Query gi|254781174|ref|YP_003065587.1| outer membrane assembly lipoprotein YfiO [Candidatus Liberibacter asiaticus str. psy62] Match_columns 271 No_of_seqs 187 out of 1985 Neff 8.2 Searched_HMMs 39220 Date Mon May 30 05:50:19 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254781174.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 TIGR03302 OM_YfiO outer membra 100.0 0 0 376.2 25.9 233 24-259 3-235 (235) 2 PRK10866 outer membrane protei 100.0 0 0 375.8 24.8 232 19-257 1-242 (243) 3 COG4105 ComL DNA uptake lipopr 100.0 0 0 365.4 24.4 242 25-271 6-248 (254) 4 TIGR02795 tol_pal_ybgF tol-pal 99.9 5.9E-23 1.5E-27 152.0 11.8 118 57-182 2-119 (119) 5 PRK10803 hypothetical protein; 99.9 8.9E-22 2.3E-26 145.1 13.2 107 151-271 154-260 (262) 6 TIGR03302 OM_YfiO outer membra 99.9 5.5E-20 1.4E-24 134.6 16.6 165 91-269 30-208 (235) 7 COG1729 Uncharacterized protei 99.8 4.8E-20 1.2E-24 135.0 12.0 115 134-270 144-258 (262) 8 TIGR02795 tol_pal_ybgF tol-pal 99.8 1.3E-19 3.2E-24 132.6 12.3 117 132-270 3-119 (119) 9 PRK10803 hypothetical protein; 99.8 3.3E-19 8.4E-24 130.1 12.3 120 57-184 141-261 (262) 10 PRK10866 outer membrane protei 99.8 3.4E-17 8.6E-22 118.4 16.8 171 92-270 30-218 (243) 11 TIGR02521 type_IV_pilW type IV 99.7 4.8E-16 1.2E-20 111.6 18.9 207 29-258 3-247 (247) 12 TIGR02917 PEP_TPR_lipo putativ 99.7 1E-14 2.6E-19 103.9 22.3 228 25-271 8-280 (924) 13 COG1729 Uncharacterized protei 99.7 5.8E-16 1.5E-20 111.2 11.9 119 56-182 140-258 (262) 14 COG4105 ComL DNA uptake lipopr 99.7 1.4E-14 3.6E-19 103.1 16.9 165 52-230 66-244 (254) 15 TIGR02917 PEP_TPR_lipo putativ 99.6 4.5E-13 1.1E-17 94.3 19.1 162 95-262 452-656 (924) 16 COG3063 PilF Tfp pilus assembl 99.6 1.6E-12 4.1E-17 91.1 19.9 210 30-265 11-245 (250) 17 PRK11788 hypothetical protein; 99.5 9.9E-12 2.5E-16 86.5 20.6 201 59-265 37-287 (389) 18 PRK11189 lipoprotein NlpI; Pro 99.5 3E-11 7.6E-16 83.7 19.6 218 22-255 4-265 (297) 19 COG2956 Predicted N-acetylgluc 99.4 8.1E-11 2.1E-15 81.2 19.1 198 61-264 39-286 (389) 20 KOG2002 consensus 99.4 8E-11 2E-15 81.2 16.5 194 58-263 165-378 (1018) 21 KOG4626 consensus 99.3 2.7E-10 6.8E-15 78.2 13.6 180 53-252 248-447 (966) 22 KOG3785 consensus 99.2 5.2E-09 1.3E-13 70.6 17.7 182 64-266 29-224 (557) 23 PRK02603 photosystem I assembl 99.2 4.7E-10 1.2E-14 76.7 11.6 120 57-181 35-160 (172) 24 PRK11447 cellulose synthase su 99.2 1.7E-08 4.4E-13 67.6 19.3 69 198-269 470-538 (1158) 25 PRK11447 cellulose synthase su 99.2 1.9E-08 4.9E-13 67.3 19.4 213 52-267 58-348 (1158) 26 PRK11788 hypothetical protein; 99.2 2.9E-08 7.3E-13 66.3 20.2 197 54-263 104-318 (389) 27 KOG2002 consensus 99.2 1.7E-08 4.4E-13 67.6 19.0 194 56-259 269-528 (1018) 28 KOG1126 consensus 99.2 7.9E-09 2E-13 69.6 16.7 56 194-252 561-616 (638) 29 KOG1126 consensus 99.1 6.1E-09 1.5E-13 70.2 13.6 189 62-264 324-560 (638) 30 KOG0624 consensus 99.1 8.8E-08 2.2E-12 63.5 19.4 188 57-261 38-257 (504) 31 cd00189 TPR Tetratricopeptide 99.1 1.6E-09 4E-14 73.7 10.4 99 59-171 2-100 (100) 32 KOG0547 consensus 99.0 1.9E-08 4.8E-13 67.4 14.4 160 57-251 394-561 (606) 33 PRK02603 photosystem I assembl 99.0 1.5E-08 3.9E-13 67.9 13.8 145 67-223 9-153 (172) 34 TIGR02521 type_IV_pilW type IV 99.0 2.1E-08 5.3E-13 67.1 12.8 141 91-260 34-178 (247) 35 KOG0547 consensus 99.0 7.9E-08 2E-12 63.7 15.4 76 60-141 118-193 (606) 36 KOG1840 consensus 99.0 1.5E-07 3.7E-12 62.2 16.6 178 62-248 204-430 (508) 37 KOG1840 consensus 98.9 5.6E-07 1.4E-11 58.8 18.5 176 62-254 246-477 (508) 38 KOG4626 consensus 98.9 2E-07 5.1E-12 61.4 15.9 179 60-258 221-419 (966) 39 COG4783 Putative Zn-dependent 98.9 2.9E-07 7.5E-12 60.4 15.9 150 57-220 306-455 (484) 40 KOG1125 consensus 98.9 3.8E-07 9.6E-12 59.8 16.1 180 59-255 287-526 (579) 41 cd00189 TPR Tetratricopeptide 98.9 1.8E-08 4.7E-13 67.4 9.0 99 96-222 2-100 (100) 42 CHL00033 ycf3 photosystem I as 98.8 7.5E-08 1.9E-12 63.9 11.2 122 56-182 30-157 (165) 43 TIGR02552 LcrH_SycD type III s 98.8 1.1E-07 2.7E-12 63.0 11.9 116 54-183 15-131 (137) 44 COG4785 NlpI Lipoprotein NlpI, 98.8 7.3E-07 1.9E-11 58.1 16.0 183 57-255 65-265 (297) 45 COG4783 Putative Zn-dependent 98.8 1.5E-06 3.8E-11 56.3 17.0 126 92-248 304-429 (484) 46 KOG1155 consensus 98.8 7.3E-07 1.9E-11 58.1 14.9 59 195-256 437-495 (559) 47 KOG3060 consensus 98.8 1.9E-06 4.9E-11 55.6 16.6 166 59-258 53-222 (289) 48 KOG2003 consensus 98.7 6.7E-07 1.7E-11 58.3 13.4 182 55-254 488-720 (840) 49 KOG2076 consensus 98.7 5.5E-06 1.4E-10 53.0 18.0 68 57-127 139-206 (895) 50 pfam04733 Coatomer_E Coatomer 98.7 1E-05 2.6E-10 51.5 19.2 174 57-267 102-276 (290) 51 KOG0543 consensus 98.7 6.3E-07 1.6E-11 58.5 12.9 117 61-191 212-340 (397) 52 PRK10049 pgaA outer membrane p 98.7 1.5E-05 4E-10 50.4 19.1 75 56-130 30-135 (818) 53 PRK11189 lipoprotein NlpI; Pro 98.7 3.6E-06 9.3E-11 54.0 15.6 172 56-242 98-290 (297) 54 KOG1155 consensus 98.7 8.1E-06 2.1E-10 52.0 17.4 61 194-254 470-534 (559) 55 KOG1173 consensus 98.7 3.4E-06 8.6E-11 54.2 15.4 190 57-263 312-525 (611) 56 PRK10747 putative protoheme IX 98.6 2.7E-05 6.9E-10 48.9 21.7 187 57-252 118-386 (398) 57 KOG2003 consensus 98.6 5.6E-06 1.4E-10 52.9 15.2 26 61-86 423-448 (840) 58 TIGR00990 3a0801s09 mitochondr 98.6 2.7E-06 6.8E-11 54.8 13.2 170 54-252 428-601 (649) 59 KOG2376 consensus 98.6 2.9E-05 7.3E-10 48.8 18.2 71 56-126 174-256 (652) 60 PRK10049 pgaA outer membrane p 98.5 5.2E-05 1.3E-09 47.3 19.7 73 54-129 96-168 (818) 61 COG3063 PilF Tfp pilus assembl 98.5 6.6E-06 1.7E-10 52.5 12.7 141 91-260 32-172 (250) 62 KOG1173 consensus 98.5 6.5E-05 1.7E-09 46.7 17.7 199 55-260 242-488 (611) 63 CHL00033 ycf3 photosystem I as 98.5 4E-06 1E-10 53.8 11.0 130 83-223 18-149 (165) 64 KOG0548 consensus 98.5 3.7E-05 9.5E-10 48.1 15.9 182 61-253 228-418 (539) 65 KOG0553 consensus 98.4 4.7E-06 1.2E-10 53.4 10.8 109 53-175 77-185 (304) 66 KOG4340 consensus 98.4 3.3E-05 8.3E-10 48.5 14.7 178 67-255 20-206 (459) 67 TIGR00990 3a0801s09 mitochondr 98.4 1E-05 2.6E-10 51.4 11.5 156 63-252 369-524 (649) 68 KOG2076 consensus 98.4 0.00014 3.7E-09 44.7 17.4 63 60-122 244-308 (895) 69 KOG1156 consensus 98.3 0.00023 5.8E-09 43.5 17.2 63 196-261 225-288 (700) 70 KOG1174 consensus 98.3 8.2E-05 2.1E-09 46.2 14.5 208 57-271 232-515 (564) 71 KOG0543 consensus 98.2 5.6E-05 1.4E-09 47.1 12.6 69 194-265 261-329 (397) 72 KOG0624 consensus 98.2 6.9E-05 1.8E-09 46.6 13.0 74 57-133 72-145 (504) 73 KOG0548 consensus 98.2 0.00023 5.9E-09 43.5 15.7 111 97-235 361-471 (539) 74 KOG0550 consensus 98.2 4.7E-05 1.2E-09 47.5 12.1 176 63-252 135-346 (486) 75 COG2976 Uncharacterized protei 98.2 0.00012 3E-09 45.2 14.0 181 11-224 13-193 (207) 76 PRK10370 formate-dependent nit 98.2 0.00014 3.6E-09 44.8 14.4 129 72-228 62-190 (206) 77 COG2956 Predicted N-acetylgluc 98.2 0.00041 1E-08 42.1 18.9 184 54-260 104-315 (389) 78 PRK13184 pknD serine/threonine 98.1 0.00011 2.7E-09 45.5 12.5 72 57-132 512-590 (933) 79 KOG2376 consensus 98.1 0.00049 1.2E-08 41.6 15.9 195 55-268 44-266 (652) 80 KOG3081 consensus 98.1 0.00047 1.2E-08 41.7 15.7 94 155-268 189-283 (299) 81 COG5010 TadD Flp pilus assembl 98.1 0.00034 8.6E-09 42.6 14.0 157 61-251 70-226 (257) 82 KOG1586 consensus 98.1 0.00065 1.7E-08 40.9 18.1 198 57-270 33-241 (288) 83 TIGR02552 LcrH_SycD type III s 98.0 0.00013 3.3E-09 45.0 11.4 104 93-224 17-121 (137) 84 KOG3060 consensus 98.0 0.00076 1.9E-08 40.5 15.5 150 55-235 84-233 (289) 85 KOG3785 consensus 98.0 0.00078 2E-08 40.4 16.7 65 62-129 156-220 (557) 86 PRK10370 formate-dependent nit 98.0 0.00032 8.2E-09 42.7 12.9 111 116-255 69-180 (206) 87 KOG2053 consensus 98.0 0.00016 4.1E-09 44.4 11.0 76 57-136 43-118 (932) 88 COG4235 Cytochrome c biogenesi 98.0 0.00084 2.1E-08 40.3 14.5 130 69-226 134-264 (287) 89 KOG4234 consensus 98.0 0.00026 6.6E-09 43.2 11.8 115 57-182 95-211 (271) 90 KOG1129 consensus 98.0 0.0011 2.8E-08 39.5 14.8 183 62-259 228-427 (478) 91 COG5010 TadD Flp pilus assembl 97.9 0.0016 4E-08 38.7 18.6 194 23-258 4-199 (257) 92 PRK10747 putative protoheme IX 97.9 0.0016 4.1E-08 38.6 20.8 109 57-178 84-192 (398) 93 KOG0553 consensus 97.9 0.00032 8.2E-09 42.7 10.6 118 95-244 82-200 (304) 94 KOG0550 consensus 97.8 0.00024 6E-09 43.5 9.4 25 195-219 326-350 (486) 95 KOG4162 consensus 97.8 0.0022 5.5E-08 37.9 17.6 182 58-259 358-545 (799) 96 COG4785 NlpI Lipoprotein NlpI, 97.8 0.00086 2.2E-08 40.2 11.8 173 56-240 98-288 (297) 97 KOG1129 consensus 97.7 0.0025 6.4E-08 37.5 15.5 187 54-260 253-462 (478) 98 PRK13184 pknD serine/threonine 97.7 0.00063 1.6E-08 41.0 10.5 102 66-179 484-592 (933) 99 KOG1174 consensus 97.7 0.0027 6.9E-08 37.3 17.4 193 55-270 196-427 (564) 100 KOG4648 consensus 97.6 0.00052 1.3E-08 41.5 8.8 111 58-182 98-208 (536) 101 KOG1156 consensus 97.6 0.0039 9.9E-08 36.4 14.1 70 193-265 374-448 (700) 102 KOG0545 consensus 97.6 0.0016 4E-08 38.7 10.9 124 57-193 178-316 (329) 103 KOG1125 consensus 97.6 0.0046 1.2E-07 36.0 14.2 190 56-248 318-563 (579) 104 PRK12370 invasion protein regu 97.5 0.005 1.3E-07 35.7 15.9 187 52-255 253-469 (553) 105 KOG1130 consensus 97.4 0.0051 1.3E-07 35.7 11.7 90 54-143 51-147 (639) 106 COG0457 NrfG FOG: TPR repeat [ 97.4 0.0073 1.9E-07 34.8 18.4 61 195-258 207-267 (291) 107 KOG4162 consensus 97.4 0.0073 1.9E-07 34.8 12.3 70 55-126 476-545 (799) 108 COG2976 Uncharacterized protei 97.4 0.0075 1.9E-07 34.7 13.8 152 63-259 37-191 (207) 109 pfam09295 ChAPs ChAPs (Chs5p-A 97.4 0.003 7.6E-08 37.0 10.2 109 107-249 182-290 (395) 110 COG3118 Thioredoxin domain-con 97.3 0.0096 2.4E-07 34.1 15.0 164 56-252 133-298 (304) 111 COG4700 Uncharacterized protei 97.3 0.0089 2.3E-07 34.3 11.8 122 97-247 92-213 (251) 112 PRK09782 bacteriophage N4 rece 97.3 0.0063 1.6E-07 35.1 10.8 65 61-128 48-112 (987) 113 KOG0376 consensus 97.3 0.0023 5.8E-08 37.7 8.4 120 57-195 4-125 (476) 114 KOG1586 consensus 97.2 0.012 3.1E-07 33.4 15.3 200 13-238 40-245 (288) 115 COG4235 Cytochrome c biogenesi 97.2 0.012 3.2E-07 33.4 11.5 127 111-268 139-269 (287) 116 KOG1130 consensus 97.2 0.0045 1.1E-07 36.0 8.9 62 191-252 276-340 (639) 117 KOG1127 consensus 97.0 0.02 5E-07 32.3 15.6 189 62-270 404-639 (1238) 118 COG0457 NrfG FOG: TPR repeat [ 97.0 0.02 5E-07 32.3 19.6 175 57-259 59-234 (291) 119 COG4700 Uncharacterized protei 97.0 0.022 5.7E-07 31.9 16.0 138 54-222 86-225 (251) 120 KOG0495 consensus 96.9 0.024 6.1E-07 31.8 17.2 57 195-254 822-878 (913) 121 KOG4340 consensus 96.9 0.022 5.6E-07 32.0 10.4 74 52-128 139-213 (459) 122 KOG4234 consensus 96.9 0.027 6.8E-07 31.5 13.2 142 57-237 72-215 (271) 123 KOG2053 consensus 96.8 0.029 7.4E-07 31.3 19.5 97 61-171 13-109 (932) 124 cd05804 StaR_like StaR_like; a 96.8 0.03 7.8E-07 31.2 17.5 141 54-219 40-215 (355) 125 KOG0551 consensus 96.6 0.044 1.1E-06 30.2 10.6 85 57-144 81-166 (390) 126 pfam04781 DUF627 Protein of un 96.5 0.025 6.4E-07 31.7 8.6 73 100-172 2-77 (112) 127 KOG1585 consensus 96.5 0.052 1.3E-06 29.8 17.2 207 53-267 26-267 (308) 128 KOG0495 consensus 96.4 0.056 1.4E-06 29.6 20.8 99 71-172 564-684 (913) 129 PRK11619 lytic murein transgly 96.1 0.087 2.2E-06 28.5 14.7 69 55-131 31-99 (645) 130 KOG1128 consensus 96.0 0.092 2.3E-06 28.4 14.9 55 70-127 498-552 (777) 131 PRK10941 putative transcriptio 96.0 0.035 8.9E-07 30.8 6.9 72 195-269 186-257 (269) 132 KOG4642 consensus 95.9 0.11 2.7E-06 28.0 10.5 101 55-169 8-108 (284) 133 pfam04733 Coatomer_E Coatomer 95.7 0.12 3.1E-06 27.6 17.4 56 203-261 180-235 (290) 134 KOG3081 consensus 95.7 0.12 2.9E-06 27.8 8.7 55 203-260 186-240 (299) 135 KOG4642 consensus 95.5 0.063 1.6E-06 29.3 6.7 66 61-129 48-113 (284) 136 KOG1127 consensus 95.5 0.15 3.8E-06 27.1 14.8 63 63-128 568-630 (1238) 137 pfam10300 IML2 Putative mitoch 95.2 0.18 4.6E-06 26.6 10.7 90 56-147 249-339 (446) 138 PRK12450 foldase protein PrsA; 95.2 0.19 4.9E-06 26.5 9.4 49 18-66 3-51 (309) 139 KOG3364 consensus 95.0 0.15 3.8E-06 27.1 7.4 74 55-130 30-107 (149) 140 COG5159 RPN6 26S proteasome re 95.0 0.21 5.3E-06 26.3 13.3 49 62-110 8-61 (421) 141 pfam00515 TPR_1 Tetratricopept 95.0 0.024 6.2E-07 31.7 3.2 32 95-126 2-33 (34) 142 COG3071 HemY Uncharacterized e 94.9 0.23 5.8E-06 26.1 19.5 183 64-252 125-386 (400) 143 pfam03704 BTAD Bacterial trans 94.9 0.23 6E-06 26.0 10.5 58 195-255 67-124 (146) 144 KOG0545 consensus 94.8 0.24 6.2E-06 25.9 12.0 63 194-259 234-296 (329) 145 KOG2610 consensus 94.7 0.26 6.5E-06 25.8 9.1 156 55-221 101-279 (491) 146 KOG4555 consensus 94.6 0.27 7E-06 25.6 11.9 107 58-177 44-151 (175) 147 PRK10941 putative transcriptio 94.2 0.12 3.1E-06 27.6 5.4 59 153-228 195-253 (269) 148 smart00028 TPR Tetratricopepti 94.0 0.054 1.4E-06 29.7 3.2 32 95-126 2-33 (34) 149 pfam10602 RPN7 26S proteasome 93.7 0.42 1.1E-05 24.5 8.9 119 87-225 29-148 (174) 150 pfam07719 TPR_2 Tetratricopept 93.7 0.048 1.2E-06 30.0 2.4 31 96-126 3-33 (34) 151 KOG1941 consensus 93.5 0.44 1.1E-05 24.4 14.1 124 92-215 120-271 (518) 152 pfam00515 TPR_1 Tetratricopept 93.2 0.14 3.4E-06 27.4 4.1 32 58-89 2-33 (34) 153 PRK01326 prsA foldase protein 92.4 0.65 1.7E-05 23.4 8.0 47 19-65 1-48 (310) 154 TIGR03042 PS_II_psbQ_bact phot 92.1 0.71 1.8E-05 23.2 11.3 23 21-43 2-24 (142) 155 pfam07719 TPR_2 Tetratricopept 92.0 0.15 3.8E-06 27.1 3.0 32 58-89 2-33 (34) 156 PRK04405 prsA peptidylprolyl i 92.0 0.74 1.9E-05 23.1 9.7 48 17-66 3-50 (298) 157 COG3629 DnrI DNA-binding trans 91.7 0.79 2E-05 22.9 10.0 59 195-256 158-216 (280) 158 TIGR02561 HrpB1_HrpK type III 91.6 0.8 2E-05 22.9 6.4 68 61-129 54-121 (190) 159 smart00028 TPR Tetratricopepti 91.6 0.27 6.8E-06 25.6 3.9 31 59-89 3-33 (34) 160 COG3014 Uncharacterized protei 90.9 0.95 2.4E-05 22.4 10.2 20 24-43 6-25 (449) 161 KOG4814 consensus 90.6 1 2.6E-05 22.3 14.9 74 58-131 355-431 (872) 162 PRK03002 prsA peptidylprolyl i 90.6 1 2.6E-05 22.3 8.7 46 20-65 3-48 (285) 163 PRK11063 metQ DL-methionine tr 89.6 0.67 1.7E-05 23.3 4.5 53 17-70 3-55 (271) 164 PRK03095 prsA peptidylprolyl i 89.5 1.3 3.2E-05 21.7 9.2 47 19-67 1-47 (287) 165 PRK10802 peptidoglycan-associa 89.3 0.42 1.1E-05 24.5 3.3 27 19-45 3-29 (173) 166 pfam10255 Paf67 RNA polymerase 88.1 1.4 3.6E-05 21.5 5.3 57 67-123 132-193 (402) 167 pfam02259 FAT FAT domain. The 87.7 1.7 4.2E-05 21.0 12.3 30 207-239 275-304 (351) 168 COG3107 LppC Putative lipoprot 87.4 1.7 4.4E-05 20.9 16.3 143 17-172 6-166 (604) 169 pfam07901 DUF1672 Protein of u 87.0 0.52 1.3E-05 24.0 2.6 25 19-45 1-25 (304) 170 pfam10345 Cohesin_load Cohesin 86.3 2 5.1E-05 20.6 16.4 195 57-252 180-424 (593) 171 KOG1464 consensus 86.2 2 5.1E-05 20.5 9.4 55 68-122 38-93 (440) 172 COG2912 Uncharacterized conser 86.0 2.1 5.3E-05 20.5 7.5 64 200-266 191-254 (269) 173 pfam11817 Foie-gras_1 Foie gra 85.9 2.1 5.3E-05 20.4 11.8 56 194-249 189-247 (254) 174 pfam09986 DUF2225 Uncharacteri 85.8 2.1 5.4E-05 20.4 11.3 52 205-256 140-194 (214) 175 KOG1308 consensus 85.5 0.54 1.4E-05 23.9 2.0 65 62-129 119-183 (377) 176 TIGR03659 IsdE heme ABC transp 84.7 1.3 3.3E-05 21.7 3.7 27 21-47 2-28 (289) 177 PRK01622 OxaA-like protein pre 84.4 2.5 6.3E-05 20.0 6.9 69 18-90 3-79 (266) 178 pfam09670 Cas_Cas02710 CRISPR- 84.4 2.5 6.3E-05 20.0 13.3 64 60-123 134-198 (379) 179 COG2909 MalT ATP-dependent tra 84.2 2.5 6.4E-05 20.0 18.5 196 62-263 463-701 (894) 180 COG4669 EscJ Type III secretor 83.9 1.6 4.1E-05 21.1 3.9 19 21-39 3-21 (246) 181 pfam10579 Rapsyn_N Rapsyn N-te 83.9 2.6 6.6E-05 19.9 6.5 60 58-117 7-66 (80) 182 PRK10880 adenine DNA glycosyla 83.1 2.8 7.1E-05 19.7 9.1 69 98-172 33-107 (350) 183 COG4259 Uncharacterized protei 83.0 2.8 7.2E-05 19.7 6.9 92 26-128 10-106 (121) 184 pfam05053 Menin Menin. MEN1, t 82.9 2.8 7.2E-05 19.7 7.0 45 195-239 323-370 (618) 185 KOG2300 consensus 82.0 3.1 7.9E-05 19.4 14.3 56 194-249 449-507 (629) 186 COG4594 FecB ABC-type Fe3+-cit 81.9 0.96 2.4E-05 22.4 2.1 27 18-44 2-28 (310) 187 KOG1463 consensus 81.2 3.3 8.4E-05 19.3 14.4 59 191-249 210-271 (411) 188 KOG2796 consensus 80.9 3.4 8.6E-05 19.2 10.9 140 95-259 178-318 (366) 189 COG1704 LemA Uncharacterized c 80.9 3.4 8.6E-05 19.2 14.4 68 152-226 94-161 (185) 190 pfam06552 TOM20_plant Plant sp 80.6 3.5 8.8E-05 19.2 7.5 67 110-179 51-120 (186) 191 PRK06330 transcript cleavage f 80.4 3.5 9E-05 19.1 12.8 129 95-238 118-269 (906) 192 TIGR02747 TraV type IV conjuga 79.7 0.91 2.3E-05 22.6 1.4 22 19-40 1-22 (174) 193 PRK09859 multidrug efflux syst 79.6 2.4 6.1E-05 20.1 3.5 25 21-45 4-28 (385) 194 PRK11138 outer membrane protei 79.5 1.8 4.7E-05 20.8 2.9 24 24-47 6-29 (394) 195 COG3947 Response regulator con 79.1 3.9 9.8E-05 18.9 8.9 159 76-251 166-337 (361) 196 pfam10140 essB Predicted membr 78.7 4 0.0001 18.8 11.8 53 56-112 221-273 (359) 197 COG3417 FlgN Collagen-binding 78.6 2.2 5.7E-05 20.3 3.1 21 20-40 2-22 (200) 198 TIGR03524 GldJ gliding motilit 78.0 2.6 6.6E-05 19.9 3.3 27 18-44 2-28 (559) 199 KOG2471 consensus 77.3 4.4 0.00011 18.6 9.0 88 14-105 291-380 (696) 200 COG4976 Predicted methyltransf 75.9 4.7 0.00012 18.4 4.1 62 65-129 3-64 (287) 201 PRK10702 endonuclease III; Pro 75.5 4.9 0.00013 18.3 12.2 166 74-270 4-180 (211) 202 PRK10153 DNA-binding transcrip 74.8 5.1 0.00013 18.2 15.9 147 113-269 314-487 (512) 203 pfam11777 DUF3316 Protein of u 74.2 5.3 0.00014 18.1 4.8 52 19-70 1-52 (114) 204 TIGR02722 lp_ uncharacterized 73.5 4.1 0.00011 18.7 3.3 22 21-42 2-23 (215) 205 PRK09967 putative outer membra 73.4 3.3 8.3E-05 19.3 2.8 22 19-41 1-22 (160) 206 PRK09578 periplasmic multidrug 72.1 4.8 0.00012 18.3 3.4 24 21-44 6-29 (385) 207 TIGR01781 Trep_dent_lipo Trepo 72.1 3 7.6E-05 19.6 2.3 23 23-45 6-28 (463) 208 PRK11548 hypothetical protein; 71.1 2.8 7E-05 19.7 2.0 24 19-42 1-24 (113) 209 PRK00022 lolB outer membrane l 70.6 6.5 0.00016 17.6 3.9 19 25-43 5-23 (203) 210 KOG4459 consensus 70.5 6.5 0.00017 17.6 6.7 34 99-132 138-171 (471) 211 TIGR03516 ppisom_GldI peptidyl 69.7 6.8 0.00017 17.5 4.9 20 27-46 6-25 (177) 212 PRK02998 prsA peptidylprolyl i 69.0 7 0.00018 17.4 9.6 79 19-101 1-81 (283) 213 TIGR02887 spore_ger_x_C germin 68.6 3 7.8E-05 19.5 1.8 21 22-42 2-22 (400) 214 PRK09861 cytoplasmic membrane 67.5 7.5 0.00019 17.2 5.1 134 26-171 12-161 (272) 215 pfam11207 DUF2989 Protein of u 67.2 7.6 0.00019 17.2 11.7 53 194-247 145-198 (203) 216 KOG1070 consensus 66.8 7.8 0.0002 17.1 18.5 187 75-268 1442-1643(1710) 217 KOG2114 consensus 65.5 8.3 0.00021 17.0 14.3 147 62-214 339-514 (933) 218 pfam08631 SPO22 Meiosis protei 64.8 8.5 0.00022 16.9 17.0 194 55-253 33-274 (280) 219 PRK06481 fumarate reductase fl 63.8 8.9 0.00023 16.8 3.5 26 23-48 6-31 (506) 220 COG3017 LolB Outer membrane li 63.7 8.9 0.00023 16.8 3.8 29 19-47 3-31 (206) 221 COG5510 Predicted small secret 62.5 5.5 0.00014 18.0 2.1 23 18-40 1-24 (44) 222 KOG2063 consensus 62.0 9.6 0.00025 16.6 13.9 67 194-260 508-577 (877) 223 PRK02944 OxaA-like protein pre 61.1 10 0.00026 16.5 7.5 67 21-90 4-78 (255) 224 TIGR03504 FimV_Cterm FimV C-te 60.8 10 0.00026 16.4 4.5 20 64-83 6-25 (44) 225 TIGR02544 III_secr_YscJ type I 59.5 11 0.00027 16.3 4.3 16 25-40 4-19 (203) 226 PRK10796 LPS-assembly lipoprot 59.3 7.6 0.00019 17.2 2.4 28 21-48 2-29 (196) 227 PTZ00116 signal peptidase; Pro 58.6 11 0.00028 16.2 3.6 31 1-40 1-31 (185) 228 COG4499 Predicted membrane pro 58.3 11 0.00029 16.2 11.7 48 59-110 250-297 (434) 229 TIGR01742 SA_tandem_lipo Staph 58.0 4.8 0.00012 18.3 1.2 61 19-94 4-65 (267) 230 KOG2908 consensus 57.0 12 0.0003 16.1 8.4 105 71-178 89-201 (380) 231 TIGR03044 PS_II_psb27 photosys 56.1 12 0.00031 16.0 12.8 117 17-182 5-121 (135) 232 pfam04507 DUF576 Protein of un 56.0 6.2 0.00016 17.7 1.5 49 20-93 5-53 (257) 233 pfam04348 LppC LppC putative l 55.8 12 0.00032 15.9 5.3 58 61-118 28-85 (535) 234 TIGR03525 GldK gliding motilit 55.5 8.9 0.00023 16.8 2.2 17 28-44 7-23 (449) 235 COG2143 Thioredoxin-related pr 55.0 8.5 0.00022 16.9 2.0 21 23-43 4-24 (182) 236 pfam06474 MLTD_N MLTD_N. 54.1 13 0.00034 15.8 3.1 21 27-47 5-25 (93) 237 KOG3617 consensus 54.1 13 0.00034 15.8 16.4 84 177-260 1278-1363(1416) 238 TIGR02898 spore_YhcN_YlaJ spor 53.0 13 0.00033 15.8 2.7 23 108-130 68-90 (185) 239 TIGR02508 type_III_yscG type I 53.0 6.1 0.00015 17.7 1.0 49 57-111 40-88 (118) 240 PRK10722 hypothetical protein; 52.8 14 0.00035 15.7 4.9 22 28-49 21-42 (248) 241 TIGR01000 bacteriocin_acc bact 52.3 9 0.00023 16.7 1.8 44 17-60 17-60 (476) 242 PRK11627 hypothetical protein; 52.0 14 0.00036 15.6 3.1 15 29-43 9-23 (192) 243 PRK04841 transcriptional regul 51.7 14 0.00037 15.5 18.8 65 196-260 697-765 (903) 244 pfam07720 TPR_3 Tetratricopept 51.7 14 0.00037 15.5 3.8 25 59-83 3-27 (34) 245 COG5633 Predicted periplasmic 50.6 6.9 0.00018 17.4 1.0 18 28-45 7-24 (123) 246 PRK11023 hypothetical protein; 50.5 11 0.00027 16.3 1.9 19 22-40 3-21 (191) 247 PRK11671 mltC murein transglyc 50.5 12 0.0003 16.1 2.1 22 247-268 328-349 (360) 248 pfam07219 HemY_N HemY protein 49.9 15 0.00039 15.4 6.7 28 59-86 87-114 (134) 249 pfam04011 LemA LemA family. Th 49.6 16 0.0004 15.4 14.2 68 152-226 94-161 (186) 250 KOG2047 consensus 49.2 16 0.0004 15.3 14.6 20 102-121 256-275 (835) 251 PRK09973 putative outer membra 49.2 5.9 0.00015 17.8 0.5 22 22-43 4-25 (85) 252 PRK12696 flgH flagellar basal 48.4 16 0.00042 15.2 2.7 16 29-44 9-24 (238) 253 pfam11846 DUF3366 Domain of un 48.0 17 0.00042 15.2 9.5 53 210-266 131-183 (193) 254 PRK12700 flgH flagellar basal 47.8 17 0.00042 15.2 3.6 25 19-43 3-27 (230) 255 pfam09710 Trep_dent_lipo Trepo 47.5 7.3 0.00019 17.3 0.7 20 24-43 4-23 (394) 256 pfam12309 KBP_C KIF-1 binding 47.3 17 0.00043 15.1 7.3 64 155-220 259-333 (365) 257 pfam11153 DUF2931 Protein of u 47.2 14 0.00036 15.6 2.2 16 25-40 3-18 (202) 258 PRK13684 Ycf48-like protein; P 47.2 17 0.00043 15.1 2.7 20 21-40 7-26 (333) 259 pfam12273 RCR Chitin synthesis 46.5 17 0.00042 15.2 2.4 19 22-40 5-23 (124) 260 PRK10598 hypothetical protein; 46.4 13 0.00033 15.9 1.8 23 19-42 1-23 (186) 261 TIGR01084 mutY A/G-specific ad 45.1 18 0.00047 14.9 5.2 67 99-172 42-116 (297) 262 TIGR03511 GldH_lipo gliding mo 45.0 12 0.0003 16.1 1.4 23 18-42 2-24 (156) 263 pfam08085 Entericidin Enterici 44.5 16 0.00041 15.3 2.1 22 19-40 1-23 (42) 264 pfam11172 DUF2959 Protein of u 44.3 19 0.00048 14.9 10.1 91 33-135 2-98 (201) 265 pfam00244 14-3-3 14-3-3 protei 43.9 19 0.00049 14.8 6.6 63 58-121 2-64 (236) 266 PRK10871 nlpD lipoprotein NlpD 43.9 19 0.00049 14.8 2.8 31 17-47 5-35 (374) 267 TIGR02269 TIGR02269 Myxococcus 43.6 15 0.00037 15.5 1.8 19 26-44 2-20 (228) 268 PRK13731 conjugal transfer sur 43.2 17 0.00043 15.2 2.0 23 18-41 2-24 (243) 269 PRK00249 flgH flagellar basal 43.2 20 0.0005 14.8 2.7 21 24-44 5-25 (231) 270 COG4649 Uncharacterized protei 42.7 20 0.00051 14.7 17.0 136 58-219 59-196 (221) 271 COG2178 Predicted RNA-binding 42.0 21 0.00052 14.6 11.2 109 75-191 57-180 (204) 272 TIGR03300 assembly_YfgL outer 41.9 20 0.0005 14.8 2.2 21 26-46 4-24 (377) 273 KOG3824 consensus 41.7 21 0.00053 14.6 6.2 70 57-129 116-185 (472) 274 pfam09613 HrpB1_HrpK Bacterial 41.5 21 0.00053 14.6 9.4 63 61-127 48-110 (156) 275 pfam08139 LPAM_1 Prokaryotic m 41.3 20 0.00052 14.7 2.2 14 30-43 9-22 (26) 276 PRK12407 flgH flagellar basal 41.1 21 0.00054 14.6 2.8 18 26-43 4-21 (220) 277 cd02678 MIT_VPS4 MIT: domain c 40.3 22 0.00056 14.5 4.4 30 57-86 6-35 (75) 278 PRK11443 hypothetical protein; 40.1 22 0.00056 14.5 4.3 17 26-42 4-20 (120) 279 TIGR03352 VI_chp_3 type VI sec 40.1 19 0.00048 14.9 1.9 16 28-43 6-21 (146) 280 KOG2997 consensus 40.0 22 0.00056 14.5 3.7 39 54-92 16-54 (366) 281 PRK10781 rcsF outer membrane l 39.5 23 0.00057 14.4 2.4 18 26-43 4-21 (133) 282 COG3317 NlpB Uncharacterized l 39.1 23 0.00058 14.4 2.3 21 23-43 5-25 (342) 283 cd02683 MIT_1 MIT: domain cont 38.9 23 0.00059 14.4 4.2 31 56-86 5-35 (77) 284 PRK13910 DNA glycosylase MutY; 37.7 24 0.00061 14.3 2.4 18 112-129 10-28 (290) 285 pfam06957 COPI_C Coatomer (COP 37.7 24 0.00061 14.3 13.0 36 201-236 310-345 (421) 286 PRK04168 hypothetical protein; 37.7 18 0.00045 15.0 1.4 23 19-41 4-26 (336) 287 PRK09534 btuF corrinoid ABC tr 37.5 24 0.00062 14.2 2.8 27 17-43 2-28 (364) 288 PRK10175 hypothetical protein; 37.4 15 0.00039 15.4 1.1 20 25-44 3-22 (75) 289 PRK12698 flgH flagellar basal 37.3 24 0.00062 14.2 2.9 22 19-44 1-22 (224) 290 PRK11616 hypothetical protein; 36.6 15 0.00038 15.4 0.9 17 26-42 8-24 (109) 291 COG4238 Murein lipoprotein [Ce 36.6 9.5 0.00024 16.6 -0.1 22 24-45 7-28 (78) 292 pfam04010 DUF357 Protein of un 36.5 25 0.00064 14.1 2.8 55 153-214 5-59 (75) 293 cd02684 MIT_2 MIT: domain cont 36.1 26 0.00065 14.1 4.4 30 57-86 6-35 (75) 294 smart00101 14_3_3 14-3-3 homol 35.8 26 0.00066 14.1 6.9 65 59-123 3-68 (244) 295 COG3168 PilP Tfp pilus assembl 35.7 24 0.0006 14.3 1.8 19 25-43 3-21 (170) 296 PRK12701 flgH flagellar basal 35.7 26 0.00066 14.1 3.5 20 24-43 5-24 (230) 297 PRK13733 conjugal transfer pro 35.0 10 0.00025 16.5 -0.2 40 30-69 11-51 (171) 298 TIGR00620 sporelyase spore pho 34.3 20 0.0005 14.8 1.2 63 153-224 131-194 (199) 299 PRK12798 chemotaxis protein; R 34.2 27 0.0007 13.9 16.5 117 55-181 111-227 (416) 300 pfam06085 Rz1 Lipoprotein Rz1 34.1 25 0.00064 14.2 1.7 23 22-44 4-26 (59) 301 pfam03304 Mlp Mlp lipoprotein 33.7 18 0.00045 15.0 0.9 19 24-42 4-22 (150) 302 COG3065 Slp Starvation-inducib 33.7 28 0.00071 13.9 3.1 21 24-44 9-29 (191) 303 COG3056 Uncharacterized lipopr 33.7 28 0.00071 13.9 5.0 32 14-45 7-38 (204) 304 TIGR03362 VI_chp_7 type VI sec 33.4 28 0.00072 13.8 8.4 176 57-255 99-278 (301) 305 pfam09533 DUF2380 Predicted li 33.3 28 0.00072 13.8 2.8 24 22-45 3-26 (220) 306 PRK09193 indolepyruvate ferred 32.9 29 0.00074 13.8 8.1 88 97-185 878-975 (1155) 307 pfam04212 MIT MIT (microtubule 32.7 29 0.00074 13.8 4.5 30 57-86 5-34 (69) 308 PRK10523 lipoprotein involved 32.0 30 0.00076 13.7 3.8 27 18-44 1-27 (234) 309 TIGR00985 3a0801s04tom mitocho 32.0 30 0.00076 13.7 3.0 59 72-130 65-131 (155) 310 PRK06760 hypothetical protein; 31.6 17 0.00042 15.2 0.5 20 24-43 7-26 (223) 311 PHA00407 phage lambda Rz1-like 31.6 30 0.00078 13.7 3.1 30 21-50 32-61 (84) 312 COG5191 Uncharacterized conser 31.5 30 0.00078 13.7 5.5 81 57-142 107-190 (435) 313 KOG1310 consensus 30.9 31 0.0008 13.6 8.4 30 61-90 378-407 (758) 314 pfam04910 DUF654 Protein of un 30.2 32 0.00082 13.5 12.7 107 58-171 101-208 (343) 315 KOG2396 consensus 28.7 34 0.00087 13.4 12.6 73 70-145 118-191 (568) 316 PRK10461 thiamine biosynthesis 28.4 35 0.00088 13.3 2.2 17 27-43 9-25 (351) 317 smart00745 MIT Microtubule Int 27.9 35 0.0009 13.3 4.4 24 59-82 10-33 (77) 318 pfam02184 HAT HAT (Half-A-TPR) 27.7 32 0.00082 13.5 1.4 21 109-129 2-22 (32) 319 PRK10760 murein hydrolase B; P 27.7 36 0.00091 13.3 3.3 16 29-44 8-23 (357) 320 pfam07268 EppA_BapA Exported p 27.7 36 0.00091 13.3 2.6 24 57-80 26-49 (163) 321 PRK10215 hypothetical protein; 27.7 23 0.0006 14.3 0.7 21 22-42 8-28 (219) 322 COG0790 FOG: TPR repeat, SEL1 27.7 36 0.00091 13.3 15.4 162 57-241 41-236 (292) 323 COG5645 Predicted periplasmic 27.6 27 0.0007 13.9 1.0 12 33-44 12-23 (80) 324 PRK12788 flgH flagellar basal 27.6 36 0.00091 13.3 2.5 19 25-43 3-21 (231) 325 COG4939 Major membrane immunog 27.3 20 0.00052 14.7 0.3 22 25-46 6-27 (147) 326 pfam09577 Spore_YpjB Sporulati 26.9 37 0.00094 13.2 5.6 58 60-118 8-65 (232) 327 cd02656 MIT MIT: domain contai 26.9 37 0.00094 13.2 6.5 30 57-86 6-35 (75) 328 COG3521 Predicted component of 26.7 37 0.00094 13.2 2.0 14 31-44 13-26 (159) 329 pfam10516 SHNi-TPR SHNi-TPR. S 26.2 38 0.00097 13.1 2.8 24 62-85 6-29 (38) 330 COG4851 CamS Protein involved 26.0 38 0.00097 13.1 2.7 23 23-45 4-26 (382) 331 PRK02463 OxaA-like protein pre 25.9 38 0.00098 13.1 11.6 66 20-89 6-78 (307) 332 pfam07721 TPR_4 Tetratricopept 25.8 39 0.00098 13.1 2.0 19 98-116 5-23 (26) 333 TIGR00752 slp outer membrane l 25.7 39 0.00099 13.0 2.7 22 25-46 5-26 (190) 334 pfam02402 Lysis_col Lysis prot 25.5 26 0.00065 14.1 0.6 22 24-45 5-26 (46) 335 PRK12699 flgH flagellar basal 25.5 39 0.001 13.0 2.7 24 21-44 16-39 (246) 336 KOG2041 consensus 25.3 39 0.001 13.0 13.0 75 57-142 796-870 (1189) 337 PRK10510 putative outer membra 25.2 40 0.001 13.0 2.2 19 155-173 127-145 (219) 338 KOG0546 consensus 24.9 40 0.001 13.0 3.1 63 196-261 281-343 (372) 339 PRK10449 heat-inducible protei 24.7 41 0.001 12.9 2.6 19 26-44 5-23 (140) 340 pfam05462 Dicty_CAR Slime mold 24.4 41 0.001 12.9 9.0 96 2-98 20-120 (305) 341 KOG2066 consensus 24.4 41 0.001 12.9 9.4 21 63-83 511-531 (846) 342 pfam11839 DUF3359 Protein of u 24.2 40 0.001 13.0 1.3 25 20-44 2-26 (97) 343 PRK06804 flgA flagellar basal 24.2 36 0.00093 13.2 1.2 14 56-69 54-67 (272) 344 PRK08005 ribulose-phosphate 3- 23.5 31 0.00079 13.6 0.7 66 151-216 143-210 (210) 345 PRK00059 prsA peptidylprolyl i 23.4 43 0.0011 12.8 3.1 24 18-41 3-26 (336) 346 TIGR01004 PulS_OutS lipoprotei 23.0 44 0.0011 12.7 4.9 28 19-46 5-32 (136) 347 PRK10525 cytochrome o ubiquino 22.9 44 0.0011 12.7 2.5 21 20-40 7-27 (305) 348 PTZ00334 trans-sialidase; Prov 22.8 41 0.0011 12.9 1.2 24 18-41 38-61 (780) 349 cd07668 BAR_SNX9 The Bin/Amphi 22.8 44 0.0011 12.7 9.0 39 148-186 97-136 (210) 350 pfam07273 DUF1439 Protein of u 22.7 44 0.0011 12.7 1.8 11 30-40 10-20 (177) 351 PRK10759 hypothetical protein; 22.5 45 0.0011 12.7 1.6 18 26-43 3-20 (106) 352 KOG4563 consensus 22.5 45 0.0011 12.7 5.9 65 53-117 37-106 (400) 353 KOG1550 consensus 22.5 45 0.0011 12.7 15.3 98 57-168 288-393 (552) 354 COG5567 Predicted small peripl 22.3 45 0.0012 12.7 2.6 22 23-44 5-26 (58) 355 pfam03978 Borrelia_REV Borreli 22.2 45 0.0012 12.6 4.1 18 24-41 6-23 (160) 356 PRK11162 mltA murein transglyc 21.8 46 0.0012 12.6 5.1 19 26-44 6-24 (362) 357 PRK12697 flgH flagellar basal 21.3 47 0.0012 12.5 2.6 23 21-43 7-29 (227) 358 pfam01441 Lipoprotein_6 Lipopr 21.3 47 0.0012 12.5 4.3 25 19-44 1-25 (209) 359 COG1194 MutY A/G-specific DNA 20.7 49 0.0012 12.5 7.0 29 101-129 40-68 (342) 360 PRK10797 glutamate and asparta 20.7 49 0.0012 12.5 2.8 34 235-268 254-294 (302) No 1 >TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795). Probab=100.00 E-value=0 Score=376.20 Aligned_cols=233 Identities=35% Similarity=0.606 Sum_probs=220.5 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHH Q ss_conf 99999999999972268753334456777673689999999999980989999999999985304773026899877887 Q gi|254781174|r 24 LTIFFSIAVCFLVGWERQSSRDVYLDSVTDVRYQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFV 103 (271) Q Consensus 24 ~~i~~~i~~~~l~~Cs~~~~~~~~~~~~~~~~~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~ 103 (271) .++++++++++++|||+..+.. ..+....+++++|++|+.++++|+|..|++.|++|...||+|+++++|++++|++ T Consensus 3 ~~~~~~~~~~~l~sCs~~~~~~---~~~~~~~~~~~lY~~a~~~~~~~~y~~A~~~fe~l~~~yp~s~~a~~A~l~~a~a 79 (235) T TIGR03302 3 LLILLLALLLLLAGCSSKKKKE---ADPVEERPAEELYNEAKEALDSGDYTEAIKYFEALESRYPFSPYAEQAQLDLAYA 79 (235) T ss_pred HHHHHHHHHHHHHHCCCCCCCC---CCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH T ss_conf 9999999999998738997665---5555578999999999999987899999999999998689931379999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHH Q ss_conf 76545556799999887540112110135554434445555443111234589999999999985200101345655555 Q gi|254781174|r 104 QYSAGKYQQAASLGEEYITQYPESKNVDYVYYLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVT 183 (271) Q Consensus 104 ~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~ 183 (271) ||..++|++|+..+++|++.||+|+++++|+|++|+|+++++++.++||+.+.+|+..|+.++++||+|+|+++|+.+|. T Consensus 80 ~y~~~~y~~A~~~~~~Fi~~yP~~~~~~ya~y~~a~s~~~~~~~~~rDq~~~~~A~~~f~~~i~~yP~S~ya~dA~~~i~ 159 (235) T TIGR03302 80 YYKSGDYAEAIAAADRFIRLHPNHPDADYAYYLRGLSNYKQIDDSDRDQTAAREAFEAFQELIRRYPNSEYAPDAKKRMD 159 (235) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHH T ss_conf 99764399999999999998877434999999987998873665445808999999999999988899657999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCC Q ss_conf 5566666665556666654432234568999886528998115999999999999749879999999999785699 Q gi|254781174|r 184 VGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSLIQERYPQ 259 (271) Q Consensus 184 ~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~~~yP~ 259 (271) .++++||+||+.+|++|+++|+|.+|++||+.++++||+|+++++|+++|+++|.++|+.++|+++.++|..|||+ T Consensus 160 ~l~~~LA~~e~~ia~~Y~k~~~y~aAi~r~~~vl~~Yp~t~~~~eAl~~l~eay~~lg~~~~A~~~~~~l~~n~p~ 235 (235) T TIGR03302 160 YLRNRLAGKELYVARFYLKRGAYVAAINRFETVVENYPDTPATEEALARLVEAYLALGLTDLAQDAAAVLGANYPD 235 (235) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCC T ss_conf 9999999999999999998417099999999999868998513999999999999869989999999999753989 No 2 >PRK10866 outer membrane protein assembly complex subunit YfiO; Provisional Probab=100.00 E-value=0 Score=375.80 Aligned_cols=232 Identities=22% Similarity=0.375 Sum_probs=218.2 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHH Q ss_conf 99999999999999999722687533344567776736899999999999809899999999999853047730268998 Q gi|254781174|r 19 LYKFALTIFFSIAVCFLVGWERQSSRDVYLDSVTDVRYQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLL 98 (271) Q Consensus 19 m~k~~~~i~~~i~~~~l~~Cs~~~~~~~~~~~~~~~~~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~ 98 (271) |+|...++++++++++++|||+..+. ....+++++|++|...+++|+|.+|++.|+++..+||+|+++++|++ T Consensus 1 m~~~k~l~~~~~l~l~l~gCs~~~~~-------~~~~~~~~~Y~~A~~~~~~~~y~~A~~~fe~l~~~yP~s~~A~qA~L 73 (243) T PRK10866 1 MTRMKYLVAAATLSLFLAGCSGSKEE-------VPDNPPNEIYATAQQKLQDGNWKQAITQLEALDNRYPFGPYSQQVQL 73 (243) T ss_pred CCHHHHHHHHHHHHHHHHHCCCCCCC-------CCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHH T ss_conf 94589999999999999876899888-------88999999999999999878999999999999986899767899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHH----------HHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 77887765455567999998875401121101355544344455----------55443111234589999999999985 Q gi|254781174|r 99 MSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYLVGMSYA----------QMIRDVPYDQRATKLMLQYMSRIVER 168 (271) Q Consensus 99 ~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~~a~~~~----------~~~~~~~~d~~~~~~A~~~f~~~i~~ 168 (271) ++|+++|+.++|++|+..+++||+.||+||++|||+|++|+|++ .++.+.+||++.+++|++.|+.+|++ T Consensus 74 ~~ayayy~~~~y~~A~~~~~rFi~lyP~~~~v~Ya~Yl~gl~~~~~~~~~~~~~~~~~~~~rD~~~~~~A~~~f~~li~~ 153 (243) T PRK10866 74 DLIYAYYKNADLPLAQAAIDRFMRLNPTHPNIDYVMYMRGLTNMALDDSALQGFFGVDRSDRDPQHARAAFSDFSKLVRG 153 (243) T ss_pred HHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHCCCHHHCCHHHHHHHHHHHHHHHHH T ss_conf 99999873077899999999999867899973899999889877401358888743256430769999999999999987 Q ss_pred HHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHH Q ss_conf 20010134565555555666666655566666544322345689998865289981159999999999997498799999 Q gi|254781174|r 169 YTNSPYVKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEARE 248 (271) Q Consensus 169 yP~S~ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~ 248 (271) ||+|+|+.||+.+|..+++.||+||+.||+||+++|+|.||++|++.||++||+|++++|||++|+++|..+|+.|+|.+ T Consensus 154 yP~S~Ya~dA~~r~~~lrn~LA~~El~Va~fY~kr~~y~AAi~R~~~vi~~Yp~t~~~~eAL~~l~eaY~~Lgl~~~A~~ 233 (243) T PRK10866 154 YPNSQYTTDATKRLVFLKDRLAKYEYSVAEYYTKRGAWVAVVNRVEGMLRDYPDTQATRDALPLMENAYRQMQLNAQADK 233 (243) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCHHHHHH T ss_conf 89984589999999999999999999999999992656999999999997689996339999999999998699789999 Q ss_pred HHHHHHHHC Q ss_conf 999997856 Q gi|254781174|r 249 VVSLIQERY 257 (271) Q Consensus 249 ~~~~l~~~y 257 (271) +.++|..|- T Consensus 234 ~~~vL~~N~ 242 (243) T PRK10866 234 VAKIIAANS 242 (243) T ss_pred HHHHHHHHC T ss_conf 999996126 No 3 >COG4105 ComL DNA uptake lipoprotein [General function prediction only] Probab=100.00 E-value=0 Score=365.40 Aligned_cols=242 Identities=34% Similarity=0.574 Sum_probs=225.8 Q ss_pred HHHHHHHHHH-HHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHH Q ss_conf 9999999999-972268753334456777673689999999999980989999999999985304773026899877887 Q gi|254781174|r 25 TIFFSIAVCF-LVGWERQSSRDVYLDSVTDVRYQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFV 103 (271) Q Consensus 25 ~i~~~i~~~~-l~~Cs~~~~~~~~~~~~~~~~~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~ 103 (271) ++++.+++++ .++||....++ .....++.++|++|...+++|+|++|++.|+.+..+||+|+++++++++++++ T Consensus 6 ~l~~~l~~~~~~~~cs~~~~~~-----~~~~~p~~~LY~~g~~~L~~gn~~~A~~~fe~l~~~~p~~~~~~qa~l~l~yA 80 (254) T COG4105 6 LLVIGLLLLVASTGCSGDKDKN-----GVYNLPASELYNEGLTELQKGNYEEAIKYFEALDSRHPFSPYSEQAQLDLAYA 80 (254) T ss_pred HHHHHHHHHHHHCCCCCCHHHC-----CCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHH T ss_conf 9999999999712577623330-----55578899999999999863788999999999987099981128999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHH Q ss_conf 76545556799999887540112110135554434445555443111234589999999999985200101345655555 Q gi|254781174|r 104 QYSAGKYQQAASLGEEYITQYPESKNVDYVYYLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVT 183 (271) Q Consensus 104 ~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~ 183 (271) +|+.++|++|+...++|+.+||+||+++||+|++|++++.++.++.|||+.++.|+..|++||++||||+|+.||+.+|. T Consensus 81 ~Yk~~~y~~A~~~~drFi~lyP~~~n~dY~~YlkgLs~~~~i~~~~rDq~~~~~A~~~f~~~i~ryPnS~Ya~dA~~~i~ 160 (254) T COG4105 81 YYKNGEYDLALAYIDRFIRLYPTHPNADYAYYLKGLSYFFQIDDVTRDQSAARAAFAAFKELVQRYPNSRYAPDAKARIV 160 (254) T ss_pred HHHCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHH T ss_conf 88135589999999999987899987169999999987526775444779999999999999987889702355999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHH Q ss_conf 55666666655566666544322345689998865289981159999999999997498799999999997856999789 Q gi|254781174|r 184 VGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSLIQERYPQGYWA 263 (271) Q Consensus 184 ~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~~~yP~s~~~ 263 (271) .+++.||+||+.||+||+++|.|.+|++|++.++++||+|+.+.+||.+|.++|+.+|..|+|.+..++|..|||+|.|+ T Consensus 161 ~~~d~LA~~Em~IaryY~kr~~~~AA~nR~~~v~e~y~~t~~~~eaL~~l~eaY~~lgl~~~a~~~~~vl~~N~p~s~~~ 240 (254) T COG4105 161 KLNDALAGHEMAIARYYLKRGAYVAAINRFEEVLENYPDTSAVREALARLEEAYYALGLTDEAKKTAKVLGANYPDSQWY 240 (254) T ss_pred HHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCH T ss_conf 99999889999999999990576999999999987145552047789999999998287278999999997368988203 Q ss_pred HHHHHHHC Q ss_conf 99998609 Q gi|254781174|r 264 RYVETLVK 271 (271) Q Consensus 264 ~~a~~~l~ 271 (271) +.++.+++ T Consensus 241 ~~~~~~~~ 248 (254) T COG4105 241 KDAYRLLQ 248 (254) T ss_pred HHHHHHCC T ss_conf 56665043 No 4 >TIGR02795 tol_pal_ybgF tol-pal system protein YbgF; InterPro: IPR014162 Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, and a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.. Probab=99.90 E-value=5.9e-23 Score=151.96 Aligned_cols=118 Identities=21% Similarity=0.339 Sum_probs=112.0 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHH Q ss_conf 89999999999980989999999999985304773026899877887765455567999998875401121101355544 Q gi|254781174|r 57 QREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYL 136 (271) Q Consensus 57 ~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~ 136 (271) ++..|+.|...+++|||..|+..|+..+.+||.|+++++|++|+|+++|.++++.+|+..|..+++.||+|+.+|+|++. T Consensus 2 ~~~~y~aa~~~lk~gdY~~A~~~F~~F~~~YP~S~y~pnA~YWLGe~~y~~~~~~~A~~~f~~v~~~yp~s~KaPdALLK 81 (119) T TIGR02795 2 EEAYYDAALLVLKAGDYADAIQAFQAFLKKYPKSTYAPNAHYWLGEAYYAQGDYADAAKAFLAVVKKYPKSPKAPDALLK 81 (119) T ss_pred CHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHH T ss_conf 16789999999853887899999999988687987671054799999999733999999999998645899860689999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHH Q ss_conf 3444555544311123458999999999998520010134565555 Q gi|254781174|r 137 VGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYV 182 (271) Q Consensus 137 ~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l 182 (271) +|++..++ +...+|...|++++++||+|+-+..|+.++ T Consensus 82 lg~~~~~~--------g~~~~A~~~l~~V~~~YP~s~aA~~A~~~l 119 (119) T TIGR02795 82 LGMSLQEL--------GDKEKAKATLQQVIKRYPGSSAAKLAQKRL 119 (119) T ss_pred HHHHHHHH--------CCHHHHHHHHHHHHHHCCCCHHHHHHHHHC T ss_conf 99999872--------898899999999985669987889997509 No 5 >PRK10803 hypothetical protein; Provisional Probab=99.88 E-value=8.9e-22 Score=145.08 Aligned_cols=107 Identities=19% Similarity=0.314 Sum_probs=100.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHH Q ss_conf 23458999999999998520010134565555555666666655566666544322345689998865289981159999 Q gi|254781174|r 151 DQRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAM 230 (271) Q Consensus 151 d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl 230 (271) ..+...+|+..|+.|++.||+|.|+.+|.++ +|..|+.+++|..|+..|..++++||++..+++|+ T Consensus 154 ~~k~y~~Ai~aF~~Fi~~yP~s~y~~nA~YW--------------LGe~~y~~~~~~~A~~~F~~vv~~yP~s~K~pdAl 219 (262) T PRK10803 154 DKSRQDDAIVAFQNFIKKYPDSTYQPNANYW--------------LGQLNYNKGKKDDAAYYFASVVKNYPKSPKAADAM 219 (262) T ss_pred HHCCHHHHHHHHHHHHHHCCCCCCCCCCHHH--------------HHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCHHHH T ss_conf 7356999999999999878799877430999--------------99999974378999999999998677998678999 Q ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHC Q ss_conf 99999999749879999999999785699978999998609 Q gi|254781174|r 231 ARLVEAYVALALMDEAREVVSLIQERYPQGYWARYVETLVK 271 (271) Q Consensus 231 ~~l~~~y~~lg~~d~A~~~~~~l~~~yP~s~~~~~a~~~l~ 271 (271) +.+|.++.++|+.++|+.+++.++.+||+|.-++.|+..|+ T Consensus 220 lklg~~~~~lg~~~~Ak~~l~~vi~~YP~S~aA~lAk~~L~ 260 (262) T PRK10803 220 FKVGVIMQDKGDTAKAKAVYQQVIKKYPGTDGAKQAQKRLN 260 (262) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHH T ss_conf 99999999859999999999999998859989999999986 No 6 >TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795). Probab=99.86 E-value=5.5e-20 Score=134.63 Aligned_cols=165 Identities=17% Similarity=0.161 Sum_probs=137.9 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 30268998778877654555679999988754011211013555443444555544311123458999999999998520 Q gi|254781174|r 91 GVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYT 170 (271) Q Consensus 91 ~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP 170 (271) ...++..+..|..++..|+|..|+..|+++...||.|+.++.|.++.|.|+|++ +...+|+..+++|++.|| T Consensus 30 ~~~~~~lY~~a~~~~~~~~y~~A~~~fe~l~~~yp~s~~a~~A~l~~a~a~y~~--------~~y~~A~~~~~~Fi~~yP 101 (235) T TIGR03302 30 ERPAEELYNEAKEALDSGDYTEAIKYFEALESRYPFSPYAEQAQLDLAYAYYKS--------GDYAEAIAAADRFIRLHP 101 (235) T ss_pred CCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHH--------HHHHHHHHHHHHHHHHCC T ss_conf 789999999999999878999999999999986899313799999999999976--------439999999999999887 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHH--------------HHH Q ss_conf 010134565555555666666655566666544322345689998865289981159999999--------------999 Q gi|254781174|r 171 NSPYVKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAMARL--------------VEA 236 (271) Q Consensus 171 ~S~ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l--------------~~~ 236 (271) +|+.++.|...+..+.-.. +.......+....|+..|+.+++.||+|+++++|..++ |+- T Consensus 102 ~~~~~~ya~y~~a~s~~~~------~~~~~rDq~~~~~A~~~f~~~i~~yP~S~ya~dA~~~i~~l~~~LA~~e~~ia~~ 175 (235) T TIGR03302 102 NHPDADYAYYLRGLSNYKQ------IDDSDRDQTAAREAFEAFQELIRRYPNSEYAPDAKKRMDYLRNRLAGKELYVARF 175 (235) T ss_pred CCCHHHHHHHHHHHHHHHH------CCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 7434999999987998873------6654458089999999999999888996579999999999999999999999999 Q ss_pred HHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHH Q ss_conf 997498799999999997856999789999986 Q gi|254781174|r 237 YVALALMDEAREVVSLIQERYPQGYWARYVETL 269 (271) Q Consensus 237 y~~lg~~d~A~~~~~~l~~~yP~s~~~~~a~~~ 269 (271) |.+.|....|...++.+..+||++.+...|--+ T Consensus 176 Y~k~~~y~aAi~r~~~vl~~Yp~t~~~~eAl~~ 208 (235) T TIGR03302 176 YLKRGAYVAAINRFETVVENYPDTPATEEALAR 208 (235) T ss_pred HHHCCCCHHHHHHHHHHHHHCCCCCCHHHHHHH T ss_conf 998417099999999999868998513999999 No 7 >COG1729 Uncharacterized protein conserved in bacteria [Function unknown] Probab=99.84 E-value=4.8e-20 Score=135.00 Aligned_cols=115 Identities=22% Similarity=0.339 Sum_probs=91.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 54434445555443111234589999999999985200101345655555556666666555666665443223456899 Q gi|254781174|r 134 YYLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRF 213 (271) Q Consensus 134 ~y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~ 213 (271) .|..|+.+++. +....|.+.|+.|+++||+|.|+.+|.+++ |+.+|.+|+|..|...| T Consensus 144 ~Y~~A~~~~ks--------gdy~~A~~~F~~fi~~YP~s~~~~nA~yWL--------------Ge~~y~qg~y~~Aa~~f 201 (262) T COG1729 144 LYNAALDLYKS--------GDYAEAEQAFQAFIKKYPNSTYTPNAYYWL--------------GESLYAQGDYEDAAYIF 201 (262) T ss_pred HHHHHHHHHHC--------CCHHHHHHHHHHHHHCCCCCCCCCHHHHHH--------------HHHHHHCCCCHHHHHHH T ss_conf 99999999983--------878899999999997099975560249998--------------89998624616799999 Q ss_pred HHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHH Q ss_conf 988652899811599999999999974987999999999978569997899999860 Q gi|254781174|r 214 QLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSLIQERYPQGYWARYVETLV 270 (271) Q Consensus 214 ~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~~~yP~s~~~~~a~~~l 270 (271) ..++++||+++.++|+++.++.+-..+|..|+|..++..+..+||++.=++.|...+ T Consensus 202 ~~~~k~~P~s~KApdallKlg~~~~~l~~~d~A~atl~qv~k~YP~t~aA~~Ak~~~ 258 (262) T COG1729 202 ARVVKDYPKSPKAPDALLKLGVSLGRLGNTDEACATLQQVIKRYPGTDAAKLAKVAL 258 (262) T ss_pred HHHHHHCCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHH T ss_conf 999983899998828999999999982177999999999998789977999999987 No 8 >TIGR02795 tol_pal_ybgF tol-pal system protein YbgF; InterPro: IPR014162 Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, and a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.. Probab=99.83 E-value=1.3e-19 Score=132.55 Aligned_cols=117 Identities=21% Similarity=0.391 Sum_probs=101.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 55544344455554431112345899999999999852001013456555555566666665556666654432234568 Q gi|254781174|r 132 YVYYLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIP 211 (271) Q Consensus 132 ~A~y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~ 211 (271) .++|..|...++ .+....|++.|+.|+++||+|+|+.+|++++ |+.|+.+|+|..|+. T Consensus 3 ~~~y~aa~~~lk--------~gdY~~A~~~F~~F~~~YP~S~y~pnA~YWL--------------Ge~~y~~~~~~~A~~ 60 (119) T TIGR02795 3 EAYYDAALLVLK--------AGDYADAIQAFQAFLKKYPKSTYAPNAHYWL--------------GEAYYAQGDYADAAK 60 (119) T ss_pred HHHHHHHHHHHH--------CCCHHHHHHHHHHHHHHCCCCCCCCCHHHHH--------------HHHHHHHHHHHHHHH T ss_conf 678999999985--------3887899999999988687987671054799--------------999999733999999 Q ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHH Q ss_conf 99988652899811599999999999974987999999999978569997899999860 Q gi|254781174|r 212 RFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSLIQERYPQGYWARYVETLV 270 (271) Q Consensus 212 ~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~~~yP~s~~~~~a~~~l 270 (271) .|..|+.+||+++.+|+||..||.+..++|+.++|+.+++.|+.+||+|.-.+.|+..| T Consensus 61 ~f~~v~~~yp~s~KaPdALLKlg~~~~~~g~~~~A~~~l~~V~~~YP~s~aA~~A~~~l 119 (119) T TIGR02795 61 AFLAVVKKYPKSPKAPDALLKLGMSLQELGDKEKAKATLQQVIKRYPGSSAAKLAQKRL 119 (119) T ss_pred HHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHC T ss_conf 99999864589986068999999999872898899999999985669987889997509 No 9 >PRK10803 hypothetical protein; Provisional Probab=99.81 E-value=3.3e-19 Score=130.10 Aligned_cols=120 Identities=16% Similarity=0.226 Sum_probs=104.3 Q ss_pred HHHHHHHHHHH-HHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHH Q ss_conf 89999999999-98098999999999998530477302689987788776545556799999887540112110135554 Q gi|254781174|r 57 QREVYEKAVLF-LKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYY 135 (271) Q Consensus 57 ~~~lY~~a~~~-~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y 135 (271) ...-|+.|..+ +++++|++|+..|+.++..||.|.+++.|+||+|.+||..++|++|+..|..+++.||+|+.+++|+| T Consensus 141 e~~~Y~~A~~Lvl~~k~y~~Ai~aF~~Fi~~yP~s~y~~nA~YWLGe~~y~~~~~~~A~~~F~~vv~~yP~s~K~pdAll 220 (262) T PRK10803 141 ANTDYNAAIALVQDKSRQDDAIVAFQNFIKKYPDSTYQPNANYWLGQLNYNKGKKDDAAYYFASVVKNYPKSPKAADAMF 220 (262) T ss_pred HHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCHHHHH T ss_conf 27789999999997356999999999999878799877430999999999743789999999999986779986789999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHH Q ss_conf 4344455554431112345899999999999852001013456555555 Q gi|254781174|r 136 LVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVTV 184 (271) Q Consensus 136 ~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~ 184 (271) .+|.|+.+. +...+|...|++++.+||+|+-+..|+.+|.. T Consensus 221 klg~~~~~l--------g~~~~Ak~~l~~vi~~YP~S~aA~lAk~~L~~ 261 (262) T PRK10803 221 KVGVIMQDK--------GDTAKAKAVYQQVIKKYPGTDGAKQAQKRLNA 261 (262) T ss_pred HHHHHHHHC--------CCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHC T ss_conf 999999985--------99999999999999988599899999999862 No 10 >PRK10866 outer membrane protein assembly complex subunit YfiO; Provisional Probab=99.78 E-value=3.4e-17 Score=118.37 Aligned_cols=171 Identities=16% Similarity=0.169 Sum_probs=138.4 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 02689987788776545556799999887540112110135554434445555443111234589999999999985200 Q gi|254781174|r 92 VARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTN 171 (271) Q Consensus 92 ~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~ 171 (271) ..++..+..|...+..|+|..|+..|+.+..+||.|+.++.|..+.|.++|+. +...+|+..+++|++.||+ T Consensus 30 ~~~~~~Y~~A~~~~~~~~y~~A~~~fe~l~~~yP~s~~A~qA~L~~ayayy~~--------~~y~~A~~~~~rFi~lyP~ 101 (243) T PRK10866 30 NPPNEIYATAQQKLQDGNWKQAITQLEALDNRYPFGPYSQQVQLDLIYAYYKN--------ADLPLAQAAIDRFMRLNPT 101 (243) T ss_pred CCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHH--------CCHHHHHHHHHHHHHHCCC T ss_conf 99999999999999878999999999999986899767899999999998730--------7789999999999986789 Q ss_pred CHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHH-------------- Q ss_conf 1013456555555566666665----5566666544322345689998865289981159999999-------------- Q gi|254781174|r 172 SPYVKGARFYVTVGRNQLAAKE----VEIGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAMARL-------------- 233 (271) Q Consensus 172 S~ya~~A~~~l~~~~~~La~~e----~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l-------------- 233 (271) |+.++-|......+.-.+.... +.+-..--..+.-..|...|+.+++.||+|+++++|..++ T Consensus 102 ~~~v~Ya~Yl~gl~~~~~~~~~~~~~~~~~~~~rD~~~~~~A~~~f~~li~~yP~S~Ya~dA~~r~~~lrn~LA~~El~V 181 (243) T PRK10866 102 HPNIDYVMYMRGLTNMALDDSALQGFFGVDRSDRDPQHARAAFSDFSKLVRGYPNSQYTTDATKRLVFLKDRLAKYEYSV 181 (243) T ss_pred CCCHHHHHHHHHHHHHHCCHHHHHHHHCCCHHHCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99738999998898774013588887432564307699999999999999878998458999999999999999999999 Q ss_pred HHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHH Q ss_conf 9999974987999999999978569997899999860 Q gi|254781174|r 234 VEAYVALALMDEAREVVSLIQERYPQGYWARYVETLV 270 (271) Q Consensus 234 ~~~y~~lg~~d~A~~~~~~l~~~yP~s~~~~~a~~~l 270 (271) |+=|.+.|....|...++.+..+||++.+...|=-++ T Consensus 182 a~fY~kr~~y~AAi~R~~~vi~~Yp~t~~~~eAL~~l 218 (243) T PRK10866 182 AEYYTKRGAWVAVVNRVEGMLRDYPDTQATRDALPLM 218 (243) T ss_pred HHHHHHCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHH T ss_conf 9999992656999999999997689996339999999 No 11 >TIGR02521 type_IV_pilW type IV pilus biogenesis/stability protein PilW; InterPro: IPR013360 Proteins in this entry are designated PilF and PilW . This outer membrane protein is required both for pilus stability and for pilus functions such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.. Probab=99.75 E-value=4.8e-16 Score=111.63 Aligned_cols=207 Identities=16% Similarity=0.158 Sum_probs=155.9 Q ss_pred HHHHHHHHHCCCCC------CCCCCCCCCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHH Q ss_conf 99999997226875------333445677767368999999999998098999999999998530477302689987788 Q gi|254781174|r 29 SIAVCFLVGWERQS------SRDVYLDSVTDVRYQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAF 102 (271) Q Consensus 29 ~i~~~~l~~Cs~~~------~~~~~~~~~~~~~~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~ 102 (271) ++++++++||++.. .......+.+.....+.--++|..|++.|++..|.+.+++.+..-|... +|...+|. T Consensus 3 l~~~~~l~GC~~~~P~~q~~~~~~~~~~~~~~~~a~~r~qLal~Yl~~G~~~~Ak~~l~~AL~~dp~~~---~a~~~lA~ 79 (247) T TIGR02521 3 LVLLLALTGCVTTPPTVQEQEVPESSRTTDREKAAKIRVQLALGYLEQGDLEKAKENLDKALEHDPDDY---DAYLALAL 79 (247) T ss_pred HHHHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCC---HHHHHHHH T ss_conf 878467851578878977878888755102799999999999999860787899999999987376661---28999999 Q ss_pred HHHHHH--HHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHH------------------------------HHHHH Q ss_conf 776545--5567999998875401121101355544344455554------------------------------43111 Q gi|254781174|r 103 VQYSAG--KYQQAASLGEEYITQYPESKNVDYVYYLVGMSYAQMI------------------------------RDVPY 150 (271) Q Consensus 103 ~~y~~~--~y~~A~~~~~~fi~~~P~s~~~~~A~y~~a~~~~~~~------------------------------~~~~~ 150 (271) .|...| +-++|...|++-+...|++..+ +..=|.-.+.+. .--.+ T Consensus 80 ~Yq~~grP~~~~A~~~f~~Al~~~p~~g~~---~NNYG~FLC~~g~~~~~~A~~~F~~A~~~p~Y~~~~~~~eN~G~C~~ 156 (247) T TIGR02521 80 YYQQLGRPELEKAEQSFRKALTLKPNNGDV---LNNYGTFLCQQGNDKYEQAEQQFEAAIEDPLYPQPARSYENAGLCAL 156 (247) T ss_pred HHHHHCCCCHHHHHHHHHHHHHCCCCCCCE---ECHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHH T ss_conf 998717995578999999986248998523---20145776404897068999999999866889733024554999998 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHH Q ss_conf 23458999999999998520010134565555555666666655566666544322345689998865289981159999 Q gi|254781174|r 151 DQRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAM 230 (271) Q Consensus 151 d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl 230 (271) .++...+|...|++.+..-|+.+ ..-+++++.++++|+|..|...++.+....-.....+++| T Consensus 157 ~~g~~~~A~~~l~~Al~~dp~~p-----------------~~lleLa~l~~~~g~~~~A~~~l~ry~~~~G~~~~~a~sL 219 (247) T TIGR02521 157 KAGDTEKAEKYLTRALKIDPQRP-----------------ESLLELAELYYERGQYAQARALLERYQQTVGVKNQTAESL 219 (247) T ss_pred HCCCHHHHHHHHHHHHHHCCCCC-----------------HHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCCCCCHHHH T ss_conf 61786889999999998587888-----------------5579999998750772899999999998736788887347 Q ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHHCC Q ss_conf 9999999974987999999999978569 Q gi|254781174|r 231 ARLVEAYVALALMDEAREVVSLIQERYP 258 (271) Q Consensus 231 ~~l~~~y~~lg~~d~A~~~~~~l~~~yP 258 (271) .+.+++-..+|+.+.+..+...|..+|| T Consensus 220 ~Lg~riA~~~Gd~~~~~~y~~qL~~~fP 247 (247) T TIGR02521 220 WLGIRIARALGDVAAAKRYGAQLQKLFP 247 (247) T ss_pred HHHHHHHHHHCCHHHHHHHHHHHHCCCC T ss_conf 8999999984429999999998500279 No 12 >TIGR02917 PEP_TPR_lipo putative PEP-CTERM system TPR-repeat lipoprotein; InterPro: IPR014266 This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system , analogous to the LPXTG/sortase system common in Gram-positive bacteria. The proteins in this entry occur in a species if, and only if, a transmembrane histidine kinase (IPR014265 from INTERPRO) and a DNA-binding response regulator (IPR014264 from INTERPRO) also occur. The present of tetratricopeptide repeats (TPR) suggests they may be involved in protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.. Probab=99.72 E-value=1e-14 Score=103.88 Aligned_cols=228 Identities=14% Similarity=0.125 Sum_probs=175.4 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHH Q ss_conf 99999999999722687533344567776736899999999999809899999999999853047730268998778877 Q gi|254781174|r 25 TIFFSIAVCFLVGWERQSSRDVYLDSVTDVRYQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQ 104 (271) Q Consensus 25 ~i~~~i~~~~l~~Cs~~~~~~~~~~~~~~~~~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~ 104 (271) ++..+..+++|+|||+.. ..++++++..|..+.++|++..||..++.++..-|.. .+|.|.+|.+| T Consensus 8 ~~~~~~a~~lLsaCsGg~-----------~~~~~~l~~~Ak~yl~~~~~~aAiI~LKNAL~~~p~~---~EARflLg~~Y 73 (924) T TIGR02917 8 LTILLAAILLLSACSGGD-----------QLSPEELIEEAKKYLQKNKYKAAIIQLKNALQKDPND---AEARFLLGKIY 73 (924) T ss_pred HHHHHHHHHHHHHCCCCC-----------CCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCC---HHHHHHHHHHH T ss_conf 999999999986305788-----------8896899999999986189972267757877408674---78899999999 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCHH-H---HHHHHHHHHHHHHH----------------------HHHHHHHHHHHHH Q ss_conf 65455567999998875401121101-3---55544344455554----------------------4311123458999 Q gi|254781174|r 105 YSAGKYQQAASLGEEYITQYPESKNV-D---YVYYLVGMSYAQMI----------------------RDVPYDQRATKLM 158 (271) Q Consensus 105 y~~~~y~~A~~~~~~fi~~~P~s~~~-~---~A~y~~a~~~~~~~----------------------~~~~~d~~~~~~A 158 (271) -..|++..|...++|..+.-+....+ + .++.+.|- +++. .......+..++| T Consensus 74 ~~~Gd~~~AEKEL~kA~~lg~~~~~~~p~LAra~l~~~~--~~~~~~~~~~~~~~~~~~~a~ll~~~G~A~L~~g~~~~A 151 (924) T TIGR02917 74 LAQGDAAAAEKELRKALSLGYPKEQVLPLLARAYLLQGK--FQQVLDELEAKTSLDDEEAAELLALRGLAYLGLGELEEA 151 (924) T ss_pred HHHCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCC--HHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHCCCHHHH T ss_conf 984598888899999985689986553999999985595--789998547887787066999999999998850787899 Q ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHHHH--HHHHH-----------------HHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999852001013456555555566--66666-----------------555666665443223456899988652 Q gi|254781174|r 159 LQYMSRIVERYTNSPYVKGARFYVTVGRN--QLAAK-----------------EVEIGRYYLKRGEYVAAIPRFQLVLAN 219 (271) Q Consensus 159 ~~~f~~~i~~yP~S~ya~~A~~~l~~~~~--~La~~-----------------e~~ia~~Y~~~~~y~aA~~~~~~~i~~ 219 (271) ...|..-...-|.+.++.-+..++...++ .+.+- .+-.|++-+.+|++..|...|+.++.- T Consensus 152 ~~~~~~Al~~~p~~~~A~lglA~~~~~~~ss~~~~a~~~~d~vL~~~P~~~dA~~L~g~l~~~~~~~~~A~~~y~ka~~~ 231 (924) T TIGR02917 152 QKSYEQALAIAPRSLDAKLGLARLALAENSSKLDEARALIDEVLTADPKNVDALLLKGDLLLSSGNIEQALAAYRKAIEL 231 (924) T ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHH T ss_conf 99999987634027358999999999733778788999999997328752227999999999727768999999999963 Q ss_pred CCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHC Q ss_conf 8998115999999999999749879999999999785699978999998609 Q gi|254781174|r 220 YSDAEHAEEAMARLVEAYVALALMDEAREVVSLIQERYPQGYWARYVETLVK 271 (271) Q Consensus 220 yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~~~yP~s~~~~~a~~~l~ 271 (271) -|+.+ ..++.++.+-...|.+|+|.+..+.+...+|++....+.+-+|+ T Consensus 232 ~P~~~---~~~~~~~~~~l~~~~~e~A~~~~d~~~k~~p~~p~a~y~kAlv~ 280 (924) T TIGR02917 232 RPNNI---AVLLALATILLEAGEFEEAEKDADALLKKAPNSPQAAYLKALVD 280 (924) T ss_pred CCCCH---HHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHH T ss_conf 88874---78999999999736667889999999998477846899999999 No 13 >COG1729 Uncharacterized protein conserved in bacteria [Function unknown] Probab=99.68 E-value=5.8e-16 Score=111.17 Aligned_cols=119 Identities=18% Similarity=0.305 Sum_probs=111.1 Q ss_pred CHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHH Q ss_conf 68999999999998098999999999998530477302689987788776545556799999887540112110135554 Q gi|254781174|r 56 YQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYY 135 (271) Q Consensus 56 ~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y 135 (271) ++..+|+.|..++..|+|..|...|...+..||+|.+++.|+||+|.++|.+|+|+.|...|.+.++.||+|+.+|++++ T Consensus 140 ~~~~~Y~~A~~~~ksgdy~~A~~~F~~fi~~YP~s~~~~nA~yWLGe~~y~qg~y~~Aa~~f~~~~k~~P~s~KApdall 219 (262) T COG1729 140 PATKLYNAALDLYKSGDYAEAEQAFQAFIKKYPNSTYTPNAYYWLGESLYAQGDYEDAAYIFARVVKDYPKSPKAPDALL 219 (262) T ss_pred CHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCCHHHHH T ss_conf 22689999999998387889999999999709997556024999889998624616799999999983899998828999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHH Q ss_conf 43444555544311123458999999999998520010134565555 Q gi|254781174|r 136 LVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYV 182 (271) Q Consensus 136 ~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l 182 (271) .+|+|... .+++.+|...+++++.+||+++-+..|+.++ T Consensus 220 Klg~~~~~--------l~~~d~A~atl~qv~k~YP~t~aA~~Ak~~~ 258 (262) T COG1729 220 KLGVSLGR--------LGNTDEACATLQQVIKRYPGTDAAKLAKVAL 258 (262) T ss_pred HHHHHHHH--------HCCHHHHHHHHHHHHHHCCCCHHHHHHHHHH T ss_conf 99999998--------2177999999999998789977999999987 No 14 >COG4105 ComL DNA uptake lipoprotein [General function prediction only] Probab=99.66 E-value=1.4e-14 Score=103.07 Aligned_cols=165 Identities=16% Similarity=0.167 Sum_probs=94.5 Q ss_pred CCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHH--------HHHHHHHHHHHHHH Q ss_conf 7673689999999999980989999999999985304773026899877887765455--------56799999887540 Q gi|254781174|r 52 TDVRYQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGK--------YQQAASLGEEYITQ 123 (271) Q Consensus 52 ~~~~~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~--------y~~A~~~~~~fi~~ 123 (271) ....+++.+...+..++++++|++|+...++.+..||+|+-++-+.++.|.+++..-+ -.+|+..++.||.+ T Consensus 66 ~~~~~~qa~l~l~yA~Yk~~~y~~A~~~~drFi~lyP~~~n~dY~~YlkgLs~~~~i~~~~rDq~~~~~A~~~f~~~i~r 145 (254) T COG4105 66 FSPYSEQAQLDLAYAYYKNGEYDLALAYIDRFIRLYPTHPNADYAYYLKGLSYFFQIDDVTRDQSAARAAFAAFKELVQR 145 (254) T ss_pred CCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHH T ss_conf 98112899999999988135589999999999987899987169999999987526775444779999999999999987 Q ss_pred CCCCCHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 1121101355544344455------5544311123458999999999998520010134565555555666666655566 Q gi|254781174|r 124 YPESKNVDYVYYLVGMSYA------QMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQLAAKEVEIG 197 (271) Q Consensus 124 ~P~s~~~~~A~y~~a~~~~------~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~La~~e~~ia 197 (271) ||+|+.+++|..++....- .++.......+....|+..++.++++||+++.+.+|..++ . T Consensus 146 yPnS~Ya~dA~~~i~~~~d~LA~~Em~IaryY~kr~~~~AA~nR~~~v~e~y~~t~~~~eaL~~l--------------~ 211 (254) T COG4105 146 YPNSRYAPDAKARIVKLNDALAGHEMAIARYYLKRGAYVAAINRFEEVLENYPDTSAVREALARL--------------E 211 (254) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHH--------------H T ss_conf 88970235599999999999889999999999990576999999999987145552047789999--------------9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHH Q ss_conf 666544322345689998865289981159999 Q gi|254781174|r 198 RYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAM 230 (271) Q Consensus 198 ~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl 230 (271) ..|+..|....|-+.-+-+-.|||+|++..++. T Consensus 212 eaY~~lgl~~~a~~~~~vl~~N~p~s~~~~~~~ 244 (254) T COG4105 212 EAYYALGLTDEAKKTAKVLGANYPDSQWYKDAY 244 (254) T ss_pred HHHHHCCCHHHHHHHHHHHHHCCCCCCCHHHHH T ss_conf 999982872789999999973689882035666 No 15 >TIGR02917 PEP_TPR_lipo putative PEP-CTERM system TPR-repeat lipoprotein; InterPro: IPR014266 This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system , analogous to the LPXTG/sortase system common in Gram-positive bacteria. The proteins in this entry occur in a species if, and only if, a transmembrane histidine kinase (IPR014265 from INTERPRO) and a DNA-binding response regulator (IPR014264 from INTERPRO) also occur. The present of tetratricopeptide repeats (TPR) suggests they may be involved in protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.. Probab=99.60 E-value=4.5e-13 Score=94.35 Aligned_cols=162 Identities=15% Similarity=0.197 Sum_probs=91.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHH------------H--------------HHHH Q ss_conf 8998778877654555679999988754011211013555443444555------------5--------------4431 Q gi|254781174|r 95 KSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYLVGMSYAQ------------M--------------IRDV 148 (271) Q Consensus 95 ~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~~a~~~~~------------~--------------~~~~ 148 (271) +|.+.++..|-+.++.+.|+...++..+..|++|.+ +.++|.++.. + ...+ T Consensus 452 ~a~~~L~~~yL~~~~~dKAla~~~~l~~~~pd~p~~---~nl~G~i~l~~~~~~kA~~~f~~AL~i~p~~~~A~~nLA~~ 528 (924) T TIGR02917 452 RADLLLILSYLRSGEFDKALAAAKKLEKKQPDNPSL---HNLLGAIYLGKKDLAKAREAFEKALKIEPDFFPAAANLARI 528 (924) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHH---HHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHH T ss_conf 478999998874204899999999997528898268---99999999851787899999998753187784789999998 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHH-----------------HHHHHHHHHHHHHHHHHHHHHH Q ss_conf 11234589999999999985200101345655555556666-----------------6665556666654432234568 Q gi|254781174|r 149 PYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQL-----------------AAKEVEIGRYYLKRGEYVAAIP 211 (271) Q Consensus 149 ~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~L-----------------a~~e~~ia~~Y~~~~~y~aA~~ 211 (271) +.+.+....|++.|+.++...|++--+..+...+...++.- -..-+.++.+|...|+...|+. T Consensus 529 d~~~~~~~~A~~r~~~~l~~~P~~~~a~~~La~L~~~~g~~~~~~~~l~~A~~~~~~~~~p~l~L~~~yl~~~~l~kAl~ 608 (924) T TIGR02917 529 DLQEGNPDDAKQRFEKVLTIDPKNLRALLALAELAARTGNEEEAVAWLEKAAEKNPKEIEPALALAQYYLSKGELKKALA 608 (924) T ss_pred HHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCHHHHHH T ss_conf 87428968999999999875743379999999999851896899999999986281777789999999986589878999 Q ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHH Q ss_conf 999886528998115999999999999749879999999999785699978 Q gi|254781174|r 212 RFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSLIQERYPQGYW 262 (271) Q Consensus 212 ~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~~~yP~s~~ 262 (271) -.+.+++..|+++ +++-.||.+|...|+.+.|...++.|...-|++.- T Consensus 609 ~l~~~~~~~~~~~---~~~~~Lg~~ql~~~d~~~A~~~f~Kl~~~~P~~~~ 656 (924) T TIGR02917 609 ILNEAADLAPDSP---EALELLGRAQLAAGDLNKAVESFKKLLELQPDSAL 656 (924) T ss_pred HHHHHHHHCCCCH---HHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCH T ss_conf 9999987314757---89999999999732467799999999986888723 No 16 >COG3063 PilF Tfp pilus assembly protein PilF [Cell motility and secretion / Intracellular trafficking and secretion] Probab=99.58 E-value=1.6e-12 Score=91.11 Aligned_cols=210 Identities=14% Similarity=0.110 Sum_probs=147.3 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 99999972268753334456777673689999999999980989999999999985304773026899877887765455 Q gi|254781174|r 30 IAVCFLVGWERQSSRDVYLDSVTDVRYQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGK 109 (271) Q Consensus 30 i~~~~l~~Cs~~~~~~~~~~~~~~~~~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~ 109 (271) |+.++.++|++...+.. ..+.....+..-+.|..+++.|++..|.+.+++++..-|.+. .++.-+|..|...|+ T Consensus 11 v~~La~~~cvt~~~~~~---~~~~~~aa~arlqLal~YL~~gd~~~A~~nlekAL~~DPs~~---~a~~~~A~~Yq~~Ge 84 (250) T COG3063 11 VFSLALGACVTDPAPSR---QTDRNEAAKARLQLALGYLQQGDYAQAKKNLEKALEHDPSYY---LAHLVRAHYYQKLGE 84 (250) T ss_pred HHHHHHHCCCCCCCCCC---CCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCH---HHHHHHHHHHHHCCC T ss_conf 99876320058997533---225889999999999999977899999988999988495208---899999999999077 Q ss_pred HHHHHHHHHHHHHHCCCCCHHH--HHHHHHHHHHHHH--------HHHH---------------HHHHHHHHHHHHHHHH Q ss_conf 5679999988754011211013--5554434445555--------4431---------------1123458999999999 Q gi|254781174|r 110 YQQAASLGEEYITQYPESKNVD--YVYYLVGMSYAQM--------IRDV---------------PYDQRATKLMLQYMSR 164 (271) Q Consensus 110 y~~A~~~~~~fi~~~P~s~~~~--~A~y~~a~~~~~~--------~~~~---------------~~d~~~~~~A~~~f~~ 164 (271) .+.|...|++.++..|++-++- |..|+.+.-.+.+ +.++ ...++....|.+.|++ T Consensus 85 ~~~A~e~YrkAlsl~p~~GdVLNNYG~FLC~qg~~~eA~q~F~~Al~~P~Y~~~s~t~eN~G~Cal~~gq~~~A~~~l~r 164 (250) T COG3063 85 NDLADESYRKALSLAPNNGDVLNNYGAFLCAQGRPEEAMQQFERALADPAYGEPSDTLENLGLCALKAGQFDQAEEYLKR 164 (250) T ss_pred HHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHH T ss_conf 55679999999852988640552106999708986899999999871888777530454457888643770447999999 Q ss_pred HHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHH Q ss_conf 99852001013456555555566666665556666654432234568999886528998115999999999999749879 Q gi|254781174|r 165 IVERYTNSPYVKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMD 244 (271) Q Consensus 165 ~i~~yP~S~ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d 244 (271) -+..-|+-+.+ -.+.++..++.|+|..|.-++..+....+ ...+.|...+++-..+|+.+ T Consensus 165 aL~~dp~~~~~-----------------~l~~a~~~~~~~~y~~Ar~~~~~~~~~~~---~~A~sL~L~iriak~~gd~~ 224 (250) T COG3063 165 ALELDPQFPPA-----------------LLELARLHYKAGDYAPARLYLERYQQRGG---AQAESLLLGIRIAKRLGDRA 224 (250) T ss_pred HHHHCCCCCHH-----------------HHHHHHHHHHCCCCHHHHHHHHHHHHCCC---CCHHHHHHHHHHHHHHCCHH T ss_conf 98837578767-----------------89987988753130189999999970256---65899998999999965488 Q ss_pred HHHHHHHHHHHHCCCCHHHHH Q ss_conf 999999999785699978999 Q gi|254781174|r 245 EAREVVSLIQERYPQGYWARY 265 (271) Q Consensus 245 ~A~~~~~~l~~~yP~s~~~~~ 265 (271) .+..+-..|..+||+|..+.- T Consensus 225 ~a~~Y~~qL~r~fP~s~e~q~ 245 (250) T COG3063 225 AAQRYQAQLQRLFPYSEEYQT 245 (250) T ss_pred HHHHHHHHHHHHCCCCHHHHH T ss_conf 999999999986899677776 No 17 >PRK11788 hypothetical protein; Provisional Probab=99.53 E-value=9.9e-12 Score=86.49 Aligned_cols=201 Identities=20% Similarity=0.254 Sum_probs=129.6 Q ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCH--HHHHHHH Q ss_conf 99999999998098999999999998530477302689987788776545556799999887540112110--1355544 Q gi|254781174|r 59 EVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKN--VDYVYYL 136 (271) Q Consensus 59 ~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~--~~~A~y~ 136 (271) .-|-+|..++=+.+.++|++.|.+++...|.+ -+.++-+|..+-+.|+.+.||..=+..+.+ |+-+. ...|.+- T Consensus 37 ~~Y~~Gln~Ll~~q~dkAid~f~~~~~~~~~t---~e~hlaLG~LfRrrGE~dRAIriHq~Ll~r-p~L~~~~r~~a~~E 112 (389) T PRK11788 37 RDYFKGLNFLLNEQPDKAIDLFIEMLKVDPET---VELHLALGNLFRRRGEVDRAIRIHQNLLSR-PDLTREQRLLALQE 112 (389) T ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCH---HHHHHHHHHHHHHCCCHHHHHHHHHHHHHC-CCCCHHHHHHHHHH T ss_conf 99998998872589459999999998648114---999999999999656088999999999728-79899999999999 Q ss_pred HHHHHHHH------------HHH--------------HHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHH-- Q ss_conf 34445555------------443--------------11123458999999999998520010134565555555666-- Q gi|254781174|r 137 VGMSYAQM------------IRD--------------VPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQ-- 188 (271) Q Consensus 137 ~a~~~~~~------------~~~--------------~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~-- 188 (271) .|.-|+.. ..+ +.-..+.-.+|++..+.+...-..+....-|.+.-....+. T Consensus 113 La~Dy~~aGllDRAE~~f~~l~~~~~~~~~al~~Ll~iye~~~dW~~Ai~~a~~L~~~~~~~~~~~iAhy~CElA~~~~~ 192 (389) T PRK11788 113 LGQDYLKAGLLDRAEELFLQLVDEDDFAEGALRQLLTIYQQEKDWQKAIEVAERLEKLGGDSLRKEIAHFYCELAQQALA 192 (389) T ss_pred HHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHH T ss_conf 99999867749999999999982854269999999999998645999999999998704765069999999999999987 Q ss_pred --------------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHH Q ss_conf --------------------666655566666544322345689998865289981159999999999997498799999 Q gi|254781174|r 189 --------------------LAAKEVEIGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEARE 248 (271) Q Consensus 189 --------------------La~~e~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~ 248 (271) -++-.+..|+.+...|+|.+|+..++.+.+. +..+.++++-.|.++|..+|..++..+ T Consensus 193 ~~~~~~A~~~L~kAL~~dp~cvRA~lllg~l~~~~g~~~~Ai~~l~~i~~q--~p~~l~~vl~~L~~~y~~l~~~~~~~~ 270 (389) T PRK11788 193 RGDLDAARALLKKALAADPQCVRASILLGDLALAQGDYAAAIEALERVEEQ--DPEYLPEVLPKLMECYQALGDEAEGLE 270 (389) T ss_pred CCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH--CHHHHHHHHHHHHHHHHHCCCHHHHHH T ss_conf 489999999999998748243418999999999879999999999999973--957799999999999998288889999 Q ss_pred HHHHHHHHCCCCHHHHH Q ss_conf 99999785699978999 Q gi|254781174|r 249 VVSLIQERYPQGYWARY 265 (271) Q Consensus 249 ~~~~l~~~yP~s~~~~~ 265 (271) ..+.+...+|+....-. T Consensus 271 ~L~~~~~~~~~~~l~l~ 287 (389) T PRK11788 271 FLRRAVEEYPGADLLLA 287 (389) T ss_pred HHHHHHHHCCCCHHHHH T ss_conf 99999985899359999 No 18 >PRK11189 lipoprotein NlpI; Provisional Probab=99.47 E-value=3e-11 Score=83.69 Aligned_cols=218 Identities=20% Similarity=0.192 Sum_probs=152.8 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCCC---CC-----------------------CCCCCCCCHHHHHHHHHHHHHHCCHHH Q ss_conf 9999999999999972268753334---45-----------------------677767368999999999998098999 Q gi|254781174|r 22 FALTIFFSIAVCFLVGWERQSSRDV---YL-----------------------DSVTDVRYQREVYEKAVLFLKEQNFSK 75 (271) Q Consensus 22 ~~~~i~~~i~~~~l~~Cs~~~~~~~---~~-----------------------~~~~~~~~~~~lY~~a~~~~~~~~y~~ 75 (271) ++...+++++.+.++||++++.... .. ...++..-++-.|+.|.-+-.-|-..- T Consensus 4 ~~r~~~~~~~~l~LsGC~s~~~~~~~~~~~a~plqp~~q~Ev~laRl~qlL~~~~Lt~e~rA~l~yeRGvlyDSlGLr~L 83 (297) T PRK11189 4 FLRWCFVATAALLLAGCASSNTSWRKSEVLAVPLQPSYQQEVILARLNQILASRDLTDEERAQLHYERGVLYDSLGLRAL 83 (297) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHCCHHH T ss_conf 89999999999999864058876444865144678779999999999999743458989999999983147764120999 Q ss_pred HHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999853047730268998778877654555679999988754011211013555443444555544311123458 Q gi|254781174|r 76 AYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYLVGMSYAQMIRDVPYDQRAT 155 (271) Q Consensus 76 A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~~a~~~~~~~~~~~~d~~~~ 155 (271) |.-.|.+.+..-|.- +++.-.+|.-+-..++|+.|...|+..++..|++ +||+..+|++.|-. +.. T Consensus 84 AR~DF~qAL~l~P~~---adayN~LGiy~tq~~~fD~AyEAFDs~LEL~P~y---~YA~lNRGIAlYYg--------~R~ 149 (297) T PRK11189 84 ARNDFSQALALRPDM---ADAYNYLGIYLTQEGEFDAAYEAFDSVLELDPTY---DYAYLNRGIALYYG--------GRY 149 (297) T ss_pred HHHHHHHHHHCCCCC---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC---HHHHHHHHHHHHHC--------CCH T ss_conf 986499897539985---8999999999999711999999987777269873---48998302788744--------637 Q ss_pred HHHHHHHHHHHHHHHCCHHHH-------------HHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 999999999998520010134-------------56555555566666665--556666654432234568999886528 Q gi|254781174|r 156 KLMLQYMSRIVERYTNSPYVK-------------GARFYVTVGRNQLAAKE--VEIGRYYLKRGEYVAAIPRFQLVLANY 220 (271) Q Consensus 156 ~~A~~~f~~~i~~yP~S~ya~-------------~A~~~l~~~~~~La~~e--~~ia~~Y~~~~~y~aA~~~~~~~i~~y 220 (271) ..|...+..+.+.-|+.||.. +|...+..-...+-... ..|.++|..+..-.+...+...-+++ T Consensus 150 ~LA~~D~~~fy~~dp~DpYR~LWLYl~e~~~d~~~A~~~L~~r~~~~~~~~W~~~iV~~ylg~ise~~~~~~~~~~~~~- 228 (297) T PRK11189 150 DLAQDDLLAFYQDDPNDPYRALWLYLIEQKLDPKQAKENLKQRYEKLDKEQWGWNIVDFYLGKISEEQLFERLKAGATD- 228 (297) T ss_pred HHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHCCC- T ss_conf 8899999999960999827999999998116999999999999983571226899999994589999999999872737- Q ss_pred CCCHHH---HHHHHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 998115---99999999999974987999999999978 Q gi|254781174|r 221 SDAEHA---EEAMARLVEAYVALALMDEAREVVSLIQE 255 (271) Q Consensus 221 p~t~~~---~eAl~~l~~~y~~lg~~d~A~~~~~~l~~ 255 (271) ++..+ -|+.+.|++.+...|+.++|..+++.-.. T Consensus 229 -n~~lAErLcEaYFYLaK~~~~~G~~~~A~~~fKLAla 265 (297) T PRK11189 229 -NTELAERLCETYFYLAKYYLSLGDLDEAANLFKLALA 265 (297) T ss_pred -CHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH T ss_conf -0999999999999999999987998999999999870 No 19 >COG2956 Predicted N-acetylglucosaminyl transferase [Carbohydrate transport and metabolism] Probab=99.43 E-value=8.1e-11 Score=81.17 Aligned_cols=198 Identities=20% Similarity=0.246 Sum_probs=114.8 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCH--HHHHHHHHH Q ss_conf 999999998098999999999998530477302689987788776545556799999887540112110--135554434 Q gi|254781174|r 61 YEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKN--VDYVYYLVG 138 (271) Q Consensus 61 Y~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~--~~~A~y~~a 138 (271) |-.|..++-+.+-++|++.|.++...-|. .-++++-+|..+-+.|+-+.||..-+..++. |+-+. -.-|.+..| T Consensus 39 Yv~GlNfLLs~Q~dKAvdlF~e~l~~d~~---t~e~~ltLGnLfRsRGEvDRAIRiHQ~L~~s-pdlT~~qr~lAl~qL~ 114 (389) T COG2956 39 YVKGLNFLLSNQPDKAVDLFLEMLQEDPE---TFEAHLTLGNLFRSRGEVDRAIRIHQTLLES-PDLTFEQRLLALQQLG 114 (389) T ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHCCCH---HHHHHHHHHHHHHHCCHHHHHHHHHHHHHCC-CCCCHHHHHHHHHHHH T ss_conf 88678898616963699999999835815---6789889888988635088999999988528-9986689999999998 Q ss_pred HHHHH------------HHHH--------------HHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHH--- Q ss_conf 44555------------5443--------------111234589999999999985200101345655555556666--- Q gi|254781174|r 139 MSYAQ------------MIRD--------------VPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQL--- 189 (271) Q Consensus 139 ~~~~~------------~~~~--------------~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~L--- 189 (271) .-|+. +..+ +.---+...+|++.-+++...-|+.--..-|++.-......+ T Consensus 115 ~Dym~aGl~DRAE~~f~~L~de~efa~~AlqqLl~IYQ~treW~KAIe~A~~L~k~~~q~~~~eIAqfyCELAq~~~~~~ 194 (389) T COG2956 115 RDYMAAGLLDRAEDIFNQLVDEGEFAEGALQQLLNIYQATREWEKAIDVAERLVKLGGQTYRVEIAQFYCELAQQALASS 194 (389) T ss_pred HHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 99998564668999999973463432899999999999836899999999999971885212589999999999974031 Q ss_pred -------------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHH Q ss_conf -------------------6665556666654432234568999886528998115999999999999749879999999 Q gi|254781174|r 190 -------------------AAKEVEIGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVV 250 (271) Q Consensus 190 -------------------a~~e~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~ 250 (271) .+-.+..|+.+..+|+|.+|+.+++.+++. +..+.++.+-.|.++|..+|..++...+. T Consensus 195 ~~d~A~~~l~kAlqa~~~cvRAsi~lG~v~~~~g~y~~AV~~~e~v~~Q--n~~yl~evl~~L~~~Y~~lg~~~~~~~fL 272 (389) T COG2956 195 DVDRARELLKKALQADKKCVRASIILGRVELAKGDYQKAVEALERVLEQ--NPEYLSEVLEMLYECYAQLGKPAEGLNFL 272 (389) T ss_pred HHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHH--CHHHHHHHHHHHHHHHHHHCCHHHHHHHH T ss_conf 1899999999998628200103545438887526669999999999984--95779999999999999947877899999 Q ss_pred HHHHHHCCCCHHHH Q ss_conf 99978569997899 Q gi|254781174|r 251 SLIQERYPQGYWAR 264 (271) Q Consensus 251 ~~l~~~yP~s~~~~ 264 (271) ..-...+|+..--- T Consensus 273 ~~~~~~~~g~~~~l 286 (389) T COG2956 273 RRAMETNTGADAEL 286 (389) T ss_pred HHHHHCCCCCCHHH T ss_conf 99986447851999 No 20 >KOG2002 consensus Probab=99.38 E-value=8e-11 Score=81.20 Aligned_cols=194 Identities=16% Similarity=0.194 Sum_probs=120.9 Q ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHH Q ss_conf 99999999999809899999999999853047730268998778877654555679999988754011211013555443 Q gi|254781174|r 58 REVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYLV 137 (271) Q Consensus 58 ~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~~ 137 (271) -.+.-+|...|++|+|..|+..|+.++...|... ++....+|.|+.++++-+.|+..+++++.+.|+.-.+ ++ T Consensus 165 l~LlGkA~i~ynkkdY~~al~yyk~al~inp~~~--aD~rIgig~Cf~kl~~~~~a~~a~~ralqLdp~~v~a-----lv 237 (1018) T KOG2002 165 LALLGKARIAYNKKDYRGALKYYKKALRINPACK--ADVRIGIGHCFWKLGMSEKALLAFERALQLDPTCVSA-----LV 237 (1018) T ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCC--CCCCCHHHHHHHHCCCHHHHHHHHHHHHHCCHHHHHH-----HH T ss_conf 8899899999612138999999999985086667--8763114667875034366799999998507155999-----99 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHH--------------------HHHHHHHHHHHHHHH Q ss_conf 444555544311123458999999999998520010134565555--------------------555666666655566 Q gi|254781174|r 138 GMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYV--------------------TVGRNQLAAKEVEIG 197 (271) Q Consensus 138 a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l--------------------~~~~~~La~~e~~ia 197 (271) |+..+... ..|......+...++......|..|-+..-.... .......|+.-|.+| T Consensus 238 ~L~~~~l~---~~d~~s~~~~~~ll~~ay~~n~~nP~~l~~LAn~fyfK~dy~~v~~la~~ai~~t~~~~~~aes~Y~~g 314 (1018) T KOG2002 238 ALGEVDLN---FNDSDSYKKGVQLLQRAYKENNENPVALNHLANHFYFKKDYERVWHLAEHAIKNTENKSIKAESFYQLG 314 (1018) T ss_pred HHHHHHHH---CCCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999987---015677888999988887606899579999977774135378999999999986101477778899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHH Q ss_conf 666544322345689998865289981159999999999997498799999999997856999789 Q gi|254781174|r 198 RYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSLIQERYPQGYWA 263 (271) Q Consensus 198 ~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~~~yP~s~~~ 263 (271) +.|+.+|+|..|-.+|...+.--|+. -.-+++.+|..|...|+.+.|+..+..+...+|+.... T Consensus 315 Rs~Ha~Gd~ekA~~yY~~s~k~~~d~--~~l~~~GlgQm~i~~~dle~s~~~fEkv~k~~p~~~et 378 (1018) T KOG2002 315 RSYHAQGDFEKAFKYYMESLKADNDN--FVLPLVGLGQMYIKRGDLEESKFCFEKVLKQLPNNYET 378 (1018) T ss_pred HHHHHHCCHHHHHHHHHHHHCCCCCC--CCCCCCCHHHHHHHHCHHHHHHHHHHHHHHHCCCHHHH T ss_conf 99886226899999999997248887--62001212578987132778899999999867650889 No 21 >KOG4626 consensus Probab=99.26 E-value=2.7e-10 Score=78.15 Aligned_cols=180 Identities=13% Similarity=0.122 Sum_probs=115.4 Q ss_pred CCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHH Q ss_conf 67368999999999998098999999999998530477302689987788776545556799999887540112110135 Q gi|254781174|r 53 DVRYQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDY 132 (271) Q Consensus 53 ~~~~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~ 132 (271) +...+.+.++.|..+-+.+.|+.|+..|++....-|.+. .+.-++|-+||.+|.-+.||..|++.++.-|+-|. T Consensus 248 dP~f~dAYiNLGnV~ke~~~~d~Avs~Y~rAl~lrpn~A---~a~gNla~iYyeqG~ldlAI~~Ykral~~~P~F~~--- 321 (966) T KOG4626 248 DPNFLDAYINLGNVYKEARIFDRAVSCYLRALNLRPNHA---VAHGNLACIYYEQGLLDLAIDTYKRALELQPNFPD--- 321 (966) T ss_pred CCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCH---HHCCCEEEEEECCCCHHHHHHHHHHHHHCCCCCHH--- T ss_conf 976527786588999987220378999999974297303---12043479973144278889999998703999567--- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHH--------------------HHHHHH Q ss_conf 554434445555443111234589999999999985200101345655555556--------------------666666 Q gi|254781174|r 133 VYYLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGR--------------------NQLAAK 192 (271) Q Consensus 133 A~y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~--------------------~~La~~ 192 (271) |+-..|.+ -+|.+...+|...+.+.+..-|++. ||...+..+. ..+|+- T Consensus 322 Ay~NlanA--------Lkd~G~V~ea~~cYnkaL~l~p~ha---dam~NLgni~~E~~~~e~A~~ly~~al~v~p~~aaa 390 (966) T KOG4626 322 AYNNLANA--------LKDKGSVTEAVDCYNKALRLCPNHA---DAMNNLGNIYREQGKIEEATRLYLKALEVFPEFAAA 390 (966) T ss_pred HHHHHHHH--------HHHCCCHHHHHHHHHHHHHHCCCCH---HHHHHHHHHHHHHCCCHHHHHHHHHHHHHCHHHHHH T ss_conf 77579999--------7750546799999999987587637---788879899988612218899999998638024556 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHH Q ss_conf 555666665443223456899988652899811599999999999974987999999999 Q gi|254781174|r 193 EVEIGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSL 252 (271) Q Consensus 193 e~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~ 252 (271) .-.+|..|-++|++..|+..|+++++--|.. .+|+-.+|.+|..+|+.+.|.+-+.. T Consensus 391 ~nNLa~i~kqqgnl~~Ai~~YkealrI~P~f---Ada~~NmGnt~ke~g~v~~A~q~y~r 447 (966) T KOG4626 391 HNNLASIYKQQGNLDDAIMCYKEALRIKPTF---ADALSNMGNTYKEMGDVSAAIQCYTR 447 (966) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHCCCHH---HHHHHHCCHHHHHHHHHHHHHHHHHH T ss_conf 4229999996054778999999887149407---89987504277886417889999999 No 22 >KOG3785 consensus Probab=99.21 E-value=5.2e-09 Score=70.64 Aligned_cols=182 Identities=14% Similarity=0.135 Sum_probs=125.3 Q ss_pred HHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHH Q ss_conf 99999809899999999999853047730268998778877654555679999988754011211013555443444555 Q gi|254781174|r 64 AVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYLVGMSYAQ 143 (271) Q Consensus 64 a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~~a~~~~~ 143 (271) -...+.+.+|..|+..++--. .-+.+...+..+|+|.|+|.+|+|++|...|+-.... +.+.. +.....|.|++- T Consensus 29 Ledfls~rDytGAislLefk~--~~~~EEE~~~~lWia~C~fhLgdY~~Al~~Y~~~~~~--~~~~~-el~vnLAcc~Fy 103 (557) T KOG3785 29 LEDFLSNRDYTGAISLLEFKL--NLDREEEDSLQLWIAHCYFHLGDYEEALNVYTFLMNK--DDAPA-ELGVNLACCKFY 103 (557) T ss_pred HHHHHHCCCCHHHHHHHHHHH--CCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCC--CCCCC-CCCHHHHHHHHH T ss_conf 888875255145899998860--4441446789999999887521399999999988536--79970-102528999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHH----------HHHHH----HHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 544311123458999999999998520010134565----------55555----5666666655566666544322345 Q gi|254781174|r 144 MIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGAR----------FYVTV----GRNQLAAKEVEIGRYYLKRGEYVAA 209 (271) Q Consensus 144 ~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~----------~~l~~----~~~~La~~e~~ia~~Y~~~~~y~aA 209 (271) . +...+|.. +..+-|++|.-..-. .++.. +++.+ +..+.+|...+.|-.|..| T Consensus 104 L--------g~Y~eA~~----~~~ka~k~pL~~RLlfhlahklndEk~~~~fh~~LqD~~-EdqLSLAsvhYmR~HYQeA 170 (557) T KOG3785 104 L--------GQYIEAKS----IAEKAPKTPLCIRLLFHLAHKLNDEKRILTFHSSLQDTL-EDQLSLASVHYMRMHYQEA 170 (557) T ss_pred H--------HHHHHHHH----HHHHCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHH T ss_conf 9--------98999999----986189875999999999987176889999998875467-8777677779999989999 Q ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHH Q ss_conf 689998865289981159999999999997498799999999997856999789999 Q gi|254781174|r 210 IPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSLIQERYPQGYWARYV 266 (271) Q Consensus 210 ~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~~~yP~s~~~~~a 266 (271) |..|+.++.+-|+- .-....++-+|+++..+|-+.+..++...++|||..++.- T Consensus 171 IdvYkrvL~dn~ey---~alNVy~ALCyyKlDYydvsqevl~vYL~q~pdStiA~NL 224 (557) T KOG3785 171 IDVYKRVLQDNPEY---IALNVYMALCYYKLDYYDVSQEVLKVYLRQFPDSTIAKNL 224 (557) T ss_pred HHHHHHHHHCCHHH---HHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHH T ss_conf 99999997048636---4567889999985561666899999999968995888889 No 23 >PRK02603 photosystem I assembly protein Ycf3; Provisional Probab=99.19 E-value=4.7e-10 Score=76.74 Aligned_cols=120 Identities=12% Similarity=0.078 Sum_probs=94.5 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHH Q ss_conf 89999999999980989999999999985304773026899877887765455567999998875401121101355544 Q gi|254781174|r 57 QREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYL 136 (271) Q Consensus 57 ~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~ 136 (271) +-..|+.|+.+-..|+|.+|++.|++.+..-|...-...+++++|.+|...|++++|+..|.+.|+..|+.+. |+.. T Consensus 35 A~~Yy~~G~~~q~~G~Y~EAle~Y~kAL~Le~dp~D~s~ilyNiGlIY~~~G~~~kAie~Y~kALeiNP~~~~---AlNN 111 (172) T PRK02603 35 AFVYYRDGMSAQNDGDYAEALENYEEALKLEEDPNDRSYILYNMAIIYASNGEHDKALETYHQALELNPKQPS---ALNN 111 (172) T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHH---HHHH T ss_conf 9999972257774579999999999987277896208999977999999869988999999999986899789---9999 Q ss_pred HHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHH Q ss_conf 34445555443------1112345899999999999852001013456555 Q gi|254781174|r 137 VGMSYAQMIRD------VPYDQRATKLMLQYMSRIVERYTNSPYVKGARFY 181 (271) Q Consensus 137 ~a~~~~~~~~~------~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~ 181 (271) +|++|..+... .+......++|.+.+.+.|+.-|+.= .+|+.. T Consensus 112 lgvIY~k~G~~a~~~~~~~~~~~~fdkAa~yw~kAI~l~P~ny--i~aqn~ 160 (172) T PRK02603 112 IAVIYHKRGRKAEEAGDQDEAEAWFDKAAEYWKQAIRLAPNNY--IEAQNW 160 (172) T ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCH--HHHHHH T ss_conf 9999999859999845877899999999999999885099870--999999 No 24 >PRK11447 cellulose synthase subunit BcsC; Provisional Probab=99.18 E-value=1.7e-08 Score=67.59 Aligned_cols=69 Identities=14% Similarity=0.094 Sum_probs=35.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHH Q ss_conf 666544322345689998865289981159999999999997498799999999997856999789999986 Q gi|254781174|r 198 RYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSLIQERYPQGYWARYVETL 269 (271) Q Consensus 198 ~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~~~yP~s~~~~~a~~~ 269 (271) +-...+|+...|+..++..+...|+.++. .+.|+.+|..+|..++|......+..+.|++....+|.-| T Consensus 470 ~~~~~~Gd~~~A~~~l~qa~~~~P~dpWl---R~~LAr~~~~~G~~~~A~~lM~~l~~~~p~~pe~~yA~AL 538 (1158) T PRK11447 470 EALENQGKWAQAAELQRQRLALDPGSVWI---TYRLAQDLWQAGQRSQADALMRRLAQQKPNDPEQVYAYGL 538 (1158) T ss_pred HHHHHCCCHHHHHHHHHHHHHHCCCCCHH---HHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHH T ss_conf 99997889999999999999718997049---9999999997799889999999998449999899999999 No 25 >PRK11447 cellulose synthase subunit BcsC; Provisional Probab=99.18 E-value=1.9e-08 Score=67.32 Aligned_cols=213 Identities=17% Similarity=0.200 Sum_probs=129.7 Q ss_pred CCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHH-------------HHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 76736899999999999809899999999999853047730268-------------99877887765455567999998 Q gi|254781174|r 52 TDVRYQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARK-------------SLLMSAFVQYSAGKYQQAASLGE 118 (271) Q Consensus 52 ~~~~~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~-------------A~~~la~~~y~~~~y~~A~~~~~ 118 (271) .+...|+.+|..+...++.|+-++|...+.++....|+|+-..+ ..+..|..+=..|++++|+..|+ T Consensus 58 IdPnnpeaL~~~~~~~lr~Gd~~~A~~~L~rL~~~aP~s~~~~~l~~~~~l~~~~~~~~LqqARlLA~sGr~~eA~~~y~ 137 (1158) T PRK11447 58 IDPNNPDVIAARFRLLLRQGDSDGAQKLLDRLSQLAPSSTAYRSSRTTMLLSTPEGRQALQQARLLATTGHTEEALASYD 137 (1158) T ss_pred CCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHH T ss_conf 59999899999999998689999999999999864889879999999987438644999999999987498699999999 Q ss_pred HH-----------------------------------HHHCCCCCHHHHHHHH---------HHHHHHHHHHHHH--HH- Q ss_conf 87-----------------------------------5401121101355544---------3444555544311--12- Q gi|254781174|r 119 EY-----------------------------------ITQYPESKNVDYVYYL---------VGMSYAQMIRDVP--YD- 151 (271) Q Consensus 119 ~f-----------------------------------i~~~P~s~~~~~A~y~---------~a~~~~~~~~~~~--~d- 151 (271) ++ ...||+++.+.-|+=. -|+....++.... ++ T Consensus 138 ~LF~g~~P~~~LalEYyq~~A~~p~~~~~Ai~~L~~l~~~~P~d~~~~lALA~~L~~~~tr~egl~~L~~La~~p~~~~~ 217 (1158) T PRK11447 138 KLFNGAPPEGELAVEYWRTVAKLPARRPEAINQLQRLNAQYPGNTGLRNALALLLFASGRRDEGFAVLEQMAKSPAGRDA 217 (1158) T ss_pred HHCCCCCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCHHH T ss_conf 96699997868899999996269543699999999999868798799999999874257668999999998549852579 Q ss_pred -----------HHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHH Q ss_conf -----------3458999999999998520010134565555555666666655---56666654432234568999886 Q gi|254781174|r 152 -----------QRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQLAAKEV---EIGRYYLKRGEYVAAIPRFQLVL 217 (271) Q Consensus 152 -----------~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~La~~e~---~ia~~Y~~~~~y~aA~~~~~~~i 217 (271) +.....-...|+.++..||+++-+..++.++..-+..++...+ .-|---+.+|+..+|...|+.++ T Consensus 218 A~~~W~~~L~~~p~~~~~~~~~~~yL~~~P~d~~~~~~~~~l~~q~~~~~DP~~~Ar~~Gl~aL~~G~l~aAe~~l~~AL 297 (1158) T PRK11447 218 AAQIWYQQIKDMPVSDASVKALQAFLTVFSEGDSVAAARSQLAEQQKQLADPAFRARQQGLAAVDSGMGGKAIAELQQAV 297 (1158) T ss_pred HHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHH T ss_conf 99999999845799866699999999869996699999999998752227804789998999997799788999999999 Q ss_pred HHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCC----CHHHHHHH Q ss_conf 528998115999999999999749879999999999785699----97899999 Q gi|254781174|r 218 ANYSDAEHAEEAMARLVEAYVALALMDEAREVVSLIQERYPQ----GYWARYVE 267 (271) Q Consensus 218 ~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~~~yP~----s~~~~~a~ 267 (271) ...|+.+ |++--||.+|.+.|..++|..+++.-...=|+ ++|...++ T Consensus 298 ~~~PnD~---daLGgLG~v~lRqG~~~eA~~yf~rA~~~d~~~~~~~kW~sll~ 348 (1158) T PRK11447 298 RANPKDS---EALGALGQAYSQRGDRARAVAQFEKALALDPHSSNRDKWESLLK 348 (1158) T ss_pred HHCCCCH---HHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHH T ss_conf 7199987---89988899999658889999999999970899854688999998 No 26 >PRK11788 hypothetical protein; Provisional Probab=99.17 E-value=2.9e-08 Score=66.31 Aligned_cols=197 Identities=11% Similarity=0.055 Sum_probs=131.8 Q ss_pred CCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHH Q ss_conf 73689999999999980989999999999985304773026899877887765455567999998875401121101355 Q gi|254781174|r 54 VRYQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYV 133 (271) Q Consensus 54 ~~~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A 133 (271) ..-...+++.|..|+..|-++.|...|.++... +++...|+-.+...|...++|++|+...++..+.-..+....-| T Consensus 104 ~~r~~a~~ELa~Dy~~aGllDRAE~~f~~l~~~---~~~~~~al~~Ll~iye~~~dW~~Ai~~a~~L~~~~~~~~~~~iA 180 (389) T PRK11788 104 EQRLLALQELGQDYLKAGLLDRAEELFLQLVDE---DDFAEGALRQLLTIYQQEKDWQKAIEVAERLEKLGGDSLRKEIA 180 (389) T ss_pred HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHC---CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHH T ss_conf 999999999999998677499999999999828---54269999999999998645999999999998704765069999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH-----------HHHHHH-HHH------H Q ss_conf 5443444555544311123458999999999998520010134565555555-----------666666-655------5 Q gi|254781174|r 134 YYLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVTVG-----------RNQLAA-KEV------E 195 (271) Q Consensus 134 ~y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~-----------~~~La~-~e~------~ 195 (271) +|..-++-. ....+...+|.+.+++-++-.|++..+.-..-.+..- +..+.. .++ . T Consensus 181 hy~CElA~~------~~~~~~~~~A~~~L~kAL~~dp~cvRA~lllg~l~~~~g~~~~Ai~~l~~i~~q~p~~l~~vl~~ 254 (389) T PRK11788 181 HFYCELAQQ------ALARGDLDAARALLKKALAADPQCVRASILLGDLALAQGDYAAAIEALERVEEQDPEYLPEVLPK 254 (389) T ss_pred HHHHHHHHH------HHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCHHHHHHHHHH T ss_conf 999999999------98748999999999999874824341899999999987999999999999997395779999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHH Q ss_conf 66666544322345689998865289981159999999999997498799999999997856999789 Q gi|254781174|r 196 IGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSLIQERYPQGYWA 263 (271) Q Consensus 196 ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~~~yP~s~~~ 263 (271) +-..|...++-......++.+++.+|.+. .+..+.+.....|..+.|..+......+.|+-... T Consensus 255 L~~~y~~l~~~~~~~~~L~~~~~~~~~~~----l~l~l~~~l~~~~g~~~A~~~l~~~L~~~Psl~g~ 318 (389) T PRK11788 255 LMECYQALGDEAEGLEFLRRAVEEYPGAD----LLLALAQLLEEREGPEAAQALLREQLRRHPSLRGF 318 (389) T ss_pred HHHHHHHCCCHHHHHHHHHHHHHHCCCCH----HHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHH T ss_conf 99999982888899999999998589935----99999999997699999999999997529988999 No 27 >KOG2002 consensus Probab=99.17 E-value=1.7e-08 Score=67.58 Aligned_cols=194 Identities=12% Similarity=0.127 Sum_probs=96.7 Q ss_pred CHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHH Q ss_conf 68999999999998098999999999998530477302689987788776545556799999887540112110135554 Q gi|254781174|r 56 YQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYY 135 (271) Q Consensus 56 ~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y 135 (271) .|..+--.|..+|.+|+|..+....+-.+..--.+..-.++.|++|.+|+.+|+|++|-.+|..-+...|++ -..+++ T Consensus 269 nP~~l~~LAn~fyfK~dy~~v~~la~~ai~~t~~~~~~aes~Y~~gRs~Ha~Gd~ekA~~yY~~s~k~~~d~--~~l~~~ 346 (1018) T KOG2002 269 NPVALNHLANHFYFKKDYERVWHLAEHAIKNTENKSIKAESFYQLGRSYHAQGDFEKAFKYYMESLKADNDN--FVLPLV 346 (1018) T ss_pred CCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCC--CCCCCC T ss_conf 957999997777413537899999999998610147777889999999886226899999999997248887--620012 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC---------HHHHHH--HHHHHH-------------------- Q ss_conf 4344455554431112345899999999999852001---------013456--555555-------------------- Q gi|254781174|r 136 LVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNS---------PYVKGA--RFYVTV-------------------- 184 (271) Q Consensus 136 ~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S---------~ya~~A--~~~l~~-------------------- 184 (271) .+|..+... +.+..|+..|+.++..+|+- -|+..+ +..... T Consensus 347 GlgQm~i~~--------~dle~s~~~fEkv~k~~p~~~etm~iLG~Lya~~~~~~~~~d~a~~~l~K~~~~~~~d~~a~l 418 (1018) T KOG2002 347 GLGQMYIKR--------GDLEESKFCFEKVLKQLPNNYETMKILGCLYAHSAKKQEKRDKASNVLGKVLEQTPVDSEAWL 418 (1018) T ss_pred CHHHHHHHH--------CHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHH T ss_conf 125789871--------327788999999998676508899998867886401257789999999887750534088899 Q ss_pred --------------------HHHHHHHHH--------HHHHHHHHHHHHHHHHHHHHHHHHHHCC---CCHH----HHHH Q ss_conf --------------------566666665--------5566666544322345689998865289---9811----5999 Q gi|254781174|r 185 --------------------GRNQLAAKE--------VEIGRYYLKRGEYVAAIPRFQLVLANYS---DAEH----AEEA 229 (271) Q Consensus 185 --------------------~~~~La~~e--------~~ia~~Y~~~~~y~aA~~~~~~~i~~yp---~t~~----~~eA 229 (271) +.+.|..+. -.+|-+++..|.+..|...|...+..-- ++.. .... T Consensus 419 ~laql~e~~d~~~sL~~~~~A~d~L~~~~~~ip~E~LNNvaslhf~~g~~~~A~~~f~~A~~~~~~~~n~de~~~~~lt~ 498 (1018) T KOG2002 419 ELAQLLEQTDPWASLDAYGNALDILESKGKQIPPEVLNNVASLHFRLGNIEKALEHFKSALGKLLEVANKDEGKSTNLTL 498 (1018) T ss_pred HHHHHHHHCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHH T ss_conf 99999981276889999998999999727889889997588999985076878988998864301220710023510689 Q ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHHCCC Q ss_conf 999999999749879999999999785699 Q gi|254781174|r 230 MARLVEAYVALALMDEAREVVSLIQERYPQ 259 (271) Q Consensus 230 l~~l~~~y~~lg~~d~A~~~~~~l~~~yP~ 259 (271) .|.++.+...++..+.|.+.++.+...+|+ T Consensus 499 ~YNlarl~E~l~~~~~A~e~Yk~Ilkehp~ 528 (1018) T KOG2002 499 KYNLARLLEELHDTEVAEEMYKSILKEHPG 528 (1018) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCH T ss_conf 988999987430006799999999987840 No 28 >KOG1126 consensus Probab=99.16 E-value=7.9e-09 Score=69.56 Aligned_cols=56 Identities=20% Similarity=0.160 Sum_probs=26.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHH Q ss_conf 55666665443223456899988652899811599999999999974987999999999 Q gi|254781174|r 194 VEIGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSL 252 (271) Q Consensus 194 ~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~ 252 (271) |..|..++..++|..|..-++++.+--|+-. -+++.+|.+|-++|..+.|..-+.+ T Consensus 561 ~~~~~il~~~~~~~eal~~LEeLk~~vP~es---~v~~llgki~k~~~~~~~Al~~f~~ 616 (638) T KOG1126 561 YHRASILFSLGRYVEALQELEELKELVPQES---SVFALLGKIYKRLGNTDLALLHFSW 616 (638) T ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHCCCHH---HHHHHHHHHHHHHCCCHHHHHHHHH T ss_conf 8877888762236999999999998575057---8999999999997330588876677 No 29 >KOG1126 consensus Probab=99.09 E-value=6.1e-09 Score=70.24 Aligned_cols=189 Identities=15% Similarity=0.100 Sum_probs=106.5 Q ss_pred HHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC-------------- Q ss_conf 999999980989999999999985304773026899877887765455567999998875401121-------------- Q gi|254781174|r 62 EKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPES-------------- 127 (271) Q Consensus 62 ~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s-------------- 127 (271) .+|....-.-+..+|+..|+++-..++++.|. +..+|.+||.+.+|++|...|+..-+..|-- T Consensus 324 ~~~~~~~s~y~~~~A~~~~~klp~h~~nt~wv---l~q~GrayFEl~~Y~~a~~~F~~~r~~~p~rv~~meiyST~LWHL 400 (638) T KOG1126 324 GEGYRSLSQYNCREALNLFEKLPSHHYNTGWV---LSQLGRAYFELIEYDQAERIFSLVRRIEPYRVKGMEIYSTTLWHL 400 (638) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHH T ss_conf 99999988887999999998647753784479---999889999898789999999999861454113246788888887 Q ss_pred -----------------CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHH---------- Q ss_conf -----------------10135554434445555443111234589999999999985200101345655---------- Q gi|254781174|r 128 -----------------KNVDYVYYLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARF---------- 180 (271) Q Consensus 128 -----------------~~~~~A~y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~---------- 180 (271) ++.|+.+=-.|.||-. |+..+.|++.|++.++.-|+-.|+---.- T Consensus 401 q~~v~Ls~Laq~Li~~~~~sPesWca~GNcfSL--------Qkdh~~Aik~f~RAiQldp~faYayTLlGhE~~~~ee~d 472 (638) T KOG1126 401 QDEVALSYLAQDLIDTDPNSPESWCALGNCFSL--------QKDHDTAIKCFKRAIQLDPRFAYAYTLLGHESIATEEFD 472 (638) T ss_pred HHHHHHHHHHHHHHHHCCCCCHHHHHHCCHHHH--------HHHHHHHHHHHHHHHCCCCCCCHHHHHCCCHHHHHHHHH T ss_conf 756778899898875087870778774025666--------567999999887765468763212353077345457777 Q ss_pred -HHHHHHHHH---HHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH Q ss_conf -555556666---666---5556666654432234568999886528998115999999999999749879999999999 Q gi|254781174|r 181 -YVTVGRNQL---AAK---EVEIGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSLI 253 (271) Q Consensus 181 -~l~~~~~~L---a~~---e~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l 253 (271) -+..-+..| .+| =+.+|..|++++++.-|.-.|+.+++--|.+. ..+-.+|.++.++|..|+|.+.+..- T Consensus 473 ~a~~~fr~Al~~~~rhYnAwYGlG~vy~Kqek~e~Ae~~fqkA~~INP~ns---vi~~~~g~~~~~~k~~d~AL~~~~~A 549 (638) T KOG1126 473 KAMKSFRKALGVDPRHYNAWYGLGTVYLKQEKLEFAEFHFQKAVEINPSNS---VILCHIGRIQHQLKRKDKALQLYEKA 549 (638) T ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHEECCCHHHHHHHHHHHHHCCCCCCH---HHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 677877765157805667888620302204104678998876422796103---57765259999860132799999998 Q ss_pred HHHCCCCHHHH Q ss_conf 78569997899 Q gi|254781174|r 254 QERYPQGYWAR 264 (271) Q Consensus 254 ~~~yP~s~~~~ 264 (271) ..-=|.+...+ T Consensus 550 ~~ld~kn~l~~ 560 (638) T KOG1126 550 IHLDPKNPLCK 560 (638) T ss_pred HHCCCCCCHHH T ss_conf 81687881567 No 30 >KOG0624 consensus Probab=99.08 E-value=8.8e-08 Score=63.46 Aligned_cols=188 Identities=12% Similarity=0.086 Sum_probs=146.3 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHH Q ss_conf 89999999999980989999999999985304773026899877887765455567999998875401121101355544 Q gi|254781174|r 57 QREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYL 136 (271) Q Consensus 57 ~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~ 136 (271) .+.-++.|...+.+|++.+|+..|-..+..-|.+ =+|.|..|-+|..+|+-..|+..+++.+++-|+-- -|.-. T Consensus 38 vekhlElGk~lla~~Q~sDALt~yHaAve~dp~~---Y~aifrRaT~yLAmGksk~al~Dl~rVlelKpDF~---~ARiQ 111 (504) T KOG0624 38 VEKHLELGKELLARGQLSDALTHYHAAVEGDPNN---YQAIFRRATVYLAMGKSKAALQDLSRVLELKPDFM---AARIQ 111 (504) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHH---HHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCCCHH---HHHHH T ss_conf 9999998899997346778999988887078066---99999877777652577400330899986183188---88887 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHH------------------------ Q ss_conf 34445555443111234589999999999985200101345655555556666666------------------------ Q gi|254781174|r 137 VGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQLAAK------------------------ 192 (271) Q Consensus 137 ~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~La~~------------------------ 192 (271) +|....+ ++....|...|..+++.-|+..-+.+++.++..+.+..--. T Consensus 112 Rg~vllK--------~Gele~A~~DF~~vl~~~~s~~~~~eaqskl~~~~e~~~l~~ql~s~~~~GD~~~ai~~i~~llE 183 (504) T KOG0624 112 RGVVLLK--------QGELEQAEADFDQVLQHEPSNGLVLEAQSKLALIQEHWVLVQQLKSASGSGDCQNAIEMITHLLE 183 (504) T ss_pred HCHHHHH--------CCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH T ss_conf 2203553--------25288898799999725987302688998877689999999999988438746559999988885 Q ss_pred ------H--HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCH Q ss_conf ------5--55666665443223456899988652899811599999999999974987999999999978569997 Q gi|254781174|r 193 ------E--VEIGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSLIQERYPQGY 261 (271) Q Consensus 193 ------e--~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~~~yP~s~ 261 (271) + -..++.|...++...||.-++.+-..-.|+. +++|.+.+.++.+|+.+.+......-.+-=|+.+ T Consensus 184 i~~Wda~l~~~Rakc~i~~~e~k~AI~Dlk~askLs~DnT---e~~ykis~L~Y~vgd~~~sL~~iRECLKldpdHK 257 (504) T KOG0624 184 IQPWDASLRQARAKCYIAEGEPKKAIHDLKQASKLSQDNT---EGHYKISQLLYTVGDAENSLKEIRECLKLDPDHK 257 (504) T ss_pred CCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCH---HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHH T ss_conf 3713368999998888853868888888999874046653---8889899899766118878999999980695333 No 31 >cd00189 TPR Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and C Probab=99.08 E-value=1.6e-09 Score=73.69 Aligned_cols=99 Identities=21% Similarity=0.327 Sum_probs=83.2 Q ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHH Q ss_conf 99999999998098999999999998530477302689987788776545556799999887540112110135554434 Q gi|254781174|r 59 EVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYLVG 138 (271) Q Consensus 59 ~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~~a 138 (271) .+|..|..+++.|++++|+..|++++...|.++ .+.+.+|.++...+++++|+..+++.++..|+++ .+++.+| T Consensus 2 ~~~~lg~~~~~~g~~~~A~~~~~~al~~~p~~~---~~~~~l~~~~~~~~~~~~A~~~~~~~~~~~~~~~---~~~~~la 75 (100) T cd00189 2 ALLNLGNLYYKLGDYDEALEYYEKALELDPDNA---DAYYNLAAAYYKLGKYEEALEDYEKALELDPDNA---KAYYNLG 75 (100) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCH---HHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCH---HHHHHHH T ss_conf 899999999984999999999999998599989---9999999999985985899999999998698989---9999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 445555443111234589999999999985200 Q gi|254781174|r 139 MSYAQMIRDVPYDQRATKLMLQYMSRIVERYTN 171 (271) Q Consensus 139 ~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~ 171 (271) .++..+ +...+|+..|+..++.-|+ T Consensus 76 ~~~~~~--------~~~~~A~~~~~~al~~~P~ 100 (100) T cd00189 76 LAYYKL--------GKYEEALEAYEKALELDPN 100 (100) T ss_pred HHHHHC--------CCHHHHHHHHHHHHHHCCC T ss_conf 999994--------9999999999999974908 No 32 >KOG0547 consensus Probab=99.04 E-value=1.9e-08 Score=67.36 Aligned_cols=160 Identities=16% Similarity=0.170 Sum_probs=81.7 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHH Q ss_conf 89999999999980989999999999985304773026899877887765455567999998875401121101355544 Q gi|254781174|r 57 QREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYL 136 (271) Q Consensus 57 ~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~ 136 (271) +...|.+|.-.|--++|++|+..|++.++.-|.+-+ ++..++.+.|++++.+++...|++.+++||+.+.++ T Consensus 394 ~dvYyHRgQm~flL~q~e~A~aDF~Kai~L~pe~~~---~~iQl~~a~Yr~~k~~~~m~~Fee~kkkFP~~~Evy----- 465 (606) T KOG0547 394 PDVYYHRGQMRFLLQQYEEAIADFQKAISLDPENAY---AYIQLCCALYRQHKIAESMKTFEEAKKKFPNCPEVY----- 465 (606) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHH----- T ss_conf 832576778999999899999879987605825337---999999999998789999999999998688982377----- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC--------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 344455554431112345899999999999852001--------013456555555566666665556666654432234 Q gi|254781174|r 137 VGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNS--------PYVKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVA 208 (271) Q Consensus 137 ~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S--------~ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~a 208 (271) .....+-.||+...+|++.|...|..-|.- +++..| +.+.+ =+++... T Consensus 466 ------~~fAeiLtDqqqFd~A~k~YD~ai~LE~~~~~~~v~~~plV~Ka---------------~l~~q---wk~d~~~ 521 (606) T KOG0547 466 ------NLFAEILTDQQQFDKAVKQYDKAIELEPREHLIIVNAAPLVHKA---------------LLVLQ---WKEDINQ 521 (606) T ss_pred ------HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHH---------------HHHHC---HHHHHHH T ss_conf ------89999983177677899988888850642011324605443046---------------76435---1654899 Q ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHH Q ss_conf 5689998865289981159999999999997498799999999 Q gi|254781174|r 209 AIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVS 251 (271) Q Consensus 209 A~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~ 251 (271) |++-++..++--|.+. .|.--|+..-.+.|..++|.+.+. T Consensus 522 a~~Ll~KA~e~Dpkce---~A~~tlaq~~lQ~~~i~eAielFE 561 (606) T KOG0547 522 AENLLRKAIELDPKCE---QAYETLAQFELQRGKIDEAIELFE 561 (606) T ss_pred HHHHHHHHHCCCCHHH---HHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999872594478---999999899998865999999999 No 33 >PRK02603 photosystem I assembly protein Ycf3; Provisional Probab=99.03 E-value=1.5e-08 Score=67.90 Aligned_cols=145 Identities=14% Similarity=0.108 Sum_probs=101.2 Q ss_pred HHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 99809899999999999853047730268998778877654555679999988754011211013555443444555544 Q gi|254781174|r 67 FLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYLVGMSYAQMIR 146 (271) Q Consensus 67 ~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~~a~~~~~~~~ 146 (271) -|-++.+..+++.+.+++..-|....| -+.+..|.++...|+|.+|+..|++.++.-|+..+..+++|.+|++|..+ T Consensus 9 nfidk~ftv~ad~~~k~~p~~~~~k~A-~~Yy~~G~~~q~~G~Y~EAle~Y~kAL~Le~dp~D~s~ilyNiGlIY~~~-- 85 (172) T PRK02603 9 NFIDKSFTVMADLILKILPINKKAKEA-FVYYRDGMSAQNDGDYAEALENYEEALKLEEDPNDRSYILYNMAIIYASN-- 85 (172) T ss_pred CHHHHHHHHHHHHHHHHHCCCCCHHHH-HHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHC-- T ss_conf 114424899999999981588633689-99997225777457999999999998727789620899997799999986-- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 31112345899999999999852001013456555555566666665556666654432234568999886528998 Q gi|254781174|r 147 DVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQLVLANYSDA 223 (271) Q Consensus 147 ~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t 223 (271) +...+|++.|.+.|..-|+.+ +|...+..+.....+...+.|+.=.....++.|...++..|+--|+. T Consensus 86 ------G~~~kAie~Y~kALeiNP~~~---~AlNNlgvIY~k~G~~a~~~~~~~~~~~~fdkAa~yw~kAI~l~P~n 153 (172) T PRK02603 86 ------GEHDKALETYHQALELNPKQP---SALNNIAVIYHKRGRKAEEAGDQDEAEAWFDKAAEYWKQAIRLAPNN 153 (172) T ss_pred ------CCHHHHHHHHHHHHHHCCCCH---HHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf ------998899999999998689978---99999999999985999984587789999999999999988509987 No 34 >TIGR02521 type_IV_pilW type IV pilus biogenesis/stability protein PilW; InterPro: IPR013360 Proteins in this entry are designated PilF and PilW . This outer membrane protein is required both for pilus stability and for pilus functions such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.. Probab=98.98 E-value=2.1e-08 Score=67.10 Aligned_cols=141 Identities=14% Similarity=0.149 Sum_probs=118.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHH Q ss_conf 302689987788776545556799999887540112110135554434445555443111234--589999999999985 Q gi|254781174|r 91 GVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYLVGMSYAQMIRDVPYDQR--ATKLMLQYMSRIVER 168 (271) Q Consensus 91 ~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~~a~~~~~~~~~~~~d~~--~~~~A~~~f~~~i~~ 168 (271) .-+.++...||-.|+..|++..|...+++.|+.-|+... |+-++|..|.. .+ ...+|-..|++-++. T Consensus 34 ~~~a~~r~qLal~Yl~~G~~~~Ak~~l~~AL~~dp~~~~---a~~~lA~~Yq~--------~grP~~~~A~~~f~~Al~~ 102 (247) T TIGR02521 34 EKAAKIRVQLALGYLEQGDLEKAKENLDKALEHDPDDYD---AYLALALYYQQ--------LGRPELEKAEQSFRKALTL 102 (247) T ss_pred HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCH---HHHHHHHHHHH--------HCCCCHHHHHHHHHHHHHC T ss_conf 999999999999998607878999999999873766612---89999999987--------1799557899999998624 Q ss_pred HHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHH Q ss_conf 200101345655555556666666555666665443--223456899988652899811599999999999974987999 Q gi|254781174|r 169 YTNSPYVKGARFYVTVGRNQLAAKEVEIGRYYLKRG--EYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEA 246 (271) Q Consensus 169 yP~S~ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~--~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A 246 (271) -|++.-+ .. .=|.|..++| +|+.|..+|...++. |.-+....++-.+|-|..+.|+.++| T Consensus 103 ~p~~g~~---~N--------------NYG~FLC~~g~~~~~~A~~~F~~A~~~-p~Y~~~~~~~eN~G~C~~~~g~~~~A 164 (247) T TIGR02521 103 KPNNGDV---LN--------------NYGTFLCQQGNDKYEQAEQQFEAAIED-PLYPQPARSYENAGLCALKAGDTEKA 164 (247) T ss_pred CCCCCCE---EC--------------HHHHHHHHCCCCCHHHHHHHHHHHHHC-CCCCCCHHHHHHHHHHHHHCCCHHHH T ss_conf 8998523---20--------------145776404897068999999999866-88973302455499999861786889 Q ss_pred HHHHHHHHHHCCCC Q ss_conf 99999997856999 Q gi|254781174|r 247 REVVSLIQERYPQG 260 (271) Q Consensus 247 ~~~~~~l~~~yP~s 260 (271) +++++....-=|+. T Consensus 165 ~~~l~~Al~~dp~~ 178 (247) T TIGR02521 165 EKYLTRALKIDPQR 178 (247) T ss_pred HHHHHHHHHHCCCC T ss_conf 99999999858788 No 35 >KOG0547 consensus Probab=98.97 E-value=7.9e-08 Score=63.74 Aligned_cols=76 Identities=13% Similarity=0.137 Sum_probs=39.3 Q ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHH Q ss_conf 99999999980989999999999985304773026899877887765455567999998875401121101355544344 Q gi|254781174|r 60 VYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYLVGM 139 (271) Q Consensus 60 lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~~a~ 139 (271) +=.+|..+|.+|+|++||+.|...++..|.-+ --.=+.+-||-..|+|.+.+.--.+.++..|++.. |+++++. T Consensus 118 lK~~GN~~f~~kkY~eAIkyY~~AI~l~p~ep---iFYsNraAcY~~lgd~~~Vied~TkALEl~P~Y~K---Al~RRA~ 191 (606) T KOG0547 118 LKTKGNKFFRNKKYDEAIKYYTQAIELCPDEP---IFYSNRAACYESLGDWEKVIEDCTKALELNPDYVK---ALLRRAS 191 (606) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCC---HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHH---HHHHHHH T ss_conf 87530455302538889999999985089976---13310899999985099999987877624908899---9999878 Q ss_pred HH Q ss_conf 45 Q gi|254781174|r 140 SY 141 (271) Q Consensus 140 ~~ 141 (271) ++ T Consensus 192 A~ 193 (606) T KOG0547 192 AH 193 (606) T ss_pred HH T ss_conf 87 No 36 >KOG1840 consensus Probab=98.97 E-value=1.5e-07 Score=62.17 Aligned_cols=178 Identities=20% Similarity=0.202 Sum_probs=87.8 Q ss_pred HHHHHHHHHCCHHHHHHHHHHHHHH----C-CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC-----CCCCHHH Q ss_conf 9999999809899999999999853----0-4773026899877887765455567999998875401-----1211013 Q gi|254781174|r 62 EKAVLFLKEQNFSKAYEYFNQCSRD----F-PFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQY-----PESKNVD 131 (271) Q Consensus 62 ~~a~~~~~~~~y~~A~~~f~~i~~~----~-P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~-----P~s~~~~ 131 (271) ..|..+...|+|+.|+..++..+.. + -..+.....+-.+|..|-.+++|.+|+..|++.+... ++||.+. T Consensus 204 ~La~~y~~~g~~e~A~~l~k~Al~~l~k~~G~~hl~va~~l~~~a~~y~~~~k~~eAv~ly~~AL~i~e~~~G~~h~~va 283 (508) T KOG1840 204 NLAEMYAVQGRLEKAEPLCKQALRILEKTSGLKHLVVASMLNILALVYRSLGKYDEAVNLYEEALTIREEVFGEDHPAVA 283 (508) T ss_pred HHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCCCCHHHH T ss_conf 99999998520778999999999999870376668999999999999997442899999999999999985389878999 Q ss_pred HHHHHHHHHHHHHHH----------------------------------HHHHHHHHHHHHHHHHHHHHHHHHCCHHHHH Q ss_conf 555443444555544----------------------------------3111234589999999999985200101345 Q gi|254781174|r 132 YVYYLVGMSYAQMIR----------------------------------DVPYDQRATKLMLQYMSRIVERYTNSPYVKG 177 (271) Q Consensus 132 ~A~y~~a~~~~~~~~----------------------------------~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~ 177 (271) -++..+|..|+++.+ .+.-.++.+.+|...++..+..+-+. T Consensus 284 ~~l~nLa~ly~~~GKf~EA~~~~e~Al~I~~~~~~~~~~~v~~~l~~~~~~~~~~~~~Eea~~l~q~al~i~~~~----- 358 (508) T KOG1840 284 ATLNNLAVLYYKQGKFAEAEEYCERALEIYEKLLGASHPEVAAQLSELAAILQSMNEYEEAKKLLQKALKIYLDA----- 358 (508) T ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHH----- T ss_conf 999889999860477577899999999999985036858999999899999988520568999999999999861----- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC-----CCHHHHHHHHHHHHHHHHCCCHHHHHH Q ss_conf 65555555666666655566666544322345689998865289-----981159999999999997498799999 Q gi|254781174|r 178 ARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQLVLANYS-----DAEHAEEAMARLVEAYVALALMDEARE 248 (271) Q Consensus 178 A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~~~i~~yp-----~t~~~~eAl~~l~~~y~~lg~~d~A~~ 248 (271) ...-+..+|...-.+|..|+..|+|..|...++.+|..-. .+......+-.|+..|.+++.+++|.+ T Consensus 359 ----~g~~~~~~a~~~~nl~~l~~~~gk~~ea~~~~k~ai~~~~~~~~~~~~~~~~~l~~la~~~~~~k~~~~a~~ 430 (508) T KOG1840 359 ----PGEDNVNLAKIYANLAELYLKMGKYKEAEELYKKAIQILRELLGKKDYGVGKPLNQLAEAYEELKKYEEAEQ 430 (508) T ss_pred ----CCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHH T ss_conf ----464105789899899999999446567999999999998861567671454789999999998346022899 No 37 >KOG1840 consensus Probab=98.93 E-value=5.6e-07 Score=58.76 Aligned_cols=176 Identities=15% Similarity=0.132 Sum_probs=104.4 Q ss_pred HHHHHHHHHCCHHHHHHHHHHHHHH-----CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC-----CCHHH Q ss_conf 9999999809899999999999853-----0477302689987788776545556799999887540112-----11013 Q gi|254781174|r 62 EKAVLFLKEQNFSKAYEYFNQCSRD-----FPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPE-----SKNVD 131 (271) Q Consensus 62 ~~a~~~~~~~~y~~A~~~f~~i~~~-----~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~-----s~~~~ 131 (271) ..|..+...++|.+|+..|++.... .+.++-..-.+.++|.+|+..|++.+|...+++.++.+-. ++.+. T Consensus 246 ~~a~~y~~~~k~~eAv~ly~~AL~i~e~~~G~~h~~va~~l~nLa~ly~~~GKf~EA~~~~e~Al~I~~~~~~~~~~~v~ 325 (508) T KOG1840 246 ILALVYRSLGKYDEAVNLYEEALTIREEVFGEDHPAVAATLNNLAVLYYKQGKFAEAEEYCERALEIYEKLLGASHPEVA 325 (508) T ss_pred HHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCHHHHH T ss_conf 99999997442899999999999999985389878999999889999860477577899999999999985036858999 Q ss_pred HHHHHHHHHHHHHHHHH-----------------------------------HHHHHHHHHHHHHHHHHHHHHHCCHHHH Q ss_conf 55544344455554431-----------------------------------1123458999999999998520010134 Q gi|254781174|r 132 YVYYLVGMSYAQMIRDV-----------------------------------PYDQRATKLMLQYMSRIVERYTNSPYVK 176 (271) Q Consensus 132 ~A~y~~a~~~~~~~~~~-----------------------------------~~d~~~~~~A~~~f~~~i~~yP~S~ya~ 176 (271) -.+-..+..+..+ ... .+-++...+|.+.++..+..- T Consensus 326 ~~l~~~~~~~~~~-~~~Eea~~l~q~al~i~~~~~g~~~~~~a~~~~nl~~l~~~~gk~~ea~~~~k~ai~~~------- 397 (508) T KOG1840 326 AQLSELAAILQSM-NEYEEAKKLLQKALKIYLDAPGEDNVNLAKIYANLAELYLKMGKYKEAEELYKKAIQIL------- 397 (508) T ss_pred HHHHHHHHHHHHH-CCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHH------- T ss_conf 9998999999885-20568999999999999861464105789899899999999446567999999999998------- Q ss_pred HHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHH----HCCCCHHHHHHHHHHHHHHHHCCCHHH Q ss_conf 56555555566666665-------5566666544322345689998865----289981159999999999997498799 Q gi|254781174|r 177 GARFYVTVGRNQLAAKE-------VEIGRYYLKRGEYVAAIPRFQLVLA----NYSDAEHAEEAMARLVEAYVALALMDE 245 (271) Q Consensus 177 ~A~~~l~~~~~~La~~e-------~~ia~~Y~~~~~y~aA~~~~~~~i~----~yp~t~~~~eAl~~l~~~y~~lg~~d~ 245 (271) ++....+. -.+|.-|.+.+.|..|-.-|.+.+. -=|+++-+.-.+..|+.+|..+|..+. T Consensus 398 ---------~~~~~~~~~~~~~~l~~la~~~~~~k~~~~a~~l~~~~~~i~~~~g~~~~~~~~~~~nL~~~Y~~~g~~e~ 468 (508) T KOG1840 398 ---------RELLGKKDYGVGKPLNQLAEAYEELKKYEEAEQLFEEAKDIMKLCGPDHPDVTYTYLNLAALYRAQGNYEA 468 (508) T ss_pred ---------HHCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCHHH T ss_conf ---------86156767145478999999999834602289999999889998588997468999889999997356888 Q ss_pred HHHHHHHHH Q ss_conf 999999997 Q gi|254781174|r 246 AREVVSLIQ 254 (271) Q Consensus 246 A~~~~~~l~ 254 (271) |.++...+. T Consensus 469 a~~~~~~~~ 477 (508) T KOG1840 469 AEELEEKVL 477 (508) T ss_pred HHHHHHHHH T ss_conf 999999998 No 38 >KOG4626 consensus Probab=98.92 E-value=2e-07 Score=61.37 Aligned_cols=179 Identities=16% Similarity=0.103 Sum_probs=110.2 Q ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHH Q ss_conf 99999999980989999999999985304773026899877887765455567999998875401121101355544344 Q gi|254781174|r 60 VYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYLVGM 139 (271) Q Consensus 60 lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~~a~ 139 (271) +=+.|-.....|+-..|+..|++.+..-|. -.+|.+++|..|-..+.|+.|+..|.+.+..-|+|.- |+=.+|. T Consensus 221 wsnLg~~f~~~Gei~~aiq~y~eAvkldP~---f~dAYiNLGnV~ke~~~~d~Avs~Y~rAl~lrpn~A~---a~gNla~ 294 (966) T KOG4626 221 WSNLGCVFNAQGEIWLAIQHYEEAVKLDPN---FLDAYINLGNVYKEARIFDRAVSCYLRALNLRPNHAV---AHGNLAC 294 (966) T ss_pred EHHCCHHHHHCCHHHHHHHHHHHHHCCCCC---CHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCHH---HCCCEEE T ss_conf 120004776043699999999876537976---5277865889999872203789999999742973031---2043479 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH--------------------HHHHHHHHHHHHHH Q ss_conf 4555544311123458999999999998520010134565555555--------------------66666665556666 Q gi|254781174|r 140 SYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVTVG--------------------RNQLAAKEVEIGRY 199 (271) Q Consensus 140 ~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~--------------------~~~La~~e~~ia~~ 199 (271) .||+ |+..+-||..+++.+..-|+-+ +|...+... ...-|..-..+|.. T Consensus 295 iYye--------qG~ldlAI~~Ykral~~~P~F~---~Ay~NlanALkd~G~V~ea~~cYnkaL~l~p~hadam~NLgni 363 (966) T KOG4626 295 IYYE--------QGLLDLAIDTYKRALELQPNFP---DAYNNLANALKDKGSVTEAVDCYNKALRLCPNHADAMNNLGNI 363 (966) T ss_pred EEEC--------CCCHHHHHHHHHHHHHCCCCCH---HHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHH T ss_conf 9731--------4427888999999870399956---7775799997750546799999999987587637788879899 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCC Q ss_conf 65443223456899988652899811599999999999974987999999999978569 Q gi|254781174|r 200 YLKRGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSLIQERYP 258 (271) Q Consensus 200 Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~~~yP 258 (271) |-+.++...|+.-|...++-+|+.. .|.-.|+.+|.+.|..++|..-++....--| T Consensus 364 ~~E~~~~e~A~~ly~~al~v~p~~a---aa~nNLa~i~kqqgnl~~Ai~~YkealrI~P 419 (966) T KOG4626 364 YREQGKIEEATRLYLKALEVFPEFA---AAHNNLASIYKQQGNLDDAIMCYKEALRIKP 419 (966) T ss_pred HHHHCCCHHHHHHHHHHHHHCHHHH---HHHHHHHHHHHHCCCHHHHHHHHHHHHHCCC T ss_conf 9886122188999999986380245---5642299999960547789999998871494 No 39 >COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only] Probab=98.89 E-value=2.9e-07 Score=60.42 Aligned_cols=150 Identities=11% Similarity=0.004 Sum_probs=65.5 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHH Q ss_conf 89999999999980989999999999985304773026899877887765455567999998875401121101355544 Q gi|254781174|r 57 QREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYL 136 (271) Q Consensus 57 ~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~ 136 (271) +...|-.|...+..|+++.|...+..++...|+++|..++ .++++...++..+|+..+++.+...|+++. ..+. T Consensus 306 ~aa~YG~A~~~~~~~~~d~A~~~l~~L~~~~P~N~~~~~~---~~~i~~~~nk~~~A~e~~~kal~l~P~~~~---l~~~ 379 (484) T COG4783 306 LAAQYGRALQTYLAGQYDEALKLLQPLIAAQPDNPYYLEL---AGDILLEANKAKEAIERLKKALALDPNSPL---LQLN 379 (484) T ss_pred HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHH---HHHHHHHCCCHHHHHHHHHHHHHCCCCCCH---HHHH T ss_conf 3778889999998327015789999999848998799999---999998758867799999999840898507---9999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 34445555443111234589999999999985200101345655555556666666555666665443223456899988 Q gi|254781174|r 137 VGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQLV 216 (271) Q Consensus 137 ~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~~~ 216 (271) .|..+.+. +...+|+..+++.....|+.+-.-+-..+--.....-++-+...|..|+-.|++.+|+..+... T Consensus 380 ~a~all~~--------g~~~eai~~L~~~~~~~p~dp~~w~~LAqay~~~g~~~~a~~A~AE~~~~~G~~~~A~~~l~~A 451 (484) T COG4783 380 LAQALLKG--------GKPQEAIRILNRYLFNDPEDPNGWDLLAQAYAELGNRAEALLARAEGYALAGRLEQAIIFLMRA 451 (484) T ss_pred HHHHHHHC--------CCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCHHHHHHHHHHH T ss_conf 99999956--------9978999999998644877736999999999984761779999999998678878999999999 Q ss_pred HHHC Q ss_conf 6528 Q gi|254781174|r 217 LANY 220 (271) Q Consensus 217 i~~y 220 (271) .+.. T Consensus 452 ~~~~ 455 (484) T COG4783 452 SQQV 455 (484) T ss_pred HHHC T ss_conf 9863 No 40 >KOG1125 consensus Probab=98.88 E-value=3.8e-07 Score=59.78 Aligned_cols=180 Identities=14% Similarity=0.131 Sum_probs=125.6 Q ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHH Q ss_conf 99999999998098999999999998530477302689987788776545556799999887540112110135554434 Q gi|254781174|r 59 EVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYLVG 138 (271) Q Consensus 59 ~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~~a 138 (271) .-|..|..++++|+..+|+-.|+..+.+-|.+ .+|=.++|.++...++-..||.++.+-++.-|+.-. |+.-+| T Consensus 287 dPf~eG~~lm~nG~L~~A~LafEAAVkqdP~h---aeAW~~LG~~qaENE~E~~ai~AL~rcl~LdP~Nle---aLmaLA 360 (579) T KOG1125 287 DPFKEGCNLMKNGDLSEAALAFEAAVKQDPQH---AEAWQKLGITQAENENEQNAISALRRCLELDPTNLE---ALMALA 360 (579) T ss_pred CHHHHHHHHHHCCCCHHHHHHHHHHHHHCHHH---HHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHH---HHHHHH T ss_conf 74899999986389059999999998619287---899999615765034208899999999846995199---999988 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHH-----------------HHHHH-----HHHHH----HHH Q ss_conf 44555544311123458999999999998520010134565-----------------55555-----56666----666 Q gi|254781174|r 139 MSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGAR-----------------FYVTV-----GRNQL----AAK 192 (271) Q Consensus 139 ~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~-----------------~~l~~-----~~~~L----a~~ 192 (271) .||-++ +.-..|+..+...|...|.-.+...+. ..|.. .+..- ++- T Consensus 361 VSytNe--------g~q~~Al~~L~~Wi~~~p~y~~l~~a~~~~~~~~~~s~~~~~~l~~i~~~fLeaa~~~~~~~Dpdv 432 (579) T KOG1125 361 VSYTNE--------GLQNQALKMLDKWIRNKPKYVHLVSAGENEDFENTKSFLDSSHLAHIQELFLEAARQLPTKIDPDV 432 (579) T ss_pred HHHHHH--------HHHHHHHHHHHHHHHHCCCCHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCHHH T ss_conf 888614--------668889999999998486211114667665666776778878999999999999985778888427 Q ss_pred HHHHHHHHHHHHHH----------------------------------HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHH Q ss_conf 55566666544322----------------------------------34568999886528998115999999999999 Q gi|254781174|r 193 EVEIGRYYLKRGEY----------------------------------VAAIPRFQLVLANYSDAEHAEEAMARLVEAYV 238 (271) Q Consensus 193 e~~ia~~Y~~~~~y----------------------------------~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~ 238 (271) ...+|-+|+-.++| ..||..|+..++..|+.. .+.|.||.++. T Consensus 433 Q~~LGVLy~ls~efdraiDcf~~AL~v~Pnd~~lWNRLGAtLAN~~~s~EAIsAY~rALqLqP~yV---R~RyNlgIS~m 509 (579) T KOG1125 433 QSGLGVLYNLSGEFDRAVDCFEAALQVKPNDYLLWNRLGATLANGNRSEEAISAYNRALQLQPGYV---RVRYNLGISCM 509 (579) T ss_pred HHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCEE---EEEHHHHHHHH T ss_conf 762678974014788899999999704984177898866886077553999999999984188726---53010145662 Q ss_pred HCCCHHHHHHHHHHHHH Q ss_conf 74987999999999978 Q gi|254781174|r 239 ALALMDEAREVVSLIQE 255 (271) Q Consensus 239 ~lg~~d~A~~~~~~l~~ 255 (271) .+|.+.+|.+.+-..+. T Consensus 510 NlG~ykEA~~hlL~AL~ 526 (579) T KOG1125 510 NLGAYKEAVKHLLEALS 526 (579) T ss_pred HHHHHHHHHHHHHHHHH T ss_conf 05669999999999998 No 41 >cd00189 TPR Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and C Probab=98.86 E-value=1.8e-08 Score=67.42 Aligned_cols=99 Identities=21% Similarity=0.330 Sum_probs=75.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHH Q ss_conf 99877887765455567999998875401121101355544344455554431112345899999999999852001013 Q gi|254781174|r 96 SLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYV 175 (271) Q Consensus 96 A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya 175 (271) +.+.+|.+++.+|++++|+..|++.++..|+++ .+++.+|.++..+ +...+|+..|++.++..|+++ T Consensus 2 ~~~~lg~~~~~~g~~~~A~~~~~~al~~~p~~~---~~~~~l~~~~~~~--------~~~~~A~~~~~~~~~~~~~~~-- 68 (100) T cd00189 2 ALLNLGNLYYKLGDYDEALEYYEKALELDPDNA---DAYYNLAAAYYKL--------GKYEEALEDYEKALELDPDNA-- 68 (100) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCH---HHHHHHHHHHHHC--------CCHHHHHHHHHHHHHHCCCCH-- T ss_conf 899999999984999999999999998599989---9999999999985--------985899999999998698989-- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 45655555556666666555666665443223456899988652899 Q gi|254781174|r 176 KGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQLVLANYSD 222 (271) Q Consensus 176 ~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~ 222 (271) ++. +.+|..|...+++..|+..|+.+++--|+ T Consensus 69 -~~~--------------~~la~~~~~~~~~~~A~~~~~~al~~~P~ 100 (100) T cd00189 69 -KAY--------------YNLGLAYYKLGKYEEALEAYEKALELDPN 100 (100) T ss_pred -HHH--------------HHHHHHHHHCCCHHHHHHHHHHHHHHCCC T ss_conf -999--------------99999999949999999999999974908 No 42 >CHL00033 ycf3 photosystem I assembly protein Ycf3 Probab=98.83 E-value=7.5e-08 Score=63.85 Aligned_cols=122 Identities=10% Similarity=0.019 Sum_probs=93.5 Q ss_pred CHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHH Q ss_conf 68999999999998098999999999998530477302689987788776545556799999887540112110135554 Q gi|254781174|r 56 YQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYY 135 (271) Q Consensus 56 ~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y 135 (271) .+-..|..|+.....|+|.+|++.|.+.+..-+..---..+++++|.+|...|+++.|+..|.+.|+.+|+.|. |+. T Consensus 30 ~af~yyr~GmsaQs~GeyaEALe~Y~eAL~Le~D~~Drs~iLYNIGlIy~~nGe~~kALeyY~qALeiNP~~pq---A~N 106 (165) T CHL00033 30 EAFTYYRDGMSAQSEGEYAEALQNYYEAMRLEIDPYDRSYILYNIGLIHTSNGEHTKALEYYFQALERNPFLPQ---ALN 106 (165) T ss_pred HHHHHHHCCCHHHCCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHH---HHH T ss_conf 89999981510101353899999999862345685126689989999999858888999999999985999689---997 Q ss_pred HHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHH Q ss_conf 434445555443------11123458999999999998520010134565555 Q gi|254781174|r 136 LVGMSYAQMIRD------VPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYV 182 (271) Q Consensus 136 ~~a~~~~~~~~~------~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l 182 (271) .+|+.|-.+... .+......++|-+.+.+.|..-|++ | .+|+..+ T Consensus 107 NlGvIyh~qG~~a~~~~~~~~~~~~fd~aa~~w~~ai~lap~n-y-~eaqnwl 157 (165) T CHL00033 107 NMAVICHYRGEQAIEQGDSEIAEAWFDQAAEYWKQAIALAPGN-Y-IEAQNWL 157 (165) T ss_pred HHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCC-C-HHHHHHH T ss_conf 8999999987999984586789999999999999987038997-0-9999999 No 43 >TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD; InterPro: IPR005415 The type III secretion system of Gram-negative bacteria is used to transport virulence factors from the pathogen directly into the host cell and is only triggered when the bacterium comes into close contact with the host. Effector proteins secreted by the type III system do not possess a secretion signal, and are considered unique because of this. Yersinia spp. secrete effector proteins called YopB and YopD that facilitate the spread of other translocated proteins through the type III needle and the host cell cytoplasm . Both are believed to act as pore translocases, forming apertures in the host cell membrane and allowing the bacterium easy access to its cytoplasm. YopD also acts as a negative regulator of the Yersinia low-calcium response, and in turn is controlled by a chaperone, SycD . This protein also regulates YopB secretion. SycD is located on the Yop pathogenicity island of Yersinia spp., and is speculated to prevent a premature interaction between YopB, YopD and the calcium-response LcrV protein . It has been speculated that a type III secretion mechanism also exists in Chlamydial species. With the sequencing of the Chlamydia trachomatis genome, several proteins similar to characterised type III proteins have emerged, including a SycD homologue . The Pseudomonas aeruginosa gene PcrH is also similar to the Yersinia chaperone, suggesting a comparable function. Proteins in this entry are found in type III secretion operons. LcrH, from Yersinia is believed to have a regulatory function in the low-calcium response of the secretion system ,. The same protein is also known as SycD (SYC = Specific Yop Chaperone) for its chaperone role . In Pseudomonas, where the homolog is known as PcrH, the chaperone role has been demonstrated and the regulatory role appears to be absent . ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array .; GO: 0006950 response to stress. Probab=98.83 E-value=1.1e-07 Score=63.01 Aligned_cols=116 Identities=14% Similarity=0.140 Sum_probs=97.1 Q ss_pred CCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHH Q ss_conf 73689999999999980989999999999985304773026899877887765455567999998875401121101355 Q gi|254781174|r 54 VRYQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYV 133 (271) Q Consensus 54 ~~~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A 133 (271) ...-+.+|..|...++.|+|++|...|.-+.---|. ..+-.+-||-|+..+++|++|+..|..-....|++|. + T Consensus 15 ~~~L~~~Y~~AY~~y~~G~Y~~A~~~F~~L~~~~~~---~~~Y~~gLa~c~q~~~~Y~~A~~~Y~~a~~l~~~~P~---p 88 (137) T TIGR02552 15 SEQLEAIYALAYNLYQQGRYDEALKLFQLLAAYDPW---NADYWLGLAACCQMLKEYEEAIDAYALAAALDPDDPR---P 88 (137) T ss_pred HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCH---HHHHHHHHHHHHHHHHCHHHHHHHHHHHHHCCCCCCC---C T ss_conf 989999999999998536679999999999997430---3689999999999861678999999999752768695---1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-CCHHHHHHHHHHH Q ss_conf 5443444555544311123458999999999998520-0101345655555 Q gi|254781174|r 134 YYLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYT-NSPYVKGARFYVT 183 (271) Q Consensus 134 ~y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP-~S~ya~~A~~~l~ 183 (271) +|..|.|+... +....|.+.|+..+..=+ +++-=.+-+.|.. T Consensus 89 ~~~~a~C~l~~--------g~~~~A~~aL~~a~~~~~~~~~~~~~l~~rA~ 131 (137) T TIGR02552 89 YYHAAECYLAL--------GEPESALKALDLAIEIAGGENPEYSELKERAE 131 (137) T ss_pred HHHHHHHHHHH--------CCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHH T ss_conf 26998999872--------89789999999999984889963178999999 No 44 >COG4785 NlpI Lipoprotein NlpI, contains TPR repeats [General function prediction only] Probab=98.82 E-value=7.3e-07 Score=58.11 Aligned_cols=183 Identities=16% Similarity=0.147 Sum_probs=125.4 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHH Q ss_conf 89999999999980989999999999985304773026899877887765455567999998875401121101355544 Q gi|254781174|r 57 QREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYL 136 (271) Q Consensus 57 ~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~ 136 (271) +.-+|+.|.-+-.-|-+.-|...|.+.....|.- +.+.-.+|.-+-..|+|+.|...|+..++.-|.. +||... T Consensus 65 A~l~fERGvlYDSlGL~~LAR~DftQaLai~P~m---~~vfNyLG~Yl~~a~~fdaa~eaFds~~ELDp~y---~Ya~lN 138 (297) T COG4785 65 AQLLFERGVLYDSLGLRALARNDFSQALAIRPDM---PEVFNYLGIYLTQAGNFDAAYEAFDSVLELDPTY---NYAHLN 138 (297) T ss_pred HHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCC---HHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCC---HHHHHC T ss_conf 8998861203422407999860145666349675---8999899999986022057999866671047751---077760 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHH------HHHHHHHHHHHHHH---------HHHHHHHH Q ss_conf 3444555544311123458999999999998520010134565------55555566666665---------55666665 Q gi|254781174|r 137 VGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGAR------FYVTVGRNQLAAKE---------VEIGRYYL 201 (271) Q Consensus 137 ~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~------~~l~~~~~~La~~e---------~~ia~~Y~ 201 (271) +|+..|-- +...-|.+.|..+.+.-|+.||..-=. ..-...+..|.+.- ..+-.+|+ T Consensus 139 Rgi~~YY~--------gR~~LAq~d~~~fYQ~D~~DPfR~LWLYl~E~k~dP~~A~tnL~qR~~~~d~e~WG~~iV~~yL 210 (297) T COG4785 139 RGIALYYG--------GRYKLAQDDLLAFYQDDPNDPFRSLWLYLNEQKLDPKQAKTNLKQRAEKSDKEQWGWNIVEFYL 210 (297) T ss_pred CCEEEEEC--------CCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHH T ss_conf 44023444--------7515559999999844989807899999876107868999999999870567655187999997 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCHH---HHHHHHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 443223456899988652899811---599999999999974987999999999978 Q gi|254781174|r 202 KRGEYVAAIPRFQLVLANYSDAEH---AEEAMARLVEAYVALALMDEAREVVSLIQE 255 (271) Q Consensus 202 ~~~~y~aA~~~~~~~i~~yp~t~~---~~eAl~~l~~~y~~lg~~d~A~~~~~~l~~ 255 (271) .+-.-.++..+.+...++ ++.. .-|+.+.|++-|..+|+.++|...++.-.. T Consensus 211 gkiS~e~l~~~~~a~a~~--n~~~Ae~LTEtyFYL~K~~l~~G~~~~A~~LfKLaia 265 (297) T COG4785 211 GKISEETLMERLKADATD--NTSLAEHLTETYFYLGKYYLSLGDLDEATALFKLAVA 265 (297) T ss_pred HHCCHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH T ss_conf 040399999999863121--2899999999999999998613117889999999998 No 45 >COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only] Probab=98.80 E-value=1.5e-06 Score=56.27 Aligned_cols=126 Identities=21% Similarity=0.260 Sum_probs=97.7 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 02689987788776545556799999887540112110135554434445555443111234589999999999985200 Q gi|254781174|r 92 VARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTN 171 (271) Q Consensus 92 ~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~ 171 (271) ...-+++-.|..+|..++++.|...++.++...|+++... ....++....+...+|.+.++..+..+|+ T Consensus 304 ~~~aa~YG~A~~~~~~~~~d~A~~~l~~L~~~~P~N~~~~-----------~~~~~i~~~~nk~~~A~e~~~kal~l~P~ 372 (484) T COG4783 304 GGLAAQYGRALQTYLAGQYDEALKLLQPLIAAQPDNPYYL-----------ELAGDILLEANKAKEAIERLKKALALDPN 372 (484) T ss_pred CCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHH-----------HHHHHHHHHCCCHHHHHHHHHHHHHCCCC T ss_conf 6537788899999983270157899999998489987999-----------99999998758867799999999840898 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHH Q ss_conf 10134565555555666666655566666544322345689998865289981159999999999997498799999 Q gi|254781174|r 172 SPYVKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEARE 248 (271) Q Consensus 172 S~ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~ 248 (271) ++-.. +..|+.|++.|++..|+..++..+.+.|+.+ .-+.+|+++|-.+|+..+|.. T Consensus 373 ~~~l~-----------------~~~a~all~~g~~~eai~~L~~~~~~~p~dp---~~w~~LAqay~~~g~~~~a~~ 429 (484) T COG4783 373 SPLLQ-----------------LNLAQALLKGGKPQEAIRILNRYLFNDPEDP---NGWDLLAQAYAELGNRAEALL 429 (484) T ss_pred CCHHH-----------------HHHHHHHHHCCCHHHHHHHHHHHHHCCCCCC---HHHHHHHHHHHHHCCHHHHHH T ss_conf 50799-----------------9999999956997899999999864487773---699999999998476177999 No 46 >KOG1155 consensus Probab=98.78 E-value=7.3e-07 Score=58.09 Aligned_cols=59 Identities=24% Similarity=0.301 Sum_probs=37.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH Q ss_conf 56666654432234568999886528998115999999999999749879999999999785 Q gi|254781174|r 195 EIGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSLIQER 256 (271) Q Consensus 195 ~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~~~ 256 (271) .+|++|.+.+....|++-|+.++.- .|+ --.++.+|+..|.++++.++|.+++..-+.. T Consensus 437 aLG~CY~kl~~~~eAiKCykrai~~-~dt--e~~~l~~LakLye~l~d~~eAa~~yek~v~~ 495 (559) T KOG1155 437 ALGECYEKLNRLEEAIKCYKRAILL-GDT--EGSALVRLAKLYEELKDLNEAAQYYEKYVEV 495 (559) T ss_pred HHHHHHHHHCCHHHHHHHHHHHHHC-CCC--CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999855689999999888852-464--3389999999999987479999999999999 No 47 >KOG3060 consensus Probab=98.77 E-value=1.9e-06 Score=55.64 Aligned_cols=166 Identities=19% Similarity=0.215 Sum_probs=126.9 Q ss_pred HHHHH-HHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHH Q ss_conf 99999-99999809899999999999853047730268998778877654555679999988754011211013555443 Q gi|254781174|r 59 EVYEK-AVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYLV 137 (271) Q Consensus 59 ~lY~~-a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~~ 137 (271) .+|++ ....+..|..+.|..-+..+..+||+|.-... +-|--+-..|.|++|+..|++.++..|.+. |.+.+ T Consensus 53 ~l~EqV~IAAld~~~~~lAq~C~~~L~~~fp~S~RV~~---lkam~lEa~~~~~~A~e~y~~lL~ddpt~~----v~~KR 125 (289) T KOG3060 53 TLYEQVFIAALDTGRDDLAQKCINQLRDRFPGSKRVGK---LKAMLLEATGNYKEAIEYYESLLEDDPTDT----VIRKR 125 (289) T ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHH---HHHHHHHHHHCHHHHHHHHHHHHCCCCCHH----HHHHH T ss_conf 99999999999833137899999999986789706798---999999986135459999999844686314----89999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 44455554431112345899999999999852001013456555555566666665556666654432234568999886 Q gi|254781174|r 138 GMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQLVL 217 (271) Q Consensus 138 a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~~~i 217 (271) -++- -+.++...+||+....+++.||+.. +|= .+.++.|+..++|..|+=-+.+++ T Consensus 126 KlAi-------lka~GK~l~aIk~ln~YL~~F~~D~---EAW--------------~eLaeiY~~~~~f~kA~fClEE~l 181 (289) T KOG3060 126 KLAI-------LKAQGKNLEAIKELNEYLDKFMNDQ---EAW--------------HELAEIYLSEGDFEKAAFCLEELL 181 (289) T ss_pred HHHH-------HHHCCCCHHHHHHHHHHHHHHCCCH---HHH--------------HHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999-------9971885799999999999835769---999--------------999999976768999999999999 Q ss_pred HHCCCCHHHHHHHHHHHHHHHHCCCH---HHHHHHHHHHHHHCC Q ss_conf 52899811599999999999974987---999999999978569 Q gi|254781174|r 218 ANYSDAEHAEEAMARLVEAYVALALM---DEAREVVSLIQERYP 258 (271) Q Consensus 218 ~~yp~t~~~~eAl~~l~~~y~~lg~~---d~A~~~~~~l~~~yP 258 (271) -.-| +.+--.-|+++.++-+|.. ..|++++..-...-| T Consensus 182 l~~P---~n~l~f~rlae~~Yt~gg~eN~~~arkyy~~alkl~~ 222 (289) T KOG3060 182 LIQP---FNPLYFQRLAEVLYTQGGAENLELARKYYERALKLNP 222 (289) T ss_pred HCCC---CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCH T ss_conf 7598---7189999999999998418889999999999987274 No 48 >KOG2003 consensus Probab=98.73 E-value=6.7e-07 Score=58.32 Aligned_cols=182 Identities=13% Similarity=0.146 Sum_probs=96.8 Q ss_pred CCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHH Q ss_conf 36899999999999809899999999999853047730268998778877654555679999988754011211013555 Q gi|254781174|r 55 RYQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVY 134 (271) Q Consensus 55 ~~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~ 134 (271) -.+..+-++|...|.+|+++.|.+.|++...+-. --.+|+|++|.++-.+|+.++|+..|-+ +|.---+..+++ T Consensus 488 yn~~a~~nkgn~~f~ngd~dka~~~ykeal~nda---sc~ealfniglt~e~~~~ldeald~f~k---lh~il~nn~evl 561 (840) T KOG2003 488 YNAAALTNKGNIAFANGDLDKAAEFYKEALNNDA---SCTEALFNIGLTAEALGNLDEALDCFLK---LHAILLNNAEVL 561 (840) T ss_pred CCHHHHHCCCCEEEECCCHHHHHHHHHHHHCCCH---HHHHHHHHHCCCHHHHCCHHHHHHHHHH---HHHHHHHHHHHH T ss_conf 4777750677544415868999999999971742---8899999711239874478999999999---999997029999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHH---------------------------------------- Q ss_conf 4434445555443111234589999999999985200101---------------------------------------- Q gi|254781174|r 135 YLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPY---------------------------------------- 174 (271) Q Consensus 135 y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~y---------------------------------------- 174 (271) +.+|..|-. ..+...|++.+......-|+.|- T Consensus 562 ~qianiye~--------led~aqaie~~~q~~slip~dp~ilskl~dlydqegdksqafq~~ydsyryfp~nie~iewl~ 633 (840) T KOG2003 562 VQIANIYEL--------LEDPAQAIELLMQANSLIPNDPAILSKLADLYDQEGDKSQAFQCHYDSYRYFPCNIETIEWLA 633 (840) T ss_pred HHHHHHHHH--------HHCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCHHHHHHHH T ss_conf 999999998--------548799999999850158898899999998861036444545664301023676237999999 Q ss_pred --------HHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCH Q ss_conf --------345655---555556666666555666665443223456899988652899811599999999999974987 Q gi|254781174|r 175 --------VKGARF---YVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALM 243 (271) Q Consensus 175 --------a~~A~~---~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~ 243 (271) .+.+.. +...++.....-.+.||.+.-+.|+|..|...|+.+.+.||+.- +.|..|+++--.+|.. T Consensus 634 ayyidtqf~ekai~y~ekaaliqp~~~kwqlmiasc~rrsgnyqka~d~yk~~hrkfpedl---dclkflvri~~dlgl~ 710 (840) T KOG2003 634 AYYIDTQFSEKAINYFEKAALIQPNQSKWQLMIASCFRRSGNYQKAFDLYKDIHRKFPEDL---DCLKFLVRIAGDLGLK 710 (840) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCH---HHHHHHHHHHCCCCCH T ss_conf 8887647789999999888734853789999999998753648889999999887585501---9999999984002644 Q ss_pred HHHHHHHHHHH Q ss_conf 99999999997 Q gi|254781174|r 244 DEAREVVSLIQ 254 (271) Q Consensus 244 d~A~~~~~~l~ 254 (271) | ++++...|. T Consensus 711 d-~key~~kle 720 (840) T KOG2003 711 D-AKEYADKLE 720 (840) T ss_pred H-HHHHHHHHH T ss_conf 6-899999999 No 49 >KOG2076 consensus Probab=98.73 E-value=5.5e-06 Score=53.01 Aligned_cols=68 Identities=12% Similarity=0.085 Sum_probs=50.3 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 89999999999980989999999999985304773026899877887765455567999998875401121 Q gi|254781174|r 57 QREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPES 127 (271) Q Consensus 57 ~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s 127 (271) ...+..+|...+-.|++++|.+.+.+|+.+-| .++.|.+.+|.+|-++|+-+++....-..--+.|++ T Consensus 139 l~~ll~eAN~lfarg~~eeA~~i~~EvIkqdp---~~~~ay~tL~~IyEqrGd~eK~l~~~llAAHL~p~d 206 (895) T KOG2076 139 LRQLLGEANNLFARGDLEEAEEILMEVIKQDP---RNPIAYYTLGEIYEQRGDIEKALNFWLLAAHLNPKD 206 (895) T ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCC---CCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCC T ss_conf 99999998899980778999999999997273---651369999999998032999999999999669988 No 50 >pfam04733 Coatomer_E Coatomer epsilon subunit. This family represents the epsilon subunit of the coatomer complex, which is involved in the regulation of intracellular protein trafficking between the endoplasmic reticulum and the Golgi complex. Probab=98.72 E-value=1e-05 Score=51.46 Aligned_cols=174 Identities=11% Similarity=0.141 Sum_probs=130.8 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHH Q ss_conf 89999999999980989999999999985304773026899877887765455567999998875401121101355544 Q gi|254781174|r 57 QREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYL 136 (271) Q Consensus 57 ~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~ 136 (271) +-...--|..+...|++++|++.+..- ..-++......++-++++.+.|...++++-+...++.. .. T Consensus 102 ~~~~~~aati~~~~g~~e~AL~~l~~~--------~~lE~~al~VQi~L~~nR~DLA~ke~~~~q~~~eDs~l-----~q 168 (290) T pfam04733 102 STLRLLAAIIFIHEGDFDDALKHLHKG--------ENLEAMALNVQILLKMHRIDLAEQQLKKMQQIDEDATL-----TQ 168 (290) T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHCC--------CCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHH-----HH T ss_conf 899999999999779999999998558--------96679999999999728836799999998843510599-----99 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 34445555443111234589999999999985200101345655555556666666555666665443223456899988 Q gi|254781174|r 137 VGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQLV 216 (271) Q Consensus 137 ~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~~~ 216 (271) ++.++.... .......+|.-.|+++-.+|+.|+.... ..|-.+..+|+|+.|...+++. T Consensus 169 La~awv~l~----~Ggek~q~A~yif~EL~~~~~~t~llln-----------------g~Av~~m~~~~~~eAe~~L~eA 227 (290) T pfam04733 169 LANAWVKLA----VGGEKIQDAYYIFQEFSEKYDSTPLLLN-----------------GQAVCCMCLGRYEEAESLLKEA 227 (290) T ss_pred HHHHHHHHH----HCCHHHHHHHHHHHHHHHHCCCCHHHHH-----------------HHHHHHHHCCCHHHHHHHHHHH T ss_conf 999999997----2647789899999999852289789999-----------------9999998747879999999999 Q ss_pred HHHCCCCHHHHHHHHHHHHHHHHCCCHHH-HHHHHHHHHHHCCCCHHHHHHH Q ss_conf 65289981159999999999997498799-9999999978569997899999 Q gi|254781174|r 217 LANYSDAEHAEEAMARLVEAYVALALMDE-AREVVSLIQERYPQGYWARYVE 267 (271) Q Consensus 217 i~~yp~t~~~~eAl~~l~~~y~~lg~~d~-A~~~~~~l~~~yP~s~~~~~a~ 267 (271) ++..|+. ++.+..++-+...+|...+ ...+...|...+|++.|.+.-. T Consensus 228 l~~d~~d---~dtlaNliv~s~~~gK~~e~~~ryl~qL~~~~p~H~~v~~~~ 276 (290) T pfam04733 228 LDKDAKD---PETLINLVVCALHLGKPAEVSNRNLSQLKLSHPTHPLVKDLN 276 (290) T ss_pred HHHCCCC---HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHH T ss_conf 8728898---899999999999809987899999999986299985999999 No 51 >KOG0543 consensus Probab=98.71 E-value=6.3e-07 Score=58.46 Aligned_cols=117 Identities=14% Similarity=0.153 Sum_probs=93.4 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCH------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 999999998098999999999998530477302------------68998778877654555679999988754011211 Q gi|254781174|r 61 YEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVA------------RKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESK 128 (271) Q Consensus 61 Y~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a------------~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~ 128 (271) =..|..+|+.|+|..|+..|++++...+...-. --+++++|-|+-++++|..|+..-.+.+..-|++. T Consensus 212 ke~Gn~~fK~gk~~~A~~~Yerav~~l~~~~~~~~ee~~~~~~~k~~~~lNlA~c~lKl~~~~~Ai~~c~kvLe~~~~N~ 291 (397) T KOG0543 212 KERGNVLFKEGKFKLAKKRYERAVSFLEYRRSFDEEEQKKAEALKLACHLNLAACYLKLKEYKEAIESCNKVLELDPNNV 291 (397) T ss_pred HHHHHHHHHHCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCH T ss_conf 77515887512199999999999987511346997899999888877764799888741218899999999884589962 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHH Q ss_conf 013555443444555544311123458999999999998520010134565555555666666 Q gi|254781174|r 129 NVDYVYYLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQLAA 191 (271) Q Consensus 129 ~~~~A~y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~La~ 191 (271) . |+|++|.++..+ +..+.|+..|+..+...|++ ++++..+..|+.+..+ T Consensus 292 K---ALyRrG~A~l~~--------~e~~~A~~df~ka~k~~P~N---ka~~~el~~l~~k~~~ 340 (397) T KOG0543 292 K---ALYRRGQALLAL--------GEYDLARDDFQKALKLEPSN---KAARAELIKLKQKIRE 340 (397) T ss_pred H---HHHHHHHHHHHH--------CCHHHHHHHHHHHHHHCCCC---HHHHHHHHHHHHHHHH T ss_conf 6---777878898864--------20989999999999748886---8999999999999999 No 52 >PRK10049 pgaA outer membrane protein PgaA; Provisional Probab=98.68 E-value=1.5e-05 Score=50.36 Aligned_cols=75 Identities=13% Similarity=0.216 Sum_probs=53.1 Q ss_pred CHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCC-------------------------------CCHHHHHHHHHHHH Q ss_conf 68999999999998098999999999998530477-------------------------------30268998778877 Q gi|254781174|r 56 YQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFA-------------------------------GVARKSLLMSAFVQ 104 (271) Q Consensus 56 ~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s-------------------------------~~a~~A~~~la~~~ 104 (271) ..++.|+.....-++|++..|+..+++....+|-+ ++-..++-.+|.++ T Consensus 30 ~~d~~yd~~Ii~AR~G~~~~al~~L~~~~~~~~~~~~~v~D~l~i~sWag~d~~v~~~~~~~~~~~~lp~~~l~~~Aka~ 109 (818) T PRK10049 30 AADTAYDALIIEARAGDTAPALSWLQQKSKAGALSNGQVADWLQIALWAGQDQEVITVYERYQSHMQLPARGLAAAARAY 109 (818) T ss_pred HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHH T ss_conf 32568999999986488236899999875338766788988999985347777999999961888889979999999998 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCHH Q ss_conf 65455567999998875401121101 Q gi|254781174|r 105 YSAGKYQQAASLGEEYITQYPESKNV 130 (271) Q Consensus 105 y~~~~y~~A~~~~~~fi~~~P~s~~~ 130 (271) -..++|+.|+..+...+.+.|++++. T Consensus 110 Rn~kq~~~Al~ly~~~~~~~P~n~d~ 135 (818) T PRK10049 110 RNLKQWQNSLTLWQKALSLEPQNKDY 135 (818) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCH T ss_conf 63554677999999985029999407 No 53 >PRK11189 lipoprotein NlpI; Provisional Probab=98.67 E-value=3.6e-06 Score=54.03 Aligned_cols=172 Identities=20% Similarity=0.200 Sum_probs=110.2 Q ss_pred CHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHH Q ss_conf 68999999999998098999999999998530477302689987788776545556799999887540112110135554 Q gi|254781174|r 56 YQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYY 135 (271) Q Consensus 56 ~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y 135 (271) .++..--.|..+.+.|+|+.|.+.|..+.+.-|..+++ .++.|.+.|--++++.|...+.+|...-|+.|...-=.| T Consensus 98 ~adayN~LGiy~tq~~~fD~AyEAFDs~LEL~P~y~YA---~lNRGIAlYYg~R~~LA~~D~~~fy~~dp~DpYR~LWLY 174 (297) T PRK11189 98 MADAYNYLGIYLTQEGEFDAAYEAFDSVLELDPTYDYA---YLNRGIALYYGGRYDLAQDDLLAFYQDDPNDPYRALWLY 174 (297) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHH---HHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCHHHHHHH T ss_conf 58999999999999711999999987777269873489---983027887446378899999999960999827999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH-------------HHHHHHHCCHHHHHHHHHH-------HHHHHHHHHHHHH Q ss_conf 4344455554431112345899999999-------------9998520010134565555-------5556666666555 Q gi|254781174|r 136 LVGMSYAQMIRDVPYDQRATKLMLQYMS-------------RIVERYTNSPYVKGARFYV-------TVGRNQLAAKEVE 195 (271) Q Consensus 136 ~~a~~~~~~~~~~~~d~~~~~~A~~~f~-------------~~i~~yP~S~ya~~A~~~l-------~~~~~~La~~e~~ 195 (271) +... ..|+. +|...++ .++.-|-+.--..+...++ ..+-++|-+--|+ T Consensus 175 l~e~---------~~d~~---~A~~~L~~r~~~~~~~~W~~~iV~~ylg~ise~~~~~~~~~~~~~n~~lAErLcEaYFY 242 (297) T PRK11189 175 LIEQ---------KLDPK---QAKENLKQRYEKLDKEQWGWNIVDFYLGKISEEQLFERLKAGATDNTELAERLCETYFY 242 (297) T ss_pred HHHC---------CCCHH---HHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHH T ss_conf 9981---------16999---99999999998357122689999999458999999999987273709999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCC-CCHHHHHHHHHHHHHHHHCCC Q ss_conf 66666544322345689998865289-981159999999999997498 Q gi|254781174|r 196 IGRYYLKRGEYVAAIPRFQLVLANYS-DAEHAEEAMARLVEAYVALAL 242 (271) Q Consensus 196 ia~~Y~~~~~y~aA~~~~~~~i~~yp-~t~~~~eAl~~l~~~y~~lg~ 242 (271) +|++|...|++..|++.|+-.+.+-. +.....-|+.-|+.....+.. T Consensus 243 LaK~~~~~G~~~~A~~~fKLAla~NVyeFVEhRYA~lEL~~l~~~~~~ 290 (297) T PRK11189 243 LAKYYLSLGDLDEAANLFKLALANNVYNFVEHRYALLELALLGQDLDD 290 (297) T ss_pred HHHHHHHCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999879989999999998704825888999999999999999999 No 54 >KOG1155 consensus Probab=98.67 E-value=8.1e-06 Score=52.01 Aligned_cols=61 Identities=16% Similarity=0.171 Sum_probs=35.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHC----CCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH Q ss_conf 556666654432234568999886528----9981159999999999997498799999999997 Q gi|254781174|r 194 VEIGRYYLKRGEYVAAIPRFQLVLANY----SDAEHAEEAMARLVEAYVALALMDEAREVVSLIQ 254 (271) Q Consensus 194 ~~ia~~Y~~~~~y~aA~~~~~~~i~~y----p~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~ 254 (271) ..+|+.|.+.+++..|...|...|+.+ .-++...+|...|++=+.+.++.++|..+...+. T Consensus 470 ~~LakLye~l~d~~eAa~~yek~v~~~~~eg~~~~~t~ka~~fLA~~f~k~~~~~~As~Ya~~~~ 534 (559) T KOG1155 470 VRLAKLYEELKDLNEAAQYYEKYVEVSELEGEIDDETIKARLFLAEYFKKMKDFDEASYYATLVL 534 (559) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHH T ss_conf 99999999987479999999999999886145663899999999999985012578999999874 No 55 >KOG1173 consensus Probab=98.66 E-value=3.4e-06 Score=54.22 Aligned_cols=190 Identities=19% Similarity=0.210 Sum_probs=117.4 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHH Q ss_conf 89999999999980989999999999985304773026899877887765455567999998875401121101355544 Q gi|254781174|r 57 QREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYL 136 (271) Q Consensus 57 ~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~ 136 (271) +...|.-|..++--|++.+|.+.|.+....-|. +++ |-+..|-++-..++-++|+..|...-+.+|.+.. | ..- T Consensus 312 a~sW~aVg~YYl~i~k~seARry~SKat~lD~~--fgp-aWl~fghsfa~e~EhdQAmaaY~tAarl~~G~hl-P--~LY 385 (611) T KOG1173 312 ALSWFAVGCYYLMIGKYSEARRYFSKATTLDPT--FGP-AWLAFGHSFAGEGEHDQAMAAYFTAARLMPGCHL-P--SLY 385 (611) T ss_pred CCCHHHHHHHHHHHCCCHHHHHHHHHHHHCCCC--CCH-HHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCC-H--HHH T ss_conf 731465778999814867999998887603765--327-9998767765010578899999999986667763-1--789 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHH--------------HHHH----HHHHHHHHHHH------ Q ss_conf 34445555443111234589999999999985200101345--------------6555----55556666666------ Q gi|254781174|r 137 VGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKG--------------ARFY----VTVGRNQLAAK------ 192 (271) Q Consensus 137 ~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~--------------A~~~----l~~~~~~La~~------ 192 (271) +|+-|... .+.+-|-..|..-+...|+.|++.. |... +..+.+.+.++ T Consensus 386 lgmey~~t--------~n~kLAe~Ff~~A~ai~P~Dplv~~Elgvvay~~~~y~~A~~~f~~~l~~ik~~~~e~~~w~p~ 457 (611) T KOG1173 386 LGMEYMRT--------NNLKLAEKFFKQALAIAPSDPLVLHELGVVAYTYEEYPEALKYFQKALEVIKSVLNEKIFWEPT 457 (611) T ss_pred HHHHHHHH--------CCHHHHHHHHHHHHHCCCCCCHHHHHHHHEEEHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHH T ss_conf 99999874--------2398999999998711898604433144203327546889999999988766305432211577 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHH Q ss_conf 55566666544322345689998865289981159999999999997498799999999997856999789 Q gi|254781174|r 193 EVEIGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSLIQERYPQGYWA 263 (271) Q Consensus 193 e~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~~~yP~s~~~ 263 (271) -..+|..|-+.+.|..||..|+..+..=|.+. .+.--+|-+|..+|+.|.|.+.+-.-..-=|+.... T Consensus 458 ~~NLGH~~Rkl~~~~eAI~~~q~aL~l~~k~~---~~~asig~iy~llgnld~Aid~fhKaL~l~p~n~~~ 525 (611) T KOG1173 458 LNNLGHAYRKLNKYEEAIDYYQKALLLSPKDA---STHASIGYIYHLLGNLDKAIDHFHKALALKPDNIFI 525 (611) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCH---HHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCHHH T ss_conf 87678999997667999999999987187765---688889999999658077999999987148752899 No 56 >PRK10747 putative protoheme IX biogenesis protein; Provisional Probab=98.64 E-value=2.7e-05 Score=48.95 Aligned_cols=187 Identities=16% Similarity=0.101 Sum_probs=112.7 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHH Q ss_conf 89999999999980989999999999985304773026899877887765455567999998875401121101355544 Q gi|254781174|r 57 QREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYL 136 (271) Q Consensus 57 ~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~ 136 (271) .-..-.-|....+.|+++.+-..+++.....|.+..+. .+..|......|+|++|...+++..+..|+||.+-. + T Consensus 118 llnyL~AArAA~~~g~~~~rd~yL~~A~e~~p~a~lav--~lt~Ael~l~~~q~e~Ala~L~~L~~~~p~~~~VL~---L 192 (398) T PRK10747 118 VVNYLLAAEAAQQRGDEARANQHLERAAELAGNDTIPV--EITRVRLQLARNENHAARHGVDKLLEVTPRHPEVLR---L 192 (398) T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHH--HHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHH---H T ss_conf 88999999999988898999999999888589848899--999999999779989999999999861999999999---9 Q ss_pred HHHHHHHH---------HHHHH---------------------------------------------------------- Q ss_conf 34445555---------44311---------------------------------------------------------- Q gi|254781174|r 137 VGMSYAQM---------IRDVP---------------------------------------------------------- 149 (271) Q Consensus 137 ~a~~~~~~---------~~~~~---------------------------------------------------------- 149 (271) ...+|..+ .+... T Consensus 193 l~~~y~~~~dW~aL~~LLp~L~K~~~l~~~e~~~L~~~a~~~ll~~~~~~~~~~~L~~~W~~lp~~~r~~~~l~~~ya~~ 272 (398) T PRK10747 193 AEQAYIRTGAWSSLLDIIPSMAKAHVGDEEHRAMLEQQAWIGLMDQARADNGSEGLRNWWKNQSRKTRHQVALQVAMAEH 272 (398) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCHHHHCCHHHHHHHHHH T ss_conf 99999987459999999999987579999999999999999999998732347999999986899875699999999999 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHCCHHHH--------HHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHH Q ss_conf -123458999999999998520010134--------5655555556666666------5556666654432234568999 Q gi|254781174|r 150 -YDQRATKLMLQYMSRIVERYTNSPYVK--------GARFYVTVGRNQLAAK------EVEIGRYYLKRGEYVAAIPRFQ 214 (271) Q Consensus 150 -~d~~~~~~A~~~f~~~i~~yP~S~ya~--------~A~~~l~~~~~~La~~------e~~ia~~Y~~~~~y~aA~~~~~ 214 (271) .+.+...+|.....+.++..+++..+. +....+..+...+..+ .+..|+.+.+.+.+..|..+|+ T Consensus 273 Li~~g~~~~A~~ll~~~Lkk~wd~~Lv~ly~~l~~~d~~~~l~~~e~wlk~~~~~~~Ll~aLGrL~~~~~lwgkA~~yle 352 (398) T PRK10747 273 LIECDDHDTAQQIIIDGLKRQYDDRLVLPIPRLKTNNPEQLEKVLRQQIKQVGDRPLLWSTLGQSLMKHGEWQEASLAFR 352 (398) T ss_pred HHHCCCHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCHHHHHHHHHH T ss_conf 98689968999999999861899899999875677998999999999873189998999999999997350999999999 Q ss_pred HHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHH Q ss_conf 88652899811599999999999974987999999999 Q gi|254781174|r 215 LVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSL 252 (271) Q Consensus 215 ~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~ 252 (271) ..+..=|+ +++...|++.+.++|+.++|.+.++. T Consensus 353 ~sl~l~p~----~~~~~~La~l~e~lg~~~~A~~~yr~ 386 (398) T PRK10747 353 AALKQRPD----AYDYAWLADALDRLHKPEEAAAMRRD 386 (398) T ss_pred HHHHCCCC----HHHHHHHHHHHHHCCCHHHHHHHHHH T ss_conf 99865999----99999999999985997999999999 No 57 >KOG2003 consensus Probab=98.61 E-value=5.6e-06 Score=52.93 Aligned_cols=26 Identities=23% Similarity=0.253 Sum_probs=16.1 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHH Q ss_conf 99999999809899999999999853 Q gi|254781174|r 61 YEKAVLFLKEQNFSKAYEYFNQCSRD 86 (271) Q Consensus 61 Y~~a~~~~~~~~y~~A~~~f~~i~~~ 86 (271) ..+|..++++|++..|++.++.+..+ T Consensus 423 i~ka~~~lk~~d~~~aieilkv~~~k 448 (840) T KOG2003 423 INKAGELLKNGDIEGAIEILKVFEKK 448 (840) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHC T ss_conf 46888888626778899999998732 No 58 >TIGR00990 3a0801s09 mitochondrial precursor proteins import receptor; InterPro: IPR005687 The mitochondrial protein translocase (MPT) family, which brings nuclearly encoded preproteins into mitochondria, is very complex with 19 currently identified protein constituents. These proteins include several chaperone proteins, four proteins of the outer membrane translocase (Tom) import receptor, five proteins of the Tom channel complex, five proteins of the inner membrane translocase (Tim) and three "motor" proteins. This family is specific for the Tom70 proteins. ; GO: 0015450 P-P-bond-hydrolysis-driven protein transmembrane transporter activity, 0006886 intracellular protein transport, 0005741 mitochondrial outer membrane. Probab=98.59 E-value=2.7e-06 Score=54.83 Aligned_cols=170 Identities=15% Similarity=0.151 Sum_probs=113.9 Q ss_pred CCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHH Q ss_conf 73689999999999980989999999999985304773026899877887765455567999998875401121101355 Q gi|254781174|r 54 VRYQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYV 133 (271) Q Consensus 54 ~~~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A 133 (271) ...+...|.+|.-.|=.|+|++|.+.|++.++.-|.--+ +++.||...|+.|..+.....|.+-++.||..|.++. T Consensus 428 ~~~~~~YYHRaQl~FI~~~f~~A~KDy~K~~~LDp~~IF---~~IQL~~~~Y~~~~~~~~~~~F~~~~~~Fp~~P~V~N- 503 (649) T TIGR00990 428 SEDPDIYYHRAQLHFIKGEFAQAGKDYQKSIDLDPDFIF---SYIQLGVLQYKLGSIASSMALFRRCKKNFPEAPDVYN- 503 (649) T ss_pred CCCCCCEEHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCH---HHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCCC- T ss_conf 898751101778998873077675457766432711022---6899999886303443478999999861778888530- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHH Q ss_conf 54434445555443111234589999999999985200101-3456555555566666665556666654---4322345 Q gi|254781174|r 134 YYLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPY-VKGARFYVTVGRNQLAAKEVEIGRYYLK---RGEYVAA 209 (271) Q Consensus 134 ~y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~y-a~~A~~~l~~~~~~La~~e~~ia~~Y~~---~~~y~aA 209 (271) -...+-.|+....+|++.|..-|..=-..+= .... +- |-. -+...++ ..+|..| T Consensus 504 ----------yy~ElL~D~~~F~~A~~~fD~AI~LE~~~~~~~~~v---~P-----L~~----Ka~~L~~nP~~~~f~EA 561 (649) T TIGR00990 504 ----------YYGELLLDKNKFDEAIEKFDLAIELEKELKPIYVNV---LP-----LIN----KALALFQNPWKQDFIEA 561 (649) T ss_pred ----------HHHHHHHCHHHHHHHHHHHHHHHHHHHCCCCEEEEC---HH-----HHH----HHHHHHHCCCHHHHHHH T ss_conf ----------578886143679999998568999875059807631---33-----677----89997626521007899 Q ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHH Q ss_conf 6899988652899811599999999999974987999999999 Q gi|254781174|r 210 IPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSL 252 (271) Q Consensus 210 ~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~ 252 (271) .+-++..+.-=|.+. .|.--|+..-.+.|..+||.++++. T Consensus 562 ~~L~EKA~~~DP~~~---~A~~~lAQl~LQQ~~v~EAl~~FE~ 601 (649) T TIGR00990 562 ENLLEKALIIDPECD---IAVATLAQLLLQQGDVDEALKLFER 601 (649) T ss_pred HHHHHHHHCCCCCHH---HHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999888630488414---7899999987623468999999987 No 59 >KOG2376 consensus Probab=98.58 E-value=2.9e-05 Score=48.79 Aligned_cols=71 Identities=17% Similarity=0.273 Sum_probs=34.1 Q ss_pred CHHHHHHHHHHHHHHCCHHHHHHHHHHHHH----HCCCCCCH--------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 689999999999980989999999999985----30477302--------689987788776545556799999887540 Q gi|254781174|r 56 YQREVYEKAVLFLKEQNFSKAYEYFNQCSR----DFPFAGVA--------RKSLLMSAFVQYSAGKYQQAASLGEEYITQ 123 (271) Q Consensus 56 ~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~----~~P~s~~a--------~~A~~~la~~~y~~~~y~~A~~~~~~fi~~ 123 (271) +-+.+|+.|-.+.+.|+|.+|++.++.... ..-.+... .-....+++++..+|+-.+|...|..+|+. T Consensus 174 syel~yN~Ac~~i~~gky~qA~elL~kA~~~~~e~l~~~d~~eEeie~el~~IrvQlayVlQ~~Gqt~ea~~iy~~~i~~ 253 (652) T KOG2376 174 SYELLYNTACILIENGKYNQAIELLEKALRICREKLEDEDTNEEEIEEELNPIRVQLAYVLQLQGQTAEASSIYVDIIKR 253 (652) T ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHH T ss_conf 29999999999985034899999999999988876314665306688887789999999999943308789999999872 Q ss_pred CCC Q ss_conf 112 Q gi|254781174|r 124 YPE 126 (271) Q Consensus 124 ~P~ 126 (271) .|. T Consensus 254 ~~~ 256 (652) T KOG2376 254 NPA 256 (652) T ss_pred CCC T ss_conf 688 No 60 >PRK10049 pgaA outer membrane protein PgaA; Provisional Probab=98.54 E-value=5.2e-05 Score=47.29 Aligned_cols=73 Identities=12% Similarity=0.129 Sum_probs=53.2 Q ss_pred CCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCH Q ss_conf 7368999999999998098999999999998530477302689987788776545556799999887540112110 Q gi|254781174|r 54 VRYQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKN 129 (271) Q Consensus 54 ~~~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~ 129 (271) ..++..+=..|..+-+.++|..|++.++.+....|.++ +..+-++.+.=..+++.+|...+.++...+|+.+. T Consensus 96 ~lp~~~l~~~Aka~Rn~kq~~~Al~ly~~~~~~~P~n~---d~~lGlalt~Ada~~~~~Al~~~~~~~~~~p~~~~ 168 (818) T PRK10049 96 QLPARGLAAAARAYRNLKQWQNSLTLWQKALSLEPQNK---DYQSGQILTLADAGHYDEALVKAKQLVAGAPDVAN 168 (818) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC---CHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHH T ss_conf 89979999999998635546779999999850299994---07899999998638869999999998762897678 No 61 >COG3063 PilF Tfp pilus assembly protein PilF [Cell motility and secretion / Intracellular trafficking and secretion] Probab=98.48 E-value=6.6e-06 Score=52.53 Aligned_cols=141 Identities=19% Similarity=0.170 Sum_probs=101.5 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 30268998778877654555679999988754011211013555443444555544311123458999999999998520 Q gi|254781174|r 91 GVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYT 170 (271) Q Consensus 91 ~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP 170 (271) ..+.++.+.+|..|...|++..|...+++.++--|++.. |+-.+|..|..+ +..+.|-+.|+.-++.-| T Consensus 32 ~~aa~arlqLal~YL~~gd~~~A~~nlekAL~~DPs~~~---a~~~~A~~Yq~~--------Ge~~~A~e~YrkAlsl~p 100 (250) T COG3063 32 NEAAKARLQLALGYLQQGDYAQAKKNLEKALEHDPSYYL---AHLVRAHYYQKL--------GENDLADESYRKALSLAP 100 (250) T ss_pred HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHH---HHHHHHHHHHHC--------CCHHHHHHHHHHHHHCCC T ss_conf 899999999999999778999999889999884952088---999999999990--------775567999999985298 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHH Q ss_conf 01013456555555566666665556666654432234568999886528998115999999999999749879999999 Q gi|254781174|r 171 NSPYVKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVV 250 (271) Q Consensus 171 ~S~ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~ 250 (271) ++--+ .. .-|-|.-.+|.|..|...|...+.. |..+....++-.+|.|-.+.|..+.|+.++ T Consensus 101 ~~GdV---LN--------------NYG~FLC~qg~~~eA~q~F~~Al~~-P~Y~~~s~t~eN~G~Cal~~gq~~~A~~~l 162 (250) T COG3063 101 NNGDV---LN--------------NYGAFLCAQGRPEEAMQQFERALAD-PAYGEPSDTLENLGLCALKAGQFDQAEEYL 162 (250) T ss_pred CCCCH---HH--------------HHHHHHHHCCCHHHHHHHHHHHHHC-CCCCCCCHHHHHHHHHHHHCCCCHHHHHHH T ss_conf 86405---52--------------1069997089868999999998718-887775304544578886437704479999 Q ss_pred HHHHHHCCCC Q ss_conf 9997856999 Q gi|254781174|r 251 SLIQERYPQG 260 (271) Q Consensus 251 ~~l~~~yP~s 260 (271) +.-...=|+. T Consensus 163 ~raL~~dp~~ 172 (250) T COG3063 163 KRALELDPQF 172 (250) T ss_pred HHHHHHCCCC T ss_conf 9998837578 No 62 >KOG1173 consensus Probab=98.48 E-value=6.5e-05 Score=46.74 Aligned_cols=199 Identities=20% Similarity=0.167 Sum_probs=135.3 Q ss_pred CCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHH Q ss_conf 36899999999999809899999999999853047730268998778877654555679999988754011211013555 Q gi|254781174|r 55 RYQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVY 134 (271) Q Consensus 55 ~~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~ 134 (271) ...+-+-.+|..++...+|.+-.+.++.+....|...-.- -+.+| |++..|+...-...-.+.++.||+++.. + T Consensus 242 ~~~dll~~~ad~~y~~c~f~~c~kit~~lle~dpfh~~~~--~~~ia-~l~el~~~n~Lf~lsh~LV~~yP~~a~s---W 315 (611) T KOG1173 242 ENLDLLAEKADRLYYGCRFKECLKITEELLEKDPFHLPCL--PLHIA-CLYELGKSNKLFLLSHKLVDLYPSKALS---W 315 (611) T ss_pred HCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCHH--HHHHH-HHHHHCCCCHHHHHHHHHHHHCCCCCCC---H T ss_conf 1479999999999971719999998599873278985139--99999-9998165216999888798728887731---4 Q ss_pred HHHHHHHHHHHHHHHHH---------------------------HHHHHHHHHHHHHHHHHHHCCH--HHHHHHHHHHHH Q ss_conf 44344455554431112---------------------------3458999999999998520010--134565555555 Q gi|254781174|r 135 YLVGMSYAQMIRDVPYD---------------------------QRATKLMLQYMSRIVERYTNSP--YVKGARFYVTVG 185 (271) Q Consensus 135 y~~a~~~~~~~~~~~~d---------------------------~~~~~~A~~~f~~~i~~yP~S~--ya~~A~~~l~~~ 185 (271) |-+|.-|+ .+...... .+-.+.|+..+...-+.+|++- +---.-+.+..- T Consensus 316 ~aVg~YYl-~i~k~seARry~SKat~lD~~fgpaWl~fghsfa~e~EhdQAmaaY~tAarl~~G~hlP~LYlgmey~~t~ 394 (611) T KOG1173 316 FAVGCYYL-MIGKYSEARRYFSKATTLDPTFGPAWLAFGHSFAGEGEHDQAMAAYFTAARLMPGCHLPSLYLGMEYMRTN 394 (611) T ss_pred HHHHHHHH-HHCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHC T ss_conf 65778999-81486799999888760376532799987677650105788999999999866677631789999998742 Q ss_pred HHHHHHHHH---------------HHHHHHHHHHHHHHHHHHHHHHHHHC----CCCHHHHHHHHHHHHHHHHCCCHHHH Q ss_conf 666666655---------------56666654432234568999886528----99811599999999999974987999 Q gi|254781174|r 186 RNQLAAKEV---------------EIGRYYLKRGEYVAAIPRFQLVLANY----SDAEHAEEAMARLVEAYVALALMDEA 246 (271) Q Consensus 186 ~~~La~~e~---------------~ia~~Y~~~~~y~aA~~~~~~~i~~y----p~t~~~~eAl~~l~~~y~~lg~~d~A 246 (271) +-.||++-| ++|-..|..+.|..|.+.|+.++..- +..+.-+-.+-.||-+|-+++..++| T Consensus 395 n~kLAe~Ff~~A~ai~P~Dplv~~Elgvvay~~~~y~~A~~~f~~~l~~ik~~~~e~~~w~p~~~NLGH~~Rkl~~~~eA 474 (611) T KOG1173 395 NLKLAEKFFKQALAIAPSDPLVLHELGVVAYTYEEYPEALKYFQKALEVIKSVLNEKIFWEPTLNNLGHAYRKLNKYEEA 474 (611) T ss_pred CHHHHHHHHHHHHHCCCCCCHHHHHHHHEEEHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 39899999999871189860443314420332754688999999998876630543221157787678999997667999 Q ss_pred HHHHHHHHHHCCCC Q ss_conf 99999997856999 Q gi|254781174|r 247 REVVSLIQERYPQG 260 (271) Q Consensus 247 ~~~~~~l~~~yP~s 260 (271) ..+++.-..--|.+ T Consensus 475 I~~~q~aL~l~~k~ 488 (611) T KOG1173 475 IDYYQKALLLSPKD 488 (611) T ss_pred HHHHHHHHHCCCCC T ss_conf 99999998718776 No 63 >CHL00033 ycf3 photosystem I assembly protein Ycf3 Probab=98.45 E-value=4e-06 Score=53.81 Aligned_cols=130 Identities=15% Similarity=0.130 Sum_probs=100.1 Q ss_pred HHHHCCCCCCHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9853047730268--99877887765455567999998875401121101355544344455554431112345899999 Q gi|254781174|r 83 CSRDFPFAGVARK--SLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYLVGMSYAQMIRDVPYDQRATKLMLQ 160 (271) Q Consensus 83 i~~~~P~s~~a~~--A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~~a~~~~~~~~~~~~d~~~~~~A~~ 160 (271) ++...|.+.-..+ +.+.-|.+....|+|.+|+..|.+.++.-++..+..+++|.+|+++..+ +...+|++ T Consensus 18 ~~k~~p~~~~~k~af~yyr~GmsaQs~GeyaEALe~Y~eAL~Le~D~~Drs~iLYNIGlIy~~n--------Ge~~kALe 89 (165) T CHL00033 18 LLKILPTTSGEKEAFTYYRDGMSAQSEGEYAEALQNYYEAMRLEIDPYDRSYILYNIGLIHTSN--------GEHTKALE 89 (165) T ss_pred HHHHCCCCCCHHHHHHHHHCCCHHHCCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHC--------CCHHHHHH T ss_conf 9998088842288999998151010135389999999986234568512668998999999985--------88889999 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 999999852001013456555555566666665556666654432234568999886528998 Q gi|254781174|r 161 YMSRIVERYTNSPYVKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQLVLANYSDA 223 (271) Q Consensus 161 ~f~~~i~~yP~S~ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t 223 (271) .|.+.+..-|+- .+|...|..+.....++..+.|+.-....-++.|...++..|...|+. T Consensus 90 yY~qALeiNP~~---pqA~NNlGvIyh~qG~~a~~~~~~~~~~~~fd~aa~~w~~ai~lap~n 149 (165) T CHL00033 90 YYFQALERNPFL---PQALNNMAVICHYRGEQAIEQGDSEIAEAWFDQAAEYWKQAIALAPGN 149 (165) T ss_pred HHHHHHHHCCCC---HHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 999999859996---899978999999987999984586789999999999999987038997 No 64 >KOG0548 consensus Probab=98.45 E-value=3.7e-05 Score=48.13 Aligned_cols=182 Identities=16% Similarity=0.176 Sum_probs=106.5 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHH Q ss_conf 99999999809899999999999853047730268998778877654555679999988754011211013555443444 Q gi|254781174|r 61 YEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYLVGMS 140 (271) Q Consensus 61 Y~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~~a~~ 140 (271) =+.|...++++++..|++.+...+... +... -..+.|-+|+..+.+.+.+..-++-++.--... .+ +-+++.+ T Consensus 228 k~lgnaaykkk~f~~a~q~y~~a~el~--~~it--~~~n~aA~~~e~~~~~~c~~~c~~a~E~gre~r-ad--~klIak~ 300 (539) T KOG0548 228 KELGNAAYKKKDFETAIQHYAKALELA--TDIT--YLNNIAAVYLERGKYAECIELCEKAVEVGRELR-AD--YKLIAKA 300 (539) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHH--HHHHHHHHHHHCCHHHHHHCCHHHHHHHHHHHH-HH--HHHHHHH T ss_conf 998899887454899999999998675--6667--888898899731379874422088998757899-99--9999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC--C-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 5555443111234589999999999985200--1-------013456555555566666665556666654432234568 Q gi|254781174|r 141 YAQMIRDVPYDQRATKLMLQYMSRIVERYTN--S-------PYVKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIP 211 (271) Q Consensus 141 ~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~--S-------~ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~ 211 (271) .-. +.......+.+..|+..+++-+..+-+ - +-+........+++..+|.-+..-|.-+++.|+|..|+. T Consensus 301 ~~r-~g~a~~k~~~~~~ai~~~~kaLte~Rt~~~ls~lk~~Ek~~k~~e~~a~~~pe~A~e~r~kGne~Fk~gdy~~Av~ 379 (539) T KOG0548 301 LAR-LGNAYTKREDYEGAIKYYQKALTEHRTPDLLSKLKEAEKALKEAERKAYINPEKAEEEREKGNEAFKKGDYPEAVK 379 (539) T ss_pred HHH-HHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCHHHHHH T ss_conf 998-6311355776788999999986632588878777899999999888873184678999987788775358899999 Q ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH Q ss_conf 999886528998115999999999999749879999999999 Q gi|254781174|r 212 RFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSLI 253 (271) Q Consensus 212 ~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l 253 (271) +|.+.|..-|+.. .+.-..+-+|.++|....|..-.+.- T Consensus 380 ~YteAIkr~P~Da---~lYsNRAac~~kL~~~~~aL~Da~~~ 418 (539) T KOG0548 380 HYTEAIKRDPEDA---RLYSNRAACYLKLGEYPEALKDAKKC 418 (539) T ss_pred HHHHHHHCCCCHH---HHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999984298415---78888999999873179899889998 No 65 >KOG0553 consensus Probab=98.42 E-value=4.7e-06 Score=53.38 Aligned_cols=109 Identities=14% Similarity=0.181 Sum_probs=90.3 Q ss_pred CCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHH Q ss_conf 67368999999999998098999999999998530477302689987788776545556799999887540112110135 Q gi|254781174|r 53 DVRYQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDY 132 (271) Q Consensus 53 ~~~~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~ 132 (271) +..-++.|=.+|...+++++|.+|+..|.+.+..-|....- .=+.|-+|-++|.|+.|+...+.-|...|.+.. T Consensus 77 ~~~~AE~LK~eGN~~m~~~~Y~eAv~kY~~AI~l~P~nAVy---ycNRAAAy~~Lg~~~~AVkDce~Al~iDp~ysk--- 150 (304) T KOG0553 77 DKALAESLKNEGNKLMKNKDYQEAVDKYTEAIELDPTNAVY---YCNRAAAYSKLGEYEDAVKDCESALSIDPHYSK--- 150 (304) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHH---HHHHHHHHHHHCCHHHHHHHHHHHHHCCHHHHH--- T ss_conf 79999999998899998625999999999998458876267---778999999860117899999997762908999--- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHH Q ss_conf 5544344455554431112345899999999999852001013 Q gi|254781174|r 133 VYYLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYV 175 (271) Q Consensus 133 A~y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya 175 (271) ||-++|+.|+. ++...+|+..|++.+...|+.+-. T Consensus 151 ay~RLG~A~~~--------~gk~~~A~~aykKaLeldP~Ne~~ 185 (304) T KOG0553 151 AYGRLGLAYLA--------LGKYEEAIEAYKKALELDPDNESY 185 (304) T ss_pred HHHHHHHHHHC--------CCCHHHHHHHHHHHHCCCCCCHHH T ss_conf 99998799980--------673799999997643168884899 No 66 >KOG4340 consensus Probab=98.41 E-value=3.3e-05 Score=48.48 Aligned_cols=178 Identities=20% Similarity=0.225 Sum_probs=106.8 Q ss_pred HHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 99809899999999999853047730268998778877654555679999988754011211013555443444555544 Q gi|254781174|r 67 FLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYLVGMSYAQMIR 146 (271) Q Consensus 67 ~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~~a~~~~~~~~ 146 (271) .+.+.+|.+||+.+....++.|.+. + ++-.+|+|||...+|..|...|+..-.++|+-.. |-.| .|.+.|+... T Consensus 20 lI~d~ry~DaI~~l~s~~Er~p~~r-A--gLSlLgyCYY~~Q~f~~AA~CYeQL~ql~P~~~q--YrlY-~AQSLY~A~i 93 (459) T KOG4340 20 LIRDARYADAIQLLGSELERSPRSR-A--GLSLLGYCYYRLQEFALAAECYEQLGQLHPELEQ--YRLY-QAQSLYKACI 93 (459) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCHH-H--HHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHH--HHHH-HHHHHHHCCC T ss_conf 9987667889999988874396006-7--8889999999999998888999999863718888--9999-9999997035 Q ss_pred --------HHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf --------3111234589999999999985200101345655555556-6666665556666654432234568999886 Q gi|254781174|r 147 --------DVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGR-NQLAAKEVEIGRYYLKRGEYVAAIPRFQLVL 217 (271) Q Consensus 147 --------~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~-~~La~~e~~ia~~Y~~~~~y~aA~~~~~~~i 217 (271) ..-.|+....+-.-.++..| .|...+.+ .++--+..+- +.-|..-...|..-++.|+|.+|+.+|+..+ T Consensus 94 ~ADALrV~~~~~D~~~L~~~~lqLqaAI-kYse~Dl~-g~rsLveQlp~en~Ad~~in~gCllykegqyEaAvqkFqaAl 171 (459) T KOG4340 94 YADALRVAFLLLDNPALHSRVLQLQAAI-KYSEGDLP-GSRSLVEQLPSENEADGQINLGCLLYKEGQYEAAVQKFQAAL 171 (459) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHH-HCCCCCCC-CHHHHHHHCCCCCCHHHHHCCHHHEECCCCHHHHHHHHHHHH T ss_conf 2889999999568498999999999998-60545553-549999857788850233030120001654899999999987 Q ss_pred HHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 52899811599999999999974987999999999978 Q gi|254781174|r 218 ANYSDAEHAEEAMARLVEAYVALALMDEAREVVSLIQE 255 (271) Q Consensus 218 ~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~~ 255 (271) +- +.+.+-..|.++-+++.-|+.+.|-+..+.++. T Consensus 172 qv---sGyqpllAYniALaHy~~~qyasALk~iSEIie 206 (459) T KOG4340 172 QV---SGYQPLLAYNLALAHYSSRQYASALKHISEIIE 206 (459) T ss_pred HH---CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 30---576727899999998731207789999999997 No 67 >TIGR00990 3a0801s09 mitochondrial precursor proteins import receptor; InterPro: IPR005687 The mitochondrial protein translocase (MPT) family, which brings nuclearly encoded preproteins into mitochondria, is very complex with 19 currently identified protein constituents. These proteins include several chaperone proteins, four proteins of the outer membrane translocase (Tom) import receptor, five proteins of the Tom channel complex, five proteins of the inner membrane translocase (Tim) and three "motor" proteins. This family is specific for the Tom70 proteins. ; GO: 0015450 P-P-bond-hydrolysis-driven protein transmembrane transporter activity, 0006886 intracellular protein transport, 0005741 mitochondrial outer membrane. Probab=98.37 E-value=1e-05 Score=51.44 Aligned_cols=156 Identities=13% Similarity=0.136 Sum_probs=91.7 Q ss_pred HHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHH Q ss_conf 99999980989999999999985304773026899877887765455567999998875401121101355544344455 Q gi|254781174|r 63 KAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYLVGMSYA 142 (271) Q Consensus 63 ~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~~a~~~~ 142 (271) .|.-.+=.|+-.+|...+.+.++.+|. ..++.+..|...-.+|+.+++...|++.|...|+.+. +||.+|..+| T Consensus 369 ~G~F~~L~~~~~~A~~Dl~K~IEL~P~---~~qSYI~~A~~~~~~~~~~~~~~~F~~Al~~~~~~~~---~YYHRaQl~F 442 (649) T TIGR00990 369 RGIFKCLKGKHLEALADLSKSIELDPR---VTQSYIKRASINLELGDPDKAEEDFDKALKKNSEDPD---IYYHRAQLHF 442 (649) T ss_pred CCHHHHHCCCHHHHHHHHHHHHHCCCC---CCHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCC---CEEHHHHHHH T ss_conf 422233128815789987765422787---3178999999974157842258899999861889875---1101778998 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 55443111234589999999999985200101345655555556666666555666665443223456899988652899 Q gi|254781174|r 143 QMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQLVLANYSD 222 (271) Q Consensus 143 ~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~ 222 (271) -. +...+|-+.|+.-++.-|+--+ .+.++|-.-|+.+........|+..+++||. T Consensus 443 I~--------~~f~~A~KDy~K~~~LDp~~IF-----------------~~IQL~~~~Y~~~~~~~~~~~F~~~~~~Fp~ 497 (649) T TIGR00990 443 IK--------GEFAQAGKDYQKSIDLDPDFIF-----------------SYIQLGVLQYKLGSIASSMALFRRCKKNFPE 497 (649) T ss_pred HH--------HHHHHHHHHHHHHHHCCCCCCH-----------------HHHHHHHHHHHCCCHHHHHHHHHHHHHCCCC T ss_conf 87--------3077675457766432711022-----------------6899999886303443478999999861778 Q ss_pred CHHHHHHHHHHHHHHHHCCCHHHHHHHHHH Q ss_conf 811599999999999974987999999999 Q gi|254781174|r 223 AEHAEEAMARLVEAYVALALMDEAREVVSL 252 (271) Q Consensus 223 t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~ 252 (271) +| +..-.-+|........++|.+-+.+ T Consensus 498 ~P---~V~Nyy~ElL~D~~~F~~A~~~fD~ 524 (649) T TIGR00990 498 AP---DVYNYYGELLLDKNKFDEAIEKFDL 524 (649) T ss_pred CC---CCCCHHHHHHHCHHHHHHHHHHHHH T ss_conf 88---8530578886143679999998568 No 68 >KOG2076 consensus Probab=98.37 E-value=0.00014 Score=44.73 Aligned_cols=63 Identities=21% Similarity=0.228 Sum_probs=27.9 Q ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCC--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999809899999999999853047730--268998778877654555679999988754 Q gi|254781174|r 60 VYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGV--ARKSLLMSAFVQYSAGKYQQAASLGEEYIT 122 (271) Q Consensus 60 lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~--a~~A~~~la~~~y~~~~y~~A~~~~~~fi~ 122 (271) +|+++.-+.+.|+...|.+.|.++...-|.+++ ..+.....+..+...++-+.|+..++.+++ T Consensus 244 ~~ers~L~~~~G~~~~Am~~f~~l~~~~p~~d~er~~d~i~~~~~~~~~~~~~e~a~~~le~~~s 308 (895) T KOG2076 244 IYERSSLYQKTGDLKRAMETFLQLLQLDPPVDIERIEDLIRRVAHYFITHNERERAAKALEGALS 308 (895) T ss_pred HHHHHHHHHHHCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999984707888999999986289015899999999999999971078999999997774 No 69 >KOG1156 consensus Probab=98.29 E-value=0.00023 Score=43.54 Aligned_cols=63 Identities=16% Similarity=0.308 Sum_probs=27.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH-HHCCCHHHHHHHHHHHHHHCCCCH Q ss_conf 666665443223456899988652899811599999999999-974987999999999978569997 Q gi|254781174|r 196 IGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAY-VALALMDEAREVVSLIQERYPQGY 261 (271) Q Consensus 196 ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y-~~lg~~d~A~~~~~~l~~~yP~s~ 261 (271) -|.++.+.+.+..|+..|..++...|+.-..-+-+ ..++ .-.+....-+..+..+..+||-+. T Consensus 225 ka~l~~kl~~lEeA~~~y~~Ll~rnPdn~~Yy~~l---~~~lgk~~d~~~~lk~ly~~ls~~y~r~e 288 (700) T KOG1156 225 KADLLMKLGQLEEAVKVYRRLLERNPDNLDYYEGL---EKALGKIKDMLEALKALYAILSEKYPRHE 288 (700) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 88899987418768999999986293367899999---99999984339999999998752375322 No 70 >KOG1174 consensus Probab=98.29 E-value=8.2e-05 Score=46.15 Aligned_cols=208 Identities=15% Similarity=0.098 Sum_probs=137.1 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHH--H-----------------------------HHHHHHH Q ss_conf 89999999999980989999999999985304773026899--8-----------------------------7788776 Q gi|254781174|r 57 QREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSL--L-----------------------------MSAFVQY 105 (271) Q Consensus 57 ~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~--~-----------------------------~la~~~y 105 (271) ..-+-..|..++-.|++++|+..|+++....|+.-.+-+-. + -.+...| T Consensus 232 vhLl~~lak~~~~~Gdn~~a~~~Fe~~~~~dpy~i~~MD~Ya~LL~~eg~~e~~~~L~~~Lf~~~~~ta~~wfV~~~~l~ 311 (564) T KOG1174 232 EHLMMALGKCLYYNGDYFQAEDIFSSTLCANPDNVEAMDLYAVLLGQEGGCEQDSALMDYLFAKVKYTASHWFVHAQLLY 311 (564) T ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCEEEEEEHHH T ss_conf 88998875565341675689999988750781555568999999986048767899999998522023240234310145 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHH--------------------------HHHHHHHHHHHHHHH Q ss_conf 545556799999887540112110135554434445555--------------------------443111234589999 Q gi|254781174|r 106 SAGKYQQAASLGEEYITQYPESKNVDYVYYLVGMSYAQM--------------------------IRDVPYDQRATKLML 159 (271) Q Consensus 106 ~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~~a~~~~~~--------------------------~~~~~~d~~~~~~A~ 159 (271) ..++|..|+...++-|+..|+|-. |+.++|...... .-.....++.+++|. T Consensus 312 ~~K~~~rAL~~~eK~I~~~~r~~~---alilKG~lL~~~~R~~~A~IaFR~Aq~Lap~rL~~Y~GL~hsYLA~~~~kEA~ 388 (564) T KOG1174 312 DEKKFERALNFVEKCIDSEPRNHE---ALILKGRLLIALERHTQAVIAFRTAQMLAPYRLEIYRGLFHSYLAQKRFKEAN 388 (564) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCH---HHHHCCHHHHHCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCHHHHH T ss_conf 776689999999887615855336---78861399885456288999999987425145789988999999614089999 Q ss_pred HHHHHHHHHHHCCHHHHHHH-----H---HHHHHHHHHHHHH-----------HHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 99999998520010134565-----5---5555566666665-----------556666654432234568999886528 Q gi|254781174|r 160 QYMSRIVERYTNSPYVKGAR-----F---YVTVGRNQLAAKE-----------VEIGRYYLKRGEYVAAIPRFQLVLANY 220 (271) Q Consensus 160 ~~f~~~i~~yP~S~ya~~A~-----~---~l~~~~~~La~~e-----------~~ia~~Y~~~~~y~aA~~~~~~~i~~y 220 (271) ..-+..++.+|+|-.+-.-. . .+..--..++++. ..+|......|.+..+|.-++..+.+| T Consensus 389 ~~An~~~~~~~~sA~~LtL~g~~V~~~dp~~rEKAKkf~ek~L~~~P~Y~~AV~~~AEL~~~Eg~~~D~i~LLe~~L~~~ 468 (564) T KOG1174 389 ALANWTIRLFQNSARSLTLFGTLVLFPDPRMREKAKKFAEKSLKINPIYTPAVNLIAELCQVEGPTKDIIKLLEKHLIIF 468 (564) T ss_pred HHHHHHHHHHHHHHHHHHHHHHEEECCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHC T ss_conf 99999999765314467663000115685067999999984203588608999999999986276541899999987425 Q ss_pred CCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHC Q ss_conf 998115999999999999749879999999999785699978999998609 Q gi|254781174|r 221 SDAEHAEEAMARLVEAYVALALMDEAREVVSLIQERYPQGYWARYVETLVK 271 (271) Q Consensus 221 p~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~~~yP~s~~~~~a~~~l~ 271 (271) ||.. ..-.||+..-..+..++|.+++..-..-=|+++-...-=+++| T Consensus 469 ~D~~----LH~~Lgd~~~A~Ne~Q~am~~y~~ALr~dP~~~~sl~Gl~~lE 515 (564) T KOG1174 469 PDVN----LHNHLGDIMRAQNEPQKAMEYYYKALRQDPKSKRTLRGLRLLE 515 (564) T ss_pred CCHH----HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHH T ss_conf 6208----9999999999861079999999999705964077888899987 No 71 >KOG0543 consensus Probab=98.23 E-value=5.6e-05 Score=47.12 Aligned_cols=69 Identities=29% Similarity=0.259 Sum_probs=54.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHH Q ss_conf 556666654432234568999886528998115999999999999749879999999999785699978999 Q gi|254781174|r 194 VEIGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSLIQERYPQGYWARY 265 (271) Q Consensus 194 ~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~~~yP~s~~~~~ 265 (271) +.+|-.|.+.+.|.+|+.....+|+.=|+.. .||||-|+++..+|.++.|+..++....-.|+++-.+. T Consensus 261 lNlA~c~lKl~~~~~Ai~~c~kvLe~~~~N~---KALyRrG~A~l~~~e~~~A~~df~ka~k~~P~Nka~~~ 329 (397) T KOG0543 261 LNLAACYLKLKEYKEAIESCNKVLELDPNNV---KALYRRGQALLALGEYDLARDDFQKALKLEPSNKAARA 329 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCH---HHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHH T ss_conf 4799888741218899999999884589962---67778788988642098999999999974888689999 No 72 >KOG0624 consensus Probab=98.23 E-value=6.9e-05 Score=46.59 Aligned_cols=74 Identities=14% Similarity=0.134 Sum_probs=37.0 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHH Q ss_conf 89999999999980989999999999985304773026899877887765455567999998875401121101355 Q gi|254781174|r 57 QREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYV 133 (271) Q Consensus 57 ~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A 133 (271) -..+|..|..++.-|+-..|+..|..|+..-|.= .-|....|..+.++|++++|+..|+..++-.|+....-+| T Consensus 72 Y~aifrRaT~yLAmGksk~al~Dl~rVlelKpDF---~~ARiQRg~vllK~Gele~A~~DF~~vl~~~~s~~~~~ea 145 (504) T KOG0624 72 YQAIFRRATVYLAMGKSKAALQDLSRVLELKPDF---MAARIQRGVVLLKQGELEQAEADFDQVLQHEPSNGLVLEA 145 (504) T ss_pred HHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCCCH---HHHHHHHCHHHHHCCCHHHHHHHHHHHHHCCCCCCHHHHH T ss_conf 9999987777765257740033089998618318---8888872203553252888987999997259873026889 No 73 >KOG0548 consensus Probab=98.23 E-value=0.00023 Score=43.50 Aligned_cols=111 Identities=21% Similarity=0.180 Sum_probs=64.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHH Q ss_conf 98778877654555679999988754011211013555443444555544311123458999999999998520010134 Q gi|254781174|r 97 LLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVK 176 (271) Q Consensus 97 ~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~ 176 (271) .-.-|..+|+.++|..|+..|++.|.+.|+++. .+=.+|.||.+. .....|+...+..|+.-|+-. T Consensus 361 ~r~kGne~Fk~gdy~~Av~~YteAIkr~P~Da~---lYsNRAac~~kL--------~~~~~aL~Da~~~ieL~p~~~--- 426 (539) T KOG0548 361 EREKGNEAFKKGDYPEAVKHYTEAIKRDPEDAR---LYSNRAACYLKL--------GEYPEALKDAKKCIELDPNFI--- 426 (539) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHH---HHHHHHHHHHHH--------HHHHHHHHHHHHHHHCCCHHH--- T ss_conf 998778877535889999999999842984157---888899999987--------317989988999875093189--- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHH Q ss_conf 56555555566666665556666654432234568999886528998115999999999 Q gi|254781174|r 177 GARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAMARLVE 235 (271) Q Consensus 177 ~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~ 235 (271) .+. +.-|..+.-.++|+.|...|+..++.-|+.....+.+.+..+ T Consensus 427 kgy--------------~RKg~al~~mk~ydkAleay~eale~dp~~~e~~~~~~rc~~ 471 (539) T KOG0548 427 KAY--------------LRKGAALRAMKEYDKALEAYQEALELDPSNAEAIDGYRRCVE 471 (539) T ss_pred HHH--------------HHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH T ss_conf 999--------------999999999998999999999987349155899999999999 No 74 >KOG0550 consensus Probab=98.22 E-value=4.7e-05 Score=47.54 Aligned_cols=176 Identities=16% Similarity=0.095 Sum_probs=79.8 Q ss_pred HHHHHHHHCCHHHHHHHHHHHHHHCCCC---CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHH Q ss_conf 9999998098999999999998530477---3026899877887765455567999998875401121101355544344 Q gi|254781174|r 63 KAVLFLKEQNFSKAYEYFNQCSRDFPFA---GVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYLVGM 139 (271) Q Consensus 63 ~a~~~~~~~~y~~A~~~f~~i~~~~P~s---~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~~a~ 139 (271) .|...+++.+-..+.+.|..+...-|.. |--..+.+.-+.|+-..++++.|+...-+.+++.|.+ .+|.|.+|. T Consensus 135 ~A~~~~~~~~~~~~anal~~~~~~~~s~s~~pac~~a~~lka~cl~~~~~~~~a~~ea~~ilkld~~n---~~al~vrg~ 211 (486) T KOG0550 135 EAEEKLKSKQAYKAANALPTLEKLAPSHSREPACFKAKLLKAECLAFLGDYDEAQSEAIDILKLDATN---AEALYVRGL 211 (486) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCH---HHHHHHCCC T ss_conf 89988621016677653122200242346773256788741056543034316778888887516301---678886153 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHH-------------H------------ Q ss_conf 455554431112345899999999999852001013456555555566666665-------------5------------ Q gi|254781174|r 140 SYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQLAAKE-------------V------------ 194 (271) Q Consensus 140 ~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~La~~e-------------~------------ 194 (271) |+|-. .....|+..|++-++.-|++.-+..+......|...-.+.+ . T Consensus 212 ~~yy~--------~~~~ka~~hf~qal~ldpdh~~sk~~~~~~k~le~~k~~gN~~fk~G~y~~A~E~Yteal~idP~n~ 283 (486) T KOG0550 212 CLYYN--------DNADKAINHFQQALRLDPDHQKSKSASMMPKKLEVKKERGNDAFKNGNYRKAYECYTEALNIDPSNK 283 (486) T ss_pred CCCCC--------CCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCC T ss_conf 31023--------2068888887652026802566776764078999887505667634424678998877633793222 Q ss_pred -HHHHHH-------HHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHH Q ss_conf -566666-------5443223456899988652899811599999999999974987999999999 Q gi|254781174|r 195 -EIGRYY-------LKRGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSL 252 (271) Q Consensus 195 -~ia~~Y-------~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~ 252 (271) ..+.+| .+.|.-..|+.-.++++.-- +..-+|+.+-+++|..++..++|.+.++. T Consensus 284 ~~naklY~nra~v~~rLgrl~eaisdc~~Al~iD---~syikall~ra~c~l~le~~e~AV~d~~~ 346 (486) T KOG0550 284 KTNAKLYGNRALVNIRLGRLREAISDCNEALKID---SSYIKALLRRANCHLALEKWEEAVEDYEK 346 (486) T ss_pred CHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCC---HHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 2468999876766420677165665446666347---98889999988988899899999999999 No 75 >COG2976 Uncharacterized protein conserved in bacteria [Function unknown] Probab=98.22 E-value=0.00012 Score=45.24 Aligned_cols=181 Identities=15% Similarity=0.135 Sum_probs=111.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCC Q ss_conf 99999999999999999999999997226875333445677767368999999999998098999999999998530477 Q gi|254781174|r 11 IFEAWAYQLYKFALTIFFSIAVCFLVGWERQSSRDVYLDSVTDVRYQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFA 90 (271) Q Consensus 11 ~~~~~~~~m~k~~~~i~~~i~~~~l~~Cs~~~~~~~~~~~~~~~~~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s 90 (271) -++.|+-.-.+. +++.++|.+..++|.--.....+ .....+...|+......+.+.- +.+...++++..+|.+ T Consensus 13 ~ik~wwkeNGk~-li~gviLg~~~lfGW~ywq~~q~-----~q~~~AS~~Y~~~i~~~~ak~~-~~~~~~ekf~~~n~~t 85 (207) T COG2976 13 AIKDWWKENGKA-LIVGVILGLGGLFGWRYWQSHQV-----EQAQEASAQYQNAIKAVQAKKP-KSIAAAEKFVQANGKT 85 (207) T ss_pred HHHHHHHHCCCH-HHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHCCCC-HHHHHHHHHHHHCCCC T ss_conf 999999977907-89999999999998899999878-----8888999999999999855770-2599999998615635 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 30268998778877654555679999988754011211013555443444555544311123458999999999998520 Q gi|254781174|r 91 GVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYT 170 (271) Q Consensus 91 ~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP 170 (271) .|+..+.+.+|..++..++++.|+..++..+..-.++..-+-+-.++|. +...++..++|+..+..... T Consensus 86 ~Ya~laaL~lAk~~ve~~~~d~A~aqL~~~l~~t~De~lk~l~~lRLAr--------vq~q~~k~D~AL~~L~t~~~--- 154 (207) T COG2976 86 IYAVLAALELAKAEVEANNLDKAEAQLKQALAQTKDENLKALAALRLAR--------VQLQQKKADAALKTLDTIKE--- 154 (207) T ss_pred HHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHH--------HHHHHHHHHHHHHHHHCCCC--- T ss_conf 8999999999999986342999999999987145257899999999999--------99986038889998723011--- Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCH Q ss_conf 010134565555555666666655566666544322345689998865289981 Q gi|254781174|r 171 NSPYVKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQLVLANYSDAE 224 (271) Q Consensus 171 ~S~ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~ 224 (271) ..++. ..+-..|+.+..+|+-.+|+.+|...+...++.. T Consensus 155 -~~w~~--------------~~~elrGDill~kg~k~~Ar~ay~kAl~~~~s~~ 193 (207) T COG2976 155 -ESWAA--------------IVAELRGDILLAKGDKQEARAAYEKALESDASPA 193 (207) T ss_pred -CCHHH--------------HHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHH T ss_conf -10788--------------9999742699972865999999999987157748 No 76 >PRK10370 formate-dependent nitrite reductase complex subunit NrfG; Provisional Probab=98.22 E-value=0.00014 Score=44.79 Aligned_cols=129 Identities=11% Similarity=0.024 Sum_probs=94.4 Q ss_pred CHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 89999999999985304773026899877887765455567999998875401121101355544344455554431112 Q gi|254781174|r 72 NFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYLVGMSYAQMIRDVPYD 151 (271) Q Consensus 72 ~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~~a~~~~~~~~~~~~d 151 (271) .-+..+..++.-+...|+. .+.-+++|.+|-..++|+.|+..|++.++.-|+++.+- +.| |.+.+. ..| T Consensus 62 ~~e~~~~~L~~rL~~~Pdd---~~gW~~LGr~y~~~~~~~~A~~Ay~kA~~L~p~~~~il-~~y--A~aL~~-----~~~ 130 (206) T PRK10370 62 TPEAQLQALQDKIRANPQN---SEQWALLGEYYLWRNAYDNALLAYRQALQLRGENAELY-AAL--ATVLYY-----QAG 130 (206) T ss_pred HHHHHHHHHHHHHHHCCCC---HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHH-HHH--HHHHHH-----HCC T ss_conf 1799999999999839988---89999999999976898999999999997579987899-999--999998-----759 Q ss_pred HHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHH Q ss_conf 34589999999999985200101345655555556666666555666665443223456899988652899811599 Q gi|254781174|r 152 QRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEE 228 (271) Q Consensus 152 ~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~e 228 (271) +..+.++....++.+..-|+..-+ . .-+|---|.+++|..||..++.+++.-|-..-..+ T Consensus 131 ~~~t~~~~~lL~~AL~lDp~~~~A---L--------------~Lla~~AFe~~dY~~AI~~Wq~lL~~~~p~~~R~~ 190 (206) T PRK10370 131 QHMTPQTREMIDKALALDANEVTA---L--------------MLLASDAFMQADYAQAIELWQKLLDLNSPRVNRTQ 190 (206) T ss_pred CCCCHHHHHHHHHHHHHCCCCHHH---H--------------HHHHHHHHHHCCHHHHHHHHHHHHHCCCCCCCHHH T ss_conf 877689999999998749897889---9--------------99999999826499999999999845799988999 No 77 >COG2956 Predicted N-acetylglucosaminyl transferase [Carbohydrate transport and metabolism] Probab=98.17 E-value=0.00041 Score=42.07 Aligned_cols=184 Identities=14% Similarity=0.184 Sum_probs=121.8 Q ss_pred CCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHH Q ss_conf 73689999999999980989999999999985304773026899877887765455567999998875401121101355 Q gi|254781174|r 54 VRYQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYV 133 (271) Q Consensus 54 ~~~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A 133 (271) ..-.-++++.|..++..|-++.|.+.|..+.+. .++++.|.=.+..+|....+|.+||...+++.+.-|+.-.+.=| T Consensus 104 ~qr~lAl~qL~~Dym~aGl~DRAE~~f~~L~de---~efa~~AlqqLl~IYQ~treW~KAIe~A~~L~k~~~q~~~~eIA 180 (389) T COG2956 104 EQRLLALQQLGRDYMAAGLLDRAEDIFNQLVDE---GEFAEGALQQLLNIYQATREWEKAIDVAERLVKLGGQTYRVEIA 180 (389) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHH T ss_conf 899999999989999856466899999997346---34328999999999998368999999999999718852125899 Q ss_pred HHHHHHHHHHH----------------------------HHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH Q ss_conf 54434445555----------------------------44311123458999999999998520010134565555555 Q gi|254781174|r 134 YYLVGMSYAQM----------------------------IRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVTVG 185 (271) Q Consensus 134 ~y~~a~~~~~~----------------------------~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~ 185 (271) +|-.-++-... ..++..+.+.+..|++.++.++++-| +|..++.. T Consensus 181 qfyCELAq~~~~~~~~d~A~~~l~kAlqa~~~cvRAsi~lG~v~~~~g~y~~AV~~~e~v~~Qn~--~yl~evl~----- 253 (389) T COG2956 181 QFYCELAQQALASSDVDRARELLKKALQADKKCVRASIILGRVELAKGDYQKAVEALERVLEQNP--EYLSEVLE----- 253 (389) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCH--HHHHHHHH----- T ss_conf 99999999974031189999999999862820010354543888752666999999999998495--77999999----- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCC Q ss_conf 666666655566666544322345689998865289981159999999999997498799999999997856999 Q gi|254781174|r 186 RNQLAAKEVEIGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSLIQERYPQG 260 (271) Q Consensus 186 ~~~La~~e~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~~~yP~s 260 (271) .+...|...|+....++.+...++.+++.. ++.+++.+.+. ..-.+.|..+......+-|+- T Consensus 254 ---------~L~~~Y~~lg~~~~~~~fL~~~~~~~~g~~-~~l~l~~l~e~---~eG~~~Aq~~l~~Ql~r~Pt~ 315 (389) T COG2956 254 ---------MLYECYAQLGKPAEGLNFLRRAMETNTGAD-AELMLADLIEL---QEGIDAAQAYLTRQLRRKPTM 315 (389) T ss_pred ---------HHHHHHHHHCCHHHHHHHHHHHHHCCCCCC-HHHHHHHHHHH---HHCHHHHHHHHHHHHHHCCCH T ss_conf ---------999999994787789999999986447851-99999999997---107799999999998629818 No 78 >PRK13184 pknD serine/threonine-protein kinase; Reviewed Probab=98.14 E-value=0.00011 Score=45.49 Aligned_cols=72 Identities=15% Similarity=0.184 Sum_probs=45.9 Q ss_pred HHHHHHHHHHHHHH-----CC--HHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCH Q ss_conf 89999999999980-----98--999999999998530477302689987788776545556799999887540112110 Q gi|254781174|r 57 QREVYEKAVLFLKE-----QN--FSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKN 129 (271) Q Consensus 57 ~~~lY~~a~~~~~~-----~~--y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~ 129 (271) -++.|..|...+++ ++ +.+|+..|+.+ -+++-||.-.+..|.+|..+++|++-+..+.-.+++||+||. T Consensus 512 ~ea~fr~git~le~a~~~~~~~~~~~al~~f~~l----h~~~~apleylgkalvyq~~~~~~eeik~~~la~kry~~hp~ 587 (933) T PRK13184 512 YEAQFRLGITLLEKASEQGGDQEFTQALSEFSYL----HGGVAAPLEYLGKALVYQRLGEYNEEIKSLLLALKRYSQHPE 587 (933) T ss_pred HHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH----CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 7889986389999887604725799999999885----389887578888899998730078999999999975679987 Q ss_pred HHH Q ss_conf 135 Q gi|254781174|r 130 VDY 132 (271) Q Consensus 130 ~~~ 132 (271) ++. T Consensus 588 i~~ 590 (933) T PRK13184 588 IPR 590 (933) T ss_pred CHH T ss_conf 367 No 79 >KOG2376 consensus Probab=98.14 E-value=0.00049 Score=41.62 Aligned_cols=195 Identities=12% Similarity=0.087 Sum_probs=118.4 Q ss_pred CCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHH Q ss_conf 36899999999999809899999999999853047730268998778877654555679999988754011211013555 Q gi|254781174|r 55 RYQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVY 134 (271) Q Consensus 55 ~~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~ 134 (271) .++.+++.+-...++.++|++|+ +++..++.......-.|.-|||.|++++.++|+..++.. + +.. +... T Consensus 44 dd~~a~~cKvValIq~~ky~~AL----k~ikk~~~~~~~~~~~fEKAYc~Yrlnk~Dealk~~~~~-~--~~~---~~ll 113 (652) T KOG2376 44 DDEDAIRCKVVALIQLDKYEDAL----KLIKKNGALLVINSFFFEKAYCEYRLNKLDEALKTLKGL-D--RLD---DKLL 113 (652) T ss_pred CCHHHHHHHHHHHHHHHHHHHHH----HHHHHCCHHHHCCHHHHHHHHHHHHCCCHHHHHHHHHCC-C--CCC---HHHH T ss_conf 94766866676420366799999----999851114311113578888998705678899987503-5--566---1889 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHH-------------H--HHHHHHH Q ss_conf 443444555544311123458999999999998520010134565555555666666-------------6--5556666 Q gi|254781174|r 135 YLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQLAA-------------K--EVEIGRY 199 (271) Q Consensus 135 y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~La~-------------~--e~~ia~~ 199 (271) -+.|...|.+ +...+|++.|+.++...-+ ++-.+-+..+......+.. + -|..|-. T Consensus 114 ~L~AQvlYrl--------~~ydealdiY~~L~kn~~d-d~d~~~r~nl~a~~a~l~~~~~q~v~~v~e~syel~yN~Ac~ 184 (652) T KOG2376 114 ELRAQVLYRL--------ERYDEALDIYQHLAKNNSD-DQDEERRANLLAVAAALQVQLLQSVPEVPEDSYELLYNTACI 184 (652) T ss_pred HHHHHHHHHH--------HHHHHHHHHHHHHHHCCCC-HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHH T ss_conf 9999999987--------5389999999999855870-578999978999887404779874558885429999999999 Q ss_pred HHHHHHHHHHHHHHHHH----HHHCCCCHHH------H--HHHHHHHHHHHHCCCHHHHHH-HHHHHHHHCCCCHHHHHH Q ss_conf 65443223456899988----6528998115------9--999999999997498799999-999997856999789999 Q gi|254781174|r 200 YLKRGEYVAAIPRFQLV----LANYSDAEHA------E--EAMARLVEAYVALALMDEARE-VVSLIQERYPQGYWARYV 266 (271) Q Consensus 200 Y~~~~~y~aA~~~~~~~----i~~yp~t~~~------~--eAl~~l~~~y~~lg~~d~A~~-~~~~l~~~yP~s~~~~~a 266 (271) +...|+|..|+.-++.. .+..-+.... + -....|+-++..+|...+|.+ +..++..+-+|+....-+ T Consensus 185 ~i~~gky~qA~elL~kA~~~~~e~l~~~d~~eEeie~el~~IrvQlayVlQ~~Gqt~ea~~iy~~~i~~~~~D~~~~Av~ 264 (652) T KOG2376 185 LIENGKYNQAIELLEKALRICREKLEDEDTNEEEIEEELNPIRVQLAYVLQLQGQTAEASSIYVDIIKRNPADEPSLAVA 264 (652) T ss_pred HHHCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCHHHHHH T ss_conf 98503489999999999998887631466530668888778999999999994330878999999987268882278999 Q ss_pred HH Q ss_conf 98 Q gi|254781174|r 267 ET 268 (271) Q Consensus 267 ~~ 268 (271) .| T Consensus 265 ~N 266 (652) T KOG2376 265 VN 266 (652) T ss_pred HC T ss_conf 60 No 80 >KOG3081 consensus Probab=98.13 E-value=0.00047 Score=41.74 Aligned_cols=94 Identities=15% Similarity=0.169 Sum_probs=72.9 Q ss_pred HHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHH Q ss_conf 89999999999985200101345655555556666666555666665443223456899988652899811599999999 Q gi|254781174|r 155 TKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAMARLV 234 (271) Q Consensus 155 ~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~ 234 (271) ...|.-.|+++-++||-++... ...|......++|.+|..-++..++.+++. +++|-.+. T Consensus 189 ~qdAfyifeE~s~k~~~T~~ll-----------------nG~Av~~l~~~~~eeAe~lL~eaL~kd~~d---petL~Nli 248 (299) T KOG3081 189 IQDAFYIFEELSEKTPPTPLLL-----------------NGQAVCHLQLGRYEEAESLLEEALDKDAKD---PETLANLI 248 (299) T ss_pred HHHHHHHHHHHHCCCCCCHHHH-----------------CCHHHHHHHHCCHHHHHHHHHHHHHCCCCC---HHHHHHHH T ss_conf 5468999999835359976887-----------------069999897169999999999998534787---89999999 Q ss_pred HHHHHCCCHHHHH-HHHHHHHHHCCCCHHHHHHHH Q ss_conf 9999749879999-999999785699978999998 Q gi|254781174|r 235 EAYVALALMDEAR-EVVSLIQERYPQGYWARYVET 268 (271) Q Consensus 235 ~~y~~lg~~d~A~-~~~~~l~~~yP~s~~~~~a~~ 268 (271) .+-..+|..+++. .....|...+|++.|.+.... T Consensus 249 v~a~~~Gkd~~~~~r~l~QLk~~~p~h~~vk~~~e 283 (299) T KOG3081 249 VLALHLGKDAEVTERNLSQLKLSHPEHPFVKHLNE 283 (299) T ss_pred HHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHH T ss_conf 99998189917788899998740876537787888 No 81 >COG5010 TadD Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking and secretion] Probab=98.08 E-value=0.00034 Score=42.56 Aligned_cols=157 Identities=13% Similarity=0.124 Sum_probs=103.1 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHH Q ss_conf 99999999809899999999999853047730268998778877654555679999988754011211013555443444 Q gi|254781174|r 61 YEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYLVGMS 140 (271) Q Consensus 61 Y~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~~a~~ 140 (271) +..+..+.-.|+=+.+......+...+|.+.....+ +|...+..|+|..|+..+.+....-|+... ++-..|.+ T Consensus 70 ~~~a~a~~~~G~a~~~l~~~~~~~~~~~~d~~ll~~---~gk~~~~~g~~~~A~~~~rkA~~l~p~d~~---~~~~lgaa 143 (257) T COG5010 70 AKLATALYLRGDADSSLAVLQKSAIAYPKDRELLAA---QGKNQIRNGNFGEAVSVLRKAARLAPTDWE---AWNLLGAA 143 (257) T ss_pred HHHHHHHHHCCCCCCHHHHHHHHHCCCCCCHHHHHH---HHHHHHHHCCHHHHHHHHHHHHCCCCCCHH---HHHHHHHH T ss_conf 887777774055550078875540147454999999---999999804568899999998436999746---66677899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 55554431112345899999999999852001013456555555566666665556666654432234568999886528 Q gi|254781174|r 141 YAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQLVLANY 220 (271) Q Consensus 141 ~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~~~i~~y 220 (271) +.+. +....|...+.+.++..|+.+-+.. .+|..|+-+|++..|...+......- T Consensus 144 ldq~--------Gr~~~Ar~ay~qAl~L~~~~p~~~n-----------------Nlgms~~L~gd~~~A~~lll~a~l~~ 198 (257) T COG5010 144 LDQL--------GRFDEARRAYRQALELAPNEPSIAN-----------------NLGMSLLLRGDLEDAETLLLPAYLSP 198 (257) T ss_pred HHHC--------CCHHHHHHHHHHHHHHCCCCCHHHH-----------------HHHHHHHHCCCHHHHHHHHHHHHHCC T ss_conf 9973--------6756778999999984258844666-----------------58999997277989999999998678 Q ss_pred CCCHHHHHHHHHHHHHHHHCCCHHHHHHHHH Q ss_conf 9981159999999999997498799999999 Q gi|254781174|r 221 SDAEHAEEAMARLVEAYVALALMDEAREVVS 251 (271) Q Consensus 221 p~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~ 251 (271) +..+.+ ...++.+--..|+.++|+++.. T Consensus 199 ~ad~~v---~~NLAl~~~~~g~~~~A~~i~~ 226 (257) T COG5010 199 AADSRV---RQNLALVVGLQGDFREAEDIAV 226 (257) T ss_pred CCCHHH---HHHHHHHHHHCCCHHHHHHHCC T ss_conf 876389---9989999840488677776242 No 82 >KOG1586 consensus Probab=98.08 E-value=0.00065 Score=40.90 Aligned_cols=198 Identities=17% Similarity=0.176 Sum_probs=122.1 Q ss_pred HHHHHHHHHHHHH-HCCHHHHHHHHHHHHHHCCCCCCHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHH Q ss_conf 8999999999998-09899999999999853047730268--99877887765455567999998875401121101355 Q gi|254781174|r 57 QREVYEKAVLFLK-EQNFSKAYEYFNQCSRDFPFAGVARK--SLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYV 133 (271) Q Consensus 57 ~~~lY~~a~~~~~-~~~y~~A~~~f~~i~~~~P~s~~a~~--A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A 133 (271) +-++|.+|-..++ .++|+.|-..|.+.-..+-...-.++ ..+.-|.--|+..+..+|+..+++.|+.|.+- . T Consensus 33 Aadl~~~Aan~yklaK~w~~AG~aflkaA~~h~k~~skhDaat~YveA~~cykk~~~~eAv~cL~~aieIyt~~-----G 107 (288) T KOG1586 33 AAELYERAANMYKLAKNWSAAGDAFLKAADLHLKAGSKHDAATTYVEAANCYKKVDPEEAVNCLEKAIEIYTDM-----G 107 (288) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHH-----H T ss_conf 99999998789998886888789999999998860881137889999988762158488999999999999852-----2 Q ss_pred HHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 54434445555443-11123458999999999998520010134565555555666666655566666544322345689 Q gi|254781174|r 134 YYLVGMSYAQMIRD-VPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPR 212 (271) Q Consensus 134 ~y~~a~~~~~~~~~-~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~ 212 (271) .|.+|--+...+.+ ...|+.+..+|+..|+.--+-|.+-+-+..|..=+ +.+|.+=-..++|..||.- T Consensus 108 rf~~aAk~~~~iaEiyEsdl~d~ekaI~~YE~Aae~yk~ees~ssANKC~-----------lKvA~yaa~leqY~~Ai~i 176 (288) T KOG1586 108 RFTMAAKHHIEIAEIYESDLQDFEKAIAHYEQAAEYYKGEESVSSANKCL-----------LKVAQYAAQLEQYSKAIDI 176 (288) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHH-----------HHHHHHHHHHHHHHHHHHH T ss_conf 77787751113999986227779999999999999970015552688999-----------9999999999999999999 Q ss_pred HHHHHHHCCCCHH----HHHHHHHHHHHHHHCCCHHH---HHHHHHHHHHHCCCCHHHHHHHHHH Q ss_conf 9988652899811----59999999999997498799---9999999978569997899999860 Q gi|254781174|r 213 FQLVLANYSDAEH----AEEAMARLVEAYVALALMDE---AREVVSLIQERYPQGYWARYVETLV 270 (271) Q Consensus 213 ~~~~i~~yp~t~~----~~eAl~~l~~~y~~lg~~d~---A~~~~~~l~~~yP~s~~~~~a~~~l 270 (271) |..+...--+++. +.+-++.-+-+|.-.++.-- |.+-++.+-.+|.+|+..+...+|+ T Consensus 177 yeqva~~s~~n~LLKys~KdyflkAgLChl~~~D~v~a~~ALeky~~~dP~F~dsREckflk~L~ 241 (288) T KOG1586 177 YEQVARSSLDNNLLKYSAKDYFLKAGLCHLCKADEVNAQRALEKYQELDPAFTDSRECKFLKDLL 241 (288) T ss_pred HHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHH T ss_conf 99999986036487767888999999886730207789999998875398646408879999999 No 83 >TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD; InterPro: IPR005415 The type III secretion system of Gram-negative bacteria is used to transport virulence factors from the pathogen directly into the host cell and is only triggered when the bacterium comes into close contact with the host. Effector proteins secreted by the type III system do not possess a secretion signal, and are considered unique because of this. Yersinia spp. secrete effector proteins called YopB and YopD that facilitate the spread of other translocated proteins through the type III needle and the host cell cytoplasm . Both are believed to act as pore translocases, forming apertures in the host cell membrane and allowing the bacterium easy access to its cytoplasm. YopD also acts as a negative regulator of the Yersinia low-calcium response, and in turn is controlled by a chaperone, SycD . This protein also regulates YopB secretion. SycD is located on the Yop pathogenicity island of Yersinia spp., and is speculated to prevent a premature interaction between YopB, YopD and the calcium-response LcrV protein . It has been speculated that a type III secretion mechanism also exists in Chlamydial species. With the sequencing of the Chlamydia trachomatis genome, several proteins similar to characterised type III proteins have emerged, including a SycD homologue . The Pseudomonas aeruginosa gene PcrH is also similar to the Yersinia chaperone, suggesting a comparable function. Proteins in this entry are found in type III secretion operons. LcrH, from Yersinia is believed to have a regulatory function in the low-calcium response of the secretion system ,. The same protein is also known as SycD (SYC = Specific Yop Chaperone) for its chaperone role . In Pseudomonas, where the homolog is known as PcrH, the chaperone role has been demonstrated and the regulatory role appears to be absent . ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array .; GO: 0006950 response to stress. Probab=98.04 E-value=0.00013 Score=45.00 Aligned_cols=104 Identities=13% Similarity=0.167 Sum_probs=56.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 26899877887765455567999998875401121101355544344455554431112345899999999999852001 Q gi|254781174|r 93 ARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNS 172 (271) Q Consensus 93 a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S 172 (271) .=+..+..|+..|..|+|++|...|.-+.-..|.+.+ -.+.+|.|... ++...+|+..|.......|++ T Consensus 17 ~L~~~Y~~AY~~y~~G~Y~~A~~~F~~L~~~~~~~~~---Y~~gLa~c~q~--------~~~Y~~A~~~Y~~a~~l~~~~ 85 (137) T TIGR02552 17 QLEAIYALAYNLYQQGRYDEALKLFQLLAAYDPWNAD---YWLGLAACCQM--------LKEYEEAIDAYALAAALDPDD 85 (137) T ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCHHHH---HHHHHHHHHHH--------HHCHHHHHHHHHHHHHCCCCC T ss_conf 9999999999998536679999999999997430368---99999999998--------616789999999997527686 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC-CCH Q ss_conf 0134565555555666666655566666544322345689998865289-981 Q gi|254781174|r 173 PYVKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQLVLANYS-DAE 224 (271) Q Consensus 173 ~ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~~~i~~yp-~t~ 224 (271) |... +..|+.|+..|+...|.+.|+.+++.=+ +.+ T Consensus 86 P~p~-----------------~~~a~C~l~~g~~~~A~~aL~~a~~~~~~~~~ 121 (137) T TIGR02552 86 PRPY-----------------YHAAECYLALGEPESALKALDLAIEIAGGENP 121 (137) T ss_pred CCCH-----------------HHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCC T ss_conf 9512-----------------69989998728978999999999998488996 No 84 >KOG3060 consensus Probab=98.04 E-value=0.00076 Score=40.51 Aligned_cols=150 Identities=17% Similarity=0.132 Sum_probs=99.5 Q ss_pred CCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHH Q ss_conf 36899999999999809899999999999853047730268998778877654555679999988754011211013555 Q gi|254781174|r 55 RYQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVY 134 (271) Q Consensus 55 ~~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~ 134 (271) .+.+..=-+|+.+-..|+|++|++.|+.++..-|...-..+-.+-+ .-.+|+-.+||....++++.||+.+. |+ T Consensus 84 ~S~RV~~lkam~lEa~~~~~~A~e~y~~lL~ddpt~~v~~KRKlAi---lka~GK~l~aIk~ln~YL~~F~~D~E---AW 157 (289) T KOG3060 84 GSKRVGKLKAMLLEATGNYKEAIEYYESLLEDDPTDTVIRKRKLAI---LKAQGKNLEAIKELNEYLDKFMNDQE---AW 157 (289) T ss_pred CCHHHHHHHHHHHHHHHCHHHHHHHHHHHHCCCCCHHHHHHHHHHH---HHHCCCCHHHHHHHHHHHHHHCCCHH---HH T ss_conf 9706798999999986135459999999844686314899999999---99718857999999999998357699---99 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 44344455554431112345899999999999852001013456555555566666665556666654432234568999 Q gi|254781174|r 135 YLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQ 214 (271) Q Consensus 135 y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~ 214 (271) .-.+..|.. .+...+|.--+++++-.-|-++. -..++ |+--+.+|. ..++..|...|. T Consensus 158 ~eLaeiY~~--------~~~f~kA~fClEE~ll~~P~n~l---~f~rl-------ae~~Yt~gg----~eN~~~arkyy~ 215 (289) T KOG3060 158 HELAEIYLS--------EGDFEKAAFCLEELLLIQPFNPL---YFQRL-------AEVLYTQGG----AENLELARKYYE 215 (289) T ss_pred HHHHHHHHH--------HHHHHHHHHHHHHHHHCCCCCHH---HHHHH-------HHHHHHHHH----HHHHHHHHHHHH T ss_conf 999999976--------76899999999999975987189---99999-------999999841----888999999999 Q ss_pred HHHHHCCCCHHHHHHHHHHHH Q ss_conf 886528998115999999999 Q gi|254781174|r 215 LVLANYSDAEHAEEAMARLVE 235 (271) Q Consensus 215 ~~i~~yp~t~~~~eAl~~l~~ 235 (271) ..++.-|... .+++-+-- T Consensus 216 ~alkl~~~~~---ral~GI~l 233 (289) T KOG3060 216 RALKLNPKNL---RALFGIYL 233 (289) T ss_pred HHHHHCHHHH---HHHHHHHH T ss_conf 9987274738---99999999 No 85 >KOG3785 consensus Probab=98.04 E-value=0.00078 Score=40.45 Aligned_cols=65 Identities=17% Similarity=0.285 Sum_probs=30.3 Q ss_pred HHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCH Q ss_conf 99999998098999999999998530477302689987788776545556799999887540112110 Q gi|254781174|r 62 EKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKN 129 (271) Q Consensus 62 ~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~ 129 (271) ..|--.+-+-.|.+||+.|.+|....|. +.. --..+|.|||+++-|+-+-..++.+++.+|+|+- T Consensus 156 SLAsvhYmR~HYQeAIdvYkrvL~dn~e--y~a-lNVy~ALCyyKlDYydvsqevl~vYL~q~pdSti 220 (557) T KOG3785 156 SLASVHYMRMHYQEAIDVYKRVLQDNPE--YIA-LNVYMALCYYKLDYYDVSQEVLKVYLRQFPDSTI 220 (557) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCHH--HHH-HHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHH T ss_conf 6777799999899999999999704863--645-6788999998556166689999999996899588 No 86 >PRK10370 formate-dependent nitrite reductase complex subunit NrfG; Provisional Probab=98.01 E-value=0.00032 Score=42.70 Aligned_cols=111 Identities=15% Similarity=0.074 Sum_probs=47.4 Q ss_pred HHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99887540112110135554434445555443111234589999999999985200101345655555556666666555 Q gi|254781174|r 116 LGEEYITQYPESKNVDYVYYLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQLAAKEVE 195 (271) Q Consensus 116 ~~~~fi~~~P~s~~~~~A~y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~La~~e~~ 195 (271) .++.=++..|++. ..+++.|.+|... +....|...|++.++.-|+++-+.-+ +. T Consensus 69 ~L~~rL~~~Pdd~---~gW~~LGr~y~~~--------~~~~~A~~Ay~kA~~L~p~~~~il~~---------------yA 122 (206) T PRK10370 69 ALQDKIRANPQNS---EQWALLGEYYLWR--------NAYDNALLAYRQALQLRGENAELYAA---------------LA 122 (206) T ss_pred HHHHHHHHCCCCH---HHHHHHHHHHHHC--------CCHHHHHHHHHHHHHHCCCCHHHHHH---------------HH T ss_conf 9999998399888---9999999999976--------89899999999999757998789999---------------99 Q ss_pred HHHHHHHHHH-HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 6666654432-23456899988652899811599999999999974987999999999978 Q gi|254781174|r 196 IGRYYLKRGE-YVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSLIQE 255 (271) Q Consensus 196 ia~~Y~~~~~-y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~~ 255 (271) .+.+|-..+. -..+..-++.++..-|+. ..|+..+|-..+..|++.+|..+++.+.. T Consensus 123 ~aL~~~~~~~~t~~~~~lL~~AL~lDp~~---~~AL~Lla~~AFe~~dY~~AI~~Wq~lL~ 180 (206) T PRK10370 123 TVLYYQAGQHMTPQTREMIDKALALDANE---VTALMLLASDAFMQADYAQAIELWQKLLD 180 (206) T ss_pred HHHHHHCCCCCCHHHHHHHHHHHHHCCCC---HHHHHHHHHHHHHHCCHHHHHHHHHHHHH T ss_conf 99998759877689999999998749897---88999999999982649999999999984 No 87 >KOG2053 consensus Probab=97.99 E-value=0.00016 Score=44.42 Aligned_cols=76 Identities=17% Similarity=0.141 Sum_probs=48.8 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHH Q ss_conf 89999999999980989999999999985304773026899877887765455567999998875401121101355544 Q gi|254781174|r 57 QREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYL 136 (271) Q Consensus 57 ~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~ 136 (271) +-+.--+|...++.|++++|...++.+....|.+... +=-+-.+|-.++++++|...|++++..||+ ...-+.+|+ T Consensus 43 ~~a~vLkaLsl~r~gk~~ea~~~Le~~~~~~~~D~~t---Lq~l~~~y~d~~~~d~~~~~Ye~~~~~~P~-eell~~lFm 118 (932) T KOG2053 43 LYAKVLKALSLFRLGKGDEALKLLEALYGLKGTDDLT---LQFLQNVYRDLGKLDEAVHLYERANQKYPS-EELLYHLFM 118 (932) T ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCCCCHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHCCC-HHHHHHHHH T ss_conf 7999999999998158646788875130479985689---999999999874256899999998710884-889999999 No 88 >COG4235 Cytochrome c biogenesis factor [Posttranslational modification, protein turnover, chaperones] Probab=97.98 E-value=0.00084 Score=40.26 Aligned_cols=130 Identities=10% Similarity=0.037 Sum_probs=100.1 Q ss_pred HHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 80989999999999985304773026899877887765455567999998875401121101355544344455554431 Q gi|254781174|r 69 KEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYLVGMSYAQMIRDV 148 (271) Q Consensus 69 ~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~~a~~~~~~~~~~ 148 (271) ...+.+.-+..++..+...|++. +.-.++|.+|..+++++.|...|.+-++.-|+++.+- -..|...+.+ T Consensus 134 ~~~~~~~l~a~Le~~L~~nP~d~---egW~~Lg~~ym~~~~~~~A~~AY~~A~rL~g~n~~~~---~g~aeaL~~~---- 203 (287) T COG4235 134 AEQEMEALIARLETHLQQNPGDA---EGWDLLGRAYMALGRASDALLAYRNALRLAGDNPEIL---LGLAEALYYQ---- 203 (287) T ss_pred CCCCHHHHHHHHHHHHHHCCCCC---HHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHH---HHHHHHHHHH---- T ss_conf 73359999999999998498872---4499999999984443479999999987379988999---9999999995---- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC-CHHH Q ss_conf 11234589999999999985200101345655555556666666555666665443223456899988652899-8115 Q gi|254781174|r 149 PYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQLVLANYSD-AEHA 226 (271) Q Consensus 149 ~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~-t~~~ 226 (271) -|+..+.++...|++.+..-|+... |. +.+|.-++..|+|..|+..++.+++.-|. .+.. T Consensus 204 -a~~~~ta~a~~ll~~al~~D~~~ir---al--------------~lLA~~afe~g~~~~A~~~Wq~lL~~lp~~~~rr 264 (287) T COG4235 204 -AGQQMTAKARALLRQALALDPANIR---AL--------------SLLAFAAFEQGDYAEAAAAWQMLLDLLPADDPRR 264 (287) T ss_pred -CCCCCCHHHHHHHHHHHHCCCCCHH---HH--------------HHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCHH T ss_conf -3884449999999999843973288---99--------------9999999981459999999999985189998237 No 89 >KOG4234 consensus Probab=97.97 E-value=0.00026 Score=43.23 Aligned_cols=115 Identities=14% Similarity=0.156 Sum_probs=88.9 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHH Q ss_conf 899999999999809899999999999853047730268998--778877654555679999988754011211013555 Q gi|254781174|r 57 QREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLL--MSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVY 134 (271) Q Consensus 57 ~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~--~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~ 134 (271) ...+=..|...|.+|+|.+|...|...+...|..+.-.++.+ +.|-|..+++++..||..-.+.|.+.|.... |+ T Consensus 95 ad~lK~EGN~~F~ngdyeeA~skY~~Ale~cp~~~~e~rsIly~Nraaa~iKl~k~e~aI~dcsKaiel~pty~k---Al 171 (271) T KOG4234 95 ADSLKKEGNELFKNGDYEEANSKYQEALESCPSTSTEERSILYSNRAAALIKLRKWESAIEDCSKAIELNPTYEK---AL 171 (271) T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHH---HH T ss_conf 888988777765134077899999999986866308889999720588998756689899998765750840679---99 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHH Q ss_conf 443444555544311123458999999999998520010134565555 Q gi|254781174|r 135 YLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYV 182 (271) Q Consensus 135 y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l 182 (271) -++|..|-+. ....+|++.+..++..-|.-.-+..+..++ T Consensus 172 ~RRAeayek~--------ek~eealeDyKki~E~dPs~~ear~~i~rl 211 (271) T KOG4234 172 ERRAEAYEKM--------EKYEEALEDYKKILESDPSRREAREAIARL 211 (271) T ss_pred HHHHHHHHHH--------HHHHHHHHHHHHHHHHCCCHHHHHHHHHHC T ss_conf 9999998855--------517888999999987182068999999852 No 90 >KOG1129 consensus Probab=97.95 E-value=0.0011 Score=39.53 Aligned_cols=183 Identities=17% Similarity=0.165 Sum_probs=109.2 Q ss_pred HHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHH Q ss_conf 99999998098999999999998530477302689987788776545556799999887540112110135554434445 Q gi|254781174|r 62 EKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYLVGMSY 141 (271) Q Consensus 62 ~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~~a~~~ 141 (271) +.|..++.-|-+.+|.+.|+.-..++|- ++..+.++.+|-..+++..|+..+..-++.+|.. +-|+.|+.- T Consensus 228 Q~gkCylrLgm~r~AekqlqssL~q~~~----~dTfllLskvY~ridQP~~AL~~~~~gld~fP~~-----VT~l~g~AR 298 (478) T KOG1129 228 QMGKCYLRLGMPRRAEKQLQSSLTQFPH----PDTFLLLSKVYQRIDQPERALLVIGEGLDSFPFD-----VTYLLGQAR 298 (478) T ss_pred HHHHHHHHHCCHHHHHHHHHHHHHCCCC----HHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCH-----HHHHHHHHH T ss_conf 9989999816826668999987643776----1699999999987235889999986465238832-----566663699 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHH-----------HHHHHHHHHHHHHHHH----HH--HHHHHHHHHHH Q ss_conf 5554431112345899999999999852001013-----------4565555555666666----65--55666665443 Q gi|254781174|r 142 AQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYV-----------KGARFYVTVGRNQLAA----KE--VEIGRYYLKRG 204 (271) Q Consensus 142 ~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya-----------~~A~~~l~~~~~~La~----~e--~~ia~~Y~~~~ 204 (271) .. -..+..++|.+.++.+++..|+.--+ ..-..-+.+-+..|.- .| ..||-+-+--+ T Consensus 299 i~------eam~~~~~a~~lYk~vlk~~~~nvEaiAcia~~yfY~~~PE~AlryYRRiLqmG~~speLf~NigLCC~yaq 372 (478) T KOG1129 299 IH------EAMEQQEDALQLYKLVLKLHPINVEAIACIAVGYFYDNNPEMALRYYRRILQMGAQSPELFCNIGLCCLYAQ 372 (478) T ss_pred HH------HHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHC T ss_conf 99------999707889999999984087650035665530013898689999999999832777677765888987540 Q ss_pred HHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCC Q ss_conf 2234568999886528998115999999999999749879999999999785699 Q gi|254781174|r 205 EYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSLIQERYPQ 259 (271) Q Consensus 205 ~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~~~yP~ 259 (271) .++=++..|+..+..-.+...+.+.+|.++.+-.-.|+...|+.-+.+-..+=|+ T Consensus 373 Q~D~~L~sf~RAlstat~~~~aaDvWYNlg~vaV~iGD~nlA~rcfrlaL~~d~~ 427 (478) T KOG1129 373 QIDLVLPSFQRALSTATQPGQAADVWYNLGFVAVTIGDFNLAKRCFRLALTSDAQ 427 (478) T ss_pred CHHHHHHHHHHHHHHCCCCCHHHHHHHCCCEEEEECCCHHHHHHHHHHHHCCCCC T ss_conf 0303178899998633686415666640011677034248888888987505852 No 91 >COG5010 TadD Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking and secretion] Probab=97.87 E-value=0.0016 Score=38.67 Aligned_cols=194 Identities=10% Similarity=0.030 Sum_probs=129.5 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHH--HHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHH Q ss_conf 99999999999997226875333445677767368999--9999999980989999999999985304773026899877 Q gi|254781174|r 23 ALTIFFSIAVCFLVGWERQSSRDVYLDSVTDVRYQREV--YEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMS 100 (271) Q Consensus 23 ~~~i~~~i~~~~l~~Cs~~~~~~~~~~~~~~~~~~~~l--Y~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~l 100 (271) +..+.+..+.+.+.||.+...... .+....++.. +... +...+-..|...+-......|+.... ... T Consensus 4 ~~~~~~a~l~~~~~~c~~~~~~~~----~~~g~~~~a~~~~~~~---~~~~q~~~a~~al~~~~~~~p~d~~i----~~~ 72 (257) T COG5010 4 LGGAVLAALALSLAGCAGQAGQRE----MSTGASAKATGAPESS---LAMRQTQGAAAALGAAVLRNPEDLSI----AKL 72 (257) T ss_pred HHHHHHHHHHHHHCCCCCCCCCHH----HCCCCCHHCCCCHHHH---HHHHHHHHHHHHHHHHHHCCCCHHHH----HHH T ss_conf 877999999999742421157400----0022210011447899---99876668999999998068201778----887 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHH Q ss_conf 88776545556799999887540112110135554434445555443111234589999999999985200101345655 Q gi|254781174|r 101 AFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARF 180 (271) Q Consensus 101 a~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~ 180 (271) +-+++..|+-+.+..........+|..+.+.-+ .|...+. .++...|+..+++....-|+.. .. T Consensus 73 a~a~~~~G~a~~~l~~~~~~~~~~~~d~~ll~~---~gk~~~~--------~g~~~~A~~~~rkA~~l~p~d~-----~~ 136 (257) T COG5010 73 ATALYLRGDADSSLAVLQKSAIAYPKDRELLAA---QGKNQIR--------NGNFGEAVSVLRKAARLAPTDW-----EA 136 (257) T ss_pred HHHHHHCCCCCCHHHHHHHHHCCCCCCHHHHHH---HHHHHHH--------HCCHHHHHHHHHHHHCCCCCCH-----HH T ss_conf 777774055550078875540147454999999---9999998--------0456889999999843699974-----66 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCC Q ss_conf 555556666666555666665443223456899988652899811599999999999974987999999999978569 Q gi|254781174|r 181 YVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSLIQERYP 258 (271) Q Consensus 181 ~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~~~yP 258 (271) + ...|-.|.+.|+...|...|.+.++-.|+.+ ...-.|+-+|+--|+.+.|..........-| T Consensus 137 ~------------~~lgaaldq~Gr~~~Ar~ay~qAl~L~~~~p---~~~nNlgms~~L~gd~~~A~~lll~a~l~~~ 199 (257) T COG5010 137 W------------NLLGAALDQLGRFDEARRAYRQALELAPNEP---SIANNLGMSLLLRGDLEDAETLLLPAYLSPA 199 (257) T ss_pred H------------HHHHHHHHHCCCHHHHHHHHHHHHHHCCCCC---HHHHHHHHHHHHCCCHHHHHHHHHHHHHCCC T ss_conf 6------------6778999973675677899999998425884---4666589999972779899999999986788 No 92 >PRK10747 putative protoheme IX biogenesis protein; Provisional Probab=97.87 E-value=0.0016 Score=38.63 Aligned_cols=109 Identities=15% Similarity=0.123 Sum_probs=80.9 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHH Q ss_conf 89999999999980989999999999985304773026899877887765455567999998875401121101355544 Q gi|254781174|r 57 QREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYL 136 (271) Q Consensus 57 ~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~ 136 (271) +.....+|...+-+|+|..|.+.+.+-.+. |+.---..+..|.+-+.+|+++.+-..+++..+..|++..+ T Consensus 84 A~~~t~~Gl~~l~EG~~~~AeK~l~k~a~~---~~~pllnyL~AArAA~~~g~~~~rd~yL~~A~e~~p~a~la------ 154 (398) T PRK10747 84 ARKQTEQALLKLAEGDYQQVEKLMAKNADH---AEQPVVNYLLAAEAAQQRGDEARANQHLERAAELAGNDTIP------ 154 (398) T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHCCC---CCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHH------ T ss_conf 999999999998777999999999863435---67748899999999998889899999999988858984889------ Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHH Q ss_conf 344455554431112345899999999999852001013456 Q gi|254781174|r 137 VGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGA 178 (271) Q Consensus 137 ~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A 178 (271) ++++-- ....+++....|......+....|+++++-.- T Consensus 155 v~lt~A----el~l~~~q~e~Ala~L~~L~~~~p~~~~VL~L 192 (398) T PRK10747 155 VEITRV----RLQLARNENHAARHGVDKLLEVTPRHPEVLRL 192 (398) T ss_pred HHHHHH----HHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHH T ss_conf 999999----99997799899999999998619999999999 No 93 >KOG0553 consensus Probab=97.86 E-value=0.00032 Score=42.67 Aligned_cols=118 Identities=21% Similarity=0.175 Sum_probs=86.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCH Q ss_conf 89987788776545556799999887540112110135554-43444555544311123458999999999998520010 Q gi|254781174|r 95 KSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYY-LVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSP 173 (271) Q Consensus 95 ~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y-~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ 173 (271) +.+=.-|.-..+.++|.+|+..|.+.|...|+++ ++| .+|-+|.+ .+....|++..+.-+..-|.+. T Consensus 82 E~LK~eGN~~m~~~~Y~eAv~kY~~AI~l~P~nA----VyycNRAAAy~~--------Lg~~~~AVkDce~Al~iDp~ys 149 (304) T KOG0553 82 ESLKNEGNKLMKNKDYQEAVDKYTEAIELDPTNA----VYYCNRAAAYSK--------LGEYEDAVKDCESALSIDPHYS 149 (304) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC----HHHHHHHHHHHH--------HCCHHHHHHHHHHHHHCCHHHH T ss_conf 9999988999986259999999999984588762----677789999998--------6011789999999776290899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHH Q ss_conf 13456555555566666665556666654432234568999886528998115999999999999749879 Q gi|254781174|r 174 YVKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMD 244 (271) Q Consensus 174 ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d 244 (271) .|-- .+|.-|+.+|+|..|+..|+..++-.|+.+.....|- .+-.+++... T Consensus 150 ---kay~--------------RLG~A~~~~gk~~~A~~aykKaLeldP~Ne~~K~nL~---~Ae~~l~e~~ 200 (304) T KOG0553 150 ---KAYG--------------RLGLAYLALGKYEEAIEAYKKALELDPDNESYKSNLK---IAEQKLNEPK 200 (304) T ss_pred ---HHHH--------------HHHHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHH---HHHHHHCCCC T ss_conf ---9999--------------9879998067379999999764316888489998899---9998724787 No 94 >KOG0550 consensus Probab=97.81 E-value=0.00024 Score=43.46 Aligned_cols=25 Identities=8% Similarity=0.125 Sum_probs=12.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 5666665443223456899988652 Q gi|254781174|r 195 EIGRYYLKRGEYVAAIPRFQLVLAN 219 (271) Q Consensus 195 ~ia~~Y~~~~~y~aA~~~~~~~i~~ 219 (271) ..|+.|...++|..|+..|+..++. T Consensus 326 ~ra~c~l~le~~e~AV~d~~~a~q~ 350 (486) T KOG0550 326 RRANCHLALEKWEEAVEDYEKAMQL 350 (486) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9889888998999999999999863 No 95 >KOG4162 consensus Probab=97.79 E-value=0.0022 Score=37.85 Aligned_cols=182 Identities=17% Similarity=0.152 Sum_probs=73.0 Q ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHC--CCCCHHHHHH Q ss_conf 99999999999809899999999999853047730268998778-87765455567999998875401--1211013555 Q gi|254781174|r 58 REVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSA-FVQYSAGKYQQAASLGEEYITQY--PESKNVDYVY 134 (271) Q Consensus 58 ~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la-~~~y~~~~y~~A~~~~~~fi~~~--P~s~~~~~A~ 134 (271) +-+|+.|..+...|.-..|+...++-....|..+- +-..++.+ .|.-..+.+.+++.+..+.++.. -.+...+.++ T Consensus 358 e~w~~~als~saag~~s~Av~ll~~~~~~~~~ps~-~s~~Lmasklc~e~l~~~eegldYA~kai~~~~~~~~~l~~~~~ 436 (799) T KOG4162 358 ERWYQLALSYSAAGSDSKAVNLLRESLKKSEQPSD-ISVLLMASKLCIERLKLVEEGLDYAQKAISLLGGQRSHLKPRGY 436 (799) T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHCCCCCCCCC-CHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999877788714644888877764156568884-36999999998755431566788999999875544531115689 Q ss_pred HHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 44344455554431112---345899999999999852001013456555555566666665556666654432234568 Q gi|254781174|r 135 YLVGMSYAQMIRDVPYD---QRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIP 211 (271) Q Consensus 135 y~~a~~~~~~~~~~~~d---~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~ 211 (271) ...|++|-.+......+ -....++++.+++.++.-|+.+ ++. |+++-.|-.+++-.+|.. T Consensus 437 l~lGi~y~~~A~~a~~~seR~~~h~kslqale~av~~d~~dp---~~i--------------f~lalq~A~~R~l~sAl~ 499 (799) T KOG4162 437 LFLGIAYGFQARQANLKSERDALHKKSLQALEEAVQFDPTDP---LVI--------------FYLALQYAEQRQLTSALD 499 (799) T ss_pred HHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCCCCC---HHH--------------HHHHHHHHHHHHHHHHHH T ss_conf 999988776765278867899999999999999986189982---489--------------999999999976999999 Q ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCC Q ss_conf 999886528998115999999999999749879999999999785699 Q gi|254781174|r 212 RFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSLIQERYPQ 259 (271) Q Consensus 212 ~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~~~yP~ 259 (271) +.++.+.--+.. ...+++.++.+.-..+...+|..........+|+ T Consensus 500 ~~~eaL~l~~~~--~~~~whLLALvlSa~kr~~~Al~vvd~al~E~~~ 545 (799) T KOG4162 500 YAREALALNRGD--SAKAWHLLALVLSAQKRLKEALDVVDAALEEFGD 545 (799) T ss_pred HHHHHHHHCCCC--CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999998715776--4889999999996224167789999999997412 No 96 >COG4785 NlpI Lipoprotein NlpI, contains TPR repeats [General function prediction only] Probab=97.78 E-value=0.00086 Score=40.18 Aligned_cols=173 Identities=21% Similarity=0.208 Sum_probs=106.4 Q ss_pred CHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHH Q ss_conf 68999999999998098999999999998530477302689987788776545556799999887540112110135554 Q gi|254781174|r 56 YQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYY 135 (271) Q Consensus 56 ~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y 135 (271) .|+..--.|..+...|+|+.|.+.|..+.+.-|.-+++ .++.|.+.|--|+|..|...+..|-..-|+.|...-=.| T Consensus 98 m~~vfNyLG~Yl~~a~~fdaa~eaFds~~ELDp~y~Ya---~lNRgi~~YY~gR~~LAq~d~~~fYQ~D~~DPfR~LWLY 174 (297) T COG4785 98 MPEVFNYLGIYLTQAGNFDAAYEAFDSVLELDPTYNYA---HLNRGIALYYGGRYKLAQDDLLAFYQDDPNDPFRSLWLY 174 (297) T ss_pred CHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCHHH---HHCCCEEEEECCCCHHHHHHHHHHHHCCCCCHHHHHHHH T ss_conf 58999899999986022057999866671047751077---760440234447515559999999844989807899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH---------HHHH-HHHHHHHCCHHHHHHHHHH-------HHHHHHHHHHHHHHHH Q ss_conf 434445555443111234589999---------9999-9998520010134565555-------5556666666555666 Q gi|254781174|r 136 LVGMSYAQMIRDVPYDQRATKLML---------QYMS-RIVERYTNSPYVKGARFYV-------TVGRNQLAAKEVEIGR 198 (271) Q Consensus 136 ~~a~~~~~~~~~~~~d~~~~~~A~---------~~f~-~~i~~yP~S~ya~~A~~~l-------~~~~~~La~~e~~ia~ 198 (271) +. ..+.|++.+...+ +.+. .++.-|-+.--++.+..++ ...-+.|.+-.|++|+ T Consensus 175 l~---------E~k~dP~~A~tnL~qR~~~~d~e~WG~~iV~~yLgkiS~e~l~~~~~a~a~~n~~~Ae~LTEtyFYL~K 245 (297) T COG4785 175 LN---------EQKLDPKQAKTNLKQRAEKSDKEQWGWNIVEFYLGKISEETLMERLKADATDNTSLAEHLTETYFYLGK 245 (297) T ss_pred HH---------HHCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 87---------610786899999999987056765518799999704039999999986312128999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHC-CCCHHHHHHHHHHHHHHHHC Q ss_conf 6654432234568999886528-99811599999999999974 Q gi|254781174|r 199 YYLKRGEYVAAIPRFQLVLANY-SDAEHAEEAMARLVEAYVAL 240 (271) Q Consensus 199 ~Y~~~~~y~aA~~~~~~~i~~y-p~t~~~~eAl~~l~~~y~~l 240 (271) .|...|+-..|..-|+-.+.+- -+.....-|++-|+..+... T Consensus 246 ~~l~~G~~~~A~~LfKLaiannVynfVE~RyA~~EL~~l~q~~ 288 (297) T COG4785 246 YYLSLGDLDEATALFKLAVANNVYNFVEHRYALLELSLLGQDQ 288 (297) T ss_pred HHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 9861311788999999999887999999999999999842311 No 97 >KOG1129 consensus Probab=97.75 E-value=0.0025 Score=37.48 Aligned_cols=187 Identities=17% Similarity=0.182 Sum_probs=117.9 Q ss_pred CCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHH Q ss_conf 73689999999999980989999999999985304773026899877887765455567999998875401121101355 Q gi|254781174|r 54 VRYQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYV 133 (271) Q Consensus 54 ~~~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A 133 (271) ...++..--.+..|-.-.+-..|+..|.+.++.+|+.. .-...+|.++..++++++|...|+..++.+|.+-.+ T Consensus 253 ~~~~dTfllLskvY~ridQP~~AL~~~~~gld~fP~~V---T~l~g~ARi~eam~~~~~a~~lYk~vlk~~~~nvEa--- 326 (478) T KOG1129 253 FPHPDTFLLLSKVYQRIDQPERALLVIGEGLDSFPFDV---TYLLGQARIHEAMEQQEDALQLYKLVLKLHPINVEA--- 326 (478) T ss_pred CCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEE--- T ss_conf 77616999999999872358899999864652388325---666636999999970788999999998408765003--- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH--------------HHHH--HHHH---- Q ss_conf 5443444555544311123458999999999998520010134565555555--------------6666--6665---- Q gi|254781174|r 134 YYLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVTVG--------------RNQL--AAKE---- 193 (271) Q Consensus 134 ~y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~--------------~~~L--a~~e---- 193 (271) .--+|..||-. ++.+-|+..++++++.=-.|+ +--..|..| ...| +..+ T Consensus 327 iAcia~~yfY~--------~~PE~AlryYRRiLqmG~~sp---eLf~NigLCC~yaqQ~D~~L~sf~RAlstat~~~~aa 395 (478) T KOG1129 327 IACIAVGYFYD--------NNPEMALRYYRRILQMGAQSP---ELFCNIGLCCLYAQQIDLVLPSFQRALSTATQPGQAA 395 (478) T ss_pred EEEEEECCCCC--------CCHHHHHHHHHHHHHHCCCCH---HHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCHHH T ss_conf 56655300138--------986899999999998327776---7776588898754003031788999986336864156 Q ss_pred ---HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCC Q ss_conf ---5566666544322345689998865289981159999999999997498799999999997856999 Q gi|254781174|r 194 ---VEIGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSLIQERYPQG 260 (271) Q Consensus 194 ---~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~~~yP~s 260 (271) +.+|..-...|+...|..-|+-.+.+-|+.. +|+-.|+..-.+-|+.++|+..+..-...-|+- T Consensus 396 DvWYNlg~vaV~iGD~nlA~rcfrlaL~~d~~h~---ealnNLavL~~r~G~i~~Arsll~~A~s~~P~m 462 (478) T KOG1129 396 DVWYNLGFVAVTIGDFNLAKRCFRLALTSDAQHG---EALNNLAVLAARSGDILGARSLLNAAKSVMPDM 462 (478) T ss_pred HHHHCCCEEEEECCCHHHHHHHHHHHHCCCCCHH---HHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCC T ss_conf 6664001167703424888888898750585238---888768998851276387899998765418530 No 98 >PRK13184 pknD serine/threonine-protein kinase; Reviewed Probab=97.73 E-value=0.00063 Score=40.96 Aligned_cols=102 Identities=14% Similarity=0.201 Sum_probs=74.1 Q ss_pred HHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHH-----H--HHHHHHHHHHHHHHCCCCCHHHHHHHHHH Q ss_conf 9998098999999999998530477302689987788776545-----5--56799999887540112110135554434 Q gi|254781174|r 66 LFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAG-----K--YQQAASLGEEYITQYPESKNVDYVYYLVG 138 (271) Q Consensus 66 ~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~-----~--y~~A~~~~~~fi~~~P~s~~~~~A~y~~a 138 (271) ..+..+.|+.|+..|++|...||+-...-+|+|.+|.+...+- + +.+|+..|+.. | +++-+|--|..+| T Consensus 484 afl~~k~y~~a~~~yrri~~sfpgr~eg~ea~fr~git~le~a~~~~~~~~~~~al~~f~~l---h-~~~~apleylgka 559 (933) T PRK13184 484 AFLAEKLYDRAAIFYRRIRESFPGRKEGYEAQFRLGITLLEKASEQGGDQEFTQALSEFSYL---H-GGVAAPLEYLGKA 559 (933) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH---C-CCCCCCHHHHHHH T ss_conf 87736668999999999986289953007889986389999887604725799999999885---3-8988757888889 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHH Q ss_conf 44555544311123458999999999998520010134565 Q gi|254781174|r 139 MSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGAR 179 (271) Q Consensus 139 ~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~ 179 (271) +.|..+ +...+-++.+.-.+++||++|-.+-.+ T Consensus 560 lvyq~~--------~~~~eeik~~~la~kry~~hp~i~~l~ 592 (933) T PRK13184 560 LVYQRL--------GEYNEEIKSLLLALKRYSQHPEIPRLK 592 (933) T ss_pred HHHHHH--------HHHHHHHHHHHHHHHHCCCCCCCHHHH T ss_conf 999873--------007899999999997567998736789 No 99 >KOG1174 consensus Probab=97.73 E-value=0.0027 Score=37.27 Aligned_cols=193 Identities=13% Similarity=0.131 Sum_probs=111.4 Q ss_pred CCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHH--HH Q ss_conf 3689999999999980989999999999985304773026899877887765455567999998875401121101--35 Q gi|254781174|r 55 RYQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNV--DY 132 (271) Q Consensus 55 ~~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~--~~ 132 (271) +.-..+|..|.-.--+..+..+...+-+....+|... .-+..+|.++|..|++++|+..|++.....|..-.. .| T Consensus 196 wls~wika~Aq~~~~~hs~a~~t~l~le~~~~lr~Nv---hLl~~lak~~~~~Gdn~~a~~~Fe~~~~~dpy~i~~MD~Y 272 (564) T KOG1174 196 WLSKWIKALAQMFNFKHSDASQTFLMLHDNTTLRCNE---HLMMALGKCLYYNGDYFQAEDIFSSTLCANPDNVEAMDLY 272 (564) T ss_pred HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCH---HHHHHHHHHHHHHCCCHHHHHHHHHHHHCCHHHHHHHHHH T ss_conf 8999999999997035303665888787654067608---8998875565341675689999988750781555568999 Q ss_pred HHHHHHHHHHHHHHH---------------------HHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHH Q ss_conf 554434445555443---------------------11123458999999999998520010134565555555666666 Q gi|254781174|r 133 VYYLVGMSYAQMIRD---------------------VPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQLAA 191 (271) Q Consensus 133 A~y~~a~~~~~~~~~---------------------~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~La~ 191 (271) |+.+--.+-+++... ...+.+..+.|+..-++.|+.-|+|- .+. T Consensus 273 a~LL~~eg~~e~~~~L~~~Lf~~~~~ta~~wfV~~~~l~~~K~~~rAL~~~eK~I~~~~r~~---~al------------ 337 (564) T KOG1174 273 AVLLGQEGGCEQDSALMDYLFAKVKYTASHWFVHAQLLYDEKKFERALNFVEKCIDSEPRNH---EAL------------ 337 (564) T ss_pred HHHHHHHCCHHHHHHHHHHHHHHCCCCCCCEEEEEEHHHHHHHHHHHHHHHHHHHHCCCCCC---HHH------------ T ss_conf 99998604876789999999852202324023431014577668999999988761585533---678------------ Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH---------------- Q ss_conf 6555666665443223456899988652899811599999999999974987999999999978---------------- Q gi|254781174|r 192 KEVEIGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSLIQE---------------- 255 (271) Q Consensus 192 ~e~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~~---------------- 255 (271) +.-|+.....|...+|++.|+.....-| ..-+..--|.++|...|...||.-....... T Consensus 338 --ilKG~lL~~~~R~~~A~IaFR~Aq~Lap---~rL~~Y~GL~hsYLA~~~~kEA~~~An~~~~~~~~sA~~LtL~g~~V 412 (564) T KOG1174 338 --ILKGRLLIALERHTQAVIAFRTAQMLAP---YRLEIYRGLFHSYLAQKRFKEANALANWTIRLFQNSARSLTLFGTLV 412 (564) T ss_pred --HHCCHHHHHCCCHHHHHHHHHHHHHCCH---HHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHEE T ss_conf --8613998854562889999999874251---45789988999999614089999999999997653144676630001 Q ss_pred HCCCCHHHHHHHHHH Q ss_conf 569997899999860 Q gi|254781174|r 256 RYPQGYWARYVETLV 270 (271) Q Consensus 256 ~yP~s~~~~~a~~~l 270 (271) .+|++.-...|++++ T Consensus 413 ~~~dp~~rEKAKkf~ 427 (564) T KOG1174 413 LFPDPRMREKAKKFA 427 (564) T ss_pred ECCCCHHHHHHHHHH T ss_conf 156850679999999 No 100 >KOG4648 consensus Probab=97.63 E-value=0.00052 Score=41.45 Aligned_cols=111 Identities=12% Similarity=0.070 Sum_probs=77.4 Q ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHH Q ss_conf 99999999999809899999999999853047730268998778877654555679999988754011211013555443 Q gi|254781174|r 58 REVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYLV 137 (271) Q Consensus 58 ~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~~ 137 (271) .++=++|..+|++|+|++||+-|..-+..+|..+.. ..+.|.+|++++.|..|...-+..+.+.-.- --||-++ T Consensus 98 SEiKE~GN~yFKQgKy~EAIDCYs~~ia~~P~NpV~---~~NRA~AYlk~K~FA~AE~DC~~AiaLd~~Y---~KAYSRR 171 (536) T KOG4648 98 SEIKERGNTYFKQGKYEEAIDCYSTAIAVYPHNPVY---HINRALAYLKQKSFAQAEEDCEAAIALDKLY---VKAYSRR 171 (536) T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHCCCCCCCCCCCC---CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHH T ss_conf 888876336764254135666653012457888642---0058899998887775430288987632889---9998777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHH Q ss_conf 444555544311123458999999999998520010134565555 Q gi|254781174|r 138 GMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYV 182 (271) Q Consensus 138 a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l 182 (271) |-+-. ..++..+|.+.++.++..-|++.-.......| T Consensus 172 ~~AR~--------~Lg~~~EAKkD~E~vL~LEP~~~ELkK~~a~i 208 (536) T KOG4648 172 MQARE--------SLGNNMEAKKDCETVLALEPKNIELKKSLARI 208 (536) T ss_pred HHHHH--------HHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHH T ss_conf 78999--------87757877777889986085617899999986 No 101 >KOG1156 consensus Probab=97.62 E-value=0.0039 Score=36.38 Aligned_cols=70 Identities=21% Similarity=0.245 Sum_probs=42.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH-----HHCCCCHHHHH Q ss_conf 55566666544322345689998865289981159999999999997498799999999997-----85699978999 Q gi|254781174|r 193 EVEIGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSLIQ-----ERYPQGYWARY 265 (271) Q Consensus 193 e~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~-----~~yP~s~~~~~ 265 (271) -+.++..|...|++.-|.......+++ ||-..|-...-++++-..|+.++|..+..... .+|-+|+-+++ T Consensus 374 ~y~laqh~D~~g~~~~A~~yId~AIdH---TPTliEly~~KaRI~kH~G~l~eAa~~l~ea~elD~aDR~INsKcAKY 448 (700) T KOG1156 374 LYFLAQHYDKLGDYEVALEYIDLAIDH---TPTLIELYLVKARIFKHAGLLDEAAAWLDEAQELDTADRAINSKCAKY 448 (700) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHCC---CCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHH T ss_conf 999998888705579999999887525---860999999999999844886899998999872452557887788999 No 102 >KOG0545 consensus Probab=97.60 E-value=0.0016 Score=38.70 Aligned_cols=124 Identities=11% Similarity=0.084 Sum_probs=78.3 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHH--------HCCCCC-CHH------HHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 89999999999980989999999999985--------304773-026------899877887765455567999998875 Q gi|254781174|r 57 QREVYEKAVLFLKEQNFSKAYEYFNQCSR--------DFPFAG-VAR------KSLLMSAFVQYSAGKYQQAASLGEEYI 121 (271) Q Consensus 57 ~~~lY~~a~~~~~~~~y~~A~~~f~~i~~--------~~P~s~-~a~------~A~~~la~~~y~~~~y~~A~~~~~~fi 121 (271) ...+=++|...|..|+|.+|+..|.+.+. .-|+++ |.+ --.++.+.|+...++|-+++..-..++ T Consensus 178 v~~l~q~GN~lfk~~~ykEA~~~YreAi~~l~~L~lkEkP~e~eW~eLdk~~tpLllNy~QC~L~~~e~yevleh~seiL 257 (329) T KOG0545 178 VPVLHQEGNRLFKLGRYKEASSKYREAIICLRNLQLKEKPGEPEWLELDKMITPLLLNYCQCLLKKEEYYEVLEHCSEIL 257 (329) T ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 28898750146662217789999999999999887404899818999987633778867778744878999999889998 Q ss_pred HHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 401121101355544344455554431112345899999999999852001013456555555566666665 Q gi|254781174|r 122 TQYPESKNVDYVYYLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQLAAKE 193 (271) Q Consensus 122 ~~~P~s~~~~~A~y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~La~~e 193 (271) ..+|+... |+|.+|-++.... +..+|...|..++...|. .++-....+..+.++.+++. T Consensus 258 ~~~~~nvK---A~frRakAhaa~W--------n~~eA~~D~~~vL~ldps--lasvVsrElr~le~r~~ek~ 316 (329) T KOG0545 258 RHHPGNVK---AYFRRAKAHAAVW--------NEAEAKADLQKVLELDPS--LASVVSRELRLLENRMAEKQ 316 (329) T ss_pred HCCCCHHH---HHHHHHHHHHHHC--------CHHHHHHHHHHHHHCCHH--HHHHHHHHHHHHHHHHHHHH T ss_conf 50885089---9999877877536--------888888899999861956--67899999998999998750 No 103 >KOG1125 consensus Probab=97.58 E-value=0.0046 Score=35.97 Aligned_cols=190 Identities=14% Similarity=0.091 Sum_probs=111.6 Q ss_pred CHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHH--- Q ss_conf 68999999999998098999999999998530477302689987788776545556799999887540112110135--- Q gi|254781174|r 56 YQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDY--- 132 (271) Q Consensus 56 ~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~--- 132 (271) -.++.-..|....++++=..||..+++..+.-|.. -+|++.+|.+|-..+.-.+|...+++.|..+|.-....- T Consensus 318 haeAW~~LG~~qaENE~E~~ai~AL~rcl~LdP~N---leaLmaLAVSytNeg~q~~Al~~L~~Wi~~~p~y~~l~~a~~ 394 (579) T KOG1125 318 HAEAWQKLGITQAENENEQNAISALRRCLELDPTN---LEALMALAVSYTNEGLQNQALKMLDKWIRNKPKYVHLVSAGE 394 (579) T ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCC---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHCCCCCC T ss_conf 78999996157650342088999999998469951---999999888886146688899999999984862111146676 Q ss_pred -------------HHHH-HHHHHHHHH---H-HHHHH-----------HHHHHHHHHHHHHHHHHHHCC----------- Q ss_conf -------------5544-344455554---4-31112-----------345899999999999852001----------- Q gi|254781174|r 133 -------------VYYL-VGMSYAQMI---R-DVPYD-----------QRATKLMLQYMSRIVERYTNS----------- 172 (271) Q Consensus 133 -------------A~y~-~a~~~~~~~---~-~~~~d-----------~~~~~~A~~~f~~~i~~yP~S----------- 172 (271) ..+. +...+.+.. + .++-| -+..++|++.|+..+..-|+. T Consensus 395 ~~~~~~~~s~~~~~~l~~i~~~fLeaa~~~~~~~DpdvQ~~LGVLy~ls~efdraiDcf~~AL~v~Pnd~~lWNRLGAtL 474 (579) T KOG1125 395 NEDFENTKSFLDSSHLAHIQELFLEAARQLPTKIDPDVQSGLGVLYNLSGEFDRAVDCFEAALQVKPNDYLLWNRLGATL 474 (579) T ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHH T ss_conf 65666776778878999999999999985778888427762678974014788899999999704984177898866886 Q ss_pred ------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHH---H----HHHHHHHHHHHHH Q ss_conf ------01345655555556666666555666665443223456899988652899811---5----9999999999997 Q gi|254781174|r 173 ------PYVKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQLVLANYSDAEH---A----EEAMARLVEAYVA 239 (271) Q Consensus 173 ------~ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~---~----~eAl~~l~~~y~~ 239 (271) .-|..|-.+...++...-+--+.+|-.|...|.|..|+..|-.+|.--+.+.- . +..+-.|-.+... T Consensus 475 AN~~~s~EAIsAY~rALqLqP~yVR~RyNlgIS~mNlG~ykEA~~hlL~AL~mq~ks~~~~~~~~~se~iw~tLR~als~ 554 (579) T KOG1125 475 ANGNRSEEAISAYNRALQLQPGYVRVRYNLGISCMNLGAYKEAVKHLLEALSMQRKSRNHNKAPMASENIWQTLRLALSA 554 (579) T ss_pred CCCCCCHHHHHHHHHHHHCCCCEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHH T ss_conf 07755399999999998418872653010145662056699999999999986531555444775257899999999997 Q ss_pred CCCHHHHHH Q ss_conf 498799999 Q gi|254781174|r 240 LALMDEARE 248 (271) Q Consensus 240 lg~~d~A~~ 248 (271) ++..|.+.+ T Consensus 555 ~~~~D~l~~ 563 (579) T KOG1125 555 MNRSDLLQE 563 (579) T ss_pred CCCCHHHHH T ss_conf 088167887 No 104 >PRK12370 invasion protein regulator; Provisional Probab=97.55 E-value=0.005 Score=35.73 Aligned_cols=187 Identities=12% Similarity=-0.028 Sum_probs=91.2 Q ss_pred CCCCCHHHHHHHHHHHHHH---CCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHH---------HHHHHHHHHH Q ss_conf 7673689999999999980---989999999999985304773026899877887765455---------5679999988 Q gi|254781174|r 52 TDVRYQREVYEKAVLFLKE---QNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGK---------YQQAASLGEE 119 (271) Q Consensus 52 ~~~~~~~~lY~~a~~~~~~---~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~---------y~~A~~~~~~ 119 (271) .+...+-..|-.|..++.+ .+...|+..|++.+..-|+.- .|.-.+++||++... -..|....++ T Consensus 253 ~~sLDA~~~Yl~Gl~hl~rft~edn~~A~~LF~~Av~~dP~fA---~AyA~la~cy~q~a~l~~~d~~~~~~~A~~~Aer 329 (553) T PRK12370 253 LNSIDSTMVYLRGKHELNQYTPYSLQQALKLLTQCVNMSPNSI---APYCALAECYLSMAQMGIFDKQNAMIKAKEHAIK 329 (553) T ss_pred CCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCCH---HHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH T ss_conf 9997489999978999973188779999999999873397744---6678789999976340368828899999999999 Q ss_pred HHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHH-------- Q ss_conf 754011211013555443444555544311123458999999999998520010134565555555666666-------- Q gi|254781174|r 120 YITQYPESKNVDYVYYLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQLAA-------- 191 (271) Q Consensus 120 fi~~~P~s~~~~~A~y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~La~-------- 191 (271) -++..|.+|. |.++.|.++-.. +...-+...|++-...-|||.-+.-+.-.......++++ T Consensus 330 avEldp~dp~---a~~~lg~~~~L~--------~d~~~a~~~f~rA~~L~Pnsa~~~Y~~~W~~~maGr~~ea~~~i~~a 398 (553) T PRK12370 330 ATELDHNNPQ---ALGLLGLINTIH--------SEYIVGSLLFKQANLLSPISADIKYYYGWNLFMAGQLEEALQTINEC 398 (553) T ss_pred HHHCCCCCHH---HHHHHHHHHHHC--------CCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHH T ss_conf 8743878878---889999998723--------55224479998764259984257999999999758689999999998 Q ss_pred ----------HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf ----------6555666665443223456899988652899811599999999999974987999999999978 Q gi|254781174|r 192 ----------KEVEIGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSLIQE 255 (271) Q Consensus 192 ----------~e~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~~ 255 (271) .-......|+.. .-+.|+.-..+...++|.+-..-.+..-| ..--.|..++|++..+.+.. T Consensus 399 ~~LdP~~~~~~~~~l~~~~~~~-~~d~a~~l~~~~~~q~~~~~~~~~s~~~m--~~~~~g~~~~a~~~~~~~~~ 469 (553) T PRK12370 399 LKLDPTRAAAGITKLWITYYHT-GIDDAIRLGDELRSQHLQDNPILLSMQVM--FLSLKGKHELARKLTKEIST 469 (553) T ss_pred HHCCCCCCHHHHHHHHHHHHHC-CCCHHHHHHHHHHCCCCCCCHHHHHHHHH--HHHHCCCHHHHHHHHHHCCH T ss_conf 8359865202799999999842-31079999998625077878369999999--99733838999999985180 No 105 >KOG1130 consensus Probab=97.44 E-value=0.0051 Score=35.70 Aligned_cols=90 Identities=13% Similarity=0.105 Sum_probs=33.8 Q ss_pred CCCHHHHHHH-HHHHHHHCCHHHHHHHHHH--HHHHCCCCCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC-- Q ss_conf 7368999999-9999980989999999999--9853047730-26899877887765455567999998875401121-- Q gi|254781174|r 54 VRYQREVYEK-AVLFLKEQNFSKAYEYFNQ--CSRDFPFAGV-ARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPES-- 127 (271) Q Consensus 54 ~~~~~~lY~~-a~~~~~~~~y~~A~~~f~~--i~~~~P~s~~-a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s-- 127 (271) ..+-.++|.+ |..+|--++|.+|++.-.- .+.+--+..+ ..++--++|.++-..|.|++|+..-.+-++.--.- T Consensus 51 l~tLSAIYsQLGNAyfyL~DY~kAl~yH~hDltlar~lgdklGEAKssgNLGNtlKv~G~fdeA~~cc~rhLd~areLgD 130 (639) T KOG1130 51 LSTLSAIYSQLGNAYFYLKDYEKALKYHTHDLTLARLLGDKLGEAKSSGNLGNTLKVKGAFDEALTCCFRHLDFARELGD 130 (639) T ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHCCHHCCCCCCCCCCCHHHHHCCCCHHHHHHHHHHHHHHHHHH T ss_conf 88999999974114446763999976422357799874442012200455201456302320688999988679999747 Q ss_pred -CHHHHHHHHHHHHHHH Q ss_conf -1013555443444555 Q gi|254781174|r 128 -KNVDYVYYLVGMSYAQ 143 (271) Q Consensus 128 -~~~~~A~y~~a~~~~~ 143 (271) -.-..|+|.+|..|-. T Consensus 131 rv~e~RAlYNlgnvYha 147 (639) T KOG1130 131 RVLESRALYNLGNVYHA 147 (639) T ss_pred HHHHHHHHHHHHHHHHH T ss_conf 77665777411111221 No 106 >COG0457 NrfG FOG: TPR repeat [General function prediction only] Probab=97.42 E-value=0.0073 Score=34.78 Aligned_cols=61 Identities=30% Similarity=0.384 Sum_probs=30.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCC Q ss_conf 5666665443223456899988652899811599999999999974987999999999978569 Q gi|254781174|r 195 EIGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSLIQERYP 258 (271) Q Consensus 195 ~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~~~yP 258 (271) .++..|...+.+..|+..+...+...|+ .......++..+...|..+++...........| T Consensus 207 ~~~~~~~~~~~~~~a~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 267 (291) T COG0457 207 NLGLLYLKLGKYEEALEYYEKALELDPD---NAEALYNLALLLLELGRYEEALEALEKALELDP 267 (291) T ss_pred HHHHHHHHCCCHHHHHHHHHHHHHHCCC---CHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCH T ss_conf 9999998752599999999999872910---179999999998838869999999999998798 No 107 >KOG4162 consensus Probab=97.42 E-value=0.0073 Score=34.76 Aligned_cols=70 Identities=10% Similarity=0.153 Sum_probs=51.6 Q ss_pred CCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 368999999999998098999999999998530477302689987788776545556799999887540112 Q gi|254781174|r 55 RYQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPE 126 (271) Q Consensus 55 ~~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~ 126 (271) ..|..+|..|.++-..++...|.+.-++.....|++. ..+...+|.+.-..+++.+|+..-+..+..+|+ T Consensus 476 ~dp~~if~lalq~A~~R~l~sAl~~~~eaL~l~~~~~--~~~whLLALvlSa~kr~~~Al~vvd~al~E~~~ 545 (799) T KOG4162 476 TDPLVIFYLALQYAEQRQLTSALDYAREALALNRGDS--AKAWHLLALVLSAQKRLKEALDVVDAALEEFGD 545 (799) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9824899999999999769999999999987157764--889999999996224167789999999997412 No 108 >COG2976 Uncharacterized protein conserved in bacteria [Function unknown] Probab=97.41 E-value=0.0075 Score=34.71 Aligned_cols=152 Identities=10% Similarity=0.057 Sum_probs=103.8 Q ss_pred HHHHHHHHCCHH---HHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHH Q ss_conf 999999809899---99999999985304773026899877887765455567999998875401121101355544344 Q gi|254781174|r 63 KAVLFLKEQNFS---KAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYLVGM 139 (271) Q Consensus 63 ~a~~~~~~~~y~---~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~~a~ 139 (271) -|+.+.+..+-+ .|...|+.++..-- .++. +.+...++|+..+|.+..+..+-...|- T Consensus 37 fGW~ywq~~q~~q~~~AS~~Y~~~i~~~~------------------ak~~-~~~~~~ekf~~~n~~t~Ya~laaL~lAk 97 (207) T COG2976 37 FGWRYWQSHQVEQAQEASAQYQNAIKAVQ------------------AKKP-KSIAAAEKFVQANGKTIYAVLAALELAK 97 (207) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH------------------CCCC-HHHHHHHHHHHHCCCCHHHHHHHHHHHH T ss_conf 98899999878888899999999999985------------------5770-2599999998615635899999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 45555443111234589999999999985200101345655555556666666555666665443223456899988652 Q gi|254781174|r 140 SYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQLVLAN 219 (271) Q Consensus 140 ~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~~~i~~ 219 (271) .+. |.+.+.+|...++..+..--++.+-.-+. +.+|+.-+..|+++.|.+.+..+.+ T Consensus 98 ~~v--------e~~~~d~A~aqL~~~l~~t~De~lk~l~~--------------lRLArvq~q~~k~D~AL~~L~t~~~- 154 (207) T COG2976 98 AEV--------EANNLDKAEAQLKQALAQTKDENLKALAA--------------LRLARVQLQQKKADAALKTLDTIKE- 154 (207) T ss_pred HHH--------HHCCHHHHHHHHHHHHCCCHHHHHHHHHH--------------HHHHHHHHHHHHHHHHHHHHHCCCC- T ss_conf 998--------63429999999999871452578999999--------------9999999986038889998723011- Q ss_pred CCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCC Q ss_conf 8998115999999999999749879999999999785699 Q gi|254781174|r 220 YSDAEHAEEAMARLVEAYVALALMDEAREVVSLIQERYPQ 259 (271) Q Consensus 220 yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~~~yP~ 259 (271) ..++.....+.|+++...|+.++|+..++.-....++ T Consensus 155 ---~~w~~~~~elrGDill~kg~k~~Ar~ay~kAl~~~~s 191 (207) T COG2976 155 ---ESWAAIVAELRGDILLAKGDKQEARAAYEKALESDAS 191 (207) T ss_pred ---CCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCC T ss_conf ---1078899997426999728659999999999871577 No 109 >pfam09295 ChAPs ChAPs (Chs5p-Arf1p-binding proteins). ChAPs (Chs5p-Arf1p-binding proteins) are required for the export of specialized cargo from the Golgi. They physically interact with Chs3, Chs5 and the small GTPase Arf1, and they form also interactions with each other. Probab=97.40 E-value=0.003 Score=37.04 Aligned_cols=109 Identities=17% Similarity=0.149 Sum_probs=74.7 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHH Q ss_conf 45556799999887540112110135554434445555443111234589999999999985200101345655555556 Q gi|254781174|r 107 AGKYQQAASLGEEYITQYPESKNVDYVYYLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGR 186 (271) Q Consensus 107 ~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~ 186 (271) .+.|+.++..++++...+|. +...++..+... ..-.+|+....+-+..-|.|.- T Consensus 182 t~~~~~~~~~le~L~~~~p~------v~~llakv~~~~--------~~E~~av~~i~~~L~~~p~d~~------------ 235 (395) T pfam09295 182 TKRYDRALNLLEKLREEEPE------VVSLLARVLLLM--------DEEIDAVKLMHDALQENPMDYE------------ 235 (395) T ss_pred CCCHHHHHHHHHHHHHCCCH------HHHHHHHHHHHC--------CCHHHHHHHHHHHHHCCCCCHH------------ T ss_conf 44389899999998751844------999999999965--------6089999999999861887678------------ Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHH Q ss_conf 666666555666665443223456899988652899811599999999999974987999999 Q gi|254781174|r 187 NQLAAKEVEIGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREV 249 (271) Q Consensus 187 ~~La~~e~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~ 249 (271) .| ..-++|...+++|+-|+.--+..+.--|+.- +.++.|+++|..+|+.+.|--. T Consensus 236 -LL----~~Qa~FLl~K~~yelAL~~A~~av~~~P~~F---~~W~~La~~Yi~l~d~e~ALl~ 290 (395) T pfam09295 236 -LL----CLQAEFLLSKKKDELALECAKRAVNSAPSEF---KTWALLVKVYIKLEDWENALLT 290 (395) T ss_pred -HH----HHHHHHHHHCCCHHHHHHHHHHHHHCCCCCC---HHHHHHHHHHHHHHHHHHHHHH T ss_conf -99----9999999817988999999999873178661---8899999999986329999999 No 110 >COG3118 Thioredoxin domain-containing protein [Posttranslational modification, protein turnover, chaperones] Probab=97.33 E-value=0.0096 Score=34.09 Aligned_cols=164 Identities=16% Similarity=0.148 Sum_probs=110.8 Q ss_pred CHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHH Q ss_conf 68999999999998098999999999998530477302689987788776545556799999887540112110135554 Q gi|254781174|r 56 YQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYY 135 (271) Q Consensus 56 ~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y 135 (271) ..+..+.+|...++.|++.+|...|..+...-|.+ .++.+.+|.|+...|+.+.|...+... |.....+.+.- T Consensus 133 ~~e~~~~~~~~~~~~e~~~~a~~~~~~al~~~~~~---~~~~~~la~~~l~~g~~e~A~~iL~~l----P~~~~~~~~~~ 205 (304) T COG3118 133 EEEEALAEAKELIEAEDFGEAAPLLKQALQAAPEN---SEAKLLLAECLLAAGDVEAAQAILAAL----PLQAQDKAAHG 205 (304) T ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCC---CHHHHHHHHHHHHCCCHHHHHHHHHHC----CCCCHHHHHHH T ss_conf 77888987666541654455899999999858456---637778999999818858999999728----50010457888 Q ss_pred HHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 434-4455554431112345899999999999852001013456555555566666665556666654432234568999 Q gi|254781174|r 136 LVG-MSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQ 214 (271) Q Consensus 136 ~~a-~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~ 214 (271) +.+ +....+..+.. -+..++.-+..-|+.. ++. +..|.-|.-.|++++|..++- T Consensus 206 l~a~i~ll~qaa~~~--------~~~~l~~~~aadPdd~---~aa--------------~~lA~~~~~~g~~e~Ale~Ll 260 (304) T COG3118 206 LQAQIELLEQAAATP--------EIQDLQRRLAADPDDV---EAA--------------LALADQLHLVGRNEAALEHLL 260 (304) T ss_pred HHHHHHHHHHHHCCC--------CHHHHHHHHHHCCCCH---HHH--------------HHHHHHHHHCCCHHHHHHHHH T ss_conf 999999999974599--------7789999987099988---999--------------999999997388799999999 Q ss_pred HHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHH-HHHHH Q ss_conf 886528998115999999999999749879999-99999 Q gi|254781174|r 215 LVLANYSDAEHAEEAMARLVEAYVALALMDEAR-EVVSL 252 (271) Q Consensus 215 ~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~-~~~~~ 252 (271) .++..-- +-...++..++.+.+..+|..|.+- ++... T Consensus 261 ~~l~~d~-~~~d~~~Rk~lle~f~~~g~~Dp~~~~~RRk 298 (304) T COG3118 261 ALLRRDR-GFEDGEARKTLLELFEAFGPADPLVLAYRRK 298 (304) T ss_pred HHHHHCC-CCCCCHHHHHHHHHHHHCCCCCHHHHHHHHH T ss_conf 9987503-4437689999999999638898789999999 No 111 >COG4700 Uncharacterized protein conserved in bacteria containing a divergent form of TPR repeats [Function unknown] Probab=97.32 E-value=0.0089 Score=34.27 Aligned_cols=122 Identities=16% Similarity=0.182 Sum_probs=49.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHH Q ss_conf 98778877654555679999988754011211013555443444555544311123458999999999998520010134 Q gi|254781174|r 97 LLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVK 176 (271) Q Consensus 97 ~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~ 176 (271) .+.+|.+.-..|++.+|...|++-+. .+.++++..+.|+..-.. ..+..-.|...++++.+..|.- .. T Consensus 92 r~rLa~al~elGr~~EA~~hy~qals----G~fA~d~a~lLglA~Aqf------a~~~~A~a~~tLe~l~e~~pa~-r~- 159 (251) T COG4700 92 RYRLANALAELGRYHEAVPHYQQALS----GIFAHDAAMLLGLAQAQF------AIQEFAAAQQTLEDLMEYNPAF-RS- 159 (251) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHC----CCCCCCHHHHHHHHHHHH------HHCCHHHHHHHHHHHHHCCCCC-CC- T ss_conf 99999999984105666889999854----656798899988898888------6165888888999986059765-89- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHH Q ss_conf 56555555566666665556666654432234568999886528998115999999999999749879999 Q gi|254781174|r 177 GARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAR 247 (271) Q Consensus 177 ~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~ 247 (271) +...+..|+.|.-.|+|..|...|+..++-||+. ++.-+-++-.-++|..++|+ T Consensus 160 -------------pd~~Ll~aR~laa~g~~a~Aesafe~a~~~ypg~----~ar~~Y~e~La~qgr~~ea~ 213 (251) T COG4700 160 -------------PDGHLLFARTLAAQGKYADAESAFEVAISYYPGP----QARIYYAEMLAKQGRLREAN 213 (251) T ss_pred -------------CCCHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCH----HHHHHHHHHHHHHCCHHHHH T ss_conf -------------9725899999984588055899999999858877----78989999999712216788 No 112 >PRK09782 bacteriophage N4 receptor, outer membrane subunit; Provisional Probab=97.29 E-value=0.0063 Score=35.15 Aligned_cols=65 Identities=20% Similarity=0.123 Sum_probs=57.5 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 99999999809899999999999853047730268998778877654555679999988754011211 Q gi|254781174|r 61 YEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESK 128 (271) Q Consensus 61 Y~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~ 128 (271) .++|....++++|.+|+..++-+...-|++. .+...++++|-..|.|.+|...+++..+.+|... T Consensus 48 iDkAy~e~k~~~~~~a~re~~h~l~~vP~n~---~~~~~Laea~~h~g~~~eA~all~~~~~~~~gDA 112 (987) T PRK09782 48 LDKALKAQKNNDEATAIREFEYIHQQVPDNI---PLTLYLAEAYRHFGHDDRARLLLEDQLKRHPGDA 112 (987) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHHCCCCC---HHHHHHHHHHHHCCCCHHHHHHHHHCCCCCCCHH T ss_conf 6799999871548889999999998688961---9999999999983893899999986134898257 No 113 >KOG0376 consensus Probab=97.28 E-value=0.0023 Score=37.73 Aligned_cols=120 Identities=16% Similarity=0.158 Sum_probs=94.0 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCC--CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHH Q ss_conf 89999999999980989999999999985304773--0268998778877654555679999988754011211013555 Q gi|254781174|r 57 QREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAG--VARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVY 134 (271) Q Consensus 57 ~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~--~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~ 134 (271) ++++=.+|.+.+..++|+.|+..+.+++...|+.. ++.+| .++.+.++|..|+.-+.+.|+.-|.- .-+| T Consensus 4 a~e~k~ean~~l~~~~fd~avdlysKaI~ldpnca~~~anRa-----~a~lK~e~~~~Al~Da~kaie~dP~~---~K~Y 75 (476) T KOG0376 4 AEELKNEANEALKDKVFDVAVDLYSKAIELDPNCAIYFANRA-----LAHLKVESFGGALHDALKAIELDPTY---IKAY 75 (476) T ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCCEEEECHHH-----HHHEEECHHHHHHHHHHHHHHCCCHH---HHEE T ss_conf 224146776541200199999999999852886324423021-----23432000336778777635408012---3303 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 4434445555443111234589999999999985200101345655555556666666555 Q gi|254781174|r 135 YLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQLAAKEVE 195 (271) Q Consensus 135 y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~La~~e~~ 195 (271) +++|.+.... .-..+|+..|+.....-|+-+ ++..++..|+....+.+|+ T Consensus 76 ~rrg~a~m~l--------~~~~~A~~~l~~~~~l~Pnd~---~~~r~~~Ec~~~vs~~~fe 125 (476) T KOG0376 76 VRRGTAVMAL--------GEFKKALLDLEKVKKLAPNDP---DATRKIDECNKIVSEEKFE 125 (476) T ss_pred EECCHHHHHH--------HHHHHHHHHHHHHHHCCCCCH---HHHHHHHHHHHHHHHHHHH T ss_conf 4311778757--------779999999987662386768---8998789999999887641 No 114 >KOG1586 consensus Probab=97.23 E-value=0.012 Score=33.45 Aligned_cols=200 Identities=13% Similarity=0.050 Sum_probs=125.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCC-C- Q ss_conf 99999999999999999999999722687533344567776736899999999999809899999999999853047-7- Q gi|254781174|r 13 EAWAYQLYKFALTIFFSIAVCFLVGWERQSSRDVYLDSVTDVRYQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPF-A- 90 (271) Q Consensus 13 ~~~~~~m~k~~~~i~~~i~~~~l~~Cs~~~~~~~~~~~~~~~~~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~-s- 90 (271) ..-.|||-|.--...- +++-..-|-.+.. ..-..-.-|-.|-..+++.+-.+|++-++..+.-|-. + T Consensus 40 Aan~yklaK~w~~AG~--aflkaA~~h~k~~---------skhDaat~YveA~~cykk~~~~eAv~cL~~aieIyt~~Gr 108 (288) T KOG1586 40 AANMYKLAKNWSAAGD--AFLKAADLHLKAG---------SKHDAATTYVEAANCYKKVDPEEAVNCLEKAIEIYTDMGR 108 (288) T ss_pred HHHHHHHHHHHHHHHH--HHHHHHHHHHHCC---------CCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 8789998886888789--9999999988608---------8113788999998876215848899999999999985227 Q ss_pred -CCHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf -30268998778877654-5556799999887540112110135554434445555443111234589999999999985 Q gi|254781174|r 91 -GVARKSLLMSAFVQYSA-GKYQQAASLGEEYITQYPESKNVDYVYYLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVER 168 (271) Q Consensus 91 -~~a~~A~~~la~~~y~~-~~y~~A~~~~~~fi~~~P~s~~~~~A~y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~ 168 (271) ..+.+-.+.+|+.|-.. .++.+||..|+..-+.|...+.+.-|- -|+.+ ..+..--.+...+|+..|+++... T Consensus 109 f~~aAk~~~~iaEiyEsdl~d~ekaI~~YE~Aae~yk~ees~ssAN----KC~lK-vA~yaa~leqY~~Ai~iyeqva~~ 183 (288) T KOG1586 109 FTMAAKHHIEIAEIYESDLQDFEKAIAHYEQAAEYYKGEESVSSAN----KCLLK-VAQYAAQLEQYSKAIDIYEQVARS 183 (288) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHH----HHHHH-HHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 7787751113999986227779999999999999970015552688----99999-999999999999999999999998 Q ss_pred HHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHH--HHHHHHHHH Q ss_conf 2001013456555555566666665556666654432234568999886528998115999--999999999 Q gi|254781174|r 169 YTNSPYVKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEEA--MARLVEAYV 238 (271) Q Consensus 169 yP~S~ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eA--l~~l~~~y~ 238 (271) --+++..+ -..-++-|..|..++-..+..++...++...+.+|...-..|. +.-|.++.. T Consensus 184 s~~n~LLK----------ys~KdyflkAgLChl~~~D~v~a~~ALeky~~~dP~F~dsREckflk~L~~aie 245 (288) T KOG1586 184 SLDNNLLK----------YSAKDYFLKAGLCHLCKADEVNAQRALEKYQELDPAFTDSRECKFLKDLLDAIE 245 (288) T ss_pred HCCCHHHH----------HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHH T ss_conf 60364877----------678889999998867302077899999988753986464088799999999885 No 115 >COG4235 Cytochrome c biogenesis factor [Posttranslational modification, protein turnover, chaperones] Probab=97.20 E-value=0.012 Score=33.43 Aligned_cols=127 Identities=15% Similarity=0.084 Sum_probs=85.6 Q ss_pred HHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHH Q ss_conf 67999998875401121101355544344455554431112345899999999999852001013456555555566666 Q gi|254781174|r 111 QQAASLGEEYITQYPESKNVDYVYYLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQLA 190 (271) Q Consensus 111 ~~A~~~~~~fi~~~P~s~~~~~A~y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~La 190 (271) +.-+..++.-+..+|++.. -+.+.|-.|.. ++....|...|.+-++.-|+++-..- T Consensus 139 ~~l~a~Le~~L~~nP~d~e---gW~~Lg~~ym~--------~~~~~~A~~AY~~A~rL~g~n~~~~~------------- 194 (287) T COG4235 139 EALIARLETHLQQNPGDAE---GWDLLGRAYMA--------LGRASDALLAYRNALRLAGDNPEILL------------- 194 (287) T ss_pred HHHHHHHHHHHHHCCCCCH---HHHHHHHHHHH--------HCCHHHHHHHHHHHHHHCCCCHHHHH------------- T ss_conf 9999999999984988724---49999999998--------44434799999999873799889999------------- Q ss_pred HHHHHHHH--HHHH-HHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH-HHHCCCCHHHHHH Q ss_conf 66555666--6654-432234568999886528998115999999999999749879999999999-7856999789999 Q gi|254781174|r 191 AKEVEIGR--YYLK-RGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSLI-QERYPQGYWARYV 266 (271) Q Consensus 191 ~~e~~ia~--~Y~~-~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l-~~~yP~s~~~~~a 266 (271) ..|. +|-. ...-..+..-++.++..=|+. ..|++.|+-.+.+.|++.+|...++.+ ...-|+..|.... T Consensus 195 ----g~aeaL~~~a~~~~ta~a~~ll~~al~~D~~~---iral~lLA~~afe~g~~~~A~~~Wq~lL~~lp~~~~rr~~i 267 (287) T COG4235 195 ----GLAEALYYQAGQQMTAKARALLRQALALDPAN---IRALSLLAFAAFEQGDYAEAAAAWQMLLDLLPADDPRRSLI 267 (287) T ss_pred ----HHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCC---HHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCHHHHH T ss_conf ----99999999538844499999999998439732---88999999999981459999999999985189998237899 Q ss_pred HH Q ss_conf 98 Q gi|254781174|r 267 ET 268 (271) Q Consensus 267 ~~ 268 (271) +. T Consensus 268 e~ 269 (287) T COG4235 268 ER 269 (287) T ss_pred HH T ss_conf 99 No 116 >KOG1130 consensus Probab=97.16 E-value=0.0045 Score=36.02 Aligned_cols=62 Identities=26% Similarity=0.131 Sum_probs=37.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH---HCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHH Q ss_conf 6655566666544322345689998865---2899811599999999999974987999999999 Q gi|254781174|r 191 AKEVEIGRYYLKRGEYVAAIPRFQLVLA---NYSDAEHAEEAMARLVEAYVALALMDEAREVVSL 252 (271) Q Consensus 191 ~~e~~ia~~Y~~~~~y~aA~~~~~~~i~---~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~ 252 (271) ..-+.+|+.|+-.+++..||.+++.-+. .-.+-.--..|-+-|+.+|..+|.-++|..+... T Consensus 276 QscYSLgNtytll~e~~kAI~Yh~rHLaIAqeL~DriGe~RacwSLgna~~alg~h~kAl~fae~ 340 (639) T KOG1130 276 QSCYSLGNTYTLLKEVQKAITYHQRHLAIAQELEDRIGELRACWSLGNAFNALGEHRKALYFAEL 340 (639) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 98987644899999999999999999999999887641489999987888764217888999999 No 117 >KOG1127 consensus Probab=97.03 E-value=0.02 Score=32.27 Aligned_cols=189 Identities=16% Similarity=0.088 Sum_probs=106.1 Q ss_pred HHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHH---------------------H-----HHHHHHHH Q ss_conf 999999980989999999999985304773026899877887765---------------------4-----55567999 Q gi|254781174|r 62 EKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYS---------------------A-----GKYQQAAS 115 (271) Q Consensus 62 ~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~---------------------~-----~~y~~A~~ 115 (271) .+|....+.++|.+|......+...+-+. ++...+-+|+.+ . ++-..|.. T Consensus 404 akgl~~ie~~~y~Daa~tl~lv~~~s~nd----~slselswc~~~~~ek~mdva~~~~~e~~~~w~a~~~~rK~~~~al~ 479 (1238) T KOG1127 404 AKGLAPIEANVYTDAAITLDLVSSLSFND----DSLSELSWCLPRALEKMMDVALLLECENSEFWVALGCMRKNSALALH 479 (1238) T ss_pred HCCHHHHHHHHCHHHHHHHHHHHHHHCCC----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 21131777742407999999998752376----26667667777767764249999998777999999996410788999 Q ss_pred HHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----------------CCHHHHHH Q ss_conf 9988754011211013555443444555544311123458999999999998520-----------------01013456 Q gi|254781174|r 116 LGEEYITQYPESKNVDYVYYLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYT-----------------NSPYVKGA 178 (271) Q Consensus 116 ~~~~fi~~~P~s~~~~~A~y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP-----------------~S~ya~~A 178 (271) .+-+-++.-| .+-.|.++.- ++.||.-....|.+.|+...+.-| +.+.+... T Consensus 480 ali~alrld~--------~~apaf~~LG---~iYrd~~Dm~RA~kCf~KAFeLDatdaeaaaa~adtyae~~~we~a~~I 548 (1238) T KOG1127 480 ALIRALRLDV--------SLAPAFAFLG---QIYRDSDDMKRAKKCFDKAFELDATDAEAAAASADTYAEESTWEEAFEI 548 (1238) T ss_pred HHHHHHHCCC--------CHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHH T ss_conf 9999985464--------2238999988---9998789899999999888637912344678888876315418899999 Q ss_pred HHHHHHHHHHHHH--HH--HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH Q ss_conf 5555555666666--65--5566666544322345689998865289981159999999999997498799999999997 Q gi|254781174|r 179 RFYVTVGRNQLAA--KE--VEIGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSLIQ 254 (271) Q Consensus 179 ~~~l~~~~~~La~--~e--~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~ 254 (271) .. ...+...++ ++ ..+|-+|.+.++...|+.-||..++.-|.+ -++..-+|++|..-|.+..|.+.+.... T Consensus 549 ~l--~~~qka~a~~~k~nW~~rG~yyLea~n~h~aV~~fQsALR~dPkD---~n~W~gLGeAY~~sGry~~AlKvF~kAs 623 (1238) T KOG1127 549 CL--RAAQKAPAFACKENWVQRGPYYLEAHNLHGAVCEFQSALRTDPKD---YNLWLGLGEAYPESGRYSHALKVFTKAS 623 (1238) T ss_pred HH--HHHHHCHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHCCCCHH---HHHHHHHHHHHHHCCCEEHHHHHHHHHH T ss_conf 98--874215477877523540434368521555899999986389065---7889987788774376200777642457 Q ss_pred HHCCCCHHHHHHHHHH Q ss_conf 8569997899999860 Q gi|254781174|r 255 ERYPQGYWARYVETLV 270 (271) Q Consensus 255 ~~yP~s~~~~~a~~~l 270 (271) .-=|+|.+.++-+..+ T Consensus 624 ~LrP~s~y~~fk~A~~ 639 (1238) T KOG1127 624 LLRPLSKYGRFKEAVM 639 (1238) T ss_pred HCCCHHHHHHHHHHHH T ss_conf 4482768999999999 No 118 >COG0457 NrfG FOG: TPR repeat [General function prediction only] Probab=97.02 E-value=0.02 Score=32.25 Aligned_cols=175 Identities=21% Similarity=0.230 Sum_probs=126.6 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHH Q ss_conf 89999999999980989999999999985304773026899877887765455567999998875401121101355544 Q gi|254781174|r 57 QREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYL 136 (271) Q Consensus 57 ~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~ 136 (271) .......+......+++..+...+...... ..........+..+......+++..++..+...+...|..... ... T Consensus 59 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~---~~~ 134 (291) T COG0457 59 AGLLLLLALALLKLGRLEEALELLEKALEL-ELLPNLAEALLNLGLLLEALGKYEEALELLEKALALDPDPDLA---EAL 134 (291) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHH---HHH T ss_conf 689999888999866379999989998877-5352359999999999999843999999999987336564269---999 Q ss_pred HHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 344-4555544311123458999999999998520010134565555555666666655566666544322345689998 Q gi|254781174|r 137 VGM-SYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQL 215 (271) Q Consensus 137 ~a~-~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~~ 215 (271) .+. ++ ...+....|...++..+...|. ....+......+..+...+.+..|+..+.. T Consensus 135 ~~~~~~--------~~~~~~~~a~~~~~~~~~~~~~--------------~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~ 192 (291) T COG0457 135 LALGAL--------YELGDYEEALELYEKALELDPE--------------LNELAEALLALGALLEALGRYEEALELLEK 192 (291) T ss_pred HHHHHH--------HHHHHHHHHHHHHHHHHHHCHH--------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999--------9864099999999999874944--------------467899999999898870339999999999 Q ss_pred HHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCC Q ss_conf 86528998115999999999999749879999999999785699 Q gi|254781174|r 216 VLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSLIQERYPQ 259 (271) Q Consensus 216 ~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~~~yP~ 259 (271) .+...|.. ...+...++..+...|..++|...+.......|+ T Consensus 193 ~~~~~~~~--~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~ 234 (291) T COG0457 193 ALKLNPDD--DAEALLNLGLLYLKLGKYEEALEYYEKALELDPD 234 (291) T ss_pred HHHHCCCC--CHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCC T ss_conf 98628012--2799999999998752599999999999872910 No 119 >COG4700 Uncharacterized protein conserved in bacteria containing a divergent form of TPR repeats [Function unknown] Probab=96.96 E-value=0.022 Score=31.95 Aligned_cols=138 Identities=15% Similarity=0.119 Sum_probs=98.2 Q ss_pred CCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHH Q ss_conf 73689999999999980989999999999985304773026899--8778877654555679999988754011211013 Q gi|254781174|r 54 VRYQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSL--LMSAFVQYSAGKYQQAASLGEEYITQYPESKNVD 131 (271) Q Consensus 54 ~~~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~--~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~ 131 (271) -.+....|..|....+-|++.+|...|++...- .++.++. +.+|.+.+..+++..|...+++..+.+|.-.. | T Consensus 86 ApTvqnr~rLa~al~elGr~~EA~~hy~qalsG----~fA~d~a~lLglA~Aqfa~~~~A~a~~tLe~l~e~~pa~r~-p 160 (251) T COG4700 86 APTVQNRYRLANALAELGRYHEAVPHYQQALSG----IFAHDAAMLLGLAQAQFAIQEFAAAQQTLEDLMEYNPAFRS-P 160 (251) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC----CCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCCC-C T ss_conf 613788999999999841056668899998546----56798899988898888616588888899998605976589-9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 55544344455554431112345899999999999852001013456555555566666665556666654432234568 Q gi|254781174|r 132 YVYYLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIP 211 (271) Q Consensus 132 ~A~y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~ 211 (271) +-..+.|..+-- ++....|...|+..+.-||+- .++- .-|.+..++|.-.+|.. T Consensus 161 d~~Ll~aR~laa--------~g~~a~Aesafe~a~~~ypg~----~ar~--------------~Y~e~La~qgr~~ea~a 214 (251) T COG4700 161 DGHLLFARTLAA--------QGKYADAESAFEVAISYYPGP----QARI--------------YYAEMLAKQGRLREANA 214 (251) T ss_pred CCHHHHHHHHHH--------CCCCHHHHHHHHHHHHHCCCH----HHHH--------------HHHHHHHHHCCHHHHHH T ss_conf 725899999984--------588055899999999858877----7898--------------99999997122167888 Q ss_pred HHHHHHHHCCC Q ss_conf 99988652899 Q gi|254781174|r 212 RFQLVLANYSD 222 (271) Q Consensus 212 ~~~~~i~~yp~ 222 (271) ++..++++-.. T Consensus 215 q~~~v~d~~~r 225 (251) T COG4700 215 QYVAVVDTAKR 225 (251) T ss_pred HHHHHHHHHHH T ss_conf 79999999976 No 120 >KOG0495 consensus Probab=96.93 E-value=0.024 Score=31.76 Aligned_cols=57 Identities=14% Similarity=0.183 Sum_probs=24.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH Q ss_conf 566666544322345689998865289981159999999999997498799999999997 Q gi|254781174|r 195 EIGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSLIQ 254 (271) Q Consensus 195 ~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~ 254 (271) .+|+..+..+++..|..-|...|..-|+.. ++.-+.-.-+...|..+.-++++.... T Consensus 822 aia~lfw~e~k~~kar~Wf~Ravk~d~d~G---D~wa~fykfel~hG~eed~kev~~~c~ 878 (913) T KOG0495 822 AIAKLFWSEKKIEKAREWFERAVKKDPDNG---DAWAWFYKFELRHGTEEDQKEVLKKCE 878 (913) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCCC---HHHHHHHHHHHHHCCHHHHHHHHHHHH T ss_conf 999999988889999999999970288641---299999999998377777999999876 No 121 >KOG4340 consensus Probab=96.88 E-value=0.022 Score=31.98 Aligned_cols=74 Identities=16% Similarity=0.121 Sum_probs=57.9 Q ss_pred CCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-CCCCC Q ss_conf 767368999999999998098999999999998530477302689987788776545556799999887540-11211 Q gi|254781174|r 52 TDVRYQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQ-YPESK 128 (271) Q Consensus 52 ~~~~~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~-~P~s~ 128 (271) ....++..+-+.|-..++.|+|+.|++.|+....... +-+-.-+.+|.|+|+.++|+.|+....++|++ -.+|| T Consensus 139 p~en~Ad~~in~gCllykegqyEaAvqkFqaAlqvsG---yqpllAYniALaHy~~~qyasALk~iSEIieRG~r~HP 213 (459) T KOG4340 139 PSENEADGQINLGCLLYKEGQYEAAVQKFQAALQVSG---YQPLLAYNLALAHYSSRQYASALKHISEIIERGIRQHP 213 (459) T ss_pred CCCCCHHHHHCCHHHEECCCCHHHHHHHHHHHHHHCC---CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 7888502330301200016548999999999873057---67278999999987312077899999999975542397 No 122 >KOG4234 consensus Probab=96.87 E-value=0.027 Score=31.48 Aligned_cols=142 Identities=18% Similarity=0.201 Sum_probs=94.8 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHH--H Q ss_conf 89999999999980989999999999985304773026899877887765455567999998875401121101355--5 Q gi|254781174|r 57 QREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYV--Y 134 (271) Q Consensus 57 ~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A--~ 134 (271) .++|...+..-+...+-+.|+..-..+ -.-|.-+|..|+|..|...|...+..-|..+.-..+ + T Consensus 72 eEeLmae~E~i~~deek~k~~~kad~l--------------K~EGN~~F~ngdyeeA~skY~~Ale~cp~~~~e~rsIly 137 (271) T KOG4234 72 EEELMAEIEKIFSDEEKDKAIEKADSL--------------KKEGNELFKNGDYEEANSKYQEALESCPSTSTEERSILY 137 (271) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHH--------------HHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCHHHHHHHH T ss_conf 999999998743808888889888889--------------887777651340778999999999868663088899997 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 44344455554431112345899999999999852001013456555555566666665556666654432234568999 Q gi|254781174|r 135 YLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQ 214 (271) Q Consensus 135 y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~ 214 (271) -.+|.|..+ ++....|+......|..-|.-..+.. ..|..|-+..+|..|+.-|+ T Consensus 138 ~Nraaa~iK--------l~k~e~aI~dcsKaiel~pty~kAl~-----------------RRAeayek~ek~eealeDyK 192 (271) T KOG4234 138 SNRAAALIK--------LRKWESAIEDCSKAIELNPTYEKALE-----------------RRAEAYEKMEKYEEALEDYK 192 (271) T ss_pred HHHHHHHHH--------HHHHHHHHHHHHHHHHCCCHHHHHHH-----------------HHHHHHHHHHHHHHHHHHHH T ss_conf 205889987--------56689899998765750840679999-----------------99999885551788899999 Q ss_pred HHHHHCCCCHHHHHHHHHHHHHH Q ss_conf 88652899811599999999999 Q gi|254781174|r 215 LVLANYSDAEHAEEAMARLVEAY 237 (271) Q Consensus 215 ~~i~~yp~t~~~~eAl~~l~~~y 237 (271) .+++.-|.-..+.++..||-..- T Consensus 193 ki~E~dPs~~ear~~i~rl~~~i 215 (271) T KOG4234 193 KILESDPSRREAREAIARLPPKI 215 (271) T ss_pred HHHHHCCCHHHHHHHHHHCCHHH T ss_conf 99871820689999998527777 No 123 >KOG2053 consensus Probab=96.83 E-value=0.029 Score=31.27 Aligned_cols=97 Identities=19% Similarity=0.180 Sum_probs=76.5 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHH Q ss_conf 99999999809899999999999853047730268998778877654555679999988754011211013555443444 Q gi|254781174|r 61 YEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYLVGMS 140 (271) Q Consensus 61 Y~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~~a~~ 140 (271) -+--...++.+++..|.....++..+||+++++ ...-|.+++++|++++|...++..-...|++...- T Consensus 13 ~rpi~d~ld~~qfkkal~~~~kllkk~Pn~~~a---~vLkaLsl~r~gk~~ea~~~Le~~~~~~~~D~~tL--------- 80 (932) T KOG2053 13 LRPIYDLLDSSQFKKALAKLGKLLKKHPNALYA---KVLKALSLFRLGKGDEALKLLEALYGLKGTDDLTL--------- 80 (932) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHH---HHHHHHHHHHHCCCHHHHHHHHHHCCCCCCCHHHH--------- T ss_conf 768887752677999999999999878991799---99999999981586467888751304799856899--------- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 5555443111234589999999999985200 Q gi|254781174|r 141 YAQMIRDVPYDQRATKLMLQYMSRIVERYTN 171 (271) Q Consensus 141 ~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~ 171 (271) ..+....+|++...+|...++..+..||+ T Consensus 81 --q~l~~~y~d~~~~d~~~~~Ye~~~~~~P~ 109 (932) T KOG2053 81 --QFLQNVYRDLGKLDEAVHLYERANQKYPS 109 (932) T ss_pred --HHHHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf --99999999874256899999998710884 No 124 >cd05804 StaR_like StaR_like; a well-conserved protein found in bacteria, plants, and animals. A family member from Streptomyces toyocaensis, StaR is part of a gene cluster involved in the biosynthesis of glycopeptide antibiotics (GPAs), specifically A47934. It has been speculated that StaR could be a flavoprotein hydroxylating a tyrosine sidechain. Some family members have been annotated as proteins containing tetratricopeptide (TPR) repeats, which may at least indicate mostly alpha-helical secondary structure. Probab=96.80 E-value=0.03 Score=31.15 Aligned_cols=141 Identities=21% Similarity=0.251 Sum_probs=81.3 Q ss_pred CCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHH---HH-------------------------------H Q ss_conf 7368999999999998098999999999998530477302689---98-------------------------------7 Q gi|254781174|r 54 VRYQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKS---LL-------------------------------M 99 (271) Q Consensus 54 ~~~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A---~~-------------------------------~ 99 (271) -.+.+.++-++....-.|++..|...++++...||..-++-+. .| + T Consensus 40 ~t~RE~~hv~a~~~~~~g~~~~A~~~~~~~l~~~P~D~lalk~~~~~f~lg~~~g~~~~~~~~l~~~~p~~~~~~~~~gm 119 (355) T cd05804 40 ATERERAHVEALSAWIAGDLPKALALLEQLLDDYPRDLLALKLHLGAFGLGDFSGMRDHVARVLPLWAPENPDYWYLLGM 119 (355) T ss_pred CCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHCCCCCCCCHHHHHH T ss_conf 99899999999999987997899999999998698359999987674102541324056999987257348983699999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHH Q ss_conf 78877654555679999988754011211013555443444555544311123458999999999998520010134565 Q gi|254781174|r 100 SAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGAR 179 (271) Q Consensus 100 la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~ 179 (271) .|+++-..|+|++|....++-++.+|++..+.-| ++=++++ ++...+.+..++.....+.++. T Consensus 120 ~AF~leE~g~~~~Ae~~a~~al~~np~d~wa~HA---vAHv~~~--------~g~~~eG~~~l~~~~~~w~~~~------ 182 (355) T cd05804 120 LAFGLEEAGQYDRAEEAARRALELNPDDAWAVHA---VAHVLEM--------QGRFKEGIAFMESWRDTWDCSS------ 182 (355) T ss_pred HHHHHHHCCCHHHHHHHHHHHHHCCCCCCHHHHH---HHHHHHH--------CCCHHHHHHHHHHHHHCCCCCC------ T ss_conf 9999998588999999999999639999379999---9999998--------3887889999985175257788------ Q ss_pred HHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 5555556666666-555666665443223456899988652 Q gi|254781174|r 180 FYVTVGRNQLAAK-EVEIGRYYLKRGEYVAAIPRFQLVLAN 219 (271) Q Consensus 180 ~~l~~~~~~La~~-e~~ia~~Y~~~~~y~aA~~~~~~~i~~ 219 (271) .+..| -.-.|-+|+.+|++++++..|...+.. T Consensus 183 --------~l~~H~wWH~AL~~Le~g~~~~~l~iyD~~i~~ 215 (355) T cd05804 183 --------MLRGHNWWHLALFYLERGDYEAALAIYDTHIAP 215 (355) T ss_pred --------CHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCC T ss_conf --------589899999999988778988999998887445 No 125 >KOG0551 consensus Probab=96.59 E-value=0.044 Score=30.24 Aligned_cols=85 Identities=14% Similarity=0.151 Sum_probs=71.0 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHH-HCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHH Q ss_conf 89999999999980989999999999985-30477302689987788776545556799999887540112110135554 Q gi|254781174|r 57 QREVYEKAVLFLKEQNFSKAYEYFNQCSR-DFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYY 135 (271) Q Consensus 57 ~~~lY~~a~~~~~~~~y~~A~~~f~~i~~-~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y 135 (271) ++..=+.|..+|+.++|..|+..|.+-+. ..++.....--..+.|-|.+..|+|-.|+.--.+.+...|.|-. |+| T Consensus 81 Aen~KeeGN~~fK~Kryk~A~~~Yt~Glk~kc~D~dlnavLY~NRAAa~~~l~NyRs~l~Dcs~al~~~P~h~K---a~~ 157 (390) T KOG0551 81 AENYKEEGNEYFKEKRYKDAVESYTEGLKKKCADPDLNAVLYTNRAAAQLYLGNYRSALNDCSAALKLKPTHLK---AYI 157 (390) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHH---HHH T ss_conf 99888876898777567689999988876418992378998842999999999899999999998614873044---343 Q ss_pred HHHHHHHHH Q ss_conf 434445555 Q gi|254781174|r 136 LVGMSYAQM 144 (271) Q Consensus 136 ~~a~~~~~~ 144 (271) +-|.|.++. T Consensus 158 R~Akc~~eL 166 (390) T KOG0551 158 RGAKCLLEL 166 (390) T ss_pred HHHHHHHHH T ss_conf 566899999 No 126 >pfam04781 DUF627 Protein of unknown function (DUF627). This family represents the N-terminal region of several plant proteins of unknown function. Probab=96.54 E-value=0.025 Score=31.66 Aligned_cols=73 Identities=14% Similarity=0.103 Sum_probs=52.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 7887765455567999998875401121101355544344455554---431112345899999999999852001 Q gi|254781174|r 100 SAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYLVGMSYAQMI---RDVPYDQRATKLMLQYMSRIVERYTNS 172 (271) Q Consensus 100 la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~~a~~~~~~~---~~~~~d~~~~~~A~~~f~~~i~~yP~S 172 (271) .|.-+|..|++.+|+...++++..|++++...-.+...|-.++++. .+++..+.....+++.|.+....-|++ T Consensus 2 ~A~~~~~~GnhiKAL~iied~is~h~~~~~~~~~h~~QG~if~~lA~~ten~d~K~~yL~~sve~~s~~~~Lsp~~ 77 (112) T pfam04781 2 LARDLFAKGDYIKALEIIEDSISDHGKDESAWLLHILQGTIFVKLAKKTENSDVKFTYLLGSVECFSEDVLLSPDA 77 (112) T ss_pred CHHHHHHCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCHHH T ss_conf 0589987657599999999999872676411999999859999999974582388999999999998875018789 No 127 >KOG1585 consensus Probab=96.47 E-value=0.052 Score=29.81 Aligned_cols=207 Identities=18% Similarity=0.183 Sum_probs=114.5 Q ss_pred CCCCHHHHHHHHHHHHH-HCCHHHHHHHHHHHHHHCCCCC---CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 67368999999999998-0989999999999985304773---0268998778877654555679999988754011211 Q gi|254781174|r 53 DVRYQREVYEKAVLFLK-EQNFSKAYEYFNQCSRDFPFAG---VARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESK 128 (271) Q Consensus 53 ~~~~~~~lY~~a~~~~~-~~~y~~A~~~f~~i~~~~P~s~---~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~ 128 (271) +-..+..+|.+|-..|+ .++|++|...+.+...-|-+.. .+.++.-..|-..-.+.++.++...+++....|-.+- T Consensus 26 d~dgaas~yekAAvafRnAk~feKakdcLlkA~~~yEnnrslfhAAKayEqaamLake~~klsEvvdl~eKAs~lY~E~G 105 (308) T KOG1585 26 DWDGAASLYEKAAVAFRNAKKFEKAKDCLLKASKGYENNRSLFHAAKAYEQAAMLAKELSKLSEVVDLYEKASELYVECG 105 (308) T ss_pred CCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 70035789999999998513578889999999999883213878888899999999998872799999999999999709 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHH--------------- Q ss_conf 01355544344455554431112345899999999999852001013456555555566666665--------------- Q gi|254781174|r 129 NVDYVYYLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQLAAKE--------------- 193 (271) Q Consensus 129 ~~~~A~y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~La~~e--------------- 193 (271) ..+.|---++-. -+.+.+ ...++|++.+++-..-+-+++....|-..+..+-+.|-+-+ T Consensus 106 spdtAAmaleKA-ak~len-----v~Pd~AlqlYqralavve~~dr~~ma~el~gk~sr~lVrl~kf~Eaa~a~lKe~~~ 179 (308) T KOG1585 106 SPDTAAMALEKA-AKALEN-----VKPDDALQLYQRALAVVEEDDRDQMAFELYGKCSRVLVRLEKFTEAATAFLKEGVA 179 (308) T ss_pred CCCHHHHHHHHH-HHHHHC-----CCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 954699999999-988623-----88889999999999998505348788899987545733357756889999986359 Q ss_pred ------------HHHHH--HHHHHHHHHHHHHHHHHH--HHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHC Q ss_conf ------------55666--665443223456899988--65289981159999999999997498799999999997856 Q gi|254781174|r 194 ------------VEIGR--YYLKRGEYVAAIPRFQLV--LANYSDAEHAEEAMARLVEAYVALALMDEAREVVSLIQERY 257 (271) Q Consensus 194 ------------~~ia~--~Y~~~~~y~aA~~~~~~~--i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~~~y 257 (271) ..+|. .|+-+++|..|-+-++.- +.+|- ++..-.++.+|...|. -|+.++..+....=.-++ T Consensus 180 ~~~~~~y~~~~k~~va~ilv~L~~~Dyv~aekc~r~~~qip~f~-~sed~r~lenLL~ayd-~gD~E~~~kvl~sp~~r~ 257 (308) T KOG1585 180 ADKCDAYNSQCKAYVAAILVYLYAHDYVQAEKCYRDCSQIPAFL-KSEDSRSLENLLTAYD-EGDIEEIKKVLSSPTVRN 257 (308) T ss_pred HHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCHHCCCCCC-CHHHHHHHHHHHHHHC-CCCHHHHHHHHCCHHHHH T ss_conf 99986112389999999999720888999999835230276646-7477899999998753-378899999972717653 Q ss_pred CCCHHHHHHH Q ss_conf 9997899999 Q gi|254781174|r 258 PQGYWARYVE 267 (271) Q Consensus 258 P~s~~~~~a~ 267 (271) -|..+.+... T Consensus 258 MDneya~l~k 267 (308) T KOG1585 258 MDNEYAHLNK 267 (308) T ss_pred HHHHHHHHHH T ss_conf 0289999730 No 128 >KOG0495 consensus Probab=96.41 E-value=0.056 Score=29.60 Aligned_cols=99 Identities=12% Similarity=0.100 Sum_probs=51.3 Q ss_pred CCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHH--------------HHH Q ss_conf 989999999999985304773026899877887765455567999998875401121101355--------------544 Q gi|254781174|r 71 QNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYV--------------YYL 136 (271) Q Consensus 71 ~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A--------------~y~ 136 (271) |.-+.-...|++.....|.+ +---+|.|..++..|+-..|...+....+.+|+|+.+=-| .-+ T Consensus 564 gt~Esl~Allqkav~~~pka---e~lwlM~ake~w~agdv~~ar~il~~af~~~pnseeiwlaavKle~en~e~eraR~l 640 (913) T KOG0495 564 GTRESLEALLQKAVEQCPKA---EILWLMYAKEKWKAGDVPAARVILDQAFEANPNSEEIWLAAVKLEFENDELERARDL 640 (913) T ss_pred CCHHHHHHHHHHHHHHCCCC---HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHH T ss_conf 86999999999999738862---268998888887517848899999999983898188898988875224648999999 Q ss_pred HHHHH--------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 34445--------5554431112345899999999999852001 Q gi|254781174|r 137 VGMSY--------AQMIRDVPYDQRATKLMLQYMSRIVERYTNS 172 (271) Q Consensus 137 ~a~~~--------~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S 172 (271) .|-+. +..+....+-++..++|+..+++.++.||+- T Consensus 641 lakar~~sgTeRv~mKs~~~er~ld~~eeA~rllEe~lk~fp~f 684 (913) T KOG0495 641 LAKARSISGTERVWMKSANLERYLDNVEEALRLLEEALKSFPDF 684 (913) T ss_pred HHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCH T ss_conf 99975028851267877689998640799999999998848755 No 129 >PRK11619 lytic murein transglycosylase; Provisional Probab=96.06 E-value=0.087 Score=28.51 Aligned_cols=69 Identities=12% Similarity=0.158 Sum_probs=47.9 Q ss_pred CCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHH Q ss_conf 36899999999999809899999999999853047730268998778877654555679999988754011211013 Q gi|254781174|r 55 RYQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVD 131 (271) Q Consensus 55 ~~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~ 131 (271) ....+.|.+|...+++|+.......-.++ ..||-.|+.+--.+. ..=-.......+.|++.||++|.++ T Consensus 31 ~~QR~~Y~~A~~Al~~~d~~~~~~l~~~L-~dYPL~PYLdY~~L~-------~~L~~~~~~~V~aFL~~y~~tPla~ 99 (645) T PRK11619 31 DEQRQRYAQIKQAWDNRQMDVVEQLMPTL-KDYPLYPYLEYRQIT-------DDLMNQPAVTVTNFVRANPTLPPAR 99 (645) T ss_pred HHHHHHHHHHHHHHHCCCHHHHHHHHHHH-CCCCCHHHHHHHHHH-------HCCCCCCHHHHHHHHHHCCCCHHHH T ss_conf 99999999999999888999999998763-689877899999997-------0623099999999998779980779 No 130 >KOG1128 consensus Probab=96.01 E-value=0.092 Score=28.37 Aligned_cols=55 Identities=16% Similarity=0.139 Sum_probs=27.4 Q ss_pred HCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 0989999999999985304773026899877887765455567999998875401121 Q gi|254781174|r 70 EQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPES 127 (271) Q Consensus 70 ~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s 127 (271) +++|.+|.+.|+.-...+|.. ...-|..|.+..+.+++..|...|.+-+..-|++ T Consensus 498 ~~~fs~~~~hle~sl~~nplq---~~~wf~~G~~ALqlek~q~av~aF~rcvtL~Pd~ 552 (777) T KOG1128 498 NKDFSEADKHLERSLEINPLQ---LGTWFGLGCAALQLEKEQAAVKAFHRCVTLEPDN 552 (777) T ss_pred CCCHHHHHHHHHHHHHCCCCC---HHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 212899999999876506632---5578741489998763678999999896248882 No 131 >PRK10941 putative transcriptional regulator; Provisional Probab=95.98 E-value=0.035 Score=30.81 Aligned_cols=72 Identities=11% Similarity=0.076 Sum_probs=56.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHH Q ss_conf 566666544322345689998865289981159999999999997498799999999997856999789999986 Q gi|254781174|r 195 EIGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSLIQERYPQGYWARYVETL 269 (271) Q Consensus 195 ~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~~~yP~s~~~~~a~~~ 269 (271) .+-..|.+.++|..|+..-+.++.--|++++ -.---|-+|.++|....|..-+.......|+..-..-.+.. T Consensus 186 nLK~~~~~e~~~~~AL~~~e~lL~l~Pddp~---EiRDRGli~~qL~c~~~Al~DLe~fle~~P~dp~ae~ik~q 257 (269) T PRK10941 186 TLKAALMEEKQMELALRASEALLQFNPEDPY---EIRDRGLIYAQLDCEHVALSDLSYFVEQCPEDPISEMIRAQ 257 (269) T ss_pred HHHHHHHHHCCHHHHHHHHHHHHHHCCCCHH---HHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHH T ss_conf 9999998511499999999999862969979---99999999998499489999999999978597289999999 No 132 >KOG4642 consensus Probab=95.87 E-value=0.11 Score=28.00 Aligned_cols=101 Identities=9% Similarity=0.111 Sum_probs=60.2 Q ss_pred CCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHH Q ss_conf 36899999999999809899999999999853047730268998778877654555679999988754011211013555 Q gi|254781174|r 55 RYQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVY 134 (271) Q Consensus 55 ~~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~ 134 (271) ..++++=.+|..+|..+.|..|+..|.+.+...|... .-.-+-|.||.+.++++.+...=.+.++.-|++-. +. T Consensus 8 ~~a~qlkE~gnk~f~~k~y~~ai~~y~raI~~nP~~~---~Y~tnralchlk~~~~~~v~~dcrralql~~N~vk---~h 81 (284) T KOG4642 8 ESAEQLKEQGNKCFIPKRYDDAIDCYSRAICINPTVA---SYYTNRALCHLKLKHWEPVEEDCRRALQLDPNLVK---AH 81 (284) T ss_pred HHHHHHHHCCCCCCCHHHHCHHHHHHHHHHHCCCCCC---HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHH---HH T ss_conf 6799998524602530111468999999885489730---24454788888762002666617889860728789---99 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 44344455554431112345899999999999852 Q gi|254781174|r 135 YLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERY 169 (271) Q Consensus 135 y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~y 169 (271) |..|.+.... ....+||..+++....+ T Consensus 82 ~flg~~~l~s--------~~~~eaI~~Lqra~sl~ 108 (284) T KOG4642 82 YFLGQWLLQS--------KGYDEAIKVLQRAYSLL 108 (284) T ss_pred HHHHHHHHHH--------CCCCHHHHHHHHHHHHH T ss_conf 9988888752--------36607899999999887 No 133 >pfam04733 Coatomer_E Coatomer epsilon subunit. This family represents the epsilon subunit of the coatomer complex, which is involved in the regulation of intracellular protein trafficking between the endoplasmic reticulum and the Golgi complex. Probab=95.72 E-value=0.12 Score=27.63 Aligned_cols=56 Identities=13% Similarity=0.098 Sum_probs=46.5 Q ss_pred HHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCH Q ss_conf 43223456899988652899811599999999999974987999999999978569997 Q gi|254781174|r 203 RGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSLIQERYPQGY 261 (271) Q Consensus 203 ~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~~~yP~s~ 261 (271) .+++..|--.|+++-+.|+.|+. .+--++-++..+|.+++|......-....|+.. T Consensus 180 gek~q~A~yif~EL~~~~~~t~l---llng~Av~~m~~~~~~eAe~~L~eAl~~d~~d~ 235 (290) T pfam04733 180 GEKIQDAYYIFQEFSEKYDSTPL---LLNGQAVCCMCLGRYEEAESLLKEALDKDAKDP 235 (290) T ss_pred CHHHHHHHHHHHHHHHHCCCCHH---HHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCH T ss_conf 47789899999999852289789---999999999874787999999999987288988 No 134 >KOG3081 consensus Probab=95.71 E-value=0.12 Score=27.78 Aligned_cols=55 Identities=9% Similarity=0.048 Sum_probs=44.8 Q ss_pred HHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCC Q ss_conf 4322345689998865289981159999999999997498799999999997856999 Q gi|254781174|r 203 RGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSLIQERYPQG 260 (271) Q Consensus 203 ~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~~~yP~s 260 (271) .+++..|--.|+++-+.||-|+ ..+--++-++..+|.+++|....+....++|+- T Consensus 186 gek~qdAfyifeE~s~k~~~T~---~llnG~Av~~l~~~~~eeAe~lL~eaL~kd~~d 240 (299) T KOG3081 186 GEKIQDAFYIFEELSEKTPPTP---LLLNGQAVCHLQLGRYEEAESLLEEALDKDAKD 240 (299) T ss_pred CHHHHHHHHHHHHHHCCCCCCH---HHHCCHHHHHHHHCCHHHHHHHHHHHHHCCCCC T ss_conf 3245468999999835359976---887069999897169999999999998534787 No 135 >KOG4642 consensus Probab=95.52 E-value=0.063 Score=29.32 Aligned_cols=66 Identities=14% Similarity=0.048 Sum_probs=56.0 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCH Q ss_conf 999999998098999999999998530477302689987788776545556799999887540112110 Q gi|254781174|r 61 YEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKN 129 (271) Q Consensus 61 Y~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~ 129 (271) =+.|+.+++-++|+.+...-...+..-|++ .++++.+|.+....+.|+.||..+.+..++.-.++. T Consensus 48 tnralchlk~~~~~~v~~dcrralql~~N~---vk~h~flg~~~l~s~~~~eaI~~Lqra~sl~r~~~~ 113 (284) T KOG4642 48 TNRALCHLKLKHWEPVEEDCRRALQLDPNL---VKAHYFLGQWLLQSKGYDEAIKVLQRAYSLLREQPF 113 (284) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCHHH---HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCC T ss_conf 547888887620026666178898607287---899999888887523660789999999988732799 No 136 >KOG1127 consensus Probab=95.49 E-value=0.15 Score=27.13 Aligned_cols=63 Identities=21% Similarity=0.151 Sum_probs=26.4 Q ss_pred HHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 999999809899999999999853047730268998778877654555679999988754011211 Q gi|254781174|r 63 KAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESK 128 (271) Q Consensus 63 ~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~ 128 (271) .|..+.+.++...|+..|+...+.-|.+- .+-..+|.+|-..|.|..|+..|++.....|.|. T Consensus 568 rG~yyLea~n~h~aV~~fQsALR~dPkD~---n~W~gLGeAY~~sGry~~AlKvF~kAs~LrP~s~ 630 (1238) T KOG1127 568 RGPYYLEAHNLHGAVCEFQSALRTDPKDY---NLWLGLGEAYPESGRYSHALKVFTKASLLRPLSK 630 (1238) T ss_pred CCCCCCCCCCHHHHHHHHHHHHCCCCHHH---HHHHHHHHHHHHCCCEEHHHHHHHHHHHCCCHHH T ss_conf 04343685215558999999863890657---8899877887743762007776424574482768 No 137 >pfam10300 IML2 Putative mitochondrial outer membrane protein. This is a family of proteins conserved from fungi to humans. Members of this family also carry a TPR_2 domain pfam07719 at their C-terminus. Many of them are annotated as being mitochondrial outer membrane protein IML2. Probab=95.24 E-value=0.18 Score=26.65 Aligned_cols=90 Identities=18% Similarity=0.158 Sum_probs=64.9 Q ss_pred CHHHHHHHHHHHHHHCCHHHHHHHHHHHHH-HCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHH Q ss_conf 689999999999980989999999999985-3047730268998778877654555679999988754011211013555 Q gi|254781174|r 56 YQREVYEKAVLFLKEQNFSKAYEYFNQCSR-DFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVY 134 (271) Q Consensus 56 ~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~-~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~ 134 (271) ..--++.+|.-..-+|+.++|+..|+.... .-+......-..+.+++++.-+.+|.+|...+.+..+...=|. +-| . T Consensus 249 s~l~l~~~gR~~~~~g~l~~Ai~~~~~~~~~~~~~kQ~~~lc~~El~w~~~~~~~w~~A~~~~~~L~~~s~WSk-a~Y-~ 326 (446) T pfam10300 249 SALWLFFEARIESLKGNLDEALELFEECIESQSEWKQIHHLCYWELAWCHVFLHRWKQAANYFLLLSDESKWSK-ALY-A 326 (446) T ss_pred CHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHH-HHH-H T ss_conf 46899999999997379999999999867446544999999999999999999789999999999998454189-999-9 Q ss_pred HHHHHHHHHHHHH Q ss_conf 4434445555443 Q gi|254781174|r 135 YLVGMSYAQMIRD 147 (271) Q Consensus 135 y~~a~~~~~~~~~ 147 (271) |+.|.|+.....+ T Consensus 327 Y~~a~c~~~~~~~ 339 (446) T pfam10300 327 YLAAACLLMLGRE 339 (446) T ss_pred HHHHHHHHHHCCC T ss_conf 9999999865330 No 138 >PRK12450 foldase protein PrsA; Reviewed Probab=95.16 E-value=0.19 Score=26.51 Aligned_cols=49 Identities=12% Similarity=0.104 Sum_probs=34.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHH Q ss_conf 9999999999999999997226875333445677767368999999999 Q gi|254781174|r 18 QLYKFALTIFFSIAVCFLVGWERQSSRDVYLDSVTDVRYQREVYEKAVL 66 (271) Q Consensus 18 ~m~k~~~~i~~~i~~~~l~~Cs~~~~~~~~~~~~~~~~~~~~lY~~a~~ 66 (271) +|.|.++.+..+.+++.|.+||+.+.+...........+..++|+.-.. T Consensus 3 ~mKK~~~~~~~~~svl~LaAC~s~~~~~~v~t~kgg~IT~~efy~~~K~ 51 (309) T PRK12450 3 QMNKLITGVVTLATVVTLSACQSSHNNTKLVSMKGDTITVSDFYNETKN 51 (309) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCHHHHHHHHHC T ss_conf 7999999999999999998605899983689835992409999999853 No 139 >KOG3364 consensus Probab=95.05 E-value=0.15 Score=27.12 Aligned_cols=74 Identities=11% Similarity=0.229 Sum_probs=48.3 Q ss_pred CCHHHHHHHHHHHHHHCC---HHHHHHHHHHHHH-HCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHH Q ss_conf 368999999999998098---9999999999985-304773026899877887765455567999998875401121101 Q gi|254781174|r 55 RYQREVYEKAVLFLKEQN---FSKAYEYFNQCSR-DFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNV 130 (271) Q Consensus 55 ~~~~~lY~~a~~~~~~~~---y~~A~~~f~~i~~-~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~ 130 (271) .+.+..|+.|+......+ -.+-+..|+++.+ .+|.. -.+-.+.+|..+|+.++|+.|+...+.+++.-|+++.+ T Consensus 30 ~s~~s~f~lAwaLV~S~~~~dv~~GI~iLe~l~~~~~~~~--rRe~lyYLAvg~yRlkeY~~s~~yvd~ll~~e~~n~Qa 107 (149) T KOG3364 30 VSKQSQFNLAWALVRSRDTEDVQEGIVILEDLLKSAHPER--RRECLYYLAVGHYRLKEYSKSLRYVDALLETEPNNRQA 107 (149) T ss_pred CHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCC--CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHH T ss_conf 1388889999999746425899976999999861068400--01321356777888865899999999999608770789 No 140 >COG5159 RPN6 26S proteasome regulatory complex component [Posttranslational modification, protein turnover, chaperones] Probab=95.03 E-value=0.21 Score=26.29 Aligned_cols=49 Identities=18% Similarity=0.080 Sum_probs=21.2 Q ss_pred HHHHHHHHHCCHHHHHHHHHHHHHHCCC-----CCCHHHHHHHHHHHHHHHHHH Q ss_conf 9999999809899999999999853047-----730268998778877654555 Q gi|254781174|r 62 EKAVLFLKEQNFSKAYEYFNQCSRDFPF-----AGVARKSLLMSAFVQYSAGKY 110 (271) Q Consensus 62 ~~a~~~~~~~~y~~A~~~f~~i~~~~P~-----s~~a~~A~~~la~~~y~~~~y 110 (271) +.|....+.+++++|+..+.++...... -+.-+.+.+.++..|-..|++ T Consensus 8 e~a~~~v~~~~~~~ai~~yk~iL~kg~s~dek~~nEqE~tvlel~~lyv~~g~~ 61 (421) T COG5159 8 ELANNAVKSNDIEKAIGEYKRILGKGVSKDEKTLNEQEATVLELFKLYVSKGDY 61 (421) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 999876404459999999999965777712656647888999999899716773 No 141 >pfam00515 TPR_1 Tetratricopeptide repeat. Probab=94.96 E-value=0.024 Score=31.73 Aligned_cols=32 Identities=25% Similarity=0.341 Sum_probs=19.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 89987788776545556799999887540112 Q gi|254781174|r 95 KSLLMSAFVQYSAGKYQQAASLGEEYITQYPE 126 (271) Q Consensus 95 ~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~ 126 (271) ++.+.+|.+|+.+|+|++|+..|++.++..|+ T Consensus 2 ~~~~~lG~~y~~~~~~~~A~~~~~~Al~l~p~ 33 (34) T pfam00515 2 KALYNLGNAYFKLGKYDEAIEYYEKALELNPN 33 (34) T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCC T ss_conf 79999899999917899999999998843989 No 142 >COG3071 HemY Uncharacterized enzyme of heme biosynthesis [Coenzyme metabolism] Probab=94.89 E-value=0.23 Score=26.06 Aligned_cols=183 Identities=16% Similarity=0.091 Sum_probs=88.7 Q ss_pred HHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHH----HHHHHHHH Q ss_conf 99999809899999999999853047730268998778877654555679999988754011211013----55544344 Q gi|254781174|r 64 AVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVD----YVYYLVGM 139 (271) Q Consensus 64 a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~----~A~y~~a~ 139 (271) |...-+.|+++.|-..+.++.+.-+++... ..+-.+...-.+++++.|....+...+..|.++.+- .++...|- T Consensus 125 A~AA~qrgd~~~an~yL~eaae~~~~~~l~--v~ltrarlll~~~d~~aA~~~v~~ll~~~pr~~~vlrLa~r~y~~~g~ 202 (400) T COG3071 125 AEAAQQRGDEDRANRYLAEAAELAGDDTLA--VELTRARLLLNRRDYPAARENVDQLLEMTPRHPEVLRLALRAYIRLGA 202 (400) T ss_pred HHHHHHCCCHHHHHHHHHHHHCCCCCCHHH--HHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCC T ss_conf 999886305778989999986258994088--999999999867886568988999987286886999999999998511 Q ss_pred HH--HHHHH---------HH------------HHHHHHHHHHHHHHHHHHHHHHCCHHH------------------HHH Q ss_conf 45--55544---------31------------112345899999999999852001013------------------456 Q gi|254781174|r 140 SY--AQMIR---------DV------------PYDQRATKLMLQYMSRIVERYTNSPYV------------------KGA 178 (271) Q Consensus 140 ~~--~~~~~---------~~------------~~d~~~~~~A~~~f~~~i~~yP~S~ya------------------~~A 178 (271) -. +...+ +. -.++....+-.+.+....++-|.+... .+| T Consensus 203 ~~~ll~~l~~L~ka~~l~~~e~~~le~~a~~glL~q~~~~~~~~gL~~~W~~~pr~lr~~p~l~~~~a~~li~l~~~~~A 282 (400) T COG3071 203 WQALLAILPKLRKAGLLSDEEAARLEQQAWEGLLQQARDDNGSEGLKTWWKNQPRKLRNDPELVVAYAERLIRLGDHDEA 282 (400) T ss_pred HHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHCCHHHHCCHHHHHHHHHHHHHCCCHHHH T ss_conf 89999988999873578829999999999999999873620014789999866497605834899999999975876889 Q ss_pred HHH----------------------------HHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCH Q ss_conf 555----------------------------55556666666------55566666544322345689998865289981 Q gi|254781174|r 179 RFY----------------------------VTVGRNQLAAK------EVEIGRYYLKRGEYVAAIPRFQLVLANYSDAE 224 (271) Q Consensus 179 ~~~----------------------------l~~~~~~La~~------e~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~ 224 (271) ... +..+...++.+ .+..|+.|++.+.|..|...|+..+..=|+. T Consensus 283 ~~~i~~~Lk~~~D~~L~~~~~~l~~~d~~~l~k~~e~~l~~h~~~p~L~~tLG~L~~k~~~w~kA~~~leaAl~~~~s~- 361 (400) T COG3071 283 QEIIEDALKRQWDPRLCRLIPRLRPGDPEPLIKAAEKWLKQHPEDPLLLSTLGRLALKNKLWGKASEALEAALKLRPSA- 361 (400) T ss_pred HHHHHHHHHHCCCHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCH- T ss_conf 9999999874458569998865188993679999999998399981499999999998417899999999997258974- Q ss_pred HHHHHHHHHHHHHHHCCCHHHHHHHHHH Q ss_conf 1599999999999974987999999999 Q gi|254781174|r 225 HAEEAMARLVEAYVALALMDEAREVVSL 252 (271) Q Consensus 225 ~~~eAl~~l~~~y~~lg~~d~A~~~~~~ 252 (271) ...-.++.++.++|.+.+|.+.... T Consensus 362 ---~~~~~la~~~~~~g~~~~A~~~r~e 386 (400) T COG3071 362 ---SDYAELADALDQLGEPEEAEQVRRE 386 (400) T ss_pred ---HHHHHHHHHHHHCCCHHHHHHHHHH T ss_conf ---3699999999981886889999999 No 143 >pfam03704 BTAD Bacterial transcriptional activator domain. Found in the DNRI/REDD/AFSR family of regulators. This region of AFSR along with the C terminal region is capable of independently directing actinorhodin production. This family contains TPR repeats. Probab=94.85 E-value=0.23 Score=25.99 Aligned_cols=58 Identities=17% Similarity=0.094 Sum_probs=43.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 5666665443223456899988652899811599999999999974987999999999978 Q gi|254781174|r 195 EIGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSLIQE 255 (271) Q Consensus 195 ~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~~ 255 (271) ..+..+...|++..|+...+.++...| ..+.+...+..+|...|...+|...+..+.. T Consensus 67 ~l~~~~~~~g~~~~a~~~~~~~l~~dP---l~E~~~~~Lm~a~~~~G~~~~Al~~Y~~~r~ 124 (146) T pfam03704 67 RRIEADLRLGRHEEALAELRALVALHP---LRERLHRQLMRALYRSGRQAEALRVYRRLRR 124 (146) T ss_pred HHHHHHHHCCCCCCHHHHHHHHHHHCC---CCHHHHHHHHHHHHHCCCHHHHHHHHHHHHH T ss_conf 999999885983021999999998688---7429999999999988999999999999999 No 144 >KOG0545 consensus Probab=94.79 E-value=0.24 Score=25.90 Aligned_cols=63 Identities=21% Similarity=0.202 Sum_probs=55.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCC Q ss_conf 556666654432234568999886528998115999999999999749879999999999785699 Q gi|254781174|r 194 VEIGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSLIQERYPQ 259 (271) Q Consensus 194 ~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~~~yP~ 259 (271) +...++++..|+|-+++..-..++..+|+. ..|+|+-+.++-..=+.++|+.-++.+..+-|. T Consensus 234 lNy~QC~L~~~e~yevleh~seiL~~~~~n---vKA~frRakAhaa~Wn~~eA~~D~~~vL~ldps 296 (329) T KOG0545 234 LNYCQCLLKKEEYYEVLEHCSEILRHHPGN---VKAYFRRAKAHAAVWNEAEAKADLQKVLELDPS 296 (329) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCH---HHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCHH T ss_conf 867778744878999999889998508850---899999877877536888888899999861956 No 145 >KOG2610 consensus Probab=94.69 E-value=0.26 Score=25.76 Aligned_cols=156 Identities=13% Similarity=0.112 Sum_probs=88.0 Q ss_pred CCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-CCCCCHHHHH Q ss_conf 368999999999998098999999999998530477302689987788776545556799999887540-1121101355 Q gi|254781174|r 55 RYQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQ-YPESKNVDYV 133 (271) Q Consensus 55 ~~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~-~P~s~~~~~A 133 (271) .+.+.+=..+...+.+|+.-+|...++++...||..-.+-+ ..-.++|.+|+-+.....+++++.. +|+-|--.|+ T Consensus 101 ~arEk~h~~aai~~~~g~~h~a~~~wdklL~d~PtDlla~k---fsh~a~fy~G~~~~~k~ai~kIip~wn~dlp~~sYv 177 (491) T KOG2610 101 NAREKRHAKAAILWGRGKHHEAAIEWDKLLDDYPTDLLAVK---FSHDAHFYNGNQIGKKNAIEKIIPKWNADLPCYSYV 177 (491) T ss_pred HHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCHHHHHHH---HHHHHHHHCCCHHHHHHHHHHHCCCCCCCCCHHHHH T ss_conf 26875102678763356641778889999875850355445---444678741414421457998433458998578998 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHH--------------------HHHHHHHHH Q ss_conf 544344455554431112345899999999999852001013456555555--------------------566666665 Q gi|254781174|r 134 YYLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVTV--------------------GRNQLAAKE 193 (271) Q Consensus 134 ~y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~--------------------~~~~La~~e 193 (271) .=|.+.+..+- +..++|-+.-.+-++--|+.-.+.-|...+.. ....+|.|. T Consensus 178 ~GmyaFgL~E~--------g~y~dAEk~A~ralqiN~~D~Wa~Ha~aHVlem~~r~Keg~eFM~~ted~Wr~s~mlasHN 249 (491) T KOG2610 178 HGMYAFGLEEC--------GIYDDAEKQADRALQINRFDCWASHAKAHVLEMNGRHKEGKEFMYKTEDDWRQSWMLASHN 249 (491) T ss_pred HHHHHHHHHHH--------CCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHCCCCHHHHHHHHHHH T ss_conf 88998667872--------5502699999863147874168898899999840205657999985314121236887650 Q ss_pred H-HHHHHHHHHHHHHHHHHHHHH-HHHHCC Q ss_conf 5-566666544322345689998-865289 Q gi|254781174|r 194 V-EIGRYYLKRGEYVAAIPRFQL-VLANYS 221 (271) Q Consensus 194 ~-~ia~~Y~~~~~y~aA~~~~~~-~i~~yp 221 (271) + -+|-||...+.|.+|..-|.. ++.... T Consensus 250 yWH~Al~~iE~aeye~aleIyD~ei~k~l~ 279 (491) T KOG2610 250 YWHTALFHIEGAEYEKALEIYDREIWKRLE 279 (491) T ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHHHHHH T ss_conf 688888641011135788888799998751 No 146 >KOG4555 consensus Probab=94.58 E-value=0.27 Score=25.60 Aligned_cols=107 Identities=14% Similarity=0.092 Sum_probs=78.8 Q ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC-CCCCHHHHHHHH Q ss_conf 9999999999980989999999999985304773026899877887765455567999998875401-121101355544 Q gi|254781174|r 58 REVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQY-PESKNVDYVYYL 136 (271) Q Consensus 58 ~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~-P~s~~~~~A~y~ 136 (271) .++=-+|...-+.|+.+.|++.|.+.+...|.. +.+.-+-|.++-.+|+.+.|+..+++.++.- |+..-+-.++-. T Consensus 44 ~~LEl~~valaE~g~Ld~AlE~F~qal~l~P~r---aSayNNRAQa~RLq~~~e~ALdDLn~AleLag~~trtacqa~vQ 120 (175) T KOG4555 44 RELELKAIALAEAGDLDGALELFGQALCLAPER---ASAYNNRAQALRLQGDDEEALDDLNKALELAGDQTRTACQAFVQ 120 (175) T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCC---HHHHCCHHHHHHHCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHH T ss_conf 999999889885045678999999999854335---38661399999872881878998999998628620699999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHH Q ss_conf 34445555443111234589999999999985200101345 Q gi|254781174|r 137 VGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKG 177 (271) Q Consensus 137 ~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~ 177 (271) +|+.|-. ++.-+.|...|+..-+. +|+++.. T Consensus 121 Rg~lyRl--------~g~dd~AR~DFe~AA~L--GS~FAr~ 151 (175) T KOG4555 121 RGLLYRL--------LGNDDAARADFEAAAQL--GSKFARE 151 (175) T ss_pred HHHHHHH--------HCCHHHHHHHHHHHHHH--CCHHHHH T ss_conf 9999999--------37618888869999986--7878999 No 147 >PRK10941 putative transcriptional regulator; Provisional Probab=94.21 E-value=0.12 Score=27.61 Aligned_cols=59 Identities=14% Similarity=0.207 Sum_probs=30.3 Q ss_pred HHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHH Q ss_conf 4589999999999985200101345655555556666666555666665443223456899988652899811599 Q gi|254781174|r 153 RATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEE 228 (271) Q Consensus 153 ~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~e 228 (271) +....|+..-+.++...|+++|..- ..|-.|.+.+.+..|+.-++.+++.-|+.+-++- T Consensus 195 ~~~~~AL~~~e~lL~l~Pddp~EiR-----------------DRGli~~qL~c~~~Al~DLe~fle~~P~dp~ae~ 253 (269) T PRK10941 195 KQMELALRASEALLQFNPEDPYEIR-----------------DRGLIYAQLDCEHVALSDLSYFVEQCPEDPISEM 253 (269) T ss_pred CCHHHHHHHHHHHHHHCCCCHHHHH-----------------HHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHH T ss_conf 1499999999999862969979999-----------------9999999849948999999999997859728999 No 148 >smart00028 TPR Tetratricopeptide repeats. Repeats present in 4 or more copies in proteins. Contain a minimum of 34 amino acids each and self-associate via a "knobs and holes" mechanism. Probab=94.00 E-value=0.054 Score=29.69 Aligned_cols=32 Identities=19% Similarity=0.221 Sum_probs=18.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 89987788776545556799999887540112 Q gi|254781174|r 95 KSLLMSAFVQYSAGKYQQAASLGEEYITQYPE 126 (271) Q Consensus 95 ~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~ 126 (271) .+.+.+|.+|+.+|++++|+..|++.++..|+ T Consensus 2 ~~~~~lg~~y~~~~~~~~Ai~~~~~al~~~p~ 33 (34) T smart00028 2 EALYNLGNAYLKLGDYDEALEYYEKALELDPN 33 (34) T ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCC T ss_conf 28999899999823699999999997735999 No 149 >pfam10602 RPN7 26S proteasome subunit RPN7. RPN7 (known as the non ATPase regulatory subunit 6 in higher eukaryotes) is one of the lid subunits of the 26S proteasome and has been shown in Saccharomyces cerevisiae to be required for structural integrity. The 26S proteasome is is involved in the ATP-dependent degradation of ubiquitinated proteins. Probab=93.70 E-value=0.42 Score=24.53 Aligned_cols=119 Identities=13% Similarity=-0.003 Sum_probs=67.1 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 04773026899877887765455567999998875401121101355544-34445555443111234589999999999 Q gi|254781174|r 87 FPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYL-VGMSYAQMIRDVPYDQRATKLMLQYMSRI 165 (271) Q Consensus 87 ~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~-~a~~~~~~~~~~~~d~~~~~~A~~~f~~~ 165 (271) ....+....++..+|.-|...|+.+.|+..|.+..+.-....+.-+..+. +=++.+. .|-.....-+...+.+ T Consensus 29 n~~kesiR~~~~~l~~~y~~iGd~~~A~k~y~~~~~~~ts~~~~id~~l~~irv~i~~------~d~~~v~~~i~ka~~~ 102 (174) T pfam10602 29 NLGKEEIRMANLDLGEHYAKIGDLENALKAYSRAREYCTSLGHKIDMCLNLIRVGIFF------NDWDHVSKYIEKAKSL 102 (174) T ss_pred HCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHC T ss_conf 4270999999999899999966799999999998831788459999999999999997------0099999999999856 Q ss_pred HHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHH Q ss_conf 985200101345655555556666666555666665443223456899988652899811 Q gi|254781174|r 166 VERYTNSPYVKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQLVLANYSDAEH 225 (271) Q Consensus 166 i~~yP~S~ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~ 225 (271) +..- .+ .+.+ ++|- -+-|-+.+..++|..|...|-..+..|+.+.. T Consensus 103 ~e~g--gD--w~r~-------nrlk---~y~Gl~~l~~r~f~~AA~~fl~~~~tf~~~el 148 (174) T pfam10602 103 IEKG--GD--WERR-------NRLK---VYEGLALLAIRDFKEAAKLFLDSLSTFTSTEL 148 (174) T ss_pred CCCC--CC--HHHH-------HHHH---HHHHHHHHHHCHHHHHHHHHHCCCCCCCCCCC T ss_conf 5144--55--8789-------8899---99999999716499999999637678884634 No 150 >pfam07719 TPR_2 Tetratricopeptide repeat. This Pfam entry includes outlying Tetratricopeptide-like repeats (TPR) that are not matched by pfam00515. Probab=93.67 E-value=0.048 Score=30.02 Aligned_cols=31 Identities=23% Similarity=0.349 Sum_probs=16.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 9987788776545556799999887540112 Q gi|254781174|r 96 SLLMSAFVQYSAGKYQQAASLGEEYITQYPE 126 (271) Q Consensus 96 A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~ 126 (271) +.+.+|.+++.+|+|++|+..|++.++..|+ T Consensus 3 ~~~~~g~~~~~~g~~~~A~~~~~~al~l~p~ 33 (34) T pfam07719 3 ALYNLGLAYYKLGDYEEALEAYEKALELDPN 33 (34) T ss_pred HHHHCCHHHHHCCCHHHHHHHHHHHHCCCCC T ss_conf 6875014876216699999999997726999 No 151 >KOG1941 consensus Probab=93.55 E-value=0.44 Score=24.38 Aligned_cols=124 Identities=12% Similarity=0.049 Sum_probs=56.9 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCH-HHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 02689987788776545556799999887540112110-1355544344-455554431112345899999999999852 Q gi|254781174|r 92 VARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKN-VDYVYYLVGM-SYAQMIRDVPYDQRATKLMLQYMSRIVERY 169 (271) Q Consensus 92 ~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~-~~~A~y~~a~-~~~~~~~~~~~d~~~~~~A~~~f~~~i~~y 169 (271) ..-++.+-++.++-.++.+++++..|+..++.--+..+ .-+-+--.++ +.|-+.+|.++......+|.+..+.+--.- T Consensus 120 ~~gq~~l~~~~Ahlgls~fq~~Lesfe~A~~~A~~~~D~~LElqvcv~Lgslf~~l~D~~Kal~f~~kA~~lv~s~~l~d 199 (518) T KOG1941 120 LGGQVSLSMGNAHLGLSVFQKALESFEKALRYAHNNDDAMLELQVCVSLGSLFAQLKDYEKALFFPCKAAELVNSYGLKD 199 (518) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 65035657777751378899999999999987650698403541353388999998744677761476999987527560 Q ss_pred HCCHHHHHHHHHHHHHHHH--------------------------HHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 0010134565555555666--------------------------666655566666544322345689998 Q gi|254781174|r 170 TNSPYVKGARFYVTVGRNQ--------------------------LAAKEVEIGRYYLKRGEYVAAIPRFQL 215 (271) Q Consensus 170 P~S~ya~~A~~~l~~~~~~--------------------------La~~e~~ia~~Y~~~~~y~aA~~~~~~ 215 (271) -++.|-.-+.++|...-.. -|+--.-.|++|-.+|+-+.|..||+. T Consensus 200 ~~~kyr~~~lyhmaValR~~G~LgdA~e~C~Ea~klal~~Gdra~~arc~~~~aDIyR~~gd~e~af~rYe~ 271 (518) T KOG1941 200 WSLKYRAMSLYHMAVALRLLGRLGDAMECCEEAMKLALQHGDRALQARCLLCFADIYRSRGDLERAFRRYEQ 271 (518) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHH T ss_conf 568999999999999999841211399999999999998477798899999999998861307668999999 No 152 >pfam00515 TPR_1 Tetratricopeptide repeat. Probab=93.21 E-value=0.14 Score=27.38 Aligned_cols=32 Identities=19% Similarity=0.408 Sum_probs=24.6 Q ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCC Q ss_conf 99999999999809899999999999853047 Q gi|254781174|r 58 REVYEKAVLFLKEQNFSKAYEYFNQCSRDFPF 89 (271) Q Consensus 58 ~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~ 89 (271) +..+..|..++..|+|++|++.|++++...|. T Consensus 2 ~~~~~lG~~y~~~~~~~~A~~~~~~Al~l~p~ 33 (34) T pfam00515 2 KALYNLGNAYFKLGKYDEAIEYYEKALELNPN 33 (34) T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCC T ss_conf 79999899999917899999999998843989 No 153 >PRK01326 prsA foldase protein PrsA; Reviewed Probab=92.40 E-value=0.65 Score=23.40 Aligned_cols=47 Identities=11% Similarity=0.128 Sum_probs=28.9 Q ss_pred HHHHHHH-HHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHH Q ss_conf 9999999-9999999999722687533344567776736899999999 Q gi|254781174|r 19 LYKFALT-IFFSIAVCFLVGWERQSSRDVYLDSVTDVRYQREVYEKAV 65 (271) Q Consensus 19 m~k~~~~-i~~~i~~~~l~~Cs~~~~~~~~~~~~~~~~~~~~lY~~a~ 65 (271) |.|.+++ +..+++++.|.|||+.+.+...........+..++|+.-. T Consensus 1 MKKKl~~~~~~~~svl~LaaCs~~~~~~~vat~kgg~IT~~e~Y~~~K 48 (310) T PRK01326 1 MKKKLIAGAVTLLSVATLAACSKTNDNTKVISMKGDTITVSDFYNEVK 48 (310) T ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCEEHHHHHHHHH T ss_conf 908899999999999999871699988068982499342999999984 No 154 >TIGR03042 PS_II_psbQ_bact photosystem II protein PsbQ. This protein through the member sll1638 from Synechocystis sp. PCC 6803, was shown to be part of the cyanobacteria photosystem II. It is homologous to (but quite diverged from) the chloroplast PsbQ protein, called oxygen-evolving enhancer protein 3 (OEE3). We designate this cyanobacteria protein PsbQ by homology. Probab=92.12 E-value=0.71 Score=23.19 Aligned_cols=23 Identities=13% Similarity=0.330 Sum_probs=15.3 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 99999999999999972268753 Q gi|254781174|r 21 KFALTIFFSIAVCFLVGWERQSS 43 (271) Q Consensus 21 k~~~~i~~~i~~~~l~~Cs~~~~ 43 (271) |.+..+.++++.+|++|||+... T Consensus 2 Rsi~s~~L~~~~~fLvsC~gp~a 24 (142) T TIGR03042 2 RSLASLLLVLLLTFLVSCSGPAA 24 (142) T ss_pred HHHHHHHHHHHHHHHHHCCCCCC T ss_conf 76999999999999988389988 No 155 >pfam07719 TPR_2 Tetratricopeptide repeat. This Pfam entry includes outlying Tetratricopeptide-like repeats (TPR) that are not matched by pfam00515. Probab=92.02 E-value=0.15 Score=27.15 Aligned_cols=32 Identities=16% Similarity=0.306 Sum_probs=23.7 Q ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCC Q ss_conf 99999999999809899999999999853047 Q gi|254781174|r 58 REVYEKAVLFLKEQNFSKAYEYFNQCSRDFPF 89 (271) Q Consensus 58 ~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~ 89 (271) ..++..|..++..|+|++|++.|++++...|. T Consensus 2 ~~~~~~g~~~~~~g~~~~A~~~~~~al~l~p~ 33 (34) T pfam07719 2 EALYNLGLAYYKLGDYEEALEAYEKALELDPN 33 (34) T ss_pred HHHHHCCHHHHHCCCHHHHHHHHHHHHCCCCC T ss_conf 16875014876216699999999997726999 No 156 >PRK04405 prsA peptidylprolyl isomerase; Provisional Probab=91.96 E-value=0.74 Score=23.09 Aligned_cols=48 Identities=13% Similarity=0.051 Sum_probs=31.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHH Q ss_conf 99999999999999999997226875333445677767368999999999 Q gi|254781174|r 17 YQLYKFALTIFFSIAVCFLVGWERQSSRDVYLDSVTDVRYQREVYEKAVL 66 (271) Q Consensus 17 ~~m~k~~~~i~~~i~~~~l~~Cs~~~~~~~~~~~~~~~~~~~~lY~~a~~ 66 (271) -.|.+.++.+..+++++.|.|||+.+..-... .....+..++|+.-.. T Consensus 3 ~~MKK~~l~~~~~~~~l~LaaCss~~~~Vat~--kg~~IT~~e~y~~~k~ 50 (298) T PRK04405 3 KKMKKWALAAASAGLLLSLAGCSSNNKTVATY--KGGKITESQYYKEMKQ 50 (298) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEE--CCCCCCHHHHHHHHHC T ss_conf 67899999999999999998717999769986--3987259999999864 No 157 >COG3629 DnrI DNA-binding transcriptional activator of the SARP family [Signal transduction mechanisms] Probab=91.69 E-value=0.79 Score=22.90 Aligned_cols=59 Identities=19% Similarity=0.213 Sum_probs=46.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH Q ss_conf 56666654432234568999886528998115999999999999749879999999999785 Q gi|254781174|r 195 EIGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSLIQER 256 (271) Q Consensus 195 ~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~~~ 256 (271) ..+.-+...+.+++++..++.++..-| +.+.+..++.++|+..|....|...+..+... T Consensus 158 ~lae~~~~~~~~~~~~~~l~~Li~~dp---~~E~~~~~lm~~y~~~g~~~~ai~~y~~l~~~ 216 (280) T COG3629 158 KLAEALIACGRADAVIEHLERLIELDP---YDEPAYLRLMEAYLVNGRQSAAIRAYRQLKKT 216 (280) T ss_pred HHHHHHHHCCCHHHHHHHHHHHHHCCC---CCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH T ss_conf 999998731218899999999986296---46499999999999758806899999999997 No 158 >TIGR02561 HrpB1_HrpK type III secretion protein HrpB1/HrpK; InterPro: IPR013394 This family of proteins is encoded by genes found within type III secretion operons in a limited range of species including Xanthomonas, Ralstonia and Burkholderia.. Probab=91.63 E-value=0.8 Score=22.87 Aligned_cols=68 Identities=12% Similarity=0.090 Sum_probs=49.4 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCH Q ss_conf 999999998098999999999998530477302689987788776545556799999887540112110 Q gi|254781174|r 61 YEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKN 129 (271) Q Consensus 61 Y~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~ 129 (271) .=.|+-++..|+|++|++.|.+|...-|....-+-+.=++|.|+.-.+|-+. ....++.|..-|+..- T Consensus 54 ~~dG~L~i~~G~y~dAaR~L~E~~~~~~~~svm~yg~ALlAlcL~~~~D~~W-~~~A~e~L~~~a~~da 121 (190) T TIGR02561 54 MFDGWLLIARGNYDDAARILRELLASAGAASVMPYGKALLALCLSAKGDDEW-HLHADEVLEEDADADA 121 (190) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCHHH-HHHHHHHHHCCCCCHH T ss_conf 9999998615875899999998724899720437899999998743288888-9999998744878506 No 159 >smart00028 TPR Tetratricopeptide repeats. Repeats present in 4 or more copies in proteins. Contain a minimum of 34 amino acids each and self-associate via a "knobs and holes" mechanism. Probab=91.56 E-value=0.27 Score=25.64 Aligned_cols=31 Identities=23% Similarity=0.421 Sum_probs=21.3 Q ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHHHCCC Q ss_conf 9999999999809899999999999853047 Q gi|254781174|r 59 EVYEKAVLFLKEQNFSKAYEYFNQCSRDFPF 89 (271) Q Consensus 59 ~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~ 89 (271) .++..|..++..|++++|++.|++.+...|. T Consensus 3 ~~~~lg~~y~~~~~~~~Ai~~~~~al~~~p~ 33 (34) T smart00028 3 ALYNLGNAYLKLGDYDEALEYYEKALELDPN 33 (34) T ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHCCCCC T ss_conf 8999899999823699999999997735999 No 160 >COG3014 Uncharacterized protein conserved in bacteria [Function unknown] Probab=90.91 E-value=0.95 Score=22.44 Aligned_cols=20 Identities=20% Similarity=-0.031 Sum_probs=13.7 Q ss_pred HHHHHHHHHHHHHHCCCCCC Q ss_conf 99999999999972268753 Q gi|254781174|r 24 LTIFFSIAVCFLVGWERQSS 43 (271) Q Consensus 24 ~~i~~~i~~~~l~~Cs~~~~ 43 (271) -.|+-.|.+++|+||..... T Consensus 6 qlI~s~I~~L~LTgC~N~~~ 25 (449) T COG3014 6 QLIMSVILGLSLTGCINYRM 25 (449) T ss_pred HHHHHHHHHHHHHHHHHHHH T ss_conf 12344566763598776787 No 161 >KOG4814 consensus Probab=90.65 E-value=1 Score=22.30 Aligned_cols=74 Identities=9% Similarity=0.067 Sum_probs=59.2 Q ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHH Q ss_conf 999999999998098999999999998530477302---68998778877654555679999988754011211013 Q gi|254781174|r 58 REVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVA---RKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVD 131 (271) Q Consensus 58 ~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a---~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~ 131 (271) ..+.+.|...|+-++|..+++.|..-...+|...+. .+.+=.++.||..+.+-|.|...+++.-+--|+++... T Consensus 355 ~iLWn~A~~~F~~~~Y~~s~~~y~~Sl~~i~~D~~~~~FaK~qR~l~~CYL~L~QLD~A~E~~~EAE~~d~~~~l~q 431 (872) T KOG4814 355 TLLWNTAKKLFKMEKYVVSIRFYKLSLKDIISDNYSDRFAKIQRALQVCYLKLEQLDNAVEVYQEAEEVDRQSPLCQ 431 (872) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHH T ss_conf 99998888775377689999999998874475046578999999999998259988779999999986073238899 No 162 >PRK03002 prsA peptidylprolyl isomerase; Reviewed Probab=90.62 E-value=1 Score=22.28 Aligned_cols=46 Identities=11% Similarity=0.067 Sum_probs=29.0 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHH Q ss_conf 9999999999999999722687533344567776736899999999 Q gi|254781174|r 20 YKFALTIFFSIAVCFLVGWERQSSRDVYLDSVTDVRYQREVYEKAV 65 (271) Q Consensus 20 ~k~~~~i~~~i~~~~l~~Cs~~~~~~~~~~~~~~~~~~~~lY~~a~ 65 (271) .|.++++..++++++|.||+++....+...-.....+..++|+.-. T Consensus 3 kK~i~~~~~~~svl~LaaC~~~~~~~~VAtv~g~~IT~~df~~~lk 48 (285) T PRK03002 3 GKHIFIITALISILMLSACGQKNSSATVATATDSTITKSDFEKQLK 48 (285) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCEECHHHHHHHHH T ss_conf 0189999999999999984579999779986897873999999999 No 163 >PRK11063 metQ DL-methionine transporter substrate-binding subunit; Provisional Probab=89.60 E-value=0.67 Score=23.32 Aligned_cols=53 Identities=23% Similarity=0.142 Sum_probs=29.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 999999999999999999972268753334456777673689999999999980 Q gi|254781174|r 17 YQLYKFALTIFFSIAVCFLVGWERQSSRDVYLDSVTDVRYQREVYEKAVLFLKE 70 (271) Q Consensus 17 ~~m~k~~~~i~~~i~~~~l~~Cs~~~~~~~~~~~~~~~~~~~~lY~~a~~~~~~ 70 (271) ++|..+..+ ..+++.++++||++.+.+...+.-.....+..++.+.+....++ T Consensus 3 ~~~k~~~~~-~~l~~~l~l~gCg~~~~~~~~ikVG~~~gp~~ei~e~~~~~~~e 55 (271) T PRK11063 3 FKFKTFAAV-GALIGSLALVGCGQDEKDPNHIKVGVIVGAEQQVAEVAQKVAKE 55 (271) T ss_pred HHHHHHHHH-HHHHHHHHHCCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHHHH T ss_conf 029999999-99999997500589856798189984689879999998888886 No 164 >PRK03095 prsA peptidylprolyl isomerase; Reviewed Probab=89.48 E-value=1.3 Score=21.73 Aligned_cols=47 Identities=21% Similarity=0.174 Sum_probs=29.5 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 9999999999999999972268753334456777673689999999999 Q gi|254781174|r 19 LYKFALTIFFSIAVCFLVGWERQSSRDVYLDSVTDVRYQREVYEKAVLF 67 (271) Q Consensus 19 m~k~~~~i~~~i~~~~l~~Cs~~~~~~~~~~~~~~~~~~~~lY~~a~~~ 67 (271) |.|.++.+ ..++++.|++||+++.+.+ ........+..++|+.-... T Consensus 1 MKK~~la~-~~~svl~LaaC~~~~~~~v-at~kgg~IT~~e~y~~~k~~ 47 (287) T PRK03095 1 MKKAMLAL-AATSVIALSACGTSSSDKI-VTSKAGDITKDEFYEQMKTQ 47 (287) T ss_pred CCHHHHHH-HHHHHHHHHHCCCCCCCEE-EEECCCCCCHHHHHHHHHHH T ss_conf 90789999-9999999985168998758-98469827299999999988 No 165 >PRK10802 peptidoglycan-associated outer membrane lipoprotein; Provisional Probab=89.29 E-value=0.42 Score=24.51 Aligned_cols=27 Identities=11% Similarity=0.191 Sum_probs=17.0 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 999999999999999997226875333 Q gi|254781174|r 19 LYKFALTIFFSIAVCFLVGWERQSSRD 45 (271) Q Consensus 19 m~k~~~~i~~~i~~~~l~~Cs~~~~~~ 45 (271) |.|++..+++.+.+++|.+||+..... T Consensus 3 ~nk~~K~l~ia~~~l~LaACSS~~~~~ 29 (173) T PRK10802 3 LNKVLKGLMIALPVMAIAACSSNKNAS 29 (173) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCC T ss_conf 078999999999999998557999888 No 166 >pfam10255 Paf67 RNA polymerase I-associated factor PAF67. RNA polymerase I is a multisubunit enzyme and its transcription competence is dependent on the presence of PAF67. This family of proteins is conserved from worms to humans. Probab=88.08 E-value=1.4 Score=21.46 Aligned_cols=57 Identities=9% Similarity=0.030 Sum_probs=35.8 Q ss_pred HHHHCCHHHHHHHHHHHHHHC----CCCCCH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 998098999999999998530----477302-689987788776545556799999887540 Q gi|254781174|r 67 FLKEQNFSKAYEYFNQCSRDF----PFAGVA-RKSLLMSAFVQYSAGKYQQAASLGEEYITQ 123 (271) Q Consensus 67 ~~~~~~y~~A~~~f~~i~~~~----P~s~~a-~~A~~~la~~~y~~~~y~~A~~~~~~fi~~ 123 (271) +-=-|||..|++.++.|.-.- +..+-+ -...+..|.||..+++|.+|+..|...+-. T Consensus 132 hvLLGDY~~Alk~L~~Idl~~~~l~~kV~~~~vs~~YyvGFaYlMlrRY~DAir~f~~iL~y 193 (402) T pfam10255 132 HVLLGDYYLALKVLEPIDLSKKGLYSKVPAAYVSLYYYVGFAYLMLRRYADAIRVFSQILIY 193 (402) T ss_pred HHHCCCHHHHHHHHHHHCCCCCHHHCCCCHHHEEHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 87005689999975241446320102474040212458899999986589999999999999 No 167 >pfam02259 FAT FAT domain. The FAT domain is named after FRAP, ATM and TRRAP. Probab=87.74 E-value=1.7 Score=21.04 Aligned_cols=30 Identities=7% Similarity=-0.105 Sum_probs=15.4 Q ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHH Q ss_conf 345689998865289981159999999999997 Q gi|254781174|r 207 VAAIPRFQLVLANYSDAEHAEEAMARLVEAYVA 239 (271) Q Consensus 207 ~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~ 239 (271) ..++..|+..++-.|+. ++|.+.+|..+.+ T Consensus 275 ~~il~~y~~A~~~~~~w---~Kaw~~wa~~~~~ 304 (351) T pfam02259 275 DEILKAYRTATQFDDQW---YKAWHSWALANFE 304 (351) T ss_pred HHHHHHHHHHHHHCCCC---HHHHHHHHHHHHH T ss_conf 99999999999878664---8999999999999 No 168 >COG3107 LppC Putative lipoprotein [General function prediction only] Probab=87.36 E-value=1.7 Score=20.91 Aligned_cols=143 Identities=10% Similarity=-0.037 Sum_probs=80.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCC-----------------HHHHHHHHHHHHHHCCHHHHHHH Q ss_conf 9999999999999999999722687533344567776736-----------------89999999999980989999999 Q gi|254781174|r 17 YQLYKFALTIFFSIAVCFLVGWERQSSRDVYLDSVTDVRY-----------------QREVYEKAVLFLKEQNFSKAYEY 79 (271) Q Consensus 17 ~~m~k~~~~i~~~i~~~~l~~Cs~~~~~~~~~~~~~~~~~-----------------~~~lY~~a~~~~~~~~y~~A~~~ 79 (271) +.+.++...+..+++.+++.||+..........-..+... ...+-.-+....++|+-..|... T Consensus 6 f~r~~~~r~l~~illa~vla~C~t~~~~~~~~~lq~~a~a~s~~yl~qa~qs~~~~~~~~~llAa~al~~e~k~~qA~~L 85 (604) T COG3107 6 FQRIKAPRLLTPILLALVLAGCSTFLPSGSVVLLQGTANASSQFYLQQAQQSSGEQQNDWLLLAARALVEEGKTAQAQAL 85 (604) T ss_pred HHHHHCCHHHHHHHHHHHHHHHCCCCCCCCHHHCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCHHHHHHH T ss_conf 76763020038999999997604678998601116782111999999976337015664999999999974782889999 Q ss_pred HHHHHH-HCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999985-3047730268998778877654555679999988754011211013555443444555544311123458999 Q gi|254781174|r 80 FNQCSR-DFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYLVGMSYAQMIRDVPYDQRATKLM 158 (271) Q Consensus 80 f~~i~~-~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~~a~~~~~~~~~~~~d~~~~~~A 158 (271) +.++-. .-|. --.+..+..|+..-..+++..|...+.+.... ..+..-.+.|..+.+-.... -..+ T Consensus 86 l~ql~~~Ltd~--Q~~~~~LL~ael~la~~q~~~Al~~L~~~~~~--~ls~~Qq~Ry~q~~a~a~ea---------~~~~ 152 (604) T COG3107 86 LNQLPQELTDA--QRAEKSLLAAELALAQKQPAAALQQLAKLLPA--DLSQNQQARYYQARADALEA---------RGDS 152 (604) T ss_pred HHHCCCCCCHH--HHHHHHHHHHHHHHHCCCHHHHHHHHHHCCHH--HCCHHHHHHHHHHHHHHHHC---------CCCH T ss_conf 98656237989--99999999999998422729999998520321--16778899999999999861---------3215 Q ss_pred HHHHHHHHHHHHCC Q ss_conf 99999999852001 Q gi|254781174|r 159 LQYMSRIVERYTNS 172 (271) Q Consensus 159 ~~~f~~~i~~yP~S 172 (271) +...+..|..+|.. T Consensus 153 ~~a~rari~~~~lL 166 (604) T COG3107 153 IDAARARIAQDPLL 166 (604) T ss_pred HHHHHHHHHHHHHC T ss_conf 89999999753210 No 169 >pfam07901 DUF1672 Protein of unknown function (DUF1672). This family is composed of hypothetical bacterial proteins of unknown function. Probab=87.02 E-value=0.52 Score=23.99 Aligned_cols=25 Identities=24% Similarity=0.329 Sum_probs=16.0 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 999999999999999997226875333 Q gi|254781174|r 19 LYKFALTIFFSIAVCFLVGWERQSSRD 45 (271) Q Consensus 19 m~k~~~~i~~~i~~~~l~~Cs~~~~~~ 45 (271) |+| +++.+++++++|+|||....+. T Consensus 1 MkK--~i~~il~~~llL~GCs~mn~~~ 25 (304) T pfam07901 1 MRK--LIKLILIATLLLSGCSTTNNES 25 (304) T ss_pred CCH--HHHHHHHHHHHHCCCCCCCCCC T ss_conf 901--8999999999971334455542 No 170 >pfam10345 Cohesin_load Cohesin loading factor. Cohesin_load is a common cohesin loading factor protein that is conserved in fungi. It is associated with the cohesin complex and is required in G1 for cohesin binding to chromosomes but dispensable in G2 when cohesion has been established. It is referred to as both Ssl3, in pombe, and Scc4, in S.cerevisiae. It complexes with Mis4. Probab=86.28 E-value=2 Score=20.56 Aligned_cols=195 Identities=8% Similarity=-0.046 Sum_probs=83.1 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCC-------CCHHHHHHHHHHH--HHHHHH---HHHHHHHHHHHHHHC Q ss_conf 8999999999998098999999999998530477-------3026899877887--765455---567999998875401 Q gi|254781174|r 57 QREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFA-------GVARKSLLMSAFV--QYSAGK---YQQAASLGEEYITQY 124 (271) Q Consensus 57 ~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s-------~~a~~A~~~la~~--~y~~~~---y~~A~~~~~~fi~~~ 124 (271) .-..+-.|...+..+...++++..+.++...-.- +.-=.+.+.+-+. +-..|+ -.+.+..++.+++.. T Consensus 180 v~~~l~~~~l~l~~~~~~d~l~~l~~~~~~~~~~q~~~~~~~~qL~~l~lll~l~~~l~~~~~~~~~~~L~~lq~~~~~~ 259 (593) T pfam10345 180 VVLSLLEALLHLYRGSPKDVLELLQEAIAAARSLQLDDSEQIPQLKALRLLLDLLCSLQKDDVKNVKEKLKQLQSFLDEL 259 (593) T ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHH T ss_conf 99999999897503982569999999999999731466457289999999999999997698666999999999999985 Q ss_pred CCCCHH------------------------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHH Q ss_conf 121101------------------------------35554434445555443111234589999999999985200101 Q gi|254781174|r 125 PESKNV------------------------------DYVYYLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPY 174 (271) Q Consensus 125 P~s~~~------------------------------~~A~y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~y 174 (271) -+.+.. .-+++.-|+++... ...++.++.+.+|+...+++....|..+- T Consensus 260 ~~~~~w~~~~~~~~i~~~~~~~~~l~f~wl~~~~l~~L~y~vs~l~~~~~-~~~~ka~k~l~k~L~~i~kl~~~~~~~~~ 338 (593) T pfam10345 260 KNLSDWSSWDIDGSIKLKIVIRDPLVFSWLPKEDLYALVYFLSGLHNMYD-KYTDKSQKFLPKALRQLEKLKLKRNPESP 338 (593) T ss_pred HCCCCCCCCCCCCCCCCCCCCCCCCEEEECCHHHHHHHHHHHHHHHHHHC-CCCCHHHHHHHHHHHHHHHHHCCCCCCCC T ss_conf 36677888776510145667787632665588899999999999997400-20227999999999999975414776678 Q ss_pred --HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HCCC---CHHHHHHHHHHHHHHHHCCCHHHH Q ss_conf --34565555555666666655566666544322345689998865---2899---811599999999999974987999 Q gi|254781174|r 175 --VKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQLVLA---NYSD---AEHAEEAMARLVEAYVALALMDEA 246 (271) Q Consensus 175 --a~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~~~i~---~yp~---t~~~~eAl~~l~~~y~~lg~~d~A 246 (271) ..++..+..-.+...---.++++-.++-++++..|......+.. .-|+ ....+-..+..|-.+...|+.+.| T Consensus 339 ~sl~~~~~~~~~~~~l~~~~~~y~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~yL~gl~~q~~gdl~~A 418 (593) T pfam10345 339 FSLSELSEKIQWLSSIKCYVLFYIIWCAFVLGDWAKADRLNEFLRSLNQKLFEKLLKDLQPLLHYLEGLYAQSKGDLEAA 418 (593) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCHHHH T ss_conf 78899976558999999999999999998653798899889999988650430145656799999999999871679999 Q ss_pred HHHHHH Q ss_conf 999999 Q gi|254781174|r 247 REVVSL 252 (271) Q Consensus 247 ~~~~~~ 252 (271) ..++.. T Consensus 419 ~~~y~~ 424 (593) T pfam10345 419 LYYYTK 424 (593) T ss_pred HHHHHH T ss_conf 999860 No 171 >KOG1464 consensus Probab=86.21 E-value=2 Score=20.54 Aligned_cols=55 Identities=18% Similarity=0.158 Sum_probs=47.4 Q ss_pred HHHCCHHHHHHHHHHHHHHCCC-CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9809899999999999853047-730268998778877654555679999988754 Q gi|254781174|r 68 LKEQNFSKAYEYFNQCSRDFPF-AGVARKSLLMSAFVQYSAGKYQQAASLGEEYIT 122 (271) Q Consensus 68 ~~~~~y~~A~~~f~~i~~~~P~-s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~ 122 (271) +...+..+|+..|++|+..-|. ++|+-+|+-.+..++|.+++|++-...|...+. T Consensus 38 l~e~~p~~Al~sF~kVlelEgEKgeWGFKALKQmiKI~f~l~~~~eMm~~Y~qlLT 93 (440) T KOG1464 38 LKEDEPKEALSSFQKVLELEGEKGEWGFKALKQMIKINFRLGNYKEMMERYKQLLT 93 (440) T ss_pred CCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH T ss_conf 44368799999999999416643426799999888777504667999999999999 No 172 >COG2912 Uncharacterized conserved protein [Function unknown] Probab=86.01 E-value=2.1 Score=20.48 Aligned_cols=64 Identities=17% Similarity=0.106 Sum_probs=33.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHH Q ss_conf 6544322345689998865289981159999999999997498799999999997856999789999 Q gi|254781174|r 200 YLKRGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSLIQERYPQGYWARYV 266 (271) Q Consensus 200 Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~~~yP~s~~~~~a 266 (271) |...+.+..|...-+.++.-.|++++-- -=-|-+|.++|-..-|.+..+......|+.....-. T Consensus 191 ~~~e~~~~~al~~~~r~l~l~P~dp~ei---rDrGliY~ql~c~~vAl~dl~~~~~~~P~~~~a~~i 254 (269) T COG2912 191 LLRELQWELALRVAERLLDLNPEDPYEI---RDRGLIYAQLGCYHVALEDLSYFVEHCPDDPIAEMI 254 (269) T ss_pred HHHHHCHHHHHHHHHHHHHHCCCCHHHC---CCCHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHH T ss_conf 9984056999999999875189984431---670899986488445699999999849993578999 No 173 >pfam11817 Foie-gras_1 Foie gras liver health family 1. Mutating the gene foie gras in zebrafish has been shown to affect development; the mutants develop large, lipid-filled hepatocytes in the liver, resembling those in individuals with fatty liver disease. Foie-gras protein is long and has several well-defined domains though none of them has a known function. We have annotated this one as the first. THe C-terminus of this region contains TPR repeats. Probab=85.91 E-value=2.1 Score=20.45 Aligned_cols=56 Identities=21% Similarity=0.309 Sum_probs=37.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHH---HHHHHHHHHHHHHHCCCHHHHHHH Q ss_conf 55666665443223456899988652899811---599999999999974987999999 Q gi|254781174|r 194 VEIGRYYLKRGEYVAAIPRFQLVLANYSDAEH---AEEAMARLVEAYVALALMDEAREV 249 (271) Q Consensus 194 ~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~---~~eAl~~l~~~y~~lg~~d~A~~~ 249 (271) +.+|+-|+..|+|..|+.-|+.+.-.|-+..+ ..+.+..+-+|...+|+.+....+ T Consensus 189 ~~mA~Ey~~~g~~~~Al~ll~~~~~~yR~EgW~~Ll~~vl~~l~~CA~~~~d~~~~l~~ 247 (254) T pfam11817 189 LEMAEEYLRLGDWSKALELLDPVALSYRKEGWWTLLEEVLWALRECARLVGDVKDVLAV 247 (254) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCHHHHHHH T ss_conf 99999999768999999999999999997887999999999999999981899999999 No 174 >pfam09986 DUF2225 Uncharacterized protein conserved in bacteria (DUF2225). This domain, found in various hypothetical bacterial proteins, has no known function. Probab=85.84 E-value=2.1 Score=20.43 Aligned_cols=52 Identities=15% Similarity=0.037 Sum_probs=32.6 Q ss_pred HHHHHHHHHHHHHH--HCCCCHHHH-HHHHHHHHHHHHCCCHHHHHHHHHHHHHH Q ss_conf 22345689998865--289981159-99999999999749879999999999785 Q gi|254781174|r 205 EYVAAIPRFQLVLA--NYSDAEHAE-EAMARLVEAYVALALMDEAREVVSLIQER 256 (271) Q Consensus 205 ~y~aA~~~~~~~i~--~yp~t~~~~-eAl~~l~~~y~~lg~~d~A~~~~~~l~~~ 256 (271) -...|...|....+ ++|.++..+ ..+|++|+.+.++|+.++|.+++..+... T Consensus 140 fl~~Al~~y~~ay~~e~~~~~~~~e~~l~YLigeL~rRlG~~~eA~~wfs~vi~~ 194 (214) T pfam09986 140 FLKKALELYKEAYEKEDFPIEGIDEATLMYLIGELSRRLGNKEEALRWFSKVIGS 194 (214) T ss_pred HHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCC T ss_conf 9999999999999606688777118899999999999949999999999999639 No 175 >KOG1308 consensus Probab=85.45 E-value=0.54 Score=23.89 Aligned_cols=65 Identities=22% Similarity=0.086 Sum_probs=53.3 Q ss_pred HHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCH Q ss_conf 99999998098999999999998530477302689987788776545556799999887540112110 Q gi|254781174|r 62 EKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKN 129 (271) Q Consensus 62 ~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~ 129 (271) -+|.+.+.+|+++.|++.|...+...|.+. .-+-+.|-++.+++++..|+..+...++.+|++.. T Consensus 119 ~~A~ealn~G~~~~ai~~~t~ai~lnp~~A---~l~~kR~sv~lkl~kp~aairD~d~A~ein~Dsa~ 183 (377) T KOG1308 119 VQASEALNDGEFDTAIELFTSAIELNPPLA---ILYAKRASVFLKLKKPNAAIRDCDFAIEINPDSAK 183 (377) T ss_pred HHHHHHHCCCCHHHHHCCCCCCCCCCCCHH---HHCCCCCCEEEECCCCCHHHHHHHHHHCCCCCCCC T ss_conf 888987327643333115531203699434---41055341002236873465402455416831023 No 176 >TIGR03659 IsdE heme ABC transporter, heme-binding protein isdE. This family of ABC substrate-binding proteins is observed primarily in close proximity with proteins localized to the cell wall and bearing the NEAT (NEAr Transporter, pfam05031) heme-binding domain. IsdE has been shown to bind heme and is involved in the process of scavenging heme for the purpose of obtaining iron. Probab=84.66 E-value=1.3 Score=21.67 Aligned_cols=27 Identities=15% Similarity=0.146 Sum_probs=17.1 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 999999999999999722687533344 Q gi|254781174|r 21 KFALTIFFSIAVCFLVGWERQSSRDVY 47 (271) Q Consensus 21 k~~~~i~~~i~~~~l~~Cs~~~~~~~~ 47 (271) |...+++++++.++++|||+++.+... T Consensus 2 k~~~~~~~~~~~~~l~~C~~~~~~~~~ 28 (289) T TIGR03659 2 KILSLILLALLSLGLTGCSSSKAESKV 28 (289) T ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCC T ss_conf 129999999999998432899877766 No 177 >PRK01622 OxaA-like protein precursor; Validated Probab=84.39 E-value=2.5 Score=20.03 Aligned_cols=69 Identities=17% Similarity=0.251 Sum_probs=40.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHH---HHHHHHHHHH---HCCHHHHHHHHHHHHHH--CCC Q ss_conf 999999999999999999722687533344567776736899---9999999998---09899999999999853--047 Q gi|254781174|r 18 QLYKFALTIFFSIAVCFLVGWERQSSRDVYLDSVTDVRYQRE---VYEKAVLFLK---EQNFSKAYEYFNQCSRD--FPF 89 (271) Q Consensus 18 ~m~k~~~~i~~~i~~~~l~~Cs~~~~~~~~~~~~~~~~~~~~---lY~~a~~~~~---~~~y~~A~~~f~~i~~~--~P~ 89 (271) |.+|..+....+.++++++||+...+. +..+......- -.......+. .|+|.-||-.+.-+++. +|- T Consensus 3 k~~~~~l~~~~~~~~~~lsgc~~~~~~----~~~~~g~~~~~~v~p~~~~i~~~a~~~~gnyGlaIIl~TiivRlil~PL 78 (266) T PRK01622 3 KSYRAVLISLSLLLVFVLSGCSNAAPI----DAHSDGFFHHYFVYPFSFLIQFVAHHIHGSYGIAIIIVTLIIRSLMIPL 78 (266) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCC----CCCCCCHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 388999999999999999546899998----8999970799999999999999998758849999999999999999870 Q ss_pred C Q ss_conf 7 Q gi|254781174|r 90 A 90 (271) Q Consensus 90 s 90 (271) + T Consensus 79 ~ 79 (266) T PRK01622 79 A 79 (266) T ss_pred H T ss_conf 2 No 178 >pfam09670 Cas_Cas02710 CRISPR-associated protein (Cas_Cas02710). Members of this family are found, exclusively in the vicinity of CRISPR repeats and other CRISPR-associated (cas) genes, in Methanothermobacter thermautotrophicus (Methanobacterium thermoformicicum), Thermus thermophilus (Deinococcus-Thermus), Chloroflexus aurantiacus (Chloroflexi), and Thermomicrobium roseum (Thermomicrobia). Probab=84.35 E-value=2.5 Score=20.02 Aligned_cols=64 Identities=14% Similarity=0.241 Sum_probs=51.6 Q ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHCCCC-CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999998098999999999998530477-302689987788776545556799999887540 Q gi|254781174|r 60 VYEKAVLFLKEQNFSKAYEYFNQCSRDFPFA-GVARKSLLMSAFVQYSAGKYQQAASLGEEYITQ 123 (271) Q Consensus 60 lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s-~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~ 123 (271) ...++...++.++|..|...|+++..+.|.. .+..-..+-.|+-.+..-++.+|...+++.+.. T Consensus 134 e~~~a~~lfn~~~y~~A~~~l~~l~~rl~~~~~~~~~~~l~~~y~~WD~f~~~~A~~~L~~~~~~ 198 (379) T pfam09670 134 ERRQAIRLFNKGDYGSALRILEDLARRLPGRKDYAAYKLLSEGYLAWDSFDHEEAADYLSKVLAL 198 (379) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH T ss_conf 99999998845688999999999987379745799999999999998655899999999998875 No 179 >COG2909 MalT ATP-dependent transcriptional regulator [Transcription] Probab=84.16 E-value=2.5 Score=19.97 Aligned_cols=196 Identities=14% Similarity=0.057 Sum_probs=104.0 Q ss_pred HHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCH---HHHHHHH Q ss_conf 99999998098999999999998530477302689--987788776545556799999887540112110---1355544 Q gi|254781174|r 62 EKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKS--LLMSAFVQYSAGKYQQAASLGEEYITQYPESKN---VDYVYYL 136 (271) Q Consensus 62 ~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A--~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~---~~~A~y~ 136 (271) -.|......++.+.|.+.-+.+...-|...+..++ .--+|.+..-.|++++|...-....+.--.+.. ...+.+. T Consensus 463 L~a~val~~~~~e~a~~lar~al~~L~~~~~~~r~~~~sv~~~a~~~~G~~~~Al~~~~~a~~~a~~~~~~~l~~~~~~~ 542 (894) T COG2909 463 LRAQVALNRGDPEEAEDLARLALVQLPEAAYRSRIVALSVLGEAAHIRGELTQALALMQQAEQMARQHDVYHLALWSLLQ 542 (894) T ss_pred HHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHH T ss_conf 99999984587789999999999866122540002565405688887343999999888799999980448899999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHCCHHHHHHHHHHHHHHHHHHHH-------------------- Q ss_conf 3444555544311123458999999999----9985200101345655555556666666-------------------- Q gi|254781174|r 137 VGMSYAQMIRDVPYDQRATKLMLQYMSR----IVERYTNSPYVKGARFYVTVGRNQLAAK-------------------- 192 (271) Q Consensus 137 ~a~~~~~~~~~~~~d~~~~~~A~~~f~~----~i~~yP~S~ya~~A~~~l~~~~~~La~~-------------------- 192 (271) .+.....+. +....+....|.. ...+-|-|.+..-.+..+.....+++.- T Consensus 543 ~s~il~~qG------q~~~a~~~~~~~~~~~q~l~q~~~~~f~~~~r~~ll~~~~r~~~~~~ear~~~~~~~~~~~~~~~ 616 (894) T COG2909 543 QSEILEAQG------QVARAEQEKAFNLIREQHLEQKPRHEFLVRIRAQLLRAWLRLDLAEAEARLGIEVGSVYTPQPLL 616 (894) T ss_pred HHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHCCCCHHH T ss_conf 999999830------78899888887779988765425321578899999999998866667764330042203632367 Q ss_pred ---H-HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHH--HHHHHHH-HH-HHCCCHHHHHHHHHH------HHHHCC Q ss_conf ---5-55666665443223456899988652899811599--9999999-99-974987999999999------978569 Q gi|254781174|r 193 ---E-VEIGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEE--AMARLVE-AY-VALALMDEAREVVSL------IQERYP 258 (271) Q Consensus 193 ---e-~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~e--Al~~l~~-~y-~~lg~~d~A~~~~~~------l~~~yP 258 (271) . ...|+.++.+|++++|..+...+...+.++.+... |.-..+. .| ...|+..+|...... ....|| T Consensus 617 ~~~~~~~LA~l~~~~Gdl~~A~~~l~~~~~l~~~~~~~~~~~a~~~~v~~~lwl~qg~~~~a~~~l~~s~~~~~~~~~~~ 696 (894) T COG2909 617 SRLALSMLAELEFLRGDLDKALAQLDELERLLLNGQYHVDYLAAAYKVKLILWLAQGDKELAAEWLLKSGDPDKANAHFP 696 (894) T ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCCCHHHHHCC T ss_conf 99999999999875388888998999999985089987048888988667873156998999999986158203432053 Q ss_pred CCHHH Q ss_conf 99789 Q gi|254781174|r 259 QGYWA 263 (271) Q Consensus 259 ~s~~~ 263 (271) +..|. T Consensus 697 ~~~~r 701 (894) T COG2909 697 QLQWR 701 (894) T ss_pred CCCCC T ss_conf 52400 No 180 >COG4669 EscJ Type III secretory pathway, lipoprotein EscJ [Intracellular trafficking and secretion] Probab=83.92 E-value=1.6 Score=21.13 Aligned_cols=19 Identities=16% Similarity=0.408 Sum_probs=12.5 Q ss_pred HHHHHHHHHHHHHHHHHCC Q ss_conf 9999999999999997226 Q gi|254781174|r 21 KFALTIFFSIAVCFLVGWE 39 (271) Q Consensus 21 k~~~~i~~~i~~~~l~~Cs 39 (271) .....+++++++++|+||- T Consensus 3 ~~~~~l~~l~l~l~L~gCk 21 (246) T COG4669 3 SLRKYLFLLLLILLLTGCK 21 (246) T ss_pred HHHHHHHHHHHHHHHHCCH T ss_conf 2899999999999983356 No 181 >pfam10579 Rapsyn_N Rapsyn N-terminal myristoylation and linker region. Neuromuscular junction formation relies upon the clustering of acetylcholine receptors and other proteins in the muscle membrane. Rapsyn is a peripheral membrane protein that is selectively concentrated at the neuromuscular junction and is essential for the formation of synaptic acetylcholine receptor aggregates. Acetylcholine receptors fail to aggregate beneath nerve terminals in mice where rapsyn has been knocked out. The N-terminal six amino acids of rapsyn are its myristoylation site, and myristoylation is necessary for the targeting of the protein to the membrane. Probab=83.90 E-value=2.6 Score=19.90 Aligned_cols=60 Identities=17% Similarity=0.161 Sum_probs=36.7 Q ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999998098999999999998530477302689987788776545556799999 Q gi|254781174|r 58 REVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLG 117 (271) Q Consensus 58 ~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~ 117 (271) +.--++|..+++..+...|+..+..++++-...+.--..+-.++-+|-..|+|.+.+... T Consensus 7 Kq~ie~GlkLY~~~~~~~Av~~W~~~L~k~~~~~~rF~~LGyL~qA~~e~GkYr~ml~fa 66 (80) T pfam10579 7 KQQIEKGLRLYQSNNTEKALEIWRKVLEKTTDREGRFRVLGCLITAHSEMGKYKEMLEFA 66 (80) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999999712899999999999872470014789989999998988899999999 No 182 >PRK10880 adenine DNA glycosylase; Provisional Probab=83.11 E-value=2.8 Score=19.71 Aligned_cols=69 Identities=12% Similarity=0.189 Sum_probs=40.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 877887765455567999998875401121101355------54434445555443111234589999999999985200 Q gi|254781174|r 98 LMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYV------YYLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTN 171 (271) Q Consensus 98 ~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A------~y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~ 171 (271) .++.+.-.++-+-+..+.+|++|++.||+-.....| ....|+-||..... ..+.|....+++-..+|+ T Consensus 33 vwvSEiMLQQTqv~tV~~yy~rf~~~fP~~~~LA~A~~~~vl~~W~GLGYY~RArn------Lh~aA~~i~~~~~G~~P~ 106 (350) T PRK10880 33 VWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARN------LHKAAQQVATLHGGKFPE 106 (350) T ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHCCCHHHHHHHHHHCCCHHHHHH------HHHHHHHHHHHCCCCCCC T ss_conf 99999998348778999999999998839999977999999998640693899999------999999999975898982 Q ss_pred C Q ss_conf 1 Q gi|254781174|r 172 S 172 (271) Q Consensus 172 S 172 (271) + T Consensus 107 ~ 107 (350) T PRK10880 107 T 107 (350) T ss_pred H T ss_conf 5 No 183 >COG4259 Uncharacterized protein conserved in bacteria [Function unknown] Probab=82.95 E-value=2.8 Score=19.67 Aligned_cols=92 Identities=23% Similarity=0.261 Sum_probs=52.8 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHH-CC---HHHHH-HHHHHHHHHCCCCCCHHHHHHHH Q ss_conf 999999999972268753334456777673689999999999980-98---99999-99999985304773026899877 Q gi|254781174|r 26 IFFSIAVCFLVGWERQSSRDVYLDSVTDVRYQREVYEKAVLFLKE-QN---FSKAY-EYFNQCSRDFPFAGVARKSLLMS 100 (271) Q Consensus 26 i~~~i~~~~l~~Cs~~~~~~~~~~~~~~~~~~~~lY~~a~~~~~~-~~---y~~A~-~~f~~i~~~~P~s~~a~~A~~~l 100 (271) +.+++.+..|.||+...+ +... ....+...|+ +++. .. -..|+ +.++++... +..-.|-.+-.+ T Consensus 10 ~al~~av~~LagC~~gpK--slY~---w~gYq~tvyE----y~K~~es~e~Q~~~le~~~ek~~ak--~~~vpPG~HAhL 78 (121) T COG4259 10 LALLLAVAALAGCGGGPK--SLYQ---WEGYQDTVYE----YFKGDESKEAQTAALEKYLEKIGAK--NGAVPPGYHAHL 78 (121) T ss_pred HHHHHHHHHHHHCCCCCC--CCCC---CCCCCHHHHH----HHCCCCCHHHHHHHHHHHHHHHHHC--CCCCCCCHHHHH T ss_conf 999999999988057986--4101---3786389999----9817887889999999999997514--899997278887 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 8877654555679999988754011211 Q gi|254781174|r 101 AFVQYSAGKYQQAASLGEEYITQYPESK 128 (271) Q Consensus 101 a~~~y~~~~y~~A~~~~~~fi~~~P~s~ 128 (271) |..|-..|+.++|...|+.=-..||.|- T Consensus 79 GlLys~~G~~e~a~~eFetEKalFPES~ 106 (121) T COG4259 79 GLLYSNSGKDEQAVREFETEKALFPESG 106 (121) T ss_pred HHHHHHCCCHHHHHHHHHHHHHHCCCCH T ss_conf 7788604982899999987664076601 No 184 >pfam05053 Menin Menin. MEN1, the gene responsible for multiple endocrine neoplasia type 1, is a tumour suppressor gene that encodes a protein called Menin which may be an atypical GTPase stimulated by nm23. Probab=82.92 E-value=2.8 Score=19.67 Aligned_cols=45 Identities=13% Similarity=0.192 Sum_probs=35.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHH---HHHHCCCCHHHHHHHHHHHHHHHH Q ss_conf 566666544322345689998---865289981159999999999997 Q gi|254781174|r 195 EIGRYYLKRGEYVAAIPRFQL---VLANYSDAEHAEEAMARLVEAYVA 239 (271) Q Consensus 195 ~ia~~Y~~~~~y~aA~~~~~~---~i~~yp~t~~~~eAl~~l~~~y~~ 239 (271) +.|.||+++++|..|+..+-+ ||..|--+...+|...-+.++-.. T Consensus 323 Y~ggy~yR~~~yreA~~~WA~Aa~Vir~YnY~ReDEEIYKEfldIANe 370 (618) T pfam05053 323 YLAGYHNRLRKVREALAAWANAADVIQDYNYCREDEEIYKEFFDIAND 370 (618) T ss_pred ECCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHH T ss_conf 212056542018999999987888988614573369999999999987 No 185 >KOG2300 consensus Probab=81.95 E-value=3.1 Score=19.45 Aligned_cols=56 Identities=14% Similarity=0.162 Sum_probs=38.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCC---CHHHHHHHHHHHHHHHHCCCHHHHHHH Q ss_conf 55666665443223456899988652899---811599999999999974987999999 Q gi|254781174|r 194 VEIGRYYLKRGEYVAAIPRFQLVLANYSD---AEHAEEAMARLVEAYVALALMDEAREV 249 (271) Q Consensus 194 ~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~---t~~~~eAl~~l~~~y~~lg~~d~A~~~ 249 (271) +.-|-+-+.++++..|..+..+.++.--. .....-.+..|+....-+|+.-++++- T Consensus 449 ~v~glfaf~qn~lnEaK~~l~e~Lkmanaed~~rL~a~~LvLLs~v~lslgn~~es~nm 507 (629) T KOG2300 449 YVYGLFAFKQNDLNEAKRFLRETLKMANAEDLNRLTACSLVLLSHVFLSLGNTVESRNM 507 (629) T ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHC T ss_conf 99999999855289999999998754040257788878999999999981452887725 No 186 >COG4594 FecB ABC-type Fe3+-citrate transport system, periplasmic component [Inorganic ion transport and metabolism] Probab=81.86 E-value=0.96 Score=22.42 Aligned_cols=27 Identities=7% Similarity=0.189 Sum_probs=19.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 999999999999999999722687533 Q gi|254781174|r 18 QLYKFALTIFFSIAVCFLVGWERQSSR 44 (271) Q Consensus 18 ~m~k~~~~i~~~i~~~~l~~Cs~~~~~ 44 (271) .|.+..++.++++++++..|||++..+ T Consensus 2 ~~~~~~~i~~lll~lllva~C~~s~~~ 28 (310) T COG4594 2 HMKKTAIILTLLLLLLLVAACSSSDNN 28 (310) T ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 750359999999999999771476765 No 187 >KOG1463 consensus Probab=81.21 E-value=3.3 Score=19.29 Aligned_cols=59 Identities=14% Similarity=0.235 Sum_probs=37.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC--C-HHHHHHHHHHHHHHHHCCCHHHHHHH Q ss_conf 66555666665443223456899988652899--8-11599999999999974987999999 Q gi|254781174|r 191 AKEVEIGRYYLKRGEYVAAIPRFQLVLANYSD--A-EHAEEAMARLVEAYVALALMDEAREV 249 (271) Q Consensus 191 ~~e~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~--t-~~~~eAl~~l~~~y~~lg~~d~A~~~ 249 (271) .-++..|-++-..++|.-|-.+|-+..+.|.. + ..+-.+|..|.-+-..++..|+-... T Consensus 210 ~lDLqSGIlha~ekDykTafSYFyEAfEgf~s~~~~v~A~~sLKYMlLcKIMln~~ddv~~l 271 (411) T KOG1463 210 TLDLQSGILHAAEKDYKTAFSYFYEAFEGFDSLDDDVKALTSLKYMLLCKIMLNLPDDVAAL 271 (411) T ss_pred HHHHHCCCEEECCCCCCHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHH T ss_conf 77885263110003420489999999705012477489999999999999982699999999 No 188 >KOG2796 consensus Probab=80.90 E-value=3.4 Score=19.23 Aligned_cols=140 Identities=11% Similarity=0.087 Sum_probs=67.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC-CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCH Q ss_conf 8998778877654555679999988754011-211013555443444555544311123458999999999998520010 Q gi|254781174|r 95 KSLLMSAFVQYSAGKYQQAASLGEEYITQYP-ESKNVDYVYYLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSP 173 (271) Q Consensus 95 ~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P-~s~~~~~A~y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ 173 (271) ..++.++-++...++|.-+...+.+.++.+| ..|....+.-.+ ..|+ ++.+.|..+|+++-+ T Consensus 178 ~Vmy~~~~~llG~kEy~iS~d~~~~vi~~~~e~~p~L~s~Lgr~----~MQ~-------GD~k~a~~yf~~vek------ 240 (366) T KOG2796 178 RVMYSMANCLLGMKEYVLSVDAYHSVIKYYPEQEPQLLSGLGRI----SMQI-------GDIKTAEKYFQDVEK------ 240 (366) T ss_pred HHHHHHHHHHHCCHHHHEEHHHHHHHHHHCCCCCHHHHHHHHHH----HHHC-------CCHHHHHHHHHHHHH------ T ss_conf 99999999983630451109999999863873067999998788----8860-------438889989999998------ Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH Q ss_conf 13456555555566666665556666654432234568999886528998115999999999999749879999999999 Q gi|254781174|r 174 YVKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSLI 253 (271) Q Consensus 174 ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l 253 (271) ...++..++...-- ....+-.|.-.++|..|...+.+++..-|.++.+ .-.-+-+...+|.-..|.++.+.+ T Consensus 241 ----~~~kL~~~q~~~~V-~~n~a~i~lg~nn~a~a~r~~~~i~~~D~~~~~a---~NnKALcllYlg~l~DAiK~~e~~ 312 (366) T KOG2796 241 ----VTQKLDGLQGKIMV-LMNSAFLHLGQNNFAEAHRFFTEILRMDPRNAVA---NNNKALCLLYLGKLKDALKQLEAM 312 (366) T ss_pred ----HHHHHHCCCHHHHH-HHHHHHEEECCCCHHHHHHHHHHCCCCCCCCHHH---HCHHHHHHHHHHHHHHHHHHHHHH T ss_conf ----76143234046788-7634420221333599999885042248987022---051999999987788999999987 Q ss_pred HHHCCC Q ss_conf 785699 Q gi|254781174|r 254 QERYPQ 259 (271) Q Consensus 254 ~~~yP~ 259 (271) ...-|. T Consensus 313 ~~~~P~ 318 (366) T KOG2796 313 VQQDPR 318 (366) T ss_pred HCCCCC T ss_conf 611886 No 189 >COG1704 LemA Uncharacterized conserved protein [Function unknown] Probab=80.89 E-value=3.4 Score=19.23 Aligned_cols=68 Identities=19% Similarity=0.209 Sum_probs=50.6 Q ss_pred HHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHH Q ss_conf 345899999999999852001013456555555566666665556666654432234568999886528998115 Q gi|254781174|r 152 QRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQLVLANYSDAEHA 226 (271) Q Consensus 152 ~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~ 226 (271) ++....++...--+-+.||+ -.|......+...|.+-|=.|+ ..|+.|..|+..|+..++.||....+ T Consensus 94 q~~Ls~~L~rl~a~~E~YPd----LKAn~~f~~Lq~ql~~tEn~Ia---~aR~~YN~av~~yN~~i~~FPs~ivA 161 (185) T COG1704 94 QAELSSALGRLFAVAEAYPD----LKANENFLELQSQLEGTENRIA---VARRLYNEAVRDYNVKIRSFPSNIVA 161 (185) T ss_pred HHHHHHHHHHHHHHHHHCCC----HHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999996842----2121689999999885789999---99999999999998999981699999 No 190 >pfam06552 TOM20_plant Plant specific mitochondrial import receptor subunit TOM20. This family consists of several plant specific mitochondrial import receptor subunit TOM20 (translocase of outer membrane 20 kDa subunit) proteins. Most mitochondrial proteins are encoded by the nuclear genome, and are synthesized in the cytosol. TOM20 is a general import receptor that binds to mitochondrial pre-sequences in the early step of protein import into the mitochondria. Probab=80.57 E-value=3.5 Score=19.16 Aligned_cols=67 Identities=16% Similarity=0.250 Sum_probs=47.1 Q ss_pred HHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHH Q ss_conf 5679999988754011211013555443444555---544311123458999999999998520010134565 Q gi|254781174|r 110 YQQAASLGEEYITQYPESKNVDYVYYLVGMSYAQ---MIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGAR 179 (271) Q Consensus 110 y~~A~~~~~~fi~~~P~s~~~~~A~y~~a~~~~~---~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~ 179 (271) -.+|+.-+++.+...|.-.+ |++-+|.++-. ..+|........++|...|+..++.-|+.+....+. T Consensus 51 i~dAisKlEeAL~InP~khd---alWclGNA~TS~~Fl~pD~~~A~~~F~kA~~~Fq~AvdeeP~NelYrKsL 120 (186) T pfam06552 51 IQDAISKLEEALVINPKKHD---ALWCIGNAYTSHAFLTPDQTEAKDNFDKATQFFQQAVDEQPDNDLYRKSL 120 (186) T ss_pred HHHHHHHHHHHHCCCCCCCC---HHHEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHH T ss_conf 99999999998713988753---33300541101513178789999889999999999863498618999999 No 191 >PRK06330 transcript cleavage factor/unknown domain fusion protein; Validated Probab=80.37 E-value=3.5 Score=19.12 Aligned_cols=129 Identities=16% Similarity=0.256 Sum_probs=56.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCH- Q ss_conf 8998778877654555679999988754011211013555443444555544311123458999999999998520010- Q gi|254781174|r 95 KSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSP- 173 (271) Q Consensus 95 ~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~- 173 (271) -|+..+|.+|-+.++-+++....+++++.--+.|.++. ..|..+ +. .+..+|+..|+..+.+|-+.+ T Consensus 118 ~AL~~la~~y~~~~~n~el~~iwerlv~~D~~~~e~~~---~~a~~~-e~--------~d~~kai~y~k~A~~rf~~~k~ 185 (906) T PRK06330 118 LALYTLAEAYAKLKKNKELKVIWERLVKADRDNPEIVK---KLATSY-EE--------EDKEKSITYLKKAIYRFIKKKQ 185 (906) T ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHH---HHHHHH-HH--------CCHHHHHHHHHHHHHHHHHHHH T ss_conf 89999999999835774389999999971344668999---998888-74--------0268899999999999987754 Q ss_pred HH-------------H-HHHHHHHHHHHHHHHHH--------HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHH Q ss_conf 13-------------4-56555555566666665--------55666665443223456899988652899811599999 Q gi|254781174|r 174 YV-------------K-GARFYVTVGRNQLAAKE--------VEIGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAMA 231 (271) Q Consensus 174 ya-------------~-~A~~~l~~~~~~La~~e--------~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~ 231 (271) ++ + +-.+-+...+..++..+ .-+=.+|....+|..++.-++.+++.=+.+. .|.. T Consensus 186 ~~~~~eiw~klv~~~~dd~~~f~~i~~~~~~~~~~~~~~~l~~~i~~~y~~~e~~~~~i~~lK~iL~~d~~~~---kaR~ 262 (906) T PRK06330 186 YVGIEEIWSKLVHNNSDDFDFFLRIERKILGHREFTRLVGLLYDIVEPYKQLENWDEVIYILKKILEHDNKSN---KARE 262 (906) T ss_pred HCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCEEEEEHHHHHHHHHHCCCHHHHHHHHHHHHHCCHHHH---HHHH T ss_conf 1228899999986086317899999999987643470366416767887612768899999999973350106---8899 Q ss_pred HHHHHHH Q ss_conf 9999999 Q gi|254781174|r 232 RLVEAYV 238 (271) Q Consensus 232 ~l~~~y~ 238 (271) +|+++|- T Consensus 263 ~li~~~k 269 (906) T PRK06330 263 ELIRFYK 269 (906) T ss_pred HHHHHHH T ss_conf 9999999 No 192 >TIGR02747 TraV type IV conjugative transfer system protein TraV; InterPro: IPR014118 This entry includes TraV, which is a component of conjugative type IV secretion system. TraV is an outer membrane lipoprotein that is believed to interact with the secretin TraK , , . The alignment contains three conserved cysteines in the N-terminal half.. Probab=79.69 E-value=0.91 Score=22.55 Aligned_cols=22 Identities=9% Similarity=0.377 Sum_probs=14.5 Q ss_pred HHHHHHHHHHHHHHHHHHHCCC Q ss_conf 9999999999999999972268 Q gi|254781174|r 19 LYKFALTIFFSIAVCFLVGWER 40 (271) Q Consensus 19 m~k~~~~i~~~i~~~~l~~Cs~ 40 (271) |.|..+++.+.++.+.|+|||+ T Consensus 1 Mmr~~~L~~~~~~~f~LtGCsa 22 (174) T TIGR02747 1 MMRLKVLLLLACVAFLLTGCSA 22 (174) T ss_pred CCHHHHHHHHHHHHHHHHHHHH T ss_conf 9067899999999872002577 No 193 >PRK09859 multidrug efflux system protein MdtE; Provisional Probab=79.64 E-value=2.4 Score=20.10 Aligned_cols=25 Identities=16% Similarity=0.208 Sum_probs=14.9 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 9999999999999997226875333 Q gi|254781174|r 21 KFALTIFFSIAVCFLVGWERQSSRD 45 (271) Q Consensus 21 k~~~~i~~~i~~~~l~~Cs~~~~~~ 45 (271) |..+++.++++.++|+||+.+..+. T Consensus 4 k~~~li~ll~~~~lL~gC~~~~~~~ 28 (385) T PRK09859 4 RRKLLIPLLFCGAMLTACDDKSAEN 28 (385) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCCC T ss_conf 0678999999999995379997687 No 194 >PRK11138 outer membrane protein assembly complex subunit YfgL; Provisional Probab=79.47 E-value=1.8 Score=20.76 Aligned_cols=24 Identities=21% Similarity=0.141 Sum_probs=14.0 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 999999999999722687533344 Q gi|254781174|r 24 LTIFFSIAVCFLVGWERQSSRDVY 47 (271) Q Consensus 24 ~~i~~~i~~~~l~~Cs~~~~~~~~ 47 (271) +++..++++.+|+|||+...++.. T Consensus 6 ~~~~~l~~~~ll~gCs~~~~~~~~ 29 (394) T PRK11138 6 LLLPGLLSVALLSGCSLFDSEEDV 29 (394) T ss_pred HHHHHHHHHHHHHHCCCCCCCCCC T ss_conf 799999999998652378888777 No 195 >COG3947 Response regulator containing CheY-like receiver and SARP domains [Signal transduction mechanisms] Probab=79.15 E-value=3.9 Score=18.89 Aligned_cols=159 Identities=16% Similarity=0.117 Sum_probs=89.9 Q ss_pred HHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999985304773026899877887765455------5679999988754011211013555443444555544311 Q gi|254781174|r 76 AYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGK------YQQAASLGEEYITQYPESKNVDYVYYLVGMSYAQMIRDVP 149 (271) Q Consensus 76 A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~------y~~A~~~~~~fi~~~P~s~~~~~A~y~~a~~~~~~~~~~~ 149 (271) ++...+-+...+|..+... +.-.+--+-|+..+ -.+++...+++-+.+-+.+..+.-.|..+. .+++... T Consensus 166 ~v~~~~~ie~lwpe~D~kk-a~s~lhTtvyqlRKaLs~L~~ne~vts~d~~Ykld~~~~k~Dv~e~es~~---rqi~~in 241 (361) T COG3947 166 EVTSWEAIEALWPEKDEKK-ASSLLHTTVYQLRKALSRLNANEAVTSQDRKYKLDAGLPKYDVQEYESLA---RQIEAIN 241 (361) T ss_pred CCCHHHHHHHHCCCCCHHH-HHHHHHHHHHHHHHHHCHHCCCCEEEECCCCEEEECCCCCCCHHHHHHHH---HHHHCCC T ss_conf 5507579999703200466-89999999999999845303675489737724774488632099999886---3543135 Q ss_pred HHHHHHHHHHHHHHHHHHHHHC-------CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 1234589999999999985200-------101345655555556666666555666665443223456899988652899 Q gi|254781174|r 150 YDQRATKLMLQYMSRIVERYTN-------SPYVKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQLVLANYSD 222 (271) Q Consensus 150 ~d~~~~~~A~~~f~~~i~~yP~-------S~ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~ 222 (271) ..+...++++..|-+ ++.|.+-+.++..+.-.+- ..+++.|.+-|.+..|+..-+.++.--|- T Consensus 242 -------ltide~kelv~~ykgdyl~e~~y~Waedererle~ly~kll---gkva~~yle~g~~neAi~l~qr~ltldpL 311 (361) T COG3947 242 -------LTIDELKELVGQYKGDYLPEADYPWAEDERERLEQLYMKLL---GKVARAYLEAGKPNEAIQLHQRALTLDPL 311 (361) T ss_pred -------CCHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHH---HHHHHHHHHCCCHHHHHHHHHHHHHCCHH T ss_conf -------67789999999853776774446510036899999999999---99999999759927899999877202704 Q ss_pred CHHHHHHHHHHHHHHHHCCCHHHHHHHHH Q ss_conf 81159999999999997498799999999 Q gi|254781174|r 223 AEHAEEAMARLVEAYVALALMDEAREVVS 251 (271) Q Consensus 223 t~~~~eAl~~l~~~y~~lg~~d~A~~~~~ 251 (271) .++....+..++..+|+.-.+.+-+. T Consensus 312 ---~e~~nk~lm~~la~~gD~is~~khye 337 (361) T COG3947 312 ---SEQDNKGLMASLATLGDEISAIKHYE 337 (361) T ss_pred ---HHHHHHHHHHHHHHHCCCHHHHHHHH T ss_conf ---46789999999998065043423899 No 196 >pfam10140 essB Predicted membrane protein essB. Members of this family of prokaryotic proteins include the virulence factor essB, which is required for the synthesis and secretion of EsxA and EsxB, both ESAT-6 like proteins. Probab=78.73 E-value=4 Score=18.81 Aligned_cols=53 Identities=11% Similarity=0.105 Sum_probs=36.0 Q ss_pred CHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 689999999999980989999999999985304773026899877887765455567 Q gi|254781174|r 56 YQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQ 112 (271) Q Consensus 56 ~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~ 112 (271) +.++.+..|...|=+.+|++.+..++.+.. ..+-..+++.+|++|-+..+..+ T Consensus 221 P~q~~i~~a~~~fi~~dYs~VI~~L~~~~~----~~lp~~~kY~LA~SyV~~e~Ls~ 273 (359) T pfam10140 221 PKQEAILEANTAFLNVDYSKVITTLEKYDP----ESLPKSVKYELAYSYVQVEKLSD 273 (359) T ss_pred HHHHHHHHHHHHHHHCCHHHHHHHHHHCCH----HHCCHHHHHHHHHHHHHCCCCCH T ss_conf 048899999999986258999987620793----24997777789899996166679 No 197 >COG3417 FlgN Collagen-binding surface adhesin SpaP (antigen I/II family) [General function prediction only] Probab=78.57 E-value=2.2 Score=20.27 Aligned_cols=21 Identities=19% Similarity=0.232 Sum_probs=13.5 Q ss_pred HHHHHHHHHHHHHHHHHHCCC Q ss_conf 999999999999999972268 Q gi|254781174|r 20 YKFALTIFFSIAVCFLVGWER 40 (271) Q Consensus 20 ~k~~~~i~~~i~~~~l~~Cs~ 40 (271) .|......++++.++++||++ T Consensus 2 tk~k~~~~il~~al~l~GCs~ 22 (200) T COG3417 2 TKMKIYASILLLALFLSGCSS 22 (200) T ss_pred CHHHHHHHHHHHHHHHHHCCC T ss_conf 257999999999999841136 No 198 >TIGR03524 GldJ gliding motility-associated lipoprotein GldJ. Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldJ is a lipoprotein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae. Knockouts of GldJ abolish the gliding phenotype. GldJ is homologous to GldK. There is a GldJ homolog in Cytophaga hutchinsonii and several other species that has a different, shorter architecture and is represented by a separate model. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility. Probab=77.96 E-value=2.6 Score=19.92 Aligned_cols=27 Identities=15% Similarity=0.232 Sum_probs=16.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 999999999999999999722687533 Q gi|254781174|r 18 QLYKFALTIFFSIAVCFLVGWERQSSR 44 (271) Q Consensus 18 ~m~k~~~~i~~~i~~~~l~~Cs~~~~~ 44 (271) .|.....++|++++.++++||+++... T Consensus 2 ~~~~~~~~~~~~~~~~~~~sC~~~~~~ 28 (559) T TIGR03524 2 NMKNVFKITFIVFVSLLFVSCKKKSSS 28 (559) T ss_pred CHHHHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 467889999999999997642588888 No 199 >KOG2471 consensus Probab=77.28 E-value=4.4 Score=18.56 Aligned_cols=88 Identities=13% Similarity=0.086 Sum_probs=39.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCC--CCCCCCCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCC Q ss_conf 9999999999999999999999722687533--34456777673689999999999980989999999999985304773 Q gi|254781174|r 14 AWAYQLYKFALTIFFSIAVCFLVGWERQSSR--DVYLDSVTDVRYQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAG 91 (271) Q Consensus 14 ~~~~~m~k~~~~i~~~i~~~~l~~Cs~~~~~--~~~~~~~~~~~~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~ 91 (271) |-+|+|-+....++++.-.+- -.|+.-..+ ....-.......-+-+|+-|..++..|+-..|.+-|.+.+..|--.| T Consensus 291 cIh~~~~~y~~~~~~F~kAL~-N~c~qL~~g~~~~~~~tls~nks~eilYNcG~~~Lh~grPl~AfqCf~~av~vfh~nP 369 (696) T KOG2471 291 CIHYQLGCYQASSVLFLKALR-NSCSQLRNGLKPAKTFTLSQNKSMEILYNCGLLYLHSGRPLLAFQCFQKAVHVFHRNP 369 (696) T ss_pred EEEEEHHHHHHHHHHHHHHHH-HHHHHHHCCCCCCCCEEHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCC T ss_conf 475201257888999999999-9999996457888603321053024677423799863890799999999999982183 Q ss_pred CHHHHHHHHHHHHH Q ss_conf 02689987788776 Q gi|254781174|r 92 VARKSLLMSAFVQY 105 (271) Q Consensus 92 ~a~~A~~~la~~~y 105 (271) +--+.+|+|-. T Consensus 370 ---rlWLRlAEcCi 380 (696) T KOG2471 370 ---RLWLRLAECCI 380 (696) T ss_pred ---HHHHHHHHHHH T ss_conf ---89999999999 No 200 >COG4976 Predicted methyltransferase (contains TPR repeat) [General function prediction only] Probab=75.88 E-value=4.7 Score=18.37 Aligned_cols=62 Identities=23% Similarity=0.207 Sum_probs=46.5 Q ss_pred HHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCH Q ss_conf 99998098999999999998530477302689987788776545556799999887540112110 Q gi|254781174|r 65 VLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKN 129 (271) Q Consensus 65 ~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~ 129 (271) ....+.++...|.+.+.++...-|. |+ ..-|.+|+...+.|+.+.|...|++.++..|.... T Consensus 3 ~~~~~~~D~~aaaely~qal~lap~--w~-~gwfR~g~~~ekag~~daAa~a~~~~L~ldp~D~~ 64 (287) T COG4976 3 YMLAESGDAEAAAELYNQALELAPE--WA-AGWFRLGEYTEKAGEFDAAAAAYEEVLELDPEDHG 64 (287) T ss_pred CHHCCCCCHHHHHHHHHHHHHCCCH--HH-HHHHHCCHHHHHCCCHHHHHHHHHHHHCCCCCCCC T ss_conf 0010367848999999877504704--44-34554023666304189999999988707962122 No 201 >PRK10702 endonuclease III; Provisional Probab=75.52 E-value=4.9 Score=18.28 Aligned_cols=166 Identities=10% Similarity=0.030 Sum_probs=85.9 Q ss_pred HHHHHHHHHHHHHCCCCCCH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHH------HHHHHHHHHHH Q ss_conf 99999999998530477302----689987788776545556799999887540112110135------55443444555 Q gi|254781174|r 74 SKAYEYFNQCSRDFPFAGVA----RKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDY------VYYLVGMSYAQ 143 (271) Q Consensus 74 ~~A~~~f~~i~~~~P~s~~a----~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~------A~y~~a~~~~~ 143 (271) .+|.+.++.+.+.||...-. .-=..+++...-++-........++++.+.+|+-+.+.. ..+..+..||. T Consensus 4 ~~~~~i~~~l~~~~p~~~~~l~~~~P~~vLVs~ILsqqTtd~~v~~~~~~L~~~~~t~e~la~a~~~el~~~i~~~G~y~ 83 (211) T PRK10702 4 AKRLEILTRLRDNNPHPTTELNFSSPFELLIAVLLSAQATDVSVNKATAKLYPVANTPAAMLELGVEGVKTYIKTIGLYN 83 (211) T ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHCCCHHHHHHHHHHHHHHH T ss_conf 99999999999878699998588985899999999741858999999999999779999987099999999999863599 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHCCC Q ss_conf 5443111234589999999999985200101345655555556666666555666665443223456899988-652899 Q gi|254781174|r 144 MIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQLV-LANYSD 222 (271) Q Consensus 144 ~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~~~-i~~yp~ 222 (271) + + -.+. ...-+.++++|-+ . .++....+..+ . . -|.|.|+....-.+ ....+= T Consensus 84 ~-----K-A~~L---~~~a~~i~~~~~G-~-vP~~~~~L~~L----p----G-------IG~kTA~aIl~~a~~~~~~~V 137 (211) T PRK10702 84 S-----K-AENV---IKTCRILLEKHNG-E-VPEDRAALEAL----P----G-------VGRKTANVVLNTAFGWPTIAV 137 (211) T ss_pred H-----H-HHHH---HHHHHHHHHHHCC-C-CHHHHHHHHHH----H----C-------CCHHHHHHHHHHHCCCCCCCC T ss_conf 9-----9-9999---9999999999099-8-76669999987----6----6-------358899999999849986525 Q ss_pred CHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHH Q ss_conf 811599999999999974987999999999978569997899999860 Q gi|254781174|r 223 AEHAEEAMARLVEAYVALALMDEAREVVSLIQERYPQGYWARYVETLV 270 (271) Q Consensus 223 t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~~~yP~s~~~~~a~~~l 270 (271) ..++...+.|++-+ .+. ...+....+....|...|......++ T Consensus 138 DtnV~RV~~Rlg~~---~~~--~~~~~~~~l~~~~p~~~~~~~~~~li 180 (211) T PRK10702 138 DTHIFRVCNRTQFA---PGK--NVEQVEEKLLKVVPAEFKVDCHHWLI 180 (211) T ss_pred CCCHHHHHHHHCCC---CCC--CHHHHHHHHHHHCCHHHHHHHHHHHH T ss_conf 73599999997657---789--99999999998289023799999999 No 202 >PRK10153 DNA-binding transcriptional activator CadC; Provisional Probab=74.85 E-value=5.1 Score=18.18 Aligned_cols=147 Identities=12% Similarity=0.049 Sum_probs=84.2 Q ss_pred HHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHH---------- Q ss_conf 9999988754011211013555443444555544311123458999999999998520010134565555---------- Q gi|254781174|r 113 AASLGEEYITQYPESKNVDYVYYLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYV---------- 182 (271) Q Consensus 113 A~~~~~~fi~~~P~s~~~~~A~y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l---------- 182 (271) +....+..+...|+.+.+-.. |+.|.-+.. ..|.....+|++.+++.++.-|+..|+...+.-. T Consensus 314 ~~~~~~~l~~~lP~~~~aL~~-~~~A~~~l~-----~~d~~~l~~~idLl~~~~~~ePdn~Yv~Ae~yi~~~~~~~l~~~ 387 (512) T PRK10153 314 PERMQERLQQGLPHQGAALML-FYQAHHYLN-----SGDAKSLNKGSDLLEEILKSEPDFTYALAEKALADHVRQSLQPL 387 (512) T ss_pred HHHHHHHHHHHCCCCHHHHHH-HHHHHHHHH-----CCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCC T ss_conf 488999998748998899999-998515663-----03478899999999999972998389999999999999854988 Q ss_pred -----HHHHHHHHHHHH------------HHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHH Q ss_conf -----555666666655------------566666544322345689998865289981159999999999997498799 Q gi|254781174|r 183 -----TVGRNQLAAKEV------------EIGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDE 245 (271) Q Consensus 183 -----~~~~~~La~~e~------------~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~ 245 (271) ..++..+....- ..|-.....|+.+.|-..+... |......-..-.+|+.+...|+.+. T Consensus 388 ~~~~l~~Ln~~l~~~~~~~~~~~~~riyeAlAl~al~~~~~~~A~~~L~~~----~~~~~Svl~YILlgK~aEl~G~~~~ 463 (512) T PRK10153 388 DEKQLAALNTELDNIVALPGVNVLSRIYEALAVQALVKGDTDEAYQALNKA----IELEMSWLNYVLLGKVAELKGDNRL 463 (512) T ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHC----HHHHHHHHHHHHHHHHHHHCCCHHH T ss_conf 899999999999998733246887288999999998479999999999867----5645569999999999996499668 Q ss_pred HHHHHHHHHHHCCCCHHHHHHHHH Q ss_conf 999999997856999789999986 Q gi|254781174|r 246 AREVVSLIQERYPQGYWARYVETL 269 (271) Q Consensus 246 A~~~~~~l~~~yP~s~~~~~a~~~ 269 (271) |.+.+..-..-=|.-+-+.-++++ T Consensus 464 A~e~Y~~Af~~~~s~~tl~l~enL 487 (512) T PRK10153 464 AADAYSTAFNLRPGENTYYLIENL 487 (512) T ss_pred HHHHHHHHHHCCCCHHHHHHHHHC T ss_conf 899999998038838999998625 No 203 >pfam11777 DUF3316 Protein of unknown function (DUF3316). This family of bacterial proteins has no known function. Several members are, however, annotated as being putative acyl-CoA synthetase, but this could not be confirmed. Probab=74.16 E-value=5.3 Score=18.07 Aligned_cols=52 Identities=10% Similarity=0.024 Sum_probs=29.8 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 9999999999999999972268753334456777673689999999999980 Q gi|254781174|r 19 LYKFALTIFFSIAVCFLVGWERQSSRDVYLDSVTDVRYQREVYEKAVLFLKE 70 (271) Q Consensus 19 m~k~~~~i~~~i~~~~l~~Cs~~~~~~~~~~~~~~~~~~~~lY~~a~~~~~~ 70 (271) |.+.+++.+++++....++|+............+...+..+.|+.|...+++ T Consensus 1 mKk~~ll~~~ll~s~~AfA~~~~~~~~~~~~~~~~~~tk~~AY~~G~~~~~~ 52 (114) T pfam11777 1 MKKLILLALLLLLSATAFAGNYVSTTQTDSLQTDVYATKQAAYGAGFTNLND 52 (114) T ss_pred CHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEECCCCCHHHHHHHHHHHHHH T ss_conf 9108999999999888875555365562589715530599999999999998 No 204 >TIGR02722 lp_ uncharacterized lipoprotein; InterPro: IPR014094 Members of this protein family are restricted to the Proteobacteria, and all are predicted lipoproteins. In genomes that contain the thiK gene for the salvage enzyme thiamin kinase, the member of this family is encoded nearby.. Probab=73.47 E-value=4.1 Score=18.73 Aligned_cols=22 Identities=27% Similarity=0.498 Sum_probs=16.7 Q ss_pred HHHHHHHHHHHHHHHHHCCCCC Q ss_conf 9999999999999997226875 Q gi|254781174|r 21 KFALTIFFSIAVCFLVGWERQS 42 (271) Q Consensus 21 k~~~~i~~~i~~~~l~~Cs~~~ 42 (271) |+.++++++++.++|+||++.. T Consensus 2 ~~~~l~~~~~~Al~L~GC~~~~ 23 (215) T TIGR02722 2 KYKILIFVALLALLLSGCVSQR 23 (215) T ss_pred HHHHHHHHHHHHHHHHCCCCCC T ss_conf 3787899999999985468872 No 205 >PRK09967 putative outer membrane lipoprotein; Provisional Probab=73.36 E-value=3.3 Score=19.32 Aligned_cols=22 Identities=14% Similarity=0.190 Sum_probs=13.9 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 99999999999999999722687 Q gi|254781174|r 19 LYKFALTIFFSIAVCFLVGWERQ 41 (271) Q Consensus 19 m~k~~~~i~~~i~~~~l~~Cs~~ 41 (271) |.|..+ ..+++++++|+||++. T Consensus 1 m~k~~~-~~~l~~~l~lsgC~s~ 22 (160) T PRK09967 1 MIKHLV-APLIFTSLILTGCQSP 22 (160) T ss_pred CHHHHH-HHHHHHHHHHHHCCCC T ss_conf 907899-9999999999845899 No 206 >PRK09578 periplasmic multidrug efflux lipoprotein precursor; Reviewed Probab=72.13 E-value=4.8 Score=18.33 Aligned_cols=24 Identities=21% Similarity=0.211 Sum_probs=16.0 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 999999999999999722687533 Q gi|254781174|r 21 KFALTIFFSIAVCFLVGWERQSSR 44 (271) Q Consensus 21 k~~~~i~~~i~~~~l~~Cs~~~~~ 44 (271) |+.+.++.+++.++|+||+..... T Consensus 6 r~~~~l~~l~~~~~L~gC~~~~~~ 29 (385) T PRK09578 6 RRRLALAALVAAFALAGCGKGDSD 29 (385) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCC T ss_conf 799999999999998167999878 No 207 >TIGR01781 Trep_dent_lipo Treponema denticola clustered lipoprotein; InterPro: IPR010103 This entry represents a family of six predicted lipoproteins from a region of about 20 tandemly arranged genes in the Treponema denticola genome. Two other neighbouring genes share the lipoprotein signal peptide region but do not show more extensive homology. The function of this locus is unknown.. Probab=72.06 E-value=3 Score=19.56 Aligned_cols=23 Identities=13% Similarity=0.260 Sum_probs=15.3 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 99999999999997226875333 Q gi|254781174|r 23 ALTIFFSIAVCFLVGWERQSSRD 45 (271) Q Consensus 23 ~~~i~~~i~~~~l~~Cs~~~~~~ 45 (271) ..+|||+++.++|+||+..-++. T Consensus 6 lKlIFIL~LAvLLFSC~ke~ke~ 28 (463) T TIGR01781 6 LKLIFILMLAVLLFSCKKEVKES 28 (463) T ss_pred HHHHHHHHHHHHHHCCCCCCCCC T ss_conf 46888999999860476211331 No 208 >PRK11548 hypothetical protein; Provisional Probab=71.10 E-value=2.8 Score=19.75 Aligned_cols=24 Identities=13% Similarity=0.078 Sum_probs=15.9 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 999999999999999997226875 Q gi|254781174|r 19 LYKFALTIFFSIAVCFLVGWERQS 42 (271) Q Consensus 19 m~k~~~~i~~~i~~~~l~~Cs~~~ 42 (271) |....+....++++++++|||+.. T Consensus 1 m~~k~l~~a~ll~~lllsgCS~~~ 24 (113) T PRK11548 1 MRCKTLTAAAAVLLMLTAGCSTLE 24 (113) T ss_pred CCHHHHHHHHHHHHHHHCCCCCCC T ss_conf 925788999999999980115787 No 209 >PRK00022 lolB outer membrane lipoprotein LolB; Provisional Probab=70.61 E-value=6.5 Score=17.58 Aligned_cols=19 Identities=16% Similarity=0.268 Sum_probs=11.9 Q ss_pred HHHHHHHHHHHHHCCCCCC Q ss_conf 9999999999972268753 Q gi|254781174|r 25 TIFFSIAVCFLVGWERQSS 43 (271) Q Consensus 25 ~i~~~i~~~~l~~Cs~~~~ 43 (271) ..++++++++|+||++... T Consensus 5 ~~~l~~~~llLsgCat~p~ 23 (203) T PRK00022 5 LRLLPLAALLLAGCASLPP 23 (203) T ss_pred HHHHHHHHHHHHHCCCCCC T ss_conf 8999999999866148999 No 210 >KOG4459 consensus Probab=70.51 E-value=6.5 Score=17.57 Aligned_cols=34 Identities=12% Similarity=0.440 Sum_probs=28.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHH Q ss_conf 7788776545556799999887540112110135 Q gi|254781174|r 99 MSAFVQYSAGKYQQAASLGEEYITQYPESKNVDY 132 (271) Q Consensus 99 ~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~ 132 (271) .+-.+||+.|+...|+.....|+-.+|+++.+-+ T Consensus 138 YL~~ay~k~~~l~kAv~aa~tflv~~Pdde~ik~ 171 (471) T KOG4459 138 YLQFAYFKVGELEKAVAAAHTFLVANPDDEDIKQ 171 (471) T ss_pred HHHHHHHHHHHHHHHHHHCCEEEECCCCHHHHHH T ss_conf 9999999862199999742324421873789999 No 211 >TIGR03516 ppisom_GldI peptidyl-prolyl isomerase, gliding motility-associated. Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldI is a FKBP-type peptidyl-prolyl cis-trans isomerase (pfam00254) linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae and Cytophaga hutchinsonii. Knockout of this gene abolishes the gliding phenotype. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. This family is only found in Bacteroidetes containing the suite of genes proposed to confer the gliding motility phenotype. Probab=69.71 E-value=6.8 Score=17.47 Aligned_cols=20 Identities=15% Similarity=0.268 Sum_probs=11.8 Q ss_pred HHHHHHHHHHHCCCCCCCCC Q ss_conf 99999999972268753334 Q gi|254781174|r 27 FFSIAVCFLVGWERQSSRDV 46 (271) Q Consensus 27 ~~~i~~~~l~~Cs~~~~~~~ 46 (271) .+++++++++||+....... T Consensus 6 ~~i~~~~~~~sC~~~e~RkP 25 (177) T TIGR03516 6 AVILLLLLLLGCKTPEARRP 25 (177) T ss_pred HHHHHHHHHHHCCCCCCCCC T ss_conf 78989899872699646688 No 212 >PRK02998 prsA peptidylprolyl isomerase; Reviewed Probab=68.96 E-value=7 Score=17.37 Aligned_cols=79 Identities=13% Similarity=-0.022 Sum_probs=36.0 Q ss_pred HHHHHHHHH--HHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHH Q ss_conf 999999999--999999997226875333445677767368999999999998098999999999998530477302689 Q gi|254781174|r 19 LYKFALTIF--FSIAVCFLVGWERQSSRDVYLDSVTDVRYQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKS 96 (271) Q Consensus 19 m~k~~~~i~--~~i~~~~l~~Cs~~~~~~~~~~~~~~~~~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A 96 (271) |.|..+++. .+.+++.|.+|++.. +.........+..++|+.-..... ++--.-.-.-..+...|.-+...-+. T Consensus 1 Mkkkk~~~~~~~~~~~l~LaaC~s~~---~vat~kgg~IT~~e~y~~lk~~~g-~~~l~~li~~kvl~~~ykVsdkev~~ 76 (283) T PRK02998 1 MKKKKLFIGTIISCVVLALSACGSSD---NVVTSKVGNITEKELSKELRQKYG-ESTLYQMVLSKALLDKYKVSDEEAKK 76 (283) T ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCC---CEEEECCCCCCHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHCCCCHHHHHH T ss_conf 93589999999999999998668998---179843992839999999999999-99999999999999983688999999 Q ss_pred HHHHH Q ss_conf 98778 Q gi|254781174|r 97 LLMSA 101 (271) Q Consensus 97 ~~~la 101 (271) .+... T Consensus 77 ~~~~~ 81 (283) T PRK02998 77 QVEEA 81 (283) T ss_pred HHHHH T ss_conf 99999 No 213 >TIGR02887 spore_ger_x_C germination protein, Ger(x)C family; InterPro: IPR008844 The GerAA, -AB, and -AC proteins of the Bacillus subtilis spore are required for the germination response to L-alanine as the sole germinant. Members of GerAC family are thought to be located in the inner spore membrane. Although the function of this family is unclear, they are likely to encode the components of the germination apparatus that respond directly to this germinant, mediating the spore's response .; GO: 0009847 spore germination, 0016020 membrane. Probab=68.57 E-value=3 Score=19.49 Aligned_cols=21 Identities=14% Similarity=0.186 Sum_probs=13.8 Q ss_pred HHHHHHHHHHHHHHHHCCCCC Q ss_conf 999999999999997226875 Q gi|254781174|r 22 FALTIFFSIAVCFLVGWERQS 42 (271) Q Consensus 22 ~~~~i~~~i~~~~l~~Cs~~~ 42 (271) ...++++++++++|+||.... T Consensus 2 ~~~~~~~~~~~~lL~GCwd~~ 22 (400) T TIGR02887 2 KLKILLLILALLLLTGCWDKR 22 (400) T ss_pred HHHHHHHHHHHHHHCCCCCCH T ss_conf 466789999999832665620 No 214 >PRK09861 cytoplasmic membrane lipoprotein-28; Provisional Probab=67.50 E-value=7.5 Score=17.19 Aligned_cols=134 Identities=14% Similarity=0.089 Sum_probs=56.9 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHC-CHHHHHHHHHHHHHHCCCCCCH-HHHHHHHHHH Q ss_conf 9999999999722687533344567776736899999999999809-8999999999998530477302-6899877887 Q gi|254781174|r 26 IFFSIAVCFLVGWERQSSRDVYLDSVTDVRYQREVYEKAVLFLKEQ-NFSKAYEYFNQCSRDFPFAGVA-RKSLLMSAFV 103 (271) Q Consensus 26 i~~~i~~~~l~~Cs~~~~~~~~~~~~~~~~~~~~lY~~a~~~~~~~-~y~~A~~~f~~i~~~~P~s~~a-~~A~~~la~~ 103 (271) ..++++.++++||++.+++...+.-..-..+..++.+.+.....++ -++--+ -..+.+. +...+.-|+. T Consensus 12 a~ll~~~~~l~~C~~~~~~~k~ikVG~~~~p~~~i~e~~~~~~~ek~G~~lei---------v~FsDy~~PN~AL~~G~i 82 (272) T PRK09861 12 AALLLAGILLAGCDQSSSDAKHIKVGVINGAEQDVAEVAKKVAKEKYGLDVEL---------VGFSGSLLPNDATNHGEL 82 (272) T ss_pred HHHHHHHHHHHHCCCCCCCCCEEEEEECCCCHHHHHHHHHHHHHHHCCCEEEE---------EEECCCCCHHHHHHCCCC T ss_conf 99999999987458976789738999678986999999998887617976899---------994686226289767983 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCC-------HHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHH--HHHHH Q ss_conf 7654555679999988754011211-------0135554434445555443-----11123458999999999--99852 Q gi|254781174|r 104 QYSAGKYQQAASLGEEYITQYPESK-------NVDYVYYLVGMSYAQMIRD-----VPYDQRATKLMLQYMSR--IVERY 169 (271) Q Consensus 104 ~y~~~~y~~A~~~~~~fi~~~P~s~-------~~~~A~y~~a~~~~~~~~~-----~~~d~~~~~~A~~~f~~--~i~~y 169 (271) .-++-+-...++.|.+.+..+- ..|-+.|-.-......+++ +..|..+..+|+..++. +|..= T Consensus 83 ---DaN~fQH~pyL~~~n~~~g~~L~~v~~~~~~P~glYS~K~ksl~dlp~Ga~IaIPnDpsN~~RAL~lL~~aGLI~Lk 159 (272) T PRK09861 83 ---DANVFQHRPFLEQDNQAHGYKLVAVGNTFVFPMAGYSKKIKTVAQIKEGATVAIPNDPTNLGRALLLLQKEKLITLK 159 (272) T ss_pred ---CCHHHHHHHHHHHHHHHCCCCEEEEEEEEEEEEEECCCCCCCHHHCCCCCEEEECCCHHHHHHHHHHHHHCCCEEEC T ss_conf ---60245579999999998699579973167830140344659875847999898047812699999999988978977 Q ss_pred HC Q ss_conf 00 Q gi|254781174|r 170 TN 171 (271) Q Consensus 170 P~ 171 (271) |+ T Consensus 160 ~~ 161 (272) T PRK09861 160 EG 161 (272) T ss_pred CC T ss_conf 99 No 215 >pfam11207 DUF2989 Protein of unknown function (DUF2989). Some members in this bacterial family of proteins are annotated as lipoproteins however this cannot be confirmed. Probab=67.21 E-value=7.6 Score=17.16 Aligned_cols=53 Identities=19% Similarity=0.273 Sum_probs=45.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCC-CCHHHHHHHHHHHHHHHHCCCHHHHH Q ss_conf 5566666544322345689998865289-98115999999999999749879999 Q gi|254781174|r 194 VEIGRYYLKRGEYVAAIPRFQLVLANYS-DAEHAEEAMARLVEAYVALALMDEAR 247 (271) Q Consensus 194 ~~ia~~Y~~~~~y~aA~~~~~~~i~~yp-~t~~~~eAl~~l~~~y~~lg~~d~A~ 247 (271) +.+|.||.+ .+...++.-+..+++.+. +....++.+.-|+-+|+.+|+.++|- T Consensus 145 ~aLAtyY~k-~d~~Kt~~Ll~~aL~l~~~~d~~n~eil~sLas~~~~~~~~~~AY 198 (203) T pfam11207 145 YALATYYIK-RDPDKTIQLLYRALELYTPGDKFNPEILLSLASIYQKQKNYEQAY 198 (203) T ss_pred HHHHHHHHC-CCHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCHHHE T ss_conf 999999980-488999999999998058767579799999999998714213430 No 216 >KOG1070 consensus Probab=66.82 E-value=7.8 Score=17.11 Aligned_cols=187 Identities=12% Similarity=0.196 Sum_probs=97.7 Q ss_pred HHHHHHHHHHHHCCCCCCHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHH-CCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999998530477302689987788776--545556799999887540-1121101355544344455554431112 Q gi|254781174|r 75 KAYEYFNQCSRDFPFAGVARKSLLMSAFVQY--SAGKYQQAASLGEEYITQ-YPESKNVDYVYYLVGMSYAQMIRDVPYD 151 (271) Q Consensus 75 ~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y--~~~~y~~A~~~~~~fi~~-~P~s~~~~~A~y~~a~~~~~~~~~~~~d 151 (271) +..+.|++++...|+|. ..|+-|-.| +.++-+.|...+++++.. +|+-+ -+----.+|+-+++..-..+-. T Consensus 1442 esaeDferlvrssPNSS-----i~WI~YMaf~LelsEiekAR~iaerAL~tIN~REe-eEKLNiWiA~lNlEn~yG~ees 1515 (1710) T KOG1070 1442 ESAEDFERLVRSSPNSS-----ILWIRYMAFHLELSEIEKARKIAERALKTINFREE-EEKLNIWIAYLNLENAYGTEES 1515 (1710) T ss_pred CCHHHHHHHHHCCCCCC-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHH-HHHHHHHHHHHHHHHHHCCHHH T ss_conf 67788999986099840-----28999999885024567789999998611786416-7777789999867875276899 Q ss_pred -HHHHH------HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHH-HH--HHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf -34589------9999999999852001013456555555566666-66--55566666544322345689998865289 Q gi|254781174|r 152 -QRATK------LMLQYMSRIVERYTNSPYVKGARFYVTVGRNQLA-AK--EVEIGRYYLKRGEYVAAIPRFQLVLANYS 221 (271) Q Consensus 152 -~~~~~------~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~La-~~--e~~ia~~Y~~~~~y~aA~~~~~~~i~~yp 221 (271) ++-.+ .|...+..+..-|-.++...+|.+-+...-.+.. .. =...|.+.+++.+-.+|..-++..+.--| T Consensus 1516 l~kVFeRAcqycd~~~V~~~L~~iy~k~ek~~~A~ell~~m~KKF~q~~~vW~~y~~fLl~~ne~~aa~~lL~rAL~~lP 1595 (1710) T KOG1070 1516 LKKVFERACQYCDAYTVHLKLLGIYEKSEKNDEADELLRLMLKKFGQTRKVWIMYADFLLRQNEAEAARELLKRALKSLP 1595 (1710) T ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCC T ss_conf 99999999874336889999887998762310489999999998653256899999998514478899999999986465 Q ss_pred CCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCC--CHHHHHHHH Q ss_conf 98115999999999999749879999999999785699--978999998 Q gi|254781174|r 222 DAEHAEEAMARLVEAYVALALMDEAREVVSLIQERYPQ--GYWARYVET 268 (271) Q Consensus 222 ~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~~~yP~--s~~~~~a~~ 268 (271) ... ..+-+-..+..-++.|+.+-++..+.-+...||. .-|..++.. T Consensus 1596 k~e-Hv~~IskfAqLEFk~GDaeRGRtlfEgll~ayPKRtDlW~VYid~ 1643 (1710) T KOG1070 1596 KQE-HVEFISKFAQLEFKYGDAERGRTLFEGLLSAYPKRTDLWSVYIDM 1643 (1710) T ss_pred HHH-HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHH T ss_conf 023-688999999987651884356789887776385430279999998 No 217 >KOG2114 consensus Probab=65.49 E-value=8.3 Score=16.96 Aligned_cols=147 Identities=12% Similarity=0.130 Sum_probs=75.0 Q ss_pred HHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---------HHCCCCCHHHH Q ss_conf 999999980989999999999985304773026899877887765455567999998875---------40112110135 Q gi|254781174|r 62 EKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYI---------TQYPESKNVDY 132 (271) Q Consensus 62 ~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi---------~~~P~s~~~~~ 132 (271) .+-.-.+++.-|..|+..=+ ..+-........+...|.-.|..|++++|...|-+-| +.||+...+-. T Consensus 339 ~kL~iL~kK~ly~~Ai~LAk---~~~~d~d~~~~i~~kYgd~Ly~Kgdf~~A~~qYI~tI~~le~s~Vi~kfLdaq~Ikn 415 (933) T KOG2114 339 TKLDILFKKNLYKVAINLAK---SQHLDEDTLAEIHRKYGDYLYGKGDFDEATDQYIETIGFLEPSEVIKKFLDAQRIKN 415 (933) T ss_pred HHHHHHHHHHHHHHHHHHHH---HCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCHHHHHHHHCCHHHHHH T ss_conf 89999997641799999998---638987899999999878877447778899999987065785999998348999999 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHH--------HHH----HH Q ss_conf -55443444555544311123458-------99999999999852001013456555555566--------666----66 Q gi|254781174|r 133 -VYYLVGMSYAQMIRDVPYDQRAT-------KLMLQYMSRIVERYTNSPYVKGARFYVTVGRN--------QLA----AK 192 (271) Q Consensus 133 -A~y~~a~~~~~~~~~~~~d~~~~-------~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~--------~La----~~ 192 (271) +.|+.++.--.. ...|...+ .+-.+-..+||+..|.+...-|...-+..++. .|| .| T Consensus 416 Lt~YLe~L~~~gl---a~~dhttlLLncYiKlkd~~kL~efI~~~~~g~~~fd~e~al~Ilr~snyl~~a~~LA~k~~~h 492 (933) T KOG2114 416 LTSYLEALHKKGL---ANSDHTTLLLNCYIKLKDVEKLTEFISKCDKGEWFFDVETALEILRKSNYLDEAELLATKFKKH 492 (933) T ss_pred HHHHHHHHHHCCC---CCCHHHHHHHHHHHHHCCHHHHHHHHHCCCCCCEEEEHHHHHHHHHHHCHHHHHHHHHHHHCCC T ss_conf 9999999997143---5510588899999985245778999843887615530999999999707678999999985137 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH Q ss_conf 5556666654432234568999 Q gi|254781174|r 193 EVEIGRYYLKRGEYVAAIPRFQ 214 (271) Q Consensus 193 e~~ia~~Y~~~~~y~aA~~~~~ 214 (271) |..+--.+...++|..|+.+++ T Consensus 493 e~vl~ille~~~ny~eAl~yi~ 514 (933) T KOG2114 493 EWVLDILLEDLHNYEEALRYIS 514 (933) T ss_pred HHHHHHHHHHHCCHHHHHHHHH T ss_conf 8899999997108899998886 No 218 >pfam08631 SPO22 Meiosis protein SPO22/ZIP4 like. SPO22/ZIP4 in yeast is a meiosis specific protein involved in sporulation. It has been shown to regulate crossover distribution by promoting synaptonemal complex formation. Probab=64.81 E-value=8.5 Score=16.88 Aligned_cols=194 Identities=15% Similarity=0.142 Sum_probs=94.1 Q ss_pred CCHHHHHHHHHHHHHHC-CHHHHHHHHHHHHHHCC---CC--------CCHHHHHHHHHHHHHHHH---HHHHHHHHHHH Q ss_conf 36899999999999809-89999999999985304---77--------302689987788776545---55679999988 Q gi|254781174|r 55 RYQREVYEKAVLFLKEQ-NFSKAYEYFNQCSRDFP---FA--------GVARKSLLMSAFVQYSAG---KYQQAASLGEE 119 (271) Q Consensus 55 ~~~~~lY~~a~~~~~~~-~y~~A~~~f~~i~~~~P---~s--------~~a~~A~~~la~~~y~~~---~y~~A~~~~~~ 119 (271) .-....|+-|...+.++ +|.+|+.-+++..+..+ .. ++--..+-.++-++...+ .++.|.....- T Consensus 33 ~La~~~yn~G~~ll~~~~~~~~A~~wL~ra~d~l~~~~~~~~~~~~~~eLr~~iL~~La~a~l~~~~~es~~ka~~~l~~ 112 (280) T pfam08631 33 ELARVLYNIGKSLLERKTDFSEAVKWLQRALDLLEKGAKNDQKSTEGSELKFSILRLLAKALLETGESESVLKALKALSL 112 (280) T ss_pred HHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH T ss_conf 99999999999999758987999999999999862665313368548889999999999998446874689999999999 Q ss_pred HHHHCCCCCHHHHHHHHHHHH------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHH Q ss_conf 754011211013555443444------------55554431112345899999999999852001013456555555566 Q gi|254781174|r 120 YITQYPESKNVDYVYYLVGMS------------YAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRN 187 (271) Q Consensus 120 fi~~~P~s~~~~~A~y~~a~~------------~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~ 187 (271) ....||+++.+. .+.+..+. ...++...+.--.+...++.....+.+..|.+. ..... ..+.+ T Consensus 113 l~~e~~~~~~~~-~L~l~il~~~~~~~~~~~~~L~~mi~~~~~~e~~~~~~l~~i~~l~~~~~~~~--~~~l~--~~l~~ 187 (280) T pfam08631 113 LESEFPNKPELY-LLALEILKKRPGPEEEYEDVLMRMIKSVDVTESNFELAISHINKLSDKAPASA--AKCLD--YLLFN 187 (280) T ss_pred HHHHCCCCCHHH-HHHHHHHHCCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCHHHH--HHHHH--HHHHH T ss_conf 986178872489-99999997389964899999999998467410229999999999985075779--99999--99999 Q ss_pred HHHHHH---H----HHHHHHHH-HHHHHHHHHH---HHHHH---HH---CCCCHHHHHH----HHHHHHHHHHCCCHHHH Q ss_conf 666665---5----56666654-4322345689---99886---52---8998115999----99999999974987999 Q gi|254781174|r 188 QLAAKE---V----EIGRYYLK-RGEYVAAIPR---FQLVL---AN---YSDAEHAEEA----MARLVEAYVALALMDEA 246 (271) Q Consensus 188 ~La~~e---~----~ia~~Y~~-~~~y~aA~~~---~~~~i---~~---yp~t~~~~eA----l~~l~~~y~~lg~~d~A 246 (271) ++..++ + -+-++... ++.-..+..+ .+.++ .. -|-+..+..| +...+..+++.+++++| T Consensus 188 r~~~~~~~~~~e~~vv~r~~~~t~~~~~~~~~~i~~l~~~~~~~~~~~~~~ls~~~~~a~~~LLW~~~~~~~~~k~Y~~A 267 (280) T pfam08631 188 RCNSSEDSNLLEKAVVTRVLLTTQSKELGAGEKIEVLAEFLSRVEKLLTKPLSAKAATAIHALLWNKGKKNYKTKNYDEA 267 (280) T ss_pred HHCCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHCCHHHH T ss_conf 85678865699999999999861675432088999999999999987147777999999999999998999987279999 Q ss_pred HHHHHHH Q ss_conf 9999999 Q gi|254781174|r 247 REVVSLI 253 (271) Q Consensus 247 ~~~~~~l 253 (271) ..++..- T Consensus 268 ~~W~~~a 274 (280) T pfam08631 268 LEWYKLA 274 (280) T ss_pred HHHHHHH T ss_conf 9999999 No 219 >PRK06481 fumarate reductase flavoprotein subunit; Validated Probab=63.80 E-value=8.9 Score=16.77 Aligned_cols=26 Identities=8% Similarity=0.040 Sum_probs=16.6 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCC Q ss_conf 99999999999997226875333445 Q gi|254781174|r 23 ALTIFFSIAVCFLVGWERQSSRDVYL 48 (271) Q Consensus 23 ~~~i~~~i~~~~l~~Cs~~~~~~~~~ 48 (271) +.++.+.++.++|.||++..+..... T Consensus 6 ~~~~~~~~~~~~~~~c~~~~~~~~~~ 31 (506) T PRK06481 6 TTLGMLLLLALILTGCGSNTTSKSDS 31 (506) T ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCC T ss_conf 99999999999986507999998888 No 220 >COG3017 LolB Outer membrane lipoprotein involved in outer membrane biogenesis [Cell envelope biogenesis, outer membrane] Probab=63.73 E-value=8.9 Score=16.76 Aligned_cols=29 Identities=21% Similarity=0.244 Sum_probs=19.2 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 99999999999999999722687533344 Q gi|254781174|r 19 LYKFALTIFFSIAVCFLVGWERQSSRDVY 47 (271) Q Consensus 19 m~k~~~~i~~~i~~~~l~~Cs~~~~~~~~ 47 (271) |-++...++..+++++|.||+...+.... T Consensus 3 ~~~~~~~~l~~~As~LL~aC~~~~~~~~~ 31 (206) T COG3017 3 MMKRLLFLLLALASLLLTACTLTASRPPN 31 (206) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCC T ss_conf 27778789999999999756566777877 No 221 >COG5510 Predicted small secreted protein [Function unknown] Probab=62.51 E-value=5.5 Score=18.00 Aligned_cols=23 Identities=17% Similarity=0.135 Sum_probs=15.2 Q ss_pred HHHHHHHHHHHHHHHHH-HHHCCC Q ss_conf 99999999999999999-972268 Q gi|254781174|r 18 QLYKFALTIFFSIAVCF-LVGWER 40 (271) Q Consensus 18 ~m~k~~~~i~~~i~~~~-l~~Cs~ 40 (271) +|.|+++++++++++++ +.+|.. T Consensus 1 mmk~t~l~i~~vll~s~llaaCNT 24 (44) T COG5510 1 MMKKTILLIALVLLASTLLAACNT 24 (44) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 904899999999999999988663 No 222 >KOG2063 consensus Probab=62.02 E-value=9.6 Score=16.58 Aligned_cols=67 Identities=15% Similarity=0.213 Sum_probs=47.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCC-CCHHHHHHHHHHHHHHHHCCCH--HHHHHHHHHHHHHCCCC Q ss_conf 5566666544322345689998865289-9811599999999999974987--99999999997856999 Q gi|254781174|r 194 VEIGRYYLKRGEYVAAIPRFQLVLANYS-DAEHAEEAMARLVEAYVALALM--DEAREVVSLIQERYPQG 260 (271) Q Consensus 194 ~~ia~~Y~~~~~y~aA~~~~~~~i~~yp-~t~~~~eAl~~l~~~y~~lg~~--d~A~~~~~~l~~~yP~s 260 (271) ..++.+|..+|.+.+|+.-+..+++... ......+.+-++.+=..++|.. +..-++..-+....|+. T Consensus 508 ~~Li~LY~~kg~h~~AL~ll~~l~d~~~~~d~~~~~~~e~ii~YL~~l~~~~~~Li~~y~~wvl~~~p~~ 577 (877) T KOG2063 508 RELIELYATKGMHEKALQLLRDLVDEDSDTDSFQLDGLEKIIEYLKKLGAENLDLILEYADWVLNKNPEA 577 (877) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHH T ss_conf 9999999862655999999999723644232114443788999998745653668898766543149412 No 223 >PRK02944 OxaA-like protein precursor; Validated Probab=61.06 E-value=10 Score=16.48 Aligned_cols=67 Identities=15% Similarity=0.112 Sum_probs=34.7 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHH-HH--HHHHHH---HHHHCCHHHHHHHHHHHHHH--CCCC Q ss_conf 99999999999999972268753334456777673689-99--999999---99809899999999999853--0477 Q gi|254781174|r 21 KFALTIFFSIAVCFLVGWERQSSRDVYLDSVTDVRYQR-EV--YEKAVL---FLKEQNFSKAYEYFNQCSRD--FPFA 90 (271) Q Consensus 21 k~~~~i~~~i~~~~l~~Cs~~~~~~~~~~~~~~~~~~~-~l--Y~~a~~---~~~~~~y~~A~~~f~~i~~~--~P~s 90 (271) |..++..++.+.++|+||+..++..+. .+.....+ -. ...... ..-.|+|.-||-.+.-+++. .|-+ T Consensus 4 ~~~~~~~~~~~~~~lsgC~~~~~~~~~---~~~g~~~~~~v~p~~~~i~~~a~~~g~~yGlaIIl~TiiVRlillPL~ 78 (255) T PRK02944 4 KLGLLAMVVALMAILAGCSEVNQPITP---KSTGIWNEYFVYPLSQLITYFANLFGSNYGLAIIVVTLLIRLAILPLM 78 (255) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCC---CCCCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999999999624688999999---999818999999999999999986497199999999999999999889 No 224 >TIGR03504 FimV_Cterm FimV C-terminal domain. This protein is found at the extreme C-terminus of FimV from Pseudomonas aeruginosa, and of TspA of Neisseria meningitidis. Disruption of the former blocks twitching motility from type IV pili; Semmler, et al. suggest a role in peptidoglycan layer remodelling required by type IV fimbrial systems. Probab=60.76 E-value=10 Score=16.44 Aligned_cols=20 Identities=15% Similarity=0.292 Sum_probs=6.6 Q ss_pred HHHHHHHCCHHHHHHHHHHH Q ss_conf 99999809899999999999 Q gi|254781174|r 64 AVLFLKEQNFSKAYEYFNQC 83 (271) Q Consensus 64 a~~~~~~~~y~~A~~~f~~i 83 (271) |..|++-|+-+.|...+++| T Consensus 6 A~aYiemGD~egAr~lL~EV 25 (44) T TIGR03504 6 ARAYIEMGDLEGARELLEEV 25 (44) T ss_pred HHHHHHCCCHHHHHHHHHHH T ss_conf 99999829889899999999 No 225 >TIGR02544 III_secr_YscJ type III secretion apparatus lipoprotein, YscJ/HrcJ family; InterPro: IPR003282 Secretion of virulence factors in Gram-negative bacteria involves transportation of the protein across two membranes to reach the cell exterior. There have been four secretion systems described in animal enteropathogens such as Salmonella and Yersinia, with further sequence similarities in plant pathogens like Ralstonia and Erwinia. The type III secretion system is of great interest as it is used to transport virulence factors from the pathogen directly into the host cell and is only triggered when the bacterium comes into close contact with the host. The protein subunits of the system are very similar to those of bacterial flagellar biosynthesis . However, while the latter forms a ring structure to allow secretion of flagellin and is an integral part of the flagellum itself, type III subunits in the outer membrane translocate secreted proteins through a channel-like structure. One of the outer membrane protein subunit families, termed "K" here for nomenclature purposes, aids in the structural assembly of the invasion complex . It is also described as a lipoprotein. Members of this family include the Salmonella PrgK and SsaJ genes, MxiJ from Shigella, YscJ from Yersinia, and from the plant enteropathogens NolT (Rhizobium) and HrcJ Erwinia). The flagellar M-ring protein FliF also shares a low level of similarity, presumably due to evolution of the type III secretion system from the flagellar biosynthetic pathway.; GO: 0009306 protein secretion. Probab=59.46 E-value=11 Score=16.31 Aligned_cols=16 Identities=13% Similarity=0.366 Sum_probs=11.3 Q ss_pred HHHHHHHHHHHHHCCC Q ss_conf 9999999999972268 Q gi|254781174|r 25 TIFFSIAVCFLVGWER 40 (271) Q Consensus 25 ~i~~~i~~~~l~~Cs~ 40 (271) .+++++++++|+||.. T Consensus 4 ~~~~~~l~llL~GC~~ 19 (203) T TIGR02544 4 KLLLLLLLLLLTGCKV 19 (203) T ss_pred HHHHHHHHHHHHCCCC T ss_conf 8899999999717874 No 226 >PRK10796 LPS-assembly lipoprotein RlpB; Provisional Probab=59.31 E-value=7.6 Score=17.16 Aligned_cols=28 Identities=18% Similarity=0.288 Sum_probs=19.8 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCCCC Q ss_conf 9999999999999997226875333445 Q gi|254781174|r 21 KFALTIFFSIAVCFLVGWERQSSRDVYL 48 (271) Q Consensus 21 k~~~~i~~~i~~~~l~~Cs~~~~~~~~~ 48 (271) |++..+++.++++++.||+-.-.+...+ T Consensus 2 r~l~~l~l~lavll~agCGFhLRg~~~l 29 (196) T PRK10796 2 RYLATLLLSLAVLVTAGCGWHLRGTTQV 29 (196) T ss_pred HHHHHHHHHHHHHHHCCCCCEECCCCCC T ss_conf 5699999999999974767157689999 No 227 >PTZ00116 signal peptidase; Provisional Probab=58.61 E-value=11 Score=16.22 Aligned_cols=31 Identities=26% Similarity=0.333 Sum_probs=20.2 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 9467766899999999999999999999999999972268 Q gi|254781174|r 1 MSAVLGRAICIFEAWAYQLYKFALTIFFSIAVCFLVGWER 40 (271) Q Consensus 1 ~~~~~~~~~~~~~~~~~~m~k~~~~i~~~i~~~~l~~Cs~ 40 (271) |.+++-|++++|.. + ...++++|++.++|+. T Consensus 1 M~S~lsR~N~lf~~--~-------~~~L~vlcal~~~~~~ 31 (185) T PTZ00116 1 MDNFLNRLNVLFYS--M-------ALCFFILCAFNYGTSF 31 (185) T ss_pred CCCHHHHHHHHHHH--H-------HHHHHHHHHHHHCEEE T ss_conf 94167676899999--9-------9999999998742036 No 228 >COG4499 Predicted membrane protein [Function unknown] Probab=58.34 E-value=11 Score=16.20 Aligned_cols=48 Identities=8% Similarity=0.040 Sum_probs=25.8 Q ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999809899999999999853047730268998778877654555 Q gi|254781174|r 59 EVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKY 110 (271) Q Consensus 59 ~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y 110 (271) +....|...|-+.+|++-+..++.. ..+.+-..+++.+|.+|-...+- T Consensus 250 eai~~a~~aFL~~nY~qVittLe~y----dp~klPksv~Y~LA~SYV~~e~L 297 (434) T COG4499 250 EAIITANTAFLKNNYDQVITTLENY----DPEKLPKSVQYILAVSYVNLEDL 297 (434) T ss_pred HHHHHHHHHHHHCCHHHHHHHCCCC----CHHHCCHHHHHHHHHHHHHCCCC T ss_conf 8899889999853589886302348----91237688899998877400344 No 229 >TIGR01742 SA_tandem_lipo Staphylococcus tandem lipoproteins; InterPro: IPR007595 This family contains several uncharacterised staphylococcal proteins.. Probab=58.00 E-value=4.8 Score=18.32 Aligned_cols=61 Identities=16% Similarity=0.223 Sum_probs=34.4 Q ss_pred HHHHHHHHHHHHH-HHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHH Q ss_conf 9999999999999-999972268753334456777673689999999999980989999999999985304773026 Q gi|254781174|r 19 LYKFALTIFFSIA-VCFLVGWERQSSRDVYLDSVTDVRYQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVAR 94 (271) Q Consensus 19 m~k~~~~i~~~i~-~~~l~~Cs~~~~~~~~~~~~~~~~~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~ 94 (271) ..|+.+.+.++|+ +++++||.....+....+.+ --.+++=.+--+.|+++++.||--.+-+ T Consensus 4 ~~k~~ly~s~~il~I~~vvgC~~~~~eqkkeeqt---------------~k~d~ke~~~Kksf~KtL~mYPIKNLE~ 65 (267) T TIGR01742 4 AKKLALYISVLILIISFVVGCGKFKDEQKKEEQT---------------TKEDSKEEQIKKSFEKTLSMYPIKNLED 65 (267) T ss_pred HHHHHHHHHHHHHHEEEEECCCCCHHHHHHHHHH---------------HHHCCHHHHHCCCHHHCCCCCCCCCCHH T ss_conf 5799999999864300012587520344344421---------------0201214442110110025788344212 No 230 >KOG2908 consensus Probab=57.02 E-value=12 Score=16.06 Aligned_cols=105 Identities=13% Similarity=0.151 Sum_probs=60.4 Q ss_pred CCHHHHHHHHHHHHHHCCCCCC---HHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHCCCCCH-HHHHHHHHHHHHHH Q ss_conf 9899999999999853047730---2689987788776545556799999887---540112110-13555443444555 Q gi|254781174|r 71 QNFSKAYEYFNQCSRDFPFAGV---ARKSLLMSAFVQYSAGKYQQAASLGEEY---ITQYPESKN-VDYVYYLVGMSYAQ 143 (271) Q Consensus 71 ~~y~~A~~~f~~i~~~~P~s~~---a~~A~~~la~~~y~~~~y~~A~~~~~~f---i~~~P~s~~-~~~A~y~~a~~~~~ 143 (271) ++-++|++.++++.++...... .......+|.++...+|-..+....++. ++.-++-+. +.--+|+.+.-||+ T Consensus 89 ~D~~~al~~Le~i~~~~~~~~e~~av~~~~t~~~r~~L~i~DLk~~kk~ldd~~~~ld~~~~v~~~Vh~~fY~lssqYyk 168 (380) T KOG2908 89 SDKDEALEFLEKIIEKLKEYKEPDAVIYILTEIARLKLEINDLKEIKKLLDDLKSMLDSLDGVTSNVHSSFYSLSSQYYK 168 (380) T ss_pred CCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHH T ss_conf 56889999999999998750562159999999998887504389999999999998740158874454338888899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH-HHHHHCCHHHHHH Q ss_conf 5443111234589999999999-9852001013456 Q gi|254781174|r 144 MIRDVPYDQRATKLMLQYMSRI-VERYTNSPYVKGA 178 (271) Q Consensus 144 ~~~~~~~d~~~~~~A~~~f~~~-i~~yP~S~ya~~A 178 (271) .+.+.. ...+.|+....-. ++..|.|+...-| T Consensus 169 ~~~d~a---~yYr~~L~YL~~~d~~~l~~se~~~lA 201 (380) T KOG2908 169 KIGDFA---SYYRHALLYLGCSDIDDLSESEKQDLA 201 (380) T ss_pred HHHHHH---HHHHHHHHHHCCCCCCCCCHHHHHHHH T ss_conf 987689---999999999350321325778999999 No 231 >TIGR03044 PS_II_psb27 photosystem II protein Psb27. Members of this family are the Psb27 protein of the cyanobacterial photosynthetic supracomplex, photosystem II. Although most protein components of both cyanobacterial and chloroplast versions of photosystem II are closely related and described together by single HMM families, this family is strictly bacterial. Some uncharacterized proteins with highly divergent sequences, from Arabidopsis, score between trusted and noise cutoffs for this model but are not at this time assigned as functionally equivalent photosystem II proteins. Probab=56.14 E-value=12 Score=15.98 Aligned_cols=117 Identities=15% Similarity=0.167 Sum_probs=61.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHH Q ss_conf 99999999999999999997226875333445677767368999999999998098999999999998530477302689 Q gi|254781174|r 17 YQLYKFALTIFFSIAVCFLVGWERQSSRDVYLDSVTDVRYQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKS 96 (271) Q Consensus 17 ~~m~k~~~~i~~~i~~~~l~~Cs~~~~~~~~~~~~~~~~~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A 96 (271) .++++..+-++ +.+++.+.||+...+.. .+.. |.+ |-..-++.+...+..-..++-..++ T Consensus 5 ~rl~~~alal~-L~~~l~l~~cs~a~~s~-----ltG~------Y~~--------DT~~Vi~tlr~~i~lp~d~pn~~ea 64 (135) T TIGR03044 5 SRLSRAALALV-LGLCLLLTACSGAAKTR-----LTGD------YVE--------DTLAVIQTLREAIDLPDDDPNKSEA 64 (135) T ss_pred HHHHHHHHHHH-HHHHHHHHCCCCCCCCC-----CCCC------HHH--------HHHHHHHHHHHHHCCCCCCCCHHHH T ss_conf 99999999999-99999973456777656-----7875------299--------8999999999998089899888999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHH Q ss_conf 98778877654555679999988754011211013555443444555544311123458999999999998520010134 Q gi|254781174|r 97 LLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVK 176 (271) Q Consensus 97 ~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~ 176 (271) -.+|......|+.+|-..+.+. |+.-| +....|+..+.--...||+-|.-+ T Consensus 65 -------------~aear~~indyisrYRr~~~v~------gl~SF----------tTMqTALNalAGHY~sy~nrPlP~ 115 (135) T TIGR03044 65 -------------QAEARQLINDYISRYRRRPRVN------GLSSF----------TTMQTALNSLAGHYKSYANRPLPE 115 (135) T ss_pred -------------HHHHHHHHHHHHHHHCCCCCCC------CCCCH----------HHHHHHHHHHHHHHCCCCCCCCCH T ss_conf -------------9999999999999874777646------87528----------999999999878750688899999 Q ss_pred HHHHHH Q ss_conf 565555 Q gi|254781174|r 177 GARFYV 182 (271) Q Consensus 177 ~A~~~l 182 (271) .-+.++ T Consensus 116 klk~Rl 121 (135) T TIGR03044 116 KLKERL 121 (135) T ss_pred HHHHHH T ss_conf 999999 No 232 >pfam04507 DUF576 Protein of unknown function, DUF576. This family contains several uncharacterized staphylococcal proteins. Probab=56.01 E-value=6.2 Score=17.70 Aligned_cols=49 Identities=20% Similarity=0.272 Sum_probs=28.8 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCH Q ss_conf 99999999999999997226875333445677767368999999999998098999999999998530477302 Q gi|254781174|r 20 YKFALTIFFSIAVCFLVGWERQSSRDVYLDSVTDVRYQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVA 93 (271) Q Consensus 20 ~k~~~~i~~~i~~~~l~~Cs~~~~~~~~~~~~~~~~~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a 93 (271) .|+.+.|.++++++++.||+...+.+. +-.+--+.|.+.++.||--.+- T Consensus 5 kk~~L~is~liliifI~Gcg~~~k~ds-------------------------Ke~qIKksF~KtL~mYPiKNLE 53 (257) T pfam04507 5 KKLALYISVLILIIFISGCGFMNKEDS-------------------------KEAQIKKSFAKTLDMYPIKNLE 53 (257) T ss_pred HEEHHHHHHHHHHHHHEECCCCCCCCC-------------------------HHHHHHHHHHHHHHCCCCCCHH T ss_conf 100367889978652000223555665-------------------------7889999998776226644477 No 233 >pfam04348 LppC LppC putative lipoprotein. This family includes several bacterial outer membrane antigens, whose molecular function is unknown. Probab=55.76 E-value=12 Score=15.94 Aligned_cols=58 Identities=12% Similarity=0.029 Sum_probs=32.1 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999980989999999999985304773026899877887765455567999998 Q gi|254781174|r 61 YEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGE 118 (271) Q Consensus 61 Y~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~ 118 (271) -.-+..+++.|++..|...+.++...--......+..+..|......+++..|...+. T Consensus 28 L~Aa~a~i~eg~~~qA~~ll~~l~~~~L~~~q~~e~~Ll~A~l~~~~~~~~~Al~~L~ 85 (535) T pfam04348 28 LLAARVLIEENKIAQAQALLRELSVQQLTDEQKLDKSLLEAELAAAKRQNEVALQQLS 85 (535) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHC T ss_conf 9999999977999999999997404119999999999999999997599999999855 No 234 >TIGR03525 GldK gliding motility-associated lipoprotein GldK. Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldK is a lipoprotein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae. Knockouts of GldK abolish the gliding phenotype. GldK is homologous to GldJ. There is a GldK homolog in Cytophaga hutchinsonii and several other species that has a different, shorter architecture and is represented by a separate model. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility. Probab=55.51 E-value=8.9 Score=16.77 Aligned_cols=17 Identities=12% Similarity=0.182 Sum_probs=10.0 Q ss_pred HHHHHHHHHHCCCCCCC Q ss_conf 99999999722687533 Q gi|254781174|r 28 FSIAVCFLVGWERQSSR 44 (271) Q Consensus 28 ~~i~~~~l~~Cs~~~~~ 44 (271) +++++++++||++.... T Consensus 7 ~~~~~~~~~SCg~~~~G 23 (449) T TIGR03525 7 FAALVVLVYSCGSGDKG 23 (449) T ss_pred HHHHHHHHHHCCCCCCC T ss_conf 99999987210589971 No 235 >COG2143 Thioredoxin-related protein [Posttranslational modification, protein turnover, chaperones] Probab=54.95 E-value=8.5 Score=16.89 Aligned_cols=21 Identities=24% Similarity=0.316 Sum_probs=13.4 Q ss_pred HHHHHHHHHHHHHHHCCCCCC Q ss_conf 999999999999972268753 Q gi|254781174|r 23 ALTIFFSIAVCFLVGWERQSS 43 (271) Q Consensus 23 ~~~i~~~i~~~~l~~Cs~~~~ 43 (271) .++++++++.++++.|++... T Consensus 4 vl~i~Lliis~fl~a~~s~~e 24 (182) T COG2143 4 VLLIVLLIISLFLSACKSNNE 24 (182) T ss_pred HHHHHHHHHHHHHHHHHCCCH T ss_conf 199999999999999717804 No 236 >pfam06474 MLTD_N MLTD_N. Probab=54.12 E-value=13 Score=15.78 Aligned_cols=21 Identities=19% Similarity=0.150 Sum_probs=12.3 Q ss_pred HHHHHHHHHHHCCCCCCCCCC Q ss_conf 999999999722687533344 Q gi|254781174|r 27 FFSIAVCFLVGWERQSSRDVY 47 (271) Q Consensus 27 ~~~i~~~~l~~Cs~~~~~~~~ 47 (271) .++++.++|+||.+....... T Consensus 5 ail~~alLLaGCQst~~~~~~ 25 (93) T pfam06474 5 AVLAAALLLVGCQSTQAVSST 25 (93) T ss_pred HHHHHHHHHHHHCCCCCCCCC T ss_conf 999999999981379999998 No 237 >KOG3617 consensus Probab=54.11 E-value=13 Score=15.78 Aligned_cols=84 Identities=15% Similarity=0.165 Sum_probs=40.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC--CCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH Q ss_conf 56555555566666665556666654432234568999886528--9981159999999999997498799999999997 Q gi|254781174|r 177 GARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQLVLANY--SDAEHAEEAMARLVEAYVALALMDEAREVVSLIQ 254 (271) Q Consensus 177 ~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~~~i~~y--p~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~ 254 (271) .+...+..+++.+|.---.+-..-.-+++....|.....+++.- |++......+-.|++.|-...++..|....+-|. T Consensus 1278 ~~~t~l~~Lq~~~a~vk~~l~~~q~~~eD~~~~i~qc~~lleep~ld~~Ir~~~~~a~lie~~v~~k~y~~AyRal~el~ 1357 (1416) T KOG3617 1278 MSTTGLDALQEDLAKVKVQLRKLQIMKEDAADGIRQCTTLLEEPILDDIIRCTRLFALLIEDHVSRKNYKPAYRALTELQ 1357 (1416) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHH T ss_conf 05778999999999999999999986545888899989876075778751167899999999875631108889999876 Q ss_pred HHCCCC Q ss_conf 856999 Q gi|254781174|r 255 ERYPQG 260 (271) Q Consensus 255 ~~yP~s 260 (271) .+-|+- T Consensus 1358 ~k~p~~ 1363 (1416) T KOG3617 1358 KKVPNV 1363 (1416) T ss_pred HCCCCC T ss_conf 307766 No 238 >TIGR02898 spore_YhcN_YlaJ sporulation lipoprotein, YhcN/YlaJ family; InterPro: IPR014247 This entry contains YhcN and YlaJ, which are predicted lipoproteins that have been detected as spore proteins but not vegetative proteins in Bacillus subtilis. Both appear to be expressed under control of the RNA polymerase sigma-G factor. The YlaJ-like members of this family have a low-complexity, strongly acidic, 40-residue C-terminal domain. . Probab=53.04 E-value=13 Score=15.83 Aligned_cols=23 Identities=13% Similarity=0.133 Sum_probs=14.0 Q ss_pred HHHHHHHHHHHHHHHHCCCCCHH Q ss_conf 55567999998875401121101 Q gi|254781174|r 108 GKYQQAASLGEEYITQYPESKNV 130 (271) Q Consensus 108 ~~y~~A~~~~~~fi~~~P~s~~~ 130 (271) ..++-|-...++..+.-|.-+.+ T Consensus 68 ~~~~~a~~~A~~~a~k~~gvk~A 90 (185) T TIGR02898 68 DLLKVADKIASEVAKKVKGVKDA 90 (185) T ss_pred HHHHHHHHHHHHHHHHCCCCCEE T ss_conf 07899999999998627886211 No 239 >TIGR02508 type_III_yscG type III secretion protein, YscG family; InterPro: IPR013348 YscG is a molecular chaperone for YscE, where both are part of the type III secretion system that in Yersinia is designated Ysc (Yersinia secretion). The secretion system delivers effector proteins, designated Yops (Yersinia outer proteins), in Yersinia. This entry consists of YscG from Yersinia, and functionally equivalent type III secretion proteins in other species: e.g. AscG in Aeromonas and LscG in Photorhabdus luminescens.; GO: 0009405 pathogenesis. Probab=53.03 E-value=6.1 Score=17.75 Aligned_cols=49 Identities=10% Similarity=0.213 Sum_probs=33.2 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 8999999999998098999999999998530477302689987788776545556 Q gi|254781174|r 57 QREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQ 111 (271) Q Consensus 57 ~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~ 111 (271) .-.-.-+....+++|+|..|.+..++.-.-||+ +.| ++|-|-++.|--+ T Consensus 40 E~v~LIrlsSLmN~G~Y~~Al~lg~~~~tayPd--Lep----wlALce~rlGl~~ 88 (118) T TIGR02508 40 EAVVLIRLSSLMNRGDYQEALQLGEELCTAYPD--LEP----WLALCEWRLGLLS 88 (118) T ss_pred HHHHHHHHHHHCCHHHHHHHHHHCCCCCCCCCC--HHH----HHHHHHHHHHHHH T ss_conf 999999986312755799999724336887778--778----9999999987999 No 240 >PRK10722 hypothetical protein; Provisional Probab=52.79 E-value=14 Score=15.65 Aligned_cols=22 Identities=9% Similarity=0.105 Sum_probs=15.1 Q ss_pred HHHHHHHHHHCCCCCCCCCCCC Q ss_conf 9999999972268753334456 Q gi|254781174|r 28 FSIAVCFLVGWERQSSRDVYLD 49 (271) Q Consensus 28 ~~i~~~~l~~Cs~~~~~~~~~~ 49 (271) +.+.+++|.||+.+........ T Consensus 21 ~~~~~l~L~gC~~~~~~~~~~~ 42 (248) T PRK10722 21 AGLPCLALAGCVQNANKPAIDT 42 (248) T ss_pred HHHHHHHHHHCCCCCCCCCCCC T ss_conf 7149998863047888753455 No 241 >TIGR01000 bacteriocin_acc bacteriocin secretion accessory protein; InterPro: IPR005696 Bacteriocins are antibacterial proteinaceous compounds produced by bacteria. In Gram-positive bacteria, they are divided into four classes. Within the class II, constituted by non-modified peptides produced mainly by lactic acid bacteria, bacteriocins of the subclass IIa, such as mesentericin Y105, are of particular interest. They are active against the foodborne pathogen Listeria monocytogenes and share a similar primary structure, with a conserved N-terminal motif (YGNGV). Subclass IIa bacteriocins induce membrane permeabilization of sensitive strains, but their target specificity and their molecular mode of action remain elusive. This family of proteins is involved in the secretion of such bacteriocins although the mechanism of this process is not well understood. ; GO: 0005215 transporter activity, 0006810 transport. Probab=52.32 E-value=9 Score=16.74 Aligned_cols=44 Identities=11% Similarity=0.272 Sum_probs=27.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHH Q ss_conf 99999999999999999997226875333445677767368999 Q gi|254781174|r 17 YQLYKFALTIFFSIAVCFLVGWERQSSRDVYLDSVTDVRYQREV 60 (271) Q Consensus 17 ~~m~k~~~~i~~~i~~~~l~~Cs~~~~~~~~~~~~~~~~~~~~l 60 (271) |+-|-..+++-++|++|++++.+..++++..+.......|++.+ T Consensus 17 y~NFs~l~I~P~~lLLv~~~lFs~fAkkEivi~~~g~~~p~~~~ 60 (476) T TIGR01000 17 YHNFSTLVIVPIFLLLVFLVLFSLFAKKEIVIRSTGSIEPAKVI 60 (476) T ss_pred HHCCHHHHHHHHHHHHHHHHHHHCCCEEEEEEEEEEEECCCCCC T ss_conf 11012689999999999999972201078898411266143201 No 242 >PRK11627 hypothetical protein; Provisional Probab=51.99 E-value=14 Score=15.58 Aligned_cols=15 Identities=20% Similarity=0.383 Sum_probs=10.9 Q ss_pred HHHHHHHHHCCCCCC Q ss_conf 999999972268753 Q gi|254781174|r 29 SIAVCFLVGWERQSS 43 (271) Q Consensus 29 ~i~~~~l~~Cs~~~~ 43 (271) +++.++|.||++... T Consensus 9 l~a~~lL~gCas~p~ 23 (192) T PRK11627 9 LVALFMLAGCATPPT 23 (192) T ss_pred HHHHHHHHHHCCCCC T ss_conf 999999986069976 No 243 >PRK04841 transcriptional regulator MalT; Provisional Probab=51.67 E-value=14 Score=15.55 Aligned_cols=65 Identities=12% Similarity=0.101 Sum_probs=37.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH---HCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHH-HHHHHCCCC Q ss_conf 66666544322345689998865---289981159999999999997498799999999-997856999 Q gi|254781174|r 196 IGRYYLKRGEYVAAIPRFQLVLA---NYSDAEHAEEAMARLVEAYVALALMDEAREVVS-LIQERYPQG 260 (271) Q Consensus 196 ia~~Y~~~~~y~aA~~~~~~~i~---~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~-~l~~~yP~s 260 (271) +++.+...|++..|...+..+.. .++.....-+++..++.++..+|..++|..... .|..--|++ T Consensus 697 ~ar~~l~~g~~~~A~~~l~~l~~~a~~~~~~~~~~~~~ll~A~a~~~~g~~~~A~~~l~~AL~lA~~~g 765 (903) T PRK04841 697 IARAQILLGQFDEAEIVLEELNENARSLRLMSDLNRNLILLNQLYWQQGRKSDAQRALLEALKLANRTG 765 (903) T ss_pred HHHHHHHCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCC T ss_conf 999999829979999999999999987275346999999999999987999999999999999761468 No 244 >pfam07720 TPR_3 Tetratricopeptide repeat. This Pfam entry includes tetratricopeptide-like repeats found in the LcrH/SycD-like chaperones. Probab=51.67 E-value=14 Score=15.55 Aligned_cols=25 Identities=16% Similarity=0.315 Sum_probs=14.9 Q ss_pred HHHHHHHHHHHHCCHHHHHHHHHHH Q ss_conf 9999999999809899999999999 Q gi|254781174|r 59 EVYEKAVLFLKEQNFSKAYEYFNQC 83 (271) Q Consensus 59 ~lY~~a~~~~~~~~y~~A~~~f~~i 83 (271) .+|..|-....+|+|++|...|.-+ T Consensus 3 ~l~~~Aa~~~~k~ky~~A~~~y~~l 27 (34) T pfam07720 3 YLYSLAAVFYLKGKYQEAEHLYALL 27 (34) T ss_pred HHHHHHHHHHHCCHHHHHHHHHHHH T ss_conf 8999899998702134799999999 No 245 >COG5633 Predicted periplasmic lipoprotein [General function prediction only] Probab=50.65 E-value=6.9 Score=17.41 Aligned_cols=18 Identities=28% Similarity=0.370 Sum_probs=11.8 Q ss_pred HHHHHHHHHHCCCCCCCC Q ss_conf 999999997226875333 Q gi|254781174|r 28 FSIAVCFLVGWERQSSRD 45 (271) Q Consensus 28 ~~i~~~~l~~Cs~~~~~~ 45 (271) +++..++|+||++..... T Consensus 7 ~~l~~~lLvGCsS~~~i~ 24 (123) T COG5633 7 LSLALLLLVGCSSHQEIL 24 (123) T ss_pred HHHHHHHHHCCCCCCCCC T ss_conf 999999942047887752 No 246 >PRK11023 hypothetical protein; Provisional Probab=50.53 E-value=11 Score=16.33 Aligned_cols=19 Identities=16% Similarity=0.077 Sum_probs=11.6 Q ss_pred HHHHHHHHHHHHHHHHCCC Q ss_conf 9999999999999972268 Q gi|254781174|r 22 FALTIFFSIAVCFLVGWER 40 (271) Q Consensus 22 ~~~~i~~~i~~~~l~~Cs~ 40 (271) ....+.+++++++|+||.. T Consensus 3 ~~~~l~~l~~~~~L~GC~~ 21 (191) T PRK11023 3 ALSPLAVLISALLLQGCVA 21 (191) T ss_pred HHHHHHHHHHHHHHCCCHH T ss_conf 4889999999999726823 No 247 >PRK11671 mltC murein transglycosylase C; Provisional Probab=50.52 E-value=12 Score=16.09 Aligned_cols=22 Identities=23% Similarity=0.327 Sum_probs=11.3 Q ss_pred HHHHHHHHHHCCCCHHHHHHHH Q ss_conf 9999999785699978999998 Q gi|254781174|r 247 REVVSLIQERYPQGYWARYVET 268 (271) Q Consensus 247 ~~~~~~l~~~yP~s~~~~~a~~ 268 (271) .+++..|..+.|-.+=..+..+ T Consensus 328 ~evY~~L~~~~P~~EtR~YL~K 349 (360) T PRK11671 328 GDVYQTLTTRHPSAESRRYLYK 349 (360) T ss_pred HHHHHHHHHHCCHHHHHHHHHH T ss_conf 9999999863986889999999 No 248 >pfam07219 HemY_N HemY protein N-terminus. This family represents the N-terminus (approximately 150 residues) of bacterial HemY porphyrin biosynthesis proteins. This is a membrane protein involved in a late step of protoheme IX synthesis. Probab=49.85 E-value=15 Score=15.38 Aligned_cols=28 Identities=14% Similarity=0.184 Sum_probs=16.9 Q ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHHH Q ss_conf 9999999999809899999999999853 Q gi|254781174|r 59 EVYEKAVLFLKEQNFSKAYEYFNQCSRD 86 (271) Q Consensus 59 ~lY~~a~~~~~~~~y~~A~~~f~~i~~~ 86 (271) ..-.+|...+..||+.+|.+.+.+..+. T Consensus 87 ~al~~Gl~a~~~Gd~~~A~k~~~ka~k~ 114 (134) T pfam07219 87 KALSEGLLALAEGDWALAERLARKAAEL 114 (134) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHC T ss_conf 9999999999789999999999998763 No 249 >pfam04011 LemA LemA family. The members of this family are related to the LemA protein. LemA contains an amino terminal predicted transmembrane helix. It has been predicted that the small amino terminus is extracellular. The exact molecular function of this protein is uncertain. Probab=49.58 E-value=16 Score=15.35 Aligned_cols=68 Identities=19% Similarity=0.234 Sum_probs=47.7 Q ss_pred HHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHH Q ss_conf 345899999999999852001013456555555566666665556666654432234568999886528998115 Q gi|254781174|r 152 QRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQLVLANYSDAEHA 226 (271) Q Consensus 152 ~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~ 226 (271) ......++..+..+.+.||+ -.+......+...+++-|-.|+ ..|..|-.++..|+..+..||.+-.+ T Consensus 94 ~~~l~~aL~~l~a~~E~YP~----Lka~~~~~~L~~~l~~~E~~Ia---~aR~~YN~~V~~yN~~i~~FP~~lvA 161 (186) T pfam04011 94 QNELSSALGRLLVVAENYPD----LKANENFLELQSQLEGTENRIA---VARRDYNEAVQEYNTKIRQFPSVIVA 161 (186) T ss_pred HHHHHHHHHHHHHHHHHCCC----CCCCHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHCCHHHHH T ss_conf 99999999999999974945----2334679999999999999999---99999999999998878838299999 No 250 >KOG2047 consensus Probab=49.24 E-value=16 Score=15.32 Aligned_cols=20 Identities=25% Similarity=0.248 Sum_probs=7.5 Q ss_pred HHHHHHHHHHHHHHHHHHHH Q ss_conf 87765455567999998875 Q gi|254781174|r 102 FVQYSAGKYQQAASLGEEYI 121 (271) Q Consensus 102 ~~~y~~~~y~~A~~~~~~fi 121 (271) +-|-..|.++.|...|++-+ T Consensus 256 dYYIr~g~~ekarDvyeeai 275 (835) T KOG2047 256 DYYIRSGLFEKARDVYEEAI 275 (835) T ss_pred HHHHHHHHHHHHHHHHHHHH T ss_conf 99987311677789999998 No 251 >PRK09973 putative outer membrane lipoprotein; Provisional Probab=49.17 E-value=5.9 Score=17.80 Aligned_cols=22 Identities=18% Similarity=0.265 Sum_probs=14.1 Q ss_pred HHHHHHHHHHHHHHHHCCCCCC Q ss_conf 9999999999999972268753 Q gi|254781174|r 22 FALTIFFSIAVCFLVGWERQSS 43 (271) Q Consensus 22 ~~~~i~~~i~~~~l~~Cs~~~~ 43 (271) .+.+-.+++...+|.||++..+ T Consensus 4 k~vlGaviLaS~LLaGCsn~~k 25 (85) T PRK09973 4 IFTVGAVVLATCLLSGCVNEQK 25 (85) T ss_pred EEHHHHHHHHHHHHHCCCCHHH T ss_conf 2110699999999981555588 No 252 >PRK12696 flgH flagellar basal body L-ring protein; Reviewed Probab=48.37 E-value=16 Score=15.24 Aligned_cols=16 Identities=31% Similarity=0.368 Sum_probs=10.4 Q ss_pred HHHHHHHHHCCCCCCC Q ss_conf 9999999722687533 Q gi|254781174|r 29 SIAVCFLVGWERQSSR 44 (271) Q Consensus 29 ~i~~~~l~~Cs~~~~~ 44 (271) ++++++|.||+..... T Consensus 9 ~~~~~~L~GC~~~~~~ 24 (238) T PRK12696 9 SCAVLLLSGCNAARQQ 24 (238) T ss_pred HHHHHHHHCCCCCCCC T ss_conf 9999995165688888 No 253 >pfam11846 DUF3366 Domain of unknown function (DUF3366). This domain is functionally uncharacterized. This domain is found in bacteria. This presumed domain is about 200 amino acids in length. Probab=48.01 E-value=17 Score=15.21 Aligned_cols=53 Identities=19% Similarity=0.145 Sum_probs=40.9 Q ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHH Q ss_conf 689998865289981159999999999997498799999999997856999789999 Q gi|254781174|r 210 IPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSLIQERYPQGYWARYV 266 (271) Q Consensus 210 ~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~~~yP~s~~~~~a 266 (271) +.-...++..-|+ +....+++.++..+|+.++|++.......-||..++..-. T Consensus 131 ~~wa~~~l~~~P~----p~~y~~la~~~~~~G~~~~A~~~~~~a~~lyP~~~f~~~~ 183 (193) T pfam11846 131 HEQALKLLRYKPR----PAVYRRLALALALLGKPAEARDWMAQARYLYPLADFCAAL 183 (193) T ss_pred HHHHHHHHHCCCC----HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHH T ss_conf 9999999874997----5999999999998699999999999999839972336886 No 254 >PRK12700 flgH flagellar basal body L-ring protein; Reviewed Probab=47.85 E-value=17 Score=15.19 Aligned_cols=25 Identities=8% Similarity=-0.008 Sum_probs=12.9 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 9999999999999999972268753 Q gi|254781174|r 19 LYKFALTIFFSIAVCFLVGWERQSS 43 (271) Q Consensus 19 m~k~~~~i~~~i~~~~l~~Cs~~~~ 43 (271) |...+.+.+.+.+++++.||++... T Consensus 3 ~~t~~Rl~~~~~~~~ll~GCa~~~~ 27 (230) T PRK12700 3 LKTVLRLPVCAALLALAAGCAMIPP 27 (230) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 5889999999999999744468999 No 255 >pfam09710 Trep_dent_lipo Treponema clustered lipoprotein (Trep_dent_lipo). This entry represents a family of six predicted lipoproteins from a region of about 20 tandemly arranged genes in the Treponema denticola genome. Two other neighbouring genes share the lipoprotein signal peptide region but do not show more extensive homology. The function of this locus is unknown. Probab=47.54 E-value=7.3 Score=17.28 Aligned_cols=20 Identities=20% Similarity=0.311 Sum_probs=13.1 Q ss_pred HHHHHHHHHHHHHHCCCCCC Q ss_conf 99999999999972268753 Q gi|254781174|r 24 LTIFFSIAVCFLVGWERQSS 43 (271) Q Consensus 24 ~~i~~~i~~~~l~~Cs~~~~ 43 (271) .++|++++.++|+|||+.-+ T Consensus 4 KlifILilavLLFSCske~K 23 (394) T pfam09710 4 KLIFILILAVLLFSCSKEVK 23 (394) T ss_pred EEEHHHHHHHHHHHHHHHHC T ss_conf 57279999999974036551 No 256 >pfam12309 KBP_C KIF-1 binding protein C terminal. This family of proteins is found in bacteria and eukaryotes. Proteins in this family are typically between 365 and 621 amino acids in length. There is a conserved LLP sequence motif. KBP is a binding partner for KIF1Balpha that is a regulator of its transport function and thus represents a type of kinesin interacting protein. Probab=47.29 E-value=17 Score=15.14 Aligned_cols=64 Identities=13% Similarity=0.221 Sum_probs=29.1 Q ss_pred HHHHHHHHHHHHHHHHCCHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHH---------HHHHHHHHHHHHHHHHHC Q ss_conf 8999999999998520010134565555--55566666665556666654---------432234568999886528 Q gi|254781174|r 155 TKLMLQYMSRIVERYTNSPYVKGARFYV--TVGRNQLAAKEVEIGRYYLK---------RGEYVAAIPRFQLVLANY 220 (271) Q Consensus 155 ~~~A~~~f~~~i~~yP~S~ya~~A~~~l--~~~~~~La~~e~~ia~~Y~~---------~~~y~aA~~~~~~~i~~y 220 (271) ...|+..|+.|+..|-.+. ..+....+ ...+..|-. .|.+|+.|.+ .++-..+...|+.+++-. T Consensus 259 ~~~aI~~y~~F~~s~~~~~-~~~~p~~~~~d~~rp~l~A-~f~~Arl~sK~is~d~~~~le~l~~SL~~Y~~vv~yc 333 (365) T pfam12309 259 CRKAIKYYEKFLDSLKDPS-TGKLPETLDEDELRPILVA-KFHIARLYSKLITADPKEKLENLSKSLEYYKQIVDYC 333 (365) T ss_pred HHHHHHHHHHHHHHHCCCC-CCCCCCCCCHHHHHHHHHH-HHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999999999863776-4536654678888899999-9999999865047985999999999999999999999 No 257 >pfam11153 DUF2931 Protein of unknown function (DUF2931). Some members in this family of proteins are annotated as lipoproteins however this cannot be confirmed. Currently, there is no known function. Probab=47.19 E-value=14 Score=15.58 Aligned_cols=16 Identities=13% Similarity=0.280 Sum_probs=11.1 Q ss_pred HHHHHHHHHHHHHCCC Q ss_conf 9999999999972268 Q gi|254781174|r 25 TIFFSIAVCFLVGWER 40 (271) Q Consensus 25 ~i~~~i~~~~l~~Cs~ 40 (271) .+++++++++++||+. T Consensus 3 ~i~~ll~~lll~aCs~ 18 (202) T pfam11153 3 KILLLLLLLLLTACST 18 (202) T ss_pred HHHHHHHHHHHHHHCC T ss_conf 1489999999876025 No 258 >PRK13684 Ycf48-like protein; Provisional Probab=47.17 E-value=17 Score=15.13 Aligned_cols=20 Identities=10% Similarity=0.056 Sum_probs=11.1 Q ss_pred HHHHHHHHHHHHHHHHHCCC Q ss_conf 99999999999999972268 Q gi|254781174|r 21 KFALTIFFSIAVCFLVGWER 40 (271) Q Consensus 21 k~~~~i~~~i~~~~l~~Cs~ 40 (271) +..-++++++++++|.||++ T Consensus 7 ~~~~l~l~~~~~~~~~~c~~ 26 (333) T PRK13684 7 SLLNLLLLLALGLVLSGCST 26 (333) T ss_pred HHHHHHHHHHHHHHHCCCCC T ss_conf 89999999999986322356 No 259 >pfam12273 RCR Chitin synthesis regulation, resistance to Congo red. RCR proteins are ER membrane proteins that regulate chitin deposition in fungal cell walls. Although chitin, a linear polymer of beta-1,4-linked N-acetylglucosamine, constitutes only 2% of the cell wall it plays a vital role in the overall protection of the cell wall against stress, noxious chemicals and osmotic pressure changes. Congo red is a cell wall-disrupting benzidine-type dye extensively used in many cell wall mutant studies that specifically targets chitin in yeast cells and inhibits growth. RCR proteins render the yeasts resistant to Congo red by diminishing the content of chitin in the cell wall. RCR proteins are probably regulating chitin synthase III interact directly with ubiquitin ligase Rsp5, and the VPEY motif is necessary for this, via interaction with the WW domains of Rsp5. Probab=46.51 E-value=17 Score=15.20 Aligned_cols=19 Identities=26% Similarity=0.371 Sum_probs=10.3 Q ss_pred HHHHHHHHHHHHHHHHCCC Q ss_conf 9999999999999972268 Q gi|254781174|r 22 FALTIFFSIAVCFLVGWER 40 (271) Q Consensus 22 ~~~~i~~~i~~~~l~~Cs~ 40 (271) +.++|+++++++|+++|-+ T Consensus 5 fav~i~~~~i~~f~~~~~n 23 (124) T pfam12273 5 FAIFIIALLILFFLTARIN 23 (124) T ss_pred HHHHHHHHHHHHHHHHHHH T ss_conf 8999999999999999873 No 260 >PRK10598 hypothetical protein; Provisional Probab=46.38 E-value=13 Score=15.86 Aligned_cols=23 Identities=26% Similarity=0.361 Sum_probs=13.6 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 999999999999999997226875 Q gi|254781174|r 19 LYKFALTIFFSIAVCFLVGWERQS 42 (271) Q Consensus 19 m~k~~~~i~~~i~~~~l~~Cs~~~ 42 (271) |.|+.+. ..+++..+++||++-+ T Consensus 1 mkk~~l~-~~l~l~~llsGC~~l~ 23 (186) T PRK10598 1 MKKFLLA-AALLVSGLLVGCNQLT 23 (186) T ss_pred CCHHHHH-HHHHHHHHHHCCCCCC T ss_conf 9126999-9999999971435336 No 261 >TIGR01084 mutY A/G-specific adenine glycosylase; InterPro: IPR005760 The DNA repair enzyme MutY plays an important role in the prevention of DNA mutations resulting from the presence of the oxidatively damaged lesion 7,8-dihydro-8-oxo-2'-deoxyguanosine (8-OxoG). 8-OxoG can mispair with 2'-deoxycytidine 5'-triphosphate or with 2'-deoxyadenosine triphosphate during DNA replication, forming C*8-oxoG and A*8-oxoG mispairs. Human MutY is responsible for recognition and removal of the inappropriately inserted adenine in an A=8-oxoG mispair. If unrepaired, the A=8-oxoG mispairs can result in deleterious C:G to A:T transversions. Multiple forms of mammalian MutY are formed by alternative splicing and locate to the nucleus or mitochondrion, where they have been shown to interact with several other proteins involved in the repair of DNA damage . The HhH-GPD domain within the protein binds the phosphate backbone of the substrate. This family represents bacterial MutY and a limited number of murine homologs. In rat, the Escherichia coli MutY DNA glycosylase homolog (MYH) is induced in response to mitochondrial DNA damage . ; GO: 0019104 DNA N-glycosylase activity, 0006284 base-excision repair, 0005622 intracellular. Probab=45.10 E-value=18 Score=14.94 Aligned_cols=67 Identities=10% Similarity=0.165 Sum_probs=45.1 Q ss_pred HHHHHHHHHHHHHHHHH-HHHHHHHHCCCCCHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHH Q ss_conf 77887765455567999-998875401121101355------544344455554431112345899999-9999998520 Q gi|254781174|r 99 MSAFVQYSAGKYQQAAS-LGEEYITQYPESKNVDYV------YYLVGMSYAQMIRDVPYDQRATKLMLQ-YMSRIVERYT 170 (271) Q Consensus 99 ~la~~~y~~~~y~~A~~-~~~~fi~~~P~s~~~~~A------~y~~a~~~~~~~~~~~~d~~~~~~A~~-~f~~~i~~yP 170 (271) ++.++-.++-+-...+. +|.+|++.||+-.....| ....|+-||...+ +..+|-. .-+.+=..+| T Consensus 42 W~SEvMLQQTqV~tV~prYf~rFle~FPTv~~LA~A~~deVL~lW~GLGYYaRAR-------NL~kAA~~v~~~fGG~fP 114 (297) T TIGR01084 42 WVSEVMLQQTQVATVIPRYFERFLERFPTVQALANAPQDEVLKLWEGLGYYARAR-------NLHKAAQEVVEEFGGEFP 114 (297) T ss_pred EEEEECCCCCEEEECCCCCCHHHHHHCCCHHHHHCCCHHHHHHHHHCCCHHHHHH-------HHHHHHHHHHHHHCCCCC T ss_conf 4002110011001126710047664278857874779657999862578678889-------999999999987188177 Q ss_pred CC Q ss_conf 01 Q gi|254781174|r 171 NS 172 (271) Q Consensus 171 ~S 172 (271) +. T Consensus 115 ~d 116 (297) T TIGR01084 115 QD 116 (297) T ss_pred CC T ss_conf 23 No 262 >TIGR03511 GldH_lipo gliding motility-associated lipoprotein GldH. Members of this protein family are predicted lipoproteins, exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). Members include GldH, a protein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae and Cytophaga hutchinsonii. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Not all Bacteroidetes with members of this protein family may have gliding motility. Probab=44.95 E-value=12 Score=16.10 Aligned_cols=23 Identities=22% Similarity=0.574 Sum_probs=13.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 9999999999999999997226875 Q gi|254781174|r 18 QLYKFALTIFFSIAVCFLVGWERQS 42 (271) Q Consensus 18 ~m~k~~~~i~~~i~~~~l~~Cs~~~ 42 (271) +|.|..+++++ ++++++||..++ T Consensus 2 ~~~~~~i~~~l--~~~ll~SC~~~~ 24 (156) T TIGR03511 2 SLVRNSISFFL--GACVLVSCTENT 24 (156) T ss_pred CHHHHHHHHHH--HHHHEEEECCCC T ss_conf 06854499999--999707638996 No 263 >pfam08085 Entericidin Entericidin EcnA/B family. This family consists of the entericidin antidote/toxin peptides. The entericidin locus is activated in stationary phase under high osmolarity conditions by rho-S and simultaneously repressed by the osmoregulatory EnvZ/OmpR signal transduction pathway. The entericidin locus encodes tandem paralogous genes (ecnAB) and directs the synthesis of two small cell-envelope lipoproteins which can maintain plasmids in bacterial population by means of post-segregational killing. Probab=44.51 E-value=16 Score=15.29 Aligned_cols=22 Identities=14% Similarity=0.209 Sum_probs=11.2 Q ss_pred HHHHHHHHHH-HHHHHHHHHCCC Q ss_conf 9999999999-999999972268 Q gi|254781174|r 19 LYKFALTIFF-SIAVCFLVGWER 40 (271) Q Consensus 19 m~k~~~~i~~-~i~~~~l~~Cs~ 40 (271) |.|.+..++. .++++.+.||.+ T Consensus 1 Mk~~~~~~~~l~~~~~~lagCNT 23 (42) T pfam08085 1 MKKLIALLLALLLLALVLAGCNT 23 (42) T ss_pred CHHHHHHHHHHHHHHHHHHCCHH T ss_conf 92369999999999999861222 No 264 >pfam11172 DUF2959 Protein of unknown function (DUF2959). This family of proteins with unknown function appears to be restricted to Gammaproteobacteria. Probab=44.28 E-value=19 Score=14.86 Aligned_cols=91 Identities=15% Similarity=0.142 Sum_probs=42.6 Q ss_pred HHHHHCCCC---CCCCCCCCCCCCCCCHHHHHHHHHHHH--HHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHH- Q ss_conf 999722687---533344567776736899999999999--80989999999999985304773026899877887765- Q gi|254781174|r 33 CFLVGWERQ---SSRDVYLDSVTDVRYQREVYEKAVLFL--KEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYS- 106 (271) Q Consensus 33 ~~l~~Cs~~---~~~~~~~~~~~~~~~~~~lY~~a~~~~--~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~- 106 (271) ++|+||++. .-+++-+.+-+.. ...-+.|.... .+.++..|++.|..+... .++++- ..|-. T Consensus 2 ~lL~GCqsaYY~aMEkvG~hKRDIl---vdRVe~ArdsQ~~aqeqF~sALe~~~a~~~~-dgg~Le--------~~Y~~L 69 (201) T pfam11172 2 LLLSGCQSAYYSAMEKVGVHKRDIL---VDRVEDARDSQEDAQEQFKSALEQFSSVVNF-DGGDLE--------DVYNQL 69 (201) T ss_pred CHHHHHHHHHHHHHHHHCCHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC-CCCCHH--------HHHHHH T ss_conf 2166547889999999574177899---9999998987999999999999999887579-985299--------999999 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCHHHHHHH Q ss_conf 45556799999887540112110135554 Q gi|254781174|r 107 AGKYQQAASLGEEYITQYPESKNVDYVYY 135 (271) Q Consensus 107 ~~~y~~A~~~~~~fi~~~P~s~~~~~A~y 135 (271) ...|+.+....+..-.+--.-+.+..|.| T Consensus 70 n~~yE~S~~aA~~V~~RI~~ve~VA~aLF 98 (201) T pfam11172 70 NDEYEDSEAAAEEVSDRIDAVEDVAEALF 98 (201) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999899999999999999999889999 No 265 >pfam00244 14-3-3 14-3-3 protein. Probab=43.94 E-value=19 Score=14.83 Aligned_cols=63 Identities=14% Similarity=0.084 Sum_probs=40.5 Q ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999980989999999999985304773026899877887765455567999998875 Q gi|254781174|r 58 REVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYI 121 (271) Q Consensus 58 ~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi 121 (271) +++...|...-+.++|++.+...++++...|.-..-++-+|..|+-.. .|..-.|......+. T Consensus 2 e~~v~~Aklaeq~ery~dm~~~mk~~~~~~~~Ls~eERnLLsvayKn~-i~~rR~s~R~l~~ie 64 (236) T pfam00244 2 EELVYLAKLAEQAERYDDMVEAMKKVVELKEELSVEERNLLSVAYKNV-IGARRASWRIISSIE 64 (236) T ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHH-HHHHHHHHHHHHHHH T ss_conf 789999999998468999999999998658989999999999999987-410099999998999 No 266 >PRK10871 nlpD lipoprotein NlpD; Provisional Probab=43.90 E-value=19 Score=14.83 Aligned_cols=31 Identities=13% Similarity=0.122 Sum_probs=21.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 9999999999999999999722687533344 Q gi|254781174|r 17 YQLYKFALTIFFSIAVCFLVGWERQSSRDVY 47 (271) Q Consensus 17 ~~m~k~~~~i~~~i~~~~l~~Cs~~~~~~~~ 47 (271) ..|+++..+.++.+..+.|.||++....... T Consensus 5 s~~~~~~~~~~~~~~~~~l~gcs~~~~~~ap 35 (374) T PRK10871 5 SPKFTVRRIAALSLVSLWLAGCTNTSNPPAP 35 (374) T ss_pred CCHHHHHHHHHHHHHHHHHECCCCCCCCCCC T ss_conf 8216778999999999860103788888686 No 267 >TIGR02269 TIGR02269 Myxococcus xanthus paralogous lipoprotein family TIGR02269; InterPro: IPR011755 This family consists of at least 9 paralogs in Myxococcus xanthus, a member of the Deltaproteobacteria. One appears truncated toward the N-terminus; the others are predicted lipoproteins. The function is unknown.. Probab=43.65 E-value=15 Score=15.51 Aligned_cols=19 Identities=16% Similarity=0.420 Sum_probs=13.2 Q ss_pred HHHHHHHHHHHHCCCCCCC Q ss_conf 9999999999722687533 Q gi|254781174|r 26 IFFSIAVCFLVGWERQSSR 44 (271) Q Consensus 26 i~~~i~~~~l~~Cs~~~~~ 44 (271) +.+.|+.++++||+++.++ T Consensus 2 l~~~lla~ll~GCa~~~~~ 20 (228) T TIGR02269 2 LLLLLLALLLVGCASTAPK 20 (228) T ss_pred HHHHHHHHHHHHHCCCCCC T ss_conf 5799889998752057877 No 268 >PRK13731 conjugal transfer surface exclusion protein TraT; Provisional Probab=43.24 E-value=17 Score=15.18 Aligned_cols=23 Identities=13% Similarity=0.132 Sum_probs=15.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 999999999999999999722687 Q gi|254781174|r 18 QLYKFALTIFFSIAVCFLVGWERQ 41 (271) Q Consensus 18 ~m~k~~~~i~~~i~~~~l~~Cs~~ 41 (271) +|.+. +++.++.++++|+||+.. T Consensus 2 ~~kk~-~~~~~~~~~l~lsGC~a~ 24 (243) T PRK13731 2 KTKKL-MMVALVSSTLALSGCGAM 24 (243) T ss_pred CHHHH-HHHHHHHHHHHHCCHHHH T ss_conf 33775-799999999997156876 No 269 >PRK00249 flgH flagellar basal body L-ring protein; Reviewed Probab=43.21 E-value=20 Score=14.76 Aligned_cols=21 Identities=14% Similarity=0.225 Sum_probs=13.6 Q ss_pred HHHHHHHHHHHHHHCCCCCCC Q ss_conf 999999999999722687533 Q gi|254781174|r 24 LTIFFSIAVCFLVGWERQSSR 44 (271) Q Consensus 24 ~~i~~~i~~~~l~~Cs~~~~~ 44 (271) ++.++++++++|.||+..... T Consensus 5 ~~~~~~~~~~~LsgCa~~~~~ 25 (231) T PRK00249 5 LIALALLLLLLLSGCASIPPK 25 (231) T ss_pred HHHHHHHHHHHHHHCCCCCCC T ss_conf 999999999998753489998 No 270 >COG4649 Uncharacterized protein conserved in bacteria [Function unknown] Probab=42.68 E-value=20 Score=14.72 Aligned_cols=136 Identities=13% Similarity=0.116 Sum_probs=84.3 Q ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHH--HHHHH Q ss_conf 9999999999980989999999999985304773026899877887765455567999998875401121101--35554 Q gi|254781174|r 58 REVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNV--DYVYY 135 (271) Q Consensus 58 ~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~--~~A~y 135 (271) -..|..|...-+.++-++|+..|..+.+..- ..|--.|.+..|-..-..|+-..|+..|++.-.-.|- |.+ +-|.. T Consensus 59 gd~flaAL~lA~~~k~d~Alaaf~~lektg~-g~YpvLA~mr~at~~a~kgdta~AV~aFdeia~dt~~-P~~~rd~ARl 136 (221) T COG4649 59 GDAFLAALKLAQENKTDDALAAFTDLEKTGY-GSYPVLARMRAATLLAQKGDTAAAVAAFDEIAADTSI-PQIGRDLARL 136 (221) T ss_pred HHHHHHHHHHHHCCCCHHHHHHHHHHHHCCC-CCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCC-CCHHHHHHHH T ss_conf 5899999999870796689999999986488-8556999999999986126478899989988525787-6013679999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 43444555544311123458999999999998520010134565555555666666655566666544322345689998 Q gi|254781174|r 136 LVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQL 215 (271) Q Consensus 136 ~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~~ 215 (271) +-|.... |.+....-....+-+ --|++++...|++-+ |--=++-|++..|..-|+. T Consensus 137 raa~lLv--------D~gsy~dV~srvepL--a~d~n~mR~sArEAL--------------glAa~kagd~a~A~~~F~q 192 (221) T COG4649 137 RAAYLLV--------DNGSYDDVSSRVEPL--AGDGNPMRHSAREAL--------------GLAAYKAGDFAKAKSWFVQ 192 (221) T ss_pred HHHHHHH--------CCCCHHHHHHHHHHC--CCCCCHHHHHHHHHH--------------HHHHHHCCCHHHHHHHHHH T ss_conf 8789874--------356488899874130--489970178899987--------------6888732246779999999 Q ss_pred HHHH Q ss_conf 8652 Q gi|254781174|r 216 VLAN 219 (271) Q Consensus 216 ~i~~ 219 (271) +.++ T Consensus 193 ia~D 196 (221) T COG4649 193 IAND 196 (221) T ss_pred HHCC T ss_conf 9701 No 271 >COG2178 Predicted RNA-binding protein of the translin family [Translation, ribosomal structure and biogenesis] Probab=41.95 E-value=21 Score=14.65 Aligned_cols=109 Identities=21% Similarity=0.269 Sum_probs=57.8 Q ss_pred HHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC-CCCC--HHHHHHHHHHHHHHHH----HHH Q ss_conf 99999999985304773026899877887765455567999998875401-1211--0135554434445555----443 Q gi|254781174|r 75 KAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQY-PESK--NVDYVYYLVGMSYAQM----IRD 147 (271) Q Consensus 75 ~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~-P~s~--~~~~A~y~~a~~~~~~----~~~ 147 (271) +++..+++....|| ..+..|+|-..+++|.+|...+.-.-... |.+. .++.+.|..|++-.-- ..- T Consensus 57 ~~v~~Lk~~l~~~p-------el~~ag~~~~a~QEyvEA~~l~~~l~~~~~ps~~EL~V~~~~YilGl~D~vGELrR~~l 129 (204) T COG2178 57 EAVEKLKRLLAGFP-------ELYFAGFVTTALQEYVEATLLYSILKDGRLPSPEELGVPPIAYILGLADAVGELRRHVL 129 (204) T ss_pred HHHHHHHHHHHHHH-------HHHHHHHHCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999986148-------99987753006999999999999996578998887399899999789999989999999 Q ss_pred HHHHHHHHHHHHHH-------HHHHHH-HHHCCHHHHHHHHHHHHHHHHHHH Q ss_conf 11123458999999-------999998-520010134565555555666666 Q gi|254781174|r 148 VPYDQRATKLMLQY-------MSRIVE-RYTNSPYVKGARFYVTVGRNQLAA 191 (271) Q Consensus 148 ~~~d~~~~~~A~~~-------f~~~i~-~yP~S~ya~~A~~~l~~~~~~La~ 191 (271) ...-.++.+.|-.. +..+.. .||. ..++.-+.+.+..+..+.+ T Consensus 130 e~l~~~~~~~Ae~~~~~ME~lY~~Lm~fdyP~-~l~~~LR~K~Dvar~~lek 180 (204) T COG2178 130 ELLRKGSFEEAERFLKFMEKLYEELMEFDYPK-ALVPGLRQKQDVARSLLEK 180 (204) T ss_pred HHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCH-HHHHHHHHHHHHHHHHHHH T ss_conf 99873429999999999999999998267845-3426588989999999998 No 272 >TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ. Probab=41.88 E-value=20 Score=14.76 Aligned_cols=21 Identities=24% Similarity=0.234 Sum_probs=12.7 Q ss_pred HHHHHHHHHHHHCCCCCCCCC Q ss_conf 999999999972268753334 Q gi|254781174|r 26 IFFSIAVCFLVGWERQSSRDV 46 (271) Q Consensus 26 i~~~i~~~~l~~Cs~~~~~~~ 46 (271) +.+++++++|.|||....++. T Consensus 4 ~~~~~~~~~l~gCs~~~~~~~ 24 (377) T TIGR03300 4 ALVIALAALLSGCSWFSSKDE 24 (377) T ss_pred HHHHHHHHHHHHCCCCCCCCC T ss_conf 899999999855657679988 No 273 >KOG3824 consensus Probab=41.66 E-value=21 Score=14.62 Aligned_cols=70 Identities=14% Similarity=0.042 Sum_probs=55.1 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCH Q ss_conf 8999999999998098999999999998530477302689987788776545556799999887540112110 Q gi|254781174|r 57 QREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKN 129 (271) Q Consensus 57 ~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~ 129 (271) +....+.|......|+-+.|...|+-....-|.. ++++...|.-....++--+|-..|-+.+..-|.+.. T Consensus 116 A~~Al~~A~~~~~~Gk~ekA~~lfeHAlalaP~~---p~~L~e~G~f~E~~~~iv~ADq~Y~~ALtisP~nse 185 (472) T KOG3824 116 AILALKAAGRSRKDGKLEKAMTLFEHALALAPTN---PQILIEMGQFREMHNEIVEADQCYVKALTISPGNSE 185 (472) T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCC---HHHHHHHHHHHHHHHHHHHHHHHHHEEEEECCCCHH T ss_conf 9999999889886434688999999998308997---799999768888514467665564202321798558 No 274 >pfam09613 HrpB1_HrpK Bacterial type III secretion protein (HrpB1_HrpK). This family of proteins is encoded by genes found within type III secretion operons in a limited range of species including Xanthomonas, Ralstonia and Burkholderia. Probab=41.48 E-value=21 Score=14.60 Aligned_cols=63 Identities=6% Similarity=0.063 Sum_probs=36.7 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 9999999980989999999999985304773026899877887765455567999998875401121 Q gi|254781174|r 61 YEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPES 127 (271) Q Consensus 61 Y~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s 127 (271) .=.|+-++..|+|.+|+..|.++...-|.++++ .-+++.|++.++|.+- ....++.+..-|+. T Consensus 48 ~~dg~l~i~rg~w~dA~r~lr~v~~~~~~~~~~---kAL~A~CL~a~~D~~W-r~~A~evLe~~~~~ 110 (156) T pfam09613 48 LFDGWLYIGRGQWDDAARLLRELEGRASHLPLC---KALMAQCLFALQDPAW-REYADEVLEEGGDP 110 (156) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHCCCCCCHHH---HHHHHHHHHHCCCHHH-HHHHHHHHHHCCCC T ss_conf 878778776065899999999986267784899---9999999987479679-99999998618990 No 275 >pfam08139 LPAM_1 Prokaryotic membrane lipoprotein lipid attachment site. In prokaryotes, membrane lipoproteins are synthesized with a precursor signal peptide, which is cleaved by a specific lipoprotein signal peptidase (signal peptidase II). The peptidase recognizes a conserved sequence and cuts upstream of a cysteine residue to which a glyceride-fatty acid lipid is attached. Probab=41.30 E-value=20 Score=14.68 Aligned_cols=14 Identities=21% Similarity=0.405 Sum_probs=8.5 Q ss_pred HHHHHHHHCCCCCC Q ss_conf 99999972268753 Q gi|254781174|r 30 IAVCFLVGWERQSS 43 (271) Q Consensus 30 i~~~~l~~Cs~~~~ 43 (271) ++.++++||++... T Consensus 9 ~~~l~LagCas~~~ 22 (26) T pfam08139 9 LALLLLAGCASXXX 22 (26) T ss_pred HHHHHHHCCCCCCC T ss_conf 99999823320013 No 276 >PRK12407 flgH flagellar basal body L-ring protein; Reviewed Probab=41.13 E-value=21 Score=14.57 Aligned_cols=18 Identities=22% Similarity=0.287 Sum_probs=11.2 Q ss_pred HHHHHHHHHHHHCCCCCC Q ss_conf 999999999972268753 Q gi|254781174|r 26 IFFSIAVCFLVGWERQSS 43 (271) Q Consensus 26 i~~~i~~~~l~~Cs~~~~ 43 (271) +++++++++|+||++... T Consensus 4 l~l~~~~l~LsGC~~~~~ 21 (220) T PRK12407 4 LILTPMVLALCGCESPAL 21 (220) T ss_pred HHHHHHHHHHHCCCCCCC T ss_conf 799999999736588887 No 277 >cd02678 MIT_VPS4 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in intracellular protein transport proteins of the AAA-ATPase family. The molecular function of the MIT domain is unclear. Probab=40.27 E-value=22 Score=14.49 Aligned_cols=30 Identities=17% Similarity=0.189 Sum_probs=19.2 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHH Q ss_conf 899999999999809899999999999853 Q gi|254781174|r 57 QREVYEKAVLFLKEQNFSKAYEYFNQCSRD 86 (271) Q Consensus 57 ~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~ 86 (271) +.++-.+|...-+.|+|.+|...|...+.. T Consensus 6 Ai~lv~~Av~~D~~~~Y~eA~~lY~~al~~ 35 (75) T cd02678 6 AIELVKKAIEEDNAGNYEEALRLYQHALEY 35 (75) T ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHH T ss_conf 999999999978737799999999999999 No 278 >PRK11443 hypothetical protein; Provisional Probab=40.09 E-value=22 Score=14.48 Aligned_cols=17 Identities=6% Similarity=0.184 Sum_probs=11.0 Q ss_pred HHHHHHHHHHHHCCCCC Q ss_conf 99999999997226875 Q gi|254781174|r 26 IFFSIAVCFLVGWERQS 42 (271) Q Consensus 26 i~~~i~~~~l~~Cs~~~ 42 (271) +.+.+++++|+||.+.. T Consensus 4 ~~~~~~~lll~GCq~~p 20 (120) T PRK11443 4 FIAPLLALLVSGCQIDP 20 (120) T ss_pred HHHHHHHHHHHCCCCCC T ss_conf 89999999996667896 No 279 >TIGR03352 VI_chp_3 type VI secretion lipoprotein, VC_A0113 family. Work by Mougous, et al. (2006), describes IAHP-related loci as a type VI secretion system (PMID:16763151). This protein family is associated with type VI secretion loci, although not treated explicitly by Mougous, et al. Probab=40.07 E-value=19 Score=14.90 Aligned_cols=16 Identities=19% Similarity=0.258 Sum_probs=10.1 Q ss_pred HHHHHHHHHHCCCCCC Q ss_conf 9999999972268753 Q gi|254781174|r 28 FSIAVCFLVGWERQSS 43 (271) Q Consensus 28 ~~i~~~~l~~Cs~~~~ 43 (271) +++++++++||++... T Consensus 6 ~l~~~lllagC~s~~~ 21 (146) T TIGR03352 6 LLAACLLLAGCSSAPP 21 (146) T ss_pred HHHHHHHHHHCCCCCC T ss_conf 8999999853259999 No 280 >KOG2997 consensus Probab=39.95 E-value=22 Score=14.46 Aligned_cols=39 Identities=21% Similarity=0.289 Sum_probs=33.5 Q ss_pred CCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCC Q ss_conf 736899999999999809899999999999853047730 Q gi|254781174|r 54 VRYQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGV 92 (271) Q Consensus 54 ~~~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~ 92 (271) ..-+..+|.+|...-++|+..+|++.+.....-+|+-+. T Consensus 16 ~kkA~~l~~~av~~Eq~G~l~dai~fYR~AlqI~~diEs 54 (366) T KOG2997 16 AKKAIALYEKAVLKEQDGSLYDAINFYRDALQIVPDIES 54 (366) T ss_pred HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHH T ss_conf 899999999998776347487778887765538800999 No 281 >PRK10781 rcsF outer membrane lipoprotein; Reviewed Probab=39.51 E-value=23 Score=14.42 Aligned_cols=18 Identities=17% Similarity=0.194 Sum_probs=11.4 Q ss_pred HHHHHHHHHHHHCCCCCC Q ss_conf 999999999972268753 Q gi|254781174|r 26 IFFSIAVCFLVGWERQSS 43 (271) Q Consensus 26 i~~~i~~~~l~~Cs~~~~ 43 (271) +.+.++.++++|||.... T Consensus 4 lpi~llal~ltGCs~l~~ 21 (133) T PRK10781 4 LPICLLALMLTGCSMLSR 21 (133) T ss_pred HHHHHHHHHHHCCCCCCC T ss_conf 899999999843134454 No 282 >COG3317 NlpB Uncharacterized lipoprotein [Cell envelope biogenesis, outer membrane] Probab=39.11 E-value=23 Score=14.39 Aligned_cols=21 Identities=19% Similarity=0.199 Sum_probs=13.3 Q ss_pred HHHHHHHHHHHHHHHCCCCCC Q ss_conf 999999999999972268753 Q gi|254781174|r 23 ALTIFFSIAVCFLVGWERQSS 43 (271) Q Consensus 23 ~~~i~~~i~~~~l~~Cs~~~~ 43 (271) .-+++-.++++++.|||+... T Consensus 5 ~~lv~~al~v~~LaaCSs~~~ 25 (342) T COG3317 5 AKLVLGALLVLLLAACSSDSE 25 (342) T ss_pred HHHHHHHHHHHHHHHCCCCCC T ss_conf 889999999998741468850 No 283 >cd02683 MIT_1 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in proteins with unknown function, co-occuring with an as yet undescribed domain. The molecular function of the MIT domain is unclear. Probab=38.94 E-value=23 Score=14.37 Aligned_cols=31 Identities=23% Similarity=0.264 Sum_probs=19.8 Q ss_pred CHHHHHHHHHHHHHHCCHHHHHHHHHHHHHH Q ss_conf 6899999999999809899999999999853 Q gi|254781174|r 56 YQREVYEKAVLFLKEQNFSKAYEYFNQCSRD 86 (271) Q Consensus 56 ~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~ 86 (271) .+.++-.+|...-+.|+|.+|...|+.-+.. T Consensus 5 ~A~~lv~rAve~D~~~~y~eAl~~Y~~gi~~ 35 (77) T cd02683 5 AAKEVLKRAVELDQEGRFQEALVCYQEGIDL 35 (77) T ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH T ss_conf 8999999999988715699999999999999 No 284 >PRK13910 DNA glycosylase MutY; Provisional Probab=37.72 E-value=24 Score=14.26 Aligned_cols=18 Identities=11% Similarity=0.401 Sum_probs=8.4 Q ss_pred HHHH-HHHHHHHHCCCCCH Q ss_conf 7999-99887540112110 Q gi|254781174|r 112 QAAS-LGEEYITQYPESKN 129 (271) Q Consensus 112 ~A~~-~~~~fi~~~P~s~~ 129 (271) ..+. +|++|++.||+-.. T Consensus 10 tvip~y~~~f~~~fP~~~~ 28 (290) T PRK13910 10 TVVERFYSPFLEAFPTLKD 28 (290) T ss_pred CCCHHHHHHHHHHCCCHHH T ss_conf 0057899999998839999 No 285 >pfam06957 COPI_C Coatomer (COPI) alpha subunit C-terminus. This family represents the C-terminus (approximately 500 residues) of the eukaryotic coatomer alpha subunit. Coatomer (COPI) is a large cytosolic protein complex which forms a coat around vesicles budding from the Golgi apparatus. Such coatomer-coated vesicles have been proposed to play a role in many distinct steps of intracellular transport. Note that many family members also contain the pfam04053 domain. Probab=37.71 E-value=24 Score=14.25 Aligned_cols=36 Identities=19% Similarity=0.123 Sum_probs=20.2 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH Q ss_conf 544322345689998865289981159999999999 Q gi|254781174|r 201 LKRGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEA 236 (271) Q Consensus 201 ~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~ 236 (271) +|.++|..|..--+.+|+.-|..+.+++|...+..+ T Consensus 310 fK~KNy~TAasFArRLLel~p~~~~a~qArKIl~~c 345 (421) T pfam06957 310 FKLKNFKTAASFARRLLELAPKPEVAQQARKVLQAC 345 (421) T ss_pred HHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH T ss_conf 986229999999999986499889999999999998 No 286 >PRK04168 hypothetical protein; Provisional Probab=37.70 E-value=18 Score=15.05 Aligned_cols=23 Identities=4% Similarity=0.085 Sum_probs=15.2 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 99999999999999999722687 Q gi|254781174|r 19 LYKFALTIFFSIAVCFLVGWERQ 41 (271) Q Consensus 19 m~k~~~~i~~~i~~~~l~~Cs~~ 41 (271) +.+.+.+++++++.+++.||.+. T Consensus 4 ~~~~~~~~~~~~~~~~~~g~~~~ 26 (336) T PRK04168 4 GRRKIVIVILLLLALVFLGCVNT 26 (336) T ss_pred CCEEHHHHHHHHHHHHHCCCCCC T ss_conf 53220379999999997055577 No 287 >PRK09534 btuF corrinoid ABC transporter substrate-binding protein; Reviewed Probab=37.48 E-value=24 Score=14.23 Aligned_cols=27 Identities=15% Similarity=-0.017 Sum_probs=15.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 999999999999999999972268753 Q gi|254781174|r 17 YQLYKFALTIFFSIAVCFLVGWERQSS 43 (271) Q Consensus 17 ~~m~k~~~~i~~~i~~~~l~~Cs~~~~ 43 (271) .+|....++++.+++++.+.||+..++ T Consensus 2 ~~~r~~~l~~~~l~~~~~~~g~~~~~~ 28 (364) T PRK09534 2 HRMRFASLVIVALALVMTAAGGALAPA 28 (364) T ss_pred CCHHHHHHHHHHHHHHHHHCCCCCCCC T ss_conf 520236999999999998626787889 No 288 >PRK10175 hypothetical protein; Provisional Probab=37.45 E-value=15 Score=15.39 Aligned_cols=20 Identities=40% Similarity=0.403 Sum_probs=12.9 Q ss_pred HHHHHHHHHHHHHCCCCCCC Q ss_conf 99999999999722687533 Q gi|254781174|r 25 TIFFSIAVCFLVGWERQSSR 44 (271) Q Consensus 25 ~i~~~i~~~~l~~Cs~~~~~ 44 (271) ++++.+.+++++||+|.-+. T Consensus 3 ~i~~~~~~lllsGC~SIms~ 22 (75) T PRK10175 3 LIVVSIMVTLLSGCGSIISR 22 (75) T ss_pred EHHHHHHHHHHCCCHHHHCC T ss_conf 44888899998164244305 No 289 >PRK12698 flgH flagellar basal body L-ring protein; Reviewed Probab=37.29 E-value=24 Score=14.21 Aligned_cols=22 Identities=14% Similarity=0.332 Sum_probs=12.2 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 99999999999999999722687533 Q gi|254781174|r 19 LYKFALTIFFSIAVCFLVGWERQSSR 44 (271) Q Consensus 19 m~k~~~~i~~~i~~~~l~~Cs~~~~~ 44 (271) |.|++++ +++++|.||++.... T Consensus 1 M~r~~l~----~~~l~L~gCa~~~~~ 22 (224) T PRK12698 1 MARYILL----ALALLLAGCSSTPKK 22 (224) T ss_pred CHHHHHH----HHHHHHHCCCCCCCC T ss_conf 9779999----999998365688988 No 290 >PRK11616 hypothetical protein; Provisional Probab=36.64 E-value=15 Score=15.44 Aligned_cols=17 Identities=12% Similarity=0.202 Sum_probs=10.1 Q ss_pred HHHHHHHHHHHHCCCCC Q ss_conf 99999999997226875 Q gi|254781174|r 26 IFFSIAVCFLVGWERQS 42 (271) Q Consensus 26 i~~~i~~~~l~~Cs~~~ 42 (271) +++...+++++|||+-- T Consensus 8 f~~~~g~l~~sGCSSvM 24 (109) T PRK11616 8 FMICSGMLLLSGCSSVM 24 (109) T ss_pred HHHHHHHHHHCCCHHHH T ss_conf 99985899875751230 No 291 >COG4238 Murein lipoprotein [Cell envelope biogenesis, outer membrane] Probab=36.56 E-value=9.5 Score=16.60 Aligned_cols=22 Identities=14% Similarity=0.099 Sum_probs=14.2 Q ss_pred HHHHHHHHHHHHHHCCCCCCCC Q ss_conf 9999999999997226875333 Q gi|254781174|r 24 LTIFFSIAVCFLVGWERQSSRD 45 (271) Q Consensus 24 ~~i~~~i~~~~l~~Cs~~~~~~ 45 (271) ++....+..+++.||++..+.+ T Consensus 7 ~l~Avvlg~lllAGc~s~aK~d 28 (78) T COG4238 7 TLGAVVLGSLLLAGCSSNAKID 28 (78) T ss_pred HHHHHHHHHHHHHHCCCHHHHH T ss_conf 2789998779997011088898 No 292 >pfam04010 DUF357 Protein of unknown function (DUF357). Members of this family are short (less than 100 amino acid) proteins found in archaebacteria. The function of these proteins is unknown. Probab=36.45 E-value=25 Score=14.14 Aligned_cols=55 Identities=25% Similarity=0.350 Sum_probs=25.2 Q ss_pred HHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 45899999999999852001013456555555566666665556666654432234568999 Q gi|254781174|r 153 RATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQ 214 (271) Q Consensus 153 ~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~ 214 (271) ..+.+|++..+... -++|+....+..-+. +|+.-+.=|++|+++|++.-|...+. T Consensus 5 ~~~~eAl~~~~~~~--~~~s~l~~~a~~~~~-----ma~~Y~~Da~~fl~~gD~v~A~~~~~ 59 (75) T pfam04010 5 RLLEEALSKVKIAP--PEGSPLHAAAEEVLE-----MAKSYLEDAKYYLEKGDLVTALACFS 59 (75) T ss_pred HHHHHHHHCCCCCC--CCCCHHHHHHHHHHH-----HHHHHHHHHHHHHHCCCHHHHHHHHH T ss_conf 99999997087889--999825899999999-----99999999999988787799999999 No 293 >cd02684 MIT_2 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in proteins with an n-terminal serine/threonine kinase domain. The molecular function of the MIT domain is unclear. Probab=36.08 E-value=26 Score=14.10 Aligned_cols=30 Identities=10% Similarity=-0.011 Sum_probs=17.2 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHH Q ss_conf 899999999999809899999999999853 Q gi|254781174|r 57 QREVYEKAVLFLKEQNFSKAYEYFNQCSRD 86 (271) Q Consensus 57 ~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~ 86 (271) +.++-.+|...-+.|+|.+|...|...+.. T Consensus 6 Ai~lv~~Av~~D~~~~y~eA~~lY~~aley 35 (75) T cd02684 6 AIALVVQAVKKDQRGDAAAALSLYCSALQY 35 (75) T ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHH T ss_conf 999999999988726799999999999999 No 294 >smart00101 14_3_3 14-3-3 homologues. 14-3-3 homologues mediates signal transduction by binding to phosphoserine-containing proteins. They are involved in growth factor signalling and also interact with MEK kinases. Probab=35.81 E-value=26 Score=14.07 Aligned_cols=65 Identities=8% Similarity=0.059 Sum_probs=36.0 Q ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHH Q ss_conf 999999999980989999999999985304773026899877887765-45556799999887540 Q gi|254781174|r 59 EVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYS-AGKYQQAASLGEEYITQ 123 (271) Q Consensus 59 ~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~-~~~y~~A~~~~~~fi~~ 123 (271) ++-..|...-+.++|++.+...++++......++..+-.-.+..+|=. .|..-.+......+... T Consensus 3 ~~v~~AklaeqaeRyddM~~~mk~v~~~~~~~eLs~eERnLLSvayKn~i~~rR~s~R~i~~ie~k 68 (244) T smart00101 3 ENVYMAKLAEQAERYEEMVEFMEKVAKTVDSEELTVEERNLLSVAYKNVIGARRASWRIISSIEQK 68 (244) T ss_pred HHHHHHHHHHHHHCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHH T ss_conf 999999999997389999999999997457877999999999999874116329999998599986 No 295 >COG3168 PilP Tfp pilus assembly protein PilP [Cell motility and secretion / Intracellular trafficking and secretion] Probab=35.74 E-value=24 Score=14.30 Aligned_cols=19 Identities=32% Similarity=0.369 Sum_probs=12.6 Q ss_pred HHHHHHHHHHHHHCCCCCC Q ss_conf 9999999999972268753 Q gi|254781174|r 25 TIFFSIAVCFLVGWERQSS 43 (271) Q Consensus 25 ~i~~~i~~~~l~~Cs~~~~ 43 (271) ++.++++++.++||+.... T Consensus 3 ~~~~l~slL~lvgC~~~~d 21 (170) T COG3168 3 LILLLISLLALVGCGAGSD 21 (170) T ss_pred CHHHHHHHHHHHHCCCCCC T ss_conf 0368899999975046884 No 296 >PRK12701 flgH flagellar basal body L-ring protein; Reviewed Probab=35.67 E-value=26 Score=14.06 Aligned_cols=20 Identities=20% Similarity=0.202 Sum_probs=12.5 Q ss_pred HHHHHHHHHHHHHHCCCCCC Q ss_conf 99999999999972268753 Q gi|254781174|r 24 LTIFFSIAVCFLVGWERQSS 43 (271) Q Consensus 24 ~~i~~~i~~~~l~~Cs~~~~ 43 (271) .+++.+++.++|+||++... T Consensus 5 ~~~~~~la~~lL~GC~~~~~ 24 (230) T PRK12701 5 NIAVSCLATALLFGCEALHP 24 (230) T ss_pred HHHHHHHHHHHHHCCCCCCC T ss_conf 99999999999755668899 No 297 >PRK13733 conjugal transfer protein TraV; Provisional Probab=34.99 E-value=10 Score=16.48 Aligned_cols=40 Identities=18% Similarity=0.125 Sum_probs=25.2 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCCCC-CCHHHHHHHHHHHHH Q ss_conf 9999997226875333445677767-368999999999998 Q gi|254781174|r 30 IAVCFLVGWERQSSRDVYLDSVTDV-RYQREVYEKAVLFLK 69 (271) Q Consensus 30 i~~~~l~~Cs~~~~~~~~~~~~~~~-~~~~~lY~~a~~~~~ 69 (271) ...++|+||...+++.......+|. .+.++..++|...-+ T Consensus 11 ~~~llLtGCAG~nSdFdCnaTtsDtCmTMeqANe~A~~~~~ 51 (171) T PRK13733 11 LGTLLLSGCAGTNSEFECNATTSDTCMTMEQANEKAKKLEQ 51 (171) T ss_pred CCEEEEECCCCCCCCCCCCCCCCCCCEEHHHHHHHHHHHHH T ss_conf 26002602457775454699877620629999999986540 No 298 >TIGR00620 sporelyase spore photoproduct lyase; InterPro: IPR004594 Members of this family are involved in the direct repair of UV induced spore photoproducts (thymine dimer 5-thyminyl-5,6-dihydrothymine). The protein repairs UV radiation-induced DNA damage during spore germination by monomerization of these thymine dimers.. Probab=34.28 E-value=20 Score=14.76 Aligned_cols=63 Identities=17% Similarity=0.219 Sum_probs=42.3 Q ss_pred HHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC-CCCH Q ss_conf 45899999999999852001013456555555566666665556666654432234568999886528-9981 Q gi|254781174|r 153 RATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPRFQLVLANY-SDAE 224 (271) Q Consensus 153 ~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~~~~~i~~y-p~t~ 224 (271) +.|..|.... .+.||++..--|-..| +.-...+.+|+|=|++.+-.+-...+..++++| |.+. T Consensus 131 RfT~~AK~vI---~~~yP~~KL~lD~EKR------~yK~G~yG~GKY~Y~K~~~~~l~~~l~~~ID~yFP~~~ 194 (199) T TIGR00620 131 RFTKPAKRVI---EKNYPKTKLELDEEKR------RYKYGRYGIGKYIYKKDEEHELREALEEYIDKYFPEAK 194 (199) T ss_pred CCCCHHHHHH---HHHCCCCCCCCCCCCC------CCCCCCCCCCEEEECCCHHHHHHHHHHHHHHHHCCCCC T ss_conf 4351578988---7607887787620011------12355323540231122379999999998652178650 No 299 >PRK12798 chemotaxis protein; Reviewed Probab=34.24 E-value=27 Score=13.92 Aligned_cols=117 Identities=11% Similarity=0.015 Sum_probs=73.6 Q ss_pred CCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHH Q ss_conf 36899999999999809899999999999853047730268998778877654555679999988754011211013555 Q gi|254781174|r 55 RYQREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVY 134 (271) Q Consensus 55 ~~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~ 134 (271) ..+..+ -.|...+-+|+-.+|.+.+..+...+-.+..++...+-.| ......+...|+..++..--.-|. ..+++|- T Consensus 111 ~~d~~l-~~g~laY~~Gr~~ea~~~L~~i~~~~lp~~lg~~laLv~a-~l~~~~dP~~Al~~lD~aRL~aPG-TLvEEAA 187 (416) T PRK12798 111 EVDQRL-ADGALAYLSGRGREARKLLSGVDPETLPAELGAYLALVQG-NLMAATDPATALDRLDQARLLAPG-TLVEEAA 187 (416) T ss_pred HHHHHH-HHHHHHHHCCCHHHHHHHHCCCCHHHCCCHHHHHHHHHHH-HHHCCCCHHHHHHHHHHHHHHCCC-HHHHHHH T ss_conf 032999-9999999769879999885137801047224577999988-875136889999998899981881-5899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHH Q ss_conf 44344455554431112345899999999999852001013456555 Q gi|254781174|r 135 YLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFY 181 (271) Q Consensus 135 y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~ 181 (271) .++.+..-.+ .+...+....-..++.+|+.|+|+.+=..+ T Consensus 188 LRR~i~la~~-------~g~~~r~~~~a~~Y~rRf~~S~Ya~~F~~~ 227 (416) T PRK12798 188 LRRSLFIAAQ-------LGDADKFEALSRNYLRRFRHSPYAGNFAQR 227 (416) T ss_pred HHHHHHHHHH-------CCCHHHHHHHHHHHHHHHCCCCHHHHHHHH T ss_conf 9877699986-------376789999999999985068006789999 No 300 >pfam06085 Rz1 Lipoprotein Rz1 precursor. This family consists of several bacteria and phage lipoprotein Rz1 precursors. Rz1 is a proline-rich lipoprotein from bacteriophage lambda which is known to have fusogenic properties. Rz1-induced liposome fusion is thought to be mediated primarily by the generation of local perturbation in the bilayer lipid membrane and to a lesser extent by electrostatic forces. Probab=34.14 E-value=25 Score=14.16 Aligned_cols=23 Identities=0% Similarity=0.059 Sum_probs=13.8 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 99999999999999722687533 Q gi|254781174|r 22 FALTIFFSIAVCFLVGWERQSSR 44 (271) Q Consensus 22 ~~~~i~~~i~~~~l~~Cs~~~~~ 44 (271) ....++..++.+.+++|++...- T Consensus 4 lkm~l~~~m~~L~vsaC~S~p~v 26 (59) T pfam06085 4 LKMKLCVMMLPLVVSACSSKPPV 26 (59) T ss_pred CHHHHHHHHHHHHHHHHCCCCCC T ss_conf 02569999989999982699986 No 301 >pfam03304 Mlp Mlp lipoprotein family. The Mlp (for Multicopy Lipoprotein) family of lipoproteins is found in Borrelia species. This family were previously known as 2.9 lipoprotein genes. These surface expressed genes may represent new candidate vaccinogens for Lyme disease. Members of this family generally are downstream of four ORFs called A,B,C and D that are involved in hemolytic activity. Probab=33.73 E-value=18 Score=15.01 Aligned_cols=19 Identities=21% Similarity=0.494 Sum_probs=12.1 Q ss_pred HHHHHHHHHHHHHHCCCCC Q ss_conf 9999999999997226875 Q gi|254781174|r 24 LTIFFSIAVCFLVGWERQS 42 (271) Q Consensus 24 ~~i~~~i~~~~l~~Cs~~~ 42 (271) +-|++.|+++.|.||.+.. T Consensus 4 INilfclfll~Ln~Cnsnd 22 (150) T pfam03304 4 INILFCLFLLMLNGCNSND 22 (150) T ss_pred EHHHHHHHHHHHCCCCCCC T ss_conf 1499999999982656786 No 302 >COG3065 Slp Starvation-inducible outer membrane lipoprotein [Cell envelope biogenesis, outer membrane] Probab=33.72 E-value=28 Score=13.87 Aligned_cols=21 Identities=10% Similarity=0.056 Sum_probs=12.4 Q ss_pred HHHHHHHHHHHHHHCCCCCCC Q ss_conf 999999999999722687533 Q gi|254781174|r 24 LTIFFSIAVCFLVGWERQSSR 44 (271) Q Consensus 24 ~~i~~~i~~~~l~~Cs~~~~~ 44 (271) +..++..+++.|+||.+-.+. T Consensus 9 ~~~l~~~laflLsgC~tiPk~ 29 (191) T COG3065 9 KGALIGTLAFLLSGCVTIPKA 29 (191) T ss_pred HHHHHHHHHHHHHHCCCCCHH T ss_conf 999999999997630357831 No 303 >COG3056 Uncharacterized lipoprotein [Cell envelope biogenesis, outer membrane] Probab=33.72 E-value=28 Score=13.87 Aligned_cols=32 Identities=13% Similarity=0.011 Sum_probs=18.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 99999999999999999999997226875333 Q gi|254781174|r 14 AWAYQLYKFALTIFFSIAVCFLVGWERQSSRD 45 (271) Q Consensus 14 ~~~~~m~k~~~~i~~~i~~~~l~~Cs~~~~~~ 45 (271) |-.-+..+++.+++++..+++|.||+...... T Consensus 7 ~~i~~k~~t~k~L~~laa~~lLagC~a~~~tl 38 (204) T COG3056 7 CVIESKNMTKKILFPLAAIFLLAGCAAPPTTL 38 (204) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEE T ss_conf 88761788999999999999997537997503 No 304 >TIGR03362 VI_chp_7 type VI secretion-associated protein, VC_A0119 family. This protein family is one of two related families in type VI secretion systems that contain an ImpA-related N-terminal domain (pfam06812). Probab=33.40 E-value=28 Score=13.84 Aligned_cols=176 Identities=13% Similarity=0.077 Sum_probs=91.0 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHCCCCCHHHH Q ss_conf 89999999999980989999999999985304773026899877887765455567999----99887540112110135 Q gi|254781174|r 57 QREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAAS----LGEEYITQYPESKNVDY 132 (271) Q Consensus 57 ~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~----~~~~fi~~~P~s~~~~~ 132 (271) +.+.-..-...+.++++.+.++.-+..... +++==+.+++.+.+.-.+| |+.+.. .+..|+.+.|.-.. T Consensus 99 ~ad~~~~y~~~~~q~~~~~Ll~~~E~sl~~---~pfWLDgq~~~a~al~~LG-~~~~a~aI~~el~~fL~RlP~L~~--- 171 (301) T TIGR03362 99 PADRVADYQELLAQADWAALLQRVEQSLSL---APFWLDGQRLSAQALERLG-YAAVAQAIRDELAAFLERLPGLLE--- 171 (301) T ss_pred CHHHHHHHHHHHHCCCHHHHHHHHHHHHHH---CCHHHHHHHHHHHHHHHCC-CHHHHHHHHHHHHHHHHHCCCHHH--- T ss_conf 989999999999779989999999999882---7247359999999998768-599999999999999986776043--- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 55443444555544311123458999999999998520010134565555555666666655566666544322345689 Q gi|254781174|r 133 VYYLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQLAAKEVEIGRYYLKRGEYVAAIPR 212 (271) Q Consensus 133 A~y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~La~~e~~ia~~Y~~~~~y~aA~~~ 212 (271) +++-.-.+- .-.............-+.+.-+..+..-. .....+ ....+.--...+.-.+|+.. T Consensus 172 ------L~F~DGtPF------ad~~T~~WL~~~~~~~~~~~~~~~~~~~~---~~~~~~-~~~eA~~l~~~~gl~~Al~~ 235 (301) T TIGR03362 172 ------LKFSDGTPF------ADDETRAWLAQHATRSNAASVAPVAEVGE---ESDWEE-LREEARALAAEGGLEAALQR 235 (301) T ss_pred ------HCCCCCCCC------CCHHHHHHHHHCCCCCCCCCCCCCCCCCC---CCCHHH-HHHHHHHHHHCCCHHHHHHH T ss_conf ------023799987------98889999986034677776677777776---644899-99999999974799999999 Q ss_pred HHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 9988652899811599999999999974987999999999978 Q gi|254781174|r 213 FQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSLIQE 255 (271) Q Consensus 213 ~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~~ 255 (271) ++.-+..-+.....--....+++....-|..+.|...+..|.. T Consensus 236 L~~~l~~~~s~R~rf~~rL~~ArL~~~~g~~~lA~~~l~~L~~ 278 (301) T TIGR03362 236 LQQRLAQAREPRERFHWRLLLARLLEQAGKAELAQQLYAALDQ 278 (301) T ss_pred HHHHCCCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH T ss_conf 9975324899799999999999999986987999999999999 No 305 >pfam09533 DUF2380 Predicted lipoprotein of unknown function (DUF2380). This family consists of at least 9 paralogs in Myxococcus xanthus, a member of the Deltaproteobacteria. One appears truncated toward the N-terminus; the others are predicted lipoproteins. The function is unknown. Probab=33.34 E-value=28 Score=13.84 Aligned_cols=24 Identities=17% Similarity=0.286 Sum_probs=16.3 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 999999999999997226875333 Q gi|254781174|r 22 FALTIFFSIAVCFLVGWERQSSRD 45 (271) Q Consensus 22 ~~~~i~~~i~~~~l~~Cs~~~~~~ 45 (271) ..+.+.+.++.++++||++..... T Consensus 3 ~~l~~~l~vLal~~~GCasaa~~p 26 (220) T pfam09533 3 RALVLWLLVLALLWVGCASAAPTP 26 (220) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 689999999999984501578986 No 306 >PRK09193 indolepyruvate ferredoxin oxidoreductase; Validated Probab=32.87 E-value=29 Score=13.79 Aligned_cols=88 Identities=14% Similarity=0.089 Sum_probs=51.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHH------HH----HHHHHHHHHHHHHHHHHHHH Q ss_conf 987788776545556799999887540112110135554434445555------44----31112345899999999999 Q gi|254781174|r 97 LLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYLVGMSYAQM------IR----DVPYDQRATKLMLQYMSRIV 166 (271) Q Consensus 97 ~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~~a~~~~~~------~~----~~~~d~~~~~~A~~~f~~~i 166 (271) ++++|+++.+ |--.-.....++.|+++.-.-...-.-|..|.-.... .. ..........+-+......+ T Consensus 878 m~lLG~AwQk-GlIPlS~eAIe~AIeLNGvaVe~Nl~AF~wGR~aa~dP~~v~~~~~~~~~~~~~~~sLdelI~~R~~~L 956 (1155) T PRK09193 878 MFMLGYAWQK-GLLPLSAAAIERAIELNGVAVEMNLAAFRWGRRAAHDPAAVAALAPAAVIAAPLPPSLDELIARRVAFL 956 (1155) T ss_pred HHHHHHHHHC-CCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHH T ss_conf 9999999985-887789999999998569457889999999999852999999762245666678989999999999999 Q ss_pred HHHHCCHHHHHHHHHHHHH Q ss_conf 8520010134565555555 Q gi|254781174|r 167 ERYTNSPYVKGARFYVTVG 185 (271) Q Consensus 167 ~~yP~S~ya~~A~~~l~~~ 185 (271) ..|.|..|+..-...+..+ T Consensus 957 t~YQn~~YA~rY~~~V~~v 975 (1155) T PRK09193 957 TAYQNAAYAARYRALVERV 975 (1155) T ss_pred HHHCCHHHHHHHHHHHHHH T ss_conf 9805999999999999999 No 307 >pfam04212 MIT MIT (microtubule interacting and transport) domain. The MIT domain forms an asymmetric three-helix bundle and binds ESCRT-III (endosomal sorting complexes required for transport) substrates. Probab=32.69 E-value=29 Score=13.77 Aligned_cols=30 Identities=23% Similarity=0.262 Sum_probs=18.0 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHH Q ss_conf 899999999999809899999999999853 Q gi|254781174|r 57 QREVYEKAVLFLKEQNFSKAYEYFNQCSRD 86 (271) Q Consensus 57 ~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~ 86 (271) +..+..+|...-+.|+|.+|+..|.+.+.. T Consensus 5 A~~~~~~Av~~D~~g~y~~A~~~Y~~ai~~ 34 (69) T pfam04212 5 ALELVKKAVEADEAGNYEEALELYKEAIEY 34 (69) T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHH T ss_conf 999999999988869999999999999999 No 308 >PRK10523 lipoprotein involved with copper homeostasis and adhesion; Provisional Probab=32.01 E-value=30 Score=13.71 Aligned_cols=27 Identities=19% Similarity=0.271 Sum_probs=20.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 999999999999999999722687533 Q gi|254781174|r 18 QLYKFALTIFFSIAVCFLVGWERQSSR 44 (271) Q Consensus 18 ~m~k~~~~i~~~i~~~~l~~Cs~~~~~ 44 (271) .|.|.++.+++.+.++.|+||...... T Consensus 1 ~mkk~~~~~~~a~g~~~l~GC~~r~~~ 27 (234) T PRK10523 1 MMKKAILTALAAVGLFALMGCNNRAEV 27 (234) T ss_pred CHHHHHHHHHHHHHHHHHHCCCCCCCC T ss_conf 905899999999999987634665434 No 309 >TIGR00985 3a0801s04tom mitochondrial import receptor subunit; InterPro: IPR016043 Virtually all mitochondrial precursors are imported via the same mechanism : precursors first bind to receptors on the mitochondrial surface, then insert into the translocation channel in the outer membrane. Many outer-membrane proteins participate in the early stages of import, four of which (MAS20, MAS22, MAS37 and MAS70) are components of the receptor. MAS20, which forms a subcomplex with MAS22, seems to interact with most or all mitochondrial precursors, suggesting that the protein binds directly to mitochondrial targeting sequences. The MAS37 and MAS70 components also form a subcomplex, the two subcomplexes possibly binding via their trans- membrane (TM) regions - the TM region of MAS70 promotes oligomerisation of attatched protein domains and shares sequence similarity with the TM region of MAS20 .. Probab=32.00 E-value=30 Score=13.70 Aligned_cols=59 Identities=12% Similarity=0.061 Sum_probs=33.9 Q ss_pred CHHHHHHHHHHHHHHCCC--CCCHHHHH----HHHHHHHHHHH--HHHHHHHHHHHHHHHCCCCCHH Q ss_conf 899999999999853047--73026899----87788776545--5567999998875401121101 Q gi|254781174|r 72 NFSKAYEYFNQCSRDFPF--AGVARKSL----LMSAFVQYSAG--KYQQAASLGEEYITQYPESKNV 130 (271) Q Consensus 72 ~y~~A~~~f~~i~~~~P~--s~~a~~A~----~~la~~~y~~~--~y~~A~~~~~~fi~~~P~s~~~ 130 (271) +.+.-++..-.....-|. ++...++. ...|+-+..++ +..+|...|-+.|..||+.... T Consensus 65 k~~~~~r~~~~~~~~~P~p~d~~e~e~fFl~~V~~GE~L~~q~P~~~~e~a~~fy~Al~Vyp~P~~L 131 (155) T TIGR00985 65 KLRKVVREAVAELAKEPDPSDAEEKEAFFLEEVQLGEELLAQQPDKVKEAALHFYKALKVYPQPQDL 131 (155) T ss_pred HHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCHHH T ss_conf 9989999999876407899988789879999999877850489951789999999887627992489 No 310 >PRK06760 hypothetical protein; Provisional Probab=31.60 E-value=17 Score=15.19 Aligned_cols=20 Identities=0% Similarity=0.112 Sum_probs=11.9 Q ss_pred HHHHHHHHHHHHHHCCCCCC Q ss_conf 99999999999972268753 Q gi|254781174|r 24 LTIFFSIAVCFLVGWERQSS 43 (271) Q Consensus 24 ~~i~~~i~~~~l~~Cs~~~~ 43 (271) +.++-++++..+++||.... T Consensus 7 i~i~~iis~~~lsaCS~~~~ 26 (223) T PRK06760 7 IFMLTILLLISFSACSKKEN 26 (223) T ss_pred EHHHHHHHHHHHHHCCCCCC T ss_conf 49999999998400336787 No 311 >PHA00407 phage lambda Rz1-like protein Probab=31.56 E-value=30 Score=13.66 Aligned_cols=30 Identities=27% Similarity=0.202 Sum_probs=17.2 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCCCCCC Q ss_conf 999999999999999722687533344567 Q gi|254781174|r 21 KFALTIFFSIAVCFLVGWERQSSRDVYLDS 50 (271) Q Consensus 21 k~~~~i~~~i~~~~l~~Cs~~~~~~~~~~~ 50 (271) |-.++-.+++....++||.+.+.-..+..+ T Consensus 32 kaaLIGlllicv~tISGCaSes~lp~ep~k 61 (84) T PHA00407 32 KAALIGLLLICVATISGCASESKLPVEPQK 61 (84) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCCCC T ss_conf 999999999999997666502569999642 No 312 >COG5191 Uncharacterized conserved protein, contains HAT (Half-A-TPR) repeat [General function prediction only] Probab=31.53 E-value=30 Score=13.66 Aligned_cols=81 Identities=14% Similarity=0.224 Sum_probs=61.0 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHH Q ss_conf 89999999999980989999999999985304773026899877---887765455567999998875401121101355 Q gi|254781174|r 57 QREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMS---AFVQYSAGKYQQAASLGEEYITQYPESKNVDYV 133 (271) Q Consensus 57 ~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~l---a~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A 133 (271) ++-.-+-+....+.|-|.+--+.|.+....||... .+|+ ++-++..++...+...+.+-++.+|++|.+=.+ T Consensus 107 ~k~w~~y~~Y~~k~k~y~~~~nI~~~~l~khP~nv-----dlWI~~c~~e~~~~ani~s~Ra~f~~glR~N~~~p~iw~e 181 (435) T COG5191 107 PKIWSQYAAYVIKKKMYGEMKNIFAECLTKHPLNV-----DLWIYCCAFELFEIANIESSRAMFLKGLRMNSRSPRIWIE 181 (435) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC-----EEEEEECCCHHHHHCCHHHHHHHHHHHHCCCCCCCHHHHH T ss_conf 08999999999998789899999999996199874-----0565630003544156798999998653148898429999 Q ss_pred HHHHHHHHH Q ss_conf 544344455 Q gi|254781174|r 134 YYLVGMSYA 142 (271) Q Consensus 134 ~y~~a~~~~ 142 (271) ++..-+.|. T Consensus 182 yfr~El~yi 190 (435) T COG5191 182 YFRMELMYI 190 (435) T ss_pred HHHHHHHHH T ss_conf 999999999 No 313 >KOG1310 consensus Probab=30.89 E-value=31 Score=13.59 Aligned_cols=30 Identities=20% Similarity=0.029 Sum_probs=18.2 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHCCCC Q ss_conf 999999998098999999999998530477 Q gi|254781174|r 61 YEKAVLFLKEQNFSKAYEYFNQCSRDFPFA 90 (271) Q Consensus 61 Y~~a~~~~~~~~y~~A~~~f~~i~~~~P~s 90 (271) .++|...+..+.+..|+..|...+...|.. T Consensus 378 ~~egnd~ly~~~~~~~i~~~s~a~q~~~~~ 407 (758) T KOG1310 378 KTEGNDGLYESIVSGAISHYSRAIQYVPDA 407 (758) T ss_pred HHHCCCHHHHHHHHHHHHHHHHHHHHCCCH T ss_conf 860653103688877899988776645502 No 314 >pfam04910 DUF654 Protein of unknown function, DUF654. This family includes a number of poorly characterized eukaryotic proteins. Probab=30.15 E-value=32 Score=13.52 Aligned_cols=107 Identities=12% Similarity=0.138 Sum_probs=56.7 Q ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCC-CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHH Q ss_conf 99999999999809899999999999853047-73026899877887765455567999998875401121101355544 Q gi|254781174|r 58 REVYEKAVLFLKEQNFSKAYEYFNQCSRDFPF-AGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYL 136 (271) Q Consensus 58 ~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~-s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~ 136 (271) ..+|.--...-++|=|..|.+.-+-+.+.-|. .|++. .+.+=+---+.++|+-=+...+.+... .+-...|.-.|- T Consensus 101 lal~r~i~~L~~Rgc~rTAlE~~KLLLsLdp~~DP~g~--ll~ID~~ALrs~ey~~li~f~~~~~~~-~~~~~lPn~afS 177 (343) T pfam04910 101 LALFRYIQSLGRRGCWRTALEWCKLLLSLDPEEDPLGA--LLLIDYYALRAKEYEWLIDFSESPLAS-RNLSLLPNFAYS 177 (343) T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCHH--HHHHHHHHHHCCCHHHHHHHHHHHHCC-CCHHHCCCHHHH T ss_conf 99999999988665489999999999844988881367--999999999635377999999735303-406558487899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 34445555443111234589999999999985200 Q gi|254781174|r 137 VGMSYAQMIRDVPYDQRATKLMLQYMSRIVERYTN 171 (271) Q Consensus 137 ~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~yP~ 171 (271) .|++++.... +.+....|...+++.+..||. T Consensus 178 ~aLA~~~l~~----~~~~~~~A~~~L~~Ai~~fP~ 208 (343) T pfam04910 178 VALALFLLEK----DDGLSESARALLLRAILRFPW 208 (343) T ss_pred HHHHHHHHHC----CCCCCHHHHHHHHHHHHHHHH T ss_conf 9999999748----666607899999999998499 No 315 >KOG2396 consensus Probab=28.67 E-value=34 Score=13.37 Aligned_cols=73 Identities=14% Similarity=0.184 Sum_probs=53.8 Q ss_pred HCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 0989999999999985304773026899877887765455-567999998875401121101355544344455554 Q gi|254781174|r 70 EQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGK-YQQAASLGEEYITQYPESKNVDYVYYLVGMSYAQMI 145 (271) Q Consensus 70 ~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~-y~~A~~~~~~fi~~~P~s~~~~~A~y~~a~~~~~~~ 145 (271) .+.+.+-...|.+....||.++- -=...|.=.|..+. .+.|...+.+-++.+|++|..-..++..-+.+.... T Consensus 118 ~~~~~~v~ki~~~~l~~Hp~~~d---LWI~aA~wefe~n~ni~saRalflrgLR~npdsp~Lw~eyfrmEL~~~~Kl 191 (568) T KOG2396 118 KKTYGEVKKIFAAMLAKHPNNPD---LWIYAAKWEFEINLNIESARALFLRGLRFNPDSPKLWKEYFRMELMYAEKL 191 (568) T ss_pred HCCHHHHHHHHHHHHHHCCCCCH---HHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 42246899999999985899830---688665568750323589999999776418997689999999999999999 No 316 >PRK10461 thiamine biosynthesis lipoprotein ApbE; Provisional Probab=28.35 E-value=35 Score=13.33 Aligned_cols=17 Identities=24% Similarity=0.342 Sum_probs=10.2 Q ss_pred HHHHHHHHHHHCCCCCC Q ss_conf 99999999972268753 Q gi|254781174|r 27 FFSIAVCFLVGWERQSS 43 (271) Q Consensus 27 ~~~i~~~~l~~Cs~~~~ 43 (271) .++.++++++||+...+ T Consensus 9 ~ll~~~~~l~gC~~~~~ 25 (351) T PRK10461 9 ALLAAALLLVGCDQAPQ 25 (351) T ss_pred HHHHHHHHHHHHCCCCC T ss_conf 99999999987248887 No 317 >smart00745 MIT Microtubule Interacting and Trafficking molecule domain. Probab=27.87 E-value=35 Score=13.28 Aligned_cols=24 Identities=21% Similarity=0.365 Sum_probs=11.0 Q ss_pred HHHHHHHHHHHHCCHHHHHHHHHH Q ss_conf 999999999980989999999999 Q gi|254781174|r 59 EVYEKAVLFLKEQNFSKAYEYFNQ 82 (271) Q Consensus 59 ~lY~~a~~~~~~~~y~~A~~~f~~ 82 (271) .+-.+|...-+.|+|.+|+..|.. T Consensus 10 ~~v~~Av~~D~~g~y~eA~~~Y~~ 33 (77) T smart00745 10 ELISKALKADEAGDYEEALELYKK 33 (77) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHH T ss_conf 999999998781889999999999 No 318 >pfam02184 HAT HAT (Half-A-TPR) repeat. The HAT (Half A TPR) repeat is found in several RNA processing proteins. Probab=27.75 E-value=32 Score=13.52 Aligned_cols=21 Identities=24% Similarity=0.439 Sum_probs=15.8 Q ss_pred HHHHHHHHHHHHHHHCCCCCH Q ss_conf 556799999887540112110 Q gi|254781174|r 109 KYQQAASLGEEYITQYPESKN 129 (271) Q Consensus 109 ~y~~A~~~~~~fi~~~P~s~~ 129 (271) +.+.|...|++|+-.||+-.+ T Consensus 2 E~drAR~iyerFv~vHpevkn 22 (32) T pfam02184 2 EIDRARGIYERFVHVHPEVQN 22 (32) T ss_pred HHHHHHHHHHHHEEECCCHHH T ss_conf 167899999872053744252 No 319 >PRK10760 murein hydrolase B; Provisional Probab=27.72 E-value=36 Score=13.27 Aligned_cols=16 Identities=6% Similarity=0.021 Sum_probs=10.8 Q ss_pred HHHHHHHHHCCCCCCC Q ss_conf 9999999722687533 Q gi|254781174|r 29 SIAVCFLVGWERQSSR 44 (271) Q Consensus 29 ~i~~~~l~~Cs~~~~~ 44 (271) +.++++|.+||+.... T Consensus 8 ~~~~~~l~~css~p~~ 23 (357) T PRK10760 8 LPLFVLLAACSSKPKP 23 (357) T ss_pred HHHHHHHHHHCCCCCC T ss_conf 9999999985189999 No 320 >pfam07268 EppA_BapA Exported protein precursor (EppA/BapA). This family consists of a number of exported protein precursor (EppA and BapA) sequences which seem to be specific to Borrelia burgdorferi (Lyme disease spirochete). bapA gene sequences are quite stable but the encoded proteins do not provoke a strong immune response in most individuals. Conversely, EppA proteins are much more antigenic but are more variable in sequence. It is thought that BapA and EppA play important roles during the Borrelia burgdorferi infectious cycle. Probab=27.70 E-value=36 Score=13.27 Aligned_cols=24 Identities=17% Similarity=0.228 Sum_probs=14.4 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHH Q ss_conf 899999999999809899999999 Q gi|254781174|r 57 QREVYEKAVLFLKEQNFSKAYEYF 80 (271) Q Consensus 57 ~~~lY~~a~~~~~~~~y~~A~~~f 80 (271) -+.-|.+|...|...++.---+.+ T Consensus 26 i~~NYakAkk~fSkedfnLIkkRL 49 (163) T pfam07268 26 IEKNYAKAKKAFSKEDFNLIKKRL 49 (163) T ss_pred HHHHHHHHHHHCCHHHHHHHHHHH T ss_conf 888799999852787799998650 No 321 >PRK10215 hypothetical protein; Provisional Probab=27.70 E-value=23 Score=14.33 Aligned_cols=21 Identities=19% Similarity=0.051 Sum_probs=13.7 Q ss_pred HHHHHHHHHHHHHHHHCCCCC Q ss_conf 999999999999997226875 Q gi|254781174|r 22 FALTIFFSIAVCFLVGWERQS 42 (271) Q Consensus 22 ~~~~i~~~i~~~~l~~Cs~~~ 42 (271) +...++++.+.++|+||-+.. T Consensus 8 ~~k~~~l~~~~f~LSgC~t~~ 28 (219) T PRK10215 8 FFKAAGLLPLAFFLSGCISFG 28 (219) T ss_pred HHHHHHHHHHHHHHHHHHHHH T ss_conf 444524766899998776655 No 322 >COG0790 FOG: TPR repeat, SEL1 subfamily [General function prediction only] Probab=27.69 E-value=36 Score=13.26 Aligned_cols=162 Identities=12% Similarity=0.090 Sum_probs=76.3 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHCCCCCHHHH Q ss_conf 899999999999809899999999999853047730268998778877654----5556799999887540112110135 Q gi|254781174|r 57 QREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSA----GKYQQAASLGEEYITQYPESKNVDY 132 (271) Q Consensus 57 ~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~----~~y~~A~~~~~~fi~~~P~s~~~~~ 132 (271) ....+..+......+++..|...++..... ....+.+.++..|..- .+..+|...|.... ....+. T Consensus 41 ~~~~~~~~~~~~~~~~~~~a~~~~~~a~~~-----~~~~a~~~l~~~y~~g~gv~~~~~~A~~~~~~~a-----~~g~~~ 110 (292) T COG0790 41 LKSALLNGAGSAYPPDYAKALKSYEKAAEL-----GDAAALALLGQMYGAGKGVSRDKTKAADWYRCAA-----ADGLAE 110 (292) T ss_pred HHHHHHHHHHHCCCCCHHHHHHHHHHHHHC-----CCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHH-----HCCCHH T ss_conf 887665310000221489889999988643-----5476789999998635552013899999999987-----369799 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHCCHHHHHHHHHHHHHH------------------------- Q ss_conf 554434445555443111234589999999999985-200101345655555556------------------------- Q gi|254781174|r 133 VYYLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVER-YTNSPYVKGARFYVTVGR------------------------- 186 (271) Q Consensus 133 A~y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~-yP~S~ya~~A~~~l~~~~------------------------- 186 (271) +.+..|..+..-.. +.+ +..+|...|+..... .+.. ..+...+..+. T Consensus 111 a~~~lg~~~~~G~g-v~~---d~~~A~~~~~~Aa~~g~~~a---~~~~~~l~~~y~~g~~~~~~~~~~~~A~~~~~~aa~ 183 (292) T COG0790 111 ALFNLGLMYANGRG-VPL---DLVKALKYYEKAAKLGNVEA---ALAMYRLGLAYLSGLQALAVAYDDKKALYLYRKAAE 183 (292) T ss_pred HHHHHHHHHHCCCC-CCC---CHHHHHHHHHHHHHCCCCHH---HHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHH T ss_conf 99999999855899-745---89999999999998479305---999999999861886321215148999999999998 Q ss_pred HHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCC Q ss_conf 6666665556666654----432234568999886528998115999999999999749 Q gi|254781174|r 187 NQLAAKEVEIGRYYLK----RGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALA 241 (271) Q Consensus 187 ~~La~~e~~ia~~Y~~----~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg 241 (271) .......+.+|..|.. ..++..|+.-|+...+.=. ..+.++++ .++.-| T Consensus 184 ~~~~~a~~~lg~~y~~G~Gv~~d~~~A~~wy~~Aa~~g~-----~~a~~~~~-~~~~~g 236 (292) T COG0790 184 LGNPDAQLLLGRMYEKGLGVPRDLKKAFRWYKKAAEQGD-----GAACYNLG-LMYLNG 236 (292) T ss_pred HCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCC-----HHHHHHHH-HHHHCC T ss_conf 367999999999997588876689999999999997698-----99999999-998668 No 323 >COG5645 Predicted periplasmic lipoprotein [General function prediction only] Probab=27.65 E-value=27 Score=13.93 Aligned_cols=12 Identities=25% Similarity=0.135 Sum_probs=8.2 Q ss_pred HHHHHCCCCCCC Q ss_conf 999722687533 Q gi|254781174|r 33 CFLVGWERQSSR 44 (271) Q Consensus 33 ~~l~~Cs~~~~~ 44 (271) +.++||++--+- T Consensus 12 lllSGC~SV~s~ 23 (80) T COG5645 12 LLLSGCGSVISL 23 (80) T ss_pred HHHCCCCEEEEE T ss_conf 983764103774 No 324 >PRK12788 flgH flagellar basal body L-ring protein; Reviewed Probab=27.62 E-value=36 Score=13.26 Aligned_cols=19 Identities=16% Similarity=0.153 Sum_probs=12.5 Q ss_pred HHHHHHHHHHHHHCCCCCC Q ss_conf 9999999999972268753 Q gi|254781174|r 25 TIFFSIAVCFLVGWERQSS 43 (271) Q Consensus 25 ~i~~~i~~~~l~~Cs~~~~ 43 (271) +++.++++++|.||+.... T Consensus 3 ~l~~~~a~l~LsGCa~~~~ 21 (231) T PRK12788 3 LLVAILACLALAGCANNLA 21 (231) T ss_pred HHHHHHHHHHHHHHCCCCC T ss_conf 8999999999764038743 No 325 >COG4939 Major membrane immunogen, membrane-anchored lipoprotein [Function unknown] Probab=27.32 E-value=20 Score=14.70 Aligned_cols=22 Identities=9% Similarity=0.168 Sum_probs=13.5 Q ss_pred HHHHHHHHHHHHHCCCCCCCCC Q ss_conf 9999999999972268753334 Q gi|254781174|r 25 TIFFSIAVCFLVGWERQSSRDV 46 (271) Q Consensus 25 ~i~~~i~~~~l~~Cs~~~~~~~ 46 (271) +.+.+.+.++|.+|+++...+. T Consensus 6 ~~~~~~~~~LL~aCg~sd~s~~ 27 (147) T COG4939 6 LVGMIVALSLLTACGKSDFSKM 27 (147) T ss_pred HHHHHHHHHHHHHHCCCCCCCC T ss_conf 6599999999987066644554 No 326 >pfam09577 Spore_YpjB Sporulation protein YpjB (SpoYpjB). These proteins are found in the endospore-forming bacteria which include Bacillus species. In Bacillus subtilis, ypjB was found to be part of the sigma-E regulon. Sigma-E is a sporulation sigma factor that regulates expression in the mother cell compartment. Null mutants of ypjB show a sporulation defect, but this gene is not, however, a part of the endospore formation minimal gene set. Probab=26.90 E-value=37 Score=13.18 Aligned_cols=58 Identities=16% Similarity=0.191 Sum_probs=38.1 Q ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999980989999999999985304773026899877887765455567999998 Q gi|254781174|r 60 VYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGE 118 (271) Q Consensus 60 lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~ 118 (271) +=..+..+-++|+|++|.+.+..+.++|+......+- +-+....--.--|++|...+. T Consensus 8 lsD~~Lql~K~~~yeeAkqvL~yFs~~F~~~~~~~~~-~t~~elR~vT~~~~~a~~al~ 65 (232) T pfam09577 8 LSDTVLQLTKQQKYEEAKQVLQYFSKQFLSADYEDRE-LTMDELRQVTLAYEDAVKALT 65 (232) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCC-CCHHHHHHHHHHHHHHHHHHH T ss_conf 5899999987300889999999999998746354467-689999999999999999986 No 327 >cd02656 MIT MIT: domain contained within Microtubule Interacting and Trafficking molecules. The MIT domain is found in sorting nexins, the nuclear thiol protease PalBH, the AAA protein spastin and archaebacterial proteins with similar domain architecture, vacuolar sorting proteins and others. The molecular function of the MIT domain is unclear. Probab=26.85 E-value=37 Score=13.17 Aligned_cols=30 Identities=20% Similarity=0.310 Sum_probs=18.8 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHH Q ss_conf 899999999999809899999999999853 Q gi|254781174|r 57 QREVYEKAVLFLKEQNFSKAYEYFNQCSRD 86 (271) Q Consensus 57 ~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~ 86 (271) +.++-.+|...-+.|+|.+|+..|...+.. T Consensus 6 A~~l~~~Av~~D~~g~y~eA~~lY~~ai~~ 35 (75) T cd02656 6 AKELIKQAVKEDEDGNYEEALELYKEALDY 35 (75) T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHH T ss_conf 999999999988848899999999999999 No 328 >COG3521 Predicted component of the type VI protein secretion system [Intracellular trafficking, secretion, and vesicular transport] Probab=26.72 E-value=37 Score=13.16 Aligned_cols=14 Identities=21% Similarity=0.237 Sum_probs=8.4 Q ss_pred HHHHHHHCCCCCCC Q ss_conf 99999722687533 Q gi|254781174|r 31 AVCFLVGWERQSSR 44 (271) Q Consensus 31 ~~~~l~~Cs~~~~~ 44 (271) +++.++|||+..+. T Consensus 13 al~~~sgCsss~~~ 26 (159) T COG3521 13 ALLVLSGCSSSKPL 26 (159) T ss_pred HHHHHHHHCCCCCC T ss_conf 99985241258877 No 329 >pfam10516 SHNi-TPR SHNi-TPR. SHNi-TPR family members contain a reiterated sequence motif that is an interrupted form of TPR repeat. Probab=26.16 E-value=38 Score=13.10 Aligned_cols=24 Identities=21% Similarity=0.264 Sum_probs=14.2 Q ss_pred HHHHHHHHHCCHHHHHHHHHHHHH Q ss_conf 999999980989999999999985 Q gi|254781174|r 62 EKAVLFLKEQNFSKAYEYFNQCSR 85 (271) Q Consensus 62 ~~a~~~~~~~~y~~A~~~f~~i~~ 85 (271) ..|...+++++|.+|+..|++.+. T Consensus 6 ~LgEislE~e~F~qA~~D~~~~L~ 29 (38) T pfam10516 6 LLGEISLENENFPQAVEDLRKALE 29 (38) T ss_pred HHHHHHHHCCCHHHHHHHHHHHHH T ss_conf 777865401560778999999999 No 330 >COG4851 CamS Protein involved in sex pheromone biosynthesis [General function prediction only] Probab=26.03 E-value=38 Score=13.09 Aligned_cols=23 Identities=22% Similarity=0.118 Sum_probs=14.3 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 99999999999997226875333 Q gi|254781174|r 23 ALTIFFSIAVCFLVGWERQSSRD 45 (271) Q Consensus 23 ~~~i~~~i~~~~l~~Cs~~~~~~ 45 (271) ++.|.++-++++|+||+...... T Consensus 4 tl~i~~ta~vliLs~C~~~~dd~ 26 (382) T COG4851 4 TLGIAATASVLILSGCFPFVDDT 26 (382) T ss_pred HHHHHHHHHHHHHHHCCCCCCCC T ss_conf 15589999999986116766873 No 331 >PRK02463 OxaA-like protein precursor; Provisional Probab=25.89 E-value=38 Score=13.07 Aligned_cols=66 Identities=11% Similarity=0.075 Sum_probs=34.3 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHH--HHHHHHHHH---HCCHHHHHHHHHHHHHH--CCC Q ss_conf 99999999999999997226875333445677767368999--999999998---09899999999999853--047 Q gi|254781174|r 20 YKFALTIFFSIAVCFLVGWERQSSRDVYLDSVTDVRYQREV--YEKAVLFLK---EQNFSKAYEYFNQCSRD--FPF 89 (271) Q Consensus 20 ~k~~~~i~~~i~~~~l~~Cs~~~~~~~~~~~~~~~~~~~~l--Y~~a~~~~~---~~~y~~A~~~f~~i~~~--~P~ 89 (271) .|..+..+++.++++|+||+..+... +++......-. .......+. .++|.-||-.+.-+++. .|- T Consensus 6 k~~~~~~~~~~~~l~LsgC~~~~~~~----~~~G~~~~~~v~p~~~~i~~~a~~~g~nyG~aIIl~TiiIRlillPL 78 (307) T PRK02463 6 KRILFSGLALSMLLTLTGCVGRDKHG----NPTGMIWNFLGKPMSYFIDYFANNLGLGFGLAIIIVTIIVRTLILPL 78 (307) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCC----CCCCHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHC T ss_conf 99999999999999996346899999----97740899999999999999998638773389999999999999830 No 332 >pfam07721 TPR_4 Tetratricopeptide repeat. This Pfam entry includes tetratricopeptide-like repeats not detected by the pfam00515, pfam07719 and pfam07720 models. Probab=25.78 E-value=39 Score=13.06 Aligned_cols=19 Identities=26% Similarity=0.272 Sum_probs=7.7 Q ss_pred HHHHHHHHHHHHHHHHHHH Q ss_conf 8778877654555679999 Q gi|254781174|r 98 LMSAFVQYSAGKYQQAASL 116 (271) Q Consensus 98 ~~la~~~y~~~~y~~A~~~ 116 (271) ..+|.++...|++++|... T Consensus 5 ~~la~~~~~~G~~~~A~~~ 23 (26) T pfam07721 5 LALARALLALGDLDEARAL 23 (26) T ss_pred HHHHHHHHHHCCHHHHHHH T ss_conf 9999999980677889988 No 333 >TIGR00752 slp outer membrane lipoprotein, Slp family; InterPro: IPR004658 Slp superfamily members are present in the Gram-negative gamma proteobacteria Escherichia coli (which also contains a close paralog), Haemophilus influenzae and Pasteurella multocida and Vibrio cholerae. The known members of the family to date share a motif LX[GA]C near the N-terminus, which is compatible with the possibility that the protein is modified into a lipoprotein with Cys as the new N-terminus. Slp from E. coli is known to be a lipoprotein of the outer membrane and to be expressed in response to carbon starvation.; GO: 0019867 outer membrane. Probab=25.71 E-value=39 Score=13.05 Aligned_cols=22 Identities=9% Similarity=0.073 Sum_probs=13.7 Q ss_pred HHHHHHHHHHHHHCCCCCCCCC Q ss_conf 9999999999972268753334 Q gi|254781174|r 25 TIFFSIAVCFLVGWERQSSRDV 46 (271) Q Consensus 25 ~i~~~i~~~~l~~Cs~~~~~~~ 46 (271) .+++.-+++.|+||++-.+... T Consensus 5 g~l~~~l~f~L~gC~~~P~~ik 26 (190) T TIGR00752 5 GLLITALAFLLTGCIAVPKAIK 26 (190) T ss_pred HHHHHHHHHHHHHHCCCCCCCC T ss_conf 4568889988753146742345 No 334 >pfam02402 Lysis_col Lysis protein. These small bacterial proteins are required for colicin release and partial cell lysis. This family contains lysis proteins for several different forms of colicin. Bacillus subtilis lytA has been included in this family, the similarity is not highly significant, however it is also a short protein, that is involved in secretion of other proteins (Bateman A pers. obs.). Probab=25.55 E-value=26 Score=14.09 Aligned_cols=22 Identities=23% Similarity=0.200 Sum_probs=12.2 Q ss_pred HHHHHHHHHHHHHHCCCCCCCC Q ss_conf 9999999999997226875333 Q gi|254781174|r 24 LTIFFSIAVCFLVGWERQSSRD 45 (271) Q Consensus 24 ~~i~~~i~~~~l~~Cs~~~~~~ 45 (271) +.++++++.++|++|-.+--.+ T Consensus 5 ~~~~i~~~~~~L~aCQaNyiRD 26 (46) T pfam02402 5 LFIGILLLTVLLSACQANYIRD 26 (46) T ss_pred EEHHHHHHHHHHHHHHHHHEEE T ss_conf 0139999999999855504122 No 335 >PRK12699 flgH flagellar basal body L-ring protein; Reviewed Probab=25.48 E-value=39 Score=13.03 Aligned_cols=24 Identities=8% Similarity=0.072 Sum_probs=14.8 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 999999999999999722687533 Q gi|254781174|r 21 KFALTIFFSIAVCFLVGWERQSSR 44 (271) Q Consensus 21 k~~~~i~~~i~~~~l~~Cs~~~~~ 44 (271) +..+-.+++++++++.||+....+ T Consensus 16 ~~~~~~~~~~~~~~l~gC~~~~~~ 39 (246) T PRK12699 16 GRLLGPVLIVMLALVGGCSLPTPA 39 (246) T ss_pred CCHHHHHHHHHHHHHCCCCCCCCC T ss_conf 324888999999986044689998 No 336 >KOG2041 consensus Probab=25.25 E-value=39 Score=13.00 Aligned_cols=75 Identities=12% Similarity=0.146 Sum_probs=47.0 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHH Q ss_conf 89999999999980989999999999985304773026899877887765455567999998875401121101355544 Q gi|254781174|r 57 QREVYEKAVLFLKEQNFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAGKYQQAASLGEEYITQYPESKNVDYVYYL 136 (271) Q Consensus 57 ~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~~y~~A~~~~~~fi~~~P~s~~~~~A~y~ 136 (271) ..+.=+-|....+...|.+|.+.|..--.. -..++|+|..++|++-......+-+.+|--|.+.++.-. T Consensus 796 e~A~r~ig~~fa~~~~We~A~~yY~~~~~~-----------e~~~ecly~le~f~~LE~la~~Lpe~s~llp~~a~mf~s 864 (1189) T KOG2041 796 EDAFRNIGETFAEMMEWEEAAKYYSYCGDT-----------ENQIECLYRLELFGELEVLARTLPEDSELLPVMADMFTS 864 (1189) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCH-----------HHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHH T ss_conf 999999999999999899999999861346-----------767789888875515999987567655137999999875 Q ss_pred HHHHHH Q ss_conf 344455 Q gi|254781174|r 137 VGMSYA 142 (271) Q Consensus 137 ~a~~~~ 142 (271) +|+|-. T Consensus 865 vGMC~q 870 (1189) T KOG2041 865 VGMCDQ 870 (1189) T ss_pred HCHHHH T ss_conf 040889 No 337 >PRK10510 putative outer membrane lipoprotein; Provisional Probab=25.20 E-value=40 Score=12.99 Aligned_cols=19 Identities=11% Similarity=0.228 Sum_probs=13.7 Q ss_pred HHHHHHHHHHHHHHHHCCH Q ss_conf 8999999999998520010 Q gi|254781174|r 155 TKLMLQYMSRIVERYTNSP 173 (271) Q Consensus 155 ~~~A~~~f~~~i~~yP~S~ 173 (271) ....+...-.++++||++. T Consensus 127 ~~~~L~~vA~~L~~yp~~~ 145 (219) T PRK10510 127 GANTLTGVAMVLKEYPKTA 145 (219) T ss_pred HHHHHHHHHHHHHHCCCCE T ss_conf 8999999999999889927 No 338 >KOG0546 consensus Probab=24.95 E-value=40 Score=12.97 Aligned_cols=63 Identities=25% Similarity=0.149 Sum_probs=40.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCH Q ss_conf 666665443223456899988652899811599999999999974987999999999978569997 Q gi|254781174|r 196 IGRYYLKRGEYVAAIPRFQLVLANYSDAEHAEEAMARLVEAYVALALMDEAREVVSLIQERYPQGY 261 (271) Q Consensus 196 ia~~Y~~~~~y~aA~~~~~~~i~~yp~t~~~~eAl~~l~~~y~~lg~~d~A~~~~~~l~~~yP~s~ 261 (271) ++..=++.+.|..|+-+-..++. +.+....|++|.+.+|..+...++|.+..+.-...=|+.. T Consensus 281 ~~~~~lk~~~~~~a~~~~~~~~~---~~~s~tka~~Rr~~~~~~~~~~~~a~~~~~~a~~~~p~d~ 343 (372) T KOG0546 281 LAAVGLKVKGRGGARFRTNEALR---DERSKTKAHYRRGQAYKLLKNYDEALEDLKKAKQKAPNDK 343 (372) T ss_pred HHHHCCCCCCCCCCEECCCCCCC---CCHHHCCHHHHHHHHHHHHHCHHHHHHHHHHHHCCCCCHH T ss_conf 67751213677762321640023---6811182888877688764345666778887542285449 No 339 >PRK10449 heat-inducible protein; Provisional Probab=24.67 E-value=41 Score=12.93 Aligned_cols=19 Identities=5% Similarity=0.072 Sum_probs=12.0 Q ss_pred HHHHHHHHHHHHCCCCCCC Q ss_conf 9999999999722687533 Q gi|254781174|r 26 IFFSIAVCFLVGWERQSSR 44 (271) Q Consensus 26 i~~~i~~~~l~~Cs~~~~~ 44 (271) +.++++.+++.||+++... T Consensus 5 l~l~~~~llLagC~s~~~~ 23 (140) T PRK10449 5 VALVALSLLMAGCVSSGKI 23 (140) T ss_pred HHHHHHHHHHHHCCCCCCC T ss_conf 9999999999752389988 No 340 >pfam05462 Dicty_CAR Slime mold cyclic AMP receptor. This family consists of cyclic AMP receptor (CAR) proteins from slime molds. CAR proteins are responsible for controlling development in Dictyostelium discoideum. Probab=24.41 E-value=41 Score=12.90 Aligned_cols=96 Identities=13% Similarity=0.137 Sum_probs=54.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCH----HHH Q ss_conf 467766899999999999999999999999999-972268753334456777673689999999999980989----999 Q gi|254781174|r 2 SAVLGRAICIFEAWAYQLYKFALTIFFSIAVCF-LVGWERQSSRDVYLDSVTDVRYQREVYEKAVLFLKEQNF----SKA 76 (271) Q Consensus 2 ~~~~~~~~~~~~~~~~~m~k~~~~i~~~i~~~~-l~~Cs~~~~~~~~~~~~~~~~~~~~lY~~a~~~~~~~~y----~~A 76 (271) .|++|-.++|+..|.+|..|.-..-+++.+++. +..|-.+. --...+...+..-+=.++.-...+..-..+ --| T Consensus 20 ~SIIGc~lvii~F~~fklLRn~~tRiI~ylc~s~l~~dl~s~-~l~~~~~~~~~~~~C~l~Ai~ItY~~LS~~LWT~cIA 98 (305) T pfam05462 20 TSIIGCFLVLIGFWRLKLLRNHITKVISCFCLTSLLKDLIST-ILTLTNSAQYNGFPCYLYAIVITYGSLACWLWTLCLA 98 (305) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999879999999999999876179999999899999772442-2676023568984018999999999999999999999 Q ss_pred HHHHHHHHHHCCCCCCHHHHHH Q ss_conf 9999999853047730268998 Q gi|254781174|r 77 YEYFNQCSRDFPFAGVARKSLL 98 (271) Q Consensus 77 ~~~f~~i~~~~P~s~~a~~A~~ 98 (271) ...|+-++++.|+.+.-++-.. T Consensus 99 fsiY~liVKr~~e~E~~EkyYh 120 (305) T pfam05462 99 FSIYNLIVKREPEPEKFEKYYF 120 (305) T ss_pred HHHHHHHHHCCCCHHHHHHHHH T ss_conf 9999999608887899888688 No 341 >KOG2066 consensus Probab=24.41 E-value=41 Score=12.90 Aligned_cols=21 Identities=24% Similarity=0.403 Sum_probs=12.2 Q ss_pred HHHHHHHHCCHHHHHHHHHHH Q ss_conf 999999809899999999999 Q gi|254781174|r 63 KAVLFLKEQNFSKAYEYFNQC 83 (271) Q Consensus 63 ~a~~~~~~~~y~~A~~~f~~i 83 (271) .|..|+.+++|..|...+-.+ T Consensus 511 La~LYl~d~~Y~~Al~~ylkl 531 (846) T KOG2066 511 LAHLYLYDNKYEKALPIYLKL 531 (846) T ss_pred HHHHHHHCCCHHHHHHHHHHC T ss_conf 999998725727799999851 No 342 >pfam11839 DUF3359 Protein of unknown function (DUF3359). This family of proteins are functionally uncharacterized. This protein is found in bacteria. Proteins in this family are about 80 amino acids in length. Probab=24.22 E-value=40 Score=12.99 Aligned_cols=25 Identities=16% Similarity=0.049 Sum_probs=14.5 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 9999999999999999722687533 Q gi|254781174|r 20 YKFALTIFFSIAVCFLVGWERQSSR 44 (271) Q Consensus 20 ~k~~~~i~~~i~~~~l~~Cs~~~~~ 44 (271) .+...+..+.+..++-.||++-+.. T Consensus 2 kkllk~s~i~laallaggcaslsee 26 (97) T pfam11839 2 KKLLKFSAIALAALLAGGCASLSEE 26 (97) T ss_pred CHHHHHHHHHHHHHHHCCCHHHHHH T ss_conf 3489999999999997431232188 No 343 >PRK06804 flgA flagellar basal body P-ring biosynthesis protein FlgA; Reviewed Probab=24.15 E-value=36 Score=13.20 Aligned_cols=14 Identities=21% Similarity=0.370 Sum_probs=7.4 Q ss_pred CHHHHHHHHHHHHH Q ss_conf 68999999999998 Q gi|254781174|r 56 YQREVYEKAVLFLK 69 (271) Q Consensus 56 ~~~~lY~~a~~~~~ 69 (271) +.++++.++...+. T Consensus 54 ~~k~~~a~~~~~~~ 67 (272) T PRK06804 54 TRKQVYTQALHAAT 67 (272) T ss_pred HHHHHHHHHHHHHH T ss_conf 26789999999999 No 344 >PRK08005 ribulose-phosphate 3-epimerase; Validated Probab=23.47 E-value=31 Score=13.60 Aligned_cols=66 Identities=18% Similarity=0.257 Sum_probs=32.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHH-H-HHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 234589999999999985200101345655555556666-6-66555666665443223456899988 Q gi|254781174|r 151 DQRATKLMLQYMSRIVERYTNSPYVKGARFYVTVGRNQL-A-AKEVEIGRYYLKRGEYVAAIPRFQLV 216 (271) Q Consensus 151 d~~~~~~A~~~f~~~i~~yP~S~ya~~A~~~l~~~~~~L-a-~~e~~ia~~Y~~~~~y~aA~~~~~~~ 216 (271) .|+....+++..+++.+.+|+..-..|---....+.... | ..-+-.|...++.++|.+++.+++.+ T Consensus 143 GQ~Fi~~~~~KI~~~r~~~~~~~I~vDGGIn~~t~~~~~~aGad~~V~GSaiF~~~d~~~~i~~lr~l 210 (210) T PRK08005 143 GQQFIAAMCEKVSQSREHFPAAECWADGGITLRAARLLAAAGAQHLVIGRALFTTANYDVTLSQFTAL 210 (210) T ss_pred CCCCCHHHHHHHHHHHHHCCCCCEEEECCCCHHHHHHHHHCCCCEEEECHHHHCCCCHHHHHHHHHCC T ss_conf 72117889999999996287788899788788999999986999999790653699999999998639 No 345 >PRK00059 prsA peptidylprolyl isomerase; Provisional Probab=23.38 E-value=43 Score=12.79 Aligned_cols=24 Identities=13% Similarity=0.177 Sum_probs=16.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 999999999999999999722687 Q gi|254781174|r 18 QLYKFALTIFFSIAVCFLVGWERQ 41 (271) Q Consensus 18 ~m~k~~~~i~~~i~~~~l~~Cs~~ 41 (271) +|.|.++.+++.++++..+||+.. T Consensus 3 ~~~ki~~~~ii~~~~~~~vg~~~~ 26 (336) T PRK00059 3 SAKKIVASLLVGVFIFSAVGCNMV 26 (336) T ss_pred CHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 079999999999999998302533 No 346 >TIGR01004 PulS_OutS lipoprotein, PulS/OutS family; InterPro: IPR005699 This family comprises lipoproteins from four gamma proteobacterial species: PulS protein of Klebsiella pneumoniae (P20440 from SWISSPROT), the OutS protein of Erwinia chrysanthemi (Q01567 from SWISSPROT) and Pectobacterium chrysanthemi, and the functionally uncharacterised E. coli protein EtpO. PulS and OutS have been shown to interact with and facilitate insertion of secretins into the outer membrane, suggesting a chaperone-like, or piloting function for members of this family.; GO: 0008565 protein transporter activity, 0006886 intracellular protein transport. Probab=23.02 E-value=44 Score=12.74 Aligned_cols=28 Identities=14% Similarity=0.182 Sum_probs=17.5 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 9999999999999999972268753334 Q gi|254781174|r 19 LYKFALTIFFSIAVCFLVGWERQSSRDV 46 (271) Q Consensus 19 m~k~~~~i~~~i~~~~l~~Cs~~~~~~~ 46 (271) ..|..-.++..+.++.++||.+-..-+. T Consensus 5 ~lKkln~~~~~~~~v~lsgCqtPaPvqn 32 (136) T TIGR01004 5 ILKKLNVVLSLLVVVLLSGCQTPAPVQN 32 (136) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCC T ss_conf 7899999999999998640478998888 No 347 >PRK10525 cytochrome o ubiquinol oxidase subunit II; Provisional Probab=22.87 E-value=44 Score=12.73 Aligned_cols=21 Identities=14% Similarity=0.004 Sum_probs=13.4 Q ss_pred HHHHHHHHHHHHHHHHHHCCC Q ss_conf 999999999999999972268 Q gi|254781174|r 20 YKFALTIFFSIAVCFLVGWER 40 (271) Q Consensus 20 ~k~~~~i~~~i~~~~l~~Cs~ 40 (271) .|....+.++...++|+||+. T Consensus 7 ~~~~~~l~l~~~~~lLsGC~~ 27 (305) T PRK10525 7 NKSLGWLSLFAGTVLLSGCNS 27 (305) T ss_pred HHHHHHHHHHHHHHHHCCCCC T ss_conf 778999999999998667876 No 348 >PTZ00334 trans-sialidase; Provisional Probab=22.77 E-value=41 Score=12.88 Aligned_cols=24 Identities=4% Similarity=0.037 Sum_probs=13.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 999999999999999999722687 Q gi|254781174|r 18 QLYKFALTIFFSIAVCFLVGWERQ 41 (271) Q Consensus 18 ~m~k~~~~i~~~i~~~~l~~Cs~~ 41 (271) +|.|..+.-..+++++.+..|++. T Consensus 38 NMSRrvF~SAVLLLlvVmMCCgsg 61 (780) T PTZ00334 38 NMSRRVFTSAVLLLLVVMMCCGSG 61 (780) T ss_pred CCCHHHHHHHHHHHHHHHEECCCC T ss_conf 851344677899999984208982 No 349 >cd07668 BAR_SNX9 The Bin/Amphiphysin/Rvs (BAR) domain of Sorting Nexin 9. BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions. Sorting nexins (SNXs) are Phox homology (PX) domain containing proteins that are involved in regulating membrane traffic and protein sorting in the endosomal system. SNXs differ from each other in their lipid-binding specificity, subcellular localization and specific function in the endocytic pathway. A subset of SNXs also contain BAR domains. The PX-BAR structural unit determines the specific membrane targeting of SNXs. SNX9, also known as SH3PX1, is a cytosolic protein that interacts with proteins associated with clathrin-coated pits such as Cdc-42-associated tyrosine kinase 2 (ACK2). It binds class I polyproline sequences found in dynamin 1/2 and the WASP/N-WASP actin regulators. SNX9 is localized to plasma membrane endocytic sites and acts primarily in clathrin-mediated endocytosi Probab=22.76 E-value=44 Score=12.71 Aligned_cols=39 Identities=13% Similarity=-0.104 Sum_probs=20.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCHH-HHHHHHHHHHHH Q ss_conf 111234589999999999985200101-345655555556 Q gi|254781174|r 148 VPYDQRATKLMLQYMSRIVERYTNSPY-VKGARFYVTVGR 186 (271) Q Consensus 148 ~~~d~~~~~~A~~~f~~~i~~yP~S~y-a~~A~~~l~~~~ 186 (271) +..|..+..+.+..|+-++..||+--- .+.|..+...++ T Consensus 97 pk~D~~pl~d~L~~Y~GlL~~fPDIi~v~K~A~~KvKE~e 136 (210) T cd07668 97 PKKDLHFLMETNHEYKGFLGCFPDIIGAHKGAIEKVKESD 136 (210) T ss_pred CCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHH T ss_conf 5323688998899985143258408999899999999999 No 350 >pfam07273 DUF1439 Protein of unknown function (DUF1439). This family consists of several hypothetical bacterial proteins of around 190 residues in length. Several members of this family are annotated as being putative lipoproteins and are often known as YceB. The function of this family is unknown. Probab=22.74 E-value=44 Score=12.71 Aligned_cols=11 Identities=27% Similarity=0.422 Sum_probs=7.6 Q ss_pred HHHHHHHHCCC Q ss_conf 99999972268 Q gi|254781174|r 30 IAVCFLVGWER 40 (271) Q Consensus 30 i~~~~l~~Cs~ 40 (271) +++++++||++ T Consensus 10 ~l~~~LsgC~~ 20 (177) T pfam07273 10 LLVLLLSGCAS 20 (177) T ss_pred HHHHHHHCCCC T ss_conf 99999715575 No 351 >PRK10759 hypothetical protein; Provisional Probab=22.50 E-value=45 Score=12.68 Aligned_cols=18 Identities=11% Similarity=0.178 Sum_probs=11.1 Q ss_pred HHHHHHHHHHHHCCCCCC Q ss_conf 999999999972268753 Q gi|254781174|r 26 IFFSIAVCFLVGWERQSS 43 (271) Q Consensus 26 i~~~i~~~~l~~Cs~~~~ 43 (271) ++++.++++++||+-..+ T Consensus 3 ~~~~~~~l~~sgc~h~an 20 (106) T PRK10759 3 ALIVPLLLLLSGCSHLAN 20 (106) T ss_pred HHHHHHHHHHCCCHHHCC T ss_conf 399999999844234326 No 352 >KOG4563 consensus Probab=22.49 E-value=45 Score=12.68 Aligned_cols=65 Identities=9% Similarity=0.020 Sum_probs=49.3 Q ss_pred CCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHH----C-CCCCCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 6736899999999999809899999999999853----0-477302689987788776545556799999 Q gi|254781174|r 53 DVRYQREVYEKAVLFLKEQNFSKAYEYFNQCSRD----F-PFAGVARKSLLMSAFVQYSAGKYQQAASLG 117 (271) Q Consensus 53 ~~~~~~~lY~~a~~~~~~~~y~~A~~~f~~i~~~----~-P~s~~a~~A~~~la~~~y~~~~y~~A~~~~ 117 (271) ...+.+.+-..|..++.+++++.|...|...... | -.+.....++|..|.+++..++....+... T Consensus 37 ~~~~~e~lv~~G~~~~~~~d~~~Avda~s~A~~l~~ei~Ge~~~e~~eal~~YGkslLela~~e~~VL~n 106 (400) T KOG4563 37 KEKTLEELVQAGRRALCNNDIDKAVDALSEATELSDEIYGEKHLETFEALFLYGKSLLELAKEESQVLGN 106 (400) T ss_pred HHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 7899999998626888605477788888999998888751557777899999879999988888776415 No 353 >KOG1550 consensus Probab=22.45 E-value=45 Score=12.68 Aligned_cols=98 Identities=14% Similarity=0.218 Sum_probs=52.5 Q ss_pred HHHHHHHHHHHHHHC-----CHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHCCCCC Q ss_conf 899999999999809-----8999999999998530477302689987788776545---55679999988754011211 Q gi|254781174|r 57 QREVYEKAVLFLKEQ-----NFSKAYEYFNQCSRDFPFAGVARKSLLMSAFVQYSAG---KYQQAASLGEEYITQYPESK 128 (271) Q Consensus 57 ~~~lY~~a~~~~~~~-----~y~~A~~~f~~i~~~~P~s~~a~~A~~~la~~~y~~~---~y~~A~~~~~~fi~~~P~s~ 128 (271) +...|..|..+++.. ++..|...|.+.... -.+++++.+|.++..-. ++..|...|...... .| T Consensus 288 ~~a~~~lg~~Y~~g~~~~~~d~~~A~~~~~~aA~~-----g~~~a~~~lg~~~~~g~~~~d~~~A~~yy~~Aa~~--G~- 359 (552) T KOG1550 288 PPAQYGLGRLYLQGLGVEKIDYEKALKLYTKAAEL-----GNPDAQYLLGVLYETGTKERDYRRAFEYYSLAAKA--GH- 359 (552) T ss_pred CHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHC-----CCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHC--CC- T ss_conf 42247999999818987512499999999999855-----99048999889876276321388999999999975--99- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 0135554434445555443111234589999999999985 Q gi|254781174|r 129 NVDYVYYLVGMSYAQMIRDVPYDQRATKLMLQYMSRIVER 168 (271) Q Consensus 129 ~~~~A~y~~a~~~~~~~~~~~~d~~~~~~A~~~f~~~i~~ 168 (271) ..|.|.+|.||..- ..+. .+...|...+.+...+ T Consensus 360 --~~A~~~la~~y~~G-~gv~---r~~~~A~~~~k~aA~~ 393 (552) T KOG1550 360 --ILAIYRLALCYELG-LGVE---RNLELAFAYYKKAAEK 393 (552) T ss_pred --HHHHHHHHHHHHHC-CCCC---CCHHHHHHHHHHHHHC T ss_conf --89999999999747-7868---8778999999999982 No 354 >COG5567 Predicted small periplasmic lipoprotein [Cell motility and secretion] Probab=22.30 E-value=45 Score=12.66 Aligned_cols=22 Identities=9% Similarity=0.037 Sum_probs=11.7 Q ss_pred HHHHHHHHHHHHHHHCCCCCCC Q ss_conf 9999999999999722687533 Q gi|254781174|r 23 ALTIFFSIAVCFLVGWERQSSR 44 (271) Q Consensus 23 ~~~i~~~i~~~~l~~Cs~~~~~ 44 (271) +.....+..++.+.||+-+..- T Consensus 5 ~~s~~ala~l~sLA~CG~KGPL 26 (58) T COG5567 5 FKSLLALATLFSLAGCGLKGPL 26 (58) T ss_pred HHHHHHHHHHHHHHHCCCCCCC T ss_conf 9999999999999852667876 No 355 >pfam03978 Borrelia_REV Borrelia burgdorferi REV protein. This family consists of several REV proteins from Borrelia burgdorferi (Lyme disease spirochete). The function of REV is unknown although it known that gene is induced during the ingesting of host blood suggesting a role in the metabolic activation of borreliae to adapt to physiological stimuli. Probab=22.19 E-value=45 Score=12.65 Aligned_cols=18 Identities=11% Similarity=0.371 Sum_probs=9.6 Q ss_pred HHHHHHHHHHHHHHCCCC Q ss_conf 999999999999722687 Q gi|254781174|r 24 LTIFFSIAVCFLVGWERQ 41 (271) Q Consensus 24 ~~i~~~i~~~~l~~Cs~~ 41 (271) ++-+++++++|+.+|-.. T Consensus 6 I~kLfF~smlfvmaCk~y 23 (160) T pfam03978 6 IVKLFFVSMLFVMACKAY 23 (160) T ss_pred HHHHHHHHHHHHHHHHHH T ss_conf 899999999999999999 No 356 >PRK11162 mltA murein transglycosylase A; Provisional Probab=21.79 E-value=46 Score=12.60 Aligned_cols=19 Identities=16% Similarity=0.413 Sum_probs=11.7 Q ss_pred HHHHHHHHHHHHCCCCCCC Q ss_conf 9999999999722687533 Q gi|254781174|r 26 IFFSIAVCFLVGWERQSSR 44 (271) Q Consensus 26 i~~~i~~~~l~~Cs~~~~~ 44 (271) +++.+++.++.+|++.... T Consensus 6 ~~~~~~~~~l~~c~~~p~~ 24 (362) T PRK11162 6 LLMGIVVALLAACSSKPTD 24 (362) T ss_pred HHHHHHHHHHHHHCCCCCC T ss_conf 9999999999760479876 No 357 >PRK12697 flgH flagellar basal body L-ring protein; Reviewed Probab=21.30 E-value=47 Score=12.54 Aligned_cols=23 Identities=13% Similarity=0.063 Sum_probs=17.2 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 99999999999999972268753 Q gi|254781174|r 21 KFALTIFFSIAVCFLVGWERQSS 43 (271) Q Consensus 21 k~~~~i~~~i~~~~l~~Cs~~~~ 43 (271) |+.....+.+++++|.||+.... T Consensus 7 ~~~~~~~~al~~l~LsGCa~~~~ 29 (227) T PRK12697 7 RRPGAAACALAALALAGCAQIPR 29 (227) T ss_pred CCCCHHHHHHHHHHHHCCCCCCC T ss_conf 14038999999999726567899 No 358 >pfam01441 Lipoprotein_6 Lipoprotein. Members of this family are lipoproteins that are probably involved in evasion of the host immune system by pathogens. Probab=21.28 E-value=47 Score=12.54 Aligned_cols=25 Identities=8% Similarity=0.157 Sum_probs=14.2 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 99999999999999999722687533 Q gi|254781174|r 19 LYKFALTIFFSIAVCFLVGWERQSSR 44 (271) Q Consensus 19 m~k~~~~i~~~i~~~~l~~Cs~~~~~ 44 (271) |.|..+..+++.+++ +.||....++ T Consensus 1 MkkntlSAIlMTLfl-fiSCNNsG~~ 25 (209) T pfam01441 1 MKKNTLSAILMTLFL-FISCNNSGPE 25 (209) T ss_pred CCHHHHHHHHHHHHH-HHCCCCCCCC T ss_conf 961379999999999-8314589998 No 359 >COG1194 MutY A/G-specific DNA glycosylase [DNA replication, recombination, and repair] Probab=20.71 E-value=49 Score=12.46 Aligned_cols=29 Identities=10% Similarity=0.244 Sum_probs=13.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCH Q ss_conf 88776545556799999887540112110 Q gi|254781174|r 101 AFVQYSAGKYQQAASLGEEYITQYPESKN 129 (271) Q Consensus 101 a~~~y~~~~y~~A~~~~~~fi~~~P~s~~ 129 (271) .+...++-.-+..+..|.+|++++|+-+. T Consensus 40 SEiMLQQT~v~~Vi~yy~~fl~rfPti~~ 68 (342) T COG1194 40 SEIMLQQTQVATVIPYYERFLERFPTIKA 68 (342) T ss_pred HHHHHHHCCHHHHHHHHHHHHHHCCCHHH T ss_conf 88876006074554569999986899899 No 360 >PRK10797 glutamate and aspartate transporter subunit; Provisional Probab=20.70 E-value=49 Score=12.46 Aligned_cols=34 Identities=0% Similarity=-0.148 Sum_probs=19.2 Q ss_pred HHHHHCCCHHH-HHHHHH------HHHHHCCCCHHHHHHHH Q ss_conf 99997498799-999999------99785699978999998 Q gi|254781174|r 235 EAYVALALMDE-AREVVS------LIQERYPQGYWARYVET 268 (271) Q Consensus 235 ~~y~~lg~~d~-A~~~~~------~l~~~yP~s~~~~~a~~ 268 (271) ....+-|..++ ..++|. .+..|+|-|...+++.. T Consensus 254 ~~l~~dGt~~~I~~KWF~~p~P~~~~~l~~~~~~~~~~~~~ 294 (302) T PRK10797 254 AQAQTSGEAEKWFDKWFKNPIPPKNLNMNFELSDEMKALFK 294 (302) T ss_pred HHHHHCCCHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHH T ss_conf 99997782999999875999999984548988799999985 Done!