Query gi|254780750|ref|YP_003065163.1| DNA mismatch repair protein [Candidatus Liberibacter asiaticus str. psy62] Match_columns 920 No_of_seqs 324 out of 3234 Neff 8.0 Searched_HMMs 39220 Date Tue May 31 15:11:03 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780750.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 TIGR01070 mutS1 DNA mismatch r 100.0 0 0 2046.2 58.5 850 25-899 1-863 (863) 2 PRK05399 DNA mismatch repair p 100.0 0 0 1947.7 83.2 842 21-902 4-848 (848) 3 COG0249 MutS Mismatch repair A 100.0 0 0 1745.8 74.9 837 21-899 2-843 (843) 4 KOG0218 consensus 100.0 0 0 1317.2 48.2 816 23-859 160-1047(1070) 5 KOG0217 consensus 100.0 0 0 1275.5 37.8 815 7-844 234-1094(1125) 6 KOG0219 consensus 100.0 0 0 970.3 46.2 787 26-845 13-840 (902) 7 KOG0220 consensus 100.0 0 0 817.7 40.9 772 106-905 56-865 (867) 8 PRK00409 recombination and DNA 100.0 0 0 726.1 46.1 502 292-836 2-509 (780) 9 KOG0221 consensus 100.0 0 0 729.4 33.6 710 151-884 49-828 (849) 10 pfam00488 MutS_V MutS domain V 100.0 0 0 587.5 23.8 233 601-841 1-233 (234) 11 cd03285 ABC_MSH2_euk MutS2 hom 100.0 0 0 563.0 22.6 220 613-840 1-222 (222) 12 cd03284 ABC_MutS1 MutS1 homolo 100.0 0 0 558.6 22.5 216 613-836 1-216 (216) 13 TIGR01069 mutS2 MutS2 family p 100.0 0 0 529.1 34.7 495 314-836 15-544 (834) 14 cd03287 ABC_MSH3_euk MutS3 hom 100.0 0 0 537.2 22.2 213 612-832 1-222 (222) 15 cd03286 ABC_MSH6_euk MutS6 hom 100.0 0 0 528.6 21.3 211 614-832 2-218 (218) 16 cd03282 ABC_MSH4_euk MutS4 hom 100.0 0 0 507.5 22.9 202 613-824 1-204 (204) 17 cd03281 ABC_MSH5_euk MutS5 hom 100.0 0 0 500.9 21.2 203 613-824 1-213 (213) 18 cd03280 ABC_MutS2 MutS2 homolo 100.0 0 0 479.5 21.2 199 613-824 1-200 (200) 19 cd03243 ABC_MutS_homologs The 100.0 0 0 479.0 21.1 202 613-824 1-202 (202) 20 cd03283 ABC_MutS-like MutS-lik 100.0 0 0 465.8 22.6 197 613-824 1-199 (199) 21 smart00534 MUTSac ATPase domai 100.0 0 0 466.9 20.2 185 652-836 1-185 (185) 22 COG1193 Mismatch repair ATPase 100.0 0 0 432.4 24.4 480 316-836 15-499 (753) 23 pfam05192 MutS_III MutS domain 100.0 0 0 423.3 26.4 305 287-598 1-306 (306) 24 smart00533 MUTSd DNA-binding d 100.0 0 0 410.1 26.7 307 311-624 1-307 (308) 25 pfam01624 MutS_I MutS domain I 100.0 5E-38 1.3E-42 302.5 12.6 113 26-143 1-113 (113) 26 cd03227 ABC_Class2 ABC-type Cl 100.0 3.2E-34 8.2E-39 274.1 16.5 156 614-791 2-159 (162) 27 pfam05188 MutS_II MutS domain 99.8 1E-17 2.5E-22 150.9 13.7 129 151-279 1-133 (133) 28 pfam05190 MutS_IV MutS family 99.6 2.8E-16 7.2E-21 140.0 3.6 91 456-552 1-91 (92) 29 KOG0219 consensus 99.5 1.5E-15 3.8E-20 134.6 -2.5 222 614-850 626-848 (902) 30 cd00267 ABC_ATPase ABC (ATP-bi 98.6 5.9E-07 1.5E-11 70.4 10.4 124 635-779 14-146 (157) 31 cd03238 ABC_UvrA The excision 98.5 2.8E-06 7.2E-11 65.3 10.6 140 635-780 10-156 (176) 32 cd03230 ABC_DR_subfamily_A Thi 98.3 1.6E-05 4.2E-10 59.6 10.7 127 635-780 15-163 (173) 33 PRK13538 cytochrome c biogenes 98.3 3.6E-05 9.2E-10 57.1 12.4 52 727-781 145-196 (204) 34 cd03217 ABC_FeS_Assembly ABC-t 98.2 4.4E-05 1.1E-09 56.4 12.2 137 634-778 14-169 (200) 35 cd03233 ABC_PDR_domain1 The pl 98.2 2.8E-05 7.1E-10 57.9 11.2 130 635-773 22-179 (202) 36 cd03216 ABC_Carb_Monos_I This 98.2 4E-05 1E-09 56.7 11.8 130 634-780 14-150 (163) 37 PRK13543 cytochrome c biogenes 98.2 0.00023 6E-09 51.0 15.7 153 603-776 3-200 (214) 38 PRK13541 cytochrome c biogenes 98.2 7.9E-05 2E-09 54.5 13.2 128 636-779 16-189 (195) 39 cd03221 ABCF_EF-3 ABCF_EF-3 E 98.2 3.6E-05 9.1E-10 57.1 11.3 118 634-777 14-131 (144) 40 PRK11144 modC molybdate transp 98.2 8.3E-06 2.1E-10 61.8 8.0 143 633-781 12-198 (352) 41 cd03232 ABC_PDR_domain2 The pl 98.2 0.00013 3.2E-09 53.0 13.1 136 634-780 21-177 (192) 42 cd03255 ABC_MJ0796_Lo1CDE_FtsE 98.1 3.8E-05 9.7E-10 56.9 10.3 135 635-780 19-208 (218) 43 cd03218 ABC_YhbG The ABC trans 98.1 4.1E-05 1E-09 56.7 10.3 131 635-780 15-201 (232) 44 cd03265 ABC_DrrA DrrA is the A 98.1 3.3E-05 8.3E-10 57.4 9.7 134 634-780 14-200 (220) 45 cd03271 ABC_UvrA_II The excisi 98.1 1E-04 2.5E-09 53.8 12.0 54 724-779 185-238 (261) 46 cd03240 ABC_Rad50 The catalyti 98.1 7.7E-05 2E-09 54.6 11.4 51 727-780 137-190 (204) 47 PRK11247 ssuB aliphatic sulfon 98.1 4.6E-05 1.2E-09 56.3 10.2 183 602-808 3-221 (257) 48 cd03267 ABC_NatA_like Similar 98.1 4.3E-05 1.1E-09 56.5 9.8 140 631-780 32-222 (236) 49 PRK11614 livF leucine/isoleuci 98.1 6.3E-05 1.6E-09 55.2 10.6 133 635-780 20-205 (237) 50 cd03213 ABCG_EPDR ABCG transpo 98.1 0.00011 2.7E-09 53.6 11.5 132 635-780 24-180 (194) 51 cd03299 ABC_ModC_like Archeal 98.1 9.9E-05 2.5E-09 53.8 11.1 133 634-780 13-198 (235) 52 PRK13547 hmuV hemin importer A 98.1 0.00015 3.8E-09 52.5 12.0 139 634-777 15-220 (273) 53 COG0396 sufC Cysteine desulfur 98.0 2.2E-05 5.7E-10 58.6 7.5 133 635-780 19-211 (251) 54 cd03222 ABC_RNaseL_inhibitor T 98.0 0.00012 3.2E-09 53.1 11.2 111 648-777 23-136 (177) 55 PRK10247 putative ABC transpor 98.0 9.3E-05 2.4E-09 54.0 10.5 141 633-778 20-203 (225) 56 PRK13544 consensus 98.0 6E-05 1.5E-09 55.4 9.2 133 634-775 15-188 (208) 57 cd03215 ABC_Carb_Monos_II This 98.0 7.1E-05 1.8E-09 54.9 9.6 135 634-780 14-172 (182) 58 cd03214 ABC_Iron-Siderophores_ 98.0 0.0001 2.6E-09 53.6 10.4 127 635-777 14-162 (180) 59 cd03223 ABCD_peroxisomal_ALDP 98.0 0.00025 6.3E-09 50.8 12.0 120 635-780 16-155 (166) 60 cd03229 ABC_Class3 This class 98.0 0.00018 4.5E-09 51.9 11.2 130 634-780 14-169 (178) 61 PRK13540 cytochrome c biogenes 98.0 0.0002 5.1E-09 51.5 11.4 136 635-779 16-193 (200) 62 cd03270 ABC_UvrA_I The excisio 98.0 0.00051 1.3E-08 48.5 13.4 55 724-780 152-206 (226) 63 TIGR03608 L_ocin_972_ABC putat 98.0 0.00017 4.4E-09 52.0 11.0 130 634-780 12-201 (206) 64 PRK13539 cytochrome c biogenes 98.0 3.8E-05 9.7E-10 56.9 7.6 46 727-774 142-187 (206) 65 cd03250 ABCC_MRP_domain1 Domai 98.0 0.00038 9.7E-09 49.4 12.7 35 633-671 18-52 (204) 66 cd03269 ABC_putative_ATPase Th 98.0 0.00027 6.9E-09 50.5 11.8 133 634-775 14-190 (210) 67 PRK10575 iron-hydroxamate tran 98.0 0.00022 5.7E-09 51.1 11.4 136 634-776 25-211 (265) 68 cd03220 ABC_KpsT_Wzt ABC_KpsT_ 98.0 0.00025 6.3E-09 50.8 11.5 132 634-780 36-210 (224) 69 PRK10771 thiQ thiamine transpo 98.0 0.00095 2.4E-08 46.5 14.4 133 635-780 14-198 (233) 70 COG0488 Uup ATPase components 97.9 0.0015 3.8E-08 45.0 15.3 134 634-777 336-500 (530) 71 TIGR01978 sufC FeS assembly AT 97.9 4.3E-05 1.1E-09 56.5 7.4 132 635-779 15-215 (248) 72 cd03268 ABC_BcrA_bacitracin_re 97.9 0.00043 1.1E-08 49.0 12.3 141 634-780 14-194 (208) 73 cd03231 ABC_CcmA_heme_exporter 97.9 0.00041 1E-08 49.2 12.2 131 635-780 15-193 (201) 74 PRK10253 iron-enterobactin tra 97.9 0.00016 4.1E-09 52.2 10.1 49 727-776 159-207 (265) 75 cd03263 ABC_subfamily_A The AB 97.9 0.00013 3.3E-09 53.0 9.5 130 635-781 17-201 (220) 76 PRK09544 znuC high-affinity zi 97.9 0.00014 3.6E-09 52.7 9.6 136 634-777 18-185 (251) 77 PRK10584 putative ABC transpor 97.9 0.00028 7.2E-09 50.4 11.2 131 635-779 25-213 (228) 78 cd03226 ABC_cobalt_CbiO_domain 97.9 0.00017 4.3E-09 52.0 10.0 134 634-780 14-194 (205) 79 cd03235 ABC_Metallic_Cations A 97.9 0.00014 3.5E-09 52.7 9.5 48 727-776 148-195 (213) 80 KOG0062 consensus 97.9 1.7E-05 4.3E-10 59.5 4.6 43 731-778 502-544 (582) 81 PRK13647 cbiO cobalt transport 97.9 0.00013 3.4E-09 52.8 9.2 138 635-780 20-206 (273) 82 cd03224 ABC_TM1139_LivF_branch 97.9 0.00012 3.1E-09 53.1 8.9 133 635-780 15-200 (222) 83 PRK09700 D-allose transporter 97.9 0.00023 6E-09 51.0 10.3 52 728-781 426-478 (510) 84 PRK13546 teichoic acids export 97.9 0.00011 2.7E-09 53.5 8.5 163 635-841 39-243 (264) 85 PRK03695 vitamin B12-transport 97.9 0.00011 2.9E-09 53.4 8.6 136 635-776 12-193 (245) 86 TIGR03522 GldA_ABC_ATP gliding 97.9 0.00029 7.3E-09 50.3 10.6 138 634-781 16-201 (301) 87 PRK13648 cbiO cobalt transport 97.9 0.00019 4.9E-09 51.6 9.7 131 631-777 20-207 (269) 88 PRK10938 putative molybdenum t 97.9 0.00026 6.7E-09 50.6 10.4 50 727-777 417-466 (490) 89 COG1131 CcmA ABC-type multidru 97.9 0.00016 4.1E-09 52.2 9.1 137 633-777 18-201 (293) 90 PRK11000 maltose/maltodextrin 97.9 0.00059 1.5E-08 48.0 12.0 140 634-780 17-202 (369) 91 PRK10895 putative ABC transpor 97.9 0.00072 1.8E-08 47.3 12.3 139 634-780 17-205 (241) 92 PRK13649 cbiO cobalt transport 97.8 0.00063 1.6E-08 47.8 11.8 171 635-834 22-255 (280) 93 cd03292 ABC_FtsE_transporter F 97.8 0.00038 9.6E-09 49.4 10.7 131 635-780 16-204 (214) 94 cd03228 ABCC_MRP_Like The MRP 97.8 0.00021 5.4E-09 51.3 9.3 123 634-780 16-162 (171) 95 cd03237 ABC_RNaseL_inhibitor_d 97.8 0.00017 4.3E-09 52.0 8.9 138 635-777 9-180 (246) 96 COG1121 ZnuC ABC-type Mn/Zn tr 97.8 4.5E-05 1.1E-09 56.3 5.8 133 635-774 19-200 (254) 97 TIGR03375 type_I_sec_LssB type 97.8 0.00052 1.3E-08 48.4 11.1 131 634-781 479-668 (694) 98 PRK13548 hmuV hemin importer A 97.8 0.00024 6.1E-09 50.9 9.3 136 635-776 17-203 (257) 99 cd03239 ABC_SMC_head The struc 97.8 0.00019 4.8E-09 51.7 8.8 129 651-787 23-172 (178) 100 PRK13640 cbiO cobalt transport 97.8 0.00016 4.2E-09 52.1 8.5 51 727-778 160-210 (283) 101 PRK10851 sulfate/thiosulfate t 97.8 0.00013 3.4E-09 52.8 7.9 141 635-781 17-206 (352) 102 PRK13632 cbiO cobalt transport 97.8 0.00022 5.6E-09 51.2 8.9 174 634-828 24-246 (273) 103 PRK11300 livG leucine/isoleuci 97.8 0.00034 8.6E-09 49.8 9.8 52 728-780 170-222 (255) 104 CHL00131 ycf16 sulfate ABC tra 97.8 0.0007 1.8E-08 47.4 11.4 135 634-777 20-214 (252) 105 PRK11231 fecE iron-dicitrate t 97.8 0.00077 2E-08 47.1 11.6 125 634-776 16-201 (255) 106 PRK10535 macrolide transporter 97.8 0.00021 5.4E-09 51.3 8.7 12 881-892 635-646 (648) 107 cd03234 ABCG_White The White s 97.8 0.00052 1.3E-08 48.4 10.7 33 635-671 22-54 (226) 108 PRK13537 lipooligosaccharide t 97.8 0.00067 1.7E-08 47.6 11.2 139 634-781 19-205 (304) 109 cd03225 ABC_cobalt_CbiO_domain 97.8 0.00013 3.3E-09 52.9 7.6 137 634-780 15-202 (211) 110 COG1119 ModF ABC-type molybden 97.8 0.00081 2.1E-08 47.0 11.4 21 651-671 58-78 (257) 111 PRK09493 glnQ glutamine ABC tr 97.8 0.00065 1.7E-08 47.7 10.9 134 635-777 16-200 (240) 112 PRK13542 consensus 97.8 0.00024 6.2E-09 50.8 8.8 131 635-772 33-206 (224) 113 cd03300 ABC_PotA_N PotA is an 97.8 0.00039 9.8E-09 49.4 9.7 133 634-780 14-199 (232) 114 cd03246 ABCC_Protease_Secretio 97.8 0.00042 1.1E-08 49.1 9.9 128 634-780 16-163 (173) 115 cd03245 ABCC_bacteriocin_expor 97.8 0.00028 7.1E-09 50.4 9.0 136 633-781 17-207 (220) 116 PRK09984 phosphonate/organopho 97.8 0.00043 1.1E-08 49.0 9.9 48 728-776 169-216 (262) 117 PRK09452 potA putrescine/sperm 97.8 0.00032 8.3E-09 49.9 9.2 142 634-781 31-217 (378) 118 PRK11629 lolD lipoprotein tran 97.7 0.0013 3.4E-08 45.4 12.3 130 635-779 24-212 (233) 119 cd03256 ABC_PhnC_transporter A 97.7 0.00021 5.3E-09 51.4 8.2 34 635-672 16-49 (241) 120 PRK11831 putative ABC transpor 97.7 0.0011 2.9E-08 45.9 11.9 138 635-780 23-213 (269) 121 PRK10619 histidine/lysine/argi 97.7 0.00058 1.5E-08 48.0 10.4 49 727-777 168-216 (257) 122 TIGR03410 urea_trans_UrtE urea 97.7 0.00037 9.4E-09 49.5 9.4 133 635-780 15-200 (230) 123 PRK11248 tauB taurine transpor 97.7 0.0011 2.8E-08 45.9 11.8 132 635-780 16-197 (255) 124 cd03296 ABC_CysA_sulfate_impor 97.7 0.0004 1E-08 49.2 9.5 132 635-780 17-205 (239) 125 cd03301 ABC_MalK_N The N-termi 97.7 0.0004 1E-08 49.2 9.5 134 634-781 14-200 (213) 126 cd03236 ABC_RNaseL_inhibitor_d 97.7 0.00062 1.6E-08 47.8 10.4 48 727-776 155-202 (255) 127 PRK11650 ugpC glycerol-3-phosp 97.7 0.00035 8.8E-09 49.7 9.1 131 635-781 19-204 (358) 128 cd03264 ABC_drug_resistance_li 97.7 0.001 2.6E-08 46.2 11.3 47 727-776 146-192 (211) 129 PRK11819 putative ABC transpor 97.7 0.00022 5.6E-09 51.2 7.7 134 634-777 338-506 (556) 130 PRK11432 fbpC ferric transport 97.7 0.00045 1.1E-08 48.9 9.3 153 607-781 2-206 (351) 131 cd03293 ABC_NrtD_SsuB_transpor 97.7 0.00014 3.6E-09 52.6 6.7 130 635-780 19-200 (220) 132 PRK13637 cbiO cobalt transport 97.7 0.0015 3.9E-08 44.9 11.9 139 635-780 22-213 (287) 133 PRK11288 araG L-arabinose tran 97.7 0.00068 1.7E-08 47.5 10.0 53 727-781 412-465 (501) 134 PRK13650 cbiO cobalt transport 97.7 0.0004 1E-08 49.2 8.9 139 633-777 17-202 (276) 135 PRK13635 cbiO cobalt transport 97.7 0.00015 3.9E-09 52.4 6.5 128 634-777 21-205 (279) 136 TIGR03269 met_CoM_red_A2 methy 97.7 0.00047 1.2E-08 48.7 9.0 32 636-671 300-331 (520) 137 cd03247 ABCC_cytochrome_bd The 97.7 0.0011 2.8E-08 45.9 10.7 126 634-777 16-161 (178) 138 COG3910 Predicted ATPase [Gene 97.6 0.00032 8E-09 50.0 7.8 127 651-790 38-201 (233) 139 cd03266 ABC_NatA_sodium_export 97.6 0.0015 3.7E-08 45.0 11.1 131 635-780 20-204 (218) 140 PRK09536 btuD corrinoid ABC tr 97.6 0.00039 9.8E-09 49.4 8.1 125 635-777 17-202 (409) 141 PRK13536 nodulation factor exp 97.6 0.00099 2.5E-08 46.3 10.2 47 728-776 155-201 (306) 142 cd03219 ABC_Mj1267_LivG_branch 97.6 0.0013 3.3E-08 45.4 10.8 137 635-780 15-211 (236) 143 PRK13641 cbiO cobalt transport 97.6 0.0015 3.8E-08 45.0 11.0 52 727-780 161-213 (286) 144 PRK11607 potG putrescine trans 97.6 0.0031 8E-08 42.6 12.7 141 635-781 34-219 (377) 145 PRK13631 cbiO cobalt transport 97.6 0.00076 1.9E-08 47.1 9.4 52 727-780 192-244 (320) 146 cd03278 ABC_SMC_barmotin Barmo 97.6 0.00083 2.1E-08 46.9 9.6 55 729-791 135-191 (197) 147 cd03259 ABC_Carb_Solutes_like 97.6 0.00047 1.2E-08 48.7 8.3 139 635-780 15-199 (213) 148 PRK11147 ABC transporter ATPas 97.6 0.00048 1.2E-08 48.7 8.2 134 634-777 333-501 (632) 149 COG2274 SunT ABC-type bacterio 97.6 0.0054 1.4E-07 40.8 13.6 128 633-777 486-672 (709) 150 PRK13651 cobalt transporter AT 97.6 0.0013 3.2E-08 45.5 10.3 48 727-776 177-224 (304) 151 cd03298 ABC_ThiQ_thiamine_tran 97.6 0.0034 8.7E-08 42.3 12.4 129 638-780 16-197 (211) 152 PRK10982 galactose/methyl gala 97.6 0.00075 1.9E-08 47.2 9.0 52 728-781 408-460 (491) 153 PRK10762 D-ribose transporter 97.6 0.0003 7.7E-09 50.2 7.0 53 727-781 411-464 (501) 154 PRK13636 cbiO cobalt transport 97.6 0.0025 6.4E-08 43.3 11.7 50 727-777 157-206 (285) 155 PRK11147 ABC transporter ATPas 97.6 0.0041 1.1E-07 41.7 12.8 13 308-320 39-51 (632) 156 cd03257 ABC_NikE_OppD_transpor 97.6 0.0019 4.8E-08 44.2 11.0 53 727-780 161-214 (228) 157 PRK11819 putative ABC transpor 97.6 0.00076 1.9E-08 47.2 8.9 16 697-715 353-368 (556) 158 PRK13634 cbiO cobalt transport 97.6 0.0022 5.6E-08 43.7 11.2 53 727-780 148-201 (276) 159 COG4586 ABC-type uncharacteriz 97.6 0.00068 1.7E-08 47.5 8.6 147 637-790 41-235 (325) 160 COG1120 FepC ABC-type cobalami 97.6 0.00086 2.2E-08 46.7 9.1 129 632-777 14-203 (258) 161 PRK13652 cbiO cobalt transport 97.6 0.0016 4.1E-08 44.7 10.4 52 728-780 154-206 (277) 162 PRK13638 cbiO cobalt transport 97.6 0.0013 3.2E-08 45.5 9.8 87 727-829 152-240 (271) 163 PRK10908 cell division protein 97.5 0.0024 6.1E-08 43.4 11.2 134 635-777 17-201 (222) 164 PRK13549 xylose transporter AT 97.5 0.00056 1.4E-08 48.1 7.9 53 727-781 421-474 (513) 165 PRK13633 cobalt transporter AT 97.5 0.00063 1.6E-08 47.8 8.1 130 634-777 25-210 (281) 166 PRK10938 putative molybdenum t 97.5 0.0034 8.6E-08 42.3 11.7 33 635-671 275-307 (490) 167 PRK11124 artP arginine transpo 97.5 0.0031 8E-08 42.6 11.5 52 727-780 157-209 (242) 168 PRK10636 putative ABC transpor 97.5 0.00053 1.3E-08 48.3 7.5 134 634-777 326-491 (638) 169 PRK09580 sufC cysteine desulfu 97.5 0.002 5.2E-08 44.0 10.5 35 634-672 15-49 (248) 170 PRK13643 cbiO cobalt transport 97.5 0.00073 1.8E-08 47.3 8.1 135 635-777 21-208 (288) 171 PRK10636 putative ABC transpor 97.5 0.0033 8.3E-08 42.4 11.4 17 308-324 37-53 (638) 172 PRK11701 phnK phosphonates tra 97.5 0.0013 3.3E-08 45.5 9.3 53 727-780 167-220 (258) 173 PRK13642 cbiO cobalt transport 97.5 0.0012 3E-08 45.8 9.0 129 635-777 22-205 (277) 174 COG4555 NatA ABC-type Na+ tran 97.5 0.00059 1.5E-08 48.0 7.5 129 648-780 26-201 (245) 175 PRK13646 cbiO cobalt transport 97.5 0.00059 1.5E-08 48.0 7.4 53 727-780 161-214 (286) 176 PRK13644 cbiO cobalt transport 97.5 0.0011 2.8E-08 45.9 8.7 133 635-775 17-198 (274) 177 cd03261 ABC_Org_Solvent_Resist 97.5 0.0021 5.4E-08 43.8 10.0 133 635-781 15-206 (235) 178 TIGR02673 FtsE cell division A 97.5 0.0017 4.4E-08 44.5 9.6 132 637-779 19-203 (215) 179 PRK11264 putative amino-acid A 97.5 0.0022 5.5E-08 43.8 10.0 35 634-672 15-49 (248) 180 TIGR03265 PhnT2 putative 2-ami 97.5 0.00017 4.3E-09 52.0 4.4 142 635-781 19-204 (353) 181 cd03251 ABCC_MsbA MsbA is an e 97.5 0.0032 8.1E-08 42.5 10.9 41 633-685 15-55 (234) 182 PRK10744 phosphate transporter 97.4 0.003 7.7E-08 42.7 10.5 36 634-673 24-59 (257) 183 PRK13639 cbiO cobalt transport 97.4 0.00071 1.8E-08 47.4 7.2 48 728-777 154-201 (275) 184 TIGR03258 PhnT 2-aminoethylpho 97.4 0.0017 4.4E-08 44.5 9.1 137 635-776 20-202 (362) 185 cd03248 ABCC_TAP TAP, the Tran 97.4 0.0032 8.1E-08 42.5 10.4 131 633-780 27-216 (226) 186 cd03289 ABCC_CFTR2 The CFTR su 97.4 0.0042 1.1E-07 41.7 10.9 34 634-671 18-51 (275) 187 COG1118 CysA ABC-type sulfate/ 97.4 0.0022 5.7E-08 43.7 9.5 138 637-780 19-206 (345) 188 cd03260 ABC_PstB_phosphate_tra 97.4 0.0064 1.6E-07 40.3 11.7 37 634-674 14-50 (227) 189 TIGR03411 urea_trans_UrtD urea 97.4 0.0028 7.2E-08 42.9 9.8 132 635-776 17-205 (242) 190 PRK13409 putative ATPase RIL; 97.4 0.0057 1.5E-07 40.6 11.1 157 649-840 364-555 (590) 191 cd03290 ABCC_SUR1_N The SUR do 97.4 0.0031 8E-08 42.6 9.8 33 635-671 16-48 (218) 192 cd03254 ABCC_Glucan_exporter_l 97.3 0.0028 7E-08 43.0 9.5 34 634-671 17-50 (229) 193 cd03297 ABC_ModC_molybdenum_tr 97.3 0.0057 1.5E-07 40.6 11.1 141 633-781 12-201 (214) 194 cd03369 ABCC_NFT1 Domain 2 of 97.3 0.0055 1.4E-07 40.7 11.0 34 634-671 22-55 (207) 195 PRK13549 xylose transporter AT 97.3 0.0052 1.3E-07 40.9 10.7 33 635-671 277-309 (513) 196 cd03253 ABCC_ATM1_transporter 97.3 0.0088 2.3E-07 39.2 11.8 41 634-686 15-55 (236) 197 PRK11288 araG L-arabinose tran 97.3 0.0042 1.1E-07 41.7 10.1 33 635-671 268-300 (501) 198 COG4619 ABC-type uncharacteriz 97.3 0.0058 1.5E-07 40.6 10.6 133 633-774 16-195 (223) 199 COG4152 ABC-type uncharacteriz 97.3 0.0017 4.2E-08 44.6 7.7 121 635-773 17-190 (300) 200 cd03295 ABC_OpuCA_Osmoprotecti 97.3 0.0073 1.9E-07 39.8 11.0 139 635-781 16-205 (242) 201 PRK10762 D-ribose transporter 97.3 0.0063 1.6E-07 40.3 10.5 31 637-671 269-299 (501) 202 cd03291 ABCC_CFTR1 The CFTR su 97.2 0.0089 2.3E-07 39.2 11.2 124 635-776 52-222 (282) 203 PRK09700 D-allose transporter 97.2 0.013 3.4E-07 37.9 12.1 31 637-671 280-310 (510) 204 pfam01695 IstB IstB-like ATP b 97.2 0.015 3.7E-07 37.6 12.1 101 652-773 49-150 (178) 205 TIGR01189 ccmA heme ABC export 97.2 0.00036 9.2E-09 49.6 3.7 124 633-772 15-190 (204) 206 PRK13645 cbiO cobalt transport 97.2 0.0035 9E-08 42.2 8.4 53 727-780 166-219 (289) 207 KOG0927 consensus 97.2 0.025 6.5E-07 35.8 12.7 144 635-791 405-585 (614) 208 cd03249 ABC_MTABC3_MDL1_MDL2 M 97.2 0.009 2.3E-07 39.1 10.4 34 634-671 17-50 (238) 209 COG1122 CbiO ABC-type cobalt t 97.1 0.0035 9E-08 42.2 7.9 49 728-777 155-203 (235) 210 cd03252 ABCC_Hemolysin The ABC 97.1 0.015 3.7E-07 37.6 11.0 36 632-671 14-49 (237) 211 pfam00154 RecA recA bacterial 97.1 0.0069 1.8E-07 40.0 9.3 130 637-774 40-190 (322) 212 COG3845 ABC-type uncharacteriz 97.0 0.012 3.1E-07 38.1 10.1 54 728-785 420-476 (501) 213 PRK06526 transposase; Provisio 97.0 0.023 5.8E-07 36.1 11.4 95 653-773 101-201 (254) 214 cd03262 ABC_HisP_GlnQ_permease 97.0 0.014 3.6E-07 37.7 10.2 138 635-780 15-203 (213) 215 TIGR02982 heterocyst_DevA ABC 97.0 0.0069 1.8E-07 40.0 8.5 134 632-782 17-211 (220) 216 PRK13657 cyclic beta-1,2-gluca 97.0 0.025 6.3E-07 35.9 11.2 32 635-670 350-381 (585) 217 pfam03266 DUF265 Protein of un 97.0 0.019 4.8E-07 36.7 10.6 134 653-792 2-157 (168) 218 PRK11153 metN DL-methionine tr 96.9 0.019 5E-07 36.6 10.5 49 727-776 156-204 (343) 219 cd03288 ABCC_SUR2 The SUR doma 96.9 0.029 7.3E-07 35.4 11.4 35 633-671 34-68 (257) 220 PRK09354 recA recombinase A; P 96.9 0.017 4.4E-07 37.1 10.0 137 635-775 46-199 (350) 221 PRK11160 cysteine/glutathione 96.9 0.011 2.7E-07 38.6 8.9 34 634-671 355-388 (575) 222 PRK10261 glutathione transport 96.9 0.013 3.3E-07 38.0 9.0 32 636-671 340-371 (623) 223 cd00009 AAA The AAA+ (ATPases 96.9 0.016 4.1E-07 37.2 9.5 108 650-775 19-132 (151) 224 PRK10418 nikD nickel transport 96.8 0.038 9.7E-07 34.5 11.2 34 634-671 17-50 (254) 225 KOG0064 consensus 96.8 0.011 2.7E-07 38.6 8.3 44 648-698 506-549 (728) 226 PRK10982 galactose/methyl gala 96.8 0.014 3.6E-07 37.7 8.8 31 637-671 265-295 (491) 227 COG2401 ABC-type ATPase fused 96.7 0.017 4.4E-07 37.1 9.0 137 633-778 396-573 (593) 228 PRK10419 nikE nickel transport 96.7 0.054 1.4E-06 33.3 11.5 51 727-780 167-220 (266) 229 PRK08116 hypothetical protein; 96.7 0.058 1.5E-06 33.1 12.6 113 649-780 107-221 (262) 230 TIGR00954 3a01203 Peroxysomal 96.7 0.0014 3.6E-08 45.2 3.1 50 636-687 544-605 (788) 231 PRK13545 tagH teichoic acids e 96.7 0.0073 1.9E-07 39.8 6.7 24 43-66 5-28 (549) 232 COG4133 CcmA ABC-type transpor 96.7 0.0023 5.8E-08 43.6 4.1 34 633-672 17-50 (209) 233 COG4988 CydD ABC-type transpor 96.7 0.064 1.6E-06 32.8 11.4 81 576-670 281-367 (559) 234 TIGR03415 ABC_choXWV_ATP choli 96.7 0.013 3.2E-07 38.1 7.7 34 636-673 40-73 (382) 235 COG4178 ABC-type uncharacteriz 96.6 0.03 7.5E-07 35.3 9.3 197 561-780 331-581 (604) 236 COG1136 SalX ABC-type antimicr 96.6 0.031 7.9E-07 35.1 9.4 51 727-781 158-211 (226) 237 cd03294 ABC_Pro_Gly_Bertaine T 96.5 0.025 6.4E-07 35.8 8.6 138 634-780 38-229 (269) 238 COG1129 MglA ABC-type sugar tr 96.5 0.081 2.1E-06 32.0 11.9 54 727-781 417-470 (500) 239 KOG0927 consensus 96.5 0.021 5.3E-07 36.4 7.9 38 299-336 95-139 (614) 240 TIGR02858 spore_III_AA stage I 96.5 0.016 4.2E-07 37.2 7.4 129 614-780 105-248 (282) 241 TIGR01842 type_I_sec_PrtD type 96.5 0.042 1.1E-06 34.2 9.4 147 609-782 318-535 (556) 242 COG3839 MalK ABC-type sugar tr 96.5 0.0036 9.2E-08 42.1 4.0 131 635-781 18-203 (338) 243 PRK08181 transposase; Validate 96.4 0.089 2.3E-06 31.7 11.0 96 653-773 109-209 (269) 244 COG0410 LivF ABC-type branched 96.3 0.017 4.4E-07 37.1 6.8 130 635-772 18-196 (237) 245 PRK08939 primosomal protein Dn 96.3 0.1 2.6E-06 31.3 12.1 105 649-782 156-270 (306) 246 KOG0056 consensus 96.3 0.1 2.6E-06 31.3 15.6 150 600-771 526-731 (790) 247 TIGR03269 met_CoM_red_A2 methy 96.3 0.052 1.3E-06 33.5 9.1 54 727-781 443-497 (520) 248 PRK09183 transposase/IS protei 96.2 0.046 1.2E-06 33.8 8.4 95 653-773 104-205 (258) 249 pfam03215 Rad17 Rad17 cell cyc 96.2 0.052 1.3E-06 33.4 8.6 34 658-692 449-483 (490) 250 cd03258 ABC_MetN_methionine_tr 96.2 0.099 2.5E-06 31.4 10.0 138 635-780 20-209 (233) 251 KOG0059 consensus 96.2 0.091 2.3E-06 31.6 9.7 64 727-792 714-778 (885) 252 cd00983 recA RecA is a bacter 96.1 0.07 1.8E-06 32.5 9.0 122 648-775 53-194 (325) 253 TIGR01288 nodI nodulation ABC 96.1 0.027 6.8E-07 35.6 6.8 140 631-776 15-198 (303) 254 KOG0217 consensus 96.0 1.8E-05 4.7E-10 59.3 -9.6 176 648-833 752-941 (1125) 255 PRK10790 putative multidrug tr 96.0 0.14 3.5E-06 30.3 13.4 123 634-773 355-535 (593) 256 PRK00440 rfc replication facto 96.0 0.12 3.1E-06 30.7 9.8 99 653-773 40-141 (318) 257 cd01122 GP4d_helicase GP4d_hel 95.9 0.025 6.4E-07 35.8 6.0 44 648-692 28-71 (271) 258 PRK13695 putative NTPase; Prov 95.8 0.13 3.3E-06 30.5 9.3 115 653-773 6-139 (174) 259 PRK04195 replication factor C 95.8 0.15 3.8E-06 30.0 9.6 15 530-544 324-338 (403) 260 PRK09473 oppD oligopeptide tra 95.8 0.12 3.2E-06 30.6 9.0 33 635-671 31-63 (330) 261 smart00382 AAA ATPases associa 95.7 0.0092 2.3E-07 39.1 3.1 90 651-746 3-95 (148) 262 KOG0060 consensus 95.7 0.085 2.2E-06 31.9 8.0 125 648-777 459-631 (659) 263 COG0411 LivG ABC-type branched 95.6 0.013 3.3E-07 37.9 3.4 133 635-772 19-209 (250) 264 PRK10869 recombination and rep 95.5 0.22 5.6E-06 28.8 26.0 112 710-837 436-552 (553) 265 cd03244 ABCC_MRP_domain2 Domai 95.4 0.024 6.1E-07 36.0 4.3 35 633-671 17-51 (221) 266 cd03275 ABC_SMC1_euk Eukaryoti 95.4 0.23 5.8E-06 28.6 10.2 74 704-785 154-231 (247) 267 PRK07952 DNA replication prote 95.4 0.16 4E-06 29.9 8.5 103 652-772 98-201 (242) 268 PRK11308 dppF dipeptide transp 95.2 0.27 6.8E-06 28.2 11.2 47 727-777 170-219 (327) 269 PRK12402 replication factor C 95.2 0.22 5.6E-06 28.8 8.7 104 653-773 39-165 (337) 270 PRK10261 glutathione transport 95.2 0.27 6.8E-06 28.1 11.2 47 727-776 479-527 (623) 271 COG1618 Predicted nucleotide k 95.1 0.052 1.3E-06 33.4 5.3 127 653-786 8-156 (179) 272 COG4559 ABC-type hemin transpo 95.1 0.02 5.2E-07 36.5 3.2 40 635-686 16-55 (259) 273 pfam05729 NACHT NACHT domain. 95.1 0.18 4.6E-06 29.4 8.0 121 651-782 1-138 (165) 274 PRK06921 hypothetical protein; 95.1 0.26 6.5E-06 28.3 8.7 130 652-806 118-262 (265) 275 cd00984 DnaB_C DnaB helicase C 94.9 0.18 4.5E-06 29.5 7.5 42 649-691 12-53 (242) 276 PRK10789 putative multidrug tr 94.9 0.08 2E-06 32.1 5.6 33 635-671 330-362 (569) 277 cd03279 ABC_sbcCD SbcCD and ot 94.7 0.052 1.3E-06 33.4 4.4 35 637-674 18-52 (213) 278 PRK03918 chromosome segregatio 94.7 0.033 8.5E-07 34.9 3.3 71 704-781 790-865 (882) 279 cd01120 RecA-like_NTPases RecA 94.6 0.05 1.3E-06 33.6 4.1 86 653-738 2-94 (165) 280 pfam00004 AAA ATPase family as 94.6 0.38 9.7E-06 27.0 10.0 103 653-777 1-115 (131) 281 PRK06835 DNA replication prote 94.5 0.4 1E-05 26.9 10.9 102 652-773 185-289 (330) 282 PRK13342 recombination factor 94.4 0.41 1E-05 26.8 8.9 13 534-546 393-405 (417) 283 COG0497 RecN ATPase involved i 94.4 0.42 1.1E-05 26.7 11.6 113 710-838 437-554 (557) 284 COG3842 PotA ABC-type spermidi 94.4 0.067 1.7E-06 32.6 4.3 136 633-776 18-200 (352) 285 COG3950 Predicted ATP-binding 94.1 0.066 1.7E-06 32.7 3.8 38 731-772 297-335 (440) 286 TIGR00618 sbcc exonuclease Sbc 94.1 0.056 1.4E-06 33.2 3.4 72 705-778 970-1046(1063) 287 TIGR00968 3a0106s01 sulfate AB 94.1 0.065 1.7E-06 32.7 3.7 149 635-791 15-214 (241) 288 KOG0922 consensus 94.0 0.048 1.2E-06 33.7 3.0 29 84-112 48-76 (674) 289 PRK10070 glycine betaine trans 94.0 0.062 1.6E-06 32.9 3.5 33 636-672 44-76 (400) 290 TIGR03185 DNA_S_dndD DNA sulfu 93.9 0.1 2.5E-06 31.3 4.4 36 66-101 28-77 (650) 291 COG4138 BtuD ABC-type cobalami 93.9 0.51 1.3E-05 26.0 10.5 22 648-669 23-44 (248) 292 KOG0065 consensus 93.9 0.13 3.3E-06 30.5 4.9 112 654-776 821-993 (1391) 293 cd03241 ABC_RecN RecN ATPase i 93.9 0.52 1.3E-05 26.0 11.3 92 719-823 182-275 (276) 294 cd03277 ABC_SMC5_euk Eukaryoti 93.8 0.069 1.8E-06 32.5 3.5 103 649-760 22-175 (213) 295 PRK02224 chromosome segregatio 93.8 0.071 1.8E-06 32.4 3.5 75 704-781 780-862 (880) 296 PRK01156 chromosome segregatio 93.8 0.063 1.6E-06 32.8 3.3 74 704-781 800-878 (895) 297 COG0419 SbcC ATPase involved i 93.8 0.072 1.8E-06 32.4 3.4 73 704-781 814-891 (908) 298 PTZ00243 ABC transporter; Prov 93.8 0.54 1.4E-05 25.9 9.6 31 634-668 1324-1354(1560) 299 TIGR02324 CP_lyasePhnL phospho 93.7 0.059 1.5E-06 33.0 2.9 32 635-670 23-54 (224) 300 PRK13409 putative ATPase RIL; 93.5 0.12 3.1E-06 30.6 4.3 16 696-714 367-382 (590) 301 pfam09818 ABC_ATPase Predicted 93.5 0.23 5.7E-06 28.7 5.6 48 652-704 245-292 (447) 302 PRK09112 DNA polymerase III su 93.5 0.26 6.6E-06 28.2 5.9 119 649-774 44-182 (352) 303 pfam01580 FtsK_SpoIIIE FtsK/Sp 93.4 0.22 5.6E-06 28.8 5.5 119 653-773 41-200 (202) 304 COG1106 Predicted ATPases [Gen 93.4 0.1 2.5E-06 31.3 3.7 50 730-780 271-320 (371) 305 TIGR01166 cbiO cobalt ABC tran 93.4 0.063 1.6E-06 32.8 2.6 114 648-775 16-189 (190) 306 PRK10246 exonuclease subunit S 93.4 0.097 2.5E-06 31.4 3.6 95 680-777 919-1021(1047) 307 cd01394 radB RadB. The archaea 93.3 0.64 1.6E-05 25.3 9.2 119 649-772 18-157 (218) 308 COG1126 GlnQ ABC-type polar am 93.2 0.11 2.8E-06 31.1 3.7 130 634-774 16-197 (240) 309 PRK10522 multidrug transporter 93.2 0.56 1.4E-05 25.7 7.3 33 634-670 337-369 (547) 310 TIGR03015 pepcterm_ATPase puta 93.2 0.27 7E-06 28.1 5.7 83 650-738 43-132 (269) 311 pfam03796 DnaB_C DnaB-like hel 93.2 0.59 1.5E-05 25.6 7.4 116 649-773 18-139 (186) 312 TIGR02169 SMC_prok_A chromosom 93.1 0.04 1E-06 34.3 1.3 21 173-193 131-153 (1202) 313 cd03276 ABC_SMC6_euk Eukaryoti 93.0 0.1 2.7E-06 31.2 3.3 24 650-673 21-44 (198) 314 PRK13341 recombination factor 92.9 0.55 1.4E-05 25.8 6.9 23 813-837 682-704 (726) 315 COG2884 FtsE Predicted ATPase 92.9 0.12 3E-06 30.8 3.4 47 728-778 154-202 (223) 316 PRK11174 cysteine/glutathione 92.8 0.74 1.9E-05 24.8 7.4 33 634-670 364-396 (588) 317 TIGR00972 3a0107s01c2 phosphat 92.8 0.11 2.9E-06 31.0 3.2 121 635-772 16-202 (248) 318 COG4987 CydC ABC-type transpor 92.7 0.61 1.6E-05 25.5 6.8 45 729-779 492-537 (573) 319 PRK00064 recF recombination pr 92.7 0.13 3.3E-06 30.5 3.4 59 726-791 297-355 (355) 320 COG1193 Mismatch repair ATPase 92.7 0.0024 6E-08 43.5 -5.5 119 650-772 532-653 (753) 321 PRK09361 radB DNA repair and r 92.6 0.79 2E-05 24.6 7.9 121 649-772 22-160 (224) 322 pfam00910 RNA_helicase RNA hel 92.6 0.54 1.4E-05 25.8 6.5 64 654-745 2-65 (105) 323 PRK11664 ATP-dependent RNA hel 92.5 0.32 8.2E-06 27.5 5.3 27 83-109 1-27 (812) 324 pfam05496 RuvB_N Holliday junc 92.5 0.56 1.4E-05 25.8 6.5 62 653-742 53-114 (234) 325 COG3638 ABC-type phosphate/pho 92.5 0.24 6.2E-06 28.4 4.6 35 634-672 18-52 (258) 326 TIGR02315 ABC_phnC phosphonate 92.5 0.13 3.2E-06 30.6 3.1 30 638-671 20-49 (253) 327 smart00763 AAA_PrkA PrkA AAA d 92.4 0.15 3.8E-06 30.1 3.4 18 651-668 79-96 (361) 328 TIGR00602 rad24 checkpoint pro 92.4 0.12 3.2E-06 30.6 3.0 39 290-332 250-288 (670) 329 TIGR02012 tigrfam_recA protein 92.4 0.17 4.3E-06 29.6 3.7 133 635-772 41-191 (322) 330 TIGR03420 DnaA_homol_Hda DnaA 92.4 0.84 2.1E-05 24.4 10.6 96 650-773 38-133 (226) 331 COG1134 TagH ABC-type polysacc 92.3 0.18 4.6E-06 29.4 3.7 127 634-776 41-210 (249) 332 COG1480 Predicted membrane-ass 92.3 0.58 1.5E-05 25.6 6.3 76 748-834 498-582 (700) 333 PRK11176 lipid transporter ATP 92.2 0.43 1.1E-05 26.6 5.6 33 634-670 356-388 (581) 334 COG4604 CeuD ABC-type enteroch 92.2 0.22 5.5E-06 28.8 4.1 72 632-728 13-84 (252) 335 KOG0057 consensus 92.2 0.87 2.2E-05 24.3 7.2 122 653-781 381-554 (591) 336 COG4615 PvdE ABC-type sideroph 92.2 0.44 1.1E-05 26.5 5.6 151 601-773 311-509 (546) 337 pfam05707 Zot Zonular occluden 92.2 0.88 2.3E-05 24.3 9.8 108 652-775 2-118 (183) 338 PRK06995 flhF flagellar biosyn 92.1 0.9 2.3E-05 24.2 7.9 159 649-831 175-346 (404) 339 COG3044 Predicted ATPase of th 92.0 0.18 4.7E-06 29.3 3.5 21 652-672 244-264 (554) 340 KOG0743 consensus 91.9 0.85 2.2E-05 24.4 6.8 145 648-841 233-397 (457) 341 COG4161 ArtP ABC-type arginine 91.9 0.25 6.5E-06 28.3 4.1 36 635-674 17-52 (242) 342 KOG0979 consensus 91.8 0.24 6E-06 28.5 3.8 68 40-110 19-94 (1072) 343 COG1125 OpuBA ABC-type proline 91.6 0.23 6E-06 28.6 3.7 32 635-670 16-47 (309) 344 pfam00931 NB-ARC NB-ARC domain 91.5 0.68 1.7E-05 25.1 5.9 174 648-836 17-206 (285) 345 PRK09519 recA recombinase A; R 91.4 0.53 1.3E-05 25.9 5.3 10 823-832 719-728 (790) 346 COG1127 Ttg2A ABC-type transpo 91.3 1.1 2.7E-05 23.6 13.1 126 633-772 21-205 (263) 347 COG4717 Uncharacterized conser 91.3 0.15 3.8E-06 30.1 2.4 89 685-777 863-971 (984) 348 cd03242 ABC_RecF RecF is a rec 91.2 0.23 5.9E-06 28.6 3.3 30 728-761 209-238 (270) 349 PRK12727 flagellar biosynthesi 90.9 1.2 3E-05 23.3 7.2 127 649-786 347-483 (557) 350 TIGR01186 proV glycine betaine 90.8 0.27 6.9E-06 28.1 3.4 11 603-613 339-349 (372) 351 PRK11022 dppD dipeptide transp 90.8 0.3 7.7E-06 27.8 3.6 33 635-671 22-54 (327) 352 COG4674 Uncharacterized ABC-ty 90.8 0.11 2.8E-06 31.1 1.3 45 729-776 165-209 (249) 353 pfam02463 SMC_N RecF/RecN/SMC 90.6 0.29 7.4E-06 27.9 3.4 12 180-191 117-128 (1162) 354 COG4637 Predicted ATPase [Gene 90.6 0.36 9.1E-06 27.2 3.8 124 655-786 207-344 (373) 355 TIGR02533 type_II_gspE general 90.5 0.16 4E-06 29.9 1.9 19 652-670 247-265 (495) 356 pfam01637 Arch_ATPase Archaeal 90.3 1.3 3.4E-05 23.0 10.7 24 649-672 19-42 (223) 357 PRK12726 flagellar biosynthesi 90.2 0.79 2E-05 24.6 5.3 89 648-742 204-298 (407) 358 PRK04296 thymidine kinase; Pro 90.2 1.1 2.7E-05 23.7 5.9 147 650-808 2-161 (197) 359 pfam08298 AAA_PrkA PrkA AAA do 90.1 0.31 7.9E-06 27.7 3.2 21 649-669 84-104 (358) 360 COG0470 HolB ATPase involved i 90.1 1.4 3.5E-05 22.9 7.7 107 651-773 25-149 (325) 361 pfam00265 TK Thymidine kinase. 90.1 0.6 1.5E-05 25.5 4.6 142 650-808 1-155 (175) 362 COG4778 PhnL ABC-type phosphon 90.1 0.4 1E-05 26.9 3.7 33 635-671 26-58 (235) 363 KOG1970 consensus 90.0 0.27 6.9E-06 28.1 2.8 25 658-682 516-540 (634) 364 KOG0996 consensus 89.9 1.4 3.6E-05 22.7 7.8 13 124-136 104-116 (1293) 365 COG1116 TauB ABC-type nitrate/ 89.8 0.37 9.4E-06 27.1 3.4 127 635-774 18-192 (248) 366 COG1196 Smc Chromosome segrega 89.8 0.32 8.1E-06 27.6 3.0 17 612-628 525-541 (1163) 367 TIGR01194 cyc_pep_trnsptr cycl 89.7 0.39 1E-05 26.9 3.4 58 611-686 337-396 (555) 368 TIGR01420 pilT_fam twitching m 89.7 0.22 5.5E-06 28.8 2.1 65 582-667 68-144 (350) 369 PRK06851 hypothetical protein; 89.6 0.42 1.1E-05 26.7 3.5 54 630-692 202-255 (368) 370 PRK10787 DNA-binding ATP-depen 89.6 0.46 1.2E-05 26.4 3.7 60 745-833 679-740 (784) 371 TIGR00929 VirB4_CagE type IV s 89.6 0.26 6.7E-06 28.2 2.5 184 653-845 519-751 (931) 372 PRK05703 flhF flagellar biosyn 89.6 1.5 3.7E-05 22.6 7.0 165 648-835 208-384 (412) 373 PRK00411 cdc6 cell division co 89.5 0.69 1.8E-05 25.1 4.6 121 653-777 58-187 (394) 374 COG1101 PhnK ABC-type uncharac 89.5 0.32 8.1E-06 27.6 2.8 128 635-772 21-208 (263) 375 TIGR01188 drrA daunorubicin re 89.4 0.38 9.6E-06 27.0 3.1 124 648-777 19-193 (343) 376 TIGR00611 recf DNA replication 89.3 0.29 7.5E-06 27.8 2.5 25 576-600 339-365 (399) 377 cd03115 SRP The signal recogni 89.1 1.6 4.1E-05 22.3 7.1 154 652-826 2-169 (173) 378 COG0466 Lon ATP-dependent Lon 89.0 0.73 1.9E-05 24.9 4.4 103 651-775 351-460 (782) 379 COG1643 HrpA HrpA-like helicas 88.9 1.6 4.2E-05 22.3 6.7 30 188-217 188-218 (845) 380 TIGR00960 3a0501s02 Type II (G 88.9 0.58 1.5E-05 25.7 3.8 123 649-778 28-203 (216) 381 cd01393 recA_like RecA is a b 88.8 1.7 4.3E-05 22.2 8.0 116 649-773 18-169 (226) 382 COG3854 SpoIIIAA ncharacterize 88.6 1.7 4.4E-05 22.1 8.2 107 653-780 140-260 (308) 383 COG1117 PstB ABC-type phosphat 88.3 0.52 1.3E-05 26.0 3.2 35 635-673 22-56 (253) 384 KOG0061 consensus 88.3 1.8 4.6E-05 21.9 10.2 15 125-139 44-58 (613) 385 KOG2355 consensus 88.2 0.51 1.3E-05 26.1 3.1 38 731-773 167-208 (291) 386 KOG0744 consensus 88.1 1.8 4.7E-05 21.9 6.9 101 650-771 177-304 (423) 387 TIGR00606 rad50 rad50; InterPr 88.0 0.32 8.1E-06 27.6 2.0 50 128-177 21-95 (1328) 388 PRK12377 putative replication 87.9 1.9 4.8E-05 21.8 11.7 103 652-772 103-205 (248) 389 KOG2004 consensus 87.9 1.6 4E-05 22.4 5.5 104 648-776 436-549 (906) 390 TIGR02142 modC_ABC molybdate A 87.7 1.7 4.4E-05 22.1 5.6 139 636-782 13-202 (361) 391 PRK10078 ribose 1,5-bisphospho 87.4 0.59 1.5E-05 25.6 3.1 21 650-670 2-22 (184) 392 PRK06647 DNA polymerase III su 87.4 2 5.1E-05 21.6 7.2 15 255-269 131-145 (560) 393 PRK06893 DNA replication initi 87.2 2.1 5.2E-05 21.5 10.5 112 650-791 39-159 (229) 394 TIGR02868 CydC ABC transporter 87.1 0.74 1.9E-05 24.8 3.4 144 608-772 356-565 (566) 395 pfam00437 GSPII_E Type II/IV s 87.0 0.65 1.7E-05 25.2 3.1 22 650-671 139-160 (283) 396 COG1484 DnaC DNA replication p 86.9 2.1 5.5E-05 21.4 10.8 97 652-774 107-210 (254) 397 COG2256 MGS1 ATPase related to 86.9 1.9 4.9E-05 21.7 5.5 14 535-548 408-421 (436) 398 cd01123 Rad51_DMC1_radA Rad51_ 86.8 2.2 5.5E-05 21.4 12.1 116 649-772 18-169 (235) 399 KOG0250 consensus 86.8 0.67 1.7E-05 25.1 3.1 11 754-764 586-596 (1074) 400 COG1419 FlhF Flagellar GTP-bin 86.6 2.2 5.6E-05 21.3 6.8 118 649-784 202-337 (407) 401 KOG1533 consensus 86.4 0.58 1.5E-05 25.6 2.6 37 285-337 95-131 (290) 402 TIGR00763 lon ATP-dependent pr 86.3 1.4 3.6E-05 22.8 4.5 104 649-780 448-568 (941) 403 COG1124 DppF ABC-type dipeptid 86.2 2.3 5.9E-05 21.1 9.4 133 635-777 22-206 (252) 404 cd01131 PilT Pilus retraction 86.2 0.71 1.8E-05 25.0 2.9 19 652-670 3-21 (198) 405 TIGR03263 guanyl_kin guanylate 86.1 0.84 2.1E-05 24.4 3.3 22 650-671 1-22 (180) 406 pfam05621 TniB Bacterial TniB 86.1 1.2 3E-05 23.3 4.0 48 721-771 138-189 (302) 407 pfam06745 KaiC KaiC. This fami 86.0 2.4 6E-05 21.1 10.3 122 649-775 18-167 (231) 408 COG4618 ArpD ABC-type protease 85.8 2.4 6.2E-05 21.0 11.1 53 728-782 489-541 (580) 409 cd01130 VirB11-like_ATPase Typ 85.7 0.75 1.9E-05 24.8 2.9 105 652-776 27-138 (186) 410 COG2804 PulE Type II secretory 85.3 0.81 2.1E-05 24.5 2.9 20 651-670 259-278 (500) 411 COG4185 Uncharacterized protei 85.2 0.33 8.3E-06 27.5 0.8 18 650-667 2-19 (187) 412 COG0194 Gmk Guanylate kinase [ 85.0 0.94 2.4E-05 24.0 3.1 110 649-783 3-113 (191) 413 PRK08118 topology modulation p 84.7 0.89 2.3E-05 24.2 2.9 62 653-736 4-65 (167) 414 COG4148 ModC ABC-type molybdat 84.5 2.7 7E-05 20.6 8.8 149 636-790 14-207 (352) 415 cd01129 PulE-GspE PulE/GspE Th 84.5 0.93 2.4E-05 24.1 2.9 19 651-669 81-99 (264) 416 COG2805 PilT Tfp pilus assembl 84.3 0.74 1.9E-05 24.8 2.3 63 651-746 126-188 (353) 417 PRK10436 hypothetical protein; 84.1 0.99 2.5E-05 23.9 2.9 74 651-738 216-293 (461) 418 PRK01184 hypothetical protein; 83.9 1.4 3.5E-05 22.9 3.5 122 651-810 2-126 (183) 419 pfam00625 Guanylate_kin Guanyl 83.9 1.1 2.9E-05 23.5 3.1 110 651-783 2-112 (182) 420 cd03273 ABC_SMC2_euk Eukaryoti 83.9 0.99 2.5E-05 23.9 2.8 22 651-672 26-47 (251) 421 PRK00300 gmk guanylate kinase; 83.8 1.2 3E-05 23.3 3.2 23 648-670 5-27 (208) 422 PRK13768 GTPase; Provisional 83.7 1.2 2.9E-05 23.4 3.1 30 652-683 4-33 (253) 423 smart00072 GuKc Guanylate kina 83.6 1.2 3.2E-05 23.2 3.2 22 650-671 2-23 (184) 424 PRK11519 tyrosine kinase; Prov 83.5 2 5.1E-05 21.6 4.3 61 612-683 497-558 (720) 425 KOG0085 consensus 83.5 0.89 2.3E-05 24.2 2.4 11 606-616 264-274 (359) 426 pfam00308 Bac_DnaA Bacterial d 83.5 3 7.7E-05 20.3 6.6 97 652-772 36-139 (219) 427 PRK07261 topology modulation p 83.4 1.1 2.8E-05 23.6 2.8 63 653-737 3-65 (171) 428 COG4175 ProV ABC-type proline/ 83.2 1.3 3.3E-05 23.0 3.2 56 301-356 55-113 (386) 429 KOG1969 consensus 83.1 3.1 7.9E-05 20.2 9.1 42 768-809 711-756 (877) 430 pfam00448 SRP54 SRP54-type pro 83.1 3.1 7.9E-05 20.2 6.5 163 651-841 2-184 (196) 431 TIGR01832 kduD 2-deoxy-D-gluco 83.1 1.3 3.3E-05 23.0 3.1 13 340-352 87-99 (249) 432 COG1132 MdlB ABC-type multidru 83.0 1.5 3.8E-05 22.6 3.4 122 635-772 344-523 (567) 433 PRK06731 flhF flagellar biosyn 82.9 3.2 8.1E-05 20.1 10.4 171 649-840 74-255 (270) 434 TIGR00634 recN DNA repair prot 82.8 3.2 8.1E-05 20.1 10.6 119 710-837 471-604 (605) 435 KOG0734 consensus 82.7 3.2 8.2E-05 20.1 8.9 43 653-702 340-382 (752) 436 COG4167 SapF ABC-type antimicr 82.3 1.9 5E-05 21.7 3.8 24 307-330 48-71 (267) 437 pfam08497 Radical_SAM_N Radica 82.0 2 5.2E-05 21.6 3.8 41 64-112 15-56 (298) 438 COG3172 NadR Predicted ATPase/ 81.8 1.3 3.3E-05 23.0 2.8 23 651-673 9-31 (187) 439 PRK06217 hypothetical protein; 81.7 1.4 3.5E-05 22.8 2.8 21 653-673 4-24 (185) 440 COG1137 YhbG ABC-type (unclass 81.6 1.1 2.8E-05 23.5 2.3 33 631-667 15-47 (243) 441 PRK00955 hypothetical protein; 81.3 2.2 5.6E-05 21.3 3.8 42 64-112 12-53 (599) 442 TIGR00956 3a01205 Pleiotropic 81.3 1.4 3.5E-05 22.8 2.7 108 654-780 857-1034(1466) 443 pfam03205 MobB Molybdopterin g 81.2 1.6 4.1E-05 22.3 3.0 30 652-683 2-31 (122) 444 COG4938 Uncharacterized conser 81.1 2.4 6E-05 21.1 3.9 47 727-775 258-304 (374) 445 CHL00181 cbbX CbbX; Provisiona 81.0 3.7 9.4E-05 19.6 10.3 115 653-785 62-183 (287) 446 PRK05917 DNA polymerase III su 80.8 3.7 9.5E-05 19.6 5.9 105 650-773 19-135 (290) 447 PRK08451 DNA polymerase III su 80.8 3.7 9.5E-05 19.6 6.1 12 49-60 11-22 (523) 448 COG4639 Predicted kinase [Gene 80.7 2.3 6E-05 21.1 3.7 77 745-840 52-138 (168) 449 COG3911 Predicted ATPase [Gene 80.7 2.1 5.5E-05 21.4 3.5 91 650-758 9-101 (183) 450 PRK09841 cryptic autophosphory 80.6 3.1 7.9E-05 20.2 4.3 35 648-684 529-564 (726) 451 TIGR01187 potA polyamine ABC t 80.2 1 2.7E-05 23.7 1.8 16 340-355 73-88 (331) 452 PRK13853 type IV secretion sys 80.1 1.1 2.8E-05 23.6 1.9 41 649-695 426-466 (789) 453 cd03274 ABC_SMC4_euk Eukaryoti 80.0 1.5 3.9E-05 22.5 2.6 58 724-785 144-202 (212) 454 PRK10416 cell division protein 79.9 4 0.0001 19.4 7.1 23 649-671 294-316 (499) 455 CHL00176 ftsH cell division pr 79.8 4 0.0001 19.4 6.1 16 612-627 435-450 (631) 456 TIGR02788 VirB11 P-type DNA tr 79.7 1.7 4.3E-05 22.1 2.8 32 652-692 160-191 (328) 457 TIGR00958 3a01208 antigen pept 79.3 4.1 0.00011 19.3 7.4 75 650-733 559-653 (770) 458 cd00227 CPT Chloramphenicol (C 78.9 2.2 5.7E-05 21.2 3.2 23 649-671 1-23 (175) 459 PRK11331 5-methylcytosine-spec 78.8 2.5 6.3E-05 20.9 3.4 84 651-744 195-287 (459) 460 pfam06414 Zeta_toxin Zeta toxi 78.7 2.6 6.6E-05 20.8 3.5 24 648-671 10-33 (191) 461 TIGR00630 uvra excinuclease AB 78.7 2.7 6.8E-05 20.7 3.5 47 611-675 644-690 (956) 462 PRK07078 hypothetical protein; 78.6 4.3 0.00011 19.1 7.2 80 648-759 240-319 (510) 463 pfam02492 cobW CobW/HypB/UreG, 78.3 2 5.2E-05 21.6 2.8 20 652-671 2-21 (174) 464 cd02019 NK Nucleoside/nucleoti 78.0 1.7 4.2E-05 22.2 2.3 22 653-674 2-23 (69) 465 PRK07132 DNA polymerase III su 78.0 4.5 0.00011 19.0 7.0 110 649-773 19-132 (303) 466 cd01124 KaiC KaiC is a circadi 78.0 4.5 0.00011 19.0 10.0 119 653-775 2-142 (187) 467 TIGR03574 selen_PSTK L-seryl-t 77.7 4.6 0.00012 18.9 6.4 29 653-683 2-30 (249) 468 COG4650 RtcR Sigma54-dependent 77.7 3.8 9.8E-05 19.5 4.1 72 652-740 210-293 (531) 469 cd03114 ArgK-like The function 77.6 4.6 0.00012 18.9 7.9 87 654-746 3-108 (148) 470 PRK08727 hypothetical protein; 77.6 4.6 0.00012 18.9 9.8 94 649-772 40-135 (233) 471 COG4598 HisP ABC-type histidin 77.6 2.3 5.9E-05 21.2 2.9 26 648-673 30-55 (256) 472 COG4088 Predicted nucleotide k 77.4 2.1 5.3E-05 21.5 2.6 85 651-749 2-89 (261) 473 TIGR01193 bacteriocin_ABC ABC- 77.4 2.2 5.5E-05 21.4 2.7 141 592-778 476-676 (710) 474 cd00071 GMPK Guanosine monopho 77.0 2.1 5.3E-05 21.5 2.6 107 652-784 1-111 (137) 475 KOG2373 consensus 77.0 1.9 4.8E-05 21.8 2.3 138 616-771 257-423 (514) 476 cd02021 GntK Gluconate kinase 76.8 2.1 5.3E-05 21.5 2.5 20 652-671 1-20 (150) 477 TIGR02755 TraX_Ftype type-F co 76.7 0.67 1.7E-05 25.2 -0.0 56 652-720 47-107 (232) 478 PRK01254 hypothetical protein; 76.6 3.8 9.6E-05 19.5 3.8 41 64-112 37-78 (742) 479 PRK08084 DNA replication initi 76.5 4.9 0.00012 18.7 10.4 95 649-773 44-141 (235) 480 TIGR03029 EpsG chain length de 76.4 4.9 0.00013 18.7 4.5 88 649-746 102-192 (274) 481 TIGR02633 xylG D-xylose ABC tr 76.3 3.1 8E-05 20.1 3.3 79 720-799 411-491 (501) 482 PRK05416 hypothetical protein; 76.3 2.8 7.1E-05 20.5 3.1 11 612-622 253-263 (292) 483 PRK13830 conjugal transfer pro 76.1 2.2 5.6E-05 21.3 2.5 29 653-686 459-487 (818) 484 PRK04690 murD UDP-N-acetylmura 76.0 4.5 0.00012 19.0 4.1 17 815-831 445-461 (468) 485 pfam07724 AAA_2 AAA domain (Cd 75.9 4.7 0.00012 18.8 4.1 23 652-674 5-27 (168) 486 KOG0962 consensus 75.8 2.5 6.3E-05 20.9 2.7 18 126-143 18-35 (1294) 487 PRK04301 radA DNA repair and r 75.7 5.1 0.00013 18.5 8.5 116 649-771 102-252 (318) 488 COG4608 AppF ABC-type oligopep 75.6 5.2 0.00013 18.5 11.3 120 635-777 28-174 (268) 489 TIGR01087 murD UDP-N-acetylmur 75.6 3.3 8.3E-05 20.0 3.3 21 95-115 17-37 (476) 490 COG0771 MurD UDP-N-acetylmuram 75.6 3.7 9.4E-05 19.6 3.5 20 95-114 23-42 (448) 491 TIGR02857 CydD ABC transporter 75.3 3.5 8.9E-05 19.8 3.3 61 591-671 337-399 (570) 492 PRK00081 coaE dephospho-CoA ki 75.3 3.7 9.4E-05 19.6 3.5 26 652-683 4-29 (199) 493 PRK13900 type IV secretion sys 75.3 2.8 7.1E-05 20.5 2.8 19 653-671 163-181 (332) 494 cd01121 Sms Sms (bacterial rad 75.2 5.3 0.00013 18.5 7.1 156 650-815 82-250 (372) 495 KOG0082 consensus 75.0 3 7.5E-05 20.3 2.9 21 653-673 36-56 (354) 496 PRK11823 DNA repair protein Ra 74.9 1.6 4E-05 22.4 1.5 80 650-736 90-172 (454) 497 PRK03806 murD UDP-N-acetylmura 74.6 5.4 0.00014 18.4 4.2 54 618-680 78-131 (438) 498 PRK13851 type IV secretion sys 74.6 3 7.6E-05 20.3 2.8 19 653-671 165-183 (343) 499 pfam09439 SRPRB Signal recogni 74.4 3.6 9.3E-05 19.7 3.2 24 648-671 1-24 (181) 500 TIGR00017 cmk cytidylate kinas 74.3 2.2 5.6E-05 21.3 2.1 17 655-671 7-23 (223) No 1 >TIGR01070 mutS1 DNA mismatch repair protein MutS; InterPro: IPR005748 Mismatch repair contributes to the overall fidelity of DNA replication and is essential for combating the adverse effects of damage to the genome. It involves the correction of mismatched base pairs that have been missed by the proofreading element of the DNA polymerase complex. The post-replicative Mismatch Repair System (MMRS) of Escherichia coli involves MutS (Mutator S), MutL and MutH proteins, and acts to correct point mutations or small insertion/deletion loops produced during DNA replication . MutS and MutL are involved in preventing recombination between partially homologous DNA sequences. The assembly of MMRS is initiated by MutS, which recognises and binds to mispaired nucleotides and allows further action of MutL and MutH to eliminate a portion of newly synthesized DNA strand containing the mispaired base . MutS can also collaborate with methyltransferases in the repair of O(6)-methylguanine damage, which would otherwise pair with thymine during replication to create an O(6)mG:T mismatch . MutS exists as a dimer, where the two monomers have different conformations and form a heterodimer at the structural level . Only one monomer recognises the mismatch specifically and has ADP bound. Non-specific major groove DNA-binding domains from both monomers embrace the DNA in a clamp-like structure. Mismatch binding induces ATP uptake and a conformational change in the MutS protein, resulting in a clamp that translocates on DNA. MutS is a modular protein with a complex structure , and is composed of: N-terminal mismatch-recognition domain, which is similar in structure to tRNA endonuclease. Connector domain, which is similar in structure to Holliday junction resolvase ruvC. Core domain, which is composed of two separate subdomains that join together to form a helical bundle; from within the core domain, two helices act as levers that extend towards (but do not touch) the DNA. Clamp domain, which is inserted between the two subdomains of the core domain at the top of the lever helices; the clamp domain has a beta-sheet structure. ATPase domain (connected to the core domain), which has a classical Walker A motif. HTH (helix-turn-helix) domain, which is involved in dimer contacts. Homologues of MutS have been found in many species including eukaryotes (MSH 1, 2, 3, 4, 5, and 6 proteins), archaea and bacteria, and together these proteins have been grouped into the MutS family. Although many of these proteins have similar activities to the E. coli MutS, there is significant diversity of function among the MutS family members. This diversity is even seen within species, where many species encode multiple MutS homologues with distinct functions . Inter-species homologues may have arisen through frequent ancient horizontal gene transfer of MutS (and MutL) from bacteria to archaea and eukaryotes via endosymbiotic ancestors of mitochondria and chloroplasts . This entry represents a family of MutS proteins.; GO: 0005524 ATP binding, 0030983 mismatched DNA binding, 0006298 mismatch repair. Probab=100.00 E-value=0 Score=2046.16 Aligned_cols=850 Identities=37% Similarity=0.642 Sum_probs=794.4 Q ss_pred CCHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHHHHHHHCCEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCE Q ss_conf 89168999999987799299998088222007999999986181891078899888762656176699999999988985 Q gi|254780750|r 25 STPMMRQYIEIKSINPDSLVFYRMGDFYELFFDDALLASRCLGITLTKRGKHLGKDIPMCGVPVHTANHYIQKLIKIGHR 104 (920) Q Consensus 25 ~TP~~~Qy~~iK~~~~D~il~fr~G~FYE~f~~DA~~aa~~L~i~lt~r~~~~~~~~pmaGvP~~~l~~yl~~Lv~~Gyk 104 (920) +||||||||+||++|||.+||||||||||||||||..||++|+|+||+||++.|+++||||||+|+++.|+.+||+.||| T Consensus 1 ~tPmmQQY~~lK~~~PD~lLffRmGDFYElFyeDAk~Aa~~L~I~LT~R~~sA~~~IPMAGiPyHs~~~Y~~~L~~~G~~ 80 (863) T TIGR01070 1 LTPMMQQYLKLKAEHPDALLFFRMGDFYELFYEDAKKAAQLLDISLTSRGQSADEPIPMAGIPYHSVEAYLEKLVKQGYK 80 (863) T ss_pred CCCCHHHHHHHHHHCCCHHHHHHCCCEEECCHHHHHHHHHHHCCHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCE T ss_conf 98702578999864751577641363001107899999985032002457888888884787588899999999863986 Q ss_pred EEEEEECCCHHHHHHCCCCCCEEEEEEEEECCCCEECCHHCCCCCCCEEEEEEECCCEEEEEEEECCCCEEEEEECC-HH Q ss_conf 99995028978874128998455458999775303020103878773699999529849999999786559996058-78 Q gi|254780750|r 105 IAICEQIESPLEAKKRGNKSLVRRNVVRLVTPGTLTEDQLLSPTDSNYLMAVARIRTEWAIAWIDISTGIFKISTSN-HD 183 (920) Q Consensus 105 VaiveQ~E~p~~~~~~~~~~~v~R~Vt~IiTpGT~~d~~~l~~~~~nyL~aI~~~~~~~Gia~iDisTG~~~~~~~~-~d 183 (920) ||||||+|+|+.+ +|+|+|+|||||||||+.|+.++.+.++|||+||+.++..||+|++|+|||+|.++... .+ T Consensus 81 VAIcEQ~eDP~~a-----kG~V~ReVVq~iTPGTv~Deall~er~~N~lAai~~~~~~fGLA~lD~~tG~F~~~~~~~~e 155 (863) T TIGR01070 81 VAICEQIEDPKTA-----KGVVKREVVQIITPGTVSDEALLPERQDNLLAAIAQESNGFGLATLDLTTGEFKVTELADKE 155 (863) T ss_pred EEEECCCCCHHHC-----CCCEEEEEEEEECCCEECCCCCCHHHHHHHHEEEEECCCCCCEEEEEEEECCEEEECCCCHH T ss_conf 8985368886465-----78625689898528711135664033411212100068852147852243510022000178 Q ss_pred HHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHCCCCEECCCCCCCCHHHHHHHHHHHCCCCCCCCHHHHH-HHHHHHHHH Q ss_conf 9999985229857997077689678998874238815718521147134478899870878642102444-346888777 Q gi|254780750|r 184 RLISDIMRIDPREIIFSEKELSHAEYKSLFETLGNVAVAQPNVFFDHSISESRIADYYNITTVDTFGSFS-QVEKTAAAA 262 (920) Q Consensus 184 ~l~~~L~~l~P~EIii~~~~~~~~~~~~l~~~~~~~~~~~~~~~f~~~~~~~~l~~~~~~~~l~~~~~~~-~~~~~a~~a 262 (920) .+.++|.+++|+|||+++...+.... .. ..-...+|-|.|+...+...++..|...++..|+... ...++|.|+ T Consensus 156 ~l~~El~r~~p~E~l~~~~l~e~~~~---~~--rrG~~RR~~Wef~~~~a~~~~e~~f~~~dl~~~~~~~~~~~~~A~g~ 230 (863) T TIGR01070 156 TLLAELQRLNPAEVLLAEDLSEEEAI---EL--RRGLRRRPLWEFRLDTAVLSLEKQFETEDLKGLGLRNAPLGLKAAGC 230 (863) T ss_pred HHHHHHHCCCCEEEEECCCCCHHHHH---HH--CCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHH T ss_conf 99999860797388616572012799---96--37875788542331136887625766346410000234036678889 Q ss_pred HHHHHHHHHCCCCCCCCCCCCCCCCCEEEECCHHHCCEEEEECCCCCCCCHHHHHHCCCCCCHHHHHHHHHHHHHCCCHH Q ss_conf 76436874112222221134457454014251332015454127877641278763012343378999986310023456 Q gi|254780750|r 263 AISYIKKTQMVNKPTIGYPEREDIQSTLFIDSAARSNLEILRTLSGSREQSLLKTIDYSITGAGGRLFAERIASPLTDST 342 (920) Q Consensus 263 ll~Yl~~~~~~~~~~i~~~~~~~~~~~m~LD~~Tl~nLEI~~~~~g~~~gSL~~~Ln~t~T~~G~RlLr~wL~~PL~d~~ 342 (920) |+.|++.||...++|++++..++.+++|.||..|.+||||++|++|++++||+|+||+|+||||+|||++||-+||.|.+ T Consensus 231 LL~Ya~~TQ~~~L~Hl~p~~~~~~~d~~~ld~At~rnLEl~~nlrgG~~~tL~svLD~t~TAMG~RLL~~Wl~~PL~d~~ 310 (863) T TIGR01070 231 LLQYAKRTQRTALPHLRPVRLYELQDFVQLDAATRRNLELTENLRGGKQNTLFSVLDETKTAMGSRLLKRWLHRPLRDRE 310 (863) T ss_pred HHHHHHHHHCCCCCCCCCEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHCCCCCCHH T ss_conf 99999986102443036302455487773578876531000344678877630000446799861589986116632137 Q ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHH Q ss_conf 78989999997402212588999997520483444456775210000212068999998655675430258457787676 Q gi|254780750|r 343 KINTRLDSINFFIQNPILINPIQKILKSSPDVPRSLSRLKIGRGEPKDIAVIRDGIRSGINILNLFLKNDLPEELRGAVE 422 (920) Q Consensus 343 ~I~~R~daVe~l~~~~~~~~~l~~~L~~i~DleRll~ri~~~~~~p~dl~~L~~sl~~~~~i~~~l~~~~~~~~l~~~~~ 422 (920) .|++|||.|+.|+++...++.++..|+.++||||+++|+.+++++||||..|+++|..+|.+...+.... .+.+..+.. T Consensus 311 ~l~~Rq~~V~~l~~~f~~~~~L~~~L~~v~DlERla~Rv~l~~a~PrDL~~Lr~~L~~lP~lra~l~~~~-~~~l~~l~~ 389 (863) T TIGR01070 311 VLEARQDTVETLLRHFFLREDLRPLLKEVGDLERLAARVALGNARPRDLARLRTALEQLPELRALLEELE-QPKLQALAK 389 (863) T ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHCC-CHHHHHHHH T ss_conf 7999999999998521012115898888877999999987327898899999999998699999862115-568999998 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEE Q ss_conf 57767899999999985431012210460101685411455799886389999977687788738875110255862237 Q gi|254780750|r 423 KLKKLPQSLEKTLSSMLSDDLPTFKRDGGFLRDGADASLDETRSLRDQSKRIIASLQLKYAEETKIKNLKIKHNNNLGYF 502 (920) Q Consensus 423 ~l~~l~~~l~~~l~~~i~~~~~~~~~~~~~ik~g~d~eLD~lr~~~~~~~~~l~~l~~~~~~~~~i~sLki~~n~~~gy~ 502 (920) .+..+++ +.++|..+|.++||..++|||+||+|||++||++|.+.+++.+||.+|+.++|++|||++|||+||+++||| T Consensus 390 ~~~~~~e-l~eLL~~Al~e~PP~~v~dGG~I~~GYd~~LDe~R~~~~~g~~yl~~LE~rErE~TGi~tLKvGyN~v~GYY 468 (863) T TIGR01070 390 QLDDLSE-LLELLERALIENPPLVVRDGGVIREGYDEELDEYRALSREGTDYLAELEARERERTGIPTLKVGYNAVFGYY 468 (863) T ss_pred HHHCCCC-HHHHHHHHCCCCCCCEEECCCEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCEEEEE T ss_conf 5001674-699999704688692650486123578863799999987578999999988776358864551454211026 Q ss_pred EEECCHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHH Q ss_conf 62041143334411168765279852120001113456664334643777776433447888765430112579888899 Q gi|254780750|r 503 IEVTSSSASVLKKDLESKDRFIHRQTMSNLTRFTTLELIDLENRITNATNSALLIELESFEILSNAIIEQSESLDNASQV 582 (920) Q Consensus 503 iEV~~~~~~~i~~~~~~~~~~i~~qt~~~~~Rf~t~eL~~l~~~i~~~~~~~~~~e~~~~~~l~~~i~~~~~~l~~~~~~ 582 (920) ||||+++.+.+|. .+|+|+||+||++||++|||++.+.++..++.++..+|+++|.+|++.+..+.+.|+..+.. T Consensus 469 i~vt~~q~~~~Pe-----~~Y~R~qTLKnaERyi~PELKe~E~~~L~a~~~~~~lE~elf~elre~~~~~~~~Lq~~A~~ 543 (863) T TIGR01070 469 IEVTRGQLHKVPE-----AHYIRRQTLKNAERYITPELKEKEDKVLEAKGKALALEKELFEELRELLKKYLEALQEAAKA 543 (863) T ss_pred EECCCHHHCCCCC-----CCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 5226133304875-----56446667788755675789999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHCCCCCEECCCCCCEEECCCCCHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCEEEEEECCCCCC Q ss_conf 98677887788877525885121047871400004680588763102877044434563587777664399996778440 Q gi|254780750|r 583 IAIIDISIALAILAKEQNYCRPIIDNSTNFIVKDGRHPIVEKTLKQQSSKPFIANDCDLSCPNDKNSGKLWLLTGPNMGG 662 (920) Q Consensus 583 ia~lD~l~SlA~~a~~~~y~rP~i~~~~~l~i~~gRHPviE~~l~~~~~~~fVpNdi~l~~~~~~~~~~~~iiTGpNmgG 662 (920) +|+||||.|||.+|...+|+||+|.|++.+.|++|||||||+.|. ..+||||||.|+ ++.+++||||||||| T Consensus 544 laELDVL~~LAe~A~~~~y~~P~F~d~~~~~I~~GRHPVVE~Vl~---~~~fvpN~~~m~-----~nr~~lliTGPNM~G 615 (863) T TIGR01070 544 LAELDVLANLAEVAETLHYTRPRFGDDPELRIREGRHPVVEKVLD---RQPFVPNDLEMA-----DNRRLLLITGPNMGG 615 (863) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCCCCEEEECCCCCEEEEECC---CCCCCCCCCCCC-----CCCEEEEEECCCCCC T ss_conf 889999999999997448968701367614675088770434465---778677756558-----885688886687975 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCC Q ss_conf 78999999999999971985303532068221056765237661138532899999999999958998569993258898 Q gi|254780750|r 663 KSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGSADNLASGRSTFMVEMIETASILNQATNQSFVILDEIGRGT 742 (920) Q Consensus 663 KSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~at~~SLVllDElGrGT 742 (920) |||||||+|||+|||||||||||++|+|||||+|||||||+|||..|+|||||||.|+++||++||+|||||+||||||| T Consensus 616 KSTYmRQtALIallAqiGSfVPA~~a~Lp~~D~IFTRIGAsDDLasG~STFMVEM~E~aNil~~AT~~SLvL~DE~GRGT 695 (863) T TIGR01070 616 KSTYMRQTALIALLAQIGSFVPAESAELPLFDRIFTRIGASDDLASGRSTFMVEMTEAANILHNATENSLVLLDEIGRGT 695 (863) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 31799999999999982785350423028846320111513324799643006789999999986651022331034530 Q ss_pred CHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHHCCCEEEEEEEEEEECCEEEEEEEEEECCCCCCHHHHHHHHC Q ss_conf 80567999999999999726984999748757976643068858999999960992778777744789887789999982 Q gi|254780750|r 743 ATLDGLSIAWATIEYLHETNRCRGLLATHFHELTDLSKSLKRFHNATLQVSDSNEGIIFLHKVIPGIADHSYGIQVGKLA 822 (920) Q Consensus 743 st~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~l~~~~~~v~n~~~~~~~~~~~i~flykl~~G~~~~Sygi~vA~la 822 (920) |||||+|||||+.||||++++|+|||||||||||.|++.+|++.|.|+...|.+++|+|+|++.||++++||||+||+|| T Consensus 696 STyDGlaLA~A~aE~lh~~irA~TLFATHYfELT~L~~~L~~l~NvHv~A~E~~g~~vFlH~v~~GpAsKSYG~~VA~lA 775 (863) T TIGR01070 696 STYDGLALAWAIAEYLHEHIRAKTLFATHYFELTALEEKLPRLANVHVAALEHNGTIVFLHQVLPGPASKSYGLAVAALA 775 (863) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHCCCEEEEEEEEEECCCCEEEEEEECCCCCCCCHHHHHHHHC T ss_conf 01678999999999986232121200145567513753486415435433661881589863157988643369999871 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----------HHHCCCHHHHHCCCCCHHHHHHHHHCCCCCCCCHHHHH Q ss_conf 9998999999999999987630001111211-----------11100001120035473899999860782227989999 Q gi|254780750|r 823 GLPNTVISRAYDILKTFEKLYHHNQKDMRLY-----------YPEIQTKESKNNLSKNDKLFVEKIKCLNLDEMSPLTAL 891 (920) Q Consensus 823 G~p~~vi~~A~~~~~~le~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~al 891 (920) |+|.+||.||+++|++||+............ ....+.......+.+.++++++..++||||+|||+||| T Consensus 776 GlP~~vi~RA~~~L~~LE~~s~~~~~p~~~~~~~~~~~q~~lf~~~e~~~~~e~L~~kekqvidafksLdpd~ltP~qAl 855 (863) T TIGR01070 776 GLPKDVIARARQILTELEARSTESEAPLRKAQKKAEPEQLSLFDEAETHPLLEELAKKEKQVIDAFKSLDPDDLTPLQAL 855 (863) T ss_pred CCCHHHHHHHHHHHHHHHHCCCCCCCHHHCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHH T ss_conf 89888999999999999852888741333465412104667634303627899998621368877652787787979999 Q ss_pred HHHHHHHH Q ss_conf 99999999 Q gi|254780750|r 892 KTLYAVKA 899 (920) Q Consensus 892 ~~l~~lk~ 899 (920) +.|++||+ T Consensus 856 ~~Ly~LK~ 863 (863) T TIGR01070 856 NLLYELKK 863 (863) T ss_pred HHHHHHCC T ss_conf 99997239 No 2 >PRK05399 DNA mismatch repair protein; Provisional Probab=100.00 E-value=0 Score=1947.74 Aligned_cols=842 Identities=44% Similarity=0.730 Sum_probs=777.3 Q ss_pred HHHHCCHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHHHHHHHCCEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 67728916899999998779929999808822200799999998618189107889988876265617669999999998 Q gi|254780750|r 21 NLKSSTPMMRQYIEIKSINPDSLVFYRMGDFYELFFDDALLASRCLGITLTKRGKHLGKDIPMCGVPVHTANHYIQKLIK 100 (920) Q Consensus 21 ~~~~~TP~~~Qy~~iK~~~~D~il~fr~G~FYE~f~~DA~~aa~~L~i~lt~r~~~~~~~~pmaGvP~~~l~~yl~~Lv~ 100 (920) +++|+||||+|||+||++|||||||||||||||+||+||++||++|||+||+|+++.++++||||||+|+++.|++|||+ T Consensus 4 ~~~k~TPmm~Qy~~iK~~~~D~iLffr~GdFYElF~eDA~~as~~L~i~LT~R~~~~~~~~pm~GvP~ha~~~yl~kLv~ 83 (848) T PRK05399 4 DLSKHTPMMQQYLEIKAQYPDALLFYRMGDFYELFFEDAKKASRLLDITLTKRGKSAGEPIPMAGVPYHAAEGYLAKLVK 83 (848) T ss_pred CCCCCCHHHHHHHHHHHHCCCEEEEEECCCEEEECHHHHHHHHHHHCEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHH T ss_conf 42228989999999999789979999627310353888999998619699547888999888088978899999999998 Q ss_pred CCCEEEEEEECCCHHHHHHCCCCCCEEEEEEEEECCCCEECCHHCCCCCCCEEEEEEECCCEEEEEEEECCCCEEEEEEC Q ss_conf 89859999502897887412899845545899977530302010387877369999952984999999978655999605 Q gi|254780750|r 101 IGHRIAICEQIESPLEAKKRGNKSLVRRNVVRLVTPGTLTEDQLLSPTDSNYLMAVARIRTEWAIAWIDISTGIFKISTS 180 (920) Q Consensus 101 ~GykVaiveQ~E~p~~~~~~~~~~~v~R~Vt~IiTpGT~~d~~~l~~~~~nyL~aI~~~~~~~Gia~iDisTG~~~~~~~ 180 (920) +|||||||||+|+|+.+ ||+|+|+|||||||||++|++++++.++|||+||+..+..||+||+|+|||+|.+.++ T Consensus 84 ~G~kVaiceQ~e~p~~~-----kg~v~R~V~rIiTPGT~~d~~~L~~~~~nyL~ai~~~~~~~gla~~DiSTGef~~~~~ 158 (848) T PRK05399 84 KGYKVAICEQVEDPATA-----KGPVKREVVRIVTPGTVTDEALLDEKQNNYLAAIYQDKNRYGLAWLDLSTGEFRVTEL 158 (848) T ss_pred CCCEEEEEECCCCCCCC-----CCCCCCCEEEEECCCCEECCCCCCCCCCCEEEEEEECCCEEEEEEEECCCCEEEEEEE T ss_conf 79969999616782104-----9983203489987861347243688877679999967985999999966663999983 Q ss_pred C-HHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHCCCCEECCCCCCCCHHHHHHHHHHHCCCCCCCCHHHH-HHHHHH Q ss_conf 8-78999998522985799707768967899887423881571852114713447889987087864210244-434688 Q gi|254780750|r 181 N-HDRLISDIMRIDPREIIFSEKELSHAEYKSLFETLGNVAVAQPNVFFDHSISESRIADYYNITTVDTFGSF-SQVEKT 258 (920) Q Consensus 181 ~-~d~l~~~L~~l~P~EIii~~~~~~~~~~~~l~~~~~~~~~~~~~~~f~~~~~~~~l~~~~~~~~l~~~~~~-~~~~~~ 258 (920) . .+++.++|.+++|+|||+++...+... ........+.|.|+...+.+.+.++|++.++++++.. ...+++ T Consensus 159 ~~~~~L~~~L~r~~P~EIl~~~~~~~~~~-------~~~~~~~~~~~~f~~~~a~~~l~~~f~~~~l~~~g~~~~~~~~~ 231 (848) T PRK05399 159 ADEEELLAELARLNPAEILLPEDLAELLL-------ILFGVTRRPDWEFDLDTAEKRLLEQFGVASLDGFGLEDLELALR 231 (848) T ss_pred CCHHHHHHHHHHCCCCEEEECCCHHHHHH-------HHCCEEECCCCCCCHHHHHHHHHHHHCCCCHHHCCCCCCHHHHH T ss_conf 89999999998449734874574024566-------42025547720058678999999984858813217667569999 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEECCHHHCCEEEEECCCCCCCCHHHHHHCCCCCCHHHHHHHHHHHHHC Q ss_conf 87777643687411222222113445745401425133201545412787764127876301234337899998631002 Q gi|254780750|r 259 AAAAAISYIKKTQMVNKPTIGYPEREDIQSTLFIDSAARSNLEILRTLSGSREQSLLKTIDYSITGAGGRLFAERIASPL 338 (920) Q Consensus 259 a~~all~Yl~~~~~~~~~~i~~~~~~~~~~~m~LD~~Tl~nLEI~~~~~g~~~gSL~~~Ln~t~T~~G~RlLr~wL~~PL 338 (920) |+|+++.|+.++|...++|+..+..++.+++|.||.+|++||||++|.+|+ +|||||+||+|+|+||+||||+||++|| T Consensus 232 A~g~LL~YL~~tq~~~l~~i~~~~~~~~~~~M~LD~~T~rNLEL~~~~~g~-kgSLl~vLd~T~T~mG~RLLr~WL~~PL 310 (848) T PRK05399 232 AAGALLQYLKETQKGSLPHLRSPKRYEEGDYLILDAATRRNLELTETLRGG-GGSLLSVLDRTVTAMGGRLLRRWLHRPL 310 (848) T ss_pred HHHHHHHHHHHHCCCCCCCCCCCEEECCCCEEEECHHHHHHHHHHCCCCCC-CCCHHHHHCCCCCHHHHHHHHHHHHHHC T ss_conf 999999999986112334557775876789799887887468874167999-9857888477898589999999987356 Q ss_pred CCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHH Q ss_conf 34567898999999740221258899999752048344445677521000021206899999865567543025845778 Q gi|254780750|r 339 TDSTKINTRLDSINFFIQNPILINPIQKILKSSPDVPRSLSRLKIGRGEPKDIAVIRDGIRSGINILNLFLKNDLPEELR 418 (920) Q Consensus 339 ~d~~~I~~R~daVe~l~~~~~~~~~l~~~L~~i~DleRll~ri~~~~~~p~dl~~L~~sl~~~~~i~~~l~~~~~~~~l~ 418 (920) +|+++|++|||+|++|.++..+++.++..|++++|+||+++|++.|+++|+|+..+++++..++.+...+... ..+. T Consensus 311 ~d~~~I~~RldaVe~l~~~~~~~~~lr~~Lk~i~DlERllsRi~~~~~~prDl~~L~~sL~~~~~i~~~l~~~---~~l~ 387 (848) T PRK05399 311 RDREAINARLDAVEELLDNPLLREDLRELLKGVYDLERILSRVALGRANPRDLAALRDSLEALPELKELLEES---PLLA 387 (848) T ss_pred CCHHHHHHHHHHHHHHHHCHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHCC---HHHH T ss_conf 2778999999999999749799999999985489889999999808978899999999999999999998467---7899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCC Q ss_conf 76765776789999999998543101221046010168541145579988638999997768778873887511025586 Q gi|254780750|r 419 GAVEKLKKLPQSLEKTLSSMLSDDLPTFKRDGGFLRDGADASLDETRSLRDQSKRIIASLQLKYAEETKIKNLKIKHNNN 498 (920) Q Consensus 419 ~~~~~l~~l~~~l~~~l~~~i~~~~~~~~~~~~~ik~g~d~eLD~lr~~~~~~~~~l~~l~~~~~~~~~i~sLki~~n~~ 498 (920) .+...+..+ ..+.+.|..++.+++|...++|++|++|||++||++|.+.++.++|+.+++.++++++||++||++||++ T Consensus 388 ~l~~~l~~~-~~l~~~i~~~i~~~~p~~~~dg~~Ik~G~d~eLDelr~~~~~~~~~l~~le~~er~~tgI~sLKi~yn~v 466 (848) T PRK05399 388 ELLAQLDPL-PELADLLERAIVEEPPLLIRDGGVIADGYDAELDELRALSDNGKDWLAELEARERERTGISSLKVGYNKV 466 (848) T ss_pred HHHHHCCCH-HHHHHHHHHHHHCCCHHHCCCCCEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHCCC T ss_conf 998853148-9999999998710557554368863588798899999999868999999999999870865110111475 Q ss_pred CEEEEEECCHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHH Q ss_conf 22376204114333441116876527985212000111345666433464377777643344788876543011257988 Q gi|254780750|r 499 LGYFIEVTSSSASVLKKDLESKDRFIHRQTMSNLTRFTTLELIDLENRITNATNSALLIELESFEILSNAIIEQSESLDN 578 (920) Q Consensus 499 ~gy~iEV~~~~~~~i~~~~~~~~~~i~~qt~~~~~Rf~t~eL~~l~~~i~~~~~~~~~~e~~~~~~l~~~i~~~~~~l~~ 578 (920) +|||||||+++.+.+| +.|+++||++|++||+||||++++.++.++++++..+|.++|.++++.+.++.+.|+. T Consensus 467 ~GY~iEv~k~~~~~vp------~~~ir~qTl~n~~Rf~T~eL~~~e~~il~a~~~~~~lE~~~f~~l~~~~~~~~~~l~~ 540 (848) T PRK05399 467 FGYYIEVTKANADKVP------DDYIRRQTLKNAERYITPELKELEDKILSAEERALALEKELFEELREEVLEHIERLQK 540 (848) T ss_pred CEEEEEECHHHHHCCC------CCEEEEEECCCCEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 2289984043340498------4438876124871544588999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCEECCCCCCEEECCCCCHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCEEEEEECC Q ss_conf 88999867788778887752588512104787140000468058876310287704443456358777766439999677 Q gi|254780750|r 579 ASQVIAIIDISIALAILAKEQNYCRPIIDNSTNFIVKDGRHPIVEKTLKQQSSKPFIANDCDLSCPNDKNSGKLWLLTGP 658 (920) Q Consensus 579 ~~~~ia~lD~l~SlA~~a~~~~y~rP~i~~~~~l~i~~gRHPviE~~l~~~~~~~fVpNdi~l~~~~~~~~~~~~iiTGp 658 (920) +++++|+|||++|||.+|..++||||+|++++.|.|++|||||||..+. .+||||||.|+ ++.+++||||| T Consensus 541 ~~~~ia~LD~l~SlA~~A~~~~y~rP~i~~~~~l~I~~gRHPvvE~~~~----~~fVpND~~l~-----~~~~~~iiTGP 611 (848) T PRK05399 541 LAKALAELDVLASLAEVAEELNYVRPEFTDSREIDIEAGRHPVVEQVLG----GPFVPNDCDLD-----PERRMLLITGP 611 (848) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCEECCCCCEEEEECCCCEEEECCC----CCEEEEEEEEC-----CCCCEEEEECC T ss_conf 9999999999999999999669647713589865898356857640368----87576568866-----87617999568 Q ss_pred CCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEECC Q ss_conf 84407899999999999997198530353206822105676523766113853289999999999995899856999325 Q gi|254780750|r 659 NMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGSADNLASGRSTFMVEMIETASILNQATNQSFVILDEI 738 (920) Q Consensus 659 NmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~at~~SLVllDEl 738 (920) ||||||||||||||+||||||||||||++|++||||+|||||||+|||.+|+|||||||.|||+||++||+|||||+||| T Consensus 612 NM~GKSTylRQvalivilAQiGsfVPA~~a~i~i~D~IftRiGa~D~l~~g~STFmvEM~E~a~IL~~AT~~SLVilDEl 691 (848) T PRK05399 612 NMAGKSTYMRQVALIVLLAQIGSFVPAKSARIGIVDRIFTRIGASDDLASGRSTFMVEMTETANILNNATERSLVLLDEI 691 (848) T ss_pred CCCCCHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEECC T ss_conf 87770899999999999997089741876786555767773676300345787599999999999984898848999616 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHHCCCEEEEEEEEEEECCEEEEEEEEEECCCCCCHHHHH Q ss_conf 88988056799999999999972698499974875797664306885899999996099277877774478988778999 Q gi|254780750|r 739 GRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTDLSKSLKRFHNATLQVSDSNEGIIFLHKVIPGIADHSYGIQV 818 (920) Q Consensus 739 GrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~l~~~~~~v~n~~~~~~~~~~~i~flykl~~G~~~~Sygi~v 818 (920) ||||||+||+||||||+|||+++++|+||||||||||++++..+++|.|+||++.+ +++|+|||||++|+|++||||+| T Consensus 692 GRGTST~DG~aIA~Av~e~l~~~~~~~tlFaTHyheL~~l~~~~~~v~n~h~~v~e-~~~i~Flykl~~G~~~~SyGi~V 770 (848) T PRK05399 692 GRGTSTYDGLSIAWAVAEYLHDKIKARTLFATHYHELTELAEKLPGVKNVHVAAKE-DGDIVFLHKVVPGAADKSYGIHV 770 (848) T ss_pred CCCCCCCHHHHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHCCCEEEEEEEEEE-CCEEEEEEEECCCCCCCCHHHHH T ss_conf 78888506799999999999865798699988757888776437782899999998-89278998767788987779999 Q ss_pred HHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHCC-CCCHHHHHHHHHCCCCCCCCHHHHHHHHHHH Q ss_conf 998299989999999999999876300011112111110000112003-5473899999860782227989999999999 Q gi|254780750|r 819 GKLAGLPNTVISRAYDILKTFEKLYHHNQKDMRLYYPEIQTKESKNNL-SKNDKLFVEKIKCLNLDEMSPLTALKTLYAV 897 (920) Q Consensus 819 A~laG~p~~vi~~A~~~~~~le~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~p~~al~~l~~l 897 (920) |||||+|++||+||++++++||+.......... ...+...+ ....+.++++++++|+++|||+|||+.|++| T Consensus 771 A~lAGlP~~vi~rA~~~l~~lE~~~~~~~~~~~-------~~~q~~lf~~~~~~~~~~~l~~~~~~~~tp~~al~~l~~l 843 (848) T PRK05399 771 AKLAGLPKSVIKRAREILAQLESASEKAAKAAA-------ESDQLSLFAAPEESPVLEALKALDPDELTPREALNLLYEL 843 (848) T ss_pred HHHCCCCHHHHHHHHHHHHHHHCCCCCCCCCCC-------CCCCCCCCCCCCCHHHHHHHHHCCCCCCCHHHHHHHHHHH T ss_conf 998398999999999999998446554566667-------6322233368983699999970897779999999999999 Q ss_pred HHHHH Q ss_conf 99986 Q gi|254780750|r 898 KAWTL 902 (920) Q Consensus 898 k~~~~ 902 (920) |+.++ T Consensus 844 k~~~~ 848 (848) T PRK05399 844 KKLLK 848 (848) T ss_pred HHHHC T ss_conf 99629 No 3 >COG0249 MutS Mismatch repair ATPase (MutS family) [DNA replication, recombination, and repair] Probab=100.00 E-value=0 Score=1745.83 Aligned_cols=837 Identities=46% Similarity=0.739 Sum_probs=776.2 Q ss_pred HHHHCCHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHHHHHHHCCEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 67728916899999998779929999808822200799999998618189107889988876265617669999999998 Q gi|254780750|r 21 NLKSSTPMMRQYIEIKSINPDSLVFYRMGDFYELFFDDALLASRCLGITLTKRGKHLGKDIPMCGVPVHTANHYIQKLIK 100 (920) Q Consensus 21 ~~~~~TP~~~Qy~~iK~~~~D~il~fr~G~FYE~f~~DA~~aa~~L~i~lt~r~~~~~~~~pmaGvP~~~l~~yl~~Lv~ 100 (920) .++++||||+|||+||++|||++||||||||||+||+||.+||++|+|+||+|++ +||||||+|+++.|+.+||+ T Consensus 2 ~~~~~tP~m~qy~~ik~~~~d~llffr~GdfYelf~~DA~~~s~~l~itlT~r~~-----~pm~gvP~h~~~~yl~~li~ 76 (843) T COG0249 2 SKAKLTPMMQQYLEIKAQYPDTLLFFRMGDFYELFFEDAKIAARLLDITLTKRGN-----IPMAGVPYHALDYYLAKLIE 76 (843) T ss_pred CCCCCCCHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCEEECCCC-----CCCCCCCCCCHHHHHHHHHH T ss_conf 7233783899999988758763899845632663699999998760842315788-----76777764327799999996 Q ss_pred CCCEEEEEEECCCHHHHHHCCCCCCEEEEEEEEECCCCEECCHHCCCCCCCEEEEEEECCC-EEEEEEEECCCCEEEEEE Q ss_conf 8985999950289788741289984554589997753030201038787736999995298-499999997865599960 Q gi|254780750|r 101 IGHRIAICEQIESPLEAKKRGNKSLVRRNVVRLVTPGTLTEDQLLSPTDSNYLMAVARIRT-EWAIAWIDISTGIFKIST 179 (920) Q Consensus 101 ~GykVaiveQ~E~p~~~~~~~~~~~v~R~Vt~IiTpGT~~d~~~l~~~~~nyL~aI~~~~~-~~Gia~iDisTG~~~~~~ 179 (920) +|||||||||+|+|..+ +++|+|+|+||+||||++|+.++++..+|||+|+...+. .||+||+|+|||+|.+.+ T Consensus 77 ~g~kVAiceQ~e~~~~~-----k~~v~R~v~rv~TpGt~~d~~~l~~~~~n~l~a~~~~~~~~~gla~~dlstGef~~~~ 151 (843) T COG0249 77 LGYKVAICEQLEDPAEA-----KGVVERKVVRVVTPGTLTDEALLSDRENNYLAALAIDRDGKVGLAFIDLSTGEFFVSE 151 (843) T ss_pred CCCEEEEEECCCCHHHH-----CCCCEEEEEEEECCCEEECCCCCCCCCCCEEEEEEECCCCEEEEEEEECCCCEEEEEE T ss_conf 79838999715581641-----6850257899988976422442346666549999966898799999981267499998 Q ss_pred CCHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHCCCCEECCCCCCCCHHHHHHHHHHHCCCCCCCCHHHHH-HHHHH Q ss_conf 58789999985229857997077689678998874238815718521147134478899870878642102444-34688 Q gi|254780750|r 180 SNHDRLISDIMRIDPREIIFSEKELSHAEYKSLFETLGNVAVAQPNVFFDHSISESRIADYYNITTVDTFGSFS-QVEKT 258 (920) Q Consensus 180 ~~~d~l~~~L~~l~P~EIii~~~~~~~~~~~~l~~~~~~~~~~~~~~~f~~~~~~~~l~~~~~~~~l~~~~~~~-~~~~~ 258 (920) +..+++.++|.|++|+|||+++...+... ....... .....+.|.|+...+...+..+|++.++++++..+ ..+++ T Consensus 152 ~~~~~l~~~l~r~~p~Eil~~~~~~~~~~-~~~~~~~--~~~~~~~~~f~~~~~~~~l~~~~~~~~l~~~~~~~~~~~~~ 228 (843) T COG0249 152 FEREKLLSELKRLQPKELLLSESLLSSTL-AEIQKGL--FVVRLPASFFDEDDAEEILSEYFGVRDLDGFGLISTPLALS 228 (843) T ss_pred EEHHHHHHHHHHCCCCEEEECCCCCCHHH-HHHHHCE--EEEECCHHHCCCCHHHHHHHHHHCCCCCCCCCCCCCHHHHH T ss_conf 32899999998489857996655576245-6653100--13654445337427999999983866454424224538999 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEECCHHHCCEEEEECCCC-CCCCHHHHHHCCCCCCHHHHHHHHHHHHH Q ss_conf 87777643687411222222113445745401425133201545412787-76412787630123433789999863100 Q gi|254780750|r 259 AAAAAISYIKKTQMVNKPTIGYPEREDIQSTLFIDSAARSNLEILRTLSG-SREQSLLKTIDYSITGAGGRLFAERIASP 337 (920) Q Consensus 259 a~~all~Yl~~~~~~~~~~i~~~~~~~~~~~m~LD~~Tl~nLEI~~~~~g-~~~gSL~~~Ln~t~T~~G~RlLr~wL~~P 337 (920) |+++++.|+..+|...++++..++.+...++|.||.+|++||||++|.+| +++|||||+||+|+|+||+|+|++||.+| T Consensus 229 a~~~ll~Y~~~t~~~~l~~~~~~~~~~~~~~m~lD~~t~~nLEl~~~~~~~~~~gSL~~~ld~t~T~mG~RlL~~wl~~P 308 (843) T COG0249 229 AAGALLSYLKDTQKTFLPHIQIIQRYDLEDYMVLDAATRRNLELFSNLRGNGKKGSLFWLLDRTVTPMGSRLLKRWLLQP 308 (843) T ss_pred HHHHHHHHHHHHHHCCCCCCCCCEEECCCCEEEECHHHHHCCCCCCCCCCCCCCCCHHHHHCCCCCCHHHHHHHHHHHCC T ss_conf 99999999998653224355634564358679973899711102224778998873999853677824679989976480 Q ss_pred CCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHCCCCHHH Q ss_conf 23456789899999974022125889999975204834444567752100002120689999986556754302584577 Q gi|254780750|r 338 LTDSTKINTRLDSINFFIQNPILINPIQKILKSSPDVPRSLSRLKIGRGEPKDIAVIRDGIRSGINILNLFLKNDLPEEL 417 (920) Q Consensus 338 L~d~~~I~~R~daVe~l~~~~~~~~~l~~~L~~i~DleRll~ri~~~~~~p~dl~~L~~sl~~~~~i~~~l~~~~~~~~l 417 (920) |+|.++|+.|||+|++|..+..+++.++..|++++|+||+++|+++|+++|+|+..+++++..++.+...+........+ T Consensus 309 L~~~~~I~~Rld~Ve~l~~~~~l~~~L~~~L~~v~DleRl~~Rl~~~~~~~rDl~~l~~~l~~~~~i~~~l~~~~~~~~l 388 (843) T COG0249 309 LLDKEEIEERLDAVEELKDNPELREKLREMLKKVPDLERLLSRLSLGRASPRDLLRLRDSLEKIPEIFKLLSSLKSESDL 388 (843) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCHHHH T ss_conf 35899999999999999865689999999985076799999999737998656999999999889999998525430235 Q ss_pred HHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHCCCCEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 87676577678--9999999998543101221046010168541145579988638999997768778873887511025 Q gi|254780750|r 418 RGAVEKLKKLP--QSLEKTLSSMLSDDLPTFKRDGGFLRDGADASLDETRSLRDQSKRIIASLQLKYAEETKIKNLKIKH 495 (920) Q Consensus 418 ~~~~~~l~~l~--~~l~~~l~~~i~~~~~~~~~~~~~ik~g~d~eLD~lr~~~~~~~~~l~~l~~~~~~~~~i~sLki~~ 495 (920) ......+..+. ..+...+..++.+++|...++ ++|+.||+.+||++|...++.++++.+++.++++++|++++|+.| T Consensus 389 ~~~~~~i~~~~~~~e~~~ll~~~i~~~~~~~~~~-~ii~~g~~~eLd~lr~~~~~~~~~i~~le~~~r~~~gi~slki~~ 467 (843) T COG0249 389 LLLLEDIESLDYLAELLELLETAINEDPPLAVRD-GIIKEGYNIELDELRDLLNNAKEWIAKLELEERERTGIKSLKIKY 467 (843) T ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHHCCHHHCCH-HHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHH T ss_conf 6776444313407999999998721166233015-577511039999999999888999999999888864972121122 Q ss_pred CCCCEEEEEECCHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHH Q ss_conf 58622376204114333441116876527985212000111345666433464377777643344788876543011257 Q gi|254780750|r 496 NNNLGYFIEVTSSSASVLKKDLESKDRFIHRQTMSNLTRFTTLELIDLENRITNATNSALLIELESFEILSNAIIEQSES 575 (920) Q Consensus 496 n~~~gy~iEV~~~~~~~i~~~~~~~~~~i~~qt~~~~~Rf~t~eL~~l~~~i~~~~~~~~~~e~~~~~~l~~~i~~~~~~ 575 (920) |+++||||||++++.+.+|.+ |+++||++|++||+||+|++++.++.++++++..+|+++|.++++.+..|.+. T Consensus 468 n~v~Gy~ievt~~~~~~~p~~------~ir~qt~kn~~rf~t~el~~~e~~i~~a~~~i~~lE~~l~~~~~~~i~~~~~~ 541 (843) T COG0249 468 NKVYGYYIEVTKSNAKLVPDD------YIRRQTLKNAERFTTPELKELEEKLLDAEEKILALEYELFDELREKILAHINE 541 (843) T ss_pred HCCCEEEEEECHHCCCCCCHH------HHHHHHHHCCEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 034605999542014558437------89998773246864888899999999999998999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCCCEEECCCCCHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCEEEEE Q ss_conf 98888999867788778887752588512104787140000468058876310287704443456358777766439999 Q gi|254780750|r 576 LDNASQVIAIIDISIALAILAKEQNYCRPIIDNSTNFIVKDGRHPIVEKTLKQQSSKPFIANDCDLSCPNDKNSGKLWLL 655 (920) Q Consensus 576 l~~~~~~ia~lD~l~SlA~~a~~~~y~rP~i~~~~~l~i~~gRHPviE~~l~~~~~~~fVpNdi~l~~~~~~~~~~~~ii 655 (920) |+.++.++|+|||++|||.+|...+||||+++++..+.|++|||||||..+..+ ||||||.|+ ++.+++|| T Consensus 542 l~~~a~aLa~lD~l~slA~~a~~~~y~rP~~~~~~~l~i~~gRHPvvE~~~~~~----fVpNd~~L~-----~~~~i~lI 612 (843) T COG0249 542 LQALAKALAELDVLSSLAEIAAEQNYVRPEFVDSNDLEIKEGRHPVVEAVLDNG----FVPNDIDLS-----GNRRIILI 612 (843) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCCEECCCCCEEEEECCCCHHHHHCCCC----CCCCCEEEC-----CCCEEEEE T ss_conf 999999999999999999998657987866638988799845761122212577----446865307-----99548999 Q ss_pred ECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEE Q ss_conf 67784407899999999999997198530353206822105676523766113853289999999999995899856999 Q gi|254780750|r 656 TGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGSADNLASGRSTFMVEMIETASILNQATNQSFVIL 735 (920) Q Consensus 656 TGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~at~~SLVll 735 (920) |||||||||||||||||++|||||||||||++|++||||+|||||||+|||.+|+|||||||.||++||++||++||||| T Consensus 613 TGPNM~GKSTylRQvali~imAQiGsfVPA~~A~i~ivD~IfTRiGa~DDL~~G~STFMvEM~Eta~IL~~AT~~SLvil 692 (843) T COG0249 613 TGPNMGGKSTYLRQVALIVILAQIGSFVPAEKARIGIVDRIFTRIGAADDLASGRSTFMVEMLETANILDNATERSLVIL 692 (843) T ss_pred ECCCCCCCHHHHHHHHHHHHHHHCCCCEEHHHCCCCCCCEEEECCCCCCHHHCCCCHHHHHHHHHHHHHHHCCCCCEEEE T ss_conf 78998861999999999999987499852887436653201121565100211651899999999999985798848999 Q ss_pred ECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHHCCCEEEEEEEEEEECCEEEEEEEEEECCCCCCHH Q ss_conf 32588988056799999999999972698499974875797664306885899999996099277877774478988778 Q gi|254780750|r 736 DEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTDLSKSLKRFHNATLQVSDSNEGIIFLHKVIPGIADHSYG 815 (920) Q Consensus 736 DElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~l~~~~~~v~n~~~~~~~~~~~i~flykl~~G~~~~Syg 815 (920) ||+||||||+||+||||||+|||+++++|+||||||||||+++....+++.|+||.+.+.+++|+|+|++++|+|++||| T Consensus 693 DEiGRGTsT~DGlaIA~Av~eyL~~~~~~~tLFATHy~ELt~l~~~~~~v~N~h~~~~e~~~~i~Fl~kv~~G~a~~SyG 772 (843) T COG0249 693 DEIGRGTSTYDGLAIAWAVLEYLHEKIGCRTLFATHYHELTELEEKLPQVKNYHMSAVEEGGDITFLYKVKPGIADKSYG 772 (843) T ss_pred ECCCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHCCCCCCEEEEEEEECCCCEEEEEEECCCCCCCCHH T ss_conf 64668877413689999999999863583699961688887765115445504788897489658999834688886589 Q ss_pred HHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHCCCCCHHHHHHHHHCCCCCCCCHHHHHHHHH Q ss_conf 99999829998999999999999987630001111211111000011200354738999998607822279899999999 Q gi|254780750|r 816 IQVGKLAGLPNTVISRAYDILKTFEKLYHHNQKDMRLYYPEIQTKESKNNLSKNDKLFVEKIKCLNLDEMSPLTALKTLY 895 (920) Q Consensus 816 i~vA~laG~p~~vi~~A~~~~~~le~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~al~~l~ 895 (920) |+||++||+|.+||+||++++++||+.......+. .+..... ..++.++++++|+++|||+ ||+.|+ T Consensus 773 i~VAklaGlP~~Vi~rA~~il~~le~~~~~~~~~~-----------~~~~~~~-~~~i~~~~~~~~~~~~tp~-al~~l~ 839 (843) T COG0249 773 IHVAKLAGLPEEVIERAREILAELEKESRSSNLEL-----------NQKDLSL-FPKVLKALKSLDPDELTPR-ALNLLY 839 (843) T ss_pred HHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCCCCH-----------HHHHHHH-HHHHHHHHHHCCHHHCCHH-HHHHHH T ss_conf 99999929999999999999998750356753231-----------4566767-9999999870790004878-999999 Q ss_pred HHHH Q ss_conf 9999 Q gi|254780750|r 896 AVKA 899 (920) Q Consensus 896 ~lk~ 899 (920) +||+ T Consensus 840 ~lk~ 843 (843) T COG0249 840 ELKK 843 (843) T ss_pred HHHC T ss_conf 8529 No 4 >KOG0218 consensus Probab=100.00 E-value=0 Score=1317.17 Aligned_cols=816 Identities=28% Similarity=0.444 Sum_probs=665.5 Q ss_pred HHCCHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHHHHHHHCCEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHCC Q ss_conf 72891689999999877992999980882220079999999861818910788998887626561766999999999889 Q gi|254780750|r 23 KSSTPMMRQYIEIKSINPDSLVFYRMGDFYELFFDDALLASRCLGITLTKRGKHLGKDIPMCGVPVHTANHYIQKLIKIG 102 (920) Q Consensus 23 ~~~TP~~~Qy~~iK~~~~D~il~fr~G~FYE~f~~DA~~aa~~L~i~lt~r~~~~~~~~pmaGvP~~~l~~yl~~Lv~~G 102 (920) +++||+++||.++|.+|+|+||.++|||.|.+||+||++||++|||+| +.++|+..|+||.|+++.|++|||++| T Consensus 160 s~yTPLeqQ~~elK~~h~d~VL~ievGYkyRfFgeDAeiasrvLgIyc-----h~dhnFmtaS~P~~Rl~vHleRLv~~g 234 (1070) T KOG0218 160 SKYTPLEQQVKELKMHHRDKVLVIEVGYKYRFFGEDAEIASRVLGIYC-----HNDHNFMTASFPDVRLNVHLERLVHHG 234 (1070) T ss_pred CCCCCHHHHHHHHHHHCCCEEEEEEECCEEEEECCCHHHHHHHHCEEE-----EECCCEEECCCCCCHHHHHHHHHHHCC T ss_conf 556808999999986388638999942067750530889987641378-----712540104677402467899987527 Q ss_pred CEEEEEEECCCHHHHHHCC-CCCCEEEEEEEEECCCCEECCH--HC-----CCCCCCEEEEEEECC---------CEEEE Q ss_conf 8599995028978874128-9984554589997753030201--03-----878773699999529---------84999 Q gi|254780750|r 103 HRIAICEQIESPLEAKKRG-NKSLVRRNVVRLVTPGTLTEDQ--LL-----SPTDSNYLMAVARIR---------TEWAI 165 (920) Q Consensus 103 ykVaiveQ~E~p~~~~~~~-~~~~v~R~Vt~IiTpGT~~d~~--~l-----~~~~~nyL~aI~~~~---------~~~Gi 165 (920) ||||||+|+||++.++..+ ..++|.|++++|||+||+.+|. ++ -..+++|++|+..+. -.+|+ T Consensus 235 ~KVaVVkQtETAAiKs~gasRsslF~RklsavyTKaTl~eds~~~~r~e~~~~~~ssfllcv~dn~~ksk~ksg~v~vgl 314 (1070) T KOG0218 235 LKVAVVKQTETAAIKSHGASRSSLFERKLSAVYTKATLGEDSTFVLRGERILGDTSSFLLCVRDNVHKSKEKSGVVKVGL 314 (1070) T ss_pred CEEEEEEEHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCHHHHCCCCCEEEEEEEHHHHHHHHHCCCEEEEE T ss_conf 34899962356787751775453677788888532000245654304123047754089997200554454347358999 Q ss_pred EEEECCCCEEEEEECCHH----HHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHCCC-------CEECCCCCCCCHHHHH Q ss_conf 999978655999605878----99999852298579970776896789988742388-------1571852114713447 Q gi|254780750|r 166 AWIDISTGIFKISTSNHD----RLISDIMRIDPREIIFSEKELSHAEYKSLFETLGN-------VAVAQPNVFFDHSISE 234 (920) Q Consensus 166 a~iDisTG~~~~~~~~~d----~l~~~L~~l~P~EIii~~~~~~~~~~~~l~~~~~~-------~~~~~~~~~f~~~~~~ 234 (920) ..+.++||++..++|.++ +|++.+..++|.|+|++.. +++.....++...-. .........++.+.+. T Consensus 315 igVqlstGevVydeFqdnf~rseLqtrisslqP~ElLl~~~-ls~qt~all~~~~Vsve~~~~rv~r~~naV~q~ikla~ 393 (1070) T KOG0218 315 IGVQLSTGEVVYDEFQDNFARSELQTRISSLQPIELLLNTD-LSPQTVALLFKDNVSVELIHKRVYRLENAVVQAIKLAN 393 (1070) T ss_pred EEEECCCCCEEHHHHHHHHHHHHHHHHHHCCCCCEEECCCC-CCHHHHHHHHHCCCCHHHHHHHHHHCHHHHHHHHHHHH T ss_conf 99962788474686665677777899986068410004898-75888999985344224444345110037999999999 Q ss_pred HHHHHHCCCCCC-CC------HHHHHHH-HHHHHHHHHHHHHHHHCCCCCCC-CCCCCCCCCCEEEECCHHHCCEEEEEC Q ss_conf 889987087864-21------0244434-68887777643687411222222-113445745401425133201545412 Q gi|254780750|r 235 SRIADYYNITTV-DT------FGSFSQV-EKTAAAAAISYIKKTQMVNKPTI-GYPEREDIQSTLFIDSAARSNLEILRT 305 (920) Q Consensus 235 ~~l~~~~~~~~l-~~------~~~~~~~-~~~a~~all~Yl~~~~~~~~~~i-~~~~~~~~~~~m~LD~~Tl~nLEI~~~ 305 (920) +.+.++|..... ++ .-..+.+ .++.+++++.|+.+.....+... +..+.++...+|.|+++|+++||||.| T Consensus 394 e~~q~f~~~k~~l~gs~ii~li~nl~~psvic~la~~is~lkefnlE~~l~~psf~s~~ss~e~Mtls~ntLq~Leif~n 473 (1070) T KOG0218 394 EKIQNFEDDKEMLEGSEIICLIMNLSHPSVICLLAKLISHLKEFNLEQVLLIPSFYSPFSSKEHMTLSPNTLQSLEIFTN 473 (1070) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCHHHHEECCCCCCCCCCCCEEEECHHHHHCEEEEEE T ss_conf 99765311155541245666654278706999999999999970667741033336854553036655423201433550 Q ss_pred CC-CCCCCHHHHHHCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHC---CCHHHHHHHHHHHHHHCCHHHHHHHH Q ss_conf 78-776412787630123433789999863100234567898999999740---22125889999975204834444567 Q gi|254780750|r 306 LS-GSREQSLLKTIDYSITGAGGRLFAERIASPLTDSTKINTRLDSINFFI---QNPILINPIQKILKSSPDVPRSLSRL 381 (920) Q Consensus 306 ~~-g~~~gSL~~~Ln~t~T~~G~RlLr~wL~~PL~d~~~I~~R~daVe~l~---~~~~~~~~l~~~L~~i~DleRll~ri 381 (920) .+ |+.+|||||+||||.|.+|.|+||+|+.+||+|...|++|+|||+++. ++...++.++.+|.+.|||+|.++|| T Consensus 474 qtd~~~kGSLfwvldhT~TsfG~RmLr~WvtkPLvd~~~I~eRLDAVeeitshssnS~vf~si~~~l~rlpDl~rgL~rI 553 (1070) T KOG0218 474 QTDGSEKGSLFWVLDHTRTSFGLRMLREWVTKPLVDVHQIEERLDAVEEITSHSSNSIVFESINQMLNRLPDLLRGLNRI 553 (1070) T ss_pred CCCCCCCCCEEEEECCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCHHHHHHHHH T ss_conf 68997454268873331548889999998744002198899987789999861542499999999987480767667887 Q ss_pred HHHHHCH-HHHHHHHHHHHHHHHHHHHH---HH----C-----CCCHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHH Q ss_conf 7521000-02120689999986556754---30----2-----58457787676577--678999999999854310122 Q gi|254780750|r 382 KIGRGEP-KDIAVIRDGIRSGINILNLF---LK----N-----DLPEELRGAVEKLK--KLPQSLEKTLSSMLSDDLPTF 446 (920) Q Consensus 382 ~~~~~~p-~dl~~L~~sl~~~~~i~~~l---~~----~-----~~~~~l~~~~~~l~--~l~~~l~~~l~~~i~~~~~~~ 446 (920) .+++|+| +++..+.+.+.....-++.+ .+ . ..+..|+.++..+. .+...+...+ .+++...... T Consensus 554 y~~tCtp~~eff~vlk~iy~a~s~fq~~~~~~~~~~~s~~~s~~qS~LLrrlisel~~p~~~s~~~hfL-~mln~~aa~~ 632 (1070) T KOG0218 554 YYGTCTPRKEFFFVLKQIYSAVSHFQMHQSYLEHFKSSDGRSGKQSPLLRRLISELNEPLSTSQLPHFL-TMLNVSAAME 632 (1070) T ss_pred HCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHCCHHHHHHHHHHCCCCCCCCHHHHH-HHHHHHHHHH T ss_conf 505679479999999999999999998765554311654310101189999999846720014278999-8843898861 Q ss_pred HCCCCEECCCCC-CHHHHHHHHH---HHHHHH---HHHHHHHHHHHCCCCCCCCCCCCCCEEEEEECCHHHHHCCCCCCC Q ss_conf 104601016854-1145579988---638999---997768778873887511025586223762041143334411168 Q gi|254780750|r 447 KRDGGFLRDGAD-ASLDETRSLR---DQSKRI---IASLQLKYAEETKIKNLKIKHNNNLGYFIEVTSSSASVLKKDLES 519 (920) Q Consensus 447 ~~~~~~ik~g~d-~eLD~lr~~~---~~~~~~---l~~l~~~~~~~~~i~sLki~~n~~~gy~iEV~~~~~~~i~~~~~~ 519 (920) ......+++-.| +.+|+-+++. ++.+.. +.+-.+++|+.++.++|.+.......|.|||+++..+++|.+ T Consensus 633 gnk~d~fkd~snfpl~~e~~di~~virE~~ms~~~~~~hLaeiRk~Lk~pnlef~~vsgv~flIEvkns~~kkiP~d--- 709 (1070) T KOG0218 633 GNKDDQFKDFSNFPLYDESEDIIKVIRESEMSRSQLKEHLAEIRKYLKRPNLEFRQVSGVDFLIEVKNSQIKKIPDD--- 709 (1070) T ss_pred CCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCEEEEEEECCCCCCCCCCC--- T ss_conf 77587662220385111246689999998877877899999999996189850687648069998154313549901--- Q ss_pred CCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 76527985212000111345666433464377777643344788876543011257988889998677887788877525 Q gi|254780750|r 520 KDRFIHRQTMSNLTRFTTLELIDLENRITNATNSALLIELESFEILSNAIIEQSESLDNASQVIAIIDISIALAILAKEQ 599 (920) Q Consensus 520 ~~~~i~~qt~~~~~Rf~t~eL~~l~~~i~~~~~~~~~~e~~~~~~l~~~i~~~~~~l~~~~~~ia~lD~l~SlA~~a~~~ 599 (920) |++.++++...||+||++.++..++..+++.+.......|..+.+.+..||..|+++...+|.+||+.|||.+|.+. T Consensus 710 ---WiKvnsTk~vsRfhtP~iq~~l~eL~~~~e~L~i~sea~~~sFL~kiSehYtelrkat~~LatlDCi~SlA~~s~n~ 786 (1070) T KOG0218 710 ---WIKVNSTKMVSRFHTPRIQKLLQELEYYKELLIIESEAQYKSFLNKISEHYTELRKATLNLATLDCILSLAATSCNV 786 (1070) T ss_pred ---CEEECCCEEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf ---35634612244137888999999999888776663488999999999999999999999888999999999986158 Q ss_pred CCCCEECCCC-CCEEECCCCCHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHH Q ss_conf 8851210478-714000046805887631028770444345635877776643999967784407899999999999997 Q gi|254780750|r 600 NYCRPIIDNS-TNFIVKDGRHPIVEKTLKQQSSKPFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQ 678 (920) Q Consensus 600 ~y~rP~i~~~-~~l~i~~gRHPviE~~l~~~~~~~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQ 678 (920) |||||+|+|+ ..|.|++||||+||.+|..+ |||||+.|++ ++.|++|||||||||||+|+|||||+.|||| T Consensus 787 nYvRPtfvd~~~eI~ikngRhPvIe~Ll~d~----fVPNdi~ls~----egerc~IITGPNMGGKSsyIrQvALitIMAQ 858 (1070) T KOG0218 787 NYVRPTFVDGQQEIIIKNGRHPVIESLLVDY----FVPNDIMLSP----EGERCNIITGPNMGGKSSYIRQVALITIMAQ 858 (1070) T ss_pred CCCCCCCCCCHHHHHHHCCCCCHHHHHHHHC----CCCCCCEECC----CCCEEEEEECCCCCCCHHHHHHHHHHHHHHH T ss_conf 9548540241245343157883389875312----5788623468----8865899837998870499999999999998 Q ss_pred CCCCCCHHHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHH Q ss_conf 19853035320682210567652376611385328999999999999589985699932588988056799999999999 Q gi|254780750|r 679 MGSYVPASYAHIGIVDKLFSRVGSADNLASGRSTFMVEMIETASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYL 758 (920) Q Consensus 679 iG~fVPA~~a~i~~~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l 758 (920) +||||||++|++++||.|||||||+|||..|.||||+||.+|++||++||++||||+||+||||||+||+|||||+++|| T Consensus 859 iGsfVPAeea~l~IfdgvltRmGAsDnI~~grSTFm~Emldt~eil~kat~~SlvilDElGRGTsThDGiAIsYAtL~yf 938 (1070) T KOG0218 859 IGSFVPAEEARLSIFDGVLTRMGASDNIINGRSTFMVEMLDTLEILKKATKRSLVILDELGRGTSTHDGIAISYATLKYF 938 (1070) T ss_pred HCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHCCCCCCCCCHHHHHHHHHHH T ss_conf 54766367753157766787605543310451689999988999987603000011576548876665455799999999 Q ss_pred HHHCCCEEEEECCCHHHHHHHHHCCC-EEEEEEEEEEEC-------CEEEEEEEEEECCCCCCHHHHHHHHCCCCHHHHH Q ss_conf 97269849997487579766430688-589999999609-------9277877774478988778999998299989999 Q gi|254780750|r 759 HETNRCRGLLATHFHELTDLSKSLKR-FHNATLQVSDSN-------EGIIFLHKVIPGIADHSYGIQVGKLAGLPNTVIS 830 (920) Q Consensus 759 ~~~~~~~~lfaTHy~eL~~l~~~~~~-v~n~~~~~~~~~-------~~i~flykl~~G~~~~Sygi~vA~laG~p~~vi~ 830 (920) .+..+|.+||+|||+.|++++..+++ +.||||.+-+.. +.|+|||||++|+|.+|||++||+||++|.++|. T Consensus 939 ~~~~k~l~LFvTHfP~l~eie~~f~gqv~nyHmgyl~sedk~~~d~dsVtfLYklvrGlasrSyGlnVAklA~ip~sii~ 1018 (1070) T KOG0218 939 SELSKCLILFVTHFPMLGEIESGFPGQVRNYHMGYLESEDKTGEDWDSVTFLYKLVRGLASRSYGLNVAKLARIPKSIIN 1018 (1070) T ss_pred HHHHCEEEEEEECCCCHHHHHCCCCCCCCCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHCCCHHHHH T ss_conf 97520257765338406536217985224035312333134578840446599873103103445018988279989999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 99999999987630001111211111000 Q gi|254780750|r 831 RAYDILKTFEKLYHHNQKDMRLYYPEIQT 859 (920) Q Consensus 831 ~A~~~~~~le~~~~~~~~~~~~~~~~~~~ 859 (920) ||.+++++||+.....++..+.++..+.. T Consensus 1019 rA~siSeeleke~rn~rk~lk~fAkL~~i 1047 (1070) T KOG0218 1019 RAFSISEELEKESRNERKLLKLFAKLKRI 1047 (1070) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99888899998876699999999999998 No 5 >KOG0217 consensus Probab=100.00 E-value=0 Score=1275.49 Aligned_cols=815 Identities=30% Similarity=0.425 Sum_probs=669.3 Q ss_pred CCCCCCCCHHHHHHHHHHCCHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHHHHHHHCCEEECCCCCCCCCCCCCCC Q ss_conf 99856778203575677289168999999987799299998088222007999999986181891078899888762656 Q gi|254780750|r 7 SPQKLYSEQIDLEENLKSSTPMMRQYIEIKSINPDSLVFYRMGDFYELFFDDALLASRCLGITLTKRGKHLGKDIPMCGV 86 (920) Q Consensus 7 ~p~~~~~~~~~~~~~~~~~TP~~~Qy~~iK~~~~D~il~fr~G~FYE~f~~DA~~aa~~L~i~lt~r~~~~~~~~pmaGv 86 (920) -|.+++.+| .+++++||+++|||+||++|+|||+||++|+|||+|+.||.+++++|+|++|. .|+||||| T Consensus 234 Dp~TLyiP~----s~~~kftpg~kqwWeiKs~~fD~ivfFk~GKFyELye~DA~vg~~~~dl~f~~------vN~~~~Gf 303 (1125) T KOG0217 234 DPRTLYIPP----SFWKKFTPGEKQWWEIKSDNFDTIVFFKKGKFYELYEMDALVGARLFDLKFMD------VNMPHSGF 303 (1125) T ss_pred CCCCEECCH----HHHHCCCCHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHHHHCCEEECC------CCCCCCCC T ss_conf 831130487----89731793245465424321617888715528888764666533320335513------66534799 Q ss_pred CHHHHHHHHHHHHHCCCEEEEEEECCCHHHHHHCCC-----CCCEEEEEEEEECCCCEECCHHCCCCCCCEEEEEEECCC Q ss_conf 176699999999988985999950289788741289-----984554589997753030201038787736999995298 Q gi|254780750|r 87 PVHTANHYIQKLIKIGHRIAICEQIESPLEAKKRGN-----KSLVRRNVVRLVTPGTLTEDQLLSPTDSNYLMAVARIRT 161 (920) Q Consensus 87 P~~~l~~yl~~Lv~~GykVaiveQ~E~p~~~~~~~~-----~~~v~R~Vt~IiTpGT~~d~~~l~~~~~nyL~aI~~~~~ 161 (920) |+++++.|+.+++++|||||.|||+|+|..+..+.. .++|+|+|++|+|.||++|..++.+..+.||+||.+... T Consensus 304 PE~sf~~~a~q~iq~GYkvarVEQtEt~l~~e~r~~~~~~kdkvvrRev~~ilt~GT~td~~l~~~~~akylmai~e~~~ 383 (1125) T KOG0217 304 PEGSFDYWADQFIQKGYKVARVEQTETPLGKEIRERKTGKKDKVVRREVCRILTNGTLTDIALLTSDLAKYLMAIKESEE 383 (1125) T ss_pred CCCCHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCHHHHHHCCHHHHHHHHHHHCCC T ss_conf 85430567999996064566313666767765420124530256788999986288501577743678888888760677 Q ss_pred -------EEEEEEEECCCCEEEEEECCHH----HHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHCCC-CEE--CCCCCC Q ss_conf -------4999999978655999605878----99999852298579970776896789988742388-157--185211 Q gi|254780750|r 162 -------EWAIAWIDISTGIFKISTSNHD----RLISDIMRIDPREIIFSEKELSHAEYKSLFETLGN-VAV--AQPNVF 227 (920) Q Consensus 162 -------~~Gia~iDisTG~~~~~~~~~d----~l~~~L~~l~P~EIii~~~~~~~~~~~~l~~~~~~-~~~--~~~~~~ 227 (920) .+|+|++|++||++.+++|.|| .|.+.|++..|+|+|...+.++......+.-.+.+ ... ..+..+ T Consensus 384 ~~~~~~~s~gvc~iDtstge~~~~eF~DDr~~s~L~tlLs~~rp~E~I~e~~~ls~~t~~~ik~~~~~~~~~n~~~~~eF 463 (1125) T KOG0217 384 SYSTGEVSFGVCFIDTSTGEINIGEFCDDRHCSKLDTLLSQVRPVELIEERKNLSDPTKVIIKLKLSSNLVENLVPKSEF 463 (1125) T ss_pred CCCCCCEEEEEEEEECCCCEEEEEEEECCHHHHHHHHHHHCCCHHHHHHHHHCCCCCEEEEEEEEECCHHHHCCCCHHHH T ss_conf 77767514558999835553889874050556488899870668999987510477514421002235344323673540 Q ss_pred CCHHHHHHHH--HHHCCC-------CCCCCHHHHHHHHHHHHHHHHHHHHHHHCCC-CCCCCCCCCCCC--CCEEEECCH Q ss_conf 4713447889--987087-------8642102444346888777764368741122-222211344574--540142513 Q gi|254780750|r 228 FDHSISESRI--ADYYNI-------TTVDTFGSFSQVEKTAAAAAISYIKKTQMVN-KPTIGYPEREDI--QSTLFIDSA 295 (920) Q Consensus 228 f~~~~~~~~l--~~~~~~-------~~l~~~~~~~~~~~~a~~all~Yl~~~~~~~-~~~i~~~~~~~~--~~~m~LD~~ 295 (920) |+.......+ .++|.. ..+...+...+.+..|+|+++.||+...... +..+..+..++. .+.|.||.+ T Consensus 464 wdsek~~~eii~~dy~~~~g~e~~~sil~~p~~~~~la~safg~~~~Ylk~~~id~~llsm~n~~ey~~~~~s~mvlD~~ 543 (1125) T KOG0217 464 WDSEKTGREIISEDYFESLGLEDSPSILKSPNTDKELALSAFGGLFYYLKKLLIDEELLSMKNFEEYDSLDQSEMVLDGA 543 (1125) T ss_pred CCHHHHHHHHHHHHHHHCCCCCCCHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHCCCCEEECCH T ss_conf 33235888875455430136667433216888542035987788999999876679886213244222011212010433 Q ss_pred HHCCEEEEECC-CCCCCCHHHHHHCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCH Q ss_conf 32015454127-87764127876301234337899998631002345678989999997402212588999997520483 Q gi|254780750|r 296 ARSNLEILRTL-SGSREQSLLKTIDYSITGAGGRLFAERIASPLTDSTKINTRLDSINFFIQNPILINPIQKILKSSPDV 374 (920) Q Consensus 296 Tl~nLEI~~~~-~g~~~gSL~~~Ln~t~T~~G~RlLr~wL~~PL~d~~~I~~R~daVe~l~~~~~~~~~l~~~L~~i~Dl 374 (920) |++|||||.|. +|+.+|+||..+|+|.||||+|||+.||++||+|.+.|++|||+|+.|..++..+.++...|+++||+ T Consensus 544 tL~NleIf~Ns~~G~~~gtL~~~~~~csTpfGKRllk~Wl~~Pl~~~~~I~~R~dav~~l~~~~~~~~~~~e~l~klPDl 623 (1125) T KOG0217 544 TLENLEIFSNSRNGGDEGTLFYAVNRCSTPFGKRLLKTWLMAPLCDKEDIKQRQDAVDSLGKAPMDRTKVGETLKKLPDL 623 (1125) T ss_pred HHHHHHHHCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCH T ss_conf 33545540157789973208998762247177788999852767788889998999998741740699999998647729 Q ss_pred HHHHHHHHHHHHC-HHHHHHHHHHHHHHHHHHHHH---HHCCC-CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHC Q ss_conf 4444567752100-002120689999986556754---30258-4577876765776789999999998543101-2210 Q gi|254780750|r 375 PRSLSRLKIGRGE-PKDIAVIRDGIRSGINILNLF---LKNDL-PEELRGAVEKLKKLPQSLEKTLSSMLSDDLP-TFKR 448 (920) Q Consensus 375 eRll~ri~~~~~~-p~dl~~L~~sl~~~~~i~~~l---~~~~~-~~~l~~~~~~l~~l~~~l~~~l~~~i~~~~~-~~~~ 448 (920) ||+|.|++.+..+ ++++..+...|.....+...+ ..... ......+.+.+..++ ++...|........- ...+ T Consensus 624 ERlL~Rih~~~~~~~k~i~~f~rvLegfk~~~~~~~~~~~v~~~~~~~~~is~~~~~~p-~~~~~i~~~~~af~r~~a~~ 702 (1125) T KOG0217 624 ERLLVRIHNGGEKNKKKIADFIRVLEGFKEINKLLGSLINVLKEGEGLRLISELLESMP-NLSEEIENWTEAFDRVKAVK 702 (1125) T ss_pred HHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC-HHHHHHHHHHHHHHHHHHHH T ss_conf 99999997047641357999999999999999999999999886678889998746750-03578999999987777754 Q ss_pred CCCEE-CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEEECCHHHHHCCCCCCCCCCEEEEE Q ss_conf 46010-16854114557998863899999776877887388751102558622376204114333441116876527985 Q gi|254780750|r 449 DGGFL-RDGADASLDETRSLRDQSKRIIASLQLKYAEETKIKNLKIKHNNNLGYFIEVTSSSASVLKKDLESKDRFIHRQ 527 (920) Q Consensus 449 ~~~~i-k~g~d~eLD~lr~~~~~~~~~l~~l~~~~~~~~~i~sLki~~n~~~gy~iEV~~~~~~~i~~~~~~~~~~i~~q 527 (920) ++.|+ ..|+|.++|+..+..++..+.+.++...++.+.+++++.+....+.-|++|||.+-...++ ..|+..+ T Consensus 703 eg~i~P~~Gfd~eyD~a~k~~~e~e~~L~~~L~~~rk~l~c~si~~~~vGk~~y~lEvP~n~~~~s~------~~~~~~S 776 (1125) T KOG0217 703 EGVIVPLEGFDEEYDEAMKRVDEAENELLAYLEEYRKRLGCSSIVFVDVGKDVYQLEVPENGGVPSS------LRYELMS 776 (1125) T ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCEEEEEECCCCCCCCCC------HHHHHHH T ss_conf 2752778665277887766788899999999999987517883267633754799963765687783------6899987 Q ss_pred ECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHC--CCCCEE Q ss_conf 212000111345666433464377777643344788876543011257988889998677887788877525--885121 Q gi|254780750|r 528 TMSNLTRFTTLELIDLENRITNATNSALLIELESFEILSNAIIEQSESLDNASQVIAIIDISIALAILAKEQ--NYCRPI 605 (920) Q Consensus 528 t~~~~~Rf~t~eL~~l~~~i~~~~~~~~~~e~~~~~~l~~~i~~~~~~l~~~~~~ia~lD~l~SlA~~a~~~--~y~rP~ 605 (920) +.++..||.+|++..+...+.+++++........-++++.++..++..|+..+.++|.|||++|+|.+|... ++|||+ T Consensus 777 ~~Kg~~RY~tp~~~kli~~l~~aee~~~~~~~d~~~r~~~~f~~~~~~w~~tv~~~a~iD~l~sla~~s~~~~~~~Crp~ 856 (1125) T KOG0217 777 AKKGVSRYYTPDLRKLIAHLDEAEERKKSSLSDLKRRLIVRFDEHYIIWQATVKALASIDCLLSLAETSKGLGGPMCRPE 856 (1125) T ss_pred HHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCE T ss_conf 53273324687799999999999999987777889999998632379999999999988999877776503798753524 Q ss_pred CCC---C-CCEEECCCCCHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCC Q ss_conf 047---8-714000046805887631028770444345635877776643999967784407899999999999997198 Q gi|254780750|r 606 IDN---S-TNFIVKDGRHPIVEKTLKQQSSKPFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGS 681 (920) Q Consensus 606 i~~---~-~~l~i~~gRHPviE~~l~~~~~~~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~ 681 (920) +.+ . +.+.++.+||||+-... .+..|||||+.+|.. ...++.++||||||||||+|||+|++|||||||| T Consensus 857 i~~~~dt~~~l~~~~~~Hpcfsl~s---~~~~fipN~v~~g~~---~e~~~~llTGpNmgGKSTllRq~c~~vilaq~G~ 930 (1125) T KOG0217 857 IVESTDTPGFLIVKGLRHPCFSLPS---TGTSFIPNDVELGGA---EENREGLLTGPNMGGKSTLLRQACIAVILAQIGC 930 (1125) T ss_pred EECCCCCCCEEEEECCCCCEEECCC---CCCCCCCCHHHCCCC---CCCEEEEECCCCCCCCHHHHHHHHHHHHHHHHCC T ss_conf 6414688861688426672462376---777645632220555---4320223206875772489999999999998578 Q ss_pred CCCHHHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 53035320682210567652376611385328999999999999589985699932588988056799999999999972 Q gi|254780750|r 682 YVPASYAHIGIVDKLFSRVGSADNLASGRSTFMVEMIETASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHET 761 (920) Q Consensus 682 fVPA~~a~i~~~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~ 761 (920) +|||+.|+++|+|+||||+||+|+|..|+||||+|+.||+.||++||++|||++||+||||||+||.|||.||+|||... T Consensus 931 ~VPa~~~~~tpidrI~tRlGA~D~im~g~STF~vELsET~~IL~~aT~~SLvi~DELGRGtst~DG~aIA~aVLe~l~~~ 1010 (1125) T KOG0217 931 DVPAEVMELTPIDRIFTRLGANDDIMSGESTFFVELSETKKILKHATRHSLVIVDELGRGTSTFDGTAIAEAVLEHLSEG 1010 (1125) T ss_pred CCCHHHHCCCCHHHHHHHCCCCCCHHCCCCEEEEECCCHHHHHHHCCCCCEEEEHHHCCCCCCCCCCCHHHHHHHHHHHC T ss_conf 76588860661677766406653000377528986044387874337652232123237642558742189999999846 Q ss_pred CCCEEEEECCCHHHHHHHHHCCCEEEEEEEEEEECC-EEEEEEEEEECCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHH Q ss_conf 698499974875797664306885899999996099-2778777744789887789999982999899999999999998 Q gi|254780750|r 762 NRCRGLLATHFHELTDLSKSLKRFHNATLQVSDSNE-GIIFLHKVIPGIADHSYGIQVGKLAGLPNTVISRAYDILKTFE 840 (920) Q Consensus 762 ~~~~~lfaTHy~eL~~l~~~~~~v~n~~~~~~~~~~-~i~flykl~~G~~~~Sygi~vA~laG~p~~vi~~A~~~~~~le 840 (920) ++|+++|+||||.|+.....+|.|++.||.....+. ++|||||+.+|+|++|||++||||||+|..||++|...+.+|| T Consensus 1011 i~c~~fFSTHYhsl~~~~~~~p~Vrl~~Ma~~vd~e~~vtFLYkl~~G~cpkSyG~~vArmaglp~~vi~~a~~~a~E~e 1090 (1125) T KOG0217 1011 IQCLGFFSTHYHSLCVDFMHHPQVRLLHMACVVDEEIDVTFLYKLEEGICPKSYGFNVARMAGLPDQVIDRAEIKAHELE 1090 (1125) T ss_pred CCCCCCCCCCCCCHHHHHHCCCCCCCHHHEEEECCCCCEEEEEHHCCCCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHH T ss_conf 63227764342303376624843000011135437750798601104789720668999865985999999999999999 Q ss_pred HHHH Q ss_conf 7630 Q gi|254780750|r 841 KLYH 844 (920) Q Consensus 841 ~~~~ 844 (920) +... T Consensus 1091 ~~~~ 1094 (1125) T KOG0217 1091 KSSA 1094 (1125) T ss_pred HHHH T ss_conf 8777 No 6 >KOG0219 consensus Probab=100.00 E-value=0 Score=970.25 Aligned_cols=787 Identities=26% Similarity=0.406 Sum_probs=637.7 Q ss_pred CHHHHHHHHH-HHH--CCCCEEEEEECCCHHHHHHHHHHHHHHHCCEEECCC---CCCCCCCCCCCCCHHHHHHHHHHH- Q ss_conf 9168999999-987--799299998088222007999999986181891078---899888762656176699999999- Q gi|254780750|r 26 TPMMRQYIEI-KSI--NPDSLVFYRMGDFYELFFDDALLASRCLGITLTKRG---KHLGKDIPMCGVPVHTANHYIQKL- 98 (920) Q Consensus 26 TP~~~Qy~~i-K~~--~~D~il~fr~G~FYE~f~~DA~~aa~~L~i~lt~r~---~~~~~~~pmaGvP~~~l~~yl~~L- 98 (920) +-+++-|..+ +.. -.++|-||..|+||-.||+||..+|+.---+-.-.+ -...+++-.+.+--..++..++-| T Consensus 13 ~~~~~~f~~f~~~l~~p~~tvr~fdrge~ytv~geDa~~va~~v~kt~~~~k~l~~~~~~~~~~v~ls~~~~e~~vr~~l 92 (902) T KOG0219 13 EAADRNFLKFYEGLPKPTTTVRFFDRGEYYTVHGEDAEFVARNVYKTQAVIKTLSPDNAKNTKEVSLSKGNFEKVVRELL 92 (902) T ss_pred CHHHHHHHHHHHCCCCCCCEEEEECCCCEEEEECCCHHHHHHHHHHHHHHHEECCCCCCCCCEEEEECHHHHHHHHHHHH T ss_conf 06678999998448998765899427625899566233456555310333001477553343489864777999999999 Q ss_pred HHCCCEEEEEEECCCHHHHHHCCCCCCEEEEEEEEECCCCEEC-CHHCCCCCC-----CEEEEEE---ECCCEEEEEEEE Q ss_conf 9889859999502897887412899845545899977530302-010387877-----3699999---529849999999 Q gi|254780750|r 99 IKIGHRIAICEQIESPLEAKKRGNKSLVRRNVVRLVTPGTLTE-DQLLSPTDS-----NYLMAVA---RIRTEWAIAWID 169 (920) Q Consensus 99 v~~GykVaiveQ~E~p~~~~~~~~~~~v~R~Vt~IiTpGT~~d-~~~l~~~~~-----nyL~aI~---~~~~~~Gia~iD 169 (920) +..+|+|-+.+--+ +...+++|+ +||...+ ++++..+.. -|+.-.+ .+...+|+|++| T Consensus 93 ~~~~~~Ve~y~~~~--------~~w~l~~~~-----sPGN~~~fedll~~~~~v~is~~~~~v~~~~~~~~~~vgv~~~d 159 (902) T KOG0219 93 LVLRYRVEVYSSNQ--------GDWKLTKRG-----SPGNLVQFEDLLFSNNDVPISIISLIVKHPGVDGQRRVGVAFVD 159 (902) T ss_pred HHHCCCEEEEECCC--------CCEEEEECC-----CCCCHHHHHHHHCCCCCCHHHHHHEEEEEECCCCCCEEEEEEEC T ss_conf 87502068963476--------640677537-----99968999999713543213101025775224787346799704 Q ss_pred CCCCEEEEEECCHH----HHHHHHHHCCCCEEEEECCCCCH--HHHHHHHHHCCCCEECCCCCCCCHHHHHHHHHHHCCC Q ss_conf 78655999605878----99999852298579970776896--7899887423881571852114713447889987087 Q gi|254780750|r 170 ISTGIFKISTSNHD----RLISDIMRIDPREIIFSEKELSH--AEYKSLFETLGNVAVAQPNVFFDHSISESRIADYYNI 243 (920) Q Consensus 170 isTG~~~~~~~~~d----~l~~~L~~l~P~EIii~~~~~~~--~~~~~l~~~~~~~~~~~~~~~f~~~~~~~~l~~~~~~ 243 (920) .+--.+.+++|-++ +++..+..+.|+|+|++.+.... .....+++..+...+......|........+...+.. T Consensus 160 ~~~~k~~~~ef~Dn~~~snle~~l~~lg~kEcll~~~~~~~~~~kl~~~~~r~g~~~t~~~~~e~~~kdv~~~l~~~l~~ 239 (902) T KOG0219 160 TINVKIGLSEFVDDDSFSNLEALLIQLGPKECVLPEGSVAGEMKKLRNIIDRCGVLITLRKKSESSWKDVEQDLNRLLKS 239 (902) T ss_pred HHHEEEEHHHHCCCHHHHHHHHHHHHCCCEEEEECCCCCCHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHHCCCC T ss_conf 04604412240573878799999986197379801766406788999898426748887406402477899998753012 Q ss_pred CC--CCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEECCHHHCCEEEEECCCC--CCCCHHHH-HH Q ss_conf 86--421024443468887777643687411222222113445745401425133201545412787--76412787-63 Q gi|254780750|r 244 TT--VDTFGSFSQVEKTAAAAAISYIKKTQMVNKPTIGYPEREDIQSTLFIDSAARSNLEILRTLSG--SREQSLLK-TI 318 (920) Q Consensus 244 ~~--l~~~~~~~~~~~~a~~all~Yl~~~~~~~~~~i~~~~~~~~~~~m~LD~~Tl~nLEI~~~~~g--~~~gSL~~-~L 318 (920) .. ..........+..++.+++.|+....-........+..++...+|.+|..|+++|++|+...+ ....+|.. +| T Consensus 240 ~~~~~~~~e~~~q~a~~~~~~~i~yl~~~~e~~~s~~~ei~~~~~~~~m~ld~~av~alnlf~~~~~~~~~s~~L~~~~L 319 (902) T KOG0219 240 QEHAAYLPELELQLAMSALSALIKYLDLENEYSNSGKYELTNHGLHQFMRLDSAAVRALNLFPLPYNNPEKSNNLALSLL 319 (902) T ss_pred HHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHH T ss_conf 02221563877699999999999998640540100047876334477756889999887535799887432124338998 Q ss_pred CCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHH-HHHHHHCCHHHHHHHHHHHHHCHHHHHHHHHH Q ss_conf 01234337899998631002345678989999997402212588999-99752048344445677521000021206899 Q gi|254780750|r 319 DYSITGAGGRLFAERIASPLTDSTKINTRLDSINFFIQNPILINPIQ-KILKSSPDVPRSLSRLKIGRGEPKDIAVIRDG 397 (920) Q Consensus 319 n~t~T~~G~RlLr~wL~~PL~d~~~I~~R~daVe~l~~~~~~~~~l~-~~L~~i~DleRll~ri~~~~~~p~dl~~L~~s 397 (920) |||+|++|.|||++|+.+||+|+..|++|+|.|+.+..+...+..++ +.|..+|||-|+..|+. +++-.|...+++. T Consensus 320 N~c~t~~G~rll~~w~~qpL~~~~ri~~r~d~v~~l~~~~~~rq~L~~~lL~~~pdi~rl~~~l~--~~~L~d~~r~yq~ 397 (902) T KOG0219 320 NHCKTLQGERLLRQWLKQPLRDIDRINERHDLVEALVEDAEIRQKLRDDLLRRIPDISRLARRLM--KANLQDVNRIYQA 397 (902) T ss_pred HHCCCCCCHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHH--HCCHHHHHHHHHH T ss_conf 64102202466635643125778889887656999985248899999987520724777512322--1206777888988 Q ss_pred HHHHHHHHHHHHHCCC------CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEECCCCCCHHHHHHHHHHHH Q ss_conf 9998655675430258------4577876765776789999999998543101221046010168541145579988638 Q gi|254780750|r 398 IRSGINILNLFLKNDL------PEELRGAVEKLKKLPQSLEKTLSSMLSDDLPTFKRDGGFLRDGADASLDETRSLRDQS 471 (920) Q Consensus 398 l~~~~~i~~~l~~~~~------~~~l~~~~~~l~~l~~~l~~~l~~~i~~~~~~~~~~~~~ik~g~d~eLD~lr~~~~~~ 471 (920) ...++.+...+.+..- ...+......+.++.+.++. .++.+ .......+||..||++|-++|+..++. T Consensus 398 ~~~l~~~~~~l~~~~~~~~~ll~~~l~~~~~~~~kf~~~ve~----t~D~d--a~ee~ey~VR~eFdeeL~eLrq~LdeL 471 (902) T KOG0219 398 AKLLPTVVQVLISLSESHNRLLKSPLTEHLKKLEKFQEMVET----TVDLD--AEEENEYRVRVDFDEELQELREKLDEL 471 (902) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHH--HHHCCCEEEECCCCHHHHHHHHHHHHH T ss_conf 877688999888640156666650565643128999999998----73376--775473798446588899999999999 Q ss_pred HHHHHHHHHHHHHHCCCCC---CCCCCCCCCEEEEEECCHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHHHHHH Q ss_conf 9999977687788738875---1102558622376204114333441116876527985212000111345666433464 Q gi|254780750|r 472 KRIIASLQLKYAEETKIKN---LKIKHNNNLGYFIEVTSSSASVLKKDLESKDRFIHRQTMSNLTRFTTLELIDLENRIT 548 (920) Q Consensus 472 ~~~l~~l~~~~~~~~~i~s---Lki~~n~~~gy~iEV~~~~~~~i~~~~~~~~~~i~~qt~~~~~Rf~t~eL~~l~~~i~ 548 (920) +..+.++..+....++... ||...+..+||++.+|.+..+.+++ ..+|...+|.||+++|+|.+|..+++... T Consensus 472 ~~~m~~~hkrv~~dl~~D~~kklkLe~~~~~G~~~RlTr~e~~~LR~----~k~y~eLstqK~GV~FTtk~L~slN~e~~ 547 (902) T KOG0219 472 ERKMEKLHKKVSADLGLDPKKQLKLENSAQFGWYFRVTRKEEKVLRK----KKNYTELSTQKGGVKFTTKKLSSLNDEFM 547 (902) T ss_pred HHHHHHHHHHHHHHCCCCCCCCEEEECCCHHHEEEEEEEHHHHHHHC----CCCCEEEEEEECCEEEEHHHHHHHHHHHH T ss_conf 99999999998765078942010110300211000001206657630----58936999852768987415766678899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHH--HCCCCCEECCCCC--CEEECCCCCHHHHH Q ss_conf 3777776433447888765430112579888899986778877888775--2588512104787--14000046805887 Q gi|254780750|r 549 NATNSALLIELESFEILSNAIIEQSESLDNASQVIAIIDISIALAILAK--EQNYCRPIIDNST--NFIVKDGRHPIVEK 624 (920) Q Consensus 549 ~~~~~~~~~e~~~~~~l~~~i~~~~~~l~~~~~~ia~lD~l~SlA~~a~--~~~y~rP~i~~~~--~l~i~~gRHPviE~ 624 (920) +.+.+....+.++.++++.....|.+.|..+.+.+|.|||+.|||.+|. --+|+||.+.+.+ .+.++++||||+|. T Consensus 548 ~~qk~Y~~~Q~~ivrevikia~tY~Ppleal~~vlAhLDv~~SFa~~st~a~~pYvRP~~l~~gs~rl~l~~~rHp~lE~ 627 (902) T KOG0219 548 SLQKEYDEAQNEIVREIIKIAATYTPPLEALNQVLAHLDVFVSFAHAATVAPIPYVRPKLLPLGSKRLELKQSRHPVLEG 627 (902) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHEEEHHHCCCCCCCCCCCCCCCCCHHHHHHHHCCCCHHHC T ss_conf 98888999999999999999851488188999999877740010111256877753741255522688887425632012 Q ss_pred HHHHCCCCCEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEECCC Q ss_conf 63102877044434563587777664399996778440789999999999999719853035320682210567652376 Q gi|254780750|r 625 TLKQQSSKPFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGSAD 704 (920) Q Consensus 625 ~l~~~~~~~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa~D 704 (920) |...+|||||+.+. .+..+++|||||||||||||+||+|++++||||||||||++|.++++|.|+||+||+| T Consensus 628 ----Qd~~~fIpNdv~le----~~~~~~~IiTGpNMGGKSTyir~~Gvi~lmAQIGcfVPce~A~i~IvD~Il~RVGA~D 699 (902) T KOG0219 628 ----QDEIPFIPNDVVLE----KGKCRMLIITGPNMGGKSTYIRQVGVIVLMAQIGCFVPCESATISIVDGILTRVGAGD 699 (902) T ss_pred ----CCCCCCCCCCCCCC----CCCCEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCEEHHHCCCCHHHHHHHHHCCCH T ss_conf ----45677777754001----5775189985788676302121336999999818823565538760167876416441 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHHCCC Q ss_conf 61138532899999999999958998569993258898805679999999999997269849997487579766430688 Q gi|254780750|r 705 NLASGRSTFMVEMIETASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTDLSKSLKR 784 (920) Q Consensus 705 ~l~~g~STF~vEm~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~l~~~~~~ 784 (920) ++++|.||||+||.||+.||+.||++|||++||+||||||+||.+|||||-||++.+++|++||||||||++.++..++. T Consensus 700 ~q~kG~STFM~Emleta~Ilr~at~~SliiiDELGRGTSt~DGfgiAwai~ehi~~ki~cf~lfATHfhElt~lae~~~~ 779 (902) T KOG0219 700 SQLKGISTFMAEMLETASILRRATKNSLIIIDELGRGTSTYDGFGIAWAIAEHIATKIGCFCLFATHFHELTKLAEQLPT 779 (902) T ss_pred HHHCCHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 54323178999999999999856878289981367874300476178999999998875868778678888766531035 Q ss_pred EEEEEEEEEEECCEEEEEEEEEECCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 5899999996099277877774478988778999998299989999999999999876300 Q gi|254780750|r 785 FHNATLQVSDSNEGIIFLHKVIPGIADHSYGIQVGKLAGLPNTVISRAYDILKTFEKLYHH 845 (920) Q Consensus 785 v~n~~~~~~~~~~~i~flykl~~G~~~~Sygi~vA~laG~p~~vi~~A~~~~~~le~~~~~ 845 (920) ++|.||.+..++++++.+|++.+|+|++||||+||++||||+.||+.|+..+.+||+.... T Consensus 780 vKn~h~~a~i~~~~~~llY~V~~Gv~d~SFGi~VA~~a~fp~~vie~A~~~~~ele~~~~~ 840 (902) T KOG0219 780 VKNLHVTAQIENDDITLLYKVFEGVCDQSFGIHVAELVGFPEHVIEMAKQKAEELEDELVI 840 (902) T ss_pred HHHHEEEEEECCCCHHHHHHHHCCCCCCCCHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 5310003676476301299885222257511467877189757799999999877787744 No 7 >KOG0220 consensus Probab=100.00 E-value=0 Score=817.65 Aligned_cols=772 Identities=24% Similarity=0.344 Sum_probs=595.1 Q ss_pred EEEEECCCHHHHHHCCCCCCEEEEEEEEECCCCEE---CCHHCCCCCCCEEEEEEECC----CEEEEEEEECCCCEEEEE Q ss_conf 99950289788741289984554589997753030---20103878773699999529----849999999786559996 Q gi|254780750|r 106 AICEQIESPLEAKKRGNKSLVRRNVVRLVTPGTLT---EDQLLSPTDSNYLMAVARIR----TEWAIAWIDISTGIFKIS 178 (920) Q Consensus 106 aiveQ~E~p~~~~~~~~~~~v~R~Vt~IiTpGT~~---d~~~l~~~~~nyL~aI~~~~----~~~Gia~iDisTG~~~~~ 178 (920) -+.+|+-.|.+.+..-+-+...|..++--+|.--. ....+.+.....++++++.+ +.+|+|.+|..+|+.+++ T Consensus 56 ~~a~~t~~~q~~~s~~S~~~~qrs~~~~~a~sl~~sssn~s~~~~~~~~v~~~v~e~r~~~~~~Ig~~~~~~~~~q~~l~ 135 (867) T KOG0220 56 SSARDTNYPQTLKTPLSTGNPQRSGYKSWAPSLGYSSSNSSAISAHSPSVIVAVVEGRGLARGEIGMASIDLKNPQIILS 135 (867) T ss_pred HHHHHCCCCCCCCCCCCCCCCCHHHHHHCCCHHHCCCCCHHHHCCCCCCEEEEEEECCCCCCCEEEEEEECCCCCCEEHH T ss_conf 14654156311257532235222245313524310344224423678753799874377566605899843889705343 Q ss_pred ECCHHH----HHHHHHHCCCCEEEEECCCCCHHHHHHHH----HHCCCC-EECCCCCCCCHHHHHHHHHHHCCCCCCCCH Q ss_conf 058789----99998522985799707768967899887----423881-571852114713447889987087864210 Q gi|254780750|r 179 TSNHDR----LISDIMRIDPREIIFSEKELSHAEYKSLF----ETLGNV-AVAQPNVFFDHSISESRIADYYNITTVDTF 249 (920) Q Consensus 179 ~~~~d~----l~~~L~~l~P~EIii~~~~~~~~~~~~l~----~~~~~~-~~~~~~~~f~~~~~~~~l~~~~~~~~l~~~ 249 (920) +|-++. +..-+.-+.|-||+++.+.......+.++ +..++. .......+|+...+.+.+.+++....--.+ T Consensus 136 ~f~dst~y~~v~~~l~i~s~~ei~i~~~~~a~~~skl~~~~~~e~~~~v~~~~~s~k~fns~~gl~~i~~~~~~~~s~vl 215 (867) T KOG0220 136 QFADSTTYAKVITKLKILSPLEIIISNTACAVGNSKLLFTLITENFKNVNFTTISRKYFNSTKGLEYIEQLCIAEFSTVL 215 (867) T ss_pred HHHCCCHHHHHHHHHHCCCHHHEEECCCCCCCCHHHHHHHHHHHCCCCCCEEEHHHHHCCCHHHHHHHHHHHHHHHHHHH T ss_conf 55355056777867620571432403654453167899998760055522101566641824359999998745667999 Q ss_pred H--HHHHHHHHHHHHHHHHHHHHHCCCC-CCCCCCCCCCCCCEEEECCHHHCCEEEEECCCCCCCCHHHHHHCCCCCCHH Q ss_conf 2--4443468887777643687411222-222113445745401425133201545412787764127876301234337 Q gi|254780750|r 250 G--SFSQVEKTAAAAAISYIKKTQMVNK-PTIGYPEREDIQSTLFIDSAARSNLEILRTLSGSREQSLLKTIDYSITGAG 326 (920) Q Consensus 250 ~--~~~~~~~~a~~all~Yl~~~~~~~~-~~i~~~~~~~~~~~m~LD~~Tl~nLEI~~~~~g~~~gSL~~~Ln~t~T~~G 326 (920) + .....+++|+++++.|+.+++..-. +.-..+.+....+.+.||..+.++|||+.+..-....+|++++|+|.|++| T Consensus 216 e~i~~k~~al~a~a~llky~~~~~~~~~~~~slri~~~gs~nT~~id~~~~~~lelV~~~~~kn~~~l~~vl~~T~t~~g 295 (867) T KOG0220 216 EEIQSKTYALGAAAALLKYVEEIQSSVYAPKSLRICFQGSENTAMIDSSSAQSLELVINNQYKNNHTLFGVLNYTKTPGG 295 (867) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEEEECCCCEEEEECCCCCCEEEECHHHHHCCCCEEEEEEEEECCCC T ss_conf 99999999999999999999998876401000477751245335641146563487562454133320231310104540 Q ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCHHHHHHHHHHH----HH-----CHHHHHHHHHH Q ss_conf 8999986310023456789899999974022125889999975204834444567752----10-----00021206899 Q gi|254780750|r 327 GRLFAERIASPLTDSTKINTRLDSINFFIQNPILINPIQKILKSSPDVPRSLSRLKIG----RG-----EPKDIAVIRDG 397 (920) Q Consensus 327 ~RlLr~wL~~PL~d~~~I~~R~daVe~l~~~~~~~~~l~~~L~~i~DleRll~ri~~~----~~-----~p~dl~~L~~s 397 (920) .|.||..+++||+|...|+.|+++++++..++.+...++..+++.+|+++++++..-- .+ ....+..+..+ T Consensus 296 ~r~lRssilqpl~d~~ti~~rleaiqeL~a~~~L~~~Lr~~~k~~~dld~~~s~~~~~~~~~~i~~~~s~I~~~~~Lk~t 375 (867) T KOG0220 296 YRRLRSSILQPLTDIETINMRLEAIQELLADEELFFGLRSVIKRFLDLDQLLSVLVQIPTQDTVNAAESKINNLIYLKHT 375 (867) T ss_pred HHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH T ss_conf 36677641044520203467999999875283576542788752143789999998653677643226678899998888 Q ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHH-------HHHCCCCEECCCCCCHHHHHHHHHH Q ss_conf 9998655675430258457787676577-6789999999998543101-------2210460101685411455799886 Q gi|254780750|r 398 IRSGINILNLFLKNDLPEELRGAVEKLK-KLPQSLEKTLSSMLSDDLP-------TFKRDGGFLRDGADASLDETRSLRD 469 (920) Q Consensus 398 l~~~~~i~~~l~~~~~~~~l~~~~~~l~-~l~~~l~~~l~~~i~~~~~-------~~~~~~~~ik~g~d~eLD~lr~~~~ 469 (920) +..+..+...+.... ...+....+.+. +-...+.+.+...|+++.- ...+....+|.+++.-||..|..+. T Consensus 376 L~lv~~~~~al~~~~-s~~~~e~~~~~~~~r~~~i~~~i~e~I~dd~l~a~~~l~~~~qkcyAvks~i~~~LDiaR~ty~ 454 (867) T KOG0220 376 LELVDPLKIALKNCN-SNLLREYYGSFKDKRFGIILEKIKEVINDDALYAKGCLNLRTQKCYAVKSNINGFLDIARRTYT 454 (867) T ss_pred HHHHHHHHHHHHHCH-HHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHCCCEEEECCCCCHHHHHHHHHHH T ss_conf 988988999986020-5789999977444278999878887752787733310554035345421363089999999999 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEEECCHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHH Q ss_conf 38999997768778873887511025586223762041143334411168765279852120001113456664334643 Q gi|254780750|r 470 QSKRIIASLQLKYAEETKIKNLKIKHNNNLGYFIEVTSSSASVLKKDLESKDRFIHRQTMSNLTRFTTLELIDLENRITN 549 (920) Q Consensus 470 ~~~~~l~~l~~~~~~~~~i~sLki~~n~~~gy~iEV~~~~~~~i~~~~~~~~~~i~~qt~~~~~Rf~t~eL~~l~~~i~~ 549 (920) +..+...+...++.+.. ..+++..|+...||++.++..... ++. ...|+.|+.....++..+|++..+.+++.++.+ T Consensus 455 ei~~~~~~~i~~l~E~~-~~nl~~~f~sarGF~~ri~~~~~~-~~~-~~lP~~fi~~~~~~~~~~~~s~~~ik~N~Rlk~ 531 (867) T KOG0220 455 EIVDDIAGMISQLGEKF-SLNLRLSFSSARGFFIRITTDCIA-LPS-DTLPSEFIKISKVKNSYSFTSADLIKMNERLKE 531 (867) T ss_pred HHHHHHHHHHHHHHHHC-CCCCCCCCCCCCCEEEEEECCCCC-CCC-CCCCHHHHHHHHHCCEEEECHHHHHHHHHHHHH T ss_conf 99999999999988624-854103566445179995034202-431-247556642343234255214888887799999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCCCEEECCCCCHHHHHHHHHC Q ss_conf 77777643344788876543011257988889998677887788877525885121047871400004680588763102 Q gi|254780750|r 550 ATNSALLIELESFEILSNAIIEQSESLDNASQVIAIIDISIALAILAKEQNYCRPIIDNSTNFIVKDGRHPIVEKTLKQQ 629 (920) Q Consensus 550 ~~~~~~~~e~~~~~~l~~~i~~~~~~l~~~~~~ia~lD~l~SlA~~a~~~~y~rP~i~~~~~l~i~~gRHPviE~~l~~~ 629 (920) ...++......+...+...+..+.+.+..++++++.||+++|||......+||||+|+++ +.|++||||++|.... T Consensus 532 ~~~E~~l~te~~v~~lld~i~~~I~~L~~iae~~~~lD~l~sfa~~~~~~~y~~P~fT~s--laI~qGRHPILe~i~~-- 607 (867) T KOG0220 532 VLREIFLMTEMIVCKLLDEIYEHISCLYKLAEAVSMLDMLCSFAHACTLSDYVRPEFTDS--LAIKQGRHPILEKISA-- 607 (867) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCC--EEECCCCCCHHHHHCC-- T ss_conf 999899999999999999998767889999999999999988887501225556666886--0233678730454402-- Q ss_pred CCCCEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEECCCCCCCC Q ss_conf 87704443456358777766439999677844078999999999999971985303532068221056765237661138 Q gi|254780750|r 630 SSKPFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGSADNLASG 709 (920) Q Consensus 630 ~~~~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa~D~l~~g 709 (920) +.+|.||+.+. .+.+++|||||||+||||||||||+++|||||||||||.+|.+|+|+|||||||..|+|..+ T Consensus 608 --ek~i~N~t~~t-----~~s~f~IITGPNMsGKSTYLKQvAl~~IMAQIGc~IPA~YaS~pIf~RIFtRmg~nDele~N 680 (867) T KOG0220 608 --EKPIANNTYVT-----EGSNFLIITGPNMSGKSTYLKQVALCQIMAQIGSYVPAEYASFRVFKRIFTRMGTNDELERN 680 (867) T ss_pred --CCCCCCCCEEE-----CCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCCHHHCCCHHHHHHHHHHCCCHHHHHC T ss_conf --47666752131-----26623788779877614999999999999986267641102125899998874574455514 Q ss_pred CCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHHCCCEEEEE Q ss_conf 53289999999999995899856999325889880567999999999999726984999748757976643068858999 Q gi|254780750|r 710 RSTFMVEMIETASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTDLSKSLKRFHNAT 789 (920) Q Consensus 710 ~STF~vEm~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~l~~~~~~v~n~~ 789 (920) .||||.||.|+|.|+++|+.+|||+||||||||||.||+||+|||+|||+.. ++.|+|||||.+|+.++.-+|.|.|+| T Consensus 681 sS~F~sEMke~A~Ilq~a~~~SLiVlDELgR~TSteeGiaityAvCE~lL~L-kayTflATHFldIa~lan~~paVdnlH 759 (867) T KOG0220 681 SSTFMSEMKEAAYILQNANKNSLIVLDELGRGTSTEEGIAITYAVCEYLLSL-KAYTFLATHFLDIAAIANYYPAVDNLH 759 (867) T ss_pred HHHHHHHHHHHHHHHHHCCCCCEEEEHHHCCCCCCCCCHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHCCCCCCCEE T ss_conf 4678998998999997377672797765216776100044589999999876-576889998877999963374421025 Q ss_pred EEEEEECCEEEE-EEEEEECCC-CCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHCCC Q ss_conf 999960992778-777744789-887789999982999899999999999998763000111121111100001120035 Q gi|254780750|r 790 LQVSDSNEGIIF-LHKVIPGIA-DHSYGIQVGKLAGLPNTVISRAYDILKTFEKLYHHNQKDMRLYYPEIQTKESKNNLS 867 (920) Q Consensus 790 ~~~~~~~~~i~f-lykl~~G~~-~~Sygi~vA~laG~p~~vi~~A~~~~~~le~~~~~~~~~~~~~~~~~~~~~~~~~~~ 867 (920) |.++-.++...| .|||..|.. +.-||+++|++.-+|.+|++.|+.+..++-+.....+++... .+.+..-.. T Consensus 760 F~~q~~eNssk~~k~kLsrg~~~~~~yG~~~vE~s~iPd~i~e~a~~~~t~i~A~v~~~~rd~~~----~~rq~~Vy~-- 833 (867) T KOG0220 760 FLVQTDENSSKNKKYKLSRGLTEEKNYGLKAAEVSSLPDSIVEDAKEITTQITAQILQNQRDTPE----MERQRAVYH-- 833 (867) T ss_pred EEEEECCCCHHHHHHHHHHHHHHCCCCCCEEEEEECCCHHHHHHHHHHHHHHHHHHHHHCCCCHH----HHHHHHHHH-- T ss_conf 53453355105656664122330545561478875187899986567899999988751457668----888999999-- Q ss_pred CCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHC Q ss_conf 47389999986078222798999999999999986521 Q gi|254780750|r 868 KNDKLFVEKIKCLNLDEMSPLTALKTLYAVKAWTLEKY 905 (920) Q Consensus 868 ~~~~~~~~~~~~~~~~~~~p~~al~~l~~lk~~~~~~~ 905 (920) ..+.+.+..-+. .|.+|+.+|.+|++.+.|+- T Consensus 834 -----~a~~~~~t~gn~-~e~~~~~klk~l~k~~ve~~ 865 (867) T KOG0220 834 -----LATRLVQTAGNS-DEDSLRIKLKNLKKKYKEDF 865 (867) T ss_pred -----HHHHHHHHHCCC-CCHHHHHHHHHHHHHHHHHC T ss_conf -----999999860777-76789999999999998661 No 8 >PRK00409 recombination and DNA strand exchange inhibitor protein; Reviewed Probab=100.00 E-value=0 Score=726.06 Aligned_cols=502 Identities=21% Similarity=0.321 Sum_probs=430.1 Q ss_pred ECCHHHCCEEEEECCCCCCCCHHHHHHCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH Q ss_conf 25133201545412787764127876301234337899998631002345678989999997402212588999997520 Q gi|254780750|r 292 IDSAARSNLEILRTLSGSREQSLLKTIDYSITGAGGRLFAERIASPLTDSTKINTRLDSINFFIQNPILINPIQKILKSS 371 (920) Q Consensus 292 LD~~Tl~nLEI~~~~~g~~~gSL~~~Ln~t~T~~G~RlLr~wL~~PL~d~~~I~~R~daVe~l~~~~~~~~~l~~~L~~i 371 (920) |+..|++.||.-. =+=.+.++|.|++|+++++. ++|+.|.++|+.|++.++++.+-.... -.-.+.++ T Consensus 2 m~~~~l~~Lef~~--------i~~~l~~~~~t~~Gk~~l~~--L~P~~~~~~i~~~l~~t~E~~~l~~~~--~~~pl~~i 69 (780) T PRK00409 2 MNEETLRVLEFNK--------IKEQLKTFAATKLGKEKVED--LKPLTSFEEVKEHLEETDEAAKILRLK--GFPPFEGV 69 (780) T ss_pred CCHHHHHHCCHHH--------HHHHHHHHCCCHHHHHHHHC--CCCCCCHHHHHHHHHHHHHHHHHHHHC--CCCCCCCC T ss_conf 4767888659899--------99999717589999999976--899899999999999999999999707--99996785 Q ss_pred CCHHHHHHHHHHH-HHCHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 4834444567752-100002120689999986556754302584577876765776789999999998543101221046 Q gi|254780750|r 372 PDVPRSLSRLKIG-RGEPKDIAVIRDGIRSGINILNLFLKNDLPEELRGAVEKLKKLPQSLEKTLSSMLSDDLPTFKRDG 450 (920) Q Consensus 372 ~DleRll~ri~~~-~~~p~dl~~L~~sl~~~~~i~~~l~~~~~~~~l~~~~~~l~~l~~~l~~~l~~~i~~~~~~~~~~~ 450 (920) +|+++++.|+..| ..+|.+|..+...+..+..+...+......+.|..+...+..++ .+.+.|..++. +. T Consensus 70 ~Di~~~l~r~~~g~~L~~~EL~~i~~~L~~~~~lk~~l~~~~~~p~L~~l~~~l~~~~-~L~~~I~~~Id--------~~ 140 (780) T PRK00409 70 TDIDDALKRAEKGGVLSGDELLEIAKTLRYFRQLKRFIEDLEELPILEEWVAQIRTLP-ELEQEIHNCID--------EE 140 (780) T ss_pred HHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHCCCCH-HHHHHHHHHCC--------CC T ss_conf 1379999998658978999999999999999999999865544456999998575868-99999998758--------99 Q ss_pred CEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHH-HHCCCCCCCC--CCCCCCEEEEEECCHHHHHCCCCCCCCCCEEEEE Q ss_conf 010168541145579988638999997768778-8738875110--2558622376204114333441116876527985 Q gi|254780750|r 451 GFLRDGADASLDETRSLRDQSKRIIASLQLKYA-EETKIKNLKI--KHNNNLGYFIEVTSSSASVLKKDLESKDRFIHRQ 527 (920) Q Consensus 451 ~~ik~g~d~eLD~lr~~~~~~~~~l~~l~~~~~-~~~~i~sLki--~~n~~~gy~iEV~~~~~~~i~~~~~~~~~~i~~q 527 (920) |.|+++++++|+.+|......+..+.+...++. .....+.|.- .......|+|.|.....+.+| ++++.+ T Consensus 141 G~I~D~AS~eL~~IR~~i~~~~~~i~~~l~~~l~~~~~~~~Lqd~~it~R~gRyvlpVk~~~k~~v~-------GiVhd~ 213 (780) T PRK00409 141 GEVKDSASEKLRGIRRQLRRKNSRIREKLNRIVRSKSLQKYLQDTIITIRNDRYVLPVKAEYKHAIK-------GIVHDQ 213 (780) T ss_pred CEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHCCEEEEECCEEEEEEEHHHCCCCC-------EEEEEE T ss_conf 8288887989999999999999999999999985111124012342799999899996367627788-------189998 Q ss_pred ECCCCHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEC Q ss_conf 21200011134-56664334643777776433447888765430112579888899986778877888775258851210 Q gi|254780750|r 528 TMSNLTRFTTL-ELIDLENRITNATNSALLIELESFEILSNAIIEQSESLDNASQVIAIIDISIALAILAKEQNYCRPII 606 (920) Q Consensus 528 t~~~~~Rf~t~-eL~~l~~~i~~~~~~~~~~e~~~~~~l~~~i~~~~~~l~~~~~~ia~lD~l~SlA~~a~~~~y~rP~i 606 (920) |.++.+.|+.| +++++++++..+..+....+..++.+|...+..+.+.|....+.+++||+++|+|.+|..++|++|.+ T Consensus 214 S~SG~T~fIEP~~vveLnN~l~~L~~~e~~Ei~rIL~eLs~~v~~~~~~l~~~~~~l~~LD~~~AkA~~A~~~~~~~P~~ 293 (780) T PRK00409 214 SSSGATLYIEPQSVVELNNEIRELRNKEKEEIERILKELSAKVAKNLDFLKFLNKIFDTLDFIFARARYAKALKATFPLF 293 (780) T ss_pred CCCCCEEEECHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEE T ss_conf 07998588552999988899999999999999999999999999879999999999999999999999999779863127 Q ss_pred CCCCCEEECCCCCHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHH Q ss_conf 47871400004680588763102877044434563587777664399996778440789999999999999719853035 Q gi|254780750|r 607 DNSTNFIVKDGRHPIVEKTLKQQSSKPFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPAS 686 (920) Q Consensus 607 ~~~~~l~i~~gRHPviE~~l~~~~~~~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~ 686 (920) ++++.|.++++|||+++. ..+||||+.|+ .+.+++|||||||||||++|||+||+++|||+|+||||. T Consensus 294 ~~~~~i~L~~aRHPLL~~-------~~vVP~di~l~-----~~~~~liITGPNtGGKTv~LKtvgL~~lMaq~Gl~vPa~ 361 (780) T PRK00409 294 NDEGKIDLRQARHPLLDG-------EKVVPKDIELG-----FDKTVLVITGPNTGGKTVTLKTLGLAALMAKSGLPIPAA 361 (780) T ss_pred CCCCEEEEEECCCCCCCC-------CCCCCCEEEEC-----CCCEEEEEECCCCCCCEEHHHHHHHHHHHHHHCCCCCCC T ss_conf 799808986035776366-------77257347866-----984189996898888563799999999999829997516 Q ss_pred H-CCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCE Q ss_conf 3-206822105676523766113853289999999999995899856999325889880567999999999999726984 Q gi|254780750|r 687 Y-AHIGIVDKLFSRVGSADNLASGRSTFMVEMIETASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCR 765 (920) Q Consensus 687 ~-a~i~~~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~ 765 (920) . +++|+||+||++||..+||.+++|||+.+|..++.||.+|+++|||||||||+||+|.+|.|||.||+|||.++ +|+ T Consensus 362 e~s~~~~f~~i~adIGD~QSie~~LSTFS~hm~~i~~il~~a~~~sLVLlDElG~GTDP~EGaALa~aile~l~~~-~~~ 440 (780) T PRK00409 362 EPSEIPVFKEVFADIGDEQSIEQSLSTFSGHMTNIVRILEKADENSLVLFDELGAGTDPDEGAALAISILDYLRKR-GAK 440 (780) T ss_pred CCCCCCCCCEEEEEECCCHHHHHCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHHHHC-CCE T ss_conf 8980423463899827712133265249999999999997389980881232358998456599999999999977-997 Q ss_pred EEEECCCHHHHHHHHHCCCEEEEEEEEEEECCEEEEEEEEEECCCCCCHHHHHHHHCCCCHHHHHHHHHHH Q ss_conf 99974875797664306885899999996099277877774478988778999998299989999999999 Q gi|254780750|r 766 GLLATHFHELTDLSKSLKRFHNATLQVSDSNEGIIFLHKVIPGIADHSYGIQVGKLAGLPNTVISRAYDIL 836 (920) Q Consensus 766 ~lfaTHy~eL~~l~~~~~~v~n~~~~~~~~~~~i~flykl~~G~~~~Sygi~vA~laG~p~~vi~~A~~~~ 836 (920) ++++|||.+|+.++...+++.|++|++|.+. +..+|+|..|++++||+++||+..|||++||++|++.+ T Consensus 441 ~i~TTH~~~lK~~a~~~~~~~nas~~FD~~t--l~PtYrl~~G~pG~S~A~~IA~rlGlp~~ii~~A~~~l 509 (780) T PRK00409 441 IIATTHYKELKALKYNREGVENASVEFDEET--LRPTYRLLIGIPGRSNAFEIAKRLGLPPNIIEEAKKVY 509 (780) T ss_pred EEEECCHHHHHHHHHCCCCCEEEEEEECCCC--CCCCEEEECCCCCCCHHHHHHHHHCCCHHHHHHHHHHH T ss_conf 9994776999999727998189888874023--78606870599976369999999297999999999885 No 9 >KOG0221 consensus Probab=100.00 E-value=0 Score=729.38 Aligned_cols=710 Identities=23% Similarity=0.344 Sum_probs=529.4 Q ss_pred CEEEEEEECCCEEEEEEEECCCCEEEEEECC-HHHHH----HHHHHCCCCEEEEECCCCCHHHHHHHH--HHCCCCEECC Q ss_conf 3699999529849999999786559996058-78999----998522985799707768967899887--4238815718 Q gi|254780750|r 151 NYLMAVARIRTEWAIAWIDISTGIFKISTSN-HDRLI----SDIMRIDPREIIFSEKELSHAEYKSLF--ETLGNVAVAQ 223 (920) Q Consensus 151 nyL~aI~~~~~~~Gia~iDisTG~~~~~~~~-~d~l~----~~L~~l~P~EIii~~~~~~~~~~~~l~--~~~~~~~~~~ 223 (920) --.+||....+..|+|++|.|+.....-... +++-. ..|--.+|.-++-+... .....+.+- .......... T Consensus 49 Eivlcv~f~~G~LG~ayyd~s~~tlk~m~d~~~~~~~~~l~r~ldd~~p~s~~~~~~q-d~~~i~fl~~~~s~~~vp~~~ 127 (849) T KOG0221 49 EIVLCVLFNSGYLGIAYYDTSDSTLKFMPDAPDHESLKLLQRVLDDINPQSVVTSAKQ-DENMIRFLGKLASQEHVPPKR 127 (849) T ss_pred HEEEEEEECCCEEEEEEEECCCHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHH-HHHHHHHHCCCCCCCCCCCCC T ss_conf 1788987258711047630430577746785266677899887540375145656765-589999851478511378777 Q ss_pred CCCCCCHHHH----------------------HHHHHHHCCCC--CCCCH-----------H-HHHHHHHHHHHHHHHHH Q ss_conf 5211471344----------------------78899870878--64210-----------2-44434688877776436 Q gi|254780750|r 224 PNVFFDHSIS----------------------ESRIADYYNIT--TVDTF-----------G-SFSQVEKTAAAAAISYI 267 (920) Q Consensus 224 ~~~~f~~~~~----------------------~~~l~~~~~~~--~l~~~-----------~-~~~~~~~~a~~all~Yl 267 (920) +.|.|++... .....-+|..+ .-+.+ + ..-..+..+.++++.++ T Consensus 128 ~~~if~~~~t~~~ei~k~~~l~~~~~f~p~~l~~~~~~~f~~s~i~~D~l~t~~~~~~it~~~~~i~~~~r~~g~ll~fl 207 (849) T KOG0221 128 PEIIFLPSVTFGLEISKQRLLSGNYSFIPDALTATEKILFLSSIIPFDCLLTEEDKYIITKMRFDIEAVVRALGGLLKFL 207 (849) T ss_pred CCEEECCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEECCCHHHHHHHHHHHHHHC T ss_conf 44244352015777777775187312382877545776403035652010488885699863144499998611487653 Q ss_pred HHHHCC-------CCCCCCCCCCCCCCCEEEECCHHHCCEEEEECCCC---CC----C-CHHHHHHCCCCCCHHHHHHHH Q ss_conf 874112-------22222113445745401425133201545412787---76----4-127876301234337899998 Q gi|254780750|r 268 KKTQMV-------NKPTIGYPEREDIQSTLFIDSAARSNLEILRTLSG---SR----E-QSLLKTIDYSITGAGGRLFAE 332 (920) Q Consensus 268 ~~~~~~-------~~~~i~~~~~~~~~~~m~LD~~Tl~nLEI~~~~~g---~~----~-gSL~~~Ln~t~T~~G~RlLr~ 332 (920) ..+... ....|..++.+...+.|.||.+|++.|.||.+-+. .+ . -|||+++|+|....|+|+||. T Consensus 208 ~~~rigv~l~~~~v~~pI~gik~f~l~~lv~iD~nTisaL~Ifp~e~~~~~~k~~~~~g~Slf~l~n~c~s~~g~k~Lr~ 287 (849) T KOG0221 208 GRRRIGVELEDYNVSVPILGIKKFMLTHLVNIDQNTISALQIFPSESHPSKVKSGLKEGLSLFGLLNRCHSKWGEKLLRL 287 (849) T ss_pred CCCEEEEEECCCCCCCCCCCEEEEEECCEEEECCCHHHHHHHCCCCCCCCHHHHHHHCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 43413346325511121011158751232663422477877524355532124545232169999988756999999999 Q ss_pred HHHHHCCCHHHHHHHHHHHHHHCC--CHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 631002345678989999997402--212588999997520483444456775210000212068999998655675430 Q gi|254780750|r 333 RIASPLTDSTKINTRLDSINFFIQ--NPILINPIQKILKSSPDVPRSLSRLKIGRGEPKDIAVIRDGIRSGINILNLFLK 410 (920) Q Consensus 333 wL~~PL~d~~~I~~R~daVe~l~~--~~~~~~~l~~~L~~i~DleRll~ri~~~~~~p~dl~~L~~sl~~~~~i~~~l~~ 410 (920) |+.+|++|..+|..||++|.+|.. |.++...+...|++++|+--++.|+..|+++..+|-.++.++.+...+.+.+.. T Consensus 288 Wf~nPttd~~~l~sR~~~i~~fl~~qNa~~~~~Ls~~lgr~k~~~~~~~~~~sg~t~l~~W~~~~stv~~~~~i~~~~rs 367 (849) T KOG0221 288 WFTNPTTDLGELSSRLDVIQFFLLPQNADMAQMLSRLLGRIKNVPLILKRMKSGHTKLSDWQVLYSTVYSALGIRDACRS 367 (849) T ss_pred HHCCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCEECHHHHHHHHHHHHHHHHHHHHH T ss_conf 82288873787888999999984612069999999998524337899999854775211389999999999999999974 Q ss_pred CCCCHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHCCCCEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCC- Q ss_conf 258457787676--57767899999999985431012210460101685411455799886389999977687788738- Q gi|254780750|r 411 NDLPEELRGAVE--KLKKLPQSLEKTLSSMLSDDLPTFKRDGGFLRDGADASLDETRSLRDQSKRIIASLQLKYAEETK- 487 (920) Q Consensus 411 ~~~~~~l~~~~~--~l~~l~~~l~~~l~~~i~~~~~~~~~~~~~ik~g~d~eLD~lr~~~~~~~~~l~~l~~~~~~~~~- 487 (920) ...+..+....+ .+.++ ..+...+...++-+ ........-+.+|+|++||+.|..+.+...++.+...++.+..+ T Consensus 368 lp~s~~~~~~~~~~~~~~l-~eia~~~g~vIdF~-~S~~~~r~Tv~~giD~elDE~r~~y~~lp~~Lt~vAr~e~~~L~~ 445 (849) T KOG0221 368 LPQSIQLFRDIAQEFSDDL-HEIASLIGKVIDFE-GSLAENRFTVLPGIDPELDEKRRRYMGLPSFLTEVARKELENLDS 445 (849) T ss_pred CCCCHHHHHHHHHHHHHHH-HHHHHHHHHEECCC-CCCCCCEEEECCCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHHCC T ss_conf 7431455667788889999-99999861102133-321136388548997678999999706508999999999986179 Q ss_pred -CCCCCCCCCCCCEEEEEECCHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf -8751102558622376204114333441116876527985212000111345666433464377777643344788876 Q gi|254780750|r 488 -IKNLKIKHNNNLGYFIEVTSSSASVLKKDLESKDRFIHRQTMSNLTRFTTLELIDLENRITNATNSALLIELESFEILS 566 (920) Q Consensus 488 -i~sLki~~n~~~gy~iEV~~~~~~~i~~~~~~~~~~i~~qt~~~~~Rf~t~eL~~l~~~i~~~~~~~~~~e~~~~~~l~ 566 (920) ++++.+.|.+..||++.||.--.-....++...-.++.. ..---+|.+...+++.+.+.+..-++..-|..+.-.|. T Consensus 446 ~~psv~~VYIPliGfllsiprl~~~~~~~d~~~~~~~mf~--s~E~l~~rnart~eLD~~~GDIy~~i~D~et~i~~~Lq 523 (849) T KOG0221 446 RIPSVSVVYIPLIGFLLSIPRLPSMVEASDFENGLDFMFL--SEEKLHYRNARTKELDALLGDIYCEIRDQETLIMYQLQ 523 (849) T ss_pred CCCCEEEEEEECEEEEEECCCCCCHHHCCCCCCCHHHHHC--CCCEEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9983369985020037853356415326782214477744--53005763340776888764688765136889999999 Q ss_pred HHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCCC-EEECCCCCHHHHHHHHHCCCCCEEEEEEEECCCC Q ss_conf 543011257988889998677887788877525885121047871-4000046805887631028770444345635877 Q gi|254780750|r 567 NAIIEQSESLDNASQVIAIIDISIALAILAKEQNYCRPIIDNSTN-FIVKDGRHPIVEKTLKQQSSKPFIANDCDLSCPN 645 (920) Q Consensus 567 ~~i~~~~~~l~~~~~~ia~lD~l~SlA~~a~~~~y~rP~i~~~~~-l~i~~gRHPviE~~l~~~~~~~fVpNdi~l~~~~ 645 (920) .++......+-+.....++||+|+|||..|.++||.||.++++.. +.|.+||||.+|... ..||||++..| T Consensus 524 ~qvl~rk~~lt~~l~laSrldvLls~a~~aa~~gy~~P~lv~e~~il~I~ngrh~l~e~~~-----dtfvPNst~ig--- 595 (849) T KOG0221 524 CQVLARKAVLTRVLDLASRLDVLLSLASAAADYGYSRPRLVPEVLILRIQNGRHPLMELCA-----DTFVPNSTEIG--- 595 (849) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHCCCHHHHHHHH-----HHCCCCCEEEC--- T ss_conf 9999888789999888888999999998887639888735607889999707704999999-----73489860236--- Q ss_pred CCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 77664399996778440789999999999999719853035320682210567652376611385328999999999999 Q gi|254780750|r 646 DKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGSADNLASGRSTFMVEMIETASILN 725 (920) Q Consensus 646 ~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~ 725 (920) ++.+|+.||||||.+|||.||+||||+|.||||||||||+.|+||++|||||||++.|++..|+||||..+.+++..|+ T Consensus 596 -gdkgri~vITGpNasGKSiYlkqvglivfLahIGsFVPAe~A~IGivDrI~tri~s~esv~~gqSTFmiD~~Qva~aLr 674 (849) T KOG0221 596 -GDKGRIKVITGPNASGKSIYLKQVGLIVFLAHIGSFVPAEEAEIGIVDRIFTRIHSCESVSLGQSTFMIDLNQVAKALR 674 (849) T ss_pred -CCCCEEEEEECCCCCCCEEEEEECHHHHHHHHHCCCCCHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf -9884499985898888557886203565797643666455533250899998862144554003588884999999998 Q ss_pred HCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCC--EEEEECCCHHHHHHH--HHCCCEEEEEEEEE-EECCEEE Q ss_conf 589985699932588988056799999999999972698--499974875797664--30688589999999-6099277 Q gi|254780750|r 726 QATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRC--RGLLATHFHELTDLS--KSLKRFHNATLQVS-DSNEGII 800 (920) Q Consensus 726 ~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~--~~lfaTHy~eL~~l~--~~~~~v~n~~~~~~-~~~~~i~ 800 (920) +||.+||||+||+|+||.|.||.++-.+|+.|+..+..| +.+.+||||+|-.+- .+.+-+..+.|++. +..++|+ T Consensus 675 ~AT~~SLvlIDEfGKGT~tedGlsLlasvm~~w~~rg~~~PrifvcThfheL~ne~~L~~n~i~qfltm~vlr~~ge~I~ 754 (849) T KOG0221 675 NATAQSLVLIDEFGKGTNTEDGLSLLASVMRHWLARGPTCPRIFVCTHFHELVNEQLLPQNPIVQFLTMEVLREDGEDIV 754 (849) T ss_pred HHHCCCEEEEHHCCCCCCCCCCHHHHHHHHHHHHHCCCCCCEEEEECCHHHHHHHCCCCCCHHHHHHHHHHHHHCCCCEE T ss_conf 74037279873106876433118999999999983499997389830177751003477552666541999986068749 Q ss_pred EEEEEEECCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHCCCCCHHHHHHHHHCC Q ss_conf 87777447898877899999829998999999999999987630001111211111000011200354738999998607 Q gi|254780750|r 801 FLHKVIPGIADHSYGIQVGKLAGLPNTVISRAYDILKTFEKLYHHNQKDMRLYYPEIQTKESKNNLSKNDKLFVEKIKCL 880 (920) Q Consensus 801 flykl~~G~~~~Sygi~vA~laG~p~~vi~~A~~~~~~le~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 880 (920) |+||+.+|.+..|||+++|+.+|+|++||.||+++++.++...........+...+.+ .-+..++++..+ T Consensus 755 flyrv~~gl~k~sfal~~ak~~glp~~vV~Ra~~v~~ai~sg~~vk~~k~~l~~~q~~----------~~q~~Vdkf~~l 824 (849) T KOG0221 755 FLYRVCEGLAKASFALHTAKQAGLPDKVVARAKEVSDAIRSGKPVKPVKDLLKKNQME----------NCQTLVDKFMKL 824 (849) T ss_pred EEEEECCCHHHHCCCCHHHHHCCCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHH----------HHHHHHHHHHHH T ss_conf 9987345265423221157662999799999999999997699704666554055789----------899999999874 Q ss_pred CCCC Q ss_conf 8222 Q gi|254780750|r 881 NLDE 884 (920) Q Consensus 881 ~~~~ 884 (920) |++- T Consensus 825 Dl~l 828 (849) T KOG0221 825 DLEL 828 (849) T ss_pred CCCC T ss_conf 1347 No 10 >pfam00488 MutS_V MutS domain V. This domain is found in proteins of the MutS family (DNA mismatch repair proteins) and is found associated with pfam01624, pfam05188, pfam05192 and pfam05190. The mutS family of proteins is named after the Salmonella typhimurium MutS protein involved in mismatch repair; other members of the family included the eukaryotic MSH 1,2,3, 4,5 and 6 proteins. These have various roles in DNA repair and recombination. Human MSH has been implicated in non-polyposis colorectal carcinoma (HNPCC) and is a mismatch binding protein. The aligned region corresponds with domain V of Thermus aquaticus MutS, which contains a Walker A motif, and is structurally similar to the ATPase domain of ABC transporters. Probab=100.00 E-value=0 Score=587.45 Aligned_cols=233 Identities=56% Similarity=0.920 Sum_probs=223.7 Q ss_pred CCCEECCCCCCEEECCCCCHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCC Q ss_conf 85121047871400004680588763102877044434563587777664399996778440789999999999999719 Q gi|254780750|r 601 YCRPIIDNSTNFIVKDGRHPIVEKTLKQQSSKPFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMG 680 (920) Q Consensus 601 y~rP~i~~~~~l~i~~gRHPviE~~l~~~~~~~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG 680 (920) ||||+|++++.+.|++||||++|... ..+|||||+.|+. +..+++||||||||||||||||+|++++|||+| T Consensus 1 y~~P~~~~~~~l~i~~~rHPll~~~~----~~~~VpNdi~l~~----~~~~~~iiTGpN~sGKSt~Lk~igl~~~lAq~G 72 (234) T pfam00488 1 YVRPEFTESDALHIEGGRHPVVEAVL----AENFVPNDVSLGK----ERSRILLITGPNMGGKSTYLRQVALIVIMAQIG 72 (234) T ss_pred CCCCEECCCCCEEEEECCCCEEECCC----CCCEECCEEEECC----CCEEEEEEECCCCCCCHHHHHHHHHHHHHHHHC T ss_conf 97887858996899976687697257----9976876589779----961699997887776199999999999999836 Q ss_pred CCCCHHHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 85303532068221056765237661138532899999999999958998569993258898805679999999999997 Q gi|254780750|r 681 SYVPASYAHIGIVDKLFSRVGSADNLASGRSTFMVEMIETASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHE 760 (920) Q Consensus 681 ~fVPA~~a~i~~~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~ 760 (920) |||||++|++++||+||||||..||+.+|+|||+.||.+++.||+++|++||||+||+||||++.||.|||+|++|||.+ T Consensus 73 ~~VPA~~a~~~~~d~I~~~i~~~dsl~~~~StF~~e~~~~~~il~~~~~~sLvliDEl~~GT~~~eg~al~~aile~L~~ 152 (234) T pfam00488 73 SFVPAESARIGIVDRIFTRIGASDDLASGRSTFMVEMLETANILHNATDKSLVILDELGRGTSTYDGLAIAWAVAEHLAE 152 (234) T ss_pred CCEEECCEEEECCCEEEEEECCCCHHHCCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCCCCCHHHHHHHHHHHHHHHHH T ss_conf 87422205996365599985675334466117999999999999738877322014245899834679999999999997 Q ss_pred HCCCEEEEECCCHHHHHHHHHCCCEEEEEEEEEEECCEEEEEEEEEECCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHH Q ss_conf 26984999748757976643068858999999960992778777744789887789999982999899999999999998 Q gi|254780750|r 761 TNRCRGLLATHFHELTDLSKSLKRFHNATLQVSDSNEGIIFLHKVIPGIADHSYGIQVGKLAGLPNTVISRAYDILKTFE 840 (920) Q Consensus 761 ~~~~~~lfaTHy~eL~~l~~~~~~v~n~~~~~~~~~~~i~flykl~~G~~~~Sygi~vA~laG~p~~vi~~A~~~~~~le 840 (920) +.+|+++||||||+|+++....+++.|+||++++.+++++|+|||.+|+++.|||++||+++|+|++||+||++++++|| T Consensus 153 ~~~~~~i~tTH~~~L~~l~~~~~~v~~~~m~~~~~~~~~~f~Ykl~~G~~~~s~ai~ia~~~G~p~~ii~rA~~i~~~le 232 (234) T pfam00488 153 KIRARTLFATHYHELTKLAEKLPAVKNVHMAAVETNGDIVFLYKVKPGAADKSYGIHVAELAGLPESVVERAREVLAELE 232 (234) T ss_pred HCCCEEEEEECHHHHHHHHHHCCCCCCEEEEEEEECCEEEEEEEECCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHH T ss_conf 35977999713577999987476444148999998997869999775899983999999991969999999999999986 Q ss_pred H Q ss_conf 7 Q gi|254780750|r 841 K 841 (920) Q Consensus 841 ~ 841 (920) + T Consensus 233 ~ 233 (234) T pfam00488 233 D 233 (234) T ss_pred C T ss_conf 6 No 11 >cd03285 ABC_MSH2_euk MutS2 homolog in eukaryotes. The MutS protein initiates DNA mismatch repair by recognizing mispaired and unpaired bases embedded in duplex DNA and activating endo- and exonucleases to remove the mismatch. Members of the MutS family possess C-terminal domain with a conserved ATPase activity that belongs to the ATP binding cassette (ABC) superfamily. MutS homologs (MSH) have been identified in most prokaryotic and all eukaryotic organisms examined. Prokaryotes have two homologs (MutS1 and MutS2), whereas seven MSH proteins (MSH1 to MSH7) have been identified in eukaryotes. The homodimer MutS1 and heterodimers MSH2-MSH3 and MSH2-MSH6 are primarily involved in mitotic mismatch repair, whereas MSH4-MSH5 is involved in resolution of Holliday junctions during meiosis. All members of the MutS family contain the highly conserved Walker A/B ATPase domain, and many share a common mechanism of action. MutS1, MSH2-MSH3, MSH2-MSH6, and MSH4-MSH5 dimerize to form sliding c Probab=100.00 E-value=0 Score=563.04 Aligned_cols=220 Identities=45% Similarity=0.754 Sum_probs=207.4 Q ss_pred EECCCCCHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCC Q ss_conf 00004680588763102877044434563587777664399996778440789999999999999719853035320682 Q gi|254780750|r 613 IVKDGRHPIVEKTLKQQSSKPFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGI 692 (920) Q Consensus 613 ~i~~gRHPviE~~l~~~~~~~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~ 692 (920) .|++||||++|.. ....|||||+.|+. +..+++||||||||||||||||+|++++|||+||||||++|++++ T Consensus 1 ~ik~~RHPlle~~----~~~~~VpNdi~l~~----~~~~i~iiTGpN~sGKSt~lk~i~l~~~laq~G~~vpa~~~~~~~ 72 (222) T cd03285 1 VLKEARHPCVEAQ----DDVAFIPNDVTLTR----GKSRFLIITGPNMGGKSTYIRQIGVIVLMAQIGCFVPCDSADIPI 72 (222) T ss_pred CCCCCCCCEEECC----CCCCEECCEEEECC----CCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCEEEEEEEEEC T ss_conf 9667748659706----89977776689768----982599998999887189999999999999868754632389952 Q ss_pred CCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCC Q ss_conf 21056765237661138532899999999999958998569993258898805679999999999997269849997487 Q gi|254780750|r 693 VDKLFSRVGSADNLASGRSTFMVEMIETASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHF 772 (920) Q Consensus 693 ~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy 772 (920) ||+||||||+.|++..|+||||+||.+++.||+++|++||||+||+||||+|.||.|||||+++||.++.+|+++||||| T Consensus 73 ~d~i~t~i~~~d~~~~~~StF~~e~~~~~~il~~~~~~sLvliDElg~GT~~~eg~aia~aile~L~~~~~~~~i~tTH~ 152 (222) T cd03285 73 VDCILARVGASDSQLKGVSTFMAEMLETAAILKSATENSLIIIDELGRGTSTYDGFGLAWAIAEYIATQIKCFCLFATHF 152 (222) T ss_pred CCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEECCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEEC T ss_conf 76499998899710033528999999999999847677320002346898822679999999999985068509998200 Q ss_pred HHHHHHHHHCCCEEEEEEEEE--EECCEEEEEEEEEECCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHH Q ss_conf 579766430688589999999--60992778777744789887789999982999899999999999998 Q gi|254780750|r 773 HELTDLSKSLKRFHNATLQVS--DSNEGIIFLHKVIPGIADHSYGIQVGKLAGLPNTVISRAYDILKTFE 840 (920) Q Consensus 773 ~eL~~l~~~~~~v~n~~~~~~--~~~~~i~flykl~~G~~~~Sygi~vA~laG~p~~vi~~A~~~~~~le 840 (920) |+|+.++...+++.|+||.+. +.+++++|+|||.+|+|++|||+++|+++|+|++||+||++++++|| T Consensus 153 ~~L~~l~~~~~~v~~~~~~~~~~~~~~~l~f~Ykl~~G~~~~s~ai~iA~~~G~p~~ii~rA~~~~~~lE 222 (222) T cd03285 153 HELTALADEVPNVKNLHVTALTDDASRTLTMLYKVEKGACDQSFGIHVAELANFPKEVIEMAKQKALELE 222 (222) T ss_pred HHHHHHHHCCCCCEEEEEEEEEEECCCCEEEEEEECCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHCC T ss_conf 7799986408551357899999605992789999676899984999999992959999999999999629 No 12 >cd03284 ABC_MutS1 MutS1 homolog in eukaryotes. The MutS protein initiates DNA mismatch repair by recognizing mispaired and unpaired bases embedded in duplex DNA and activating endo- and exonucleases to remove the mismatch. Members of the MutS family possess C-terminal domain with a conserved ATPase activity that belongs to the ATP binding cassette (ABC) superfamily. MutS homologs (MSH) have been identified in most prokaryotic and all eukaryotic organisms examined. Prokaryotes have two homologs (MutS1 and MutS2), whereas seven MSH proteins (MSH1 to MSH7) have been identified in eukaryotes. The homodimer MutS1 and heterodimers MSH2-MSH3 and MSH2-MSH6 are primarily involved in mitotic mismatch repair, whereas MSH4-MSH5 is involved in resolution of Holliday junctions during meiosis. All members of the MutS family contain the highly conserved Walker A/B ATPase domain, and many share a common mechanism of action. MutS1, MSH2-MSH3, MSH2-MSH6, and MSH4-MSH5 dimerize to form sliding clam Probab=100.00 E-value=0 Score=558.59 Aligned_cols=216 Identities=58% Similarity=0.987 Sum_probs=206.7 Q ss_pred EECCCCCHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCC Q ss_conf 00004680588763102877044434563587777664399996778440789999999999999719853035320682 Q gi|254780750|r 613 IVKDGRHPIVEKTLKQQSSKPFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGI 692 (920) Q Consensus 613 ~i~~gRHPviE~~l~~~~~~~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~ 692 (920) +|++||||++|..+. .+.|||||+.++ ++.+++||||||||||||||||||++++|||+||||||++|++++ T Consensus 1 ~i~~~RHPlle~~~~---~~~~VpNdi~l~-----~~~~~~iiTGpN~sGKSt~Lk~igl~~ilaq~G~~vpA~~a~i~~ 72 (216) T cd03284 1 EIEGGRHPVVEQVLD---NEPFVPNDTELD-----PERQILLITGPNMAGKSTYLRQVALIALLAQIGSFVPASKAEIGV 72 (216) T ss_pred CCCCCCCCEEEEECC---CCCEECCEEEEC-----CCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCEEEECCEEEEE T ss_conf 977586863851058---897776568978-----984599998998774599999999999999868758766159970 Q ss_pred CCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCC Q ss_conf 21056765237661138532899999999999958998569993258898805679999999999997269849997487 Q gi|254780750|r 693 VDKLFSRVGSADNLASGRSTFMVEMIETASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHF 772 (920) Q Consensus 693 ~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy 772 (920) ||+||||||+.|++..|+|||++||.+++.||++||++||||+||+||||++.||.|||||++|+|+++.+|+++||||| T Consensus 73 ~d~I~t~i~~~d~i~~~~StF~~e~~~~~~il~~a~~~sLvliDEl~~gT~~~eg~ala~aile~L~~~~~~~~i~tTH~ 152 (216) T cd03284 73 VDRIFTRIGASDDLAGGRSTFMVEMVETANILNNATERSLVLLDEIGRGTSTYDGLSIAWAIVEYLHEKIGAKTLFATHY 152 (216) T ss_pred CCEEEEEECCCCHHHCCCCHHHHHHHHHHHHHHHCCCCCEEEECCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEECH T ss_conf 44699950686213337265899999999999848776156334456899857889999999999997079728975050 Q ss_pred HHHHHHHHHCCCEEEEEEEEEEECCEEEEEEEEEECCCCCCHHHHHHHHCCCCHHHHHHHHHHH Q ss_conf 5797664306885899999996099277877774478988778999998299989999999999 Q gi|254780750|r 773 HELTDLSKSLKRFHNATLQVSDSNEGIIFLHKVIPGIADHSYGIQVGKLAGLPNTVISRAYDIL 836 (920) Q Consensus 773 ~eL~~l~~~~~~v~n~~~~~~~~~~~i~flykl~~G~~~~Sygi~vA~laG~p~~vi~~A~~~~ 836 (920) |+|+.++...++++|+||.+++.+++++|+|||.+|+|++|||+++|+++|+|++||+||++++ T Consensus 153 ~~L~~l~~~~~~v~n~~~~~~~~~~~~~f~Ykl~~G~~~~s~ai~~A~~~glp~~ii~rA~eil 216 (216) T cd03284 153 HELTELEGKLPRVKNFHVAVKEKGGGVVFLHKIVEGAADKSYGIEVARLAGLPEEVIERAREIL 216 (216) T ss_pred HHHHHHHHHCCCCEEEEEEEEECCCEEEEEEEECCCCCCCCHHHHHHHHHCCCHHHHHHHHHHC T ss_conf 8899987628661566789997099571887976489997099999999295999999999659 No 13 >TIGR01069 mutS2 MutS2 family protein; InterPro: IPR005747 Mismatch repair contributes to the overall fidelity of DNA replication and is essential for combating the adverse effects of damage to the genome. It involves the correction of mismatched base pairs that have been missed by the proofreading element of the DNA polymerase complex. The post-replicative Mismatch Repair System (MMRS) of Escherichia coli involves MutS (Mutator S), MutL and MutH proteins, and acts to correct point mutations or small insertion/deletion loops produced during DNA replication . MutS and MutL are involved in preventing recombination between partially homologous DNA sequences. The assembly of MMRS is initiated by MutS, which recognises and binds to mispaired nucleotides and allows further action of MutL and MutH to eliminate a portion of newly synthesized DNA strand containing the mispaired base . MutS can also collaborate with methyltransferases in the repair of O(6)-methylguanine damage, which would otherwise pair with thymine during replication to create an O(6)mG:T mismatch . MutS exists as a dimer, where the two monomers have different conformations and form a heterodimer at the structural level . Only one monomer recognises the mismatch specifically and has ADP bound. Non-specific major groove DNA-binding domains from both monomers embrace the DNA in a clamp-like structure. Mismatch binding induces ATP uptake and a conformational change in the MutS protein, resulting in a clamp that translocates on DNA. MutS is a modular protein with a complex structure , and is composed of: N-terminal mismatch-recognition domain, which is similar in structure to tRNA endonuclease. Connector domain, which is similar in structure to Holliday junction resolvase ruvC. Core domain, which is composed of two separate subdomains that join together to form a helical bundle; from within the core domain, two helices act as levers that extend towards (but do not touch) the DNA. Clamp domain, which is inserted between the two subdomains of the core domain at the top of the lever helices; the clamp domain has a beta-sheet structure. ATPase domain (connected to the core domain), which has a classical Walker A motif. HTH (helix-turn-helix) domain, which is involved in dimer contacts. Homologues of MutS have been found in many species including eukaryotes (MSH 1, 2, 3, 4, 5, and 6 proteins), archaea and bacteria, and together these proteins have been grouped into the MutS family. Although many of these proteins have similar activities to the E. coli MutS, there is significant diversity of function among the MutS family members. This diversity is even seen within species, where many species encode multiple MutS homologues with distinct functions . Inter-species homologues may have arisen through frequent ancient horizontal gene transfer of MutS (and MutL) from bacteria to archaea and eukaryotes via endosymbiotic ancestors of mitochondria and chloroplasts . This entry represents a family of MutS proteins.; GO: 0003684 damaged DNA binding, 0005524 ATP binding, 0045005 maintenance of fidelity during DNA-dependent DNA replication. Probab=100.00 E-value=0 Score=529.08 Aligned_cols=495 Identities=22% Similarity=0.320 Sum_probs=400.5 Q ss_pred HHHHHCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHH-HHHHHHHHHHHHCCHHHHHHHHHHHH-HCHHH- Q ss_conf 7876301234337899998631002345678989999997402212-58899999752048344445677521-00002- Q gi|254780750|r 314 LLKTIDYSITGAGGRLFAERIASPLTDSTKINTRLDSINFFIQNPI-LINPIQKILKSSPDVPRSLSRLKIGR-GEPKD- 390 (920) Q Consensus 314 L~~~Ln~t~T~~G~RlLr~wL~~PL~d~~~I~~R~daVe~l~~~~~-~~~~l~~~L~~i~DleRll~ri~~~~-~~p~d- 390 (920) +=.++.+|.|+.|+=.++. +.|..+++++....+-+..+..=.. .+. .-.|.++-||--.|+|.-++- ..+-| T Consensus 15 ~e~~~~~t~t~l~~~~~~~--L~p~~~~~e~~~~l~~~~~~~~I~~s~~~--n~~f~g~~DIr~~L~rae~~~~~~~le~ 90 (834) T TIGR01069 15 KENLLKYTATPLGKEKALE--LKPPKSVEESKELLEKLKALVSIENSLRS--NVRFKGLADIREALKRAELGGIVVGLEE 90 (834) T ss_pred HHHHHHHHCCHHHHHHHHH--CCCCCCHHHHHHHHHHHHHHHHHHHCCCC--CCCCCCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 9887575138045899860--87954078999999988777778732012--6554442144668744456435676899 Q ss_pred HHHHHHHHHHHHHHHH----HHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEECCC-CCCHHHHHH Q ss_conf 1206899999865567----54302584577876765776789999999998543101221046010168-541145579 Q gi|254780750|r 391 IAVIRDGIRSGINILN----LFLKNDLPEELRGAVEKLKKLPQSLEKTLSSMLSDDLPTFKRDGGFLRDG-ADASLDETR 465 (920) Q Consensus 391 l~~L~~sl~~~~~i~~----~l~~~~~~~~l~~~~~~l~~l~~~l~~~l~~~i~~~~~~~~~~~~~ik~g-~d~eLD~lr 465 (920) ++.+...|.....+.. ...+....+.|......+..++ .+++.|...|++ .|.++++ ++++||.+| T Consensus 91 ~l~I~~~l~~~~~lk~fit~~~~~~~~~~~L~~~~~~~~~L~-~L~n~i~~~iD~--------~G~v~~~GAs~~L~~ir 161 (834) T TIGR01069 91 ILKIQNLLKKVKELKRFITRLAEERLDLEILSKLLLKLITLS-PLENDIIECIDD--------DGKVKDEGASEELDEIR 161 (834) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHCCCCC-HHHHHHHHHCCC--------CCCHHHHHCCHHHHHHH T ss_conf 999999999989999999999731244268999998424484-689999974285--------41004222046789999 Q ss_pred HHHHHHHHHHHHHHHHHHHHCC-CCCC--CCCCCCCCEEEEEECCHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHH-HH Q ss_conf 9886389999977687788738-8751--102558622376204114333441116876527985212000111345-66 Q gi|254780750|r 466 SLRDQSKRIIASLQLKYAEETK-IKNL--KIKHNNNLGYFIEVTSSSASVLKKDLESKDRFIHRQTMSNLTRFTTLE-LI 541 (920) Q Consensus 466 ~~~~~~~~~l~~l~~~~~~~~~-i~sL--ki~~n~~~gy~iEV~~~~~~~i~~~~~~~~~~i~~qt~~~~~Rf~t~e-L~ 541 (920) ......++.+.+-...+..+.. -+.| .+.+.+.-.|.|-|.......++ +.++..|.+|.+-|+.|+ ++ T Consensus 162 ~~l~~l~~~~~~~l~~~i~~~~~a~~L~d~~vt~r~~r~vlp~K~~~~~~i~-------GiV~~~SsSG~T~Y~EPq~iv 234 (834) T TIGR01069 162 ESLKALEERVKKRLKKIIRSKELAKYLSDTIVTIRNDRYVLPLKSGFKGKIK-------GIVHDTSSSGETLYVEPQAIV 234 (834) T ss_pred HHHHHHHHHHHHHHHHHHCCHHHHHHHHCCCEEEECCEEEEEEECCCCCCCC-------CEEEECCCCCCCCEECHHHHH T ss_conf 9999999999999998734433455440022265267389866615300147-------616520587750101628999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEECCCC---CCEEECCCC Q ss_conf 64334643777776433447888765430112579888899986778877888775258851210478---714000046 Q gi|254780750|r 542 DLENRITNATNSALLIELESFEILSNAIIEQSESLDNASQVIAIIDISIALAILAKEQNYCRPIIDNS---TNFIVKDGR 618 (920) Q Consensus 542 ~l~~~i~~~~~~~~~~e~~~~~~l~~~i~~~~~~l~~~~~~ia~lD~l~SlA~~a~~~~y~rP~i~~~---~~l~i~~gR 618 (920) ++++++.++..+......++++.|.+.+.+|...|....+.+..+|++.|.+.+|...+...|.+.+. ..+.+.++| T Consensus 235 ~~nnkL~~~~~~~~~~i~~iLr~Ls~~~~~~~~~L~~l~~~~~~lD~~~Ar~ry~~~~~~~fP~~~~tGd~k~~~L~~ar 314 (834) T TIGR01069 235 KLNNKLAQLKNEEEEEIEKILRTLSEKVQEYLLELKELFKEFLELDSLQARARYAKAVKAEFPEISNTGDEKVIELEEAR 314 (834) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEEHHHCC T ss_conf 99889999877888999999998889999999999999999988658999988778734726778988863112450177 Q ss_pred CHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHH-H-CCCCCCCEE Q ss_conf 80588763102877044434563587777664399996778440789999999999999719853035-3-206822105 Q gi|254780750|r 619 HPIVEKTLKQQSSKPFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPAS-Y-AHIGIVDKL 696 (920) Q Consensus 619 HPviE~~l~~~~~~~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~-~-a~i~~~D~I 696 (920) ||++...=...++...||+++.|+ .+.++++|||||.||||+.|||+||.++|||.|.||||. . .++++|+.| T Consensus 315 HPlL~~~kE~eGGPavv~~~~~l~-----~~~~~~~iTGPNtGGKTVtLKTLGL~~lm~q~gipi~~~e~hS~~p~F~~i 389 (834) T TIGR01069 315 HPLLKESKEDEGGPAVVPVTLELK-----SEKRVLIITGPNTGGKTVTLKTLGLLALMAQSGIPIPAKESHSELPLFEEI 389 (834) T ss_pred CCCCCCCCCCCCCCEEEECCCCCC-----CCCCEEEEECCCCCHHHHHHHHHHHHHHHHHCCCCEECCCCCCCCHHHHHH T ss_conf 989887432257964664462316-----641178886779961789999999999998727833117862211101212 Q ss_pred EEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCC------------------CCEEEEECCCCCCCHHHHHHHHHHHHHHH Q ss_conf 676523766113853289999999999995899------------------85699932588988056799999999999 Q gi|254780750|r 697 FSRVGSADNLASGRSTFMVEMIETASILNQATN------------------QSFVILDEIGRGTATLDGLSIAWATIEYL 758 (920) Q Consensus 697 ftRiGa~D~l~~g~STF~vEm~e~~~IL~~at~------------------~SLVllDElGrGTst~DG~aiA~aile~l 758 (920) |+.||...||.++.|||...|...+.||.+.++ +||||+||||-||++.+|.|||.|+|||| T Consensus 390 faDIGDEQSi~qnLSTFS~Hm~~i~~iL~~~~eGvqdvldPeidsPnhPifnsLVl~DE~gaGTDp~EGsALai~iLe~L 469 (834) T TIGR01069 390 FADIGDEQSIEQNLSTFSSHMKNISEILKKVDEGVQDVLDPEIDSPNHPIFNSLVLFDELGAGTDPDEGSALAISILEYL 469 (834) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCHHHHHHCCCCCCCCCCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHH T ss_conf 01358842424337789999999999972144013330265668889874202566633015788868999999999998 Q ss_pred HHHCCCEEEEECCCHHHHHHHHHCCCEEEEEEEEEEECCEEEEEEEEEECCCCCCHHHHHHHHCCCCHHHHHHHHHHH Q ss_conf 972698499974875797664306885899999996099277877774478988778999998299989999999999 Q gi|254780750|r 759 HETNRCRGLLATHFHELTDLSKSLKRFHNATLQVSDSNEGIIFLHKVIPGIADHSYGIQVGKLAGLPNTVISRAYDIL 836 (920) Q Consensus 759 ~~~~~~~~lfaTHy~eL~~l~~~~~~v~n~~~~~~~~~~~i~flykl~~G~~~~Sygi~vA~laG~p~~vi~~A~~~~ 836 (920) ++. .|+++.+|||.+|..+..+..+|.|+.|.+|.+.- ..+|+|..|+.++||+++||..+|||..||++|++.. T Consensus 470 ~~~-n~~v~~TTHy~~LK~~~y~~~~v~nASv~fD~Etl--~PtY~ll~g~~G~S~AF~iA~r~G~P~~iie~AK~~~ 544 (834) T TIGR01069 470 LKQ-NALVLITTHYKELKAYAYNNEGVENASVLFDEETL--SPTYKLLFGVPGRSYAFEIAQRLGIPSEIIEQAKEKY 544 (834) T ss_pred HCC-CCEEEEECCCHHHHHHCCCCCCCCHHHHHHCHHHC--CCCCEEEECCCCCHHHHHHHHHHCCCHHHHHHHHHHH T ss_conf 538-98499973534652200246640037766233422--7431220037871068899998479888999986742 No 14 >cd03287 ABC_MSH3_euk MutS3 homolog in eukaryotes. The MutS protein initiates DNA mismatch repair by recognizing mispaired and unpaired bases embedded in duplex DNA and activating endo- and exonucleases to remove the mismatch. Members of the MutS family possess C-terminal domain with a conserved ATPase activity that belongs to the ATP binding cassette (ABC) superfamily. MutS homologs (MSH) have been identified in most prokaryotic and all eukaryotic organisms examined. Prokaryotes have two homologs (MutS1 and MutS2), whereas seven MSH proteins (MSH1 to MSH7) have been identified in eukaryotes. The homodimer MutS1 and heterodimers MSH2-MSH3 and MSH2-MSH6 are primarily involved in mitotic mismatch repair, whereas MSH4-MSH5 is involved in resolution of Holliday junctions during meiosis. All members of the MutS family contain the highly conserved Walker A/B ATPase domain, and many share a common mechanism of action. MutS1, MSH2-MSH3, MSH2-MSH6, and MSH4-MSH5 dimerize to form sliding c Probab=100.00 E-value=0 Score=537.19 Aligned_cols=213 Identities=47% Similarity=0.838 Sum_probs=198.5 Q ss_pred EEECCCCCHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCC Q ss_conf 40000468058876310287704443456358777766439999677844078999999999999971985303532068 Q gi|254780750|r 612 FIVKDGRHPIVEKTLKQQSSKPFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIG 691 (920) Q Consensus 612 l~i~~gRHPviE~~l~~~~~~~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~ 691 (920) |.|++||||++|.+. .+.|||||+.++. +..+++||||||||||||||||+|++++|||+||||||++|+++ T Consensus 1 i~ik~~RHPlle~~~----~~~~VpNdi~l~~----~~~~~~iiTGpN~~GKSt~lk~i~l~~ilaq~G~~vpa~~~~~~ 72 (222) T cd03287 1 ILIKEGRHPMIESLL----DKSFVPNDIHLSA----EGGYCQIITGPNMGGKSSYIRQVALITIMAQIGSFVPASSATLS 72 (222) T ss_pred CEEECCCCCEEECCC----CCCEECCEEEECC----CCCEEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCCHHHCEEE T ss_conf 937267486784047----9976877479768----88448999789988728999999999999972677214563996 Q ss_pred CCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECC Q ss_conf 22105676523766113853289999999999995899856999325889880567999999999999726984999748 Q gi|254780750|r 692 IVDKLFSRVGSADNLASGRSTFMVEMIETASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATH 771 (920) Q Consensus 692 ~~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTH 771 (920) +||+||||||..|++..|+|||+.||.+++.|++++|++||||+||+||||++.||.|||||++|||.+..+|+++|+|| T Consensus 73 ~~d~i~~~i~~~d~~~~~~StF~~e~~~~~~il~~~~~~sLvliDEl~~GT~~~eg~ala~aile~l~~~~~~~~i~tTH 152 (222) T cd03287 73 IFDSVLTRMGASDSIQHGMSTFMVELSETSHILSNCTSRSLVILDELGRGTSTHDGIAIAYATLHYLLEEKKCLVLFVTH 152 (222) T ss_pred CCCEEEEECCCCCHHHCCCCHHHHHHHHHHHHHHHCCCCCEEEECCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEECC T ss_conf 35759995067862344522799999999999986788754423224689983577999999999998605889999476 Q ss_pred CHHHHHHHHHCC-CEEEEEEEEEE--------ECCEEEEEEEEEECCCCCCHHHHHHHHCCCCHHHHHHH Q ss_conf 757976643068-85899999996--------09927787777447898877899999829998999999 Q gi|254780750|r 772 FHELTDLSKSLK-RFHNATLQVSD--------SNEGIIFLHKVIPGIADHSYGIQVGKLAGLPNTVISRA 832 (920) Q Consensus 772 y~eL~~l~~~~~-~v~n~~~~~~~--------~~~~i~flykl~~G~~~~Sygi~vA~laG~p~~vi~~A 832 (920) ||+|+++....+ .++|+||.+.+ .+++++|+|||.+|+|++||||+|||++|+|++||+|| T Consensus 153 ~~~L~~l~~~~~~~v~n~~~~~~~~~~~~~~~~~~~l~f~YkL~~G~~~~s~ai~iA~~~GlP~~ii~rA 222 (222) T cd03287 153 YPSLGEILRRFEGSIRNYHMSYLESQKDFETSDSQSITFLYKLVRGLASRSFGLNVARLAGLPKSIISRA 222 (222) T ss_pred CHHHHHHHHHCCCCEEEEEEEEEEECCCCCCCCCCEEEEEEEECCCCCCCCHHHHHHHHHCCCHHHHHCC T ss_conf 2789999875878457888889873044444668803288587779999719999999949199998029 No 15 >cd03286 ABC_MSH6_euk MutS6 homolog in eukaryotes. The MutS protein initiates DNA mismatch repair by recognizing mispaired and unpaired bases embedded in duplex DNA and activating endo- and exonucleases to remove the mismatch. Members of the MutS family possess C-terminal domain with a conserved ATPase activity that belongs to the ATP binding cassette (ABC) superfamily. MutS homologs (MSH) have been identified in most prokaryotic and all eukaryotic organisms examined. Prokaryotes have two homologs (MutS1 and MutS2), whereas seven MSH proteins (MSH1 to MSH7) have been identified in eukaryotes. The homodimer MutS1 and heterodimers MSH2-MSH3 and MSH2-MSH6 are primarily involved in mitotic mismatch repair, whereas MSH4-MSH5 is involved in resolution of Holliday junctions during meiosis. All members of the MutS family contain the highly conserved Walker A/B ATPase domain, and many share a common mechanism of action. MutS1, MSH2-MSH3, MSH2-MSH6, and MSH4-MSH5 dimerize to form sliding c Probab=100.00 E-value=0 Score=528.55 Aligned_cols=211 Identities=45% Similarity=0.743 Sum_probs=197.8 Q ss_pred ECCCCCHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCC Q ss_conf 00046805887631028770444345635877776643999967784407899999999999997198530353206822 Q gi|254780750|r 614 VKDGRHPIVEKTLKQQSSKPFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIV 693 (920) Q Consensus 614 i~~gRHPviE~~l~~~~~~~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~ 693 (920) ++++|||++|.. ...+|||||+.|+. ++.+++||||||||||||||||||+++||||+||||||++|++++| T Consensus 2 lk~~RHPll~~~----~~~~~VpNdi~l~~----~~~~~~iiTGpN~sGKSt~lk~i~l~~ilaq~G~~vpA~~~~~~~~ 73 (218) T cd03286 2 FEELRHPCLNAS----TASSFVPNDVDLGA----TSPRILVLTGPNMGGKSTLLRTVCLAVIMAQMGMDVPAKSMRLSLV 73 (218) T ss_pred CCCCCCCEEECC----CCCCEECCEEEECC----CCCEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCEECCCCEEECC T ss_conf 766758779726----89975875688679----9740899989998873899999999999998288430146477346 Q ss_pred CEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCH Q ss_conf 10567652376611385328999999999999589985699932588988056799999999999972698499974875 Q gi|254780750|r 694 DKLFSRVGSADNLASGRSTFMVEMIETASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFH 773 (920) Q Consensus 694 D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~ 773 (920) |+||||||+.||+..|+||||.||.+++.||++++++||||+||+||||++.||.|||+|++|||.++.+|+++|+|||| T Consensus 74 d~i~~~i~~~d~~~~~~StF~~e~~~~~~il~~~~~~sLvllDE~~~GT~~~eg~ala~aile~L~~~~~~~~i~tTH~~ 153 (218) T cd03286 74 DRIFTRIGARDDIMKGESTFMVELSETANILRHATPDSLVILDELGRGTSTHDGYAIAHAVLEYLVKKVKCLTLFSTHYH 153 (218) T ss_pred CEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCEECCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCCH T ss_conf 64899745866143115069999999999998679985010255468998116799999999999863797699976728 Q ss_pred HHHHHHHHCCCEEEEEEEEEEE------CCEEEEEEEEEECCCCCCHHHHHHHHCCCCHHHHHHH Q ss_conf 7976643068858999999960------9927787777447898877899999829998999999 Q gi|254780750|r 774 ELTDLSKSLKRFHNATLQVSDS------NEGIIFLHKVIPGIADHSYGIQVGKLAGLPNTVISRA 832 (920) Q Consensus 774 eL~~l~~~~~~v~n~~~~~~~~------~~~i~flykl~~G~~~~Sygi~vA~laG~p~~vi~~A 832 (920) +|+++....++++++||.+..+ +++++|+|||.+|+|++|||++||+++|+|++||+|| T Consensus 154 el~~~~~~~~~v~~~~m~~~~~~~~~~~~~~l~f~YkL~~G~~~~s~ai~iA~~~G~p~~ii~rA 218 (218) T cd03286 154 SLCDEFHEHGGVRLGHMACAVKNESDPTIRDITFLYKLVAGICPKSYGLYVALMAGIPDGVVERA 218 (218) T ss_pred HHHHHHHHCCCCEEEEEEEEEECCCCCCCCEEEEEEEECCCCCCCCHHHHHHHHHCCCHHHHHCC T ss_conf 99998864889189889999963677878713288698879998719999999959099999229 No 16 >cd03282 ABC_MSH4_euk MutS4 homolog in eukaryotes. The MutS protein initiates DNA mismatch repair by recognizing mispaired and unpaired bases embedded in duplex DNA and activating endo- and exonucleases to remove the mismatch. Members of the MutS family possess C-terminal domain with a conserved ATPase activity that belongs to the ATP binding cassette (ABC) superfamily. MutS homologs (MSH) have been identified in most prokaryotic and all eukaryotic organisms examined. Prokaryotes have two homologs (MutS1 and MutS2), whereas seven MSH proteins (MSH1 to MSH7) have been identified in eukaryotes. The homodimer MutS1 and heterodimers MSH2-MSH3 and MSH2-MSH6 are primarily involved in mitotic mismatch repair, whereas MSH4-MSH5 is involved in resolution of Holliday junctions during meiosis. All members of the MutS family contain the highly conserved Walker A/B ATPase domain, and many share a common mechanism of action. MutS1, MSH2-MSH3, MSH2-MSH6, and MSH4-MSH5 dimerize to form sliding c Probab=100.00 E-value=0 Score=507.47 Aligned_cols=202 Identities=38% Similarity=0.647 Sum_probs=189.0 Q ss_pred EECCCCCHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCC Q ss_conf 00004680588763102877044434563587777664399996778440789999999999999719853035320682 Q gi|254780750|r 613 IVKDGRHPIVEKTLKQQSSKPFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGI 692 (920) Q Consensus 613 ~i~~gRHPviE~~l~~~~~~~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~ 692 (920) .|++||||++|.. .++|||||+.++. +..+++||||||||||||||||+|++++|||+||||||++|++++ T Consensus 1 iik~~rHPlle~~-----~~~~VpNdi~l~~----~~~~~~iITGpN~gGKSt~Lktigl~~ilAq~G~~vpA~~a~~~~ 71 (204) T cd03282 1 IIRDSRHPILDRD-----KKNFIPNDIYLTR----GSSRFHIITGPNMSGKSTYLKQIALLAIMAQIGCFVPAEYATLPI 71 (204) T ss_pred CCCCCCCCEEECC-----CCCEECCEEEECC----CCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCEEEHHHCCCCC T ss_conf 9557708638568-----8967876688669----972599998999887199999999999999968916713210336 Q ss_pred CCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCC Q ss_conf 21056765237661138532899999999999958998569993258898805679999999999997269849997487 Q gi|254780750|r 693 VDKLFSRVGSADNLASGRSTFMVEMIETASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHF 772 (920) Q Consensus 693 ~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy 772 (920) ||+||||||..||+.+|+|||+.||.+++.|++.++++||||+||+||||+|.||.|+|||++++|.+. +|+++||||| T Consensus 72 ~d~I~t~i~~~d~i~~~~StF~~E~~~~~~il~~~~~~sLvliDEl~~GT~~~eg~a~a~aile~l~~~-~~~~i~tTH~ 150 (204) T cd03282 72 FNRLLSRLSNDDSMERNLSTFASEMSETAYILDYADGDSLVLIDELGRGTSSADGFAISLAILECLIKK-ESTVFFATHF 150 (204) T ss_pred CCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEECCCCCCCCHHHHHHHHHHHHHHHHHC-CCEEEEECCH T ss_conf 677899987666100664589999999999998588898798543348998678999999999999967-9879998976 Q ss_pred HHHHHHHHHCCCEEEEEEEEEE-ECCEEEEEEEEEECCCC-CCHHHHHHHHCCC Q ss_conf 5797664306885899999996-09927787777447898-8778999998299 Q gi|254780750|r 773 HELTDLSKSLKRFHNATLQVSD-SNEGIIFLHKVIPGIAD-HSYGIQVGKLAGL 824 (920) Q Consensus 773 ~eL~~l~~~~~~v~n~~~~~~~-~~~~i~flykl~~G~~~-~Sygi~vA~laG~ 824 (920) |+|+++.+..+++.|+||++.+ .+++++|+|||.+|++. +||||++||+||+ T Consensus 151 ~~L~~l~~~~~~v~n~~m~~~~~~~~~l~f~YkL~~G~~~~~~ygi~~ar~~~l 204 (204) T cd03282 151 RDIAAILGNKSCVVHLHMKAQSINSNGIEMAYKLVLGLYRIVDDGIRFVRVLAL 204 (204) T ss_pred HHHHHHHHHCCCEEEEEEEEEEEECCEEEEEEEECCCCCCCCCCHHHHHHHCCC T ss_conf 899998863888499999999973992889879752788886355999997479 No 17 >cd03281 ABC_MSH5_euk MutS5 homolog in eukaryotes. The MutS protein initiates DNA mismatch repair by recognizing mispaired and unpaired bases embedded in duplex DNA and activating endo- and exonucleases to remove the mismatch. Members of the MutS family possess C-terminal domain with a conserved ATPase activity that belongs to the ATP binding cassette (ABC) superfamily. MutS homologs (MSH) have been identified in most prokaryotic and all eukaryotic organisms examined. Prokaryotes have two homologs (MutS1 and MutS2), whereas seven MSH proteins (MSH1 to MSH7) have been identified in eukaryotes. The homodimer MutS1 and heterodimers MSH2-MSH3 and MSH2-MSH6 are primarily involved in mitotic mismatch repair, whereas MSH4-MSH5 is involved in resolution of Holliday junctions during meiosis. All members of the MutS family contain the highly conserved Walker A/B ATPase domain, and many share a common mechanism of action. MutS1, MSH2-MSH3, MSH2-MSH6, and MSH4-MSH5 dimerize to form sliding c Probab=100.00 E-value=0 Score=500.91 Aligned_cols=203 Identities=41% Similarity=0.714 Sum_probs=186.4 Q ss_pred EECCCCCHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCC Q ss_conf 00004680588763102877044434563587777664399996778440789999999999999719853035320682 Q gi|254780750|r 613 IVKDGRHPIVEKTLKQQSSKPFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGI 692 (920) Q Consensus 613 ~i~~gRHPviE~~l~~~~~~~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~ 692 (920) +|++||||++|... .+|||||+.|+. +..+++||||||||||||||||+|++++|||+||||||++|++++ T Consensus 1 ~i~~~RHPlle~~~-----~~~VpNdi~l~~----~~~~~~iiTGpN~gGKSt~lkti~l~~ilaq~G~~vpa~~~~~~~ 71 (213) T cd03281 1 EIQGGRHPLLELFV-----DSFVPNDTEIGG----GGPSIMVITGPNSSGKSVYLKQVALIVFLAHIGSFVPADSATIGL 71 (213) T ss_pred CCCCCCCCEEECCC-----CCEECCEEEECC----CCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCEEECCEEEEC T ss_conf 97757586582567-----964676589779----982599998999876599999999999999858857674039972 Q ss_pred CCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCC--EEEEEC Q ss_conf 210567652376611385328999999999999589985699932588988056799999999999972698--499974 Q gi|254780750|r 693 VDKLFSRVGSADNLASGRSTFMVEMIETASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRC--RGLLAT 770 (920) Q Consensus 693 ~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~--~~lfaT 770 (920) ||+||||||..|++.+|+|||++||.+++.||++++++||||+||+||||++.||.|||||++|||.+...+ +++||| T Consensus 72 ~~~i~~~i~~~d~~~~~~StF~~E~~~~~~il~~~~~~sLvliDEl~~GT~~~eg~ala~a~le~l~~~~~~~~~~~~tT 151 (213) T cd03281 72 VDKIFTRMSSRESVSSGQSAFMIDLYQVSKALRLATRRSLVLIDEFGKGTDTEDGAGLLIATIEHLLKRGPECPRVIVST 151 (213) T ss_pred CCCEEEEECCHHHHHCCCCHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCCCEEEEEC T ss_conf 12237676764344413258999999999999858888658773134788835679999999999996588766499989 Q ss_pred CCHHHHHHH--HHCCCEEEEEEEEE------EECCEEEEEEEEEECCCCCCHHHHHHHHCCC Q ss_conf 875797664--30688589999999------6099277877774478988778999998299 Q gi|254780750|r 771 HFHELTDLS--KSLKRFHNATLQVS------DSNEGIIFLHKVIPGIADHSYGIQVGKLAGL 824 (920) Q Consensus 771 Hy~eL~~l~--~~~~~v~n~~~~~~------~~~~~i~flykl~~G~~~~Sygi~vA~laG~ 824 (920) |||+|+++. .+.+.+.++||++. ..+++++|+|||.+|+|++|||++||+++|+ T Consensus 152 H~~~L~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~l~f~YkL~~G~~~~S~ai~vA~~~Gl 213 (213) T cd03281 152 HFHELFNRSLLPERLKIKFLTMEVLLNPTSTSPNEDITYLYRLVPGLADTSFAIHCAKLAGI 213 (213) T ss_pred CHHHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCCEEEEEEEECCCCCCCCHHHHHHHHCCC T ss_conf 72787654401656443788999986364411288485987878689986099999998589 No 18 >cd03280 ABC_MutS2 MutS2 homologs in bacteria and eukaryotes. The MutS protein initiates DNA mismatch repair by recognizing mispaired and unpaired bases embedded in duplex DNA and activating endo- and exonucleases to remove the mismatch. Members of the MutS family also possess a conserved ATPase activity that belongs to the ATP binding cassette (ABC) superfamily. MutS homologs (MSH) have been identified in most prokaryotic and all eukaryotic organisms examined. Prokaryotes have two homologs (MutS1 and MutS2), whereas seven MSH proteins (MSH1 to MSH7) have been identified in eukaryotes. The homodimer MutS1 and heterodimers MSH2-MSH3 and MSH2-MSH6 are primarily involved in mitotic mismatch repair, whereas MSH4-MSH5 is involved in resolution of Holliday junctions during meiosis. All members of the MutS family contain the highly conserved Walker A/B ATPase domain, and many share a common mechanism of action. MutS1, MSH2-MSH3, MSH2-MSH6, and MSH4-MSH5 dimerize to form sliding clamps, a Probab=100.00 E-value=0 Score=479.52 Aligned_cols=199 Identities=27% Similarity=0.491 Sum_probs=186.4 Q ss_pred EECCCCCHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHH-HCCCC Q ss_conf 00004680588763102877044434563587777664399996778440789999999999999719853035-32068 Q gi|254780750|r 613 IVKDGRHPIVEKTLKQQSSKPFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPAS-YAHIG 691 (920) Q Consensus 613 ~i~~gRHPviE~~l~~~~~~~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~-~a~i~ 691 (920) .|++||||+++. ..+.|||||+.++ .+.+++||||||||||||||||+|+++||||+||||||+ .+.+| T Consensus 1 ki~~~rHPll~~-----~~~~~V~Ndi~l~-----~~~~~~iiTGpN~sGKSt~lk~i~l~~iLAq~G~~vpa~~~~~~~ 70 (200) T cd03280 1 RLREARHPLLPL-----QGEKVVPLDIQLG-----ENKRVLVITGPNAGGKTVTLKTLGLLTLMAQSGLPIPAAEGSSLP 70 (200) T ss_pred CCCCCCCCEEEC-----CCCCEECCEEEEC-----CCCEEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCEEECCCEECC T ss_conf 987772885975-----7895476258977-----993399998898775099999999999999977780011004726 Q ss_pred CCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECC Q ss_conf 22105676523766113853289999999999995899856999325889880567999999999999726984999748 Q gi|254780750|r 692 IVDKLFSRVGSADNLASGRSTFMVEMIETASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATH 771 (920) Q Consensus 692 ~~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTH 771 (920) +||+||||||..|++..|+|||+.||.+++.|+++++++||||+||+||||++.||.|+|||++++|.++ +|+++|+|| T Consensus 71 ~~d~i~~~i~~~d~~~~~~S~F~~E~~~~~~il~~~~~~sLvliDE~~~gTn~~eg~~~a~a~l~~l~~~-~~~~i~tTH 149 (200) T cd03280 71 VFENIFADIGDEQSIEQSLSTFSSHMKNIARILQHADPDSLVLLDELGSGTDPVEGAALAIAILEELLER-GALVIATTH 149 (200) T ss_pred CCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCEEEECCCCCCCCHHHHHHHHHHHHHHHHHC-CCEEEEECC T ss_conf 6567888846733666778799999999999998588888797556668988789999999999999857-997999897 Q ss_pred CHHHHHHHHHCCCEEEEEEEEEEECCEEEEEEEEEECCCCCCHHHHHHHHCCC Q ss_conf 75797664306885899999996099277877774478988778999998299 Q gi|254780750|r 772 FHELTDLSKSLKRFHNATLQVSDSNEGIIFLHKVIPGIADHSYGIQVGKLAGL 824 (920) Q Consensus 772 y~eL~~l~~~~~~v~n~~~~~~~~~~~i~flykl~~G~~~~Sygi~vA~laG~ 824 (920) ||+|+.+....+++.|+||+++++ ++.|+|||++|+|++|||++||+++|+ T Consensus 150 ~~eL~~l~~~~~~v~~~~~~~d~~--~l~~~YKl~~G~~~~s~al~ia~~~Gi 200 (200) T cd03280 150 YGELKAYAYKREGVENASMEFDPE--TLKPTYRLLIGVPGRSNALEIARRLGL 200 (200) T ss_pred CHHHHHHHHHCCCEEEEEEEEECC--CCEEEEEEEECCCCCCHHHHHHHHCCC T ss_conf 178998775135627889999888--267989981489998699999998589 No 19 >cd03243 ABC_MutS_homologs The MutS protein initiates DNA mismatch repair by recognizing mispaired and unpaired bases embedded in duplex DNA and activating endo- and exonucleases to remove the mismatch. Members of the MutS family also possess a conserved ATPase activity that belongs to the ATP binding cassette (ABC) superfamily. MutS homologs (MSH) have been identified in most prokaryotic and all eukaryotic organisms examined. Prokaryotes have two homologs (MutS1 and MutS2), whereas seven MSH proteins (MSH1 to MSH7) have been identified in eukaryotes. The homodimer MutS1 and heterodimers MSH2-MSH3 and MSH2-MSH6 are primarily involved in mitotic mismatch repair, whereas MSH4-MSH5 is involved in resolution of Holliday junctions during meiosis. All members of the MutS family contain the highly conserved Walker A/B ATPase domain, and many share a common mechanism of action. MutS1, MSH2-MSH3, MSH2-MSH6, and MSH4-MSH5 dimerize to form sliding clamps, and recognition of specific DNA stru Probab=100.00 E-value=0 Score=479.01 Aligned_cols=202 Identities=49% Similarity=0.837 Sum_probs=192.7 Q ss_pred EECCCCCHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCC Q ss_conf 00004680588763102877044434563587777664399996778440789999999999999719853035320682 Q gi|254780750|r 613 IVKDGRHPIVEKTLKQQSSKPFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGI 692 (920) Q Consensus 613 ~i~~gRHPviE~~l~~~~~~~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~ 692 (920) +|++||||++|.... .+.|||||+.|+ ..+++||||||||||||||||+|++++|||+||||||++|++++ T Consensus 1 ~i~~~rHPlle~~~~---~~~~VpNdi~l~------~~~~~iiTGpN~sGKSt~lkti~l~~~laq~G~~vpa~~~~~~~ 71 (202) T cd03243 1 EIKGGRHPVLLALTK---GETFVPNDINLG------SGRLLLITGPNMGGKSTYLRSIGLAVLLAQIGCFVPAESASIPL 71 (202) T ss_pred CCCCCCCCEEEEECC---CCCEECCEEEEC------CCCEEEEECCCCCCHHHHHHHHHHHHHHHHHCCEEEECCCEEEC T ss_conf 977574866853437---897787828867------98289998998875399999999999999838737204468944 Q ss_pred CCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCC Q ss_conf 21056765237661138532899999999999958998569993258898805679999999999997269849997487 Q gi|254780750|r 693 VDKLFSRVGSADNLASGRSTFMVEMIETASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHF 772 (920) Q Consensus 693 ~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy 772 (920) ||+|||+|+..|++..|+|||+.||.+++.|+++++++||||+||+|+||++.||.|||+|++++|.++ +|+++||||| T Consensus 72 ~~~i~~~~~~~d~~~~~~S~F~~e~~~~~~i~~~~~~~slvliDE~~~gT~~~eg~~la~a~l~~l~~~-~~~~i~tTH~ 150 (202) T cd03243 72 VDRIFTRIGAEDSISDGRSTFMAELLELKEILSLATPRSLVLIDELGRGTSTAEGLAIAYAVLEHLLEK-GCRTLFATHF 150 (202) T ss_pred CCEEEEEECCCHHHHCCCCHHHHHHHHHHHHHHHCCCCCEEEECCCCCCCCHHHHHHHHHHHHHHHHHC-CCEEEEEECC T ss_conf 666999846602444353549999999999998677777242052347998678799999999999853-6849998253 Q ss_pred HHHHHHHHHCCCEEEEEEEEEEECCEEEEEEEEEECCCCCCHHHHHHHHCCC Q ss_conf 5797664306885899999996099277877774478988778999998299 Q gi|254780750|r 773 HELTDLSKSLKRFHNATLQVSDSNEGIIFLHKVIPGIADHSYGIQVGKLAGL 824 (920) Q Consensus 773 ~eL~~l~~~~~~v~n~~~~~~~~~~~i~flykl~~G~~~~Sygi~vA~laG~ 824 (920) |+|+++....++++|+||+++..+++++|+|||.+|+|++|||+++||++|+ T Consensus 151 ~~L~~~~~~~~~~~~~~~~~~~~~~~~~~~Ykl~~G~~~~s~ai~ia~~~Gl 202 (202) T cd03243 151 HELADLPEQVPGVKNLHMEELITTGGLTFTYKLIDGICDPSYALQIAELAGL 202 (202) T ss_pred HHHHHHHHCCCCEEEEEEEEEEECCEEEEEEEECCCCCCCCHHHHHHHHCCC T ss_conf 8888753138974999999999799086988988899986599999998589 No 20 >cd03283 ABC_MutS-like MutS-like homolog in eukaryotes. The MutS protein initiates DNA mismatch repair by recognizing mispaired and unpaired bases embedded in duplex DNA and activating endo- and exonucleases to remove the mismatch. Members of the MutS family possess C-terminal domain with a conserved ATPase activity that belongs to the ATP binding cassette (ABC) superfamily. MutS homologs (MSH) have been identified in most prokaryotic and all eukaryotic organisms examined. Prokaryotes have two homologs (MutS1 and MutS2), whereas seven MSH proteins (MSH1 to MSH7) have been identified in eukaryotes. The homodimer MutS1 and heterodimers MSH2-MSH3 and MSH2-MSH6 are primarily involved in mitotic mismatch repair, whereas MSH4-MSH5 is involved in resolution of Holliday junctions during meiosis. All members of the MutS family contain the highly conserved Walker A/B ATPase domain, and many share a common mechanism of action. MutS1, MSH2-MSH3, MSH2-MSH6, and MSH4-MSH5 dimerize to form slid Probab=100.00 E-value=0 Score=465.82 Aligned_cols=197 Identities=30% Similarity=0.433 Sum_probs=184.3 Q ss_pred EECCCCCHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCC Q ss_conf 00004680588763102877044434563587777664399996778440789999999999999719853035320682 Q gi|254780750|r 613 IVKDGRHPIVEKTLKQQSSKPFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGI 692 (920) Q Consensus 613 ~i~~gRHPviE~~l~~~~~~~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~ 692 (920) +++++|||+++. +++||||+.|+ ..+++||||||||||||||||+|++++|||+||||||+++.+++ T Consensus 1 E~k~~rHPll~~-------~~~V~Ndi~l~------~~~~~iiTGpN~~GKSt~Lk~i~l~~ilaq~G~~vpa~~~~~~~ 67 (199) T cd03283 1 EAKNLGHPLIGR-------EKRVANDIDME------KKNGILITGSNMSGKSTFLRTIGVNVILAQAGAPVCASSFELPP 67 (199) T ss_pred CCCCCCCCCCCC-------CCCCCCCEEEC------CCCEEEEECCCCCCHHHHHHHHHHHHHHHHHCCEEEHHHEEECC T ss_conf 944368883478-------98327725767------98589998999986599999999999999968938624478233 Q ss_pred CCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCC--CCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEEC Q ss_conf 2105676523766113853289999999999995899--85699932588988056799999999999972698499974 Q gi|254780750|r 693 VDKLFSRVGSADNLASGRSTFMVEMIETASILNQATN--QSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLAT 770 (920) Q Consensus 693 ~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~at~--~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaT 770 (920) ++|||||+..||+..|+|||+.||.+++.|++.+++ +||||+||+||||++.||.|||+|++|||.++ +|+++||| T Consensus 68 -~~i~t~i~~~d~~~~~~S~F~~E~~~~~~il~~~~~~~~sLvliDEl~~GT~~~eg~a~a~a~le~l~~~-~~~~iitT 145 (199) T cd03283 68 -VKIFTSIRVSDDLRDGISYFYAELRRLKEIVEKAKKGEPVLFLLDEIFKGTNSRERQAASAAVLKFLKNK-NTIGIIST 145 (199) T ss_pred -CEEEEEEECCCHHHCCCCHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHHHHC-CCEEEEEC T ss_conf -0899998347512245357999999999999972489857996143247998678999999999999867-98799988 Q ss_pred CCHHHHHHHHHCCCEEEEEEEEEEECCEEEEEEEEEECCCCCCHHHHHHHHCCC Q ss_conf 875797664306885899999996099277877774478988778999998299 Q gi|254780750|r 771 HFHELTDLSKSLKRFHNATLQVSDSNEGIIFLHKVIPGIADHSYGIQVGKLAGL 824 (920) Q Consensus 771 Hy~eL~~l~~~~~~v~n~~~~~~~~~~~i~flykl~~G~~~~Sygi~vA~laG~ 824 (920) |||+|+.+.+..+.+.|+||.+...+++++|+|||.+|+|+.|||++|||++|+ T Consensus 146 H~~~l~~~~~~~~~v~~~~~~~~~~~~~~~~~Ykl~~G~~~~s~a~~vak~~Gl 199 (199) T cd03283 146 HDLELADLLDLDSAVRNYHFREDIDDNKLIFDYKLKPGVSPTRNALRLMKKIGI 199 (199) T ss_pred CCHHHHHHHCCCCCEEEEEEEEEEECCEEEEEEEECCCCCCCCHHHHHHHHCCC T ss_conf 728988643248773899999999999871885977689987199999998589 No 21 >smart00534 MUTSac ATPase domain of DNA mismatch repair MUTS family. Probab=100.00 E-value=0 Score=466.87 Aligned_cols=185 Identities=55% Similarity=0.863 Sum_probs=180.7 Q ss_pred EEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCCC Q ss_conf 99996778440789999999999999719853035320682210567652376611385328999999999999589985 Q gi|254780750|r 652 LWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGSADNLASGRSTFMVEMIETASILNQATNQS 731 (920) Q Consensus 652 ~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~at~~S 731 (920) ++||||||||||||||||+|++++|||+||||||++|++++||+||||||..|++.+|+|||+.||.+++.|+++++++| T Consensus 1 v~iiTGpN~sGKSt~lk~i~l~~~laq~G~~vpa~~~~~~~~d~i~~~i~~~d~~~~~~S~F~~e~~~~~~il~~~~~~s 80 (185) T smart00534 1 VVIITGPNMGGKSTYLRQVGLIVIMAQIGSFVPAESAELPVFDRIFTRIGASDSLAQGLSTFMVEMKETANILKNATENS 80 (185) T ss_pred CEEEECCCCCCHHHHHHHHHHHHHHHHHCCCEEECCCEEECCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHCCCCE T ss_conf 98997999884899999999999999978886157719950310023356410021564189999999999998389873 Q ss_pred EEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHHCCCEEEEEEEEEEECCEEEEEEEEEECCCC Q ss_conf 69993258898805679999999999997269849997487579766430688589999999609927787777447898 Q gi|254780750|r 732 FVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTDLSKSLKRFHNATLQVSDSNEGIIFLHKVIPGIAD 811 (920) Q Consensus 732 LVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~l~~~~~~v~n~~~~~~~~~~~i~flykl~~G~~~ 811 (920) |||+||+||||++.||.|+|+|+++||.++.+|+++||||||+|+++.+..+++.|+||.+++++++++|+|||.+|+|+ T Consensus 81 LvliDEl~~GT~~~eg~ala~aile~l~~~~~~~~iitTH~~eL~~~~~~~~~v~~~~~~~~~~~~~l~~~Ykl~~G~~~ 160 (185) T smart00534 81 LVLLDELGRGTSTYDGVAIAAAVLEYLLEKIGALTLFATHYHELTKLADEHPGVRNLHMSADEETENLTFLYKLTPGVAG 160 (185) T ss_pred EEEECCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEECHHHHHHHHHHCCCCEEEEEEEEECCCCEEEEEEECCCCCC T ss_conf 76304113688814789999999999996369759986043789998874732000246776417834178886138999 Q ss_pred CCHHHHHHHHCCCCHHHHHHHHHHH Q ss_conf 8778999998299989999999999 Q gi|254780750|r 812 HSYGIQVGKLAGLPNTVISRAYDIL 836 (920) Q Consensus 812 ~Sygi~vA~laG~p~~vi~~A~~~~ 836 (920) +|||+++|+++|+|++||+||++++ T Consensus 161 ~s~ai~ia~~~G~p~~ii~rA~~il 185 (185) T smart00534 161 KSYGIEVAKLAGLPKEVIERAKEIL 185 (185) T ss_pred CCHHHHHHHHHCCCHHHHHHHHHHC T ss_conf 7299999999492999999999759 No 22 >COG1193 Mismatch repair ATPase (MutS family) [DNA replication, recombination, and repair] Probab=100.00 E-value=0 Score=432.44 Aligned_cols=480 Identities=23% Similarity=0.290 Sum_probs=379.7 Q ss_pred HHHCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCHHHHHHHHHH-HHHCHHHHHHH Q ss_conf 76301234337899998631002345678989999997402212588999997520483444456775-21000021206 Q gi|254780750|r 316 KTIDYSITGAGGRLFAERIASPLTDSTKINTRLDSINFFIQNPILINPIQKILKSSPDVPRSLSRLKI-GRGEPKDIAVI 394 (920) Q Consensus 316 ~~Ln~t~T~~G~RlLr~wL~~PL~d~~~I~~R~daVe~l~~~~~~~~~l~~~L~~i~DleRll~ri~~-~~~~p~dl~~L 394 (920) .+..++.|+.|.-.|+. +.|..|.+.|+..++-+.++......... -.+.++.|+--.+.|+-. +..++.++..+ T Consensus 15 ~~~~~~~s~~g~~~~~~--l~p~~~~~~i~~~~~e~~~~~~~~~~~g~--~~~~~l~~i~~~l~~~e~g~~l~~~el~~i 90 (753) T COG1193 15 LLASYAQSSLGLEELKN--LKPRTDLELIEEELSETAEALDILEDEGL--PPLGGLNDVSEALGRLEKGGRLHVEELLEI 90 (753) T ss_pred HHHHHCCCHHHHHHHHH--CCCCCCHHHHHHHHHHHHHHHHHHHCCCC--CCCHHHHHHHHHHHHHHHCCCCCHHHHHHH T ss_conf 99985468789999982--68654389998777778999998724688--881125545789898864243789999999 Q ss_pred HHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEECCCCCCHHHHHHHHHHHHHHH Q ss_conf 89999986556754302584577876765776789999999998543101221046010168541145579988638999 Q gi|254780750|r 395 RDGIRSGINILNLFLKNDLPEELRGAVEKLKKLPQSLEKTLSSMLSDDLPTFKRDGGFLRDGADASLDETRSLRDQSKRI 474 (920) Q Consensus 395 ~~sl~~~~~i~~~l~~~~~~~~l~~~~~~l~~l~~~l~~~l~~~i~~~~~~~~~~~~~ik~g~d~eLD~lr~~~~~~~~~ 474 (920) .+.+.....+...+....... ......+..+. .++..+. ....+++.+++..+..|+.+|......... T Consensus 91 ~~~l~~~~~lkr~~~~~e~~~--~~~~~~~~~~~-~l~~~i~--------~~id~~g~i~d~as~~l~~ir~~lr~~~~~ 159 (753) T COG1193 91 SDFLRGFRALKRAIKKLERIK--RTLALALIELS-DLELEIN--------IPIDDDGLIKDRASFELDAIRRQLRDLEEE 159 (753) T ss_pred HHHHHHHHHHHHHHHHHHHHH--HHHHHHHHCCH-HHHHHHH--------HHHCCCCCCCCCCCHHHHHHHHHHHHHHHH T ss_conf 999877999999999865678--88987510300-7999976--------410554443332058999988536779999 Q ss_pred HHHHHHHHHHHCCCCCC--CCCCCCCCEEEEEECCHHHHHCCCCCCCCCCEEEEEECCCCHHHHHH-HHHHHHHHHHHHH Q ss_conf 99776877887388751--10255862237620411433344111687652798521200011134-5666433464377 Q gi|254780750|r 475 IASLQLKYAEETKIKNL--KIKHNNNLGYFIEVTSSSASVLKKDLESKDRFIHRQTMSNLTRFTTL-ELIDLENRITNAT 551 (920) Q Consensus 475 l~~l~~~~~~~~~i~sL--ki~~n~~~gy~iEV~~~~~~~i~~~~~~~~~~i~~qt~~~~~Rf~t~-eL~~l~~~i~~~~ 551 (920) +.+....+......+.| .+...+...|++.|.......++ ..++-++.++.+-|+.| ....+++++.... T Consensus 160 ir~~l~~~~~~~~~~~L~e~~v~~r~~r~vlpvk~~fk~~i~-------giv~d~sssg~tl~ieP~~vv~l~n~~~~l~ 232 (753) T COG1193 160 IRDKLESLIRSKEAKYLQDRIVTTRDGREVLPVKAEFKGAIK-------GIVHDTSSSGATLYIEPRSVVKLNNELRALL 232 (753) T ss_pred HHHHHHHHHHHHHHHHHHHCEEECCCCEEEEHHHHHHHHHCC-------CEEEECCCCCCEEEECCHHHHHHCCHHHHHH T ss_conf 999999987331567634104861388588578887665158-------5585512465702225437776345766531 Q ss_pred HHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCCCEEECCCCCHHHHHHHHHCCC Q ss_conf 77764334478887654301125798888999867788778887752588512104787140000468058876310287 Q gi|254780750|r 552 NSALLIELESFEILSNAIIEQSESLDNASQVIAIIDISIALAILAKEQNYCRPIIDNSTNFIVKDGRHPIVEKTLKQQSS 631 (920) Q Consensus 552 ~~~~~~e~~~~~~l~~~i~~~~~~l~~~~~~ia~lD~l~SlA~~a~~~~y~rP~i~~~~~l~i~~gRHPviE~~l~~~~~ 631 (920) .+....+..+...+...+.++.+.+..+...++++|++.+.+.++....+++|.+.+...+.+.++|||++.. T Consensus 233 ~eE~~e~e~il~~lsa~v~~~~~~l~~~~~~~~~lD~i~Ak~~~~~~~~~v~P~~~~~~~l~l~~~~HPll~~------- 305 (753) T COG1193 233 VEEDEEEERILRELSALVAPVIPELEILLEIIGELDFIEAKVRYAKALKGVKPDFSNDGVLELLDARHPLLKE------- 305 (753) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCEEEECCCCCCCCCC------- T ss_conf 4204759999998788875516789888988633488998999887650578866787038740244866765------- Q ss_pred CCEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHC-CCCCCCEEEEEEECCCCCCCCC Q ss_conf 704443456358777766439999677844078999999999999971985303532-0682210567652376611385 Q gi|254780750|r 632 KPFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYA-HIGIVDKLFSRVGSADNLASGR 710 (920) Q Consensus 632 ~~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a-~i~~~D~IftRiGa~D~l~~g~ 710 (920) -||||+.++. +-+.++|||||||||++.|+++|++++|||+|.|+||.+. ++++|++||+.||..++|.++. T Consensus 306 --~v~~~i~~~~-----e~~~l~ITGpN~GGKtvtLKTlgl~~lm~q~gl~i~a~~gsei~~F~~i~aDIGDeQsIeqsL 378 (753) T COG1193 306 --DVPNDLELGE-----ELDRLIITGPNTGGKTVTLKTLGLLRLMAQSGLPIPALEGSELPVFVKIFADIGDEQSIEQSL 378 (753) T ss_pred --CCCCCCCCCC-----CCCEEEEECCCCCCCEEHHHHHHHHHHHHHCCCCEECCCCCCCHHHHHHHHCCCCHHHHHHHH T ss_conf --5665422561-----212124764788760216998899999997689703267871042777652458388788887 Q ss_pred CHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHHCCCEEEEEE Q ss_conf 32899999999999958998569993258898805679999999999997269849997487579766430688589999 Q gi|254780750|r 711 STFMVEMIETASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTDLSKSLKRFHNATL 790 (920) Q Consensus 711 STF~vEm~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~l~~~~~~v~n~~~ 790 (920) |||..+|...+.||..++ |||++||+|.||++.+|.|+|.|+++|+.++ +|+++.+|||.+|..+..+.+++.|..| T Consensus 379 STFSshm~~i~~il~~~d--sLvl~DElg~GTdp~EgaaLai~ile~l~~~-~~~~v~tTH~~elk~~~~~~~~v~nas~ 455 (753) T COG1193 379 STFSSHMTNIVEILEKAD--SLVLFDELGSGTDPDEGAALAIAILEDLLEK-PAKIVATTHYRELKALAAEREGVENASM 455 (753) T ss_pred HHHHHHHHHHHHHHHHCC--HHHHHHHHHCCCCCCHHHHHHHHHHHHHHHC-CCCEEHHHHHHHHHHHHHCCHHHHCHHH T ss_conf 312889999999985311--5677787605788424377799889999862-4404257558999998721022211021 Q ss_pred EEEEECCEEEEEEEEEECCCCCCHHHHHHHHCCCCHHHHHHHHHHH Q ss_conf 9996099277877774478988778999998299989999999999 Q gi|254780750|r 791 QVSDSNEGIIFLHKVIPGIADHSYGIQVGKLAGLPNTVISRAYDIL 836 (920) Q Consensus 791 ~~~~~~~~i~flykl~~G~~~~Sygi~vA~laG~p~~vi~~A~~~~ 836 (920) +++.+ .+..+|++..|++++||++++|...|+|..+|++|+... T Consensus 456 ~fd~e--tL~ptY~l~~G~~g~S~Af~ia~rlGl~~~iie~a~~~~ 499 (753) T COG1193 456 EFDAE--TLRPTYRLLEGVPGRSNAFDIALRLGLPEPIIEEAKTEF 499 (753) T ss_pred HHHHH--HHHHHHHHHCCCCCCCHHHHHHHHCCCCHHHHHHHHHHC T ss_conf 32387--766778876078661027999998399889999987750 No 23 >pfam05192 MutS_III MutS domain III. This domain is found in proteins of the MutS family (DNA mismatch repair proteins) and is found associated with pfam00488, pfam05188, pfam01624 and pfam05190. The MutS family of proteins is named after the Salmonella typhimurium MutS protein involved in mismatch repair; other members of the family included the eukaryotic MSH 1,2,3, 4,5 and 6 proteins. These have various roles in DNA repair and recombination. Human MSH has been implicated in non-polyposis colorectal carcinoma (HNPCC) and is a mismatch binding protein. The aligned region corresponds with domain III, which is central to the structure of Thermus aquaticus MutS as characterized in. Probab=100.00 E-value=0 Score=423.31 Aligned_cols=305 Identities=38% Similarity=0.601 Sum_probs=275.1 Q ss_pred CCEEEECCHHHCCEEEEECC-CCCCCCHHHHHHCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHH Q ss_conf 54014251332015454127-87764127876301234337899998631002345678989999997402212588999 Q gi|254780750|r 287 QSTLFIDSAARSNLEILRTL-SGSREQSLLKTIDYSITGAGGRLFAERIASPLTDSTKINTRLDSINFFIQNPILINPIQ 365 (920) Q Consensus 287 ~~~m~LD~~Tl~nLEI~~~~-~g~~~gSL~~~Ln~t~T~~G~RlLr~wL~~PL~d~~~I~~R~daVe~l~~~~~~~~~l~ 365 (920) ++||.||++|++||||++|. +|+++||||++||+|+|+||+|+||+||++|++|+++|++|||+|++|.+++.+++.++ T Consensus 1 ~~~m~lD~~tl~~Lei~~n~~~~~~~~SL~~~Ln~t~T~~G~RlLr~wl~~Pl~d~~~I~~R~d~Ve~l~~~~~~~~~l~ 80 (306) T pfam05192 1 EDYMLLDSATRRNLELVENLSGDGKNGSLFSVLDRTVTAMGSRLLRRWLLQPLRDREEINARLDAVEELLEDPELRQTLR 80 (306) T ss_pred CCCEECCHHHHHHHCCCCCCCCCCCCCCHHHHHCCCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCHHHHHHHH T ss_conf 98453289999852688788899989839999656889088999999985816699999999999999985989999999 Q ss_pred HHHHHHCCHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99752048344445677521000021206899999865567543025845778767657767899999999985431012 Q gi|254780750|r 366 KILKSSPDVPRSLSRLKIGRGEPKDIAVIRDGIRSGINILNLFLKNDLPEELRGAVEKLKKLPQSLEKTLSSMLSDDLPT 445 (920) Q Consensus 366 ~~L~~i~DleRll~ri~~~~~~p~dl~~L~~sl~~~~~i~~~l~~~~~~~~l~~~~~~l~~l~~~l~~~l~~~i~~~~~~ 445 (920) ..|++++|+||+++|++.++++|+++..+++++..++.+...+.....+............ ...+.+.+...+..+.+. T Consensus 81 ~~Lk~i~Dlerl~~ri~~~~~~~~~~~~l~~~l~~i~~i~~~l~~~~~~~~~~~~~~~~~~-~~~~~~~i~~~i~~~~~~ 159 (306) T pfam05192 81 ELLKRVPDLERLLSRLALGRANPRDLLRLRDSLERLPTIKELLEELDSPLLLKELRKSIDD-LSELLDLLETAINEDPPL 159 (306) T ss_pred HHHHCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCC-HHHHHHHHHHHHHCCHHH T ss_conf 9986489789999999828978899999999999999999999857884678888744234-899999999998246176 Q ss_pred HHCCCCEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEEECCHHHHHCCCCCCCCCCEEE Q ss_conf 21046010168541145579988638999997768778873887511025586223762041143334411168765279 Q gi|254780750|r 446 FKRDGGFLRDGADASLDETRSLRDQSKRIIASLQLKYAEETKIKNLKIKHNNNLGYFIEVTSSSASVLKKDLESKDRFIH 525 (920) Q Consensus 446 ~~~~~~~ik~g~d~eLD~lr~~~~~~~~~l~~l~~~~~~~~~i~sLki~~n~~~gy~iEV~~~~~~~i~~~~~~~~~~i~ 525 (920) ...++++|++|++++||++|..+++..+.+.++..+++++++++++++.|++..||++++|+.....+| ..|+. T Consensus 160 ~~~~~~~i~~g~~~~LD~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~g~~i~~~~~~~~~~p------~~~~~ 233 (306) T pfam05192 160 LIREGGVIKSGYDAELDELRDLRDEAKEWIAELEAKEREETGIDSLKIGYNKVGGYYIEVTKSNADLLP------KHFIR 233 (306) T ss_pred HCCCCCEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEECCCEEEEEEECCCHHCCCC------CCCEE T ss_conf 556788548998878999999999789999999999999818986125665822799986031010286------40024 Q ss_pred EEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 8521200011134566643346437777764334478887654301125798888999867788778887752 Q gi|254780750|r 526 RQTMSNLTRFTTLELIDLENRITNATNSALLIELESFEILSNAIIEQSESLDNASQVIAIIDISIALAILAKE 598 (920) Q Consensus 526 ~qt~~~~~Rf~t~eL~~l~~~i~~~~~~~~~~e~~~~~~l~~~i~~~~~~l~~~~~~ia~lD~l~SlA~~a~~ 598 (920) .++.++.+||++|++.++..++.++.+++...+.+++.++.+.+.++.+.|..+++++|+|||++|||.+|.+ T Consensus 234 ~~~~k~~~~f~t~~l~~l~~~l~~~~~~i~~~~~~i~~~l~~~~~~~~~~l~~~~~~ia~LD~l~S~A~vA~e 306 (306) T pfam05192 234 RQTLKNAERFTTPELKELEDKILNAEERILALEKELFEELRERVVEHAEPLQALAEALAELDVLVSFAELAAE 306 (306) T ss_pred EEECCCEEEEECHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 4301571588568899999999999999999999999999999999999999999999999999999999578 No 24 >smart00533 MUTSd DNA-binding domain of DNA mismatch repair MUTS family. Probab=100.00 E-value=0 Score=410.12 Aligned_cols=307 Identities=34% Similarity=0.488 Sum_probs=269.8 Q ss_pred CCHHHHHHCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCHHH Q ss_conf 41278763012343378999986310023456789899999974022125889999975204834444567752100002 Q gi|254780750|r 311 EQSLLKTIDYSITGAGGRLFAERIASPLTDSTKINTRLDSINFFIQNPILINPIQKILKSSPDVPRSLSRLKIGRGEPKD 390 (920) Q Consensus 311 ~gSL~~~Ln~t~T~~G~RlLr~wL~~PL~d~~~I~~R~daVe~l~~~~~~~~~l~~~L~~i~DleRll~ri~~~~~~p~d 390 (920) +||||++||+|+|+||+|+||+||++|++|+++|++|||+|++|.+++.+++.++..|++++|+||+++|+..++++|.+ T Consensus 1 kgSL~~~ln~t~T~~G~RlLr~wl~~Pl~d~~~I~~R~~~Ve~l~~~~~~~~~l~~~L~~i~Dlerl~~k~~~~~~~~~~ 80 (308) T smart00533 1 KGSLFELLNHTKTPMGKRLLRRWLLQPLLDLKEINERLDAVEELVENPELRQKLRQLLKRIPDLERLLSRIERGRASPRD 80 (308) T ss_pred CCCHHHHHCCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCHHH T ss_conf 96668886678893899999999867179999999999999999819799999999987489889999999948998899 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEECCCCCCHHHHHHHHHHH Q ss_conf 12068999998655675430258457787676577678999999999854310122104601016854114557998863 Q gi|254780750|r 391 IAVIRDGIRSGINILNLFLKNDLPEELRGAVEKLKKLPQSLEKTLSSMLSDDLPTFKRDGGFLRDGADASLDETRSLRDQ 470 (920) Q Consensus 391 l~~L~~sl~~~~~i~~~l~~~~~~~~l~~~~~~l~~l~~~l~~~l~~~i~~~~~~~~~~~~~ik~g~d~eLD~lr~~~~~ 470 (920) +..+++++..+..+...+..... .........+..........+...+.+.......++++|++|++++||.+|..+++ T Consensus 81 ~~~l~~~l~~~~~i~~~l~~~~~-~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~i~~g~~~~Ld~l~~~~~~ 159 (308) T smart00533 81 LLRLYDSLEGLKEIRQLLESLDG-PLLGLLLKVILEPLLELLELLLELLNDDDPLEVNDGGLIKDGFDPELDELREKLEE 159 (308) T ss_pred HHHHHHHHHHHHHHHHHHHHCCC-HHHHHHHHHHCCCHHHHHHHHHHHHHCCCHHHCCCCCEECCCCCHHHHHHHHHHHH T ss_conf 99999999999999999985776-24899998753113999999999873457755667783389989899999999999 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEEECCHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHH Q ss_conf 89999977687788738875110255862237620411433344111687652798521200011134566643346437 Q gi|254780750|r 471 SKRIIASLQLKYAEETKIKNLKIKHNNNLGYFIEVTSSSASVLKKDLESKDRFIHRQTMSNLTRFTTLELIDLENRITNA 550 (920) Q Consensus 471 ~~~~l~~l~~~~~~~~~i~sLki~~n~~~gy~iEV~~~~~~~i~~~~~~~~~~i~~qt~~~~~Rf~t~eL~~l~~~i~~~ 550 (920) ..+.+.++...++++.+++++++.+++..||+|+||++....+| ..|+..+++++..+|+++++.+++.++.++ T Consensus 160 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~g~~i~v~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~l~~~l~~~ 233 (308) T smart00533 160 LEEELEELLKKEREELGIDSLKLGYNKVHGYYIEVTKSEAKKVP------KDFIRRSSLKNTERFTTPELKELENELLEA 233 (308) T ss_pred HHHHHHHHHHHHHHHHCCCHHHCCCCCCCEEEEEEECHHHCCCC------CCCEEEEECCCCEEEEECHHHHHHHHHHHH T ss_conf 99999999999999838851221636772289998630311597------301134404771799800388999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCCCEEECCCCCHHHHH Q ss_conf 77776433447888765430112579888899986778877888775258851210478714000046805887 Q gi|254780750|r 551 TNSALLIELESFEILSNAIIEQSESLDNASQVIAIIDISIALAILAKEQNYCRPIIDNSTNFIVKDGRHPIVEK 624 (920) Q Consensus 551 ~~~~~~~e~~~~~~l~~~i~~~~~~l~~~~~~ia~lD~l~SlA~~a~~~~y~rP~i~~~~~l~i~~gRHPviE~ 624 (920) .+++...+.+++.++.+.+.++.+.|..+++++|+|||++|||.+|..++||||+|++++.+.|++||||+||. T Consensus 234 ~~~i~~~~~~i~~~l~~~~~~~~~~l~~~~~~ia~LD~l~S~A~~a~~~~~~rP~i~~~~~l~i~~~RHPllE~ 307 (308) T smart00533 234 KEEIERLEKEILRELLEKVLEYLEELRALAEALAELDVLLSLATLAAEGNYVRPEFVDSGELEIKNGRHPVLEL 307 (308) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEECCCCCEEEEECCCCEEEC T ss_conf 99999999999999999999999999999999999999999999983787889577489868999786884618 No 25 >pfam01624 MutS_I MutS domain I. This domain is found in proteins of the MutS family (DNA mismatch repair proteins) and is found associated with pfam00488, pfam05188, pfam05192 and pfam05190. The MutS family of proteins is named after the Salmonella typhimurium MutS protein involved in mismatch repair; other members of the family included the eukaryotic MSH 1,2,3, 4,5 and 6 proteins. These have various roles in DNA repair and recombination. Human MSH has been implicated in non-polyposis colorectal carcinoma (HNPCC) and is a mismatch binding protein. The aligned region corresponds with globular domain I, which is involved in DNA binding, in Thermus aquaticus MutS as characterized in. Probab=100.00 E-value=5e-38 Score=302.52 Aligned_cols=113 Identities=58% Similarity=1.004 Sum_probs=107.6 Q ss_pred CHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHHHHHHHCCEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCEE Q ss_conf 91689999999877992999980882220079999999861818910788998887626561766999999999889859 Q gi|254780750|r 26 TPMMRQYIEIKSINPDSLVFYRMGDFYELFFDDALLASRCLGITLTKRGKHLGKDIPMCGVPVHTANHYIQKLIKIGHRI 105 (920) Q Consensus 26 TP~~~Qy~~iK~~~~D~il~fr~G~FYE~f~~DA~~aa~~L~i~lt~r~~~~~~~~pmaGvP~~~l~~yl~~Lv~~GykV 105 (920) ||||+|||+||++|||||||||||+|||+|++||+++|++|||++|.|++..++++||||||+|++++|+++|+++|||| T Consensus 1 tp~~~Qy~~iK~~~~d~vvl~qvG~FYE~y~~DA~~~a~~l~i~lt~~~~~~~~~~~m~GfP~~~l~~y~~~Lv~~G~~V 80 (113) T pfam01624 1 TPMMRQYLELKSKYPDAVLFFRVGDFYELFGEDAEIAARELGITLTVRGGGSGKRIPMAGVPEHAFETYLRRLVNKGYKV 80 (113) T ss_pred CHHHHHHHHHHHHCCCEEEEEEECCCHHHHHHHHHHHHHHHCCEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCEE T ss_conf 90899999999879886999984885787886299999861818851256788875344775005999999999879979 Q ss_pred EEEEECCCHHHHHHCCCCCCEEEEEEEEECCCCEECCH Q ss_conf 99950289788741289984554589997753030201 Q gi|254780750|r 106 AICEQIESPLEAKKRGNKSLVRRNVVRLVTPGTLTEDQ 143 (920) Q Consensus 106 aiveQ~E~p~~~~~~~~~~~v~R~Vt~IiTpGT~~d~~ 143 (920) |||||+|+|.. ++++++|+||+||||||++|++ T Consensus 81 ~i~eQ~e~~~~-----~~~~~~R~vt~i~TpGT~~d~e 113 (113) T pfam01624 81 AICEQTEDPAE-----AKGVVKREVVRVITPGTLTDEE 113 (113) T ss_pred EEEEECCCCCC-----CCCCCEEEEEEEECCCCCCCCC T ss_conf 99996258423-----5896246588999997565889 No 26 >cd03227 ABC_Class2 ABC-type Class 2 contains systems involved in cellular processes other than transport. These families are characterised by the fact that the ABC subunit is made up of duplicated, fused ABC modules (ABC2). No known transmembrane proteins or domains are associated with these proteins. Probab=100.00 E-value=3.2e-34 Score=274.09 Aligned_cols=156 Identities=30% Similarity=0.321 Sum_probs=132.4 Q ss_pred ECCCCCHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCC Q ss_conf 00046805887631028770444345635877776643999967784407899999999999997198530353206822 Q gi|254780750|r 614 VKDGRHPIVEKTLKQQSSKPFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIV 693 (920) Q Consensus 614 i~~gRHPviE~~l~~~~~~~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~ 693 (920) +.-+|||. .|||||+.++ ++++++|||||||||||+||++|++++|||.|+++||..+....+ T Consensus 2 ~~~~~~~~-----------~~vpndi~l~------~g~~~iItGpN~sGKSt~Lr~i~l~~~~a~~~~~~~~~~~~~~~~ 64 (162) T cd03227 2 IVLGRFPS-----------YFVPNDVTFG------EGSLTIITGPNGSGKSTILDAIGLALGGAQSATRRRSGVKAGCIV 64 (162) T ss_pred CCHHCCCC-----------CEECCCEECC------CCCEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHCCCCC T ss_conf 01310899-----------7507702608------986899989987757999999999999863267752555427764 Q ss_pred CEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCC--CCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECC Q ss_conf 10567652376611385328999999999999589--9856999325889880567999999999999726984999748 Q gi|254780750|r 694 DKLFSRVGSADNLASGRSTFMVEMIETASILNQAT--NQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATH 771 (920) Q Consensus 694 D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~at--~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTH 771 (920) ..+..... ....+.|+|+.||.+++.+|..+. ++||+|+||+|+||++.||.+||++|++++. . +|.++|+|| T Consensus 65 ~~~~~~~~---~~~~~lSgg~~~~~~l~~~l~~~~~~~~~lillDE~~~Gtd~~~~~~l~~~i~~~~~-~-~~~~i~tTH 139 (162) T cd03227 65 AAVSAELI---FTRLQLSGGEKELSALALILALASLKPRPLYILDEIDRGLDPRDGQALAEAILEHLV-K-GAQVIVITH 139 (162) T ss_pred HHHEEEEE---ECCCCCCCCHHHHHHHHHHHHHCCCCCCCEEEECCCCCCCCHHHHHHHHHHHHHHHH-C-CCEEEEECC T ss_conf 02305766---412000542999999999998542489848996365579998899999999999997-6-998999797 Q ss_pred CHHHHHHHHHCCCEEEEEEE Q ss_conf 75797664306885899999 Q gi|254780750|r 772 FHELTDLSKSLKRFHNATLQ 791 (920) Q Consensus 772 y~eL~~l~~~~~~v~n~~~~ 791 (920) ||||+++++.+.+++|.|+. T Consensus 140 ~~eL~~lad~~~~~~~~~~~ 159 (162) T cd03227 140 LPELAELADKLIHIKKVITG 159 (162) T ss_pred HHHHHHHHHHHCCEEEEEEE T ss_conf 39999998750074787750 No 27 >pfam05188 MutS_II MutS domain II. This domain is found in proteins of the MutS family (DNA mismatch repair proteins) and is found associated with pfam00488, pfam01624, pfam05192 and pfam05190. The MutS family of proteins is named after the Salmonella typhimurium MutS protein involved in mismatch repair; other members of the family included the eukaryotic MSH 1,2,3, 4,5 and 6 proteins. These have various roles in DNA repair and recombination. Human MSH has been implicated in non-polyposis colorectal carcinoma (HNPCC) and is a mismatch binding protein. This domain corresponds to domain II in Thermus aquaticus MutS as characterized in, and has similarity resembles RNAse-H-like domains (see pfam00075). Probab=99.78 E-value=1e-17 Score=150.88 Aligned_cols=129 Identities=28% Similarity=0.445 Sum_probs=109.1 Q ss_pred CEEEEEEEC-CCEEEEEEEECCCCEEEEEECC-HHHHHHHHHHCCCCEEEEECCCCCHHH-HHHHHHHCCCCEECCCCCC Q ss_conf 369999952-9849999999786559996058-789999985229857997077689678-9988742388157185211 Q gi|254780750|r 151 NYLMAVARI-RTEWAIAWIDISTGIFKISTSN-HDRLISDIMRIDPREIIFSEKELSHAE-YKSLFETLGNVAVAQPNVF 227 (920) Q Consensus 151 nyL~aI~~~-~~~~Gia~iDisTG~~~~~~~~-~d~l~~~L~~l~P~EIii~~~~~~~~~-~~~l~~~~~~~~~~~~~~~ 227 (920) |||+||+.. +..||+||+|+|||+|.+++++ .+++.++|.|++|+|||++++..+... ............+..+.|. T Consensus 1 NyL~ai~~~~~~~~glA~~DiSTGef~~~~~~~~~~l~~el~r~~P~Eiii~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (133) T pfam05188 1 NYLAAISRGDGNRYGLAFLDLSTGEFGVSEFEDFEELLAELSRLSPKELLLPESLSSSTVAESQKLLELRLRVGRRPTWL 80 (133) T ss_pred CEEEEEEECCCCEEEEEEEECCCCCEEEEECCCHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHCCCCEEECCCCC T ss_conf 97999998799859999999677967888528999999999825994899534422001566777640365317567532 Q ss_pred CCHHHHHHHHHHHCCCCCCCCHHH-HHHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 471344788998708786421024-4434688877776436874112222221 Q gi|254780750|r 228 FDHSISESRIADYYNITTVDTFGS-FSQVEKTAAAAAISYIKKTQMVNKPTIG 279 (920) Q Consensus 228 f~~~~~~~~l~~~~~~~~l~~~~~-~~~~~~~a~~all~Yl~~~~~~~~~~i~ 279 (920) |+...+.+.+.++|++.+++++|. ..+.+++|+|++|.|+++||+..++|++ T Consensus 81 f~~~~a~~~l~~~f~~~~l~~~g~~~~~~~i~A~galL~Yl~~Tqk~~L~hi~ 133 (133) T pfam05188 81 FELEHAYEDLNEDFGVEDLDGFGLEELPLALCAAGALISYLKETQKENLPHIQ 133 (133) T ss_pred CCHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCHHCC T ss_conf 48899999999983858764468766668999999999999999671624219 No 28 >pfam05190 MutS_IV MutS family domain IV. This domain is found in proteins of the MutS family (DNA mismatch repair proteins) and is found associated with pfam01624, pfam05188, pfam05192 and pfam00488. The mutS family of proteins is named after the Salmonella typhimurium MutS protein involved in mismatch repair; other members of the family included the eukaryotic MSH 1,2,3, 4,5 and 6 proteins. These have various roles in DNA repair and recombination. Human MSH has been implicated in non-polyposis colorectal carcinoma (HNPCC) and is a mismatch binding protein. The aligned region corresponds in part with globular domain IV, which is involved in DNA binding, in Thermus aquaticus MutS as characterized in. Probab=99.61 E-value=2.8e-16 Score=140.03 Aligned_cols=91 Identities=41% Similarity=0.544 Sum_probs=86.5 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEEECCHHHHHCCCCCCCCCCEEEEEECCCCHHH Q ss_conf 85411455799886389999977687788738875110255862237620411433344111687652798521200011 Q gi|254780750|r 456 GADASLDETRSLRDQSKRIIASLQLKYAEETKIKNLKIKHNNNLGYFIEVTSSSASVLKKDLESKDRFIHRQTMSNLTRF 535 (920) Q Consensus 456 g~d~eLD~lr~~~~~~~~~l~~l~~~~~~~~~i~sLki~~n~~~gy~iEV~~~~~~~i~~~~~~~~~~i~~qt~~~~~Rf 535 (920) |||++||++|.+.+++.++|.+++.++++++||++||++||+++|||||||+++...+| +.|+++||++|++|| T Consensus 1 G~d~eLD~lr~~~~~~~~~l~~l~~~~~~~~gi~~LKi~~n~~~Gy~ieV~~~~~~~vp------~~~i~~qt~kn~~Rf 74 (92) T pfam05190 1 GFDEELDELRDLLEELESELDELLAKERERLGIKSLKVGYNKVFGYYIEVTRSEAKKVP------KDYIRRQTLKNGVRF 74 (92) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCEEEEEEEECHHHHHCCC------HHHHHHHHHHCCEEE T ss_conf 98879999999998589999999999999939973999774688999996044341296------779999846076355 Q ss_pred HHHHHHHHHHHHHHHHH Q ss_conf 13456664334643777 Q gi|254780750|r 536 TTLELIDLENRITNATN 552 (920) Q Consensus 536 ~t~eL~~l~~~i~~~~~ 552 (920) +|++|.+++.++.++++ T Consensus 75 tt~eL~~le~~~~~a~e 91 (92) T pfam05190 75 TTPELKKLEDELLEAEE 91 (92) T ss_pred CCHHHHHHHHHHHHHCC T ss_conf 58899999999984035 No 29 >KOG0219 consensus Probab=99.47 E-value=1.5e-15 Score=134.60 Aligned_cols=222 Identities=7% Similarity=-0.161 Sum_probs=180.0 Q ss_pred ECCCCCHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHH-CCCCC Q ss_conf 00046805887631028770444345635877776643999967784407899999999999997198530353-20682 Q gi|254780750|r 614 VKDGRHPIVEKTLKQQSSKPFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASY-AHIGI 692 (920) Q Consensus 614 i~~gRHPviE~~l~~~~~~~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~-a~i~~ 692 (920) -.+.+||+|+. +..-.+.||+..+-++.+. ....|+|||+|++|||+|+|-.|-++++||.||+.= ++.|. T Consensus 626 E~Qd~~~fIpN----dv~le~~~~~~~IiTGpNM----GGKSTyir~~Gvi~lmAQIGcfVPce~A~i~IvD~Il~RVGA 697 (902) T KOG0219 626 EGQDEIPFIPN----DVVLEKGKCRMLIITGPNM----GGKSTYIRQVGVIVLMAQIGCFVPCESATISIVDGILTRVGA 697 (902) T ss_pred HCCCCCCCCCC----CCCCCCCCCEEEEEECCCC----CCCCHHHHHHHHHHHHHHHCCCEEHHHCCCCHHHHHHHHHCC T ss_conf 12456777777----5400157751899857886----763021213369999998188235655387601678764164 Q ss_pred CCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCC Q ss_conf 21056765237661138532899999999999958998569993258898805679999999999997269849997487 Q gi|254780750|r 693 VDKLFSRVGSADNLASGRSTFMVEMIETASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHF 772 (920) Q Consensus 693 ~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy 772 (920) =|..+. -++.+-..++.++||.+|.+.+.++.-++..+....+|.|+++++.++.-++|+.+.+|.....|.+.+.+|+ T Consensus 698 ~D~q~k-G~STFM~Emleta~Ilr~at~~SliiiDELGRGTSt~DGfgiAwai~ehi~~ki~cf~lfATHfhElt~lae~ 776 (902) T KOG0219 698 GDSQLK-GISTFMAEMLETASILRRATKNSLIIIDELGRGTSTYDGFGIAWAIAEHIATKIGCFCLFATHFHELTKLAEQ 776 (902) T ss_pred CHHHHC-CHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 415432-3178999999999999856878289981367874300476178999999998875868778678888766531 Q ss_pred HHHHHHHHHCCCEEEEEEEEEEECCEEEEEEEEEECCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 579766430688589999999609927787777447898877899999829998999999999999987630001111 Q gi|254780750|r 773 HELTDLSKSLKRFHNATLQVSDSNEGIIFLHKVIPGIADHSYGIQVGKLAGLPNTVISRAYDILKTFEKLYHHNQKDM 850 (920) Q Consensus 773 ~eL~~l~~~~~~v~n~~~~~~~~~~~i~flykl~~G~~~~Sygi~vA~laG~p~~vi~~A~~~~~~le~~~~~~~~~~ 850 (920) |+-+........+.| ++-.-+.-.|.-+.|.+..+++.++|..+..|-+..+.+.+-+++++.....+.+.. T Consensus 777 ~~~vKn~h~~a~i~~------~~~~llY~V~~Gv~d~SFGi~VA~~a~fp~~vie~A~~~~~ele~~~~~~e~k~kKe 848 (902) T KOG0219 777 LPTVKNLHVTAQIEN------DDITLLYKVFEGVCDQSFGIHVAELVGFPEHVIEMAKQKAEELEDELVIKESKEKKE 848 (902) T ss_pred HHHHHHHEEEEEECC------CCHHHHHHHHCCCCCCCCHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 035531000367647------630129988522225751146787718975779999999987778774431666787 No 30 >cd00267 ABC_ATPase ABC (ATP-binding cassette) transporter nucleotide-binding domain; ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide-binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=98.62 E-value=5.9e-07 Score=70.37 Aligned_cols=124 Identities=29% Similarity=0.370 Sum_probs=77.2 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCE---------EEEEEECCCC Q ss_conf 4434563587777664399996778440789999999999999719853035320682210---------5676523766 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDK---------LFSRVGSADN 705 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~---------IftRiGa~D~ 705 (920) +-+|+++.. ..+.+..|.|||.+||||++|.++-+. -| .+-++.+... +--+||--.. T Consensus 14 ~l~~i~~~i----~~Ge~~~i~G~nGaGKSTLl~~l~gl~--------~~-~~G~i~~~g~~~~~~~~~~~~~~i~~v~Q 80 (157) T cd00267 14 ALDNVSLTL----KAGEIVALVGPNGSGKSTLLRAIAGLL--------KP-TSGEILIDGKDIAKLPLEELRRRIGYVPQ 80 (157) T ss_pred EEEEEEEEE----CCCCEEEEECCCCCCHHHHHHHHHCCC--------CC-CCCEEEECCEECCCCCHHHHHHCCCCCCC T ss_conf 782117898----799799998788999899999995884--------79-96289999999997999999940608766 Q ss_pred CCCCCCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHHH Q ss_conf 11385328999999999999589985699932588988056799999999999972698499974875797664 Q gi|254780750|r 706 LASGRSTFMVEMIETASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTDLS 779 (920) Q Consensus 706 l~~g~STF~vEm~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~l~ 779 (920) |+.| |.-...+|..| +..-.++|+||...|.++.....+.. .+..+.+. +...+++||..+..... T Consensus 81 LSgG----qkqrv~iA~al--~~~p~ililDEPtsgLD~~~~~~l~~-~i~~l~~~-g~tii~vtH~~~~~~~~ 146 (157) T cd00267 81 LSGG----QRQRVALARAL--LLNPDLLLLDEPTSGLDPASRERLLE-LLRELAEE-GRTVIIVTHDPELAELA 146 (157) T ss_pred CCHH----HHHHHHHHHHH--HCCCCEEEEECCCCCCCHHHHHHHHH-HHHHHHHC-CCEEEEEECCHHHHHHH T ss_conf 8869----99999999999--70999999969876689999999999-99999968-99999990899999997 No 31 >cd03238 ABC_UvrA The excision repair protein UvrA; Nucleotide excision repair in eubacteria is a process that repairs DNA damage by the removal of a 12-13-mer oligonucleotide containing the lesion. Recognition and cleavage of the damaged DNA is a multistep ATP-dependent reaction that requires the UvrA, UvrB, and UvrC proteins. Both UvrA and UvrB are ATPases, with UvrA having two ATP binding sites, which have the characteristic signature of the family of ABC proteins, and UvrB having one ATP binding site that is structurally related to that of helicases. Probab=98.47 E-value=2.8e-06 Score=65.31 Aligned_cols=140 Identities=22% Similarity=0.203 Sum_probs=74.5 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHC-CCCCCCEE--EEEEECCC-CCCC-- Q ss_conf 443456358777766439999677844078999999999999971985303532-06822105--67652376-6113-- Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYA-HIGIVDKL--FSRVGSAD-NLAS-- 708 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a-~i~~~D~I--ftRiGa~D-~l~~-- 708 (920) +=+|+++.. ..+.+..|.|||-+||||+++......--.....+.|.-.. .+-.+|.+ +.++|-.. .+-+ T Consensus 10 aL~~isl~i----~~Ge~~~iiG~nGsGKSTLl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~l~~~~l~~~~ 85 (176) T cd03238 10 NLQNLDVSI----PLNVLVVVTGVSGSGKSTLVNEGLYASGKARLISFLPKFSRNKLIFIDQLQFLIDVGLGYLTLGQKL 85 (176) T ss_pred CCCCEEEEE----CCCCEEEEECCCCCCHHHHHHHHHHHHHCCHHHCCHHHCCCCCEEEHHHHHHHHHCCCCCCCCCCCC T ss_conf 467548788----8998999999999989999998887610311203210137553688577999997488667789916 Q ss_pred -CCCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHH Q ss_conf -853289999999999995899856999325889880567999999999999726984999748757976643 Q gi|254780750|r 709 -GRSTFMVEMIETASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTDLSK 780 (920) Q Consensus 709 -g~STF~vEm~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~l~~ 780 (920) ..|--+---..+|..|-.-.+.+++|+||--.|-++..-..| +.++..|.+. +..++++||..++.++++ T Consensus 86 ~~LSGGqkQRvaiAraL~~~p~~~ililDEPtsgLD~~~~~~l-~~~l~~l~~~-g~TvI~vtHd~~~~~~aD 156 (176) T cd03238 86 STLSGGELQRVKLASELFSEPPGTLFILDEPSTGLHQQDINQL-LEVIKGLIDL-GNTVILIEHNLDVLSSAD 156 (176) T ss_pred CCCCHHHHHHHHHHHHHHHCCCCCEEEECCCCCCCCHHHHHHH-HHHHHHHHHC-CCEEEEEECCHHHHHHCC T ss_conf 8689999999999999986899868997177445898799999-9999999987-998999947879998399 No 32 >cd03230 ABC_DR_subfamily_A This family of ATP-binding proteins belongs to a multisubunit transporter involved in drug resistance (BcrA and DrrA), nodulation, lipid transport, and lantibiotic immunity. In bacteria and archaea, these transporters usually include an ATP-binding protein and one or two integral membrane proteins. Eukaryote systems of the ABCA subfamily display ABC domains that are quite similar to this family. The ATP-binding domain shows the highest similarity between all members of the ABC transporter family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=98.28 E-value=1.6e-05 Score=59.61 Aligned_cols=127 Identities=24% Similarity=0.365 Sum_probs=72.7 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCC-------------------- Q ss_conf 443456358777766439999677844078999999999999971985303532068221-------------------- Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVD-------------------- 694 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D-------------------- 694 (920) |=+|++|.. ..+.+.-|-|||-+||||+||.++ |..-| ++-++.+.. T Consensus 15 ~L~~vsl~i----~~Gei~gl~G~NGaGKSTLl~~i~--------Gl~~p-~~G~i~i~g~~~~~~~~~~~~~ig~v~q~ 81 (173) T cd03230 15 ALDDISLTV----EKGEIYGLLGPNGAGKTTLIKIIL--------GLLKP-DSGEIKVLGKDIKKEPEEVKRRIGYLPEE 81 (173) T ss_pred EEECEEEEE----CCCCEEEEECCCCCCHHHHHHHHH--------CCCCC-CCCEEEECCEECCCCHHHHHCCEEEEECC T ss_conf 982208788----799399998789979999999997--------68577-87889999999886848886578999568 Q ss_pred -EEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCH Q ss_conf -0567652376611385328999999999999589985699932588988056799999999999972698499974875 Q gi|254780750|r 695 -KLFSRVGSADNLASGRSTFMVEMIETASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFH 773 (920) Q Consensus 695 -~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~ 773 (920) .+|..+-..|++. .|.=|--....|.. -+..-.++|+||--.|=++.--..+ |.++..+.+. ++-++++||.. T Consensus 82 ~~l~~~ltv~e~l~--LSgG~kqrv~ia~a--l~~~p~lllLDEPt~gLD~~~~~~i-~~~i~~l~~~-g~tvi~~tH~l 155 (173) T cd03230 82 PSLYENLTVRENLK--LSGGMKQRLALAQA--LLHDPELLILDEPTSGLDPESRREF-WELLRELKKE-GKTILLSSHIL 155 (173) T ss_pred CCCCCCCCHHHHHH--CCHHHHHHHHHHHH--HHCCCCEEEECCCCCCCCHHHHHHH-HHHHHHHHHC-CCEEEEECCCH T ss_conf 76671267789863--39899999999999--9649999999088657999999999-9999999968-99999992838 Q ss_pred HHH-HHHH Q ss_conf 797-6643 Q gi|254780750|r 774 ELT-DLSK 780 (920) Q Consensus 774 eL~-~l~~ 780 (920) +.. .+++ T Consensus 156 ~~~~~~~d 163 (173) T cd03230 156 EEAERLCD 163 (173) T ss_pred HHHHHHCC T ss_conf 99998699 No 33 >PRK13538 cytochrome c biogenesis protein CcmA; Provisional Probab=98.27 E-value=3.6e-05 Score=57.05 Aligned_cols=52 Identities=23% Similarity=0.177 Sum_probs=34.8 Q ss_pred CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHH Q ss_conf 8998569993258898805679999999999997269849997487579766430 Q gi|254780750|r 727 ATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTDLSKS 781 (920) Q Consensus 727 at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~l~~~ 781 (920) +.+..++|+||--.|=++. +...-+..++.+.+. +..++++|| |+|..+++. T Consensus 145 ~~~p~lllLDEPt~~LD~~-~~~~l~~~l~~~~~~-g~~vi~~tH-~dl~~~ad~ 196 (204) T PRK13538 145 LTRAPLWILDEPFTAIDKQ-GVARLEQLLAQHAEQ-GGMVILTTH-QDLPGASDK 196 (204) T ss_pred HCCCCEEEEECCCCCCCHH-HHHHHHHHHHHHHHC-CCEEEEEEC-HHHHHHCCE T ss_conf 6099989997886578999-999999999999858-998999986-698987699 No 34 >cd03217 ABC_FeS_Assembly ABC-type transport system involved in Fe-S cluster assembly, ATPase component. Biosynthesis of iron-sulfur clusters (Fe-S) depends on multiprotein systems. The SUF system of E. coli and Erwinia chrysanthemi is important for Fe-S biogenesis under stressful conditions. The SUF system is made of six proteins: SufC is an atypical cytoplasmic ABC-ATPase, which forms a complex with SufB and SufD; SufA plays the role of a scaffold protein for assembly of iron-sulfur clusters and delivery to target proteins; SufS is a cysteine desulfurase which mobilizes the sulfur atom from cysteine and provides it to the cluster; SufE has no associated function yet. Probab=98.24 E-value=4.4e-05 Score=56.44 Aligned_cols=137 Identities=27% Similarity=0.289 Sum_probs=79.4 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHH-HHHHHCCCCC----------CHHHCCCCCC--C---EEE Q ss_conf 4443456358777766439999677844078999999999-9999719853----------0353206822--1---056 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALI-VIMAQMGSYV----------PASYAHIGIV--D---KLF 697 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~-vilAQiG~fV----------PA~~a~i~~~--D---~If 697 (920) -|=+|++|.. ..+.+.-|-|||-+||||++|.++=. .+-..-|.-. |.+.++.+++ - ..| T Consensus 14 ~vL~~vsl~v----~~Gei~~iiGpnGaGKSTLl~~i~G~~~~~~~~G~I~~~g~~i~~~~~~~~~~~gi~~~~q~~~~~ 89 (200) T cd03217 14 EILKGVNLTI----KKGEVHALMGPNGSGKSTLAKTIMGHPKYEVTEGEILFKGEDITDLPPEERARLGIFLAFQYPPEI 89 (200) T ss_pred EEEECCEEEE----CCCCEEEEECCCCCCHHHHHHHHCCCCCCCCCCCEEEECCEECCCCCHHHHHHCCEEEECCCHHHC T ss_conf 9885505688----799899999689999999999970777778520079999999886999999976948963676870 Q ss_pred EEEECCC---CCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHH Q ss_conf 7652376---6113853289999999999995899856999325889880567999999999999726984999748757 Q gi|254780750|r 698 SRVGSAD---NLASGRSTFMVEMIETASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHE 774 (920) Q Consensus 698 tRiGa~D---~l~~g~STF~vEm~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~e 774 (920) ..+...| ++..+.|-=+.-...+|..|- ..-.++|+||--.|=++. ....-|.++..+.+. ++-++++||+.+ T Consensus 90 ~~~~~~~~l~~~~~~LSGGekqrv~iaral~--~~P~lllLDEPtsgLD~~-~~~~i~~~i~~l~~~-g~tiiiitH~~~ 165 (200) T cd03217 90 PGVKNADFLRYVNEGFSGGEKKRNEILQLLL--LEPDLAILDEPDSGLDID-ALRLVAEVINKLREE-GKSVLIITHYQR 165 (200) T ss_pred CCCCHHHHHHHHCCCCCHHHHHHHHHHHHHH--HCCCEEEEECCCCCCCHH-HHHHHHHHHHHHHHC-CCEEEEEEECHH T ss_conf 7984999997646367999999999999996--099999996962269999-999999999999857-999999996368 Q ss_pred HHHH Q ss_conf 9766 Q gi|254780750|r 775 LTDL 778 (920) Q Consensus 775 L~~l 778 (920) +... T Consensus 166 ~~~~ 169 (200) T cd03217 166 LLDY 169 (200) T ss_pred HHHH T ss_conf 7766 No 35 >cd03233 ABC_PDR_domain1 The pleiotropic drug resistance (PDR) family of ATP-binding cassette (ABC) transporters. PDR is a well-described phenomenon occurring in fungi and shares several similarities with processes in bacteria and higher eukaryotes. This PDR subfamily represents domain I of its (ABC-IM)2 organization. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds including sugars, ions, peptides, and more complex organic molecules. The nucleotide-binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=98.24 E-value=2.8e-05 Score=57.91 Aligned_cols=130 Identities=20% Similarity=0.237 Sum_probs=66.4 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHH-----------------------HHHHCCCCCCHHHCCC- Q ss_conf 4434563587777664399996778440789999999999-----------------------9997198530353206- Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIV-----------------------IMAQMGSYVPASYAHI- 690 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~v-----------------------ilAQiG~fVPA~~a~i- 690 (920) |=+|+++.. ..+.+..|.|||-+||||+||.++=.. ...+++ |||.+...+ T Consensus 22 vL~~is~~i----~~Gei~~llG~nGsGKSTLl~~l~G~~~~~~~~~G~i~~~g~~~~~~~~~~~~~~~-~v~q~~~~~~ 96 (202) T cd03233 22 ILKDFSGVV----KPGEMVLVLGRPGSGCSTLLKALANRTEGNVSVEGDIHYNGIPYKEFAEKYPGEII-YVSEEDVHFP 96 (202) T ss_pred EEECEEEEE----CCCEEEEEECCCCCCHHHHHHHHHCCCCCCCCEEEEEEECCEECCHHHHHHHCCEE-EECCCCCCCC T ss_conf 997708898----09849999989999889999998378789875137999999994051486420199-9867322376 Q ss_pred --CCCCEE-EE-EEECCCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEE Q ss_conf --822105-67-65237661138532899999999999958998569993258898805679999999999997269849 Q gi|254780750|r 691 --GIVDKL-FS-RVGSADNLASGRSTFMVEMIETASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRG 766 (920) Q Consensus 691 --~~~D~I-ft-RiGa~D~l~~g~STF~vEm~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~ 766 (920) ++.+.+ |. +. ..|...++.|.=+--...+|.. -+..-+++|+||--.|-++.....| |..+..+.+..+..+ T Consensus 97 ~ltv~e~l~~~~~~-~~~~~~~~LSgGqkqRv~iA~a--L~~~P~illlDEPt~gLD~~~~~~i-~~~l~~l~~~~~~t~ 172 (202) T cd03233 97 TLTVRETLDFALRC-KGNEFVRGISGGERKRVSIAEA--LVSRASVLCWDNSTRGLDSSTALEI-LKCIRTMADVLKTTT 172 (202) T ss_pred CCCHHHHHHHHHHH-CCCCCCCCCCHHHHHHHHHHHH--HHHCCCEEEEECCCCCCCHHHHHHH-HHHHHHHHHHCCCEE T ss_conf 88099999999984-6587444589999999999999--9529988998387656899999999-999999998779989 Q ss_pred EEECCCH Q ss_conf 9974875 Q gi|254780750|r 767 LLATHFH 773 (920) Q Consensus 767 lfaTHy~ 773 (920) +++||.+ T Consensus 173 ii~~~~~ 179 (202) T cd03233 173 FVSLYQA 179 (202) T ss_pred EEEEECC T ss_conf 9999069 No 36 >cd03216 ABC_Carb_Monos_I This family represents the domain I of the carbohydrate uptake proteins that transport only monosaccharides (Monos). The Carb_Monos family is involved in the uptake of monosaccharides, such as pentoses (such as xylose, arabinose, and ribose) and hexoses (such as xylose, arabinose, and ribose), that cannot be broken down to simple sugars by hydrolysis. Pentoses include xylose, arabinose, and ribose. Important hexoses include glucose, galactose, and fructose. In members of the Carb_monos family, the single hydrophobic gene product forms a homodimer while the ABC protein represents a fusion of two nucleotide-binding domains. However, it is assumed that two copies of the ABC domains are present in the assembled transporter. Probab=98.23 E-value=4e-05 Score=56.73 Aligned_cols=130 Identities=25% Similarity=0.284 Sum_probs=78.0 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEECCCCCCCC---- Q ss_conf 4443456358777766439999677844078999999999999971985303532068221056765237661138---- Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGSADNLASG---- 709 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa~D~l~~g---- 709 (920) .|=+|++|.. ..+.+.-|.|||-+||||++|.++ |..-| ++-++.+..+-.+.....|....| T Consensus 14 ~aL~~vsl~i----~~Gei~~lvG~nGaGKSTl~~~i~--------Gl~~p-~~G~i~i~G~~i~~~~~~~~~~~gi~~v 80 (163) T cd03216 14 KALDGVSLSV----RRGEVHALLGENGAGKSTLMKILS--------GLYKP-DSGEILVDGKEVSFASPRDARRAGIAMV 80 (163) T ss_pred EEECCEEEEE----CCCCEEEEECCCCCCHHHHHHHHH--------CCCCC-CCCEEEECCEECCCCCHHHHHHCCCCEE T ss_conf 9885548898----799899999889989999999995--------77689-8578999999999999999998799489 Q ss_pred --CCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHH-HHHHH Q ss_conf --532899999999999958998569993258898805679999999999997269849997487579-76643 Q gi|254780750|r 710 --RSTFMVEMIETASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHEL-TDLSK 780 (920) Q Consensus 710 --~STF~vEm~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL-~~l~~ 780 (920) .|-=|--....+..| +.+-.++|+||--.|=++..-..+ |.++..+.+. ++-.+|+||.-+. .++++ T Consensus 81 ~qLSgG~~Qrv~iaral--~~~p~llilDEPt~gLD~~~~~~i-~~~l~~l~~~-G~til~vtH~l~~~~~~~D 150 (163) T cd03216 81 YQLSVGERQMVEIARAL--ARNARLLILDEPTAALTPAEVERL-FKVIRRLRAQ-GVAVIFISHRLDEVFEIAD 150 (163) T ss_pred CCCCHHHHHHHHHHHHH--HHCCCEEEEECCCCCCCHHHHHHH-HHHHHHHHHC-CCEEEEEECCHHHHHHHCC T ss_conf 46998999999999999--729999999097557999999999-9999999878-9899999384999998699 No 37 >PRK13543 cytochrome c biogenesis protein CcmA; Provisional Probab=98.23 E-value=0.00023 Score=50.98 Aligned_cols=153 Identities=18% Similarity=0.196 Sum_probs=73.1 Q ss_pred CEECCCCCCEEECCCCCHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCC Q ss_conf 12104787140000468058876310287704443456358777766439999677844078999999999999971985 Q gi|254780750|r 603 RPIIDNSTNFIVKDGRHPIVEKTLKQQSSKPFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSY 682 (920) Q Consensus 603 rP~i~~~~~l~i~~gRHPviE~~l~~~~~~~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~f 682 (920) -|.=+..+.|.+++...- -+..-|=+++++.. +.+.+..|.|||-+||||+||.++=+. -.+-|.- T Consensus 3 ~p~~~~~P~L~~~~ls~~---------~~~~~vl~~isf~v----~~Ge~~~l~GpNGaGKTTLlr~l~Gl~-~p~~G~I 68 (214) T PRK13543 3 EPLHTAPPLLAAHALAFS---------RNEEPVFGPLDFHV----DAGEALLVQGDNGAGKTTLLRVLAGLL-HVESGQI 68 (214) T ss_pred CCCCCCCCEEEEEEEEEE---------ECCEEEEECEEEEE----CCCCEEEEECCCCCCHHHHHHHHHCCC-CCCCEEE T ss_conf 988999975999827999---------79999982638898----189899999999987999999997697-7884199 Q ss_pred C----CHHHCCCCCCCEEEEEEECCCCCCCCCCHH-----------------------------------------HHHH Q ss_conf 3----035320682210567652376611385328-----------------------------------------9999 Q gi|254780750|r 683 V----PASYAHIGIVDKLFSRVGSADNLASGRSTF-----------------------------------------MVEM 717 (920) Q Consensus 683 V----PA~~a~i~~~D~IftRiGa~D~l~~g~STF-----------------------------------------~vEm 717 (920) . |.... -..+...-++-...+....+++ +--- T Consensus 69 ~~~g~~~~~~---~~~~~~~~~~~~~~l~~~lt~~enl~~~~~l~~~~~~~~~~~~l~~~gl~~~~~~~~~~LSgGqkqR 145 (214) T PRK13543 69 QIDGKTATRG---DRSRFMAYLGHLPGLKADLSTLENLHFLCGLHGRRAKQMPGSALAIVGLAGYEDALVRQLSAGQRKR 145 (214) T ss_pred EECCEECCCH---HHCCEEEEECCCCCCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHCCCHHHCCHHHHHH T ss_conf 9999999754---0213589980145446887699999999862587799999999998699440007823489999999 Q ss_pred HHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH Q ss_conf 99999999589985699932588988056799999999999972698499974875797 Q gi|254780750|r 718 IETASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELT 776 (920) Q Consensus 718 ~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~ 776 (920) ...+.. -+.+..++|+||--.|=++ ++..+-+.++..+.+. +..+|++||..+.. T Consensus 146 v~lA~a--l~~~p~illLDEPt~~LD~-~~~~~l~~~l~~~~~~-g~tvl~~tHd~~~~ 200 (214) T PRK13543 146 LALARL--WLSPAPLWLLDEPYANLDL-EGITLVNRMISAHLRG-GGAALVTTHGAYAA 200 (214) T ss_pred HHHHHH--HHCCCCEEEEECCCCCCCH-HHHHHHHHHHHHHHHC-CCEEEEECCCHHHH T ss_conf 999999--8579999998099888999-9999999999999867-99999991998999 No 38 >PRK13541 cytochrome c biogenesis protein CcmA; Provisional Probab=98.22 E-value=7.9e-05 Score=54.51 Aligned_cols=128 Identities=20% Similarity=0.211 Sum_probs=69.8 Q ss_pred EEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCC-C-----C-CCCEEEEEEECCCCCCC Q ss_conf 434563587777664399996778440789999999999999719853035320-6-----8-22105676523766113 Q gi|254780750|r 636 ANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAH-I-----G-IVDKLFSRVGSADNLAS 708 (920) Q Consensus 636 pNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~-i-----~-~~D~IftRiGa~D~l~~ 708 (920) -+|++++. ..+.+..|+|||-+||||++|.++ |..-|.+-.. + . .-....+.+|-...+.- T Consensus 16 l~disl~i----~~G~i~~i~G~NGsGKSTLlk~i~--------Gl~~p~~G~I~~~~~~~~~~~~~~~~~i~~~~~l~~ 83 (195) T PRK13541 16 LFDLSITF----LPSAITYIKGANGCGKSSLLRMIA--------GIMQPSSGNIYYKNCNINNIAKPYCTYIGHNLGLKL 83 (195) T ss_pred EEEEEEEE----CCCCEEEEECCCCCCHHHHHHHHH--------CCCCCCCEEEEECCEEHHHHCHHHCEEECCCCCCCC T ss_conf 99777787----799799999999981999999996--------798898408999999920324635366355346787 Q ss_pred CC---------------------------------------CHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHH Q ss_conf 85---------------------------------------328999999999999589985699932588988056799 Q gi|254780750|r 709 GR---------------------------------------STFMVEMIETASILNQATNQSFVILDEIGRGTATLDGLS 749 (920) Q Consensus 709 g~---------------------------------------STF~vEm~e~~~IL~~at~~SLVllDElGrGTst~DG~a 749 (920) .. |.=|--....|.. -+.+..++|+||--.|=+. ++.. T Consensus 84 ~ltv~enl~~~~~~~~~~~~~~~~l~~~~l~~~~~~~~~~LS~G~kqrv~iAra--l~~~p~llllDEPt~~LD~-~~~~ 160 (195) T PRK13541 84 EMTVFENLKFWSEIYNSVETLYAAIHYFKLHDLLDEKCYSLSSGMQKVVALARL--IACQSDLWLLDEVETNLSK-ENRD 160 (195) T ss_pred CCCHHHHHHHHHHHCCHHHHHHHHHHHCCCHHHHHCCHHHCCHHHHHHHHHHHH--HHHCCCEEEEECCCCCCCH-HHHH T ss_conf 776999999999862839999999998498756648664599999999999999--9709999999787655999-9999 Q ss_pred HHHHHHHHHHHHCCCEEEEECCCHHHHHHH Q ss_conf 999999999972698499974875797664 Q gi|254780750|r 750 IAWATIEYLHETNRCRGLLATHFHELTDLS 779 (920) Q Consensus 750 iA~aile~l~~~~~~~~lfaTHy~eL~~l~ 779 (920) ..+-++..+.+ .+..++++||..+..+-+ T Consensus 161 ~i~~li~~~~~-~g~~ili~sH~~~~i~~a 189 (195) T PRK13541 161 LLNNLIVMKAN-SGGIVLLSSHLESSIKSA 189 (195) T ss_pred HHHHHHHHHHH-CCCEEEEECCCHHHHHHC T ss_conf 99999999983-999999983798999419 No 39 >cd03221 ABCF_EF-3 ABCF_EF-3 Elongation factor 3 (EF-3) is a cytosolic protein required by fungal ribosomes for in vitro protein synthesis and for in vivo growth. EF-3 stimulates the binding of the EF-1: GTP: aa-tRNA ternary complex to the ribosomal A site by facilitated release of the deacylated tRNA from the E site. The reaction requires ATP hydrolysis. EF-3 contains two ATP nucleotide binding sequence (NBS) motifs. NBSI is sufficient for the intrinsic ATPase activity. NBSII is essential for the ribosome-stimulated functions. Probab=98.21 E-value=3.6e-05 Score=57.08 Aligned_cols=118 Identities=27% Similarity=0.281 Sum_probs=68.8 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEECCCCCCCCCCHH Q ss_conf 44434563587777664399996778440789999999999999719853035320682210567652376611385328 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGSADNLASGRSTF 713 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa~D~l~~g~STF 713 (920) .|=|++++.. ..+.+..|.|||-+||||++|.+ +|..-| .+-++.+-+++ +|+--+-++.| T Consensus 14 ~vl~~is~~i----~~ge~~~l~G~NGsGKTTl~~~l--------~G~~~~-~~G~i~~~~~~--~i~y~~QLSgG---- 74 (144) T cd03221 14 LLLKDISLTI----NPGDRIGLVGRNGAGKSTLLKLI--------AGELEP-DEGIVTWGSTV--KIGYFEQLSGG---- 74 (144) T ss_pred EEEEEEEEEE----CCCCEEEEECCCCCCHHHHHHHH--------HCCCCC-CCEEEEECCCC--EEEEEHHCCHH---- T ss_conf 9996348998----79999999989998499999998--------489889-85099999960--89987007999---- Q ss_pred HHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHH Q ss_conf 9999999999995899856999325889880567999999999999726984999748757976 Q gi|254780750|r 714 MVEMIETASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTD 777 (920) Q Consensus 714 ~vEm~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~ 777 (920) |...++-..--+.+..++|+||--.|=++ .+. ..+.+.|.+. +.-.+|+||..+... T Consensus 75 --qkqr~~la~al~~~p~iliLDEPt~~LD~---~~~-~~i~~~l~~~-~~tii~vsHd~~~~~ 131 (144) T cd03221 75 --EKMRLALAKLLLENPNLLLLDEPTNHLDL---ESI-EALEEALKEY-PGTVILVSHDRYFLD 131 (144) T ss_pred --HHHHHHHHHHHCCCCCEEEEECCCCCCCH---HHH-HHHHHHHHHC-CCEEEEEECCHHHHH T ss_conf --99999999997259989999577555899---999-9999999970-999999967989999 No 40 >PRK11144 modC molybdate transporter ATP-binding protein; Provisional Probab=98.21 E-value=8.3e-06 Score=61.84 Aligned_cols=143 Identities=18% Similarity=0.245 Sum_probs=84.8 Q ss_pred CEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHH--------------HHHHCCCCCCHHHCCCCCC---CE Q ss_conf 044434563587777664399996778440789999999999--------------9997198530353206822---10 Q gi|254780750|r 633 PFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIV--------------IMAQMGSYVPASYAHIGIV---DK 695 (920) Q Consensus 633 ~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~v--------------ilAQiG~fVPA~~a~i~~~---D~ 695 (920) .|.- |+++.. ..+.+.-|-|||-+||||+||.++=+. ....-|-++|...-.++.+ .+ T Consensus 12 ~f~l-dv~l~i----~~g~i~~l~GpsGaGKTTLl~~iaGl~~p~~G~I~~~g~~~~~~~~~~~l~~~~r~ig~vfQ~~~ 86 (352) T PRK11144 12 QLCL-TVNLTL----PAQGITAIFGRSGAGKTSLINLISGLTRPQKGRIVLNGRVLFDAEKGICLPPEKRRIGYVFQDAR 86 (352) T ss_pred CEEE-EEEEEE----CCCCEEEEECCCCCHHHHHHHHHHCCCCCCCEEEEECCEECCCCCCCCCCCHHHCCEEEEECCCC T ss_conf 9999-999998----89989999999996299999999768999965999999998555410137676688689935763 Q ss_pred EEEEEECCCCCCCCCCHHH-------HHHHHHHHHHH-------------------HCCCCCEEEEECCCCCCCHHHHHH Q ss_conf 5676523766113853289-------99999999999-------------------589985699932588988056799 Q gi|254780750|r 696 LFSRVGSADNLASGRSTFM-------VEMIETASILN-------------------QATNQSFVILDEIGRGTATLDGLS 749 (920) Q Consensus 696 IftRiGa~D~l~~g~STF~-------vEm~e~~~IL~-------------------~at~~SLVllDElGrGTst~DG~a 749 (920) +|-.+-..+|+.-|...-+ +|+.+....++ -+++-.|+|+||--.|=+..--.. T Consensus 87 LfphltV~~Nl~~g~~~~~~~~~~~~~~~l~l~~l~~r~p~~LSGGq~QRvaiARAL~~~P~lLllDEP~s~LD~~~~~~ 166 (352) T PRK11144 87 LFPHYKVRGNLRYGMAKSMPAQFDKIVSLLGIEPLLDRYPGSLSGGEKQRVAIGRALLTAPELLLMDEPLASLDLPRKRE 166 (352) T ss_pred CCCCCCHHHHHHHHCHHHHHHHHHHHHHHCCCHHHHHCCHHHCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCHHHHHH T ss_conf 37776889966510005659999999977599567627864659245234999998724999999878400279779999 Q ss_pred HHHHHHHHHHHHCCCEEEEECCCHH-HHHHHHH Q ss_conf 9999999999726984999748757-9766430 Q gi|254780750|r 750 IAWATIEYLHETNRCRGLLATHFHE-LTDLSKS 781 (920) Q Consensus 750 iA~aile~l~~~~~~~~lfaTHy~e-L~~l~~~ 781 (920) | +..++.|.+..+.-++++||..+ ...+++. T Consensus 167 i-~~~l~~l~~~~~~til~VTHd~~e~~~laD~ 198 (352) T PRK11144 167 L-LPYLERLAQEINIPILYVSHSLDEILRLADH 198 (352) T ss_pred H-HHHHHHHHHHHCCEEEEEECCHHHHHHHCCE T ss_conf 9-9999999997398899993999999986999 No 41 >cd03232 ABC_PDR_domain2 The pleiotropic drug resistance-like (PDR) family of ATP-binding cassette (ABC) transporters. PDR is a well-described phenomenon occurring in fungi and shares several similarities with processes in bacteria and higher eukaryotes. This PDR subfamily represents domain I of its (ABC-IM)2 organization. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds including sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=98.15 E-value=0.00013 Score=53.02 Aligned_cols=136 Identities=19% Similarity=0.178 Sum_probs=66.8 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHC----------CCCCCHHH-CCCCCCCE---EEEE Q ss_conf 4443456358777766439999677844078999999999999971----------98530353-20682210---5676 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQM----------GSYVPASY-AHIGIVDK---LFSR 699 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQi----------G~fVPA~~-a~i~~~D~---IftR 699 (920) -|=+|+++.. ..+.+..|-|||-+||||+||.++-. -.. |-.++... ..++.+.+ .|-. T Consensus 21 ~vL~~is~~i----~~Ge~~~llGpnGaGKSTLl~~l~g~---~~~~~~~G~i~~~g~~~~~~~~~~igyv~q~~~~~~~ 93 (192) T cd03232 21 QLLNNISGYV----KPGTLTALMGESGAGKTTLLDVLAGR---KTAGVITGEILINGRPLDKNFQRSTGYVEQQDVHSPN 93 (192) T ss_pred EEEECCEEEE----ECCEEEEEECCCCCCHHHHHHHHHCC---CCCCCEEEEEEECCEECCHHHHHHEEEEECCHHCCCC T ss_conf 9998838899----28839999999999889999998379---8788317899987827667756227999411330734 Q ss_pred EECCCCC-----CCCCCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCH- Q ss_conf 5237661-----1385328999999999999589985699932588988056799999999999972698499974875- Q gi|254780750|r 700 VGSADNL-----ASGRSTFMVEMIETASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFH- 773 (920) Q Consensus 700 iGa~D~l-----~~g~STF~vEm~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~- 773 (920) +-..+++ .++.|.=+.....+|. --+.+-.++|+||--.|=++.--..| |..++.+.+. +..++++||.. T Consensus 94 ~tv~e~l~~~~~l~~LS~gqrqrv~iA~--aL~~~P~lllLDEPt~gLD~~~~~~i-~~~l~~l~~~-g~tiii~th~~~ 169 (192) T cd03232 94 LTVREALRFSALLRGLSVEQRKRLTIGV--ELAAKPSILFLDEPTSGLDSQAAYNI-VRFLKKLADS-GQAILCTIHQPS 169 (192) T ss_pred CHHHHHHHHHHHHHCCCHHHHHHHHHHH--HHHCCCCEEEEECCCCCCCHHHHHHH-HHHHHHHHHC-CCEEEEEECCCH T ss_conf 5499999866777337976765899999--98449988998488768898999999-9999999969-999999983637 Q ss_pred -HHHHHHH Q ss_conf -7976643 Q gi|254780750|r 774 -ELTDLSK 780 (920) Q Consensus 774 -eL~~l~~ 780 (920) ++.++++ T Consensus 170 ~~i~~~~D 177 (192) T cd03232 170 ASIFEKFD 177 (192) T ss_pred HHHHHHCC T ss_conf 99998799 No 42 >cd03255 ABC_MJ0796_Lo1CDE_FtsE This family is comprised of MJ0796 ATP-binding cassette, macrolide-specific ABC-type efflux carrier (MacAB), and proteins involved in cell division (FtsE), and release of liporoteins from the cytoplasmic membrane (LolCDE). They are clustered together phylogenetically. MacAB is an exporter that confers resistance to macrolides, while the LolCDE system is not a transporter at all. An FtsE null mutants showed filamentous growth and appeared viable on high salt medium only, indicating a role for FtsE in cell division and/or salt transport. The LolCDE complex catalyses the release of lipoproteins from the cytoplasmic membrane prior to their targeting to the outer membrane. Probab=98.15 E-value=3.8e-05 Score=56.88 Aligned_cols=135 Identities=19% Similarity=0.243 Sum_probs=73.2 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHH--------------------HH-----HHCCCCCCHHHC- Q ss_conf 4434563587777664399996778440789999999999--------------------99-----971985303532- Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIV--------------------IM-----AQMGSYVPASYA- 688 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~v--------------------il-----AQiG~fVPA~~a- 688 (920) +=+|++|.. ..+.+..|.|||-+||||+||.++-+. -+ .+|| |||-+.+ T Consensus 19 al~~isl~i----~~Ge~~~iiG~sGsGKTTll~~i~Gl~~p~~G~I~~~g~~i~~~~~~~~~~~rr~~Ig-~v~Q~~~L 93 (218) T cd03255 19 ALKGVSLSI----EKGEFVAIVGPSGSGKSTLLNILGGLDRPTSGEVRVDGTDISKLSEKELAAFRRRHIG-FVFQSFNL 93 (218) T ss_pred EEECEEEEE----CCCCEEEEECCCCCHHHHHHHHHHCCCCCCCEEEEECCEECCCCCHHHHHHHHHCCEE-EECCCCCC T ss_conf 985628998----6998999999999869999999966999996499999999887998999998650478-98667521 Q ss_pred --CCCCCCEEEEEEECCCCCCCCCCHHH--------HHHHHHH------------------HHH-HHCCCCCEEEEECCC Q ss_conf --06822105676523766113853289--------9999999------------------999-958998569993258 Q gi|254780750|r 689 --HIGIVDKLFSRVGSADNLASGRSTFM--------VEMIETA------------------SIL-NQATNQSFVILDEIG 739 (920) Q Consensus 689 --~i~~~D~IftRiGa~D~l~~g~STF~--------vEm~e~~------------------~IL-~~at~~SLVllDElG 739 (920) .++++|.|.- |.. ..|.+..- .|+.+.. .|- --+..-.++|+||-- T Consensus 94 ~~~ltV~eni~~--~~~---~~~~~~~~~~~~v~~~l~~l~l~~~~~~~~~~LSGG~kQRv~iAraL~~~P~llllDEPT 168 (218) T cd03255 94 LPDLTALENVEL--PLL---LAGVPKKERRERAEELLERVGLGDRLNHYPSELSGGQQQRVAIARALANDPKIILADEPT 168 (218) T ss_pred CCCCCHHHHHHH--HHH---HCCCCHHHHHHHHHHHHHHCCCHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCC T ss_conf 556439999999--999---849998999999998767679378873887638999999999999985599999981888 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHH Q ss_conf 89880567999999999999726984999748757976643 Q gi|254780750|r 740 RGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTDLSK 780 (920) Q Consensus 740 rGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~l~~ 780 (920) .|=++.-..- -|..+..|.+..+-.++++||..++.++++ T Consensus 169 s~LD~~~~~~-i~~~l~~l~~~~~~tii~itHd~~~~~~aD 208 (218) T cd03255 169 GNLDSETGKE-VMELLRELNKEAGTTIVVVTHDPELAEYAD 208 (218) T ss_pred CCCCHHHHHH-HHHHHHHHHHHCCCEEEEECCCHHHHHHCC T ss_conf 7689999999-999999999962989999896889998699 No 43 >cd03218 ABC_YhbG The ABC transporters belonging to the YhbG family are similar to members of the Mj1267_LivG family, which is involved in the transport of branched-chain amino acids. The genes yhbG and yhbN are located in a single operon and may function together in cell envelope during biogenesis. YhbG is the putative ATP-binding cassette component and YhbN is the putative periplasmic-binding protein. Depletion of each gene product leads to growth arrest, irreversible cell damage and loss of viability in E. coli. The YhbG homolog (NtrA) is essential in Rhizobium meliloti, a symbiotic nitrogen-fixing bacterium. Probab=98.14 E-value=4.1e-05 Score=56.67 Aligned_cols=131 Identities=28% Similarity=0.361 Sum_probs=77.5 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHH--------------------HHHHCC-CCCCHHHCCCCCC Q ss_conf 4434563587777664399996778440789999999999--------------------999719-8530353206822 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIV--------------------IMAQMG-SYVPASYAHIGIV 693 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~v--------------------ilAQiG-~fVPA~~a~i~~~ 693 (920) |=+|+++.. ..+.+.-|-|||-+||||+||.++=+. --+..| +|||=+. T Consensus 15 ~l~~vs~~v----~~Gei~~llGpNGAGKSTll~~i~Gl~~p~~G~I~~~g~di~~~~~~~r~r~gig~~pQ~~------ 84 (232) T cd03218 15 VVNGVSLSV----KQGEIVGLLGPNGAGKTTTFYMIVGLVKPDSGKILLDGQDITKLPMHKRARLGIGYLPQEA------ 84 (232) T ss_pred EECCEEEEE----CCCCEEEEECCCCCCHHHHHHHHHCCCCCCCEEEEECCEECCCCCHHHHHHCCEEECCCCC------ T ss_conf 882606798----9995999999999619999999977999986299999999999999999971979877777------ Q ss_pred CEEEEEEECCCCCC----------------------------------CCCCHHHHHHHHHHHHHHHCCCCCEEEEECCC Q ss_conf 10567652376611----------------------------------38532899999999999958998569993258 Q gi|254780750|r 694 DKLFSRVGSADNLA----------------------------------SGRSTFMVEMIETASILNQATNQSFVILDEIG 739 (920) Q Consensus 694 D~IftRiGa~D~l~----------------------------------~g~STF~vEm~e~~~IL~~at~~SLVllDElG 739 (920) .+|..+-..|||. ...|-=|--+.++|..| ++.-.++|+||-- T Consensus 85 -~l~~~ltV~enl~~~~~~~~~~~~~~~~~~~~~l~~~~L~~~~~~~~~~LSgG~~qrv~iAraL--~~~P~illlDEPt 161 (232) T cd03218 85 -SIFRKLTVEENILAVLEIRGLSKKEREEKLEELLEEFHITHLRKSKASSLSGGERRRVEIARAL--ATNPKFLLLDEPF 161 (232) T ss_pred -CCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHCCCHHHCCHHHHHHHHHHHHH--HCCCCEEEECCCC T ss_conf -6788888999999999972999999999999999876982465394255999999999999999--6699999988985 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH-HHHH Q ss_conf 8988056799999999999972698499974875797-6643 Q gi|254780750|r 740 RGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELT-DLSK 780 (920) Q Consensus 740 rGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~-~l~~ 780 (920) .|=++..-..| |.++..|.+. ++-.+++||+-+.. .+++ T Consensus 162 ~GLDp~~~~~i-~~~i~~l~~~-g~tili~tH~l~~~~~~~d 201 (232) T cd03218 162 AGVDPIAVQDI-QKIIKILKDR-GIGVLITDHNVRETLSITD 201 (232) T ss_pred CCCCHHHHHHH-HHHHHHHHHC-CCEEEEEECCHHHHHHHCC T ss_conf 68899999999-9999999958-9999999283999998699 No 44 >cd03265 ABC_DrrA DrrA is the ATP-binding protein component of a bacterial exporter complex that confers resistance to the antibiotics daunorubicin and doxorubicin. In addition to DrrA, the complex includes an integral membrane protein called DrrB. DrrA belongs to the ABC family of transporters and shares sequence and functional similarities with a protein found in cancer cells called P-glycoprotein. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region in addition to the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=98.13 E-value=3.3e-05 Score=57.38 Aligned_cols=134 Identities=22% Similarity=0.291 Sum_probs=72.3 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHH--------------------HHHHHCCCCCCHHH---CCC Q ss_conf 4443456358777766439999677844078999999999--------------------99997198530353---206 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALI--------------------VIMAQMGSYVPASY---AHI 690 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~--------------------vilAQiG~fVPA~~---a~i 690 (920) .+=+|+++.. ..+.+.-+-|||-+||||++|.++=+ .+-.++| |||=.. ..+ T Consensus 14 ~al~~is~~v----~~Gei~gllGpNGAGKSTll~~i~Gl~~p~~G~i~i~G~~~~~~~~~~r~~ig-~~pq~~~l~~~l 88 (220) T cd03265 14 EAVRGVSFRV----RRGEIFGLLGPNGAGKTTTIKMLTTLLKPTSGRATVAGHDVVREPREVRRRIG-IVFQDLSVDDEL 88 (220) T ss_pred EEECCCEEEE----CCCCEEEEECCCCCHHHHHHHHHHCCCCCCCCEEEECCEECCCCHHHHHHCCE-EEECCCCCCCCC T ss_conf 9985826798----89839999999987199999999769788962899999998839899982838-990787679889 Q ss_pred CCCCEE--EEEEECCCCCCCCCCHH--------HHH------------------HHH-HHHHHHHCCCCCEEEEECCCCC Q ss_conf 822105--67652376611385328--------999------------------999-9999995899856999325889 Q gi|254780750|r 691 GIVDKL--FSRVGSADNLASGRSTF--------MVE------------------MIE-TASILNQATNQSFVILDEIGRG 741 (920) Q Consensus 691 ~~~D~I--ftRiGa~D~l~~g~STF--------~vE------------------m~e-~~~IL~~at~~SLVllDElGrG 741 (920) ++.+.+ |.|+ .|.+.- ..| |.. ++-+.--+..-.++|+||---| T Consensus 89 Tv~e~l~~~~~l-------~g~~~~~~~~~~~~ll~~~~L~~~~~~~~~~LS~G~kqrv~ia~Al~~~P~lliLDEPt~g 161 (220) T cd03265 89 TGWENLYIHARL-------YGVPGAERRERIDELLDFVGLLEAADRLVKTYSGGMRRRLEIARSLVHRPEVLFLDEPTIG 161 (220) T ss_pred CHHHHHHHHHHH-------CCCCHHHHHHHHHHHHHHCCCHHHHCCHHHHCCHHHHHHHHHHHHHHCCCCEEEECCCCCC T ss_conf 999999999998-------1999999999999999977996797370434799999999999998569998998088668 Q ss_pred CCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH-HHHH Q ss_conf 88056799999999999972698499974875797-6643 Q gi|254780750|r 742 TATLDGLSIAWATIEYLHETNRCRGLLATHFHELT-DLSK 780 (920) Q Consensus 742 Tst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~-~l~~ 780 (920) -++.--.. -|.++..+.+..+.-++++||+-+-. .+.+ T Consensus 162 LDp~~~~~-i~~~i~~l~~~~g~tiilssH~l~eve~l~d 200 (220) T cd03265 162 LDPQTRAH-VWEYIEKLKEEFGMTILLTTHYMEEAEQLCD 200 (220) T ss_pred CCHHHHHH-HHHHHHHHHHHCCCEEEEECCCHHHHHHHCC T ss_conf 89999999-9999999998389799998888899998699 No 45 >cd03271 ABC_UvrA_II The excision repair protein UvrA domain II; Nucleotide excision repair in eubacteria is a process that repairs DNA damage by the removal of a 12-13-mer oligonucleotide containing the lesion. Recognition and cleavage of the damaged DNA is a multistep ATP-dependent reaction that requires the UvrA, UvrB, and UvrC proteins. Both UvrA and UvrB are ATPases, with UvrA having two ATP binding sites, which have the characteristic signature of the family of ABC proteins and UvrB having one ATP binding site that is structurally related to that of helicases. Probab=98.12 E-value=1e-04 Score=53.76 Aligned_cols=54 Identities=17% Similarity=0.195 Sum_probs=36.3 Q ss_pred HHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHHH Q ss_conf 99589985699932588988056799999999999972698499974875797664 Q gi|254780750|r 724 LNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTDLS 779 (920) Q Consensus 724 L~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~l~ 779 (920) ++.++..+++|+||--.|=++.+-..+ +.+++.|.+. +..++++||.-++...+ T Consensus 185 ~~~~~~P~lllLDEPTs~LD~~~~~~l-~~~l~~l~~~-G~Tvi~itH~l~~~~~a 238 (261) T cd03271 185 SKRSTGKTLYILDEPTTGLHFHDVKKL-LEVLQRLVDK-GNTVVVIEHNLDVIKCA 238 (261) T ss_pred HHCCCCCCEEEECCCCCCCCHHHHHHH-HHHHHHHHHC-CCEEEEEEECHHHHHHC T ss_conf 725888967995486345998999999-9999999978-99999984778899738 No 46 >cd03240 ABC_Rad50 The catalytic domains of Rad50 are similar to the ATP-binding cassette of ABC transporters, but are not associated with membrane-spanning domains. The conserved ATP-binding motifs common to Rad50 and the ABC transporter family include the Walker A and Walker B motifs, the Q loop, a histidine residue in the switch region, a D-loop, and a conserved LSGG sequence. This conserved sequence, LSGG, is the most specific and characteristic motif of this family and is thus known as the ABC signature sequence. Probab=98.12 E-value=7.7e-05 Score=54.58 Aligned_cols=51 Identities=29% Similarity=0.299 Sum_probs=34.7 Q ss_pred CCCCCEEEEECCCCCCCHHHHHHH---HHHHHHHHHHHCCCEEEEECCCHHHHHHHH Q ss_conf 899856999325889880567999---999999999726984999748757976643 Q gi|254780750|r 727 ATNQSFVILDEIGRGTATLDGLSI---AWATIEYLHETNRCRGLLATHFHELTDLSK 780 (920) Q Consensus 727 at~~SLVllDElGrGTst~DG~ai---A~aile~l~~~~~~~~lfaTHy~eL~~l~~ 780 (920) +....++|+||- |+..|-... -+.+++.+.+..+..++++||..++...++ T Consensus 137 ~~~p~lllLDEP---Ts~LD~~~~~~~l~~ll~~~~~~~~~tiIivtHd~e~~~~aD 190 (204) T cd03240 137 GSNCGILALDEP---TTNLDEENIEESLAEIIEERKSQKNFQLIVITHDEELVDAAD 190 (204) T ss_pred HCCCCEEEEECC---CCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHCC T ss_conf 459998998187---666997899999999999998569989999944989998499 No 47 >PRK11247 ssuB aliphatic sulfonates transport ATP-binding subunit; Provisional Probab=98.12 E-value=4.6e-05 Score=56.27 Aligned_cols=183 Identities=17% Similarity=0.167 Sum_probs=88.0 Q ss_pred CCEECCCCCCEEECCCCCHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCC Q ss_conf 51210478714000046805887631028770444345635877776643999967784407899999999999997198 Q gi|254780750|r 602 CRPIIDNSTNFIVKDGRHPIVEKTLKQQSSKPFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGS 681 (920) Q Consensus 602 ~rP~i~~~~~l~i~~gRHPviE~~l~~~~~~~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~ 681 (920) .-|..+.+..|.+++...= -+..-|=+||++.. ..+.+..|.|||-+||||++|.++=+. -..-|. T Consensus 3 ~~~~~~~g~~l~v~nlsk~---------yg~~~~L~dIs~~I----~~GEiv~LiG~nGaGKSTLlr~i~Gl~-~p~~G~ 68 (257) T PRK11247 3 NTARLNQGTPLLLNAVSKH---------YGERTVLNQLDLHI----PAGQFVAVVGRSGCGKSTLLRLLAGLE-TPTAGD 68 (257) T ss_pred CCCCCCCCCCEEEEEEEEE---------ECCEEEECCEEEEE----CCCCEEEEECCCCCHHHHHHHHHHCCC-CCCCCE T ss_conf 8677899985899768999---------89988982407588----799899999899888999999996589-888870 Q ss_pred C----CCHHH--CCCCCCC---EEEEEEECCCCCCCCCCHH----HHHHH---------------------HHHHHHH-H Q ss_conf 5----30353--2068221---0567652376611385328----99999---------------------9999999-5 Q gi|254780750|r 682 Y----VPASY--AHIGIVD---KLFSRVGSADNLASGRSTF----MVEMI---------------------ETASILN-Q 726 (920) Q Consensus 682 f----VPA~~--a~i~~~D---~IftRiGa~D~l~~g~STF----~vEm~---------------------e~~~IL~-~ 726 (920) - .|-.. ..++.+- .+|...-..||+.-|...- ..|+. +--.|.+ - T Consensus 69 I~~~~~~i~~~~~~i~~vfQ~~~l~~~~tV~eni~~gl~~~~~~~~~e~l~~vgL~~~~~~~p~~LSGGqkQRvaiAraL 148 (257) T PRK11247 69 LLAGTAPLAEAQEDTRLMFQDARLLPWKKVIDNVGLGLKGQWRDAALQALAAVGLADRANEWPAALSGGQKQRVALARAL 148 (257) T ss_pred EEECCEEHHHHHHCEEEEECCCCCCCCCCHHHHHHHHCCCCHHHHHHHHHHHCCCCHHHHCCHHHCCHHHHHHHHHHHHH T ss_conf 89898755443110079932564476778999986321410699999999985991355369444899999999999998 Q ss_pred CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH-HHHHHCCCEEEEEEEEEEECCEEEEEEEE Q ss_conf 89985699932588988056799999999999972698499974875797-66430688589999999609927787777 Q gi|254780750|r 727 ATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELT-DLSKSLKRFHNATLQVSDSNEGIIFLHKV 805 (920) Q Consensus 727 at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~-~l~~~~~~v~n~~~~~~~~~~~i~flykl 805 (920) +..-.++|+||--.|-++.-...+ +..+..|.+..+...+|+||+.+.. .+++. +. - -.+++|++.-.+ T Consensus 149 ~~~P~lLlLDEPtsgLD~~~~~~i-~~ll~~L~~e~g~TIi~vTHdl~ea~~laDR---I~--v----m~~G~Iv~d~~~ 218 (257) T PRK11247 149 IHRPRLLLLDEPLGALDALTRIEM-QDLIESLWQQHGFTVLLVTHDVSEAVAMADR---VL--L----IEEGKIGLDLTV 218 (257) T ss_pred HCCCCEEEEECCCCCCCHHHHHHH-HHHHHHHHHHHCCEEEEECCCHHHHHHHCCE---EE--E----EECCEEEEEEEC T ss_conf 459999998098765799999999-9999999996098999988799999996999---99--9----989999997118 Q ss_pred EEC Q ss_conf 447 Q gi|254780750|r 806 IPG 808 (920) Q Consensus 806 ~~G 808 (920) -.. T Consensus 219 d~~ 221 (257) T PRK11247 219 DLP 221 (257) T ss_pred CCC T ss_conf 999 No 48 >cd03267 ABC_NatA_like Similar in sequence to NatA, this is the ATPase component of a bacterial ABC-type Na+ transport system called NatAB, which catalyzes ATP-dependent electrogenic Na+ extrusion without mechanically coupled to proton or K+ uptake. NatB possess six putative membrane spanning regions at its C-terminus. In B. subtilis, NatAB is inducible by agents such as ethanol and protonophores, which lower the protonmotive force across the membrane. The closest sequence similarity to NatA is exhibited by DrrA of the two-component daunomycin- and doxorubicin-efflux system. Hence, the functional NatAB is presumably assembled with two copies of the single ATP-binding protein and the single intergral membrane protein. Probab=98.10 E-value=4.3e-05 Score=56.49 Aligned_cols=140 Identities=16% Similarity=0.248 Sum_probs=72.6 Q ss_pred CCCEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHH--------------------HHHHCCCCCCHHH--- Q ss_conf 77044434563587777664399996778440789999999999--------------------9997198530353--- Q gi|254780750|r 631 SKPFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIV--------------------IMAQMGSYVPASY--- 687 (920) Q Consensus 631 ~~~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~v--------------------ilAQiG~fVPA~~--- 687 (920) +..-+=+|+++.. ..+.+.-|-|||-+||||+||.++=+. +.+++|..+|-.. T Consensus 32 g~~~al~~vsf~i----~~Gei~gLlGpNGaGKSTllk~l~Gl~~p~~G~I~v~G~~~~~~~~~~~~~ig~v~~q~~~l~ 107 (236) T cd03267 32 REVEALKGISFTI----EKGEIVGFIGPNGAGKTTTLKILSGLLQPTSGEVRVAGLVPWKRRKKFLRRIGVVFGQKTQLW 107 (236) T ss_pred CCEEEECCCEEEE----CCCCEEEEECCCCCHHHHHHHHHHCCCCCCCEEEEECCEECCCCHHHHHHHEEEEECCCCCCC T ss_conf 9989866805788----489599999999830999999996494887159999999851040988843799957754246 Q ss_pred CCCCCCCEEEEEEECCCCCCCCCCH--H------HHHHHHHHH-------------------HHHHCCCCCEEEEECCCC Q ss_conf 2068221056765237661138532--8------999999999-------------------999589985699932588 Q gi|254780750|r 688 AHIGIVDKLFSRVGSADNLASGRST--F------MVEMIETAS-------------------ILNQATNQSFVILDEIGR 740 (920) Q Consensus 688 a~i~~~D~IftRiGa~D~l~~g~ST--F------~vEm~e~~~-------------------IL~~at~~SLVllDElGr 740 (920) ..+++.+.+. +++. ..|.+. . +.|+.++.. ..--+..-.++|+||--. T Consensus 108 ~~ltv~e~l~--~~~~---~~~~~~~~~~~~~~~~l~~~~l~~~~~~~~~~LSgG~rqrv~ia~aL~~~P~lllLDEPt~ 182 (236) T cd03267 108 WDLPVIDSFY--LLAA---IYDLPPARFKKRLDELSELLDLEELLDTPVRQLSLGQRMRAEIAAALLHEPEILFLDEPTI 182 (236) T ss_pred CCCCHHHHHH--HHHH---HHCCCHHHHHHHHHHHHHHCCCHHHHCCCHHHCCHHHHHHHHHHHHHHCCCCEEEEECCCC T ss_conf 7993999999--9999---8573899999999999997486877549345699999999999999967999999979876 Q ss_pred CCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH-HHHH Q ss_conf 988056799999999999972698499974875797-6643 Q gi|254780750|r 741 GTATLDGLSIAWATIEYLHETNRCRGLLATHFHELT-DLSK 780 (920) Q Consensus 741 GTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~-~l~~ 780 (920) |=++.--..+ |..+..+....+.-++++||.-+.. .+.+ T Consensus 183 gLD~~~~~~i-~~~l~~l~~~~g~till~tH~l~ev~~~~D 222 (236) T cd03267 183 GLDVVAQENI-RNFLKEYNRERGTTVLLTSHYMKDIEALAR 222 (236) T ss_pred CCCHHHHHHH-HHHHHHHHHHCCCEEEEECCCHHHHHHHCC T ss_conf 8899999999-999999997389899998887899999799 No 49 >PRK11614 livF leucine/isoleucine/valine transporter ATP-binding subunit; Provisional Probab=98.10 E-value=6.3e-05 Score=55.25 Aligned_cols=133 Identities=28% Similarity=0.330 Sum_probs=75.6 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHH---------------------HHHHHCCCCCCHHHCCCCCC Q ss_conf 443456358777766439999677844078999999999---------------------99997198530353206822 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALI---------------------VIMAQMGSYVPASYAHIGIV 693 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~---------------------vilAQiG~fVPA~~a~i~~~ 693 (920) |=+||+|.. ..+.+.-|-|||-+||||++|.++=. -++.+..+|||-. T Consensus 20 ~L~~isl~i----~~Gei~~liG~NGaGKSTLl~~i~G~~~~~~G~I~~~G~~i~~~~~~~~~r~~i~~vpq~------- 88 (237) T PRK11614 20 ALHEVSLHI----NQGEIVTLIGANGAGKTTLLGTLCGDPRATSGRIVFDGKDITDWQTAKIMREAVAIVPEG------- 88 (237) T ss_pred EEEEEEEEE----CCCCEEEEECCCCCCHHHHHHHHHCCCCCCCEEEEECCEECCCCCHHHHHHHCEEEECCC------- T ss_conf 881127898----699799998799975999999996799889628999999888799899987064783556------- Q ss_pred CEEEEEEECCCCCCCC-----CCHHHHHHHHHHH-------------------------H-HHHCCCCCEEEEECCCCCC Q ss_conf 1056765237661138-----5328999999999-------------------------9-9958998569993258898 Q gi|254780750|r 694 DKLFSRVGSADNLASG-----RSTFMVEMIETAS-------------------------I-LNQATNQSFVILDEIGRGT 742 (920) Q Consensus 694 D~IftRiGa~D~l~~g-----~STF~vEm~e~~~-------------------------I-L~~at~~SLVllDElGrGT 742 (920) ..+|.++-..+|+.-| ...+...+.+... | .--+..-.++|+||--.|= T Consensus 89 ~~~~~~ltv~enl~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~LSgG~~Qrv~iAraL~~~P~lLlLDEPt~gL 168 (237) T PRK11614 89 RRVFSRMTVEENLAMGGFFAERDQFQERIKWVYELFPRLHERRIQRAGTMSGGEQQMLAIGRALMSQPRLLLLDEPSLGL 168 (237) T ss_pred CCCCCCCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCC T ss_conf 64577788999987510137867899999999986555567654422348998859999999985699999995975579 Q ss_pred CHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHH-HHHHH Q ss_conf 805679999999999997269849997487579-76643 Q gi|254780750|r 743 ATLDGLSIAWATIEYLHETNRCRGLLATHFHEL-TDLSK 780 (920) Q Consensus 743 st~DG~aiA~aile~l~~~~~~~~lfaTHy~eL-~~l~~ 780 (920) ++..-..| |.+++.|.+. ++-++++||.-+. ..+++ T Consensus 169 D~~~~~~i-~~~l~~l~~~-g~tii~vsH~l~~~~~~aD 205 (237) T PRK11614 169 APIIIQQI-FDTIEQLREQ-GMTIFLVEQNANQALKLAD 205 (237) T ss_pred CHHHHHHH-HHHHHHHHHC-CCEEEEEECCHHHHHHHCC T ss_conf 99999999-9999999967-9999999475899999699 No 50 >cd03213 ABCG_EPDR ABCG transporters are involved in eye pigment (EP) precursor transport, regulation of lipid-trafficking mechanisms, and pleiotropic drug resistance (DR). DR is a well-described phenomenon occurring in fungi and shares several similarities with processes in bacteria and higher eukaryotes. Compared to other members of the ABC transporter subfamilies, the ABCG transporter family is composed of proteins that have an ATP-binding cassette domain at the N-terminus and a TM (transmembrane) domain at the C-terminus. Probab=98.08 E-value=0.00011 Score=53.57 Aligned_cols=132 Identities=22% Similarity=0.289 Sum_probs=70.4 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCC-HHHCCCCCCC------EEEEEEE--CCCC Q ss_conf 44345635877776643999967784407899999999999997198530-3532068221------0567652--3766 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVP-ASYAHIGIVD------KLFSRVG--SADN 705 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVP-A~~a~i~~~D------~IftRiG--a~D~ 705 (920) |=+|+++.. ..+.+..|.|||-+||||+||.++ |.-.| +.+-++.+-. .+-.+|| .+|+ T Consensus 24 iL~~vs~~v----~~Gei~~ilGpnGaGKSTLl~~l~--------Gl~~~~~~~G~i~~~g~~~~~~~~~~~ig~v~Q~~ 91 (194) T cd03213 24 LLKNVSGKA----KPGELTAIMGPSGAGKSTLLNALA--------GRRTGLGVSGEVLINGRPLDKRSFRKIIGYVPQDD 91 (194) T ss_pred EEECCEEEE----ECCEEEEEECCCCCHHHHHHHHHH--------CCCCCCCCEEEEEECCEECCHHHHHHEEEEECCCC T ss_conf 788838899----088199999899951999999985--------77778996289999999997578431289984665 Q ss_pred -CCCCCCH-----HHHH--------HHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECC Q ss_conf -1138532-----8999--------9999999995899856999325889880567999999999999726984999748 Q gi|254780750|r 706 -LASGRST-----FMVE--------MIETASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATH 771 (920) Q Consensus 706 -l~~g~ST-----F~vE--------m~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTH 771 (920) +....+- |..+ ...++-..--+..-+++|+||--.|-++..-..| |..+..+.+. +..++++|| T Consensus 92 ~l~~~ltv~e~l~~~a~l~~LSgGqrqRv~iA~aL~~~P~illlDEPTsgLD~~~~~~i-~~~l~~l~~~-g~tvi~~tH 169 (194) T cd03213 92 ILHPTLTVRETLMFAAKLRGLSGGERKRVSIALELVSNPSLLFLDEPTSGLDSSSALQV-MSLLRRLADT-GRTIICSIH 169 (194) T ss_pred CCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHH-HHHHHHHHHC-CCEEEEEEC T ss_conf 23776849999999987269888999999999999639988999489878898999999-9999999968-989999958 Q ss_pred CH--HHHHHHH Q ss_conf 75--7976643 Q gi|254780750|r 772 FH--ELTDLSK 780 (920) Q Consensus 772 y~--eL~~l~~ 780 (920) .. ++.++++ T Consensus 170 ~~~~~~~~~~D 180 (194) T cd03213 170 QPSSEIFELFD 180 (194) T ss_pred CCHHHHHHHCC T ss_conf 88599999799 No 51 >cd03299 ABC_ModC_like Archeal protein closely related to ModC. ModC is an ABC-type transporter and the ATPase component of a molybdate transport system that also includes the periplasmic binding protein ModA and the membrane protein ModB. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=98.07 E-value=9.9e-05 Score=53.76 Aligned_cols=133 Identities=24% Similarity=0.306 Sum_probs=73.3 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCE------------------ Q ss_conf 44434563587777664399996778440789999999999999719853035320682210------------------ Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDK------------------ 695 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~------------------ 695 (920) ++=+|++|.. +.+.+..|-|||-+||||+||.++ |..=| ++-++.+-++ T Consensus 13 ~~L~~vs~~v----~~Ge~~~iiGpSGsGKSTLlr~i~--------Gl~~p-~~G~I~~~G~di~~~~~~~r~ig~vfQ~ 79 (235) T cd03299 13 FKLKNVSLEV----ERGDYFVILGPTGSGKSVLLETIA--------GFIKP-DSGKILLNGKDITNLPPEKRDISYVPQN 79 (235) T ss_pred CEECCCEEEE----CCCCEEEEECCCCCHHHHHHHHHH--------CCCCC-CCEEEEECCEECCCCCHHHCCEEEECCC T ss_conf 4990148798----899899999999635999999997--------49999-9659999999999999767897894579 Q ss_pred --EEEEEECCCCCCCC-----CCHH--------HHHHHHHHHHHH-------------------HCCCCCEEEEECCCCC Q ss_conf --56765237661138-----5328--------999999999999-------------------5899856999325889 Q gi|254780750|r 696 --LFSRVGSADNLASG-----RSTF--------MVEMIETASILN-------------------QATNQSFVILDEIGRG 741 (920) Q Consensus 696 --IftRiGa~D~l~~g-----~STF--------~vEm~e~~~IL~-------------------~at~~SLVllDElGrG 741 (920) +|-.+-..|||.-| .+.- ..||..+...++ -+.+-.++|+||--.| T Consensus 80 ~~Lfp~~tV~eNi~~~l~~~~~~~~e~~~rv~e~l~~~gl~~~~~~~p~~LSGGq~QRVaiARAl~~~P~llllDEP~s~ 159 (235) T cd03299 80 YALFPHMTVYKNIAYGLKKRKVDKKEIERKVLEIAEMLGIDHLLNRKPETLSGGEQQRVAIARALVVNPKILLLDEPFSA 159 (235) T ss_pred CCCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHCCHHHCCHHHHHHHHHHHHHHCCCCEEEEECCCCC T ss_conf 86689990999999999876999999999999999877997787489445899999999999999738998999288764 Q ss_pred CCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHH-HHH Q ss_conf 880567999999999999726984999748757976-643 Q gi|254780750|r 742 TATLDGLSIAWATIEYLHETNRCRGLLATHFHELTD-LSK 780 (920) Q Consensus 742 Tst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~-l~~ 780 (920) =++.-...| +..+..|.+..+..++|+||..+.+. +++ T Consensus 160 LD~~~~~~i-~~~l~~l~~~~~~T~i~vTHd~~~a~~~aD 198 (235) T cd03299 160 LDVRTKEKL-REELKKIRKEFGVTVLHVTHDFEEAWALAD 198 (235) T ss_pred CCHHHHHHH-HHHHHHHHHHHCCEEEEECCCHHHHHHHCC T ss_conf 699999999-999999999829999998789999999699 No 52 >PRK13547 hmuV hemin importer ATP-binding subunit; Provisional Probab=98.07 E-value=0.00015 Score=52.48 Aligned_cols=139 Identities=22% Similarity=0.238 Sum_probs=77.2 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHH-H---------------------------HHHHHCCCCCCH Q ss_conf 444345635877776643999967784407899999999-9---------------------------999971985303 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNAL-I---------------------------VIMAQMGSYVPA 685 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval-~---------------------------vilAQiG~fVPA 685 (920) -|=||++|.. ..+.+.-|-|||-+||||++|.++= + .-+++.+.++|- T Consensus 15 ~vL~~vsl~i----~~Ge~~aliG~nGaGKSTLl~~i~G~l~~~~~~~g~~~~G~i~~~g~~i~~~~~~~~~~~~~~~~q 90 (273) T PRK13547 15 AILRDLSLRI----EPGRVTALLGRNGAGKSTLLKVLAGELTGGGAPRGVRVTGDVALNGEPLAAIDAPRLARLRAVLPQ 90 (273) T ss_pred EEEECEEEEE----CCCCEEEEECCCCCHHHHHHHHHHCCCCCCCCCCCCEEEEEEEECCEECCCCCHHHHHHHHEEECC T ss_conf 9997608899----899899999999976999999995678876566775247799999998553999999774258643 Q ss_pred HH---CCCCCCCEEE-------------------------EEEECCCCC---CCCCCHHHHHHHHHHHHHHH-------- Q ss_conf 53---2068221056-------------------------765237661---13853289999999999995-------- Q gi|254780750|r 686 SY---AHIGIVDKLF-------------------------SRVGSADNL---ASGRSTFMVEMIETASILNQ-------- 726 (920) Q Consensus 686 ~~---a~i~~~D~If-------------------------tRiGa~D~l---~~g~STF~vEm~e~~~IL~~-------- 726 (920) .. ..+++.+.++ .+.|..+-. ....|-=+.-....|..|-+ T Consensus 91 ~~~~~~~~~v~~~v~~g~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~r~~~~LSGGq~qRv~iAraL~~l~~~~~al 170 (273) T PRK13547 91 AAQPAFPFSVDEIVLLGRYPHARRAGALTHRDREIAWQALARAGATALVGRDVTTLSGGELARVQFARVLAQLWPPDPGA 170 (273) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHCCCCHHCCHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 24555677599999851223322024114899999999998649754527871126999999999999998510111013 Q ss_pred CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHH Q ss_conf 899856999325889880567999999999999726984999748757976 Q gi|254780750|r 727 ATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTD 777 (920) Q Consensus 727 at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~ 777 (920) +++-.++|+||--.|=++..-.-| +..+..|.+..+.-++|+||..++.. T Consensus 171 ~~~P~lLlLDEPts~LD~~~~~~i-~~~l~~l~~e~g~tvl~vtHdl~~~~ 220 (273) T PRK13547 171 AQPPRYLLLDEPTAALDLAHQHRL-LDTVRRLAREWNLGVLAIVHDPNLAA 220 (273) T ss_pred CCCCCEEEECCCCCCCCHHHHHHH-HHHHHHHHHHCCCEEEEEECCHHHHH T ss_conf 689868997287444899999999-99999999837989999988999999 No 53 >COG0396 sufC Cysteine desulfurase activator ATPase [Posttranslational modification, protein turnover, chaperones] Probab=98.05 E-value=2.2e-05 Score=58.61 Aligned_cols=133 Identities=31% Similarity=0.400 Sum_probs=83.9 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCC----------------CHHHCCCCCCCE--- Q ss_conf 4434563587777664399996778440789999999999999719853----------------035320682210--- Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYV----------------PASYAHIGIVDK--- 695 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fV----------------PA~~a~i~~~D~--- 695 (920) |=.+++|.. +.+.++.|-|||-+||||+...+ |-+-+.=| |.+.|+.|+|=. T Consensus 19 ILkgvnL~v----~~GEvhaiMGPNGsGKSTLa~~i-----~G~p~Y~Vt~G~I~~~GedI~~l~~~ERAr~GifLafQ~ 89 (251) T COG0396 19 ILKGVNLTV----KEGEVHAIMGPNGSGKSTLAYTI-----MGHPKYEVTEGEILFDGEDILELSPDERARAGIFLAFQY 89 (251) T ss_pred HHCCCCEEE----CCCCEEEEECCCCCCHHHHHHHH-----HCCCCCEEECCEEEECCCCCCCCCHHHHHHCCCEEEECC T ss_conf 003741467----59828999889987889999997-----289974675556998785425599868886187765117 Q ss_pred ----------EEEEEECCCCCCCC----CCHHHHHHHHHHHHHH--------------------------H-CCCCCEEE Q ss_conf ----------56765237661138----5328999999999999--------------------------5-89985699 Q gi|254780750|r 696 ----------LFSRVGSADNLASG----RSTFMVEMIETASILN--------------------------Q-ATNQSFVI 734 (920) Q Consensus 696 ----------IftRiGa~D~l~~g----~STF~vEm~e~~~IL~--------------------------~-at~~SLVl 734 (920) -|-|.+.+. ..+ ..-|.-++.|.+..|. - +=+-.|+| T Consensus 90 P~ei~GV~~~~fLr~a~n~--~~~~~~~~~~~~~~~~e~~~~l~~~~~~l~R~vN~GFSGGEkKR~EilQ~~~lePkl~I 167 (251) T COG0396 90 PVEIPGVTNSDFLRAAMNA--RRGARGILPEFIKELKEKAELLGLDEEFLERYVNEGFSGGEKKRNEILQLLLLEPKLAI 167 (251) T ss_pred CCCCCCEEHHHHHHHHHHH--HHCCCCCCHHHHHHHHHHHHHCCCCHHHHHCCCCCCCCCCHHHHHHHHHHHHCCCCEEE T ss_conf 7547780099999999975--40356433889999999998839998996165677747315779999999845998899 Q ss_pred EECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHH Q ss_conf 9325889880567999999999999726984999748757976643 Q gi|254780750|r 735 LDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTDLSK 780 (920) Q Consensus 735 lDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~l~~ 780 (920) |||.-.|-+ -|++.+..-.++.|.+. +.-+|..|||.+|.++-. T Consensus 168 LDE~DSGLD-Idalk~V~~~i~~lr~~-~~~~liITHy~rll~~i~ 211 (251) T COG0396 168 LDEPDSGLD-IDALKIVAEGINALREE-GRGVLIITHYQRLLDYIK 211 (251) T ss_pred ECCCCCCCC-HHHHHHHHHHHHHHHCC-CCEEEEEECHHHHHHHCC T ss_conf 558876755-89999999999998658-972999955799984268 No 54 >cd03222 ABC_RNaseL_inhibitor The ABC ATPase RNase L inhibitor (RLI) is a key enzyme in ribosomal biogenesis, formation of translation preinitiation complexes, and assembly of HIV capsids. RLI's are not transport proteins, and thus cluster with a group of soluble proteins that lack the transmembrane components commonly found in other members of the family. Structurally, RLI's have an N-terminal Fe-S domain and two nucleotide-binding domains, which are arranged to form two composite active sites in their interface cleft. RLI is one of the most conserved enzymes between archaea and eukaryotes with a sequence identity more than 48%. The high degree of evolutionary conservation suggests that RLI performs a central role in archaeal and eukaryotic physiology. Probab=98.04 E-value=0.00012 Score=53.05 Aligned_cols=111 Identities=23% Similarity=0.270 Sum_probs=66.6 Q ss_pred CCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEECC-C--CCCCCCCHHHHHHHHHHHHH Q ss_conf 66439999677844078999999999999971985303532068221056765237-6--61138532899999999999 Q gi|254780750|r 648 NSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGSA-D--NLASGRSTFMVEMIETASIL 724 (920) Q Consensus 648 ~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa~-D--~l~~g~STF~vEm~e~~~IL 724 (920) ..+.+.-|-|||-+||||++|.++ |..-|-+.- +. +|.+ ++.-. . +|+.|+ ...++-.. T Consensus 23 ~~GEiv~ilGpNGaGKSTllk~i~--------G~l~p~~G~-i~-~~g~--~~~~~pq~~~LSGGq------rQRv~iAr 84 (177) T cd03222 23 KEGEVIGIVGPNGTGKTTAVKILA--------GQLIPNGDN-DE-WDGI--TPVYKPQYIDLSGGE------LQRVAIAA 84 (177) T ss_pred CCCCEEEEECCCCCCHHHHHHHHH--------CCCCCCCCC-EE-ECCC--CEECCCCCCCCCHHH------HHHHHHHH T ss_conf 899899998999999999999996--------886788994-66-6686--122155515079899------99999999 Q ss_pred HHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHH Q ss_conf 95899856999325889880567999999999999726984999748757976 Q gi|254780750|r 725 NQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTD 777 (920) Q Consensus 725 ~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~ 777 (920) --+..-.++|+||--.|-++.--.. .+.+++.+.+..+..++++||.-+... T Consensus 85 al~~~p~lllLDEPts~LD~~~r~~-i~~~ik~l~~~~~~Tvl~vsHdl~~a~ 136 (177) T cd03222 85 ALLRNATFYLFDEPSAYLDIEQRLN-AARAIRRLSEEGKKTALVVEHDLAVLD 136 (177) T ss_pred HHHCCCCEEEECCCCCCCCHHHHHH-HHHHHHHHHHHCCCEEEEECCCHHHHH T ss_conf 9823999999748865389999999-999999999965977999858899999 No 55 >PRK10247 putative ABC transporter ATP-binding protein YbbL; Provisional Probab=98.03 E-value=9.3e-05 Score=53.99 Aligned_cols=141 Identities=22% Similarity=0.259 Sum_probs=69.2 Q ss_pred CEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHH--------------------HHHHCCCCCCHHHCCC-- Q ss_conf 044434563587777664399996778440789999999999--------------------9997198530353206-- Q gi|254780750|r 633 PFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIV--------------------IMAQMGSYVPASYAHI-- 690 (920) Q Consensus 633 ~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~v--------------------ilAQiG~fVPA~~a~i-- 690 (920) .-|=+|+++.. ..+.+..|.|||-+||||++|.++=+. .+.|--+|||-...-+ T Consensus 20 ~~iL~~is~~i----~~Ge~~~i~G~sGsGKSTLlk~i~gl~~p~~G~I~~~g~~i~~~~~~~~r~~i~~v~Q~~~l~~~ 95 (225) T PRK10247 20 TKILNNINFSL----RAGEFKLITGPSGCGKSTLLKIVASLISPTSGTLLFEGEDISTLKPEIYRQQVSYCAQTPTLFGD 95 (225) T ss_pred EEEEECEEEEE----CCCCEEEEECCCCCCHHHHHHHHHCCCCCCCCEEEECCEECCCCCHHHHHHHCCEEECCCCCCHH T ss_conf 89994517998----59969999999999999999999646688876599999997749999998527457045543415 Q ss_pred CCCCEEE-------------------EEEECCCCC-CCCCCHHHHHHHHHHHHHH-HCCCCCEEEEECCCCCCCHHHHHH Q ss_conf 8221056-------------------765237661-1385328999999999999-589985699932588988056799 Q gi|254780750|r 691 GIVDKLF-------------------SRVGSADNL-ASGRSTFMVEMIETASILN-QATNQSFVILDEIGRGTATLDGLS 749 (920) Q Consensus 691 ~~~D~If-------------------tRiGa~D~l-~~g~STF~vEm~e~~~IL~-~at~~SLVllDElGrGTst~DG~a 749 (920) ++.|.|. .++|-.+++ .+.-+++.--+.+--.|.+ -+..-.++|+||.-.|=++..-.. T Consensus 96 tv~eni~~~~~~~~~~~~~~~~~~~l~~~~l~~~~~~~~~~~LSGGqkQRv~iARaL~~~p~iLllDEPts~LD~~~~~~ 175 (225) T PRK10247 96 TVYDNLIFPWQIRNQQPDPAIFLDDLERFALPDTILTKNIAELSGGEKQRISLIRNLQFMPKVLLLDEITSALDESNKHN 175 (225) T ss_pred HHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHH T ss_conf 39999985787667667899999999875995667618811189999999999999860999999959766689999999 Q ss_pred HHHHHHHHHHHHCCCEEEEECCCHHHHHH Q ss_conf 99999999997269849997487579766 Q gi|254780750|r 750 IAWATIEYLHETNRCRGLLATHFHELTDL 778 (920) Q Consensus 750 iA~aile~l~~~~~~~~lfaTHy~eL~~l 778 (920) | +..+..+.+..+.-++|+||..+.... T Consensus 176 i-~~~i~~l~~~~~~tvi~vtHd~~~~~~ 203 (225) T PRK10247 176 V-NEIIHRYVREQNIAVLWVTHDKDEINH 203 (225) T ss_pred H-HHHHHHHHHHCCCEEEEEECCHHHHHC T ss_conf 9-999999998389899999039999970 No 56 >PRK13544 consensus Probab=98.02 E-value=6e-05 Score=55.38 Aligned_cols=133 Identities=15% Similarity=0.202 Sum_probs=68.9 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHH-------------------HHHCCCCCCHHHC---CCC Q ss_conf 444345635877776643999967784407899999999999-------------------9971985303532---068 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVI-------------------MAQMGSYVPASYA---HIG 691 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vi-------------------lAQiG~fVPA~~a---~i~ 691 (920) -|=+|+++.. ..+.+..|.|||-+||||+||.++=+.- +.+++ |||-+.+ .++ T Consensus 15 ~il~~vs~~i----~~Gei~~l~G~NGsGKSTLl~~i~Gl~~p~~G~I~~~G~~i~~~~~~~~~i~-~~~q~~~~~~~lt 89 (208) T PRK13544 15 VLFSNLSFTA----KQNSLTLVIGNNGSGKTSLLRLLAGLIPITHGNIKYNGENIHDSDTYITSIT-YIGHKNACNDNLT 89 (208) T ss_pred EEEECCEEEE----CCCCEEEEECCCCCCHHHHHHHHHCCCCCCCCEEEECCEECCCCHHHHCCEE-EECCCCCCCCCCC T ss_conf 9994415898----2994999999999989999999958806897489999999987837660727-8766444576778 Q ss_pred CCCEE--EEEEECCCCCC-----------------CCCCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 22105--67652376611-----------------385328999999999999589985699932588988056799999 Q gi|254780750|r 692 IVDKL--FSRVGSADNLA-----------------SGRSTFMVEMIETASILNQATNQSFVILDEIGRGTATLDGLSIAW 752 (920) Q Consensus 692 ~~D~I--ftRiGa~D~l~-----------------~g~STF~vEm~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~ 752 (920) +.+.+ +.++-..+++. ...|.=+.-....+.. -+....++|+||--.|=++.- ..+.+ T Consensus 90 v~e~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~LSgG~kqrv~la~a--L~~~~~illLDEPt~gLD~~s-~~~i~ 166 (208) T PRK13544 90 VLENIEFWANIRNTHELIAAAICFFELQPVLDIKYKELSSGWKRRVALSRL--LIYNTNVWIIDEPFANLDSAT-KELIL 166 (208) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHCCCHHHHHCCHHHCCHHHHHHHHHHHH--HHCCCCEEEEECCCCCCCHHH-HHHHH T ss_conf 999999998617838999999998499788727153579999999999999--856999999979866689999-99999 Q ss_pred HHHHHHHHHCCCEEEEECCCHHH Q ss_conf 99999997269849997487579 Q gi|254780750|r 753 ATIEYLHETNRCRGLLATHFHEL 775 (920) Q Consensus 753 aile~l~~~~~~~~lfaTHy~eL 775 (920) .++..+.+. +..++++||.-+- T Consensus 167 ~~i~~~~~~-g~~vIi~sHd~~e 188 (208) T PRK13544 167 ELILTRLEQ-NGIVIISDHSKTE 188 (208) T ss_pred HHHHHHHHC-CCEEEEECCCHHH T ss_conf 999999868-9999998699999 No 57 >cd03215 ABC_Carb_Monos_II This family represents domain II of the carbohydrate uptake proteins that transport only monosaccharides (Monos). The Carb_Monos family is involved in the uptake of monosaccharides, such as pentoses (such as xylose, arabinose, and ribose) and hexoses (such as xylose, arabinose, and ribose), that cannot be broken down to simple sugars by hydrolysis. In members of Carb_Monos family the single hydrophobic gene product forms a homodimer, while the ABC protein represents a fusion of two nucleotide-binding domains. However, it is assumed that two copies of the ABC domains are present in the assembled transporter. Probab=98.02 E-value=7.1e-05 Score=54.86 Aligned_cols=135 Identities=19% Similarity=0.172 Sum_probs=70.6 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHH--------------------HHHCC-CCCCHHHCCCCC Q ss_conf 444345635877776643999967784407899999999999--------------------99719-853035320682 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVI--------------------MAQMG-SYVPASYAHIGI 692 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vi--------------------lAQiG-~fVPA~~a~i~~ 692 (920) .+=+||+|.. ..+.+.-|-|||-+||||++|.++=+.- +.+.| .|||.+... T Consensus 14 ~aL~~vsl~v----~~GEi~~liG~nGaGKSTll~~l~G~~~p~~G~I~~~G~~~~~~~~~~~~~~~i~~vp~~r~~--- 86 (182) T cd03215 14 GAVRDVSFEV----RAGEIVGIAGLVGNGQTELAEALFGLRPPASGEITLDGKPVTRRSPRDAIRAGIAYVPEDRKR--- 86 (182) T ss_pred CCEECEEEEE----CCCCEEEEECCCCCCCCHHHHHHCCCCCCCCCEEEECCEECCCCCHHHHHHCCEEEEEHHHHH--- T ss_conf 8762317898----599699998889999263778766986788775999999988649999997896996020766--- Q ss_pred CCEEEEEEECCCCCCCC--CCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEEC Q ss_conf 21056765237661138--5328999999999999589985699932588988056799999999999972698499974 Q gi|254780750|r 693 VDKLFSRVGSADNLASG--RSTFMVEMIETASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLAT 770 (920) Q Consensus 693 ~D~IftRiGa~D~l~~g--~STF~vEm~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaT 770 (920) ..+|..+...+|+.-. .|-=+.-....|.. -+..-.++|+||--.|=++.-- .--|..+..+.+. ++-++|+| T Consensus 87 -~~l~~~~~v~en~~~~~~LSGG~~Qrv~lAra--l~~~p~llllDEPT~gLD~~~~-~~i~~~i~~l~~~-g~tvi~is 161 (182) T cd03215 87 -EGLVLDLSVAENIALSSLLSGGNQQKVVLARW--LARDPRVLILDEPTRGVDVGAK-AEIYRLIRELADA-GKAVLLIS 161 (182) T ss_pred -CCCCCCCCHHHHHHCCCCCCHHHHHHHHHHHH--HHCCCCEEEECCCCCCCCHHHH-HHHHHHHHHHHHC-CCEEEEEE T ss_conf -25678990999951855799899999999999--9719999998687545899999-9999999999978-99999996 Q ss_pred CCHHH-HHHHH Q ss_conf 87579-76643 Q gi|254780750|r 771 HFHEL-TDLSK 780 (920) Q Consensus 771 Hy~eL-~~l~~ 780 (920) |+-+. ..+.+ T Consensus 162 Hdl~~~~~~~D 172 (182) T cd03215 162 SELDELLGLCD 172 (182) T ss_pred CCHHHHHHHCC T ss_conf 87999999799 No 58 >cd03214 ABC_Iron-Siderophores_B12_Hemin ABC transporters, involved in the uptake of siderophores, heme, and vitamin B12, are widely conserved in bacteria and archaea. Only very few species lack representatives of the siderophore family transporters. The E. coli BtuCD protein is an ABC transporter mediating vitamin B12 uptake. The two ATP-binding cassettes (BtuD) are in close contact with each other, as are the two membrane-spanning subunits (BtuC); this arrangement is distinct from that observed for the E. coli lipid flippase MsbA. The BtuC subunits provide 20 transmembrane helices grouped around a translocation pathway that is closed to the cytoplasm by a gate region, whereas the dimer arrangement of the BtuD subunits resembles the ATP-bound form of the Rad50 DNA repair enzyme. A prominent cytoplasmic loop of BtuC forms the contact region with the ATP-binding cassette and represent a conserved motif among the ABC transporters. Probab=98.01 E-value=0.0001 Score=53.63 Aligned_cols=127 Identities=21% Similarity=0.254 Sum_probs=69.1 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEECCCCCCCCCCHHH Q ss_conf 44345635877776643999967784407899999999999997198530353206822105676523766113853289 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGSADNLASGRSTFM 714 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa~D~l~~g~STF~ 714 (920) |=+|+++.. ..+.+.-|.|||-+||||++|.++ |.+-|. +-++-+.++-.+.+.. ..+.+..+ |. T Consensus 14 il~~is~~i----~~Ge~~~liG~nGsGKTTLl~~i~--------G~~~~~-~G~I~~~g~~i~~~~~-~~~~~~i~-~v 78 (180) T cd03214 14 VLDDLSLSI----EAGEIVGILGPNGAGKSTLLKTLA--------GLLKPS-SGEILLDGKDLASLSP-KELARKIA-YV 78 (180) T ss_pred EEECEEEEE----CCCCEEEEECCCCCCHHHHHHHHH--------CCCCCC-CCEEEECCEECCCCCH-HHHHCCCC-HH T ss_conf 880437788----699799999899988999999995--------798998-7289999999896999-99955464-99 Q ss_pred ---HH-------------------HHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCC Q ss_conf ---99-------------------99999999958998569993258898805679999999999997269849997487 Q gi|254780750|r 715 ---VE-------------------MIETASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHF 772 (920) Q Consensus 715 ---vE-------------------m~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy 772 (920) .| ...++-..--+.+-.++|+||--.|=++.--.. -+.++..+.+..+..++++||+ T Consensus 79 ~Q~l~~~~l~~~~~~~~~~LSGGqkQrv~iA~aL~~~P~ililDEPts~LD~~~~~~-i~~~i~~l~~~~~~tii~itHd 157 (180) T cd03214 79 PQALELLGLAHLADRPFNELSGGERQRVLLARALAQEPPILLLDEPTSHLDIAHQIE-LLELLRRLARERGKTVVMVLHD 157 (180) T ss_pred HHHHHHCCCHHHHCCCHHHCCHHHHHHHHHHHHHHHCCCEEEECCCCCCCCHHHHHH-HHHHHHHHHHHCCCEEEEEECC T ss_conf 999998599778649910379999999999999986896478858754479999999-9999999998469899999079 Q ss_pred HHHHH Q ss_conf 57976 Q gi|254780750|r 773 HELTD 777 (920) Q Consensus 773 ~eL~~ 777 (920) -++.. T Consensus 158 l~~~~ 162 (180) T cd03214 158 LNLAA 162 (180) T ss_pred HHHHH T ss_conf 89999 No 59 >cd03223 ABCD_peroxisomal_ALDP Peroxisomal ATP-binding cassette transporter (Pat) is involved in the import of very long-chain fatty acids (VLCFA) into the peroxisome. The peroxisomal membrane forms a permeability barrier for a wide variety of metabolites required for and formed during fatty acid beta-oxidation. To communicate with the cytoplasm and mitochondria, peroxisomes need dedicated proteins to transport such hydrophilic molecules across their membranes. X-linked adrenoleukodystrophy (X-ALD) is caused by mutations in the ALD gene, which encodes ALDP (adrenoleukodystrophy protein ), a peroxisomal integral membrane protein that is a member of the ATP-binding cassette (ABC) transporter protein family. The disease is characterized by a striking and unpredictable variation in phenotypic expression. Phenotypes include the rapidly progressive childhood cerebral form (CCALD), the milder adult form, adrenomyeloneuropathy (AMN), and variants without neurologic involvement (i.e. asympt Probab=97.99 E-value=0.00025 Score=50.80 Aligned_cols=120 Identities=28% Similarity=0.306 Sum_probs=67.6 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHH-CCCCCCCEEEEEE--------E---- Q ss_conf 44345635877776643999967784407899999999999997198530353-2068221056765--------2---- Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASY-AHIGIVDKLFSRV--------G---- 701 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~-a~i~~~D~IftRi--------G---- 701 (920) +=+|+++.. ..+....|+|||-+||||++|.+ +|.+-|-+- -.++.-.+|.- + | T Consensus 16 il~~isl~i----~~Ge~v~i~G~sGsGKSTLl~~l--------~Gl~~~~~G~i~~~~~~~i~~-v~Q~~~l~~~tl~e 82 (166) T cd03223 16 LLKDLSFEI----KPGDRLLITGPSGTGKSSLFRAL--------AGLWPWGSGRIGMPEGEDLLF-LPQRPYLPLGTLRE 82 (166) T ss_pred EEEEEEEEE----CCCCEEEEECCCCCCHHHHHHHH--------CCCCCCCCCEEEECCCCCEEE-ECCCCCCCCCCHHH T ss_conf 894458898----89999999958999889999998--------698769986799769987999-85646658875999 Q ss_pred -----CCCCCCCCCCHHHHHHHHH--HHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHH Q ss_conf -----3766113853289999999--999995899856999325889880567999999999999726984999748757 Q gi|254780750|r 702 -----SADNLASGRSTFMVEMIET--ASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHE 774 (920) Q Consensus 702 -----a~D~l~~g~STF~vEm~e~--~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~e 774 (920) -.+.|+.| |...+ |..| +.+-.++|+||- ||.-|-.+. ..+++.|.+. +..+++.||-.+ T Consensus 83 ~l~~p~~~~LSGG------qkQRvalARal--~~~p~iliLDEp---Ts~LD~~~~-~~l~~~l~~~-~~Tvi~VtH~~~ 149 (166) T cd03223 83 QLIYPWDDVLSGG------EQQRLAFARLL--LHKPKFVFLDEA---TSALDEESE-DRLYQLLKEL-GITVISVGHRPS 149 (166) T ss_pred HHHCCCCCCCCHH------HHHHHHHHHHH--HCCCCEEEECCC---CCCCCHHHH-HHHHHHHHHC-CCEEEEEECCHH T ss_conf 9636154678999------99999999999--649999997585---332899999-9999999977-998999943469 Q ss_pred HHHHHH Q ss_conf 976643 Q gi|254780750|r 775 LTDLSK 780 (920) Q Consensus 775 L~~l~~ 780 (920) +..+++ T Consensus 150 ~~~~aD 155 (166) T cd03223 150 LWKFHD 155 (166) T ss_pred HHHCCC T ss_conf 997299 No 60 >cd03229 ABC_Class3 This class is comprised of all BPD (Binding Protein Dependent) systems that are largely represented in archaea and eubacteria and are primarily involved in scavenging solutes from the environment. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=97.99 E-value=0.00018 Score=51.91 Aligned_cols=130 Identities=26% Similarity=0.288 Sum_probs=78.8 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCC-------------------- Q ss_conf 444345635877776643999967784407899999999999997198530353206822-------------------- Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIV-------------------- 693 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~-------------------- 693 (920) -|=+|+++.. ..+.+..|.||+-+||||+||.++ |..-|- +-++-+- T Consensus 14 ~~l~~vs~~i----~~Ge~~~ivGpSG~GKSTllr~i~--------Gl~~p~-~G~I~~~g~~i~~~~~~~~~~rr~ig~ 80 (178) T cd03229 14 TVLNDVSLNI----EAGEIVALLGPSGSGKSTLLRCIA--------GLEEPD-SGSILIDGEDLTDLEDELPPLRRRIGM 80 (178) T ss_pred EEECCCEEEE----CCCCEEEEECCCCCHHHHHHHHHH--------CCCCCC-CEEEEECCEECCCCCCHHHHHCCCEEE T ss_conf 9983707698----899899999999983999999998--------599999-639999999998886102454177599 Q ss_pred ----CEEEEEEECCCCCCCCCCHHHHHHHHHHHHHH-HCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEE Q ss_conf ----10567652376611385328999999999999-5899856999325889880567999999999999726984999 Q gi|254780750|r 694 ----DKLFSRVGSADNLASGRSTFMVEMIETASILN-QATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLL 768 (920) Q Consensus 694 ----D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~-~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lf 768 (920) -.+|.++-..||+.-+.|-=| .+=..|-| -+++-.++|+||--.+-++.-...| +..+..+.+..+..+++ T Consensus 81 vFQ~~~L~p~~tv~eNv~~~LSGGq---~QRvaIARAL~~~P~ill~DEPts~LD~~~~~~i-~~~l~~l~~~~~~t~i~ 156 (178) T cd03229 81 VFQDFALFPHLTVLENIALGLSGGQ---QQRVALARALAMDPDVLLLDEPTSALDPITRREV-RALLKSLQAQLGITVVL 156 (178) T ss_pred EECCCCCCCCCCHHHHHCEECCCHH---HHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHH-HHHHHHHHHHHCCEEEE T ss_conf 9269988998928996008177268---8999999998529999997089764799999999-99999999964999999 Q ss_pred ECCCHHHH-HHHH Q ss_conf 74875797-6643 Q gi|254780750|r 769 ATHFHELT-DLSK 780 (920) Q Consensus 769 aTHy~eL~-~l~~ 780 (920) +||..+.+ .+++ T Consensus 157 vTHd~~~a~~~aD 169 (178) T cd03229 157 VTHDLDEAARLAD 169 (178) T ss_pred ECCCHHHHHHHCC T ss_conf 9899999998699 No 61 >PRK13540 cytochrome c biogenesis protein CcmA; Provisional Probab=97.98 E-value=0.0002 Score=51.48 Aligned_cols=136 Identities=18% Similarity=0.165 Sum_probs=64.8 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHH--------------------HHHHHCCCCCCHHHC---CCC Q ss_conf 443456358777766439999677844078999999999--------------------999971985303532---068 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALI--------------------VIMAQMGSYVPASYA---HIG 691 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~--------------------vilAQiG~fVPA~~a---~i~ 691 (920) |=.||++.. ..+.+..|+|||-+||||++|.++=+ ..-.|++ |||-... .++ T Consensus 16 vl~~is~~i----~~G~i~~l~G~NGaGKSTLlkli~Gl~~p~~G~I~~~~~~i~~~~~~~~~~~~-~~~~~~~i~~~lt 90 (200) T PRK13540 16 LLQQISFHL----PAGGLLHLKGSNGAGKTTLLKLIAGLLNPEKGEILFERQSIKKDLCTYQKQLC-FVGHRSGINPYLT 90 (200) T ss_pred EEECEEEEE----CCCCEEEEECCCCCCHHHHHHHHHCCCCCCCCEEEECCEECCCCHHHHHHHEE-EECCCCCCCCCCC T ss_conf 881227898----79979999889998799999999778588985699999864634477763378-7444346786776 Q ss_pred CCCEEEEEEECC---------------CCC----CCCCCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 221056765237---------------661----1385328999999999999589985699932588988056799999 Q gi|254780750|r 692 IVDKLFSRVGSA---------------DNL----ASGRSTFMVEMIETASILNQATNQSFVILDEIGRGTATLDGLSIAW 752 (920) Q Consensus 692 ~~D~IftRiGa~---------------D~l----~~g~STF~vEm~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~ 752 (920) +.+.+.-.+... +.+ ....|.=+.-...++..+ +.+..++|+||--.|=+...-..| + T Consensus 91 v~e~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~~~~LSgGqkqrv~lar~l--~~~p~illLDEPt~gLD~~~~~~i-~ 167 (200) T PRK13540 91 LRENCLYDIHFSPGAVGITELCRLFSLEHLIDYPCGLLSSGQKRQVALLRLW--MSKAKLWLLDEPLVALDELSLLTI-I 167 (200) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHCCCHHHHCCCHHHCCHHHHHHHHHHHHH--HHCCCEEEEECCCCCCCHHHHHHH-H T ss_conf 9999875543383166799999972975564497124999999999999999--839998999177643899999999-9 Q ss_pred HHHHHHHHHCCCEEEEECCCHHHHHHH Q ss_conf 999999972698499974875797664 Q gi|254780750|r 753 ATIEYLHETNRCRGLLATHFHELTDLS 779 (920) Q Consensus 753 aile~l~~~~~~~~lfaTHy~eL~~l~ 779 (920) ..+..+... ++.++++||-.+.-..+ T Consensus 168 ~~l~~~~~~-g~tiii~sH~~~~l~~a 193 (200) T PRK13540 168 TKIQEHRAK-GGAVLLTSHQDLPLNKA 193 (200) T ss_pred HHHHHHHHC-CCEEEEEECCHHHHHHC T ss_conf 999999868-99999994264777768 No 62 >cd03270 ABC_UvrA_I The excision repair protein UvrA domain I; Nucleotide excision repair in eubacteria is a process that repairs DNA damage by the removal of a 12-13-mer oligonucleotide containing the lesion. Recognition and cleavage of the damaged DNA is a multistep ATP-dependent reaction that requires the UvrA, UvrB, and UvrC proteins. Both UvrA and UvrB are ATPases, with UvrA having two ATP binding sites, which have the characteristic signature of the family of ABC proteins, and UvrB having one ATP binding site that is structurally related to that of helicases. Probab=97.98 E-value=0.00051 Score=48.49 Aligned_cols=55 Identities=16% Similarity=0.190 Sum_probs=36.4 Q ss_pred HHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHH Q ss_conf 995899856999325889880567999999999999726984999748757976643 Q gi|254780750|r 724 LNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTDLSK 780 (920) Q Consensus 724 L~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~l~~ 780 (920) |-...+..|+|+||--.|=++.....|. .+++.|.+. +..++++||-.++...++ T Consensus 152 L~~~~~~~lliLDEPTsgLD~~~~~~i~-~~i~~l~~~-G~Tvi~VsHd~~~~~~aD 206 (226) T cd03270 152 IGSGLTGVLYVLDEPSIGLHPRDNDRLI-ETLKRLRDL-GNTVLVVEHDEDTIRAAD 206 (226) T ss_pred HHHCCCCCEEEECCCCCCCCHHHHHHHH-HHHHHHHHC-CCEEEEEEECHHHHHHCC T ss_conf 9738987168832873337989999999-999999976-998999972578998489 No 63 >TIGR03608 L_ocin_972_ABC putative bacteriocin export ABC transporter, lactococcin 972 group. A gene pair with a fairly wide distribution consists of a polypeptide related to the lactococcin 972 (see TIGR01653) and multiple-membrane-spanning putative immunity protein (see TIGR01654). This model represents a small clade within the ABC transporters that regularly are found adjacent to these bacteriocin system gene pairs and are likely serve as export proteins. Probab=97.98 E-value=0.00017 Score=51.98 Aligned_cols=130 Identities=19% Similarity=0.251 Sum_probs=70.9 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEE-------------EE Q ss_conf 44434563587777664399996778440789999999999999719853035320682210567-------------65 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFS-------------RV 700 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~Ift-------------Ri 700 (920) .|=+|++|.. ..+.+..|.||+-+||||+||.+|.+ -+.++-++.+.++-.. +| T Consensus 12 ~vL~~vsl~i----~~Ge~~~i~GpSGsGKSTLL~~i~gl---------~~p~sG~i~~~g~~~~~~~~~~~~~~rr~~i 78 (206) T TIGR03608 12 TILDDLNLTI----EKGKMVAIVGESGSGKSTLLNIIGLL---------EKPDSGQVYLNGQETPPINSKKASKFRREKL 78 (206) T ss_pred EEECCEEEEE----CCCCEEEEECCCCCCHHHHHHHHHCC---------CCCCCEEEEECCEECCCCCHHHHHHHHHCCE T ss_conf 9975807798----69989999879997099999999759---------9989759999999999899889999986588 Q ss_pred EC--CC-CCCCCCCHH--------------------HHHHHHH---------------------HHHHH-HCCCCCEEEE Q ss_conf 23--76-611385328--------------------9999999---------------------99999-5899856999 Q gi|254780750|r 701 GS--AD-NLASGRSTF--------------------MVEMIET---------------------ASILN-QATNQSFVIL 735 (920) Q Consensus 701 Ga--~D-~l~~g~STF--------------------~vEm~e~---------------------~~IL~-~at~~SLVll 735 (920) |- +| +|....+-+ ..|+.+. ..|-| -++.-+++|. T Consensus 79 G~VfQ~~~L~~~ltV~eNi~l~l~~~~~~~~~~~~~~~~~L~~vgl~~~~~~~p~~LSGGe~QRVAIARAL~~~P~illa 158 (206) T TIGR03608 79 GYLFQNFALIENETVEENLDLALKYSKLSKKEKREKKKEALEKVGLNLKLKQKIYELSGGEQQRVALARAILKPSELILA 158 (206) T ss_pred EEECCCCCCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHCCCHHCCHHHHHHHHHHHHHHCCCCEEEE T ss_conf 99857987679891999999999865999999999999999986990565299244486999999999998249999996 Q ss_pred ECCCCCCCHHHHH--HHHHHHHHHHHHHCCCEEEEECCCHHHHHHHH Q ss_conf 3258898805679--99999999999726984999748757976643 Q gi|254780750|r 736 DEIGRGTATLDGL--SIAWATIEYLHETNRCRGLLATHFHELTDLSK 780 (920) Q Consensus 736 DElGrGTst~DG~--aiA~aile~l~~~~~~~~lfaTHy~eL~~l~~ 780 (920) ||- |+..|-. ..-+..+..+.+. +...+++||..++...++ T Consensus 159 DEP---T~~LD~~~~~~i~~ll~~l~~~-g~tii~vTHd~~~a~~~D 201 (206) T TIGR03608 159 DEP---TGSLDPKNRDEVLDLLLELNDE-GKTIIIVTHDPEVAEQAD 201 (206) T ss_pred CCC---CCCCCHHHHHHHHHHHHHHHHC-CCEEEEECCCHHHHHHCC T ss_conf 399---8778999999999999999867-999999898789998699 No 64 >PRK13539 cytochrome c biogenesis protein CcmA; Provisional Probab=97.98 E-value=3.8e-05 Score=56.89 Aligned_cols=46 Identities=17% Similarity=0.085 Sum_probs=32.4 Q ss_pred CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHH Q ss_conf 899856999325889880567999999999999726984999748757 Q gi|254780750|r 727 ATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHE 774 (920) Q Consensus 727 at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~e 774 (920) +.+..++|+||--.|=++ ++...-+.+++.+.+. +..++++||-+. T Consensus 142 ~~~p~vllLDEPtsgLD~-~~~~~v~~~i~~~~~~-g~tiIi~tH~p~ 187 (206) T PRK13539 142 VSNRPIWLLDEPTAALDS-ASQALFAELIRAHLAQ-GGIVIAATHIPL 187 (206) T ss_pred HHCCCEEEEECCCCCCCH-HHHHHHHHHHHHHHHC-CCEEEEEECCCC T ss_conf 869898999799777899-9999999999999958-999999938988 No 65 >cd03250 ABCC_MRP_domain1 Domain 1 of the ABC subfamily C. This family is also known as MRP (mulrtidrug resisitance-associated protein). Some of the MRP members have five additional transmembrane segments in their N-terminas, but the function of these additional membrane-spanning domains is not clear. The MRP was found in the multidrug-resisting lung cancer cell in which p-glycoprotein was not overexpressed. MRP exports glutathione by drug stimulation, as well as, certain substrates in conjugated forms with anions, such as glutathione, glucuronate, and sulfate. Probab=97.98 E-value=0.00038 Score=49.41 Aligned_cols=35 Identities=31% Similarity=0.479 Sum_probs=28.5 Q ss_pred CEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHH Q ss_conf 044434563587777664399996778440789999999 Q gi|254780750|r 633 PFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNA 671 (920) Q Consensus 633 ~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqva 671 (920) .||=+|+++.. ..+.+.-|.|||-+||||++|.++ T Consensus 18 ~~vL~~isl~i----~~Ge~~~IvG~sGsGKSTLl~~i~ 52 (204) T cd03250 18 SFTLKDINLEV----PKGELVAIVGPVGSGKSSLLSALL 52 (204) T ss_pred CCEEEEEEEEE----CCCCEEEEECCCCCCHHHHHHHHC T ss_conf 76252148997----699899999999985899999981 No 66 >cd03269 ABC_putative_ATPase This subfamily is involved in drug resistance, nodulation, lipid transport, and bacteriocin and lantibiotic immunity. In eubacteria and archaea, the typical organization consists of one ABC and one or two IMs. Eukaryote systems of the ABCA subfamily display ABC domains strongly similar to this family. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region in addition to the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=97.97 E-value=0.00027 Score=50.53 Aligned_cols=133 Identities=23% Similarity=0.337 Sum_probs=71.4 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHH-----------------HHHCCCCCCHHHC---CCCCC Q ss_conf 444345635877776643999967784407899999999999-----------------9971985303532---06822 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVI-----------------MAQMGSYVPASYA---HIGIV 693 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vi-----------------lAQiG~fVPA~~a---~i~~~ 693 (920) .|=+|++++. ..+.+.=|-|||-+||||++|.++=+.- -..+| |||-+.. .+++. T Consensus 14 ~al~~vs~~v----~~Gei~gllG~NGaGKTTll~~i~Gl~~p~~G~i~i~G~~~~~~~~~~ig-~~pq~~~l~~~ltv~ 88 (210) T cd03269 14 TALDDISFSV----EKGEIFGLLGPNGAGKTTTIRMILGIILPDSGEVLFDGKPLDIAARNRIG-YLPEERGLYPKMKVI 88 (210) T ss_pred EEECCEEEEE----CCCCEEEEECCCCCCHHHHHHHHHCCCCCCCCEEEECCEECCHHHHCCEE-EECCCCCCCCCCCHH T ss_conf 9975426788----79959999989998499999999600266899899999868844360199-964766679999999 Q ss_pred CEE--EEEE-------------------ECCC---CCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHH Q ss_conf 105--6765-------------------2376---611385328999999999999589985699932588988056799 Q gi|254780750|r 694 DKL--FSRV-------------------GSAD---NLASGRSTFMVEMIETASILNQATNQSFVILDEIGRGTATLDGLS 749 (920) Q Consensus 694 D~I--ftRi-------------------Ga~D---~l~~g~STF~vEm~e~~~IL~~at~~SLVllDElGrGTst~DG~a 749 (920) +.+ |.++ |-.+ ......|-=|.-....+ .--+..-.++|+||--.|=++.- .. T Consensus 89 e~l~~~~~l~g~~~~~~~~~~~~~l~~~~L~~~~~~~~~~LSgG~kqrv~la--~al~~~p~lllLDEPt~gLDp~~-~~ 165 (210) T cd03269 89 DQLVYLAQLKGLKKEEARRRIDEWLERLELSEYANKRVEELSKGNQQKVQFI--AAVIHDPELLILDEPFSGLDPVN-VE 165 (210) T ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHCCHHHHCCHHHHHHHHHH--HHHHCCCCEEEEECCCCCCCHHH-HH T ss_conf 9999999974999999999999999986997054880577899899999999--99957998999928866799999-99 Q ss_pred HHHHHHHHHHHHCCCEEEEECCCHHH Q ss_conf 99999999997269849997487579 Q gi|254780750|r 750 IAWATIEYLHETNRCRGLLATHFHEL 775 (920) Q Consensus 750 iA~aile~l~~~~~~~~lfaTHy~eL 775 (920) .-|.++..+.+. +..++++||.-+- T Consensus 166 ~i~~~i~~~~~~-g~til~ssH~l~e 190 (210) T cd03269 166 LLKDVIRELARA-GKTVILSTHQMEL 190 (210) T ss_pred HHHHHHHHHHHC-CCEEEEECCCHHH T ss_conf 999999999968-9899998884899 No 67 >PRK10575 iron-hydroxamate transporter ATP-binding subunit; Provisional Probab=97.97 E-value=0.00022 Score=51.14 Aligned_cols=136 Identities=21% Similarity=0.207 Sum_probs=68.6 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHH--------------------HHHHCCCCCCHHH---CCC Q ss_conf 44434563587777664399996778440789999999999--------------------9997198530353---206 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIV--------------------IMAQMGSYVPASY---AHI 690 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~v--------------------ilAQiG~fVPA~~---a~i 690 (920) -|=+|++|.. ..+.+.-|-|||-+||||++|.++-.. -+++--+|||-+. ..+ T Consensus 25 ~vL~~vsl~i----~~Ge~~~liG~NGaGKSTLl~~l~gl~~p~~G~I~~~g~~i~~~~~~~~~~~i~~v~Q~~~~~~~~ 100 (265) T PRK10575 25 TLLHPLSLTF----PAGKVTGLIGHNGSGKSTLLKMLGRHQPPSEGEILLDAQPLASWSSKAFARKVAYLPQQLPPAEGM 100 (265) T ss_pred EEEECEEEEE----CCCCEEEEECCCCCHHHHHHHHHHCCCCCCCCEEEECCEECCCCCHHHHHHHEEEECCCCCCCCCC T ss_conf 9881508898----799899999999980999999995688998738999976567589899874466631124546688 Q ss_pred CCCCEEE----------EEEECCC------------------CCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCC Q ss_conf 8221056----------7652376------------------61138532899999999999958998569993258898 Q gi|254780750|r 691 GIVDKLF----------SRVGSAD------------------NLASGRSTFMVEMIETASILNQATNQSFVILDEIGRGT 742 (920) Q Consensus 691 ~~~D~If----------tRiGa~D------------------~l~~g~STF~vEm~e~~~IL~~at~~SLVllDElGrGT 742 (920) ++.+.+. .+.++.| ......|-=+--....|.. -+.+-.++|+||--.|= T Consensus 101 tv~e~i~~g~~~~~~~~~~~~~~~~~~v~~~l~~~gl~~~~~~~~~~LSGGq~QRv~iAra--L~~~P~lLlLDEPts~L 178 (265) T PRK10575 101 TVRELVAIGRYPWHGALGRFGAADREKVEEAISLVGLKPLAHRLVDSLSGGERQRAWIAML--VAQDSRCLLLDEPTSAL 178 (265) T ss_pred CHHHHHHHCCCHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHCCHHHCCHHHHHHHHHHHH--HHCCCCEEEEECCCCCC T ss_conf 0988887165301123324777799999999998599136516833389999999999998--75699989981776558 Q ss_pred CHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH Q ss_conf 8056799999999999972698499974875797 Q gi|254780750|r 743 ATLDGLSIAWATIEYLHETNRCRGLLATHFHELT 776 (920) Q Consensus 743 st~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~ 776 (920) ++.--.. .+.++..|.+..+.-++++||.-++. T Consensus 179 D~~~~~~-i~~~l~~l~~~~g~tvi~vtHdl~~~ 211 (265) T PRK10575 179 DIAHQVD-VLALVHRLSQERGLTVIAVLHDINMA 211 (265) T ss_pred CHHHHHH-HHHHHHHHHHCCCCEEEEEECCHHHH T ss_conf 9999999-99999999862898999993888999 No 68 >cd03220 ABC_KpsT_Wzt ABC_KpsT_Wzt The KpsT/Wzt ABC transporter subfamily is involved in extracellular polysaccharide export. Among the variety of membrane-linked or extracellular polysaccharides excreted by bacteria, only capsular polysaccharides, lipopolysaccharides, and teichoic acids have been shown to be exported by ABC transporters. A typical system is made of a conserved integral membrane and an ABC. In addition to these proteins, capsular polysaccharide exporter systems require two 'accessory' proteins to perform their function: a periplasmic (E.coli) or a lipid-anchored outer membrane protein called OMA (Neisseria meningitidis and Haemophilus influenzae) and a cytoplasmic membrane protein MPA2. Probab=97.96 E-value=0.00025 Score=50.83 Aligned_cols=132 Identities=22% Similarity=0.259 Sum_probs=67.5 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEE----------CC Q ss_conf 44434563587777664399996778440789999999999999719853035320682210567652----------37 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVG----------SA 703 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiG----------a~ 703 (920) .+=+||+|.. ..+.+.-|-|||-+||||++|.++ |..-| ++-++-+.-++-..++ .. T Consensus 36 ~AL~~isf~i----~~GeivgilG~NGaGKSTLl~~i~--------Gl~~p-~~G~I~i~G~~~~~~~~~~~~~p~ltv~ 102 (224) T cd03220 36 WALKDVSFEV----PRGERIGLIGRNGAGKSTLLRLLA--------GIYPP-DSGTVTVRGRVSSLLGLGGGFNPELTGR 102 (224) T ss_pred EEECCEEEEE----CCCCEEEEECCCCCHHHHHHHHHH--------CCCCC-CCCEEEECCEECCCCCCCEECCCCCCHH T ss_conf 9876707898----389899999799981999999997--------58777-8776999989843015742039988299 Q ss_pred CCCC-----CCCCH--------HHHHHHHHHHH-------------------HHHCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 6611-----38532--------89999999999-------------------9958998569993258898805679999 Q gi|254780750|r 704 DNLA-----SGRST--------FMVEMIETASI-------------------LNQATNQSFVILDEIGRGTATLDGLSIA 751 (920) Q Consensus 704 D~l~-----~g~ST--------F~vEm~e~~~I-------------------L~~at~~SLVllDElGrGTst~DG~aiA 751 (920) ||+. .|.+. ...|+.+.... .--+.+-.++|+||.-.|-++. ...-. T Consensus 103 enl~~~~~~~g~~~~~~~~~~~~~l~~~~L~~~~~~~~~~LSgG~kqRl~iA~al~~~P~illLDEPt~gLD~~-~~~~i 181 (224) T cd03220 103 ENIYLNGRLLGLSRKEIDEKIDEIIEFSELGDFIDLPVKTYSSGMKARLAFAIATALEPDILLIDEVLAVGDAA-FQEKC 181 (224) T ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCHH-HHHHH T ss_conf 99999999829658999999999998636755653866546999999999999996699999991886656999-99999 Q ss_pred HHHHHHHHHHCCCEEEEECCCHHHH-HHHH Q ss_conf 9999999972698499974875797-6643 Q gi|254780750|r 752 WATIEYLHETNRCRGLLATHFHELT-DLSK 780 (920) Q Consensus 752 ~aile~l~~~~~~~~lfaTHy~eL~-~l~~ 780 (920) |..+..+.+. +.-++++||.-+.. .+.+ T Consensus 182 ~~~i~~l~~~-g~tiii~sH~l~~v~~lcd 210 (224) T cd03220 182 QRRLRELLKQ-GKTVILVSHDPSSIKRLCD 210 (224) T ss_pred HHHHHHHHHC-CCEEEEECCCHHHHHHHCC T ss_conf 9999999858-9999998898899999699 No 69 >PRK10771 thiQ thiamine transporter ATP-binding subunit; Provisional Probab=97.95 E-value=0.00095 Score=46.45 Aligned_cols=133 Identities=21% Similarity=0.266 Sum_probs=68.9 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHH----------------CCCCCC---CE Q ss_conf 44345635877776643999967784407899999999999997198530353----------------206822---10 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASY----------------AHIGIV---DK 695 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~----------------a~i~~~---D~ 695 (920) +|.+++|.. ..+.+.-|.|||-+||||++|.++ |..-|.+- -.++.+ .. T Consensus 14 ~~~~isl~i----~~GE~v~iiG~nGaGKSTLl~~i~--------Gll~p~sG~I~i~G~~~~~~~~~~r~i~~v~Q~~~ 81 (233) T PRK10771 14 LPMRFTLTV----ERGEQVAILGPSGAGKSTLLNLIA--------GFLTPASGSLLINGVDHTTTPPSRRPVSMLFQENN 81 (233) T ss_pred EEEEEEEEE----CCCCEEEEECCCCCHHHHHHHHHH--------CCCCCCCEEEEECCEECCCCCHHHCCEEEEECCCC T ss_conf 876278898----899899999999981999999996--------59999855999999998879988888799905776 Q ss_pred EEEEEECCCCCCCCCC---HHHHHHH-HHHHHHH----------------------------HCCCCCEEEEECCCCCCC Q ss_conf 5676523766113853---2899999-9999999----------------------------589985699932588988 Q gi|254780750|r 696 LFSRVGSADNLASGRS---TFMVEMI-ETASILN----------------------------QATNQSFVILDEIGRGTA 743 (920) Q Consensus 696 IftRiGa~D~l~~g~S---TF~vEm~-e~~~IL~----------------------------~at~~SLVllDElGrGTs 743 (920) +|..+-..|||.-|.. ..-.+.. ....+++ -+.+-.++|+||--.|=+ T Consensus 82 l~~~ltv~eni~~~~~~~~~~~~~~~~~~~~~l~~~~l~~~~~~~~~~LSGGq~QRv~iAraL~~~P~vLllDEPts~LD 161 (233) T PRK10771 82 LFSHLTVAQNIGLGLNPGLKLNAAQREKLHAIARQMGIENLLARLPGELSGGQRQRVALARCLVREQPILLLDEPFSALD 161 (233) T ss_pred CCCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCHHHHCCCHHHCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCC T ss_conf 68899099999878622678768899999999987799668608945599999999999999855999999928775579 Q ss_pred HHHHHHHHHHHHHHHHHHCCCEEEEECCCHH-HHHHHH Q ss_conf 0567999999999999726984999748757-976643 Q gi|254780750|r 744 TLDGLSIAWATIEYLHETNRCRGLLATHFHE-LTDLSK 780 (920) Q Consensus 744 t~DG~aiA~aile~l~~~~~~~~lfaTHy~e-L~~l~~ 780 (920) +.--.. -+..+..|.+..++-++++||.-+ ...+++ T Consensus 162 ~~~~~~-i~~ll~~l~~~~~~til~vtHdl~~~~~~ad 198 (233) T PRK10771 162 PALRQE-MLTLVSQVCQQRQLTLLMVSHSVEDAARIAP 198 (233) T ss_pred HHHHHH-HHHHHHHHHHHCCCEEEEEECCHHHHHHHCC T ss_conf 999999-9999999998369899999248999999699 No 70 >COG0488 Uup ATPase components of ABC transporters with duplicated ATPase domains [General function prediction only] Probab=97.95 E-value=0.0015 Score=44.96 Aligned_cols=134 Identities=25% Similarity=0.222 Sum_probs=75.7 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEECC----CCCCCC Q ss_conf 4443456358777766439999677844078999999999999971985303532068221056765237----661138 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGSA----DNLASG 709 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa~----D~l~~g 709 (920) .|=.++++... .+.-+-|.|||.+||||+||.++=.. =...|........+++-||+-....... |.+..+ T Consensus 336 ~l~~~~s~~i~----~gdrIaiiG~NG~GKSTLlk~l~g~~-~~~~G~v~~g~~v~igyf~Q~~~~l~~~~t~~d~l~~~ 410 (530) T COG0488 336 LLLKDLSFRID----RGDRIAIVGPNGAGKSTLLKLLAGEL-GPLSGTVKVGETVKIGYFDQHRDELDPDKTVLEELSEG 410 (530) T ss_pred EEECCCEEEEC----CCCEEEEECCCCCCHHHHHHHHHCCC-CCCCCEEEECCCEEEEEEEEEHHHCCCCCCHHHHHHHH T ss_conf 46637267765----89889998999877899999985213-56772599579678999870031027667799999864 Q ss_pred CCH--------------HHHHHH-------------HHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHC Q ss_conf 532--------------899999-------------99999995899856999325889880567999999999999726 Q gi|254780750|r 710 RST--------------FMVEMI-------------ETASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETN 762 (920) Q Consensus 710 ~ST--------------F~vEm~-------------e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~ 762 (920) ... |--|+. .+.-..--..+.-+.||||- |+-.|=-++ .|.-+.|.+-. T Consensus 411 ~~~~~e~~~r~~L~~f~F~~~~~~~~v~~LSGGEk~Rl~La~ll~~~pNvLlLDEP---TNhLDi~s~-eaLe~aL~~f~ 486 (530) T COG0488 411 FPDGDEQEVRAYLGRFGFTGEDQEKPVGVLSGGEKARLLLAKLLLQPPNLLLLDEP---TNHLDIESL-EALEEALLDFE 486 (530) T ss_pred CCCCCHHHHHHHHHHCCCCHHHHCCCHHHCCHHHHHHHHHHHHHCCCCCEEEECCC---CCCCCHHHH-HHHHHHHHHCC T ss_conf 65432899999999849986796395222587799999999985669978997289---876798899-99999998589 Q ss_pred CCEEEEECCCHHHHH Q ss_conf 984999748757976 Q gi|254780750|r 763 RCRGLLATHFHELTD 777 (920) Q Consensus 763 ~~~~lfaTHy~eL~~ 777 (920) |+ +||++|.+.+.+ T Consensus 487 Gt-vl~VSHDr~Fl~ 500 (530) T COG0488 487 GT-VLLVSHDRYFLD 500 (530) T ss_pred CE-EEEEECCHHHHH T ss_conf 86-999948999998 No 71 >TIGR01978 sufC FeS assembly ATPase SufC; InterPro: IPR010230 Iron-sulphur (FeS) clusters are important cofactors for numerous proteins involved in electron transfer, in redox and non-redox catalysis, in gene regulation, and as sensors of oxygen and iron. These functions depend on the various FeS cluster prosthetic groups, the most common being [2Fe-2S] and [4Fe-4S] . FeS cluster assembly is a complex process involving the mobilisation of Fe and S atoms from storage sources, their assembly into [Fe-S] form, their transport to specific cellular locations, and their transfer to recipient apoproteins. So far, three FeS assembly machineries have been identified, which are capable of synthesising all types of [Fe-S] clusters: ISC (iron-sulphur cluster), SUF (sulphur assimilation), and NIF (nitrogen fixation) systems. The ISC system is conserved in eubacteria and eukaryotes (mitochondria), and has broad specificity, targeting general FeS proteins , . It is encoded by the isc operon (iscRSUA-hscBA-fdx-iscX). IscS is a cysteine desulphurase, which obtains S from cysteine (converting it to alanine) and serves as a S donor for FeS cluster assembly. IscU and IscA act as scaffolds to accept S and Fe atoms, assembling clusters and transfering them to recipient apoproteins. HscA is a molecular chaperone and HscB is a co-chaperone. Fdx is a [2Fe-2S]-type ferredoxin. IscR is a transcription factor that regulates expression of the isc operon. IscX (also known as YfhJ) appears to interact with IscS and may function as an Fe donor during cluster assembly . The SUF system is an alternative pathway to the ISC system that operates under iron starvation and oxidative stress. It is found in eubacteria, archaea and eukaryotes (plastids). The SUF system is encoded by the suf operon (sufABCDSE), and the six encoded proteins are arranged into two complexes (SufSE and SufBCD) and one protein (SufA). SufS is a pyridoxal-phosphate (PLP) protein displaying cysteine desulphurase activity. SufE acts as a scaffold protein that accepts S from SufS and donates it to SufA . SufC is an ATPase with an unorthodox ATP-binding cassette (ABC)-like component. No specific functions have been assigned to SufB and SufD. SufA is homologous to IscA , acting as a scaffold protein in which Fe and S atoms are assembled into [FeS] cluster forms, which can then easily be transferred to apoproteins targets. In the NIF system, NifS and NifU are required for the formation of metalloclusters of nitrogenase in Azotobacter vinelandii, and other organisms, as well as in the maturation of other FeS proteins. Nitrogenase catalyses the fixation of nitrogen. It contains a complex cluster, the FeMo cofactor, which contains molybdenum, Fe and S. NifS is a cysteine desulphurase. NifU binds one Fe atom at its N-terminal, assembling an FeS cluster that is transferred to nitrogenase apoproteins . Nif proteins involved in the formation of FeS clusters can also be found in organisms that do not fix nitrogen . This entry represents SufC, which acts as an ATPase in the SUF system. SufC belongs to the ATP-binding cassette transporter family (IPR003439 from INTERPRO) but is no longer thought to be part of a transporter. The complex is reported as cytosolic or associated with the membrane.; GO: 0005524 ATP binding, 0006810 transport. Probab=97.95 E-value=4.3e-05 Score=56.46 Aligned_cols=132 Identities=31% Similarity=0.434 Sum_probs=84.8 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCC----------------CHHHCCCCCC----- Q ss_conf 4434563587777664399996778440789999999999999719853----------------0353206822----- Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYV----------------PASYAHIGIV----- 693 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fV----------------PA~~a~i~~~----- 693 (920) |=.+++|.. ..+-++.|=|||-+||||+.+++ |-+=.|=| |=+.|+.|+| T Consensus 15 IL~gvnL~v----~~GE~HAiMGPNGsGKSTL~~~i-----aGhp~y~vt~G~I~f~G~Dll~l~~~ERAR~GlFLaFQ~ 85 (248) T TIGR01978 15 ILKGVNLTV----KKGEIHAIMGPNGSGKSTLSKTI-----AGHPKYEVTSGTILFKGQDLLELEPDERARAGLFLAFQY 85 (248) T ss_pred CCCCCCCCC----CCCEEEEEECCCCCCHHHHHHHH-----HCCCCEEEEEEEEEECCEECCCCCHHHHHCCCCEECCCC T ss_conf 167867621----68517998688998478887776-----179933784208987765200189655640565101588 Q ss_pred --------CEEEEEEECCCCC---CCC-----CCHHHHHHHHHHHHHH------HCCCCC-------------------- Q ss_conf --------1056765237661---138-----5328999999999999------589985-------------------- Q gi|254780750|r 694 --------DKLFSRVGSADNL---ASG-----RSTFMVEMIETASILN------QATNQS-------------------- 731 (920) Q Consensus 694 --------D~IftRiGa~D~l---~~g-----~STF~vEm~e~~~IL~------~at~~S-------------------- 731 (920) -..|-|-- =|- .+| ..-|.-+|.+.-..|+ .--.|| T Consensus 86 P~EIPGV~~~~FlR~A--~NA~R~~~G~~~l~~~~F~~~l~~~~~~l~m~~~~d~~l~R~lNeGFSGGEKKrnEILQm~~ 163 (248) T TIGR01978 86 PEEIPGVSNLEFLRSA--LNARREARGEEPLDLLDFEKLLKEKLALLDMDELSDEFLNRSLNEGFSGGEKKRNEILQMAL 163 (248) T ss_pred CCCCCCCCHHHHHHHH--HHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCHHHHHHHCCCCCCCCCCCCCHHHHHHHHHH T ss_conf 8556885778899999--99999863899879889999999999985688313887117787454387115768998875 Q ss_pred ----EEEEECCCCCCCHHHHHHHHHHHHHHHHHH--CCCEEEEECCCHHHHHHH Q ss_conf ----699932588988056799999999999972--698499974875797664 Q gi|254780750|r 732 ----FVILDEIGRGTATLDGLSIAWATIEYLHET--NRCRGLLATHFHELTDLS 779 (920) Q Consensus 732 ----LVllDElGrGTst~DG~aiA~aile~l~~~--~~~~~lfaTHy~eL~~l~ 779 (920) |+||||+=.|=+ -|++=+....+..|-+. .++ +|..|||..|-++- T Consensus 164 L~P~laiLDE~DSGLD-iDALk~V~~~in~lr~~~P~~~-~liiTHY~rlL~~I 215 (248) T TIGR01978 164 LEPKLAILDEIDSGLD-IDALKIVAEGINRLREEHPDRA-FLIITHYQRLLNYI 215 (248) T ss_pred CCCCEEEEECCCCCCC-HHHHHHHHHHHHHHHCCCCCCE-EEEEECHHHHHHHC T ss_conf 1995798606888763-7888999999998730689800-89987517884131 No 72 >cd03268 ABC_BcrA_bacitracin_resist The BcrA subfamily represents ABC transporters involved in peptide antibiotic resistance. Bacitracin is a dodecapeptide antibiotic produced by B. licheniformis and B. subtilis. The synthesis of bacitracin is non-ribosomally catalyzed by a multienzyme complex BcrABC. Bacitracin has potent antibiotic activity against gram-positive bacteria. The inhibition of peptidoglycan biosynthesis is the best characterized bacterial effect of bacitracin. The bacitracin resistance of B. licheniformis is mediated by the ABC transporter Bcr which is composed of two identical BcrA ATP-binding subunits and one each of the integral membrane proteins, BcrB and BcrC. B. subtilis cells carrying bcr genes on high-copy number plasmids develop collateral detergent sensitivity, a similar phenomenon in human cells with overexpressed multi-drug resistance P-glycoprotein. Probab=97.93 E-value=0.00043 Score=48.99 Aligned_cols=141 Identities=19% Similarity=0.282 Sum_probs=72.3 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHH-------------------HHHHCCCCCCHHH--CCCCC Q ss_conf 44434563587777664399996778440789999999999-------------------9997198530353--20682 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIV-------------------IMAQMGSYVPASY--AHIGI 692 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~v-------------------ilAQiG~fVPA~~--a~i~~ 692 (920) .|=+|+++.. ..+.+.-|-|||-+||||+||.++=.. ...++|....... ..+++ T Consensus 14 ~al~~vs~~v----~~Gei~gllG~NGaGKSTLl~~i~Gl~~p~~G~i~i~G~~~~~~~~~~~~ig~~~~~~~l~~~ltv 89 (208) T cd03268 14 RVLDDISLHV----KKGEIYGFLGPNGAGKTTTMKIILGLIKPDSGEITFDGKSYQKNIEALRRIGALIEAPGFYPNLTA 89 (208) T ss_pred EEECCEEEEE----CCCCEEEEECCCCCCHHHHHHHHHCCCCCCCCEEEECCEECCCCHHHHHHEEEEECCCCCCCCCCH T ss_conf 9871516688----698199999999999999999995783789899999999999796857108999477767898899 Q ss_pred CCE-----------------EEEEEECCCCCCCCCCHHHHHHHH-HHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHH Q ss_conf 210-----------------567652376611385328999999-99999958998569993258898805679999999 Q gi|254780750|r 693 VDK-----------------LFSRVGSADNLASGRSTFMVEMIE-TASILNQATNQSFVILDEIGRGTATLDGLSIAWAT 754 (920) Q Consensus 693 ~D~-----------------IftRiGa~D~l~~g~STF~vEm~e-~~~IL~~at~~SLVllDElGrGTst~DG~aiA~ai 754 (920) .+. ++.++|-.+..-+--+++.-=|.+ ++-..--+.+-.++|+||--.|=++..-.. -|.+ T Consensus 90 ~e~l~~~~~~~~~~~~~~~~~l~~~~l~~~~~~~~~~LS~G~kqrl~la~al~~~p~lliLDEPt~GLD~~~~~~-i~~~ 168 (208) T cd03268 90 RENLRLLARLLGIRKKRIDEVLDVVGLKDSAKKKVKGFSLGMKQRLGIALALLGNPDLLILDEPTNGLDPDGIKE-LREL 168 (208) T ss_pred HHHHHHHHHHCCCCHHHHHHHHHHCCCCHHHCCCHHHCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHH-HHHH T ss_conf 999999998749988999999998099503369035699999999999999856999999938876899999999-9999 Q ss_pred HHHHHHHCCCEEEEECCCHH-HHHHHH Q ss_conf 99999726984999748757-976643 Q gi|254780750|r 755 IEYLHETNRCRGLLATHFHE-LTDLSK 780 (920) Q Consensus 755 le~l~~~~~~~~lfaTHy~e-L~~l~~ 780 (920) +..+.+. +.-.+++||..+ +..+.+ T Consensus 169 l~~l~~~-g~til~~sH~l~e~~~~~d 194 (208) T cd03268 169 ILSLRDQ-GITVLISSHLLSEIQKVAD 194 (208) T ss_pred HHHHHHC-CCEEEEECCCHHHHHHHCC T ss_conf 9999958-9999998986899999699 No 73 >cd03231 ABC_CcmA_heme_exporter CcmA, the ATP-binding component of the bacterial CcmAB transporter. The CCM family is involved in bacterial cytochrome c biogenesis. Cytochrome c maturation in E. coli requires the ccm operon, which encodes eight membrane proteins (CcmABCDEFGH). CcmE is a periplasmic heme chaperone that binds heme covalently and transfers it onto apocytochrome c in the presence of CcmF, CcmG, and CcmH. The CcmAB proteins represent an ABC transporter and the CcmCD proteins participate in heme transfer to CcmE. Probab=97.93 E-value=0.00041 Score=49.16 Aligned_cols=131 Identities=18% Similarity=0.131 Sum_probs=64.3 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCC---------------------C Q ss_conf 4434563587777664399996778440789999999999999719853035320682---------------------2 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGI---------------------V 693 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~---------------------~ 693 (920) |=+|+++.. ..+.+..|+|||-+||||++|.++=+ +|..+-++.+ . T Consensus 15 il~~vs~~i----~~Ge~~~l~G~NGsGKSTLlk~i~Gl---------~~p~sG~i~~~g~~~~~~~~~~~~~i~~~~~~ 81 (201) T cd03231 15 LFSGLSFTL----AAGEALQVTGPNGSGKTTLLRILAGL---------SPPLAGRVLLNGGPLDFQRDSIARGLLYLGHA 81 (201) T ss_pred EEECCEEEE----CCCCEEEEECCCCCCHHHHHHHHHCC---------CCCCCEEEEECCEECCCCCHHHHHHEEEECCC T ss_conf 995307888----79959999999999999999999667---------78885299999998331487767117997876 Q ss_pred CEEEEEEECCCCCCCCCCHHHH--------------------------HHHHHHHHHHHCCCCCEEEEECCCCCCCHHHH Q ss_conf 1056765237661138532899--------------------------99999999995899856999325889880567 Q gi|254780750|r 694 DKLFSRVGSADNLASGRSTFMV--------------------------EMIETASILNQATNQSFVILDEIGRGTATLDG 747 (920) Q Consensus 694 D~IftRiGa~D~l~~g~STF~v--------------------------Em~e~~~IL~~at~~SLVllDElGrGTst~DG 747 (920) +.++..+.+.+++.-..+.... |...++-..--+..-.+.|+||--.|=+...- T Consensus 82 ~~~~~~ltv~en~~~~~~~~~~~~~~~~L~~~~l~~~~~~~~~~LSgGqkqRv~lA~al~~~p~llllDEPt~gLD~~s~ 161 (201) T cd03231 82 PGIKTTLSVLENLRFWHADHSDEQVEEALARVGLNGFEDRPVAQLSAGQQRRVALARLLLSGRPLWILDEPTTALDKAGV 161 (201) T ss_pred CCCCCCCCHHHHHHHHHHHCCHHHHHHHHHHCCCCHHHCCCHHHCCHHHHHHHHHHHHHHCCCCEEEECCCCCCCCHHHH T ss_conf 54557878999998753223699999999985991032588234799999999999999749999998088655799999 Q ss_pred HHHHHHHHHHHHHHCCCEEEEECC-CHHHHHHHH Q ss_conf 999999999999726984999748-757976643 Q gi|254780750|r 748 LSIAWATIEYLHETNRCRGLLATH-FHELTDLSK 780 (920) Q Consensus 748 ~aiA~aile~l~~~~~~~~lfaTH-y~eL~~l~~ 780 (920) ..+ +.++..+.+. ++.++++|| ..++.+... T Consensus 162 ~~~-~~~l~~~~~~-g~~ii~~sH~~~~~~~~~~ 193 (201) T cd03231 162 ARF-AEAMAGHCAR-GGMVVLTTHQDLGLSEAGA 193 (201) T ss_pred HHH-HHHHHHHHHC-CCEEEEEECCHHHHHHHCC T ss_conf 999-9999999868-9999999867146787229 No 74 >PRK10253 iron-enterobactin transporter ATP-binding protein; Provisional Probab=97.93 E-value=0.00016 Score=52.17 Aligned_cols=49 Identities=12% Similarity=0.069 Sum_probs=34.7 Q ss_pred CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH Q ss_conf 89985699932588988056799999999999972698499974875797 Q gi|254780750|r 727 ATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELT 776 (920) Q Consensus 727 at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~ 776 (920) |.+-.++|+||--.|=++.-...+ +.++..|.+..+..++++||.-++. T Consensus 159 ~~~P~illLDEPts~LD~~~~~~i-~~~i~~l~~~~g~tvi~vtHdl~~a 207 (265) T PRK10253 159 AQETAIMLLDEPTTWLDISHQIDL-LELLSELNREKGYTLAAVLHDLNQA 207 (265) T ss_pred HCCCCEEEECCCCCCCCHHHHHHH-HHHHHHHHHHHCCEEEEEECCHHHH T ss_conf 569998998188766899999999-9999999985098999993599999 No 75 >cd03263 ABC_subfamily_A The ABCA subfamily mediates the transport of a variety of lipid compounds. Mutations of members of ABCA subfamily are associated with human genetic diseases, such as, familial high-density lipoprotein (HDL) deficiency, neonatal surfactant deficiency, degenerative retinopathies, and congenital keratinization disorders. The ABCA1 protein is involved in disorders of cholesterol transport and high-density lipoprotein (HDL) biosynthesis. The ABCA4 (ABCR) protein transports vitamin A derivatives in the outer segments of photoreceptor cells, and therefore, performs a crucial step in the visual cycle. The ABCA genes are not present in yeast. However, evolutionary studies of ABCA genes indicate that they arose as transporters that subsequently duplicated and that certain sets of ABCA genes were lost in different eukaryotic lineages. Probab=97.92 E-value=0.00013 Score=52.95 Aligned_cols=130 Identities=23% Similarity=0.292 Sum_probs=68.5 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHH--------------------HHHHCCCCCCHHHCCCCCCC Q ss_conf 4434563587777664399996778440789999999999--------------------99971985303532068221 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIV--------------------IMAQMGSYVPASYAHIGIVD 694 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~v--------------------ilAQiG~fVPA~~a~i~~~D 694 (920) |=+|+++.. ..+.+.-|-|||-+||||+||.++=+. +-.++| |||-+. T Consensus 17 aL~~is~~i----~~Gei~~llG~NGaGKSTLl~~i~Gl~~p~~G~I~i~G~~i~~~~~~~~~~ig-~v~q~~------- 84 (220) T cd03263 17 AVDDLSLNV----YKGEIFGLLGHNGAGKTTTLKMLTGELRPTSGTAYINGYSIRTDRKAARQSLG-YCPQFD------- 84 (220) T ss_pred EEECCEEEE----CCCCEEEEECCCCCCHHHHHHHHHCCCCCCCCCEEECCEECCCCHHHHHCCEE-EEECCC------- T ss_conf 984408898----49959999989997399999999669878899779999977658898860569-992356------- Q ss_pred EEEEEEECCCCCC-----CCCCH-----------------------------HHHHHHHHHHHHHHCCCCCEEEEECCCC Q ss_conf 0567652376611-----38532-----------------------------8999999999999589985699932588 Q gi|254780750|r 695 KLFSRVGSADNLA-----SGRST-----------------------------FMVEMIETASILNQATNQSFVILDEIGR 740 (920) Q Consensus 695 ~IftRiGa~D~l~-----~g~ST-----------------------------F~vEm~e~~~IL~~at~~SLVllDElGr 740 (920) .+|..+-..|++. .|.+. =|--....+ .--+..-.++|+||--. T Consensus 85 ~l~~~ltv~e~l~~~~~~~g~~~~~~~~~~~~~l~~~~l~~~~~~~~~~LSgG~kqrv~ia--~al~~~P~lliLDEPt~ 162 (220) T cd03263 85 ALFDELTVREHLRFYARLKGLPKSEIKEEVELLLRVLGLTDKANKRARTLSGGMKRKLSLA--IALIGGPSVLLLDEPTS 162 (220) T ss_pred CCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHCCCHHHCCHHHHHHHHHH--HHHHCCCCEEEEECCCC T ss_conf 5687887999999989756999899999999999876967775075767899999999999--99956999999958876 Q ss_pred CCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH-HHHHH Q ss_conf 988056799999999999972698499974875797-66430 Q gi|254780750|r 741 GTATLDGLSIAWATIEYLHETNRCRGLLATHFHELT-DLSKS 781 (920) Q Consensus 741 GTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~-~l~~~ 781 (920) |=++.--. .-|..+..+. . +.-++++||.-+.. .+++. T Consensus 163 gLD~~~~~-~i~~~l~~~~-~-~~tii~~tH~l~e~~~l~dr 201 (220) T cd03263 163 GLDPASRR-AIWDLILEVR-K-GRSIILTTHSMDEAEALCDR 201 (220) T ss_pred CCCHHHHH-HHHHHHHHHH-C-CCEEEEECCCHHHHHHHCCE T ss_conf 88999999-9999999984-8-99899996878999996999 No 76 >PRK09544 znuC high-affinity zinc transporter ATPase; Reviewed Probab=97.92 E-value=0.00014 Score=52.66 Aligned_cols=136 Identities=25% Similarity=0.345 Sum_probs=72.0 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCC----------CCCCHHH---CCCCCCCEEEEE- Q ss_conf 44434563587777664399996778440789999999999999719----------8530353---206822105676- Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMG----------SYVPASY---AHIGIVDKLFSR- 699 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG----------~fVPA~~---a~i~~~D~IftR- 699 (920) -|=+|+++.. ..+.++-|.|||-+||||++|.++=+.- ..-| .|||-+- ..++....-|.+ T Consensus 18 ~vL~~vs~~i----~~Gei~~LiGpNGaGKSTLlk~I~Gl~~-p~~G~I~~~~~~~igyvpq~~~~~~~~~~~~~~~~~~ 92 (251) T PRK09544 18 RVLSDVSLEL----KPGKILTLLGPNGAGKSTLVRVVLGLVA-PDEGVIKRNGKLRIGYVPQKLYLDTTLPLTVNRFLRL 92 (251) T ss_pred EEEECEEEEE----CCCCEEEEECCCCCCHHHHHHHHHCCCC-CCCCEEEECCCCEEECCCHHHHCCCCCHHHHHHHHHC T ss_conf 9996307898----7997999998999889999999966888-9860899999402620437762187621899998632 Q ss_pred ---------------EECCCCC---CCCCCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf ---------------5237661---1385328999999999999589985699932588988056799999999999972 Q gi|254780750|r 700 ---------------VGSADNL---ASGRSTFMVEMIETASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHET 761 (920) Q Consensus 700 ---------------iGa~D~l---~~g~STF~vEm~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~ 761 (920) ++..+-+ ....|.=+--....|..| +.+-.++|+||--.|=++.--..+ +.++..|.+. T Consensus 93 ~~~~~~~~i~~~l~~~~~~~~~~~~~~~LSGGq~QRv~iAraL--~~~P~lLiLDEPTsgLD~~~~~~i-~~li~~L~~e 169 (251) T PRK09544 93 RPGTKKEDILPALKRVQAGHLIDAPMQKLSGGETQRVLLARAL--LNRPQLLVLDEPTQGVDVNGQVAL-YDLIDQLRRE 169 (251) T ss_pred CCCCCHHHHHHHHHHHCCCHHHHCCHHHCCHHHHHHHHHHHHH--HCCCCEEEEECCCCCCCHHHHHHH-HHHHHHHHHH T ss_conf 7665389999999873852243265445899999999999999--749998998098646899999999-9999999983 Q ss_pred CCCEEEEECCCHHHHH Q ss_conf 6984999748757976 Q gi|254780750|r 762 NRCRGLLATHFHELTD 777 (920) Q Consensus 762 ~~~~~lfaTHy~eL~~ 777 (920) .++-+|++||.-+... T Consensus 170 ~g~til~vtHDl~~~~ 185 (251) T PRK09544 170 LDCAVLMVSHDLHLVM 185 (251) T ss_pred CCCEEEEEECCHHHHH T ss_conf 2989999906899999 No 77 >PRK10584 putative ABC transporter ATP-binding protein YbbA; Provisional Probab=97.92 E-value=0.00028 Score=50.38 Aligned_cols=131 Identities=21% Similarity=0.208 Sum_probs=69.8 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCC-------------------- Q ss_conf 443456358777766439999677844078999999999999971985303532068221-------------------- Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVD-------------------- 694 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D-------------------- 694 (920) |=+||+|.. ..+.+.-|.|||-+||||+||.++=+ .|.++-++.+.. T Consensus 25 vL~~isl~i----~~GE~v~ivG~sGsGKSTLl~~i~Gl---------~~p~~G~I~~~G~~~~~~~~~~~~~~r~~~ig 91 (228) T PRK10584 25 ILTGVELVV----KRGETIALIGESGSGKSTLLAILAGL---------DDGSSGEVSLVGQPLHNMDEEARAKLRAKHVG 91 (228) T ss_pred EEECEEEEE----CCCCEEEEECCCCCHHHHHHHHHHCC---------CCCCCEEEEECCEECCCCCHHHHHHHHCCCEE T ss_conf 984738899----99989999999985899999999669---------99996799999999997998899876306477 Q ss_pred ------EEEEEEECCCCCCC-----CCCH-----HHHHHHH---HH------------------HHH-HHCCCCCEEEEE Q ss_conf ------05676523766113-----8532-----8999999---99------------------999-958998569993 Q gi|254780750|r 695 ------KLFSRVGSADNLAS-----GRST-----FMVEMIE---TA------------------SIL-NQATNQSFVILD 736 (920) Q Consensus 695 ------~IftRiGa~D~l~~-----g~ST-----F~vEm~e---~~------------------~IL-~~at~~SLVllD 736 (920) .+|..+...||+.- |.+. -..|+.+ .. .|- --+..-.++|+| T Consensus 92 ~v~Q~~~l~~~~tv~eni~~~~~~~~~~~~~~~~~~~~~l~~~gl~~~~~~~p~~LSGGq~QRv~iAraL~~~P~llllD 171 (228) T PRK10584 92 FVFQSFMLIPTLNALENVELPALLRGESSAQSRNGAKALLEQLGLGKRLDHLPAQLSGGEQQRVALARAFNGRPDVLFAD 171 (228) T ss_pred EEECCCCCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEC T ss_conf 98140224798702123346898808998999999864554217344540887889979999999999987599999984 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHHH Q ss_conf 2588988056799999999999972698499974875797664 Q gi|254780750|r 737 EIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTDLS 779 (920) Q Consensus 737 ElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~l~ 779 (920) |--.|-++.....| +..+..+.+..+...+|+||..++.... T Consensus 172 EPT~~LD~~~~~~i-~~~l~~l~~~~g~tii~vtHd~~~~~~~ 213 (228) T PRK10584 172 EPTGNLDRQTGDKI-ADLLFSLNREHGTTLILVTHDLQLAARC 213 (228) T ss_pred CCCCCCCHHHHHHH-HHHHHHHHHHCCCEEEEECCCHHHHHHC T ss_conf 99767899999999-9999999997298999988669999858 No 78 >cd03226 ABC_cobalt_CbiO_domain2 Domain II of the ABC component of a cobalt transport family found in bacteria, archaea, and eukaryota. The transition metal cobalt is an essential component of many enzymes and must be transported into cells in appropriate amounts when needed. The CbiMNQO family ABC transport system is involved in cobalt transport in association with the cobalamin (vitamin B12) biosynthetic pathways. Most cobalt (Cbi) transport systems possess a separate CbiN component, the cobalt-binding periplasmic protein, and they are encoded by the conserved gene cluster cbiMNQO. Both the CbiM and CbiQ proteins are integral cytoplasmic membrane proteins, and the CbiO protein has the linker peptide and the Walker A and B motifs commonly found in the ATPase components of the ABC-type transport systems. Probab=97.91 E-value=0.00017 Score=52.04 Aligned_cols=134 Identities=22% Similarity=0.248 Sum_probs=68.9 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCC------------------CCCCHHHCCCCCCCE Q ss_conf 44434563587777664399996778440789999999999999719------------------853035320682210 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMG------------------SYVPASYAHIGIVDK 695 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG------------------~fVPA~~a~i~~~D~ 695 (920) .|=+|++|.. ..+.+..|.|||-+||||++|.++=+.- .+-| .|||-+. -+. T Consensus 14 ~vL~~vsl~i----~~Gei~~iiG~nGaGKSTLl~~i~Gl~~-p~~G~I~~~g~~~~~~~~~~~ig~v~Q~~-----~~~ 83 (205) T cd03226 14 EILDDLSLDL----YAGEIIALTGKNGAGKTTLAKILAGLIK-ESSGSILLNGKPIKAKERRKSIGYVMQDV-----DYQ 83 (205) T ss_pred EEEEEEEEEE----CCCCEEEEECCCCCCHHHHHHHHHCCCC-CCCCEEEECCEECCHHHHHCCEEEEEECC-----CCC T ss_conf 7864037888----6998999988999989999999956857-77873899999996578744489996278-----644 Q ss_pred EEEEEECCCCCCCCCCHHHHHHHHHHHHHH----------------------------HCCCCCEEEEECCCCCCCHHHH Q ss_conf 567652376611385328999999999999----------------------------5899856999325889880567 Q gi|254780750|r 696 LFSRVGSADNLASGRSTFMVEMIETASILN----------------------------QATNQSFVILDEIGRGTATLDG 747 (920) Q Consensus 696 IftRiGa~D~l~~g~STF~vEm~e~~~IL~----------------------------~at~~SLVllDElGrGTst~DG 747 (920) +|.. -..+++.-+....-..-.++..+|. -+..-.++|+||--.|=++. + T Consensus 84 l~~~-tv~~~l~~~~~~~~~~~~~~~~~l~~~~l~~~~~~~~~~LSGGqkQrv~iA~aL~~~P~illLDEPt~gLD~~-~ 161 (205) T cd03226 84 LFTD-SVREELLLGLKELDAGNEQAETVLKDLDLYALKERHPLSLSGGQKQRLAIAAALLSGKDLLIFDEPTSGLDYK-N 161 (205) T ss_pred CCHH-HHHHHHHHCCCCCCCHHHHHHHHHHHCCCCHHHHCCHHHCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCHH-H T ss_conf 2064-7999997048785607999999999769923553891128999999999999997599999997997658999-9 Q ss_pred HHHHHHHHHHHHHHCCCEEEEECCCHHHH-HHHH Q ss_conf 99999999999972698499974875797-6643 Q gi|254780750|r 748 LSIAWATIEYLHETNRCRGLLATHFHELT-DLSK 780 (920) Q Consensus 748 ~aiA~aile~l~~~~~~~~lfaTHy~eL~-~l~~ 780 (920) ...-+.++..|.+. +..++++||..+.. .+++ T Consensus 162 ~~~i~~ll~~l~~~-g~tvi~itHdl~~~~~~~d 194 (205) T cd03226 162 MERVGELIRELAAQ-GKAVIVITHDYEFLAKVCD 194 (205) T ss_pred HHHHHHHHHHHHHC-CCEEEEEECCHHHHHHHCC T ss_conf 99999999999979-9999998039899999799 No 79 >cd03235 ABC_Metallic_Cations ABC component of the metal-type transporters. This family includes transporters involved in the uptake of various metallic cations such as iron, manganese, and zinc. The ATPases of this group of transporters are very similar to members of iron-siderophore uptake family suggesting that they share a common ancestor. The best characterized metal-type ABC transporters are the YfeABCD system of Y. pestis, the SitABCD system of Salmonella enterica serovar Typhimurium, and the SitABCD transporter of Shigella flexneri. Moreover other uncharacterized homologs of these metal-type transporters are mainly found in pathogens like Haemophilus or enteroinvasive E. coli isolates. Probab=97.91 E-value=0.00014 Score=52.71 Aligned_cols=48 Identities=21% Similarity=0.216 Sum_probs=34.7 Q ss_pred CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH Q ss_conf 89985699932588988056799999999999972698499974875797 Q gi|254780750|r 727 ATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELT 776 (920) Q Consensus 727 at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~ 776 (920) +.+-.++|+||--.|=++.--..+ +.++..|.+. +..++|+||.-+.. T Consensus 148 ~~~P~lLlLDEPtsgLD~~~~~~~-~~~i~~l~~~-g~tii~vtHdl~~~ 195 (213) T cd03235 148 VQDPDLLLLDEPFAGVDPKTQEDI-YELLRELRRE-GMTILVVTHDLGLV 195 (213) T ss_pred HCCCCEEEECCCCCCCCHHHHHHH-HHHHHHHHHC-CCEEEEEECCHHHH T ss_conf 669999998188667899999999-9999999968-99999990798999 No 80 >KOG0062 consensus Probab=97.89 E-value=1.7e-05 Score=59.55 Aligned_cols=43 Identities=30% Similarity=0.279 Sum_probs=28.1 Q ss_pred CEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHH Q ss_conf 569993258898805679999999999997269849997487579766 Q gi|254780750|r 731 SFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTDL 778 (920) Q Consensus 731 SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~l 778 (920) -|++|||- |+-.|--+| -|..+.|-...|- ++..||--++... T Consensus 502 hlLVLDEP---TNhLD~dsl-~AL~~Al~~F~GG-Vv~VSHd~~fi~~ 544 (582) T KOG0062 502 HLLVLDEP---TNHLDRDSL-GALAKALKNFNGG-VVLVSHDEEFISS 544 (582) T ss_pred CEEEECCC---CCCCCHHHH-HHHHHHHHHCCCC-EEEEECCHHHHHH T ss_conf 68984488---763367778-9999999855894-7999775899863 No 81 >PRK13647 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=97.89 E-value=0.00013 Score=52.84 Aligned_cols=138 Identities=26% Similarity=0.288 Sum_probs=72.5 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCC------CC---HH--HCCCCCC-----CEEEE Q ss_conf 443456358777766439999677844078999999999999971985------30---35--3206822-----10567 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSY------VP---AS--YAHIGIV-----DKLFS 698 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~f------VP---A~--~a~i~~~-----D~Ift 698 (920) +=+++++.. ..+.+.-|.|||-+||||++|.++=+ +-.+-|.- +. .. ...+|.+ +.+|. T Consensus 20 aL~~vs~~i----~~Ge~~aiiG~NGsGKSTLl~~l~Gl-~~p~~G~I~i~G~~i~~~~~~~lr~~iG~vfQ~p~~ql~~ 94 (273) T PRK13647 20 ALDGVSLVI----PEGSKTAILGPNGAGKSTLLLHLNGI-YTAQRGRVKVMGKVVNEENIRWIRKKVGLVFQDPDDQLFS 94 (273) T ss_pred EEEEEEEEE----CCCCEEEEECCCCCHHHHHHHHHHCC-CCCCCEEEEECCEECCCCCHHHHHHHCEEEECCCHHHHCC T ss_conf 988117898----89989999999997599999999669-8888619999999999689899987524881070243052 Q ss_pred EEECCCCCCCCCCHH---HHHHHH-HHHHHH----------------------------HCCCCCEEEEECCCCCCCHHH Q ss_conf 652376611385328---999999-999999----------------------------589985699932588988056 Q gi|254780750|r 699 RVGSADNLASGRSTF---MVEMIE-TASILN----------------------------QATNQSFVILDEIGRGTATLD 746 (920) Q Consensus 699 RiGa~D~l~~g~STF---~vEm~e-~~~IL~----------------------------~at~~SLVllDElGrGTst~D 746 (920) . -..|++.-|.-.+ --|+.+ +..+|. -|..-.++|+||--.|=++.- T Consensus 95 ~-tV~e~v~fg~~~~g~~~~e~~~rv~~~L~~~~l~~~~~~~p~~LSGGqkqRvaiA~aL~~~P~lliLDEPtagLDp~~ 173 (273) T PRK13647 95 S-TVWDDVAFGPQNLELDKKEILRRVEEALKAVDMWDLREKAPYHLSYGQKKRVAIAGVLAMEPDIIVLDEPMAYLDPRG 173 (273) T ss_pred C-HHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHCCHHHCCHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHH T ss_conf 4-199999999988599999999999999987795876647933399989999999999981999999979765799999 Q ss_pred HHHHHHHHHHHHHHHCCCEEEEECCCHHHH-HHHH Q ss_conf 799999999999972698499974875797-6643 Q gi|254780750|r 747 GLSIAWATIEYLHETNRCRGLLATHFHELT-DLSK 780 (920) Q Consensus 747 G~aiA~aile~l~~~~~~~~lfaTHy~eL~-~l~~ 780 (920) - ..-+.+++.|.+. +..++++||.-+.. .+++ T Consensus 174 ~-~~l~~~l~~L~~~-G~Tvi~vtHdl~~~~~~aD 206 (273) T PRK13647 174 K-EELTAILNRLNNE-GKTVIVATHDVDLALEWAD 206 (273) T ss_pred H-HHHHHHHHHHHHC-CCEEEEEECCHHHHHHHCC T ss_conf 9-9999999999848-9999999417899999699 No 82 >cd03224 ABC_TM1139_LivF_branched LivF (TM1139) is part of the LIV-I bacterial ABC-type two-component transport system that imports neutral, branched-chain amino acids. The E. coli branched-chain amino acid transporter comprises a heterodimer of ABC transporters (LivF and LivG), a heterodimer of six-helix TM domains (LivM and LivH), and one of two alternative soluble periplasmic substrate binding proteins (LivK or LivJ). ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. Probab=97.89 E-value=0.00012 Score=53.09 Aligned_cols=133 Identities=23% Similarity=0.302 Sum_probs=72.3 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHH--------------------HHHHCC-CCCCHHHCCCCCC Q ss_conf 4434563587777664399996778440789999999999--------------------999719-8530353206822 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIV--------------------IMAQMG-SYVPASYAHIGIV 693 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~v--------------------ilAQiG-~fVPA~~a~i~~~ 693 (920) |=+|+++.. ..+.+.-|-|||-+||||++|.++=+. -.++.| .|||-+. T Consensus 15 ~L~~vs~~v----~~Gei~~liG~nGaGKSTLl~~i~Gl~~p~~G~I~~~G~~i~~~~~~~~~~~gi~~v~Q~~------ 84 (222) T cd03224 15 ILFGVSLTV----PEGEIVALLGRNGAGKTTLLKTIMGLLPPRSGSIRFDGRDITGLPPHERARAGIGYVPEGR------ 84 (222) T ss_pred EECCEEEEE----CCCCEEEEECCCCCCHHHHHHHHHCCCCCCCEEEEECCEECCCCCHHHHHHCCEEEECCCC------ T ss_conf 981408998----8998999999999859999999977988996099999999999999999975938963566------ Q ss_pred CEEEEEEECCCCCCCCCCH-----HH---HHHHH----HHHHH-------------------HHCCCCCEEEEECCCCCC Q ss_conf 1056765237661138532-----89---99999----99999-------------------958998569993258898 Q gi|254780750|r 694 DKLFSRVGSADNLASGRST-----FM---VEMIE----TASIL-------------------NQATNQSFVILDEIGRGT 742 (920) Q Consensus 694 D~IftRiGa~D~l~~g~ST-----F~---vEm~e----~~~IL-------------------~~at~~SLVllDElGrGT 742 (920) .+|..+-..||+.-|... .. -++.+ ....+ --+..-.++|+||--.|= T Consensus 85 -~l~~~ltv~enl~~~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~~~~LSgG~~Qrv~iAraL~~~P~lllLDEPt~gL 163 (222) T cd03224 85 -RIFPELTVEENLLLGAYARRRAKRKARLERVYELFPRLKERRKQLAGTLSGGEQQMLAIARALMSRPKLLLLDEPSEGL 163 (222) T ss_pred -CCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHCCHHHCCHHHHHHHHHHHHHHCCCCEEEEECCCCCC T ss_conf -5688990999999987635813599999999988663799874845448999999999999996499999993865479 Q ss_pred CHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH-HHHH Q ss_conf 8056799999999999972698499974875797-6643 Q gi|254780750|r 743 ATLDGLSIAWATIEYLHETNRCRGLLATHFHELT-DLSK 780 (920) Q Consensus 743 st~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~-~l~~ 780 (920) ++..-..| |..+..+.+. +.-++++||.-+.. .+++ T Consensus 164 D~~~~~~i-~~~l~~l~~~-g~tii~vtHdl~~~~~~~D 200 (222) T cd03224 164 APKIVEEI-FEAIRELRDE-GVTILLVEQNARFALEIAD 200 (222) T ss_pred CHHHHHHH-HHHHHHHHHC-CCEEEEECCCHHHHHHHCC T ss_conf 99999999-9999999956-9999999085899999699 No 83 >PRK09700 D-allose transporter ATP-binding protein; Provisional Probab=97.88 E-value=0.00023 Score=50.99 Aligned_cols=52 Identities=15% Similarity=0.168 Sum_probs=36.7 Q ss_pred CCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHH-HHHHHHH Q ss_conf 99856999325889880567999999999999726984999748757-9766430 Q gi|254780750|r 728 TNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHE-LTDLSKS 781 (920) Q Consensus 728 t~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~e-L~~l~~~ 781 (920) ..-.++|+||--+|=++..-..| |.++..+.+. +.-++|.||.-+ +..+.+. T Consensus 426 ~~p~lLilDEPT~GlD~~~~~~i-~~li~~l~~~-G~tvl~ishdl~ev~~~~DR 478 (510) T PRK09700 426 CCPEVIIFDEPTRGIDVGAKAEI-YKVMRQLADD-GKVILMVSSELPEIITVCDR 478 (510) T ss_pred HCCCEEEEECCCCCCCHHHHHHH-HHHHHHHHHC-CCEEEEEECCHHHHHHHCCE T ss_conf 59988999797558999999999-9999999968-99999990758999986999 No 84 >PRK13546 teichoic acids export protein ATP-binding subunit; Provisional Probab=97.88 E-value=0.00011 Score=53.52 Aligned_cols=163 Identities=18% Similarity=0.194 Sum_probs=82.6 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCC---------EEEEEEECCCC Q ss_conf 443456358777766439999677844078999999999999971985303532068221---------05676523766 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVD---------KLFSRVGSADN 705 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D---------~IftRiGa~D~ 705 (920) .=||++|.. ..+.++-|-|||-+||||++|.++ |..-|-+ -++.+.- .++-++...++ T Consensus 39 AL~dVsf~i----~~GEivgllG~NGaGKSTLlk~I~--------Gl~~P~~-G~I~~~G~i~~~~~~~~l~~~lt~~en 105 (264) T PRK13546 39 ALDDISLKA----YEGDVIGLVGINGSGKSTLSNIIG--------GSLSPTV-GKVDRNGEVSVIAISAGLSGQLTGIEN 105 (264) T ss_pred EECCCEEEE----CCCCEEEEECCCCCHHHHHHHHHH--------CCCCCCC-CEEEECCEEEEEEECCCCCCCCEEEHH T ss_conf 952707888----599899999899861999999996--------7988887-479999887488503565744300015 Q ss_pred CC-----CCCCH-----HH---HHHHHHHHHH-------------------HHCCCCCEEEEECCCCCCCHHHHHHHHHH Q ss_conf 11-----38532-----89---9999999999-------------------95899856999325889880567999999 Q gi|254780750|r 706 LA-----SGRST-----FM---VEMIETASIL-------------------NQATNQSFVILDEIGRGTATLDGLSIAWA 753 (920) Q Consensus 706 l~-----~g~ST-----F~---vEm~e~~~IL-------------------~~at~~SLVllDElGrGTst~DG~aiA~a 753 (920) +. .|.+- .. .|+.+....+ --+..-.++|+||--.|-++.--..+ |. T Consensus 106 i~~~~~~~g~~~~~~~~~~~~~le~~~l~~~~~~~~~~LSgGqkqrl~lA~al~~~P~iLiLDEPts~LD~~~~~~i-~~ 184 (264) T PRK13546 106 IEFKMLCMGFKRKEIKAMTPKIIEFSELGEFIYQPVKKYSSGMRAKLGFSINITVNPDILVIDEALSVGDQTFAQKC-LD 184 (264) T ss_pred HHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHH-HH T ss_conf 88899872424999999999999851205565175534799999999999999569999999598754899999999-99 Q ss_pred HHHHHHHHCCCEEEEECCCHHHH-HHHHHCCCEEEEEEEEEEECCEEEEEEEEEECCCCCCHHHHHHHHCCCCHHHHHHH Q ss_conf 99999972698499974875797-66430688589999999609927787777447898877899999829998999999 Q gi|254780750|r 754 TIEYLHETNRCRGLLATHFHELT-DLSKSLKRFHNATLQVSDSNEGIIFLHKVIPGIADHSYGIQVGKLAGLPNTVISRA 832 (920) Q Consensus 754 ile~l~~~~~~~~lfaTHy~eL~-~l~~~~~~v~n~~~~~~~~~~~i~flykl~~G~~~~Sygi~vA~laG~p~~vi~~A 832 (920) .+..+.+. +...+|+||.-+.. .+.+. +. + -.+++|. ..|=|++|+++- T Consensus 185 ~i~~l~~~-g~TiilvSH~l~~v~~lcDr---i~-----v-l~~GkIi--------------------~~G~~~evl~ky 234 (264) T PRK13546 185 KIYEFKEQ-NKTIFFVSHNLGQVRQFCTK---IA-----W-IEGGKLK--------------------DYGELDDVLPKY 234 (264) T ss_pred HHHHHHHC-CCEEEEECCCHHHHHHHCCE---EE-----E-EECCEEE--------------------EECCHHHHHHHH T ss_conf 99999968-98999984878999986999---99-----9-9898899--------------------985999999999 Q ss_pred HHHHHHHHH Q ss_conf 999999987 Q gi|254780750|r 833 YDILKTFEK 841 (920) Q Consensus 833 ~~~~~~le~ 841 (920) ++....+-+ T Consensus 235 ~~~~~~~~~ 243 (264) T PRK13546 235 EAFLNDFKK 243 (264) T ss_pred HHHHHHHHH T ss_conf 999999986 No 85 >PRK03695 vitamin B12-transporter ATPase; Provisional Probab=97.88 E-value=0.00011 Score=53.36 Aligned_cols=136 Identities=19% Similarity=0.193 Sum_probs=71.1 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHH-------------------HHHHCCCCCCHHHC---CCCC Q ss_conf 4434563587777664399996778440789999999999-------------------99971985303532---0682 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIV-------------------IMAQMGSYVPASYA---HIGI 692 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~v-------------------ilAQiG~fVPA~~a---~i~~ 692 (920) -=+|+++.. ..+.+.-|-|||-+||||++|.+.=+. -+|+.++|+|-+.. .+++ T Consensus 12 ~L~~isl~v----~~Ge~v~iiGpNGaGKSTLlk~i~Gl~p~~G~I~i~g~~i~~~~~~~~~~~~~~l~q~~~~~~~~~v 87 (245) T PRK03695 12 RLGPLSGEV----RAGEILHLVGPNGAGKSTLLARMAGLLSGSGEIQFAGQPLEAWSAAELARHRAYLSQQQTPPFAMPV 87 (245) T ss_pred CCCCEEEEE----CCCCEEEEECCCCCHHHHHHHHHHCCCCCCCEEEECCEECCCCCHHHHHHCEEEEEECCCCCCCCCH T ss_conf 050748999----5998999997899419999999846688896599999973538988874306899623564557739 Q ss_pred CC----------------EEEEEEECCCCC---CCCCCHHHHHHHHHHHHHHH---C--CCCCEEEEECCCCCCCHHHHH Q ss_conf 21----------------056765237661---13853289999999999995---8--998569993258898805679 Q gi|254780750|r 693 VD----------------KLFSRVGSADNL---ASGRSTFMVEMIETASILNQ---A--TNQSFVILDEIGRGTATLDGL 748 (920) Q Consensus 693 ~D----------------~IftRiGa~D~l---~~g~STF~vEm~e~~~IL~~---a--t~~SLVllDElGrGTst~DG~ 748 (920) ++ .+..++|-.|.+ .+..|.=+.-+...+..|-+ | .+-.++|+||--.|=++. .. T Consensus 88 ~~~~~l~~~~~~~~~~~~~~l~~~~l~~~~~~~~~~LSgGq~Qrv~la~all~i~~a~~p~p~illLDEPt~gLD~~-~~ 166 (245) T PRK03695 88 FHYLTLHQPAKTRTGALNDVAGALGLDDKLGRSVNQLSGGEWQRVRLAAVVLQVWPDINPAGQLLLLDEPMNSLDVA-QQ 166 (245) T ss_pred HHHHHHCCCCCHHHHHHHHHHHHCCCCHHHCCCHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCCCCHH-HH T ss_conf 99986038621189999999998599415487926689889999999999963272327888789973876678999-99 Q ss_pred HHHHHHHHHHHHHCCCEEEEECCCHHHH Q ss_conf 9999999999972698499974875797 Q gi|254780750|r 749 SIAWATIEYLHETNRCRGLLATHFHELT 776 (920) Q Consensus 749 aiA~aile~l~~~~~~~~lfaTHy~eL~ 776 (920) ..-+.+++.+.+. +.-++++||.-+.. T Consensus 167 ~~l~~~i~~l~~~-g~tIi~vtHdl~~~ 193 (245) T PRK03695 167 SALDRLLSELCQQ-GIAVVMSSHDLNHT 193 (245) T ss_pred HHHHHHHHHHHHC-CCEEEEEECCHHHH T ss_conf 9999999999847-99999994268999 No 86 >TIGR03522 GldA_ABC_ATP gliding motility-associated ABC transporter ATP-binding subunit GldA. Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldA is an ABC transporter ATP-binding protein (pfam00005) linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae and Cytophaga hutchinsonii. Knockouts of GldA abolish the gliding phenotype. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility. Probab=97.87 E-value=0.00029 Score=50.34 Aligned_cols=138 Identities=19% Similarity=0.224 Sum_probs=72.3 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHH--------------------HHHHHCCCCCCHHHC---CC Q ss_conf 4443456358777766439999677844078999999999--------------------999971985303532---06 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALI--------------------VIMAQMGSYVPASYA---HI 690 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~--------------------vilAQiG~fVPA~~a---~i 690 (920) .+=+|+++.. ..+.+.=+-|||-+||||+||.++=. .+-.++| |||-... .+ T Consensus 16 ~al~~vsf~v----~~Gei~gllGpNGAGKTTl~~~l~Gl~~p~~G~i~i~G~~~~~~~~~~~~~ig-~~pq~~~l~~~l 90 (301) T TIGR03522 16 NALDEVSFEA----QKGRIVGFLGPNGAGKSTTMKIITGYLPPDSGSVQVCGEDVLQNPKEVQRNIG-YLPEHNPLYLDM 90 (301) T ss_pred EEECCEEEEE----CCCCEEEEECCCCCCHHHHHHHHHCCCCCCCCEEEECCEECCCCHHHHHHHCE-EECCCCCCCCCC T ss_conf 9973606788----59819999999998199999999679568977799927513448799985376-745556567888 Q ss_pred CCCCEE--EE-------------------EEECCC---CCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHH Q ss_conf 822105--67-------------------652376---611385328999999999999589985699932588988056 Q gi|254780750|r 691 GIVDKL--FS-------------------RVGSAD---NLASGRSTFMVEMIETASILNQATNQSFVILDEIGRGTATLD 746 (920) Q Consensus 691 ~~~D~I--ft-------------------RiGa~D---~l~~g~STF~vEm~e~~~IL~~at~~SLVllDElGrGTst~D 746 (920) ++.+.+ +. ++|-.+ ......|.=|.-....+.. -+..-.++|+||--.|-++.- T Consensus 91 tv~e~l~~~~~~~g~~~~~~~~~~~~~l~~~~l~~~~~~~~~~LS~G~kqrl~la~a--L~~~P~lliLDEPt~GLDp~~ 168 (301) T TIGR03522 91 YVREYLQFIAGIYGMKGQLLKQRVEEMIELVGLRPEQHKKIGQLSKGYRQRVGLAQA--LIHDPKVLILDEPTTGLDPNQ 168 (301) T ss_pred CHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHCCHHHHCCHHHHHHHHHHHH--HCCCCCEEEEECCCCCCCHHH T ss_conf 999999999997399989999999999988188566548276779988445998898--707998999948866789899 Q ss_pred HHHHHHHHHHHHHHHCCCEEEEECCCHHHH-HHHHH Q ss_conf 799999999999972698499974875797-66430 Q gi|254780750|r 747 GLSIAWATIEYLHETNRCRGLLATHFHELT-DLSKS 781 (920) Q Consensus 747 G~aiA~aile~l~~~~~~~~lfaTHy~eL~-~l~~~ 781 (920) ..--|.++..+. + +..++++||+-+-. .+.+. T Consensus 169 -~~~~~~~l~~l~-~-~~TillssH~l~e~e~lcdr 201 (301) T TIGR03522 169 -LVEIRNVIKNIG-K-DKTIILSTHIMQEVEAICDR 201 (301) T ss_pred -HHHHHHHHHHHC-C-CCEEEEECCCHHHHHHHCCE T ss_conf -999999999875-9-99999987858999986999 No 87 >PRK13648 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=97.87 E-value=0.00019 Score=51.63 Aligned_cols=131 Identities=25% Similarity=0.242 Sum_probs=68.9 Q ss_pred CCCEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHH------------------CCCCC Q ss_conf 770444345635877776643999967784407899999999999997198530353------------------20682 Q gi|254780750|r 631 SKPFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASY------------------AHIGI 692 (920) Q Consensus 631 ~~~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~------------------a~i~~ 692 (920) ...++=||+++.. ..+.+.-|.|||-+||||++|.++ |..-|-+- -.+++ T Consensus 20 ~~~~~L~~is~~i----~~Ge~vaiiG~sGsGKSTLl~ll~--------Gl~~p~~G~I~~~g~~i~~~~~~~~~~~ig~ 87 (269) T PRK13648 20 DASFTLKDVSFNI----PKGQWTSIVGHNGSGKSTIAKLMI--------GIEKVKSGEIFYNNQAITDDNFEKLRKDIGI 87 (269) T ss_pred CCCEEEEEEEEEE----CCCCEEEEECCCCCCHHHHHHHHH--------CCCCCCCEEEEECCEECCCCCHHHHHHHCEE T ss_conf 9963566458998----599899999999997999999996--------4979985099999999998898999750269 Q ss_pred C----C-EEEEEEECCCCCCCCCCHHH---HHH-HHHHHHHH----------------------------HCCCCCEEEE Q ss_conf 2----1-05676523766113853289---999-99999999----------------------------5899856999 Q gi|254780750|r 693 V----D-KLFSRVGSADNLASGRSTFM---VEM-IETASILN----------------------------QATNQSFVIL 735 (920) Q Consensus 693 ~----D-~IftRiGa~D~l~~g~STF~---vEm-~e~~~IL~----------------------------~at~~SLVll 735 (920) + + .+|..+ ..+++.-|..-+. .|+ ..+..+|. -+..-.++|+ T Consensus 88 vfQ~p~~~~~~~t-v~~~i~~gl~~~~~~~~e~~~~v~~~L~~~~l~~~~~~~p~~LSGGqkQRvaiAraL~~~P~iLil 166 (269) T PRK13648 88 VFQNPDNQFVGSI-VKYDVAFGLENHAVPYDEMHRRVSEALKQVDMLERADYEPNALSGGQKQRVAIASVLALNPSVIIL 166 (269) T ss_pred EEECHHHHCCCCH-HHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCEEEE T ss_conf 9887132047217-999997336446999999999999999876991344189643899999999999999759899998 Q ss_pred ECCCCCCCHHHHH--HHHHHHHHHHHHHCCCEEEEECCCHHHHH Q ss_conf 3258898805679--99999999999726984999748757976 Q gi|254780750|r 736 DEIGRGTATLDGL--SIAWATIEYLHETNRCRGLLATHFHELTD 777 (920) Q Consensus 736 DElGrGTst~DG~--aiA~aile~l~~~~~~~~lfaTHy~eL~~ 777 (920) ||- ||..|-. .--+..++.|....+..++++||.-+... T Consensus 167 DEP---Ts~LD~~~~~~i~~ll~~L~~~~~~TvI~itHdl~~a~ 207 (269) T PRK13648 167 DEA---TSMLDPDARQNLLDLVRKVKSEHNITIISITHDLSEAM 207 (269) T ss_pred CCC---CCCCCHHHHHHHHHHHHHHHHCCCCEEEEEEECHHHHH T ss_conf 187---55489999999999999999737989999976789997 No 88 >PRK10938 putative molybdenum transport ATP-binding protein ModF; Provisional Probab=97.87 E-value=0.00026 Score=50.60 Aligned_cols=50 Identities=18% Similarity=0.212 Sum_probs=33.7 Q ss_pred CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHH Q ss_conf 899856999325889880567999999999999726984999748757976 Q gi|254780750|r 727 ATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTD 777 (920) Q Consensus 727 at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~ 777 (920) ++.-.++|+||--+|=+. ...+.-+..++.|.+..++-.||+||+.|... T Consensus 417 ~~~P~vLiLDEPT~gLD~-~~~~~i~~ll~~l~~~g~~~il~vSHd~e~~~ 466 (490) T PRK10938 417 VKHPTLLILDEPLQGLDP-LNRQLVRRFVDVLISEGETQLLFVSHHAEDAP 466 (490) T ss_pred HHCCCEEEEECCCCCCCH-HHHHHHHHHHHHHHHCCCCEEEEECCCHHHHH T ss_conf 719998999687547699-99999999999999779929999748999999 No 89 >COG1131 CcmA ABC-type multidrug transport system, ATPase component [Defense mechanisms] Probab=97.86 E-value=0.00016 Score=52.19 Aligned_cols=137 Identities=24% Similarity=0.342 Sum_probs=73.5 Q ss_pred CEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHH--------------------HHHCCCCCCHHHC---C Q ss_conf 0444345635877776643999967784407899999999999--------------------9971985303532---0 Q gi|254780750|r 633 PFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVI--------------------MAQMGSYVPASYA---H 689 (920) Q Consensus 633 ~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vi--------------------lAQiG~fVPA~~a---~ 689 (920) ..+=||+++.. ..+.+.-+-|||-+||||+||.++=++- ..++| |||-+-. . T Consensus 18 ~~~l~~vs~~i----~~Gei~gllG~NGAGKTTllk~l~Gl~~p~~G~i~i~G~~~~~~~~~~~~~ig-y~~~~~~~~~~ 92 (293) T COG1131 18 KTALDGVSFEV----EPGEIFGLLGPNGAGKTTLLKILAGLLKPTSGEILVLGYDVVKEPAKVRRRIG-YVPQEPSLYPE 92 (293) T ss_pred CEEEEEEEEEE----ECCEEEEEECCCCCCHHHHHHHHHCCCCCCCEEEEECCCCCCCCHHHHHCEEE-EEECCCCCCCC T ss_conf 78886049998----28959999899999899999999679778864999958627512676505299-99478777714 Q ss_pred CC---------------------CCCEEEEEEECCC---CCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHH Q ss_conf 68---------------------2210567652376---61138532899999999999958998569993258898805 Q gi|254780750|r 690 IG---------------------IVDKLFSRVGSAD---NLASGRSTFMVEMIETASILNQATNQSFVILDEIGRGTATL 745 (920) Q Consensus 690 i~---------------------~~D~IftRiGa~D---~l~~g~STF~vEm~e~~~IL~~at~~SLVllDElGrGTst~ 745 (920) ++ .++++..++|-.+ ...++.|.=|---.-.+ +--+..-+|+|+||--.|=++. T Consensus 93 lTv~e~l~~~~~l~~~~~~~~~~~~~~~l~~~~L~~~~~~~~~~lS~G~kqrl~ia--~aL~~~P~lliLDEPt~GLDp~ 170 (293) T COG1131 93 LTVRENLEFFARLYGLSKEEAEERIEELLELFGLEDKANKKVRTLSGGMKQRLSIA--LALLHDPELLILDEPTSGLDPE 170 (293) T ss_pred CCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCHHCCCHHHCCHHHHHHHHHH--HHHHCCCCEEEEECCCCCCCHH T ss_conf 75999999999984997166799999999986996032881023798899999999--9996699999996997787999 Q ss_pred HHHHHHHHHHHHHHHHCCCEEEEECCCHHHHH Q ss_conf 67999999999999726984999748757976 Q gi|254780750|r 746 DGLSIAWATIEYLHETNRCRGLLATHFHELTD 777 (920) Q Consensus 746 DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~ 777 (920) - ...-|.++..+.+..+...+++||+.+-.+ T Consensus 171 ~-~~~~~~~l~~l~~~g~~tvlissH~l~e~~ 201 (293) T COG1131 171 S-RREIWELLRELAKEGGVTILLSTHILEEAE 201 (293) T ss_pred H-HHHHHHHHHHHHHCCCCEEEEECCCHHHHH T ss_conf 9-999999999999679959999838869999 No 90 >PRK11000 maltose/maltodextrin transporter ATP-binding protein; Provisional Probab=97.86 E-value=0.00059 Score=47.97 Aligned_cols=140 Identities=26% Similarity=0.356 Sum_probs=72.0 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHH----HC-CCC---CCHHHCCCCCC---CEEEEEEEC Q ss_conf 44434563587777664399996778440789999999999999----71-985---30353206822---105676523 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMA----QM-GSY---VPASYAHIGIV---DKLFSRVGS 702 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilA----Qi-G~f---VPA~~a~i~~~---D~IftRiGa 702 (920) -+=+|++|.. ..+.+..|-|||-+||||+||.++=+.--. .+ |-. +|...-.++.+ ..+|-.+-. T Consensus 17 ~al~~vsl~i----~~Ge~~~llGpsG~GKSTllr~i~Gl~~p~~G~I~i~g~~v~~~~~~~r~ig~vfQ~~~L~p~ltV 92 (369) T PRK11000 17 VISKDINLDI----HEGEFVVFVGPSGCGKSTLLRMIAGLEDITSGDLFIGEKRMNDVPPAERGVGMVFQSYALYPHLSV 92 (369) T ss_pred EEECCCEEEE----CCCCEEEEECCCCCHHHHHHHHHHCCCCCCCEEEEECCEECCCCCHHHCCEEEEECCCCCCCCCCH T ss_conf 9986438898----799899999999736999999997799999549999999998799778786999407854789899 Q ss_pred CCCCCCCC-----CH--------HHHHHHHHHHHHH-------------------HCCCCCEEEEECCCCCCCHHHHH-- Q ss_conf 76611385-----32--------8999999999999-------------------58998569993258898805679-- Q gi|254780750|r 703 ADNLASGR-----ST--------FMVEMIETASILN-------------------QATNQSFVILDEIGRGTATLDGL-- 748 (920) Q Consensus 703 ~D~l~~g~-----ST--------F~vEm~e~~~IL~-------------------~at~~SLVllDElGrGTst~DG~-- 748 (920) .|||.-|. +. =+.|+.++..+++ -+.+-.++|+||- ||..|-. T Consensus 93 ~eNi~~~l~~~~~~~~e~~~rv~~~l~~~~l~~~~~r~p~~LSGGq~QRvaiARAL~~~P~illlDEP---~s~LD~~~r 169 (369) T PRK11000 93 AENMSFGLKLAGAKKEEINQRVNQVAEVLQLAHLLDRKPKALSGGQRQRVAIGRTLVAEPSVFLLDEP---LSNLDAALR 169 (369) T ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHCCHHHCCHHHHHHHHHHHHHHCCCCEEEECCC---CCCCCHHHE T ss_conf 99997788763898899999999999863745355588746694277999999886259985884366---678886665 Q ss_pred HHHHHHHHHHHHHCCCEEEEECCCHHHH-HHHH Q ss_conf 9999999999972698499974875797-6643 Q gi|254780750|r 749 SIAWATIEYLHETNRCRGLLATHFHELT-DLSK 780 (920) Q Consensus 749 aiA~aile~l~~~~~~~~lfaTHy~eL~-~l~~ 780 (920) .--+..+..|++..+..++|+||..+-+ .+++ T Consensus 170 ~~~~~~l~~l~~~~g~T~i~vTHD~~eA~~laD 202 (369) T PRK11000 170 VQMRIEISRLHKRLGRTMIYVTHDQVEAMTLAD 202 (369) T ss_pred EHHHHHHHHHHHHHCCEEEEEECCHHHHHHHCC T ss_conf 247899999999869859999089999998599 No 91 >PRK10895 putative ABC transporter ATP-binding protein YhbG; Provisional Probab=97.85 E-value=0.00072 Score=47.34 Aligned_cols=139 Identities=27% Similarity=0.341 Sum_probs=72.2 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHH--------------------HHHCC-CCCCHHHC---C Q ss_conf 444345635877776643999967784407899999999999--------------------99719-85303532---0 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVI--------------------MAQMG-SYVPASYA---H 689 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vi--------------------lAQiG-~fVPA~~a---~ 689 (920) -|=+|+++.. ..+.+.-|-|||-+||||+||.++=+.- .++-| .|||-+.. . T Consensus 17 ~~l~~vsl~i----~~Gei~~liGpNGaGKSTLl~~i~Gl~~~~~G~I~i~g~~i~~~~~~~~~~~~ig~v~Q~~~l~~~ 92 (241) T PRK10895 17 RVVEDVSLTV----NSGEIVGLLGPNGAGKTTTFYMVVGIVPRDAGNIIIDDEDISLLPLHARARRGIGYLPQEASIFRR 92 (241) T ss_pred EEEEEEEEEE----CCCCEEEEECCCCCCHHHHHHHHHCCCCCCCCEEEECCEECCCCCHHHHHHCCEEEECCCCCCCCC T ss_conf 9995207898----399799998899986999999996788888762776345234489889985776996243545778 Q ss_pred CCCCCEEEEEEECC-------------------------CCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCH Q ss_conf 68221056765237-------------------------66113853289999999999995899856999325889880 Q gi|254780750|r 690 IGIVDKLFSRVGSA-------------------------DNLASGRSTFMVEMIETASILNQATNQSFVILDEIGRGTAT 744 (920) Q Consensus 690 i~~~D~IftRiGa~-------------------------D~l~~g~STF~vEm~e~~~IL~~at~~SLVllDElGrGTst 744 (920) ++++|.++...... |......|-=+--....|..| +..-.++|+||--.|=++ T Consensus 93 ltv~enl~~~~~~~~~~~~~~~~~~~~~~l~~~~l~~~~~~~~~~LSgG~kqrv~iAraL--~~~P~illLDEPt~gLD~ 170 (241) T PRK10895 93 LSVYDNLMAVLQIRDDLSAEQREDRANELMEEFHIEHLRDSMGQSLSGGERRRVEIARAL--AANPKFILLDEPFAGVDP 170 (241) T ss_pred CCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHCCCHHHCCHHHHHHHHHHHHH--HCCCCEEEEECCCCCCCH T ss_conf 889999999999844899899999999999977991464110666898889999999999--669988999587547999 Q ss_pred HHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH-HHHH Q ss_conf 56799999999999972698499974875797-6643 Q gi|254780750|r 745 LDGLSIAWATIEYLHETNRCRGLLATHFHELT-DLSK 780 (920) Q Consensus 745 ~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~-~l~~ 780 (920) .--.. -|.+++.+.+. +.-++++||.-+.. .+.+ T Consensus 171 ~~~~~-i~~~l~~l~~~-g~tvl~~tHdl~~~~~~~d 205 (241) T PRK10895 171 ISVID-IKRIIEHLRDS-GLGVLITDHNVRETLAVCE 205 (241) T ss_pred HHHHH-HHHHHHHHHHC-CCEEEEECCCHHHHHHHCC T ss_conf 99999-99999999964-9999999072999999799 No 92 >PRK13649 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=97.84 E-value=0.00063 Score=47.76 Aligned_cols=171 Identities=22% Similarity=0.252 Sum_probs=87.5 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHH----------------------CCCCC Q ss_conf 44345635877776643999967784407899999999999997198530353----------------------20682 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASY----------------------AHIGI 692 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~----------------------a~i~~ 692 (920) +=+||++.. ..+.+..|.|||-+||||++|.++ |.+.|-+- -.+|. T Consensus 22 aL~~Vsl~i----~~Ge~~aiiG~nGsGKSTLl~~l~--------Gl~~p~~G~i~~~g~~i~~~~~~~~~~~~~~~iG~ 89 (280) T PRK13649 22 ALFDVNLDI----LDGSYTAFIGHTGSGKSTIMQLLN--------GLHVPTTGIVSVDDTDITSHSKNKEIKSIRKKVGL 89 (280) T ss_pred EEEEEEEEE----CCCCEEEEECCCCCHHHHHHHHHH--------CCCCCCCEEEEECCEECCCCCCHHHHHHHHHHEEE T ss_conf 541026898----799899999599986999999996--------69998860899999998777820139999876469 Q ss_pred C-----CEEEEEEECCCCCCCCCCHHH---HHHHHH-HHHHH-----------------------------HCCCCCEEE Q ss_conf 2-----105676523766113853289---999999-99999-----------------------------589985699 Q gi|254780750|r 693 V-----DKLFSRVGSADNLASGRSTFM---VEMIET-ASILN-----------------------------QATNQSFVI 734 (920) Q Consensus 693 ~-----D~IftRiGa~D~l~~g~STF~---vEm~e~-~~IL~-----------------------------~at~~SLVl 734 (920) + .++|.. -..|++.-|..-|- -|+.+. ...|. -|..-.++| T Consensus 90 vfQ~p~~ql~~~-tV~eev~fg~~~~g~~~~e~~~~v~~~l~~~gL~e~~~~r~p~~LSGGqkqRvaiA~aL~~~P~iLl 168 (280) T PRK13649 90 VFQFPESQLFEE-TVLKDVAFGPQNFGVSPEEAEALAREKLALVGISENLFEKNPFELSGGQMRRVAIAGILAMEPKILV 168 (280) T ss_pred EEECCCCCCCCC-HHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCEEE T ss_conf 974652123603-0999998689886999999999999999876997466542900099999999999999974999999 Q ss_pred EECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH-HHHHHCCCEEEEEEEEEEECCEEEEEEEEEECCCCCC Q ss_conf 932588988056799999999999972698499974875797-6643068858999999960992778777744789887 Q gi|254780750|r 735 LDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELT-DLSKSLKRFHNATLQVSDSNEGIIFLHKVIPGIADHS 813 (920) Q Consensus 735 lDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~-~l~~~~~~v~n~~~~~~~~~~~i~flykl~~G~~~~S 813 (920) +||--.|=++. +..--+.++..|.+. ++.++++||.-+.. .+++. +.. | .+++| +..|....= T Consensus 169 lDEPTsgLDp~-~~~~i~~ll~~l~~~-G~Tii~vTHdl~~v~~~aDr---v~v--l----~~G~i-----v~~G~p~ev 232 (280) T PRK13649 169 LDEPTAGLDPK-GRKELMTIFKKLHQS-GMTIVLVTHLMDDVANYADF---VYV--L----EKGKI-----VLSGKPKDI 232 (280) T ss_pred EECCCCCCCHH-HHHHHHHHHHHHHHC-CCEEEEECCCHHHHHHHCCE---EEE--E----ECCEE-----EEECCHHHH T ss_conf 84875548999-999999999999863-99999987589999997999---999--9----89999-----998789999 Q ss_pred HH-HHHHHHCCC-CHHHHHHHHH Q ss_conf 78-999998299-9899999999 Q gi|254780750|r 814 YG-IQVGKLAGL-PNTVISRAYD 834 (920) Q Consensus 814 yg-i~vA~laG~-p~~vi~~A~~ 834 (920) |. .+.-+.+|+ |+.+++-|.+ T Consensus 233 f~~~~~l~~~~l~~P~~~~~~~~ 255 (280) T PRK13649 233 FQQVSFLEKKQLGVPKITKFAQR 255 (280) T ss_pred HCCHHHHHHCCCCCCHHHHHHHH T ss_conf 75989998779999919999999 No 93 >cd03292 ABC_FtsE_transporter FtsE is a hydrophilic nucleotide-binding protein that binds FtsX to form a heterodimeric ATP-binding cassette (ABC)-type transporter that associates with the bacterial inner membrane. The FtsE/X transporter is thought to be involved in cell division and is important for assembly or stability of the septal ring. Probab=97.84 E-value=0.00038 Score=49.45 Aligned_cols=131 Identities=21% Similarity=0.277 Sum_probs=69.8 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEE------------EEEC Q ss_conf 4434563587777664399996778440789999999999999719853035320682210567------------6523 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFS------------RVGS 702 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~Ift------------RiGa 702 (920) +=+|+++.. ..+.+..|.|||-+||||+||.++- ..-| ++-++-+.+.-.+ .||- T Consensus 16 aL~~vsl~i----~~Ge~v~i~GpSGsGKSTLl~~i~g--------l~~p-~sG~i~i~g~~~~~~~~~~~~~~Rr~iG~ 82 (214) T cd03292 16 ALDGINISI----SAGEFVFLVGPSGAGKSTLLKLIYK--------EELP-TSGTIRVNGQDVSDLRGRAIPYLRRKIGV 82 (214) T ss_pred EEEEEEEEE----CCCCEEEEECCCCCHHHHHHHHHHC--------CCCC-CCEEEEECCEECCCCCHHHHHHHHCCEEE T ss_conf 982217798----5998999997999539999999962--------9898-86499999999898997789998667499 Q ss_pred --C-CCCCCCCCHHH-------------HHHHH-HHHHH---------------------------H-HCCCCCEEEEEC Q ss_conf --7-66113853289-------------99999-99999---------------------------9-589985699932 Q gi|254780750|r 703 --A-DNLASGRSTFM-------------VEMIE-TASIL---------------------------N-QATNQSFVILDE 737 (920) Q Consensus 703 --~-D~l~~g~STF~-------------vEm~e-~~~IL---------------------------~-~at~~SLVllDE 737 (920) + .++....|-+- -|..+ +..+| + -+..-.++|+|| T Consensus 83 VfQ~~~L~~~ltV~eNv~~~l~~~~~~~~~~~~rv~~~L~~vgL~~~~~~~p~~LSGGqkQRvaIARALv~~P~ill~DE 162 (214) T cd03292 83 VFQDFRLLPDRNVYENVAFALEVTGVPPREIRKRVPAALELVGLSHKHRALPAELSGGEQQRVAIARAIVNSPTILIADE 162 (214) T ss_pred EEECCCCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHCCCHHHCCHHHHHHHHHHHHHHCCCCEEEECC T ss_conf 90187647999799999999998499999999999999987799657549942488899999999999972999999839 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHH-HHH Q ss_conf 5889880567999999999999726984999748757976-643 Q gi|254780750|r 738 IGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTD-LSK 780 (920) Q Consensus 738 lGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~-l~~ 780 (920) ---+=++.-.. .-+..++.+.+. +...+++||..++.. +++ T Consensus 163 PT~~LD~~~~~-~i~~ll~~l~~~-g~Tii~vTHd~~~~~~~~d 204 (214) T cd03292 163 PTGNLDPDTTW-EIMNLLKKINKA-GTTVVVATHAKELVDTTRH 204 (214) T ss_pred CCCCCCHHHHH-HHHHHHHHHHHC-CCEEEEECCCHHHHHHHCC T ss_conf 87877989999-999999999850-9999998989899998689 No 94 >cd03228 ABCC_MRP_Like The MRP (Mutidrug Resistance Protein)-like transporters are involved in drug, peptide, and lipid export. They belong to the subfamily C of the ATP-binding cassette (ABC) superfamily of transport proteins. The ABCC subfamily contains transporters with a diverse functional spectrum that includes ion transport, cell surface receptor, and toxin secretion activities. The MRP-like family, simlar to all ABC proteins, have a common four-domain core structure constituted by two membrane-spanning domains, each composed of six transmembrane (TM) helices, and two nucleotide-binding domains (NBD). ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=97.83 E-value=0.00021 Score=51.33 Aligned_cols=123 Identities=27% Similarity=0.327 Sum_probs=66.7 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCC-------------------CCCC Q ss_conf 444345635877776643999967784407899999999999997198530353206-------------------8221 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHI-------------------GIVD 694 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i-------------------~~~D 694 (920) +|=+|++|.. ..+.+.-|.|||-+||||++|.++ |.+-|.+ -++ +.+. T Consensus 16 ~iL~~isl~i----~~Ge~i~ivG~sGsGKSTLl~ll~--------gl~~p~~-G~I~i~g~~i~~~~~~~~~~~i~~v~ 82 (171) T cd03228 16 PVLKDVSLTI----KPGEKVAIVGPSGSGKSTLLKLLL--------RLYDPTS-GEILIDGVDLRDLDLESLRKNIAYVP 82 (171) T ss_pred CCEECEEEEE----CCCCEEEEECCCCCHHHHHHHHHH--------CCCCCCC-CEEEECCEECCCCCHHHHHHHEEEEC T ss_conf 6167718998----599899999999983999999997--------6775897-48999999988599899863189996 Q ss_pred ---EEEEEEECCCCC-CCCCCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHH-CCCEEEEE Q ss_conf ---056765237661-1385328999999999999589985699932588988056799999999999972-69849997 Q gi|254780750|r 695 ---KLFSRVGSADNL-ASGRSTFMVEMIETASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHET-NRCRGLLA 769 (920) Q Consensus 695 ---~IftRiGa~D~l-~~g~STF~vEm~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~-~~~~~lfa 769 (920) .+|.. --.||| +.|+ .-..-.|..| +.+..++|+||- ||..|..+ ...+.+.|.+. .++.++++ T Consensus 83 Q~~~lf~~-tv~eNiLSgGq----~Qri~lARal--~~~p~illlDEp---ts~LD~~~-~~~i~~~l~~~~~~~Tvi~v 151 (171) T cd03228 83 QDPFLFSG-TIRENILSGGQ----RQRIAIARAL--LRDPPILILDEA---TSALDPET-EALILEALRALAKGKTVIVI 151 (171) T ss_pred CCCCCCCC-CHHHHCCCHHH----HHHHHHHHHH--HHCCCEEEEECC---CCCCCHHH-HHHHHHHHHHHCCCCEEEEE T ss_conf 66843757-79997744889----9999999999--748998999577---66799899-99999999998099989999 Q ss_pred CCCHHHHHHHH Q ss_conf 48757976643 Q gi|254780750|r 770 THFHELTDLSK 780 (920) Q Consensus 770 THy~eL~~l~~ 780 (920) ||-.++...++ T Consensus 152 tH~~~~~~~~D 162 (171) T cd03228 152 AHRLSTIRDAD 162 (171) T ss_pred ECCHHHHHHCC T ss_conf 57999997099 No 95 >cd03237 ABC_RNaseL_inhibitor_domain2 The ATPase domain 2 of RNase L inhibitor. The ABC ATPase, RNase L inhibitor (RLI), is a key enzyme in ribosomal biogenesis, formation of translation preinitiation complexes, and assembly of HIV capsids. RLI's are not transport proteins and thus cluster with a group of soluble proteins that lack the transmembrane components commonly found in other members of the family. Structurally, RLI's have an N-terminal Fe-S domain and two nucleotide-binding domains which are arranged to form two composite active sites in their interface cleft. RLI is one of the most conserved enzymes between archaea and eukaryotes with a sequence identity of more than 48%. The high degree of evolutionary conservation suggests that RLI performs a central role in archaeal and eukaryotic physiology. Probab=97.83 E-value=0.00017 Score=52.01 Aligned_cols=138 Identities=21% Similarity=0.169 Sum_probs=67.8 Q ss_pred EEEEEEECCCCC-CCCCEEEEEECCCCCCHHHHHHHHHHHH--------HHHHCCCCCCHHH---CCCCCCCEEEEEEEC Q ss_conf 443456358777-7664399996778440789999999999--------9997198530353---206822105676523 Q gi|254780750|r 635 IANDCDLSCPND-KNSGKLWLLTGPNMGGKSTFLRQNALIV--------IMAQMGSYVPASY---AHIGIVDKLFSRVGS 702 (920) Q Consensus 635 VpNdi~l~~~~~-~~~~~~~iiTGpNmgGKSt~lRqval~v--------ilAQiG~fVPA~~---a~i~~~D~IftRiGa 702 (920) +=+|++|..... -..+.+.-|.|||-+||||+||.++=+. +--+-=+|+|-.- ..+.+.+-++... T Consensus 9 ~l~~~sL~i~~Gti~~GEiv~liGpNGaGKSTLlk~l~Gll~p~~G~I~~~g~~i~~~pq~~~~~~~~tv~~~l~~~~-- 86 (246) T cd03237 9 TLGEFTLEVEGGSISESEVIGILGPNGIGKTTFIKMLAGVLKPDEGDIEIELDTVSYKPQYIKADYEGTVRDLLSSIT-- 86 (246) T ss_pred ECCEEEEEECCCCCCCCCEEEEECCCCCHHHHHHHHHHCCCCCCCCEEEECCCCCCCCCHHHCCCCCCCHHHHHHHHH-- T ss_conf 615068985688465798999997999769999999977878886079989820576874332577727999999886-- Q ss_pred CCCCCCCCCHHHHHHHH----------------------HHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 76611385328999999----------------------99999958998569993258898805679999999999997 Q gi|254780750|r 703 ADNLASGRSTFMVEMIE----------------------TASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHE 760 (920) Q Consensus 703 ~D~l~~g~STF~vEm~e----------------------~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~ 760 (920) +.....+.|..|+.+ ++-..--+.+-.++|+||--.|=++.--.. -+.++..|.+ T Consensus 87 --~~~~~~~~~~~e~~~~l~l~~~~~r~~~~LSGGqkQRv~iA~aL~~~p~ilLLDEPts~LD~~~~~~-i~~~i~~l~~ 163 (246) T cd03237 87 --KDFYTHPYFKTEIAKPLQIEQILDREVPELSGGELQRVAIAACLSKDADIYLLDEPSAYLDVEQRLM-ASKVIRRFAE 163 (246) T ss_pred --HHCCCCHHHHHHHHHHCCCHHHHCCCHHHCCHHHHHHHHHHHHHHCCCCEEEECCCCCCCCHHHHHH-HHHHHHHHHH T ss_conf --4312127999999988499567648700289859999999999841999999848987689999999-9999999998 Q ss_pred HCCCEEEEECCCHHHHH Q ss_conf 26984999748757976 Q gi|254780750|r 761 TNRCRGLLATHFHELTD 777 (920) Q Consensus 761 ~~~~~~lfaTHy~eL~~ 777 (920) ..+..++++||.-++.. T Consensus 164 ~~~~Tvi~VtHDl~~a~ 180 (246) T cd03237 164 NNEKTAFVVEHDIIMID 180 (246) T ss_pred HCCCEEEEECCCHHHHH T ss_conf 67989999837899999 No 96 >COG1121 ZnuC ABC-type Mn/Zn transport systems, ATPase component [Inorganic ion transport and metabolism] Probab=97.83 E-value=4.5e-05 Score=56.34 Aligned_cols=133 Identities=26% Similarity=0.358 Sum_probs=71.1 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHH--------HHH----------HCCCCCCHHH---CCCC-- Q ss_conf 4434563587777664399996778440789999999999--------999----------7198530353---2068-- Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIV--------IMA----------QMGSYVPASY---AHIG-- 691 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~v--------ilA----------QiG~fVPA~~---a~i~-- 691 (920) |=.||++.. ..+.+.-|.|||-+||||++|.+.=+. +.. -| .|||=.. ..+| T Consensus 19 vl~~i~l~v----~~G~~~~iiGPNGaGKSTLlK~iLGll~p~~G~i~~~g~~~~~~~~~~~I-gYVPQ~~~~d~~fP~t 93 (254) T COG1121 19 VLEDISLSV----EKGEITALIGPNGAGKSTLLKAILGLLKPSSGEIKIFGKPVRKRRKRLRI-GYVPQKSSVDRSFPIT 93 (254) T ss_pred EEECCEEEE----CCCCEEEEECCCCCCHHHHHHHHHCCCCCCCCEEEECCCCCCCCCCCCEE-EECCCCCCCCCCCCCC T ss_conf 041538997----48968999999888889999999678767742699836663334667769-9757610267679967 Q ss_pred CCC----------EEEEEEECCCCCCCCCCHHHHHHHHHH-------------HH-HH--HCCCCCEEEEECCCCCCCHH Q ss_conf 221----------056765237661138532899999999-------------99-99--58998569993258898805 Q gi|254780750|r 692 IVD----------KLFSRVGSADNLASGRSTFMVEMIETA-------------SI-LN--QATNQSFVILDEIGRGTATL 745 (920) Q Consensus 692 ~~D----------~IftRiGa~D~l~~g~STF~vEm~e~~-------------~I-L~--~at~~SLVllDElGrGTst~ 745 (920) +.| ..|-|.++.|.-.-..+==.|+|.+.+ .+ |- -|++--|.+|||---|-+.. T Consensus 94 V~d~V~~g~~~~~g~~~~~~~~d~~~v~~aL~~Vgm~~~~~r~i~~LSGGQ~QRV~lARAL~~~p~lllLDEP~~gvD~~ 173 (254) T COG1121 94 VKDVVLLGRYGKKGWFRRLNKKDKEKVDEALERVGMEDLRDRQIGELSGGQKQRVLLARALAQNPDLLLLDEPFTGVDVA 173 (254) T ss_pred HHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCHHHHCCCCCCCCCHHHHHHHHHHHHCCCCCEEEECCCCCCCCHH T ss_conf 99998606754466013666777999999999839266647955546727999999999853699989966875457987 Q ss_pred HHHHHHHHHHHHHHHHCCCEEEEECCCHH Q ss_conf 67999999999999726984999748757 Q gi|254780750|r 746 DGLSIAWATIEYLHETNRCRGLLATHFHE 774 (920) Q Consensus 746 DG~aiA~aile~l~~~~~~~~lfaTHy~e 774 (920) |...-+.++..|.+. +|..|..||... T Consensus 174 -~~~~i~~lL~~l~~e-g~tIl~vtHDL~ 200 (254) T COG1121 174 -GQKEIYDLLKELRQE-GKTVLMVTHDLG 200 (254) T ss_pred -HHHHHHHHHHHHHHC-CCEEEEEECCCH T ss_conf -899999999999878-988999958817 No 97 >TIGR03375 type_I_sec_LssB type I secretion system ATPase, LssB family. Type I protein secretion is a system in some Gram-negative bacteria to export proteins (often proteases) across both inner and outer membranes to the extracellular medium. This is one of three proteins of the type I secretion apparatus. Targeted proteins are not cleaved at the N-terminus, but rather carry signals located toward the extreme C-terminus to direct type I secretion. This model is related to models TIGR01842 and TIGR01846, and to bacteriocin ABC transporters that cleave their substrates during export. Probab=97.81 E-value=0.00052 Score=48.39 Aligned_cols=131 Identities=27% Similarity=0.321 Sum_probs=72.6 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHC------------------CCCCCC- Q ss_conf 4443456358777766439999677844078999999999999971985303532------------------068221- Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYA------------------HIGIVD- 694 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a------------------~i~~~D- 694 (920) .|=+|+++.. ..+...-|+||+-+||||++|-+ .|.|-|.+-. .++.+. T Consensus 479 ~vl~~vsl~i----~~Ge~vaIvG~sGsGKSTL~kll--------~Gl~~p~~G~i~idg~~~~~~~~~~~r~~i~~v~Q 546 (694) T TIGR03375 479 PALDNVSLTI----RPGEKVAIIGRIGSGKSTLLKLL--------LGLYQPTEGSVLLDGVDIRQIDPADLRRNIGYVPQ 546 (694) T ss_pred HHHCCHHHEE----CCCCEEEEEECCCCCHHHHHHHH--------CCCCCCCCCEEEECCEECCCCCHHHHHHHCEEECC T ss_conf 2213631188----79978999805898788999985--------56758998879989854254999999730213576 Q ss_pred --EEEEEEECCCCCCCCCCHH-HHHHHHHHHHH----------------------------------HH--CCCCCEEEE Q ss_conf --0567652376611385328-99999999999----------------------------------95--899856999 Q gi|254780750|r 695 --KLFSRVGSADNLASGRSTF-MVEMIETASIL----------------------------------NQ--ATNQSFVIL 735 (920) Q Consensus 695 --~IftRiGa~D~l~~g~STF-~vEm~e~~~IL----------------------------------~~--at~~SLVll 735 (920) .+|.. -=.|||.-+.... ..+|.+++.+. -. ..+-+++|+ T Consensus 547 ~~~lf~g-Ti~eNi~~~~~~~~~~~i~~a~~~a~l~~~I~~lp~g~~t~i~e~G~~LSgGqrQri~lARAl~~~p~ilil 625 (694) T TIGR03375 547 DPRLFYG-TLRDNIALGAPYADDEEILRAAELAGVTEFVRRHPDGLDMQIGERGRSLSGGQRQAVALARALLRDPPILLL 625 (694) T ss_pred CCCCCCC-CHHHHHHCCCCCCCHHHHHHHHHHHCCHHHHHHCCCCCCCEECCCCCCCCHHHHHHHHHHHHHHCCCCEEEE T ss_conf 7711074-699998416999999999999998197999971856678774689994689999999999999579998999 Q ss_pred ECCCCCCCHHHHHHHHHHHHHHHHHH-CCCEEEEECCCHHHHHHHHH Q ss_conf 32588988056799999999999972-69849997487579766430 Q gi|254780750|r 736 DEIGRGTATLDGLSIAWATIEYLHET-NRCRGLLATHFHELTDLSKS 781 (920) Q Consensus 736 DElGrGTst~DG~aiA~aile~l~~~-~~~~~lfaTHy~eL~~l~~~ 781 (920) ||- ||.-|-.. ...+.+.|.+. .++.+++.||-.+....++. T Consensus 626 DE~---ts~LD~~~-e~~i~~~l~~~~~~~T~i~itHrls~i~~aD~ 668 (694) T TIGR03375 626 DEP---TSAMDNRS-EERFKDRLKRWLAGKTLVLVTHRTSLLDLVDR 668 (694) T ss_pred ECC---CCCCCHHH-HHHHHHHHHHHCCCCEEEEEECCHHHHHHCCE T ss_conf 787---56889999-99999999986699989998168999984999 No 98 >PRK13548 hmuV hemin importer ATP-binding subunit; Provisional Probab=97.81 E-value=0.00024 Score=50.91 Aligned_cols=136 Identities=20% Similarity=0.239 Sum_probs=71.8 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHH-HH-------------------HHHHCCCCCCHHHC---CCC Q ss_conf 44345635877776643999967784407899999999-99-------------------99971985303532---068 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNAL-IV-------------------IMAQMGSYVPASYA---HIG 691 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval-~v-------------------ilAQiG~fVPA~~a---~i~ 691 (920) |=+|+++.. ..+.+.-|-|||-+||||++|.++= .. -+|+...|||-... .++ T Consensus 17 ~L~~vsl~i----~~Gei~~liGpNGaGKSTLlk~i~Gl~~p~~G~I~~~g~~i~~~~~~~~~~~~g~v~Q~~~l~~~~t 92 (257) T PRK13548 17 VLDDVSLTL----RPGEVVAILGPNGAGKSTLLRALSGELPPSSGEVRLFGRPLADWSPAELARRRAVLPQHSSLSFPFT 92 (257) T ss_pred EECCEEEEE----CCCCEEEEECCCCCCHHHHHHHHHCCCCCCCCEEEEECEECCCCCHHHHHHHEEEECCCCCCCCCCC T ss_conf 880337898----6998999999999879999999856757787569993657675899999754658612366789983 Q ss_pred CCCEE---------------------EEEEECCC---CCCCCCCHHHHHHHHHHHHHHH----CCCCCEEEEECCCCCCC Q ss_conf 22105---------------------67652376---6113853289999999999995----89985699932588988 Q gi|254780750|r 692 IVDKL---------------------FSRVGSAD---NLASGRSTFMVEMIETASILNQ----ATNQSFVILDEIGRGTA 743 (920) Q Consensus 692 ~~D~I---------------------ftRiGa~D---~l~~g~STF~vEm~e~~~IL~~----at~~SLVllDElGrGTs 743 (920) +.+.| ..++|-.+ .-....|-=+--....|..|-+ +..-.++|+||--.|=+ T Consensus 93 v~e~v~~g~~~~~~~~~~~~~~~~~~l~~~~l~~~~~~~~~~LSGGq~QRv~iAraL~q~~~~~~~P~lLlLDEPtsgLD 172 (257) T PRK13548 93 VEEVVAMGRYPHTLSQKEDDQLVAAALAQVDLAHLAGRSYPTLSGGEQQRVQLARVLAQLWEPGGGPRWLLLDEPTSALD 172 (257) T ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHCCCHHHCCHHHHHHHHHHHHHHHCCCCCCCCCEEEECCCCCCCC T ss_conf 99999977876589989999999999987699024169816699999999999999962001047998899889876689 Q ss_pred HHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH Q ss_conf 056799999999999972698499974875797 Q gi|254780750|r 744 TLDGLSIAWATIEYLHETNRCRGLLATHFHELT 776 (920) Q Consensus 744 t~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~ 776 (920) +..-..+ +.++..|.+. +.-++++||.-+.. T Consensus 173 ~~~~~~i-~~ll~~l~~~-g~tvl~vtHdl~~~ 203 (257) T PRK13548 173 LAHQHHV-LRLARQLAER-GAAVIVVLHDLNLA 203 (257) T ss_pred HHHHHHH-HHHHHHHHHC-CCEEEEEECCHHHH T ss_conf 9999999-9999999855-99999992788999 No 99 >cd03239 ABC_SMC_head The structural maintenance of chromosomes (SMC) proteins are essential for successful chromosome transmission during replication and segregation of the genome in all organisms. SMCs are generally present as single proteins in bacteria, and as at least six distinct proteins in eukaryotes. The proteins range in size from approximately 110 to 170 kDa, and each has five distinct domains: amino- and carboxy-terminal globular domains, which contain sequences characteristic of ATPases, two coiled-coil regions separating the terminal domains , and a central flexible hinge. SMC proteins function together with other proteins in a range of chromosomal transactions, including chromosome condensation, sister-chromatid cohesion, recombination, DNA repair, and epigenetic silencing of gene expression. Probab=97.81 E-value=0.00019 Score=51.66 Aligned_cols=129 Identities=22% Similarity=0.291 Sum_probs=67.0 Q ss_pred EEEEEECCCCCCHHHHHHHHHHHH-----------HHHHCCCCC--CHHHCCCCC--CCEEE-EEEECCCC-CCCCCCHH Q ss_conf 399996778440789999999999-----------999719853--035320682--21056-76523766-11385328 Q gi|254780750|r 651 KLWLLTGPNMGGKSTFLRQNALIV-----------IMAQMGSYV--PASYAHIGI--VDKLF-SRVGSADN-LASGRSTF 713 (920) Q Consensus 651 ~~~iiTGpNmgGKSt~lRqval~v-----------ilAQiG~fV--PA~~a~i~~--~D~If-tRiGa~D~-l~~g~STF 713 (920) ++-+|.|||-+|||+++..++... .+...+.++ ++.+|.+.. .+.-| ..-|..+. |..|+-+ T Consensus 23 ~~~~ivG~nGsGKSni~~ai~~~~g~~~~~~~~~~~l~~~~~~~~~~~~~a~v~~~~~~~~~~v~qg~~~~lLSGGEks- 101 (178) T cd03239 23 SFNAIVGPNGSGKSNIVDAICFVLGGKAAKLRRGSLLFLAGGGVKAGINSASVEITFDKSYFLVLQGKVEQILSGGEKS- 101 (178) T ss_pred CEEEEECCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHCCCCCCCHHHCCCCCHHH- T ss_conf 8179989998877899999999986642765200135432443556744236799998754044124410016752589- Q ss_pred HHHHHHHHHHH--HHCCCCCEEEEECCCCCCCHHHHHHH--HHHHHHHHHHHCCCEEEEECCCHHHHHHHHHCCCEEE Q ss_conf 99999999999--95899856999325889880567999--9999999997269849997487579766430688589 Q gi|254780750|r 714 MVEMIETASIL--NQATNQSFVILDEIGRGTATLDGLSI--AWATIEYLHETNRCRGLLATHFHELTDLSKSLKRFHN 787 (920) Q Consensus 714 ~vEm~e~~~IL--~~at~~SLVllDElGrGTst~DG~ai--A~aile~l~~~~~~~~lfaTHy~eL~~l~~~~~~v~n 787 (920) |..++.++ ....+..++|+||+ ++..|.... ....+..+.+. ++-.+++||-.+.-+.++.+=+|.+ T Consensus 102 ---l~alal~~ai~~~~p~p~~iLDEv---dAaLD~~N~~r~~~~i~el~~~-~sQfIiITH~~~~m~~ad~l~gVtm 172 (178) T cd03239 102 ---LSALALIFALQEIKPSPFYVLDEI---DAALDPTNRRRVSDMIKEMAKH-TSQFIVITLKKEMFENADKLIGVLF 172 (178) T ss_pred ---HHHHHHHHHHHCCCCCCEEEEECC---CCCCCHHHHHHHHHHHHHHHHC-CCEEEEEECCHHHHHHCCEEEEEEE T ss_conf ---999999999971389966998276---5547988999999999999738-9989999868999974785899980 No 100 >PRK13640 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=97.81 E-value=0.00016 Score=52.14 Aligned_cols=51 Identities=22% Similarity=0.104 Sum_probs=33.3 Q ss_pred CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHH Q ss_conf 8998569993258898805679999999999997269849997487579766 Q gi|254780750|r 727 ATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTDL 778 (920) Q Consensus 727 at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~l 778 (920) |.+-.++|+||--.|=++. +..--+.+++.|.+..+..++++||.-+.+.+ T Consensus 160 a~~P~iLllDEPTs~LD~~-~~~~i~~~l~~l~~e~g~TvI~itHd~~~a~~ 210 (283) T PRK13640 160 AVEPQIIILDESTSMLDPA-GKEQILKLIRKLMKDNNLTIISITHDIDEAAG 210 (283) T ss_pred HCCCCEEEECCCCCCCCHH-HHHHHHHHHHHHHHCCCCEEEEEEECHHHHHH T ss_conf 7199999976874548989-99999999999997069899999788789970 No 101 >PRK10851 sulfate/thiosulfate transporter subunit; Provisional Probab=97.80 E-value=0.00013 Score=52.78 Aligned_cols=141 Identities=26% Similarity=0.346 Sum_probs=73.9 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCC------C---CCHHHCCCCCC---CEEEEEEEC Q ss_conf 44345635877776643999967784407899999999999997198------5---30353206822---105676523 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGS------Y---VPASYAHIGIV---DKLFSRVGS 702 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~------f---VPA~~a~i~~~---D~IftRiGa 702 (920) |=+|++|.. ..+.+.-|-|||.+||||+||.++=+.- ...|. - .|+..-.++.+ ..+|-.+-. T Consensus 17 al~~vsl~i----~~GE~~~llGpSGsGKSTLlr~iaGL~~-p~sG~I~~~G~dv~~~~~~~r~Ig~vfQ~~~L~p~ltV 91 (352) T PRK10851 17 VLNDISLDI----PSGQMVALLGPSGSGKTTLLRIIAGLEH-QTSGHIRFHGTDVSRLHARDRKVGFVFQHYALFRHMTV 91 (352) T ss_pred EEECCEEEE----CCCCEEEEECCCCCHHHHHHHHHHCCCC-CCCEEEEECCEECCCCCCCCCCEEEEECCCHHCCCCCH T ss_conf 990637699----9998999999998469999999976999-99569999999999899300848999407121458809 Q ss_pred CCCCCCCCCHH-----------------HHHHHHHHHHHH-------------------HCCCCCEEEEECCCCCCCHHH Q ss_conf 76611385328-----------------999999999999-------------------589985699932588988056 Q gi|254780750|r 703 ADNLASGRSTF-----------------MVEMIETASILN-------------------QATNQSFVILDEIGRGTATLD 746 (920) Q Consensus 703 ~D~l~~g~STF-----------------~vEm~e~~~IL~-------------------~at~~SLVllDElGrGTst~D 746 (920) .||+.-|..-+ +.|+......++ -+.+-.++|+||--.|=++.- T Consensus 92 ~eNi~~gl~~~~~~~~~~~~~~~~rv~~~l~~vgL~~~~~r~p~~LSGGqrQRVaiARAL~~~P~vLLLDEPts~LD~~~ 171 (352) T PRK10851 92 FDNIAFGLTVLPRRERPNAAAIKAKVTQLLEMVQLAHLADRYPAQLSGGQKQRVALARALAVEPQILLLDEPFGALDAQV 171 (352) T ss_pred HHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCHHHHCCCHHHCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCHHH T ss_conf 99998777542211376899999999999987599447609931499999999999999865999999908876689899 Q ss_pred HHHHHHHHHHHHHHHCCCEEEEECCCHHHH-HHHHH Q ss_conf 799999999999972698499974875797-66430 Q gi|254780750|r 747 GLSIAWATIEYLHETNRCRGLLATHFHELT-DLSKS 781 (920) Q Consensus 747 G~aiA~aile~l~~~~~~~~lfaTHy~eL~-~l~~~ 781 (920) -..+ +..+..|.+..+..++|+||..+.+ .+++. T Consensus 172 r~~i-~~~l~~L~~e~g~T~i~VTHD~~eA~~laDr 206 (352) T PRK10851 172 RKEL-RRWLRQLHEELKFTSVFVTHDQEEAMEVADR 206 (352) T ss_pred HHHH-HHHHHHHHHHHCCEEEEECCCHHHHHHHCCE T ss_conf 9999-9999999997399899998899999986989 No 102 >PRK13632 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=97.79 E-value=0.00022 Score=51.16 Aligned_cols=174 Identities=20% Similarity=0.200 Sum_probs=83.3 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCC------CCHH-----HCCCCCC----C-EEE Q ss_conf 4443456358777766439999677844078999999999999971985------3035-----3206822----1-056 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSY------VPAS-----YAHIGIV----D-KLF 697 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~f------VPA~-----~a~i~~~----D-~If 697 (920) .+=+|++|.. ..+.+.-|.|||-+||||++|.++=+. ..+-|.- +... .-.+|.+ | .+| T Consensus 24 ~~L~~isl~i----~~Ge~vaivG~nGsGKSTLlk~l~Gll-~p~~G~I~v~G~~i~~~~~~~~~~~ig~VfQ~Pd~q~~ 98 (273) T PRK13632 24 NALKNVSFTI----NEGEYVAILGHNGSGKSTISKILTGLL-KPQSGEIKIFGITISKENLKYLRKKIGIIFQNPDNQFI 98 (273) T ss_pred CEEEEEEEEE----CCCCEEEEECCCCCHHHHHHHHHHCCC-CCCCCEEEECCEECCCCCHHHHHHHEEEEEECCCHHCC T ss_conf 6066428898----499899999999986999999997387-78887599999999968989987435699877102027 Q ss_pred EEEECCCCCCCCCCHH---HHHHHH-HHHHHH----------------------------HCCCCCEEEEECCCCCCCHH Q ss_conf 7652376611385328---999999-999999----------------------------58998569993258898805 Q gi|254780750|r 698 SRVGSADNLASGRSTF---MVEMIE-TASILN----------------------------QATNQSFVILDEIGRGTATL 745 (920) Q Consensus 698 tRiGa~D~l~~g~STF---~vEm~e-~~~IL~----------------------------~at~~SLVllDElGrGTst~ 745 (920) . .-..|++.-|.-.+ -.|+.+ +..+++ -|..-.++|+||--.|=++. T Consensus 99 ~-~tV~e~iafgl~~~~~~~~~~~~~v~~~l~~~gl~~~~~~~p~~LSGGqkQRvaiA~aLa~~P~iliLDEPTs~LD~~ 177 (273) T PRK13632 99 G-ITVEDDIAFGLENKKIPPKKMKDIIDDLAKKVGMEDYLKKEPQNLSGGQKQRVAIASVLALNPEIIIFDESTSMLDPK 177 (273) T ss_pred C-CCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHCCCHHCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCHH T ss_conf 7-517888886786679999999999999999869888774782009999999999999997199999980775569989 Q ss_pred HHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHHCCCEEEEEEEEEEECCEEEEEEEEEECCCCCCH-HHHHHHHCCC Q ss_conf 679999999999997269849997487579766430688589999999609927787777447898877-8999998299 Q gi|254780750|r 746 DGLSIAWATIEYLHETNRCRGLLATHFHELTDLSKSLKRFHNATLQVSDSNEGIIFLHKVIPGIADHSY-GIQVGKLAGL 824 (920) Q Consensus 746 DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~l~~~~~~v~n~~~~~~~~~~~i~flykl~~G~~~~Sy-gi~vA~laG~ 824 (920) +..--|.++..|.+..+..++++||.-+....+ .++. . -.+++|++ .|....=| -.++-+.+|+ T Consensus 178 -~~~~l~~~l~~l~~~~g~TvI~iTHd~~~~~~a---Drv~--v----m~~G~iv~-----~G~p~el~~~~e~l~~~~l 242 (273) T PRK13632 178 -GKREIKKIMVDLRKDRKKTLISITHDMDEAILA---DKVI--V----FSNGKLIA-----QGKPKEILNDKEILEKAKL 242 (273) T ss_pred -HHHHHHHHHHHHHHHCCCEEEEEEECHHHHHCC---CEEE--E----EECCEEEE-----ECCHHHHHCCHHHHHHCCC T ss_conf -999999999999984698999994288899719---9999--9----98999999-----7699998779999997799 Q ss_pred CHHH Q ss_conf 9899 Q gi|254780750|r 825 PNTV 828 (920) Q Consensus 825 p~~v 828 (920) ..+. T Consensus 243 ~~P~ 246 (273) T PRK13632 243 DSPF 246 (273) T ss_pred CCCH T ss_conf 9986 No 103 >PRK11300 livG leucine/isoleucine/valine transporter ATP-binding subunit; Provisional Probab=97.79 E-value=0.00034 Score=49.78 Aligned_cols=52 Identities=23% Similarity=0.255 Sum_probs=36.6 Q ss_pred CCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH-HHHH Q ss_conf 9985699932588988056799999999999972698499974875797-6643 Q gi|254780750|r 728 TNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELT-DLSK 780 (920) Q Consensus 728 t~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~-~l~~ 780 (920) ..-.++|+||--.|=++..-..| +.++..|.+..++-++++||.-+.. .+++ T Consensus 170 ~~P~lLlLDEPt~gLD~~~~~~i-~~~i~~l~~~~g~tvl~itHdl~~v~~~aD 222 (255) T PRK11300 170 TQPRILMLDEPAAGLNPKETKEL-DELIAELRNEHNVTVLLIEHDMKLVMGISD 222 (255) T ss_pred HCCCEEEECCCCCCCCHHHHHHH-HHHHHHHHHCCCCEEEEECCCHHHHHHHCC T ss_conf 29996998187546999999999-999999997159799999273899998699 No 104 >CHL00131 ycf16 sulfate ABC transporter protein; Validated Probab=97.79 E-value=0.0007 Score=47.42 Aligned_cols=135 Identities=30% Similarity=0.344 Sum_probs=68.7 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHH-HHHHHCCC----------CCCHHHCCCCCCCEEE----- Q ss_conf 4443456358777766439999677844078999999999-99997198----------5303532068221056----- Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALI-VIMAQMGS----------YVPASYAHIGIVDKLF----- 697 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~-vilAQiG~----------fVPA~~a~i~~~D~If----- 697 (920) -|=+||+|.. ..+.++-|-|||-+||||++|.++=. .+-..-|. .-|.+.++.+++ .+| T Consensus 20 ~vL~~isl~i----~~Gei~aiiG~nGsGKSTL~~~i~G~~~~~~~~G~I~~~G~~i~~~~~~~~~~~gi~-~~~Q~~~~ 94 (252) T CHL00131 20 EILKGINLSI----NAGEIHAIMGPNGSGKSTLSKVIAGHPAYTVLEGDILFKGESILELEPEERAHLGIF-LAFQYPIE 94 (252) T ss_pred EEEECCEEEE----CCCCEEEEECCCCCCHHHHHHHHCCCCCCCCCCCEEEECCEECCCCCHHHHHHCCEE-EEEECHHH T ss_conf 9885617788----799899999999999999999972787667664259987727685999999865967-85420243 Q ss_pred -E--------EE------------------------------ECCC-----CCCCCCCHHHHHHHHHHHHHHHCCCCCEE Q ss_conf -7--------65------------------------------2376-----61138532899999999999958998569 Q gi|254780750|r 698 -S--------RV------------------------------GSAD-----NLASGRSTFMVEMIETASILNQATNQSFV 733 (920) Q Consensus 698 -t--------Ri------------------------------Ga~D-----~l~~g~STF~vEm~e~~~IL~~at~~SLV 733 (920) . |+ |-.+ ++..+.|-=+--...+|..| +..-.++ T Consensus 95 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~l~~~~l~r~~~~~LSGGqkqRv~iaral--~~~P~iL 172 (252) T CHL00131 95 IPGVSNADFLRLAYNAKRKFQGLPELDPLEFFEIISEKLKLVGMDPSFLNRNVNEGFSGGEKKRNEILQMA--LLDSKLA 172 (252) T ss_pred CCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCHHHHHCCCCCCCCHHHHHHHHHHHHH--HHCCCEE T ss_conf 25755999999887666554043315889999999999987499857753365545789999999999999--6399999 Q ss_pred EEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHH Q ss_conf 99325889880567999999999999726984999748757976 Q gi|254780750|r 734 ILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTD 777 (920) Q Consensus 734 llDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~ 777 (920) |+||--.|=++.. ...-+.++..|.+. +.-++++||+.++.+ T Consensus 173 iLDEPTsgLD~~~-~~~i~~~l~~l~~~-g~tii~itH~~~~~~ 214 (252) T CHL00131 173 ILDETDSGLDIDA-LKIIAEGINKLATS-DNAIILITHYQRLLD 214 (252) T ss_pred EEECCCCCCCHHH-HHHHHHHHHHHHHC-CCEEEEEEECHHHHH T ss_conf 9979876699999-99999999999858-999999998669898 No 105 >PRK11231 fecE iron-dicitrate transporter ATP-binding subunit; Provisional Probab=97.79 E-value=0.00077 Score=47.14 Aligned_cols=125 Identities=26% Similarity=0.251 Sum_probs=69.1 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHH--------------------HHHHCCCCCCHHHC---CC Q ss_conf 44434563587777664399996778440789999999999--------------------99971985303532---06 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIV--------------------IMAQMGSYVPASYA---HI 690 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~v--------------------ilAQiG~fVPA~~a---~i 690 (920) -|=+|++|.. ..+.+.-|-|||-+||||++|.++=+. -+++.=.|||-... .+ T Consensus 16 ~~L~~isl~i----~~Gei~~liGpNGaGKSTLlk~i~Gl~~p~sG~I~i~g~~i~~~~~~~~~~~i~~vpQ~~~~~~~~ 91 (255) T PRK11231 16 RILNDLSLSL----PTGKITALIGPNGCGKSTLLKCFARLLTPQSGTVFLGDKPISMLSARQLARRLSLLPQHHLTPEGI 91 (255) T ss_pred EEEECCEEEE----CCCCEEEEECCCCCHHHHHHHHHHCCCCCCCCEEEECCEECCCCCHHHHHHHEEEECCCCCCCCCC T ss_conf 9982308899----899799999999981999999997598888648999999836299899851189976767578998 Q ss_pred CCCCEEEEEEECCCCCCCCCCHH--------------------------------------HHHHHHHHHHHHHCCCCCE Q ss_conf 82210567652376611385328--------------------------------------9999999999995899856 Q gi|254780750|r 691 GIVDKLFSRVGSADNLASGRSTF--------------------------------------MVEMIETASILNQATNQSF 732 (920) Q Consensus 691 ~~~D~IftRiGa~D~l~~g~STF--------------------------------------~vEm~e~~~IL~~at~~SL 732 (920) ++. |++.-|++.+ +--....|..| +..-.+ T Consensus 92 tv~----------e~v~~g~~~~~~~~~~~~~~~~~~v~~~l~~~~l~~~~~~~~~~LSGGqkQRv~iAraL--~~~p~l 159 (255) T PRK11231 92 TVR----------ELVSYGRSPWLSLWGRLSAEDNARVNQAMNQTRINHLADRRLTELSGGQRQRAFLAMVL--AQDTPV 159 (255) T ss_pred CHH----------HHHHCCCCHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHCCCCCCCCHHHHHHHHHHHHH--HCCCCE T ss_conf 899----------99970550123441568688999999999882982564797452999999999999999--539997 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH Q ss_conf 99932588988056799999999999972698499974875797 Q gi|254780750|r 733 VILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELT 776 (920) Q Consensus 733 VllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~ 776 (920) +|+||--.|-++..-..+ +.++..+.+. +..++++||.-+.. T Consensus 160 llLDEPtsgLD~~~~~~i-~~li~~l~~~-g~tvi~vtHdl~~~ 201 (255) T PRK11231 160 VLLDEPTTYLDINHQVEL-MRLMRELNTQ-GKTVVTVLHDLNQA 201 (255) T ss_pred EEEECCCCCCCHHHHHHH-HHHHHHHHHC-CCEEEEEECCHHHH T ss_conf 998388644899999999-9999999868-99999993788999 No 106 >PRK10535 macrolide transporter ATP-binding /permease protein; Provisional Probab=97.78 E-value=0.00021 Score=51.33 Aligned_cols=12 Identities=33% Similarity=0.595 Sum_probs=7.4 Q ss_pred CCCCCCHHHHHH Q ss_conf 822279899999 Q gi|254780750|r 881 NLDEMSPLTALK 892 (920) Q Consensus 881 ~~~~~~p~~al~ 892 (920) ...++.|+|||- T Consensus 635 kAArLdPIeALR 646 (648) T PRK10535 635 NAARLDPVDALA 646 (648) T ss_pred HHHCCCHHHHHC T ss_conf 887789989821 No 107 >cd03234 ABCG_White The White subfamily represents ABC transporters homologous to the Drosophila white gene, which acts as a dimeric importer for eye pigment precursors. The eye pigmentation of Drosophila is developed from the synthesis and deposition in the cells of red pigments, which are synthesized from guanine, and brown pigments, which are synthesized from tryptophan. The pigment precursors are encoded by the white, brown, and scarlet genes, respectively. Evidence from genetic and biochemical studies suggest that the White and Brown proteins function as heterodimers to import guanine, while the White and Scarlet proteins function to import tryptophan. However, a recent study also suggests that White may be involved in the transport of a metabolite, such as 3-hydroxykynurenine, across intracellular membranes. Mammalian ABC transporters belonging to the White subfamily (ABCG1, ABCG5, and ABCG8) have been shown to be involved in the regulation of lipid-trafficking mechanisms in Probab=97.78 E-value=0.00052 Score=48.40 Aligned_cols=33 Identities=33% Similarity=0.509 Sum_probs=27.3 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHH Q ss_conf 4434563587777664399996778440789999999 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNA 671 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqva 671 (920) |=+|+++.. ..+.+.-|-|||-+||||+|+.++ T Consensus 22 vL~~vs~~i----~~Ge~~~ilGpnGsGKSTLl~~i~ 54 (226) T cd03234 22 ILNDVSLHV----ESGQVMAILGSSGSGKTTLLDAIS 54 (226) T ss_pred EEECCEEEE----ECCCEEEEECCCCCHHHHHHHHHH T ss_conf 988977899----188099999899960999999996 No 108 >PRK13537 lipooligosaccharide transporter ATP-binding subunit; Provisional Probab=97.78 E-value=0.00067 Score=47.56 Aligned_cols=139 Identities=24% Similarity=0.343 Sum_probs=72.0 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHC------CCCCCH----HHCCCCCC---CEEEEEE Q ss_conf 4443456358777766439999677844078999999999999971------985303----53206822---1056765 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQM------GSYVPA----SYAHIGIV---DKLFSRV 700 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQi------G~fVPA----~~a~i~~~---D~IftRi 700 (920) .+=+|++|.. ..+.+.=+-|||-+||||++|.+.=+.- ..- |-.|+- ....+|.+ ..+|-.+ T Consensus 19 ~al~~vs~~V----~~Gei~gllGpNGAGKTTli~~l~Gl~~-p~sG~v~i~G~~i~~~~~~~r~~iG~~pq~~~l~~~l 93 (304) T PRK13537 19 LVVDGLSFHV----QPGECFGLLGPNGAGKTTTLKMLLGLTH-PDAGTISLCGEPVPSRARHARQRVGVVPQFDNLDPDF 93 (304) T ss_pred EEECCEEEEE----CCCCEEEEECCCCCCHHHHHHHHHCCCC-CCCCEEEECCEECCCCHHHHHHCEEEEECCCCCCCCC T ss_conf 9983717788----6995999999989729999999977956-8976899999988756288873559991776568898 Q ss_pred ECCCCCC----------------------------------CCCCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHH Q ss_conf 2376611----------------------------------385328999999999999589985699932588988056 Q gi|254780750|r 701 GSADNLA----------------------------------SGRSTFMVEMIETASILNQATNQSFVILDEIGRGTATLD 746 (920) Q Consensus 701 Ga~D~l~----------------------------------~g~STF~vEm~e~~~IL~~at~~SLVllDElGrGTst~D 746 (920) -..+++. ...|.=|--....+.. -+..-.++++||--.|=++.- T Consensus 94 tv~e~l~~~~~~~g~~~~~~~~~~~~ll~~~~L~~~~~~~~~~lSgG~kqrl~ia~a--l~~~P~lliLDEPT~GLDp~~ 171 (304) T PRK13537 94 TVRENLLVFGRYFGLSAAAARALVPPLLEFAKLENKADAKVGELSGGMKRRLTLARA--LVNDPDVLVLDEPTTGLDPQA 171 (304) T ss_pred CHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHCCCHHHCCHHHHHHHHHHHH--HHCCCCEEEEECCCCCCCHHH T ss_conf 999999999997299999999999999997799568567366799999999999999--837999999938866789999 Q ss_pred HHHHHHHHHHHHHHHCCCEEEEECCCHHHHH-HHHH Q ss_conf 7999999999999726984999748757976-6430 Q gi|254780750|r 747 GLSIAWATIEYLHETNRCRGLLATHFHELTD-LSKS 781 (920) Q Consensus 747 G~aiA~aile~l~~~~~~~~lfaTHy~eL~~-l~~~ 781 (920) - ..-|..+..+.+. +..++++|||-+-.+ +.+. T Consensus 172 r-~~i~~~i~~l~~~-G~TillttH~l~E~e~lcDr 205 (304) T PRK13537 172 R-HLMWERLRSLLAR-GKTILLTTHFMEEAERLCDR 205 (304) T ss_pred H-HHHHHHHHHHHHC-CCEEEEECCCHHHHHHHCCE T ss_conf 9-9999999999968-99999988848999986999 No 109 >cd03225 ABC_cobalt_CbiO_domain1 Domain I of the ABC component of a cobalt transport family found in bacteria, archaea, and eukaryota. The transition metal cobalt is an essential component of many enzymes and must be transported into cells in appropriate amounts when needed. This ABC transport system of the CbiMNQO family is involved in cobalt transport in association with the cobalamin (vitamin B12) biosynthetic pathways. Most of cobalt (Cbi) transport systems possess a separate CbiN component, the cobalt-binding periplasmic protein, and they are encoded by the conserved gene cluster cbiMNQO. Both the CbiM and CbiQ proteins are integral cytoplasmic membrane proteins, and the CbiO protein has the linker peptide and the Walker A and B motifs commonly found in the ATPase components of the ABC-type transport systems. Probab=97.78 E-value=0.00013 Score=52.88 Aligned_cols=137 Identities=24% Similarity=0.306 Sum_probs=69.2 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCC---------CHHH--CCCCCC----CE-EE Q ss_conf 44434563587777664399996778440789999999999999719853---------0353--206822----10-56 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYV---------PASY--AHIGIV----DK-LF 697 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fV---------PA~~--a~i~~~----D~-If 697 (920) .|=+|+++.. ..+.+.-|.|||-+||||++|.++-+. -.+-|.-- +... -.++.+ +. +| T Consensus 15 ~~L~~vsl~i----~~Gei~~iiG~nGaGKSTLlk~i~Gl~-~p~~G~I~~~g~~i~~~~~~~~~~~ig~v~Q~p~~~~~ 89 (211) T cd03225 15 PALDDISLTI----KKGEFVLIVGPNGSGKSTLLRLLNGLL-GPTSGEVLVDGKDLTKLSLKELRRKVGLVFQNPDDQFF 89 (211) T ss_pred EEEEEEEEEE----CCCCEEEEECCCCCCHHHHHHHHHCCC-CCCCCCEEECCEECCCCCHHHHHCCEEEEEECCHHHCC T ss_conf 5775317888----499799998899998999999996467-79888778999999979989984038999778325305 Q ss_pred EEEECCCCCCC-----CCCHHHHHHHH-HHHHHH----------------------------HCCCCCEEEEECCCCCCC Q ss_conf 76523766113-----85328999999-999999----------------------------589985699932588988 Q gi|254780750|r 698 SRVGSADNLAS-----GRSTFMVEMIE-TASILN----------------------------QATNQSFVILDEIGRGTA 743 (920) Q Consensus 698 tRiGa~D~l~~-----g~STF~vEm~e-~~~IL~----------------------------~at~~SLVllDElGrGTs 743 (920) . .-..|+|.- +.|. .|+.+ +..+|. -++.-.++|+||--.|=+ T Consensus 90 ~-~tv~e~i~~~~~~~~~~~--~~~~~~~~~~l~~~~l~~~~~~~~~~LSGGqkQrv~iAral~~~P~ililDEPTsgLD 166 (211) T cd03225 90 G-PTVEEEVAFGLENLGLPE--EEIEERVEEALELVGLEGLRDRSPFTLSGGQKQRVAIAGVLAMDPDILLLDEPTAGLD 166 (211) T ss_pred C-CCHHHHHHHHHHHCCCCH--HHHHHHHHHHHHHCCCHHHHHCCHHHCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCC T ss_conf 5-869999999999869999--9999999999998699466638954599989999999999975999999979855589 Q ss_pred HHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHH-HHH Q ss_conf 0567999999999999726984999748757976-643 Q gi|254780750|r 744 TLDGLSIAWATIEYLHETNRCRGLLATHFHELTD-LSK 780 (920) Q Consensus 744 t~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~-l~~ 780 (920) +.- ...-|..+..+.+. +.-++++||.-+... +++ T Consensus 167 ~~~-~~~i~~~l~~l~~~-g~tii~itHdl~~~~~~~d 202 (211) T cd03225 167 PAG-RRELLELLKKLKAE-GKTIIIVTHDLDLLLELAD 202 (211) T ss_pred HHH-HHHHHHHHHHHHHC-CCEEEEEECCHHHHHHHCC T ss_conf 999-99999999999978-9999999259999999799 No 110 >COG1119 ModF ABC-type molybdenum transport system, ATPase component/photorepair protein PhrA [Inorganic ion transport and metabolism] Probab=97.76 E-value=0.00081 Score=46.97 Aligned_cols=21 Identities=38% Similarity=0.725 Sum_probs=18.4 Q ss_pred EEEEEECCCCCCHHHHHHHHH Q ss_conf 399996778440789999999 Q gi|254780750|r 651 KLWLLTGPNMGGKSTFLRQNA 671 (920) Q Consensus 651 ~~~iiTGpNmgGKSt~lRqva 671 (920) .=|.|+|||-+||||+||.++ T Consensus 58 e~W~I~G~NGsGKTTLL~ll~ 78 (257) T COG1119 58 EHWAIVGPNGAGKTTLLSLLT 78 (257) T ss_pred CCEEEECCCCCCHHHHHHHHH T ss_conf 847998889877899999996 No 111 >PRK09493 glnQ glutamine ABC transporter ATP-binding protein; Reviewed Probab=97.76 E-value=0.00065 Score=47.66 Aligned_cols=134 Identities=22% Similarity=0.294 Sum_probs=70.1 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCC------CCC---HHH----CCCCCC-C--EEEE Q ss_conf 44345635877776643999967784407899999999999997198------530---353----206822-1--0567 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGS------YVP---ASY----AHIGIV-D--KLFS 698 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~------fVP---A~~----a~i~~~-D--~Ift 698 (920) +=+|+++.. ..+.+..|.|||-+||||++|.++=+. -..-|. -|. +.. -.++.+ . .+|. T Consensus 16 ~L~dvs~~i----~~Ge~~~liG~nGsGKSTll~~i~Gl~-~~~~G~i~~~G~~i~~~~~~~~~~r~~ig~v~Q~~~l~~ 90 (240) T PRK09493 16 VLHNIDLNI----DQGEVVVIIGPSGSGKSTLLRCINKLE-EITSGDLIVDGLKVNDPKVDERLIRQEAGMVFQQFYLFP 90 (240) T ss_pred EEECEEEEE----CCCCEEEEECCCCCHHHHHHHHHHCCC-CCCCCEEEECCEECCCCCHHHHHHHHHCEEECCCCCCCC T ss_conf 881307898----799899999999980999999996389-999974878999878876658998752428801122478 Q ss_pred EEECCCCCCCC------CCHH-----HHHHHH------------------------HHHHHHHCCCCCEEEEECCCCCCC Q ss_conf 65237661138------5328-----999999------------------------999999589985699932588988 Q gi|254780750|r 699 RVGSADNLASG------RSTF-----MVEMIE------------------------TASILNQATNQSFVILDEIGRGTA 743 (920) Q Consensus 699 RiGa~D~l~~g------~STF-----~vEm~e------------------------~~~IL~~at~~SLVllDElGrGTs 743 (920) .+-..||+.-| .+.- ..|+.+ .|.. -+.+-.++|+||--.|-+ T Consensus 91 ~ltv~eni~~~~~~~~~~~~~~~~~~~~~~l~~~gl~~~~~~~~~~LSGGq~QRvaiAra--L~~~P~lLllDEPt~~LD 168 (240) T PRK09493 91 HLTALENVMFGPLRVRGASKEEAEKQARELLAKVGLAERAHHYPSELSGGQQQRVAIARA--LAVKPKLMLFDEPTSALD 168 (240) T ss_pred CCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHCCCCCCCCHHHHHHHHHHHH--HHCCCCEEEEECCCCCCC T ss_conf 877999998789997599878999999999997699246629877289999999999987--735999999908876689 Q ss_pred HHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHH Q ss_conf 0567999999999999726984999748757976 Q gi|254780750|r 744 TLDGLSIAWATIEYLHETNRCRGLLATHFHELTD 777 (920) Q Consensus 744 t~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~ 777 (920) +.--.-| +..+..|.+. +.-++++||.-++.. T Consensus 169 ~~~~~~i-~~ll~~l~~~-g~tvi~vtHdl~~~~ 200 (240) T PRK09493 169 PELRHEV-LKVMQDLAEE-GMTMVIVTHEIGFAE 200 (240) T ss_pred HHHHHHH-HHHHHHHHHC-CCEEEEECCCHHHHH T ss_conf 9999999-9999999976-998999947999999 No 112 >PRK13542 consensus Probab=97.76 E-value=0.00024 Score=50.84 Aligned_cols=131 Identities=19% Similarity=0.243 Sum_probs=63.4 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHH--------------------HHHHCCCCCCHH---HCCCC Q ss_conf 4434563587777664399996778440789999999999--------------------999719853035---32068 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIV--------------------IMAQMGSYVPAS---YAHIG 691 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~v--------------------ilAQiG~fVPA~---~a~i~ 691 (920) |=.+++|.. ..+.+.-|.|||-+||||+||.++=+. +..++| |||=. .-.++ T Consensus 33 il~~isl~i----~~Gei~~liGpNGaGKTTLlk~l~Gll~p~~G~I~~~G~~i~~~~~~~~~~~~-~v~~~~~l~~~lt 107 (224) T PRK13542 33 VFRGIDISL----APGDLLQVMGPNGSGKTSLLRVLSGLMPPAEGDLYWRGRAVRAGDPDYLQQLA-YVGHANGIDPDLS 107 (224) T ss_pred EEECCEEEE----CCCCEEEEECCCCCCHHHHHHHHHCCCCCCCEEEEECCEECCCCCHHHHHHEE-EECCCCCCCCCCC T ss_conf 884616787----59979999999999999999999579788852899999999879988884447-8666333587872 Q ss_pred CCCEEE-------------------EEEECCCCCCCCCCHHHHHHHHHHHHHH-HCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 221056-------------------7652376611385328999999999999-58998569993258898805679999 Q gi|254780750|r 692 IVDKLF-------------------SRVGSADNLASGRSTFMVEMIETASILN-QATNQSFVILDEIGRGTATLDGLSIA 751 (920) Q Consensus 692 ~~D~If-------------------tRiGa~D~l~~g~STF~vEm~e~~~IL~-~at~~SLVllDElGrGTst~DG~aiA 751 (920) +.+.+- .+.|-.+-.-.--.++.--|..--.|.+ -+.+..++|+||-..|=++. +.... T Consensus 108 v~enl~~~~~~~~~~~~~~~~~~~l~~~gl~~~~~~~~~~LSgGqrqRv~lA~al~~~p~illLDEPtagLD~~-~~~~l 186 (224) T PRK13542 108 AFENLRFAARLAGQQATADAVHHALARFGLDRVMHAPARTLSQGQRRRVALARLALTPRALWLLDEPLTSLDDA-SAACF 186 (224) T ss_pred HHHHHHHHHHHCCCCCHHHHHHHHHHHCCCCHHHCCCHHHCCHHHHHHHHHHHHHHCCCCEEEECCCCCCCCHH-HHHHH T ss_conf 99999999987388746999999999849902546881249999999999999980799889973853548999-99999 Q ss_pred HHHHHHHHHHCCCEEEEECCC Q ss_conf 999999997269849997487 Q gi|254780750|r 752 WATIEYLHETNRCRGLLATHF 772 (920) Q Consensus 752 ~aile~l~~~~~~~~lfaTHy 772 (920) +.++..+.+. +..++++||. T Consensus 187 ~~~i~~~~~~-g~tvIi~tH~ 206 (224) T PRK13542 187 HTLLDEHLRR-GGMAVVATHQ 206 (224) T ss_pred HHHHHHHHHC-CCEEEEEECC T ss_conf 9999999968-9989999588 No 113 >cd03300 ABC_PotA_N PotA is an ABC-type transporter and the ATPase component of the spermidine/putrescine-preferential uptake system consisting of PotA, -B, -C, and -D. PotA has two domains with the N-terminal domain containing the ATPase activity and the residues required for homodimerization with PotA and heterdimerization with PotB. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=97.76 E-value=0.00039 Score=49.36 Aligned_cols=133 Identities=22% Similarity=0.264 Sum_probs=73.7 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCC------------------- Q ss_conf 4443456358777766439999677844078999999999999971985303532068221------------------- Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVD------------------- 694 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D------------------- 694 (920) -|=+|++|.. ..+.+..|-|||-+||||+||.++ |..=| ++-++-+-+ T Consensus 14 ~vl~~vsl~v----~~Ge~~~iiGpSGsGKSTllr~i~--------Gl~~p-~~G~I~~~g~~v~~~~~~~r~ig~VfQ~ 80 (232) T cd03300 14 VALDGVSLDI----KEGEFFTLLGPSGCGKTTLLRLIA--------GFETP-TSGEILLDGKDITNLPPHKRPVNTVFQN 80 (232) T ss_pred EEECCCEEEE----CCCCEEEEECCCCCHHHHHHHHHH--------CCCCC-CCEEEEECCEECCCCCHHHCCEEEEECC T ss_conf 8987617488----799899999999983999999997--------79999-8539999999999999545775699148 Q ss_pred -EEEEEEECCCCCCCC-----CCHH-----HHHHHHH---------------------HHHHH-HCCCCCEEEEECCCCC Q ss_conf -056765237661138-----5328-----9999999---------------------99999-5899856999325889 Q gi|254780750|r 695 -KLFSRVGSADNLASG-----RSTF-----MVEMIET---------------------ASILN-QATNQSFVILDEIGRG 741 (920) Q Consensus 695 -~IftRiGa~D~l~~g-----~STF-----~vEm~e~---------------------~~IL~-~at~~SLVllDElGrG 741 (920) .+|-.+-..||+.-| .|.= .-|+.+. ..|-+ -|++-.++|+||--.+ T Consensus 81 ~~Lfp~ltV~~Nva~~l~~~~~~~~e~~~rv~e~l~~v~l~~~~~~~p~~LSGGqkQRVaiARAl~~~P~llllDEP~s~ 160 (232) T cd03300 81 YALFPHLTVFENIAFGLRLKKLPKAEIKERVAEALDLVQLEGYANRKPSQLSGGQQQRVAIARALVNEPKVLLLDEPLGA 160 (232) T ss_pred CCCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHCCCHHHCCHHHHHHHHHHHHHHCCCCEEEECCCCCC T ss_conf 85477891999987799876999999999999998758977876199666998999999999998659999998088764 Q ss_pred CCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH-HHHH Q ss_conf 88056799999999999972698499974875797-6643 Q gi|254780750|r 742 TATLDGLSIAWATIEYLHETNRCRGLLATHFHELT-DLSK 780 (920) Q Consensus 742 Tst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~-~l~~ 780 (920) =++.--..| +..+..+.+..+..++++||..+-+ .+++ T Consensus 161 LD~~~~~~i-~~~l~~l~~~~~~T~i~VTHd~~ea~~lad 199 (232) T cd03300 161 LDLKLRKDM-QLELKRLQKELGITFVFVTHDQEEALTMSD 199 (232) T ss_pred CCHHHHHHH-HHHHHHHHHHHCCEEEEECCCHHHHHHHCC T ss_conf 699999999-999999999859999999999999999699 No 114 >cd03246 ABCC_Protease_Secretion This family represents the ABC component of the protease secretion system PrtD, a 60-kDa integral membrane protein sharing 37% identity with HlyB, the ABC component of the alpha-hemolysin secretion pathway, in the C-terminal domain. They export degradative enzymes by using a type I protein secretion system and lack an N-terminal signal peptide, but contain a C-terminal secretion signal. The Type I secretion apparatus is made up of three components, an ABC transporter, a membrane fusion protein (MFP), and an outer membrane protein (OMP). For the HlyA transporter complex, HlyB (ABC transporter) and HlyD (MFP) reside in the inner membrane of E. coli. The OMP component is TolC, which is thought to interact with the MFP to form a continuous channel across the periplasm from the cytoplasm to the exterior. HlyB belongs to the family of ABC transporters, which are ubiquitous, ATP-dependent transmembrane pumps or channels. The spectrum of transport substra Probab=97.76 E-value=0.00042 Score=49.08 Aligned_cols=128 Identities=23% Similarity=0.250 Sum_probs=66.1 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCC------------EEEEEEE Q ss_conf 4443456358777766439999677844078999999999999971985303532068221------------0567652 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVD------------KLFSRVG 701 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D------------~IftRiG 701 (920) .|=+|++|.. ..+.+.-|+|||-+||||+++.++ |.+-|-+- ++-+-. +-+.=+. T Consensus 16 ~vL~~i~l~i----~~G~~vaIvG~sGsGKSTLl~ll~--------gl~~p~~G-~i~i~g~~~~~~~~~~~~~~i~~v~ 82 (173) T cd03246 16 PVLRNVSFSI----EPGESLAIIGPSGSGKSTLARLIL--------GLLRPTSG-RVRLDGADISQWDPNELGDHVGYLP 82 (173) T ss_pred CCEECEEEEE----CCCCEEEEECCCCCHHHHHHHHHH--------CCCCCCCC-EEEECCEECCCCCHHHHHCCEEEEE T ss_conf 6454769998----599999999999980999999996--------66667999-8999999933289989842089990 Q ss_pred CCCCCCCC------CCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHH--HHHHHHHHHHHHHHHCCCEEEEECCCH Q ss_conf 37661138------5328999999999999589985699932588988056--799999999999972698499974875 Q gi|254780750|r 702 SADNLASG------RSTFMVEMIETASILNQATNQSFVILDEIGRGTATLD--GLSIAWATIEYLHETNRCRGLLATHFH 773 (920) Q Consensus 702 a~D~l~~g------~STF~vEm~e~~~IL~~at~~SLVllDElGrGTst~D--G~aiA~aile~l~~~~~~~~lfaTHy~ 773 (920) -.+.++.| .|-=..-..-.|..| +....++|+||- ||..| +...-+..++.+..+ +..++++||-. T Consensus 83 Q~~~lf~~ti~eNiLSGGQkQRvalARal--~~~p~ililDEp---ts~LD~~~e~~i~~~l~~l~~~-~~Tvi~vtH~~ 156 (173) T cd03246 83 QDDELFSGSIAENILSGGQRQRLGLARAL--YGNPRILVLDEP---NSHLDVEGERALNQAIAALKAA-GATRIVIAHRP 156 (173) T ss_pred CCCEECCCCHHHHCCCHHHHHHHHHHHHH--HCCCCEEEEECC---CCCCCHHHHHHHHHHHHHHHHC-CCEEEEECCCH T ss_conf 88836777589976769999999999998--279999999687---6689989999999999978648-98999984799 Q ss_pred HHHHHHH Q ss_conf 7976643 Q gi|254780750|r 774 ELTDLSK 780 (920) Q Consensus 774 eL~~l~~ 780 (920) ++...++ T Consensus 157 ~~~~~aD 163 (173) T cd03246 157 ETLASAD 163 (173) T ss_pred HHHHHCC T ss_conf 9998499 No 115 >cd03245 ABCC_bacteriocin_exporters ABC-type bacteriocin exporters. Many non-lantibiotic bacteriocins of lactic acid bacteria are produced as precursors which have N-terminal leader peptides that share similarities in amino acid sequence and contain a conserved processing site of two glycine residues in positions -1 and -2. A dedicated ATP-binding cassette (ABC) transporter is responsible for the proteolytic cleavage of the leader peptides and subsequent translocation of the bacteriocins across the cytoplasmic membrane. Probab=97.76 E-value=0.00028 Score=50.43 Aligned_cols=136 Identities=24% Similarity=0.279 Sum_probs=69.8 Q ss_pred CEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHH---------------------HHHHHCCCCCCHHHC--C Q ss_conf 04443456358777766439999677844078999999999---------------------999971985303532--0 Q gi|254780750|r 633 PFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALI---------------------VIMAQMGSYVPASYA--H 689 (920) Q Consensus 633 ~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~---------------------vilAQiG~fVPA~~a--~ 689 (920) .+|=+|++|.. ..+....|.|||-+||||++|.+.=. .+..++ +|||-+.- . T Consensus 17 ~~~L~~isl~i----~~G~~v~ivG~sGsGKSTLl~ll~gl~~p~~G~I~i~g~~~~~~~~~~~r~~i-~~v~Q~~~lf~ 91 (220) T cd03245 17 IPALDNVSLTI----RAGEKVAIIGRVGSGKSTLLKLLAGLYKPTSGSVLLDGTDIRQLDPADLRRNI-GYVPQDVTLFY 91 (220) T ss_pred CCCEECEEEEE----CCCCEEEEECCCCCHHHHHHHHHHCCCCCCCCEEEECCEEHHHHCHHHHHHCE-EEECCCCEEEC T ss_conf 85153459998----79999999999998599999999672547865899999995772599997326-99916896766 Q ss_pred CCCCCEEEEEEEC---CC-CCC-----CCCCHHHHHH----------------------HHHHHHHHHCCCCCEEEEECC Q ss_conf 6822105676523---76-611-----3853289999----------------------999999995899856999325 Q gi|254780750|r 690 IGIVDKLFSRVGS---AD-NLA-----SGRSTFMVEM----------------------IETASILNQATNQSFVILDEI 738 (920) Q Consensus 690 i~~~D~IftRiGa---~D-~l~-----~g~STF~vEm----------------------~e~~~IL~~at~~SLVllDEl 738 (920) -++.+.| ++|. .| .+. .|...|...+ .-.|..| +.+..++|+||- T Consensus 92 ~Ti~eNi--~~~~~~~~~~~i~~~~~~~~l~~~i~~~~~g~~t~i~~~g~~LSgGqkQri~lARal--~~~~~ililDEp 167 (220) T cd03245 92 GTLRDNI--TLGAPLADDERILRAAELAGVTDFVNKHPNGLDLQIGERGRGLSGGQRQAVALARAL--LNDPPILLLDEP 167 (220) T ss_pred CCHHHHH--HCCCCCCCHHHHHHHHHHHCCHHHHHHCCCCCCCCCCCCCCEECHHHHHHHHHHHHH--HCCCCEEEEECC T ss_conf 7599985--357977897999999999597899973755434535899972189999999999999--559999999687 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHC-CCEEEEECCCHHHHHHHHH Q ss_conf 889880567999999999999726-9849997487579766430 Q gi|254780750|r 739 GRGTATLDGLSIAWATIEYLHETN-RCRGLLATHFHELTDLSKS 781 (920) Q Consensus 739 GrGTst~DG~aiA~aile~l~~~~-~~~~lfaTHy~eL~~l~~~ 781 (920) ||..|-..- ..+++.|.+.. +..++++||-.++...++. T Consensus 168 ---ts~LD~~~~-~~i~~~l~~~~~~~Tvi~itH~~~~~~~~D~ 207 (220) T cd03245 168 ---TSAMDMNSE-ERLKERLRQLLGDKTLIIITHRPSLLDLVDR 207 (220) T ss_pred ---CCCCCHHHH-HHHHHHHHHHCCCCEEEEEECCHHHHHHCCE T ss_conf ---568898999-9999999987699989999359889984999 No 116 >PRK09984 phosphonate/organophosphate ester transporter subunit; Provisional Probab=97.76 E-value=0.00043 Score=49.02 Aligned_cols=48 Identities=8% Similarity=0.080 Sum_probs=31.0 Q ss_pred CCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH Q ss_conf 9985699932588988056799999999999972698499974875797 Q gi|254780750|r 728 TNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELT 776 (920) Q Consensus 728 t~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~ 776 (920) .+--++|+||--.|=++.--.- -+.++..|.+..+..++++||.-+.. T Consensus 169 ~~P~iLllDEPta~LDp~~~~~-i~~~l~~l~~~~g~Til~vtHdl~~a 216 (262) T PRK09984 169 QQAKVILADEPIASLDPESARI-VMDTLRDINQNDGITVVVTLHQVDYA 216 (262) T ss_pred CCCCEEEECCCCCCCCHHHHHH-HHHHHHHHHHHCCCEEEEECCCHHHH T ss_conf 1999999838867799999999-99999999985497999988898999 No 117 >PRK09452 potA putrescine/spermidine ABC transporter ATPase protein; Reviewed Probab=97.75 E-value=0.00032 Score=49.93 Aligned_cols=142 Identities=23% Similarity=0.244 Sum_probs=72.1 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCC---------CCCHHHCCCCCC---CEEEEEEE Q ss_conf 444345635877776643999967784407899999999999997198---------530353206822---10567652 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGS---------YVPASYAHIGIV---DKLFSRVG 701 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~---------fVPA~~a~i~~~---D~IftRiG 701 (920) -+=+|++|.. ..+.+.-|.|||-+||||+||.++=+.-- .-|. -+|++.-.++.+ ..+|-.+- T Consensus 31 ~aL~~vsl~I----~~GE~~~llGpSGsGKSTLlr~iaGl~~p-~sG~I~~~G~~v~~~~~~~R~ig~VfQ~~aLfp~lT 105 (378) T PRK09452 31 EVISNLDLTI----NNGEFLTLLGPSGCGKTTVLRLIAGFETP-DSGRIMLDGQDITHVPAENRYVNTVFQSYALFPHMT 105 (378) T ss_pred EEEECCEEEE----CCCCEEEEECCCCCHHHHHHHHHHCCCCC-CCEEEEECCEECCCCCHHHCCEEEEECCCCCCCCCC T ss_conf 9993627799----99989999989997699999999769999-846999999998989978988589922764378986 Q ss_pred CCCCCCCCC-----CHH-----HHHHH---HHHHHHH-------------------HCCCCCEEEEECCCCCCCHHHHHH Q ss_conf 376611385-----328-----99999---9999999-------------------589985699932588988056799 Q gi|254780750|r 702 SADNLASGR-----STF-----MVEMI---ETASILN-------------------QATNQSFVILDEIGRGTATLDGLS 749 (920) Q Consensus 702 a~D~l~~g~-----STF-----~vEm~---e~~~IL~-------------------~at~~SLVllDElGrGTst~DG~a 749 (920) ..||+.-|. +.= ..|+. .+...++ -+.+-.++|+||--.+=++.--.. T Consensus 106 V~eNv~~~l~~~~~~~~e~~~rv~e~L~~v~L~~~~~r~p~~LSGGqqQRVaiARAL~~~P~vLLLDEPts~LD~~~r~~ 185 (378) T PRK09452 106 VFENVAFGLRMQKTPAAEITPRVMEALRMVQLEEFAQRKPHQLSGGQQQRVAIARAVVNKPKVLLLDESLSALDYKLRKQ 185 (378) T ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHCCCCCCCHHHHHHHHHHHHHCCCCCEEEECCCCCCCCHHHHHH T ss_conf 99999989976599879999999999875073435436835499889999999998623998999578644479999999 Q ss_pred HHHHHHHHHHHHCCCEEEEECCCHHHH-HHHHH Q ss_conf 999999999972698499974875797-66430 Q gi|254780750|r 750 IAWATIEYLHETNRCRGLLATHFHELT-DLSKS 781 (920) Q Consensus 750 iA~aile~l~~~~~~~~lfaTHy~eL~-~l~~~ 781 (920) + +..+..|.+..+..++|+||..+.+ .+++. T Consensus 186 ~-~~~l~~l~~~~g~T~i~VTHD~~eA~~laDr 217 (378) T PRK09452 186 M-QNELKALQRKLGITFVFVTHDQEEALTMSDR 217 (378) T ss_pred H-HHHHHHHHHHHCCEEEEECCCHHHHHHHCCE T ss_conf 9-9999999998499899998899999986998 No 118 >PRK11629 lolD lipoprotein transporter ATP-binding subunit; Provisional Probab=97.75 E-value=0.0013 Score=45.38 Aligned_cols=130 Identities=24% Similarity=0.346 Sum_probs=69.9 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHH-----------------C-----CCCC Q ss_conf 44345635877776643999967784407899999999999997198530353-----------------2-----0682 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASY-----------------A-----HIGI 692 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~-----------------a-----~i~~ 692 (920) |=+|+++.. ..+.+.-|.|||-+||||++|.++ |..-|.+- + .++. T Consensus 24 ~l~~vs~~i----~~GE~v~iiG~sGsGKSTLl~~i~--------Gl~~p~~G~I~~~g~~i~~~~~~~~~~~r~~~ig~ 91 (233) T PRK11629 24 VLHNVSFSI----GEGEMMAIVGSSGSGKSTLLHLLG--------GLDTPTSGDVIFNGQPMSKLSSAAKAELRNQKLGF 91 (233) T ss_pred EEECEEEEE----CCCCEEEEECCCCCCHHHHHHHHH--------CCCCCCCEEEEECCEECCCCCHHHHHHHHCCCEEE T ss_conf 984628998----899899999999940999999996--------69999863999999998869988999873797899 Q ss_pred C---CEEEEEEECCCCCCC-----CCCHH-----HHHHHHHHH---------------------HHH-HCCCCCEEEEEC Q ss_conf 2---105676523766113-----85328-----999999999---------------------999-589985699932 Q gi|254780750|r 693 V---DKLFSRVGSADNLAS-----GRSTF-----MVEMIETAS---------------------ILN-QATNQSFVILDE 737 (920) Q Consensus 693 ~---D~IftRiGa~D~l~~-----g~STF-----~vEm~e~~~---------------------IL~-~at~~SLVllDE 737 (920) + ..+|.++-..||+.. +.+.- ..|+.|.-. |-+ -++.-.++|+|| T Consensus 92 v~Q~~~l~~~~tv~eni~~~l~~~~~~~~~~~~~~~~~l~~~gl~~~~~~~~~~LSGGqkQRvaiAraL~~~P~illlDE 171 (233) T PRK11629 92 IYQFHHLLPDFTALENVAMPLLIGKKKPAEINSRALEMLKAVGLEHRANHRPSELSGGERQRVAIARALVNNPRLVLADE 171 (233) T ss_pred ECCCCCCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHCCCCEECHHHHHHHHHHHHHHCCCCEEEEEC T ss_conf 91675237786699999889998499999999989999987273667749846638999999999999965999999928 Q ss_pred CCCCCCHHHHHHH--HHHHHHHHHHHCCCEEEEECCCHHHHHHH Q ss_conf 5889880567999--99999999972698499974875797664 Q gi|254780750|r 738 IGRGTATLDGLSI--AWATIEYLHETNRCRGLLATHFHELTDLS 779 (920) Q Consensus 738 lGrGTst~DG~ai--A~aile~l~~~~~~~~lfaTHy~eL~~l~ 779 (920) - |+..|..+- -+.++..+.+..+...+|+||..++.... T Consensus 172 P---Ts~LD~~~~~~i~~~l~~l~~~~g~tvi~vtHdl~~a~~~ 212 (233) T PRK11629 172 P---TGNLDARNADSIFQLLGELNRLQGTAFLVVTHDLQLAKRM 212 (233) T ss_pred C---CCCCCHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHHH T ss_conf 8---8879999999999999999997098999986899999960 No 119 >cd03256 ABC_PhnC_transporter ABC-type phosphate/phosphonate transport system. Phosphonates are a class of organophosphorus compounds characterized by a chemically stable carbon-to-phosphorus (C-P) bond. Phosphonates are widespread among naturally occurring compounds in all kingdoms of wildlife, but only procaryotic microorganisms are able to cleave this bond. Certain bacteria such as E. coli can use alkylphosphonates as a phosphorus source. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=97.75 E-value=0.00021 Score=51.35 Aligned_cols=34 Identities=41% Similarity=0.490 Sum_probs=27.1 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHH Q ss_conf 44345635877776643999967784407899999999 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNAL 672 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval 672 (920) |=+||++.. ..+.+.-|-|||-+||||+||.++- T Consensus 16 ~L~~isl~i----~~Ge~~~iiGpsGsGKSTLl~~i~g 49 (241) T cd03256 16 ALKDVSLSI----NPGEFVALIGPSGAGKSTLLRCLNG 49 (241) T ss_pred EEECCEEEE----CCCCEEEEECCCCCHHHHHHHHHHC T ss_conf 997838899----9998999999998339999999974 No 120 >PRK11831 putative ABC transporter ATP-binding protein YrbF; Provisional Probab=97.75 E-value=0.0011 Score=45.89 Aligned_cols=138 Identities=22% Similarity=0.265 Sum_probs=72.7 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHC------CCCCCHH--------HCCCCCC---CEEE Q ss_conf 443456358777766439999677844078999999999999971------9853035--------3206822---1056 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQM------GSYVPAS--------YAHIGIV---DKLF 697 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQi------G~fVPA~--------~a~i~~~---D~If 697 (920) |=+|+++.. ..+.+.-|.|||-+||||++|.++=+.- ..- |--+|.- ...++.+ ..+| T Consensus 23 vL~~is~~i----~~Gei~~iiGpnGsGKSTLlk~i~Gl~~-p~~G~I~~~G~~i~~~~~~~~~~~r~~i~~v~Q~~~l~ 97 (269) T PRK11831 23 IFDNISLTV----PRGKITAIMGPSGIGKTTLLRLIGGQIA-PDHGEILFDGENIPAMSRSRLYTVRKRMSMLFQSGALF 97 (269) T ss_pred EEECCEEEE----CCCCEEEEECCCCCHHHHHHHHHHCCCC-CCCCEEEECCEECCCCCHHHHHHHHHCEEEECCCCCCC T ss_conf 994716688----7998999993999759999999967988-89866999998887658878998761468985376326 Q ss_pred EEEECCCCCCCCC-----------------------------------CHHHHHHHHHHHHHHHCCCCCEEEEECCCCCC Q ss_conf 7652376611385-----------------------------------32899999999999958998569993258898 Q gi|254780750|r 698 SRVGSADNLASGR-----------------------------------STFMVEMIETASILNQATNQSFVILDEIGRGT 742 (920) Q Consensus 698 tRiGa~D~l~~g~-----------------------------------STF~vEm~e~~~IL~~at~~SLVllDElGrGT 742 (920) ..+-..||+.-+. |-=+--...+|..| +.+-.++|+||-..|- T Consensus 98 ~~ltv~eni~~~l~~~~~~~~~~~~~~v~~~Le~~gL~~~~~~~~~~LSGGq~QRv~iAraL--~~~P~iLlLDEPtsgL 175 (269) T PRK11831 98 TDMNVFDNVAYPLREHTQLPAPLLHSTVMMKLEAVGLRGAAKLMPSELSGGMARRAALARAI--ALEPDLIMFDEPFVGQ 175 (269) T ss_pred CCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHCCCCCCCHHHHHHHHHHHHH--HCCCCEEEECCCCCCC T ss_conf 78859999989999955899999999999999861765456388231899999999999999--7599999982875679 Q ss_pred CHHHHHHHHHHHHHHHHHHCCCEEEEECCCH-HHHHHHH Q ss_conf 8056799999999999972698499974875-7976643 Q gi|254780750|r 743 ATLDGLSIAWATIEYLHETNRCRGLLATHFH-ELTDLSK 780 (920) Q Consensus 743 st~DG~aiA~aile~l~~~~~~~~lfaTHy~-eL~~l~~ 780 (920) ++..-..| +.++..|.+..+..++++||.- +...+++ T Consensus 176 D~~~~~~i-~~li~~l~~~~g~TiiivtHdl~~v~~iaD 213 (269) T PRK11831 176 DPITMGVL-VKLISELNSALGVTCVVVSHDVPEVLSIAD 213 (269) T ss_pred CHHHHHHH-HHHHHHHHHHCCCEEEEECCCHHHHHHHCC T ss_conf 99999999-999999998529899998649899998699 No 121 >PRK10619 histidine/lysine/arginine/ornithine transporter subunit; Provisional Probab=97.74 E-value=0.00058 Score=48.04 Aligned_cols=49 Identities=14% Similarity=0.070 Sum_probs=33.3 Q ss_pred CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHH Q ss_conf 899856999325889880567999999999999726984999748757976 Q gi|254780750|r 727 ATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTD 777 (920) Q Consensus 727 at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~ 777 (920) +.+-.++|+||--.|-++.--..| +.++..|.+. +.-++|+||.-+... T Consensus 168 ~~~P~lLllDEPts~LD~~~~~~i-~~ll~~l~~~-g~tii~vtHdl~~~~ 216 (257) T PRK10619 168 AMEPEVLLFDEPTSALDPELVGEV-LRIMQQLAEE-GKTMVVVTHEMGFAR 216 (257) T ss_pred HCCCCEEEECCCCCCCCHHHHHHH-HHHHHHHHHC-CCEEEEECCCHHHHH T ss_conf 639989997688665898999999-9999999975-999999948999999 No 122 >TIGR03410 urea_trans_UrtE urea ABC transporter, ATP-binding protein UrtE. Members of this protein family are ABC transporter ATP-binding subunits associated with urea transport and metabolism. This protein is found in a conserved five-gene transport operon typically found adjacent to urease genes. It was shown in Cyanobacteria that disruption leads to the loss of high-affinity urea transport activity. Probab=97.74 E-value=0.00037 Score=49.52 Aligned_cols=133 Identities=25% Similarity=0.318 Sum_probs=73.3 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCH-----------------HHCCCCC----- Q ss_conf 443456358777766439999677844078999999999999971985303-----------------5320682----- Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPA-----------------SYAHIGI----- 692 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA-----------------~~a~i~~----- 692 (920) |=+|+++.. ..+.+.-|-|||-+||||++|.+ +|.+-|- +.++.++ T Consensus 15 ~L~~vs~~v----~~Gei~~liGpNGaGKSTL~~~i--------~Gl~~p~~G~I~~~G~~i~~~~~~~~~r~gig~v~Q 82 (230) T TIGR03410 15 ILRGVSLEV----PKGEVTCVLGRNGVGKTTLLKTL--------MGLLPVKSGSIRLDGEDITKLPPHERARAGIAYVPQ 82 (230) T ss_pred EECCEEEEE----CCCCEEEEECCCCCHHHHHHHHH--------HCCCCCCCEEEEECCEECCCCCHHHHHHCCEEEECC T ss_conf 888717799----99979999999994099999999--------779999954999999999999989999829599377 Q ss_pred CCEEEEEEECCCCCCCCCCHH-------HHHHHHHHHHHH-----------------------HCCCCCEEEEECCCCCC Q ss_conf 210567652376611385328-------999999999999-----------------------58998569993258898 Q gi|254780750|r 693 VDKLFSRVGSADNLASGRSTF-------MVEMIETASILN-----------------------QATNQSFVILDEIGRGT 742 (920) Q Consensus 693 ~D~IftRiGa~D~l~~g~STF-------~vEm~e~~~IL~-----------------------~at~~SLVllDElGrGT 742 (920) -..+|.++-..+|+.-|...+ .-++.|.-.+|. -++.-.++|+||--.|= T Consensus 83 ~~~lf~~lTV~Enl~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~r~~~~LSGGq~Qrv~iAraL~~~P~lLlLDEPt~gL 162 (230) T TIGR03410 83 GREIFPRLTVEENLLTGLAALPRRSRKIPDEIYELFPVLKEMLGRRGGDLSGGQQQQLAIARALVTRPKLLLLDEPTEGI 162 (230) T ss_pred CCCCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCHHHCCHHHHHHHHHHHHHHCCCCEEEEECCCCCC T ss_conf 74257678899999999987496678899999999999999983851119999999999999996299889993852269 Q ss_pred CHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH-HHHH Q ss_conf 8056799999999999972698499974875797-6643 Q gi|254780750|r 743 ATLDGLSIAWATIEYLHETNRCRGLLATHFHELT-DLSK 780 (920) Q Consensus 743 st~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~-~l~~ 780 (920) ++..-..| |.++..|.+..++-++++||.-+.. .+++ T Consensus 163 D~~~~~~i-~~~i~~l~~~~g~tvl~vtH~l~~~~~~ad 200 (230) T TIGR03410 163 QPSIIKDI-GRVIRRLRAEGGMAILLVEQYLDFARELAD 200 (230) T ss_pred CHHHHHHH-HHHHHHHHHCCCCEEEEEECCHHHHHHHCC T ss_conf 99999999-999999997179899999088899999699 No 123 >PRK11248 tauB taurine transporter ATP-binding subunit; Provisional Probab=97.74 E-value=0.0011 Score=45.95 Aligned_cols=132 Identities=27% Similarity=0.286 Sum_probs=69.4 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCE-----------------EE Q ss_conf 4434563587777664399996778440789999999999999719853035320682210-----------------56 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDK-----------------LF 697 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~-----------------If 697 (920) |=+|++|.. ..+.+.-|-|||-+||||+||.++ |..-| ++-++.+.++ +| T Consensus 16 ~L~dvsl~i----~~Ge~~~lvGpnGaGKSTLl~~i~--------Gl~~p-~~G~I~~~G~~i~~~~~~~g~vfQ~~~l~ 82 (255) T PRK11248 16 ALEDINLTL----ESGELLVVLGPSGCGKTTLLNLIA--------GFVPP-QHGSITLDGKPVEGPGAERGVVFQNEGLL 82 (255) T ss_pred EEECEEEEE----CCCCEEEEECCCCCHHHHHHHHHH--------CCCCC-CCCEEEECCCCCCCCCHHCEEEECCCCCC T ss_conf 881317798----699899999999846999999997--------59988-99718579964788621106994557547 Q ss_pred EEEECCCCCCCCCCHH---HHHHHHHH-HHH----------------------------HHCCCCCEEEEECCCCCCCHH Q ss_conf 7652376611385328---99999999-999----------------------------958998569993258898805 Q gi|254780750|r 698 SRVGSADNLASGRSTF---MVEMIETA-SIL----------------------------NQATNQSFVILDEIGRGTATL 745 (920) Q Consensus 698 tRiGa~D~l~~g~STF---~vEm~e~~-~IL----------------------------~~at~~SLVllDElGrGTst~ 745 (920) -..-..||+.-+.--. -.|..+.+ .+| --++.-.++|+||--.|=++. T Consensus 83 p~~tv~env~~~l~~~g~~~~~~~~~~~~~L~~vgL~~~~~~~p~~LSGGqkQRVaiArAL~~~P~iLllDEPt~~LD~~ 162 (255) T PRK11248 83 PWRNVQDNVAFGLQLAGVEKMQRLEIAHQMLKKVGLEGAEKRYIWQLSGGQRQRVGIARALAANPQLLLLDEPFGALDAF 162 (255) T ss_pred CCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHCCCHHHCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCHH T ss_conf 56879999998998748987899999999999769902441893349999999999999997299999980887779989 Q ss_pred HHHHHHHHHHHHHHHHCCCEEEEECCCHHHH-HHHH Q ss_conf 6799999999999972698499974875797-6643 Q gi|254780750|r 746 DGLSIAWATIEYLHETNRCRGLLATHFHELT-DLSK 780 (920) Q Consensus 746 DG~aiA~aile~l~~~~~~~~lfaTHy~eL~-~l~~ 780 (920) --..+ +..+..|.+..+.-++|+||.-+.+ .+++ T Consensus 163 ~r~~l-~~ll~~l~~~~g~Til~vTHdl~ea~~lad 197 (255) T PRK11248 163 TREQM-QELLLKLWQETGKQVLLITHDIEEAVFMAT 197 (255) T ss_pred HHHHH-HHHHHHHHHHCCCEEEEECCCHHHHHHHCC T ss_conf 99999-999999999619999998868999999699 No 124 >cd03296 ABC_CysA_sulfate_importer Part of the ABC transporter complex cysAWTP involved in sulfate import. Responsible for energy coupling to the transport system. The complex is composed of two ATP-binding proteins (cysA), two transmembrane proteins (cysT and cysW), and a solute-binding protein (cysP). ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=97.73 E-value=0.0004 Score=49.21 Aligned_cols=132 Identities=23% Similarity=0.277 Sum_probs=73.4 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCC-------------------- Q ss_conf 443456358777766439999677844078999999999999971985303532068221-------------------- Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVD-------------------- 694 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D-------------------- 694 (920) |=+|+++.. ..+.+..|.|||-+||||++|.++=+ .=| ++-++-+-+ T Consensus 17 ~l~~is~~v----~~Ge~~~iiGpSGsGKSTll~~i~Gl--------~~p-~~G~I~~~g~~i~~~~~~~r~ig~vfQ~~ 83 (239) T cd03296 17 ALDDVSLDI----PSGELVALLGPSGSGKTTLLRLIAGL--------ERP-DSGTILFGGEDATDVPVQERNVGFVFQHY 83 (239) T ss_pred EECCCEEEE----CCCCEEEEECCCCCHHHHHHHHHHCC--------CCC-CCEEEEECCEECCCCCHHHCCEEEECCCC T ss_conf 986638698----89989999999997799999999769--------999-86399999999999996567767981782 Q ss_pred EEEEEEECCCCCCCCC------C---HH--------HHHHHHHHHHH------------------H-HCCCCCEEEEECC Q ss_conf 0567652376611385------3---28--------99999999999------------------9-5899856999325 Q gi|254780750|r 695 KLFSRVGSADNLASGR------S---TF--------MVEMIETASIL------------------N-QATNQSFVILDEI 738 (920) Q Consensus 695 ~IftRiGa~D~l~~g~------S---TF--------~vEm~e~~~IL------------------~-~at~~SLVllDEl 738 (920) .+|-.+-..||+.-|. + .= ..|+.+.+..+ | -|.+-.++|+||- T Consensus 84 ~Lfp~ltV~eNi~~~l~~~~~~~~~~~~e~~~rv~~~l~~v~l~~~~~~~p~eLSGGq~QRVaiARAl~~~P~vlllDEP 163 (239) T cd03296 84 ALFRHMTVFDNVAFGLRVKPRSERPPEAEIRAKVHELLKLVQLDWLADRYPAQLSGGQRQRVALARALAVEPKVLLLDEP 163 (239) T ss_pred CCCCCCCHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCHHHHHCCHHHCCHHHHHHHHHHHHHHCCCCEEEECCC T ss_conf 10679969999987997335456998999999999998654997677489666999898999999987649998997388 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH-HHHH Q ss_conf 88988056799999999999972698499974875797-6643 Q gi|254780750|r 739 GRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELT-DLSK 780 (920) Q Consensus 739 GrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~-~l~~ 780 (920) -.|=++.-...| +..+..|.+..+..++|+||..+.+ .+++ T Consensus 164 ~s~LD~~~~~~i-~~~l~~l~~e~~~T~i~vTHd~~~a~~laD 205 (239) T cd03296 164 FGALDAKVRKEL-RRWLRRLHDELHVTTVFVTHDQEEALEVAD 205 (239) T ss_pred CCCCCHHHHHHH-HHHHHHHHHHHCCEEEEECCCHHHHHHHCC T ss_conf 664699999999-999999999859989999889999999699 No 125 >cd03301 ABC_MalK_N The N-terminal ATPase domain of the maltose transporter, MalK. ATP binding cassette (ABC) proteins function from bacteria to human, mediating the translocation of substances into and out of cells or organelles. ABC transporters contain two transmembrane-spanning domains (TMDs) or subunits and two nucleotide binding domains (NBDs) or subunits that couple transport to the hydrolysis of ATP. In the maltose transport system, the periplasmic maltose binding protein (MBP) stimulates the ATPase activity of the membrane-associated transporter, which consists of two transmembrane subunits, MalF and MalG, and two copies of the ATP binding subunit, MalK, and becomes tightly bound to the transporter in the catalytic transition state, ensuring that maltose is passed to the transporter as ATP is hydrolyzed. Probab=97.73 E-value=0.0004 Score=49.23 Aligned_cols=134 Identities=21% Similarity=0.255 Sum_probs=71.5 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCC----------CCCC--------- Q ss_conf 444345635877776643999967784407899999999999997198530353206----------8221--------- Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHI----------GIVD--------- 694 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i----------~~~D--------- 694 (920) .|=+|++|.. ..+.+..|-||+-+||||+||.++ |..=| ++-++ ++-+ T Consensus 14 ~~l~~isl~v----~~Ge~~~i~GpSG~GKSTlLr~ia--------Gl~~p-~~G~I~~~g~~i~~~~~~~R~ig~VfQ~ 80 (213) T cd03301 14 TALDDLNLDI----ADGEFVVLLGPSGCGKTTTLRMIA--------GLEEP-TSGRIYIGGRDVTDLPPKDRDIAMVFQN 80 (213) T ss_pred EEECCCEEEE----CCCCEEEEECCCCCHHHHHHHHHH--------CCCCC-CCEEEEECCEECCCCCHHHCCEEEEECC T ss_conf 9987617798----699899999999880999999997--------69999-8639999999999999767887899458 Q ss_pred -EEEEEEECCCCCCCC-----CCHH--------HHHHHHHHHH------------------HH-HCCCCCEEEEECCCCC Q ss_conf -056765237661138-----5328--------9999999999------------------99-5899856999325889 Q gi|254780750|r 695 -KLFSRVGSADNLASG-----RSTF--------MVEMIETASI------------------LN-QATNQSFVILDEIGRG 741 (920) Q Consensus 695 -~IftRiGa~D~l~~g-----~STF--------~vEm~e~~~I------------------L~-~at~~SLVllDElGrG 741 (920) .+|-.+-..|||.-| .+.= +.|+...... -| -+++-.++|+||--.+ T Consensus 81 ~~LfP~ltV~eNI~~~l~~~~~~~~e~~~~v~~~l~~~gl~~~~~~~P~~LSGGqkQRVaiARAl~~~P~lLLlDEP~sa 160 (213) T cd03301 81 YALYPHMTVYDNIAFGLKLRKVPKDEIDERVREVAELLQIEHLLDRKPKQLSGGQRQRVALGRAIVREPKVFLMDEPLSN 160 (213) T ss_pred CCCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHCCCHHHCCHHHHHHHHHHHHHHCCCCEEEECCCCCC T ss_conf 76465470999999899985999899999999999875992465099556999999999999998759998998388764 Q ss_pred CCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH-HHHHH Q ss_conf 88056799999999999972698499974875797-66430 Q gi|254780750|r 742 TATLDGLSIAWATIEYLHETNRCRGLLATHFHELT-DLSKS 781 (920) Q Consensus 742 Tst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~-~l~~~ 781 (920) =++.-...| ...+..+.+..+..++|+||..+-+ .+++. T Consensus 161 LD~~~r~~i-~~~l~~~~~~~~~T~i~vTHd~~ea~~l~dr 200 (213) T cd03301 161 LDAKLRVQM-RAELKRLQQRLGTTTIYVTHDQVEAMTMADR 200 (213) T ss_pred CCHHHHHHH-HHHHHHHHHHHCCEEEEECCCHHHHHHHCCE T ss_conf 298999999-9999999997499899999998999996998 No 126 >cd03236 ABC_RNaseL_inhibitor_domain1 The ATPase domain 1 of RNase L inhibitor. The ABC ATPase, RNase L inhibitor (RLI), is a key enzyme in ribosomal biogenesis, formation of translation preinitiation complexes, and assembly of HIV capsids. RLI s are not transport proteins and thus cluster with a group of soluble proteins that lack the transmembrane components commonly found in other members of the family. Structurally, RLIs have an N-terminal Fe-S domain and two nucleotide binding domains which are arranged to form two composite active sites in their interface cleft. RLI is one of the most conserved enzymes between archaea and eukaryotes with a sequence identity more than 48%. The high degree of evolutionary conservation suggests that RLI performs a central role in archaeal and eukaryotic physiology. Probab=97.73 E-value=0.00062 Score=47.82 Aligned_cols=48 Identities=21% Similarity=0.066 Sum_probs=31.7 Q ss_pred CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH Q ss_conf 89985699932588988056799999999999972698499974875797 Q gi|254780750|r 727 ATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELT 776 (920) Q Consensus 727 at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~ 776 (920) +..-.++|+||--.|=++.. ..-.+.++..|.+. +-.++++||.-... T Consensus 155 ~~~P~iLlLDEPTs~LD~~~-~~~v~~li~~L~~~-G~Tvi~vtHDl~~~ 202 (255) T cd03236 155 ARDADFYFFDEPSSYLDIKQ-RLNAARLIRELAED-DNYVLVVEHDLAVL 202 (255) T ss_pred CCCCCEEEEECCCCCCCHHH-HHHHHHHHHHHHHC-CCEEEEECCCHHHH T ss_conf 68999999979876589999-99999999999978-99999990789999 No 127 >PRK11650 ugpC glycerol-3-phosphate transporter ATP-binding subunit; Provisional Probab=97.73 E-value=0.00035 Score=49.72 Aligned_cols=131 Identities=24% Similarity=0.325 Sum_probs=71.5 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCC-------------------- Q ss_conf 443456358777766439999677844078999999999999971985303532068221-------------------- Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVD-------------------- 694 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D-------------------- 694 (920) +=+|++|.. ..+.+.-|-|||-+||||+||.++=+ .+.++-++-+.+ T Consensus 19 al~~vsl~i----~~Ge~~~llGpsG~GKTTllr~iaGl---------~~p~~G~I~~~g~~v~~~~~~~R~i~~vfQ~~ 85 (358) T PRK11650 19 VIKGIDLDV----ADGEFIVLVGPSGCGKSTLLRMVAGL---------ERITSGEIWIGGRVVNELEPADRDIAMVFQNY 85 (358) T ss_pred EEECCEEEE----CCCCEEEEECCCCCHHHHHHHHHHCC---------CCCCCEEEEECCEECCCCCHHHCCEEEECCCC T ss_conf 982527798----89989999999863699999999769---------99886299999999999997787576772555 Q ss_pred EEEEEEECCCCCCCC-----CCHH--------HHHHHHHHHHHH-------------------HCCCCCEEEEECCCCCC Q ss_conf 056765237661138-----5328--------999999999999-------------------58998569993258898 Q gi|254780750|r 695 KLFSRVGSADNLASG-----RSTF--------MVEMIETASILN-------------------QATNQSFVILDEIGRGT 742 (920) Q Consensus 695 ~IftRiGa~D~l~~g-----~STF--------~vEm~e~~~IL~-------------------~at~~SLVllDElGrGT 742 (920) .+|-.+-..||+.-+ .|.= +.|+.++...++ -+.+-.++|+||- | T Consensus 86 ~L~p~ltV~eni~~~l~~~~~~~~~~~~rv~~~l~~l~l~~~~~r~p~~LSGGq~QRvalARAL~~~P~vlllDEP---~ 162 (358) T PRK11650 86 ALYPHMSVRENMAYGLKIRGMPKAEIEERVAEAARILELEPLLDRKPRELSGGQRQRVAMGRAIVREPAVFLFDEP---L 162 (358) T ss_pred CCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCCHHHCCHHHCCHHHHHHHHHHHHHHCCCCEEEECCC---C T ss_conf 4487874878665578762886467889999998752262422489747895678999983575049986887388---7 Q ss_pred CHHHHH--HHHHHHHHHHHHHCCCEEEEECCCHHHH-HHHHH Q ss_conf 805679--9999999999972698499974875797-66430 Q gi|254780750|r 743 ATLDGL--SIAWATIEYLHETNRCRGLLATHFHELT-DLSKS 781 (920) Q Consensus 743 st~DG~--aiA~aile~l~~~~~~~~lfaTHy~eL~-~l~~~ 781 (920) |..|-. .--+..+..|++..+..++|+||..+-+ .+++. T Consensus 163 s~LD~~~r~~~~~~l~~l~~~~g~T~i~vTHd~~eA~~laDr 204 (358) T PRK11650 163 SNLDAKLRVQMRLEIQRLHRRLGTTSLYVTHDQVEAMTLADR 204 (358) T ss_pred CCCCHHHHHHHHHHHHHHHHHHCCEEEEEECCHHHHHHHCCE T ss_conf 767998999999999999997597799998999999986999 No 128 >cd03264 ABC_drug_resistance_like ABC-type multidrug transport system, ATPase component. The biological function of this family is not well characterized, but display ABC domains similar to members of ABCA subfamily. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=97.71 E-value=0.001 Score=46.17 Aligned_cols=47 Identities=21% Similarity=0.213 Sum_probs=30.7 Q ss_pred CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH Q ss_conf 89985699932588988056799999999999972698499974875797 Q gi|254780750|r 727 ATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELT 776 (920) Q Consensus 727 at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~ 776 (920) +..-.++|+||--.|=++.--..+ |..+..+. . +...+++||+-+-. T Consensus 146 ~~~P~lliLDEPt~gLDp~~~~~~-~~ll~~l~-~-~~tii~stH~l~e~ 192 (211) T cd03264 146 VGDPSILIVDEPTAGLDPEERIRF-RNLLSELG-E-DRIVILSTHIVEDV 192 (211) T ss_pred HCCCCEEEEECCCCCCCHHHHHHH-HHHHHHHC-C-CCEEEEECCCHHHH T ss_conf 289999999489767899999999-99999972-9-98999989988999 No 129 >PRK11819 putative ABC transporter ATP-binding protein; Reviewed Probab=97.70 E-value=0.00022 Score=51.22 Aligned_cols=134 Identities=20% Similarity=0.229 Sum_probs=66.0 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEEC----CCCCCCC Q ss_conf 444345635877776643999967784407899999999999997198530353206822105676523----7661138 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGS----ADNLASG 709 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa----~D~l~~g 709 (920) .+=+|+++.. ..+..+-|.|||-+||||+||.++=. +-..-|...-....+++-||+-.-.+-. .+++..| T Consensus 338 ~ll~~vs~~i----~~Ge~iaivG~NGsGKSTLlk~l~G~-~~p~~G~i~~g~~v~igy~~Q~~~~l~~~~tv~e~v~~~ 412 (556) T PRK11819 338 LLIDDLSFKL----PPGGIVGIIGPNGAGKSTLFKMITGQ-EQPDSGTIKIGETVKLAYVDQSRDALDPNKTVWEEISGG 412 (556) T ss_pred EEEEEECCCC----CCCCEEEEECCCCCCHHHHHHHHHCC-CCCCCCEEEECCCCCCCCCCHHHHHCCCCCCHHHHHHCH T ss_conf 7887640235----78824789889877588999998386-568885599899665032212054269768499987452 Q ss_pred CCH----------------H---------------HHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHH Q ss_conf 532----------------8---------------999999999999589985699932588988056799999999999 Q gi|254780750|r 710 RST----------------F---------------MVEMIETASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYL 758 (920) Q Consensus 710 ~ST----------------F---------------~vEm~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l 758 (920) ... | .-|-..++-..--+++--|.|+||- |+-.|=.++ .|+-+.| T Consensus 413 ~~~~~~~~~e~~~r~~L~~f~~~~~~~~~~v~~LSGGek~Rv~lA~~l~~~p~lLiLDEP---Tn~LDi~s~-e~Le~aL 488 (556) T PRK11819 413 LDIIKVGNREIPSRAYVGRFNFKGGDQQKKVGVLSGGERNRLHLAKTLKSGGNVLLLDEP---TNDLDVETL-RALEDAL 488 (556) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCHHHCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECC---CCCCCHHHH-HHHHHHH T ss_conf 666542167789999998707872455197031889999999999999629898999297---756799999-9999999 Q ss_pred HHHCCCEEEEECCCHHHHH Q ss_conf 9726984999748757976 Q gi|254780750|r 759 HETNRCRGLLATHFHELTD 777 (920) Q Consensus 759 ~~~~~~~~lfaTHy~eL~~ 777 (920) .+-.|+ +||++|...+.+ T Consensus 489 ~~y~Gt-vl~VSHDr~fi~ 506 (556) T PRK11819 489 LEFPGC-AVVISHDRWFLD 506 (556) T ss_pred HHCCCE-EEEEECCHHHHH T ss_conf 877996-999978999999 No 130 >PRK11432 fbpC ferric transporter ATP-binding subunit; Provisional Probab=97.70 E-value=0.00045 Score=48.90 Aligned_cols=153 Identities=24% Similarity=0.259 Sum_probs=80.8 Q ss_pred CCCCCEEECCCCCHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHH Q ss_conf 47871400004680588763102877044434563587777664399996778440789999999999999719853035 Q gi|254780750|r 607 DNSTNFIVKDGRHPIVEKTLKQQSSKPFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPAS 686 (920) Q Consensus 607 ~~~~~l~i~~gRHPviE~~l~~~~~~~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~ 686 (920) ++++.+.+++..+- -+..-+=+|++|.. ..+.+..|-||+-+||||+||.+| |..-|.+ T Consensus 2 ~~~~~i~~~~lsk~---------yg~~~al~~vsl~i----~~Ge~~~llGpSG~GKTTlLr~ia--------Gl~~p~~ 60 (351) T PRK11432 2 SNKNFVVLKNITKR---------FGKNTVIDNLDLTI----KQGTMVTLLGPSGCGKTTVLRLVA--------GLEKPTS 60 (351) T ss_pred CCCCEEEEEEEEEE---------ECCEEEECCCEEEE----CCCCEEEEECCCCCHHHHHHHHHH--------CCCCCCC T ss_conf 99976999647999---------89948984457498----899899999999649999999997--------6999883 Q ss_pred ----------------HCCCCCC---CEEEEEEECCCCCCCCC-----CHH-----HHHHHH---HHHHH---------- Q ss_conf ----------------3206822---10567652376611385-----328-----999999---99999---------- Q gi|254780750|r 687 ----------------YAHIGIV---DKLFSRVGSADNLASGR-----STF-----MVEMIE---TASIL---------- 724 (920) Q Consensus 687 ----------------~a~i~~~---D~IftRiGa~D~l~~g~-----STF-----~vEm~e---~~~IL---------- 724 (920) .-.++.+ -.+|-.+-..||+.-|. +.= ..|+.+ +..++ T Consensus 61 G~I~~~g~~v~~~~~~~R~i~~VfQ~~aLfPh~tV~eNi~~~l~~~~~~~~e~~~rv~e~l~~v~L~~~~~r~P~~LSGG 140 (351) T PRK11432 61 GQIFIDGEDVTKRSIQQRDICMVFQSYALFPHMSLGENVGYGLKMLGVPKEERKQRVKEALELVDLAGFEDRYVDQISGG 140 (351) T ss_pred EEEEECCEECCCCCHHHCCEEEEECCCCCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCCHHHCCHHHCCHH T ss_conf 69999999999999545886999448876766809999977998759999999999999997649966145895578998 Q ss_pred --------H-HCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH-HHHHH Q ss_conf --------9-589985699932588988056799999999999972698499974875797-66430 Q gi|254780750|r 725 --------N-QATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELT-DLSKS 781 (920) Q Consensus 725 --------~-~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~-~l~~~ 781 (920) | -+.+-+++|+||--.+=++.=-..+ +.-+..|++..+..++|+||..+-+ .+++. T Consensus 141 q~QRValARAL~~~P~vlLlDEP~s~LD~~lR~~~-~~~l~~l~~~~~~T~i~VTHD~~EA~~laDr 206 (351) T PRK11432 141 QQQRVALARALVLKPKVLLFDEPLSNLDANLRRSM-REKIRELQQQFNITSLYVTHDQSEAFAVSDT 206 (351) T ss_pred HHHHHHHHHHHHCCCCEEEECCCCCCCCHHHHHHH-HHHHHHHHHHHCCEEEEECCCHHHHHHHCCE T ss_conf 99999999998449989998687543699999999-9999999998699999999998999986999 No 131 >cd03293 ABC_NrtD_SsuB_transporters NrtD and SsuB are the ATP-binding subunits of the bacterial ABC-type nitrate and sulfonate transport systems, respectively. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=97.69 E-value=0.00014 Score=52.61 Aligned_cols=130 Identities=25% Similarity=0.243 Sum_probs=66.2 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCE-----------------EE Q ss_conf 4434563587777664399996778440789999999999999719853035320682210-----------------56 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDK-----------------LF 697 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~-----------------If 697 (920) |=+|++|.. ..+.+..|.|||-+||||+||.++= ..-| ++-++.+..+ +| T Consensus 19 al~~vsl~i----~~Ge~~~iiGpsGsGKSTLl~~i~G--------l~~p-~~G~I~~~G~~i~~~~~~ig~vfQ~~~L~ 85 (220) T cd03293 19 ALEDISLSV----EEGEFVALVGPSGCGKSTLLRIIAG--------LERP-TSGEVLVDGEPVTGPGPDRGYVFQQDALL 85 (220) T ss_pred EEECEEEEE----CCCCEEEEECCCCCHHHHHHHHHHC--------CCCC-CCCEEEECCEECCCCCCCEEEEECCCCCC T ss_conf 996718898----7998999999999579999999975--------9998-87389999996788898879992488537 Q ss_pred EEEECCCCCCCC-----CCHHHHHHHH-HHHHHH----------------------------HCCCCCEEEEECCCCCCC Q ss_conf 765237661138-----5328999999-999999----------------------------589985699932588988 Q gi|254780750|r 698 SRVGSADNLASG-----RSTFMVEMIE-TASILN----------------------------QATNQSFVILDEIGRGTA 743 (920) Q Consensus 698 tRiGa~D~l~~g-----~STF~vEm~e-~~~IL~----------------------------~at~~SLVllDElGrGTs 743 (920) -.+-..+|+.-+ .+. .|..+ +..+|+ -+.+-.++|+||--.|=+ T Consensus 86 p~~tv~eni~~~l~~~~~~~--~~~~~~v~~~l~~~gL~~~~~~~p~~LSGGqkQRvaiARaL~~~P~llllDEPts~LD 163 (220) T cd03293 86 PWLTVLDNVALGLELQGVPK--AEARERAEELLELVGLSGFENAYPHQLSGGMRQRVALARALAVDPDVLLLDEPFSALD 163 (220) T ss_pred CCCCHHHHHHHHHHHCCCCH--HHHHHHHHHHHHHCCCHHHHHCCHHHCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCC T ss_conf 78879999988998659998--9999999999998789547618931299999999999999866999999808876569 Q ss_pred HHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH-HHHH Q ss_conf 056799999999999972698499974875797-6643 Q gi|254780750|r 744 TLDGLSIAWATIEYLHETNRCRGLLATHFHELT-DLSK 780 (920) Q Consensus 744 t~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~-~l~~ 780 (920) +.-... -|..+..+.+..+..++|+||..+.+ .+++ T Consensus 164 ~~~~~~-i~~~l~~l~~~~g~tii~vTHdl~~a~~laD 200 (220) T cd03293 164 ALTREQ-LQEELLDIWRETGKTVLLVTHDIDEAVFLAD 200 (220) T ss_pred HHHHHH-HHHHHHHHHHHCCCEEEEECCCHHHHHHHCC T ss_conf 999999-9999999998519999998888999999699 No 132 >PRK13637 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=97.69 E-value=0.0015 Score=44.85 Aligned_cols=139 Identities=19% Similarity=0.270 Sum_probs=69.4 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCC------CCCHHH-------CCCCCC-----CEE Q ss_conf 44345635877776643999967784407899999999999997198------530353-------206822-----105 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGS------YVPASY-------AHIGIV-----DKL 696 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~------fVPA~~-------a~i~~~-----D~I 696 (920) +=+|++|.. ..+.+..|.|||-+||||++|.++=+.- .+-|. -|...+ -++|.+ ..+ T Consensus 22 aL~~isl~i----~~GE~v~iiG~nGsGKSTLl~~l~GLl~-p~~G~V~i~G~~i~~~~~~~~~~r~~iG~VfQ~P~~~l 96 (287) T PRK13637 22 ALDNVNIEI----EDGEFVALIGHTGSGKSTLIQHLNGLLK-PTSGKIIIDGVDITDKKVKLSDIRKKVGLVFQYPEYQL 96 (287) T ss_pred EEECEEEEE----CCCCEEEEECCCCCHHHHHHHHHHCCCC-CCCCEEEECCEECCCCCCCHHHHHHCEEEEEECCCCCC T ss_conf 753207698----7998999999999399999999973998-88726999999987888677888741789961752023 Q ss_pred EEEEECCCCCCCCCCHH---HHHHHH-HHHHH----------H--------------------HCCCCCEEEEECCCCCC Q ss_conf 67652376611385328---999999-99999----------9--------------------58998569993258898 Q gi|254780750|r 697 FSRVGSADNLASGRSTF---MVEMIE-TASIL----------N--------------------QATNQSFVILDEIGRGT 742 (920) Q Consensus 697 ftRiGa~D~l~~g~STF---~vEm~e-~~~IL----------~--------------------~at~~SLVllDElGrGT 742 (920) |.+ -..|++.-|.... -.|+.+ +..+| . -+..-.++|+||--.|= T Consensus 97 ~~~-tV~e~i~fg~~~~g~~~~e~~~rv~~~l~~vgL~~~~~~~~~p~~LSGGqkQRvaiA~aL~~~P~iLllDEPTs~L 175 (287) T PRK13637 97 FEE-TIEKDIAFGPINLGLSEEEIENRVKEAMNIVGLDYEVYKDKSPFELSGGQKRRVAIAGVVAMEPKVLILDEPTAGL 175 (287) T ss_pred CCC-CHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHCCCHHHCCHHHHHHHHHHHHHHCCCCEEEEECCCCCC T ss_conf 703-0999998689886999999999999999766998488706891129988999999999998399999983886648 Q ss_pred CHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH-HHHH Q ss_conf 8056799999999999972698499974875797-6643 Q gi|254780750|r 743 ATLDGLSIAWATIEYLHETNRCRGLLATHFHELT-DLSK 780 (920) Q Consensus 743 st~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~-~l~~ 780 (920) ++.- ..--+..+..|.+..+..++++||.-+.. .+++ T Consensus 176 Dp~~-~~~i~~~l~~L~~e~g~Tvi~vTHdl~~v~~~aD 213 (287) T PRK13637 176 DPKG-RDDILEKIKALHKEYNMTIILVSHSMEDVAKIAD 213 (287) T ss_pred CHHH-HHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHCC T ss_conf 9999-9999999999998509899999579999999699 No 133 >PRK11288 araG L-arabinose transporter ATP-binding protein; Provisional Probab=97.68 E-value=0.00068 Score=47.53 Aligned_cols=53 Identities=15% Similarity=0.124 Sum_probs=33.9 Q ss_pred CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH-HHHHH Q ss_conf 89985699932588988056799999999999972698499974875797-66430 Q gi|254780750|r 727 ATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELT-DLSKS 781 (920) Q Consensus 727 at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~-~l~~~ 781 (920) +.+-.++|+||--+|=+..-- .--|..+..|.++ +.-+||.||.-+.. ++++. T Consensus 412 ~~~p~vLilDEPT~GLD~~~~-~~i~~ll~~l~~~-G~tvl~ITHDl~~~~~~aDR 465 (501) T PRK11288 412 SEDMKVILLDEPTRGIDVGAK-HEIYNVIYELAAQ-GVAVLVVSSDLPEVLGVADR 465 (501) T ss_pred HHCCCEEEEECCCCCCCHHHH-HHHHHHHHHHHHC-CCEEEEEECCHHHHHHHCCE T ss_conf 709998999798778999999-9999999999968-99999990768999986999 No 134 >PRK13650 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=97.68 E-value=0.0004 Score=49.23 Aligned_cols=139 Identities=24% Similarity=0.259 Sum_probs=67.1 Q ss_pred CEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCC------HH-----HCCCCCC----CEEE Q ss_conf 0444345635877776643999967784407899999999999997198530------35-----3206822----1056 Q gi|254780750|r 633 PFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVP------AS-----YAHIGIV----DKLF 697 (920) Q Consensus 633 ~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVP------A~-----~a~i~~~----D~If 697 (920) .++=+|+++.. ..+.+.-|.|||-+||||++|.++=+. -.+-|.-.- .. .-.++.+ |..| T Consensus 17 ~~aL~~vsl~i----~~GE~vaivG~nGsGKSTL~~~l~Gll-~p~~G~I~i~G~~i~~~~~~~lr~~ig~VfQ~p~~~~ 91 (276) T PRK13650 17 KYTLDDVSFHV----KQGEWLSIIGHNGSGKSTTVRLIDGLL-EAESGSIIIDGDLLTEENVWEIRHKIGMVFQNPDNQF 91 (276) T ss_pred CEEEECCEEEE----CCCCEEEEECCCCCHHHHHHHHHHCCC-CCCCEEEEECCEECCCCCHHHHHHCEEEEEECCCHHC T ss_conf 77878758799----899899999999987999999997388-9886089999999986776887641469976720105 Q ss_pred EEEECCCCCCCCCCHH---HHHHHH-HHHHHH----------------------------HCCCCCEEEEECCCCCCCHH Q ss_conf 7652376611385328---999999-999999----------------------------58998569993258898805 Q gi|254780750|r 698 SRVGSADNLASGRSTF---MVEMIE-TASILN----------------------------QATNQSFVILDEIGRGTATL 745 (920) Q Consensus 698 tRiGa~D~l~~g~STF---~vEm~e-~~~IL~----------------------------~at~~SLVllDElGrGTst~ 745 (920) .-.-..|++.-|.... ..|+.+ +..+|. -|..-.++|+||--.|=++. T Consensus 92 ~~~tV~e~i~fgl~~~g~~~~e~~~rv~~~l~~~gl~~~~~r~p~~LSGGQrQRvaIA~aLa~~P~lLilDEPTs~LD~~ 171 (276) T PRK13650 92 VGATVEDDVAFGLENKGIPHEEMKERVNEALELVGMQDFKEREPARLSGGQKQRVAIAGAVAMRPKIIILDEATSMLDPE 171 (276) T ss_pred CCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHCCHHHCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCHH T ss_conf 63639999987998779999999999999998779924553890338999999999999997399999983886658999 Q ss_pred HHHHHHHHHHHHHHHHCCCEEEEECCCHHHHH Q ss_conf 67999999999999726984999748757976 Q gi|254780750|r 746 DGLSIAWATIEYLHETNRCRGLLATHFHELTD 777 (920) Q Consensus 746 DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~ 777 (920) +..--+..+..|.+..+-.++++||.-+... T Consensus 172 -~~~~i~~~l~~l~~~~g~Tvi~iTHdl~~v~ 202 (276) T PRK13650 172 -GRLELIKTIKNIRDDYQLTVISITHDLDEVA 202 (276) T ss_pred -HHHHHHHHHHHHHHHCCCEEEEEEECHHHHH T ss_conf -9999999999999842989999957789996 No 135 >PRK13635 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=97.67 E-value=0.00015 Score=52.37 Aligned_cols=128 Identities=29% Similarity=0.353 Sum_probs=67.3 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCC-------------------CCCC- Q ss_conf 44434563587777664399996778440789999999999999719853035320-------------------6822- Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAH-------------------IGIV- 693 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~-------------------i~~~- 693 (920) ++=+|++|.. ..+.+.-|-|||-+||||++|.+ .|.+-|- +-+ ++.+ T Consensus 21 ~aL~~isl~i----~~GE~vaivG~nGsGKSTL~k~l--------~Gl~~p~-~G~I~i~G~~i~~~~~~~lr~~ig~Vf 87 (279) T PRK13635 21 YALKDVSFSV----YEGEWVAIVGHNGSGKSTLAKLL--------NGLLLPE-AGTITVGGMVLSEETVWDVRKQIGMVF 87 (279) T ss_pred EEEEEEEEEE----CCCCEEEEECCCCCHHHHHHHHH--------HCCCCCC-CCEEEECCEECCCCCHHHHHHHEEEEE T ss_conf 2576307688----79989999999996599999999--------7288888-964999999998578799974366882 Q ss_pred ---CEEEEEEECCCCCCCCCCHH---HHHHHH-HHHHHH----------------------------HCCCCCEEEEECC Q ss_conf ---10567652376611385328---999999-999999----------------------------5899856999325 Q gi|254780750|r 694 ---DKLFSRVGSADNLASGRSTF---MVEMIE-TASILN----------------------------QATNQSFVILDEI 738 (920) Q Consensus 694 ---D~IftRiGa~D~l~~g~STF---~vEm~e-~~~IL~----------------------------~at~~SLVllDEl 738 (920) |.-|...--.|++.-|.... -.|+.+ +..+|. -+..-.++|+||- T Consensus 88 Q~P~~~l~~~tV~e~iafgl~~~g~~~~e~~~rv~~~l~~~gl~~~~~~~p~~LSGGQrQRvaIAraL~~~P~iLilDEP 167 (279) T PRK13635 88 QNPDNQFVGTTVQDDVAFGLENIGVPREEMVERVDQALRQVGMEDFLNREPHRLSGGQKQRVAIAGVLALQPDILILDEA 167 (279) T ss_pred CCHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHCCHHHCCHHHHHHHHHHHHHHCCCCEEEECCC T ss_conf 18565257626899998899877999999999999999877997886179343999999999999999709998997387 Q ss_pred CCCCCHHHHHH--HHHHHHHHHHHHCCCEEEEECCCHHHHH Q ss_conf 88988056799--9999999999726984999748757976 Q gi|254780750|r 739 GRGTATLDGLS--IAWATIEYLHETNRCRGLLATHFHELTD 777 (920) Q Consensus 739 GrGTst~DG~a--iA~aile~l~~~~~~~~lfaTHy~eL~~ 777 (920) ||..|-.+ --+..+..|.+..+..++++||.-+... T Consensus 168 ---Ts~LD~~~~~~i~~~l~~L~~~~g~TvI~itHdl~~~~ 205 (279) T PRK13635 168 ---TSMLDPQGRREVLETVRQLKEQKGITVLSITHDLDEAA 205 (279) T ss_pred ---CCCCCHHHHHHHHHHHHHHHHCCCCEEEEEEECHHHHH T ss_conf ---45489899999999999999837989999976789996 No 136 >TIGR03269 met_CoM_red_A2 methyl coenzyme M reductase system, component A2. The enzyme that catalyzes the final step in methanogenesis, methyl coenzyme M reductase, contains alpha, beta, and gamma chains. In older literature, the complex of alpha, beta, and gamma chains was termed component C, while this single chain protein was termed methyl coenzyme M reductase system component A2. Probab=97.67 E-value=0.00047 Score=48.70 Aligned_cols=32 Identities=22% Similarity=0.433 Sum_probs=25.4 Q ss_pred EEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHH Q ss_conf 434563587777664399996778440789999999 Q gi|254780750|r 636 ANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNA 671 (920) Q Consensus 636 pNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqva 671 (920) =+|++|.. ..+.++=|.|||-+||||++|.++ T Consensus 300 l~~vs~~v----~~GEi~gi~G~nGsGKsTL~k~l~ 331 (520) T TIGR03269 300 VDNVSLEV----KEGEIFGIVGTSGAGKTTLSKIIA 331 (520) T ss_pred EECEEEEE----CCCCEEEEECCCCCCHHHHHHHHH T ss_conf 51206897----289689998788887899999994 No 137 >cd03247 ABCC_cytochrome_bd The CYD subfamily implicated in cytochrome bd biogenesis. The CydC and CydD proteins are important for the formation of cytochrome bd terminal oxidase of E. coli and it has been proposed that they were necessary for biosynthesis of the cytochrome bd quinol oxidase and for periplasmic c-type cytochromes. CydCD were proposed to determine a heterooligomeric complex important for heme export into the periplasm or to be involved in the maintenance of the proper redox state of the periplasmic space. In Bacillus subtilius, the absence of CydCD does not affect the presence of halo-cytochrome c in the membrane and this observation suggests that CydCD proteins are not involved in the export of heme in this organism. Probab=97.65 E-value=0.0011 Score=45.95 Aligned_cols=126 Identities=21% Similarity=0.158 Sum_probs=67.1 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCC--------------------CCCCHHHCCCCCC Q ss_conf 44434563587777664399996778440789999999999999719--------------------8530353206822 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMG--------------------SYVPASYAHIGIV 693 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG--------------------~fVPA~~a~i~~~ 693 (920) .|=++++|.. ..+.+.-|.|||-+||||++|.+.=.. -.+-| +|||-.. T Consensus 16 ~~L~~i~l~i----~~Ge~~aivG~sGsGKSTLl~~l~G~~-~p~~G~i~i~g~~i~~~~~~~~~~i~~v~Q~~------ 84 (178) T cd03247 16 QVLKNLSLEL----KQGEKIALLGRSGSGKSTLLQLLTGDL-KPQQGEITLDGVPVSDLEKALSSLISVLNQRP------ 84 (178) T ss_pred CCEECEEEEE----CCCCEEEEECCCCCHHHHHHHHHHHCC-CCCCCEEEECCEEHHHHHHHHHHHEEEECCCC------ T ss_conf 6332558998----699999999999875999999998617-66788699999988997899997208983556------ Q ss_pred CEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCH Q ss_conf 10567652376611385328999999999999589985699932588988056799999999999972698499974875 Q gi|254780750|r 694 DKLFSRVGSADNLASGRSTFMVEMIETASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFH 773 (920) Q Consensus 694 D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~ 773 (920) .+|.. .-.||+....|-=+.--...|..| +....++|+||--.|=++.-...|-..+. .+. + +..++++||.- T Consensus 85 -~lf~~-ti~~nlg~~LSgGqkqRv~iAral--~~~p~ililDEpts~LD~~t~~~i~~~l~-~~~-~-~~Tvi~itH~~ 157 (178) T cd03247 85 -YLFDT-TLRNNLGRRFSGGERQRLALARIL--LQDAPIVLLDEPTVGLDPITERQLLSLIF-EVL-K-DKTLIWITHHL 157 (178) T ss_pred -CCCCC-HHHHHCCCCCCHHHHHHHHHHHHH--HHCCCEEEECCCCCCCCHHHHHHHHHHHH-HHC-C-CCEEEEEECCH T ss_conf -36454-199862888899999999999999--64979767228655699899999999999-983-9-99999980589 Q ss_pred HHHH Q ss_conf 7976 Q gi|254780750|r 774 ELTD 777 (920) Q Consensus 774 eL~~ 777 (920) +... T Consensus 158 ~~l~ 161 (178) T cd03247 158 TGIE 161 (178) T ss_pred HHHH T ss_conf 8998 No 138 >COG3910 Predicted ATPase [General function prediction only] Probab=97.64 E-value=0.00032 Score=50.02 Aligned_cols=127 Identities=26% Similarity=0.268 Sum_probs=82.4 Q ss_pred EEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCC------CHH------HCCC----CCCCEEEEE--------------- Q ss_conf 399996778440789999999999999719853------035------3206----822105676--------------- Q gi|254780750|r 651 KLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYV------PAS------YAHI----GIVDKLFSR--------------- 699 (920) Q Consensus 651 ~~~iiTGpNmgGKSt~lRqval~vilAQiG~fV------PA~------~a~i----~~~D~IftR--------------- 699 (920) ++-+|||-|-+||||+|-.+|...=.--+|--= -++ ++++ -+-++-|-| T Consensus 38 pIT~i~GENGsGKSTLLEaiA~~~~~n~aGg~~n~~~~~~~s~s~l~~~~k~~~~~k~~~g~FlRAEs~yn~as~~De~~ 117 (233) T COG3910 38 PITFITGENGSGKSTLLEAIAAGMGFNAAGGGKNFKGELDASHSALVDYAKLHKRKKPPIGFFLRAESFYNVASYLDEAD 117 (233) T ss_pred CEEEEECCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCEEEEHHHHHHHHHHHHHHH T ss_conf 64899768986578899999965651455787675751143314687768875067887506874467777998887622 Q ss_pred ----EECC--CCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCH Q ss_conf ----5237--6611385328999999999999589985699932588988056799999999999972698499974875 Q gi|254780750|r 700 ----VGSA--DNLASGRSTFMVEMIETASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFH 773 (920) Q Consensus 700 ----iGa~--D~l~~g~STF~vEm~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~ 773 (920) .|.. +..+.|.|-| +-+.+..+..-+-||||--.|-|+.--++++ |++..|.+. |+-.+.|||-+ T Consensus 118 ~e~~~~~~sLh~~SHGEsf~-------~i~~~rf~~~GiYiLDEPEa~LSp~RQlell-a~l~~la~s-GaQ~IiATHSP 188 (233) T COG3910 118 GEANYGGRSLHHMSHGESFL-------AIFHNRFNGQGIYILDEPEAALSPSRQLELL-AILRDLADS-GAQIIIATHSP 188 (233) T ss_pred HHCCCCCCCHHHHCCCHHHH-------HHHHHHHCCCCEEEECCCCCCCCHHHHHHHH-HHHHHHHHC-CCEEEEEECCH T ss_conf 01025774235541431899-------9999983158418856864447888899999-999998736-78399983681 Q ss_pred HHHHHHHHCCCEEEEEE Q ss_conf 79766430688589999 Q gi|254780750|r 774 ELTDLSKSLKRFHNATL 790 (920) Q Consensus 774 eL~~l~~~~~~v~n~~~ 790 (920) -|-. .|+..-+.+ T Consensus 189 iLlA----iP~A~I~~~ 201 (233) T COG3910 189 ILLA----IPGAEIYEI 201 (233) T ss_pred HHEE----CCCCEEEEE T ss_conf 4200----788479998 No 139 >cd03266 ABC_NatA_sodium_exporter NatA is the ATPase component of a bacterial ABC-type Na+ transport system called NatAB, which catalyzes ATP-dependent electrogenic Na+ extrusion without mechanically coupled proton or K+ uptake. NatB possess six putative membrane spanning regions at its C-terminus. In B. subtilus, NatAB is inducible by agents such as ethanol and protonophores, which lower the protonmotive force across the membrane. The closest sequence similarity to NatA is exhibited by DrrA of the two-component daunomycin- and doxorubicin-efflux system. Hence, the functional NatAB is presumably assembled with two copies of a single ATP-binding protein and a single intergral membrane protein. Probab=97.63 E-value=0.0015 Score=45.04 Aligned_cols=131 Identities=24% Similarity=0.313 Sum_probs=69.1 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCC------------------CCCC--- Q ss_conf 4434563587777664399996778440789999999999999719853035320------------------6822--- Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAH------------------IGIV--- 693 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~------------------i~~~--- 693 (920) +=||+++.. ..+.+.-|-|||-+||||++|.++ |..-| ++-+ ++.+ T Consensus 20 al~~vs~~i----~~Gei~gllG~NGaGKSTllk~i~--------Gl~~p-~~G~i~i~G~d~~~~~~~~r~~ig~~~q~ 86 (218) T cd03266 20 AVDGVSFTV----KPGEVTGLLGPNGAGKTTTLRMLA--------GLLEP-DAGFATVDGFDVVKEPAEARRRLGFVSDS 86 (218) T ss_pred EEECCEEEE----CCCCEEEEECCCCCCHHHHHHHHH--------CCCCC-CCCEEEECCEECCCCHHHHHCCEEEECCC T ss_conf 872627898----598299999999984999999997--------79778-97489999999886979896287998077 Q ss_pred CEEEEEEECCCCCC-----CCCC--------HHHHHHHHH-------------------HHHHHHCCCCCEEEEECCCCC Q ss_conf 10567652376611-----3853--------289999999-------------------999995899856999325889 Q gi|254780750|r 694 DKLFSRVGSADNLA-----SGRS--------TFMVEMIET-------------------ASILNQATNQSFVILDEIGRG 741 (920) Q Consensus 694 D~IftRiGa~D~l~-----~g~S--------TF~vEm~e~-------------------~~IL~~at~~SLVllDElGrG 741 (920) +.+|..+-..|++. .|.+ -.+.|+.+. +-..--+..-.++|+||--.| T Consensus 87 ~~l~~~ltv~e~l~~~~~~~g~~~~~~~~~~~~~l~~~~l~~~~~~~~~~LS~G~kqrv~la~al~~~P~lliLDEPt~g 166 (218) T cd03266 87 TGLYDRLTARENLEYFAGLYGLKGDELTARLEELADRLGMEELLDRRVGGFSTGMRQKVAIARALVHDPPVLLLDEPTTG 166 (218) T ss_pred CCCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHCCCHHHCCCHHHHHHHHHHHHHCCCCEEEEECCCCC T ss_conf 66799998999999999984999899999999999974995575144322782688999999998669989999798767 Q ss_pred CCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH-HHHH Q ss_conf 88056799999999999972698499974875797-6643 Q gi|254780750|r 742 TATLDGLSIAWATIEYLHETNRCRGLLATHFHELT-DLSK 780 (920) Q Consensus 742 Tst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~-~l~~ 780 (920) =++. +...-|.++..+.+. +.-.+|+||.-+-. .+.+ T Consensus 167 LD~~-~~~~i~~~l~~l~~~-g~til~~sH~l~e~~~l~d 204 (218) T cd03266 167 LDVM-ATRALREFIRQLRAL-GKCILFSTHIMQEVERLCD 204 (218) T ss_pred CCHH-HHHHHHHHHHHHHHC-CCEEEEECCCHHHHHHHCC T ss_conf 6999-999999999999857-9999998987899999699 No 140 >PRK09536 btuD corrinoid ABC transporter ATPase; Reviewed Probab=97.63 E-value=0.00039 Score=49.37 Aligned_cols=125 Identities=24% Similarity=0.301 Sum_probs=66.8 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHH--------------------HHHHCCCCCCHHHC---CCC Q ss_conf 4434563587777664399996778440789999999999--------------------99971985303532---068 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIV--------------------IMAQMGSYVPASYA---HIG 691 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~v--------------------ilAQiG~fVPA~~a---~i~ 691 (920) |=+|++|.. ..+.++-|-|||-+||||+||.+.=.. -+|+.-.|||=+.. .++ T Consensus 17 ~L~~Vsl~I----~~GEi~gLIGPNGAGKSTLLk~I~Gll~P~~G~V~l~G~~i~~~~~~elar~ia~vpQ~~~l~~~~t 92 (409) T PRK09536 17 ILDGVDLSV----REGHLVGVVGPNGAGKTTLLRAMNGLITPTAGTVLVAGDDVHALSAAAASRQVATVPQDTSLSFEFD 92 (409) T ss_pred EEECEEEEE----CCCCEEEEECCCCCHHHHHHHHHHCCCCCCCEEEEECCEECCCCCHHHHHHHHEEECCCCCCCCCCC T ss_conf 892508898----8998999999987279999999966888896399999999887998999623348433346677877 Q ss_pred CCCEEEEEEECCCCCCCCCCH----H-------------HHHH---------------------HHHHHHHHHCCCCCEE Q ss_conf 221056765237661138532----8-------------9999---------------------9999999958998569 Q gi|254780750|r 692 IVDKLFSRVGSADNLASGRST----F-------------MVEM---------------------IETASILNQATNQSFV 733 (920) Q Consensus 692 ~~D~IftRiGa~D~l~~g~ST----F-------------~vEm---------------------~e~~~IL~~at~~SLV 733 (920) +. |.+..|..- | ..|+ ..+|..| |..-.++ T Consensus 93 v~----------e~V~~Gr~p~~~~~~~~~~~d~~~v~~aLe~~~l~~l~dr~~~~LSGGqrQRV~IARAL--aq~P~IL 160 (409) T PRK09536 93 VR----------QVVEMGRTPHRSRFGTWTETDRAAVERAMERTGVAQFADRPVTSLSGGERQRVLLARAL--AQATPVL 160 (409) T ss_pred HH----------HHHHHCCHHHHHCCCCCCHHHHHHHHHHHHHCCCHHHHCCCHHHCCHHHHHHHHHHHHH--HCCCCEE T ss_conf 99----------99982502333203675789999999999874997685588002899999999999999--6799989 Q ss_pred EEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHH Q ss_conf 99325889880567999999999999726984999748757976 Q gi|254780750|r 734 ILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTD 777 (920) Q Consensus 734 llDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~ 777 (920) ||||--.|=+..--.-+ +.++..|.+. +.-++++||.-++.. T Consensus 161 LLDEPTs~LDi~~q~el-l~lLr~L~~~-G~TVI~vtHDL~lA~ 202 (409) T PRK09536 161 LLDEPTASLDINHQIRT-LELVRDLADD-GKTVVAAIHDLNLAA 202 (409) T ss_pred EEECCCCCCCHHHHHHH-HHHHHHHHHC-CCEEEEEECCHHHHH T ss_conf 99587667999999999-9999999858-999999956899999 No 141 >PRK13536 nodulation factor exporter subunit NodI; Provisional Probab=97.63 E-value=0.00099 Score=46.30 Aligned_cols=47 Identities=28% Similarity=0.383 Sum_probs=34.1 Q ss_pred CCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH Q ss_conf 9985699932588988056799999999999972698499974875797 Q gi|254780750|r 728 TNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELT 776 (920) Q Consensus 728 t~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~ 776 (920) ..-.++++||--.|=++.--. --|.++..|.+. +..++++|||-+-. T Consensus 155 ~~P~lliLDEPT~GLDp~~r~-~i~~~i~~l~~~-G~TillttH~l~E~ 201 (306) T PRK13536 155 NDPQLLILDEPTTGLDPHARH-LIWERLRSLLAR-GKTILLTTHFMEEA 201 (306) T ss_pred CCCCEEEECCCCCCCCHHHHH-HHHHHHHHHHHC-CCEEEEECCCHHHH T ss_conf 599899975875678999999-999999999968-98999988838999 No 142 >cd03219 ABC_Mj1267_LivG_branched The Mj1267/LivG ABC transporter subfamily is involved in the transport of the hydrophobic amino acids leucine, isoleucine and valine. MJ1267 is a branched-chain amino acid transporter with 29% similarity to both the LivF and LivG components of the E. coli branched-chain amino acid transporter. MJ1267 contains an insertion from residues 114 to 123 characteristic of LivG (Leucine-Isoleucine-Valine) homologs. The branched-chain amino acid transporter from E. coli comprises a heterodimer of ABCs (LivF and LivG), a heterodimer of six-helix TM domains (LivM and LivH), and one of two alternative soluble periplasmic substrate binding proteins (LivK or LivJ). Probab=97.63 E-value=0.0013 Score=45.43 Aligned_cols=137 Identities=27% Similarity=0.368 Sum_probs=72.7 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCC------C----CHHHCCCCCC-----CEEEEE Q ss_conf 443456358777766439999677844078999999999999971985------3----0353206822-----105676 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSY------V----PASYAHIGIV-----DKLFSR 699 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~f------V----PA~~a~i~~~-----D~IftR 699 (920) +=+|++|.. ..+.+.-|.|||-+||||++|.++=+. -..-|.- + |.+.++.++. ..+|.. T Consensus 15 aL~~vsl~i----~~Gei~gliG~nGaGKSTL~~~i~Gl~-~p~~G~I~~~G~~i~~~~~~~~~~~gi~~v~Q~~~l~~~ 89 (236) T cd03219 15 ALDDVSFSV----RPGEIHGLIGPNGAGKTTLFNLISGFL-RPTSGSVLFDGEDITGLPPHEIARLGIGRTFQIPRLFPE 89 (236) T ss_pred EECCEEEEE----CCCCEEEEECCCCCCHHHHHHHHHCCC-CCCCEEEEECCEECCCCCHHHHHHCCCEEEECCCCCCCC T ss_conf 872338998----899899999899973999999996798-788318999999668899999997597676014102655 Q ss_pred EECCCCCCCC-----CCHHH--------HHHHH-------------------------------HHHHHHHCCCCCEEEE Q ss_conf 5237661138-----53289--------99999-------------------------------9999995899856999 Q gi|254780750|r 700 VGSADNLASG-----RSTFM--------VEMIE-------------------------------TASILNQATNQSFVIL 735 (920) Q Consensus 700 iGa~D~l~~g-----~STF~--------vEm~e-------------------------------~~~IL~~at~~SLVll 735 (920) +...||+.-| ...|. .++.+ +|..| +..-.++|+ T Consensus 90 ltv~enl~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~l~~~~~~~~~~LSGG~~Qrv~iAral--~~~P~lliL 167 (236) T cd03219 90 LTVLENVMVAAQARTGSGLLLARARREEREARERAEELLERVGLADLADRPAGELSYGQQRRLEIARAL--ATDPKLLLL 167 (236) T ss_pred CCHHHHHHHHHHHHCCCCCHHHCCCCCHHHHHHHHHHHHHHCCCCCHHCCCHHHCCHHHHHHHHHHHHH--HCCCCEEEE T ss_conf 438998988887604543001102358999999999999974998043886266999999999999999--659999999 Q ss_pred ECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHH-HHHHH Q ss_conf 3258898805679999999999997269849997487579-76643 Q gi|254780750|r 736 DEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHEL-TDLSK 780 (920) Q Consensus 736 DElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL-~~l~~ 780 (920) ||--+|=++..-.-| |..+..+.+. +.-.+++||.-+. .++.+ T Consensus 168 DEPT~gLD~~~~~~i-~~~l~~l~~~-G~tii~vsHdl~~~~~~~D 211 (236) T cd03219 168 DEPAAGLNPEETEEL-AELIRELRER-GITVLLVEHDMDVVMSLAD 211 (236) T ss_pred ECCCCCCCHHHHHHH-HHHHHHHHHC-CCEEEEECCCHHHHHHHCC T ss_conf 487658999999999-9999999965-9999999174899999699 No 143 >PRK13641 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=97.63 E-value=0.0015 Score=45.02 Aligned_cols=52 Identities=15% Similarity=0.153 Sum_probs=36.8 Q ss_pred CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH-HHHH Q ss_conf 89985699932588988056799999999999972698499974875797-6643 Q gi|254780750|r 727 ATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELT-DLSK 780 (920) Q Consensus 727 at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~-~l~~ 780 (920) |..-.++|+||--.|=++.-...| +.+++.|.+. +..++++||.-+.. .+++ T Consensus 161 a~~P~iLlLDEPTsgLDp~~~~~i-~~ll~~l~~~-G~Tii~vtHd~~~v~~~ad 213 (286) T PRK13641 161 AYEPEILCLDEPAAGLDPEGRKEM-MQIFKDYQKA-GHTVILVTHNMDDVAEYAD 213 (286) T ss_pred HCCCCEEEECCCCCCCCHHHHHHH-HHHHHHHHHC-CCEEEEECCCHHHHHHHCC T ss_conf 749999997397343899999999-9999999963-9999999159999999799 No 144 >PRK11607 potG putrescine transporter ATP-binding subunit; Provisional Probab=97.62 E-value=0.0031 Score=42.57 Aligned_cols=141 Identities=23% Similarity=0.241 Sum_probs=71.1 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCC---------CCCCHHHCCCCCC---CEEEEEEEC Q ss_conf 4434563587777664399996778440789999999999999719---------8530353206822---105676523 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMG---------SYVPASYAHIGIV---DKLFSRVGS 702 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG---------~fVPA~~a~i~~~---D~IftRiGa 702 (920) +=+|++|+. ..+.+.-|-|||-+||||+||.++=+.- ..-| ..+|...-.++.+ ..+|-.+-. T Consensus 34 al~~vsl~I----~~GE~~~llGpsGsGKSTllr~i~Gl~~-p~~G~I~i~G~di~~~~~~~r~ig~vfQ~~~L~p~ltV 108 (377) T PRK11607 34 AVDDVSLTI----YKGEIFALLGASGCGKSTLLRMLAGFEQ-PSAGQIMLDGVDLSHVPPYQRPINMMFQSYALFPHMTV 108 (377) T ss_pred EECCCEEEE----CCCCEEEEECCCCCHHHHHHHHHHCCCC-CCCEEEEECCEECCCCCHHHCCEEEEECCCCCCCCCCH T ss_conf 990518799----9998999999998489999999976999-98659999999988798666650467012655877575 Q ss_pred CCCCCCCC-----CH-----HHHHHHH---HHHHHH-------------------HCCCCCEEEEECCCCCCCHHHHHHH Q ss_conf 76611385-----32-----8999999---999999-------------------5899856999325889880567999 Q gi|254780750|r 703 ADNLASGR-----ST-----FMVEMIE---TASILN-------------------QATNQSFVILDEIGRGTATLDGLSI 750 (920) Q Consensus 703 ~D~l~~g~-----ST-----F~vEm~e---~~~IL~-------------------~at~~SLVllDElGrGTst~DG~ai 750 (920) .||+.-|. +. -..|+.+ ....++ -+.+-.++|+||--.|=++.--..+ T Consensus 109 ~eNi~~~l~~~~~~~~e~~~rv~e~l~~v~L~~~~~~~p~~LSGGqrQRVaiArAL~~~P~lLllDEPts~LD~~~r~~l 188 (377) T PRK11607 109 EQNIAFGLKQDKLPKAEIASRVNEMLGLVHMQEFAKRKPHQLSGGQRQRVALARSLAKRPKLLLLDEPMGALDKKLRDRM 188 (377) T ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHCCHHHCCHHHHHHHHHHHHHHCCCCEEEECCCCCCCCHHHHHHH T ss_conf 45245478665999899999999998544627666589657898687899999987449978996487544799999999 Q ss_pred HHHHHHHHHHHCCCEEEEECCCHHHHH-HHHH Q ss_conf 999999999726984999748757976-6430 Q gi|254780750|r 751 AWATIEYLHETNRCRGLLATHFHELTD-LSKS 781 (920) Q Consensus 751 A~aile~l~~~~~~~~lfaTHy~eL~~-l~~~ 781 (920) - .-+..|.+..+..++|+||+.+-+. +++. T Consensus 189 ~-~~l~~l~~~~g~Tii~VTHD~~eA~~laDr 219 (377) T PRK11607 189 Q-LEVVDILERVGVTCVMVTHDQEEAMTMAGR 219 (377) T ss_pred H-HHHHHHHHHHCCEEEEECCCHHHHHHHCCE T ss_conf 9-999999997399999999899999986999 No 145 >PRK13631 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=97.61 E-value=0.00076 Score=47.15 Aligned_cols=52 Identities=17% Similarity=0.156 Sum_probs=34.5 Q ss_pred CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH-HHHH Q ss_conf 89985699932588988056799999999999972698499974875797-6643 Q gi|254780750|r 727 ATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELT-DLSK 780 (920) Q Consensus 727 at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~-~l~~ 780 (920) |..-.++|+||--.|=++.--..| +..+..+.+. +..++++||.-+.. .+++ T Consensus 192 a~~P~iLilDEPTagLDp~~~~~i-~~li~~l~~~-g~TiilvTHdm~~v~~~aD 244 (320) T PRK13631 192 AIQPEILIFDEPTAGLDPKGEHEM-MQLILDAKAN-NKTVFVITHTMEHVLEVAD 244 (320) T ss_pred HCCCCEEEECCCCCCCCHHHHHHH-HHHHHHHHHC-CCEEEEECCCHHHHHHHCC T ss_conf 239999997587555998999999-9999999962-9999999478999999799 No 146 >cd03278 ABC_SMC_barmotin Barmotin is a tight junction-associated protein expressed in rat epithelial cells which is thought to have an important regulatory role in tight junction barrier function. Barmotin belongs to the SMC protein family. SMC proteins are large (approximately 110 to 170 kDa), and each is arranged into five recognizable domains. Amino-acid sequence homology of SMC proteins between species is largely confined to the amino- and carboxy-terminal globular domains. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T, in the single-letter amino-acid code), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif, and a motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, w Probab=97.61 E-value=0.00083 Score=46.88 Aligned_cols=55 Identities=20% Similarity=0.275 Sum_probs=33.3 Q ss_pred CCCEEEEECCCCCCCHHHHHH--HHHHHHHHHHHHCCCEEEEECCCHHHHHHHHHCCCEEEEEEE Q ss_conf 985699932588988056799--999999999972698499974875797664306885899999 Q gi|254780750|r 729 NQSFVILDEIGRGTATLDGLS--IAWATIEYLHETNRCRGLLATHFHELTDLSKSLKRFHNATLQ 791 (920) Q Consensus 729 ~~SLVllDElGrGTst~DG~a--iA~aile~l~~~~~~~~lfaTHy~eL~~l~~~~~~v~n~~~~ 791 (920) +..++|+||- ||..|-.. .-+..++.|.+ +..+++.||.....+.++. +.-.+|. T Consensus 135 p~~iliLDEP---Ts~LD~~~~~~i~~~l~~l~~--~~t~IiITH~~~~i~~AD~---Ii~v~m~ 191 (197) T cd03278 135 PSPFCVLDEV---DAALDDANVERFARLLKEFSK--ETQFIVITHRKGTMEAADR---LYGVTMQ 191 (197) T ss_pred CCCEEEECCC---CCCCCHHHHHHHHHHHHHHHC--CCEEEEEECCHHHHHHCCE---EEEEEEC T ss_conf 9978997178---553898999999999999856--9989999849999985899---9999837 No 147 >cd03259 ABC_Carb_Solutes_like ABC Carbohydrate and Solute Transporters-like subgroup. This family is comprised of proteins involved in the transport of apparently unrelated solutes and proteins specific for di- and oligosaccharides and polyols. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides and more complex organic molecules. The nucleotide-binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=97.61 E-value=0.00047 Score=48.75 Aligned_cols=139 Identities=26% Similarity=0.311 Sum_probs=69.9 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHH-----CCCC---CCHHHCCCCCC---CEEEEEEECC Q ss_conf 44345635877776643999967784407899999999999997-----1985---30353206822---1056765237 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQ-----MGSY---VPASYAHIGIV---DKLFSRVGSA 703 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQ-----iG~f---VPA~~a~i~~~---D~IftRiGa~ 703 (920) |=+|+++.. ..+.+.-|-|||-+||||++|.++=..-... -|.- .|...-.++.+ ..+|-.+-.. T Consensus 15 al~~vs~~i----~~Gei~~iiGpnGaGKSTl~~~i~Gl~~p~~G~I~~~g~~i~~~~~~~r~ig~v~Q~~~l~~~ltv~ 90 (213) T cd03259 15 ALDDLSLTV----EPGEFLALLGPSGCGKTTLLRLIAGLERPDSGEILIDGRDVTGVPPERRNIGMVFQDYALFPHLTVA 90 (213) T ss_pred EECCCEEEE----CCCCEEEEECCCCCHHHHHHHHHHCCCCCCCEEEEECCEECCCCCHHHCCEEEEECCCCCCCCCCHH T ss_conf 984617798----8998999999999739999999975999897089999999888997787869990698658898199 Q ss_pred CCCCCC-----CCHHHHHHHH-HHHHH---------------------------H-HCCCCCEEEEECCCCCCCHHHHHH Q ss_conf 661138-----5328999999-99999---------------------------9-589985699932588988056799 Q gi|254780750|r 704 DNLASG-----RSTFMVEMIE-TASIL---------------------------N-QATNQSFVILDEIGRGTATLDGLS 749 (920) Q Consensus 704 D~l~~g-----~STF~vEm~e-~~~IL---------------------------~-~at~~SLVllDElGrGTst~DG~a 749 (920) ||+.-+ .+. .|..+ +..+| + -+.+--++|+||--.|=++.-... T Consensus 91 enl~~~~~~~~~~~--~~~~~~~~~~l~~~~l~~~~~~~~~~LSGG~kQrv~iAraL~~~P~illlDEPt~gLD~~~~~~ 168 (213) T cd03259 91 ENIAFGLKLRGVPK--AEIRARVRELLELVGLEGLLNRYPHELSGGQQQRVALARALAREPSLLLLDEPLSALDAKLREE 168 (213) T ss_pred HHHHHHHHHCCCCH--HHHHHHHHHHHHHCCCHHHHHCCCCCCCHHHHHHHHHHHHHCCCCCEEEECCCCCCCCHHHHHH T ss_conf 99988998759998--9999999999998699647637703389899999999876227999999839864379999999 Q ss_pred HHHHHHHHHHHHCCCEEEEECCCHHHH-HHHH Q ss_conf 999999999972698499974875797-6643 Q gi|254780750|r 750 IAWATIEYLHETNRCRGLLATHFHELT-DLSK 780 (920) Q Consensus 750 iA~aile~l~~~~~~~~lfaTHy~eL~-~l~~ 780 (920) | |..+..+.+..+.-.+++||..+.. .+++ T Consensus 169 i-~~li~~l~~~~g~tii~vtHdl~~~~~~~d 199 (213) T cd03259 169 L-REELKELQRELGITTIYVTHDQEEALALAD 199 (213) T ss_pred H-HHHHHHHHHHCCCEEEEECCCHHHHHHHCC T ss_conf 9-999999999629999999689999999699 No 148 >PRK11147 ABC transporter ATPase component; Reviewed Probab=97.60 E-value=0.00048 Score=48.68 Aligned_cols=134 Identities=25% Similarity=0.254 Sum_probs=66.1 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEE----CCCCCCCC Q ss_conf 44434563587777664399996778440789999999999999719853035320682210567652----37661138 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVG----SADNLASG 709 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiG----a~D~l~~g 709 (920) .|-.|+++.. ..+..+-|.|||-+||||+||.++=. +=...|..--....+++-||+-...+- .-|++..| T Consensus 333 ~~l~~vsl~i----~~Ge~ialvG~NGsGKSTLlk~l~G~-l~p~~G~i~~g~~~~i~y~~Q~~~~l~~~~tv~~~l~~~ 407 (632) T PRK11147 333 QLVKDFSAQV----QRGDKIALIGPNGCGKTTLLKLMLGQ-LQADSGRIHCGTKLEVAYFDQYRAELDPEKTVMDNLAEG 407 (632) T ss_pred EEEEEECCCC----CCCCEEEEECCCCCCHHHHHHHHHCC-CCCCCCEEEECCCCEEEEEHHHHHHCCCCCCHHHHHHHC T ss_conf 6776533335----78877999889884277999986066-689987799899870775515476459768699999732 Q ss_pred CCHHHH-------------------------------HHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHH Q ss_conf 532899-------------------------------9999999999589985699932588988056799999999999 Q gi|254780750|r 710 RSTFMV-------------------------------EMIETASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYL 758 (920) Q Consensus 710 ~STF~v-------------------------------Em~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l 758 (920) ....+. |-..++-..--+++-.|.||||- |+-.|=.++ .+.-+.| T Consensus 408 ~~~~~~~~~~r~~~~~L~~f~f~~~~~~~~v~~LSGGEk~Rl~LA~~l~~~pnlLiLDEP---TNhLDi~s~-e~Le~aL 483 (632) T PRK11147 408 KQEVMVNGKPRHVLGYLQDFLFHPKRAMTPVKALSGGERNRLLLARLFLKPSNLLILDEP---TNDLDVETL-ELLEELL 483 (632) T ss_pred CHHCCCCHHHHHHHHHHHHHCCCHHHHCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEECC---CCCCCHHHH-HHHHHHH T ss_conf 321011558999999999857798896391553999999999999985779978999898---765799999-9999999 Q ss_pred HHHCCCEEEEECCCHHHHH Q ss_conf 9726984999748757976 Q gi|254780750|r 759 HETNRCRGLLATHFHELTD 777 (920) Q Consensus 759 ~~~~~~~~lfaTHy~eL~~ 777 (920) .+-.| -.||+||.....+ T Consensus 484 ~~y~G-tvl~VSHDr~fl~ 501 (632) T PRK11147 484 DSYQG-TLLLVSHDRQFVD 501 (632) T ss_pred HHCCC-EEEEEECCHHHHH T ss_conf 85898-3999979899998 No 149 >COG2274 SunT ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain [Defense mechanisms] Probab=97.59 E-value=0.0054 Score=40.80 Aligned_cols=128 Identities=28% Similarity=0.369 Sum_probs=74.5 Q ss_pred CEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHH-CCC-----------------CCC- Q ss_conf 0444345635877776643999967784407899999999999997198530353-206-----------------822- Q gi|254780750|r 633 PFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASY-AHI-----------------GIV- 693 (920) Q Consensus 633 ~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~-a~i-----------------~~~- 693 (920) +.|=+|++|..+ .+...-|.|+|.+||||++|-+ .|.|.|-+- -.+ +.| T Consensus 486 ~~vL~~isL~I~----~Ge~vaIvG~SGsGKSTL~KLL--------~gly~p~~G~I~~dg~dl~~i~~~~lR~~ig~V~ 553 (709) T COG2274 486 PPVLEDLSLEIP----PGEKVAIVGRSGSGKSTLLKLL--------LGLYKPQQGRILLDGVDLNDIDLASLRRQVGYVL 553 (709) T ss_pred CCHHHCEEEEEC----CCCEEEEECCCCCCHHHHHHHH--------HCCCCCCCCEEEECCEEHHHCCHHHHHHHEEEEC T ss_conf 441215027767----9988999879999889999998--------3678888855999987278669999986546874 Q ss_pred --CEEEEEEECCCCCCCCCCHH-HHHHHHHHHH----------------------------------HHH--CCCCCEEE Q ss_conf --10567652376611385328-9999999999----------------------------------995--89985699 Q gi|254780750|r 694 --DKLFSRVGSADNLASGRSTF-MVEMIETASI----------------------------------LNQ--ATNQSFVI 734 (920) Q Consensus 694 --D~IftRiGa~D~l~~g~STF-~vEm~e~~~I----------------------------------L~~--at~~SLVl 734 (920) +.+|++- -.|||..|.-.- +-|..+.+.+ |-. ..+...+| T Consensus 554 Q~~~Lf~gS-I~eNi~l~~p~~~~e~i~~A~~~ag~~~fI~~lP~gy~t~v~E~G~~LSGGQrQrlalARaLl~~P~ILl 632 (709) T COG2274 554 QDPFLFSGS-IRENIALGNPEATDEEIIEAAQLAGAHEFIENLPMGYDTPVGEGGANLSGGQRQRLALARALLSKPKILL 632 (709) T ss_pred CCCHHHCCC-HHHHHHCCCCCCCHHHHHHHHHHHCCHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEE T ss_conf 665320473-9879746899999799999999837689998360545623204898888889999999998546999899 Q ss_pred EECCCCCCCHHHHHHHHHHHHHHHHHHC-CCEEEEECCCHHHHH Q ss_conf 9325889880567999999999999726-984999748757976 Q gi|254780750|r 735 LDEIGRGTATLDGLSIAWATIEYLHETN-RCRGLLATHFHELTD 777 (920) Q Consensus 735 lDElGrGTst~DG~aiA~aile~l~~~~-~~~~lfaTHy~eL~~ 777 (920) +||= ||..|=.+-+ .|.+.|.+.. ++.+++.||-+.... T Consensus 633 LDEa---TSaLD~~sE~-~I~~~L~~~~~~~T~I~IaHRl~ti~ 672 (709) T COG2274 633 LDEA---TSALDPETEA-IILQNLLQILQGRTVIIIAHRLSTIR 672 (709) T ss_pred EECC---CCCCCHHHHH-HHHHHHHHHHCCCEEEEEECCCHHHH T ss_conf 7074---2236986799-99999999845886999976616864 No 150 >PRK13651 cobalt transporter ATP-binding subunit; Provisional Probab=97.59 E-value=0.0013 Score=45.49 Aligned_cols=48 Identities=21% Similarity=0.214 Sum_probs=33.8 Q ss_pred CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH Q ss_conf 89985699932588988056799999999999972698499974875797 Q gi|254780750|r 727 ATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELT 776 (920) Q Consensus 727 at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~ 776 (920) |.+-.++|+||--.|-++..-.-| +.++..|.+. +-.++++||.-+.. T Consensus 177 a~~P~iLlLDEPTagLDp~~~~~i-~~~l~~L~~~-G~TVI~vTHdm~~v 224 (304) T PRK13651 177 AMEPDFLVFDEPTAGLDPQGVKEI-LEIFDTLNKK-GKTIILVTHDLDNV 224 (304) T ss_pred HCCCCEEEECCCCCCCCHHHHHHH-HHHHHHHHHC-CCEEEEECCCHHHH T ss_conf 459999997298665898999999-9999999977-99999986789999 No 151 >cd03298 ABC_ThiQ_thiamine_transporter ABC-type thiamine tranport system; part of the binding-protein-dependent transport system tbpA-thiPQ for thiamine and TPP. Probably responsible for the translocation of thiamine across the membrane. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=97.58 E-value=0.0034 Score=42.30 Aligned_cols=129 Identities=23% Similarity=0.324 Sum_probs=67.0 Q ss_pred EEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCC-----------------CCC---CEEE Q ss_conf 45635877776643999967784407899999999999997198530353206-----------------822---1056 Q gi|254780750|r 638 DCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHI-----------------GIV---DKLF 697 (920) Q Consensus 638 di~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i-----------------~~~---D~If 697 (920) +++|.. ..+.+..|-|||-+||||+||.++ |...|- +-++ +.+ ..+| T Consensus 16 ~i~l~i----~~Ge~~~ilGpSGsGKSTLl~li~--------Gl~~p~-sG~I~i~G~di~~~~~~~r~ig~vfQ~~~L~ 82 (211) T cd03298 16 HFDLTF----AQGEITAIVGPSGSGKSTLLNLIA--------GFETPQ-SGRVLINGVDVTAAPPADRPVSMLFQENNLF 82 (211) T ss_pred EEEEEE----CCCCEEEEECCCCCHHHHHHHHHH--------CCCCCC-CEEEEECCEECCCCCHHHCCEEEEECCCCCC T ss_conf 278898----899899999999955999999997--------699988-5299999999999998898679995388668 Q ss_pred EEEECCCCCCCCCCH---HHHH-------HHH---HHHHH------------------H-HCCCCCEEEEECCCCCCCHH Q ss_conf 765237661138532---8999-------999---99999------------------9-58998569993258898805 Q gi|254780750|r 698 SRVGSADNLASGRST---FMVE-------MIE---TASIL------------------N-QATNQSFVILDEIGRGTATL 745 (920) Q Consensus 698 tRiGa~D~l~~g~ST---F~vE-------m~e---~~~IL------------------~-~at~~SLVllDElGrGTst~ 745 (920) ..+-..||+.-|.+- +-.| +.+ ....+ | -+++-.++|+||--.+=++. T Consensus 83 p~ltV~eNi~~~l~~~~~~~~~~~~~v~~~l~~~gl~~~~~~~p~~LSGGqkQRvaiARAL~~~P~ilLlDEPts~LD~~ 162 (211) T cd03298 83 AHLTVEQNVGLGLSPGLKLTAEDRQAIEVALARVGLAGLEKRLPGELSGGERQRVALARVLVRDKPVLLLDEPFAALDPA 162 (211) T ss_pred CCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHCCHHHCCHHHHHHHHHHHHHHCCCCEEEECCCCCCCCHH T ss_conf 99949999875886468882999999999998769987872894558989999999999986599999971887655989 Q ss_pred HHHHHHHHHHHHHHHHCCCEEEEECCCHHHHH-HHH Q ss_conf 67999999999999726984999748757976-643 Q gi|254780750|r 746 DGLSIAWATIEYLHETNRCRGLLATHFHELTD-LSK 780 (920) Q Consensus 746 DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~-l~~ 780 (920) --.. -+..+..+.+..+..++++||..+.+. +++ T Consensus 163 ~~~~-l~~~l~~l~~~~~~Tvi~vTHd~~ea~~~ad 197 (211) T cd03298 163 LRAE-MLDLVLDLHAETKMTVLMVTHQPEDAKRLAQ 197 (211) T ss_pred HHHH-HHHHHHHHHHHHCCEEEEECCCHHHHHHHCC T ss_conf 9999-9999999999749989999889999999699 No 152 >PRK10982 galactose/methyl galaxtoside transporter ATP-binding protein; Provisional Probab=97.58 E-value=0.00075 Score=47.22 Aligned_cols=52 Identities=17% Similarity=0.186 Sum_probs=32.7 Q ss_pred CCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH-HHHHH Q ss_conf 9985699932588988056799999999999972698499974875797-66430 Q gi|254780750|r 728 TNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELT-DLSKS 781 (920) Q Consensus 728 t~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~-~l~~~ 781 (920) +.-.++|+||--+|=+..--..| +..+..|.+. +.-++|.||..+.. .+.+. T Consensus 408 ~~p~iLilDEPTsGLD~~~~~~i-~~ll~~l~~~-G~~il~iSHDl~~~~~~~DR 460 (491) T PRK10982 408 TQPEILMLDEPTRGIDVGAKFEI-YQLIAELAKK-DKGIIIISSEMPELLGITDR 460 (491) T ss_pred HCCCEEEEECCCCCCCHHHHHHH-HHHHHHHHHC-CCEEEEEECCHHHHHHHCCE T ss_conf 49988999787557999999999-9999999968-99999995858999986999 No 153 >PRK10762 D-ribose transporter ATP binding protein; Provisional Probab=97.58 E-value=0.0003 Score=50.17 Aligned_cols=53 Identities=21% Similarity=0.191 Sum_probs=36.1 Q ss_pred CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH-HHHHH Q ss_conf 89985699932588988056799999999999972698499974875797-66430 Q gi|254780750|r 727 ATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELT-DLSKS 781 (920) Q Consensus 727 at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~-~l~~~ 781 (920) ++.-.++|+||--+|=+..--..| |..+..+.+. |.-++|.||.-+.. ++++. T Consensus 411 ~~~p~lLilDEPT~GLD~~~~~~i-~~ll~~l~~~-G~til~isHDl~~v~~~aDR 464 (501) T PRK10762 411 MTRPKVLILDEPTRGVDVGAKKEI-YQLINQFKAD-GLSIILVSSEMPEVLGMSDR 464 (501) T ss_pred HCCCCEEEEECCCCCCCHHHHHHH-HHHHHHHHHC-CCEEEEEECCHHHHHHHCCE T ss_conf 729988999798668999999999-9999999967-99999991868999986999 No 154 >PRK13636 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=97.58 E-value=0.0025 Score=43.27 Aligned_cols=50 Identities=16% Similarity=0.147 Sum_probs=34.2 Q ss_pred CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHH Q ss_conf 899856999325889880567999999999999726984999748757976 Q gi|254780750|r 727 ATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTD 777 (920) Q Consensus 727 at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~ 777 (920) |..-.++|+||--.|=++..- .--+..+..|.+..+...+++||.-+... T Consensus 157 a~~P~iLlLDEPTagLDp~~~-~~i~~ll~~l~~e~g~TiilvtHd~~~v~ 206 (285) T PRK13636 157 VMEPKVLVLDEPTAGLDPMGV-SEIMKLLVEMQKELGLTIIIATHDIDIVP 206 (285) T ss_pred HCCCCEEEEECCCCCCCHHHH-HHHHHHHHHHHHHCCCEEEEECCCHHHHH T ss_conf 749989999787555999999-99999999999844989999948899999 No 155 >PRK11147 ABC transporter ATPase component; Reviewed Probab=97.58 E-value=0.0041 Score=41.66 Aligned_cols=13 Identities=23% Similarity=0.470 Sum_probs=5.5 Q ss_pred CCCCCHHHHHHCC Q ss_conf 7764127876301 Q gi|254780750|r 308 GSREQSLLKTIDY 320 (920) Q Consensus 308 g~~~gSL~~~Ln~ 320 (920) |+.|.||+.+|+. T Consensus 39 GsGKSTLl~iL~G 51 (632) T PRK11147 39 GAGKSTLMKILSG 51 (632) T ss_pred CCHHHHHHHHHHC T ss_conf 9879999999838 No 156 >cd03257 ABC_NikE_OppD_transporters The ABC transporter subfamily specific for the transport of dipeptides, oligopeptides (OppD), and nickel (NikDE). The NikABCDE system of E. coli belongs to this family and is composed of the periplasmic binding protein NikA, two integral membrane components (NikB and NikC), and two ATPase (NikD and NikE). The NikABCDE transporter is synthesized under anaerobic conditions to meet the increased demand for nickel resulting from hydrogenase synthesis. The molecular mechanism of nickel uptake in many bacteria and most archaea is not known. Many other members of this ABC family are also involved in the uptake of dipeptides and oligopeptides. The oligopeptide transport system (Opp) is a five-component ABC transport composed of a membrane-anchored substrate binding proteins (SRP), OppA, two transmembrane proteins, OppB and OppC, and two ATP-binding domains, OppD and OppF. Probab=97.57 E-value=0.0019 Score=44.24 Aligned_cols=53 Identities=21% Similarity=0.142 Sum_probs=36.6 Q ss_pred CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHH-HHH Q ss_conf 899856999325889880567999999999999726984999748757976-643 Q gi|254780750|r 727 ATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTD-LSK 780 (920) Q Consensus 727 at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~-l~~ 780 (920) +..-.++|+||--.|-++.-..-| +.++..|.+..+.-++++||..+... +++ T Consensus 161 ~~~P~iLllDEPTs~LD~~~~~~i-~~~l~~l~~~~~~tii~vtHd~~~~~~~aD 214 (228) T cd03257 161 ALNPKLLIADEPTSALDVSVQAQI-LDLLKKLQEELGLTLLFITHDLGVVAKIAD 214 (228) T ss_pred CCCCCEEEEECCCCCCCHHHHHHH-HHHHHHHHHHCCCEEEEECCCHHHHHHHCC T ss_conf 479999999488764799999999-999999998509899998689999999699 No 157 >PRK11819 putative ABC transporter ATP-binding protein; Reviewed Probab=97.57 E-value=0.00076 Score=47.15 Aligned_cols=16 Identities=25% Similarity=0.530 Sum_probs=9.4 Q ss_pred EEEEECCCCCCCCCCHHHH Q ss_conf 6765237661138532899 Q gi|254780750|r 697 FSRVGSADNLASGRSTFMV 715 (920) Q Consensus 697 ftRiGa~D~l~~g~STF~v 715 (920) ..=+|.+ -.|+|||+- T Consensus 353 iaivG~N---GsGKSTLlk 368 (556) T PRK11819 353 VGIIGPN---GAGKSTLFK 368 (556) T ss_pred EEEECCC---CCCHHHHHH T ss_conf 7898898---775889999 No 158 >PRK13634 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=97.57 E-value=0.0022 Score=43.74 Aligned_cols=53 Identities=21% Similarity=0.188 Sum_probs=33.5 Q ss_pred CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH-HHHH Q ss_conf 89985699932588988056799999999999972698499974875797-6643 Q gi|254780750|r 727 ATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELT-DLSK 780 (920) Q Consensus 727 at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~-~l~~ 780 (920) |..-.++|+||--.|=++.-... -+.++..|.+..+..++++||.-+.. .+++ T Consensus 148 a~~P~iLllDEPTs~LD~~~~~~-i~~ll~~L~~e~g~Tii~vTHdl~~~~~~aD 201 (276) T PRK13634 148 AMEPEVLVLDEPTAGLDPKGRKE-IMEMFYKLHKEKGLTTVLVTHSMEDAARYAD 201 (276) T ss_pred HCCCCEEEECCCCCCCCHHHHHH-HHHHHHHHHHHCCCEEEEECCCHHHHHHHCC T ss_conf 72999899769854279999999-9999999999619999998679999999799 No 159 >COG4586 ABC-type uncharacterized transport system, ATPase component [General function prediction only] Probab=97.56 E-value=0.00068 Score=47.51 Aligned_cols=147 Identities=24% Similarity=0.334 Sum_probs=77.0 Q ss_pred EEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHH-HHHHHH---HCCCCCCHH-----HCCCCCC----C--------- Q ss_conf 34563587777664399996778440789999999-999999---719853035-----3206822----1--------- Q gi|254780750|r 637 NDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNA-LIVIMA---QMGSYVPAS-----YAHIGIV----D--------- 694 (920) Q Consensus 637 Ndi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqva-l~vilA---QiG~fVPA~-----~a~i~~~----D--------- 694 (920) -|+++..+ .+.+...-|||.+||||.||..- ++.-.+ -+|.++|=. -..++.| - T Consensus 41 qdisf~IP----~G~ivgflGaNGAGKSTtLKmLTGll~p~~G~v~V~G~~Pf~~~~~~~~~~~~v~gqk~ql~Wdlp~~ 116 (325) T COG4586 41 QDISFEIP----KGEIVGFLGANGAGKSTTLKMLTGLLLPTSGKVRVNGKDPFRRREEYLRSIGLVMGQKLQLWWDLPAL 116 (325) T ss_pred HEEEEECC----CCCEEEEECCCCCCCHHHHHHHHCCCCCCCCEEEECCCCCCHHHHHHHHHHHHHHHHHHEEEEECHHH T ss_conf 51145348----98689887588886033398973860368875874586852337999988788763222025623025 Q ss_pred ------EEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHC-------------------CCCCEEEEECCCCCCCHHHHHH Q ss_conf ------056765237661138532899999999999958-------------------9985699932588988056799 Q gi|254780750|r 695 ------KLFSRVGSADNLASGRSTFMVEMIETASILNQA-------------------TNQSFVILDEIGRGTATLDGLS 749 (920) Q Consensus 695 ------~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~a-------------------t~~SLVllDElGrGTst~DG~a 749 (920) ++.-+|| |+-++..=-|++||.+....|+.- -+--.+.|||.-=|-+-.--.+ T Consensus 117 ds~~v~~~Iy~Ip--d~~F~~r~~~l~eiLdl~~~lk~~vr~LSlGqRmraeLaaaLLh~p~VLfLDEpTvgLDV~aq~~ 194 (325) T COG4586 117 DSLEVLKLIYEIP--DDEFAERLDFLTEILDLEGFLKWPVRKLSLGQRMRAELAAALLHPPKVLFLDEPTVGLDVNAQAN 194 (325) T ss_pred HHHHHHHHHHHCC--HHHHHHHHHHHHHHHCCHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCEEEECCCCCCCCHHHHHH T ss_conf 4699999997198--89999999889998632456401356536057889999998568983897448745751438999 Q ss_pred HHHHHHHHHHHHCCCEEEEECCC-HHHHHHHHHCCCEEEEEE Q ss_conf 99999999997269849997487-579766430688589999 Q gi|254780750|r 750 IAWATIEYLHETNRCRGLLATHF-HELTDLSKSLKRFHNATL 790 (920) Q Consensus 750 iA~aile~l~~~~~~~~lfaTHy-~eL~~l~~~~~~v~n~~~ 790 (920) |-..+-||=.+ -+|.+|.+||| ..++.+.+..-.+.+..+ T Consensus 195 ir~Flke~n~~-~~aTVllTTH~~~di~~lc~rv~~I~~Gql 235 (325) T COG4586 195 IREFLKEYNEE-RQATVLLTTHIFDDIATLCDRVLLIDQGQL 235 (325) T ss_pred HHHHHHHHHHH-HCCEEEEEECCHHHHHHHHHHEEEEECCCE T ss_conf 99999998775-373699984112138886343699607828 No 160 >COG1120 FepC ABC-type cobalamin/Fe3+-siderophores transport systems, ATPase components [Inorganic ion transport and metabolism / Coenzyme metabolism] Probab=97.56 E-value=0.00086 Score=46.75 Aligned_cols=129 Identities=26% Similarity=0.290 Sum_probs=73.7 Q ss_pred CCEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHH--------------------HHHHCCCCCCHHH---C Q ss_conf 7044434563587777664399996778440789999999999--------------------9997198530353---2 Q gi|254780750|r 632 KPFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIV--------------------IMAQMGSYVPASY---A 688 (920) Q Consensus 632 ~~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~v--------------------ilAQiG~fVPA~~---a 688 (920) ...|-+|+++.. ..+.+.-|=|||.+||||+||.++=+. -+|+.=+|||-+. . T Consensus 14 ~~~il~~ls~~i----~~G~i~~iiGpNG~GKSTLLk~l~~~l~p~~G~V~l~g~~i~~~~~kelAk~ia~vpQ~~~~~~ 89 (258) T COG1120 14 GKPILDDLSFSI----PKGEITGILGPNGSGKSTLLKCLAGLLKPKSGEVLLDGKDIASLSPKELAKKLAYVPQSPSAPF 89 (258) T ss_pred CEEEEECCEEEE----CCCCEEEEECCCCCCHHHHHHHHHCCCCCCCCEEEECCCCHHHCCHHHHHHHEEEECCCCCCCC T ss_conf 916872236886----5997999989988899999999865678888779999972454698887561899356788999 Q ss_pred CCCCCCEEEEEEECCCCCCCC----CCHHH---------H-HHHHHHHHH----------------------HHCCCCCE Q ss_conf 068221056765237661138----53289---------9-999999999----------------------95899856 Q gi|254780750|r 689 HIGIVDKLFSRVGSADNLASG----RSTFM---------V-EMIETASIL----------------------NQATNQSF 732 (920) Q Consensus 689 ~i~~~D~IftRiGa~D~l~~g----~STF~---------v-Em~e~~~IL----------------------~~at~~SL 732 (920) .++++| -+..| ++.|. | +..+.-.+. --|.+-.+ T Consensus 90 ~~tV~d----------~V~~GR~p~~~~~~~~~~~D~~~v~~aL~~~~~~~la~r~~~~LSGGerQrv~iAraLaQ~~~i 159 (258) T COG1120 90 GLTVYE----------LVLLGRYPHLGLFGRPSKEDEEIVEEALELLGLEHLADRPVDELSGGERQRVLIARALAQETPI 159 (258) T ss_pred CCEEEE----------HHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCHHHHCCCCCCCCHHHHHHHHHHHHHHCCCCE T ss_conf 958736----------1742677465533578876899999999982947776685511686688999999998458997 Q ss_pred EEEECCCCCCCHHHH-HHH-HHHHHHHHHHHCCCEEEEECCCHHHHH Q ss_conf 999325889880567-999-999999999726984999748757976 Q gi|254780750|r 733 VILDEIGRGTATLDG-LSI-AWATIEYLHETNRCRGLLATHFHELTD 777 (920) Q Consensus 733 VllDElGrGTst~DG-~ai-A~aile~l~~~~~~~~lfaTHy~eL~~ 777 (920) +|+||- ||..|= -.+ -..++..|.+..+.-+++++|...++. T Consensus 160 LLLDEP---Ts~LDi~~Q~evl~ll~~l~~~~g~tvv~vlHDln~A~ 203 (258) T COG1120 160 LLLDEP---TSHLDIAHQIEVLELLRDLNREKGLTVVMVLHDLNLAA 203 (258) T ss_pred EEECCC---CCCCCHHHHHHHHHHHHHHHHHCCCEEEEEECCHHHHH T ss_conf 882797---20038777999999999999855978999955988999 No 161 >PRK13652 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=97.55 E-value=0.0016 Score=44.69 Aligned_cols=52 Identities=21% Similarity=0.263 Sum_probs=34.7 Q ss_pred CCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHH-HHH Q ss_conf 99856999325889880567999999999999726984999748757976-643 Q gi|254780750|r 728 TNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTD-LSK 780 (920) Q Consensus 728 t~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~-l~~ 780 (920) .+-.++|+||--.|=++..-..| |..+..|.+..+...+++||.-+... +++ T Consensus 154 ~~P~lLlLDEPtagLDp~~~~~i-~~~l~~l~~~~g~Tii~vtHdl~~v~~~aD 206 (277) T PRK13652 154 MEPQVLVLDEPTAGLDPQGVKEL-FDFLNALPETYGMTVIFSTHQVELVAEMAD 206 (277) T ss_pred HCCCEEEECCCCCCCCHHHHHHH-HHHHHHHHHHCCCEEEEECCCHHHHHHHCC T ss_conf 29999998397454899999999-999999998509899999148999999799 No 162 >PRK13638 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=97.55 E-value=0.0013 Score=45.55 Aligned_cols=87 Identities=16% Similarity=0.228 Sum_probs=48.8 Q ss_pred CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH-HHHHHCCCEEEEEEEEEEECCEEEEEEEE Q ss_conf 89985699932588988056799999999999972698499974875797-66430688589999999609927787777 Q gi|254780750|r 727 ATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELT-DLSKSLKRFHNATLQVSDSNEGIIFLHKV 805 (920) Q Consensus 727 at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~-~l~~~~~~v~n~~~~~~~~~~~i~flykl 805 (920) +..-.++|+||--.|=++..-..| +.++..|.+. +..++++||.-+.. .+++. +.. -.+++| + T Consensus 152 ~~~P~lLlLDEPtagLD~~~~~~i-~~ll~~l~~~-G~tiiivsHdl~~v~~~aDr---v~v------l~~G~i-----v 215 (271) T PRK13638 152 VLQARYLLLDEPTAGLDPAGRTQM-IAIIRRIVAQ-GNHVIISSHDIDLIYEISDA---VYV------LRQGQI-----L 215 (271) T ss_pred HCCCCEEEECCCCCCCCHHHHHHH-HHHHHHHHHC-CCEEEEECCCHHHHHHHCCE---EEE------EECCEE-----E T ss_conf 659998998387545899999999-9999999978-99999984888999996999---999------989989-----9 Q ss_pred EECCCCCCHH-HHHHHHCCCCHHHH Q ss_conf 4478988778-99999829998999 Q gi|254780750|r 806 IPGIADHSYG-IQVGKLAGLPNTVI 829 (920) Q Consensus 806 ~~G~~~~Syg-i~vA~laG~p~~vi 829 (920) ..|.+..=|. -++=+.+|+..+.+ T Consensus 216 a~Gtp~ev~~~~~~~~~~g~~~p~~ 240 (271) T PRK13638 216 THGAPGEVFACTEAMEQAGLTQPWL 240 (271) T ss_pred EECCHHHHHCCHHHHHHCCCCCCHH T ss_conf 9868999968989999769999879 No 163 >PRK10908 cell division protein FtsE; Provisional Probab=97.54 E-value=0.0024 Score=43.43 Aligned_cols=134 Identities=21% Similarity=0.192 Sum_probs=69.4 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCC------CCCC---H-H----HCCCCCC---CEEE Q ss_conf 4434563587777664399996778440789999999999999719------8530---3-5----3206822---1056 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMG------SYVP---A-S----YAHIGIV---DKLF 697 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG------~fVP---A-~----~a~i~~~---D~If 697 (920) |=+|++|.. ..+.+..|.|||-+||||+||.++=+.-- .-| --|. . + ...++.+ +.+| T Consensus 17 ~L~~vsl~i----~~Ge~~~liG~nGsGKSTLl~~i~Gl~~p-~~G~i~~~g~~i~~~~~~~~~~~r~~ig~v~Q~~~l~ 91 (222) T PRK10908 17 ALQGVTFHM----RPGEMAFLTGHSGAGKSTLLKLICGIERP-SAGKIWFSGHDITRLKNREVPFLRRQIGMIFQDHHLL 91 (222) T ss_pred EEECCEEEE----CCCCEEEEECCCCCHHHHHHHHHHCCCCC-CCEEEEECCEECCCCCHHHHHHHHHCCEEECCCCCCC T ss_conf 986438799----69989999999980799999999659999-8629999999987566667799873024774683016 Q ss_pred EEEECCCCCCCC-----CCHHHH-----HHHH------------------------HHHHHHHCCCCCEEEEECCCCCCC Q ss_conf 765237661138-----532899-----9999------------------------999999589985699932588988 Q gi|254780750|r 698 SRVGSADNLASG-----RSTFMV-----EMIE------------------------TASILNQATNQSFVILDEIGRGTA 743 (920) Q Consensus 698 tRiGa~D~l~~g-----~STF~v-----Em~e------------------------~~~IL~~at~~SLVllDElGrGTs 743 (920) ..+-..||+.-+ .+.--. |+.+ +|.. -+.+-.++|+||--.|=+ T Consensus 92 ~~~tv~eni~~~~~~~~~~~~~~~~~~~~~l~~~gl~~~~~~~p~~LSGGq~QRvaiAra--L~~~P~iLllDEPt~~LD 169 (222) T PRK10908 92 MDRTVYDNVAIPLIIAGASGDDIRRRVSAALDKVGLLDKAKNFPIQLSGGEQQRVGIARA--VVNKPAVLLADEPTGNLD 169 (222) T ss_pred CCCEEEHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHCCCCCCCHHHHHHHHHHHH--HHCCCCEEEEECCCCCCC T ss_conf 897700456578988499989999999999987487657648876689689999999999--976999999909876679 Q ss_pred HHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHH Q ss_conf 0567999999999999726984999748757976 Q gi|254780750|r 744 TLDGLSIAWATIEYLHETNRCRGLLATHFHELTD 777 (920) Q Consensus 744 t~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~ 777 (920) +..-..+ +.+++.+.+ .+.-++++||..+... T Consensus 170 ~~~~~~v-~~~l~~l~~-~g~tvl~vtHd~~~~~ 201 (222) T PRK10908 170 DALSEGI-LRLFEEFNR-VGVTVLMATHDIGLIS 201 (222) T ss_pred HHHHHHH-HHHHHHHHH-CCCEEEEECCCHHHHH T ss_conf 9999999-999999986-1999999947999999 No 164 >PRK13549 xylose transporter ATP-binding subunit; Provisional Probab=97.53 E-value=0.00056 Score=48.13 Aligned_cols=53 Identities=23% Similarity=0.196 Sum_probs=35.4 Q ss_pred CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH-HHHHH Q ss_conf 89985699932588988056799999999999972698499974875797-66430 Q gi|254780750|r 727 ATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELT-DLSKS 781 (920) Q Consensus 727 at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~-~l~~~ 781 (920) ++.-.++|+||--+|=+..--..| |..+..|.+. |.-++|.||.-+.. ++++. T Consensus 421 ~~~P~iLilDEPT~GLD~~~~~~i-~~ll~~l~~~-G~tvl~iSHDl~~v~~~aDR 474 (513) T PRK13549 421 LLNPRILILDEPTRGIDVGAKYEI-YKLINQLVQQ-GVAIIVISSELPEVLGLSDR 474 (513) T ss_pred HCCCCEEEEECCCCCCCHHHHHHH-HHHHHHHHHC-CCEEEEEECCHHHHHHHCCE T ss_conf 719989999798668999999999-9999999957-99999991868999986999 No 165 >PRK13633 cobalt transporter ATP-binding subunit; Provisional Probab=97.53 E-value=0.00063 Score=47.77 Aligned_cols=130 Identities=21% Similarity=0.279 Sum_probs=67.0 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHH-C------------------CCCCC- Q ss_conf 444345635877776643999967784407899999999999997198530353-2------------------06822- Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASY-A------------------HIGIV- 693 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~-a------------------~i~~~- 693 (920) .+=+||++.. ..+.+.-|-|||-+||||++|.++ |...|-+- . .++.+ T Consensus 25 ~al~~Isl~i----~~GE~v~iiG~nGsGKSTL~r~l~--------gl~~P~~G~I~i~G~~~~~~~~~~~~r~~ig~vf 92 (281) T PRK13633 25 IALDDVNLEV----KKGEFLVILGHNGSGKSTIAKHMN--------ALLLPSEGKVYVDGLDTSDEENLWDIRNKAGMVF 92 (281) T ss_pred CEEECEEEEE----CCCCEEEEECCCCCHHHHHHHHHH--------CCCCCCCCEEEECCEECCCHHHHHHHHHCEEEEE T ss_conf 2673407688----799899999999984999999997--------5887888569999998788566999873608986 Q ss_pred ----CEEEEEEECCCCCCCCCCHHH---HHHH-HHHHHHH----------------------------HCCCCCEEEEEC Q ss_conf ----105676523766113853289---9999-9999999----------------------------589985699932 Q gi|254780750|r 694 ----DKLFSRVGSADNLASGRSTFM---VEMI-ETASILN----------------------------QATNQSFVILDE 737 (920) Q Consensus 694 ----D~IftRiGa~D~l~~g~STF~---vEm~-e~~~IL~----------------------------~at~~SLVllDE 737 (920) +.+|... ..|++.-|...+. .|+. .+..+|+ -|..-.++|+|| T Consensus 93 Q~P~~~l~~~t-V~e~i~fg~~~~g~~~~e~~~rv~~~l~~~gL~~~~~~~p~~LSGGqkQRvaiA~aLa~~P~iLilDE 171 (281) T PRK13633 93 QNPDNQIVATI-VEEDVAFGPENLGIPPEEIRERVDESLKKVGMYEYRRHAPHLLSGGQKQRVAIAGILAMRPECIIFDE 171 (281) T ss_pred ECCCCCCCHHH-HHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHCCCHHCCHHHHHHHHHHHHHHCCCCEEEEEC T ss_conf 68864202889-99999988988699999999999999986794876638910089859999999999985999999818 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHH Q ss_conf 5889880567999999999999726984999748757976 Q gi|254780750|r 738 IGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTD 777 (920) Q Consensus 738 lGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~ 777 (920) --.|=++.--. --+..+..|.+..+..++|+||+-+... T Consensus 172 PTs~LDp~~~~-~i~~~l~~l~~e~g~Tii~vTHdl~~~~ 210 (281) T PRK13633 172 PTAMLDPSGRR-EVVNTIKELNKKYGITIILITHYMEEAV 210 (281) T ss_pred CCCCCCHHHHH-HHHHHHHHHHHHCCCEEEEECCCHHHHH T ss_conf 73438989999-9999999999840989999867889997 No 166 >PRK10938 putative molybdenum transport ATP-binding protein ModF; Provisional Probab=97.52 E-value=0.0034 Score=42.34 Aligned_cols=33 Identities=39% Similarity=0.650 Sum_probs=24.5 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHH Q ss_conf 4434563587777664399996778440789999999 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNA 671 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqva 671 (920) |-+|+++.. ..+.++-|.|||-+||||++|.++ T Consensus 275 vl~~vs~~v----~~GE~~~i~G~nGsGKSTLl~~l~ 307 (490) T PRK10938 275 ILNNLSWQV----NPGEHWQIVGPNGAGKSTLLSLIT 307 (490) T ss_pred EEEEEEEEE----CCCCEEEEECCCCCCHHHHHHHHH T ss_conf 885357898----389889998678887999999980 No 167 >PRK11124 artP arginine transporter ATP-binding subunit; Provisional Probab=97.52 E-value=0.0031 Score=42.55 Aligned_cols=52 Identities=19% Similarity=0.155 Sum_probs=33.8 Q ss_pred CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH-HHHH Q ss_conf 89985699932588988056799999999999972698499974875797-6643 Q gi|254780750|r 727 ATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELT-DLSK 780 (920) Q Consensus 727 at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~-~l~~ 780 (920) +.+--++|+||--.|-++.--..| +.++..|.+. +.-.+++||.-+.. .+++ T Consensus 157 ~~~P~iLllDEPTs~LD~~~~~~i-~~ll~~l~~~-g~tii~vtHdl~~~~~~ad 209 (242) T PRK11124 157 MMEPQVLLFDEPTAALDPEITAQI-VSIIRELAET-GITQVIVTHEVEVARKTAS 209 (242) T ss_pred CCCCCEEEECCCCCCCCHHHHHHH-HHHHHHHHHC-CCEEEEECCCHHHHHHHCC T ss_conf 379979997688654899999999-9999999842-9989998889999999699 No 168 >PRK10636 putative ABC transporter ATP-binding protein; Provisional Probab=97.52 E-value=0.00053 Score=48.34 Aligned_cols=134 Identities=20% Similarity=0.115 Sum_probs=64.3 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEE-EECCCCCCCCCCH Q ss_conf 444345635877776643999967784407899999999999997198530353206822105676-5237661138532 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSR-VGSADNLASGRST 712 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftR-iGa~D~l~~g~ST 712 (920) .|-+|+++.. ..+..+-|.|||-+||||+||.++=. +=..-|...-+...+++-|++-... +-....+...... T Consensus 326 ~vl~~vsl~i----~~GeriaIvG~NGsGKSTLlk~L~G~-l~p~~G~i~~~~~v~igy~~Q~~~~~l~~~~t~l~~~~~ 400 (638) T PRK10636 326 IILDSIKLNL----VPGSRIGLLGRNGAGKSTLIKLLAGE-LAPVSGEIGLAKGIKLGYFAQHQLEFLRADESPLQHLAR 400 (638) T ss_pred CCCCCCCCEE----CCCCEEEEECCCCCCHHHHHHHHCCC-CCCCCCEEEECCCCCCCCCHHHHHHHCCCHHHHHHHHHH T ss_conf 2013775056----37847999747871388999997288-788885699844443341107677650611249999988 Q ss_pred H---H----------------------------HHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 8---9----------------------------99999999999589985699932588988056799999999999972 Q gi|254780750|r 713 F---M----------------------------VEMIETASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHET 761 (920) Q Consensus 713 F---~----------------------------vEm~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~ 761 (920) + . -|-..++-..--+.+-.+.||||- |+-.|=.++- +.-+.|.+- T Consensus 401 ~~~~~~~~~~r~~L~~f~f~~~~~~~~v~~LSGGEk~Rl~LA~~l~~~pnlLiLDEP---TNhLDi~s~e-~Le~aL~~y 476 (638) T PRK10636 401 LAPQELEQKLRDYLGGFGFQGDKVTEETRRFSGGEKARLVLALIVWQRPNLLLLDEP---TNHLDLDMRQ-ALTEALIDF 476 (638) T ss_pred HCHHHHHHHHHHHHHHCCCCHHHHCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECC---CCCCCHHHHH-HHHHHHHHC T ss_conf 572546999999998668897786391133999999999999998259988998588---7668889999-999999848 Q ss_pred CCCEEEEECCCHHHHH Q ss_conf 6984999748757976 Q gi|254780750|r 762 NRCRGLLATHFHELTD 777 (920) Q Consensus 762 ~~~~~lfaTHy~eL~~ 777 (920) .|+ +||++|...+.+ T Consensus 477 ~Gt-vl~VSHDr~fl~ 491 (638) T PRK10636 477 EGA-LVVVSHDRHLLR 491 (638) T ss_pred CCE-EEEEECCHHHHH T ss_conf 983-999978999999 No 169 >PRK09580 sufC cysteine desulfurase ATPase component; Reviewed Probab=97.51 E-value=0.002 Score=43.95 Aligned_cols=35 Identities=31% Similarity=0.382 Sum_probs=28.4 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHH Q ss_conf 444345635877776643999967784407899999999 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNAL 672 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval 672 (920) -|=+|+++.. ..+.++-|.|||-+||||++|.++= T Consensus 15 ~vL~~isl~i----~~Gei~~iiG~nGaGKSTLl~~i~G 49 (248) T PRK09580 15 AILRGLNLEV----RPGEVHAIMGPNGSGKSTLSATLAG 49 (248) T ss_pred EEEECCEEEE----CCCCEEEEECCCCCCHHHHHHHHHC T ss_conf 9996518898----4997999999999999999999837 No 170 >PRK13643 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=97.51 E-value=0.00073 Score=47.32 Aligned_cols=135 Identities=23% Similarity=0.277 Sum_probs=69.7 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCC------CCHH---------HCCCCC-----CC Q ss_conf 443456358777766439999677844078999999999999971985------3035---------320682-----21 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSY------VPAS---------YAHIGI-----VD 694 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~f------VPA~---------~a~i~~-----~D 694 (920) +=+||+|.. ..+.+.-|-|||-+||||++|.++=+. ..+-|.- |... .-.+|. .+ T Consensus 21 aL~dIsl~I----~~Ge~vaiiG~nGsGKSTLl~~l~Gll-~p~~G~V~~~~~~i~~~~~~~~~~~~~~~vG~VfQ~p~~ 95 (288) T PRK13643 21 ALFDIDLEV----KKGSYTALIGHTGSGKSTLLQHLNGLL-QPTEGKVTVGDIVVSSTSKQKEIKPVRKKVGVVFQFPES 95 (288) T ss_pred EEEEEEEEE----CCCCEEEEECCCCCHHHHHHHHHHCCC-CCCCCEEEECCEECCCCCHHHHHHHHHHCEEEEEECCCC T ss_conf 366336798----599899999999947999999997488-888856999999856877354479877517999977732 Q ss_pred EEEEEEECCCCCCCCCCHHH---HHHHHH-HHHHH-----------------------------HCCCCCEEEEECCCCC Q ss_conf 05676523766113853289---999999-99999-----------------------------5899856999325889 Q gi|254780750|r 695 KLFSRVGSADNLASGRSTFM---VEMIET-ASILN-----------------------------QATNQSFVILDEIGRG 741 (920) Q Consensus 695 ~IftRiGa~D~l~~g~STF~---vEm~e~-~~IL~-----------------------------~at~~SLVllDElGrG 741 (920) .+|... ..|++.-|..-+. -|+.+. ...|. -|..--++|+||--.| T Consensus 96 ql~~~t-V~e~vafg~~n~g~~~~e~~~~v~~~l~~vgl~d~~~~r~p~~LSGGqkqRvaiA~aLa~~P~vLlLDEPTs~ 174 (288) T PRK13643 96 QLFEET-VLKDVAFGPQNFGIPKEKAEKIAAEKLEMVGLADEFWEKSPFELSGGQMRRVAIAGILAMEPEVLVLDEPTAG 174 (288) T ss_pred CCCHHH-HHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCC T ss_conf 024336-9999998999869998999999999999759936675279763999999999999999749999999588555 Q ss_pred CCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHH Q ss_conf 880567999999999999726984999748757976 Q gi|254780750|r 742 TATLDGLSIAWATIEYLHETNRCRGLLATHFHELTD 777 (920) Q Consensus 742 Tst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~ 777 (920) =++. |..--+.+++.|.+. +..++++||.-+... T Consensus 175 LDp~-~~~~i~~ll~~l~~~-G~TiI~vtHd~~~v~ 208 (288) T PRK13643 175 LDPK-ARIEMMQLFESIHQS-GQTVVLVTHLMDDVA 208 (288) T ss_pred CCHH-HHHHHHHHHHHHHHC-CCEEEEECCCHHHHH T ss_conf 8999-999999999999953-999999860899999 No 171 >PRK10636 putative ABC transporter ATP-binding protein; Provisional Probab=97.50 E-value=0.0033 Score=42.45 Aligned_cols=17 Identities=24% Similarity=0.452 Sum_probs=7.2 Q ss_pred CCCCCHHHHHHCCCCCC Q ss_conf 77641278763012343 Q gi|254780750|r 308 GSREQSLLKTIDYSITG 324 (920) Q Consensus 308 g~~~gSL~~~Ln~t~T~ 324 (920) |+.|.||+.+|..-.+| T Consensus 37 GsGKSTLlklL~G~~~~ 53 (638) T PRK10636 37 GCGKSTLLALLKNEISA 53 (638) T ss_pred CCHHHHHHHHHCCCCCC T ss_conf 98899999998089988 No 172 >PRK11701 phnK phosphonates transport ATP-binding protein; Provisional Probab=97.50 E-value=0.0013 Score=45.45 Aligned_cols=53 Identities=17% Similarity=0.120 Sum_probs=34.5 Q ss_pred CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHH-HHH Q ss_conf 899856999325889880567999999999999726984999748757976-643 Q gi|254780750|r 727 ATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTD-LSK 780 (920) Q Consensus 727 at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~-l~~ 780 (920) +..-.++|+||--.|=++.--..| +..+..|.+..+.-++|+||.-+... +++ T Consensus 167 ~~~P~llllDEPtsgLD~~~~~~i-~~~l~~l~~~~g~til~vtHdl~~~~~laD 220 (258) T PRK11701 167 VTHPRLVFMDEPTGGLDVSVQARL-LDLLRGLVVELGLAVVIVTHDLAVARLLAH 220 (258) T ss_pred HCCCCEEEECCCCCCCCHHHHHHH-HHHHHHHHHHHCCEEEEECCCHHHHHHHCC T ss_conf 649999998598656899999999-999999999609899999378899999799 No 173 >PRK13642 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=97.49 E-value=0.0012 Score=45.79 Aligned_cols=129 Identities=23% Similarity=0.192 Sum_probs=67.6 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEE---------EEEE---- Q ss_conf 443456358777766439999677844078999999999999971985303532068221056---------7652---- Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLF---------SRVG---- 701 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~If---------tRiG---- 701 (920) +=|||++.. ..+.+.-|-|||-+||||++|.+ .|.+-| .+-++.+..+-. -+|| T Consensus 22 aL~~is~~i----~~Ge~~aiiG~sGsGKSTL~~~l--------~Gl~~~-~~G~I~~~G~~i~~~~~~~~r~~ig~VfQ 88 (277) T PRK13642 22 QLNGVSFSI----TKGEWVSIIGQNGSGKSTTARLI--------DGLFEE-FEGIVKIDGERLTAENVWNLRRKIGMVFQ 88 (277) T ss_pred EEECEEEEE----CCCCEEEEECCCCCHHHHHHHHH--------HCCCCC-CCCEEEECCEECCCCCHHHHHCCCEEEEE T ss_conf 644307998----89989999999996899999999--------638998-88489999999985788888517689998 Q ss_pred ----------CCCCCCCCCCHH---HHHHHH-HHHHHH----------------------------HCCCCCEEEEECCC Q ss_conf ----------376611385328---999999-999999----------------------------58998569993258 Q gi|254780750|r 702 ----------SADNLASGRSTF---MVEMIE-TASILN----------------------------QATNQSFVILDEIG 739 (920) Q Consensus 702 ----------a~D~l~~g~STF---~vEm~e-~~~IL~----------------------------~at~~SLVllDElG 739 (920) -.|++.-|.... --|+.+ +...|. -|.+-.++|+||-- T Consensus 89 ~p~~~l~~~tV~e~i~~g~~~~~~~~~e~~~~v~~~l~~~~l~~~~~~~P~~LSGGqrQRvaIA~aLa~~P~ililDEPT 168 (277) T PRK13642 89 NPDNQFVGATVEDDVAFGMENQGIPREEMIKRVDEALLAVNMLDFKTREPARLSGGQKQRVAVAGIIALRPEIIILDEST 168 (277) T ss_pred CCCCCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHCCHHHCCHHHHHHHHHHHHHHCCCCEEEEECCC T ss_conf 97632575508888987776669999999999999998779965655791228999999999999996699999995887 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHH Q ss_conf 89880567999999999999726984999748757976 Q gi|254780750|r 740 RGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTD 777 (920) Q Consensus 740 rGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~ 777 (920) .|=++.-- .--+..+..|.+..+..++|+||.-++.. T Consensus 169 s~LD~~~~-~~i~~ll~~L~~~~~~Tii~iTHdl~~~~ 205 (277) T PRK13642 169 SMLDPTGR-SEIMRVIHEIKDKYHLTVLSITHDLDEAA 205 (277) T ss_pred CCCCHHHH-HHHHHHHHHHHHHCCCEEEEEEECHHHHH T ss_conf 65898999-99999999999816989999945889997 No 174 >COG4555 NatA ABC-type Na+ transport system, ATPase component [Energy production and conversion / Inorganic ion transport and metabolism] Probab=97.49 E-value=0.00059 Score=47.96 Aligned_cols=129 Identities=27% Similarity=0.339 Sum_probs=64.1 Q ss_pred CCCEEEEEECCCCCCHHHHHHHHHHHHHHHHC-----CCCC---CHH-HCCCCCCC---EEEEEEECCCCCC-----CCC Q ss_conf 66439999677844078999999999999971-----9853---035-32068221---0567652376611-----385 Q gi|254780750|r 648 NSGKLWLLTGPNMGGKSTFLRQNALIVIMAQM-----GSYV---PAS-YAHIGIVD---KLFSRVGSADNLA-----SGR 710 (920) Q Consensus 648 ~~~~~~iiTGpNmgGKSt~lRqval~vilAQi-----G~fV---PA~-~a~i~~~D---~IftRiGa~D~l~-----~g~ 710 (920) ..+++.=|-|||-+||||.||.+|-..+-.|= ||-. |.. .-+||+.- .+|.|+-+..||. .|. T Consensus 26 e~Gei~GlLG~NGAGKTT~LRmiatlL~P~~G~v~idg~d~~~~p~~vrr~IGVl~~e~glY~RlT~rEnl~~Fa~L~~l 105 (245) T COG4555 26 EEGEITGLLGENGAGKTTLLRMIATLLIPDSGKVTIDGVDTVRDPSFVRRKIGVLFGERGLYARLTARENLKYFARLNGL 105 (245) T ss_pred CCCEEEEEECCCCCCCHHHHHHHHHHCCCCCCEEEEEECCCCCCHHHHHHHCCEECCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 06649998768988712379999983258886499840021017187752021313776703553089999999999624 Q ss_pred -----------------------------CHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf -----------------------------328999999999999589985699932588988056799999999999972 Q gi|254780750|r 711 -----------------------------STFMVEMIETASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHET 761 (920) Q Consensus 711 -----------------------------STF~vEm~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~ 761 (920) ||=|--=.-+|.-|- -.-|++++||-..|-+-.--- +-.-.+.++-+. T Consensus 106 ~~~~~kari~~l~k~l~l~~~~~rRv~~~S~G~kqkV~iARAlv--h~P~i~vlDEP~sGLDi~~~r-~~~dfi~q~k~e 182 (245) T COG4555 106 SRKEIKARIAELSKRLQLLEYLDRRVGEFSTGMKQKVAIARALV--HDPSILVLDEPTSGLDIRTRR-KFHDFIKQLKNE 182 (245) T ss_pred HHHHHHHHHHHHHHHHCHHHHHHHHHHHHCHHHHHHHHHHHHHH--CCCCEEEECCCCCCCCHHHHH-HHHHHHHHHHCC T ss_conf 02678999999998867598998887641400578899999984--398769976898774278799-999999985257 Q ss_pred CCCEEEEECCC-HHHHHHHH Q ss_conf 69849997487-57976643 Q gi|254780750|r 762 NRCRGLLATHF-HELTDLSK 780 (920) Q Consensus 762 ~~~~~lfaTHy-~eL~~l~~ 780 (920) +-.++|+||- .|+..+.+ T Consensus 183 -gr~viFSSH~m~EvealCD 201 (245) T COG4555 183 -GRAVIFSSHIMQEVEALCD 201 (245) T ss_pred -CCEEEEECCCHHHHHHHHH T ss_conf -9489996131799998613 No 175 >PRK13646 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=97.48 E-value=0.00059 Score=47.98 Aligned_cols=53 Identities=15% Similarity=0.150 Sum_probs=34.9 Q ss_pred CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH-HHHH Q ss_conf 89985699932588988056799999999999972698499974875797-6643 Q gi|254780750|r 727 ATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELT-DLSK 780 (920) Q Consensus 727 at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~-~l~~ 780 (920) |..-.++|+||--.|=++.- ..--+..+..|....+..++++||.-+.. .+++ T Consensus 161 a~~P~iLilDEPTagLDp~~-~~~i~~ll~~l~~~~g~TiI~iTHdm~~v~~~ad 214 (286) T PRK13646 161 AMNPDIIVVDEPTAGLDPQS-KRQVMRLLKSLQTDENKAIILISHDMNEVARYAD 214 (286) T ss_pred HCCCCEEEEECCCCCCCHHH-HHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHCC T ss_conf 51989999838744389899-9999999999999539899999138999999699 No 176 >PRK13644 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=97.47 E-value=0.0011 Score=45.95 Aligned_cols=133 Identities=23% Similarity=0.265 Sum_probs=66.5 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCC-CCC---H-H-------HCCCCCC-----CEEE Q ss_conf 44345635877776643999967784407899999999999997198-530---3-5-------3206822-----1056 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGS-YVP---A-S-------YAHIGIV-----DKLF 697 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~-fVP---A-~-------~a~i~~~-----D~If 697 (920) +=+|++|.. ..+.+.-|.|||-+||||++|.++=+. -.+-|. +|- . + .-.++.+ +.+| T Consensus 17 aL~~vsl~i----~~Ge~vaiiG~nGsGKSTL~~~l~Gll-~P~~G~I~v~G~d~~~~~~~~~~r~~ig~vfQ~p~~q~~ 91 (274) T PRK13644 17 ALENINLVI----KKGEYIGIIGKNGSGKSTLALHLNGLL-RPQKGKVLVSGIDTGDFSKLQGIRKLVGIVFQNPETQFV 91 (274) T ss_pred EEEEEEEEE----CCCCEEEEECCCCCHHHHHHHHHHCCC-CCCCCEEEECCEECCCCHHHHHHHHHEEEEEECCCCCCC T ss_conf 663117798----489999999999980999999997068-588872999999878705679998731799658211036 Q ss_pred EEEECCCCCCCCCCHH---HHHHHH-HHHHH----------------------------HHCCCCCEEEEECCCCCCCHH Q ss_conf 7652376611385328---999999-99999----------------------------958998569993258898805 Q gi|254780750|r 698 SRVGSADNLASGRSTF---MVEMIE-TASIL----------------------------NQATNQSFVILDEIGRGTATL 745 (920) Q Consensus 698 tRiGa~D~l~~g~STF---~vEm~e-~~~IL----------------------------~~at~~SLVllDElGrGTst~ 745 (920) .+ --.|++.-|.-.. -.|+.+ +...| --|.+-.++|+||--.|=++. T Consensus 92 ~~-tV~e~iafg~~~~~l~~~e~~~~v~~aL~~~gl~~~~~~~p~~LSGGqkQRvaiA~aLa~~P~iLiLDEPTs~LD~~ 170 (274) T PRK13644 92 GR-TVEEDLAFGPENLCLPPIEIRKRVDRALAEIGLEKYRHRSPKTLSGGQGQCVALAGILTMEPECLIFDEVTSMLDPD 170 (274) T ss_pred CC-CHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHH T ss_conf 15-19999962197669999999999999999859687762891109976999999999998299999997986678999 Q ss_pred HHHHHHHHHHHHHHHHCCCEEEEECCCHHH Q ss_conf 679999999999997269849997487579 Q gi|254780750|r 746 DGLSIAWATIEYLHETNRCRGLLATHFHEL 775 (920) Q Consensus 746 DG~aiA~aile~l~~~~~~~~lfaTHy~eL 775 (920) -- .--+..+..|.+. +..++++||.-+. T Consensus 171 ~~-~~i~~~l~~L~~~-g~TvI~itHdl~~ 198 (274) T PRK13644 171 SG-IAVLERIKKLHEK-GKTIVYITHNLEE 198 (274) T ss_pred HH-HHHHHHHHHHHHC-CCEEEEEEECHHH T ss_conf 99-9999999999868-9999998337899 No 177 >cd03261 ABC_Org_Solvent_Resistant ABC (ATP-binding cassette) transport system involved in resistant to organic solvents; ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=97.46 E-value=0.0021 Score=43.83 Aligned_cols=133 Identities=23% Similarity=0.281 Sum_probs=70.4 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCC--------------------- Q ss_conf 44345635877776643999967784407899999999999997198530353206822--------------------- Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIV--------------------- 693 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~--------------------- 693 (920) |=+||+|.. ..+.+..|-|||-+||||+||.++ |..-| ++-++-+. T Consensus 15 ~L~~isl~i----~~Ge~~~iiG~SGsGKSTll~~i~--------gL~~p-~~G~I~~~g~~i~~~~~~~~~~~r~~ig~ 81 (235) T cd03261 15 VLKGVDLDV----RRGEILAIIGPSGSGKSTLLRLIV--------GLLRP-DSGEVLIDGEDISGLSEAELYRLRRRMGM 81 (235) T ss_pred EECCEEEEE----CCCCEEEEECCCCCHHHHHHHHHH--------CCCCC-CCCEEEECCEECCCCCHHHHHHHHCCEEE T ss_conf 882606488----799899999999972999999997--------59998-98589999999998998899997578299 Q ss_pred ----CEEEEEEECCCCCCCCCCHH----HHHHHH-HHHHHH----------------------------HCCCCCEEEEE Q ss_conf ----10567652376611385328----999999-999999----------------------------58998569993 Q gi|254780750|r 694 ----DKLFSRVGSADNLASGRSTF----MVEMIE-TASILN----------------------------QATNQSFVILD 736 (920) Q Consensus 694 ----D~IftRiGa~D~l~~g~STF----~vEm~e-~~~IL~----------------------------~at~~SLVllD 736 (920) ..+|.++-..||+.-+.--. -.|+.+ +..+|+ -+++-.++|+| T Consensus 82 vfQ~~~Lf~~lTv~eNv~~~l~~~~~~~~~~~~~r~~~~L~~vgL~~~~~~~p~~LSGGq~QRvaIARALv~~P~illlD 161 (235) T cd03261 82 LFQSGALFDSLTVFENVAFPLREHTRLSEEEIREIVLEKLEAVGLRGAEDLYPAELSGGMKKRVALARALALDPELLLYD 161 (235) T ss_pred EECCCCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHCCCCCCCHHHHHHHHHHHHHHCCCCEEEEE T ss_conf 70498658999699999999999579999999999999998679925764784106999999999999985489989980 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH-HHHHH Q ss_conf 2588988056799999999999972698499974875797-66430 Q gi|254780750|r 737 EIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELT-DLSKS 781 (920) Q Consensus 737 ElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~-~l~~~ 781 (920) |--.|=++.-...| +..+..|.+..+..++|+||.-+.. .+++. T Consensus 162 EPts~LDp~~~~~i-~~li~~l~~~~g~T~i~vTHd~~~a~~~~Dr 206 (235) T cd03261 162 EPTAGLDPIASGVI-DDLIRSLKKELGLTSIMVTHDLDTAFAIADR 206 (235) T ss_pred CCCCCCCHHHHHHH-HHHHHHHHHHHCCEEEEECCCHHHHHHHCCE T ss_conf 88664798999999-9999999997299999989898999996998 No 178 >TIGR02673 FtsE cell division ATP-binding protein FtsE; InterPro: IPR005286 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). FtsE is an ABC transporter ATP-binding protein. This protein and its permease partner FtsX, localize to the cell division site. In a number of species, the ftsEX gene pair is located next to ftsY, which encodes the signal recognition particle-docki ng protein.; GO: 0005524 ATP binding, 0051301 cell division. Probab=97.46 E-value=0.0017 Score=44.50 Aligned_cols=132 Identities=26% Similarity=0.359 Sum_probs=74.5 Q ss_pred EEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHH-------------HCCCCCCHHHCCCCCC--C-EEEEEE Q ss_conf 34563587777664399996778440789999999999999-------------7198530353206822--1-056765 Q gi|254780750|r 637 NDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMA-------------QMGSYVPASYAHIGIV--D-KLFSRV 700 (920) Q Consensus 637 Ndi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilA-------------QiG~fVPA~~a~i~~~--D-~IftRi 700 (920) +||+|.. ..+.|.-||||=-+||||+||-++-.-.-. .=|..+|+=.=+||++ | ++.-.. T Consensus 19 ~~v~l~i----~kG~F~FLtG~SGAGKttLLKLl~~~~~P~~G~v~~~G~~~~~l~~~~~P~LRR~iGvvFQDf~LL~~r 94 (215) T TIGR02673 19 HDVSLHI----RKGEFLFLTGPSGAGKTTLLKLLYGALTPSRGQVRVAGEDVSRLRGRQLPLLRRRIGVVFQDFRLLPDR 94 (215) T ss_pred CCCCEEE----CCCCEEEEECCCCCCHHHHHHHHHHHCCCCCCEEEECCEECCCCCCCCCHHHHCCCCEEEECCCCCCCC T ss_conf 2764475----277407887277861789999998526987580888874046677564312213154378422110116 Q ss_pred ECCCCCC-----CCCC-HHHHHHHHHHHHHH--------HCCCC--------------------CEEEEECCCCCCCHHH Q ss_conf 2376611-----3853-28999999999999--------58998--------------------5699932588988056 Q gi|254780750|r 701 GSADNLA-----SGRS-TFMVEMIETASILN--------QATNQ--------------------SFVILDEIGRGTATLD 746 (920) Q Consensus 701 Ga~D~l~-----~g~S-TF~vEm~e~~~IL~--------~at~~--------------------SLVllDElGrGTst~D 746 (920) -..||++ .|.+ ++.- .++..+|+ ++.+. .|+|-||- |-.-| T Consensus 95 Tv~eNVAl~L~V~G~~~~~I~--~rV~~~L~~vGL~~K~~~~P~~LSGGEQQRvaIARAiv~~P~lLLADEP---TGNLD 169 (215) T TIGR02673 95 TVYENVALPLEVRGKKKREIQ--RRVEAALRLVGLEHKADAFPEQLSGGEQQRVAIARAIVNSPELLLADEP---TGNLD 169 (215) T ss_pred CHHHHCCHHHHCCCCCCHHHH--HHHHHHHHHCCCHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECC---CCCCC T ss_conf 613411210111388803367--8999999852863254257210047257888887653048967987788---99968 Q ss_pred HHHHHHHH---HHHHHHHCCCEEEEECCCHHHHHHH Q ss_conf 79999999---9999972698499974875797664 Q gi|254780750|r 747 GLSIAWAT---IEYLHETNRCRGLLATHFHELTDLS 779 (920) Q Consensus 747 G~aiA~ai---le~l~~~~~~~~lfaTHy~eL~~l~ 779 (920) - .+++.| ++.++. .|..+++|||.++|.+-. T Consensus 170 ~-~~~~~iL~ll~~~n~-~GtTV~vATHD~~L~~~~ 203 (215) T TIGR02673 170 P-ALSERILDLLKELNK-RGTTVIVATHDLELVERV 203 (215) T ss_pred H-HHHHHHHHHHHHHHH-CCCEEEEECCCHHHHHCC T ss_conf 7-678999999999841-898799980787998437 No 179 >PRK11264 putative amino-acid ABC transporter ATP-binding protein YecC; Provisional Probab=97.46 E-value=0.0022 Score=43.78 Aligned_cols=35 Identities=31% Similarity=0.506 Sum_probs=27.4 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHH Q ss_conf 444345635877776643999967784407899999999 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNAL 672 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval 672 (920) .|=+|+++.. ..+.+.-|.|||-+||||++|.++. T Consensus 15 ~vl~~vsl~i----~~Ge~~aliG~sGsGKSTLl~~l~g 49 (248) T PRK11264 15 TVLHGIDLEV----KPGEVVAIIGPSGSGKTTLLRCINL 49 (248) T ss_pred EEEECEEEEE----CCCCEEEEECCCCCCHHHHHHHHHC T ss_conf 9894317798----7998999999999809999999975 No 180 >TIGR03265 PhnT2 putative 2-aminoethylphosphonate ABC transport system, ATP-binding protein component. This ABC transporter ATP-binding protein is found in a number of genomes in operon-like contexts strongly suggesting a substrate specificity for 2-aminoethylphosphonate (2-AEP). The characterized PhnSTUV system is absent in the genomes in which this system is found. These genomes encode systems for the catabolism of 2-AEP, making the need for a 2-AEP-specific transporter likely. Probab=97.46 E-value=0.00017 Score=52.02 Aligned_cols=142 Identities=24% Similarity=0.274 Sum_probs=77.8 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHH--------HHHCCCCCCHHHCCCCCC---CEEEEEEECC Q ss_conf 44345635877776643999967784407899999999999--------997198530353206822---1056765237 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVI--------MAQMGSYVPASYAHIGIV---DKLFSRVGSA 703 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vi--------lAQiG~fVPA~~a~i~~~---D~IftRiGa~ 703 (920) +=+|++|.. ..+.++.|-||+-+||||+||.+|=+.- --+--..+|...-.++.| -.+|-.|-.. T Consensus 19 al~~v~l~v----~~Ge~~~llGpSG~GKtTlLr~iaGl~~p~~G~I~~~g~~v~~~~p~~R~ig~VfQ~~aLfPh~tV~ 94 (353) T TIGR03265 19 ALKDISLSV----KKGEFVCLLGPSGCGKTTLLRIIAGLERQTAGTIYQGGRDITRLPPQKRDYGIVFQSYALFPNLTVA 94 (353) T ss_pred EECCCEEEE----CCCCEEEEECCCCCHHHHHHHHHHCCCCCCCEEEEECCEECCCCCHHHCCEEEEECCCCCCCCCCHH T ss_conf 886648699----8999999999995359999999976999987399999999999995258859997888546789299 Q ss_pred CCCCCCC-----CHH--------HHHHHHHHHHHH-------------------HCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 6611385-----328--------999999999999-------------------58998569993258898805679999 Q gi|254780750|r 704 DNLASGR-----STF--------MVEMIETASILN-------------------QATNQSFVILDEIGRGTATLDGLSIA 751 (920) Q Consensus 704 D~l~~g~-----STF--------~vEm~e~~~IL~-------------------~at~~SLVllDElGrGTst~DG~aiA 751 (920) |||.-|. +.= +.|+..+...++ -+++-.++|+||--..=+..--..+ T Consensus 95 eNiafgl~~~~~~~~e~~~rv~~~l~~~~l~~~~~r~p~~LSGGq~QRVAlARAL~~~P~vlLlDEPlsaLD~~lr~~l- 173 (353) T TIGR03265 95 DNIAYGLKNRGMGRAEVAERVAELLDLVGLPGSERKYPGQLSGGQQQRVALARALATSPGLLLLDEPLSALDARVREHL- 173 (353) T ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHCCHHHCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHH- T ss_conf 9998899876999999999999999876995576569646898887999999998549989999087653599999999- Q ss_pred HHHHHHHHHHCCCEEEEECCCHHHH-HHHHH Q ss_conf 9999999972698499974875797-66430 Q gi|254780750|r 752 WATIEYLHETNRCRGLLATHFHELT-DLSKS 781 (920) Q Consensus 752 ~aile~l~~~~~~~~lfaTHy~eL~-~l~~~ 781 (920) +..+..|++..+..++|+||..+=+ .+++. T Consensus 174 ~~~l~~l~~~~~~T~i~VTHD~~EA~~laDr 204 (353) T TIGR03265 174 RTEIRQLQRRLGVTTIMVTHDQEEALSMADR 204 (353) T ss_pred HHHHHHHHHHHCCEEEEECCCHHHHHHHCCE T ss_conf 9999999998699899999898999986998 No 181 >cd03251 ABCC_MsbA MsbA is an essential ABC transporter, closely related to eukaryotic MDR proteins. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=97.45 E-value=0.0032 Score=42.50 Aligned_cols=41 Identities=29% Similarity=0.353 Sum_probs=29.6 Q ss_pred CEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCH Q ss_conf 04443456358777766439999677844078999999999999971985303 Q gi|254780750|r 633 PFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPA 685 (920) Q Consensus 633 ~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA 685 (920) .+|=+|++|.. ..+....|+|||-+||||++|.+ .|.+-|- T Consensus 15 ~~vL~ninl~i----~~Ge~i~IvG~sGsGKSTLl~ll--------~gl~~p~ 55 (234) T cd03251 15 PPVLRDISLDI----PAGETVALVGPSGSGKSTLVNLI--------PRFYDVD 55 (234) T ss_pred CCCEECEEEEE----CCCCEEEEECCCCCHHHHHHHHH--------HCCCCCC T ss_conf 67353608998----79999999989998299999999--------6676678 No 182 >PRK10744 phosphate transporter subunit; Provisional Probab=97.43 E-value=0.003 Score=42.68 Aligned_cols=36 Identities=25% Similarity=0.348 Sum_probs=28.2 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHH Q ss_conf 4443456358777766439999677844078999999999 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALI 673 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~ 673 (920) .+=+|+++.. ..+.+.-|-|||-+||||++|.++-+ T Consensus 24 ~aL~~vsl~i----~~Ge~~~liG~nGaGKSTLlk~i~gl 59 (257) T PRK10744 24 HALKNINLDI----AKNQVTAFIGPSGCGKSTLLRTFNKM 59 (257) T ss_pred EEECCCEEEE----CCCCEEEEECCCCCCHHHHHHHHHHH T ss_conf 6781428998----89989999999998199999999876 No 183 >PRK13639 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=97.42 E-value=0.00071 Score=47.38 Aligned_cols=48 Identities=23% Similarity=0.272 Sum_probs=34.0 Q ss_pred CCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHH Q ss_conf 99856999325889880567999999999999726984999748757976 Q gi|254780750|r 728 TNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTD 777 (920) Q Consensus 728 t~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~ 777 (920) ..-.++|+||--.|=++. |..--+.++..|.+. +...+++||.-++.. T Consensus 154 ~~P~iliLDEPTagLDp~-~~~~i~~ll~~l~~~-G~Tii~iTHdm~~~~ 201 (275) T PRK13639 154 MNPEIMVLDEPTSGLDPM-GASQIMKLLYDLNKE-GITIIISTHDVDLVP 201 (275) T ss_pred CCCCEEEECCCCCCCCHH-HHHHHHHHHHHHHHC-CCEEEEECCCHHHHH T ss_conf 699899977975548999-999999999999976-999999938999999 No 184 >TIGR03258 PhnT 2-aminoethylphosphonate ABC transport system, ATP-binding component PhnT. This ATP-binding component of an ABC transport system is found in Salmonella and Burkholderia lineages in the vicinity of enzymes for the breakdown of 2-aminoethylphosphonate. Probab=97.41 E-value=0.0017 Score=44.54 Aligned_cols=137 Identities=24% Similarity=0.266 Sum_probs=70.7 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHH-HCCC---------CCCHHHCCCCCC---CEEEEEEE Q ss_conf 4434563587777664399996778440789999999999999-7198---------530353206822---10567652 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMA-QMGS---------YVPASYAHIGIV---DKLFSRVG 701 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilA-QiG~---------fVPA~~a~i~~~---D~IftRiG 701 (920) |=+|++|.. ..+.+..|-||+-+||||+||.+|=+.--. --|. .+|...-.++.+ ..+|-.+- T Consensus 20 al~dvsl~i----~~GE~~~llGpSG~GKTTlLr~iaGL~~p~~~~G~I~~~g~~v~~~~~~~R~ig~VFQ~~aLfPhlt 95 (362) T TIGR03258 20 VLDDLSLEI----EAGELLALIGKSGCGKTTLLRAIAGFVKAAGLTGRIAIADRDLTHAPPHKRGLALLFQNYALFPHLK 95 (362) T ss_pred EECCCEEEE----CCCCEEEEECCCCCHHHHHHHHHHCCCCCCCCCEEEEECCEECCCCCHHHCCEEEECCCCCCCCCCC T ss_conf 993767199----9998999999997459999999977767778817999999999989988899489717985368980 Q ss_pred CCCCCCCC-----CCHH-----HHHHHH---HHHHH------------------H-HCCCCCEEEEECCCCCCCHHHHHH Q ss_conf 37661138-----5328-----999999---99999------------------9-589985699932588988056799 Q gi|254780750|r 702 SADNLASG-----RSTF-----MVEMIE---TASIL------------------N-QATNQSFVILDEIGRGTATLDGLS 749 (920) Q Consensus 702 a~D~l~~g-----~STF-----~vEm~e---~~~IL------------------~-~at~~SLVllDElGrGTst~DG~a 749 (920) ..|||.-| .+.= ..|+.+ +...+ | -+.+-.++|+||--.+=++.--.. T Consensus 96 V~eNia~~L~~~~~~~~e~~~rv~e~l~~vgL~~~~~r~P~~LSGGq~QRVAlARAL~~~P~ilLlDEP~saLD~~~r~~ 175 (362) T TIGR03258 96 VEDNVAFGLRAQKMPKADIAERVADALKLVGLGDAAAHLPAQLSGGMQQRIAIARAIAIEPDVLLLDEPLSALDANIRAN 175 (362) T ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHCCHHHCCHHHHHHHHHHHHHHCCCCEEEECCCCCCCCHHHHHH T ss_conf 99999899986599999999999999877899678626966789989999999999755999899818876559999999 Q ss_pred HHHHHHHHHHHHC-CCEEEEECCCHHHH Q ss_conf 9999999999726-98499974875797 Q gi|254780750|r 750 IAWATIEYLHETN-RCRGLLATHFHELT 776 (920) Q Consensus 750 iA~aile~l~~~~-~~~~lfaTHy~eL~ 776 (920) + +.-+..|++.. +..++|.||..+=+ T Consensus 176 l-~~~l~~l~~~l~~~T~i~VTHD~~EA 202 (362) T TIGR03258 176 M-REEIAALHEELPELTILCVTHDQDDA 202 (362) T ss_pred H-HHHHHHHHHHCCCCEEEEECCCHHHH T ss_conf 9-99999999976798899989998999 No 185 >cd03248 ABCC_TAP TAP, the Transporter Associated with Antigen Processing; TAP is essential for peptide delivery from the cytosol into the lumen of the endoplasmic reticulum (ER), where these peptides are loaded on major histocompatibility complex (MHC) I molecules. Loaded MHC I leave the ER and display their antigenic cargo on the cell surface to cytotoxic T cells. Subsequently, virus-infected or malignantly transformed cells can be eliminated. TAP belongs to the large family of ATP-binding cassette (ABC) transporters, which translocate a vast variety of solutes across membranes. Probab=97.41 E-value=0.0032 Score=42.51 Aligned_cols=131 Identities=27% Similarity=0.331 Sum_probs=69.1 Q ss_pred CEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHC------------------CCCCCC Q ss_conf 04443456358777766439999677844078999999999999971985303532------------------068221 Q gi|254780750|r 633 PFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYA------------------HIGIVD 694 (920) Q Consensus 633 ~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a------------------~i~~~D 694 (920) ..|=+|+++.. ..+.+.-|.|||-+||||++|.+. |.+-|-+-. .++.+. T Consensus 27 ~~vL~~is~~i----~~Ge~vaIvG~sGsGKSTL~~ll~--------gl~~p~~G~I~idg~~i~~~~~~~lr~~i~~v~ 94 (226) T cd03248 27 TLVLQDVSFTL----HPGEVTALVGPSGSGKSTVVALLE--------NFYQPQGGQVLLDGKPISQYEHKYLHSKVSLVG 94 (226) T ss_pred CEEEECEEEEE----CCCCEEEEECCCCCHHHHHHHHHH--------CCCCCCCCEEEECCEEHHHCCHHHHHHCEEEEE T ss_conf 94374538998----299999999999984999999996--------454678878999999934489999973269992 Q ss_pred ---EEEEEEECCCCCCCCCCHHH-HHHHHHHH-----------------------------------HHH-HCCCCCEEE Q ss_conf ---05676523766113853289-99999999-----------------------------------999-589985699 Q gi|254780750|r 695 ---KLFSRVGSADNLASGRSTFM-VEMIETAS-----------------------------------ILN-QATNQSFVI 734 (920) Q Consensus 695 ---~IftRiGa~D~l~~g~STF~-vEm~e~~~-----------------------------------IL~-~at~~SLVl 734 (920) .+|.. --.|||.-|....- -++.+... |.| -..+..++| T Consensus 95 Q~~~lf~~-ti~eNi~~g~~~~~~~~i~~~~~~~~~~~~i~~l~~gl~t~i~~~g~~LSgGqkQRialARal~~~p~ili 173 (226) T cd03248 95 QEPVLFAR-SLQDNIAYGLQSCSFECVKEAAQKAHAHSFISELASGYDTEVGEKGSQLSGGQKQRVAIARALIRNPQVLI 173 (226) T ss_pred CCCEECCC-CHHHHHHCCCCCCCHHHHHHHHHHHCHHHHHHHCCCCCCCEECCCCCCCCHHHHHHHHHHHHHHCCCCEEE T ss_conf 47957677-35666632789999999999999966146776263666406168488769999999999999975999999 Q ss_pred EECCCCCCCHHHHHHHHHHHHHHHHHHC-CCEEEEECCCHHHHHHHH Q ss_conf 9325889880567999999999999726-984999748757976643 Q gi|254780750|r 735 LDEIGRGTATLDGLSIAWATIEYLHETN-RCRGLLATHFHELTDLSK 780 (920) Q Consensus 735 lDElGrGTst~DG~aiA~aile~l~~~~-~~~~lfaTHy~eL~~l~~ 780 (920) +||- ||..|-.+ ...+.+.|.+.. +..+++.||-.++...++ T Consensus 174 lDEp---tSaLD~~t-e~~i~~~l~~~~~~~Tvi~ItH~l~~~~~~D 216 (226) T cd03248 174 LDEA---TSALDAES-EQQVQQALYDWPERRTVLVIAHRLSTVERAD 216 (226) T ss_pred EECC---CCCCCHHH-HHHHHHHHHHHCCCCEEEEEECCHHHHHHCC T ss_conf 9797---66889999-9999999998669999999937999998499 No 186 >cd03289 ABCC_CFTR2 The CFTR subfamily domain 2. The cystic fibrosis transmembrane regulator (CFTR), the product of the gene mutated in patients with cystic fibrosis, has adapted the ABC transporter structural motif to form a tightly regulated anion channel at the apical surface of many epithelia. Use of the term assembly of a functional ion channel implies the coming together of subunits or at least smaller not-yet functional components of the active whole. In fact, on the basis of current knowledge only the CFTR polypeptide itself is required to form an ATP- and protein kinase A-dependent low-conductance chloride channel of the type present in the apical membrane of many epithelial cells. CFTR displays the typical organization (IM-ABC)2 and carries a characteristic hydrophilic R-domain that separates IM1-ABC1 from IM2-ABC2. Probab=97.40 E-value=0.0042 Score=41.65 Aligned_cols=34 Identities=26% Similarity=0.266 Sum_probs=26.3 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHH Q ss_conf 44434563587777664399996778440789999999 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNA 671 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqva 671 (920) .|=+|+++.. ..+.+.-|.|||-+||||+++.+. T Consensus 18 ~vL~~isf~I----~~Ge~vaIvG~sGsGKSTLl~lL~ 51 (275) T cd03289 18 AVLENISFSI----SPGQRVGLLGRTGSGKSTLLSAFL 51 (275) T ss_pred CCEECEEEEE----CCCCEEEEECCCCCCHHHHHHHHH T ss_conf 6242507998----799999999999997999999996 No 187 >COG1118 CysA ABC-type sulfate/molybdate transport systems, ATPase component [Inorganic ion transport and metabolism] Probab=97.39 E-value=0.0022 Score=43.67 Aligned_cols=138 Identities=27% Similarity=0.314 Sum_probs=76.0 Q ss_pred EEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHH------------HHHCCCCCCHHHCCCCCCC---EEEEEEE Q ss_conf 345635877776643999967784407899999999999------------9971985303532068221---0567652 Q gi|254780750|r 637 NDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVI------------MAQMGSYVPASYAHIGIVD---KLFSRVG 701 (920) Q Consensus 637 Ndi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vi------------lAQiG~fVPA~~a~i~~~D---~IftRiG 701 (920) +||+|.. .++.+.-++|||.+||||+||-+|=+-- |... .-+|....++|.+= ..|--|- T Consensus 19 ~di~l~i----~~Ge~vaLlGpSGaGKsTlLRiIAGLe~p~~G~I~~~~~~l~D~-~~~~~~~R~VGfvFQ~YALF~Hmt 93 (345) T COG1118 19 DDISLDI----KSGELVALLGPSGAGKSTLLRIIAGLETPDAGRIRLNGRVLFDV-SNLAVRDRKVGFVFQHYALFPHMT 93 (345) T ss_pred CCCEEEE----CCCCEEEEECCCCCCHHHHHHHHHCCCCCCCCEEEECCEECCCH-HCCCHHHCCEEEEEECHHHCCCCH T ss_conf 4631550----68868999778987678899998575778875699999861000-003234402148972166556452 Q ss_pred CCCCCCCCC------------CHHHHHHHHHHHH--------------------HH--HCCCCCEEEEECCCCCCCHHHH Q ss_conf 376611385------------3289999999999--------------------99--5899856999325889880567 Q gi|254780750|r 702 SADNLASGR------------STFMVEMIETASI--------------------LN--QATNQSFVILDEIGRGTATLDG 747 (920) Q Consensus 702 a~D~l~~g~------------STF~vEm~e~~~I--------------------L~--~at~~SLVllDElGrGTst~DG 747 (920) -.|||+=|. .+=..|+.++-.. |- -|-+-.+.||||-.+.-++.=- T Consensus 94 Va~NIAFGl~~~~~~p~~~~~r~rv~elL~lvqL~~la~ryP~qLSGGQrQRVALARALA~eP~vLLLDEPf~ALDa~vr 173 (345) T COG1118 94 VADNIAFGLKVRKERPSEAEIRARVEELLRLVQLEGLADRYPAQLSGGQRQRVALARALAVEPKVLLLDEPFGALDAKVR 173 (345) T ss_pred HHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHCCCCHHHCCCHHCCHHHHHHHHHHHHHHCCCCEEEECCCCHHHHHHHH T ss_conf 87654210000355887356789999999986401443439421271788999999875349986863587214519999 Q ss_pred HHHHHHHHHHHHHHCCCEEEEECCCHHHH-HHHH Q ss_conf 99999999999972698499974875797-6643 Q gi|254780750|r 748 LSIAWATIEYLHETNRCRGLLATHFHELT-DLSK 780 (920) Q Consensus 748 ~aiA~aile~l~~~~~~~~lfaTHy~eL~-~l~~ 780 (920) .- -.+-|..+++..++.++|.||..+=+ ++++ T Consensus 174 ~~-lr~wLr~~~~~~~~ttvfVTHD~eea~~lad 206 (345) T COG1118 174 KE-LRRWLRKLHDRLGVTTVFVTHDQEEALELAD 206 (345) T ss_pred HH-HHHHHHHHHHHHCCEEEEEECCHHHHHHHCC T ss_conf 99-9999999998609639999589999986416 No 188 >cd03260 ABC_PstB_phosphate_transporter Phosphate uptake is of fundamental importance in the cell physiology of bacteria because phosphate is required as a nutrient. The Pst system of E. coli comprises four distinct subunits encoded by the pstS, pstA, pstB, and pstC genes. The PstS protein is a phosphate-binding protein located in the periplasmic space. P stA and PstC are hydrophobic and they form the transmembrane portion of the Pst system. PstB is the catalytic subunit, which couples the energy of ATP hydrolysis to the import of phosphate across cellular membranes through the Pst system, often referred as ABC-protein. PstB belongs to one of the largest superfamilies of proteins characterized by a highly conserved adenosine triphosphate (ATP) binding cassette (ABC), which is also a nucleotide binding domain (NBD). Probab=97.38 E-value=0.0064 Score=40.27 Aligned_cols=37 Identities=32% Similarity=0.389 Sum_probs=28.5 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHH Q ss_conf 44434563587777664399996778440789999999999 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIV 674 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~v 674 (920) .|=+|++|.. ..+.+.-|.|||-+||||++|.++-+. T Consensus 14 ~~L~~isl~i----~~Ge~~~iiG~SGsGKSTll~~i~gL~ 50 (227) T cd03260 14 HALKDISLDI----PKGEITALIGPSGCGKSTLLRLLNRLN 50 (227) T ss_pred EEEECEEEEE----CCCCEEEEECCCCCCHHHHHHHHHHHH T ss_conf 8883406788----799899999999981999999997445 No 189 >TIGR03411 urea_trans_UrtD urea ABC transporter, ATP-binding protein UrtD. Members of this protein family are ABC transporter ATP-binding subunits associated with urea transport and metabolism. This protein is found in a conserved five-gene transport operon typically found adjacent to urease genes. It was shown in Cyanobacteria that disruption leads to the loss of high-affinity urea transport activity. Probab=97.38 E-value=0.0028 Score=42.92 Aligned_cols=132 Identities=23% Similarity=0.306 Sum_probs=68.5 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCC------CC----CHHHCCCCC-----CCEEEEE Q ss_conf 44345635877776643999967784407899999999999997198------53----035320682-----2105676 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGS------YV----PASYAHIGI-----VDKLFSR 699 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~------fV----PA~~a~i~~-----~D~IftR 699 (920) +=+|++|.. ..+.+.-|-|||-+||||++|.++=+.- ..-|. -+ |.+.++.++ ...+|-. T Consensus 17 al~~vsl~v----~~Gei~~liGpNGaGKSTLl~~i~Gl~~-p~~G~I~~~G~~i~~~~~~~~~~~gIg~~~Q~~~l~~~ 91 (242) T TIGR03411 17 ALNDLSLYV----DPGELRVIIGPNGAGKTTMMDVITGKTR-PDEGSVLFGGTDLTGLPEHQIARAGIGRKFQKPTVFEN 91 (242) T ss_pred EECCCEEEE----CCCCEEEEECCCCCCHHHHHHHHHCCCC-CCCEEEEECCEECCCCCHHHHHHCCCCEECCCCCCCCC T ss_conf 874507898----8998999998999759999999967957-88559999999888999999997488545266676766 Q ss_pred EECCCCCCCC----CCHH--------------------------------------HHHHHHHHHHHHHCCCCCEEEEEC Q ss_conf 5237661138----5328--------------------------------------999999999999589985699932 Q gi|254780750|r 700 VGSADNLASG----RSTF--------------------------------------MVEMIETASILNQATNQSFVILDE 737 (920) Q Consensus 700 iGa~D~l~~g----~STF--------------------------------------~vEm~e~~~IL~~at~~SLVllDE 737 (920) +-..||+.-+ .+-| +--...++..| +++-.++|+|| T Consensus 92 ltv~enl~~~~~~~~~~~~~l~~~~~~~~~~~v~~~l~~~~L~~~~~~~~~~LSgGqkqrv~iA~aL--~~~P~lllLDE 169 (242) T TIGR03411 92 LTVFENLELALPRDKSVFASLFFRLSAEEKDRIEEVLETIGLADEADRLAGLLSHGQKQWLEIGMLL--MQDPKLLLLDE 169 (242) T ss_pred CCHHHHHHHHHHCCCCHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHCCCHHHCCHHHHHHHHHHHHH--HHCCCEEEEEC T ss_conf 9799999998751555024430366499999999999877997265586345997899999999999--73899899918 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH Q ss_conf 588988056799999999999972698499974875797 Q gi|254780750|r 738 IGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELT 776 (920) Q Consensus 738 lGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~ 776 (920) --+|=++..-..| |.+++.|.+ +.-++++||.-+.. T Consensus 170 Pt~gLD~~~~~~i-~~ll~~l~~--~~tvi~isHdl~~~ 205 (242) T TIGR03411 170 PVAGMTDEETEKT-AELLKSLAG--KHSVVVVEHDMEFV 205 (242) T ss_pred CCCCCCHHHHHHH-HHHHHHHHC--CCEEEEEECCHHHH T ss_conf 6436998999999-999999857--99799997859999 No 190 >PRK13409 putative ATPase RIL; Provisional Probab=97.35 E-value=0.0057 Score=40.63 Aligned_cols=157 Identities=24% Similarity=0.251 Sum_probs=80.7 Q ss_pred CCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHH-----CCCCCCCEEEEE---EECCCCCCC-----CCCHHHH Q ss_conf 643999967784407899999999999997198530353-----206822105676---523766113-----8532899 Q gi|254780750|r 649 SGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASY-----AHIGIVDKLFSR---VGSADNLAS-----GRSTFMV 715 (920) Q Consensus 649 ~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~-----a~i~~~D~IftR---iGa~D~l~~-----g~STF~v 715 (920) .+.++-|.|||-+||||++|.++ |..=|-+- .+++-+++-... .-..|-+.+ +.+.|-. T Consensus 364 ~GEiigIvG~NGaGKTTLlKiLa--------G~lkPd~G~V~~~~~isY~pQ~i~~d~~~tV~~~L~~~~~~~~~~~~~~ 435 (590) T PRK13409 364 KGEVIGIVGPNGIGKTTFVKLLA--------GVLKPDEGEVDTDLKISYKPQYIKPDYDGTVEDLLRSITDDLGSSYYKS 435 (590) T ss_pred CCCEEEEECCCCCCHHHHHHHHH--------CCCCCCCCEEEECCEEEECCHHCCCCCCCCHHHHHHHHCCCCCHHHHHH T ss_conf 47489998888887899999982--------8877887447547748733600146868819999986244225389999 Q ss_pred HH----------------------HHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCH Q ss_conf 99----------------------99999999589985699932588988056799999999999972698499974875 Q gi|254780750|r 716 EM----------------------IETASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFH 773 (920) Q Consensus 716 Em----------------------~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~ 773 (920) |+ ..++-..--+.+-.+.||||---+=+-..-+++|-+|-+++.+. +.-+++.+|.- T Consensus 436 ell~~L~l~~lldk~v~~LSGGEkQRvaLA~~L~~~anvLLLDEPTn~LDvE~R~~~~k~i~~~~~~~-~~t~~vV~HD~ 514 (590) T PRK13409 436 EIIKPLNLEKLLDRNVKDLSGGELQRVAIAACLSRDADLYLLDEPSAYLDVEQRLAVARAIRRIAEEK-EKAALVVDHDI 514 (590) T ss_pred HHHHHHCCHHHHCCCHHHCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHC-CCEEEEEECCH T ss_conf 99988788867649624409899999999998667999999948988778899999999999999866-97599994709 Q ss_pred HHHHHHHHCCCEEEEEEEEEEECCEEEEEEEEEECCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHH Q ss_conf 7976643068858999999960992778777744789887789999982999899999999999998 Q gi|254780750|r 774 ELTDLSKSLKRFHNATLQVSDSNEGIIFLHKVIPGIADHSYGIQVGKLAGLPNTVISRAYDILKTFE 840 (920) Q Consensus 774 eL~~l~~~~~~v~n~~~~~~~~~~~i~flykl~~G~~~~Sygi~vA~laG~p~~vi~~A~~~~~~le 840 (920) .+.++... ++ ++| .=.||+-+ .|.=|.+..+--...++.|+ T Consensus 515 ~~~d~lsd--rv-------------~vf--~G~p~~~~---------~a~~p~~~~~gmn~fl~~~~ 555 (590) T PRK13409 515 YMIDYISD--RL-------------MVF--EGEPGKHG---------HASGPLDMREGMNRFLKELG 555 (590) T ss_pred HHHHHHCC--EE-------------EEE--CCCCCCCC---------EECCCCCHHHHHHHHHHHCC T ss_conf 99987616--69-------------998--17887651---------40598447899999998779 No 191 >cd03290 ABCC_SUR1_N The SUR domain 1. The sulfonylurea receptor SUR is an ATP transporter of the ABCC/MRP family with tandem ATPase binding domains. Unlike other ABC proteins, it has no intrinsic transport function, neither active nor passive, but associates with the potassium channel proteins Kir6.1 or Kir6.2 to form the ATP-sensitive potassium (K(ATP)) channel. Within the channel complex, SUR serves as a regulatory subunit that fine-tunes the gating of Kir6.x in response to alterations in cellular metabolism. It constitutes a major pharmaceutical target as it binds numerous drugs, K(ATP) channel openers and blockers, capable of up- or down-regulating channel activity. Probab=97.35 E-value=0.0031 Score=42.55 Aligned_cols=33 Identities=21% Similarity=0.310 Sum_probs=25.5 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHH Q ss_conf 4434563587777664399996778440789999999 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNA 671 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqva 671 (920) |=+|+++.. ..+.+..|.|||-+||||+|+.+. T Consensus 16 vL~~inl~i----~~Ge~~~IvG~sGsGKSTLl~~l~ 48 (218) T cd03290 16 TLSNINIRI----PTGQLTMIVGQVGCGKSSLLLAIL 48 (218) T ss_pred EEECEEEEE----CCCCEEEEECCCCCCHHHHHHHHH T ss_conf 564769998----699999999999980999999985 No 192 >cd03254 ABCC_Glucan_exporter_like Glucan exporter ATP-binding protein. In A. tumefaciens cyclic beta-1, 2-glucan must be transported into the periplasmic space to exert its action as a virluence factor. This subfamily belongs to the MRP-like family and is involved in drug, peptide, and lipid export. The MRP-like family, similar to all ABC proteins, have a common four-domain core structure constituted by two membrane-spanning domains each composed of six transmembrane (TM) helices and two nucleotide-binding domains (NBD). ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=97.35 E-value=0.0028 Score=42.99 Aligned_cols=34 Identities=24% Similarity=0.391 Sum_probs=26.0 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHH Q ss_conf 44434563587777664399996778440789999999 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNA 671 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqva 671 (920) +|=++++|.. ..+.+..|+|||-+||||++|-+. T Consensus 17 ~vL~~inl~i----~~Ge~vaivG~sGsGKSTLl~ll~ 50 (229) T cd03254 17 PVLKDINFSI----KPGETVAIVGPTGAGKTTLINLLM 50 (229) T ss_pred EEEECEEEEE----CCCCEEEEECCCCCHHHHHHHHHH T ss_conf 0874629998----799999999999980999999996 No 193 >cd03297 ABC_ModC_molybdenum_transporter ModC is an ABC-type transporter and the ATPase component of a molybdate transport system that also includes the periplasmic binding protein ModA and the membrane protein ModB. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=97.34 E-value=0.0057 Score=40.62 Aligned_cols=141 Identities=22% Similarity=0.280 Sum_probs=68.3 Q ss_pred CEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCC---------------CCCCHHHCCCCCCC--- Q ss_conf 044434563587777664399996778440789999999999999719---------------85303532068221--- Q gi|254780750|r 633 PFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMG---------------SYVPASYAHIGIVD--- 694 (920) Q Consensus 633 ~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG---------------~fVPA~~a~i~~~D--- 694 (920) .|.- ++++.. ++.+.-|-||+-+||||+||.++-+. -...| ..+|...-.++.+= T Consensus 12 ~f~l-~i~f~i-----~ge~~~iiGpSGsGKSTll~~i~GL~-~p~sG~I~~~G~~~~~~~~~~~~~~~~r~ig~VfQ~~ 84 (214) T cd03297 12 DFTL-KIDFDL-----NEEVTGIFGASGAGKSTLLRCIAGLE-KPDGGTIVLNGTVLFDSRKKINLPPQQRKIGLVFQQY 84 (214) T ss_pred CEEE-EEEEEC-----CCCEEEEECCCCCHHHHHHHHHHCCC-CCCCEEEEECCEECCCCCCCCCCCCCCCCEEEEECCC T ss_conf 9999-999862-----99799999999735999999998499-9996499999999766541246771348758976787 Q ss_pred EEEEEEECCCCCCCC---CCHH--------HHHHHHHHHHHH-------------------HCCCCCEEEEECCCCCCCH Q ss_conf 056765237661138---5328--------999999999999-------------------5899856999325889880 Q gi|254780750|r 695 KLFSRVGSADNLASG---RSTF--------MVEMIETASILN-------------------QATNQSFVILDEIGRGTAT 744 (920) Q Consensus 695 ~IftRiGa~D~l~~g---~STF--------~vEm~e~~~IL~-------------------~at~~SLVllDElGrGTst 744 (920) .+|-.+-..||+.-+ .+.= +.|+.+...+++ -+.+-.++|+||--.+=++ T Consensus 85 ~Lfp~ltV~eNi~~~l~~~~~~~~~~~v~e~l~~~gl~~~~~~~P~~LSGGq~QRVaiARAL~~~P~llLlDEP~saLD~ 164 (214) T cd03297 85 ALFPHLNVRENLAFGLKRKRNREDRISVDELLDLLGLDHLLNRYPAQLSGGEKQRVALARALAAQPELLLLDEPFSALDR 164 (214) T ss_pred CCCCCCCHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHCCHHHCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCH T ss_conf 65788919999988876798689999999999877997786089777992999999999998719999998088766699 Q ss_pred HHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH-HHHHH Q ss_conf 56799999999999972698499974875797-66430 Q gi|254780750|r 745 LDGLSIAWATIEYLHETNRCRGLLATHFHELT-DLSKS 781 (920) Q Consensus 745 ~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~-~l~~~ 781 (920) .-...| ...+..+.+..+..++++||..+-. .+++. T Consensus 165 ~~~~~i-~~~l~~l~~~~~~t~i~VTHd~~e~~~ladr 201 (214) T cd03297 165 ALRLQL-LPELKQIKKNLNIPVIFVTHDLSEAEYLADR 201 (214) T ss_pred HHHHHH-HHHHHHHHHHHCCEEEEECCCHHHHHHHCCE T ss_conf 999999-9999999998599899998999999996999 No 194 >cd03369 ABCC_NFT1 Domain 2 of NFT1 (New full-length MRP-type transporter 1). NFT1 belongs to the MRP (mulrtidrug resisitance-associated protein) family of ABC transporters. Some of the MRP members have five additional transmembrane segments in their N-terminas, but the function of these additional membrane-spanning domains is not clear. The MRP was found in the multidrug-resisting lung cancer cell in which p-glycoprotein was not overexpressed. MRP exports glutathione by drug stimulation, as well as, certain substrates in conjugated forms with anions such as glutathione, glucuronate, and sulfate. Probab=97.34 E-value=0.0055 Score=40.75 Aligned_cols=34 Identities=18% Similarity=0.224 Sum_probs=26.8 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHH Q ss_conf 44434563587777664399996778440789999999 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNA 671 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqva 671 (920) .|=+|++|.. ..+...-|.|||-+||||++|.++ T Consensus 22 ~iL~~isl~i----~~Ge~v~ivG~sGsGKSTLl~ll~ 55 (207) T cd03369 22 PVLKNVSFKV----KAGEKIGIVGRTGAGKSTLILALF 55 (207) T ss_pred CCEEEEEEEE----CCCCEEEEECCCCCCHHHHHHHHH T ss_conf 7240258898----699999999999987999999999 No 195 >PRK13549 xylose transporter ATP-binding subunit; Provisional Probab=97.32 E-value=0.0052 Score=40.94 Aligned_cols=33 Identities=15% Similarity=0.246 Sum_probs=24.9 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHH Q ss_conf 4434563587777664399996778440789999999 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNA 671 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqva 671 (920) +=+|+++.. ..+.++-|+|||-+||||++|.++ T Consensus 277 ~l~~vsf~v----~~GEi~gi~G~nGsGKsTLl~~L~ 309 (513) T PRK13549 277 RVDDVSFSL----RRGEILGIAGLVGAGRTELVQCLF 309 (513) T ss_pred EEECCEEEE----CCCCEEEEECCCCCCHHHHHHHHH T ss_conf 652335788----688489974798865899999983 No 196 >cd03253 ABCC_ATM1_transporter ATM1 is an ABC transporter that is expressed in the mitochondria. Although the specific function of ATM1 is unknown, its disruption results in the accumulation of excess mitochondrial iron, loss of mitochondrial cytochromes, oxidative damage to mitochondrial DNA, and decreased levels of cytosolic heme proteins. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=97.32 E-value=0.0088 Score=39.21 Aligned_cols=41 Identities=32% Similarity=0.401 Sum_probs=29.5 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHH Q ss_conf 44434563587777664399996778440789999999999999719853035 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPAS 686 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~ 686 (920) .|=+|+++.. ..+.+.-|.|||-+||||++|-+ .|.+-|.+ T Consensus 15 ~iL~~isl~i----~~Ge~v~ivG~sGsGKSTLl~ll--------~gl~~p~~ 55 (236) T cd03253 15 PVLKDVSFTI----PAGKKVAIVGPSGSGKSTILRLL--------FRFYDVSS 55 (236) T ss_pred CEEECEEEEE----CCCCEEEEECCCCCCHHHHHHHH--------CCCCCCCC T ss_conf 6330568998----69999999999999899999997--------43854887 No 197 >PRK11288 araG L-arabinose transporter ATP-binding protein; Provisional Probab=97.31 E-value=0.0042 Score=41.65 Aligned_cols=33 Identities=12% Similarity=0.200 Sum_probs=25.2 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHH Q ss_conf 4434563587777664399996778440789999999 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNA 671 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqva 671 (920) +.+|++|.. ..+.++-|.|||-+||||++|.++ T Consensus 268 ~~~~vsl~v----~~GEivgivG~nGsGKSTL~k~L~ 300 (501) T PRK11288 268 LREPISFAV----RRGEIVGFFGLVGAGRSELMKLLY 300 (501) T ss_pred CCCCEEEEE----ECCEEEEEECCCCCCHHHHHHHHC T ss_conf 456634787----088399975688864879999843 No 198 >COG4619 ABC-type uncharacterized transport system, ATPase component [General function prediction only] Probab=97.29 E-value=0.0058 Score=40.58 Aligned_cols=133 Identities=29% Similarity=0.344 Sum_probs=67.0 Q ss_pred CEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHH-----CCCCC-----CHH----------HCCC-- Q ss_conf 0444345635877776643999967784407899999999999997-----19853-----035----------3206-- Q gi|254780750|r 633 PFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQ-----MGSYV-----PAS----------YAHI-- 690 (920) Q Consensus 633 ~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQ-----iG~fV-----PA~----------~a~i-- 690 (920) .-+-|.++|.. ..+-+..||||-.+||||++|++|.+.=.-- =|-.| ||= .+-+ T Consensus 16 a~il~~isl~v----~~Ge~iaitGPSG~GKStllk~va~Lisp~~G~l~f~Ge~vs~~~pea~Rq~VsY~~Q~paLfg~ 91 (223) T COG4619 16 AKILNNISLSV----RAGEFIAITGPSGCGKSTLLKIVASLISPTSGTLLFEGEDVSTLKPEAYRQQVSYCAQTPALFGD 91 (223) T ss_pred CEEECCEEEEE----CCCCEEEEECCCCCCHHHHHHHHHHCCCCCCCEEEECCCCCCCCCHHHHHHHHHHHHCCCCCCCC T ss_conf 74432413665----38854887678876688999999813699885288747334434859999999999728421466 Q ss_pred CCCCEE-E------------------EEEECCCCCCCCCCHHH--HHHHHHHHHHHHC-CCCCEEEEECCCCCCCHHH-- Q ss_conf 822105-6------------------76523766113853289--9999999999958-9985699932588988056-- Q gi|254780750|r 691 GIVDKL-F------------------SRVGSADNLASGRSTFM--VEMIETASILNQA-TNQSFVILDEIGRGTATLD-- 746 (920) Q Consensus 691 ~~~D~I-f------------------tRiGa~D~l~~g~STF~--vEm~e~~~IL~~a-t~~SLVllDElGrGTst~D-- 746 (920) ++.|++ | .|.+-.|.+....-|=| -|-... .+.++- +---..|+||+ ||..| T Consensus 92 tVeDNlifP~~~r~rr~dr~aa~~llar~~l~~~~L~k~it~lSGGE~Qri-AliR~Lq~~P~ILLLDE~---TsALD~~ 167 (223) T COG4619 92 TVEDNLIFPWQIRNRRPDRAAALDLLARFALPDSILTKNITELSGGEKQRI-ALIRNLQFMPKILLLDEI---TSALDES 167 (223) T ss_pred CHHHCCCCCHHHHCCCCCHHHHHHHHHHCCCCHHHHCCHHHHCCCHHHHHH-HHHHHHHCCCCEEEECCC---HHHCCHH T ss_conf 332231142577536888679999998707964664140233166078999-999986307746873371---4333825 Q ss_pred -HHHHHHHHHHHHHHHCCCEEEEECCCHH Q ss_conf -7999999999999726984999748757 Q gi|254780750|r 747 -GLSIAWATIEYLHETNRCRGLLATHFHE 774 (920) Q Consensus 747 -G~aiA~aile~l~~~~~~~~lfaTHy~e 774 (920) --+|-.-+..|+.+. .--.+..||... T Consensus 168 nkr~ie~mi~~~v~~q-~vAv~WiTHd~d 195 (223) T COG4619 168 NKRNIEEMIHRYVREQ-NVAVLWITHDKD 195 (223) T ss_pred HHHHHHHHHHHHHHHH-CEEEEEEECCHH T ss_conf 6888999999970042-458999953858 No 199 >COG4152 ABC-type uncharacterized transport system, ATPase component [General function prediction only] Probab=97.28 E-value=0.0017 Score=44.63 Aligned_cols=121 Identities=19% Similarity=0.268 Sum_probs=73.8 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCC-----EEEEEEECCC---CC Q ss_conf 443456358777766439999677844078999999999999971985303532068221-----0567652376---61 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVD-----KLFSRVGSAD---NL 706 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D-----~IftRiGa~D---~l 706 (920) +-||+++.. ..++++=+-|||-+||+|.+|.+.= |.++.+-+|++.. .+.-|||-=- -| T Consensus 17 av~~isf~v----~~G~i~GllG~NGAGKTTtfRmILg---------lle~~~G~I~~~g~~~~~~~~~rIGyLPEERGL 83 (300) T COG4152 17 AVDNISFEV----PPGEIFGLLGPNGAGKTTTFRMILG---------LLEPTEGEITWNGGPLSQEIKNRIGYLPEERGL 83 (300) T ss_pred EECCEEEEE----CCCEEEEEECCCCCCCCCHHHHHHC---------CCCCCCCEEEECCCCHHHHHHHHCCCCHHHHCC T ss_conf 662413665----4871787665888973233999864---------578667668985851005556540658154066 Q ss_pred CC-----CCCHHHHHHH----------------------------------------HHHHHHHHCCCCCEEEEECCCCC Q ss_conf 13-----8532899999----------------------------------------99999995899856999325889 Q gi|254780750|r 707 AS-----GRSTFMVEMI----------------------------------------ETASILNQATNQSFVILDEIGRG 741 (920) Q Consensus 707 ~~-----g~STF~vEm~----------------------------------------e~~~IL~~at~~SLVllDElGrG 741 (920) .. .+=-|..++. =++.+++ +--||||||-..| T Consensus 84 y~k~tv~dql~yla~LkGm~~~e~~~~~~~wLer~~i~~~~~~kIk~LSKGnqQKIQfisaviH---ePeLlILDEPFSG 160 (300) T COG4152 84 YPKMTVEDQLKYLAELKGMPKAEIQKKLQAWLERLEIVGKKTKKIKELSKGNQQKIQFISAVIH---EPELLILDEPFSG 160 (300) T ss_pred CCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHC---CCCEEEECCCCCC T ss_conf 7567199999999986499689999999999996065654442477753011678999999852---8877996688668 Q ss_pred CCHHHHHHHHHHHHHHHHHHCCCEEEEECCCH Q ss_conf 88056799999999999972698499974875 Q gi|254780750|r 742 TATLDGLSIAWATIEYLHETNRCRGLLATHFH 773 (920) Q Consensus 742 Tst~DG~aiA~aile~l~~~~~~~~lfaTHy~ 773 (920) -+|-.--++--+|.+.= + .|+..+|+||-- T Consensus 161 LDPVN~elLk~~I~~lk-~-~GatIifSsH~M 190 (300) T COG4152 161 LDPVNVELLKDAIFELK-E-EGATIIFSSHRM 190 (300) T ss_pred CCHHHHHHHHHHHHHHH-H-CCCEEEEECCHH T ss_conf 87232999999999987-3-587799853337 No 200 >cd03295 ABC_OpuCA_Osmoprotection OpuCA is a the ATP binding component of a bacterial solute transporter that serves a protective role to cells growing in a hyperosmolar environment. ABC (ATP-binding cassette) transporter nucleotide-binding domain; ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition, to the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=97.27 E-value=0.0073 Score=39.81 Aligned_cols=139 Identities=20% Similarity=0.273 Sum_probs=69.5 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCC------CC---CHHH--CCCCCC---CEEEEEE Q ss_conf 44345635877776643999967784407899999999999997198------53---0353--206822---1056765 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGS------YV---PASY--AHIGIV---DKLFSRV 700 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~------fV---PA~~--a~i~~~---D~IftRi 700 (920) +=+|++|.. ..+.+..|-|||-+||||+||.++-+.= ..-|. -| +... -.++.+ ..+|-.+ T Consensus 16 al~~vsl~i----~~Ge~~~ilGpSG~GKSTllr~i~gl~~-p~~G~I~i~g~~i~~~~~~~lrr~ig~vfQ~~~Lfp~l 90 (242) T cd03295 16 AVNNLNLEI----AKGEFLVLIGPSGSGKTTTMKMINRLIE-PTSGEIFIDGEDIREQDPVELRRKIGYVIQQIGLFPHM 90 (242) T ss_pred EEEEEEEEE----CCCCEEEEECCCCCHHHHHHHHHHCCCC-CCCEEEEECCEECCCCCHHHHHCCCEEEECCCCCCCCC T ss_conf 983027688----6998999999999569999999975999-98159999999999999789738867991799758888 Q ss_pred ECCCCCC-----CCCCHHH-----HHHHHH-----HHHHH-------------------HCCCCCEEEEECCCCCCCHHH Q ss_conf 2376611-----3853289-----999999-----99999-------------------589985699932588988056 Q gi|254780750|r 701 GSADNLA-----SGRSTFM-----VEMIET-----ASILN-------------------QATNQSFVILDEIGRGTATLD 746 (920) Q Consensus 701 Ga~D~l~-----~g~STF~-----vEm~e~-----~~IL~-------------------~at~~SLVllDElGrGTst~D 746 (920) -..||+. .|.|.-- .|+.+. +.+++ -|..-.++|+||- ||..| T Consensus 91 tV~eNi~~~~~~~~~~~~~~~~rv~ell~~v~L~~~~~~~~~p~~LSGGqkQRvaiARAl~~~P~ilLlDEP---~saLD 167 (242) T cd03295 91 TVEENIALVPKLLKWPKEKIRERADELLALVGLDPAEFADRYPHELSGGQQQRVGVARALAADPPLLLMDEP---FGALD 167 (242) T ss_pred CHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCCHHHCCCCHHHCCHHHHHHHHHHHHHHCCCCEEEECCC---CCCCC T ss_conf 299999999997599999999999999987499930110079566899999999999999629999998187---65469 Q ss_pred HHH--HHHHHHHHHHHHCCCEEEEECCCHHHH-HHHHH Q ss_conf 799--999999999972698499974875797-66430 Q gi|254780750|r 747 GLS--IAWATIEYLHETNRCRGLLATHFHELT-DLSKS 781 (920) Q Consensus 747 G~a--iA~aile~l~~~~~~~~lfaTHy~eL~-~l~~~ 781 (920) -.. --|..+..|.+..+-.++|+||.-+.+ .+++. T Consensus 168 ~~~~~~i~~~l~~l~~~~~~T~i~vTHd~~ea~~~aDr 205 (242) T cd03295 168 PITRDQLQEEFKRLQQELGKTIVFVTHDIDEAFRLADR 205 (242) T ss_pred HHHHHHHHHHHHHHHHHHCCEEEEECCCHHHHHHHCCE T ss_conf 89999999999999997599999999899999996998 No 201 >PRK10762 D-ribose transporter ATP binding protein; Provisional Probab=97.26 E-value=0.0063 Score=40.28 Aligned_cols=31 Identities=16% Similarity=0.333 Sum_probs=23.9 Q ss_pred EEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHH Q ss_conf 34563587777664399996778440789999999 Q gi|254780750|r 637 NDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNA 671 (920) Q Consensus 637 Ndi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqva 671 (920) +|++|.. ..+.++-|.|||-+||||++|.++ T Consensus 269 ~~vs~~v----~~GEi~gi~G~nGsGKsTL~~~l~ 299 (501) T PRK10762 269 NDVSFTL----RKGEILGVSGLMGAGRTELMKVLY 299 (501) T ss_pred CCEEEEE----CCCCEEEEECCCCCCHHHHHHHHH T ss_conf 6344476----688189966788876889999981 No 202 >cd03291 ABCC_CFTR1 The CFTR subfamily domain 1. The cystic fibrosis transmembrane regulator (CFTR), the product of the gene mutated in patients with cystic fibrosis, has adapted the ABC transporter structural motif to form a tightly regulated anion channel at the apical surface of many epithelia. Use of the term assembly of a functional ion channel implies the coming together of subunits, or at least smaller not-yet functional components of the active whole. In fact, on the basis of current knowledge only the CFTR polypeptide itself is required to form an ATP- and protein kinase A-dependent low-conductance chloride channel of the type present in the apical membrane of many epithelial cells. CFTR displays the typical organization (IM-ABC)2 and carries a characteristic hydrophilic R-domain that separates IM1-ABC1 from IM2-ABC2. Probab=97.25 E-value=0.0089 Score=39.18 Aligned_cols=124 Identities=18% Similarity=0.218 Sum_probs=63.1 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHC-----CCCCCCE---EEEEEECCCCC Q ss_conf 443456358777766439999677844078999999999999971985303532-----0682210---56765237661 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYA-----HIGIVDK---LFSRVGSADNL 706 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a-----~i~~~D~---IftRiGa~D~l 706 (920) |=+|+++.. ..+...-|.|||-+||||+++.+. |.+-|-+-. +++.+.+ +|.. --.||| T Consensus 52 VLk~Isf~I----~~Ge~vaIVG~sGSGKSTLl~lL~--------gl~~p~~G~I~~~g~i~~v~Q~~~lf~~-TireNI 118 (282) T cd03291 52 VLKNINLKI----EKGEMLAITGSTGSGKTSLLMLIL--------GELEPSEGKIKHSGRISFSSQFSWIMPG-TIKENI 118 (282) T ss_pred EEECEEEEE----CCCCEEEEECCCCCHHHHHHHHHH--------CCCCCCCCEEEECCEEEEECCCCCCCCC-CHHHHH T ss_conf 141648998----499999999999981999999995--------7872786589999999986574422671-099997 Q ss_pred CCCCCHH---HHHHHHHH---HHHH------------------------------HCCCCCEEEEECCCCCCCHHHHH-- Q ss_conf 1385328---99999999---9999------------------------------58998569993258898805679-- Q gi|254780750|r 707 ASGRSTF---MVEMIETA---SILN------------------------------QATNQSFVILDEIGRGTATLDGL-- 748 (920) Q Consensus 707 ~~g~STF---~vEm~e~~---~IL~------------------------------~at~~SLVllDElGrGTst~DG~-- 748 (920) .-|.+.= +.+..+.+ ..+. -.....++|+||- ||..|-. T Consensus 119 ~~g~~~~~~~~~~~~~~~~l~~~i~~lp~g~~t~vge~G~~LSGGQkQRlaiARALl~~p~IliLDEp---TS~LD~~tE 195 (282) T cd03291 119 IFGVSYDEYRYKSVVKACQLEEDITKFPEKDNTVLGEGGITLSGGQRARISLARAVYKDADLYLLDSP---FGYLDVFTE 195 (282) T ss_pred HCCCCCCHHHHHHHHHHHCCHHHHHHCCCCCCCEECCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECC---CCCCCHHHH T ss_conf 51688688999999998514999984634255230036775899999999999998428998998687---766898789 Q ss_pred -HHHHHHHHHHHHHCCCEEEEECCCHHHH Q ss_conf -9999999999972698499974875797 Q gi|254780750|r 749 -SIAWATIEYLHETNRCRGLLATHFHELT 776 (920) Q Consensus 749 -aiA~aile~l~~~~~~~~lfaTHy~eL~ 776 (920) .|-.+++..+. .+..+++.||-.+.. T Consensus 196 ~~I~~~~l~~~~--~~kTvI~ItHrL~~i 222 (282) T cd03291 196 KEIFESCVCKLM--ANKTRILVTSKMEHL 222 (282) T ss_pred HHHHHHHHHHHH--CCCEEEEEECCHHHH T ss_conf 999999999986--899899993788889 No 203 >PRK09700 D-allose transporter ATP-binding protein; Provisional Probab=97.25 E-value=0.013 Score=37.85 Aligned_cols=31 Identities=16% Similarity=0.236 Sum_probs=25.2 Q ss_pred EEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHH Q ss_conf 34563587777664399996778440789999999 Q gi|254780750|r 637 NDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNA 671 (920) Q Consensus 637 Ndi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqva 671 (920) +|+++.. ..+.++-|.|||-+||||++|.++ T Consensus 280 ~~vsf~v----~~GEi~gl~G~nGsGKsTL~~~l~ 310 (510) T PRK09700 280 RDISFSV----CRGEILGFAGLVGSGRTELMNCLF 310 (510) T ss_pred EEEEEEE----CCCCEEEEECCCCCCHHHHHHHHH T ss_conf 4335787----488189997688862889999981 No 204 >pfam01695 IstB IstB-like ATP binding protein. This protein contains an ATP/GTP binding P-loop motif. It is found associated with IS21 family insertion sequences. The function of this protein is unknown, but it may perform a transposase function. Probab=97.22 E-value=0.015 Score=37.57 Aligned_cols=101 Identities=24% Similarity=0.298 Sum_probs=65.3 Q ss_pred EEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEEC-CCCCCCCCCHHHHHHHHHHHHHHHCCCC Q ss_conf 999967784407899999999999997198530353206822105676523-7661138532899999999999958998 Q gi|254780750|r 652 LWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGS-ADNLASGRSTFMVEMIETASILNQATNQ 730 (920) Q Consensus 652 ~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa-~D~l~~g~STF~vEm~e~~~IL~~at~~ 730 (920) =++++||...|||-+.-.+|.-.+. -|.-| +|.++.. -+.+....+. .+....++.-.+- T Consensus 49 Nlll~G~~GtGKThLA~Ai~~~~~~--~g~~v------------~f~~~~~L~~~l~~~~~~-----~~~~~~l~~~~~~ 109 (178) T pfam01695 49 NLLLLGPPGVGKTHLACALGHQACR--AGYSV------------LFTRTPDLVEQLKRARGD-----GRLARTLQRLAKA 109 (178) T ss_pred CEEEECCCCCCHHHHHHHHHHHHHH--CCCEE------------EEEECHHHHHHHHHHHHC-----CCHHHHHHHHHCC T ss_conf 6899899998789999999999998--69859------------999616799999987526-----7499999996258 Q ss_pred CEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCH Q ss_conf 5699932588988056799999999999972698499974875 Q gi|254780750|r 731 SFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFH 773 (920) Q Consensus 731 SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~ 773 (920) .|+|+||+|.-..+..+..+-+-++++-.++ + .++|+|.++ T Consensus 110 dlLIiDDlG~~~~s~~~~~~lf~li~~Rye~-~-stIiTSN~~ 150 (178) T pfam01695 110 DLLILDDIGYLPLSQEAAHLLFELISDRYER-R-STILTSNLP 150 (178) T ss_pred CEEEEEHHCCCCCCHHHHHHHHHHHHHHHCC-C-CEEEECCCC T ss_conf 9788720016568989999999999999756-8-868776899 No 205 >TIGR01189 ccmA heme ABC exporter, ATP-binding protein CcmA; InterPro: IPR005895 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This family contains the cytochrome c biogenesis protein encoded by ccmA in bacteria and one arabidopsis protein, possibly encoded by an organelle. Bacterial c-type cytochromes are located on the periplasmic side of the cytoplasmic membrane. Several gene products encoded in a locus designated as 'ccm' are implicated in the transport and assembly of the functional cytochrome C. This cluster includes genes: ccmA;B;C;D;E;F;G and H. The post-translational pathway includes the transport of a heme moiety, the secretion of the apoprotein and the covalent attachment of the heme with the apoprotein. The proteins ccmA and B represent an ABC transporter; ccmC and D participate in heme transfer to ccmE, which functions as a periplasmic heme chaperone. The presence of ccmF, G and H is suggested to be obligatory for the final functional assembly of cytochrome c.; GO: 0005215 transporter activity, 0017004 cytochrome complex assembly, 0030288 outer membrane-bounded periplasmic space. Probab=97.21 E-value=0.00036 Score=49.57 Aligned_cols=124 Identities=27% Similarity=0.280 Sum_probs=64.6 Q ss_pred CEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHH-----------CCCCCCHHH-CCCCCCCEEEEEE Q ss_conf 0444345635877776643999967784407899999999999997-----------198530353-2068221056765 Q gi|254780750|r 633 PFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQ-----------MGSYVPASY-AHIGIVDKLFSRV 700 (920) Q Consensus 633 ~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQ-----------iG~fVPA~~-a~i~~~D~IftRi 700 (920) -|-+-+++++ .+.+++|+|||-+||||+||.+|=+..-.. -.-.-|.+. +-+|=-|.|=... T Consensus 15 ~f~~l~F~l~------aGe~l~v~GpNG~GKTtLLR~LAGL~~P~~G~v~~~~~~~~~~~~~~~~~~~YlGH~~GlK~~L 88 (204) T TIGR01189 15 LFEGLSFTLN------AGEALQVEGPNGIGKTTLLRILAGLLRPESGEVRLNGTALAEQRDEPHENLLYLGHLDGLKPEL 88 (204) T ss_pred EECCCEEEEE------CCCEEEEEECCCCCHHHHHHHHHHCCCCCCCEEEECCCCCHHHCCCHHHHHHHHHHHCCCCCCC T ss_conf 3011234540------7827898606987357899999850588665575288224211564778788885100345013 Q ss_pred ECCCCCCCCCCHHHHHHHH---------------------HHHHHH----------------HCCCCCEEEEECCCCCCC Q ss_conf 2376611385328999999---------------------999999----------------589985699932588988 Q gi|254780750|r 701 GSADNLASGRSTFMVEMIE---------------------TASILN----------------QATNQSFVILDEIGRGTA 743 (920) Q Consensus 701 Ga~D~l~~g~STF~vEm~e---------------------~~~IL~----------------~at~~SLVllDElGrGTs 743 (920) -+..|| +|..+... .+.+.- --|.+-|=||||- |- T Consensus 89 sa~ENL-----~F~~~~~~stCs~~~~~~~~AL~~vgL~g~e~~p~~~LSAGQqRRlaLARL~l~~~PlWiLDEP---~t 160 (204) T TIGR01189 89 SALENL-----QFWAAIHGSTCSSEQLTIEDALAAVGLTGLEDLPAAQLSAGQQRRLALARLWLSRAPLWILDEP---TT 160 (204) T ss_pred CHHHHH-----HHHHHHHHCCCCCCCCCHHHHHHHCCCCCHHHHHHHHCCCHHHHHHHHHHHHHCCCCEEEECCC---HH T ss_conf 778879-----9999985201683124779999760843314589874061468999999886337972220365---14 Q ss_pred HHH--HHHHHHHH-HHHHHHHCCCEEEEECCC Q ss_conf 056--79999999-999997269849997487 Q gi|254780750|r 744 TLD--GLSIAWAT-IEYLHETNRCRGLLATHF 772 (920) Q Consensus 744 t~D--G~aiA~ai-le~l~~~~~~~~lfaTHy 772 (920) ..| |+++-.++ -.|+ .+ |=.+|.|||- T Consensus 161 ALD~~Gv~~l~~~~~~H~-~r-GG~vlltTH~ 190 (204) T TIGR01189 161 ALDKAGVALLAALLAAHL-AR-GGIVLLTTHQ 190 (204) T ss_pred HCCHHHHHHHHHHHHHHH-HC-CCCEEEECCC T ss_conf 306899999999999998-60-0513663475 No 206 >PRK13645 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=97.16 E-value=0.0035 Score=42.19 Aligned_cols=53 Identities=19% Similarity=0.178 Sum_probs=34.9 Q ss_pred CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH-HHHH Q ss_conf 89985699932588988056799999999999972698499974875797-6643 Q gi|254780750|r 727 ATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELT-DLSK 780 (920) Q Consensus 727 at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~-~l~~ 780 (920) |..-.++|+||--.|=++.- ..--+..++.|.+..+..++++||.-+.. .+++ T Consensus 166 a~~P~iLilDEPTagLDp~~-~~~i~~ll~~L~~~~g~Tvi~vtHdm~~v~~~aD 219 (289) T PRK13645 166 AMDGNTLVLDEPTGGLDPKG-EEDFINLFERLNKEYKKRIIMVTHNMDQVLRIAD 219 (289) T ss_pred HCCCCEEEEECCCCCCCHHH-HHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHCC T ss_conf 63999999958876489899-9999999999999569999999159999999799 No 207 >KOG0927 consensus Probab=97.16 E-value=0.025 Score=35.79 Aligned_cols=144 Identities=21% Similarity=0.209 Sum_probs=83.1 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEECCCCCCCCCCHHH Q ss_conf 44345635877776643999967784407899999999999997198530353206822105676523766113853289 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGSADNLASGRSTFM 714 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa~D~l~~g~STF~ 714 (920) +-+++.+|.+- +.|+. +-|||..||||+||.+-- -+.-.+|.-.|-..+.++.|.+= .+-.=++...-++|| T Consensus 405 iy~~l~fgid~---~srvA-lVGPNG~GKsTLlKl~~g-dl~p~~G~vs~~~H~~~~~y~Qh---~~e~ldl~~s~le~~ 476 (614) T KOG0927 405 IYKKLNFGIDL---DSRVA-LVGPNGAGKSTLLKLITG-DLQPTIGMVSRHSHNKLPRYNQH---LAEQLDLDKSSLEFM 476 (614) T ss_pred HHHHHHCCCCC---CCCEE-EECCCCCCHHHHHHHHHH-CCCCCCCCCCCCCCCCCHHHHHH---HHHHCCCCHHHHHHH T ss_conf 55420114575---54224-766898764666788750-34666551242223320023355---676417515478989 Q ss_pred HH-------HHHHHHHHHHC-----------------------------CCCCEEEEECCCCCCCHHHHHHHHHHHHHHH Q ss_conf 99-------99999999958-----------------------------9985699932588988056799999999999 Q gi|254780750|r 715 VE-------MIETASILNQA-----------------------------TNQSFVILDEIGRGTATLDGLSIAWATIEYL 758 (920) Q Consensus 715 vE-------m~e~~~IL~~a-----------------------------t~~SLVllDElGrGTst~DG~aiA~aile~l 758 (920) .+ ..+...||... +.--|+||||---|-++.-=.++|-|+= T Consensus 477 ~~~~~~~~~~e~~r~ilgrfgLtgd~q~~p~~~LS~Gqr~rVlFa~l~~kqP~lLlLDEPtnhLDi~tid~laeaiN--- 553 (614) T KOG0927 477 MPKFPDEKELEEMRSILGRFGLTGDAQVVPMSQLSDGQRRRVLFARLAVKQPHLLLLDEPTNHLDIETIDALAEAIN--- 553 (614) T ss_pred HHHCCCCCHHHHHHHHHHHHCCCCCCCCCCHHHCCCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCHHHHHHHHHHH--- T ss_conf 97625646299999999871877521013364426100014789998843884798548876788506999999985--- Q ss_pred HHHCCCEEEEECCCHHH-HHHHHHCCCEEEEEEE Q ss_conf 97269849997487579-7664306885899999 Q gi|254780750|r 759 HETNRCRGLLATHFHEL-TDLSKSLKRFHNATLQ 791 (920) Q Consensus 759 ~~~~~~~~lfaTHy~eL-~~l~~~~~~v~n~~~~ 791 (920) +..| -+++.+|..-| ...+++.+-+.|+.+. T Consensus 554 -e~~G-gvv~vSHDfrlI~qVaeEi~~c~~~~~~ 585 (614) T KOG0927 554 -EFPG-GVVLVSHDFRLISQVAEEIWVCENGTVT 585 (614) T ss_pred -CCCC-CEEEEECHHHHHHHHHHHHHHHCCCCEE T ss_conf -2678-5155321234899887776701058366 No 208 >cd03249 ABC_MTABC3_MDL1_MDL2 MTABC3 (also known as ABCB6) is a mitochondrial ATP-binding cassette protein involved in iron homeostasis and one of four ABC transporters expressed in the mitochondrial inner membrane, the other three being MDL1(ABC7), MDL2, and ATM1. In fact, the yeast MDL1 (multidrug resistance-like protein 1) and MDL2 (multidrug resistance-like protein 2) transporters are also included in this CD. MDL1 is an ATP-dependent permease that acts as a high-copy suppressor of ATM1 and is thought to have a role in resistance to oxidative stress. Interestingly, subfamily B is more closely related to the carboxyl-terminal component of subfamily C than the two halves of ABCC molecules are with one another. Probab=97.15 E-value=0.009 Score=39.14 Aligned_cols=34 Identities=29% Similarity=0.252 Sum_probs=25.4 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHH Q ss_conf 44434563587777664399996778440789999999 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNA 671 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqva 671 (920) .|=+|++|.. ..+...-|.||+-+||||+++.+. T Consensus 17 ~vL~~isl~i----~~G~~iaIvG~sGsGKSTLl~ll~ 50 (238) T cd03249 17 PILKGLSLTI----PPGKTVALVGSSGCGKSTVVSLLE 50 (238) T ss_pred EEECCEEEEE----CCCCEEEEECCCCCCHHHHHHHHH T ss_conf 5222558997----699999999999998999999982 No 209 >COG1122 CbiO ABC-type cobalt transport system, ATPase component [Inorganic ion transport and metabolism] Probab=97.10 E-value=0.0035 Score=42.17 Aligned_cols=49 Identities=22% Similarity=0.192 Sum_probs=31.1 Q ss_pred CCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHH Q ss_conf 99856999325889880567999999999999726984999748757976 Q gi|254780750|r 728 TNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTD 777 (920) Q Consensus 728 t~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~ 777 (920) ..-.++|+||-..|=++. |..-....+..|....+...+.+||..+... T Consensus 155 ~~P~iliLDEPta~LD~~-~~~~l~~~l~~L~~~~~~tiii~tHd~~~~~ 203 (235) T COG1122 155 MGPEILLLDEPTAGLDPK-GRRELLELLKKLKEEGGKTIIIVTHDLELVL 203 (235) T ss_pred CCCCEEEECCCCCCCCHH-HHHHHHHHHHHHHHCCCCEEEEEECCHHHHH T ss_conf 189899974998898978-9999999999988607976999947478988 No 210 >cd03252 ABCC_Hemolysin The ABC-transporter hemolysin B is a central component of the secretion machinery that translocates the toxin, hemolysin A, in a Sec-independent fashion across both membranes of E. coli. The hemolysin A (HlyA) transport machinery is composed of the ATP-binding cassette (ABC) transporter HlyB located in the inner membrane, hemolysin D (HlyD), also anchored in the inner membrane, and TolC, which resides in the outer membrane. HlyD apparently forms a continuous channel that bridges the entire periplasm, interacting with TolC and HlyB. This arrangement prevents the appearance of periplasmic intermediates of HlyA during substrate transport. Little is known about the molecular details of HlyA transport, but it is evident that ATP-hydrolysis by the ABC-transporter HlyB is a necessary source of energy. Probab=97.10 E-value=0.015 Score=37.57 Aligned_cols=36 Identities=25% Similarity=0.365 Sum_probs=27.4 Q ss_pred CCEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHH Q ss_conf 7044434563587777664399996778440789999999 Q gi|254780750|r 632 KPFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNA 671 (920) Q Consensus 632 ~~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqva 671 (920) .++|=+++++.. ..+.+.-|.||+-+||||++|.+. T Consensus 14 ~~~~L~~is~~i----~~G~~vaivG~sGsGKSTll~ll~ 49 (237) T cd03252 14 GPVILDNISLRI----KPGEVVGIVGRSGSGKSTLTKLIQ 49 (237) T ss_pred CCCCEECCEEEE----CCCCEEEEECCCCCHHHHHHHHHH T ss_conf 957251508998----799999999999985999999996 No 211 >pfam00154 RecA recA bacterial DNA recombination protein. RecA is a DNA-dependent ATPase and functions in DNA repair systems. RecA protein catalyses an ATP-dependent DNA strand-exchange reaction that is the central step in the repair of dsDNA breaks by homologous recombination. Probab=97.09 E-value=0.0069 Score=40.01 Aligned_cols=130 Identities=20% Similarity=0.260 Sum_probs=71.5 Q ss_pred EEEEECCCCCCCCCEEEEEECCCCCCHHHH-HHHHHHHHHHHH-CCC---CCCHHHCCCCCCCEEEEEEEC-CCCCCCCC Q ss_conf 345635877776643999967784407899-999999999997-198---530353206822105676523-76611385 Q gi|254780750|r 637 NDCDLSCPNDKNSGKLWLLTGPNMGGKSTF-LRQNALIVIMAQ-MGS---YVPASYAHIGIVDKLFSRVGS-ADNLASGR 710 (920) Q Consensus 637 Ndi~l~~~~~~~~~~~~iiTGpNmgGKSt~-lRqval~vilAQ-iG~---fVPA~~a~i~~~D~IftRiGa-~D~l~~g~ 710 (920) -|.-||..+ -..+|+..|.||..+||||+ +..++- || .|. |+-|+.|-=+ .-...+|- .|++.--+ T Consensus 40 lD~aLg~GG-lP~GRi~ei~G~essGKTtlal~~ia~----aQk~gg~~~~iD~E~a~d~---~~a~~lGVD~~~l~~~q 111 (322) T pfam00154 40 LDIALGIGG-LPKGRIIEIYGPESSGKTTLALHAIAE----AQKAGGTAAFIDAEHALDP---VYAKKLGVDIDNLLVSQ 111 (322) T ss_pred HHHHHCCCC-CCCCEEEEEECCCCCCHHHHHHHHHHH----HHHCCCEEEEEEHHHHCCH---HHHHHCCCCHHHEEEEC T ss_conf 999875899-778708999889877789999999999----9734993899853660598---89998098802538977 Q ss_pred CHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHH----------H-----HHHHHHHHHHHHHHCCCEEEEECCCHH Q ss_conf 328999999999999589985699932588988056----------7-----999999999999726984999748757 Q gi|254780750|r 711 STFMVEMIETASILNQATNQSFVILDEIGRGTATLD----------G-----LSIAWATIEYLHETNRCRGLLATHFHE 774 (920) Q Consensus 711 STF~vEm~e~~~IL~~at~~SLVllDElGrGTst~D----------G-----~aiA~aile~l~~~~~~~~lfaTHy~e 774 (920) ....-++.++..-|-+...-.||++|=+|-=....+ | ++-|+-.+...+.+.+|..+|..|..+ T Consensus 112 pd~~Eqal~i~~~li~~~~~~liViDSvaal~p~~E~e~~~~d~~~g~~Ar~ms~alRklt~~l~k~~~~~IfiNQ~R~ 190 (322) T pfam00154 112 PDTGEQALEIADMLVRSGAVDLIVVDSVAALVPKAEIEGEMGDSHVGLQARLMSQALRKLTGNISKSNTTVIFINQIRM 190 (322) T ss_pred CCHHHHHHHHHHHHHCCCCCCEEEEECCCCCCCHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHCCCCEEEEEEEEHH T ss_conf 8839999999999853799765998253456768887524322321357999999999999997305854999765511 No 212 >COG3845 ABC-type uncharacterized transport systems, ATPase components [General function prediction only] Probab=97.03 E-value=0.012 Score=38.14 Aligned_cols=54 Identities=20% Similarity=0.258 Sum_probs=33.7 Q ss_pred CCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHH---HCCCEEEEECCCHHHHHHHHHCCCE Q ss_conf 998569993258898805679999999999997---2698499974875797664306885 Q gi|254780750|r 728 TNQSFVILDEIGRGTATLDGLSIAWATIEYLHE---TNRCRGLLATHFHELTDLSKSLKRF 785 (920) Q Consensus 728 t~~SLVllDElGrGTst~DG~aiA~aile~l~~---~~~~~~lfaTHy~eL~~l~~~~~~v 785 (920) ....|+|...--||-+-- |++ .|-+.|.+ ..++-.|+++-..|+-.+++...-+ T Consensus 420 ~~p~lLI~~qPTrGLDvg---A~~-~I~~~l~e~r~~G~AVLLiS~dLDEil~lsDrIaVi 476 (501) T COG3845 420 RRPDLLIAAQPTRGLDVG---AIE-FIHERLLELRDAGKAVLLISEDLDEILELSDRIAVI 476 (501) T ss_pred CCCCEEEECCCCCCCCHH---HHH-HHHHHHHHHHHCCCEEEEEEHHHHHHHHHHHEEEEE T ss_conf 599889986878664689---999-999999999866987999961078999752144100 No 213 >PRK06526 transposase; Provisional Probab=97.01 E-value=0.023 Score=36.12 Aligned_cols=95 Identities=16% Similarity=0.206 Sum_probs=65.5 Q ss_pred EEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHH------HHHHHHH Q ss_conf 9996778440789999999999999719853035320682210567652376611385328999999------9999995 Q gi|254780750|r 653 WLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGSADNLASGRSTFMVEMIE------TASILNQ 726 (920) Q Consensus 653 ~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa~D~l~~g~STF~vEm~e------~~~IL~~ 726 (920) +||+||-..|||-+.-.+|.-++.+ |.-| .|+++. +| +-++.+ ....++. T Consensus 101 vil~G~~GtGKThLA~Alg~~A~~~--G~~v------------~f~~~~---~L-------~~~L~~a~~~g~~~~~~~~ 156 (254) T PRK06526 101 VVFLGPPGTGKTHLAIGLGIRACQA--GHRV------------LFATAA---QW-------VARLAAAHHAGRLQDELVK 156 (254) T ss_pred EEEECCCCCCHHHHHHHHHHHHHHC--CCCE------------EEEEHH---HH-------HHHHHHHHHCCCHHHHHHH T ss_conf 8998999986899999999999986--9967------------998779---99-------9999998855809999998 Q ss_pred CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCH Q ss_conf 89985699932588988056799999999999972698499974875 Q gi|254780750|r 727 ATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFH 773 (920) Q Consensus 727 at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~ 773 (920) -.+-.|+|+||+|----+.+|..+-+.+++.-.++ ++ ++++|.++ T Consensus 157 l~~~dLLIiDe~g~~~~~~~~a~~lf~li~~Rye~-~S-~IiTSn~~ 201 (254) T PRK06526 157 LGRIPLLIVDEVGYIPFEAEAANLFFQLVSSRYER-AS-LIVTSNKP 201 (254) T ss_pred HHCCCEEEEECCCCCCCCHHHHHHHHHHHHHHHCC-CC-EEEECCCC T ss_conf 51368776502136447889999999999999745-88-67665898 No 214 >cd03262 ABC_HisP_GlnQ_permeases HisP and GlnQ are the ATP-binding components of the bacterial periplasmic histidine and glutamine permeases, repectively. Histidine permease is a multisubunit complex containing the HisQ and HisM integral membrane subunits and two copies of HisP. HisP has properties intermediate between those of integral and peripheral membrane proteins and is accessible from both sides of the membrane, presumably by its interaction with HisQ and HisM. The two HisP subunits form a homodimer within the complex. The domain structure of the amino acid uptake systems is typical for prokaryote extracellular solute binding protein-dependent uptake systems. All of the amino acid uptake systems also have at least one, and in a few cases, two extracellular solute binding proteins located in the periplasm of Gram-negative bacteria, or attached to the cell membrane of Gram-positive bacteria. The best-studied member of the PAAT (polar amino acid transport) family is the HisJQM Probab=97.00 E-value=0.014 Score=37.71 Aligned_cols=138 Identities=23% Similarity=0.294 Sum_probs=70.1 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHH-----CCCCCCHHHC-------CCCCC---CEEEEE Q ss_conf 44345635877776643999967784407899999999999997-----1985303532-------06822---105676 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQ-----MGSYVPASYA-------HIGIV---DKLFSR 699 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQ-----iG~fVPA~~a-------~i~~~---D~IftR 699 (920) |=+|++|.. ..+.+..|.||.-+||||+||.++-+.--.. -|--|..... ++|.+ -.+|-. T Consensus 15 vL~~vsl~i----~~Ge~~~ivGpSGsGKSTLL~~i~gL~~p~~G~i~i~g~~i~~~~~~~~~~rr~iG~VFQ~~~L~p~ 90 (213) T cd03262 15 VLKGIDLTV----KKGEVVVIIGPSGSGKSTLLRCINLLEEPDSGTIIIDGLKLTDDKKNINELRQKVGMVFQQFNLFPH 90 (213) T ss_pred EECCEEEEE----CCCCEEEEECCCCCHHHHHHHHHHCCCCCCCEEEEECCEECCCCHHHHHHHHCCCEEEECCCCCCCC T ss_conf 886707598----8998999999998449999999981999986499999999999815699986782799679875899 Q ss_pred EECCCCCCC------CCCHHHHHHHHHH----------------------------HHHH-HCCCCCEEEEECCCCCCCH Q ss_conf 523766113------8532899999999----------------------------9999-5899856999325889880 Q gi|254780750|r 700 VGSADNLAS------GRSTFMVEMIETA----------------------------SILN-QATNQSFVILDEIGRGTAT 744 (920) Q Consensus 700 iGa~D~l~~------g~STF~vEm~e~~----------------------------~IL~-~at~~SLVllDElGrGTst 744 (920) +-+.||+.- |.|. .|..+.+ .|-| -|++-.++|+||--.+=++ T Consensus 91 ltv~eNV~~~l~~~~~~~~--~e~~~~a~~~L~~vgL~~~~~~~P~~LSGGqqQRVAIARALa~~P~ilL~DEPts~LD~ 168 (213) T cd03262 91 LTVLENITLAPIKVKGMSK--AEAEERALELLEKVGLADKADAYPAQLSGGQQQRVAIARALAMNPKVMLFDEPTSALDP 168 (213) T ss_pred CCHHHHHHHHHHHHCCCCH--HHHHHHHHHHHHHCCCHHHHCCCHHHCCCHHHHHHHHHHHHCCCCCEEEECCCCCCCCH T ss_conf 9199999999999769999--99999999999868997887499444692999999999996379999997088887798 Q ss_pred HHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH-HHHH Q ss_conf 56799999999999972698499974875797-6643 Q gi|254780750|r 745 LDGLSIAWATIEYLHETNRCRGLLATHFHELT-DLSK 780 (920) Q Consensus 745 ~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~-~l~~ 780 (920) .-...| ...+..|.+. +..++++||..+.+ .+++ T Consensus 169 ~~~~~i-~~ll~~l~~~-g~T~i~VTHD~~~a~~~aD 203 (213) T cd03262 169 ELVGEV-LDVMKDLAEE-GMTMVVVTHEMGFAREVAD 203 (213) T ss_pred HHHHHH-HHHHHHHHHC-CCEEEEECCCHHHHHHHCC T ss_conf 999999-9999999862-9999999989999999689 No 215 >TIGR02982 heterocyst_DevA ABC exporter ATP-binding subunit, DevA family; InterPro: IPR014324 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This entry represents the ATP-binding subunit DevA, found mostly in the Cyanobacteria, but also in the Planctomycetes. Cyanobacterial examples are involved in heterocyst formation, by which some fraction of members of the colony undergo a developmental change and become capable of nitrogen fixation. The ABC transporter encoded by the devBCA operon is induced by nitrogen deficiency and is necessary for the formation of the laminated layer which envelops heterocysts , . It is thought to be involved in the export of either the heterocyst-specific glycolipids found in the laminated layer, or an enzyme essential for their formation.. Probab=96.98 E-value=0.0069 Score=40.01 Aligned_cols=134 Identities=23% Similarity=0.358 Sum_probs=83.1 Q ss_pred CCEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEE------------- Q ss_conf 7044434563587777664399996778440789999999999999719853035320682210567------------- Q gi|254780750|r 632 KPFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFS------------- 698 (920) Q Consensus 632 ~~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~Ift------------- 698 (920) ..=|=.||+|.. +.+-+-|+|||=-|||||+|-- ||+-=-+.+-.+.++++=.. T Consensus 17 rkQvL~di~L~i----~~GEiViltGPSGSGKTTLLtL---------iG~LR~~Q~G~L~vlg~~L~ga~~~~l~~~RR~ 83 (220) T TIGR02982 17 RKQVLFDINLEI----NPGEIVILTGPSGSGKTTLLTL---------IGGLRSVQEGSLKVLGQELKGASKKELVQVRRN 83 (220) T ss_pred CCEEECCCCEEE----CCCEEEEEECCCCCCHHHHHHH---------HHHHCCCCCCEEEEECCHHCCCCHHHHHHHHHH T ss_conf 210012763177----1764798437889846889998---------876256555604782201026788899999876 Q ss_pred ------------EEECCCCCCCCC----CHHHHHHHHH-HHHHHH----------------------------CCCCCEE Q ss_conf ------------652376611385----3289999999-999995----------------------------8998569 Q gi|254780750|r 699 ------------RVGSADNLASGR----STFMVEMIET-ASILNQ----------------------------ATNQSFV 733 (920) Q Consensus 699 ------------RiGa~D~l~~g~----STF~vEm~e~-~~IL~~----------------------------at~~SLV 733 (920) -+-|..|+..+. .....|+.+. ..+|.. ...-.|| T Consensus 84 iGyIFQ~HNLl~~LTA~QNVqM~~eL~~~~~~~~~~~~a~~~L~~VGL~~~~~y~P~~LSGGQKQRVAIARALv~~P~Lv 163 (220) T TIGR02982 84 IGYIFQAHNLLGFLTARQNVQMALELQPNLSAQEAREKARAMLEAVGLGDRLDYYPENLSGGQKQRVAIARALVARPKLV 163 (220) T ss_pred CCCEECCCHHCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCEE T ss_conf 39144120001000177888648988761168899999999998606012554052436786168999999973389767 Q ss_pred EEECCCCCCCHHHH---HHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHHC Q ss_conf 99325889880567---99999999999972698499974875797664306 Q gi|254780750|r 734 ILDEIGRGTATLDG---LSIAWATIEYLHETNRCRGLLATHFHELTDLSKSL 782 (920) Q Consensus 734 llDElGrGTst~DG---~aiA~aile~l~~~~~~~~lfaTHy~eL~~l~~~~ 782 (920) |=||- |+.-|. -.+ .-++..|...-+|-.|..||.+.+-++++.. T Consensus 164 LADEP---TAALD~~SGr~V-V~Lm~~lA~eqGc~iL~VTHD~RIlDvADRI 211 (220) T TIGR02982 164 LADEP---TAALDSKSGRDV-VELMQKLAREQGCTILLVTHDNRILDVADRI 211 (220) T ss_pred ECCCC---CHHHHHCCCHHH-HHHHHHHHHHHCCEEEEEECCCCHHHHHHHH T ss_conf 62577---233221133899-9999998877198899983673120065444 No 216 >PRK13657 cyclic beta-1,2-glucan ABC transporter; Provisional Probab=96.97 E-value=0.025 Score=35.88 Aligned_cols=32 Identities=25% Similarity=0.436 Sum_probs=24.3 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHH Q ss_conf 443456358777766439999677844078999999 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQN 670 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqv 670 (920) |=+|+++.. ..+...-|.||+-+||||+++-+ T Consensus 350 vL~~isl~i----~~Ge~vaiVG~SGsGKSTL~~LL 381 (585) T PRK13657 350 AVEDVSFEA----KPGQTVAIVGPTGAGKSTLINLL 381 (585) T ss_pred CCCCCEEEE----CCCCEEEEECCCCCCHHHHHHHH T ss_conf 536703897----59988999889898699999998 No 217 >pfam03266 DUF265 Protein of unknown function, DUF265. Probab=96.97 E-value=0.019 Score=36.75 Aligned_cols=134 Identities=25% Similarity=0.340 Sum_probs=69.5 Q ss_pred EEEECCCCCCHHHHHHHHHHHHHHHH----C-CCCCC---HHHCCCC--CCC------EEEEEEECCCCCCCCC-CHHHH Q ss_conf 99967784407899999999999997----1-98530---3532068--221------0567652376611385-32899 Q gi|254780750|r 653 WLLTGPNMGGKSTFLRQNALIVIMAQ----M-GSYVP---ASYAHIG--IVD------KLFSRVGSADNLASGR-STFMV 715 (920) Q Consensus 653 ~iiTGpNmgGKSt~lRqval~vilAQ----i-G~fVP---A~~a~i~--~~D------~IftRiGa~D~l~~g~-STF~v 715 (920) .+||||--.||||+++-++= .|.. + |.+.| -..-++| ++| .+++|++..-..--|+ ..+.- T Consensus 2 i~ITG~pGvGKTTli~kv~~--~l~~~~~~v~GF~T~evre~g~R~GF~iv~l~~g~~~~la~~~~~~~~~vGky~v~~~ 79 (168) T pfam03266 2 IFITGPPGVGKTTLVKKVIE--LLKSEGVKVGGFYTPEVREGGRRIGFDIVDLASGERGPLARVGGVSGPRVGKYVVNLE 79 (168) T ss_pred EEEECCCCCCHHHHHHHHHH--HHHHCCCEEEEEEECCEECCCCEEEEEEEECCCCCEEEEEEECCCCCCCCCCCEECHH T ss_conf 89978999889999999999--9986797074899302125893789999990478267744406887754577166689 Q ss_pred HHHHH-HHHHHHC-CCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCC---HHHHHHHHHCCCEEEEEE Q ss_conf 99999-9999958-998569993258898805679999999999997269849997487---579766430688589999 Q gi|254780750|r 716 EMIET-ASILNQA-TNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHF---HELTDLSKSLKRFHNATL 790 (920) Q Consensus 716 Em~e~-~~IL~~a-t~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy---~eL~~l~~~~~~v~n~~~ 790 (920) +..+. ..+|++| ....|+++||+|+=--. .-....+|.+-| +..+ .+|.+=|. |.+.+--...+.+.-+++ T Consensus 80 ~fe~~~~~~L~~a~~~~dlivIDEIG~mEl~--s~~F~~~v~~~l-~~~~-~vl~ti~~~~~~~~v~~i~~~~d~~i~~v 155 (168) T pfam03266 80 EFEEIALPALRRALEEADLIIIDEIGPMELK--SPKFREAIEEVL-SSNK-PVLAVVHRRSDSPLVERIRRRPDVKIFVV 155 (168) T ss_pred HHHHHHHHHHHHCCCCCCEEEEECCCHHHHH--HHHHHHHHHHHH-CCCC-CEEEEEEECCCCHHHHHHHCCCCCEEEEE T ss_conf 9999999999840668989999763145331--499999999996-6999-79999972589838999741799389997 Q ss_pred EE Q ss_conf 99 Q gi|254780750|r 791 QV 792 (920) Q Consensus 791 ~~ 792 (920) .. T Consensus 156 t~ 157 (168) T pfam03266 156 TE 157 (168) T ss_pred CH T ss_conf 86 No 218 >PRK11153 metN DL-methionine transporter ATP-binding subunit; Provisional Probab=96.94 E-value=0.019 Score=36.65 Aligned_cols=49 Identities=20% Similarity=0.178 Sum_probs=32.5 Q ss_pred CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH Q ss_conf 89985699932588988056799999999999972698499974875797 Q gi|254780750|r 727 ATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELT 776 (920) Q Consensus 727 at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~ 776 (920) |+.-.++|+||--.|=++.--..| +..+..|.+..+-..+|.||.-+.. T Consensus 156 a~~P~iLl~DEPTsaLDp~t~~~I-l~lL~~l~~e~g~TivlITHdm~~v 204 (343) T PRK11153 156 ASNPKVLLCDEATSALDPATTRSI-LELLKDINRRLGLTILLITHEMDVV 204 (343) T ss_pred HCCCCEEEEECCCCCCCHHHHHHH-HHHHHHHHHHHCCEEEEECCCHHHH T ss_conf 669999999288765899999999-9999999996198999988899999 No 219 >cd03288 ABCC_SUR2 The SUR domain 2. The sulfonylurea receptor SUR is an ATP binding cassette (ABC) protein of the ABCC/MRP family. Unlike other ABC proteins, it has no intrinsic transport function, neither active nor passive, but associates with the potassium channel proteins Kir6.1 or Kir6.2 to form the ATP-sensitive potassium (K(ATP)) channel. Within the channel complex, SUR serves as a regulatory subunit that fine-tunes the gating of Kir6.x in response to alterations in cellular metabolism. It constitutes a major pharmaceutical target as it binds numerous drugs, K(ATP) channel openers and blockers, capable of up- or down-regulating channel activity. Probab=96.94 E-value=0.029 Score=35.40 Aligned_cols=35 Identities=14% Similarity=0.061 Sum_probs=27.0 Q ss_pred CEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHH Q ss_conf 044434563587777664399996778440789999999 Q gi|254780750|r 633 PFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNA 671 (920) Q Consensus 633 ~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqva 671 (920) .+|=++++|.. ..+....|.|||-+||||++|.+. T Consensus 34 ~~vL~~inl~I----~~Ge~vaIvG~sGsGKSTL~~ll~ 68 (257) T cd03288 34 KPVLKHVKAYI----KPGQKVGICGRTGSGKSSLSLAFF 68 (257) T ss_pred CCCEECEEEEE----CCCCEEEEECCCCCCHHHHHHHHH T ss_conf 57310538998----799999999999981999999996 No 220 >PRK09354 recA recombinase A; Provisional Probab=96.91 E-value=0.017 Score=37.06 Aligned_cols=137 Identities=18% Similarity=0.216 Sum_probs=79.8 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHH-HHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEEC-CCCCCCCCCH Q ss_conf 44345635877776643999967784407899-999999999997198530353206822105676523-7661138532 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTF-LRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGS-ADNLASGRST 712 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~-lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa-~D~l~~g~ST 712 (920) ..-|+-||..+ -..+|+.-|.||..+||||+ |..+|-.+=+--+-.|+-|+.|-=+-+ ..++|- .|++.--+.. T Consensus 46 l~LD~aLGiGG-~P~GRivEi~G~esSGKTtlal~~iaeaQk~Gg~~a~iDaE~ald~~~---a~~lGVd~d~llv~qpd 121 (350) T PRK09354 46 LALDIALGIGG-LPKGRIVEIYGPESSGKTTLALHAIAEAQKAGGTAAFIDAEHALDPVY---AKKLGVDIDNLLVSQPD 121 (350) T ss_pred HHHHHHHCCCC-CCCCCEEEEECCCCCCHHHHHHHHHHHHHHCCCCEEEEEECCCCCHHH---HHHCCCCHHHEEEECCC T ss_conf 78999875899-678708999889877799999999999997599479996000279889---99849771571785686 Q ss_pred HHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHH----------H-----HHHHHHHHHHHHHHCCCEEEEECCCHHH Q ss_conf 8999999999999589985699932588988056----------7-----9999999999997269849997487579 Q gi|254780750|r 713 FMVEMIETASILNQATNQSFVILDEIGRGTATLD----------G-----LSIAWATIEYLHETNRCRGLLATHFHEL 775 (920) Q Consensus 713 F~vEm~e~~~IL~~at~~SLVllDElGrGTst~D----------G-----~aiA~aile~l~~~~~~~~lfaTHy~eL 775 (920) ..-++.|+..-|-..+.-.+|++|=++-=+...+ | ++-|.-.+...+.+.+|.++|..|..+= T Consensus 122 ~~Eqal~i~e~Lvrsg~vd~IVvDSVaAL~pk~Eieg~mgd~~vG~qARlmSqalRKlt~~i~ks~t~~IfINQlR~k 199 (350) T PRK09354 122 TGEQALEIADALVRSGAVDLIVVDSVAALVPKAEIEGEMGDSHVGLQARLMSQALRKLTGNINKSNTTVIFINQIRMK 199 (350) T ss_pred HHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEEEHHH T ss_conf 799999999999854884189982533457688873133542263899999999999999985578289997432321 No 221 >PRK11160 cysteine/glutathione ABC transporter membrane/ATP-binding component; Reviewed Probab=96.91 E-value=0.011 Score=38.64 Aligned_cols=34 Identities=26% Similarity=0.260 Sum_probs=25.0 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHH Q ss_conf 44434563587777664399996778440789999999 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNA 671 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqva 671 (920) .|=+|+++.. ..+...-|+||.-+||||+++-++ T Consensus 355 ~vL~~isl~i----~~Ge~vaiVG~SGsGKSTL~~LL~ 388 (575) T PRK11160 355 PVLKGLSLQI----KAGEKVALLGRTGCGKSTLLQLLT 388 (575) T ss_pred CCCCCEEEEE----CCCCEEEEECCCCCCHHHHHHHHH T ss_conf 7636715897----699889998899975999999986 No 222 >PRK10261 glutathione transporter ATP-binding protein; Provisional Probab=96.86 E-value=0.013 Score=37.99 Aligned_cols=32 Identities=25% Similarity=0.206 Sum_probs=25.0 Q ss_pred EEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHH Q ss_conf 434563587777664399996778440789999999 Q gi|254780750|r 636 ANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNA 671 (920) Q Consensus 636 pNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqva 671 (920) =+|+++.. ..+.++-|-|+|-+||||++|.++ T Consensus 340 v~~vsf~i----~~GE~l~lvG~sGsGKSTl~r~l~ 371 (623) T PRK10261 340 VEKVSFDL----WPGETLSLVGESGSGKSTTGRALL 371 (623) T ss_pred EECCCCCC----CCCCEEEEECCCCCCHHHHHHHHH T ss_conf 52340035----899589997678766899999985 No 223 >cd00009 AAA The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases. Probab=96.86 E-value=0.016 Score=37.24 Aligned_cols=108 Identities=23% Similarity=0.189 Sum_probs=54.6 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEECCCCC--CCCCCHHHHHHHHHHHHHHHC Q ss_conf 439999677844078999999999999971985303532068221056765237661--138532899999999999958 Q gi|254780750|r 650 GKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGSADNL--ASGRSTFMVEMIETASILNQA 727 (920) Q Consensus 650 ~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa~D~l--~~g~STF~vEm~e~~~IL~~a 727 (920) .+-++|+||--.|||++.|.+|-..- .-| .-|..+.++|.. ..+.+-+.........-.... T Consensus 19 ~~~ill~GppGtGKT~la~~ia~~~~--~~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 82 (151) T cd00009 19 PKNLLLYGPPGTGKTTLARAIANELF--RPG--------------APFLYLNASDLLEGLVVAELFGHFLVRLLFELAEK 82 (151) T ss_pred CCEEEEECCCCCCHHHHHHHHHHHHC--CCC--------------CCEEEEEHHHCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 98089989999886599999999712--137--------------98278547770467777576057788989999997 Q ss_pred CCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHH----HCCCEEEEECCCHHH Q ss_conf 998569993258898805679999999999997----269849997487579 Q gi|254780750|r 728 TNQSFVILDEIGRGTATLDGLSIAWATIEYLHE----TNRCRGLLATHFHEL 775 (920) Q Consensus 728 t~~SLVllDElGrGTst~DG~aiA~aile~l~~----~~~~~~lfaTHy~eL 775 (920) ++.+.+++||+.+=.. ...-+ ...+++.+.. ...+.++++|+..+. T Consensus 83 ~~~~vl~iDEi~~l~~-~~~~~-~~~~l~~~~~~~~~~~~~~vI~~tn~~~~ 132 (151) T cd00009 83 AKPGVLFIDEIDSLSR-GAQNA-LLRVLETLNDLRIDRENVRVIGATNRPLL 132 (151) T ss_pred CCCCEEEEECHHHCCH-HHHHH-HHHHHHHHCCCCCCCCCEEEEEEECCCCC T ss_conf 6998698201665599-99999-99999871575406788899995289988 No 224 >PRK10418 nikD nickel transporter ATP-binding protein; Provisional Probab=96.83 E-value=0.038 Score=34.47 Aligned_cols=34 Identities=21% Similarity=0.258 Sum_probs=26.9 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHH Q ss_conf 44434563587777664399996778440789999999 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNA 671 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqva 671 (920) .|=+||++.. ..+.+.-|-|||-+||||++|++. T Consensus 17 ~vL~~Isl~v----~~Ge~~aiiG~SGsGKStl~k~ll 50 (254) T PRK10418 17 PLVHGVSLTL----QRGRVLALVGGSGSGKSLTCAAAL 50 (254) T ss_pred EEEECEEEEE----CCCCEEEEECCCCCHHHHHHHHHH T ss_conf 0886607289----899999999999878999999995 No 225 >KOG0064 consensus Probab=96.81 E-value=0.011 Score=38.61 Aligned_cols=44 Identities=30% Similarity=0.332 Sum_probs=30.1 Q ss_pred CCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEE Q ss_conf 664399996778440789999999999999719853035320682210567 Q gi|254780750|r 648 NSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFS 698 (920) Q Consensus 648 ~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~Ift 698 (920) +.+.-++|||||.+|||++-|-.|=+ -+|-.....+|.-++||- T Consensus 506 ~~G~hLLItGPNGCGKSSLfRILggL-------WPvy~g~L~~P~~~~mFY 549 (728) T KOG0064 506 EPGMHLLITGPNGCGKSSLFRILGGL-------WPVYNGLLSIPRPNNIFY 549 (728) T ss_pred CCCCEEEEECCCCCCHHHHHHHHHCC-------CCCCCCEEECCCCCCEEE T ss_conf 58826998789976588999998644-------723277563489763685 No 226 >PRK10982 galactose/methyl galaxtoside transporter ATP-binding protein; Provisional Probab=96.79 E-value=0.014 Score=37.67 Aligned_cols=31 Identities=10% Similarity=0.183 Sum_probs=24.1 Q ss_pred EEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHH Q ss_conf 34563587777664399996778440789999999 Q gi|254780750|r 637 NDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNA 671 (920) Q Consensus 637 Ndi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqva 671 (920) +|+++.. ..+.++-|.|||-+||||++|.++ T Consensus 265 ~~vsf~v----~~GEivgl~G~nGsGKsTL~~~l~ 295 (491) T PRK10982 265 RDVSFDL----HKGEILGIAGLVGAKRTDIVETLF 295 (491) T ss_pred EEEEEEE----ECCCEEEEECCCCCCHHHHHHHHH T ss_conf 0267999----689689977899997889999981 No 227 >COG2401 ABC-type ATPase fused to a predicted acetyltransferase domain [General function prediction only] Probab=96.75 E-value=0.017 Score=37.05 Aligned_cols=137 Identities=23% Similarity=0.332 Sum_probs=83.3 Q ss_pred CEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCC-------------------CCCHHH-CCCC- Q ss_conf 0444345635877776643999967784407899999999999997198-------------------530353-2068- Q gi|254780750|r 633 PFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGS-------------------YVPASY-AHIG- 691 (920) Q Consensus 633 ~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~-------------------fVPA~~-a~i~- 691 (920) .|+--|++|.. ..+.+-+|+|+-.+||||++|.+.=...- .|- .+|-+. .+++ T Consensus 396 ryvlr~vNL~i----kpGdvvaVvGqSGaGKttllRmi~G~~~~--~~ee~y~p~sg~v~~p~nt~~a~iPge~Ep~f~~ 469 (593) T COG2401 396 RYVLRNLNLEI----KPGDVVAVVGQSGAGKTTLLRMILGAQKG--RGEEKYRPDSGKVEVPKNTVSALIPGEYEPEFGE 469 (593) T ss_pred EEEEECEEEEE----CCCCEEEEEECCCCCCCHHHHHHHHHHHC--CCCCCCCCCCCCEECCCCCHHHCCCCCCCCCCCC T ss_conf 66640203686----47876899924887731199999877643--5620247877721034431321067655544573 Q ss_pred --CCCE-------------EEEEEECCCCC-----CCCCCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf --2210-------------56765237661-----138532899999999999958998569993258898805679999 Q gi|254780750|r 692 --IVDK-------------LFSRVGSADNL-----ASGRSTFMVEMIETASILNQATNQSFVILDEIGRGTATLDGLSIA 751 (920) Q Consensus 692 --~~D~-------------IftRiGa~D~l-----~~g~STF~vEm~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA 751 (920) +.++ |..|.|.+|-. +...||=.-|=.-+|..+. ..-++.++||++.--++....-+| T Consensus 470 ~tilehl~s~tGD~~~AveILnraGlsDAvlyRr~f~ELStGQKeR~KLAklla--erpn~~~iDEF~AhLD~~TA~rVA 547 (593) T COG2401 470 VTILEHLRSKTGDLNAAVEILNRAGLSDAVLYRRKFSELSTGQKERAKLAKLLA--ERPNVLLIDEFAAHLDELTAVRVA 547 (593) T ss_pred HHHHHHHHHCCCCHHHHHHHHHHHCCCHHHHHHCCHHHCCCCHHHHHHHHHHHH--CCCCCEEHHHHHHHCCHHHHHHHH T ss_conf 118998752368636789999760453054300467553854577789999973--489817735666431779999999 Q ss_pred HHHHHHHHHHCCCEEEEECCCHHHHHH Q ss_conf 999999997269849997487579766 Q gi|254780750|r 752 WATIEYLHETNRCRGLLATHFHELTDL 778 (920) Q Consensus 752 ~aile~l~~~~~~~~lfaTHy~eL~~l 778 (920) ..+-+ |....+...+.+||-.|+.+- T Consensus 548 rkise-lARe~giTlivvThrpEv~~A 573 (593) T COG2401 548 RKISE-LAREAGITLIVVTHRPEVGNA 573 (593) T ss_pred HHHHH-HHHHHCCEEEEEECCHHHHHC T ss_conf 99999-999709739999648777744 No 228 >PRK10419 nikE nickel transporter ATP-binding protein; Provisional Probab=96.74 E-value=0.054 Score=33.33 Aligned_cols=51 Identities=24% Similarity=0.260 Sum_probs=33.9 Q ss_pred CCCCCEEEEECCCCCCCHHHHHH--HHHHHHHHHHHHCCCEEEEECCCHHHHH-HHH Q ss_conf 89985699932588988056799--9999999999726984999748757976-643 Q gi|254780750|r 727 ATNQSFVILDEIGRGTATLDGLS--IAWATIEYLHETNRCRGLLATHFHELTD-LSK 780 (920) Q Consensus 727 at~~SLVllDElGrGTst~DG~a--iA~aile~l~~~~~~~~lfaTHy~eL~~-l~~ 780 (920) +.+-.|+|+||- ||..|-.. --...+..|.+..++-.+|.||.-.+.. +++ T Consensus 167 ~~~P~lLi~DEP---tsaLD~~~q~~il~ll~~l~~~~g~t~i~ITHDl~~a~~~ad 220 (266) T PRK10419 167 AVEPKLLILDEA---VSNLDLVLQAGVIRLLKKLQQQFGTACLFITHDLRLVERFCQ 220 (266) T ss_pred CCCCCEEEEECC---CCCCCHHHHHHHHHHHHHHHHHHCCEEEEECCCHHHHHHHCC T ss_conf 069878999688---653699999999999999999759899998899999999689 No 229 >PRK08116 hypothetical protein; Validated Probab=96.73 E-value=0.058 Score=33.10 Aligned_cols=113 Identities=22% Similarity=0.227 Sum_probs=64.9 Q ss_pred CCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEECCCCCCCCCCHHHH--HHHHHHHHHHH Q ss_conf 6439999677844078999999999999971985303532068221056765237661138532899--99999999995 Q gi|254780750|r 649 SGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGSADNLASGRSTFMV--EMIETASILNQ 726 (920) Q Consensus 649 ~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa~D~l~~g~STF~v--Em~e~~~IL~~ 726 (920) .++-++++||-..|||-+.=.+|--. +. -|-.| +|+.+. |=+..=+++|-- +..+ ..+++. T Consensus 107 ~~~GLll~G~~GtGKThLa~aIa~~l-~~-~g~~V------------~~~~~~--~ll~~lk~~~~~~~~~~~-~e~l~~ 169 (262) T PRK08116 107 ESVGLLLWGSPGNGKTYLAAAIANEL-IE-KGVPV------------VFVNVP--ELLNRIKSTYNSEGKEDE-NEIIRA 169 (262) T ss_pred CCCEEEEECCCCCCHHHHHHHHHHHH-HH-CCCEE------------EEEEHH--HHHHHHHHHHHCCCCHHH-HHHHHH T ss_conf 68618998989998999999999999-98-79939------------998899--999999999863561019-999998 Q ss_pred CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHH Q ss_conf 899856999325889880567999999999999726984999748757976643 Q gi|254780750|r 727 ATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTDLSK 780 (920) Q Consensus 727 at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~l~~ 780 (920) ...-.|+|||+||..-.|.-+....+.|+.+-.... -.|+|+|-+. +.+|.+ T Consensus 170 l~~~dLLIiDDlG~e~~t~w~~e~lf~IIn~Ry~~~-kptIiTTNl~-~~eL~~ 221 (262) T PRK08116 170 LDNADLLILDDLGAEKDTEWVREKLYNIIDSRYRKG-LPTIFTTNLS-LEELKN 221 (262) T ss_pred HHCCCEEEEEHHCCCCCCHHHHHHHHHHHHHHHHCC-CCEEEECCCC-HHHHHH T ss_conf 612998998322145698789999999999999769-9989987999-999999 No 230 >TIGR00954 3a01203 Peroxysomal long chain fatty acyl transporter; InterPro: IPR005283 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). The members of this family are integral membrane proteins and they are involved in the import of activated long-chain fatty acids from the cytosol to the peroxisomal matrix. ; GO: 0005524 ATP binding, 0006810 transport, 0016021 integral to membrane. Probab=96.70 E-value=0.0014 Score=45.19 Aligned_cols=50 Identities=36% Similarity=0.423 Sum_probs=36.4 Q ss_pred EEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHH-HHH----------HCC-CCCCHHH Q ss_conf 434563587777664399996778440789999999999-999----------719-8530353 Q gi|254780750|r 636 ANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIV-IMA----------QMG-SYVPASY 687 (920) Q Consensus 636 pNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~v-ilA----------QiG-~fVPA~~ 687 (920) +|||.|. .-.++++=+||+|||.+|||++=|-+|=+= +++ -|| ..+|++. T Consensus 544 andiklP--flqGsG~~lLi~GPNGCGKSSLFRiLGeLWP~~g~~nknhqsklimG~Lt~P~~~ 605 (788) T TIGR00954 544 ANDIKLP--FLQGSGNHLLICGPNGCGKSSLFRILGELWPVYGKNNKNHQSKLIMGRLTKPAKG 605 (788) T ss_pred HCCCCCC--CCCCCCCEEEEECCCCCCHHHHHHHHHHHCCCCCCCCCCCCCEEECCEECCCCCC T ss_conf 3024355--1215887668768899864789999864302357897443320344410558888 No 231 >PRK13545 tagH teichoic acids export protein ATP-binding subunit; Provisional Probab=96.68 E-value=0.0073 Score=39.83 Aligned_cols=24 Identities=13% Similarity=0.116 Sum_probs=12.9 Q ss_pred EEEEEECCCHHHHHHHHHHHHHHH Q ss_conf 999980882220079999999861 Q gi|254780750|r 43 LVFYRMGDFYELFFDDALLASRCL 66 (920) Q Consensus 43 il~fr~G~FYE~f~~DA~~aa~~L 66 (920) |-|=-+.+.|.+|.....-.-.+| T Consensus 5 V~~~nVsK~y~ly~~~~drLk~~f 28 (549) T PRK13545 5 VKFEHVTKKYKLYNKPFDKLKDLF 28 (549) T ss_pred EEEEECCEEEECCCCHHHHHHHHH T ss_conf 999852325665689678999996 No 232 >COG4133 CcmA ABC-type transport system involved in cytochrome c biogenesis, ATPase component [Posttranslational modification, protein turnover, chaperones] Probab=96.68 E-value=0.0023 Score=43.59 Aligned_cols=34 Identities=38% Similarity=0.467 Sum_probs=25.9 Q ss_pred CEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHH Q ss_conf 0444345635877776643999967784407899999999 Q gi|254780750|r 633 PFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNAL 672 (920) Q Consensus 633 ~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval 672 (920) .|-+-++.++ .+.+..|||||-+||||+||.+|= T Consensus 17 lf~~L~f~l~------~Ge~~~i~G~NG~GKTtLLRilaG 50 (209) T COG4133 17 LFSDLSFTLN------AGEALQITGPNGAGKTTLLRILAG 50 (209) T ss_pred EECCEEEEEC------CCCEEEEECCCCCCHHHHHHHHHC T ss_conf 2123047874------887799989998758899999971 No 233 >COG4988 CydD ABC-type transport system involved in cytochrome bd biosynthesis, ATPase and permease components [Energy production and conversion / Posttranslational modification, protein turnover, chaperones] Probab=96.66 E-value=0.064 Score=32.80 Aligned_cols=81 Identities=22% Similarity=0.197 Sum_probs=45.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHC-CC-CCEECCCCCCEEEC--C--CCCHHHHHHHHHCCCCCEEEEEEEECCCCCCCC Q ss_conf 988889998677887788877525-88-51210478714000--0--468058876310287704443456358777766 Q gi|254780750|r 576 LDNASQVIAIIDISIALAILAKEQ-NY-CRPIIDNSTNFIVK--D--GRHPIVEKTLKQQSSKPFIANDCDLSCPNDKNS 649 (920) Q Consensus 576 l~~~~~~ia~lD~l~SlA~~a~~~-~y-~rP~i~~~~~l~i~--~--gRHPviE~~l~~~~~~~fVpNdi~l~~~~~~~~ 649 (920) +....+..|-.|.+.+|-.--... +. ..+.....+.+++. + -++| .++.+++|++++. .. T Consensus 281 fH~~~~g~aa~d~i~~~l~~~~~~~~~~~~~~~~~~~~~ei~~~~l~~~y~----------~g~~~l~~l~~t~----~~ 346 (559) T COG4988 281 FHAAAAGEAAADKLFTLLESPVATPGSGEKAEVANEPPIEISLENLSFRYP----------DGKPALSDLNLTI----KA 346 (559) T ss_pred HHHHHHCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCEEEECCEEEECC----------CCCCCCCCCEEEE----CC T ss_conf 999850366898999972598777887643213568974666021478558----------9985566710675----48 Q ss_pred CEEEEEECCCCCCHHHHHHHH Q ss_conf 439999677844078999999 Q gi|254780750|r 650 GKLWLLTGPNMGGKSTFLRQN 670 (920) Q Consensus 650 ~~~~iiTGpNmgGKSt~lRqv 670 (920) ++...|.||+-+||||++.-+ T Consensus 347 g~~talvG~SGaGKSTLl~lL 367 (559) T COG4988 347 GQLTALVGASGAGKSTLLNLL 367 (559) T ss_pred CCEEEEECCCCCCHHHHHHHH T ss_conf 967999889999789999998 No 234 >TIGR03415 ABC_choXWV_ATP choline ABC transporter, ATP-binding protein. Members of this protein family are the ATP-binding subunit of a three-protein transporter. This family belongs, more broadly, to the family of proline and glycine-betaine transporters, but members have been identified by direct characterization and by bioinformatic means as choline transporters. Many species have several closely-related members of this family, probably with variable abilities to act additionally on related quaternary amines. Probab=96.66 E-value=0.013 Score=38.07 Aligned_cols=34 Identities=26% Similarity=0.435 Sum_probs=26.8 Q ss_pred EEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHH Q ss_conf 43456358777766439999677844078999999999 Q gi|254780750|r 636 ANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALI 673 (920) Q Consensus 636 pNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~ 673 (920) =+|++|.. ..+-+..|-||.-+||||+||.++=+ T Consensus 40 V~dvsl~I----~~GEi~~lvGpSGsGKSTLLr~i~GL 73 (382) T TIGR03415 40 VANASLDI----EEGEICVLMGLSGSGKSSLLRAVNGL 73 (382) T ss_pred EECCEEEE----CCCCEEEEECCCCCHHHHHHHHHHCC T ss_conf 96517488----79989999999973499999999759 No 235 >COG4178 ABC-type uncharacterized transport system, permease and ATPase components [General function prediction only] Probab=96.59 E-value=0.03 Score=35.28 Aligned_cols=197 Identities=19% Similarity=0.257 Sum_probs=87.5 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHH--HCCCCCEEC-------CC--CCCEEECC--CCCHHHHHHHH Q ss_conf 7888765430112579888899986778877888775--258851210-------47--87140000--46805887631 Q gi|254780750|r 561 SFEILSNAIIEQSESLDNASQVIAIIDISIALAILAK--EQNYCRPII-------DN--STNFIVKD--GRHPIVEKTLK 627 (920) Q Consensus 561 ~~~~l~~~i~~~~~~l~~~~~~ia~lD~l~SlA~~a~--~~~y~rP~i-------~~--~~~l~i~~--gRHPviE~~l~ 627 (920) .|..+...+.-+......+++.-|.++=+.+|...=. +.+.++|.- ++ ...+.+++ .++| T Consensus 331 aF~~v~sslswfi~~~~~ia~~rA~~~Rl~~f~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~nl~l~~p------- 403 (604) T COG4178 331 AFGQVHSSLSWFIDNYDAIADWRATLLRLAEFRQALEAAQMDTEKPARTGRRIDFDDNADHGITLENLSLRTP------- 403 (604) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEECC------- T ss_conf 9999998878999876889999999999999999998514760125765431000124565058854367779------- Q ss_pred HCCCCCEE-EEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHH------CC---CCCCHH-HCCCCCC-CE Q ss_conf 02877044-4345635877776643999967784407899999999999997------19---853035-3206822-10 Q gi|254780750|r 628 QQSSKPFI-ANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQ------MG---SYVPAS-YAHIGIV-DK 695 (920) Q Consensus 628 ~~~~~~fV-pNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQ------iG---~fVPA~-~a~i~~~-D~ 695 (920) .+...| +.++.+++ +.=++|+||+-+||||++|.+|=+==-++ -| .|+|=. ++.-|-. |. T Consensus 404 --~~~~ll~~l~~~v~~------G~~llI~G~SG~GKTsLlRaiaGLWP~g~G~I~~P~~~~~lflpQ~PY~p~GtLre~ 475 (604) T COG4178 404 --DGQTLLSELNFEVRP------GERLLITGESGAGKTSLLRALAGLWPWGSGRISMPADSALLFLPQRPYLPQGTLREA 475 (604) T ss_pred --CCCEEECCCEEEECC------CCEEEEECCCCCCHHHHHHHHHCCCCCCCCCEECCCCCCEEEECCCCCCCCCCHHHH T ss_conf --987421465265479------987998789998788999999645856787441689875577148877787658999 Q ss_pred EE-----EEEECCC-CCC-----CCCCHHHHHHHH-----------------HHHHHHHCCCCCEEEEECCCCCCCHHHH Q ss_conf 56-----7652376-611-----385328999999-----------------9999995899856999325889880567 Q gi|254780750|r 696 LF-----SRVGSAD-NLA-----SGRSTFMVEMIE-----------------TASILNQATNQSFVILDEIGRGTATLDG 747 (920) Q Consensus 696 If-----tRiGa~D-~l~-----~g~STF~vEm~e-----------------~~~IL~~at~~SLVllDElGrGTst~DG 747 (920) |. .+++ | .+. -|.--+.-.|.+ .|.||= ++-..|+|||- ||..|. T Consensus 476 l~YP~~~~~~~--d~~l~~vL~~vgL~~L~~rl~~~~~W~~vLS~GEqQRlafARilL--~kP~~v~LDEA---TsALDe 548 (604) T COG4178 476 LCYPNAAPDFS--DAELVAVLHKVGLGDLAERLDEEDRWDRVLSGGEQQRLAFARLLL--HKPKWVFLDEA---TSALDE 548 (604) T ss_pred HHCCCCCCCCC--HHHHHHHHHHCCCHHHHHHHHCCCCHHHHCCHHHHHHHHHHHHHH--CCCCEEEEECC---HHCCCH T ss_conf 80899977799--599999999819198999873327576645852789999999997--09998998060---112595 Q ss_pred HHHHHHHHHHHHHH-CCCEEEEECCCHHHHHHHH Q ss_conf 99999999999972-6984999748757976643 Q gi|254780750|r 748 LSIAWATIEYLHET-NRCRGLLATHFHELTDLSK 780 (920) Q Consensus 748 ~aiA~aile~l~~~-~~~~~lfaTHy~eL~~l~~ 780 (920) -+-. ...+-|-+. .++-.+...|=+.|..+.. T Consensus 549 ~~e~-~l~q~l~~~lp~~tvISV~Hr~tl~~~h~ 581 (604) T COG4178 549 ETED-RLYQLLKEELPDATVISVGHRPTLWNFHS 581 (604) T ss_pred HHHH-HHHHHHHHHCCCCEEEEECCCHHHHHHHH T ss_conf 7899-99999985489978999556000578875 No 236 >COG1136 SalX ABC-type antimicrobial peptide transport system, ATPase component [Defense mechanisms] Probab=96.59 E-value=0.031 Score=35.13 Aligned_cols=51 Identities=22% Similarity=0.289 Sum_probs=34.1 Q ss_pred CCCCCEEEEECCCCCCCHHH---HHHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHH Q ss_conf 89985699932588988056---79999999999997269849997487579766430 Q gi|254780750|r 727 ATNQSFVILDEIGRGTATLD---GLSIAWATIEYLHETNRCRGLLATHFHELTDLSKS 781 (920) Q Consensus 727 at~~SLVllDElGrGTst~D---G~aiA~aile~l~~~~~~~~lfaTHy~eL~~l~~~ 781 (920) +.+-.++|.||- |...| |-.+ ...+..+.+..+.-++++||.++++..++. T Consensus 158 ~~~P~iilADEP---TgnLD~~t~~~V-~~ll~~~~~~~g~tii~VTHD~~lA~~~dr 211 (226) T COG1136 158 INNPKIILADEP---TGNLDSKTAKEV-LELLRELNKERGKTIIMVTHDPELAKYADR 211 (226) T ss_pred HCCCCEEEEECC---CCCCCHHHHHHH-HHHHHHHHHHCCCEEEEECCCHHHHHHCCE T ss_conf 249986996076---665886789999-999999987469899999089899974898 No 237 >cd03294 ABC_Pro_Gly_Bertaine This family comprises the glycine betaine/L-proline ATP binding subunit in bacteria and its equivalents in archaea. This transport system belong to the larger ATP-Binding Cassette (ABC) transporter superfamily. The characteristic feature of these transporters is the obligatory coupling of ATP hydrolysis to substrate translocation. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=96.54 E-value=0.025 Score=35.82 Aligned_cols=138 Identities=21% Similarity=0.267 Sum_probs=69.5 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCC------C---CHHH------CCCCCC---CE Q ss_conf 4443456358777766439999677844078999999999999971985------3---0353------206822---10 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSY------V---PASY------AHIGIV---DK 695 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~f------V---PA~~------a~i~~~---D~ 695 (920) -+=+|++|.. ..+.+..|-||.-+||||+||.++-+.- ..-|.- | |.+. -.++.+ .. T Consensus 38 ~aL~~vsl~i----~~GE~~~ivG~SGsGKSTLLr~i~GL~~-p~~G~I~~~G~~i~~~~~~~l~~~r~~~igmVFQ~~a 112 (269) T cd03294 38 VGVNDVSLDV----REGEIFVIMGLSGSGKSTLLRCINRLIE-PTSGKVLIDGQDIAAMSRKELRELRRKKISMVFQSFA 112 (269) T ss_pred EEECCCEEEE----CCCCEEEEECCCCCHHHHHHHHHHCCCC-CCCEEEEECCEECCCCCHHHHHHHHCCCEEEEEECCC T ss_conf 7977747588----8999999998998489999999975999-9975999999999999989998852564699961575 Q ss_pred EEEEEECCCCCCCC-----CCHH-----HHHHHHH---HH------------------HH-HHCCCCCEEEEECCCCCCC Q ss_conf 56765237661138-----5328-----9999999---99------------------99-9589985699932588988 Q gi|254780750|r 696 LFSRVGSADNLASG-----RSTF-----MVEMIET---AS------------------IL-NQATNQSFVILDEIGRGTA 743 (920) Q Consensus 696 IftRiGa~D~l~~g-----~STF-----~vEm~e~---~~------------------IL-~~at~~SLVllDElGrGTs 743 (920) +|-.+-..||+.-| .|.= ..|+.+. +. |- --|.+-.++|+||- || T Consensus 113 L~P~ltV~eNV~~~L~~~~~~~~e~~~rv~e~L~~vgL~~~~~~~P~qLSGGq~QRVaIARALa~~P~iLLlDEP---ts 189 (269) T cd03294 113 LLPHRTVLENVAFGLEVQGVPRAEREERAAEALELVGLEGWEHKYPDELSGGMQQRVGLARALAVDPDILLMDEA---FS 189 (269) T ss_pred CCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHCCHHHHCHHHHHHHHHHHHHHCCCCEEEECCC---CC T ss_conf 476787999986888852899789999999999867986777569678494888899999998639989997587---54 Q ss_pred HHHH---HHHHHHHHHHHHHHCCCEEEEECCCHHHH-HHHH Q ss_conf 0567---99999999999972698499974875797-6643 Q gi|254780750|r 744 TLDG---LSIAWATIEYLHETNRCRGLLATHFHELT-DLSK 780 (920) Q Consensus 744 t~DG---~aiA~aile~l~~~~~~~~lfaTHy~eL~-~l~~ 780 (920) ..|- ..| ...+..|.+..+..++|.||..+.+ .+++ T Consensus 190 aLD~~~~~~i-~~~l~~l~~~~~~T~i~VTHD~~eA~~laD 229 (269) T cd03294 190 ALDPLIRREM-QDELLRLQAELQKTIVFITHDLDEALRLGD 229 (269) T ss_pred CCCHHHHHHH-HHHHHHHHHHHCCEEEEECCCHHHHHHHCC T ss_conf 2599999999-999999999749999999998999999799 No 238 >COG1129 MglA ABC-type sugar transport system, ATPase component [Carbohydrate transport and metabolism] Probab=96.50 E-value=0.081 Score=32.03 Aligned_cols=54 Identities=26% Similarity=0.313 Sum_probs=40.8 Q ss_pred CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHH Q ss_conf 8998569993258898805679999999999997269849997487579766430 Q gi|254780750|r 727 ATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTDLSKS 781 (920) Q Consensus 727 at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~l~~~ 781 (920) ++.-.++|+||--||-+---=.- -|.++..|.....+..++++-.+||..+.+. T Consensus 417 ~~~p~vLilDEPTRGIDVGAK~e-Iy~li~~lA~~G~ail~iSSElpEll~~~DR 470 (500) T COG1129 417 ATDPKVLILDEPTRGIDVGAKAE-IYRLIRELAAEGKAILMISSELPELLGLSDR 470 (500) T ss_pred HHCCCEEEECCCCCCCCCCHHHH-HHHHHHHHHHCCCEEEEEECCHHHHHHHCCE T ss_conf 75899999889987754145899-9999999997799899994975998840977 No 239 >KOG0927 consensus Probab=96.49 E-value=0.021 Score=36.43 Aligned_cols=38 Identities=18% Similarity=0.155 Sum_probs=25.2 Q ss_pred CEEEEEC-------CCCCCCCHHHHHHCCCCCCHHHHHHHHHHHH Q ss_conf 1545412-------7877641278763012343378999986310 Q gi|254780750|r 299 NLEILRT-------LSGSREQSLLKTIDYSITGAGGRLFAERIAS 336 (920) Q Consensus 299 nLEI~~~-------~~g~~~gSL~~~Ln~t~T~~G~RlLr~wL~~ 336 (920) .|||... .+|+.+.+++..+-.|-+|.-.++--.-+.+ T Consensus 95 ~~El~~g~rygLiG~nG~Gkst~L~~i~~~e~P~p~~~d~y~ls~ 139 (614) T KOG0927 95 TLELNRGRRYGLIGPNGSGKSTFLRAIAGREVPIPEHIDFYLLSR 139 (614) T ss_pred EEEECCCCEEEEECCCCCCHHHHHHHHHCCCCCCCCCCCHHHHCC T ss_conf 478627864899767997376899887537789984201333136 No 240 >TIGR02858 spore_III_AA stage III sporulation protein AA; InterPro: IPR014217 Proteins in this entry include the stage III sporulation protein AA that is encoded by one of several genes in the spoIIIA locus. This protein is only found in species that are capable of endospore formation.. Probab=96.48 E-value=0.016 Score=37.21 Aligned_cols=129 Identities=28% Similarity=0.426 Sum_probs=75.4 Q ss_pred ECCCCCHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHH--HHHCCCCCCHHHCCCC Q ss_conf 00046805887631028770444345635877776643999967784407899999999999--9971985303532068 Q gi|254780750|r 614 VKDGRHPIVEKTLKQQSSKPFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVI--MAQMGSYVPASYAHIG 691 (920) Q Consensus 614 i~~gRHPviE~~l~~~~~~~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vi--lAQiG~fVPA~~a~i~ 691 (920) +.++=-++++.+...+. .+-| .||=||=-+||||+||=+|=+.= .+|+|. ...+.+ T Consensus 105 ~~G~A~~~~~yL~d~~~---~~~N---------------TLiIsPPq~GKTTlLRDlaR~~StG~~~~~~----~g~KVg 162 (282) T TIGR02858 105 VLGAADKILPYLVDRNG---RVLN---------------TLIISPPQCGKTTLLRDLARILSTGISKLGL----KGKKVG 162 (282) T ss_pred CCCCCHHHHHHHHCCCC---CEEE---------------EEEECCCCCCCCCHHHHHHHHHHCCCCCCCC----CCCCEE T ss_conf 05775666887730589---4467---------------8888688988510488898886078542468----997469 Q ss_pred CCCE---EEE--------EEECCCCCCCC--CCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHH Q ss_conf 2210---567--------65237661138--5328999999999999589985699932588988056799999999999 Q gi|254780750|r 692 IVDK---LFS--------RVGSADNLASG--RSTFMVEMIETASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYL 758 (920) Q Consensus 692 ~~D~---Ift--------RiGa~D~l~~g--~STF~vEm~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l 758 (920) ++|- |=. .+|..=|++-+ +|==| | -++|..+|. .|+.||||| +.| ..|++|.+ T Consensus 163 ivDERSEIAgC~~GvPQ~~vG~RtDVLD~CPKAEGm--M----M~iRSMSP~-Viv~DEIGr---~ED----~~Al~eA~ 228 (282) T TIGR02858 163 IVDERSEIAGCVNGVPQLDVGIRTDVLDGCPKAEGM--M----MLIRSMSPD-VIVVDEIGR---EED----VEALLEAL 228 (282) T ss_pred EEECCHHHHHHCCCCCCCCCCCCEEECCCCCHHHHH--H----HHHHCCCCC-EEEEECCCC---HHH----HHHHHHHH T ss_conf 984324656545882414467606751788537899--9----999706985-799814889---533----89999986 Q ss_pred HHHCCCEEEEECCCHHHHHHHH Q ss_conf 9726984999748757976643 Q gi|254780750|r 759 HETNRCRGLLATHFHELTDLSK 780 (920) Q Consensus 759 ~~~~~~~~lfaTHy~eL~~l~~ 780 (920) |. |-..+.+-|=+.+-++.+ T Consensus 229 na--GV~~I~TaHg~~~~Dl~k 248 (282) T TIGR02858 229 NA--GVSVIATAHGRDLEDLKK 248 (282) T ss_pred CC--CCEEEEEECCCCHHHHHC T ss_conf 16--756887640488126650 No 241 >TIGR01842 type_I_sec_PrtD type I secretion system ATPase; InterPro: IPR010128 Type I protein secretion is a system in some Gram-negative bacteria to export proteins (often proteases) across both inner and outer membranes to the extracellular medium. This is one of three proteins of the type I secretion apparatus. Targeted proteins are not cleaved at the N terminus, but rather carry signals located toward the extreme C terminus to direct type I secretion.; GO: 0005524 ATP binding, 0015031 protein transport, 0016021 integral to membrane. Probab=96.47 E-value=0.042 Score=34.18 Aligned_cols=147 Identities=27% Similarity=0.352 Sum_probs=87.4 Q ss_pred CCCEEEC--CCCCHHHHHHHHHCCCCCE----EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCC Q ss_conf 8714000--0468058876310287704----443456358777766439999677844078999999999999971985 Q gi|254780750|r 609 STNFIVK--DGRHPIVEKTLKQQSSKPF----IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSY 682 (920) Q Consensus 609 ~~~l~i~--~gRHPviE~~l~~~~~~~f----VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~f 682 (920) ++.|.+. ...||-=++ =..+. +=++|++. -..+.++-|=||=.+||||+.|.+ .|.- T Consensus 318 ~G~L~vE~v~~~PP~~~~-----WsqPivPk~~l~gi~F~----~~aGe~laIIGPSgSGKStLaR~~--------vG~W 380 (556) T TIGR01842 318 EGHLSVENVTIVPPGGKK-----WSQPIVPKPTLRGISFA----IQAGEALAIIGPSGSGKSTLARIL--------VGIW 380 (556) T ss_pred CCEEEEEEEEEECCCCCC-----CCCCCCCHHHHCCCCEE----ECCCCEEEEECCCCCCHHHHHHHH--------HHCC T ss_conf 636888776510786313-----57897761422786215----637745888747865258898788--------7210 Q ss_pred CCHH------HCCCCCCCE---------------EE--------EEEECCCCCCCCCCHHHHHHHHHHHH-------HH- Q ss_conf 3035------320682210---------------56--------76523766113853289999999999-------99- Q gi|254780750|r 683 VPAS------YAHIGIVDK---------------LF--------SRVGSADNLASGRSTFMVEMIETASI-------LN- 725 (920) Q Consensus 683 VPA~------~a~i~~~D~---------------If--------tRiGa~D~l~~g~STF~vEm~e~~~I-------L~- 725 (920) -|+. .|.|-=+|+ +| +|+| ||... -+..|.|.+ |+ T Consensus 381 ~~~~G~VRLDGadl~qWD~e~lG~~iGYLPQdvELF~GTva~NIARF~--en~d~------~~iieAAklAGvHElIl~l 452 (556) T TIGR01842 381 PPASGSVRLDGADLKQWDRETLGKHIGYLPQDVELFSGTVAENIARFG--ENADP------EKIIEAAKLAGVHELILRL 452 (556) T ss_pred CCCCCCEEEEHHHHHCCCHHHCCCCCCCCCCCCCCCCCCHHHHCCCCC--CCCCH------HHHHHHHHHHCHHHHHHCC T ss_conf 135653364033440237536588015479850507676764024468--87887------8999999760303575169 Q ss_pred --------------------------HC--CCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHH Q ss_conf --------------------------58--99856999325889880567999999999999726984999748757976 Q gi|254780750|r 726 --------------------------QA--TNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTD 777 (920) Q Consensus 726 --------------------------~a--t~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~ 777 (920) .| ..=.||+|||-=.-=++.==.|++.|+.+ + ++-||.+++.||=+.+.. T Consensus 453 P~GYDT~iG~~G~~LSGGQRQRIaLARAlyG~P~lvvLDEPNsNLD~~GE~AL~~Ai~~-l-K~rg~tvv~itHRp~lL~ 530 (556) T TIGR01842 453 PDGYDTDIGEGGATLSGGQRQRIALARALYGDPKLVVLDEPNSNLDEEGEQALANAIKA-L-KARGITVVVITHRPSLLG 530 (556) T ss_pred CCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCHHHHHHHHHHHHH-H-HHCCCEEEEEECHHHHHH T ss_conf 68854431377777861468999999987179837873288987661789999999999-9-867972899841068999 Q ss_pred HHHHC Q ss_conf 64306 Q gi|254780750|r 778 LSKSL 782 (920) Q Consensus 778 l~~~~ 782 (920) ..+.. T Consensus 531 ~vDkI 535 (556) T TIGR01842 531 LVDKI 535 (556) T ss_pred HHHHH T ss_conf 99999 No 242 >COG3839 MalK ABC-type sugar transport systems, ATPase components [Carbohydrate transport and metabolism] Probab=96.46 E-value=0.0036 Score=42.10 Aligned_cols=131 Identities=28% Similarity=0.356 Sum_probs=73.6 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEE---------------- Q ss_conf 4434563587777664399996778440789999999999999719853035320682210567---------------- Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFS---------------- 698 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~Ift---------------- 698 (920) +=++++|.. .++.+++|-||+-+||||+||.+| |.--|- +-+|.+-++..| T Consensus 18 ~l~~i~l~i----~~Gef~vllGPSGcGKSTlLr~IA--------GLe~p~-~G~I~i~g~~vt~l~P~~R~iamVFQ~y 84 (338) T COG3839 18 VLKDVNLDI----EDGEFVVLLGPSGCGKSTLLRMIA--------GLEEPT-SGEILIDGRDVTDLPPEKRGIAMVFQNY 84 (338) T ss_pred EEECCEEEE----CCCCEEEEECCCCCCHHHHHHHHH--------CCCCCC-CCEEEECCEECCCCCHHHCCEEEEECCC T ss_conf 563326897----479799998999888899999996--------887788-7159999999998995578889993783 Q ss_pred ----EEECCCCCCCCCCH-------HHHHHHHHHHHH------HH-------------------CCCCCEEEEECCCCCC Q ss_conf ----65237661138532-------899999999999------95-------------------8998569993258898 Q gi|254780750|r 699 ----RVGSADNLASGRST-------FMVEMIETASIL------NQ-------------------ATNQSFVILDEIGRGT 742 (920) Q Consensus 699 ----RiGa~D~l~~g~ST-------F~vEm~e~~~IL------~~-------------------at~~SLVllDElGrGT 742 (920) .|-..|||.-|.-. ---...|++.+| +. ..+-+++|+||- . T Consensus 85 ALyPhMtV~~Niaf~Lk~~~~~k~ei~~rV~eva~~L~l~~lL~r~P~~LSGGQrQRVAlaRAlVr~P~v~L~DEP---l 161 (338) T COG3839 85 ALYPHMTVYENIAFGLKLRGVPKAEIDKRVKEVAKLLGLEHLLNRKPLQLSGGQRQRVALARALVRKPKVFLLDEP---L 161 (338) T ss_pred CCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCEEEECCC---H T ss_conf 0157876999734166447995688899999999873986677359011772567899998777547887884476---4 Q ss_pred CHHHHH--HHHHHHHHHHHHHCCCEEEEECCCH-HHHHHHHH Q ss_conf 805679--9999999999972698499974875-79766430 Q gi|254780750|r 743 ATLDGL--SIAWATIEYLHETNRCRGLLATHFH-ELTDLSKS 781 (920) Q Consensus 743 st~DG~--aiA~aile~l~~~~~~~~lfaTHy~-eL~~l~~~ 781 (920) |..|-- .-..+.+..|++..+..+++.||.. |.-.+++. T Consensus 162 SnLDaklR~~mr~eik~l~~~l~~T~IYVTHDq~EAmtladr 203 (338) T COG3839 162 SNLDAKLRVLMRSEIKKLHERLGTTTIYVTHDQVEAMTLADR 203 (338) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCEEEECCCHHHHHHHCCE T ss_conf 676599999999999999986098489980899999840887 No 243 >PRK08181 transposase; Validated Probab=96.42 E-value=0.089 Score=31.71 Aligned_cols=96 Identities=28% Similarity=0.327 Sum_probs=63.4 Q ss_pred EEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEECCC---CC--CCCCCHHHHHHHHHHHHHHHC Q ss_conf 9996778440789999999999999719853035320682210567652376---61--138532899999999999958 Q gi|254780750|r 653 WLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGSAD---NL--ASGRSTFMVEMIETASILNQA 727 (920) Q Consensus 653 ~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa~D---~l--~~g~STF~vEm~e~~~IL~~a 727 (920) +|++||-..|||-+--.+|..++. -|.- -.|+++. | .+ .+...++ ...++.- T Consensus 109 vil~Gp~GtGKThLA~Alg~~A~~--~G~~------------V~f~~~~--~L~~~L~~a~~~~~~-------~~~~~~l 165 (269) T PRK08181 109 LLLFGPPGGGKSHLAAAIGLALIE--NGWR------------VLFTRTT--DLVQKLQVARRELQL-------ESAIAKL 165 (269) T ss_pred EEEECCCCCCHHHHHHHHHHHHHH--CCCE------------EEEEEHH--HHHHHHHHHHHCCCH-------HHHHHHH T ss_conf 899899998788999999999998--7993------------9997899--999999997755839-------9999997 Q ss_pred CCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCH Q ss_conf 9985699932588988056799999999999972698499974875 Q gi|254780750|r 728 TNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFH 773 (920) Q Consensus 728 t~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~ 773 (920) .+-.|+||||+|--.-+.+|..+-+-++..-+++ + -++++|.++ T Consensus 166 ~~~dLLIiDe~G~~~~~~~~~~~lf~lI~~Rye~-~-S~IITSn~~ 209 (269) T PRK08181 166 DKFDLLILDDLAYVTKDQAETSVLFELISARYER-R-SILITANQP 209 (269) T ss_pred HCCCEEEEHHCCCCCCCHHHHHHHHHHHHHHHCC-C-CEEEECCCC T ss_conf 4446012201056679989999999999998578-8-889988999 No 244 >COG0410 LivF ABC-type branched-chain amino acid transport systems, ATPase component [Amino acid transport and metabolism] Probab=96.33 E-value=0.017 Score=37.06 Aligned_cols=130 Identities=26% Similarity=0.339 Sum_probs=68.5 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCC---------C-CHHHCCCCCC-----CEEEEE Q ss_conf 443456358777766439999677844078999999999999971985---------3-0353206822-----105676 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSY---------V-PASYAHIGIV-----DKLFSR 699 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~f---------V-PA~~a~i~~~-----D~IftR 699 (920) |=.++++.. ..+.+.-+-|||-+||||+||+++=++=- .-|.- + |.+.++.|+. .+||.+ T Consensus 18 ~L~gvsl~v----~~Geiv~llG~NGaGKTTlLkti~Gl~~~-~~G~I~~~G~dit~~p~~~r~r~Gi~~VPegR~iF~~ 92 (237) T COG0410 18 ALRGVSLEV----ERGEIVALLGRNGAGKTTLLKTIMGLVRP-RSGRIIFDGEDITGLPPHERARLGIAYVPEGRRIFPR 92 (237) T ss_pred EEEEEEEEE----CCCCEEEEECCCCCCHHHHHHHHHCCCCC-CCEEEEECCEECCCCCHHHHHHCCEEECCCCCCCHHH T ss_conf 885110587----68988999899988889999998589878-8706998983567799789985776867521361000 Q ss_pred EECCCCCCCCCCHHH-------------------HH---------------HHHHHHHHHHCCCCCEEEEECCCCCCCHH Q ss_conf 523766113853289-------------------99---------------99999999958998569993258898805 Q gi|254780750|r 700 VGSADNLASGRSTFM-------------------VE---------------MIETASILNQATNQSFVILDEIGRGTATL 745 (920) Q Consensus 700 iGa~D~l~~g~STF~-------------------vE---------------m~e~~~IL~~at~~SLVllDElGrGTst~ 745 (920) +--.+||..|..+.- .| |.-++..|- +.-.|.||||-.-|-+|. T Consensus 93 LTVeENL~~g~~~~~~~~~~~~~~e~v~~lFP~Lker~~~~aG~LSGGEQQMLAiaRALm--~~PklLLLDEPs~GLaP~ 170 (237) T COG0410 93 LTVEENLLLGAYARRDKEAQERDLEEVYELFPRLKERRNQRAGTLSGGEQQMLAIARALM--SRPKLLLLDEPSEGLAPK 170 (237) T ss_pred CCHHHHHHHHHHCCCCCCCCCCCHHHHHHHCHHHHHHHHCCCCCCCHHHHHHHHHHHHHH--CCCCEEEECCCCCCCCHH T ss_conf 759998742310245310001138999997863898840844677819999999999996--199889965886676889 Q ss_pred HHHHHHHHHHHHHHHHCCCEEEEECCC Q ss_conf 679999999999997269849997487 Q gi|254780750|r 746 DGLSIAWATIEYLHETNRCRGLLATHF 772 (920) Q Consensus 746 DG~aiA~aile~l~~~~~~~~lfaTHy 772 (920) ==--| ..++..|.+..+--.|.+-.+ T Consensus 171 iv~~I-~~~i~~l~~~~g~tIlLVEQn 196 (237) T COG0410 171 IVEEI-FEAIKELRKEGGMTILLVEQN 196 (237) T ss_pred HHHHH-HHHHHHHHHCCCCEEEEEECC T ss_conf 99999-999999997489489999425 No 245 >PRK08939 primosomal protein DnaI; Reviewed Probab=96.32 E-value=0.1 Score=31.32 Aligned_cols=105 Identities=20% Similarity=0.266 Sum_probs=62.1 Q ss_pred CCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHH---------H Q ss_conf 6439999677844078999999999999971985303532068221056765237661138532899999---------9 Q gi|254780750|r 649 SGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGSADNLASGRSTFMVEMI---------E 719 (920) Q Consensus 649 ~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa~D~l~~g~STF~vEm~---------e 719 (920) ..+-+.|.||...|||-++- |++--||+-|-.| +|..+ ++|+.+|. + T Consensus 156 ~~kGlyl~G~~G~GKTyL~~--aian~La~~g~~v------------~~v~~----------p~~~~~lK~s~~d~s~~~ 211 (306) T PRK08939 156 KVKGLYLYGDFGVGKTYLLA--AIANELAKKGVSS------------TLVHF----------PEFIRELKNAISDGSVKE 211 (306) T ss_pred CCCEEEEECCCCCCHHHHHH--HHHHHHHHCCCEE------------EEEEH----------HHHHHHHHHHHCCCCHHH T ss_conf 88778898999998999999--9999999869929------------99875----------999999999864898899 Q ss_pred HHHHHHHCCCCCEEEEECCCCCCCHHHHH-HHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHHC Q ss_conf 99999958998569993258898805679-9999999999972698499974875797664306 Q gi|254780750|r 720 TASILNQATNQSFVILDEIGRGTATLDGL-SIAWATIEYLHETNRCRGLLATHFHELTDLSKSL 782 (920) Q Consensus 720 ~~~IL~~at~~SLVllDElGrGTst~DG~-aiA~aile~l~~~~~~~~lfaTHy~eL~~l~~~~ 782 (920) .-..++++ .+.+||.||..+.|.=-. -+-..|++|=... +-.|+|+|-|. +.+|+..+ T Consensus 212 ~i~~~k~~---~vLiLDDiGaE~~t~W~rd~vl~~IL~~Rm~~-~lPTffTSN~~-~~eLe~~l 270 (306) T PRK08939 212 KIDAVKEA---PVLMLDDIGAEQMSSWVRDEVLGVILQYRMQE-ELPTFFTSNFD-FDELEHHL 270 (306) T ss_pred HHHHHHCC---CEEEEECCCCCCCCHHHHHHHHHHHHHHHHHC-CCCEEEECCCC-HHHHHHHH T ss_conf 99998449---98998444654267778998999999999974-99979977999-99999998 No 246 >KOG0056 consensus Probab=96.32 E-value=0.1 Score=31.30 Aligned_cols=150 Identities=22% Similarity=0.323 Sum_probs=81.5 Q ss_pred CCCCEECCCCCCEEECC---CCCHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHH-------- Q ss_conf 88512104787140000---4680588763102877044434563587777664399996778440789999-------- Q gi|254780750|r 600 NYCRPIIDNSTNFIVKD---GRHPIVEKTLKQQSSKPFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLR-------- 668 (920) Q Consensus 600 ~y~rP~i~~~~~l~i~~---gRHPviE~~l~~~~~~~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lR-------- 668 (920) +-+-|--+..+.++..+ ++.| +.-|--||++.. ..++..-+-||-.|||||.|| T Consensus 526 P~a~pl~~~~G~i~fsnvtF~Y~p-----------~k~vl~disF~v----~pGktvAlVG~SGaGKSTimRlLfRffdv 590 (790) T KOG0056 526 PGAPPLKVTQGKIEFSNVTFAYDP-----------GKPVLSDISFTV----QPGKTVALVGPSGAGKSTIMRLLFRFFDV 590 (790) T ss_pred CCCCCCCCCCCEEEEEEEEEECCC-----------CCCEEECCEEEE----CCCCEEEEECCCCCCHHHHHHHHHHHHHC T ss_conf 999973305770899876786489-----------986123214885----69968999778988666899999999405 Q ss_pred -------------HHHHHHHHHHCCCCCCHHHC--CCCCCCEE-EEEEECCCC-CCC----------------CCCHHH- Q ss_conf -------------99999999971985303532--06822105-676523766-113----------------853289- Q gi|254780750|r 669 -------------QNALIVIMAQMGSYVPASYA--HIGIVDKL-FSRVGSADN-LAS----------------GRSTFM- 714 (920) Q Consensus 669 -------------qval~vilAQiG~fVPA~~a--~i~~~D~I-ftRiGa~D~-l~~----------------g~STF~- 714 (920) +|-+..+=.|||- ||-+.. .=++++.| |.|.+|+|+ +.. |.-|=- T Consensus 591 ~sGsI~iDgqdIrnvt~~SLRs~IGV-VPQDtvLFNdTI~yNIryak~~AsneevyaAAkAA~IHdrIl~fPegY~t~VG 669 (790) T KOG0056 591 NSGSITIDGQDIRNVTQSSLRSSIGV-VPQDTVLFNDTILYNIRYAKPSASNEEVYAAAKAAQIHDRILQFPEGYNTRVG 669 (790) T ss_pred CCCEEEECCCHHHHHHHHHHHHHCCC-CCCCCEEECCEEEEHEEECCCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHH T ss_conf 57608986701788879989975683-56751354210000102168899718999999875578988628425543320 Q ss_pred --------HHHHH---HHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECC Q ss_conf --------99999---9999995899856999325889880567999999999999726984999748 Q gi|254780750|r 715 --------VEMIE---TASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATH 771 (920) Q Consensus 715 --------vEm~e---~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTH 771 (920) -|-.. ...||+ .-|.|++||--..-+|..--+| .|.++.+.. ++ .++..-| T Consensus 670 ERGLkLSGGEKQRVAiARtiLK---~P~iIlLDEATSALDT~tER~I-QaaL~rlca-~R-TtIVvAH 731 (790) T KOG0056 670 ERGLKLSGGEKQRVAIARTILK---APSIILLDEATSALDTNTERAI-QAALARLCA-NR-TTIVVAH 731 (790) T ss_pred HCCCCCCCCCHHHHHHHHHHHC---CCCEEEECCHHHHCCCCCHHHH-HHHHHHHHC-CC-CEEEEEE T ss_conf 0243557750356899999861---8958997130432378628999-999999856-88-5599864 No 247 >TIGR03269 met_CoM_red_A2 methyl coenzyme M reductase system, component A2. The enzyme that catalyzes the final step in methanogenesis, methyl coenzyme M reductase, contains alpha, beta, and gamma chains. In older literature, the complex of alpha, beta, and gamma chains was termed component C, while this single chain protein was termed methyl coenzyme M reductase system component A2. Probab=96.30 E-value=0.052 Score=33.48 Aligned_cols=54 Identities=15% Similarity=0.202 Sum_probs=32.6 Q ss_pred CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHH-HHHHHH Q ss_conf 8998569993258898805679999999999997269849997487579-766430 Q gi|254780750|r 727 ATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHEL-TDLSKS 781 (920) Q Consensus 727 at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL-~~l~~~ 781 (920) ++.-.++|+||--+|=+..--..|...+++-. +..+.-++|.||..+. ..+.+. T Consensus 443 ~~~P~vlilDEPT~glD~~~~~~i~~~l~~~~-~~~g~tvi~iShDl~~~~~~~dR 497 (520) T TIGR03269 443 IKEPRIVILDEPTGTMDPITKVDVTHSILKAR-EEMEQTFIIVSHDMDFVLDVCDR 497 (520) T ss_pred HHCCCEEEEECCHHHHHHHHHHHHHHHHHHHH-HHCCCEEEEECCCHHHHHHHCCE T ss_conf 97989899938601133899999999999999-83298999977888999986999 No 248 >PRK09183 transposase/IS protein; Provisional Probab=96.19 E-value=0.046 Score=33.84 Aligned_cols=95 Identities=18% Similarity=0.262 Sum_probs=60.9 Q ss_pred EEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHH------HHHHHH- Q ss_conf 9996778440789999999999999719853035320682210567652376611385328999999------999999- Q gi|254780750|r 653 WLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGSADNLASGRSTFMVEMIE------TASILN- 725 (920) Q Consensus 653 ~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa~D~l~~g~STF~vEm~e------~~~IL~- 725 (920) +|++||-..|||-+-=.+|..++. -|- .-.|+++. +| +.++.+ ....++ T Consensus 104 vil~G~~GtGKThLA~Alg~~A~~--~G~------------~v~f~~~~---~L-------~~~L~~a~~~~~~~~~l~r 159 (258) T PRK09183 104 IVLLGPSGVGKTHLAIALGYEAVR--AGI------------KVRFTTAA---DL-------LLQLSTAQRQGRYKTTLQR 159 (258) T ss_pred EEEECCCCCCHHHHHHHHHHHHHH--CCC------------EEEEEEHH---HH-------HHHHHHHHHCCCHHHHHHH T ss_conf 799899998689999999999998--799------------39997899---99-------9999999876859999998 Q ss_pred HCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCH Q ss_conf 589985699932588988056799999999999972698499974875 Q gi|254780750|r 726 QATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFH 773 (920) Q Consensus 726 ~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~ 773 (920) ....-.|+||||+|--.-+..+..+-+-+++.-.++ + -++++|.++ T Consensus 160 ~l~~~dLLIiDdlG~~~~~~~~~~~lfeli~~Rye~-~-S~IiTSn~~ 205 (258) T PRK09183 160 GVMAPRLLIIDEIGYLPFSQEEANLFFQVIAKRYEK-G-AMILTSNLP 205 (258) T ss_pred HHCCCCEEEEHHHHCCCCCHHHHHHHHHHHHHHHCC-C-CEEEECCCC T ss_conf 743465144313315468888999999999998576-7-789988999 No 249 >pfam03215 Rad17 Rad17 cell cycle checkpoint protein. Probab=96.18 E-value=0.052 Score=33.43 Aligned_cols=34 Identities=18% Similarity=0.272 Sum_probs=24.7 Q ss_pred CCCCCHHHH-HHHHHHHHHHHHCCCCCCHHHCCCCC Q ss_conf 784407899-99999999999719853035320682 Q gi|254780750|r 658 PNMGGKSTF-LRQNALIVIMAQMGSYVPASYAHIGI 692 (920) Q Consensus 658 pNmgGKSt~-lRqval~vilAQiG~fVPA~~a~i~~ 692 (920) |=|| |.|+ ||.-+.|.+.-+||-+.-+++..=|+ T Consensus 449 ~~~~-~~~~~~~~~~~i~~i~~i~~~~~~~~~~~~~ 483 (490) T pfam03215 449 PMIG-AKDYAMLTCPYITIIKDIIDPQRIEYFLSGP 483 (490) T ss_pred HHHH-HHCCCCCCCHHHHHHHHHCCHHHHHHCCCCC T ss_conf 5877-5023112614657998613313333203687 No 250 >cd03258 ABC_MetN_methionine_transporter MetN (also known as YusC) is an ABC-type transporter encoded by metN of the metNPQ operon in Bacillus subtilis that is involved in methionine transport. Other members of this system include the MetP permease and the MetQ substrate binding protein. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=96.18 E-value=0.099 Score=31.35 Aligned_cols=138 Identities=25% Similarity=0.292 Sum_probs=67.3 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCC------CCH-HH-------CCCCCCC---EEE Q ss_conf 443456358777766439999677844078999999999999971985------303-53-------2068221---056 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSY------VPA-SY-------AHIGIVD---KLF 697 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~f------VPA-~~-------a~i~~~D---~If 697 (920) +=+|++++. ..+.+.-|-||.-+||||++|.++-+. -..-|.- +.. .. -.++.+= ..| T Consensus 20 al~~vsl~i----~~Ge~~~ivG~SGsGKSTllr~i~gL~-~p~sG~I~~~g~~i~~~~~~~~~~~Rr~ig~VFQ~~~L~ 94 (233) T cd03258 20 ALKDVSLSV----PKGEIFGIIGRSGAGKSTLIRCINGLE-RPTSGSVLVDGTDLTLLSGKELRKARRRIGMIFQHFNLL 94 (233) T ss_pred EEECCEEEE----CCCCEEEEECCCCCHHHHHHHHHHCCC-CCCCCEEEECCEECCCCCHHHHHHHHCCCCEEECCCCCC T ss_conf 984828899----999999998898058999999996799-999808999999989799999999862587794377889 Q ss_pred EEEECCCCCCC-----CCCH-----HHHHHHHHH---------------------HH-HHHCCCCCEEEEECCCCCCCHH Q ss_conf 76523766113-----8532-----899999999---------------------99-9958998569993258898805 Q gi|254780750|r 698 SRVGSADNLAS-----GRST-----FMVEMIETA---------------------SI-LNQATNQSFVILDEIGRGTATL 745 (920) Q Consensus 698 tRiGa~D~l~~-----g~ST-----F~vEm~e~~---------------------~I-L~~at~~SLVllDElGrGTst~ 745 (920) ..+-..||+.. |.|. -..|+.+.- .| ---+.+-.++|.||- |+.. T Consensus 95 ~~~tv~~nv~~~l~~~~~~~~~~~~r~~~lL~~vgL~~~~~~yP~eLSGGq~QRVaIARAL~~~P~lllaDEP---Ts~L 171 (233) T cd03258 95 SSRTVFENVALPLEIAGVPKAEIEERVLELLELVGLEDKADAYPAQLSGGQKQRVGIARALANNPKVLLCDEA---TSAL 171 (233) T ss_pred CCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCEEEECCC---CCCC T ss_conf 9883999999999974999999999999999867991676269652677888999999998339989996597---6646 Q ss_pred HHHH--HHHHHHHHHHHHCCCEEEEECCCHHHH-HHHH Q ss_conf 6799--999999999972698499974875797-6643 Q gi|254780750|r 746 DGLS--IAWATIEYLHETNRCRGLLATHFHELT-DLSK 780 (920) Q Consensus 746 DG~a--iA~aile~l~~~~~~~~lfaTHy~eL~-~l~~ 780 (920) |-.. --...+..|.+..+...+|+||.-++. .+++ T Consensus 172 D~~~~~~il~ll~~l~~e~g~t~i~vTHDl~~~~~~ad 209 (233) T cd03258 172 DPETTQSILALLRDINRELGLTIVLITHEMEVVKRICD 209 (233) T ss_pred CHHHHHHHHHHHHHHHHHHCCEEEEECCCHHHHHHHCC T ss_conf 98899999999999999729899998989999998699 No 251 >KOG0059 consensus Probab=96.15 E-value=0.091 Score=31.65 Aligned_cols=64 Identities=20% Similarity=0.279 Sum_probs=45.3 Q ss_pred CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHH-HHHHCCCEEEEEEEE Q ss_conf 899856999325889880567999999999999726984999748757976-643068858999999 Q gi|254780750|r 727 ATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTD-LSKSLKRFHNATLQV 792 (920) Q Consensus 727 at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~-l~~~~~~v~n~~~~~ 792 (920) ...-++|+|||---|-+|. +-=..|.++..+.+..+ -.+.+||+-|=++ +.....-..+.++.+ T Consensus 714 ig~p~vi~LDEPstGmDP~-arr~lW~ii~~~~k~g~-aiiLTSHsMeE~EaLCtR~aImv~G~l~c 778 (885) T KOG0059 714 IGDPSVILLDEPSTGLDPK-ARRHLWDIIARLRKNGK-AIILTSHSMEEAEALCTRTAIMVIGQLRC 778 (885) T ss_pred CCCCCEEEECCCCCCCCHH-HHHHHHHHHHHHHHCCC-EEEEEECCHHHHHHHHCCEEEEECCEEEE T ss_conf 1698679981787668988-99999999999985595-79999356999999862434687577999 No 252 >cd00983 recA RecA is a bacterial enzyme which has roles in homologous recombination, DNA repair, and the induction of the SOS response. RecA couples ATP hydrolysis to DNA strand exchange. Probab=96.13 E-value=0.07 Score=32.51 Aligned_cols=122 Identities=19% Similarity=0.267 Sum_probs=69.4 Q ss_pred CCCEEEEEECCCCCCHHHH-HHHHHHHHHHHHCCC---CCCHHHCCCCCCCEEEEEEEC-CCCCCCCCCHHHHHHHHHHH Q ss_conf 6643999967784407899-999999999997198---530353206822105676523-76611385328999999999 Q gi|254780750|r 648 NSGKLWLLTGPNMGGKSTF-LRQNALIVIMAQMGS---YVPASYAHIGIVDKLFSRVGS-ADNLASGRSTFMVEMIETAS 722 (920) Q Consensus 648 ~~~~~~iiTGpNmgGKSt~-lRqval~vilAQiG~---fVPA~~a~i~~~D~IftRiGa-~D~l~~g~STF~vEm~e~~~ 722 (920) ..+|+.-|.||..+||||+ |..++-++ ..|. |+=|+.+- .+ .-...+|- .|++.--++...-++.++.. T Consensus 53 P~GRivei~G~essGKTtlal~~ia~aQ---k~gg~~~~iDaE~a~-d~--~~a~~lGVD~~~l~~~qp~~~Eq~l~i~~ 126 (325) T cd00983 53 PKGRIIEIYGPESSGKTTLALHAIAEAQ---KLGGTVAFIDAEHAL-DP--VYAKKLGVDLDNLLISQPDTGEQALEIAD 126 (325) T ss_pred CCCEEEEEECCCCCCHHHHHHHHHHHHH---HCCCEEEEEEHHHCC-CH--HHHHHCCCCHHHEEEECCCHHHHHHHHHH T ss_conf 6880899988987779999999999987---359839999625425-98--99998099846758966638999999999 Q ss_pred HHHHCCCCCEEEEECCCCCCCHHH----------H-H----HHHHHHHHHHHHHCCCEEEEECCCHHH Q ss_conf 999589985699932588988056----------7-9----999999999997269849997487579 Q gi|254780750|r 723 ILNQATNQSFVILDEIGRGTATLD----------G-L----SIAWATIEYLHETNRCRGLLATHFHEL 775 (920) Q Consensus 723 IL~~at~~SLVllDElGrGTst~D----------G-~----aiA~aile~l~~~~~~~~lfaTHy~eL 775 (920) -|-+...-.||++|=+|-=+...+ | . +-|+-.+...+.+.+|..+|.-|..+= T Consensus 127 ~li~s~~~dliViDSvaal~p~~E~e~~~~d~~vg~~ArlmskalRklt~~l~k~~~~lIfiNQ~R~k 194 (325) T cd00983 127 SLVRSGAVDLIVVDSVAALVPKAEIEGEMGDSHVGLQARLMSQALRKLTGSINKSNTTVIFINQLREK 194 (325) T ss_pred HHHCCCCCCEEEECCCCCCCCHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEEEEHHH T ss_conf 97515887679981511236578876011321143899999999999998753378079995543221 No 253 >TIGR01288 nodI nodulation ABC transporter NodI; InterPro: IPR005978 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). Nodulation ABC transporter, NodI is required for normal nodulation by nitrogen-fixing root nodule bacteria such as Mesorhizobium loti. It is a member of the family of ABC transporter ATP binding proteins and works with NodJ IPR005981 from INTERPRO to export a variety of modified carbohydrate molecules as signals to plant hosts to establish root nodules. This model does not recognize the highly divergent NodI from Azorhizobium caulinodans.; GO: 0005215 transporter activity, 0005524 ATP binding, 0006810 transport, 0009877 nodulation, 0016020 membrane. Probab=96.10 E-value=0.027 Score=35.62 Aligned_cols=140 Identities=32% Similarity=0.446 Sum_probs=82.6 Q ss_pred CCCEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHH-----HHHHCCCCCCHH----HCCCCCC---CEEEE Q ss_conf 77044434563587777664399996778440789999999999-----999719853035----3206822---10567 Q gi|254780750|r 631 SKPFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIV-----IMAQMGSYVPAS----YAHIGIV---DKLFS 698 (920) Q Consensus 631 ~~~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~v-----ilAQiG~fVPA~----~a~i~~~---D~Ift 698 (920) +...|-||+.+.. ..+.+.=+-|||.+||||.-|.+-=++ -..-.|-.+|+. .+++|++ |.+=. T Consensus 15 G~k~~v~~lsf~~----~~GeCfGllGPnGaGkst~~r~~lG~~~P~~G~itvl~~~~P~~~r~ar~~~G~v~qfd~l~~ 90 (303) T TIGR01288 15 GDKVVVNDLSFTI----AAGECFGLLGPNGAGKSTIARLVLGLISPDEGKITVLGEPVPARARLARAKIGVVPQFDNLDR 90 (303) T ss_pred CCEEEEECCEEEE----ECCCEEECCCCCCCCHHHHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHHHCCEEECCCCCCH T ss_conf 7737873131455----335122222788762577766765235888752788247774255433444011202012320 Q ss_pred EEECCCCCCC-----CCCHHHHH-----HHHHHHH--------------------HHHC--CCCCEEEEECCCCCCCHHH Q ss_conf 6523766113-----85328999-----9999999--------------------9958--9985699932588988056 Q gi|254780750|r 699 RVGSADNLAS-----GRSTFMVE-----MIETASI--------------------LNQA--TNQSFVILDEIGRGTATLD 746 (920) Q Consensus 699 RiGa~D~l~~-----g~STF~vE-----m~e~~~I--------------------L~~a--t~~SLVllDElGrGTst~D 746 (920) ..-+.+|+.- +.||=.+| +.|.+.+ |-.| ..-.|.++||--.|-+|+- T Consensus 91 eft~renllv~Gryf~~~~r~~e~~~P~ll~farleska~~~v~~lsGGm~rrltla~alindP~ll~ldePttGldPha 170 (303) T TIGR01288 91 EFTVRENLLVFGRYFGLSTREIEEVIPSLLEFARLESKADVRVADLSGGMKRRLTLARALINDPQLLILDEPTTGLDPHA 170 (303) T ss_pred HHHHHHHEEEEEEEEECHHHHHHHHHHHHHHHHHHHHHCCCCHHHHCCCHHHHHHHHHHHHCCCCEEEEECCCCCCCHHH T ss_conf 11222110222000000045587754667776642100254054532514678888888743983799737877877146 Q ss_pred HHHHHHHHHHHHHHHCCCEEEEECCCHHHH Q ss_conf 799999999999972698499974875797 Q gi|254780750|r 747 GLSIAWATIEYLHETNRCRGLLATHFHELT 776 (920) Q Consensus 747 G~aiA~aile~l~~~~~~~~lfaTHy~eL~ 776 (920) - -+-|--+..|... +-..|.+||+-|=+ T Consensus 171 r-hliWerlr~lla~-Gktilltth~meea 198 (303) T TIGR01288 171 R-HLIWERLRSLLAR-GKTILLTTHFMEEA 198 (303) T ss_pred H-HHHHHHHHHHHHC-CCEEEEHHHHHHHH T ss_conf 7-8999999999853-86112034677777 No 254 >KOG0217 consensus Probab=96.04 E-value=1.8e-05 Score=59.25 Aligned_cols=176 Identities=17% Similarity=0.074 Sum_probs=120.4 Q ss_pred CCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHC Q ss_conf 66439999677844078999999999999971985303532068221056765237661138532899999999999958 Q gi|254780750|r 648 NSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGSADNLASGRSTFMVEMIETASILNQA 727 (920) Q Consensus 648 ~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~a 727 (920) +....+|-++||.||+|. +|-+.-....--.+.|+|+.++.+.-+|.+--|+-+.++ .+++.|.++|.+-- |.-.+ T Consensus 752 Gk~~y~lEvP~n~~~~s~-~~~~~~S~~Kg~~RY~tp~~~kli~~l~~aee~~~~~~~--d~~~r~~~~f~~~~-~~w~~ 827 (1125) T KOG0217 752 GKDVYQLEVPENGGVPSS-LRYELMSAKKGVSRYYTPDLRKLIAHLDEAEERKKSSLS--DLKRRLIVRFDEHY-IIWQA 827 (1125) T ss_pred CCEEEEEECCCCCCCCCC-HHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHH-HHHHH T ss_conf 754799963765687783-689998753273324687799999999999999987777--88999999863237-99999 Q ss_pred CCCCEEEEECCCCCCCHHHH----HHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHH----CCCEEEEEEEEEEECCEE Q ss_conf 99856999325889880567----9999999999997269849997487579766430----688589999999609927 Q gi|254780750|r 728 TNQSFVILDEIGRGTATLDG----LSIAWATIEYLHETNRCRGLLATHFHELTDLSKS----LKRFHNATLQVSDSNEGI 799 (920) Q Consensus 728 t~~SLVllDElGrGTst~DG----~aiA~aile~l~~~~~~~~lfaTHy~eL~~l~~~----~~~v~n~~~~~~~~~~~i 799 (920) |-++++.||+|.++|-+.+| +.++ .|++-.- . ..+..|.+|-|..-.+... .|+.. .+...+.+ .+ T Consensus 828 tv~~~a~iD~l~sla~~s~~~~~~~Crp-~i~~~~d-t-~~~l~~~~~~Hpcfsl~s~~~~fipN~v--~~g~~~e~-~~ 901 (1125) T KOG0217 828 TVKALASIDCLLSLAETSKGLGGPMCRP-EIVESTD-T-PGFLIVKGLRHPCFSLPSTGTSFIPNDV--ELGGAEEN-RE 901 (1125) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCC-EEECCCC-C-CCEEEEECCCCCEEECCCCCCCCCCCHH--HCCCCCCC-EE T ss_conf 9999998899987777650379875352-4641468-8-8616884266724623767776456322--20555432-02 Q ss_pred EE----EEEEEECCCCCCHHHHHHHHCC--CCHHHHHHHH Q ss_conf 78----7777447898877899999829--9989999999 Q gi|254780750|r 800 IF----LHKVIPGIADHSYGIQVGKLAG--LPNTVISRAY 833 (920) Q Consensus 800 ~f----lykl~~G~~~~Sygi~vA~laG--~p~~vi~~A~ 833 (920) .. .-..+.+++.++--..++...| +|.++++++- T Consensus 902 ~llTGpNmgGKSTllRq~c~~vilaq~G~~VPa~~~~~tp 941 (1125) T KOG0217 902 GLLTGPNMGGKSTLLRQACIAVILAQIGCDVPAEVMELTP 941 (1125) T ss_pred EEECCCCCCCCHHHHHHHHHHHHHHHHCCCCCHHHHCCCC T ss_conf 2320687577248999999999999857876588860661 No 255 >PRK10790 putative multidrug transporter membrane\ATP-binding components; Provisional Probab=96.04 E-value=0.14 Score=30.32 Aligned_cols=123 Identities=20% Similarity=0.236 Sum_probs=63.3 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHC------------------CCCCCC- Q ss_conf 4443456358777766439999677844078999999999999971985303532------------------068221- Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYA------------------HIGIVD- 694 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a------------------~i~~~D- 694 (920) .|=+|+++.. ..+...-|.||+.+||||+++-+. |.|-|-+-. .++.|- T Consensus 355 ~vL~~is~~I----~~Ge~vaIVG~SGsGKSTL~~LL~--------rly~p~~G~I~idG~di~~i~~~~lR~~i~~V~Q 422 (593) T PRK10790 355 LVLKNINLSV----PSRNFVALVGHTGSGKSTLASLLM--------GYYPLTEGEIRLDGRPLSSLSHSVLRQGVAMVQQ 422 (593) T ss_pred HHHHCCCCCC----CCCCEEEEECCCCCCHHHHHHHHH--------HHCCCCCCCCCCCCEECCCCCHHHHHHCCCCCCC T ss_conf 0142601044----899789987999886899999999--------8556789941659932442468888631575166 Q ss_pred --EEEEEEECCCCCCCCCCHHHHHHHHHHHHH------H----------------------------H--CCCCCEEEEE Q ss_conf --056765237661138532899999999999------9----------------------------5--8998569993 Q gi|254780750|r 695 --KLFSRVGSADNLASGRSTFMVEMIETASIL------N----------------------------Q--ATNQSFVILD 736 (920) Q Consensus 695 --~IftRiGa~D~l~~g~STF~vEm~e~~~IL------~----------------------------~--at~~SLVllD 736 (920) -+|.. --.|||.-|...=..|+.+.+... . . ..+..++|+| T Consensus 423 ~~~LF~g-TI~eNi~~g~~~~~~~i~~a~~~a~l~~~i~~lp~G~dT~vge~G~~LSgGQrQRiaiARall~~p~iliLD 501 (593) T PRK10790 423 DPVVLAD-TFLANVTLGRDISEEQVWQALETVQLAELARSLSDGLYTPLGEQGNTLSVGQKQLLALARVLVETPQILILD 501 (593) T ss_pred CCCCCCC-CHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCHHCCHHCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEE T ss_conf 6514565-299997760023679999999997789999857420104423876887999999999999995598999983 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHH-CCCEEEEECCCH Q ss_conf 2588988056799999999999972-698499974875 Q gi|254780750|r 737 EIGRGTATLDGLSIAWATIEYLHET-NRCRGLLATHFH 773 (920) Q Consensus 737 ElGrGTst~DG~aiA~aile~l~~~-~~~~~lfaTHy~ 773 (920) |- ||.-|-.. -..|.+.|-+. .++.++..||=- T Consensus 502 Ea---TSaLD~~t-E~~i~~~l~~~~~~~T~i~IaHRl 535 (593) T PRK10790 502 EA---TASIDSGT-EQAIQQALAAIREHTTLVVIAHRL 535 (593) T ss_pred CC---CCCCCHHH-HHHHHHHHHHHCCCCEEEEECCCH T ss_conf 87---77889999-999999999972899899970789 No 256 >PRK00440 rfc replication factor C small subunit; Reviewed Probab=96.01 E-value=0.12 Score=30.74 Aligned_cols=99 Identities=20% Similarity=0.215 Sum_probs=52.0 Q ss_pred EEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHC-CCCC Q ss_conf 999677844078999999999999971985303532068221056765237661138532899999999999958-9985 Q gi|254780750|r 653 WLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGSADNLASGRSTFMVEMIETASILNQA-TNQS 731 (920) Q Consensus 653 ~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~a-t~~S 731 (920) +|++||...||+|..+-+| .++.+.- ++.=+..+-|+|+ +|....--...+.+..-... .+.- T Consensus 40 lLf~GppG~GKTt~a~~la-----~~l~~~~---------~~~~~lelnasd~--r~id~vr~~i~~~~~~~~~~~~~~k 103 (318) T PRK00440 40 LLFAGPPGTGKTTAALALA-----RELYGEY---------WRENFLELNASDE--RGIDVIRNKIKEFARTAPVGGAPFK 103 (318) T ss_pred EEEECCCCCCHHHHHHHHH-----HHHCCCC---------CCCCEEEECCCCC--CCHHHHHHHHHHHHHHCCCCCCCEE T ss_conf 9888959988999999999-----9976986---------4347689516456--6717899999999972677899738 Q ss_pred EEEEECCCCCCCHHHHHHHHHHHHHHHHHH--CCCEEEEECCCH Q ss_conf 699932588988056799999999999972--698499974875 Q gi|254780750|r 732 FVILDEIGRGTATLDGLSIAWATIEYLHET--NRCRGLLATHFH 773 (920) Q Consensus 732 LVllDElGrGTst~DG~aiA~aile~l~~~--~~~~~lfaTHy~ 773 (920) +|++||.-+=|. ++ +..+....+. ..|+.+++|.+. T Consensus 104 iiiiDE~d~l~~--~a----q~aL~~~mE~~~~~~~fil~~n~~ 141 (318) T PRK00440 104 IIFLDEADNLTS--DA----QQALRRTMEMYSQTTRFILSCNYS 141 (318) T ss_pred EEEEECCCCCCH--HH----HHHHHHHHHCCCCCCEEEEECCCC T ss_conf 999868553225--56----788876431056662588634883 No 257 >cd01122 GP4d_helicase GP4d_helicase is a homohexameric 5'-3' helicases. Helicases couple NTP hydrolysis to the unwinding of nucleic acid duplexes into their component strands. Probab=95.93 E-value=0.025 Score=35.80 Aligned_cols=44 Identities=36% Similarity=0.403 Sum_probs=31.9 Q ss_pred CCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCC Q ss_conf 664399996778440789999999999999719853035320682 Q gi|254780750|r 648 NSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGI 692 (920) Q Consensus 648 ~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~ 692 (920) ..+.+.+|+|+-..|||+|+.++++..+ +|-|-.|--=+.+++. T Consensus 28 ~~GeL~viaarpg~GKT~f~~~~a~~~~-~~~g~~vl~~SlEm~~ 71 (271) T cd01122 28 RKGELIILTAGTGVGKTTFLREYALDLI-TQHGVRVGTISLEEPV 71 (271) T ss_pred CCCCEEEEEECCCCCHHHHHHHHHHHHH-HHCCCEEEEEECCCCH T ss_conf 9980899996899869999999999999-9769908999704999 No 258 >PRK13695 putative NTPase; Provisional Probab=95.83 E-value=0.13 Score=30.52 Aligned_cols=115 Identities=22% Similarity=0.278 Sum_probs=59.4 Q ss_pred EEEECCCCCCHHHHHHHHHHHHHHHH-----CCCCCCH--H-HCCC--CCCC------EEEEEEECCCCCCCCC-CHHHH Q ss_conf 99967784407899999999999997-----1985303--5-3206--8221------0567652376611385-32899 Q gi|254780750|r 653 WLLTGPNMGGKSTFLRQNALIVIMAQ-----MGSYVPA--S-YAHI--GIVD------KLFSRVGSADNLASGR-STFMV 715 (920) Q Consensus 653 ~iiTGpNmgGKSt~lRqval~vilAQ-----iG~fVPA--~-~a~i--~~~D------~IftRiGa~D~l~~g~-STF~v 715 (920) ..||||--.||||+++-+.=. |.. .|.+.+- + .-++ -++| .+++|++..-..--|+ .-+.- T Consensus 6 I~iTG~PGvGKTTli~Kv~~~--L~~~g~~v~GF~T~Evre~G~R~GF~vv~l~~g~~~~lA~~~~~~~~~VgkY~V~~~ 83 (174) T PRK13695 6 IGITGMPGVGKTTLVLKIAEL--LAREGYKVGGFITEEVREGGKRIGFKIIDLDTGEEGILARVGAVSRPRVGKYVVNLE 83 (174) T ss_pred EEEECCCCCCHHHHHHHHHHH--HHHCCCEEEEEEEEEEECCCEEEEEEEEECCCCCEEEEEECCCCCCCCCCCEEEEHH T ss_conf 998789998899999999999--863696174699525603882850599990588568767537889855456687168 Q ss_pred HHHHH-HHHHHHC-CCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCH Q ss_conf 99999-9999958-9985699932588988056799999999999972698499974875 Q gi|254780750|r 716 EMIET-ASILNQA-TNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFH 773 (920) Q Consensus 716 Em~e~-~~IL~~a-t~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~ 773 (920) +..+. ..+|++| ++..|+++||+|+=--.... .-.+|.+-| +..+ .+|.+=|-+ T Consensus 84 ~~e~~~~~~l~~a~~~~dlivIDEIG~MEl~s~~--F~~~V~~~L-~s~k-pvl~tih~p 139 (174) T PRK13695 84 DLERIAIPAISRALREADLIIIDEIGPMELKSKK--FVSAVEEVL-KSEK-PVIATVHRP 139 (174) T ss_pred HHHHHHHHHHHHCCCCCCEEEEECCCHHHHHHHH--HHHHHHHHH-CCCC-CEEEEECHH T ss_conf 9789989999835357879999631033110499--999999997-3899-899997758 No 259 >PRK04195 replication factor C large subunit; Provisional Probab=95.83 E-value=0.15 Score=30.02 Aligned_cols=15 Identities=27% Similarity=0.339 Sum_probs=6.8 Q ss_pred CCCHHHHHHHHHHHH Q ss_conf 200011134566643 Q gi|254780750|r 530 SNLTRFTTLELIDLE 544 (920) Q Consensus 530 ~~~~Rf~t~eL~~l~ 544 (920) .+.+||..|..-.+. T Consensus 324 ~~~~~y~~P~~~~~l 338 (403) T PRK04195 324 RGFTRYQPPSRIRLL 338 (403) T ss_pred CCCCCCCCCHHHHHH T ss_conf 996457895698998 No 260 >PRK09473 oppD oligopeptide transporter ATP-binding component; Provisional Probab=95.77 E-value=0.12 Score=30.63 Aligned_cols=33 Identities=24% Similarity=0.226 Sum_probs=26.2 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHH Q ss_conf 4434563587777664399996778440789999999 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNA 671 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqva 671 (920) +=+||+|+. ..+.++=|-|+|-+||||++|.+. T Consensus 31 av~~Vsf~i----~~GEilgivGeSGsGKSTl~~~i~ 63 (330) T PRK09473 31 AVNDLNFSL----RAGETLGIVGESGSGKSQTAFALM 63 (330) T ss_pred EEECCEEEE----CCCCEEEEECCCCCHHHHHHHHHH T ss_conf 866747688----899899998689877999999997 No 261 >smart00382 AAA ATPases associated with a variety of cellular activities. AAA - ATPases associated with a variety of cellular activities. This profile/alignment only detects a fraction of this vast family. The poorly conserved N-terminal helix is missing from the alignment. Probab=95.73 E-value=0.0092 Score=39.08 Aligned_cols=90 Identities=20% Similarity=0.175 Sum_probs=43.4 Q ss_pred EEEEEECCCCCCHHHHHHHHHHHHHHHH-CCCCCCHHHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCC Q ss_conf 3999967784407899999999999997-198530353206822105676523766113853289999999999995899 Q gi|254780750|r 651 KLWLLTGPNMGGKSTFLRQNALIVIMAQ-MGSYVPASYAHIGIVDKLFSRVGSADNLASGRSTFMVEMIETASILNQATN 729 (920) Q Consensus 651 ~~~iiTGpNmgGKSt~lRqval~vilAQ-iG~fVPA~~a~i~~~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~at~ 729 (920) +..+|.||.-+||||.++.+|...-... .-.++-++...-......+..... ++ ...+.-+..+. .++..|-. T Consensus 3 ~~ill~G~~GsGKTtl~~~la~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~-~~--~~~~~~~~~~~---~~~~~~~~ 76 (148) T smart00382 3 EVILIVGPPGSGKTTLARALARELGPPGGGVIYIDGEDILEEVLDQLLLIIVG-GK--KASGSGELRLR---LALALARK 76 (148) T ss_pred CEEEEECCCCHHHHHHHHHHHHHHCCCCCCEEEEEHHHHHHHHHHHHHHHHHC-CC--CCCCCHHHHHH---HHHHHHHH T ss_conf 78999999970299999999987266899689987599898889876530001-12--21051999999---99999984 Q ss_pred --CCEEEEECCCCCCCHHH Q ss_conf --85699932588988056 Q gi|254780750|r 730 --QSFVILDEIGRGTATLD 746 (920) Q Consensus 730 --~SLVllDElGrGTst~D 746 (920) .+++++||+++=..... T Consensus 77 ~~~~viiiDei~~~~~~~~ 95 (148) T smart00382 77 LKPDVLILDEITSLLDAEQ 95 (148) T ss_pred CCCCEEEEECCHHHCCCCC T ss_conf 4998999827502147620 No 262 >KOG0060 consensus Probab=95.71 E-value=0.085 Score=31.88 Aligned_cols=125 Identities=22% Similarity=0.294 Sum_probs=59.8 Q ss_pred CCCEEEEEECCCCCCHHHHHHHHHHHHHHHH-------CC-----CCCCHH-HCCCC-CCCEEEEEEECCCCCCCCCCHH Q ss_conf 6643999967784407899999999999997-------19-----853035-32068-2210567652376611385328 Q gi|254780750|r 648 NSGKLWLLTGPNMGGKSTFLRQNALIVIMAQ-------MG-----SYVPAS-YAHIG-IVDKLFSRVGSADNLASGRSTF 713 (920) Q Consensus 648 ~~~~~~iiTGpNmgGKSt~lRqval~vilAQ-------iG-----~fVPA~-~a~i~-~~D~IftRiGa~D~l~~g~STF 713 (920) .+++-++|||||.+|||++||..|=+==.-+ -| .|||-. ++.+| +-|+|.=-.++.|...++.|+= T Consensus 459 ~~g~~LLItG~sG~GKtSLlRvlggLWp~~~G~l~k~~~~~~~~lfflPQrPYmt~GTLRdQvIYP~~~~~~~~~~~~d~ 538 (659) T KOG0060 459 PSGQNLLITGPSGCGKTSLLRVLGGLWPSTGGKLTKPTDGGPKDLFFLPQRPYMTLGTLRDQVIYPLKAEDMDSKSASDE 538 (659) T ss_pred CCCCEEEEECCCCCCHHHHHHHHHCCCCCCCCEEEECCCCCCCCEEEECCCCCCCCCCHHHHEECCCCCCCCCCCCCCHH T ss_conf 58975999789987636899998532516787276056788775588368877666544550332575322101377678 Q ss_pred ----HHHHHHHHHHHHHC------------------------------CCCCEEEEECCCCCCCHHHHHHHHHHHHHHHH Q ss_conf ----99999999999958------------------------------99856999325889880567999999999999 Q gi|254780750|r 714 ----MVEMIETASILNQA------------------------------TNQSFVILDEIGRGTATLDGLSIAWATIEYLH 759 (920) Q Consensus 714 ----~vEm~e~~~IL~~a------------------------------t~~SLVllDElGrGTst~DG~aiA~aile~l~ 759 (920) +.|-....+|+... .+--+.++||= ||.-+ ...-.|+-+.+ T Consensus 539 ~i~r~Le~v~L~hl~~r~ggld~~~~~dW~dvLS~GEqQRLa~ARLfy~kPk~AiLDE~---TSAv~-~dvE~~~Yr~~- 613 (659) T KOG0060 539 DILRILENVQLGHLLEREGGLDQQVDWDWMDVLSPGEQQRLAFARLFYHKPKFAILDEC---TSAVT-EDVEGALYRKC- 613 (659) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCHHHCCHHHHCCHHHHHHHHHHHHHHCCCCEEEEECC---HHHCC-HHHHHHHHHHH- T ss_conf 99999998545558998578882222057763698888899999998608836876030---22235-76799999999- Q ss_pred HHCCCEEEEECCCHHHHH Q ss_conf 726984999748757976 Q gi|254780750|r 760 ETNRCRGLLATHFHELTD 777 (920) Q Consensus 760 ~~~~~~~lfaTHy~eL~~ 777 (920) +..+-..+-.-|-..|.. T Consensus 614 r~~giT~iSVgHRkSL~k 631 (659) T KOG0060 614 REMGITFISVGHRKSLWK 631 (659) T ss_pred HHCCCEEEEECCHHHHHH T ss_conf 980976999635788986 No 263 >COG0411 LivG ABC-type branched-chain amino acid transport systems, ATPase component [Amino acid transport and metabolism] Probab=95.58 E-value=0.013 Score=37.93 Aligned_cols=133 Identities=27% Similarity=0.343 Sum_probs=76.2 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHH--------HHHHHHHCCCCCC-HHHCCCCCCC-----EEEEEE Q ss_conf 4434563587777664399996778440789999999--------9999997198530-3532068221-----056765 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNA--------LIVIMAQMGSYVP-ASYAHIGIVD-----KLFSRV 700 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqva--------l~vilAQiG~fVP-A~~a~i~~~D-----~IftRi 700 (920) .=||+++.. ..+.+.-|-|||-+||||+.--+. -+.+-.+--.-.| .+-+++|++- ++|..| T Consensus 19 Al~~Vsl~v----~~Gei~~LIGPNGAGKTTlfNlitG~~~P~~G~v~~~G~~it~l~p~~iar~Gi~RTFQ~~rlF~~l 94 (250) T COG0411 19 AVNDVSLEV----RPGEIVGLIGPNGAGKTTLFNLITGFYKPSSGTVIFRGRDITGLPPHRIARLGIARTFQITRLFPGL 94 (250) T ss_pred EEECEEEEE----CCCEEEEEECCCCCCCEEEEEEECCCCCCCCCEEEECCCCCCCCCHHHHHHCCCEEECCCCCCCCCC T ss_conf 970414787----3872899988998882456653236405887369988803677888899862504632034126897 Q ss_pred ECCCCCCCCCC-------H-----H---HHHHHHHH-HHHH----------------------------HCCCCCEEEEE Q ss_conf 23766113853-------2-----8---99999999-9999----------------------------58998569993 Q gi|254780750|r 701 GSADNLASGRS-------T-----F---MVEMIETA-SILN----------------------------QATNQSFVILD 736 (920) Q Consensus 701 Ga~D~l~~g~S-------T-----F---~vEm~e~~-~IL~----------------------------~at~~SLVllD 736 (920) -.-||+.-|.. . + ..|..|-| .+|. -|+.--|+||| T Consensus 95 TVlENv~va~~~~~~~~~~l~~~~~~~~e~~~~e~A~~~Le~vgL~~~a~~~A~~LsyG~qR~LEIArALa~~P~lLLLD 174 (250) T COG0411 95 TVLENVAVGAHARLGLSGLLGRPRARKEEREARERARELLEFVGLGELADRPAGNLSYGQQRRLEIARALATQPKLLLLD 174 (250) T ss_pred CHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCCCHHHCCHHHCCCHHHHHHHHHHHHHHCCCCEEEEC T ss_conf 38898998865311035440563100147999999999998739932533601028856768999999986699878856 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCC Q ss_conf 258898805679999999999997269849997487 Q gi|254780750|r 737 EIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHF 772 (920) Q Consensus 737 ElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy 772 (920) |--.|-++.+--.++. .+..+-+..+--.|..-|- T Consensus 175 EPaAGln~~e~~~l~~-~i~~i~~~~g~tillIEHd 209 (250) T COG0411 175 EPAAGLNPEETEELAE-LIRELRDRGGVTILLIEHD 209 (250) T ss_pred CCCCCCCHHHHHHHHH-HHHHHHHCCCCEEEEEEEC T ss_conf 7657989899999999-9999885489689999742 No 264 >PRK10869 recombination and repair protein; Provisional Probab=95.47 E-value=0.22 Score=28.77 Aligned_cols=112 Identities=16% Similarity=0.262 Sum_probs=67.7 Q ss_pred CCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHC-CCEEEEECCCHHHHHHHHHCCCEEEE Q ss_conf 53289999999999995899856999325889880567999999999999726-98499974875797664306885899 Q gi|254780750|r 710 RSTFMVEMIETASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETN-RCRGLLATHFHELTDLSKSLKRFHNA 788 (920) Q Consensus 710 ~STF~vEm~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~-~~~~lfaTHy~eL~~l~~~~~~v~n~ 788 (920) .|-+|-= ...++-....-.-+|+||+=.|.|-.- |.+|-+.|.+.. .+=++..||.+.++..++. T Consensus 436 lSRimLA---lk~v~a~~~~~~tlIFDEID~GigG~~----a~~vg~~L~~ls~~~QVi~ITHlpQvAa~ad~------- 501 (553) T PRK10869 436 LSRIALA---IQVITARKMETPALIFDEVDVGISGPT----AAVVGKLLRQLGESTQVMCVTHLPQVAGCGHQ------- 501 (553) T ss_pred HHHHHHH---HHHHHHCCCCCCEEEEECCCCCCCHHH----HHHHHHHHHHHHCCCEEEEECCHHHHHHCCCC------- T ss_conf 9999999---999970579998289857888998799----99999999998529879998036999854487------- Q ss_pred EEEEE-EECCEEEEEEEEEECCCCCCHHHHHHHHCC---CCHHHHHHHHHHHH Q ss_conf 99999-609927787777447898877899999829---99899999999999 Q gi|254780750|r 789 TLQVS-DSNEGIIFLHKVIPGIADHSYGIQVGKLAG---LPNTVISRAYDILK 837 (920) Q Consensus 789 ~~~~~-~~~~~i~flykl~~G~~~~Sygi~vA~laG---~p~~vi~~A~~~~~ 837 (920) |+.|. +..++.|+. ++.+ ..+.-==-++|||.| +-+.-++.|+++++ T Consensus 502 H~~V~K~~~~~~t~t-~i~~-L~~~eRv~EiARMlsG~~it~~al~~A~eLL~ 552 (553) T PRK10869 502 HMFVSKETDGGMTET-HMQP-LDKKARLQELARLLGGSEVTRNTLANAKELLA 552 (553) T ss_pred EEEEEEECCCCEEEE-EEEE-CCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHC T ss_conf 789999737982689-8788-98407999999984799876999999999865 No 265 >cd03244 ABCC_MRP_domain2 Domain 2 of the ABC subfamily C. This family is also known as MRP (mulrtidrug resisitance-associated protein). Some of the MRP members have five additional transmembrane segments in their N-terminus, but the function of these additional membrane-spanning domains is not clear. The MRP was found in the multidrug-resistance lung cancer cell in which p-glycoprotein was not overexpressed. MRP exports glutathione by drug stimulation, as well as, certain substrates in conjugated forms with anions, such as glutathione, glucuronate, and sulfate. Probab=95.43 E-value=0.024 Score=35.98 Aligned_cols=35 Identities=23% Similarity=0.276 Sum_probs=27.4 Q ss_pred CEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHH Q ss_conf 044434563587777664399996778440789999999 Q gi|254780750|r 633 PFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNA 671 (920) Q Consensus 633 ~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqva 671 (920) .+|=+|++|.. ..+.+..|.|||-+||||++|.+. T Consensus 17 ~~vL~~isl~i----~~Ge~v~ivG~sGsGKSTLl~ll~ 51 (221) T cd03244 17 PPVLKNISFSI----KPGEKVGIVGRTGSGKSSLLLALF 51 (221) T ss_pred CCCEEEEEEEE----CCCCEEEEECCCCCCHHHHHHHHH T ss_conf 75175448998----699899999999998999999996 No 266 >cd03275 ABC_SMC1_euk Eukaryotic SMC1 proteins; SMC proteins are large (approximately 110 to 170 kDa), and each is arranged into five recognizable domains. Amino-acid sequence homology of SMC proteins between species is largely confined to the amino- and carboxy-terminal globular domains. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T, in the single-letter amino-acid code), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif, and a motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. In eukaryotic cells, the proteins are found as heterodimers of SMC1 paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6 (for Probab=95.42 E-value=0.23 Score=28.64 Aligned_cols=74 Identities=23% Similarity=0.380 Sum_probs=45.9 Q ss_pred CCCCCCCCHHHHHHHHHHHH--HHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHC--CCEEEEECCCHHHHHHH Q ss_conf 66113853289999999999--995899856999325889880567999999999999726--98499974875797664 Q gi|254780750|r 704 DNLASGRSTFMVEMIETASI--LNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETN--RCRGLLATHFHELTDLS 779 (920) Q Consensus 704 D~l~~g~STF~vEm~e~~~I--L~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~--~~~~lfaTHy~eL~~l~ 779 (920) |+|+.|+-|. .-+|.| +...-|--+.+|||+ ..+.|..-+. -+..||.+.. .+-.++.||-..+-+.+ T Consensus 154 ~~LSGGEKtl----~alallFai~~~~PsPF~vLDEV---DAaLD~~Nv~-r~~~~i~~~~~~~~QfIvITh~~~~~~~A 225 (247) T cd03275 154 DNLSGGEKTM----AALALLFAIHSYQPAPFFVLDEV---DAALDNTNVG-KVASYIREQAGPNFQFIVISLKEEFFSKA 225 (247) T ss_pred HHCCCCHHHH----HHHHHHHHHHCCCCCCEEEEECC---CCCCCHHHHH-HHHHHHHHHHCCCCEEEEEECCHHHHHHC T ss_conf 3338208999----99999999971289955887067---6346889999-99999998725686799998888998630 Q ss_pred HHCCCE Q ss_conf 306885 Q gi|254780750|r 780 KSLKRF 785 (920) Q Consensus 780 ~~~~~v 785 (920) +.+=+| T Consensus 226 d~L~GV 231 (247) T cd03275 226 DALVGV 231 (247) T ss_pred CEEEEE T ss_conf 018989 No 267 >PRK07952 DNA replication protein DnaC; Validated Probab=95.41 E-value=0.16 Score=29.85 Aligned_cols=103 Identities=22% Similarity=0.268 Sum_probs=61.1 Q ss_pred EEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEECCCCCCCCCCHHH-HHHHHHHHHHHHCCCC Q ss_conf 999967784407899999999999997198530353206822105676523766113853289-9999999999958998 Q gi|254780750|r 652 LWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGSADNLASGRSTFM-VEMIETASILNQATNQ 730 (920) Q Consensus 652 ~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa~D~l~~g~STF~-vEm~e~~~IL~~at~~ 730 (920) -+|.+||-..|||.+-=.+|-..+ +-|-.| +|+.+. |=+.+=++||- -++.| ..+++....- T Consensus 98 gLlF~G~~GTGKThLA~aIan~Li--~~G~sV------------lf~t~~--dLl~~lr~t~~~~~~~e-~~~l~~l~~~ 160 (242) T PRK07952 98 SFIFSGKPGTGKNHLAAAICNELL--LRGKSV------------LIITVA--DIMSAMKDTFRNSETSE-EQLLNDLSNV 160 (242) T ss_pred EEEEECCCCCCHHHHHHHHHHHHH--HCCCEE------------EEEEHH--HHHHHHHHHHHCCCCCH-HHHHHHHHCC T ss_conf 179978999978999999999999--879949------------997799--99999999980687569-9999986318 Q ss_pred CEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCC Q ss_conf 569993258898805679999999999997269849997487 Q gi|254780750|r 731 SFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHF 772 (920) Q Consensus 731 SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy 772 (920) .|+||||||--..|..+..+-+.++..=....++ ++++|-+ T Consensus 161 dLLIiDdlG~e~~t~~~~~~lf~iId~Ry~~~kp-~IitTNl 201 (242) T PRK07952 161 DLLVIDEIGVQTESRYEKVIINQIVDRRSSSKRP-TGMLTNS 201 (242) T ss_pred CEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCC-EEEEECC T ss_conf 9898730146658888999999999999971698-8998179 No 268 >PRK11308 dppF dipeptide transporter ATP-binding subunit; Provisional Probab=95.20 E-value=0.27 Score=28.17 Aligned_cols=47 Identities=17% Similarity=0.212 Sum_probs=30.0 Q ss_pred CCCCCEEEEECCCCCCCHHHHHHHHHHHHHH---HHHHCCCEEEEECCCHHHHH Q ss_conf 8998569993258898805679999999999---99726984999748757976 Q gi|254780750|r 727 ATNQSFVILDEIGRGTATLDGLSIAWATIEY---LHETNRCRGLLATHFHELTD 777 (920) Q Consensus 727 at~~SLVllDElGrGTst~DG~aiA~aile~---l~~~~~~~~lfaTHy~eL~~ 777 (920) +.+-.|+|+||- ||..|- .+...|++- |.+..+.-.||.||.-.+.. T Consensus 170 ~~~P~lLi~DEP---TsaLD~-~~q~~Il~ll~~l~~~~~~t~l~ITHDl~~v~ 219 (327) T PRK11308 170 MLDPDVVVADEP---VSALDV-SVQAQVLNLMMDLQQELGLSYVFISHDLSVVE 219 (327) T ss_pred HCCCCEEEEECC---CCCCCH-HHHHHHHHHHHHHHHHCCCEEEEECCCHHHHH T ss_conf 428989998478---654699-99999999999999700976999869899999 No 269 >PRK12402 replication factor C small subunit 2; Reviewed Probab=95.19 E-value=0.22 Score=28.76 Aligned_cols=104 Identities=19% Similarity=0.291 Sum_probs=50.1 Q ss_pred EEEECCCCCCHHHHHHHHHHHHHHHHCCC-CCCHHHCCCCCCCEEEEEEECC----C---------CCCCCCCHHHHHHH Q ss_conf 99967784407899999999999997198-5303532068221056765237----6---------61138532899999 Q gi|254780750|r 653 WLLTGPNMGGKSTFLRQNALIVIMAQMGS-YVPASYAHIGIVDKLFSRVGSA----D---------NLASGRSTFMVEMI 718 (920) Q Consensus 653 ~iiTGpNmgGKSt~lRqval~vilAQiG~-fVPA~~a~i~~~D~IftRiGa~----D---------~l~~g~STF~vEm~ 718 (920) +|++||-..||+|..|-+|-- ++| .+.....++..-| .+.| |.. | .-..|.|. .. T Consensus 39 lLf~GPpG~GKTt~A~~lA~~-----l~~~~~~~~~~~~nasd-~~~~-~~~~i~~~~~~~~~~~~~~~~~~~~----~d 107 (337) T PRK12402 39 LVVYGPSGSGKTAAVRALARE-----LYGDPWENNFTYFNVSD-FFDQ-GKKYLVEDPRFAHFYDDPKRKYKSV----ID 107 (337) T ss_pred EEEECCCCCCHHHHHHHHHHH-----HCCCCCCCCCCEECCCC-CCCC-CCCEEECCCCHHHHHCCHHHCCCCH----HH T ss_conf 988892984899999999999-----67997567833311653-1135-6400101664234420153327737----89 Q ss_pred HHHHHHHH-C------CCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHH--CCCEEEEECCCH Q ss_conf 99999995-8------9985699932588988056799999999999972--698499974875 Q gi|254780750|r 719 ETASILNQ-A------TNQSFVILDEIGRGTATLDGLSIAWATIEYLHET--NRCRGLLATHFH 773 (920) Q Consensus 719 e~~~IL~~-a------t~~SLVllDElGrGTst~DG~aiA~aile~l~~~--~~~~~lfaTHy~ 773 (920) ....+++. | ++.-+|||||..+-|.. |+..+..+++. ..|+.+|+|.+. T Consensus 108 ~i~~ii~~~a~~~p~~~~~KiiIlDEad~lt~~------Aq~aLlk~lEe~~~~~~fIl~t~~~ 165 (337) T PRK12402 108 NFKHILKEYASMRPLSADYKLILFDNAEALRED------AQQALRRIMERYSETCRFIFSTTQP 165 (337) T ss_pred HHHHHHHHHHCCCCCCCCEEEEEEECCCCCCHH------HHHHHHHHHHCCCCCEEEEEECCCC T ss_conf 999999998614887788049997071317999------9999998874088766998723864 No 270 >PRK10261 glutathione transporter ATP-binding protein; Provisional Probab=95.19 E-value=0.27 Score=28.15 Aligned_cols=47 Identities=21% Similarity=0.139 Sum_probs=30.0 Q ss_pred CCCCCEEEEECCCCCCCHHHHH--HHHHHHHHHHHHHCCCEEEEECCCHHHH Q ss_conf 8998569993258898805679--9999999999972698499974875797 Q gi|254780750|r 727 ATNQSFVILDEIGRGTATLDGL--SIAWATIEYLHETNRCRGLLATHFHELT 776 (920) Q Consensus 727 at~~SLVllDElGrGTst~DG~--aiA~aile~l~~~~~~~~lfaTHy~eL~ 776 (920) +..-.++|+||- ||..|=. |--+..+..|.+..+.-.||.||.-.+. T Consensus 479 ~~~P~lLI~DEP---Ts~LDv~~qa~il~Ll~~L~~~~g~til~IsHDl~~v 527 (623) T PRK10261 479 ALNPKVIIADEA---VSALDVSIRGQIINLLLDLQRDFGIAYLFISHDMAVV 527 (623) T ss_pred HHCCCEEEEECC---CCCCCHHHHHHHHHHHHHHHHHHCCEEEEECCCHHHH T ss_conf 969999999688---6667999999999999999997298999986899999 No 271 >COG1618 Predicted nucleotide kinase [Nucleotide transport and metabolism] Probab=95.14 E-value=0.052 Score=33.45 Aligned_cols=127 Identities=24% Similarity=0.280 Sum_probs=63.8 Q ss_pred EEEECCCCCCHHHHHHHHHHHHHHHHCCCCC----CHH------HCCCCCCC------EEEEEEECCCCCCCCCCHHHHH Q ss_conf 9996778440789999999999999719853----035------32068221------0567652376611385328999 Q gi|254780750|r 653 WLLTGPNMGGKSTFLRQNALIVIMAQMGSYV----PAS------YAHIGIVD------KLFSRVGSADNLASGRSTFMVE 716 (920) Q Consensus 653 ~iiTGpNmgGKSt~lRqval~vilAQiG~fV----PA~------~a~i~~~D------~IftRiGa~D~l~~g~STF~vE 716 (920) ..||||--.||||++.-++ --|.+-|+=| ..+ .--+.++| .+|++.|.+- ---|.=.--+| T Consensus 8 i~ITG~PGvGKtTl~~ki~--e~L~~~g~kvgGf~t~EVR~gGkR~GF~Ivdl~tg~~~~la~~~~~~-~rvGkY~V~v~ 84 (179) T COG1618 8 IFITGRPGVGKTTLVLKIA--EKLREKGYKVGGFITPEVREGGKRIGFKIVDLATGEEGILARVGFSR-PRVGKYGVNVE 84 (179) T ss_pred EEEECCCCCCHHHHHHHHH--HHHHHCCCEEEEEEEEEEECCCEEEEEEEEECCCCCEEEEEECCCCC-CCCCEEEEEHH T ss_conf 9986799845899999999--99985596651398311420882751599981479557988847887-62104786278 Q ss_pred HHH--HHHHHHHCCCC-CEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCC---HHHHHHHHHCCCEE Q ss_conf 999--99999958998-569993258898805679999999999997269849997487---57976643068858 Q gi|254780750|r 717 MIE--TASILNQATNQ-SFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHF---HELTDLSKSLKRFH 786 (920) Q Consensus 717 m~e--~~~IL~~at~~-SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy---~eL~~l~~~~~~v~ 786 (920) -.| ....|++|-.. -+||+||+|.=--..-. . ...++..++..++ .+++-|- |.+.+-......+. T Consensus 85 ~le~i~~~al~rA~~~aDvIIIDEIGpMElks~~--f-~~~ve~vl~~~kp-liatlHrrsr~P~v~~ik~~~~v~ 156 (179) T COG1618 85 GLEEIAIPALRRALEEADVIIIDEIGPMELKSKK--F-REAVEEVLKSGKP-LIATLHRRSRHPLVQRIKKLGGVY 156 (179) T ss_pred HHHHHHHHHHHHHHHCCCEEEEECCCCHHHCCHH--H-HHHHHHHHCCCCC-EEEEEECCCCCHHHHHHHHCCCEE T ss_conf 8899868999988634998999433633020088--9-9999999658993-799996256775899864248779 No 272 >COG4559 ABC-type hemin transport system, ATPase component [Inorganic ion transport and metabolism] Probab=95.13 E-value=0.02 Score=36.52 Aligned_cols=40 Identities=33% Similarity=0.552 Sum_probs=27.8 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHH Q ss_conf 4434563587777664399996778440789999999999999719853035 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPAS 686 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~ 686 (920) +-+++++.. ..+-+.-|-|||-+||||+||.. +|-+-|.. T Consensus 16 ll~~vsl~~----~pGev~ailGPNGAGKSTlLk~L--------sGel~p~~ 55 (259) T COG4559 16 LLDGVSLDL----RPGEVLAILGPNGAGKSTLLKAL--------SGELSPDS 55 (259) T ss_pred ECCCCCEEC----CCCCEEEEECCCCCCHHHHHHHH--------HCCCCCCC T ss_conf 104731541----68727999888986588899986--------17637888 No 273 >pfam05729 NACHT NACHT domain. This NTPase domain is found in apoptosis proteins as well as those involved in MHC transcription activation. This family is closely related to pfam00931. Probab=95.10 E-value=0.18 Score=29.39 Aligned_cols=121 Identities=26% Similarity=0.350 Sum_probs=56.8 Q ss_pred EEEEEECCCCCCHHHHHHHHHHHHHHHHC------CCCCCHHHCC----CCCCCEEEEEEECCCCCCCCCCHHHHHHHHH Q ss_conf 39999677844078999999999999971------9853035320----6822105676523766113853289999999 Q gi|254780750|r 651 KLWLLTGPNMGGKSTFLRQNALIVIMAQM------GSYVPASYAH----IGIVDKLFSRVGSADNLASGRSTFMVEMIET 720 (920) Q Consensus 651 ~~~iiTGpNmgGKSt~lRqval~vilAQi------G~fVPA~~a~----i~~~D~IftRiGa~D~l~~g~STF~vEm~e~ 720 (920) |..+|.|+-..||||++|-+++....-+. =-|+++.... .++.|-|+.--. .+. .+..|. T Consensus 1 r~i~i~G~aG~GKTtll~kl~~~wa~g~~~~~~~~vf~~~~r~~~~~~~~sl~~ll~~~~~--------~~~--~~~~~~ 70 (165) T pfam05729 1 RTVILQGEAGSGKTTLLQKLALLWAQGKLPQDFDFVFFLPCRELSRSGEASLADLLFSQWP--------EPA--APVSEV 70 (165) T ss_pred CEEEEECCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEHHHHCCCCCCCHHHHHHHHCC--------CCC--CCHHHH T ss_conf 9899982798989999999999998698436972899999567077766899999998767--------745--763789 Q ss_pred -HHHHHHCCCCCEEEE---ECCCCCCCHHHHHHHHHHHHHHHHHH---CCCEEEEECCCHHHHHHHHHC Q ss_conf -999995899856999---32588988056799999999999972---698499974875797664306 Q gi|254780750|r 721 -ASILNQATNQSFVIL---DEIGRGTATLDGLSIAWATIEYLHET---NRCRGLLATHFHELTDLSKSL 782 (920) Q Consensus 721 -~~IL~~at~~SLVll---DElGrGTst~DG~aiA~aile~l~~~---~~~~~lfaTHy~eL~~l~~~~ 782 (920) ..|+.+. .+-|+|+ ||+..--+..+...-....+..|++. .+|+.+++|.-+...++...+ T Consensus 71 ~~~~~~~~-~k~L~ilDGlDE~~~~~~~~~~~~~~~~~l~~ll~~~~lp~~~vliTsRp~~~~~l~~~~ 138 (165) T pfam05729 71 WAVILELP-ERVLLILDGLDELASDLGQLDGPLPVLTLLSSLLRKKLLPGASLLLTSRPDALRDLRRGL 138 (165) T ss_pred HHHHHHCC-CCEEEEECCHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCCEEEEEECCCHHHHHHHHC T ss_conf 99998397-728999648455144435644457799999999841527886499996803798857764 No 274 >PRK06921 hypothetical protein; Provisional Probab=95.08 E-value=0.26 Score=28.27 Aligned_cols=130 Identities=18% Similarity=0.325 Sum_probs=73.5 Q ss_pred EEEEECCCCCCHHHHHHHHHHHHHHHHCCC---CCCHHHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCC Q ss_conf 999967784407899999999999997198---53035320682210567652376611385328999999999999589 Q gi|254780750|r 652 LWLLTGPNMGGKSTFLRQNALIVIMAQMGS---YVPASYAHIGIVDKLFSRVGSADNLASGRSTFMVEMIETASILNQAT 728 (920) Q Consensus 652 ~~iiTGpNmgGKSt~lRqval~vilAQiG~---fVPA~~a~i~~~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~at 728 (920) -++++|+-..|| |+|=......+|-+.|- |+|.. .+|..+- -+|.. ..+.-..+++| T Consensus 118 ~l~f~G~~G~GK-ThLa~aIa~~Ll~~~~~~Vly~~~~--------~~~~~lk---------~~~~~-~~~~l~~~~~~- 177 (265) T PRK06921 118 SIALLGQPGSGK-THLLTAAANELMRKKGVPVLYFPFV--------EGFGDLK---------DDFDL-LEAKLNRMKKV- 177 (265) T ss_pred CEEEECCCCCCH-HHHHHHHHHHHHHHCCCEEEEEEHH--------HHHHHHH---------HHHHH-HHHHHHHHHCC- T ss_conf 279972898988-9999999999999629719998879--------9999999---------88888-99999986329- Q ss_pred CCCEEEEECC-----CCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCH--HHHHHHHHC-----CCEEEEEEEEEEEC Q ss_conf 9856999325-----88988056799999999999972698499974875--797664306-----88589999999609 Q gi|254780750|r 729 NQSFVILDEI-----GRGTATLDGLSIAWATIEYLHETNRCRGLLATHFH--ELTDLSKSL-----KRFHNATLQVSDSN 796 (920) Q Consensus 729 ~~SLVllDEl-----GrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~--eL~~l~~~~-----~~v~n~~~~~~~~~ 796 (920) .|.|||.| |....|.=.....+.++.|=....+ .++++|-+. +|.++.+.. ...+.+-+.+. + T Consensus 178 --dlLIIDDLfk~~~G~e~~te~~~~~lf~iIN~Ry~~~k-ptIiSSNl~~~~L~~i~e~i~SRi~emc~~~~v~~~--G 252 (265) T PRK06921 178 --EVLFIDDLFKPVNGKPRATEWQIEQTYSVVNYRYLNHK-PILISSELTIDELLDIDEALGSRIVEMCKDYLVIIN--G 252 (265) T ss_pred --CEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCC-CEEEECCCCHHHHHHHHHHHHHHHHHHCCCCEEEEE--C T ss_conf --99998221223479878988999999999999997699-989986899899987637988889997257489996--7 Q ss_pred CEEEEEEEEE Q ss_conf 9277877774 Q gi|254780750|r 797 EGIIFLHKVI 806 (920) Q Consensus 797 ~~i~flykl~ 806 (920) ++.-..|||. T Consensus 253 ~~~~ln~rl~ 262 (265) T PRK06921 253 DSFLLNHRLE 262 (265) T ss_pred CHHHHHHHHC T ss_conf 4113433421 No 275 >cd00984 DnaB_C DnaB helicase C terminal domain. The hexameric helicase DnaB unwinds the DNA duplex at the chromosome replication fork. Although the mechanism by which DnaB both couples ATP hydrolysis to translocation along DNA and denatures the duplex is unknown, a change in the quaternary structure of the protein involving dimerization of the N-terminal domain has been observed and may occur during the enzymatic cycle. This C-terminal domain contains an ATP-binding site and is therefore probably the site of ATP hydrolysis. Probab=94.89 E-value=0.18 Score=29.46 Aligned_cols=42 Identities=19% Similarity=0.169 Sum_probs=30.0 Q ss_pred CCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCC Q ss_conf 6439999677844078999999999999971985303532068 Q gi|254780750|r 649 SGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIG 691 (920) Q Consensus 649 ~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~ 691 (920) .+.+++|+|+--.|||||+.++|+.... +-|-.|--=+.+++ T Consensus 12 ~G~L~vi~a~~g~GKS~~~~~la~~~a~-~~g~~V~~~SlEm~ 53 (242) T cd00984 12 PGDLIIIAARPSMGKTAFALNIAENIAK-KQGKPVLFFSLEMS 53 (242) T ss_pred CCCEEEEEECCCCCHHHHHHHHHHHHHH-HCCCCEEEEECCCC T ss_conf 9818999968999999999999999999-77995999933353 No 276 >PRK10789 putative multidrug transporter membrane\ATP-binding components; Provisional Probab=94.87 E-value=0.08 Score=32.07 Aligned_cols=33 Identities=24% Similarity=0.424 Sum_probs=24.5 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHH Q ss_conf 4434563587777664399996778440789999999 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNA 671 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqva 671 (920) |=+|+++.. ..+...-|.||+.+||||+++-+. T Consensus 330 vL~~isl~I----~~Ge~vaIVG~SGsGKSTLl~LL~ 362 (569) T PRK10789 330 ALENVNFTL----KPGQMLGICGPTGSGKSTLLSLIQ 362 (569) T ss_pred HHCCCCCEE----CCCCEEEEECCCCCCHHHHHHHHH T ss_conf 230765688----899789987999998799999999 No 277 >cd03279 ABC_sbcCD SbcCD and other Mre11/Rad50 (MR) complexes are implicated in the metabolism of DNA ends. They cleave ends sealed by hairpin structures and are thought to play a role in removing protein bound to DNA termini. Probab=94.74 E-value=0.052 Score=33.45 Aligned_cols=35 Identities=29% Similarity=0.445 Sum_probs=24.9 Q ss_pred EEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHH Q ss_conf 34563587777664399996778440789999999999 Q gi|254780750|r 637 NDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIV 674 (920) Q Consensus 637 Ndi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~v 674 (920) +.|.++. .++..+.+|+|||.+||||.+-.+..+- T Consensus 18 ~~IDF~~---~~~~~lflI~G~nGsGKSTIlDAI~~aL 52 (213) T cd03279 18 QVIDFTG---LDNNGLFLICGPTGAGKSTILDAITYAL 52 (213) T ss_pred EEEECCC---CCCCCEEEEECCCCCCHHHHHHHHHHHH T ss_conf 1784876---7878889998899997889999999998 No 278 >PRK03918 chromosome segregation protein; Provisional Probab=94.67 E-value=0.033 Score=34.91 Aligned_cols=71 Identities=24% Similarity=0.236 Sum_probs=41.9 Q ss_pred CCCCCCCCHHHHHHH---HHHHHHHHCCCCCEEEEECCCCCCCHHHHHHH--HHHHHHHHHHHCCCEEEEECCCHHHHHH Q ss_conf 661138532899999---99999995899856999325889880567999--9999999997269849997487579766 Q gi|254780750|r 704 DNLASGRSTFMVEMI---ETASILNQATNQSFVILDEIGRGTATLDGLSI--AWATIEYLHETNRCRGLLATHFHELTDL 778 (920) Q Consensus 704 D~l~~g~STF~vEm~---e~~~IL~~at~~SLVllDElGrGTst~DG~ai--A~aile~l~~~~~~~~lfaTHy~eL~~l 778 (920) +.|+.|.+ |++=+. -.+..+ +.+-.+++|||. |.+.|-..+ +.-+++.+... +..+++.||-.+|.+. T Consensus 790 ~~LSGGE~-~~~aLAL~Lals~~~--~~~~~~lflDEg---fg~LD~~~~~~~~~~L~~l~~~-~~qi~IISH~~el~~~ 862 (882) T PRK03918 790 GFLSGGER-IALGLAFRLALSMYL--AGEISLLILDEP---TPFLDEERRRKLVDIMERYLRK-IPQVIIVSHDEELKDA 862 (882) T ss_pred CCCCHHHH-HHHHHHHHHHHHHHH--HCCCCEEEEECC---CCCCCHHHHHHHHHHHHHHHCC-CCEEEEEECHHHHHHH T ss_conf 77888899-999999999999987--149994998799---7887989999999999998658-9989999575789986 Q ss_pred HHH Q ss_conf 430 Q gi|254780750|r 779 SKS 781 (920) Q Consensus 779 ~~~ 781 (920) .+. T Consensus 863 ~d~ 865 (882) T PRK03918 863 ADH 865 (882) T ss_pred CCC T ss_conf 796 No 279 >cd01120 RecA-like_NTPases RecA-like NTPases. This family includes the NTP binding domain of F1 and V1 H+ATPases, DnaB and related helicases as well as bacterial RecA and related eukaryotic and archaeal recombinases. This group also includes bacterial conjugation proteins and related DNA transfer proteins involved in type II and type IV secretion. Probab=94.61 E-value=0.05 Score=33.56 Aligned_cols=86 Identities=27% Similarity=0.339 Sum_probs=46.5 Q ss_pred EEEECCCCCCHHHHHHHHHHHHHH-HHCCCCCCHHHCCCCCCCEE--EEEEECCCCCC----CCCCHHHHHHHHHHHHHH Q ss_conf 999677844078999999999999-97198530353206822105--67652376611----385328999999999999 Q gi|254780750|r 653 WLLTGPNMGGKSTFLRQNALIVIM-AQMGSYVPASYAHIGIVDKL--FSRVGSADNLA----SGRSTFMVEMIETASILN 725 (920) Q Consensus 653 ~iiTGpNmgGKSt~lRqval~vil-AQiG~fVPA~~a~i~~~D~I--ftRiGa~D~l~----~g~STF~vEm~e~~~IL~ 725 (920) .+|+||-..||||++.|++..+.- .....||-.+...-.++.++ +..-++.+++. ..-.+......+.+..+. T Consensus 2 ~li~g~~g~GKttl~~~~~~~~~~~~~~~~~~~~ee~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 81 (165) T cd01120 2 ILVFGPTGSGKTTLALQLALNIATKGGKVVYVDIEEEIEELTERLIGESLKGALDNLIIVFATADDPAAARLLSKAERLR 81 (165) T ss_pred EEEECCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHHCCCCCCEEEEEECCCCCCHHHHHHHHHHHHH T ss_conf 89998999989999999999987639979999866644899999998622467130799935999769999999999999 Q ss_pred HCCCCCEEEEECC Q ss_conf 5899856999325 Q gi|254780750|r 726 QATNQSFVILDEI 738 (920) Q Consensus 726 ~at~~SLVllDEl 738 (920) ......||++|.+ T Consensus 82 ~~~~~vliiiDSi 94 (165) T cd01120 82 ERGGDDLIILDEL 94 (165) T ss_pred HCCCCEEEEEECH T ss_conf 8699779999288 No 280 >pfam00004 AAA ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes. Probab=94.56 E-value=0.38 Score=27.00 Aligned_cols=103 Identities=20% Similarity=0.220 Sum_probs=49.0 Q ss_pred EEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHC--CCC Q ss_conf 999677844078999999999999971985303532068221056765237661138532899999999999958--998 Q gi|254780750|r 653 WLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGSADNLASGRSTFMVEMIETASILNQA--TNQ 730 (920) Q Consensus 653 ~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~a--t~~ 730 (920) +++.||--.|||++.|.+| .++|..+ .++..-|-+-. ..|.+. ..+..+++.| ... T Consensus 1 iLl~GppGtGKT~~a~~la-----~~~~~~~----~~v~~~~~~~~--------~~g~~~-----~~i~~~f~~a~~~~p 58 (131) T pfam00004 1 LLLYGPPGTGKTTLAKAVA-----KELGAPF----IEISGSELVSK--------YVGESE-----KRLRELFEAAKKLAP 58 (131) T ss_pred CEEECCCCCCHHHHHHHHH-----HHHCCCC----EECCCCCCCCC--------CCCHHH-----HHHHHHHHHHHHCCC T ss_conf 9878999999999999999-----9978985----33242012223--------345068-----889999999997499 Q ss_pred CEEEEECCC----CCCCHHH--HHHHHHHHHHHHHHH----CCCEEEEECCCHHHHH Q ss_conf 569993258----8988056--799999999999972----6984999748757976 Q gi|254780750|r 731 SFVILDEIG----RGTATLD--GLSIAWATIEYLHET----NRCRGLLATHFHELTD 777 (920) Q Consensus 731 SLVllDElG----rGTst~D--G~aiA~aile~l~~~----~~~~~lfaTHy~eL~~ 777 (920) +.+++||+- +...+.+ +..+..+.+..+-.. .+...+.+|.+.+.-+ T Consensus 59 ~Il~iDe~d~l~~~~~~~~~~~~~~~~~~ll~~ld~~~~~~~~v~~I~tTN~~~~ld 115 (131) T pfam00004 59 CVIFIDEIDALAGSRGSGGDSESRRVVNQLLTELDGFTSSLSKVIVIAATNRPDKLD 115 (131) T ss_pred CEEEECHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCEEEEEECCCHHHCC T ss_conf 189831167775167888887513268789999850224688769999759904499 No 281 >PRK06835 DNA replication protein DnaC; Validated Probab=94.47 E-value=0.4 Score=26.85 Aligned_cols=102 Identities=16% Similarity=0.156 Sum_probs=56.4 Q ss_pred EEEEECCCCCCHHHHHHHHHHHHHHHHCCCCC---CHHHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCC Q ss_conf 99996778440789999999999999719853---035320682210567652376611385328999999999999589 Q gi|254780750|r 652 LWLLTGPNMGGKSTFLRQNALIVIMAQMGSYV---PASYAHIGIVDKLFSRVGSADNLASGRSTFMVEMIETASILNQAT 728 (920) Q Consensus 652 ~~iiTGpNmgGKSt~lRqval~vilAQiG~fV---PA~~a~i~~~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~at 728 (920) =++++||-..|||-+.=.+|-.++- -|--| .|.. +++.| -. .-+.+... ..+.-..+.+|. T Consensus 185 nLlf~G~~G~GKTfLa~~IA~ell~--~g~sViy~ta~~----L~~~l----~~--~~~~~~~~----~~~~~~~l~~~D 248 (330) T PRK06835 185 NLLFYGNTGTGKTFLSNCIAKELLD--RGKTVIYRTSDE----LIENL----RE--IRFNNDND----APELEDLLINCD 248 (330) T ss_pred CEEEECCCCCCHHHHHHHHHHHHHH--CCCEEEEEEHHH----HHHHH----HH--HHCCCCCC----HHHHHHHHHHCC T ss_conf 6698899999889999999999998--799499962999----99999----99--75457644----899999996189 Q ss_pred CCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCH Q ss_conf 985699932588988056799999999999972698499974875 Q gi|254780750|r 729 NQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFH 773 (920) Q Consensus 729 ~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~ 773 (920) |.|||+||--..|.=..+.-+-++.+-... +..|+++|-+. T Consensus 249 ---LLIIDDLG~E~~t~~~~~~Lf~iIN~R~~~-~k~tIITTNl~ 289 (330) T PRK06835 249 ---LLIIDDLGTESITEFSKTELFNLINKRLLM-NKKMIISTNLS 289 (330) T ss_pred ---EEEEECCCCCCCCHHHHHHHHHHHHHHHHC-CCCEEEECCCC T ss_conf ---899721034558868999999999999867-99979988999 No 282 >PRK13342 recombination factor protein RarA; Reviewed Probab=94.41 E-value=0.41 Score=26.76 Aligned_cols=13 Identities=15% Similarity=0.243 Sum_probs=4.8 Q ss_pred HHHHHHHHHHHHH Q ss_conf 1113456664334 Q gi|254780750|r 534 RFTTLELIDLENR 546 (920) Q Consensus 534 Rf~t~eL~~l~~~ 546 (920) +|..|.-...+.+ T Consensus 393 ~fY~p~~~g~E~~ 405 (417) T PRK13342 393 RYYEPTERGFEKK 405 (417) T ss_pred EEECCCCCCHHHH T ss_conf 4558999718999 No 283 >COG0497 RecN ATPase involved in DNA repair [DNA replication, recombination, and repair] Probab=94.36 E-value=0.42 Score=26.69 Aligned_cols=113 Identities=19% Similarity=0.312 Sum_probs=64.3 Q ss_pred CCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHC-CCEEEEECCCHHHHHHHHHCCCEEEE Q ss_conf 53289999999999995899856999325889880567999999999999726-98499974875797664306885899 Q gi|254780750|r 710 RSTFMVEMIETASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETN-RCRGLLATHFHELTDLSKSLKRFHNA 788 (920) Q Consensus 710 ~STF~vEm~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~-~~~~lfaTHy~eL~~l~~~~~~v~n~ 788 (920) .|-||. -.+.|+.....---+|+||+--|-|- +.|.+|-+.|..-. ++-+|..||.+.++.+++. T Consensus 437 LSRimL---Alk~i~~~~~~~ptlIFDEVD~GIsG----~~A~aVg~~L~~Ls~~~QVl~VTHlPQVAa~ad~------- 502 (557) T COG0497 437 LSRIML---ALKVILSRKDDTPTLIFDEVDTGISG----RVAQAVGKKLRRLSEHHQVLCVTHLPQVAAMADT------- 502 (557) T ss_pred HHHHHH---HHHHHHHCCCCCCEEEEECCCCCCCH----HHHHHHHHHHHHHHCCCEEEEEECHHHHHHHHCC------- T ss_conf 999999---99999716689985997464579975----9999999999998369649999357878752025------- Q ss_pred EEEEEE-ECCEEEEEEEEEECCCCCCHHHHHHHHCC---CCHHHHHHHHHHHHH Q ss_conf 999996-09927787777447898877899999829---998999999999999 Q gi|254780750|r 789 TLQVSD-SNEGIIFLHKVIPGIADHSYGIQVGKLAG---LPNTVISRAYDILKT 838 (920) Q Consensus 789 ~~~~~~-~~~~i~flykl~~G~~~~Sygi~vA~laG---~p~~vi~~A~~~~~~ 838 (920) |+-|.. ..++.|-. ++++= ...-===++|||.| +-+..+..|+++++. T Consensus 503 H~~V~K~~~~~~T~s-~V~~L-~~eeRveEiARMl~G~~iT~~a~a~AkeLL~~ 554 (557) T COG0497 503 HFLVEKESEDGRTES-RVRPL-DKEERVEEIARMLGGSEVTDEALAHAKELLAA 554 (557) T ss_pred EEEEEEECCCCCEEE-EEEEC-CHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHH T ss_conf 589997247882677-63538-87678999999856843149999999999975 No 284 >COG3842 PotA ABC-type spermidine/putrescine transport systems, ATPase components [Amino acid transport and metabolism] Probab=94.36 E-value=0.067 Score=32.62 Aligned_cols=136 Identities=25% Similarity=0.262 Sum_probs=74.2 Q ss_pred CEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHH--------HCCCCCCHHHCCCCCCC---EEEEEEE Q ss_conf 044434563587777664399996778440789999999999999--------71985303532068221---0567652 Q gi|254780750|r 633 PFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMA--------QMGSYVPASYAHIGIVD---KLFSRVG 701 (920) Q Consensus 633 ~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilA--------QiG~fVPA~~a~i~~~D---~IftRiG 701 (920) ..+=.|++|+. ..+-|.-+=||.-+||||+||.+|=..--. +-=.-||+..=.++.|= .+|--|- T Consensus 18 ~~al~~isl~i----~~Gef~tlLGPSGcGKTTlLR~IAGfe~p~~G~I~l~g~~i~~lpp~kR~ig~VFQ~YALFPHmt 93 (352) T COG3842 18 FTAVDDISLDI----KKGEFVTLLGPSGCGKTTLLRMIAGFEQPSSGEILLDGEDITDVPPEKRPIGMVFQSYALFPHMT 93 (352) T ss_pred EEEEECCEEEE----CCCCEEEEECCCCCCHHHHHHHHHCCCCCCCCEEEECCEECCCCCHHHCCCCEEECCCCCCCCCC T ss_conf 26773214454----48868999899888889999999677788886599999998889942265232606766688885 Q ss_pred CCCCCCCCCC-------H----HHHHHHHHHHH--------------------HH--HCCCCCEEEEECCCCCCCHHH-- Q ss_conf 3766113853-------2----89999999999--------------------99--589985699932588988056-- Q gi|254780750|r 702 SADNLASGRS-------T----FMVEMIETASI--------------------LN--QATNQSFVILDEIGRGTATLD-- 746 (920) Q Consensus 702 a~D~l~~g~S-------T----F~vEm~e~~~I--------------------L~--~at~~SLVllDElGrGTst~D-- 746 (920) -.||+.=|+. . ...||.+.-.+ |- -+.+-.+.|+||-- |..| T Consensus 94 V~~NVafGLk~~~~~~~~ei~~rV~e~L~lV~L~~~~~R~p~qLSGGQqQRVALARAL~~~P~vLLLDEPl---SaLD~k 170 (352) T COG3842 94 VEENVAFGLKVRKKLKKAEIKARVEEALELVGLEGFADRKPHQLSGGQQQRVALARALVPEPKVLLLDEPL---SALDAK 170 (352) T ss_pred HHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCHHHHHHCHHHHCHHHHHHHHHHHHHHCCCCHHHHCCCC---CCHHHH T ss_conf 89975533310577877899999999998748544444276664827899999999742183544342740---023189 Q ss_pred -HHHHHHHHHHHHHHHCCCEEEEECCCHHHH Q ss_conf -799999999999972698499974875797 Q gi|254780750|r 747 -GLSIAWATIEYLHETNRCRGLLATHFHELT 776 (920) Q Consensus 747 -G~aiA~aile~l~~~~~~~~lfaTHy~eL~ 776 (920) ...+ ..-+..+++..+-.++|.||.++=+ T Consensus 171 LR~~m-r~Elk~lq~~~giT~i~VTHDqeEA 200 (352) T COG3842 171 LREQM-RKELKELQRELGITFVYVTHDQEEA 200 (352) T ss_pred HHHHH-HHHHHHHHHHCCCEEEEEECCHHHH T ss_conf 99999-9999999985597299997898998 No 285 >COG3950 Predicted ATP-binding protein involved in virulence [General function prediction only] Probab=94.09 E-value=0.066 Score=32.70 Aligned_cols=38 Identities=18% Similarity=0.057 Sum_probs=17.5 Q ss_pred CEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCC-EEEEECCC Q ss_conf 5699932588988056799999999999972698-49997487 Q gi|254780750|r 731 SFVILDEIGRGTATLDGLSIAWATIEYLHETNRC-RGLLATHF 772 (920) Q Consensus 731 SLVllDElGrGTst~DG~aiA~aile~l~~~~~~-~~lfaTHy 772 (920) -.||+||+-=--.|. --.-|++.|...-.. ..+.+||- T Consensus 297 givLiDeIdlflhP~----WQQqi~qkL~saFp~IQfIvstHs 335 (440) T COG3950 297 GIVLIDEIDLFLHPK----WQQQINQKLLSAFPEIQFIVSTHS 335 (440) T ss_pred CEEEEEHHHHHCCHH----HHHHHHHHHHHHCHHHHHHHHCCC T ss_conf 669742201102888----999999999864434500233388 No 286 >TIGR00618 sbcc exonuclease SbcC; InterPro: IPR004592 All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. SbcC may have nuclease activity that is functionally related to one of the nuclease activities of the RecBCD enzyme (IPR004586 from INTERPRO).; GO: 0004527 exonuclease activity, 0006259 DNA metabolic process. Probab=94.09 E-value=0.056 Score=33.21 Aligned_cols=72 Identities=24% Similarity=0.321 Sum_probs=48.2 Q ss_pred CCCCCCCHHHHHHH---HHHHHHH--HCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHH Q ss_conf 61138532899999---9999999--58998569993258898805679999999999997269849997487579766 Q gi|254780750|r 705 NLASGRSTFMVEMI---ETASILN--QATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTDL 778 (920) Q Consensus 705 ~l~~g~STF~vEm~---e~~~IL~--~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~l 778 (920) .++.|. ||||-++ -+|..|. +.|.=-+.+||| |-||=-.||.-=|.++|+.|-....|..=++.|-|++.+- T Consensus 970 tlSGGE-~Fl~slSlaLAladl~~~~~~~~L~sLfiDE-GFGslD~~~~d~~~~~L~ai~~~~~~~iGv~SHv~~~~er 1046 (1063) T TIGR00618 970 TLSGGE-TFLASLSLALALADLLSQRGGTALDSLFIDE-GFGSLDEDSLDRAVEILDAIKEDVAKMIGVISHVPEFRER 1046 (1063) T ss_pred CCCCHH-HHHHHHHHHHHHHHHHHHCCCHHHHHHHHCC-CCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHH T ss_conf 344179-9999999999999998630551335543205-7777777789899999999884489558654075889742 No 287 >TIGR00968 3a0106s01 sulfate ABC transporter, ATP-binding protein; InterPro: IPR005666 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). These proteins are involved in the transmembrane transport of sulphate and thiosulphate.; GO: 0042626 ATPase activity coupled to transmembrane movement of substances, 0006810 transport, 0016020 membrane. Probab=94.08 E-value=0.065 Score=32.72 Aligned_cols=149 Identities=27% Similarity=0.378 Sum_probs=80.5 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHH--------HHHHHCCCCCCHHHCCCCCCC---EEEEEEECC Q ss_conf 443456358777766439999677844078999999999--------999971985303532068221---056765237 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALI--------VIMAQMGSYVPASYAHIGIVD---KLFSRVGSA 703 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~--------vilAQiG~fVPA~~a~i~~~D---~IftRiGa~ 703 (920) +=+||+|.. .++++.=+=||--|||||+||.+|=+ .|--|=-+-+++..=.||-|= .+|--+--. T Consensus 15 al~~v~l~v----~~G~lvaLLGPSGSGKsTLLR~iAGLe~pd~G~I~~~G~D~t~~~~~~R~iGFVFQ~YAlF~HlTv~ 90 (241) T TIGR00968 15 ALDDVDLEV----PTGSLVALLGPSGSGKSTLLRVIAGLEQPDSGRIILNGRDATRVKLRDREIGFVFQHYALFKHLTVR 90 (241) T ss_pred EEEEEEEEE----CCCEEEEEECCCCCCHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEHHHHCCCCCHH T ss_conf 753455574----3852798546898737899999835799984269985200221320136212277001312565111 Q ss_pred CCCCCCCCHH-----------------HHHHHHHHHH-----------------HH--HCCCCCEEEEECCCCCCCHHHH Q ss_conf 6611385328-----------------9999999999-----------------99--5899856999325889880567 Q gi|254780750|r 704 DNLASGRSTF-----------------MVEMIETASI-----------------LN--QATNQSFVILDEIGRGTATLDG 747 (920) Q Consensus 704 D~l~~g~STF-----------------~vEm~e~~~I-----------------L~--~at~~SLVllDElGrGTst~DG 747 (920) |||+=|..-= ..+|.+.+.. |- -|-+-...||||-.- ..|. T Consensus 91 ~NiAFGL~~~p~~~k~~~~~~k~~V~~LL~lvqL~~l~~rYP~QLSGGQrQRvALARALAv~P~vLLLDEPFg---ALDA 167 (241) T TIGR00968 91 DNIAFGLEIRPRREKHPKAKIKARVEELLELVQLEGLGDRYPNQLSGGQRQRVALARALAVEPQVLLLDEPFG---ALDA 167 (241) T ss_pred HHHCCCCEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCHH---HHHH T ss_conf 0100142230210267578899999999998746544312742035733789999988633981576208714---5428 Q ss_pred ---HHHHHHHHHHHHHHCCCEEEEECCCH-HHHHHHHHCCCEEEEEEE Q ss_conf ---99999999999972698499974875-797664306885899999 Q gi|254780750|r 748 ---LSIAWATIEYLHETNRCRGLLATHFH-ELTDLSKSLKRFHNATLQ 791 (920) Q Consensus 748 ---~aiA~aile~l~~~~~~~~lfaTHy~-eL~~l~~~~~~v~n~~~~ 791 (920) --+ .+=|..||+.+.-.++|.||.. |=.+.++..--+.+..++ T Consensus 168 kvRk~L-R~WLR~LH~e~~~T~VfVTHD~~EA~evAd~ivv~~~G~i~ 214 (241) T TIGR00968 168 KVRKEL-RAWLRKLHDEVHVTTVFVTHDQEEAMEVADRIVVLSNGKIE 214 (241) T ss_pred HHHHHH-HHHHHHHCCEEEEEEEEEECCHHHHHHHHHHHHHHCCCCEE T ss_conf 999999-99998740305677999862858998887440132178278 No 288 >KOG0922 consensus Probab=94.04 E-value=0.048 Score=33.71 Aligned_cols=29 Identities=7% Similarity=0.086 Sum_probs=19.9 Q ss_pred CCCCHHHHHHHHHHHHHCCCEEEEEEECC Q ss_conf 65617669999999998898599995028 Q gi|254780750|r 84 CGVPVHTANHYIQKLIKIGHRIAICEQIE 112 (920) Q Consensus 84 aGvP~~~l~~yl~~Lv~~GykVaiveQ~E 112 (920) -+.|.......+=..|+..--+.|+-.|. T Consensus 48 ~~LPI~~~r~~il~~ve~nqvlIviGeTG 76 (674) T KOG0922 48 ESLPIYKYRDQILYAVEDNQVLIVIGETG 76 (674) T ss_pred CCCCHHHHHHHHHHHHHHCCEEEEECCCC T ss_conf 15998999999999998787799984898 No 289 >PRK10070 glycine betaine transporter ATP-binding subunit; Provisional Probab=93.98 E-value=0.062 Score=32.87 Aligned_cols=33 Identities=27% Similarity=0.481 Sum_probs=23.3 Q ss_pred EEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHH Q ss_conf 4345635877776643999967784407899999999 Q gi|254780750|r 636 ANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNAL 672 (920) Q Consensus 636 pNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval 672 (920) =+|++|.. ..+.+..|-||.-+||||+||.++= T Consensus 44 v~dVsl~I----~~GEi~~ivG~SGsGKSTLlr~i~g 76 (400) T PRK10070 44 VKDASLAI----EEGEIFVIMGLSGSGKSTMVRLLNR 76 (400) T ss_pred EECCEEEE----CCCCEEEEECCCCCHHHHHHHHHHC T ss_conf 97407688----7999999999998469999999975 No 290 >TIGR03185 DNA_S_dndD DNA sulfur modification protein DndD. This model describes the DndB protein encoded by an operon associated with a sulfur-containing modification to DNA. The operon is sporadically distributed in bacteria, much like some restriction enzyme operons. DndD is described as a putative ATPase. The small number of examples known so far include species from among the Firmicutes, Actinomycetes, Proteobacteria, and Cyanobacteria. Probab=93.93 E-value=0.1 Score=31.34 Aligned_cols=36 Identities=17% Similarity=0.126 Sum_probs=21.1 Q ss_pred HCCEEECCCCCCCCCCCC--------------CCCCHHHHHHHHHHHHHC Q ss_conf 181891078899888762--------------656176699999999988 Q gi|254780750|r 66 LGITLTKRGKHLGKDIPM--------------CGVPVHTANHYIQKLIKI 101 (920) Q Consensus 66 L~i~lt~r~~~~~~~~pm--------------aGvP~~~l~~yl~~Lv~~ 101 (920) =+|+|....++.|+..-| -+-+....+.|+..+++. T Consensus 28 k~i~li~G~NG~GKTTll~Ai~~~LYG~~~~~~~~~~~~y~~~~~~~~n~ 77 (650) T TIGR03185 28 KPIILIGGLNGAGKTTLLDAIQLGLYGKRAYCSKRGNQSYEQYLHGYINR 77 (650) T ss_pred CCEEEEECCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHC T ss_conf 98799977999978999999999956951003454544699999998612 No 291 >COG4138 BtuD ABC-type cobalamin transport system, ATPase component [Coenzyme metabolism] Probab=93.89 E-value=0.51 Score=26.03 Aligned_cols=22 Identities=45% Similarity=0.755 Sum_probs=17.3 Q ss_pred CCCEEEEEECCCCCCHHHHHHH Q ss_conf 6643999967784407899999 Q gi|254780750|r 648 NSGKLWLLTGPNMGGKSTFLRQ 669 (920) Q Consensus 648 ~~~~~~iiTGpNmgGKSt~lRq 669 (920) ..+-++-+-|||.+||||+|-. T Consensus 23 ~aGe~~HliGPNGaGKSTLLA~ 44 (248) T COG4138 23 RAGEILHLVGPNGAGKSTLLAR 44 (248) T ss_pred CCCEEEEEECCCCCCHHHHHHH T ss_conf 4660799987898658899999 No 292 >KOG0065 consensus Probab=93.89 E-value=0.13 Score=30.52 Aligned_cols=112 Identities=28% Similarity=0.347 Sum_probs=61.9 Q ss_pred EEECCCCCCHHHHHHHHHHHHHHHHC--CCCCCHHHC--CCCCCCEEEEEEE---CCCCCCCCCCHH------------- Q ss_conf 99677844078999999999999971--985303532--0682210567652---376611385328------------- Q gi|254780750|r 654 LLTGPNMGGKSTFLRQNALIVIMAQM--GSYVPASYA--HIGIVDKLFSRVG---SADNLASGRSTF------------- 713 (920) Q Consensus 654 iiTGpNmgGKSt~lRqval~vilAQi--G~fVPA~~a--~i~~~D~IftRiG---a~D~l~~g~STF------------- 713 (920) =|-|++-+||||+|.- |||= |-+|--+=. -.|..+.-|.|+- .++||...+.|- T Consensus 821 ALMG~SGAGKTTLLdv------LA~R~t~G~I~Gdi~i~G~p~~q~tF~R~~GYvqQ~DiH~~~~TVrESL~fSA~LRlp 894 (1391) T KOG0065 821 ALMGESGAGKTTLLDV------LAGRKTGGYIEGDILISGFPKDQETFARVSGYVEQQDIHSPELTVRESLRFSAALRLP 894 (1391) T ss_pred EHHCCCCCCHHHHHHH------HHCCCCCCEEEEEEEECCEECCHHHHCCCCCEEECCCCCCCCCCHHHHHHHHHHHCCC T ss_conf 0124778765779999------8567446568757898883273665123101142256567540419899999997187 Q ss_pred -----------------HHHHHHHHHHHH---------------------HCCCCCEEEEECCCCCCCHHHHHHHHHHHH Q ss_conf -----------------999999999999---------------------589985699932588988056799999999 Q gi|254780750|r 714 -----------------MVEMIETASILN---------------------QATNQSFVILDEIGRGTATLDGLSIAWATI 755 (920) Q Consensus 714 -----------------~vEm~e~~~IL~---------------------~at~~SLVllDElGrGTst~DG~aiA~ail 755 (920) .+||.+.+..|- -|.|.||+.|||- ||-.|.-| ||.|+ T Consensus 895 ~~v~~~ek~~yVe~Vi~lleL~~~~daiVG~~G~GLs~eQRKrLTIgVELvA~P~~ilFLDEP---TSGLDsqa-A~~i~ 970 (1391) T KOG0065 895 KEVSDEEKYEYVEEVIELLELKEYADALVGLPGSGLSTEQRKRLTIGVELVANPSSILFLDEP---TSGLDSQA-AAIVM 970 (1391) T ss_pred CCCCHHHHHHHHHHHHHHHCCHHHHHHHCCCCCCCCCHHHHCEEEEEEEEECCCCEEEEECCC---CCCCCHHH-HHHHH T ss_conf 769978999999999998376256655506888888977832356899983287556885699---87745789-99999 Q ss_pred HHH---HHHCCCEEEEECCCHHHH Q ss_conf 999---972698499974875797 Q gi|254780750|r 756 EYL---HETNRCRGLLATHFHELT 776 (920) Q Consensus 756 e~l---~~~~~~~~lfaTHy~eL~ 776 (920) +.+ .+. |=-.|.+-|=+... T Consensus 971 ~~lrkla~t-GqtIlCTIHQPS~~ 993 (1391) T KOG0065 971 RFLRKLADT-GQTILCTIHQPSID 993 (1391) T ss_pred HHHHHHHHC-CCEEEEEECCCCHH T ss_conf 999999864-88699981587089 No 293 >cd03241 ABC_RecN RecN ATPase involved in DNA repair; ABC (ATP-binding cassette) transporter nucleotide-binding domain; ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds including sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=93.86 E-value=0.52 Score=26.00 Aligned_cols=92 Identities=15% Similarity=0.202 Sum_probs=52.9 Q ss_pred HHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHC-CCEEEEECCCHHHHHHHHHCCCEEEEEEEEEEECC Q ss_conf 99999995899856999325889880567999999999999726-98499974875797664306885899999996099 Q gi|254780750|r 719 ETASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETN-RCRGLLATHFHELTDLSKSLKRFHNATLQVSDSNE 797 (920) Q Consensus 719 e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~-~~~~lfaTHy~eL~~l~~~~~~v~n~~~~~~~~~~ 797 (920) -.+.++.....-..+++||+-.|.|- ..|++|-+.|.+.. .+-++..||.+.++.+++.+ |.+.=...++ T Consensus 182 Alk~~~~~~~~~~tliFDEID~GigG----~~A~~vg~~l~~ls~~~QVi~ITHlpQvAa~ad~H-----~~V~K~~~~~ 252 (276) T cd03241 182 ALKAILARKDAVPTLIFDEIDTGISG----EVAQAVGKKLKELSRSHQVLCITHLPQVAAMADNH-----FLVEKEVEGG 252 (276) T ss_pred HHHHHHCCCCCCCEEEEECCCCCCCH----HHHHHHHHHHHHHHCCCEEEEEECHHHHHHCCCCE-----EEEEEEECCC T ss_conf 99999627889985798445789898----99999999999984798799981779988413767-----9999980499 Q ss_pred -EEEEEEEEEECCCCCCHHHHHHHHCC Q ss_conf -27787777447898877899999829 Q gi|254780750|r 798 -GIIFLHKVIPGIADHSYGIQVGKLAG 823 (920) Q Consensus 798 -~i~flykl~~G~~~~Sygi~vA~laG 823 (920) ..+-..+| ....==-++|||.| T Consensus 253 ~t~t~i~~L----~~~eRv~EiARMl~ 275 (276) T cd03241 253 RTVTKVREL----DKEERVEEIARMLS 275 (276) T ss_pred EEEEEEEEC----CHHHHHHHHHHHHC T ss_conf 289999999----86089999999728 No 294 >cd03277 ABC_SMC5_euk Eukaryotic SMC5 proteins; SMC proteins are large (approximately 110 to 170 kDa), and each is arranged into five recognizable domains. Amino-acid sequence homology of SMC proteins between species is largely confined to the amino- and carboxy-terminal globular domains. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T, in the single-letter amino-acid code), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif, and a motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. In eukaryotic cells, the proteins are found as heterodimers of SMC1 paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6 (for Probab=93.81 E-value=0.069 Score=32.52 Aligned_cols=103 Identities=24% Similarity=0.431 Sum_probs=55.2 Q ss_pred CCEEEEEECCCCCCHHHHHHHHHHHH-----HH---HHCCCCCC--HHHCC--CCCCC-----------------EE--E Q ss_conf 64399996778440789999999999-----99---97198530--35320--68221-----------------05--6 Q gi|254780750|r 649 SGKLWLLTGPNMGGKSTFLRQNALIV-----IM---AQMGSYVP--ASYAH--IGIVD-----------------KL--F 697 (920) Q Consensus 649 ~~~~~iiTGpNmgGKSt~lRqval~v-----il---AQiG~fVP--A~~a~--i~~~D-----------------~I--f 697 (920) ..++-+|.|||-+|||+.+-.+++.- +| ..+|.||- ++++. |.+.. |+ | T Consensus 22 gp~lN~IiGpNGSGKSsIv~AI~lgLGG~p~~lgRa~~v~~fVK~G~~~~~iEIeL~~~~~~iqvdNLcqFLPQdrV~eF 101 (213) T cd03277 22 GPSLNMIIGPNGSGKSSIVCAICLGLGGKPKLLGRAKKVGEFVKRGCDEGTIEIELYGNPGNIQVDNLCQFLPQDRVGEF 101 (213) T ss_pred CCCEEEEECCCCCCHHHHHHHHHHHHCCCHHHCCCHHHHHHHHHCCCCCEEEEEEEECCCCCEEECCHHHHCCHHHHHHH T ss_conf 99757998899887899999999881898000452656999985787642899999659997565405442762778887 Q ss_pred EEE-----------ECCCC------CCCC---CCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHH Q ss_conf 765-----------23766------1138---532899999999999958998569993258898805679999999999 Q gi|254780750|r 698 SRV-----------GSADN------LASG---RSTFMVEMIETASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEY 757 (920) Q Consensus 698 tRi-----------Ga~D~------l~~g---~STF~vEm~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~ 757 (920) +++ ++++- .+.| .||-+-=| -|...|+.-+=++|||--|-++.---- |-++ T Consensus 102 a~l~p~eLLVKFRk~~~L~~L~~~~QSGGErsvst~~yll-----alQ~l~~~pFr~vDEINQgmD~~nEr~----v~~~ 172 (213) T cd03277 102 AKLSPIELLVKFREGEQLQELDPHHQSGGERSVSTMLYLL-----SLQELTRCPFRVVDEINQGMDPTNERK----VFDM 172 (213) T ss_pred HHCCHHHHHHHHHHHCCCCCCCHHHHHCCHHHHHHHHHHH-----HHHHHHCCCCEEEEHHHCCCCCHHHHH----HHHH T ss_conf 7079799999998520355578778712055899999999-----997753499278534335798178999----9999 Q ss_pred HHH Q ss_conf 997 Q gi|254780750|r 758 LHE 760 (920) Q Consensus 758 l~~ 760 (920) +.+ T Consensus 173 ~v~ 175 (213) T cd03277 173 LVE 175 (213) T ss_pred HHH T ss_conf 999 No 295 >PRK02224 chromosome segregation protein; Provisional Probab=93.81 E-value=0.071 Score=32.45 Aligned_cols=75 Identities=20% Similarity=0.222 Sum_probs=42.4 Q ss_pred CCCCCCCCHHHHHHHH--HHHHHHH-C---CCCCEEEEECCCCCCCHHHHHHH--HHHHHHHHHHHCCCEEEEECCCHHH Q ss_conf 6611385328999999--9999995-8---99856999325889880567999--9999999997269849997487579 Q gi|254780750|r 704 DNLASGRSTFMVEMIE--TASILNQ-A---TNQSFVILDEIGRGTATLDGLSI--AWATIEYLHETNRCRGLLATHFHEL 775 (920) Q Consensus 704 D~l~~g~STF~vEm~e--~~~IL~~-a---t~~SLVllDElGrGTst~DG~ai--A~aile~l~~~~~~~~lfaTHy~eL 775 (920) +.++.|+.+...=-.- ++.+|.. + .+-.+++|||. |.+.|...+ ...++..|.+......++.||-.+| T Consensus 780 ~~LSGGE~~~~aLaLrlAl~~~l~~~~~g~~~~~~lilDE~---~~~LD~~~~~~l~~~l~~l~~~~~~qiiiITH~~el 856 (880) T PRK02224 780 KQLSGGERALFNLSLRCAIYRLLAEGIGGDAPLPPLILDEP---TVFLDSGHVSQLVKLIEMMRRIGVEQIIVVSHDDEL 856 (880) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEECC---CCCCCHHHHHHHHHHHHHHHCCCCCEEEEEECHHHH T ss_conf 88887899999999999999999873378888980788599---988798899999999999872899989999554889 Q ss_pred HHHHHH Q ss_conf 766430 Q gi|254780750|r 776 TDLSKS 781 (920) Q Consensus 776 ~~l~~~ 781 (920) .+.++. T Consensus 857 ~~~~d~ 862 (880) T PRK02224 857 IAAADH 862 (880) T ss_pred HHHCCE T ss_conf 986896 No 296 >PRK01156 chromosome segregation protein; Provisional Probab=93.81 E-value=0.063 Score=32.81 Aligned_cols=74 Identities=24% Similarity=0.326 Sum_probs=41.9 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHH-CCCCCEEEEECCCCCCCHHHHH--HHHHHHHHHHHHH-CCC-EEEEECCCHHHHHH Q ss_conf 66113853289999999999995-8998569993258898805679--9999999999972-698-49997487579766 Q gi|254780750|r 704 DNLASGRSTFMVEMIETASILNQ-ATNQSFVILDEIGRGTATLDGL--SIAWATIEYLHET-NRC-RGLLATHFHELTDL 778 (920) Q Consensus 704 D~l~~g~STF~vEm~e~~~IL~~-at~~SLVllDElGrGTst~DG~--aiA~aile~l~~~-~~~-~~lfaTHy~eL~~l 778 (920) +.|+.|++ |++=+.=.-.|.+- +.+-.+++|||. |.+.|-- .-...+++++... ... .+++.||-.+|.+. T Consensus 800 ~~LSGGE~-~~~aLaL~laL~~~~~~~~~~~ilDE~---~~~LD~~~~~~l~~~l~~~~~~~~~~~qi~IISH~~el~~~ 875 (895) T PRK01156 800 DSLSGGEK-TAVAFALRVAVAQFLNNDKSLLIMDEP---TAFLDEDRRTNLKDIIEYSLKDSSDIPQVIMISHHRELLSV 875 (895) T ss_pred CCCCHHHH-HHHHHHHHHHHHHHHHCCCCEEEEECC---CCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHH T ss_conf 67887799-999999999999998458995998599---87758778999999999998636898879998244889985 Q ss_pred HHH Q ss_conf 430 Q gi|254780750|r 779 SKS 781 (920) Q Consensus 779 ~~~ 781 (920) .+. T Consensus 876 ~d~ 878 (895) T PRK01156 876 ADV 878 (895) T ss_pred CCC T ss_conf 893 No 297 >COG0419 SbcC ATPase involved in DNA repair [DNA replication, recombination, and repair] Probab=93.76 E-value=0.072 Score=32.41 Aligned_cols=73 Identities=21% Similarity=0.309 Sum_probs=44.8 Q ss_pred CCCCCCCCHHHHHHHH---HHHHHHHCCCCCEEEEECCCCCCCHH--HHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHH Q ss_conf 6611385328999999---99999958998569993258898805--679999999999997269849997487579766 Q gi|254780750|r 704 DNLASGRSTFMVEMIE---TASILNQATNQSFVILDEIGRGTATL--DGLSIAWATIEYLHETNRCRGLLATHFHELTDL 778 (920) Q Consensus 704 D~l~~g~STF~vEm~e---~~~IL~~at~~SLVllDElGrGTst~--DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~l 778 (920) +.++.|. ||++=+.= .+..+..-++=.+.+|||- |.+. |+..-+..+++.+... +..+++.||-.+|.+. T Consensus 814 ~~LSGGE-~~~~sLalrLALs~~~~~~~~l~~l~LDEp---f~~LD~e~l~~l~~~l~~i~~~-~~qiiIISH~eel~e~ 888 (908) T COG0419 814 KTLSGGE-RFLASLALRLALSDLLQGRARLELLFLDEP---FGTLDEERLEKLAEILEELLSD-GRQIIIISHVEELKER 888 (908) T ss_pred CCCCCHH-HHHHHHHHHHHHHHHHHCCCCCCEEEEECC---CCCCCHHHHHHHHHHHHHHHHC-CCEEEEEECHHHHHHH T ss_conf 2048618-999999999999999724888882334289---8878989999999999999845-9979999566999985 Q ss_pred HHH Q ss_conf 430 Q gi|254780750|r 779 SKS 781 (920) Q Consensus 779 ~~~ 781 (920) ++. T Consensus 889 ~~~ 891 (908) T COG0419 889 ADV 891 (908) T ss_pred HCC T ss_conf 075 No 298 >PTZ00243 ABC transporter; Provisional Probab=93.75 E-value=0.54 Score=25.86 Aligned_cols=31 Identities=23% Similarity=0.263 Sum_probs=16.2 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHH Q ss_conf 44434563587777664399996778440789999 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLR 668 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lR 668 (920) .|=+|+++... ...+ .=|-|+-.+||||++- T Consensus 1324 ~VLk~Isf~I~---pGEK-VGIVGRTGSGKSSLi~ 1354 (1560) T PTZ00243 1324 LVLRGVSFRIA---PREK-VGIVGRTGSGKSTLLL 1354 (1560) T ss_pred CCCCCEEEEEC---CCCE-EEEECCCCCHHHHHHH T ss_conf 71057469988---9999-9998798743999999 No 299 >TIGR02324 CP_lyasePhnL phosphonate C-P lyase system protein PhnL; InterPro: IPR012701 Members of this family are the PhnL protein of C-P lyase systems for utilization of phosphonates. These systems resemble phosphonatase-based systems in having a three-component ABC transporter, where IPR005769 from INTERPRO is the permease, IPR005770 from INTERPRO is the phosphonates binding protein, and IPR012693 from INTERPRO is the ATP-binding cassette (ABC) protein. They differ, however, in having, typically, ten or more additional genes, many of which are believed to form a membrane-associated C-P lyase complex. This protein (PhnL) and the adjacent-encoded PhnK (IPR012700 from INTERPRO) resemble transporter ATP-binding proteins but are suggested, based on mutagenesis studies, to be part of this C-P lyase complex rather than part of a transporter per se .. Probab=93.67 E-value=0.059 Score=33.03 Aligned_cols=32 Identities=34% Similarity=0.587 Sum_probs=25.3 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHH Q ss_conf 443456358777766439999677844078999999 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQN 670 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqv 670 (920) |=++++|.- +.+.+..|.||=-+||||+||++ T Consensus 23 Vl~~v~l~V----~aGEcv~L~G~SGaGKSTlLk~l 54 (224) T TIGR02324 23 VLKNVSLTV----NAGECVALSGPSGAGKSTLLKSL 54 (224) T ss_pred ECCCCEEEE----ECCCEEEEECCCCCCHHHHHHHH T ss_conf 006743787----36735885368887678999976 No 300 >PRK13409 putative ATPase RIL; Provisional Probab=93.51 E-value=0.12 Score=30.65 Aligned_cols=16 Identities=31% Similarity=0.657 Sum_probs=9.2 Q ss_pred EEEEEECCCCCCCCCCHHH Q ss_conf 5676523766113853289 Q gi|254780750|r 696 LFSRVGSADNLASGRSTFM 714 (920) Q Consensus 696 IftRiGa~D~l~~g~STF~ 714 (920) +..=+|.+ -.|+|||+ T Consensus 367 iigIvG~N---GaGKTTLl 382 (590) T PRK13409 367 VIGIVGPN---GIGKTTFV 382 (590) T ss_pred EEEEECCC---CCCHHHHH T ss_conf 89998888---88789999 No 301 >pfam09818 ABC_ATPase Predicted ATPase of the ABC class. Members of this family include various bacterial predicted ABC class ATPases. Probab=93.50 E-value=0.23 Score=28.70 Aligned_cols=48 Identities=23% Similarity=0.263 Sum_probs=32.7 Q ss_pred EEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEECCC Q ss_conf 99996778440789999999999999719853035320682210567652376 Q gi|254780750|r 652 LWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGSAD 704 (920) Q Consensus 652 ~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa~D 704 (920) +-+|+|.+--||||+|+.+..=|. -.||-+-=++-+.|.=-..|.|.| T Consensus 245 ITlIvGGGyHGKSTLL~Ale~GVY-----nHipGDGRE~VvT~~~avKiRAED 292 (447) T pfam09818 245 ITLIVGGGYHGKSTLLEALERGVY-----DHIPGDGREFVVTDPDAVKIRAED 292 (447) T ss_pred EEEEECCCCCCHHHHHHHHHHCCC-----CCCCCCCEEEEEECCCEEEEEECC T ss_conf 699978987767889999982777-----778899807999659658888458 No 302 >PRK09112 DNA polymerase III subunit delta'; Validated Probab=93.49 E-value=0.26 Score=28.22 Aligned_cols=119 Identities=22% Similarity=0.224 Sum_probs=57.4 Q ss_pred CCEEEEEECCCCCCHHHHHHHHHHHHHHHHCC-CCCCHHHCCCCCCCEEEEEE--ECCCCC----------CCCCCHHHH Q ss_conf 64399996778440789999999999999719-85303532068221056765--237661----------138532899 Q gi|254780750|r 649 SGKLWLLTGPNMGGKSTFLRQNALIVIMAQMG-SYVPASYAHIGIVDKLFSRV--GSADNL----------ASGRSTFMV 715 (920) Q Consensus 649 ~~~~~iiTGpNmgGKSt~lRqval~vilAQiG-~fVPA~~a~i~~~D~IftRi--Ga~D~l----------~~g~STF~v 715 (920) -.+.||+|||..-||+|+-+..|-.. +++-. +-.|+..+...+-..++.+| |++-|+ .....|. + T Consensus 44 l~HA~Lf~GP~GiGKaTlA~~~A~~L-l~~~~~~~~~~~~~~pd~~~~~~r~i~~g~hpdl~~i~r~~d~k~~~~~~~-I 121 (352) T PRK09112 44 LHHALLFEGPEGIGKATLAFHLANHI-LSHPDPNEAPETLADPDPASPLWRQIAQGAHPNLLHLTRPFDEKTGKFKTA-I 121 (352) T ss_pred CCCEEEEECCCCCCHHHHHHHHHHHH-HCCCCCCCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEECCCCHHHHHHHCC-C T ss_conf 65246535899808999999999998-669986668655678887877899997489999565534322021454335-7 Q ss_pred HHHHHHHHHHHC------CCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHH-CCCEEEEECCCHH Q ss_conf 999999999958------9985699932588988056799999999999972-6984999748757 Q gi|254780750|r 716 EMIETASILNQA------TNQSFVILDEIGRGTATLDGLSIAWATIEYLHET-NRCRGLLATHFHE 774 (920) Q Consensus 716 Em~e~~~IL~~a------t~~SLVllDElGrGTst~DG~aiA~aile~l~~~-~~~~~lfaTHy~e 774 (920) -..++..+.+.. ...-.||+||.-.=|. .-|-|.++-|-+- .++..+++||+.+ T Consensus 122 ~vd~iR~l~~~~~~~~~~~~~kv~Iid~ad~m~~-----~aaNALLK~LEEPp~~~~fiLit~~~~ 182 (352) T PRK09112 122 TVDEIRRVTHFLSQTSGDGNWRIVIIDPADDMNR-----NAANAILKTLEEPPARALFILISHSSG 182 (352) T ss_pred CHHHHHHHHHHHCCCCCCCCEEEEEECCHHHHHH-----HHHHHHHHHHHCCCCCEEEEEEECCHH T ss_conf 7799999999845488668806999818787469-----999999998534898748998869977 No 303 >pfam01580 FtsK_SpoIIIE FtsK/SpoIIIE family. FtsK has extensive sequence similarity to wide variety of proteins from prokaryotes and plasmids, termed the FtsK/SpoIIIE family. This domain contains a putative ATP binding P-loop motif. It is found in the FtsK cell division protein from E. coli and the stage III sporulation protein E SpoIIIE which has roles in regulation of prespore specific gene expression in B. subtilis. A mutation in FtsK causes a temperature sensitive block in cell division and it is involved in peptidoglycan synthesis or modification. The SpoIIIE protein is implicated in intercellular chromosomal DNA transfer. Probab=93.42 E-value=0.22 Score=28.77 Aligned_cols=119 Identities=24% Similarity=0.249 Sum_probs=61.0 Q ss_pred EEEECCCCCCHHHHHHHHHHHHHHHH----CCCCCC-HHHCCCCCCCEE---EEEEECCCCCCCCCCHH---HHHHHHHH Q ss_conf 99967784407899999999999997----198530-353206822105---67652376611385328---99999999 Q gi|254780750|r 653 WLLTGPNMGGKSTFLRQNALIVIMAQ----MGSYVP-ASYAHIGIVDKL---FSRVGSADNLASGRSTF---MVEMIETA 721 (920) Q Consensus 653 ~iiTGpNmgGKSt~lRqval~vilAQ----iG~fVP-A~~a~i~~~D~I---ftRiGa~D~l~~g~STF---~vEm~e~~ 721 (920) ++|.|.-.+|||++||++-...++.+ +-.|+- .+...+..|..+ ++.+ + ++...-...+ ..||.+=. T Consensus 41 ~Lv~G~tGsGKS~~l~~li~sl~~~~~p~~v~l~liD~K~~~~~~~~~~~h~~~~~-~-~d~e~~~~~l~~l~~em~rR~ 118 (202) T pfam01580 41 LLIAGATGSGKSTFLNTLILSLAARHSPEEVRLYLIDPKGGELAALEDLPHLLSAV-A-TDPEDALSALRALVAEMERRY 118 (202) T ss_pred EEEECCCCCCCCHHHHHHHHHHHHHCCCCEEEEEEECCCCCCHHHHHHCCCCCCCC-C-CCHHHHHHHHHHHHHHHHHHH T ss_conf 89965899980099999999998737962069999748961267676356544337-6-899999999999999999999 Q ss_pred HHHHHCC-------------------------CCCEEEEECCCCCC--CHHHHHHHHHHHHHHHHHHC---CCEEEEECC Q ss_conf 9999589-------------------------98569993258898--80567999999999999726---984999748 Q gi|254780750|r 722 SILNQAT-------------------------NQSFVILDEIGRGT--ATLDGLSIAWATIEYLHETN---RCRGLLATH 771 (920) Q Consensus 722 ~IL~~at-------------------------~~SLVllDElGrGT--st~DG~aiA~aile~l~~~~---~~~~lfaTH 771 (920) .+++.+. +.-+|++||+.-=. .+.|-..-+.+.+..|.... |...++||- T Consensus 119 ~ll~~~g~~~i~~~~~~~~~~~~~~~~~~~~~p~ivvviDE~~~l~~~~~~~~~~~~~~~l~~iar~GRa~GihlilatQ 198 (202) T pfam01580 119 ALLKQLGVRSIEEYNGEIAEDILGGAGWLEELPPIVVIVDERAELMLAAPKDSEMRVEGALARLARMGRAAGIHLLLATQ 198 (202) T ss_pred HHHHHHCCCCHHHHHHHHHHHHHCCCCCCCCCCCEEEEEHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCEEEEEEEC T ss_conf 99998388768999998664321245543347818986445999986555046899999999999988733829999818 Q ss_pred CH Q ss_conf 75 Q gi|254780750|r 772 FH 773 (920) Q Consensus 772 y~ 773 (920) ++ T Consensus 199 rP 200 (202) T pfam01580 199 RP 200 (202) T ss_pred CC T ss_conf 99 No 304 >COG1106 Predicted ATPases [General function prediction only] Probab=93.40 E-value=0.1 Score=31.35 Aligned_cols=50 Identities=24% Similarity=0.254 Sum_probs=31.1 Q ss_pred CCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHH Q ss_conf 856999325889880567999999999999726984999748757976643 Q gi|254780750|r 730 QSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTDLSK 780 (920) Q Consensus 730 ~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~l~~ 780 (920) .-.+++||+-+|=-+.-=.-+..++.... .+...-..++||..++-++.- T Consensus 271 ~k~l~iDEie~~lHp~lm~~~l~~~~~~~-~~~niq~~~TTH~~e~id~~l 320 (371) T COG1106 271 DKVLLIDEIENGLHPSLMILILETLEDKV-KNNNIQVFLTTHSTEFIDLLL 320 (371) T ss_pred CCEEEEEHHHHCCCHHHHHHHHHHHHHHC-CCCEEEEEEECCCHHHHHHHH T ss_conf 74699503544208799999999987631-474378873100389999999 No 305 >TIGR01166 cbiO cobalt ABC transporter, ATP-binding protein; InterPro: IPR005876 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This entry represents the ATP binding subunit of the multisubunit cobalt transporter in bacteria and its equivalents in archaea. This superfamily includes two groups, one which catalyses the uptake of small molecules, including ions from the external milieu and the other group which is engaged in the efflux of small molecular weight compounds and ions from within the cell. Energy derived from the hydrolysis of ATP drives both the process of uptake and efflux.; GO: 0006824 cobalt ion transport, 0009276 1-2nm peptidoglycan-based cell wall. Probab=93.38 E-value=0.063 Score=32.83 Aligned_cols=114 Identities=31% Similarity=0.360 Sum_probs=62.9 Q ss_pred CCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCC--------------------C-----CCCEEEE-EEE Q ss_conf 6643999967784407899999999999997198530353206--------------------8-----2210567-652 Q gi|254780750|r 648 NSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHI--------------------G-----IVDKLFS-RVG 701 (920) Q Consensus 648 ~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i--------------------~-----~~D~Ift-RiG 701 (920) ..+++.-|=|||-+||||+|+-. -|.-=|-+-+.+ . |=|+||+ +|. T Consensus 16 ~~G~~~aLlG~NGaGKsTLl~~L--------nG~LrP~~G~v~~dG~~l~YsrkgL~~~R~~V~~VfQdPDDQlF~a~V~ 87 (190) T TIGR01166 16 ERGEVLALLGANGAGKSTLLLHL--------NGLLRPQSGKVLLDGEPLDYSRKGLLEVRQRVGLVFQDPDDQLFAADVD 87 (190) T ss_pred CCCCEEEEECCCCCCHHHHHHHH--------CCCCCCCCCEEEECCEECCCCCHHHHHHCCEEEEEEECHHHHCCCCCCC T ss_conf 05716898728998578998874--------3677797555876785403572446752503003762634420267622 Q ss_pred CCCCCCC-----CCCHHHHH-----------H----------------HHH--HHHHHHCCCCCEEEEECCCCCCCHHHH Q ss_conf 3766113-----85328999-----------9----------------999--999995899856999325889880567 Q gi|254780750|r 702 SADNLAS-----GRSTFMVE-----------M----------------IET--ASILNQATNQSFVILDEIGRGTATLDG 747 (920) Q Consensus 702 a~D~l~~-----g~STF~vE-----------m----------------~e~--~~IL~~at~~SLVllDElGrGTst~DG 747 (920) +| ++= |.|-=-|| | ..+ |-++ |=+---.||||--.|=++ +| T Consensus 88 -~D-VaFGPlNLGL~e~Ev~~RV~eAL~~vg~~~~~~rp~h~LS~GekkRvAIAGAv--AM~Pd~l~LDEPTAGLDp-~G 162 (190) T TIGR01166 88 -QD-VAFGPLNLGLSEAEVERRVREALTAVGISGLEERPTHLLSGGEKKRVAIAGAV--AMRPDVLLLDEPTAGLDP-AG 162 (190) T ss_pred -CC-CCCCCCCCCCCHHHHHHHHHHHHHHCCHHHHHHCCHHHCCCCCHHHHHHHHHH--HHCCCEEEEECCCCCCCH-HH T ss_conf -10-03354567337157678789999860632244122411558613577777588--616634664278889787-47 Q ss_pred HHHHHHHHHHHHHHCCCEEEEECCCHHH Q ss_conf 9999999999997269849997487579 Q gi|254780750|r 748 LSIAWATIEYLHETNRCRGLLATHFHEL 775 (920) Q Consensus 748 ~aiA~aile~l~~~~~~~~lfaTHy~eL 775 (920) ..=-.++++.|.+. |--++||||.=+| T Consensus 163 ~~q~~~~l~~L~~~-G~tvv~STHDvdL 189 (190) T TIGR01166 163 AEQLLAILRRLRAE-GTTVVISTHDVDL 189 (190) T ss_pred HHHHHHHHHHHHHC-CCEEEEEECCCCC T ss_conf 99999998878723-9989997253201 No 306 >PRK10246 exonuclease subunit SbcC; Provisional Probab=93.35 E-value=0.097 Score=31.43 Aligned_cols=95 Identities=21% Similarity=0.368 Sum_probs=54.6 Q ss_pred CCCCC----HHHCCCCCCCEEEEE-EECCCCCCCCCCHHHHHHH---HHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 98530----353206822105676-5237661138532899999---999999958998569993258898805679999 Q gi|254780750|r 680 GSYVP----ASYAHIGIVDKLFSR-VGSADNLASGRSTFMVEMI---ETASILNQATNQSFVILDEIGRGTATLDGLSIA 751 (920) Q Consensus 680 G~fVP----A~~a~i~~~D~IftR-iGa~D~l~~g~STF~vEm~---e~~~IL~~at~~SLVllDElGrGTst~DG~aiA 751 (920) |-|.- -+...|-|+|.--+. +-....|+.|. ||++=+. -+|.++..-..=-...||| |-||=-.|++-.| T Consensus 919 gry~l~r~~~~gL~l~V~D~~~g~~~R~v~TLSGGE-tF~aSLALALgLsdvvq~~~~ldtlFIDE-GFGSLD~e~Ld~a 996 (1047) T PRK10246 919 GRYLLQRKASEALELEVVDTWQADAVRDTRTLSGGE-SFLVSLALALALSDLVSHKTRIDSLFLDE-GFGTLDSETLDTA 996 (1047) T ss_pred CCEEEEECCCCCCEEEEEECCCCCCEECCCCCCHHH-HHHHHHHHHHHHHHHHHCCCCCCCEEECC-CCCCCCHHHHHHH T ss_conf 872687617999827998857899746897887259-99999999999999984899888465679-9777698899999 Q ss_pred HHHHHHHHHHCCCEEEEECCCHHHHH Q ss_conf 99999999726984999748757976 Q gi|254780750|r 752 WATIEYLHETNRCRGLLATHFHELTD 777 (920) Q Consensus 752 ~aile~l~~~~~~~~lfaTHy~eL~~ 777 (920) ..+|+.|... +-.+-+.||-.+|.+ T Consensus 997 ~~aL~~L~~~-gR~VGIISHV~ELke 1021 (1047) T PRK10246 997 LDALDALNAS-GKTIGVISHVEAMKE 1021 (1047) T ss_pred HHHHHHHHHC-CCEEEEECCHHHHHH T ss_conf 9999999968-898999888788998 No 307 >cd01394 radB RadB. The archaeal protein radB shares similarity radA, the archaeal functional homologue to the bacterial RecA. The precise function of radB is unclear. Probab=93.26 E-value=0.64 Score=25.30 Aligned_cols=119 Identities=15% Similarity=0.156 Sum_probs=59.6 Q ss_pred CCEEEEEECCCCCCHHHHHHHHHHHHHHHHCC---CCCCHHHCCCCCCCEEE----EEEECCCCCC-CCCCHHHHHHHHH Q ss_conf 64399996778440789999999999999719---85303532068221056----7652376611-3853289999999 Q gi|254780750|r 649 SGKLWLLTGPNMGGKSTFLRQNALIVIMAQMG---SYVPASYAHIGIVDKLF----SRVGSADNLA-SGRSTFMVEMIET 720 (920) Q Consensus 649 ~~~~~iiTGpNmgGKSt~lRqval~vilAQiG---~fVPA~~a~i~~~D~If----tRiGa~D~l~-~g~STF~vEm~e~ 720 (920) .+++.+|.||--.||||++=|+|..+. .-| .||-.+.+.-.-+.+|. -++ .|++. .....| .|+... T Consensus 18 ~G~it~i~G~pG~GKStl~lq~a~~~~--~~g~~v~YidtE~~~~er~~qi~~~~~~~~--~~~i~v~~~~~~-~~~~~~ 92 (218) T cd01394 18 RGTVTQVYGPPGTGKTNIAIQLAVETA--GQGKKVAYIDTEGLSSERFRQIAGDRPERA--ASSIIVFEPMDF-NEQGRA 92 (218) T ss_pred CCEEEEEECCCCCCHHHHHHHHHHHHH--HCCCEEEEEECCCCCCHHHHHHHHHHHHHH--HCCCEEECCCCH-HHHHHH T ss_conf 887999989999849999999999986--369869999665567699999987536665--305146267876-889999 Q ss_pred HHHHHH--CCCCCEEEEECCC---------CCCCHHHHHHHHHH--HHHHHHHHCCCEEEEECCC Q ss_conf 999995--8998569993258---------89880567999999--9999997269849997487 Q gi|254780750|r 721 ASILNQ--ATNQSFVILDEIG---------RGTATLDGLSIAWA--TIEYLHETNRCRGLLATHF 772 (920) Q Consensus 721 ~~IL~~--at~~SLVllDElG---------rGTst~DG~aiA~a--ile~l~~~~~~~~lfaTHy 772 (920) -..+.. ...-+||++|=+- +..+...-..++.+ .+..+.++.++-++++-|- T Consensus 93 i~~~~~~~~~~~~lvViDSi~tl~~~e~~~~~~~~~~~r~l~~~~~~L~~~Ak~~~~~vil~nqV 157 (218) T cd01394 93 IQETETFADEKVDLVVVDSATALYRLELGDDDTTIKNYRELAKQLTFLLWLARKHDVAVVITNQV 157 (218) T ss_pred HHHHHHHHHCCCCEEEEECCHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEEEEEE T ss_conf 99999764147729999140455455406896479999999999999999987669889999215 No 308 >COG1126 GlnQ ABC-type polar amino acid transport system, ATPase component [Amino acid transport and metabolism] Probab=93.24 E-value=0.11 Score=31.08 Aligned_cols=130 Identities=25% Similarity=0.283 Sum_probs=68.4 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCC------------HHHCCCCCCC---EEEE Q ss_conf 444345635877776643999967784407899999999999997198530------------3532068221---0567 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVP------------ASYAHIGIVD---KLFS 698 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVP------------A~~a~i~~~D---~Ift 698 (920) -|=.+|++.. ..+.+..|-||--|||||+||.+..+--.- -|.-.- +-.-++|.|= .+|- T Consensus 16 ~VLkgi~l~v----~~Gevv~iiGpSGSGKSTlLRclN~LE~~~-~G~I~i~g~~~~~~~~~~~~R~~vGmVFQ~fnLFP 90 (240) T COG1126 16 EVLKGISLSV----EKGEVVVIIGPSGSGKSTLLRCLNGLEEPD-SGSITVDGEDVGDKKDILKLRRKVGMVFQQFNLFP 90 (240) T ss_pred EEECCCCEEE----CCCCEEEEECCCCCCHHHHHHHHHCCCCCC-CCEEEECCEECCCHHHHHHHHHHCCEECCCCCCCC T ss_conf 7764851167----389789998999998889999997786887-86499998722545469999985576624665465 Q ss_pred EEECCCCCCC------CCCHHHHHHHHHH----------------------------HHHH-HCCCCCEEEEECCCCCCC Q ss_conf 6523766113------8532899999999----------------------------9999-589985699932588988 Q gi|254780750|r 699 RVGSADNLAS------GRSTFMVEMIETA----------------------------SILN-QATNQSFVILDEIGRGTA 743 (920) Q Consensus 699 RiGa~D~l~~------g~STF~vEm~e~~----------------------------~IL~-~at~~SLVllDElGrGTs 743 (920) .+-+-||+.- |.|.= |-.+.| .|-| -|-.--+.|+||. || T Consensus 91 H~TvleNv~lap~~v~~~~k~--eA~~~A~~lL~~VGL~dka~~yP~qLSGGQqQRVAIARALaM~P~vmLFDEP---TS 165 (240) T COG1126 91 HLTVLENVTLAPVKVKKLSKA--EAREKALELLEKVGLADKADAYPAQLSGGQQQRVAIARALAMDPKVMLFDEP---TS 165 (240) T ss_pred CCHHHHHHHHHHHHHCCCCHH--HHHHHHHHHHHHCCCHHHHHHCCCCCCCHHHHHHHHHHHHCCCCCEEEECCC---CC T ss_conf 532988877753997298999--9999999999986955666539510480788999999987179988863697---54 Q ss_pred HHHHHHH--HHHHHHHHHHHCCCEEEEECCCHH Q ss_conf 0567999--999999999726984999748757 Q gi|254780750|r 744 TLDGLSI--AWATIEYLHETNRCRGLLATHFHE 774 (920) Q Consensus 744 t~DG~ai--A~aile~l~~~~~~~~lfaTHy~e 774 (920) ..|---+ .-.++..|.+. +-..+..||--. T Consensus 166 ALDPElv~EVL~vm~~LA~e-GmTMivVTHEM~ 197 (240) T COG1126 166 ALDPELVGEVLDVMKDLAEE-GMTMIIVTHEMG 197 (240) T ss_pred CCCHHHHHHHHHHHHHHHHC-CCEEEEEECHHH T ss_conf 37988999999999999976-986999950367 No 309 >PRK10522 multidrug transporter membrane component/ATP-binding component; Provisional Probab=93.23 E-value=0.56 Score=25.74 Aligned_cols=33 Identities=33% Similarity=0.428 Sum_probs=24.9 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHH Q ss_conf 4443456358777766439999677844078999999 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQN 670 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqv 670 (920) ++=+|+++.. ..+...-|.||+.+||||+++-+ T Consensus 337 ~~L~~isl~i----~~Ge~vaiVG~SGsGKSTL~~LL 369 (547) T PRK10522 337 FSVGPINLTI----KRGELLFLIGGNGSGKSTLAMLL 369 (547) T ss_pred CCCCCEEEEE----CCCCEEEEECCCCCCHHHHHHHH T ss_conf 7207804798----59988999899999779999998 No 310 >TIGR03015 pepcterm_ATPase putative secretion ATPase, PEP-CTERM locus subfamily. Members of this protein are marked as probable ATPases by the nucleotide binding P-loop motif GXXGXGKTT, a motif DEAQ similar to the DEAD/H box of helicases, and extensive homology to ATPases of MSHA-type pilus systems and to GspA proteins associated with type II protein secretion systems. Probab=93.22 E-value=0.27 Score=28.06 Aligned_cols=83 Identities=17% Similarity=0.304 Sum_probs=41.5 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHHHHHHCC-C-CCCHH--HCCCC---CCCEEEEEEECCCCCCCCCCHHHHHHHHHHH Q ss_conf 4399996778440789999999999999719-8-53035--32068---2210567652376611385328999999999 Q gi|254780750|r 650 GKLWLLTGPNMGGKSTFLRQNALIVIMAQMG-S-YVPAS--YAHIG---IVDKLFSRVGSADNLASGRSTFMVEMIETAS 722 (920) Q Consensus 650 ~~~~iiTGpNmgGKSt~lRqval~vilAQiG-~-fVPA~--~a~i~---~~D~IftRiGa~D~l~~g~STF~vEm~e~~~ 722 (920) ..+.+|||+-.+||||++|.. +.+.. . ++++- ...++ ++..|....|...+ ..+.....-.+.+.-. T Consensus 43 ~g~~lltGe~GtGKTtllr~l-----~~~l~~~~~~~~~i~~~~l~~~~ll~~i~~~lg~~~~-~~~~~~~~~~l~~~L~ 116 (269) T TIGR03015 43 EGFILITGEVGAGKTTLIRNL-----LKRLDQERVVAAKLVNTRVDAEDLLRMVAADFGLETE-GRDKAALLRELEDFLI 116 (269) T ss_pred CCEEEEECCCCCCHHHHHHHH-----HHHCCCCCEEEEEECCCCCCHHHHHHHHHHHCCCCCC-CCCHHHHHHHHHHHHH T ss_conf 965999729989889999999-----9845934548999769999999999999998598988-9899999999999999 Q ss_pred HHHHCCCCCEEEEECC Q ss_conf 9995899856999325 Q gi|254780750|r 723 ILNQATNQSFVILDEI 738 (920) Q Consensus 723 IL~~at~~SLVllDEl 738 (920) -+.....+-++|+||- T Consensus 117 ~~~~~g~~~vliIDEA 132 (269) T TIGR03015 117 EQFAAGKRALLVVDEA 132 (269) T ss_pred HHHHCCCCEEEEEECH T ss_conf 9996699469997242 No 311 >pfam03796 DnaB_C DnaB-like helicase C terminal domain. The hexameric helicase DnaB unwinds the DNA duplex at the Escherichia coli chromosome replication fork. Although the mechanism by which DnaB both couples ATP hydrolysis to translocation along DNA and denatures the duplex is unknown, a change in the quaternary structure of the protein involving dimerization of the N-terminal domain has been observed and may occur during the enzymatic cycle. This C-terminal domain contains an ATP-binding site and is therefore probably the site of ATP hydrolysis. Probab=93.18 E-value=0.59 Score=25.56 Aligned_cols=116 Identities=17% Similarity=0.170 Sum_probs=54.3 Q ss_pred CCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCC---CCEEE---EEEECCCCCCCCCCHHHHHHHHHHH Q ss_conf 64399996778440789999999999999719853035320682---21056---7652376611385328999999999 Q gi|254780750|r 649 SGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGI---VDKLF---SRVGSADNLASGRSTFMVEMIETAS 722 (920) Q Consensus 649 ~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~---~D~If---tRiGa~D~l~~g~STF~vEm~e~~~ 722 (920) .+.+.+|.|+-..|||+++-++|....+.| |-.|--=+.+++. ..|++ +++- .+++..|.-+ ..|+..... T Consensus 18 ~G~l~vi~g~pg~GKS~~~~~~a~~~a~~~-g~~Vl~~slEm~~~~~~~R~~a~~~~v~-~~~i~~~~~~-~~~~~~~~~ 94 (186) T pfam03796 18 KGDLIIIAARPSMGKTAFALNIARNAALKQ-DKPVLFFSLEMSAEQLAERLLSSESRIS-SSKLRSGQLS-DEDWERLAE 94 (186) T ss_pred CCCEEEEEECCCCCHHHHHHHHHHHHHHHC-CCCEEEECCCCCHHHHHHHHHHHHHCCC-HHHHHCCCHH-HHHHHHHHH T ss_conf 881799996799987999999999999970-9966875475529999999999862676-5554125121-679999999 Q ss_pred HHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCH Q ss_conf 999589985699932588988056799999999999972698499974875 Q gi|254780750|r 723 ILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFH 773 (920) Q Consensus 723 IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~ 773 (920) ........-+.+-|. + +++..+- .+.++++....++..+|.=|.+ T Consensus 95 ~~~~~~~~~l~i~~~-~-~~t~~~i----~~~i~~~~~~~~~~~vvvDyl~ 139 (186) T pfam03796 95 AAGELSEAPLYIDDT-P-GLSLSEL----RAQARRLKREHGLGLIVIDYLQ 139 (186) T ss_pred HHHHHHCCCEEEECC-C-CCCHHHH----HHHHHHHHHHCCCCEEEEEHHH T ss_conf 999985398688479-9-9989999----9999999985599889974898 No 312 >TIGR02169 SMC_prok_A chromosome segregation protein SMC; InterPro: IPR011891 The SMC (structural maintenance of chromosomes) family of proteins, exist in virtually all organisms including both bacteria and archaea. The SMC proteins are essential for successful chromosome transmission during replication and segregation of the genome in all organisms and form three types of heterodimer (SMC1SMC3, SMC2SMC4, SMC5SMC6), which are core components of large multiprotein complexes. The best known complexes are cohesin, which is responsible for sister-chromatid cohesion, and condensin, which is required for full chromosome condensation in mitosis. SMCs are generally present as single proteins in bacteria, and as at least six distinct proteins in eukaryotes. The proteins range in size from approximately 110 to 170 kDa, and share a five-domain structure, with globular N- and C-terminal (IPR003395 from INTERPRO) domains separated by a long (circa 100 nm or 900 residues) coiled coil segment in the centre of which is a globular ''hinge'' domain, characterised by a set of four highly conserved glycine residues that are typical of flexible regions in a protein. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif (XXXXD, where X is any hydrophobic residue), and a LSGG motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases . All SMC proteins appear to form dimers, either forming homodimers with themselves, as in the case of prokaryotic SMC proteins, or heterodimers between different but related SMC proteins. The dimers are arranged in an antiparallel alignment. This orientation brings the N- and C-terminal globular domains (from either different or identical protamers) together, which unites an ATP binding site (Walker A motif) within the N-terminal domain with a Walker B motif (DA box) within the C-terminal domain, to form a potentially functional ATPase. Protein interaction and microscopy data suggest that SMC dimers form a ring-like structure which might embrace DNA molecules. Non-SMC subunits associate with the SMC amino- and carboxy-terminal domains. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. SMCs share not only sequence similarity but also structural similarity with ABC proteins. SMC proteins function together with other proteins in a range of chromosomal transactions, including chromosome condensation, sister-chromatid cohesion, recombination, DNA repair and epigenetic silencing of gene expression . This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by IPR011890 from INTERPRO. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent .. Probab=93.06 E-value=0.04 Score=34.27 Aligned_cols=21 Identities=0% Similarity=0.059 Sum_probs=12.9 Q ss_pred CEEEEE--ECCHHHHHHHHHHCC Q ss_conf 559996--058789999985229 Q gi|254780750|r 173 GIFKIS--TSNHDRLISDIMRID 193 (920) Q Consensus 173 G~~~~~--~~~~d~l~~~L~~l~ 193 (920) .-|++. .++..++.+.|+++. T Consensus 131 SyY~lNG~~~~l~ei~d~L~~~g 153 (1202) T TIGR02169 131 SYYYLNGKSVRLSEIHDFLAAAG 153 (1202) T ss_pred EEEEECCCCCCHHHHHHHHHHCC T ss_conf 88887082035766899998617 No 313 >cd03276 ABC_SMC6_euk Eukaryotic SMC6 proteins; SMC proteins are large (approximately 110 to 170 kDa), and each is arranged into five recognizable domains. Amino-acid sequence homology of SMC proteins between species is largely confined to the amino- and carboxy-terminal globular domains. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T, in the single-letter amino-acid code), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif, and a motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. In eukaryotic cells, the proteins are found as heterodimers of SMC1 paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6 (for Probab=92.99 E-value=0.1 Score=31.19 Aligned_cols=24 Identities=25% Similarity=0.329 Sum_probs=20.1 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHH Q ss_conf 439999677844078999999999 Q gi|254780750|r 650 GKLWLLTGPNMGGKSTFLRQNALI 673 (920) Q Consensus 650 ~~~~iiTGpNmgGKSt~lRqval~ 673 (920) .++-+|.|||-+|||+.|-.+++. T Consensus 21 p~lN~IiG~NGsGKSsIl~AI~lg 44 (198) T cd03276 21 PRVNFIVGNNGSGKSAILTALTIG 44 (198) T ss_pred CCEEEEECCCCCCHHHHHHHHHHH T ss_conf 982899889999889999999986 No 314 >PRK13341 recombination factor protein RarA/unknown domain fusion protein; Reviewed Probab=92.94 E-value=0.55 Score=25.77 Aligned_cols=23 Identities=26% Similarity=0.468 Sum_probs=14.2 Q ss_pred CHHHHHHHHCCCCHHHHHHHHHHHH Q ss_conf 7789999982999899999999999 Q gi|254780750|r 813 SYGIQVGKLAGLPNTVISRAYDILK 837 (920) Q Consensus 813 Sygi~vA~laG~p~~vi~~A~~~~~ 837 (920) .|.-.+++ .+++..|.+=++.++ T Consensus 682 ~Yr~~l~~--~l~~~~i~~L~~~l~ 704 (726) T PRK13341 682 DYRQAVGT--NLEEEVLCNLDELLT 704 (726) T ss_pred HHHHHHHC--CCCHHHHHHHHHHHH T ss_conf 49999853--299999999999999 No 315 >COG2884 FtsE Predicted ATPase involved in cell division [Cell division and chromosome partitioning] Probab=92.90 E-value=0.12 Score=30.79 Aligned_cols=47 Identities=28% Similarity=0.415 Sum_probs=32.3 Q ss_pred CCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHH--CCCEEEEECCCHHHHHH Q ss_conf 9985699932588988056799999999999972--69849997487579766 Q gi|254780750|r 728 TNQSFVILDEIGRGTATLDGLSIAWATIEYLHET--NRCRGLLATHFHELTDL 778 (920) Q Consensus 728 t~~SLVllDElGrGTst~DG~aiA~aile~l~~~--~~~~~lfaTHy~eL~~l 778 (920) ..-.+.|-||- |...| -.+||-|+.-|.+. .|..++.|||..+|.+- T Consensus 154 ~~P~vLlADEP---TGNLD-p~~s~~im~lfeeinr~GtTVl~ATHd~~lv~~ 202 (223) T COG2884 154 NQPAVLLADEP---TGNLD-PDLSWEIMRLFEEINRLGTTVLMATHDLELVNR 202 (223) T ss_pred CCCCEEEECCC---CCCCC-HHHHHHHHHHHHHHHHCCCEEEEEECHHHHHHH T ss_conf 69876860488---78888-588999999999986448579997160999975 No 316 >PRK11174 cysteine/glutathione ABC transporter membrane/ATP-binding component; Reviewed Probab=92.80 E-value=0.74 Score=24.83 Aligned_cols=33 Identities=24% Similarity=0.402 Sum_probs=24.6 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHH Q ss_conf 4443456358777766439999677844078999999 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQN 670 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqv 670 (920) .|-+|+++.. ..+...-|.||+.+||||+++-+ T Consensus 364 ~vL~~is~~i----~~Ge~vaIVG~SGsGKSTl~~LL 396 (588) T PRK11174 364 TLAGPLNFTL----PAGQRVALVGPSGAGKTSLLNAL 396 (588) T ss_pred EECCCEEEEE----CCCCEEEEECCCCCCHHHHHHHH T ss_conf 5103646997----49978999899986499999999 No 317 >TIGR00972 3a0107s01c2 phosphate ABC transporter, ATP-binding protein; InterPro: IPR005670 This is a family of phosphate transport system permease proteins.; GO: 0005315 inorganic phosphate transmembrane transporter activity, 0015114 phosphate transmembrane transporter activity, 0006817 phosphate transport, 0016020 membrane. Probab=92.76 E-value=0.11 Score=30.97 Aligned_cols=121 Identities=31% Similarity=0.414 Sum_probs=64.6 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCC--C-----CCCCEEE------EEEE Q ss_conf 4434563587777664399996778440789999999999999719853035320--6-----8221056------7652 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAH--I-----GIVDKLF------SRVG 701 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~--i-----~~~D~If------tRiG 701 (920) +=.||+|.. ....+--|=||=-+|||||||++=-. .=-||--..+ + -|+|.-+ .||| T Consensus 16 AL~~i~~~I----~~n~vTAlIGPSGCGKSTlLR~lNRM------nDl~~~~r~~G~v~f~G~dIy~~~~D~~~LR~~vG 85 (248) T TIGR00972 16 ALKNINLDI----PKNQVTALIGPSGCGKSTLLRSLNRM------NDLVPGVRIEGKVLFDGQDIYDKKIDVVELRKRVG 85 (248) T ss_pred EEECCCCEE----CCCEEEEEECCCCCCHHHHHHHHHHH------HHHCCCEEEEEEEEECCCCCCCCCCCHHHHHHHCC T ss_conf 862156200----37705898778898678999998877------64078816888898645114565668788762258 Q ss_pred C------------CCCCCCCCCHH-------HHHHHHHHHHHHHC------------CCCC------------------- Q ss_conf 3------------76611385328-------99999999999958------------9985------------------- Q gi|254780750|r 702 S------------ADNLASGRSTF-------MVEMIETASILNQA------------TNQS------------------- 731 (920) Q Consensus 702 a------------~D~l~~g~STF-------~vEm~e~~~IL~~a------------t~~S------------------- 731 (920) - -|||+=|.=+. +-|..|.| |+.| ..-| T Consensus 86 MVFQ~PNPFpmSIydNiayG~r~~G~~~K~~L~e~Ve~s--L~~AALWDEVKD~L~~sa~~LSGGQQQRLCIARalA~eP 163 (248) T TIGR00972 86 MVFQKPNPFPMSIYDNIAYGPRLHGIKDKKELDEIVEES--LKKAALWDEVKDRLHDSALGLSGGQQQRLCIARALAVEP 163 (248) T ss_pred CCCCCCCCCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHH--HHHCCCHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCCC T ss_conf 521478978840556754524521633778999999999--861687135524213588978726889999998752488 Q ss_pred -EEEEECCCCCCCHHHHHHHHHHHHHHHHHHC-CCEE-EEECCC Q ss_conf -6999325889880567999999999999726-9849-997487 Q gi|254780750|r 732 -FVILDEIGRGTATLDGLSIAWATIEYLHETN-RCRG-LLATHF 772 (920) Q Consensus 732 -LVllDElGrGTst~DG~aiA~aile~l~~~~-~~~~-lfaTHy 772 (920) -+||||- ||..|=+| -+-+|.|+... +.+| +..||= T Consensus 164 eVlLlDEP---TSALDPIa--T~~IEeLi~eLk~~YTivIVTHn 202 (248) T TIGR00972 164 EVLLLDEP---TSALDPIA--TGKIEELIQELKKKYTIVIVTHN 202 (248) T ss_pred CEECCCCC---CCCCCHHH--HHHHHHHHHHHHHCCEEEEEECC T ss_conf 52105788---87578778--99999999987652979988177 No 318 >COG4987 CydC ABC-type transport system involved in cytochrome bd biosynthesis, fused ATPase and permease components [Energy production and conversion / Posttranslational modification, protein turnover, chaperones] Probab=92.68 E-value=0.61 Score=25.47 Aligned_cols=45 Identities=24% Similarity=0.318 Sum_probs=26.7 Q ss_pred CCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCC-CEEEEECCCHHHHHHH Q ss_conf 98569993258898805679999999999997269-8499974875797664 Q gi|254780750|r 729 NQSFVILDEIGRGTATLDGLSIAWATIEYLHETNR-CRGLLATHFHELTDLS 779 (920) Q Consensus 729 ~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~-~~~lfaTHy~eL~~l~ 779 (920) .-+++|+||---|=++.- ...+++-|.++.+ -..||+||- |+.++ T Consensus 492 dapl~lLDEPTegLD~~T----E~~vL~ll~~~~~~kTll~vTHr--L~~le 537 (573) T COG4987 492 DAPLWLLDEPTEGLDPIT----ERQVLALLFEHAEGKTLLMVTHR--LRGLE 537 (573) T ss_pred CCCEEEECCCCCCCCHHH----HHHHHHHHHHHHCCCEEEEEECC--CCCHH T ss_conf 798688448866678643----99999999998459859999645--42176 No 319 >PRK00064 recF recombination protein F; Reviewed Probab=92.68 E-value=0.13 Score=30.46 Aligned_cols=59 Identities=10% Similarity=0.079 Sum_probs=34.0 Q ss_pred HCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHHCCCEEEEEEE Q ss_conf 589985699932588988056799999999999972698499974875797664306885899999 Q gi|254780750|r 726 QATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTDLSKSLKRFHNATLQ 791 (920) Q Consensus 726 ~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~l~~~~~~v~n~~~~ 791 (920) ......++|||++...=+..= . .+++++|.+. ++=++++|=-.+ .+.......+++||+ T Consensus 297 ~~~~~PIlLlDDv~sELD~~r---~-~~ll~~l~~~-~~QvfiT~td~~--~~~~~~~~~~~FhVe 355 (355) T PRK00064 297 ETGEAPILLLDDVASELDDGR---R-AALLETLIAL-GAQVFITATDLE--GFDALWENAKIFHVE 355 (355) T ss_pred HHCCCCEEEEECCHHCCCHHH---H-HHHHHHHHHC-CCEEEEECCCHH--HHHHHHCCCCEEEEC T ss_conf 549997899948300169999---9-9999998747-986999757878--876554348489969 No 320 >COG1193 Mismatch repair ATPase (MutS family) [DNA replication, recombination, and repair] Probab=92.67 E-value=0.0024 Score=43.48 Aligned_cols=119 Identities=26% Similarity=0.355 Sum_probs=78.0 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHH-HHHHHHHCC Q ss_conf 4399996778440789999999999999719853035320682210567652376611385328999999-999999589 Q gi|254780750|r 650 GKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGSADNLASGRSTFMVEMIE-TASILNQAT 728 (920) Q Consensus 650 ~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa~D~l~~g~STF~vEm~e-~~~IL~~at 728 (920) ..+.+++|-|-+||+++++...-++.+|++|..||++.|+..+.+-|+ .+.. ..+..+...|-.-+.+ ....+..+. T Consensus 532 ~e~~~l~~~~s~~~~~l~e~~~~~~~~a~~~~~~~~~~a~~e~~~~~~-~~~~-~~~~~~~~~~e~~~~~~~~~~~~~~~ 609 (753) T COG1193 532 DEVELLTGANSGGKTSLLELKAQIVVLAHMGLPVPAEEAKVEAVDEVK-FLKK-KKGILDAGAFESTLKDKKNKVLPEAK 609 (753) T ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHH-HHHHHHHHHHHHHHHHHHCCCCCCCC T ss_conf 999999776433326799999999999972138999999988999999-8646-47778898899874542134586334 Q ss_pred CCCEEEEECCCCCCCHHHHHHHHH--HHHHHHHHHCCCEEEEECCC Q ss_conf 985699932588988056799999--99999997269849997487 Q gi|254780750|r 729 NQSFVILDEIGRGTATLDGLSIAW--ATIEYLHETNRCRGLLATHF 772 (920) Q Consensus 729 ~~SLVllDElGrGTst~DG~aiA~--aile~l~~~~~~~~lfaTHy 772 (920) .+.|++.||+--=| ..-|.++-. ...+++++. +...++.+|- T Consensus 610 ~~~l~~gDev~~~t-~e~G~v~~i~a~~~e~~v~~-g~~kv~V~~~ 653 (753) T COG1193 610 KRKLKLGDEVEVIT-GEPGAVVKIIAGILEALVQS-GILKVIVSHL 653 (753) T ss_pred CCCCEECCEEEEEC-CCCCCEEEEECCCCEEEEEC-CEEEEEEEHH T ss_conf 55751345357605-88510356532676468760-4069997526 No 321 >PRK09361 radB DNA repair and recombination protein RadB; Provisional Probab=92.59 E-value=0.79 Score=24.63 Aligned_cols=121 Identities=13% Similarity=0.132 Sum_probs=59.8 Q ss_pred CCEEEEEECCCCCCHHHHHHHHHHHHHHHH-CCCCCCHHHCCCCCCCEEEEEE-E-CCCCCCC-CCCHHHHH---HHHHH Q ss_conf 643999967784407899999999999997-1985303532068221056765-2-3766113-85328999---99999 Q gi|254780750|r 649 SGKLWLLTGPNMGGKSTFLRQNALIVIMAQ-MGSYVPASYAHIGIVDKLFSRV-G-SADNLAS-GRSTFMVE---MIETA 721 (920) Q Consensus 649 ~~~~~iiTGpNmgGKSt~lRqval~vilAQ-iG~fVPA~~a~i~~~D~IftRi-G-a~D~l~~-g~STF~vE---m~e~~ 721 (920) .+++..|+||--+||||+.=|++..+.... --.||-++.+...-+..|..+- . ..+++.- .-++|.-+ +.... T Consensus 22 ~G~itei~G~pG~GKTtl~lq~a~~~~~~g~~vlYidtE~~~~er~~qi~~~~~~~~~~~i~~~~~~~~~~~~~~i~~~~ 101 (224) T PRK09361 22 RGTITQIYGPPGSGKTNICIQLAVEAARQGKKVIYIDTEGLSPERFKQIAGEDFEELLSNIIIFEPSSFEEQREAIQKAE 101 (224) T ss_pred CCEEEEEECCCCCCHHHHHHHHHHHHHHCCCCEEEECCCCCCHHHHHHHHCCCHHHHHHCCEEECCCCHHHHHHHHHHHH T ss_conf 88799998999985999999999999974990999678767889999985657345420614724798899999999999 Q ss_pred HHHHHCCCCCEEEEECCCC--------CCCHHHHH-HHHH--HHHHHHHHHCCCEEEEECCC Q ss_conf 9999589985699932588--------98805679-9999--99999997269849997487 Q gi|254780750|r 722 SILNQATNQSFVILDEIGR--------GTATLDGL-SIAW--ATIEYLHETNRCRGLLATHF 772 (920) Q Consensus 722 ~IL~~at~~SLVllDElGr--------GTst~DG~-aiA~--aile~l~~~~~~~~lfaTHy 772 (920) .+.+ ..-.||++|=+-- +..+ ... .++. ..|..+.++.++-++|+-|- T Consensus 102 ~~~~--~~~~lvVIDSi~~~~~~e~~~~~~~-~~~~~l~~~~~~L~~~a~~~~~~vvl~nqV 160 (224) T PRK09361 102 KIAK--ENVGLIVLDSATSLYRLELDDNDNS-KLNRELGKQISHLLSLARKHNIAVVITNQV 160 (224) T ss_pred HHHH--CCCCEEEEECCHHHEEHHCCCCCCH-HHHHHHHHHHHHHHHHHHHCCCEEEEEEEE T ss_conf 8750--5873899962301000001457658-999999999999999999719869999668 No 322 >pfam00910 RNA_helicase RNA helicase. This family includes RNA helicases thought to be involved in duplex unwinding during viral RNA replication. Members of this family are found in a variety of single stranded RNA viruses. Probab=92.57 E-value=0.54 Score=25.85 Aligned_cols=64 Identities=30% Similarity=0.388 Sum_probs=44.2 Q ss_pred EEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCCCEE Q ss_conf 99677844078999999999999971985303532068221056765237661138532899999999999958998569 Q gi|254780750|r 654 LLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGSADNLASGRSTFMVEMIETASILNQATNQSFV 733 (920) Q Consensus 654 iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~at~~SLV 733 (920) .|.||.-.|||++.+.+|-.+.=+ ...+.-+++|+|- ..|+-..|. +..--| T Consensus 2 ~l~G~~G~GKS~~a~~la~~~~~~----------~~~~~~~~~Y~~~-~~~~~wdgY-----------------~gq~vv 53 (105) T pfam00910 2 WLYGPPGCGKSTLAKYLARALLDH----------LGLPKKDSVYSRN-PDDDFWDGY-----------------TGQPVV 53 (105) T ss_pred EEECCCCCCHHHHHHHHHHHHHHH----------HCCCCCCCEEECC-CCCCCCCCC-----------------CCCEEE T ss_conf 897999898899999999999998----------3778789779678-877656788-----------------998579 Q ss_pred EEECCCCCCCHH Q ss_conf 993258898805 Q gi|254780750|r 734 ILDEIGRGTATL 745 (920) Q Consensus 734 llDElGrGTst~ 745 (920) ++||+|+-.+.. T Consensus 54 i~DD~~~~~~~~ 65 (105) T pfam00910 54 IIDDFGQNPDGP 65 (105) T ss_pred EEECCCCCCCCC T ss_conf 996577788862 No 323 >PRK11664 ATP-dependent RNA helicase HrpB; Provisional Probab=92.54 E-value=0.32 Score=27.53 Aligned_cols=27 Identities=15% Similarity=0.236 Sum_probs=11.9 Q ss_pred CCCCCHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 265617669999999998898599995 Q gi|254780750|r 83 MCGVPVHTANHYIQKLIKIGHRIAICE 109 (920) Q Consensus 83 maGvP~~~l~~yl~~Lv~~GykVaive 109 (920) |...|.+++..-+..-++.+-.|+|+- T Consensus 1 m~~LPi~~~~~~i~~~l~~~~~~vl~a 27 (812) T PRK11664 1 MSSLPVAAVLPELLTALKTAPQVLLKA 27 (812) T ss_pred CCCCCHHHHHHHHHHHHHHCCEEEEEE T ss_conf 998984789999999999799799990 No 324 >pfam05496 RuvB_N Holliday junction DNA helicase ruvB N-terminus. The RuvB protein makes up part of the RuvABC revolvasome which catalyses the resolution of Holliday junctions that arise during genetic recombination and DNA repair. Branch migration is catalysed by the RuvB protein that is targeted to the Holliday junction by the structure specific RuvA protein. This family contains the N-terminal region of the protein. Probab=92.51 E-value=0.56 Score=25.75 Aligned_cols=62 Identities=27% Similarity=0.257 Sum_probs=41.3 Q ss_pred EEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCCCE Q ss_conf 99967784407899999999999997198530353206822105676523766113853289999999999995899856 Q gi|254780750|r 653 WLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGSADNLASGRSTFMVEMIETASILNQATNQSF 732 (920) Q Consensus 653 ~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~at~~SL 732 (920) +|++||-..||||+-|-+|-. ++|-.-+ ..+ . -.-+..+...++.+...+.+ T Consensus 53 ~lf~GPPG~GKTTlAriiAk~-----~~~~~~~--------------~s~-~--------~i~~~~di~~~l~~~~~~~I 104 (234) T pfam05496 53 VLLYGPPGLGKTTLANIIANE-----MGVNIRI--------------TSG-P--------ALEKPGDLAAILTNLEPGDV 104 (234) T ss_pred EEEECCCCCCHHHHHHHHHHH-----HCCCEEE--------------ECC-H--------HHHHHHHHHHHHHHCCCCCE T ss_conf 788789999888999999984-----0875376--------------142-6--------66438999999984589988 Q ss_pred EEEECCCCCC Q ss_conf 9993258898 Q gi|254780750|r 733 VILDEIGRGT 742 (920) Q Consensus 733 VllDElGrGT 742 (920) +.+||+-|=+ T Consensus 105 LFIDEIHr~n 114 (234) T pfam05496 105 LFIDEIHRLN 114 (234) T ss_pred EEEECHHHCC T ss_conf 9996654358 No 325 >COG3638 ABC-type phosphate/phosphonate transport system, ATPase component [Inorganic ion transport and metabolism] Probab=92.46 E-value=0.24 Score=28.45 Aligned_cols=35 Identities=34% Similarity=0.468 Sum_probs=26.5 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHH Q ss_conf 444345635877776643999967784407899999999 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNAL 672 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval 672 (920) ..=.||++.. ..+.+..|-||.-+||||+||.+.= T Consensus 18 ~aL~~Vnl~I----~~GE~VaiIG~SGaGKSTLLR~lng 52 (258) T COG3638 18 QALKDVNLEI----NQGEMVAIIGPSGAGKSTLLRSLNG 52 (258) T ss_pred EEEEEEEEEE----CCCCEEEEECCCCCCHHHHHHHHHC T ss_conf 6663576775----7986899987888868999999866 No 326 >TIGR02315 ABC_phnC phosphonate ABC transporter, ATP-binding protein; InterPro: IPR012693 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). Phosphonates are a class of phosphorus-containing organic compound with a stable direct C-P bond rather than a C-O-P linkage. A number of bacterial species have operons, typically about 14 genes in size, with genes for ATP-dependent transport of phosphonates, degradation, and regulation of the expression of the system. Members of this protein family are the ATP-binding cassette component of tripartite ABC transporters of phosphonates.; GO: 0005524 ATP binding, 0015416 phosphonate transmembrane-transporting ATPase activity, 0015716 phosphonate transport, 0016020 membrane. Probab=92.46 E-value=0.13 Score=30.58 Aligned_cols=30 Identities=37% Similarity=0.552 Sum_probs=21.5 Q ss_pred EEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHH Q ss_conf 4563587777664399996778440789999999 Q gi|254780750|r 638 DCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNA 671 (920) Q Consensus 638 di~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqva 671 (920) ||+|.. ..+-|..|=||=-+||||+||++= T Consensus 20 ~inl~i----~~GE~~~~IG~SGAGKSTLLR~iN 49 (253) T TIGR02315 20 NINLEI----NPGEFVAVIGPSGAGKSTLLRCIN 49 (253) T ss_pred EECCEE----ECCEEEEEECCCCCCHHHHHHHHH T ss_conf 311434----165179997378872679998775 No 327 >smart00763 AAA_PrkA PrkA AAA domain. This is a family of PrkA bacterial and archaeal serine kinases approximately 630 residues long. This is the N-terminal AAA domain. Probab=92.43 E-value=0.15 Score=30.06 Aligned_cols=18 Identities=39% Similarity=0.724 Sum_probs=6.6 Q ss_pred EEEEEECCCCCCHHHHHH Q ss_conf 399996778440789999 Q gi|254780750|r 651 KLWLLTGPNMGGKSTFLR 668 (920) Q Consensus 651 ~~~iiTGpNmgGKSt~lR 668 (920) +++++-||+.|||||+.+ T Consensus 79 ~IllL~GPvGsGKStl~~ 96 (361) T smart00763 79 QILYLLGPVGGGKSSLVE 96 (361) T ss_pred EEEEEECCCCCCHHHHHH T ss_conf 699998899887799999 No 328 >TIGR00602 rad24 checkpoint protein rad24; InterPro: IPR004582 To be effective as a mechanism that preserves genomic integrity, the DNA damage checkpoint must be extremely sensitive in its ability to detect DNA damage. In Saccharomyces cerevisiae the Ddc1/Rad17/Mec3 complex and Rad24 are DNA damage checkpoint components which may promote checkpoint activation by "sensing" DNA damage directly . Rad24 shares sequence homology with RF-c, a protein that recognises DNA template/RNA primer hybrids during DNA replication. The Ddc1 complex has structural homology to proliferating-cell nuclear antigen (PCNA), which clamps onto DNA and confers processivity to DNA polymerases delta and epsilon. Rad24 is postulated to recognise DNA lesions and then recruit the Ddc1 complex to generate checkpoint signals. ; GO: 0006281 DNA repair, 0007049 cell cycle, 0005634 nucleus. Probab=92.38 E-value=0.12 Score=30.62 Aligned_cols=39 Identities=18% Similarity=0.149 Sum_probs=18.0 Q ss_pred EEECCHHHCCEEEEECCCCCCCCHHHHHHCCCCCCHHHHHHHH Q ss_conf 1425133201545412787764127876301234337899998 Q gi|254780750|r 290 LFIDSAARSNLEILRTLSGSREQSLLKTIDYSITGAGGRLFAE 332 (920) Q Consensus 290 m~LD~~Tl~nLEI~~~~~g~~~gSL~~~Ln~t~T~~G~RlLr~ 332 (920) |+|...-+.|||=.+|.+.-.---+|. ..|-||+-+|.+ T Consensus 250 ~~~te~~~~nlegdNnq~kfGir~~F~----~~~IM~~~il~~ 288 (670) T TIGR00602 250 IIITESELNNLEGDNNQDKFGIRLLFP----AVTIMNKEILEE 288 (670) T ss_pred EEEECCCCCCCCCCCCCHHCCCCCCCC----HHHHHHHHHHCC T ss_conf 997124344457755510104101352----577752665407 No 329 >TIGR02012 tigrfam_recA protein RecA; InterPro: IPR001553 The recA gene product is a multifunctional enzyme that plays a role in homologous recombination, DNA repair and induction of the SOS response . In homologous recombination, the protein functions as a DNA-dependent ATPase, promoting synapsis, heteroduplex formation and strand exchange between homologous DNAs . RecA also acts as a protease cofactor that promotes autodigestion of the lexA product and phage repressors. The proteolytic inactivation of the lexA repressor by an activated form of recA may cause a derepression of the 20 or so genes involved in the SOS response, which regulates DNA repair, induced mutagenesis, delayed cell division and prophage induction in response to DNA damage . RecA is a protein of about 350 amino-acid residues. Its sequence is very well conserved , , among eubacterial species. It is also found in the chloroplast of plants . RecA-like proteins are found in archaea and diverse eukaryotic organisms, like fission yeast, mouse or human. In the filament visualised by X-ray crystallography, ß-strand 3, the loop C-terminal to ß-strand 2, and alpha-helix D of the core domain form one surface that packs against alpha-helix A and ß-strand 0 (the N-terminal domain) of an adjacent monomer during polymerisation [Lusetti and Cox, Annu. Rev. Biochem. 2002. 71:71-100.]. The core ATP-binding site domain is well conserved, with 14 invariant residues. It contains the nucleotide binding loop between ß-strand 1 and alpha-helix C. The Escherichia coli sequence GPESSGKT matches the consensus sequence of amino acids (G/A)XXXXGK(T/S) for the Walker A box (also referred to as the P-loop) found in a number of nucleoside triphosphate (NTP)-binding proteins. Another nucleotide binding motif, the Walker B box is found at ß-strand 4 in the RecA structure. The Walker B box is characterised by four hydrophobic amino acids followed by an acidic residue (usually aspartate). Nucleotide specificity and additional ATP binding interactions are contributed by the amino acid residues at ß-strand 2 and the loop C-terminal to that strand, all of which are greater than 90 Probab=92.37 E-value=0.17 Score=29.64 Aligned_cols=133 Identities=19% Similarity=0.279 Sum_probs=85.5 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHH-HHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEE-EECC-CCCCCCCC Q ss_conf 44345635877776643999967784407899-999999999997198530353206822105676-5237-66113853 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTF-LRQNALIVIMAQMGSYVPASYAHIGIVDKLFSR-VGSA-DNLASGRS 711 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~-lRqval~vilAQiG~fVPA~~a~i~~~D~IftR-iGa~-D~l~~g~S 711 (920) +.-|+-||..+- ..+|+.=|.||-.|||||+ |.-||-++=.--.-.||-|++|- |..|++ +|.. |+|.-.|- T Consensus 41 l~LD~AlG~GG~-P~GRi~EiYGpESsGKTTLal~~iA~~Qk~Gg~~afiDAEHAl----D~~YA~~LGv~~~~L~~sQP 115 (322) T TIGR02012 41 LALDLALGVGGL-PKGRIIEIYGPESSGKTTLALHVIAEAQKAGGVAAFIDAEHAL----DPVYAKKLGVDIDNLLVSQP 115 (322) T ss_pred HHHHHHHCCCCC-CCCEEEEEECCCCCCHHHHHHHHHHHHHHCCCEEEEEECHHHC----CHHHHHHHCCCHHHHEEECC T ss_conf 134355167989-8750799854898847899999999997439838998451303----77889983645247112088 Q ss_pred HHHHHHHHHHHHHHHCCCCCEEEEECCCCCC----------CHHHHHH--HHHHHHHHH---HHHCCCEEEEECCC Q ss_conf 2899999999999958998569993258898----------8056799--999999999---97269849997487 Q gi|254780750|r 712 TFMVEMIETASILNQATNQSFVILDEIGRGT----------ATLDGLS--IAWATIEYL---HETNRCRGLLATHF 772 (920) Q Consensus 712 TF~vEm~e~~~IL~~at~~SLVllDElGrGT----------st~DG~a--iA~aile~l---~~~~~~~~lfaTHy 772 (920) -.--.-.|+...|-+++.=-+|++|-+-.=| +..-|+. |-.--|..| +++..+.++|.--. T Consensus 116 d~GE~ALeI~~~L~rSgAvD~iVvDSVAAL~P~aEieGemgd~~~Gl~ARLMS~ALRKl~g~~~k~~t~~iFiNQ~ 191 (322) T TIGR02012 116 DTGEQALEIAETLVRSGAVDIIVVDSVAALVPKAEIEGEMGDSHVGLQARLMSQALRKLTGALSKSNTTLIFINQI 191 (322) T ss_pred CCCHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEEEHH T ss_conf 8714699999998723761179973400138712317543544232578889999998887653205237640022 No 330 >TIGR03420 DnaA_homol_Hda DnaA regulatory inactivator Hda. Members of this protein family are Hda (Homologous to DnaA). These proteins are about half the length of DnaA and homologous over length of Hda. In the model species Escherichia coli, the initiation of DNA replication requires DnaA bound to ATP rather than ADP; Hda helps facilitate the conversion of DnaA-ATP to DnaA-ADP. Probab=92.37 E-value=0.84 Score=24.44 Aligned_cols=96 Identities=16% Similarity=0.187 Sum_probs=52.4 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCC Q ss_conf 43999967784407899999999999997198530353206822105676523766113853289999999999995899 Q gi|254780750|r 650 GKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGSADNLASGRSTFMVEMIETASILNQATN 729 (920) Q Consensus 650 ~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~at~ 729 (920) .+.+.|.||-.+|||.+++.+|- .+.+. |- ..+|.. ...|+-.+. .++..-.. T Consensus 38 ~~~l~i~G~~GsGKTHLl~a~~~-~~~~~-~~------------~~~yl~----------~~~~~~~~~---~~l~~l~~ 90 (226) T TIGR03420 38 DRFLYLWGESGSGKSHLLQAACA-AAEER-GK------------SAIYLP----------LAELAQADP---EVLEGLEQ 90 (226) T ss_pred CCEEEEECCCCCCHHHHHHHHHH-HHHCC-CC------------CEEEEC----------HHHHHHHHH---HHHHHCCC T ss_conf 88699989999988999999999-98626-99------------579952----------999877539---99972744 Q ss_pred CCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCH Q ss_conf 85699932588988056799999999999972698499974875 Q gi|254780750|r 730 QSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFH 773 (920) Q Consensus 730 ~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~ 773 (920) -.++++|.+-+=....+.--.-+-+..++.+. ++..+++.+.+ T Consensus 91 ~d~l~iDDi~~i~~~~~~e~~lF~l~N~~~~~-~~~ilits~~~ 133 (226) T TIGR03420 91 ADLVCLDDVEAIAGQPEWQEALFHLYNRVREA-GGRLLIAGRAA 133 (226) T ss_pred CCEEEEECHHHHCCCHHHHHHHHHHHHHHHHH-CCEEEEECCCC T ss_conf 89999966333437837899999999999865-28289867888 No 331 >COG1134 TagH ABC-type polysaccharide/polyol phosphate transport system, ATPase component [Carbohydrate transport and metabolism / Cell envelope biogenesis, outer membrane] Probab=92.28 E-value=0.18 Score=29.38 Aligned_cols=127 Identities=24% Similarity=0.359 Sum_probs=64.5 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEE--EEEC--------C Q ss_conf 44434563587777664399996778440789999999999999719853035320682210567--6523--------7 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFS--RVGS--------A 703 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~Ift--RiGa--------~ 703 (920) ++=+||++.. ..+-.+=|-|+|-+||||+||-++ |.+-|-+ -++.+--+|.. -+|+ . T Consensus 41 ~aL~disf~i----~~Ge~vGiiG~NGaGKSTLlklia--------Gi~~Pt~-G~v~v~G~v~~li~lg~Gf~pelTGr 107 (249) T COG1134 41 WALKDISFEI----YKGERVGIIGHNGAGKSTLLKLIA--------GIYKPTS-GKVKVTGKVAPLIELGAGFDPELTGR 107 (249) T ss_pred EEECCCEEEE----ECCCEEEEECCCCCCHHHHHHHHH--------CCCCCCC-CEEEECCEEEHHHHCCCCCCCCCCHH T ss_conf 8752735886----079899898789985899999995--------8717988-25998146705644156778542058 Q ss_pred CCC-----CCCCCH-----HHHHHHHHHHH----------------------HHHCCCCCEEEEEC-CCCCCCHHHHHHH Q ss_conf 661-----138532-----89999999999----------------------99589985699932-5889880567999 Q gi|254780750|r 704 DNL-----ASGRST-----FMVEMIETASI----------------------LNQATNQSFVILDE-IGRGTATLDGLSI 750 (920) Q Consensus 704 D~l-----~~g~ST-----F~vEm~e~~~I----------------------L~~at~~SLVllDE-lGrGTst~DG~ai 750 (920) ||+ ..|.|. .+-|..|-|.+ +.-+.+---.|+|| |+-|+.++---|+ T Consensus 108 eNi~l~~~~lG~~~~ei~~~~~eIieFaELG~fi~~PvktYSSGM~aRLaFsia~~~~pdILllDEvlavGD~~F~~K~~ 187 (249) T COG1134 108 ENIYLRGLILGLTRKEIDEKVDEIIEFAELGDFIDQPVKTYSSGMYARLAFSVATHVEPDILLLDEVLAVGDAAFQEKCL 187 (249) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHCCHHHCCHHHHHHHHHHHHHHCCCCEEEEEHHHHCCCHHHHHHHH T ss_conf 78999999846669999988899999987788761744112588999988875331378789986134407799999999 Q ss_pred HHHHHHHHHHHCCCEEEEECCCHHHH Q ss_conf 99999999972698499974875797 Q gi|254780750|r 751 AWATIEYLHETNRCRGLLATHFHELT 776 (920) Q Consensus 751 A~aile~l~~~~~~~~lfaTHy~eL~ 776 (920) . -++.+.++ ....+|++|-+... T Consensus 188 ~--rl~e~~~~-~~tiv~VSHd~~~I 210 (249) T COG1134 188 E--RLNELVEK-NKTIVLVSHDLGAI 210 (249) T ss_pred H--HHHHHHHC-CCEEEEEECCHHHH T ss_conf 9--99999975-98799997887999 No 332 >COG1480 Predicted membrane-associated HD superfamily hydrolase [General function prediction only] Probab=92.26 E-value=0.58 Score=25.61 Aligned_cols=76 Identities=14% Similarity=0.277 Sum_probs=47.4 Q ss_pred HHHHHHHHHHHHHHCCCE---EEEECCCHHHHHHHHHCCCEEEEEEEEEEECCEEEEEEEEEECCC------CCCHHHHH Q ss_conf 999999999999726984---999748757976643068858999999960992778777744789------88778999 Q gi|254780750|r 748 LSIAWATIEYLHETNRCR---GLLATHFHELTDLSKSLKRFHNATLQVSDSNEGIIFLHKVIPGIA------DHSYGIQV 818 (920) Q Consensus 748 ~aiA~aile~l~~~~~~~---~lfaTHy~eL~~l~~~~~~v~n~~~~~~~~~~~i~flykl~~G~~------~~Sygi~v 818 (920) ..||.|-++.+ ||- +-.++|||++..+.+-.--+.| .|.-... ..++.|-.| .-+.|.+. T Consensus 498 AnLAEaAa~~I----Gan~lLaRVgayYHDIGK~~rP~~FiEN-Q~~g~N~------Hd~lsP~lSa~II~sHv~eGv~m 566 (700) T COG1480 498 ANLAEAAAEEI----GANSLLARVGAYYHDIGKMKRPLFFIEN-QMGGKNP------HDDLSPQLSALIIISHVKEGVEM 566 (700) T ss_pred HHHHHHHHHHH----CCCHHHHHHHHHHHHCCCCCCCCCCCCC-CCCCCCC------CCCCCHHHHHHHHHHHHHHHHHH T ss_conf 77899999983----8863888887877501022588641111-1389997------66478788899999851568999 Q ss_pred HHHCCCCHHHHHHHHH Q ss_conf 9982999899999999 Q gi|254780750|r 819 GKLAGLPNTVISRAYD 834 (920) Q Consensus 819 A~laG~p~~vi~~A~~ 834 (920) ||..|+|++|++-+.+ T Consensus 567 ar~y~lPq~iidii~e 582 (700) T COG1480 567 AREYKLPQEIIDIIPE 582 (700) T ss_pred HHHCCCCHHHHHHHHH T ss_conf 9983997799999998 No 333 >PRK11176 lipid transporter ATP-binding/permease protein; Provisional Probab=92.24 E-value=0.43 Score=26.59 Aligned_cols=33 Identities=24% Similarity=0.216 Sum_probs=24.7 Q ss_pred EEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHH Q ss_conf 4443456358777766439999677844078999999 Q gi|254780750|r 634 FIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQN 670 (920) Q Consensus 634 fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqv 670 (920) +|=+|+++.. ..+...-|.||+.+||||+++-+ T Consensus 356 ~vL~~isl~I----~~G~~vaiVG~SGsGKSTL~~LL 388 (581) T PRK11176 356 PALRNINFKI----PAGKTVALVGRSGSGKSTIANLL 388 (581) T ss_pred CCCCCCCCCC----CCCCEEECCCCCCCCHHHHHHHH T ss_conf 2010663357----99944312289998678999999 No 334 >COG4604 CeuD ABC-type enterochelin transport system, ATPase component [Inorganic ion transport and metabolism] Probab=92.22 E-value=0.22 Score=28.83 Aligned_cols=72 Identities=29% Similarity=0.321 Sum_probs=41.1 Q ss_pred CCEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEECCCCCCCCCC Q ss_conf 70444345635877776643999967784407899999999999997198530353206822105676523766113853 Q gi|254780750|r 632 KPFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGSADNLASGRS 711 (920) Q Consensus 632 ~~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa~D~l~~g~S 711 (920) ..-|-+|+++.. ..+.+-=|-|||.+||||+|-.++= .+|.++-++.+= . -++ .+ T Consensus 13 ~~~vl~~isl~i----~~g~iTs~IGPNGAGKSTLLS~~sR---------L~~~d~G~i~i~-g--------~~~---~~ 67 (252) T COG4604 13 TKVVLDDVSLDI----PKGGITSIIGPNGAGKSTLLSMMSR---------LLKKDSGEITID-G--------LEL---TS 67 (252) T ss_pred CEEEECCCEEEE----CCCCEEEEECCCCCCHHHHHHHHHH---------HCCCCCCEEEEE-E--------EEC---CC T ss_conf 777653614541----5886368888998648889999998---------526678638981-1--------662---56 Q ss_pred HHHHHHHHHHHHHHHCC Q ss_conf 28999999999999589 Q gi|254780750|r 712 TFMVEMIETASILNQAT 728 (920) Q Consensus 712 TF~vEm~e~~~IL~~at 728 (920) |=--|+...-.||++.. T Consensus 68 ~~s~~LAk~lSILkQ~N 84 (252) T COG4604 68 TPSKELAKKLSILKQEN 84 (252) T ss_pred CCHHHHHHHHHHHHHHC T ss_conf 87699999988987632 No 335 >KOG0057 consensus Probab=92.21 E-value=0.87 Score=24.30 Aligned_cols=122 Identities=24% Similarity=0.294 Sum_probs=68.1 Q ss_pred EEEECCCCCCHHHHHHHHHHH-------------------HHHHHCCCCCCHHHCCC--CCCCEE-E------------- Q ss_conf 999677844078999999999-------------------99997198530353206--822105-6------------- Q gi|254780750|r 653 WLLTGPNMGGKSTFLRQNALI-------------------VIMAQMGSYVPASYAHI--GIVDKL-F------------- 697 (920) Q Consensus 653 ~iiTGpNmgGKSt~lRqval~-------------------vilAQiG~fVPA~~a~i--~~~D~I-f------------- 697 (920) .=|-|+|.+||||++|.+==. -=+-|+=.|||-++.-+ +++-.| | T Consensus 381 VaIvG~nGsGKSTilr~LlrF~d~sG~I~IdG~dik~~~~~SlR~~Ig~VPQd~~LFndTIl~NI~YGn~sas~eeV~e~ 460 (591) T KOG0057 381 VAIVGSNGSGKSTILRLLLRFFDYSGSILIDGQDIKEVSLESLRQSIGVVPQDSVLFNDTILYNIKYGNPSASDEEVVEA 460 (591) T ss_pred EEEECCCCCCHHHHHHHHHHHHCCCCCEEECCEEHHHHCHHHHHHHEEEECCCCCCCCHHHHHHHHCCCCCCCHHHHHHH T ss_conf 98978999878899999999744688599987337650757765221676776643006599886328987688999999 Q ss_pred -EEEECCC---CCCCCCCHHHH---------HHHHHH---HHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf -7652376---61138532899---------999999---9999589985699932588988056799999999999972 Q gi|254780750|r 698 -SRVGSAD---NLASGRSTFMV---------EMIETA---SILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHET 761 (920) Q Consensus 698 -tRiGa~D---~l~~g~STF~v---------Em~e~~---~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~ 761 (920) -|-|-+| ++..|.+|--- |+..++ .+|++|- .+++||- ||..|-.+= ..+++.+.+. T Consensus 461 ~k~a~~hd~i~~l~~GY~T~VGerG~~LSGGekQrvslaRa~lKda~---Il~~DEa---TS~LD~~TE-~~i~~~i~~~ 533 (591) T KOG0057 461 CKRAGLHDVISRLPDGYQTLVGERGLMLSGGEKQRVSLARAFLKDAP---ILLLDEA---TSALDSETE-REILDMIMDV 533 (591) T ss_pred HHHCCCHHHHHHCCCCCHHHHHHCCCCCCCCHHHHHHHHHHHHCCCC---EEEECCC---CCCCCHHHH-HHHHHHHHHH T ss_conf 99728378887366630326753444256406789999999845898---6886376---532365669-9999999875 Q ss_pred CCC-EEEEECCCHHHHHHHHH Q ss_conf 698-49997487579766430 Q gi|254780750|r 762 NRC-RGLLATHFHELTDLSKS 781 (920) Q Consensus 762 ~~~-~~lfaTHy~eL~~l~~~ 781 (920) ... .++|.-|-|.+..-.+. T Consensus 534 ~~~rTvI~IvH~l~ll~~~Dk 554 (591) T KOG0057 534 MSGRTVIMIVHRLDLLKDFDK 554 (591) T ss_pred CCCCEEEEEEECCHHHHCCCE T ss_conf 179769999963034750888 No 336 >COG4615 PvdE ABC-type siderophore export system, fused ATPase and permease components [Secondary metabolites biosynthesis, transport, and catabolism / Inorganic ion transport and metabolism] Probab=92.20 E-value=0.44 Score=26.54 Aligned_cols=151 Identities=21% Similarity=0.257 Sum_probs=73.9 Q ss_pred CCCEEC-CCCCCEEECCCCCHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHC Q ss_conf 851210-4787140000468058876310287704443456358777766439999677844078999999999999971 Q gi|254780750|r 601 YCRPII-DNSTNFIVKDGRHPIVEKTLKQQSSKPFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQM 679 (920) Q Consensus 601 y~rP~i-~~~~~l~i~~gRHPviE~~l~~~~~~~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQi 679 (920) +-+|+- -+=..+++++-|-.-. ...|--|-|+|.. ..+-+..|+|-|.+||||+++- -. T Consensus 311 ~~~~q~~p~~~~lelrnvrfay~--------~~~FhvgPiNl~i----krGelvFliG~NGsGKST~~~L--------Lt 370 (546) T COG4615 311 FPRPQAFPDWKTLELRNVRFAYQ--------DNAFHVGPINLTI----KRGELVFLIGGNGSGKSTLAML--------LT 370 (546) T ss_pred CCCCCCCCCCCCCEEEEEEECCC--------CCCCEECCEEEEE----ECCCEEEEECCCCCCHHHHHHH--------HH T ss_conf 88877687545201320020367--------6552115613687----3373899988899638899999--------97 Q ss_pred CCCCCHHHCC---------------CCCCCEEEEEEECCCCCCCCCC-------HHHHHHHHHHH--------------- Q ss_conf 9853035320---------------6822105676523766113853-------28999999999--------------- Q gi|254780750|r 680 GSYVPASYAH---------------IGIVDKLFSRVGSADNLASGRS-------TFMVEMIETAS--------------- 722 (920) Q Consensus 680 G~fVPA~~a~---------------i~~~D~IftRiGa~D~l~~g~S-------TF~vEm~e~~~--------------- 722 (920) |.|-|-+--. =++|..||++.---|.+..+.- +|..+-.|.++ T Consensus 371 GL~~PqsG~I~ldg~pV~~e~ledYR~LfSavFsDyhLF~~ll~~e~~as~q~i~~~LqrLel~~ktsl~d~~fs~~kLS 450 (546) T COG4615 371 GLYQPQSGEILLDGKPVSAEQLEDYRKLFSAVFSDYHLFDQLLGPEGKASPQLIEKWLQRLELAHKTSLNDGRFSNLKLS 450 (546) T ss_pred HCCCCCCCCEEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHEECCCCCCCCCCC T ss_conf 06688888266789348844789999999999645652687629656798678999999998766500057863200045 Q ss_pred ---------HHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEE-CCCH Q ss_conf ---------99958998569993258898805679999999999997269849997-4875 Q gi|254780750|r 723 ---------ILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLA-THFH 773 (920) Q Consensus 723 ---------IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfa-THy~ 773 (920) ++--.-+|-..++||--.--+|.=.-- -|-++--+.+. .-+|+|| ||.- T Consensus 451 tGQkKRlAll~AllEeR~Ilv~DEWAADQDPaFRR~-FY~~lLp~LK~-qGKTI~aIsHDd 509 (546) T COG4615 451 TGQKKRLALLLALLEERDILVLDEWAADQDPAFRRE-FYQVLLPLLKE-QGKTIFAISHDD 509 (546) T ss_pred CCHHHHHHHHHHHHHHCCEEEEEHHHCCCCHHHHHH-HHHHHHHHHHH-HCCEEEEEECCC T ss_conf 305878999999973098688601212467599999-99998599997-098599994473 No 337 >pfam05707 Zot Zonular occludens toxin (Zot). This family consists of bacterial and viral proteins which are very similar to the Zonular occludens toxin (Zot). Zot is elaborated by bacteriophages present in toxigenic strains of Vibrio cholerae. Zot is a single polypeptide chain of 44.8 kDa, with the ability to reversibly alter intestinal epithelial tight junctions, allowing the passage of macromolecules through mucosal barriers Probab=92.15 E-value=0.88 Score=24.26 Aligned_cols=108 Identities=14% Similarity=0.146 Sum_probs=50.9 Q ss_pred EEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCC-C-CCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCC Q ss_conf 9999677844078999999999999971985303532068-2-2105676523766113853289999999999995899 Q gi|254780750|r 652 LWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIG-I-VDKLFSRVGSADNLASGRSTFMVEMIETASILNQATN 729 (920) Q Consensus 652 ~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~-~-~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~at~ 729 (920) +.+|||.+-+|||-+.=+--++--+ .-|..|-+. |+ + .|+|....|.. ...|+ |..+....-+.... T Consensus 2 I~litG~pGsGKS~~aV~~~i~~al-~~GR~V~tN---I~gL~~~~~~~~~~~~------~~~~~-~~~~~~~~w~~~p~ 70 (183) T pfam05707 2 IYLITGKPGSGKTLEAVSYHILPAL-KKGRKVITN---IDGLNLERFPKVFGED------VRERL-EDIGYMDPWRTYPK 70 (183) T ss_pred EEEEECCCCCCCCHHHHHHHHHHHH-HCCCEEEEC---CCCCCCHHCCCCCCCC------CCCCC-CCCCCCHHHCCCCC T ss_conf 7999359999622999999999998-789989987---8653522101223444------54320-00012223314999 Q ss_pred CCEEEEECCCC-----CCCHHHHHHHHHHHHHHHHH--HCCCEEEEECCCHHH Q ss_conf 85699932588-----98805679999999999997--269849997487579 Q gi|254780750|r 730 QSFVILDEIGR-----GTATLDGLSIAWATIEYLHE--TNRCRGLLATHFHEL 775 (920) Q Consensus 730 ~SLVllDElGr-----GTst~DG~aiA~aile~l~~--~~~~~~lfaTHy~eL 775 (920) .|||++||.++ ++...+ -..+++|.. +-|--.+|.|.-..+ T Consensus 71 g~liViDE~~~~~~~r~~~~~~-----~~~i~~l~~HRH~G~DiiliTQ~~~~ 118 (183) T pfam05707 71 GALLVIDEAQTWFPSRRGGDKV-----PPVLDAFSTHRHLGWDIILITQNPSK 118 (183) T ss_pred CCEEEEECCHHHCCCCCCCCCC-----CHHHHHHHHCCCCCCEEEEEECCHHH T ss_conf 8799998976554887778888-----38999999807788208999189799 No 338 >PRK06995 flhF flagellar biosynthesis regulator FlhF; Validated Probab=92.09 E-value=0.9 Score=24.21 Aligned_cols=159 Identities=21% Similarity=0.293 Sum_probs=86.6 Q ss_pred CCEEEEEECCCCCCHHHHHHHHHHHHHHHHCC---CCCCHHHCCCCCCCEE--EEEEECCCCCCCCCCHH-HHHHHHHHH Q ss_conf 64399996778440789999999999999719---8530353206822105--67652376611385328-999999999 Q gi|254780750|r 649 SGKLWLLTGPNMGGKSTFLRQNALIVIMAQMG---SYVPASYAHIGIVDKL--FSRVGSADNLASGRSTF-MVEMIETAS 722 (920) Q Consensus 649 ~~~~~iiTGpNmgGKSt~lRqval~vilAQiG---~fVPA~~a~i~~~D~I--ftRiGa~D~l~~g~STF-~vEm~e~~~ 722 (920) .+.+..+-||-..||||.+==+|--..|.+=+ .+|-++.-+|+=++++ |++|= |..-. .-.-.|... T Consensus 175 ~ggV~alVGPTGVGKTTTiAKLAAr~~l~~g~~kVaLIT~DTYRIgAvEQLktYa~Il-------gvPv~vv~~~~eL~~ 247 (404) T PRK06995 175 RGGVFALVGPTGVGKTTTTAKLAARCVMRHGASKVALLTTDSYRIGGHEQLRIYGKIL-------GVPVHAVKDAADLRL 247 (404) T ss_pred CCCEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHHC-------CCEEEEECCHHHHHH T ss_conf 4755898668887637589999999999838983799976875478999999999875-------955999599999999 Q ss_pred HHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCC----EEEE-ECCCHHHHHHHHHCCCEEEEEEEEEEECC Q ss_conf 999589985699932588988056799999999999972698----4999-74875797664306885899999996099 Q gi|254780750|r 723 ILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRC----RGLL-ATHFHELTDLSKSLKRFHNATLQVSDSNE 797 (920) Q Consensus 723 IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~----~~lf-aTHy~eL~~l~~~~~~v~n~~~~~~~~~~ 797 (920) .|.....+-|||||=-|| |..|-. .. --++.|...... ++|= +|++..|.+....+..+... T Consensus 248 aL~~l~~~dlILIDTaGr--s~rD~~-~~-e~l~~l~~~~~~~~~~LVLsat~~~~dl~~i~~~f~~~~~~--------- 314 (404) T PRK06995 248 ALAELRNKHIVLIDTVGM--SQRDRM-VS-EQIAMLHGAGAPVQRLLLLNATSHGDTLNEVVQAYRGPGLA--------- 314 (404) T ss_pred HHHHHCCCCEEEEECCCC--CCCCHH-HH-HHHHHHHHCCCCCEEEEEECCCCCHHHHHHHHHHHCCCCCC--------- T ss_conf 999708999999809998--976888-99-99999973578852899977989999999999984469998--------- Q ss_pred EEEEEEEEEECCCCCCHH--HHHHHHCCCCHHHHHH Q ss_conf 277877774478988778--9999982999899999 Q gi|254780750|r 798 GIIFLHKVIPGIADHSYG--IQVGKLAGLPNTVISR 831 (920) Q Consensus 798 ~i~flykl~~G~~~~Syg--i~vA~laG~p~~vi~~ 831 (920) +++|+ ||=+-. ||| ++++-..++|-.-+-. T Consensus 315 ~~I~T-KLDEt~---~~G~iln~~~~~~lPlsy~T~ 346 (404) T PRK06995 315 GCILT-KLDEAA---SLGGALDTVIRHKLPLHYVSN 346 (404) T ss_pred EEEEE-CCCCCC---CHHHHHHHHHHHCCCEEEECC T ss_conf 39983-040679---723999999997898599818 No 339 >COG3044 Predicted ATPase of the ABC class [General function prediction only] Probab=91.97 E-value=0.18 Score=29.34 Aligned_cols=21 Identities=38% Similarity=0.509 Sum_probs=18.5 Q ss_pred EEEEECCCCCCHHHHHHHHHH Q ss_conf 999967784407899999999 Q gi|254780750|r 652 LWLLTGPNMGGKSTFLRQNAL 672 (920) Q Consensus 652 ~~iiTGpNmgGKSt~lRqval 672 (920) +.+|||||.-||||++-.+.- T Consensus 244 it~ItG~nfhGKTTLl~Aie~ 264 (554) T COG3044 244 ITLITGGNFHGKTTLLTAIER 264 (554) T ss_pred EEEEECCCCCCHHHHHHHHHH T ss_conf 389855876662689999972 No 340 >KOG0743 consensus Probab=91.92 E-value=0.85 Score=24.39 Aligned_cols=145 Identities=19% Similarity=0.360 Sum_probs=83.9 Q ss_pred CCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHC Q ss_conf 66439999677844078999999999999971985303532068221056765237661138532899999999999958 Q gi|254780750|r 648 NSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGSADNLASGRSTFMVEMIETASILNQA 727 (920) Q Consensus 648 ~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~a 727 (920) .-.|-.++.||-..||||++-. | |......|+|==.|-++..| |+..+|..+ T Consensus 233 awKRGYLLYGPPGTGKSS~IaA------m--------An~L~ydIydLeLt~v~~n~--------------dLr~LL~~t 284 (457) T KOG0743 233 AWKRGYLLYGPPGTGKSSFIAA------M--------ANYLNYDIYDLELTEVKLDS--------------DLRHLLLAT 284 (457) T ss_pred CCCCCCEEECCCCCCHHHHHHH------H--------HHHCCCCEEEEEECCCCCCH--------------HHHHHHHHC T ss_conf 5000412047999988899999------9--------72058736774400236838--------------999999728 Q ss_pred CCCCEEEEECCC-------CCCC---HHHH--HHHHHH-HHHHHHHHC--CC----EEEEECCCHHHHHHHHHCCCEEEE Q ss_conf 998569993258-------8988---0567--999999-999999726--98----499974875797664306885899 Q gi|254780750|r 728 TNQSFVILDEIG-------RGTA---TLDG--LSIAWA-TIEYLHETN--RC----RGLLATHFHELTDLSKSLKRFHNA 788 (920) Q Consensus 728 t~~SLVllDElG-------rGTs---t~DG--~aiA~a-ile~l~~~~--~~----~~lfaTHy~eL~~l~~~~~~v~n~ 788 (920) +++|.|+|--|- |+.. .++| .-+..+ .|..+ +-. -| -++|+|.|.|=-+-+--.| . T Consensus 285 ~~kSIivIEDIDcs~~l~~~~~~~~~~~~~~~~~VTlSGLLNfi-DGlwSscg~ERIivFTTNh~EkLDPALlRp----G 359 (457) T KOG0743 285 PNKSILLIEDIDCSFDLRERRKKKKENFEGDLSRVTLSGLLNFL-DGLWSSCGDERIIVFTTNHKEKLDPALLRP----G 359 (457) T ss_pred CCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCEEEHHHHHHHH-CCCCCCCCCCEEEEEECCCHHHCCHHHCCC----C T ss_conf 99718999612432304434555664546776606647756641-343004887349999468710068866288----7 Q ss_pred EEEEEEECCEEEEEEEEEECCCCC-CHHHHHHHHCCCCHHHHHHHHHHHHHHHH Q ss_conf 999996099277877774478988-77899999829998999999999999987 Q gi|254780750|r 789 TLQVSDSNEGIIFLHKVIPGIADH-SYGIQVGKLAGLPNTVISRAYDILKTFEK 841 (920) Q Consensus 789 ~~~~~~~~~~i~flykl~~G~~~~-Sygi~vA~laG~p~~vi~~A~~~~~~le~ 841 (920) .|+ +++--|-|.- .|-.-...-.|+|+ ...+..++|+ T Consensus 360 RmD-----------mhI~mgyCtf~~fK~La~nYL~~~~-----~h~L~~eie~ 397 (457) T KOG0743 360 RMD-----------MHIYMGYCTFEAFKTLASNYLGIEE-----DHRLFDEIER 397 (457) T ss_pred CCE-----------EEEECCCCCHHHHHHHHHHHCCCCC-----CCHHHHHHHH T ss_conf 522-----------5667266987999999998338988-----7306799998 No 341 >COG4161 ArtP ABC-type arginine transport system, ATPase component [Amino acid transport and metabolism] Probab=91.86 E-value=0.25 Score=28.30 Aligned_cols=36 Identities=39% Similarity=0.639 Sum_probs=25.9 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHH Q ss_conf 4434563587777664399996778440789999999999 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIV 674 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~v 674 (920) +--||+|+++ ++..+++-||-.+||||++|..-|+- T Consensus 17 ~lfdi~l~~~----~getlvllgpsgagkssllr~lnlle 52 (242) T COG4161 17 ALFDITLDCP----EGETLVLLGPSGAGKSSLLRVLNLLE 52 (242) T ss_pred EEEEEEECCC----CCCEEEEECCCCCCHHHHHHHHHHHH T ss_conf 0035532268----89779997788876467999988871 No 342 >KOG0979 consensus Probab=91.77 E-value=0.24 Score=28.53 Aligned_cols=68 Identities=19% Similarity=0.222 Sum_probs=32.3 Q ss_pred CCCEEEEEECCCHHHHHHHHHHHHHHHCCEEECCCCCCCCCCC-------CCCCCHH-HHHHHHHHHHHCCCEEEEEEE Q ss_conf 9929999808822200799999998618189107889988876-------2656176-699999999988985999950 Q gi|254780750|r 40 PDSLVFYRMGDFYELFFDDALLASRCLGITLTKRGKHLGKDIP-------MCGVPVH-TANHYIQKLIKIGHRIAICEQ 110 (920) Q Consensus 40 ~D~il~fr~G~FYE~f~~DA~~aa~~L~i~lt~r~~~~~~~~p-------maGvP~~-~l~~yl~~Lv~~GykVaiveQ 110 (920) |..|+=+++-.|-..=+ =-..-.+-||+.+.+- +.|+.-- .+|=|.. --.+-+.-.|.+|-.+|-+|= T Consensus 19 ~GsIvrI~l~NF~Ty~~-~e~~pgpsLNmIiGpN--GSGKSSiVcAIcLglgG~Pk~lGRak~VgeyIK~G~~~g~IEI 94 (1072) T KOG0979 19 DGSIVRIELHNFLTYDH-TEFLPGPSLNMIIGPN--GSGKSSIVCAICLGLGGKPKLLGRAKKVGEYIKRGEDEGYIEI 94 (1072) T ss_pred CCCEEEEEEEEEEEEEE-EEECCCCCEEEEECCC--CCCCHHHHHHHHHHCCCCHHHCCCHHHHHHHHHCCCCCCEEEE T ss_conf 98669999740044433-3443788612687789--8970488999999727974431435579999964776636999 No 343 >COG1125 OpuBA ABC-type proline/glycine betaine transport systems, ATPase components [Amino acid transport and metabolism] Probab=91.65 E-value=0.23 Score=28.57 Aligned_cols=32 Identities=31% Similarity=0.570 Sum_probs=16.2 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHH Q ss_conf 443456358777766439999677844078999999 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQN 670 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqv 670 (920) +-+|++|.. ..+-++++-||--+||||.||++ T Consensus 16 av~~v~l~I----~~gef~vliGpSGsGKTTtLkMI 47 (309) T COG1125 16 AVDDVNLTI----EEGEFLVLIGPSGSGKTTTLKMI 47 (309) T ss_pred EEEEEEEEE----CCCEEEEEECCCCCCHHHHHHHH T ss_conf 332225776----59728999878997578799999 No 344 >pfam00931 NB-ARC NB-ARC domain. Probab=91.45 E-value=0.68 Score=25.11 Aligned_cols=174 Identities=14% Similarity=0.115 Sum_probs=84.9 Q ss_pred CCCEEEEEECCCCCCHHHHHHHHHHHHHH---HHCCCCCCHHH--CCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHH Q ss_conf 66439999677844078999999999999---97198530353--20682210567652376611385328999999999 Q gi|254780750|r 648 NSGKLWLLTGPNMGGKSTFLRQNALIVIM---AQMGSYVPASY--AHIGIVDKLFSRVGSADNLASGRSTFMVEMIETAS 722 (920) Q Consensus 648 ~~~~~~iiTGpNmgGKSt~lRqval~vil---AQiG~fVPA~~--a~i~~~D~IftRiGa~D~l~~g~STF~vEm~e~~~ 722 (920) ++-+++=|.|+-.-||||+.|.+.=-.-. -..-++|.... -...+...|+.+++..+. .+.+. +..+.+. T Consensus 17 ~~~~vI~I~G~gGiGKTtLA~~v~~~~~i~~~F~~~~wv~vs~~~~~~~i~~~i~~~l~~~~~----~~~~~-~~~~l~~ 91 (285) T pfam00931 17 ENLGVVGIVGMGGVGKTTLAKQIYNDDSVGGHFDSVAWVVVSKTYTEFRLQKDILQELGLDDS----DWVEK-NESELAV 91 (285) T ss_pred CCCEEEEEECCCCCCHHHHHHHHHCCHHHHHCCCEEEEEEECCCCCHHHHHHHHHHHHCCCCC----CCCCC-CHHHHHH T ss_conf 895399988999563999999997165565059838999979766689999999998566654----55557-8999999 Q ss_pred HHHH--CCCCCEEEEECCCCCCCHHHHHHHHHH-HHHHHH-HHCCCEEEEECCCHHHHHHHHHCCCEEEEEEEEEEECCE Q ss_conf 9995--899856999325889880567999999-999999-726984999748757976643068858999999960992 Q gi|254780750|r 723 ILNQ--ATNQSFVILDEIGRGTATLDGLSIAWA-TIEYLH-ETNRCRGLLATHFHELTDLSKSLKRFHNATLQVSDSNEG 798 (920) Q Consensus 723 IL~~--at~~SLVllDElGrGTst~DG~aiA~a-ile~l~-~~~~~~~lfaTHy~eL~~l~~~~~~v~n~~~~~~~~~~~ 798 (920) .|+. .+++-||++|-+... .+ |- +...+- ...||+.|++|...+++...... ...+.+..-..++. T Consensus 92 ~l~~~L~~kr~LiVLDDVw~~---~~-----~~~l~~~~~~~~~gSrIIvTTR~~~V~~~~~~~--~~~~~l~~L~~~es 161 (285) T pfam00931 92 KIKEALLRKRFLLVLDDVWEK---ND-----WDKIGVPFPDGENGSRVIVTTRSESVAGRMGGT--SKPHEVESLEPEES 161 (285) T ss_pred HHHHHHCCCCEEEEECCCCCH---HH-----HHHHHCCCCCCCCCCEEEEECCCHHHHHHCCCC--CCEEECCCCCHHHH T ss_conf 999997279669996388878---99-----999734575789982799855758999873788--83476168987999 Q ss_pred E-EEEEEEEECCCC-----CCHHHHHHH-HCCCCHHHHHHHHHHH Q ss_conf 7-787777447898-----877899999-8299989999999999 Q gi|254780750|r 799 I-IFLHKVIPGIAD-----HSYGIQVGK-LAGLPNTVISRAYDIL 836 (920) Q Consensus 799 i-~flykl~~G~~~-----~Sygi~vA~-laG~p~~vi~~A~~~~ 836 (920) . .|..+.-.+..+ ...|-++++ ..|+|-.+.--|..+. T Consensus 162 ~~Lf~~~a~~~~~~~~~~l~~~~~~Iv~~C~GlPLai~~lg~~L~ 206 (285) T pfam00931 162 WELFSNKVFEKELPPCPELEEVAKEIVEKCKGLPLALKVLGGLLA 206 (285) T ss_pred HHHHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHC T ss_conf 999999846898999767999999999985899499999999971 No 345 >PRK09519 recA recombinase A; Reviewed Probab=91.42 E-value=0.53 Score=25.94 Aligned_cols=10 Identities=20% Similarity=0.557 Sum_probs=4.6 Q ss_pred CCCHHHHHHH Q ss_conf 9998999999 Q gi|254780750|r 823 GLPNTVISRA 832 (920) Q Consensus 823 G~p~~vi~~A 832 (920) |+.+.+|+.| T Consensus 719 GVe~g~vrKs 728 (790) T PRK09519 719 GVDQGLIRKS 728 (790) T ss_pred CCCCCEEECC T ss_conf 3200401025 No 346 >COG1127 Ttg2A ABC-type transport system involved in resistance to organic solvents, ATPase component [Secondary metabolites biosynthesis, transport, and catabolism] Probab=91.34 E-value=1.1 Score=23.64 Aligned_cols=126 Identities=24% Similarity=0.268 Sum_probs=65.1 Q ss_pred CEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCC------------CEEEEEE Q ss_conf 0444345635877776643999967784407899999999999997198530353206822------------1056765 Q gi|254780750|r 633 PFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIV------------DKLFSRV 700 (920) Q Consensus 633 ~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~------------D~IftRi 700 (920) .-|=.++.|.. ..+.++.|-||--+||||+||+++- .-=|-+ -++-++ ..|..|| T Consensus 21 ~~Ild~v~l~V----~~Gei~~iiGgSGsGKStlLr~I~G--------ll~P~~-GeI~i~G~~i~~ls~~~~~~ir~r~ 87 (263) T COG1127 21 RVILDGVDLDV----PRGEILAILGGSGSGKSTLLRLILG--------LLRPDK-GEILIDGEDIPQLSEEELYEIRKRM 87 (263) T ss_pred EEEECCCEEEE----CCCCEEEEECCCCCCHHHHHHHHHC--------CCCCCC-CEEEECCCCHHCCCHHHHHHHHHHE T ss_conf 77862731355----0781899988988689999999856--------578988-7599847641104988999998435 Q ss_pred EC---CCCCCCCCCHH------HH--------HHHHHH-----------------------------HHHH-HCCCCCEE Q ss_conf 23---76611385328------99--------999999-----------------------------9999-58998569 Q gi|254780750|r 701 GS---ADNLASGRSTF------MV--------EMIETA-----------------------------SILN-QATNQSFV 733 (920) Q Consensus 701 Ga---~D~l~~g~STF------~v--------Em~e~~-----------------------------~IL~-~at~~SLV 733 (920) |- +--|+...+.| |- ++.|++ .+-| -|..-.|+ T Consensus 88 GvlFQ~gALFssltV~eNVafplre~~~lp~~~i~~lv~~KL~~VGL~~~~~~~~PsELSGGM~KRvaLARAialdPell 167 (263) T COG1127 88 GVLFQQGALFSSLTVFENVAFPLREHTKLPESLIRELVLMKLELVGLRGAAADLYPSELSGGMRKRVALARAIALDPELL 167 (263) T ss_pred EEEEECCCCCCCCCHHHHHHEEHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHCCHHHCCHHHHHHHHHHHHHCCCCEE T ss_conf 17860561235565457310006864259999999999999986499856666393320435889999999986498779 Q ss_pred EEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCC Q ss_conf 993258898805679999999999997269849997487 Q gi|254780750|r 734 ILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHF 772 (920) Q Consensus 734 llDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy 772 (920) ++||--.|-+|.--..|. ..+..|-+..++.++..||. T Consensus 168 ~~DEPtsGLDPI~a~~~~-~LI~~L~~~lg~T~i~VTHD 205 (263) T COG1127 168 FLDEPTSGLDPISAGVID-ELIRELNDALGLTVIMVTHD 205 (263) T ss_pred EECCCCCCCCCCHHHHHH-HHHHHHHHHHCCEEEEEECC T ss_conf 855997788830277999-99999998639879999777 No 347 >COG4717 Uncharacterized conserved protein [Function unknown] Probab=91.29 E-value=0.15 Score=30.08 Aligned_cols=89 Identities=20% Similarity=0.231 Sum_probs=48.4 Q ss_pred HHHCCCCCCCEEEEEEECC---CCC----CCCCCHHHHHHHHH-H----------HHHHHCCCCCE-EEEECCCCCCCHH Q ss_conf 3532068221056765237---661----13853289999999-9----------99995899856-9993258898805 Q gi|254780750|r 685 ASYAHIGIVDKLFSRVGSA---DNL----ASGRSTFMVEMIET-A----------SILNQATNQSF-VILDEIGRGTATL 745 (920) Q Consensus 685 A~~a~i~~~D~IftRiGa~---D~l----~~g~STF~vEm~e~-~----------~IL~~at~~SL-VllDElGrGTst~ 745 (920) |...=.++.|.=||+|-.+ |+| ..|.|-|..|++.- + -|---.+.-+| +|+|-+. -++ T Consensus 863 A~~~F~hlT~G~Yt~Iy~~e~~d~I~V~~~~G~~~~~~ELSqgT~EQLYlAlRfali~~~~~~~~LP~i~DD~f---VhF 939 (984) T COG4717 863 ASEFFMHLTDGRYTGIYTQEDKDSIIVEHRAGGSKLAEELSQGTKEQLYLALRFALIHEVRTREPLPFIADDIF---VHF 939 (984) T ss_pred HHHHHHHCCCCCEEEEECCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEECCCH---HCC T ss_conf 99998642488523665356776058872365523589883657999999999988863135788875632300---005 Q ss_pred HHHHHHHHHHHHHHHHC-CCEEEEECCCHHHHH Q ss_conf 67999999999999726-984999748757976 Q gi|254780750|r 746 DGLSIAWATIEYLHETN-RCRGLLATHFHELTD 777 (920) Q Consensus 746 DG~aiA~aile~l~~~~-~~~~lfaTHy~eL~~ 777 (920) |- .=+..++++|.+.. +--+++=||..+.++ T Consensus 940 D~-~R~~r~~e~l~dls~~~QviYFTCHe~~~d 971 (984) T COG4717 940 DD-ERAKRMLELLADLSEGNQVIYFTCHEHTCD 971 (984) T ss_pred CH-HHHHHHHHHHHHHCCCCEEEEEEECHHHHC T ss_conf 88-899999999997355780799970366541 No 348 >cd03242 ABC_RecF RecF is a recombinational DNA repair ATPase that maintains replication in the presence of DNA damage. When replication is prematurely disrupted by DNA damage, several recF pathway gene products play critical roles processing the arrested replication fork, allowing it to resume and complete its task. This CD represents the nucleotide binding domain of RecF. RecF belongs to a large superfamily of ABC transporters involved in the transport of a wide variety of different compounds including sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases with a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=91.17 E-value=0.23 Score=28.61 Aligned_cols=30 Identities=10% Similarity=0.081 Sum_probs=15.2 Q ss_pred CCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 9985699932588988056799999999999972 Q gi|254780750|r 728 TNQSFVILDEIGRGTATLDGLSIAWATIEYLHET 761 (920) Q Consensus 728 t~~SLVllDElGrGTst~DG~aiA~aile~l~~~ 761 (920) .....+|+|++..-=+..- ..++++.|.+. T Consensus 209 ~~~PilLlDDi~seLD~~r----~~~l~~~l~~~ 238 (270) T cd03242 209 GEYPVLLLDDVLAELDLGR----QAALLDAIEGR 238 (270) T ss_pred CCCCEEEECCHHHHHCHHH----HHHHHHHHCCC T ss_conf 9997899723155409999----99999987018 No 349 >PRK12727 flagellar biosynthesis regulator FlhF; Provisional Probab=90.85 E-value=1.2 Score=23.32 Aligned_cols=127 Identities=20% Similarity=0.293 Sum_probs=71.9 Q ss_pred CCEEEEEECCCCCCHHHHHHHHHHHHHHHHCC---CCCCHHHCCCCCCCE--EEEEE-ECCCCCCCCCCHHHHHHHHHHH Q ss_conf 64399996778440789999999999999719---853035320682210--56765-2376611385328999999999 Q gi|254780750|r 649 SGKLWLLTGPNMGGKSTFLRQNALIVIMAQMG---SYVPASYAHIGIVDK--LFSRV-GSADNLASGRSTFMVEMIETAS 722 (920) Q Consensus 649 ~~~~~iiTGpNmgGKSt~lRqval~vilAQiG---~fVPA~~a~i~~~D~--IftRi-Ga~D~l~~g~STF~vEm~e~~~ 722 (920) .+-+.-+-||-..||+|.+=-+|--..|-+=- .+|-.+..+||-+++ +|.|| |-.=.+.. +-.|+.. T Consensus 347 ~gGv~AlvGpTGvGKTTT~aKlAa~~~~~~g~~~valit~DtyRiga~eQL~~y~~ilgvpv~~~~-------~~~~l~~ 419 (557) T PRK12727 347 RGGVIALVGPTGAGKTTTIAKLAQRFAAQHAPRDVALVTTDTQRVGGREQLHSYGRQLGIAVHEAD-------SAESLLD 419 (557) T ss_pred CCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHHHHHHHHCCEEEEEC-------CHHHHHH T ss_conf 076478743777673117999999999973998189997266408799999999998397579828-------9999999 Q ss_pred HHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHH--HCC-CEEEE-ECCCHHHHHHHHHCCCEE Q ss_conf 99958998569993258898805679999999999997--269-84999-748757976643068858 Q gi|254780750|r 723 ILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHE--TNR-CRGLL-ATHFHELTDLSKSLKRFH 786 (920) Q Consensus 723 IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~--~~~-~~~lf-aTHy~eL~~l~~~~~~v~ 786 (920) .|..-..+.|||||-.|+|-. |-.- ..-+..|.. .++ .++|= +||+..|.+....+..+. T Consensus 420 ~l~~l~~~~lvliDTaG~~~r--d~~~--~~~~~~l~~~~~~~~~Lvl~a~~~~~~l~~~~~~~~~~~ 483 (557) T PRK12727 420 LLERLRDYKLVLIDTAGMGQR--DRAL--AAQLNWLRAARQVTSLLVLPANAHFSDLDEVVRRFAHAK 483 (557) T ss_pred HHHHHCCCCEEEEECCCCCCC--CHHH--HHHHHHHHCCCCCCEEEEEECCCCHHHHHHHHHHHCCCC T ss_conf 999836999899949998846--9999--999998751477635999968899899999999853799 No 350 >TIGR01186 proV glycine betaine/L-proline transport ATP binding subunit; InterPro: IPR005892 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This family comprises the glycine betaine/L-proline ATP binding subunit in bacteria and its equivalents in archaea. This transport system belong to the larger ATP-Binding Cassette (ABC) transporter superfamily. The characteristic feature of these transporters is the obligatory coupling of ATP hydrolysis to substrate translocation. Functionally, this transport system is involved in osmoregulation. Under conditions of stress, the organism recruits this transport system to accumulate glycine betaine and other solutes which offer osmo-protection. It has been demonstrated that glycine betaine uptake is accompanied by symport with sodium ions. The locus has been named variously as proU or opuA. A gene library from L. lactis functionally complements an Escherichia coli proU mutant. The complementing locus is similar to a opuA locus in Bacillus subtlis. This clarifies the differences in nomenclature.; GO: 0005524 ATP binding, 0015171 amino acid transmembrane transporter activity, 0006865 amino acid transport, 0016020 membrane. Probab=90.84 E-value=0.27 Score=28.11 Aligned_cols=11 Identities=18% Similarity=0.706 Sum_probs=6.4 Q ss_pred CEECCCCCCEE Q ss_conf 12104787140 Q gi|254780750|r 603 RPIIDNSTNFI 613 (920) Q Consensus 603 rP~i~~~~~l~ 613 (920) -|+++++..+. T Consensus 339 vpVVDE~~~~~ 349 (372) T TIGR01186 339 VPVVDEDQRLV 349 (372) T ss_pred EEEECCCCCEE T ss_conf 66661456458 No 351 >PRK11022 dppD dipeptide transporter ATP-binding subunit; Provisional Probab=90.76 E-value=0.3 Score=27.75 Aligned_cols=33 Identities=18% Similarity=0.147 Sum_probs=25.3 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHH Q ss_conf 4434563587777664399996778440789999999 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNA 671 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqva 671 (920) +=+|++|.. ..+.++=|-|+|-+||||++|.+. T Consensus 22 Av~~Vsf~i----~~GEilgivGeSGsGKSTl~~~il 54 (327) T PRK11022 22 AVDRISYSV----KQGEVVGIVGESGSGKSVSSLAIM 54 (327) T ss_pred EEECEEEEE----CCCCEEEEECCCCCHHHHHHHHHH T ss_conf 984418798----899999999999878999999997 No 352 >COG4674 Uncharacterized ABC-type transport system, ATPase component [General function prediction only] Probab=90.75 E-value=0.11 Score=31.06 Aligned_cols=45 Identities=18% Similarity=0.184 Sum_probs=29.4 Q ss_pred CCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH Q ss_conf 985699932588988056799999999999972698499974875797 Q gi|254780750|r 729 NQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELT 776 (920) Q Consensus 729 ~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~ 776 (920) +--|.|+||--.|-.-.+-...|. .+..|. .+|..+..-|.-+.. T Consensus 165 ~P~lLLlDEPvAGMTd~Et~~tae-Ll~~la--~~hsilVVEHDM~Fv 209 (249) T COG4674 165 DPKLLLLDEPVAGMTDAETEKTAE-LLKSLA--GKHSILVVEHDMGFV 209 (249) T ss_pred CCCEEEECCCCCCCCHHHHHHHHH-HHHHHH--CCCEEEEEECCHHHH T ss_conf 886788558657885788999999-999873--174499994537999 No 353 >pfam02463 SMC_N RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination. Probab=90.60 E-value=0.29 Score=27.87 Aligned_cols=12 Identities=0% Similarity=-0.050 Sum_probs=4.6 Q ss_pred CCHHHHHHHHHH Q ss_conf 587899999852 Q gi|254780750|r 180 SNHDRLISDIMR 191 (920) Q Consensus 180 ~~~d~l~~~L~~ 191 (920) ++..++.+.|.. T Consensus 117 ~~~~dv~~ll~~ 128 (1162) T pfam02463 117 VTKKDVAELLES 128 (1162) T ss_pred CCHHHHHHHHHH T ss_conf 379999999997 No 354 >COG4637 Predicted ATPase [General function prediction only] Probab=90.58 E-value=0.36 Score=27.21 Aligned_cols=124 Identities=20% Similarity=0.185 Sum_probs=68.4 Q ss_pred EECCCCCCHHHHHHHHH---HHHHHHHCCCCCCH-HHCCCCCC-----CEEEEEEECCC-----CCCCCCCHHHHHHHHH Q ss_conf 96778440789999999---99999971985303-53206822-----10567652376-----6113853289999999 Q gi|254780750|r 655 LTGPNMGGKSTFLRQNA---LIVIMAQMGSYVPA-SYAHIGIV-----DKLFSRVGSAD-----NLASGRSTFMVEMIET 720 (920) Q Consensus 655 iTGpNmgGKSt~lRqva---l~vilAQiG~fVPA-~~a~i~~~-----D~IftRiGa~D-----~l~~g~STF~vEm~e~ 720 (920) =||-||+-==-+||++. .-.|.|-+- .+|- ...+-.|. .=.|+.=|-.| -++.|.=-||. . T Consensus 207 ~tG~nlAs~l~~lR~t~~D~~q~i~~v~~-aFpG~~~~~~~P~~~g~~~l~~~d~~~~~P~~~~eLSDGTlRfl~----l 281 (373) T COG4637 207 PTGDNLASVLYTLRETHPDIFQLIEAVMD-AFPGIRLLEPKPGEDGRIELKWQDRGFLDPFRDRELSDGTLRFLA----L 281 (373) T ss_pred CCHHHHHHHHHHHHHHCCHHHHHHHHHHH-CCCCCHHCCCCCCCCCEEEEEECCCCCCCCHHHHHCCCCHHHHHH----H T ss_conf 85455999999998708209999999886-079712135898767258999616666675044203533899999----9 Q ss_pred HHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHHCCCEE Q ss_conf 999995899856999325889880567999999999999726984999748757976643068858 Q gi|254780750|r 721 ASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTDLSKSLKRFH 786 (920) Q Consensus 721 ~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~l~~~~~~v~ 786 (920) +..|-+-.+.-|+++||---+--|.==-++|.-+-++= + .+-++++||-.+|....+..+.+. T Consensus 282 ~t~Llsp~~p~ll~ldEPE~sLHP~lL~~La~~~~sAa--k-~sQv~VsTHS~rLl~~~e~~~v~~ 344 (373) T COG4637 282 ATLLLSPRPPPLLLLDEPETSLHPDLLPALAELMRSAA--K-RSQVIVSTHSPRLLNAVEEHSVFW 344 (373) T ss_pred HHHHCCCCCCCEEEECCCCCCCCHHHHHHHHHHHHHHH--C-CCEEEEEECCHHHHHHCCCCCEEE T ss_conf 99983999996267458523358769999999999862--0-551899827799995356364588 No 355 >TIGR02533 type_II_gspE general secretory pathway protein E; InterPro: IPR013369 GspE, the E protein of the type II secretion system, is also referred to as the main terminal branch of the general secretion pathway. ; GO: 0005524 ATP binding, 0008565 protein transporter activity, 0015628 protein secretion by the type II secretion system, 0015627 type II protein secretion system complex. Probab=90.47 E-value=0.16 Score=29.88 Aligned_cols=19 Identities=42% Similarity=0.600 Sum_probs=16.7 Q ss_pred EEEEECCCCCCHHHHHHHH Q ss_conf 9999677844078999999 Q gi|254780750|r 652 LWLLTGPNMGGKSTFLRQN 670 (920) Q Consensus 652 ~~iiTGpNmgGKSt~lRqv 670 (920) +.|+|||=-|||||.|=.. T Consensus 247 IiLVTGPTGSGKtTTLYaa 265 (495) T TIGR02533 247 IILVTGPTGSGKTTTLYAA 265 (495) T ss_pred EEEECCCCCCCHHHHHHHH T ss_conf 1884177898525889999 No 356 >pfam01637 Arch_ATPase Archaeal ATPase. This family contain a conserved P-loop motif that is involved in binding ATP. This family is almost exclusively found in archaebacteria and particularly in Methanococcus jannaschii that encodes sixteen members of this family. Probab=90.28 E-value=1.3 Score=22.97 Aligned_cols=24 Identities=25% Similarity=0.385 Sum_probs=20.2 Q ss_pred CCEEEEEECCCCCCHHHHHHHHHH Q ss_conf 643999967784407899999999 Q gi|254780750|r 649 SGKLWLLTGPNMGGKSTFLRQNAL 672 (920) Q Consensus 649 ~~~~~iiTGpNmgGKSt~lRqval 672 (920) ...+.+|+||=..|||++++.++- T Consensus 19 ~~~~ivi~G~RR~GKTsLi~~~~~ 42 (223) T pfam01637 19 TYPIIVVYGPRRCGKTALLREFLE 42 (223) T ss_pred CCCEEEEECCCCCCHHHHHHHHHH T ss_conf 971899986887879999999998 No 357 >PRK12726 flagellar biosynthesis regulator FlhF; Provisional Probab=90.18 E-value=0.79 Score=24.63 Aligned_cols=89 Identities=17% Similarity=0.274 Sum_probs=46.9 Q ss_pred CCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCC---CCCHHHCCCCCCCEEE--EE-EECCCCCCCCCCHHHHHHHHHH Q ss_conf 6643999967784407899999999999997198---5303532068221056--76-5237661138532899999999 Q gi|254780750|r 648 NSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGS---YVPASYAHIGIVDKLF--SR-VGSADNLASGRSTFMVEMIETA 721 (920) Q Consensus 648 ~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~---fVPA~~a~i~~~D~If--tR-iGa~D~l~~g~STF~vEm~e~~ 721 (920) ..+++.++-|||.+||||.+==.|- .+.+-|- .|.|+..+.+-+|++- .+ +|. .+..+.+ -.++.... T Consensus 204 ~~g~VIaLVGvnGvGKTTTiAKLA~--~l~~~gkkV~LVAaDTFRaAAiEQLk~~g~rlgV--pV~~~~d--pa~l~~av 277 (407) T PRK12726 204 SNHRIISLIGQTGVGKTTTLVKLGW--QLLKQNRTVGFITTDTFRSGAVEQFQGYADKLDV--ELIVATS--PAELEEAV 277 (407) T ss_pred CCCCEEEEECCCCCCHHHHHHHHHH--HHHHCCCCEEEEECCCCCHHHHHHHHHHHHHHCC--EEEEECC--HHHHHHHH T ss_conf 3690899989998978999999999--9997799179997066778899999999999796--4998188--89999999 Q ss_pred HHHHHCCCCCEEEEECCCCCC Q ss_conf 999958998569993258898 Q gi|254780750|r 722 SILNQATNQSFVILDEIGRGT 742 (920) Q Consensus 722 ~IL~~at~~SLVllDElGrGT 742 (920) .-+..+...-+||+|=-||-- T Consensus 278 ~~~a~~~~~DvVIIDTAGRl~ 298 (407) T PRK12726 278 QYMTYVNCVDHILIDTVGRNY 298 (407) T ss_pred HHHHHCCCCCEEEEECCCCCC T ss_conf 999862899989996999881 No 358 >PRK04296 thymidine kinase; Provisional Probab=90.17 E-value=1.1 Score=23.70 Aligned_cols=147 Identities=20% Similarity=0.249 Sum_probs=71.6 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHHHHHH--CCCCCCHHHCCCCCCCEEEEEEECCC-CCCCCCCHHHHHHHHHHHHHHH Q ss_conf 43999967784407899999999999997--19853035320682210567652376-6113853289999999999995 Q gi|254780750|r 650 GKLWLLTGPNMGGKSTFLRQNALIVIMAQ--MGSYVPASYAHIGIVDKLFSRVGSAD-NLASGRSTFMVEMIETASILNQ 726 (920) Q Consensus 650 ~~~~iiTGpNmgGKSt~lRqval~vilAQ--iG~fVPA~~a~i~~~D~IftRiGa~D-~l~~g~STF~vEm~e~~~IL~~ 726 (920) +++-+||||=.|||||-|=+.+-..-.|. +=.|.|+---+.+- +.|-||.|..- .+.-..+ .++.+.-..... T Consensus 2 g~L~~i~GpMfSGKTteLi~~~~~~~~~gkkvl~~kp~~D~Ry~~-~~I~Sh~g~~~~~~~v~~~---~~i~~~~~~~~~ 77 (197) T PRK04296 2 AKLYFIYGAMNSGKSTELLQRAYNYEERGMKVLVFTPAIDDRYGE-GKVVSRIGLSREAILISSD---TDIFELIEEEHE 77 (197) T ss_pred CEEEEEECCCCCHHHHHHHHHHHHHHHCCCEEEEEEECCCCCCCC-CCEECCCCCEEEEEEECCH---HHHHHHHHHHHC T ss_conf 559999934278889999999999998799599998534465777-8578678983468994878---999999987630 Q ss_pred CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCC---CEEEEECC----CHHHHHH---HHHCCCEEEEEEEEEEEC Q ss_conf 8998569993258898805679999999999997269---84999748----7579766---430688589999999609 Q gi|254780750|r 727 ATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNR---CRGLLATH----FHELTDL---SKSLKRFHNATLQVSDSN 796 (920) Q Consensus 727 at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~---~~~lfaTH----y~eL~~l---~~~~~~v~n~~~~~~~~~ 796 (920) ...-..|.+||.=-=+. .+ ...+++.+.+... |.+|=.+. |.....| ++....+...+... + T Consensus 78 ~~~~dvI~IDEaQFf~~-~~----i~~~~~~~~~~~~~Viv~GLd~Df~~~~F~~~~~Li~~Ad~V~kl~a~C~~C---g 149 (197) T PRK04296 78 GGKIDCVLIDEAQFLDK-EQ----VVQLAEVVDDLGIPVICYGLDTDFRGEPFEGSPYLLALADKVTELKAICVHC---G 149 (197) T ss_pred CCCCCEEEECCHHCCCH-HH----HHHHHHHHHHCCCEEEEEEECCCCCCCCCHHHHHHHHHCCCEEEEEEECCCC---C T ss_conf 47856899720212798-99----9999999983185899976503302486543999997268099974397789---9 Q ss_pred CEEEEEEEEEEC Q ss_conf 927787777447 Q gi|254780750|r 797 EGIIFLHKVIPG 808 (920) Q Consensus 797 ~~i~flykl~~G 808 (920) ..=+|.+|+.+| T Consensus 150 ~~A~fs~R~~~~ 161 (197) T PRK04296 150 RKATMTQRLING 161 (197) T ss_pred CEEEEEEEECCC T ss_conf 931017999489 No 359 >pfam08298 AAA_PrkA PrkA AAA domain. This is a family of PrkA bacterial and archaeal serine kinases approximately 630 residues long. This is the N-terminal AAA domain. Probab=90.11 E-value=0.31 Score=27.67 Aligned_cols=21 Identities=33% Similarity=0.577 Sum_probs=10.8 Q ss_pred CCEEEEEECCCCCCHHHHHHH Q ss_conf 643999967784407899999 Q gi|254780750|r 649 SGKLWLLTGPNMGGKSTFLRQ 669 (920) Q Consensus 649 ~~~~~iiTGpNmgGKSt~lRq 669 (920) ..+++++-||+.|||||+.+. T Consensus 84 ~kqIllL~GPVGsGKSsl~e~ 104 (358) T pfam08298 84 RKQILYLLGPVGGGKSSLAER 104 (358) T ss_pred CEEEEEEECCCCCCHHHHHHH T ss_conf 105899977898775899999 No 360 >COG0470 HolB ATPase involved in DNA replication [DNA replication, recombination, and repair] Probab=90.11 E-value=1.4 Score=22.87 Aligned_cols=107 Identities=24% Similarity=0.225 Sum_probs=57.0 Q ss_pred EEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCC-----------CCCCCEEEEEEECCCCCCCCCCHHHHHHHH Q ss_conf 399996778440789999999999999719853035320-----------682210567652376611385328999999 Q gi|254780750|r 651 KLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAH-----------IGIVDKLFSRVGSADNLASGRSTFMVEMIE 719 (920) Q Consensus 651 ~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~-----------i~~~D~IftRiGa~D~l~~g~STF~vEm~e 719 (920) ..+++.||.-.||+|...-+| ..++|..|-.... .+-... |..+.++|.=.. + ..+| + T Consensus 25 halL~~Gp~G~Gktt~a~~lA-----~~l~~~~~~~~~~~~~~~~~~~~~~~~~~d-~lel~~s~~~~~--~-i~~~--~ 93 (325) T COG0470 25 HALLFYGPPGVGKTTAALALA-----KELLCENPTGLLPCGHCRSCKLIPAGNHPD-FLELNPSDLRKI--D-IIVE--Q 93 (325) T ss_pred CCEEEECCCCCCHHHHHHHHH-----HHHCCCCCCCCCCCCHHHHHHHHCCCCCCC-EEEECCCCCCCC--C-CHHH--H T ss_conf 610037999997899999999-----996586643345520022444320256886-599773213333--0-0699--9 Q ss_pred HHHHHHHC------CCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHH-CCCEEEEECCCH Q ss_conf 99999958------9985699932588988056799999999999972-698499974875 Q gi|254780750|r 720 TASILNQA------TNQSFVILDEIGRGTATLDGLSIAWATIEYLHET-NRCRGLLATHFH 773 (920) Q Consensus 720 ~~~IL~~a------t~~SLVllDElGrGTst~DG~aiA~aile~l~~~-~~~~~lfaTHy~ 773 (920) +..+.+.. +..=+||+||.-+-|. |. |-|.+..+.+. ..++.+++||+. T Consensus 94 vr~~~~~~~~~~~~~~~kviiidead~mt~--~A---~nallk~lEep~~~~~~il~~n~~ 149 (325) T COG0470 94 VRELAEFLSESPLEGGYKVVIIDEADKLTE--DA---ANALLKTLEEPPKNTRFILITNDP 149 (325) T ss_pred HHHHHHHCCCCCCCCCCEEEEEECCCCCCH--HH---HHHHHHHCCCCCCCEEEEEEECCH T ss_conf 999998604465667726999732032698--88---876754332488871699974985 No 361 >pfam00265 TK Thymidine kinase. Probab=90.08 E-value=0.6 Score=25.52 Aligned_cols=142 Identities=17% Similarity=0.169 Sum_probs=70.1 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHHHHHHCC--CCCCHHHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHC Q ss_conf 4399996778440789999999999999719--85303532068221056765237661138532899999999999958 Q gi|254780750|r 650 GKLWLLTGPNMGGKSTFLRQNALIVIMAQMG--SYVPASYAHIGIVDKLFSRVGSADNLASGRSTFMVEMIETASILNQA 727 (920) Q Consensus 650 ~~~~iiTGpNmgGKSt~lRqval~vilAQiG--~fVPA~~a~i~~~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~a 727 (920) +++-+|+||=-|||||-|=+.+-..-.++-- .+-|+.--+.+- +.|-||.|..=. ..........+...++ . T Consensus 1 G~L~~i~GpMfsGKTteLi~~~~~~~~~gkkvl~i~p~~D~R~~~-~~i~Sh~g~~~~---~~~~~~~~~~~~~~~~--~ 74 (175) T pfam00265 1 GSIELIIGPMFSGKSTELIRRVYRYQLAQYKCVVIKYAIDNRYGT-GKVVTHDGISME---AKLIKTSLLDDIKDII--S 74 (175) T ss_pred CEEEEEECCCCCHHHHHHHHHHHHHHHCCCEEEEEEECCCCCCCC-CEEECCCCCCCC---CEECCCCHHHHHHHHH--C T ss_conf 949999925177899999999999998799399994611277899-969889998114---5673653199999864--3 Q ss_pred CCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEE---CC-----CHHHHHH---HHHCCCEEEEEEEEEEEC Q ss_conf 998569993258898805679999999999997269849997---48-----7579766---430688589999999609 Q gi|254780750|r 728 TNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLA---TH-----FHELTDL---SKSLKRFHNATLQVSDSN 796 (920) Q Consensus 728 t~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfa---TH-----y~eL~~l---~~~~~~v~n~~~~~~~~~ 796 (920) .+-..|++||.---+ . ...+++.+.+. +-.++.+ +- |.+...| ++....+.-.++.. + T Consensus 75 ~~~dvI~IDEaQFf~----~---l~~~~~~~~~~-~~~Viv~GLd~D~~~~~F~~i~~Li~~Ad~V~kl~a~C~~C---g 143 (175) T pfam00265 75 NTIHVVGIDEAQFFD----D---IVEFCETMANL-GKIVIVAGLDGDFQRKPFGNILNLLPLAEKVTKLKAVCMKC---Y 143 (175) T ss_pred CCCCEEEEEHHHHHH----H---HHHHHHHHHHC-CCEEEEEEEECCCCCCCCHHHHHHHHHCCEEEEEEEEECCC---C T ss_conf 687899983377526----4---89999999967-99499987503011585447999996279699980493899---9 Q ss_pred CEEEEEEEEEEC Q ss_conf 927787777447 Q gi|254780750|r 797 EGIIFLHKVIPG 808 (920) Q Consensus 797 ~~i~flykl~~G 808 (920) ..-+|.+|+..+ T Consensus 144 ~~A~~t~R~~~~ 155 (175) T pfam00265 144 KDASFTKRLNNE 155 (175) T ss_pred CEEEEEEEECCC T ss_conf 961068988599 No 362 >COG4778 PhnL ABC-type phosphonate transport system, ATPase component [Inorganic ion transport and metabolism] Probab=90.05 E-value=0.4 Score=26.86 Aligned_cols=33 Identities=39% Similarity=0.580 Sum_probs=25.4 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHH Q ss_conf 4434563587777664399996778440789999999 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNA 671 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqva 671 (920) |-++++++. +.+-+.++.||-.+||||+||+.= T Consensus 26 V~~~vslsV----~aGECvvL~G~SG~GKStllr~LY 58 (235) T COG4778 26 VLRNVSLSV----NAGECVVLHGPSGSGKSTLLRSLY 58 (235) T ss_pred EEECEEEEE----CCCCEEEEECCCCCCHHHHHHHHH T ss_conf 340415776----375079966898876889999998 No 363 >KOG1970 consensus Probab=89.97 E-value=0.27 Score=28.11 Aligned_cols=25 Identities=24% Similarity=0.398 Sum_probs=13.0 Q ss_pred CCCCCHHHHHHHHHHHHHHHHCCCC Q ss_conf 7844078999999999999971985 Q gi|254780750|r 658 PNMGGKSTFLRQNALIVIMAQMGSY 682 (920) Q Consensus 658 pNmgGKSt~lRqval~vilAQiG~f 682 (920) |--||++.++|..+-+.+.--||=. T Consensus 516 p~ig~~~~avr~~~gi~~~~di~d~ 540 (634) T KOG1970 516 PRIGLLTVAVRNCAGISFINDIGDL 540 (634) T ss_pred CCCCCHHHHHHCCCCCHHHHHCCCC T ss_conf 0014304655415530133303656 No 364 >KOG0996 consensus Probab=89.87 E-value=1.4 Score=22.74 Aligned_cols=13 Identities=15% Similarity=0.376 Sum_probs=7.9 Q ss_pred CCEEEEEEEEECC Q ss_conf 8455458999775 Q gi|254780750|r 124 SLVRRNVVRLVTP 136 (920) Q Consensus 124 ~~v~R~Vt~IiTp 136 (920) |+|.-..|.||-| T Consensus 104 GPFHksFtaIvGP 116 (1293) T KOG0996 104 GPFHKSFTAIVGP 116 (1293) T ss_pred CCCCCCCEEEECC T ss_conf 6778773156789 No 365 >COG1116 TauB ABC-type nitrate/sulfonate/bicarbonate transport system, ATPase component [Inorganic ion transport and metabolism] Probab=89.81 E-value=0.37 Score=27.08 Aligned_cols=127 Identities=29% Similarity=0.296 Sum_probs=63.9 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCC------C----------CCCHHHCCCCCCCEEEE Q ss_conf 4434563587777664399996778440789999999999999719------8----------53035320682210567 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMG------S----------YVPASYAHIGIVDKLFS 698 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG------~----------fVPA~~a~i~~~D~Ift 698 (920) |=.|++|.. ..+.|.-|-||--+||||+||-+|=+.- .+.| - ||+=+.+-+|. T Consensus 18 vl~~i~L~v----~~GEfvsilGpSGcGKSTLLriiAGL~~-p~~G~V~~~g~~v~~p~~~~~~vFQ~~~LlPW------ 86 (248) T COG1116 18 VLEDINLSV----EKGEFVAILGPSGCGKSTLLRLIAGLEK-PTSGEVLLDGRPVTGPGPDIGYVFQEDALLPW------ 86 (248) T ss_pred EECCCEEEE----CCCCEEEEECCCCCCHHHHHHHHHCCCC-CCCCEEEECCCCCCCCCCCEEEEECCCCCCCH------ T ss_conf 760503587----7997999989997889999999968787-77755998882157899877999266764514------ Q ss_pred EEECCCCCCC-----------------------CCCHHH--------HHHHHHHHHHH-HCCCCCEEEEECCCCCCCHHH Q ss_conf 6523766113-----------------------853289--------99999999999-589985699932588988056 Q gi|254780750|r 699 RVGSADNLAS-----------------------GRSTFM--------VEMIETASILN-QATNQSFVILDEIGRGTATLD 746 (920) Q Consensus 699 RiGa~D~l~~-----------------------g~STF~--------vEm~e~~~IL~-~at~~SLVllDElGrGTst~D 746 (920) .-+.||+.- |.+-|. -=|.+=..|.| -+++--+.||||-.--=+..- T Consensus 87 -~Tv~~NV~l~l~~~~~~~~e~~~~a~~~L~~VgL~~~~~~~P~qLSGGMrQRVaiARAL~~~P~lLLlDEPFgALDalT 165 (248) T COG1116 87 -LTVLDNVALGLELRGKSKAEARERAKELLELVGLAGFEDKYPHQLSGGMRQRVAIARALATRPKLLLLDEPFGALDALT 165 (248) T ss_pred -HHHHHHHEEHHHCCCCCHHHHHHHHHHHHHHCCCCCHHHCCCCCCCHHHHHHHHHHHHHHCCCCEEEECCCCCHHHHHH T ss_conf -6688443504412565617689999999997597421016960018479999999999714999798769741201999 Q ss_pred HHHHHHHHHHHHHHHCCCEEEEECCCHH Q ss_conf 7999999999999726984999748757 Q gi|254780750|r 747 GLSIAWATIEYLHETNRCRGLLATHFHE 774 (920) Q Consensus 747 G~aiA~aile~l~~~~~~~~lfaTHy~e 774 (920) -..+ +--+..|++..+..++|.||.-+ T Consensus 166 R~~l-q~~l~~lw~~~~~TvllVTHdi~ 192 (248) T COG1116 166 REEL-QDELLRLWEETRKTVLLVTHDVD 192 (248) T ss_pred HHHH-HHHHHHHHHHHCCEEEEEECCHH T ss_conf 9999-99999999964988999908989 No 366 >COG1196 Smc Chromosome segregation ATPases [Cell division and chromosome partitioning] Probab=89.78 E-value=0.32 Score=27.58 Aligned_cols=17 Identities=24% Similarity=0.280 Sum_probs=8.6 Q ss_pred EEECCCCCHHHHHHHHH Q ss_conf 40000468058876310 Q gi|254780750|r 612 FIVKDGRHPIVEKTLKQ 628 (920) Q Consensus 612 l~i~~gRHPviE~~l~~ 628 (920) +.+...++.-||..++. T Consensus 525 i~v~~~y~~Aie~alG~ 541 (1163) T COG1196 525 IKVKEKYETALEAALGN 541 (1163) T ss_pred CCCCHHHHHHHHHHHCC T ss_conf 26065799999999563 No 367 >TIGR01194 cyc_pep_trnsptr cyclic peptide transporter; InterPro: IPR005898 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). Bacteria have elaborate pathways for the production of toxins and secondary metabolites. Many such compounds, including syringomycin and pyoverdine are synthesized on non-ribosomal templates consisting of a multienzyme complex. On several occasions the proteins of the complex and transporter protein are present on the same operon. Often times these compounds cross the biological membrane by specific transporters. This model describes a family of cyclic peptide transporters in bacteria. Syringomycin is an amphipathic, cyclic lipodepsipeptide when inserted into host causes formation of channels, permeable to a variety of cations. On the other hand, pyoverdine is a cyclic octa-peptidyl dihydroxyquinoline, which is efficient in sequestering iron for uptake.; GO: 0005524 ATP binding, 0015197 peptide transporter activity, 0015833 peptide transport, 0016021 integral to membrane. Probab=89.73 E-value=0.39 Score=26.88 Aligned_cols=58 Identities=33% Similarity=0.470 Sum_probs=36.7 Q ss_pred CEEECCCCCHHHHHHHHHCCCCCEE--EEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHH Q ss_conf 1400004680588763102877044--434563587777664399996778440789999999999999719853035 Q gi|254780750|r 611 NFIVKDGRHPIVEKTLKQQSSKPFI--ANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPAS 686 (920) Q Consensus 611 ~l~i~~gRHPviE~~l~~~~~~~fV--pNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~ 686 (920) .+++++-|--- -..++..+|- |-|..+ .++.+.-|-|-|.+||||+.|-. +|.|.|-+ T Consensus 337 S~~L~~V~~~~----~~~~~S~~F~LGPI~L~I------~~G~~VyIVG~NGCGK~TL~K~l--------~GLY~PQ~ 396 (555) T TIGR01194 337 SLELKDVRYSY----KDVESSEGFALGPIDLRI------ASGDLVYIVGENGCGKSTLIKLL--------LGLYIPQE 396 (555) T ss_pred CEEEECEECCC----CCCCCCCCCCCCCCCCEE------ECCCEEEEECCCCCCHHHHHHHH--------HHCCCCCC T ss_conf 20210000478----888888884217611257------43528999648897389999999--------72587876 No 368 >TIGR01420 pilT_fam twitching motility protein; InterPro: IPR006321 These represent the PilT subfamily of proteins related to GspE, a protein involved in type II secretion (also called the General Secretion Pathway). PilT is an apparent cytosolic ATPase associated with type IV pilus systems. It is not required for pilin biogenesis, but is required for twitching motility and social gliding behaviors, shown in some species, powered by pilus retraction . Members of this family may be found in some species that do not have type IV pili but have related structures for DNA uptake and natural transformation. ; GO: 0005524 ATP binding, 0006810 transport. Probab=89.65 E-value=0.22 Score=28.81 Aligned_cols=65 Identities=28% Similarity=0.272 Sum_probs=35.5 Q ss_pred HHHHHHHHHHHHHHHHHC-C---------CCCEECCCCCCEEECCCCCH--HHHHHHHHCCCCCEEEEEEEECCCCCCCC Q ss_conf 998677887788877525-8---------85121047871400004680--58876310287704443456358777766 Q gi|254780750|r 582 VIAIIDISIALAILAKEQ-N---------YCRPIIDNSTNFIVKDGRHP--IVEKTLKQQSSKPFIANDCDLSCPNDKNS 649 (920) Q Consensus 582 ~ia~lD~l~SlA~~a~~~-~---------y~rP~i~~~~~l~i~~gRHP--viE~~l~~~~~~~fVpNdi~l~~~~~~~~ 649 (920) .=.++||.++++.+++=. | .|.=. ..+....+++.+=| |+..+.. .. T Consensus 68 ~~~E~Dfs~~~~~~~RfRvN~f~QRg~~a~vlR~-ip~~Ip~fe~LGLP~~v~~~~a~--------------------~~ 126 (350) T TIGR01420 68 ENGELDFSFSLPGLARFRVNAFKQRGGVALVLRL-IPSKIPTFEELGLPRPVLRELAE--------------------RP 126 (350) T ss_pred HCCCCCEEEEECCCCEEEEHHHHHCCHHHHHHHH-CCCCCCCHHHCCCCHHHHHHHHH--------------------CC T ss_conf 5066444663067322122032350006423231-15346216663798789999983--------------------66 Q ss_pred CEEEEEECCCCCCHHHHH Q ss_conf 439999677844078999 Q gi|254780750|r 650 GKLWLLTGPNMGGKSTFL 667 (920) Q Consensus 650 ~~~~iiTGpNmgGKSt~l 667 (920) .-+.|+|||=-|||||.| T Consensus 127 ~GLiLVTGPTGSGKSTTl 144 (350) T TIGR01420 127 RGLILVTGPTGSGKSTTL 144 (350) T ss_pred CCCEEEECCCCCCHHHHH T ss_conf 993898768898678999 No 369 >PRK06851 hypothetical protein; Provisional Probab=89.65 E-value=0.42 Score=26.70 Aligned_cols=54 Identities=19% Similarity=0.142 Sum_probs=35.2 Q ss_pred CCCCEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCC Q ss_conf 877044434563587777664399996778440789999999999999719853035320682 Q gi|254780750|r 630 SSKPFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGI 692 (920) Q Consensus 630 ~~~~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~ 692 (920) +...||+|=+. +-.+..+|.|.--+||||+|+.++-.+.- -|..|=.=.|-+-| T Consensus 202 G~v~~i~~l~~-------~~~~~y~ikG~pGtGKstlL~~i~~~A~~--~G~dvevyhc~fdP 255 (368) T PRK06851 202 GAVDHVPSLTE-------GVKNRYFLKGRPGTGKSTMLKKVAKAAEE--RGLDVEVYHCPFDP 255 (368) T ss_pred CHHHHHHHHHH-------CCCEEEEECCCCCCCHHHHHHHHHHHHHH--CCCEEEEEECCCCC T ss_conf 64514787860-------67869998189998779999999999998--59828998079898 No 370 >PRK10787 DNA-binding ATP-dependent protease La; Provisional Probab=89.64 E-value=0.46 Score=26.41 Aligned_cols=60 Identities=17% Similarity=0.354 Sum_probs=30.4 Q ss_pred HHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHHCCCEEEEEEEEEEECCEEEEEEEEEE--CCCCCCHHHHHHHHC Q ss_conf 567999999999999726984999748757976643068858999999960992778777744--789887789999982 Q gi|254780750|r 745 LDGLSIAWATIEYLHETNRCRGLLATHFHELTDLSKSLKRFHNATLQVSDSNEGIIFLHKVIP--GIADHSYGIQVGKLA 822 (920) Q Consensus 745 ~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~l~~~~~~v~n~~~~~~~~~~~i~flykl~~--G~~~~Sygi~vA~la 822 (920) .-|++++.|++-.+..+ .-+ + +..|. ++|+..=++.| |+-.+ +-.|++| T Consensus 679 SAGit~~tal~S~~~~~-~v~-----------------~---~~amT-----GEitL~G~VlpiGG~keK---~laA~r~ 729 (784) T PRK10787 679 SAGIAMCTALVSCLTGN-PVR-----------------A---DVAMT-----GEITLRGQVLPIGGLKEK---LLAAHRG 729 (784) T ss_pred HHHHHHHHHHHHHHHCC-CCC-----------------C---CCCEE-----EEEEECEEEEEECCHHHH---HHHHHHC T ss_conf 28999999999998699-989-----------------9---96555-----656602027820789999---9999984 Q ss_pred CCCHHHHHHHH Q ss_conf 99989999999 Q gi|254780750|r 823 GLPNTVISRAY 833 (920) Q Consensus 823 G~p~~vi~~A~ 833 (920) |+-.-+|-.+. T Consensus 730 gi~~vi~P~~N 740 (784) T PRK10787 730 GIKTVLIPFEN 740 (784) T ss_pred CCCEEEECHHH T ss_conf 99899945212 No 371 >TIGR00929 VirB4_CagE type IV secretion/conjugal transfer ATPase, VirB4 family; InterPro: IPR004346 This family includes the Helicobacter pylori protein CagE (see examples), which together with other proteins from the cag pathogenicity island (PAI), encodes a type IV transporter secretion system. The precise role of CagE is not known, but studies in animal models have shown that it is essential for pathogenesis in Helicobacter pylori induced gastritis and peptic ulceration . Indeed, the expression of the cag PAI has been shown to be essential for stimulating human gastric epithelial cell apoptosis in vitro . Similar type IV transport systems are also found in other bacteria. This family includes proteins from the trb and Vir conjugal transfer systems in Agrobacterium tumefaciens and homologues of VirB proteins from other species.; GO: 0005524 ATP binding. Probab=89.64 E-value=0.26 Score=28.18 Aligned_cols=184 Identities=17% Similarity=0.227 Sum_probs=92.3 Q ss_pred EEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCE-----EEEEEECCC--CCC-------CCCCHHH--HH Q ss_conf 9996778440789999999999999719853035320682210-----567652376--611-------3853289--99 Q gi|254780750|r 653 WLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDK-----LFSRVGSAD--NLA-------SGRSTFM--VE 716 (920) Q Consensus 653 ~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~-----IftRiGa~D--~l~-------~g~STF~--vE 716 (920) -+|.||.-+||||+|- -+|||++=|.|+..+++=.||| ||+|-..+. -|. .|+.+=+ -. T Consensus 519 T~IfG~~G~GKTtLl~-----fL~a~~~ky~~~~a~~~~~fDkd~g~~~~~~a~gG~Y~~i~G~~T~~~~g~~~~~NPl~ 593 (931) T TIGR00929 519 TLIFGPTGSGKTTLLN-----FLLAQLQKYKPNFALTIFAFDKDRGMEIFIRALGGKYLEIKGDATVAADGEPFGFNPLQ 593 (931) T ss_pred EEEECCCCCCHHHHHH-----HHHHHHHCCCCCCCEEEEEEECCCCCHHHHEEECCEEEEECCCEECCCCCCCCCCCCHH T ss_conf 7788888984699999-----99999742488987069998878982104111174565303301016788866568020 Q ss_pred HHHHHHHHHHCCCCCE------EEEECCCCC---CCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHHCCCEEE Q ss_conf 9999999995899856------999325889---8805679999999999997269849997487579766430688589 Q gi|254780750|r 717 MIETASILNQATNQSF------VILDEIGRG---TATLDGLSIAWATIEYLHETNRCRGLLATHFHELTDLSKSLKRFHN 787 (920) Q Consensus 717 m~e~~~IL~~at~~SL------VllDElGrG---Tst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~l~~~~~~v~n 787 (920) |..++.+...++.+++ .|...=|++ ++..+--+|+.||.--. +.. ..- --+...|.++...++.-.+ T Consensus 594 ~~~g~~l~~t~~n~~F~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~i~~~~-~~~-~~~--~~~~r~l~~l~~~l~~~~~ 669 (931) T TIGR00929 594 GIDGEALEDTERNRAFLKEWLAMLVTLDGETDEEISAEDRNALSSAIDSLY-EED-QKE--HPAIRSLSDLLNFLPKDSI 669 (931) T ss_pred HHCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHH-HCC-CCC--CCCCCCHHHHHHHHCCCCC T ss_conf 123367777724407999999999851276654367899999999988887-314-411--2552208999997257753 Q ss_pred EEEEEE----------EECCEEE---EEEEEEE-CCC------C-CCHHHHHHHHCCCC---HHHHHHHHHHHHHHHHHH Q ss_conf 999999----------6099277---8777744-789------8-87789999982999---899999999999998763 Q gi|254780750|r 788 ATLQVS----------DSNEGII---FLHKVIP-GIA------D-HSYGIQVGKLAGLP---NTVISRAYDILKTFEKLY 843 (920) Q Consensus 788 ~~~~~~----------~~~~~i~---flykl~~-G~~------~-~Sygi~vA~laG~p---~~vi~~A~~~~~~le~~~ 843 (920) -.|+.. -.++..- -++.|-+ +.. . +-+|+++-.+.--| +...--+--++.+.++.. T Consensus 670 ~~~~~~~~l~~~L~~w~~~~~~G~~Gef~wLFD~~~~D~Ldl~~~~~~gfd~~~ll~~~e~~~~~~p~~~YLf~Ri~~~~ 749 (931) T TIGR00929 670 EVVEARNSLADRLERWLKGGDYGGDGEFQWLFDGNETDSLDLSKADIIGFDLTELLDNPELEKVATPVLLYLFHRIEEAL 749 (931) T ss_pred CCCCCHHHHHHHHHHHHCCCCCCCCCEEEEECCCCCCCCCCCCCCCEEEEEHHHHHCCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 00045023899988751789875552032110689754123789714787554642476866789999999999999972 Q ss_pred HH Q ss_conf 00 Q gi|254780750|r 844 HH 845 (920) Q Consensus 844 ~~ 845 (920) .. T Consensus 750 dg 751 (931) T TIGR00929 750 DG 751 (931) T ss_pred CC T ss_conf 41 No 372 >PRK05703 flhF flagellar biosynthesis regulator FlhF; Validated Probab=89.63 E-value=1.5 Score=22.61 Aligned_cols=165 Identities=23% Similarity=0.325 Sum_probs=87.7 Q ss_pred CCCEEEEEECCCCCCHHHHHHHHHHHHHHHH----CCCCCCHHHCCCCCCCEE--EEEEECCCCCCCCCCHH-HHHHHHH Q ss_conf 6643999967784407899999999999997----198530353206822105--67652376611385328-9999999 Q gi|254780750|r 648 NSGKLWLLTGPNMGGKSTFLRQNALIVIMAQ----MGSYVPASYAHIGIVDKL--FSRVGSADNLASGRSTF-MVEMIET 720 (920) Q Consensus 648 ~~~~~~iiTGpNmgGKSt~lRqval~vilAQ----iG~fVPA~~a~i~~~D~I--ftRiGa~D~l~~g~STF-~vEm~e~ 720 (920) ..+++..+-||=..||||.+==+|-...+.+ +| +|-++.-+++-++++ |++|= |..-+ .-.-.|. T Consensus 208 ~~~~vvalVGPTGVGKTTTiAKLAA~~~l~~~~~kV~-lIT~DtyRigA~eQLk~Ya~il-------gvp~~v~~~~~~l 279 (412) T PRK05703 208 EQGGVVALVGPTGVGKTTTLAKLAARYALEEGKDKVA-LITLDTYRIGAVEQLKTYAKIM-------GIPVKVAYDPKEL 279 (412) T ss_pred CCCCEEEEECCCCCCHHHHHHHHHHHHHHHCCCCEEE-EEEECCCCHHHHHHHHHHHHHC-------CCEEEEECCHHHH T ss_conf 5673699988888756769999999999972998179-9983767777999999999971-------9737984799999 Q ss_pred HHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCC----CEEEE-ECCCHHHHHHHHHCCCEEEEEEEEEEE Q ss_conf 9999958998569993258898805679999999999997269----84999-748757976643068858999999960 Q gi|254780750|r 721 ASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNR----CRGLL-ATHFHELTDLSKSLKRFHNATLQVSDS 795 (920) Q Consensus 721 ~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~----~~~lf-aTHy~eL~~l~~~~~~v~n~~~~~~~~ 795 (920) ..+|+......|||+|=-||. ++|..-+. -+..|.+... .++|= +|++.+|.+....+..+.. T Consensus 280 ~~al~~~~~~dlILIDTaG~s--~~d~~~~~--eL~~~~~~~~~~~~~LVlsat~~~~dl~~i~~~f~~~~~-------- 347 (412) T PRK05703 280 AKALEQLANCDLILIDTAGRS--QRDPRLIS--ELKALIENSKPIDVYLVLSATTKYRDLKDIVKHFSRLPL-------- 347 (412) T ss_pred HHHHHHHCCCCEEEEECCCCC--CCCHHHHH--HHHHHHHHCCCCCEEEEECCCCCHHHHHHHHHHHCCCCC-------- T ss_conf 999987158997999689889--78999999--999998624887189997598998999999998467999-------- Q ss_pred CCEEEEEEEEEECCCCCCHHHHHHHHCCCCHHHHHHHHHH Q ss_conf 9927787777447898877899999829998999999999 Q gi|254780750|r 796 NEGIIFLHKVIPGIADHSYGIQVGKLAGLPNTVISRAYDI 835 (920) Q Consensus 796 ~~~i~flykl~~G~~~~Sygi~vA~laG~p~~vi~~A~~~ 835 (920) +.++|+ ||=+=.+-- --++++...|+|=.-+-.-+.| T Consensus 348 -~~lI~T-KlDEt~~~G-~il~~~~~~~lplsy~t~GQ~V 384 (412) T PRK05703 348 -DGLILT-KLDETSSLG-SILSLLIESGLPISYLTDGQRV 384 (412) T ss_pred -CEEEEE-EECCCCCCC-HHHHHHHHHCCCEEEEECCCCC T ss_conf -879997-112899862-9999999988796999469997 No 373 >PRK00411 cdc6 cell division control protein 6; Reviewed Probab=89.54 E-value=0.69 Score=25.07 Aligned_cols=121 Identities=16% Similarity=0.114 Sum_probs=57.3 Q ss_pred EEEECCCCCCHHHHHHHHHHHHHHHHC----CCCCCHHHC--CCCCCCEEEEEEECCCCC-CCCCCHHHHHHHHHHHHHH Q ss_conf 999677844078999999999999971----985303532--068221056765237661-1385328999999999999 Q gi|254780750|r 653 WLLTGPNMGGKSTFLRQNALIVIMAQM----GSYVPASYA--HIGIVDKLFSRVGSADNL-ASGRSTFMVEMIETASILN 725 (920) Q Consensus 653 ~iiTGpNmgGKSt~lRqval~vilAQi----G~fVPA~~a--~i~~~D~IftRiGa~D~l-~~g~STF~vEm~e~~~IL~ 725 (920) ++|.||-..|||+..|.|.=-. -.+. =.||-|... ...++-+|+..++. .++ ..|.|+-.+ +...-..|. T Consensus 58 ~~I~G~pGTGKT~~vk~v~~~l-~~~~~~~~~vyINc~~~~t~~~i~~~i~~~L~~-~~~p~~G~s~~~~-~~~l~~~l~ 134 (394) T PRK00411 58 VLILGPPGTGKTTTVKKVFEEL-EEAALKVVYVYINCQIDRTRYAILSEIARSLFG-HPPPSSGLSFDEL-FDKIAEYLD 134 (394) T ss_pred EEEECCCCCCHHHHHHHHHHHH-HHHCCCCEEEEEECCCCCCHHHHHHHHHHHHCC-CCCCCCCCCHHHH-HHHHHHHHH T ss_conf 7998899998999999999999-974689659999696689899999999999569-9898778789999-999999861 Q ss_pred HCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHH--HHHCCCEEEEECCCHHHHH Q ss_conf 589985699932588988056799999999999--9726984999748757976 Q gi|254780750|r 726 QATNQSFVILDEIGRGTATLDGLSIAWATIEYL--HETNRCRGLLATHFHELTD 777 (920) Q Consensus 726 ~at~~SLVllDElGrGTst~DG~aiA~aile~l--~~~~~~~~lfaTHy~eL~~ 777 (920) .-...-+|+|||+-.=.+ .+|--+-|...+-- +...++-.+..++--.+.+ T Consensus 135 ~~~~~~ivvLDEiD~L~~-~~~~~vLY~L~r~~~~~~~~~~~vI~IsN~~~~~~ 187 (394) T PRK00411 135 ERDRVLIVALDDINYLVE-KEGNDVLYSLLRAHEEYPGARIGVIGISSDLTFLY 187 (394) T ss_pred CCCCEEEEEEECCCCCCC-CCCCHHHHHHHHCCCCCCCCCEEEEEEECCCCHHH T ss_conf 669758999965540203-66508999998540226887389999976871776 No 374 >COG1101 PhnK ABC-type uncharacterized transport system, ATPase component [General function prediction only] Probab=89.52 E-value=0.32 Score=27.59 Aligned_cols=128 Identities=27% Similarity=0.332 Sum_probs=64.8 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHH--HC---CCCC---C-HHHCCCCCCCEEE-------- Q ss_conf 4434563587777664399996778440789999999999999--71---9853---0-3532068221056-------- Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMA--QM---GSYV---P-ASYAHIGIVDKLF-------- 697 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilA--Qi---G~fV---P-A~~a~i~~~D~If-------- 697 (920) +=|.++|.. ..+.|..|-|.|-+||||++..++=-+..- || |--| | ++.| ..+-|+| T Consensus 21 ~l~~~sL~I----~~g~FvtViGsNGAGKSTlln~iaG~l~~t~G~I~Idg~dVtk~~~~~RA--~~larVfQdp~~gt~ 94 (263) T COG1101 21 ALNGLSLEI----AEGDFVTVIGSNGAGKSTLLNAIAGDLKPTSGQILIDGVDVTKKSVAKRA--NLLARVFQDPLAGTA 94 (263) T ss_pred HHHCCCEEE----CCCCEEEEECCCCCCHHHHHHHHHCCCCCCCCEEEECCEECCCCCHHHHH--HHHHHHHCCHHHCCC T ss_conf 872375122----47846999767986388888886075036776599876443437788875--678987136520776 Q ss_pred EEEECCCCC-----------------CCCCCHHHHHHHHH-----------------------HHHHHHCCCCCEEEEEC Q ss_conf 765237661-----------------13853289999999-----------------------99999589985699932 Q gi|254780750|r 698 SRVGSADNL-----------------ASGRSTFMVEMIET-----------------------ASILNQATNQSFVILDE 737 (920) Q Consensus 698 tRiGa~D~l-----------------~~g~STF~vEm~e~-----------------------~~IL~~at~~SLVllDE 737 (920) +++--..|+ .+-.+.|+-++.++ +-++-.+.+--+.++|| T Consensus 95 ~~lTieENl~la~~Rg~~rgl~~~ln~~~~~~f~~~l~~l~lgLenrL~~~iglLSGGQRQalsL~MAtl~~pkiLLLDE 174 (263) T COG1101 95 PELTIEENLALAESRGKKRGLSSALNERRRSSFRERLARLGLGLENRLSDRIGLLSGGQRQALSLLMATLHPPKILLLDE 174 (263) T ss_pred CCCCHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHCCHHHHCCCHHHHHHHHHHHHCCCCCEEEECC T ss_conf 42519988999986076556335452778999999986615323555247133236518999999998547984887601 Q ss_pred CCCCCCHHH-HHH--HHHHHHHHHHHHCCCEEEEECCC Q ss_conf 588988056-799--99999999997269849997487 Q gi|254780750|r 738 IGRGTATLD-GLS--IAWATIEYLHETNRCRGLLATHF 772 (920) Q Consensus 738 lGrGTst~D-G~a--iA~aile~l~~~~~~~~lfaTHy 772 (920) = |...| +.| +-..+ +.+++.-+-.++-.||- T Consensus 175 H---TAALDPkta~~vm~lT-~kiV~~~klTtlMVTHn 208 (263) T COG1101 175 H---TAALDPKTAEFVMELT-AKIVEEHKLTTLMVTHN 208 (263) T ss_pred H---HHCCCCCHHHHHHHHH-HHHHHHCCCCEEEEECC T ss_conf 1---2117930679999999-99998569706887415 No 375 >TIGR01188 drrA daunorubicin resistance ABC transporter, ATP-binding protein; InterPro: IPR005894 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This family contains the daunorubicin resistance ABC transporter, ATP binding subunit in bacteria and archaea. This model is restricted in its scope to preferentially recognize the ATP binding subunit associated with effux of the drug, daunorubicin. In other words it functions as an ATP dependent antiporter. . Probab=89.43 E-value=0.38 Score=27.04 Aligned_cols=124 Identities=31% Similarity=0.439 Sum_probs=67.1 Q ss_pred CCCEEEEEECCCCCCHHHHHHHHHHHHHHHH-------CCCCC---CH--H-HCCCCCC------CEEEEEEECCCCCC- Q ss_conf 6643999967784407899999999999997-------19853---03--5-3206822------10567652376611- Q gi|254780750|r 648 NSGKLWLLTGPNMGGKSTFLRQNALIVIMAQ-------MGSYV---PA--S-YAHIGIV------DKLFSRVGSADNLA- 707 (920) Q Consensus 648 ~~~~~~iiTGpNmgGKSt~lRqval~vilAQ-------iG~fV---PA--~-~a~i~~~------D~IftRiGa~D~l~- 707 (920) ..+.++=+=|||-+||||.+|..+ +++== .|.=| |+ + .=+||+| |..+| |..||. T Consensus 19 ~~G~vfGfLGPNGAGKTTti~mLt--Tll~P~sG~A~V~GYDvvreP~kl~VRr~IG~v~Q~~s~D~~LT---g~ENl~m 93 (343) T TIGR01188 19 REGEVFGFLGPNGAGKTTTIRMLT--TLLKPTSGTARVAGYDVVREPRKLKVRRRIGIVPQYASVDEDLT---GRENLVM 93 (343) T ss_pred CCCEEEEEECCCCCCHHHHHHHHC--CCCCCCCCEEEEEEEECCCCHHHCCCCEEEEEECCCCCCCCCCC---HHHHHHH T ss_conf 062489976879985133563410--25579987689983210236304032113204468555564577---4754445 Q ss_pred ----CCCCHHH-----HHHHHHHHHHHHCCC----------------CC------EEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf ----3853289-----999999999995899----------------85------6999325889880567999999999 Q gi|254780750|r 708 ----SGRSTFM-----VEMIETASILNQATN----------------QS------FVILDEIGRGTATLDGLSIAWATIE 756 (920) Q Consensus 708 ----~g~STF~-----vEm~e~~~IL~~at~----------------~S------LVllDElGrGTst~DG~aiA~aile 756 (920) .|.|-=. .|+.|.=.+-..|.+ .| +.-|||-=-|=+|.---+| |-+++ T Consensus 94 ~g~LyGlp~~~~~~Ra~ELLe~~~L~~aa~r~V~tySGGMrRRL~iA~sli~~P~vLFLDEPT~GLDP~tR~~i-Wd~i~ 172 (343) T TIGR01188 94 MGRLYGLPKKEAEERAEELLELFELTEAADRKVKTYSGGMRRRLEIAASLIHQPEVLFLDEPTTGLDPRTRRAI-WDYIE 172 (343) T ss_pred HHHHCCCCHHHHHHHHHHHHHHCCHHHHHCCCCCCCCCCCEEHHHHHHHHHCCCCEEECCCCCCCCCHHHHHHH-HHHHH T ss_conf 33334896899998888876230014662543322677112144543111038825651488768888669999-99999 Q ss_pred HHHHHCCCEEEEECCCHHHHH Q ss_conf 999726984999748757976 Q gi|254780750|r 757 YLHETNRCRGLLATHFHELTD 777 (920) Q Consensus 757 ~l~~~~~~~~lfaTHy~eL~~ 777 (920) .|-+..+--+|.+|||=|=++ T Consensus 173 ~lk~~~g~TilLTThYmeEAd 193 (343) T TIGR01188 173 ALKKEEGVTILLTTHYMEEAD 193 (343) T ss_pred HHHHCCCCEEEEECCCHHHHH T ss_conf 987407969997437869998 No 376 >TIGR00611 recf DNA replication and repair protein RecF; InterPro: IPR001238 All proteins in this family, including recF, for which functions are known are DNA binding proteins that assist the filamentation of RecA onto DNA for the initiation of recombination or recombinational repair. RecF is involved in DNA metabolism and is required for recombinational DNA repair and for induction of the SOS response , . ; GO: 0003697 single-stranded DNA binding, 0005524 ATP binding, 0006281 DNA repair. Probab=89.31 E-value=0.29 Score=27.85 Aligned_cols=25 Identities=20% Similarity=0.037 Sum_probs=12.4 Q ss_pred HHHHHHHHHHHHHH--HHHHHHHHHCC Q ss_conf 98888999867788--77888775258 Q gi|254780750|r 576 LDNASQVIAIIDIS--IALAILAKEQN 600 (920) Q Consensus 576 l~~~~~~ia~lD~l--~SlA~~a~~~~ 600 (920) +.-+=+..||||-. -.||..+...+ T Consensus 339 ilLlDDv~SELD~~Rr~~L~~~~~~~~ 365 (399) T TIGR00611 339 ILLLDDVASELDDQRRRALAETLQSLG 365 (399) T ss_pred EEEECCHHHCCHHHHHHHHHHHHHHCC T ss_conf 264425232260789999999997169 No 377 >cd03115 SRP The signal recognition particle (SRP) mediates the transport to or across the plasma membrane in bacteria and the endoplasmic reticulum in eukaryotes. SRP recognizes N-terminal sighnal sequences of newly synthesized polypeptides at the ribosome. The SRP-polypeptide complex is then targeted to the membrane by an interaction between SRP and its cognated receptor (SR). In mammals, SRP consists of six protein subunits and a 7SL RNA. One of these subunits is a 54 kd protein (SRP54), which is a GTP-binding protein that interacts with the signal sequence when it emerges from the ribosome. SRP54 is a multidomain protein that consists of an N-terminal domain, followed by a central G (GTPase) domain and a C-terminal M domain. Probab=89.08 E-value=1.6 Score=22.33 Aligned_cols=154 Identities=18% Similarity=0.198 Sum_probs=67.9 Q ss_pred EEEEECCCCCCHHHHHHHHHHHHHHHHCCC---CCCHHHCCCCCCCEE--EEEEECCCCCC--CCCCHHHHHHHHH-HHH Q ss_conf 999967784407899999999999997198---530353206822105--67652376611--3853289999999-999 Q gi|254780750|r 652 LWLLTGPNMGGKSTFLRQNALIVIMAQMGS---YVPASYAHIGIVDKL--FSRVGSADNLA--SGRSTFMVEMIET-ASI 723 (920) Q Consensus 652 ~~iiTGpNmgGKSt~lRqval~vilAQiG~---fVPA~~a~i~~~D~I--ftRiGa~D~l~--~g~STF~vEm~e~-~~I 723 (920) +.++-|||-+||||.+=-.|.-. .+-|- +|.|+..+++=+|++ |+++-.- .+. ...+ ++.+. ... T Consensus 2 Vi~lvGptGvGKTTTiaKLA~~~--~~~~~kV~lit~Dt~R~gA~eQL~~~a~~l~v-~~~~~~~~~----~~~~~~~~~ 74 (173) T cd03115 2 VILLVGLQGVGKTTTAAKLALYL--KKKGKKVLLVAADTYRPAAIEQLRVLGEQVGV-PVFEEGEGK----DPVSIAKRA 74 (173) T ss_pred EEEEECCCCCCHHHHHHHHHHHH--HHCCCEEEEEECCCCCHHHHHHHHHHHHHCCC-EEEECCCCC----CHHHHHHHH T ss_conf 99998999998899999999999--97699289997488757799999999997498-599227755----879999999 Q ss_pred HHH--CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCC----EEEEECCCHHHHHHHHHCCCEEEEEEEEEEECC Q ss_conf 995--89985699932588988056799999999999972698----499974875797664306885899999996099 Q gi|254780750|r 724 LNQ--ATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRC----RGLLATHFHELTDLSKSLKRFHNATLQVSDSNE 797 (920) Q Consensus 724 L~~--at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~----~~lfaTHy~eL~~l~~~~~~v~n~~~~~~~~~~ 797 (920) +.. .....+||+|=-||. .+|..-+ .=++.+.+..++ +++-+|.=++..+....+....+ -+ T Consensus 75 ~~~~~~~~~D~IlIDTaGr~--~~d~~~~--~el~~l~~~~~p~~~~LVl~a~~~~~~~~~~~~f~~~~~--------~~ 142 (173) T cd03115 75 IEHAREENFDVVIVDTAGRL--QIDENLM--EELKKIKRVVKPDEVLLVVDAMTGQDAVNQAKAFNEALG--------IT 142 (173) T ss_pred HHHHHHCCCCEEEEECCCCC--CCCHHHH--HHHHHHHHHHCCCCCEEECCCCCHHHHHHHHHHHHHCCC--------CC T ss_conf 99987568998999788878--7999999--999999864489721574246550658999999874279--------97 Q ss_pred EEEEEEEEEECCCCCCHHHHHHHHCCCCH Q ss_conf 27787777447898877899999829998 Q gi|254780750|r 798 GIIFLHKVIPGIADHSYGIQVGKLAGLPN 826 (920) Q Consensus 798 ~i~flykl~~G~~~~Sygi~vA~laG~p~ 826 (920) .++|. ||=+ .+.---.++++...|+|= T Consensus 143 ~~I~T-KlDe-t~~~G~~ls~~~~~~~Pi 169 (173) T cd03115 143 GVILT-KLDG-DARGGAALSIRAVTGKPI 169 (173) T ss_pred EEEEE-EECC-CCCCCHHHHHHHHHCCCE T ss_conf 89997-1438-997579999999989090 No 378 >COG0466 Lon ATP-dependent Lon protease, bacterial type [Posttranslational modification, protein turnover, chaperones] Probab=89.04 E-value=0.73 Score=24.86 Aligned_cols=103 Identities=23% Similarity=0.368 Sum_probs=41.9 Q ss_pred EEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEE---ECCC--CCCCCCCHHHHHHHH-HHHHH Q ss_conf 39999677844078999999999999971985303532068221056765---2376--611385328999999-99999 Q gi|254780750|r 651 KLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRV---GSAD--NLASGRSTFMVEMIE-TASIL 724 (920) Q Consensus 651 ~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRi---Ga~D--~l~~g~STF~vEm~e-~~~IL 724 (920) .++.+.||-.-|||++-|++|=. ..|=|.|| |-.| .|-..+=|+---|-- .-.-+ T Consensus 351 pILcLVGPPGVGKTSLgkSIA~a-------------------l~RkfvR~sLGGvrDEAEIRGHRRTYIGaMPGrIiQ~m 411 (782) T COG0466 351 PILCLVGPPGVGKTSLGKSIAKA-------------------LGRKFVRISLGGVRDEAEIRGHRRTYIGAMPGKIIQGM 411 (782) T ss_pred CEEEEECCCCCCCHHHHHHHHHH-------------------HCCCEEEEECCCCCCHHHHCCCCCCCCCCCCHHHHHHH T ss_conf 57999789988701189999999-------------------58977999547654277753553123356872899999 Q ss_pred HH-CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHH Q ss_conf 95-8998569993258898805679999999999997269849997487579 Q gi|254780750|r 725 NQ-ATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHEL 775 (920) Q Consensus 725 ~~-at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL 775 (920) +. -+.|-++||||+-+=+|.+-|-- |.|.||-|=-. +- .=|.-||-++ T Consensus 412 kka~~~NPv~LLDEIDKm~ss~rGDP-aSALLEVLDPE-QN-~~F~DhYLev 460 (782) T COG0466 412 KKAGVKNPVFLLDEIDKMGSSFRGDP-ASALLEVLDPE-QN-NTFSDHYLEV 460 (782) T ss_pred HHHCCCCCEEEEECHHHCCCCCCCCH-HHHHHHHCCHH-HC-CCHHHCCCCC T ss_conf 98677687478640333167777886-88888626976-56-7612220167 No 379 >COG1643 HrpA HrpA-like helicases [DNA replication, recombination, and repair] Probab=88.91 E-value=1.6 Score=22.25 Aligned_cols=30 Identities=13% Similarity=0.211 Sum_probs=17.4 Q ss_pred HHHHCCC-CEEEEECCCCCHHHHHHHHHHCC Q ss_conf 9852298-57997077689678998874238 Q gi|254780750|r 188 DIMRIDP-REIIFSEKELSHAEYKSLFETLG 217 (920) Q Consensus 188 ~L~~l~P-~EIii~~~~~~~~~~~~l~~~~~ 217 (920) .+.+..| --||+.+-.++...+...+...+ T Consensus 188 ~~~~rr~DLKiIimSATld~~rfs~~f~~ap 218 (845) T COG1643 188 LLARRRDDLKLIIMSATLDAERFSAYFGNAP 218 (845) T ss_pred HHHHCCCCCEEEEEECCCCHHHHHHHCCCCC T ss_conf 9864688705999725358899997628998 No 380 >TIGR00960 3a0501s02 Type II (General) Secretory Pathway (IISP) Family protein; InterPro: IPR013505 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). FtsE is an ABC transporter ATP-binding protein. This protein and its permease partner FtsX, localize to the cell division site. In a number of species, the ftsEX gene pair is located next to ftsY, which encodes the signal recognition particle-docki ng protein.; GO: 0005524 ATP binding, 0051301 cell division. Probab=88.90 E-value=0.58 Score=25.65 Aligned_cols=123 Identities=28% Similarity=0.319 Sum_probs=61.4 Q ss_pred CCEEEEEECCCCCCHHHHHHHHHHHHHHHHC--------------CCCCCHHHCCCCCCC---EEEEEEECCCCC----- Q ss_conf 6439999677844078999999999999971--------------985303532068221---056765237661----- Q gi|254780750|r 649 SGKLWLLTGPNMGGKSTFLRQNALIVIMAQM--------------GSYVPASYAHIGIVD---KLFSRVGSADNL----- 706 (920) Q Consensus 649 ~~~~~iiTGpNmgGKSt~lRqval~vilAQi--------------G~fVPA~~a~i~~~D---~IftRiGa~D~l----- 706 (920) .+.|.-|+||--+||||+||-+.=+.- +-. |--||-=.=.||.|= ++++.--+-||. T Consensus 28 kGem~fL~GHSGaGKST~lkLi~~~~~-P~~G~i~~~G~d~~~L~~r~~P~LRr~iG~VFQD~~LL~drtv~dNVa~pL~ 106 (216) T TIGR00960 28 KGEMVFLVGHSGAGKSTLLKLILGIEK-PTRGKIRFSGQDLTRLRGREVPFLRRHIGMVFQDHRLLSDRTVYDNVALPLR 106 (216) T ss_pred CCCEEEEECCCCCCHHHHHHHHHHHCC-CCCCEEEECCCCCCCCCCCCCCCEEEECCEEECCHHHHHHCCHHHHHHHHHH T ss_conf 850799856888607899999985228-9986078715421001577467300010426701155311655455243355 Q ss_pred CCCCCHHHHHHHHHHHHHH--------------------------HC--CCCCEEEEECCCCCCCHHHHHHHHHHHH--- Q ss_conf 1385328999999999999--------------------------58--9985699932588988056799999999--- Q gi|254780750|r 707 ASGRSTFMVEMIETASILN--------------------------QA--TNQSFVILDEIGRGTATLDGLSIAWATI--- 755 (920) Q Consensus 707 ~~g~STF~vEm~e~~~IL~--------------------------~a--t~~SLVllDElGrGTst~DG~aiA~ail--- 755 (920) .-|.++=-.+-. ++..|+ .| .+=.|.|=||- |...|- +.+.-|+ T Consensus 107 iiG~~~~~~~~r-v~~aL~~VGL~~K~~~lP~~LSGGEQQRv~IARA~V~~P~lLLADEP---TGNLD~-~~S~~il~Lf 181 (216) T TIGR00960 107 IIGVPGRDINER-VSAALEKVGLKGKAHALPVQLSGGEQQRVAIARAVVNKPALLLADEP---TGNLDP-ELSRDILRLF 181 (216) T ss_pred HCCCCCCCHHHH-HHHHHHHCCCCCHHHCCCCCCCCCCHHHHHHHHHHCCCCCCEEEECC---CCCCCH-HHHHHHHHHH T ss_conf 228997426789-99998730611212407620048503455664443067970131088---988788-8899999999 Q ss_pred HHHHHHCCCEEEEECCCHHHHHH Q ss_conf 99997269849997487579766 Q gi|254780750|r 756 EYLHETNRCRGLLATHFHELTDL 778 (920) Q Consensus 756 e~l~~~~~~~~lfaTHy~eL~~l 778 (920) |.++ +.|..+|.|||.+.|.+- T Consensus 182 ~~~n-~~G~TVl~ATHD~~L~~~ 203 (216) T TIGR00960 182 EEFN-RAGTTVLVATHDINLVES 203 (216) T ss_pred HHHH-CCCCEEEECCCHHHHHHH T ss_conf 8750-378547771024889973 No 381 >cd01393 recA_like RecA is a bacterial enzyme which has roles in homologous recombination, DNA repair, and the induction of the SOS response. RecA couples ATP hydrolysis to DNA strand exchange. While prokaryotes have a single RecA protein, eukaryotes have multiple RecA homologs such as Rad51, DMC1 and Rad55/57. Archaea have the RecA-like homologs radA and radB. Probab=88.78 E-value=1.7 Score=22.19 Aligned_cols=116 Identities=16% Similarity=0.306 Sum_probs=57.3 Q ss_pred CCEEEEEECCCCCCHHHHHHHHHHHHHHHH-C----C--CCCCHHHC-------CCC---------CCCEEEEEEECCCC Q ss_conf 643999967784407899999999999997-1----9--85303532-------068---------22105676523766 Q gi|254780750|r 649 SGKLWLLTGPNMGGKSTFLRQNALIVIMAQ-M----G--SYVPASYA-------HIG---------IVDKLFSRVGSADN 705 (920) Q Consensus 649 ~~~~~iiTGpNmgGKSt~lRqval~vilAQ-i----G--~fVPA~~a-------~i~---------~~D~IftRiGa~D~ 705 (920) .+++.-|.||..+|||++.=|+|+.+.+.- . | .|+-++.. .++ +.|+|+.- ..++ T Consensus 18 ~G~ItEi~G~~gsGKT~l~lqla~~~q~~~~~~~~~g~vvyIDtE~~f~~~rl~~i~~~~~~~~~~~l~~i~~~--~~~~ 95 (226) T cd01393 18 TGRITEIFGEFGSGKTQLCLQLAVEAQLPGELGGLEGKVVYIDTEGAFRPERLVQLAVRFGLDPEEVLDNIYVA--RPYN 95 (226) T ss_pred CCEEEEEECCCCCCHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHCCCHHHHHHHEEEE--CCCC T ss_conf 88399999999998999999999998542211699961999955775319999999876032667764333684--3799 Q ss_pred CCCCCCHHHHHHHH-HHHHHHHCCCCCEEEEECCC---------CCCCHHHHHHHH---HHHHHHHHHHCCCEEEEECCC Q ss_conf 11385328999999-99999958998569993258---------898805679999---999999997269849997487 Q gi|254780750|r 706 LASGRSTFMVEMIE-TASILNQATNQSFVILDEIG---------RGTATLDGLSIA---WATIEYLHETNRCRGLLATHF 772 (920) Q Consensus 706 l~~g~STF~vEm~e-~~~IL~~at~~SLVllDElG---------rGTst~DG~aiA---~aile~l~~~~~~~~lfaTHy 772 (920) ..+ ..++.+ ...++. ..+-.||++|=+. +++. .+...+. ...|..+..+.++-++++-|. T Consensus 96 ~e~-----~~~~~~~l~~~~~-~~~v~liViDSi~al~r~~~~~~~~~-~~r~~~l~~~~~~L~~la~~~~~avv~tNQv 168 (226) T cd01393 96 GEQ-----QLEIVEELERIMS-SGRVDLVVVDSVAALFRKEFIGRGML-AERARLLSQALRKLLRLADKFNVAVVFTNQV 168 (226) T ss_pred HHH-----HHHHHHHHHHHHH-CCCCEEEEEECCCEECCHHHCCCCCH-HHHHHHHHHHHHHHHHHHHHCCCEEEEECEE T ss_conf 999-----9999999998752-47842899932200111444276207-8999999999999999999849799996811 Q ss_pred H Q ss_conf 5 Q gi|254780750|r 773 H 773 (920) Q Consensus 773 ~ 773 (920) . T Consensus 169 ~ 169 (226) T cd01393 169 R 169 (226) T ss_pred E T ss_conf 7 No 382 >COG3854 SpoIIIAA ncharacterized protein conserved in bacteria [Function unknown] Probab=88.63 E-value=1.7 Score=22.12 Aligned_cols=107 Identities=28% Similarity=0.445 Sum_probs=58.1 Q ss_pred EEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEECCC------------CCCCC--CCHHHHHHH Q ss_conf 9996778440789999999999999719853035320682210567652376------------61138--532899999 Q gi|254780750|r 653 WLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGSAD------------NLASG--RSTFMVEMI 718 (920) Q Consensus 653 ~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa~D------------~l~~g--~STF~vEm~ 718 (920) .+|-||--.||+|+||-+|=.+--- +--|.|- +.+++|-= .-|++.| ++..+ ++-=|.+|. T Consensus 140 tLiigpP~~GKTTlLRdiaR~~s~g-~~~~l~k---kv~IiDer-sEIag~~~gvpq~~~g~R~dVld~cpk~~gmmmaI 214 (308) T COG3854 140 TLIIGPPQVGKTTLLRDIARLLSDG-INQFLPK---KVGIIDER-SEIAGCLNGVPQHGRGRRMDVLDPCPKAEGMMMAI 214 (308) T ss_pred EEEECCCCCCHHHHHHHHHHHHHCC-CCCCCCC---EEEEEECC-CHHHCCCCCCCHHHHHHHHHHCCCCHHHHHHHHHH T ss_conf 6996599887077999999986315-1126773---28997150-04303435886032322101046561788899999 Q ss_pred HHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHH Q ss_conf 99999995899856999325889880567999999999999726984999748757976643 Q gi|254780750|r 719 ETASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTDLSK 780 (920) Q Consensus 719 e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~l~~ 780 (920) ++..+. .+++||+|+- .| |-|+++.++. |-..+-+-|=.++-++.+ T Consensus 215 ------rsm~PE-ViIvDEIGt~---~d----~~A~~ta~~~--GVkli~TaHG~~iedl~k 260 (308) T COG3854 215 ------RSMSPE-VIIVDEIGTE---ED----ALAILTALHA--GVKLITTAHGNGIEDLIK 260 (308) T ss_pred ------HHCCCC-EEEEECCCCH---HH----HHHHHHHHHC--CCEEEEEECCCCHHHHHC T ss_conf ------954995-7998343647---77----9999999854--858999504411777650 No 383 >COG1117 PstB ABC-type phosphate transport system, ATPase component [Inorganic ion transport and metabolism] Probab=88.26 E-value=0.52 Score=26.01 Aligned_cols=35 Identities=37% Similarity=0.470 Sum_probs=23.9 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHH Q ss_conf 443456358777766439999677844078999999999 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALI 673 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~ 673 (920) +=.||+|..+ ...+--+-||-.+|||||||..--+ T Consensus 22 aL~~i~l~i~----~~~VTAlIGPSGcGKST~LR~lNRm 56 (253) T COG1117 22 ALKDINLDIP----KNKVTALIGPSGCGKSTLLRCLNRM 56 (253) T ss_pred HHCCCCEECC----CCCEEEEECCCCCCHHHHHHHHHHH T ss_conf 2315722145----7806998889886788899998754 No 384 >KOG0061 consensus Probab=88.26 E-value=1.8 Score=21.95 Aligned_cols=15 Identities=27% Similarity=0.470 Sum_probs=9.0 Q ss_pred CEEEEEEEEECCCCE Q ss_conf 455458999775303 Q gi|254780750|r 125 LVRRNVVRLVTPGTL 139 (920) Q Consensus 125 ~v~R~Vt~IiTpGT~ 139 (920) -+=+.|+....||.+ T Consensus 44 ~iL~~vsg~~~~Gel 58 (613) T KOG0061 44 TILKGVSGTAKPGEL 58 (613) T ss_pred EEECCCEEEEECCEE T ss_conf 432187799867868 No 385 >KOG2355 consensus Probab=88.17 E-value=0.51 Score=26.05 Aligned_cols=38 Identities=37% Similarity=0.558 Sum_probs=25.0 Q ss_pred CEEEEECCCCCCCHHHHHHHHHH-HHHHHHHH---CCCEEEEECCCH Q ss_conf 56999325889880567999999-99999972---698499974875 Q gi|254780750|r 731 SFVILDEIGRGTATLDGLSIAWA-TIEYLHET---NRCRGLLATHFH 773 (920) Q Consensus 731 SLVllDElGrGTst~DG~aiA~a-ile~l~~~---~~~~~lfaTHy~ 773 (920) -..|+||+ |--.| -+|.| .+|+|-+. -+|..+.|||-. T Consensus 167 kVLLLDEV---TVDLD--VlARadLLeFlkeEce~RgatIVYATHIF 208 (291) T KOG2355 167 KVLLLDEV---TVDLD--VLARADLLEFLKEECEQRGATIVYATHIF 208 (291) T ss_pred EEEEEEEE---EEEHH--HHHHHHHHHHHHHHHHHCCCEEEEEEEEC T ss_conf 37776315---74067--87788899999998863486799973000 No 386 >KOG0744 consensus Probab=88.13 E-value=1.8 Score=21.89 Aligned_cols=101 Identities=27% Similarity=0.338 Sum_probs=56.3 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCC----EEEEEEECCCCCCCCCCHHHHHHHH------ Q ss_conf 439999677844078999999999999971985303532068221----0567652376611385328999999------ Q gi|254780750|r 650 GKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVD----KLFSRVGSADNLASGRSTFMVEMIE------ 719 (920) Q Consensus 650 ~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D----~IftRiGa~D~l~~g~STF~vEm~e------ 719 (920) .|+++++||-.-||+++.|.. ||- ..|..-| .+..-|.+ .|-|..=-.| T Consensus 177 NRliLlhGPPGTGKTSLCKaL------aQk--------LSIR~~~~y~~~~liEins-------hsLFSKWFsESgKlV~ 235 (423) T KOG0744 177 NRLILLHGPPGTGKTSLCKAL------AQK--------LSIRTNDRYYKGQLIEINS-------HSLFSKWFSESGKLVA 235 (423) T ss_pred EEEEEEECCCCCCHHHHHHHH------HHH--------HEEEECCCCCCCEEEEEEH-------HHHHHHHHHHHHHHHH T ss_conf 148998579998822799999------875--------1465237644406999704-------6788988712113899 Q ss_pred -----HHHHHHHCCCCCEEEEECCC----------CCCCHHHHHHHHHHHHHHHHHHCCC--EEEEECC Q ss_conf -----99999958998569993258----------8988056799999999999972698--4999748 Q gi|254780750|r 720 -----TASILNQATNQSFVILDEIG----------RGTATLDGLSIAWATIEYLHETNRC--RGLLATH 771 (920) Q Consensus 720 -----~~~IL~~at~~SLVllDElG----------rGTst~DG~aiA~aile~l~~~~~~--~~lfaTH 771 (920) +...+..-..=-+|||||+- .|+.|.||+-..-|++..|-+-.+| ..+.+|- T Consensus 236 kmF~kI~ELv~d~~~lVfvLIDEVESLa~aR~s~~S~~EpsDaIRvVNalLTQlDrlK~~~NvliL~TS 304 (423) T KOG0744 236 KMFQKIQELVEDRGNLVFVLIDEVESLAAARTSASSRNEPSDAIRVVNALLTQLDRLKRYPNVLILATS 304 (423) T ss_pred HHHHHHHHHHHCCCCEEEEEEHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEEECC T ss_conf 999999999717896899980787888999875413799821899999999989986047977999626 No 387 >TIGR00606 rad50 rad50; InterPro: IPR004584 Rad50 is involved in recombination, recombinational repair, and/or non-homologous end joining. It is a component of an exonuclease complex with MRE11 homologs. The Saccharomyces cerevisiae Rad50/MRE11 complex possesses single-stranded endonuclease activity and ATP-dependent double-strand-specific exonuclease activity. Rad50 provides an ATP-dependent control of MRE11 by unwinding and repositioning DNA ends into the MRE11 active site. This family is distantly related to the SbcC family of bacterial proteins. When the N- and C-terminal globular regions of Rad50 from Pyrococcus furiosus P58301 from SWISSPROT are co-expressed in Escherichia coli, they spontaneously associate to form a stable complex that possesses ATP-binding and weak ATP-hydrolysing activities. The structure formed is known as the Rad50 catalytic domain (Rad50cd1). In the presence of ATP, two Rad50cd1 molecules interact via their ATP-binding and highly conserved 'signature' motifs to form a dimer. As ATP is buried deep within this dimer interface, the two Rad50cd1 molecules may have to completely disengage after ATP hydrolysis to allow the release of ADP before binding of a new ATP molecule. ATP binding is also accompanied by a 30° rotation of two distinct domains within each Rad50cd1 part of the dimer. This rotation and dimerisation creates a positively charged surface which, potentially, could provide a DNA-binding site capable of accommodating two DNA molecules. The Mre11-docking site within Rad50 has been mapped to two 40-residue heptad-repeat sequences that lie adjacent to the N- and C-terminal ATPase segments. A distinct region within this domain forms a conserved hydrophobic patch that is believed to be the actual Mre11-binding site and lies immediately adjacent to the putative DNA-binding site of Rad50. As Rad50 dimerises in the presence of ATP and forms a stoichiometric complex with Mre11 (one Mre11 subunit binding to one Rad50 subunit), it is possible that the MR complex forms a closely coordinated DNA-binding unit that has the potential to act on two DNA molecules simultaneously. Within this unit, ATP-dependent control of nuclease action might be achieved via Rad50 unwinding or repositioning DNA ends into the active-site of Mre11 . ; GO: 0005524 ATP binding, 0006281 DNA repair, 0030870 Mre11 complex. Probab=88.01 E-value=0.32 Score=27.57 Aligned_cols=50 Identities=10% Similarity=0.232 Sum_probs=25.4 Q ss_pred EEEEEEECCCCEECCHHCC-------------------CC-CCCEEEEEE-----ECCCEEEEEEEECCCCEEEE Q ss_conf 4589997753030201038-------------------78-773699999-----52984999999978655999 Q gi|254780750|r 128 RNVVRLVTPGTLTEDQLLS-------------------PT-DSNYLMAVA-----RIRTEWAIAWIDISTGIFKI 177 (920) Q Consensus 128 R~Vt~IiTpGT~~d~~~l~-------------------~~-~~nyL~aI~-----~~~~~~Gia~iDisTG~~~~ 177 (920) ..+..-+||-|++-+-.-. .. .|.|+-+-. ..+..+-+-|.||.--.+.+ T Consensus 21 ~~~I~F~SP~T~l~GPNG~GKTT~IE~L~y~~TG~~P~~~K~NtFvH~~~VA~~T~V~A~~~L~F~DV~G~~~~~ 95 (1328) T TIGR00606 21 KQIIDFLSPLTLLVGPNGAGKTTIIEALKYVTTGDFPPGKKGNTFVHDPKVARETDVRAQVRLEFRDVKGEEVAV 95 (1328) T ss_pred CCEEEEECCCCEEECCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHCCCCEEEEEEEEEEECCCEEEE T ss_conf 423321166010127788752589875433204889888888600037222110532058899888617607887 No 388 >PRK12377 putative replication protein; Provisional Probab=87.91 E-value=1.9 Score=21.80 Aligned_cols=103 Identities=17% Similarity=0.187 Sum_probs=56.1 Q ss_pred EEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCCC Q ss_conf 99996778440789999999999999719853035320682210567652376611385328999999999999589985 Q gi|254780750|r 652 LWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGSADNLASGRSTFMVEMIETASILNQATNQS 731 (920) Q Consensus 652 ~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~at~~S 731 (920) -+|+.||---|||-+-=.+|.-++. -|--| +|+++. |=+.+=..+|--...+ ...++....-. T Consensus 103 NlIf~G~pGtGKTHLA~AIg~~a~~--~G~sV------------lF~t~~--dLv~~L~~a~~~g~~~-~k~l~~l~~~d 165 (248) T PRK12377 103 NFVFSGKPGTGKNHLAAAIGNRLLA--KGRSV------------IVVTVP--DVMSRLHESYDNGQSG-EKFLQELCKVD 165 (248) T ss_pred EEEEECCCCCCHHHHHHHHHHHHHH--CCCEE------------EEEEHH--HHHHHHHHHHHCCCCH-HHHHHHHHCCC T ss_conf 0899899998788999999999998--79969------------998899--9999999999848509-99999973389 Q ss_pred EEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCC Q ss_conf 69993258898805679999999999997269849997487 Q gi|254780750|r 732 FVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHF 772 (920) Q Consensus 732 LVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy 772 (920) |+||||+|--..+..+..+-.-++..=.+..++ ++++|-. T Consensus 166 LLIIDElG~~~~s~~~~~llfqlI~~Ry~~~ks-~IiTTNL 205 (248) T PRK12377 166 LLVLDEIGIQRETKNEQVVLNQIIDRRTASMRS-VGMLTNL 205 (248) T ss_pred EEEEHHCCCCCCCHHHHHHHHHHHHHHHHCCCC-EEEECCC T ss_conf 898600057889867999999999999855798-6897589 No 389 >KOG2004 consensus Probab=87.88 E-value=1.6 Score=22.39 Aligned_cols=104 Identities=25% Similarity=0.440 Sum_probs=63.0 Q ss_pred CCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEE---ECCC--CCCCCCCHHHHHHH-HHH Q ss_conf 66439999677844078999999999999971985303532068221056765---2376--61138532899999-999 Q gi|254780750|r 648 NSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRV---GSAD--NLASGRSTFMVEMI-ETA 721 (920) Q Consensus 648 ~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRi---Ga~D--~l~~g~STF~vEm~-e~~ 721 (920) -.+.++-.+||-.-||++.-|+||=. .+|=|-|+ |-.| +|-..+-|+---|- .+- T Consensus 436 ~qGkIlCf~GPPGVGKTSI~kSIA~A-------------------LnRkFfRfSvGG~tDvAeIkGHRRTYVGAMPGkiI 496 (906) T KOG2004 436 VQGKILCFVGPPGVGKTSIAKSIARA-------------------LNRKFFRFSVGGMTDVAEIKGHRRTYVGAMPGKII 496 (906) T ss_pred CCCCEEEEECCCCCCCCCHHHHHHHH-------------------HCCCEEEEECCCCCCHHHHCCCCEEEECCCCHHHH T ss_conf 78837998689987732189999998-------------------48746998536634277642542110014884899 Q ss_pred HHHHHC-CCCCEEEEECC---CCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH Q ss_conf 999958-99856999325---88988056799999999999972698499974875797 Q gi|254780750|r 722 SILNQA-TNQSFVILDEI---GRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELT 776 (920) Q Consensus 722 ~IL~~a-t~~SLVllDEl---GrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~ 776 (920) +-|+.+ |.|-||||||+ |||-- |-- |.|.||-|--. +- .=|--||-.+. T Consensus 497 q~LK~v~t~NPliLiDEvDKlG~g~q---GDP-asALLElLDPE-QN-anFlDHYLdVp 549 (906) T KOG2004 497 QCLKKVKTENPLILIDEVDKLGSGHQ---GDP-ASALLELLDPE-QN-ANFLDHYLDVP 549 (906) T ss_pred HHHHHHCCCCCEEEEEHHHHHCCCCC---CCH-HHHHHHHCCHH-HC-CCHHHHCCCCC T ss_conf 99986177886588532234178877---986-89998743965-35-53454202664 No 390 >TIGR02142 modC_ABC molybdate ABC transporter, ATP-binding protein; InterPro: IPR011868 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This entry represents the ATP-binding cassette (ABC) protein of the three subunit molybdate ABC transporter. The three proteins of this complex are homologous to proteins of the sulphate ABC transporter. Molybdenum may be used in nitrogenases of nitrogen-fixing bacteria and in molybdopterin cofactors. In some cases, molybdate may be transported by a sulphate transporter rather than by a specific molybdate transporter.; GO: 0005524 ATP binding, 0015098 molybdate ion transmembrane transporter activity, 0015689 molybdate ion transport, 0009276 1-2nm peptidoglycan-based cell wall. Probab=87.66 E-value=1.7 Score=22.07 Aligned_cols=139 Identities=25% Similarity=0.339 Sum_probs=84.7 Q ss_pred EEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHH--H------------CCCCCCHHHCCCCCC---CEEEE Q ss_conf 434563587777664399996778440789999999999999--7------------198530353206822---10567 Q gi|254780750|r 636 ANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMA--Q------------MGSYVPASYAHIGIV---DKLFS 698 (920) Q Consensus 636 pNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilA--Q------------iG~fVPA~~a~i~~~---D~Ift 698 (920) .-|+.+..++ .-+.=|-|+=-+||||++|-||=++=.. | -|-+.|+++=++|.| =++|+ T Consensus 13 ~Ld~~~~~pg----~GvtAlFG~SGsGKTtli~~iaGL~rp~~G~i~l~G~~L~ds~k~i~Lp~ekRr~GYVFQeA~LFP 88 (361) T TIGR02142 13 ELDVDLTLPG----QGVTALFGRSGSGKTTLIRLIAGLTRPDEGEIVLNGEVLFDSRKGIFLPPEKRRIGYVFQEARLFP 88 (361) T ss_pred EEEEEEECCC----CEEEEEECCCCCHHHHHHHHHHHCCCCCCCEEEECCEEEECCCCCCCCCCCCCEEEEEEECCEECC T ss_conf 7777653287----406871258997078999998731675668799887462056776678720113536885355078 Q ss_pred EEECCCCCCCCCCHHHHHH-----------HHHHHHHHH-----------------C--CCCCEEEEECCCCCCCHHHHH Q ss_conf 6523766113853289999-----------999999995-----------------8--998569993258898805679 Q gi|254780750|r 699 RVGSADNLASGRSTFMVEM-----------IETASILNQ-----------------A--TNQSFVILDEIGRGTATLDGL 748 (920) Q Consensus 699 RiGa~D~l~~g~STF~vEm-----------~e~~~IL~~-----------------a--t~~SLVllDElGrGTst~DG~ 748 (920) -.+-..||.-|.+--+.+- .-+...|.. | |.=-|.||||-= |.-|-. T Consensus 89 Hl~Vr~NL~YG~~~~~~~~r~i~~~~v~~lLgi~hLL~R~p~~LSGGEkQRVAIGRALLs~P~LLLMDEPL---aaLD~~ 165 (361) T TIGR02142 89 HLSVRGNLRYGMKRARPKERRISFERVIELLGIEHLLERLPGRLSGGEKQRVAIGRALLSSPRLLLMDEPL---AALDEP 165 (361) T ss_pred CHHHHHHHHCCCCCCCHHHCCCCHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCHHHCCCCH---HHCCHH T ss_conf 52334551257210574121378899998746751121678875784047788998874187411104662---340644 Q ss_pred HHHHHH---HHHHHHHCCCEEEEECCC-HHHHHHHHHC Q ss_conf 999999---999997269849997487-5797664306 Q gi|254780750|r 749 SIAWAT---IEYLHETNRCRGLLATHF-HELTDLSKSL 782 (920) Q Consensus 749 aiA~ai---le~l~~~~~~~~lfaTHy-~eL~~l~~~~ 782 (920) .= .-| ||.|++..+--+||..|- .|+..|++.. T Consensus 166 rk-~EilPYLerL~~e~~iP~lyVSHsl~Ev~rLADrv 202 (361) T TIGR02142 166 RK-KEILPYLERLHAELKIPILYVSHSLDEVARLADRV 202 (361) T ss_pred HH-HHHCCHHHHHHHHHCCCEEEEECCHHHHHHHHCEE T ss_conf 46-64161676789872798899904979998760747 No 391 >PRK10078 ribose 1,5-bisphosphokinase; Provisional Probab=87.40 E-value=0.59 Score=25.57 Aligned_cols=21 Identities=43% Similarity=0.646 Sum_probs=18.5 Q ss_pred CEEEEEECCCCCCHHHHHHHH Q ss_conf 439999677844078999999 Q gi|254780750|r 650 GKLWLLTGPNMGGKSTFLRQN 670 (920) Q Consensus 650 ~~~~iiTGpNmgGKSt~lRqv 670 (920) +++.+|+||-.+||||+++.+ T Consensus 2 G~LivvsgPSGaGK~Tli~~l 22 (184) T PRK10078 2 GKLIWLMGPSGSGKDSLLAAL 22 (184) T ss_pred CEEEEEECCCCCCHHHHHHHH T ss_conf 709999899869999999999 No 392 >PRK06647 DNA polymerase III subunits gamma and tau; Validated Probab=87.38 E-value=2 Score=21.58 Aligned_cols=15 Identities=20% Similarity=0.284 Sum_probs=8.2 Q ss_pred HHHHHHHHHHHHHHH Q ss_conf 468887777643687 Q gi|254780750|r 255 VEKTAAAAAISYIKK 269 (920) Q Consensus 255 ~~~~a~~all~Yl~~ 269 (920) +...|+.|||.-|++ T Consensus 131 Lt~~A~NALLKtLEE 145 (560) T PRK06647 131 LSNSAFNALLKTIEE 145 (560) T ss_pred CCHHHHHHHHHHHHC T ss_conf 599999999998634 No 393 >PRK06893 DNA replication initiation factor; Validated Probab=87.24 E-value=2.1 Score=21.53 Aligned_cols=112 Identities=21% Similarity=0.332 Sum_probs=59.6 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHHHHHH-CCCCCCHHHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCC Q ss_conf 43999967784407899999999999997-19853035320682210567652376611385328999999999999589 Q gi|254780750|r 650 GKLWLLTGPNMGGKSTFLRQNALIVIMAQ-MGSYVPASYAHIGIVDKLFSRVGSADNLASGRSTFMVEMIETASILNQAT 728 (920) Q Consensus 650 ~~~~iiTGpNmgGKSt~lRqval~vilAQ-iG~fVPA~~a~i~~~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~at 728 (920) .+++.|.||-.+|||=+|..+|--..-+. -..|||++.+.- |. ..++.+.. T Consensus 39 ~~~l~i~G~~gsGKTHLLqa~~~~~~~~~~~~~yi~~~~~~~----------------------~~------~~~l~~l~ 90 (229) T PRK06893 39 QPFFYIWGGKSSGKSHLLKAVSNHYLLNQRTAIYIPLSKSQY----------------------FS------PAVLENLE 90 (229) T ss_pred CCEEEEECCCCCCHHHHHHHHHHHHHHHCCCEEEEEHHHHHH----------------------CC------HHHHHHHH T ss_conf 987999899999889999999999997189859997377564----------------------06------99998765 Q ss_pred CCCEEEEECCCC--CCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCH--H----HHHHHHHCCCEEEEEEE Q ss_conf 985699932588--988056799999999999972698499974875--7----97664306885899999 Q gi|254780750|r 729 NQSFVILDEIGR--GTATLDGLSIAWATIEYLHETNRCRGLLATHFH--E----LTDLSKSLKRFHNATLQ 791 (920) Q Consensus 729 ~~SLVllDElGr--GTst~DG~aiA~aile~l~~~~~~~~lfaTHy~--e----L~~l~~~~~~v~n~~~~ 791 (920) .-.+|++|-+=- |....+ -.-+-+...+.+..++..+++.+.+ + +.+|...+.....+++. T Consensus 91 ~~d~l~iDDi~~i~g~~~~e--~~lF~l~N~l~~~~~~~ll~ss~~~p~~l~~~l~DL~SRl~~~~~~~i~ 159 (229) T PRK06893 91 QQDLVCLDDLQAVIGNEEWE--LAIFDLFNRIKESGKTLLLISANQSPHALQIKLPDLASRLTWGEIYQLN 159 (229) T ss_pred HCCEEEEECHHHHCCCHHHH--HHHHHHHHHHHHCCCCEEEEECCCCHHHHCCHHHHHHHHHHCCEEEEEC T ss_conf 47979996723424883899--9999999999975991799857988332210026799999688369966 No 394 >TIGR02868 CydC ABC transporter, CydDC cysteine exporter (CydDC-E) family, permease/ATP-binding protein CydC; InterPro: IPR014223 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This entry represents CydC, a member of a heterodimeric ATP-binding cassette-type transporter (ABC transporter). It is involved in the export of glutathione from the cytoplasm to the periplasm and is required for the assembly of both cytochrome c and cytochrome bd , , .; GO: 0042626 ATPase activity coupled to transmembrane movement of substances, 0006810 transport, 0016021 integral to membrane. Probab=87.12 E-value=0.74 Score=24.83 Aligned_cols=144 Identities=25% Similarity=0.330 Sum_probs=83.0 Q ss_pred CCCCEEECC--CCCHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHH-HCCCC-- Q ss_conf 787140000--4680588763102877044434563587777664399996778440789999999999999-71985-- Q gi|254780750|r 608 NSTNFIVKD--GRHPIVEKTLKQQSSKPFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMA-QMGSY-- 682 (920) Q Consensus 608 ~~~~l~i~~--gRHPviE~~l~~~~~~~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilA-QiG~f-- 682 (920) +++.+.+++ .+|| .+..+|=++|+|.-+ .++=.-|+||=-+||||++....= ++. +-|-+ T Consensus 356 ~~p~L~~~~ls~~~p---------g~~~~vl~~V~L~l~----~G~r~Ai~G~SG~GKsTLL~~L~G--~l~P~~G~vtl 420 (566) T TIGR02868 356 DKPTLELRDLSVGYP---------GAPPNVLDGVSLDLP----PGERVAILGPSGSGKSTLLATLAG--LLDPLQGEVTL 420 (566) T ss_pred CCCEEEEEEEEEECC---------CCCHHHHCCCCCCCC----CCCCEEEECCCCCCHHHHHHHHHH--HCCCCCCCEEE T ss_conf 875078987765269---------873465427864113----886089866887657899999984--02899991787 Q ss_pred --CCHHH-------CCCCCCC---EEEE-------EEE---C----CC-----------------CCCCCCCHHHHHHHH Q ss_conf --30353-------2068221---0567-------652---3----76-----------------611385328999999 Q gi|254780750|r 683 --VPASY-------AHIGIVD---KLFS-------RVG---S----AD-----------------NLASGRSTFMVEMIE 719 (920) Q Consensus 683 --VPA~~-------a~i~~~D---~Ift-------RiG---a----~D-----------------~l~~g~STF~vEm~e 719 (920) ||..+ ..++.+. ++|. ||| + +| .|-.|.+|=|.|+-. T Consensus 421 ~G~~~~~~~~~evrr~v~~~aQ~aHlF~ttvr~NLrlarpdaaaGDtdeE~~~aL~~vgL~~~~~~LP~Gl~T~~ge~G~ 500 (566) T TIGR02868 421 DGVPVSSLSEDEVRRRVSVFAQDAHLFDTTVRENLRLARPDAAAGDTDEELLAALERVGLADWLRALPDGLDTVLGEGGA 500 (566) T ss_pred CCCCHHHCCCCHHEHHEEECCCCCCCCCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCCCHHHHCCCCCCCCCCCCCC T ss_conf 77324325731100000312788621105478788731888899888899999999715802386385767853035643 Q ss_pred -----------HHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHH-HHC-CCE-----EEEECCC Q ss_conf -----------9999995899856999325889880567999999999999-726-984-----9997487 Q gi|254780750|r 720 -----------TASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLH-ETN-RCR-----GLLATHF 772 (920) Q Consensus 720 -----------~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~-~~~-~~~-----~lfaTHy 772 (920) .|.+| .++--++||||-.=+=+...+ ..+++-|. +-. +=. ++..||- T Consensus 501 ~lSGGeRqRLALARaL--l~~ap~llLDEPTehLDa~t~----~~ll~dL~t~a~~g~t~~~R~vvliTH~ 565 (566) T TIGR02868 501 RLSGGERQRLALARAL--LADAPILLLDEPTEHLDAETA----DELLEDLLTAALSGRTESARAVVLITHH 565 (566) T ss_pred CCCHHHHHHHHHHHHH--HCCCCEEEECCCCCCCCHHHH----HHHHHHHHCCCCCCCCCCCCEEEEEEEC T ss_conf 0104899999999997--379988986088666787679----9999998505544655898748987506 No 395 >pfam00437 GSPII_E Type II/IV secretion system protein. This family contains both type II and type IV pathway secretion proteins from bacteria. VirB11 ATPase is a subunit of the Agrobacterium tumefaciens transfer DNA (T-DNA) transfer system, a type IV secretion pathway required for delivery of T-DNA and effector proteins to plant cells during infection. Probab=87.02 E-value=0.65 Score=25.24 Aligned_cols=22 Identities=27% Similarity=0.250 Sum_probs=18.4 Q ss_pred CEEEEEECCCCCCHHHHHHHHH Q ss_conf 4399996778440789999999 Q gi|254780750|r 650 GKLWLLTGPNMGGKSTFLRQNA 671 (920) Q Consensus 650 ~~~~iiTGpNmgGKSt~lRqva 671 (920) ...++|+||=.+||||+|+... T Consensus 139 ~~~ilIsG~TGSGKTT~l~all 160 (283) T pfam00437 139 RGNILVSGGTGSGKTTLLYALL 160 (283) T ss_pred CCEEEEECCCCCCHHHHHHHHH T ss_conf 9759998899998899999999 No 396 >COG1484 DnaC DNA replication protein [DNA replication, recombination, and repair] Probab=86.89 E-value=2.1 Score=21.39 Aligned_cols=97 Identities=16% Similarity=0.275 Sum_probs=57.7 Q ss_pred EEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHH-------HHHH Q ss_conf 999967784407899999999999997198530353206822105676523766113853289999999-------9999 Q gi|254780750|r 652 LWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGSADNLASGRSTFMVEMIET-------ASIL 724 (920) Q Consensus 652 ~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa~D~l~~g~STF~vEm~e~-------~~IL 724 (920) -++++||=..|||-+.=.+|.-.+ +-|-- -+|+.+ +-|+-++.+. ..++ T Consensus 107 nl~l~G~~G~GKthLa~Ai~~~l~--~~g~s------------v~f~~~----------~el~~~Lk~~~~~~~~~~~l~ 162 (254) T COG1484 107 NLVLLGPPGVGKTHLAIAIGNELL--KAGIS------------VLFITA----------PDLLSKLKAAFDEGRLEEKLL 162 (254) T ss_pred CEEEECCCCCCHHHHHHHHHHHHH--HCCCE------------EEEEEH----------HHHHHHHHHHHHCCCHHHHHH T ss_conf 289989999879999999999999--83984------------999885----------999999999874552689999 Q ss_pred HHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHH Q ss_conf 95899856999325889880567999999999999726984999748757 Q gi|254780750|r 725 NQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHE 774 (920) Q Consensus 725 ~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~e 774 (920) +....=.|.||||||-=+.+.-+..+..-++...... +.. +|+|.+.. T Consensus 163 ~~l~~~dlLIiDDlG~~~~~~~~~~~~~q~I~~r~~~-~~~-~~tsN~~~ 210 (254) T COG1484 163 RELKKVDLLIIDDIGYEPFSQEEADLLFQLISRRYES-RSL-IITSNLSF 210 (254) T ss_pred HHHHHCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHH-CCC-CEEECCCH T ss_conf 8875289899823677668815587999999999973-054-20205882 No 397 >COG2256 MGS1 ATPase related to the helicase subunit of the Holliday junction resolvase [DNA replication, recombination, and repair] Probab=86.88 E-value=1.9 Score=21.74 Aligned_cols=14 Identities=14% Similarity=0.180 Sum_probs=5.0 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 11345666433464 Q gi|254780750|r 535 FTTLELIDLENRIT 548 (920) Q Consensus 535 f~t~eL~~l~~~i~ 548 (920) |..|....++.++. T Consensus 408 yy~p~~~g~E~~i~ 421 (436) T COG2256 408 YYQPTNRGFEKKIG 421 (436) T ss_pred CCCCCCCCHHHHHH T ss_conf 20699850799999 No 398 >cd01123 Rad51_DMC1_radA Rad51_DMC1_radA,B. This group of recombinases includes the eukaryotic proteins RAD51, RAD55/57 and the meiosis-specific protein DMC1, and the archaeal proteins radA and radB. They are closely related to the bacterial RecA group. Rad51 proteins catalyze a similiar recombination reaction as RecA, using ATP-dependent DNA binding activity and a DNA-dependent ATPase. However, this reaction is less efficient and requires accessory proteins such as RAD55/57 . Probab=86.82 E-value=2.2 Score=21.37 Aligned_cols=116 Identities=19% Similarity=0.291 Sum_probs=59.2 Q ss_pred CCEEEEEECCCCCCHHHHHHHHHHHHHHHH-CC------CCCCHHHCCCC-----------------CCCEEEEEEECCC Q ss_conf 643999967784407899999999999997-19------85303532068-----------------2210567652376 Q gi|254780750|r 649 SGKLWLLTGPNMGGKSTFLRQNALIVIMAQ-MG------SYVPASYAHIG-----------------IVDKLFSRVGSAD 704 (920) Q Consensus 649 ~~~~~iiTGpNmgGKSt~lRqval~vilAQ-iG------~fVPA~~a~i~-----------------~~D~IftRiGa~D 704 (920) .+.+..|.||-.+|||++.=|+|+.+.+.. .| .||-++.. ++ +.|+|+.-- +. T Consensus 18 ~G~itEi~G~~GsGKTql~lqla~~~~~~~~~~g~~~~vvyIdtE~~-f~~~Rl~qia~~~~~~~~~~l~~i~~~~-~~- 94 (235) T cd01123 18 TGSITEIFGEFGSGKTQLCHQLAVTVQLPIELGGLEGKAVYIDTEGT-FRPERLVQIAERFGLDPEEVLDNIYVAR-AY- 94 (235) T ss_pred CCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCC-CCHHHHHHHHHHHCCCHHHHHCCEEEEC-CC- T ss_conf 78799999999984999999999998424753678962999953677-5889999999971347245422547963-79- Q ss_pred CCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEECC---------CCCCCHHHHHHHHH---HHHHHHHHHCCCEEEEECCC Q ss_conf 6113853289999999999995899856999325---------88988056799999---99999997269849997487 Q gi|254780750|r 705 NLASGRSTFMVEMIETASILNQATNQSFVILDEI---------GRGTATLDGLSIAW---ATIEYLHETNRCRGLLATHF 772 (920) Q Consensus 705 ~l~~g~STF~vEm~e~~~IL~~at~~SLVllDEl---------GrGTst~DG~aiA~---aile~l~~~~~~~~lfaTHy 772 (920) |. ..++--+.+....+....+-.||++|=+ |++ +..+....-. ..|..|..+-++-++++-|- T Consensus 95 ~~----~~l~~~l~~l~~~l~~~~~v~LvVIDSia~l~r~e~~~~~-~~~~r~~~l~~~~~~L~~lA~~~~~aVvvtNqv 169 (235) T cd01123 95 NS----DHQLQLLEELEAILIESSRIKLVIVDSVTALFRAEFDGRG-ELAERQQHLAKLLRTLKRLADEFNVAVVITNQV 169 (235) T ss_pred CH----HHHHHHHHHHHHHHHCCCCCEEEEEEECHHHHHHHHCCCC-CHHHHHHHHHHHHHHHHHHHHHCCCEEEEECEE T ss_conf 99----9999999999998730377239999610455566644886-447899999999999999999809799996806 No 399 >KOG0250 consensus Probab=86.79 E-value=0.67 Score=25.14 Aligned_cols=11 Identities=9% Similarity=0.184 Sum_probs=5.4 Q ss_pred HHHHHHHHCCC Q ss_conf 99999972698 Q gi|254780750|r 754 TIEYLHETNRC 764 (920) Q Consensus 754 ile~l~~~~~~ 764 (920) |+.+|++..+| T Consensus 586 V~N~LID~s~i 596 (1074) T KOG0250 586 VLNVLIDKSGI 596 (1074) T ss_pred HHHHHHHHCCC T ss_conf 88886432052 No 400 >COG1419 FlhF Flagellar GTP-binding protein [Cell motility and secretion] Probab=86.61 E-value=2.2 Score=21.29 Aligned_cols=118 Identities=24% Similarity=0.345 Sum_probs=66.9 Q ss_pred CCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEECCCCCCCCCCHH-------------HH Q ss_conf 64399996778440789999999999999719853035320682210567652376611385328-------------99 Q gi|254780750|r 649 SGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGSADNLASGRSTF-------------MV 715 (920) Q Consensus 649 ~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa~D~l~~g~STF-------------~v 715 (920) .++++.+-||-.-||+|.|=-.|-.-.|- ..+-+++++.-===||||..-| +|+ .. T Consensus 202 ~~~vi~LVGPTGVGKTTTlAKLAar~~~~-------~~~~kVaiITtDtYRIGA~EQL----k~Ya~im~vp~~vv~~~~ 270 (407) T COG1419 202 QKRVIALVGPTGVGKTTTLAKLAARYVML-------KKKKKVAIITTDTYRIGAVEQL----KTYADIMGVPLEVVYSPK 270 (407) T ss_pred CCCEEEEECCCCCCHHHHHHHHHHHHHHH-------CCCCCEEEEEECCCHHHHHHHH----HHHHHHHCCCEEEECCHH T ss_conf 68579998998875887999999999753-------2576068997144115289999----999998699559963999 Q ss_pred HHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHC----CCEEEEEC-CCHHHHHHHHHCCC Q ss_conf 99999999995899856999325889880567999999999999726----98499974-87579766430688 Q gi|254780750|r 716 EMIETASILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETN----RCRGLLAT-HFHELTDLSKSLKR 784 (920) Q Consensus 716 Em~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~----~~~~lfaT-Hy~eL~~l~~~~~~ 784 (920) |+.+.-.-|+++ -+||+|=.|| |.+|+.-++. ++.+++.. -+++|=|| -|..|.+..+.+.. T Consensus 271 el~~ai~~l~~~---d~ILVDTaGr--s~~D~~~i~e--l~~~~~~~~~i~~~Lvlsat~K~~dlkei~~~f~~ 337 (407) T COG1419 271 ELAEAIEALRDC---DVILVDTAGR--SQYDKEKIEE--LKELIDVSHSIEVYLVLSATTKYEDLKEIIKQFSL 337 (407) T ss_pred HHHHHHHHHHCC---CEEEEECCCC--CCCCHHHHHH--HHHHHHCCCCCEEEEEEECCCCHHHHHHHHHHHCC T ss_conf 999999985318---8899968998--8337899999--99997035662179998457646889999997245 No 401 >KOG1533 consensus Probab=86.40 E-value=0.58 Score=25.64 Aligned_cols=37 Identities=16% Similarity=0.173 Sum_probs=24.2 Q ss_pred CCCCEEEECCHHHCCEEEEECCCCCCCCHHHHHHCCCCCCHHHHHHHHHHHHH Q ss_conf 74540142513320154541278776412787630123433789999863100 Q gi|254780750|r 285 DIQSTLFIDSAARSNLEILRTLSGSREQSLLKTIDYSITGAGGRLFAERIASP 337 (920) Q Consensus 285 ~~~~~m~LD~~Tl~nLEI~~~~~g~~~gSL~~~Ln~t~T~~G~RlLr~wL~~P 337 (920) ..+.|+..|-- -+.|+|. ..+||.+++ |.|++|=.++ T Consensus 95 ~~~~Y~lFDcP--GQVELft-----~h~~l~~I~---------~~Lek~~~rl 131 (290) T KOG1533 95 LTDHYVLFDCP--GQVELFT-----HHDSLNKIF---------RKLEKLDYRL 131 (290) T ss_pred CCCCEEEEECC--CCEEEEE-----CCCHHHHHH---------HHHHHCCCEE T ss_conf 34748999579--8279874-----256099999---------9999769537 No 402 >TIGR00763 lon ATP-dependent protease La; InterPro: IPR004815 Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes . They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence . Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases . Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base . The geometric orientations of the catalytic residues are similar between families, despite different protein folds . The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) , . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This signature defines the bacterial and eukaryotic lon proteases, which are ATP-dependent serine peptidases belonging to the MEROPS peptidase family S16 (lon protease family, clan SF). This family of sequences does not include the archaeal lon homologs, IPR004663 from INTERPRO. In the eukaryotes the majority of the proteins are located in the mitochondrial matrix , . In yeast, Pim1, is located in the mitochondrial matrix, is required for mitochondrial function, is constitutively expressed but is increased after thermal stress, suggesting that Pim1 may play a role in the heat shock response .; GO: 0004176 ATP-dependent peptidase activity, 0005524 ATP binding, 0006510 ATP-dependent proteolysis. Probab=86.28 E-value=1.4 Score=22.76 Aligned_cols=104 Identities=30% Similarity=0.469 Sum_probs=67.4 Q ss_pred CCE-EEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEE---ECCC--CCCCCCCHHHH----HHH Q ss_conf 643-9999677844078999999999999971985303532068221056765---2376--61138532899----999 Q gi|254780750|r 649 SGK-LWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRV---GSAD--NLASGRSTFMV----EMI 718 (920) Q Consensus 649 ~~~-~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRi---Ga~D--~l~~g~STF~v----Em~ 718 (920) .+. ++-+-||=.=|||++=|+||= -..|=|.|| |-.| .|=..+=|.-- =|. T Consensus 448 ~GpqIlClvGPPGVGKTSlg~SIA~-------------------ALnRkFvR~SlGG~~DeAEIrGHRRTYvGAMPGrii 508 (941) T TIGR00763 448 KGPQILCLVGPPGVGKTSLGKSIAK-------------------ALNRKFVRFSLGGVRDEAEIRGHRRTYVGAMPGRII 508 (941) T ss_pred CCCEEEEEECCCCCCHHHHHHHHHH-------------------HHCCEEEEEEECCCEEHHHCCCCCCCCCCCCHHHHH T ss_conf 8876787207269542227899999-------------------968804999526722031127864320346725789 Q ss_pred HHHHHHHHC-CCCCEEEEECC---C--CCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH-HHHH Q ss_conf 999999958-99856999325---8--8988056799999999999972698499974875797-6643 Q gi|254780750|r 719 ETASILNQA-TNQSFVILDEI---G--RGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELT-DLSK 780 (920) Q Consensus 719 e~~~IL~~a-t~~SLVllDEl---G--rGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~-~l~~ 780 (920) .-|+.| |.|=|+||||| | +|---.= |.|.||=| +=.+- .=|.=||-++. +|++ T Consensus 509 ---Q~lk~~~t~NPl~LlDEIDK~~~~~~~~GDP----aSALLEvL-DPEQN-~~F~DHYldvp~DLS~ 568 (941) T TIGR00763 509 ---QGLKKAKTKNPLILLDEIDKIGLKSSFRGDP----ASALLEVL-DPEQN-NAFSDHYLDVPFDLSK 568 (941) T ss_pred ---HHHHHCCCCCCEEEEEEEEEECCCCCCCCCH----HHHHHHHC-CHHHC-CCCCCCCCCCCCCHHH T ss_conf ---9987604158806862022001678865563----78886412-86436-0425530023400420 No 403 >COG1124 DppF ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component [Amino acid transport and metabolism / Inorganic ion transport and metabolism] Probab=86.21 E-value=2.3 Score=21.14 Aligned_cols=133 Identities=25% Similarity=0.252 Sum_probs=67.1 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHC------CCCCCHHH---CCCCCCCEEEEEEECC-- Q ss_conf 443456358777766439999677844078999999999999971------98530353---2068221056765237-- Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQM------GSYVPASY---AHIGIVDKLFSRVGSA-- 703 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQi------G~fVPA~~---a~i~~~D~IftRiGa~-- 703 (920) +=|++++.. ..+-.+=|-||--+||||+.|.++-+.== .. |..++-.. +...+|--||-.-..+ T Consensus 22 ~l~~VS~~i----~~Ge~lgivGeSGsGKSTL~r~l~Gl~~p-~~G~I~~~G~~~~~~~~~~~~~~~VQmVFQDp~~SLn 96 (252) T COG1124 22 ALNNVSLEI----ERGETLGIVGESGSGKSTLARLLAGLEKP-SSGSILLDGKPLAPKKRAKAFYRPVQMVFQDPYSSLN 96 (252) T ss_pred HHCCEEEEE----CCCCEEEEECCCCCCHHHHHHHHHCCCCC-CCCEEEECCCCCCCCCCCHHHCCCEEEEECCCCCCCC T ss_conf 414325996----48978999848989888999999565678-8862898884057665303330450699518722468 Q ss_pred ------CCCC-----CCCCHH---HHHHH-----------------------HHHHHHH-HCCCCCEEEEECCCCCCCHH Q ss_conf ------6611-----385328---99999-----------------------9999999-58998569993258898805 Q gi|254780750|r 704 ------DNLA-----SGRSTF---MVEMI-----------------------ETASILN-QATNQSFVILDEIGRGTATL 745 (920) Q Consensus 704 ------D~l~-----~g~STF---~vEm~-----------------------e~~~IL~-~at~~SLVllDElGrGTst~ 745 (920) +-|. .|.+.= ..|+. .. .|.| -+.+-.++|+||- ||.. T Consensus 97 P~~tv~~~l~Epl~~~~~~~~~~~i~~~L~~VgL~~~~l~R~P~eLSGGQ~QRi-aIARAL~~~PklLIlDEp---tSaL 172 (252) T COG1124 97 PRRTVGRILSEPLRPHGLSKSQQRIAELLDQVGLPPSFLDRRPHELSGGQRQRI-AIARALIPEPKLLILDEP---TSAL 172 (252) T ss_pred CCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCHHHHHCCCHHCCHHHHHHH-HHHHHHCCCCCEEEECCC---HHHH T ss_conf 410198997424303775378999999999849998998539421281689999-999986368887995382---3441 Q ss_pred HHHHHHHHHHHHHHH---HCCCEEEEECCCHHHHH Q ss_conf 679999999999997---26984999748757976 Q gi|254780750|r 746 DGLSIAWATIEYLHE---TNRCRGLLATHFHELTD 777 (920) Q Consensus 746 DG~aiA~aile~l~~---~~~~~~lfaTHy~eL~~ 777 (920) |- ++-..|+..|.+ .-+--.||.||.-.+.+ T Consensus 173 D~-siQa~IlnlL~~l~~~~~lt~l~IsHdl~~v~ 206 (252) T COG1124 173 DV-SVQAQILNLLLELKKERGLTYLFISHDLALVE 206 (252) T ss_pred CH-HHHHHHHHHHHHHHHHCCCEEEEEECCHHHHH T ss_conf 58-89999999999999861945999967299999 No 404 >cd01131 PilT Pilus retraction ATPase PilT. PilT is a nucleotide binding protein responsible for the retraction of type IV pili, likely by pili disassembly. This retraction provides the force required for travel of bacteria in low water environments by a mechanism known as twitching motility. Probab=86.18 E-value=0.71 Score=24.99 Aligned_cols=19 Identities=53% Similarity=0.607 Sum_probs=17.0 Q ss_pred EEEEECCCCCCHHHHHHHH Q ss_conf 9999677844078999999 Q gi|254780750|r 652 LWLLTGPNMGGKSTFLRQN 670 (920) Q Consensus 652 ~~iiTGpNmgGKSt~lRqv 670 (920) +.|||||-.+||||.|... T Consensus 3 liLitG~TGSGKTTtl~al 21 (198) T cd01131 3 LVLVTGPTGSGKSTTLAAM 21 (198) T ss_pred EEEEECCCCCCHHHHHHHH T ss_conf 8999899999799999999 No 405 >TIGR03263 guanyl_kin guanylate kinase. Members of this family are the enzyme guanylate kinase, also called GMP kinase. This enzyme transfers a phosphate from ATP to GMP, yielding ADP and GDP. Probab=86.15 E-value=0.84 Score=24.44 Aligned_cols=22 Identities=36% Similarity=0.572 Sum_probs=18.9 Q ss_pred CEEEEEECCCCCCHHHHHHHHH Q ss_conf 4399996778440789999999 Q gi|254780750|r 650 GKLWLLTGPNMGGKSTFLRQNA 671 (920) Q Consensus 650 ~~~~iiTGpNmgGKSt~lRqva 671 (920) +++++|+||-.+||+|+++.+. T Consensus 1 G~livl~GpsG~GK~tl~~~l~ 22 (180) T TIGR03263 1 GLLIVISGPSGVGKSTLVKALL 22 (180) T ss_pred CEEEEEECCCCCCHHHHHHHHH T ss_conf 9399998999889999999999 No 406 >pfam05621 TniB Bacterial TniB protein. This family consists of several bacterial TniB NTP-binding proteins. TniB is a probable ATP-binding protein which is involved in Tn5053 mercury resistance transposition. Probab=86.06 E-value=1.2 Score=23.34 Aligned_cols=48 Identities=15% Similarity=0.173 Sum_probs=29.5 Q ss_pred HHHHHHCCCCCEEEEECC---CCCCCHHHHHHHHHHHHHHHHHHCC-CEEEEECC Q ss_conf 999995899856999325---8898805679999999999997269-84999748 Q gi|254780750|r 721 ASILNQATNQSFVILDEI---GRGTATLDGLSIAWATIEYLHETNR-CRGLLATH 771 (920) Q Consensus 721 ~~IL~~at~~SLVllDEl---GrGTst~DG~aiA~aile~l~~~~~-~~~lfaTH 771 (920) -..|+....+ ++|+||+ -+||...-- -..+.+.+|.+..+ +.+.+-|. T Consensus 138 ~~ll~~~~vr-mLIIDEiHnlL~Gs~~~qr--~~ln~LK~L~Nel~IpiV~vGt~ 189 (302) T pfam05621 138 LALLRKVGVR-MLVIDELHNVLAGNSVNRR--EFLNLLRFLGNELRIPLVGVGTR 189 (302) T ss_pred HHHHHHCCCC-EEEEECHHHHCCCCHHHHH--HHHHHHHHHHHCCCCCEEEECCH T ss_conf 9999974987-8998543656048688999--99999999863658786995319 No 407 >pfam06745 KaiC KaiC. This family represents a conserved region within bacterial and archaeal proteins, most of which are hypothetical. More than one copy is sometimes found in each protein. This family includes KaiC, which is one of the Kai proteins among which direct protein-protein association may be a critical process in the generation of circadian rhythms in cyanobacteria. Probab=86.03 E-value=2.4 Score=21.08 Aligned_cols=122 Identities=16% Similarity=0.152 Sum_probs=61.8 Q ss_pred CCEEEEEECCCCCCHHHHHHHHHHHHHHH--HCCCCCCHHHC---------CCCC-CCE------E-EEE-----EEC-C Q ss_conf 64399996778440789999999999999--71985303532---------0682-210------5-676-----523-7 Q gi|254780750|r 649 SGKLWLLTGPNMGGKSTFLRQNALIVIMA--QMGSYVPASYA---------HIGI-VDK------L-FSR-----VGS-A 703 (920) Q Consensus 649 ~~~~~iiTGpNmgGKSt~lRqval~vilA--QiG~fVPA~~a---------~i~~-~D~------I-ftR-----iGa-~ 703 (920) .+++.+|+||--+|||++.-|.+..-.+- .-|+||-.+.- .+++ ++. + +.. .+. . T Consensus 18 ~gs~~LI~G~pGsGKT~la~qfl~~ga~~~ge~~lYis~ee~~~~l~~~~~~~g~~~~~~~~~g~l~i~d~~~~~~~~~~ 97 (231) T pfam06745 18 EGRVVLITGGPGTGKTIFGLQFLYNGALEYGEPGVYVTLEEPPEDLRENAKSFGWDLEKLEEEGKLAIIDASTSGIGIAE 97 (231) T ss_pred CCEEEEEECCCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHHCCCCHHHHHHCCCEEEEECCCCCCCCCC T ss_conf 99699998589725999999999999986589689998137999999999982998589864696789862544222100 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEECCCC---CCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHH Q ss_conf 6611385328999999999999589985699932588---98805679999999999997269849997487579 Q gi|254780750|r 704 DNLASGRSTFMVEMIETASILNQATNQSFVILDEIGR---GTATLDGLSIAWATIEYLHETNRCRGLLATHFHEL 775 (920) Q Consensus 704 D~l~~g~STF~vEm~e~~~IL~~at~~SLVllDElGr---GTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL 775 (920) +....+...|...+ ...++.- +-..|++|=+.- -.+...-...-+....++. ..+|.+||+.|.+.. T Consensus 98 ~~~~~~~~~~~~~i---~~~i~~~-~~~~vVIDsit~l~~~~~~~~~r~~l~~l~~~lk-~~g~t~l~t~e~~~~ 167 (231) T pfam06745 98 VKGRFDLEELIERL---REAIREI-GAKRVVIDSITTLFYLLKPAMAREILRRLKRVLK-KLGVTAIFTSEKPSG 167 (231) T ss_pred CCCCCCHHHHHHHH---HHHHHHH-CCCEEEEECCHHHHHCCCHHHHHHHHHHHHHHHH-HCCCEEEEEEEECCC T ss_conf 11227999999999---9999971-9988999764164005889999999999999999-769919999821257 No 408 >COG4618 ArpD ABC-type protease/lipase transport system, ATPase and permease components [General function prediction only] Probab=85.78 E-value=2.4 Score=20.99 Aligned_cols=53 Identities=13% Similarity=0.107 Sum_probs=35.2 Q ss_pred CCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHHC Q ss_conf 9985699932588988056799999999999972698499974875797664306 Q gi|254780750|r 728 TNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTDLSKSL 782 (920) Q Consensus 728 t~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~l~~~~ 782 (920) ..--||++||--.--+..--.|++.|+.+ + +.-|+.+++.||-+.+-...+.. T Consensus 489 G~P~lvVLDEPNsNLD~~GE~AL~~Ai~~-~-k~rG~~vvviaHRPs~L~~~Dki 541 (580) T COG4618 489 GDPFLVVLDEPNSNLDSEGEAALAAAILA-A-KARGGTVVVIAHRPSALASVDKI 541 (580) T ss_pred CCCCEEEECCCCCCCCHHHHHHHHHHHHH-H-HHCCCEEEEEECCHHHHHHCCEE T ss_conf 89708995589877652679999999999-9-97698799992677788515534 No 409 >cd01130 VirB11-like_ATPase Type IV secretory pathway component VirB11, and related ATPases. The homohexamer, VirB11 is one of eleven Vir proteins, which are required for T-pilus biogenesis and virulence in the transfer of T-DNA from the Ti (tumor-inducing) plasmid of bacterial to plant cells. The pilus is a fibrous cell surface organelle, which mediates adhesion between bacteria during conjugative transfer or between bacteria and host eukaryotic cells during infection. VirB11- related ATPases include the archaeal flagella biosynthesis protein and the pilus assembly proteins CpaF/TadA and TrbB. This alignment contains the C-terminal domain, which is the ATPase. Probab=85.73 E-value=0.75 Score=24.81 Aligned_cols=105 Identities=21% Similarity=0.335 Sum_probs=48.0 Q ss_pred EEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHH-HCCCCC----CCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 99996778440789999999999999719853035-320682----2105676523766113853289999999999995 Q gi|254780750|r 652 LWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPAS-YAHIGI----VDKLFSRVGSADNLASGRSTFMVEMIETASILNQ 726 (920) Q Consensus 652 ~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~-~a~i~~----~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~ 726 (920) -++|+||-.+||||+|+.+.-.+ -.. --.|-.+ ..++.+ +-++++|... ..+...+ - .+.+++. T Consensus 27 nIlIsG~tGSGKTTll~al~~~i-~~~-~rivtiEd~~El~l~~~~~v~l~~~~~~----~~~~~~~--~---~~~li~~ 95 (186) T cd01130 27 NILISGGTGSGKTTLLNALLAFI-PPD-ERIITIEDTAELQLPHPNWVRLVTRPGN----VEGSGEV--T---MADLLRS 95 (186) T ss_pred CEEEECCCCCCHHHHHHHHHHHC-CCC-CCEEEECCCHHHCCCCCCEEEEEEECCC----CCCCCEE--C---HHHHHHH T ss_conf 89998999998999999999613-345-6459841535404777756888860464----5786503--4---9999887 Q ss_pred C--CCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH Q ss_conf 8--9985699932588988056799999999999972698499974875797 Q gi|254780750|r 727 A--TNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELT 776 (920) Q Consensus 727 a--t~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~ 776 (920) | -.-..|++-|+ ||- + |++.++.+. ....-++.+.|=.... T Consensus 96 aLR~~pd~iivGEi-R~~---E----a~~~l~a~~-tGh~g~ltTiHa~s~~ 138 (186) T cd01130 96 ALRMRPDRIIVGEV-RGG---E----ALDLLQAMN-TGHPGGMTTIHANSAE 138 (186) T ss_pred HCCCCCCEEECCCC-CCH---H----HHHHHHHHH-CCCCCCCCCCCCCCHH T ss_conf 36689973731756-839---9----999999997-4898603031589999 No 410 >COG2804 PulE Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB [Cell motility and secretion / Intracellular trafficking and secretion] Probab=85.33 E-value=0.81 Score=24.55 Aligned_cols=20 Identities=45% Similarity=0.572 Sum_probs=17.6 Q ss_pred EEEEEECCCCCCHHHHHHHH Q ss_conf 39999677844078999999 Q gi|254780750|r 651 KLWLLTGPNMGGKSTFLRQN 670 (920) Q Consensus 651 ~~~iiTGpNmgGKSt~lRqv 670 (920) -++++|||-.|||||.|-+. T Consensus 259 GliLvTGPTGSGKTTTLY~~ 278 (500) T COG2804 259 GLILVTGPTGSGKTTTLYAA 278 (500) T ss_pred EEEEEECCCCCCHHHHHHHH T ss_conf 08999689999889999999 No 411 >COG4185 Uncharacterized protein conserved in bacteria [Function unknown] Probab=85.22 E-value=0.33 Score=27.49 Aligned_cols=18 Identities=44% Similarity=0.650 Sum_probs=15.1 Q ss_pred CEEEEEECCCCCCHHHHH Q ss_conf 439999677844078999 Q gi|254780750|r 650 GKLWLLTGPNMGGKSTFL 667 (920) Q Consensus 650 ~~~~iiTGpNmgGKSt~l 667 (920) .+++|+-|||.+||||+- T Consensus 2 ~~l~IvaG~NGsGKstv~ 19 (187) T COG4185 2 KRLDIVAGPNGSGKSTVY 19 (187) T ss_pred CEEEEEECCCCCCCEEEE T ss_conf 658999668888732543 No 412 >COG0194 Gmk Guanylate kinase [Nucleotide transport and metabolism] Probab=84.97 E-value=0.94 Score=24.05 Aligned_cols=110 Identities=20% Similarity=0.230 Sum_probs=52.1 Q ss_pred CCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCC Q ss_conf 64399996778440789999999999999719853035320682210567652376611385328999999999999589 Q gi|254780750|r 649 SGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGSADNLASGRSTFMVEMIETASILNQAT 728 (920) Q Consensus 649 ~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~at 728 (920) .+.+.+|+||-..||||.+|.+ +++-+ +-+.-| -=||- -...=..|...|++--.|-...+.+ T Consensus 3 ~G~l~vlsgPSG~GKsTl~k~L-----~~~~~-l~~SVS--------~TTR~-pR~gEv~G~dY~Fvs~~EF~~~i~~-- 65 (191) T COG0194 3 KGLLIVLSGPSGVGKSTLVKAL-----LEDDK-LRFSVS--------ATTRK-PRPGEVDGVDYFFVTEEEFEELIER-- 65 (191) T ss_pred CCEEEEEECCCCCCHHHHHHHH-----HHHCC-EEEEEE--------ECCCC-CCCCCCCCCEEEECCHHHHHHHHHC-- T ss_conf 8639999899888889999999-----86349-379998--------52679-9998757802475779999998756-- Q ss_pred CCCEEEEECC-CCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHHCC Q ss_conf 9856999325-889880567999999999999726984999748757976643068 Q gi|254780750|r 729 NQSFVILDEI-GRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTDLSKSLK 783 (920) Q Consensus 729 ~~SLVllDEl-GrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~l~~~~~ 783 (920) .-++=--|+ | .+-|.+.. .|-+.+ .. |--+++-.-+.--...-+..| T Consensus 66 -~~fLE~a~~~g----nyYGT~~~-~ve~~~-~~-G~~vildId~qGa~qvk~~~p 113 (191) T COG0194 66 -DEFLEWAEYHG----NYYGTSRE-PVEQAL-AE-GKDVILDIDVQGALQVKKKMP 113 (191) T ss_pred -CCCEEEEEECC----CCCCCCHH-HHHHHH-HC-CCEEEEEEEHHHHHHHHHHCC T ss_conf -87478877719----73248688-999998-66-990899985399999997499 No 413 >PRK08118 topology modulation protein; Reviewed Probab=84.69 E-value=0.89 Score=24.24 Aligned_cols=62 Identities=23% Similarity=0.259 Sum_probs=35.2 Q ss_pred EEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCCCE Q ss_conf 99967784407899999999999997198530353206822105676523766113853289999999999995899856 Q gi|254780750|r 653 WLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGSADNLASGRSTFMVEMIETASILNQATNQSF 732 (920) Q Consensus 653 ~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~at~~SL 732 (920) .+|-|+..|||||+-|++| +..|.+| --.|+||=.=| -.......|..- ++.++. ..+ T Consensus 4 I~IiG~~GsGKSTlAr~L~-----~~~~ip~-------~~LD~l~w~~~---w~~~~~~e~~~~---~~~~~~---~~~- 61 (167) T PRK08118 4 IILIGSGGSGKSTLARQLG-----EKLNIPV-------HHLDALFWKPN---WEGVPKEEQRTV---QNELVK---EDE- 61 (167) T ss_pred EEEECCCCCCHHHHHHHHH-----HHHCCCE-------EECCCEEECCC---CCCCCHHHHHHH---HHHHHH---CCC- T ss_conf 9998899987999999999-----9889697-------96443476689---946888999999---999983---898- Q ss_pred EEEE Q ss_conf 9993 Q gi|254780750|r 733 VILD 736 (920) Q Consensus 733 VllD 736 (920) =|+| T Consensus 62 WIid 65 (167) T PRK08118 62 WIID 65 (167) T ss_pred EEEE T ss_conf 7994 No 414 >COG4148 ModC ABC-type molybdate transport system, ATPase component [Inorganic ion transport and metabolism] Probab=84.51 E-value=2.7 Score=20.59 Aligned_cols=149 Identities=22% Similarity=0.327 Sum_probs=81.4 Q ss_pred EEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHH--------------HHHHCCCCCCHHHCCCCCC---CEEEE Q ss_conf 434563587777664399996778440789999999999--------------9997198530353206822---10567 Q gi|254780750|r 636 ANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIV--------------IMAQMGSYVPASYAHIGIV---DKLFS 698 (920) Q Consensus 636 pNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~v--------------ilAQiG~fVPA~~a~i~~~---D~Ift 698 (920) .-|+.+.. ....+.-|.|+--+||||+++.||=+. .=++-|.|+|++.-+||-| -|+|- T Consensus 14 ~l~a~~~~----p~~GvTAlFG~SGsGKTslin~IaGL~rPdeG~I~lngr~L~Ds~k~i~lp~~~RriGYVFQDARLFp 89 (352) T COG4148 14 ALDANFTL----PARGITALFGPSGSGKTSLINMIAGLTRPDEGRIELNGRVLVDAEKGIFLPPEKRRIGYVFQDARLFP 89 (352) T ss_pred EEEEECCC----CCCCEEEEECCCCCCHHHHHHHHHCCCCCCCCEEEECCEEEECCCCCCCCCHHHHEEEEEEECCCCCC T ss_conf 79874368----77856999647888716189897434776661899898886515667446754611356740020166 Q ss_pred EEECCCCCCCCC--C---HH--HHHHHHHHHHHHH-------------------CCCCCEEEEECCCCCC-CHHHHHHHH Q ss_conf 652376611385--3---28--9999999999995-------------------8998569993258898-805679999 Q gi|254780750|r 699 RVGSADNLASGR--S---TF--MVEMIETASILNQ-------------------ATNQSFVILDEIGRGT-ATLDGLSIA 751 (920) Q Consensus 699 RiGa~D~l~~g~--S---TF--~vEm~e~~~IL~~-------------------at~~SLVllDElGrGT-st~DG~aiA 751 (920) -+--.-||.-|. | .| .|.|.-+...|+. -|.--|.||||--..- .+.-+-- T Consensus 90 H~tVrgNL~YG~~~~~~~~fd~iv~lLGI~hLL~R~P~~LSGGEkQRVAIGRALLt~P~LLLmDEPLaSLD~~RK~Ei-- 167 (352) T COG4148 90 HYTVRGNLRYGMWKSMRAQFDQLVALLGIEHLLDRYPGTLSGGEKQRVAIGRALLTAPELLLMDEPLASLDLPRKREI-- 167 (352) T ss_pred CEEEECCHHHHCCCCCHHHHHHHHHHHCCHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCEEEECCCHHHCCCCHHHHH-- T ss_conf 517732200101435367799999984848678508775676155677788887549773430684032365104677-- Q ss_pred HHHHHHHHHHCCCEEEEECCC-HHHHHHHHHCCCEEEEEE Q ss_conf 999999997269849997487-579766430688589999 Q gi|254780750|r 752 WATIEYLHETNRCRGLLATHF-HELTDLSKSLKRFHNATL 790 (920) Q Consensus 752 ~aile~l~~~~~~~~lfaTHy-~eL~~l~~~~~~v~n~~~ 790 (920) .--+|.|.+..+--.++.||- .|+..|++..--..|..+ T Consensus 168 lpylERL~~e~~IPIlYVSHS~~Ev~RLAd~vV~le~GkV 207 (352) T COG4148 168 LPYLERLRDEINIPILYVSHSLDEVLRLADRVVVLENGKV 207 (352) T ss_pred HHHHHHHHHHCCCCEEEEECCHHHHHHHHHEEEEECCCEE T ss_conf 8999998762188789993689999965423898208848 No 415 >cd01129 PulE-GspE PulE/GspE The type II secretory pathway is the main terminal branch of the general secretory pathway (GSP). It is responsible for the export the majority of Gram-negative bacterial exoenzymes and toxins. PulE is a cytoplasmic protein of the GSP, which contains an ATP binding site and a tetracysteine motif. This subgroup also includes PillB and HofB. Probab=84.48 E-value=0.93 Score=24.08 Aligned_cols=19 Identities=42% Similarity=0.593 Sum_probs=16.5 Q ss_pred EEEEEECCCCCCHHHHHHH Q ss_conf 3999967784407899999 Q gi|254780750|r 651 KLWLLTGPNMGGKSTFLRQ 669 (920) Q Consensus 651 ~~~iiTGpNmgGKSt~lRq 669 (920) -+.|||||..+||||.|-. T Consensus 81 GlilitGptGSGKtTtl~a 99 (264) T cd01129 81 GIILVTGPTGSGKTTTLYS 99 (264) T ss_pred CEEEEECCCCCCHHHHHHH T ss_conf 8899978999977999999 No 416 >COG2805 PilT Tfp pilus assembly protein, pilus retraction ATPase PilT [Cell motility and secretion / Intracellular trafficking and secretion] Probab=84.26 E-value=0.74 Score=24.85 Aligned_cols=63 Identities=27% Similarity=0.423 Sum_probs=37.4 Q ss_pred EEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCC Q ss_conf 39999677844078999999999999971985303532068221056765237661138532899999999999958998 Q gi|254780750|r 651 KLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGSADNLASGRSTFMVEMIETASILNQATNQ 730 (920) Q Consensus 651 ~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~at~~ 730 (920) -+.++|||-.|||||.|-. ++-..-+- ---.|.|- .|-|. .-+.+.+ T Consensus 126 GLILVTGpTGSGKSTTlAa--mId~iN~~------------~~~HIlTI---EDPIE----------------~vh~skk 172 (353) T COG2805 126 GLILVTGPTGSGKSTTLAA--MIDYINKH------------KAKHILTI---EDPIE----------------YVHESKK 172 (353) T ss_pred CEEEEECCCCCCHHHHHHH--HHHHHHCC------------CCCCEEEE---CCCHH----------------HHHCCHH T ss_conf 6699867999967879999--99998414------------77516872---37468----------------6504327 Q ss_pred CEEEEECCCCCCCHHH Q ss_conf 5699932588988056 Q gi|254780750|r 731 SFVILDEIGRGTATLD 746 (920) Q Consensus 731 SLVllDElGrGTst~D 746 (920) |||-=-|+|+-|-+++ T Consensus 173 slI~QREvG~dT~sF~ 188 (353) T COG2805 173 SLINQREVGRDTLSFA 188 (353) T ss_pred HHHHHHHHCCCHHHHH T ss_conf 6666877454278899 No 417 >PRK10436 hypothetical protein; Provisional Probab=84.06 E-value=0.99 Score=23.90 Aligned_cols=74 Identities=24% Similarity=0.350 Sum_probs=35.8 Q ss_pred EEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEE-ECCC---CCCCCCCHHHHHHHHHHHHHHH Q ss_conf 39999677844078999999999999971985303532068221056765-2376---6113853289999999999995 Q gi|254780750|r 651 KLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRV-GSAD---NLASGRSTFMVEMIETASILNQ 726 (920) Q Consensus 651 ~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRi-Ga~D---~l~~g~STF~vEm~e~~~IL~~ 726 (920) -+.++|||--|||||.|-. .|.++.. .+.-.+++=|-|=-++ |... |-..| -||..- ...+|++ T Consensus 216 GliLvtGPTGSGKTTTLya-----~L~~l~~---~~~~I~TiEDPVE~~l~gi~Q~~vn~~~g-~tfa~~---lrs~LRq 283 (461) T PRK10436 216 GLVLVTGPTGSGKTVTLYS-----ALQTLNT---AQINICSVEDPVEIPLAGINQTQIHPRAG-LTFQRV---LRALLRQ 283 (461) T ss_pred CEEEEECCCCCCHHHHHHH-----HHHHHCC---CCCEEEEEECCCCCCCCCCCCCEECCCCC-HHHHHH---HHHHHCC T ss_conf 7799978999956999999-----9974346---77169996077435546754523132213-139999---9998746 Q ss_pred CCCCCEEEEECC Q ss_conf 899856999325 Q gi|254780750|r 727 ATNQSFVILDEI 738 (920) Q Consensus 727 at~~SLVllDEl 738 (920) +++ -+++.|+ T Consensus 284 -DPD-VImvGEI 293 (461) T PRK10436 284 -DPD-VIMVGEI 293 (461) T ss_pred -CCC-EEEECCC T ss_conf -999-9986577 No 418 >PRK01184 hypothetical protein; Provisional Probab=83.94 E-value=1.4 Score=22.86 Aligned_cols=122 Identities=18% Similarity=0.187 Sum_probs=55.5 Q ss_pred EEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCH-HHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCC Q ss_conf 39999677844078999999999999971985303-53206822105676523766113853289999999999995899 Q gi|254780750|r 651 KLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPA-SYAHIGIVDKLFSRVGSADNLASGRSTFMVEMIETASILNQATN 729 (920) Q Consensus 651 ~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA-~~a~i~~~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~at~ 729 (920) .++-|||++.+||||.-. ++...|.+|=- .. ++.+...+ .|..-.-..+...+.-|+. T Consensus 2 ~iIGlTG~iGSGKstva~------i~~e~G~~vi~~~D----ivr~~v~~--------~g~~~~~~~~~~~~~~lR~--- 60 (183) T PRK01184 2 MIIIVTGMPGSGKGEFSK------IARELGIPVVVMGD----VIREEVKK--------RGLPPTDENIGKVATDLRK--- 60 (183) T ss_pred EEEEEECCCCCCHHHHHH------HHHHCCCEEEECCH----HHHHHHHH--------CCCCCCHHHHHHHHHHHHH--- T ss_conf 399996899887899999------99977993998607----78999998--------3899977899999999998--- Q ss_pred CCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEE--CCCHHHHHHHHHCCCEEEEEEEEEEECCEEEEEEEEEE Q ss_conf 8569993258898805679999999999997269849997--48757976643068858999999960992778777744 Q gi|254780750|r 730 QSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLA--THFHELTDLSKSLKRFHNATLQVSDSNEGIIFLHKVIP 807 (920) Q Consensus 730 ~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfa--THy~eL~~l~~~~~~v~n~~~~~~~~~~~i~flykl~~ 807 (920) ++|. -++|.-+++++.+.....+++- -|..|+..+...++++. +-+-....++.|-+-+.. T Consensus 61 -------~~G~-------~~~a~~~~~~i~~~~~~~~vidgir~~~E~~~~~~~~~~~~---li~V~A~~~~R~eRl~~R 123 (183) T PRK01184 61 -------ELGM-------DAVAIRTVPKIRELGSELVVVDGVRGDAEVEYFRKEFEDFI---LVAIHAPPETRFERLKKR 123 (183) T ss_pred -------HHCC-------HHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHCCCEE---EEEEECCHHHHHHHHHHC T ss_conf -------7195-------58999999999703798289816787899999997469849---999989888999999846 Q ss_pred CCC Q ss_conf 789 Q gi|254780750|r 808 GIA 810 (920) Q Consensus 808 G~~ 810 (920) |-+ T Consensus 124 ~r~ 126 (183) T PRK01184 124 GRS 126 (183) T ss_pred CCC T ss_conf 998 No 419 >pfam00625 Guanylate_kin Guanylate kinase. Probab=83.90 E-value=1.1 Score=23.48 Aligned_cols=110 Identities=19% Similarity=0.154 Sum_probs=49.1 Q ss_pred EEEEEECCCCCCHHHHHHHHHHHHHHH-HCCCCCCHHHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCC Q ss_conf 399996778440789999999999999-7198530353206822105676523766113853289999999999995899 Q gi|254780750|r 651 KLWLLTGPNMGGKSTFLRQNALIVIMA-QMGSYVPASYAHIGIVDKLFSRVGSADNLASGRSTFMVEMIETASILNQATN 729 (920) Q Consensus 651 ~~~iiTGpNmgGKSt~lRqval~vilA-QiG~fVPA~~a~i~~~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~at~ 729 (920) +++||.||-.+||+|+++.+.-. .. +.+.-||+..-...+- -+...|-.+-....|. +|.+....|..+.- T Consensus 2 klivl~GPSG~GK~tl~~~L~~~--~~~~~~~~vs~TTR~~R~~-----E~~G~dY~Fvs~~~F~-~~i~~~~FlE~~~~ 73 (182) T pfam00625 2 RPIVLSGPSGVGKSHIKKALLDE--YPEKFGYSVSHTTRPPRPG-----EVDGKDYHFVSKEEME-NDISANEFLEHAEF 73 (182) T ss_pred CEEEEECCCCCCHHHHHHHHHHH--CCCCEEEEEEEECCCCCCC-----CCCCCEEEEECHHHHH-HHHHHHHHHHHHHH T ss_conf 86999898999999999999984--8667344576554799987-----8789657996589999-98754377762640 Q ss_pred CCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHHCC Q ss_conf 856999325889880567999999999999726984999748757976643068 Q gi|254780750|r 730 QSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTDLSKSLK 783 (920) Q Consensus 730 ~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~l~~~~~ 783 (920) +. --=||+- .-++..++. +..+++-....-+..+.+..+ T Consensus 74 ~g------~~YGt~~--------~~I~~~~~~-g~~vvl~id~~g~~~lk~~~~ 112 (182) T pfam00625 74 NG------NYYGTSK--------EAIEQIAES-GKICILDVDIQGVKQLRKAEL 112 (182) T ss_pred CC------CCEEECH--------HHHHHHHHC-CCEEEEEECHHHHHHHHHHCC T ss_conf 79------7256402--------777999867-996999972899999987495 No 420 >cd03273 ABC_SMC2_euk Eukaryotic SMC2 proteins; SMC proteins are large (approximately 110 to 170 kDa), and each is arranged into five recognizable domains. Amino-acid sequence homology of SMC proteins between species is largely confined to the amino- and carboxy-terminal globular domains. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T, in the single-letter amino-acid code), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif, and a motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. In eukaryotic cells, the proteins are found as heterodimers of SMC1 paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6 (fo Probab=83.89 E-value=0.99 Score=23.88 Aligned_cols=22 Identities=32% Similarity=0.391 Sum_probs=17.5 Q ss_pred EEEEEECCCCCCHHHHHHHHHH Q ss_conf 3999967784407899999999 Q gi|254780750|r 651 KLWLLTGPNMGGKSTFLRQNAL 672 (920) Q Consensus 651 ~~~iiTGpNmgGKSt~lRqval 672 (920) .+-.|+|||-+|||-.+-.+.- T Consensus 26 g~taIVGPNGsGKSNI~DAIrw 47 (251) T cd03273 26 QFNAITGLNGSGKSNILDAICF 47 (251) T ss_pred CCEEEECCCCCCHHHHHHHHHH T ss_conf 9569999998866889999999 No 421 >PRK00300 gmk guanylate kinase; Provisional Probab=83.82 E-value=1.2 Score=23.30 Aligned_cols=23 Identities=39% Similarity=0.579 Sum_probs=20.4 Q ss_pred CCCEEEEEECCCCCCHHHHHHHH Q ss_conf 66439999677844078999999 Q gi|254780750|r 648 NSGKLWLLTGPNMGGKSTFLRQN 670 (920) Q Consensus 648 ~~~~~~iiTGpNmgGKSt~lRqv 670 (920) ..++++||+||-.+||+|+++.+ T Consensus 5 ~~g~livisGPSG~GK~tl~~~L 27 (208) T PRK00300 5 RRGLLIVLSAPSGAGKSTLVRAL 27 (208) T ss_pred CCCCEEEEECCCCCCHHHHHHHH T ss_conf 18838999999988999999999 No 422 >PRK13768 GTPase; Provisional Probab=83.67 E-value=1.2 Score=23.39 Aligned_cols=30 Identities=23% Similarity=0.323 Sum_probs=13.3 Q ss_pred EEEEECCCCCCHHHHHHHHHHHHHHHHCCCCC Q ss_conf 99996778440789999999999999719853 Q gi|254780750|r 652 LWLLTGPNMGGKSTFLRQNALIVIMAQMGSYV 683 (920) Q Consensus 652 ~~iiTGpNmgGKSt~lRqval~vilAQiG~fV 683 (920) .+++.||-.+|||||.+. +...+.+.|-.| T Consensus 4 ~~~ViGpaGSGKsT~~~~--l~~~l~~~~r~~ 33 (253) T PRK13768 4 IVFFLGTAGSGKTTLVGA--LSDWLEEQGYDV 33 (253) T ss_pred EEEEECCCCCCHHHHHHH--HHHHHHHCCCCE T ss_conf 899989999988999999--999999769975 No 423 >smart00072 GuKc Guanylate kinase homologues. Active enzymes catalyze ATP-dependent phosphorylation of GMP to GDP. Structure resembles that of adenylate kinase. So-called membrane-associated guanylate kinase homologues (MAGUKs) do not possess guanylate kinase activities; instead at least some possess protein-binding functions. Probab=83.61 E-value=1.2 Score=23.17 Aligned_cols=22 Identities=32% Similarity=0.443 Sum_probs=19.1 Q ss_pred CEEEEEECCCCCCHHHHHHHHH Q ss_conf 4399996778440789999999 Q gi|254780750|r 650 GKLWLLTGPNMGGKSTFLRQNA 671 (920) Q Consensus 650 ~~~~iiTGpNmgGKSt~lRqva 671 (920) .++.||.||..+||+|+++++. T Consensus 2 ~r~iil~Gpsg~GK~tl~~~l~ 23 (184) T smart00072 2 RRPIVLSGPSGVGKGTLLAELI 23 (184) T ss_pred CCEEEEECCCCCCHHHHHHHHH T ss_conf 9779999999999999999998 No 424 >PRK11519 tyrosine kinase; Provisional Probab=83.55 E-value=2 Score=21.60 Aligned_cols=61 Identities=15% Similarity=0.236 Sum_probs=38.5 Q ss_pred EEECCCCCHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCEEEEEECCCCC-CHHHHHHHHHHHHHHHHCCCCC Q ss_conf 40000468058876310287704443456358777766439999677844-0789999999999999719853 Q gi|254780750|r 612 FIVKDGRHPIVEKTLKQQSSKPFIANDCDLSCPNDKNSGKLWLLTGPNMG-GKSTFLRQNALIVIMAQMGSYV 683 (920) Q Consensus 612 l~i~~gRHPviE~~l~~~~~~~fVpNdi~l~~~~~~~~~~~~iiTGpNmg-GKSt~lRqval~vilAQiG~fV 683 (920) +.+.+-+-|+.|.+-. +--++.+.. .....++++||++..| ||||.- +.|+..|||.|.=| T Consensus 497 ~~~~~p~s~~~Ea~R~-------lrt~l~~~~--~~~~~~vi~vTS~~pgEGKSt~a--~nLA~~~A~~G~rv 558 (720) T PRK11519 497 LAVGNPTDLAIEAIRS-------LRTSLHFAM--MQAQNNVLMMTGVSPSIGKTFVC--ANLAAVISQTNKRV 558 (720) T ss_pred EEECCCCCHHHHHHHH-------HHHHHHHHC--CCCCCCEEEEEECCCCCCHHHHH--HHHHHHHHHCCCEE T ss_conf 3435998768999999-------999998744--68887679997089999789999--99999998379919 No 425 >KOG0085 consensus Probab=83.49 E-value=0.89 Score=24.24 Aligned_cols=11 Identities=18% Similarity=0.341 Sum_probs=4.5 Q ss_pred CCCCCCEEECC Q ss_conf 04787140000 Q gi|254780750|r 606 IDNSTNFIVKD 616 (920) Q Consensus 606 i~~~~~l~i~~ 616 (920) |.+++.|-.-+ T Consensus 264 F~nssVIlFLN 274 (359) T KOG0085 264 FQNSSVILFLN 274 (359) T ss_pred CCCCCEEEEEC T ss_conf 05884699963 No 426 >pfam00308 Bac_DnaA Bacterial dnaA protein. Probab=83.48 E-value=3 Score=20.29 Aligned_cols=97 Identities=16% Similarity=0.200 Sum_probs=53.4 Q ss_pred EEEEECCCCCCHHHHHHHHHHHHHHHHC-C---CCCCHHHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHC Q ss_conf 9999677844078999999999999971-9---85303532068221056765237661138532899999999999958 Q gi|254780750|r 652 LWLLTGPNMGGKSTFLRQNALIVIMAQM-G---SYVPASYAHIGIVDKLFSRVGSADNLASGRSTFMVEMIETASILNQA 727 (920) Q Consensus 652 ~~iiTGpNmgGKSt~lRqval~vilAQi-G---~fVPA~~a~i~~~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~a 727 (920) .+.|.||-.+|||-+|..+|-- +.++. | .|+||+...=..++.+ ..+. +.+...-++++ T Consensus 36 pl~i~G~~G~GKTHLLqA~~~~-~~~~~~~~~v~yl~~~~~~~~~~~~l----------~~~~------~~~f~~~l~~~ 98 (219) T pfam00308 36 PLFIYGGVGLGKTHLLHAIGNY-ALRNFPNLRVVYLTSEEFLNDFVDAL----------RDNK------IEAFKKSYRNV 98 (219) T ss_pred CEEEECCCCCCHHHHHHHHHHH-HHHHCCCCEEEEEEHHHHHHHHHHHH----------HCCC------HHHHHHHHHHC T ss_conf 2699889999888999999999-99849998288843999999889999----------8188------88999997632 Q ss_pred CCCCEEEEECCCC---CCCHHHHHHHHHHHHHHHHHHCCCEEEEECCC Q ss_conf 9985699932588---98805679999999999997269849997487 Q gi|254780750|r 728 TNQSFVILDEIGR---GTATLDGLSIAWATIEYLHETNRCRGLLATHF 772 (920) Q Consensus 728 t~~SLVllDElGr---GTst~DG~aiA~aile~l~~~~~~~~lfaTHy 772 (920) .++++|.+=. -+.+.+. .+.++.++.+. +...+++... T Consensus 99 ---d~l~iDDi~~l~~~~~~ee~---lf~l~N~~~~~-~~~lllts~~ 139 (219) T pfam00308 99 ---DLLLIDDIQFLAGKEKTQEE---FFHTFNALHEN-NKQIVLTSDR 139 (219) T ss_pred ---CHHHHCCHHHHCCCHHHHHH---HHHHHHHHHHC-CCEEEEECCC T ss_conf ---33652236765686478999---99999999972-9869997799 No 427 >PRK07261 topology modulation protein; Provisional Probab=83.39 E-value=1.1 Score=23.59 Aligned_cols=63 Identities=19% Similarity=0.195 Sum_probs=37.3 Q ss_pred EEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCCCE Q ss_conf 99967784407899999999999997198530353206822105676523766113853289999999999995899856 Q gi|254780750|r 653 WLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGSADNLASGRSTFMVEMIETASILNQATNQSF 732 (920) Q Consensus 653 ~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~at~~SL 732 (920) .+|-|+..|||||+-|++| ++.|.+| --.|.||=.= +-.....-.|... ++.++.. .+ T Consensus 3 I~IiG~sGsGKSTlAr~L~-----~~~~ip~-------~~LD~l~w~p---~w~~~~~~e~~~~---~~~~~~~---~~- 60 (171) T PRK07261 3 IAIIGYSGSGKSTLARFLG-----QHYNCPV-------LHLDQLHFSS---NWQERDDDDMIAD---ISNFLLK---QD- 60 (171) T ss_pred EEEECCCCCCHHHHHHHHH-----HHHCCCE-------EEECCEEECC---CCEECCHHHHHHH---HHHHHHC---CC- T ss_conf 9998899986899999999-----9879797-------9702278889---9988889999999---9999848---98- Q ss_pred EEEEC Q ss_conf 99932 Q gi|254780750|r 733 VILDE 737 (920) Q Consensus 733 VllDE 737 (920) =|+|= T Consensus 61 WIiDG 65 (171) T PRK07261 61 WIIEG 65 (171) T ss_pred EEEEC T ss_conf 79947 No 428 >COG4175 ProV ABC-type proline/glycine betaine transport system, ATPase component [Amino acid transport and metabolism] Probab=83.18 E-value=1.3 Score=22.99 Aligned_cols=56 Identities=25% Similarity=0.326 Sum_probs=30.1 Q ss_pred EEE--ECCCCCCCCHHHHHHCCCCCCH-HHHHHHHHHHHHCCCHHHHHHHHHHHHHHCC Q ss_conf 454--1278776412787630123433-7899998631002345678989999997402 Q gi|254780750|r 301 EIL--RTLSGSREQSLLKTIDYSITGA-GGRLFAERIASPLTDSTKINTRLDSINFFIQ 356 (920) Q Consensus 301 EI~--~~~~g~~~gSL~~~Ln~t~T~~-G~RlLr~wL~~PL~d~~~I~~R~daVe~l~~ 356 (920) ||| --++|+.|+||...+|+-.-|- |+=++...=...+...+-++-|..-+.-..+ T Consensus 55 eIfViMGLSGSGKSTLvR~~NrLiept~G~ilv~g~di~~~~~~~Lr~~Rr~~~sMVFQ 113 (386) T COG4175 55 EIFVIMGLSGSGKSTLVRLLNRLIEPTRGEILVDGKDIAKLSAAELRELRRKKISMVFQ 113 (386) T ss_pred EEEEEEECCCCCHHHHHHHHHCCCCCCCCEEEECCCCHHCCCHHHHHHHHHHHHHHHHH T ss_conf 08999935878788999997356788774389888432018999999998643213344 No 429 >KOG1969 consensus Probab=83.11 E-value=3.1 Score=20.18 Aligned_cols=42 Identities=14% Similarity=0.124 Sum_probs=17.4 Q ss_pred EECCCH---HHHHHHHHCCCEEEEEEEEEE-ECCEEEEEEEEEECC Q ss_conf 974875---797664306885899999996-099277877774478 Q gi|254780750|r 768 LATHFH---ELTDLSKSLKRFHNATLQVSD-SNEGIIFLHKVIPGI 809 (920) Q Consensus 768 faTHy~---eL~~l~~~~~~v~n~~~~~~~-~~~~i~flykl~~G~ 809 (920) |.+|.. +...++....-...|.+.+.. ..++..+-|++.|-+ T Consensus 711 ~~~~l~~~re~aql~~lV~im~sY~Lty~q~~~~d~~~~~rldP~i 756 (877) T KOG1969 711 VNKSLYSKREKAQLEELVQIMCSYSLTYIQNRVEDGQYGLRLDPPI 756 (877) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCEEEEEEECCCCCEEEEECCCH T ss_conf 5666443657999999999998603135754348852367746864 No 430 >pfam00448 SRP54 SRP54-type protein, GTPase domain. This family includes relatives of the G-domain of the SRP54 family of proteins. Probab=83.08 E-value=3.1 Score=20.18 Aligned_cols=163 Identities=21% Similarity=0.199 Sum_probs=74.5 Q ss_pred EEEEEECCCCCCHHHHHHHHHHHHHHHHCCC---CCCHHHCCCCCCCEEEEEEECCCCCCC-----CCCHHHH-HHHHHH Q ss_conf 3999967784407899999999999997198---530353206822105676523766113-----8532899-999999 Q gi|254780750|r 651 KLWLLTGPNMGGKSTFLRQNALIVIMAQMGS---YVPASYAHIGIVDKLFSRVGSADNLAS-----GRSTFMV-EMIETA 721 (920) Q Consensus 651 ~~~iiTGpNmgGKSt~lRqval~vilAQiG~---fVPA~~a~i~~~D~IftRiGa~D~l~~-----g~STF~v-Em~e~~ 721 (920) +++.+-||+-.||||.+=-.|.-. . .-|- +|-|+.. |+||.+-|.. |...+-+ +-.+.. T Consensus 2 ~vi~lvGptGvGKTTTiaKLAa~~-~-~~~~~V~lit~Dt~----------R~gA~eQL~~ya~~l~v~~~~~~~~~d~~ 69 (196) T pfam00448 2 NVILLVGLQGSGKTTTIAKLAAYL-K-KQGKKVLLVAADTF----------RAAAIEQLKQLAERLGVPVFGSGTGSDPA 69 (196) T ss_pred EEEEEECCCCCCHHHHHHHHHHHH-H-HCCCEEEEEECCCC----------CHHHHHHHHHHHHHCCCEEEECCCCCCHH T ss_conf 699998999998899999999999-9-77992899975877----------68899999999986398178148777878 Q ss_pred HHHH----H--CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCC----EEEEE-CCCHHHHHHHHHCCCEEEEEE Q ss_conf 9999----5--89985699932588988056799999999999972698----49997-487579766430688589999 Q gi|254780750|r 722 SILN----Q--ATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRC----RGLLA-THFHELTDLSKSLKRFHNATL 790 (920) Q Consensus 722 ~IL~----~--at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~----~~lfa-THy~eL~~l~~~~~~v~n~~~ 790 (920) .+++ . .....+||+|=-||. ..|..-+.+ ++.+.+..++ ++|=| |++..+......+..+. T Consensus 70 ~~~~~~l~~~~~~~~D~IlIDTaGr~--~~d~~~~~e--l~~~~~~~~~~~~~LVl~a~~~~~~~~~~~~f~~~~~---- 141 (196) T pfam00448 70 AVAFDAVEKAKAENYDVVLVDTAGRL--QNDKNLMDE--LKKIKRVIAPDEVLLVLDATTGQNALNQAKAFNEAVG---- 141 (196) T ss_pred HHHHHHHHHHHHCCCCEEEEECCCCC--CCCHHHHHH--HHHHHHHCCCCCEEEEEECCCCCCHHHHHHHHHHHCC---- T ss_conf 99999999988468999999899987--476778999--9999852287302899856778213789998760047---- Q ss_pred EEEEECCEEEEEEEEEECCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHH Q ss_conf 999609927787777447898877899999829998999999999999987 Q gi|254780750|r 791 QVSDSNEGIIFLHKVIPGIADHSYGIQVGKLAGLPNTVISRAYDILKTFEK 841 (920) Q Consensus 791 ~~~~~~~~i~flykl~~G~~~~Sygi~vA~laG~p~~vi~~A~~~~~~le~ 841 (920) -+.++|+ ||=+ .+.----++++...|+|-.-+-.-+.+ +.||. T Consensus 142 -----~~~~I~T-KlDe-t~~~G~~l~~~~~~~~Pi~~~t~Gq~v-~Dl~~ 184 (196) T pfam00448 142 -----ITGVILT-KLDG-DAKGGAALSIAAETGKPIKFIGVGEKI-DDLEP 184 (196) T ss_pred -----CCCEEEE-EECC-CCCCCHHHHHHHHHCCCEEEEECCCCH-HHCCC T ss_conf -----7626888-4057-887529998999989697999679981-20634 No 431 >TIGR01832 kduD 2-deoxy-D-gluconate 3-dehydrogenase; InterPro: IPR011286 This entry describes 2-deoxy-D-gluconate 3-dehydrogenase (also called 2-keto-3-deoxygluconate oxidoreductase), a member of the family of short-chain-alcohol dehydrogenases (IPR002198 from INTERPRO). This protein has been characterised in Erwinia chrysanthemi as an enzyme of pectin degradation .; GO: 0008678 2-deoxy-D-gluconate 3-dehydrogenase activity, 0051287 NAD binding. Probab=83.06 E-value=1.3 Score=23.02 Aligned_cols=13 Identities=23% Similarity=0.394 Sum_probs=4.9 Q ss_pred CHHHHHHHHHHHH Q ss_conf 4567898999999 Q gi|254780750|r 340 DSTKINTRLDSIN 352 (920) Q Consensus 340 d~~~I~~R~daVe 352 (920) |=.-|-+|.|+++ T Consensus 87 NNAGIIrR~da~~ 99 (249) T TIGR01832 87 NNAGIIRREDAVE 99 (249) T ss_pred ECCCHHHHHHHHH T ss_conf 0742210457751 No 432 >COG1132 MdlB ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms] Probab=83.00 E-value=1.5 Score=22.56 Aligned_cols=122 Identities=24% Similarity=0.313 Sum_probs=61.6 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHH---------------------HHHHHHHHCCCCCCHHHC--CCC Q ss_conf 443456358777766439999677844078999999---------------------999999971985303532--068 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQN---------------------ALIVIMAQMGSYVPASYA--HIG 691 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqv---------------------al~vilAQiG~fVPA~~a--~i~ 691 (920) |=+|+++.. ..+...-|.||+.+||||+++-+ .+..+-.++| +||=+.. .-+ T Consensus 344 vl~~is~~i----~~Ge~vaiVG~sGsGKSTl~~LL~r~~~~~~G~I~idg~dI~~i~~~~lr~~i~-~V~Qd~~LF~~T 418 (567) T COG1132 344 VLKDISFSI----EPGEKVAIVGPSGSGKSTLIKLLLRLYDPTSGEILIDGIDIRDISLDSLRKRIG-IVSQDPLLFSGT 418 (567) T ss_pred CCCCCEEEE----CCCCEEEEECCCCCCHHHHHHHHHHHCCCCCCEEEECCEEHHHCCHHHHHHHEE-EECCCCHHHHHH T ss_conf 110522775----489878885588885789999998615888836989999777538567887823-546642777766 Q ss_pred CCC--------------------------------EEEEEEECC-CCCCCCCCHHHHHHHH-HHHHHHHCCCCCEEEEEC Q ss_conf 221--------------------------------056765237-6611385328999999-999999589985699932 Q gi|254780750|r 692 IVD--------------------------------KLFSRVGSA-DNLASGRSTFMVEMIE-TASILNQATNQSFVILDE 737 (920) Q Consensus 692 ~~D--------------------------------~IftRiGa~-D~l~~g~STF~vEm~e-~~~IL~~at~~SLVllDE 737 (920) +.| ...|.+|.+ .+++.||= -..- ...+|+++ .++|+|| T Consensus 419 I~~NI~~g~~~at~eei~~a~k~a~~~d~I~~lp~g~dt~vge~G~~LSgGQr----QriaiARall~~~---~iLILDE 491 (567) T COG1132 419 IRENIALGRPDATDEEIEEALKLANAHEFIANLPDGYDTIVGERGVNLSGGQR----QRLAIARALLRNP---PILILDE 491 (567) T ss_pred HHHHHHCCCCCCCHHHHHHHHHHHCCHHHHHHCCCCCCCEECCCCCCCCHHHH----HHHHHHHHHHHCC---CEEEECC T ss_conf 99997457889999999999999486378984766678360578871889999----9999999970369---8688723 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHCC-CEEEEECCC Q ss_conf 58898805679999999999997269-849997487 Q gi|254780750|r 738 IGRGTATLDGLSIAWATIEYLHETNR-CRGLLATHF 772 (920) Q Consensus 738 lGrGTst~DG~aiA~aile~l~~~~~-~~~lfaTHy 772 (920) - ||..|-..= ..+.+.+.+..+ ..++..+|= T Consensus 492 a---TS~lD~~tE-~~I~~~l~~l~~~rT~iiIaHR 523 (567) T COG1132 492 A---TSALDTETE-ALIQDALKKLLKGRTTLIIAHR 523 (567) T ss_pred H---HHCCCHHHH-HHHHHHHHHHHCCCEEEEEECC T ss_conf 1---102677569-9999999998269879998667 No 433 >PRK06731 flhF flagellar biosynthesis regulator FlhF; Validated Probab=82.91 E-value=3.2 Score=20.13 Aligned_cols=171 Identities=18% Similarity=0.294 Sum_probs=82.9 Q ss_pred CCEEEEEECCCCCCHHHHHHHHHHHHHHHHCC---CCCCHHHCCCCCCCEE--EEE-EECCCCCCCCCCHHHHHHHHHHH Q ss_conf 64399996778440789999999999999719---8530353206822105--676-52376611385328999999999 Q gi|254780750|r 649 SGKLWLLTGPNMGGKSTFLRQNALIVIMAQMG---SYVPASYAHIGIVDKL--FSR-VGSADNLASGRSTFMVEMIETAS 722 (920) Q Consensus 649 ~~~~~iiTGpNmgGKSt~lRqval~vilAQiG---~fVPA~~a~i~~~D~I--ftR-iGa~D~l~~g~STF~vEm~e~~~ 722 (920) ..++.++-|||.+||||.+==+|.- +..-| ..|.|+..+.+-+|++ |++ +|..=-...+. .++.+... T Consensus 74 ~~~vI~lvG~~G~GKTTT~AKLA~~--~~~~~~kV~lia~DtyR~aA~eQLk~~a~~l~v~~~~~~~~----~~~~~~~~ 147 (270) T PRK06731 74 EVQTIALIGPTGVGKTTTLAKMAWQ--FHGKKKTVGFITTDHSRIGTVQQLQDYVKTIGFEVIAVRDE----AAMTRALT 147 (270) T ss_pred CCEEEEEECCCCCCHHHHHHHHHHH--HHHCCCEEEEEEECCCCHHHHHHHHHHHHHHCCCEECCCCH----HHHHHHHH T ss_conf 9818999888989889999999999--98679908999838888889999999999819953545887----89999999 Q ss_pred HHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCC----EEEEE-CCCHHHHHHHHHCCCEEEEEEEEEEECC Q ss_conf 999589985699932588988056799999999999972698----49997-4875797664306885899999996099 Q gi|254780750|r 723 ILNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRC----RGLLA-THFHELTDLSKSLKRFHNATLQVSDSNE 797 (920) Q Consensus 723 IL~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~----~~lfa-THy~eL~~l~~~~~~v~n~~~~~~~~~~ 797 (920) .+......-+||+|=-||.- .|-.-+. -..+ +.+...+ +++=+ ++..++.+....+..+. -+ T Consensus 148 ~~~~~~~~DvilIDTAGR~~--~d~~lm~-el~~-~~~~~~p~~~~Lvldas~~~~~~~~~~~~f~~~~---------i~ 214 (270) T PRK06731 148 YFKEEARVDYILIDTAGKNY--RASETVE-EMIE-TMGQVEPDYICLTLSASMKSKDMIEIITNFKDIH---------ID 214 (270) T ss_pred HHHHHCCCCEEEEECCCCCC--CCHHHHH-HHHH-HHCCCCCCEEEEEEECCCCHHHHHHHHHHHCCCC---------CC T ss_conf 99997699999997999871--4699999-9999-8606389879999868777699999999807799---------98 Q ss_pred EEEEEEEEEECCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHH Q ss_conf 2778777744789887789999982999899999999999998 Q gi|254780750|r 798 GIIFLHKVIPGIADHSYGIQVGKLAGLPNTVISRAYDILKTFE 840 (920) Q Consensus 798 ~i~flykl~~G~~~~Sygi~vA~laG~p~~vi~~A~~~~~~le 840 (920) +++++ |+ +|-+.---.+.++...|+|=..+-.-+.+-+.+| T Consensus 215 gvIlT-Kl-D~ta~gG~als~~~~~~~PI~fig~Ge~VpeDi~ 255 (270) T PRK06731 215 GIVFT-KF-DETASSGELLKIPAVSSAPIVLMTDGQDVKKNIH 255 (270) T ss_pred EEEEE-CC-CCCCCCCHHHHHHHHHCCCEEEEECCCCCHHHHH T ss_conf 89996-53-5899772999999998859799945999702141 No 434 >TIGR00634 recN DNA repair protein RecN; InterPro: IPR004604 DNA repair protein RecN is thought to be DNA damage inducible and involved in recombinational processes. The N-terminal region of most of the bacterial RecN proteins sequenced to date contains an ATP/GTP binding domain within an SMC-like motif. SMC-like domains are involved in chromosomal scaffolding and segregation. It is possible that the function of RecN in homologous recombination is either structural or enzymatic or both. RecN may be involved in the proper positioning of the recombining segments of DNA, ensuring normal recombination. The observation that inactivation of this gene leads to a decreased transformation efficiency, as well as increased sensitivity to DNA-damaging agents, may be due to some defect in chromosomal partitioning or positioning during these recombination-dependent processes . The protein may function presynaptically to process double-stranded breaks to produce 3 single-stranded DNA intermediates during recombination. ; GO: 0005524 ATP binding, 0006281 DNA repair, 0006310 DNA recombination. Probab=82.80 E-value=3.2 Score=20.10 Aligned_cols=119 Identities=18% Similarity=0.297 Sum_probs=66.8 Q ss_pred CCHHHHHHHHHHHHHHHC--CCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHH-CCCEEEEECCCHHHHHHHHHCCCEE Q ss_conf 532899999999999958--9985699932588988056799999999999972-6984999748757976643068858 Q gi|254780750|r 710 RSTFMVEMIETASILNQA--TNQSFVILDEIGRGTATLDGLSIAWATIEYLHET-NRCRGLLATHFHELTDLSKSLKRFH 786 (920) Q Consensus 710 ~STF~vEm~e~~~IL~~a--t~~SLVllDElGrGTst~DG~aiA~aile~l~~~-~~~~~lfaTHy~eL~~l~~~~~~v~ 786 (920) .|-|| .=++.||... .+-.=.|+||+=-|=|-.- |.+|-++|.+- .++=+|+.||.+.++.+++.+=.|. T Consensus 471 LSRv~---LAL~~~ls~~~~~~~~TliFDEvDvGvsG~~----A~~ia~~L~~LS~~~QV~cvTHlp~vAA~AD~h~~v~ 543 (605) T TIGR00634 471 LSRVM---LALKVVLSSSDPAAVTTLIFDEVDVGVSGKT----AQAIAKKLAQLSERKQVLCVTHLPQVAAHADAHLKVE 543 (605) T ss_pred HHHHH---HHHHHHCCCCCCCCCCCEEEEEEECCCCHHH----HHHHHHHHHHHHHHCCEEEECCCHHHHHHCCCCEEEE T ss_conf 99999---9999854234656787127631641764689----9999999999853095899818379897248814677 Q ss_pred EEEEE---EEEE-----CCEEEEEEEEEECCCCC-CHHHHHHHHC-C--CCHHHHHHHHHHHH Q ss_conf 99999---9960-----99277877774478988-7789999982-9--99899999999999 Q gi|254780750|r 787 NATLQ---VSDS-----NEGIIFLHKVIPGIADH-SYGIQVGKLA-G--LPNTVISRAYDILK 837 (920) Q Consensus 787 n~~~~---~~~~-----~~~i~flykl~~G~~~~-Sygi~vA~la-G--~p~~vi~~A~~~~~ 837 (920) -.... .+.. .|+.|. =++++ .+.. -===+||||. | .-+.-+..|+++++ T Consensus 544 K~~~neniadPtnPqldeD~~T~-~~v~~-L~~~G~rV~ElARllaG~~~s~~~la~A~ELL~ 604 (605) T TIGR00634 544 KELLNENIADPTNPQLDEDGRTA-TRVRP-LSSEGERVAELARLLAGLEKSDLALAHAQELLE 604 (605) T ss_pred EEEECCCCCCCCCCCCCCCCCCC-CEEEC-HHCCCCHHHHHHHHHCCCCHHHHHHHHHHHHHC T ss_conf 54200114677887667878862-00220-000263689999972576014888999998740 No 435 >KOG0734 consensus Probab=82.67 E-value=3.2 Score=20.06 Aligned_cols=43 Identities=37% Similarity=0.451 Sum_probs=19.2 Q ss_pred EEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEEC Q ss_conf 99967784407899999999999997198530353206822105676523 Q gi|254780750|r 653 WLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGS 702 (920) Q Consensus 653 ~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa 702 (920) .++|||-.-||+-+-|.||= - .-||==+|.=+-||-+|.-+|| T Consensus 340 VLLvGPPGTGKTlLARAvAG-----E--A~VPFF~~sGSEFdEm~VGvGA 382 (752) T KOG0734 340 VLLVGPPGTGKTLLARAVAG-----E--AGVPFFYASGSEFDEMFVGVGA 382 (752) T ss_pred EEEECCCCCCHHHHHHHHHC-----C--CCCCEEECCCCCHHHHHHCCCH T ss_conf 38768999755699998605-----5--6897474166204454220148 No 436 >COG4167 SapF ABC-type antimicrobial peptide transport system, ATPase component [Defense mechanisms] Probab=82.31 E-value=1.9 Score=21.70 Aligned_cols=24 Identities=25% Similarity=0.407 Sum_probs=12.9 Q ss_pred CCCCCCHHHHHHCCCCCCHHHHHH Q ss_conf 877641278763012343378999 Q gi|254780750|r 307 SGSREQSLLKTIDYSITGAGGRLF 330 (920) Q Consensus 307 ~g~~~gSL~~~Ln~t~T~~G~RlL 330 (920) +|+.+.+|-..|---.-|-|.|.| T Consensus 48 NGSGKSTLakMlaGmi~PTsG~il 71 (267) T COG4167 48 NGSGKSTLAKMLAGMIEPTSGEIL 71 (267) T ss_pred CCCCHHHHHHHHHCCCCCCCCEEE T ss_conf 997475899998355589885487 No 437 >pfam08497 Radical_SAM_N Radical SAM N-terminal. This domain tends to occur to the N-terminus of the pfam04055 domain in hypothetical bacterial proteins. Probab=82.01 E-value=2 Score=21.57 Aligned_cols=41 Identities=24% Similarity=0.401 Sum_probs=28.5 Q ss_pred HHHCCEEECCCCCCCCCCCCCCCCHHHHHHHHHH-HHHCCCEEEEEEECC Q ss_conf 8618189107889988876265617669999999-998898599995028 Q gi|254780750|r 64 RCLGITLTKRGKHLGKDIPMCGVPVHTANHYIQK-LIKIGHRIAICEQIE 112 (920) Q Consensus 64 ~~L~i~lt~r~~~~~~~~pmaGvP~~~l~~yl~~-Lv~~GykVaiveQ~E 112 (920) ..||+.|...+.+.++ |..|... +.| |..+||||||+-|=. T Consensus 15 d~lDvilVtGDAYVDH--PsFG~Ai------IgR~Le~~GyrVgIIaQPd 56 (298) T pfam08497 15 DQLDVILVTGDAYVDH--PSFGMAI------IGRVLEAQGFRVGIIAQPD 56 (298) T ss_pred CCCCEEEEECCCCCCC--CCHHHHH------HHHHHHHCCCEEEEEECCC T ss_conf 7578899947622568--5306899------9999998696598971899 No 438 >COG3172 NadR Predicted ATPase/kinase involved in NAD metabolism [Coenzyme metabolism] Probab=81.84 E-value=1.3 Score=23.02 Aligned_cols=23 Identities=35% Similarity=0.364 Sum_probs=19.4 Q ss_pred EEEEEECCCCCCHHHHHHHHHHH Q ss_conf 39999677844078999999999 Q gi|254780750|r 651 KLWLLTGPNMGGKSTFLRQNALI 673 (920) Q Consensus 651 ~~~iiTGpNmgGKSt~lRqval~ 673 (920) +...|+||-.+||||+.+++|-+ T Consensus 9 K~VailG~ESsGKStLv~kLA~~ 31 (187) T COG3172 9 KTVAILGGESSGKSTLVNKLANI 31 (187) T ss_pred EEEEEECCCCCCHHHHHHHHHHH T ss_conf 13556457665717999999999 No 439 >PRK06217 hypothetical protein; Validated Probab=81.68 E-value=1.4 Score=22.83 Aligned_cols=21 Identities=24% Similarity=0.154 Sum_probs=17.7 Q ss_pred EEEECCCCCCHHHHHHHHHHH Q ss_conf 999677844078999999999 Q gi|254780750|r 653 WLLTGPNMGGKSTFLRQNALI 673 (920) Q Consensus 653 ~iiTGpNmgGKSt~lRqval~ 673 (920) ++|+|.+.+||||+-|++|-. T Consensus 4 I~i~G~sGsGkSTla~~La~~ 24 (185) T PRK06217 4 IHITGASGSGTTTLGAALAEA 24 (185) T ss_pred EEEECCCCCCHHHHHHHHHHH T ss_conf 999789988789999999997 No 440 >COG1137 YhbG ABC-type (unclassified) transport system, ATPase component [General function prediction only] Probab=81.58 E-value=1.1 Score=23.54 Aligned_cols=33 Identities=42% Similarity=0.570 Sum_probs=25.0 Q ss_pred CCCEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHH Q ss_conf 7704443456358777766439999677844078999 Q gi|254780750|r 631 SKPFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFL 667 (920) Q Consensus 631 ~~~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~l 667 (920) .+..|-||++|.. ..+-+.=+-|||-+||||.- T Consensus 15 ~kr~Vv~~Vsl~v----~~GEiVGLLGPNGAGKTT~F 47 (243) T COG1137 15 KKRKVVNDVSLEV----NSGEIVGLLGPNGAGKTTTF 47 (243) T ss_pred CCEEEEEEEEEEE----CCCCEEEEECCCCCCCEEEE T ss_conf 8705551156887----27858988778888851589 No 441 >PRK00955 hypothetical protein; Provisional Probab=81.28 E-value=2.2 Score=21.30 Aligned_cols=42 Identities=21% Similarity=0.237 Sum_probs=30.1 Q ss_pred HHHCCEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCEEEEEEECC Q ss_conf 8618189107889988876265617669999999998898599995028 Q gi|254780750|r 64 RCLGITLTKRGKHLGKDIPMCGVPVHTANHYIQKLIKIGHRIAICEQIE 112 (920) Q Consensus 64 ~~L~i~lt~r~~~~~~~~pmaGvP~~~l~~yl~~Lv~~GykVaiveQ~E 112 (920) ..||+.+...+.+.++ |..|.+.- -+-|..+||||||+-|=. T Consensus 12 d~lDvIlVtGDAYVDH--PsFG~AiI-----gR~Le~~GyrVgIIaQPd 53 (599) T PRK00955 12 DELDFILVTGDAYVDH--PSFGTAII-----GRVLEAEGFRVGIIAQPN 53 (599) T ss_pred CCCCEEEEECCCCCCC--CCCHHHHH-----HHHHHHCCCEEEEEECCC T ss_conf 7477899947612568--64048999-----999997696598972899 No 442 >TIGR00956 3a01205 Pleiotropic Drug Resistance (PDR) Family protein; InterPro: IPR005285 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This family includes transporters, whose physiological function is not yet established. These proteins are thought to confer resistance to the chemicals cycloheximide and sulphomethuron methyl, BFA, azole antifungal agents, other antifungal agents: amorolfine and terbinafine. Some of them could serve as an efflux pump of various antibiotics.. Probab=81.25 E-value=1.4 Score=22.80 Aligned_cols=108 Identities=30% Similarity=0.432 Sum_probs=70.3 Q ss_pred EEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCC------------CCEEEEEE-E--CCCCCCCCCCH------ Q ss_conf 996778440789999999999999719853035320682------------21056765-2--37661138532------ Q gi|254780750|r 654 LLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGI------------VDKLFSRV-G--SADNLASGRST------ 712 (920) Q Consensus 654 iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~------------~D~IftRi-G--a~D~l~~g~ST------ 712 (920) =|=|.=-+||||+|=. | |+..+.|+ .|.=|.|. | =+.||.-.+|| T Consensus 857 ALMG~SGAGKTTLLn~------L--------a~R~t~GvIT~G~~lVNG~~lD~sF~R~iGYvQQQD~Hl~~~TVREaL~ 922 (1466) T TIGR00956 857 ALMGASGAGKTTLLNV------L--------AERVTTGVITDGDRLVNGRPLDSSFQRSIGYVQQQDLHLETSTVREALR 922 (1466) T ss_pred ECCCCCCCCHHHHHHH------H--------HCCCCCCEEECCEEEECCCCCCCCCCCCCCCEEECCEECCCCHHHHHHH T ss_conf 5157887635788864------4--------3304321770571550884557555744132454221224202889999 Q ss_pred H------------------------HHHHHHHHHHH---------H-------------HCCCCCEEEEECCCCCCCHHH Q ss_conf 8------------------------99999999999---------9-------------589985699932588988056 Q gi|254780750|r 713 F------------------------MVEMIETASIL---------N-------------QATNQSFVILDEIGRGTATLD 746 (920) Q Consensus 713 F------------------------~vEm~e~~~IL---------~-------------~at~~SLVllDElGrGTst~D 746 (920) | .+||.+-|..+ + -|-|++||-|||- ||-.| T Consensus 923 FSA~LRqP~~vs~~EK~~YVe~vI~lLEMe~YaDAvVG~pG~tGLNVEQRKRLTIGVELvAkPkLL~FLDEP---TSGLD 999 (1466) T TIGR00956 923 FSAYLRQPKSVSKSEKMEYVEEVIKLLEMESYADAVVGVPGETGLNVEQRKRLTIGVELVAKPKLLLFLDEP---TSGLD 999 (1466) T ss_pred HHHHHCCCCCCCHHHHHHHHHHHHHHHCCHHHCCEEEECCCCCCCCHHHHHHHHHHHHHHHCCCHHEECCCC---CCCHH T ss_conf 999845985678653246888887664101022113515677888732321010334322163200021588---97055 Q ss_pred HHHHHHHHHHHH---HHHCCCEEEEECCCHHHHHHHH Q ss_conf 799999999999---9726984999748757976643 Q gi|254780750|r 747 GLSIAWATIEYL---HETNRCRGLLATHFHELTDLSK 780 (920) Q Consensus 747 G~aiA~aile~l---~~~~~~~~lfaTHy~eL~~l~~ 780 (920) --+ ||||+.-+ .+..++ .|.+=|=+...-+++ T Consensus 1000 SQt-AWsi~~l~RKLad~GQa-ILCTIHQPSA~L~~e 1034 (1466) T TIGR00956 1000 SQT-AWSICKLLRKLADHGQA-ILCTIHQPSAILFEE 1034 (1466) T ss_pred HHH-HHHHHHHHHHHHHCCCE-EEECCCCHHHHHHHH T ss_conf 899-99999999988755983-886043024899998 No 443 >pfam03205 MobB Molybdopterin guanine dinucleotide synthesis protein B. This protein contains a P-loop. Probab=81.17 E-value=1.6 Score=22.31 Aligned_cols=30 Identities=30% Similarity=0.496 Sum_probs=22.4 Q ss_pred EEEEECCCCCCHHHHHHHHHHHHHHHHCCCCC Q ss_conf 99996778440789999999999999719853 Q gi|254780750|r 652 LWLLTGPNMGGKSTFLRQNALIVIMAQMGSYV 683 (920) Q Consensus 652 ~~iiTGpNmgGKSt~lRqval~vilAQiG~fV 683 (920) ..+|.||.-+||||++|.++ -.+...|.-| T Consensus 2 ~v~i~G~~~sGKttl~~~L~--~~~~~~g~~~ 31 (122) T pfam03205 2 IVLVVGPKDSGKTTLIRKLL--NYLKRRGYRV 31 (122) T ss_pred EEEEECCCCCCHHHHHHHHH--HHHHHCCCCE T ss_conf 79999489998999999999--9999879944 No 444 >COG4938 Uncharacterized conserved protein [Function unknown] Probab=81.15 E-value=2.4 Score=21.08 Aligned_cols=47 Identities=13% Similarity=0.164 Sum_probs=26.6 Q ss_pred CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHH Q ss_conf 8998569993258898805679999999999997269849997487579 Q gi|254780750|r 727 ATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHEL 775 (920) Q Consensus 727 at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL 775 (920) .-+.+||||.+--.--.| .|-+.-.-.+-.+... |-.++..||--.+ T Consensus 258 ~K~~~lIIIEnPEAHLHP-~gQs~lg~llA~~a~~-gvqvVveTHSdhi 304 (374) T COG4938 258 VKKKYLIIIENPEAHLHP-EGQSKLGDLLAELAAR-GVQVVVETHSDHI 304 (374) T ss_pred ECCCCEEEECCCHHHCCC-HHHHHHHHHHHHHHHC-CCEEEEEECHHHH T ss_conf 147708997182451492-1677999999999855-8689998456887 No 445 >CHL00181 cbbX CbbX; Provisional Probab=80.97 E-value=3.7 Score=19.64 Aligned_cols=115 Identities=17% Similarity=0.205 Sum_probs=66.4 Q ss_pred EEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEECCCCC--CCCCCHHHHHHHHHHHHHHHCCCC Q ss_conf 999677844078999999999999971985303532068221056765237661--138532899999999999958998 Q gi|254780750|r 653 WLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGSADNL--ASGRSTFMVEMIETASILNQATNQ 730 (920) Q Consensus 653 ~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa~D~l--~~g~STF~vEm~e~~~IL~~at~~ 730 (920) ++.+||---||||.-|-+|= |++.+|. +|-. -|.-+..+|=+ .-|++- .-|..++..| .. T Consensus 62 ~vF~GnPGTGKTTVARl~a~--il~~lG~-L~~g---------~vve~~r~dLvg~yvG~Ta-----~kt~~~i~~a-~G 123 (287) T CHL00181 62 MSFTGSPGTGKTTVALKMAD--ILYRLGY-IKKG---------HLITVTRDDLVGQYIGHTA-----PKTKEVLKKA-MG 123 (287) T ss_pred EEEECCCCCCHHHHHHHHHH--HHHHCCC-CCCC---------EEEEECHHHHCCCCCCCCH-----HHHHHHHHHC-CC T ss_conf 88878998679999999999--9998699-5589---------5899535884163535216-----9999999964-59 Q ss_pred CEEEEEC---CCCCCCHHH-HHHHHHHHHHHHHHHCCCEEEEECCCH-HHHHHHHHCCCE Q ss_conf 5699932---588988056-799999999999972698499974875-797664306885 Q gi|254780750|r 731 SFVILDE---IGRGTATLD-GLSIAWATIEYLHETNRCRGLLATHFH-ELTDLSKSLKRF 785 (920) Q Consensus 731 SLVllDE---lGrGTst~D-G~aiA~aile~l~~~~~~~~lfaTHy~-eL~~l~~~~~~v 785 (920) -..+||| |.++.+..| |...--.+++.+-+..+.++++...|. ++-.+....|++ T Consensus 124 GVLfIDEAY~L~~~~~~~dfg~eaidtLl~~me~~~~~lvvI~AGY~~eM~~fl~~NpGL 183 (287) T CHL00181 124 GVLFIDEAYYLYKPDNERDYGAEAIEILLQVMENQRDDLVVIFAGYKDRMDKFYESNPGL 183 (287) T ss_pred CEEEEECHHHHCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHCCCH T ss_conf 879982446535788999837999999999987079988999846789999999859047 No 446 >PRK05917 DNA polymerase III subunit delta'; Validated Probab=80.81 E-value=3.7 Score=19.60 Aligned_cols=105 Identities=20% Similarity=0.223 Sum_probs=54.2 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCC----CCCCCEEEEEEECCCCCCCCCCH-HHHHHHHHHHHH Q ss_conf 4399996778440789999999999999719853035320----68221056765237661138532-899999999999 Q gi|254780750|r 650 GKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAH----IGIVDKLFSRVGSADNLASGRST-FMVEMIETASIL 724 (920) Q Consensus 650 ~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~----i~~~D~IftRiGa~D~l~~g~ST-F~vEm~e~~~IL 724 (920) .+.+|++||...||+++-+..|-..+ .+ |. |+.... .-| |=. .+- -.|.+. .-+|- ...+. T Consensus 19 ~HAyLf~Gp~G~GK~~~A~~~A~~LL-c~-~~--p~~~~~i~~~~HP-D~~--~i~-----pe~k~~~~~Id~--iR~l~ 84 (290) T PRK05917 19 PSAILLHGQDLSNLSQYAYELASLIL-LE-SS--PEAQYKISQKIHP-DIH--EFF-----PEGKGRLHSIET--PRAIK 84 (290) T ss_pred CEEEEEECCCCCCHHHHHHHHHHHHH-CC-CC--CCHHHHHHHCCCC-CEE--EEC-----CCCCCCCCCHHH--HHHHH T ss_conf 60687689998659999999999985-78-99--6168898746899-859--961-----577788786789--99999 Q ss_pred HH------CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHH-CCCEEEEECCCH Q ss_conf 95------89985699932588988056799999999999972-698499974875 Q gi|254780750|r 725 NQ------ATNQSFVILDEIGRGTATLDGLSIAWATIEYLHET-NRCRGLLATHFH 773 (920) Q Consensus 725 ~~------at~~SLVllDElGrGTst~DG~aiA~aile~l~~~-~~~~~lfaTHy~ 773 (920) +. .+.+=.+|+||.-+=|.. =|-|.|.-|-+- ..+..++.||.. T Consensus 85 ~~i~~~p~~g~~KV~IId~Ad~Mn~~-----AaNALLKtLEEPP~~tvfILit~~~ 135 (290) T PRK05917 85 KQIWIHPYEANYKIYIIHEADRMTLD-----AISAFLKVLEDPPKHSVIILTSAKP 135 (290) T ss_pred HHHCCCCCCCCCEEEEECCHHHHCHH-----HHHHHHHHHCCCCCCEEEEEEECCH T ss_conf 99641864688269997567763899-----9999999734798785999986992 No 447 >PRK08451 DNA polymerase III subunits gamma and tau; Validated Probab=80.76 E-value=3.7 Score=19.59 Aligned_cols=12 Identities=25% Similarity=0.373 Sum_probs=5.8 Q ss_pred CCCHHHHHHHHH Q ss_conf 882220079999 Q gi|254780750|r 49 GDFYELFFDDAL 60 (920) Q Consensus 49 G~FYE~f~~DA~ 60 (920) -.|=|+-|.+.. T Consensus 11 ~~F~evIGQe~i 22 (523) T PRK08451 11 KHFDELIGQESV 22 (523) T ss_pred CCHHHCCCCHHH T ss_conf 965440494999 No 448 >COG4639 Predicted kinase [General function prediction only] Probab=80.72 E-value=2.3 Score=21.10 Aligned_cols=77 Identities=18% Similarity=0.239 Sum_probs=41.5 Q ss_pred HHHHHHHHHHHHHHHHHCCCEEEEECCC-----HHHHHHHHHCCCEEEEEEEEEEECCEEEEEEEEEECCCCCCHHHHHH Q ss_conf 5679999999999997269849997487-----57976643068858999999960992778777744789887789999 Q gi|254780750|r 745 LDGLSIAWATIEYLHETNRCRGLLATHF-----HELTDLSKSLKRFHNATLQVSDSNEGIIFLHKVIPGIADHSYGIQVG 819 (920) Q Consensus 745 ~DG~aiA~aile~l~~~~~~~~lfaTHy-----~eL~~l~~~~~~v~n~~~~~~~~~~~i~flykl~~G~~~~Sygi~vA 819 (920) .|-..+++..++..+.+.+-..+=|||- ..|.+|+..+.- .-+.. .| .+ +++.--| T Consensus 52 ~~~~~~l~~~l~qrl~~Gk~tiidAtn~rr~~r~~l~~La~~y~~-~~~~i---------vf--dt-------p~~~c~a 112 (168) T COG4639 52 ELVWDILYKQLEQRLRRGKFTIIDATNLRREDRRKLIDLAKAYGY-KIYAI---------VF--DT-------PLELCLA 112 (168) T ss_pred HHHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHHHHHCC-EEEEE---------EE--EC-------CHHHHHH T ss_conf 999999999999999739768997545899999999999998397-68999---------98--18-------8999987 Q ss_pred H----HCCCCHHHHHHHHHH-HHHHH Q ss_conf 9----829998999999999-99998 Q gi|254780750|r 820 K----LAGLPNTVISRAYDI-LKTFE 840 (920) Q Consensus 820 ~----laG~p~~vi~~A~~~-~~~le 840 (920) | -.-+|++||.|+-+. ...+| T Consensus 113 RNk~~~Rqv~~~VI~r~~r~~~~~l~ 138 (168) T COG4639 113 RNKLRERQVPEEVIPRMLRETIDLLE 138 (168) T ss_pred HHHCCCHHCCHHHHHHHHHHHHHHCC T ss_conf 74044001798999999998775405 No 449 >COG3911 Predicted ATPase [General function prediction only] Probab=80.66 E-value=2.1 Score=21.38 Aligned_cols=91 Identities=25% Similarity=0.286 Sum_probs=42.6 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHH--HHHC Q ss_conf 43999967784407899999999999997198530353206822105676523766113853289999999999--9958 Q gi|254780750|r 650 GKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGSADNLASGRSTFMVEMIETASI--LNQA 727 (920) Q Consensus 650 ~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa~D~l~~g~STF~vEm~e~~~I--L~~a 727 (920) .++.||||.-.+||||+|+. ||-.|.-+- +.+-=.++ ..=++.|..+-=... -+|..|+.-.... -+++ T Consensus 9 ~~~fIltGgpGaGKTtLL~a------La~~Gfatv-ee~~r~ii-~~es~~gg~~lPW~D-~~afael~~~~~l~q~r~~ 79 (183) T COG3911 9 HKRFILTGGPGAGKTTLLAA------LARAGFATV-EEAGRDII-ALESAQGGTALPWTD-PGAFAELVGLQRLRQTRSA 79 (183) T ss_pred CEEEEEECCCCCCHHHHHHH------HHHCCCEEE-CCCHHHHH-HHHHHCCCCCCCCCC-HHHHHHHHHHHHHHHHHCC T ss_conf 33899837999768999999------997586350-13416689-998754897677647-6899999999999986401 Q ss_pred CCCCEEEEECCCCCCCHHHHHHHHHHHHHHH Q ss_conf 9985699932588988056799999999999 Q gi|254780750|r 728 TNQSFVILDEIGRGTATLDGLSIAWATIEYL 758 (920) Q Consensus 728 t~~SLVllDElGrGTst~DG~aiA~aile~l 758 (920) -..++|.+| | +.-|. .|-+.+| T Consensus 80 ~~~~~vFfD---R--~~~da----~a~l~~l 101 (183) T COG3911 80 AVGGRVFFD---R--GPPDA----LAYLRFL 101 (183) T ss_pred CCCCCEEEC---C--CCHHH----HHHHHHH T ss_conf 336713541---5--72779----9999984 No 450 >PRK09841 cryptic autophosphorylating protein tyrosine kinase Etk; Provisional Probab=80.58 E-value=3.1 Score=20.17 Aligned_cols=35 Identities=23% Similarity=0.382 Sum_probs=28.8 Q ss_pred CCCEEEEEECCCCC-CHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 66439999677844-07899999999999997198530 Q gi|254780750|r 648 NSGKLWLLTGPNMG-GKSTFLRQNALIVIMAQMGSYVP 684 (920) Q Consensus 648 ~~~~~~iiTGpNmg-GKSt~lRqval~vilAQiG~fVP 684 (920) ...++++||++..| ||||. .+.|...+||+|.-|= T Consensus 529 ~~~kvi~vTS~~pgEGKSt~--a~nLA~~~A~~G~rvL 564 (726) T PRK09841 529 TENNILMITGATPDSGKTFV--SSTLAAVIAQSDQKVL 564 (726) T ss_pred CCCCEEEEECCCCCCCHHHH--HHHHHHHHHHCCCEEE T ss_conf 88868999779999977999--9999999984799599 No 451 >TIGR01187 potA polyamine ABC transporter, ATP-binding protein; InterPro: IPR005893 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This family comprises the spermidine/putrescine ABC transporter, ATP binding subunit in bacteria and its equivalents in archaea. This transport system belongs to the larger ATP-Binding Cassette (ABC) transporter superfamily. Polyamines like spermidine and putrescine play a vital role in cell proliferation, differentiation, and ion homeostasis. The concentration of polyamines within the cell are regulated by biosynthesis, degradation and transport (uptake and efflux included).; GO: 0015417 polyamine-transporting ATPase activity, 0015846 polyamine transport, 0016020 membrane. Probab=80.16 E-value=1 Score=23.72 Aligned_cols=16 Identities=13% Similarity=0.233 Sum_probs=7.3 Q ss_pred CHHHHHHHHHHHHHHC Q ss_conf 4567898999999740 Q gi|254780750|r 340 DSTKINTRLDSINFFI 355 (920) Q Consensus 340 d~~~I~~R~daVe~l~ 355 (920) +.++|..|-..+=.++ T Consensus 73 ~~~ei~~RV~e~L~~V 88 (331) T TIGR01187 73 PRAEIKPRVKEALRLV 88 (331) T ss_pred CHHHHHHHHHHHHHHC T ss_conf 8566889999997421 No 452 >PRK13853 type IV secretion system protein VirB4; Provisional Probab=80.06 E-value=1.1 Score=23.60 Aligned_cols=41 Identities=24% Similarity=0.338 Sum_probs=25.1 Q ss_pred CCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCE Q ss_conf 64399996778440789999999999999719853035320682210 Q gi|254780750|r 649 SGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDK 695 (920) Q Consensus 649 ~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~ 695 (920) -++. +|.||.-+||||+ +..+|||..=|-++..+++=.||+ T Consensus 426 ~GHt-~I~G~TGsGKTtl-----l~fL~aq~~ky~~~~~~~~~~fDk 466 (789) T PRK13853 426 VGMT-AIFGPIGRGKTTL-----MTFILAMLEQSMVDRAGAVVFFDK 466 (789) T ss_pred CCCE-EEECCCCCCHHHH-----HHHHHHHHHHHHCCCCCEEEEEEC T ss_conf 7744-8878999988999-----999999999742235770899958 No 453 >cd03274 ABC_SMC4_euk Eukaryotic SMC4 proteins; SMC proteins are large (approximately 110 to 170 kDa), and each is arranged into five recognizable domains. Amino-acid sequence homology of SMC proteins between species is largely confined to the amino- and carboxy-terminal globular domains. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T, in the single-letter amino-acid code), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif, and a motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. In eukaryotic cells, the proteins are found as heterodimers of SMC1 paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6 (fo Probab=80.00 E-value=1.5 Score=22.51 Aligned_cols=58 Identities=19% Similarity=0.307 Sum_probs=38.1 Q ss_pred HHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHH-CCCEEEEECCCHHHHHHHHHCCCE Q ss_conf 99589985699932588988056799999999999972-698499974875797664306885 Q gi|254780750|r 724 LNQATNQSFVILDEIGRGTATLDGLSIAWATIEYLHET-NRCRGLLATHFHELTDLSKSLKRF 785 (920) Q Consensus 724 L~~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~-~~~~~lfaTHy~eL~~l~~~~~~v 785 (920) +...-|-.+.++||+- .+.|-.-+ .-+.+++-+. ..+-.++.||-...-+.++.+=+| T Consensus 144 i~~~kPsPFyvLDEVD---AALD~~Nv-~r~a~~ike~s~~aQFIVIThr~~tme~AD~L~GV 202 (212) T cd03274 144 LHHYKPTPLYVMDEID---AALDFRNV-SIVANYIKERTKNAQFIVISLRNNMFELADRLVGI 202 (212) T ss_pred HHCCCCCCEEEEECCC---CCCCHHHH-HHHHHHHHHHCCCCEEEEEECCHHHHHHHHHEEEE T ss_conf 9642899558973564---24678889-99999999847797699998588999851517989 No 454 >PRK10416 cell division protein FtsY; Provisional Probab=79.93 E-value=4 Score=19.40 Aligned_cols=23 Identities=30% Similarity=0.284 Sum_probs=17.6 Q ss_pred CCEEEEEECCCCCCHHHHHHHHH Q ss_conf 64399996778440789999999 Q gi|254780750|r 649 SGKLWLLTGPNMGGKSTFLRQNA 671 (920) Q Consensus 649 ~~~~~iiTGpNmgGKSt~lRqva 671 (920) ...++++.|.|.+||||.+--+| T Consensus 294 ~P~VIl~vGvNG~GKTTTigKLA 316 (499) T PRK10416 294 TPFVILMVGVNGVGKTTTIGKLA 316 (499) T ss_pred CCEEEEEECCCCCCHHHHHHHHH T ss_conf 98799997478787898999999 No 455 >CHL00176 ftsH cell division protein; Validated Probab=79.77 E-value=4 Score=19.36 Aligned_cols=16 Identities=25% Similarity=0.277 Sum_probs=9.2 Q ss_pred EEECCCCCHHHHHHHH Q ss_conf 4000046805887631 Q gi|254780750|r 612 FIVKDGRHPIVEKTLK 627 (920) Q Consensus 612 l~i~~gRHPviE~~l~ 627 (920) +.+.++-|-++-.+++ T Consensus 435 vAyHEAGHAlVa~ll~ 450 (631) T CHL00176 435 IAYHEVGHAIVGTLLP 450 (631) T ss_pred HHHHHHHHHHHHHHCC T ss_conf 9999999999999778 No 456 >TIGR02788 VirB11 P-type DNA transfer ATPase VirB11; InterPro: IPR014155 This entry contains VirB11, a protein that is found in the vir locus of Agrobacterium Ti plasmids where it is involved in the type IV secretion system for DNA transfer . VirB11 is believed to be an ATPase and is a homologue of the P-like conjugation system TrbB protein and the Flp pilus system protein TadA .. Probab=79.70 E-value=1.7 Score=22.14 Aligned_cols=32 Identities=28% Similarity=0.463 Sum_probs=22.2 Q ss_pred EEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCC Q ss_conf 99996778440789999999999999719853035320682 Q gi|254780750|r 652 LWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGI 692 (920) Q Consensus 652 ~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~ 692 (920) -+||.|-=.||||||+|++ +-+ +|.++=-|+| T Consensus 160 nIii~GGTgSGKTTf~kal-----~~~----IP~~ER~iTI 191 (328) T TIGR02788 160 NIIISGGTGSGKTTFLKAL-----VKE----IPKDERLITI 191 (328) T ss_pred EEEEEECCCCHHHHHHHHH-----HHC----CCCCCCEEEE T ss_conf 1999906897189999999-----732----7622527888 No 457 >TIGR00958 3a01208 antigen peptide transporter 2; InterPro: IPR005293 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). Proteins of this family are involved in the transport of antigens from the cytoplasm to a membrane-bound compartment for association with MHC class I molecules.; GO: 0005215 transporter activity, 0006810 transport. Probab=79.25 E-value=4.1 Score=19.25 Aligned_cols=75 Identities=24% Similarity=0.380 Sum_probs=40.2 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHH------HCCCCCCCEEE--EEEE------------CCCCCCCC Q ss_conf 4399996778440789999999999999719853035------32068221056--7652------------37661138 Q gi|254780750|r 650 GKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPAS------YAHIGIVDKLF--SRVG------------SADNLASG 709 (920) Q Consensus 650 ~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~------~a~i~~~D~If--tRiG------------a~D~l~~g 709 (920) +.+-=|=|||-|||||.. .+|-+ .|.|-. .--|.=+|+-| +.|+ -.|||.=| T Consensus 559 G~vvALVGPsGsGKStva------aLL~n--~Y~Pt~G~vLlDg~Pl~~y~H~YLH~~V~~VgQEPvLf~gSvr~NIaYG 630 (770) T TIGR00958 559 GEVVALVGPSGSGKSTVA------ALLQN--LYQPTGGQVLLDGVPLSQYDHHYLHRQVALVGQEPVLFSGSVRENIAYG 630 (770) T ss_pred CEEEEEECCCCCCHHHHH------HHHHH--CCCCCCCEEEECCCCHHHHCCEECCCEEEEEECCCEECCCCHHHHHHHC T ss_conf 625998658998399999------99985--5789865687768461332650100025433205430246316656516 Q ss_pred CCHHHHHHHHHHHHHHHCCCCCEE Q ss_conf 532899999999999958998569 Q gi|254780750|r 710 RSTFMVEMIETASILNQATNQSFV 733 (920) Q Consensus 710 ~STF~vEm~e~~~IL~~at~~SLV 733 (920) ...-- -|.|+.+.=+.|..+.+| T Consensus 631 L~~~~-T~~~~~aaA~~a~ahdFI 653 (770) T TIGR00958 631 LTKTP-TDEEVTAAAKAANAHDFI 653 (770) T ss_pred CCCCC-CHHHHHHHHHHHCCCCCC T ss_conf 89998-278999999861376443 No 458 >cd00227 CPT Chloramphenicol (Cm) phosphotransferase (CPT). Cm-inactivating enzyme; modifies the primary (C-3) hydroxyl of the antibiotic. Related structurally to shikimate kinase II. Probab=78.93 E-value=2.2 Score=21.25 Aligned_cols=23 Identities=30% Similarity=0.562 Sum_probs=19.7 Q ss_pred CCEEEEEECCCCCCHHHHHHHHH Q ss_conf 64399996778440789999999 Q gi|254780750|r 649 SGKLWLLTGPNMGGKSTFLRQNA 671 (920) Q Consensus 649 ~~~~~iiTGpNmgGKSt~lRqva 671 (920) +++++++.|+-.+||||.-|.+- T Consensus 1 ~G~II~LNG~SSSGKSsiAraLQ 23 (175) T cd00227 1 TGRIIILNGGSSAGKSSIARALQ 23 (175) T ss_pred CCEEEEEECCCCCCHHHHHHHHH T ss_conf 97499986899898899999999 No 459 >PRK11331 5-methylcytosine-specific restriction enzyme subunit McrB; Provisional Probab=78.80 E-value=2.5 Score=20.92 Aligned_cols=84 Identities=23% Similarity=0.324 Sum_probs=39.5 Q ss_pred EEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEE--------EEEC-CCCCCCCCCHHHHHHHHHH Q ss_conf 399996778440789999999999999719853035320682210567--------6523-7661138532899999999 Q gi|254780750|r 651 KLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFS--------RVGS-ADNLASGRSTFMVEMIETA 721 (920) Q Consensus 651 ~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~Ift--------RiGa-~D~l~~g~STF~vEm~e~~ 721 (920) ...|++||-..|||=.-|-.|-.. |+. =-| +..++-.|..-++ |-+. ...+.. -.|+ ++.+-| T Consensus 195 knvIL~G~pGtGKT~lAk~lA~~l-~g~---~~~-~rv~~VqfhpsysYEDfi~Gyrp~~~gf~~~~--G~f~-~~~~~A 266 (459) T PRK11331 195 KNIILQGPPGVGKTFVARRLAYLL-TGE---KAP-QRVNMVQFHQSYSYEDFIQGYRPNGVGFRRKD--GIFY-NFCQQA 266 (459) T ss_pred CCEEEECCCCCCHHHHHHHHHHHH-HCC---CCC-CCEEEEEECCCCCHHHHHCCCCCCCCCCEECC--CHHH-HHHHHH T ss_conf 827965899988789999999997-078---877-84689983588661787646056888613268--3699-999999 Q ss_pred HHHHHCCCCCEEEEECCCCCCCH Q ss_conf 99995899856999325889880 Q gi|254780750|r 722 SILNQATNQSFVILDEIGRGTAT 744 (920) Q Consensus 722 ~IL~~at~~SLVllDElGrGTst 744 (920) . ++....-+++|||+-||-=. T Consensus 267 ~--~~p~~~y~~iideinr~~~~ 287 (459) T PRK11331 267 K--EQPEKKYVFIIDEINRANLS 287 (459) T ss_pred H--HCCCCCEEEEEEHHHCCCHH T ss_conf 8--49898769998432033889 No 460 >pfam06414 Zeta_toxin Zeta toxin. This family consists of several bacterial zeta toxin proteins. Zeta toxin is thought to be part of a postregulational killing system in bacteria. It relies on antitoxin/toxin systems that secure stable inheritance of low and medium copy number plasmids during cell division and kill cells that have lost the plasmid. Probab=78.74 E-value=2.6 Score=20.75 Aligned_cols=24 Identities=25% Similarity=0.214 Sum_probs=20.9 Q ss_pred CCCEEEEEECCCCCCHHHHHHHHH Q ss_conf 664399996778440789999999 Q gi|254780750|r 648 NSGKLWLLTGPNMGGKSTFLRQNA 671 (920) Q Consensus 648 ~~~~~~iiTGpNmgGKSt~lRqva 671 (920) +.+...++-|++.+|||+..+++- T Consensus 10 ~~Pkai~laG~pGAGKS~~~~~~~ 33 (191) T pfam06414 10 ERPVAVLLGGQPGAGKTELARALL 33 (191) T ss_pred CCCEEEEEECCCCCCHHHHHHHHH T ss_conf 698799995799888899999998 No 461 >TIGR00630 uvra excinuclease ABC, A subunit; InterPro: IPR004602 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). During the process of Escherichia coli nucleotide excision repair, DNA damage recognition and processing are achieved by the action of the uvrA, uvrB, and uvrC gene products . The UvrC protein contain 4 conserved regions: a central region which interact with UvrB (Uvr domain), a Helix hairpin Helix (HhH) domain important for 5 prime incision of damage DNA and the homology regions 1 and 2 of unknown function. UvrC homology region 2 is specific for UvrC proteins, whereas UvrC homology region 1 is also shared by few other nucleases.; GO: 0009381 excinuclease ABC activity, 0006289 nucleotide-excision repair, 0009380 excinuclease repair complex. Probab=78.73 E-value=2.7 Score=20.68 Aligned_cols=47 Identities=28% Similarity=0.382 Sum_probs=27.2 Q ss_pred CEEECCCCCHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHH Q ss_conf 14000046805887631028770444345635877776643999967784407899999999999 Q gi|254780750|r 611 NFIVKDGRHPIVEKTLKQQSSKPFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVI 675 (920) Q Consensus 611 ~l~i~~gRHPviE~~l~~~~~~~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vi 675 (920) .|.++++|. +-|++ |.-+|=| +.|-.|||+=-|||||++-+|=.-++ T Consensus 644 ~l~~~gA~~----nNLK~------i~v~iPL--------G~~t~iTGVSGSGKSTLind~L~~~~ 690 (956) T TIGR00630 644 VLTLKGARE----NNLKN------ITVSIPL--------GLFTCITGVSGSGKSTLINDTLYPAL 690 (956) T ss_pred EEEEEECCC----CCCCC------CEEEECC--------CEEEEEECCCCCCHHHHHHHHHHHHH T ss_conf 999842010----56402------1177407--------71799974458745777999999999 No 462 >PRK07078 hypothetical protein; Validated Probab=78.62 E-value=4.3 Score=19.12 Aligned_cols=80 Identities=24% Similarity=0.301 Sum_probs=44.7 Q ss_pred CCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHC Q ss_conf 66439999677844078999999999999971985303532068221056765237661138532899999999999958 Q gi|254780750|r 648 NSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGSADNLASGRSTFMVEMIETASILNQA 727 (920) Q Consensus 648 ~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~a 727 (920) .++.+.++.|+-.-||||++..+.- ||..-+.-+|++... .. .+++ .+| | ++. |+ T Consensus 240 ~Eq~lf~l~G~G~NGKStf~~vl~~--lLGdYa~t~~~~tl~--------~~--~~~~----~~~---d---lA~-L~-- 294 (510) T PRK07078 240 SEHALFFLYGTGANGKSVFVNTLAT--ILGDYAANAAMDTFM--------ET--RGDR----HPT---D---LAG-LR-- 294 (510) T ss_pred CCEEEEEEECCCCCCHHHHHHHHHH--HHHHHHCCCCHHHHH--------HC--CCCC----CCH---H---HHH-CC-- T ss_conf 7338999975898847899999999--865663048989996--------34--6899----987---8---883-37-- Q ss_pred CCCCEEEEECCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 99856999325889880567999999999999 Q gi|254780750|r 728 TNQSFVILDEIGRGTATLDGLSIAWATIEYLH 759 (920) Q Consensus 728 t~~SLVllDElGrGTst~DG~aiA~aile~l~ 759 (920) -.-||+.-|.- +|..++.|.+..|. T Consensus 295 -GaRlV~asE~e------eG~rl~Ea~IKqLT 319 (510) T PRK07078 295 -GARFVSAIETE------QGRRWAESKVKQLT 319 (510) T ss_pred -CCEEEEEECCC------CCCCHHHHHHHEEC T ss_conf -98189965488------65004477454301 No 463 >pfam02492 cobW CobW/HypB/UreG, nucleotide-binding domain. This domain is found in HypB, a hydrogenase expression / formation protein, and UreG a urease accessory protein. Both these proteins contain a P-loop nucleotide binding motif. HypB has GTPase activity and is a guanine nucleotide binding protein. It is not known whether UreG binds GTP or some other nucleotide. Both enzymes are involved in nickel binding. HypB can store nickel and is required for nickel dependent hydrogenase expression. UreG is required for functional incorporation of the urease nickel metallocenter. GTP hydrolysis may required by these proteins for nickel incorporation into other nickel proteins. This family of domains also contains P47K, a Pseudomonas chlororaphis protein needed for nitrile hydratase expression, and the cobW gene product, which may be involved in cobalamin biosynthesis in Pseudomonas denitrificans. Probab=78.26 E-value=2 Score=21.57 Aligned_cols=20 Identities=35% Similarity=0.343 Sum_probs=17.3 Q ss_pred EEEEECCCCCCHHHHHHHHH Q ss_conf 99996778440789999999 Q gi|254780750|r 652 LWLLTGPNMGGKSTFLRQNA 671 (920) Q Consensus 652 ~~iiTGpNmgGKSt~lRqva 671 (920) +.||||+=-+||||+|+.+. T Consensus 2 v~iitGFLGsGKTTll~~ll 21 (174) T pfam02492 2 VTVLTGFLGSGKTTLLEHLL 21 (174) T ss_pred EEEEECCCCCCHHHHHHHHH T ss_conf 69993488788999999999 No 464 >cd02019 NK Nucleoside/nucleotide kinase (NK) is a protein superfamily consisting of multiple families of enzymes that share structural similarity and are functionally related to the catalysis of the reversible phosphate group transfer from nucleoside triphosphates to nucleosides/nucleotides, nucleoside monophosphates, or sugars. Members of this family play a wide variety of essential roles in nucleotide metabolism, the biosynthesis of coenzymes and aromatic compounds, as well as the metabolism of sugar and sulfate. Probab=78.05 E-value=1.7 Score=22.21 Aligned_cols=22 Identities=32% Similarity=0.292 Sum_probs=18.1 Q ss_pred EEEECCCCCCHHHHHHHHHHHH Q ss_conf 9996778440789999999999 Q gi|254780750|r 653 WLLTGPNMGGKSTFLRQNALIV 674 (920) Q Consensus 653 ~iiTGpNmgGKSt~lRqval~v 674 (920) ..|+||-.+||||+.|.++-.- T Consensus 2 I~I~G~~gsGKsT~a~~l~~~l 23 (69) T cd02019 2 IAITGGSGSGKSTVAKKLAEQL 23 (69) T ss_pred EEEECCCCCCHHHHHHHHHHHH T ss_conf 7883688887258999999995 No 465 >PRK07132 DNA polymerase III subunit delta'; Validated Probab=78.04 E-value=4.5 Score=19.00 Aligned_cols=110 Identities=12% Similarity=0.046 Sum_probs=57.0 Q ss_pred CCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHH-H- Q ss_conf 64399996778440789999999999999719853035320682210567652376611385328999999999999-5- Q gi|254780750|r 649 SGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGSADNLASGRSTFMVEMIETASILN-Q- 726 (920) Q Consensus 649 ~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~-~- 726 (920) -.+..|++||...||++..+..|-...=-+.|.-.++.. +.|-.+.... .+++...+ .+|..+ .+.. . T Consensus 19 lsHAYLF~G~~G~Gk~~~a~~~a~~l~~~~~~~~~~~~~----~~~~~~id~~-~~~i~~~~---i~~~i~--~~~~~~~ 88 (303) T PRK07132 19 ISHSFLLKSNYNNDLDESILYFLNKFNNLQSLNLNLEEL----PYNIFLFDIF-DEDLSKEE---FLSAIE--KFSFSSF 88 (303) T ss_pred CCEEEEEECCCCCCHHHHHHHHHHHHCCCCCCCCCCCCC----CCCCEEECCC-CCCHHHHH---HHHHHH--HHHHCCC T ss_conf 761688678998679999999999972998788875456----5323041332-22001688---999999--9973665 Q ss_pred -CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHC-CCEEEEECCCH Q ss_conf -899856999325889880567999999999999726-98499974875 Q gi|254780750|r 727 -ATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETN-RCRGLLATHFH 773 (920) Q Consensus 727 -at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~-~~~~lfaTHy~ 773 (920) .++.-.+|+||.-+-|... |-|.|..|-+-. ....+++|+.. T Consensus 89 ~~~~~Kv~IIdea~~lt~~A-----~NaLLKtLEEPp~~~~fil~t~~~ 132 (303) T PRK07132 89 VSNQKKILIIKNIEKTSNSS-----LNALLKTIEEPSKNTYFLLTTKNI 132 (303) T ss_pred CCCCEEEEEEECHHHCCHHH-----HHHHHHHCCCCCCCEEEEEEECCH T ss_conf 56870699981655339999-----999998703898684899972882 No 466 >cd01124 KaiC KaiC is a circadian clock protein primarily found in cyanobacteria KaiC is a RecA-like ATPase, having both Walker A and Walker B motifs. A related protein is found in archaea. Probab=78.01 E-value=4.5 Score=18.99 Aligned_cols=119 Identities=18% Similarity=0.173 Sum_probs=55.6 Q ss_pred EEEECCCCCCHHHHHHHHHHHHHHH-HCCCCCCHHHC---------CCCCC-C------E-EEEEEECCCCCCCCCCHHH Q ss_conf 9996778440789999999999999-71985303532---------06822-1------0-5676523766113853289 Q gi|254780750|r 653 WLLTGPNMGGKSTFLRQNALIVIMA-QMGSYVPASYA---------HIGIV-D------K-LFSRVGSADNLASGRSTFM 714 (920) Q Consensus 653 ~iiTGpNmgGKSt~lRqval~vilA-QiG~fVPA~~a---------~i~~~-D------~-IftRiGa~D~l~~g~STF~ 714 (920) -||+||--+||||+.-|.+....-. .-|.||-.++. .++.- + . .+..+.. +....+..... T Consensus 2 tLi~G~pGsGKT~~a~qfl~~~a~~ge~~lyis~eE~~~~l~~~~~~~g~d~~~~~~~g~l~i~~~~~-~~~~~~~~~~~ 80 (187) T cd01124 2 TLLSGGPGTGKTTFALQFLYAGLARGEPGLYVTLEESPEELIENAESLGWDLERLEDEGLLAIVDADP-DEIGPAESSLR 80 (187) T ss_pred EEEEECCCCCHHHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHHCCCCHHHHHHCCCEEEEECCC-CCCCCHHHHHH T ss_conf 15876899999999999999998769978999950799999999998399858986458568996262-00220333236 Q ss_pred HHH-HHHHHHHHHCCCCCEEEEECCC---CCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHH Q ss_conf 999-9999999958998569993258---898805679999999999997269849997487579 Q gi|254780750|r 715 VEM-IETASILNQATNQSFVILDEIG---RGTATLDGLSIAWATIEYLHETNRCRGLLATHFHEL 775 (920) Q Consensus 715 vEm-~e~~~IL~~at~~SLVllDElG---rGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL 775 (920) .+. .+...+++. ..-..|++|=+. .+. ..+-....+..+.+|.+ .+|.++|+++.+.. T Consensus 81 ~~l~~~i~~~i~~-~~~~~vVIDSi~~l~~~~-~~~~~~~~~~l~~~l~~-~~~t~ll~~e~~~~ 142 (187) T cd01124 81 LELIQRLKDAIEE-FKAKRVVIDSVSGLLLME-QSTARLEIRRLLFALKR-FGVTTLLTSEQSGL 142 (187) T ss_pred HHHHHHHHHHHHH-HCCCEEEECCHHHHHHCC-HHHHHHHHHHHHHHHHH-CCCCEEEEEEECCC T ss_conf 7899999999998-499899994868875256-66689999999999997-69968999974256 No 467 >TIGR03574 selen_PSTK L-seryl-tRNA(Sec) kinase, archaeal. Members of this protein are L-seryl-tRNA(Sec) kinase. This enzyme is part of a two-step pathway in Eukaryota and Archaea for performing selenocysteine biosynthesis by changing serine misacylated on selenocysteine-tRNA to selenocysteine. This enzyme performs the first step, phosphorylation of the OH group of the serine side chain. This family represents archaeal proteins with this activity. Probab=77.74 E-value=4.6 Score=18.94 Aligned_cols=29 Identities=31% Similarity=0.279 Sum_probs=21.7 Q ss_pred EEEECCCCCCHHHHHHHHHHHHHHHHCCCCC Q ss_conf 9996778440789999999999999719853 Q gi|254780750|r 653 WLLTGPNMGGKSTFLRQNALIVIMAQMGSYV 683 (920) Q Consensus 653 ~iiTGpNmgGKSt~lRqval~vilAQiG~fV 683 (920) ++|||--.+||||+-|+ |.-.+.+-|.-| T Consensus 2 ivl~GlP~SGKSt~a~~--L~~~l~~~~~~~ 30 (249) T TIGR03574 2 IILTGLPGVGKSTFSKE--LSKKLSEKNIDN 30 (249) T ss_pred EEEECCCCCCHHHHHHH--HHHHHHHHCCCE T ss_conf 78967899989999999--999999829965 No 468 >COG4650 RtcR Sigma54-dependent transcription regulator containing an AAA-type ATPase domain and a DNA-binding domain [Transcription / Signal transduction mechanisms] Probab=77.68 E-value=3.8 Score=19.51 Aligned_cols=72 Identities=33% Similarity=0.474 Sum_probs=46.6 Q ss_pred EEEEECCCCCCHHHHHHHHHHHHHHH-H-CCCCCC----------HHHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHH Q ss_conf 99996778440789999999999999-7-198530----------35320682210567652376611385328999999 Q gi|254780750|r 652 LWLLTGPNMGGKSTFLRQNALIVIMA-Q-MGSYVP----------ASYAHIGIVDKLFSRVGSADNLASGRSTFMVEMIE 719 (920) Q Consensus 652 ~~iiTGpNmgGKSt~lRqval~vilA-Q-iG~fVP----------A~~a~i~~~D~IftRiGa~D~l~~g~STF~vEm~e 719 (920) -+++.||..+|||-+-|-|--+-.-- | .|-||- |-+|-+|-|..-|| ||.. + T Consensus 210 p~ll~gptgagksflarriyelk~arhq~sg~fvevncatlrgd~amsalfghvkgaft--ga~~--------------~ 273 (531) T COG4650 210 PILLNGPTGAGKSFLARRIYELKQARHQFSGAFVEVNCATLRGDTAMSALFGHVKGAFT--GARE--------------S 273 (531) T ss_pred CEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEEEECCCHHHHHHHHHHHCCCC--CCHH--------------H T ss_conf 76864688743668999999999888753786278863000475288998764001314--6334--------------5 Q ss_pred HHHHHHHCCCCCEEEEECCCC Q ss_conf 999999589985699932588 Q gi|254780750|r 720 TASILNQATNQSFVILDEIGR 740 (920) Q Consensus 720 ~~~IL~~at~~SLVllDElGr 740 (920) -+-.|+.|+ --+..+||+|- T Consensus 274 r~gllrsad-ggmlfldeige 293 (531) T COG4650 274 REGLLRSAD-GGMLFLDEIGE 293 (531) T ss_pred HHHHHCCCC-CCEEEHHHHHH T ss_conf 544243377-86575674332 No 469 >cd03114 ArgK-like The function of this protein family is unkown. The protein sequences are similar to the ArgK protein in E. coli. ArgK protein is a membrane ATPase which is required for transporting arginine, ornithine and lysine into the cells by the arginine and ornithine (AO system) and lysine, arginine and ornithine (LAO) transport systems. Probab=77.65 E-value=4.6 Score=18.92 Aligned_cols=87 Identities=23% Similarity=0.295 Sum_probs=50.9 Q ss_pred EEECCCCCCHHHHHHHHHHHHHHHHCCCCCC--H--------------HHCC---CCCCCEEEEEEECCCCCCCCCCHHH Q ss_conf 9967784407899999999999997198530--3--------------5320---6822105676523766113853289 Q gi|254780750|r 654 LLTGPNMGGKSTFLRQNALIVIMAQMGSYVP--A--------------SYAH---IGIVDKLFSRVGSADNLASGRSTFM 714 (920) Q Consensus 654 iiTGpNmgGKSt~lRqval~vilAQiG~fVP--A--------------~~a~---i~~~D~IftRiGa~D~l~~g~STF~ 714 (920) =||||--+||||++-..+-.. .+-|--|. | +..+ ++.-+++|.|=-++-....|.|-.. T Consensus 3 GitG~pGaGKStLi~~l~~~~--~~~g~~VaVlavDPsS~~sgGalLGDRiRm~~~~~~~~vfiRs~atrg~~ggla~~~ 80 (148) T cd03114 3 GITGVPGAGKSTLIDALITAL--RARGKRVAVLAIDPSSPFSGGAILGDRIRMERHASDPGVFIRSLATRGFLGGLSRAT 80 (148) T ss_pred EECCCCCCCHHHHHHHHHHHH--HHCCCEEEEEEECCCCCCCCCCCHHHHHHHHHHCCCCCEEEEECCCCCCCCCHHHHH T ss_conf 625899787899999999999--978983799996888786686203235453441579983686346666542046889 Q ss_pred HHHHHHHHHHHHCCCCCEEEEECCCCCCCHHH Q ss_conf 99999999999589985699932588988056 Q gi|254780750|r 715 VEMIETASILNQATNQSFVILDEIGRGTATLD 746 (920) Q Consensus 715 vEm~e~~~IL~~at~~SLVllDElGrGTst~D 746 (920) .|+. ..|+. ..--+|++.-.|-|.|..+ T Consensus 81 ~~~i---~~l~~-~g~D~IiIETvGvGQse~~ 108 (148) T cd03114 81 PEVI---RVLDA-AGFDVIIVETVGVGQSEVD 108 (148) T ss_pred HHHH---HHHHH-CCCCEEEEECCCCCCCHHH T ss_conf 9999---99997-5999899974877756026 No 470 >PRK08727 hypothetical protein; Validated Probab=77.62 E-value=4.6 Score=18.92 Aligned_cols=94 Identities=17% Similarity=0.164 Sum_probs=52.0 Q ss_pred CCEEEEEECCCCCCHHHHHHHHHHHHHHHH-CCCCCCHHHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHC Q ss_conf 643999967784407899999999999997-1985303532068221056765237661138532899999999999958 Q gi|254780750|r 649 SGKLWLLTGPNMGGKSTFLRQNALIVIMAQ-MGSYVPASYAHIGIVDKLFSRVGSADNLASGRSTFMVEMIETASILNQA 727 (920) Q Consensus 649 ~~~~~iiTGpNmgGKSt~lRqval~vilAQ-iG~fVPA~~a~i~~~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~a 727 (920) ..+.+.|.||-.+|||-++..+|-....+. .-.|+|++.+.-.. . .+|... T Consensus 40 ~~~~lyl~G~~GsGKTHLl~a~~~~~~~~~~~~~yl~l~~~~~~~----------------------~------~~l~~l 91 (233) T PRK08727 40 SSDWLYLSGPAGTGKTHLALSLCAAAEQAGRSSAYLPLQAAAGRL----------------------R------DALEAL 91 (233) T ss_pred CCCEEEEECCCCCCHHHHHHHHHHHHHHCCCCEEEEEHHHHHHHH----------------------H------HHHHHH T ss_conf 889899989999988999999999998279972884478853202----------------------5------677531 Q ss_pred CCCCEEEEECCCCCCCHHH-HHHHHHHHHHHHHHHCCCEEEEECCC Q ss_conf 9985699932588988056-79999999999997269849997487 Q gi|254780750|r 728 TNQSFVILDEIGRGTATLD-GLSIAWATIEYLHETNRCRGLLATHF 772 (920) Q Consensus 728 t~~SLVllDElGrGTst~D-G~aiA~aile~l~~~~~~~~lfaTHy 772 (920) ...++|.+|-+=.=..-.+ =.+ .+.....+.+. ++..+++..- T Consensus 92 e~~~ll~iDDid~i~g~~~~e~a-LFhL~N~~~~~-~~~ll~ts~~ 135 (233) T PRK08727 92 EGRSLVALDGVDSIAGQREDEVA-LFDFHNRARAA-GITLLYTARQ 135 (233) T ss_pred CCCCEEEEECCHHCCCCHHHHHH-HHHHHHHHHHC-CCEEEEECCC T ss_conf 03897898550112698279999-99999999861-9838997798 No 471 >COG4598 HisP ABC-type histidine transport system, ATPase component [Amino acid transport and metabolism] Probab=77.61 E-value=2.3 Score=21.16 Aligned_cols=26 Identities=38% Similarity=0.595 Sum_probs=19.9 Q ss_pred CCCEEEEEECCCCCCHHHHHHHHHHH Q ss_conf 66439999677844078999999999 Q gi|254780750|r 648 NSGKLWLLTGPNMGGKSTFLRQNALI 673 (920) Q Consensus 648 ~~~~~~iiTGpNmgGKSt~lRqval~ 673 (920) ..+.++-|-|.--+|||||||-+-++ T Consensus 30 ~~GdVisIIGsSGSGKSTfLRCiN~L 55 (256) T COG4598 30 NAGDVISIIGSSGSGKSTFLRCINFL 55 (256) T ss_pred CCCCEEEEECCCCCCHHHHHHHHHHH T ss_conf 78988999658998626899999863 No 472 >COG4088 Predicted nucleotide kinase [Nucleotide transport and metabolism] Probab=77.39 E-value=2.1 Score=21.50 Aligned_cols=85 Identities=21% Similarity=0.200 Sum_probs=50.7 Q ss_pred EEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHC--CCCCCCEEEEEEECCCCCCCC-CCHHHHHHHHHHHHHHHC Q ss_conf 39999677844078999999999999971985303532--068221056765237661138-532899999999999958 Q gi|254780750|r 651 KLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYA--HIGIVDKLFSRVGSADNLASG-RSTFMVEMIETASILNQA 727 (920) Q Consensus 651 ~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a--~i~~~D~IftRiGa~D~l~~g-~STF~vEm~e~~~IL~~a 727 (920) +++|+||+-.+||||+-|-. .-+|-|=|--||--++ ..++. . --++.+.+. .-+|.-|-.+ ..|..| T Consensus 2 pLiIlTGyPgsGKTtfakeL--ak~L~~~i~~vi~l~kdy~~~i~----~--DEslpi~ke~yres~~ks~~--rlldSa 71 (261) T COG4088 2 PLIILTGYPGSGKTTFAKEL--AKELRQEIWRVIHLEKDYLRGIL----W--DESLPILKEVYRESFLKSVE--RLLDSA 71 (261) T ss_pred CEEEEECCCCCCCHHHHHHH--HHHHHHHHHHCCCCCHHHHHHEE----C--CCCCCHHHHHHHHHHHHHHH--HHHHHH T ss_conf 56998269998801789999--99999720011213201454123----3--13240379999999988899--999998 Q ss_pred CCCCEEEEECCCCCCCHHHHHH Q ss_conf 9985699932588988056799 Q gi|254780750|r 728 TNQSFVILDEIGRGTATLDGLS 749 (920) Q Consensus 728 t~~SLVllDElGrGTst~DG~a 749 (920) -.+-+||.|-+ +-+-|+- T Consensus 72 lkn~~VIvDdt----NYyksmR 89 (261) T COG4088 72 LKNYLVIVDDT----NYYKSMR 89 (261) T ss_pred HCCEEEEEECC----CHHHHHH T ss_conf 63649997063----2888999 No 473 >TIGR01193 bacteriocin_ABC ABC-type bacteriocin transporter; InterPro: IPR005897 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This family contains ABC-type bacteriocin transporter. In general, bacteriocins are agents which are responsible for killing or inhibiting the closely related species or even different strains of the same species. Bacteriocins are encoded by bacterial plasmids. Bacteriocins are named after the species and hence in literature one encounters various names e.g., leucocin from Leuconostic geldium; pedicocin from Pedicoccus acidilactici; sakacin from Lactobacillus sake etc. Peptide bacteriocins are exported across the cytoplasmic membrane by a dedicated ATP-binding cassette (ABC) transporter. These ABC-transporters have an N-terminal peptidase domain that belong to MEROPS peptidase family C39 (clan CA); a central multi-pass transmembrane region and a C-terminal ABC transporter domain. These transporters have dual function: (i) they remove the N-terminal leader peptide from its bacteriocin precursor by cleavage at a Gly-Gly bond and (ii) transport the mature bacteriocin across the cytoplasmic membrane. This represents a novel strategy for secretion of bacterial proteins . Many bacteria are known to regulate diverse physiological processes through this system, such as bioluminescence, regulation of sporulation, virulence factor expression, antibiotics production, competence for genetic transformation, and activation of biofilm formation .; GO: 0008234 cysteine-type peptidase activity, 0019534 toxin transporter activity, 0006810 transport, 0016020 membrane. Probab=77.38 E-value=2.2 Score=21.36 Aligned_cols=141 Identities=23% Similarity=0.336 Sum_probs=75.2 Q ss_pred HHHHHHHCCCCCEECCCCCCEEECCCCCHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHH Q ss_conf 88877525885121047871400004680588763102877044434563587777664399996778440789999999 Q gi|254780750|r 592 LAILAKEQNYCRPIIDNSTNFIVKDGRHPIVEKTLKQQSSKPFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNA 671 (920) Q Consensus 592 lA~~a~~~~y~rP~i~~~~~l~i~~gRHPviE~~l~~~~~~~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqva 671 (920) +-.++..||||+|.++|= .|.|+.. .-.=|=||=.|||||+.|-. T Consensus 476 ~~~~sy~YG~g~~~L~di-~L~I~~n---------------------------------~k~tiVGmSGSGKsTLaKLL- 520 (710) T TIGR01193 476 VNDVSYSYGYGSNVLSDI-SLTIKKN---------------------------------EKITIVGMSGSGKSTLAKLL- 520 (710) T ss_pred EEEEEEECCCCCCCCCCC-EEEEECC---------------------------------CEEEEECCCCCCHHHHHHHH- T ss_conf 743466447887320264-2365078---------------------------------54899736797489999875- Q ss_pred HHHHHHHCCCCCC-HHHCCCCC-------CCE---------------EEE-------EEECCCCCCCCCCHHHHHHHHHH Q ss_conf 9999997198530-35320682-------210---------------567-------65237661138532899999999 Q gi|254780750|r 672 LIVIMAQMGSYVP-ASYAHIGI-------VDK---------------LFS-------RVGSADNLASGRSTFMVEMIETA 721 (920) Q Consensus 672 l~vilAQiG~fVP-A~~a~i~~-------~D~---------------Ift-------RiGa~D~l~~g~STF~vEm~e~~ 721 (920) +|.|=| |++-+|.+ +|| ||+ =+||..+..+..=-=.+|..|.+ T Consensus 521 -------V~FfePQ~~sG~I~Lng~~l~~iD~h~LRq~INYlPQeP~IF~GsILeNLLlGak~~~~~~~i~~A~~iAEIk 593 (710) T TIGR01193 521 -------VGFFEPQAESGEILLNGISLKDIDRHELRQFINYLPQEPYIFSGSILENLLLGAKENVSQDEILKAVEIAEIK 593 (710) T ss_pred -------HCCCCCCCCCCEEEECCCCHHHCCHHHHCCCCCCCCCCCEEECCCHHHHHHHCCCCCCCHHHHHHHHHHCCCH T ss_conf -------2035899887736527824453373444123355688784512317887650378998989999884020112 Q ss_pred HH----------------------------HHHC--CCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECC Q ss_conf 99----------------------------9958--99856999325889880567999999999999726984999748 Q gi|254780750|r 722 SI----------------------------LNQA--TNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATH 771 (920) Q Consensus 722 ~I----------------------------L~~a--t~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTH 771 (920) .= |-+| |+---.|+||- ||.-|=+- =.=|++-|+....=..+|.-| T Consensus 594 ~DIe~mp~Gy~T~LS~~~~~iSGGQKQRialARaLL~~s~vLilDEs---TSnLD~it-E~KI~~nLl~l~dKTIIFvAH 669 (710) T TIGR01193 594 DDIEKMPLGYQTELSEEGSSISGGQKQRIALARALLTDSKVLILDES---TSNLDTIT-EKKIVENLLNLKDKTIIFVAH 669 (710) T ss_pred HHHHHCCCCCCEEECCCCCCCCCHHHHHHHHHHHHCCCCCEEEEECC---CCHHHHHH-HHHHHHHCCCCCCCCEEEEEE T ss_conf 45831888766000103741151589999999986188867886244---34003788-999998424666762687500 Q ss_pred CHHHHHH Q ss_conf 7579766 Q gi|254780750|r 772 FHELTDL 778 (920) Q Consensus 772 y~eL~~l 778 (920) =-+.+.. T Consensus 670 RL~vA~~ 676 (710) T TIGR01193 670 RLSVAKQ 676 (710) T ss_pred CCCHHHC T ss_conf 0324003 No 474 >cd00071 GMPK Guanosine monophosphate kinase (GMPK, EC 2.7.4.8), also known as guanylate kinase (GKase), catalyzes the reversible phosphoryl transfer from adenosine triphosphate (ATP) to guanosine monophosphate (GMP) to yield adenosine diphosphate (ADP) and guanosine diphosphate (GDP). It plays an essential role in the biosynthesis of guanosine triphosphate (GTP). This enzyme is also important for the activation of some antiviral and anticancer agents, such as acyclovir, ganciclovir, carbovir, and thiopurines. Probab=77.00 E-value=2.1 Score=21.47 Aligned_cols=107 Identities=21% Similarity=0.236 Sum_probs=47.8 Q ss_pred EEEEECCCCCCHHHHHHHHHHHHHHHHC-CCCCCHHHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCC Q ss_conf 9999677844078999999999999971-985303532068221056765237661138532899999999999958998 Q gi|254780750|r 652 LWLLTGPNMGGKSTFLRQNALIVIMAQM-GSYVPASYAHIGIVDKLFSRVGSADNLASGRSTFMVEMIETASILNQATNQ 730 (920) Q Consensus 652 ~~iiTGpNmgGKSt~lRqval~vilAQi-G~fVPA~~a~i~~~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~at~~ 730 (920) +.+|+||-.+||+|+++.+ +... ..|.-+ =+-=||= ..++=..|...|.|-..+-...++ .+ T Consensus 1 livi~GPSG~GK~tl~~~L-----~~~~~~~~~~~--------vs~TTR~-~r~~E~~G~dY~Fvs~~~F~~~i~---~g 63 (137) T cd00071 1 LIVLSGPSGVGKSTLLKRL-----LEEFDPNFGFS--------VSHTTRK-PRPGEVDGVDYHFVSKEEFERLIE---NG 63 (137) T ss_pred CEEEECCCCCCHHHHHHHH-----HHCCCCCEEEE--------EEEECCC-CCCCCCCCCEEEEECHHHHHHHHH---CC T ss_conf 9999999988999999999-----85198776875--------6603789-988877896789867999999986---69 Q ss_pred CEEEEECC---CCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHHCCC Q ss_conf 56999325---8898805679999999999997269849997487579766430688 Q gi|254780750|r 731 SFVILDEI---GRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTDLSKSLKR 784 (920) Q Consensus 731 SLVllDEl---GrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~l~~~~~~ 784 (920) -++---+. --||+- .-++..++. +-.++|-....-+..+-...+. T Consensus 64 ~f~E~~~~~g~~YGt~~--------~~I~~~~~~-G~~vil~id~~g~~~lk~~~~~ 111 (137) T cd00071 64 EFLEWAEFHGNYYGTSK--------AAVEEALAE-GKIVILEIDVQGARQVKKSYPD 111 (137) T ss_pred EEEEEEEECCCCCCCCH--------HHHHHHHHC-CCEEEEEEEHHHHHHHHHCCCC T ss_conf 14999999880662789--------999999963-9949999748999999970998 No 475 >KOG2373 consensus Probab=76.99 E-value=1.9 Score=21.80 Aligned_cols=138 Identities=30% Similarity=0.434 Sum_probs=66.0 Q ss_pred CCCCHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHH----CCCCC-CH----- Q ss_conf 046805887631028770444345635877776643999967784407899999999999997----19853-03----- Q gi|254780750|r 616 DGRHPIVEKTLKQQSSKPFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQ----MGSYV-PA----- 685 (920) Q Consensus 616 ~gRHPviE~~l~~~~~~~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQ----iG~fV-PA----- 685 (920) =-|-|++.++|+... .+-+-|+|||-.+||+|||---+|-..|-- -|||- |- T Consensus 257 WkRFpvLNk~LkGhR------------------~GElTvlTGpTGsGKTTFlsEYsLDL~~QGVnTLwgSFEi~n~rla~ 318 (514) T KOG2373 257 WKRFPVLNKYLKGHR------------------PGELTVLTGPTGSGKTTFLSEYSLDLFTQGVNTLWGSFEIPNKRLAH 318 (514) T ss_pred EHHHHHHHHHHCCCC------------------CCCEEEEECCCCCCCEEEEHHHHHHHHHHHHHHEEEEEECCHHHHHH T ss_conf 002068899853678------------------77269985688898236756765899863224406500436089999 Q ss_pred ----HHCCCCCCCEEEEEEECCCCCCCCCCHHHH--------HHHHHHHHHHHCCCCCEEEEEC----CCCCCCHHHHHH Q ss_conf ----532068221056765237661138532899--------9999999999589985699932----588988056799 Q gi|254780750|r 686 ----SYAHIGIVDKLFSRVGSADNLASGRSTFMV--------EMIETASILNQATNQSFVILDE----IGRGTATLDGLS 749 (920) Q Consensus 686 ----~~a~i~~~D~IftRiGa~D~l~~g~STF~v--------Em~e~~~IL~~at~~SLVllDE----lGrGTst~DG~a 749 (920) ..|..+.-||.-.==-=.|.-..--+.||. -..++-.+--..-.=--||||- +|-||-..|-.+ T Consensus 319 ~mL~Qyagyrl~drl~~y~HWadrFErlplyfmtfhgqq~~~~vi~~i~ha~yV~di~HViIDNLQFmmg~~~~~~Drf~ 398 (514) T KOG2373 319 WMLVQYAGYRLLDRLNSYKHWADRFERLPLYFMTFHGQQFMEKVINEIAHAIYVEDIQHVIIDNLQFMMGQGMMALDRFH 398 (514) T ss_pred HHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHCCCHHH T ss_conf 99999716723755545667899875263676102461068889999989998875464445557887464301000456 Q ss_pred HHHHHHHHH---HHHCCCEEEEECC Q ss_conf 999999999---9726984999748 Q gi|254780750|r 750 IAWATIEYL---HETNRCRGLLATH 771 (920) Q Consensus 750 iA~aile~l---~~~~~~~~lfaTH 771 (920) .--.++-|+ ...+.|-+-..-| T Consensus 399 ~QD~iig~fR~fAT~nn~HvTlVvH 423 (514) T KOG2373 399 LQDRIIGYFRQFATQNNIHVTLVVH 423 (514) T ss_pred HHHHHHHHHHHHHHCCCEEEEEEEC T ss_conf 6778999999875426606999965 No 476 >cd02021 GntK Gluconate kinase (GntK) catalyzes the phosphoryl transfer from ATP to gluconate. The resulting product gluconate-6-phoshate is an important precursor of gluconate metabolism. GntK acts as a dimmer composed of two identical subunits. Probab=76.76 E-value=2.1 Score=21.48 Aligned_cols=20 Identities=30% Similarity=0.313 Sum_probs=17.8 Q ss_pred EEEEECCCCCCHHHHHHHHH Q ss_conf 99996778440789999999 Q gi|254780750|r 652 LWLLTGPNMGGKSTFLRQNA 671 (920) Q Consensus 652 ~~iiTGpNmgGKSt~lRqva 671 (920) ++|+.|+..+||||.-+.+| T Consensus 1 liiv~GvsGsGKSTia~~La 20 (150) T cd02021 1 IIVVMGVSGSGKSTVGKALA 20 (150) T ss_pred CEEEECCCCCCHHHHHHHHH T ss_conf 98999189999999999999 No 477 >TIGR02755 TraX_Ftype type-F conjugative transfer system pilin acetylase TraX; InterPro: IPR014125 TraX is responsible for the acetylation of the F-pilin TraA during conjugative plasmid transfer. The purpose of this acetylation is unclear, but the reported transcriptional regulation of TraX may indicate that it is involved in the process of pilu extension/retraction , , .. Probab=76.73 E-value=0.67 Score=25.16 Aligned_cols=56 Identities=23% Similarity=0.283 Sum_probs=42.2 Q ss_pred EEEEECCCCCCHHHHHHHHHHH-----HHHHHCCCCCCHHHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHH Q ss_conf 9999677844078999999999-----99997198530353206822105676523766113853289999999 Q gi|254780750|r 652 LWLLTGPNMGGKSTFLRQNALI-----VIMAQMGSYVPASYAHIGIVDKLFSRVGSADNLASGRSTFMVEMIET 720 (920) Q Consensus 652 ~~iiTGpNmgGKSt~lRqval~-----vilAQiG~fVPA~~a~i~~~D~IftRiGa~D~l~~g~STF~vEm~e~ 720 (920) +-++-|-||+|| |-+||-+++ .++||.|.|.--...=++-++ -|| .=||.|-|.-. T Consensus 47 FALV~G~Nl~~~-~~~RQ~a~NRLW~~a~~AQ~gY~LAfrdvG~~Wy~---------~NI---LF~FAV~~Qv~ 107 (232) T TIGR02755 47 FALVWGLNLSRH-AHIRQHAINRLWGWALIAQPGYYLAFRDVGLPWYE---------GNI---LFAFAVVAQVL 107 (232) T ss_pred HHHHHHHCCCCC-CCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHH---------HHH---HHHHHHHHHHH T ss_conf 999984100256-52116788899999999977899998743621578---------889---99999999999 No 478 >PRK01254 hypothetical protein; Provisional Probab=76.62 E-value=3.8 Score=19.55 Aligned_cols=41 Identities=20% Similarity=0.365 Sum_probs=29.5 Q ss_pred HHHCCEEECCCCCCCCCCCCCCCCHHHHHHHHHHH-HHCCCEEEEEEECC Q ss_conf 86181891078899888762656176699999999-98898599995028 Q gi|254780750|r 64 RCLGITLTKRGKHLGKDIPMCGVPVHTANHYIQKL-IKIGHRIAICEQIE 112 (920) Q Consensus 64 ~~L~i~lt~r~~~~~~~~pmaGvP~~~l~~yl~~L-v~~GykVaiveQ~E 112 (920) ..||+.+...+.+.++ |..|... +.|+ ..+||||||+-|=. T Consensus 37 d~~DiIlVTGDAYVDH--PSFG~Ai------IgR~LE~~GfrVGIIaQPd 78 (742) T PRK01254 37 DSCDIIIVTGDAYVDH--PSFGMAI------IGRMLEAQGFRVGIIAQPD 78 (742) T ss_pred CCCCEEEEECCCCCCC--CCHHHHH------HHHHHHHCCCEEEEEECCC T ss_conf 7477899947612567--5305899------9999998696598971799 No 479 >PRK08084 DNA replication initiation factor; Provisional Probab=76.53 E-value=4.9 Score=18.71 Aligned_cols=95 Identities=22% Similarity=0.230 Sum_probs=54.1 Q ss_pred CCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCC---CCCHHHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 643999967784407899999999999997198---53035320682210567652376611385328999999999999 Q gi|254780750|r 649 SGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGS---YVPASYAHIGIVDKLFSRVGSADNLASGRSTFMVEMIETASILN 725 (920) Q Consensus 649 ~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~---fVPA~~a~i~~~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~ 725 (920) ..+.+.|.||=.+|||-+|..+|--.. +.|. |||++. ...|..|+.| T Consensus 44 ~~~~l~l~G~~G~GKTHLLqA~~~~~~--~~~~~~~yl~~~~----------------------~~~~~~~~l~------ 93 (235) T PRK08084 44 HSGYIYLWGREGAGRSHLLHAACAELS--QRGDAVGYVPLDK----------------------RAWFVPEVLE------ 93 (235) T ss_pred CCCEEEEECCCCCCHHHHHHHHHHHHH--CCCCCEEEEEHHH----------------------HHHHHHHHHH------ T ss_conf 987699989999888999999999997--0798579987798----------------------6651799998------ Q ss_pred HCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCH Q ss_conf 589985699932588988056799999999999972698499974875 Q gi|254780750|r 726 QATNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFH 773 (920) Q Consensus 726 ~at~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~ 773 (920) ....-.+|++|-+=.=..-.+----.+-+...+.+..+|..+++.+.+ T Consensus 94 ~l~~~dll~iDDi~~i~g~~~~ee~lF~l~N~~~~~g~~~ll~ts~~~ 141 (235) T PRK08084 94 GMEQLSLVCIDNIECIAGDELWEMAIFDLYNRILESGKTRLLITGDRP 141 (235) T ss_pred HHHHCCEEEEECHHHHCCCHHHHHHHHHHHHHHHHHCCCEEEEECCCC T ss_conf 764189899827455469978999999999999984896699967988 No 480 >TIGR03029 EpsG chain length determinant protein tyrosine kinase EpsG. The proteins in this family are homologs of the EpsG protein found in Methylobacillus strain 12S and are generally found in operons with other Eps homologs. The protein is believed to function as the protein tyrosine kinase component of the chain length regulator (along with the transmembrane component EpsF). Probab=76.36 E-value=4.9 Score=18.67 Aligned_cols=88 Identities=15% Similarity=0.196 Sum_probs=45.3 Q ss_pred CCEEEEEECCCCC-CHHHHHHHHHHHHHHHHCCCCCCHH--HCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 6439999677844-0789999999999999719853035--320682210567652376611385328999999999999 Q gi|254780750|r 649 SGKLWLLTGPNMG-GKSTFLRQNALIVIMAQMGSYVPAS--YAHIGIVDKLFSRVGSADNLASGRSTFMVEMIETASILN 725 (920) Q Consensus 649 ~~~~~iiTGpNmg-GKSt~lRqval~vilAQiG~fVPA~--~a~i~~~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~ 725 (920) ..+.+.||+|..| ||||.- .-|.+.+||.|.=|=== -++=|-..++|- + ++ ..|.|.+...-.....|.+ T Consensus 102 ~~~~LaItS~~pGEGKS~vA--aNLA~~~Aq~G~RvLLVDaDLRrP~lh~~f~-l---~~-~~GLs~vL~g~~~l~~i~~ 174 (274) T TIGR03029 102 GRKALAVVSAKSGEGCSYIA--ANLAIVFSQLGEKTLLIDANLRDPVQHRNFK-L---SE-QRGLSDILAGRSDLEVITH 174 (274) T ss_pred CCCEEEEECCCCCCCHHHHH--HHHHHHHHHCCCEEEEEECCCCCCHHHHHHC-C---CC-CCCHHHHHCCCCCHHHEEE T ss_conf 88389996899999899999--9999999967991999958888844779759-9---99-7687888459998899051 Q ss_pred HCCCCCEEEEECCCCCCCHHH Q ss_conf 589985699932588988056 Q gi|254780750|r 726 QATNQSFVILDEIGRGTATLD 746 (920) Q Consensus 726 ~at~~SLVllDElGrGTst~D 746 (920) ...-.. +|=+..|+.+.. T Consensus 175 ~~~~~n---L~VLpaG~~ppn 192 (274) T TIGR03029 175 IPALEN---LSVLPAGAIPPN 192 (274) T ss_pred CCCCCC---EEEEECCCCCCC T ss_conf 589899---789969999989 No 481 >TIGR02633 xylG D-xylose ABC transporter, ATP-binding protein; InterPro: IPR013455 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). Several bacterial species use the enzymes xylose isomerase and xylulokinase for xylose utilisation. This entry represents the ATP-binding cassette (ABC) subunit of the known, or predicted, high-affinity xylose ABC transporter used for xylose import. The genes encoding these proteins, which closely resemble other sugar transport ABC transporter genes, are typically found near xylose utilisation enzymes and regulatory proteins. . Probab=76.33 E-value=3.1 Score=20.14 Aligned_cols=79 Identities=19% Similarity=0.240 Sum_probs=45.4 Q ss_pred HHHHHHHC--CCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHHCCCEEEEEEEEEEECC Q ss_conf 99999958--9985699932588988056799999999999972698499974875797664306885899999996099 Q gi|254780750|r 720 TASILNQA--TNQSFVILDEIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTDLSKSLKRFHNATLQVSDSNE 797 (920) Q Consensus 720 ~~~IL~~a--t~~SLVllDElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~l~~~~~~v~n~~~~~~~~~~ 797 (920) =+.||-.+ +.--..||||--||-+---=.- -|-.+..|....=+..+++.-..|.-.+++..=-++...+..+--|+ T Consensus 411 QKA~Lak~Ll~~P~~LILDEPTRG~DvGAKyE-IYkL~~~La~~G~a~iv~SSEL~EVLG~sDRVLV~~eG~l~~~l~N~ 489 (501) T TIGR02633 411 QKAVLAKMLLLNPRVLILDEPTRGVDVGAKYE-IYKLINQLAQEGVAIIVVSSELAEVLGLSDRVLVIGEGKLKADLVNH 489 (501) T ss_pred HHHHHHHHHHCCCCEEEEECCCCCCCCCCHHH-HHHHHHHHHHCCCEEEEECCCCHHHCCCCCEEEEEECCEEEEEEECC T ss_conf 79999998743994899747888610364078-99999999847958999834310030552237887367788542035 Q ss_pred EE Q ss_conf 27 Q gi|254780750|r 798 GI 799 (920) Q Consensus 798 ~i 799 (920) .+ T Consensus 490 ~L 491 (501) T TIGR02633 490 EL 491 (501) T ss_pred CC T ss_conf 24 No 482 >PRK05416 hypothetical protein; Provisional Probab=76.31 E-value=2.8 Score=20.52 Aligned_cols=11 Identities=27% Similarity=0.350 Sum_probs=5.2 Q ss_pred EEECCCCCHHH Q ss_conf 40000468058 Q gi|254780750|r 612 FIVKDGRHPIV 622 (920) Q Consensus 612 l~i~~gRHPvi 622 (920) +-..+|||=-| T Consensus 253 iGCTGG~HRSV 263 (292) T PRK05416 253 IGCTGGQHRSV 263 (292) T ss_pred ECCCCCCCCHH T ss_conf 75889871799 No 483 >PRK13830 conjugal transfer protein TrbE; Provisional Probab=76.11 E-value=2.2 Score=21.30 Aligned_cols=29 Identities=45% Similarity=0.597 Sum_probs=22.3 Q ss_pred EEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHH Q ss_conf 9996778440789999999999999719853035 Q gi|254780750|r 653 WLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPAS 686 (920) Q Consensus 653 ~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~ 686 (920) .+|.||.-+||||+| ..++||..=|-+|. T Consensus 459 tlIiGpTGsGKTvll-----~fl~aq~~ky~~~~ 487 (818) T PRK13830 459 TLIFGPTGSGKSTLL-----ALIAAQFRRYAGAQ 487 (818) T ss_pred EEEECCCCCCHHHHH-----HHHHHHHHHCCCCE T ss_conf 589899999889999-----99999986427983 No 484 >PRK04690 murD UDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase; Provisional Probab=76.02 E-value=4.5 Score=18.97 Aligned_cols=17 Identities=35% Similarity=0.534 Sum_probs=9.7 Q ss_pred HHHHHHHCCCCHHHHHH Q ss_conf 89999982999899999 Q gi|254780750|r 815 GIQVGKLAGLPNTVISR 831 (920) Q Consensus 815 gi~vA~laG~p~~vi~~ 831 (920) |=+-.+|||+.+.-|++ T Consensus 445 G~~Fk~L~~~~~~~~~~ 461 (468) T PRK04690 445 GRHFAQLAGFDPAAISA 461 (468) T ss_pred HHHHHHHHCCCHHHHHH T ss_conf 99999987808777852 No 485 >pfam07724 AAA_2 AAA domain (Cdc48 subfamily). This Pfam entry includes some of the AAA proteins not detected by the pfam00004 model. Probab=75.88 E-value=4.7 Score=18.83 Aligned_cols=23 Identities=26% Similarity=0.433 Sum_probs=18.6 Q ss_pred EEEEECCCCCCHHHHHHHHHHHH Q ss_conf 99996778440789999999999 Q gi|254780750|r 652 LWLLTGPNMGGKSTFLRQNALIV 674 (920) Q Consensus 652 ~~iiTGpNmgGKSt~lRqval~v 674 (920) .++++||--.|||..-|++|=.. T Consensus 5 ~~l~~GPsGvGKT~lAk~la~~l 27 (168) T pfam07724 5 SFLFLGPTGVGKTELAKALAELL 27 (168) T ss_pred EEEEECCCCCCHHHHHHHHHHHH T ss_conf 99988989989999999999996 No 486 >KOG0962 consensus Probab=75.81 E-value=2.5 Score=20.94 Aligned_cols=18 Identities=28% Similarity=0.385 Sum_probs=11.1 Q ss_pred EEEEEEEEECCCCEECCH Q ss_conf 554589997753030201 Q gi|254780750|r 126 VRRNVVRLVTPGTLTEDQ 143 (920) Q Consensus 126 v~R~Vt~IiTpGT~~d~~ 143 (920) -.|.+..-++|=|++.+. T Consensus 18 ~d~~~i~F~sPlTLIvG~ 35 (1294) T KOG0962 18 KDRNTIEFFSPLTLIVGA 35 (1294) T ss_pred CCCCEEEECCCEEEEECC T ss_conf 665536511770567668 No 487 >PRK04301 radA DNA repair and recombination protein RadA; Validated Probab=75.69 E-value=5.1 Score=18.55 Aligned_cols=116 Identities=16% Similarity=0.244 Sum_probs=58.6 Q ss_pred CCEEEEEECCCCCCHHHHHHHHHHHHHHHHC--------------CCCCCH------HHCCCC---CCCEEEE-EEECCC Q ss_conf 6439999677844078999999999999971--------------985303------532068---2210567-652376 Q gi|254780750|r 649 SGKLWLLTGPNMGGKSTFLRQNALIVIMAQM--------------GSYVPA------SYAHIG---IVDKLFS-RVGSAD 704 (920) Q Consensus 649 ~~~~~iiTGpNmgGKSt~lRqval~vilAQi--------------G~fVPA------~~a~i~---~~D~Ift-RiGa~D 704 (920) .+.+-=|.||.-+|||.+.-|.|+.+-|..- |.|=|- +...+. +.|+|+. |.-..| T Consensus 102 ~g~ITEi~Ge~gsGKTQlc~qLav~~qlp~~~GGl~g~vvYIDTEgtF~peRi~qia~~~g~d~~~~L~nI~v~r~~~~~ 181 (318) T PRK04301 102 TQSITEFYGEFGSGKTQICHQLAVNVQLPEEEGGLEGKAVYIDTENTFRPERIEQMAEGLGLDPDEVLDNIHVARAYNSD 181 (318) T ss_pred CCEEEEEECCCCCCCHHHHHHHHHHEECCHHHCCCCCCEEEEECCCCCCHHHHHHHHHHHCCCHHHHHHCEEEEECCCHH T ss_conf 67078886688787035667767653376777898863799956898697999999998499978986402686139989 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEEC--------C-CCCCCHHHHHHHHH--HHHHHHHHHCCCEEEEECC Q ss_conf 611385328999999999999589985699932--------5-88988056799999--9999999726984999748 Q gi|254780750|r 705 NLASGRSTFMVEMIETASILNQATNQSFVILDE--------I-GRGTATLDGLSIAW--ATIEYLHETNRCRGLLATH 771 (920) Q Consensus 705 ~l~~g~STF~vEm~e~~~IL~~at~~SLVllDE--------l-GrGTst~DG~aiA~--aile~l~~~~~~~~lfaTH 771 (920) . .|.-+.+...++....+=.||++|- + |||+-..--..+.+ +.|..|.+.-.+-++++-| T Consensus 182 ~-------q~~~~~~~~~~~~~~~~v~LvVvDSi~alfR~e~~grg~l~~Rq~~L~~~l~~L~~lA~~~niaVvvTNQ 252 (318) T PRK04301 182 H-------QMLLAEKAEELIKEGNNIKLVIVDSLTAHFRAEYVGRGNLAERQQKLNKHLHDLLRLADLYNAAVVVTNQ 252 (318) T ss_pred H-------HHHHHHHHHHHHHHCCCCEEEEEECCHHHEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEEEEE T ss_conf 9-------9999999999996278804999943423212104685309999999999999999999985957999613 No 488 >COG4608 AppF ABC-type oligopeptide transport system, ATPase component [Amino acid transport and metabolism] Probab=75.64 E-value=5.2 Score=18.54 Aligned_cols=120 Identities=27% Similarity=0.265 Sum_probs=63.5 Q ss_pred EEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCC----------------CCCEEEE Q ss_conf 443456358777766439999677844078999999999999971985303532068----------------2210567 Q gi|254780750|r 635 IANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIG----------------IVDKLFS 698 (920) Q Consensus 635 VpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~----------------~~D~Ift 698 (920) .=+||++.. ..+.++-|-|.+.+||||+-|.+. |..=|.+- ++- -+..++- T Consensus 28 avd~Vsf~i----~~ge~~glVGESG~GKSTlgr~i~--------~L~~pt~G-~i~f~g~~i~~~~~~~~~~~v~elL~ 94 (268) T COG4608 28 AVDGVSFSI----KEGETLGLVGESGCGKSTLGRLIL--------GLEEPTSG-EILFEGKDITKLSKEERRERVLELLE 94 (268) T ss_pred EECCEEEEE----CCCCEEEEEECCCCCHHHHHHHHH--------CCCCCCCC-EEEECCCCHHHCCHHHHHHHHHHHHH T ss_conf 751156897----589878999368887787999997--------28388872-69986853111366679999999999 Q ss_pred EEECCC--------CCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHH---HHHHHHHHCCCEEE Q ss_conf 652376--------6113853289999999999995899856999325889880567999999---99999972698499 Q gi|254780750|r 699 RVGSAD--------NLASGRSTFMVEMIETASILNQATNQSFVILDEIGRGTATLDGLSIAWA---TIEYLHETNRCRGL 767 (920) Q Consensus 699 RiGa~D--------~l~~g~STF~vEm~e~~~IL~~at~~SLVllDElGrGTst~DG~aiA~a---ile~l~~~~~~~~l 767 (920) ++|-.. -++.||=-. .-.|.. -|.+-+|++.||. ||..|- ++-.- .+..|-+..+...+ T Consensus 95 ~Vgl~~~~~~ryPhelSGGQrQR----i~IARA--Lal~P~liV~DEp---vSaLDv-SiqaqIlnLL~~lq~~~~lt~l 164 (268) T COG4608 95 KVGLPEEFLYRYPHELSGGQRQR----IGIARA--LALNPKLIVADEP---VSALDV-SVQAQILNLLKDLQEELGLTYL 164 (268) T ss_pred HHCCCHHHHHCCCCCCCCHHHHH----HHHHHH--HHHCCCEEEECCC---HHHCCH-HHHHHHHHHHHHHHHHHCCEEE T ss_conf 80988778633883037313356----999999--8509867974372---110124-6799999999999998597589 Q ss_pred EECCCHHHHH Q ss_conf 9748757976 Q gi|254780750|r 768 LATHFHELTD 777 (920) Q Consensus 768 faTHy~eL~~ 777 (920) |.||.-.... T Consensus 165 FIsHDl~vv~ 174 (268) T COG4608 165 FISHDLSVVR 174 (268) T ss_pred EEEEEHHHHH T ss_conf 9987777666 No 489 >TIGR01087 murD UDP-N-acetylmuramoylalanine--D-glutamate ligase; InterPro: IPR005762 The bacterial cell wall provides strength and rigidity to counteract internal osmotic pressure, and protection against the environment. The peptidoglycan layer gives the cell wall its strength, and helps maintain the overall shape of the cell. The basic peptidoglycan structure of both Gram-positive and Gram-negative bacteria is comprised of a sheet of glycan chains connected by short cross-linking polypeptides. Biosynthesis of peptidoglycan is a multi-step (11-12 steps) process comprising three main stages: (1) formation of UDP-N-acetylmuramic acid (UDPMurNAc) from N-acetylglucosamine (GlcNAc). (2) addition of a short polypeptide chain to the UDPMurNAc. (3) addition of a second GlcNAc to the disaccharide-pentapeptide building block and transport of this unit through the cytoplasmic membrane and incorporation into the growing peptidoglycan layer. Stage two involves four key Mur ligase enzymes: MurC (6.3.2.8 from EC) , MurD (6.3.2.9 from EC) , MurE (6.3.2.13 from EC) and MurF (6.3.2.10 from EC) . These four Mur ligases are responsible for the successive additions of L-alanine, D-glutamate, meso-diaminopimelate or L-lysine, and D-alanyl-D-alanine to UDP-N-acetylmuramic acid. All four Mur ligases are topologically similar to one another, even though they display low sequence identity. They are each composed of three domains: an N-terminal Rossmann-fold domain responsible for binding the UDPMurNAc substrate; a central domain (similar to ATP-binding domains of several ATPases and GTPases); and a C-terminal domain (similar to dihydrofolate reductase fold) that appears to be associated with binding the incoming amino acid. The conserved sequence motifs found in the four Mur enzymes also map to other members of the Mur ligase family, including folylpolyglutamate synthetase, cyanophycin synthetase and the capB enzyme from Bacillales . This entry represents UDP-N-acetylmuramoylalanine-D-glutamate ligase (MurD). MurD catalyses the addition of d-glutamate to the nucleotide precursor UDP-N-acetylmuramoyl-l-alanine. ; GO: 0005524 ATP binding, 0008764 UDP-N-acetylmuramoylalanine-D-glutamate ligase activity, 0008360 regulation of cell shape, 0009252 peptidoglycan biosynthetic process, 0009273 peptidoglycan-based cell wall biogenesis, 0051301 cell division, 0005737 cytoplasm. Probab=75.64 E-value=3.3 Score=20.03 Aligned_cols=21 Identities=14% Similarity=0.147 Sum_probs=15.5 Q ss_pred HHHHHHCCCEEEEEEECCCHH Q ss_conf 999998898599995028978 Q gi|254780750|r 95 IQKLIKIGHRIAICEQIESPL 115 (920) Q Consensus 95 l~~Lv~~GykVaiveQ~E~p~ 115 (920) ++-|.++|..|.++++-..+. T Consensus 17 a~~L~~~g~~~~~~D~~~~~~ 37 (476) T TIGR01087 17 ARFLKKKGAEVTVTDDDPEEE 37 (476) T ss_pred HHHHHHCCCEEEEEECCCCCC T ss_conf 999997298799998452213 No 490 >COG0771 MurD UDP-N-acetylmuramoylalanine-D-glutamate ligase [Cell envelope biogenesis, outer membrane] Probab=75.57 E-value=3.7 Score=19.64 Aligned_cols=20 Identities=20% Similarity=0.546 Sum_probs=15.7 Q ss_pred HHHHHHCCCEEEEEEECCCH Q ss_conf 99999889859999502897 Q gi|254780750|r 95 IQKLIKIGHRIAICEQIESP 114 (920) Q Consensus 95 l~~Lv~~GykVaiveQ~E~p 114 (920) ++-|.+.|++|.++|+-+.| T Consensus 23 a~~L~~~G~~v~v~D~~~~~ 42 (448) T COG0771 23 ARFLLKLGAEVTVSDDRPAP 42 (448) T ss_pred HHHHHHCCCEEEEECCCCCC T ss_conf 99999779869998389876 No 491 >TIGR02857 CydD ABC transporter, CydDC cysteine exporter (CydDC-E) family, permease/ATP-binding protein CydD; InterPro: IPR014216 The gene pair cydCD encodes an ABC-family transporter in which each gene contains an N-terminal membrane-spanning domain and a C-terminal ATP-binding domain. In Escherichia coli these genes were discovered as mutants which caused the terminal heme-copper oxidase complex cytochrome bd to fail to assemble. Recent work has shown that the transporter is involved in the export of redox-active thiol compounds such as cysteine and glutathione , . The linkage to assembly of the cytochrome bd complex is further supported by the conserved operon structure found outside the gammaproteobacteria (cydABCD) containing both the transporter and oxidase genes components. The genes used as the seed members for this entry are all either found in the gammaproteobacterial context, or the CydABCD context. All members of this entry scoring above the trusted cut off at the time of its creation were from genomes which encode a cytochrome bd complex. Unfortunately, the gene symbol nomenclature, based on this operon in Bacillus subtilis, assigns cydC to the third gene in the operon, where this gene is actually homologous to the E. coli cydD gene. We have chosen to name all homologues in this family in accordance with the precedence of publication of the E. coli name, CydD.. Probab=75.30 E-value=3.5 Score=19.82 Aligned_cols=61 Identities=28% Similarity=0.362 Sum_probs=0.0 Q ss_pred HHHHHHHHCCCCCEECCCCCCEEEC-CCCC-HHHHHHHHHCCCCCEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHH Q ss_conf 7888775258851210478714000-0468-0588763102877044434563587777664399996778440789999 Q gi|254780750|r 591 ALAILAKEQNYCRPIIDNSTNFIVK-DGRH-PIVEKTLKQQSSKPFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLR 668 (920) Q Consensus 591 SlA~~a~~~~y~rP~i~~~~~l~i~-~gRH-PviE~~l~~~~~~~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lR 668 (920) +..+.|.. +.. |.+.=++.=..- .||- | ++-++++.. +.+...-|+||=-+||||+|. T Consensus 337 ~~~~~a~D-G~~-P~l~~~~v~~~y~~G~~~p--------------a~~~~sf~~----~pG~~vAl~G~SGaGKSTLL~ 396 (570) T TIGR02857 337 ALGRLAAD-GAD-PSLEFEGVSVAYAPGRDEP--------------ALRPVSFTV----EPGERVALVGPSGAGKSTLLN 396 (570) T ss_pred HHCCCCCC-CCC-CEEEEEEEEEECCCCCHHH--------------CCCCCCEEE----CCCCEEEEEECCCCCHHHHHH T ss_conf 10010036-998-4078875168627888010--------------378854166----387048886279997889999 Q ss_pred HHH Q ss_conf 999 Q gi|254780750|r 669 QNA 671 (920) Q Consensus 669 qva 671 (920) -.. T Consensus 397 lLL 399 (570) T TIGR02857 397 LLL 399 (570) T ss_pred HHH T ss_conf 997 No 492 >PRK00081 coaE dephospho-CoA kinase; Reviewed Probab=75.26 E-value=3.7 Score=19.62 Aligned_cols=26 Identities=42% Similarity=0.441 Sum_probs=0.0 Q ss_pred EEEEECCCCCCHHHHHHHHHHHHHHHHCCCCC Q ss_conf 99996778440789999999999999719853 Q gi|254780750|r 652 LWLLTGPNMGGKSTFLRQNALIVIMAQMGSYV 683 (920) Q Consensus 652 ~~iiTGpNmgGKSt~lRqval~vilAQiG~fV 683 (920) +.-|||+-.+||||..+ ++...|++| T Consensus 4 ~IgiTG~igsGKStv~~------~l~~~G~~v 29 (199) T PRK00081 4 IIGLTGGIGSGKSTVAN------IFAELGVPV 29 (199) T ss_pred EEEEECCCCCCHHHHHH------HHHHCCCCE T ss_conf 99957888777999999------999889939 No 493 >PRK13900 type IV secretion system ATPase VirB11; Provisional Probab=75.25 E-value=2.8 Score=20.52 Aligned_cols=19 Identities=26% Similarity=0.464 Sum_probs=0.0 Q ss_pred EEEECCCCCCHHHHHHHHH Q ss_conf 9996778440789999999 Q gi|254780750|r 653 WLLTGPNMGGKSTFLRQNA 671 (920) Q Consensus 653 ~iiTGpNmgGKSt~lRqva 671 (920) ++|.|+--+||||+|+..+ T Consensus 163 ilI~G~TgSGKTTll~aL~ 181 (332) T PRK13900 163 IIISGGTSTGKTTFTNAAL 181 (332) T ss_pred EEEECCCCCCHHHHHHHHH T ss_conf 9998888988999999998 No 494 >cd01121 Sms Sms (bacterial radA) DNA repair protein. This protein is not related to archael radA any more than is to other RecA-like NTPases. Sms has a role in recombination and recombinational repair and is responsible for the stabilization or processing of branched DNA molecules. Probab=75.25 E-value=5.3 Score=18.47 Aligned_cols=156 Identities=18% Similarity=0.225 Sum_probs=0.0 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHHHHHHCC---CCCCHHHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 4399996778440789999999999999719---8530353206822105676523766113853289999999999995 Q gi|254780750|r 650 GKLWLLTGPNMGGKSTFLRQNALIVIMAQMG---SYVPASYAHIGIVDKLFSRVGSADNLASGRSTFMVEMIETASILNQ 726 (920) Q Consensus 650 ~~~~iiTGpNmgGKSt~lRqval~vilAQiG---~fVPA~~a~i~~~D~IftRiGa~D~l~~g~STF~vEm~e~~~IL~~ 726 (920) +.+.||.|.---||||+|=|+|-. ||+-| .||-+++..--+--|= -|+| ..+..-|..-=.++..|+.. T Consensus 82 GSvvLlgGePGIGKSTLLLQia~~--la~~~~~vLYvSGEES~~QIk~RA-~RLg-----~~~~~l~l~set~le~Il~~ 153 (372) T cd01121 82 GSVILIGGDPGIGKSTLLLQVAAR--LAKRGGKVLYVSGEESPEQIKLRA-DRLG-----ISTENLYLLAETNLEDILAS 153 (372) T ss_pred CEEEEEECCCCCCHHHHHHHHHHH--HHHCCCCEEEEECHHHHHHHHHHH-HHHC-----CCCCCCEEEECCCHHHHHHH T ss_conf 717998259988688999999999--986399389982456789999899-9858-----78877278843569999999 Q ss_pred C--CCCCEEEEE--------CCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHHCCCEEEEEEEEEEEC Q ss_conf 8--998569993--------258898805679999999999997269849997487579766430688589999999609 Q gi|254780750|r 727 A--TNQSFVILD--------EIGRGTATLDGLSIAWATIEYLHETNRCRGLLATHFHELTDLSKSLKRFHNATLQVSDSN 796 (920) Q Consensus 727 a--t~~SLVllD--------ElGrGTst~DG~aiA~aile~l~~~~~~~~lfaTHy~eL~~l~~~~~~v~n~~~~~~~~~ 796 (920) . .+-+++++| ++..--.+--=+-=..+.+-.+.+..+.-+++.-|-.-=..++--.---|+...-..-++ T Consensus 154 i~~~kP~~lIIDSIQT~~~~~~~s~pGsvsQVReca~~L~~~AK~~~i~~~lVGHVTK~G~iAGPkvLEHmVDtVl~fEg 233 (372) T cd01121 154 IEELKPDLVIIDSIQTVYSSELTSAPGSVSQVRECTAELMRFAKERNIPIFIVGHVTKEGSIAGPKVLEHMVDTVLYFEG 233 (372) T ss_pred HHHHCCCEEEEECHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCEEEEEEEECCCCCCCCCHHHEEEEEEEEEECC T ss_conf 99719988999562202037767799878999999999999998619739999876268863774031002136887515 Q ss_pred CEEEEEEEEEECCCCCCHH Q ss_conf 9277877774478988778 Q gi|254780750|r 797 EGIIFLHKVIPGIADHSYG 815 (920) Q Consensus 797 ~~i~flykl~~G~~~~Syg 815 (920) +. ...||+...+=++ || T Consensus 234 d~-~~~~R~LR~~KNR-FG 250 (372) T cd01121 234 DR-HSEYRILRSVKNR-FG 250 (372) T ss_pred CC-CCCEEEEEEECCC-CC T ss_conf 77-6550356741156-77 No 495 >KOG0082 consensus Probab=74.99 E-value=3 Score=20.34 Aligned_cols=21 Identities=38% Similarity=0.594 Sum_probs=0.0 Q ss_pred EEEECCCCCCHHHHHHHHHHH Q ss_conf 999677844078999999999 Q gi|254780750|r 653 WLLTGPNMGGKSTFLRQNALI 673 (920) Q Consensus 653 ~iiTGpNmgGKSt~lRqval~ 673 (920) +++=|+--|||||.+||+=++ T Consensus 36 lLLLGageSGKSTI~KQmkil 56 (354) T KOG0082 36 LLLLGAGESGKSTIVKQMKIL 56 (354) T ss_pred EEEECCCCCCHHHHHHHHHHH T ss_conf 556538987457899999999 No 496 >PRK11823 DNA repair protein RadA; Provisional Probab=74.93 E-value=1.6 Score=22.43 Aligned_cols=80 Identities=29% Similarity=0.280 Sum_probs=0.0 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCEEEEEEEC-CCCCCCCCCHHHHHHHHHHHHHHHCC Q ss_conf 43999967784407899999999999997198530353206822105676523-76611385328999999999999589 Q gi|254780750|r 650 GKLWLLTGPNMGGKSTFLRQNALIVIMAQMGSYVPASYAHIGIVDKLFSRVGS-ADNLASGRSTFMVEMIETASILNQAT 728 (920) Q Consensus 650 ~~~~iiTGpNmgGKSt~lRqval~vilAQiG~fVPA~~a~i~~~D~IftRiGa-~D~l~~g~STF~vEm~e~~~IL~~at 728 (920) +.+.||.|.---||||+|=|+|-..--..-=.||.+++..--+--|- -|+|. +|||.---.|- +..|+..+. T Consensus 90 GS~iLlgGePGIGKSTLlLQ~a~~la~~~~vLYvSGEES~~Qik~RA-~RLg~~~~~l~l~~et~------l~~Il~~i~ 162 (454) T PRK11823 90 GSVVLIGGDPGIGKSTLLLQVAAALAAGGKVLYVSGEESLQQIKLRA-ERLGLPSDNLYLLAETN------LEDILATIE 162 (454) T ss_pred CEEEEECCCCCCCHHHHHHHHHHHHHCCCCEEEECCCHHHHHHHHHH-HHHCCCCCCCEEEECCC------HHHHHHHHH T ss_conf 64899507998889999999999985599579981501578999999-97588888737885368------999999998 Q ss_pred CC--CEEEEE Q ss_conf 98--569993 Q gi|254780750|r 729 NQ--SFVILD 736 (920) Q Consensus 729 ~~--SLVllD 736 (920) .. +|+++| T Consensus 163 ~~~P~~lIID 172 (454) T PRK11823 163 EEKPDLVVID 172 (454) T ss_pred HHCCCEEEEE T ss_conf 6099889994 No 497 >PRK03806 murD UDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase; Provisional Probab=74.59 E-value=5.4 Score=18.38 Aligned_cols=54 Identities=26% Similarity=0.397 Sum_probs=0.0 Q ss_pred CCHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCC Q ss_conf 680588763102877044434563587777664399996778440789999999999999719 Q gi|254780750|r 618 RHPIVEKTLKQQSSKPFIANDCDLSCPNDKNSGKLWLLTGPNMGGKSTFLRQNALIVIMAQMG 680 (920) Q Consensus 618 RHPviE~~l~~~~~~~fVpNdi~l~~~~~~~~~~~~iiTGpNmgGKSt~lRqval~vilAQiG 680 (920) .||.+......+.. |-.++.|-... ...++.-|||.| ||||.-..++ .||.+.| T Consensus 78 ~~p~~~~a~~~~i~---i~seiel~~~~--~~~~iIaVTGTn--GKTTTt~li~--~iL~~~g 131 (438) T PRK03806 78 AHPSLSAAADAGVE---IVGDIELFCRE--AQAPIVAITGSN--GKSTVTTLVG--EMAKAAG 131 (438) T ss_pred CCHHHHHHHHCCCC---EEEHHHHHHCC--CCCCEEEEECCC--CCHHHHHHHH--HHHHHCC T ss_conf 89899999987994---77699997222--799889994899--8489999999--9998659 No 498 >PRK13851 type IV secretion system protein VirB11; Provisional Probab=74.58 E-value=3 Score=20.31 Aligned_cols=19 Identities=37% Similarity=0.532 Sum_probs=0.0 Q ss_pred EEEECCCCCCHHHHHHHHH Q ss_conf 9996778440789999999 Q gi|254780750|r 653 WLLTGPNMGGKSTFLRQNA 671 (920) Q Consensus 653 ~iiTGpNmgGKSt~lRqva 671 (920) ++|.|+-.+||||||+... T Consensus 165 IlIsGgTGSGKTTllnALl 183 (343) T PRK13851 165 MLLCGPTGSGKTTMSKTLI 183 (343) T ss_pred EEEECCCCCCHHHHHHHHH T ss_conf 9998889861999999999 No 499 >pfam09439 SRPRB Signal recognition particle receptor beta subunit. The beta subunit of the signal recognition particle receptor (SRP) is a transmembrane GTPase which anchors the alpha subunit to the endoplasmic reticulum membrane. Probab=74.40 E-value=3.6 Score=19.67 Aligned_cols=24 Identities=17% Similarity=0.230 Sum_probs=0.0 Q ss_pred CCCEEEEEECCCCCCHHHHHHHHH Q ss_conf 664399996778440789999999 Q gi|254780750|r 648 NSGKLWLLTGPNMGGKSTFLRQNA 671 (920) Q Consensus 648 ~~~~~~iiTGpNmgGKSt~lRqva 671 (920) .+.+..+|.||+-+|||+++-+.. T Consensus 1 ~~~ptvLllGl~~sGKT~Lf~~L~ 24 (181) T pfam09439 1 SSQPAVIIAGLCDSGKTSLFTLLT 24 (181) T ss_pred CCCCEEEEECCCCCCHHHHHHHHH T ss_conf 998869998689998999999997 No 500 >TIGR00017 cmk cytidylate kinase; InterPro: IPR003136 This family consists of cytidylate kinase (2.7.4.14 from EC), which catalyzes the phosphorylation of cytidine 5-monophosphate (dCMP) to cytidine 5 -diphosphate (dCDP) in the presence of ATP or GTP. UMP and dCMP can also act as acceptors .; GO: 0004127 cytidylate kinase activity, 0005524 ATP binding, 0006139 nucleobase nucleoside nucleotide and nucleic acid metabolic process. Probab=74.28 E-value=2.2 Score=21.31 Aligned_cols=17 Identities=41% Similarity=0.608 Sum_probs=0.0 Q ss_pred EECCCCCCHHHHHHHHH Q ss_conf 96778440789999999 Q gi|254780750|r 655 LTGPNMGGKSTFLRQNA 671 (920) Q Consensus 655 iTGpNmgGKSt~lRqva 671 (920) |=||=.+||||.-|++| T Consensus 7 IDGPs~aGKStvak~~A 23 (223) T TIGR00017 7 IDGPSGAGKSTVAKAVA 23 (223) T ss_pred EECCCCCCHHHHHHHHH T ss_conf 23776465578999999 Done!