Query gi|254780293|ref|YP_003064706.1| 30S ribosomal protein S9 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 170 No_of_seqs 129 out of 946 Neff 5.2 Searched_HMMs 39220 Date Sun May 29 14:35:40 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780293.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK00132 rpsI 30S ribosomal pr 100.0 0 0 356.0 14.2 128 43-170 3-130 (130) 2 CHL00079 rps9 ribosomal protei 100.0 0 0 343.3 14.4 127 44-170 7-134 (134) 3 COG0103 RpsI Ribosomal protein 100.0 0 0 340.2 13.5 126 45-170 5-130 (130) 4 pfam00380 Ribosomal_S9 Ribosom 100.0 0 0 339.5 13.3 121 50-170 1-121 (121) 5 PRK00474 rps9p 30S ribosomal p 100.0 0 0 325.7 13.4 125 45-170 3-132 (132) 6 TIGR03627 arch_S9P archaeal ri 100.0 0 0 323.3 13.3 124 46-170 2-130 (130) 7 KOG1697 consensus 100.0 0 0 327.8 9.0 136 35-170 136-275 (275) 8 PTZ00086 40S ribosomal protein 100.0 0 0 321.1 13.1 127 43-170 8-147 (147) 9 KOG1753 consensus 99.9 8.2E-26 2.1E-30 179.8 9.8 125 45-170 8-145 (145) 10 TIGR02903 spore_lon_C ATP-depe 93.9 0.099 2.5E-06 31.0 4.3 84 42-128 446-542 (616) 11 TIGR01193 bacteriocin_ABC ABC- 93.4 0.048 1.2E-06 33.0 2.0 77 50-129 510-630 (710) 12 TIGR00958 3a01208 antigen pept 92.2 0.059 1.5E-06 32.4 1.2 79 49-129 568-687 (770) 13 KOG0058 consensus 89.3 0.49 1.2E-05 26.8 3.6 78 49-129 503-621 (716) 14 TIGR01846 type_I_sec_HlyB type 86.6 0.63 1.6E-05 26.1 2.8 85 42-129 489-618 (703) 15 COG4988 CydD ABC-type transpor 85.2 0.45 1.1E-05 27.0 1.5 80 49-130 356-474 (559) 16 PRK11176 lipid transporter ATP 84.9 2 5.2E-05 22.9 4.8 86 43-129 367-496 (581) 17 PTZ00265 multidrug resistance 84.8 1.3 3.4E-05 24.1 3.8 65 63-129 1275-1376(1467) 18 PRK11174 cysteine/glutathione 84.1 1.7 4.3E-05 23.4 4.1 79 49-129 385-502 (588) 19 PRK13657 cyclic beta-1,2-gluca 84.0 0.33 8.4E-06 27.8 0.4 86 43-130 360-489 (585) 20 TIGR02211 LolD_lipo_ex lipopro 83.4 1.6 4E-05 23.6 3.7 67 63-130 57-159 (221) 21 PRK11160 cysteine/glutathione 81.8 2.3 5.9E-05 22.6 4.0 79 49-129 376-493 (575) 22 cd03229 ABC_Class3 This class 80.1 0.8 2E-05 25.4 1.2 75 49-129 35-117 (178) 23 TIGR02857 CydD ABC transporter 79.2 0.89 2.3E-05 25.1 1.2 86 44-131 356-514 (570) 24 COG4608 AppF ABC-type oligopep 78.4 0.95 2.4E-05 25.0 1.2 71 49-128 48-125 (268) 25 COG3842 PotA ABC-type spermidi 78.3 0.58 1.5E-05 26.3 0.0 48 49-96 40-106 (352) 26 cd03214 ABC_Iron-Siderophores_ 78.2 0.59 1.5E-05 26.3 0.0 76 49-129 34-114 (180) 27 KOG3288 consensus 77.9 1.5 3.7E-05 23.8 2.0 32 99-130 273-304 (307) 28 cd03252 ABCC_Hemolysin The ABC 77.9 5.5 0.00014 20.3 4.9 80 49-129 37-155 (237) 29 COG1132 MdlB ABC-type multidru 77.6 5.8 0.00015 20.1 5.0 86 43-130 354-483 (567) 30 cd03244 ABCC_MRP_domain2 Domai 77.2 4.2 0.00011 21.0 4.1 80 49-129 39-156 (221) 31 TIGR03375 type_I_sec_LssB type 77.1 0.93 2.4E-05 25.0 0.8 80 49-129 500-618 (694) 32 TIGR01192 chvA glucan exporter 76.8 6.3 0.00016 19.9 5.5 84 49-134 370-493 (592) 33 cd03288 ABCC_SUR2 The SUR doma 76.7 6.3 0.00016 19.9 5.6 79 49-129 56-173 (257) 34 PRK10789 putative multidrug tr 76.1 1.8 4.7E-05 23.2 2.1 86 43-129 340-468 (569) 35 cd03248 ABCC_TAP TAP, the Tran 75.9 3.8 9.6E-05 21.3 3.6 87 42-129 38-167 (226) 36 COG1126 GlnQ ABC-type polar am 75.8 2.7 6.9E-05 22.2 2.9 74 50-128 38-152 (240) 37 cd03249 ABC_MTABC3_MDL1_MDL2 M 74.9 7 0.00018 19.6 4.9 80 49-129 38-156 (238) 38 cd03251 ABCC_MsbA MsbA is an e 71.4 4.9 0.00013 20.6 3.3 80 49-129 37-155 (234) 39 PRK10790 putative multidrug tr 71.0 1.7 4.3E-05 23.4 0.9 79 49-129 376-493 (593) 40 pfam02120 Flg_hook Flagellar h 70.8 2.9 7.3E-05 22.0 2.0 52 97-148 23-74 (85) 41 cd03254 ABCC_Glucan_exporter_l 70.5 5.7 0.00015 20.2 3.5 80 49-129 38-156 (229) 42 PRK10522 multidrug transporter 67.7 8.5 0.00022 19.1 3.9 37 49-86 358-397 (547) 43 cd03253 ABCC_ATM1_transporter 65.1 11 0.00029 18.3 4.1 80 49-129 36-154 (236) 44 COG1750 Archaeal serine protea 63.1 13 0.00032 18.1 7.1 72 58-131 55-133 (579) 45 cd03261 ABC_Org_Solvent_Resist 62.5 4.7 0.00012 20.7 1.8 76 49-129 35-153 (235) 46 cd03298 ABC_ThiQ_thiamine_tran 62.4 4.2 0.00011 21.0 1.5 76 49-129 33-145 (211) 47 cd03246 ABCC_Protease_Secretio 62.1 0.87 2.2E-05 25.2 -2.0 70 49-129 37-113 (173) 48 PRK11432 fbpC ferric transport 61.8 6.3 0.00016 19.9 2.3 80 49-129 41-153 (351) 49 cd03213 ABCG_EPDR ABCG transpo 61.3 9.2 0.00023 18.9 3.1 65 49-129 44-128 (194) 50 cd03369 ABCC_NFT1 Domain 2 of 61.1 4.3 0.00011 20.9 1.4 80 49-129 43-142 (207) 51 COG2274 SunT ABC-type bacterio 59.6 14 0.00037 17.7 5.2 77 50-129 509-626 (709) 52 TIGR03608 L_ocin_972_ABC putat 58.8 7.6 0.00019 19.4 2.3 76 49-129 33-151 (206) 53 COG1124 DppF ABC-type dipeptid 58.3 5.9 0.00015 20.1 1.7 81 42-128 31-157 (252) 54 cd03247 ABCC_cytochrome_bd The 57.2 1.7 4.3E-05 23.5 -1.3 72 49-129 37-115 (178) 55 cd00267 ABC_ATPase ABC (ATP-bi 56.8 4 0.0001 21.1 0.6 62 49-130 34-98 (157) 56 cd03228 ABCC_MRP_Like The MRP 56.8 1.8 4.5E-05 23.3 -1.2 70 49-129 37-113 (171) 57 COG4525 TauB ABC-type taurine 56.7 14 0.00037 17.7 3.4 76 49-129 40-149 (259) 58 TIGR02982 heterocyst_DevA ABC 56.5 16 0.00041 17.4 3.6 53 73-130 107-159 (220) 59 TIGR01989 COQ6 Ubiquinone bios 56.3 8.3 0.00021 19.2 2.2 25 101-127 1-25 (481) 60 TIGR02673 FtsE cell division A 54.7 13 0.00033 18.0 2.9 74 52-130 40-155 (215) 61 pfam00016 RuBisCO_large Ribulo 54.7 17 0.00044 17.2 4.2 36 102-138 244-306 (309) 62 TIGR00416 sms DNA repair prote 51.8 19 0.00049 16.9 4.3 79 82-161 360-453 (481) 63 PRK00286 xseA exodeoxyribonucl 51.7 19 0.00048 16.9 3.4 33 83-115 176-208 (443) 64 TIGR01842 type_I_sec_PrtD type 51.2 14 0.00035 17.8 2.6 77 49-127 365-481 (556) 65 TIGR03265 PhnT2 putative 2-ami 51.1 12 0.0003 18.2 2.3 48 49-96 39-105 (353) 66 PRK06996 hypothetical protein; 50.6 12 0.0003 18.2 2.2 23 52-74 78-102 (397) 67 TIGR02315 ABC_phnC phosphonate 50.2 10 0.00027 18.5 1.9 49 75-129 124-172 (253) 68 COG3840 ThiQ ABC-type thiamine 49.8 21 0.00053 16.7 4.4 17 60-76 48-64 (231) 69 PRK05446 imidazole glycerol-ph 46.9 23 0.00059 16.4 4.8 44 87-130 296-339 (354) 70 cd03215 ABC_Carb_Monos_II This 44.8 1.6 4.1E-05 23.6 -3.0 39 42-80 24-69 (182) 71 COG3839 MalK ABC-type sugar tr 44.5 21 0.00054 16.6 2.7 76 49-129 38-150 (338) 72 cd03223 ABCD_peroxisomal_ALDP 44.1 12 0.00032 18.1 1.5 64 49-129 36-108 (166) 73 TIGR02023 BchP-ChlP geranylger 40.5 23 0.00059 16.4 2.4 15 108-122 6-20 (408) 74 COG1561 Uncharacterized stress 40.2 29 0.00075 15.8 3.2 74 54-128 6-89 (290) 75 TIGR03378 glycerol3P_GlpB glyc 39.0 23 0.00058 16.5 2.2 14 67-80 140-154 (419) 76 cd03216 ABC_Carb_Monos_I This 38.8 7.2 0.00018 19.5 -0.4 62 49-129 35-99 (163) 77 COG1850 RbcL Ribulose 1,5-bisp 38.7 31 0.00079 15.6 3.7 40 102-142 370-424 (429) 78 TIGR00020 prfB peptide chain r 38.4 17 0.00044 17.2 1.5 28 49-76 216-247 (373) 79 COG1148 HdrA Heterodisulfide r 36.4 23 0.00059 16.4 1.9 51 82-134 106-159 (622) 80 pfam05488 PAAR_motif PAAR moti 36.2 13 0.00034 17.9 0.6 16 61-76 1-16 (26) 81 COG2884 FtsE Predicted ATPase 35.6 35 0.00088 15.3 3.2 76 50-130 38-155 (223) 82 TIGR02203 MsbA_lipidA lipid A 34.0 23 0.00059 16.4 1.6 92 40-134 384-523 (603) 83 cd03291 ABCC_CFTR1 The CFTR su 32.8 38 0.00098 15.0 4.6 79 49-129 72-176 (282) 84 PRK08948 consensus 32.3 35 0.00089 15.3 2.2 21 55-75 75-97 (392) 85 PRK08010 pyridine nucleotide-d 32.2 39 0.001 15.0 2.5 40 104-143 160-214 (441) 86 KOG2495 consensus 32.2 39 0.001 15.0 4.8 85 55-139 156-257 (491) 87 TIGR02138 phosphate_pstC phosp 31.9 35 0.00089 15.3 2.2 19 109-127 219-237 (317) 88 pfam05491 RuvB_C Holliday junc 31.8 35 0.0009 15.3 2.2 41 109-150 21-61 (75) 89 cd03289 ABCC_CFTR2 The CFTR su 31.7 40 0.001 14.9 4.2 80 49-129 39-155 (275) 90 PRK05589 peptide chain release 31.6 25 0.00063 16.2 1.4 29 48-76 168-199 (325) 91 COG4138 BtuD ABC-type cobalami 31.4 36 0.00091 15.3 2.2 34 54-87 40-74 (248) 92 PRK06292 dihydrolipoamide dehy 30.3 42 0.0011 14.8 2.5 18 104-121 171-188 (460) 93 COG1127 Ttg2A ABC-type transpo 30.2 43 0.0011 14.8 2.9 75 49-128 43-161 (263) 94 cd03296 ABC_CysA_sulfate_impor 29.9 33 0.00085 15.4 1.8 47 49-95 37-102 (239) 95 PRK10851 sulfate/thiosulfate t 29.8 24 0.0006 16.4 1.0 46 49-94 37-101 (352) 96 PRK07334 threonine dehydratase 29.5 35 0.00088 15.3 1.8 36 91-133 159-195 (399) 97 COG3179 Predicted chitinase [G 29.4 22 0.00056 16.5 0.8 71 56-129 110-202 (206) 98 cd03250 ABCC_MRP_domain1 Domai 29.0 45 0.0011 14.6 2.4 80 49-129 40-144 (204) 99 PRK13538 cytochrome c biogenes 28.5 38 0.00097 15.1 1.9 27 50-76 37-66 (204) 100 KOG2665 consensus 28.4 39 0.001 15.0 2.0 13 100-114 48-60 (453) 101 PRK10752 sulfate transporter s 28.1 46 0.0012 14.6 3.0 84 67-153 21-106 (329) 102 pfam10865 DUF2703 Protein of u 27.9 34 0.00086 15.4 1.5 17 64-80 61-77 (120) 103 COG4987 CydC ABC-type transpor 27.7 47 0.0012 14.5 4.6 80 49-130 373-492 (573) 104 COG1608 Predicted archaeal kin 26.1 50 0.0013 14.3 6.6 78 59-143 3-95 (252) 105 COG1116 TauB ABC-type nitrate/ 25.8 51 0.0013 14.3 3.4 75 50-129 39-147 (248) 106 cd03231 ABC_CcmA_heme_exporter 25.7 51 0.0013 14.3 2.5 75 50-129 36-142 (201) 107 COG1125 OpuBA ABC-type proline 25.6 45 0.0012 14.6 1.9 29 50-78 37-68 (309) 108 smart00798 AICARFT_IMPCHas AIC 25.1 41 0.0011 14.9 1.6 18 109-126 294-311 (311) 109 PRK00951 hisB imidazoleglycero 25.0 53 0.0013 14.2 3.6 43 87-129 134-177 (195) 110 cd03295 ABC_OpuCA_Osmoprotecti 25.0 41 0.001 14.9 1.5 80 49-129 36-152 (242) 111 TIGR01277 thiQ thiamine ABC tr 24.9 29 0.00074 15.8 0.8 16 60-75 47-62 (213) 112 COG0410 LivF ABC-type branched 24.6 24 0.00061 16.3 0.3 77 52-129 41-153 (237) 113 cd03301 ABC_MalK_N The N-termi 23.9 53 0.0014 14.2 2.0 80 49-129 35-147 (213) 114 COG1224 TIP49 DNA helicase TIP 23.9 55 0.0014 14.1 2.5 10 118-127 80-89 (450) 115 PRK07342 peptide chain release 23.9 48 0.0012 14.5 1.7 29 48-76 154-185 (322) 116 PRK07818 dihydrolipoamide dehy 23.8 55 0.0014 14.1 2.6 40 104-143 174-228 (467) 117 KOG2158 consensus 23.5 56 0.0014 14.0 2.1 41 64-106 239-280 (565) 118 cd03293 ABC_NrtD_SsuB_transpor 23.0 51 0.0013 14.3 1.7 80 49-129 39-148 (220) 119 COG0381 WecB UDP-N-acetylgluco 22.9 58 0.0015 14.0 3.2 48 86-133 46-94 (383) 120 TIGR02734 crtI_fam phytoene de 22.9 52 0.0013 14.2 1.7 13 109-121 5-17 (526) 121 pfam03971 IDH Monomeric isocit 22.8 53 0.0014 14.2 1.8 46 61-113 528-574 (735) 122 KOG1251 consensus 22.6 59 0.0015 13.9 2.3 23 91-113 164-187 (323) 123 TIGR01811 sdhA_Bsu succinate d 22.6 54 0.0014 14.1 1.8 14 102-115 116-129 (620) 124 pfam02601 Exonuc_VII_L Exonucl 22.5 59 0.0015 13.9 3.5 32 84-115 56-90 (295) 125 PRK05714 2-octaprenyl-3-methyl 22.3 59 0.0015 13.9 2.2 10 64-73 89-98 (405) 126 COG4559 ABC-type hemin transpo 22.1 41 0.001 14.9 1.1 28 60-87 50-77 (259) 127 PRK07245 consensus 21.7 56 0.0014 14.0 1.7 29 48-76 178-209 (337) 128 COG0644 FixC Dehydrogenases (f 21.3 62 0.0016 13.8 2.5 10 56-65 197-206 (396) 129 PRK11614 livF leucine/isoleuci 21.2 6.6 0.00017 19.8 -3.1 28 50-77 41-71 (237) 130 PRK00591 prfA peptide chain re 20.9 60 0.0015 13.8 1.7 27 49-75 196-224 (360) 131 TIGR01988 Ubi-OHases Ubiquinon 20.8 64 0.0016 13.7 2.1 17 106-122 3-19 (445) 132 cd07914 IGPD Imidazoleglycerol 20.8 64 0.0016 13.7 4.0 45 86-130 129-174 (190) 133 pfam01808 AICARFT_IMPCHas AICA 20.7 58 0.0015 13.9 1.6 18 110-127 172-189 (315) 134 PRK06467 dihydrolipoamide dehy 20.7 64 0.0016 13.7 2.6 17 104-120 176-192 (472) 135 cd03290 ABCC_SUR1_N The SUR do 20.4 65 0.0017 13.6 3.7 28 49-76 36-66 (218) 136 PRK11247 ssuB aliphatic sulfon 20.2 66 0.0017 13.6 2.4 76 49-129 47-150 (257) 137 PRK05976 dihydrolipoamide dehy 20.0 66 0.0017 13.6 2.6 20 103-122 176-195 (464) No 1 >PRK00132 rpsI 30S ribosomal protein S9; Reviewed Probab=100.00 E-value=0 Score=356.04 Aligned_cols=128 Identities=52% Similarity=0.884 Sum_probs=125.6 Q ss_pred CCCCEEEEECCCEEEEEEEECCCCEEEEECCCHHHHCCCHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCHHHHHHHHH Q ss_conf 35500131112108999998189739994100888446989999999999873856641079998259830188899999 Q gi|254780293|r 43 WQRSYATGKRKTSIARVWIKSGSGKFTINYVDISKYFTQDLLVLNIKRPFNTVSQDNMYDVFATVSGGGLSGQASAICHG 122 (170) Q Consensus 43 ~g~~~~tGrRKtsvArV~l~~GsG~I~INgk~~~~Yf~~~~~r~~i~~Pl~~~~~~~~~Di~~~V~GGG~sgQa~Airla 122 (170) ....|||||||||+|+|||.||+|+|+|||+|+++||+++.+++.+++||.+++..++|||+|+|+|||+||||+||||| T Consensus 3 ~~~~~~~GrRKtaiArv~l~~G~G~i~INgk~~~~Yf~~~~~r~~i~~Pl~~~~~~~~~DI~i~V~GGG~sGQa~Airla 82 (130) T PRK00132 3 QVQYYGTGRRKSAVARVRLKPGSGKITVNGRDLEEYFPRETLRMVVRQPLELTETEGKYDIKVNVKGGGLSGQAGAIRHG 82 (130) T ss_pred CCEEEEECCCCCEEEEEEEECCCCEEEECCEEHHHHCCCHHHHHHHHHHHHHHCCCCCEEEEEEEECCCHHHHHHHHHHH T ss_conf 42899855760589999998688579995812999749999999998589874745640389999798898999999999 Q ss_pred HHHHHHHHCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 999997657634487997798335755416759528984335430579 Q gi|254780293|r 123 VAKALTYFQPDLRPQIKKGGFLTRDSRIVERKKYGKAKARRSFQFSKR 170 (170) Q Consensus 123 IarAL~~~~p~~r~~Lk~~glLt~D~R~~ErKK~G~~kARk~~q~skR 170 (170) |||||+.++|++++.|++.||||+|+|++||||||++||||+|||||| T Consensus 83 IaRaL~~~~~~~r~~Lk~~glLtrD~R~~ErKK~G~~kARk~~Q~skR 130 (130) T PRK00132 83 IARALLEYDPELRPALKKAGFLTRDARVVERKKYGLKKARRAPQFSKR 130 (130) T ss_pred HHHHHHHHCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 999999979787999998789346861115576897566656466789 No 2 >CHL00079 rps9 ribosomal protein S9 Probab=100.00 E-value=0 Score=343.30 Aligned_cols=127 Identities=45% Similarity=0.705 Sum_probs=122.8 Q ss_pred CCCEEEEECCCEEEEEEEECCCCEEEEECCCHHHHCC-CHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCHHHHHHHHH Q ss_conf 5500131112108999998189739994100888446-989999999999873856641079998259830188899999 Q gi|254780293|r 44 QRSYATGKRKTSIARVWIKSGSGKFTINYVDISKYFT-QDLLVLNIKRPFNTVSQDNMYDVFATVSGGGLSGQASAICHG 122 (170) Q Consensus 44 g~~~~tGrRKtsvArV~l~~GsG~I~INgk~~~~Yf~-~~~~r~~i~~Pl~~~~~~~~~Di~~~V~GGG~sgQa~Airla 122 (170) .-.|||||||||+|+|||++|+|+|+|||+|+++||+ ++.+++.+++||.++++.++|||+|+|+|||+||||+||||| T Consensus 7 ~~~y~tGkRKtsiArv~i~~GsG~i~VN~k~l~~Yf~~~~~~~~~v~~Pl~~~~~~~k~DI~i~V~GGG~sgQa~Air~a 86 (134) T CHL00079 7 ILYYGTGRRKCAVAQVRLVPGTGEIIINGKPGEEYLQYNPNYLKAIKAPLEVLGLENQYDIIVKVKGGGLTGQAEAIKLG 86 (134) T ss_pred EEEEEECCCCCEEEEEEEECCCCEEEEECCCHHHHCCCCHHHHHHHHHHHHHHCCCCCEEEEEEEECCCHHHHHHHHHHH T ss_conf 79998668705899999986983699959709996586699999999799986878735699999788754399999999 Q ss_pred HHHHHHHHCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 999997657634487997798335755416759528984335430579 Q gi|254780293|r 123 VAKALTYFQPDLRPQIKKGGFLTRDSRIVERKKYGKAKARRSFQFSKR 170 (170) Q Consensus 123 IarAL~~~~p~~r~~Lk~~glLt~D~R~~ErKK~G~~kARk~~q~skR 170 (170) |||||+.++|++++.|+++||||+|+|++||||||++||||+|||||| T Consensus 87 IaRaL~~~~~~~r~~Lk~~glLtrD~R~~ErKK~G~~kARk~~q~skR 134 (134) T CHL00079 87 IARALCSMNPENRKSLKSEGFLTRDARIKERKKYGLKKARKAPQFSKR 134 (134) T ss_pred HHHHHHHHCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 999999979776799987779126731215586897455647365779 No 3 >COG0103 RpsI Ribosomal protein S9 [Translation, ribosomal structure and biogenesis] Probab=100.00 E-value=0 Score=340.17 Aligned_cols=126 Identities=51% Similarity=0.809 Sum_probs=124.7 Q ss_pred CCEEEEECCCEEEEEEEECCCCEEEEECCCHHHHCCCHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCHHHHHHHHHHH Q ss_conf 50013111210899999818973999410088844698999999999987385664107999825983018889999999 Q gi|254780293|r 45 RSYATGKRKTSIARVWIKSGSGKFTINYVDISKYFTQDLLVLNIKRPFNTVSQDNMYDVFATVSGGGLSGQASAICHGVA 124 (170) Q Consensus 45 ~~~~tGrRKtsvArV~l~~GsG~I~INgk~~~~Yf~~~~~r~~i~~Pl~~~~~~~~~Di~~~V~GGG~sgQa~AirlaIa 124 (170) ..|+|||||+|+|||||.+|+|+|+|||+|+++||+++.+++.|++||.+++..++|||.++|+|||+||||+||||||| T Consensus 5 ~~~~tGkRK~AvArv~l~~g~G~i~vNg~~~e~yf~~e~~r~~i~~Pl~l~~~~~~~Di~v~V~GGG~~GQA~AiR~gIa 84 (130) T COG0103 5 QVYTTGKRKSAVARVRLVPGKGKITVNGRPLELYFPRETLRMKIMQPLLLTGTVGKFDIDVTVKGGGISGQAGAIRHGIA 84 (130) T ss_pred EEEEECCCCCEEEEEEEECCCCEEEECCCCHHHHCCHHHHHHHHHHHHHHHCCCCCCCEEEEEECCCCHHHHHHHHHHHH T ss_conf 24640554326899999768758999891899966409999998766998674245547999966984689999999999 Q ss_pred HHHHHHCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 9997657634487997798335755416759528984335430579 Q gi|254780293|r 125 KALTYFQPDLRPQIKKGGFLTRDSRIVERKKYGKAKARRSFQFSKR 170 (170) Q Consensus 125 rAL~~~~p~~r~~Lk~~glLt~D~R~~ErKK~G~~kARk~~q~skR 170 (170) |||+.|+|++++.|++.||||+|||++||||||++|||++|||||| T Consensus 85 RAL~~~~~~lr~~lk~~g~LtrD~R~~ErKK~G~~kARr~~Q~SkR 130 (130) T COG0103 85 RALVEYDPELRPALKKAGLLTRDPRRVERKKYGLKKARRRPQFSKR 130 (130) T ss_pred HHHHHHCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 9999979898999987788015764324443576332115344569 No 4 >pfam00380 Ribosomal_S9 Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes. Probab=100.00 E-value=0 Score=339.54 Aligned_cols=121 Identities=51% Similarity=0.849 Sum_probs=120.2 Q ss_pred EECCCEEEEEEEECCCCEEEEECCCHHHHCCCHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCHHHHHHHHHHHHHHHH Q ss_conf 11121089999981897399941008884469899999999998738566410799982598301888999999999976 Q gi|254780293|r 50 GKRKTSIARVWIKSGSGKFTINYVDISKYFTQDLLVLNIKRPFNTVSQDNMYDVFATVSGGGLSGQASAICHGVAKALTY 129 (170) Q Consensus 50 GrRKtsvArV~l~~GsG~I~INgk~~~~Yf~~~~~r~~i~~Pl~~~~~~~~~Di~~~V~GGG~sgQa~AirlaIarAL~~ 129 (170) ||||||+|+|||++|+|+|+|||+|+++||+++.+++.+++||.+++..++|||+|+|+|||+||||+||||||||||+. T Consensus 1 GrRKtaiArv~l~~G~G~i~INg~~~~~yf~~~~~r~~v~~Pl~~~~~~~~~Di~i~V~GGG~sgQa~AirlaIaRaLv~ 80 (121) T pfam00380 1 GRRKTAVARVWLKPGSGKITINGKPLEEYFPNETLRMKILEPLELTGTLGKFDIVVTVKGGGISGQAGAIRLAIARALVA 80 (121) T ss_pred CCCCCEEEEEEEEECCCEEEECCCCHHHHCCCHHHHHHHHHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHHHHHHHHH T ss_conf 99845899999980776799989328886698999999997999860046642699996888858999999999999999 Q ss_pred HCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 57634487997798335755416759528984335430579 Q gi|254780293|r 130 FQPDLRPQIKKGGFLTRDSRIVERKKYGKAKARRSFQFSKR 170 (170) Q Consensus 130 ~~p~~r~~Lk~~glLt~D~R~~ErKK~G~~kARk~~q~skR 170 (170) ++|++++.|++.||||+|+|++||||||++||||+|||||| T Consensus 81 ~~~~~r~~Lk~~glLt~D~R~~ErKK~G~~kARk~~q~skR 121 (121) T pfam00380 81 YDPELRPELKKAGLLTRDPRRKERKKYGLKKARKRPQFSKR 121 (121) T ss_pred HCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 79787899998869015751115586897566746355679 No 5 >PRK00474 rps9p 30S ribosomal protein S9P; Reviewed Probab=100.00 E-value=0 Score=325.68 Aligned_cols=125 Identities=35% Similarity=0.461 Sum_probs=118.2 Q ss_pred CCEEEEECCCEEEEEEEECCCCEEEEECCCHHHHCCCHHHHHHHHHHHHHHC-CCCCEEEEEEEECCCCCHHHHHHHHHH Q ss_conf 5001311121089999981897399941008884469899999999998738-566410799982598301888999999 Q gi|254780293|r 45 RSYATGKRKTSIARVWIKSGSGKFTINYVDISKYFTQDLLVLNIKRPFNTVS-QDNMYDVFATVSGGGLSGQASAICHGV 123 (170) Q Consensus 45 ~~~~tGrRKtsvArV~l~~GsG~I~INgk~~~~Yf~~~~~r~~i~~Pl~~~~-~~~~~Di~~~V~GGG~sgQa~AirlaI 123 (170) ..++|||||||+|+|||+||+|+|+|||+|++.| ++..+++.+++||.+++ ..++|||+|+|+|||+||||+|||||| T Consensus 3 ~i~atGrRKtaiArv~l~~G~G~i~INgk~~~~y-~~~~~~~~i~~Pl~~~~~~~~~~Di~i~V~GGG~sgQA~AirlaI 81 (132) T PRK00474 3 VVITSGKRKTAIARATIKEGKGRVRINGVPLELY-EPELARMKIMEPLILAGDIASKVDIDVDVSGGGIMGQADAVRTAI 81 (132) T ss_pred EEEEECCCCCEEEEEEEECCCCEEEECCEEHHHH-CCHHHHHHHHHHHHHHHCCCCCCEEEEEEECCCHHHHHHHHHHHH T ss_conf 5887678766799999982873598898758996-899999999855999850034631799997787259999999999 Q ss_pred HHHHHHH--CHHHHHHHHH--CCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 9999765--7634487997--798335755416759528984335430579 Q gi|254780293|r 124 AKALTYF--QPDLRPQIKK--GGFLTRDSRIVERKKYGKAKARRSFQFSKR 170 (170) Q Consensus 124 arAL~~~--~p~~r~~Lk~--~glLt~D~R~~ErKK~G~~kARk~~q~skR 170 (170) ||||+.+ +|++++.|+. .||||+|||++||||||++||||+|||||| T Consensus 82 aRAL~~~~~~~~~~~~~~~~d~glLtrD~R~~ERKK~G~~kARk~~Q~SkR 132 (132) T PRK00474 82 ARGLVEWTGDMELKDAFLEYDRTLLVGDPRRTEPKKPGGKGARAKRQKSYR 132 (132) T ss_pred HHHHHHHCCCHHHHHHHHHCCCCCCCCCCHHHCCCCCCCCCCCCCCCCCCC T ss_conf 999998624877899998533786214635433577898554435255569 No 6 >TIGR03627 arch_S9P archaeal ribosomal protein S9P. This model describes exclusively the archaeal ribosomal protein S9P. Homologous eukaryotic and bacterial ribosomal proteins are excluded from this model. Probab=100.00 E-value=0 Score=323.34 Aligned_cols=124 Identities=36% Similarity=0.480 Sum_probs=117.4 Q ss_pred CEEEEECCCEEEEEEEECCCCEEEEECCCHHHHCCCHHHHHHHHHHHHHH-CCCCCEEEEEEEECCCCCHHHHHHHHHHH Q ss_conf 00131112108999998189739994100888446989999999999873-85664107999825983018889999999 Q gi|254780293|r 46 SYATGKRKTSIARVWIKSGSGKFTINYVDISKYFTQDLLVLNIKRPFNTV-SQDNMYDVFATVSGGGLSGQASAICHGVA 124 (170) Q Consensus 46 ~~~tGrRKtsvArV~l~~GsG~I~INgk~~~~Yf~~~~~r~~i~~Pl~~~-~~~~~~Di~~~V~GGG~sgQa~AirlaIa 124 (170) .++|||||||+|+|||+||+|+|+|||+|++. |+++.+++.+++||.++ +..++|||+|+|+|||+||||+||||||| T Consensus 2 i~~tGrRKtaiA~v~l~~G~G~i~VNg~~~~~-~~~~~~~~~i~~Pl~~~~~~~~~~di~i~V~GGG~sgQa~AirlaIa 80 (130) T TIGR03627 2 VITSGKRKTAIARATIREGKGRVRINGVPVEL-YPPELARLKIMEPLILAGDIAKEVDIDVKVSGGGIMGQADAARTAIA 80 (130) T ss_pred EEEECCCCCEEEEEEEECCCEEEEECCEEHHH-HCCHHHHHHHHHHHHHHHCCCCCCEEEEEEEECCEEHHHHHHHHHHH T ss_conf 15567680779999999171799999987999-49999999998509998451558508999982883068879999999 Q ss_pred HHHHHH--CHHHHHHHHH--CCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 999765--7634487997--798335755416759528984335430579 Q gi|254780293|r 125 KALTYF--QPDLRPQIKK--GGFLTRDSRIVERKKYGKAKARRSFQFSKR 170 (170) Q Consensus 125 rAL~~~--~p~~r~~Lk~--~glLt~D~R~~ErKK~G~~kARk~~q~skR 170 (170) |||+.| +|++++.|+. .||||+|||++||||||++||||+|||||| T Consensus 81 raL~~~~~~~~l~~~l~~~d~glLtrD~R~~ERKK~G~~kARk~~Q~SkR 130 (130) T TIGR03627 81 RGLVEFTGDKELRDAFRAYDRTLLVNDPRRKEPKKPGGRGARAKRQKSYR 130 (130) T ss_pred HHHHHHCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 99998635878899998553887466743323476898664737265679 No 7 >KOG1697 consensus Probab=100.00 E-value=0 Score=327.75 Aligned_cols=136 Identities=43% Similarity=0.616 Sum_probs=132.6 Q ss_pred CCCCCCCCCC----CCEEEEECCCEEEEEEEECCCCEEEEECCCHHHHCCCHHHHHHHHHHHHHHCCCCCEEEEEEEECC Q ss_conf 2346545435----500131112108999998189739994100888446989999999999873856641079998259 Q gi|254780293|r 35 IHSRKVDQWQ----RSYATGKRKTSIARVWIKSGSGKFTINYVDISKYFTQDLLVLNIKRPFNTVSQDNMYDVFATVSGG 110 (170) Q Consensus 35 ~~~~~~d~~g----~~~~tGrRKtsvArV~l~~GsG~I~INgk~~~~Yf~~~~~r~~i~~Pl~~~~~~~~~Di~~~V~GG 110 (170) .....+|..| +++++||||+|+|+|++.+|+|+|.|||+++++||+...+|+.++.||..++.+++|||+|+|+|| T Consensus 136 ~~~k~~~~~g~~~~~~~~~g~rK~a~A~V~v~~GtGk~~vNg~~~~~yF~~~~~Re~ll~Pl~~~~~lg~~dv~atv~GG 215 (275) T KOG1697 136 RFRKPLDESGKPEVRIIAVGRRKCARATVKVQPGTGKFDVNGRDLDVYFQHLQHREQLLYPLAVSESLGKYDVTATVSGG 215 (275) T ss_pred HHHHCCCCCCCCCEEEEECCCEECCEEEEEEECCCEEEEECCHHHHHHHCCCHHHHHHHHHHHHHCCCCCEEEEEEECCC T ss_conf 98730254688751466125400011379995684048765602776751340388875018986014530379995389 Q ss_pred CCCHHHHHHHHHHHHHHHHHCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 830188899999999997657634487997798335755416759528984335430579 Q gi|254780293|r 111 GLSGQASAICHGVAKALTYFQPDLRPQIKKGGFLTRDSRIVERKKYGKAKARRSFQFSKR 170 (170) Q Consensus 111 G~sgQa~AirlaIarAL~~~~p~~r~~Lk~~glLt~D~R~~ErKK~G~~kARk~~q~skR 170 (170) |+|||++||+||||+||+.++|++++.|+++||||+|+|++||||||++||||+|||+|| T Consensus 216 G~sgqagAI~~gia~aL~~~~p~~~~~lr~aGlLTrD~R~vERKK~gq~kARkk~tW~KR 275 (275) T KOG1697 216 GPSGQAGAIRLGIAKALASFEPDLIEPLRLAGLLTRDPRVVERKKPGQPKARKKPTWKKR 275 (275) T ss_pred CCCCHHHHHHHHHHHHHHHCCHHHHHHHHHCCCCCCCHHHHHHCCCCCCCCCCCCCCCCC T ss_conf 855416789999999998539877648886475005627665135786011136631369 No 8 >PTZ00086 40S ribosomal protein S16; Provisional Probab=100.00 E-value=0 Score=321.09 Aligned_cols=127 Identities=29% Similarity=0.402 Sum_probs=116.8 Q ss_pred CCCCEEEEECCCEEEEEEEECCCCEEEEECCCHHHHCCCHHHHHHHHHHHHHHC--CCCCEEEEEEEECCCCCHHHHHHH Q ss_conf 355001311121089999981897399941008884469899999999998738--566410799982598301888999 Q gi|254780293|r 43 WQRSYATGKRKTSIARVWIKSGSGKFTINYVDISKYFTQDLLVLNIKRPFNTVS--QDNMYDVFATVSGGGLSGQASAIC 120 (170) Q Consensus 43 ~g~~~~tGrRKtsvArV~l~~GsG~I~INgk~~~~Yf~~~~~r~~i~~Pl~~~~--~~~~~Di~~~V~GGG~sgQa~Air 120 (170) ....+++||||||+|+|||++|+|+|+|||+|+++||+ ..+++.|++||.+++ ..++|||+|+|+|||+||||+||| T Consensus 8 ~k~i~~~GrRKtAvArv~l~~G~G~I~IN~k~~~~~~~-~~~~~~i~~Pl~~~~~~~~~~~Di~i~V~GGG~sgQa~Air 86 (147) T PTZ00086 8 SKQVQTFGKKKTAVAVATVTKGKGLIKVNGVPLDLILP-ETLRAKIMEPLLVVGSKRFSRLDIRVRVRGGGQVAQIYAIR 86 (147) T ss_pred CCEEEEECCCCCEEEEEEEECCCCEEEECCEEHHHHCC-HHHHHHHHHHHHHHCCCCCCCEEEEEEEECCCHHHHHHHHH T ss_conf 71779775474279999998897509998914999689-99999996339774503135510899996787989999999 Q ss_pred HHHHHHHHHH-----CHHHHHHHHHC------CCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 9999999765-----76344879977------98335755416759528984335430579 Q gi|254780293|r 121 HGVAKALTYF-----QPDLRPQIKKG------GFLTRDSRIVERKKYGKAKARRSFQFSKR 170 (170) Q Consensus 121 laIarAL~~~-----~p~~r~~Lk~~------glLt~D~R~~ErKK~G~~kARk~~q~skR 170 (170) |||||||+.| ++..++.|++. ||||+|||++||||||++||||+|||||| T Consensus 87 ~aIaRALv~~~~~~~d~~~~~~Lk~~~~~yd~glLtrD~R~~ErKK~G~~kARk~~Q~SkR 147 (147) T PTZ00086 87 QAIAKGIVAYYQKYVDEAEKAELKDKLLSYDRSLLIADPRRCEPKKFGGHGARARFQKSYR 147 (147) T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 9999999986111168667599999986464876346753225476898654636366679 No 9 >KOG1753 consensus Probab=99.93 E-value=8.2e-26 Score=179.79 Aligned_cols=125 Identities=30% Similarity=0.393 Sum_probs=111.4 Q ss_pred CCEEEEECCCEEEEEEEECCCCEEEEECCCHHHHCCCHHHHHHHHHHHHHHCCC--CCEEEEEEEECCCCCHHHHHHHHH Q ss_conf 500131112108999998189739994100888446989999999999873856--641079998259830188899999 Q gi|254780293|r 45 RSYATGKRKTSIARVWIKSGSGKFTINYVDISKYFTQDLLVLNIKRPFNTVSQD--NMYDVFATVSGGGLSGQASAICHG 122 (170) Q Consensus 45 ~~~~tGrRKtsvArV~l~~GsG~I~INgk~~~~Yf~~~~~r~~i~~Pl~~~~~~--~~~Di~~~V~GGG~sgQa~Airla 122 (170) .....|++|+|+|.+..+.|+|.|.|||.|++.+ ....++.++++|+.+++.+ ..+||.+.|+|||..+|..|||++ T Consensus 8 Svq~fG~KK~AtAva~ck~G~glikvNg~ple~i-e~~~L~~Kl~Epvlllgk~rfa~vdi~v~~~ggghvsqiyairqa 86 (145) T KOG1753 8 SVQVFGRKKTATAVAHCKHGSGLIKVNGRPLELI-EPEILRYKLLEPVLLLGKERFAGVDIRVRVKGGGHVSQIYAIRQA 86 (145) T ss_pred CEEEECCCCCEEEEEEEECCCEEEEECCCCHHHC-CHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHHH T ss_conf 1255436530478887503745799878136762-479999987425764306550787569998358508889999987 Q ss_pred HHHHHHHHCH-----HHHHHHHH------CCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 9999976576-----34487997------798335755416759528984335430579 Q gi|254780293|r 123 VAKALTYFQP-----DLRPQIKK------GGFLTRDSRIVERKKYGKAKARRSFQFSKR 170 (170) Q Consensus 123 IarAL~~~~p-----~~r~~Lk~------~glLt~D~R~~ErKK~G~~kARk~~q~skR 170 (170) ||+||+.+.. ..+..+|+ .-||..|||+.|+||||.++||.++|.|+| T Consensus 87 ~~kalvayyqkyvDE~skkeiKd~li~yDrtlLVADprr~esKKFGGpGAraryQksyr 145 (145) T KOG1753 87 IAKALVAYYQKYVDEQSKKEIKDILIQYDRTLLVADPRRCESKKFGGPGARARYQKSYR 145 (145) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCHHHHHHHHCC T ss_conf 36779999999888888999999998578568973766567645689616777654139 No 10 >TIGR02903 spore_lon_C ATP-dependent protease, Lon family; InterPro: IPR014252 This entry shows some relation to the widely distributed ATP-dependent protease La, also called Lon or LonA (IPR004815 from INTERPRO), but is more closely related to LonB (IPR014251 from INTERPRO), a LonA paralog found only in endospore-forming bacteria. Proteins in this entry are unassigned peptidases belonging to the MEROPS peptidase family S16 (lon protease family, clan SJ). They are restricted to a subset of endospore-forming species, and probably participate in the program of endospore formation. We propose the designation LonC.. Probab=93.88 E-value=0.099 Score=31.04 Aligned_cols=84 Identities=21% Similarity=0.238 Sum_probs=50.2 Q ss_pred CCCCCEEEEE------CCCEEEEEEE--ECCCCEEEEECCCHHHHCCCHHHHHHHHHHHHHHCCC-CCEEEEEEEECCC- Q ss_conf 4355001311------1210899999--8189739994100888446989999999999873856-6410799982598- Q gi|254780293|r 42 QWQRSYATGK------RKTSIARVWI--KSGSGKFTINYVDISKYFTQDLLVLNIKRPFNTVSQD-NMYDVFATVSGGG- 111 (170) Q Consensus 42 ~~g~~~~tGr------RKtsvArV~l--~~GsG~I~INgk~~~~Yf~~~~~r~~i~~Pl~~~~~~-~~~Di~~~V~GGG- 111 (170) +-|+.||-|= ==.==|.++= .||.|.|.+|.---.. .-.+..+.+---=.+|+.. ..|||++||-||| T Consensus 446 EvG~vFGLGV~gy~GS~lEIEa~aF~A~~~GkG~~RfNdTAGSM--aKDSvFNAasviRk~T~~D~~~yD~HVNViGGG~ 523 (616) T TIGR02903 446 EVGHVFGLGVSGYLGSVLEIEAVAFEAKEKGKGTVRFNDTAGSM--AKDSVFNAASVIRKITGKDLKDYDIHVNVIGGGR 523 (616) T ss_pred CEEEEEECCCCCCCCCEEEEEEEEECCCCCCCEEEEEECCCCCC--HHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCC T ss_conf 30468704212100233355504423778995058861565530--3577898899986530468341651788852770 Q ss_pred ---CCHHHHHHHHHHHHHHH Q ss_conf ---30188899999999997 Q gi|254780293|r 112 ---LSGQASAICHGVAKALT 128 (170) Q Consensus 112 ---~sgQa~AirlaIarAL~ 128 (170) .|+ --||-+.|-=||. T Consensus 524 IDGPSA-G~Ai~~~~~SA~~ 542 (616) T TIGR02903 524 IDGPSA-GAAITLCIISAIL 542 (616) T ss_pred CCCCCH-HHHHHHHHHHHHC T ss_conf 175325-7999999999870 No 11 >TIGR01193 bacteriocin_ABC ABC-type bacteriocin transporter; InterPro: IPR005897 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This family contains ABC-type bacteriocin transporter. In general, bacteriocins are agents which are responsible for killing or inhibiting the closely related species or even different strains of the same species. Bacteriocins are encoded by bacterial plasmids. Bacteriocins are named after the species and hence in literature one encounters various names e.g., leucocin from Leuconostic geldium; pedicocin from Pedicoccus acidilactici; sakacin from Lactobacillus sake etc. Peptide bacteriocins are exported across the cytoplasmic membrane by a dedicated ATP-binding cassette (ABC) transporter. These ABC-transporters have an N-terminal peptidase domain that belong to MEROPS peptidase family C39 (clan CA); a central multi-pass transmembrane region and a C-terminal ABC transporter domain. These transporters have dual function: (i) they remove the N-terminal leader peptide from its bacteriocin precursor by cleavage at a Gly-Gly bond and (ii) transport the mature bacteriocin across the cytoplasmic membrane. This represents a novel strategy for secretion of bacterial proteins . Many bacteria are known to regulate diverse physiological processes through this system, such as bioluminescence, regulation of sporulation, virulence factor expression, antibiotics production, competence for genetic transformation, and activation of biofilm formation .; GO: 0008234 cysteine-type peptidase activity, 0019534 toxin transporter activity, 0006810 transport, 0016020 membrane. Probab=93.40 E-value=0.048 Score=33.01 Aligned_cols=77 Identities=23% Similarity=0.365 Sum_probs=52.6 Q ss_pred EECCCEEEE--E-EE--ECCCCEEEEECCCHHHHCCCHHHHHHHH----HHHHHHCC----------------------- Q ss_conf 111210899--9-99--8189739994100888446989999999----99987385----------------------- Q gi|254780293|r 50 GKRKTSIAR--V-WI--KSGSGKFTINYVDISKYFTQDLLVLNIK----RPFNTVSQ----------------------- 97 (170) Q Consensus 50 GrRKtsvAr--V-~l--~~GsG~I~INgk~~~~Yf~~~~~r~~i~----~Pl~~~~~----------------------- 97 (170) |==||.-|. | +- +|++|.|.+||.++.+ +.+-.+|..|. +|+..+|. T Consensus 510 GSGKsTLaKLLV~FfePQ~~sG~I~Lng~~l~~-iD~h~LRq~INYlPQeP~IF~GsILeNLLlGak~~~~~~~i~~A~~ 588 (710) T TIGR01193 510 GSGKSTLAKLLVGFFEPQAESGEILLNGISLKD-IDRHELRQFINYLPQEPYIFSGSILENLLLGAKENVSQDEILKAVE 588 (710) T ss_pred CCCHHHHHHHHHCCCCCCCCCCEEEECCCCHHH-CCHHHHCCCCCCCCCCCEEECCCHHHHHHHCCCCCCCHHHHHHHHH T ss_conf 974899998752035899887736527824453-3734441233556887845123178876503789989899998840 Q ss_pred ----C-------CCEEEEEEEECCCCCH-HHHHHHHHHHHHHHH Q ss_conf ----6-------6410799982598301-888999999999976 Q gi|254780293|r 98 ----D-------NMYDVFATVSGGGLSG-QASAICHGVAKALTY 129 (170) Q Consensus 98 ----~-------~~~Di~~~V~GGG~sg-Qa~AirlaIarAL~~ 129 (170) . -.|+-...++|+-+|| |-+ |+||||||+. T Consensus 589 iAEIk~DIe~mp~Gy~T~LS~~~~~iSGGQKQ--RialARaLL~ 630 (710) T TIGR01193 589 IAEIKDDIEKMPLGYQTELSEEGSSISGGQKQ--RIALARALLT 630 (710) T ss_pred HCCCHHHHHHCCCCCCEEECCCCCCCCCHHHH--HHHHHHHHCC T ss_conf 20112458318887660001037411515899--9999998618 No 12 >TIGR00958 3a01208 antigen peptide transporter 2; InterPro: IPR005293 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). Proteins of this family are involved in the transport of antigens from the cytoplasm to a membrane-bound compartment for association with MHC class I molecules.; GO: 0005215 transporter activity, 0006810 transport. Probab=92.19 E-value=0.059 Score=32.42 Aligned_cols=79 Identities=24% Similarity=0.305 Sum_probs=55.3 Q ss_pred EEECCCEEEEE--EE-ECCCCEEEEECCCHHHHCCCHHHHHHH---HHHHHHH--------------------------- Q ss_conf 31112108999--99-818973999410088844698999999---9999873--------------------------- Q gi|254780293|r 49 TGKRKTSIARV--WI-KSGSGKFTINYVDISKYFTQDLLVLNI---KRPFNTV--------------------------- 95 (170) Q Consensus 49 tGrRKtsvArV--~l-~~GsG~I~INgk~~~~Yf~~~~~r~~i---~~Pl~~~--------------------------- 95 (170) .|.=|++||-- .+ .|-+|.++|.|+|+.+|=-+..|+... -||.... T Consensus 568 sGsGKStvaaLL~n~Y~Pt~G~vLlDg~Pl~~y~H~YLH~~V~~VgQEPvLf~gSvr~NIaYGL~~~~T~~~~~aaA~~a 647 (770) T TIGR00958 568 SGSGKSTVAALLQNLYQPTGGQVLLDGVPLSQYDHHYLHRQVALVGQEPVLFSGSVRENIAYGLTKTPTDEEVTAAAKAA 647 (770) T ss_pred CCCCHHHHHHHHHHCCCCCCCEEEECCCCHHHHCCEECCCEEEEEECCCEECCCCHHHHHHHCCCCCCCHHHHHHHHHHH T ss_conf 99839999999985578986568776846133265010002543320543024631665651689998278999999861 Q ss_pred -------CCCCCEEEEEEEECCCC-CHHHHHHHHHHHHHHHH Q ss_conf -------85664107999825983-01888999999999976 Q gi|254780293|r 96 -------SQDNMYDVFATVSGGGL-SGQASAICHGVAKALTY 129 (170) Q Consensus 96 -------~~~~~~Di~~~V~GGG~-sgQa~AirlaIarAL~~ 129 (170) +....||-.|--+|.=+ -||-++ .||||||+. T Consensus 648 ~ahdFI~~~~~gydT~vGE~G~qlSgGQKQR--iAiARALvR 687 (770) T TIGR00958 648 NAHDFISELPNGYDTEVGEKGSQLSGGQKQR--IAIARALVR 687 (770) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCHHHHH--HHHHHHHHH T ss_conf 3764435788986754089787523038999--999988751 No 13 >KOG0058 consensus Probab=89.27 E-value=0.49 Score=26.77 Aligned_cols=78 Identities=26% Similarity=0.315 Sum_probs=53.7 Q ss_pred EEECCCEEEEEEE---ECCCCEEEEECCCHHHHCCCHHHHHHH------------------------------HHHH--- Q ss_conf 3111210899999---818973999410088844698999999------------------------------9999--- Q gi|254780293|r 49 TGKRKTSIARVWI---KSGSGKFTINYVDISKYFTQDLLVLNI------------------------------KRPF--- 92 (170) Q Consensus 49 tGrRKtsvArV~l---~~GsG~I~INgk~~~~Yf~~~~~r~~i------------------------------~~Pl--- 92 (170) .|.=|+++|..-+ .|-+|.|.++|+|+.+|=+ .++|..| ...- T Consensus 503 SGsGKSTiasLL~rfY~PtsG~IllDG~~i~~~~~-~~lr~~Ig~V~QEPvLFs~sI~eNI~YG~~~~t~e~i~~AAk~A 581 (716) T KOG0058 503 SGSGKSTIASLLLRFYDPTSGRILLDGVPISDINH-KYLRRKIGLVGQEPVLFSGSIRENIAYGLDNATDEEIEAAAKMA 581 (716) T ss_pred CCCCHHHHHHHHHHHCCCCCCEEEECCEEHHHCCH-HHHHHHEEEEECCCEEECCCHHHHHHCCCCCCCHHHHHHHHHHH T ss_conf 99888999999997368888738778834564187-99987714310265550451787775278888899999999984 Q ss_pred ----HHHCCCCCEEEEEEEECCCCCH-HHHHHHHHHHHHHHH Q ss_conf ----8738566410799982598301-888999999999976 Q gi|254780293|r 93 ----NTVSQDNMYDVFATVSGGGLSG-QASAICHGVAKALTY 129 (170) Q Consensus 93 ----~~~~~~~~~Di~~~V~GGG~sg-Qa~AirlaIarAL~~ 129 (170) .+.+....||-.|--+|-=+|| |-+ |.||||||+. T Consensus 582 Nah~FI~~~p~gY~T~VGEkG~qLSGGQKQ--RIAIARALlr 621 (716) T KOG0058 582 NAHEFITNFPDGYNTVVGEKGSQLSGGQKQ--RIAIARALLR 621 (716) T ss_pred CHHHHHHHCCCCCCCCCCCCCCCCCCHHHH--HHHHHHHHHH T ss_conf 738899828765556547754606631889--9999999850 No 14 >TIGR01846 type_I_sec_HlyB type I secretion system ATPase; InterPro: IPR010132 Type I protein secretion is a system in some Gram-negative bacteria to export proteins (often proteases) across both inner and outer membranes to the extracellular medium. This entry contains one of three proteins of the type I secretion apparatus. Targeted proteins are not cleaved at the N terminus, but rather carry signals located toward the extreme C terminus to direct type I secretion.; GO: 0005524 ATP binding, 0008565 protein transporter activity, 0015031 protein transport, 0016021 integral to membrane. Probab=86.62 E-value=0.63 Score=26.09 Aligned_cols=85 Identities=18% Similarity=0.221 Sum_probs=56.2 Q ss_pred CCCCCEE-E---EECCCEEEEE--EE-ECCCCEEEEECCCHHHHCCCHHHHHH---HHH--------------------- Q ss_conf 4355001-3---1112108999--99-81897399941008884469899999---999--------------------- Q gi|254780293|r 42 QWQRSYA-T---GKRKTSIARV--WI-KSGSGKFTINYVDISKYFTQDLLVLN---IKR--------------------- 90 (170) Q Consensus 42 ~~g~~~~-t---GrRKtsvArV--~l-~~GsG~I~INgk~~~~Yf~~~~~r~~---i~~--------------------- 90 (170) ..|...| | |==|+..++- ++ .|-.|.|.|.|.|+-..=|+ .+|-. ||+ T Consensus 489 ~~Ge~IGIvGpSGSGKSTLTKL~QRLYtP~~GqVLVDG~DLA~~DP~-wLRRQ~gVVLQEn~LFsrSirdNI~l~~P~~~ 567 (703) T TIGR01846 489 KPGEVIGIVGPSGSGKSTLTKLLQRLYTPEHGQVLVDGVDLAIADPA-WLRRQVGVVLQENVLFSRSIRDNIALCNPGAS 567 (703) T ss_pred CCCEEEEEECCCCCCHHHHHHHHHHHCCCCCCEEEECCCCCCCCCCC-HHHHCCEEEEECCHHHCCCHHHHHHHCCCCCC T ss_conf 78657998727898678999998861488887477703000101852-01022302340002101227889851777769 Q ss_pred ------HHHHH-------CCCCCEEEEEEEECCCCCH-HHHHHHHHHHHHHHH Q ss_conf ------99873-------8566410799982598301-888999999999976 Q gi|254780293|r 91 ------PFNTV-------SQDNMYDVFATVSGGGLSG-QASAICHGVAKALTY 129 (170) Q Consensus 91 ------Pl~~~-------~~~~~~Di~~~V~GGG~sg-Qa~AirlaIarAL~~ 129 (170) -=.++ ++-.-||-.+--+||++|| |.+ |.||||||+. T Consensus 568 ~E~Vi~AA~LAGA~dFI~~Lp~GY~T~vgE~G~~LSGGQRQ--RIAiARALv~ 618 (703) T TIGR01846 568 VEHVIAAAKLAGAHDFISELPQGYNTEVGEKGANLSGGQRQ--RIAIARALVA 618 (703) T ss_pred HHHHHHHHHHCCHHHHHHHCCCCCCCCCEEECCCCCCHHHH--HHHHHHHHHC T ss_conf 89999998621858999736376774001241566733789--9999999735 No 15 >COG4988 CydD ABC-type transport system involved in cytochrome bd biosynthesis, ATPase and permease components [Energy production and conversion / Posttranslational modification, protein turnover, chaperones] Probab=85.19 E-value=0.45 Score=27.00 Aligned_cols=80 Identities=20% Similarity=0.303 Sum_probs=49.5 Q ss_pred EEECCCEEEEEEE---ECCCCEEEEECCCHHHHCCCHHHHH------------------------------HHHHHHHHH Q ss_conf 3111210899999---8189739994100888446989999------------------------------999999873 Q gi|254780293|r 49 TGKRKTSIARVWI---KSGSGKFTINYVDISKYFTQDLLVL------------------------------NIKRPFNTV 95 (170) Q Consensus 49 tGrRKtsvArV~l---~~GsG~I~INgk~~~~Yf~~~~~r~------------------------------~i~~Pl~~~ 95 (170) .|-=|++...+.+ .|.+|.|+|||.++.+ +....++. .+.+-+..+ T Consensus 356 SGaGKSTLl~lL~G~~~~~~G~I~vng~~l~~-l~~~~~~k~i~~v~Q~p~lf~gTireNi~l~~~~~s~e~i~~al~~a 434 (559) T COG4988 356 SGAGKSTLLNLLLGFLAPTQGEIRVNGIDLRD-LSPEAWRKQISWVSQNPYLFAGTIRENILLARPDASDEEIIAALDQA 434 (559) T ss_pred CCCCHHHHHHHHHCCCCCCCCEEEECCCCCCC-CCHHHHHHHEEEECCCCCCCCCCHHHHHHCCCCCCCHHHHHHHHHHH T ss_conf 99978999999847577778448889931000-68778886724627998405641887773168767899999999982 Q ss_pred CCC------CCEEEEEEEECCCCCHHHHHHHHHHHHHHHHH Q ss_conf 856------64107999825983018889999999999765 Q gi|254780293|r 96 SQD------NMYDVFATVSGGGLSGQASAICHGVAKALTYF 130 (170) Q Consensus 96 ~~~------~~~Di~~~V~GGG~sgQa~AirlaIarAL~~~ 130 (170) +.. ...|-.+--.|-|+|| -+|-|+++||||+.- T Consensus 435 ~l~~~v~~p~GLdt~ige~G~~LSg-GQ~QRlaLARAll~~ 474 (559) T COG4988 435 GLLEFVPKPDGLDTVIGEGGAGLSG-GQAQRLALARALLSP 474 (559) T ss_pred CHHHHHCCCCCCCCCCCCCCCCCCH-HHHHHHHHHHHHCCC T ss_conf 5787616777643302158887789-999999999985588 No 16 >PRK11176 lipid transporter ATP-binding/permease protein; Provisional Probab=84.94 E-value=2 Score=22.93 Aligned_cols=86 Identities=22% Similarity=0.246 Sum_probs=53.2 Q ss_pred CCCCEE----EEECCCEEEEEEE---ECCCCEEEEECCCHHHH------------------CC-----------C-HHHH Q ss_conf 355001----3111210899999---81897399941008884------------------46-----------9-8999 Q gi|254780293|r 43 WQRSYA----TGKRKTSIARVWI---KSGSGKFTINYVDISKY------------------FT-----------Q-DLLV 85 (170) Q Consensus 43 ~g~~~~----tGrRKtsvArV~l---~~GsG~I~INgk~~~~Y------------------f~-----------~-~~~r 85 (170) .|..+| +|-=||..++.-+ .|-+|.|.|||.|+.+| |. . .... T Consensus 367 ~G~~vaiVG~SGsGKSTL~~LL~gly~p~~G~I~idg~di~~~~~~~lr~~i~~V~Q~~~lF~~TI~eNi~~~~~~~~~~ 446 (581) T PRK11176 367 AGKTVALVGRSGSGKSTIANLLTRFYDIDEGEILLDGHDLRDYTLASLRNQVALVSQNVHLFNDTVANNIAYARTEQYSR 446 (581) T ss_pred CCCEEECCCCCCCCHHHHHHHHHHHCCCCCCEEEECCEEHHHCCHHHHHCCCCCCCCCCEEECCCHHHHHHCCCCCCCCH T ss_conf 99443122899986789999998536678874878988512147666503455607776110772999972267432456 Q ss_pred HHHHHHHHHHCC-------CCCEEEEEEEECCCCCHHHHHHHHHHHHHHHH Q ss_conf 999999987385-------66410799982598301888999999999976 Q gi|254780293|r 86 LNIKRPFNTVSQ-------DNMYDVFATVSGGGLSGQASAICHGVAKALTY 129 (170) Q Consensus 86 ~~i~~Pl~~~~~-------~~~~Di~~~V~GGG~sgQa~AirlaIarAL~~ 129 (170) +.+.+.+..++. .+.||-.+.-.|..+| .-+.=|++|||||+. T Consensus 447 ~~i~~a~~~a~~~~~i~~lp~gldt~vge~G~~LS-gGQrQRialARall~ 496 (581) T PRK11176 447 EQIEEAARMAYAMDFINKMDNGLDTVIGENGVLLS-GGQRQRIAIARALLR 496 (581) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCC-HHHHHHHHHHHHHHC T ss_conf 89999999877778998655411450048989789-999999999999967 No 17 >PTZ00265 multidrug resistance protein (mdr1); Provisional Probab=84.80 E-value=1.3 Score=24.08 Aligned_cols=65 Identities=25% Similarity=0.311 Sum_probs=39.6 Q ss_pred CCCCEEEEECCCHHHHCCCHHHH------------------------------HHHHHHHHHHC-------CCCCEEEEE Q ss_conf 18973999410088844698999------------------------------99999998738-------566410799 Q gi|254780293|r 63 SGSGKFTINYVDISKYFTQDLLV------------------------------LNIKRPFNTVS-------QDNMYDVFA 105 (170) Q Consensus 63 ~GsG~I~INgk~~~~Yf~~~~~r------------------------------~~i~~Pl~~~~-------~~~~~Di~~ 105 (170) |.+|.|.|+|.++.++ +...+| +.|.+-...++ +...||-.+ T Consensus 1275 p~sG~I~IDG~DI~~i-~L~~LRs~iaiVpQeP~LF~GTIReNI~yg~~d~t~eeI~~A~k~Ani~dfI~sLP~G~dT~V 1353 (1467) T PTZ00265 1275 KNNGEILLDDIDICDY-NLKDLRNLFSIVSQEPMLFNMSIYENIKFGKEDATLEDVKRACKFAAIDEFIESLPNKYDTNV 1353 (1467) T ss_pred CCCCEEEECCEECCCC-CHHHHHHCCCEECCCCCEECCCHHHHHCCCCCCCCHHHHHHHHHHCCCHHHHHHCCCCCCCEE T ss_conf 8883699899872307-999998578578989802274889980889999999999999987488889863666558862 Q ss_pred EEECCCCCHHHHHHHHHHHHHHHH Q ss_conf 982598301888999999999976 Q gi|254780293|r 106 TVSGGGLSGQASAICHGVAKALTY 129 (170) Q Consensus 106 ~V~GGG~sgQa~AirlaIarAL~~ 129 (170) .-.|.-+| .-+-=|++|||||+. T Consensus 1354 ge~G~nLS-gGQKQrIaIARALLR 1376 (1467) T PTZ00265 1354 GPYGKSLS-GGQKQRIAIARALLR 1376 (1467) T ss_pred CCCCCCCC-HHHHHHHHHHHHHHH T ss_conf 69993369-999999999999976 No 18 >PRK11174 cysteine/glutathione ABC transporter membrane/ATP-binding component; Reviewed Probab=84.06 E-value=1.7 Score=23.41 Aligned_cols=79 Identities=19% Similarity=0.175 Sum_probs=50.2 Q ss_pred EEECCCEEEEEE--EECCCCEEEEECCCHHHHCCCHHHH------------------------------HHHHHHHHHHC Q ss_conf 311121089999--9818973999410088844698999------------------------------99999998738 Q gi|254780293|r 49 TGKRKTSIARVW--IKSGSGKFTINYVDISKYFTQDLLV------------------------------LNIKRPFNTVS 96 (170) Q Consensus 49 tGrRKtsvArV~--l~~GsG~I~INgk~~~~Yf~~~~~r------------------------------~~i~~Pl~~~~ 96 (170) +|-=||..++.- ..|-+|.|.|||.|+.+|=. ..+| +.+.+-+..++ T Consensus 385 SGsGKSTl~~LL~g~~p~~G~I~i~g~di~~i~~-~~lr~~i~~V~Q~~~LF~~TI~eNI~~g~~~atdeei~~A~~~a~ 463 (588) T PRK11174 385 SGAGKTSLLNALLGFLPYQGSLKINGIELRELDP-ESWRKHLSWVGQNPQLFHGTLRDNVLLANPDASDEQLQQALENAW 463 (588) T ss_pred CCCCHHHHHHHHHHHCCCCCEEEECCEECCCCCH-HHHHHHEEEECCCCCCCCCCHHHHHHCCCCCCCHHHHHHHHHHHH T ss_conf 9864999999998728988389999986030899-999966035166677776629986533585433457999999862 Q ss_pred C-------CCCEEEEEEEECCCCCHHHHHHHHHHHHHHHH Q ss_conf 5-------66410799982598301888999999999976 Q gi|254780293|r 97 Q-------DNMYDVFATVSGGGLSGQASAICHGVAKALTY 129 (170) Q Consensus 97 ~-------~~~~Di~~~V~GGG~sgQa~AirlaIarAL~~ 129 (170) . .+.||-.+.-.|.++|| -+.=|++|||||+. T Consensus 464 ~~~~I~~Lp~GldT~vge~G~~LSG-GQrQRiaiARAll~ 502 (588) T PRK11174 464 VSEFVPLLPQGLDTPIGDQAAGLSV-GQAQRLALARALLQ 502 (588) T ss_pred HHHHHHHCCCCCCCHHCCCCCCCCH-HHHHHHHHHHHHHC T ss_conf 4789984513223632288887799-99999999999837 No 19 >PRK13657 cyclic beta-1,2-glucan ABC transporter; Provisional Probab=83.95 E-value=0.33 Score=27.81 Aligned_cols=86 Identities=22% Similarity=0.246 Sum_probs=52.8 Q ss_pred CCCCEE----EEECCCEEEEEEE---ECCCCEEEEECCCHHHHCCCHHHHH----------------------------- Q ss_conf 355001----3111210899999---8189739994100888446989999----------------------------- Q gi|254780293|r 43 WQRSYA----TGKRKTSIARVWI---KSGSGKFTINYVDISKYFTQDLLVL----------------------------- 86 (170) Q Consensus 43 ~g~~~~----tGrRKtsvArV~l---~~GsG~I~INgk~~~~Yf~~~~~r~----------------------------- 86 (170) .|..+| +|-=||+.++.-+ .|-+|.|.|||.|+.++ +...+|. T Consensus 360 ~Ge~vaiVG~SGsGKSTL~~LL~gly~p~~G~I~idg~di~~i-~~~~lR~~i~~V~Q~~~LF~gTI~eNI~~g~~~~sd 438 (585) T PRK13657 360 PGQTVAIVGPTGAGKSTLINLLHRVFDPQSGRIRIDGTDIRTV-TRASLRRNIGVVFQEAGLFNRSIEDNLRVGRPDATD 438 (585) T ss_pred CCCEEEEECCCCCCHHHHHHHHHHHCCCCCCCEEECCEECHHC-CHHHHHHHCCEECCCCCCCCHHHHHHHHCCCCCCCH T ss_conf 9988999889898699999998601578879675898961016-899998525221667635476599887527999986 Q ss_pred -HHHHHHHHHCC-------CCCEEEEEEEECCCCCHHHHHHHHHHHHHHHHH Q ss_conf -99999987385-------664107999825983018889999999999765 Q gi|254780293|r 87 -NIKRPFNTVSQ-------DNMYDVFATVSGGGLSGQASAICHGVAKALTYF 130 (170) Q Consensus 87 -~i~~Pl~~~~~-------~~~~Di~~~V~GGG~sgQa~AirlaIarAL~~~ 130 (170) .+.+-+..++. .+.||-.+.-.|-.+| .-+.=|+||||||+.- T Consensus 439 ~ei~~a~~~a~~~~~I~~lp~G~dT~vge~G~~LS-GGQrQRialARAll~~ 489 (585) T PRK13657 439 EEMRAAAERAQALDFIERKEDGYDTVVGERGRQLS-GGERQRLAIARALLKD 489 (585) T ss_pred HHHHHHHHHHCCHHHHHHCCCCCCCCCCCCCCCCC-HHHHHHHHHHHHHHCC T ss_conf 88999999843279997175666680137888689-9999999999999469 No 20 >TIGR02211 LolD_lipo_ex lipoprotein releasing system, ATP-binding protein; InterPro: IPR011924 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This entry represents LolD, a member of the ABC transporter family. LolD is involved in localization of lipoproteins in some bacteria. It works with a transmembrane protein LolC, which in some species is a paralogous pair LolC and LolE. Depending on the residue immediately following the modified N-terminal Cys residue, the nascent lipoprotein may be carried further by LolA and LolB to the outer membrane, or remain at the inner membrane. Excluded from this entry are homologs from the archaeal genus Methanosarcina .; GO: 0005524 ATP binding, 0006810 transport, 0016020 membrane. Probab=83.43 E-value=1.6 Score=23.63 Aligned_cols=67 Identities=16% Similarity=0.125 Sum_probs=42.5 Q ss_pred CCCCEEEEECCCH----------------------HHHCCCHHHHHHHHHHHHHHCCC------------CCEEE--EEE Q ss_conf 1897399941008----------------------88446989999999999873856------------64107--999 Q gi|254780293|r 63 SGSGKFTINYVDI----------------------SKYFTQDLLVLNIKRPFNTVSQD------------NMYDV--FAT 106 (170) Q Consensus 63 ~GsG~I~INgk~~----------------------~~Yf~~~~~r~~i~~Pl~~~~~~------------~~~Di--~~~ 106 (170) |-+|.|.+||+++ +.-.+..+.-+.|+-|+.+-+.. .++.. .++ T Consensus 57 PT~G~v~f~G~~l~~lS~~~~~~LRN~~LGFiYQFHHLL~dFtaLENVaMP~LIg~~s~~ea~~~A~~mL~~VgL~~R~~ 136 (221) T TIGR02211 57 PTSGEVLFNGQSLSKLSSNERAKLRNKKLGFIYQFHHLLPDFTALENVAMPLLIGKKSKKEAKERAYEMLEKVGLEHRIN 136 (221) T ss_pred CCCCEEEECCCCHHHCCHHHHHHHHHHHCCCEEEHHHCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCC T ss_conf 96315897063234404467887512225844432020300002688777775358998899999999988607334455 Q ss_pred EECCCCCHHHHHHHHHHHHHHHHH Q ss_conf 825983018889999999999765 Q gi|254780293|r 107 VSGGGLSGQASAICHGVAKALTYF 130 (170) Q Consensus 107 V~GGG~sgQa~AirlaIarAL~~~ 130 (170) =+=+=+||= +==|-||||||+.- T Consensus 137 h~PSELSGG-ERQRvAIARALvN~ 159 (221) T TIGR02211 137 HRPSELSGG-ERQRVAIARALVNQ 159 (221) T ss_pred CCCCCCCCC-HHHHHHHHHHHHCC T ss_conf 577734563-37999999986189 No 21 >PRK11160 cysteine/glutathione ABC transporter membrane/ATP-binding component; Reviewed Probab=81.83 E-value=2.3 Score=22.59 Aligned_cols=79 Identities=24% Similarity=0.325 Sum_probs=50.5 Q ss_pred EEECCCEEEEEEE---ECCCCEEEEECCCHHHHCCCHHHH------------------------------HHHHHHHHHH Q ss_conf 3111210899999---818973999410088844698999------------------------------9999999873 Q gi|254780293|r 49 TGKRKTSIARVWI---KSGSGKFTINYVDISKYFTQDLLV------------------------------LNIKRPFNTV 95 (170) Q Consensus 49 tGrRKtsvArV~l---~~GsG~I~INgk~~~~Yf~~~~~r------------------------------~~i~~Pl~~~ 95 (170) +|-=||..+..-+ .|-+|.|.|||.|+.+| +...+| +.+.+-+..+ T Consensus 376 SGsGKSTL~~LL~r~ydp~~G~I~idG~di~~~-~~~~lr~~i~~V~Q~~~LF~~TI~eNi~~g~~~atdee~~~al~~a 454 (575) T PRK11160 376 TGCGKSTLLQLLTRAWDPQQGEILLNGQPIASY-SEAALRQAISVVSQRVHLFSGTLRDNLLLAAPNATDEALIEVLQQV 454 (575) T ss_pred CCCCHHHHHHHHHCCCCCCCCEEEECCEECCCC-CHHHHHHHHCCCCCCCCCCCCCHHHHHHHCCCCCCHHHHHHHHHHH T ss_conf 997599999998623678998899998975638-8899987613567776025886789987258987899999999753 Q ss_pred CC------CCCEEEEEEEECCCCCHHHHHHHHHHHHHHHH Q ss_conf 85------66410799982598301888999999999976 Q gi|254780293|r 96 SQ------DNMYDVFATVSGGGLSGQASAICHGVAKALTY 129 (170) Q Consensus 96 ~~------~~~~Di~~~V~GGG~sgQa~AirlaIarAL~~ 129 (170) +. ...||-.+--.|..+|| -+.=|++|||||+. T Consensus 455 ~l~~~~~~p~GldT~vGe~G~~LSG-GQrQRiaiARAll~ 493 (575) T PRK11160 455 GLEKLLEGDKGLDAWLGEGGRQLSG-GEQRRLGIARALLH 493 (575) T ss_pred CCHHHHHCCCCCCCCCCCCCCCCCH-HHHHHHHHHHHHHC T ss_conf 7064563676534503788995999-99999999999945 No 22 >cd03229 ABC_Class3 This class is comprised of all BPD (Binding Protein Dependent) systems that are largely represented in archaea and eubacteria and are primarily involved in scavenging solutes from the environment. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=80.09 E-value=0.8 Score=25.45 Aligned_cols=75 Identities=21% Similarity=0.281 Sum_probs=35.8 Q ss_pred EEECCCEEEEE---EEECCCCEEEEECCCHHHHCCC-HHHHHHHHHHHH---HHCCCCCE-EEEEEEECCCCCHHHHHHH Q ss_conf 31112108999---9981897399941008884469-899999999998---73856641-0799982598301888999 Q gi|254780293|r 49 TGKRKTSIARV---WIKSGSGKFTINYVDISKYFTQ-DLLVLNIKRPFN---TVSQDNMY-DVFATVSGGGLSGQASAIC 120 (170) Q Consensus 49 tGrRKtsvArV---~l~~GsG~I~INgk~~~~Yf~~-~~~r~~i~~Pl~---~~~~~~~~-Di~~~V~GGG~sgQa~Air 120 (170) .|-=||...++ .+.|-+|.|.+||+++...-.. ...+..+-.-|. +.....-+ +|..-.+|| +.=| T Consensus 35 SG~GKSTllr~i~Gl~~p~~G~I~~~g~~i~~~~~~~~~~rr~ig~vFQ~~~L~p~~tv~eNv~~~LSGG------q~QR 108 (178) T cd03229 35 SGSGKSTLLRCIAGLEEPDSGSILIDGEDLTDLEDELPPLRRRIGMVFQDFALFPHLTVLENIALGLSGG------QQQR 108 (178) T ss_pred CCCHHHHHHHHHHCCCCCCCEEEEECCEECCCCCCHHHHHCCCEEEEECCCCCCCCCCHHHHHCEECCCH------HHHH T ss_conf 9983999999998599999639999999998886102454177599926998899892899600817726------8899 Q ss_pred HHHHHHHHH Q ss_conf 999999976 Q gi|254780293|r 121 HGVAKALTY 129 (170) Q Consensus 121 laIarAL~~ 129 (170) .||||||+. T Consensus 109 vaIARAL~~ 117 (178) T cd03229 109 VALARALAM 117 (178) T ss_pred HHHHHHHHC T ss_conf 999999852 No 23 >TIGR02857 CydD ABC transporter, CydDC cysteine exporter (CydDC-E) family, permease/ATP-binding protein CydD; InterPro: IPR014216 The gene pair cydCD encodes an ABC-family transporter in which each gene contains an N-terminal membrane-spanning domain and a C-terminal ATP-binding domain. In Escherichia coli these genes were discovered as mutants which caused the terminal heme-copper oxidase complex cytochrome bd to fail to assemble. Recent work has shown that the transporter is involved in the export of redox-active thiol compounds such as cysteine and glutathione , . The linkage to assembly of the cytochrome bd complex is further supported by the conserved operon structure found outside the gammaproteobacteria (cydABCD) containing both the transporter and oxidase genes components. The genes used as the seed members for this entry are all either found in the gammaproteobacterial context, or the CydABCD context. All members of this entry scoring above the trusted cut off at the time of its creation were from genomes which encode a cytochrome bd complex. Unfortunately, the gene symbol nomenclature, based on this operon in Bacillus subtilis, assigns cydC to the third gene in the operon, where this gene is actually homologous to the E. coli cydD gene. We have chosen to name all homologues in this family in accordance with the precedence of publication of the E. coli name, CydD.. Probab=79.22 E-value=0.89 Score=25.14 Aligned_cols=86 Identities=24% Similarity=0.289 Sum_probs=60.0 Q ss_pred CCCEEEEECCCEEEEEEE--ECC---------------------------CCEEEE------ECCCHHHHCCCH-HHHHH Q ss_conf 550013111210899999--818---------------------------973999------410088844698-99999 Q gi|254780293|r 44 QRSYATGKRKTSIARVWI--KSG---------------------------SGKFTI------NYVDISKYFTQD-LLVLN 87 (170) Q Consensus 44 g~~~~tGrRKtsvArV~l--~~G---------------------------sG~I~I------Ngk~~~~Yf~~~-~~r~~ 87 (170) .-+|+-||++.++.-|-+ .|| .|.|.| ||.|+.+ |... .++.. T Consensus 356 ~~~y~~G~~~pa~~~~sf~~~pG~~vAl~G~SGaGKSTLL~lLLGf~~P~~G~i~v~~~Gg~~G~~L~~-~~~~W~W~~~ 434 (570) T TIGR02857 356 SVAYAPGRDEPALRPVSFTVEPGERVALVGPSGAGKSTLLNLLLGFVEPTEGAIVVERDGGINGVPLAE-ADADWSWRDQ 434 (570) T ss_pred EEECCCCCHHHCCCCCCEEECCCCEEEEEECCCCCHHHHHHHHHCCCCCCCCEEEEECCCCCCCCCCCC-CCHHHHHHHC T ss_conf 686278880103788541663870488862799978899999971576446468874488853976421-1310225541 Q ss_pred HHH----HHHHHCCC---------------------------------CCEEEEEEEECCCCCHHHHHHHHHHHHHHHHH Q ss_conf 999----99873856---------------------------------64107999825983018889999999999765 Q gi|254780293|r 88 IKR----PFNTVSQD---------------------------------NMYDVFATVSGGGLSGQASAICHGVAKALTYF 130 (170) Q Consensus 88 i~~----Pl~~~~~~---------------------------------~~~Di~~~V~GGG~sgQa~AirlaIarAL~~~ 130 (170) |-. |+..-++. .-+|-.+-=+|-|+|| =++=|+|+||||+.- T Consensus 435 iaWv~Q~P~l~~gtiaeNi~La~~~A~~~~~~~A~~~agl~~~~~~LP~Gl~t~~Ge~G~GLSG-Gq~QRlALARafl~~ 513 (570) T TIGR02857 435 IAWVPQHPFLFAGTIAENIRLARPDASDAEIREALERAGLDEFVAALPQGLDTPIGEGGAGLSG-GQAQRLALARAFLRD 513 (570) T ss_pred EEEECCCCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCH-HHHHHHHHHHHHHCH T ss_conf 3430566601364799999734899998999999999728999997111313435888884779-999999999996061 Q ss_pred C Q ss_conf 7 Q gi|254780293|r 131 Q 131 (170) Q Consensus 131 ~ 131 (170) . T Consensus 514 ~ 514 (570) T TIGR02857 514 A 514 (570) T ss_pred H T ss_conf 1 No 24 >COG4608 AppF ABC-type oligopeptide transport system, ATPase component [Amino acid transport and metabolism] Probab=78.41 E-value=0.95 Score=24.97 Aligned_cols=71 Identities=25% Similarity=0.353 Sum_probs=50.4 Q ss_pred EEECCCEEEEEEE---ECCCCEEEEECCCHHHHCCCHHHHHHHHHHHHHHCCC----CCEEEEEEEECCCCCHHHHHHHH Q ss_conf 3111210899999---8189739994100888446989999999999873856----64107999825983018889999 Q gi|254780293|r 49 TGKRKTSIARVWI---KSGSGKFTINYVDISKYFTQDLLVLNIKRPFNTVSQD----NMYDVFATVSGGGLSGQASAICH 121 (170) Q Consensus 49 tGrRKtsvArV~l---~~GsG~I~INgk~~~~Yf~~~~~r~~i~~Pl~~~~~~----~~~Di~~~V~GGG~sgQa~Airl 121 (170) .|-=||..||+.+ .|-+|.|..+|+++.. +.....+..+.+-|..++.. .+|+-.. +|| |-+ |. T Consensus 48 SG~GKSTlgr~i~~L~~pt~G~i~f~g~~i~~-~~~~~~~~~v~elL~~Vgl~~~~~~ryPhel--SGG----QrQ--Ri 118 (268) T COG4608 48 SGCGKSTLGRLILGLEEPTSGEILFEGKDITK-LSKEERRERVLELLEKVGLPEEFLYRYPHEL--SGG----QRQ--RI 118 (268) T ss_pred CCCCHHHHHHHHHCCCCCCCCEEEECCCCHHH-CCHHHHHHHHHHHHHHHCCCHHHHHCCCCCC--CCH----HHH--HH T ss_conf 88877879999972838887269986853111-3666799999999998098877863388303--731----335--69 Q ss_pred HHHHHHH Q ss_conf 9999997 Q gi|254780293|r 122 GVAKALT 128 (170) Q Consensus 122 aIarAL~ 128 (170) +|||||. T Consensus 119 ~IARALa 125 (268) T COG4608 119 GIARALA 125 (268) T ss_pred HHHHHHH T ss_conf 9999985 No 25 >COG3842 PotA ABC-type spermidine/putrescine transport systems, ATPase components [Amino acid transport and metabolism] Probab=78.35 E-value=0.58 Score=26.28 Aligned_cols=48 Identities=23% Similarity=0.371 Sum_probs=31.8 Q ss_pred EEECCCEEEEE---EEECCCCEEEEECCCHH--------------HH--CCCHHHHHHHHHHHHHHC Q ss_conf 31112108999---99818973999410088--------------84--469899999999998738 Q gi|254780293|r 49 TGKRKTSIARV---WIKSGSGKFTINYVDIS--------------KY--FTQDLLVLNIKRPFNTVS 96 (170) Q Consensus 49 tGrRKtsvArV---~l~~GsG~I~INgk~~~--------------~Y--f~~~~~r~~i~~Pl~~~~ 96 (170) .|-=||..=|. ...|.+|.|.+||+++. +| ||+.+-...|-.||.+-. T Consensus 40 SGcGKTTlLR~IAGfe~p~~G~I~l~g~~i~~lpp~kR~ig~VFQ~YALFPHmtV~~NVafGLk~~~ 106 (352) T COG3842 40 SGCGKTTLLRMIAGFEQPSSGEILLDGEDITDVPPEKRPIGMVFQSYALFPHMTVEENVAFGLKVRK 106 (352) T ss_pred CCCCHHHHHHHHHCCCCCCCCEEEECCEECCCCCHHHCCCCEEECCCCCCCCCCHHHHHHHHHHHCC T ss_conf 8888899999996777888865999999988899422652326067666888858997553331057 No 26 >cd03214 ABC_Iron-Siderophores_B12_Hemin ABC transporters, involved in the uptake of siderophores, heme, and vitamin B12, are widely conserved in bacteria and archaea. Only very few species lack representatives of the siderophore family transporters. The E. coli BtuCD protein is an ABC transporter mediating vitamin B12 uptake. The two ATP-binding cassettes (BtuD) are in close contact with each other, as are the two membrane-spanning subunits (BtuC); this arrangement is distinct from that observed for the E. coli lipid flippase MsbA. The BtuC subunits provide 20 transmembrane helices grouped around a translocation pathway that is closed to the cytoplasm by a gate region, whereas the dimer arrangement of the BtuD subunits resembles the ATP-bound form of the Rad50 DNA repair enzyme. A prominent cytoplasmic loop of BtuC forms the contact region with the ATP-binding cassette and represent a conserved motif among the ABC transporters. Probab=78.23 E-value=0.59 Score=26.27 Aligned_cols=76 Identities=17% Similarity=0.258 Sum_probs=44.7 Q ss_pred EEECCCEEEEE---EEECCCCEEEEECCCHHHHCCCHHHHH--HHHHHHHHHCCCCCEEEEEEEECCCCCHHHHHHHHHH Q ss_conf 31112108999---998189739994100888446989999--9999998738566410799982598301888999999 Q gi|254780293|r 49 TGKRKTSIARV---WIKSGSGKFTINYVDISKYFTQDLLVL--NIKRPFNTVSQDNMYDVFATVSGGGLSGQASAICHGV 123 (170) Q Consensus 49 tGrRKtsvArV---~l~~GsG~I~INgk~~~~Yf~~~~~r~--~i~~Pl~~~~~~~~~Di~~~V~GGG~sgQa~AirlaI 123 (170) .|-=||+..++ .+.|-+|.|.+||+++..+-+....+. .+.+-|..++....-+=.+.--.|| |-+ |.+| T Consensus 34 nGsGKTTLl~~i~G~~~~~~G~I~~~g~~i~~~~~~~~~~~i~~v~Q~l~~~~l~~~~~~~~~~LSGG---qkQ--rv~i 108 (180) T cd03214 34 NGAGKSTLLKTLAGLLKPSSGEILLDGKDLASLSPKELARKIAYVPQALELLGLAHLADRPFNELSGG---ERQ--RVLL 108 (180) T ss_pred CCCCHHHHHHHHHCCCCCCCCEEEECCEECCCCCHHHHHCCCCHHHHHHHHCCCHHHHCCCHHHCCHH---HHH--HHHH T ss_conf 99889999999957989987289999999896999999554649999999859977864991037999---999--9999 Q ss_pred HHHHHH Q ss_conf 999976 Q gi|254780293|r 124 AKALTY 129 (170) Q Consensus 124 arAL~~ 129 (170) ||||+. T Consensus 109 A~aL~~ 114 (180) T cd03214 109 ARALAQ 114 (180) T ss_pred HHHHHH T ss_conf 999986 No 27 >KOG3288 consensus Probab=77.94 E-value=1.5 Score=23.84 Aligned_cols=32 Identities=28% Similarity=0.311 Sum_probs=27.8 Q ss_pred CCEEEEEEEECCCCCHHHHHHHHHHHHHHHHH Q ss_conf 64107999825983018889999999999765 Q gi|254780293|r 99 NMYDVFATVSGGGLSGQASAICHGVAKALTYF 130 (170) Q Consensus 99 ~~~Di~~~V~GGG~sgQa~AirlaIarAL~~~ 130 (170) .+|.+.|.|.+-|+.||.+|.+||-+-.-+.| T Consensus 273 ~~ftlRC~~Cq~glvGq~ea~eHA~~TGH~nF 304 (307) T KOG3288 273 AKFTLRCMVCQMGLVGQKEAAEHAKATGHVNF 304 (307) T ss_pred CCEEEEEEECCCCEEEHHHHHHHHHHCCCCCC T ss_conf 11489864215420208999998886087764 No 28 >cd03252 ABCC_Hemolysin The ABC-transporter hemolysin B is a central component of the secretion machinery that translocates the toxin, hemolysin A, in a Sec-independent fashion across both membranes of E. coli. The hemolysin A (HlyA) transport machinery is composed of the ATP-binding cassette (ABC) transporter HlyB located in the inner membrane, hemolysin D (HlyD), also anchored in the inner membrane, and TolC, which resides in the outer membrane. HlyD apparently forms a continuous channel that bridges the entire periplasm, interacting with TolC and HlyB. This arrangement prevents the appearance of periplasmic intermediates of HlyA during substrate transport. Little is known about the molecular details of HlyA transport, but it is evident that ATP-hydrolysis by the ABC-transporter HlyB is a necessary source of energy. Probab=77.85 E-value=5.5 Score=20.26 Aligned_cols=80 Identities=16% Similarity=0.223 Sum_probs=47.3 Q ss_pred EEECCCEEEEEEE---ECCCCEEEEECCCHHHHCC-----------------CHHHH------------HHHHHHHHHHC Q ss_conf 3111210899999---8189739994100888446-----------------98999------------99999998738 Q gi|254780293|r 49 TGKRKTSIARVWI---KSGSGKFTINYVDISKYFT-----------------QDLLV------------LNIKRPFNTVS 96 (170) Q Consensus 49 tGrRKtsvArV~l---~~GsG~I~INgk~~~~Yf~-----------------~~~~r------------~~i~~Pl~~~~ 96 (170) +|-=||+.+++.+ .|-+|.|.|||.++.++=+ +...+ ..+..-+..++ T Consensus 37 sGsGKSTll~ll~gl~~p~~G~I~i~g~di~~~~~~~~r~~i~~v~Q~~~lf~~Ti~eNi~~g~~~~~~~~i~~a~~~~~ 116 (237) T cd03252 37 SGSGKSTLTKLIQRFYVPENGRVLVDGHDLALADPAWLRRQVGVVLQENVLFNRSIRDNIALADPGMSMERVIEAAKLAG 116 (237) T ss_pred CCCHHHHHHHHHHCCCCCCCCEEEECCEECCCCCHHHHHHHCEEEECCCCCCCCCHHHHHHCCCCCCCHHHHHHHHHHHC T ss_conf 99859999999967765798789999999551899999860189958771557828988723899999999999999968 Q ss_pred CC-------CCEEEEEEEECCCCCHHHHHHHHHHHHHHHH Q ss_conf 56-------6410799982598301888999999999976 Q gi|254780293|r 97 QD-------NMYDVFATVSGGGLSGQASAICHGVAKALTY 129 (170) Q Consensus 97 ~~-------~~~Di~~~V~GGG~sgQa~AirlaIarAL~~ 129 (170) .. ..+|-.+.-.|..+| .-+.-|++|||||+. T Consensus 117 l~~~i~~l~~g~~t~i~~~g~~LS-gGqkQRlalARall~ 155 (237) T cd03252 117 AHDFISELPEGYDTIVGEQGAGLS-GGQRQRIAIARALIH 155 (237) T ss_pred CHHHHHHCCCCCCEEECCCCCCCC-HHHHHHHHHHHHHHC T ss_conf 544466366666205579788489-999999999999966 No 29 >COG1132 MdlB ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms] Probab=77.61 E-value=5.8 Score=20.14 Aligned_cols=86 Identities=20% Similarity=0.215 Sum_probs=50.5 Q ss_pred CCCCEE----EEECCCEEEEE--EE-ECCCCEEEEECCCHHHHCCCHHHHHH---------------------------- Q ss_conf 355001----31112108999--99-81897399941008884469899999---------------------------- Q gi|254780293|r 43 WQRSYA----TGKRKTSIARV--WI-KSGSGKFTINYVDISKYFTQDLLVLN---------------------------- 87 (170) Q Consensus 43 ~g~~~~----tGrRKtsvArV--~l-~~GsG~I~INgk~~~~Yf~~~~~r~~---------------------------- 87 (170) .|..+| +|=-|+..+.. .+ .|-+|+|.|||.++.++.. ..+|.. T Consensus 354 ~Ge~vaiVG~sGsGKSTl~~LL~r~~~~~~G~I~idg~dI~~i~~-~~lr~~i~~V~Qd~~LF~~TI~~NI~~g~~~at~ 432 (567) T COG1132 354 PGEKVAIVGPSGSGKSTLIKLLLRLYDPTSGEILIDGIDIRDISL-DSLRKRIGIVSQDPLLFSGTIRENIALGRPDATD 432 (567) T ss_pred CCCEEEEECCCCCCHHHHHHHHHHHCCCCCCEEEECCEEHHHCCH-HHHHHHEEEECCCCHHHHHHHHHHHHCCCCCCCH T ss_conf 898788855888857899999986158888369899997775385-6788782354664277776699997457889999 Q ss_pred --HHHHHHHHCC-------CCCEEEEEEEECCCCCHHHHHHHHHHHHHHHHH Q ss_conf --9999987385-------664107999825983018889999999999765 Q gi|254780293|r 88 --IKRPFNTVSQ-------DNMYDVFATVSGGGLSGQASAICHGVAKALTYF 130 (170) Q Consensus 88 --i~~Pl~~~~~-------~~~~Di~~~V~GGG~sgQa~AirlaIarAL~~~ 130 (170) +.+-+..++. ...||-.+.- +|..-+.-+--|++|||||+.- T Consensus 433 eei~~a~k~a~~~d~I~~lp~g~dt~vge-~G~~LSgGQrQriaiARall~~ 483 (567) T COG1132 433 EEIEEALKLANAHEFIANLPDGYDTIVGE-RGVNLSGGQRQRLAIARALLRN 483 (567) T ss_pred HHHHHHHHHHCCHHHHHHCCCCCCCEECC-CCCCCCHHHHHHHHHHHHHHHC T ss_conf 99999999948637898476667836057-8871889999999999997036 No 30 >cd03244 ABCC_MRP_domain2 Domain 2 of the ABC subfamily C. This family is also known as MRP (mulrtidrug resisitance-associated protein). Some of the MRP members have five additional transmembrane segments in their N-terminus, but the function of these additional membrane-spanning domains is not clear. The MRP was found in the multidrug-resistance lung cancer cell in which p-glycoprotein was not overexpressed. MRP exports glutathione by drug stimulation, as well as, certain substrates in conjugated forms with anions, such as glutathione, glucuronate, and sulfate. Probab=77.20 E-value=4.2 Score=21.02 Aligned_cols=80 Identities=25% Similarity=0.231 Sum_probs=46.1 Q ss_pred EEECCCEEEEEE---EECCCCEEEEECCCHHHHC------------CCH-----HHHH-----------HHHHHHHHHCC Q ss_conf 311121089999---9818973999410088844------------698-----9999-----------99999987385 Q gi|254780293|r 49 TGKRKTSIARVW---IKSGSGKFTINYVDISKYF------------TQD-----LLVL-----------NIKRPFNTVSQ 97 (170) Q Consensus 49 tGrRKtsvArV~---l~~GsG~I~INgk~~~~Yf------------~~~-----~~r~-----------~i~~Pl~~~~~ 97 (170) .|-=||..++.. +.|-+|.|.|||+++.++- +++ ..++ .+...+..++. T Consensus 39 sGsGKSTLl~ll~gl~~p~~G~I~i~g~~i~~~~~~~~r~~i~~v~Q~~~lf~~Tv~eNi~~~~~~~~~~i~~~l~~~~l 118 (221) T cd03244 39 TGSGKSSLLLALFRLVELSSGSILIDGVDISKIGLHDLRSRISIIPQDPVLFSGTIRSNLDPFGEYSDEELWQALERVGL 118 (221) T ss_pred CCCCHHHHHHHHHCCCCCCCCEEEECCEECCCCCHHHHHHHEEEEECCCCCCCCCHHHHHCCCCCCCHHHHHHHHHHHCH T ss_conf 99989999999967971898489999999661999999740799930352356008875071023799999998875160 Q ss_pred -------CCCEEEEEEEECCCCCHHHHHHHHHHHHHHHH Q ss_conf -------66410799982598301888999999999976 Q gi|254780293|r 98 -------DNMYDVFATVSGGGLSGQASAICHGVAKALTY 129 (170) Q Consensus 98 -------~~~~Di~~~V~GGG~sgQa~AirlaIarAL~~ 129 (170) ...+|..+.-.|..+|| -+.-|++|||||+. T Consensus 119 ~~~~~~~~~~l~~~~~~~~~~LSg-GqkQrv~lARal~~ 156 (221) T cd03244 119 KEFVESLPGGLDTVVEEGGENLSV-GQRQLLCLARALLR 156 (221) T ss_pred HHHHHHCCCCCCCEECCCCCCCCH-HHHHHHHHHHHHHC T ss_conf 899974104445470366677999-99999999999965 No 31 >TIGR03375 type_I_sec_LssB type I secretion system ATPase, LssB family. Type I protein secretion is a system in some Gram-negative bacteria to export proteins (often proteases) across both inner and outer membranes to the extracellular medium. This is one of three proteins of the type I secretion apparatus. Targeted proteins are not cleaved at the N-terminus, but rather carry signals located toward the extreme C-terminus to direct type I secretion. This model is related to models TIGR01842 and TIGR01846, and to bacteriocin ABC transporters that cleave their substrates during export. Probab=77.08 E-value=0.93 Score=25.04 Aligned_cols=80 Identities=20% Similarity=0.210 Sum_probs=50.3 Q ss_pred EEECCCEEEEEEE---ECCCCEEEEECCCHHHHCCCH-----------------------------HHHHHHHHHHHHHC Q ss_conf 3111210899999---818973999410088844698-----------------------------99999999998738 Q gi|254780293|r 49 TGKRKTSIARVWI---KSGSGKFTINYVDISKYFTQD-----------------------------LLVLNIKRPFNTVS 96 (170) Q Consensus 49 tGrRKtsvArV~l---~~GsG~I~INgk~~~~Yf~~~-----------------------------~~r~~i~~Pl~~~~ 96 (170) +|-=||+.+++-. .|-+|.|.+||.++.+|=+.. ...+.+.+-+..++ T Consensus 500 sGsGKSTL~kll~Gl~~p~~G~i~idg~~~~~~~~~~~r~~i~~v~Q~~~lf~gTi~eNi~~~~~~~~~~~i~~a~~~a~ 579 (694) T TIGR03375 500 IGSGKSTLLKLLLGLYQPTEGSVLLDGVDIRQIDPADLRRNIGYVPQDPRLFYGTLRDNIALGAPYADDEEILRAAELAG 579 (694) T ss_pred CCCCHHHHHHHHCCCCCCCCCEEEECCEECCCCCHHHHHHHCEEECCCCCCCCCCHHHHHHCCCCCCCHHHHHHHHHHHC T ss_conf 89878899998556758998879989854254999999730213576771107469999841699999999999999819 Q ss_pred CC-------CCEEEEEEEECCCCCHHHHHHHHHHHHHHHH Q ss_conf 56-------6410799982598301888999999999976 Q gi|254780293|r 97 QD-------NMYDVFATVSGGGLSGQASAICHGVAKALTY 129 (170) Q Consensus 97 ~~-------~~~Di~~~V~GGG~sgQa~AirlaIarAL~~ 129 (170) .. ..||-.+.-.|..+| .-+-=|++|||||+. T Consensus 580 l~~~I~~lp~g~~t~i~e~G~~LS-gGqrQri~lARAl~~ 618 (694) T TIGR03375 580 VTEFVRRHPDGLDMQIGERGRSLS-GGQRQAVALARALLR 618 (694) T ss_pred CHHHHHHCCCCCCCEECCCCCCCC-HHHHHHHHHHHHHHC T ss_conf 799997185667877468999468-999999999999957 No 32 >TIGR01192 chvA glucan exporter ATP-binding protein; InterPro: IPR005896 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This family contains glucan exporter ATP binding protein in bacteria. It belongs to the larger ABC transporter superfamily with the characteristic ATP binding motif. In general, this protein is in some ways implicated in osmo-regulation and is suggested to participate in the export of glucan from the cytoplasm to periplasm. The cyclic beta-1,2-glucan in the bacterial periplasmic space is suggested to confer the property of high osmolority. It has also been demonstrated that mutants in this loci have lost functions of virulence and motility. It is unclear as to how virulence and osmo-adaptaion are related. ; GO: 0005215 transporter activity, 0006810 transport. Probab=76.82 E-value=6.3 Score=19.92 Aligned_cols=84 Identities=27% Similarity=0.358 Sum_probs=61.8 Q ss_pred EEECCCEEEEE--EE-ECCCCEEEEECCCHHHHCCCHHHHHHHHHHHHHHCCC--------------------------- Q ss_conf 31112108999--99-8189739994100888446989999999999873856--------------------------- Q gi|254780293|r 49 TGKRKTSIARV--WI-KSGSGKFTINYVDISKYFTQDLLVLNIKRPFNTVSQD--------------------------- 98 (170) Q Consensus 49 tGrRKtsvArV--~l-~~GsG~I~INgk~~~~Yf~~~~~r~~i~~Pl~~~~~~--------------------------- 98 (170) ||==||..-.. ++ .|-.|.|.|+|.++.. ..+.++|.-+-.-|.=.|+. T Consensus 370 TGAGKTTLiNLLQRVydP~~G~I~IDG~~I~~-v~r~SLR~s~ATVFQDAGL~~R~Ir~NI~lG~~~At~~E~~~Aa~aA 448 (592) T TIGR01192 370 TGAGKTTLINLLQRVYDPKVGQILIDGIDINT-VTRESLRKSIATVFQDAGLLNRSIRENIRLGREGATDEEVVEAAKAA 448 (592) T ss_pred CCCCHHHHHHHHHHHCCCCCCEEEECCEEHHH-HCHHHHHHHHHHHHHHCCCCCCCHHHHHCCCCCCCCHHHHHHHHHHH T ss_conf 99717899887753106986548872424001-03477887887787310200131233200377778778999999754 Q ss_pred ----------CCEEEEEEEECCCCCHHHHHHHHHHHHHHHHHCHHH Q ss_conf ----------641079998259830188899999999997657634 Q gi|254780293|r 99 ----------NMYDVFATVSGGGLSGQASAICHGVAKALTYFQPDL 134 (170) Q Consensus 99 ----------~~~Di~~~V~GGG~sgQa~AirlaIarAL~~~~p~~ 134 (170) +.||-.+--+|.-+||= +==|+|||||+++--|-+ T Consensus 449 aA~DFI~~~~~~Y~T~VGERG~~LSGG-ERQR~AIARAILKNAPIL 493 (592) T TIGR01192 449 AASDFIEKRLNGYDTLVGERGNRLSGG-ERQRLAIARAILKNAPIL 493 (592) T ss_pred HHHHHHHHHHCCCCEEECCCCCCCCCC-HHHHHHHHHHHHHCCCEE T ss_conf 366778864134440011467747772-378999999997238846 No 33 >cd03288 ABCC_SUR2 The SUR domain 2. The sulfonylurea receptor SUR is an ATP binding cassette (ABC) protein of the ABCC/MRP family. Unlike other ABC proteins, it has no intrinsic transport function, neither active nor passive, but associates with the potassium channel proteins Kir6.1 or Kir6.2 to form the ATP-sensitive potassium (K(ATP)) channel. Within the channel complex, SUR serves as a regulatory subunit that fine-tunes the gating of Kir6.x in response to alterations in cellular metabolism. It constitutes a major pharmaceutical target as it binds numerous drugs, K(ATP) channel openers and blockers, capable of up- or down-regulating channel activity. Probab=76.68 E-value=6.3 Score=19.90 Aligned_cols=79 Identities=23% Similarity=0.212 Sum_probs=48.0 Q ss_pred EEECCCEEEEE---EEECCCCEEEEECCCHHHHCCCHHHH-----------------------------HHHHHHHHHHC Q ss_conf 31112108999---99818973999410088844698999-----------------------------99999998738 Q gi|254780293|r 49 TGKRKTSIARV---WIKSGSGKFTINYVDISKYFTQDLLV-----------------------------LNIKRPFNTVS 96 (170) Q Consensus 49 tGrRKtsvArV---~l~~GsG~I~INgk~~~~Yf~~~~~r-----------------------------~~i~~Pl~~~~ 96 (170) +|-=||+.++. .+.|-+|.|.|||.++.+|-. ..+| ..+.+.+..++ T Consensus 56 sGsGKSTL~~ll~gl~~p~~G~I~idg~di~~~~~-~~lr~~i~~v~Q~~~lf~~Ti~~Ni~~~~~~~~~~i~~al~~~~ 134 (257) T cd03288 56 TGSGKSSLSLAFFRMVDIFDGKIVIDGIDISKLPL-HTLRSRLSIILQDPILFSGSIRFNLDPECKCTDDRLWEALEIAQ 134 (257) T ss_pred CCCCHHHHHHHHHHCCCCCCCEEEECCEEHHHCCH-HHHHHHEEEEECCCCCCCCHHHHCCCCCCCCCHHHHHHHHHHHH T ss_conf 99819999999960566788889999899687999-99975057994567343613554137566768999999999962 Q ss_pred C-------CCCEEEEEEEECCCCCHHHHHHHHHHHHHHHH Q ss_conf 5-------66410799982598301888999999999976 Q gi|254780293|r 97 Q-------DNMYDVFATVSGGGLSGQASAICHGVAKALTY 129 (170) Q Consensus 97 ~-------~~~~Di~~~V~GGG~sgQa~AirlaIarAL~~ 129 (170) . .+.+|-.+.-.|..+|| -+.-|++|||||+. T Consensus 135 l~~~i~~lp~gl~t~l~~~g~~LSg-GQrQri~lARAll~ 173 (257) T cd03288 135 LKNMVKSLPGGLDAVVTEGGENFSV-GQRQLFCLARAFVR 173 (257) T ss_pred HHHHHHHCCCCCCEEECCCCCCCCH-HHHHHHHHHHHHHC T ss_conf 3799973758985174687683699-99999999999955 No 34 >PRK10789 putative multidrug transporter membrane\ATP-binding components; Provisional Probab=76.06 E-value=1.8 Score=23.22 Aligned_cols=86 Identities=19% Similarity=0.162 Sum_probs=52.1 Q ss_pred CCCCEE----EEECCCEEEEEEE---ECCCCEEEEECCCHHHHCC-----------------CHHHHH------------ Q ss_conf 355001----3111210899999---8189739994100888446-----------------989999------------ Q gi|254780293|r 43 WQRSYA----TGKRKTSIARVWI---KSGSGKFTINYVDISKYFT-----------------QDLLVL------------ 86 (170) Q Consensus 43 ~g~~~~----tGrRKtsvArV~l---~~GsG~I~INgk~~~~Yf~-----------------~~~~r~------------ 86 (170) .|..++ +|-=||+.++.-+ .|-+|.|.|||.|+.+|=. +.+.++ T Consensus 340 ~Ge~vaIVG~SGsGKSTLl~LL~g~y~p~~G~I~idg~di~~i~~~~lR~~I~~V~Q~~~LF~~TI~eNI~lg~~~~~~e 419 (569) T PRK10789 340 PGQMLGICGPTGSGKSTLLSLIQRHFDVSEGDIRFHDIPLTKLQLDSWRSRLAVVSQTPFLFSDTVANNIALGRPDATQQ 419 (569) T ss_pred CCCEEEEECCCCCCHHHHHHHHHHHHHCCCCCEEEECEECCCCCHHHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCHH T ss_conf 99789987999998799999999776426787465010134257688863147658875025662999986579776545 Q ss_pred HHHHHHHHHC-------CCCCEEEEEEEECCCCCHHHHHHHHHHHHHHHH Q ss_conf 9999998738-------566410799982598301888999999999976 Q gi|254780293|r 87 NIKRPFNTVS-------QDNMYDVFATVSGGGLSGQASAICHGVAKALTY 129 (170) Q Consensus 87 ~i~~Pl~~~~-------~~~~~Di~~~V~GGG~sgQa~AirlaIarAL~~ 129 (170) .+.+-+..++ +...||-.+.-.|..+| .-+-=|++|||||+. T Consensus 420 ei~~a~~~a~l~~~i~~lp~G~dT~ige~G~~LS-GGQrQRialARAll~ 468 (569) T PRK10789 420 EIEHVARLASVHEDILRLPQGYDTEVGERGVMLS-GGQKQRISIARALLL 468 (569) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCC-HHHHHHHHHHHHHHC T ss_conf 8999999855568764375532371268889969-999999999999954 No 35 >cd03248 ABCC_TAP TAP, the Transporter Associated with Antigen Processing; TAP is essential for peptide delivery from the cytosol into the lumen of the endoplasmic reticulum (ER), where these peptides are loaded on major histocompatibility complex (MHC) I molecules. Loaded MHC I leave the ER and display their antigenic cargo on the cell surface to cytotoxic T cells. Subsequently, virus-infected or malignantly transformed cells can be eliminated. TAP belongs to the large family of ATP-binding cassette (ABC) transporters, which translocate a vast variety of solutes across membranes. Probab=75.91 E-value=3.8 Score=21.28 Aligned_cols=87 Identities=22% Similarity=0.205 Sum_probs=49.9 Q ss_pred CCCCCEE----EEECCCEEEEEEE---ECCCCEEEEECCCHHHHCC-----------------CHHHHHHH--------- Q ss_conf 4355001----3111210899999---8189739994100888446-----------------98999999--------- Q gi|254780293|r 42 QWQRSYA----TGKRKTSIARVWI---KSGSGKFTINYVDISKYFT-----------------QDLLVLNI--------- 88 (170) Q Consensus 42 ~~g~~~~----tGrRKtsvArV~l---~~GsG~I~INgk~~~~Yf~-----------------~~~~r~~i--------- 88 (170) ..|..++ +|-=||..++..+ .|-+|.|.|||.++.+|-. +...++.+ T Consensus 38 ~~Ge~vaIvG~sGsGKSTL~~ll~gl~~p~~G~I~idg~~i~~~~~~~lr~~i~~v~Q~~~lf~~ti~eNi~~g~~~~~~ 117 (226) T cd03248 38 HPGEVTALVGPSGSGKSTVVALLENFYQPQGGQVLLDGKPISQYEHKYLHSKVSLVGQEPVLFARSLQDNIAYGLQSCSF 117 (226) T ss_pred CCCCEEEEECCCCCHHHHHHHHHHCCCCCCCCEEEECCEEHHHCCHHHHHHCEEEEECCCEECCCCHHHHHHCCCCCCCH T ss_conf 29999999999998499999999645467887899999993448999997326999247957677356666327899999 Q ss_pred ---HHHHHHHC-------CCCCEEEEEEEECCCCCHHHHHHHHHHHHHHHH Q ss_conf ---99998738-------566410799982598301888999999999976 Q gi|254780293|r 89 ---KRPFNTVS-------QDNMYDVFATVSGGGLSGQASAICHGVAKALTY 129 (170) Q Consensus 89 ---~~Pl~~~~-------~~~~~Di~~~V~GGG~sgQa~AirlaIarAL~~ 129 (170) .+-+..++ ....+|-.+.-.|..+| .-+.-|++|||||+. T Consensus 118 ~~i~~~~~~~~~~~~i~~l~~gl~t~i~~~g~~LS-gGqkQRialARal~~ 167 (226) T cd03248 118 ECVKEAAQKAHAHSFISELASGYDTEVGEKGSQLS-GGQKQRVAIARALIR 167 (226) T ss_pred HHHHHHHHHHCHHHHHHHCCCCCCCEECCCCCCCC-HHHHHHHHHHHHHHC T ss_conf 99999999966146776263666406168488769-999999999999975 No 36 >COG1126 GlnQ ABC-type polar amino acid transport system, ATPase component [Amino acid transport and metabolism] Probab=75.81 E-value=2.7 Score=22.17 Aligned_cols=74 Identities=22% Similarity=0.312 Sum_probs=42.4 Q ss_pred EECCCEEEEE--EE-ECCCCEEEEECCCHH-------------------HHCCCHH-HHHHHHHHHHHH----------- Q ss_conf 1112108999--99-818973999410088-------------------8446989-999999999873----------- Q gi|254780293|r 50 GKRKTSIARV--WI-KSGSGKFTINYVDIS-------------------KYFTQDL-LVLNIKRPFNTV----------- 95 (170) Q Consensus 50 GrRKtsvArV--~l-~~GsG~I~INgk~~~-------------------~Yf~~~~-~r~~i~~Pl~~~----------- 95 (170) |-=|+..-|+ .| .+-+|.|.|+|.++. +.||+.. +.+..+.|..+. T Consensus 38 GSGKSTlLRclN~LE~~~~G~I~i~g~~~~~~~~~~~~R~~vGmVFQ~fnLFPH~TvleNv~lap~~v~~~~k~eA~~~A 117 (240) T COG1126 38 GSGKSTLLRCLNGLEEPDSGSITVDGEDVGDKKDILKLRRKVGMVFQQFNLFPHLTVLENVTLAPVKVKKLSKAEAREKA 117 (240) T ss_pred CCCHHHHHHHHHCCCCCCCCEEEECCEECCCHHHHHHHHHHCCEECCCCCCCCCCHHHHHHHHHHHHHCCCCHHHHHHHH T ss_conf 99888999999778688786499998722545469999985576624665465532988877753997298999999999 Q ss_pred -------CCCCCEEEEEEEECCCCCHHHHHHHHHHHHHHH Q ss_conf -------856641079998259830188899999999997 Q gi|254780293|r 96 -------SQDNMYDVFATVSGGGLSGQASAICHGVAKALT 128 (170) Q Consensus 96 -------~~~~~~Di~~~V~GGG~sgQa~AirlaIarAL~ 128 (170) ++..+.|.+-.--.||.- =|-||||||+ T Consensus 118 ~~lL~~VGL~dka~~yP~qLSGGQq-----QRVAIARALa 152 (240) T COG1126 118 LELLEKVGLADKADAYPAQLSGGQQ-----QRVAIARALA 152 (240) T ss_pred HHHHHHCCCHHHHHHCCCCCCCHHH-----HHHHHHHHHC T ss_conf 9999986955666539510480788-----9999999871 No 37 >cd03249 ABC_MTABC3_MDL1_MDL2 MTABC3 (also known as ABCB6) is a mitochondrial ATP-binding cassette protein involved in iron homeostasis and one of four ABC transporters expressed in the mitochondrial inner membrane, the other three being MDL1(ABC7), MDL2, and ATM1. In fact, the yeast MDL1 (multidrug resistance-like protein 1) and MDL2 (multidrug resistance-like protein 2) transporters are also included in this CD. MDL1 is an ATP-dependent permease that acts as a high-copy suppressor of ATM1 and is thought to have a role in resistance to oxidative stress. Interestingly, subfamily B is more closely related to the carboxyl-terminal component of subfamily C than the two halves of ABCC molecules are with one another. Probab=74.91 E-value=7 Score=19.61 Aligned_cols=80 Identities=20% Similarity=0.274 Sum_probs=48.1 Q ss_pred EEECCCEEEEEEE---ECCCCEEEEECCCHHHHC------------CC-----HHHHHH------------HHHHHHHHC Q ss_conf 3111210899999---818973999410088844------------69-----899999------------999998738 Q gi|254780293|r 49 TGKRKTSIARVWI---KSGSGKFTINYVDISKYF------------TQ-----DLLVLN------------IKRPFNTVS 96 (170) Q Consensus 49 tGrRKtsvArV~l---~~GsG~I~INgk~~~~Yf------------~~-----~~~r~~------------i~~Pl~~~~ 96 (170) +|-=||+.+...+ .|-+|.|.|||.++.+|- ++ -+.++. +.+.+..++ T Consensus 38 sGsGKSTLl~ll~gl~~p~~G~I~idg~~i~~~~~~~~r~~i~~v~Q~~~lf~~Ti~eNi~~g~~~~~~~~~~~a~~~~~ 117 (238) T cd03249 38 SGCGKSTVVSLLERFYDPTSGEILLDGVDIRDLNLRWLRSQIGLVSQEPVLFDGTIAENIRYGKPDATDEEVEEAAKKAN 117 (238) T ss_pred CCCCHHHHHHHHHHCCCCCCCEEEECCEECCCCCHHHHHHCEEEECCCCEECCCCHHHHHHCCCCCCCHHHHHHHHHHHC T ss_conf 99989999999823861885189999999231899999740699915896727529999633687899999999999978 Q ss_pred C-------CCCEEEEEEEECCCCCHHHHHHHHHHHHHHHH Q ss_conf 5-------66410799982598301888999999999976 Q gi|254780293|r 97 Q-------DNMYDVFATVSGGGLSGQASAICHGVAKALTY 129 (170) Q Consensus 97 ~-------~~~~Di~~~V~GGG~sgQa~AirlaIarAL~~ 129 (170) . ...+|-.+.-.|..+| .-+.=|++|||||.. T Consensus 118 l~~~i~~lp~gl~t~ige~G~~LS-gGQ~QRialARAl~~ 156 (238) T cd03249 118 IHDFIMSLPDGYDTLVGERGSQLS-GGQKQRIAIARALLR 156 (238) T ss_pred CCHHHHHCCCCCCEEECCCCCCCC-HHHHHHHHHHHHHHC T ss_conf 524355186877225078678589-999999999999965 No 38 >cd03251 ABCC_MsbA MsbA is an essential ABC transporter, closely related to eukaryotic MDR proteins. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=71.35 E-value=4.9 Score=20.55 Aligned_cols=80 Identities=20% Similarity=0.291 Sum_probs=47.9 Q ss_pred EEECCCEEEEEEE---ECCCCEEEEECCCHHHH----------------------------C-CCHHHHHHHHHHHHHHC Q ss_conf 3111210899999---81897399941008884----------------------------4-69899999999998738 Q gi|254780293|r 49 TGKRKTSIARVWI---KSGSGKFTINYVDISKY----------------------------F-TQDLLVLNIKRPFNTVS 96 (170) Q Consensus 49 tGrRKtsvArV~l---~~GsG~I~INgk~~~~Y----------------------------f-~~~~~r~~i~~Pl~~~~ 96 (170) +|--||+..+..+ .|-+|.|.|||.++.++ | ......+.+.+-+..++ T Consensus 37 sGsGKSTLl~ll~gl~~p~~G~I~i~g~~i~~~~~~~~r~~i~~v~Q~~~lf~~Ti~eNI~~g~~~~~~~~i~~~l~~~~ 116 (234) T cd03251 37 SGSGKSTLVNLIPRFYDVDSGRILIDGHDVRDYTLASLRRQIGLVSQDVFLFNDTVAENIAYGRPGATREEVEEAARAAN 116 (234) T ss_pred CCCHHHHHHHHHHCCCCCCCCEEEECCEECCCCCHHHHHHCEEEECCCCEECCCCHHHHHHCCCCCCCHHHHHHHHHHHH T ss_conf 99829999999966766788689999999660899999731799936894716459999614798999999999999863 Q ss_pred C-------CCCEEEEEEEECCCCCHHHHHHHHHHHHHHHH Q ss_conf 5-------66410799982598301888999999999976 Q gi|254780293|r 97 Q-------DNMYDVFATVSGGGLSGQASAICHGVAKALTY 129 (170) Q Consensus 97 ~-------~~~~Di~~~V~GGG~sgQa~AirlaIarAL~~ 129 (170) . ...+|-.+.-.|-.+|| -+.=|++|||||+. T Consensus 117 l~~~i~~l~~gl~t~i~~~g~~LSg-GQ~QRi~lARal~~ 155 (234) T cd03251 117 AHEFIMELPEGYDTVIGERGVKLSG-GQRQRIAIARALLK 155 (234) T ss_pred HHHHHHCCCCCCCCCCCCCCCCCCH-HHHHHHHHHHHHHC T ss_conf 4554202778875563887895899-99999999999964 No 39 >PRK10790 putative multidrug transporter membrane\ATP-binding components; Provisional Probab=70.99 E-value=1.7 Score=23.41 Aligned_cols=79 Identities=18% Similarity=0.178 Sum_probs=50.0 Q ss_pred EEECCCEEEEEEE---ECCCCEEEEECCCHHHHCCCHHHHH-----------------------------HHHHHHHHHC Q ss_conf 3111210899999---8189739994100888446989999-----------------------------9999998738 Q gi|254780293|r 49 TGKRKTSIARVWI---KSGSGKFTINYVDISKYFTQDLLVL-----------------------------NIKRPFNTVS 96 (170) Q Consensus 49 tGrRKtsvArV~l---~~GsG~I~INgk~~~~Yf~~~~~r~-----------------------------~i~~Pl~~~~ 96 (170) +|-=||+.++.-+ .|.+|.|.|||.|+.++-. ..+|. .+.+-+..++ T Consensus 376 SGsGKSTL~~LL~rly~p~~G~I~idG~di~~i~~-~~lR~~i~~V~Q~~~LF~gTI~eNi~~g~~~~~~~i~~a~~~a~ 454 (593) T PRK10790 376 TGSGKSTLASLLMGYYPLTEGEIRLDGRPLSSLSH-SVLRQGVAMVQQDPVVLADTFLANVTLGRDISEEQVWQALETVQ 454 (593) T ss_pred CCCCHHHHHHHHHHHCCCCCCCCCCCCEECCCCCH-HHHHHCCCCCCCCCCCCCCCHHHHHHHHCCCCHHHHHHHHHHHH T ss_conf 98868999999998556789941659932442468-88863157516665145652999977600236799999999977 Q ss_pred -------CCCCEEEEEEEECCCCCHHHHHHHHHHHHHHHH Q ss_conf -------566410799982598301888999999999976 Q gi|254780293|r 97 -------QDNMYDVFATVSGGGLSGQASAICHGVAKALTY 129 (170) Q Consensus 97 -------~~~~~Di~~~V~GGG~sgQa~AirlaIarAL~~ 129 (170) +...||-.+.-.|..+|| -+.=|++|||||+. T Consensus 455 l~~~i~~lp~G~dT~vge~G~~LSg-GQrQRiaiARall~ 493 (593) T PRK10790 455 LAELARSLSDGLYTPLGEQGNTLSV-GQKQLLALARVLVE 493 (593) T ss_pred HHHHHHHCCCHHCCHHCCCCCCCCH-HHHHHHHHHHHHHC T ss_conf 8999985742010442387688799-99999999999955 No 40 >pfam02120 Flg_hook Flagellar hook-length control protein FliK. This is the C terminal domain of FliK. FliK controls the length of the flagellar hook by directly measuring the hook length as a molecular ruler. This family also includes YscP of the Yersinia type III secretion system, and equivalent proteins in other pathogenic bacterial type III secretion systems. Probab=70.80 E-value=2.9 Score=22.01 Aligned_cols=52 Identities=23% Similarity=0.204 Sum_probs=45.5 Q ss_pred CCCCEEEEEEEECCCCCHHHHHHHHHHHHHHHHHCHHHHHHHHHCCCCCCCC Q ss_conf 5664107999825983018889999999999765763448799779833575 Q gi|254780293|r 97 QDNMYDVFATVSGGGLSGQASAICHGVAKALTYFQPDLRPQIKKGGFLTRDS 148 (170) Q Consensus 97 ~~~~~Di~~~V~GGG~sgQa~AirlaIarAL~~~~p~~r~~Lk~~glLt~D~ 148 (170) .+|.++|.+.+.+|+++.+-.|-.......|...-+.++..|...||-..+. T Consensus 23 ~LG~v~i~l~~~~~~l~v~~~~~~~~~~~~L~~~~~~L~~~L~~~G~~~~~~ 74 (85) T pfam02120 23 ELGPVEVRLSLDGGQLSVRFTAENPETRELLEQNLPRLREALAAQGLELADL 74 (85) T ss_pred HCCCEEEEEEEECCEEEEEEEECCHHHHHHHHHHHHHHHHHHHHCCCCEEEE T ss_conf 8295899999968979999997878999999998999999999869984157 No 41 >cd03254 ABCC_Glucan_exporter_like Glucan exporter ATP-binding protein. In A. tumefaciens cyclic beta-1, 2-glucan must be transported into the periplasmic space to exert its action as a virluence factor. This subfamily belongs to the MRP-like family and is involved in drug, peptide, and lipid export. The MRP-like family, similar to all ABC proteins, have a common four-domain core structure constituted by two membrane-spanning domains each composed of six transmembrane (TM) helices and two nucleotide-binding domains (NBD). ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=70.51 E-value=5.7 Score=20.16 Aligned_cols=80 Identities=23% Similarity=0.293 Sum_probs=47.2 Q ss_pred EEECCCEEEEEEE---ECCCCEEEEECCCHHHH-----------------------------CCCHHHHHHHHHHHHHHC Q ss_conf 3111210899999---81897399941008884-----------------------------469899999999998738 Q gi|254780293|r 49 TGKRKTSIARVWI---KSGSGKFTINYVDISKY-----------------------------FTQDLLVLNIKRPFNTVS 96 (170) Q Consensus 49 tGrRKtsvArV~l---~~GsG~I~INgk~~~~Y-----------------------------f~~~~~r~~i~~Pl~~~~ 96 (170) +|-=||+..++.+ .|-+|.|.|||.++.++ +........+.+-+..++ T Consensus 38 sGsGKSTLl~ll~gl~~p~~G~I~idg~~i~~~~~~~~r~~i~~v~Q~~~lf~~Ti~~Ni~~~~~~~~~~~~~~~l~~~~ 117 (229) T cd03254 38 TGAGKTTLINLLMRFYDPQKGQILIDGIDIRDISRKSLRSMIGVVLQDTFLFSGTIMENIRLGRPNATDEEVIEAAKEAG 117 (229) T ss_pred CCCHHHHHHHHHHCCCCCCCCEEEECCEECCCCCHHHHHHCEEEECCCCEECCCCHHHHHHCCCCCCCHHHHHHHHHHHC T ss_conf 99809999999966866787389999999541899999632899903898757459998405899999999999999840 Q ss_pred CC-------CCEEEEEEEECCCCCHHHHHHHHHHHHHHHH Q ss_conf 56-------6410799982598301888999999999976 Q gi|254780293|r 97 QD-------NMYDVFATVSGGGLSGQASAICHGVAKALTY 129 (170) Q Consensus 97 ~~-------~~~Di~~~V~GGG~sgQa~AirlaIarAL~~ 129 (170) .. ..+|-.+.-.|..+| .-+.-|++|||||+. T Consensus 118 l~~~i~~l~~gl~t~i~~~g~~LS-gGq~Qri~lARal~~ 156 (229) T cd03254 118 AHDFIMKLPNGYDTVLGENGGNLS-QGERQLLAIARAMLR 156 (229) T ss_pred CHHHHHHCCCCCCCCCCCCCCCCC-HHHHHHHHHHHHHHC T ss_conf 045887584645786157864499-999999999999951 No 42 >PRK10522 multidrug transporter membrane component/ATP-binding component; Provisional Probab=67.68 E-value=8.5 Score=19.10 Aligned_cols=37 Identities=14% Similarity=0.175 Sum_probs=24.8 Q ss_pred EEECCCEEEEEEE---ECCCCEEEEECCCHHHHCCCHHHHH Q ss_conf 3111210899999---8189739994100888446989999 Q gi|254780293|r 49 TGKRKTSIARVWI---KSGSGKFTINYVDISKYFTQDLLVL 86 (170) Q Consensus 49 tGrRKtsvArV~l---~~GsG~I~INgk~~~~Yf~~~~~r~ 86 (170) +|-=|+..+..-+ .|-+|.|.|||.|+.+|-. ..+|. T Consensus 358 SGsGKSTL~~LL~r~y~p~~G~I~idG~di~~~~~-~~lR~ 397 (547) T PRK10522 358 NGSGKSTLAMLLTGLYQPQSGEILLDGKPVTAEQP-EDYRK 397 (547) T ss_pred CCCCHHHHHHHHHCCCCCCCCEEEECCEECCCCCH-HHHHH T ss_conf 99977999999828966999869899999996899-99985 No 43 >cd03253 ABCC_ATM1_transporter ATM1 is an ABC transporter that is expressed in the mitochondria. Although the specific function of ATM1 is unknown, its disruption results in the accumulation of excess mitochondrial iron, loss of mitochondrial cytochromes, oxidative damage to mitochondrial DNA, and decreased levels of cytosolic heme proteins. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=65.06 E-value=11 Score=18.31 Aligned_cols=80 Identities=23% Similarity=0.275 Sum_probs=47.4 Q ss_pred EEECCCEEEEEEE---ECCCCEEEEECCCHHHHC------------C-----CHHHHHH------------HHHHHHHHC Q ss_conf 3111210899999---818973999410088844------------6-----9899999------------999998738 Q gi|254780293|r 49 TGKRKTSIARVWI---KSGSGKFTINYVDISKYF------------T-----QDLLVLN------------IKRPFNTVS 96 (170) Q Consensus 49 tGrRKtsvArV~l---~~GsG~I~INgk~~~~Yf------------~-----~~~~r~~------------i~~Pl~~~~ 96 (170) +|-=||+.+++.+ .|-+|.|.|||.++.++= + +.+.++. +.+-+..++ T Consensus 36 sGsGKSTLl~ll~gl~~p~~G~I~idg~di~~~~~~~~r~~i~~v~Q~~~lf~~Ti~eNi~~g~~~~~~~~i~~~~~~~~ 115 (236) T cd03253 36 SGSGKSTILRLLFRFYDVSSGSILIDGQDIREVTLDSLRRAIGVVPQDTVLFNDTIGYNIRYGRPDATDEEVIEAAKAAQ 115 (236) T ss_pred CCCCHHHHHHHHCCCCCCCCCEEEECCEECCCCCHHHHHHCEEEEECCCEEECCCHHHHHHHCCCCCCHHHHHHHHHHHH T ss_conf 99989999999743854887489999999231899999723789950780116758998865677799999999999850 Q ss_pred CC-------CCEEEEEEEECCCCCHHHHHHHHHHHHHHHH Q ss_conf 56-------6410799982598301888999999999976 Q gi|254780293|r 97 QD-------NMYDVFATVSGGGLSGQASAICHGVAKALTY 129 (170) Q Consensus 97 ~~-------~~~Di~~~V~GGG~sgQa~AirlaIarAL~~ 129 (170) .. ..+|-.+.-.|-.+| .-+-=|++|||||+. T Consensus 116 l~~~i~~lp~gl~t~ig~~g~~LS-gGQ~QrialARal~~ 154 (236) T cd03253 116 IHDKIMRFPDGYDTIVGERGLKLS-GGEKQRVAIARAILK 154 (236) T ss_pred HHHHHHCCCCCCCCEECCCCCCCC-HHHHHHHHHHHHHHC T ss_conf 024554487666766278888269-999999999999964 No 44 >COG1750 Archaeal serine proteases [General function prediction only] Probab=63.08 E-value=13 Score=18.06 Aligned_cols=72 Identities=18% Similarity=0.286 Sum_probs=54.4 Q ss_pred EEEEECCCCEEEEECCCHHHH-CCCHHHHHHHHHHHHHHCCC-CCEEEEEEEEC-----CCCCHHHHHHHHHHHHHHHHH Q ss_conf 999981897399941008884-46989999999999873856-64107999825-----983018889999999999765 Q gi|254780293|r 58 RVWIKSGSGKFTINYVDISKY-FTQDLLVLNIKRPFNTVSQD-NMYDVFATVSG-----GGLSGQASAICHGVAKALTYF 130 (170) Q Consensus 58 rV~l~~GsG~I~INgk~~~~Y-f~~~~~r~~i~~Pl~~~~~~-~~~Di~~~V~G-----GG~sgQa~AirlaIarAL~~~ 130 (170) .|.+.||.|...|-+-|++.- +. .+.|.....-+.++|.. .+||+++.|.- ||.| -.+++-+|+--++..+ T Consensus 55 ~vtv~pG~G~v~v~t~P~t~~d~~-~SArvAa~~A~~~~Gvd~ssyd~~i~v~a~~pVVGgPS-agg~mtva~~~~~~~~ 132 (579) T COG1750 55 SVTVTPGDGRVYVATFPYTQIDMQ-GSARVAAGVALRLAGVDMSSYDVYIAVEADSPVVGGPS-AGGYMTVAIYAALMGW 132 (579) T ss_pred EEEECCCCCEEEEECCCCCHHCCC-HHHHHHHHHHHHHHCCCCCCEEEEEEEECCCCEECCCC-CCHHHHHHHHHHHHCC T ss_conf 666528986388636777400221-34578888778862787532258999943797435865-1007599999999578 Q ss_pred C Q ss_conf 7 Q gi|254780293|r 131 Q 131 (170) Q Consensus 131 ~ 131 (170) + T Consensus 133 ~ 133 (579) T COG1750 133 S 133 (579) T ss_pred C T ss_conf 8 No 45 >cd03261 ABC_Org_Solvent_Resistant ABC (ATP-binding cassette) transport system involved in resistant to organic solvents; ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=62.49 E-value=4.7 Score=20.70 Aligned_cols=76 Identities=24% Similarity=0.356 Sum_probs=43.3 Q ss_pred EEECCCEEEEEE---EECCCCEEEEECCCHHH-------------------H--CCCHHHHHHHHHHHHHHCC------- Q ss_conf 311121089999---98189739994100888-------------------4--4698999999999987385------- Q gi|254780293|r 49 TGKRKTSIARVW---IKSGSGKFTINYVDISK-------------------Y--FTQDLLVLNIKRPFNTVSQ------- 97 (170) Q Consensus 49 tGrRKtsvArV~---l~~GsG~I~INgk~~~~-------------------Y--f~~~~~r~~i~~Pl~~~~~------- 97 (170) .|-=||...+.. +.|-+|.|.+||+++.. + ||+..-.+.|..||...+. T Consensus 35 SGsGKSTll~~i~gL~~p~~G~I~~~g~~i~~~~~~~~~~~r~~ig~vfQ~~~Lf~~lTv~eNv~~~l~~~~~~~~~~~~ 114 (235) T cd03261 35 SGSGKSTLLRLIVGLLRPDSGEVLIDGEDISGLSEAELYRLRRRMGMLFQSGALFDSLTVFENVAFPLREHTRLSEEEIR 114 (235) T ss_pred CCCHHHHHHHHHHCCCCCCCCEEEECCEECCCCCHHHHHHHHCCEEEEECCCCCCCCCCHHHHHHHHHHHHCCCCHHHHH T ss_conf 99729999999975999898589999999998998899997578299704986589996999999999995799999999 Q ss_pred ------------CCCEEEEEEEECCCCCHHHHHHHHHHHHHHHH Q ss_conf ------------66410799982598301888999999999976 Q gi|254780293|r 98 ------------DNMYDVFATVSGGGLSGQASAICHGVAKALTY 129 (170) Q Consensus 98 ------------~~~~Di~~~V~GGG~sgQa~AirlaIarAL~~ 129 (170) ...-+-+..--.|| .-=|.+|||||+. T Consensus 115 ~r~~~~L~~vgL~~~~~~~p~~LSGG-----q~QRvaIARALv~ 153 (235) T cd03261 115 EIVLEKLEAVGLRGAEDLYPAELSGG-----MKKRVALARALAL 153 (235) T ss_pred HHHHHHHHHCCCCHHHHCCCCCCCHH-----HHHHHHHHHHHHC T ss_conf 99999998679925764784106999-----9999999999854 No 46 >cd03298 ABC_ThiQ_thiamine_transporter ABC-type thiamine tranport system; part of the binding-protein-dependent transport system tbpA-thiPQ for thiamine and TPP. Probably responsible for the translocation of thiamine across the membrane. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=62.42 E-value=4.2 Score=21.00 Aligned_cols=76 Identities=20% Similarity=0.289 Sum_probs=41.1 Q ss_pred EEECCCEEEEE---EEECCCCEEEEECCCHHH----------------HCCCHHHHHHHHHHHHH--------------- Q ss_conf 31112108999---998189739994100888----------------44698999999999987--------------- Q gi|254780293|r 49 TGKRKTSIARV---WIKSGSGKFTINYVDISK----------------YFTQDLLVLNIKRPFNT--------------- 94 (170) Q Consensus 49 tGrRKtsvArV---~l~~GsG~I~INgk~~~~----------------Yf~~~~~r~~i~~Pl~~--------------- 94 (170) .|-=||..-+. .+.|-+|+|.|||+++.. .||+..-.+.|..|+.. T Consensus 33 SGsGKSTLl~li~Gl~~p~sG~I~i~G~di~~~~~~~r~ig~vfQ~~~L~p~ltV~eNi~~~l~~~~~~~~~~~~~v~~~ 112 (211) T cd03298 33 SGSGKSTLLNLIAGFETPQSGRVLINGVDVTAAPPADRPVSMLFQENNLFAHLTVEQNVGLGLSPGLKLTAEDRQAIEVA 112 (211) T ss_pred CCCHHHHHHHHHHCCCCCCCEEEEECCEECCCCCHHHCCEEEEECCCCCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHH T ss_conf 99559999999976999885299999999999998898679995388668999499998758864688829999999999 Q ss_pred ---HCCCCCEEEEEEEECCCCCHHHHHHHHHHHHHHHH Q ss_conf ---38566410799982598301888999999999976 Q gi|254780293|r 95 ---VSQDNMYDVFATVSGGGLSGQASAICHGVAKALTY 129 (170) Q Consensus 95 ---~~~~~~~Di~~~V~GGG~sgQa~AirlaIarAL~~ 129 (170) ++.....|-+..--.|| +.=|.+|||||+. T Consensus 113 l~~~gl~~~~~~~p~~LSGG-----qkQRvaiARAL~~ 145 (211) T cd03298 113 LARVGLAGLEKRLPGELSGG-----ERQRVALARVLVR 145 (211) T ss_pred HHHCCCHHHHHCCHHHCCHH-----HHHHHHHHHHHHC T ss_conf 98769987872894558989-----9999999999865 No 47 >cd03246 ABCC_Protease_Secretion This family represents the ABC component of the protease secretion system PrtD, a 60-kDa integral membrane protein sharing 37% identity with HlyB, the ABC component of the alpha-hemolysin secretion pathway, in the C-terminal domain. They export degradative enzymes by using a type I protein secretion system and lack an N-terminal signal peptide, but contain a C-terminal secretion signal. The Type I secretion apparatus is made up of three components, an ABC transporter, a membrane fusion protein (MFP), and an outer membrane protein (OMP). For the HlyA transporter complex, HlyB (ABC transporter) and HlyD (MFP) reside in the inner membrane of E. coli. The OMP component is TolC, which is thought to interact with the MFP to form a continuous channel across the periplasm from the cytoplasm to the exterior. HlyB belongs to the family of ABC transporters, which are ubiquitous, ATP-dependent transmembrane pumps or channels. The spectrum of transport substra Probab=62.10 E-value=0.87 Score=25.22 Aligned_cols=70 Identities=23% Similarity=0.277 Sum_probs=40.8 Q ss_pred EEECCCEEEEE---EEECCCCEEEEECCCHHHHCCCHHHHHHHHH----HHHHHCCCCCEEEEEEEECCCCCHHHHHHHH Q ss_conf 31112108999---9981897399941008884469899999999----9987385664107999825983018889999 Q gi|254780293|r 49 TGKRKTSIARV---WIKSGSGKFTINYVDISKYFTQDLLVLNIKR----PFNTVSQDNMYDVFATVSGGGLSGQASAICH 121 (170) Q Consensus 49 tGrRKtsvArV---~l~~GsG~I~INgk~~~~Yf~~~~~r~~i~~----Pl~~~~~~~~~Di~~~V~GGG~sgQa~Airl 121 (170) +|-=||+.+++ .+.|-+|.|.|||.++.+|-+ ..++..+-. |..+.++. .-+| .+| -+.=|. T Consensus 37 sGsGKSTLl~ll~gl~~p~~G~i~i~g~~~~~~~~-~~~~~~i~~v~Q~~~lf~~ti-~eNi---LSG------GQkQRv 105 (173) T cd03246 37 SGSGKSTLARLILGLLRPTSGRVRLDGADISQWDP-NELGDHVGYLPQDDELFSGSI-AENI---LSG------GQRQRL 105 (173) T ss_pred CCCHHHHHHHHHHCCCCCCCCEEEECCEECCCCCH-HHHHCCEEEEECCCEECCCCH-HHHC---CCH------HHHHHH T ss_conf 99809999999966666799989999999332899-898420899908883677758-9976---769------999999 Q ss_pred HHHHHHHH Q ss_conf 99999976 Q gi|254780293|r 122 GVAKALTY 129 (170) Q Consensus 122 aIarAL~~ 129 (170) +|||||.. T Consensus 106 alARal~~ 113 (173) T cd03246 106 GLARALYG 113 (173) T ss_pred HHHHHHHC T ss_conf 99999827 No 48 >PRK11432 fbpC ferric transporter ATP-binding subunit; Provisional Probab=61.80 E-value=6.3 Score=19.88 Aligned_cols=80 Identities=23% Similarity=0.262 Sum_probs=46.4 Q ss_pred EEECCCEEEEE---EEECCCCEEEEECCCHH--------------H--HCCCHHHHHHHHHHHHHHCCCC---------- Q ss_conf 31112108999---99818973999410088--------------8--4469899999999998738566---------- Q gi|254780293|r 49 TGKRKTSIARV---WIKSGSGKFTINYVDIS--------------K--YFTQDLLVLNIKRPFNTVSQDN---------- 99 (170) Q Consensus 49 tGrRKtsvArV---~l~~GsG~I~INgk~~~--------------~--Yf~~~~~r~~i~~Pl~~~~~~~---------- 99 (170) .|-=||...|. ...|-+|.|.+||+++. + .||+..-...|..||.+.+... T Consensus 41 SG~GKTTlLr~iaGl~~p~~G~I~~~g~~v~~~~~~~R~i~~VfQ~~aLfPh~tV~eNi~~~l~~~~~~~~e~~~rv~e~ 120 (351) T PRK11432 41 SGCGKTTVLRLVAGLEKPTSGQIFIDGEDVTKRSIQQRDICMVFQSYALFPHMSLGENVGYGLKMLGVPKEERKQRVKEA 120 (351) T ss_pred CCCHHHHHHHHHHCCCCCCCEEEEECCEECCCCCHHHCCEEEEECCCCCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHH T ss_conf 96499999999976999883699999999999995458869994488767668099999779987599999999999999 Q ss_pred --CEEEEE--EEECCCCCHHHHHHHHHHHHHHHH Q ss_conf --410799--982598301888999999999976 Q gi|254780293|r 100 --MYDVFA--TVSGGGLSGQASAICHGVAKALTY 129 (170) Q Consensus 100 --~~Di~~--~V~GGG~sgQa~AirlaIarAL~~ 129 (170) .+++.- .-.-.-+|| -+.=|.||||||+. T Consensus 121 l~~v~L~~~~~r~P~~LSG-Gq~QRValARAL~~ 153 (351) T PRK11432 121 LELVDLAGFEDRYVDQISG-GQQQRVALARALVL 153 (351) T ss_pred HHHCCCCCHHHCCHHHCCH-HHHHHHHHHHHHHC T ss_conf 9764996614589557899-89999999999844 No 49 >cd03213 ABCG_EPDR ABCG transporters are involved in eye pigment (EP) precursor transport, regulation of lipid-trafficking mechanisms, and pleiotropic drug resistance (DR). DR is a well-described phenomenon occurring in fungi and shares several similarities with processes in bacteria and higher eukaryotes. Compared to other members of the ABC transporter subfamilies, the ABCG transporter family is composed of proteins that have an ATP-binding cassette domain at the N-terminus and a TM (transmembrane) domain at the C-terminus. Probab=61.25 E-value=9.2 Score=18.90 Aligned_cols=65 Identities=22% Similarity=0.212 Sum_probs=34.6 Q ss_pred EEECCCEEEEEEE---E--CCCCEEEEECCCHHH---------------HCCCHHHHHHHHHHHHHHCCCCCEEEEEEEE Q ss_conf 3111210899999---8--189739994100888---------------4469899999999998738566410799982 Q gi|254780293|r 49 TGKRKTSIARVWI---K--SGSGKFTINYVDISK---------------YFTQDLLVLNIKRPFNTVSQDNMYDVFATVS 108 (170) Q Consensus 49 tGrRKtsvArV~l---~--~GsG~I~INgk~~~~---------------Yf~~~~~r~~i~~Pl~~~~~~~~~Di~~~V~ 108 (170) .|==||...++.. . +.+|.|.+||+++.. .|++..-++.+... +.++ T Consensus 44 nGaGKSTLl~~l~Gl~~~~~~~G~i~~~g~~~~~~~~~~~ig~v~Q~~~l~~~ltv~e~l~~~-------------a~l~ 110 (194) T cd03213 44 SGAGKSTLLNALAGRRTGLGVSGEVLINGRPLDKRSFRKIIGYVPQDDILHPTLTVRETLMFA-------------AKLR 110 (194) T ss_pred CCCHHHHHHHHHHCCCCCCCCEEEEEECCEECCHHHHHHEEEEECCCCCCCCCCCHHHHHHHH-------------HHHC T ss_conf 995199999998577778996289999999997578431289984665237768499999999-------------8726 Q ss_pred CCCCCHHHHHHHHHHHHHHHH Q ss_conf 598301888999999999976 Q gi|254780293|r 109 GGGLSGQASAICHGVAKALTY 129 (170) Q Consensus 109 GGG~sgQa~AirlaIarAL~~ 129 (170) |+|| -+-=|++|||||+. T Consensus 111 --~LSg-GqrqRv~iA~aL~~ 128 (194) T cd03213 111 --GLSG-GERKRVSIALELVS 128 (194) T ss_pred --CCCH-HHHHHHHHHHHHHC T ss_conf --9888-99999999999963 No 50 >cd03369 ABCC_NFT1 Domain 2 of NFT1 (New full-length MRP-type transporter 1). NFT1 belongs to the MRP (mulrtidrug resisitance-associated protein) family of ABC transporters. Some of the MRP members have five additional transmembrane segments in their N-terminas, but the function of these additional membrane-spanning domains is not clear. The MRP was found in the multidrug-resisting lung cancer cell in which p-glycoprotein was not overexpressed. MRP exports glutathione by drug stimulation, as well as, certain substrates in conjugated forms with anions such as glutathione, glucuronate, and sulfate. Probab=61.12 E-value=4.3 Score=20.95 Aligned_cols=80 Identities=24% Similarity=0.302 Sum_probs=42.6 Q ss_pred EEECCCEEEEE---EEECCCCEEEEECCCHHHHCCCHHHHHHHHH----HHHHHCCC----CCEE------EE--EEE-E Q ss_conf 31112108999---9981897399941008884469899999999----99873856----6410------79--998-2 Q gi|254780293|r 49 TGKRKTSIARV---WIKSGSGKFTINYVDISKYFTQDLLVLNIKR----PFNTVSQD----NMYD------VF--ATV-S 108 (170) Q Consensus 49 tGrRKtsvArV---~l~~GsG~I~INgk~~~~Yf~~~~~r~~i~~----Pl~~~~~~----~~~D------i~--~~V-~ 108 (170) .|-=||..+++ .+.|-+|.|.|||+++.++ +...++..+-. |....++. ..++ +. ..+ . T Consensus 43 sGsGKSTLl~ll~g~~~p~~G~I~idg~di~~~-~~~~~r~~i~~v~Q~~~lf~~ti~~Nl~~~~~~~~~~i~~~l~~~~ 121 (207) T cd03369 43 TGAGKSTLILALFRFLEAEEGKIEIDGIDISTI-PLEDLRSSLTIIPQDPTLFSGTIRSNLDPFDEYSDEEIYGALRVSE 121 (207) T ss_pred CCCCHHHHHHHHHHHCCCCCCEEEECCEECCCC-CHHHHHHHCEEEECCCEECCCCHHHHCCCCCCCCHHHHHHHHHHCC T ss_conf 998799999999987288887899999995407-9999995153770356332754998738033358999999986047 Q ss_pred CCCCCHHHHHHHHHHHHHHHH Q ss_conf 598301888999999999976 Q gi|254780293|r 109 GGGLSGQASAICHGVAKALTY 129 (170) Q Consensus 109 GGG~sgQa~AirlaIarAL~~ 129 (170) +|-.-+.-+.=|++|||||+. T Consensus 122 ~g~~LSgGqkQrl~iARal~~ 142 (207) T cd03369 122 GGLNLSQGQRQLLCLARALLK 142 (207) T ss_pred CCCCCCHHHHHHHHHHHHHHC T ss_conf 888869899999999999970 No 51 >COG2274 SunT ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain [Defense mechanisms] Probab=59.60 E-value=14 Score=17.68 Aligned_cols=77 Identities=23% Similarity=0.302 Sum_probs=50.2 Q ss_pred EECCCEEEEEEE---ECCCCEEEEECCCHHHHCCCHHHH------------------------------HHHHHHHHHHC Q ss_conf 111210899999---818973999410088844698999------------------------------99999998738 Q gi|254780293|r 50 GKRKTSIARVWI---KSGSGKFTINYVDISKYFTQDLLV------------------------------LNIKRPFNTVS 96 (170) Q Consensus 50 GrRKtsvArV~l---~~GsG~I~INgk~~~~Yf~~~~~r------------------------------~~i~~Pl~~~~ 96 (170) |-=|+..++.-. .|-.|.|.+||.++.++ ....+| +.+.+-..+++ T Consensus 509 GsGKSTL~KLL~gly~p~~G~I~~dg~dl~~i-~~~~lR~~ig~V~Q~~~Lf~gSI~eNi~l~~p~~~~e~i~~A~~~ag 587 (709) T COG2274 509 GSGKSTLLKLLLGLYKPQQGRILLDGVDLNDI-DLASLRRQVGYVLQDPFLFSGSIRENIALGNPEATDEEIIEAAQLAG 587 (709) T ss_pred CCCHHHHHHHHHCCCCCCCCEEEECCEEHHHC-CHHHHHHHEEEECCCCHHHCCCHHHHHHCCCCCCCHHHHHHHHHHHC T ss_conf 99889999998367888885599998727866-99999865468746653204739879746899999799999999837 Q ss_pred C-------CCCEEEEEEEECCCCCH-HHHHHHHHHHHHHHH Q ss_conf 5-------66410799982598301-888999999999976 Q gi|254780293|r 97 Q-------DNMYDVFATVSGGGLSG-QASAICHGVAKALTY 129 (170) Q Consensus 97 ~-------~~~~Di~~~V~GGG~sg-Qa~AirlaIarAL~~ 129 (170) . -..||-.+.-.|.++|| |-+ |++|||||+. T Consensus 588 ~~~fI~~lP~gy~t~v~E~G~~LSGGQrQ--rlalARaLl~ 626 (709) T COG2274 588 AHEFIENLPMGYDTPVGEGGANLSGGQRQ--RLALARALLS 626 (709) T ss_pred CHHHHHHCCCCCCCCCCCCCCCCCHHHHH--HHHHHHHHCC T ss_conf 68999836054562320489888888999--9999998546 No 52 >TIGR03608 L_ocin_972_ABC putative bacteriocin export ABC transporter, lactococcin 972 group. A gene pair with a fairly wide distribution consists of a polypeptide related to the lactococcin 972 (see TIGR01653) and multiple-membrane-spanning putative immunity protein (see TIGR01654). This model represents a small clade within the ABC transporters that regularly are found adjacent to these bacteriocin system gene pairs and are likely serve as export proteins. Probab=58.77 E-value=7.6 Score=19.40 Aligned_cols=76 Identities=14% Similarity=0.167 Sum_probs=42.8 Q ss_pred EEECCCEEEEE---EEECCCCEEEEECCCHHHH----------------------CCCHHHHHHHHHHHHHHCCCC---- Q ss_conf 31112108999---9981897399941008884----------------------469899999999998738566---- Q gi|254780293|r 49 TGKRKTSIARV---WIKSGSGKFTINYVDISKY----------------------FTQDLLVLNIKRPFNTVSQDN---- 99 (170) Q Consensus 49 tGrRKtsvArV---~l~~GsG~I~INgk~~~~Y----------------------f~~~~~r~~i~~Pl~~~~~~~---- 99 (170) .|-=||..-+. ...|-+|.|.+||+++..+ |++..-.+.|..|+...+... T Consensus 33 SGsGKSTLL~~i~gl~~p~sG~i~~~g~~~~~~~~~~~~~~rr~~iG~VfQ~~~L~~~ltV~eNi~l~l~~~~~~~~~~~ 112 (206) T TIGR03608 33 SGSGKSTLLNIIGLLEKPDSGQVYLNGQETPPINSKKASKFRREKLGYLFQNFALIENETVEENLDLALKYSKLSKKEKR 112 (206) T ss_pred CCCCHHHHHHHHHCCCCCCCEEEEECCEECCCCCHHHHHHHHHCCEEEECCCCCCCCCCCHHHHHHHHHHHCCCCHHHHH T ss_conf 99709999999975999897599999999998998899999865889985798767989199999999986599999999 Q ss_pred --------CE------EEEEEEECCCCCHHHHHHHHHHHHHHHH Q ss_conf --------41------0799982598301888999999999976 Q gi|254780293|r 100 --------MY------DVFATVSGGGLSGQASAICHGVAKALTY 129 (170) Q Consensus 100 --------~~------Di~~~V~GGG~sgQa~AirlaIarAL~~ 129 (170) .+ +-.+.--.|| +.=|.||||||+. T Consensus 113 ~~~~~~L~~vgl~~~~~~~p~~LSGG-----e~QRVAIARAL~~ 151 (206) T TIGR03608 113 EKKKEALEKVGLNLKLKQKIYELSGG-----EQQRVALARAILK 151 (206) T ss_pred HHHHHHHHHCCCCHHHHCCCHHCCHH-----HHHHHHHHHHHHC T ss_conf 99999999869905652992444869-----9999999999824 No 53 >COG1124 DppF ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component [Amino acid transport and metabolism / Inorganic ion transport and metabolism] Probab=58.29 E-value=5.9 Score=20.06 Aligned_cols=81 Identities=21% Similarity=0.300 Sum_probs=49.1 Q ss_pred CCCCCEE----EEECCCEEEEE---EEECCCCEEEEECCCHHH-------------HC--------CCHHHHHHHHHHHH Q ss_conf 4355001----31112108999---998189739994100888-------------44--------69899999999998 Q gi|254780293|r 42 QWQRSYA----TGKRKTSIARV---WIKSGSGKFTINYVDISK-------------YF--------TQDLLVLNIKRPFN 93 (170) Q Consensus 42 ~~g~~~~----tGrRKtsvArV---~l~~GsG~I~INgk~~~~-------------Yf--------~~~~~r~~i~~Pl~ 93 (170) ..|..+| .|-=|+..||+ +.+|-+|.|.++|+++.. .| |+-.-...+.+|+. T Consensus 31 ~~Ge~lgivGeSGsGKSTL~r~l~Gl~~p~~G~I~~~G~~~~~~~~~~~~~~~VQmVFQDp~~SLnP~~tv~~~l~Epl~ 110 (252) T COG1124 31 ERGETLGIVGESGSGKSTLARLLAGLEKPSSGSILLDGKPLAPKKRAKAFYRPVQMVFQDPYSSLNPRRTVGRILSEPLR 110 (252) T ss_pred CCCCEEEEECCCCCCHHHHHHHHHCCCCCCCCEEEECCCCCCCCCCCHHHCCCEEEEECCCCCCCCCCHHHHHHHHHHHC T ss_conf 48978999848989888999999565678886289888405766530333045069951872246841019899742430 Q ss_pred H----------HCCCCCE-------EEEE-EEECCCCCHHHHHHHHHHHHHHH Q ss_conf 7----------3856641-------0799-98259830188899999999997 Q gi|254780293|r 94 T----------VSQDNMY-------DVFA-TVSGGGLSGQASAICHGVAKALT 128 (170) Q Consensus 94 ~----------~~~~~~~-------Di~~-~V~GGG~sgQa~AirlaIarAL~ 128 (170) . .+.+..+ |=+. ..+|| | -=|.||||||+ T Consensus 111 ~~~~~~~~~~i~~~L~~VgL~~~~l~R~P~eLSGG----Q--~QRiaIARAL~ 157 (252) T COG1124 111 PHGLSKSQQRIAELLDQVGLPPSFLDRRPHELSGG----Q--RQRIAIARALI 157 (252) T ss_pred CCCCCHHHHHHHHHHHHCCCCHHHHHCCCHHCCHH----H--HHHHHHHHHHC T ss_conf 37753789999999998499989985394212816----8--99999999863 No 54 >cd03247 ABCC_cytochrome_bd The CYD subfamily implicated in cytochrome bd biogenesis. The CydC and CydD proteins are important for the formation of cytochrome bd terminal oxidase of E. coli and it has been proposed that they were necessary for biosynthesis of the cytochrome bd quinol oxidase and for periplasmic c-type cytochromes. CydCD were proposed to determine a heterooligomeric complex important for heme export into the periplasm or to be involved in the maintenance of the proper redox state of the periplasmic space. In Bacillus subtilius, the absence of CydCD does not affect the presence of halo-cytochrome c in the membrane and this observation suggests that CydCD proteins are not involved in the export of heme in this organism. Probab=57.16 E-value=1.7 Score=23.46 Aligned_cols=72 Identities=21% Similarity=0.277 Sum_probs=41.2 Q ss_pred EEECCCEEEEEE---EECCCCEEEEECCCHHHHCCCHHHHHHHHH----HHHHHCCCCCEEEEEEEECCCCCHHHHHHHH Q ss_conf 311121089999---981897399941008884469899999999----9987385664107999825983018889999 Q gi|254780293|r 49 TGKRKTSIARVW---IKSGSGKFTINYVDISKYFTQDLLVLNIKR----PFNTVSQDNMYDVFATVSGGGLSGQASAICH 121 (170) Q Consensus 49 tGrRKtsvArV~---l~~GsG~I~INgk~~~~Yf~~~~~r~~i~~----Pl~~~~~~~~~Di~~~V~GGG~sgQa~Airl 121 (170) .|-=||+..+.. +.|-+|.|.|||.++.++- ..++..+-. |...-... .-++ |-.+|| -+.=|+ T Consensus 37 sGsGKSTLl~~l~G~~~p~~G~i~i~g~~i~~~~--~~~~~~i~~v~Q~~~lf~~ti-~~nl-----g~~LSg-GqkqRv 107 (178) T cd03247 37 SGSGKSTLLQLLTGDLKPQQGEITLDGVPVSDLE--KALSSLISVLNQRPYLFDTTL-RNNL-----GRRFSG-GERQRL 107 (178) T ss_pred CCCHHHHHHHHHHHCCCCCCCEEEECCEEHHHHH--HHHHHHEEEECCCCCCCCCHH-HHHC-----CCCCCH-HHHHHH T ss_conf 9875999999998617667886999999889978--999972089835563645419-9862-----888899-999999 Q ss_pred HHHHHHHH Q ss_conf 99999976 Q gi|254780293|r 122 GVAKALTY 129 (170) Q Consensus 122 aIarAL~~ 129 (170) +|||||.. T Consensus 108 ~iAral~~ 115 (178) T cd03247 108 ALARILLQ 115 (178) T ss_pred HHHHHHHH T ss_conf 99999964 No 55 >cd00267 ABC_ATPase ABC (ATP-binding cassette) transporter nucleotide-binding domain; ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide-binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=56.79 E-value=4 Score=21.14 Aligned_cols=62 Identities=29% Similarity=0.306 Sum_probs=38.2 Q ss_pred EEECCCEEEEEE---EECCCCEEEEECCCHHHHCCCHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCHHHHHHHHHHHH Q ss_conf 311121089999---98189739994100888446989999999999873856641079998259830188899999999 Q gi|254780293|r 49 TGKRKTSIARVW---IKSGSGKFTINYVDISKYFTQDLLVLNIKRPFNTVSQDNMYDVFATVSGGGLSGQASAICHGVAK 125 (170) Q Consensus 49 tGrRKtsvArV~---l~~GsG~I~INgk~~~~Yf~~~~~r~~i~~Pl~~~~~~~~~Di~~~V~GGG~sgQa~AirlaIar 125 (170) .|-=||...++. +.|-+|.|.+||.++..+.. ...+..+-.-+. .+|| +-=|++||| T Consensus 34 nGaGKSTLl~~l~gl~~~~~G~i~~~g~~~~~~~~-~~~~~~i~~v~Q-------------LSgG------qkqrv~iA~ 93 (157) T cd00267 34 NGSGKSTLLRAIAGLLKPTSGEILIDGKDIAKLPL-EELRRRIGYVPQ-------------LSGG------QRQRVALAR 93 (157) T ss_pred CCCCHHHHHHHHHCCCCCCCCEEEECCEECCCCCH-HHHHHCCCCCCC-------------CCHH------HHHHHHHHH T ss_conf 89998999999958847996289999999997999-999940608766-------------8869------999999999 Q ss_pred HHHHH Q ss_conf 99765 Q gi|254780293|r 126 ALTYF 130 (170) Q Consensus 126 AL~~~ 130 (170) ||+.- T Consensus 94 al~~~ 98 (157) T cd00267 94 ALLLN 98 (157) T ss_pred HHHCC T ss_conf 99709 No 56 >cd03228 ABCC_MRP_Like The MRP (Mutidrug Resistance Protein)-like transporters are involved in drug, peptide, and lipid export. They belong to the subfamily C of the ATP-binding cassette (ABC) superfamily of transport proteins. The ABCC subfamily contains transporters with a diverse functional spectrum that includes ion transport, cell surface receptor, and toxin secretion activities. The MRP-like family, simlar to all ABC proteins, have a common four-domain core structure constituted by two membrane-spanning domains, each composed of six transmembrane (TM) helices, and two nucleotide-binding domains (NBD). ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=56.75 E-value=1.8 Score=23.32 Aligned_cols=70 Identities=26% Similarity=0.340 Sum_probs=40.0 Q ss_pred EEECCCEEEEEE---EECCCCEEEEECCCHHHHCCCHHHHHHHH----HHHHHHCCCCCEEEEEEEECCCCCHHHHHHHH Q ss_conf 311121089999---98189739994100888446989999999----99987385664107999825983018889999 Q gi|254780293|r 49 TGKRKTSIARVW---IKSGSGKFTINYVDISKYFTQDLLVLNIK----RPFNTVSQDNMYDVFATVSGGGLSGQASAICH 121 (170) Q Consensus 49 tGrRKtsvArV~---l~~GsG~I~INgk~~~~Yf~~~~~r~~i~----~Pl~~~~~~~~~Di~~~V~GGG~sgQa~Airl 121 (170) +|-=||+..++. +.|-+|.|.|||.++..+-+ ..++..+- .|..+.++. .-++ .+|| +-=|+ T Consensus 37 sGsGKSTLl~ll~gl~~p~~G~I~i~g~~i~~~~~-~~~~~~i~~v~Q~~~lf~~tv-~eNi---LSgG------q~Qri 105 (171) T cd03228 37 SGSGKSTLLKLLLRLYDPTSGEILIDGVDLRDLDL-ESLRKNIAYVPQDPFLFSGTI-RENI---LSGG------QRQRI 105 (171) T ss_pred CCCHHHHHHHHHHCCCCCCCCEEEECCEECCCCCH-HHHHHHEEEECCCCCCCCCCH-HHHC---CCHH------HHHHH T ss_conf 99839999999976775897489999999885998-998631899966684375779-9977---4488------99999 Q ss_pred HHHHHHHH Q ss_conf 99999976 Q gi|254780293|r 122 GVAKALTY 129 (170) Q Consensus 122 aIarAL~~ 129 (170) +|||||+. T Consensus 106 ~lARal~~ 113 (171) T cd03228 106 AIARALLR 113 (171) T ss_pred HHHHHHHH T ss_conf 99999974 No 57 >COG4525 TauB ABC-type taurine transport system, ATPase component [Inorganic ion transport and metabolism] Probab=56.72 E-value=14 Score=17.70 Aligned_cols=76 Identities=18% Similarity=0.236 Sum_probs=42.9 Q ss_pred EEECCCEEE---EEEEECCCCEEEEECCCHH-------HHCCCHH------------------------HHHHHHHHHHH Q ss_conf 311121089---9999818973999410088-------8446989------------------------99999999987 Q gi|254780293|r 49 TGKRKTSIA---RVWIKSGSGKFTINYVDIS-------KYFTQDL------------------------LVLNIKRPFNT 94 (170) Q Consensus 49 tGrRKtsvA---rV~l~~GsG~I~INgk~~~-------~Yf~~~~------------------------~r~~i~~Pl~~ 94 (170) .|--||+.- --++.|-.|.|.+||++++ ..|+++. -+..+.+-|.+ T Consensus 40 SGcGKTTLLnl~AGf~~P~~G~i~l~~r~i~gPgaergvVFQ~~~LlPWl~~~dNvafgL~l~Gi~k~~R~~~a~q~l~~ 119 (259) T COG4525 40 SGCGKTTLLNLIAGFVTPSRGSIQLNGRRIEGPGAERGVVFQNEALLPWLNVIDNVAFGLQLRGIEKAQRREIAHQMLAL 119 (259) T ss_pred CCCCHHHHHHHHHCCCCCCCCEEEECCEECCCCCCCCEEEECCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH T ss_conf 88657889999862758566648889986568874323473267633046778888877776488788999999999998 Q ss_pred HCCCCCEEEEEEEECCCCCHHHHHHHHHHHHHHHH Q ss_conf 38566410799982598301888999999999976 Q gi|254780293|r 95 VSQDNMYDVFATVSGGGLSGQASAICHGVAKALTY 129 (170) Q Consensus 95 ~~~~~~~Di~~~V~GGG~sgQa~AirlaIarAL~~ 129 (170) +++.+--+-++---.||+- -|.||||||.. T Consensus 120 VgL~~~~~~~i~qLSGGmr-----QRvGiARALa~ 149 (259) T COG4525 120 VGLEGAEHKYIWQLSGGMR-----QRVGIARALAV 149 (259) T ss_pred HCCCCCCCCCEEEECCHHH-----HHHHHHHHHHC T ss_conf 3734324322575065088-----87778887604 No 58 >TIGR02982 heterocyst_DevA ABC exporter ATP-binding subunit, DevA family; InterPro: IPR014324 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This entry represents the ATP-binding subunit DevA, found mostly in the Cyanobacteria, but also in the Planctomycetes. Cyanobacterial examples are involved in heterocyst formation, by which some fraction of members of the colony undergo a developmental change and become capable of nitrogen fixation. The ABC transporter encoded by the devBCA operon is induced by nitrogen deficiency and is necessary for the formation of the laminated layer which envelops heterocysts , . It is thought to be involved in the export of either the heterocyst-specific glycolipids found in the laminated layer, or an enzyme essential for their formation.. Probab=56.51 E-value=16 Score=17.42 Aligned_cols=53 Identities=13% Similarity=0.183 Sum_probs=38.4 Q ss_pred CCHHHHCCCHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCHHHHHHHHHHHHHHHHH Q ss_conf 0088844698999999999987385664107999825983018889999999999765 Q gi|254780293|r 73 VDISKYFTQDLLVLNIKRPFNTVSQDNMYDVFATVSGGGLSGQASAICHGVAKALTYF 130 (170) Q Consensus 73 k~~~~Yf~~~~~r~~i~~Pl~~~~~~~~~Di~~~V~GGG~sgQa~AirlaIarAL~~~ 130 (170) .++...|.....+....+-|..+++..++|-+..==.||. -+ |-||||||+.. T Consensus 107 ~eL~~~~~~~~~~~~a~~~L~~VGL~~~~~y~P~~LSGGQ---KQ--RVAIARALv~~ 159 (220) T TIGR02982 107 LELQPNLSAQEAREKARAMLEAVGLGDRLDYYPENLSGGQ---KQ--RVAIARALVAR 159 (220) T ss_pred HHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHCCCCCCCCC---HH--HHHHHHHHHCC T ss_conf 9887611688999999999986060125540524367861---68--99999997338 No 59 >TIGR01989 COQ6 Ubiquinone biosynthesis mono0xygenase COQ6; InterPro: IPR000689 Ubiquinone (Q) functions as an electron carrier in the respiratory chain in mitochondria. Q biosynthesis involves a series of enzymatic steps, which are catalysed by the enzymes COQ1-COQ8 in Saccharomyces cerevisiae. COQ6, or ubiquinone biosynthesis monooxygenase, is a flavin-dependent enzyme localised to the matrix side of the inner mitochondrial membrane that is required for one or more steps in Q biosynthesis , .; GO: 0004497 monooxygenase activity. Probab=56.34 E-value=8.3 Score=19.15 Aligned_cols=25 Identities=44% Similarity=0.615 Sum_probs=18.9 Q ss_pred EEEEEEEECCCCCHHHHHHHHHHHHHH Q ss_conf 107999825983018889999999999 Q gi|254780293|r 101 YDVFATVSGGGLSGQASAICHGVAKAL 127 (170) Q Consensus 101 ~Di~~~V~GGG~sgQa~AirlaIarAL 127 (170) |||.| -|||+.|-|=|+.|+-...+ T Consensus 1 ~DvvI--vGGG~VG~alAaaL~~~~~~ 25 (481) T TIGR01989 1 FDVVI--VGGGLVGLALAAALGNNPLL 25 (481) T ss_pred CCEEE--ECCCHHHHHHHHHHHCCCCC T ss_conf 92898--88857899999997318732 No 60 >TIGR02673 FtsE cell division ATP-binding protein FtsE; InterPro: IPR005286 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). FtsE is an ABC transporter ATP-binding protein. This protein and its permease partner FtsX, localize to the cell division site. In a number of species, the ftsEX gene pair is located next to ftsY, which encodes the signal recognition particle-docki ng protein.; GO: 0005524 ATP binding, 0051301 cell division. Probab=54.69 E-value=13 Score=17.99 Aligned_cols=74 Identities=16% Similarity=0.249 Sum_probs=45.3 Q ss_pred CCCEEEEE---EEECCCCEEEEECCCHHHHC---------------------------------------CCHHHHHHHH Q ss_conf 12108999---99818973999410088844---------------------------------------6989999999 Q gi|254780293|r 52 RKTSIARV---WIKSGSGKFTINYVDISKYF---------------------------------------TQDLLVLNIK 89 (170) Q Consensus 52 RKtsvArV---~l~~GsG~I~INgk~~~~Yf---------------------------------------~~~~~r~~i~ 89 (170) =|||.=+- .+.|-.|.|.|||.+++.+- +....+..|. T Consensus 40 GKttLLKLl~~~~~P~~G~v~~~G~~~~~l~~~~~P~LRR~iGvvFQDf~LL~~rTv~eNVAl~L~V~G~~~~~I~~rV~ 119 (215) T TIGR02673 40 GKTTLLKLLYGALTPSRGQVRVAGEDVSRLRGRQLPLLRRRIGVVFQDFRLLPDRTVYENVALPLEVRGKKKREIQRRVE 119 (215) T ss_pred CHHHHHHHHHHHCCCCCCEEEECCEECCCCCCCCCHHHHCCCCEEEECCCCCCCCCHHHHCCHHHHCCCCCCHHHHHHHH T ss_conf 17899999985269875808888740466775643122131543784221101166134112101113888033678999 Q ss_pred HHHHHHCCCCCEEEEEEEECCCCCHHHHHHHHHHHHHHHHH Q ss_conf 99987385664107999825983018889999999999765 Q gi|254780293|r 90 RPFNTVSQDNMYDVFATVSGGGLSGQASAICHGVAKALTYF 130 (170) Q Consensus 90 ~Pl~~~~~~~~~Di~~~V~GGG~sgQa~AirlaIarAL~~~ 130 (170) .-|..+|+..|.|.+...=.||=- =|-||||||+.- T Consensus 120 ~~L~~vGL~~K~~~~P~~LSGGEQ-----QRvaIARAiv~~ 155 (215) T TIGR02673 120 AALRLVGLEHKADAFPEQLSGGEQ-----QRVAIARAIVNS 155 (215) T ss_pred HHHHHCCCHHHHHCCCCCCCCCHH-----HHHHHHHHHHCC T ss_conf 999852863254257210047257-----888887653048 No 61 >pfam00016 RuBisCO_large Ribulose bisphosphate carboxylase large chain, catalytic domain. The C-terminal domain of RuBisCO large chain is the catalytic domain adopting a TIM barrel fold. Probab=54.68 E-value=17 Score=17.18 Aligned_cols=36 Identities=31% Similarity=0.452 Sum_probs=20.5 Q ss_pred EEEEEEECCCCCHH-------HHHHHHHH--------------------HHHHHHHCHHHHHHH Q ss_conf 07999825983018-------88999999--------------------999976576344879 Q gi|254780293|r 102 DVFATVSGGGLSGQ-------ASAICHGV--------------------AKALTYFQPDLRPQI 138 (170) Q Consensus 102 Di~~~V~GGG~sgQ-------a~AirlaI--------------------arAL~~~~p~~r~~L 138 (170) |+.... |||+.|- +-|+|.|+ .+++.+|.|+++..| T Consensus 244 D~il~a-GGGi~GHP~G~aAGa~A~RqA~eA~~~g~~~~~~yA~eh~El~~a~a~~~~~l~~al 306 (309) T pfam00016 244 DVVLQF-GGGTLGHPDGNAAGATANRVALEACVQARNEGRDYAREGPEILRAAAKFSGELAAAL 306 (309) T ss_pred EEEEEC-CCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHCHHHHHHHHHCCHHHHHHH T ss_conf 079976-986217998617889999999999998355068898757899999984198999987 No 62 >TIGR00416 sms DNA repair protein RadA; InterPro: IPR004504 RadA/Sms is a highly conserved eubacterial protein that shares sequence similarity with both RecA strand transferase and lon protease. The RadA/Sms family are probable ATP-dependent proteases involved in both DNA repair and degradation of proteins, peptides, glycopeptides. They are classified in as non-peptidase homologues and unassigned peptidases in MEROPS peptidase family S16 (lon protease family, clan SJ). RadA/Sms is involved in recombination and recombinational repair, most likely involving the stabilisation or processing of branched DNA molecules or blocked replication forks because of its genetic redundancy with RecG and RuvABC .; GO: 0003684 damaged DNA binding, 0005524 ATP binding, 0006281 DNA repair. Probab=51.85 E-value=19 Score=16.89 Aligned_cols=79 Identities=24% Similarity=0.263 Sum_probs=48.2 Q ss_pred HHHHHH-HHHHHHHH-C-CCCCEEEEEEEECCCCCHHHHHHHHHHHHHHHHHCHHH--HH---HHHHCCC---C----CC Q ss_conf 899999-99999873-8-56641079998259830188899999999997657634--48---7997798---3----35 Q gi|254780293|r 82 DLLVLN-IKRPFNTV-S-QDNMYDVFATVSGGGLSGQASAICHGVAKALTYFQPDL--RP---QIKKGGF---L----TR 146 (170) Q Consensus 82 ~~~r~~-i~~Pl~~~-~-~~~~~Di~~~V~GGG~sgQa~AirlaIarAL~~~~p~~--r~---~Lk~~gl---L----t~ 146 (170) +.+|.. +++-|.-- + .+.+.|++++|.|| +-=.=-|+=||+.=||+.-.-+- .+ .|=.=|| . .- T Consensus 360 d~NRL~~LlAvLek~~Gl~l~~~Dvf~~V~GG-vkv~Epa~DLA~~~a~~SSFrdr~~~~~~~~lGEVGL~G~ir~vp~~ 438 (481) T TIGR00416 360 DQNRLALLLAVLEKRLGLPLADQDVFLNVAGG-VKVSEPAVDLALLIALVSSFRDRPLDPDLVILGEVGLAGEIRPVPSL 438 (481) T ss_pred HHHHHHHHHHHHHHHCCCCHHHCCEEEEEECC-EEECCHHHHHHHHHHHHHHHCCCCCCCCEEEEEEECCCCEEEECCCH T ss_conf 25689999999876406611117347986235-05410578899999999875178878563888764377704007763 Q ss_pred CCCCCCCCCCCCCCC Q ss_conf 755416759528984 Q gi|254780293|r 147 DSRIVERKKYGKAKA 161 (170) Q Consensus 147 D~R~~ErKK~G~~kA 161 (170) ..|.+|--|+|-++| T Consensus 439 ~~R~kEaak~GFkra 453 (481) T TIGR00416 439 EERLKEAAKLGFKRA 453 (481) T ss_pred HHHHHHHHHCCCCEE T ss_conf 167999984686344 No 63 >PRK00286 xseA exodeoxyribonuclease VII large subunit; Reviewed Probab=51.70 E-value=19 Score=16.95 Aligned_cols=33 Identities=18% Similarity=0.220 Sum_probs=20.5 Q ss_pred HHHHHHHHHHHHHCCCCCEEEEEEEECCCCCHH Q ss_conf 999999999987385664107999825983018 Q gi|254780293|r 83 LLVLNIKRPFNTVSQDNMYDVFATVSGGGLSGQ 115 (170) Q Consensus 83 ~~r~~i~~Pl~~~~~~~~~Di~~~V~GGG~sgQ 115 (170) ..-..|..-|...+..+.+|+.+-++|||-..- T Consensus 176 ~A~~~I~~ai~~~~~~~~~DvIIi~RGGGS~eD 208 (443) T PRK00286 176 GAAASIVEAIERANARGEVDVLIVARGGGSLED 208 (443) T ss_pred HHHHHHHHHHHHHHHHCCCCEEEEECCCCCHHH T ss_conf 479999999999852248888999368788889 No 64 >TIGR01842 type_I_sec_PrtD type I secretion system ATPase; InterPro: IPR010128 Type I protein secretion is a system in some Gram-negative bacteria to export proteins (often proteases) across both inner and outer membranes to the extracellular medium. This is one of three proteins of the type I secretion apparatus. Targeted proteins are not cleaved at the N terminus, but rather carry signals located toward the extreme C terminus to direct type I secretion.; GO: 0005524 ATP binding, 0015031 protein transport, 0016021 integral to membrane. Probab=51.21 E-value=14 Score=17.84 Aligned_cols=77 Identities=25% Similarity=0.368 Sum_probs=53.6 Q ss_pred EEECCCEEEEEEE---ECCCCEEEEECCCHHHH-----------------------------CCCHHHHHHHHHHHHHHC Q ss_conf 3111210899999---81897399941008884-----------------------------469899999999998738 Q gi|254780293|r 49 TGKRKTSIARVWI---KSGSGKFTINYVDISKY-----------------------------FTQDLLVLNIKRPFNTVS 96 (170) Q Consensus 49 tGrRKtsvArV~l---~~GsG~I~INgk~~~~Y-----------------------------f~~~~~r~~i~~Pl~~~~ 96 (170) .|==|+|-||+-+ .+-+|++...|-++++| |.....-++|.+-=.+++ T Consensus 365 SgSGKStLaR~~vG~W~~~~G~VRLDGadl~qWD~e~lG~~iGYLPQdvELF~GTva~NIARF~en~d~~~iieAAklAG 444 (556) T TIGR01842 365 SGSGKSTLARILVGIWPPASGSVRLDGADLKQWDRETLGKHIGYLPQDVELFSGTVAENIARFGENADPEKIIEAAKLAG 444 (556) T ss_pred CCCCHHHHHHHHHHCCCCCCCCEEEEHHHHHCCCHHHCCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCHHHHHHHHHHHC T ss_conf 86525889878872101356533640334402375365880154798505076767640244688788789999997603 Q ss_pred -------CCCCEEEEEEEECCCCCH-HHHHHHHHHHHHH Q ss_conf -------566410799982598301-8889999999999 Q gi|254780293|r 97 -------QDNMYDVFATVSGGGLSG-QASAICHGVAKAL 127 (170) Q Consensus 97 -------~~~~~Di~~~V~GGG~sg-Qa~AirlaIarAL 127 (170) ...-||-.+-..|.++|| |-+ |.|+|||| T Consensus 445 vHElIl~lP~GYDT~iG~~G~~LSGGQRQ--RIaLARAl 481 (556) T TIGR01842 445 VHELILRLPDGYDTDIGEGGATLSGGQRQ--RIALARAL 481 (556) T ss_pred HHHHHHCCCCCCCCCCCCCCCCCCCHHHH--HHHHHHHH T ss_conf 03575169688544313777778614689--99999987 No 65 >TIGR03265 PhnT2 putative 2-aminoethylphosphonate ABC transport system, ATP-binding protein component. This ABC transporter ATP-binding protein is found in a number of genomes in operon-like contexts strongly suggesting a substrate specificity for 2-aminoethylphosphonate (2-AEP). The characterized PhnSTUV system is absent in the genomes in which this system is found. These genomes encode systems for the catabolism of 2-AEP, making the need for a 2-AEP-specific transporter likely. Probab=51.06 E-value=12 Score=18.20 Aligned_cols=48 Identities=21% Similarity=0.238 Sum_probs=33.1 Q ss_pred EEECCCEEEEEE---EECCCCEEEEECCCHH--------------H--HCCCHHHHHHHHHHHHHHC Q ss_conf 311121089999---9818973999410088--------------8--4469899999999998738 Q gi|254780293|r 49 TGKRKTSIARVW---IKSGSGKFTINYVDIS--------------K--YFTQDLLVLNIKRPFNTVS 96 (170) Q Consensus 49 tGrRKtsvArV~---l~~GsG~I~INgk~~~--------------~--Yf~~~~~r~~i~~Pl~~~~ 96 (170) .|-=||..-|.. ..|-+|.|.+||+++. + .||+..-.+.|..||...+ T Consensus 39 SG~GKtTlLr~iaGl~~p~~G~I~~~g~~v~~~~p~~R~ig~VfQ~~aLfPh~tV~eNiafgl~~~~ 105 (353) T TIGR03265 39 SGCGKTTLLRIIAGLERQTAGTIYQGGRDITRLPPQKRDYGIVFQSYALFPNLTVADNIAYGLKNRG 105 (353) T ss_pred CCCHHHHHHHHHHCCCCCCCEEEEECCEECCCCCHHHCCEEEEECCCCCCCCCCHHHHHHHHHHHCC T ss_conf 9535999999997699998739999999999999525885999788854678929999988998769 No 66 >PRK06996 hypothetical protein; Provisional Probab=50.60 E-value=12 Score=18.20 Aligned_cols=23 Identities=22% Similarity=0.299 Sum_probs=10.2 Q ss_pred CCCEEEEEEEE--CCCCEEEEECCC Q ss_conf 12108999998--189739994100 Q gi|254780293|r 52 RKTSIARVWIK--SGSGKFTINYVD 74 (170) Q Consensus 52 RKtsvArV~l~--~GsG~I~INgk~ 74 (170) +.+.+-++.+. ...|.+.++..+ T Consensus 78 ~~~~i~~~~v~~~~~~g~~~~~~~~ 102 (397) T PRK06996 78 DATPIEHIHVSQRGHFGRTLIDRDD 102 (397) T ss_pred CCCCEEEEEEEECCCCCCEECCHHH T ss_conf 6874259999405776621126544 No 67 >TIGR02315 ABC_phnC phosphonate ABC transporter, ATP-binding protein; InterPro: IPR012693 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). Phosphonates are a class of phosphorus-containing organic compound with a stable direct C-P bond rather than a C-O-P linkage. A number of bacterial species have operons, typically about 14 genes in size, with genes for ATP-dependent transport of phosphonates, degradation, and regulation of the expression of the system. Members of this protein family are the ATP-binding cassette component of tripartite ABC transporters of phosphonates.; GO: 0005524 ATP binding, 0015416 phosphonate transmembrane-transporting ATPase activity, 0015716 phosphonate transport, 0016020 membrane. Probab=50.24 E-value=10 Score=18.54 Aligned_cols=49 Identities=18% Similarity=0.083 Sum_probs=37.9 Q ss_pred HHHHCCCHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCHHHHHHHHHHHHHHHH Q ss_conf 8884469899999999998738566410799982598301888999999999976 Q gi|254780293|r 75 ISKYFTQDLLVLNIKRPFNTVSQDNMYDVFATVSGGGLSGQASAICHGVAKALTY 129 (170) Q Consensus 75 ~~~Yf~~~~~r~~i~~Pl~~~~~~~~~Di~~~V~GGG~sgQa~AirlaIarAL~~ 129 (170) +-..|+ +..++..++-|.-+|....+.-.|.-=-||.- | |-||||||+. T Consensus 124 ~~~~F~-~~dk~~Al~~L~rVGl~~~A~~RaD~LSGGQQ-Q----RVaIARAL~Q 172 (253) T TIGR02315 124 LLGRFS-EEDKERALSLLERVGLADKAYQRADQLSGGQQ-Q----RVAIARALAQ 172 (253) T ss_pred HHCCCC-HHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCH-H----HHHHHHHHCC T ss_conf 632589-98999999876523878898652132158511-6----8999986368 No 68 >COG3840 ThiQ ABC-type thiamine transport system, ATPase component [Coenzyme metabolism] Probab=49.84 E-value=21 Score=16.70 Aligned_cols=17 Identities=35% Similarity=0.448 Sum_probs=13.9 Q ss_pred EEECCCCEEEEECCCHH Q ss_conf 99818973999410088 Q gi|254780293|r 60 WIKSGSGKFTINYVDIS 76 (170) Q Consensus 60 ~l~~GsG~I~INgk~~~ 76 (170) +..|.+|.|.|||.+.. T Consensus 48 F~~P~~G~i~i~g~d~t 64 (231) T COG3840 48 FETPASGEILINGVDHT 64 (231) T ss_pred CCCCCCCEEEECCEECC T ss_conf 24778745898572147 No 69 >PRK05446 imidazole glycerol-phosphate dehydratase/histidinol phosphatase; Provisional Probab=46.88 E-value=23 Score=16.42 Aligned_cols=44 Identities=11% Similarity=0.143 Sum_probs=30.7 Q ss_pred HHHHHHHHHCCCCCEEEEEEEECCCCCHHHHHHHHHHHHHHHHH Q ss_conf 99999987385664107999825983018889999999999765 Q gi|254780293|r 87 NIKRPFNTVSQDNMYDVFATVSGGGLSGQASAICHGVAKALTYF 130 (170) Q Consensus 87 ~i~~Pl~~~~~~~~~Di~~~V~GGG~sgQa~AirlaIarAL~~~ 130 (170) .+..=|.-.-......+++.|.|.-..-.++|+--|+||||-.. T Consensus 296 l~~~Ff~s~a~~~~~tlHi~~~G~n~HH~~Ea~FKa~a~Al~~A 339 (354) T PRK05446 296 MVEHFFRSLSDAMGCTLHLKTKGKNDHHKVESLFKAFGRALRQA 339 (354) T ss_pred HHHHHHHHHHHHCCCEEEEEEECCCCHHHHHHHHHHHHHHHHHH T ss_conf 79999999996078889998413870889999999999999998 No 70 >cd03215 ABC_Carb_Monos_II This family represents domain II of the carbohydrate uptake proteins that transport only monosaccharides (Monos). The Carb_Monos family is involved in the uptake of monosaccharides, such as pentoses (such as xylose, arabinose, and ribose) and hexoses (such as xylose, arabinose, and ribose), that cannot be broken down to simple sugars by hydrolysis. In members of Carb_Monos family the single hydrophobic gene product forms a homodimer, while the ABC protein represents a fusion of two nucleotide-binding domains. However, it is assumed that two copies of the ABC domains are present in the assembled transporter. Probab=44.78 E-value=1.6 Score=23.58 Aligned_cols=39 Identities=15% Similarity=0.201 Sum_probs=27.6 Q ss_pred CCCCCEE----EEECCCEEEEEE---EECCCCEEEEECCCHHHHCC Q ss_conf 4355001----311121089999---98189739994100888446 Q gi|254780293|r 42 QWQRSYA----TGKRKTSIARVW---IKSGSGKFTINYVDISKYFT 80 (170) Q Consensus 42 ~~g~~~~----tGrRKtsvArV~---l~~GsG~I~INgk~~~~Yf~ 80 (170) ..|...+ .|==||...++. +.|-+|.|.+||+++..+-+ T Consensus 24 ~~GEi~~liG~nGaGKSTll~~l~G~~~p~~G~I~~~G~~~~~~~~ 69 (182) T cd03215 24 RAGEIVGIAGLVGNGQTELAEALFGLRPPASGEITLDGKPVTRRSP 69 (182) T ss_pred CCCCEEEEECCCCCCCCHHHHHHCCCCCCCCCEEEECCEECCCCCH T ss_conf 5996999988899992637787669867887759999999886499 No 71 >COG3839 MalK ABC-type sugar transport systems, ATPase components [Carbohydrate transport and metabolism] Probab=44.54 E-value=21 Score=16.63 Aligned_cols=76 Identities=20% Similarity=0.236 Sum_probs=43.7 Q ss_pred EEECCCEEEEE---EEECCCCEEEEECCCHHH----------------HCCCHHHHHHHHHHHHHHCCC----------- Q ss_conf 31112108999---998189739994100888----------------446989999999999873856----------- Q gi|254780293|r 49 TGKRKTSIARV---WIKSGSGKFTINYVDISK----------------YFTQDLLVLNIKRPFNTVSQD----------- 98 (170) Q Consensus 49 tGrRKtsvArV---~l~~GsG~I~INgk~~~~----------------Yf~~~~~r~~i~~Pl~~~~~~----------- 98 (170) .|-=||..=|. ...+-+|.|.|+|++.++ .||+.+-+..+..||...+.. T Consensus 38 SGcGKSTlLr~IAGLe~p~~G~I~i~g~~vt~l~P~~R~iamVFQ~yALyPhMtV~~Niaf~Lk~~~~~k~ei~~rV~ev 117 (338) T COG3839 38 SGCGKSTLLRMIAGLEEPTSGEILIDGRDVTDLPPEKRGIAMVFQNYALYPHMTVYENIAFGLKLRGVPKAEIDKRVKEV 117 (338) T ss_pred CCCCHHHHHHHHHCCCCCCCCEEEECCEECCCCCHHHCCEEEEECCCCCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHH T ss_conf 98888999999968877887159999999998995578889993783015787699973416644799568889999999 Q ss_pred -------CCEEEEEEEECCCCCHHHHHHHHHHHHHHHH Q ss_conf -------6410799982598301888999999999976 Q gi|254780293|r 99 -------NMYDVFATVSGGGLSGQASAICHGVAKALTY 129 (170) Q Consensus 99 -------~~~Di~~~V~GGG~sgQa~AirlaIarAL~~ 129 (170) .-.|=...--.|| |- =|.||+|||+. T Consensus 118 a~~L~l~~lL~r~P~~LSGG---Qr--QRVAlaRAlVr 150 (338) T COG3839 118 AKLLGLEHLLNRKPLQLSGG---QR--QRVALARALVR 150 (338) T ss_pred HHHCCCHHHHHCCCCCCCHH---HH--HHHHHHHHHHC T ss_conf 98739866773590117725---67--89999877754 No 72 >cd03223 ABCD_peroxisomal_ALDP Peroxisomal ATP-binding cassette transporter (Pat) is involved in the import of very long-chain fatty acids (VLCFA) into the peroxisome. The peroxisomal membrane forms a permeability barrier for a wide variety of metabolites required for and formed during fatty acid beta-oxidation. To communicate with the cytoplasm and mitochondria, peroxisomes need dedicated proteins to transport such hydrophilic molecules across their membranes. X-linked adrenoleukodystrophy (X-ALD) is caused by mutations in the ALD gene, which encodes ALDP (adrenoleukodystrophy protein ), a peroxisomal integral membrane protein that is a member of the ATP-binding cassette (ABC) transporter protein family. The disease is characterized by a striking and unpredictable variation in phenotypic expression. Phenotypes include the rapidly progressive childhood cerebral form (CCALD), the milder adult form, adrenomyeloneuropathy (AMN), and variants without neurologic involvement (i.e. asympt Probab=44.10 E-value=12 Score=18.09 Aligned_cols=64 Identities=23% Similarity=0.328 Sum_probs=38.3 Q ss_pred EEECCCEEEEEEE---ECCCCEEEEECCC------HHHHCCCHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCHHHHHH Q ss_conf 3111210899999---8189739994100------888446989999999999873856641079998259830188899 Q gi|254780293|r 49 TGKRKTSIARVWI---KSGSGKFTINYVD------ISKYFTQDLLVLNIKRPFNTVSQDNMYDVFATVSGGGLSGQASAI 119 (170) Q Consensus 49 tGrRKtsvArV~l---~~GsG~I~INgk~------~~~Yf~~~~~r~~i~~Pl~~~~~~~~~Di~~~V~GGG~sgQa~Ai 119 (170) .|-=||...++-. .+.+|.|.+++.. -+.|++.-..++.+..|+. -..+|| +-- T Consensus 36 sGsGKSTLl~~l~Gl~~~~~G~i~~~~~~~i~~v~Q~~~l~~~tl~e~l~~p~~-----------~~LSGG------qkQ 98 (166) T cd03223 36 SGTGKSSLFRALAGLWPWGSGRIGMPEGEDLLFLPQRPYLPLGTLREQLIYPWD-----------DVLSGG------EQQ 98 (166) T ss_pred CCCCHHHHHHHHCCCCCCCCCEEEECCCCCEEEECCCCCCCCCCHHHHHHCCCC-----------CCCCHH------HHH T ss_conf 999889999998698769986799769987999856466588759999636154-----------678999------999 Q ss_pred HHHHHHHHHH Q ss_conf 9999999976 Q gi|254780293|r 120 CHGVAKALTY 129 (170) Q Consensus 120 rlaIarAL~~ 129 (170) |++|||||+. T Consensus 99 RvalARal~~ 108 (166) T cd03223 99 RLAFARLLLH 108 (166) T ss_pred HHHHHHHHHC T ss_conf 9999999964 No 73 >TIGR02023 BchP-ChlP geranylgeranyl reductase; InterPro: IPR010253 This entry represents a group of geranylgeranyl reductases specific for the biosyntheses of bacteriochlorophyll and chlorophyll . It is unclear whether the processes of isoprenoid ligation to the chlorin ring and reduction of the geranylgeranyl chain to a phytyl chain are necessarily ordered the same way in all species (see introduction to ).; GO: 0045550 geranylgeranyl reductase activity, 0015979 photosynthesis, 0015995 chlorophyll biosynthetic process. Probab=40.47 E-value=23 Score=16.44 Aligned_cols=15 Identities=47% Similarity=0.570 Sum_probs=6.8 Q ss_pred ECCCCCHHHHHHHHH Q ss_conf 259830188899999 Q gi|254780293|r 108 SGGGLSGQASAICHG 122 (170) Q Consensus 108 ~GGG~sgQa~Airla 122 (170) -|||.||=+-|.-|| T Consensus 6 iGGGPsGA~AAe~LA 20 (408) T TIGR02023 6 IGGGPSGAAAAETLA 20 (408) T ss_pred EECCCCHHHHHHHHH T ss_conf 816850689999998 No 74 >COG1561 Uncharacterized stress-induced protein [Function unknown] Probab=40.21 E-value=29 Score=15.78 Aligned_cols=74 Identities=14% Similarity=0.028 Sum_probs=37.4 Q ss_pred CEEEEEEEECCCCEEE-----EECCCHHHHCCCHHHHHHHHHHHHH----HC-CCCCEEEEEEEECCCCCHHHHHHHHHH Q ss_conf 1089999981897399-----9410088844698999999999987----38-566410799982598301888999999 Q gi|254780293|r 54 TSIARVWIKSGSGKFT-----INYVDISKYFTQDLLVLNIKRPFNT----VS-QDNMYDVFATVSGGGLSGQASAICHGV 123 (170) Q Consensus 54 tsvArV~l~~GsG~I~-----INgk~~~~Yf~~~~~r~~i~~Pl~~----~~-~~~~~Di~~~V~GGG~sgQa~AirlaI 123 (170) |..||+-..-|.+.+. ||++-++.+|.-|... .-++|..- .. .-|++++++++.+--.+..-=.|--.. T Consensus 6 Tgfar~e~~~~~~~~~~ElrSVN~r~Le~~~RLP~~l-~~lE~~lR~~i~~~isRGkie~~l~~~~~~~~~~~l~in~~l 84 (290) T COG1561 6 TGFARAEKEGGWYSAAVELRSVNGRYLETRFRLPEQL-RGLEPALRELISSRISRGKVECTLRIEREESAQKELEINEDL 84 (290) T ss_pred CHHHHCCCCCCCEEEEEEEEECCCCCHHHHHCCCHHH-HHHHHHHHHHHHHHHCCCCEEEEEEEECCCCCCCCEEECHHH T ss_conf 0231134457871799998604554126763487766-775199999999981477269999960012456651248999 Q ss_pred HHHHH Q ss_conf 99997 Q gi|254780293|r 124 AKALT 128 (170) Q Consensus 124 arAL~ 128 (170) +++++ T Consensus 85 l~~~~ 89 (290) T COG1561 85 LKQLV 89 (290) T ss_pred HHHHH T ss_conf 99999 No 75 >TIGR03378 glycerol3P_GlpB glycerol-3-phosphate dehydrogenase, anaerobic, B subunit. Members of this protein family are the B subunit, product of the glpB gene, of a three-subunit, membrane-anchored, FAD-dependent anaerobic glycerol-3-phosphate dehydrogenase. Probab=39.01 E-value=23 Score=16.48 Aligned_cols=14 Identities=0% Similarity=0.029 Sum_probs=6.5 Q ss_pred EEEEEC-CCHHHHCC Q ss_conf 399941-00888446 Q gi|254780293|r 67 KFTINY-VDISKYFT 80 (170) Q Consensus 67 ~I~INg-k~~~~Yf~ 80 (170) +|.|=| .-+.||.| T Consensus 140 ~i~vvgieG~~DF~p 154 (419) T TIGR03378 140 RILLVGIEGFRDFQP 154 (419) T ss_pred CEEEEECCCCCCCCH T ss_conf 369996564578898 No 76 >cd03216 ABC_Carb_Monos_I This family represents the domain I of the carbohydrate uptake proteins that transport only monosaccharides (Monos). The Carb_Monos family is involved in the uptake of monosaccharides, such as pentoses (such as xylose, arabinose, and ribose) and hexoses (such as xylose, arabinose, and ribose), that cannot be broken down to simple sugars by hydrolysis. Pentoses include xylose, arabinose, and ribose. Important hexoses include glucose, galactose, and fructose. In members of the Carb_monos family, the single hydrophobic gene product forms a homodimer while the ABC protein represents a fusion of two nucleotide-binding domains. However, it is assumed that two copies of the ABC domains are present in the assembled transporter. Probab=38.79 E-value=7.2 Score=19.53 Aligned_cols=62 Identities=19% Similarity=0.233 Sum_probs=34.3 Q ss_pred EEECCCEEEEE---EEECCCCEEEEECCCHHHHCCCHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCHHHHHHHHHHHH Q ss_conf 31112108999---998189739994100888446989999999999873856641079998259830188899999999 Q gi|254780293|r 49 TGKRKTSIARV---WIKSGSGKFTINYVDISKYFTQDLLVLNIKRPFNTVSQDNMYDVFATVSGGGLSGQASAICHGVAK 125 (170) Q Consensus 49 tGrRKtsvArV---~l~~GsG~I~INgk~~~~Yf~~~~~r~~i~~Pl~~~~~~~~~Di~~~V~GGG~sgQa~AirlaIar 125 (170) .|-=||...+. .+.|-+|.|.+||+++..+-+....+.-+ ...-..+| -+-=|++||| T Consensus 35 nGaGKSTl~~~i~Gl~~p~~G~i~i~G~~i~~~~~~~~~~~gi-------------~~v~qLSg------G~~Qrv~iar 95 (163) T cd03216 35 NGAGKSTLMKILSGLYKPDSGEILVDGKEVSFASPRDARRAGI-------------AMVYQLSV------GERQMVEIAR 95 (163) T ss_pred CCCCHHHHHHHHHCCCCCCCCEEEECCEECCCCCHHHHHHCCC-------------CEECCCCH------HHHHHHHHHH T ss_conf 9989999999995776898578999999999999999998799-------------48946998------9999999999 Q ss_pred HHHH Q ss_conf 9976 Q gi|254780293|r 126 ALTY 129 (170) Q Consensus 126 AL~~ 129 (170) ||.. T Consensus 96 al~~ 99 (163) T cd03216 96 ALAR 99 (163) T ss_pred HHHH T ss_conf 9972 No 77 >COG1850 RbcL Ribulose 1,5-bisphosphate carboxylase, large subunit [Carbohydrate transport and metabolism] Probab=38.75 E-value=31 Score=15.64 Aligned_cols=40 Identities=30% Similarity=0.613 Sum_probs=25.0 Q ss_pred EEEEEEECCCCCHH-------HHHHHHHHHHHHHH--------HCHHHHHHHHHCC Q ss_conf 07999825983018-------88999999999976--------5763448799779 Q gi|254780293|r 102 DVFATVSGGGLSGQ-------ASAICHGVAKALTY--------FQPDLRPQIKKGG 142 (170) Q Consensus 102 Di~~~V~GGG~sgQ-------a~AirlaIarAL~~--------~~p~~r~~Lk~~g 142 (170) |+.+.. |||+.|- +.|+|.||--|+.. .+|+|...|.+-| T Consensus 370 D~viq~-GGGv~gHP~G~~aGarA~RqAieA~v~g~~L~e~Ak~~~EL~~ALekwg 424 (429) T COG1850 370 DLVIQA-GGGVHGHPDGPAAGARAVRQAIEAAVEGIPLREYAKWHPELAAALEKWG 424 (429) T ss_pred HHEEEC-CCCEECCCCCCCHHHHHHHHHHHHHHHCCCHHHHHHHCHHHHHHHHHHH T ss_conf 316743-8834248988612389999999999718878999753999999999873 No 78 >TIGR00020 prfB peptide chain release factor 2; InterPro: IPR004374 In many but not all taxa, there is a conserved real translational frameshift at a TGA codon. RF-2 helps terminate translation at TGA codons and can therefore regulate its own production by readthrough when RF-2 is insufficient. There is a superfamily IPR000352 from INTERPRO of RF-1, RF-2, mitochondrial, RF-H, etc proteins.; GO: 0016149 translation release factor activity codon specific, 0006415 translational termination, 0005737 cytoplasm. Probab=38.44 E-value=17 Score=17.23 Aligned_cols=28 Identities=25% Similarity=0.247 Sum_probs=20.3 Q ss_pred EEECCCEEEEEEEECCC----CEEEEECCCHH Q ss_conf 31112108999998189----73999410088 Q gi|254780293|r 49 TGKRKTSIARVWIKSGS----GKFTINYVDIS 76 (170) Q Consensus 49 tGrRKtsvArV~l~~Gs----G~I~INgk~~~ 76 (170) -|||-||.|.|.+.|-- =.|.|+-.|+- T Consensus 216 ~~rRHTSFasv~V~P~~Dd~tI~IeI~~~D~r 247 (373) T TIGR00020 216 NARRHTSFASVEVMPEVDDDTIDIEIKPEDVR 247 (373) T ss_pred CCCCCCEEEEEEECCCCCCCCCCCCCCCCCEE T ss_conf 69533226888862403574451106887546 No 79 >COG1148 HdrA Heterodisulfide reductase, subunit A and related polyferredoxins [Energy production and conversion] Probab=36.36 E-value=23 Score=16.43 Aligned_cols=51 Identities=18% Similarity=0.213 Sum_probs=28.7 Q ss_pred HHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCHHHHHHHHHHH---HHHHHHCHHH Q ss_conf 8999999999987385664107999825983018889999999---9997657634 Q gi|254780293|r 82 DLLVLNIKRPFNTVSQDNMYDVFATVSGGGLSGQASAICHGVA---KALTYFQPDL 134 (170) Q Consensus 82 ~~~r~~i~~Pl~~~~~~~~~Di~~~V~GGG~sgQa~AirlaIa---rAL~~~~p~~ 134 (170) ...+...++||.-... ..+=.+-|-|||++|-.-|+-||=. --|+.-+|.+ T Consensus 106 avakar~le~le~~~~--~v~~svLVIGGGvAGitAAl~La~~G~~v~LVEKepsi 159 (622) T COG1148 106 AVAKARKLEPLEEIKV--EVSKSVLVIGGGVAGITAALELADMGFKVYLVEKEPSI 159 (622) T ss_pred HHHHHHHCCCHHHHHH--HHCCCEEEECCCHHHHHHHHHHHHCCCEEEEEECCCCC T ss_conf 9998764388056777--53053599868489899999999759808999417864 No 80 >pfam05488 PAAR_motif PAAR motif. This motif is found usually in pairs in a family of bacterial membrane proteins. It is also found as a triplet of tandem repeats comprising the entire length in a another family of hypothetical proteins. Probab=36.18 E-value=13 Score=17.93 Aligned_cols=16 Identities=31% Similarity=0.252 Sum_probs=13.2 Q ss_pred EECCCCEEEEECCCHH Q ss_conf 9818973999410088 Q gi|254780293|r 61 IKSGSGKFTINYVDIS 76 (170) Q Consensus 61 l~~GsG~I~INgk~~~ 76 (170) |..|+..++|||+|.. T Consensus 1 I~~Gs~~v~inG~PAA 16 (26) T pfam05488 1 IIEGSPNVKINGKPAA 16 (26) T ss_pred CCCCCCCCEECCCCHH T ss_conf 9247887118786065 No 81 >COG2884 FtsE Predicted ATPase involved in cell division [Cell division and chromosome partitioning] Probab=35.59 E-value=35 Score=15.33 Aligned_cols=76 Identities=16% Similarity=0.223 Sum_probs=42.5 Q ss_pred EECCCEEEEEE---EECCCCEEEEECCCHHHHC---------------------CC------------------HHHHHH Q ss_conf 11121089999---9818973999410088844---------------------69------------------899999 Q gi|254780293|r 50 GKRKTSIARVW---IKSGSGKFTINYVDISKYF---------------------TQ------------------DLLVLN 87 (170) Q Consensus 50 GrRKtsvArV~---l~~GsG~I~INgk~~~~Yf---------------------~~------------------~~~r~~ 87 (170) |-=|++.-+.. .+|-.|+|.+||.++..+- ++ ...+.. T Consensus 38 GAGKSTllkLi~~~e~pt~G~i~~~~~dl~~l~~~~iP~LRR~IGvVFQD~rLL~~~tvyeNVA~pL~v~G~~~~~i~~r 117 (223) T COG2884 38 GAGKSTLLKLIYGEERPTRGKILVNGHDLSRLKGREIPFLRRQIGVVFQDFRLLPDRTVYENVALPLRVIGKPPREIRRR 117 (223) T ss_pred CCCHHHHHHHHHHHHCCCCCEEEECCEECCCCCCCCCCHHHHEEEEEEEECCCCCCCHHHHHHHHHHHCCCCCHHHHHHH T ss_conf 88789999999841367876599888500325545360554412058641223465306654212010017887899999 Q ss_pred HHHHHHHHCCCCCEEEEEEEECCCCCHHHHHHHHHHHHHHHHH Q ss_conf 9999987385664107999825983018889999999999765 Q gi|254780293|r 88 IKRPFNTVSQDNMYDVFATVSGGGLSGQASAICHGVAKALTYF 130 (170) Q Consensus 88 i~~Pl~~~~~~~~~Di~~~V~GGG~sgQa~AirlaIarAL~~~ 130 (170) +.+-|..+++.++-+..-..-.||- -=|.|||||++.- T Consensus 118 V~~~L~~VgL~~k~~~lP~~LSGGE-----QQRvaIARAiV~~ 155 (223) T COG2884 118 VSEVLDLVGLKHKARALPSQLSGGE-----QQRVAIARAIVNQ 155 (223) T ss_pred HHHHHHHHCCHHHHHCCCCCCCCHH-----HHHHHHHHHHCCC T ss_conf 9999988062025443960158237-----8899999998169 No 82 >TIGR02203 MsbA_lipidA lipid A export permease/ATP-binding protein MsbA; InterPro: IPR011917 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This family consists of a single polypeptide chain transporter in the ATP-binding cassette (ABC) transporter family, MsbA, which exports lipid A. It may also act in multidrug resistance. Lipid A, a part of lipopolysaccharide, is found in the outer leaflet of the outer membrane of most Gram-negative bacteria. Members of this family are restricted to the Proteobacteria (although lipid A is more broadly distributed) and often are clustered with lipid A biosynthesis genes .; GO: 0005524 ATP binding, 0006869 lipid transport, 0009276 1-2nm peptidoglycan-based cell wall, 0016021 integral to membrane. Probab=34.02 E-value=23 Score=16.41 Aligned_cols=92 Identities=23% Similarity=0.263 Sum_probs=58.5 Q ss_pred CCCCCCCEE-EEE---CCCEEEEE---EEECCCCEEEEECCCHHHHCCCHHHHHHHHH---------------------- Q ss_conf 454355001-311---12108999---9981897399941008884469899999999---------------------- Q gi|254780293|r 40 VDQWQRSYA-TGK---RKTSIARV---WIKSGSGKFTINYVDISKYFTQDLLVLNIKR---------------------- 90 (170) Q Consensus 40 ~d~~g~~~~-tGr---RKtsvArV---~l~~GsG~I~INgk~~~~Yf~~~~~r~~i~~---------------------- 90 (170) ....|...| +|| =||+++.- +..|-+|.|.+.|.|+.||-- ..+|.++-. T Consensus 384 ~~~~G~~vALVGRSGSGKsTlv~LlPRFy~p~~G~IllDG~~l~d~~L-~~LR~q~alVsQ~V~LFdDTiA~NvaYg~~d 462 (603) T TIGR02203 384 VVEPGETVALVGRSGSGKSTLVNLLPRFYEPDSGQILLDGVDLQDYTL-ADLRRQVALVSQDVVLFDDTIANNVAYGRLD 462 (603) T ss_pred EECCCCEEEEECCCCCHHHHHHHHCCCCCCCCCCCEEECCCHHHHCCH-HHHHHHHHHHCCCEEECCCCHHHHHHHCCCC T ss_conf 115873599870688538999855236604588856527840443026-6356230002152372053267764404234 Q ss_pred -----------HHHHH------C-CCCCEEEEEEEECCCCCH-HHHHHHHHHHHHHHHHCHHH Q ss_conf -----------99873------8-566410799982598301-88899999999997657634 Q gi|254780293|r 91 -----------PFNTV------S-QDNMYDVFATVSGGGLSG-QASAICHGVAKALTYFQPDL 134 (170) Q Consensus 91 -----------Pl~~~------~-~~~~~Di~~~V~GGG~sg-Qa~AirlaIarAL~~~~p~~ 134 (170) -+.-+ + +..-+|-.|-.+|+=+|| |=+ |+||||||.+=-|-| T Consensus 463 a~~~~~a~v~~A~~~Ayl~dfvd~lP~Gldt~vG~NG~~LSGGQRQ--RlAIARAllKdAPiL 523 (603) T TIGR02203 463 AEQVDRAEVERALAAAYLQDFVDKLPLGLDTPVGENGSRLSGGQRQ--RLAIARALLKDAPIL 523 (603) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHH--HHHHHHHHHHCCCHH T ss_conf 0017878999999986468887147888563424132237760458--999999997426623 No 83 >cd03291 ABCC_CFTR1 The CFTR subfamily domain 1. The cystic fibrosis transmembrane regulator (CFTR), the product of the gene mutated in patients with cystic fibrosis, has adapted the ABC transporter structural motif to form a tightly regulated anion channel at the apical surface of many epithelia. Use of the term assembly of a functional ion channel implies the coming together of subunits, or at least smaller not-yet functional components of the active whole. In fact, on the basis of current knowledge only the CFTR polypeptide itself is required to form an ATP- and protein kinase A-dependent low-conductance chloride channel of the type present in the apical membrane of many epithelial cells. CFTR displays the typical organization (IM-ABC)2 and carries a characteristic hydrophilic R-domain that separates IM1-ABC1 from IM2-ABC2. Probab=32.81 E-value=38 Score=15.05 Aligned_cols=79 Identities=18% Similarity=0.095 Sum_probs=43.0 Q ss_pred EEECCCEEEEEE---EECCCCEEEEECCC----HHHH-CCCHHHHHHHHH-----------HHHHH-------CCCCCEE Q ss_conf 311121089999---98189739994100----8884-469899999999-----------99873-------8566410 Q gi|254780293|r 49 TGKRKTSIARVW---IKSGSGKFTINYVD----ISKY-FTQDLLVLNIKR-----------PFNTV-------SQDNMYD 102 (170) Q Consensus 49 tGrRKtsvArV~---l~~GsG~I~INgk~----~~~Y-f~~~~~r~~i~~-----------Pl~~~-------~~~~~~D 102 (170) +|-=||+..+.. +.|-+|.|.++|+- -+.| |+ ...++.+.. .+..+ .....+| T Consensus 72 sGSGKSTLl~lL~gl~~p~~G~I~~~g~i~~v~Q~~~lf~-~TireNI~~g~~~~~~~~~~~~~~~~l~~~i~~lp~g~~ 150 (282) T cd03291 72 TGSGKTSLLMLILGELEPSEGKIKHSGRISFSSQFSWIMP-GTIKENIIFGVSYDEYRYKSVVKACQLEEDITKFPEKDN 150 (282) T ss_pred CCCHHHHHHHHHHCCCCCCCCEEEECCEEEEECCCCCCCC-CCHHHHHHCCCCCCHHHHHHHHHHHCCHHHHHHCCCCCC T ss_conf 9981999999995787278658999999998657442267-109999751688688999999998514999984634255 Q ss_pred EEEEEECCCCCHHHHHHHHHHHHHHHH Q ss_conf 799982598301888999999999976 Q gi|254780293|r 103 VFATVSGGGLSGQASAICHGVAKALTY 129 (170) Q Consensus 103 i~~~V~GGG~sgQa~AirlaIarAL~~ 129 (170) ..+.-.|..+| .-+.=|++|||||+. T Consensus 151 t~vge~G~~LS-GGQkQRlaiARALl~ 176 (282) T cd03291 151 TVLGEGGITLS-GGQRARISLARAVYK 176 (282) T ss_pred CEECCCCCCCC-HHHHHHHHHHHHHHC T ss_conf 23003677589-999999999999842 No 84 >PRK08948 consensus Probab=32.27 E-value=35 Score=15.32 Aligned_cols=21 Identities=24% Similarity=0.327 Sum_probs=9.9 Q ss_pred EEEEEEEE--CCCCEEEEECCCH Q ss_conf 08999998--1897399941008 Q gi|254780293|r 55 SIARVWIK--SGSGKFTINYVDI 75 (170) Q Consensus 55 svArV~l~--~GsG~I~INgk~~ 75 (170) .+-.+.+. .+.|.+.++..++ T Consensus 75 pi~~i~v~d~~~~~~~~~~~~~~ 97 (392) T PRK08948 75 AITHVHVSDRGHAGFVTLAAEDY 97 (392) T ss_pred CCCEEEEEECCCCCEEECCHHHC T ss_conf 30078984078776254376660 No 85 >PRK08010 pyridine nucleotide-disulfide oxidoreductase; Provisional Probab=32.21 E-value=39 Score=14.99 Aligned_cols=40 Identities=20% Similarity=0.260 Sum_probs=20.5 Q ss_pred EEEEECCCCCHHHHHHHHH-----------HHHHHHHHCHHHH----HHHHHCCC Q ss_conf 9998259830188899999-----------9999976576344----87997798 Q gi|254780293|r 104 FATVSGGGLSGQASAICHG-----------VAKALTYFQPDLR----PQIKKGGF 143 (170) Q Consensus 104 ~~~V~GGG~sgQa~Airla-----------IarAL~~~~p~~r----~~Lk~~gl 143 (170) .+-|-|||..|---|-.++ -.+-|-.+|+++. ..|++.|. T Consensus 160 ~l~IiGgG~ig~E~A~~~~~lG~~Vtiie~~~~il~~~D~~~~~~l~~~l~~~Gi 214 (441) T PRK08010 160 HLGILGGGYIGVEFASMFANFGSKVTILEAASLFLPREDRDIADNIATILRDQGV 214 (441) T ss_pred EEEEECCCHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHCCE T ss_conf 7999898589999999999759878897046732663114689999999987885 No 86 >KOG2495 consensus Probab=32.16 E-value=39 Score=14.98 Aligned_cols=85 Identities=20% Similarity=0.266 Sum_probs=47.6 Q ss_pred EEEEEEEECCCC----EEEEECCCHHHHCCC-----HHHHHHHHHHHHHHCCC------CCEEEEEEEECCCCCHHHHH- Q ss_conf 089999981897----399941008884469-----89999999999873856------64107999825983018889- Q gi|254780293|r 55 SIARVWIKSGSG----KFTINYVDISKYFTQ-----DLLVLNIKRPFNTVSQD------NMYDVFATVSGGGLSGQASA- 118 (170) Q Consensus 55 svArV~l~~GsG----~I~INgk~~~~Yf~~-----~~~r~~i~~Pl~~~~~~------~~~Di~~~V~GGG~sgQa~A- 118 (170) .++.=+|.-..| .+-|+|.--+-+|-. ...|..++.-|+.+... .+==.++.|-|||.||--=| T Consensus 156 ~i~YDyLViA~GA~~~TFgipGV~e~~~FLKEv~dAqeIR~~~~~~le~a~~~~l~~eerkRlLh~VVVGGGPTGVEFAa 235 (491) T KOG2495 156 VIGYDYLVIAVGAEPNTFGIPGVEENAHFLKEVEDAQEIRRKVIDNLEKAELPGLSDEERKRLLHFVVVGGGPTGVEFAA 235 (491) T ss_pred EECCCEEEEECCCCCCCCCCCCHHHCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHEEEEEEECCCCCCEEEHH T ss_conf 60266899962677777899751101046664367999999999888875258998577540578999888876224567 Q ss_pred -HHHHHHHHHHHHCHHHHHHHH Q ss_conf -999999999765763448799 Q gi|254780293|r 119 -ICHGVAKALTYFQPDLRPQIK 139 (170) Q Consensus 119 -irlaIarAL~~~~p~~r~~Lk 139 (170) +.-=|-.-|..+.|++...++ T Consensus 236 EL~Dfi~~Dl~k~yp~l~~~i~ 257 (491) T KOG2495 236 ELADFIPEDLRKIYPELKKDIK 257 (491) T ss_pred HHHHHHHHHHHHHHHCCHHHEE T ss_conf 7787787889986041320348 No 87 >TIGR02138 phosphate_pstC phosphate ABC transporter, permease protein PstC; InterPro: IPR011864 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). The typical operon for the high affinity inorganic phosphate ABC transporter encodes an ATP-binding protein, a phosphate-binding protein, and two permease proteins. This family describes PstC, which is homologous to PstA (IPR005672 from INTERPRO). In the Escherichia coli, this transport system is induced when the concentration of extrallular inorganic phosphate is low. A constitutive, lower affinity transporter operates otherwise.; GO: 0005315 inorganic phosphate transmembrane transporter activity, 0006817 phosphate transport, 0016021 integral to membrane. Probab=31.89 E-value=35 Score=15.32 Aligned_cols=19 Identities=32% Similarity=0.461 Sum_probs=16.7 Q ss_pred CCCCCHHHHHHHHHHHHHH Q ss_conf 5983018889999999999 Q gi|254780293|r 109 GGGLSGQASAICHGVAKAL 127 (170) Q Consensus 109 GGG~sgQa~AirlaIarAL 127 (170) ==+.+|-.+|+.||++||| T Consensus 219 P~a~~GIv~a~~LglGRAl 237 (317) T TIGR02138 219 PAARSGIVGAVVLGLGRAL 237 (317) T ss_pred HHHHHHHHHHHHHHHHHHH T ss_conf 0002189999999999999 No 88 >pfam05491 RuvB_C Holliday junction DNA helicase ruvB C-terminus. The RuvB protein makes up part of the RuvABC revolvasome which catalyses the resolution of Holliday junctions that arise during genetic recombination and DNA repair. Branch migration is catalysed by the RuvB protein that is targeted to the Holliday junction by the structure specific RuvA protein. This family consists of the C-terminal region of the RuvB protein which is thought to be helicase DNA-binding domain. Probab=31.76 E-value=35 Score=15.27 Aligned_cols=41 Identities=22% Similarity=0.429 Sum_probs=26.4 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHCHHHHHHHHHCCCCCCCCCC Q ss_conf 598301888999999999976576344879977983357554 Q gi|254780293|r 109 GGGLSGQASAICHGVAKALTYFQPDLRPQIKKGGFLTRDSRI 150 (170) Q Consensus 109 GGG~sgQa~AirlaIarAL~~~~p~~r~~Lk~~glLt~D~R~ 150 (170) +||..|- ++|..+++-.--.....+.|.|-+.||+.|-||= T Consensus 21 ~ggPvGl-~tla~~l~e~~~TiE~viEPyLi~~G~i~RTprG 61 (75) T pfam05491 21 GGGPVGL-DTLAAALGEERDTIEDVIEPYLIQIGFIQRTPRG 61 (75) T ss_pred CCCCCCH-HHHHHHHCCCHHHHHHHHHHHHHHHCCHHCCCCH T ss_conf 8997129-9999998687408999843999995443208854 No 89 >cd03289 ABCC_CFTR2 The CFTR subfamily domain 2. The cystic fibrosis transmembrane regulator (CFTR), the product of the gene mutated in patients with cystic fibrosis, has adapted the ABC transporter structural motif to form a tightly regulated anion channel at the apical surface of many epithelia. Use of the term assembly of a functional ion channel implies the coming together of subunits or at least smaller not-yet functional components of the active whole. In fact, on the basis of current knowledge only the CFTR polypeptide itself is required to form an ATP- and protein kinase A-dependent low-conductance chloride channel of the type present in the apical membrane of many epithelial cells. CFTR displays the typical organization (IM-ABC)2 and carries a characteristic hydrophilic R-domain that separates IM1-ABC1 from IM2-ABC2. Probab=31.66 E-value=40 Score=14.93 Aligned_cols=80 Identities=18% Similarity=0.141 Sum_probs=41.3 Q ss_pred EEECCCEEEEE--EEECCCCEEEEECCCHHHHCC-----------------CHHHHHH-----------HHHHHHHHCC- Q ss_conf 31112108999--998189739994100888446-----------------9899999-----------9999987385- Q gi|254780293|r 49 TGKRKTSIARV--WIKSGSGKFTINYVDISKYFT-----------------QDLLVLN-----------IKRPFNTVSQ- 97 (170) Q Consensus 49 tGrRKtsvArV--~l~~GsG~I~INgk~~~~Yf~-----------------~~~~r~~-----------i~~Pl~~~~~- 97 (170) +|-=||...+. .+.+-.|.|.|||.++..+-. +...++. +...+..++. T Consensus 39 sGsGKSTLl~lL~gl~~~~G~I~idg~~i~~~~~~~~r~~i~~vpQ~~~lf~~Ti~eNl~~~~~~~~~~i~~~~~~~~l~ 118 (275) T cd03289 39 TGSGKSTLLSAFLRLLNTEGDIQIDGVSWNSVPLQKWRKAFGVIPQKVFIFSGTFRKNLDPYGKWSDEEIWKVAEEVGLK 118 (275) T ss_pred CCCCHHHHHHHHHHHCCCCCEEEECCEECCCCCHHHHHHHEEEECCCCCCCCCCHHHHCCCCCCCCHHHHHHHHHHHCHH T ss_conf 99979999999960357895399998806736899997638996655632674199970321222889999999997669 Q ss_pred ------CCCEEEEEEEECCCCCHHHHHHHHHHHHHHHH Q ss_conf ------66410799982598301888999999999976 Q gi|254780293|r 98 ------DNMYDVFATVSGGGLSGQASAICHGVAKALTY 129 (170) Q Consensus 98 ------~~~~Di~~~V~GGG~sgQa~AirlaIarAL~~ 129 (170) .+.+|..+.-.|..+| .-+.=|++|||||+. T Consensus 119 ~~i~~lp~~ld~~~~~~g~~LS-gGqkQrl~lARaLl~ 155 (275) T cd03289 119 SVIEQFPGQLDFVLVDGGCVLS-HGHKQLMCLARSVLS 155 (275) T ss_pred HHHHHCCCCCCEEECCCCCCCC-HHHHHHHHHHHHHHC T ss_conf 9998573667403268887239-999999999999951 No 90 >PRK05589 peptide chain release factor 2; Provisional Probab=31.60 E-value=25 Score=16.24 Aligned_cols=29 Identities=34% Similarity=0.409 Sum_probs=20.1 Q ss_pred EEEECCCEEEEEEEECC---CCEEEEECCCHH Q ss_conf 13111210899999818---973999410088 Q gi|254780293|r 48 ATGKRKTSIARVWIKSG---SGKFTINYVDIS 76 (170) Q Consensus 48 ~tGrRKtsvArV~l~~G---sG~I~INgk~~~ 76 (170) +-|||-||.|.|.+.|- .-.|.||..++. T Consensus 168 s~gR~hTS~a~V~VlP~~~~~~~i~i~~~dl~ 199 (325) T PRK05589 168 ANGKRQTSFASVEVLPELTDDQDIEIRSEDLK 199 (325) T ss_pred CCCCEEEEEEEEEEECCCCCCCCCCCCCCCCE T ss_conf 78854877578899457666544777735642 No 91 >COG4138 BtuD ABC-type cobalamin transport system, ATPase component [Coenzyme metabolism] Probab=31.43 E-value=36 Score=15.26 Aligned_cols=34 Identities=24% Similarity=0.282 Sum_probs=24.1 Q ss_pred CEEEE-EEEECCCCEEEEECCCHHHHCCCHHHHHH Q ss_conf 10899-99981897399941008884469899999 Q gi|254780293|r 54 TSIAR-VWIKSGSGKFTINYVDISKYFTQDLLVLN 87 (170) Q Consensus 54 tsvAr-V~l~~GsG~I~INgk~~~~Yf~~~~~r~~ 87 (170) |-.|+ +-+.||+|.|.+.|+|++.|-..+..|.. T Consensus 40 TLLA~lAGm~~~sGsi~~~G~~l~~~~~~eLArhR 74 (248) T COG4138 40 TLLARMAGMTSGSGSIQFAGQPLEAWSATELARHR 74 (248) T ss_pred HHHHHHHCCCCCCCEEEECCCCHHHHHHHHHHHHH T ss_conf 99999847788884279888465677777899999 No 92 >PRK06292 dihydrolipoamide dehydrogenase; Validated Probab=30.30 E-value=42 Score=14.79 Aligned_cols=18 Identities=28% Similarity=0.337 Sum_probs=11.1 Q ss_pred EEEEECCCCCHHHHHHHH Q ss_conf 999825983018889999 Q gi|254780293|r 104 FATVSGGGLSGQASAICH 121 (170) Q Consensus 104 ~~~V~GGG~sgQa~Airl 121 (170) .+.|-|||..|---|.-+ T Consensus 171 ~v~VIGgG~ig~E~A~~l 188 (460) T PRK06292 171 SLAVIGGGVIGLELGQAL 188 (460) T ss_pred EEEEECCCHHHHHHHHHH T ss_conf 699999867887999999 No 93 >COG1127 Ttg2A ABC-type transport system involved in resistance to organic solvents, ATPase component [Secondary metabolites biosynthesis, transport, and catabolism] Probab=30.21 E-value=43 Score=14.78 Aligned_cols=75 Identities=19% Similarity=0.351 Sum_probs=44.5 Q ss_pred EEECCCEEEEE---EEECCCCEEEEECCCHHH--------------------------------------H--CCCHHHH Q ss_conf 31112108999---998189739994100888--------------------------------------4--4698999 Q gi|254780293|r 49 TGKRKTSIARV---WIKSGSGKFTINYVDISK--------------------------------------Y--FTQDLLV 85 (170) Q Consensus 49 tGrRKtsvArV---~l~~GsG~I~INgk~~~~--------------------------------------Y--f~~~~~r 85 (170) .|-=||..-|. .+.|-+|+|.|.|.++.. | +|....+ T Consensus 43 SGsGKStlLr~I~Gll~P~~GeI~i~G~~i~~ls~~~~~~ir~r~GvlFQ~gALFssltV~eNVafplre~~~lp~~~i~ 122 (263) T COG1127 43 SGSGKSTLLRLILGLLRPDKGEILIDGEDIPQLSEEELYEIRKRMGVLFQQGALFSSLTVFENVAFPLREHTKLPESLIR 122 (263) T ss_pred CCCCHHHHHHHHHCCCCCCCCEEEECCCCHHCCCHHHHHHHHHHEEEEEECCCCCCCCCHHHHHHEEHHHHCCCCHHHHH T ss_conf 88689999999856578988759984764110498899999843517860561235565457310006864259999999 Q ss_pred HHHHHHHHHHCCCCC-EEEEEEEECCCCCHHHHHHHHHHHHHHH Q ss_conf 999999987385664-1079998259830188899999999997 Q gi|254780293|r 86 LNIKRPFNTVSQDNM-YDVFATVSGGGLSGQASAICHGVAKALT 128 (170) Q Consensus 86 ~~i~~Pl~~~~~~~~-~Di~~~V~GGG~sgQa~AirlaIarAL~ 128 (170) +.++.-|..+++-+. .|.+-.--.||+. =|-|+||||+ T Consensus 123 ~lv~~KL~~VGL~~~~~~~~PsELSGGM~-----KRvaLARAia 161 (263) T COG1127 123 ELVLMKLELVGLRGAAADLYPSELSGGMR-----KRVALARAIA 161 (263) T ss_pred HHHHHHHHHCCCCHHHHHHCCHHHCCHHH-----HHHHHHHHHH T ss_conf 99999998649985666639332043588-----9999999986 No 94 >cd03296 ABC_CysA_sulfate_importer Part of the ABC transporter complex cysAWTP involved in sulfate import. Responsible for energy coupling to the transport system. The complex is composed of two ATP-binding proteins (cysA), two transmembrane proteins (cysT and cysW), and a solute-binding protein (cysP). ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=29.91 E-value=33 Score=15.45 Aligned_cols=47 Identities=19% Similarity=0.251 Sum_probs=32.3 Q ss_pred EEECCCEEEEEE---EECCCCEEEEECCCHHH----------------HCCCHHHHHHHHHHHHHH Q ss_conf 311121089999---98189739994100888----------------446989999999999873 Q gi|254780293|r 49 TGKRKTSIARVW---IKSGSGKFTINYVDISK----------------YFTQDLLVLNIKRPFNTV 95 (170) Q Consensus 49 tGrRKtsvArV~---l~~GsG~I~INgk~~~~----------------Yf~~~~~r~~i~~Pl~~~ 95 (170) .|-=||...++. ..|-+|.|.+||+++.. .||+..-.+.|..||.+- T Consensus 37 SGsGKSTll~~i~Gl~~p~~G~I~~~g~~i~~~~~~~r~ig~vfQ~~~Lfp~ltV~eNi~~~l~~~ 102 (239) T cd03296 37 SGSGKTTLLRLIAGLERPDSGTILFGGEDATDVPVQERNVGFVFQHYALFRHMTVFDNVAFGLRVK 102 (239) T ss_pred CCCHHHHHHHHHHCCCCCCCEEEEECCEECCCCCHHHCCEEEECCCCCCCCCCCHHHHHHHHHHHC T ss_conf 997799999999769999863999999999999965677679817821067996999998799733 No 95 >PRK10851 sulfate/thiosulfate transporter subunit; Provisional Probab=29.80 E-value=24 Score=16.37 Aligned_cols=46 Identities=24% Similarity=0.340 Sum_probs=26.0 Q ss_pred EEECCCEEEEEE---EECCCCEEEEECCCHHH----------------HCCCHHHHHHHHHHHHH Q ss_conf 311121089999---98189739994100888----------------44698999999999987 Q gi|254780293|r 49 TGKRKTSIARVW---IKSGSGKFTINYVDISK----------------YFTQDLLVLNIKRPFNT 94 (170) Q Consensus 49 tGrRKtsvArV~---l~~GsG~I~INgk~~~~----------------Yf~~~~~r~~i~~Pl~~ 94 (170) .|-=||...+.. ..|-+|.|.++|+++.. .|++..-.+.+..|+.+ T Consensus 37 SGsGKSTLlr~iaGL~~p~sG~I~~~G~dv~~~~~~~r~Ig~vfQ~~~L~p~ltV~eNi~~gl~~ 101 (352) T PRK10851 37 SGSGKTTLLRIIAGLEHQTSGHIRFHGTDVSRLHARDRKVGFVFQHYALFRHMTVFDNIAFGLTV 101 (352) T ss_pred CCCHHHHHHHHHHCCCCCCCEEEEECCEECCCCCCCCCCEEEEECCCHHCCCCCHHHHHHHHHHH T ss_conf 98469999999976999995699999999998993008489994071214588099999877754 No 96 >PRK07334 threonine dehydratase; Provisional Probab=29.50 E-value=35 Score=15.33 Aligned_cols=36 Identities=28% Similarity=0.566 Sum_probs=18.5 Q ss_pred HHHHHCCCCCEEEEEE-EECCCCCHHHHHHHHHHHHHHHHHCHH Q ss_conf 9987385664107999-825983018889999999999765763 Q gi|254780293|r 91 PFNTVSQDNMYDVFAT-VSGGGLSGQASAICHGVAKALTYFQPD 133 (170) Q Consensus 91 Pl~~~~~~~~~Di~~~-V~GGG~sgQa~AirlaIarAL~~~~p~ 133 (170) -+++.+....+|..+- |-|||+.+ |||.++-..+|+ T Consensus 159 glEileq~p~~D~vvvpvGGGGLis-------Gia~a~K~~~P~ 195 (399) T PRK07334 159 ALEMLEDAPDLDTLVVPIGGGGLIS-------GIATAAKALKPD 195 (399) T ss_pred HHHHHHHCCCCCEEEEECCCCHHHH-------HHHHHHHCCCCC T ss_conf 9999985789998999278604799-------999999604999 No 97 >COG3179 Predicted chitinase [General function prediction only] Probab=29.38 E-value=22 Score=16.53 Aligned_cols=71 Identities=14% Similarity=0.112 Sum_probs=41.2 Q ss_pred EEEEEEECCCCEEEEECCCHHHHCCCH---------HHH----HHHHHHHHH--HCCC-------CCEEEEEEEECCCCC Q ss_conf 899999818973999410088844698---------999----999999987--3856-------641079998259830 Q gi|254780293|r 56 IARVWIKSGSGKFTINYVDISKYFTQD---------LLV----LNIKRPFNT--VSQD-------NMYDVFATVSGGGLS 113 (170) Q Consensus 56 vArV~l~~GsG~I~INgk~~~~Yf~~~---------~~r----~~i~~Pl~~--~~~~-------~~~Di~~~V~GGG~s 113 (170) .-.-|+.+|+|.|-|-|++=..++... -.- ...+.+.-. +.-+ ...-|+..+.| |.. T Consensus 110 sgDGw~yRgrg~iQiTGrdNY~~~g~alg~dlv~~P~~~~~~~~~a~~a~wyw~~~g~n~~aD~gd~~~VT~~ING-G~N 188 (206) T COG3179 110 SGDGWLYRGRGLIQITGRDNYRRCGRALGLDLVANPGQLELDPHAARVAAWYWKTRGLNALADEGDLVRVTRKING-GLN 188 (206) T ss_pred CCCCCEECCCCCEEEECCHHHHHHHHHHCCCCCCCHHHHCCCHHHHHHHHHHHHHCCHHHHHCCCCHHHHHHHHCC-CCC T ss_conf 8974020367630320522788888860898667944432571777788999986157776116773343332057-632 Q ss_pred HHHHHHHHHHHHHHHH Q ss_conf 1888999999999976 Q gi|254780293|r 114 GQASAICHGVAKALTY 129 (170) Q Consensus 114 gQa~AirlaIarAL~~ 129 (170) |+.+ |++..+++.. T Consensus 189 G~~d--R~~r~~~a~~ 202 (206) T COG3179 189 GLDD--RLARYRRASG 202 (206) T ss_pred CHHH--HHHHHHHHHC T ss_conf 3889--9999999745 No 98 >cd03250 ABCC_MRP_domain1 Domain 1 of the ABC subfamily C. This family is also known as MRP (mulrtidrug resisitance-associated protein). Some of the MRP members have five additional transmembrane segments in their N-terminas, but the function of these additional membrane-spanning domains is not clear. The MRP was found in the multidrug-resisting lung cancer cell in which p-glycoprotein was not overexpressed. MRP exports glutathione by drug stimulation, as well as, certain substrates in conjugated forms with anions, such as glutathione, glucuronate, and sulfate. Probab=28.99 E-value=45 Score=14.65 Aligned_cols=80 Identities=16% Similarity=0.131 Sum_probs=40.2 Q ss_pred EEECCCEEEEEE---EECCCCEEEEECCCH----HHHCCCHHHHHHHH-----------HHHHHHCCC-------CCEEE Q ss_conf 311121089999---981897399941008----88446989999999-----------999873856-------64107 Q gi|254780293|r 49 TGKRKTSIARVW---IKSGSGKFTINYVDI----SKYFTQDLLVLNIK-----------RPFNTVSQD-------NMYDV 103 (170) Q Consensus 49 tGrRKtsvArV~---l~~GsG~I~INgk~~----~~Yf~~~~~r~~i~-----------~Pl~~~~~~-------~~~Di 103 (170) +|-=||+..++. +.|-+|.|.+||+-. +.++.+.+.++.+. ..+..++.. ...+. T Consensus 40 sGsGKSTLl~~i~G~~~~~~G~I~~~g~i~~v~Q~~~l~~~Tv~eNi~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~ 119 (204) T cd03250 40 VGSGKSSLLSALLGELEKLSGSVSVPGSIAYVSQEPWIQNGTIRENILFGKPFDEERYEKVIKACALEPDLEILPDGDLT 119 (204) T ss_pred CCCCHHHHHHHHCCCCCCCCCCCCCCCCEEEEECCCCCCCCCHHHHHHCCCCCCHHHHHHHHHHHCCHHHHHHHHCCCCC T ss_conf 99858999999818952568952258988999587756775099997446866889999999997619889851353524 Q ss_pred EEEEECCCCCHHHHHHHHHHHHHHHH Q ss_conf 99982598301888999999999976 Q gi|254780293|r 104 FATVSGGGLSGQASAICHGVAKALTY 129 (170) Q Consensus 104 ~~~V~GGG~sgQa~AirlaIarAL~~ 129 (170) .+.-.|.-+|| -+.=|++|||||+. T Consensus 120 ~i~~~~~~LSg-GQkQRvalARal~~ 144 (204) T cd03250 120 EIGEKGINLSG-GQKQRISLARAVYS 144 (204) T ss_pred EECCCCCCCCH-HHHHHHHHHHHHHC T ss_conf 52378875799-99999999999953 No 99 >PRK13538 cytochrome c biogenesis protein CcmA; Provisional Probab=28.51 E-value=38 Score=15.09 Aligned_cols=27 Identities=26% Similarity=0.347 Sum_probs=18.6 Q ss_pred EECCCEEEEE---EEECCCCEEEEECCCHH Q ss_conf 1112108999---99818973999410088 Q gi|254780293|r 50 GKRKTSIARV---WIKSGSGKFTINYVDIS 76 (170) Q Consensus 50 GrRKtsvArV---~l~~GsG~I~INgk~~~ 76 (170) |-=||+..++ .+.|-+|.|.+||+++. T Consensus 37 GaGKSTLl~~l~Gl~~p~~G~i~~~g~~i~ 66 (204) T PRK13538 37 GAGKTSLLRILAGLARPDAGEVLWQGEPIR 66 (204) T ss_pred CCCHHHHHHHHHCCCCCCCCEEEECCEECC T ss_conf 985999999997688888737999999976 No 100 >KOG2665 consensus Probab=28.41 E-value=39 Score=14.99 Aligned_cols=13 Identities=46% Similarity=1.025 Sum_probs=5.4 Q ss_pred CEEEEEEEECCCCCH Q ss_conf 410799982598301 Q gi|254780293|r 100 MYDVFATVSGGGLSG 114 (170) Q Consensus 100 ~~Di~~~V~GGG~sg 114 (170) +||+.+ -|||+.| T Consensus 48 ~~D~Vv--vGgGiVG 60 (453) T KOG2665 48 RYDLVV--VGGGIVG 60 (453) T ss_pred CCCEEE--ECCCEEE T ss_conf 554799--8884322 No 101 >PRK10752 sulfate transporter subunit; Provisional Probab=28.11 E-value=46 Score=14.55 Aligned_cols=84 Identities=17% Similarity=0.181 Sum_probs=48.3 Q ss_pred EEEEECCCHHHHCCCHHHHHHHHHHHHH-HCCCCCEEEEEEEECCCCCHHHHHHHHHHHHHHHHHC-HHHHHHHHHCCCC Q ss_conf 3999410088844698999999999987-3856641079998259830188899999999997657-6344879977983 Q gi|254780293|r 67 KFTINYVDISKYFTQDLLVLNIKRPFNT-VSQDNMYDVFATVSGGGLSGQASAICHGVAKALTYFQ-PDLRPQIKKGGFL 144 (170) Q Consensus 67 ~I~INgk~~~~Yf~~~~~r~~i~~Pl~~-~~~~~~~Di~~~V~GGG~sgQa~AirlaIarAL~~~~-p~~r~~Lk~~glL 144 (170) .+++-+.+++. .++.+.. +..-|.- -.....-+|.++-+=||.+.||.|+--|+-=-++.+. +..-..|-+.||+ T Consensus 21 ~~~llnvSYd~--tRely~~-~n~~F~~~wk~~tG~~V~i~qShGGSg~QARaVi~Gl~ADVVtlal~~Did~l~~~GlI 97 (329) T PRK10752 21 DIQLLNVSYDP--TRELYEQ-YNKAFSAHWKQQTGDNVVIRQSHGGSGKQATSVINGIEADVVTLALAYDVDAIAERGRI 97 (329) T ss_pred CEEEEEECCHH--HHHHHHH-HHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHCCCCCCEEEECCHHHHHHHHHCCCC T ss_conf 52699614075--8999999-98999999988329903898557876278999847875677875142136899867999 Q ss_pred CCCCCCCCC Q ss_conf 357554167 Q gi|254780293|r 145 TRDSRIVER 153 (170) Q Consensus 145 t~D~R~~Er 153 (170) ..|.+..-+ T Consensus 98 ~~dW~~~~P 106 (329) T PRK10752 98 DKNWIKRLP 106 (329) T ss_pred CCCHHHHCC T ss_conf 914565488 No 102 >pfam10865 DUF2703 Protein of unknown function (DUF2703). This family of protein has no known function. Probab=27.87 E-value=34 Score=15.40 Aligned_cols=17 Identities=18% Similarity=0.243 Sum_probs=14.3 Q ss_pred CCCEEEEECCCHHHHCC Q ss_conf 89739994100888446 Q gi|254780293|r 64 GSGKFTINYVDISKYFT 80 (170) Q Consensus 64 GsG~I~INgk~~~~Yf~ 80 (170) -|-.|.|||+|+++.+. T Consensus 61 eSn~I~inG~plE~wl~ 77 (120) T pfam10865 61 ESNRIWINGRPLEEWLG 77 (120) T ss_pred CCCEEEECCEEHHHHHC T ss_conf 58758999986899828 No 103 >COG4987 CydC ABC-type transport system involved in cytochrome bd biosynthesis, fused ATPase and permease components [Energy production and conversion / Posttranslational modification, protein turnover, chaperones] Probab=27.72 E-value=47 Score=14.51 Aligned_cols=80 Identities=21% Similarity=0.252 Sum_probs=48.0 Q ss_pred EEECCCEEEEEE---EECCCCEEEEECCCHHHHCCCHHHHHHH------------------------------HHHHHHH Q ss_conf 311121089999---9818973999410088844698999999------------------------------9999873 Q gi|254780293|r 49 TGKRKTSIARVW---IKSGSGKFTINYVDISKYFTQDLLVLNI------------------------------KRPFNTV 95 (170) Q Consensus 49 tGrRKtsvArV~---l~~GsG~I~INgk~~~~Yf~~~~~r~~i------------------------------~~Pl~~~ 95 (170) +|-=|++..... ..|-+|+|+.||.|+..|=+ ...++.+ -+-|..+ T Consensus 373 SGsGKSTllqLl~~~~~~~~G~i~~~g~~~~~l~~-~~~~e~i~vl~Qr~hlF~~Tlr~NL~lA~~~AsDEel~~aL~qv 451 (573) T COG4987 373 SGSGKSTLLQLLAGAWDPQQGSITLNGVEIASLDE-QALRETISVLTQRVHLFSGTLRDNLRLANPDASDEELWAALQQV 451 (573) T ss_pred CCCCHHHHHHHHHHCCCCCCCEEEECCCCHHHCCH-HHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHC T ss_conf 99878999999972358788736578867331883-66899875412321777778998773059999889999999981 Q ss_pred CCC-------CCEEEEEEEECCCCCHHHHHHHHHHHHHHHHH Q ss_conf 856-------64107999825983018889999999999765 Q gi|254780293|r 96 SQD-------NMYDVFATVSGGGLSGQASAICHGVAKALTYF 130 (170) Q Consensus 96 ~~~-------~~~Di~~~V~GGG~sgQa~AirlaIarAL~~~ 130 (170) ++. ..++-++--.|-++|| -+.=|+||||+|+.- T Consensus 452 gL~~l~~~~p~gl~t~lge~G~~LSG-GE~rRLAlAR~LL~d 492 (573) T COG4987 452 GLEKLLESAPDGLNTWLGEGGRRLSG-GERRRLALARALLHD 492 (573) T ss_pred CHHHHHHHCHHHHHCHHCCCCCCCCC-HHHHHHHHHHHHHCC T ss_conf 87999874735342222259974781-389999999999707 No 104 >COG1608 Predicted archaeal kinase [General function prediction only] Probab=26.11 E-value=50 Score=14.33 Aligned_cols=78 Identities=18% Similarity=0.181 Sum_probs=45.3 Q ss_pred EEEECCCCEEEEECCCHHHHCCCHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCHHHHHHHHHH--------------- Q ss_conf 99981897399941008884469899999999998738566410799982598301888999999--------------- Q gi|254780293|r 59 VWIKSGSGKFTINYVDISKYFTQDLLVLNIKRPFNTVSQDNMYDVFATVSGGGLSGQASAICHGV--------------- 123 (170) Q Consensus 59 V~l~~GsG~I~INgk~~~~Yf~~~~~r~~i~~Pl~~~~~~~~~Di~~~V~GGG~sgQa~AirlaI--------------- 123 (170) +.|+-|-..|+=-++|.. |....++..+.+- .+ ++.+=.+-|||||.=|...|-.+++ T Consensus 3 ~IlKlGGSvITdK~~p~t--~r~~~l~ria~eI---~~--~~~~~livVHGgGSFGHp~Ak~~~~~~~~~~~s~~G~~~~ 75 (252) T COG1608 3 IILKLGGSVITDKDKPRT--VREDRLRRIAREI---SN--GKPEKLIVVHGGGSFGHPAAKEFGLEGLKNYLSPLGFSLT 75 (252) T ss_pred EEEEECCEEEECCCCCCH--HHHHHHHHHHHHH---HC--CCCCCEEEEECCCCCCCHHHHHHCCCCCCCCCCCCCHHHH T ss_conf 899955616525887321--1189999999998---45--9954389994586656788997076544466674343999 Q ss_pred HHHHHHHCHHHHHHHHHCCC Q ss_conf 99997657634487997798 Q gi|254780293|r 124 AKALTYFQPDLRPQIKKGGF 143 (170) Q Consensus 124 arAL~~~~p~~r~~Lk~~gl 143 (170) .+|...++...-..|-++|. T Consensus 76 ~~am~~L~~~V~~~l~~~Gv 95 (252) T COG1608 76 HLAMLELNSIVVDALLDAGV 95 (252) T ss_pred HHHHHHHHHHHHHHHHHCCC T ss_conf 99999999999999986698 No 105 >COG1116 TauB ABC-type nitrate/sulfonate/bicarbonate transport system, ATPase component [Inorganic ion transport and metabolism] Probab=25.76 E-value=51 Score=14.29 Aligned_cols=75 Identities=17% Similarity=0.230 Sum_probs=39.4 Q ss_pred EECCCEEEEE---EEECCCCEEEEECCCHH-----------H--HCCCHHHHHHHHHHHHHH------------------ Q ss_conf 1112108999---99818973999410088-----------8--446989999999999873------------------ Q gi|254780293|r 50 GKRKTSIARV---WIKSGSGKFTINYVDIS-----------K--YFTQDLLVLNIKRPFNTV------------------ 95 (170) Q Consensus 50 GrRKtsvArV---~l~~GsG~I~INgk~~~-----------~--Yf~~~~~r~~i~~Pl~~~------------------ 95 (170) |==|+..=|+ ...|.+|.|.++|+++. + .||-.+-+..|..||.+. T Consensus 39 GcGKSTLLriiAGL~~p~~G~V~~~g~~v~~p~~~~~~vFQ~~~LlPW~Tv~~NV~l~l~~~~~~~~e~~~~a~~~L~~V 118 (248) T COG1116 39 GCGKSTLLRLIAGLEKPTSGEVLLDGRPVTGPGPDIGYVFQEDALLPWLTVLDNVALGLELRGKSKAEARERAKELLELV 118 (248) T ss_pred CCCHHHHHHHHHCCCCCCCCEEEECCCCCCCCCCCEEEEECCCCCCCHHHHHHHHEEHHHCCCCCHHHHHHHHHHHHHHC T ss_conf 78899999999687877775599888215789987799926676451466884435044125656176899999999975 Q ss_pred CCCCCEEEEEEEECCCCCHHHHHHHHHHHHHHHH Q ss_conf 8566410799982598301888999999999976 Q gi|254780293|r 96 SQDNMYDVFATVSGGGLSGQASAICHGVAKALTY 129 (170) Q Consensus 96 ~~~~~~Di~~~V~GGG~sgQa~AirlaIarAL~~ 129 (170) ++.+--|-+..--.||+- =|-||||||+. T Consensus 119 gL~~~~~~~P~qLSGGMr-----QRVaiARAL~~ 147 (248) T COG1116 119 GLAGFEDKYPHQLSGGMR-----QRVAIARALAT 147 (248) T ss_pred CCCCHHHCCCCCCCHHHH-----HHHHHHHHHHC T ss_conf 974210169600184799-----99999999714 No 106 >cd03231 ABC_CcmA_heme_exporter CcmA, the ATP-binding component of the bacterial CcmAB transporter. The CCM family is involved in bacterial cytochrome c biogenesis. Cytochrome c maturation in E. coli requires the ccm operon, which encodes eight membrane proteins (CcmABCDEFGH). CcmE is a periplasmic heme chaperone that binds heme covalently and transfers it onto apocytochrome c in the presence of CcmF, CcmG, and CcmH. The CcmAB proteins represent an ABC transporter and the CcmCD proteins participate in heme transfer to CcmE. Probab=25.67 E-value=51 Score=14.28 Aligned_cols=75 Identities=17% Similarity=0.218 Sum_probs=38.0 Q ss_pred EECCCEEEEE---EEECCCCEEEEECCCHH-----------------------------HHCCCHHHHHHHHHHHHHHCC Q ss_conf 1112108999---99818973999410088-----------------------------844698999999999987385 Q gi|254780293|r 50 GKRKTSIARV---WIKSGSGKFTINYVDIS-----------------------------KYFTQDLLVLNIKRPFNTVSQ 97 (170) Q Consensus 50 GrRKtsvArV---~l~~GsG~I~INgk~~~-----------------------------~Yf~~~~~r~~i~~Pl~~~~~ 97 (170) |-=||+..++ .+.|-+|.|.+||+++. .+|.....+..+.+-|..++. T Consensus 36 GsGKSTLlk~i~Gl~~p~sG~i~~~g~~~~~~~~~~~~~i~~~~~~~~~~~~ltv~en~~~~~~~~~~~~~~~~L~~~~l 115 (201) T cd03231 36 GSGKTTLLRILAGLSPPLAGRVLLNGGPLDFQRDSIARGLLYLGHAPGIKTTLSVLENLRFWHADHSDEQVEEALARVGL 115 (201) T ss_pred CCCHHHHHHHHHCCCCCCCEEEEECCEECCCCCHHHHHHEEEECCCCCCCCCCCHHHHHHHHHHHCCHHHHHHHHHHCCC T ss_conf 99999999999667788852999999983314877671179978765455787899999875322369999999998599 Q ss_pred CCCEEEEEEEECCCCCHHHHHHHHHHHHHHHH Q ss_conf 66410799982598301888999999999976 Q gi|254780293|r 98 DNMYDVFATVSGGGLSGQASAICHGVAKALTY 129 (170) Q Consensus 98 ~~~~Di~~~V~GGG~sgQa~AirlaIarAL~~ 129 (170) ....|..+. .+| .-+-=|++|||||+. T Consensus 116 ~~~~~~~~~----~LS-gGqkqRv~lA~al~~ 142 (201) T cd03231 116 NGFEDRPVA----QLS-AGQQRRVALARLLLS 142 (201) T ss_pred CHHHCCCHH----HCC-HHHHHHHHHHHHHHC T ss_conf 103258823----479-999999999999974 No 107 >COG1125 OpuBA ABC-type proline/glycine betaine transport systems, ATPase components [Amino acid transport and metabolism] Probab=25.58 E-value=45 Score=14.60 Aligned_cols=29 Identities=34% Similarity=0.468 Sum_probs=20.0 Q ss_pred EECCCEEEEE---EEECCCCEEEEECCCHHHH Q ss_conf 1112108999---9981897399941008884 Q gi|254780293|r 50 GKRKTSIARV---WIKSGSGKFTINYVDISKY 78 (170) Q Consensus 50 GrRKtsvArV---~l~~GsG~I~INgk~~~~Y 78 (170) |--||..-+. .+.|-+|.|.|||+++.+| T Consensus 37 GsGKTTtLkMINrLiept~G~I~i~g~~i~~~ 68 (309) T COG1125 37 GSGKTTTLKMINRLIEPTSGEILIDGEDISDL 68 (309) T ss_pred CCCHHHHHHHHHCCCCCCCCEEEECCEECCCC T ss_conf 97578799999605588885389899044658 No 108 >smart00798 AICARFT_IMPCHas AICARFT/IMPCHase bienzyme. This is a family of bifunctional enzymes catalysing the last two steps in de novo purine biosynthesis. The bifunctional enzyme is found in both prokaryotes and eukaryotes. The second last step is catalysed by 5-aminoimidazole-4-carboxamide ribonucleotide formyltransferase (AICARFT), this enzyme catalyses the formylation of AICAR with 10-formyl-tetrahydrofolate to yield FAICAR and tetrahydrofolate. The last step is catalysed by IMP (Inosine monophosphate) cyclohydrolase (IMPCHase), cyclizing FAICAR (5-formylaminoimidazole-4-carboxamide ribonucleotide) to IMP. Probab=25.11 E-value=41 Score=14.85 Aligned_cols=18 Identities=22% Similarity=0.285 Sum_probs=9.4 Q ss_pred CCCCCHHHHHHHHHHHHH Q ss_conf 598301888999999999 Q gi|254780293|r 109 GGGLSGQASAICHGVAKA 126 (170) Q Consensus 109 GGG~sgQa~AirlaIarA 126 (170) |.|.++..+|+++||.|| T Consensus 294 GaGQ~sRvda~~~A~~kA 311 (311) T smart00798 294 GAGQMSRVDSARIAAEKA 311 (311) T ss_pred CCCCCCHHHHHHHHHHHC T ss_conf 898555999999999609 No 109 >PRK00951 hisB imidazoleglycerol-phosphate dehydratase; Validated Probab=24.98 E-value=53 Score=14.20 Aligned_cols=43 Identities=16% Similarity=0.212 Sum_probs=31.6 Q ss_pred HHHHHHHHHCCCCCEEEEEEE-ECCCCCHHHHHHHHHHHHHHHH Q ss_conf 999999873856641079998-2598301888999999999976 Q gi|254780293|r 87 NIKRPFNTVSQDNMYDVFATV-SGGGLSGQASAICHGVAKALTY 129 (170) Q Consensus 87 ~i~~Pl~~~~~~~~~Di~~~V-~GGG~sgQa~AirlaIarAL~~ 129 (170) .+..=|.-.-...+..+++.+ .|.-..-.++|+--|++|||-. T Consensus 134 ~~~hFf~s~a~~a~~tlHi~~~~G~N~HH~~Ea~FKa~g~Alr~ 177 (195) T PRK00951 134 LVREFFQAFANNAGITLHIDVLYGRNAHHIIEALFKAFARALRM 177 (195) T ss_pred HHHHHHHHHHHHCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHH T ss_conf 59999999984086389999614677389999999999999999 No 110 >cd03295 ABC_OpuCA_Osmoprotection OpuCA is a the ATP binding component of a bacterial solute transporter that serves a protective role to cells growing in a hyperosmolar environment. ABC (ATP-binding cassette) transporter nucleotide-binding domain; ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition, to the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=24.96 E-value=41 Score=14.87 Aligned_cols=80 Identities=26% Similarity=0.242 Sum_probs=44.8 Q ss_pred EEECCCEEEEE---EEECCCCEEEEECCCHHHH------------------CCCHHHHHHHHHHHHHHCCC--------- Q ss_conf 31112108999---9981897399941008884------------------46989999999999873856--------- Q gi|254780293|r 49 TGKRKTSIARV---WIKSGSGKFTINYVDISKY------------------FTQDLLVLNIKRPFNTVSQD--------- 98 (170) Q Consensus 49 tGrRKtsvArV---~l~~GsG~I~INgk~~~~Y------------------f~~~~~r~~i~~Pl~~~~~~--------- 98 (170) .|-=||...+. .+.|-+|.|.|||+++..+ ||+..-.+.|..|+.+.+.. T Consensus 36 SG~GKSTllr~i~gl~~p~~G~I~i~g~~i~~~~~~~lrr~ig~vfQ~~~Lfp~ltV~eNi~~~~~~~~~~~~~~~~rv~ 115 (242) T cd03295 36 SGSGKTTTMKMINRLIEPTSGEIFIDGEDIREQDPVELRRKIGYVIQQIGLFPHMTVEENIALVPKLLKWPKEKIRERAD 115 (242) T ss_pred CCCHHHHHHHHHHCCCCCCCEEEEECCEECCCCCHHHHHCCCEEEECCCCCCCCCCHHHHHHHHHHHCCCCHHHHHHHHH T ss_conf 99569999999975999981599999999999997897388679917997588882999999999975999999999999 Q ss_pred ---CCEEEE----EEEECCCCCHHHHHHHHHHHHHHHH Q ss_conf ---641079----9982598301888999999999976 Q gi|254780293|r 99 ---NMYDVF----ATVSGGGLSGQASAICHGVAKALTY 129 (170) Q Consensus 99 ---~~~Di~----~~V~GGG~sgQa~AirlaIarAL~~ 129 (170) ..+++. ..-...-+|| -+.=|.+|||||+. T Consensus 116 ell~~v~L~~~~~~~~~p~~LSG-GqkQRvaiARAl~~ 152 (242) T cd03295 116 ELLALVGLDPAEFADRYPHELSG-GQQQRVGVARALAA 152 (242) T ss_pred HHHHHCCCCCHHHCCCCHHHCCH-HHHHHHHHHHHHHC T ss_conf 99987499930110079566899-99999999999962 No 111 >TIGR01277 thiQ thiamine ABC transporter, ATP-binding protein; InterPro: IPR005968 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). Thiamine pyrophosphate (TPP) is a required cofactor synthesized de novo in Salmonella typhimurium. The primary role for TPP is in central metabolism as an electron carrier and nucleophile for such enzymes as pyruvate dehydrogenase (1.2.4.1 from EC), acetolactate synthase (4.1.3.18 from EC), and alpha-ketoglutarate dehydrogenase (1.2.4.2 from EC). Despite its importance in cellular physiology, neither the de novo biosynthetic pathway nor the salvage systems for thiamine are fully understood in any organism. The model describes thiamine ABC transporter, ATP-binding protein, believed to be involved in the specific translocation of thiamine and its phosphoesters across the inner membrane The protein belongs to the larger ABC transport system which consists of at least three components: the inner membrane permease; thiamine binding protein and an ATP-binding subunit. This protein is found so far only in Proteobacteria, and is found in complete genomes only if the ThiB and ThiP subunits are also found. It has been experimentally demonstrated that mutants in the various steps in the de novo synthesis of thiamine and its biologically active form, namely thiamine pyrophosphate can be exogenously supplemented with thiamine, thiamine monophosphate or thiamine pyrophosphate. ; GO: 0005524 ATP binding, 0042626 ATPase activity coupled to transmembrane movement of substances, 0006810 transport, 0009276 1-2nm peptidoglycan-based cell wall. Probab=24.94 E-value=29 Score=15.79 Aligned_cols=16 Identities=25% Similarity=0.511 Sum_probs=13.5 Q ss_pred EEECCCCEEEEECCCH Q ss_conf 9981897399941008 Q gi|254780293|r 60 WIKSGSGKFTINYVDI 75 (170) Q Consensus 60 ~l~~GsG~I~INgk~~ 75 (170) ++.|-+|.|+||+++. T Consensus 47 F~~PasG~i~~nd~~~ 62 (213) T TIGR01277 47 FLEPASGEIKVNDKDH 62 (213) T ss_pred HCCCCCEEEEECCCCC T ss_conf 0477640588778012 No 112 >COG0410 LivF ABC-type branched-chain amino acid transport systems, ATPase component [Amino acid transport and metabolism] Probab=24.57 E-value=24 Score=16.31 Aligned_cols=77 Identities=22% Similarity=0.330 Sum_probs=41.5 Q ss_pred CCCEEEEEE---EECCCCEEEEECCCHHH-------------------HCCCHHHHHHHHHHHHHHC--------CCCCE Q ss_conf 121089999---98189739994100888-------------------4469899999999998738--------56641 Q gi|254780293|r 52 RKTSIARVW---IKSGSGKFTINYVDISK-------------------YFTQDLLVLNIKRPFNTVS--------QDNMY 101 (170) Q Consensus 52 RKtsvArV~---l~~GsG~I~INgk~~~~-------------------Yf~~~~~r~~i~~Pl~~~~--------~~~~~ 101 (170) =||+.-+.. +.+-+|.|..+|+++.. .|++.+-.+.++-...... .+.-| T Consensus 41 GKTTlLkti~Gl~~~~~G~I~~~G~dit~~p~~~r~r~Gi~~VPegR~iF~~LTVeENL~~g~~~~~~~~~~~~~~e~v~ 120 (237) T COG0410 41 GKTTLLKTIMGLVRPRSGRIIFDGEDITGLPPHERARLGIAYVPEGRRIFPRLTVEENLLLGAYARRDKEAQERDLEEVY 120 (237) T ss_pred CHHHHHHHHHCCCCCCCEEEEECCEECCCCCHHHHHHCCEEECCCCCCCHHHCCHHHHHHHHHHCCCCCCCCCCCHHHHH T ss_conf 88999999858987887069989835677997899857768675213610007599987423102453100011389999 Q ss_pred EEEEE------EECCCCCHHHHHHHHHHHHHHHH Q ss_conf 07999------82598301888999999999976 Q gi|254780293|r 102 DVFAT------VSGGGLSGQASAICHGVAKALTY 129 (170) Q Consensus 102 Di~~~------V~GGG~sgQa~AirlaIarAL~~ 129 (170) +.|-. -.||-+|| -+-=++||||||.. T Consensus 121 ~lFP~Lker~~~~aG~LSG-GEQQMLAiaRALm~ 153 (237) T COG0410 121 ELFPRLKERRNQRAGTLSG-GEQQMLAIARALMS 153 (237) T ss_pred HHCHHHHHHHHCCCCCCCH-HHHHHHHHHHHHHC T ss_conf 9786389884084467781-99999999999961 No 113 >cd03301 ABC_MalK_N The N-terminal ATPase domain of the maltose transporter, MalK. ATP binding cassette (ABC) proteins function from bacteria to human, mediating the translocation of substances into and out of cells or organelles. ABC transporters contain two transmembrane-spanning domains (TMDs) or subunits and two nucleotide binding domains (NBDs) or subunits that couple transport to the hydrolysis of ATP. In the maltose transport system, the periplasmic maltose binding protein (MBP) stimulates the ATPase activity of the membrane-associated transporter, which consists of two transmembrane subunits, MalF and MalG, and two copies of the ATP binding subunit, MalK, and becomes tightly bound to the transporter in the catalytic transition state, ensuring that maltose is passed to the transporter as ATP is hydrolyzed. Probab=23.92 E-value=53 Score=14.17 Aligned_cols=80 Identities=18% Similarity=0.179 Sum_probs=45.7 Q ss_pred EEECCCEEEEE---EEECCCCEEEEECCCHH----------------HHCCCHHHHHHHHHHHHHHCCCC---------- Q ss_conf 31112108999---99818973999410088----------------84469899999999998738566---------- Q gi|254780293|r 49 TGKRKTSIARV---WIKSGSGKFTINYVDIS----------------KYFTQDLLVLNIKRPFNTVSQDN---------- 99 (170) Q Consensus 49 tGrRKtsvArV---~l~~GsG~I~INgk~~~----------------~Yf~~~~~r~~i~~Pl~~~~~~~---------- 99 (170) .|-=||...+. ...|-+|.|.+||+++. ..||+......|..|+.+-+... T Consensus 35 SG~GKSTlLr~iaGl~~p~~G~I~~~g~~i~~~~~~~R~ig~VfQ~~~LfP~ltV~eNI~~~l~~~~~~~~e~~~~v~~~ 114 (213) T cd03301 35 SGCGKTTTLRMIAGLEEPTSGRIYIGGRDVTDLPPKDRDIAMVFQNYALYPHMTVYDNIAFGLKLRKVPKDEIDERVREV 114 (213) T ss_pred CCCHHHHHHHHHHCCCCCCCEEEEECCEECCCCCHHHCCEEEEECCCCCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHH T ss_conf 98809999999976999986399999999999997678878994587646547099999989998599989999999999 Q ss_pred --CEEEE--EEEECCCCCHHHHHHHHHHHHHHHH Q ss_conf --41079--9982598301888999999999976 Q gi|254780293|r 100 --MYDVF--ATVSGGGLSGQASAICHGVAKALTY 129 (170) Q Consensus 100 --~~Di~--~~V~GGG~sgQa~AirlaIarAL~~ 129 (170) .+++. ..-.-.-+|| -+.=|-||||||+. T Consensus 115 l~~~gl~~~~~~~P~~LSG-GqkQRVaiARAl~~ 147 (213) T cd03301 115 AELLQIEHLLDRKPKQLSG-GQRQRVALGRAIVR 147 (213) T ss_pred HHHCCCHHHHCCCHHHCCH-HHHHHHHHHHHHHC T ss_conf 9875992465099556999-99999999999875 No 114 >COG1224 TIP49 DNA helicase TIP49, TBP-interacting protein [Transcription] Probab=23.89 E-value=55 Score=14.08 Aligned_cols=10 Identities=40% Similarity=0.773 Sum_probs=4.9 Q ss_pred HHHHHHHHHH Q ss_conf 9999999999 Q gi|254780293|r 118 AICHGVAKAL 127 (170) Q Consensus 118 AirlaIarAL 127 (170) |+.+|||+.| T Consensus 80 AlA~gIa~eL 89 (450) T COG1224 80 ALAMGIAREL 89 (450) T ss_pred HHHHHHHHHH T ss_conf 9999999985 No 115 >PRK07342 peptide chain release factor 2; Provisional Probab=23.88 E-value=48 Score=14.45 Aligned_cols=29 Identities=24% Similarity=0.452 Sum_probs=20.7 Q ss_pred EEEECCCEEEEEEEECCC---CEEEEECCCHH Q ss_conf 131112108999998189---73999410088 Q gi|254780293|r 48 ATGKRKTSIARVWIKSGS---GKFTINYVDIS 76 (170) Q Consensus 48 ~tGrRKtsvArV~l~~Gs---G~I~INgk~~~ 76 (170) +-|||-||.|.|.+.|-. -.|.||..++. T Consensus 154 s~gr~hTS~asV~VlP~~~d~~~i~i~~~Dlr 185 (322) T PRK07342 154 SNARRHTSFASIWVYPVIDDNIEVDVNESDVR 185 (322) T ss_pred CCCCEEEEEEEEEEEECCCCCCCCCCCHHHCC T ss_conf 88866876679999824466433569833612 No 116 >PRK07818 dihydrolipoamide dehydrogenase; Reviewed Probab=23.82 E-value=55 Score=14.07 Aligned_cols=40 Identities=25% Similarity=0.283 Sum_probs=21.5 Q ss_pred EEEEECCCCCHHHHHHHHH-----------HHHHHHHHCHHHHH----HHHHCCC Q ss_conf 9998259830188899999-----------99999765763448----7997798 Q gi|254780293|r 104 FATVSGGGLSGQASAICHG-----------VAKALTYFQPDLRP----QIKKGGF 143 (170) Q Consensus 104 ~~~V~GGG~sgQa~Airla-----------IarAL~~~~p~~r~----~Lk~~gl 143 (170) .+.|-|||..|---|..+. -.|.|-.+++++.. .|++.|. T Consensus 174 ~v~IiGgG~ig~E~A~~~~~lG~~Vtiv~~~~~~l~~~d~~~~~~l~~~l~~~Gi 228 (467) T PRK07818 174 SIVIAGAGAIGMEFAYVLKNYGVDVTIVEFLDRALPNEDAEVSKEIEKQYKKLGV 228 (467) T ss_pred EEEEECCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCHHHHHHHHHHHHHCCC T ss_conf 6999998799999999998329877776211313544226789999999986698 No 117 >KOG2158 consensus Probab=23.53 E-value=56 Score=14.03 Aligned_cols=41 Identities=17% Similarity=0.286 Sum_probs=29.4 Q ss_pred CCCEEEEECC-CHHHHCCCHHHHHHHHHHHHHHCCCCCEEEEEE Q ss_conf 8973999410-088844698999999999987385664107999 Q gi|254780293|r 64 GSGKFTINYV-DISKYFTQDLLVLNIKRPFNTVSQDNMYDVFAT 106 (170) Q Consensus 64 GsG~I~INgk-~~~~Yf~~~~~r~~i~~Pl~~~~~~~~~Di~~~ 106 (170) |+|...||.. ...+| ++...++.|..||.+.+.. |||+.+. T Consensus 239 g~giylisDir~~g~~-Q~~~vQeyV~~pLli~dky-Kfd~rvy 280 (565) T KOG2158 239 GSGIYLISDIREKGEY-QNKKVQEYVTYPLLISDKY-KFDQRVY 280 (565) T ss_pred CCCEEEECHHHHHHHH-HHHHHHHHHCCCCCCCCCC-EEEEEEE T ss_conf 7640552022221077-8788888832433136652-1465541 No 118 >cd03293 ABC_NrtD_SsuB_transporters NrtD and SsuB are the ATP-binding subunits of the bacterial ABC-type nitrate and sulfonate transport systems, respectively. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=23.01 E-value=51 Score=14.29 Aligned_cols=80 Identities=15% Similarity=0.168 Sum_probs=43.0 Q ss_pred EEECCCEEEEE---EEECCCCEEEEECCCHH-------------HHCCCHHHHHHHHHHHHHHCCC------------CC Q ss_conf 31112108999---99818973999410088-------------8446989999999999873856------------64 Q gi|254780293|r 49 TGKRKTSIARV---WIKSGSGKFTINYVDIS-------------KYFTQDLLVLNIKRPFNTVSQD------------NM 100 (170) Q Consensus 49 tGrRKtsvArV---~l~~GsG~I~INgk~~~-------------~Yf~~~~~r~~i~~Pl~~~~~~------------~~ 100 (170) .|-=||...+. .+.|-+|.|.+||+++. ..||+..-.+.+..|+.+.+.. .. T Consensus 39 sGsGKSTLl~~i~Gl~~p~~G~I~~~G~~i~~~~~~ig~vfQ~~~L~p~~tv~eni~~~l~~~~~~~~~~~~~v~~~l~~ 118 (220) T cd03293 39 SGCGKSTLLRIIAGLERPTSGEVLVDGEPVTGPGPDRGYVFQQDALLPWLTVLDNVALGLELQGVPKAEARERAEELLEL 118 (220) T ss_pred CCCHHHHHHHHHHCCCCCCCCEEEECCEECCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHH T ss_conf 99579999999975999887389999996788898879992488537788799999889986599989999999999998 Q ss_pred EEEE--EEEECCCCCHHHHHHHHHHHHHHHH Q ss_conf 1079--9982598301888999999999976 Q gi|254780293|r 101 YDVF--ATVSGGGLSGQASAICHGVAKALTY 129 (170) Q Consensus 101 ~Di~--~~V~GGG~sgQa~AirlaIarAL~~ 129 (170) +++. ..-.-.-+|| -+.=|.+|||||+. T Consensus 119 ~gL~~~~~~~p~~LSG-GqkQRvaiARaL~~ 148 (220) T cd03293 119 VGLSGFENAYPHQLSG-GMRQRVALARALAV 148 (220) T ss_pred CCCHHHHHCCHHHCCH-HHHHHHHHHHHHHC T ss_conf 7895476189312999-99999999999866 No 119 >COG0381 WecB UDP-N-acetylglucosamine 2-epimerase [Cell envelope biogenesis, outer membrane] Probab=22.89 E-value=58 Score=13.95 Aligned_cols=48 Identities=21% Similarity=0.256 Sum_probs=30.0 Q ss_pred HHHHHHHHHHCCC-CCEEEEEEEECCCCCHHHHHHHHHHHHHHHHHCHH Q ss_conf 9999999873856-64107999825983018889999999999765763 Q gi|254780293|r 86 LNIKRPFNTVSQD-NMYDVFATVSGGGLSGQASAICHGVAKALTYFQPD 133 (170) Q Consensus 86 ~~i~~Pl~~~~~~-~~~Di~~~V~GGG~sgQa~AirlaIarAL~~~~p~ 133 (170) +....-|.+.+.. -.||..+.-.|+.++.+.+-+..++.+.|....|+ T Consensus 46 em~~~~le~~~i~~pdy~L~i~~~~~tl~~~t~~~i~~~~~vl~~~kPD 94 (383) T COG0381 46 EMLDQVLELFGIRKPDYDLNIMKPGQTLGEITGNIIEGLSKVLEEEKPD 94 (383) T ss_pred HHHHHHHHHHCCCCCCCCHHCCCCCCCHHHHHHHHHHHHHHHHHHHCCC T ss_conf 7899999982898888313216668888999999999999999862999 No 120 >TIGR02734 crtI_fam phytoene desaturase; InterPro: IPR014105 Phytoene is converted to lycopene by desaturation at four (two symmetrical pairs of) sites. This is achieved by two enzymes (crtP and crtQ) in cyanobacteria (Gloeobacter being an exception) and plants, but by a single enzyme in most other bacteria and in fungi. This single enzyme is called the bacterial-type phytoene desaturase, or CrtI. Most members of this family are CrtI and are part of flavin containing amine oxidoreductase family. It is likely that all members act on either phytoene or on related compounds such as dehydrosqualene, for carotenoid biosynthesis.. Probab=22.86 E-value=52 Score=14.23 Aligned_cols=13 Identities=46% Similarity=0.608 Sum_probs=4.1 Q ss_pred CCCCCHHHHHHHH Q ss_conf 5983018889999 Q gi|254780293|r 109 GGGLSGQASAICH 121 (170) Q Consensus 109 GGG~sgQa~Airl 121 (170) |+|.-|-|=|+|| T Consensus 5 GAG~gGLA~A~rL 17 (526) T TIGR02734 5 GAGFGGLALAIRL 17 (526) T ss_pred CCCHHHHHHHHHH T ss_conf 7776899999999 No 121 >pfam03971 IDH Monomeric isocitrate dehydrogenase. NADP(+)-dependent isocitrate dehydrogenase (ICD) is an important enzyme of the intermediary metabolism, as it controls the carbon flux within the citric acid cycle and supplies the cell with 2-oxoglutarate EC:1.1.1.42 and NADPH for biosynthetic purposes. Probab=22.83 E-value=53 Score=14.17 Aligned_cols=46 Identities=17% Similarity=0.382 Sum_probs=33.5 Q ss_pred EECCCCEEEEECCCHHHHCCCHHHHHHHHHHHHHHCCCCC-EEEEEEEECCCCC Q ss_conf 9818973999410088844698999999999987385664-1079998259830 Q gi|254780293|r 61 IKSGSGKFTINYVDISKYFTQDLLVLNIKRPFNTVSQDNM-YDVFATVSGGGLS 113 (170) Q Consensus 61 l~~GsG~I~INgk~~~~Yf~~~~~r~~i~~Pl~~~~~~~~-~Di~~~V~GGG~s 113 (170) ++.|...|.|-|--+-||+.. |.|+.-+|+..| .-|.--..|||+- T Consensus 528 ~~~G~dtIsvTGNVLRDYLTD-------LFPILElGTSAKMLSIVPLm~GGGlf 574 (735) T pfam03971 528 LKRGQDTISVTGNVLRDYLTD-------LFPILELGTSAKMLSIVPLMAGGGMF 574 (735) T ss_pred HHCCCCEEEEECHHHHHHHCC-------HHHHHHCCCCCCCEEEEEECCCCCEE T ss_conf 975898388621467765411-------12343203432110222320478323 No 122 >KOG1251 consensus Probab=22.61 E-value=59 Score=13.92 Aligned_cols=23 Identities=35% Similarity=0.589 Sum_probs=14.4 Q ss_pred HHHHHCCCCCEE-EEEEEECCCCC Q ss_conf 998738566410-79998259830 Q gi|254780293|r 91 PFNTVSQDNMYD-VFATVSGGGLS 113 (170) Q Consensus 91 Pl~~~~~~~~~D-i~~~V~GGG~s 113 (170) .|++++..+..| +++-|-|||+- T Consensus 164 A~ElleqVg~iDalfvpvgGGGll 187 (323) T KOG1251 164 ALELLEQVGEIDALFVPVGGGGLL 187 (323) T ss_pred HHHHHHHHCCCCEEEEEECCCCHH T ss_conf 999998507653699960686046 No 123 >TIGR01811 sdhA_Bsu succinate dehydrogenase or fumarate reductase, flavoprotein subunit; InterPro: IPR011280 This entry represents the succinate dehydrogenase flavoprotein subunit as found in the low-GC Gram-positive bacteria and a few other lineages. This enzyme may act in a complete or partial TCA cycle, or act in the opposite direction as fumarate reductase. In some but not all species, succinate dehydrogenase and fumarate reductase may be encoded as separate isozymes.. Probab=22.60 E-value=54 Score=14.13 Aligned_cols=14 Identities=36% Similarity=0.404 Sum_probs=7.6 Q ss_pred EEEEEEECCCCCHH Q ss_conf 07999825983018 Q gi|254780293|r 102 DVFATVSGGGLSGQ 115 (170) Q Consensus 102 Di~~~V~GGG~sgQ 115 (170) -+.=+.--+|.||| T Consensus 116 qvsRT~Yarg~TGQ 129 (620) T TIGR01811 116 QVSRTFYARGQTGQ 129 (620) T ss_pred EEEEEEECCCCHHH T ss_conf 00121341776278 No 124 >pfam02601 Exonuc_VII_L Exonuclease VII, large subunit. This family consist of exonuclease VII, large subunit EC:3.1.11.6 This enzyme catalyses exonucleolytic cleavage in either 5'-3' or 3'-5' direction to yield 5'-phosphomononucleotides. This exonuclease VII enzyme is composed of one large subunit and 4 small ones. Probab=22.47 E-value=59 Score=13.90 Aligned_cols=32 Identities=22% Similarity=0.373 Sum_probs=21.1 Q ss_pred HHHHHHHHHHHHCC---CCCEEEEEEEECCCCCHH Q ss_conf 99999999987385---664107999825983018 Q gi|254780293|r 84 LVLNIKRPFNTVSQ---DNMYDVFATVSGGGLSGQ 115 (170) Q Consensus 84 ~r~~i~~Pl~~~~~---~~~~Di~~~V~GGG~sgQ 115 (170) .-..|..-|...+. ...||+.+-++|||-.-- T Consensus 56 a~~~I~~ai~~~~~~~~~~~~Dviii~RGGGS~eD 90 (295) T pfam02601 56 AAESIVSALERANERETALDYDVIVIIRGGGSKED 90 (295) T ss_pred HHHHHHHHHHHHHHCCCCCCCCEEEEECCCCCHHH T ss_conf 89999999999984689899838999578688898 No 125 >PRK05714 2-octaprenyl-3-methyl-6-methoxy-1,4-benzoquinol hydroxylase; Provisional Probab=22.33 E-value=59 Score=13.89 Aligned_cols=10 Identities=20% Similarity=0.557 Sum_probs=4.1 Q ss_pred CCCEEEEECC Q ss_conf 8973999410 Q gi|254780293|r 64 GSGKFTINYV 73 (170) Q Consensus 64 GsG~I~INgk 73 (170) +.|.+..+.. T Consensus 89 ~~~~~~f~~~ 98 (405) T PRK05714 89 GTGQIHFSAA 98 (405) T ss_pred CCCEEEECCC T ss_conf 8622764410 No 126 >COG4559 ABC-type hemin transport system, ATPase component [Inorganic ion transport and metabolism] Probab=22.14 E-value=41 Score=14.90 Aligned_cols=28 Identities=21% Similarity=0.297 Sum_probs=20.9 Q ss_pred EEECCCCEEEEECCCHHHHCCCHHHHHH Q ss_conf 9981897399941008884469899999 Q gi|254780293|r 60 WIKSGSGKFTINYVDISKYFTQDLLVLN 87 (170) Q Consensus 60 ~l~~GsG~I~INgk~~~~Yf~~~~~r~~ 87 (170) .+.|.+|++++||.+++.+-+.+..+.. T Consensus 50 el~p~~G~v~~~g~~l~~~~~~~lA~~r 77 (259) T COG4559 50 ELSPDSGEVTLNGVPLNSWPPEELARHR 77 (259) T ss_pred CCCCCCCEEEECCCCHHHCCHHHHHHHH T ss_conf 6378887676679281237999998775 No 127 >PRK07245 consensus Probab=21.73 E-value=56 Score=14.03 Aligned_cols=29 Identities=24% Similarity=0.209 Sum_probs=20.1 Q ss_pred EEEECCCEEEEEEEECCC---CEEEEECCCHH Q ss_conf 131112108999998189---73999410088 Q gi|254780293|r 48 ATGKRKTSIARVWIKSGS---GKFTINYVDIS 76 (170) Q Consensus 48 ~tGrRKtsvArV~l~~Gs---G~I~INgk~~~ 76 (170) +-|||-||.|.|++.|-- -.|.||..++. T Consensus 178 s~gRrhTS~asV~VlP~~dd~i~i~i~~~Dlr 209 (337) T PRK07245 178 SAKRRHTSFASVEVMPELDDTIEVEVRDDDIK 209 (337) T ss_pred CCCCEEEEEEEEEEECCCCCCCCCCCCCCCEE T ss_conf 88866656789999535576545678875638 No 128 >COG0644 FixC Dehydrogenases (flavoproteins) [Energy production and conversion] Probab=21.31 E-value=62 Score=13.76 Aligned_cols=10 Identities=20% Similarity=0.076 Sum_probs=4.7 Q ss_pred EEEEEEECCC Q ss_conf 8999998189 Q gi|254780293|r 56 IARVWIKSGS 65 (170) Q Consensus 56 vArV~l~~Gs 65 (170) ..-.|+.|-. T Consensus 197 ~Gy~wifP~~ 206 (396) T COG0644 197 GGYGWIFPLG 206 (396) T ss_pred CCEEEEEECC T ss_conf 8579999889 No 129 >PRK11614 livF leucine/isoleucine/valine transporter ATP-binding subunit; Provisional Probab=21.23 E-value=6.6 Score=19.80 Aligned_cols=28 Identities=25% Similarity=0.395 Sum_probs=17.6 Q ss_pred EECCCEEEEEE---EECCCCEEEEECCCHHH Q ss_conf 11121089999---98189739994100888 Q gi|254780293|r 50 GKRKTSIARVW---IKSGSGKFTINYVDISK 77 (170) Q Consensus 50 GrRKtsvArV~---l~~GsG~I~INgk~~~~ 77 (170) |==||...+.. +.|-+|.|++||+++.. T Consensus 41 GaGKSTLl~~i~G~~~~~~G~I~~~G~~i~~ 71 (237) T PRK11614 41 GAGKTTLLGTLCGDPRATSGRIVFDGKDITD 71 (237) T ss_pred CCCHHHHHHHHHCCCCCCCEEEEECCEECCC T ss_conf 9759999999967998896289999998887 No 130 >PRK00591 prfA peptide chain release factor 1; Validated Probab=20.86 E-value=60 Score=13.84 Aligned_cols=27 Identities=30% Similarity=0.322 Sum_probs=17.5 Q ss_pred EEECCCEEEEEEEECCCCE--EEEECCCH Q ss_conf 3111210899999818973--99941008 Q gi|254780293|r 49 TGKRKTSIARVWIKSGSGK--FTINYVDI 75 (170) Q Consensus 49 tGrRKtsvArV~l~~GsG~--I~INgk~~ 75 (170) -|||-||.|.|++.|---. |.||..++ T Consensus 196 ~gR~HTSfasV~V~P~~~~~~i~i~~~dl 224 (360) T PRK00591 196 QGRIHTSAATVAVLPEAEEVEVEINPKDL 224 (360) T ss_pred CCCEEEEEEEEEEECCCCCCCCCCCCCCE T ss_conf 78566400478970676431025685402 No 131 >TIGR01988 Ubi-OHases Ubiquinone biosynthesis hydroxylase, UbiH/UbiF/VisC/COQ6 family; InterPro: IPR010971 This entry represents a family of FAD-dependent hydroxylases (monooxygenases) which are all believed to act in the aerobic ubiquinone biosynthesis pathway . A separate set of hydroxylases, as yet undiscovered, are believed to be active under anaerobic conditions . In Escherichia coli, three enzyme activities have been described: UbiB (which acts first at position 6, see IPR010232 from INTERPRO), UbiH (which acts at position 4, ) and UbiF (which acts at position 5, ). UbiH and UbiF are similar to one another and form the basis of this subfamily. Interestingly, E. coli contains another hydroxylase gene, called visC, that is highly similar to UbiF, adjacent to UbiH and, when mutated, results in a phenotype similar to that of UbiH (which has also been named visB) . Several other species appear to have three homologs in this family, although they assort themselves differently on phylogenetic trees (e.g. Xylella and Mesorhizobium) making it difficult to ascribe a specific activity to each one. Eukaryotes appear to have only a single homologue in this subfamily (COQ6, ) which complements UbiH, but also possess a non-orthologous gene, COQ7 which complements UbiF.; GO: 0016709 oxidoreductase activity acting on paired donors with incorporation or reduction of molecular oxygen NADH or NADPH as one donor and incorporation of one atom of oxygen, 0050660 FAD binding, 0006744 ubiquinone biosynthetic process. Probab=20.79 E-value=64 Score=13.69 Aligned_cols=17 Identities=35% Similarity=0.653 Sum_probs=11.6 Q ss_pred EEECCCCCHHHHHHHHH Q ss_conf 98259830188899999 Q gi|254780293|r 106 TVSGGGLSGQASAICHG 122 (170) Q Consensus 106 ~V~GGG~sgQa~Airla 122 (170) -+-|||+.|-+=|+.|+ T Consensus 3 ~IvGgG~VG~~lA~aL~ 19 (445) T TIGR01988 3 VIVGGGMVGLALALALA 19 (445) T ss_pred EEECCCHHHHHHHHHHH T ss_conf 88847388999999984 No 132 >cd07914 IGPD Imidazoleglycerol-phosphate dehydratase. Imidazoleglycerol-phosphate dehydratase (IGPD; EC 4.2.1.19) catalyzes the dehydration of imidazole glycerol phosphate to imidazole acetol phosphate, the sixth step of histidine biosynthesis in plants and microorganisms where the histidine is synthesized de novo. There is an internal repeat in the protein domain that is related by pseudo-dyad symmetry, perhaps as a result of an ancient gene duplication. The apo-form of IGPD exists as a catalytically inactive trimer which, in the presence of specific divalent metal cations such as manganese (Mn2+), cobalt (Co2+), cadmium (Cd2+), nickel (Ni2+), iron (Fe2+) and zinc (Zn2+), assembles to form a biologically active high molecular weight metalloenzyme; a 24-mer with 4-3-2 symmetry. Each 24-mer has 24 active sites, and contains around 1.5 metal ions per monomer, each monomer contributing residues to three separate active sites. IGPD enzymes are monofunctional in fungi, plants, archaea and s Probab=20.78 E-value=64 Score=13.69 Aligned_cols=45 Identities=18% Similarity=0.133 Sum_probs=33.3 Q ss_pred HHHHHHHHHHCCCCCEEEEEEE-ECCCCCHHHHHHHHHHHHHHHHH Q ss_conf 9999999873856641079998-25983018889999999999765 Q gi|254780293|r 86 LNIKRPFNTVSQDNMYDVFATV-SGGGLSGQASAICHGVAKALTYF 130 (170) Q Consensus 86 ~~i~~Pl~~~~~~~~~Di~~~V-~GGG~sgQa~AirlaIarAL~~~ 130 (170) +.+..=|.-.-...+..+++.+ .|.-..-.++|+--|.+|||-.. T Consensus 129 el~~hFf~s~a~~a~~tlHi~~~~G~N~HH~iEa~FKalg~ALr~A 174 (190) T cd07914 129 ELVEEFFRSFANNAGITLHIRVLYGRNDHHIIEAIFKAFARALRQA 174 (190) T ss_pred HHHHHHHHHHHHCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 5599999999836854999994457874899999999999999998 No 133 >pfam01808 AICARFT_IMPCHas AICARFT/IMPCHase bienzyme. This is a family of bifunctional enzymes catalysing the last two steps in de novo purine biosynthesis. The bifunctional enzyme is found in both prokaryotes and eukaryotes. The second last step is catalysed by 5-aminoimidazole-4-carboxamide ribonucleotide formyltransferase EC:2.1.2.3 (AICARFT), this enzyme catalyses the formylation of AICAR with 10-formyl-tetrahydrofolate to yield FAICAR and tetrahydrofolate. The last step is catalysed by IMP (Inosine monophosphate) cyclohydrolase EC:3.5.4.10 (IMPCHase), cyclizing FAICAR (5-formylaminoimidazole-4-carboxamide ribonucleotide) to IMP. Probab=20.73 E-value=58 Score=13.94 Aligned_cols=18 Identities=22% Similarity=0.350 Sum_probs=7.3 Q ss_pred CCCCHHHHHHHHHHHHHH Q ss_conf 983018889999999999 Q gi|254780293|r 110 GGLSGQASAICHGVAKAL 127 (170) Q Consensus 110 GG~sgQa~AirlaIarAL 127 (170) ||+.+--.-+-..+|+.| T Consensus 172 GGIva~N~~vd~~~A~~i 189 (315) T pfam01808 172 GGIIALNDEVDVETAKEI 189 (315) T ss_pred CCEEEECCHHHHHHHHHH T ss_conf 868986570008899999 No 134 >PRK06467 dihydrolipoamide dehydrogenase; Reviewed Probab=20.65 E-value=64 Score=13.67 Aligned_cols=17 Identities=29% Similarity=0.405 Sum_probs=10.6 Q ss_pred EEEEECCCCCHHHHHHH Q ss_conf 99982598301888999 Q gi|254780293|r 104 FATVSGGGLSGQASAIC 120 (170) Q Consensus 104 ~~~V~GGG~sgQa~Air 120 (170) .+-|-|||..|---|.. T Consensus 176 ~v~IiGgG~ig~E~A~~ 192 (472) T PRK06467 176 RLLVMGGGIIGLEMGTV 192 (472) T ss_pred EEEEECCCHHHHHHHHH T ss_conf 59999564728999999 No 135 >cd03290 ABCC_SUR1_N The SUR domain 1. The sulfonylurea receptor SUR is an ATP transporter of the ABCC/MRP family with tandem ATPase binding domains. Unlike other ABC proteins, it has no intrinsic transport function, neither active nor passive, but associates with the potassium channel proteins Kir6.1 or Kir6.2 to form the ATP-sensitive potassium (K(ATP)) channel. Within the channel complex, SUR serves as a regulatory subunit that fine-tunes the gating of Kir6.x in response to alterations in cellular metabolism. It constitutes a major pharmaceutical target as it binds numerous drugs, K(ATP) channel openers and blockers, capable of up- or down-regulating channel activity. Probab=20.36 E-value=65 Score=13.63 Aligned_cols=28 Identities=21% Similarity=0.192 Sum_probs=17.3 Q ss_pred EEECCCEEEEEE---EECCCCEEEEECCCHH Q ss_conf 311121089999---9818973999410088 Q gi|254780293|r 49 TGKRKTSIARVW---IKSGSGKFTINYVDIS 76 (170) Q Consensus 49 tGrRKtsvArV~---l~~GsG~I~INgk~~~ 76 (170) +|-=|++..++. +.+-+|.|.+||++.. T Consensus 36 sGsGKSTLl~~l~g~~~~~~G~I~~~~~~~~ 66 (218) T cd03290 36 VGCGKSSLLLAILGEMQTLEGKVHWSNKNES 66 (218) T ss_pred CCCCHHHHHHHHHCCCCCCCCEEEECCEECC T ss_conf 9980999999985556567764998986668 No 136 >PRK11247 ssuB aliphatic sulfonates transport ATP-binding subunit; Provisional Probab=20.16 E-value=66 Score=13.61 Aligned_cols=76 Identities=13% Similarity=0.182 Sum_probs=39.6 Q ss_pred EEECCCEEEEE---EEECCCCEEEEECCCHHHH-------------CC------------CHHHHHHHHHHHHHHCCCCC Q ss_conf 31112108999---9981897399941008884-------------46------------98999999999987385664 Q gi|254780293|r 49 TGKRKTSIARV---WIKSGSGKFTINYVDISKY-------------FT------------QDLLVLNIKRPFNTVSQDNM 100 (170) Q Consensus 49 tGrRKtsvArV---~l~~GsG~I~INgk~~~~Y-------------f~------------~~~~r~~i~~Pl~~~~~~~~ 100 (170) .|==||...+. .+.|-+|.|.+++.++... |+ ...++..+.+-|..++.... T Consensus 47 nGaGKSTLlr~i~Gl~~p~~G~I~~~~~~i~~~~~~i~~vfQ~~~l~~~~tV~eni~~gl~~~~~~~~~e~l~~vgL~~~ 126 (257) T PRK11247 47 SGCGKSTLLRLLAGLETPTAGDLLAGTAPLAEAQEDTRLMFQDARLLPWKKVIDNVGLGLKGQWRDAALQALAAVGLADR 126 (257) T ss_pred CCCHHHHHHHHHHCCCCCCCCEEEECCEEHHHHHHCEEEEECCCCCCCCCCHHHHHHHHCCCCHHHHHHHHHHHCCCCHH T ss_conf 98889999999965898888708989875544311007993256447677899998632141069999999998599135 Q ss_pred EEEEEEEECCCCCHHHHHHHHHHHHHHHH Q ss_conf 10799982598301888999999999976 Q gi|254780293|r 101 YDVFATVSGGGLSGQASAICHGVAKALTY 129 (170) Q Consensus 101 ~Di~~~V~GGG~sgQa~AirlaIarAL~~ 129 (170) .|-.+.--.|| +-=|++|||||+. T Consensus 127 ~~~~p~~LSGG-----qkQRvaiAraL~~ 150 (257) T PRK11247 127 ANEWPAALSGG-----QKQRVALARALIH 150 (257) T ss_pred HHCCHHHCCHH-----HHHHHHHHHHHHC T ss_conf 53694448999-----9999999999845 No 137 >PRK05976 dihydrolipoamide dehydrogenase; Validated Probab=20.01 E-value=66 Score=13.59 Aligned_cols=20 Identities=25% Similarity=0.190 Sum_probs=13.3 Q ss_pred EEEEEECCCCCHHHHHHHHH Q ss_conf 79998259830188899999 Q gi|254780293|r 103 VFATVSGGGLSGQASAICHG 122 (170) Q Consensus 103 i~~~V~GGG~sgQa~Airla 122 (170) =.+.|-|||..|---|..+. T Consensus 176 k~v~ViGgG~ig~E~A~~~~ 195 (464) T PRK05976 176 KSLVVVGGGYIGLEWGSMLR 195 (464) T ss_pred CEEEEECCCHHHHHHHHHHH T ss_conf 55999899689999999999 Done!