Query 537021.9.peg.789_1 Match_columns 143 No_of_seqs 109 out of 6853 Neff 8.5 Searched_HMMs 39220 Date Wed May 25 02:39:38 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i peg_789.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 TIGR02212 lolCE lipoprotein re 100.0 6.2E-42 0 231.7 8.1 143 1-143 272-414 (414) 2 PRK11146 outer membrane-specif 100.0 2.2E-37 5.6E-42 208.5 12.5 143 1-143 270-412 (412) 3 TIGR02213 lolE_release lipopro 100.0 2.4E-35 6E-40 198.2 5.7 143 1-143 274-416 (416) 4 COG4591 LolE ABC-type transpor 100.0 8E-34 2.1E-38 190.4 12.8 137 1-143 272-408 (408) 5 PRK10535 macrolide transporter 100.0 3E-32 7.8E-37 182.4 11.7 122 3-143 527-648 (648) 6 PRK10814 outer membrane-specif 100.0 5.3E-32 1.3E-36 181.1 11.9 130 1-140 267-396 (396) 7 TIGR03434 ADOP Acidobacterial 99.9 4.5E-25 1.1E-29 145.9 12.9 118 6-143 686-803 (803) 8 TIGR03434 ADOP Acidobacterial 99.9 7.8E-23 2E-27 134.5 12.7 126 3-142 276-401 (803) 9 COG0577 SalY ABC-type antimicr 99.9 5.2E-21 1.3E-25 125.2 12.8 129 3-143 291-419 (419) 10 pfam02687 FtsX Predicted perme 99.8 4E-19 1E-23 115.6 12.3 119 4-136 56-174 (174) 11 PRK11026 ftsX cell division pr 99.5 2.6E-13 6.7E-18 86.0 11.5 118 9-136 190-308 (309) 12 COG2177 FtsX Cell division pro 99.2 3.5E-10 9E-15 70.1 11.2 66 4-69 173-238 (297) 13 TIGR01185 devC DevC protein; I 99.2 5E-11 1.3E-15 74.4 6.7 110 10-139 271-380 (381) 14 COG3127 Predicted ABC-type tra 99.2 5.4E-10 1.4E-14 69.2 10.6 119 7-141 263-382 (829) 15 TIGR00439 ftsX putative protei 98.9 5.9E-09 1.5E-13 63.9 8.6 119 5-133 189-308 (314) 16 COG3127 Predicted ABC-type tra 98.7 2.4E-07 6.1E-12 55.7 11.2 113 7-142 714-828 (829) 17 COG0795 Predicted permeases [G 92.1 0.31 8E-06 24.6 4.8 49 18-66 74-122 (364) 18 pfam03739 YjgP_YjgQ Predicted 90.8 0.77 2E-05 22.6 5.7 49 18-66 68-116 (353) 19 pfam03649 UPF0014 Uncharacteri 84.9 2.3 5.9E-05 20.2 8.8 43 21-63 149-191 (250) 20 TIGR01654 bact_immun_7tm bacte 84.3 2.5 6.3E-05 20.0 10.8 99 24-142 221-319 (745) 21 TIGR00837 araaP aromatic amino 80.6 3.5 9E-05 19.2 5.4 101 32-138 262-372 (391) 22 TIGR00801 ncs2 uracil-xanthine 75.4 5.1 0.00013 18.4 4.2 105 30-134 200-309 (480) 23 COG0390 ABC-type uncharacteriz 67.1 7.8 0.0002 17.5 3.6 43 19-61 145-187 (256) 24 pfam07242 DUF1430 Protein of u 57.9 12 0.00031 16.5 11.4 49 14-62 9-57 (100) 25 TIGR02138 phosphate_pstC phosp 45.9 20 0.0005 15.4 6.2 44 26-72 196-239 (317) 26 TIGR02139 permease_CysT sulfat 43.5 22 0.00055 15.2 3.1 81 35-122 167-250 (265) 27 PRK06298 type III secretion sy 39.8 25 0.00063 14.9 5.5 10 130-139 167-176 (360) 28 COG4149 ModC ABC-type molybdat 34.8 30 0.00077 14.5 6.4 99 32-133 122-223 (225) 29 pfam12072 DUF3552 Domain of un 29.2 34 0.00087 14.2 1.8 17 54-70 4-20 (201) 30 TIGR01666 YCCS hypothetical me 29.2 29 0.00075 14.6 1.4 20 57-76 438-457 (713) 31 TIGR02230 ATPase_gene1 F0F1-AT 27.1 42 0.0011 13.8 3.3 16 99-114 69-84 (100) 32 pfam12273 RCR Chitin synthesis 20.0 58 0.0015 13.0 3.0 13 20-32 18-30 (124) No 1 >TIGR02212 lolCE lipoprotein releasing system, transmembrane protein, LolC/E family; InterPro: IPR011925 This entry describes the LolC protein, and its paralog LolE found in some species. These proteins are homologous to permease proteins of ABC transporters. In some species, two paralogs occur, designated LolC and LolE. In others, a single form is found and tends to be designated LolC .; GO: 0006810 transport, 0009276 1-2nm peptidoglycan-based cell wall, 0016021 integral to membrane. Probab=100.00 E-value=6.2e-42 Score=231.66 Aligned_cols=143 Identities=47% Similarity=0.804 Sum_probs=141.9 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 98999999999999999999999999999999999807552222036888987653233456678875310000000012 Q 537021.9.peg.7 1 MFVILALIVLVAALNIISSLVMLVQERRRDIAILRTMGARISSIMSIFFMIGAFIGIAGTGMGMIVGILISCNVEAIRKF 80 (143) Q Consensus 1 m~ii~~~i~~i~~~~i~~~~~~~i~~R~~eigilkalG~s~~~I~~~~~~E~~~l~~ig~i~G~~lg~~l~~~i~~~~~~ 80 (143) ||+++++++.||||||++++.|.++||+.||||||++|+++++|.++|+++++..|++|+++|+++|+.+++.+.....+ T Consensus 272 Mfl~L~LIi~VAaFNIvS~L~M~V~dK~~dIAILRT~Ga~~~~im~IF~~~G~~~G~~GtllG~i~Gv~~~lN~~~i~~~ 351 (414) T TIGR02212 272 MFLILSLIIAVAAFNIVSTLVMAVKDKQGDIAILRTLGATPGQIMRIFIVQGLLIGVIGTLLGVILGVLLALNLTTIINF 351 (414) T ss_pred HHHHHHHHHHHHHHHHHHHHHEEEECCCCCHHHHHHCCCCCCCEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999788542034423752006665337987751111447879999999999999999999989999999 Q ss_pred CCCCCCCCCCCHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHCCC Q ss_conf 000268513680022331057044069999999999999999999999998299999982187 Q 537021.9.peg.7 81 FLHTLGVVIFDTEAYLLTELPSKISWVEVSWIISMALALSLLATIFPSWKASRIDPVKVLRGE 143 (143) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~pa~~~~k~~p~~~lr~e 143 (143) .+...+...++++.|+.+.+|+.++|.....+...+++++++|++||||||+|.+|+|+|||| T Consensus 352 ~e~~~G~~~l~~~iYf~~~LPs~l~~~dv~~i~~~al~LSllATlYPA~RAa~~~PA~aLRYE 414 (414) T TIGR02212 352 IEWLIGVKFLNADIYFLDFLPSKLDPLDVLLIIVAALVLSLLATLYPAWRAAKLQPAEALRYE 414 (414) T ss_pred HHHHHCCEEECCCCHHHHCCCCEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHCCC T ss_conf 988749353142101101075201078899999999999999888899998602804650389 No 2 >PRK11146 outer membrane-specific lipoprotein transporter subunit LolE; Provisional Probab=100.00 E-value=2.2e-37 Score=208.52 Aligned_cols=143 Identities=38% Similarity=0.721 Sum_probs=134.9 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 98999999999999999999999999999999999807552222036888987653233456678875310000000012 Q 537021.9.peg.7 1 MFVILALIVLVAALNIISSLVMLVQERRRDIAILRTMGARISSIMSIFFMIGAFIGIAGTGMGMIVGILISCNVEAIRKF 80 (143) Q Consensus 1 m~ii~~~i~~i~~~~i~~~~~~~i~~R~~eigilkalG~s~~~I~~~~~~E~~~l~~ig~i~G~~lg~~l~~~i~~~~~~ 80 (143) |++++.+++++++||+++++.+++.||+||||+|||+|+++++|+++|++|+++++++|+++|+++|..++..+...... T Consensus 270 ~~~il~li~~va~fnIi~~l~m~V~eK~reIgiLkalG~~~~~I~~iFl~eg~~ig~~G~~~G~~lG~~l~~~l~~~~~~ 349 (412) T PRK11146 270 MYLAMVLVIGVACFNIVSTLVMAVKDKSGDIAILRTLGAKDGLIRAIFVWYGLLAGLKGSLIGVVIGVVVSLNLTPIIKG 349 (412) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999999999999998887999999948998899999999999999999999999999999999999999 Q ss_pred CCCCCCCCCCCHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHCCC Q ss_conf 000268513680022331057044069999999999999999999999998299999982187 Q 537021.9.peg.7 81 FLHTLGVVIFDTEAYLLTELPSKISWVEVSWIISMALALSLLATIFPSWKASRIDPVKVLRGE 143 (143) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~pa~~~~k~~p~~~lr~e 143 (143) ....++...++++.|+.+.+|+.+.|..++.+.+.+++++++++++||||++|++|+|+||+| T Consensus 350 ~~~~~~~~~~~~~iy~i~~lP~~i~~~~v~~i~~~~~i~~~las~~PA~rAa~l~P~eaLr~e 412 (412) T PRK11146 350 IEKLIGHQFLSGDIYFIDFLPSELHWLDVAYVLVTALLLSLLASWYPARRASKLDPARVLSGQ 412 (412) T ss_pred HHHHHCCCCCCCCCEEECCCCEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHCCC T ss_conf 999837555897520120355171399999999999999999999999999787999983599 No 3 >TIGR02213 lolE_release lipoprotein releasing system, transmembrane protein LolE; InterPro: IPR011926 This protein is part of an unusual ABC transporter complex that releases lipoproteins from the periplasmic side of the bacterial inner membrane, rather than transport any substrate across the inner membrane. In some species, the permease-like transmembrane protein is represented by two paralogs, LolC and LolE, both in the LolCDE complex. This family consists of LolE, as found in Escherichia coli and related species.; GO: 0006810 transport, 0009276 1-2nm peptidoglycan-based cell wall, 0016021 integral to membrane. Probab=100.00 E-value=2.4e-35 Score=198.17 Aligned_cols=143 Identities=35% Similarity=0.666 Sum_probs=140.0 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 98999999999999999999999999999999999807552222036888987653233456678875310000000012 Q 537021.9.peg.7 1 MFVILALIVLVAALNIISSLVMLVQERRRDIAILRTMGARISSIMSIFFMIGAFIGIAGTGMGMIVGILISCNVEAIRKF 80 (143) Q Consensus 1 m~ii~~~i~~i~~~~i~~~~~~~i~~R~~eigilkalG~s~~~I~~~~~~E~~~l~~ig~i~G~~lg~~l~~~i~~~~~~ 80 (143) ||+.+.+++-||||||++|+.|.+++|..||++||++||.+.=|+++|.++++.-|+.|+++|.++|+.+++.+..+..+ T Consensus 274 MYlaMVLVIGVACFNIVSTL~MAVKDk~gdIAilRTlGA~dglIkrIFvWyGl~aG~~Gsl~G~v~Gv~vaLnLTt~I~g 353 (416) T TIGR02213 274 MYLAMVLVIGVACFNIVSTLVMAVKDKAGDIAILRTLGAKDGLIKRIFVWYGLLAGLLGSLLGVVVGVVVALNLTTLIKG 353 (416) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHEECCCCCEEHHHHCCCCCCCEEEHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHH T ss_conf 99988888655556467664531004776530210126576523202425547863546665443224534011147889 Q ss_pred CCCCCCCCCCCHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHCCC Q ss_conf 000268513680022331057044069999999999999999999999998299999982187 Q 537021.9.peg.7 81 FLHTLGVVIFDTEAYLLTELPSKISWVEVSWIISMALALSLLATIFPSWKASRIDPVKVLRGE 143 (143) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~pa~~~~k~~p~~~lr~e 143 (143) .+...++...++|+||.|++|++++|.+.+.+...+++++++++||||+||+|++|++.|=++ T Consensus 354 iE~vIG~~fLSgDiYFvDFLPSeLhwlDVl~VlvtAl~LSl~AswYPA~RAakl~PA~VLSs~ 416 (416) T TIGR02213 354 IEKVIGHQFLSGDIYFVDFLPSELHWLDVLLVLVTALVLSLLASWYPASRAAKLKPARVLSSK 416 (416) T ss_pred HHHHHCCCCCCCCEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHCCCC T ss_conf 998824100124265673277302478888889999999999864367663037831112689 No 4 >COG4591 LolE ABC-type transport system, involved in lipoprotein release, permease component [Cell envelope biogenesis, outer membrane] Probab=100.00 E-value=8e-34 Score=190.38 Aligned_cols=137 Identities=49% Similarity=0.801 Sum_probs=124.7 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 98999999999999999999999999999999999807552222036888987653233456678875310000000012 Q 537021.9.peg.7 1 MFVILALIVLVAALNIISSLVMLVQERRRDIAILRTMGARISSIMSIFFMIGAFIGIAGTGMGMIVGILISCNVEAIRKF 80 (143) Q Consensus 1 m~ii~~~i~~i~~~~i~~~~~~~i~~R~~eigilkalG~s~~~I~~~~~~E~~~l~~ig~i~G~~lg~~l~~~i~~~~~~ 80 (143) |++++.+++++++||+++++.+.+.||++|||+||++|+++++|.++|++|++.+|++|+++|+++|+.+++.+..+... T Consensus 272 m~~il~LIi~VA~fnI~ssL~m~V~ek~~eIAILrtmGa~~~~I~~iFl~~G~~iG~iG~llG~iLG~~~~~~i~~~~~~ 351 (408) T COG4591 272 MFIILVLIILVAAFNIISSLTMLVKEKTREIAILRAMGASPSHIMRIFLLQGLIIGLIGALLGVILGVLLALNLNSIIIF 351 (408) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 99999999999999899889987664555789999829986789999999989999999999999999999998778740 Q ss_pred CCCCCCCCCCCHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHCCC Q ss_conf 000268513680022331057044069999999999999999999999998299999982187 Q 537021.9.peg.7 81 FLHTLGVVIFDTEAYLLTELPSKISWVEVSWIISMALALSLLATIFPSWKASRIDPVKVLRGE 143 (143) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~pa~~~~k~~p~~~lr~e 143 (143) .....+.. .+.+.+|.+++|.....+.+.+++++++++++|||||+|.+|+|+||+| T Consensus 352 ~~~~~~~~------~~~~~lP~~~~~~di~~v~~~al~ls~lAtlyPA~rAskl~P~eaLr~e 408 (408) T COG4591 352 IEPLLGHT------FGISTLPIELSLLDVVLVLVFALLLSLLATLYPARRASKLDPAEALRYE 408 (408) T ss_pred CCCCCCCE------ECCCCCCCEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHCC T ss_conf 56534410------0112477350099999999999999999999788888668988885079 No 5 >PRK10535 macrolide transporter ATP-binding /permease protein; Provisional Probab=99.98 E-value=3e-32 Score=182.35 Aligned_cols=122 Identities=21% Similarity=0.407 Sum_probs=104.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCC Q ss_conf 99999999999999999999999999999999980755222203688898765323345667887531000000001200 Q 537021.9.peg.7 3 VILALIVLVAALNIISSLVMLVQERRRDIAILRTMGARISSIMSIFFMIGAFIGIAGTGMGMIVGILISCNVEAIRKFFL 82 (143) Q Consensus 3 ii~~~i~~i~~~~i~~~~~~~i~~R~~eigilkalG~s~~~I~~~~~~E~~~l~~ig~i~G~~lg~~l~~~i~~~~~~~~ 82 (143) .+..+.++++++++.|.+++++.||+||||++||+|+++++|..+|+.|+++++++|+++|+++|+.+...+..... T Consensus 527 ~Ia~ISLlVggIGImNIMlvSVtERTrEIGIRKAlGA~~~dI~~qFL~Ea~~l~~iGGiiGi~lg~~i~~~i~~~l~--- 603 (648) T PRK10535 527 LVAVISLVVGGIGVMNIMLVSVTERTREIGIRMAVGARASDVLQQFLIEAVLVCLVGGALGISLSLLIAFTLQLFLP--- 603 (648) T ss_pred HHHHHHHHHHHHHHHHHHEEEHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC--- T ss_conf 99999999999999967322017888899999983898889999999999999999999999999999999998566--- Q ss_pred CCCCCCCCCHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHCCC Q ss_conf 0268513680022331057044069999999999999999999999998299999982187 Q 537021.9.peg.7 83 HTLGVVIFDTEAYLLTELPSKISWVEVSWIISMALALSLLATIFPSWKASRIDPVKVLRGE 143 (143) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~pa~~~~k~~p~~~lr~e 143 (143) ..+..++++..+..+.++.+++++++++|||||+|++|+||||+| T Consensus 604 ----------------~~~~~~~~~~i~~a~~~s~~iGvlfGlyPA~kAArLdPIeALR~E 648 (648) T PRK10535 604 ----------------GWEIGFSPLALLSAFLCSTVTGILFGWLPARNAARLDPVDALARE 648 (648) T ss_pred ----------------CCCEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHCCC T ss_conf ----------------771426889999999999999999999999988778998982179 No 6 >PRK10814 outer membrane-specific lipoprotein transporter subunit LolC; Provisional Probab=99.98 E-value=5.3e-32 Score=181.14 Aligned_cols=130 Identities=36% Similarity=0.562 Sum_probs=116.2 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 98999999999999999999999999999999999807552222036888987653233456678875310000000012 Q 537021.9.peg.7 1 MFVILALIVLVAALNIISSLVMLVQERRRDIAILRTMGARISSIMSIFFMIGAFIGIAGTGMGMIVGILISCNVEAIRKF 80 (143) Q Consensus 1 m~ii~~~i~~i~~~~i~~~~~~~i~~R~~eigilkalG~s~~~I~~~~~~E~~~l~~ig~i~G~~lg~~l~~~i~~~~~~ 80 (143) |++++.+++++|+||+++++.+.+.||+||||+|||+|+++++|.++|++|+++++++|+++|..+|..++..+..... T Consensus 267 ~~iIl~lIilVA~FnIi~~l~m~V~eK~~eI~iL~alGa~~~~I~~iFl~~g~~ig~~G~~~G~~lG~~l~~~~~~~~~- 345 (396) T PRK10814 267 MGLLLSLIVAVAAFNIITSLGLMVMEKQGEVAILQTQGLTPRQIMMVFMVQGASAGIIGALLGAALGALLASQLNNLMP- 345 (396) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH- T ss_conf 9999999999999999999999999777689999983899355999999999999999999999999999999999985- Q ss_pred CCCCCCCCCCCHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHH Q ss_conf 000268513680022331057044069999999999999999999999998299999982 Q 537021.9.peg.7 81 FLHTLGVVIFDTEAYLLTELPSKISWVEVSWIISMALALSLLATIFPSWKASRIDPVKVL 140 (143) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~pa~~~~k~~p~~~l 140 (143) ..+..+..+.+|+.++|..++.+.+.++++|++++++|||||+|++|+|+| T Consensus 346 ---------~~~~~~~~~~~Pv~i~~~~v~~v~~~~~~i~~la~~~PA~~Aa~l~PveaL 396 (396) T PRK10814 346 ---------IIGVLLDGAALPVAIEPLQVIVIALVAMAIALLSTLYPSWRAAATQPAEAL 396 (396) T ss_pred ---------HHHHHCCCCEEEEEECHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHC T ss_conf ---------432323664356895299999999999999999999999999587956649 No 7 >TIGR03434 ADOP Acidobacterial duplicated orphan permease. Members of this protein family are found, so far, only in three species of Acidobacteria, namely Acidobacteria bacterium Ellin345, Acidobacterium capsulatum ATCC 51196, and Solibacter usitatus Ellin6076, where they form large paralogous families. Each protein contains two copies of a domain called the efflux ABC transporter permease protein (pfam02687). However, unlike other members of that family (including LolC, FtsX, and MacB), genes for these proteins are essentially never found fused or adjacent to ABC transporter ATP-binding protein (pfam00005) genes. We name this family ADOP, for Acidobacterial Duplicated Orphan Permease, to reflect the restricted lineage, internal duplication, lack of associated ATP-binding cassette proteins, and permease homology. The function is unknown. Probab=99.93 E-value=4.5e-25 Score=145.88 Aligned_cols=118 Identities=27% Similarity=0.484 Sum_probs=93.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCC Q ss_conf 99999999999999999999999999999980755222203688898765323345667887531000000001200026 Q 537021.9.peg.7 6 ALIVLVAALNIISSLVMLVQERRRDIAILRTMGARISSIMSIFFMIGAFIGIAGTGMGMIVGILISCNVEAIRKFFLHTL 85 (143) Q Consensus 6 ~~i~~i~~~~i~~~~~~~i~~R~~eigilkalG~s~~~I~~~~~~E~~~l~~ig~i~G~~lg~~l~~~i~~~~~~~~~~~ 85 (143) .+.++++++|++++...++.||+||||++|++|+++++|.+++..|.+.+..+|.++|..+|+.+...+.... + T Consensus 686 ~lailia~lGl~gl~~~~v~~R~kEIgIRkalGAs~~~I~~ll~~e~l~l~~ig~~ig~~la~~~~~~l~~~~------~ 759 (803) T TIGR03434 686 ALALLLAAIGLYGVLAYSVAQRTREIGIRMALGAQRGDVLRLVLRQGLRLAAAGLAIGLAAALALARLLASLL------F 759 (803) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC------C T ss_conf 9999999999999999999999999998274289888999999999999999999999999999999999871------6 Q ss_pred CCCCCCHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHCCC Q ss_conf 8513680022331057044069999999999999999999999998299999982187 Q 537021.9.peg.7 86 GVVIFDTEAYLLTELPSKISWVEVSWIISMALALSLLATIFPSWKASRIDPVKVLRGE 143 (143) Q Consensus 86 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~pa~~~~k~~p~~~lr~e 143 (143) . . ...+|+.++...+..++++++++++|+||++|.||+|+||+| T Consensus 760 ~-------------~-~~~~~~~f~~~~~~~~~i~lla~~~pa~rAa~~nPv~aLr~E 803 (803) T TIGR03434 760 G-------------V-SPTDPLTFAAVAALLLAVALLACYLPARRAARVDPMIALRAE 803 (803) T ss_pred C-------------C-CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHCC T ss_conf 6-------------8-877779999999999999999999999999853999896278 No 8 >TIGR03434 ADOP Acidobacterial duplicated orphan permease. Members of this protein family are found, so far, only in three species of Acidobacteria, namely Acidobacteria bacterium Ellin345, Acidobacterium capsulatum ATCC 51196, and Solibacter usitatus Ellin6076, where they form large paralogous families. Each protein contains two copies of a domain called the efflux ABC transporter permease protein (pfam02687). However, unlike other members of that family (including LolC, FtsX, and MacB), genes for these proteins are essentially never found fused or adjacent to ABC transporter ATP-binding protein (pfam00005) genes. We name this family ADOP, for Acidobacterial Duplicated Orphan Permease, to reflect the restricted lineage, internal duplication, lack of associated ATP-binding cassette proteins, and permease homology. The function is unknown. Probab=99.90 E-value=7.8e-23 Score=134.49 Aligned_cols=126 Identities=20% Similarity=0.309 Sum_probs=101.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCC Q ss_conf 99999999999999999999999999999999980755222203688898765323345667887531000000001200 Q 537021.9.peg.7 3 VILALIVLVAALNIISSLVMLVQERRRDIAILRTMGARISSIMSIFFMIGAFIGIAGTGMGMIVGILISCNVEAIRKFFL 82 (143) Q Consensus 3 ii~~~i~~i~~~~i~~~~~~~i~~R~~eigilkalG~s~~~I~~~~~~E~~~l~~ig~i~G~~lg~~l~~~i~~~~~~~~ 82 (143) .+..+++++||+|..|.......+|.||+|++|++|++++|++++++.|+++++++|+++|+.++.++...+..... T Consensus 276 ~~~~liLliA~iN~~NL~~ar~~~R~rEigirkalGA~r~ql~~q~l~Es~ll~~~a~~lgl~l~~~~~~~~~~~~~--- 352 (803) T TIGR03434 276 GAVGLVLLIACANVANLLLARAAARQREIAVRLALGAGRGRLVRQLLTESLLLALAGGALGLLLAYWGLRLLLALLP--- 352 (803) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--- T ss_conf 99999999999999999999999999999999997969899999999999999999999999999999999999742--- Q ss_pred CCCCCCCCCHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHCC Q ss_conf 026851368002233105704406999999999999999999999999829999998218 Q 537021.9.peg.7 83 HTLGVVIFDTEAYLLTELPSKISWVEVSWIISMALALSLLATIFPSWKASRIDPVKVLRG 142 (143) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~pa~~~~k~~p~~~lr~ 142 (143) ...+. . ....++|..++...+++++++++++++||++++|.+|+++||+ T Consensus 353 ~~~~~---------~--~~~~~~~~~l~~~~~~~l~~~llag~~PA~~~s~~~p~~~Lk~ 401 (803) T TIGR03434 353 ASLPR---------L--AEISLDGRVLLFALALSLLTGLLFGLAPALQATRSDLAEALKE 401 (803) T ss_pred HCCCC---------H--HCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHH T ss_conf 20431---------1--0046678999999999999999999999999816788999875 No 9 >COG0577 SalY ABC-type antimicrobial peptide transport system, permease component [Defense mechanisms] Probab=99.86 E-value=5.2e-21 Score=125.21 Aligned_cols=129 Identities=28% Similarity=0.507 Sum_probs=109.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCC Q ss_conf 99999999999999999999999999999999980755222203688898765323345667887531000000001200 Q 537021.9.peg.7 3 VILALIVLVAALNIISSLVMLVQERRRDIAILRTMGARISSIMSIFFMIGAFIGIAGTGMGMIVGILISCNVEAIRKFFL 82 (143) Q Consensus 3 ii~~~i~~i~~~~i~~~~~~~i~~R~~eigilkalG~s~~~I~~~~~~E~~~l~~ig~i~G~~lg~~l~~~i~~~~~~~~ 82 (143) .+......++++++.+.+.+++.||+||+|++|++|++++++..+++.|+++++.+|.++|..+|......+..... T Consensus 291 ~~~~~~~~v~~i~i~~~~~~~v~er~~eigi~ka~G~~~~~i~~~~~~e~~~~~~~g~~~g~~~~~~~~~~i~~~~~--- 367 (419) T COG0577 291 AIAVISLIVGAIGIYNILLVSVLERTREIGILKALGATRREILLQFLLEALILGLIGGLLGILLGLGLSLLLALLLI--- 367 (419) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--- T ss_conf 89999999999999999999999987889999970998487999999999999999999999999999999999987--- Q ss_pred CCCCCCCCCHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHCCC Q ss_conf 0268513680022331057044069999999999999999999999998299999982187 Q 537021.9.peg.7 83 HTLGVVIFDTEAYLLTELPSKISWVEVSWIISMALALSLLATIFPSWKASRIDPVKVLRGE 143 (143) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~pa~~~~k~~p~~~lr~e 143 (143) .. ......++..+++.........+++++++++++|++|++|++|+|++|+| T Consensus 368 ---~~------~~~~~~~~~~~~~~~~~~~~~~~~~i~~i~~~~pa~~~~k~~~~~~l~~~ 419 (419) T COG0577 368 ---AS------LFFLLALPILLSPLLILLALIVALLVGVIAGLLPARKAAKLDPIEALRKE 419 (419) T ss_pred ---HH------HHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf ---66------66566777752799999999999999999999999999876299996359 No 10 >pfam02687 FtsX Predicted permease. This is a family of predicted permeases and hypothetical transmembrane proteins. The lipoprotein-releasing system transmembrane protein lolC from Buchnera aphidicola has been shown to transport lipids targeted to the outer membrane across the inner membrane. Both lolC and an ATP-binding cassette transporter-like protein from Streptococcus cristatus have been shown to require ATP. Probab=99.81 E-value=4e-19 Score=115.62 Aligned_cols=119 Identities=25% Similarity=0.522 Sum_probs=100.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCC Q ss_conf 99999999999999999999999999999999807552222036888987653233456678875310000000012000 Q 537021.9.peg.7 4 ILALIVLVAALNIISSLVMLVQERRRDIAILRTMGARISSIMSIFFMIGAFIGIAGTGMGMIVGILISCNVEAIRKFFLH 83 (143) Q Consensus 4 i~~~i~~i~~~~i~~~~~~~i~~R~~eigilkalG~s~~~I~~~~~~E~~~l~~ig~i~G~~lg~~l~~~i~~~~~~~~~ 83 (143) +..++++++.+++.+++..++.+|+||+|++|++|++++++.++++.|++.++++|.++|..+|......+.... T Consensus 56 ~~~~~~~~~~~~i~~~~~~~i~~r~~ei~ilka~G~~~~~i~~~~~~e~~~~~~~~~~~g~~~~~~~~~~~~~~~----- 130 (174) T pfam02687 56 LAVLLLLLALLILLNLLSISIAERRREIGILKALGASKKQIFKLLLLEALLLGLIGSLLGLLLGFLLAYLLSSAI----- 130 (174) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----- T ss_conf 999999999999999999999999999999999299999999999999999999999999999999999999999----- Q ss_pred CCCCCCCCHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCH Q ss_conf 26851368002233105704406999999999999999999999999829999 Q 537021.9.peg.7 84 TLGVVIFDTEAYLLTELPSKISWVEVSWIISMALALSLLATIFPSWKASRIDP 136 (143) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~pa~~~~k~~p 136 (143) ..+.....+..++|..+....+..++++++++++|++|+.|.+| T Consensus 131 ---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 174 (174) T pfam02687 131 ---------GYFSGFSLPLVIDPIVILLVLLLILLIALLASLLPARRILKISP 174 (174) T ss_pred ---------HHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf ---------98614343400189999999999999999999999999986998 No 11 >PRK11026 ftsX cell division protein FtsX; Provisional Probab=99.51 E-value=2.6e-13 Score=86.00 Aligned_cols=118 Identities=19% Similarity=0.311 Sum_probs=83.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCC Q ss_conf 99999999999999999999999999980755222203688898765323345667887531000000001200026851 Q 537021.9.peg.7 9 VLVAALNIISSLVMLVQERRRDIAILRTMGARISSIMSIFFMIGAFIGIAGTGMGMIVGILISCNVEAIRKFFLHTLGVV 88 (143) Q Consensus 9 ~~i~~~~i~~~~~~~i~~R~~eigilkalG~s~~~I~~~~~~E~~~l~~ig~i~G~~lg~~l~~~i~~~~~~~~~~~~~~ 88 (143) .+.+.+.+.||+-+++..||+||.+||-+|+|++-|++.|++|+++.|++|+++++++.......+..........++ T Consensus 190 ~~a~~liI~NTIRL~i~sRr~EI~ImkLVGAT~~FIr~PFl~eG~~~Gl~G~~la~~l~~~~~~~~~~~~~~l~~ly~-- 267 (309) T PRK11026 190 VAAVFLVIGNSVRLSIFSRRDTINVMKLIGATDGFILRPFLYGGALLGFSGALLSLILSEILVLYLSSAVKYVADVFG-- 267 (309) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC-- T ss_conf 999999999999999998799999999938887889760999999999999999999999999999989999999836-- Q ss_pred CCCHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC-CCH Q ss_conf 368002233105704406999999999999999999999999829-999 Q 537021.9.peg.7 89 IFDTEAYLLTELPSKISWVEVSWIISMALALSLLATIFPSWKASR-IDP 136 (143) Q Consensus 89 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~pa~~~~k-~~p 136 (143) .+.-...+.+.......+.+..++.+++++..+|-.| ++| T Consensus 268 --------~~~~l~~~~~~~~~~ll~~g~~lG~lgS~~sv~r~Lr~i~p 308 (309) T PRK11026 268 --------TKFDLNGLSFDECLLLLLVCSMIGWIAAWLATVQHLRHFTP 308 (309) T ss_pred --------CCCHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf --------43202344199999999999999999999999999875599 No 12 >COG2177 FtsX Cell division protein [Cell division and chromosome partitioning] Probab=99.19 E-value=3.5e-10 Score=70.10 Aligned_cols=66 Identities=29% Similarity=0.468 Sum_probs=56.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999999999999999998075522220368889876532334566788753 Q 537021.9.peg.7 4 ILALIVLVAALNIISSLVMLVQERRRDIAILRTMGARISSIMSIFFMIGAFIGIAGTGMGMIVGIL 69 (143) Q Consensus 4 i~~~i~~i~~~~i~~~~~~~i~~R~~eigilkalG~s~~~I~~~~~~E~~~l~~ig~i~G~~lg~~ 69 (143) +..++.+.+.+.+.|++-..+..||+||.+||-+|++++-|.+.|++|++.+|+.|++++..+.-+ T Consensus 173 ~~~ll~~~~vllI~NtiR~~i~sRr~eIeVmklvGAt~~fI~~PFl~~g~~~gl~Ga~~~~~l~~~ 238 (297) T COG2177 173 LSILLALAAVLLIGNTIRLAIFSRRREIEVMKLVGATDSFIRRPFLYEGMLIGLLGALIALALAAL 238 (297) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999999999999999953237999998264118887199999999999999999999999 No 13 >TIGR01185 devC DevC protein; InterPro: IPR005891 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This family contains a predicted membrane subunit, DevC, of an ABC transporter known so far from two species of cyanobacteria. Some experimental data from mutational analysis { suggest that this protein, along with DevA and DevB encoded in the same operon, may be involved in the transport/export of glycolipids.; GO: 0005215 transporter activity, 0006810 transport, 0016020 membrane. Probab=99.19 E-value=5e-11 Score=74.40 Aligned_cols=110 Identities=15% Similarity=0.263 Sum_probs=94.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCC Q ss_conf 99999999999999999999999999807552222036888987653233456678875310000000012000268513 Q 537021.9.peg.7 10 LVAALNIISSLVMLVQERRRDIAILRTMGARISSIMSIFFMIGAFIGIAGTGMGMIVGILISCNVEAIRKFFLHTLGVVI 89 (143) Q Consensus 10 ~i~~~~i~~~~~~~i~~R~~eigilkalG~s~~~I~~~~~~E~~~l~~ig~i~G~~lg~~l~~~i~~~~~~~~~~~~~~~ 89 (143) +|+...++-.++..+.+.-.|.++|||+|+++|-+.+..+.|+++++++|.+-|+.+|..+= . T Consensus 271 IVGTVIVYQILYT~V~DHL~EYATLKAIGY~~~YLLGV~lQEALLLA~~GyLP~~~l~i~LY------~----------- 333 (381) T TIGR01185 271 IVGTVIVYQILYTEVADHLSEYATLKAIGYTDKYLLGVVLQEALLLAVLGYLPSFVLAILLY------T----------- 333 (381) T ss_pred HEEEHEEEHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------H----------- T ss_conf 10101032013421431001245653203442246788999999999987516899999998------7----------- Q ss_pred CCHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHH Q ss_conf 68002233105704406999999999999999999999999829999998 Q 537021.9.peg.7 90 FDTEAYLLTELPSKISWVEVSWIISMALALSLLATIFPSWKASRIDPVKV 139 (143) Q Consensus 90 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~pa~~~~k~~p~~~ 139 (143) +....+.+|.-.+..-.+.+...+++.|++++-+..+|.+.-+|+|+ T Consensus 334 ---~~~~AT~LP~~M~~~RA~TVLIlT~~MC~~SG~IAV~~LR~ADPADI 380 (381) T TIGR01185 334 ---TARRATLLPLIMSYDRALTVLILTIIMCFVSGAIAVRKLRAADPADI 380 (381) T ss_pred ---HHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC T ss_conf ---65776237147566789999999999997757888644206874457 No 14 >COG3127 Predicted ABC-type transport system involved in lysophospholipase L1 biosynthesis, permease component [Secondary metabolites biosynthesis, transport, and catabolism] Probab=99.15 E-value=5.4e-10 Score=69.17 Aligned_cols=119 Identities=22% Similarity=0.380 Sum_probs=90.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCC Q ss_conf 99999999999999999999999999999807552222036888987653233456678875310000000012000268 Q 537021.9.peg.7 7 LIVLVAALNIISSLVMLVQERRRDIAILRTMGARISSIMSIFFMIGAFIGIAGTGMGMIVGILISCNVEAIRKFFLHTLG 86 (143) Q Consensus 7 ~i~~i~~~~i~~~~~~~i~~R~~eigilkalG~s~~~I~~~~~~E~~~l~~ig~i~G~~lg~~l~~~i~~~~~~~~~~~~ 86 (143) ..+++++.++.+........|+.-++++||+|++++++++.|..|-++++.+|..+|..+|....+.+....... .+ T Consensus 263 ~all~agv~VA~A~~~Y~~~r~~~iA~lK~LGA~~~~~~~l~l~Qil~v~~lgiaiG~vlG~l~~~~l~~~L~~~---LP 339 (829) T COG3127 263 LALLLAGVAVANAVRHYLDSRYDAIAILKCLGASRGQLRLLYLLQILMVLLLGIAIGLVLGALAPLVLMALLASL---LP 339 (829) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC---CC T ss_conf 999999999999999997235248999997177566899999999999999999999999998899999984101---78 Q ss_pred CCCCCHHHHHHHCCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHC Q ss_conf 513680022331057-0440699999999999999999999999982999999821 Q 537021.9.peg.7 87 VVIFDTEAYLLTELP-SKISWVEVSWIISMALALSLLATIFPSWKASRIDPVKVLR 141 (143) Q Consensus 87 ~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~i~~~~~~~pa~~~~k~~p~~~lr 141 (143) .. .| ..+.|+. +.....++++...++.|-.++.+.+|...+| T Consensus 340 v~-----------~p~~~l~P~~--~alAa~fl~~l~fal~PL~rl~~vpp~av~R 382 (829) T COG3127 340 VP-----------LPAGGLWPWA--LALAALFLIALAFALLPLGRLRRVPPLAVLR 382 (829) T ss_pred CC-----------CCCCCCCHHH--HHHHHHHHHHHHHHHHHHHHHCCCCHHHHHH T ss_conf 88-----------7766633589--9999999999996100068762588188873 No 15 >TIGR00439 ftsX putative protein insertion permease FtsX; InterPro: IPR004513 FtsX is an integral membrane protein encoded in the same operon as signal recognition particle docking protein FtsY and FtsE. FtsE is a hydrophilic nucleotide-binding protein that associates with the inner membrane by means of association with FtsX; FtsE mutants are viable only in high salt, supporting possible roles in cell division, as previously indicated, or in transport.; GO: 0007049 cell cycle, 0016021 integral to membrane. Probab=98.94 E-value=5.9e-09 Score=63.87 Aligned_cols=119 Identities=16% Similarity=0.257 Sum_probs=91.2 Q ss_pred HHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCC Q ss_conf 99999999999-99999999999999999999807552222036888987653233456678875310000000012000 Q 537021.9.peg.7 5 LALIVLVAALN-IISSLVMLVQERRRDIAILRTMGARISSIMSIFFMIGAFIGIAGTGMGMIVGILISCNVEAIRKFFLH 83 (143) Q Consensus 5 ~~~i~~i~~~~-i~~~~~~~i~~R~~eigilkalG~s~~~I~~~~~~E~~~l~~ig~i~G~~lg~~l~~~i~~~~~~~~~ 83 (143) .++.+.+|.+. +-|++-..+..||.||-+.|-+||+++-|.|-|++++++.++.|+.+..+++-++...+......... T Consensus 189 ~~vlm~~AvfLVigNsiRl~i~srr~~I~v~KLlGAtd~FIlRPfLY~gm~~~~~galvslils~~l~s~~~saV~~v~~ 268 (314) T TIGR00439 189 LSVLMGIAVFLVIGNSIRLDILSRRESIEVTKLLGATDSFILRPFLYQGMLLSLLGALVSLILSEFLVSALRSAVDAVAK 268 (314) T ss_pred HHHHHHHHHHHHHCCHHHHHHHHCCCCCHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999988778612113332321102000012117982211121579999999999999999999999976288998642 Q ss_pred CCCCCCCCHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 26851368002233105704406999999999999999999999999829 Q 537021.9.peg.7 84 TLGVVIFDTEAYLLTELPSKISWVEVSWIISMALALSLLATIFPSWKASR 133 (143) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~pa~~~~k 133 (143) .++....-.. +...-+......+..++.+++|+.+-+..+ T Consensus 269 ~fg~~f~~~g----------l~vgE~~lll~~~~~~G~vgAWlAttqHl~ 308 (314) T TIGR00439 269 VFGLKFDLNG----------LSVGELVLLLVVCIILGYVGAWLATTQHLL 308 (314) T ss_pred HHHHEEECCC----------CHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 4300210131----------006789999999999999999999999997 No 16 >COG3127 Predicted ABC-type transport system involved in lysophospholipase L1 biosynthesis, permease component [Secondary metabolites biosynthesis, transport, and catabolism] Probab=98.73 E-value=2.4e-07 Score=55.68 Aligned_cols=113 Identities=20% Similarity=0.264 Sum_probs=78.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCC Q ss_conf 99999999999999999999999999999807552222036888987653233456678875310000000012000268 Q 537021.9.peg.7 7 LIVLVAALNIISSLVMLVQERRRDIAILRTMGARISSIMSIFFMIGAFIGIAGTGMGMIVGILISCNVEAIRKFFLHTLG 86 (143) Q Consensus 7 ~i~~i~~~~i~~~~~~~i~~R~~eigilkalG~s~~~I~~~~~~E~~~l~~ig~i~G~~lg~~l~~~i~~~~~~~~~~~~ 86 (143) +.++.+.+.+...+.....+|.+|..++|++|++++.+.+-+..|..+++..+.+.+.+-|.....++.... + T Consensus 714 ~~l~AgvLvL~~alaag~r~R~~eavvlrtLGAtr~~l~~t~~~Ey~llGlvagl~A~~~~~aa~w~v~~~v------f- 786 (829) T COG3127 714 LALAAGVLVLAAALAAGNRARVQEAVVLRTLGATRALLRTTLWAEYALLGLVAGLVAAIGAEAAAWVLVAKV------F- 786 (829) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------H- T ss_conf 999999999999998757888888899998301089999999999999999999999985458999999998------5- Q ss_pred CCCCCHHHHHHHCCCHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHCC Q ss_conf 51368002233105704406999--999999999999999999999829999998218 Q 537021.9.peg.7 87 VVIFDTEAYLLTELPSKISWVEV--SWIISMALALSLLATIFPSWKASRIDPVKVLRG 142 (143) Q Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~i~~~~~~~pa~~~~k~~p~~~lr~ 142 (143) ++|...+|... +.....+...+++++ |++.+.+|...+|. T Consensus 787 ------------~lp~~pd~al~~~v~~l~~~~g~~l~G~----w~vl~~k~a~vlR~ 828 (829) T COG3127 787 ------------DLPWSPDWALWTIVLALVGAVGLGLAGG----WLVLGQKAAPVLRQ 828 (829) T ss_pred ------------CCCCCCCHHHHHHHHHHHHHHHHHHHHH----HHHHCCCCCHHHCC T ss_conf ------------5889875689999999999999998778----99853665053205 No 17 >COG0795 Predicted permeases [General function prediction only] Probab=92.08 E-value=0.31 Score=24.57 Aligned_cols=49 Identities=12% Similarity=0.356 Sum_probs=38.9 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999998075522220368889876532334566788 Q 537021.9.peg.7 18 SSLVMLVQERRRDIAILRTMGARISSIMSIFFMIGAFIGIAGTGMGMIV 66 (143) Q Consensus 18 ~~~~~~i~~R~~eigilkalG~s~~~I~~~~~~E~~~l~~ig~i~G~~l 66 (143) ......-..|++|+=+++|.|.|++++.+..+.=++.++++....+--+ T Consensus 74 ~l~~l~~L~~~sEl~vl~A~GvS~~~li~p~l~~al~~~~l~~~~~~~l 122 (364) T COG0795 74 ALLTLGRLSRESELVVLRASGVSLKRLLRPVLILALLLSLLSFVLGLWL 122 (364) T ss_pred HHHHHHHHHHCCCHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHCCCH T ss_conf 9999999750371999988289889999999999999999999983843 No 18 >pfam03739 YjgP_YjgQ Predicted permease YjgP/YjgQ family. Members of this family are predicted integral membrane proteins of unknown function. They are about 350 amino acids long and contain about 6 transmembrane regions. They are predicted to be permeases although there is no verification of this. Probab=90.81 E-value=0.77 Score=22.58 Aligned_cols=49 Identities=10% Similarity=0.236 Sum_probs=39.1 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999998075522220368889876532334566788 Q 537021.9.peg.7 18 SSLVMLVQERRRDIAILRTMGARISSIMSIFFMIGAFIGIAGTGMGMIV 66 (143) Q Consensus 18 ~~~~~~i~~R~~eigilkalG~s~~~I~~~~~~E~~~l~~ig~i~G~~l 66 (143) ..+...-..|.+|+-.+||.|.|.+++.+.++.-+++++++....+-.+ T Consensus 68 ~l~~~~~L~~~~El~a~~a~GiS~~~i~~p~l~~~~~~~~~~~~~~~~v 116 (353) T pfam03739 68 VLLTLGRLARDSELTALRAAGISLKRLLRPVLILALLLSLLSFLLSEYL 116 (353) T ss_pred HHHHHHHHHHCCHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHCHH T ss_conf 9999999984264899998389906678999999999999999985403 No 19 >pfam03649 UPF0014 Uncharacterized protein family (UPF0014). Probab=84.94 E-value=2.3 Score=20.15 Aligned_cols=43 Identities=16% Similarity=0.059 Sum_probs=35.9 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999998075522220368889876532334566 Q 537021.9.peg.7 21 VMLVQERRRDIAILRTMGARISSIMSIFFMIGAFIGIAGTGMG 63 (143) Q Consensus 21 ~~~i~~R~~eigilkalG~s~~~I~~~~~~E~~~l~~ig~i~G 63 (143) ....++|+.|+-...|+|+|++|-.+.+..|++--+.+.++-. T Consensus 149 ~~~l~~~~~~iE~~LaLGAt~~~A~~~~~r~Air~aliP~in~ 191 (250) T pfam03649 149 IDMVKSERDIIWGYLALGATPIQALAPFIRNAIRAALIPTVNR 191 (250) T ss_pred HHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999987999999998799989999999999999985626999 No 20 >TIGR01654 bact_immun_7tm bacteriocin-associated integral membrane protein; InterPro: IPR006541 These sequences represent a family of integral membrane proteins, most of which are about 650 residues in size and predicted to span the membrane seven times. Nearly half of the members of this family are found in association with a member of the lactococcin 972 family of bacteriocins (IPR006540 from INTERPRO) . Others may be associated with uncharacterised proteins that may also act as bacteriocins. Although this protein is suggested to be an immunity protein, and the bacteriocin is suggested to be exported by a Sec-dependent process, the role of this protein is unclear. . Probab=84.28 E-value=2.5 Score=19.99 Aligned_cols=99 Identities=13% Similarity=0.157 Sum_probs=58.8 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHCCCHH Q ss_conf 99999999999980755222203688898765323345667887531000000001200026851368002233105704 Q 537021.9.peg.7 24 VQERRRDIAILRTMGARISSIMSIFFMIGAFIGIAGTGMGMIVGILISCNVEAIRKFFLHTLGVVIFDTEAYLLTELPSK 103 (143) Q Consensus 24 i~~R~~eigilkalG~s~~~I~~~~~~E~~~l~~ig~i~G~~lg~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 103 (143) .....|..||-|-.|.|.++|....+...+..-+++++.-.+.+..+........ .. ... T Consensus 221 ~~~n~K~~~v~~l~G~s~~~i~~~~~~k~~~~~l~~~l~~~l~~~~~l~~~gL~~---------------~~-----~~~ 280 (745) T TIGR01654 221 LIINMKKVAVKRLNGFSLLKILFRLFKKELTYLLISALLILLLSSLVLFIKGLLY---------------TI-----LLE 280 (745) T ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHH---------------HH-----HHH T ss_conf 9875689999875061089999985233277898999999999999998656778---------------98-----999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHCC Q ss_conf 406999999999999999999999999829999998218 Q 537021.9.peg.7 104 ISWVEVSWIISMALALSLLATIFPSWKASRIDPVKVLRG 142 (143) Q Consensus 104 ~~~~~~~~~~~~~~~i~~~~~~~pa~~~~k~~p~~~lr~ 142 (143) .-+..+......-+.+.++-+++-...+.|.+.++.+|+ T Consensus 281 ~~~~~~~l~~~~lll~~~~~~~l~~l~~~~~~l~~~ik~ 319 (745) T TIGR01654 281 LFIFALSLLVLLLLLLLLIISFLTLLLLSKLSLANVIKG 319 (745) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 999999999999999999999999999988778887416 No 21 >TIGR00837 araaP aromatic amino acid transport protein; InterPro: IPR013059 Amino acid permeases are integral membrane proteins involved in the transport of amino acids into the cell. A number of such proteins have been found to be evolutionary related , , . Aromatic amino acids are concentrated in the cytoplasm of Escherichia coli by 4 distinct transport systems: a general aromatic amino acid permease, and a specific permease for each of the 3 types (Phe, Tyr and Trp) . It has been shown that some permeases in E. coli and related bacteria are evolutionary related. These permeases are proteins of about 400 to 420 amino acids and are located in the cytoplasmic membrane and, like bacterial sugar/cation transporters, are thought to contain 12 transmembrane (TM) regions - hydropathy analysis, however, is inconclusive, suggesting the possibility of 10 to 12 membrane-spanning domains . The best conserved domain is a stretch of 20 residues which seems to be located in a cytoplasmic loop between the first and second transmembrane region. This family is specific for aromatic amino acid transporters and includes the tyrosine permease, TyrP (P18199 from SWISSPROT), and the tryptophan transporters TnaB (P23173 from SWISSPROT) and Mtr (P22306 from SWISSPROT) of E. coli. . Probab=80.60 E-value=3.5 Score=19.24 Aligned_cols=101 Identities=22% Similarity=0.289 Sum_probs=52.0 Q ss_pred HHHHHCCCCCCCCCHHHHHH-HHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHCCCHHH----HH Q ss_conf 99998075522220368889-87653233456678875310000000012000268513680022331057044----06 Q 537021.9.peg.7 32 AILRTMGARISSIMSIFFMI-GAFIGIAGTGMGMIVGILISCNVEAIRKFFLHTLGVVIFDTEAYLLTELPSKI----SW 106 (143) Q Consensus 32 gilkalG~s~~~I~~~~~~E-~~~l~~ig~i~G~~lg~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~ 106 (143) |..+|++...+.-..-...| =...+++.+.+|+.+|.. .++.....+.+...+.. +....+.+|+-+ -| T Consensus 262 ~Lv~Al~~~~~s~~~~~~~~~Fs~~Al~sSFLGV~LGLF--DylaDlFkf~d~~~GR~----k~~~ltFlPPLv~alfyP 335 (391) T TIGR00837 262 GLVEALQGVLKSSAIELALELFSNLALASSFLGVTLGLF--DYLADLFKFDDSKKGRL----KTGLLTFLPPLVLALFYP 335 (391) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHCCCCCCE----EEEEHHHHHHHHHHHHHH T ss_conf 799999998711388889999999999999999999999--99999863312567737----663046788999999800 Q ss_pred HHHHHHHH-HH-HHHHHHHHHHH---HHHHHCCCHHH Q ss_conf 99999999-99-99999999999---99982999999 Q 537021.9.peg.7 107 VEVSWIIS-MA-LALSLLATIFP---SWKASRIDPVK 138 (143) Q Consensus 107 ~~~~~~~~-~~-~~i~~~~~~~p---a~~~~k~~p~~ 138 (143) ..++.... .+ ...++.+.++| +||++|.+|.. T Consensus 336 ~GFl~AigYAG~l~~tiwa~I~P~~La~~~Rk~~p~~ 372 (391) T TIGR00837 336 EGFLYAIGYAGALAATIWAVIIPALLAWKARKKFPNA 372 (391) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 7998898888899999999998899999987623799 No 22 >TIGR00801 ncs2 uracil-xanthine permease; InterPro: IPR006042 A number of transport proteins which are involved in the uptake of xanthine or uracil are evolutionary related . They are proteins of from 430 to 595 residues that seem to contain 12 transmembrane domains.; GO: 0005215 transporter activity, 0006810 transport, 0016020 membrane. Probab=75.37 E-value=5.1 Score=18.40 Aligned_cols=105 Identities=8% Similarity=0.102 Sum_probs=60.2 Q ss_pred HHHHHHHCCCCCCCCCHHHHHHH---HHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHCCCHHHHH Q ss_conf 99999980755222203688898---765323345667887531000000001200026851368002233105704406 Q 537021.9.peg.7 30 DIAILRTMGARISSIMSIFFMIG---AFIGIAGTGMGMIVGILISCNVEAIRKFFLHTLGVVIFDTEAYLLTELPSKISW 106 (143) Q Consensus 30 eigilkalG~s~~~I~~~~~~E~---~~l~~ig~i~G~~lg~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 106 (143) |+|=++.++.+=--.....+.+. -++..++.++|+..|++.++....+.....+......++.-.++-.+.+..++| T Consensus 200 ~~g~~~~~~~a~~vL~~iil~~~f~~G~~~~~~iliGi~~Gy~~a~~~G~vd~~~v~~A~wf~~P~~~~fg~~Y~~~f~~ 279 (480) T TIGR00801 200 TYGSLENLLVAFVVLALIILLNRFAKGFLKSISILIGIVVGYLLALAMGLVDFSPVKEAPWFSLPKPFTFGVDYAPSFEL 279 (480) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHCCCCCCCCCCCCCEEEEH T ss_conf 88767899999999999999998622078899999999999999998187779971025031267412689754113408 Q ss_pred HHHHHHHHHHHHHHH--HHHHHHHHHHHCC Q ss_conf 999999999999999--9999999998299 Q 537021.9.peg.7 107 VEVSWIISMALALSL--LATIFPSWKASRI 134 (143) Q Consensus 107 ~~~~~~~~~~~~i~~--~~~~~pa~~~~k~ 134 (143) ..+.....+.++... .+.+..-.++.|. T Consensus 280 ~~~l~~~~v~~v~~~E~~G~~~A~~~v~~r 309 (480) T TIGR00801 280 PAILTMLPVAIVTLVESIGDLTATADVTGR 309 (480) T ss_pred HHHHHHHHHHHHHHHHHHHCEEEEEEEECC T ss_conf 569999999999997401002664221134 No 23 >COG0390 ABC-type uncharacterized transport system, permease component [General function prediction only] Probab=67.12 E-value=7.8 Score=17.48 Aligned_cols=43 Identities=14% Similarity=0.094 Sum_probs=34.9 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999980755222203688898765323345 Q 537021.9.peg.7 19 SLVMLVQERRRDIAILRTMGARISSIMSIFFMIGAFIGIAGTG 61 (143) Q Consensus 19 ~~~~~i~~R~~eigilkalG~s~~~I~~~~~~E~~~l~~ig~i 61 (143) .+...+.++++|+-..-|+|+|++|-.+.+.-+++=-+.+.++ T Consensus 145 ~l~~~i~~~~~eie~~LsLGaTp~~A~~~~~r~Air~aliPti 187 (256) T COG0390 145 RLVSEIISEKDEIEAKLSLGATPKEASRPYIRSAIRAALIPTI 187 (256) T ss_pred HHHHHHHCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCC T ss_conf 8899974038999999852898899979999998888640550 No 24 >pfam07242 DUF1430 Protein of unknown function (DUF1430). This family represents the C-terminus (approximately 120 residues) of a number of hypothetical bacterial proteins of unknown function. These are possibly membrane proteins involved in immunity. Probab=57.90 E-value=12 Score=16.48 Aligned_cols=49 Identities=14% Similarity=0.259 Sum_probs=36.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999999807552222036888987653233456 Q 537021.9.peg.7 14 LNIISSLVMLVQERRRDIAILRTMGARISSIMSIFFMIGAFIGIAGTGM 62 (143) Q Consensus 14 ~~i~~~~~~~i~~R~~eigilkalG~s~~~I~~~~~~E~~~l~~ig~i~ 62 (143) +....+....-+|-||++.+.|--|++.-+..+.++.......++|... T Consensus 9 ~l~~~~~~~YFeq~kk~i~Ik~i~Gys~f~~~k~yl~~~l~~~~~~~~~ 57 (100) T pfam07242 9 LLFYQMVLLYFEQNKKKILIKRIHGYSFFKRHKTYLIIQLIVWVFGLIL 57 (100) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999989999999889999999917249999999999999999999999 No 25 >TIGR02138 phosphate_pstC phosphate ABC transporter, permease protein PstC; InterPro: IPR011864 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). The typical operon for the high affinity inorganic phosphate ABC transporter encodes an ATP-binding protein, a phosphate-binding protein, and two permease proteins. This family describes PstC, which is homologous to PstA (IPR005672 from INTERPRO). In the Escherichia coli, this transport system is induced when the concentration of extrallular inorganic phosphate is low. A constitutive, lower affinity transporter operates otherwise.; GO: 0005315 inorganic phosphate transmembrane transporter activity, 0006817 phosphate transport, 0016021 integral to membrane. Probab=45.90 E-value=20 Score=15.43 Aligned_cols=44 Identities=16% Similarity=0.208 Sum_probs=32.8 Q ss_pred HHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 99999999998075522220368889876532334566788753100 Q 537021.9.peg.7 26 ERRRDIAILRTMGARISSIMSIFFMIGAFIGIAGTGMGMIVGILISC 72 (143) Q Consensus 26 ~R~~eigilkalG~s~~~I~~~~~~E~~~l~~ig~i~G~~lg~~l~~ 72 (143) +..||=+ -|+|+||+|..+...+-...-|++|+.+ +++|-+++. T Consensus 196 ~~~kea~--~ALGAT~wetI~~v~lP~a~~GIv~a~~-LglGRAlGE 239 (317) T TIGR02138 196 RSYKEAS--LALGATKWETIRKVILPAARSGIVGAVV-LGLGRALGE 239 (317) T ss_pred HHHHHHH--HHHCCCHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHH T ss_conf 6788898--7706860433555332000218999999-999999999 No 26 >TIGR02139 permease_CysT sulfate ABC transporter, permease protein CysT; InterPro: IPR011865 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This entry represents CysT, one of two homologous, tandem permeases in the sulphate ABC transporter system; the other is CysW (IPR011866 from INTERPRO). The sulphate transporter has been described in Escherichia coli as transporting sulphate, thiosulphate, selenate, and selenite. Sulphate transporters may also transport molybdate ion if a specific molybdate transporter is not present.; GO: 0015116 sulfate transmembrane transporter activity, 0008272 sulfate transport, 0009276 1-2nm peptidoglycan-based cell wall, 0016021 integral to membrane. Probab=43.45 E-value=22 Score=15.23 Aligned_cols=81 Identities=12% Similarity=0.134 Sum_probs=43.6 Q ss_pred HHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCHHHHH---HHCCCHHHHHHHHHH Q ss_conf 98075522220368889876532334566788753100000000120002685136800223---310570440699999 Q 537021.9.peg.7 35 RTMGARISSIMSIFFMIGAFIGIAGTGMGMIVGILISCNVEAIRKFFLHTLGVVIFDTEAYL---LTELPSKISWVEVSW 111 (143) Q Consensus 35 kalG~s~~~I~~~~~~E~~~l~~ig~i~G~~lg~~l~~~i~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~ 111 (143) -|+|+|+.|+++..++=.+.=++ +.|..++.. ..+-++.....-.-| .++..|+.. ...++ ++|+...-. T Consensus 167 ~~LGAS~w~tF~~vilP~L~PAL---LTG~ALaFA--Ra~GEYGSViFIAGN-~P~~tEI~PLLI~~kLE-qYDY~gATa 239 (265) T TIGR02139 167 ASLGASRWQTFWKVILPALLPAL---LTGFALAFA--RAVGEYGSVIFIAGN-LPMKTEIAPLLIFIKLE-QYDYAGATA 239 (265) T ss_pred HHHCCCCHHEEEEEECCCCCCHH---HHHHHHHHH--HHHHHHHCEEEECCC-CCCHHHHHHHHHHHHHH-HHHHHHHHH T ss_conf 65168820002223314440069---997899887--543233300022268-88067889999977754-412378999 Q ss_pred HHHHHHHHHHH Q ss_conf 99999999999 Q 537021.9.peg.7 112 IISMALALSLL 122 (143) Q Consensus 112 ~~~~~~~i~~~ 122 (143) ++.+.+.+++. T Consensus 240 IA~vmL~~SF~ 250 (265) T TIGR02139 240 IAVVMLLISFV 250 (265) T ss_pred HHHHHHHHHHH T ss_conf 99999999999 No 27 >PRK06298 type III secretion system protein; Validated Probab=39.83 E-value=25 Score=14.92 Aligned_cols=10 Identities=10% Similarity=0.444 Sum_probs=4.3 Q ss_pred HHHCCCHHHH Q ss_conf 9829999998 Q 537021.9.peg.7 130 KASRIDPVKV 139 (143) Q Consensus 130 ~~~k~~p~~~ 139 (143) .....+|.++ T Consensus 167 ~~~~~~~~~~ 176 (360) T PRK06298 167 ETAGVPPIVT 176 (360) T ss_pred HHHCCCHHHH T ss_conf 8825899999 No 28 >COG4149 ModC ABC-type molybdate transport system, permease component [Inorganic ion transport and metabolism] Probab=34.79 E-value=30 Score=14.49 Aligned_cols=99 Identities=11% Similarity=0.063 Sum_probs=48.0 Q ss_pred HHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCC--CCCCCCCCCCCCCCCCCCHHHHHHH-CCCHHHHHHH Q ss_conf 999980755222203688898765323345667887531000--0000012000268513680022331-0570440699 Q 537021.9.peg.7 32 AILRTMGARISSIMSIFFMIGAFIGIAGTGMGMIVGILISCN--VEAIRKFFLHTLGVVIFDTEAYLLT-ELPSKISWVE 108 (143) Q Consensus 32 gilkalG~s~~~I~~~~~~E~~~l~~ig~i~G~~lg~~l~~~--i~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~ 108 (143) ..=+++|+++.+++.-..+ =++.-|.+.|..++..-+.. -......-+..-..+..+...|... .-+.+..+.. T Consensus 122 ~aA~tlGas~~~vf~~i~L---PLalpGIlag~iLsFAralGEFGAtlm~aGNIpG~T~Tlp~aIY~~~q~g~~~~A~~l 198 (225) T COG4149 122 EAARTLGASRWEVFFRITL---PLALPGILAGVILSFARALGEFGATLMFAGNIPGETQTLPLAIYTAVQAGNYALAHRL 198 (225) T ss_pred HHHHHCCCCHHHHHEEEEC---CCCHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCEEEHHHHHHHHHCCCHHHHHHH T ss_conf 9999818880213100140---2210679999999999986205707987547888531106999999981878899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 9999999999999999999999829 Q 537021.9.peg.7 109 VSWIISMALALSLLATIFPSWKASR 133 (143) Q Consensus 109 ~~~~~~~~~~i~~~~~~~pa~~~~k 133 (143) ......+++..-+...+..-++..| T Consensus 199 ~li~v~is~~~L~~~~~l~~~~~~~ 223 (225) T COG4149 199 VLILVVISLLVLFVSEWLARRSSKR 223 (225) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 9999999999999999998764201 No 29 >pfam12072 DUF3552 Domain of unknown function (DUF3552). This presumed domain is functionally uncharacterized. This domain is found in bacteria, archaea and eukaryotes. This domain is about 200 amino acids in length. This domain is found associated with pfam00013, pfam01966. This domain has a single completely conserved residue A that may be functionally important. Probab=29.16 E-value=34 Score=14.21 Aligned_cols=17 Identities=18% Similarity=0.520 Sum_probs=8.0 Q ss_pred HHHHHHHHHHHHHHHHH Q ss_conf 65323345667887531 Q 537021.9.peg.7 54 FIGIAGTGMGMIVGILI 70 (143) Q Consensus 54 ~l~~ig~i~G~~lg~~l 70 (143) +++++|.++|.++|+++ T Consensus 4 i~~i~~~~iG~~~G~~~ 20 (201) T pfam12072 4 ILAIIALVVGFAIGYFV 20 (201) T ss_pred HHHHHHHHHHHHHHHHH T ss_conf 99999999999999999 No 30 >TIGR01666 YCCS hypothetical membrane protein, TIGR01666; InterPro: IPR010019 This entry identifies a clade of sequences from gamma and beta proteobacteria. These proteins are more than 700 amino acids long and many have been annotated as putative membrane proteins. The gene from Salmonella has been annotated as a putative efflux transporter. The gene from Escherichia coli has been annotated as yccS.. Probab=29.15 E-value=29 Score=14.55 Aligned_cols=20 Identities=30% Similarity=0.541 Sum_probs=15.1 Q ss_pred HHHHHHHHHHHHHHCCCCCC Q ss_conf 23345667887531000000 Q 537021.9.peg.7 57 IAGTGMGMIVGILISCNVEA 76 (143) Q Consensus 57 ~ig~i~G~~lg~~l~~~i~~ 76 (143) ++|+++|+++|+++.+.... T Consensus 438 IIGTl~G~~iG~pll~f~Ps 457 (713) T TIGR01666 438 IIGTLVGVVIGVPLLYFVPS 457 (713) T ss_pred HHHHHHHHHHHHHHHHHCCC T ss_conf 99999999998778741454 No 31 >TIGR02230 ATPase_gene1 F0F1-ATPase subunit, putative; InterPro: IPR011744 This model represents a protein found encoded in F1F0-ATPase operons in several genomes, including Methanosarcina barkeri (archaeal) and Chlorobium tepidum (bacterial). It is a small protein (about 100 amino acids) with long hydrophic stretches and is presumed to be a subunit of the enzyme .. Probab=27.10 E-value=42 Score=13.78 Aligned_cols=16 Identities=31% Similarity=0.632 Sum_probs=10.7 Q ss_pred CCCHHHHHHHHHHHHH Q ss_conf 0570440699999999 Q 537021.9.peg.7 99 ELPSKISWVEVSWIIS 114 (143) Q Consensus 99 ~~~~~~~~~~~~~~~~ 114 (143) ..|..++|+...++.. T Consensus 69 hyPs~fSWTLtmli~G 84 (100) T TIGR02230 69 HYPSSFSWTLTMLIVG 84 (100) T ss_pred CCCCCCHHHHHHHHHH T ss_conf 6898703899999999 No 32 >pfam12273 RCR Chitin synthesis regulation, resistance to Congo red. RCR proteins are ER membrane proteins that regulate chitin deposition in fungal cell walls. Although chitin, a linear polymer of beta-1,4-linked N-acetylglucosamine, constitutes only 2% of the cell wall it plays a vital role in the overall protection of the cell wall against stress, noxious chemicals and osmotic pressure changes. Congo red is a cell wall-disrupting benzidine-type dye extensively used in many cell wall mutant studies that specifically targets chitin in yeast cells and inhibits growth. RCR proteins render the yeasts resistant to Congo red by diminishing the content of chitin in the cell wall. RCR proteins are probably regulating chitin synthase III interact directly with ubiquitin ligase Rsp5, and the VPEY motif is necessary for this, via interaction with the WW domains of Rsp5. Probab=20.02 E-value=58 Score=13.02 Aligned_cols=13 Identities=31% Similarity=0.442 Sum_probs=4.6 Q ss_pred HHHHHHHHHHHHH Q ss_conf 9999999999999 Q 537021.9.peg.7 20 LVMLVQERRRDIA 32 (143) Q Consensus 20 ~~~~i~~R~~eig 32 (143) ++.-++.|||.-| T Consensus 18 ~~~~~n~RRrr~G 30 (124) T pfam12273 18 LTARINRRRRRRG 30 (124) T ss_pred HHHHHHHHHHHCC T ss_conf 9998739998758 Done!