HHsearch alignment for GI: 254780724 and conserved domain: TIGR01271
>TIGR01271 CFTR_protein cystic fibrosis transmembrane conductor regulator (CFTR); InterPro: IPR005291 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). These proteins are integral membrane proteins and they are involved in the transport of chloride ions. Many of these proteins are the cystis fibrosis transmembrane conductor regulators (CFTR) in eukaryotes. The principal role of this protein is chloride ion conductance. The protein is predicted to consist of 12 transmembrane domains. Mutations or lesions in the genetic loci have been linked to the aetiology of asthma, bronchiectasis, chronic obstructive pulmonary disease etc. Disease-causing mutations have been studied by 36Cl efflux assays in vitro cell cultures and electrophysiology, all of which point to the impairment of chloride channel stability and not the biosynthetic processing per se.; GO: 0005254 chloride channel activity, 0006811 ion transport, 0016020 membrane.
Probab=95.72 E-value=0.0087 Score=38.11 Aligned_cols=171 Identities=21% Similarity=0.269 Sum_probs=90.9
Q ss_pred CCHHHHHHHHHHHCCCCCEEEECCCCCCHHHHHHHHHHHCCCCCCEE---EEECCHHHC--------CCCCCEEEEEEEC
Q ss_conf 98899999898622588489981888888899999983038767679---996421312--------5678756778741
Q gi|254780724|r 235 VTAEGARLLQIIGRIRCNVLISGGTGSGKTTLLNCLTRYIDKDERIV---TCEDTAELQ--------LQQPHVVRLETRP 303 (483)
Q Consensus 235 ~~~~~~~~l~~~v~~~~nilVsG~TGSGKTT~L~al~~~i~~~~riv---tIED~~El~--------l~~~~~v~~~~~~ 303 (483)
T Consensus 1271 ~G~avL~dlSFsv~~GQ~VGlLGRTGsGKSTLLSAlLRL~~T~GEI~IDGvSW~SvtLQ~WRKAFGViPQKvFi~sGTFR 1350 (1534)
T TIGR01271 1271 AGRAVLQDLSFSVEAGQRVGLLGRTGSGKSTLLSALLRLLSTEGEIQIDGVSWNSVTLQKWRKAFGVIPQKVFIFSGTFR 1350 (1534)
T ss_pred CCHHHHHHCCEEECCCCEEEEEECCCCCHHHHHHHHHHHCCCCCCEEECCEEECCCCHHHHHHHCCCCCCEEEEECCCCC
T ss_conf 20555641341443883577530268767899999999607798167623350521220034441315634788315511
Q ss_pred CCCCCCC--------CCHHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHHCCCCEEEHHHCCCCHHHHHHHHHHHHHH
Q ss_conf 6666530--------00189999875205998899657587999999999873970102000478888899999975530
Q gi|254780724|r 304 PNIEGEG--------EITMRDLVKNCLRMRPERIILGEVRGPEVLDLLQAMNTGHDGSMGTIHANNARESFGRMEAMIAM 375 (483)
Q Consensus 304 ~~~e~~~--------~~t~~~ll~~aLR~~PD~IiVGEiRg~Ea~~~l~A~~TGH~G~ltTlHa~s~~~ai~RL~~m~~~ 375 (483)
T Consensus 1351 ~NLDPy~~~SD~E~WkVaeEVGLkSvIEQFPdKLdF~L~DGGyv------LS~GHKQLMCL-----ARSiLSKAkILLLD 1419 (1534)
T TIGR01271 1351 KNLDPYEQWSDEEIWKVAEEVGLKSVIEQFPDKLDFVLVDGGYV------LSNGHKQLMCL-----ARSILSKAKILLLD 1419 (1534)
T ss_pred CCCCHHHHCCHHHHHHHHHHCCCEEEEECCCCCCCEEEECCCEE------EECCHHHHHHH-----HHHHHHHHHHHHCC
T ss_conf 36881342260356666543154311000888412488628678------31641689999-----98888653322214
Q ss_pred CC-CC---CCHHHHHHHHHHHCCE--EEEEEECCCCCEEEEEEEEEEEE---ECCE
Q ss_conf 46-79---9989999999742358--89998769998789999999841---4988
Q gi|254780724|r 376 GG-FT---LPSQMVREIITSSLDV--IVQTQRLRDGSRRITNICEIVGM---EGNV 422 (483)
Q Consensus 376 ~~-~~---~~~~~~~~~ia~avd~--iV~~~r~~dG~Rrv~~I~Ev~g~---e~~~ 422 (483)
T Consensus 1420 EPsA~LDPvT~Qi~RkTLK~~Fs~CTVILs------EHRvEalLECQ~FL~IE~~~ 1469 (1534)
T TIGR01271 1420 EPSAHLDPVTLQIIRKTLKQSFSNCTVILS------EHRVEALLECQQFLVIEGSS 1469 (1534)
T ss_pred CCCHHCCHHHHHHHHHHHHHHHCCCEEEEE------CCCHHHHHHHCCEEEEECCC
T ss_conf 871010316899999998532215748751------12222466403101442564