RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= 003801
         (794 letters)



>gnl|CDD|239657 cd03685, ClC_6_like, ClC-6-like chloride channel proteins. This CD
           includes ClC-6, ClC-7 and ClC-B, C, D in plants.
           Proteins in this family are ubiquitous in eukarotes and
           their functions are unclear. They are expressed in
           intracellular organelles membranes.  This family belongs
           to the ClC superfamily of chloride ion channels, which
           share the unique double-barreled architecture and
           voltage-dependent gating mechanism. The gating is
           conferred by the permeating anion itself, acting as the
           gating charge. ClC chloride ion channel superfamily
           perform a variety of functions including cellular
           excitability regulation, cell volume regulation,
           membrane potential stabilization, acidification of
           intracellular organelles, signal transduction, and
           transepithelial transport in animals.
          Length = 466

 Score =  470 bits (1212), Expect = e-159
 Identities = 175/335 (52%), Positives = 234/335 (69%)

Query: 65  ESLDYEIAENDFFKEDWRTRGRNQMIQYIFMKWSLCFLIGLIVGLIGFLNNLAVENIAGT 124
           ESLDYE+ END F+E+WR R + Q++QY F+KW +C LIG+  GL+ +  +LAVEN+AG 
Sbjct: 1   ESLDYEVIENDLFREEWRKRKKKQVLQYEFLKWIICLLIGIFTGLVAYFIDLAVENLAGL 60

Query: 125 KFVVTSNMMLANKFGMAFLSFSIPNLVLTLFASIVTATVAPVAAGSGIPEVKAYLNGVDA 184
           KF+V  N +   +   AFL +   NLVL L A+++ A +AP AAGSGIPEVK YLNGV  
Sbjct: 61  KFLVVKNYIEKGRLFTAFLVYLGLNLVLVLVAALLVAYIAPTAAGSGIPEVKGYLNGVKI 120

Query: 185 PGILSFRTLVIKIIGSISAVSSSLLVGKAGPMVHTGACVASLLGQGGSKKYGLTWKWLRF 244
           P IL  +TL++KI+G I +VS  L +GK GPM+H GAC+A+ L QGGS    L ++W R+
Sbjct: 121 PHILRLKTLLVKIVGVILSVSGGLALGKEGPMIHIGACIAAGLSQGGSTSLRLDFRWFRY 180

Query: 245 FKNDRDRRDFVTCGSAAGIAAAFRAPVGGLLFAIEEMASWWRSALLWRAFFTTAIVAILL 304
           F+NDRD+RDFVTCG+AAG+AAAF APVGG+LF++EE+AS+W  AL WR FF++ IV   L
Sbjct: 181 FRNDRDKRDFVTCGAAAGVAAAFGAPVGGVLFSLEEVASFWNQALTWRTFFSSMIVTFTL 240

Query: 305 RAFIDICKSGKCGLFGTGGLIMYDVYSADASYHLADVPLVLLLGVVGGILGSLYNFLLDK 364
             F+  C SGKCGLFG GGLIM+D  S    Y   ++   +L+GV+GG+LG+L+N L  K
Sbjct: 241 NFFLSGCNSGKCGLFGPGGLIMFDGSSTKYLYTYFELIPFMLIGVIGGLLGALFNHLNHK 300

Query: 365 VLRIYNFINEKGIAPKIFLACTISILTSCLLFGLP 399
           V R    IN KG   K+  A  +S++TS + F   
Sbjct: 301 VTRFRKRINHKGKLLKVLEALLVSLVTSVVAFPQT 335



 Score =  175 bits (447), Expect = 9e-48
 Identities = 73/126 (57%), Positives = 98/126 (77%), Gaps = 3/126 (2%)

Query: 464 SSMLIFFVTCFSLSVLSYGIVAPAGLFVPAIVTGASYGRFVGMLVGS---HSNLNHGLYA 520
            ++LIFFV  + L+  ++GI  P+GLF+P I+ GA+YGR VG+L+GS    ++++ GLYA
Sbjct: 334 QTLLIFFVLYYFLACWTFGIAVPSGLFIPMILIGAAYGRLVGILLGSYFGFTSIDPGLYA 393

Query: 521 VLGAASFLGGSMRMTVSLCVIILELTNNLLLLPMIMLVLLISKTVADAFNGNIYDLIMKA 580
           +LGAA+FLGG MRMTVSL VI+LELTNNL  LP IMLVL+I+K V D FN  IYD+I++ 
Sbjct: 394 LLGAAAFLGGVMRMTVSLTVILLELTNNLTYLPPIMLVLMIAKWVGDYFNEGIYDIIIQL 453

Query: 581 KGFPYL 586
           KG P+L
Sbjct: 454 KGVPFL 459


>gnl|CDD|239656 cd03684, ClC_3_like, ClC-3-like chloride channel proteins.  This CD
            includes ClC-3, ClC-4, ClC-5 and ClC-Y1. ClC-3 was
           initially cloned from rat kidney. Expression of ClC-3
           produces outwardly-rectifying Cl currents that are
           inhibited by protein kinase C activation. It has been
           suggested that ClC-3 may be a ubiquitous
           swelling-activated Cl channel that has very similar
           characteristics to those of native volume-regulated Cl
           currents. The function of ClC-4 is unclear. Studies of
           human ClC-4 have revealed that it gives rise to Cl
           currents that rapidly activate at positive voltages, and
           are sensitive to extracellular pH, with currents
           decreasing when pH falls below 6.5. ClC-4 is broadly
           distributed, especially in brain and heart.   ClC-5 is
           predominantly expressed in the kidney, but can be found
           in the brain and liver. Mutations in the ClC-5 gene
           cause certain hereditary diseases, including Dent's
           disease, an X-chromosome linked syndrome characterised
           by proteinuria, hypercalciuria, and kidney stones
           (nephrolithiasis), leading to progressive renal failure.
             These proteins belong to the ClC superfamily of
           chloride ion channels, which share the unique
           double-barreled architecture and voltage-dependent
           gating mechanism. The gating is conferred by the
           permeating anion itself, acting as the gating charge.
           This domain is found in the eukaryotic halogen ion (Cl-
           and I-) channel proteins, that perform a variety of
           functions including cell volume regulation, the membrane
           potential stabilization, transepithelial chloride
           transport and charge compensation necessary for the
           acidification of intracellular organelles.
          Length = 445

 Score =  261 bits (668), Expect = 5e-79
 Identities = 150/512 (29%), Positives = 245/512 (47%), Gaps = 96/512 (18%)

Query: 104 GLIVGLIGFLNNLAVENIAGTKFVVTSNMMLANKFGMAFLSFSIPNLVLTLFASIVTATV 163
           G+ +GLI  L ++    ++  K    + ++        ++  +   L+    A ++   V
Sbjct: 1   GIAIGLIAGLIDIIASWLSDLKEGYCNYII--------YVLLA---LLFAFIAVLLVKVV 49

Query: 164 APVAAGSGIPEVKAYLNGVDAPGILSFRTLVIKIIGSISAVSSSLLVGKAGPMVHTGACV 223
           AP AAGSGIPE+K  L+G    G L   TL+IK +G + AV+S L +GK GP+VH   CV
Sbjct: 50  APYAAGSGIPEIKTILSGFIIRGFLGKWTLLIKSVGLVLAVASGLSLGKEGPLVHIATCV 109

Query: 224 ASLLGQGGSKKYGLTWKWL-RFFKNDRDRRDFVTCGSAAGIAAAFRAPVGGLLFAIEEMA 282
            +++            +   ++ +N+  RR+ ++  +AAG+A AF AP+GG+LF++EE++
Sbjct: 110 GNIIS-----------RLFPKYRRNEAKRREILSAAAAAGVAVAFGAPIGGVLFSLEEVS 158

Query: 283 SWWRSALLWRAFFTTAIVAILLRAFIDICKSGKCGLFGTGGLIMYDVYSADASYHLADVP 342
            ++    LWR+FF  A+VA      ++         FGTG L++++V   D  +H  ++ 
Sbjct: 159 YYFPLKTLWRSFF-CALVAAFTLKSLNP--------FGTGRLVLFEV-EYDRDWHYFELI 208

Query: 343 LVLLLGVVGGILGSLYNFLLDKVLRIYNFINEKGIAPKIFLACTISILTSCLLFGLPWLA 402
             +LLG+ GG+ G+ +     K  R       K     +     ++++T+ + F  P+  
Sbjct: 209 PFILLGIFGGLYGAFFIKANIKWARFRKKSLLKRYP--VLEVLLVALITALISFPNPYTR 266

Query: 403 SC---------RPC-PSDASEACPTIGRSGNYKKFQCAPGYYNDLASLIFNTNDDAIRNL 452
                        C P D +  C        Y+      G Y  L SL+           
Sbjct: 267 LDMTELLELLFNECEPGDDNSLC-------CYRDPPAGDGVYKALWSLLLAL-------- 311

Query: 453 FSKDTDKEFQHSSMLIFFVTCFSLSVLSYGIVAPAGLFVPAIVTGASYGRFVGMLVGSHS 512
                        ++   +T F     ++GI  PAG+FVP++  GA +GR VG+LV   +
Sbjct: 312 -------------IIKLLLTIF-----TFGIKVPAGIFVPSMAVGALFGRIVGILVEQLA 353

Query: 513 NLNH-----------------GLYAVLGAASFLGGSMRMTVSLCVIILELTNNLLLLPMI 555
                                GLYA++GAA+FLGG  RMTVSL VI+ ELT  L  +  +
Sbjct: 354 YSYPDSIFFACCTAGPSCITPGLYAMVGAAAFLGGVTRMTVSLVVIMFELTGALNYILPL 413

Query: 556 MLVLLISKTVADAFN-GNIYDLIMKAKGFPYL 586
           M+ +++SK VADA     IYD  +   G+P+L
Sbjct: 414 MIAVMVSKWVADAIGKEGIYDAHIHLNGYPFL 445


>gnl|CDD|216046 pfam00654, Voltage_CLC, Voltage gated chloride channel.  This
           family of ion channels contains 10 or 12 transmembrane
           helices. Each protein forms a single pore. It has been
           shown that some members of this family form homodimers.
           In terms of primary structure, they are unrelated to
           known cation channels or other types of anion channels.
           Three ClC subfamilies are found in animals. ClC-1 is
           involved in setting and restoring the resting membrane
           potential of skeletal muscle, while other channels play
           important parts in solute concentration mechanisms in
           the kidney. These proteins contain two pfam00571
           domains.
          Length = 345

 Score =  235 bits (601), Expect = 1e-70
 Identities = 118/421 (28%), Positives = 187/421 (44%), Gaps = 79/421 (18%)

Query: 151 VLTLFASIVTATVAPVAAGSGIPEVKAYLNGVDAPGILSFRTLVIKIIGSISAVSSSLLV 210
           +  L A ++       AAGSGIPEV A L+GV   G L  R L++K +G++  + S   +
Sbjct: 1   IGGLLAGLLVKRFPE-AAGSGIPEVIAALHGVK--GPLPLRVLLVKFLGTLLTLGSGGSL 57

Query: 211 GKAGPMVHTGACVASLLGQGGSKKYGLTWKWLRFFKNDRDRRDFVTCGSAAGIAAAFRAP 270
           G+ GP V  GA + S LG+             R   +  DRR  +  G+AAGIAAAF AP
Sbjct: 58  GREGPSVQIGAAIGSGLGR-------------RLKLSRNDRRRLIAAGAAAGIAAAFNAP 104

Query: 271 VGGLLFAIEEMASWWRSALLWRAFFTTAIVAILLRAFIDICKSGKCGLFGTGGLIMYDVY 330
           + G+LFA+EE++  +    L      + + A++ R                 G       
Sbjct: 105 LAGVLFALEELSRSFSYRALLPVLVASVVAALVSRLL--------------FGNEPLFEV 150

Query: 331 SADASYHLADVPLVLLLGVVGGILGSLYNFLLDKVLRIYNFINEKGIAPKIFLACTISIL 390
            +     L ++PL +LLG++ G+LG+L+  LL KV R++  +      P I       +L
Sbjct: 151 PSLPPLSLLELPLFILLGILCGLLGALFVRLLLKVERLFRRLK---KLPPILRPALGGLL 207

Query: 391 TSCLLFGLPWLASCRPCPSDASEACPTIGRSGNYKKFQCAPGYYNDLASLIFNTNDDAIR 450
              L   LP +                    G         GY   L  L+ N +  ++ 
Sbjct: 208 VGLLGLFLPEV-------------------LGG--------GY--GLIQLLLNGSTLSL- 237

Query: 451 NLFSKDTDKEFQHSSMLIFFVTCFSLSVLSYGIVAPAGLFVPAIVTGASYGRFVGMLVGS 510
                          +L+  +     + LS G  AP G+F P++  GA+ GR +G+L+  
Sbjct: 238 -------------LLLLLLLLLKLLATALSLGSGAPGGIFAPSLFIGAALGRLLGLLLPL 284

Query: 511 ---HSNLNHGLYAVLGAASFLGGSMRMTVSLCVIILELTNNLLLLPMIMLVLLISKTVAD 567
                  + G +A+LG A+FL G  R  ++  V++ ELT +  LL  +ML +LI+  V+ 
Sbjct: 285 LFPGIAPSPGAFALLGMAAFLAGVTRAPLTAIVLVFELTGSYSLLLPLMLAVLIAYLVSR 344

Query: 568 A 568
            
Sbjct: 345 L 345



 Score = 39.8 bits (94), Expect = 0.004
 Identities = 38/179 (21%), Positives = 61/179 (34%), Gaps = 22/179 (12%)

Query: 101 FLIGLIVGLIGFLNNLAVENIAGTKFVVTSNMMLANKFGMAFLSFSIPNLVLTLFASIVT 160
            L+G++ GL+G L              V   + +   F        I    L      + 
Sbjct: 165 ILLGILCGLLGAL-------------FVRLLLKVERLFRRLKKLPPILRPALGGLLVGLL 211

Query: 161 ATVAPVAAGSGIPEVKAYLNGVDAPGILSFRTLVIKIIGSISAVSSSLLVGKAGPMVHTG 220
               P   G G   ++  LNG     +L    L++K++ +  ++ S    G   P +  G
Sbjct: 212 GLFLPEVLGGGYGLIQLLLNGSTLSLLLLLLLLLLKLLATALSLGSGAPGGIFAPSLFIG 271

Query: 221 ACVASLLGQGGSKKYGLTWKWLRFFKNDRDRRDFVTCGSAAGIAAAFRAPVGGLLFAIE 279
           A +  LLG             L F         F   G AA +A   RAP+  ++   E
Sbjct: 272 AALGRLLGLLLP---------LLFPGIAPSPGAFALLGMAAFLAGVTRAPLTAIVLVFE 321


>gnl|CDD|238507 cd01036, ClC_euk, Chloride channel, ClC.  These domains are found
           in the eukaryotic halogen ion (Cl-, Br- and I-) channel
           proteins that perform a variety of functions including
           cell volume regulation, membrane potential
           stabilization, charge compensation necessary for the
           acidification of intracellular organelles, signal
           transduction and transepithelial transport.  They are
           also involved in many pathophysiological processes and
           are responsible for a number of human diseases.  These
           proteins belong to the ClC superfamily of chloride ion
           channels, which share the unique double-barreled
           architecture and voltage-dependent gating mechanism.
           The gating is conferred by the permeating anion itself,
           acting as the gating charge.  Some proteins possess long
           C-terminal cytoplasmic regions containing two CBS
           (cystathionine beta synthase) domains of putative
           regulatory function.
          Length = 416

 Score =  236 bits (605), Expect = 3e-70
 Identities = 97/294 (32%), Positives = 160/294 (54%), Gaps = 3/294 (1%)

Query: 104 GLIVGLIGFLNNLAVENIAGTKFVVTSNMMLANKFGMAFLSFSIPNLVLTLFASIVTATV 163
           GL++GL+  + + AVE+       +     +   + + +L + + ++VL L +S +    
Sbjct: 1   GLLMGLVAVVLDYAVESSLDAGQWLLRR--IPGSYLLGYLMWVLWSVVLVLISSGICLYF 58

Query: 164 APVAAGSGIPEVKAYLNGVDAPGILSFRTLVIKIIGSISAVSSSLLVGKAGPMVHTGACV 223
           AP AAGSGIPEV AYLNGV  P  LS RTL+ K I  I AV+S L +GK GP+VH GA +
Sbjct: 59  APQAAGSGIPEVMAYLNGVHLPMYLSIRTLIAKTISCICAVASGLPLGKEGPLVHLGAMI 118

Query: 224 ASLLGQGGSKKYGLTWKWLRFFKNDRDRRDFVTCGSAAGIAAAFRAPVGGLLFAIEEMAS 283
            + L QG S+  G      + F+N RDRRDF+  G+AAG+A+AF AP+GGLLF +EE+++
Sbjct: 119 GAGLLQGRSRTLGCHVHLFQLFRNPRDRRDFLVAGAAAGVASAFGAPIGGLLFVLEEVST 178

Query: 284 WWRSALLWRAFFTTAIVAILLRAFIDICKSGKCGLFGTGGLIMYDVYSADASYHLADVPL 343
           ++   L WR FF   + A +++ +       +     +   +   V+      +L +   
Sbjct: 179 FFPVRLAWRVFFAALVSAFVIQIYNSFNSGFELLDRSSAMFLSLTVFELHVPLNLYEFIP 238

Query: 344 VLLLGVVGGILGSLYNFLLDKVLR-IYNFINEKGIAPKIFLACTISILTSCLLF 396
            +++GV+ G+L +L+  L    LR     +  K    ++      +++ S + +
Sbjct: 239 TVVIGVICGLLAALFVRLSIIFLRWRRRLLFRKTARYRVLEPVLFTLIYSTIHY 292



 Score =  115 bits (291), Expect = 1e-27
 Identities = 51/122 (41%), Positives = 81/122 (66%), Gaps = 11/122 (9%)

Query: 465 SMLIFFVTCFSLSVLSYGIVAPAGLFVPAIVTGASYGRFVGMLV-----------GSHSN 513
           ++L+F +  F +S L++GI  P G F+P++V GA+ GR VG+LV            +   
Sbjct: 295 TLLLFLLIYFWMSALAFGIAVPGGTFIPSLVIGAAIGRLVGLLVHRIAVAGIGAESATLW 354

Query: 514 LNHGLYAVLGAASFLGGSMRMTVSLCVIILELTNNLLLLPMIMLVLLISKTVADAFNGNI 573
            + G+YA++GAA+FLGG+ R+T S+CVI++ELT +L  L  +M+ +LI+K VADAF  ++
Sbjct: 355 ADPGVYALIGAAAFLGGTTRLTFSICVIMMELTGDLHHLLPLMVAILIAKAVADAFCESL 414

Query: 574 YD 575
           Y 
Sbjct: 415 YH 416


>gnl|CDD|238504 cd01031, EriC, ClC chloride channel EriC.  This domain is found in
           the EriC chloride transporters that mediate the extreme
           acid resistance response in eubacteria and archaea. This
           response allows bacteria to survive in the acidic
           environments by decarboxylation-linked proton
           utilization. As shown for Escherichia coli EriC, these
           channels can counterbalance the electric current
           produced by the outwardly directed virtual proton pump
           linked to amino acid decarboxylation.  The EriC proteins
           belong to the ClC superfamily of chloride ion channels,
           which share a unique double-barreled architecture and
           voltage-dependent gating mechanism.  The
           voltage-dependent gating is conferred by the permeating
           anion itself, acting as the gating charge. In
           Escherichia coli EriC, a glutamate residue that
           protrudes into the pore is thought to participate in
           gating by binding to a Cl- ion site within the
           selectivity filter.
          Length = 402

 Score =  177 bits (452), Expect = 5e-49
 Identities = 129/480 (26%), Positives = 202/480 (42%), Gaps = 85/480 (17%)

Query: 103 IGLIVGLIGFLNNLAVENIAGTKFVVTSNMMLANKFGMAFLSFSIPNLVLTLFASIVTAT 162
           IGL+ GL+  L  L ++ +      ++     AN   +  +   I   VL L A  +   
Sbjct: 1   IGLLAGLVAVLFRLGIDKL--GNLRLSLYDFAANNPPLLLVLPLIS-AVLGLLAGWLVKK 57

Query: 163 VAPVAAGSGIPEVKAYLNGVDAPGILSFRTLVIKIIGSISAVSSSLLVGKAGPMVHTGAC 222
            AP A GSGIP+V+  L G+       +R L +K +G + A+ S L +G+ GP V  GA 
Sbjct: 58  FAPEAKGSGIPQVEGVLAGL--LPPNWWRVLPVKFVGGVLALGSGLSLGREGPSVQIGAA 115

Query: 223 VASLLGQGGSKKYGLTWKWLRFFKNDRDRRDFVTCGSAAGIAAAFRAPVGGLLFAIEEMA 282
           +    GQG SK          F  +  +RR  +  G+AAG+AAAF AP+ G+LF +EE+ 
Sbjct: 116 I----GQGVSKW---------FKTSPEERRQLIAAGAAAGLAAAFNAPLAGVLFVLEELR 162

Query: 283 SWWRSALLWRAFFTTAIVAILLRAFIDICKSGKCGLFGTGGLIMYDVYSADASYHLADVP 342
             +    L  A   +     + R F           FG  G ++           L    
Sbjct: 163 HSFSPLALLTALVASIAADFVSRLF-----------FGL-GPVLSIPPLPALP--LKSYW 208

Query: 343 LVLLLGVVGGILGSLYNFLLDKVLRIYNFINEKGIAPKIFLACTISILTSCLLFGLPWLA 402
           L+LLLG++ G+LG L+N  L K   +Y  + +     ++ L   +      LL       
Sbjct: 209 LLLLLGIIAGLLGYLFNRSLLKSQDLYRKLKKLPRELRVLLPGLLIGPLGLLL------- 261

Query: 403 SCRPCPSDASEACPTIGRSGNYKKFQCAPGYYNDLASLIFNTNDDAIRNLFSKDTDKEFQ 462
                        P     G+                LI +                 F 
Sbjct: 262 -------------PEALGGGH---------------GLILSL------------AGGNFS 281

Query: 463 HSSMLIFFVTCFSLSVLSYGIVAPAGLFVPAIVTGASYGRFVGMLVGSHSNLNHGL---Y 519
            S +L+ FV  F  ++LSYG  AP G+F P +  GA  G   G ++     +       +
Sbjct: 282 ISLLLLIFVLRFIFTMLSYGSGAPGGIFAPMLALGALLGLLFGTILVQLGPIPISAPATF 341

Query: 520 AVLGAASFLGGSMRMTVSLCVIILELTNNL-LLLPMIMLVLLISKTVADAFNGN-IYDLI 577
           A+ G A+F    +R  ++  +++ E+T N  LLLP+ M+V L++  VAD   G  IY+ +
Sbjct: 342 AIAGMAAFFAAVVRAPITAIILVTEMTGNFNLLLPL-MVVCLVAYLVADLLGGKPIYEAL 400


>gnl|CDD|223116 COG0038, EriC, Chloride channel protein EriC [Inorganic ion
           transport and metabolism].
          Length = 443

 Score =  171 bits (436), Expect = 2e-46
 Identities = 117/478 (24%), Positives = 185/478 (38%), Gaps = 83/478 (17%)

Query: 102 LIGLIVGLIGFLNNLAVENI-AGTKFVVTSNMMLANKFGMAFLSFSIPNLVLTLFASIVT 160
            I   +G+I  L  +A++        +    +  A   G   L        L + A +V 
Sbjct: 24  GIAAALGVIVGLFAVALDLALLLLGRLRGGLLSAAQAPGPWLLPLVPALGGLLVGALLV- 82

Query: 161 ATVAPVAAGSGIPEVKAYLNGVDAPGILSFRTLVIKIIGSISAVSSSLLVGKAGPMVHTG 220
              AP A GSGIP+    L+G      +S R L +K++ ++  + S   +G+ GP V  G
Sbjct: 83  YKFAPEARGSGIPQAIEALHGRKGR--ISPRVLPVKLVATLLTIGSGASLGREGPSVQIG 140

Query: 221 ACVASLLGQGGSKKYGLTWKWLRFFKNDRDRRDFVTCGSAAGIAAAFRAPVGGLLFAIEE 280
           A + SLLG+                 +  DRR  +  G+AAG+AAAF AP+ G LFAIE 
Sbjct: 141 AAIGSLLGR-------------LLKLSREDRRILLAAGAAAGLAAAFNAPLAGALFAIEV 187

Query: 281 MASW-WRSALLWRAFFTTAIVAILLRAFIDICKSGKCGLFGTGGLIMYDVYSADASYHLA 339
           +         L       A+VA+L+                 GG              L 
Sbjct: 188 LYGRFLEYRALVPVLV-AAVVALLVAGLF-------------GGPHFLLPIVTTPHMSLW 233

Query: 340 DVPLVLLLGVVGGILGSLYNFLLDKVLRIYNFINEKGIAPKIFLACTISILTSCLLFGLP 399
           D+ L L+LG++ G+ G L + LL  + R +          +  L   +      L    P
Sbjct: 234 DLLLYLVLGIIAGLFGVLLSRLL-ALSRRFFRRLPLPPLLRPALGGLLVGA---LGLLFP 289

Query: 400 WLASCRPCPSDASEACPTIGRSGNYKKFQCAPGYYNDLASLIFNTNDDAIRNLFSKDTDK 459
                             +G      +   A                             
Sbjct: 290 E----------------VLGNGYGLIQLALAG---------------------------- 305

Query: 460 EFQHSSMLIFFVTCFSLSVLSYGIVAPAGLFVPAIVTGASYGRFVGMLVGSHSNLNH--- 516
           E     +L+ F+     ++LSYG  AP G+F P++  GA+ G   G L+G     +    
Sbjct: 306 EGGLLVLLLLFLLKLLATLLSYGSGAPGGIFAPSLFIGAALGLAFGALLGLLFPPSILEP 365

Query: 517 GLYAVLGAASFLGGSMRMTVSLCVIILELTNNLLLLPMIMLVLLISKTVADAFNGNIY 574
           GL+A+LG A+FL  + R  ++  V++LE+T N  LL  +++  LI+  V+    G   
Sbjct: 366 GLFALLGMAAFLAATTRAPLTAIVLVLEMTGNYQLLLPLLIACLIAYLVSRLLGGRPI 423


>gnl|CDD|239655 cd03683, ClC_1_like, ClC-1-like chloride channel proteins. This CD
           includes isoforms ClC-0, ClC-1, ClC-2 and ClC_K. ClC-1
           is expressed in skeletal muscle and its mutation leads
           to both recessively and dominantly-inherited forms of
           muscle stiffness or myotonia. ClC-K is exclusively
           expressed in kidney. Similarly, mutation of ClC-K leads
           to nephrogenic diabetes insipidus in mice and Bartter's
           syndrome in human. These proteins belong to the ClC
           superfamily of chloride ion channels, which share the
           unique double-barreled architecture and
           voltage-dependent gating mechanism.  The gating is
           conferred by the permeating anion itself, acting as the
           gating charge. This domain is found in the eukaryotic
           halogen ion (Cl-, Br- and I-) channel proteins, that
           perform a variety of functions including cell volume
           regulation, regulation of intracelluar chloride
           concentration, membrane potential stabilization, charge
           compensation necessary for the acidification of
           intracellular organelles and transepithelial chloride
           transport.
          Length = 426

 Score =  164 bits (418), Expect = 4e-44
 Identities = 88/306 (28%), Positives = 152/306 (49%), Gaps = 23/306 (7%)

Query: 97  WSLCFLIGLIVGLIGFLNNLAVENIAGTKFVVTSNMMLANKFGMAFLSFSIPNLVLTLFA 156
           W    L+G+++ LI    + AVE +           +L     + +L +    + L LF+
Sbjct: 2   WLFLALLGILMALISIAMDFAVEKL--LNARRWLYSLLTGNSLLQYLVWVAYPVALVLFS 59

Query: 157 SIVTATVAPVAAGSGIPEVKAYLNGVDAPGILSFRTLVIKIIGSISAVSSSLLVGKAGPM 216
           ++    ++P A GSGIPE+K  L GV  P  L+F+TLV K+IG   A+ S L +GK GP 
Sbjct: 60  ALFCKYISPQAVGSGIPEMKTILRGVVLPEYLTFKTLVAKVIGLTCALGSGLPLGKEGPF 119

Query: 217 VHTGACVASLLGQGGSKKYGLTWKWLRFFKNDRDRRDFVTCGSAAGIAAAFRAPVGGLLF 276
           VH  + VA+LL +  +   G        ++N+  R + +    A G+A  F AP+GG+LF
Sbjct: 120 VHISSIVAALLSKLTTFFSG-------IYENESRRMEMLAAACAVGVACTFGAPIGGVLF 172

Query: 277 AIEEMASWWRSALLWRAFFTTAIVAILLRAFIDICKSGKCGLFGTGGLIMYDVYSADASY 336
           +IE  ++++     WR FF     A   R         +     T   +    +  D  +
Sbjct: 173 SIEVTSTYFAVRNYWRGFFAATCGAFTFRLLAVFFSDQE-----TITALFKTTFFVDFPF 227

Query: 337 HLADVPLVLLLGVVGGILGSLYNFLLDKVL------RIYNFINEKGIAPKIFLACTISIL 390
            + ++P+  LLG++ G+LG+L+ FL  K++      R+++       +P ++ A  +++L
Sbjct: 228 DVQELPIFALLGIICGLLGALFVFLHRKIVRFRRKNRLFSKF--LKRSPLLYPAI-VALL 284

Query: 391 TSCLLF 396
           T+ L F
Sbjct: 285 TAVLTF 290



 Score = 99.6 bits (249), Expect = 3e-22
 Identities = 52/133 (39%), Positives = 75/133 (56%), Gaps = 13/133 (9%)

Query: 465 SMLIFFVTCFSLSVLSYGIVAPAGLFVPAIVTGASYGRFVGMLVGSH----------SNL 514
           ++ +F V  F L+ L+  +  PAG+F+P  V GA+ GR VG ++             + +
Sbjct: 294 TLFLFIVVKFVLTALAITLPVPAGIFMPVFVIGAALGRLVGEIMAVLFPEGIRGGISNPI 353

Query: 515 NHGLYAVLGAASFLGGSMRMTVSLCVIILELTNNL-LLLPMIMLVLLISKTVADAFNGNI 573
             G YAV+GAA+F G     TVS+ VII ELT  +  LLP+ ++ +LIS  VA     +I
Sbjct: 354 GPGGYAVVGAAAFSGAVTH-TVSVAVIIFELTGQISHLLPV-LIAVLISNAVAQFLQPSI 411

Query: 574 YDLIMKAKGFPYL 586
           YD I+K K  PYL
Sbjct: 412 YDSIIKIKKLPYL 424


>gnl|CDD|238233 cd00400, Voltage_gated_ClC, CLC voltage-gated chloride channel. The
           ClC chloride channels catalyse the selective flow of Cl-
           ions across cell membranes, thereby regulating
           electrical excitation in skeletal muscle and the flow of
           salt and water across epithelial barriers. This domain
           is found in the halogen ions (Cl-, Br- and I-) transport
           proteins of the ClC family.  The ClC channels are found
           in all three kingdoms of life and perform a variety of
           functions including cellular excitability regulation,
           cell volume regulation, membrane potential
           stabilization, acidification of intracellular
           organelles, signal transduction, transepithelial
           transport in animals, and the extreme acid resistance
           response in eubacteria.  They lack any structural or
           sequence similarity to other known ion channels and
           exhibit unique properties of ion permeation and gating. 
           Unlike cation-selective ion channels, which form
           oligomers containing a single pore along the axis of
           symmetry, the ClC channels form two-pore homodimers with
           one pore per subunit without axial symmetry.  Although
           lacking the typical voltage-sensor found in cation
           channels, all studied ClC channels are gated (opened and
           closed) by transmembrane voltage. The gating is
           conferred by the permeating ion itself, acting as the
           gating charge.  In addition, eukaryotic and some
           prokaryotic ClC channels have two additional C-terminal
           CBS (cystathionine beta synthase) domains of putative
           regulatory function.
          Length = 383

 Score =  153 bits (389), Expect = 1e-40
 Identities = 115/463 (24%), Positives = 182/463 (39%), Gaps = 85/463 (18%)

Query: 104 GLIVGLIGFLNNLAVENIAGTKFVVTSNMMLANKFGMAFLSFSIPNLVLTLFASIVTATV 163
           G++ GL   L  L +E +    F      + A      ++   +P  V+      +   +
Sbjct: 1   GVLSGLGAVLFRLLIELLQNLLFGGLPGELAAGSLSPLYILL-VP--VIGGLLVGLLVRL 57

Query: 164 APVAAGSGIPEV-KAYLNGVDAPGILSFRTLVIKIIGSISAVSSSLLVGKAGPMVHTGAC 222
              A G GIPEV +A   G    G L  R  ++K + S   + S   VG+ GP+V  GA 
Sbjct: 58  LGPARGHGIPEVIEAIALG---GGRLPLRVALVKFLASALTLGSGGSVGREGPIVQIGAA 114

Query: 223 VASLLGQGGSKKYGLTWKWLRFFKNDRDRRDFVTCGSAAGIAAAFRAPVGGLLFAIEEMA 282
           + S LG+             R   +  DRR  V CG+AAGIAAAF AP+ G LFAIE + 
Sbjct: 115 IGSWLGR-------------RLRLSRNDRRILVACGAAAGIAAAFNAPLAGALFAIEVLL 161

Query: 283 SWWRSALLWRAFFTTAIVAILLRAFIDICKSGKCGLFGTGGLIMYDVYSADASYHLADVP 342
             +  A L      +   A++ R            LFG        +Y       L ++P
Sbjct: 162 GEYSVASLIPVLLASVAAALVSRL-----------LFGAEPAFGVPLYD---PLSLLELP 207

Query: 343 LVLLLGVVGGILGSLYNFLLDKVLRIYNFINEKGIAPKIFLACTISILTSCLLFGLPWLA 402
           L LLLG++ G++G L+  LL K+ R++     +   P         +L   L   LP   
Sbjct: 208 LYLLLGLLAGLVGVLFVRLLYKIERLF----RRLPIPPWLRPALGGLLLGLLGLFLPQ-- 261

Query: 403 SCRPCPSDASEACPTIGRSGNYKKFQCAPGYYNDLASLIFNTNDDAIRNLFSKDTDKEFQ 462
                                        G       L                      
Sbjct: 262 ---------------------------VLGSGYGAILLALAGELSL-------------- 280

Query: 463 HSSMLIFFVTCFSLSVLSYGIVAPAGLFVPAIVTGASYGRFVGMLVGSHSNLNH---GLY 519
              +L+  +     + L+ G   P G+F P++  GA+ G   G+L+ +         G Y
Sbjct: 281 -LLLLLLLLLKLLATALTLGSGFPGGVFAPSLFIGAALGAAFGLLLPALFPGLVASPGAY 339

Query: 520 AVLGAASFLGGSMRMTVSLCVIILELTNNLLLLPMIMLVLLIS 562
           A++G A+ L   +R  ++  +++LELT +  LL  +ML ++I+
Sbjct: 340 ALVGMAALLAAVLRAPLTAILLVLELTGDYSLLLPLMLAVVIA 382



 Score = 36.4 bits (85), Expect = 0.056
 Identities = 38/184 (20%), Positives = 63/184 (34%), Gaps = 30/184 (16%)

Query: 99  LCFLIGLIVGLIGFLNNLAVENIAGTKFVVTSNMMLANKFGMAFLSFSIPNLVLTLFASI 158
           L  L+GL+ GL+G L    +  I                    F    IP  +      +
Sbjct: 208 LYLLLGLLAGLVGVLFVRLLYKIERL-----------------FRRLPIPPWLRPALGGL 250

Query: 159 VTATVA---PVAAGSGIPEVKAYLNGVDAPGILSFRTLVIKIIGSISAVSSSLLVGKAGP 215
           +   +    P   GSG   +   L G  +  +L    L++K++ +   + S    G   P
Sbjct: 251 LLGLLGLFLPQVLGSGYGAILLALAGELSLLLL-LLLLLLKLLATALTLGSGFPGGVFAP 309

Query: 216 MVHTGACVASLLGQGGSKKYGLTWKWLRFFKNDRDRRDFVTCGSAAGIAAAFRAPVGGLL 275
            +  GA + +  G               F         +   G AA +AA  RAP+  +L
Sbjct: 310 SLFIGAALGAAFGLLLP---------ALFPGLVASPGAYALVGMAALLAAVLRAPLTAIL 360

Query: 276 FAIE 279
             +E
Sbjct: 361 LVLE 364


>gnl|CDD|235385 PRK05277, PRK05277, chloride channel protein; Provisional.
          Length = 438

 Score =  129 bits (326), Expect = 6e-32
 Identities = 116/474 (24%), Positives = 198/474 (41%), Gaps = 106/474 (22%)

Query: 102 LIGLIVGLIGFLNNLAVENIAGTKFVVTSNMMLANKFGMAFLSFSIPNLVLTLFASIVTA 161
           ++G + GL+G    LAV+ +   +              +  L+    N +L    + + +
Sbjct: 6   VVGTLTGLVGVAFELAVDWVQNQR--------------LGLLASVADNGLLLWIVAFLIS 51

Query: 162 TV------------APVAAGSGIPEVKAYLNGVDAPGILSFRTLVIKIIGSISAVSSSLL 209
            V            AP A GSGIPE++  L G+    +  +R L +K  G +  + S ++
Sbjct: 52  AVLAMIGYFLVRRFAPEAGGSGIPEIEGALEGLR--PVRWWRVLPVKFFGGLGTLGSGMV 109

Query: 210 VGKAGPMVHTGACVASLLGQGGSKKYGLTWKWLRFFKNDRDRRDFVTCGSAAGIAAAFRA 269
           +G+ GP V  G  +  ++               R  ++D  R   +  G+AAG+AAAF A
Sbjct: 110 LGREGPTVQMGGNIGRMVLD-----------IFRL-RSDEARHTLLAAGAAAGLAAAFNA 157

Query: 270 PVGGLLFAIEEMASWWRSALL-WRAFFTTAIVA-ILLRAFIDICKSGKCGLFGTGGLIMY 327
           P+ G+LF IEEM   +R +L+  +A F   I+A I+ R F            G   +I  
Sbjct: 158 PLAGILFVIEEMRPQFRYSLISIKAVFIGVIMATIVFRLFN-----------GEQAVIEV 206

Query: 328 DVYSADASYHLADVPLVLLLGVVGGILGSLYNFLLDKVLRIYNFINEKGIAPKIFLACTI 387
             +SA     L  + L LLLG++ GI G L+N LL +   +++ ++       + +   +
Sbjct: 207 GKFSAPP---LNTLWLFLLLGIIFGIFGVLFNKLLLRTQDLFDRLHGGNKKRWVLMGGAV 263

Query: 388 SILTSCLLFG-LPWLASCRPCPSDASEACPTIGRSGNYKKFQCAPGYYNDLASLIFNTND 446
                  L G L  LA               +G  G +              +LI     
Sbjct: 264 -----GGLCGLLGLLAP------------AAVG--GGF--------------NLIPIA-- 288

Query: 447 DAIRNLFSKDTDKEFQHSSMLIFFVTCFSLSVLSYGIVAPAGLFVPAIVTGASYGRFVGM 506
                         F    +L  FV  F  ++L +G  AP G+F P +  G   G   GM
Sbjct: 289 ----------LAGNFSIGMLLFIFVARFITTLLCFGSGAPGGIFAPMLALGTLLGLAFGM 338

Query: 507 LVG---SHSNLNHGLYAVLGAASFLGGSMRMTVSLCVIILELTNN-LLLLPMIM 556
           +        ++  G +A+ G  +    ++R  ++  V++LE+T+N  L+LP+I+
Sbjct: 339 VAAALFPQYHIEPGTFAIAGMGALFAATVRAPLTGIVLVLEMTDNYQLILPLII 392


>gnl|CDD|238506 cd01034, EriC_like, ClC chloride channel family. These protein
           sequences, closely related to the ClC Eric family, are
           putative halogen ion (Cl-, Br- and I-) transport
           proteins found in eubacteria. They belong to the ClC
           superfamily of chloride ion channels, which share a
           unique double-barreled architecture and
           voltage-dependent gating mechanism.  This superfamily
           lacks any structural or sequence similarity to other
           known ion channels and exhibit unique properties of ion
           permeation and gating.  The voltage-dependent gating is
           conferred by the permeating anion itself, acting as the
           gating charge.
          Length = 390

 Score = 94.6 bits (236), Expect = 1e-20
 Identities = 73/225 (32%), Positives = 100/225 (44%), Gaps = 33/225 (14%)

Query: 150 LVLTL--FASIV--TATVAPVAAGSGIPEVKAYL---NGVDAPGILSFRTLVIKIIGSIS 202
           L+LT   FA I   T    P AAGSGIP+V A L   +      +LS RT V KI+ ++ 
Sbjct: 30  LLLTPAGFALIAWLTRRFFPGAAGSGIPQVIAALELPSAAARRRLLSLRTAVGKILLTLL 89

Query: 203 AVSSSLLVGKAGPMVHTGACVASLLGQGGSKKYGLTWKWLRFFKNDRDRRDFVTCGSAAG 262
            +     VG+ GP V  GA V   +G+   K  GL+             R  +  G AAG
Sbjct: 90  GLLGGASVGREGPSVQIGAAVMLAIGRRLPKWGGLS------------ERGLILAGGAAG 137

Query: 263 IAAAFRAPVGGLLFAIEEMASWWRSALLWRAFFTTAIVAILLRAFIDICKSGKCGLFGTG 322
           +AAAF  P+ G++FAIEE++         R F       +LL        +G   L   G
Sbjct: 138 LAAAFNTPLAGIVFAIEELS---------RDFELRFSGLVLLAVIA----AGLVSLAVLG 184

Query: 323 GLIMYDVYSADASYHLADVPLVLLLGVVGGILGSLYNFLLDKVLR 367
               + V +           LVL+ GVVGG+ G L+  LL  +  
Sbjct: 185 NYPYFGVAAVALPLG-EAWLLVLVCGVVGGLAGGLFARLLVALSS 228



 Score = 51.1 bits (123), Expect = 1e-06
 Identities = 31/98 (31%), Positives = 53/98 (54%), Gaps = 3/98 (3%)

Query: 469 FFVTCFSLSVLSYGIVAPAGLFVPAIVTGASYGRFVGMLVGSHSNLNHGLYAVLGAASFL 528
           F +  F  ++LSY    P GLF P++  GA  G  +  L+GS S    G   +LG A+FL
Sbjct: 286 FGLLKFLATLLSYWSGIPGGLFAPSLAVGAGLGSLLAALLGSVS---QGALVLLGMAAFL 342

Query: 529 GGSMRMTVSLCVIILELTNNLLLLPMIMLVLLISKTVA 566
            G  +  ++  VI++E+T +  +L  ++   L++  V+
Sbjct: 343 AGVTQAPLTAFVIVMEMTGDQQMLLPLLAAALLASGVS 380


>gnl|CDD|239964 cd04591, CBS_pair_EriC_assoc_euk_bac, This cd contains two tandem
           repeats of the cystathionine beta-synthase (CBS pair)
           domains in the EriC CIC-type chloride channels in
           eukaryotes and bacteria. These ion channels are proteins
           with a seemingly simple task of allowing the passive
           flow of chloride ions across biological membranes.
           CIC-type chloride channels come from all kingdoms of
           life, have several gene families, and can be gated by
           voltage. The members of the CIC-type chloride channel
           are double-barreled: two proteins forming homodimers at
           a broad interface formed by four helices from each
           protein. The two pores are not found at this interface,
           but are completely contained within each subunit, as
           deduced from the mutational analyses, unlike many other
           channels, in which four or five identical or
           structurally related subunits jointly form one pore. CBS
           is a small domain originally identified in cystathionine
           beta-synthase and subsequently found in a wide range of
           different proteins. CBS domains usually come in tandem
           repeats, which associate to form a so-called Bateman
           domain or a CBS pair which is reflected in this model.
           The interface between the two CBS domains forms a cleft
           that is a potential ligand binding site. The CBS pair
           coexists with a variety of other functional domains. It
           has been proposed that the CBS domain may play a
           regulatory role, although its exact function is unknown.
           Mutations of conserved residues within this domain in
           CLC chloride channel family members have been associated
           with classic Bartter syndrome, Osteopetrosis, Dent's
           disease, idiopathic generalized epilepsy, and myotonia.
          Length = 105

 Score = 81.9 bits (203), Expect = 9e-19
 Identities = 44/148 (29%), Positives = 60/148 (40%), Gaps = 58/148 (39%)

Query: 617 GNVVHVLRTTRHNGFPVIDENPLSETPILYGLILRAHLITLLKKKAFLPTPNPTVNDAFS 676
            ++  +L TT HNGFPV+D    S  P L G ILR+ L+  LK                 
Sbjct: 14  EDLESLLSTTSHNGFPVVDSTEES--PRLVGYILRSQLVVALKN---------------- 55

Query: 677 QFSAVDFAKRGSGNGDKIEDIELSEEEMEMYVDLHPFANASPYTVVETMSLAKALILFRE 736
                                         Y+D       SP+TV    SL K   LFR+
Sbjct: 56  ------------------------------YIDP------SPFTVSPRTSLEKVHQLFRK 79

Query: 737 VGLRHLLVIPKISNGSPVMGILTRHDFM 764
           +GLRHLLV+    +   ++GI+TR D +
Sbjct: 80  LGLRHLLVV----DEGRLVGIITRKDLL 103


>gnl|CDD|238505 cd01033, ClC_like, Putative ClC chloride channel.  Clc proteins are
           putative halogen ion (Cl-, Br- and I-) transporters
           found in eubacteria. They belong to the ClC superfamily
           of halogen ion channels, which share a unique
           double-barreled architecture and voltage-dependent
           gating mechanism.  This superfamily lacks any structural
           or sequence similarity to other known ion channels and
           exhibit unique properties of ion permeation and gating. 
           The voltage-dependent gating is conferred by the
           permeating anion itself, acting as the gating charge.
          Length = 388

 Score = 58.1 bits (141), Expect = 9e-09
 Identities = 67/277 (24%), Positives = 101/277 (36%), Gaps = 63/277 (22%)

Query: 104 GLIVGLIGFLNNL---AVENIAGTKFVVTSNMMLANKFGMAFLSFSIPNLVLTLFASIVT 160
           G+  GL G L  L    V+++A               FG +  SF      ++     ++
Sbjct: 1   GVGAGLGGGLLTLLLHGVQHLA---------------FGYSEGSFLTGVAAVSPIRRALS 45

Query: 161 ATVAPVAAGSG----------IPEVKAYLNGVDAPGILSFRTLVIKIIGSISAVSSSLLV 210
            TV  + AG G          +  +K     V     + F   +I       AV   + V
Sbjct: 46  LTVGGLIAGLGWYLLRRKGKKLVSIK---QAVRGKKRMPFWETII------HAVLQIVTV 96

Query: 211 GKAGPMVHTGAC--VASLLGQGGSKKYGLTWKWLRFFKNDRDRRDFVTCGSAAGIAAAFR 268
           G   P+    A   V +LL Q  S   GLT           DRR  V C + AG+AA + 
Sbjct: 97  GLGAPLGREVAPREVGALLAQRFSDWLGLT---------VADRRLLVACAAGAGLAAVYN 147

Query: 269 APVGGLLFAIEEMASWWRSALLWRAFFTTAIVAILLRAFIDICKSGKCGLFGTGGLIMYD 328
            P+ G LFA+E         +L R     ++VA L  + I    +        G   +YD
Sbjct: 148 VPLAGALFALE---------ILLRTISLRSVVAALATSAIAAAVAS----LLKGDHPIYD 194

Query: 329 VYSADASYHLADVPLVLLLGVVGGILGSLYNFLLDKV 365
           +     S  L  +   LL G V G++ + +  L    
Sbjct: 195 IPPMQLSTPL--LIWALLAGPVLGVVAAGFRRLSQAA 229



 Score = 40.7 bits (96), Expect = 0.003
 Identities = 26/92 (28%), Positives = 50/92 (54%), Gaps = 8/92 (8%)

Query: 471 VTCFSLSVLSYGIVAPAGLFVPAIVTGASYGRFVGMLVGSH-SNLNHGLYAVLGAASFLG 529
            T  +L   +YG     GL  P++  GA  G  +G++  +    L+   +A++GAA+FL 
Sbjct: 291 ATLLALRAGAYG-----GLLTPSLALGALLGALLGIVWNALLPPLSIAAFALIGAAAFLA 345

Query: 530 GSMRMTVSLCVIILELT--NNLLLLPMIMLVL 559
            + +  ++  +++LE T  N L L+P+++ V 
Sbjct: 346 ATQKAPLTALILVLEFTRQNPLFLIPLMLAVA 377


>gnl|CDD|234987 PRK01862, PRK01862, putative voltage-gated ClC-type chloride
           channel ClcB; Provisional.
          Length = 574

 Score = 51.3 bits (123), Expect = 2e-06
 Identities = 31/95 (32%), Positives = 46/95 (48%), Gaps = 15/95 (15%)

Query: 186 GILSFRTLVIKIIGSISAVSSSLLVGKAGPMVHTGACVASLLGQGGSKKYGLTWKWLRFF 245
           G++  R  + +   S+  + S   +G+ GPMV   A  ASL+G           ++  F 
Sbjct: 111 GVVPVRQSLWRSASSLLTIGSGGSIGREGPMVQLAALAASLVG-----------RFAHF- 158

Query: 246 KNDRDR-RDFVTCGSAAGIAAAFRAPVGGLLFAIE 279
             D  R R  V CG+AAGI +A+ AP+ G  F  E
Sbjct: 159 --DPPRLRLLVACGAAAGITSAYNAPIAGAFFVAE 191



 Score = 37.4 bits (87), Expect = 0.033
 Identities = 25/95 (26%), Positives = 44/95 (46%), Gaps = 11/95 (11%)

Query: 477 SVLSYGIVAPAGLFVPAIVTGASYGRFVGMLVGSHSNLNHG-----LYAVLGAASFLGGS 531
           +  + G  A  G+F P +  GA  G   G+    H+           YA++G  +FL G+
Sbjct: 327 TAATAGSGAVGGVFTPTLFVGAVVGSLFGL--AMHALWPGHTSAPFAYAMVGMGAFLAGA 384

Query: 532 MRMTVSLCVIILELTNNLLLLPMIMLVLLISKTVA 566
            +  +   ++I E+T    L   ++L L++S  VA
Sbjct: 385 TQAPLMAILMIFEMT----LSYQVVLPLMVSCVVA 415


>gnl|CDD|234963 PRK01610, PRK01610, putative voltage-gated ClC-type chloride
           channel ClcB; Provisional.
          Length = 418

 Score = 50.9 bits (122), Expect = 2e-06
 Identities = 41/187 (21%), Positives = 81/187 (43%), Gaps = 29/187 (15%)

Query: 178 YLNGVDAPGILSFRTLVIKIIGSISAVSSSLLVGKAGPMVHTGACVASLLGQGGSKKYGL 237
           Y+  +   G   +   ++K + S+  V+S   +G+ G M+   A  AS   Q  + +   
Sbjct: 85  YMEALQTDGQFDYAASLVKSLASLLVVTSGSAIGREGAMILLAALAASCFAQRFTPRQ-- 142

Query: 238 TWK-WLRFFKNDRDRRDFVTCGSAAGIAAAFRAPVGGLLFAIEEMASWWRSALLWRAFFT 296
            WK W             + CG+AAG+A+A+ AP+ G LF  E         +L+     
Sbjct: 143 EWKLW-------------IACGAAAGMASAYHAPLAGSLFIAE---------ILFGTLML 180

Query: 297 TAIVAILLRAFIDICKSGKCGLFGTGGLIMYDVYSADASYHLADVPLVLLLGVVGGILGS 356
            ++  +++ A + +  +    L      ++Y+V     +    D  L++  G++ G+ G 
Sbjct: 181 ASLGPVVISAVVALLTT---NLLNGSDALLYNVQ-LSVTVQARDYALIISTGLLAGLCGP 236

Query: 357 LYNFLLD 363
           L   L++
Sbjct: 237 LLLTLMN 243



 Score = 37.1 bits (86), Expect = 0.035
 Identities = 26/98 (26%), Positives = 45/98 (45%), Gaps = 4/98 (4%)

Query: 473 CFSLSVL-SYGIVAPAGLFVPAIVTGASYGRFVGMLVGSHSNLNHG---LYAVLGAASFL 528
           C  L+VL S G  AP G+F P +  G + G   G  +G           L  + G A+ L
Sbjct: 304 CKLLAVLASSGSGAPGGVFTPTLFVGLAIGMLYGRSLGLWLPDGEEITLLLGLTGMATLL 363

Query: 529 GGSMRMTVSLCVIILELTNNLLLLPMIMLVLLISKTVA 566
             +    +   ++I E+T    LLP +++  +I+  ++
Sbjct: 364 AATTHAPIMSTLMICEMTGEYQLLPGLLIACVIASVIS 401


>gnl|CDD|239984 cd04611, CBS_pair_PAS_GGDEF_DUF1_assoc, This cd contains two tandem
           repeats of the cystathionine beta-synthase (CBS pair)
           domains in association with a PAS domain, a GGDEF
           (DiGuanylate-Cyclase (DGC) domain, and a DUF1 domain
           downstream. PAS domains have been found to bind ligands,
           and to act as sensors for light and oxygen in signal
           transduction. The GGDEF domain has been suggested to be
           homologous to the adenylyl cyclase catalytic domain and
           is thought to be involved in regulating cell surface
           adhesiveness in bacteria. CBS is a small domain
           originally identified in cystathionine beta-synthase and
           subsequently found in a wide range of different
           proteins. CBS domains usually come in tandem repeats,
           which associate to form a so-called Bateman domain or a
           CBS pair which is reflected in this model. The interface
           between the two CBS domains forms a cleft that is a
           potential ligand binding site. The CBS pair coexists
           with a variety of other functional domains.  It has been
           proposed that the CBS domain may play a regulatory role,
           although its exact function is unknown.
          Length = 111

 Score = 39.5 bits (93), Expect = 7e-04
 Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 4/49 (8%)

Query: 717 SP-YTVVETMSLAKALILFREVGLRHLLVIPKISNGSPVMGILTRHDFM 764
           SP  TV    SL  A  L RE G+RHL+V+    +G  + G+L++ D +
Sbjct: 64  SPLLTVPADTSLYDARQLMREHGIRHLVVVD--DDGELL-GLLSQTDLL 109


>gnl|CDD|201313 pfam00571, CBS, CBS domain.  CBS domains are small intracellular
           modules that pair together to form a stable globular
           domain. This family represents a single CBS domain.
           Pairs of these domains have been termed a Bateman
           domain. CBS domains have been shown to bind ligands with
           an adenosyl group such as AMP, ATP and S-AdoMet. CBS
           domains are found attached to a wide range of other
           protein domains suggesting that CBS domains may play a
           regulatory role making proteins sensitive to adenosyl
           carrying ligands. The region containing the CBS domains
           in Cystathionine-beta synthase is involved in regulation
           by S-AdoMet. CBS domain pairs from AMPK bind AMP or ATP.
           The CBS domains from IMPDH and the chloride channel CLC2
           bind ATP.
          Length = 57

 Score = 36.8 bits (86), Expect = 0.002
 Identities = 18/52 (34%), Positives = 25/52 (48%), Gaps = 3/52 (5%)

Query: 713 FANASPYTVVETMSLAKALILFREVGLRHLLVIPKISNGSPVMGILTRHDFM 764
                  TV    SL +AL L RE G+  L V+ +  +G  V GI+T  D +
Sbjct: 4   IMTPDVVTVPPDTSLEEALELMRENGISRLPVVDE--DGKLV-GIVTLRDLL 52



 Score = 28.3 bits (64), Expect = 1.9
 Identities = 11/62 (17%), Positives = 25/62 (40%), Gaps = 5/62 (8%)

Query: 599 VGDVVTAPLQLFHGIEKAGNVVHVLRTTRHNGFPVIDENPLSETPILYGLILRAHLITLL 658
           V D++T  +            + ++R    +  PV+DE+      +  G++    L+  L
Sbjct: 1   VKDIMTPDVVTVPPDTSLEEALELMRENGISRLPVVDED---GKLV--GIVTLRDLLRAL 55

Query: 659 KK 660
            +
Sbjct: 56  LE 57


>gnl|CDD|239654 cd03682, ClC_sycA_like, ClC sycA-like chloride channel proteins.
           This ClC family presents in bacteria, where it
           facilitates acid resistance in acidic soil. Mutation of
           this gene (sycA) in Rhizobium tropici CIAT899 causes
           serious deficiencies in nodule development, nodulation
           competitiveness, and N2 fixation on Phaseolus vulgaris
           plants, due to its reduced ability for acid resistance. 
           This family is part of the ClC chloride channel
           superfamiy. These proteins catalyse the selective flow
           of Cl- ions across cell membranes and Cl-/H+ exchange
           transport. These proteins share two characteristics that
           are apparently inherent to the entire ClC chloride
           channel superfamily: a unique double-barreled
           architecture and voltage-dependent gating mechanism. The
           gating is conferred by the permeating anion itself,
           acting as the gating charge.
          Length = 378

 Score = 40.3 bits (95), Expect = 0.003
 Identities = 24/71 (33%), Positives = 31/71 (43%), Gaps = 15/71 (21%)

Query: 210 VGKAGPMVHTGACVASLLGQGGSKKYGLTWKWLRFFK-NDRDRRDFVTCGSAAGIAAAFR 268
            G+ G  V  G  +A   G              R FK  + DRR  +  G AAG AA F 
Sbjct: 95  AGREGTAVQMGGSLADAFG--------------RVFKLPEEDRRILLIAGIAAGFAAVFG 140

Query: 269 APVGGLLFAIE 279
            P+ G +FA+E
Sbjct: 141 TPLAGAIFALE 151


>gnl|CDD|239067 cd02205, CBS_pair, The CBS domain, named after human CBS, is a
           small domain originally identified in cystathionine
           beta-synthase and is subsequently found in a wide range
           of different proteins. CBS domains usually occur in
           tandem repeats. They associate to form a so-called
           Bateman domain or a CBS pair based on crystallographic
           studies in bacteria.  The CBS pair was used as a basis
           for this cd hierarchy since the human CBS proteins can
           adopt the typical core structure and form an
           intramolecular CBS pair.  The interface between the two
           CBS domains forms a cleft that is a potential ligand
           binding site. The CBS pair coexists with a variety of
           other functional domains and this has been used to help
           in its classification here.  It has been proposed that
           the CBS domain may play a regulatory role, although its
           exact function is unknown. Mutations of conserved
           residues within this domain are associated with a
           variety of human hereditary diseases, including
           congenital myotonia, idiopathic generalized epilepsy,
           hypercalciuric nephrolithiasis, and classic Bartter
           syndrome (CLC chloride channel family members),
           Wolff-Parkinson-White syndrome (gamma 2 subunit of
           AMP-activated protein kinase), retinitis pigmentosa (IMP
           dehydrogenase-1), and homocystinuria (cystathionine
           beta-synthase).
          Length = 113

 Score = 36.3 bits (84), Expect = 0.012
 Identities = 30/146 (20%), Positives = 46/146 (31%), Gaps = 49/146 (33%)

Query: 619 VVHVLRTTRHNGFPVIDENPLSETPILYGLILRAHLITLLKKKAFLPTPNPTVNDAFSQF 678
            + ++     +G PV+D++       L G++    L+  L +      P  TV D     
Sbjct: 15  ALRLMLEHGISGLPVVDDDG-----RLVGIVTERDLLRALAEG--GLDPLVTVGD----- 62

Query: 679 SAVDFAKRGSGNGDKIEDIELSEEEMEMYVDLHPFANASPYTVVETMSLAKALILFREVG 738
                                                    TV    SL +A  L  E G
Sbjct: 63  ----------------------------------VMTRDVVTVSPDTSLEEAAELMLEHG 88

Query: 739 LRHLLVIPKISNGSPVMGILTRHDFM 764
           +R L V+     G  V GI+TR D +
Sbjct: 89  IRRLPVV--DDEGRLV-GIVTRSDIL 111



 Score = 34.3 bits (79), Expect = 0.050
 Identities = 14/47 (29%), Positives = 24/47 (51%), Gaps = 3/47 (6%)

Query: 718 PYTVVETMSLAKALILFREVGLRHLLVIPKISNGSPVMGILTRHDFM 764
             TV    ++A+AL L  E G+  L V   + +   ++GI+T  D +
Sbjct: 3   VVTVSPDDTVAEALRLMLEHGISGLPV---VDDDGRLVGIVTERDLL 46


>gnl|CDD|240116 cd04803, CBS_pair_15, The CBS domain, named after human CBS, is a
           small domain originally identified in cystathionine
           beta-synthase and is subsequently found in a wide range
           of different proteins. CBS domains usually occur in
           tandem repeats. They associate to form a so-called
           Bateman domain or a CBS pair based on crystallographic
           studies in bacteria.  The CBS pair was used as a basis
           for this cd hierarchy since the human CBS proteins can
           adopt the typical core structure and form an
           intramolecular CBS pair.  The interface between the two
           CBS domains forms a cleft that is a potential ligand
           binding site. The CBS pair coexists with a variety of
           other functional domains and this has been used to help
           in its classification here.  It has been proposed that
           the CBS domain may play a regulatory role, although its
           exact function is unknown. Mutations of conserved
           residues within this domain are associated with a
           variety of human hereditary diseases, including
           congenital myotonia, idiopathic generalized epilepsy,
           hypercalciuric nephrolithiasis, and classic Bartter
           syndrome (CLC chloride channel family members),
           Wolff-Parkinson-White syndrome (gamma 2 subunit of
           AMP-activated protein kinase), retinitis pigmentosa (IMP
           dehydrogenase-1), and homocystinuria (cystathionine
           beta-synthase).
          Length = 122

 Score = 35.7 bits (83), Expect = 0.021
 Identities = 17/47 (36%), Positives = 26/47 (55%), Gaps = 3/47 (6%)

Query: 718 PYTVVETMSLAKALILFREVGLRHLLVIPKISNGSPVMGILTRHDFM 764
             T+ E  SLA A  L RE  +RHL   P ++    ++G+LT+ D +
Sbjct: 3   VVTLSEDDSLADAEELMREHRIRHL---PVVNEDGKLVGLLTQRDLL 46


>gnl|CDD|239973 cd04600, CBS_pair_HPP_assoc, This cd contains two tandem repeats of
           the cystathionine beta-synthase (CBS pair) domains
           associated with the HPP motif domain. These proteins are
           integral membrane proteins with four transmembrane
           spanning helices. The function of these proteins is
           uncertain, but they are thought to be transporters. CBS
           is a small domain originally identified in cystathionine
           beta-synthase and subsequently found in a wide range of
           different proteins. CBS domains usually come in tandem
           repeats, which associate to form a so-called Bateman
           domain or a CBS pair which is reflected in this model.
           The interface between the two CBS domains forms a cleft
           that is a potential ligand binding site. The CBS pair
           coexists with a variety of other functional domains.  It
           has been proposed that the CBS domain may play a
           regulatory role, although its exact function is unknown.
          Length = 124

 Score = 35.3 bits (82), Expect = 0.031
 Identities = 19/72 (26%), Positives = 28/72 (38%), Gaps = 7/72 (9%)

Query: 718 PYTVVETMSLAKALILFREVGLRHLLVIPKISNGSPVMGILTRHDFMPEHILALNPLLAR 777
             TV    SL +A  L R      +  +P +     ++GI+T+ D +        P   R
Sbjct: 4   VVTVTPDTSLEEAWALLRR---HRIKALPVVDGDRRLVGIVTQRDLL----RHARPDGRR 56

Query: 778 SRWKRLRIRFPP 789
               RLR R  P
Sbjct: 57  PLRGRLRGRDKP 68


>gnl|CDD|239961 cd04588, CBS_pair_CAP-ED_DUF294_assoc_arch, This cd contains two
           tandem repeats of the cystathionine beta-synthase (CBS
           pair) domains associated with the archaeal CAP_ED (cAMP
           receptor protein effector domain) family of
           transcription factors and the DUF294 domain.  Members of
           CAP_ED, include CAP which binds cAMP, FNR (fumarate and
           nitrate reductase) which uses an iron-sulfur cluster to
           sense oxygen, and CooA a heme containing CO sensor. In
           all cases binding of the effector leads to
           conformational changes and the ability to activate
           transcription. DUF294 is a putative
           nucleotidyltransferase with a conserved DxD motif. CBS
           is a small domain originally identified in cystathionine
           beta-synthase and subsequently found in a wide range of
           different proteins. CBS domains usually come in tandem
           repeats, which associate to form a so-called Bateman
           domain or a CBS pair which is reflected in this model.
           The interface between the two CBS domains forms a cleft
           that is a potential ligand binding site. The CBS pair
           coexists with a variety of other functional domains. It
           has been proposed that the CBS domain may play a
           regulatory role, although its exact function is unknown.
          Length = 110

 Score = 33.4 bits (77), Expect = 0.089
 Identities = 15/44 (34%), Positives = 21/44 (47%), Gaps = 3/44 (6%)

Query: 719 YTVVETMSLAKALILFREVGLRHLLVIPKISNGSPVMGILTRHD 762
            T+ E   L  A+ L  +  +  L+V      G PV GI+TR D
Sbjct: 66  ITIDEDEQLYDAIRLMNKHNVGRLIVT--DDEGRPV-GIITRTD 106


>gnl|CDD|239958 cd04585, CBS_pair_ACT_assoc2, This cd contains two tandem repeats
           of the cystathionine beta-synthase (CBS pair) domains in
            the acetoin utilization proteins in bacteria. Acetoin
           is a product of fermentative metabolism in many
           prokaryotic and eukaryotic microorganisms.  They produce
           acetoin as an external carbon storage compound and then
           later reuse it as a carbon and energy source during
           their stationary phase and sporulation. In addition
           these CBS domains are associated with a downstream ACT
           domain, which is linked to a wide range of metabolic
           enzymes that are regulated by amino acid concentration.
           Pairs of ACT domains bind specifically to a particular
           amino acid leading to regulation of the linked enzyme.
           CBS is a small domain originally identified in
           cystathionine beta-synthase and subsequently found in a
           wide range of different proteins. CBS domains usually
           come in tandem repeats, which associate to form a
           so-called Bateman domain or a CBS pair which is
           reflected in this model. The interface between the two
           CBS domains forms a cleft that is a potential ligand
           binding site. The CBS pair coexists with a variety of
           other functional domains.  It has been proposed that the
           CBS domain may play a regulatory role, although its
           exact function is unknown.
          Length = 122

 Score = 33.0 bits (76), Expect = 0.17
 Identities = 20/73 (27%), Positives = 31/73 (42%), Gaps = 7/73 (9%)

Query: 717 SPYTVVETMSLAKALILFREVGLRHLLVIPKISNGSPVMGILTRHDF---MPEHILALNP 773
           +P TV    SL +AL L +E  +R L   P +  G  V GI+T  D     P     L+ 
Sbjct: 2   NPITVTPDTSLMEALKLMKENSIRRL---PVVDRGKLV-GIVTDRDLKLASPSKATTLDI 57

Query: 774 LLARSRWKRLRIR 786
                   ++++ 
Sbjct: 58  WELYYLLSKIKVS 70


>gnl|CDD|214522 smart00116, CBS, Domain in cystathionine beta-synthase and other
           proteins.  Domain present in all 3 forms of cellular
           life. Present in two copies in inosine monophosphate
           dehydrogenase, of which one is disordered in the crystal
           structure. A number of disease states are associated
           with CBS-containing proteins including homocystinuria,
           Becker's and Thomsen disease.
          Length = 49

 Score = 30.2 bits (69), Expect = 0.31
 Identities = 19/46 (41%), Positives = 24/46 (52%), Gaps = 3/46 (6%)

Query: 718 PYTVVETMSLAKALILFREVGLRHLLVIPKISNGSPVMGILTRHDF 763
             TV    +L +AL L RE G+R L V+     G  V GI+TR D 
Sbjct: 2   VVTVSPDTTLEEALELLRENGIRRLPVVD--EEGRLV-GIVTRRDI 44



 Score = 27.5 bits (62), Expect = 2.9
 Identities = 11/41 (26%), Positives = 17/41 (41%), Gaps = 5/41 (12%)

Query: 619 VVHVLRTTRHNGFPVIDENPLSETPILYGLILRAHLITLLK 659
            + +LR       PV+DE        L G++ R  +I  L 
Sbjct: 14  ALELLRENGIRRLPVVDEE-----GRLVGIVTRRDIIKALA 49


>gnl|CDD|240003 cd04632, CBS_pair_19, The CBS domain, named after human CBS, is a
           small domain originally identified in cystathionine
           beta-synthase and is subsequently found in a wide range
           of different proteins. CBS domains usually occur in
           tandem repeats. They associate to form a so-called
           Bateman domain or a CBS pair based on crystallographic
           studies in bacteria.  The CBS pair was used as a basis
           for this cd hierarchy since the human CBS proteins can
           adopt the typical core structure and form an
           intramolecular CBS pair.  The interface between the two
           CBS domains forms a cleft that is a potential ligand
           binding site. The CBS pair coexists with a variety of
           other functional domains and this has been used to help
           in its classification here.  It has been proposed that
           the CBS domain may play a regulatory role, although its
           exact function is unknown. Mutations of conserved
           residues within this domain are associated with a
           variety of human hereditary diseases, including
           congenital myotonia, idiopathic generalized epilepsy,
           hypercalciuric nephrolithiasis, and classic Bartter
           syndrome (CLC chloride channel family members),
           Wolff-Parkinson-White syndrome (gamma 2 subunit of
           AMP-activated protein kinase), retinitis pigmentosa (IMP
           dehydrogenase-1), and homocystinuria (cystathionine
           beta-synthase).
          Length = 128

 Score = 32.5 bits (74), Expect = 0.31
 Identities = 17/43 (39%), Positives = 25/43 (58%), Gaps = 3/43 (6%)

Query: 720 TVVETMSLAKALILFREVGLRHLLVIPKISNGSPVMGILTRHD 762
           TV E  S+ KA+ + RE G+  L   P + +   + GI+TRHD
Sbjct: 5   TVREDDSVGKAINVLREHGISRL---PVVDDNGKLTGIVTRHD 44


>gnl|CDD|225457 COG2905, COG2905, Predicted signal-transduction protein containing
           cAMP-binding and CBS domains [Signal transduction
           mechanisms].
          Length = 610

 Score = 33.9 bits (78), Expect = 0.34
 Identities = 24/104 (23%), Positives = 40/104 (38%), Gaps = 6/104 (5%)

Query: 672 NDAFSQFSAVDFAKRGSGNGDKIEDIELSEEEMEMYVDLHPFANASPYTVVETMSLAKAL 731
           N  F+ F     AKR     D++ +   SE  +    ++         TV    S+  A 
Sbjct: 116 NPEFADFFLRSLAKRLRDIADRLAEQGESEFILTRVGEVK---TLPAVTVSPQASIQDAA 172

Query: 732 ILFREVGLRHLLVIPKISNGSPVMGILTRHDFMPEHILALNPLL 775
              ++ G+  L+V+   S   P++GI+TR D     I       
Sbjct: 173 RKMKDEGVSSLVVL-DDSG--PLLGIVTRKDLRSRVIADGRSKT 213


>gnl|CDD|233128 TIGR00792, gph, sugar (Glycoside-Pentoside-Hexuronide) transporter.
            The Glycoside-Pentoside-Hexuronide (GPH):Cation
           Symporter Family (TC 2.A.2) GPH:cation symporters
           catalyze uptake of sugars in symport with a monovalent
           cation (H+ or Na+). Members of this family includes
           transporters for melibiose, lactose, raffinose,
           glucuronides, pentosides and isoprimeverose. Mutants of
           two groups of these symporters (the melibiose permeases
           of enteric bacteria, and the lactose permease of
           Streptococcus thermophilus) have been isolated in which
           altered cation specificity is observed or in which sugar
           transport is uncoupled from cation symport (i.e.,
           uniport is catalyzed). The various members of the family
           can use Na+, H+ or Li, Na+ or Li+, H+ or Li+, or only H+
           as the symported cation. All of these proteins possess
           twelve putative transmembrane a-helical spanners
           [Transport and binding proteins, Carbohydrates, organic
           alcohols, and acids].
          Length = 437

 Score = 33.4 bits (77), Expect = 0.46
 Identities = 29/129 (22%), Positives = 52/129 (40%), Gaps = 23/129 (17%)

Query: 101 FLIGLIVGLIGFLNNLAVENIAGTKFVVTSNMMLANKFGMAFLSFSIPNLVLTLFASIVT 160
           F  G+++ ++G+L            F   SN+ L     +  L+    N V  L  ++V 
Sbjct: 290 FAGGILLMVLGYLIF----------FFAGSNLPLI--LVLIILAGFGQNFVTGLVWALVA 337

Query: 161 ATVAPVAAGSGIPEVKAYLNGVDAPGI-LSFRTLVIKIIGSISAVSSSLLVGKAGPMVHT 219
            TV          +   +  GV A G+  S RT V K+  +++     L++G  G + + 
Sbjct: 338 DTV----------DYGEWKTGVRAEGLVYSVRTFVRKLGQALAGFLVGLILGIIGYVANA 387

Query: 220 GACVASLLG 228
                +L G
Sbjct: 388 AQSPITLNG 396


>gnl|CDD|237843 PRK14869, PRK14869, putative manganese-dependent inorganic
           pyrophosphatase; Provisional.
          Length = 546

 Score = 33.3 bits (77), Expect = 0.53
 Identities = 13/39 (33%), Positives = 24/39 (61%), Gaps = 5/39 (12%)

Query: 626 TRHNGFPVIDENPLSETPILYGLILRAHLITLLKKKAFL 664
           +R+  +PV+DE+       + G+I R HL++ ++KK  L
Sbjct: 276 SRYRSYPVVDEDG-----KVVGVISRYHLLSPVRKKVIL 309


>gnl|CDD|217334 pfam03041, Baculo_LEF-2, lef-2.  The lef-2 gene (for late
           expression factor 2) from baculovirus is required for
           expression of late genes. This gene has been shown to be
           specifically required for expression from the vp39 and
           polh promoters. LEF-1 is a DNA primase and there is some
           evidence to suggest that LEF-2 may bind to both DNA and
           LEF-1.
          Length = 164

 Score = 31.9 bits (73), Expect = 0.57
 Identities = 31/109 (28%), Positives = 34/109 (31%), Gaps = 33/109 (30%)

Query: 707 YVDLHPF--ANASPYTVVETMSLAKALILFREVGLR--HLLVIPK--------------- 747
            VD   F     SPYTV E   L       R  GLR  +LL  P                
Sbjct: 21  LVDPFDFDWLVLSPYTVFEPGGL-----YVRVSGLRLYNLLKNPPTFEESPSVNFSKKSK 75

Query: 748 --------ISNGSPVMGILTRHDFMPEHI-LALNPLLARSRWKRLRIRF 787
                         V+  L +    P  I  ALN L  R R  R R RF
Sbjct: 76  RNVCFKDCSGGRKSVIDKLRKLLNPPPCITKALNDLCVRPRGGRYRKRF 124


>gnl|CDD|240001 cd04630, CBS_pair_17, The CBS domain, named after human CBS, is a
           small domain originally identified in cystathionine
           beta-synthase and is subsequently found in a wide range
           of different proteins. CBS domains usually occur in
           tandem repeats. They associate to form a so-called
           Bateman domain or a CBS pair based on crystallographic
           studies in bacteria.  The CBS pair was used as a basis
           for this cd hierarchy since the human CBS proteins can
           adopt the typical core structure and form an
           intramolecular CBS pair.  The interface between the two
           CBS domains forms a cleft that is a potential ligand
           binding site. The CBS pair coexists with a variety of
           other functional domains and this has been used to help
           in its classification here.  It has been proposed that
           the CBS domain may play a regulatory role, although its
           exact function is unknown. Mutations of conserved
           residues within this domain are associated with a
           variety of human hereditary diseases, including
           congenital myotonia, idiopathic generalized epilepsy,
           hypercalciuric nephrolithiasis, and classic Bartter
           syndrome (CLC chloride channel family members),
           Wolff-Parkinson-White syndrome (gamma 2 subunit of
           AMP-activated protein kinase), retinitis pigmentosa (IMP
           dehydrogenase-1), and homocystinuria (cystathionine
           beta-synthase).
          Length = 114

 Score = 30.7 bits (70), Expect = 1.1
 Identities = 14/44 (31%), Positives = 23/44 (52%), Gaps = 2/44 (4%)

Query: 719 YTVVETMSLAKALILFREVGLRHLLVIPKISNGSPVMGILTRHD 762
            T+    ++A+AL L +E G+  L+V  +    S   GI+T  D
Sbjct: 4   VTIDGLATVAEALQLMKEHGVSSLVVEKR--RESDAYGIVTMRD 45


>gnl|CDD|222060 pfam13347, MFS_2, MFS/sugar transport protein.  This family is part
           of the major facilitator superfamily of membrane
           transport proteins.
          Length = 425

 Score = 31.4 bits (72), Expect = 1.7
 Identities = 21/115 (18%), Positives = 40/115 (34%), Gaps = 22/115 (19%)

Query: 101 FLIGLIVGLIGFLNNLAVENIAGTKFVVTSNMMLANKFGMAFLSFSIPNLVLTLFASIVT 160
           FL+G+++  IG              F+   ++ L     +  L+     L   L  +++ 
Sbjct: 290 FLLGMLLAAIGL---------VLLFFLPPGSLWLF--LVLVVLAGIGLGLATLLPWAMLA 338

Query: 161 ATVAPVAAGSGIPEVKAYLNGVDAPGI-LSFRTLVIKIIGSISAVSSSLLVGKAG 214
             V          +   +  G    GI  S  +   K+ G++      LL+  AG
Sbjct: 339 DVV----------DYGEWKTGKRREGITFSLYSFADKLGGALGGAIVGLLLAFAG 383


>gnl|CDD|239792 cd04259, AAK_AK-DapDC, AAK_AK-DapDC: Amino Acid Kinase Superfamily
           (AAK), AK-DapDC; this CD includes the N-terminal
           catalytic aspartokinase (AK) domain of the bifunctional
           enzyme AK - DAP decarboxylase (DapDC) found in some
           bacteria. Aspartokinase is the first enzyme in the
           aspartate metabolic pathway, catalyzes the conversion of
           aspartate and ATP to aspartylphosphate and ADP. DapDC,
           which is the lysA gene product, catalyzes the
           decarboxylation of DAP to lysine.
          Length = 295

 Score = 31.4 bits (71), Expect = 1.8
 Identities = 31/145 (21%), Positives = 52/145 (35%), Gaps = 20/145 (13%)

Query: 181 GVDAPGILS--FRTLVIKIIGSISAVSSSLLVGKAGPMVHTGACVASLLGQGGSKKYGLT 238
            VDA  +L+     L  + +  IS +  +    +A  ++  G  +++ LG    +  GL 
Sbjct: 79  EVDADALLANDLAQLQ-RWLTGISLLKQASPRTRA-EVLALGELMSTRLGAAYLEAQGLK 136

Query: 239 WKWLRFFKNDRDRRDFVTCGSAAGIAAAFRAPVGGLLFAIEEMASWWRSALLWRAFFTTA 298
            KWL       D R+ +T     G        +  L    E       +  L +      
Sbjct: 137 VKWL-------DARELLTATPTLGGE-----TMNYLSARCES----EYADALLQKRLADG 180

Query: 299 IVAILLRAFIDICKSGKCGLFGTGG 323
              I+ + FI     G+  L G GG
Sbjct: 181 AQLIITQGFIARNAHGETVLLGRGG 205


>gnl|CDD|240036 cd04680, Nudix_Hydrolase_21, Members of the Nudix hydrolase
           superfamily catalyze the hydrolysis of NUcleoside
           DIphosphates linked to other moieties, X. Enzymes
           belonging to this superfamily require a divalent cation,
           such as Mg2+ or Mn2+, for their activity and contain a
           highly conserved 23-residue nudix motif
           (GX5EX7REUXEEXGU, where U = I, L or V), which functions
           as a metal binding and catalytic site. Substrates of
           nudix hydrolases include intact and oxidatively damaged
           nucleoside triphosphates, dinucleoside polyphosphates,
           nucleotide-sugars and dinucleotide enzymes. These
           substrates are metabolites or cell signaling molecules
           that require regulation during different stages of the
           cell cycle or during periods of stress. In general, the
           role of the nudix hydrolase is to sanitize the
           nucleotide pools and to maintain cell viability, thereby
           serving as surveillance & "house-cleaning" enzymes.
           Substrate specificity is used to define families within
           the superfamily. Differences in substrate specificity
           are determined by the N-terminal extension or by
           residues in variable loop regions. Mechanistically,
           substrate hydrolysis occurs by a nucleophilic
           substitution reaction, with variation in the numbers and
           roles of divalent cations required.
          Length = 120

 Score = 30.0 bits (68), Expect = 1.9
 Identities = 21/72 (29%), Positives = 26/72 (36%), Gaps = 5/72 (6%)

Query: 6   PTEKMATAAAFENLNTAGVEGGQDPESLTVPLLSHQRSRSSISNSTSQVALVGADVCPIE 65
             E  A AA  E L   G+      E L V   S   S   +      V    AD  P+ 
Sbjct: 34  RGETFAEAARRELLEELGIRLAVVAELLGVYYHSASGSWDHVI-----VFRARADTQPVI 88

Query: 66  SLDYEIAENDFF 77
              +EI+E  FF
Sbjct: 89  RPSHEISEARFF 100


>gnl|CDD|239962 cd04589, CBS_pair_CAP-ED_DUF294_assoc_bac, This cd contains two
           tandem repeats of the cystathionine beta-synthase (CBS
           pair) domains associated with the bacterial CAP_ED (cAMP
           receptor protein effector domain) family of
           transcription factors and the DUF294 domain.  Members of
           CAP_ED, include CAP which binds cAMP, FNR (fumarate and
           nitrate reductase) which uses an iron-sulfur cluster to
           sense oxygen, and CooA a heme containing CO sensor. In
           all cases binding of the effector leads to
           conformational changes and the ability to activate
           transcription. DUF294 is a putative
           nucleotidyltransferase with a conserved DxD motif. CBS
           is a small domain originally identified in cystathionine
           beta-synthase and subsequently found in a wide range of
           different proteins. CBS domains usually come in tandem
           repeats, which associate to form a so-called Bateman
           domain or a CBS pair which is reflected in this model.
           CBS domains usually come in tandem repeats, which
           associate to form a so-called Bateman domain or a CBS
           pair which is reflected in this model.  The interface
           between the two CBS domains forms a cleft that is a
           potential ligand binding site. The CBS pair coexists
           with a variety of other functional domains. It has been
           proposed that the CBS domain may play a regulatory role,
           although its exact function is unknown.
          Length = 111

 Score = 29.5 bits (67), Expect = 2.0
 Identities = 20/57 (35%), Positives = 26/57 (45%), Gaps = 4/57 (7%)

Query: 718 PYTVVETMSLAKALILFREVGLRHLLVIPKISNGSPVMGILTRHDFMPEHILALNPL 774
           P  V  + S+  A  L RE G   LLV      G P +GI+TR D +   +L   P 
Sbjct: 3   PLIVDASTSIRDAARLMREHGADALLVRD----GDPRLGIVTRTDLLDAVLLDGLPS 55


>gnl|CDD|239983 cd04610, CBS_pair_ParBc_assoc, This cd contains two tandem repeats
           of the cystathionine beta-synthase (CBS pair) domains
           associated with a ParBc (ParB-like nuclease) domain
           downstream. CBS is a small domain originally identified
           in cystathionine beta-synthase and subsequently found in
           a wide range of different proteins. CBS domains usually
           come in tandem repeats, which associate to form a
           so-called Bateman domain or a CBS pair which is
           reflected in this model. The interface between the two
           CBS domains forms a cleft that is a potential ligand
           binding site. The CBS pair coexists with a variety of
           other functional domains.  It has been proposed that the
           CBS domain may play a regulatory role, although its
           exact function is unknown.
          Length = 107

 Score = 29.3 bits (66), Expect = 2.4
 Identities = 8/19 (42%), Positives = 14/19 (73%)

Query: 619 VVHVLRTTRHNGFPVIDEN 637
           V+ +++ T H+GFPV+D  
Sbjct: 16  VIKLIKETGHDGFPVVDNG 34


>gnl|CDD|224902 COG1991, COG1991, Uncharacterized conserved protein [Function
           unknown].
          Length = 131

 Score = 29.7 bits (67), Expect = 2.5
 Identities = 18/70 (25%), Positives = 26/70 (37%), Gaps = 6/70 (8%)

Query: 131 NMMLANKFGMAFLSFSIPNLVLTLFASIVTATVAPVAAGSGIPEVKA------YLNGVDA 184
             ++ +  G   L FS+  L + L ASI  A V      S   E         ++    A
Sbjct: 6   TKIILSNKGQISLEFSLLLLAIVLAASIAGAYVLISNFNSAAIEAYNAAKNAIWVYSNAA 65

Query: 185 PGILSFRTLV 194
              LS  T+V
Sbjct: 66  KDTLSNVTVV 75


>gnl|CDD|239988 cd04615, CBS_pair_2, The CBS domain, named after human CBS, is a
           small domain originally identified in cystathionine
           beta-synthase and is subsequently found in a wide range
           of different proteins. CBS domains usually occur in
           tandem repeats. They associate to form a so-called
           Bateman domain or a CBS pair based on crystallographic
           studies in bacteria.  The CBS pair was used as a basis
           for this cd hierarchy since the human CBS proteins can
           adopt the typical core structure and form an
           intramolecular CBS pair.  The interface between the two
           CBS domains forms a cleft that is a potential ligand
           binding site. The CBS pair coexists with a variety of
           other functional domains and this has been used to help
           in its classification here.  It has been proposed that
           the CBS domain may play a regulatory role, although its
           exact function is unknown. Mutations of conserved
           residues within this domain are associated with a
           variety of human hereditary diseases, including
           congenital myotonia, idiopathic generalized epilepsy,
           hypercalciuric nephrolithiasis, and classic Bartter
           syndrome (CLC chloride channel family members),
           Wolff-Parkinson-White syndrome (gamma 2 subunit of
           AMP-activated protein kinase), retinitis pigmentosa (IMP
           dehydrogenase-1), and homocystinuria (cystathionine
           beta-synthase).
          Length = 113

 Score = 29.4 bits (66), Expect = 2.5
 Identities = 13/47 (27%), Positives = 24/47 (51%), Gaps = 3/47 (6%)

Query: 718 PYTVVETMSLAKALILFREVGLRHLLVIPKISNGSPVMGILTRHDFM 764
           P  VV    +A+A+      G R    +P + +   ++GI+TR+D +
Sbjct: 3   PSCVVLNTDIARAVAEMYTSGSR---ALPVVDDKKRLVGIITRYDVL 46


>gnl|CDD|240113 cd04800, CBS_pair_CAP-ED_DUF294_PBI_assoc2, This cd contains two
           tandem repeats of the cystathionine beta-synthase (CBS
           pair) domains associated with either the CAP_ED (cAMP
           receptor protein effector domain) family of
           transcription factors and the DUF294 domain or the PB1
           (Phox and Bem1p) domain.  Members of CAP_ED, include CAP
           which binds cAMP, FNR (fumarate and nitrate reductase)
           which uses an iron-sulfur cluster to sense oxygen, and
           CooA a heme containing CO sensor. In all cases binding
           of the effector leads to conformational changes and the
           ability to activate transcription. DUF294 is a putative
           nucleotidyltransferase with a conserved DxD motif. The
           PB1 domain adopts a beta-grasp fold, similar to that
           found in ubiquitin and Ras-binding domains. A motif,
           variously termed OPR, PC and AID, represents the most
           conserved region of the majority of PB1 domains, and is
           necessary for PB1 domain function. This function is the
           formation of PB1 domain heterodimers, although not all
           PB1 domain pairs associate. CBS is a small domain
           originally identified in cystathionine beta-synthase and
           subsequently found in a wide range of different
           proteins. CBS domains usually come in tandem repeats,
           which associate to form a so-called Bateman domain or a
           CBS pair which is reflected in this model. The interface
           between the two CBS domains forms a cleft that is a
           potential ligand binding site. The CBS pair coexists
           with a variety of other functional domains. It has been
           proposed that the CBS domain may play a regulatory role,
           although its exact function is unknown.
          Length = 111

 Score = 29.4 bits (67), Expect = 2.6
 Identities = 15/49 (30%), Positives = 27/49 (55%), Gaps = 4/49 (8%)

Query: 716 ASPYTVVETMSLAKALILFREVGLRHLLVIPKISNGSPVMGILTRHDFM 764
           A P T+    ++ +AL+L  E G+ HL V   + +G  V G+++  D +
Sbjct: 65  APPITIPPDATVFEALLLMLERGIHHLPV---VDDGRLV-GVISATDLL 109


>gnl|CDD|240011 cd04640, CBS_pair_27, The CBS domain, named after human CBS, is a
           small domain originally identified in cystathionine
           beta-synthase and is subsequently found in a wide range
           of different proteins. CBS domains usually occur in
           tandem repeats. They associate to form a so-called
           Bateman domain or a CBS pair based on crystallographic
           studies in bacteria.  The CBS pair was used as a basis
           for this cd hierarchy since the human CBS proteins can
           adopt the typical core structure and form an
           intramolecular CBS pair.  The interface between the two
           CBS domains forms a cleft that is a potential ligand
           binding site. The CBS pair coexists with a variety of
           other functional domains and this has been used to help
           in its classification here.  It has been proposed that
           the CBS domain may play a regulatory role, although its
           exact function is unknown. Mutations of conserved
           residues within this domain are associated with a
           variety of human hereditary diseases, including
           congenital myotonia, idiopathic generalized epilepsy,
           hypercalciuric nephrolithiasis, and classic Bartter
           syndrome (CLC chloride channel family members),
           Wolff-Parkinson-White syndrome (gamma 2 subunit of
           AMP-activated protein kinase), retinitis pigmentosa (IMP
           dehydrogenase-1), and homocystinuria (cystathionine
           beta-synthase).
          Length = 126

 Score = 29.6 bits (67), Expect = 2.7
 Identities = 15/52 (28%), Positives = 26/52 (50%), Gaps = 3/52 (5%)

Query: 718 PYTVVETMSLAKALILFREVGLRHLLVIPKISNGSPVMGILTRHDFMPEHIL 769
           P  +    S+ +AL L  + G+R LLV+    N    +G++T  D + E  +
Sbjct: 3   PIVIPADTSIDEALELMIKHGVRLLLVVDSDDN---FIGVITAVDLLGEEPI 51


>gnl|CDD|239980 cd04607, CBS_pair_NTP_transferase_assoc, This cd contains two
           tandem repeats of the cystathionine beta-synthase (CBS
           pair) domain associated with the NTP (Nucleotidyl
           transferase) domain downstream.  CBS is a small domain
           originally identified in cystathionine beta-synthase and
           subsequently found in a wide range of different
           proteins. CBS domains usually come in tandem repeats,
           which associate to form a so-called Bateman domain or a
           CBS pair which is reflected in this model. The interface
           between the two CBS domains forms a cleft that is a
           potential ligand binding site. The CBS pair coexists
           with a variety of other functional domains.  It has been
           proposed that the CBS domain may play a regulatory role,
           although its exact function is unknown.
          Length = 113

 Score = 29.0 bits (66), Expect = 3.3
 Identities = 15/50 (30%), Positives = 22/50 (44%), Gaps = 3/50 (6%)

Query: 714 ANASPYTVVETMSLAKALILFREVGLRHLLVIPKISNGSPVMGILTRHDF 763
            N +P T     S  + L L RE  +RH   +P +     V+G+ T  D 
Sbjct: 64  MNRNPITAKVGSSREEILALMRERSIRH---LPILDEEGRVVGLATLDDL 110


>gnl|CDD|130176 TIGR01106, ATPase-IIC_X-K, sodium or proton efflux -- potassium
           uptake antiporter, P-type ATPase, alpha subunit.  This
           model describes the P-type ATPases responsible for the
           exchange of either protons or sodium ions for potassium
           ions across the plasma membranes of eukaryotes. Unlike
           most other P-type ATPases, members of this subfamily
           require a beta subunit for activity. This model
           encompasses eukaryotes and consists of two functional
           types, a Na/K antiporter found widely distributed in
           eukaryotes and a H/K antiporter found only in
           vertebrates. The Na+ or H+/K+ antiporter P-type ATPases
           have been characterized as Type IIC based on a published
           phylogenetic analysis. Sequences from Blastocladiella
           emersonii (GP|6636502, GP|6636502 and PIR|T43025), C.
           elegans (GP|2315419, GP|6671808 and PIR|T31763) and
           Drosophila melanogaster (GP|7291424) score below trusted
           cutoff, apparently due to long branch length (excessive
           divergence from the last common ancestor) as evidenced
           by a phylogenetic tree. Experimental evidence is needed
           to determine whether these sequences represent ATPases
           with conserved function. Aside from fragments, other
           sequences between trusted and noise appear to be
           bacterial ATPases of unclear lineage, but most likely
           calcium pumps [Energy metabolism, ATP-proton motive
           force interconversion].
          Length = 997

 Score = 30.9 bits (70), Expect = 3.3
 Identities = 20/59 (33%), Positives = 30/59 (50%), Gaps = 5/59 (8%)

Query: 57  VGADVCPIESLDYEIAENDFFKEDWRTRGRNQMI--QYIFMKWSLCFLIGLIVGLIGFL 113
           +G D+ P  SL YE AE+D  K   R    ++++  + I M +     IG+I  L GF 
Sbjct: 786 LGTDMVPAISLAYEKAESDIMKRQPRNPKTDKLVNERLISMAY---GQIGMIQALGGFF 841


>gnl|CDD|227821 COG5534, COG5534, Plasmid replication initiator protein [DNA
           replication, recombination, and repair].
          Length = 383

 Score = 30.4 bits (68), Expect = 3.7
 Identities = 8/36 (22%), Positives = 17/36 (47%)

Query: 59  ADVCPIESLDYEIAENDFFKEDWRTRGRNQMIQYIF 94
            D   ++ L  E+   D+ +E++R  G   ++  I 
Sbjct: 177 TDARALKPLGEELILPDWLREEYRGVGDTALVLTID 212


>gnl|CDD|224234 COG1315, COG1315, Uncharacterized conserved protein [Function
           unknown].
          Length = 543

 Score = 30.5 bits (69), Expect = 3.9
 Identities = 17/69 (24%), Positives = 33/69 (47%), Gaps = 2/69 (2%)

Query: 663 FLPTPNPTVNDAFSQFSAVDFAKRGSGNGDKIEDIELSEEEMEMYVDLH-PFANASPYTV 721
           F     P +N+A S +    + +      DK+ ++++SE+ ME  ++   P    +P T 
Sbjct: 45  FQDIEKPRINNATSPYKMNAYFEISPAE-DKVYEVKISEDSMEATLESTVPGVEGAPVTK 103

Query: 722 VETMSLAKA 730
            E ++  KA
Sbjct: 104 DEILNALKA 112


>gnl|CDD|222537 pfam14093, DUF4271, Domain of unknown function (DUF4271).  This
           family of integral membrane proteins is functionally
           uncharacterized. This family of proteins is found in
           bacteria. Proteins in this family are typically between
           221 and 326 amino acids in length.
          Length = 207

 Score = 29.5 bits (67), Expect = 4.6
 Identities = 18/77 (23%), Positives = 28/77 (36%), Gaps = 6/77 (7%)

Query: 433 YYNDLASLIFNTNDDAIRNLFSKDTDKEFQHSSMLIFFVTCFSLSVLSYGIVAPAGLFVP 492
            + D   L F          F  +T KE +   + +F  TC  LS+  Y  +   GL + 
Sbjct: 26  RFKDFFLLKFR------DKYFIIETGKENRPFQLFLFLQTCLLLSLFIYLYLRDFGLELF 79

Query: 493 AIVTGASYGRFVGMLVG 509
             V    +G   G +  
Sbjct: 80  FSVQLIVFGLIFGGVFL 96


>gnl|CDD|239986 cd04613, CBS_pair_SpoIVFB_EriC_assoc2, This cd contains two tandem
           repeats of the cystathionine beta-synthase (CBS pair)
           domains in association with either the SpoIVFB domain
           (sporulation protein, stage IV cell wall formation, F
           locus, promoter-distal B) or the chloride channel
           protein EriC.  SpoIVFB is one of 4 proteins involved in
           endospore formation; the others are SpoIVFA (sporulation
           protein, stage IV cell wall formation, F locus,
           promoter-proximal A), BofA (bypass-of-forespore A ), and
           SpoIVB (sporulation protein, stage IV cell wall
           formation, B locus).  SpoIVFB is negatively regulated by
           SpoIVFA and BofA and activated by SpoIVB.  It is thought
           that SpoIVFB, SpoIVFA, and BofA are located in the
           mother-cell membrane that surrounds the forespore and
           that SpoIVB is secreted from the forespore into the
           space between the two where it activates SpoIVFB. EriC
           is involved in inorganic ion transport and metabolism.
           CBS is a small domain originally identified in
           cystathionine beta-synthase and subsequently found in a
           wide range of different proteins. CBS domains usually
           come in tandem repeats, which associate to form a
           so-called Bateman domain or a CBS pair which is
           reflected in this model. The interface between the two
           CBS domains forms a cleft that is a potential ligand
           binding site. The CBS pair coexists with a variety of
           other functional domains.  It has been proposed that the
           CBS domain may play a regulatory role, although its
           exact function is unknown.
          Length = 114

 Score = 28.7 bits (65), Expect = 4.8
 Identities = 24/148 (16%), Positives = 43/148 (29%), Gaps = 48/148 (32%)

Query: 617 GNVVHVLRTTRHNGFPVIDENPLSETPILYGLILRAHLITLLKKKAFLPTPNPTVNDAFS 676
             ++ V+  +  N FPV+D++       L G++    +  +L                  
Sbjct: 13  NELLDVIAHSPENNFPVVDDDGR-----LVGIVSLDDIREILF----------------- 50

Query: 677 QFSAVDFAKRGSGNGDKIEDIELSEEEMEMYVDLHPFANASPYTVVETMSLAKALILFRE 736
                                   +  +   V         P  V    SL  AL  F +
Sbjct: 51  ------------------------DPSLYDLVVASDIMTKPPVVVYPEDSLEDALKKFED 86

Query: 737 VGLRHLLVIPKISNGSPVMGILTRHDFM 764
                L V+        ++GIL+R D +
Sbjct: 87  SDYEQLPVVDDDPGK--LLGILSRSDLL 112


>gnl|CDD|179591 PRK03545, PRK03545, putative arabinose transporter; Provisional.
          Length = 390

 Score = 29.5 bits (67), Expect = 8.6
 Identities = 20/60 (33%), Positives = 28/60 (46%), Gaps = 14/60 (23%)

Query: 341 VPLVLLLGVVGGILGS-LYNFLLDKVLRIYNFINEKGIAPKIFLACTISILTSCLLFGLP 399
             L+LLL    GI+GS L++ L ++              P  FL   I++L  CLL  LP
Sbjct: 243 ATLLLLLFGGAGIIGSVLFSRLGNR-------------HPSGFLLIAIALLLVCLLLLLP 289


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.324    0.139    0.419 

Gapped
Lambda     K      H
   0.267   0.0828    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 41,539,563
Number of extensions: 4278919
Number of successful extensions: 6686
Number of sequences better than 10.0: 1
Number of HSP's gapped: 6587
Number of HSP's successfully gapped: 179
Length of query: 794
Length of database: 10,937,602
Length adjustment: 105
Effective length of query: 689
Effective length of database: 6,280,432
Effective search space: 4327217648
Effective search space used: 4327217648
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.5 bits)
S2: 63 (27.9 bits)