RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy6631
         (704 letters)



>gnl|CDD|239655 cd03683, ClC_1_like, ClC-1-like chloride channel proteins. This CD
           includes isoforms ClC-0, ClC-1, ClC-2 and ClC_K. ClC-1
           is expressed in skeletal muscle and its mutation leads
           to both recessively and dominantly-inherited forms of
           muscle stiffness or myotonia. ClC-K is exclusively
           expressed in kidney. Similarly, mutation of ClC-K leads
           to nephrogenic diabetes insipidus in mice and Bartter's
           syndrome in human. These proteins belong to the ClC
           superfamily of chloride ion channels, which share the
           unique double-barreled architecture and
           voltage-dependent gating mechanism.  The gating is
           conferred by the permeating anion itself, acting as the
           gating charge. This domain is found in the eukaryotic
           halogen ion (Cl-, Br- and I-) channel proteins, that
           perform a variety of functions including cell volume
           regulation, regulation of intracelluar chloride
           concentration, membrane potential stabilization, charge
           compensation necessary for the acidification of
           intracellular organelles and transepithelial chloride
           transport.
          Length = 426

 Score =  382 bits (983), Expect = e-126
 Identities = 161/313 (51%), Positives = 208/313 (66%), Gaps = 50/313 (15%)

Query: 98  GALYKGVLFSIEVTTVYFAIRNYWRGFFTAVCGATVFRLLAVWFYNEETMRAFFPTNFTM 157
           GA   GVLFSIEVT+ YFA+RNYWRGFF A CGA  FRLLAV+F ++ET+ A F T F +
Sbjct: 164 GAPIGGVLFSIEVTSTYFAVRNYWRGFFAATCGAFTFRLLAVFFSDQETITALFKTTFFV 223

Query: 158 DFPFDPQELTVFALIGVVCGFGGAGYVWSHRQYVLFMRRNKKMNAFLQKNRFLYPGIVVL 217
           DFPFD QEL +FAL+G++CG  GA +V+ HR+ V F R+N+  + FL+++  LYP IV L
Sbjct: 224 DFPFDVQELPIFALLGIICGLLGALFVFLHRKIVRFRRKNRLFSKFLKRSPLLYPAIVAL 283

Query: 218 LATSVSFPLGLGKYMAGDLNTHDQLSSLFSNFTWTKGHFTVEEQEVLKHWTTRNTDVFVS 277
           L   ++FP                                                 F++
Sbjct: 284 LTAVLTFP-------------------------------------------------FLT 294

Query: 278 LACFMLYTYIFSIIASTIPVPSGSFIPVFKIGAAFGRMIGEIIFLRFPDGITHGKFIAPI 337
           L  F++  ++ + +A T+PVP+G F+PVF IGAA GR++GEI+ + FP+GI  G    PI
Sbjct: 295 LFLFIVVKFVLTALAITLPVPAGIFMPVFVIGAALGRLVGEIMAVLFPEGIRGGI-SNPI 353

Query: 338 IPGGYATVGAAAFSGAVTHTISVSVIMFEMTGQITHIIPVMIAVLISNAVAALLQPSLYD 397
            PGGYA VGAAAFSGAVTHT+SV+VI+FE+TGQI+H++PV+IAVLISNAVA  LQPS+YD
Sbjct: 354 GPGGYAVVGAAAFSGAVTHTVSVAVIIFELTGQISHLLPVLIAVLISNAVAQFLQPSIYD 413

Query: 398 SIILIKKLPYLPD 410
           SII IKKLPYLPD
Sbjct: 414 SIIKIKKLPYLPD 426



 Score = 52.6 bits (127), Expect = 4e-07
 Identities = 21/29 (72%), Positives = 25/29 (86%)

Query: 458 LVESSVAALLQPSLYDSIILIKKLPYLPD 486
           L+ ++VA  LQPS+YDSII IKKLPYLPD
Sbjct: 398 LISNAVAQFLQPSIYDSIIKIKKLPYLPD 426


>gnl|CDD|238507 cd01036, ClC_euk, Chloride channel, ClC.  These domains are found
           in the eukaryotic halogen ion (Cl-, Br- and I-) channel
           proteins that perform a variety of functions including
           cell volume regulation, membrane potential
           stabilization, charge compensation necessary for the
           acidification of intracellular organelles, signal
           transduction and transepithelial transport.  They are
           also involved in many pathophysiological processes and
           are responsible for a number of human diseases.  These
           proteins belong to the ClC superfamily of chloride ion
           channels, which share the unique double-barreled
           architecture and voltage-dependent gating mechanism.
           The gating is conferred by the permeating anion itself,
           acting as the gating charge.  Some proteins possess long
           C-terminal cytoplasmic regions containing two CBS
           (cystathionine beta synthase) domains of putative
           regulatory function.
          Length = 416

 Score =  211 bits (538), Expect = 2e-61
 Identities = 85/306 (27%), Positives = 129/306 (42%), Gaps = 58/306 (18%)

Query: 98  GALYKGVLFSIEVTTVYFAIRNYWRGFFTAVCGATVFRLLAVWFYNEETM-----RAFFP 152
           GA   G+LF +E  + +F +R  WR FF A+  A V ++   +    E +          
Sbjct: 163 GAPIGGLLFVLEEVSTFFPVRLAWRVFFAALVSAFVIQIYNSFNSGFELLDRSSAMFLSL 222

Query: 153 TNFTMDFPFDPQELTVFALIGVVCGFGGAGYVWSHRQYVLFMRRNKKMNAFLQKNRFLYP 212
           T F +  P +  E     +IGV+CG   A +V     ++ + RR         + R L P
Sbjct: 223 TVFELHVPLNLYEFIPTVVIGVICGLLAALFVRLSIIFLRWRRRLLF--RKTARYRVLEP 280

Query: 213 GIVVLLATSVSFPLGLGKYMAGDLNTHDQLSSLFSNFTWTKGHFTVEEQEVLKHWTTRNT 272
            +  L+ +++ +                                                
Sbjct: 281 VLFTLIYSTIHY------------------------------------------------ 292

Query: 273 DVFVSLACFMLYTYIFSIIASTIPVPSGSFIPVFKIGAAFGRMIGEIIFLRFPDGITHGK 332
               +L  F+L  +  S +A  I VP G+FIP   IGAA GR++G ++      GI    
Sbjct: 293 --APTLLLFLLIYFWMSALAFGIAVPGGTFIPSLVIGAAIGRLVGLLVHRIAVAGIGAES 350

Query: 333 FIAPIIPGGYATVGAAAFSGAVT-HTISVSVIMFEMTGQITHIIPVMIAVLISNAVAALL 391
                 PG YA +GAAAF G  T  T S+ VIM E+TG + H++P+M+A+LI+ AVA   
Sbjct: 351 ATLWADPGVYALIGAAAFLGGTTRLTFSICVIMMELTGDLHHLLPLMVAILIAKAVADAF 410

Query: 392 QPSLYD 397
             SLY 
Sbjct: 411 CESLYH 416


>gnl|CDD|216046 pfam00654, Voltage_CLC, Voltage gated chloride channel.  This
           family of ion channels contains 10 or 12 transmembrane
           helices. Each protein forms a single pore. It has been
           shown that some members of this family form homodimers.
           In terms of primary structure, they are unrelated to
           known cation channels or other types of anion channels.
           Three ClC subfamilies are found in animals. ClC-1 is
           involved in setting and restoring the resting membrane
           potential of skeletal muscle, while other channels play
           important parts in solute concentration mechanisms in
           the kidney. These proteins contain two pfam00571
           domains.
          Length = 345

 Score =  156 bits (397), Expect = 3e-42
 Identities = 75/291 (25%), Positives = 117/291 (40%), Gaps = 59/291 (20%)

Query: 103 GVLFSIEVTTVYFAIRNYWRGFFTAVCGATVFRLLAVWFYNEETMRAFFPTNFTMDFPFD 162
           GVLF++E  +  F+ R        +V  A V RLL            F   +     P  
Sbjct: 107 GVLFALEELSRSFSYRALLPVLVASVVAALVSRLLFGNE------PLFEVPSLP---PLS 157

Query: 163 PQELTVFALIGVVCGFGGAGYVWSHRQYVLFMRRNKKMNAFLQKNRFLYPGIVVLLATSV 222
             EL +F L+G++CG  GA +V    +     RR KK          L P +  LL   +
Sbjct: 158 LLELPLFILLGILCGLLGALFVRLLLKVERLFRRLKK------LPPILRPALGGLLVGLL 211

Query: 223 SFP----LGLGKYMAGDLNTHDQLSSLFSNFTWTKGHFTVEEQEVLKHWTTRNTDVFVSL 278
                  LG G         +  +  L +                       +T   + L
Sbjct: 212 GLFLPEVLGGG---------YGLIQLLLNG----------------------STLSLLLL 240

Query: 279 ACFMLYTYIFSIIASTIPVPSGSFIPVFKIGAAFGRMIGEIIFLRFPDGITHGKFIAPII 338
              +L   + + ++     P G F P   IGAA GR++G ++ L FP             
Sbjct: 241 LLLLLLKLLATALSLGSGAPGGIFAPSLFIGAALGRLLGLLLPLLFPGI--------APS 292

Query: 339 PGGYATVGAAAFSGAVTHT-ISVSVIMFEMTGQITHIIPVMIAVLISNAVA 388
           PG +A +G AAF   VT   ++  V++FE+TG  + ++P+M+AVLI+  V+
Sbjct: 293 PGAFALLGMAAFLAGVTRAPLTAIVLVFELTGSYSLLLPLMLAVLIAYLVS 343


>gnl|CDD|239656 cd03684, ClC_3_like, ClC-3-like chloride channel proteins.  This CD
            includes ClC-3, ClC-4, ClC-5 and ClC-Y1. ClC-3 was
           initially cloned from rat kidney. Expression of ClC-3
           produces outwardly-rectifying Cl currents that are
           inhibited by protein kinase C activation. It has been
           suggested that ClC-3 may be a ubiquitous
           swelling-activated Cl channel that has very similar
           characteristics to those of native volume-regulated Cl
           currents. The function of ClC-4 is unclear. Studies of
           human ClC-4 have revealed that it gives rise to Cl
           currents that rapidly activate at positive voltages, and
           are sensitive to extracellular pH, with currents
           decreasing when pH falls below 6.5. ClC-4 is broadly
           distributed, especially in brain and heart.   ClC-5 is
           predominantly expressed in the kidney, but can be found
           in the brain and liver. Mutations in the ClC-5 gene
           cause certain hereditary diseases, including Dent's
           disease, an X-chromosome linked syndrome characterised
           by proteinuria, hypercalciuria, and kidney stones
           (nephrolithiasis), leading to progressive renal failure.
             These proteins belong to the ClC superfamily of
           chloride ion channels, which share the unique
           double-barreled architecture and voltage-dependent
           gating mechanism. The gating is conferred by the
           permeating anion itself, acting as the gating charge.
           This domain is found in the eukaryotic halogen ion (Cl-
           and I-) channel proteins, that perform a variety of
           functions including cell volume regulation, the membrane
           potential stabilization, transepithelial chloride
           transport and charge compensation necessary for the
           acidification of intracellular organelles.
          Length = 445

 Score =  150 bits (381), Expect = 3e-39
 Identities = 97/325 (29%), Positives = 147/325 (45%), Gaps = 47/325 (14%)

Query: 103 GVLFSIEVTTVYFAIRNYWRGFFTAVCGATVFRLLAVWFYNEETMRAF-FPTNFTMDFPF 161
           GVLFS+E  + YF ++  WR FF A+  A   + L  +     T R   F   +  D+ +
Sbjct: 149 GVLFSLEEVSYYFPLKTLWRSFFCALVAAFTLKSLNPF----GTGRLVLFEVEYDRDWHY 204

Query: 162 DPQELTVFALIGVVCGFGGAGYVWSHRQYVLFMRRNKKMNAFLQKNRFLYPGI-----VV 216
              EL  F L+G+  G  GA           F++ N K   F +K+      +     V 
Sbjct: 205 --FELIPFILLGIFGGLYGA----------FFIKANIKWARFRKKSLLKRYPVLEVLLVA 252

Query: 217 LLATSVSFPLGLGKYMAGDLNTHDQLSSLF-----SNFTWTKGHFTVEEQEVLKHWTTRN 271
           L+   +SFP    +     L+  + L  LF      +      +                
Sbjct: 253 LITALISFPNPYTR-----LDMTELLELLFNECEPGDDNSLCCYRDPPAG-------DGV 300

Query: 272 TDVFVSLACFMLYTYIFSIIASTIPVPSGSFIPVFKIGAAFGRMIG---EIIFLRFPDGI 328
                SL   ++   + +I    I VP+G F+P   +GA FGR++G   E +   +PD I
Sbjct: 301 YKALWSLLLALIIKLLLTIFTFGIKVPAGIFVPSMAVGALFGRIVGILVEQLAYSYPDSI 360

Query: 329 ---THGKFIAPIIPGGYATVGAAAFSGAVTH-TISVSVIMFEMTGQITHIIPVMIAVLIS 384
                    + I PG YA VGAAAF G VT  T+S+ VIMFE+TG + +I+P+MIAV++S
Sbjct: 361 FFACCTAGPSCITPGLYAMVGAAAFLGGVTRMTVSLVVIMFELTGALNYILPLMIAVMVS 420

Query: 385 NAVAALLQP-SLYDSIILIKKLPYL 408
             VA  +    +YD+ I +   P+L
Sbjct: 421 KWVADAIGKEGIYDAHIHLNGYPFL 445


>gnl|CDD|239657 cd03685, ClC_6_like, ClC-6-like chloride channel proteins. This CD
           includes ClC-6, ClC-7 and ClC-B, C, D in plants.
           Proteins in this family are ubiquitous in eukarotes and
           their functions are unclear. They are expressed in
           intracellular organelles membranes.  This family belongs
           to the ClC superfamily of chloride ion channels, which
           share the unique double-barreled architecture and
           voltage-dependent gating mechanism. The gating is
           conferred by the permeating anion itself, acting as the
           gating charge. ClC chloride ion channel superfamily
           perform a variety of functions including cellular
           excitability regulation, cell volume regulation,
           membrane potential stabilization, acidification of
           intracellular organelles, signal transduction, and
           transepithelial transport in animals.
          Length = 466

 Score =  126 bits (320), Expect = 4e-31
 Identities = 87/327 (26%), Positives = 129/327 (39%), Gaps = 87/327 (26%)

Query: 98  GALYKGVLFSIEVTTVYFAIRNYWRGFFTAVCGATVFRLLAVWFYNEETMRAFFPTNFTM 157
           GA   GVLFS+E    ++     WR FF+++              N      F P    M
Sbjct: 204 GAPVGGVLFSLEEVASFWNQALTWRTFFSSMIVTFTLNFFLSG-CNSGKCGLFGPGGLIM 262

Query: 158 DFPFDPQ------ELTVFALIGVVCGFGGAGYVWSHRQYVLFMRRNKKMNAFLQKNRFLY 211
                 +      EL  F LIGV+ G  GA +             N K+  F  + R  +
Sbjct: 263 FDGSSTKYLYTYFELIPFMLIGVIGGLLGALFNH----------LNHKVTRF--RKRINH 310

Query: 212 PG---------IVVLLATSVSFPLGLGKYMAGDLNTHDQLSSLFSNFTWTKGHFTVEEQE 262
            G         +V L+ + V+FP                                     
Sbjct: 311 KGKLLKVLEALLVSLVTSVVAFP------------------------------------- 333

Query: 263 VLKHWTTRNTDVFVSLACFMLYTYIFSIIASTIPVPSGSFIPVFKIGAAFGRMIGEIIFL 322
                         +L  F +  Y  +     I VPSG FIP+  IGAA+GR++G  I L
Sbjct: 334 -------------QTLLIFFVLYYFLACWTFGIAVPSGLFIPMILIGAAYGRLVG--ILL 378

Query: 323 RFPDGITHGKFIAPIIPGGYATVGAAAF-SGAVTHTISVSVIMFEMTGQITHIIPVMIAV 381
               G T       I PG YA +GAAAF  G +  T+S++VI+ E+T  +T++ P+M+ +
Sbjct: 379 GSYFGFTS------IDPGLYALLGAAAFLGGVMRMTVSLTVILLELTNNLTYLPPIMLVL 432

Query: 382 LISNAVAALLQPSLYDSIILIKKLPYL 408
           +I+  V       +YD II +K +P+L
Sbjct: 433 MIAKWVGDYFNEGIYDIIIQLKGVPFL 459


>gnl|CDD|238233 cd00400, Voltage_gated_ClC, CLC voltage-gated chloride channel. The
           ClC chloride channels catalyse the selective flow of Cl-
           ions across cell membranes, thereby regulating
           electrical excitation in skeletal muscle and the flow of
           salt and water across epithelial barriers. This domain
           is found in the halogen ions (Cl-, Br- and I-) transport
           proteins of the ClC family.  The ClC channels are found
           in all three kingdoms of life and perform a variety of
           functions including cellular excitability regulation,
           cell volume regulation, membrane potential
           stabilization, acidification of intracellular
           organelles, signal transduction, transepithelial
           transport in animals, and the extreme acid resistance
           response in eubacteria.  They lack any structural or
           sequence similarity to other known ion channels and
           exhibit unique properties of ion permeation and gating. 
           Unlike cation-selective ion channels, which form
           oligomers containing a single pore along the axis of
           symmetry, the ClC channels form two-pore homodimers with
           one pore per subunit without axial symmetry.  Although
           lacking the typical voltage-sensor found in cation
           channels, all studied ClC channels are gated (opened and
           closed) by transmembrane voltage. The gating is
           conferred by the permeating ion itself, acting as the
           gating charge.  In addition, eukaryotic and some
           prokaryotic ClC channels have two additional C-terminal
           CBS (cystathionine beta synthase) domains of putative
           regulatory function.
          Length = 383

 Score =  114 bits (288), Expect = 2e-27
 Identities = 61/286 (21%), Positives = 104/286 (36%), Gaps = 57/286 (19%)

Query: 103 GVLFSIEVTTVYFAIRNYWRGFFTAVCGATVFRLLAVWFYNEETMRAFFPTNFTMDFPFD 162
           G LF+IEV    +++ +       +V  A V RLL           A       +  P  
Sbjct: 152 GALFAIEVLLGEYSVASLIPVLLASVAAALVSRLL---------FGAEPAFGVPLYDPLS 202

Query: 163 PQELTVFALIGVVCGFGGAGYVWSHRQYVLFMRRNKKMNAFLQKNRFLYPGI--VVLLAT 220
             EL ++ L+G++ G  G  +V          R  +++        +L P +  ++L   
Sbjct: 203 LLELPLYLLLGLLAGLVGVLFVRL---LYKIERLFRRLP----IPPWLRPALGGLLLGLL 255

Query: 221 SVSFPLGLGKYMAGDLNTHDQLSSLFSNFTWTKGHFTVEEQEVLKHWTTRNTDVFVSLAC 280
            +  P  LG                        G+  +                 + L  
Sbjct: 256 GLFLPQVLGS-----------------------GYGAI-------LLALAGELSLLLLLL 285

Query: 281 FMLYTYIFSIIASTIPVPSGSFIPVFKIGAAFGRMIGEIIFLRFPDGITHGKFIAPIIPG 340
            +L   + + +      P G F P   IGAA G   G ++   FP             PG
Sbjct: 286 LLLLKLLATALTLGSGFPGGVFAPSLFIGAALGAAFGLLLPALFPG--------LVASPG 337

Query: 341 GYATVGAAAFSGAVTHT-ISVSVIMFEMTGQITHIIPVMIAVLISN 385
            YA VG AA   AV    ++  +++ E+TG  + ++P+M+AV+I+ 
Sbjct: 338 AYALVGMAALLAAVLRAPLTAILLVLELTGDYSLLLPLMLAVVIAY 383


>gnl|CDD|238504 cd01031, EriC, ClC chloride channel EriC.  This domain is found in
           the EriC chloride transporters that mediate the extreme
           acid resistance response in eubacteria and archaea. This
           response allows bacteria to survive in the acidic
           environments by decarboxylation-linked proton
           utilization. As shown for Escherichia coli EriC, these
           channels can counterbalance the electric current
           produced by the outwardly directed virtual proton pump
           linked to amino acid decarboxylation.  The EriC proteins
           belong to the ClC superfamily of chloride ion channels,
           which share a unique double-barreled architecture and
           voltage-dependent gating mechanism.  The
           voltage-dependent gating is conferred by the permeating
           anion itself, acting as the gating charge. In
           Escherichia coli EriC, a glutamate residue that
           protrudes into the pore is thought to participate in
           gating by binding to a Cl- ion site within the
           selectivity filter.
          Length = 402

 Score = 97.2 bits (243), Expect = 1e-21
 Identities = 65/305 (21%), Positives = 109/305 (35%), Gaps = 63/305 (20%)

Query: 103 GVLFSIEVTTVYFAIRNYWRGFFTAVCGATVFRLLAVWFYNEETMRAFFPTNFTMDFP-- 160
           GVLF +E     F+          ++    V RL             FF     +  P  
Sbjct: 153 GVLFVLEELRHSFSPLALLTALVASIAADFVSRL-------------FFGLGPVLSIPPL 199

Query: 161 --FDPQELTVFALIGVVCGFGGAGYVWSHRQYVLFMRRNKKMNAFLQKNRFLYPGIVVLL 218
                +   +  L+G++ G  G  +  S    +L  +         +  R L   +  LL
Sbjct: 200 PALPLKSYWLLLLLGIIAGLLGYLFNRS----LLKSQ--DLYRKLKKLPRELRVLLPGLL 253

Query: 219 ATSVSFPLGLGKYMAGDLNTHDQLSSLFSNFTWTKGHFTVEEQEVLKHWTTRNTDVFVSL 278
                 PLGL             L           GH  +              +  +SL
Sbjct: 254 ----IGPLGL------------LLPEALGG-----GHGLILSLA--------GGNFSISL 284

Query: 279 ACFMLYT-YIFSIIASTIPVPSGSFIPVFKIGAAFGRMIGEIIFLRFPDGITHGKFIAPI 337
              +    +IF++++     P G F P+  +GA  G + G I+    P        I   
Sbjct: 285 LLLIFVLRFIFTMLSYGSGAPGGIFAPMLALGALLGLLFGTILVQLGP--------IPIS 336

Query: 338 IPGGYATVGAAAFSGAVTHT-ISVSVIMFEMTGQITHIIPVMIAVLISNAVAALLQ-PSL 395
            P  +A  G AAF  AV    I+  +++ EMTG    ++P+M+  L++  VA LL    +
Sbjct: 337 APATFAIAGMAAFFAAVVRAPITAIILVTEMTGNFNLLLPLMVVCLVAYLVADLLGGKPI 396

Query: 396 YDSII 400
           Y++++
Sbjct: 397 YEALL 401


>gnl|CDD|223116 COG0038, EriC, Chloride channel protein EriC [Inorganic ion
           transport and metabolism].
          Length = 443

 Score = 94.3 bits (235), Expect = 2e-20
 Identities = 68/317 (21%), Positives = 110/317 (34%), Gaps = 67/317 (21%)

Query: 103 GVLFSIEVTTVYFAIRNYWRGFFTAVCGATVFRLLAVWFYNEETMRAFFPTNFTMDFPFD 162
           G LF+IEV       R         V  A V  LL            F   +F +     
Sbjct: 180 GALFAIEV----LYGRFLEYRALVPVLVAAVVALL--------VAGLFGGPHFLLPIVTT 227

Query: 163 ----PQELTVFALIGVVCGFGGAGYVWSHRQYVLFMRRNKKMNAFLQK---NRFLYPGIV 215
                 +L ++ ++G++ G  G          VL  R       F ++      L P + 
Sbjct: 228 PHMSLWDLLLYLVLGIIAGLFG----------VLLSRLLALSRRFFRRLPLPPLLRPALG 277

Query: 216 VLLATSVSFPLGLGKYMAGDLNTHDQLSSLFSNFTWTKGHFTVEEQEVLKHWTTRNTDVF 275
            LL  ++             L       +L                            + 
Sbjct: 278 GLLVGALGLLFPEVLGNGYGLI----QLALAGEGGLL-------------------VLLL 314

Query: 276 VSLACFMLYTYIFSIIASTIPVPSGSFIPVFKIGAAFGRMIGEIIFLRFPDGITHGKFIA 335
           + L        + ++++     P G F P   IGAA G   G ++ L FP  I       
Sbjct: 315 LFLLKL-----LATLLSYGSGAPGGIFAPSLFIGAALGLAFGALLGLLFPPSIL------ 363

Query: 336 PIIPGGYATVGAAAFSGAVTHT-ISVSVIMFEMTGQITHIIPVMIAVLISNAVAALLQP- 393
              PG +A +G AAF  A T   ++  V++ EMTG    ++P++IA LI+  V+ LL   
Sbjct: 364 --EPGLFALLGMAAFLAATTRAPLTAIVLVLEMTGNYQLLLPLLIACLIAYLVSRLLGGR 421

Query: 394 SLYDSIILIKKLPYLPD 410
            +Y  ++  +  P L +
Sbjct: 422 PIYTQLLARRGAPILRE 438



 Score = 30.4 bits (69), Expect = 4.1
 Identities = 20/91 (21%), Positives = 37/91 (40%), Gaps = 17/91 (18%)

Query: 308 IGAAFGRMIGEIIFLRFPDGITHGKFIAPIIPGGYATVGAAAFSGAVTHT-ISVSVIMFE 366
           IGAA G ++G ++ L   D                   GAAA   A  +  ++ ++   E
Sbjct: 139 IGAAIGSLLGRLLKLSREDRRI------------LLAAGAAAGLAAAFNAPLAGALFAIE 186

Query: 367 MTGQITH----IIPVMIAVLISNAVAALLQP 393
           +          ++PV++A +++  VA L   
Sbjct: 187 VLYGRFLEYRALVPVLVAAVVALLVAGLFGG 217


>gnl|CDD|234987 PRK01862, PRK01862, putative voltage-gated ClC-type chloride
           channel ClcB; Provisional.
          Length = 574

 Score = 59.0 bits (143), Expect = 5e-09
 Identities = 60/253 (23%), Positives = 97/253 (38%), Gaps = 61/253 (24%)

Query: 159 FPFD-PQELTVFALIGVVCGFGGAGYVWSHRQYVLFMRRNKKMNAFLQKNRF--LYPGIV 215
           FP     E+ +F  +GV+CG               F+R          KN+F  L   + 
Sbjct: 231 FPAVTGWEVLLFVALGVLCGAAAP----------QFLRL-----LDASKNQFKRLPVPLP 275

Query: 216 VLLATSVSFPLGLGKYMAGDLNTHDQLSSLFSNFTWTKGHFTVEEQEVLK-HWTTRNTDV 274
           V LA        LG  + G +       S++    W  G+  V    +L   WT      
Sbjct: 276 VRLA--------LGGLLVGVI-------SVWVPEVWGNGYSVV--NTILHAPWTW----- 313

Query: 275 FVSLACFMLYTYIFSIIASTIPVPSGS----FIPVFKIGAAFGRMIGEIIFLRFPDGITH 330
             +L   +    +  +IA+     SG+    F P   +GA  G + G  +   +P   + 
Sbjct: 314 -QALVAVL----VAKLIATAATAGSGAVGGVFTPTLFVGAVVGSLFGLAMHALWPGHTSA 368

Query: 331 GKFIAPIIPGGYATVGAAAFSGAVTHTISVSVIM-FEMTGQITHIIPVMIAVLISNAVAA 389
                   P  YA VG  AF    T    ++++M FEMT     ++P+M++ +++   A 
Sbjct: 369 --------PFAYAMVGMGAFLAGATQAPLMAILMIFEMTLSYQVVLPLMVSCVVAYFTAR 420

Query: 390 LLQP-SLYDSIIL 401
            L   S+Y  I L
Sbjct: 421 ALGTTSMY-EITL 432


>gnl|CDD|238506 cd01034, EriC_like, ClC chloride channel family. These protein
           sequences, closely related to the ClC Eric family, are
           putative halogen ion (Cl-, Br- and I-) transport
           proteins found in eubacteria. They belong to the ClC
           superfamily of chloride ion channels, which share a
           unique double-barreled architecture and
           voltage-dependent gating mechanism.  This superfamily
           lacks any structural or sequence similarity to other
           known ion channels and exhibit unique properties of ion
           permeation and gating.  The voltage-dependent gating is
           conferred by the permeating anion itself, acting as the
           gating charge.
          Length = 390

 Score = 56.1 bits (136), Expect = 3e-08
 Identities = 46/234 (19%), Positives = 79/234 (33%), Gaps = 57/234 (24%)

Query: 171 LIGVVCGFGGAGY----VWSHRQYVLFMRRNKKMNAFLQKNRFLYPGIVVLLATSVSFPL 226
           + GVV G  G  +    V               +  F ++   L+  +  L    +    
Sbjct: 207 VCGVVGGLAGGLFARLLVALSS------GLPGWVRRFRRRRPVLFAALCGLALALIGLVS 260

Query: 227 GLGKYMAGDLNTHDQLSSLFSNFTWTKGHFTVEEQEVLKHWTTRNTDVFVSLACFMLYTY 286
           G   +  G L     L                     L  W          L  F+    
Sbjct: 261 GGLTFGTGYLQARAALEG----------------GGGLPLWFG--------LLKFL---- 292

Query: 287 IFSIIASTIP-VPSGSFIPVFKIGAAFGRMIGEIIFLRFPDGITHGKFIAPIIPGGYATV 345
             + + S    +P G F P   +GA  G ++  +        ++ G  +          +
Sbjct: 293 --ATLLSYWSGIPGGLFAPSLAVGAGLGSLLAAL-----LGSVSQGALVL---------L 336

Query: 346 GAAAFSGAVTHT-ISVSVIMFEMTGQITHIIPVMIAVLISNAVAALLQP-SLYD 397
           G AAF   VT   ++  VI+ EMTG    ++P++ A L+++ V+ L+ P  LY 
Sbjct: 337 GMAAFLAGVTQAPLTAFVIVMEMTGDQQMLLPLLAAALLASGVSRLVCPEPLYH 390


>gnl|CDD|234963 PRK01610, PRK01610, putative voltage-gated ClC-type chloride
           channel ClcB; Provisional.
          Length = 418

 Score = 46.7 bits (111), Expect = 3e-05
 Identities = 55/243 (22%), Positives = 92/243 (37%), Gaps = 57/243 (23%)

Query: 163 PQELTVFALIGVVCGFGGAGYVW----SHRQYVLFMRRNKKMNAFLQKNRFLYPGIVVLL 218
            ++  +    G++ G  G   +     SHR              F+             L
Sbjct: 218 ARDYALIISTGLLAGLCGPLLLTLMNASHR-------------GFVS------------L 252

Query: 219 ATSVSFPLGLGKYMAGDLNTHDQLSSLFSNFTWTKGHFTVEEQEVLKHWTTRNTDVFVSL 278
             +  + L LG  + G       L SLF+   W  G+  V  Q  L           + L
Sbjct: 253 KLAPPWQLALGGLIVG-------LLSLFTPAVWGNGYSVV--QSFL------TAPPLLML 297

Query: 279 ACFMLYTYIFSIIAST-IPVPSGSFIPVFKIGAAFGRMIGEIIFLRFPDGITHGKFIAPI 337
              +    + +++AS+    P G F P   +G A G + G  + L  PD    G+ I  +
Sbjct: 298 IAGIFLCKLLAVLASSGSGAPGGVFTPTLFVGLAIGMLYGRSLGLWLPD----GEEITLL 353

Query: 338 IPGGYATVGAAAFSGAVTHTISVSVIM-FEMTGQITHIIPVMIAVLISNAVAALLQPSLY 396
           +       G A    A TH   +S +M  EMTG+   +  ++IA +I++ ++  L+    
Sbjct: 354 L----GLTGMATLLAATTHAPIMSTLMICEMTGEYQLLPGLLIACVIASVISRTLRR--- 406

Query: 397 DSI 399
           DSI
Sbjct: 407 DSI 409


>gnl|CDD|235385 PRK05277, PRK05277, chloride channel protein; Provisional.
          Length = 438

 Score = 42.6 bits (101), Expect = 6e-04
 Identities = 28/106 (26%), Positives = 46/106 (43%), Gaps = 12/106 (11%)

Query: 297 VPSGSFIPVFKIGAAFGRMIGEIIFLRFPDGITHGKFIAPIIPGGYATVGAAA-FSGAVT 355
            P G F P+  +G   G   G +    FP             PG +A  G  A F+  V 
Sbjct: 317 APGGIFAPMLALGTLLGLAFGMVAAALFPQYHIE--------PGTFAIAGMGALFAATVR 368

Query: 356 HTISVSVIMFEMTGQITHIIPVMIAVLISNAVAALL--QPSLYDSI 399
             ++  V++ EMT     I+P++I  L +  +A  L  +P +Y ++
Sbjct: 369 APLTGIVLVLEMTDNYQLILPLIITCLGATLLAQFLGGKP-IYSAL 413


>gnl|CDD|238505 cd01033, ClC_like, Putative ClC chloride channel.  Clc proteins are
           putative halogen ion (Cl-, Br- and I-) transporters
           found in eubacteria. They belong to the ClC superfamily
           of halogen ion channels, which share a unique
           double-barreled architecture and voltage-dependent
           gating mechanism.  This superfamily lacks any structural
           or sequence similarity to other known ion channels and
           exhibit unique properties of ion permeation and gating. 
           The voltage-dependent gating is conferred by the
           permeating anion itself, acting as the gating charge.
          Length = 388

 Score = 40.7 bits (96), Expect = 0.002
 Identities = 59/298 (19%), Positives = 104/298 (34%), Gaps = 80/298 (26%)

Query: 103 GVLFSIEVTTVYFAIRNYWRGFFTAVCGATVFRLLAVWFYNEETMRAFFPTNFTMDFPFD 162
           G LF++E+     ++R+       A    +                      + +     
Sbjct: 152 GALFALEILLRTISLRS-----VVAALATSAIAAAVASLLKG------DHPIYDIPPMQL 200

Query: 163 PQELTVFA-LIGVVCGFGGAGYVWSHRQYVLFMRRNKKMNAFLQKNRFLYPGIVVLLAT- 220
              L ++A L G V G   AG+    R+     R  +         R L+   +  L   
Sbjct: 201 STPLLIWALLAGPVLGVVAAGF----RRLSQAARAKRPKGK-----RILWQMPLAFLVIG 251

Query: 221 --SVSFP--LGLGKYMAGDLNTHDQLSSLFSNFTWTKGHFTVEEQEVLKHWTTRNTDVFV 276
             S+ FP  LG G+ +A                           Q         +T + +
Sbjct: 252 LLSIFFPQILGNGRALA---------------------------QLAF------STTLTL 278

Query: 277 SLACFMLYTYIFSIIASTIPVPSGSF----IPVFKIGAAFGRMIGEIIFLRFPDGITHGK 332
           SL   +L   +  I+A+ + + +G++     P   +GA  G ++G +     P       
Sbjct: 279 SLLLILL---VLKIVATLLALRAGAYGGLLTPSLALGALLGALLGIVWNALLP------- 328

Query: 333 FIAPIIPGGYATVGAAAFSGAVTHTISVSVIMF--EMTGQI-THIIPVMIAVLISNAV 387
               I    +A +GAAAF  A T    ++ ++   E T Q    +IP+M+AV  + AV
Sbjct: 329 -PLSI--AAFALIGAAAF-LAATQKAPLTALILVLEFTRQNPLFLIPLMLAVAGAVAV 382


>gnl|CDD|213779 TIGR03165, F1F0_chp_2, F1/F0 ATPase, Methanosarcina type, subunit
           2.  Members of this protein family are uncharacterized,
           highly hydrophobic proteins encoded in the middle of
           apparent F1/F0 ATPase operons. We note, however, that
           this protein is both broadly and sparsely distributed.
           It is found in about only about two percent of microbial
           genomes sequenced, with the first ten examples found
           coming from the Euryarchaeota, Chlorobia,
           Betaproteobacteria, Deltaproteobacteria, and
           Planctomycetes. In most of these species, surrounding
           operon appears to represent a second F1/F0 ATPase
           system, and the member proteins belong to subfamilies
           with the same phylogenetic distribution as the current
           protein family.
          Length = 83

 Score = 31.5 bits (72), Expect = 0.20
 Identities = 17/60 (28%), Positives = 22/60 (36%), Gaps = 5/60 (8%)

Query: 82  KFPEGRAPLVVYEFLKGALYKGVLFSIEVTTVYFAIRNYWRGFFTAVCGATVFRLLAVWF 141
           K    R P     FL   L   +  +I +   Y   R  WR    A+ G    R+LAV  
Sbjct: 27  KGLASRRP--ALLFLGSLL---LRTAIVLAGFYLLARGGWRRLLAALLGFFAARVLAVRL 81


>gnl|CDD|201313 pfam00571, CBS, CBS domain.  CBS domains are small intracellular
           modules that pair together to form a stable globular
           domain. This family represents a single CBS domain.
           Pairs of these domains have been termed a Bateman
           domain. CBS domains have been shown to bind ligands with
           an adenosyl group such as AMP, ATP and S-AdoMet. CBS
           domains are found attached to a wide range of other
           protein domains suggesting that CBS domains may play a
           regulatory role making proteins sensitive to adenosyl
           carrying ligands. The region containing the CBS domains
           in Cystathionine-beta synthase is involved in regulation
           by S-AdoMet. CBS domain pairs from AMPK bind AMP or ATP.
           The CBS domains from IMPDH and the chloride channel CLC2
           bind ATP.
          Length = 57

 Score = 31.0 bits (71), Expect = 0.20
 Identities = 12/43 (27%), Positives = 22/43 (51%), Gaps = 1/43 (2%)

Query: 499 VEDFMVRDVKYIWNNMTYRDLKNLLKENRSLRVFPLVESSGKF 541
           V+D M  DV  +  + +  +   L++EN  +   P+V+  GK 
Sbjct: 1   VKDIMTPDVVTVPPDTSLEEALELMRENG-ISRLPVVDEDGKL 42



 Score = 28.3 bits (64), Expect = 1.8
 Identities = 9/34 (26%), Positives = 17/34 (50%)

Query: 423 VEDFMVRDVKYIWNNMTYRDLKNLLKENRSLRVF 456
           V+D M  DV  +  + +  +   L++EN   R+ 
Sbjct: 1   VKDIMTPDVVTVPPDTSLEEALELMRENGISRLP 34


>gnl|CDD|223591 COG0517, COG0517, FOG: CBS domain [General function prediction
           only].
          Length = 117

 Score = 31.7 bits (72), Expect = 0.44
 Identities = 27/115 (23%), Positives = 49/115 (42%), Gaps = 15/115 (13%)

Query: 423 VEDFMVRDVKYIWNNMTYRDLKNLLKENRSLRVFPLVE-SSVAALLQPSLYDSIILIKKL 481
           V+D M +DV  +  + + RD   L+ EN    V P+V+   +  ++  +  D  IL    
Sbjct: 1   VKDIMTKDVITVKPDTSVRDALLLMSENGVSAV-PVVDDGKLVGII--TERD--ILRALA 55

Query: 482 PYLPDLLPSSSGIYNVYVEDFMVRDVKYIWNNMTYRDLKNLLKENRSLRVFPLVE 536
                LLP         V++ M + V  +  +    +   L+ E   +R  P+V+
Sbjct: 56  AGGKRLLP---------VKEVMTKPVVTVDPDTPLEEALELMVERHKIRRLPVVD 101


>gnl|CDD|179668 PRK03893, PRK03893, putative sialic acid transporter; Provisional.
          Length = 496

 Score = 32.7 bits (75), Expect = 0.65
 Identities = 29/131 (22%), Positives = 57/131 (43%), Gaps = 29/131 (22%)

Query: 285 TYIFSIIASTIPVPSGSFIPVFKIGAAFGRMIGEIIFLR--FPDGITHGKFIAPIIPGGY 342
            Y+ S++ S + +     IPVF IG A   ++G ++F +     GI+    + P + GGY
Sbjct: 343 AYVCSLLISQLLI-----IPVFAIGGANVWVLGLLLFFQQMLGQGISG---LLPKLIGGY 394

Query: 343 -------------ATVGA-----AAFSGA-VTHTISVSVIMFEMTGQITHIIPVMIAVLI 383
                          VGA     A   GA +   + +   +  ++  +T ++ ++I + +
Sbjct: 395 FDTEQRAAGLGFTYNVGALGGALAPILGALIAQRLDLGTALASLSFSLTFVVILLIGLDM 454

Query: 384 SNAVAALLQPS 394
            + V   L+P 
Sbjct: 455 PSRVQRWLRPE 465


>gnl|CDD|223419 COG0342, SecD, Preprotein translocase subunit SecD [Intracellular
           trafficking and secretion].
          Length = 506

 Score = 32.7 bits (75), Expect = 0.75
 Identities = 23/131 (17%), Positives = 51/131 (38%), Gaps = 21/131 (16%)

Query: 272 TDVFVSLACFMLYTYIFSIIASTIPVPSGSFIPVFKIGAAFGRMIGE------IIFLRFP 325
             + + L   ++   + S++ +T+ +P           A     IG       +IF R  
Sbjct: 371 AAIALGLNGVLILA-VLSLLGATLTLPGI---------AGIILTIGMAVDANVLIFERIR 420

Query: 326 DGITHGKFIAPIIPGGYATVGAAAFSGAVTHTISVSVIMFEMTGQI-----THIIPVMIA 380
           + + +GK +   I  G+    +       T  I+ +++    TG +     T I+ ++ +
Sbjct: 421 EELRNGKSVLSAIDAGFKRAFSTILDSNATTLIAAAILFALGTGPVKGFAVTLILGILTS 480

Query: 381 VLISNAVAALL 391
           +  +  V  LL
Sbjct: 481 MFTAIPVTRLL 491


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 31.6 bits (72), Expect = 1.3
 Identities = 11/33 (33%), Positives = 17/33 (51%), Gaps = 1/33 (3%)

Query: 540 KFEQMEIKRREERQRRPSRFEVTPAPDLLRGNP 572
           K E+ E K REE ++     +     ++L GNP
Sbjct: 155 KKERAEEKEREEEEKAAEEEKAREE-EILTGNP 186


>gnl|CDD|183671 PRK12672, PRK12672, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 118

 Score = 29.3 bits (66), Expect = 2.7
 Identities = 27/92 (29%), Positives = 35/92 (38%), Gaps = 23/92 (25%)

Query: 308 IGAAFGRMIGEIIFLRFPDGIT--HGKFIAPIIPGGYATVGA--AAFSGAVTHTISVSVI 363
           IG  F  ++G I   RFPD  T  HG            T G   A     VTH +     
Sbjct: 13  IGVTFN-LLGSIALHRFPDVYTRLHGATKC-------TTFGTIFAVL-AVVTHALY---- 59

Query: 364 MFEMTGQ------ITHIIPVMIAVLISNAVAA 389
              +TG         H    +IA+L++N V A
Sbjct: 60  RLRLTGDPKYLQMALHSFVALIALLLTNPVGA 91


>gnl|CDD|227540 COG5215, KAP95, Karyopherin (importin) beta [Intracellular
           trafficking and secretion].
          Length = 858

 Score = 30.7 bits (69), Expect = 3.6
 Identities = 19/84 (22%), Positives = 39/84 (46%), Gaps = 5/84 (5%)

Query: 397 DSIILIKKLPYLPDLLPSSSGIYNVYVEDFMVRDVKYIWNNMTYRDLKNLLKENRSLRVF 456
           D  +L   +  + DL  +    +N+Y +  M   V+ + +  T+RDLK  +     L VF
Sbjct: 652 DRFVLNSAVGLVGDLANTLGTDFNIYADVLMSSLVQCLSSEATHRDLKPAI-----LSVF 706

Query: 457 PLVESSVAALLQPSLYDSIILIKK 480
             +  ++ A  +  L   ++L ++
Sbjct: 707 GDIALAIGANFESYLDMIMMLFQQ 730


>gnl|CDD|226559 COG4073, COG4073, Uncharacterized protein conserved in archaea
           [Function unknown].
          Length = 198

 Score = 29.8 bits (67), Expect = 3.8
 Identities = 30/95 (31%), Positives = 41/95 (43%), Gaps = 17/95 (17%)

Query: 443 LKNLLKENRSLRVFPLVESS-VAALLQPSLYDSIILIKKLPYLPDLLPSSSGIYNVYVED 501
           L+ L +EN  LRV P VE+    + L PS Y S +  + L  +  L P       VYVE 
Sbjct: 37  LRVLPEENGVLRVIPKVENGPYISTLDPSYYRSEVGARILRVVEKLRPD------VYVEL 90

Query: 502 FMVRDVKYIWNNMTYRDLKNLLKENRSLRVFPLVE 536
              R            + + L  E RS  V PL++
Sbjct: 91  HCYRP----------ENYRKLTAERRSGGVPPLID 115


>gnl|CDD|233281 TIGR01129, secD, protein-export membrane protein SecD.  Members of
           this family are highly variable in length immediately
           after the well-conserved motif LGLGLXGG at the
           amino-terminal end of this model. Archaeal homologs are
           not included in the seed and score between the trusted
           and noise cutoffs. SecD from Mycobacterium tuberculosis
           has a long Pro-rich insert [Protein fate, Protein and
           peptide secretion and trafficking].
          Length = 397

 Score = 30.3 bits (69), Expect = 4.0
 Identities = 27/121 (22%), Positives = 49/121 (40%), Gaps = 15/121 (12%)

Query: 278 LACFMLYTYIFSIIAS------TIPVPSGSFIPVFKIGAAFGRMIGEIIFLRFPDGITHG 331
           +A   L   I  I+A       T+ +P  + + +  IG A    +  +I+ R  + +  G
Sbjct: 275 IAAIALVINIVLILAILSAFGATLTLPGIAGL-ILTIGMAVDANV--LIYERIKEELRLG 331

Query: 332 KFIAPIIPGGYATVGAAAFSGAVTHTISVSVIMFEMTGQI-----THIIPVMIAVLISNA 386
           K +   I  G+    +  F   +T  I+  ++    TG +     T  I + IA L +  
Sbjct: 332 KSVQQAIEAGFERAFSTIFDANITTLIAALILYVFGTGPVKGFAVTLAIGI-IASLFTAL 390

Query: 387 V 387
           V
Sbjct: 391 V 391


>gnl|CDD|240000 cd04629, CBS_pair_16, The CBS domain, named after human CBS, is a
           small domain originally identified in cystathionine
           beta-synthase and is subsequently found in a wide range
           of different proteins. CBS domains usually occur in
           tandem repeats. They associate to form a so-called
           Bateman domain or a CBS pair based on crystallographic
           studies in bacteria.  The CBS pair was used as a basis
           for this cd hierarchy since the human CBS proteins can
           adopt the typical core structure and form an
           intramolecular CBS pair.  The interface between the two
           CBS domains forms a cleft that is a potential ligand
           binding site. The CBS pair coexists with a variety of
           other functional domains and this has been used to help
           in its classification here.  It has been proposed that
           the CBS domain may play a regulatory role, although its
           exact function is unknown. Mutations of conserved
           residues within this domain are associated with a
           variety of human hereditary diseases, including
           congenital myotonia, idiopathic generalized epilepsy,
           hypercalciuric nephrolithiasis, and classic Bartter
           syndrome (CLC chloride channel family members),
           Wolff-Parkinson-White syndrome (gamma 2 subunit of
           AMP-activated protein kinase), retinitis pigmentosa (IMP
           dehydrogenase-1), and homocystinuria (cystathionine
           beta-synthase).
          Length = 114

 Score = 28.7 bits (65), Expect = 4.3
 Identities = 11/43 (25%), Positives = 22/43 (51%), Gaps = 2/43 (4%)

Query: 499 VEDFMVRDVKYIWNNMTYRDLKNLLKENRSLRVFPLVESSGKF 541
           V D M  +V  +  + +  DL  L+ + +  + +P+V+  GK 
Sbjct: 62  VRDIMTTEVLTVSPDDSIVDLAQLMLKAKP-KRYPVVD-DGKL 102


>gnl|CDD|223911 COG0841, AcrB, Cation/multidrug efflux pump [Defense mechanisms].
          Length = 1009

 Score = 30.3 bits (69), Expect = 4.3
 Identities = 31/128 (24%), Positives = 51/128 (39%), Gaps = 26/128 (20%)

Query: 290 IIASTIPVPSGSFIPVFKIGAAFG------RMIGEI--IFLRFPDGITHGKFIAPIIPG- 340
           I A  +PV   S +  F +  AFG       +   +  I L   D I   + I   + G 
Sbjct: 358 IPALAVPV---SLLGTFAVMYAFGFSLNTLTLFALVLAIGLLVDDAIVVVENIERHMRGL 414

Query: 341 --------GYATVGAAAFSGAVTHTISVSV-IMF--EMTGQITH--IIPVMIAVLISNAV 387
                       +G A  +  +   I+V + + F    TG++     I + +A+L+S  V
Sbjct: 415 PPREAAEKAMGEIGGAVIAITLV-LIAVFLPMAFMGGSTGELFRQFAITIAVAMLLSLLV 473

Query: 388 AALLQPSL 395
           A  L P+L
Sbjct: 474 ALTLTPAL 481


>gnl|CDD|216972 pfam02322, Cyto_ox_2, Cytochrome oxidase subunit II.  This Family
           consists of cytochrome bd type terminal oxidases that
           catalyzes Quinol dependent, Na+ independent oxygen
           uptake. Members of this family are integral membrane
           proteins andi contain a protohaem IX centre B558. One
           member of the family,cydB from Klebsiella pneumoniae, is
           implicated in having an important role in micro-aerobic
           nitrogen fixation in the enteric bacterium Klebsiella
           pneumoniae.
          Length = 327

 Score = 29.8 bits (68), Expect = 4.5
 Identities = 19/60 (31%), Positives = 24/60 (40%), Gaps = 9/60 (15%)

Query: 300 GSFIPVFKIGAAFGRMI-GEIIFLRFPDGITHGKFIAPIIP----GGYATVGAAAFSGAV 354
           GS +  F  G A G ++ G    + F  G   G F   + P    GG A V   A  GA 
Sbjct: 121 GSLLAAFLQGVALGNLVQG----VPFDAGNYAGGFFDLLNPFALLGGLAVVALYALLGAT 176


>gnl|CDD|239654 cd03682, ClC_sycA_like, ClC sycA-like chloride channel proteins.
           This ClC family presents in bacteria, where it
           facilitates acid resistance in acidic soil. Mutation of
           this gene (sycA) in Rhizobium tropici CIAT899 causes
           serious deficiencies in nodule development, nodulation
           competitiveness, and N2 fixation on Phaseolus vulgaris
           plants, due to its reduced ability for acid resistance. 
           This family is part of the ClC chloride channel
           superfamiy. These proteins catalyse the selective flow
           of Cl- ions across cell membranes and Cl-/H+ exchange
           transport. These proteins share two characteristics that
           are apparently inherent to the entire ClC chloride
           channel superfamily: a unique double-barreled
           architecture and voltage-dependent gating mechanism. The
           gating is conferred by the permeating anion itself,
           acting as the gating charge.
          Length = 378

 Score = 29.9 bits (68), Expect = 4.7
 Identities = 20/85 (23%), Positives = 35/85 (41%), Gaps = 15/85 (17%)

Query: 300 GSFIPVFKIGAAFGRMIGEIIFLRFPDGITHGKFIAPIIPGGYATVGAAAFSGAVTHTIS 359
           G   P+F IGA  G  +  I+ L           +A +   G+  V    F+GA    ++
Sbjct: 298 GEVTPLFFIGATLGNALAPILGLPVS-------LLAAL---GFVAV----FAGATNTPLA 343

Query: 360 VSVIMFEMTGQITHIIPVMIAVLIS 384
             ++  E+ G   +     IA L++
Sbjct: 344 CIIMGIELFG-AENAPYFFIACLVA 367


>gnl|CDD|224239 COG1320, MnhG, Multisubunit Na+/H+ antiporter, MnhG subunit
           [Inorganic ion transport and metabolism].
          Length = 113

 Score = 28.4 bits (64), Expect = 5.5
 Identities = 26/108 (24%), Positives = 39/108 (36%), Gaps = 27/108 (25%)

Query: 287 IFSIIASTIPVPSGSFIPVFKIGAAFGRMIGEIIFLRFPDGIT--HGKFIAPIIPGGYAT 344
           I  II S I +  G+    F + AA G        LRFPD  T  H    A  +      
Sbjct: 3   ILEIIVS-ILLLIGAL---FSLLAAIG-------LLRFPDVYTRLHAATKATTL------ 45

Query: 345 VGAAAFSGAVTHTISVSVIMFEMTGQI-THIIPVMIAVLISNAVAALL 391
                  GA    + V +    + G     +I + + +L++  V A  
Sbjct: 46  -------GAGLLLLGVIIYFLSLLGFFALKLILLALFILLTAPVGAHA 86


>gnl|CDD|240004 cd04633, CBS_pair_20, The CBS domain, named after human CBS, is a
           small domain originally identified in cystathionine
           beta-synthase and is subsequently found in a wide range
           of different proteins. CBS domains usually occur in
           tandem repeats. They associate to form a so-called
           Bateman domain or a CBS pair based on crystallographic
           studies in bacteria.  The CBS pair was used as a basis
           for this cd hierarchy since the human CBS proteins can
           adopt the typical core structure and form an
           intramolecular CBS pair.  The interface between the two
           CBS domains forms a cleft that is a potential ligand
           binding site. The CBS pair coexists with a variety of
           other functional domains and this has been used to help
           in its classification here.  It has been proposed that
           the CBS domain may play a regulatory role, although its
           exact function is unknown. Mutations of conserved
           residues within this domain are associated with a
           variety of human hereditary diseases, including
           congenital myotonia, idiopathic generalized epilepsy,
           hypercalciuric nephrolithiasis, and classic Bartter
           syndrome (CLC chloride channel family members),
           Wolff-Parkinson-White syndrome (gamma 2 subunit of
           AMP-activated protein kinase), retinitis pigmentosa (IMP
           dehydrogenase-1), and homocystinuria (cystathionine
           beta-synthase).
          Length = 121

 Score = 28.4 bits (64), Expect = 5.6
 Identities = 11/36 (30%), Positives = 16/36 (44%), Gaps = 1/36 (2%)

Query: 418 IYNVYVEDFMVRDVKYIWNNMTYRD-LKNLLKENRS 452
           I N+ V D M R V  I  + +  D    +L+ N  
Sbjct: 64  IRNLPVSDIMTRPVITIEPDTSVSDVASLMLENNIG 99



 Score = 28.4 bits (64), Expect = 5.6
 Identities = 11/36 (30%), Positives = 16/36 (44%), Gaps = 1/36 (2%)

Query: 494 IYNVYVEDFMVRDVKYIWNNMTYRD-LKNLLKENRS 528
           I N+ V D M R V  I  + +  D    +L+ N  
Sbjct: 64  IRNLPVSDIMTRPVITIEPDTSVSDVASLMLENNIG 99


>gnl|CDD|220304 pfam09594, DUF2029, Protein of unknown function (DUF2029).  This is
           a putative transmembrane protein from bacteria. It is
           likely to be conserved between Mycobacterium species.
          Length = 240

 Score = 29.2 bits (66), Expect = 6.6
 Identities = 9/39 (23%), Positives = 14/39 (35%)

Query: 113 VYFAIRNYWRGFFTAVCGATVFRLLAVWFYNEETMRAFF 151
           +Y  +R  WR    A   A     L++      + R F 
Sbjct: 118 LYLLLRRRWRALAAAAATAAALVALSLLVLGPSSWRYFL 156


>gnl|CDD|187823 cd09692, Cas5_I-B, CRISPR/Cas system-associated RAMP superfamily
           protein Cas5.  CRISPR (Clustered Regularly Interspaced
           Short Palindromic Repeats) and associated Cas proteins
           comprise a system for heritable host defense by
           prokaryotic cells against phage and other foreign DNA;
           Cas5 is a RAMP superfamily protein; Subunit of the
           Cascade complex.
          Length = 189

 Score = 28.6 bits (64), Expect = 7.6
 Identities = 15/63 (23%), Positives = 24/63 (38%), Gaps = 1/63 (1%)

Query: 148 RAFFPTNFTMDFPFDPQELTVFALIGVVCGFGGAGYVWSHRQYVLFMRRNKKMNAFLQKN 207
           R  +     + +PF P   T+  LIG V G+    Y          +   K +   +Q  
Sbjct: 13  RKPYTNTSKLTYPF-PPRTTLLGLIGAVLGYDKDEYYDLFEPVRGRLDLAKPVRKIMQVL 71

Query: 208 RFL 210
           +FL
Sbjct: 72  KFL 74


>gnl|CDD|182513 PRK10519, PRK10519, hypothetical protein; Provisional.
          Length = 151

 Score = 28.4 bits (64), Expect = 8.7
 Identities = 25/96 (26%), Positives = 39/96 (40%), Gaps = 35/96 (36%)

Query: 331 GKFIAPI-----IPGGYATVGAAAF---SGAVTHTIS-----------VSVI---MFEMT 368
           G    P+     +PG  ATV  AA+    GAV    S           V+++   ++ M 
Sbjct: 50  GHIFGPVMALFGLPGEAATVLLAAWMSMGGAVGVAASLFTAGALTGHHVTILLPAIYLMG 109

Query: 369 GQITHI-------------IPVMIAVLISNAVAALL 391
            Q+ ++              P +IAV I NA+ A+L
Sbjct: 110 SQVQYMGRILGVAEVNAKYYPHLIAVCIINALLAML 145


>gnl|CDD|215124 PLN02195, PLN02195, cellulose synthase A.
          Length = 977

 Score = 29.2 bits (65), Expect = 8.9
 Identities = 32/107 (29%), Positives = 41/107 (38%), Gaps = 26/107 (24%)

Query: 100 LYKGVLFSIEVTTVY------FAIRNYWRGFFTAVCGATVFRLLAVWFYNEETMRAFFPT 153
           L+ G+  SI +T+V        +I + WR     V G     L AV F     M A   T
Sbjct: 790 LFLGLFISIILTSVLELRWSGVSIEDLWRNEQFWVIGGVSAHLFAV-FQGFLKMLAGLDT 848

Query: 154 NFTM------DFPFD-------------PQELTVFALIGVVCGFGGA 181
           NFT+      D  F              P  L +  L+GVV GF  A
Sbjct: 849 NFTVTAKAADDTEFGELYMVKWTTLLIPPTSLLIINLVGVVAGFSDA 895


>gnl|CDD|224450 COG1533, SplB, DNA repair photolyase [DNA replication,
           recombination, and repair].
          Length = 297

 Score = 28.9 bits (65), Expect = 9.4
 Identities = 15/32 (46%), Positives = 18/32 (56%), Gaps = 5/32 (15%)

Query: 327 GITHGKFIAPIIPG-----GYATVGAAAFSGA 353
           GI  G F+APIIPG         + AAA +GA
Sbjct: 182 GIPVGLFVAPIIPGLNDEELERILEAAAEAGA 213


>gnl|CDD|215936 pfam00478, IMPDH, IMP dehydrogenase / GMP reductase domain.  This
           family is involved in biosynthesis of guanosine
           nucleotide. Members of this family contain a TIM barrel
           structure. In the inosine monophosphate dehydrogenases 2
           CBS domains pfam00571 are inserted in the TIM barrel.
           This family is a member of the common phosphate binding
           site TIM barrel family.
          Length = 467

 Score = 29.1 bits (66), Expect = 9.8
 Identities = 17/59 (28%), Positives = 23/59 (38%), Gaps = 14/59 (23%)

Query: 601 PKKSILKKTNSFSLKNFSPMLTPNVT--------PYTTVTGAESRIRMAFE---AIFHK 648
           P +S+L   +   L      LT N+T           TVT +   I MA E    + HK
Sbjct: 12  PGRSVLPSRSDVDLSTK---LTRNITLNIPLVSANMDTVTESRMAIAMAREGGIGVIHK 67


>gnl|CDD|233053 TIGR00613, reco, DNA repair protein RecO.  All proteins in this
           family for which functions are known are DNA binding
           proteins that are involved in the initiation of
           recombination or recombinational repair [DNA metabolism,
           DNA replication, recombination, and repair].
          Length = 241

 Score = 28.8 bits (65), Expect = 9.8
 Identities = 12/53 (22%), Positives = 23/53 (43%), Gaps = 6/53 (11%)

Query: 246 FSNFTWTK--GHFTVEEQEVLKHWTTRNTDVFVSLACFMLYTYIFSIIASTIP 296
             +F W K  G  T+ + E++  +    +D+F+         YI  +I   +P
Sbjct: 53  LGDFVWYKRSGLSTLNQGELINSFEGIKSDLFLLAYAS----YIAELIDRLLP 101


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.323    0.138    0.422 

Gapped
Lambda     K      H
   0.267   0.0729    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 36,970,161
Number of extensions: 3751420
Number of successful extensions: 3534
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3492
Number of HSP's successfully gapped: 83
Length of query: 704
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 600
Effective length of database: 6,324,786
Effective search space: 3794871600
Effective search space used: 3794871600
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 63 (28.0 bits)