RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy8107
         (1613 letters)



>gnl|CDD|212040 cd10332, SLC6sbd-B0AT-like, System B(0) neutral amino acid
            transporter AT1, 2 and 3, and related proteins;
            solute-binding domain.  This subgroup includes the
            solute-binding domain of transmembrane transporters,
            which transport, i) neutral amino acids: NTT4 (also
            called XT1), SBAT1 (also called B0AT2, v7-3, NTT7-3), and
            B0AT1 (also called HND); the human genes encoding these
            are SLC6A17, SLC6A15, and SLC6A19 respectively, ii)
            glycine: B0AT3 (also called Xtrp2, XT2), iii) imino
            acids, such as proline, pipecolate, MeAIB, and sarcosine:
            SIT1 (also called XTRP3, XT3, IMINO). The human genes
            encoding B0AT3 and SIT1 are SLC6A18 and SLC6A20
            respectively. Transporters in this subgroup may play a
            role in disorders including major depression, Hartnup
            disorder, increased susceptibility to myocardial
            infarction, and iminoglycinuria. This subgroup belongs to
            the solute carrier 6 (SLC6) transporter family.
          Length = 565

 Score =  633 bits (1636), Expect = 0.0
 Identities = 225/386 (58%), Positives = 274/386 (70%), Gaps = 46/386 (11%)

Query: 878  AFLIPFMVMLILEGIPLFLIELGIGQKMRLGSLGVWNTIHPWLGGVGISSCLVTFFVALY 937
            AFLIP+++ML+LEGIPLF +EL IGQ++R GS+GVWNTI P+LGG+GI+SC+V+F VALY
Sbjct: 38   AFLIPYVIMLVLEGIPLFFLELAIGQRLRKGSIGVWNTISPYLGGIGIASCVVSFLVALY 97

Query: 938  YNVIITWCFYYLFNSFSSSLPWSSCPTDHNGT-IIPECELSSETAYFWYRTTLDVSPNIE 996
            YNVII WCF+YLFNSF   LPWSSCPT+ NGT  + EC  SS T YFWYRTTLD+SP+I+
Sbjct: 98   YNVIIAWCFFYLFNSFQYPLPWSSCPTNGNGTGYVEECAKSSPTQYFWYRTTLDISPSID 157

Query: 997  SPGSPKWWIVLCLLLAWIIVFFIVMKGIQSSGRVVYFTSMFPYIVLTIFFIRGITLPGAG 1056
              G   WW+ LCLLLAWIIV+  ++KGI+SSG+VVYFT+ FPYIVL IF IRG+TL GAG
Sbjct: 158  ESGGLNWWMTLCLLLAWIIVYLCMIKGIKSSGKVVYFTATFPYIVLIIFLIRGLTLKGAG 217

Query: 1057 AGLVHMYTPKVEKLLDPNVWLDAATQVFYSFGLAFGSLIAFGSYNQPNNNCVRDVIMVSI 1116
             GL HM+TPK EKLLDP VWLDAATQ+F+S GL FGSLIAF SYN P NNC RD ++VS+
Sbjct: 218  DGLKHMFTPKFEKLLDPQVWLDAATQIFFSLGLGFGSLIAFSSYNPPKNNCHRDAVLVSL 277

Query: 1117 CNALTAIYASVVVFAILGFKAMSNAAE-----------GTGLAFI--------------- 1150
             N  T+I+AS+VVF++LGFKA +                  L                  
Sbjct: 278  INCFTSIFASIVVFSVLGFKATTRYDRCVDSNILILVINFLLNSTEDYVDNYTAINNVTL 337

Query: 1151 -------------------AAEGTGLAFIVFTQAIVELPGAPFWSIIFFMMLLSLGLGSQ 1191
                               A EGTGLAFIVFT+AIV+ PGAPFWS++FF+MLL+LGLGS 
Sbjct: 338  TENQTLGLTNCSLEDELDEAVEGTGLAFIVFTEAIVKFPGAPFWSVLFFLMLLTLGLGSM 397

Query: 1192 IGILEGVLCTIFDIEIFKRIRKPILT 1217
             G LEGVL  +FD +I  ++ K  LT
Sbjct: 398  FGTLEGVLTPLFDSKILPKVPKEYLT 423



 Score =  187 bits (477), Expect = 5e-50
 Identities = 63/135 (46%), Positives = 94/135 (69%), Gaps = 11/135 (8%)

Query: 769 HLVGVVCLISFLVGLIFTTGAGEYWLKLFDSFAGTIGLVLIALIEMLAVMFIYGHEKFTN 828
           +L G+VCL+ FL+GLIFT  +G YW+++FDS++GT+ L++IAL EM+AV ++YG ++F +
Sbjct: 421 YLTGLVCLVCFLIGLIFTQRSGNYWVQMFDSYSGTLPLLIIALFEMIAVSYVYGLDRFED 480

Query: 829 DIYEMTGYKPGWYWQITWRFLGPAIMSCILVSSIICRFLKKPSYSAWNRA---------- 878
           DI  MTG +PGWYW+ITWR++ P  M  ILV+SI+    K P+YSAWN            
Sbjct: 481 DIEYMTGKRPGWYWKITWRYVSPLAMLVILVASIVQLVTKPPTYSAWNAELASTESLPYP 540

Query: 879 -FLIPFMVMLILEGI 892
            + +  +++LIL  I
Sbjct: 541 GWGLAVIILLILLSI 555


>gnl|CDD|215790 pfam00209, SNF, Sodium:neurotransmitter symporter family. 
          Length = 506

 Score =  444 bits (1145), Expect = e-141
 Identities = 164/341 (48%), Positives = 232/341 (68%), Gaps = 7/341 (2%)

Query: 878  AFLIPFMVMLILEGIPLFLIELGIGQKMRLGSLGVWNTIHPWLGGVGISSCLVTFFVALY 937
            AFLIP+++ LI  GIPLF +EL +GQ  R G++GVW  I P   G+G +S ++ F++ +Y
Sbjct: 38   AFLIPYLIFLITAGIPLFFLELALGQYTREGAIGVWRKICPLFKGIGYASMVIVFYIGIY 97

Query: 938  YNVIITWCFYYLFNSFSSSLPWSSCPTDHNGTIIPECELSSETAYFWYRTTLDVSPNIES 997
            YNVII W  YYLF+SF++ LPW+SC    N    P CE +S    FW R  L +S  IE 
Sbjct: 98   YNVIIAWALYYLFSSFTTELPWASCNNSWNT---PRCEKTSPVEEFWERKVLKLSDGIED 154

Query: 998  PGSPKWWIVLCLLLAWIIVFFIVMKGIQSSGRVVYFTSMFPYIVLTIFFIRGITLPGAGA 1057
             G  +W + LCLL+AW++V+F + KG++SSG+VVYFT+ FPY+VL +  +RG+TLPGA  
Sbjct: 155  LGGLRWPLTLCLLVAWVVVYFCIWKGVKSSGKVVYFTATFPYVVLLVLLVRGVTLPGAAD 214

Query: 1058 GLVHMYTPKVEKLLDPNVWLDAATQVFYSFGLAFGSLIAFGSYNQPNNNCVRDVIMVSIC 1117
            G+    TP   KLLDP VW+DAATQ+F+S G+ FG LIAF SYN+ +NNC RD ++VS  
Sbjct: 215  GIDFYLTPDWSKLLDPQVWIDAATQIFFSLGIGFGGLIAFASYNKFHNNCYRDALIVSFI 274

Query: 1118 NALTAIYASVVVFAILGFKAMSNAAEGTGLAFIAAEGTGLAFIVFTQAIVELPGAPFWSI 1177
            N+ T+  A  V+F++LGF A     +G  ++ +A  G GLAFI + +A+  LP +PFWS+
Sbjct: 275  NSATSFLAGFVIFSVLGFMA---QEQGVPISEVAESGPGLAFIAYPEAVTMLPLSPFWSV 331

Query: 1178 IFFMMLLSLGLGSQIGILEGVLCTIFD-IEIFKRIRKPILT 1217
            +FF+ML++LGL SQ G +EG++  + D   I  R R+ + T
Sbjct: 332  LFFLMLITLGLDSQFGGVEGIITALVDEFPILLRKRRELFT 372



 Score =  123 bits (311), Expect = 4e-29
 Identities = 47/133 (35%), Positives = 76/133 (57%), Gaps = 8/133 (6%)

Query: 770 LVGVVCLISFLVGLIFTTGAGEYWLKLFDSFAGTIGLVLIALIEMLAVMFIYGHEKFTND 829
              +VC+ISFL+GLI  T  G Y   LFD +A + GL+ +   E +AV ++YG ++F +D
Sbjct: 371 FTLIVCVISFLIGLILVTEGGIYVFTLFDYYAASFGLLFVVFFECIAVAWVYGLDRFYDD 430

Query: 830 IYEMTGYKPGWYWQITWRFLGPAIMSCILVSSIICRFLKKPSYSA-----WNRAFLIPFM 884
           I EM G++PG +W++ W+F+ P I+  + + SI+    K  +Y+      W  A      
Sbjct: 431 IKEMLGFRPGLFWRLCWKFVSPLILLFLFIFSIV--QYKPLTYNNYVYPNWANALGWLMA 488

Query: 885 VMLILEGIPLFLI 897
           +  +L  IPL+ I
Sbjct: 489 LSSMLC-IPLYSI 500


>gnl|CDD|212086 cd11517, SLC6sbd_B0AT3, glycine transporter, B0AT3; solute-binding
            domain.  B0AT3 (also called Xtrp2, XT2) transports
            glycine. Human B0AT3 is encoded by the SLC6A18 gene.
            B0AT3 is expressed in the kidney. Mutations in the
            SLC6A18 gene may contribute to the autosomal recessive
            disorder iminoglycinuria and its related disorder
            hyperglycinuria. SLC6A18 or its neighboring genes are
            associated with increased susceptibility to myocardial
            infarction. This subgroup belongs to the solute carrier 6
            (SLC6) transporter family.
          Length = 576

 Score =  423 bits (1090), Expect = e-132
 Identities = 177/391 (45%), Positives = 252/391 (64%), Gaps = 51/391 (13%)

Query: 878  AFLIPFMVMLILEGIPLFLIELGIGQKMRLGSLGVWNTIHPWLGGVGISSCLVTFFVALY 937
            AFLIP+ + L+ EGIPLF +EL IGQ++R GS+GVW TI P+LGGVG+    V+F V+LY
Sbjct: 39   AFLIPYFIALVFEGIPLFHLELAIGQRLRKGSIGVWTTISPYLGGVGLGCVTVSFLVSLY 98

Query: 938  YNVIITWCFYYLFNSFSSSLPWSSCPTDHNGT-IIPECELSSETAYFWYRTTLDVSPNIE 996
            YN I+ W  +Y  NSF   LPWSSCP D N T  + EC+ S+  +YFWYR TL+++P+I 
Sbjct: 99   YNTILAWVLWYFLNSFQHPLPWSSCPLDLNRTGFVEECQGSTAVSYFWYRQTLNITPDIN 158

Query: 997  SPGSPKWWIVLCLLLAWIIVFFIVMKGIQSSGRVVYFTSMFPYIVLTIFFIRGITLPGAG 1056
              GS +WW++LCL   W IV+  V++GI+++G+V+YFT++FPY+VLTIF IRG+TLPGA 
Sbjct: 159  DSGSIQWWLLLCLAACWAIVYLCVIRGIETTGKVIYFTALFPYLVLTIFLIRGLTLPGAT 218

Query: 1057 AGLVHMYTPKVEKLLDPNVWLDAATQVFYSFGLAFGSLIAFGSYNQPNNNCVRDVIMVSI 1116
             GL++++TP +  L +P VWLDAATQ+F+S  LAFG  IAF SYN P N+C +D + +++
Sbjct: 219  EGLIYLFTPNMHILQNPRVWLDAATQIFFSLSLAFGGHIAFASYNPPRNDCEKDAVTIAL 278

Query: 1117 CNALTAIYASVVVFAILGFKAMS------------------------------------N 1140
             N++T++YAS+ VF++LGFKA +                                    N
Sbjct: 279  VNSMTSLYASIPVFSVLGFKATNDYEHCLDRNILSLINEFDFPEQSISRDNYPSVLMHLN 338

Query: 1141 AAEGTGLAFI-------------AAEGTGLAFIVFTQAIVELPGAPFWSIIFFMMLLSLG 1187
            A     +A +             +A G GLAFIVFT+A++ +PGA  W+++FF ML +LG
Sbjct: 339  ATWPERVAQLPLKACLLEDFLDKSASGPGLAFIVFTEAVLHMPGAQVWAVLFFGMLFTLG 398

Query: 1188 LGSQIGILEGVLCTIFDIEIFKR-IRKPILT 1217
            L S  G +EGV+  + D+ +  R + K  LT
Sbjct: 399  LSSMFGNVEGVITPLLDVGVLPRWVPKEALT 429



 Score =  136 bits (343), Expect = 6e-33
 Identities = 51/107 (47%), Positives = 75/107 (70%)

Query: 770 LVGVVCLISFLVGLIFTTGAGEYWLKLFDSFAGTIGLVLIALIEMLAVMFIYGHEKFTND 829
           L G+VCL+ FL  L FT  +G YWL++FDSFA ++ L++IA  E++ V+++YG ++F +D
Sbjct: 428 LTGLVCLVCFLSALCFTLQSGNYWLEIFDSFAASLNLLVIAFFEVVGVVYVYGIKRFCDD 487

Query: 830 IYEMTGYKPGWYWQITWRFLGPAIMSCILVSSIICRFLKKPSYSAWN 876
           I  MTG +P  YW++TWR + P ++  I V+ II  F K PSY AWN
Sbjct: 488 IEWMTGRRPSLYWRLTWRVVSPLLLLTIFVAYIILLFQKPPSYKAWN 534


>gnl|CDD|212034 cd10324, SLC6sbd, Solute carrier 6 family, neurotransmitter
            transporters; solute-binding domain.  This family
            represents the solute-binding domain of SLC6 proteins
            (also called the sodium- and chloride-dependent
            neurotransmitter transporter family or Na+/Cl--dependent
            transporter family). These use sodium and chloride
            electrochemical gradients to catalyze the
            thermodynamically uphill movement of a variety of
            substrates, and include neurotransmitter transporters
            (NTTs). The latter are Na+/Cl--dependent plasma membrane
            transporters for the monoamine neurotransmitters
            serotonin (5-hydroxytryptamine), dopamine, and
            norepinephrine, and the amino acid neurotransmitters GABA
            and glycine. NTTs are widely expressed in the mammalian
            brain, and are involved in regulating neurotransmitter
            signaling and homeostasis, through facilitating the
            uptake of released neurotransmitters from the
            extracellular space into neurons and glial cells. NTTs
            are the target of a range of therapeutic drugs for the
            treatment of psychiatric diseases, such as major
            depression, anxiety disorders, attention deficit
            hyperactivity disorder and epilepsy. In addition, they
            are the primary targets of cocaine, amphetamines and
            other psychostimulants. This family also includes
            Drosophila Blot which is expressed primarily in
            epithelial tissues of ectodermal origin and in the
            nervous system of the embryo and larvae, but in addition
            found in the developing oocyte and the freshly laid egg.
            A lack or reduction of Blot function during oogenesis
            results in early arrest of embryonic development. 12
            transmembrane helices (TMs) appears to be common for
            eukaryotic and some prokaryotic and archaeal SLC6s, (a
            core inverted topology repeat, TM1-5 and TM6-10, plus
            TMs11-12; TMs numbered to conform to the SLC6 Aquifex
            aeolicus LeuT), although a majority of bacterial, and
            some archaeal SLC6s lack TM12, for example the functional
            Fusobacterium nucleatum tyrosine transporter Tyt1.
          Length = 417

 Score =  406 bits (1047), Expect = e-128
 Identities = 143/337 (42%), Positives = 199/337 (59%), Gaps = 52/337 (15%)

Query: 878  AFLIPFMVMLILEGIPLFLIELGIGQKMRLGSLGVWNTIHPWLGGVGISSCLVTFFVALY 937
            AFLIP+++ML+L GIPLF +EL +GQ    G +G +  I P   GVG +  +++F VA+Y
Sbjct: 37   AFLIPYLIMLLLVGIPLFFLELALGQYTSRGPVGAFWRICPLFKGVGYAQVVISFLVAIY 96

Query: 938  YNVIITWCFYYLFNSFSSSLPWSSCPTDHNGTIIPECELSSETAYFWYRTTLDVSPNIES 997
            YNVII W  YYLF SF                                            
Sbjct: 97   YNVIIAWTLYYLFASF-------------------------------------------- 112

Query: 998  PGSPKWWIVLCLLLAWIIVFFIVMKGIQSSGRVVYFTSMFPYIVLTIFFIRGITLPGAGA 1057
                 W + LCLLLAWI+V+  + KG++SSG+VVYFT+ FPY+VL I  IRG+TLPGAG 
Sbjct: 113  ---INWELALCLLLAWILVYLCIFKGVKSSGKVVYFTATFPYVVLFILLIRGVTLPGAGD 169

Query: 1058 GLVHMYTPKVEKLLDPNVWLDAATQVFYSFGLAFGSLIAFGSYNQPNNNCVRDVIMVSIC 1117
            G+++  TP   KLLDP VW  AATQ+F+S  + FG LI + SYN+ +NN  RD ++VSI 
Sbjct: 170  GILYFLTPDWSKLLDPKVWYAAATQIFFSLSIGFGGLITYASYNKFDNNIYRDALIVSIL 229

Query: 1118 NALTAIYASVVVFAILGFKAMSNAAEGTGLAFIAAEGTGLAFIVFTQAIVELPGAPFWSI 1177
            N LT++ A  V+F++LGF  +++   G  +A +   G GLAF+V+ +AI ++P    WS+
Sbjct: 230  NTLTSLLAGFVIFSVLGF--LAHEL-GVPVADVVKGGPGLAFVVYPEAIAKMPVPQLWSV 286

Query: 1178 IFFMMLLSLGLGSQIGILEGVLCTIFDIEIFKRIRKP 1214
            +FF ML  LGL SQ  I+E ++  I D   F ++RK 
Sbjct: 287  LFFFMLFLLGLDSQFAIVETIVTAIVD--EFPKLRKR 321



 Score =  126 bits (320), Expect = 9e-31
 Identities = 38/94 (40%), Positives = 59/94 (62%)

Query: 769 HLVGVVCLISFLVGLIFTTGAGEYWLKLFDSFAGTIGLVLIALIEMLAVMFIYGHEKFTN 828
            +   VC++ FL+GL + T  G Y L L D +A T  L+++AL E++A+ +IYG ++F  
Sbjct: 324 LVTLGVCVVGFLLGLPYVTQGGIYILDLLDYYAATFSLLVLALFELIAISWIYGVDRFLR 383

Query: 829 DIYEMTGYKPGWYWQITWRFLGPAIMSCILVSSI 862
           DI  M G  P  YW+I W+F+ P I+  IL+ S+
Sbjct: 384 DIEFMLGIPPSIYWRICWKFITPVILLVILIFSL 417


>gnl|CDD|212084 cd11515, SLC6sbd_NTT4-like, Na(+)-dependent neurotransmitter
            transporter 4, and related proteins; solute-binding
            domain.  This subgroup includes the solute-binding domain
            of NTT4 (also called XT1) and SBAT1 (also called B0AT2,
            v7-3, NTT7-3); both these proteins can transport neutral
            amino acids. Human SBAT1 is encoded by the SLC6A15 gene,
            a susceptibility gene for major depression. SBAT1 is
            expressed in brain, and may have a role in transporting
            neurotransmitter precursors into neurons. Human NTT4 is
            encoded by the SLC6A17 gene. NTT4 is specifically
            expressed in the nervous system, in synaptic vesicles of
            glutamatergic and GABAergic neurons, and may play an
            important role in synaptic transmission. This subgroup
            belongs to the solute carrier 6 (SLC6) transporter
            family.
          Length = 530

 Score =  406 bits (1046), Expect = e-127
 Identities = 175/354 (49%), Positives = 242/354 (68%), Gaps = 17/354 (4%)

Query: 878  AFLIPFMVMLILEGIPLFLIELGIGQKMRLGSLGVWNTIHPWLGGVGISSCLVTFFVALY 937
            A+L+P++++LI+ GIPLF +EL +GQ++R GS+GVWN I P LGG+G +SC+V  FVALY
Sbjct: 38   AYLVPYLILLIVIGIPLFFLELSVGQRIRRGSIGVWNYISPKLGGIGFASCIVCLFVALY 97

Query: 938  YNVIITWCFYYLFNSFSSSLPWSSCPTDHNGT---IIPECELSSETAYFWYRTTLDVSPN 994
            YNVII W  +Y   SF   LPW  CP   N +   + PECE SS T Y+WYR  L++S +
Sbjct: 98   YNVIIGWSLFYFSQSFQYPLPWDQCPLVKNASHTFVEPECEKSSATTYYWYREALNISSS 157

Query: 995  IESPGSPKWWIVLCLLLAWIIVFFIVMKGIQSSGRVVYFTSMFPYIVLTIFFIRGITLPG 1054
            I   G   W + +CLL AW++V   ++KGIQSSG+++YF+S+FPY+VL  F +RG+ L G
Sbjct: 158  ISESGGLNWKMTICLLAAWVMVCLAMIKGIQSSGKIMYFSSLFPYVVLACFLVRGLLLNG 217

Query: 1055 AGAGLVHMYTPKVEKLLDPNVWLDAATQVFYSFGLAFGSLIAFGSYNQPNNNCVRDVIMV 1114
            +  G+ HM+TPK+E +L+P VW +AATQVF++ GL FG +IAF SYN+ +NNC  D ++V
Sbjct: 218  SVDGIRHMFTPKLEIMLEPKVWREAATQVFFALGLGFGGVIAFSSYNKRDNNCHFDAVLV 277

Query: 1115 SICNALTAIYASVVVFAILGFKAMS---NAAEGTGLAFI--------AAEGTGLAFIVFT 1163
            S  N  T++ A++VVFA+LGFK            GL           A +GTGLAFI FT
Sbjct: 278  SFINFFTSVLATLVVFAVLGFKIQKVKEEEFSALGLDSCLIEDELDKAVQGTGLAFIAFT 337

Query: 1164 QAIVELPGAPFWSIIFFMMLLSLGLGSQIGILEGVLCTIFDIEIFKRIRKPILT 1217
            +A+   P +PFWS++FF+ML++LGLGS  G +EG+   I D   FK +RK ILT
Sbjct: 338  EAMTHFPASPFWSVMFFLMLVNLGLGSMFGTIEGITTPIVD--TFK-VRKEILT 388



 Score =  104 bits (260), Expect = 1e-22
 Identities = 37/103 (35%), Positives = 68/103 (66%)

Query: 773 VVCLISFLVGLIFTTGAGEYWLKLFDSFAGTIGLVLIALIEMLAVMFIYGHEKFTNDIYE 832
           + CL +FL+GLIF   +G Y++ +FD ++ T+ L+++ ++E +AV +IYG +KF  D+ +
Sbjct: 390 ICCLFAFLIGLIFVQRSGNYFVTMFDDYSATLPLLIVVILENIAVAWIYGIKKFMEDLKD 449

Query: 833 MTGYKPGWYWQITWRFLGPAIMSCILVSSIICRFLKKPSYSAW 875
           M G++P  Y+   W+++ P  +  +L +SI+   L  P Y+AW
Sbjct: 450 MLGFRPSRYYYYMWKYISPLCLLVLLTASIVQMGLSPPGYNAW 492


>gnl|CDD|212065 cd11496, SLC6sbd-TauT-like, Na(+)- and Cl(-)-dependent taurine
            transporter TauT, and related proteins; solute-binding
            domain.  This subgroup represents the solute-binding
            domain of TauT-like Na(+)- and Cl(-)-dependent
            transporters. Family members include: human TauT which
            transports taurine, human GAT1, GAT2, and GAT3, and BGT1,
            which transport gamma-aminobutyric acid (GABA), and human
            CT1 which transports creatine. This subgroup belongs to
            the solute carrier 6 (SLC6) transporter family.
          Length = 543

 Score =  397 bits (1023), Expect = e-123
 Identities = 150/360 (41%), Positives = 222/360 (61%), Gaps = 24/360 (6%)

Query: 878  AFLIPFMVMLILEGIPLFLIELGIGQKMRLGSLGVWNTIHPWLGGVGISSCLVTFFVALY 937
            AFLIP+ + L+L GIP+F +E+ +GQ    G +  W  I P   G+G +S ++ F++ +Y
Sbjct: 38   AFLIPYFIFLVLCGIPIFFLEVALGQYTSQGGITAWK-ICPLFKGIGYASAVIVFWLNIY 96

Query: 938  YNVIITWCFYYLFNSFSSSLPWSSCP----TDH--------NGTIIPECELSSETAY--- 982
            Y VI+ W  +YLFNSF+S LPW++C     T+         N + + +   ++  +    
Sbjct: 97   YIVILAWALFYLFNSFTSELPWTTCDNWWNTECCVETYSNNNLSNLTKNCSNNTNSTSPV 156

Query: 983  --FWYRTTLDVSPNIESPGSPKWWIVLCLLLAWIIVFFIVMKGIQSSGRVVYFTSMFPYI 1040
              FW R  L +S  IE  GS +W + LCLLLAWII +F + KG++S+G+VVYFT+ FPY+
Sbjct: 157  EEFWERRVLGISDGIEDIGSIRWELALCLLLAWIICYFCIWKGVKSTGKVVYFTATFPYV 216

Query: 1041 VLTIFFIRGITLPGAGAGLVHMYTPKVEKLLDPNVWLDAATQVFYSFGLAFGSLIAFGSY 1100
            +L I  IRG+TLPGA  G+     P + KLLDP VW+DA TQ+F+S+G+  GSL A GSY
Sbjct: 217  MLIILLIRGVTLPGASDGIYFYLKPDLTKLLDPQVWIDAGTQIFFSYGIGLGSLTALGSY 276

Query: 1101 NQPNNNCVRDVIMVSICNALTAIYASVVVFAILGFKAMSNAAEGTGLAFIAAEGTGLAFI 1160
            N+ NNNC +D I++   N+ T+ +A   +F+ILGF  M+    G  ++ +A  G GLAFI
Sbjct: 277  NKFNNNCYKDSIILCFLNSGTSFFAGFAIFSILGF--MAQEQ-GVPISEVAESGPGLAFI 333

Query: 1161 VFTQAIVELPGAPFWSIIFFMMLLSLGLGSQIGILEGVLCTIFDI---EIFKRIRKPILT 1217
             + +A+  LP    W+++FF+MLL LGL SQ   +EG +  I D+    +  R R+ I  
Sbjct: 334  AYPRAVSLLPLPQLWAVLFFIMLLLLGLDSQFVGVEGFVTAIVDLYPNVLRLRYRREIFV 393



 Score = 93.8 bits (234), Expect = 2e-19
 Identities = 47/135 (34%), Positives = 73/135 (54%), Gaps = 10/135 (7%)

Query: 770 LVGVVCLISFLVGLIFTTGAGEYWLKLFDSFAGT-IGLVLIALIEMLAVMFIYGHEKFTN 828
            V +VCL+ FL+GL   T  G Y  +LFD +A + I L+ +A  E +A+ ++YG ++F +
Sbjct: 392 FVAIVCLVCFLIGLPMVTEGGIYVFQLFDYYAASGICLLWLAFFECIAISWVYGADRFYD 451

Query: 829 DIYEMTGYKPGWYWQITWRFLGPAIMSCILVSSIIC----RFLKKPSYSAWNRAF--LIP 882
           +I +M GY+PG +W+  W+FL PAI   I V S++      +     Y  W  A    + 
Sbjct: 452 NIEDMIGYRPGPWWKYCWKFLTPAICLGIFVFSLVKYTPLTYGNTYVYPWWAEAIGWFLA 511

Query: 883 FMVMLILEGIPLFLI 897
              ML    IP + I
Sbjct: 512 LSSMLC---IPGYAI 523


>gnl|CDD|212085 cd11516, SLC6sbd_B0AT1, Na(+)-dependent neutral amino acids
            transporter, B0AT1; solute-binding domain.  B0AT1 (also
            called HND) transports neutral amino acids. Human B0AT1
            is encoded by the SLC6A19 gene. B0AT1 is expressed
            primarily in the kidney and intestine; it requires
            collectrin for expression in the kidney, and
            angiotensin-converting enzyme 2 for expression in the
            intestine. Interaction with these two proteins implicates
            B0AT1 in more complex processes such as glomerular
            structure, exocytosis, and blood pressure control. The
            autosomal recessive disorder, Hartnup disorder, is caused
            by mutations in B0AT1. This subgroup belongs to the
            solute carrier 6 (SLC6) transporter family.
          Length = 581

 Score =  398 bits (1024), Expect = e-123
 Identities = 174/391 (44%), Positives = 243/391 (62%), Gaps = 51/391 (13%)

Query: 878  AFLIPFMVMLILEGIPLFLIELGIGQKMRLGSLGVWNTIHPWLGGVGISSCLVTFFVALY 937
            AF+IPF+++L+ EGIPL  +E  IGQ++R GS+GVW+TIHP L GVGI+S  V+F V+LY
Sbjct: 38   AFMIPFLILLVFEGIPLLHLEFAIGQRLRKGSVGVWSTIHPTLKGVGIASMCVSFLVSLY 97

Query: 938  YNVIITWCFYYLFNSFSSSLPWSSCPTDHNGT-IIPECELSSETAYFWYRTTLDVSPNIE 996
            YN II W  +Y FNSF   LPWS CP + N T  IPEC  SS   YFWYR TL+ S +I+
Sbjct: 98   YNTIIAWVMWYFFNSFQEPLPWSQCPLNENRTGYIPECAKSSPVDYFWYRETLNTSTSID 157

Query: 997  SPGSPKWWIVLCLLLAWIIVFFIVMKGIQSSGRVVYFTSMFPYIVLTIFFIRGITLPGAG 1056
              G  +WW+VLCL  AW +++   ++GI+++G+ VY TS  PY+VLTIF IRG+TL G+ 
Sbjct: 158  DSGGIQWWMVLCLACAWGVLYVCTIRGIETTGKAVYVTSTLPYLVLTIFLIRGLTLKGSV 217

Query: 1057 AGLVHMYTPKVEKLLDPNVWLDAATQVFYSFGLAFGSLIAFGSYNQPNNNCVRDVIMVSI 1116
             G+  ++TP + +L +P  WLDA  QVFYSF LAFG LI+F SYN  +NNC +D +++S+
Sbjct: 218  NGIKFLFTPDINELANPVTWLDAGAQVFYSFSLAFGGLISFSSYNSVHNNCEKDAVIISV 277

Query: 1117 CNALTAIYASVVVFAILGFKA--------------------------------------- 1137
             N  T+IYA+ V+++I+GF+A                                       
Sbjct: 278  INGFTSIYAATVIYSIIGFRATERFDDCFDGNILTLINAFDLPEGNITQDNYQDMQQSLN 337

Query: 1138 MSNAAEGTGLAFI----------AAEGTGLAFIVFTQAIVELPGAPFWSIIFFMMLLSLG 1187
             ++    + L+              EGTGLAFIVFT+AI ++P +P WS++FF+ML  LG
Sbjct: 338  STDPDIISSLSLQTCDLNNFLSEGVEGTGLAFIVFTEAITKMPLSPLWSVLFFIMLFCLG 397

Query: 1188 LGSQIGILEGVLCTIFDIEIF-KRIRKPILT 1217
            L S  G +EGVL  + D+++  K   K I+T
Sbjct: 398  LSSMFGNMEGVLVPLQDLKVIPKSWPKEIIT 428



 Score =  128 bits (324), Expect = 2e-30
 Identities = 57/140 (40%), Positives = 87/140 (62%), Gaps = 12/140 (8%)

Query: 770 LVGVVCLISFLVGLIFTTGAGEYWLKLFDSFAGTIGLVLIALIEMLAVMFIYGHEKFTND 829
           + G++CL+SFL+GLIF  G+G YWL LFDSFAG+I L++IA  EM +V+++YG ++F  D
Sbjct: 427 ITGLICLVSFLIGLIFVLGSGNYWLALFDSFAGSIPLLIIAFCEMFSVVYVYGIDRFNKD 486

Query: 830 IYEMTGYKPGWYWQITWRFLGPAIMSCILVSSIICRFLKKPSYSAWN---RAFLIP---- 882
           I  M G+KP  +WQ TWR + P IM  I +   + +  ++  YSAW+     F       
Sbjct: 487 IEFMIGHKPNIFWQATWRVISPLIMLVIFLFYFVVKVSQELFYSAWDPEYEEFPKSQKIS 546

Query: 883 -----FMVMLILEGIPLFLI 897
                +++++IL G+P   I
Sbjct: 547 YPTWIYVIIVILAGVPSLAI 566


>gnl|CDD|212087 cd11518, SLC6sbd_SIT1, Na(+)- and Cl(-)-dependent imino acid
            transporter SIT1; solute-binding domain.  SIT1 (also
            called XTRP3, XT3, IMINO) transports imino acids, such as
            proline, pipecolate, MeAIB, and sarcosine. It has weak
            affinity for neutral amino acids such as phenylalanine.
            Human SIT1 is encoded by the SLC6A20 gene. SIT1 is
            expressed in brain, kidney, small intestine, thymus,
            spleen, ovary, and lung. SLC6A20 is a candidate gene for
            the rare disorder iminoglycinuria. This subgroup belongs
            to the solute carrier 6 (SLC6) transporter family.
          Length = 576

 Score =  379 bits (976), Expect = e-116
 Identities = 174/397 (43%), Positives = 246/397 (61%), Gaps = 55/397 (13%)

Query: 879  FLIPFMVMLILEGIPLFLIELGIGQKMRLGSLGVWNTIHPWLGGVGISSCLVTFFVALYY 938
            FLIP+++MLILEG+PL  +EL +GQ+MR GS+G W +I P+LGGVG++S +V+FF+++YY
Sbjct: 42   FLIPYLIMLILEGMPLLYLELAVGQRMRQGSIGAWKSISPYLGGVGVASVVVSFFLSMYY 101

Query: 939  NVIITWCFYYLFNSFSSSLPWSSCPTDHNGT-IIPECELSSETAYFWYRTTLDVSPNIES 997
            NVI  W F+YLF+SF   LPWS CP + N T    ECE +S T YFWYR TL++SP+IE 
Sbjct: 102  NVINAWAFWYLFHSFQDPLPWSVCPLNSNRTGYTEECEKASPTQYFWYRETLNISPSIEE 161

Query: 998  PGSPKWWIVLCLLLAWIIVFFIVMKGIQSSGRVVYFTSMFPYIVLTIFFIRGITLPGAGA 1057
             G  +W   LCL+LAW++V+  +++G +S+G+VVY T+  PY+VL I+ IRG+TL GA  
Sbjct: 162  SGGIQWEQALCLILAWLVVYLCILRGTESTGKVVYVTATLPYLVLIIYLIRGLTLHGAAN 221

Query: 1058 GLVHMYTPKVEKLLDPNVWLDAATQVFYSFGLAFGSLIAFGSYNQPNNNCVRDVIMVSIC 1117
            GL +M+TPK+E+L +P  W++AATQ+F+S GL FGSLIAF SYN+P+NNC R  I+VS+ 
Sbjct: 222  GLKYMFTPKLEQLANPKTWINAATQIFFSLGLGFGSLIAFASYNEPSNNCERHAIIVSLI 281

Query: 1118 NALT---------AIYASVVVF--------AIL----------GFKAMSNAAEGTGL--- 1147
            N+ T         +IY     F         IL          G   MSN  +       
Sbjct: 282  NSGTSIFASIVTFSIYGFKATFNYENCLKKVILLLTNTFDLEEGSITMSNLEDMKKYLNA 341

Query: 1148 AF---------------------IAAEGTGLAFIVFTQAIVELPGAPFWSIIFFMMLLSL 1186
            A+                      A +GTGLAFIV+++AI  +  +  WS+++F MLL L
Sbjct: 342  AYPQEYAQIAPQLKNCSLEKELDTAVQGTGLAFIVYSEAIKNMEVSQLWSVLYFFMLLML 401

Query: 1187 GLGSQIGILEGVLCTIFDIEIFKRIRKPILTEALKGM 1223
            G+GS +G    +L  + D    K I   +  EA+ G+
Sbjct: 402  GIGSMLGNTAAILTPLTD---SKIISSHLPKEAISGL 435



 Score =  108 bits (270), Expect = 7e-24
 Identities = 42/106 (39%), Positives = 66/106 (62%), Gaps = 1/106 (0%)

Query: 772 GVVCLISFLVGLIFTTGAGEYWLKLFDSFAGTIGLVLIALIEMLAVMFIYGHEKFTNDIY 831
           G+VC I+ ++GL+FT  AG YW  +F+ +A T+ L+LI L+E +AV ++YG  +F  D+ 
Sbjct: 434 GLVCFINCIIGLVFTMEAGNYWFDIFNDYAATLSLLLIVLVETIAVCYVYGLRRFEKDLE 493

Query: 832 EMTGYKPGWYWQITWRFLGPAIMSCILVSSIICRFLK-KPSYSAWN 876
            MTG+KP WYW+I W F  P ++  + +  +    L     Y AW+
Sbjct: 494 AMTGHKPSWYWKIMWAFTSPLLIISLFIFYLTDYILTGTLQYQAWD 539


>gnl|CDD|212091 cd11522, SLC6sbd_SBAT1, Sodium-coupled branched-chain amino-acid
            transporter 1; solute-binding domain.  SBAT1 (also called
            B0AT2, v7-3, NTT7-3) is a high-affinity Na(+)-dependent
            transporter for large neutral amino acids, including
            leucine, isoleucine, valine, proline and methionine.
            Human SBAT1 is encoded by the SLC6A15 gene, a
            susceptibility gene for major depression. SBAT1 is
            expressed in brain, and may have a role in transporting
            neurotransmitter precursors into neurons. This subgroup
            belongs to the solute carrier 6 (SLC6) transporter
            family.
          Length = 580

 Score =  375 bits (964), Expect = e-115
 Identities = 173/404 (42%), Positives = 243/404 (60%), Gaps = 67/404 (16%)

Query: 878  AFLIPFMVMLILEGIPLFLIELGIGQKMRLGSLGVWNTIHPWLGGVGISSCLVTFFVALY 937
            A+L+P++++L++ GIPLF +EL +GQ++R GS+GVWN I P LGG+G +SC+V FFVALY
Sbjct: 38   AYLVPYLILLLVIGIPLFFLELSVGQRIRRGSIGVWNYISPKLGGIGFASCIVCFFVALY 97

Query: 938  YNVIITWCFYYLFNSFSSSLPWSSCPTDHNGT---IIPECELSSETAYFWYRTTLDVSPN 994
            YNVII W  +Y   SF   LPW  CP   N +   + PECE SS T Y+WYR  L++S +
Sbjct: 98   YNVIIGWSLFYFSQSFQQPLPWDQCPLVKNASHTFVEPECEKSSATTYYWYREALNISSS 157

Query: 995  IESPGSPKWWIVLCLLLAWIIVFFIVMKGIQSSGRVVYFTSMFPYIVLTIFFIRGITLPG 1054
            I   G   W + +CLL AW++V   ++KGIQSSG+++YF+S+FPY+VL  F IR + L G
Sbjct: 158  ISESGGLNWKMTICLLAAWVMVCLAMIKGIQSSGKIMYFSSLFPYVVLICFLIRALLLNG 217

Query: 1055 AGAGLVHMYTPKVEKLLDPNVWLDAATQVFYSFGLAFGSLIAFGSYNQPNNNCVRDVIMV 1114
            +  G+ HM+TPK+E +L+P VW +AATQVF++ GL FG +IAF SYN+ +NNC  D ++V
Sbjct: 218  SVDGIRHMFTPKLEIMLEPKVWREAATQVFFALGLGFGGVIAFSSYNKRDNNCHFDAVLV 277

Query: 1115 SICNALTAIYASVVVFA--------------------ILGFKAMSNAAEGTGLAFI---- 1150
            S  N  T++ A++VVFA                    I+    M N ++      I    
Sbjct: 278  SFINFFTSVLATLVVFAVLGFKANVINEKCIIQNSEKIIKLLKMGNLSQDIIPHHINFSS 337

Query: 1151 -------------------------------------AAEGTGLAFIVFTQAIVELPGAP 1173
                                                 A +GTGLAFI FT+A+   P +P
Sbjct: 338  ITAEDYNLVYDIIQKVKEEEFPALGLKSCQIEDELNKAVQGTGLAFIAFTEAMTHFPASP 397

Query: 1174 FWSIIFFMMLLSLGLGSQIGILEGVLCTIFDIEIFKRIRKPILT 1217
            FWS++FF+ML++LGLGS  G +EG++  I D   FK +RK ILT
Sbjct: 398  FWSVMFFLMLVNLGLGSMFGTIEGIITPIVD--TFK-VRKEILT 438



 Score =  103 bits (257), Expect = 3e-22
 Identities = 37/107 (34%), Positives = 70/107 (65%)

Query: 770 LVGVVCLISFLVGLIFTTGAGEYWLKLFDSFAGTIGLVLIALIEMLAVMFIYGHEKFTND 829
           L  + CL++F +GLIF   +G Y++ +FD ++ T+ L+++ ++E +AV F+YG +KF  D
Sbjct: 437 LTVICCLLAFCIGLIFVQRSGNYFVTMFDDYSATLPLLIVVILENIAVSFVYGIDKFMED 496

Query: 830 IYEMTGYKPGWYWQITWRFLGPAIMSCILVSSIICRFLKKPSYSAWN 876
           + +M G+ P  Y+   W+++ P ++  +LV+S++   L  P Y+AW 
Sbjct: 497 LKDMLGFTPNRYYYYMWKYISPLVLLSLLVASVVQMGLSPPGYNAWI 543


>gnl|CDD|212066 cd11497, SLC6sbd_SERT-like, Na(+)- and Cl(-)-dependent monoamine
            transporters, SERT, NET, DAT1 and related proteins;
            solute binding domain.  This subgroup represents the
            solute-binding domain of transmembrane transporters that
            transport monoamine neurotransmitters from synaptic
            spaces into presynaptic neurons. Members include: NET
            which transports norepinephrine, SERT which transports
            serotonin, and DAT1 which transports dopamine. These
            transporters may play a role in diseases including
            depression, anxiety disorders, attention-deficit
            hyperactivity disorder, and in the control of human
            behavior and emotional states. This subgroup belongs to
            the solute carrier 6 (SLC6) transporter family.
          Length = 539

 Score =  369 bits (949), Expect = e-113
 Identities = 159/355 (44%), Positives = 219/355 (61%), Gaps = 27/355 (7%)

Query: 878  AFLIPFMVMLILEGIPLFLIELGIGQKMRLGSLGVWNTIHPWLGGVGISSCLVTFFVALY 937
            AFLIP+++MLI  G+PLF +EL +GQ  R G + VW  I P   GVG + C++ F+VA Y
Sbjct: 38   AFLIPYLLMLIFGGLPLFYMELALGQYHRSGCITVWRKICPIFKGVGYAICIIAFYVAFY 97

Query: 938  YNVIITWCFYYLFNSFSSSLPWSSC-----------------PTDHNGTIIPECELSSET 980
            YN II W  YYLF+SF+S LPW+SC                  +  NGT      L S  
Sbjct: 98   YNTIIAWALYYLFSSFTSELPWTSCNNPWNTPNCTDVSNRNNSSSSNGTWT----LHSPA 153

Query: 981  AYFWYRTTLDV--SPNIESPGSPKWWIVLCLLLAWIIVFFIVMKGIQSSGRVVYFTSMFP 1038
              F+ R  L++  S  I+  G PKW + LCL   +IIV+F + KG++SSG+VV+ T+  P
Sbjct: 154  EEFFERKVLELQKSEGIDDLGGPKWQLALCLFAVFIIVYFSLWKGVKSSGKVVWVTATMP 213

Query: 1039 YIVLTIFFIRGITLPGAGAGLVHMYTPKVEKLLDPNVWLDAATQVFYSFGLAFGSLIAFG 1098
            Y+VL+I  IRG+TLPGA  G+ +  TP   KL DP VW+DAATQ+F+S G  FG L+A  
Sbjct: 214  YVVLSILLIRGLTLPGASDGIKYYLTPNWSKLKDPKVWIDAATQIFFSLGPGFGVLLALS 273

Query: 1099 SYNQPNNNCVRDVIMVSICNALTAIYASVVVFAILGFKAMSNAAEGTGLAFIAAEGTGLA 1158
            SYN+ +NNC RD ++ S  N LT+ ++  V+F++LG+ A         +  +A EG GL 
Sbjct: 274  SYNKFHNNCYRDALITSSINCLTSFFSGFVIFSVLGYMAHKQ---NVDIEVVATEGPGLV 330

Query: 1159 FIVFTQAIVELPGAPFWSIIFFMMLLSLGLGSQIGILEGVLCTIFDIEIFKRIRK 1213
            FIV+ +AI  LPG+ FW+IIFF+ML++LGL S  G LE V+  + D E    + K
Sbjct: 331  FIVYPEAIATLPGSTFWAIIFFLMLITLGLDSTFGGLEAVITGLCD-EFPPVLGK 384



 Score =  106 bits (267), Expect = 2e-23
 Identities = 33/108 (30%), Positives = 60/108 (55%), Gaps = 2/108 (1%)

Query: 769 HLVGVVCLISFLVGLIFTTGAGEYWLKLFDSFAGTIGLVLIALIEMLAVMFIYGHEKFTN 828
             V  V +I FL+ L   T  G Y + L D +A    ++ +  +E +AV + YG ++F++
Sbjct: 388 LFVLGVVIICFLLALPTVTYGGIYVVTLLDRYAAGTSILFVVFLEAVAVSWFYGVDRFSD 447

Query: 829 DIYEMTGYKPGWYWQITWRFLGPAIMSCILVSSIICRFLKKPSYSAWN 876
           DI EM G++PG YW++ W+F+ P  +  I++ S++    +   Y  + 
Sbjct: 448 DIEEMLGFRPGLYWRVCWKFISPIFLLFIVIFSLV--SYEPLQYQDYV 493


>gnl|CDD|212069 cd11500, SLC6sbd_PROT, Na(+)- and Cl(-)-dependent L-proline
            transporter PROT; solute-binding domain.  PROT is a
            high-affinity L-proline transporter that transports
            L-proline, and may have a role in excitatory
            neurotransmission. Human PROT is encoded by the SLC6A7
            gene, a potential susceptible gene for asthma. PROT is
            expressed in the brain. This subgroup belongs to the
            solute carrier 6 (SLC6) transporter family.
          Length = 544

 Score =  362 bits (931), Expect = e-110
 Identities = 145/343 (42%), Positives = 202/343 (58%), Gaps = 20/343 (5%)

Query: 878  AFLIPFMVMLILEGIPLFLIELGIGQKMRLGSLGVWNTIHPWLGGVGISSCLVTFFVALY 937
            AFLIP+ +ML   GIPLF +EL +GQ   LG L VW    P   G+G    LV+  V +Y
Sbjct: 38   AFLIPYFIMLAFCGIPLFFMELSLGQYGSLGPLTVWKCC-PIFKGIGYGMLLVSGLVCIY 96

Query: 938  YNVIITWCFYYLFNSFSSSLPWSSC--------------PTDHNGTIIPECELSSETAYF 983
            YNVII W  +YLF SF+S LPW  C                +     +      S +  +
Sbjct: 97   YNVIIAWTLFYLFASFTSVLPWEHCGNWWNTESCLEHRVKDNMGALPVNLTRTVSPSEEY 156

Query: 984  WYRTTL--DVSPNIESPGSPKWWIVLCLLLAWIIVFFIVMKGIQSSGRVVYFTSMFPYIV 1041
            W R  L    S  I  PG  +W + LCLLLAW+IVFF + KG++SSG+VVYFT+ FPYI+
Sbjct: 157  WNRRVLGIQGSSGIGDPGEVRWELALCLLLAWVIVFFCLFKGVKSSGKVVYFTATFPYII 216

Query: 1042 LTIFFIRGITLPGAGAGLVHMYTPKVEKLLDPNVWLDAATQVFYSFGLAFGSLIAFGSYN 1101
            L +  +RG+TL GA  G+    TP   +L    VW DAA+Q+FYS G+ FG L+   SYN
Sbjct: 217  LIVLLVRGVTLEGALKGIQFYLTPDFHRLASSQVWYDAASQIFYSLGIGFGGLLTMASYN 276

Query: 1102 QPNNNCVRDVIMVSICNALTAIYASVVVFAILGFKAMSNAAEGTGLAFIAAEGTGLAFIV 1161
            + NNN  RD +++++ N +T+++A   +F++LG  A      G  +  +A  G GLAF+ 
Sbjct: 277  KFNNNIYRDTLIITLGNCITSVFAGFAIFSVLGHMAHE---LGVPVKDVADAGPGLAFVA 333

Query: 1162 FTQAIVELPGAPFWSIIFFMMLLSLGLGSQIGILEGVLCTIFD 1204
            + +A+  LP +PFWSI+FF MLL+LGL SQ  +LE ++  + D
Sbjct: 334  YPEALTLLPVSPFWSILFFFMLLTLGLDSQFAMLETIVTAVTD 376



 Score =  105 bits (263), Expect = 4e-23
 Identities = 45/143 (31%), Positives = 77/143 (53%), Gaps = 4/143 (2%)

Query: 769 HLVGVVCLISFLVGLIFTTGAGEYWLKLFDSFAGTIGLVLIALIEMLAVMFIYGHEKFTN 828
             +G++C+  +L+GL+  T  G YWL L D ++ + GL+++AL   LA+ ++YG ++F  
Sbjct: 388 WFLGLICVGMYLLGLLLVTDGGMYWLTLMDWYSASFGLMVLALFMCLAISWVYGIQRFCR 447

Query: 829 DIYEMTGYKPGWYWQITWRFLGPAIMSCILVSSIICRFLKKPS---YSAWNRAFLIPFMV 885
           DI  M G++P  Y++  W F+ PA++  ILV SI+     +     Y  W     I  M 
Sbjct: 448 DIKMMIGFEPNLYFKACWMFISPALLLFILVYSIVKYQPSEYGSYRYPPWAELLGI-LMG 506

Query: 886 MLILEGIPLFLIELGIGQKMRLG 908
           +L    IP+  I   + ++  L 
Sbjct: 507 LLSCLMIPIGAIVAILREEGSLW 529


>gnl|CDD|212090 cd11521, SLC6sbd_NTT4, Na(+)-dependent neurotransmitter transporter
            4; solute-binding domain.  NTT4 (also called XT1)
            transports the neutral amino acids, proline, glycine,
            leucine, and alanine, and may play an important role in
            synaptic transmission. Human NTT4 is encoded by the
            SLC6A17 gene. NTT4 is specifically expressed in the
            nervous system, in synaptic vesicles of glutamatergic and
            GABAergic neurons. This subgroup belongs to the solute
            carrier 6 (SLC6) transporter family.
          Length = 589

 Score =  353 bits (908), Expect = e-107
 Identities = 148/263 (56%), Positives = 195/263 (74%), Gaps = 3/263 (1%)

Query: 878  AFLIPFMVMLILEGIPLFLIELGIGQKMRLGSLGVWNTIHPWLGGVGISSCLVTFFVALY 937
            A+L+P++V+LI+ GIPLF +EL +GQ++R GS+GVWN I P LGG+G +SCLV FFV LY
Sbjct: 38   AYLVPYLVLLIIIGIPLFFLELAVGQRIRRGSIGVWNYICPRLGGIGYASCLVCFFVGLY 97

Query: 938  YNVIITWCFYYLFNSFSSSLPWSSCPTDHNGTII---PECELSSETAYFWYRTTLDVSPN 994
            YNVII W  +Y F SF   LPWS CP   NG++     ECE SS T YFWYR  LD+S +
Sbjct: 98   YNVIIGWSIFYFFKSFQYPLPWSECPIVRNGSVAVVEAECEKSSATTYFWYREALDISNS 157

Query: 995  IESPGSPKWWIVLCLLLAWIIVFFIVMKGIQSSGRVVYFTSMFPYIVLTIFFIRGITLPG 1054
            I   G   W + LCLL+AW IV   ++KGIQSSG+V+YF+S+FPY+VL  F +RG+ L G
Sbjct: 158  ISESGGLNWKMTLCLLVAWSIVGLAMIKGIQSSGKVMYFSSLFPYVVLVCFLVRGLLLRG 217

Query: 1055 AGAGLVHMYTPKVEKLLDPNVWLDAATQVFYSFGLAFGSLIAFGSYNQPNNNCVRDVIMV 1114
            A  G++HM+TPK++K+LDP VW +AATQVF++ GL FG +IAF SYN+ +NNC  D  +V
Sbjct: 218  AVDGILHMFTPKLDKMLDPQVWREAATQVFFALGLGFGGVIAFSSYNKQDNNCHFDAALV 277

Query: 1115 SICNALTAIYASVVVFAILGFKA 1137
            S  N  T++ A++VVFA+LGFKA
Sbjct: 278  SFINFFTSVLATLVVFAVLGFKA 300



 Score = 95.4 bits (237), Expect = 8e-20
 Identities = 39/103 (37%), Positives = 69/103 (66%)

Query: 775 CLISFLVGLIFTTGAGEYWLKLFDSFAGTIGLVLIALIEMLAVMFIYGHEKFTNDIYEMT 834
           C+ +FLVGLIF   +G Y++ +FD ++ T+ L ++ ++E +AV +IYG +KF  ++ EM 
Sbjct: 442 CIFAFLVGLIFVQRSGNYFVTMFDDYSATLPLTIVVILENIAVAWIYGTKKFMQELTEML 501

Query: 835 GYKPGWYWQITWRFLGPAIMSCILVSSIICRFLKKPSYSAWNR 877
           G++P  ++  TW+++ P  M+ ++ +SII   +  P YSAW R
Sbjct: 502 GFRPYSFYFYTWKYVSPICMAVLMTASIIQLGVSPPGYSAWIR 544


>gnl|CDD|212071 cd11502, SLC6sbd_NTT5, Neurotransmitter transporter 5; solute-binding
            domain.  Human NTT5 is encoded by the SLC6A16 gene. NTT5
            is expressed in testis, pancreas, and prostate; its
            expression is predominantly intracellular, indicative of
            a vesicular location. Its substrates are unknown. This
            subgroup belongs to the solute carrier 6 (SLC6)
            transporter family.
          Length = 543

 Score =  327 bits (840), Expect = 2e-98
 Identities = 139/380 (36%), Positives = 208/380 (54%), Gaps = 35/380 (9%)

Query: 878  AFLIPFMVMLILEGIPLFLIELGIGQKMRLGSLGVWNTIHPWLGGVGISSCLVTFFVALY 937
            +FLI + VML L GIPL  +E+ +GQ+++ GS+ VW  I PW GGVG SS L  F VALY
Sbjct: 38   SFLIAYFVMLFLVGIPLLFLEMAVGQRLQQGSIRVWKAISPWFGGVGYSSVLACFLVALY 97

Query: 938  YNVIITWCFYYLFNSFSSSLPWSSCPTDHNGTIIPECELSSETAYFWYRTTLDVSPNIES 997
            YNV+ +W  +YL +SF   LPW  CP   N +  PEC  ++ + YFWY+ TL  S  IE 
Sbjct: 98   YNVVNSWALFYLGHSFQFPLPWEQCPLVKNSSD-PECARTTPSGYFWYQKTLKASDRIED 156

Query: 998  PGSPKWWIVLCLLLAWIIVFFIVMKGIQSSGRVVYFTSMFPYIVLTIFFIRGITLPGAGA 1057
             GS    + L L +AW +V+  +++G++S+G+ + F  + PY++L    IR + L GA  
Sbjct: 157  GGSLVGHLALSLFVAWCLVYVTMIRGLKSTGKSIIFLVLLPYVILLCLLIRSLFLEGAFF 216

Query: 1058 GLVHMYTPKVEKLLDPNVWLDAATQVFYSFGLAFGSLIAFGSYNQPNNNCVRDVIMVSIC 1117
            GL+ +   KV  L + ++W  A  QV +S GL FG+LI + SY    NNC+ D  +V++ 
Sbjct: 217  GLLWLLVAKVSSLANLSIWRVAGGQVLFSLGLGFGTLITYASYMPRKNNCLTDAFVVALV 276

Query: 1118 NALTAIYASVVVFAILGF-----------KAMSNAAEGTGLAFIA--------------- 1151
            N LT++ A+ + FA++GF           K + N      L   A               
Sbjct: 277  NLLTSLIATPIFFAVMGFWATVSTHRCCEKNVENLLMLGVLPPEAKPPADLSLDPASIYL 336

Query: 1152 -------AEGTGLAFIVFTQAIVELPGAPFWSIIFFMMLLSLGLGSQIGILEGVLCTIFD 1204
                    EG G AF+ F++AI   PG+ FW+I+FF+ LL  GL +  GI++G+   + D
Sbjct: 337  LWLFNLPMEGPGFAFLAFSEAISLAPGSAFWAIVFFLALLLAGLSTMFGIMQGIYTPLQD 396

Query: 1205 -IEIFKRIRKPILTEALKGM 1223
                F++  K +      G+
Sbjct: 397  TFSFFRKHPKLLTVLVCVGL 416



 Score = 65.9 bits (161), Expect = 1e-10
 Identities = 33/122 (27%), Positives = 64/122 (52%), Gaps = 7/122 (5%)

Query: 759 LTDPHCDEHLHLVGV-VCLISFLVGLIFTTGAGEYWLKLFDSFAGTIGLVLIALIEMLAV 817
            T     +H  L+ V VC+  FL GL+FT  +G Y+  L D +   + ++ I + E +A+
Sbjct: 396 DTFSFFRKHPKLLTVLVCVGLFLGGLVFTQPSGSYFRSLLDDYLVPLPIIFIVVFENIAL 455

Query: 818 MFIYGHEKFTNDIYEMTGYKP---GWYWQITWRFLGPAIMSCILVSSIICRFLKKPSYSA 874
            + YG ++F  D+  + G       W+    W +L P ++  + + ++I  +++ P+Y A
Sbjct: 456 AWAYGAKRFLADLEFLLGGPISSIVWWL---WSYLSPVVLLGLFLWTLIQLYVQPPTYLA 512

Query: 875 WN 876
           W+
Sbjct: 513 WD 514


>gnl|CDD|212094 cd11556, SLC6sbd_SERT-like_u1, uncharacterized subgroup of the
            SERT-like Na(+)- and Cl(-)-dependent monoamine
            transporter subfamily; solute binding domain.  SERT-like
            Na(+)- and Cl(-)-dependent monoamine transporters,
            transport monoamine neurotransmitters from synaptic
            spaces into presynaptic neurons. Members include: the
            norepinephrine transporter NET, the serotonin transporter
            SERT , and the dopamine transporter DAT1. These latter
            may play a role in diseases or disorders including
            depression, anxiety disorders, and attention-deficit
            hyperactivity disorder, and in the control of human
            behavior and emotional states. They belongs to the solute
            carrier 6 (SLC6) transporter family. Members of this
            subgroup are uncharacterized.
          Length = 552

 Score =  327 bits (839), Expect = 4e-98
 Identities = 142/369 (38%), Positives = 209/369 (56%), Gaps = 39/369 (10%)

Query: 878  AFLIPFMVMLILEGIPLFLIELGIGQKMRLGSLGVWNTIHPWLGGVGISSCLVTFFVALY 937
            AFLIP+ +ML++ GIPLF +EL +GQ  R G++  W  I P   G+G +  L+ F+V  +
Sbjct: 38   AFLIPYGIMLVVGGIPLFYMELALGQYYRKGAITSWGRICPLFKGIGYAVVLIAFYVDFF 97

Query: 938  YNVIITWCFYYLFNSFSSSLPWSSCPTDHNGTIIPEC-------ELSSETAYFW------ 984
            YNVII W  YY F SF+ +LPW+SC    N    P C       + +             
Sbjct: 98   YNVIIAWSLYYFFASFTFNLPWTSCNNSWN---TPNCYEPHWSEDGTVPCRSANQSFSAT 154

Query: 985  -------------------YRTTLDVSPNIESPGSPKWWIVLCLLLAWIIVFFIVMKGIQ 1025
                                   L VS  +   G+ +W I+LCL   ++I +F + KGI 
Sbjct: 155  NISAAEENFTSAALEYFERGVLELHVSRGVTDLGNIRWQILLCLFAVYLICYFSLWKGIH 214

Query: 1026 SSGRVVYFTSMFPYIVLTIFFIRGITLPGAGAGLVHMYTPKVEKLLDPNVWLDAATQVFY 1085
            +SG+VV+FT++FPY+VL I  IRG+TLPG+  G+ +   P  E L    VW+DAATQVF+
Sbjct: 215  TSGKVVWFTALFPYVVLFILLIRGVTLPGSQNGIEYYLRPNFEALKSAEVWVDAATQVFF 274

Query: 1086 SFGLAFGSLIAFGSYNQPNNNCVRDVIMVSICNALTAIYASVVVFAILGFKAMSNAAEGT 1145
            S G  FG L+A+ SYN+ +NN  RD ++ S  N LT+  +  V+F++LG+ A      G 
Sbjct: 275  SLGPGFGVLLAYASYNKFHNNVYRDALVTSSINCLTSFLSGFVIFSVLGYMACR---SGK 331

Query: 1146 GLAFIAAEGTGLAFIVFTQAIVELPGAPFWSIIFFMMLLSLGLGSQIGILEGVLCTIFD- 1204
             +  +A EG GL F+V+ +A+  +PG+ FWS++FF+ML++LGL S  G  E ++  + D 
Sbjct: 332  PIEDVATEGPGLVFVVYPEALSTMPGSTFWSLLFFLMLMTLGLDSSFGGSEAIITALSDE 391

Query: 1205 IEIFKRIRK 1213
              I KR R+
Sbjct: 392  FPIIKRHRE 400



 Score = 84.1 bits (208), Expect = 2e-16
 Identities = 34/93 (36%), Positives = 54/93 (58%), Gaps = 2/93 (2%)

Query: 773 VVCLISF--LVGLIFTTGAGEYWLKLFDSFAGTIGLVLIALIEMLAVMFIYGHEKFTNDI 830
           V CL +F  ++GL   T  G Y+  L D +A    +++  L E +AV +IYG ++F  DI
Sbjct: 403 VGCLFAFYFVIGLAMCTQGGIYFFHLLDRYAAGYSILIAVLFEAVAVSWIYGLDRFCQDI 462

Query: 831 YEMTGYKPGWYWQITWRFLGPAIMSCILVSSII 863
            EM G++PG YW+I W+F+ P  +   +   +I
Sbjct: 463 KEMLGFRPGIYWKICWKFIAPIFLLFNITYGLI 495


>gnl|CDD|212082 cd11513, SLC6sbd_SERT, Na(+)- and Cl(-)-dependent serotonin
            transporter SERT; solute-binding domain.  SERT (also
            called 5-HTT), is a transmembrane transporter that
            transports the neurotransmitter serotonin from synaptic
            spaces into presynaptic neurons. The antiport of a K+ ion
            is believed to follow the transport of serotonin and
            promote the reorientation of SERT for another transport
            cycle. Human SERT is encoded by the SLC6A4 gene. SERT is
            expressed in brain, peripheral nervous system, placenta,
            epithelium, and platelets. SERT may play a role in
            diseases or disorders including anxiety, depression,
            autism, gastrointestinal disorders, premature
            ejaculation, and obesity. It may also have a role in
            social cognition. This subgroup belongs to the solute
            carrier 6 (SLC6) transporter family.
          Length = 537

 Score =  326 bits (837), Expect = 5e-98
 Identities = 142/341 (41%), Positives = 203/341 (59%), Gaps = 18/341 (5%)

Query: 878  AFLIPFMVMLILEGIPLFLIELGIGQKMRLGSLGVWNTIHPWLGGVGISSCLVTFFVALY 937
            AFLIP+ +M I  GIPLF +EL +GQ  R G + +W  I P   G+G + C++  +VA Y
Sbjct: 38   AFLIPYTLMAIFGGIPLFYMELALGQYHRTGCISIWRKICPIFKGIGYAICIIALYVASY 97

Query: 938  YNVIITWCFYYLFNSFSSSLPWSSCPTD------------HNGTIIPECELSSETAYFWY 985
            YN II W  YYL +SF+++LPW+SC                N T        +E   F+ 
Sbjct: 98   YNTIIAWALYYLISSFATTLPWTSCDNSWNTGNCTNYFAKDNITWTLHSTSPAEE--FYT 155

Query: 986  RTTLDV--SPNIESPGSPKWWIVLCLLLAWIIVFFIVMKGIQSSGRVVYFTSMFPYIVLT 1043
            R  L V  S  +   G   W + LCL L + IV+F + KG+++SG+VV+ T+ FPYIVL 
Sbjct: 156  RHVLQVHRSTGLGDLGGISWQLALCLFLIFTIVYFSIWKGVKTSGKVVWVTATFPYIVLF 215

Query: 1044 IFFIRGITLPGAGAGLVHMYTPKVEKLLDPNVWLDAATQVFYSFGLAFGSLIAFGSYNQP 1103
            I  IRG TLPGA  G+V+   PK  KLL+  VW+DAA Q+F+S G  FG L+AF SYN+ 
Sbjct: 216  ILLIRGATLPGAWRGVVYYLKPKWHKLLETGVWVDAAAQIFFSLGPGFGVLLAFSSYNKF 275

Query: 1104 NNNCVRDVIMVSICNALTAIYASVVVFAILGFKAMSNAAEGTGLAFIAAEGTGLAFIVFT 1163
            +NNC +D ++ S+ N LT+  +  V+F +LG+  M+       +      G  L FI + 
Sbjct: 276  HNNCYQDALVTSVVNCLTSFVSGFVIFTVLGY--MAEMRNEDVIEVATDAGPSLLFITYA 333

Query: 1164 QAIVELPGAPFWSIIFFMMLLSLGLGSQIGILEGVLCTIFD 1204
            +AI  +PG+ F++IIFF+M+++LGL S  G LEGV+  + D
Sbjct: 334  EAIANMPGSTFFAIIFFLMIITLGLDSTFGGLEGVITAVLD 374



 Score = 88.7 bits (220), Expect = 7e-18
 Identities = 36/108 (33%), Positives = 60/108 (55%), Gaps = 8/108 (7%)

Query: 764 CDEHLHLVG------VVCLI--SFLVGLIFTTGAGEYWLKLFDSFAGTIGLVLIALIEML 815
            DE  H++G      V+ L+   FL  L   T  G Y +KL + +A    ++ + L+E +
Sbjct: 373 LDEFPHVIGKRREIFVLGLVIVCFLGSLATLTFGGAYVVKLLEEYATGPAVLTVVLLEAI 432

Query: 816 AVMFIYGHEKFTNDIYEMTGYKPGWYWQITWRFLGPAIMSCILVSSII 863
           AV + YG  +F +DI EM G+ PGW+W++ W  + P  +  I+VS ++
Sbjct: 433 AVSWFYGITQFCSDIKEMLGFTPGWFWRVCWVAISPLFLLFIIVSFLM 480


>gnl|CDD|212075 cd11506, SLC6sbd_GAT1, Na(+)- and Cl(-)-dependent GABA transporter 1;
            solute-binding domain.  GAT1 transports
            gamma-aminobutyric acid (GABA). GABA is the main
            inhibitory neurotransmitter within the mammalian CNS.
            Human GAT1 is encoded by the SLC6A1 gene. GAT1 is
            expressed in brain and peripheral nervous system. The
            antiepileptic drug, Tiagabine, inhibits GAT1. This
            subgroup belongs to the solute carrier 6 (SLC6)
            transporter family.
          Length = 598

 Score =  322 bits (827), Expect = 6e-96
 Identities = 152/369 (41%), Positives = 233/369 (63%), Gaps = 13/369 (3%)

Query: 845  TWRFLGPAIMSCILVSSIICRFLKKPSYSAWNR--AFLIPFMVMLILEGIPLFLIELGIG 902
            TW+     +MSC+  +  +    + P     N   AFLIP+ + LI  G+PLFL+E  +G
Sbjct: 45   TWKGKFDFLMSCVGYAIGLGNVWRFPYLCGKNGGGAFLIPYFLTLIFAGVPLFLLECSLG 104

Query: 903  QKMRLGSLGVWNTIHPWLGGVGISSCLVTFFVALYYNVIITWCFYYLFNSFSSSLPWSSC 962
            Q   +G LGVW  + P   GVG+++ +++F++ +YY VII W  YYL+NSF+++LPW SC
Sbjct: 105  QYTSIGGLGVWK-LAPMFKGVGLAAAVLSFWLNIYYIVIIAWAIYYLYNSFTTTLPWKSC 163

Query: 963  P----TDH---NGTIIPECELSSETAYFWYRTTLDVSPNIESPGSPKWWIVLCLLLAWII 1015
                 TD    N ++     ++S    FW R    ++  +E PG  +W + + L +AW++
Sbjct: 164  DNPWNTDRCFSNYSMANTTNMTSAVVEFWERNMHQMTDGLEKPGQIRWPLAITLAIAWVL 223

Query: 1016 VFFIVMKGIQSSGRVVYFTSMFPYIVLTIFFIRGITLPGAGAGLVHMYTPKVEKLLDPNV 1075
            V+F + KG+  +G+VVYF++ +PYI+L I F RG+TLPGA  G++   TP   KL D  V
Sbjct: 224  VYFCIWKGVGWTGKVVYFSATYPYIMLFILFFRGVTLPGAKEGILFYITPNFSKLSDSEV 283

Query: 1076 WLDAATQVFYSFGLAFGSLIAFGSYNQPNNNCVRDVIMVSICNALTAIYASVVVFAILGF 1135
            WLDAATQ+F+S+GL  GSLIA GSYN  +NN  RD I+V   N+ T+++A  V+F+I+GF
Sbjct: 284  WLDAATQIFFSYGLGLGSLIALGSYNPFHNNVYRDSIIVCCINSCTSMFAGFVIFSIVGF 343

Query: 1136 KAMSNAAEGTGLAFIAAEGTGLAFIVFTQAIVELPGAPFWSIIFFMMLLSLGLGSQIGIL 1195
              M++  +   +A +AA G GLAF+ + +A+ +LP +P W+++FF MLL LG+ SQ   +
Sbjct: 344  --MAHVTKRP-IADVAASGPGLAFLAYPEAVTQLPISPLWAVLFFSMLLMLGIDSQFCTV 400

Query: 1196 EGVLCTIFD 1204
            EG +  + D
Sbjct: 401  EGFITALVD 409



 Score = 75.0 bits (184), Expect = 2e-13
 Identities = 31/95 (32%), Positives = 58/95 (61%), Gaps = 1/95 (1%)

Query: 770 LVGVVCLISFLVGLIFTTGAGEYWLKLFDSFAGT-IGLVLIALIEMLAVMFIYGHEKFTN 828
            + VVC++S+L+GL   T  G Y  KLFD ++ + + L+ +   E +++ + YG  +F +
Sbjct: 422 FIAVVCIVSYLIGLSNITQGGIYVFKLFDYYSASGMSLLFLVFFECVSISWFYGVNRFYD 481

Query: 829 DIYEMTGYKPGWYWQITWRFLGPAIMSCILVSSII 863
           +I EM GY+P  +W++ W F  P I++ + + S +
Sbjct: 482 NIEEMVGYRPCIWWKLCWSFFTPIIVAGVFLFSAV 516


>gnl|CDD|241286 cd01255, PH2_Tiam1_2, T-lymphoma invasion and metastasis 1 and 2
            Pleckstrin Homology (PH) domain, C-terminal domain.
            Tiam1 activates Rac GTPases to induce membrane ruffling
            and cell motility while Tiam2 (also called STEF (SIF
            (still life) and Tiam1 like-exchange factor) contributes
            to neurite growth. Tiam1/2 are Dbl-family of GEFs that
            possess a Dbl(DH) domain with a PH domain in tandem.
            DH-PH domain catalyzes the GDP/GTP exchange reaction in
            the GTPase cycle and facillitating the switch between
            inactive GDP-bound and active GTP-bound states. The DH
            domain of Tiam1 interacts with Switch regions 1 and 2 of
            Rac1 which blocks magnesium binding and GDP is released.
            Tiam1/2 possess two PH domains, which are often referred
            to as PHn and PHc domains. The DH-PH tandem domain is
            made up of the PHc domain while the PHn is part of a
            novel N-terminal PHCCEx domain which is made up of the
            PHn domain, a coiled coil region(CC), and an extra region
            (Ex). PHCCEx mediates binding to plasma membranes and
            signalling proteins in the activation of Rac GTPases. The
            PH domain resembles the beta-spectrin PH domain,
            suggesting non-canonical phosphatidylinositol binding. CC
            and Ex form a positively charged surface for protein
            binding. There are 2 motifs in Tiam1/2-interacting
            proteins that bind to the PHCCEx domain: Motif-I in CD44,
            ephrinBs, and the NMDA receptor and Motif-II in Par3 and
            JIP2. PH domains have diverse functions, but in general
            are involved in targeting proteins to the appropriate
            cellular location or in the interaction with a binding
            partner. They share little sequence conservation, but all
            have a common fold, which is electrostatically polarized.
            Less than 10% of PH domains bind phosphoinositide
            phosphates (PIPs) with high affinity and specificity. PH
            domains are distinguished from other PIP-binding domains
            by their specific high-affinity binding to PIPs with two
            vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
            PtdIns(3,4,5)P3 which results in targeting some PH domain
            proteins to the plasma membrane. A few display strong
            specificity in lipid binding. Any specificity is usually
            determined by loop regions or insertions in the
            N-terminus of the domain, which are not conserved across
            all PH domains. PH domains are found in cellular
            signaling proteins such as serine/threonine kinase,
            tyrosine kinases, regulators of G-proteins, endocytotic
            GTPases, adaptors, as well as cytoskeletal associated
            molecules and in lipid associated enzymes.
          Length = 172

 Score =  302 bits (775), Expect = 1e-94
 Identities = 88/174 (50%), Positives = 114/174 (65%), Gaps = 2/174 (1%)

Query: 1234 QRIHEEYGAIFDHLFRQHQKSCKQPIDLSPGDLLYYGGVEWLNISDFLGKIKKGLELHAM 1293
            Q+IHEEYGA+FD L R+   + K+  DLS GDLL YG VEWLN    LGK+KK  E    
Sbjct: 1    QKIHEEYGAVFDQLIREQSGTKKEVADLSMGDLLLYGTVEWLNPPSSLGKVKK--EPELA 58

Query: 1294 CFVFKSAVVFLCKERLRQKKKLIGVSNKGCSNEVEIIRYQVLVPVTEVQVRASSAKDMDS 1353
             FVFK+AVV +CKER +QKKKL+G        + +  R++ L+PV+ +QVR S+  D + 
Sbjct: 59   VFVFKTAVVLVCKERSKQKKKLMGSHRASSYEDRDPFRFRHLIPVSALQVRNSNTADTEG 118

Query: 1354 HFLWELIHLRSQIQRRTEKVYVLSNSTAEFRNAFLKTIRQIIRESVRNMSIPMT 1407
              LWELIH +S+++ R EKV+ L  ST EF+NAFLK IR I+RE VR  S    
Sbjct: 119  RCLWELIHTKSELEGRPEKVFQLCCSTPEFKNAFLKVIRSILREKVRRQSSKTE 172


>gnl|CDD|212068 cd11499, SLC6sbd_GlyT2, Na(+)- and Cl(-)-dependent glycine
            transporter GlyT2; solute-binding domain.  GlyT2 (also
            called NET1) is a membrane-bound transporter that
            re-uptakes glycine from the synaptic cleft. Human GlyT2
            is encoded by the SLC6A5 gene. GlyT2 is expressed in
            brain and spinal cord. GlyT2 may play a role in pain, and
            in spasticity. This subgroup belongs to the solute
            carrier 6 (SLC6) transporter family.
          Length = 606

 Score =  316 bits (812), Expect = 7e-94
 Identities = 155/387 (40%), Positives = 227/387 (58%), Gaps = 55/387 (14%)

Query: 878  AFLIPFMVMLILEGIPLFLIELGIGQKMRLGSLGVWNTIHPWLGGVGISSCLVTFFVALY 937
            AFLIP+++ML L GIP+F +E+ +GQ    G + VW  I P L G GI+  +++  +A+Y
Sbjct: 38   AFLIPYLMMLALAGIPIFFLEVSLGQFASQGPVSVWKAI-PALQGCGIAMLIISVLIAIY 96

Query: 938  YNVIITWCFYYLFNSFSSSLPWSSCPTDHNGTIIPECE---------------------- 975
            YN+I+ +  +YLF S    LPW+SC    N    PEC+                      
Sbjct: 97   YNIIMCYTLFYLFASLVEVLPWASCNNPWN---TPECKDKDKLLLDSCIISDHPKIQIKN 153

Query: 976  ------------------------LSSETAYFWYRTTLDVSPNIESPGSPKWWIVLCLLL 1011
                                    +S    YF Y   L +S  IE PG  +W + LCL L
Sbjct: 154  STFCMTAYPNLTMVNFTSVGNKTFVSGSEEYFKY-NVLKISAGIEYPGEIRWPLALCLFL 212

Query: 1012 AWIIVFFIVMKGIQSSGRVVYFTSMFPYIVLTIFFIRGITLPGAGAGLVHMYTPKVEKLL 1071
            AW+IV+  + KGI+SSG+VVYFT+ FPY+VL I  IRG+TLPGAGAG+ +  TPK EKL 
Sbjct: 213  AWVIVYASLAKGIKSSGKVVYFTATFPYVVLVILLIRGVTLPGAGAGIWYFITPKWEKLN 272

Query: 1072 DPNVWLDAATQVFYSFGLAFGSLIAFGSYNQPNNNCVRDVIMVSICNALTAIYASVVVFA 1131
            D  VW DAATQ+F+S   A+G LI   SYN+ +NNC RD ++V+  N+ T+I+A  V+F+
Sbjct: 273  DATVWKDAATQIFFSLSAAWGGLITLSSYNKFHNNCYRDTLIVTCTNSATSIFAGFVIFS 332

Query: 1132 ILGFKAMSNAAEGTGLAFIAAEGTGLAFIVFTQAIVELPGAPFWSIIFFMMLLSLGLGSQ 1191
            ++GF  M++  +   +  +A +G G+AF+V+ +A+  LP +PFW+IIFF+MLL+LGL + 
Sbjct: 333  VIGF--MAHELK-VNIESVADQGPGIAFVVYPEALTRLPLSPFWAIIFFLMLLTLGLDTM 389

Query: 1192 IGILEGVLCTIFD-IEIFKRIRKPILT 1217
               +E ++ ++ D    + R  KP+ T
Sbjct: 390  FATIETIVTSVSDEFPKYLRTHKPLFT 416



 Score = 81.2 bits (200), Expect = 2e-15
 Identities = 37/97 (38%), Positives = 59/97 (60%), Gaps = 1/97 (1%)

Query: 767 HLHLVGVVCLISFLV-GLIFTTGAGEYWLKLFDSFAGTIGLVLIALIEMLAVMFIYGHEK 825
           H  L  +VC ISF + G    T  G Y L+L D++A +  LV+IA+ E++ + +IYG ++
Sbjct: 411 HKPLFTLVCCISFFIMGFPMITQGGMYMLQLVDTYAASYSLVIIAIFELVGISYIYGLQR 470

Query: 826 FTNDIYEMTGYKPGWYWQITWRFLGPAIMSCILVSSI 862
           F  DI  M G++P  +W++ W F+ P I++ IL  S 
Sbjct: 471 FCEDIEMMIGFQPNKFWRVCWAFVTPTILTFILCFSF 507


>gnl|CDD|212078 cd11509, SLC6sbd_CT1, Na(+)- and Cl(-)-dependent creatine transporter
            1; solute-binding domain.  CT1 (also called CRTR, CRT)
            transports creatine. Human CT1 is encoded by the SLC6A8
            gene. CT1 is ubiquitously expressed, with highest levels
            found in skeletal muscle and kidney. Creatine is absorbed
            from food or synthesized from arginine and plays an
            important role in energy metabolism. Deficiency in human
            CT1 leads to X-linked cerebral creatine transporter
            deficiency. In males, this disorder is characterized by
            language and speech delays, autistic-like behavior,
            seizures in about 50% of cases, and can also involve
            midfacial hypoplasia, and short stature. In females, it
            is characterized by mild cognitive impairment with
            behavior and learning problems. This subgroup belongs to
            the solute carrier 6 (SLC6) transporter family.
          Length = 592

 Score =  316 bits (810), Expect = 1e-93
 Identities = 147/404 (36%), Positives = 235/404 (58%), Gaps = 37/404 (9%)

Query: 845  TWRFLGPAIMSCILVSSIICRFLKKP--SYSAWNRAFLIPFMVMLILEGIPLFLIELGIG 902
            TW      IMSC+  +  +    + P   Y      FLIP+++++ + GIP+F +E+ +G
Sbjct: 3    TWTRQMDFIMSCVGFAVGLGNVWRFPYLCYKNGGGVFLIPYLLIVFVGGIPVFFLEIALG 62

Query: 903  QKMRLGSLGVWNTIHPWLGGVGISSCLVTFFVALYYNVIITWCFYYLFNSFSSSLPWSSC 962
            Q M+ G +  WN I P   G+G +S ++ FF   YY +I+ W  YYL +SF+++LPW++C
Sbjct: 63   QFMKQGGINAWN-IAPLFKGLGYASMVIVFFCNTYYIMILVWGLYYLVHSFTNTLPWATC 121

Query: 963  PTD---------------HNGTIIPE---------CE----LSSETAYFWYRTTLDVSPN 994
                               NG++            C+      S    FW    L +S  
Sbjct: 122  GHPWNTENCTEIFRHEDCANGSLANSWSQTFLNLSCDELANKRSPVIEFWENKVLRLSGG 181

Query: 995  IESPGSPKWWIVLCLLLAWIIVFFIVMKGIQSSGRVVYFTSMFPYIVLTIFFIRGITLPG 1054
            +E PG+  W ++LCLL  W+IV+F V KG++S+G++VYFT++FPY+VL +  + G+TLPG
Sbjct: 182  LEEPGALNWEMILCLLATWVIVYFCVWKGVKSTGKIVYFTALFPYVVLIVLLVHGVTLPG 241

Query: 1055 AGAGLVHMYTPKVEKLLDPNVWLDAATQVFYSFGLAFGSLIAFGSYNQPNNNCVRDVIMV 1114
            A  G+++   P   KL +  VW+DA TQ+F+S+ +  G+L A GSYN+ +NNC RD  ++
Sbjct: 242  ALDGIIYYLKPDWSKLGEAQVWIDAGTQIFFSYAIGLGALTALGSYNRFHNNCYRDAFIL 301

Query: 1115 SICNALTAIYASVVVFAILGFKAMSNAAEGTGLAFIAAEGTGLAFIVFTQAIVELPGAPF 1174
            ++ N+ T+ +A  VVF++LGF A   + +G  ++ +A  G GLAFI + +A+  +P AP 
Sbjct: 302  ALINSGTSFFAGFVVFSVLGFMA---SEQGVDISKVAESGPGLAFIAYPKAVTLMPVAPL 358

Query: 1175 WSIIFFMMLLSLGLGSQIGILEGVLCTIFDI---EIFKRIRKPI 1215
            W+ +FF MLL LGL SQ   +EG +  I D+   + + R+R+ I
Sbjct: 359  WAALFFFMLLLLGLDSQFVGVEGFITGILDLFPQKYYGRLRREI 402



 Score = 58.0 bits (140), Expect = 3e-08
 Identities = 28/94 (29%), Positives = 49/94 (52%), Gaps = 1/94 (1%)

Query: 771 VGVVCLISFLVGLIFTTGAGEYWLKLFDSF-AGTIGLVLIALIEMLAVMFIYGHEKFTND 829
             + CL+ FL+ L   T  G Y  +LFD + A  I L+  A  E + + ++YG ++F +D
Sbjct: 404 AALCCLLCFLIDLSMVTQGGMYVFQLFDYYSASGITLLWQAFWECVVIAWVYGADRFMDD 463

Query: 830 IYEMTGYKPGWYWQITWRFLGPAIMSCILVSSII 863
           I  M GY+P  + +  W  + P +   I +  ++
Sbjct: 464 IARMIGYRPLPWMKWCWSVITPLVCVGIFLFHVV 497


>gnl|CDD|212079 cd11510, SLC6sbd_TauT, Na(+)- and Cl(-)-dependent taurine
            transporter; solute-binding domain.  TauT is a Na(+)- and
            Cl(-)-dependent, high-affinity, low-capacity transporter
            of taurine and beta-alanine. Human TauT is encoded by the
            SLC6A6 gene. TauT is expressed in brain, retina, liver,
            kidney, heart, spleen, and pancreas. It may play a part
            in the supply of taurine to the intestinal epithelium and
            in the between-meal-capture of taurine. It may also
            participate in re-absorbing taurine that has been
            deconjugated from bile acids in the distal lumen.
            Functional TauT protects kidney cells from nephrotoxicity
            caused by the chemotherapeutic agent cisplatin; cisplatin
            down-regulates TauT in a p53-dependent manner. In mice,
            TauT has been shown to be important for the maintenance
            of skeletal muscle function and total exercise capacity.
            TauT-/- mice develop additional clinically important
            diseases, some of which are characterized by apoptosis,
            including vision loss, olfactory dysfunction, and chronic
            liver disease. This subgroup belongs to the solute
            carrier 6 (SLC6) transporter family.
          Length = 542

 Score =  312 bits (802), Expect = 3e-93
 Identities = 133/346 (38%), Positives = 202/346 (58%), Gaps = 24/346 (6%)

Query: 878  AFLIPFMVMLILEGIPLFLIELGIGQKMRLGSLGVWNTIHPWLGGVGISSCLVTFFVALY 937
            AFLIP+ + L   G+P+F +E+ +GQ    G +  W  + P   G+G +S ++   + +Y
Sbjct: 38   AFLIPYFIFLFGGGLPVFFLEVALGQYTSEGGITCWEKLCPIFTGIGYASIVIVSLLNIY 97

Query: 938  YNVIITWCFYYLFNSFSSSLPWSSCPTDHNGTIIPEC------------------ELSSE 979
            Y VI+ W  YYLF SF S LPW+ C    N    P C                    +S 
Sbjct: 98   YIVILAWGLYYLFQSFQSELPWAHCNQSWN---TPNCVEDTLRKNKTLWASINATNFTSP 154

Query: 980  TAYFWYRTTLDVSPNIESPGSPKWWIVLCLLLAWIIVFFIVMKGIQSSGRVVYFTSMFPY 1039
               FW R  L +S  IE  G+ KW + LCLLL W+I FF + KG++S+G+VVYFT+ FP+
Sbjct: 155  VTEFWERNVLSLSSGIEHVGTIKWDLALCLLLVWVICFFCIWKGVKSTGKVVYFTATFPF 214

Query: 1040 IVLTIFFIRGITLPGAGAGLVHMYTPKVEKLLDPNVWLDAATQVFYSFGLAFGSLIAFGS 1099
            ++L +  IRG+TLPGA  G+     P + +L DP VW+DA TQ+F+S+ +  G++ + GS
Sbjct: 215  LMLIVLLIRGVTLPGAAEGIKFYLYPDITRLGDPQVWIDAGTQIFFSYAICLGAMTSLGS 274

Query: 1100 YNQPNNNCVRDVIMVSICNALTAIYASVVVFAILGFKAMSNAAEGTGLAFIAAEGTGLAF 1159
            YN+   NC RD +++   N+ T+  +   +F++LGF A     +G  +A +A  G GLAF
Sbjct: 275  YNKYKYNCYRDCMLLGCLNSGTSFVSGFAIFSVLGFMAQE---QGVDIADVAESGPGLAF 331

Query: 1160 IVFTQAIVELPGAPFWSIIFFMMLLSLGLGSQIGILEGVLCTIFDI 1205
            I + +A+  +P   FW+I+FF+MLL LGL SQ   +EG + ++ D+
Sbjct: 332  IAYPKAVTMMPLPTFWAILFFIMLLLLGLDSQFVEVEGQITSLVDL 377



 Score = 72.9 bits (179), Expect = 8e-13
 Identities = 31/95 (32%), Positives = 56/95 (58%), Gaps = 1/95 (1%)

Query: 770 LVGVVCLISFLVGLIFTTGAGEYWLKLFDSFAGT-IGLVLIALIEMLAVMFIYGHEKFTN 828
            + +VC IS+L+GL   T  G Y  +LFD +A + + L+ +A  E +AV ++YG + F +
Sbjct: 391 FIAIVCFISYLLGLTMVTEGGMYVFQLFDYYAASGVCLLWVAFFECIAVAWVYGADNFYD 450

Query: 829 DIYEMTGYKPGWYWQITWRFLGPAIMSCILVSSII 863
            I +M GY+PG + + +W  + P +     + S++
Sbjct: 451 AIEDMIGYRPGPWMKWSWIVITPVLCVGCFIFSLV 485


>gnl|CDD|212070 cd11501, SLC6sbd_ATB0, Na(+)- and Cl(-)-dependent beta-alanine
            transporter ATB0+; solute-binding domain.  ATB0+ (also
            known as the beta-alanine carrier) is a transmembrane
            transporter with a broad substrate specificity; it can
            transport non-alpha-amino acids such as beta-alanine with
            low affinity, and can transport dipolar and cationic
            amino acids such as leucine and lysine, with a higher
            affinity. It may have a role in the absorption of
            essential nutrients and drugs in the distal regions of
            the human gastrointestinal tract. Human ATB0+ is encoded
            by the SLC6A14 gene. ATB0+ is expressed in the lung,
            trachea, salivary gland, mammary gland, stomach, and
            pituitary gland. ATB0+ may play a role in obesity, and
            its upregulation may have a pathogenic role in colorectal
            cancer. This subgroup belongs to the solute carrier 6
            (SLC6) transporter family.
          Length = 601

 Score =  313 bits (804), Expect = 8e-93
 Identities = 152/378 (40%), Positives = 216/378 (57%), Gaps = 45/378 (11%)

Query: 878  AFLIPFMVMLILEGIPLFLIELGIGQKMRLGSLGVWNTIHPWLGGVGISSCLVTFFVALY 937
            AFLIP+ +ML L G+PLF +E  +GQ   LG + VW  + P L GVGI+  LV+ FV++Y
Sbjct: 38   AFLIPYTIMLALAGLPLFFLECSLGQFASLGPISVWRAV-PLLQGVGITMVLVSTFVSIY 96

Query: 938  YNVIITWCFYYLFNSFSSSLPWSSCPTDHN----GTIIPECELS---------------- 977
            YNVII +  YY+F SF S LPWS C +  +     + I  C +S                
Sbjct: 97   YNVIIAYSLYYMFASFQSPLPWSDCSSWADENCSTSPITHCNVSLANNGIHENKSWVDSN 156

Query: 978  ---------------SETAYFWYRTTLDVSPNIESPGSPKWWIVLCLLLAWIIVFFIVMK 1022
                             +  +W +  L  S  ++  G   W++ LCLLLAWIIV   + K
Sbjct: 157  NLTCINISIIYQPGQLPSEQYWDKVALQRSSGMDETGPIVWYLALCLLLAWIIVGAALFK 216

Query: 1023 GIQSSGRVVYFTSMFPYIVLTIFFIRGITLPGAGAGLVHMYTPK---VEKLLDPNVWLDA 1079
            GI+SSG+VVYFT++FPY+VL I  +RG TL GA  G +  Y      + KL++  VW DA
Sbjct: 217  GIKSSGKVVYFTALFPYVVLLILLVRGATLEGAKDG-IEYYIGSQSNLTKLMEAEVWKDA 275

Query: 1080 ATQVFYSFGLAFGSLIAFGSYNQPNNNCVRDVIMVSICNALTAIYASVVVFAILGFKAMS 1139
            ATQ+FYS  +A+G L+A  SYN+ +NNC  D I+V + N LT+++A   +F++LG  A  
Sbjct: 276  ATQIFYSLSVAWGGLVALSSYNKFHNNCFSDAIVVCVTNCLTSVFAGFAIFSVLGHMAH- 334

Query: 1140 NAAEGTGLAFIAAEGTGLAFIVFTQAIVELPGAPFWSIIFFMMLLSLGLGSQIGILEGVL 1199
                   ++ +   G  LAFI + +A+ +LP +P WSI+FF MLL+LGL SQ   +E + 
Sbjct: 335  --VLKKPVSEVVDSGFDLAFIAYPEALAKLPISPLWSILFFFMLLTLGLDSQFAGIETIT 392

Query: 1200 CTIFDI--EIFKRIRKPI 1215
             TI D   ++ K+ R PI
Sbjct: 393  TTIQDAFPKVMKKKRIPI 410



 Score = 88.7 bits (220), Expect = 1e-17
 Identities = 41/126 (32%), Positives = 67/126 (53%), Gaps = 13/126 (10%)

Query: 774 VCLISFLVGLIFTTGAGEYWLKLFDSFAGTIGLVLIALIEMLAVMFIYGHEKFTNDIYEM 833
           VC+I FL+GL   T AG YW+ L D F    G++  A++E++ +++IYG  +F  DI  M
Sbjct: 414 VCIILFLLGLPCVTQAGIYWVNLIDHFCAGWGILFAAVLELIGIIWIYGGNRFIEDIEMM 473

Query: 834 TGYKPGW---YWQITWRFLGPAIMSCILVSSIICRFLKKPSYSA-----WNRAF---LIP 882
            G K      +W+  W F+ P ++  IL+ S++      P+Y +     W  A    +I 
Sbjct: 474 IGKKSWIFWLWWRACWFFITPVLLIAILIWSLV--TFHSPTYGSVPYPDWGTALGWCMIA 531

Query: 883 FMVMLI 888
           F ++ I
Sbjct: 532 FCLIWI 537


>gnl|CDD|212080 cd11511, SLC6sbd_BGT1, Na(+)- and Cl(-)-dependent betaine/GABA
            transporter-1, and related proteins; solute-binding
            domain.  BGT1 is a relatively low-affinity transporter of
            gamma-aminobutyric acid (GABA), and can also transport
            betaine. GABA is the main inhibitory neurotransmitter
            within the mammalian CNS. Human BGT1 is encoded by the
            SLC6A12 gene, and is similar to mouse GAT2. Mouse GAT2
            plays a role in transporting GABA across the blood-brain
            barrier. In addition to being expressed in cells of the
            central nervous system, BGT1 is expressed in peripheral
            tissues, including kidney, liver, and heart. An
            association has been shown between the SLC6A12 gene and
            the occurrence of aspirin-intolerant asthma, and BGT1 is
            a drug target for antiepileptic drugs. This subgroup
            belongs to the solute carrier 6 (SLC6) transporter
            family.
          Length = 541

 Score =  306 bits (784), Expect = 8e-91
 Identities = 132/341 (38%), Positives = 202/341 (59%), Gaps = 16/341 (4%)

Query: 878  AFLIPFMVMLILEGIPLFLIELGIGQKMRLGSLGVWNTIHPWLGGVGISSCLVTFFVALY 937
            AFLIP+++ L   GIPLF +E  +GQ    G +  W  I P   G+G +S ++  ++ +Y
Sbjct: 39   AFLIPYLIFLFTCGIPLFFLETALGQYTSQGGVTAWRKICPIFEGIGYASQVIESYLNVY 98

Query: 938  YNVIITWCFYYLFNSFSSSLPWSSCPTDHN-------------GTIIPECELSSETAYFW 984
            Y +I++W  +YLF+SF+S LPW++C    N                 P    +S    FW
Sbjct: 99   YIIILSWALFYLFSSFTSVLPWATCNNPWNSDLCVDFLNHSGADNRTPPENATSPVMEFW 158

Query: 985  YRTTLDVSPNIESPGSPKWWIVLCLLLAWIIVFFIVMKGIQSSGRVVYFTSMFPYIVLTI 1044
             +  L ++  I   G+ +W + LCLLLAWII +F + KG++S+G+VVYFT+ FPY++L I
Sbjct: 159  EKRVLGLTDGIHKLGTVRWELALCLLLAWIICYFCIWKGVKSTGKVVYFTATFPYLMLVI 218

Query: 1045 FFIRGITLPGAGAGLVHMYTPKVEKLLDPNVWLDAATQVFYSFGLAFGSLIAFGSYNQPN 1104
              +RG+TLPGA  G++    P + +L DP VW+DA TQ+ +S+ +  G L A GSYN+  
Sbjct: 219  LLVRGVTLPGAAEGIIFYLKPDLSRLADPQVWMDAGTQILFSYAICQGCLTALGSYNKYT 278

Query: 1105 NNCVRDVIMVSICNALTAIYASVVVFAILGFKAMSNAAEGTGLAFIAAEGTGLAFIVFTQ 1164
            NNC RD IM+   N+ T+  A   +F++LGF A     +G  ++ +A  G GLAFI +  
Sbjct: 279  NNCYRDCIMLCFLNSATSFVAGFAIFSVLGFMARE---QGVPISEVAESGPGLAFIAYPT 335

Query: 1165 AIVELPGAPFWSIIFFMMLLSLGLGSQIGILEGVLCTIFDI 1205
            A+  +P +  WS +FF+ML+ LGL SQ   +E ++    D+
Sbjct: 336  AVTMMPVSQLWSCLFFLMLIFLGLDSQFVCVESMVTASIDM 376



 Score = 67.6 bits (165), Expect = 3e-11
 Identities = 36/137 (26%), Positives = 71/137 (51%), Gaps = 14/137 (10%)

Query: 770 LVGVVCLISFLVGLIFTTGAGEYWLKLFDSFAGT-IGLVLIALIEMLAVMFIYGHEKFTN 828
           L+  + +I +L+GL+  T  G Y  +LFD +A +   L+ +A+ E++ + ++YG  +F +
Sbjct: 390 LILAIAVICYLLGLLLVTEGGMYIFQLFDYYAASGTCLLFLAIFEVICIGWVYGANRFYD 449

Query: 829 DIYEMTGYKPGWYWQITWRFLGPAIMSCILVSSIICRFLKKPSYSAWNRAFLIP------ 882
           +I +M GY+P    +I W    P +   + + S+I     K +   +N +++ P      
Sbjct: 450 NIEDMIGYRPWPLIKICWLVFTPGLCLAVFLFSLI-----KYTPLKYNNSYVYPPWGYVL 504

Query: 883 --FMVMLILEGIPLFLI 897
              M +  +  IPL+ I
Sbjct: 505 GWLMALSSMVCIPLYAI 521


>gnl|CDD|212067 cd11498, SLC6sbd_GlyT1, Na(+)- and Cl(-)-dependent glycine
            transporter GlyT1; solute-binding domain.  GlyT1 is a
            membrane-bound transporter that re-uptakes glycine from
            the synaptic cleft. Human GlyT1 is encoded by the SLC6A9
            gene. GlyT1 is expressed in brain, pancreas, uterus,
            stomach, spleen, liver, and retina. GlyT1 may play a role
            in schizophrenia. This subgroup belongs to the solute
            carrier 6 (SLC6) transporter family.
          Length = 585

 Score =  303 bits (777), Expect = 2e-89
 Identities = 145/355 (40%), Positives = 217/355 (61%), Gaps = 32/355 (9%)

Query: 878  AFLIPFMVMLILEGIPLFLIELGIGQKMRLGSLGVWNTIHPWLGGVGISSCLVTFFVALY 937
            AF+ P+ +ML+  GIPLF +EL  GQ    G LGVW  + P   GVG    +V+ ++ +Y
Sbjct: 54   AFMFPYFIMLVFCGIPLFFMELSFGQFASQGCLGVWR-VSPMFKGVGYGMMVVSTYIGIY 112

Query: 938  YNVIITWCFYYLFNSFSSSLPWSSC--------------PTDHNGTIIPECEL------- 976
            YNV+I   FYY F S +  LPW+ C                  N T      +       
Sbjct: 113  YNVVICIAFYYFFMSMTRVLPWTYCNNPWNTPDCAGVLSTNRSNATFANMSAVVSNLTEL 172

Query: 977  -------SSETAYFWYRTTLDVSPNIESPGSPKWWIVLCLLLAWIIVFFIVMKGIQSSGR 1029
                   +S +  +W R  L++S +I + G  +  ++ CL ++W++VF  +++G++SSG+
Sbjct: 173  LNRTLKRTSPSEEYWRRYVLNISDDIGNFGEVRLPLLGCLGVSWLVVFLCLIRGVKSSGK 232

Query: 1030 VVYFTSMFPYIVLTIFFIRGITLPGAGAGLVHMYTPKVEKLLDPNVWLDAATQVFYSFGL 1089
            VVYFT+ FPY+VLTI FIRGITL GA +G+++  TP+ +K+LD  VW DAA+Q+FYS G 
Sbjct: 233  VVYFTATFPYVVLTILFIRGITLEGAISGIMYYLTPQWDKILDAKVWGDAASQIFYSLGC 292

Query: 1090 AFGSLIAFGSYNQPNNNCVRDVIMVSICNALTAIYASVVVFAILGFKAMSNAAEGTGLAF 1149
            A+G LI   SYN+ +NNC RD I++SI N  T++YA  V+F+ILGF  M+N   G  ++ 
Sbjct: 293  AWGGLITMASYNKFHNNCYRDSIIISITNCATSVYAGFVIFSILGF--MANHL-GVDVSE 349

Query: 1150 IAAEGTGLAFIVFTQAIVELPGAPFWSIIFFMMLLSLGLGSQIGILEGVLCTIFD 1204
            +A  G GLAF+ + +A+  LP +P WS++FF ML+ LGLG+Q  +LE ++  + D
Sbjct: 350  VADHGPGLAFVAYPEALTLLPISPLWSLLFFFMLILLGLGTQFCLLETLVTAVVD 404



 Score = 95.4 bits (237), Expect = 7e-20
 Identities = 41/90 (45%), Positives = 61/90 (67%)

Query: 774 VCLISFLVGLIFTTGAGEYWLKLFDSFAGTIGLVLIALIEMLAVMFIYGHEKFTNDIYEM 833
           V +I FL+G+  TT AG YWL L D++A +  LV+I+ I  +A+M+IYGH  +  DI  M
Sbjct: 422 VAIIGFLLGIPLTTQAGIYWLLLMDNYAASFSLVIISCIMCIAIMYIYGHRNYFKDIEMM 481

Query: 834 TGYKPGWYWQITWRFLGPAIMSCILVSSII 863
            G+ P  ++QI WRF+ PAI+  IL+ ++I
Sbjct: 482 LGFPPPLFFQICWRFVSPAIIFFILIFTVI 511


>gnl|CDD|212076 cd11507, SLC6sbd_GAT2, Na(+)- and Cl(-)-dependent GABA transporter 2;
            solute-binding domain.  This family includes human GAT2
            (hGAT2) which transports gamma-aminobutyric acid (GABA).
            GABA is the main inhibitory neurotransmitter within the
            mammalian CNS. hGAT2 is encoded by the SLC6A13 gene, and
            is similar to mouse GAT-3, and rat GAT2. hGAT2 is
            expressed in brain, kidney, lung, and testis. hGAT2 is a
            potential drug target for treatment of epilepsy. This
            subgroup belongs to the solute carrier 6 (SLC6)
            transporter family.
          Length = 544

 Score =  301 bits (772), Expect = 3e-89
 Identities = 137/350 (39%), Positives = 206/350 (58%), Gaps = 23/350 (6%)

Query: 878  AFLIPFMVMLILEGIPLFLIELGIGQKMRLGSLGVWNTIHPWLGGVGISSCLVTFFVALY 937
            AF IP+++ L   GIP+F +E  +GQ    G +  W  I P   G+G +S ++   +  Y
Sbjct: 38   AFFIPYLIFLFTCGIPVFFLETALGQYTSQGGVTCWRKICPLFEGIGYASQVIVALLNFY 97

Query: 938  YNVIITWCFYYLFNSFSSSLPWSSC---------------PTDHNGTIIPECELSSETAY 982
            Y +++ W  +YLFNSF+  LPWSSC                   N T+      +S    
Sbjct: 98   YIIVLAWALFYLFNSFTWELPWSSCNHTWNTENCVEFQKRNYSVNVTLE---NATSPVIE 154

Query: 983  FWYRTTLDVSPNIESPGSPKWWIVLCLLLAWIIVFFIVMKGIQSSGRVVYFTSMFPYIVL 1042
            FW R  L +S  IE  GS +W + LCLLLAW+I +F + KG++S+G+VVYFT+ FPY++L
Sbjct: 155  FWERRVLRISDGIEHIGSLRWELALCLLLAWVICYFCIWKGVKSTGKVVYFTATFPYLML 214

Query: 1043 TIFFIRGITLPGAGAGLVHMYTPKVEKLLDPNVWLDAATQVFYSFGLAFGSLIAFGSYNQ 1102
             +  IRG+TLPGA  G+     P + +L DP VW+DA TQ+F+S+ +  G L A GSYN+
Sbjct: 215  VVLLIRGVTLPGASLGIQFYLYPDLSRLADPQVWMDAGTQIFFSYAICLGCLTALGSYNK 274

Query: 1103 PNNNCVRDVIMVSICNALTAIYASVVVFAILGFKAMSNAAEGTGLAFIAAEGTGLAFIVF 1162
             NNNC RD I +   N+ T+  A   +F+ILGF A     +G  ++ +A  G GLAFI +
Sbjct: 275  YNNNCYRDCIALCFLNSGTSFVAGFAIFSILGFMAYE---QGVPISEVAESGPGLAFIAY 331

Query: 1163 TQAIVELPGAPFWSIIFFMMLLSLGLGSQIGILEGVLCTIFDI--EIFKR 1210
             +A+  +P +P W+  FF+M++ LGL SQ   +E ++  + D+  E+F++
Sbjct: 332  PRAVSMMPFSPLWACFFFIMIVLLGLDSQFVCVESLVTAMVDMYPEVFRK 381



 Score = 75.7 bits (186), Expect = 9e-14
 Identities = 46/135 (34%), Positives = 78/135 (57%), Gaps = 10/135 (7%)

Query: 770 LVGVVCLISFLVGLIFTTGAGEYWLKLFDSFAGT-IGLVLIALIEMLAVMFIYGHEKFTN 828
           L+  V ++SFLVGLI  T  G Y  +LFD +A + + L+ +A+ E L + ++YG ++F +
Sbjct: 388 LILFVSIVSFLVGLIMLTEGGMYVFQLFDYYAASGMCLLFVAIFETLCIAWVYGADRFYD 447

Query: 829 DIYEMTGYKPGWYWQITWRFLGPAIMSCILVSSII----CRFLKKPSYSAWNRA--FLIP 882
           +I +M GY+PG Y +  W F+ PA+     + S+I     ++ K+  Y  W  A  +L+ 
Sbjct: 448 NIEDMIGYRPGPYIKYCWLFITPAVCVATFLFSLIKYTPLKYNKEYVYPWWGDAIGWLLA 507

Query: 883 FMVMLILEGIPLFLI 897
              M+    IPL++I
Sbjct: 508 LSSMVC---IPLWII 519


>gnl|CDD|212081 cd11512, SLC6sbd_NET, Na(+)- and Cl(-)-dependent norepinephrine
            transporter NET; solute-binding domain.  NET (also called
            NAT1, NET1), is a transmembrane transporter that
            transports the neurotransmitter norepinephrine from
            synaptic spaces into presynaptic neurons. Human NET is
            encoded by the SLC6A2 gene. NET is expressed in brain,
            peripheral nervous system, adrenal gland, and placenta.
            NET may play a role in diseases or disorders including
            depression, orthostatic intolerance, anorexia nervosa,
            cardiovascular diseases, alcoholism, and
            attention-deficit hyperactivity disorder. This subgroup
            belongs to the solute carrier 6 (SLC6) transporter
            family.
          Length = 560

 Score =  298 bits (763), Expect = 8e-88
 Identities = 144/355 (40%), Positives = 213/355 (60%), Gaps = 23/355 (6%)

Query: 878  AFLIPFMVMLILEGIPLFLIELGIGQKMRLGSLGVWNTIHPWLGGVGISSCLVTFFVALY 937
            AFLIP+ + LI+ G+PLF +EL +GQ  R G+  VW  I P   GVG +  L+  +V  Y
Sbjct: 38   AFLIPYTLFLIIAGMPLFYMELALGQYNREGAATVWK-ICPCFKGVGYAVILIALYVGFY 96

Query: 938  YNVIITWCFYYLFNSFSSSLPWSSC------PTDHNGTIIPECELSSETAYFWYRTT--- 988
            YNVII W  YYLF+SF+S LPW++C      P   +  ++    L + T Y  Y+ T   
Sbjct: 97   YNVIIAWSLYYLFSSFTSELPWTTCGNVWNSPNCTDPKLLNASVLGNGTKYSKYKLTPAA 156

Query: 989  ---------LDVSPNIESPGSPKWWIVLCLLLAWIIVFFIVMKGIQSSGRVVYFTSMFPY 1039
                     L  S  I   G P+W + LCL +  I++FF + KG+++SG+VV+ T+  PY
Sbjct: 157  EFYERGVLHLHESAGIHDLGLPRWQLTLCLFVVVIVLFFSLWKGVKTSGKVVWITATLPY 216

Query: 1040 IVLTIFFIRGITLPGAGAGLVHMYTPKVEKLLDPNVWLDAATQVFYSFGLAFGSLIAFGS 1099
            +VL +  I G+TLPGA  G+        ++L +  VW+DAATQ+FYS G  FG LIAF S
Sbjct: 217  VVLFVLLIHGVTLPGAFNGINAYLHIDFKRLKEATVWIDAATQIFYSLGAGFGVLIAFAS 276

Query: 1100 YNQPNNNCVRDVIMVSICNALTAIYASVVVFAILGFKAMSNAAEGTGLAFIAAEGTGLAF 1159
            YN+ +NNC RD ++ S  N +T+  +   +F+ILG+ A  +  +   +  +A EG GL F
Sbjct: 277  YNKFDNNCYRDALLTSTINCVTSFISGFAIFSILGYMAHEHKVK---IEDVATEGAGLVF 333

Query: 1160 IVFTQAIVELPGAPFWSIIFFMMLLSLGLGSQIGILEGVLCTIF-DIEIFKRIRK 1213
            I++ +AI  L G+ FW+++FF+MLL+LG+ S +G +E V+  +  D  I K+ RK
Sbjct: 334  ILYPEAISTLSGSTFWAVVFFIMLLTLGIDSSMGGMEAVITGLADDFSILKKHRK 388



 Score = 87.3 bits (216), Expect = 2e-17
 Identities = 34/90 (37%), Positives = 57/90 (63%)

Query: 774 VCLISFLVGLIFTTGAGEYWLKLFDSFAGTIGLVLIALIEMLAVMFIYGHEKFTNDIYEM 833
           V  I+FL+ L   T  G Y L L D+FA    ++   L+E + V + YG ++F++DI +M
Sbjct: 394 VAFITFLLALFCITNGGIYVLTLLDTFAAGTSILFAVLMEAIGVSWFYGVDRFSDDIQQM 453

Query: 834 TGYKPGWYWQITWRFLGPAIMSCILVSSII 863
            G++PG YW++ W+F+ PA +  +++ SII
Sbjct: 454 MGFRPGLYWRLCWKFVSPAFLLFVVIVSII 483


>gnl|CDD|212077 cd11508, SLC6sbd_GAT3, Na(+)- and Cl(-)-dependent GABA transporter 3;
            solute-binding domain.  This family includes human GAT3
            (hGAT3) a high-affinity transporter of gamma-aminobutyric
            acid (GABA). GABA is the main inhibitory neurotransmitter
            within the mammalian CNS. hGAT3 is encoded by the SLC6A11
            gene, and is similar to mouse GAT4, and rat GAT3/GATB.
            GAT3 is expressed primarily in the glia of the brain, and
            is a potential drug target for antiepileptic drugs. This
            subgroup belongs to the solute carrier 6 (SLC6)
            transporter family.
          Length = 542

 Score =  294 bits (754), Expect = 8e-87
 Identities = 135/351 (38%), Positives = 206/351 (58%), Gaps = 23/351 (6%)

Query: 878  AFLIPFMVMLILEGIPLFLIELGIGQKMRLGSLGVWNTIHPWLGGVGISSCLVTFFVALY 937
            AFLIP++V  I  GIP+F +E  +GQ    G +  W  + P   G+G ++ ++   + +Y
Sbjct: 39   AFLIPYVVFFICCGIPVFFLETALGQFTSEGGITCWRKVCPLFEGIGYATQVIEAHLNVY 98

Query: 938  YNVIITWCFYYLFNSFSSSLPWSSC----------------PTDHNGTIIPECELSSETA 981
            Y +I+ W  +YLFN F++ LPW++C                 ++ +   +     +S   
Sbjct: 99   YIIILAWAIFYLFNCFTTELPWATCGHEWNTENCVEFQKLNSSNCSHVSLQNA--TSPVM 156

Query: 982  YFWYRTTLDVSPNIESPGSPKWWIVLCLLLAWIIVFFIVMKGIQSSGRVVYFTSMFPYIV 1041
             FW R  L +S  IE  G+ +W + LCLL AW I +F + KG +S+G+VVY T+ FPYI+
Sbjct: 157  EFWERRVLAISDGIEHIGNLRWELALCLLAAWTICYFCIWKGTKSTGKVVYVTATFPYIM 216

Query: 1042 LTIFFIRGITLPGAGAGLVHMYTPKVEKLLDPNVWLDAATQVFYSFGLAFGSLIAFGSYN 1101
            L I  IRG+TLPGA  G+     P + +L DP VW+DA TQ+F+S+ +  G L A GSYN
Sbjct: 217  LLILLIRGVTLPGASEGIKFYLYPDLSRLSDPQVWVDAGTQIFFSYAICLGCLTALGSYN 276

Query: 1102 QPNNNCVRDVIMVSICNALTAIYASVVVFAILGFKAMSNAAEGTGLAFIAAEGTGLAFIV 1161
              NNNC RD IM+   N+ T+  A   +F++LGF A     +G  +A +A  G GLAFI 
Sbjct: 277  NYNNNCYRDCIMLCCLNSGTSFVAGFAIFSVLGFMAYE---QGVPIAEVAESGPGLAFIA 333

Query: 1162 FTQAIVELPGAPFWSIIFFMMLLSLGLGSQIGILEGVLCTIFDI--EIFKR 1210
            + +A+  +P +P W+ +FFMML+ LGL SQ   +E ++  + D+  ++F+R
Sbjct: 334  YPKAVTMMPLSPLWATLFFMMLIFLGLDSQFVCVESLVTAVVDMYPKVFRR 384



 Score = 67.7 bits (165), Expect = 3e-11
 Identities = 34/135 (25%), Positives = 71/135 (52%), Gaps = 10/135 (7%)

Query: 770 LVGVVCLISFLVGLIFTTGAGEYWLKLFDSFAGT-IGLVLIALIEMLAVMFIYGHEKFTN 828
           L+  + ++S+ +GL+  T  G Y  +LFDS+A + + L+ +A+ E + + ++YG  +F +
Sbjct: 391 LILALSIVSYFLGLVMLTEGGMYIFQLFDSYAASGMCLLFVAIFECICIGWVYGSNRFYD 450

Query: 829 DIYEMTGYKPGWYWQITWRFLGPAIMSCILVSSII----CRFLKKPSYSAWNRA--FLIP 882
           +I +M GY+P    +  W  + P I + I +  ++     ++    +Y  W     +L+ 
Sbjct: 451 NIEDMIGYRPLSLIKWCWMVITPGICAGIFIFFLVKYKPLKYNNVYTYPDWGYGIGWLMA 510

Query: 883 FMVMLILEGIPLFLI 897
              M+    IPL++ 
Sbjct: 511 LSSMIC---IPLWIC 522


>gnl|CDD|212083 cd11514, SLC6sbd_DAT1, Na(+)- and Cl(-)-dependent dopamine
            transporter 1; solute-binding domain.  DAT1 (also called
            DAT), is a plasma membrane transport protein that
            functions at the dopaminergic synapses to transport
            dopamine from the extracellular space back into the
            presynaptic nerve terminal. Human DAT1 is encoded by the
            SLC6A3 gene, and is expressed in the brain. DAT1 may play
            a role in diseases or disorders related to dopaminergic
            neurons, including attention-deficit hyperactivity
            disorder (ADHD), Tourette syndrome, Parkinson's disease,
            alcoholism, drug abuse, schizophrenia, extraversion, and
            risky behavior. This subgroup belongs to the solute
            carrier 6 (SLC6) transporter family.
          Length = 555

 Score =  285 bits (731), Expect = 1e-83
 Identities = 134/356 (37%), Positives = 210/356 (58%), Gaps = 29/356 (8%)

Query: 878  AFLIPFMVMLILEGIPLFLIELGIGQKMRLGSLGVWNTIHPWLGGVGISSCLVTFFVALY 937
            AFL+P++  +++ G+PLF +EL +GQ  R G+ GVW  I P   GVG +  L++ +V  +
Sbjct: 38   AFLVPYLFFMVIAGMPLFYMELALGQFNREGAAGVWK-ICPIFKGVGFTVILISLYVGFF 96

Query: 938  YNVIITWCFYYLFNSFSSSLPWSSCPTDHNGTIIPECELSSETAYFW--------YRTT- 988
            YNVII W  +YLF+SF+  LPW  C    N    P C  +               Y+TT 
Sbjct: 97   YNVIIAWALFYLFSSFTGELPWIHCNNSWNS---PNCSDAHPGDSSGDSSGLNDTYKTTP 153

Query: 989  -----------LDVSPNIESPGSPKWWIVLCLLLAWIIVFFIVMKGIQSSGRVVYFTSMF 1037
                       L  S  I   G P+W +  CL++  ++++F + KG+++SG+VV+ T+  
Sbjct: 154  AAEYFERGVLHLHESHGIHDLGPPRWQLTSCLVVVIVLLYFSLWKGVKTSGKVVWITATM 213

Query: 1038 PYIVLTIFFIRGITLPGAGAGLVHMYTPKVEKLLDPNVWLDAATQVFYSFGLAFGSLIAF 1097
            PY+VLT   +RG+TLPGA  G+    +    +L + +VW+DAATQ+ +S G+ FG LIAF
Sbjct: 214  PYVVLTALLLRGVTLPGAIDGIKAYLSVDFLRLCEASVWIDAATQICFSLGVGFGVLIAF 273

Query: 1098 GSYNQPNNNCVRDVIMVSICNALTAIYASVVVFAILGFKAMSNAAEGTGLAFIAAEGTGL 1157
             SYN+  NNC RD I+ +  N+LT+ ++  VVF+ LG+ A  +      +  +A +G GL
Sbjct: 274  SSYNKFTNNCYRDAIITTSINSLTSFFSGFVVFSFLGYMAQKH---NVPIGDVAKDGPGL 330

Query: 1158 AFIVFTQAIVELPGAPFWSIIFFMMLLSLGLGSQIGILEGVLCTIFDIEIFKRIRK 1213
             FI++ +AI  LP +  W++IFF+MLL+LG+ S +G +E V+  +  I+ FK + +
Sbjct: 331  IFIIYPEAIATLPLSSVWAVIFFIMLLTLGIDSAMGGMESVITGL--IDEFKFLHR 384



 Score = 76.9 bits (189), Expect = 4e-14
 Identities = 34/102 (33%), Positives = 57/102 (55%), Gaps = 2/102 (1%)

Query: 774 VCLISFLVGLIFTTGAGEYWLKLFDSFAGTIGLVLIALIEMLAVMFIYGHEKFTNDIYEM 833
           + L +FL+ L   T  G Y   L D FA    ++   LIE + + + YG  +F++DI EM
Sbjct: 393 IVLSTFLISLFCVTNGGIYVFTLLDHFAAGTSILFGVLIEAIGIAWFYGVGRFSDDIEEM 452

Query: 834 TGYKPGWYWQITWRFLGPAIMSCILVSSIICRFLKKPSYSAW 875
            G +PG YW++ W+F+ P  +  ++V SI+    + P Y ++
Sbjct: 453 IGQRPGLYWRLCWKFVSPCFLLFVVVVSIVT--FRPPKYGSY 492


>gnl|CDD|212092 cd11554, SLC6sbd_u2, uncharacterized eukaryotic solute carrier 6
            subfamily; solute-binding domain.  SLC6 proteins (also
            called the sodium- and chloride-dependent
            neurotransmitter transporter family or Na+/Cl--dependent
            transporter family) include neurotransmitter transporters
            (NTTs): these are sodium- and chloride-dependent plasma
            membrane transporters for the monoamine neurotransmitters
            serotonin (5-hydroxytryptamine), dopamine, and
            norepinephrine, and the amino acid neurotransmitters GABA
            and glycine. These NTTs are widely expressed in the
            mammalian brain, and are involved in regulating
            neurotransmitter signaling and homeostasis, and are the
            target of a range of therapeutic drugs for the treatment
            of psychiatric diseases. Bacterial members of the SLC6
            family include the LeuT amino acid transporter.
          Length = 406

 Score =  258 bits (660), Expect = 8e-76
 Identities = 102/355 (28%), Positives = 166/355 (46%), Gaps = 59/355 (16%)

Query: 866  FLKKPSYSAWNR--AFLIPFMVMLILEGIPLFLIELGIGQKMRLGSLGVWNTIHPWLGGV 923
             L+ PS    N    F IP+++ L+L  IP+ ++EL +GQ  R G +  +N I+    GV
Sbjct: 23   LLRFPSVVYNNGGLQFFIPYLLALVLLAIPILILELALGQAYRGGCVKAFNLINRRAKGV 82

Query: 924  GISSCLVTFFVALYYNVIITWCFYYLFNSFSSSLPWSSCPTDHNGTIIPECELSSETAYF 983
            G +     F V  YY  I+ W   Y  +S                               
Sbjct: 83   GAAVVFGGFVVCTYYVPILAWIMVYFRHS------------------------------- 111

Query: 984  WYRTTLDVSPNIESPGSPKWWIVLCLLLAWIIVFFIVMKGIQSSGRVVYFTSMFPYIVLT 1043
                                + V  L   W +V+  V +G+ S+GRVVY T   P +++ 
Sbjct: 112  --------------------FTVGWLAFTWFVVWLCVFRGVGSTGRVVYITMGLPIVLII 151

Query: 1044 IFFIRGITLPGAGAGLVHMYTP-KVEKLLDPNVWLDAATQVFYSFGLAFGSLIAFGSYNQ 1102
            I   RG +LP AG G+   +   + +KL    +W  AA QVF+S G+ FG + A+ SYN 
Sbjct: 152  ILLGRGASLPNAGDGIRLYFATWRGDKLASGEIWQAAAGQVFFSTGVGFGYMTAYASYNS 211

Query: 1103 PNNNCVRDVIMVSICNALTAIYASVVVFAILGFKAMSNAAEGTGLAFIAAEGTGLAFIVF 1162
              +N V+D ++++  N+L  I A   VF I+G+  M+   EG  L      G  L F+ +
Sbjct: 212  KFSNAVQDALIIAGSNSLFEIIAGFAVFGIVGYLGMTPGVEGEALG-----GFVLGFVTY 266

Query: 1163 TQAIVELPGAPFWSIIFFMMLLSLGLGSQIGILEGVLCTIFDIEIFKRIRKPILT 1217
             +AI ++PG+ F++++FF+ L  LG+ S   +LE V+  I D    +++++  + 
Sbjct: 267  PEAIAQMPGSNFFAVLFFLTLFLLGISSAFALLEAVVTLICDSGWGRKVKRWWIA 321



 Score = 44.6 bits (106), Expect = 4e-04
 Identities = 22/85 (25%), Positives = 38/85 (44%), Gaps = 6/85 (7%)

Query: 773 VVCLISFLVGLIFTTGAGEYWLKLFDSFAGTIGLVLIALIEMLAVMFIYGHEKFTNDIYE 832
           VVC +SFL+ L + T  G Y L   D++   + LV +   E +A   +Y  E    D+ +
Sbjct: 323 VVCFVSFLLSLPYCTRFGYYLLDAVDAWISNVALVFVVWAECVAATTVYRFE----DVVD 378

Query: 833 MTGYKPGWYWQITWRFLGPAIMSCI 857
             G     Y+ +   + G   +  +
Sbjct: 379 QVGLPAFAYYNVG--YAGGQTLGAV 401


>gnl|CDD|212042 cd10334, SLC6sbd_u1, uncharacterized bacterial and archaeal solute
            carrier 6 subfamily; solute-binding domain.  SLC6
            proteins (also called the sodium- and chloride-dependent
            neurotransmitter transporter family or Na+/Cl--dependent
            transporter family) include neurotransmitter transporters
            (NTTs): these are sodium- and chloride-dependent plasma
            membrane transporters for the monoamine neurotransmitters
            serotonin (5-hydroxytryptamine), dopamine, and
            norepinephrine, and the amino acid neurotransmitters GABA
            and glycine. These NTTs are widely expressed in the
            mammalian brain, involved in regulating neurotransmitter
            signaling and homeostasis, and the target of a range of
            therapeutic drugs for the treatment of psychiatric
            diseases. Bacterial members of the SLC6 family include
            the LeuT amino acid transporter.
          Length = 480

 Score =  260 bits (666), Expect = 1e-75
 Identities = 109/329 (33%), Positives = 174/329 (52%), Gaps = 24/329 (7%)

Query: 878  AFLIPFMVMLILEGIPLFLIELGIGQKMRLGSLGVWNTIHPWLGGVGISSCLVTFFVALY 937
            AFLIP+++ L+  GIPL ++E G+G K R  +   +  I+     +G    LV F +A Y
Sbjct: 38   AFLIPYLIALLTAGIPLLILEYGLGHKFRGSAPLAFRRINKKFEWIGWWQVLVAFVIATY 97

Query: 938  YNVIITWCFYYLFNSFSSSLPWSSCPTDHNGTIIPECELSSETAYFWYRTTLDVSPNIES 997
            Y VII W   YLF SF+  L W   P                 A+F +   L +S     
Sbjct: 98   YAVIIAWALSYLFFSFT--LAWGDDPE----------------AFF-FGDFLQLSDGPFD 138

Query: 998  PGSPKWWIVLCLLLAWIIVFFIVMKGIQSS-GRVVYFTSMFPYIVLTIFFIRGITLPGAG 1056
             G   W I++ L+L W+I++FI+ +G++    +         +++  I  IR +TLPGA 
Sbjct: 139  LGGFVWPILIALVLVWLIIWFILYRGVKKGIEKANKIFMPLLFVLFLILVIRALTLPGAA 198

Query: 1057 AGLVHMYTPKVEKLLDPNVWLDAATQVFYSFGLAFGSLIAFGSYNQPNNNCVRDVIMVSI 1116
             GL  ++TP    LLDP VW+ A  Q+F+S  + FG +I + SY    ++   + ++V+ 
Sbjct: 199  DGLNALFTPDWSALLDPKVWIAAYGQIFFSLSVGFGIMITYASYLPKKSDLTNNALIVAF 258

Query: 1117 CNALTAIYASVVVFAILGFKAMSNAAEGTGLAFIAAEGTGLAFIVFTQAIVEL-PGAPFW 1175
             N+   I A + VF+ILGF  M+ A  G  ++ + + G GLAF+ F +AI  + PG   +
Sbjct: 259  ANSSFEILAGIGVFSILGF--MAQAQ-GVPVSEVVSSGIGLAFVAFPKAINLMPPGGALF 315

Query: 1176 SIIFFMMLLSLGLGSQIGILEGVLCTIFD 1204
             ++FF+ L+  GL S I ++E V+  + D
Sbjct: 316  GVLFFLSLVFAGLTSLISLVEVVISALQD 344



 Score = 88.3 bits (220), Expect = 7e-18
 Identities = 41/127 (32%), Positives = 67/127 (52%), Gaps = 3/127 (2%)

Query: 771 VGVVCLISFLVGLIFTTGAGEYWLKLFDSFAGTIGLVLIALIEMLAVMFIYGHEKFTNDI 830
           V +VCL+ FLV L+F TGAG Y L + D F    G+VL+ L+E++ V +++G EK    +
Sbjct: 354 VTIVCLVGFLVSLLFATGAGLYLLDIVDHFINNYGIVLVGLVEVIVVGWVFGAEKLREHL 413

Query: 831 YEMTGYKPGWYWQITWRFLGPAIMSCILVSSIICRFLKKP--SYSAWNRAFLIPFMVMLI 888
             ++ +K G +W    + + P ++  +L+  +I   L +P   Y  W        +V LI
Sbjct: 414 NAVSDFKVGKWWDFLIKVVTPLVLGYMLILGLI-GLLTEPYGGYPVWALLIFGWGIVALI 472

Query: 889 LEGIPLF 895
           L    L 
Sbjct: 473 LVAALLL 479


>gnl|CDD|241264 cd01230, PH1_Tiam1_2, T-lymphoma invasion and metastasis 1 and 2
           Pleckstrin Homology (PH) domain, N-terminal domain.
           Tiam1 activates Rac GTPases to induce membrane ruffling
           and cell motility while Tiam2 (also called STEF (SIF
           (still life) and Tiam1 like-exchange factor) contributes
           to neurite growth. Tiam1/2 are Dbl-family of GEFs that
           possess a Dbl(DH) domain with a PH domain in tandem.
           DH-PH domain catalyzes the GDP/GTP exchange reaction in
           the GTPase cycle and facillitating the switch between
           inactive GDP-bound and active GTP-bound states. Tiam1/2
           possess two PH domains, which are often referred to as
           PHn and PHc domains. The DH-PH tandem domain is made up
           of the PHc domain while the PHn is part of a novel
           N-terminal PHCCEx domain which is made up of the PHn
           domain, a coiled coil region(CC), and an extra region
           (Ex). PHCCEx mediates binding to plasma membranes and
           signalling proteins in the activation of Rac GTPases.
           The PH domain resembles the beta-spectrin PH domain,
           suggesting non-canonical phosphatidylinositol binding.
           CC and Ex form a positively charged surface for protein
           binding. There are 2 motifs in Tiam1/2-interacting
           proteins that bind to the PHCCEx domain: Motif-I in
           CD44, ephrinBs, and the NMDA receptor and Motif-II in
           Par3 and JIP2.Neither of these fall in the PHn domain.
           PH domains have diverse functions, but in general are
           involved in targeting proteins to the appropriate
           cellular location or in the interaction with a binding
           partner. They share little sequence conservation, but
           all have a common fold, which is electrostatically
           polarized. Less than 10% of PH domains bind
           phosphoinositide phosphates (PIPs) with high affinity
           and specificity. PH domains are distinguished from other
           PIP-binding domains by their specific high-affinity
           binding to PIPs with two vicinal phosphate groups:
           PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which
           results in targeting some PH domain proteins to the
           plasma membrane. A few display strong specificity in
           lipid binding. Any specificity is usually determined by
           loop regions or insertions in the N-terminus of the
           domain, which are not conserved across all PH domains.
           PH domains are found in cellular signaling proteins such
           as serine/threonine kinase, tyrosine kinases, regulators
           of G-proteins, endocytotic GTPases, adaptors, as well as
           cytoskeletal associated molecules and in lipid
           associated enzymes.
          Length = 127

 Score =  230 bits (588), Expect = 3e-70
 Identities = 72/120 (60%), Positives = 89/120 (74%), Gaps = 1/120 (0%)

Query: 1   FLSVKKWLLRKKH-QIELARKRGWKGYWVCLKGTTLLFYPCDSREGRSVDAAPKHLIIVD 59
           +LSVK +L  KK  ++ELA +R WK YWV LKG TLLFY CD R G   ++ PKH + V+
Sbjct: 8   WLSVKNFLTHKKKKKLELASRRKWKKYWVTLKGCTLLFYECDERTGIDDNSTPKHALFVE 67

Query: 60  GAIMQPIPEHPKRDYIFCLSTAFGDAYLFQAPCQVELENWVNSIHSACAAAFARHRGKTG 119
           G+I Q +PEHPK+D +FCLS +FGDAYLFQA  Q ELENW+ +IHSACA+AFAR  GK  
Sbjct: 68  GSIAQAVPEHPKKDNVFCLSNSFGDAYLFQATSQTELENWITAIHSACASAFARQHGKED 127


>gnl|CDD|223805 COG0733, COG0733, Na+-dependent transporters of the SNF family
            [General function prediction only].
          Length = 439

 Score =  193 bits (494), Expect = 2e-53
 Identities = 97/327 (29%), Positives = 153/327 (46%), Gaps = 51/327 (15%)

Query: 878  AFLIPFMVMLILEGIPLFLIELGIGQKMRLGSLGVWNTIHP-----WLGGVGISSCLVTF 932
            AFL+P+++ L+L GIPL L E  IG++ R  ++G +  + P     W+G  G+   L  F
Sbjct: 42   AFLLPYLIFLLLVGIPLLLAEFAIGRRGRKNAVGAFRKLAPKKKWEWIGWFGV---LGGF 98

Query: 933  FVALYYNVIITWCFYYLFNSFSSSLPWSSCPTDHNGTIIPECELSSETAYFWYRTTLDVS 992
             +  +Y+VI  W   YL  S + +LP         G          + A  + +T     
Sbjct: 99   LILSFYSVIGGWILSYLVKSLTGALP-------DTGE---------QFAQLFGQT----- 137

Query: 993  PNIESPGSPKWWIVLCLLLAWIIVFFIVMKGIQSS-GRVVYFTSMFP--YIVLTIFFIRG 1049
              I +P       V+  LL  ++   IV +G++    +      M P  +++  I  IR 
Sbjct: 138  --ISNP----GLAVIFHLLFLVLTALIVSRGVKKGIEKANKI--MMPLLFVLFIILVIRA 189

Query: 1050 ITLPGAGAGLVHMYTPKVEKLLDPNVWLDAATQVFYSFGLAFGSLIAFGSYNQPNNNCVR 1109
            +TLPGA  GL  ++ P   KL DP VWL A  Q F+S  L FG +I + SY    ++ V 
Sbjct: 190  VTLPGAMEGLKFLFKPDFSKLTDPKVWLAALGQAFFSLSLGFGIMITYSSYLSKKSDLVS 249

Query: 1110 DVIMVSICNALTAIYASVVVFAILGFKAMSNAAEGTGLAFIAAEGTGLAFIVFTQAIVEL 1169
              + + + N L ++ A +V+F  L                 A++G GL FIV      ++
Sbjct: 250  SALSIVLLNTLISLLAGLVIFPALFSFGAD-----------ASQGPGLVFIVLPAVFNQM 298

Query: 1170 PGAPFWSIIFFMMLLSLGLGSQIGILE 1196
            P    + I+FF++LL   L S I +LE
Sbjct: 299  PLGTLFGILFFLLLLFAALTSAISMLE 325



 Score = 55.3 bits (134), Expect = 2e-07
 Identities = 17/91 (18%), Positives = 36/91 (39%), Gaps = 1/91 (1%)

Query: 773 VVCLISFLVGLIFTTGAGEYWLKLFDSFAGTIGLVLIALIEMLAVMFIYGHEKFTNDIYE 832
           ++ ++ FL+G+      G     L D     I + L AL+  + V ++   E    ++  
Sbjct: 345 LIGILIFLLGIPSILSFGLSIFDLVDFVVSNILMPLGALLIAIFVGWVLKKELLREEL-N 403

Query: 833 MTGYKPGWYWQITWRFLGPAIMSCILVSSII 863
               K    W    R++ P ++  +L+    
Sbjct: 404 AGSAKLFKIWLYLVRYIAPIVILVVLILGFY 434


>gnl|CDD|212043 cd10336, SLC6sbd_Tyt1-Like, solute carrier 6 subfamily, Fusobacterium
            nucleatum Tyt1-like; solute-binding domain.  SLC6
            proteins (also called the sodium- and chloride-dependent
            neurotransmitter transporter family or Na+/Cl--dependent
            transporter family) include neurotransmitter transporters
            (NTTs): these are sodium- and chloride-dependent plasma
            membrane transporters for the monoamine neurotransmitters
            serotonin (5-hydroxytryptamine), dopamine, and
            norepinephrine, and the amino acid neurotransmitters GABA
            and glycine. These NTTs are widely expressed in the
            mammalian brain, involved in regulating neurotransmitter
            signaling and homeostasis, and the target of a range of
            therapeutic drugs for the treatment of psychiatric
            diseases. Bacterial members of the SLC6 family include
            the LeuT amino acid transporter. An arrangement of 12
            transmembrane (TM) helices appears to be as a common
            topological motif for eukaryotic and some prokaryotic and
            archaeal NTTs. However, this subfamily which contains the
            majority of bacterial members and some archaeal members,
            appears to contain only 11 TMs; for example the
            functional Fusobacterium nucleatum tyrosine transporter
            Tyt1.
          Length = 439

 Score =  181 bits (462), Expect = 3e-49
 Identities = 107/341 (31%), Positives = 162/341 (47%), Gaps = 46/341 (13%)

Query: 878  AFLIPFMVMLILEGIPLFLIELGIGQKMR---LGSLGVWNTIHPWLGGVGISSCLVTFFV 934
            AFL+ +++ ++L G+PL + E  IG++ R   +G+             VG    L  F +
Sbjct: 37   AFLLVYLLFVLLIGLPLLIAEFAIGRRTRKNPVGAFAKLAKGGKKWKLVGWLGVLGAFLI 96

Query: 935  ALYYNVIITWCFYYLFNSFSSSLPWSSCPTDHNGTIIPECELSSETAYFWYRTTLDVSPN 994
              +Y+V+  W   YLF S + S           G      + +   A F+          
Sbjct: 97   LSFYSVVGGWVLAYLFKSLTGSF---------AGA-----DAAEVGAAFFG--------F 134

Query: 995  IESPGSPKWWIVLCLLLAWIIVFFIVMKGIQSSGRVVYFTSMFP--YIVLTIFFIRGITL 1052
            I SP SP  W +L LLL  +IV   V KGI+ + ++     + P  +++L I  IR +TL
Sbjct: 135  IASPLSPLLWHLLFLLLTALIVARGVKKGIERASKI-----LMPLLFVLLIILAIRSLTL 189

Query: 1053 PGAGAGLVHMYTPKVEKLLDPNVWLDAATQVFYSFGLAFGSLIAFGSYNQPNNNCVRDVI 1112
            PGAG GL  ++ P   K L P V L A  Q F+S  L  G +I +GSY   + N  +  +
Sbjct: 190  PGAGEGLKFLFKPDFSK-LTPEVVLAALGQAFFSLSLGMGVMITYGSYLPKDENLPKSAL 248

Query: 1113 MVSICNALTAIYASVVVFAILGFKAMSNAAEGTGLAFIAAEGTGLAFIVFTQAIVELPGA 1172
            +V+I + L A+ A +V+F  + F      A G GL FI      L   VF Q    +PG 
Sbjct: 249  IVAILDTLVALLAGLVIFPAV-FAFGLEPAAGPGLLFIT-----LP-AVFAQ----MPGG 297

Query: 1173 PFWSIIFFMMLLSLGLGSQIGILEGVLCTIFDIEIFKRIRK 1213
              + I+FF++LL   L S I +LE  +  +  I+ F   RK
Sbjct: 298  RLFGILFFLLLLFAALTSAISLLEVPVAYL--IDEFGWSRK 336



 Score = 47.0 bits (113), Expect = 6e-05
 Identities = 23/93 (24%), Positives = 41/93 (44%), Gaps = 1/93 (1%)

Query: 770 LVGVVCLISFLVGLIFTTGAGEYWLKLFDSFAGTIGLVLIALIEMLAVMFIYGHEKFTND 829
           L+G+   +SF V L   T  G+    L D     I L L  L+  + V ++ G E    +
Sbjct: 348 LLGIPSALSFGV-LSDPTIFGKTIFDLLDFLVSNILLPLGGLLIAIFVGWVLGKEALREE 406

Query: 830 IYEMTGYKPGWYWQITWRFLGPAIMSCILVSSI 862
           + +  G K    W+   R++ P ++  I +  +
Sbjct: 407 LNKGRGKKLFKIWRFLLRYVAPVLILIIFLLGL 439


>gnl|CDD|214619 smart00325, RhoGEF, Guanine nucleotide exchange factor for
           Rho/Rac/Cdc42-like GTPases.  Guanine nucleotide exchange
           factor for Rho/Rac/Cdc42-like GTPases Also called
           Dbl-homologous (DH) domain. It appears that PH domains
           invariably occur C-terminal to RhoGEF/DH domains.
           Improved coverage.
          Length = 180

 Score =  162 bits (411), Expect = 2e-45
 Identities = 70/177 (39%), Positives = 95/177 (53%), Gaps = 14/177 (7%)

Query: 597 VIIELIETERTYVKNLNGLLENYLEPLKKE-TFLSGAEINALFGNILEIVAFQRQFLQNL 655
           V+ EL++TER YV++L  L+E +L+PLKKE   LS  E+  LFGNI EI  F R FL  L
Sbjct: 1   VLKELLQTERNYVRDLKLLVEVFLKPLKKELKLLSPNELETLFGNIEEIYEFHRDFLDEL 60

Query: 656 VEALENEADFHHFDQSSQFKNILFSIGSAFLYYVNHFKLYSSFCASHSKAQKVLHPNEGN 715
            E +E   D                IG  FL     FK+YS +C++H  A ++L   + N
Sbjct: 61  EERIEEWDDSVE------------RIGDVFLKLEEFFKIYSEYCSNHPDALELLKKLKKN 108

Query: 716 QALQEFLSSRNPRQQHSS-TLESYLIKPIQRILKYPLLLQQLCNLTDPHCDEHLHLV 771
              Q+FL       Q    TLES L+KP+QR+ KYPLLL++L   T    ++   L 
Sbjct: 109 PRFQKFLKEIESSPQCRRLTLESLLLKPVQRLTKYPLLLKELLKHTPEDHEDREDLK 165


>gnl|CDD|238091 cd00160, RhoGEF, Guanine nucleotide exchange factor for
           Rho/Rac/Cdc42-like GTPases; Also called Dbl-homologous
           (DH) domain. It appears that PH domains invariably occur
           C-terminal to RhoGEF/DH domains.
          Length = 181

 Score =  159 bits (403), Expect = 2e-44
 Identities = 68/179 (37%), Positives = 96/179 (53%), Gaps = 15/179 (8%)

Query: 594 LSKVIIELIETERTYVKNLNGLLENYLEPLKKE-TFLSGAEINALFGNILEIVAFQRQFL 652
             +VI EL++TER YV++L  L+E +L+PL KE   LS  E+  LFGNI EI  F R FL
Sbjct: 1   RQEVIKELLQTERNYVRDLKLLVEVFLKPLDKELLPLSPEEVELLFGNIEEIYEFHRIFL 60

Query: 653 QNLVEALENEADFHHFDQSSQFKNILFSIGSAFLYYVNHFKLYSSFCASHSKAQKVL-HP 711
           ++L E +E                    IG  FL     FK+YS +C++H  A ++L   
Sbjct: 61  KSLEERVEEWDKSG------------PRIGDVFLKLAPFFKIYSEYCSNHPDALELLKKL 108

Query: 712 NEGNQALQEFLSSRNPRQQHSSTLESYLIKPIQRILKYPLLLQQLCNLTDPHCDEHLHL 770
            + N+  QEFL      +     LES L+KP+QR+ KYPLLL++L   T    ++   L
Sbjct: 109 KKFNKFFQEFLEKAE-SECGRLKLESLLLKPVQRLTKYPLLLKELLKHTPDGHEDREDL 166


>gnl|CDD|216028 pfam00621, RhoGEF, RhoGEF domain.  Guanine nucleotide exchange
           factor for Rho/Rac/Cdc42-like GTPases Also called
           Dbl-homologous (DH) domain. It appears that pfam00169
           domains invariably occur C-terminal to RhoGEF/DH
           domains.
          Length = 179

 Score =  151 bits (384), Expect = 8e-42
 Identities = 61/168 (36%), Positives = 93/168 (55%), Gaps = 15/168 (8%)

Query: 597 VIIELIETERTYVKNLNGLLENYLEPLKKETFLSGAEINALFGNILEIVAFQRQFLQNLV 656
           VI EL++TER+YV++L  L+E +L+PL++   LS  EI  +F NI EI+   ++FL    
Sbjct: 1   VIQELLQTERSYVRDLKILVEVFLKPLRESPILSEEEIKTIFSNIEEILELHQEFL---- 56

Query: 657 EALENEADFHHFDQSSQFKNILFSIGSAFLYYVNHFKLYSSFCASHSKAQKVLHPN-EGN 715
           E LE   +     Q          IG  FL +   FK+YS++C+++ +A ++L    + N
Sbjct: 57  EELEERLEEWPDIQ---------RIGDIFLKFAPFFKVYSTYCSNYPRALELLKKLRKKN 107

Query: 716 QALQEFL-SSRNPRQQHSSTLESYLIKPIQRILKYPLLLQQLCNLTDP 762
               +FL             L S+LIKP+QRI +YPLLL++L   T P
Sbjct: 108 PRFAKFLKECEASPLCRGLDLNSFLIKPVQRIPRYPLLLKELLKHTPP 155


>gnl|CDD|212041 cd10333, LeuT-like_sbd, Aquifex aeolicus LeuT and related proteins;
            solute binding domain.  LeuT is a bacterial amino acid
            transporter with specificity for the hydrophobic amino
            acids glycine, alanine, methionine, and leucine. This
            subgroup belongs to the solute carrier 6 (SLC6)
            transporter family; LeuT has been used as a structural
            template for understanding fundamental aspects of SLC6
            function. It has an arrangement of 12 transmembrane
            helices (TMs), which appears to be a common motif for
            eukaryotic and some prokaryotic and archaeal SLC6s: an
            inverted topology repeat: TMs1-5 and TMs6-10, and
            TMs11-12.
          Length = 500

 Score =  137 bits (348), Expect = 6e-34
 Identities = 106/380 (27%), Positives = 161/380 (42%), Gaps = 80/380 (21%)

Query: 866  FLKKPSYSAWNR--AFLIPFMVMLILEGIPLFLIELGIGQKMRLGS-----------LGV 912
            FL+ P  +A N   AF+IP+ +  +L GIPL  +E  +G   R G            L  
Sbjct: 24   FLRFPVQAAQNGGGAFMIPYFIAFLLVGIPLMWVEWAMG---RYGGKHGHGTTPMIFLLA 80

Query: 913  WNTIHPW--LGGVGISSCLVTFFVALYYNVIITWCFYYLFNSFSSSLPWSSCPTDHNGTI 970
               I     LG +G +   +   V  YY  I +W   Y F S +   P            
Sbjct: 81   KRNI-FAKILGAIGFA---IPLLVNSYYVYIESWTLGYAFKSLTGGYPEPI--------- 127

Query: 971  IPECELSSETAYFWYRTTLDVSPNIESPGSPKWWIVLCLLLAWIIVFFIVMKGIQSSGRV 1030
                      A F+Y          +       + ++ L L + I++  V KGI++  ++
Sbjct: 128  ------LRPFAEFFYSYI-----GAKPSSFAYIFFLITLALNFFILYRGVSKGIEAFAKI 176

Query: 1031 VYFTSMFPYIVL--TIFFIRGITLP-------GAGAGLVHMYTPKVEKLLDPNVWLDAAT 1081
                   P + +   I  IR +TL         A  GL  ++ P   KL DP VWL AA 
Sbjct: 177  -----AMPLLYVFAIILAIRVLTLGTPVNPDWTALKGLNFLWNPDFSKLKDPKVWLAAAG 231

Query: 1082 QVFYSFGLAFGSLIAFGSYNQPNNNCVRDVIMVSICNALTAIYASVVVFA--ILGFKAMS 1139
            Q+F++  L  G +I + SY + ++    D+++    ++LTA  AS+  FA  ILG   + 
Sbjct: 232  QIFFTLSLGMGIIINYASYLRKDD----DIVL----SSLTA--ASLNEFAEVILGGSIVI 281

Query: 1140 NAAEG-TGLAFIAAEGT-GLAFI----VFTQAIVELPGAPFWSIIFFMMLLSLGLGSQIG 1193
             AA    G   +A  GT GL FI    VF+Q    +PG  F+  ++F +L   G+ S I 
Sbjct: 282  PAAFAFLGAVAVAKAGTFGLGFIALPNVFSQ----MPGGQFFGFLWFFLLFFAGITSSIA 337

Query: 1194 ILEGVLCTIFDIEIFKRIRK 1213
            +L+      F  E  K  RK
Sbjct: 338  MLQ--PLIAFLEEELKWSRK 355



 Score = 48.4 bits (116), Expect = 2e-05
 Identities = 20/94 (21%), Positives = 40/94 (42%), Gaps = 8/94 (8%)

Query: 778 SFLVGLIFTTGAGEYW--------LKLFDSFAGTIGLVLIALIEMLAVMFIYGHEKFTND 829
           +  +  +                 L   D + GTI LV++ LIE++   +I+G +K   +
Sbjct: 358 ALWIFALVIIVGLPVAFEFIDLKALDELDFWVGTILLVVLGLIEVIVFAWIFGAKKGWEE 417

Query: 830 IYEMTGYKPGWYWQITWRFLGPAIMSCILVSSII 863
           I      K   +++   +++ PA +  +LV   I
Sbjct: 418 INRGALIKVPRFFKYVIKYITPAFLIVVLVFFTI 451


>gnl|CDD|176413 cd01818, TIAM1_RBD, Ubiquitin domain of Tiam1 guanine nucleotide
           exchange factor.  Tiam1 (T lymphoma invasion and
           metastasis 1) a guanine nucleotide exchange factor that
           activates Rac, is an important regulator of Rho GTPase
           functions in tumor cells including regulation of cell
           shape and invasiveness in epithelial cells and
           fibroblasts. TIAM1 has an RBD (Ras-binding domain)
           similar to that of Raf kinase as well as PH (pleckstrin
           homology), PDZ, and RhoGEF domains.
          Length = 77

 Score =  110 bits (277), Expect = 6e-29
 Identities = 35/77 (45%), Positives = 50/77 (64%), Gaps = 2/77 (2%)

Query: 278 KVSLPENQYAHVYLKDSMTVEEFLANACSRKNLNPMEHFVRVKKRREMEDHNYFVPPRGD 337
            V LP+NQ    YL+  M+VE+FL +AC RK L+PMEH++R+K  R ME+H YF  P   
Sbjct: 3   WVCLPDNQPVLTYLRPGMSVEDFLESACKRKQLDPMEHYLRLKFLR-MENHEYFYIPN-P 60

Query: 338 LIETYLSTHEVVEVCAK 354
           L + Y   ++ +E+C K
Sbjct: 61  LEDIYDLLYKEIEICPK 77


>gnl|CDD|176358 cd01760, RBD, Ubiquitin-like domain of RBD-like S/T kinases.  The
           ras-binding domain (RBD) of the serine/threonine kinase
           raf is structurally quite similar to the beta-grasp fold
           of ubiquitin. A raf-like RBD is also present in RGS12
           and other members of a family of GTPase activating
           proteins and TIAM1, a guanine nucleotide exchange
           protein.
          Length = 72

 Score = 72.6 bits (179), Expect = 1e-15
 Identities = 20/78 (25%), Positives = 31/78 (39%), Gaps = 7/78 (8%)

Query: 277 VKVSLPENQYAHVYLKDSMTVEEFLANACSRKNLNPMEHFVRVKKRREMEDHNYFVPPRG 336
            +V LP  Q   V ++  M+V + LA AC ++ LNP    V +    E         P  
Sbjct: 2   CRVYLPNGQRTVVPVRPGMSVRDVLAKACKKRGLNPECCDVFLLGLDE-------KKPLD 54

Query: 337 DLIETYLSTHEVVEVCAK 354
              ++     E +EV   
Sbjct: 55  LDTDSSSLAGEELEVEPL 72


>gnl|CDD|215766 pfam00169, PH, PH domain.  PH stands for pleckstrin homology.
          Length = 101

 Score = 73.3 bits (180), Expect = 2e-15
 Identities = 28/104 (26%), Positives = 45/104 (43%), Gaps = 10/104 (9%)

Query: 7   WLLRKKHQIELARKRGWKGYWVCLKGTTLLFYPCDSREGRSVDAAPKHLIIVDGAIMQPI 66
           WLL+K        ++ WK  +  L    LL+Y    +      + PK  I + G  +  +
Sbjct: 6   WLLKKGS----GGRKSWKKRYFVLFDGVLLYYKDSKKS----SSRPKGSIPLSGCQVTKV 57

Query: 67  PEHP--KRDYIFCLSTAFGDAYLFQAPCQVELENWVNSIHSACA 108
           P+    KR   F + T   + +L QA  + E + WV +I SA  
Sbjct: 58  PDSEDGKRKNCFEIRTGDRETFLLQAESEEERKEWVKAIRSAIR 101



 Score = 33.6 bits (77), Expect = 0.17
 Identities = 18/115 (15%), Positives = 35/115 (30%), Gaps = 25/115 (21%)

Query: 1283 KIKKGLELHAMCFVFKSAVVFLCKERLRQKKKLIGVSNKGCSNEVEIIRYQVLVPVTEVQ 1342
                        FV    V+   K+  +   +  G                  +P++  Q
Sbjct: 11   GSGGRKSWKKRYFVLFDGVLLYYKDSKKSSSRPKGS-----------------IPLSGCQ 53

Query: 1343 VR-ASSAKDMDSHFLWELIHLRSQIQRRTEKVYVLSNSTAEFRNAFLKTIRQIIR 1396
            V     ++D      +E+            + ++L   + E R  ++K IR  IR
Sbjct: 54   VTKVPDSEDGKRKNCFEIRT-------GDRETFLLQAESEEERKEWVKAIRSAIR 101


>gnl|CDD|202147 pfam02196, RBD, Raf-like Ras-binding domain. 
          Length = 71

 Score = 71.2 bits (175), Expect = 3e-15
 Identities = 24/71 (33%), Positives = 36/71 (50%), Gaps = 3/71 (4%)

Query: 275 KSVKVSLPENQYAHVYLKDSMTVEEFLANACSRKNLNPMEHFVR---VKKRREMEDHNYF 331
           K+ +V LP NQ + V ++  MTV + LA AC R+ LNP    VR    KK  +++     
Sbjct: 1   KTCRVHLPNNQRSVVEVRPGMTVRDALAKACKRRGLNPSACAVRLSGEKKPLDLDTDISS 60

Query: 332 VPPRGDLIETY 342
           +P     +E  
Sbjct: 61  LPGEELRVELL 71


>gnl|CDD|214574 smart00233, PH, Pleckstrin homology domain.  Domain commonly found
           in eukaryotic signalling proteins. The domain family
           possesses multiple functions including the abilities to
           bind inositol phosphates, and various proteins. PH
           domains have been found to possess inserted domains
           (such as in PLC gamma, syntrophins) and to be inserted
           within other domains. Mutations in Brutons tyrosine
           kinase (Btk) within its PH domain cause X-linked
           agammaglobulinaemia (XLA) in patients. Point mutations
           cluster into the positively charged end of the molecule
           around the predicted binding site for
           phosphatidylinositol lipids.
          Length = 102

 Score = 68.7 bits (168), Expect = 8e-14
 Identities = 28/104 (26%), Positives = 45/104 (43%), Gaps = 9/104 (8%)

Query: 7   WLLRKKHQIELARKRGWKGYWVCLKGTTLLFYPCDSREGRSVDAAPKHLIIVDGAIMQPI 66
           WL +K        K+ WK  +  L  +TLL+Y     +       PK  I + G  ++  
Sbjct: 6   WLYKKSGG----GKKSWKKRYFVLFNSTLLYYKSKKDKK---SYKPKGSIDLSGCTVREA 58

Query: 67  PE--HPKRDYIFCLSTAFGDAYLFQAPCQVELENWVNSIHSACA 108
           P+    K+ + F + T+     L QA  + E E WV ++  A A
Sbjct: 59  PDPDSSKKPHCFEIKTSDRKTLLLQAESEEEREKWVEALRKAIA 102


>gnl|CDD|128731 smart00455, RBD, Raf-like Ras-binding domain. 
          Length = 70

 Score = 65.4 bits (160), Expect = 4e-13
 Identities = 20/69 (28%), Positives = 32/69 (46%), Gaps = 3/69 (4%)

Query: 277 VKVSLPENQYAHVYLKDSMTVEEFLANACSRKNLNPMEHFVR---VKKRREMEDHNYFVP 333
            KV LP+NQ   V ++   TV + LA A  ++ LNP    VR    KK  ++      + 
Sbjct: 2   CKVHLPDNQRTVVKVRPGKTVRDALAKALKKRGLNPECCVVRLRGEKKPLDLNQPISSLD 61

Query: 334 PRGDLIETY 342
            +  ++E  
Sbjct: 62  GQELVVEEL 70


>gnl|CDD|241449 cd13295, PH_EFA6, Exchange Factor for ARF6 Pleckstrin homology (PH)
           domain.  EFA6 (also called PSD/pleckstrin and Sec7
           domain containing) is an guanine nucleotide exchange
           factor for ADP ribosylation factor 6 (ARF6), which is
           involved in membrane recycling. EFA6 has four
           structurally related polypeptides: EFA6A, EFA6B, EFA6C
           and EFA6D. It consists of a N-terminal proline rich
           region (PR), a SEC7 domain, a PH domain, a PR, a
           coiled-coil region, and a C-terminal PR. The EFA6 PH
           domain regulates its association with the plasma
           membrane. EFA6 activates Arf6 through its Sec7 catalytic
           domain and modulates this activity through its
           C-terminal domain, which rearranges the actin
           cytoskeleton in fibroblastic cell lines. PH domains have
           diverse functions, but in general are involved in
           targeting proteins to the appropriate cellular location
           or in the interaction with a binding partner. They share
           little sequence conservation, but all have a common
           fold, which is electrostatically polarized. Less than
           10% of PH domains bind phosphoinositide phosphates
           (PIPs) with high affinity and specificity. PH domains
           are distinguished from other PIP-binding domains by
           their specific high-affinity binding to PIPs with two
           vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2
           or PtdIns(3,4,5)P3 which results in targeting some PH
           domain proteins to the plasma membrane. A few display
           strong specificity in lipid binding. Any specificity is
           usually determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 126

 Score = 65.8 bits (161), Expect = 2e-12
 Identities = 30/111 (27%), Positives = 58/111 (52%), Gaps = 6/111 (5%)

Query: 5   KKWLLRKKHQ-IELAR----KRGWKGYWVCLKGTTLLFYPCDSR-EGRSVDAAPKHLIIV 58
           K +L+RK H   +  +    KRGWK ++  LKG  L  +  +   + +    + ++ I V
Sbjct: 9   KGYLMRKCHADPDGKKTPRGKRGWKMFYATLKGLVLYLHKDEYGCKKQLSYESLRNAISV 68

Query: 59  DGAIMQPIPEHPKRDYIFCLSTAFGDAYLFQAPCQVELENWVNSIHSACAA 109
             ++ +   ++ K+ ++F L TA    +LFQA    E+++W+ +I+   AA
Sbjct: 69  HHSLAEKATDYTKKPHVFRLQTADWREFLFQASDTEEMQSWIEAINLVAAA 119


>gnl|CDD|241307 cd10571, PH_beta_spectrin, Beta-spectrin pleckstrin homology (PH)
           domain.  Beta spectrin binds actin and functions as a
           major component of the cytoskeleton underlying cellular
           membranes. Beta spectrin consists of multiple spectrin
           repeats followed by a PH domain, which binds to
           inositol-1,4,5-trisphosphate. The PH domain of
           beta-spectrin is thought to play a role in the
           association of spectrin with the plasma membrane of
           cells. PH domains have diverse functions, but in general
           are involved in targeting proteins to the appropriate
           cellular location or in the interaction with a binding
           partner. They share little sequence conservation, but
           all have a common fold, which is electrostatically
           polarized. Less than 10% of PH domains bind
           phosphoinositide phosphates (PIPs) with high affinity
           and specificity. PH domains are distinguished from other
           PIP-binding domains by their specific high-affinity
           binding to PIPs with two vicinal phosphate groups:
           PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which
           results in targeting some PH domain proteins to the
           plasma membrane. A few display strong specificity in
           lipid binding. Any specificity is usually determined by
           loop regions or insertions in the N-terminus of the
           domain, which are not conserved across all PH domains.
           PH domains are found in cellular signaling proteins such
           as serine/threonine kinase, tyrosine kinases, regulators
           of G-proteins, endocytotic GTPases, adaptors, as well as
           cytoskeletal associated molecules and in lipid
           associated enzymes.
          Length = 105

 Score = 60.7 bits (148), Expect = 6e-11
 Identities = 32/103 (31%), Positives = 53/103 (51%), Gaps = 6/103 (5%)

Query: 8   LLRKKHQIELARKRG----WKGYWVCLKGTTLLFYPCDSREGRSVDAAPKHLIIVDGAIM 63
           L RK H++E   K+     WK  +  LKG  L FY  D +   S   A +  + + GA+ 
Sbjct: 5   LERK-HELESGGKKASNRSWKKVYCVLKGQELSFYK-DQKAAASESYAGEPPLNLSGAVC 62

Query: 64  QPIPEHPKRDYIFCLSTAFGDAYLFQAPCQVELENWVNSIHSA 106
           +   ++ K+ ++F L  + G  YLFQA  + E+ +WV  + +A
Sbjct: 63  EVASDYKKKKHVFRLRLSDGSEYLFQAKDEEEMNDWVQKLQAA 105


>gnl|CDD|241455 cd13301, PH1_Pleckstrin_2, Pleckstrin 2 Pleckstrin homology (PH)
           domain, repeat 1.  Pleckstrin is a protein found in
           platelets. This name is derived from platelet and
           leukocyte C kinase substrate and the KSTR string of
           amino acids. Pleckstrin 2 contains two PH domains and a
           DEP (dishvelled, egl-10, and pleckstrin) domain. Unlike
           pleckstrin 1, pleckstrin 2 does not contain obvious
           sites of PKC phosphorylation. Pleckstrin 2 plays a role
           in actin rearrangement, large lamellipodia and
           peripheral ruffle formation, and may help orchestrate
           cytoskeletal arrangement. The PH domains of pleckstrin 2
           are thought to contribute to lamellipodia formation.
           This cd contains the first PH domain repeat. PH domains
           have diverse functions, but in general are involved in
           targeting proteins to the appropriate cellular location
           or in the interaction with a binding partner. They share
           little sequence conservation, but all have a common
           fold, which is electrostatically polarized. Less than
           10% of PH domains bind phosphoinositide phosphates
           (PIPs) with high affinity and specificity. PH domains
           are distinguished from other PIP-binding domains by
           their specific high-affinity binding to PIPs with two
           vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2
           or PtdIns(3,4,5)P3 which results in targeting some PH
           domain proteins to the plasma membrane. A few display
           strong specificity in lipid binding. Any specificity is
           usually determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 108

 Score = 54.7 bits (132), Expect = 6e-09
 Identities = 29/93 (31%), Positives = 45/93 (48%), Gaps = 12/93 (12%)

Query: 20  KRG-----WKGYWVCLKGTTLLFYPCDSREGRSVDAAPKHLIIVDGA-IMQPIPEHPKRD 73
           K+G     WK  W  L    L +Y     + ++ D++PK +I++ G  I  P  E+ KR 
Sbjct: 11  KKGHVVNNWKARWFVLLEDKLEYY-----KKKT-DSSPKGMILLKGCTITSPCLEYEKRP 64

Query: 74  YIFCLSTAFGDAYLFQAPCQVELENWVNSIHSA 106
            +F L+TA G  +  QA  + E + W   I  A
Sbjct: 65  LVFKLTTAKGQDHFLQACSREERDAWAKDITKA 97


>gnl|CDD|241284 cd01253, PH_ARHGAP21-like, ARHGAP21 and related proteins pleckstrin
           homology (PH) domain.  ARHGAP family genes encode
           Rho/Rac/Cdc42-like GTPase activating proteins with a
           RhoGAP domain. These proteins functions as a
           GTPase-activating protein (GAP) for RHOA and CDC42.
           ARHGAP21 controls the Arp2/3 complex and F-actin
           dynamics at the Golgi complex by regulating the activity
           of the small GTPase Cdc42. It is recruited to the Golgi
           by to GTPase, ARF1, through its PH domain and its
           helical motif. It is also required for CTNNA1
           recruitment to adherens junctions. ARHGAP21 and it
           related proteins all contains a PH domain and a RhoGAP
           domain. Some of the members have additional N-terminal
           domains including PDZ, SH3, and SPEC. The ARHGAP21 PH
           domain interacts with the GTPbound forms of both ARF1
           and ARF6 ARF-binding domain/ArfBD. The members here
           include: ARHGAP15, ARHGAP21, and ARHGAP23. PH domains
           have diverse functions, but in general are involved in
           targeting proteins to the appropriate cellular location
           or in the interaction with a binding partner. They share
           little sequence conservation, but all have a common
           fold, which is electrostatically polarized. Less than
           10% of PH domains bind phosphoinositide phosphates
           (PIPs) with high affinity and specificity. PH domains
           are distinguished from other PIP-binding domains by
           their specific high-affinity binding to PIPs with two
           vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2
           or PtdIns(3,4,5)P3 which results in targeting some PH
           domain proteins to the plasma membrane. A few display
           strong specificity in lipid binding. Any specificity is
           usually determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 113

 Score = 54.2 bits (131), Expect = 1e-08
 Identities = 34/110 (30%), Positives = 53/110 (48%), Gaps = 9/110 (8%)

Query: 7   WLLRKKHQIELARK---RGWKGYWVCLKGTTLLFYPCDSREG-RSVDAAPK--HLIIVDG 60
           WL  K+  +E  ++   R WK  W  L+G +L  Y  D RE   ++ AA      I +  
Sbjct: 5   WLHFKQSVLEKGKRASDRSWKQVWAVLRGHSLYLYK-DKRETSPALSAAEDSEQRIDIRS 63

Query: 61  AIMQPIPE-HPKRDYIFCLSTAFGDAYLFQAPCQVELENWVNSIHSACAA 109
            I+  I   + KR  +F L+T+ G  YLFQA  + ++  W+ +I     A
Sbjct: 64  CIVD-IAYSYTKRKNVFRLTTSDGSEYLFQAEDRDDMLGWIKAIQENSNA 112


>gnl|CDD|227709 COG5422, ROM1, RhoGEF, Guanine nucleotide exchange factor for
           Rho/Rac/Cdc42-like GTPases [Signal transduction
           mechanisms].
          Length = 1175

 Score = 59.1 bits (143), Expect = 2e-08
 Identities = 64/230 (27%), Positives = 100/230 (43%), Gaps = 42/230 (18%)

Query: 584 PSRQLTDAEKLSKVIIELIETERTYVKNLNGLLENYLEPL---------KKETFLSGAEI 634
           P +++    K  + I E+I TER +VK+L  L + +++PL          +  F     I
Sbjct: 479 PKQEI----KRQEAIYEVIYTERDFVKDLEYLRDTWIKPLEESNIIPENARRNF-----I 529

Query: 635 NALFGNILEIVAFQRQFLQNLVEALENEADFHHFDQSSQFKNILFSIGSAFLYYVNHFKL 694
             +F NI EI A   + L    +AL N        Q      I+  I   FL YV  F+ 
Sbjct: 530 KHVFANINEIYAVNSKLL----KALTNR-------QCLS--PIVNGIADIFLDYVPKFEP 576

Query: 695 YSSFCASH------SKAQKVLHPNEGNQALQEFLSSRNPRQQHSSTLESYLIKPIQRILK 748
           +  + AS        + +K ++PN       E       R+     L+ YL KP  R+ +
Sbjct: 577 FIKYGASQPYAKYEFEREKSVNPNFAR-FDHEVERLDESRKLE---LDGYLTKPTTRLAR 632

Query: 749 YPLLLQQLCNLTDPHCDEHLHLVGVVCLI-SFLVGLIFTTGAGEYWLKLF 797
           YPLLL+++   TDP   +   +  V+ ++  FL  L F +G  E    LF
Sbjct: 633 YPLLLEEVLKFTDPDNPDTEDIPKVIDMLREFLSRLNFESGKAENRGDLF 682


>gnl|CDD|223605 COG0531, PotE, Amino acid transporters [Amino acid transport and
            metabolism].
          Length = 466

 Score = 56.7 bits (137), Expect = 6e-08
 Identities = 36/202 (17%), Positives = 77/202 (38%), Gaps = 19/202 (9%)

Query: 991  VSPNIESPGSPKWWIVLCLLLAWIIVFFIVMKGIQSSGRVVYFTSMFPYIVLTIFFIRGI 1050
                      P   I++ L L  ++   + ++GI++S ++    ++   I+L IF I G+
Sbjct: 121  FPGPGLLSIGPLLIILIALALIALL-TLLNLRGIKASAKINSIITILKIIILLIFIILGL 179

Query: 1051 TLPGAGAGLVHMYTPKVEKLLDPNVWLDAATQVFYSFGLAFGSLIAFGSYNQ-------P 1103
               G   G +             N    +   +  +  LAF +   F +          P
Sbjct: 180  FAFGFSNGNL---------FAPFNPGGGSFGGILAAILLAFFAFTGFEAIATLAEEVKNP 230

Query: 1104 NNNCVRDVIMVSICNALTAIYASVVVFAILGFKAMSNAAEGTGLAFIAAEGTGLAFIVFT 1163
                 R +I+  +   +  I  ++V+  +L    ++ +A    LA  A  G G    +  
Sbjct: 231  KRTIPRAIILSLLIVLILYILGALVIVGVLPAGDLAASAPSAPLALAALFGGGNWGAIII 290

Query: 1164 QAIVELPGAPFWSIIFFMMLLS 1185
              +  L  + F S++ +++ +S
Sbjct: 291  AILALL--SLFGSLLAWILAVS 310



 Score = 30.5 bits (69), Expect = 8.2
 Identities = 26/180 (14%), Positives = 60/180 (33%), Gaps = 30/180 (16%)

Query: 999  GSPKWWIVLCLLLAWIIVFFIVMKGIQSSGRVVY-------FTSMF---------PYIVL 1042
            G   W  ++  +LA + +F  ++  I +  RV+Y           F         P I L
Sbjct: 281  GGGNWGAIIIAILALLSLFGSLLAWILAVSRVLYAMARDGLLPKFFAKVNPKGRTPVIAL 340

Query: 1043 TIFFIRGITLPGAGAGLVHMYTPKVEKLLDPNVWLDAATQVFYSFGLAFGSLIAFGSYNQ 1102
             +  I  + L      L  + +     L+       A+     ++ L   +L+      +
Sbjct: 341  ILTGIISLIL----LLLFPLSSIAFNALV-----SLASVAFLIAYLLVALALLVL-RRKK 390

Query: 1103 PNNNCVRDVIMVSICNALTAIYASVVVFAILGFKAMSNAAEGTGLAFIAAEGTGLAFIVF 1162
            P+      + +  +   L  +   ++++A+                 + A G  +  +V+
Sbjct: 391  PDLKRPFRLPLAPLIPILGIVAVLLLLYALYASGLPPL----LLGVILIAGGIIIYLLVY 446


>gnl|CDD|238492 cd00992, PDZ_signaling, PDZ domain found in a variety of Eumetazoan
           signaling molecules, often in tandem arrangements. May
           be responsible for specific protein-protein
           interactions, as most PDZ domains bind C-terminal
           polypeptides, and binding to internal (non-C-terminal)
           polypeptides and even to lipids has been demonstrated.
           In this subfamily of PDZ domains an N-terminal
           beta-strand forms the peptide-binding groove base, a
           circular permutation with respect to PDZ domains found
           in proteases.
          Length = 82

 Score = 48.7 bits (117), Expect = 4e-07
 Identities = 21/79 (26%), Positives = 31/79 (39%), Gaps = 7/79 (8%)

Query: 356 LYQVELQRTTLDHMWGFSVEAELTENAERQDELCCYVSRVEDKSVAMHNGLIKGDEIMVI 415
           +  V L++       GFS+         +      +VSRVE    A   GL  GD I+ +
Sbjct: 1   VRTVTLRKDPGGG-LGFSLRGG------KDSGGGIFVSRVEPGGPAERGGLRVGDRILEV 53

Query: 416 NGAIVSDLDMMYLESVLQE 434
           NG  V  L       +L+ 
Sbjct: 54  NGVSVEGLTHEEAVELLKN 72


>gnl|CDD|212031 cd06857, SLC5-6-like_sbd, Solute carrier families 5 and 6-like;
            solute binding domain.  This superfamily includes the
            solute-binding domain of SLC5 proteins (also called the
            sodium/glucose cotransporters or solute sodium
            symporters), SLC6 proteins (also called the sodium- and
            chloride-dependent neurotransmitter transporters or
            Na+/Cl--dependent transporters), and
            nucleobase-cation-symport-1 (NCS1) transporters. SLC5s
            co-transport Na+ with sugars, amino acids, inorganic ions
            or vitamins. SLC6s include Na+/Cl--dependent plasma
            membrane transporters for the monoamine neurotransmitters
            serotonin, dopamine, and norepinephrine, and the amino
            acid neurotransmitters GABA and glycine. NCS1s are
            essential components of salvage pathways for nucleobases
            and related metabolites; their known substrates include
            allantoin, uracil, thiamine, and nicotinamide riboside.
            Members of this superfamily are important in human
            physiology and disease. They contain a functional core of
            10 transmembrane helices (TMs): an inverted structural
            repeat, TMs1-5 and TMs6-10; TMs numbered to conform to
            the SLC6 Aquifex aeolicus LeuT.
          Length = 413

 Score = 53.4 bits (128), Expect = 6e-07
 Identities = 29/208 (13%), Positives = 68/208 (32%), Gaps = 6/208 (2%)

Query: 1004 WIVLCLLLAWIIVFFIVMKGIQSSGRVVYFTSMFPYIVLTIFFIRGIT-LPGAGAGLVHM 1062
             I++  L+A +   +  +  I  +  +     +   ++     +  I    G    +   
Sbjct: 117  SILILALIAVVYSIYGGLSAIVWTDVIAVPFLVLGGVMTVYMAVSFIGGFAGVSKMVDAA 176

Query: 1063 YTPKVEKLLDPNVWLDAATQVFYSFGLAFGSLIAFGSYNQPNNNCVRDVIMVSICNALTA 1122
                   L  P + +        +    +G +         + + V +     +  A   
Sbjct: 177  PGHFEMILNLPGIAVAIGGLWVANL-YYWGFIQYIIQRTLADKS-VSEAQKGIVFAAFLM 234

Query: 1123 IYASVVVFAILGFKAMSNAAEGTGLAFIAAEGT-GLAFIVFTQAIVELPGAPFWSIIFFM 1181
            + A ++V  I G  A    ++   +A +    +   A   +      LPG+     + F 
Sbjct: 235  LIAPILVGLI-GIAAYVITSDPQLMASLGNLPSAANADKAYPWLTQFLPGSVGVKGLVFA 293

Query: 1182 MLLSLGLGSQIGILEGVLCTIFDIEIFK 1209
             L +  + S   +L     TI  ++I+K
Sbjct: 294  ALAAAIVSSLAAMLNSTA-TILTMDIYK 320


>gnl|CDD|214570 smart00228, PDZ, Domain present in PSD-95, Dlg, and ZO-1/2.  Also
           called DHR (Dlg homologous region) or GLGF (relatively
           well conserved tetrapeptide in these domains). Some PDZs
           have been shown to bind C-terminal polypeptides; others
           appear to bind internal (non-C-terminal) polypeptides.
           Different PDZs possess different binding specificities.
          Length = 85

 Score = 46.2 bits (110), Expect = 4e-06
 Identities = 21/80 (26%), Positives = 32/80 (40%), Gaps = 8/80 (10%)

Query: 355 ILYQVELQRTTLDHMWGFSVEAELTENAERQDELCCYVSRVEDKSVAMHNGLIKGDEIMV 414
               VEL++       GFS+         + +     VS V   S A   GL  GD I+ 
Sbjct: 1   EPRLVELEKGGGG--LGFSLVGG------KDEGGGVVVSSVVPGSPAAKAGLRVGDVILE 52

Query: 415 INGAIVSDLDMMYLESVLQE 434
           +NG  V  L  +    +L++
Sbjct: 53  VNGTSVEGLTHLEAVDLLKK 72


>gnl|CDD|241231 cd00821, PH, Pleckstrin homology (PH) domain.  PH domains have
           diverse functions, but in general are involved in
           targeting proteins to the appropriate cellular location
           or in the interaction with a binding partner. They share
           little sequence conservation, but all have a common
           fold, which is electrostatically polarized. Less than
           10% of PH domains bind phosphoinositide phosphates
           (PIPs) with high affinity and specificity. PH domains
           are distinguished from other PIP-binding domains by
           their specific high-affinity binding to PIPs with two
           vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2
           or PtdIns(3,4,5)P3 which results in targeting some PH
           domain proteins to the plasma membrane. A few display
           strong specificity in lipid binding. Any specificity is
           usually determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 92

 Score = 45.8 bits (108), Expect = 7e-06
 Identities = 22/97 (22%), Positives = 36/97 (37%), Gaps = 9/97 (9%)

Query: 7   WLLRKKHQIELARKRGWKGYWVCLKGTTLLFYPCDSREGRSVDAAPKHLIIVDGAIMQPI 66
           WL ++  +      + WK  W  L    LL+Y    +  +     PK LI +   +   +
Sbjct: 4   WLKKRGGKG----LKSWKKRWFVLFDDVLLYY----KSKKDSSKKPKGLIPLSDGLEVEL 55

Query: 67  PEHPKRDYIFCLSTAFGD-AYLFQAPCQVELENWVNS 102
                +   F L T      Y  QA  + E E W+ +
Sbjct: 56  VSSSGKPNCFELVTPDRGRTYYLQAESEEEREEWLEA 92


>gnl|CDD|238490 cd00990, PDZ_glycyl_aminopeptidase, PDZ domain associated with
           archaeal and bacterial M61 glycyl-aminopeptidases. May
           be responsible for substrate recognition and/or binding,
           as most PDZ domains bind C-terminal polypeptides, and
           binding to internal (non-C-terminal) polypeptides and
           even to lipids has been demonstrated. In this subfamily
           of protease-associated PDZ domains a C-terminal
           beta-strand is presumed to form the peptide-binding
           groove base, a circular permutation with respect to PDZ
           domains found in Eumetazoan signaling proteins.
          Length = 80

 Score = 43.6 bits (103), Expect = 3e-05
 Identities = 20/72 (27%), Positives = 30/72 (41%)

Query: 385 QDELCCYVSRVEDKSVAMHNGLIKGDEIMVINGAIVSDLDMMYLESVLQEELSLCMMMRS 444
           ++E    V+ V D S A   GL+ GDE++ +NG  V  L     E    + + L +    
Sbjct: 9   KEEGLGKVTFVRDDSPADKAGLVAGDELVAVNGWRVDALQDRLKEYQAGDPVELTVFRDD 68

Query: 445 SRTEPPETAGSG 456
              E P T    
Sbjct: 69  RLIEVPLTLADP 80


>gnl|CDD|241464 cd13310, PH_RalGPS1_2, Ral GEF with PH domain and SH3 binding motif
           1 and 2 Pleckstrin homology (PH) domain.  RalGPS1 (also
           called Ral GEF with PH domain and SH3 binding motif
           1;RALGEF2/ Ral guanine nucleotide exchange factor 2;
           RalA exchange factor RalGPS1; Ral guanine nucleotide
           exchange factor RalGPS1A2; ras-specific guanine
           nucleotide-releasing factor RalGPS1) and RalGPS2 (also
           called Ral GEF with PH domain and SH3 binding motif 2;
           Ral-A exchange factor RalGPS2; ras-specific guanine
           nucleotide-releasing factor RalGPS22). They activate
           small GTPase Ral proteins such as RalA and RalB by
           stimulating the exchange of Ral bound GDP to GTP,
           thereby regulating various downstream cellular
           processes. Structurally they contain an N-terminal
           Cdc25-like catalytic domain, followed by a PXXP motif
           and a C-terminal PH domain. The Cdc25-like catalytic
           domain interacts with Ral and its PH domain ensures the
           correct membrane localization. Its PXXP motif is thought
           to interact with the SH3 domain of Grb2. PH domains have
           diverse functions, but in general are involved in
           targeting proteins to the appropriate cellular location
           or in the interaction with a binding partner. They share
           little sequence conservation, but all have a common
           fold, which is electrostatically polarized. Less than
           10% of PH domains bind phosphoinositide phosphates
           (PIPs) with high affinity and specificity. PH domains
           are distinguished from other PIP-binding domains by
           their specific high-affinity binding to PIPs with two
           vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2
           or PtdIns(3,4,5)P3 which results in targeting some PH
           domain proteins to the plasma membrane. A few display
           strong specificity in lipid binding. Any specificity is
           usually determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 116

 Score = 40.7 bits (96), Expect = 6e-04
 Identities = 31/111 (27%), Positives = 50/111 (45%), Gaps = 16/111 (14%)

Query: 9   LRKKHQIELARKRG-------WKGYWVCLKGTTLLFYPCDSREG--RS-VDAAPKHLIIV 58
           LR+K       K G       W+ YWV L GT+L++Y   S  G  RS   + P  ++ +
Sbjct: 6   LRRK----TVLKEGRKPTVSSWQRYWVQLWGTSLVYYAPKSLRGNERSDFKSEPCKIVSI 61

Query: 59  DGAIMQPIPEHPKRDYIFCLS-TAFGDAYLFQAPCQVELENWVNSIHSACA 108
            G ++   P+ P+    F L+    G+ Y F+A  +     W+  +  AC 
Sbjct: 62  SGWMVVL-PDDPEHPDSFQLTDPEKGNVYKFRAGSRSNALLWLKHLKDACK 111


>gnl|CDD|241397 cd13243, PH_PLEKHG1_G2_G3, Pleckstrin homology domain-containing
            family G members 1, 2, and 3 pleckstrin homology (PH)
            domain.  PLEKHG1 (also called ARHGEF41), PLEKHG2 (also
            called ARHGEF42 or CLG/common-site lymphoma/leukemia
            guanine nucleotide exchange factor2), and PLEKHG3 (also
            called ARHGEF43) have RhoGEF DH/double-homology domains
            in tandem with a PH domain which is involved in
            phospholipid binding. They function as a guanine
            nucleotide exchange factor (GEF) and are involved in the
            regulation of Rho protein signal transduction. Mutations
            in PLEKHG1 have been associated panic disorder (PD), an
            anxiety disorder characterized by panic attacks and
            anticipatory anxiety. PH domains have diverse functions,
            but in general are involved in targeting proteins to the
            appropriate cellular location or in the interaction with
            a binding partner. They share little sequence
            conservation, but all have a common fold, which is
            electrostatically polarized. Less than 10% of PH domains
            bind phosphoinositide phosphates (PIPs) with high
            affinity and specificity. PH domains are distinguished
            from other PIP-binding domains by their specific
            high-affinity binding to PIPs with two vicinal phosphate
            groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3
            which results in targeting some PH domain proteins to the
            plasma membrane. A few display strong specificity in
            lipid binding. Any specificity is usually determined by
            loop regions or insertions in the N-terminus of the
            domain, which are not conserved across all PH domains. PH
            domains are found in cellular signaling proteins such as
            serine/threonine kinase, tyrosine kinases, regulators of
            G-proteins, endocytotic GTPases, adaptors, as well as
            cytoskeletal associated molecules and in lipid associated
            enzymes.
          Length = 147

 Score = 39.6 bits (93), Expect = 0.003
 Identities = 11/26 (42%), Positives = 18/26 (69%)

Query: 1214 PILTEALKGMEKVAEHINEMQRIHEE 1239
             ++ +A+  M +VA HIN+M+R HE 
Sbjct: 2    EVVEDAIDTMTQVAWHINDMKRKHEH 27


>gnl|CDD|222193 pfam13520, AA_permease_2, Amino acid permease. 
          Length = 425

 Score = 41.5 bits (98), Expect = 0.003
 Identities = 47/337 (13%), Positives = 98/337 (29%), Gaps = 73/337 (21%)

Query: 869  KPSYSAWNRAFLIPFMVMLILEGIPLFLI--ELGIGQKMRLGSLGVW--NTIHPWLGGVG 924
             P  ++   A ++   +  ++  + + L+  EL      R G + VW  N     +  + 
Sbjct: 22   TPLIASAGAAAILWGWIAALIFSLAVALVYAELS-SAFPRNGGIYVWLKNAFGKPVAFI- 79

Query: 925  ISSCLVTFFVALYYNVIITWCFYYLFNSFSSSLPWSSCPTDHNGTIIPECELSSETAYFW 984
                               W  Y L  + SSS+                   +      +
Sbjct: 80   --------------AAWFNWLAYILGLASSSSV------------------AAQYLLSAF 107

Query: 985  YRTTLDVSPNIESPGSPKWWIVLCLLLAWIIVFFIVMKGIQSSGRVVYFTSMFPYIVLTI 1044
                    P++       + I + +L   II   I ++GI+ S ++     +   ++  I
Sbjct: 108  -------FPDLVGNTWLTYGIAIAIL---IIFALINIRGIKESAKIQNILGIVKLLLPLI 157

Query: 1045 FFIRGITLPGAGAGLVHMYTPKVEKLLDPNVWLDAATQVFYSFGLAFGSLIAFGSYNQ-- 1102
              I        G  L       +        +      VF    +   S   F S     
Sbjct: 158  LII------LLGLVLALGGGFNLLPNSWTTFFPSGWPGVFLGLLIVLWSFGGFESAANVS 211

Query: 1103 ---PNNNCVRDV-----IMVSICNALTAIYASVVVFAILGFKAMSNAAEGTGLAFIAAE- 1153
                     RDV     I + I   +  +  +++   ++    ++  +    +A +  E 
Sbjct: 212  EEVKK----RDVPKALFIGLLIV-GVLYLLVNILFLGVVPDDEIAKLSNLPSVAALLFEA 266

Query: 1154 ---GTGLAFIVFTQAIVELPGAPFWSIIFFMMLLSLG 1187
                 G   +V   A+  L       +    +L +L 
Sbjct: 267  VGGPWGAIIVVILLALSLLGAVNTAIVASSRVLEALA 303


>gnl|CDD|241387 cd13233, PH_ARHGAP9-like, Beta-spectrin pleckstrin homology (PH)
           domain.  ARHGAP family genes encode Rho/Rac/Cdc42-like
           GTPase activating proteins with RhoGAP domain. The
           ARHGAP members here all have a PH domain upstream of
           their C-terminal RhoGAP domain. Some have additional
           N-terminal SH3 and WW domains. The members here include:
           ARHGAP9, ARHGAP12, ARHGAP15, and ARHGAP27. ARHGAP27 and
           ARHGAP12 shared the common-domain structure, consisting
           of SH3, WW, PH, and RhoGAP domains. The PH domain of
           ArhGAP9 employs a non-canonical phosphoinositide binding
           mechanism, a variation of the spectrin-
           Ins(4,5)P2-binding mode, that gives rise to a unique PI
           binding profile, namely a preference for both PI(4,5)P2
           and the PI 3-kinase products PI(3,4,5)P3 and PI(3,4)P2.
           This lipid binding mechanism is also employed by the PH
           domain of Tiam1 and Slm1. PH domains have diverse
           functions, but in general are involved in targeting
           proteins to the appropriate cellular location or in the
           interaction with a binding partner. They share little
           sequence conservation, but all have a common fold, which
           is electrostatically polarized. Less than 10% of PH
           domains bind phosphoinositide phosphates (PIPs) with
           high affinity and specificity. PH domains are
           distinguished from other PIP-binding domains by their
           specific high-affinity binding to PIPs with two vicinal
           phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
           PtdIns(3,4,5)P3 which results in targeting some PH
           domain proteins to the plasma membrane. A few display
           strong specificity in lipid binding. Any specificity is
           usually determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 110

 Score = 38.4 bits (90), Expect = 0.004
 Identities = 28/94 (29%), Positives = 43/94 (45%), Gaps = 10/94 (10%)

Query: 19  RKRGWKGYWVCLKGTTLLFYPCDSREGRSVDAAPKH----LIIVD--GAIMQPIPEHPKR 72
            ++ W   WV L G+ LLFY    ++ +S  A+          VD  GA ++   E   R
Sbjct: 18  LRKNWSTSWVVLTGSHLLFY----KDQKSAAASGGPQSKPESSVDLRGASIEWAKEKSSR 73

Query: 73  DYIFCLSTAFGDAYLFQAPCQVELENWVNSIHSA 106
             +F LST  G  YL Q+    E++ W ++I   
Sbjct: 74  KNVFQLSTVTGTEYLLQSDNDTEIQEWFDAIKRV 107


>gnl|CDD|241452 cd13298, PH1_PH_fungal, Fungal proteins Pleckstrin homology (PH)
           domain, repeat 1.  The functions of these fungal
           proteins are unknown, but they all contain 2 PH domains.
           This cd represents the first PH repeat. PH domains have
           diverse functions, but in general are involved in
           targeting proteins to the appropriate cellular location
           or in the interaction with a binding partner. They share
           little sequence conservation, but all have a common
           fold, which is electrostatically polarized. Less than
           10% of PH domains bind phosphoinositide phosphates
           (PIPs) with high affinity and specificity. PH domains
           are distinguished from other PIP-binding domains by
           their specific high-affinity binding to PIPs with two
           vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2
           or PtdIns(3,4,5)P3 which results in targeting some PH
           domain proteins to the plasma membrane. A few display
           strong specificity in lipid binding. Any specificity is
           usually determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 106

 Score = 38.0 bits (89), Expect = 0.004
 Identities = 24/101 (23%), Positives = 43/101 (42%), Gaps = 14/101 (13%)

Query: 7   WLLRKKHQIELARKRGWKGYWVCLKGTTLLFYPCDSREGRSVDAAPKHLI-IVDGAIMQP 65
           WLL++       + + WK  W  L+   L +Y  D +E        + +I + +   + P
Sbjct: 11  WLLKRSR-----KTKTWKKRWFVLRPCQLSYYK-DEKE-----YKLRRVINLSELTAVAP 59

Query: 66  IPEHPKRDYIFCLSTAFGDAYLFQAPCQVELENWVNSIHSA 106
           + +  KR  +F + T     Y FQA  + +   WV +I   
Sbjct: 60  LKDK-KRKNVFAIYTP-SKNYHFQASSEKDANEWVEAIREE 98


>gnl|CDD|241450 cd13296, PH2_MyoX, Myosin X Pleckstrin homology (PH) domain, repeat
           2.  MyoX, a MyTH-FERM myosin, is a molecular motor that
           has crucial functions in the transport and/or tethering
           of integrins in the actin-based extensions known as
           filopodia, microtubule binding, and in netrin-mediated
           axon guidance. It functions as a dimer. MyoX walks on
           bundles of actin, rather than single filaments, unlike
           the other unconventional myosins. MyoX is present in
           organisms ranging from humans to choanoflagellates, but
           not in Drosophila and Caenorhabditis elegans.MyoX
           consists of a N-terminal motor/head region, a neck made
           of 3 IQ motifs, and a tail consisting of a coiled-coil
           domain, a PEST region, 3 PH domains, a myosin tail
           homology 4 (MyTH4), and a FERM domain at its very
           C-terminus. The first PH domain in the MyoX tail is a
           split-PH domain, interupted by the second PH domain such
           that PH 1a and PH 1b flanks PH 2. The third PH domain
           (PH 3) follows the PH 1b domain. This cd contains the
           second PH repeat. PH domains have diverse functions, but
           in general are involved in targeting proteins to the
           appropriate cellular location or in the interaction with
           a binding partner. They share little sequence
           conservation, but all have a common fold, which is
           electrostatically polarized. Less than 10% of PH domains
           bind phosphoinositide phosphates (PIPs) with high
           affinity and specificity. PH domains are distinguished
           from other PIP-binding domains by their specific
           high-affinity binding to PIPs with two vicinal phosphate
           groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3
           which results in targeting some PH domain proteins to
           the plasma membrane. A few display strong specificity in
           lipid binding. Any specificity is usually determined by
           loop regions or insertions in the N-terminus of the
           domain, which are not conserved across all PH domains.
           PH domains are found in cellular signaling proteins such
           as serine/threonine kinase, tyrosine kinases, regulators
           of G-proteins, endocytotic GTPases, adaptors, as well as
           cytoskeletal associated molecules and in lipid
           associated enzymes.
          Length = 111

 Score = 37.8 bits (88), Expect = 0.006
 Identities = 24/104 (23%), Positives = 40/104 (38%), Gaps = 8/104 (7%)

Query: 7   WLLRKKHQIELARKRGWKGYWVCLKGTTLLFYPCDSREGRSVDAAPKHLIIVDGAIMQPI 66
           WL +K        ++ WK  W  L+ T L +Y  D  EG    A     I +  +  + +
Sbjct: 12  WLYKKGGGSSTLSRKNWKSRWFVLRDTVLKYYENDQ-EG----AKALGTIDI-RSAKEIV 65

Query: 67  PEHPKRDYIFCLSTAFGDAYLFQAPCQVELENWVNSIHSACAAA 110
              PK +  F ++T     Y F A    +   W + +    +A 
Sbjct: 66  DNTPK-ENGFDITTP-SRTYHFVAESPEDASQWFSVLTRVHSAT 107


>gnl|CDD|223769 COG0697, RhaT, Permeases of the drug/metabolite transporter (DMT)
            superfamily [Carbohydrate transport and metabolism /
            Amino acid transport and metabolism / General function
            prediction only].
          Length = 292

 Score = 39.5 bits (92), Expect = 0.011
 Identities = 36/236 (15%), Positives = 81/236 (34%), Gaps = 35/236 (14%)

Query: 1001 PKWWIVLCLLLAWIIVFFIVMKGIQ----SSGRVVYFTSMFPYIVLTIFFIRG-----IT 1051
            P   ++L  LL   + F ++   ++    S   ++         +L +  + G     + 
Sbjct: 68   PWLLLLLLALLGLALPFLLLFLALKYTSASVASLIIGLLPLFTALLAVLLLLGERLSLLQ 127

Query: 1052 LPGAGAGLVHMYTPKVEKLLDPNVWLDAATQVFYS---FGLAFGSLIAFGSYNQPNNNCV 1108
            + G    L  +            + L        S     LA  + + +  Y        
Sbjct: 128  ILGILLALAGVLL----------ILLGGGGGGILSLLGLLLALAAALLWALYTALVKRLS 177

Query: 1109 R-DVIMVSICNALTAIYASVVVFAILGFKAMSNAAEGTGLAFIAAEGTGLAFIVFTQAIV 1167
            R   + +++   L      +++F + GF A   +     L ++    TGLA++++  A+ 
Sbjct: 178  RLGPVTLALLLQLLLALLLLLLFFLSGFGAPILSRAWLLLLYLGVFSTGLAYLLWYYALR 237

Query: 1168 ELPGA---------PFWSIIFFMMLL--SLGLGSQIGILEGVLCTIFDIEIFKRIR 1212
             L  +         P ++ +  ++LL   L     +G    V+  +    +  R R
Sbjct: 238  LLGASLVALLSLLEPVFAALLGVLLLGEPLSPAQLLGAA-LVVLGVLLASLRARRR 292


>gnl|CDD|241529 cd13378, PH_RhoGAP2, Rho GTPase activating protein 2 Pleckstrin
           homology (PH) domain.  RhoGAP2 (also called RhoGap22 or
           ArhGap22) are involved in cell polarity, cell morphology
           and cytoskeletal organization. They activate a GTPase
           belonging to the RAS superfamily of small GTP-binding
           proteins. The encoded protein is insulin-responsive, is
           dependent on the kinase Akt, and requires the
           Akt-dependent 14-3-3 binding protein which binds
           sequentially to two serine residues resulting in
           regulation of cell motility. Members here contain an
           N-terminal PH domain followed by a RhoGAP domain and
           either a BAR or TATA Binding Protein (TBP) Associated
           Factor 4 (TAF4) domain. PH domains have diverse
           functions, but in general are involved in targeting
           proteins to the appropriate cellular location or in the
           interaction with a binding partner. They share little
           sequence conservation, but all have a common fold, which
           is electrostatically polarized. Less than 10% of PH
           domains bind phosphoinositide phosphates (PIPs) with
           high affinity and specificity. PH domains are
           distinguished from other PIP-binding domains by their
           specific high-affinity binding to PIPs with two vicinal
           phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
           PtdIns(3,4,5)P3 which results in targeting some PH
           domain proteins to the plasma membrane. A few display
           strong specificity in lipid binding. Any specificity is
           usually determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 116

 Score = 36.8 bits (85), Expect = 0.014
 Identities = 32/120 (26%), Positives = 50/120 (41%), Gaps = 22/120 (18%)

Query: 4   VKKWLLRKKHQIELARKRGWKGYWVCLKGTTLLFYPCDSREGRSVDAAPKHLIIVDGAI- 62
           +K   L+K+  I     + W+  W  L+G  L +Y  D  E +     P+  I + G+  
Sbjct: 4   LKAGWLKKQRSI----MKNWQQRWFVLRGDQLFYYK-DEEETK-----PQGCISLQGSQV 53

Query: 63  --MQPIPEHPKRDYIFCLSTAFGD---------AYLFQAPCQVELENWVNSIHSACAAAF 111
             + P PE P +     L    GD         A+L  A  Q ++E+WV +I     A F
Sbjct: 54  NELPPNPEEPGKHLFEILPGGAGDREKVPMNHEAFLLMANSQSDMEDWVKAIRRVIWAPF 113


>gnl|CDD|221739 pfam12730, ABC2_membrane_4, ABC-2 family transporter protein.  This
            family is related to the ABC-2 membrane transporter
            family pfam01061.
          Length = 230

 Score = 38.5 bits (90), Expect = 0.015
 Identities = 42/249 (16%), Positives = 79/249 (31%), Gaps = 44/249 (17%)

Query: 793  WLKLFDSFAGTIGLVLIALIEMLAVMFIYGHEKFTNDIYEMTGYKPGWYWQITWRFLGPA 852
             LKL  +    I L+L  L+ +LA+  ++G    T             +         P 
Sbjct: 4    LLKLKRTKIFLILLILPLLLALLAL-LLFGVNGDTGLSPGSLLLSSFSFLSA---LFLP- 58

Query: 853  IMSCILVSSIICR----------FLKKPSYSAWNRAFLIPFMVMLILEGIPLFLIELGIG 902
            ++  IL S ++ R               S      A L+  +++ +L  + L L+ L  G
Sbjct: 59   LLIAILASLLVSREFKNGTIKLLLSLPISRGKIFLAKLLVLLILSLLASLLLLLLSLIAG 118

Query: 903  QKMRLGSLGVWNTIHPWLGGVGIS-----SCLVTFFVALYYN----VIITWCFYYLFNSF 953
              +      +   +   L  + +S       L+  F++L        I       L    
Sbjct: 119  LLLGGSGFSLSLLLGALLLLLLLSLLALLLILLILFLSLLLRNSAVAIGIGLLLILLGLL 178

Query: 954  SSSLPWSSCPTDHNGTIIPECELSSETAYFWYRTTLDVSPNIES---PGSPKWWIVLCLL 1010
              SLP           + P         + +    + +S ++            I L ++
Sbjct: 179  LGSLPLWL--------LFP---------WSYLPLIVLLSLSLNGTLLSSGLSIIIALIVI 221

Query: 1011 LAWIIVFFI 1019
            L   I+F I
Sbjct: 222  LILFIIFLI 230


>gnl|CDD|238080 cd00136, PDZ, PDZ domain, also called DHR (Dlg homologous region)
           or GLGF (after a conserved sequence motif). Many PDZ
           domains bind C-terminal polypeptides, though binding to
           internal (non-C-terminal) polypeptides and even to
           lipids has been demonstrated. Heterodimerization through
           PDZ-PDZ domain interactions adds to the domain's
           versatility, and PDZ domain-mediated interactions may be
           modulated dynamically through target phosphorylation.
           Some PDZ domains play a role in scaffolding
           supramolecular complexes. PDZ domains are found in
           diverse signaling proteins in bacteria, archebacteria,
           and eurkayotes. This CD contains two distinct structural
           subgroups with either a N- or C-terminal beta-strand
           forming the peptide-binding groove base. The circular
           permutation placing the strand on the N-terminus appears
           to be found in Eumetazoa only, while the C-terminal
           variant is found in all three kingdoms of life, and
           seems to co-occur with protease domains. PDZ domains
           have been named after PSD95(post synaptic density
           protein), DlgA (Drosophila disc large tumor suppressor),
           and ZO1, a mammalian tight junction protein.
          Length = 70

 Score = 35.4 bits (82), Expect = 0.017
 Identities = 16/46 (34%), Positives = 24/46 (52%)

Query: 390 CYVSRVEDKSVAMHNGLIKGDEIMVINGAIVSDLDMMYLESVLQEE 435
             V  VE  S A   GL  GD I+ +NG  V +L +  +  +L++E
Sbjct: 15  VVVLSVEPGSPAERAGLQAGDVILAVNGTDVKNLTLEDVAELLKKE 60


>gnl|CDD|216670 pfam01733, Nucleoside_tran, Nucleoside transporter.  This is a family
            of nucleoside transporters. In mammalian cells nucleoside
            transporters transport nucleoside across the plasma
            membrane and are essential for nucleotide synthesis via
            the salvage pathways for cells that lack their own de
            novo synthesis pathways. Also in this family is mouse and
            human nucleolar protein HNP36, a protein of unknown
            function; although it has been hypothesised to be a
            plasma membrane nucleoside transporter.
          Length = 305

 Score = 38.2 bits (89), Expect = 0.027
 Identities = 31/104 (29%), Positives = 46/104 (44%), Gaps = 10/104 (9%)

Query: 998  PGSPKWWIVL--CLLLAWIIVFFIVMKGIQSSGRVVYFTSMFPYIVLTIFFIRGITLPGA 1055
            PG    W+ +   L L +I +F +     QS    V+F S + +I+L   F    +  G 
Sbjct: 204  PGQDPRWLPVLSILRLLFIPLFLLCNYKPQSRALPVFFESDWLFIILMALF--AFS-NGY 260

Query: 1056 GAGLVHMYTPKVEKLLDPNVWLDAA--TQVFYSFGLAFGSLIAF 1097
             + L  MY PK    + P+    A   T  F   GLA GS+ +F
Sbjct: 261  LSSLAMMYAPKQ---VPPHEAETAGMITVFFLILGLALGSVFSF 301


>gnl|CDD|241389 cd13235, PH2_FARP1-like, FERM, RhoGEF and pleckstrin
           domain-containing protein 1 and related proteins
           Pleckstrin Homology (PH) domain, repeat 2.  Members here
           include FARP1 (also called Chondrocyte-derived
           ezrin-like protein; PH domain-containing family C member
           2), FARP2 (also called FIR/FERM domain including RhoGEF;
           FGD1-related Cdc42-GEF/FRG), and FARP6 (also called Zinc
           finger FYVE domain-containing protein 24). They are
           members of the Dbl family guanine nucleotide exchange
           factors (GEFs) which are upstream positive regulators of
           Rho GTPases. Little is known about FARP1 and FARP6,
           though FARP1 has increased expression in differentiated
           chondrocytes. FARP2 is thought to regulate neurite
           remodeling by mediating the signaling pathways from
           membrane proteins to Rac. It is found in brain, lung,
           and testis, as well as embryonic hippocampal and
           cortical neurons. FARP1 and FARP2 are composed of a
           N-terminal FERM domain, a proline-rich (PR) domain,
           Dbl-homology (DH), and two C-terminal PH domains. FARP6
           is composed of Dbl-homology (DH), and two C-terminal PH
           domains separated by a FYVE domain. This hierarchy
           contains the second PH repeat. PH domains have diverse
           functions, but in general are involved in targeting
           proteins to the appropriate cellular location or in the
           interaction with a binding partner. They share little
           sequence conservation, but all have a common fold, which
           is electrostatically polarized. Less than 10% of PH
           domains bind phosphoinositide phosphates (PIPs) with
           high affinity and specificity. PH domains are
           distinguished from other PIP-binding domains by their
           specific high-affinity binding to PIPs with two vicinal
           phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
           PtdIns(3,4,5)P3 which results in targeting some PH
           domain proteins to the plasma membrane. A few display
           strong specificity in lipid binding. Any specificity is
           usually determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 113

 Score = 35.7 bits (83), Expect = 0.032
 Identities = 29/104 (27%), Positives = 44/104 (42%), Gaps = 18/104 (17%)

Query: 7   WLLRK-KHQIELARKRGWKGYWVCLKGTTLLFYPCDSREGRSVDAAP-KHLIIVDGAIMQ 64
           +LLRK K+        GW+  WV      L FY    +  +  D  P   L ++  ++  
Sbjct: 23  YLLRKFKNS------NGWQKLWVVFTNFCLFFY----KSHQ--DEFPLASLPLLGYSVGL 70

Query: 65  PIPE-HPKRDYIFCLSTAFGD-AYLFQAPCQVELENWVNSIHSA 106
           P    +  +DY+F L   F    Y F+A  +   E W+  I SA
Sbjct: 71  PSEADNIDKDYVFKLQ--FKSHVYFFRAESEYTFERWMEVIRSA 112


>gnl|CDD|233677 TIGR01995, PTS-II-ABC-beta, PTS system, beta-glucoside-specific
           IIABC component.  This model represents a family of PTS
           enzyme II proteins in which all three domains are found
           in the same polypeptide chain and which appear to have a
           broad specificity for beta-glucosides including salicin
           (beta-D-glucose-1-salicylate) and arbutin
           (Hydroquinone-O-beta-D-glucopyranoside). These are
           distinct from the closely related sucrose-specific and
           trehalose-specific PTS transporters.
          Length = 610

 Score = 38.5 bits (90), Expect = 0.034
 Identities = 35/117 (29%), Positives = 52/117 (44%), Gaps = 18/117 (15%)

Query: 851 PAIMSCILVSSIICRFLKKPSYSAWNRAFLIPFMVMLILEGIPLFLIELG-----IGQKM 905
           P I+  + + S + +FLKK    A  + FL P +VMLI   +PL L+ +G      G+ +
Sbjct: 213 PVIL-AVWLMSYVEKFLKKVIPGAL-KNFLTPLLVMLIT--VPLTLLIIGPLGNYAGEGI 268

Query: 906 RLGSLGVWNTIHPWLGGV---GISSCLVTF-----FVALYYNVIITWCFYYLFNSFS 954
             G L +   + PWL G     +   LV F      + +  N I T  F  L    S
Sbjct: 269 SSGILFL-YEVSPWLAGALLAALWPVLVMFGLHWAMIPIVINNIATQGFDPLILPAS 324


>gnl|CDD|241291 cd01260, PH_CNK_mammalian-like, Connector enhancer of KSR (Kinase
           suppressor of ras) (CNK) pleckstrin homology (PH)
           domain.  CNK family members function as protein
           scaffolds, regulating the activity and the subcellular
           localization of RAS activated RAF. There is a single CNK
           protein present in Drosophila and Caenorhabditis elegans
           in contrast to mammals which have 3 CNK proteins (CNK1,
           CNK2, and CNK3). All of the CNK members contain a
           sterile a motif (SAM), a conserved region in CNK (CRIC)
           domain, and a PSD-95/DLG-1/ZO-1 (PDZ) domain, and, with
           the exception of CNK3, a PH domain. A CNK2 splice
           variant CNK2A also has a PDZ domain-binding motif at its
           C terminus and Drosophila CNK (D-CNK) also has a domain
           known as the Raf-interacting region (RIR) that mediates
           binding of the Drosophila Raf kinase. This cd contains
           CNKs from mammals, chickens, amphibians, fish, and
           crustacea. PH domains have diverse functions, but in
           general are involved in targeting proteins to the
           appropriate cellular location or in the interaction with
           a binding partner. They share little sequence
           conservation, but all have a common fold, which is
           electrostatically polarized. Less than 10% of PH domains
           bind phosphoinositide phosphates (PIPs) with high
           affinity and specificity. PH domains are distinguished
           from other PIP-binding domains by their specific
           high-affinity binding to PIPs with two vicinal phosphate
           groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3
           which results in targeting some PH domain proteins to
           the plasma membrane. A few display strong specificity in
           lipid binding. Any specificity is usually determined by
           loop regions or insertions in the N-terminus of the
           domain, which are not conserved across all PH domains.
           PH domains are found in cellular signaling proteins such
           as serine/threonine kinase, tyrosine kinases, regulators
           of G-proteins, endocytotic GTPases, adaptors, as well as
           cytoskeletal associated molecules and in lipid
           associated enzymes.
          Length = 114

 Score = 35.1 bits (81), Expect = 0.052
 Identities = 25/107 (23%), Positives = 43/107 (40%), Gaps = 22/107 (20%)

Query: 7   WLLRKKHQIELARKRGWKGYWVCLKGTTLLFYPCDSREGRSVDAAPKHLIIVDGAIMQPI 66
           WL +KK       ++ WK YW  LKG++L +Y        +     K     +G I   +
Sbjct: 18  WLWKKKEAKGFFGQK-WKRYWFVLKGSSLYWY--------NNPQDEK----AEGFI--NL 62

Query: 67  PEH-------PKRDYIFCLSTAFGDAYLFQAPCQVELENWVNSIHSA 106
           P+         K+ Y F  S      + F A    ++  W++ + +A
Sbjct: 63  PDFKIERASECKKKYAFKASHPKIKTFYFAAENLDDMNKWLSKLITA 109


>gnl|CDD|223802 COG0730, COG0730, Predicted permeases [General function prediction
            only].
          Length = 258

 Score = 37.0 bits (86), Expect = 0.052
 Identities = 18/86 (20%), Positives = 32/86 (37%), Gaps = 11/86 (12%)

Query: 1127 VVVFAILGFKAMSNAAEGTGLAFIA-AEGTGLAFIVFTQAIVELPGAPFWSIIFFMMLLS 1185
            +V   +L        A  T LA I      G A  +F    V+      W +   + + S
Sbjct: 163  IVPALLLLLLLPLKLAVATSLAIILNTASNGAALYLFALGAVD------WPLALLLAVGS 216

Query: 1186 LGLGSQIGILEGVLCTIFDIEIFKRI 1211
            + LG+ +G     L      ++ +R+
Sbjct: 217  I-LGAYLGAR---LARRLSPKVLRRL 238


>gnl|CDD|241391 cd13237, PH2_FGD5_FGD6, FYVE, RhoGEF and PH domain
           containing/faciogenital dysplasia proteins 5 and 6
           pleckstrin homology (PH) domain, C-terminus.  FGD5
           regulates promotes angiogenesis of vascular endothelial
           growth factor (VEGF) in vascular endothelial cells,
           including network formation, permeability, directional
           movement, and proliferation. The specific function of
           FGD6 is unknown. In general, FGDs have a RhoGEF (DH)
           domain, followed by a PH domain, a FYVE domain and a
           C-terminal PH domain. All FGDs are guanine nucleotide
           exchange factors that activate the Rho GTPase Cdc42, an
           important regulator of membrane trafficking. The RhoGEF
           domain is responsible for GEF catalytic activity, while
           the PH domain is involved in intracellular targeting of
           the DH domain. PH domains have diverse functions, but in
           general are involved in targeting proteins to the
           appropriate cellular location or in the interaction with
           a binding partner. They share little sequence
           conservation, but all have a common fold, which is
           electrostatically polarized. Less than 10% of PH domains
           bind phosphoinositide phosphates (PIPs) with high
           affinity and specificity. PH domains are distinguished
           from other PIP-binding domains by their specific
           high-affinity binding to PIPs with two vicinal phosphate
           groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3
           which results in targeting some PH domain proteins to
           the plasma membrane. A few display strong specificity in
           lipid binding. Any specificity is usually determined by
           loop regions or insertions in the N-terminus of the
           domain, which are not conserved across all PH domains.
           PH domains are found in cellular signaling proteins such
           as serine/threonine kinase, tyrosine kinases, regulators
           of G-proteins, endocytotic GTPases, adaptors, as well as
           cytoskeletal associated molecules and in lipid
           associated enzymes.
          Length = 90

 Score = 34.3 bits (79), Expect = 0.075
 Identities = 23/97 (23%), Positives = 34/97 (35%), Gaps = 13/97 (13%)

Query: 7   WLLRKKHQIELARKRGWKGYWVCLKGTTLLFYPCDSREGRSVDAAPKHLIIVDGAIMQPI 66
           +L R+K      +K+ WK  W  LK   L  Y        S D      I + G  + P 
Sbjct: 4   YLYRRK-----RKKKSWKRLWFVLKDKVLYTYK------ASEDVVALESIPLLGYTVVPA 52

Query: 67  PE--HPKRDYIFCLSTAFGDAYLFQAPCQVELENWVN 101
            E        +F L       Y+F+A      + W+ 
Sbjct: 53  KEGFEGDESLVFQLLHKGQLPYIFRADDAETAQRWIE 89


>gnl|CDD|226953 COG4587, COG4587, ABC-type uncharacterized transport system,
           permease component [General function prediction only].
          Length = 268

 Score = 35.5 bits (82), Expect = 0.16
 Identities = 30/187 (16%), Positives = 50/187 (26%), Gaps = 36/187 (19%)

Query: 776 LISFLVGLIFTTGAGEYWLKLFDSFAGTIGLVLIALIEMLAVMFIYGHEKFTNDIYEMTG 835
           ++  L GL+     G  W+ + DS     GL    +       F+               
Sbjct: 27  ILWRLSGLLPLILMGYLWVAVADSSPQINGLTPGDVARYFFATFVVR------------- 73

Query: 836 YKPGWYWQITWRFLGPAIMSCILVSSIICRFLKKPSYSAWNRAF-LIPFMVMLILEGIPL 894
                  Q+T  +        +    +  R L+   Y     A  L       +   + L
Sbjct: 74  -------QLTTVWSIWEFEKEVREGELSPRLLRPLDYLFHELAAHLGERASRGLPFLLVL 126

Query: 895 FLIELGIGQKMRLGSLGVWNTIHPWLGGVGISSCLVTFFVALYYNVIITWCFYYLFNSFS 954
            LI   +     L  L  W               L    +AL + +     F   F  F+
Sbjct: 127 LLIFALLYGA-ILQFLSPWTLY------------LFVLALALLFLLRFLIQF--TFGLFA 171

Query: 955 SSLPWSS 961
                +S
Sbjct: 172 FWTERAS 178


>gnl|CDD|241530 cd13379, PH_RhoGap24, Rho GTPase activating protein 24 Pleckstrin
           homology (PH) domain.  RhoGap24 (also called ARHGAP24,
           p73RhoGAp, and Filamin-A-associated RhoGAP) like other
           RhoGAPs are involved in cell polarity, cell morphology
           and cytoskeletal organization. They act as GTPase
           activators for the Rac-type GTPases by converting them
           to an inactive GDP-bound state and control actin
           remodeling by inactivating Rac downstream of Rho leading
           to suppress leading edge protrusion and promotes cell
           retraction to achieve cellular polarity and are able to
           suppress RAC1 and CDC42 activity in vitro.
           Overexpression of these proteins induces cell rounding
           with partial or complete disruption of actin stress
           fibers and formation of membrane ruffles, lamellipodia,
           and filopodia. Members here contain an N-terminal PH
           domain followed by a RhoGAP domain and either a BAR or
           TATA Binding Protein (TBP) Associated Factor 4 (TAF4)
           domain. PH domains have diverse functions, but in
           general are involved in targeting proteins to the
           appropriate cellular location or in the interaction with
           a binding partner. They share little sequence
           conservation, but all have a common fold, which is
           electrostatically polarized. Less than 10% of PH domains
           bind phosphoinositide phosphates (PIPs) with high
           affinity and specificity. PH domains are distinguished
           from other PIP-binding domains by their specific
           high-affinity binding to PIPs with two vicinal phosphate
           groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3
           which results in targeting some PH domain proteins to
           the plasma membrane. A few display strong specificity in
           lipid binding. Any specificity is usually determined by
           loop regions or insertions in the N-terminus of the
           domain, which are not conserved across all PH domains.
           PH domains are found in cellular signaling proteins such
           as serine/threonine kinase, tyrosine kinases, regulators
           of G-proteins, endocytotic GTPases, adaptors, as well as
           cytoskeletal associated molecules and in lipid
           associated enzymes.
          Length = 114

 Score = 33.8 bits (77), Expect = 0.18
 Identities = 31/115 (26%), Positives = 45/115 (39%), Gaps = 33/115 (28%)

Query: 19  RKRG-----WKGYWVCLKGTTLLFYPCDSREGRSVDAAPKHLIIVDGAIMQP---IPEHP 70
           RK+G     W   W  LKG  L ++  D  E + +           G I  P   + EHP
Sbjct: 10  RKQGGFVKTWHTRWFVLKGDQLYYFK-DEDETKPL-----------GTIFLPGNRVTEHP 57

Query: 71  KRD-----YIFCL--------STAFGDAYLFQAPCQVELENWVNSIHSACAAAFA 112
             +     ++F +         TA  + YL  A  Q ++E+WV SI     A F 
Sbjct: 58  CNEEEPGKFLFEVVPGGDRERMTANHETYLLMASTQNDMEDWVKSIRRVIWAPFG 112


>gnl|CDD|241369 cd13215, PH-GRAM1_AGT26, Autophagy-related protein 26/Sterol
           3-beta-glucosyltransferase Pleckstrin homology (PH)
           domain, repeat 1.  ATG26 (also called
           UGT51/UDP-glycosyltransferase 51), a member of the
           glycosyltransferase 28 family, resulting in the
           biosynthesis of sterol glucoside. ATG26 in decane
           metabolism and autophagy. There are 32 known
           autophagy-related (ATG) proteins, 17 are components of
           the core autophagic machinery essential for all
           autophagy-related pathways and 15 are the additional
           components required only for certain pathways or
           species. The core autophagic machinery includes 1) the
           ATG9 cycling system (ATG1, ATG2, ATG9, ATG13, ATG18, and
           ATG27), 2) the phosphatidylinositol 3-kinase complex
           (ATG6/VPS30, ATG14, VPS15, and ATG34), and 3) the
           ubiquitin-like protein system (ATG3, ATG4, ATG5, ATG7,
           ATG8, ATG10, ATG12, and ATG16). Less is known about how
           the core machinery is adapted or modulated with
           additional components to accommodate the nonselective
           sequestration of bulk cytosol (autophagosome formation)
           or selective sequestration of specific cargos (Cvt
           vesicle, pexophagosome, or bacteria-containing
           autophagosome formation). The pexophagosome-specific
           additions include the ATG30-ATG11-ATG17
           receptor-adaptors complex, the coiled-coil protein
           ATG25, and the sterol glucosyltransferase ATG26. ATG26
           is necessary for the degradation of medium peroxisomes.
           It contains 2 GRAM domains and a single PH domain. PH
           domains are only found in eukaryotes. They share little
           sequence conservation, but all have a common fold, which
           is electrostatically polarized. PH domains also have
           diverse functions. They are often involved in targeting
           proteins to the plasma membrane, but few display strong
           specificity in lipid binding. Any specificity is usually
           determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 130

 Score = 33.7 bits (78), Expect = 0.20
 Identities = 23/106 (21%), Positives = 37/106 (34%), Gaps = 32/106 (30%)

Query: 9   LRKKHQIELARKRGWKGYWVCLKGTTL--------LFYP---CDSREGRSVDAAPKHLII 57
           L K+ +    R   +  YW  LKG  L        L++P    D R G S +   K    
Sbjct: 35  LSKRGK----RTPRYNRYWFVLKGDVLSWYRSSTDLYFPHGTIDLRYGISAEITDKD--- 87

Query: 58  VDGAIMQPIPEHPKRDYIFCLSTAFGDAYLFQAPCQVELENWVNSI 103
                     +       F ++T     Y F+A  +   + WV ++
Sbjct: 88  ----------KETTH---FKITTN-SRTYTFKADSEPSAKEWVKAL 119


>gnl|CDD|217759 pfam03845, Spore_permease, Spore germination protein. 
          Length = 320

 Score = 34.9 bits (81), Expect = 0.33
 Identities = 25/154 (16%), Positives = 50/154 (32%), Gaps = 31/154 (20%)

Query: 998  PGSPKWWIVLCLLLAWIIVFFIVMKGIQSSGRVVYFTSMF---PYIVLTIFFIRGIT--- 1051
            P +P W  ++ L L  + ++ ++ KGI+   R   F         + + IF ++      
Sbjct: 109  PDTPTW--IIILTLLLVSIY-LISKGIEVLAREFCFIIPILLIILLFILIFSLKQAHIKN 165

Query: 1052 -LPGAGAGLVHMYTPKVEKLLDPNVWLDAATQVFYSFGLAFGSLIAFGSYNQPNNN---C 1107
              P  G G+  +              L  +      FG     L+ +        +    
Sbjct: 166  LRPVLGNGISPV--------------LKGSKTTSLPFGGFEILLMLYPFMKDKKKSKKAA 211

Query: 1108 VRDVIMVSICNALTAIYASVVVFAILGFKAMSNA 1141
            +  V +  +   LT    + +V  +LG       
Sbjct: 212  LLGVGISGLLYTLT----TFIVIGVLGVDQTKTL 241


>gnl|CDD|223884 COG0814, SdaC, Amino acid permeases [Amino acid transport and
            metabolism].
          Length = 415

 Score = 34.9 bits (81), Expect = 0.35
 Identities = 33/222 (14%), Positives = 70/222 (31%), Gaps = 17/222 (7%)

Query: 1000 SPKWWIVLCLLLAWIIVFFIVMKGIQSSGRVVYFTSMFPYIVLTIFFIRGITLPGAGAGL 1059
             P+    L   L   ++ F+   G  +  ++   TS+  +  +    +  + L       
Sbjct: 126  LPRKLGSLIFAL---VLAFLSWLGTLAVLKI---TSLLVFGKVIYLVLLVVYLIPHWNPA 179

Query: 1060 VHMYTPKVEKLLDPNVWLDAATQVFYSFGLAFGSLIAFGSYNQPNNNCVRDVIMVSICNA 1119
                 P   +     + L     VF SFG            N    N  + V    +  +
Sbjct: 180  NLFALPSASQSFWKYLLLAIPVFVF-SFG---FHGNIPSLVNYMRKNSKKAVRKAILIGS 235

Query: 1120 LTAIYASVVVFAILGFKAMSNAAEGTGLAFIAAEGTGLAFIVFTQAIVELPGAPFWSIIF 1179
            L A+    V++ ++GF              +AA+   ++ +     ++  P       IF
Sbjct: 236  LIAL----VLYILVGFFVFGCFGSLVFGNILAAKEQNISLLSALAGVINSPILSIALNIF 291

Query: 1180 FMMLLSLGLGSQIGILEGVLCTIFDIEIFKRIRKPILTEALK 1221
             +  +     S +G+  G+   + D+      +       L 
Sbjct: 292  ALFAI---ATSFLGVYLGLFEGLADLFKKSNSKPGRKKTGLL 330


>gnl|CDD|241424 cd13270, PH1_TAPP1_2, Tandem PH-domain-containing proteins 1 and 2
           Pleckstrin homology (PH) domain, N-terminal repeat.  The
           binding of TAPP1 (also called PLEKHA1/pleckstrin
           homology domain containing, family A (phosphoinositide
           binding specific) member 1) and TAPP2 (also called
           PLEKHA2) adaptors to PtdIns(3,4)P(2), but not PI(3,4,
           5)P3, function as negative regulators of insulin and
           PI3K signalling pathways (i.e. TAPP/utrophin/syntrophin
           complex). TAPP1 and TAPP2 contain two sequential PH
           domains in which the C-terminal PH domain binds
           PtdIns(3,4)P2. They also contain a C-terminal
           PDZ-binding motif that interacts with several
           PDZ-binding proteins, including PTPN13 (known previously
           as PTPL1 or FAP-1) as well as the scaffolding proteins
           MUPP1 (multiple PDZ-domain-containing protein 1),
           syntrophin and utrophin. PH domains have diverse
           functions, but in general are involved in targeting
           proteins to the appropriate cellular location or in the
           interaction with a binding partner. They share little
           sequence conservation, but all have a common fold, which
           is electrostatically polarized. Less than 10% of PH
           domains bind phosphoinositide phosphates (PIPs) with
           high affinity and specificity. PH domains are
           distinguished from other PIP-binding domains by their
           specific high-affinity binding to PIPs with two vicinal
           phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
           PtdIns(3,4,5)P3 which results in targeting some PH
           domain proteins to the plasma membrane. A few display
           strong specificity in lipid binding. Any specificity is
           usually determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 118

 Score = 32.8 bits (75), Expect = 0.37
 Identities = 12/38 (31%), Positives = 20/38 (52%), Gaps = 1/38 (2%)

Query: 70  PKRDYIFCLSTAFGDAYLFQAPCQVELENWVNSIHSAC 107
           PK ++ F ++      Y  QA  Q +LE WV +++ A 
Sbjct: 75  PKAEFCFVINALS-RRYFLQANDQQDLEEWVEALNDAS 111


>gnl|CDD|216682 pfam01757, Acyl_transf_3, Acyltransferase family.  This family
            includes a range of acyltransferase enzymes. This domain
            is found in many as yet uncharacterized C. elegans
            proteins and it is approximately 300 amino acids long.
          Length = 326

 Score = 34.4 bits (79), Expect = 0.49
 Identities = 24/226 (10%), Positives = 52/226 (23%), Gaps = 55/226 (24%)

Query: 838  PGWYWQITWRFLGPAIMSCILVSSIICRFLKKPSYSAWNRA----------------FLI 881
              +  +   R L P +   +L   +         Y                      FL 
Sbjct: 70   FKFLKKRLLRLLIPYLFWSLLYFLLGLLVGGLSVYGLLLLLLLLGLLFGGGPNGHLWFLP 129

Query: 882  PFMVMLILEGIPLFLIELGIGQKMRLGSLGVWNTIHPWLGGVGISSCLVTFFVALYYNVI 941
               V  +L  +PL L                       L  + + + L+   +     ++
Sbjct: 130  ALFVFYLL--LPLLLR---------------LLRKLKKLLLLLLLALLLLLSLLYILILL 172

Query: 942  ITWCFYYLFNSFSSSLPWSSCPTDHNGTIIPECELSSETAYFWYRTTLDVSPNIESPGSP 1001
            +      L         +                          R    +          
Sbjct: 173  VGLPPTVLNLLIGLLPFFLLG-------------------ALLARYRKRIRSKRL---LL 210

Query: 1002 KWWIVLCLLLAWIIVFFIVMKGIQSSGRVVYFTSMFPYIVLTIFFI 1047
               I+L L L  +I+  + + G+      +Y       ++L +  +
Sbjct: 211  LIVILLALALLALILLLLFLFGLVYLAPELYGYFSLLLLLLGVLLL 256


>gnl|CDD|148404 pfam06781, UPF0233, Uncharacterized protein family (UPF0233). 
          Length = 87

 Score = 31.5 bits (72), Expect = 0.53
 Identities = 9/39 (23%), Positives = 18/39 (46%), Gaps = 5/39 (12%)

Query: 986  RTTLDVSPNIESPGSPKWWIVL---CLL--LAWIIVFFI 1019
             T        +   SP W++ L    +L  L W++V+++
Sbjct: 15   PTASRTPVKKKKKPSPVWFVPLMVGLMLLGLLWLVVYYL 53


>gnl|CDD|117544 pfam08978, Reoviridae_Vp9, Reoviridae VP9.  This domain is found in
           various VP9 viral outer-coat proteins. It has no known
           function.
          Length = 280

 Score = 33.9 bits (77), Expect = 0.56
 Identities = 24/100 (24%), Positives = 42/100 (42%), Gaps = 9/100 (9%)

Query: 889 LEGIPLFLIELGIGQKMRLGSLGVWNTIHPWLGGVGISSCLVTFFVALYYNVIITWCFYY 948
           L GI   +  +  G++M LG +      +PW G    +   +T    L+ N I       
Sbjct: 107 LAGINRKITNVMSGERMILGHIAAGTLANPWSGKC--AGLFITNEHNLHDNQIEPGKICV 164

Query: 949 LFN-------SFSSSLPWSSCPTDHNGTIIPECELSSETA 981
           LF        + S+S  +S+C   H      + +L++ET+
Sbjct: 165 LFITSLSTTANKSNSFAYSACSIPHEDWDFKKIKLTAETS 204


>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein.  This family consists of AF4
            (Proto-oncogene AF4) and FMR2 (Fragile X E mental
            retardation syndrome) nuclear proteins. These proteins
            have been linked to human diseases such as acute
            lymphoblastic leukaemia and mental retardation. The
            family also contains a Drosophila AF4 protein homologue
            Lilliputian which contains an AT-hook domain. Lilliputian
            represents a novel pair-rule gene that acts in
            cytoskeleton regulation, segmentation and morphogenesis
            in Drosophila.
          Length = 1154

 Score = 34.5 bits (79), Expect = 0.62
 Identities = 27/144 (18%), Positives = 45/144 (31%), Gaps = 21/144 (14%)

Query: 1422 RHSSTGKSAQEQSMNKSSLNPGCSQTLEKSKKPPRPPQRHSSGSIEYEQQAIIGQPQQGT 1481
               S+  S    S NK S       +  K ++    P    S S             +  
Sbjct: 775  CSPSSSSSHHHSSSNKESR----KSSRNKEEEMLPSPSSPLSSS-----------SPKPE 819

Query: 1482 QPSYVQEPQSTHSTFRGRNKPLNESQVSQDEGDKEVKHDCDPGTKSEGEEDSITGAVKKT 1541
             PS  +  +   ++    + P + S                   K+EG+  S +   K +
Sbjct: 820  HPSRKRPRRQEDTS--SSSGPFSASSTKSSSKSSSTSKH----RKTEGKGSSTSKEHKGS 873

Query: 1542 LGRTPNHLTLSTTSTLSAGSTGSQ 1565
             G TPN  +      LS GS+  +
Sbjct: 874  SGDTPNKASSFPVPPLSNGSSKPR 897


>gnl|CDD|241417 cd13263, PH_RhoGap25-like, Rho GTPase activating protein 25 and
           related proteins Pleckstrin homology (PH) domain.
           RhoGAP25 (also called ArhGap25) like other RhoGaps are
           involved in cell polarity, cell morphology and
           cytoskeletal organization. They act as GTPase activators
           for the Rac-type GTPases by converting them to an
           inactive GDP-bound state and control actin remodeling by
           inactivating Rac downstream of Rho leading to suppress
           leading edge protrusion and promotes cell retraction to
           achieve cellular polarity and are able to suppress RAC1
           and CDC42 activity in vitro. Overexpression of these
           proteins induces cell rounding with partial or complete
           disruption of actin stress fibers and formation of
           membrane ruffles, lamellipodia, and filopodia. This
           hierarchy contains RhoGAP22, RhoGAP24, and RhoGAP25.
           Members here contain an N-terminal PH domain followed by
           a RhoGAP domain and either a BAR or TATA Binding Protein
           (TBP) Associated Factor 4 (TAF4) domain. PH domains have
           diverse functions, but in general are involved in
           targeting proteins to the appropriate cellular location
           or in the interaction with a binding partner. They share
           little sequence conservation, but all have a common
           fold, which is electrostatically polarized. Less than
           10% of PH domains bind phosphoinositide phosphates
           (PIPs) with high affinity and specificity. PH domains
           are distinguished from other PIP-binding domains by
           their specific high-affinity binding to PIPs with two
           vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2
           or PtdIns(3,4,5)P3 which results in targeting some PH
           domain proteins to the plasma membrane. A few display
           strong specificity in lipid binding. Any specificity is
           usually determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 114

 Score = 31.9 bits (73), Expect = 0.67
 Identities = 25/93 (26%), Positives = 42/93 (45%), Gaps = 16/93 (17%)

Query: 21  RGWKGYWVCLKGTTLLFYPCDSREGRSVDAAPKHLIIVDGAIMQPIPEHPKRD--YIFCL 78
           + W+  W  L+G  L +Y  D  E +     P+  I + G  ++ +P +P+    ++F +
Sbjct: 17  KNWQRRWFVLRGDQLYYYK-DEDESK-----PQGCIPLPGNTVKELPFNPEEPGKFLFEI 70

Query: 79  STAFG--------DAYLFQAPCQVELENWVNSI 103
               G        D+YL  A  Q E+E WV  I
Sbjct: 71  IPGDGGTRRSANHDSYLLMANSQAEMEEWVKVI 103


>gnl|CDD|238489 cd00989, PDZ_metalloprotease, PDZ domain of bacterial and plant
           zinc metalloprotases, presumably membrane-associated or
           integral membrane proteases, which may be involved in
           signalling and regulatory mechanisms. May be responsible
           for substrate recognition and/or binding, as most PDZ
           domains bind C-terminal polypeptides, and binding to
           internal (non-C-terminal) polypeptides and even to
           lipids has been demonstrated. In this subfamily of
           protease-associated PDZ domains a C-terminal beta-strand
           forms the peptide-binding groove base, a circular
           permutation with respect to PDZ domains found in
           Eumetazoan signaling proteins.
          Length = 79

 Score = 30.7 bits (70), Expect = 0.91
 Identities = 11/32 (34%), Positives = 14/32 (43%)

Query: 391 YVSRVEDKSVAMHNGLIKGDEIMVINGAIVSD 422
            +  V   S A   GL  GD I+ ING  +  
Sbjct: 15  VIGEVVPGSPAAKAGLKAGDRILAINGQKIKS 46


>gnl|CDD|176352 cd00196, UBQ, Ubiquitin-like proteins.  Ubiquitin homologs;
           Includes ubiquitin and ubiquitin-like proteins.
           Ubiquitin-mediated proteolysis is part of the regulated
           turnover of proteins required for controlling cell cycle
           progression. Other family members are protein modifiers
           that perform a wide range of functions. Ubiquitination
           usually results in a covalent bond between the
           C-terminus of ubiquitin and the epsilon-amino group of a
           substrate lysine. The three-step mechanism requires an
           activating enzyme (E1) that forms a thiol ester with the
           C-terminal carboxy group, a conjugating enzyme (E2) that
           transiently carries the activated ubiquitin molecule as
           a thiol ester, and a ligase (E3) that transfers the
           activated ubiquitin from the E2 to the substrate lysine
           residue. In poly-ubiquitination, ubiquitin itself is the
           substrate.
          Length = 69

 Score = 30.2 bits (68), Expect = 1.0
 Identities = 11/75 (14%), Positives = 25/75 (33%), Gaps = 8/75 (10%)

Query: 278 KVSLPENQYAHVYLKDSMTVEEFLANACSRKNLNPMEHFVRVKKRREMEDHNYFVPPRGD 337
           KV L + +   + +    TV +       +  L P +  + V  +         +P    
Sbjct: 1   KVKLNDGKTVELLVPSGTTVADLKEKLAKKLGLPPEQQRLLVNGK--------ILPDSLT 52

Query: 338 LIETYLSTHEVVEVC 352
           L +  L   + + + 
Sbjct: 53  LEDYGLQDGDELVLV 67


>gnl|CDD|217315 pfam03006, HlyIII, Haemolysin-III related.  Members of this family
            are integral membrane proteins. This family includes a
            protein with hemolytic activity from Bacillus cereus. It
            has been proposed that YOL002c encodes a Saccharomyces
            cerevisiae protein that plays a key role in metabolic
            pathways that regulate lipid and phosphate metabolism. In
            eukaryotes, members are seven-transmembrane pass
            molecules found to encode functional receptors with a
            broad range of apparent ligand specificities, including
            progestin and adipoQ receptors, and hence have been named
            PAQR proteins. The mammalian members include progesterone
            binding proteins. Unlike the case with GPCR receptor
            proteins, the evolutionary ancestry of the members of
            this family can be traced back to the Archaea.
          Length = 207

 Score = 32.6 bits (75), Expect = 1.1
 Identities = 15/69 (21%), Positives = 24/69 (34%), Gaps = 3/69 (4%)

Query: 998  PGSPKW-WIVLCLLLAWIIVFFIV--MKGIQSSGRVVYFTSMFPYIVLTIFFIRGITLPG 1054
                +W   VL LL+ W+ +  I   +  +   G V+       Y +  IF+      P 
Sbjct: 126  LKRFRWLRTVLYLLMGWLGIIPIKHLILALGGGGLVLLVLGGVLYTLGAIFYALRFPGPF 185

Query: 1055 AGAGLVHMY 1063
               G  H  
Sbjct: 186  DIWGHSHQI 194


>gnl|CDD|241470 cd13316, PH_Boi, Boi family Pleckstrin homology domain.  Yeast Boi
           proteins Boi1 and Boi2 are functionally redundant and
           important for cell growth with Boi mutants displaying
           defects in bud formation and in the maintenance of cell
           polarity.They appear to be linked to Rho-type GTPase,
           Cdc42 and Rho3. Boi1 and Boi2 display two-hybrid
           interactions with the GTP-bound ("active") form of
           Cdc42, while Rho3 can suppress of the lethality caused
           by deletion of Boi1 and Boi2. These findings suggest
           that Boi1 and Boi2 are targets of Cdc42 that promote
           cell growth in a manner that is regulated by Rho3. Boi
           proteins contain a N-terminal SH3 domain, followed by a
           SAM (sterile alpha motif) domain, a proline-rich region,
           which mediates binding to the second SH3 domain of Bem1,
           and C-terminal PH domain. The PH domain is essential for
           its function in cell growth and is important for
           localization to the bud, while the SH3 domain is needed
           for localization to the neck. PH domains have diverse
           functions, but in general are involved in targeting
           proteins to the appropriate cellular location or in the
           interaction with a binding partner. They share little
           sequence conservation, but all have a common fold, which
           is electrostatically polarized. Less than 10% of PH
           domains bind phosphoinositide phosphates (PIPs) with
           high affinity and specificity. PH domains are
           distinguished from other PIP-binding domains by their
           specific high-affinity binding to PIPs with two vicinal
           phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
           PtdIns(3,4,5)P3 which results in targeting some PH
           domain proteins to the plasma membrane. A few display
           strong specificity in lipid binding. Any specificity is
           usually determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 95

 Score = 30.8 bits (70), Expect = 1.2
 Identities = 23/92 (25%), Positives = 37/92 (40%), Gaps = 13/92 (14%)

Query: 19  RKRG-----WKGYWVCLKGTTLLFYPCDSREGRSVDAAPKHLIIVDGAIMQPIPEHPKR- 72
           +KRG     WK  +  LKGT L +   ++      D+  K LI + G  +     + K  
Sbjct: 6   KKRGERYGTWKTRYFVLKGTRLYYLKSEN------DSKEKGLIDLTGHRVTVDDSNSKPG 59

Query: 73  DYIFCL-STAFGDAYLFQAPCQVELENWVNSI 103
            Y F L   A    + F    +  L  W+ ++
Sbjct: 60  RYGFKLVPPAVEKVHYFAVDEKEVLREWMKAL 91


>gnl|CDD|176490 cd08760, Cyt_b561_FRRS1_like, Eukaryotic cytochrome b(561),
           including the FRRS1 gene product.  Cytochrome b(561), as
           found in eukaryotes, similar to and including the human
           FRRS1 gene product (ferric-chelate reductase 1), also
           called SDR-2 (stromal cell-derived receptor 2). This
           family comprises a variety of domain architectures, many
           of which contain dopamine beta-monooxygenase (DOMON)
           domains. The protein might act as a ferric-chelate
           reductase, catalyzing the reduction of Fe(3+) to Fe(2+),
           such as associated with the transport of iron from the
           endosome to the cytoplasm. It is assumed that this
           protein uses ascorbate as the electron donor. Belongs to
           the cytochrome b(561) family, which are secretory
           vesicle-specific electron transport proteins.
           Cytochromes b(561) are integral membrane proteins that
           bind two heme groups non-covalently, and may have six
           alpha-helical trans-membrane segments.
          Length = 191

 Score = 32.3 bits (74), Expect = 1.2
 Identities = 16/78 (20%), Positives = 28/78 (35%), Gaps = 13/78 (16%)

Query: 844 ITWRFLGPAIMSCILVSSIICRFLKKPSYSAWNRAFLIPFMVMLILEGIPLFLIELGIGQ 903
           I W  L P       + +++ R+        W   F +   + L+   + L +    +G 
Sbjct: 45  IAWGILMP-------IGALLARYFLLGD-PVW---FYLHAGLQLL--AVLLAIAGFVLGI 91

Query: 904 KMRLGSLGVWNTIHPWLG 921
            +  G  G  N  H  LG
Sbjct: 92  VLVQGGGGSLNNAHAILG 109


>gnl|CDD|119392 cd06174, MFS, The Major Facilitator Superfamily (MFS) is a large and
            diverse group of secondary transporters that includes
            uniporters, symporters, and antiporters. MFS proteins
            facilitate the transport across cytoplasmic or internal
            membranes of a variety of substrates including ions,
            sugar phosphates, drugs, neurotransmitters, nucleosides,
            amino acids, and peptides. They do so using the
            electrochemical potential of the transported substrates.
            Uniporters transport a single substrate, while symporters
            and antiporters transport two substrates in the same or
            in opposite directions, respectively, across membranes.
            MFS proteins are typically 400 to 600 amino acids in
            length, and the majority contain 12 transmembrane alpha
            helices (TMs) connected by hydrophilic loops. The N- and
            C-terminal halves of these proteins display weak
            similarity and may be the result of a gene
            duplication/fusion event. Based on kinetic studies and
            the structures of a few bacterial superfamily members,
            GlpT (glycerol-3-phosphate transporter), LacY (lactose
            permease), and EmrD (multidrug transporter), MFS proteins
            are thought to function through a single substrate
            binding site, alternating-access mechanism involving a
            rocker-switch type of movement. Bacterial members
            function primarily for nutrient uptake, and as
            drug-efflux pumps to confer antibiotic resistance. Some
            MFS proteins have medical significance in humans such as
            the glucose transporter Glut4, which is impaired in type
            II diabetes, and glucose-6-phosphate transporter (G6PT),
            which causes glycogen storage disease when mutated.
          Length = 352

 Score = 33.1 bits (76), Expect = 1.4
 Identities = 35/219 (15%), Positives = 63/219 (28%), Gaps = 36/219 (16%)

Query: 1000 SPKWWIVLCLLLAWIIVFFIVMKGIQSSGRVVYFTSMFPYIVLTIFFIRGITLPGAGAGL 1059
              +  ++L LLL + +   +                +     L      G   P A A +
Sbjct: 62   GRRRVLLLGLLL-FALGSLL-------LAFASSLWLLLVGRFLLGLGG-GALYPAAAALI 112

Query: 1060 VHMYTPK-VEKLLDPNVWLDAATQVFYSFGLAFGSLIAFGSYNQPNNNCVRDVIMVSICN 1118
               + PK   + L             +  G   G L+            +   ++++I  
Sbjct: 113  AEWFPPKERGRAL-------GLFSAGFGLGALLGPLLGGLLAESLGWRWL--FLILAILG 163

Query: 1119 ALTAIYASVVVFAILGFKAMSNAAEGTGLAFIAAEGTGLAFIVFTQAIV-ELPGAPFWSI 1177
             L A+    ++  +L             LAF          + +    + E+ G      
Sbjct: 164  LLLALLLLFLLRLLLLL----------ALAFFLLSFGYYGLLTYLPLYLQEVLGLSAAEA 213

Query: 1178 IFFMMLLSLGLGSQIGILEGVLCTIFDIEIFKRIRKPIL 1216
               + L  LG     GIL G L      +   R R  +L
Sbjct: 214  GLLLSLFGLG-----GIL-GALLGGLLSDRLGRRRLLLL 246


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
           protein is found in bacteria and eukaryotes. Proteins in
           this family are typically between 943 to 1234 amino
           acids in length. This family contains a P-loop motif
           suggesting it is a nucleotide binding protein. It may be
           involved in replication.
          Length = 1198

 Score = 33.1 bits (76), Expect = 1.4
 Identities = 21/91 (23%), Positives = 38/91 (41%), Gaps = 6/91 (6%)

Query: 93  QVELENWVNSIHSACAAAFARHRGKTGTLHLLQEEIFRLEKAIDSDNKLKHMADLQMSVL 152
           Q +L+  + SI S      AR    TG    +Q +  RL++ I    +   +      + 
Sbjct: 348 QADLDQ-LPSIRSELEEVEARLDALTGKHQDVQRKYERLKQKIKEQLERD-LEKNNERLA 405

Query: 153 AEPETKQQINEQITQWEENLERLHCEQFRLR 183
           A    +++ + Q    EE+L+ L   Q R +
Sbjct: 406 A---IREEKDRQKAAIEEDLQAL-ESQLRQQ 432


>gnl|CDD|191296 pfam05514, HR_lesion, HR-like lesion-inducing.  Family of plant
            proteins that are associated with the hypersensitive
            response (HR) pathway of defence against plant pathogens.
          Length = 138

 Score = 31.2 bits (71), Expect = 1.6
 Identities = 21/93 (22%), Positives = 33/93 (35%), Gaps = 9/93 (9%)

Query: 1075 VWLDAATQVFYSFGLAFGSLIAFGSYNQPNNNCVRDVIMVSICNALTAIYASVVVFAILG 1134
            V+L +A Q F  FG   G         +P  N     +   +   +  I    +V A++ 
Sbjct: 15   VFLLSAWQEFSEFGADGGPA---AKELRPKFNMFSKHVSKHLGMQVPHIDVKHIVAALIA 71

Query: 1135 FKAMSNAAEGTGLAFIAAEGTGLAFIVFTQAIV 1167
             K +       GL FI     G   ++  QA  
Sbjct: 72   LKGLG------GLLFIFGSSFGAYLLLVYQAFA 98


>gnl|CDD|241435 cd13281, PH_PLEKHD1, Pleckstrin homology (PH) domain containing,
           family D (with coiled-coil domains) member 1 PH domain. 
           Human PLEKHD1 (also called UPF0639, pleckstrin homology
           domain containing, family D (with M protein repeats)
           member 1) is a single transcript and contains a single
           PH domain. PLEKHD1 is conserved in human, chimpanzee, ,
           dog, cow, mouse, chicken, zebrafish, and Caenorhabditis
           elegans. PH domains have diverse functions, but in
           general are involved in targeting proteins to the
           appropriate cellular location or in the interaction with
           a binding partner. They share little sequence
           conservation, but all have a common fold, which is
           electrostatically polarized. Less than 10% of PH domains
           bind phosphoinositide phosphates (PIPs) with high
           affinity and specificity. PH domains are distinguished
           from other PIP-binding domains by their specific
           high-affinity binding to PIPs with two vicinal phosphate
           groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3
           which results in targeting some PH domain proteins to
           the plasma membrane. A few display strong specificity in
           lipid binding. Any specificity is usually determined by
           loop regions or insertions in the N-terminus of the
           domain, which are not conserved across all PH domains.
           PH domains are found in cellular signaling proteins such
           as serine/threonine kinase, tyrosine kinases, regulators
           of G-proteins, endocytotic GTPases, adaptors, as well as
           cytoskeletal associated molecules and in lipid
           associated enzymes.
          Length = 139

 Score = 31.2 bits (71), Expect = 1.7
 Identities = 19/95 (20%), Positives = 38/95 (40%), Gaps = 10/95 (10%)

Query: 19  RKRGWKGYWVCLKGTTLLFYPC----DSREGRSVDAAPKHLIIVDGAIMQPIPEHPKRDY 74
           +   W   +  LK   LL+Y      D  + R  +  PK +I + G  ++   +  +   
Sbjct: 26  QSAKWSKRFFVLKECFLLYYAESEKKDFEKTRRFNIHPKGVIPLGGCSIEAGRDPGRP-- 83

Query: 75  IFCLS---TAFGDAYLFQAPCQVELENWVNSIHSA 106
            +C       F  + +  A  + E E+W++ +  A
Sbjct: 84  -YCFLISHPDFKGSIILAADSEEEQESWLDMLREA 117


>gnl|CDD|221961 pfam13180, PDZ_2, PDZ domain. 
          Length = 81

 Score = 29.9 bits (68), Expect = 1.8
 Identities = 13/35 (37%), Positives = 18/35 (51%)

Query: 390 CYVSRVEDKSVAMHNGLIKGDEIMVINGAIVSDLD 424
             V  V++ S A   GL  GD I+ I+G  V+ L 
Sbjct: 15  VTVVSVKEGSPAAKAGLKPGDIILSIDGKKVNSLT 49


>gnl|CDD|212052 cd11483, SLC-NCS1sbd_Mhp1-like, nucleobase-cation-symport-1 (NCS1)
           transporter Mhp1-like; solute-binding domain.  This NCS1
           subfamily includes Microbacterium liquefaciens Mhp1, and
           various uncharacterized NCS1s. Mhp1 mediates the uptake
           of indolyl methyl- and benzyl-hydantoins as part of a
           metabolic salvage pathway for their conversion to amino
           acids. Mhp1 has 12 transmembrane (TM) helices (an
           inverted topology repeat: TMs1-5 and TMs6-10, and
           TMs11-12; TMs numbered to conform to the Solute carrier
           6 (SLC6) family Aquifex aeolicus LeuT). NCS1s are
           essential components of salvage pathways for nucleobases
           and related metabolites; their other known substrates
           include allantoin, uracil, thiamine, and nicotinamide
           riboside. NCS1s belong to a superfamily which also
           contains the solute carrier 5 family sodium/glucose
           transporters (SLC5s), and SLC6 neurotransmitter
           transporters.
          Length = 434

 Score = 32.6 bits (75), Expect = 2.0
 Identities = 39/177 (22%), Positives = 58/177 (32%), Gaps = 32/177 (18%)

Query: 773 VVCLISFLVGLIFTTGAGEYWLKLFDSFAGTIGLVLIALIEMLAVMFIYGHEKFTNDIYE 832
           V+  I F    I  T  G   +K  ++ A         +I +L  MF     K+   I E
Sbjct: 132 VLWFIIFAALQIVNTAYGFKAIKWLENIAA------PVIIAILVYMFYTLLTKYGVSIGE 185

Query: 833 MTGYKPGWYWQITW--RFLGPAIM-------SCILVSSIICRFLKKPS-YSAWN----RA 878
                  W ++ TW   F   AIM       +  L      R LK    Y  +     R 
Sbjct: 186 NL-----WSFEGTWGMPFWA-AIMAFMGGWSTMALNIPDYSRELKPGPNYEGFFKRNKRR 239

Query: 879 FLIPFMVMLILEGIPLFLIELGIGQKMRLGSLGVWNTIHPWLGGVGISSCLVTFFVA 935
            +  F+ ++ +           IG  M   + G WN I+     V     LV   + 
Sbjct: 240 GIAQFIALVPVTVFMGL-----IGA-MSGIATGEWNPINVIQEVVDNKFLLVLTLLF 290


>gnl|CDD|217855 pfam04039, MnhB, Domain related to MnhB subunit of Na+/H+ antiporter.
             Possible subunit of Na+/H+ antiporter. Predicted
            integral membrane protein, usually four transmembrane
            regions in this domain. Often found in bacterial NADH
            dehydrogenase subunit.
          Length = 125

 Score = 30.8 bits (70), Expect = 2.0
 Identities = 24/114 (21%), Positives = 40/114 (35%), Gaps = 12/114 (10%)

Query: 1089 LAFGSLIAFGSYNQPNNNCVRDVIMVSICNALTAIYASVVVFAILGFKAMSNAAEGTGLA 1148
            L FG+ +    +  P       +I+      L   +   VV   L               
Sbjct: 16   LVFGAYLFLRGHLSPGGGFQGGLIIAIAFLLLYIAFGIEVVEKFLPT------------T 63

Query: 1149 FIAAEGTGLAFIVFTQAIVELPGAPFWSIIFFMMLLSLGLGSQIGILEGVLCTI 1202
            +IA EG GL   + T   + L G PF +  F  ++   GL + +    G+   +
Sbjct: 64   YIALEGIGLLIALLTLVGLLLFGWPFLNAFFLPLVGEAGLITALNFDVGLKVGV 117


>gnl|CDD|234827 PRK00733, hppA, membrane-bound proton-translocating
           pyrophosphatase; Validated.
          Length = 666

 Score = 32.4 bits (75), Expect = 2.1
 Identities = 21/71 (29%), Positives = 30/71 (42%), Gaps = 17/71 (23%)

Query: 768 LHLVGVVCLISFLVGLIFTTGAGEYWLKLFDSFAGTIGLVLIALIEMLAVMFIYGHEKFT 827
           L +V       +L+G          WL LF   A  IGLV+ ALI ++        E +T
Sbjct: 279 LSIVLTYFATYWLLGDGADGFT---WLNLF--GAVLIGLVVGALIGLIT-------EYYT 326

Query: 828 NDIYEMTGYKP 838
           +     T Y+P
Sbjct: 327 S-----TEYRP 332


>gnl|CDD|241461 cd13307, PH2_AFAP, Actin filament associated protein family
           Pleckstrin homology (PH) domain, repeat 2.  There are 3
           members of the AFAP family of adaptor proteins: AFAP1,
           AFAP1L1, and AFAP1L2/XB130. AFAP1 is a cSrc binding
           partner and actin cross-linking protein. AFAP1L1 is
           thought to play a similar role to AFAP1 in terms of
           being an actin cross-linking protein, but it
           preferentially binds to cortactin and not cSrc, thereby
           playing a role in invadosome formation. AFAP1L2 is a
           cSrc binding protein, but does not bind to actin
           filaments. AFAP1L2 acts as an intermediary between the
           RET/PTC kinase and PI-3kinase pathway in the thyroid.
           The AFAPs share a similar structure of a SH3 binding
           motif, 3 SH2 binding motifs, 2 PH domains, a coiled-coil
           region corresponding to the AFAP1 leucine zipper, and an
           actin binding domain. This cd is the second PH domain of
           AFAP. PH domains have diverse functions, but in general
           are involved in targeting proteins to the appropriate
           cellular location or in the interaction with a binding
           partner. They share little sequence conservation, but
           all have a common fold, which is electrostatically
           polarized. Less than 10% of PH domains bind
           phosphoinositide phosphates (PIPs) with high affinity
           and specificity. PH domains are distinguished from other
           PIP-binding domains by their specific high-affinity
           binding to PIPs with two vicinal phosphate groups:
           PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which
           results in targeting some PH domain proteins to the
           plasma membrane. A few display strong specificity in
           lipid binding. Any specificity is usually determined by
           loop regions or insertions in the N-terminus of the
           domain, which are not conserved across all PH domains.
           PH domains are found in cellular signaling proteins such
           as serine/threonine kinase, tyrosine kinases, regulators
           of G-proteins, endocytotic GTPases, adaptors, as well as
           cytoskeletal associated molecules and in lipid
           associated enzymes.
          Length = 101

 Score = 30.4 bits (69), Expect = 2.1
 Identities = 20/88 (22%), Positives = 34/88 (38%), Gaps = 6/88 (6%)

Query: 23  WKGYWVCLKGTTLLFYPCDSREGRSVDAAPKHLIIVDGAIMQPIPEHPKRDYIFCLSTAF 82
           W+  W C+K   L FY    R        P  L   +  I  P P+HP   Y F +    
Sbjct: 16  WRSRWCCVKDGQLHFY--QDRNKLKSPQQPLPLRGCE-VIPGPDPKHP---YSFRILRNG 69

Query: 83  GDAYLFQAPCQVELENWVNSIHSACAAA 110
            +  + +A    ++  W+  + +   +A
Sbjct: 70  EEVAVLEASSSEDMGRWLGLLLAETGSA 97


>gnl|CDD|219137 pfam06687, SUR7, SUR7/PalI family.  This family consists of several
           fungal-specific SUR7 proteins. Its activity regulates
           expression of RVS161, a homologue of human endophilin,
           suggesting a function for both in endocytosis. The
           protein carries four transmembrane domains and is thus
           likely to act as an anchoring protein for the eisosome
           to the plasma membrane. Eisosomes are the immobile
           protein complexes, that include the proteins Pil1 and
           Lsp1, which co-localise with sites of protein and lipid
           endocytosis at the plasma membrane. SUR7 protein may
           play a role in sporulation. This family also includes
           PalI which is part of a pH signal transduction cascade.
           Based on the similarity of PalI to the yeast Rim9
           meiotic signal transduction component it has been
           suggested that PalI might be a membrane sensor for
           ambient pH.
          Length = 205

 Score = 31.9 bits (73), Expect = 2.1
 Identities = 14/93 (15%), Positives = 31/93 (33%), Gaps = 4/93 (4%)

Query: 768 LHLVGVV-CLISFLVGLIFTTGAGEYWLKLFDSFAGTIGLVLIALIEMLA-VMFIYGHEK 825
           +H + +   +I+ ++  I    +      L +     +  +   L   L   +F+     
Sbjct: 115 VHPIALFFTVIALILAGILAHFSSPRRGSLVNFLLSLLAFLFTLLAAALDTALFVKARNA 174

Query: 826 FTNDIYEMTGYKPGWYWQITWRFLGPAIMSCIL 858
           F N         P  +    W  +  A++S I 
Sbjct: 175 F-NSNGISASLGPKLFA-FGWLAVAAALLSAIF 205


>gnl|CDD|216972 pfam02322, Cyto_ox_2, Cytochrome oxidase subunit II.  This Family
           consists of cytochrome bd type terminal oxidases that
           catalyzes Quinol dependent, Na+ independent oxygen
           uptake. Members of this family are integral membrane
           proteins andi contain a protohaem IX centre B558. One
           member of the family,cydB from Klebsiella pneumoniae, is
           implicated in having an important role in micro-aerobic
           nitrogen fixation in the enteric bacterium Klebsiella
           pneumoniae.
          Length = 327

 Score = 32.1 bits (74), Expect = 2.1
 Identities = 30/177 (16%), Positives = 64/177 (36%), Gaps = 30/177 (16%)

Query: 776 LISFLVGLIFTT-------GAGEY---WLKLFDSFAGTIGLVLIALIEMLAVMFIYGHEK 825
           L +FL G+            AG Y   +  L + FA   GL ++AL  +L   ++    K
Sbjct: 124 LAAFLQGVALGNLVQGVPFDAGNYAGGFFDLLNPFALLGGLAVVALYALLGATWL--ILK 181

Query: 826 FTNDIYEMTGYKPGWYWQITWRFLGPAIMSCILVSSIICRFLKKPSYSAWNRAFLIPFMV 885
              ++ E          ++  +     ++   LVS  +   L  P+   +   F  P+++
Sbjct: 182 TEGELQE-------RARRLAKKLGLVLLVLFALVS--VWVPLAAPAL--FANWFSSPWLL 230

Query: 886 MLILEGIPLFLIELGIGQKMRLGSLGVWNTIHPWLGGVGISSCLVTFFVALYYNVII 942
           +L    +P+    L +           +        G+     ++   V+L+  ++ 
Sbjct: 231 ILPA--LPVLAALLAVLLLRLRREGLAF-----LASGLAFVLVVLGLAVSLFPYILP 280


>gnl|CDD|233199 TIGR00939, 2a57, Equilibrative Nucleoside Transporter (ENT).
            [Transport and binding proteins, Nucleosides, purines and
            pyrimidines].
          Length = 437

 Score = 32.4 bits (74), Expect = 2.2
 Identities = 25/104 (24%), Positives = 44/104 (42%), Gaps = 11/104 (10%)

Query: 998  PGSPKWWIVL--CLLLAWIIVFFIVMKGIQSSGRVVYFTSMFPYIVLTIFFIRGITLPGA 1055
            P     W+ +   L + +I +F +     Q S   V+F     +I+L + F  G +  G 
Sbjct: 333  PDEDSRWLPILSFLRVLFIPLFLLCNY-PQRSRLPVFFPGDAYFIILMLLF--GFS-NGY 388

Query: 1056 GAGLVHMYTPKVEKLLDPNVWLDAA--TQVFYSFGLAFGSLIAF 1097
               L     P+    +DP+    A     +F   GLA G++++F
Sbjct: 389  LGSLSMCLAPRQ---VDPHEREVAGALMVIFLLVGLALGAVLSF 429


>gnl|CDD|162098 TIGR00900, 2A0121, H+ Antiporter protein.  [Transport and binding
            proteins, Cations and iron carrying compounds].
          Length = 365

 Score = 32.3 bits (74), Expect = 2.3
 Identities = 27/183 (14%), Positives = 55/183 (30%), Gaps = 40/183 (21%)

Query: 1041 VLTIFFIRGITLPGAGAGLVHMYTPKV---EKLLDPNVWLDAATQVFYSFGLAF-GSLIA 1096
            V  +  I  I              P +   E+L   N    A   +FY  G    G + A
Sbjct: 95   VYVLAGILAIAQA-FFTPAYQAMLPDLVPEEQLTQANSLSQAVRSLFYIVGPGIGGLMYA 153

Query: 1097 FGSYNQPNNNCVRDVIMVSICNALTAIYASVVVFAILGFKAMSNAAEGTGLAFIAAEGTG 1156
                          +      +A+    +++++ ++   +  ++  +    A +     G
Sbjct: 154  TLG-----------IKWAIWVDAVGFAISALLIVSVRIPELAASEIQALSNAVLRDTREG 202

Query: 1157 LAFI---------VFTQAIVELPGAPFWSIIF--FMM-------------LLSLGLGSQI 1192
            + F+         +    +  L  AP    +F                  L + GLG+ +
Sbjct: 203  IKFVLKNPLLRTLLLLALLFNLVFAPAIVALFPYVQSKYLGRGSTHYGWVLAAFGLGALL 262

Query: 1193 GIL 1195
            G L
Sbjct: 263  GAL 265


>gnl|CDD|241430 cd13276, PH_AtPH1, Arabidopsis thaliana Pleckstrin homolog (PH) 1
           (AtPH1) PH domain.  AtPH1 is expressed in all plant
           tissue and is proposed to be the plant homolog of human
           pleckstrin. Pleckstrin consists of two PH domains
           separated by a linker region, while AtPH has a single PH
           domain with a short N-terminal extension. AtPH1 binds
           PtdIns3P specifically and is thought to be an adaptor
           molecule since it has no obvious catalytic functions. PH
           domains have diverse functions, but in general are
           involved in targeting proteins to the appropriate
           cellular location or in the interaction with a binding
           partner. They share little sequence conservation, but
           all have a common fold, which is electrostatically
           polarized. Less than 10% of PH domains bind
           phosphoinositide phosphates (PIPs) with high affinity
           and specificity. PH domains are distinguished from other
           PIP-binding domains by their specific high-affinity
           binding to PIPs with two vicinal phosphate groups:
           PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which
           results in targeting some PH domain proteins to the
           plasma membrane. A few display strong specificity in
           lipid binding. Any specificity is usually determined by
           loop regions or insertions in the N-terminus of the
           domain, which are not conserved across all PH domains.
           PH domains are found in cellular signaling proteins such
           as serine/threonine kinase, tyrosine kinases, regulators
           of G-proteins, endocytotic GTPases, adaptors, as well as
           cytoskeletal associated molecules and in lipid
           associated enzymes.
          Length = 117

 Score = 30.4 bits (69), Expect = 2.5
 Identities = 18/99 (18%), Positives = 39/99 (39%), Gaps = 7/99 (7%)

Query: 21  RGWKGYWVCLKGTTLLFYPCDSREGRSVDAAPKHLIIVDGAI-MQPIPEHPKRDYIFCLS 79
           + W+  W  LK   L ++  +       D+ P+ +I +   + ++   E   +++ F +S
Sbjct: 21  KTWRRRWFVLKQGKLFYFKDEDP-----DSEPRGVIDLSDCLTVKSAEEATNKEFAFEVS 75

Query: 80  TAFGDAYLFQAPCQVELENWVNSIHSACAAAFARHRGKT 118
           T     +   A  + E E W+++I  A            
Sbjct: 76  TP-ERTFYLIADSEKEKEEWISAIGRAIVKLSRSKGTID 113


>gnl|CDD|223547 COG0471, CitT, Di- and tricarboxylate transporters [Inorganic ion
            transport and metabolism].
          Length = 461

 Score = 32.3 bits (74), Expect = 2.6
 Identities = 18/74 (24%), Positives = 29/74 (39%), Gaps = 6/74 (8%)

Query: 1119 ALTAIYASVVVFAILGFKAMSNAAEGTGLAFIAAEGTGL-AFIVFTQAIVELPGAPFWSI 1177
            +  A   +++V AI+ +  +  A     LA +      L  FI+   A          S 
Sbjct: 28   SGFAWLFAILVAAIILW--ILEAIPLFALAILVIVLLVLSGFILAISASKLALSGFANST 85

Query: 1178 IFFMM---LLSLGL 1188
            I+ +    LLS GL
Sbjct: 86   IWLIFGLFLLSKGL 99


>gnl|CDD|150420 pfam09744, Jnk-SapK_ap_N, JNK_SAPK-associated protein-1.  This is
           the N-terminal 200 residues of a set of proteins
           conserved from yeasts to humans. Most of the proteins in
           this entry have an RhoGEF pfam00621 domain at their
           C-terminal end.
          Length = 158

 Score = 30.9 bits (70), Expect = 2.8
 Identities = 21/88 (23%), Positives = 40/88 (45%), Gaps = 20/88 (22%)

Query: 121 LHLLQEEIFRLEKAIDSDNKLKHMADLQMSVLAE--PETKQQINEQITQWEENLERLHCE 178
           L LL+E+  RL    + + +L+  A+ ++    +   + K+++ ++I   EEN+ +L   
Sbjct: 52  LELLREDNERLSTQYEREKELRKQAEQKLLEFEDELEQEKKELQKKIEDLEENVRQL--- 108

Query: 179 QFRLRCYMASLQNGELPNPKSLLTHVSR 206
                         EL   K+L   VSR
Sbjct: 109 --------------ELKA-KNLSDQVSR 121


>gnl|CDD|224372 COG1455, CelB, Phosphotransferase system cellobiose-specific
            component IIC [Carbohydrate transport and metabolism].
          Length = 432

 Score = 32.2 bits (74), Expect = 2.9
 Identities = 40/220 (18%), Positives = 69/220 (31%), Gaps = 70/220 (31%)

Query: 1033 FTSMFPYIVLTIFFIRGITLPGAGAGLVHMYTPKVEKLLDPNVWLDAATQVF-YSFGLAF 1091
            F +  P  ++   F+    LP                    N W     QV+  +FG+  
Sbjct: 32   FIAAMPLTIVGSIFLLINNLPFLFGA-------DFLGKPFGNGWKGILMQVYNGTFGI-M 83

Query: 1092 GSLIAFG-------SYNQPNNNCVRDVIMVSICNALTAIYASVVVFAILGFKAMS-NAAE 1143
              L+AFG       SY                  AL A   S+  F I+   +++   A 
Sbjct: 84   SLLVAFGIAYSLAKSYGVD---------------ALAAGLLSLAAFFIVTPLSIAVVPAG 128

Query: 1144 GTGLAFIAAEGTGLAFIVFTQAIV-------------------ELPGA---------PFW 1175
              G A++  +G   A I+   A+V                    +P A         P +
Sbjct: 129  AVGAAWLGGKGLFTAIII---ALVTVEIYTFLVKRNITIKMPESVPPAVSKSFEALIPGF 185

Query: 1176 SIIFFMMLLSLGLGSQIGILEGVLCTIFDIEIFKRIRKPI 1215
             I+    L+++       +L  +        I+  I+ P+
Sbjct: 186  IILSIFGLINI-------LLNSITGGNLITLIYTAIQAPL 218


>gnl|CDD|232788 TIGR00025, Mtu_efflux, ABC transporter efflux protein, DrrB family.
            The seed members for this model are a paralogous family
            of Mycobacterium tuberculosis. Nearly all proteins
            scoring above the noise cutoff are from high-GC
            Gram-positive organisms. The members of this paralogous
            family of efflux proteins are all found in operons with
            ATP-binding chain partners. They are related to a
            putative daunorubicin resistance efflux protein of
            Streptomyces peucetius. This model represents a branch of
            a larger superfamily that also includes NodJ, a part of
            the NodIJ pair of nodulation-triggering signal efflux
            proteins. The members of this branch may all act in
            antibiotic resistance.
          Length = 232

 Score = 31.3 bits (71), Expect = 3.1
 Identities = 20/108 (18%), Positives = 37/108 (34%), Gaps = 3/108 (2%)

Query: 1090 AFGSLIAFGSYNQPNNNCVRDVIMVSICNALTAIYASVVVFAILGFKAMSNAAEGTGL-A 1148
             +G+L   G+   P    +    +  +          +V+  +LGF+    A     L A
Sbjct: 64   RYGALKRLGATPLPRLGILAGRSLAVVARVFLQTLILLVIGFVLGFRFAGGALTALTLGA 123

Query: 1149 FIAAEGTGLAFIVFTQAIVELPGAPFWSIIFFMMLLSLGLGSQIGILE 1196
             I A GT L   +   A   L      ++   +  +   L    G++ 
Sbjct: 124  VIIALGTALFAALGLVAGGTLQAEIVLAVANLVWFIFALLS--AGLVP 169


>gnl|CDD|218348 pfam04956, TrbC, TrbC/VIRB2 family.  Conjugal transfer protein, TrbC
            has been identified as a subunit of the pilus precursor
            in bacteria. The protein undergoes three processing steps
            before gaining its mature cyclic structure. This family
            also contains several VIRB2 type IV secretion proteins.
            The virB2 gene encodes a putative type IV secretion
            system and is known to be a pathogenicity factor in
            Bartonella species.
          Length = 98

 Score = 29.5 bits (67), Expect = 3.2
 Identities = 17/84 (20%), Positives = 28/84 (33%), Gaps = 7/84 (8%)

Query: 1087 FGLAFGSLIAFGSYNQPNNNCVRDVIMVSICNALT---AIYASVVVFAILGFKAMSNAAE 1143
               A   L+A  +  Q + + V  V + +I + LT   A   + +     G  A      
Sbjct: 16   ALAAALLLLASPALAQGSGDPVETV-LQNIVDFLTGPLAKVIATIAIIGAGIMAAFGRLS 74

Query: 1144 GTGLAFIAAEGTGLAFIVFTQAIV 1167
               L  +     G+  I     IV
Sbjct: 75   WRTLVMVVI---GIVIIFGASTIV 95


>gnl|CDD|206188 pfam14018, DUF4234, Domain of unknown function (DUF4234).  This
            presumed integral membrane protein domain is functionally
            uncharacterized. This domain family is found in bacteria
            and archaea, and is approximately 70 amino acids in
            length.
          Length = 75

 Score = 29.2 bits (66), Expect = 3.3
 Identities = 8/42 (19%), Positives = 20/42 (47%), Gaps = 5/42 (11%)

Query: 982  YFWYRTTLDVSP---NIESPGSPKWWIVLCLLLAWIIVFFIV 1020
            Y+ Y+   +++     I+SP S    ++L +L     ++ + 
Sbjct: 20   YWLYKLNRELNRLAGRIDSPRSGGLVLLLSIL--TCGIYGLY 59


>gnl|CDD|218122 pfam04515, Choline_transpo, Plasma-membrane choline transporter.
            This family represents a high-affinity plasma-membrane
            choline transporter in C.elegans which is thought to be
            rate-limiting for ACh synthesis in cholinergic nerve
            terminals.
          Length = 327

 Score = 31.4 bits (72), Expect = 3.5
 Identities = 24/148 (16%), Positives = 45/148 (30%), Gaps = 33/148 (22%)

Query: 1004 WIVLCLLLAWIIVFF-----IVMKGIQSSGRV------VYFTSMFPYIVLTIFFIRGITL 1052
            + VL L+L  +I+F      + +  ++++         +    +  ++ L  F    I  
Sbjct: 6    FTVLTLILLLLIIFLRKRIPLAIALLKTASDAVSKNPSLLLVPLITFLALAAFIALWI-- 63

Query: 1053 PGAGAGLVHMYTPKVEKLLDPNV------WLDAATQVFYSFGLAFGS--LIAFGS----- 1099
                  L     PK      P               V+  FGL + S  ++A        
Sbjct: 64   -VVAVALYTSGDPKYNNGGCPTGGEYKYDSKLIGLLVYLLFGLFWTSEVILAVQHVTIAG 122

Query: 1100 ------YNQPNNNCVRDVIMVSICNALT 1121
                  +     N  +   + S   ALT
Sbjct: 123  VVATWYFAGGKRNMPKFPTLSSFKRALT 150


>gnl|CDD|214978 smart01021, Bac_rhodopsin, Bacteriorhodopsin-like protein.  The
            bacterial opsins are retinal-binding proteins that
            provide light- dependent ion transport and sensory
            functions to a family of halophilic bacteria.. They are
            integral membrane proteins believed to contain seven
            transmembrane (TM) domains, the last of which contains
            the attachment point for retinal (a conserved lysine).
          Length = 233

 Score = 31.1 bits (71), Expect = 3.6
 Identities = 11/41 (26%), Positives = 20/41 (48%)

Query: 1004 WIVLCLLLAWIIVFFIVMKGIQSSGRVVYFTSMFPYIVLTI 1044
            W+V  L+L   +VF ++ + +  S R+ Y  +     V  I
Sbjct: 3    WVVFALMLLGALVFVLLSRRVPDSRRLFYLITALITGVAAI 43


>gnl|CDD|226962 COG4603, COG4603, ABC-type uncharacterized transport system, permease
            component [General function prediction only].
          Length = 356

 Score = 31.4 bits (72), Expect = 3.6
 Identities = 21/90 (23%), Positives = 33/90 (36%), Gaps = 13/90 (14%)

Query: 1121 TAIYASVVVFAILG----FKAMSNAAEGTGLAFIAAEGTGLA-FIVFTQAIVELPGAPFW 1175
              + A+ ++   LG    F+A        GL  I AEG   A  I      +  P  P W
Sbjct: 61   LLVKAAPLILTGLGVAVAFRA--------GLFNIGAEGQFYAGAIAAALVALLFPDLPSW 112

Query: 1176 SIIFFMMLLSLGLGSQIGILEGVLCTIFDI 1205
              +   +L     G    ++ G+L   F +
Sbjct: 113  LALPLALLAGAAGGGLWALIPGLLKARFGV 142


>gnl|CDD|204405 pfam10146, zf-C4H2, Zinc finger-containing protein.  This is a
           family of proteins which appears to have a highly
           conserved zinc finger domain at the C terminal end,
           described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is
           predicted to contain a coiled coil. Members are
           annotated as being tumour-associated antigen HCA127 in
           humans but this could not confirmed.
          Length = 215

 Score = 31.0 bits (70), Expect = 3.8
 Identities = 19/53 (35%), Positives = 28/53 (52%), Gaps = 2/53 (3%)

Query: 115 RGKTGTLHLLQEEIFRLEKAIDSDNKLKHMADLQMSVLAEPETKQQINEQITQ 167
           R KT  L  L++EI   E+A++S+ K     D +M  L   E K Q  E++ Q
Sbjct: 7   RNKTDELEKLKDEIKAEEEALESEEKHLKEYDKEMEELL--EEKMQHVEELRQ 57


>gnl|CDD|184529 PRK14133, PRK14133, DNA polymerase IV; Provisional.
          Length = 347

 Score = 31.2 bits (71), Expect = 4.1
 Identities = 36/141 (25%), Positives = 55/141 (39%), Gaps = 28/141 (19%)

Query: 480 ISEEMICDLIVPAPSWSKEHNISEQPPEKVKMPGSLYNRDLL-MSNNNNSSTINNNNVNS 538
           I+E+MI D++ P P  SK H I ++  EK+   G     DLL +S            V  
Sbjct: 161 ITEDMIPDILKPLPI-SKVHGIGKKSVEKLNNIGIYTIEDLLKLSREFLIEYFGKFGV-- 217

Query: 539 NNKGETYEIENLIKSAGEVTGFCRSPVETHRTSPTGSMANQSSLNPSRQLTDAEKL---- 594
               E YE          + G     VE  R     S+  +++L   +   D E+L    
Sbjct: 218 ----EIYE---------RIRGIDYREVEVSRE--RKSIGKETTL--KKDTKDKEELKKYL 260

Query: 595 ---SKVIIELIETERTYVKNL 612
              S +I E ++    Y K +
Sbjct: 261 KDFSNIISEELKKRNLYGKTV 281


>gnl|CDD|216883 pfam02118, Srg, Srg family chemoreceptor. 
          Length = 275

 Score = 31.0 bits (71), Expect = 4.1
 Identities = 31/174 (17%), Positives = 60/174 (34%), Gaps = 49/174 (28%)

Query: 807 VLIALIEMLAVMFIYGHEKFTNDIYEMTGYKPGWYWQITWRFLGPAIMSCILVSSIICRF 866
           +L+++  M  V+F   HEKF +              +     L   I+  I+  S I   
Sbjct: 100 ILLSVNRMSCVLFPVTHEKFWS--------------RYYKLVL---IIIFIIPFSFIWNI 142

Query: 867 LKKPSY---------------SAWNR--AFLIPFMVMLILEGIPLFLIELGIGQKM--RL 907
           L    Y                 W     F + + +++++  I   ++      K+  R+
Sbjct: 143 LISRVYVNPVNGGFSINYEKAVPWASTSLFQLIYFILILVFTIITSIVTSYKLAKLSKRI 202

Query: 908 GSLGVWNTIHPWLGGVGISSCLVTFFVALYYNVIITWCFYYLFNSFSSSLPWSS 961
            S             V  S  ++T F+++ + ++  W     F  F + LP S 
Sbjct: 203 KS-------------VERSLTIITIFISVVFLLVAAWQVLNSFAFFFNFLPDSK 243


>gnl|CDD|241266 cd01233, PH_KIFIA_KIFIB, KIFIA and KIFIB protein pleckstrin
           homology (PH) domain.  The kinesin-3 family motors KIFIA
           (Caenorhabditis elegans homolog unc-104) and KIFIB
           transport synaptic vesicle precursors that contain
           synaptic vesicle proteins, such as synaptophysin,
           synaptotagmin and the small GTPase RAB3A, but they do
           not transport organelles that contain plasma membrane
           proteins. They have a N-terminal motor domain, followed
           by a coiled-coil domain, and a C-terminal PH domain.
           KIF1A adopts a monomeric form in vitro, but acts as a
           processive dimer in vivo. KIF1B has alternatively
           spliced isoforms distinguished by the presence or
           absence of insertion sequences in the conserved
           amino-terminal region of the protein; this results in
           their different motor activities. KIF1A and KIF1B bind
           to RAB3 proteins through the adaptor protein
           mitogen-activated protein kinase (MAPK) -activating
           death domain (MADD; also calledDENN), which was first
           identified as a RAB3 guanine nucleotide exchange factor
           (GEF). PH domains have diverse functions, but in general
           are involved in targeting proteins to the appropriate
           cellular location or in the interaction with a binding
           partner. They share little sequence conservation, but
           all have a common fold, which is electrostatically
           polarized. Less than 10% of PH domains bind
           phosphoinositide phosphates (PIPs) with high affinity
           and specificity. PH domains are distinguished from other
           PIP-binding domains by their specific high-affinity
           binding to PIPs with two vicinal phosphate groups:
           PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which
           results in targeting some PH domain proteins to the
           plasma membrane. A few display strong specificity in
           lipid binding. Any specificity is usually determined by
           loop regions or insertions in the N-terminus of the
           domain, which are not conserved across all PH domains.
           PH domains are found in cellular signaling proteins such
           as serine/threonine kinase, tyrosine kinases, regulators
           of G-proteins, endocytotic GTPases, adaptors, as well as
           cytoskeletal associated molecules and in lipid
           associated enzymes.
          Length = 111

 Score = 29.5 bits (67), Expect = 4.4
 Identities = 21/85 (24%), Positives = 36/85 (42%), Gaps = 10/85 (11%)

Query: 22  GWKGYWVCLKGTTLLFYPCDSREGRSVDAAPKHLIIVDGAIMQPIPE---HPKRDYIFCL 78
           GW   WV L+   L  Y    ++G          I +  A ++  P+      R  +F +
Sbjct: 29  GWVRRWVVLRRPYLHIYS-SEKDGDERGV-----INLSTARVEYSPDQEALLGRPNVFAV 82

Query: 79  STAFGDAYLFQAPCQVELENWVNSI 103
            T    +YL QA  + E+ +W+ +I
Sbjct: 83  YTPTN-SYLLQARSEKEMHDWLYAI 106


>gnl|CDD|216801 pfam01943, Polysacc_synt, Polysaccharide biosynthesis protein.
           Members of this family are integral membrane proteins.
           Many members of the family are implicated in production
           of polysaccharide. The family includes RfbX part of the
           O antigen biosynthesis operon. The family includes SpoVB
           from Bacillus subtilis, which is involved in spore
           cortex biosynthesis.
          Length = 272

 Score = 31.1 bits (71), Expect = 4.6
 Identities = 23/145 (15%), Positives = 38/145 (26%), Gaps = 20/145 (13%)

Query: 770 LVGVVCLISFLVGLIFTTGAGEYWLKLFDSFAGTIGLVLIALIEMLAVMFIYGHEKFTND 829
              ++ LI  L+ LI           +    A  I L+              G EK    
Sbjct: 83  GKLLLSLIFLLILLIAAFLGLPDLAVILLILALAILLLPGVSQ--FFSWLFQGLEKMKYI 140

Query: 830 IYEMTGYK-------------PGWYWQITWRFLGPAIMSCILVSSIICRFLKKPSYSAWN 876
              M   K                        L  +++  I+  +++ +           
Sbjct: 141 AISMIIEKLGSLILVFIAVFLGLDLLAAVLILLIASLIGGIIALAVLRKKFLPRFVF--- 197

Query: 877 RAFLIPFMVMLILEGIPLFLIELGI 901
             F +     L+    PLFL  L I
Sbjct: 198 --FSLKLFKELLRFSWPLFLSSLAI 220


>gnl|CDD|203570 pfam07058, Myosin_HC-like, Myosin II heavy chain-like.  This family
            represents a conserved region within a number of myosin
            II heavy chain-like proteins that seem to be specific to
            Arabidopsis thaliana.
          Length = 351

 Score = 31.2 bits (70), Expect = 4.9
 Identities = 35/178 (19%), Positives = 69/178 (38%), Gaps = 19/178 (10%)

Query: 1362 LRSQIQRRTEKVYV---LSNSTAEFRNAFLKTIRQIIRESVRNMSIPMTKPGVATQIVSN 1418
            L+ ++Q+  +K+ +    + S A+ +  F   ++ ++ E ++  +    +P         
Sbjct: 120  LQGEMQQLRDKLAISDRAAKSEAQLKEKFKLRLK-VLEEGLKGPNSSFVRP--------T 170

Query: 1419 TLGRHSSTGKSAQEQSMNKSSLNPGCSQTLEKSKKPPRPPQRHS-SGSIEYEQQAIIGQP 1477
            T+GR  S G + + QS+  +  +P  +     SKK P    R S +G I    +   G  
Sbjct: 171  TVGRSESNGPT-RRQSLGGAETSPKFTSNGGLSKKRPSSQLRGSLTGRISTVLKHAKGTS 229

Query: 1478 QQGTQPSYVQEPQSTHSTFRGRNKPLNESQVSQDEGDKEVKHDCDPGTKSEGEEDSIT 1535
                  S+    +S   +    N P N     + E         D   K+E E+ +  
Sbjct: 230  I-----SFDGGTRSMDRSKILANGPSNFPLNDKHEEGTSRGESPDSERKTEEEDGNAY 282


>gnl|CDD|205692 pfam13514, AAA_27, AAA domain.  This domain is found in a number of
           double-strand DNA break proteins. This domain contains a
           P-loop motif.
          Length = 1118

 Score = 31.3 bits (71), Expect = 5.2
 Identities = 11/52 (21%), Positives = 20/52 (38%), Gaps = 4/52 (7%)

Query: 124 LQEEIFRLEKAIDSDNKLKHMADLQMSVLAEPETKQQINEQITQWEENLERL 175
            +E    L   I  D   + +  L+ S+ A    K+ + E   + E   + L
Sbjct: 315 AREAAAALLAQIGPDADEEAVESLRPSLAA----KETVTELEKRKEALDQAL 362


>gnl|CDD|217401 pfam03169, OPT, OPT oligopeptide transporter protein.  The OPT family
            of oligopeptide transporters is distinct from the ABC
            pfam00005 and PTR pfam00854 transporter families. OPT
            transporters were first recognised in fungi (Candida
            albicans and Schizosaccharomyces pombe), but this
            alignment also includes orthologues from Arabidopsis
            thaliana. OPT transporters are thought to have 12-14
            transmembrane domains and contain the following motif:
            SPYxEVRxxVxxxDDP.
          Length = 619

 Score = 31.1 bits (71), Expect = 5.3
 Identities = 17/82 (20%), Positives = 24/82 (29%), Gaps = 22/82 (26%)

Query: 1001 PKWWIVLCLLLAWI--IVFFIVMKGIQS------------------SGRVVYFTSMFPYI 1040
            P WW +  L+L+ +  I     +   Q                   S  +   T   P  
Sbjct: 350  PMWWYLAGLVLSLVLGIALVAALFPTQLPVWGLLLALLLAFVFAIPSAYIAGLTGSSPVS 409

Query: 1041 VLTIF--FIRGITLPGAGAGLV 1060
             L I    I G  LPG     +
Sbjct: 410  GLGILTELIAGYVLPGRPLANL 431


>gnl|CDD|222095 pfam13388, DUF4106, Protein of unknown function (DUF4106).  This
            family of proteins are found in large numbers in the
            Trichomonas vaginalis proteome. The function of this
            protein is unknown.
          Length = 422

 Score = 30.8 bits (69), Expect = 6.1
 Identities = 20/93 (21%), Positives = 33/93 (35%), Gaps = 13/93 (13%)

Query: 1409 PGVATQIVSNTLGRHSSTGKSAQEQSMNKSSLNPGCSQTLEKSKKPPRPPQRHSSGSIEY 1468
            PG+     S+   RH    K  Q+ ++     NP    T++   + P+            
Sbjct: 183  PGLPKTFTSSHGHRHRHAPKPTQQPTVQ----NPAQQPTVQNPAQQPQQQP--------- 229

Query: 1469 EQQAIIGQPQQGTQPSYVQEPQSTHSTFRGRNK 1501
            +QQ +    Q   Q    Q PQ+     R R +
Sbjct: 230  QQQPVQPAQQPTPQNPAQQPPQTEQGHKRSREQ 262


>gnl|CDD|227696 COG5409, COG5409, EXS domain-containing protein [Signal transduction
            mechanisms].
          Length = 384

 Score = 30.9 bits (70), Expect = 6.2
 Identities = 7/36 (19%), Positives = 16/36 (44%), Gaps = 1/36 (2%)

Query: 1013 WIIVFFIVMKGIQSSGRVVYFTSMFPYI-VLTIFFI 1047
            +    FI +  ++    ++ F    P + V T+F+ 
Sbjct: 97   FTTSLFIFLNAVEGLKFILLFVYFLPLLQVGTVFWF 132


>gnl|CDD|216963 pfam02293, AmiS_UreI, AmiS/UreI family transporter.  This family
            includes UreI and proton gated urea channel as well as
            putative amide transporters.
          Length = 166

 Score = 29.9 bits (68), Expect = 6.9
 Identities = 18/69 (26%), Positives = 27/69 (39%), Gaps = 14/69 (20%)

Query: 1142 AEGTGLA----FIAAEGTGLAFIVFTQAIVELPGAPF------WSIIFFMMLLSLGLGSQ 1191
             +G GL     F+A      A + FT          F      W++++F+  L LGLG  
Sbjct: 82   LDGRGLGWFSLFVAITALPYAIVSFTSG----EDPWFGVIWLAWAVLWFLFFLLLGLGKS 137

Query: 1192 IGILEGVLC 1200
            +    G L 
Sbjct: 138  LTKFTGWLA 146


>gnl|CDD|129816 TIGR00733, TIGR00733, putative oligopeptide transporter, OPT family. 
            This protein represents a small family of integral
            membrane proteins from Gram-negative bacteria, a
            Gram-positive bacteria, and an archaeal species. Members
            of this family contain 15 to 18 GES predicted
            transmembrane regions, and this family has extensive
            homology to a family of yeast tetrapeptide transporters,
            including isp4 (Schizosaccharomyces pombe) and Opt1
            (Candida albicans). EspB, an apparent equivalog from
            Myxococcus xanthus, shares an operon with a two component
            system regulatory protein, and is required for the normal
            timing of sporulation after the aggregation of cells.
            This is consistent with a role in transporting
            oligopeptides as signals across the membrane [Transport
            and binding proteins, Amino acids, peptides and amines].
          Length = 591

 Score = 31.0 bits (70), Expect = 7.0
 Identities = 22/86 (25%), Positives = 38/86 (44%), Gaps = 13/86 (15%)

Query: 1121 TAIYASVVVFAILGFKAMSNAAEGTGLAFIAAEGTGLAFIVFT-QAIVEL---PGAPFWS 1176
             +I A+V+  A+L      +  E   +   A+    L+ ++F   A++ L      P+W 
Sbjct: 34   ASIPAAVISMAVLMAFRDRSILENNMVQTAASAAGTLSSVIFVLPALLMLGYWTEFPYWI 93

Query: 1177 IIFFMMLLSLGLGSQIGILEGVLCTI 1202
                 ++ +LG     G L GVL TI
Sbjct: 94   T---TVICALG-----GSL-GVLFTI 110


>gnl|CDD|219922 pfam08595, RXT2_N, RXT2-like, N-terminal.  The family represents
           the N-terminal region of RXT2-like proteins. In S.
           cerevisiae, RXT2 has been demonstrated to be involved in
           conjugation with cellular fusion (mating) and invasive
           growth. A high throughput localisation study has
           localised RXT2 to the nucleus.
          Length = 141

 Score = 29.7 bits (67), Expect = 7.0
 Identities = 17/110 (15%), Positives = 44/110 (40%), Gaps = 5/110 (4%)

Query: 514 SLYNRDLLMSNNNNSSTINNNNVNSNNKGETYEIENLIKSAGEVTGFCRSPVETHRTSPT 573
            + +R  +  NN      +  +++ +++ +  + E       +   +    +E    +P 
Sbjct: 32  DVLSRKRIKFNNPPRIDEDGGDIDDDDEDDEDD-EEADAEDDDENPYKLIRLE-EILAPL 89

Query: 574 GS---MANQSSLNPSRQLTDAEKLSKVIIELIETERTYVKNLNGLLENYL 620
                +    +++ + +     +L+   I LIE E+  +  LN LL+  L
Sbjct: 90  THPSDLPTHPAISRTFKSKTLSELALEAIALIEKEQNNLNKLNKLLQVLL 139


>gnl|CDD|233105 TIGR00728, OPT_sfam, oligopeptide transporter, OPT superfamily.  This
            superfamily has two main branches. One branch contains a
            tetrapeptide transporter demonstrated experimentally in
            three different species of yeast. The other family
            contains EspB of Myxococcus xanthus, a protein required
            for normal rather than delayed sporulation after cellular
            aggregation; its role is unknown but is compatible with
            transport of a signalling molecule. Homology between the
            two branches of the superfamily is seen most easily at
            the ends of the protein. The central regions are poorly
            conserved within each branch and may not be homologous
            between branches.
          Length = 606

 Score = 30.8 bits (70), Expect = 7.1
 Identities = 32/241 (13%), Positives = 62/241 (25%), Gaps = 34/241 (14%)

Query: 941  IITWCFYYLFNSFSSSLPWSSCPTDHNGTIIPECELSSETAYFWYRTTLDVSPN---IES 997
            I       L   FS S  WS                 +               +   I S
Sbjct: 184  IARNLLRVLLKYFSVSFLWSF--VSWFFPGFTA--FQTFGGLGSGSIGFGWQQSSAYIGS 239

Query: 998  PGSPKWWIVLCLLLAWIIVFFIVMKGIQSSGRVV-----------YFTSMFPYIVLTI-F 1045
                 + + L +LL +++ F IV+  +                    ++   Y  +    
Sbjct: 240  GLITPFTVGLNILLGFVLAFGIVLPILYYKNTWPADADPIDSSSTIDSTGVRYNSVGTAR 299

Query: 1046 FIRGITLPGAGAGLVHMYTPKVEKLLDPNVWLDAATQVFYSFGLAFGSLIAFGSYNQPNN 1105
                         L+        K           + +  SFGL    +  +    Q   
Sbjct: 300  RDHYKLGRDDHVRLLSRDEDLKVKNYKEVPAWWYLSLLLVSFGLGIVVVEYYFGITQLPW 359

Query: 1106 NCVRDVIMVSICNALTAIYA-------------SVVVFA-ILGFKAMSNAAEGTGLAFIA 1151
              V   +++++  A+                  + ++   IL  + ++N A      FI 
Sbjct: 360  WGVIVALIIALVLAIPIGILAGITNPVSGLNIITELIIGYILPGRPLANLAFKA-YGFIT 418

Query: 1152 A 1152
            A
Sbjct: 419  A 419


>gnl|CDD|225715 COG3174, COG3174, Predicted membrane protein [Function unknown].
          Length = 371

 Score = 30.8 bits (70), Expect = 7.3
 Identities = 16/46 (34%), Positives = 23/46 (50%), Gaps = 2/46 (4%)

Query: 772 GVVCLISFLVGLIFTTGAGEYWLKLFDSFAGTIGLVLIALIEMLAV 817
               L +   GL+F+  A  Y   +  +  GT GL+L+ALI  LA 
Sbjct: 258 PFSLLPALAFGLLFS--AILYASAVLRTIFGTAGLLLLALISGLAD 301


>gnl|CDD|234099 TIGR03062, pip_yhgE_Cterm, YhgE/Pip C-terminal domain.  This family
            contains the C-terminal domain of a family of multiple
            membrane-spanning proteins of Gram-positive bacteria. One
            member was shown to be a host protein essential for phage
            infection, so many members of this family are called
            "phage infection protein". A separate model, TIGR03061,
            represents the conserved N-terminal domain. The domains
            are separated by regions highly variable in both length
            and sequence, often containing extended heptad repeats as
            described in model TIGR03057.
          Length = 208

 Score = 30.2 bits (69), Expect = 7.6
 Identities = 17/66 (25%), Positives = 24/66 (36%), Gaps = 5/66 (7%)

Query: 1122 AIYASVVVFAILGFKAMSNAAEGTGLAFIAAEGTGLAFIVFTQAIVELPGAPFWSIIFFM 1181
            AI    V+   LG       A      F     T L F+   Q +V L G     +   +
Sbjct: 78   AIILYGVLILGLGLDPAHPPAT---FGFAIL--TSLTFMAIIQFLVALFGNVGRFLALVL 132

Query: 1182 MLLSLG 1187
            ++L LG
Sbjct: 133  LVLQLG 138


>gnl|CDD|132761 cd07076, NR_LBD_GR, Ligand binding domain of the glucocorticoid
            receptor, a member of the nuclear receptor superfamily.
            The ligand binding domain of the glucocorticoid receptor
            (GR): GR is a ligand-activated transcription factor
            belonging to the nuclear receptor superfamily. It binds
            with high affinity to cortisol and other glucocorticoids.
            GR is expressed in almost every cell in the body and
            regulates genes controlling a wide variety of processes
            including the development, metabolism, and immune
            response of the organism. In the absence of hormone, the
            glucocorticoid receptor (GR) is complexes with a variety
            of heat shock proteins in the cytosol. The binding of the
            glucocorticoids results in release of the heat shock
            proteins and transforms it to its active state. One
            mechanism of action of GR is by direct activation of gene
            transcription. The activated form of GR forms dimers,
            translocates into the nucleus, and binds to specific
            hormone responsive elements, activating gene
            transcription. GR can also function as a repressor of
            other gene transcription activators, such as NF-kappaB
            and AF-1 by directly binding to them, and bloc king the
            expression of their activated genes. Like other members
            of the nuclear receptor (NR) superfamily of
            ligand-activated transcription factors, GR has a central
            well conserved DNA binding domain (DBD), a variable
            N-terminal domain, a flexible hinge and a C-terminal
            ligand binding domain (LBD). The LBD also functions for
            dimerization and chaperone protein association.
          Length = 247

 Score = 30.3 bits (68), Expect = 7.6
 Identities = 18/62 (29%), Positives = 31/62 (50%), Gaps = 1/62 (1%)

Query: 1180 FMMLLSLGLGSQIGILEGVLCTIFDIEIFK-RIRKPILTEALKGMEKVAEHINEMQRIHE 1238
            F+M  +LG  S       +LC   D+ I + R+  P + +  K M  V+  ++ +Q  +E
Sbjct: 72   FLMAFALGWRSYRQSNGNLLCFAPDLIINEQRMTLPCMYDQCKHMLYVSSELHRLQVSYE 131

Query: 1239 EY 1240
            EY
Sbjct: 132  EY 133


>gnl|CDD|162054 TIGR00815, sulP, high affinity sulphate transporter 1.  The SulP
            family is a large and ubiquitous family with over 30
            sequenced members derived from bacteria, fungi, plants
            and animals. Many organisms including Bacillus subtilis,
            Synechocystis sp, Saccharomyces cerevisiae, Arabidopsis
            thaliana and Caenorhabditis elegans possess multiple SulP
            family paralogues. Many of these proteins are
            functionally characterized, and all are sulfate uptake
            transporters. Some transport their substrate with high
            affinities, while others transport it with relatively low
            affinities. Most function by SO42- :H+symport, but SO42-
            :HCO3- antiport has been reported for the rat protein
            (spP45380). The bacterial proteins vary in size from 434
            residues to 566 residues with one exception, a
            Mycobacterium tuberculosis protein with 784 residues. The
            eukaryotic proteins vary in size from 611 residues to 893
            residues with one exception, a protein designated "early
            nodulin 70 protein" from Glycine max which is reported to
            be of 485 residues. Thus, the eukaryotic proteins are
            almost without exception larger than the prokaryotic
            proteins. These proteins exhibit 10-13 putative
            transmembrane a-helical spanners (TMSs) depending on the
            protein. The phylogenetic tree for the SulP family
            reveals five principal branches. Three of these are
            bacterial specific as follows: one bears a single protein
            from M. tuberculosis; a second bears two proteins, one
            from M. tuberculosis, the other from Synechocystis sp,
            and the third bears all remaining prokaryotic proteins.
            The remaining two clusters bear only eukaryotic proteins
            with the animal proteins all localized to one branch and
            the plant and fungal proteins localized to the other. The
            generalized transport reactions catalyzed by SulP family
            proteins are: (1) SO42- (out) + nH+ (out) --> SO42- (in)
            + nH+ (in). (2) SO42- (out) + nHCO3- (in) SO42- (in) +
            nHCO3- (out) [Transport and binding proteins, Anions].
          Length = 563

 Score = 30.8 bits (70), Expect = 7.8
 Identities = 72/415 (17%), Positives = 135/415 (32%), Gaps = 106/415 (25%)

Query: 850  GPAIMSCILVSSIICRFLKK--PSYSAWNRAFLIPFMVMLILEGIPLFLIELGIGQKMRL 907
            GP  +  +L+ S+I R   +      A   AF +  +      GI  F + LG+   +RL
Sbjct: 69   GPVAVMSLLLGSVIARVGLQYLFDCDAIRLAFTLTLLA-----GI--FQVILGL---LRL 118

Query: 908  GSLGVWNTIHPWLGGVGISSCLVTFFVALYYNVIITWCFYYLFNSFSSSLPWSSCPTDHN 967
            G L  +           +S  +++ F+      I       L       +   +  TD  
Sbjct: 119  GFLIEF-----------LSHAVISGFMTGAAITIGLSQLKGLL-----GISIFNTRTDTL 162

Query: 968  GTIIPECELSSETAYFWYRTTLDVSPNIESPGSPKWWIVLCLLLAWIIVFFIVMKGIQSS 1027
            G +I               +T    PN     +  W  ++  L+  +++F +  K +   
Sbjct: 163  GVVI---------------STWAGLPNT---HNWNWCTLVIGLV--LLLFLLYTKKLGKR 202

Query: 1028 GRVVYF----TSMFPYIVLTIFFI------RGITLPG---AGAGLVHMYTPKVEKLLDPN 1074
             + + F      +   I+ T+         +G+++ G   +G       T   E L  P 
Sbjct: 203  NKKLLFAPAVAPLLVVILATLAVTIGLHKKQGVSILGHIPSGLSFFPPITLDWELL--PT 260

Query: 1075 VWLDAATQVFYSFGLAFGSLIA--FGSYN----QPNNNCVRDVIMVSICNALTAIYASVV 1128
            +  DA        GL     IA  F          N    ++++   I N + + ++   
Sbjct: 261  LAPDAIAIAI--VGLIESIAIARSFARMTGYKIDAN----QELVAQGIANIVGSFFSCYP 314

Query: 1129 VFAILGFKAMSNAAEG-TGL-AFIAAEGTGLAFIVFTQAIVELPGA-------------- 1172
                L   A++  A   T L   + A    L  +V T     +P A              
Sbjct: 315  ATGSLSRTAVNAKAGCRTQLSGVVTAIVVLLVLLVLTPLFYYIPQAALAAIIISAVRGLI 374

Query: 1173 ------------PFWSIIFFM-MLLSLGLGSQIGILEGVLCTIFDIEIFKRIRKP 1214
                            +++ +     +    +IG+L GV  +     +  RI +P
Sbjct: 375  DYKELYKLWKADKMDFVVWLVTFFGVVFTSIEIGLLVGVALSAAF--LLLRIARP 427


>gnl|CDD|234075 TIGR02968, succ_dehyd_anc, succinate dehydrogenase, hydrophobic
            membrane anchor protein.  In E. coli and many other
            bacteria, two small, hydrophobic, mutually homologous
            subunits of succinate dehydrogenase, a TCA cycle enzyme,
            are SdhC and SdhD. This family is the SdhD, the
            hydrophobic membrane anchor protein. SdhC is
            apocytochrome b558, which also plays a role in anchoring
            the complex [Energy metabolism, TCA cycle].
          Length = 105

 Score = 28.7 bits (65), Expect = 8.0
 Identities = 12/66 (18%), Positives = 21/66 (31%), Gaps = 15/66 (22%)

Query: 1003 WW------IVLCLLLAWIIVFFIVMKGIQSSGRVVYFTSMFPYIVLTIFFIRGITLPGAG 1056
            W       +VL L   ++I F + + G+        F   +  I   +  +         
Sbjct: 8    WLLQRVTAVVLALYTIFLIGFLLALPGLTYEAWRALFAHPWMKIFTLLALL--------- 58

Query: 1057 AGLVHM 1062
            A L H 
Sbjct: 59   ALLYHA 64


>gnl|CDD|241422 cd13268, PH_Brdg1, BCR downstream signaling 1 Pleckstrin homology
          (PH) domain.  Brdg1 is thought to function as a docking
          protein acting downstream of Tec, a protein tyrosine
          kinases (PTK), in B-cell antigen receptor (BCR)
          signaling. BRDG1 contains a proline-rich (PR) motif
          which is thought to bind SH3 or WW domains, a PH
          domain, and multiple tyrosine residues which are
          potential target sites for SH2 domains. Since PH
          domains bind phospholipids it is thought to be involved
          in the tethering of Tec and BRDG1 to the cell
          membrane.Tec and Pyk2, but not Btk, Bmx, Lyn, Syk, or
          c-Abl, induces phosphorylation of BRDG1 on tyrosine
          residues. Efficient phosphorylation requires both the
          PH and SH2 domains of BRDG1 and the kinase domain of
          Tec. The overexpression of BRDG1 increases
          theBCR-mediated activation of cAMP-response element
          binding protein (CREB). Phosphorylated BRDG1 is
          hypothesized to recruit CREB either directly or through
          its recruitment of downstream effectors which then
          recruit CREB. PH domains have diverse functions, but in
          general are involved in targeting proteins to the
          appropriate cellular location or in the interaction
          with a binding partner. They share little sequence
          conservation, but all have a common fold, which is
          electrostatically polarized. Less than 10% of PH
          domains bind phosphoinositide phosphates (PIPs) with
          high affinity and specificity. PH domains are
          distinguished from other PIP-binding domains by their
          specific high-affinity binding to PIPs with two vicinal
          phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
          PtdIns(3,4,5)P3 which results in targeting some PH
          domain proteins to the plasma membrane. A few display
          strong specificity in lipid binding. Any specificity is
          usually determined by loop regions or insertions in the
          N-terminus of the domain, which are not conserved
          across all PH domains. PH domains are found in cellular
          signaling proteins such as serine/threonine kinase,
          tyrosine kinases, regulators of G-proteins, endocytotic
          GTPases, adaptors, as well as cytoskeletal associated
          molecules and in lipid associated enzymes.
          Length = 127

 Score = 29.3 bits (66), Expect = 8.1
 Identities = 17/76 (22%), Positives = 29/76 (38%), Gaps = 25/76 (32%)

Query: 7  WLLRKKHQIELARKRGWKGYWVCLKGTTLLFYPC--------------------DSREGR 46
          +L +++ +    RK      W  L+GTTL FY                      +    R
Sbjct: 17 FLEKRRSKDREFRK-----LWTELRGTTLFFYNDKKDTQYVEKLDLSALTSLTDEISRER 71

Query: 47 SVDAAPKHLIIVDGAI 62
          ++DAA   L++ D  +
Sbjct: 72 NLDAARFTLVLKDEEV 87


>gnl|CDD|119322 cd06580, TM_PBP1_transp_TpRbsC_like, Transmembrane subunit (TM) of
            Treponema pallidum (Tp) RbsC-1, RbsC-2 and related
            proteins. This is a functionally uncharacterized subgroup
            of TMs which belong to a larger group of TMs of
            Periplasmic Binding Protein (PBP)-dependent ATP-Binding
            Cassette (ABC) transporters, which are mainly involved in
            the uptake of branched-chain amino acids (AAs) or in the
            uptake of monosaccharides including ribose, galactose,
            and arabinose, and which generally bind type 1 PBPs.
            PBP-dependent ABC transporters consist of a PBP, two TMs,
            and two cytoplasmic ABCs, and are mainly involved in
            importing solutes from the environment. The solute is
            captured by the PBP, which delivers it to a gated
            translocation pathway formed by the two TMs. The two ABCs
            bind and hydrolyze ATP and drive the transport reaction.
          Length = 234

 Score = 30.1 bits (69), Expect = 8.1
 Identities = 16/79 (20%), Positives = 27/79 (34%), Gaps = 5/79 (6%)

Query: 1130 FAILGFKAMSNA-AEGTGLAFIAAEGTGL--AFIVFTQAIVELPGAPFWSIIFFMMLLSL 1186
               L   A+  A +   G+  I  EG  L  AF      +    G P    +   +L + 
Sbjct: 1    ATPLILAALGVAISFRAGVFNIGLEGQMLLGAFAAA--LVALYLGLPATGSLPLGLLAAA 58

Query: 1187 GLGSQIGILEGVLCTIFDI 1205
              G+   +L  +L     +
Sbjct: 59   LAGALWALLPALLKAKLGV 77


>gnl|CDD|165120 PHA02753, PHA02753, hypothetical protein; Provisional.
          Length = 298

 Score = 30.3 bits (68), Expect = 8.2
 Identities = 39/157 (24%), Positives = 62/157 (39%), Gaps = 26/157 (16%)

Query: 796 LFDSFAGTIGLVL---IALIEMLAVMFIYGHEK-FTNDIYEMTGYKPG----WYWQITW- 846
           +  + AG +GL     IALI+++A++ I  H   F+       G   G     +      
Sbjct: 25  VHGAMAGALGLAGNGAIALIQIIALIAIIAHWANFSEADGFAAGAAFGAGTTAFIGAHDL 84

Query: 847 --RFLGPAIMSCI--LVSSIICRF-----------LKKPSYSAWNRAFLIPFMVMLILE- 890
               +  AI + I  +V +I   F           +  P+  A    FL     MLIL  
Sbjct: 85  AIDAILFAIGTLINSIVDTIEKAFLLGIALGFLGFILAPAIFAGIAGFLFGLAAMLILRL 144

Query: 891 -GIPLFLIELGIGQKMRLGSLGVWNTIHPWLGGVGIS 926
            G+    +ELGI    +    G    I P++GG+ I+
Sbjct: 145 GGLLTEALELGIHALFKHLPEGYAALIAPFIGGIFIA 181


>gnl|CDD|129983 TIGR00905, 2A0302, transporter, basic amino acid/polyamine antiporter
            (APA) family.  This family includes several families of
            antiporters that, rather commonly, are encoded next to
            decarboxylases that convert one of the antiporter
            substrates into the other. This arrangement allows a
            cycle that can remove proteins from the cytoplasm and
            thereby protect against acidic conditions [Transport and
            binding proteins, Amino acids, peptides and amines].
          Length = 473

 Score = 30.4 bits (69), Expect = 8.2
 Identities = 14/52 (26%), Positives = 28/52 (53%), Gaps = 1/52 (1%)

Query: 999  GSPKWWIVLCLLLAWIIVFFIVMKGIQSSGRVVYFTSMFPYIVLTIFFIRGI 1050
            G+P   I+   +L W+   F+V++G++ +  +   T++   I L +F I G 
Sbjct: 124  GNPVPSILGASVLLWV-FTFLVLRGVRQAAFINTITTIAKLIPLFLFIIIGW 174


>gnl|CDD|222060 pfam13347, MFS_2, MFS/sugar transport protein.  This family is part
            of the major facilitator superfamily of membrane
            transport proteins.
          Length = 425

 Score = 30.6 bits (70), Expect = 8.3
 Identities = 8/45 (17%), Positives = 22/45 (48%), Gaps = 4/45 (8%)

Query: 1004 WIVLCLLLAWI--IVFFIVMKGIQSSGRVVYFTSMFPYIVLTIFF 1046
            W+++  +   I  ++ F    G+  + +++Y      YI+L + +
Sbjct: 72   WLLIGAIPLAISLVLLFATPFGLSMTAKLIYAFIT--YILLGLAY 114


>gnl|CDD|227621 COG5305, COG5305, Predicted membrane protein [Function unknown].
          Length = 552

 Score = 30.4 bits (69), Expect = 8.5
 Identities = 39/194 (20%), Positives = 65/194 (33%), Gaps = 44/194 (22%)

Query: 776 LISFLVGLIFTTGAGEYWLKLFDSFAGTIGLVLIALIEMLAVMFIYGHEKFTNDIYEMTG 835
           LI +L+       A   WL ++  FA +  + +  LI    +  I G     N IY +  
Sbjct: 242 LIWWLLATAAGLLAFLPWLLVY--FANSFRVGIPDLI---TLDGIVGQLI--NPIYPLI- 293

Query: 836 YKPGWYWQIT-----------WRFLGPAIMSCILVSS---IICRFLKKPSYSAWNRAFLI 881
                 W +              F    +   +       I+          A  R  L+
Sbjct: 294 ----AAWLLAFSLLFFDLLHGLDFPINGVARFLYPIWLAIILILGALYTLCRATERKILL 349

Query: 882 PFMVMLILEGIPLFLIELGIGQKMRLGSLGVWNTIHPWLGGVGISSCLVTFFVALYYNVI 941
             + ++I   +PL L +L          + V+ T +P+L    I S L+      Y N++
Sbjct: 350 FILTLIIAPALPLILSDL----------VSVFITRNPYL----IPSYLILLLGVAYLNLL 395

Query: 942 ITWCFYYLFNSFSS 955
            TW    L     S
Sbjct: 396 GTW----LTIGDLS 405


>gnl|CDD|216852 pfam02028, BCCT, BCCT family transporter. 
          Length = 484

 Score = 30.6 bits (70), Expect = 8.5
 Identities = 15/73 (20%), Positives = 26/73 (35%), Gaps = 12/73 (16%)

Query: 776 LISFLVGLIFTTGAGEYWLKLFDSFAGTIGLVLIALIEMLAVMFIYGHEKFTND--IYEM 833
           L   L+  +   G   +   + ++F  +IG  L   + M      +G   +     I+  
Sbjct: 252 LALALLLFVLIVGPTLF---ILNTFVQSIGDYLQNFVRMSLWTDPFGDGGWLQGWTIF-- 306

Query: 834 TGYKPGWYWQITW 846
             Y   W W I W
Sbjct: 307 --Y---WAWWIAW 314


>gnl|CDD|221721 pfam12698, ABC2_membrane_3, ABC-2 family transporter protein.  This
            family is related to the ABC-2 membrane transporter
            family pfam01061.
          Length = 278

 Score = 30.0 bits (68), Expect = 8.7
 Identities = 25/143 (17%), Positives = 47/143 (32%), Gaps = 28/143 (19%)

Query: 1075 VWLDAATQVFYSFGLAFGSLIAFGSYNQPNNNCVRDVIMVSICNALTAIYASVVVFAILG 1134
              L    Q+     L FG  I FG+             ++ +   L    A + + A+LG
Sbjct: 147  DLLVGLIQLLIILLLLFGLGIPFGNL------------LLLLLLFLLYGLAYIALGALLG 194

Query: 1135 FKAMSNAAEGTGLAFIAAEGTGLAFIVFTQAIVELPGA--------PFWSIIFFMMLLSL 1186
                 N+     +  I        F      +  +P          PF++ I  ++ +  
Sbjct: 195  S-LFKNSEAAILVISILI-LLLSGFFGGLFPLPNMPSFLQWIFSIIPFFAPIDGLLRIIY 252

Query: 1187 GLGSQIG------ILEGVLCTIF 1203
            G  ++I       +L GV+  + 
Sbjct: 253  GDLAEILLSLIILLLFGVVLLLL 275


>gnl|CDD|218549 pfam05308, Mito_fiss_reg, Mitochondrial fission regulator.  In
           eukaryotes, this family of proteins induces
           mitochondrial fission.
          Length = 248

 Score = 30.1 bits (68), Expect = 8.9
 Identities = 12/47 (25%), Positives = 19/47 (40%)

Query: 433 QEELSLCMMMRSSRTEPPETAGSGMLSATDDMIESLVCPPPPSDPPM 479
           Q   +   ++ S  + P  +  S  +S   +     V PPPP  PP 
Sbjct: 147 QSNSTTSDLLSSDESVPSSSTTSFPISPPTEEPVLEVPPPPPPPPPP 193


>gnl|CDD|218918 pfam06161, DUF975, Protein of unknown function (DUF975).  Family of
           uncharacterized bacterial proteins.
          Length = 251

 Score = 30.1 bits (68), Expect = 9.3
 Identities = 30/144 (20%), Positives = 53/144 (36%), Gaps = 12/144 (8%)

Query: 806 LVLIALIEMLAVMFIYGHEKFTNDIYEMTGYKPGWYWQITWRFLGPAIMSCILVSSIICR 865
           L+LI LIE      I          +         +  + +       +S I     + R
Sbjct: 25  LLLILLIEFSLSFLISNIGSIGESSFLSITLLAYIFPLLVFLIGTALTISAIFYYLKLVR 84

Query: 866 FLKKPSYS------AWNRAFLIPFMVMLILEGIPLFLIELGIGQKMRLGSLGVWNTIHPW 919
             +KPS+          R      +++LIL+GI LFL  L       +G L +  + H  
Sbjct: 85  RKEKPSFKESFASFKDKRF--GKLLLLLILKGIFLFLWSLIA----TIGLLIIIISSHLA 138

Query: 920 LGGVGISSCLVTFFVALYYNVIIT 943
           L    I   L+ F + +  ++++ 
Sbjct: 139 LSSSEIEDNLMVFGIGILISLLLI 162


>gnl|CDD|217024 pfam02417, Chromate_transp, Chromate transporter.  Members of this
           family probably act as chromate transporters. Members of
           this family are found in both bacteria and
           archaebacteria. The proteins are composed of one or two
           copies of this region. The alignment contains two
           conserved motifs, FGG and PGP.
          Length = 169

 Score = 29.4 bits (67), Expect = 9.6
 Identities = 18/104 (17%), Positives = 38/104 (36%), Gaps = 14/104 (13%)

Query: 799 SFAGTIGLVLIALIEMLAVMFIYGHEKFTNDIYEMTGYKPGWYWQITWRFLGPAIMSCIL 858
           +   T+G  L + + +L +  +Y              +K   + Q     + PA++  IL
Sbjct: 75  ALVATLGFFLPSFLLILLLAPLYKR------------FKDSPWVQAFLAGVKPAVVGLIL 122

Query: 859 VSSIICRFLKKPSYSAWNRAFLIPFMVMLILEGIPLFLIELGIG 902
            ++I     KK      + A      ++L    +   ++ L  G
Sbjct: 123 AAAI--SLAKKAVKDPLSLAIAALAFLLLAFFKLNPVIVILLAG 164


>gnl|CDD|217790 pfam03916, NrfD, Polysulphide reductase, NrfD.  NrfD is an integral
           transmembrane protein with loops in both the periplasm
           and the cytoplasm. NrfD is thought to participate in the
           transfer of electrons, from the quinone pool into the
           terminal components of the Nrf pathway.
          Length = 313

 Score = 30.1 bits (68), Expect = 9.8
 Identities = 24/114 (21%), Positives = 44/114 (38%), Gaps = 25/114 (21%)

Query: 839 GWYWQITWRFLGPAIMSCILVSSIICRFLKKPSYSAWNR-----AFLIPFMVMLILEGIP 893
           G    +    +G  I +  L+ +I+ +  KK     +N      A L P  V+       
Sbjct: 12  GLPIAVYLFLIG--ISAGALILAILYKRFKKNEGKPFNLIIRTGALLAPLAVL-----AG 64

Query: 894 LFLIELGIGQKMRLGSL---GVWNTIHPWLGGVGISSCLVTFFVALYYNVIITW 944
           L ++   +G+  R   L     +++I  W  GV          ++LY  V++ W
Sbjct: 65  LLILIFDLGRPWRFWKLLRYYNFSSIMSW--GV--------MLLSLYMVVLVLW 108


>gnl|CDD|239237 cd02911, arch_FMN, Archeal FMN-binding domain. This family of
           archaeal proteins are part of the NAD(P)H-dependent
           flavin oxidoreductase (oxidored) FMN-binding family that
           reduce a range of alternative electron acceptors. Most
           use FAD/FMN as a cofactor and NAD(P)H as electron donor.
           Some contain 4Fe-4S cluster to transfer electron from
           FAD to FMN. The specific function of this group is
           unknown.
          Length = 233

 Score = 30.0 bits (68), Expect = 10.0
 Identities = 31/98 (31%), Positives = 43/98 (43%), Gaps = 15/98 (15%)

Query: 585 SRQLTD---AEKLSKVIIELIETERTYVKNLNGLLE-----NYLEPLKKETFLSGAEINA 636
           +R+L      E L    +E IE E   +K+ N L+      + LEPL        A + A
Sbjct: 41  ARKLVKRGRKEFLPDDPLEFIEGEIKALKDSNVLVGVNVRSSSLEPLLNA-----AALVA 95

Query: 637 LFGNILEIVAFQRQFLQNLVEALENEADFHHFDQSSQF 674
               ILEI A  RQ    +VEA   EA     ++ S+F
Sbjct: 96  KNAAILEINAHCRQ--PEMVEAGAGEALLKDPERLSEF 131


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.320    0.135    0.406 

Gapped
Lambda     K      H
   0.267   0.0746    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 81,784,525
Number of extensions: 8187918
Number of successful extensions: 9538
Number of sequences better than 10.0: 1
Number of HSP's gapped: 9356
Number of HSP's successfully gapped: 274
Length of query: 1613
Length of database: 10,937,602
Length adjustment: 110
Effective length of query: 1503
Effective length of database: 6,058,662
Effective search space: 9106168986
Effective search space used: 9106168986
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 66 (29.2 bits)