RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy13206
         (478 letters)



>gnl|CDD|215702 pfam00083, Sugar_tr, Sugar (and other) transporter. 
          Length = 449

 Score =  192 bits (490), Expect = 4e-56
 Identities = 124/458 (27%), Positives = 222/458 (48%), Gaps = 33/458 (7%)

Query: 1   MAGTSMGWPSPVLRLFKSNITDMFRNETYI------EMTSAESSWVVSIIELGNLVTPIP 54
           + G   G+ + V+  F   +   F+    +        ++  S  +VSI  +G L+  + 
Sbjct: 7   LGGFLFGYDTGVIGAF-LTLIKFFKRFGALTSIGACAASTVLSGLIVSIFSVGCLIGSLF 65

Query: 55  IGFLVDYVGRKPCLLTTGPLYIISWLLVIFTK--HVYVLYVVRFMQGLAMGIVFTVAPMY 112
            G L D  GRK  LL    L++I  LL  F K    Y+L V R + GL +G +  + PMY
Sbjct: 66  AGKLGDRFGRKKSLLIGNVLFVIGALLQGFAKGKSFYMLIVGRVIVGLGVGGISVLVPMY 125

Query: 113 IGEISGAKCRGALSTFFIGMLNTGILLEYTVGPYVDY-------DTLAYVSLVIPVVFLM 165
           I EI+  K RGAL + +   +  GIL+   +G  ++             +  V  ++ L+
Sbjct: 126 ISEIAPKKLRGALGSLYQLGITFGILVAAIIGLGLNKYSNSDGWRIPLGLQFVPAILLLI 185

Query: 166 TFIWMPESPYFLIMKGRDVDARKSLFWLRGGRESSKDKINLELSNIKQDVEREMKLSDDF 225
             +++PESP +L++KG+  +AR  L  LRG  +  ++    E  ++++ VE E     + 
Sbjct: 186 GLLFLPESPRWLVLKGKLEEARAVLAKLRGVSDVDQEIQE-EKDSLERSVEAEKASWLEL 244

Query: 226 MDIISTPANRRSLLIVQIVAVADVISGMSAVLPYASSTFARTEGSLITPDECTLLLGILV 285
                    R+ LL+  ++ +   ++G++A+  Y+ + F  T G L      T+++G++ 
Sbjct: 245 ---FRGKTVRQRLLMGVMLQIFQQLTGINAIFYYSPTIF-ETLG-LSDSLLVTIIVGVVN 299

Query: 286 FLSTFPTAFLVDRTGRRPLLLVSCFGSGISQLIAGTYYLLSENYTVDLSKFNWIPLISIT 345
           F+ TF   FLVDR GRRPLLL+   G  I  L+ G   L        ++K     +++I 
Sbjct: 300 FVFTFIAIFLVDRFGRRPLLLLGAAGMAICFLVLGVALLG-------VAKSKGAGIVAIV 352

Query: 346 CFAV---IYSIGLGPLVPTLQGEFFPSNTRGLAGGVTTITLTVISFLVMKMYQVICDHYG 402
              +    +++G GP+   +  E FP   R  A  + T    + +FL+  ++ +I    G
Sbjct: 353 FILLFIAFFALGWGPVPWVIVSELFPLGVRPKAMAIATAANWLANFLIGFLFPIITGAIG 412

Query: 403 VYLNFYIYSLGCIICGVLVYFIIPESKGKTFAQIQEEL 440
            Y+ F +++   ++  + V+F +PE+KG+T  +I E  
Sbjct: 413 GYV-FLVFAGLLVLFILFVFFFVPETKGRTLEEIDELF 449


>gnl|CDD|233165 TIGR00879, SP, MFS transporter, sugar porter (SP) family.  This
           model represent the sugar porter subfamily of the major
           facilitator superfamily (pfam00083) [Transport and
           binding proteins, Carbohydrates, organic alcohols, and
           acids].
          Length = 481

 Score =  182 bits (465), Expect = 3e-52
 Identities = 117/419 (27%), Positives = 197/419 (47%), Gaps = 21/419 (5%)

Query: 33  TSAESSWVVSIIELGNLVTPIPIGFLVDYVGRKPCLLTTGPLYIISWLLVIFTK---HVY 89
           +S+    VVSI  +G  +  +  G+L D  GRK  LL    L++I  +L+        V 
Sbjct: 69  SSSLWGLVVSIFLVGGFIGALFAGWLSDRFGRKKSLLIIALLFVIGAILMGLAAFALSVE 128

Query: 90  VLYVVRFMQGLAMGIVFTVAPMYIGEISGAKCRGALSTFFIGMLNTGILLEYTVGPY--- 146
           +L V R + G+ +GI   + PMY+ EI+    RGAL++ +   +  GIL+ Y  G     
Sbjct: 129 MLIVGRVLLGIGVGIASALVPMYLSEIAPKALRGALTSLYQLAITFGILVAYGFGSGKVS 188

Query: 147 ----VDYDTLAYVSLVIPVVFLMTFIWMPESPYFLIMKGRDVDARKSLFWLRGGRESSKD 202
               + +     + L+   +  +   ++PESP +L+ KGR  +ARKSL  LRG   S +D
Sbjct: 189 LNNTLGWRIPLGLQLIPAGLLFLGLFFLPESPRWLVGKGRVEEARKSLARLRGT--SGED 246

Query: 203 KI---NLELSNIKQDVERE--MKLSDDFMDIISTPANRRSLLIVQIVAVADVISGMSAVL 257
           K     LEL +IK+ +E+              ST   RR L +  ++      +G++A++
Sbjct: 247 KELLDELELIDIKRSIEKRSVQPSWGSLFS--STRRIRRRLFLGVVLQWFQQFTGINAIM 304

Query: 258 PYASSTFARTEGSLITPDECTLLLGILVFLSTFPTAFLVDRTGRRPLLLVSCFGSGISQL 317
            Y+ + F     S       ++++G + F  TF   FLVDR GRRPLLL+   G  I   
Sbjct: 305 YYSPTIFENAGVSTDHAFLVSIIVGAVNFAFTFVAIFLVDRFGRRPLLLIGAAGMAICLF 364

Query: 318 IAGTYYLLSENYTVDLSKFNWIPLISITCFAVIYSIGLGPLVPTLQGEFFPSNTRGLAGG 377
           + G         T        + ++ I  F   +++G GP+   +  E FP + R     
Sbjct: 365 VLGILGASFV--TGSSKSSGNVAIVFILLFIAFFAMGWGPVPWVIVSEIFPLSLRPKGIS 422

Query: 378 VTTITLTVISFLVMKMYQVICDHYGVYLNFYIYSLGCIICGVLVYFIIPESKGKTFAQI 436
           +      + +F+V  ++  + +  GV   F  +    ++  + VYF +PE+KG+T  +I
Sbjct: 423 IAVAANWLANFIVGFLFPTMLESIGVGGVFIFFGGLNVLGLIFVYFFLPETKGRTLEEI 481


>gnl|CDD|182225 PRK10077, xylE, D-xylose transporter XylE; Provisional.
          Length = 479

 Score =  108 bits (272), Expect = 1e-25
 Identities = 97/423 (22%), Positives = 178/423 (42%), Gaps = 64/423 (15%)

Query: 56  GFLVDYVGRKPCLLTTGPLYIISWL------LVIFTK----HVYVLYVV--RFMQGLAMG 103
           G+  +  GR+  L     L+ IS L          +       YV   V  R + G+ +G
Sbjct: 76  GYCSNRFGRRDSLKIAAVLFFISALGSAWPEFGFTSIGPDNTGYVPEFVIYRIIGGIGVG 135

Query: 104 IVFTVAPMYIGEISGAKCRGALSTFFIGMLNTGILLEYTVG---------PYVDYDTLAY 154
           +   ++PMYI EI+ A  RG L +F    +  G L+ Y V           +++ D   Y
Sbjct: 136 LASMLSPMYIAEIAPAHIRGKLVSFNQFAIIFGQLVVYFVNYFIARSGDASWLNTDGWRY 195

Query: 155 V--SLVIP-VVFLMTFIWMPESPYFLIMKGRDVDARKSLFWLRGGRESSKDKINLELSNI 211
           +  S  IP ++FLM   ++PE+P +L+ +G+   A   L  + G            L  I
Sbjct: 196 MFASEAIPALLFLMLLYFVPETPRYLMSRGKQEQAEGILRKIMG-----NTLATQALQEI 250

Query: 212 KQDVEREMKLSDDFMDIISTPANRRSLLIVQI-VAVADVISGMSAVLPYASSTF----AR 266
           K  ++   K     +           ++++ + ++V     G++ VL YA   F    A 
Sbjct: 251 KHSLDHGRKTGGKLLMFGVG------VIVIGVMLSVFQQFVGINVVLYYAPEIFKTLGAS 304

Query: 267 TEGSLITPDECTLLLGILVFLSTFPTAFLVDRTGRRPLLLVSCFGSGISQLIAGT-YYLL 325
           T+ +L+     T+++G++    T      VD+ GR+PL ++   G  I     GT +Y  
Sbjct: 305 TDIALLQ----TIIVGVINLTFTVLAIMTVDKFGRKPLQIIGALGMAIGMFSLGTAFYTQ 360

Query: 326 SENYTVDLSKFNWIPLISITCFAVIYSIGLGPLVPTLQGEFFPSNTRGLAGGVTTITLTV 385
           +            + L+S+  +   +++  GP+   L  E FP+  RG A  +      +
Sbjct: 361 APGI---------VALLSMLFYVAAFAMSWGPVCWVLLSEIFPNAIRGKALAIAVAAQWI 411

Query: 386 ISFLV------MKMYQVICDHYGVYLNFYIYSLGCIICGVLVYFIIPESKGKTFAQIQEE 439
            ++ V      M     +  H+    +++IY    ++  + ++  +PE+KGKT     EE
Sbjct: 412 ANYFVSWTFPMMDKNSWLVAHFHNGFSYWIYGCMGVLAALFMWKFVPETKGKTL----EE 467

Query: 440 LNK 442
           +  
Sbjct: 468 MEA 470


>gnl|CDD|233176 TIGR00898, 2A0119, cation transport protein.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 505

 Score = 79.3 bits (196), Expect = 8e-16
 Identities = 92/407 (22%), Positives = 167/407 (41%), Gaps = 39/407 (9%)

Query: 36  ESSWVV----SIIELGNLVTPIPIGFLVDYVGRKPCLLTTGPLYIISWLLVIFTKHVYVL 91
           E +W V    S   +G L+     G+L D  GRK  LL +  +  +S +L  F+ +  V 
Sbjct: 124 EDAWKVDLTQSCFFVGVLLGSFVFGYLSDRFGRKKVLLLSTLVTAVSGVLTAFSPNYTVF 183

Query: 92  YVVRFMQGLAMGIVFTVAPMYIGEISGAKCRGALSTFFIGMLNTGILLEYTVGPYV-DYD 150
            V R + G+ +G ++  A +   E    K R  + T      + G++L   V  ++ D+ 
Sbjct: 184 LVFRLLVGMGIGGIWVQAVVLNTEFLPKKQRAIVGTLIQVFFSLGLVLLPLVAYFIPDWR 243

Query: 151 TLAYVSLVIPVVFLMTFIWMPESPYFLIMKGRDVDARKSLFWLRGGRESSKDKINLELSN 210
            L     +   +F +   ++PESP +LI +GR  +A K L   R  + + K K+  E+ +
Sbjct: 244 WLQLAVSLPTFLFFLLSWFVPESPRWLISQGRIEEALKIL--QRIAKINGK-KLPAEVLS 300

Query: 211 IK-QDVEREMKLSDDFMDIISTPANRRSLLIVQIVAVADVISGMSAVLPYASSTFARTEG 269
           +  +      K    F+D+  TP  R++ L + ++         +A   Y         G
Sbjct: 301 LSLEKDLSSSKKQYSFLDLFRTPNLRKTTLCLMMLWFT------TAFSYYGLVLDLGNLG 354

Query: 270 SLITPDECTLLLGILVFLSTFPTAFLVDRTGRRPLLLVSCFGSGISQLIAGTYYLLSENY 329
             I  D    + G++   +   T  L+DR GRR  +  S   +G++ L+    ++  + Y
Sbjct: 355 GNIYLDL--FISGLVELPAKLITLLLIDRLGRRYTMAASLLLAGVALLL--LLFVPVDLY 410

Query: 330 TVDLSKFNWIPLISITCFAVIYSIGLGPLVPTLQGEFFPSNTRGLAGGVTTITLTVIS-- 387
            +           ++             +V     E +P+  R L  GV +    V S  
Sbjct: 411 FL---------RTALAVLGKFGITSAFQMVYLYTAELYPTVVRNLGVGVCSTMARVGSII 461

Query: 388 --FLVMKMYQVICDHYGVYLNFYIYSLGCIICGVLVYFIIPESKGKT 432
             FLV            ++L   ++    ++ G+L  F +PE+KG  
Sbjct: 462 SPFLVYLGE------KWLFLPLVLFGGLALLAGILTLF-LPETKGVP 501


>gnl|CDD|219516 pfam07690, MFS_1, Major Facilitator Superfamily. 
          Length = 346

 Score = 75.9 bits (187), Expect = 5e-15
 Identities = 65/361 (18%), Positives = 114/361 (31%), Gaps = 48/361 (13%)

Query: 32  MTSAESSWVVSIIELGNLVTPIPIGFLVDYVGRKPCLLTTGPLYIISWLLVIFTKHVYVL 91
           ++  E   +++   LG  +     G L D  GR+  LL    L+ +  LL++F   +++L
Sbjct: 29  ISPTEIGLLLTAFSLGYALAQPLAGRLSDRFGRRRVLLIGLLLFALGLLLLLFASSLWLL 88

Query: 92  YVVRFMQGLAMGIVFTVAPMYIGEISGAKCRGALSTFFIGMLNTGILLEYTVGPYVDYDT 151
            V+R +QGL  G +F  A   I +    + RG            G  L   +G  +    
Sbjct: 89  LVLRVLQGLGGGALFPAAAALIADWFPPEERGRALGLLSAGFGLGAALGPLLGGLLASLF 148

Query: 152 LAYVSLVIPVVFLMTFIWMPESPYFLIMKGRDVDARKSLFWLRGGRESSKDKINLELSNI 211
               + +I  +  +    +                      L      SK     E    
Sbjct: 149 GWRAAFLILAILALLAAVLA------------------ALLLPRPPPESKRPKPAE---- 186

Query: 212 KQDVEREMKLSDDFMDIISTPANRRSLLIVQIVAVADVISGMSAVLPYASSTFARTEGSL 271
               E    L   +  ++    +    L++ ++        +   LP        +    
Sbjct: 187 ----EAPAPLVPAWKLLLR---DPVLWLLLALLLFGFAFFALLTYLPLYQEVLGLSALLA 239

Query: 272 ITPDECTLLLGILVFLSTFPTAFLVDRTGRRPLLLVSCFGSGISQLIAGTYYLLSENYTV 331
                   L G+L  +       L DR GRR  LL++        L A    LLS     
Sbjct: 240 GLL---LGLAGLLGAIGRLLLGRLSDRLGRRRRLLLALLLLI---LAALGLALLS----- 288

Query: 332 DLSKFNWIPLISITCFAVIYSIGLGPLVPTLQG---EFFPSNTRGLAGGVTTITLTVISF 388
                     + +    ++   G G + P L     +  P   RG A G+     ++   
Sbjct: 289 -----LTESSLWLLVALLLLGFGAGLVFPALNALVSDLAPKEERGTASGLYNTAGSLGGA 343

Query: 389 L 389
           L
Sbjct: 344 L 344



 Score = 41.6 bits (98), Expect = 6e-04
 Identities = 37/207 (17%), Positives = 73/207 (35%), Gaps = 26/207 (12%)

Query: 238 LLIVQIVAVADVISGMSAVLPYASSTF--ARTEGSLITPDECTLLLGILVFLSTFPTAFL 295
           L +   +A         A+  Y +     + TE  L+          +   L+      L
Sbjct: 1   LFLAAFLAGLGRSLLGPALPLYLAEDLGISPTEIGLLL-----TAFSLGYALAQPLAGRL 55

Query: 296 VDRTGRRPLLLVSCFGSGISQLIAGTYYLLSENYTVDLSKFNWIPLISITCFAVIYSIGL 355
            DR GRR +LL+      +  L+                  + + L+ +    V+  +G 
Sbjct: 56  SDRFGRRRVLLIGLLLFALGLLLLLF--------------ASSLWLLLV--LRVLQGLGG 99

Query: 356 GPLVPTLQ---GEFFPSNTRGLAGGVTTITLTVISFLVMKMYQVICDHYGVYLNFYIYSL 412
           G L P       ++FP   RG A G+ +    + + L   +  ++   +G    F I ++
Sbjct: 100 GALFPAAAALIADWFPPEERGRALGLLSAGFGLGAALGPLLGGLLASLFGWRAAFLILAI 159

Query: 413 GCIICGVLVYFIIPESKGKTFAQIQEE 439
             ++  VL   ++P    ++      E
Sbjct: 160 LALLAAVLAALLLPRPPPESKRPKPAE 186


>gnl|CDD|119392 cd06174, MFS, The Major Facilitator Superfamily (MFS) is a large
           and diverse group of secondary transporters that
           includes uniporters, symporters, and antiporters. MFS
           proteins facilitate the transport across cytoplasmic or
           internal membranes of a variety of substrates including
           ions, sugar phosphates, drugs, neurotransmitters,
           nucleosides, amino acids, and peptides. They do so using
           the electrochemical potential of the transported
           substrates. Uniporters transport a single substrate,
           while symporters and antiporters transport two
           substrates in the same or in opposite directions,
           respectively, across membranes. MFS proteins are
           typically 400 to 600 amino acids in length, and the
           majority contain 12 transmembrane alpha helices (TMs)
           connected by hydrophilic loops. The N- and C-terminal
           halves of these proteins display weak similarity and may
           be the result of a gene duplication/fusion event. Based
           on kinetic studies and the structures of a few bacterial
           superfamily members, GlpT (glycerol-3-phosphate
           transporter), LacY (lactose permease), and EmrD
           (multidrug transporter), MFS proteins are thought to
           function through a single substrate binding site,
           alternating-access mechanism involving a rocker-switch
           type of movement. Bacterial members function primarily
           for nutrient uptake, and as drug-efflux pumps to confer
           antibiotic resistance. Some MFS proteins have medical
           significance in humans such as the glucose transporter
           Glut4, which is impaired in type II diabetes, and
           glucose-6-phosphate transporter (G6PT), which causes
           glycogen storage disease when mutated.
          Length = 352

 Score = 75.8 bits (187), Expect = 6e-15
 Identities = 34/144 (23%), Positives = 66/144 (45%), Gaps = 4/144 (2%)

Query: 32  MTSAESSWVVSIIELGNLVTPIPIGFLVDYVGRKPCLLTTGPLYIISWLLVIFTKHVYVL 91
           ++++++  +VS   LG  +  +  G+L D  GR+  LL    L+ +  LL+ F   +++L
Sbjct: 31  LSASQAGLIVSAFSLGYALGSLLAGYLSDRFGRRRVLLLGLLLFALGSLLLAFASSLWLL 90

Query: 92  YVVRFMQGLAMGIVFTVAPMYIGEISGAKCRGALSTFFIGMLNTGILLEYTVGPYV---- 147
            V RF+ GL  G ++  A   I E    K RG     F      G LL   +G  +    
Sbjct: 91  LVGRFLLGLGGGALYPAAAALIAEWFPPKERGRALGLFSAGFGLGALLGPLLGGLLAESL 150

Query: 148 DYDTLAYVSLVIPVVFLMTFIWMP 171
            +  L  +  ++ ++  +  +++ 
Sbjct: 151 GWRWLFLILAILGLLLALLLLFLL 174



 Score = 54.6 bits (132), Expect = 5e-08
 Identities = 30/145 (20%), Positives = 62/145 (42%), Gaps = 5/145 (3%)

Query: 32  MTSAESSWVVSIIELGNLVTPIPIGFLVDYVGRKPCLLTTG-PLYIISWLLVIFTKHVYV 90
           +++AE+  ++S+  LG ++  +  G L D +GR+  LL  G  L  +  LL+     + +
Sbjct: 208 LSAAEAGLLLSLFGLGGILGALLGGLLSDRLGRRRLLLLIGLLLAALGLLLLALAPSLAL 267

Query: 91  LYVVRFMQGLAMGIVFTVAPMYIGEISGAKCRGALSTFFIGMLNTGILLEYTVGPYV--- 147
           L V   + G  +G  F        E++  + RG  S  F    + G  L   +   +   
Sbjct: 268 LLVALLLLGFGLGFAFPALLTLASELAPPEARGTASGLFNTFGSLGGALGPLLAGLLLDT 327

Query: 148 -DYDTLAYVSLVIPVVFLMTFIWMP 171
             Y  +  +   + ++  +  + +P
Sbjct: 328 GGYGGVFLILAALALLAALLLLLLP 352



 Score = 49.6 bits (119), Expect = 2e-06
 Identities = 27/150 (18%), Positives = 57/150 (38%), Gaps = 19/150 (12%)

Query: 280 LLGILVFLSTFPTAFLVDRTGRRPLLLVSCFGSGISQLIAGTYYLLSENYTVDLSKFNWI 339
              +   L +    +L DR GRR +LL+      +  L+       + +  +        
Sbjct: 42  AFSLGYALGSLLAGYLSDRFGRRRVLLLGLLLFALGSLLLA----FASSLWL-------- 89

Query: 340 PLISITCFAVIYSIGLG---PLVPTLQGEFFPSNTRGLAGGVTTITLTVISFLVMKMYQV 396
               +     +  +G G   P    L  E+FP   RG A G+ +    + + L   +  +
Sbjct: 90  ----LLVGRFLLGLGGGALYPAAAALIAEWFPPKERGRALGLFSAGFGLGALLGPLLGGL 145

Query: 397 ICDHYGVYLNFYIYSLGCIICGVLVYFIIP 426
           + +  G    F I ++  ++  +L+ F++ 
Sbjct: 146 LAESLGWRWLFLILAILGLLLALLLLFLLR 175



 Score = 47.3 bits (113), Expect = 8e-06
 Identities = 34/179 (18%), Positives = 56/179 (31%), Gaps = 15/179 (8%)

Query: 249 VISGMSAVLPYASSTFARTEG-SLITPDECTLLLGILVFLSTFPTAFLVDRTGRRP-LLL 306
           +  G   +L Y         G S         L G+   L       L DR GRR  LLL
Sbjct: 187 LSFGYYGLLTYLPLYLQEVLGLSAAEAGLLLSLFGLGGILGALLGGLLSDRLGRRRLLLL 246

Query: 307 VSCFGSGISQLIAGTYYLLSENYTVDLSKFNWIPLISITCFAVIYSIGLGPLVPTLQGEF 366
           +    + +  L+                  +   L+              P + TL  E 
Sbjct: 247 IGLLLAALGLLLLALA-------------PSLALLLVALLLLGFGLGFAFPALLTLASEL 293

Query: 367 FPSNTRGLAGGVTTITLTVISFLVMKMYQVICDHYGVYLNFYIYSLGCIICGVLVYFII 425
            P   RG A G+     ++   L   +  ++ D  G    F I +   ++  +L+  + 
Sbjct: 294 APPEARGTASGLFNTFGSLGGALGPLLAGLLLDTGGYGGVFLILAALALLAALLLLLLP 352



 Score = 36.9 bits (86), Expect = 0.018
 Identities = 30/147 (20%), Positives = 58/147 (39%), Gaps = 4/147 (2%)

Query: 37  SSWVVSIIELGNLVTPIPIGFLVDYVGRKPCLLTTGPLYIISWLLVIFT-KHVYVLYVVR 95
                +   LG L+ P+  G L + +G +   L    L ++  LL++F  + + +L +  
Sbjct: 125 LGLFSAGFGLGALLGPLLGGLLAESLGWRWLFLILAILGLLLALLLLFLLRLLLLLALAF 184

Query: 96  FMQGLAMGIVFTVAPMYIGEISGAKCRGA---LSTFFIGMLNTGILLEYTVGPYVDYDTL 152
           F+       + T  P+Y+ E+ G     A   LS F +G +   +L             L
Sbjct: 185 FLLSFGYYGLLTYLPLYLQEVLGLSAAEAGLLLSLFGLGGILGALLGGLLSDRLGRRRLL 244

Query: 153 AYVSLVIPVVFLMTFIWMPESPYFLIM 179
             + L++  + L+     P     L+ 
Sbjct: 245 LLIGLLLAALGLLLLALAPSLALLLVA 271


>gnl|CDD|233175 TIGR00895, 2A0115, benzoate transport.  [Transport and binding
           proteins, Carbohydrates, organic alcohols, and acids].
          Length = 398

 Score = 63.9 bits (156), Expect = 6e-11
 Identities = 69/362 (19%), Positives = 129/362 (35%), Gaps = 47/362 (12%)

Query: 32  MTSAESSWVVSIIELGNLVTPIPIGFLVDYVGRKPCLLTTGPLYIISWLLVIFTKHVYVL 91
           +   +  ++ S   +G     +  G L D +GR+  LL +  L+ +  LL     +V  L
Sbjct: 49  LDPVQLGFLFSAGLIGMAFGALFFGPLADRIGRRRVLLWSILLFSVFTLLCALATNVTQL 108

Query: 92  YVVRFMQGLAMGIVFTVAPMYIGEISGAKCRG-ALSTFFIGM---LNTGILLEYTVGPYV 147
            ++RF+ GL +G +       + E +  + RG A+   F G       G  L   + P  
Sbjct: 109 LILRFLAGLGLGGLMPNLNALVSEYAPKRFRGTAVGLMFCGYPIGAAVGGFLAGWLIPVF 168

Query: 148 DYDTLAYVSLVIPVVFL-MTFIWMPESPYFLIMKGRD-VDARKSLFWLRGGRESSKDKIN 205
            + +L YV  + P++ L +   ++PES  FL+ K  + V    +    +   E+      
Sbjct: 169 GWRSLFYVGGIAPLLLLLLLMRFLPESIDFLVSKRPETVRRIVNAIAPQMQAEAQSA--- 225

Query: 206 LELSNIKQDVEREMKLSDDFMDIISTPA-NRRSLLI-----VQIVAVADVISGMSAVLPY 259
           L      Q  +R           +      R ++L+     + +V V  + +     LP 
Sbjct: 226 LPEQKATQGTKR------SVFKALFQGKTARITVLLWLLYFMLLVGVYFLTNW----LPK 275

Query: 260 ASSTFARTEGSLITPDECTLLLGILVF---LSTFPTAFLVDRTGRRPLLLVSCFGSGISQ 316
                        +         +  F   + +    +L DR G R   L+   G+  + 
Sbjct: 276 LMVELGF------SLSLAATGGALFNFGGVIGSIIFGWLADRLGPRVTALLLLLGAVFAV 329

Query: 317 LIAGTYYLLSENYTVDLSKFNWIPLISITCFAVIYSIGLGPLVPTLQGEFFPSNTRGLAG 376
           L+  T              F+   L+ +   A  +  G    +  L   F+P+  R    
Sbjct: 330 LVGSTL-------------FSPTLLLLLGAIAGFFVNGGQSGLYALMALFYPTAIRATGV 376

Query: 377 GV 378
           G 
Sbjct: 377 GW 378


>gnl|CDD|233166 TIGR00880, 2_A_01_02, Multidrug resistance protein. 
          Length = 141

 Score = 58.8 bits (143), Expect = 1e-10
 Identities = 30/141 (21%), Positives = 58/141 (41%), Gaps = 5/141 (3%)

Query: 38  SWVVSIIELGNLVTPIPIGFLVDYVGRKPCLLTTGPLYIISWLLVIFTKHVYVLYVVRFM 97
             +++   LG L+     G L D  GRKP LL    ++++S  +   + ++ VL + RF+
Sbjct: 1   GLLLAGYALGQLIYSPLSGLLTDRFGRKPVLLVGLFIFVLSTAMFALSSNITVLIIARFL 60

Query: 98  QGLAMGIVFTVAPMYIGEISGAKCRGALSTFFIGMLNTGILLEYTVGPYVD-----YDTL 152
           QG             I +I   + RG         +  G LL   +G  +          
Sbjct: 61  QGFGAAFALVAGAALIADIYPPEERGVALGLMSAGIALGPLLGPPLGGVLAQFLGWRAPF 120

Query: 153 AYVSLVIPVVFLMTFIWMPES 173
            +++++    F++    +PE+
Sbjct: 121 LFLAILALAAFILLAFLLPET 141



 Score = 39.6 bits (93), Expect = 6e-04
 Identities = 32/150 (21%), Positives = 60/150 (40%), Gaps = 13/150 (8%)

Query: 279 LLLGILVFLSTFPTAFLVDRTGRRPLLLVSCFGSGISQLIAGTYYLLSENYTVDLSKFNW 338
               +   + +  +  L DR GR+P+LLV  F   +S  +      LS N TV       
Sbjct: 5   AGYALGQLIYSPLSGLLTDRFGRKPVLLVGLFIFVLSTAMFA----LSSNITV------- 53

Query: 339 IPLISITCFAVIYSIGLGPLVPTLQGEFFPSNTRGLAGGVTTITLTVISFLVMKMYQVIC 398
             +I+        +  L      L  + +P   RG+A G+ +  + +   L   +  V+ 
Sbjct: 54  -LIIARFLQGFGAAFAL-VAGAALIADIYPPEERGVALGLMSAGIALGPLLGPPLGGVLA 111

Query: 399 DHYGVYLNFYIYSLGCIICGVLVYFIIPES 428
              G    F   ++  +   +L+ F++PE+
Sbjct: 112 QFLGWRAPFLFLAILALAAFILLAFLLPET 141


>gnl|CDD|233172 TIGR00891, 2A0112, putative sialic acid transporter.  [Transport
           and binding proteins, Carbohydrates, organic alcohols,
           and acids].
          Length = 405

 Score = 58.7 bits (142), Expect = 3e-09
 Identities = 64/385 (16%), Positives = 129/385 (33%), Gaps = 48/385 (12%)

Query: 33  TSAESSWVVSIIELGNLVTPIPIGFLVDYVGRKPCLLTTGPLYIISWLLVIFTKHVYVLY 92
           T+ +++ ++S   +      +  G   D  GR+  ++T+  L+    L   F      ++
Sbjct: 45  TTVDAASLISAALISRWFGALMFGLWGDRYGRRLPMVTSIVLFSAGTLACGFAPGYITMF 104

Query: 93  VVRFMQGLAMGIVFTVAPMYIGEISGAKCRGALSTFFIGMLNTGILLE----YTVGPYVD 148
           + R + G+ MG  +  +  Y+ E      R   S   I     G ++       V P   
Sbjct: 105 IARLVIGIGMGGEYGSSAAYVIESWPKHLRNKASGLLISGYAVGAVVAAQVYSLVVPVWG 164

Query: 149 --YDTLAYVSLVIPVVFLMTFIWMPESPYFLIMKGRDVDARKSLFWLRGGRESSKDKINL 206
             +  L ++S++  +  L     +PE+  +          R  +  L GG     + +  
Sbjct: 165 DGWRALFFISILPIIFALWLRKNIPEAEDWKEKHAGKALVRTMVDILYGGEHRIANIVMT 224

Query: 207 ELSNIKQDVEREMKLSDDFMDIISTPANRRSLLIVQIVAVADVISGMSAVLPYASSTFAR 266
             + + Q   +                     L+V ++        +  +LP    T+ +
Sbjct: 225 LAAAMVQSAGKRWPTF--------------VYLVVLVLFANLYSHPIQDLLP----TYLK 266

Query: 267 TEGSLITPDECTLLLGILVFLSTFPTA-------FLVDRTGRRPLLLVSCFGSGISQLIA 319
            +  L +P         +V  S            FL D  GRR   + S     +  LI 
Sbjct: 267 ADLGL-SPHTVA----NIVVFSNIGAIVGGCVFGFLGDWLGRRKAYVCSLLAGQL--LII 319

Query: 320 GTYYLLSENYTVDLSKFNWIPLISITCFAVIYSIGLGPLVPTLQGEFFPSNTRGLAGGVT 379
             + + +    + L  F          F  +   G+  ++P   GE+FP++ R    G T
Sbjct: 320 PVFAIGANVAVLGLGLF----------FQQMLVQGIWGILPKHLGEYFPTDQRAAGLGFT 369

Query: 380 TITLTVISFLVMKMYQVICDHYGVY 404
                +   L   +  ++      Y
Sbjct: 370 YQLGNLGGALAPIIGALLAQRLDEY 394


>gnl|CDD|145103 pfam01770, Folate_carrier, Reduced folate carrier.  The reduced
           folate carrier (a transmembrane glycoprotein) transports
           reduced folate into mammalian cells via the carrier
           mediated mechanism (as opposed to the receptor mediated
           mechanism) it also transports cytotoxic folate analogues
           used in chemotherapy, such as methotrexate (MTX).
           Mammalian cells have an absolute requirement for
           exogenous folates which are needed for growth, and
           biosynthesis of macromolecules.
          Length = 410

 Score = 52.3 bits (126), Expect = 3e-07
 Identities = 42/141 (29%), Positives = 64/141 (45%), Gaps = 16/141 (11%)

Query: 48  NLVTPIPIGFLVDYVGRKPCLLTTGPLYIISWLLVIFTKHVYVLYVVRFMQGLAMG---- 103
            L   +P+  L DY+  KP ++  G   I++WLL++FT  V  + V+ F  G+A      
Sbjct: 54  YLALLVPVFLLTDYLRYKPVIILQGLAGIVTWLLLLFTTSVIAMQVMEFFYGVATAAEVA 113

Query: 104 ----IVFTVAPMYIGEISGAKCRGA-LSTFFIGMLNTGILLEYTVGPYVDYDTLAYVSLV 158
               I   V P     ++    R A L   F+  +   +L+       V Y TL Y+SL 
Sbjct: 114 YYSYIYSKVDPERYQRVTS-YTRAATLVGKFLSGVLGQLLVSLGR---VSYFTLNYISLA 169

Query: 159 IPVV--FLMTFIWMPE-SPYF 176
             VV  FL  F+  P+ S +F
Sbjct: 170 SQVVALFLSLFLPRPKRSLFF 190


>gnl|CDD|130366 TIGR01299, synapt_SV2, synaptic vesicle protein SV2.  This model
           describes a tightly conserved subfamily of the larger
           family of sugar (and other) transporters described by
           PFAM model pfam00083. Members of this subfamily include
           closely related forms SV2A and SV2B of synaptic vesicle
           protein from vertebrates and a more distantly related
           homolog (below trusted cutoff) from Drosophila
           melanogaster. Members are predicted to have two sets of
           six transmembrane helices.
          Length = 742

 Score = 39.6 bits (92), Expect = 0.004
 Identities = 39/177 (22%), Positives = 65/177 (36%), Gaps = 17/177 (9%)

Query: 27  ETYIEMTSAESSWVVSIIELGNLVTPIPIGFLVDYVGRKPCLLTTGPLYIISWLLVIFTK 86
           E  + +  +    +  I+ LG +V     G L D +GRK CLL    +         F +
Sbjct: 194 EKDLCIPDSGKGMLGLIVYLGMMVGAFFWGGLADKLGRKQCLLICLSVNGFFAFFSSFVQ 253

Query: 87  HVYVLYVVRFMQGLAMGIVFTVAPMYIGEISGAKCRGA----LSTFF-IGMLNTGILL-- 139
                   R + G  +G    +   Y  E    + RG     L  F+ IG +    +   
Sbjct: 254 GYGFFLFCRLLSGFGIGGAIPIVFSYFAEFLAQEKRGEHLSWLCMFWMIGGIYAAAMAWA 313

Query: 140 -------EYTVGPYVDYDTLAYVSLV--IPVVF-LMTFIWMPESPYFLIMKGRDVDA 186
                   + +G    + +     +V   P VF +    +MPESP F +  G+  +A
Sbjct: 314 IIPHYGWSFQMGSAYQFHSWRVFVIVCAFPCVFAIGALTFMPESPRFFLENGKHDEA 370



 Score = 29.9 bits (67), Expect = 3.2
 Identities = 20/57 (35%), Positives = 27/57 (47%), Gaps = 11/57 (19%)

Query: 256 VLPYASSTFARTEGSLITPDECTLLLGILVFLSTFPTAF----LVDRTGRRPLLLVS 308
           VLP A       E  L  PD    +LG++V+L     AF    L D+ GR+  LL+ 
Sbjct: 189 VLPSA-------EKDLCIPDSGKGMLGLIVYLGMMVGAFFWGGLADKLGRKQCLLIC 238


>gnl|CDD|129888 TIGR00806, rfc, RFC reduced folate carrier.  The Reduced Folate
           Carrier (RFC) Family (TC 2.A.48) Members of the RFC
           family mediate the uptake of folate, reduce folate,
           derivatives of reduced folate and the drug,
           methotrexate. Proteins of the RFC family are so-far
           restricted to animals. RFC proteins possess 12 putative
           transmembrane a-helical spanners (TMSs) and evidence for
           a 12 TMS topology has been published for the human RFC.
           The RFC transporters appear to transport reduced folate
           by an energy-dependent, pH-dependent, Na+-independent
           mechanism. Folate:H+ symport, folate:OH- antiport and
           folate:anion antiport mechanisms have been proposed, but
           the energetic mechanism is not well defined [Transport
           and binding proteins, Carbohydrates, organic alcohols,
           and acids].
          Length = 511

 Score = 38.7 bits (90), Expect = 0.005
 Identities = 38/165 (23%), Positives = 75/165 (45%), Gaps = 18/165 (10%)

Query: 37  SSWVVSIIELGNLVTPIPIGFLVDYVGRKPCLLTTGPLYIISWLLVIFTKHVYVLYVVRF 96
           ++ ++ ++   +L   +P+  L DY+  KP L+     ++  WLL++    V+ + ++  
Sbjct: 64  TNEIIPVLPYSHLAVLVPVFLLTDYLRYKPVLVLQALSFVCVWLLLLLGTSVWHMQLMEV 123

Query: 97  MQGLAMGI-------VFTVAPMYIGEISGAKCRGALSTFFIGMLNTGILLEYTVG-PYVD 148
              + M         +F++ P    + + A  R A+    +G+  + +L +  V   ++ 
Sbjct: 124 FYSVTMAARIAYSSYIFSLVPPSRYQRAAAYSRAAV---LLGVFLSSVLGQLLVTLGWIS 180

Query: 149 YDTLAYVSLV--IPVVFLMTFIWMP-ESPYFLIMKGRDVDARKSL 190
           Y TL  +SLV     VFL  F+  P  S +F     R  D R +L
Sbjct: 181 YSTLNIISLVFMTFSVFLALFLKRPKRSLFF----NRLEDVRGAL 221


>gnl|CDD|129965 TIGR00887, 2A0109, phosphate:H+ symporter.  This model represents
           the phosphate uptake symporter subfamily of the major
           facilitator superfamily (pfam00083) [Transport and
           binding proteins, Anions].
          Length = 502

 Score = 38.6 bits (90), Expect = 0.006
 Identities = 44/182 (24%), Positives = 69/182 (37%), Gaps = 46/182 (25%)

Query: 283 ILVFLSTFP----TAFLVDRTGRRPLLLVSCFGSGISQLI-AGTYYLLSENYTVDLSKFN 337
           I+    T P    T FLVD  GR+P+ L+  F   +   +    Y  LS +         
Sbjct: 342 IIALAGTVPGYWVTVFLVDIIGRKPIQLMGFFILTVLFFVLGFAYNHLSTH--------- 392

Query: 338 WIPLISITCFAVIYSI-----GLGP-----LVPTLQGEFFPSNTRGLAGGVTT----ITL 383
                    F  IY +       GP     +VP   GE FP+  R  A G++        
Sbjct: 393 --------GFLAIYVLAQFFANFGPNATTFIVP---GEVFPTRYRSTAHGISAASGKAGA 441

Query: 384 TVISFLVMKMYQVICDHYGVYLNFY------IYSLGCIICGVLVYFIIPESKGKTFAQIQ 437
            +  F  + + Q      G     +      I++L  +  G+L   +IPE+KGK+  ++ 
Sbjct: 442 IIGQFGFLYLAQHGDPTKGYPTGIWMGHVLEIFAL-FMFLGILFTLLIPETKGKSLEELS 500

Query: 438 EE 439
            E
Sbjct: 501 GE 502


>gnl|CDD|183259 PRK11652, emrD, multidrug resistance protein D; Provisional.
          Length = 394

 Score = 38.0 bits (89), Expect = 0.009
 Identities = 41/154 (26%), Positives = 57/154 (37%), Gaps = 43/154 (27%)

Query: 56  GFLVDYVGRKPCLLTTGPLYIISWLLVIFTKHVYVLYVVRFMQGLAMGIVFTVAPMYIGE 115
           G L D VGR+P +L    ++I+  L+ +F   + VL     +QGL  G            
Sbjct: 64  GPLSDRVGRRPVILVGMSIFILGTLVALFAHSLTVLIAASAIQGLGTG------------ 111

Query: 116 ISGAKCR--------GALSTFFIGMLNTGI----LLEYTVGPYVD--------YDTLAYV 155
           + G   R        G        +LN GI    LL   +G  +         Y  L   
Sbjct: 112 VGGVMARTLPRDLYEGTQLRHANSLLNMGILVSPLLAPLIGGLLTTLFGWRACYLFLL-- 169

Query: 156 SLVIPVVFLMTFIWMPESPYFLIMKGRDVDARKS 189
            L   V F M   WMPE+        R  DAR++
Sbjct: 170 LLGAGVTFSM-ARWMPET--------RPADARRT 194


>gnl|CDD|223553 COG0477, ProP, Permeases of the major facilitator superfamily
           [Carbohydrate transport and metabolism / Amino acid
           transport and metabolism / Inorganic ion transport and
           metabolism / General function prediction only].
          Length = 338

 Score = 37.4 bits (85), Expect = 0.013
 Identities = 27/148 (18%), Positives = 54/148 (36%), Gaps = 4/148 (2%)

Query: 27  ETYIEMTSAESSWVVSIIELGNLVTPIPIGFLVDYVGRKPCLLTTGPLYIISWLLVIF-- 84
              +         ++S   LG  +  +  G L D  GR+  L+    L+++  LL+    
Sbjct: 31  TLSLSSGRLLYGLLLSAFFLGYAIGSLLAGPLGDRYGRRKVLIIGLLLFLLGTLLLALAP 90

Query: 85  TKHVYVLYVVRFMQGLAMGIVFTVAPMYIGEISGAKCR--GALSTFFIGMLNTGILLEYT 142
              + +L ++R +QGL  G +  VA   + E          A+    +G    G+ L   
Sbjct: 91  NVGLALLLILRLLQGLGGGGLLPVASALLSEWFPEATERGLAVGLVTLGAGALGLALGPL 150

Query: 143 VGPYVDYDTLAYVSLVIPVVFLMTFIWM 170
           +   +    L        +  L+  + +
Sbjct: 151 LAGLLLGALLWGWRAAFLLAALLGLLLL 178


>gnl|CDD|237958 PRK15403, PRK15403, multidrug efflux system protein MdtM;
           Provisional.
          Length = 413

 Score = 37.5 bits (87), Expect = 0.015
 Identities = 24/105 (22%), Positives = 50/105 (47%), Gaps = 4/105 (3%)

Query: 41  VSIIELGNLVTPIPIGFLVDYVGRKPCLLTTGPLYIISWLLVIFTKHVYVLYVVRFMQGL 100
           VS+   G +     +G L D +GR+P L+T   ++ ++    +FT  +    + RF+QG 
Sbjct: 57  VSLYLAGGMALQWLLGPLSDRIGRRPVLITGALIFTLACAATLFTTSMTQFLIARFIQGT 116

Query: 101 AMGIVFTVAPMYIGEISGAKCRGALSTFFIGMLNTGILLEYTVGP 145
           ++  + TV  + + E  G           + ++ + +L+   +GP
Sbjct: 117 SICFIATVGYVTVQEAFGQT----KGIKLMAIITSIVLVAPIIGP 157


>gnl|CDD|129794 TIGR00711, efflux_EmrB, drug resistance transporter, EmrB/QacA
           subfamily.  This subfamily of drug efflux proteins, a
           part of the major faciliator family, is predicted to
           have 14 potential membrane-spanning regions. Members
           with known activities include EmrB (multiple drug
           resistance efflux pump) in E. coli, FarB (antibacterial
           fatty acid resistance) in Neisseria gonorrhoeae, TcmA
           (tetracenomycin C resistance) in Streptomyces
           glaucescens, etc. In most cases, the efflux pump is
           described as having a second component encoded in the
           same operon, such as EmrA of E. coli [Cellular
           processes, Toxin production and resistance, Transport
           and binding proteins, Other].
          Length = 485

 Score = 37.0 bits (86), Expect = 0.020
 Identities = 39/141 (27%), Positives = 57/141 (40%), Gaps = 19/141 (13%)

Query: 292 TAFLVDRTGRRPLLLVSCFGSGISQLIAGTYYLLSENYTVDLSKFNWIPLISITCFAVIY 351
           T +L  R G R L L+S F   +  L+ G    ++ N            L  +  F VI 
Sbjct: 57  TGWLAKRFGTRRLFLISTFAFTLGSLLCG----VAPN------------LELMIIFRVIQ 100

Query: 352 SIGLGPLVPTLQGEF---FPSNTRGLAGGVTTITLTVISFLVMKMYQVICDHYGVYLNFY 408
             G GPL+P         +P   RG A  +  +T+ V   L   +   I ++Y     F 
Sbjct: 101 GFGGGPLIPLSFSTLLNIYPPEKRGRAMAIWGLTVLVAPALGPTLGGWIIENYHWRWIFL 160

Query: 409 IYSLGCIICGVLVYFIIPESK 429
           I     II  V+ +FI+P  K
Sbjct: 161 INVPIGIIVVVVAFFILPRDK 181


>gnl|CDD|236927 PRK11551, PRK11551, putative 3-hydroxyphenylpropionic transporter
           MhpT; Provisional.
          Length = 406

 Score = 36.9 bits (86), Expect = 0.021
 Identities = 42/151 (27%), Positives = 62/151 (41%), Gaps = 19/151 (12%)

Query: 39  WVVSIIELGNLVTPIPIGFLVDYVGRKPCLLTTGPLYIISWLLVIFTKHVY---VLYVVR 95
           W  S   LG L   +  G L D +GRK  L+ +  L+    L  + T   +    L V R
Sbjct: 54  WAFSAGILGLLPGALLGGRLADRIGRKRILIVSVALF---GLFSLATAQAWDFPSLLVAR 110

Query: 96  FMQGLAMGIVFTVAPMYIG---EISGAKCRG-ALSTFFIGMLNTGILLEYTVGPYVDYD- 150
            + G+ +G      P  I    E  G + RG A+S  + G+   G  L   +G     D 
Sbjct: 111 LLTGVGLG---GALPNLIALTSEAVGPRLRGTAVSLMYCGV-PFGGALASVIGVLAAGDA 166

Query: 151 ---TLAYVSLVIP-VVFLMTFIWMPESPYFL 177
               + YV  V P ++  +   W+PES  F 
Sbjct: 167 AWRHIFYVGGVGPLLLVPLLMRWLPESRAFA 197


>gnl|CDD|233174 TIGR00893, 2A0114, D-galactonate transporter.  [Transport and
           binding proteins, Carbohydrates, organic alcohols, and
           acids].
          Length = 399

 Score = 36.5 bits (85), Expect = 0.029
 Identities = 70/417 (16%), Positives = 129/417 (30%), Gaps = 67/417 (16%)

Query: 32  MTSAESSWVVSIIELGNLVTPIPIGFLVDYVGRKPCLLTTGPLYIISW----LLVIFTKH 87
           +++A+  +V S    G +V   P G+L+D  G +     T  ++I+ W     L  F   
Sbjct: 26  LSAAQYGYVFSAFSWGYVVGQFPGGWLLDRFGAR----KTLAVFIVIWGVFTGLQAFAGA 81

Query: 88  VYVLYVVRFMQGLAMGIVFTVAPMYIGEISGAKCRGALSTFFIGMLNTGILLEYTVGPYV 147
              LY++R + G A    F    + +     A  R    + F      G ++    GP V
Sbjct: 82  YVSLYILRVLLGAAEAPFFPGIILIVASWFPASERATAVSIFNSAQGLGGII---GGPLV 138

Query: 148 DYDTLAYVSLVIPVV-----FLMTFIWMPESPYFLIMKGRDVDARKSLFWLRGGRESSKD 202
            +  + +      ++      +   +W        I      D  +   WL    +    
Sbjct: 139 GWILIHFSWQWAFIIEGVLGIIWGVLW-----LKFI-----PDPPQKAKWLTEEEKYIVV 188

Query: 203 KINLELSNIKQDVEREMKLSDDFMDIISTPANRRSLLIVQIVAVADVISGMSAVLPYASS 262
                L+  +       K       I     +RR   +     + ++  G          
Sbjct: 189 G--GLLAEQQGKGPSTPKK----YQIKELLKDRRVWGLALGQFLVNIGLGF---FLTWFP 239

Query: 263 TFARTEG--SLITPDECTLLLGILVFLSTFPTAFLVDRTGRRP------------LLLVS 308
           T+   E   S++       L GI+ F+       L D   RR               LV 
Sbjct: 240 TYLVQERGLSILEAGFMASLPGIVGFIGMILGGRLSDLLLRRGKSLVFARKTAIIAGLVL 299

Query: 309 CFGSGISQLIAGTYYLLSENYTVDLSKFNWIPLISITCFAVIYSIGLGPLVPTLQGEFFP 368
                 +  +   Y  L+              L+++  F +    G G +   L  +  P
Sbjct: 300 SLLMFATNYVNIPYAALA--------------LVALGFFGL----GAGAIGWALISDNAP 341

Query: 369 SNTRGLAGGVTTITLTVISFLVMKMYQVICDHYGVYLNFYIYSLGCIICGVLVYFII 425
            N  GL GG+      +   +   +   I    G +    +      + G L Y ++
Sbjct: 342 GNIAGLTGGLINSLGNLGGIVGPIVIGAIAATTGSFAGALMVVAALALIGALSYLLL 398


>gnl|CDD|225371 COG2814, AraJ, Arabinose efflux permease [Carbohydrate transport
           and metabolism].
          Length = 394

 Score = 36.4 bits (85), Expect = 0.031
 Identities = 25/102 (24%), Positives = 46/102 (45%), Gaps = 1/102 (0%)

Query: 32  MTSAESSWVVSIIELGNLVTPIPIGFLVDYVGRKPCLLTTGPLYIISWLLVIFTKHVYVL 91
           ++   +  +++   LG  +    +  L   + R+  LL    L+I+S LL        VL
Sbjct: 45  VSEGAAGQLITAYALGVALGAPLLALLTGRLERRRLLLGLLALFIVSNLLSALAPSFAVL 104

Query: 92  YVVRFMQGLAMGIVFTVAPMYIGEISGAKCRG-ALSTFFIGM 132
            + R + GLA G+ +++A      +     RG AL+  F G+
Sbjct: 105 LLARALAGLAHGVFWSIAAALAARLVPPGKRGRALALVFTGL 146



 Score = 30.3 bits (69), Expect = 2.4
 Identities = 26/108 (24%), Positives = 42/108 (38%), Gaps = 17/108 (15%)

Query: 229 ISTPANRRSLLIVQIVAVADVISGMS-----AVLPYASSTFARTEGS---LITPDECTLL 280
               A +   L +  +A+A    G +      +LP  ++    +EG+   LIT       
Sbjct: 3   TPATARKPMWLALLALALAAFAIGTTEFVPVGLLPPIAADLGVSEGAAGQLIT------A 56

Query: 281 LGILVFLSTFPTAFLVDRTGRRPLLLVSCFGSGISQLI---AGTYYLL 325
             + V L     A L  R  RR LLL       +S L+   A ++ +L
Sbjct: 57  YALGVALGAPLLALLTGRLERRRLLLGLLALFIVSNLLSALAPSFAVL 104


>gnl|CDD|233168 TIGR00883, 2A0106, metabolite-proton symporter.  This model
           represents the metabolite:H+ symport subfamily of the
           major facilitator superfamily (pfam00083), including
           citrate-H+ symporters, dicarboxylate:H+ symporters, the
           proline/glycine-betaine transporter ProP, etc [Transport
           and binding proteins, Unknown substrate].
          Length = 394

 Score = 35.7 bits (83), Expect = 0.044
 Identities = 26/102 (25%), Positives = 44/102 (43%), Gaps = 10/102 (9%)

Query: 279 LLLGILVFLSTFP-TAFLVDRTGRRPLLLVSCFGSGISQLIAGTYYLLSENYTVDLSKFN 337
           L+L +++F  T P +  L DR GRRP+L+   F      ++A    +      +D   F 
Sbjct: 261 LMLSLILFFITIPLSGALSDRIGRRPVLI--IFT-----VLAALLAVPLLMALLDSGSF- 312

Query: 338 WIPLISITCFAVIYSIGLGPLVPTLQGEFFPSNTRGLAGGVT 379
            +    +   A+I  +  GP+   L  E FP+  R     + 
Sbjct: 313 TLFFFLVLGMALIGGMYTGPMGSFL-PELFPTEVRYTGASLA 353



 Score = 31.5 bits (72), Expect = 0.96
 Identities = 18/98 (18%), Positives = 40/98 (40%), Gaps = 6/98 (6%)

Query: 32  MTSAESSWVVSIIELGNLVTPIPI-GFLVDYVGRKPCLLTTGPLYIISWLLVIFT-KHVY 89
           +++  +  V+ +  +   +T IP+ G L D +GR+P L+    L  +  + ++       
Sbjct: 252 LSANSALLVLMLSLILFFIT-IPLSGALSDRIGRRPVLIIFTVLAALLAVPLLMALLDSG 310

Query: 90  VLYVVRFMQGLAMGIV-FTVAPM--YIGEISGAKCRGA 124
              +  F+      I      PM  ++ E+   + R  
Sbjct: 311 SFTLFFFLVLGMALIGGMYTGPMGSFLPELFPTEVRYT 348


>gnl|CDD|115279 pfam06609, TRI12, Fungal trichothecene efflux pump (TRI12).  This
           family consists of several fungal specific trichothecene
           efflux pump proteins. Many of the genes involved in
           trichothecene toxin biosynthesis in Fusarium
           sporotrichioides are present within a gene cluster.It
           has been suggested that TRI12 may play a role in F.
           sporotrichioides self-protection against trichothecenes.
          Length = 598

 Score = 34.6 bits (79), Expect = 0.13
 Identities = 33/167 (19%), Positives = 68/167 (40%), Gaps = 17/167 (10%)

Query: 34  SAESSWVVSIIELGNLVTPIPIGFLVDYVGRKPCLLTTGPLYIISWLLVIFTKHVYVLYV 93
           S       ++  +G  V+ + +G L D  GR+P ++ T  + ++  ++         L  
Sbjct: 77  SENQGLFSTLWTMGQAVSILMMGRLTDRFGRRPFVIATHIIGLVGAIVGCTANKFNTLLA 136

Query: 94  VRFMQGLAMGIVFTVAPMYIGEISGAKCRGALSTFFIGMLNTGI--LLEYTVGPYV---- 147
              + G+A G     +P++IGE+   K      T F+G+L      +     GPY     
Sbjct: 137 AMTLLGVAAGPAGA-SPLFIGELMSNK------TKFLGLLIVSAPTIAMNGAGPYFGQRL 189

Query: 148 ----DYDTLAYVSLVIPVVFLMTFIWMPESPYFLIMKGRDVDARKSL 190
               ++  + Y+ +++  + ++  I     P F  + G+    R  L
Sbjct: 190 AIQGNWRWIFYIYIIMSAIAVLLIIIWYHPPSFAQLHGKKARKRDEL 236


>gnl|CDD|185300 PRK15402, PRK15402, multidrug efflux system translocase MdfA;
           Provisional.
          Length = 406

 Score = 33.8 bits (78), Expect = 0.19
 Identities = 15/63 (23%), Positives = 33/63 (52%)

Query: 46  LGNLVTPIPIGFLVDYVGRKPCLLTTGPLYIISWLLVIFTKHVYVLYVVRFMQGLAMGIV 105
            G +     +G L D +GR+P +L     +I++ L ++  + +    ++RF+QG+ +  +
Sbjct: 59  AGGMFLQWLLGPLSDRIGRRPVMLAGVAFFILTCLAILLAQSIEQFTLLRFLQGIGLCFI 118

Query: 106 FTV 108
             V
Sbjct: 119 GAV 121


>gnl|CDD|233099 TIGR00710, efflux_Bcr_CflA, drug resistance transporter, Bcr/CflA
           subfamily.  This subfamily of drug efflux proteins, a
           part of the major faciliator family, is predicted to
           have 12 membrane-spanning regions. Members with known
           activity include Bcr (bicyclomycin resistance protein)
           in E. coli, Flor (chloramphenicol and florfenicol
           resistance) in Salmonella typhimurium DT104, and CmlA
           (chloramphenicol resistance) in Pseudomonas sp. plasmid
           R1033.
          Length = 385

 Score = 33.1 bits (76), Expect = 0.32
 Identities = 15/46 (32%), Positives = 25/46 (54%)

Query: 56  GFLVDYVGRKPCLLTTGPLYIISWLLVIFTKHVYVLYVVRFMQGLA 101
           G L D  GR+P LL    ++ +S L +  + ++  L V+RF+Q   
Sbjct: 61  GPLSDRYGRRPVLLLGLFIFALSSLGLALSNNIETLLVLRFVQAFG 106


>gnl|CDD|217023 pfam02414, Borrelia_orfA, Borrelia ORF-A.  This protein is encoded
           by an open reading frame in plasmid borne DNA repeats of
           Borrelia species. This protein is known as ORF-A. The
           function of this putative protein is unknown.
          Length = 285

 Score = 32.7 bits (75), Expect = 0.35
 Identities = 14/44 (31%), Positives = 22/44 (50%), Gaps = 1/44 (2%)

Query: 421 VYFIIPESKGKTFAQIQEELNKHIAHKSKLKEQRKQN-KINRNN 463
            +FII ++K K   +I  +  K    K K  ++  +N KIN  N
Sbjct: 242 PHFIIEKNKYKDLNKIIGKFKKSFKKKKKNSKKNYENIKINIFN 285


>gnl|CDD|130673 TIGR01612, 235kDa-fam, reticulocyte binding/rhoptry protein.  This
           model represents a group of paralogous families in
           plasmodium species alternately annotated as reticulocyte
           binding protein, 235-kDa family protein and rhoptry
           protein. Rhoptry protein is localized on the cell
           surface and is extremely large (although apparently
           lacking in repeat structure) and is important for the
           process of invasion of the RBCs by the parasite. These
           proteins are found in P. falciparum, P. vivax and P.
           yoelii.
          Length = 2757

 Score = 33.1 bits (75), Expect = 0.37
 Identities = 21/53 (39%), Positives = 27/53 (50%), Gaps = 1/53 (1%)

Query: 424 IIPESKGKTFAQIQEELNKHIAHKSKLKEQRKQNKINRNNEKADNNNVTKCKI 476
           II E K     +I ++LNK I    K KE+   NKIN   ++ D  N  K KI
Sbjct: 737 IIVEIKKHIHGEINKDLNK-ILEDFKNKEKELSNKINDYAKEKDELNKYKSKI 788


>gnl|CDD|237051 PRK12307, PRK12307, putative sialic acid transporter; Provisional.
          Length = 426

 Score = 32.6 bits (74), Expect = 0.49
 Identities = 68/335 (20%), Positives = 121/335 (36%), Gaps = 59/335 (17%)

Query: 56  GFLVDYVGRKPCLLTTGPLYIISWLLVIFTKHVYVLYVVRFMQGLAMGIVFTVAPMYIGE 115
           G L D  GRKP ++ +   Y +   L      V +L + RF+ G+ M   +  A  Y  E
Sbjct: 74  GLLADKFGRKPLMMWSIVAYSVGTGLSGLASGVIMLTLSRFIVGMGMAGEYACASTYAVE 133

Query: 116 ISGAKCRGALSTFFIGMLNTG-ILLEYTVGPYVD---YDTLAYVSLVIPVVFLMTFIWMP 171
                 +   S F +     G I+  Y +  + +   +    +V L +PV+ ++      
Sbjct: 134 SWPKHLKSKASAFLVSGFGIGNIIAAYFMPSFAEAYGWRAAFFVGL-LPVLLVI------ 186

Query: 172 ESPYFLIMKGRDVDARKSLFWLRGGRESSKDKINLELSNIKQDVEREMKLSDDFMDIIST 231
                               ++R     SK+    +LS   +  +    +    M  +  
Sbjct: 187 --------------------YIRARAPESKEWEEAKLSGKGKHSQSAWSVFSLSMKGLFN 226

Query: 232 PANRRSLLIVQIVAVADVISGMSAVLPYASSTFARTEG-------SLITPDECTLLLGIL 284
            A     L V IV  +  I G +  +     T+   EG       +L+T      +LG +
Sbjct: 227 RAQFPLTLCVFIVLFS--IFGANWPIFGLLPTYLAGEGFDTGVVSNLMTAAAFGTVLGNI 284

Query: 285 VFLSTFPTAFLVDRTGRRPLLLVSCFGSGISQLIAGTYYLLSENYTVDLSKFNWIPLISI 344
           V+          DR G +    +    S +   I   + +  +NY           L+  
Sbjct: 285 VW------GLCADRIGLKKTFSIGLLMSFL--FIFPLFRIPQDNYL----------LLGA 326

Query: 345 TCFAVIYS-IGLGPLVPTLQGEFFPSNTRGLAGGV 378
             F ++ + +G+G LVP    ++FP   RGL  G+
Sbjct: 327 CLFGLMATNVGVGGLVPKFLYDYFPLEVRGLGTGL 361


>gnl|CDD|129977 TIGR00899, 2A0120, sugar efflux transporter.  This family of
           proteins is an efflux system for lactose, glucose,
           aromatic glucosides and galactosides, cellobiose,
           maltose, a-methyl glucoside and other sugar compounds.
           They are found in both gram-negative and gram-postitive
           bacteria [Transport and binding proteins, Carbohydrates,
           organic alcohols, and acids].
          Length = 375

 Score = 32.1 bits (73), Expect = 0.57
 Identities = 23/132 (17%), Positives = 46/132 (34%), Gaps = 14/132 (10%)

Query: 297 DRTGRRPLLLVSCFGSGISQLIAGTYYLLSENYTVDLSKFNWIPLISITCFAVIYSIGLG 356
           D  G R  L++ C        +A   +  + NY +         L+ +      ++    
Sbjct: 58  DYQGDRKGLILFCCLLAA---LACLLFAWNRNYFL---------LLVLGVLLSSFASTAN 105

Query: 357 PLVPTLQGEFFPSNTRGLAGGVTTITLTVISFLVMKMYQV--ICDHYGVYLNFYIYSLGC 414
           P +  L  E      R      + +   +    V+       +   +G  + F   +L  
Sbjct: 106 PQLFALAREHADRTGREAVMFSSVMRAQISLAWVIGPPLAFWLALGFGFTVMFLTAALAF 165

Query: 415 IICGVLVYFIIP 426
           ++CGVLV+  +P
Sbjct: 166 VLCGVLVWLFLP 177


>gnl|CDD|204565 pfam10956, DUF2756, Protein of unknown function (DUF2756).  Some
           members in this family of proteins are annotated yhhA
           however currently no function is known. The family
           appears to be restricted to Enterobacteriaceae.
          Length = 104

 Score = 30.4 bits (68), Expect = 0.70
 Identities = 11/37 (29%), Positives = 24/37 (64%), Gaps = 2/37 (5%)

Query: 435 QIQEE--LNKHIAHKSKLKEQRKQNKINRNNEKADNN 469
           QIQ++  LN+ +  +++L++Q  QN++N N ++    
Sbjct: 47  QIQQKGMLNQQLQTQTRLQQQHLQNQLNNNQQRVQQG 83


>gnl|CDD|182127 PRK09874, PRK09874, drug efflux system protein MdtG; Provisional.
          Length = 408

 Score = 31.4 bits (71), Expect = 0.93
 Identities = 41/156 (26%), Positives = 61/156 (39%), Gaps = 23/156 (14%)

Query: 279 LLLGILVFLSTFPTAF---LVDRTGRRPLLLVSCFGSGISQLIAGTYYLLSENYTVDLSK 335
           L+  I    S   + F   L DR GR+ +LL S  G GI  ++ G    L++N       
Sbjct: 58  LVFSITFLFSAIASPFWGGLADRKGRKIMLLRSALGMGIVMVLMG----LAQNI------ 107

Query: 336 FNWIPLISITCFAVIYSIGLGPLVP---TLQGEFFPSNTRGLAGGVTTITLTVISFLVMK 392
             W  LI      +     LG  VP    L     P N  G A G  +      + L   
Sbjct: 108 --WQFLILRALLGL-----LGGFVPNANALIATQVPRNKSGWALGTLSTGGVSGALLGPL 160

Query: 393 MYQVICDHYGVYLNFYIYSLGCIICGVLVYFIIPES 428
              ++ D YG+   F+I +    +C ++  F I E+
Sbjct: 161 AGGLLADSYGLRPVFFITASVLFLCFLVTLFCIREN 196


>gnl|CDD|129972 TIGR00894, 2A0114euk, Na(+)-dependent inorganic phosphate
           cotransporter.  [Transport and binding proteins,
           Anions].
          Length = 465

 Score = 31.6 bits (72), Expect = 0.95
 Identities = 24/110 (21%), Positives = 49/110 (44%), Gaps = 9/110 (8%)

Query: 349 VIYSIGLGPLVPTLQG---EFFPSNTRGLAGGVTTITLTVISFLVMKMYQVICDHYGVY- 404
           VI  +  G + P       ++ P   R    G++T    + +F+ + +   +C+ +G + 
Sbjct: 139 VIQGLAQGSVSPATHKIIVKWAPPKERSRLLGMSTSGFQLGTFIFLPISGWLCESWGGWP 198

Query: 405 LNFYIYSL-GCIICGVLVYFIIPESKGKTFAQIQEELNKHIAHKSKLKEQ 453
           + FY++ + GC     L++F+ P         I +   K+I   S L+ Q
Sbjct: 199 MIFYVFGIVGCAWS--LLWFVFPADDPSIHPCISKFEKKYIN--SSLQGQ 244


>gnl|CDD|225121 COG2211, MelB, Na+/melibiose symporter and related transporters
           [Carbohydrate transport and metabolism].
          Length = 467

 Score = 31.5 bits (72), Expect = 1.2
 Identities = 23/110 (20%), Positives = 41/110 (37%), Gaps = 12/110 (10%)

Query: 33  TSAESSWVVSIIELGNLVTPIPIGFLVDYVGRKPCLLTTGPLYIISWLLVIFTKH--VYV 90
               +  ++     G L+  I    LV   G+K   L    L  + +LL+ FT    V +
Sbjct: 270 PELFAYLLLLASGAGLLIGLILWPRLVKKFGKKKLFLIGLLLLAVGYLLLYFTPAGSVVL 329

Query: 91  LYVVRFMQGLAMGIVFTVAPMYIGEI-------SGAKCRG---ALSTFFI 130
           + V   + G+  GI   +    + +        +G +  G   +  TFF 
Sbjct: 330 IVVALIIAGVGTGIANPLPWAMVADTVDYGEWKTGVRREGIVYSGMTFFR 379


>gnl|CDD|204107 pfam08970, Sda, Sporulation inhibitor A.  Members of this protein
           family contain two antiparallel alpha helices that are
           linked by a highly structured inter-helix loop to form a
           helical hairpin; the structure is stabilised by numerous
           hydrophobic and electrostatic interactions. These
           sporulation inhibitors are antikinases that bind to the
           histidine kinase KinA phosphotransfer domain and act as
           a molecular barricade that inhibit productive
           interaction between the ATP binding site and the
           phosphorylatable KinA His residue. This results in the
           inhibition of sporulation (by preventing phosphorylation
           of spo0A).
          Length = 46

 Score = 28.0 bits (63), Expect = 1.2
 Identities = 9/22 (40%), Positives = 14/22 (63%)

Query: 217 REMKLSDDFMDIISTPANRRSL 238
           +E+ L  DF+++I     RRSL
Sbjct: 17  KELNLDPDFIELIKEEIIRRSL 38


>gnl|CDD|114359 pfam05631, DUF791, Protein of unknown function (DUF791).  This
           family consists of several eukaryotic proteins of
           unknown function.
          Length = 354

 Score = 31.0 bits (70), Expect = 1.3
 Identities = 21/56 (37%), Positives = 30/56 (53%), Gaps = 7/56 (12%)

Query: 55  IGFLVDYVGRKPCLLTTGPLYIISWLLVIFTKH---VYVLYVVRFMQGLAMGIVFT 107
           +G L D  GRK   LT    Y I ++L   TKH     VL + RF+ G+A  ++F+
Sbjct: 89  VGSLADKQGRKRACLT----YCILYILSCITKHSPNYKVLMIGRFLGGIATSLLFS 140


>gnl|CDD|184835 PRK14822, PRK14822, nucleoside-triphosphatase; Provisional.
          Length = 200

 Score = 30.6 bits (70), Expect = 1.4
 Identities = 15/36 (41%), Positives = 22/36 (61%), Gaps = 7/36 (19%)

Query: 423 FIIPESKGKTFAQI-QEELNKHIAHK----SKLKEQ 453
           F +PE KGKT A++  EE N  I+H+     KL+ +
Sbjct: 159 FYVPE-KGKTMAELSSEEKNA-ISHRGKALKKLEAE 192


>gnl|CDD|213979 TIGR04366, cupin_WbuC, cupin fold metalloprotein, WbuC family.
           Members of this family show sequence similarity to cupin
           fold proteins (see pfam07883), including conserved His
           residues likely to serve as metal-binding ligands. Many
           members occur in bacterial O-antigen biosynthesis
           regions. Some members have acquired the gene symbol wbuC
           (e.g. Jarvis, et al, 2011), but publications using this
           term do not ascribe a function.
          Length = 132

 Score = 29.8 bits (68), Expect = 1.4
 Identities = 10/43 (23%), Positives = 20/43 (46%)

Query: 326 SENYTVDLSKFNWIPLISITCFAVIYSIGLGPLVPTLQGEFFP 368
              + V++    W  L++++   VI+ +  GP  P    +F P
Sbjct: 87  GGTFGVEIPPGTWHTLVALSPGTVIFEVKEGPYDPLADKDFAP 129


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
           found at the N terminus of SMC proteins. The SMC
           (structural maintenance of chromosomes) superfamily
           proteins have ATP-binding domains at the N- and
           C-termini, and two extended coiled-coil domains
           separated by a hinge in the middle. The eukaryotic SMC
           proteins form two kind of heterodimers: the SMC1/SMC3
           and the SMC2/SMC4 types. These heterodimers constitute
           an essential part of higher order complexes, which are
           involved in chromatin and DNA dynamics. This family also
           includes the RecF and RecN proteins that are involved in
           DNA metabolism and recombination.
          Length = 1162

 Score = 31.1 bits (70), Expect = 1.6
 Identities = 8/48 (16%), Positives = 14/48 (29%)

Query: 418 GVLVYFIIPESKGKTFAQIQEELNKHIAHKSKLKEQRKQNKINRNNEK 465
                    + + K   +  E L + I    +LK Q  + K       
Sbjct: 164 AGSREKRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKAL 211


>gnl|CDD|182924 PRK11043, PRK11043, putative transporter; Provisional.
          Length = 401

 Score = 30.6 bits (70), Expect = 1.9
 Identities = 16/45 (35%), Positives = 25/45 (55%)

Query: 56  GFLVDYVGRKPCLLTTGPLYIISWLLVIFTKHVYVLYVVRFMQGL 100
           G L D  GRKP LL    L+ +  L +++ +    L V+RF+Q +
Sbjct: 62  GPLSDRYGRKPVLLAGLSLFALGSLGMLWVESAAQLLVLRFVQAV 106


>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 438

 Score = 30.6 bits (69), Expect = 2.1
 Identities = 7/48 (14%), Positives = 13/48 (27%), Gaps = 6/48 (12%)

Query: 427 ESKGKTFAQIQEEL----NKHIAHKS--KLKEQRKQNKINRNNEKADN 468
               +     ++E     N+ I         E  K   +   N + D 
Sbjct: 67  SQLEEQLINQKKEQKNLFNEQIKQFELALQDEIAKLEALELLNLEKDK 114


>gnl|CDD|225180 COG2271, UhpC, Sugar phosphate permease [Carbohydrate transport and
           metabolism].
          Length = 448

 Score = 30.3 bits (69), Expect = 2.3
 Identities = 28/159 (17%), Positives = 51/159 (32%), Gaps = 22/159 (13%)

Query: 279 LLLGILVFLSTFPTAFLVDRTGRRPLLLVSCFGSGISQLIAGTYYLLSENYTVDLSKFNW 338
               I   +S F    L DR+  R  +      S I  ++ G                  
Sbjct: 71  SAFSITYGVSKFVMGVLSDRSNPRYFMAFGLILSAIVNILFGF--------------SPS 116

Query: 339 IPLISITCF--AVIYSIGLGPLVPTLQGEFFPSNTRGLAGGVTTITLTVISFLVMKM--Y 394
           + L ++          +G  P   T+   +F    RG    +   +  +   L   +   
Sbjct: 117 LFLFAVLWVLNGWFQGMGWPPCARTI-THWFSRKERGTWWSIWNTSHNIGGALAPLVALL 175

Query: 395 QVICDHYGVYLNFYIYSLGCIICGVLVYFII---PESKG 430
                H G    FY   +  II  +++ F++   P+S+G
Sbjct: 176 AFFAFHGGWRAAFYFPGIIAIIVALILLFLLRDRPQSEG 214


>gnl|CDD|150406 pfam09727, CortBP2, Cortactin-binding protein-2.  This entry is the
           first approximately 250 residues of cortactin-binding
           protein 2. In addition to being a positional candidate
           for autism this protein is expressed at highest levels
           in the brain in humans. The human protein has six
           associated ankyrin repeat domains pfam00023 towards the
           C-terminus which act as protein-protein interaction
           domains.
          Length = 193

 Score = 29.5 bits (66), Expect = 2.6
 Identities = 16/45 (35%), Positives = 22/45 (48%)

Query: 198 ESSKDKINLELSNIKQDVEREMKLSDDFMDIISTPANRRSLLIVQ 242
           E  + K  LEL   K+   R MK SDDF +++     R   L+ Q
Sbjct: 109 EKRQRKTVLELEEEKRKHIRYMKKSDDFTNLLEQERERLKKLLEQ 153


>gnl|CDD|224878 COG1967, COG1967, Predicted membrane protein [Function unknown].
          Length = 271

 Score = 29.6 bits (67), Expect = 3.7
 Identities = 27/105 (25%), Positives = 42/105 (40%), Gaps = 11/105 (10%)

Query: 255 AVLPY---ASSTFARTEGSLITPDECTLLLGI--LVFLSTFPTAFLVDRTGRRPLLLVSC 309
           A++P+    SS  A  +  ++ P    +  GI  LVF    PT     R  +RP +++  
Sbjct: 62  ALIPFVLLGSSVRALVDAGILPPSYLIITPGIYFLVFAIALPTLLASVRFFKRPYVVLL- 120

Query: 310 FGSGISQLIAGTYYLLSENYTVDLSKFNWIPLISITCFAVIYSIG 354
            G G   L A    LL  N       FN   L+ +   A + +  
Sbjct: 121 AGWG-LVLAAVALLLLLHNAPT----FNLYVLVLLIGVATVLTAV 160


>gnl|CDD|233128 TIGR00792, gph, sugar (Glycoside-Pentoside-Hexuronide) transporter.
            The Glycoside-Pentoside-Hexuronide (GPH):Cation
           Symporter Family (TC 2.A.2) GPH:cation symporters
           catalyze uptake of sugars in symport with a monovalent
           cation (H+ or Na+). Members of this family includes
           transporters for melibiose, lactose, raffinose,
           glucuronides, pentosides and isoprimeverose. Mutants of
           two groups of these symporters (the melibiose permeases
           of enteric bacteria, and the lactose permease of
           Streptococcus thermophilus) have been isolated in which
           altered cation specificity is observed or in which sugar
           transport is uncoupled from cation symport (i.e.,
           uniport is catalyzed). The various members of the family
           can use Na+, H+ or Li, Na+ or Li+, H+ or Li+, or only H+
           as the symported cation. All of these proteins possess
           twelve putative transmembrane a-helical spanners
           [Transport and binding proteins, Carbohydrates, organic
           alcohols, and acids].
          Length = 437

 Score = 29.5 bits (67), Expect = 4.0
 Identities = 17/82 (20%), Positives = 33/82 (40%), Gaps = 1/82 (1%)

Query: 28  TYIEMTSAESSWVVSIIELGNLVTPIPIGFLVDYVGRKPCLLTTGPLYIISWLLVIFTK- 86
           TY+       S++ SI     L+  +    LV   GRK        L ++ +L+  F   
Sbjct: 250 TYVLGDPELFSYMGSIAIGAGLIGVLLFPRLVKKFGRKILFAGGILLMVLGYLIFFFAGS 309

Query: 87  HVYVLYVVRFMQGLAMGIVFTV 108
           ++ ++ V+  + G     V  +
Sbjct: 310 NLPLILVLIILAGFGQNFVTGL 331


>gnl|CDD|215527 PLN02976, PLN02976, amine oxidase.
          Length = 1713

 Score = 29.5 bits (66), Expect = 4.7
 Identities = 20/66 (30%), Positives = 31/66 (46%), Gaps = 2/66 (3%)

Query: 189 SLFWLRGGRESSKDKINLELSNIKQDVEREMKLSDDFMDIISTPANRRSLLIVQIVAVAD 248
           SLF L+  + S K K+ LE +  ++  E+   L +D   +  T A+ R  L      V  
Sbjct: 31  SLFKLKRPKNSKKVKVGLESTGKRE--EKLSALDEDSEGMDDTLASFRKRLKGPKKGVGS 88

Query: 249 VISGMS 254
           V + MS
Sbjct: 89  VSARMS 94


>gnl|CDD|237823 PRK14823, PRK14823, putative deoxyribonucleoside-triphosphatase;
           Provisional.
          Length = 191

 Score = 28.9 bits (65), Expect = 4.9
 Identities = 10/25 (40%), Positives = 17/25 (68%)

Query: 425 IPESKGKTFAQIQEELNKHIAHKSK 449
           +PE   KTFA++  E+   I+H++K
Sbjct: 155 VPEGYDKTFAELGLEIKNQISHRAK 179


>gnl|CDD|232795 TIGR00042, TIGR00042, non-canonical purine NTP pyrophosphatase,
           RdgB/HAM1 family.  Saccharomyces cerevisiae HAM1
           protects against the mutagenic effects of the base
           analog 6-N-hydroxylaminopurine, which can be a natural
           product of monooxygenase activity on adenine.
           Methanococcus jannaschii MJ0226 and E. coli RdgB are
           also characterized as pyrophosphatases active against
           non-standard purines NTPs. E. coli RdgB appears to act
           by intercepting non-canonical deoxyribonucleotide
           triphosphates from replication precursor pools. [DNA
           metabolism, DNA replication, recombination, and repair].
          Length = 184

 Score = 28.5 bits (64), Expect = 5.8
 Identities = 19/66 (28%), Positives = 26/66 (39%), Gaps = 17/66 (25%)

Query: 396 VICDHYGVYLNFYIYSLGCIICGVLVY------------FIIPESKGKTFAQIQEELNKH 443
             CD  G  L F     G I+ G +                IP  +GKTFA++  E    
Sbjct: 114 GYCDPNGEPLVF----EG-IVKGKITREPRGTYGFGYDPIFIPPEEGKTFAELTTEEKNK 168

Query: 444 IAHKSK 449
           I+H+ K
Sbjct: 169 ISHRGK 174


>gnl|CDD|185746 cd08915, V_Alix_like, Protein-interacting V-domain of mammalian
           Alix and related domains.  This superfamily contains the
           V-shaped (V) domain of mammalian Alix (apoptosis-linked
           gene-2 interacting protein X), His-Domain type N23
           protein tyrosine phosphatase (HD-PTP, also known as
           PTPN23), Bro1 and Rim20 (also known as PalA) from
           Saccharomyces cerevisiae, and related domains. Alix,
           HD-PTP, Bro1, and Rim20 all interact with the ESCRT
           (Endosomal Sorting Complexes Required for Transport)
           system. Alix, also known as apoptosis-linked gene-2
           interacting protein 1 (AIP1), participates in membrane
           remodeling processes during the budding of enveloped
           viruses, vesicle budding inside late endosomal
           multivesicular bodies (MVBs), and the abscission
           reactions of mammalian cell division. It also functions
           in apoptosis. HD-PTP functions in cell migration and
           endosomal trafficking, Bro1 in endosomal trafficking,
           and Rim20 in the response to the external pH via the
           Rim101 pathway. The Alix V-domain contains a binding
           site, partially conserved in this superfamily, for the
           retroviral late assembly (L) domain YPXnL motif. The
           Alix V-domain is also a dimerization domain. Members of
           this superfamily have an N-terminal Bro1-like domain,
           which binds components of the ESCRT-III complex. The
           Bro1-like domains of Alix and HD-PTP can also bind human
           immunodeficiency virus type 1 (HIV-1) nucleocapsid. Many
           members, including Alix, HD-PTP, and Bro1, also have a
           proline-rich region (PRR), which binds multiple partners
           in Alix, including Tsg101 (tumor susceptibility gene
           101, a component of ESCRT-1) and the apoptotic protein
           ALG-2. The C-terminal portion (V-domain and PRR) of Bro1
           interacts with Doa4, a ubiquitin thiolesterase needed to
           remove ubiquitin from MVB cargoes; it interacts with a
           YPxL motif in Doa4s catalytic domain to stimulate its
           deubiquitination activity. Rim20 may bind the ESCRT-III
           subunit Snf7, bringing the protease Rim13 (a
           YPxL-containing transcription factor) into proximity
           with Rim101, and promoting the proteolytic activation of
           Rim101. HD-PTP is encoded by the PTPN23 gene, a tumor
           suppressor gene candidate often absent in human kidney,
           breast, lung, and cervical tumors. HD-PTP has a
           C-terminal catalytically inactive tyrosine phosphatase
           domain.
          Length = 342

 Score = 28.8 bits (65), Expect = 7.1
 Identities = 8/39 (20%), Positives = 20/39 (51%), Gaps = 1/39 (2%)

Query: 435 QIQEELNKHIAHKSKLKEQRKQNKINRNNEKADNNNVTK 473
           ++   L   +   S+L+++R+   I+    K+ NN++  
Sbjct: 190 EVVSSLRPLLNEVSELEKERE-RFISELEIKSRNNDILP 227


>gnl|CDD|222060 pfam13347, MFS_2, MFS/sugar transport protein.  This family is part
           of the major facilitator superfamily of membrane
           transport proteins.
          Length = 425

 Score = 28.7 bits (65), Expect = 7.7
 Identities = 13/81 (16%), Positives = 34/81 (41%), Gaps = 2/81 (2%)

Query: 38  SWVVSIIELGNLVTPIPIGFLVDYVGRKPCLLTTGPLYIISWLLVIF--TKHVYVLYVVR 95
           S ++ I  +  ++      +L    G+K   L    L  I  +L+ F     +++  V+ 
Sbjct: 260 SVLLLIGTIAAILGAPLWPWLAKRFGKKRTFLLGMLLAAIGLVLLFFLPPGSLWLFLVLV 319

Query: 96  FMQGLAMGIVFTVAPMYIGEI 116
            + G+ +G+   +    + ++
Sbjct: 320 VLAGIGLGLATLLPWAMLADV 340


>gnl|CDD|237902 PRK15075, PRK15075, citrate-proton symporter; Provisional.
          Length = 434

 Score = 28.8 bits (65), Expect = 7.8
 Identities = 24/72 (33%), Positives = 33/72 (45%), Gaps = 15/72 (20%)

Query: 249 VISGMSAVLPYASS---------TFARTEGSLITPDECTLLLGILVFLSTF---PTA-FL 295
           V++GM  V     S         TF +T   L   D  +LL+ + V +S F   P    L
Sbjct: 240 VLAGMLMVAMTTVSFYLITVYTPTFGKTVLHLSAAD--SLLVTLCVGVSNFIWLPIGGAL 297

Query: 296 VDRTGRRPLLLV 307
            DR GRRP+L+ 
Sbjct: 298 SDRIGRRPVLIA 309


>gnl|CDD|140234 PTZ00207, PTZ00207, hypothetical protein; Provisional.
          Length = 591

 Score = 28.6 bits (64), Expect = 7.8
 Identities = 31/97 (31%), Positives = 43/97 (44%), Gaps = 16/97 (16%)

Query: 281 LGILVFLSTFPTAFLVDRTGRRPL----LLVSCFGSGISQLIAGTYYLLSENYTVDLSKF 336
           +GI V     P +F+ D  G RP+    + V C G   + L A T+  + E   V LS +
Sbjct: 70  VGIAVGYFLLPYSFIYDYLGPRPIFVLSMTVFCLG---TLLFALTFQEVIEGSVVRLSVY 126

Query: 337 NWIPLISITCFAVIYSIGLGPLVPTLQGEFFPSNTRG 373
           N +  +    F       LG +V  L    FPSN RG
Sbjct: 127 NGLMTLGCMLF------DLGAVVTVLS--VFPSN-RG 154


>gnl|CDD|226614 COG4129, COG4129, Predicted membrane protein [Function unknown].
          Length = 332

 Score = 28.5 bits (64), Expect = 8.8
 Identities = 22/126 (17%), Positives = 50/126 (39%), Gaps = 9/126 (7%)

Query: 101 AMGIVFTVAPMYIGEISGAKCRGALSTFFIGMLNTGILLEYTVGPYVDYDTLAYVSLVIP 160
           A G+V  +    +        +       I +    IL+   +  ++ ++    V + + 
Sbjct: 84  AFGVVLLIIIPLL-----VLLKLENGVVPITVGVLHILVAAMIPLFLIFNRFLLVFVGVG 138

Query: 161 VVFLMTFIWMPESPYFLIMKGRDVDAR-KSLFW--LRGGRESSKDKINLELSNIKQDVER 217
           V FL+  + MP   Y L +    V+A   S+ W      R++   +++ +L  + + + +
Sbjct: 139 VAFLVNLV-MPPPDYELKLYRAKVEAILASILWEVASYLRDTESAELDKDLEALLRLLIK 197

Query: 218 EMKLSD 223
             KL  
Sbjct: 198 LAKLIA 203


>gnl|CDD|182486 PRK10473, PRK10473, multidrug efflux system protein MdtL;
           Provisional.
          Length = 392

 Score = 28.4 bits (64), Expect = 9.4
 Identities = 18/54 (33%), Positives = 27/54 (50%)

Query: 56  GFLVDYVGRKPCLLTTGPLYIISWLLVIFTKHVYVLYVVRFMQGLAMGIVFTVA 109
           G + D  GRKP  +    L+II+ LL    +   +    RF+QG+  G  + VA
Sbjct: 59  GKIADRSGRKPVAIPGAALFIIASLLCSLAETSSLFLAGRFLQGIGAGCCYVVA 112


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.325    0.141    0.422 

Gapped
Lambda     K      H
   0.267   0.0760    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 24,481,304
Number of extensions: 2471889
Number of successful extensions: 3621
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3547
Number of HSP's successfully gapped: 162
Length of query: 478
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 377
Effective length of database: 6,457,848
Effective search space: 2434608696
Effective search space used: 2434608696
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 61 (27.2 bits)