RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy3469
         (154 letters)



>gnl|CDD|129972 TIGR00894, 2A0114euk, Na(+)-dependent inorganic phosphate
           cotransporter.  [Transport and binding proteins,
           Anions].
          Length = 465

 Score =  110 bits (278), Expect = 2e-29
 Identities = 38/123 (30%), Positives = 59/123 (47%), Gaps = 1/123 (0%)

Query: 3   LGQYGPAVCLVAASYTGCDPYLTVGILTIGLGFNGAVYSGFKVNHLDISPRFAGILMSFT 62
           +G  GP +   A  Y     YLT+ ILT+    +    +G  +N LD++PRF G +   T
Sbjct: 342 IGGLGPGIFAYALPYLSAAFYLTIIILTLANAVSSGPLAGVLINSLDLAPRFLGFIKGIT 401

Query: 63  NCLANLAGLLAPIIAGYVIQGKPTQAAWRVVFVAAAVVYLFSSTFYIFAASGDRQPWDNP 122
                + GL+A  +AG  I  + ++  W +VF+  A V +    FY+   S +RQ W   
Sbjct: 402 GLPGFIGGLIASTLAG-NILSQDSKNVWLIVFLIMAFVNILCVIFYLIFGSAERQDWAKE 460

Query: 123 NND 125
             D
Sbjct: 461 EKD 463


>gnl|CDD|119392 cd06174, MFS, The Major Facilitator Superfamily (MFS) is a large
           and diverse group of secondary transporters that
           includes uniporters, symporters, and antiporters. MFS
           proteins facilitate the transport across cytoplasmic or
           internal membranes of a variety of substrates including
           ions, sugar phosphates, drugs, neurotransmitters,
           nucleosides, amino acids, and peptides. They do so using
           the electrochemical potential of the transported
           substrates. Uniporters transport a single substrate,
           while symporters and antiporters transport two
           substrates in the same or in opposite directions,
           respectively, across membranes. MFS proteins are
           typically 400 to 600 amino acids in length, and the
           majority contain 12 transmembrane alpha helices (TMs)
           connected by hydrophilic loops. The N- and C-terminal
           halves of these proteins display weak similarity and may
           be the result of a gene duplication/fusion event. Based
           on kinetic studies and the structures of a few bacterial
           superfamily members, GlpT (glycerol-3-phosphate
           transporter), LacY (lactose permease), and EmrD
           (multidrug transporter), MFS proteins are thought to
           function through a single substrate binding site,
           alternating-access mechanism involving a rocker-switch
           type of movement. Bacterial members function primarily
           for nutrient uptake, and as drug-efflux pumps to confer
           antibiotic resistance. Some MFS proteins have medical
           significance in humans such as the glucose transporter
           Glut4, which is impaired in type II diabetes, and
           glucose-6-phosphate transporter (G6PT), which causes
           glycogen storage disease when mutated.
          Length = 352

 Score = 40.4 bits (95), Expect = 1e-04
 Identities = 22/110 (20%), Positives = 43/110 (39%), Gaps = 4/110 (3%)

Query: 1   MILGQYGPAVCLVAASYTGCDPYLTVGILTIGLGFNGAVYSGFKVNHLDISPRFAGILMS 60
           +++G    A+ L+  +       L V +L +G G   A  +   +      P   G    
Sbjct: 245 LLIGLLLAALGLLLLALAPSLALLLVALLLLGFGLGFAFPALLTLASELAPPEARGTASG 304

Query: 61  FTNCLANLAGLLAPIIAGYVIQGKPTQAAWRVVFVAAAVVYLFSSTFYIF 110
             N   +L G L P++AG ++        +  VF+  A + L ++   + 
Sbjct: 305 LFNTFGSLGGALGPLLAGLLLDTG----GYGGVFLILAALALLAALLLLL 350



 Score = 30.7 bits (70), Expect = 0.24
 Identities = 20/102 (19%), Positives = 40/102 (39%), Gaps = 4/102 (3%)

Query: 9   AVCLVAASYTGCDPYLTVGILTIGLGFNGAVYSGFKVNHLDISPRFAGILMSFTNCLANL 68
           A+  +  ++      L VG   +GLG      +   +      P+  G  +   +    L
Sbjct: 75  ALGSLLLAFASSLWLLLVGRFLLGLGGGALYPAAAALIAEWFPPKERGRALGLFSAGFGL 134

Query: 69  AGLLAPIIAGYVIQGKPTQAAWRVVFVAAAVVYLFSSTFYIF 110
             LL P++ G + +       WR +F+  A++ L  +   +F
Sbjct: 135 GALLGPLLGGLLAE----SLGWRWLFLILAILGLLLALLLLF 172


>gnl|CDD|233174 TIGR00893, 2A0114, D-galactonate transporter.  [Transport and
           binding proteins, Carbohydrates, organic alcohols, and
           acids].
          Length = 399

 Score = 36.9 bits (86), Expect = 0.002
 Identities = 23/99 (23%), Positives = 44/99 (44%), Gaps = 2/99 (2%)

Query: 2   ILGQYGPAVCLVAASYTGCDPYLTVGILTIGLGFNGAVYSGFKVNHLDISPRFAGILMSF 61
           I+     ++ + A +Y    PY  + ++ +G    GA   G+ +   +     AG+    
Sbjct: 293 IIAGLVLSLLMFATNYVN-IPYAALALVALGFFGLGAGAIGWALISDNAPGNIAGLTGGL 351

Query: 62  TNCLANLAGLLAPIIAGYVIQGKPTQAAWRVVFVAAAVV 100
            N L NL G++ PI+ G +     +  A  ++ VAA  +
Sbjct: 352 INSLGNLGGIVGPIVIGAIAATTGS-FAGALMVVAALAL 389


>gnl|CDD|217379 pfam03125, Sre, C. elegans Sre G protein-coupled chemoreceptor.
           Caenorhabditis elegans Sre proteins are candidate
           chemosensory receptors. There are four main recognised
           groups of such receptors: Odr-10, Sra, Sro, and Srg. Sre
           (this family), Sra pfam02117 and Srb pfam02175 comprise
           the Sra group. All of the above receptors are thought to
           be G protein-coupled seven transmembrane domain
           proteins. The existence of several different
           chemosensory receptors underlies the fact that in spite
           of having only 20-30 chemosensory neurones, C. elegans
           detects hundreds of different chemicals, with the
           ability to discern individual chemicals among
           combinations.
          Length = 363

 Score = 31.5 bits (72), Expect = 0.16
 Identities = 12/63 (19%), Positives = 16/63 (25%), Gaps = 6/63 (9%)

Query: 32  GLGFNGAVYSGFKVNHLDISPRFAGILMSFTNCLANLAGLLAPIIAGYVIQGKPTQAAWR 91
            L       SG      D+ P     L         L  LL      + +       AW+
Sbjct: 257 VLFIIVLCGSGILALSFDLIPLLRTFLNHIFENSLFLYPLLICPTIMFSVP------AWK 310

Query: 92  VVF 94
             F
Sbjct: 311 KEF 313


>gnl|CDD|178518 PLN02930, PLN02930, CDP-diacylglycerol-serine
           O-phosphatidyltransferase.
          Length = 353

 Score = 30.9 bits (70), Expect = 0.20
 Identities = 18/62 (29%), Positives = 30/62 (48%), Gaps = 7/62 (11%)

Query: 62  TNCLANLAGLLAPIIAGYVIQG------KPTQAAWRVVFVAAAVVYLFSSTFYIFAASGD 115
           T+    +  ++A  +A  ++Q       +P  A WR+V    AVVYL + TF +F    D
Sbjct: 36  TSVKRGVWAMIAVFLAYCLLQAPSTVLIRPHPAIWRLVH-GMAVVYLVALTFLLFQKRDD 94

Query: 116 RQ 117
            +
Sbjct: 95  AR 96


>gnl|CDD|225646 COG3104, PTR2, Dipeptide/tripeptide permease [Amino acid transport
           and metabolism].
          Length = 498

 Score = 30.7 bits (70), Expect = 0.25
 Identities = 31/127 (24%), Positives = 39/127 (30%), Gaps = 26/127 (20%)

Query: 1   MILGQYGPAVCLVAASYT------GCDPYLTVGILTIGLGFNGAVYSGFKVNHLDISPRF 54
            +LG     + L A                  G L IGL         FK N   IS   
Sbjct: 87  RVLGTRR-TIVLGAILMAIGHLVLAISSVSGPGGLYIGLALIIVGTGLFKPN---ISS-L 141

Query: 55  AGILMS---------FTNCLA--NLAGLLAPIIAGYVIQGKPTQAAWRVVFVAAAVVYLF 103
            G L           FT      N+  L+APII G +         W V F  AAV  + 
Sbjct: 142 LGELYPKDDPRRDGGFTLFYMGINIGSLIAPIITGLLAINYG----WHVGFGLAAVGMII 197

Query: 104 SSTFYIF 110
               ++ 
Sbjct: 198 GLVIFLL 204


>gnl|CDD|145924 pfam03034, PSS, Phosphatidyl serine synthase.  Phosphatidyl serine
           synthase is also known as serine exchange enzyme. This
           family represents eukaryotic PSS I and II which are
           membrane bound proteins which catalyzes the replacement
           of the head group of a phospholipid (phosphotidylcholine
           or phosphotidylethanolamine) by L-serine.
          Length = 278

 Score = 29.6 bits (67), Expect = 0.55
 Identities = 10/26 (38%), Positives = 15/26 (57%), Gaps = 1/26 (3%)

Query: 85  PTQAAWRVVFVAAAVVYLFSSTFYIF 110
           P  A WR+V    +V+YL +  F +F
Sbjct: 1   PHPAFWRLVL-GLSVLYLLALVFLLF 25


>gnl|CDD|211363 cd09252, AP-3_Mu3_Cterm, C-terminal domain of medium Mu3 subunit in
           adaptor protein (AP) complex AP-3.  AP complexes
           participate in the formation of intracellular coated
           transport vesicles and select cargo molecules for
           incorporation into the coated vesicles in the late
           secretory and endocytic pathways. There are four AP
           complexes, AP-1, AP-2, AP-3, and AP-4, described in
           various eukaryotic organisms. Each AP complex consists
           of four subunits: two large chains (one each of
           gamma/alpha/delta/epsilon and beta1-4, respectively), a
           medium mu chain (mu1-4), and a small sigma chain
           (sigma1-4). Each of the four subunits from the different
           AP complexes exhibits similarity with each other. This
           family corresponds to the C-terminal domain of
           heterotetrameric adaptor protein complex 3 (AP-3) medium
           mu3 subunit, which includes two closely related
           homologs, mu3A (P47A, encoded by ap3m1) and mu1B (P47B,
           encoded by ap3m2). Mu3A is ubiquitously expressed, but
           mu3B is specifically expressed in neurons and
           neuroendocrine cells. AP-3 is particularly important for
           targeting integral membrane proteins to lysosomes and
           lysome-related organelles at trans-Golgi network (TGN)
           and/or endosomes, such as the yeast vacuole, fly pigment
           granules and mammalian melanosomes, platelet dense
           bodies and the secretory lysosomes of cytotoxic T
           lymphocytes. Unlike AP-1 and AP-2, which function in
           conjunction with clathrin which is a scaffolding protein
           participating in the formation of coated vesicles, the
           nature of the outer shell of AP-3 containing coats
           remains to be elucidated. Membrane-anchored cargo
           molecules interact with adaptors through short sorting
           signals in their cytosolic segments. Tyrosine-based
           endocytotic signals are one of the most important
           sorting signals. They are of the form Y-X-X-Phi, where Y
           is tyrosine, X is any amino acid and Phi is a bulky
           hydrophobic residue that can be Leu, Ile, Met, Phe, or
           Val. These kinds of sorting signals can be recognized by
           the C-terminal domain of AP-3 mu3 subunit, also known as
           Y-X-X-Phi signal-binding domain that contains two
           hydrophobic pockets, one for the tyrosine-binding and
           one for the bulky hydrophobic residue-binding.
          Length = 248

 Score = 29.5 bits (67), Expect = 0.62
 Identities = 10/23 (43%), Positives = 14/23 (60%), Gaps = 2/23 (8%)

Query: 30  TIGLGF--NGAVYSGFKVNHLDI 50
           +I + F   G + SG KV+ LDI
Sbjct: 206 SISVSFKIPGYLPSGLKVDSLDI 228


>gnl|CDD|148986 pfam07682, SOR, Sulphur oxygenase reductase.  The sulphur
          oxygenase/reductase (SOR) of the thermo-acidophilic
          archaeon Acidianus ambivalens is an unusual enzyme
          consisting of 24 identical subunits arranged in a
          perfectly symmetrical hollow sphere and containing a
          mononuclear non-heme iron centre (personal
          communication: A. Kletzin). At 85 degrees C in vitro,
          elemental sulphur is oxidized to sulphite, thiosulphate
          and hydrogen sulphide with no external cofactors
          needed. The proposed equation is: 4S + O2 + 4 H2O --->
          2 HSO3- + 2 H2S + 2 H+.
          Length = 303

 Score = 29.1 bits (65), Expect = 0.92
 Identities = 13/40 (32%), Positives = 21/40 (52%), Gaps = 3/40 (7%)

Query: 2  ILGQYGPAVCLVAA---SYTGCDPYLTVGILTIGLGFNGA 38
          +    GP VC+V A    + G   ++ +G+L +G  F GA
Sbjct: 21 LFSAVGPKVCMVTARHPGFVGFQNHVQIGVLPLGERFGGA 60


>gnl|CDD|219516 pfam07690, MFS_1, Major Facilitator Superfamily. 
          Length = 346

 Score = 28.9 bits (65), Expect = 0.94
 Identities = 19/110 (17%), Positives = 39/110 (35%), Gaps = 4/110 (3%)

Query: 1   MILGQYGPAVCLVAASYTGCDPYLTVGILTIGLGFNGAVYSGFKVNHLDISPRFAGILMS 60
           +++G    A+ L+   +      L V  +  GLG      +   +      P   G  + 
Sbjct: 65  LLIGLLLFALGLLLLLFASSLWLLLVLRVLQGLGGGALFPAAAALIADWFPPEERGRALG 124

Query: 61  FTNCLANLAGLLAPIIAGYVIQGKPTQAAWRVVFVAAAVVYLFSSTFYIF 110
             +    L   L P++ G +     +   WR  F+  A++ L ++     
Sbjct: 125 LLSAGFGLGAALGPLLGGLLA----SLFGWRAAFLILAILALLAAVLAAL 170


>gnl|CDD|233175 TIGR00895, 2A0115, benzoate transport.  [Transport and binding
           proteins, Carbohydrates, organic alcohols, and acids].
          Length = 398

 Score = 28.5 bits (64), Expect = 1.6
 Identities = 17/95 (17%), Positives = 32/95 (33%), Gaps = 4/95 (4%)

Query: 9   AVCLVAASYTGCDPYLTVGILTIGLGFNGAVYSGFKVNHLDISPRFAGILMSFTNCLANL 68
           +V  +  +       L +     GLG  G + +   +       RF G  +    C   +
Sbjct: 93  SVFTLLCALATNVTQLLILRFLAGLGLGGLMPNLNALVSEYAPKRFRGTAVGLMFCGYPI 152

Query: 69  AGLLAPIIAGYVIQGKPTQAAWRVVFVAAAVVYLF 103
              +   +AG++I        WR +F    +  L 
Sbjct: 153 GAAVGGFLAGWLI----PVFGWRSLFYVGGIAPLL 183


>gnl|CDD|225371 COG2814, AraJ, Arabinose efflux permease [Carbohydrate transport
           and metabolism].
          Length = 394

 Score = 28.3 bits (64), Expect = 1.8
 Identities = 22/109 (20%), Positives = 41/109 (37%), Gaps = 4/109 (3%)

Query: 9   AVCLVAASYTGCDPYLTVGILTIGLGFNGAVYSGFKVNHLDISPRFAGILMSFTNCLANL 68
           A+ L+A ++TG  P L + +L +          G +     ++P  A +  S      NL
Sbjct: 287 ALALLALTFTGASPALALALLFLWGFAFSPALQGLQTRLARLAPDAADLAGSLNVAAFNL 346

Query: 69  AGLLAPIIAGYVIQGKPTQAAWRVVFVAAAVVYLFSSTFYIFAASGDRQ 117
              L   + G V+        +       A + L +    + +A  DR+
Sbjct: 347 GIALGAALGGLVLD----ALGYAATGWVGAALLLLALLLALLSARKDRR 391


>gnl|CDD|222011 pfam13257, DUF4048, Domain of unknown function (DUF4048).  This
           presumed domain is functionally uncharacterized. This
           domain family is found in eukaryotes, and is typically
           between 228 and 257 amino acids in length.
          Length = 242

 Score = 27.8 bits (62), Expect = 2.1
 Identities = 15/49 (30%), Positives = 21/49 (42%), Gaps = 3/49 (6%)

Query: 103 FSSTFYIFAASGDRQPWDNPNNDAPTPRAATKRDASNNNRTSYASDVED 151
           F S F I       Q   +P   + TP   T +  S N ++S   DV+D
Sbjct: 176 FWSEFGIDTPG---QKSKSPQKASSTPAGNTNQGQSQNAQSSNLLDVDD 221


>gnl|CDD|239250 cd02952, TRP14_like, Human TRX-related protein 14 (TRP14)-like
           family; composed of proteins similar to TRP14, a 14kD
           cytosolic protein that shows disulfide reductase
           activity in vitro with a different substrate specificity
           compared with another human cytosolic protein, TRX1.
           TRP14 catalyzes the reduction of small
           disulfide-containing peptides but does not reduce
           disulfides of ribonucleotide reductase, peroxiredoxin
           and methionine sulfoxide reductase, which are TRX1
           substrates. TRP14 also plays a role in tumor necrosis
           factor (TNF)-alpha signaling pathways, distinct from
           that of TRX1. Its depletion promoted TNF-alpha induced
           activation of c-Jun N-terminal kinase and
           mitogen-activated protein kinases.
          Length = 119

 Score = 26.9 bits (60), Expect = 2.8
 Identities = 9/22 (40%), Positives = 12/22 (54%)

Query: 108 YIFAASGDRQPWDNPNNDAPTP 129
           +I+   GDR  W +PNN   T 
Sbjct: 63  FIYCDVGDRPYWRDPNNPFRTD 84


>gnl|CDD|220557 pfam10086, DUF2324, Putative membrane peptidase family (DUF2324).
           This domain, found in various hypothetical bacterial
           proteins, has no known function. This family appears to
           be related to the prenyl protease 2 family pfam02517,
           suggesting this family may be peptidases.
          Length = 222

 Score = 27.2 bits (61), Expect = 3.7
 Identities = 12/34 (35%), Positives = 19/34 (55%), Gaps = 1/34 (2%)

Query: 71  LLAPIIAGYVIQGKPTQAAWRVVFVAAAVVYLFS 104
           +L P++  YV   K  + +WRV  + A V  LF+
Sbjct: 3   ILLPVLL-YVYLRKRKKLSWRVFGLGALVFALFA 35


>gnl|CDD|218922 pfam06168, DUF981, Protein of unknown function (DUF981).  Family of
           uncharacterized proteins found in bacteria and archaea.
          Length = 189

 Score = 26.9 bits (60), Expect = 4.3
 Identities = 25/123 (20%), Positives = 44/123 (35%), Gaps = 31/123 (25%)

Query: 2   ILGQYGPAVCLVAASYTGC--DPYLTVGILTIGLGFNGAVYSGFKVNHLDISPRFAGI-- 57
           I G +G     +  SY     DPYL +G++ +      ++Y G+ +  L I   F G+  
Sbjct: 53  ITGLWGELTWPLPGSYNILFGDPYLLLGLILLSAAI--SLYKGYDLRPLGILALFLGLYT 110

Query: 58  --------------------LMSFTNCLANLAGLLAPIIAGYVIQGKPTQAAWRVVFVAA 97
                               L  +   L  LAGLL P +A   +     +    ++    
Sbjct: 111 IYYGVAIYNYGLTREPLLALLGLYI--LTGLAGLLPPTLA---LDRLKGKKILLILGAVI 165

Query: 98  AVV 100
            ++
Sbjct: 166 LIL 168


>gnl|CDD|235348 PRK05122, PRK05122, major facilitator superfamily transporter;
           Provisional.
          Length = 399

 Score = 27.2 bits (61), Expect = 4.4
 Identities = 27/97 (27%), Positives = 37/97 (38%), Gaps = 10/97 (10%)

Query: 23  YLTVGILTIGLGFNGAVYSGFKVNHLDISPRFAGILMSFTNCLANLAGLLAPIIAGYVIQ 82
           +  +  LTIGL    AV  G+  + L  S   AG+++S    L  LA LL+   AG    
Sbjct: 23  FTFISYLTIGLPL--AVLPGYVHDQLGFSAFLAGLVIS----LQYLATLLSRPHAGRYAD 76

Query: 83  GKPTQAAWRVVFVAAAVVYLFSSTFYIFAASGDRQPW 119
              T    + V        L S   Y+ A      P 
Sbjct: 77  ---TLGPKKAVVFGLCGCAL-SGLLYLLAGLLAAWPV 109


>gnl|CDD|226115 COG3587, COG3587, Restriction endonuclease [Defense mechanisms].
          Length = 985

 Score = 27.1 bits (60), Expect = 4.4
 Identities = 8/16 (50%), Positives = 11/16 (68%)

Query: 108 YIFAASGDRQPWDNPN 123
           +IF+    R+ WDNPN
Sbjct: 486 FIFSKWALREGWDNPN 501


>gnl|CDD|178126 PLN02510, PLN02510, probable 1-acyl-sn-glycerol-3-phosphate
           acyltransferase.
          Length = 374

 Score = 27.0 bits (60), Expect = 4.4
 Identities = 12/50 (24%), Positives = 23/50 (46%), Gaps = 2/50 (4%)

Query: 59  MSFTNCLANLAGLLAPIIAGYVIQGKPTQAAWRVVFVAAAVVYLFSSTFY 108
           +S   CLAN   ++A  +           + W  ++V+ +  YL S+T +
Sbjct: 307 LSTLKCLANFLAVIA--LTTVFTYLTLFSSVWFKIYVSLSCAYLTSATHF 354


>gnl|CDD|235151 PRK03699, PRK03699, putative transporter; Provisional.
          Length = 394

 Score = 27.2 bits (61), Expect = 4.4
 Identities = 13/42 (30%), Positives = 23/42 (54%), Gaps = 3/42 (7%)

Query: 61  FTNCLANLAGLLAPIIAGYVIQGKPTQAAWRVVFVAAAVVYL 102
           FT+   ++AG++ PIIA Y++        W  V+    +VY+
Sbjct: 135 FTDSFFSMAGMIFPIIAAYLLARS---IEWYWVYACIGLVYV 173


>gnl|CDD|226812 COG4377, COG4377, Predicted membrane protein [Function unknown].
          Length = 258

 Score = 26.7 bits (59), Expect = 4.9
 Identities = 9/40 (22%), Positives = 15/40 (37%)

Query: 65  LANLAGLLAPIIAGYVIQGKPTQAAWRVVFVAAAVVYLFS 104
           +     LLA  I       +  Q    V+ + A   ++FS
Sbjct: 12  IITAIALLAFPIGSIWWAKRKYQINLAVLGLGAVAFFVFS 51


>gnl|CDD|182964 PRK11102, PRK11102, bicyclomycin/multidrug efflux system;
           Provisional.
          Length = 377

 Score = 26.8 bits (60), Expect = 4.9
 Identities = 11/23 (47%), Positives = 16/23 (69%)

Query: 58  LMSFTNCLANLAGLLAPIIAGYV 80
           +MSF   +  +A LLAPII G++
Sbjct: 116 MMSFVTLVMTIAPLLAPIIGGWL 138


>gnl|CDD|152471 pfam12036, DUF3522, Protein of unknown function (DUF3522).  This
           family of proteins is functionally uncharacterized. This
           protein is found in eukaryotes. Proteins in this family
           are typically between 220 to 787 amino acids in length.
          Length = 183

 Score = 26.6 bits (59), Expect = 5.0
 Identities = 16/58 (27%), Positives = 22/58 (37%), Gaps = 15/58 (25%)

Query: 54  FAGILMSFTNCLANLAGLLAPIIAGYVIQGKPTQAAWRVVFVAAAVVY---LFSSTFY 108
           F  +L      L+NLA L    +            A +  +V  AVVY   +F S  Y
Sbjct: 2   FEQLLQFLLLTLSNLAFLPTIYV------------ALKRRYVYEAVVYIFTMFFSFMY 47


>gnl|CDD|223856 COG0785, CcdA, Cytochrome c biogenesis protein [Posttranslational
           modification, protein turnover, chaperones].
          Length = 220

 Score = 26.5 bits (59), Expect = 5.1
 Identities = 14/59 (23%), Positives = 22/59 (37%), Gaps = 5/59 (8%)

Query: 55  AGILMSFTNCLANLAGLLAPIIAGYVIQGKPTQAAWRVVFVAAAVVYLFSSTFYIFAAS 113
           AG+L   + C   L   L P    Y+  G    A   V+  +   V  F++ F +    
Sbjct: 19  AGLLSFLSPC--VLP--LLPAYLSYLAGGSLG-ARKSVLLASLLFVLGFATVFVLLGIG 72


>gnl|CDD|224191 COG1272, COG1272, Predicted membrane protein, hemolysin III homolog
           [General function prediction only].
          Length = 226

 Score = 26.8 bits (60), Expect = 5.2
 Identities = 12/72 (16%), Positives = 23/72 (31%), Gaps = 10/72 (13%)

Query: 58  LMSFTNCLAN----LAGLLAPIIAG------YVIQGKPTQAAWRVVFVAAAVVYLFSSTF 107
             S+   +AN    L G +  I+         +I G         ++  +  +    ST 
Sbjct: 15  SYSWHEEIANAITHLIGAILAIVGLVLLLVYALITGSALAVIVFSIYGLSLFLLFLVSTL 74

Query: 108 YIFAASGDRQPW 119
           Y    +G +   
Sbjct: 75  YHSIPNGQKAKA 86


>gnl|CDD|185739 cd08998, GH43_ABN_1, Glycosyl hydrolase family 43.  This glycosyl
           hydrolase family 43 (GH43) includes mostly enzymes with
           alpha-L-arabinofuranosidase (AFN; EC 3.2.1.55) and
           endo-alpha-L-arabinanase (ABN; EC 3.2.1.99) activities.
           These are inverting enzymes (i.e. they invert the
           stereochemistry of the anomeric carbon atom of the
           substrate) that have an aspartate as the catalytic
           general base, a glutamate as the catalytic general acid
           and another aspartate that is responsible for pKa
           modulation and orienting the catalytic acid. The GH43
           ABN enzymes hydrolyze alpha-1,5-L-arabinofuranoside
           linkages while the ABF enzymes cleave arabinose side
           chains so that the combined actions of these two enzymes
           reduce arabinan to L-arabinose and/or
           arabinooligosaccharides. These arabinan-degrading
           enzymes are important in the food industry for efficient
           production of L-arabinose from agricultural waste;
           L-arabinose is often used as a bioactive sweetener. A
           common structural feature of GH43 enzymes is a 5-bladed
           beta-propeller domain that contains the catalytic acid
           and catalytic base. A long V-shaped groove, partially
           enclosed at one end, forms a single extended
           substrate-binding surface across the face of the
           propeller.
          Length = 288

 Score = 26.8 bits (60), Expect = 5.3
 Identities = 14/50 (28%), Positives = 18/50 (36%), Gaps = 1/50 (2%)

Query: 101 YLFSSTFYIFAA-SGDRQPWDNPNNDAPTPRAATKRDASNNNRTSYASDV 149
           YLFS+   I  A S D   W    +  PT            +   +A DV
Sbjct: 14  YLFSTGNGIAIAKSKDLGNWTYVGSVFPTGPPWWSSIDPGGSGNLWAPDV 63


>gnl|CDD|220948 pfam11027, DUF2615, Protein of unknown function (DUF2615).  This
           small. approximately 100 residue, family is conserved
           from worms to humans. It is cysteine-rich with a
           characteristic FDxCEC sequence motif. The function is
           not known.
          Length = 103

 Score = 25.8 bits (57), Expect = 5.4
 Identities = 21/69 (30%), Positives = 31/69 (44%), Gaps = 7/69 (10%)

Query: 62  TNCLANLAGLLAPIIAGYVIQGKPTQAAWRVVFVAAAVV-YLFSSTFYIFAASGDRQPWD 120
           T CL +L G   P  +G    G  +     + ++  A+  YL      +   SGD +P D
Sbjct: 35  TECLQDLPG--RPQGSG---TGGISGFMLMMAWMLLAMALYLLRP-SSLRNPSGDNKPRD 88

Query: 121 NPNNDAPTP 129
           N +ND P P
Sbjct: 89  NNSNDGPPP 97


>gnl|CDD|225365 COG2807, CynX, Cyanate permease [Inorganic ion transport and
           metabolism].
          Length = 395

 Score = 26.1 bits (58), Expect = 8.2
 Identities = 12/57 (21%), Positives = 21/57 (36%)

Query: 50  ISPRFAGILMSFTNCLANLAGLLAPIIAGYVIQGKPTQAAWRVVFVAAAVVYLFSST 106
           +S   AG L+S          LL P++A      +P      ++ +   V  L +  
Sbjct: 240 LSAAEAGSLLSLMQLAQLPTALLIPLLARRSKNQRPLVVLALLLMLVGLVGLLLAPG 296


>gnl|CDD|233167 TIGR00881, 2A0104, phosphoglycerate transporter family protein.
           [Transport and binding proteins, Carbohydrates, organic
           alcohols, and acids].
          Length = 379

 Score = 26.2 bits (58), Expect = 8.3
 Identities = 25/138 (18%), Positives = 46/138 (33%), Gaps = 26/138 (18%)

Query: 24  LTVGILTIGLGFNGAVYS--------------GFKVNHLDI----SPRFAGILMSFTNCL 65
           +   I+ +  GF+ +++               G+      +    S    G  +SF NC 
Sbjct: 68  ILCAIVNLFFGFSTSLWVMAALWALNGIFQGMGWPPCGRTVTKWFSRSERGTWVSFWNCS 127

Query: 66  ANLAGLLAPIIAGYVIQGKPTQAAWRVVFVAAAVVYLFSSTFYIFAASGDRQ-----PWD 120
            N+ G L P +  + I       +W  VF+   ++ +  S           Q     P +
Sbjct: 128 HNVGGGLLPPLVLFGIA---ELYSWHWVFIVPGIIAIIVSLICFLLLRDSPQSCGLPPAE 184

Query: 121 NPNNDAPTPRAATKRDAS 138
              N+ P   A  K +  
Sbjct: 185 EMPNEEPDGDAEKKEEEL 202


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.322    0.136    0.419 

Gapped
Lambda     K      H
   0.267   0.0748    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 7,876,747
Number of extensions: 725968
Number of successful extensions: 836
Number of sequences better than 10.0: 1
Number of HSP's gapped: 825
Number of HSP's successfully gapped: 93
Length of query: 154
Length of database: 10,937,602
Length adjustment: 89
Effective length of query: 65
Effective length of database: 6,990,096
Effective search space: 454356240
Effective search space used: 454356240
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 55 (24.9 bits)