RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy16665
         (71 letters)



>gnl|CDD|233165 TIGR00879, SP, MFS transporter, sugar porter (SP) family.  This
           model represent the sugar porter subfamily of the major
           facilitator superfamily (pfam00083) [Transport and
           binding proteins, Carbohydrates, organic alcohols, and
           acids].
          Length = 481

 Score = 40.0 bits (94), Expect = 2e-05
 Identities = 23/65 (35%), Positives = 38/65 (58%), Gaps = 1/65 (1%)

Query: 3   LSVVLVDKAGRRLLLLLSVIIMGLCLGVLGFYFFLKNSGSDVSSIAFLPLISVIMFIVMF 62
           +++ LVD+ GRR LLL+    M +CL VLG       +GS   S   + ++ +++FI  F
Sbjct: 338 VAIFLVDRFGRRPLLLIGAAGMAICLFVLGILGASFVTGSS-KSSGNVAIVFILLFIAFF 396

Query: 63  SIPWG 67
           ++ WG
Sbjct: 397 AMGWG 401


>gnl|CDD|215702 pfam00083, Sugar_tr, Sugar (and other) transporter. 
          Length = 449

 Score = 35.0 bits (81), Expect = 0.001
 Identities = 20/64 (31%), Positives = 32/64 (50%), Gaps = 3/64 (4%)

Query: 4   SVVLVDKAGRRLLLLLSVIIMGLCLGVLGFYFFLKNSGSDVSSIAFLPLISVIMFIVMFS 63
           ++ LVD+ GRR LLLL    M +C  VLG                 + ++ +++FI  F+
Sbjct: 306 AIFLVDRFGRRPLLLLGAAGMAICFLVLGVALLGVAK---SKGAGIVAIVFILLFIAFFA 362

Query: 64  IPWG 67
           + WG
Sbjct: 363 LGWG 366


>gnl|CDD|182225 PRK10077, xylE, D-xylose transporter XylE; Provisional.
          Length = 479

 Score = 33.5 bits (77), Expect = 0.004
 Identities = 17/65 (26%), Positives = 36/65 (55%), Gaps = 7/65 (10%)

Query: 3   LSVVLVDKAGRRLLLLLSVIIMGLCLGVLGFYFFLKNSGSDVSSIAFLPLISVIMFIVMF 62
           L+++ VDK GR+ L ++  + M + +  LG  F+ +  G        + L+S++ ++  F
Sbjct: 325 LAIMTVDKFGRKPLQIIGALGMAIGMFSLGTAFYTQAPGI-------VALLSMLFYVAAF 377

Query: 63  SIPWG 67
           ++ WG
Sbjct: 378 AMSWG 382


>gnl|CDD|235909 PRK07021, fliL, flagellar basal body-associated protein FliL;
          Reviewed.
          Length = 162

 Score = 26.8 bits (60), Expect = 0.81
 Identities = 7/49 (14%), Positives = 22/49 (44%)

Query: 13 RRLLLLLSVIIMGLCLGVLGFYFFLKNSGSDVSSIAFLPLISVIMFIVM 61
          R+L L++ ++++       G+ ++L    +  ++ A        +F  +
Sbjct: 16 RKLWLIILILLLLAAAAGAGYSWWLSKEQATAAAKAAPAPPPAPVFFPL 64


>gnl|CDD|234084 TIGR03007, pepcterm_ChnLen, polysaccharide chain length determinant
           protein, PEP-CTERM locus subfamily.  Members of this
           protein family belong to the family of polysaccharide
           chain length determinant proteins (pfam02706). All are
           found in species that encode the PEP-CTERM/exosortase
           system predicted to act in protein sorting in a number
           of Gram-negative bacteria, and are found near the epsH
           homolog that is the putative exosortase gene [Cell
           envelope, Biosynthesis and degradation of surface
           polysaccharides and lipopolysaccharides].
          Length = 498

 Score = 26.9 bits (60), Expect = 0.85
 Identities = 17/52 (32%), Positives = 26/52 (50%), Gaps = 7/52 (13%)

Query: 14  RLLLLLSVIIMGLCLGVLGFYFFLK------NSGSDVSSIAFLPLISVIMFI 59
           R LL+L+ ++ GL  G +G  F L        S  D+  +  LP++ VI  I
Sbjct: 413 RPLLMLAGLLGGLGAG-IGLAFLLSQLRPTVRSVRDLRELTGLPVLGVIPMI 463


>gnl|CDD|182695 PRK10747, PRK10747, putative protoheme IX biogenesis protein;
          Provisional.
          Length = 398

 Score = 26.6 bits (59), Expect = 0.86
 Identities = 15/58 (25%), Positives = 34/58 (58%), Gaps = 5/58 (8%)

Query: 14 RLLLLLSVIIMGLCLG--VLGF--YFFLKNSGSDV-SSIAFLPLISVIMFIVMFSIPW 66
          ++LLL  ++I G+ +G  + G   Y  ++    ++ +S+  L +I ++  +V+F+I W
Sbjct: 3  KVLLLFVLLIAGIVVGPMIAGHQGYVLIQTDNYNIETSVTGLAIILILAMVVLFAIEW 60


>gnl|CDD|225613 COG3071, HemY, Uncharacterized enzyme of heme biosynthesis
          [Coenzyme metabolism].
          Length = 400

 Score = 25.8 bits (57), Expect = 1.6
 Identities = 14/64 (21%), Positives = 29/64 (45%), Gaps = 13/64 (20%)

Query: 13 RRLLLLLSVIIMGLCLG---------VLGFYFFLKNSGSDVSSIAFLPLISVIMFIVMFS 63
          RR+LLL  +++ G+ +G         VL       N   ++S    +  + +I  +V++ 
Sbjct: 2  RRVLLLFVLLLAGIGVGLAIAGQQGYVLIQT---DNYNIEMSLTTLVIFL-IIALVVLYL 57

Query: 64 IPWG 67
          + W 
Sbjct: 58 LEWL 61


>gnl|CDD|217378 pfam03124, EXS, EXS family.  We have named this region the EXS
           family after (ERD1, XPR1, and SYG1). This family
           includes C-terminus portions from the SYG1 G-protein
           associated signal transduction protein from
           Saccharomyces cerevisiae, and sequences that are thought
           to be murine leukaemia virus (MLV) receptors (XPR1).
           N-terminus portions from these proteins are aligned in
           the SPX pfam03105 family. The previously noted
           similarity between SYG1 and MLV receptors over their
           whole sequences is thus borne out in pfam03105 and this
           family. While the N-termini aligned in pfam03105 are
           thought to be involved in signal transduction, the role
           of the C-terminus sequences aligned in this family is
           not known. This region of similarity contains several
           predicted transmembrane helices. This family also
           includes the ERD1 (ERD: ER retention defective) yeast
           proteins. ERD1 proteins are involved in the localisation
           of endogenous endoplasmic reticulum (ER) proteins. erd1
           null mutants secrete such proteins even though they
           possess the C-terminal HDEL ER lumen localisation label
           sequence. In addition, null mutants also exhibit defects
           in the Golgi-dependent processing of several
           glycoproteins, which led to the suggestion that the
           sorting of luminal ER proteins actually occurs in the
           Golgi, with subsequent return of these proteins to the
           ER via `salvage' vesicles.
          Length = 337

 Score = 25.7 bits (57), Expect = 1.9
 Identities = 11/54 (20%), Positives = 25/54 (46%)

Query: 13  RRLLLLLSVIIMGLCLGVLGFYFFLKNSGSDVSSIAFLPLISVIMFIVMFSIPW 66
           R LL L S   +   L  L +        +  +S   +PL+ +++ +++  +P+
Sbjct: 48  RELLELASFFTVLWLLSFLLYLLLEHLELTKPASWVQVPLVLLLIVLLILFLPF 101


>gnl|CDD|236106 PRK07818, PRK07818, dihydrolipoamide dehydrogenase; Reviewed.
          Length = 466

 Score = 25.8 bits (57), Expect = 2.0
 Identities = 14/34 (41%), Positives = 20/34 (58%), Gaps = 4/34 (11%)

Query: 20  SVIIMGLCLGVLG--FYFFLKNSGSDVSSIAFLP 51
           S++I G   G +G  F + LKN G DV+ + FL 
Sbjct: 174 SIVIAGA--GAIGMEFAYVLKNYGVDVTIVEFLD 205


>gnl|CDD|236726 PRK10618, PRK10618, phosphotransfer intermediate protein in
           two-component regulatory system with RcsBC; Provisional.
          Length = 894

 Score = 25.7 bits (57), Expect = 2.3
 Identities = 12/39 (30%), Positives = 18/39 (46%), Gaps = 1/39 (2%)

Query: 3   LSVVLVDKAGRRLLLLLSVIIMGLCLGVLGFYFFLKNSG 41
           L  +L+D     L  LL  + + L L + G+Y F    G
Sbjct: 291 LGTLLLDLLQNNLWPLLLNLGL-LALSLFGYYTFRHQYG 328


>gnl|CDD|222572 pfam14155, DUF4307, Domain of unknown function (DUF4307).  This
          family of proteins is functionally uncharacterized.
          This family of proteins is found in bacteria. Proteins
          in this family are typically between 132 and 153 amino
          acids in length. There is a single completely conserved
          residue C that may be functionally important.
          Length = 112

 Score = 24.9 bits (55), Expect = 3.0
 Identities = 11/39 (28%), Positives = 22/39 (56%), Gaps = 2/39 (5%)

Query: 13 RRLLLLLSVIIMGLCLGVLGFYFFLKNSGSDVSS--IAF 49
          RR LL +  +++ + L ++ +  + +  G DVS+  I F
Sbjct: 3  RRKLLAVGAVLVVVALVLVAWIGYQQVGGPDVSAEVIGF 41


>gnl|CDD|129965 TIGR00887, 2A0109, phosphate:H+ symporter.  This model represents
           the phosphate uptake symporter subfamily of the major
           facilitator superfamily (pfam00083) [Transport and
           binding proteins, Anions].
          Length = 502

 Score = 25.1 bits (55), Expect = 3.6
 Identities = 12/32 (37%), Positives = 19/32 (59%)

Query: 4   SVVLVDKAGRRLLLLLSVIIMGLCLGVLGFYF 35
           +V LVD  GR+ + L+   I+ +   VLGF +
Sbjct: 355 TVFLVDIIGRKPIQLMGFFILTVLFFVLGFAY 386


>gnl|CDD|240232 PTZ00021, PTZ00021, falcipain-2; Provisional.
          Length = 489

 Score = 25.1 bits (55), Expect = 3.8
 Identities = 11/27 (40%), Positives = 15/27 (55%), Gaps = 1/27 (3%)

Query: 9  DKAGRRLLLLLSVIIMGLCLGVLGFYF 35
           K  + LL++LSV  + L  G   FYF
Sbjct: 30 LKGKKSLLVVLSVSAICLVAGSA-FYF 55


>gnl|CDD|235476 PRK05454, PRK05454, glucosyltransferase MdoH; Provisional.
          Length = 605

 Score = 24.8 bits (55), Expect = 4.3
 Identities = 10/51 (19%), Positives = 18/51 (35%), Gaps = 1/51 (1%)

Query: 13 RRLLLLLSVIIMGLCLGVLGFYFFLKNSGSDVSSIAFLPLISVIMFIVMFS 63
          RRL+LL   +     +        L   G  +   A L L +++   +   
Sbjct: 41 RRLILLGLTLAQTA-VATWEMKAVLPYGGWTLLEPALLVLFALLFAWISLG 90


>gnl|CDD|224475 COG1559, COG1559, Aminodeoxychorismate lyase [Coenzyme transport
          and metabolism].
          Length = 342

 Score = 24.5 bits (54), Expect = 5.2
 Identities = 9/28 (32%), Positives = 15/28 (53%)

Query: 10 KAGRRLLLLLSVIIMGLCLGVLGFYFFL 37
          K      LL+ V+++ L LG +G+   L
Sbjct: 5  KKKLLKTLLVLVLVLLLILGGVGYIGIL 32


>gnl|CDD|224496 COG1580, FliL, Flagellar basal body-associated protein [Cell
          motility and secretion].
          Length = 159

 Score = 24.6 bits (54), Expect = 5.2
 Identities = 8/35 (22%), Positives = 15/35 (42%)

Query: 8  VDKAGRRLLLLLSVIIMGLCLGVLGFYFFLKNSGS 42
            K     +LL+ +I++    G   F++F   S  
Sbjct: 12 KKKKKSLWILLIVLIVLLALAGAGYFFWFGSKSED 46


>gnl|CDD|166864 PRK00234, PRK00234, Maf-like protein; Reviewed.
          Length = 192

 Score = 24.3 bits (53), Expect = 5.5
 Identities = 10/26 (38%), Positives = 14/26 (53%)

Query: 31  LGFYFFLKNSGSDVSSIAFLPLISVI 56
           LG   F    G D +S+  LPLI ++
Sbjct: 156 LGVSLFRSTEGEDATSLIGLPLIRLV 181


>gnl|CDD|233176 TIGR00898, 2A0119, cation transport protein.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 505

 Score = 24.6 bits (54), Expect = 5.8
 Identities = 10/31 (32%), Positives = 22/31 (70%)

Query: 3   LSVVLVDKAGRRLLLLLSVIIMGLCLGVLGF 33
           ++++L+D+ GRR  +  S+++ G+ L +L F
Sbjct: 374 ITLLLIDRLGRRYTMAASLLLAGVALLLLLF 404


>gnl|CDD|222083 pfam13373, DUF2407_C, DUF2407 C-terminal domain.  This is a family
           of proteins found in fungi. The function is not known.
           There is a characteristic GFDRL sequence motif.
          Length = 130

 Score = 24.1 bits (53), Expect = 6.0
 Identities = 9/24 (37%), Positives = 13/24 (54%)

Query: 18  LLSVIIMGLCLGVLGFYFFLKNSG 41
           LL  ++MG  LGV    + L+  G
Sbjct: 82  LLIGMLMGFFLGVGSLVWLLREEG 105


>gnl|CDD|183743 PRK12785, fliL, flagellar basal body-associated protein FliL;
          Reviewed.
          Length = 166

 Score = 24.2 bits (53), Expect = 6.1
 Identities = 7/34 (20%), Positives = 16/34 (47%)

Query: 10 KAGRRLLLLLSVIIMGLCLGVLGFYFFLKNSGSD 43
             ++ L++++   + L  G  G +FF  + G  
Sbjct: 20 PKKKKKLIIIAAAAVLLLGGGGGGFFFFFSHGHG 53


>gnl|CDD|218809 pfam05915, DUF872, Eukaryotic protein of unknown function
          (DUF872).  This family consists of several
          uncharacterized eukaryotic proteins. The function of
          this family is unknown.
          Length = 108

 Score = 23.9 bits (52), Expect = 6.6
 Identities = 10/47 (21%), Positives = 23/47 (48%)

Query: 13 RRLLLLLSVIIMGLCLGVLGFYFFLKNSGSDVSSIAFLPLISVIMFI 59
          + + L   ++++G  L V+G + F      D S      ++ +++FI
Sbjct: 35 KVIALAFLLLLLGALLLVIGVFIFTGPMEGDGSQAYVFFVLGILLFI 81


>gnl|CDD|233168 TIGR00883, 2A0106, metabolite-proton symporter.  This model
          represents the metabolite:H+ symport subfamily of the
          major facilitator superfamily (pfam00083), including
          citrate-H+ symporters, dicarboxylate:H+ symporters, the
          proline/glycine-betaine transporter ProP, etc
          [Transport and binding proteins, Unknown substrate].
          Length = 394

 Score = 24.2 bits (53), Expect = 8.0
 Identities = 7/24 (29%), Positives = 18/24 (75%)

Query: 9  DKAGRRLLLLLSVIIMGLCLGVLG 32
          D+ GR+  L++++++MG+   ++G
Sbjct: 60 DRIGRKKTLVITLLMMGIGTLLIG 83


>gnl|CDD|217379 pfam03125, Sre, C. elegans Sre G protein-coupled chemoreceptor.
           Caenorhabditis elegans Sre proteins are candidate
           chemosensory receptors. There are four main recognised
           groups of such receptors: Odr-10, Sra, Sro, and Srg. Sre
           (this family), Sra pfam02117 and Srb pfam02175 comprise
           the Sra group. All of the above receptors are thought to
           be G protein-coupled seven transmembrane domain
           proteins. The existence of several different
           chemosensory receptors underlies the fact that in spite
           of having only 20-30 chemosensory neurones, C. elegans
           detects hundreds of different chemicals, with the
           ability to discern individual chemicals among
           combinations.
          Length = 363

 Score = 23.8 bits (52), Expect = 9.5
 Identities = 16/60 (26%), Positives = 27/60 (45%), Gaps = 4/60 (6%)

Query: 10  KAGRRLLLLLSVIIMGLCLGVLGFYFFLKNSGSDVSSIAFLPLISV----IMFIVMFSIP 65
           K G+RL+ ++  II+    G+L   F L        +  F   + +    I   +MFS+P
Sbjct: 248 KLGKRLVFVVLFIIVLCGSGILALSFDLIPLLRTFLNHIFENSLFLYPLLICPTIMFSVP 307


>gnl|CDD|147008 pfam04639, Baculo_E56, Baculoviral E56 protein, specific to ODV
           envelope.  This family represents the E56 protein, which
           is localises to the occlusion derived virus (ODV)
           envelope, but not to the budded virus (BV) envelope.
          Length = 308

 Score = 23.8 bits (52), Expect = 9.6
 Identities = 9/23 (39%), Positives = 13/23 (56%), Gaps = 1/23 (4%)

Query: 38  KNSGSDVSSIAFLPLISVIMFIV 60
            NS   VS  + LP+I VI  ++
Sbjct: 269 SNSSKSVSD-SLLPIILVIGGVL 290


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.334    0.151    0.450 

Gapped
Lambda     K      H
   0.267   0.0829    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 3,670,425
Number of extensions: 303948
Number of successful extensions: 1756
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1708
Number of HSP's successfully gapped: 308
Length of query: 71
Length of database: 10,937,602
Length adjustment: 41
Effective length of query: 30
Effective length of database: 9,119,088
Effective search space: 273572640
Effective search space used: 273572640
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 39 (21.5 bits)
S2: 53 (24.0 bits)