RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy3380
         (290 letters)



>gnl|CDD|133004 cd02510, pp-GalNAc-T, pp-GalNAc-T initiates the formation of
           mucin-type O-linked glycans.  UDP-GalNAc: polypeptide
           alpha-N-acetylgalactosaminyltransferases (pp-GalNAc-T)
           initiate the formation of mucin-type, O-linked glycans
           by catalyzing the transfer of
           alpha-N-acetylgalactosamine (GalNAc) from UDP-GalNAc to
           hydroxyl groups of Ser or Thr residues of core proteins
           to form the Tn antigen (GalNAc-a-1-O-Ser/Thr). These
           enzymes are type II membrane proteins with a GT-A type
           catalytic domain and a lectin domain located on the
           lumen side of the Golgi apparatus. In human, there are
           15 isozymes of pp-GalNAc-Ts, representing the largest of
           all glycosyltransferase families. Each isozyme has
           unique but partially redundant substrate specificity for
           glycosylation sites on acceptor proteins.
          Length = 299

 Score =  175 bits (445), Expect = 3e-53
 Identities = 91/234 (38%), Positives = 114/234 (48%), Gaps = 57/234 (24%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL+PLL  +A N   VV P+I  I  DTFE R        S     GGFDW+L 
Sbjct: 94  CEVNVGWLEPLLARIAENRKTVVCPIIDVIDADTFEYR-------GSSGDARGGFDWSLH 146

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +PE ER R ++   P+ +PTMAGGLF+ID+ +F +LG YD G DIWGGENLELSF
Sbjct: 147 FKWLPLPEEER-RRESPTAPIRSPTMAGGLFAIDREWFLELGGYDEGMDIWGGENLELSF 205

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG------------LFSIDKAFFEKLGTYDS- 170
           K                    VW     GG            +F   +  +   G   + 
Sbjct: 206 K--------------------VW---QCGGSIEIVPCSRVGHIFRRKRKPYTFPGGSGTV 242

Query: 171 ------GFDIWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE 211
                   ++W  E  E  +K        D+GD++ RK LR  L CKSFKWYLE
Sbjct: 243 LRNYKRVAEVWMDEYKEYFYKARPELRNIDYGDLSERKALRERLKCKSFKWYLE 296


>gnl|CDD|216044 pfam00652, Ricin_B_lectin, Ricin-type beta-trefoil lectin domain. 
          Length = 124

 Score = 74.1 bits (182), Expect = 1e-16
 Identities = 36/79 (45%), Positives = 45/79 (56%), Gaps = 9/79 (11%)

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRR---DEACLDYAGGD--- 269
            SG C+D      D   PVGLYPCH  GGNQ W ++  G IR       CLD +GG    
Sbjct: 10  RSGKCLDVPGGSAD-GGPVGLYPCHG-GGNQLWTLTGDGTIRSNGNSNLCLDVSGGGNGS 67

Query: 270 -VILYPCHGSKGNQYFEYD 287
            V+L+PC+G  GNQ ++YD
Sbjct: 68  KVVLWPCNGGSGNQRWDYD 86



 Score = 49.5 bits (118), Expect = 1e-07
 Identities = 27/76 (35%), Positives = 35/76 (46%), Gaps = 9/76 (11%)

Query: 213 SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEI--RRDEACLD----YA 266
           SN  S +C+D +         V L+PC+   GNQ W     G I  R+   CLD      
Sbjct: 50  SNGNSNLCLDVS--GGGNGSKVVLWPCNGGSGNQRWDYDGDGTIRNRKSGKCLDVKGASN 107

Query: 267 GGDVILYPCHGSKGNQ 282
           G  VIL+ C G+  NQ
Sbjct: 108 GTKVILWTCDGN-PNQ 122


>gnl|CDD|238092 cd00161, RICIN, Ricin-type beta-trefoil; Carbohydrate-binding
           domain formed from presumed gene triplication. The
           domain is found in a variety of molecules serving
           diverse functions such as enzymatic activity, inhibitory
           toxicity and signal transduction. Highly specific ligand
           binding occurs on exposed surfaces of the compact domain
           sturcture.
          Length = 124

 Score = 69.1 bits (169), Expect = 8e-15
 Identities = 28/80 (35%), Positives = 36/80 (45%), Gaps = 8/80 (10%)

Query: 217 SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIR--RDEACLD----YAGGDV 270
           +G+C+D          PV LYPCH  G NQ W ++  G IR      CLD      G  V
Sbjct: 9   TGLCLDV--NGGSDGGPVQLYPCHGNGNNQKWTLTSDGTIRIKSSNLCLDVGGDAPGSKV 66

Query: 271 ILYPCHGSKGNQYFEYDYKY 290
            LY C G   NQ + ++   
Sbjct: 67  RLYTCSGGSDNQRWTFNKDG 86



 Score = 45.6 bits (108), Expect = 2e-06
 Identities = 31/101 (30%), Positives = 41/101 (40%), Gaps = 25/101 (24%)

Query: 192 TSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMS 251
           TS   +R     KS    L+V  D  G               V LY C     NQ W  +
Sbjct: 41  TSDGTIR----IKSSNLCLDVGGDAPGS-------------KVRLYTCSGGSDNQRWTFN 83

Query: 252 KHGEIRRDEA--CLDYAGGD-----VILYPCHGSKGNQYFE 285
           K G IR  ++  CLD  GG+     +IL+ C G   NQ ++
Sbjct: 84  KDGTIRNLKSGKCLDVKGGNTNGTNLILWTCDGGP-NQKWK 123


>gnl|CDD|214672 smart00458, RICIN, Ricin-type beta-trefoil.  Carbohydrate-binding
           domain formed from presumed gene triplication.
          Length = 118

 Score = 64.5 bits (157), Expect = 4e-13
 Identities = 29/83 (34%), Positives = 39/83 (46%), Gaps = 9/83 (10%)

Query: 213 SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA--CLDY---AG 267
            +  +G C+D          PVGL+ CH  GGNQ W ++  G IR  +   CL      G
Sbjct: 2   ISGNTGKCLD----VNGNKNPVGLFDCHGTGGNQLWKLTSDGAIRIKDTDLCLTANGNTG 57

Query: 268 GDVILYPCHGSKGNQYFEYDYKY 290
             V LY C G+  NQY+E +   
Sbjct: 58  STVTLYSCDGTNDNQYWEVNKDG 80



 Score = 42.5 bits (100), Expect = 2e-05
 Identities = 23/86 (26%), Positives = 38/86 (44%), Gaps = 13/86 (15%)

Query: 207 KWYL----EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA- 261
            W L     +    + +C+ +     +    V LY C     NQ+W ++K G IR  ++ 
Sbjct: 32  LWKLTSDGAIRIKDTDLCLTA---NGNTGSTVTLYSCDGTNDNQYWEVNKDGTIRNPDSG 88

Query: 262 -CLDYAGGD----VILYPCHGSKGNQ 282
            CLD   G+    VIL+ C G+   +
Sbjct: 89  KCLDVKDGNTGTKVILWTCSGNPNQK 114


>gnl|CDD|217196 pfam02709, Glyco_transf_7C, N-terminal domain of
           galactosyltransferase.  This is the N-terminal domain of
           a family of galactosyltransferases from a wide range of
           Metazoa with three related galactosyltransferases
           activities, all three of which are possessed by one
           sequence in some cases. EC:2.4.1.90, N-acetyllactosamine
           synthase; EC:2.4.1.38,
           Beta-N-acetylglucosaminyl-glycopeptide beta-1,4-
           galactosyltransferase; and EC:2.4.1.22 Lactose synthase.
           Note that N-acetyllactosamine synthase is a component of
           Lactose synthase along with alpha-lactalbumin, in the
           absence of alpha-lactalbumin EC:2.4.1.90 is the
           catalyzed reaction.
          Length = 78

 Score = 31.0 bits (71), Expect = 0.11
 Identities = 11/35 (31%), Positives = 18/35 (51%)

Query: 87  PTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
               GG+ +  K  F K+  + + F  WGGE+ +L
Sbjct: 17  KGYFGGVLAFSKEDFLKVNGFSNNFWGWGGEDDDL 51



 Score = 31.0 bits (71), Expect = 0.11
 Identities = 11/35 (31%), Positives = 18/35 (51%)

Query: 148 PTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 182
               GG+ +  K  F K+  + + F  WGGE+ +L
Sbjct: 17  KGYFGGVLAFSKEDFLKVNGFSNNFWGWGGEDDDL 51


>gnl|CDD|220577 pfam10111, Glyco_tranf_2_2, Glycosyltransferase like family 2.
           Members of this family of prokaryotic proteins include
           putative glucosyltransferase, which are involved in
           bacterial capsule biosynthesis.
          Length = 278

 Score = 32.8 bits (75), Expect = 0.17
 Identities = 13/36 (36%), Positives = 21/36 (58%)

Query: 89  MAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
           +A     I++ FF K+G +D  F   GGE+ EL ++
Sbjct: 165 LASSCILINRDFFLKIGGFDENFRGHGGEDFELLYR 200



 Score = 32.8 bits (75), Expect = 0.17
 Identities = 13/36 (36%), Positives = 21/36 (58%)

Query: 150 MAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 185
           +A     I++ FF K+G +D  F   GGE+ EL ++
Sbjct: 165 LASSCILINRDFFLKIGGFDENFRGHGGEDFELLYR 200


>gnl|CDD|223548 COG0472, Rfe, UDP-N-acetylmuramyl pentapeptide
           phosphotransferase/UDP-N- acetylglucosamine-1-phosphate
           transferase [Cell envelope biogenesis, outer membrane].
          Length = 319

 Score = 29.9 bits (68), Expect = 1.3
 Identities = 14/35 (40%), Positives = 14/35 (40%), Gaps = 6/35 (17%)

Query: 121 LSFKFNWHAIPERERKRHKNAAEPVWTPTMAGGLF 155
              K     IPE   K HK       TPTM GGL 
Sbjct: 26  FLRKLGLGDIPEDGPKSHKKG-----TPTM-GGLA 54



 Score = 28.0 bits (63), Expect = 6.5
 Identities = 13/32 (40%), Positives = 14/32 (43%), Gaps = 6/32 (18%)

Query: 63 QFNWHAIPERERKRHKNAAEPVWTPTMAGGLF 94
          +     IPE   K HK       TPTM GGL 
Sbjct: 29 KLGLGDIPEDGPKSHKKG-----TPTM-GGLA 54


>gnl|CDD|187551 cd05240, UDP_G4E_3_SDR_e, UDP-glucose 4 epimerase (G4E), subgroup
           3, extended (e) SDRs.  Members of this bacterial
           subgroup are identified as possible sugar epimerases,
           such as UDP-glucose 4 epimerase. However, while the
           NAD(P)-binding motif is fairly well conserved, not all
           members retain the canonical active site tetrad of the
           extended SDRs. UDP-glucose 4 epimerase (aka
           UDP-galactose-4-epimerase), is a homodimeric extended
           SDR. It catalyzes the NAD-dependent conversion of
           UDP-galactose to UDP-glucose, the final step in Leloir
           galactose synthesis. Extended SDRs are distinct from
           classical SDRs. In addition to the Rossmann fold
           (alpha/beta folding pattern with a central beta-sheet)
           core region typical of all SDRs, extended SDRs have a
           less conserved C-terminal extension of approximately 100
           amino acids. Extended SDRs are a diverse collection of
           proteins, and include isomerases, epimerases,
           oxidoreductases, and lyases; they typically have a
           TGXXGXXG cofactor binding motif. SDRs are a functionally
           diverse family of oxidoreductases that have a single
           domain with a structurally conserved Rossmann fold, an
           NAD(P)(H)-binding region, and a structurally diverse
           C-terminal region. Sequence identity between different
           SDR enzymes is typically in the 15-30% range; they
           catalyze a wide range of activities including the
           metabolism of steroids, cofactors, carbohydrates,
           lipids, aromatic compounds, and amino acids, and act in
           redox sensing. Classical SDRs have an TGXXX[AG]XG
           cofactor binding motif and a YXXXK active site motif,
           with the Tyr residue of the active site motif serving as
           a critical catalytic residue (Tyr-151, human
           15-hydroxyprostaglandin dehydrogenase numbering). In
           addition to the Tyr and Lys, there is often an upstream
           Ser and/or an Asn, contributing to the active site;
           while substrate binding is in the C-terminal region,
           which determines specificity. The standard reaction
           mechanism is a 4-pro-S hydride transfer and proton relay
           involving the conserved Tyr and Lys, a water molecule
           stabilized by Asn, and nicotinamide. Atypical SDRs
           generally lack the catalytic residues characteristic of
           the SDRs, and their glycine-rich NAD(P)-binding motif is
           often different from the forms normally seen in
           classical or extended SDRs. Complex (multidomain) SDRs
           such as ketoreductase domains of fatty acid synthase
           have a GGXGXXG NAD(P)-binding motif and an altered
           active site motif (YXXXN). Fungal type ketoacyl
           reductases have a TGXXXGX(1-2)G NAD(P)-binding motif.
          Length = 306

 Score = 29.3 bits (66), Expect = 2.2
 Identities = 15/59 (25%), Positives = 19/59 (32%), Gaps = 9/59 (15%)

Query: 9   RWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWH 67
           R   P L+V     + ++ P   N   D    R  P           GGFD   QF   
Sbjct: 153 RRRHPELNVTVLRPATILGPGTRNTTRDFLSPRRLPVP---------GGFDPPFQFLHE 202


>gnl|CDD|176494 cd08764, Cyt_b561_CG1275_like, Non-vertebrate eumetazoan
          cytochrome b(561).  Cytochrome b(561), as found in
          non-vertebrate eumetazoans, similar to the Drosophila
          melanogaster CG1275 gene product. This protein might
          act as a ferric-chelate reductase, catalyzing the
          reduction of Fe(3+) to Fe(2+), such as associated with
          the transport of iron from the endosome to the
          cytoplasm. It is assumed that this protein uses
          ascorbate as the electron donor. Belongs to the
          cytochrome b(561) family, which are secretory
          vesicle-specific electron transport proteins.
          Cytochromes b(561) are integral membrane proteins that
          bind two heme groups non-covalently, and may have six
          alpha-helical trans-membrane segments.
          Length = 214

 Score = 28.8 bits (65), Expect = 2.4
 Identities = 11/17 (64%), Positives = 11/17 (64%), Gaps = 3/17 (17%)

Query: 54 FIGGFDW---NLQFNWH 67
          F GGF W    LQFNWH
Sbjct: 11 FRGGFSWTGPGLQFNWH 27


>gnl|CDD|226577 COG4092, COG4092, Predicted glycosyltransferase involved in capsule
           biosynthesis [Cell envelope biogenesis, outer membrane].
          Length = 346

 Score = 28.0 bits (62), Expect = 6.6
 Identities = 24/120 (20%), Positives = 37/120 (30%), Gaps = 23/120 (19%)

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF-DIWGGENLELS 122
           F         +K   N   P+  P     ++ ++KA  +     +  F D    E+    
Sbjct: 112 FAKMLSIATIKKMRTNIDAPLVLP-----VYHLNKADTQVFFDVEDMFLDAMIFESPLAE 166

Query: 123 FKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 182
           F           RK       P         +F I++  F   G YD  F   G E+ E 
Sbjct: 167 F-----------RKEDNFFIAPY------TNIFLINRRMFSLTGGYDERFRGHGSEDFEF 209


>gnl|CDD|215006 smart01065, CBM_2, Starch binding domain. 
          Length = 88

 Score = 26.2 bits (58), Expect = 7.5
 Identities = 9/36 (25%), Positives = 13/36 (36%), Gaps = 2/36 (5%)

Query: 37 TFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAIPER 72
             +  PP   T  YK+     D +    W + P R
Sbjct: 49 KGTVSLPPAGTTIEYKYVKV--DEDGSVTWESGPNR 82


>gnl|CDD|133016 cd02525, Succinoglycan_BP_ExoA, ExoA is involved in the
           biosynthesis of succinoglycan.  Succinoglycan
           Biosynthesis Protein ExoA catalyzes the formation of a
           beta-1,3 linkage of the second sugar (glucose) of the
           succinoglycan with the galactose on the lipid carrie.
           Succinoglycan is an acidic exopolysaccharide that is
           important for invasion of the nodules. Succinoglycan is
           a high-molecular-weight polymer composed of repeating
           octasaccharide units. These units are synthesized on
           membrane-bound isoprenoid lipid carriers, beginning with
           galactose followed by seven glucose molecules, and
           modified by the addition of acetate, succinate, and
           pyruvate. ExoA is a membrane protein with a
           transmembrance domain at c-terminus.
          Length = 249

 Score = 27.6 bits (62), Expect = 8.1
 Identities = 11/48 (22%), Positives = 19/48 (39%), Gaps = 4/48 (8%)

Query: 74  RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
                 A +  +  T+  G +   +  FEK+G +D        E+ EL
Sbjct: 143 SAYRGGAVKIGYVDTVHHGAYR--REVFEKVGGFDESLVR--NEDAEL 186



 Score = 27.6 bits (62), Expect = 8.1
 Identities = 11/48 (22%), Positives = 19/48 (39%), Gaps = 4/48 (8%)

Query: 135 RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 182
                 A +  +  T+  G +   +  FEK+G +D        E+ EL
Sbjct: 143 SAYRGGAVKIGYVDTVHHGAYR--REVFEKVGGFDESLVR--NEDAEL 186


>gnl|CDD|220523 pfam10020, DUF2262, Uncharacterized protein conserved in bacteria
           (DUF2262).  This domain, found in various hypothetical
           bacterial proteins, has no known function.
          Length = 144

 Score = 26.9 bits (60), Expect = 8.3
 Identities = 13/38 (34%), Positives = 20/38 (52%), Gaps = 4/38 (10%)

Query: 153 GLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKGDFGD 190
           G F++DK +    G+++   D W GE + LS   D  D
Sbjct: 6   GDFTLDKDY----GSFEGKIDWWDGEEISLSLDEDDND 39


>gnl|CDD|240337 PTZ00261, PTZ00261, acyltransferase; Provisional.
          Length = 355

 Score = 27.5 bits (61), Expect = 9.9
 Identities = 11/46 (23%), Positives = 18/46 (39%), Gaps = 3/46 (6%)

Query: 227 PTDMHKPVGLYPC---HKQGGNQFWMMSKHGEIRRDEACLDYAGGD 269
           P DMH  +G YP         +    + +  +  RDE   + A  +
Sbjct: 273 PADMHIRIGAYPIDYDRDSSKDVAVGLQQRMQKVRDEIAAEVAAAE 318


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.322    0.140    0.474 

Gapped
Lambda     K      H
   0.267   0.0710    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 15,133,705
Number of extensions: 1428467
Number of successful extensions: 1003
Number of sequences better than 10.0: 1
Number of HSP's gapped: 991
Number of HSP's successfully gapped: 31
Length of query: 290
Length of database: 10,937,602
Length adjustment: 96
Effective length of query: 194
Effective length of database: 6,679,618
Effective search space: 1295845892
Effective search space used: 1295845892
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 58 (26.2 bits)