RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy3380
(290 letters)
>gnl|CDD|133004 cd02510, pp-GalNAc-T, pp-GalNAc-T initiates the formation of
mucin-type O-linked glycans. UDP-GalNAc: polypeptide
alpha-N-acetylgalactosaminyltransferases (pp-GalNAc-T)
initiate the formation of mucin-type, O-linked glycans
by catalyzing the transfer of
alpha-N-acetylgalactosamine (GalNAc) from UDP-GalNAc to
hydroxyl groups of Ser or Thr residues of core proteins
to form the Tn antigen (GalNAc-a-1-O-Ser/Thr). These
enzymes are type II membrane proteins with a GT-A type
catalytic domain and a lectin domain located on the
lumen side of the Golgi apparatus. In human, there are
15 isozymes of pp-GalNAc-Ts, representing the largest of
all glycosyltransferase families. Each isozyme has
unique but partially redundant substrate specificity for
glycosylation sites on acceptor proteins.
Length = 299
Score = 175 bits (445), Expect = 3e-53
Identities = 91/234 (38%), Positives = 114/234 (48%), Gaps = 57/234 (24%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL+PLL +A N VV P+I I DTFE R S GGFDW+L
Sbjct: 94 CEVNVGWLEPLLARIAENRKTVVCPIIDVIDADTFEYR-------GSSGDARGGFDWSLH 146
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +PE ER R ++ P+ +PTMAGGLF+ID+ +F +LG YD G DIWGGENLELSF
Sbjct: 147 FKWLPLPEEER-RRESPTAPIRSPTMAGGLFAIDREWFLELGGYDEGMDIWGGENLELSF 205
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG------------LFSIDKAFFEKLGTYDS- 170
K VW GG +F + + G +
Sbjct: 206 K--------------------VW---QCGGSIEIVPCSRVGHIFRRKRKPYTFPGGSGTV 242
Query: 171 ------GFDIWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE 211
++W E E +K D+GD++ RK LR L CKSFKWYLE
Sbjct: 243 LRNYKRVAEVWMDEYKEYFYKARPELRNIDYGDLSERKALRERLKCKSFKWYLE 296
>gnl|CDD|216044 pfam00652, Ricin_B_lectin, Ricin-type beta-trefoil lectin domain.
Length = 124
Score = 74.1 bits (182), Expect = 1e-16
Identities = 36/79 (45%), Positives = 45/79 (56%), Gaps = 9/79 (11%)
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRR---DEACLDYAGGD--- 269
SG C+D D PVGLYPCH GGNQ W ++ G IR CLD +GG
Sbjct: 10 RSGKCLDVPGGSAD-GGPVGLYPCHG-GGNQLWTLTGDGTIRSNGNSNLCLDVSGGGNGS 67
Query: 270 -VILYPCHGSKGNQYFEYD 287
V+L+PC+G GNQ ++YD
Sbjct: 68 KVVLWPCNGGSGNQRWDYD 86
Score = 49.5 bits (118), Expect = 1e-07
Identities = 27/76 (35%), Positives = 35/76 (46%), Gaps = 9/76 (11%)
Query: 213 SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEI--RRDEACLD----YA 266
SN S +C+D + V L+PC+ GNQ W G I R+ CLD
Sbjct: 50 SNGNSNLCLDVS--GGGNGSKVVLWPCNGGSGNQRWDYDGDGTIRNRKSGKCLDVKGASN 107
Query: 267 GGDVILYPCHGSKGNQ 282
G VIL+ C G+ NQ
Sbjct: 108 GTKVILWTCDGN-PNQ 122
>gnl|CDD|238092 cd00161, RICIN, Ricin-type beta-trefoil; Carbohydrate-binding
domain formed from presumed gene triplication. The
domain is found in a variety of molecules serving
diverse functions such as enzymatic activity, inhibitory
toxicity and signal transduction. Highly specific ligand
binding occurs on exposed surfaces of the compact domain
sturcture.
Length = 124
Score = 69.1 bits (169), Expect = 8e-15
Identities = 28/80 (35%), Positives = 36/80 (45%), Gaps = 8/80 (10%)
Query: 217 SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIR--RDEACLD----YAGGDV 270
+G+C+D PV LYPCH G NQ W ++ G IR CLD G V
Sbjct: 9 TGLCLDV--NGGSDGGPVQLYPCHGNGNNQKWTLTSDGTIRIKSSNLCLDVGGDAPGSKV 66
Query: 271 ILYPCHGSKGNQYFEYDYKY 290
LY C G NQ + ++
Sbjct: 67 RLYTCSGGSDNQRWTFNKDG 86
Score = 45.6 bits (108), Expect = 2e-06
Identities = 31/101 (30%), Positives = 41/101 (40%), Gaps = 25/101 (24%)
Query: 192 TSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMS 251
TS +R KS L+V D G V LY C NQ W +
Sbjct: 41 TSDGTIR----IKSSNLCLDVGGDAPGS-------------KVRLYTCSGGSDNQRWTFN 83
Query: 252 KHGEIRRDEA--CLDYAGGD-----VILYPCHGSKGNQYFE 285
K G IR ++ CLD GG+ +IL+ C G NQ ++
Sbjct: 84 KDGTIRNLKSGKCLDVKGGNTNGTNLILWTCDGGP-NQKWK 123
>gnl|CDD|214672 smart00458, RICIN, Ricin-type beta-trefoil. Carbohydrate-binding
domain formed from presumed gene triplication.
Length = 118
Score = 64.5 bits (157), Expect = 4e-13
Identities = 29/83 (34%), Positives = 39/83 (46%), Gaps = 9/83 (10%)
Query: 213 SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA--CLDY---AG 267
+ +G C+D PVGL+ CH GGNQ W ++ G IR + CL G
Sbjct: 2 ISGNTGKCLD----VNGNKNPVGLFDCHGTGGNQLWKLTSDGAIRIKDTDLCLTANGNTG 57
Query: 268 GDVILYPCHGSKGNQYFEYDYKY 290
V LY C G+ NQY+E +
Sbjct: 58 STVTLYSCDGTNDNQYWEVNKDG 80
Score = 42.5 bits (100), Expect = 2e-05
Identities = 23/86 (26%), Positives = 38/86 (44%), Gaps = 13/86 (15%)
Query: 207 KWYL----EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA- 261
W L + + +C+ + + V LY C NQ+W ++K G IR ++
Sbjct: 32 LWKLTSDGAIRIKDTDLCLTA---NGNTGSTVTLYSCDGTNDNQYWEVNKDGTIRNPDSG 88
Query: 262 -CLDYAGGD----VILYPCHGSKGNQ 282
CLD G+ VIL+ C G+ +
Sbjct: 89 KCLDVKDGNTGTKVILWTCSGNPNQK 114
>gnl|CDD|217196 pfam02709, Glyco_transf_7C, N-terminal domain of
galactosyltransferase. This is the N-terminal domain of
a family of galactosyltransferases from a wide range of
Metazoa with three related galactosyltransferases
activities, all three of which are possessed by one
sequence in some cases. EC:2.4.1.90, N-acetyllactosamine
synthase; EC:2.4.1.38,
Beta-N-acetylglucosaminyl-glycopeptide beta-1,4-
galactosyltransferase; and EC:2.4.1.22 Lactose synthase.
Note that N-acetyllactosamine synthase is a component of
Lactose synthase along with alpha-lactalbumin, in the
absence of alpha-lactalbumin EC:2.4.1.90 is the
catalyzed reaction.
Length = 78
Score = 31.0 bits (71), Expect = 0.11
Identities = 11/35 (31%), Positives = 18/35 (51%)
Query: 87 PTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
GG+ + K F K+ + + F WGGE+ +L
Sbjct: 17 KGYFGGVLAFSKEDFLKVNGFSNNFWGWGGEDDDL 51
Score = 31.0 bits (71), Expect = 0.11
Identities = 11/35 (31%), Positives = 18/35 (51%)
Query: 148 PTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 182
GG+ + K F K+ + + F WGGE+ +L
Sbjct: 17 KGYFGGVLAFSKEDFLKVNGFSNNFWGWGGEDDDL 51
>gnl|CDD|220577 pfam10111, Glyco_tranf_2_2, Glycosyltransferase like family 2.
Members of this family of prokaryotic proteins include
putative glucosyltransferase, which are involved in
bacterial capsule biosynthesis.
Length = 278
Score = 32.8 bits (75), Expect = 0.17
Identities = 13/36 (36%), Positives = 21/36 (58%)
Query: 89 MAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
+A I++ FF K+G +D F GGE+ EL ++
Sbjct: 165 LASSCILINRDFFLKIGGFDENFRGHGGEDFELLYR 200
Score = 32.8 bits (75), Expect = 0.17
Identities = 13/36 (36%), Positives = 21/36 (58%)
Query: 150 MAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 185
+A I++ FF K+G +D F GGE+ EL ++
Sbjct: 165 LASSCILINRDFFLKIGGFDENFRGHGGEDFELLYR 200
>gnl|CDD|223548 COG0472, Rfe, UDP-N-acetylmuramyl pentapeptide
phosphotransferase/UDP-N- acetylglucosamine-1-phosphate
transferase [Cell envelope biogenesis, outer membrane].
Length = 319
Score = 29.9 bits (68), Expect = 1.3
Identities = 14/35 (40%), Positives = 14/35 (40%), Gaps = 6/35 (17%)
Query: 121 LSFKFNWHAIPERERKRHKNAAEPVWTPTMAGGLF 155
K IPE K HK TPTM GGL
Sbjct: 26 FLRKLGLGDIPEDGPKSHKKG-----TPTM-GGLA 54
Score = 28.0 bits (63), Expect = 6.5
Identities = 13/32 (40%), Positives = 14/32 (43%), Gaps = 6/32 (18%)
Query: 63 QFNWHAIPERERKRHKNAAEPVWTPTMAGGLF 94
+ IPE K HK TPTM GGL
Sbjct: 29 KLGLGDIPEDGPKSHKKG-----TPTM-GGLA 54
>gnl|CDD|187551 cd05240, UDP_G4E_3_SDR_e, UDP-glucose 4 epimerase (G4E), subgroup
3, extended (e) SDRs. Members of this bacterial
subgroup are identified as possible sugar epimerases,
such as UDP-glucose 4 epimerase. However, while the
NAD(P)-binding motif is fairly well conserved, not all
members retain the canonical active site tetrad of the
extended SDRs. UDP-glucose 4 epimerase (aka
UDP-galactose-4-epimerase), is a homodimeric extended
SDR. It catalyzes the NAD-dependent conversion of
UDP-galactose to UDP-glucose, the final step in Leloir
galactose synthesis. Extended SDRs are distinct from
classical SDRs. In addition to the Rossmann fold
(alpha/beta folding pattern with a central beta-sheet)
core region typical of all SDRs, extended SDRs have a
less conserved C-terminal extension of approximately 100
amino acids. Extended SDRs are a diverse collection of
proteins, and include isomerases, epimerases,
oxidoreductases, and lyases; they typically have a
TGXXGXXG cofactor binding motif. SDRs are a functionally
diverse family of oxidoreductases that have a single
domain with a structurally conserved Rossmann fold, an
NAD(P)(H)-binding region, and a structurally diverse
C-terminal region. Sequence identity between different
SDR enzymes is typically in the 15-30% range; they
catalyze a wide range of activities including the
metabolism of steroids, cofactors, carbohydrates,
lipids, aromatic compounds, and amino acids, and act in
redox sensing. Classical SDRs have an TGXXX[AG]XG
cofactor binding motif and a YXXXK active site motif,
with the Tyr residue of the active site motif serving as
a critical catalytic residue (Tyr-151, human
15-hydroxyprostaglandin dehydrogenase numbering). In
addition to the Tyr and Lys, there is often an upstream
Ser and/or an Asn, contributing to the active site;
while substrate binding is in the C-terminal region,
which determines specificity. The standard reaction
mechanism is a 4-pro-S hydride transfer and proton relay
involving the conserved Tyr and Lys, a water molecule
stabilized by Asn, and nicotinamide. Atypical SDRs
generally lack the catalytic residues characteristic of
the SDRs, and their glycine-rich NAD(P)-binding motif is
often different from the forms normally seen in
classical or extended SDRs. Complex (multidomain) SDRs
such as ketoreductase domains of fatty acid synthase
have a GGXGXXG NAD(P)-binding motif and an altered
active site motif (YXXXN). Fungal type ketoacyl
reductases have a TGXXXGX(1-2)G NAD(P)-binding motif.
Length = 306
Score = 29.3 bits (66), Expect = 2.2
Identities = 15/59 (25%), Positives = 19/59 (32%), Gaps = 9/59 (15%)
Query: 9 RWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWH 67
R P L+V + ++ P N D R P GGFD QF
Sbjct: 153 RRRHPELNVTVLRPATILGPGTRNTTRDFLSPRRLPVP---------GGFDPPFQFLHE 202
>gnl|CDD|176494 cd08764, Cyt_b561_CG1275_like, Non-vertebrate eumetazoan
cytochrome b(561). Cytochrome b(561), as found in
non-vertebrate eumetazoans, similar to the Drosophila
melanogaster CG1275 gene product. This protein might
act as a ferric-chelate reductase, catalyzing the
reduction of Fe(3+) to Fe(2+), such as associated with
the transport of iron from the endosome to the
cytoplasm. It is assumed that this protein uses
ascorbate as the electron donor. Belongs to the
cytochrome b(561) family, which are secretory
vesicle-specific electron transport proteins.
Cytochromes b(561) are integral membrane proteins that
bind two heme groups non-covalently, and may have six
alpha-helical trans-membrane segments.
Length = 214
Score = 28.8 bits (65), Expect = 2.4
Identities = 11/17 (64%), Positives = 11/17 (64%), Gaps = 3/17 (17%)
Query: 54 FIGGFDW---NLQFNWH 67
F GGF W LQFNWH
Sbjct: 11 FRGGFSWTGPGLQFNWH 27
>gnl|CDD|226577 COG4092, COG4092, Predicted glycosyltransferase involved in capsule
biosynthesis [Cell envelope biogenesis, outer membrane].
Length = 346
Score = 28.0 bits (62), Expect = 6.6
Identities = 24/120 (20%), Positives = 37/120 (30%), Gaps = 23/120 (19%)
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF-DIWGGENLELS 122
F +K N P+ P ++ ++KA + + F D E+
Sbjct: 112 FAKMLSIATIKKMRTNIDAPLVLP-----VYHLNKADTQVFFDVEDMFLDAMIFESPLAE 166
Query: 123 FKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 182
F RK P +F I++ F G YD F G E+ E
Sbjct: 167 F-----------RKEDNFFIAPY------TNIFLINRRMFSLTGGYDERFRGHGSEDFEF 209
>gnl|CDD|215006 smart01065, CBM_2, Starch binding domain.
Length = 88
Score = 26.2 bits (58), Expect = 7.5
Identities = 9/36 (25%), Positives = 13/36 (36%), Gaps = 2/36 (5%)
Query: 37 TFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAIPER 72
+ PP T YK+ D + W + P R
Sbjct: 49 KGTVSLPPAGTTIEYKYVKV--DEDGSVTWESGPNR 82
>gnl|CDD|133016 cd02525, Succinoglycan_BP_ExoA, ExoA is involved in the
biosynthesis of succinoglycan. Succinoglycan
Biosynthesis Protein ExoA catalyzes the formation of a
beta-1,3 linkage of the second sugar (glucose) of the
succinoglycan with the galactose on the lipid carrie.
Succinoglycan is an acidic exopolysaccharide that is
important for invasion of the nodules. Succinoglycan is
a high-molecular-weight polymer composed of repeating
octasaccharide units. These units are synthesized on
membrane-bound isoprenoid lipid carriers, beginning with
galactose followed by seven glucose molecules, and
modified by the addition of acetate, succinate, and
pyruvate. ExoA is a membrane protein with a
transmembrance domain at c-terminus.
Length = 249
Score = 27.6 bits (62), Expect = 8.1
Identities = 11/48 (22%), Positives = 19/48 (39%), Gaps = 4/48 (8%)
Query: 74 RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
A + + T+ G + + FEK+G +D E+ EL
Sbjct: 143 SAYRGGAVKIGYVDTVHHGAYR--REVFEKVGGFDESLVR--NEDAEL 186
Score = 27.6 bits (62), Expect = 8.1
Identities = 11/48 (22%), Positives = 19/48 (39%), Gaps = 4/48 (8%)
Query: 135 RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 182
A + + T+ G + + FEK+G +D E+ EL
Sbjct: 143 SAYRGGAVKIGYVDTVHHGAYR--REVFEKVGGFDESLVR--NEDAEL 186
>gnl|CDD|220523 pfam10020, DUF2262, Uncharacterized protein conserved in bacteria
(DUF2262). This domain, found in various hypothetical
bacterial proteins, has no known function.
Length = 144
Score = 26.9 bits (60), Expect = 8.3
Identities = 13/38 (34%), Positives = 20/38 (52%), Gaps = 4/38 (10%)
Query: 153 GLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKGDFGD 190
G F++DK + G+++ D W GE + LS D D
Sbjct: 6 GDFTLDKDY----GSFEGKIDWWDGEEISLSLDEDDND 39
>gnl|CDD|240337 PTZ00261, PTZ00261, acyltransferase; Provisional.
Length = 355
Score = 27.5 bits (61), Expect = 9.9
Identities = 11/46 (23%), Positives = 18/46 (39%), Gaps = 3/46 (6%)
Query: 227 PTDMHKPVGLYPC---HKQGGNQFWMMSKHGEIRRDEACLDYAGGD 269
P DMH +G YP + + + + RDE + A +
Sbjct: 273 PADMHIRIGAYPIDYDRDSSKDVAVGLQQRMQKVRDEIAAEVAAAE 318
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.322 0.140 0.474
Gapped
Lambda K H
0.267 0.0710 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 15,133,705
Number of extensions: 1428467
Number of successful extensions: 1003
Number of sequences better than 10.0: 1
Number of HSP's gapped: 991
Number of HSP's successfully gapped: 31
Length of query: 290
Length of database: 10,937,602
Length adjustment: 96
Effective length of query: 194
Effective length of database: 6,679,618
Effective search space: 1295845892
Effective search space used: 1295845892
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 58 (26.2 bits)