RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy12509
(194 letters)
>gnl|CDD|217079 pfam02516, STT3, Oligosaccharyl transferase STT3 subunit. This
family consists of the oligosaccharyl transferase STT3
subunit and related proteins. The STT3 subunit is part
of the oligosaccharyl transferase (OTase) complex of
proteins and is required for its activity. In
eukaryotes, OTase transfers a lipid-linked
core-oligosaccharide to selected asparagine residues in
the ER. In the archaea STT3 occurs alone, rather than in
an OTase complex, and is required for N-glycosylation of
asparagines.
Length = 650
Score = 176 bits (447), Expect = 6e-52
Identities = 73/183 (39%), Positives = 106/183 (57%), Gaps = 7/183 (3%)
Query: 15 AFKAAFSTRLFSVLRFESVIHEFDPYFNYRTTKFLVEEGYYKFHNWFDDRAWYPLG-RII 73
A +RLF+ R HEFDPY+NYR T+ L+ EG+Y W D A+YP G R+
Sbjct: 10 IAGVAIFSRLFTDTRGLPYFHEFDPYYNYRLTENLLNEGFYPGGIWDDHSAYYPPGSRVD 69
Query: 74 GGTIYPGLMVTSAVLYHLMWLLNITIDIRNVCVFLAPFFSSLTTIVTYLLTKEVKDTGSG 133
+ P L + + + L + +R VC +L P L I TYLL +E+K+ +G
Sbjct: 70 YPPLLPYLTMA----LYGIIFLFLPFSLREVCFWLPPVIGGLLGIATYLLVRELKNDLAG 125
Query: 134 LVAAAMISIVPGYISRSVAGSYDNEAIAIFCMLLTYYSWIKAVKTGS--ILWSALTALAY 191
L+AA +++I PGY+SR+VAG YD + +AIF LL + W+ A+KTGS I+++ L L
Sbjct: 126 LIAAFLLAIAPGYVSRTVAGFYDTDMLAIFLPLLVLFFWLLAIKTGSNRIIYAVLAGLPL 185
Query: 192 FYM 194
F M
Sbjct: 186 FLM 188
>gnl|CDD|224206 COG1287, COG1287, Uncharacterized membrane protein, required for
N-linked glycosylation [General function prediction
only].
Length = 773
Score = 95.2 bits (237), Expect = 5e-23
Identities = 44/188 (23%), Positives = 80/188 (42%), Gaps = 16/188 (8%)
Query: 16 FKAAFSTRLFSVLRFES--VIHEFDPYFNYRTTKFLVEEGY-YKFHNWFDDRAWYPLGRI 72
S + EFDPY++YR + L++ G F + +D+ YP G
Sbjct: 28 LGFLVRAFTRSYAFDDPGVYFGEFDPYYHYRLIENLLKNGPPRDFFDPYDN---YPPGSP 84
Query: 73 IG-GTIYPGLMVTSAVLYHLMWLLNITIDIRNVCVFLAPFFSSLTTIVTYLLTKEVKDTG 131
I ++ L L+ + + + LT I YLL + +
Sbjct: 85 IDFPPLFLYLTAA----LGLILGSIFPVSLETAALLFPAILGVLTVIPVYLLGRRILGDK 140
Query: 132 SGLVAAAMISIVPGYISRSVAGSYDNEAIAIFCMLLTYYSWIKAVKT-----GSILWSAL 186
+GL+AA ++++ PGY+SR+VAG YD + + L + ++ A+K ++++ L
Sbjct: 141 TGLLAALLLALAPGYLSRTVAGFYDTDMFELLLPLFALFFFLLALKAAKKLKKPVIYALL 200
Query: 187 TALAYFYM 194
LA +
Sbjct: 201 AGLALGLL 208
>gnl|CDD|234485 TIGR04154, archaeo_STT3, oligosaccharyl transferase, archaeosortase
A system-associated. Members of this protein family
occur, one to three members per genome, in the same
species of Euryarchaeota as contain the predicted
protein-sorting enzyme archaeosortase (TIGR04125) and
its cognate protein-sorting signal PGF-CTERM
(TIGR04126).
Length = 815
Score = 78.5 bits (194), Expect = 3e-17
Identities = 42/174 (24%), Positives = 75/174 (43%), Gaps = 21/174 (12%)
Query: 36 EFDPYFNYRTTKFLVEEGYYKFHNWFDDRAWYPLGRIIG-GTIYPGLMVTSAVLYHLMWL 94
DP+++ R ++ V + WFD YP G +G G ++ L T A++ L
Sbjct: 32 GNDPWYHLRRIEYTVHN--FPNTLWFDPYTNYPYGTEVGWGPLFDQLGATLALIVGL--G 87
Query: 95 LNITIDIRNVCVFLAPFFSSLTTIVTYLLTKEVKDTGSGLVAAAMISIVPG-YISRSVAG 153
I V FL P +LT I Y + K + D +GL+AA +++++PG ++SRS+ G
Sbjct: 88 APSRETIETVAAFLPPLLGALTVIPVYFIGKRLGDRKTGLLAAFLLAVLPGQFLSRSLFG 147
Query: 154 SYDNEA-------IAIFCMLLTYYSWIK--------AVKTGSILWSALTALAYF 192
D+ +A+ +L + + ++ L +A
Sbjct: 148 FVDHHIAEVLFSTLAVLAFILALRVAREHKPSLRDLDTLKKPLAYAVLAGIALG 201
>gnl|CDD|221995 pfam13231, PMT_2, Dolichyl-phosphate-mannose-protein
mannosyltransferase. This family contains members that
are not captured by pfam02366.
Length = 158
Score = 40.6 bits (96), Expect = 1e-04
Identities = 16/84 (19%), Positives = 37/84 (44%), Gaps = 2/84 (2%)
Query: 107 FLAPFFSSLTTIVTYLLTKEVKDTGSGLVAAAMISIVPGYISRSVAGSYDNEAIAIFCML 166
+ L+ ++ YLL + + + L+A ++++ P ++ SV + D A +
Sbjct: 26 LPSALAGLLSILLLYLLARRLFGRRAALLAVLLLAVTPLFVLGSVLFTPD--APLLLFWA 83
Query: 167 LTYYSWIKAVKTGSILWSALTALA 190
L Y +A++ + W L +
Sbjct: 84 LALYFLYRALEKNKLKWWLLAGVF 107
>gnl|CDD|216998 pfam02366, PMT, Dolichyl-phosphate-mannose-protein
mannosyltransferase. This is a family of
Dolichyl-phosphate-mannose-protein mannosyltransferase
proteins EC:2.4.1.109. These proteins are responsible
for O-linked glycosylation of proteins, they catalyze
the reaction:- Dolichyl phosphate D-mannose + protein
<=> dolichyl phosphate + O-D-mannosyl-protein. Also in
this family is the Drosophila rotated abdomen protein
which is a putative mannosyltransferase. This family
appears to be distantly related to pfam02516 (A Bateman
pers. obs.).
Length = 244
Score = 29.6 bits (67), Expect = 0.75
Identities = 21/89 (23%), Positives = 36/89 (40%), Gaps = 7/89 (7%)
Query: 107 FLAPFFSSLTTIVTYLLTKE-VKDTGSGLVAAAMISIVPGYISRSVAGSYDNEAIAIFCM 165
+ SLT + YL E + L+AA ++ + +I+ S ++ +F
Sbjct: 87 LFSALLGSLTVPLVYLTALELGFSRLTALLAALLVILENSFITLS--RYILLDSPLLFFT 144
Query: 166 LLTYYSWIKAVKT----GSILWSALTALA 190
L Y ++K K LW LT +A
Sbjct: 145 TLAMYCFVKFEKAPFSRKWWLWLLLTGIA 173
>gnl|CDD|224720 COG1807, ArnT, 4-amino-4-deoxy-L-arabinose transferase and related
glycosyltransferases of PMT family [Cell envelope
biogenesis, outer membrane].
Length = 535
Score = 29.7 bits (67), Expect = 1.1
Identities = 16/84 (19%), Positives = 31/84 (36%), Gaps = 1/84 (1%)
Query: 107 FLAPFFSSLTTIVTYLLTKEVKDTGSGLVAAAMISIVPGYISRSVAGSYDNEAIAIFCML 166
+ +LT ++ Y L K + + L+AA ++ + P + D +
Sbjct: 88 LPSALAGALTALLVYWLAKRLFGRLAALLAALILLLTPLFFLIGRLALLDAALAFFLTLA 147
Query: 167 LTYYSWIKAVKTGSILWSALTALA 190
L ++ G + W L LA
Sbjct: 148 LALL-YLALRARGKLKWLLLLGLA 170
>gnl|CDD|235718 PRK06154, PRK06154, hypothetical protein; Provisional.
Length = 565
Score = 29.0 bits (65), Expect = 1.8
Identities = 14/55 (25%), Positives = 25/55 (45%), Gaps = 10/55 (18%)
Query: 89 YHLMWLLNITIDIRNVCV---------FLAPFFSSLTTIVTYLLTKEVKDTGSGL 134
Y ++W L +DI+ V + L+PF+ + + +YL + G GL
Sbjct: 384 YRVVWELQHAVDIKTVIITHDAGSPRDQLSPFYVA-SRPGSYLGWGKTTQLGYGL 437
>gnl|CDD|173745 cd07848, STKc_CDKL5, Catalytic domain of the Serine/Threonine
Kinase, Cyclin-Dependent protein Kinase Like 5.
Serine/Threonine Kinases (STKs), Cyclin-dependent
protein kinase like 5 (CDKL5) subfamily, catalytic (c)
domain. STKs catalyze the transfer of the
gamma-phosphoryl group from ATP to serine/threonine
residues on protein substrates. The CDKL5 subfamily is
part of a larger superfamily that includes the catalytic
domains of other protein STKs, protein tyrosine kinases,
RIO kinases, aminoglycoside phosphotransferase, choline
kinase, and phosphoinositide 3-kinase. CDKs belong to a
large family of STKs that are regulated by their cognate
cyclins. Together, they are involved in the control of
cell-cycle progression, transcription, and neuronal
function. Mutations in the gene encoding CDKL5,
previously called STK9, are associated with early onset
epilepsy and severe mental retardation [X-linked
infantile spasm syndrome (ISSX) or West syndrome]. In
addition, CDKL5 mutations also sometimes cause a
phenotype similar to Rett syndrome (RTT), a progressive
neurodevelopmental disorder. These pathogenic mutations
are located in the N-terminal portion of the protein
within the kinase domain.
Length = 287
Score = 27.3 bits (60), Expect = 5.7
Identities = 12/35 (34%), Positives = 17/35 (48%)
Query: 1 MFRLKSVLGSHKPSAFKAAFSTRLFSVLRFESVIH 35
+F ++ VLG K +S F LRF +V H
Sbjct: 210 LFTIQKVLGPLPAEQMKLFYSNPRFHGLRFPAVNH 244
>gnl|CDD|224438 COG1521, COG1521, Pantothenate kinase type III (Bvg accessory
factor family protein) [Transcription].
Length = 251
Score = 26.9 bits (60), Expect = 7.2
Identities = 8/21 (38%), Positives = 13/21 (61%)
Query: 70 GRIIGGTIYPGLMVTSAVLYH 90
GR +GG I PG+ ++ L+
Sbjct: 142 GRYLGGAILPGITLSFEALFA 162
>gnl|CDD|132676 TIGR03637, cas1_YPEST, CRISPR-associated endonuclease Cas1, subtype
I-F/YPEST. The CRISPR-associated protein Cas1 is
virtually universal to CRISPR systems. CRISPR, an
acronym for Clustered Regularly Interspaced Short
Palindromic Repeats, is prokaryotic immunity system for
foreign DNA, mostly from phage. CRISPR systems belong to
different subtypes, distinguished by both nature of the
repeats, the makeup of the cohort of associated Cas
proteins, and by molecular phylogeny within the more
universal Cas proteins such as this one. This model is
of type EXCEPTION and provides more specific information
than the EQUIVALOG model TIGR00287. It describes the
Cas1 protein particular to the YPEST subtype of
CRISPR/Cas system.
Length = 307
Score = 26.7 bits (59), Expect = 7.5
Identities = 16/42 (38%), Positives = 21/42 (50%), Gaps = 4/42 (9%)
Query: 22 TRLFSVLRFESVIHEFDPYFNYRTTKFLVEEGYYKFHNWFDD 63
T LFS E + P YR T++L + + KF WFDD
Sbjct: 78 TPLFSANEVEVDVSWLSPQSEYRPTEYL--QAWMKF--WFDD 115
>gnl|CDD|237413 PRK13528, PRK13528, outer membrane receptor FepA; Provisional.
Length = 727
Score = 27.0 bits (60), Expect = 7.8
Identities = 16/44 (36%), Positives = 23/44 (52%), Gaps = 6/44 (13%)
Query: 103 NVCVFLAPFFSSLTTIVTYLLTKEVKDTGSGLVAAAMISIVPGY 146
N+ V L + T TY++T E KDTG+ L S++P Y
Sbjct: 586 NLLVPLMKDRLNWNTNATYMITSEQKDTGNPL------SVIPKY 623
>gnl|CDD|187849 cd09718, Cas1_I-F, CRISPR/Cas system-associated protein Cas1.
CRISPR (Clustered Regularly Interspaced Short
Palindromic Repeats) and associated Cas proteins
comprise a system for heritable host defense by
prokaryotic cells against phage and other foreign DNA;
Cas1 is the most universal CRISPR system protein thought
to be involved in spacer integration; Cas1 is
metal-dependent deoxyribonuclease, also binds RNA; Shown
to possess a unique fold consisting of a N-terminal
beta-strand domain and a C-terminal alpha-helical
domain.
Length = 306
Score = 26.7 bits (59), Expect = 8.7
Identities = 16/42 (38%), Positives = 21/42 (50%), Gaps = 4/42 (9%)
Query: 22 TRLFSVLRFESVIHEFDPYFNYRTTKFLVEEGYYKFHNWFDD 63
T LFS E + P YR T++L + + KF WFDD
Sbjct: 78 TPLFSANEVEVDVEWLSPQSEYRPTEYL--QAWMKF--WFDD 115
>gnl|CDD|181973 PRK09589, celA, 6-phospho-beta-glucosidase; Reviewed.
Length = 476
Score = 26.7 bits (59), Expect = 9.0
Identities = 15/50 (30%), Positives = 18/50 (36%), Gaps = 15/50 (30%)
Query: 35 HEFDPYFNYRTTKFLVEEGYYKFH---------------NWFDDRAWYPL 69
HE +P +Y T+ LV Y K NWF D PL
Sbjct: 321 HEDNPQLDYVETRDLVSNPYVKASEWGWQIDPAGLRYSLNWFWDHYQLPL 370
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.328 0.140 0.440
Gapped
Lambda K H
0.267 0.0795 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 10,107,845
Number of extensions: 955892
Number of successful extensions: 1308
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1296
Number of HSP's successfully gapped: 47
Length of query: 194
Length of database: 10,937,602
Length adjustment: 92
Effective length of query: 102
Effective length of database: 6,857,034
Effective search space: 699417468
Effective search space used: 699417468
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.8 bits)
S2: 56 (25.2 bits)