RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy12509
         (194 letters)



>gnl|CDD|217079 pfam02516, STT3, Oligosaccharyl transferase STT3 subunit.  This
           family consists of the oligosaccharyl transferase STT3
           subunit and related proteins. The STT3 subunit is part
           of the oligosaccharyl transferase (OTase) complex of
           proteins and is required for its activity. In
           eukaryotes, OTase transfers a lipid-linked
           core-oligosaccharide to selected asparagine residues in
           the ER. In the archaea STT3 occurs alone, rather than in
           an OTase complex, and is required for N-glycosylation of
           asparagines.
          Length = 650

 Score =  176 bits (447), Expect = 6e-52
 Identities = 73/183 (39%), Positives = 106/183 (57%), Gaps = 7/183 (3%)

Query: 15  AFKAAFSTRLFSVLRFESVIHEFDPYFNYRTTKFLVEEGYYKFHNWFDDRAWYPLG-RII 73
               A  +RLF+  R     HEFDPY+NYR T+ L+ EG+Y    W D  A+YP G R+ 
Sbjct: 10  IAGVAIFSRLFTDTRGLPYFHEFDPYYNYRLTENLLNEGFYPGGIWDDHSAYYPPGSRVD 69

Query: 74  GGTIYPGLMVTSAVLYHLMWLLNITIDIRNVCVFLAPFFSSLTTIVTYLLTKEVKDTGSG 133
              + P L +      + +  L +   +R VC +L P    L  I TYLL +E+K+  +G
Sbjct: 70  YPPLLPYLTMA----LYGIIFLFLPFSLREVCFWLPPVIGGLLGIATYLLVRELKNDLAG 125

Query: 134 LVAAAMISIVPGYISRSVAGSYDNEAIAIFCMLLTYYSWIKAVKTGS--ILWSALTALAY 191
           L+AA +++I PGY+SR+VAG YD + +AIF  LL  + W+ A+KTGS  I+++ L  L  
Sbjct: 126 LIAAFLLAIAPGYVSRTVAGFYDTDMLAIFLPLLVLFFWLLAIKTGSNRIIYAVLAGLPL 185

Query: 192 FYM 194
           F M
Sbjct: 186 FLM 188


>gnl|CDD|224206 COG1287, COG1287, Uncharacterized membrane protein, required for
           N-linked glycosylation [General function prediction
           only].
          Length = 773

 Score = 95.2 bits (237), Expect = 5e-23
 Identities = 44/188 (23%), Positives = 80/188 (42%), Gaps = 16/188 (8%)

Query: 16  FKAAFSTRLFSVLRFES--VIHEFDPYFNYRTTKFLVEEGY-YKFHNWFDDRAWYPLGRI 72
                     S    +      EFDPY++YR  + L++ G    F + +D+   YP G  
Sbjct: 28  LGFLVRAFTRSYAFDDPGVYFGEFDPYYHYRLIENLLKNGPPRDFFDPYDN---YPPGSP 84

Query: 73  IG-GTIYPGLMVTSAVLYHLMWLLNITIDIRNVCVFLAPFFSSLTTIVTYLLTKEVKDTG 131
           I    ++  L         L+      + +    +        LT I  YLL + +    
Sbjct: 85  IDFPPLFLYLTAA----LGLILGSIFPVSLETAALLFPAILGVLTVIPVYLLGRRILGDK 140

Query: 132 SGLVAAAMISIVPGYISRSVAGSYDNEAIAIFCMLLTYYSWIKAVKT-----GSILWSAL 186
           +GL+AA ++++ PGY+SR+VAG YD +   +   L   + ++ A+K        ++++ L
Sbjct: 141 TGLLAALLLALAPGYLSRTVAGFYDTDMFELLLPLFALFFFLLALKAAKKLKKPVIYALL 200

Query: 187 TALAYFYM 194
             LA   +
Sbjct: 201 AGLALGLL 208


>gnl|CDD|234485 TIGR04154, archaeo_STT3, oligosaccharyl transferase, archaeosortase
           A system-associated.  Members of this protein family
           occur, one to three members per genome, in the same
           species of Euryarchaeota as contain the predicted
           protein-sorting enzyme archaeosortase (TIGR04125) and
           its cognate protein-sorting signal PGF-CTERM
           (TIGR04126).
          Length = 815

 Score = 78.5 bits (194), Expect = 3e-17
 Identities = 42/174 (24%), Positives = 75/174 (43%), Gaps = 21/174 (12%)

Query: 36  EFDPYFNYRTTKFLVEEGYYKFHNWFDDRAWYPLGRIIG-GTIYPGLMVTSAVLYHLMWL 94
             DP+++ R  ++ V    +    WFD    YP G  +G G ++  L  T A++  L   
Sbjct: 32  GNDPWYHLRRIEYTVHN--FPNTLWFDPYTNYPYGTEVGWGPLFDQLGATLALIVGL--G 87

Query: 95  LNITIDIRNVCVFLAPFFSSLTTIVTYLLTKEVKDTGSGLVAAAMISIVPG-YISRSVAG 153
                 I  V  FL P   +LT I  Y + K + D  +GL+AA +++++PG ++SRS+ G
Sbjct: 88  APSRETIETVAAFLPPLLGALTVIPVYFIGKRLGDRKTGLLAAFLLAVLPGQFLSRSLFG 147

Query: 154 SYDNEA-------IAIFCMLLTYYSWIK--------AVKTGSILWSALTALAYF 192
             D+         +A+   +L      +              + ++ L  +A  
Sbjct: 148 FVDHHIAEVLFSTLAVLAFILALRVAREHKPSLRDLDTLKKPLAYAVLAGIALG 201


>gnl|CDD|221995 pfam13231, PMT_2, Dolichyl-phosphate-mannose-protein
           mannosyltransferase.  This family contains members that
           are not captured by pfam02366.
          Length = 158

 Score = 40.6 bits (96), Expect = 1e-04
 Identities = 16/84 (19%), Positives = 37/84 (44%), Gaps = 2/84 (2%)

Query: 107 FLAPFFSSLTTIVTYLLTKEVKDTGSGLVAAAMISIVPGYISRSVAGSYDNEAIAIFCML 166
             +     L+ ++ YLL + +    + L+A  ++++ P ++  SV  + D  A  +    
Sbjct: 26  LPSALAGLLSILLLYLLARRLFGRRAALLAVLLLAVTPLFVLGSVLFTPD--APLLLFWA 83

Query: 167 LTYYSWIKAVKTGSILWSALTALA 190
           L  Y   +A++   + W  L  + 
Sbjct: 84  LALYFLYRALEKNKLKWWLLAGVF 107


>gnl|CDD|216998 pfam02366, PMT, Dolichyl-phosphate-mannose-protein
           mannosyltransferase.  This is a family of
           Dolichyl-phosphate-mannose-protein mannosyltransferase
           proteins EC:2.4.1.109. These proteins are responsible
           for O-linked glycosylation of proteins, they catalyze
           the reaction:- Dolichyl phosphate D-mannose + protein
           <=> dolichyl phosphate + O-D-mannosyl-protein. Also in
           this family is the Drosophila rotated abdomen protein
           which is a putative mannosyltransferase. This family
           appears to be distantly related to pfam02516 (A Bateman
           pers. obs.).
          Length = 244

 Score = 29.6 bits (67), Expect = 0.75
 Identities = 21/89 (23%), Positives = 36/89 (40%), Gaps = 7/89 (7%)

Query: 107 FLAPFFSSLTTIVTYLLTKE-VKDTGSGLVAAAMISIVPGYISRSVAGSYDNEAIAIFCM 165
             +    SLT  + YL   E      + L+AA ++ +   +I+ S       ++  +F  
Sbjct: 87  LFSALLGSLTVPLVYLTALELGFSRLTALLAALLVILENSFITLS--RYILLDSPLLFFT 144

Query: 166 LLTYYSWIKAVKT----GSILWSALTALA 190
            L  Y ++K  K        LW  LT +A
Sbjct: 145 TLAMYCFVKFEKAPFSRKWWLWLLLTGIA 173


>gnl|CDD|224720 COG1807, ArnT, 4-amino-4-deoxy-L-arabinose transferase and related
           glycosyltransferases of PMT family [Cell envelope
           biogenesis, outer membrane].
          Length = 535

 Score = 29.7 bits (67), Expect = 1.1
 Identities = 16/84 (19%), Positives = 31/84 (36%), Gaps = 1/84 (1%)

Query: 107 FLAPFFSSLTTIVTYLLTKEVKDTGSGLVAAAMISIVPGYISRSVAGSYDNEAIAIFCML 166
             +    +LT ++ Y L K +    + L+AA ++ + P +         D        + 
Sbjct: 88  LPSALAGALTALLVYWLAKRLFGRLAALLAALILLLTPLFFLIGRLALLDAALAFFLTLA 147

Query: 167 LTYYSWIKAVKTGSILWSALTALA 190
           L    ++     G + W  L  LA
Sbjct: 148 LALL-YLALRARGKLKWLLLLGLA 170


>gnl|CDD|235718 PRK06154, PRK06154, hypothetical protein; Provisional.
          Length = 565

 Score = 29.0 bits (65), Expect = 1.8
 Identities = 14/55 (25%), Positives = 25/55 (45%), Gaps = 10/55 (18%)

Query: 89  YHLMWLLNITIDIRNVCV---------FLAPFFSSLTTIVTYLLTKEVKDTGSGL 134
           Y ++W L   +DI+ V +          L+PF+ + +   +YL   +    G GL
Sbjct: 384 YRVVWELQHAVDIKTVIITHDAGSPRDQLSPFYVA-SRPGSYLGWGKTTQLGYGL 437


>gnl|CDD|173745 cd07848, STKc_CDKL5, Catalytic domain of the Serine/Threonine
           Kinase, Cyclin-Dependent protein Kinase Like 5.
           Serine/Threonine Kinases (STKs), Cyclin-dependent
           protein kinase like 5 (CDKL5) subfamily, catalytic (c)
           domain. STKs catalyze the transfer of the
           gamma-phosphoryl group from ATP to serine/threonine
           residues on protein substrates. The CDKL5 subfamily is
           part of a larger superfamily that includes the catalytic
           domains of other protein STKs, protein tyrosine kinases,
           RIO kinases, aminoglycoside phosphotransferase, choline
           kinase, and phosphoinositide 3-kinase. CDKs belong to a
           large family of STKs that are regulated by their cognate
           cyclins. Together, they are involved in the control of
           cell-cycle progression, transcription, and neuronal
           function. Mutations in the gene encoding CDKL5,
           previously called STK9, are associated with early onset
           epilepsy and severe mental retardation [X-linked
           infantile spasm syndrome (ISSX) or West syndrome]. In
           addition, CDKL5 mutations also sometimes cause a
           phenotype similar to Rett syndrome (RTT), a progressive
           neurodevelopmental disorder. These pathogenic mutations
           are located in the N-terminal portion of the protein
           within the kinase domain.
          Length = 287

 Score = 27.3 bits (60), Expect = 5.7
 Identities = 12/35 (34%), Positives = 17/35 (48%)

Query: 1   MFRLKSVLGSHKPSAFKAAFSTRLFSVLRFESVIH 35
           +F ++ VLG       K  +S   F  LRF +V H
Sbjct: 210 LFTIQKVLGPLPAEQMKLFYSNPRFHGLRFPAVNH 244


>gnl|CDD|224438 COG1521, COG1521, Pantothenate kinase type III (Bvg accessory
           factor family protein) [Transcription].
          Length = 251

 Score = 26.9 bits (60), Expect = 7.2
 Identities = 8/21 (38%), Positives = 13/21 (61%)

Query: 70  GRIIGGTIYPGLMVTSAVLYH 90
           GR +GG I PG+ ++   L+ 
Sbjct: 142 GRYLGGAILPGITLSFEALFA 162


>gnl|CDD|132676 TIGR03637, cas1_YPEST, CRISPR-associated endonuclease Cas1, subtype
           I-F/YPEST.  The CRISPR-associated protein Cas1 is
           virtually universal to CRISPR systems. CRISPR, an
           acronym for Clustered Regularly Interspaced Short
           Palindromic Repeats, is prokaryotic immunity system for
           foreign DNA, mostly from phage. CRISPR systems belong to
           different subtypes, distinguished by both nature of the
           repeats, the makeup of the cohort of associated Cas
           proteins, and by molecular phylogeny within the more
           universal Cas proteins such as this one. This model is
           of type EXCEPTION and provides more specific information
           than the EQUIVALOG model TIGR00287. It describes the
           Cas1 protein particular to the YPEST subtype of
           CRISPR/Cas system.
          Length = 307

 Score = 26.7 bits (59), Expect = 7.5
 Identities = 16/42 (38%), Positives = 21/42 (50%), Gaps = 4/42 (9%)

Query: 22  TRLFSVLRFESVIHEFDPYFNYRTTKFLVEEGYYKFHNWFDD 63
           T LFS    E  +    P   YR T++L  + + KF  WFDD
Sbjct: 78  TPLFSANEVEVDVSWLSPQSEYRPTEYL--QAWMKF--WFDD 115


>gnl|CDD|237413 PRK13528, PRK13528, outer membrane receptor FepA; Provisional.
          Length = 727

 Score = 27.0 bits (60), Expect = 7.8
 Identities = 16/44 (36%), Positives = 23/44 (52%), Gaps = 6/44 (13%)

Query: 103 NVCVFLAPFFSSLTTIVTYLLTKEVKDTGSGLVAAAMISIVPGY 146
           N+ V L     +  T  TY++T E KDTG+ L      S++P Y
Sbjct: 586 NLLVPLMKDRLNWNTNATYMITSEQKDTGNPL------SVIPKY 623


>gnl|CDD|187849 cd09718, Cas1_I-F, CRISPR/Cas system-associated protein Cas1.
           CRISPR (Clustered Regularly Interspaced Short
           Palindromic Repeats) and associated Cas proteins
           comprise a system for heritable host defense by
           prokaryotic cells against phage and other foreign DNA;
           Cas1 is the most universal CRISPR system protein thought
           to be involved in spacer integration; Cas1 is
           metal-dependent deoxyribonuclease, also binds RNA; Shown
           to possess a unique fold consisting of a N-terminal
           beta-strand domain and a C-terminal alpha-helical
           domain.
          Length = 306

 Score = 26.7 bits (59), Expect = 8.7
 Identities = 16/42 (38%), Positives = 21/42 (50%), Gaps = 4/42 (9%)

Query: 22  TRLFSVLRFESVIHEFDPYFNYRTTKFLVEEGYYKFHNWFDD 63
           T LFS    E  +    P   YR T++L  + + KF  WFDD
Sbjct: 78  TPLFSANEVEVDVEWLSPQSEYRPTEYL--QAWMKF--WFDD 115


>gnl|CDD|181973 PRK09589, celA, 6-phospho-beta-glucosidase; Reviewed.
          Length = 476

 Score = 26.7 bits (59), Expect = 9.0
 Identities = 15/50 (30%), Positives = 18/50 (36%), Gaps = 15/50 (30%)

Query: 35  HEFDPYFNYRTTKFLVEEGYYKFH---------------NWFDDRAWYPL 69
           HE +P  +Y  T+ LV   Y K                 NWF D    PL
Sbjct: 321 HEDNPQLDYVETRDLVSNPYVKASEWGWQIDPAGLRYSLNWFWDHYQLPL 370


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.328    0.140    0.440 

Gapped
Lambda     K      H
   0.267   0.0795    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 10,107,845
Number of extensions: 955892
Number of successful extensions: 1308
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1296
Number of HSP's successfully gapped: 47
Length of query: 194
Length of database: 10,937,602
Length adjustment: 92
Effective length of query: 102
Effective length of database: 6,857,034
Effective search space: 699417468
Effective search space used: 699417468
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.8 bits)
S2: 56 (25.2 bits)