RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy17695
         (222 letters)



>gnl|CDD|206015 pfam13844, Glyco_transf_41, Glycosyl transferase family 41.  This
           family of glycosyltransferases includes O-linked
           beta-N-acetylglucosamine (O-GlcNAc) transferase, an
           enzyme which catalyzes the addition of O-GlcNAc to
           serine and threonine residues. In addition to its
           function as an O-GlcNAc transferase, human OGT, also
           appears to proteolytically cleave the epigenetic
           cell-cycle regulator HCF-1.
          Length = 468

 Score =  133 bits (335), Expect = 1e-36
 Identities = 49/58 (84%), Positives = 56/58 (96%)

Query: 160 QAKIAREAEHFLDLSQVPCNGKAADRIHKDGIHILVNMNGYTKGARNEIFALRPAPIQ 217
           ++K+  E+EHF+DLSQ+PCNGKAADRIH+DGIHILVNMNGYTKGARNEIFALRPAPIQ
Sbjct: 47  RSKVMAESEHFVDLSQIPCNGKAADRIHQDGIHILVNMNGYTKGARNEIFALRPAPIQ 104



 Score = 58.3 bits (141), Expect = 5e-10
 Identities = 24/51 (47%), Positives = 30/51 (58%), Gaps = 2/51 (3%)

Query: 97  EYRKAIAARHAN-LCLLKAMRAKVWQARTESPLFCVQRYAQRLESLYKVMW 146
           EY   IA +       L+ +RAKVW+ART S LF  ++Y Q LE LY  MW
Sbjct: 419 EYED-IAVKLGTDREYLRKIRAKVWKARTSSTLFNTKQYCQDLERLYLKMW 468


>gnl|CDD|226428 COG3914, Spy, Predicted O-linked N-acetylglucosamine transferase,
           SPINDLY family [Posttranslational modification, protein
           turnover, chaperones].
          Length = 620

 Score = 61.7 bits (150), Expect = 3e-11
 Identities = 27/72 (37%), Positives = 42/72 (58%), Gaps = 1/72 (1%)

Query: 149 YSQNLPVTHITQAKIAREAEHFLDLSQVPCNGKAADRIHKDGIHILVNMNGYTKGARNEI 208
           YS   P T   Q +I+   E +  + ++    + A+ I  DGI ILV+++G+T   R ++
Sbjct: 295 YSLGPPHTDALQERISAAVEKWYPIGRMDDA-EIANAIRTDGIDILVDLDGHTVDTRCQV 353

Query: 209 FALRPAPIQSSF 220
           FA RPAPIQ S+
Sbjct: 354 FAHRPAPIQVSW 365



 Score = 35.5 bits (82), Expect = 0.018
 Identities = 16/39 (41%), Positives = 27/39 (69%)

Query: 111 LLKAMRAKVWQARTESPLFCVQRYAQRLESLYKVMWDRY 149
           L + +RA++ ++R  SPLF  + +A++LE+LY  MW  Y
Sbjct: 580 LRQQVRAELKRSRQTSPLFDPKAFARKLETLYWGMWSEY 618


>gnl|CDD|238112 cd00189, TPR, Tetratricopeptide repeat domain; typically contains
           34 amino acids
           [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-
           X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found
           in a variety of organisms including bacteria,
           cyanobacteria, yeast, fungi, plants, and humans in
           various subcellular locations; involved in a variety of
           functions including protein-protein interactions, but
           common features in the interaction partners have not
           been defined; involved in chaperone, cell-cycle,
           transciption, and protein transport complexes; the
           number of TPR motifs varies among proteins (1,3-11,13
           15,16,19); 5-6 tandem repeats generate a right-handed
           helical structure with an amphipathic channel that is
           thought to accomodate an alpha-helix of a target
           protein; it has been proposed that TPR proteins
           preferably interact with WD-40 repeat proteins, but in
           many instances several TPR-proteins seem to aggregate to
           multi-protein complexes; examples of TPR-proteins
           include, Cdc16p, Cdc23p and Cdc27p components of the
           cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal
           targeting signals, the Tom70p co-receptor for
           mitochondrial targeting signals, Ser/Thr phosphatase 5C
           and the p110 subunit of O-GlcNAc transferase; three
           copies of the repeat are present here.
          Length = 100

 Score = 42.8 bits (101), Expect = 1e-05
 Identities = 20/83 (24%), Positives = 36/83 (43%), Gaps = 6/83 (7%)

Query: 22  GNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQLDKNRL 81
           G+  EA++ Y  AL+L PD  DAY NLA     +  + +     +K + +          
Sbjct: 14  GDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKALEL------DPDN 67

Query: 82  PSVHPHHSMLYPLTHEYRKAIAA 104
              + +  + Y    +Y +A+ A
Sbjct: 68  AKAYYNLGLAYYKLGKYEEALEA 90



 Score = 39.7 bits (93), Expect = 1e-04
 Identities = 14/50 (28%), Positives = 23/50 (46%)

Query: 22 GNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSI 71
          G   EA++ Y  AL+L PD   AY NL      +  + +     +K + +
Sbjct: 48 GKYEEALEDYEKALELDPDNAKAYYNLGLAYYKLGKYEEALEAYEKALEL 97


>gnl|CDD|222112 pfam13414, TPR_11, TPR repeat. 
          Length = 69

 Score = 40.8 bits (96), Expect = 2e-05
 Identities = 15/32 (46%), Positives = 20/32 (62%)

Query: 22 GNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQ 53
          G+  EAI++Y  AL+L PD  +AY NLA    
Sbjct: 17 GDYDEAIEAYEKALELDPDNAEAYYNLALAYL 48


>gnl|CDD|205609 pfam13431, TPR_17, Tetratricopeptide repeat. 
          Length = 34

 Score = 31.4 bits (72), Expect = 0.027
 Identities = 11/23 (47%), Positives = 14/23 (60%)

Query: 31 YRTALKLKPDFPDAYCNLAHCLQ 53
          Y  AL+L P+  +AY NLA  L 
Sbjct: 2  YEKALELDPNNAEAYYNLALLLL 24


>gnl|CDD|233223 TIGR00990, 3a0801s09, mitochondrial precursor proteins import
           receptor (72 kDa mitochondrial outermembrane protein)
           (mitochondrial import receptor for the ADP/ATP carrier)
           (translocase of outermembrane tom70).  [Transport and
           binding proteins, Amino acids, peptides and amines].
          Length = 615

 Score = 33.4 bits (76), Expect = 0.084
 Identities = 14/33 (42%), Positives = 19/33 (57%), Gaps = 1/33 (3%)

Query: 26  EAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDW 58
           +AI+ Y  A++ KPD P  Y N A C   + DW
Sbjct: 145 KAIKLYSKAIECKPD-PVYYSNRAACHNALGDW 176


>gnl|CDD|234059 TIGR02917, PEP_TPR_lipo, putative PEP-CTERM system TPR-repeat
           lipoprotein.  This protein family occurs in strictly
           within a subset of Gram-negative bacterial species with
           the proposed PEP-CTERM/exosortase system, analogous to
           the LPXTG/sortase system common in Gram-positive
           bacteria. This protein occurs in a species if and only
           if a transmembrane histidine kinase (TIGR02916) and a
           DNA-binding response regulator (TIGR02915) also occur.
           The present of tetratricopeptide repeats (TPR) suggests
           protein-protein interaction, possibly for the regulation
           of PEP-CTERM protein expression, since many PEP-CTERM
           proteins in these genomes are preceded by a proposed DNA
           binding site for the response regulator.
          Length = 899

 Score = 32.7 bits (75), Expect = 0.15
 Identities = 12/30 (40%), Positives = 17/30 (56%)

Query: 20  SPGNIPEAIQSYRTALKLKPDFPDAYCNLA 49
           S GNI  A+ +YR A+ L+P+       LA
Sbjct: 205 SLGNIELALAAYRKAIALRPNNIAVLLALA 234



 Score = 28.9 bits (65), Expect = 2.6
 Identities = 11/28 (39%), Positives = 19/28 (67%)

Query: 22  GNIPEAIQSYRTALKLKPDFPDAYCNLA 49
           G++ +A +++  AL ++PDF  A  NLA
Sbjct: 479 GDLAKAREAFEKALSIEPDFFPAAANLA 506



 Score = 28.5 bits (64), Expect = 3.3
 Identities = 10/24 (41%), Positives = 14/24 (58%)

Query: 26  EAIQSYRTALKLKPDFPDAYCNLA 49
           +AI+ Y+T +K  PD      NLA
Sbjct: 788 KAIKHYQTVVKKAPDNAVVLNNLA 811



 Score = 27.4 bits (61), Expect = 8.2
 Identities = 11/22 (50%), Positives = 15/22 (68%)

Query: 23  NIPEAIQSYRTALKLKPDFPDA 44
           N  +AI S + AL+LKPD  +A
Sbjct: 650 NYAKAITSLKRALELKPDNTEA 671


>gnl|CDD|201277 pfam00515, TPR_1, Tetratricopeptide repeat. 
          Length = 34

 Score = 28.5 bits (65), Expect = 0.23
 Identities = 8/20 (40%), Positives = 12/20 (60%)

Query: 22 GNIPEAIQSYRTALKLKPDF 41
          G   EA++ Y  AL+L P+ 
Sbjct: 15 GKYDEALEYYEKALELNPNN 34


>gnl|CDD|197478 smart00028, TPR, Tetratricopeptide repeats.  Repeats present in 4
          or more copies in proteins. Contain a minimum of 34
          amino acids each and self-associate via a "knobs and
          holes" mechanism.
          Length = 34

 Score = 28.2 bits (64), Expect = 0.39
 Identities = 8/20 (40%), Positives = 13/20 (65%)

Query: 22 GNIPEAIQSYRTALKLKPDF 41
          G+  EA++ Y  AL+L P+ 
Sbjct: 15 GDYDEALEYYEKALELDPNN 34


>gnl|CDD|224524 COG1608, COG1608, Predicted archaeal kinase [General function
           prediction only].
          Length = 252

 Score = 29.6 bits (67), Expect = 0.98
 Identities = 12/45 (26%), Positives = 17/45 (37%), Gaps = 2/45 (4%)

Query: 65  MKKLVSIVAEQLDKNRLP--SVHPHHSMLYPLTHEYRKAIAARHA 107
           M +L SIV + L    +   SV P     +     Y    A + A
Sbjct: 79  MLELNSIVVDALLDAGVRAVSVVPISFSTFNGRILYTYLEAIKDA 123


>gnl|CDD|222121 pfam13428, TPR_14, Tetratricopeptide repeat. 
          Length = 44

 Score = 26.7 bits (59), Expect = 1.4
 Identities = 13/30 (43%), Positives = 17/30 (56%)

Query: 22 GNIPEAIQSYRTALKLKPDFPDAYCNLAHC 51
          G++ EA+   R AL L PD P+A   LA  
Sbjct: 15 GDLDEALALLRRALALDPDDPEALLLLARL 44


>gnl|CDD|205362 pfam13181, TPR_8, Tetratricopeptide repeat. 
          Length = 34

 Score = 26.3 bits (59), Expect = 1.4
 Identities = 8/20 (40%), Positives = 12/20 (60%)

Query: 22 GNIPEAIQSYRTALKLKPDF 41
          G+  EA + Y  AL+L P+ 
Sbjct: 15 GDYEEAKEYYEKALELDPNN 34


>gnl|CDD|222123 pfam13432, TPR_16, Tetratricopeptide repeat. 
          Length = 65

 Score = 26.9 bits (60), Expect = 2.0
 Identities = 8/28 (28%), Positives = 12/28 (42%)

Query: 22 GNIPEAIQSYRTALKLKPDFPDAYCNLA 49
          G+  EA+ +   AL   P   +A   L 
Sbjct: 11 GDYDEALAALEAALARYPLAAEALLLLG 38


>gnl|CDD|234256 TIGR03558, oxido_grp_1, luciferase family oxidoreductase, group 1. 
           The Pfam domain family pfam00296 is named for
           luciferase-like monooxygenases, but the family also
           contains several coenzyme F420-dependent enzymes. This
           protein family represents a well-resolved clade within
           family pfam00296 and shows no restriction to coenzyme
           F420-positive species, unlike some other clades within
           pfam00296 [Unknown function, Enzymes of unknown
           specificity].
          Length = 323

 Score = 28.6 bits (65), Expect = 2.2
 Identities = 15/55 (27%), Positives = 26/55 (47%), Gaps = 2/55 (3%)

Query: 10  GILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEAR 64
           G+ F+ +S  +P  + EA+  YR   +      + Y  +A  + +V   TD EA 
Sbjct: 178 GLPFAFASHFAPDALEEALDLYRERFRPSAQLDEPYVMVA--VNVVAADTDEEAE 230


>gnl|CDD|226080 COG3550, HipA, Toxin module HipA, protein kinase of
           phosphatidylinositol 3/4-kinase superfamily [General
           function prediction only].
          Length = 392

 Score = 28.6 bits (64), Expect = 2.5
 Identities = 11/57 (19%), Positives = 16/57 (28%), Gaps = 2/57 (3%)

Query: 134 YAQRLESLYKVMWDRYSQNLPVTHITQAKIAREAEH--FLDLSQVPCNGKAADRIHK 188
            A  L +L +          P  HI +A     A       +S        A R  +
Sbjct: 121 GADALTALLRKASGIPDGITPTVHIIKAPDGLYAVKSNSDRISTSIDKELDALRYPE 177


>gnl|CDD|191825 pfam07719, TPR_2, Tetratricopeptide repeat.  This Pfam entry
          includes outlying Tetratricopeptide-like repeats (TPR)
          that are not matched by pfam00515.
          Length = 34

 Score = 25.6 bits (57), Expect = 3.1
 Identities = 8/20 (40%), Positives = 14/20 (70%)

Query: 22 GNIPEAIQSYRTALKLKPDF 41
          G+  EA+++Y  AL+L P+ 
Sbjct: 15 GDYEEALEAYEKALELDPNN 34


>gnl|CDD|236875 PRK11189, PRK11189, lipoprotein NlpI; Provisional.
          Length = 296

 Score = 27.9 bits (63), Expect = 3.6
 Identities = 8/12 (66%), Positives = 9/12 (75%)

Query: 34  ALKLKPDFPDAY 45
           AL L+PD  DAY
Sbjct: 90  ALALRPDMADAY 101


>gnl|CDD|218897 pfam06100, Strep_67kDa_ant, Streptococcal 67 kDa
           myosin-cross-reactive antigen like family.  Members of
           this family are thought to have structural features in
           common with the beta chain of the class II antigens, as
           well as myosin, and may play an important role in the
           pathogenesis.
          Length = 500

 Score = 28.2 bits (63), Expect = 3.8
 Identities = 20/82 (24%), Positives = 36/82 (43%), Gaps = 5/82 (6%)

Query: 131 VQRYAQRLESLYKVMWDRYSQNLPVTHITQAKIAREAEHFLDLSQVPCNGKAADRIHKDG 190
           V R  + +E+ ++ +WD +  ++P   I  A +    + F  L++   N      IHK G
Sbjct: 56  VVRGGREMENHFECLWDLFR-SIPSLEIPNASV---LDEFYWLNKDDPNYSNCRAIHKRG 111

Query: 191 IHILVNMN-GYTKGARNEIFAL 211
             +  +     +K A  EI  L
Sbjct: 112 NRLDDDGKFTLSKKAAKEIIKL 133


>gnl|CDD|217334 pfam03041, Baculo_LEF-2, lef-2.  The lef-2 gene (for late
           expression factor 2) from baculovirus is required for
           expression of late genes. This gene has been shown to be
           specifically required for expression from the vp39 and
           polh promoters. LEF-1 is a DNA primase and there is some
           evidence to suggest that LEF-2 may bind to both DNA and
           LEF-1.
          Length = 164

 Score = 27.7 bits (62), Expect = 3.9
 Identities = 14/63 (22%), Positives = 22/63 (34%), Gaps = 3/63 (4%)

Query: 70  SIVAEQLDKNRLPSVHPHH---SMLYPLTHEYRKAIAARHANLCLLKAMRAKVWQARTES 126
           S+    +DKN    V P      +L P T      +  R + L L   ++       + S
Sbjct: 8   SLKRSDIDKNATYLVDPFDFDWLVLSPYTVFEPGGLYVRVSGLRLYNLLKNPPTFEESPS 67

Query: 127 PLF 129
             F
Sbjct: 68  VNF 70


>gnl|CDD|131573 TIGR02521, type_IV_pilW, type IV pilus biogenesis/stability protein
           PilW.  Members of this family are designated PilF in ref
           (PMID:8973346) and PilW in ref (PMID:15612916). This
           outer membrane protein is required both for pilus
           stability and for pilus function such as adherence to
           human cells. Members of this family contain copies of
           the TPR (tetratricopeptide repeat) domain.
          Length = 234

 Score = 27.3 bits (61), Expect = 5.5
 Identities = 10/26 (38%), Positives = 14/26 (53%)

Query: 22  GNIPEAIQSYRTALKLKPDFPDAYCN 47
           G + +A  S+R AL L P+  D   N
Sbjct: 79  GELEKAEDSFRRALTLNPNNGDVLNN 104



 Score = 27.3 bits (61), Expect = 6.4
 Identities = 11/32 (34%), Positives = 16/32 (50%)

Query: 22 GNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQ 53
          G++  A ++   AL+  PD   AY  LA   Q
Sbjct: 45 GDLEVAKENLDKALEHDPDDYLAYLALALYYQ 76


>gnl|CDD|185261 PRK15363, PRK15363, pathogenicity island 2 chaperone protein SscA;
           Provisional.
          Length = 157

 Score = 26.9 bits (59), Expect = 6.7
 Identities = 16/56 (28%), Positives = 28/56 (50%), Gaps = 1/56 (1%)

Query: 26  EAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQLDKNRL 81
           EAI +Y  A ++K D P A    A C  + CD   Y  +  K V  +  ++ ++++
Sbjct: 87  EAIYAYGRAAQIKIDAPQAPWAAAECY-LACDNVCYAIKALKAVVRICGEVSEHQI 141


>gnl|CDD|131546 TIGR02493, PFLA, pyruvate formate-lyase 1-activating enzyme.  An
           iron-sulfur protein with a radical-SAM domain
           (pfam04055). A single glycine residue in EC 2.3.1.54,
           formate C-acetyltransferase (formate-pyruvate lyase), is
           oxidized to the corresponding radical by transfer of H
           from its CH2 to AdoMet with concomitant cleavage of the
           latter. The reaction requires Fe2+. The first stage is
           reduction of the AdoMet to give methionine and the
           5'-deoxyadenosin-5-yl radical, which then abstracts a
           hydrogen radical from the glycine residue [Energy
           metabolism, Anaerobic, Protein fate, Protein
           modification and repair].
          Length = 235

 Score = 26.9 bits (60), Expect = 6.7
 Identities = 12/46 (26%), Positives = 20/46 (43%), Gaps = 2/46 (4%)

Query: 106 HANLCLL--KAMRAKVWQARTESPLFCVQRYAQRLESLYKVMWDRY 149
           + +L LL  K    + ++  T   L     +A+ L    K +W RY
Sbjct: 118 YTDLVLLDIKHFNPEKYKKLTGVSLQPTLDFAKYLAKRNKPIWIRY 163


>gnl|CDD|233924 TIGR02552, LcrH_SycD, type III secretion low calcium response
          chaperone LcrH/SycD.  Genes in this family are found in
          type III secretion operons. LcrH, from Yersinia is
          believed to have a regulatory function in the
          low-calcium response of the secretion system. The same
          protein is also known as SycD (SYC = Specific Yop
          Chaperone) for its chaperone role. In Pseudomonas,
          where the homolog is known as PcrH, the chaperone role
          has been demonstrated and the regulatory role appears
          to be absent. ScyD/LcrH contains three central
          tetratricopeptide-like repeats that are predicted to
          fold into an all-alpha-helical array.
          Length = 135

 Score = 26.5 bits (59), Expect = 7.4
 Identities = 13/27 (48%), Positives = 15/27 (55%)

Query: 26 EAIQSYRTALKLKPDFPDAYCNLAHCL 52
          EAI +Y  A  L PD P  Y + A CL
Sbjct: 69 EAIDAYALAAALDPDDPRPYFHAAECL 95


>gnl|CDD|211344 cd02867, PseudoU_synth_TruB_4, Pseudouridine synthase homolog 4.
           This group consists of Eukaryotic TruB proteins similar
           to Saccharomyces cerevisiae Pus4. S. cerevisiae Pus4,
           makes psi55 in the T loop of both cytoplasmic and
           mitochondrial tRNAs. Psi55 is almost universally
           conserved.  Pseudouridine synthases catalyze the
           isomerization of specific uridines in an RNA molecule to
           pseudouridines (5-ribosyluracil, psi).
          Length = 312

 Score = 27.0 bits (60), Expect = 9.1
 Identities = 10/25 (40%), Positives = 13/25 (52%), Gaps = 1/25 (4%)

Query: 154 PVTHITQAKIAREAEHFL-DLSQVP 177
           P +HIT+  I      F  D+ QVP
Sbjct: 117 PYSHITREDIEEVLAKFRGDIKQVP 141


>gnl|CDD|115693 pfam07056, DUF1335, Protein of unknown function (DUF1335).  This
           family represents a conserved region approximately 130
           residues long within a number of proteins of unknown
           function that seem to be specific to the white spot
           syndrome virus (WSSV).
          Length = 131

 Score = 26.3 bits (58), Expect = 9.3
 Identities = 8/20 (40%), Positives = 10/20 (50%)

Query: 5   PFFKVGILFSNSSFPSPGNI 24
           P F +G  FS +  PS   I
Sbjct: 98  PLFMLGEEFSRNVHPSAKEI 117


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.323    0.134    0.418 

Gapped
Lambda     K      H
   0.267   0.0867    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 11,304,192
Number of extensions: 1017313
Number of successful extensions: 1230
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1228
Number of HSP's successfully gapped: 46
Length of query: 222
Length of database: 10,937,602
Length adjustment: 93
Effective length of query: 129
Effective length of database: 6,812,680
Effective search space: 878835720
Effective search space used: 878835720
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 57 (25.5 bits)