RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy17695
(222 letters)
>gnl|CDD|206015 pfam13844, Glyco_transf_41, Glycosyl transferase family 41. This
family of glycosyltransferases includes O-linked
beta-N-acetylglucosamine (O-GlcNAc) transferase, an
enzyme which catalyzes the addition of O-GlcNAc to
serine and threonine residues. In addition to its
function as an O-GlcNAc transferase, human OGT, also
appears to proteolytically cleave the epigenetic
cell-cycle regulator HCF-1.
Length = 468
Score = 133 bits (335), Expect = 1e-36
Identities = 49/58 (84%), Positives = 56/58 (96%)
Query: 160 QAKIAREAEHFLDLSQVPCNGKAADRIHKDGIHILVNMNGYTKGARNEIFALRPAPIQ 217
++K+ E+EHF+DLSQ+PCNGKAADRIH+DGIHILVNMNGYTKGARNEIFALRPAPIQ
Sbjct: 47 RSKVMAESEHFVDLSQIPCNGKAADRIHQDGIHILVNMNGYTKGARNEIFALRPAPIQ 104
Score = 58.3 bits (141), Expect = 5e-10
Identities = 24/51 (47%), Positives = 30/51 (58%), Gaps = 2/51 (3%)
Query: 97 EYRKAIAARHAN-LCLLKAMRAKVWQARTESPLFCVQRYAQRLESLYKVMW 146
EY IA + L+ +RAKVW+ART S LF ++Y Q LE LY MW
Sbjct: 419 EYED-IAVKLGTDREYLRKIRAKVWKARTSSTLFNTKQYCQDLERLYLKMW 468
>gnl|CDD|226428 COG3914, Spy, Predicted O-linked N-acetylglucosamine transferase,
SPINDLY family [Posttranslational modification, protein
turnover, chaperones].
Length = 620
Score = 61.7 bits (150), Expect = 3e-11
Identities = 27/72 (37%), Positives = 42/72 (58%), Gaps = 1/72 (1%)
Query: 149 YSQNLPVTHITQAKIAREAEHFLDLSQVPCNGKAADRIHKDGIHILVNMNGYTKGARNEI 208
YS P T Q +I+ E + + ++ + A+ I DGI ILV+++G+T R ++
Sbjct: 295 YSLGPPHTDALQERISAAVEKWYPIGRMDDA-EIANAIRTDGIDILVDLDGHTVDTRCQV 353
Query: 209 FALRPAPIQSSF 220
FA RPAPIQ S+
Sbjct: 354 FAHRPAPIQVSW 365
Score = 35.5 bits (82), Expect = 0.018
Identities = 16/39 (41%), Positives = 27/39 (69%)
Query: 111 LLKAMRAKVWQARTESPLFCVQRYAQRLESLYKVMWDRY 149
L + +RA++ ++R SPLF + +A++LE+LY MW Y
Sbjct: 580 LRQQVRAELKRSRQTSPLFDPKAFARKLETLYWGMWSEY 618
>gnl|CDD|238112 cd00189, TPR, Tetratricopeptide repeat domain; typically contains
34 amino acids
[WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-
X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found
in a variety of organisms including bacteria,
cyanobacteria, yeast, fungi, plants, and humans in
various subcellular locations; involved in a variety of
functions including protein-protein interactions, but
common features in the interaction partners have not
been defined; involved in chaperone, cell-cycle,
transciption, and protein transport complexes; the
number of TPR motifs varies among proteins (1,3-11,13
15,16,19); 5-6 tandem repeats generate a right-handed
helical structure with an amphipathic channel that is
thought to accomodate an alpha-helix of a target
protein; it has been proposed that TPR proteins
preferably interact with WD-40 repeat proteins, but in
many instances several TPR-proteins seem to aggregate to
multi-protein complexes; examples of TPR-proteins
include, Cdc16p, Cdc23p and Cdc27p components of the
cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal
targeting signals, the Tom70p co-receptor for
mitochondrial targeting signals, Ser/Thr phosphatase 5C
and the p110 subunit of O-GlcNAc transferase; three
copies of the repeat are present here.
Length = 100
Score = 42.8 bits (101), Expect = 1e-05
Identities = 20/83 (24%), Positives = 36/83 (43%), Gaps = 6/83 (7%)
Query: 22 GNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQLDKNRL 81
G+ EA++ Y AL+L PD DAY NLA + + + +K + +
Sbjct: 14 GDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKALEL------DPDN 67
Query: 82 PSVHPHHSMLYPLTHEYRKAIAA 104
+ + + Y +Y +A+ A
Sbjct: 68 AKAYYNLGLAYYKLGKYEEALEA 90
Score = 39.7 bits (93), Expect = 1e-04
Identities = 14/50 (28%), Positives = 23/50 (46%)
Query: 22 GNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSI 71
G EA++ Y AL+L PD AY NL + + + +K + +
Sbjct: 48 GKYEEALEDYEKALELDPDNAKAYYNLGLAYYKLGKYEEALEAYEKALEL 97
>gnl|CDD|222112 pfam13414, TPR_11, TPR repeat.
Length = 69
Score = 40.8 bits (96), Expect = 2e-05
Identities = 15/32 (46%), Positives = 20/32 (62%)
Query: 22 GNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQ 53
G+ EAI++Y AL+L PD +AY NLA
Sbjct: 17 GDYDEAIEAYEKALELDPDNAEAYYNLALAYL 48
>gnl|CDD|205609 pfam13431, TPR_17, Tetratricopeptide repeat.
Length = 34
Score = 31.4 bits (72), Expect = 0.027
Identities = 11/23 (47%), Positives = 14/23 (60%)
Query: 31 YRTALKLKPDFPDAYCNLAHCLQ 53
Y AL+L P+ +AY NLA L
Sbjct: 2 YEKALELDPNNAEAYYNLALLLL 24
>gnl|CDD|233223 TIGR00990, 3a0801s09, mitochondrial precursor proteins import
receptor (72 kDa mitochondrial outermembrane protein)
(mitochondrial import receptor for the ADP/ATP carrier)
(translocase of outermembrane tom70). [Transport and
binding proteins, Amino acids, peptides and amines].
Length = 615
Score = 33.4 bits (76), Expect = 0.084
Identities = 14/33 (42%), Positives = 19/33 (57%), Gaps = 1/33 (3%)
Query: 26 EAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDW 58
+AI+ Y A++ KPD P Y N A C + DW
Sbjct: 145 KAIKLYSKAIECKPD-PVYYSNRAACHNALGDW 176
>gnl|CDD|234059 TIGR02917, PEP_TPR_lipo, putative PEP-CTERM system TPR-repeat
lipoprotein. This protein family occurs in strictly
within a subset of Gram-negative bacterial species with
the proposed PEP-CTERM/exosortase system, analogous to
the LPXTG/sortase system common in Gram-positive
bacteria. This protein occurs in a species if and only
if a transmembrane histidine kinase (TIGR02916) and a
DNA-binding response regulator (TIGR02915) also occur.
The present of tetratricopeptide repeats (TPR) suggests
protein-protein interaction, possibly for the regulation
of PEP-CTERM protein expression, since many PEP-CTERM
proteins in these genomes are preceded by a proposed DNA
binding site for the response regulator.
Length = 899
Score = 32.7 bits (75), Expect = 0.15
Identities = 12/30 (40%), Positives = 17/30 (56%)
Query: 20 SPGNIPEAIQSYRTALKLKPDFPDAYCNLA 49
S GNI A+ +YR A+ L+P+ LA
Sbjct: 205 SLGNIELALAAYRKAIALRPNNIAVLLALA 234
Score = 28.9 bits (65), Expect = 2.6
Identities = 11/28 (39%), Positives = 19/28 (67%)
Query: 22 GNIPEAIQSYRTALKLKPDFPDAYCNLA 49
G++ +A +++ AL ++PDF A NLA
Sbjct: 479 GDLAKAREAFEKALSIEPDFFPAAANLA 506
Score = 28.5 bits (64), Expect = 3.3
Identities = 10/24 (41%), Positives = 14/24 (58%)
Query: 26 EAIQSYRTALKLKPDFPDAYCNLA 49
+AI+ Y+T +K PD NLA
Sbjct: 788 KAIKHYQTVVKKAPDNAVVLNNLA 811
Score = 27.4 bits (61), Expect = 8.2
Identities = 11/22 (50%), Positives = 15/22 (68%)
Query: 23 NIPEAIQSYRTALKLKPDFPDA 44
N +AI S + AL+LKPD +A
Sbjct: 650 NYAKAITSLKRALELKPDNTEA 671
>gnl|CDD|201277 pfam00515, TPR_1, Tetratricopeptide repeat.
Length = 34
Score = 28.5 bits (65), Expect = 0.23
Identities = 8/20 (40%), Positives = 12/20 (60%)
Query: 22 GNIPEAIQSYRTALKLKPDF 41
G EA++ Y AL+L P+
Sbjct: 15 GKYDEALEYYEKALELNPNN 34
>gnl|CDD|197478 smart00028, TPR, Tetratricopeptide repeats. Repeats present in 4
or more copies in proteins. Contain a minimum of 34
amino acids each and self-associate via a "knobs and
holes" mechanism.
Length = 34
Score = 28.2 bits (64), Expect = 0.39
Identities = 8/20 (40%), Positives = 13/20 (65%)
Query: 22 GNIPEAIQSYRTALKLKPDF 41
G+ EA++ Y AL+L P+
Sbjct: 15 GDYDEALEYYEKALELDPNN 34
>gnl|CDD|224524 COG1608, COG1608, Predicted archaeal kinase [General function
prediction only].
Length = 252
Score = 29.6 bits (67), Expect = 0.98
Identities = 12/45 (26%), Positives = 17/45 (37%), Gaps = 2/45 (4%)
Query: 65 MKKLVSIVAEQLDKNRLP--SVHPHHSMLYPLTHEYRKAIAARHA 107
M +L SIV + L + SV P + Y A + A
Sbjct: 79 MLELNSIVVDALLDAGVRAVSVVPISFSTFNGRILYTYLEAIKDA 123
>gnl|CDD|222121 pfam13428, TPR_14, Tetratricopeptide repeat.
Length = 44
Score = 26.7 bits (59), Expect = 1.4
Identities = 13/30 (43%), Positives = 17/30 (56%)
Query: 22 GNIPEAIQSYRTALKLKPDFPDAYCNLAHC 51
G++ EA+ R AL L PD P+A LA
Sbjct: 15 GDLDEALALLRRALALDPDDPEALLLLARL 44
>gnl|CDD|205362 pfam13181, TPR_8, Tetratricopeptide repeat.
Length = 34
Score = 26.3 bits (59), Expect = 1.4
Identities = 8/20 (40%), Positives = 12/20 (60%)
Query: 22 GNIPEAIQSYRTALKLKPDF 41
G+ EA + Y AL+L P+
Sbjct: 15 GDYEEAKEYYEKALELDPNN 34
>gnl|CDD|222123 pfam13432, TPR_16, Tetratricopeptide repeat.
Length = 65
Score = 26.9 bits (60), Expect = 2.0
Identities = 8/28 (28%), Positives = 12/28 (42%)
Query: 22 GNIPEAIQSYRTALKLKPDFPDAYCNLA 49
G+ EA+ + AL P +A L
Sbjct: 11 GDYDEALAALEAALARYPLAAEALLLLG 38
>gnl|CDD|234256 TIGR03558, oxido_grp_1, luciferase family oxidoreductase, group 1.
The Pfam domain family pfam00296 is named for
luciferase-like monooxygenases, but the family also
contains several coenzyme F420-dependent enzymes. This
protein family represents a well-resolved clade within
family pfam00296 and shows no restriction to coenzyme
F420-positive species, unlike some other clades within
pfam00296 [Unknown function, Enzymes of unknown
specificity].
Length = 323
Score = 28.6 bits (65), Expect = 2.2
Identities = 15/55 (27%), Positives = 26/55 (47%), Gaps = 2/55 (3%)
Query: 10 GILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEAR 64
G+ F+ +S +P + EA+ YR + + Y +A + +V TD EA
Sbjct: 178 GLPFAFASHFAPDALEEALDLYRERFRPSAQLDEPYVMVA--VNVVAADTDEEAE 230
>gnl|CDD|226080 COG3550, HipA, Toxin module HipA, protein kinase of
phosphatidylinositol 3/4-kinase superfamily [General
function prediction only].
Length = 392
Score = 28.6 bits (64), Expect = 2.5
Identities = 11/57 (19%), Positives = 16/57 (28%), Gaps = 2/57 (3%)
Query: 134 YAQRLESLYKVMWDRYSQNLPVTHITQAKIAREAEH--FLDLSQVPCNGKAADRIHK 188
A L +L + P HI +A A +S A R +
Sbjct: 121 GADALTALLRKASGIPDGITPTVHIIKAPDGLYAVKSNSDRISTSIDKELDALRYPE 177
>gnl|CDD|191825 pfam07719, TPR_2, Tetratricopeptide repeat. This Pfam entry
includes outlying Tetratricopeptide-like repeats (TPR)
that are not matched by pfam00515.
Length = 34
Score = 25.6 bits (57), Expect = 3.1
Identities = 8/20 (40%), Positives = 14/20 (70%)
Query: 22 GNIPEAIQSYRTALKLKPDF 41
G+ EA+++Y AL+L P+
Sbjct: 15 GDYEEALEAYEKALELDPNN 34
>gnl|CDD|236875 PRK11189, PRK11189, lipoprotein NlpI; Provisional.
Length = 296
Score = 27.9 bits (63), Expect = 3.6
Identities = 8/12 (66%), Positives = 9/12 (75%)
Query: 34 ALKLKPDFPDAY 45
AL L+PD DAY
Sbjct: 90 ALALRPDMADAY 101
>gnl|CDD|218897 pfam06100, Strep_67kDa_ant, Streptococcal 67 kDa
myosin-cross-reactive antigen like family. Members of
this family are thought to have structural features in
common with the beta chain of the class II antigens, as
well as myosin, and may play an important role in the
pathogenesis.
Length = 500
Score = 28.2 bits (63), Expect = 3.8
Identities = 20/82 (24%), Positives = 36/82 (43%), Gaps = 5/82 (6%)
Query: 131 VQRYAQRLESLYKVMWDRYSQNLPVTHITQAKIAREAEHFLDLSQVPCNGKAADRIHKDG 190
V R + +E+ ++ +WD + ++P I A + + F L++ N IHK G
Sbjct: 56 VVRGGREMENHFECLWDLFR-SIPSLEIPNASV---LDEFYWLNKDDPNYSNCRAIHKRG 111
Query: 191 IHILVNMN-GYTKGARNEIFAL 211
+ + +K A EI L
Sbjct: 112 NRLDDDGKFTLSKKAAKEIIKL 133
>gnl|CDD|217334 pfam03041, Baculo_LEF-2, lef-2. The lef-2 gene (for late
expression factor 2) from baculovirus is required for
expression of late genes. This gene has been shown to be
specifically required for expression from the vp39 and
polh promoters. LEF-1 is a DNA primase and there is some
evidence to suggest that LEF-2 may bind to both DNA and
LEF-1.
Length = 164
Score = 27.7 bits (62), Expect = 3.9
Identities = 14/63 (22%), Positives = 22/63 (34%), Gaps = 3/63 (4%)
Query: 70 SIVAEQLDKNRLPSVHPHH---SMLYPLTHEYRKAIAARHANLCLLKAMRAKVWQARTES 126
S+ +DKN V P +L P T + R + L L ++ + S
Sbjct: 8 SLKRSDIDKNATYLVDPFDFDWLVLSPYTVFEPGGLYVRVSGLRLYNLLKNPPTFEESPS 67
Query: 127 PLF 129
F
Sbjct: 68 VNF 70
>gnl|CDD|131573 TIGR02521, type_IV_pilW, type IV pilus biogenesis/stability protein
PilW. Members of this family are designated PilF in ref
(PMID:8973346) and PilW in ref (PMID:15612916). This
outer membrane protein is required both for pilus
stability and for pilus function such as adherence to
human cells. Members of this family contain copies of
the TPR (tetratricopeptide repeat) domain.
Length = 234
Score = 27.3 bits (61), Expect = 5.5
Identities = 10/26 (38%), Positives = 14/26 (53%)
Query: 22 GNIPEAIQSYRTALKLKPDFPDAYCN 47
G + +A S+R AL L P+ D N
Sbjct: 79 GELEKAEDSFRRALTLNPNNGDVLNN 104
Score = 27.3 bits (61), Expect = 6.4
Identities = 11/32 (34%), Positives = 16/32 (50%)
Query: 22 GNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQ 53
G++ A ++ AL+ PD AY LA Q
Sbjct: 45 GDLEVAKENLDKALEHDPDDYLAYLALALYYQ 76
>gnl|CDD|185261 PRK15363, PRK15363, pathogenicity island 2 chaperone protein SscA;
Provisional.
Length = 157
Score = 26.9 bits (59), Expect = 6.7
Identities = 16/56 (28%), Positives = 28/56 (50%), Gaps = 1/56 (1%)
Query: 26 EAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQLDKNRL 81
EAI +Y A ++K D P A A C + CD Y + K V + ++ ++++
Sbjct: 87 EAIYAYGRAAQIKIDAPQAPWAAAECY-LACDNVCYAIKALKAVVRICGEVSEHQI 141
>gnl|CDD|131546 TIGR02493, PFLA, pyruvate formate-lyase 1-activating enzyme. An
iron-sulfur protein with a radical-SAM domain
(pfam04055). A single glycine residue in EC 2.3.1.54,
formate C-acetyltransferase (formate-pyruvate lyase), is
oxidized to the corresponding radical by transfer of H
from its CH2 to AdoMet with concomitant cleavage of the
latter. The reaction requires Fe2+. The first stage is
reduction of the AdoMet to give methionine and the
5'-deoxyadenosin-5-yl radical, which then abstracts a
hydrogen radical from the glycine residue [Energy
metabolism, Anaerobic, Protein fate, Protein
modification and repair].
Length = 235
Score = 26.9 bits (60), Expect = 6.7
Identities = 12/46 (26%), Positives = 20/46 (43%), Gaps = 2/46 (4%)
Query: 106 HANLCLL--KAMRAKVWQARTESPLFCVQRYAQRLESLYKVMWDRY 149
+ +L LL K + ++ T L +A+ L K +W RY
Sbjct: 118 YTDLVLLDIKHFNPEKYKKLTGVSLQPTLDFAKYLAKRNKPIWIRY 163
>gnl|CDD|233924 TIGR02552, LcrH_SycD, type III secretion low calcium response
chaperone LcrH/SycD. Genes in this family are found in
type III secretion operons. LcrH, from Yersinia is
believed to have a regulatory function in the
low-calcium response of the secretion system. The same
protein is also known as SycD (SYC = Specific Yop
Chaperone) for its chaperone role. In Pseudomonas,
where the homolog is known as PcrH, the chaperone role
has been demonstrated and the regulatory role appears
to be absent. ScyD/LcrH contains three central
tetratricopeptide-like repeats that are predicted to
fold into an all-alpha-helical array.
Length = 135
Score = 26.5 bits (59), Expect = 7.4
Identities = 13/27 (48%), Positives = 15/27 (55%)
Query: 26 EAIQSYRTALKLKPDFPDAYCNLAHCL 52
EAI +Y A L PD P Y + A CL
Sbjct: 69 EAIDAYALAAALDPDDPRPYFHAAECL 95
>gnl|CDD|211344 cd02867, PseudoU_synth_TruB_4, Pseudouridine synthase homolog 4.
This group consists of Eukaryotic TruB proteins similar
to Saccharomyces cerevisiae Pus4. S. cerevisiae Pus4,
makes psi55 in the T loop of both cytoplasmic and
mitochondrial tRNAs. Psi55 is almost universally
conserved. Pseudouridine synthases catalyze the
isomerization of specific uridines in an RNA molecule to
pseudouridines (5-ribosyluracil, psi).
Length = 312
Score = 27.0 bits (60), Expect = 9.1
Identities = 10/25 (40%), Positives = 13/25 (52%), Gaps = 1/25 (4%)
Query: 154 PVTHITQAKIAREAEHFL-DLSQVP 177
P +HIT+ I F D+ QVP
Sbjct: 117 PYSHITREDIEEVLAKFRGDIKQVP 141
>gnl|CDD|115693 pfam07056, DUF1335, Protein of unknown function (DUF1335). This
family represents a conserved region approximately 130
residues long within a number of proteins of unknown
function that seem to be specific to the white spot
syndrome virus (WSSV).
Length = 131
Score = 26.3 bits (58), Expect = 9.3
Identities = 8/20 (40%), Positives = 10/20 (50%)
Query: 5 PFFKVGILFSNSSFPSPGNI 24
P F +G FS + PS I
Sbjct: 98 PLFMLGEEFSRNVHPSAKEI 117
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.323 0.134 0.418
Gapped
Lambda K H
0.267 0.0867 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 11,304,192
Number of extensions: 1017313
Number of successful extensions: 1230
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1228
Number of HSP's successfully gapped: 46
Length of query: 222
Length of database: 10,937,602
Length adjustment: 93
Effective length of query: 129
Effective length of database: 6,812,680
Effective search space: 878835720
Effective search space used: 878835720
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 57 (25.5 bits)