RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy838
(152 letters)
>gnl|CDD|240332 PTZ00255, PTZ00255, 60S ribosomal protein L37a; Provisional.
Length = 90
Score = 97.9 bits (244), Expect = 1e-27
Identities = 40/53 (75%), Positives = 44/53 (83%)
Query: 1 MAKRTKKVGITGKYGTRYGASLRKMVKKMEITQHAKYTCSFCGKEYKKSTDLG 53
MAKRTKKVGITGKYGTRYGASLRK +KK+EI+QHAKY C FCGK K +G
Sbjct: 1 MAKRTKKVGITGKYGTRYGASLRKQIKKIEISQHAKYFCPFCGKHAVKRQAVG 53
>gnl|CDD|216695 pfam01780, Ribosomal_L37ae, Ribosomal L37ae protein family. This
ribosomal protein is found in archaebacteria and
eukaryotes. It contains four conserved cysteine
residues that may bind to zinc.
Length = 90
Score = 93.1 bits (232), Expect = 9e-26
Identities = 37/47 (78%), Positives = 42/47 (89%)
Query: 2 AKRTKKVGITGKYGTRYGASLRKMVKKMEITQHAKYTCSFCGKEYKK 48
AKRTKKVGI G++GTRYGASLRK VKK+EI+QHAKYTC FCGK+ K
Sbjct: 1 AKRTKKVGIAGRFGTRYGASLRKRVKKIEISQHAKYTCPFCGKDAVK 47
>gnl|CDD|232904 TIGR00280, L37a, ribosomal protein L37a. This model finds
eukaryotic ribosomal protein L37a and its archaeal
orthologs. The nomeclature is tricky because eukaryotes
have proteins called both L37 and L37a [Protein
synthesis, Ribosomal proteins: synthesis and
modification].
Length = 92
Score = 69.5 bits (170), Expect = 2e-16
Identities = 31/44 (70%), Positives = 35/44 (79%)
Query: 1 MAKRTKKVGITGKYGTRYGASLRKMVKKMEITQHAKYTCSFCGK 44
M KRTKKVGITG++G RYG LR+ VKK+EI Q AKY C FCGK
Sbjct: 1 MMKRTKKVGITGRFGPRYGLKLRRQVKKIEIQQKAKYVCPFCGK 44
>gnl|CDD|235183 PRK03976, rpl37ae, 50S ribosomal protein L37Ae; Reviewed.
Length = 90
Score = 67.7 bits (166), Expect = 9e-16
Identities = 22/48 (45%), Positives = 28/48 (58%)
Query: 1 MAKRTKKVGITGKYGTRYGASLRKMVKKMEITQHAKYTCSFCGKEYKK 48
M RTKKVG G++G RYG +RK V +E AK+ C CG+ K
Sbjct: 1 MMSRTKKVGSAGRFGARYGRKIRKRVADIEEKMRAKHVCPVCGRPKVK 48
>gnl|CDD|224908 COG1997, RPL43A, Ribosomal protein L37AE/L43A [Translation,
ribosomal structure and biogenesis].
Length = 89
Score = 65.5 bits (160), Expect = 6e-15
Identities = 23/42 (54%), Positives = 32/42 (76%)
Query: 4 RTKKVGITGKYGTRYGASLRKMVKKMEITQHAKYTCSFCGKE 45
+TKKVGI G++G RYG+ LR+ VK++E Q AK+ C FCG+
Sbjct: 3 KTKKVGIAGRFGPRYGSKLRRRVKEIEAQQRAKHVCPFCGRT 44
>gnl|CDD|173926 cd04301, NAT_SF, N-Acyltransferase superfamily: Various enzymes
that characteristically catalyze the transfer of an acyl
group to a substrate. NAT (N-Acyltransferase) is a
large superfamily of enzymes that mostly catalyze the
transfer of an acyl group to a substrate and are
implicated in a variety of functions, ranging from
bacterial antibiotic resistance to circadian rhythms in
mammals. Members include GCN5-related
N-Acetyltransferases (GNAT) such as Aminoglycoside
N-acetyltransferases, Histone N-acetyltransferase (HAT)
enzymes, and Serotonin N-acetyltransferase, which
catalyze the transfer of an acetyl group to a substrate.
The kinetic mechanism of most GNATs involves the ordered
formation of a ternary complex: the reaction begins with
Acetyl Coenzyme A (AcCoA) binding, followed by binding
of substrate, then direct transfer of the acetyl group
from AcCoA to the substrate, followed by product and
subsequent CoA release. Other family members include
Arginine/ornithine N-succinyltransferase, Myristoyl-CoA:
protein N-myristoyltransferase, and
Acyl-homoserinelactone synthase which have a similar
catalytic mechanism but differ in types of acyl groups
transferred. Leucyl/phenylalanyl-tRNA-protein
transferase and FemXAB nonribosomal peptidyltransferases
which catalyze similar peptidyltransferase reactions are
also included.
Length = 65
Score = 29.9 bits (68), Expect = 0.098
Identities = 13/52 (25%), Positives = 25/52 (48%), Gaps = 3/52 (5%)
Query: 95 YLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQA 146
+LVA++ V F+ D G Y +L + + + KG+G +++A
Sbjct: 1 FLVAEDDG--EIVGFASLSPD-GSGGDTAYIGDLAVLPEYRGKGIGSALLEA 49
>gnl|CDD|181411 PRK08393, PRK08393, N-ethylammeline chlorohydrolase; Provisional.
Length = 424
Score = 30.5 bits (69), Expect = 0.29
Identities = 19/81 (23%), Positives = 30/81 (37%), Gaps = 13/81 (16%)
Query: 34 HAKYTCSF-CGKEY-KKSTDLG-------AETKKWVWELEEKNMKHSYEVCD----IGWD 80
HA YTCS K +K+ + +ET + ++ EK K + D + D
Sbjct: 181 HAPYTCSLALLKWVREKAREWNKLITIHLSETMDEIKQIREKYGKSPVVLLDEIGFLNED 240
Query: 81 PQGKHSEMFDDRACYLVAKNG 101
H R ++A G
Sbjct: 241 VIAAHGVWLSSRDIRILASAG 261
>gnl|CDD|181193 PRK08006, PRK08006, replicative DNA helicase; Provisional.
Length = 471
Score = 29.2 bits (65), Expect = 0.76
Identities = 12/36 (33%), Positives = 22/36 (61%), Gaps = 3/36 (8%)
Query: 71 SYEVCDIGWDPQGKHSEMFDDRA---CYLVAKNGSS 103
+ E+ D G+DPQG+ SE D A + +A++ ++
Sbjct: 139 ANEIADAGYDPQGRTSEDLLDLAESRVFQIAESRAN 174
>gnl|CDD|221780 pfam12804, NTP_transf_3, MobA-like NTP transferase domain. This
family includes the MobA protein (Molybdopterin-guanine
dinucleotide biosynthesis protein A). The family also
includes a wide range of other NTP transferase domain.
Length = 178
Score = 28.7 bits (65), Expect = 1.0
Identities = 13/53 (24%), Positives = 22/53 (41%), Gaps = 4/53 (7%)
Query: 95 YLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYEL--QLEKQVQRKGLGKFMMQ 145
L+A +S +A +D G P+L+ L LE + +GL +
Sbjct: 105 RLLAALEASGADIAV--PVYDGGRGHPLLFPRSLLPALEALLGDRGLRSLLRA 155
>gnl|CDD|183178 PRK11525, dinD, DNA-damage-inducible protein D; Provisional.
Length = 279
Score = 28.5 bits (64), Expect = 1.4
Identities = 10/18 (55%), Positives = 12/18 (66%), Gaps = 1/18 (5%)
Query: 93 ACYLVAKNGSSSTPV-AF 109
ACYL+ +NG PV AF
Sbjct: 87 ACYLIVQNGDPRKPVIAF 104
>gnl|CDD|181562 PRK08840, PRK08840, replicative DNA helicase; Provisional.
Length = 464
Score = 28.0 bits (62), Expect = 2.0
Identities = 11/21 (52%), Positives = 14/21 (66%)
Query: 73 EVCDIGWDPQGKHSEMFDDRA 93
E+ D G+DPQG+ SE D A
Sbjct: 134 EIADAGYDPQGRTSEDLLDMA 154
>gnl|CDD|205438 pfam13258, DUF4049, Domain of unknown function (DUF4049). This
presumed domain is functionally uncharacterized. This
domain family is found in bacteria, and is typically
between 310 and 324 amino acids in length.
Length = 318
Score = 28.0 bits (62), Expect = 2.3
Identities = 18/73 (24%), Positives = 30/73 (41%), Gaps = 5/73 (6%)
Query: 84 KHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVDFG---EPVLYCYEL--QLEKQVQRKG 138
KH DD+ +L+ N STPV ++F DF E + C + ++ ++ G
Sbjct: 221 KHKTHMDDKLFHLIRSNSCRSTPVYNNYFSNTTDFRPKLEAIFACGQTLKKINQKYGHNG 280
Query: 139 LGKFMMQAFYTYI 151
G Q +
Sbjct: 281 NGVDECQGEDNTV 293
>gnl|CDD|198062 smart00994, zf-C4_ClpX, ClpX C4-type zinc finger. The ClpX heat
shock protein of Escherichia coli is a member of the
universally conserved Hsp100 family of proteins, and
possesses a putative zinc finger motif of the C4 type.
This presumed zinc binding domain is found at the
N-terminus of the ClpX protein. ClpX is an ATPase which
functions both as a substrate specificity component of
the ClpXP protease and as a molecular chaperone. The
molecular function of this domain is now known.
Length = 39
Score = 24.8 bits (55), Expect = 4.3
Identities = 6/9 (66%), Positives = 6/9 (66%)
Query: 37 YTCSFCGKE 45
CSFCGK
Sbjct: 1 LRCSFCGKS 9
>gnl|CDD|203502 pfam06689, zf-C4_ClpX, ClpX C4-type zinc finger. The ClpX heat
shock protein of Escherichia coli is a member of the
universally conserved Hsp100 family of proteins, and
possesses a putative zinc finger motif of the C4 type.
This presumed zinc binding domain is found at the
N-terminus of the ClpX protein. ClpX is an ATPase which
functions both as a substrate specificity component of
the ClpXP protease and as a molecular chaperone. The
molecular function of this domain is now known.
Length = 39
Score = 24.4 bits (54), Expect = 5.1
Identities = 6/9 (66%), Positives = 6/9 (66%)
Query: 37 YTCSFCGKE 45
CSFCGK
Sbjct: 1 LRCSFCGKS 9
>gnl|CDD|225156 COG2247, LytB, Putative cell wall-binding domain [Cell envelope
biogenesis, outer membrane].
Length = 337
Score = 26.7 bits (59), Expect = 5.5
Identities = 13/60 (21%), Positives = 24/60 (40%), Gaps = 4/60 (6%)
Query: 78 GWDPQGKHSEMFDDRACYLVAKNGSSSTP----VAFSHFRFDVDFGEPVLYCYELQLEKQ 133
GWD E+ + ++ KN S +A + + + G P+LY +L +
Sbjct: 140 GWDYADALMELMKEGIVPVILKNTSILVRWSRKLALAESPYAANIGCPILYNTKLTEKNV 199
>gnl|CDD|226768 COG4318, COG4318, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 221
Score = 26.3 bits (58), Expect = 6.6
Identities = 14/59 (23%), Positives = 20/59 (33%), Gaps = 12/59 (20%)
Query: 59 WVWELEEKNMKHSYE-----VCDIGWDPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHF 112
WV + + + YE + +G DP + A YL G FS F
Sbjct: 123 WVHPFDARGARRPYEAIPKALAGLGDDP-------YRSLAGYLRDAGGFDKDQTPFSEF 174
>gnl|CDD|117424 pfam08857, ParBc_2, Putative ParB-like nuclease. This domain is
probably distantly related to pfam02195. Suggesting
these uncharacterized proteins have a nuclease function.
Length = 162
Score = 26.1 bits (58), Expect = 6.9
Identities = 15/63 (23%), Positives = 23/63 (36%), Gaps = 10/63 (15%)
Query: 55 ETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFDDR----ACYLVAKNGSSSTPVAFS 110
+ WV +E+ + Y + P+ + DD A +L G P FS
Sbjct: 98 DNNGWVHPYDERGERRPYS--AL---PKSLG-GLEDDPYRSLAGFLRDAGGYKKDPTPFS 151
Query: 111 HFR 113
FR
Sbjct: 152 EFR 154
>gnl|CDD|236983 PRK11788, PRK11788, tetratricopeptide repeat protein; Provisional.
Length = 389
Score = 26.3 bits (59), Expect = 8.2
Identities = 7/22 (31%), Positives = 12/22 (54%), Gaps = 1/22 (4%)
Query: 22 LRKMVKKMEITQHAKYTCSFCG 43
LR +V + + + +Y C CG
Sbjct: 341 LRDLVGEQ-LKRKPRYRCRNCG 361
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.320 0.134 0.419
Gapped
Lambda K H
0.267 0.0797 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 7,452,591
Number of extensions: 624747
Number of successful extensions: 500
Number of sequences better than 10.0: 1
Number of HSP's gapped: 500
Number of HSP's successfully gapped: 30
Length of query: 152
Length of database: 10,937,602
Length adjustment: 88
Effective length of query: 64
Effective length of database: 7,034,450
Effective search space: 450204800
Effective search space used: 450204800
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 55 (24.8 bits)