RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy14938
(96 letters)
>gnl|CDD|202401 pfam02799, NMT_C, Myristoyl-CoA:protein N-myristoyltransferase,
C-terminal domain. The N and C-terminal domains of NMT
are structurally similar, each adopting an acyl-CoA
N-acyltransferase-like fold.
Length = 190
Score = 146 bits (370), Expect = 3e-46
Identities = 62/139 (44%), Positives = 77/139 (55%), Gaps = 47/139 (33%)
Query: 4 RFDLAPIFSEEEFKHWFLPRDGIVYTFVVEN-DGEITE---------------------- 40
RFDLAP+F++EE HWFLPR+ +VY++VVEN DG+IT+
Sbjct: 52 RFDLAPVFTKEEVAHWFLPRENVVYSYVVENEDGKITDFFSFYSLPSTVIGNPKHKTLKA 111
Query: 41 --------------------IIL--KEGFDVFNALDLMENKEFLGPLKFGIGDGNLQYYL 78
+IL GFDVFNALDLM+NK FL LKFG GDG L YYL
Sbjct: 112 AYSFYYAATSTRLKQLMNDALILAKNLGFDVFNALDLMDNKLFLEDLKFGPGDGQLNYYL 171
Query: 79 YNWKCPSI--PPNKIGLVL 95
YN++CP I + +GLVL
Sbjct: 172 YNYRCPPIMGGISGVGLVL 190
>gnl|CDD|227423 COG5092, NMT1, N-myristoyl transferase [Lipid metabolism].
Length = 451
Score = 70.8 bits (173), Expect = 7e-16
Identities = 28/44 (63%), Positives = 31/44 (70%)
Query: 44 KEGFDVFNALDLMENKEFLGPLKFGIGDGNLQYYLYNWKCPSIP 87
EG DVFNAL +M+N FL LKFG GDG L YYLYN+K IP
Sbjct: 386 VEGCDVFNALTMMDNSLFLADLKFGCGDGFLNYYLYNYKSEEIP 429
>gnl|CDD|220751 pfam10433, MMS1_N, Mono-functional DNA-alkylating methyl
methanesulfonate N-term. MMS1 is a protein that
protects against replication-dependent DNA damage in
Saccharomyces cerevisiae. MMS1 belongs to the DDB1
family of cullin 4 adaptors and the two proteins are
homologous. MMS1 bridges the interaction of MMS22 and
Crt10 with Cul8/Rtt101. Cul8/Rtt101 is a cullin protein
involved in the regulation of DNA replication subsequent
to DNA damage. The N-terminal region of MMS1 and the
C-terminal of MMS22 are required for the the MMS1-MMS22
interaction. The human HIV-1 virion-associated protein
Vpr assembles with DDB1 through interaction with DCAF1
(chromatin assembly factor) to form an E3 ubiquitin
ligase that targets cellular substrates for
proteasome-mediated degradation and subsequent G2
arrest.
Length = 513
Score = 30.8 bits (70), Expect = 0.085
Identities = 13/55 (23%), Positives = 21/55 (38%), Gaps = 3/55 (5%)
Query: 7 LAPIFSEEEFKHWFLPR-DGIVYTFVVENDGEITEIILKEGF--DVFNALDLMEN 58
E+ L R DG +Y + NDGE+ I + V A +++
Sbjct: 220 DRNWSYNEKHDDILLAREDGRIYYLEISNDGEVESITILGIGKCSVAIAFAILDK 274
>gnl|CDD|216842 pfam02009, Rifin_STEVOR, Rifin/stevor family. Several multicopy
gene families have been described in Plasmodium
falciparum, including the stevor family of subtelomeric
open reading frames and the rif interspersed repetitive
elements. Both families contain three predicted
transmembrane segments. It has been proposed that stevor
and rif are members of a larger superfamily that code
for variant surface antigens.
Length = 290
Score = 29.2 bits (66), Expect = 0.24
Identities = 15/39 (38%), Positives = 17/39 (43%), Gaps = 4/39 (10%)
Query: 35 DGEITEIILKEGFDVFNALDLMENKEFLGPLKFGIGDGN 73
D EI +IILK D L E K L+ G G G
Sbjct: 71 DKEIQKIILK---DKLEKKSLAE-KVEKECLRCGFGLGG 105
>gnl|CDD|220077 pfam08938, DUF1916, Domain of unknown function (DUF1916). This
domain is found in various eukaryotic HBS1-like
proteins.
Length = 148
Score = 28.2 bits (63), Expect = 0.45
Identities = 12/26 (46%), Positives = 15/26 (57%)
Query: 35 DGEITEIILKEGFDVFNALDLMENKE 60
D + E +LK FDV ALD + N E
Sbjct: 106 DRTLVEAVLKHDFDVQKALDEILNSE 131
>gnl|CDD|220248 pfam09455, Cas_DxTHG, CRISPR-associated (Cas) DxTHG family. CRISPR
is a term for Clustered Regularly Interspaced Short
Palidromic Repeats. A number of protein families appear
only in association with these repeats and are
designated Cas (CRISPR associated) proteins. The family
describes Cas proteins of about 400 residues that
include the motif [VIL]-D-x-[ST]-H-[GS]. The CRISPR and
associated proteins are thought to be involved in the
evolution of host resistance. The exact molecular
function of this family is currently unknown.
Length = 370
Score = 27.4 bits (61), Expect = 0.96
Identities = 16/83 (19%), Positives = 27/83 (32%), Gaps = 1/83 (1%)
Query: 9 PIFSEEEFKHWFLPRDGIVYTFVVENDGEITEIILKEGFDVFNALDLMENKEFLGPLKFG 68
+ E+ W+L R V + + I+ + L+ G D E E
Sbjct: 283 SLEDLEKLIEWYLERGLYVQALTLAREWLISVLCLRYGLDNIKDRKEREKAEEA-LKLLE 341
Query: 69 IGDGNLQYYLYNWKCPSIPPNKI 91
+ N + W C S N +
Sbjct: 342 LRLKNEKELADKWSCLSDERNFL 364
>gnl|CDD|178635 PLN03086, PLN03086, PRLI-interacting factor K; Provisional.
Length = 567
Score = 26.8 bits (59), Expect = 1.7
Identities = 18/53 (33%), Positives = 28/53 (52%), Gaps = 13/53 (24%)
Query: 31 VVENDGEITEIILKEGFDVFNALDLMENKEFLGPLKFG------IGDGNLQYY 77
V+E D E+ +I G D ++ EN+ L PL+FG + +GN +YY
Sbjct: 246 VLETDIEV-DI---VGPD---SVSNEENQHVLKPLEFGKSESGMVEEGNYRYY 291
>gnl|CDD|130592 TIGR01529, argR_whole, arginine repressor. This model includes
most members of the arginine-responsive transcriptional
regulator family ArgR. This hexameric protein binds DNA
at its amino end to repress arginine biosyntheis or
activate arginine catabolism. Some species have several
ArgR paralogs. In a neighbor-joining tree, some of
these paralogous sequences show long branches and
differ significantly in an otherwise well-conserved
C-terminal region motif GT[VIL][AC]GDDT. These paralogs
are excluded from the seed and score in the gray zone
of this model, between trusted and noise cutoffs [Amino
acid biosynthesis, Glutamate family, Regulatory
functions, DNA interactions].
Length = 146
Score = 26.1 bits (58), Expect = 2.2
Identities = 15/50 (30%), Positives = 22/50 (44%), Gaps = 7/50 (14%)
Query: 32 VENDGEITEIILKEGFDVFNAL---DLMENKEFLGPLKFGIGDGNLQYYL 78
+ E+ ++ EG +V A DL E LG +K DG+ Y L
Sbjct: 16 ISTQEELVALLKAEGIEVTQATVSRDLRE----LGAVKVRDEDGSYVYSL 61
>gnl|CDD|185583 PTZ00375, PTZ00375, dihydroxyacetone kinase-like protein;
Provisional.
Length = 584
Score = 26.7 bits (59), Expect = 2.4
Identities = 10/46 (21%), Positives = 21/46 (45%), Gaps = 6/46 (13%)
Query: 29 TFVVENDGEITEIILKEGFDVFNALDLMENKEFLGPLKFGIGDGNL 74
+ + ++ + VF L +E++ +L L +GDG+L
Sbjct: 369 KAATPTNSLLRSVLER----VFETL--IESENYLNELDAEVGDGDL 408
>gnl|CDD|227315 COG4982, COG4982, 3-oxoacyl-[acyl-carrier protein].
Length = 866
Score = 26.4 bits (58), Expect = 2.6
Identities = 10/37 (27%), Positives = 15/37 (40%), Gaps = 1/37 (2%)
Query: 2 FLRFDLAPIFSEEEFKHWFLPRDG-IVYTFVVENDGE 37
FL DL + +E F+ D F + + GE
Sbjct: 823 FLDKDLEFFVASKEEARAFVDFDPEHTVIFEIPDSGE 859
>gnl|CDD|143406 cd07087, ALDH_F3-13-14_CALDH-like, ALDH subfamily: Coniferyl
aldehyde dehydrogenase, ALDH families 3, 13, and 14, and
other related proteins. ALDH subfamily which includes
NAD(P)+-dependent, aldehyde dehydrogenase, family 3
member A1 and B1 (ALDH3A1, ALDH3B1, EC=1.2.1.5) and
fatty aldehyde dehydrogenase, family 3 member A2
(ALDH3A2, EC=1.2.1.3), and also plant ALDH family
members ALDH3F1, ALDH3H1, and ALDH3I1, fungal ALDH14
(YMR110C) and the protozoan family 13 member (ALDH13),
as well as coniferyl aldehyde dehydrogenases (CALDH,
EC=1.2.1.68), and other similar sequences, such as the
Pseudomonas putida benzaldehyde dehydrogenase I that is
involved in the metabolism of mandelate.
Length = 426
Score = 26.3 bits (59), Expect = 2.9
Identities = 8/18 (44%), Positives = 10/18 (55%)
Query: 31 VVENDGEITEIILKEGFD 48
VVE E+ +L E FD
Sbjct: 161 VVEGGVEVATALLAEPFD 178
>gnl|CDD|235858 PRK06762, PRK06762, hypothetical protein; Provisional.
Length = 166
Score = 25.7 bits (57), Expect = 4.2
Identities = 10/40 (25%), Positives = 19/40 (47%), Gaps = 5/40 (12%)
Query: 11 FSEEEFKHWFLPRDGIVY---TFVVENDG--EITEIILKE 45
F E++ + W+ P D + T +N +I + IL +
Sbjct: 122 FGEDDMRRWWNPHDTLGVIGETIFTDNLSLKDIFDAILTD 161
>gnl|CDD|213831 TIGR03578, EF_0831, conserved hypothetical protein
EF_0831/AHA_3912. Members of this family of small
(about 100 amino acid), relatively rare proteins are
found in both Gram-positive (e.g. Enterococcus
faecalis) and Gram-negative (e.g. Aeromonas hydrophila)
bacteria, as part of a cluster of conserved proteins.
The function is unknown [Hypothetical proteins,
Conserved].
Length = 96
Score = 25.1 bits (55), Expect = 4.8
Identities = 9/32 (28%), Positives = 18/32 (56%)
Query: 14 EEFKHWFLPRDGIVYTFVVENDGEITEIILKE 45
E+F +F PR Y+ ++ ++T I ++E
Sbjct: 60 EKFLFFFFPRKRREYSVELDVTVKVTAIKVEE 91
>gnl|CDD|215664 pfam00027, cNMP_binding, Cyclic nucleotide-binding domain.
Length = 91
Score = 25.0 bits (55), Expect = 5.1
Identities = 13/52 (25%), Positives = 22/52 (42%), Gaps = 3/52 (5%)
Query: 10 IFSE-EEFKHWFLPRDGIVYTFVVENDGEITEI-ILKEGFDVFNALDLMENK 59
IF E + ++ G V + ++ DG + L G D F L L+ +
Sbjct: 10 IFREGDPADSLYIVLSGKVKVYKLDEDGREQILAFLGPG-DFFGELALLGGE 60
>gnl|CDD|222086 pfam13378, MR_MLE_C, Enolase C-terminal domain-like. This domain
appears at the C-terminus of many of the proteins that
carry the MR_MLE, pfam01188 and MR_MLE_N pfam02746
domains. EC:4.2.1.40.
Length = 111
Score = 24.8 bits (55), Expect = 6.5
Identities = 7/25 (28%), Positives = 11/25 (44%)
Query: 15 EFKHWFLPRDGIVYTFVVENDGEIT 39
E+ + LP D + V DG +
Sbjct: 77 EYVPYLLPDDLLTGPPPVVEDGRLA 101
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.326 0.150 0.479
Gapped
Lambda K H
0.267 0.0786 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 5,318,498
Number of extensions: 465637
Number of successful extensions: 461
Number of sequences better than 10.0: 1
Number of HSP's gapped: 461
Number of HSP's successfully gapped: 28
Length of query: 96
Length of database: 10,937,602
Length adjustment: 63
Effective length of query: 33
Effective length of database: 8,143,300
Effective search space: 268728900
Effective search space used: 268728900
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 53 (24.1 bits)