RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy14938
         (96 letters)



>gnl|CDD|202401 pfam02799, NMT_C, Myristoyl-CoA:protein N-myristoyltransferase,
           C-terminal domain.  The N and C-terminal domains of NMT
           are structurally similar, each adopting an acyl-CoA
           N-acyltransferase-like fold.
          Length = 190

 Score =  146 bits (370), Expect = 3e-46
 Identities = 62/139 (44%), Positives = 77/139 (55%), Gaps = 47/139 (33%)

Query: 4   RFDLAPIFSEEEFKHWFLPRDGIVYTFVVEN-DGEITE---------------------- 40
           RFDLAP+F++EE  HWFLPR+ +VY++VVEN DG+IT+                      
Sbjct: 52  RFDLAPVFTKEEVAHWFLPRENVVYSYVVENEDGKITDFFSFYSLPSTVIGNPKHKTLKA 111

Query: 41  --------------------IIL--KEGFDVFNALDLMENKEFLGPLKFGIGDGNLQYYL 78
                               +IL    GFDVFNALDLM+NK FL  LKFG GDG L YYL
Sbjct: 112 AYSFYYAATSTRLKQLMNDALILAKNLGFDVFNALDLMDNKLFLEDLKFGPGDGQLNYYL 171

Query: 79  YNWKCPSI--PPNKIGLVL 95
           YN++CP I    + +GLVL
Sbjct: 172 YNYRCPPIMGGISGVGLVL 190


>gnl|CDD|227423 COG5092, NMT1, N-myristoyl transferase [Lipid metabolism].
          Length = 451

 Score = 70.8 bits (173), Expect = 7e-16
 Identities = 28/44 (63%), Positives = 31/44 (70%)

Query: 44  KEGFDVFNALDLMENKEFLGPLKFGIGDGNLQYYLYNWKCPSIP 87
            EG DVFNAL +M+N  FL  LKFG GDG L YYLYN+K   IP
Sbjct: 386 VEGCDVFNALTMMDNSLFLADLKFGCGDGFLNYYLYNYKSEEIP 429


>gnl|CDD|220751 pfam10433, MMS1_N, Mono-functional DNA-alkylating methyl
           methanesulfonate N-term.  MMS1 is a protein that
           protects against replication-dependent DNA damage in
           Saccharomyces cerevisiae. MMS1 belongs to the DDB1
           family of cullin 4 adaptors and the two proteins are
           homologous. MMS1 bridges the interaction of MMS22 and
           Crt10 with Cul8/Rtt101. Cul8/Rtt101 is a cullin protein
           involved in the regulation of DNA replication subsequent
           to DNA damage. The N-terminal region of MMS1 and the
           C-terminal of MMS22 are required for the the MMS1-MMS22
           interaction. The human HIV-1 virion-associated protein
           Vpr assembles with DDB1 through interaction with DCAF1
           (chromatin assembly factor) to form an E3 ubiquitin
           ligase that targets cellular substrates for
           proteasome-mediated degradation and subsequent G2
           arrest.
          Length = 513

 Score = 30.8 bits (70), Expect = 0.085
 Identities = 13/55 (23%), Positives = 21/55 (38%), Gaps = 3/55 (5%)

Query: 7   LAPIFSEEEFKHWFLPR-DGIVYTFVVENDGEITEIILKEGF--DVFNALDLMEN 58
                  E+     L R DG +Y   + NDGE+  I +       V  A  +++ 
Sbjct: 220 DRNWSYNEKHDDILLAREDGRIYYLEISNDGEVESITILGIGKCSVAIAFAILDK 274


>gnl|CDD|216842 pfam02009, Rifin_STEVOR, Rifin/stevor family.  Several multicopy
           gene families have been described in Plasmodium
           falciparum, including the stevor family of subtelomeric
           open reading frames and the rif interspersed repetitive
           elements. Both families contain three predicted
           transmembrane segments. It has been proposed that stevor
           and rif are members of a larger superfamily that code
           for variant surface antigens.
          Length = 290

 Score = 29.2 bits (66), Expect = 0.24
 Identities = 15/39 (38%), Positives = 17/39 (43%), Gaps = 4/39 (10%)

Query: 35  DGEITEIILKEGFDVFNALDLMENKEFLGPLKFGIGDGN 73
           D EI +IILK   D      L E K     L+ G G G 
Sbjct: 71  DKEIQKIILK---DKLEKKSLAE-KVEKECLRCGFGLGG 105


>gnl|CDD|220077 pfam08938, DUF1916, Domain of unknown function (DUF1916).  This
           domain is found in various eukaryotic HBS1-like
           proteins.
          Length = 148

 Score = 28.2 bits (63), Expect = 0.45
 Identities = 12/26 (46%), Positives = 15/26 (57%)

Query: 35  DGEITEIILKEGFDVFNALDLMENKE 60
           D  + E +LK  FDV  ALD + N E
Sbjct: 106 DRTLVEAVLKHDFDVQKALDEILNSE 131


>gnl|CDD|220248 pfam09455, Cas_DxTHG, CRISPR-associated (Cas) DxTHG family.  CRISPR
           is a term for Clustered Regularly Interspaced Short
           Palidromic Repeats. A number of protein families appear
           only in association with these repeats and are
           designated Cas (CRISPR associated) proteins. The family
           describes Cas proteins of about 400 residues that
           include the motif [VIL]-D-x-[ST]-H-[GS]. The CRISPR and
           associated proteins are thought to be involved in the
           evolution of host resistance. The exact molecular
           function of this family is currently unknown.
          Length = 370

 Score = 27.4 bits (61), Expect = 0.96
 Identities = 16/83 (19%), Positives = 27/83 (32%), Gaps = 1/83 (1%)

Query: 9   PIFSEEEFKHWFLPRDGIVYTFVVENDGEITEIILKEGFDVFNALDLMENKEFLGPLKFG 68
            +   E+   W+L R   V    +  +  I+ + L+ G D        E  E        
Sbjct: 283 SLEDLEKLIEWYLERGLYVQALTLAREWLISVLCLRYGLDNIKDRKEREKAEEA-LKLLE 341

Query: 69  IGDGNLQYYLYNWKCPSIPPNKI 91
           +   N +     W C S   N +
Sbjct: 342 LRLKNEKELADKWSCLSDERNFL 364


>gnl|CDD|178635 PLN03086, PLN03086, PRLI-interacting factor K; Provisional.
          Length = 567

 Score = 26.8 bits (59), Expect = 1.7
 Identities = 18/53 (33%), Positives = 28/53 (52%), Gaps = 13/53 (24%)

Query: 31  VVENDGEITEIILKEGFDVFNALDLMENKEFLGPLKFG------IGDGNLQYY 77
           V+E D E+ +I    G D   ++   EN+  L PL+FG      + +GN +YY
Sbjct: 246 VLETDIEV-DI---VGPD---SVSNEENQHVLKPLEFGKSESGMVEEGNYRYY 291


>gnl|CDD|130592 TIGR01529, argR_whole, arginine repressor.  This model includes
          most members of the arginine-responsive transcriptional
          regulator family ArgR. This hexameric protein binds DNA
          at its amino end to repress arginine biosyntheis or
          activate arginine catabolism. Some species have several
          ArgR paralogs. In a neighbor-joining tree, some of
          these paralogous sequences show long branches and
          differ significantly in an otherwise well-conserved
          C-terminal region motif GT[VIL][AC]GDDT. These paralogs
          are excluded from the seed and score in the gray zone
          of this model, between trusted and noise cutoffs [Amino
          acid biosynthesis, Glutamate family, Regulatory
          functions, DNA interactions].
          Length = 146

 Score = 26.1 bits (58), Expect = 2.2
 Identities = 15/50 (30%), Positives = 22/50 (44%), Gaps = 7/50 (14%)

Query: 32 VENDGEITEIILKEGFDVFNAL---DLMENKEFLGPLKFGIGDGNLQYYL 78
          +    E+  ++  EG +V  A    DL E    LG +K    DG+  Y L
Sbjct: 16 ISTQEELVALLKAEGIEVTQATVSRDLRE----LGAVKVRDEDGSYVYSL 61


>gnl|CDD|185583 PTZ00375, PTZ00375, dihydroxyacetone kinase-like protein;
           Provisional.
          Length = 584

 Score = 26.7 bits (59), Expect = 2.4
 Identities = 10/46 (21%), Positives = 21/46 (45%), Gaps = 6/46 (13%)

Query: 29  TFVVENDGEITEIILKEGFDVFNALDLMENKEFLGPLKFGIGDGNL 74
                 +  +  ++ +    VF  L  +E++ +L  L   +GDG+L
Sbjct: 369 KAATPTNSLLRSVLER----VFETL--IESENYLNELDAEVGDGDL 408


>gnl|CDD|227315 COG4982, COG4982, 3-oxoacyl-[acyl-carrier protein].
          Length = 866

 Score = 26.4 bits (58), Expect = 2.6
 Identities = 10/37 (27%), Positives = 15/37 (40%), Gaps = 1/37 (2%)

Query: 2   FLRFDLAPIFSEEEFKHWFLPRDG-IVYTFVVENDGE 37
           FL  DL    + +E    F+  D      F + + GE
Sbjct: 823 FLDKDLEFFVASKEEARAFVDFDPEHTVIFEIPDSGE 859


>gnl|CDD|143406 cd07087, ALDH_F3-13-14_CALDH-like, ALDH subfamily: Coniferyl
           aldehyde dehydrogenase, ALDH families 3, 13, and 14, and
           other related proteins.  ALDH subfamily which includes
           NAD(P)+-dependent, aldehyde dehydrogenase, family 3
           member A1 and B1  (ALDH3A1, ALDH3B1,  EC=1.2.1.5) and
           fatty aldehyde dehydrogenase, family 3 member A2
           (ALDH3A2, EC=1.2.1.3), and also plant ALDH family
           members ALDH3F1, ALDH3H1, and ALDH3I1, fungal ALDH14
           (YMR110C) and the protozoan family 13 member (ALDH13),
           as well as coniferyl aldehyde dehydrogenases (CALDH,
           EC=1.2.1.68), and other similar  sequences, such as the
           Pseudomonas putida benzaldehyde dehydrogenase I that is
           involved in the metabolism of mandelate.
          Length = 426

 Score = 26.3 bits (59), Expect = 2.9
 Identities = 8/18 (44%), Positives = 10/18 (55%)

Query: 31  VVENDGEITEIILKEGFD 48
           VVE   E+   +L E FD
Sbjct: 161 VVEGGVEVATALLAEPFD 178


>gnl|CDD|235858 PRK06762, PRK06762, hypothetical protein; Provisional.
          Length = 166

 Score = 25.7 bits (57), Expect = 4.2
 Identities = 10/40 (25%), Positives = 19/40 (47%), Gaps = 5/40 (12%)

Query: 11  FSEEEFKHWFLPRDGIVY---TFVVENDG--EITEIILKE 45
           F E++ + W+ P D +     T   +N    +I + IL +
Sbjct: 122 FGEDDMRRWWNPHDTLGVIGETIFTDNLSLKDIFDAILTD 161


>gnl|CDD|213831 TIGR03578, EF_0831, conserved hypothetical protein
          EF_0831/AHA_3912.  Members of this family of small
          (about 100 amino acid), relatively rare proteins are
          found in both Gram-positive (e.g. Enterococcus
          faecalis) and Gram-negative (e.g. Aeromonas hydrophila)
          bacteria, as part of a cluster of conserved proteins.
          The function is unknown [Hypothetical proteins,
          Conserved].
          Length = 96

 Score = 25.1 bits (55), Expect = 4.8
 Identities = 9/32 (28%), Positives = 18/32 (56%)

Query: 14 EEFKHWFLPRDGIVYTFVVENDGEITEIILKE 45
          E+F  +F PR    Y+  ++   ++T I ++E
Sbjct: 60 EKFLFFFFPRKRREYSVELDVTVKVTAIKVEE 91


>gnl|CDD|215664 pfam00027, cNMP_binding, Cyclic nucleotide-binding domain. 
          Length = 91

 Score = 25.0 bits (55), Expect = 5.1
 Identities = 13/52 (25%), Positives = 22/52 (42%), Gaps = 3/52 (5%)

Query: 10 IFSE-EEFKHWFLPRDGIVYTFVVENDGEITEI-ILKEGFDVFNALDLMENK 59
          IF E +     ++   G V  + ++ DG    +  L  G D F  L L+  +
Sbjct: 10 IFREGDPADSLYIVLSGKVKVYKLDEDGREQILAFLGPG-DFFGELALLGGE 60


>gnl|CDD|222086 pfam13378, MR_MLE_C, Enolase C-terminal domain-like.  This domain
           appears at the C-terminus of many of the proteins that
           carry the MR_MLE, pfam01188 and MR_MLE_N pfam02746
           domains. EC:4.2.1.40.
          Length = 111

 Score = 24.8 bits (55), Expect = 6.5
 Identities = 7/25 (28%), Positives = 11/25 (44%)

Query: 15  EFKHWFLPRDGIVYTFVVENDGEIT 39
           E+  + LP D +     V  DG + 
Sbjct: 77  EYVPYLLPDDLLTGPPPVVEDGRLA 101


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.326    0.150    0.479 

Gapped
Lambda     K      H
   0.267   0.0786    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 5,318,498
Number of extensions: 465637
Number of successful extensions: 461
Number of sequences better than 10.0: 1
Number of HSP's gapped: 461
Number of HSP's successfully gapped: 28
Length of query: 96
Length of database: 10,937,602
Length adjustment: 63
Effective length of query: 33
Effective length of database: 8,143,300
Effective search space: 268728900
Effective search space used: 268728900
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 53 (24.1 bits)