RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy838
         (152 letters)



>gnl|CDD|240332 PTZ00255, PTZ00255, 60S ribosomal protein L37a; Provisional.
          Length = 90

 Score = 97.9 bits (244), Expect = 1e-27
 Identities = 40/53 (75%), Positives = 44/53 (83%)

Query: 1  MAKRTKKVGITGKYGTRYGASLRKMVKKMEITQHAKYTCSFCGKEYKKSTDLG 53
          MAKRTKKVGITGKYGTRYGASLRK +KK+EI+QHAKY C FCGK   K   +G
Sbjct: 1  MAKRTKKVGITGKYGTRYGASLRKQIKKIEISQHAKYFCPFCGKHAVKRQAVG 53


>gnl|CDD|216695 pfam01780, Ribosomal_L37ae, Ribosomal L37ae protein family.  This
          ribosomal protein is found in archaebacteria and
          eukaryotes. It contains four conserved cysteine
          residues that may bind to zinc.
          Length = 90

 Score = 93.1 bits (232), Expect = 9e-26
 Identities = 37/47 (78%), Positives = 42/47 (89%)

Query: 2  AKRTKKVGITGKYGTRYGASLRKMVKKMEITQHAKYTCSFCGKEYKK 48
          AKRTKKVGI G++GTRYGASLRK VKK+EI+QHAKYTC FCGK+  K
Sbjct: 1  AKRTKKVGIAGRFGTRYGASLRKRVKKIEISQHAKYTCPFCGKDAVK 47


>gnl|CDD|232904 TIGR00280, L37a, ribosomal protein L37a.  This model finds
          eukaryotic ribosomal protein L37a and its archaeal
          orthologs. The nomeclature is tricky because eukaryotes
          have proteins called both L37 and L37a [Protein
          synthesis, Ribosomal proteins: synthesis and
          modification].
          Length = 92

 Score = 69.5 bits (170), Expect = 2e-16
 Identities = 31/44 (70%), Positives = 35/44 (79%)

Query: 1  MAKRTKKVGITGKYGTRYGASLRKMVKKMEITQHAKYTCSFCGK 44
          M KRTKKVGITG++G RYG  LR+ VKK+EI Q AKY C FCGK
Sbjct: 1  MMKRTKKVGITGRFGPRYGLKLRRQVKKIEIQQKAKYVCPFCGK 44


>gnl|CDD|235183 PRK03976, rpl37ae, 50S ribosomal protein L37Ae; Reviewed.
          Length = 90

 Score = 67.7 bits (166), Expect = 9e-16
 Identities = 22/48 (45%), Positives = 28/48 (58%)

Query: 1  MAKRTKKVGITGKYGTRYGASLRKMVKKMEITQHAKYTCSFCGKEYKK 48
          M  RTKKVG  G++G RYG  +RK V  +E    AK+ C  CG+   K
Sbjct: 1  MMSRTKKVGSAGRFGARYGRKIRKRVADIEEKMRAKHVCPVCGRPKVK 48


>gnl|CDD|224908 COG1997, RPL43A, Ribosomal protein L37AE/L43A [Translation,
          ribosomal structure and biogenesis].
          Length = 89

 Score = 65.5 bits (160), Expect = 6e-15
 Identities = 23/42 (54%), Positives = 32/42 (76%)

Query: 4  RTKKVGITGKYGTRYGASLRKMVKKMEITQHAKYTCSFCGKE 45
          +TKKVGI G++G RYG+ LR+ VK++E  Q AK+ C FCG+ 
Sbjct: 3  KTKKVGIAGRFGPRYGSKLRRRVKEIEAQQRAKHVCPFCGRT 44


>gnl|CDD|173926 cd04301, NAT_SF, N-Acyltransferase superfamily: Various enzymes
           that characteristically catalyze the transfer of an acyl
           group to a substrate.  NAT (N-Acyltransferase) is a
           large superfamily of enzymes that mostly catalyze the
           transfer of an acyl group to a substrate and are
           implicated in a variety of functions, ranging from
           bacterial antibiotic resistance to circadian rhythms in
           mammals. Members include GCN5-related
           N-Acetyltransferases (GNAT) such as Aminoglycoside
           N-acetyltransferases, Histone N-acetyltransferase (HAT)
           enzymes, and Serotonin N-acetyltransferase, which
           catalyze the transfer of an acetyl group to a substrate.
           The kinetic mechanism of most GNATs involves the ordered
           formation of a ternary complex: the reaction begins with
           Acetyl Coenzyme A (AcCoA) binding, followed by binding
           of substrate, then direct transfer of the acetyl group
           from AcCoA to the substrate, followed by product and
           subsequent CoA release. Other family members include
           Arginine/ornithine N-succinyltransferase, Myristoyl-CoA:
           protein N-myristoyltransferase, and
           Acyl-homoserinelactone synthase which have a similar
           catalytic mechanism but differ in types of acyl groups
           transferred. Leucyl/phenylalanyl-tRNA-protein
           transferase and FemXAB nonribosomal peptidyltransferases
           which catalyze similar peptidyltransferase reactions are
           also included.
          Length = 65

 Score = 29.9 bits (68), Expect = 0.098
 Identities = 13/52 (25%), Positives = 25/52 (48%), Gaps = 3/52 (5%)

Query: 95  YLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQA 146
           +LVA++      V F+    D   G    Y  +L +  + + KG+G  +++A
Sbjct: 1   FLVAEDDG--EIVGFASLSPD-GSGGDTAYIGDLAVLPEYRGKGIGSALLEA 49


>gnl|CDD|181411 PRK08393, PRK08393, N-ethylammeline chlorohydrolase; Provisional.
          Length = 424

 Score = 30.5 bits (69), Expect = 0.29
 Identities = 19/81 (23%), Positives = 30/81 (37%), Gaps = 13/81 (16%)

Query: 34  HAKYTCSF-CGKEY-KKSTDLG-------AETKKWVWELEEKNMKHSYEVCD----IGWD 80
           HA YTCS    K   +K+ +         +ET   + ++ EK  K    + D    +  D
Sbjct: 181 HAPYTCSLALLKWVREKAREWNKLITIHLSETMDEIKQIREKYGKSPVVLLDEIGFLNED 240

Query: 81  PQGKHSEMFDDRACYLVAKNG 101
               H      R   ++A  G
Sbjct: 241 VIAAHGVWLSSRDIRILASAG 261


>gnl|CDD|181193 PRK08006, PRK08006, replicative DNA helicase; Provisional.
          Length = 471

 Score = 29.2 bits (65), Expect = 0.76
 Identities = 12/36 (33%), Positives = 22/36 (61%), Gaps = 3/36 (8%)

Query: 71  SYEVCDIGWDPQGKHSEMFDDRA---CYLVAKNGSS 103
           + E+ D G+DPQG+ SE   D A    + +A++ ++
Sbjct: 139 ANEIADAGYDPQGRTSEDLLDLAESRVFQIAESRAN 174


>gnl|CDD|221780 pfam12804, NTP_transf_3, MobA-like NTP transferase domain.  This
           family includes the MobA protein (Molybdopterin-guanine
           dinucleotide biosynthesis protein A). The family also
           includes a wide range of other NTP transferase domain.
          Length = 178

 Score = 28.7 bits (65), Expect = 1.0
 Identities = 13/53 (24%), Positives = 22/53 (41%), Gaps = 4/53 (7%)

Query: 95  YLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYEL--QLEKQVQRKGLGKFMMQ 145
            L+A   +S   +A     +D   G P+L+   L   LE  +  +GL   +  
Sbjct: 105 RLLAALEASGADIAV--PVYDGGRGHPLLFPRSLLPALEALLGDRGLRSLLRA 155


>gnl|CDD|183178 PRK11525, dinD, DNA-damage-inducible protein D; Provisional.
          Length = 279

 Score = 28.5 bits (64), Expect = 1.4
 Identities = 10/18 (55%), Positives = 12/18 (66%), Gaps = 1/18 (5%)

Query: 93  ACYLVAKNGSSSTPV-AF 109
           ACYL+ +NG    PV AF
Sbjct: 87  ACYLIVQNGDPRKPVIAF 104


>gnl|CDD|181562 PRK08840, PRK08840, replicative DNA helicase; Provisional.
          Length = 464

 Score = 28.0 bits (62), Expect = 2.0
 Identities = 11/21 (52%), Positives = 14/21 (66%)

Query: 73  EVCDIGWDPQGKHSEMFDDRA 93
           E+ D G+DPQG+ SE   D A
Sbjct: 134 EIADAGYDPQGRTSEDLLDMA 154


>gnl|CDD|205438 pfam13258, DUF4049, Domain of unknown function (DUF4049).  This
           presumed domain is functionally uncharacterized. This
           domain family is found in bacteria, and is typically
           between 310 and 324 amino acids in length.
          Length = 318

 Score = 28.0 bits (62), Expect = 2.3
 Identities = 18/73 (24%), Positives = 30/73 (41%), Gaps = 5/73 (6%)

Query: 84  KHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVDFG---EPVLYCYEL--QLEKQVQRKG 138
           KH    DD+  +L+  N   STPV  ++F    DF    E +  C +   ++ ++    G
Sbjct: 221 KHKTHMDDKLFHLIRSNSCRSTPVYNNYFSNTTDFRPKLEAIFACGQTLKKINQKYGHNG 280

Query: 139 LGKFMMQAFYTYI 151
            G    Q     +
Sbjct: 281 NGVDECQGEDNTV 293


>gnl|CDD|198062 smart00994, zf-C4_ClpX, ClpX C4-type zinc finger.  The ClpX heat
          shock protein of Escherichia coli is a member of the
          universally conserved Hsp100 family of proteins, and
          possesses a putative zinc finger motif of the C4 type.
          This presumed zinc binding domain is found at the
          N-terminus of the ClpX protein. ClpX is an ATPase which
          functions both as a substrate specificity component of
          the ClpXP protease and as a molecular chaperone. The
          molecular function of this domain is now known.
          Length = 39

 Score = 24.8 bits (55), Expect = 4.3
 Identities = 6/9 (66%), Positives = 6/9 (66%)

Query: 37 YTCSFCGKE 45
            CSFCGK 
Sbjct: 1  LRCSFCGKS 9


>gnl|CDD|203502 pfam06689, zf-C4_ClpX, ClpX C4-type zinc finger.  The ClpX heat
          shock protein of Escherichia coli is a member of the
          universally conserved Hsp100 family of proteins, and
          possesses a putative zinc finger motif of the C4 type.
          This presumed zinc binding domain is found at the
          N-terminus of the ClpX protein. ClpX is an ATPase which
          functions both as a substrate specificity component of
          the ClpXP protease and as a molecular chaperone. The
          molecular function of this domain is now known.
          Length = 39

 Score = 24.4 bits (54), Expect = 5.1
 Identities = 6/9 (66%), Positives = 6/9 (66%)

Query: 37 YTCSFCGKE 45
            CSFCGK 
Sbjct: 1  LRCSFCGKS 9


>gnl|CDD|225156 COG2247, LytB, Putative cell wall-binding domain [Cell envelope
           biogenesis, outer membrane].
          Length = 337

 Score = 26.7 bits (59), Expect = 5.5
 Identities = 13/60 (21%), Positives = 24/60 (40%), Gaps = 4/60 (6%)

Query: 78  GWDPQGKHSEMFDDRACYLVAKNGSSSTP----VAFSHFRFDVDFGEPVLYCYELQLEKQ 133
           GWD      E+  +    ++ KN S        +A +   +  + G P+LY  +L  +  
Sbjct: 140 GWDYADALMELMKEGIVPVILKNTSILVRWSRKLALAESPYAANIGCPILYNTKLTEKNV 199


>gnl|CDD|226768 COG4318, COG4318, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 221

 Score = 26.3 bits (58), Expect = 6.6
 Identities = 14/59 (23%), Positives = 20/59 (33%), Gaps = 12/59 (20%)

Query: 59  WVWELEEKNMKHSYE-----VCDIGWDPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHF 112
           WV   + +  +  YE     +  +G DP       +   A YL    G       FS F
Sbjct: 123 WVHPFDARGARRPYEAIPKALAGLGDDP-------YRSLAGYLRDAGGFDKDQTPFSEF 174


>gnl|CDD|117424 pfam08857, ParBc_2, Putative ParB-like nuclease.  This domain is
           probably distantly related to pfam02195. Suggesting
           these uncharacterized proteins have a nuclease function.
          Length = 162

 Score = 26.1 bits (58), Expect = 6.9
 Identities = 15/63 (23%), Positives = 23/63 (36%), Gaps = 10/63 (15%)

Query: 55  ETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFDDR----ACYLVAKNGSSSTPVAFS 110
           +   WV   +E+  +  Y    +   P+     + DD     A +L    G    P  FS
Sbjct: 98  DNNGWVHPYDERGERRPYS--AL---PKSLG-GLEDDPYRSLAGFLRDAGGYKKDPTPFS 151

Query: 111 HFR 113
            FR
Sbjct: 152 EFR 154


>gnl|CDD|236983 PRK11788, PRK11788, tetratricopeptide repeat protein; Provisional.
          Length = 389

 Score = 26.3 bits (59), Expect = 8.2
 Identities = 7/22 (31%), Positives = 12/22 (54%), Gaps = 1/22 (4%)

Query: 22  LRKMVKKMEITQHAKYTCSFCG 43
           LR +V +  + +  +Y C  CG
Sbjct: 341 LRDLVGEQ-LKRKPRYRCRNCG 361


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.320    0.134    0.419 

Gapped
Lambda     K      H
   0.267   0.0797    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 7,452,591
Number of extensions: 624747
Number of successful extensions: 500
Number of sequences better than 10.0: 1
Number of HSP's gapped: 500
Number of HSP's successfully gapped: 30
Length of query: 152
Length of database: 10,937,602
Length adjustment: 88
Effective length of query: 64
Effective length of database: 7,034,450
Effective search space: 450204800
Effective search space used: 450204800
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 55 (24.8 bits)