RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy1312
         (213 letters)



>gnl|CDD|233352 TIGR01310, L7, 60S ribosomal protein L7, eukaryotic.  This model
           describes the eukaryotic 60S (cytosolic) ribosomal
           protein L7 and paralogs that may or may not also be L7.
           Human, Drosophila, and Arabidopsis all have both a
           typical L7 and an L7-related paralog. This family is
           designated subfamily rather than equivalog to reflect
           these uncharacterized paralogs. Members of this family
           average ~ 250 residues in length, somewhat longer than
           the archaeal L30P/L7E homolog (~ 155 residues) and much
           longer than the related bacterial/organellar form (~ 60
           residues).
          Length = 235

 Score =  192 bits (491), Expect = 4e-62
 Identities = 102/235 (43%), Positives = 133/235 (56%), Gaps = 51/235 (21%)

Query: 30  ESVLKKRKKRENSRIISLHKKLTALANRKKKTREIFIRAEKYVKAYRAKERDEIRLMREA 89
           E +LKKRK R+   +    +        KKK +  F RAE +VK YR  ER+ IRL R A
Sbjct: 1   ELLLKKRKARQELAVQVAKQAKAKKKANKKKRKIYFKRAESFVKEYRKAEREIIRLKRLA 60

Query: 90  KKRGNYYIPGEAKLAFVIRIRGINRVAPKVRKTLQLLRLRQINNGIFVRLNKVSV----- 144
           KK G +Y+P E KL FVIRI+GIN + PK RK L+LLRL+Q+NNG+FV++ K ++     
Sbjct: 61  KKPGKFYVPAEHKLVFVIRIKGINGIPPKPRKVLRLLRLKQVNNGVFVKVTKATLEMLRI 120

Query: 145 --------------------------------------------GKQNIICVEDLIHEVF 160
                                                       GK  IIC+EDLIHE++
Sbjct: 121 VEPYVAYGYPNLKSVRELIYKRGFAKINGQRVPLTDNTIIEQHLGKYGIICIEDLIHEIY 180

Query: 161 TVGDNFKYASNALWPFKLNNPTGGW--RKKTIHYVEGGDFGNREDKINKLLRKMV 213
           TVG +FK  +N LWPFKL++P  GW  RKK  H++EGGDFGNRED IN+L++KM 
Sbjct: 181 TVGPHFKEVNNFLWPFKLSSPVKGWGKRKKVKHFIEGGDFGNREDFINELIKKMN 235


>gnl|CDD|100099 cd01657, Ribosomal_L7_archeal_euk, Ribosomal protein L7, which is
           found in archaea and eukaryotes but not in prokaryotes,
           binds domain II of the 23S rRNA as well as the 5S rRNA
           and is one of five ribosomal proteins that mediate the
           interactions 5S rRNA makes with the ribosome.  The
           eukaryotic L7 members have an N-terminal extension not
           found in the archeal L7 orthologs.  L7 is closely
           related to the ribosomal L30 protein found in eukaryotes
           and prokaryotes.
          Length = 159

 Score =  144 bits (365), Expect = 5e-44
 Identities = 62/159 (38%), Positives = 85/159 (53%), Gaps = 48/159 (30%)

Query: 103 LAFVIRIRGINRVAPKVRKTLQLLRLRQINNGIFVRLNKVSVG----------------- 145
           L  V+RIRGI  V PK+RKTLQLLRLR+INN +FV+L K ++G                 
Sbjct: 1   LYAVVRIRGIVGVPPKIRKTLQLLRLRRINNAVFVKLTKATIGMLKKVEPYVTWGEPNLE 60

Query: 146 -------------------------------KQNIICVEDLIHEVFTVGDNFKYASNALW 174
                                          K+ IIC+EDL HE++T G NFK  +N L 
Sbjct: 61  TLRELIYKRGRLKGNGQRIPLTDNYLVEKTLKKGIICIEDLAHEIYTGGPNFKEVNNFLP 120

Query: 175 PFKLNNPTGGWRKKTIHYVEGGDFGNREDKINKLLRKMV 213
           PF+L+ P GG++ K  H+ EGG+ G R + IN+L+R+M+
Sbjct: 121 PFRLHPPRGGFKGKKRHFSEGGELGYRGEDINELIRRMI 159


>gnl|CDD|203848 pfam08079, Ribosomal_L30_N, Ribosomal L30 N-terminal domain.  This
           presumed domain is found at the N-terminus of Ribosomal
           L30 proteins and has been termed RL30NT or NUC018.
          Length = 71

 Score = 77.3 bits (191), Expect = 5e-19
 Identities = 43/71 (60%), Positives = 50/71 (70%)

Query: 31  SVLKKRKKRENSRIISLHKKLTALANRKKKTREIFIRAEKYVKAYRAKERDEIRLMREAK 90
           S+LKKRK+ E  R     K+    A RKKK + IF RAEKYVK YRA ERDEIRL REAK
Sbjct: 1   SLLKKRKRNEKLRAKRAKKRAAKKAARKKKRKLIFKRAEKYVKEYRAAERDEIRLKREAK 60

Query: 91  KRGNYYIPGEA 101
           K+GN+Y+P E 
Sbjct: 61  KKGNFYVPAEP 71


>gnl|CDD|235683 PRK06049, rpl30p, 50S ribosomal protein L30P; Reviewed.
          Length = 154

 Score = 54.4 bits (132), Expect = 1e-09
 Identities = 42/150 (28%), Positives = 63/150 (42%), Gaps = 42/150 (28%)

Query: 106 VIRIRGINRVAPKVRKTLQLLRLRQINNGIFVR--------LNKVS----VGKQNIICVE 153
           VIRIRG   V   +  TL++LRL ++N+ + V         L KV      G+ +   + 
Sbjct: 5   VIRIRGRVNVPRDIADTLKMLRLHRVNHCVLVPETPSYKGMLQKVKDYVTWGEIDADTLA 64

Query: 154 DLI-HEVFTVGD----------NFKYAS-----NALW--------------PFKLNNPTG 183
           +L+       G+          N  Y S      AL                F+L+ P G
Sbjct: 65  ELLRKRGRLEGNKKLTDEYVKENTGYDSIEELAEALVEGEIKLKDLPGLKPVFRLHPPRG 124

Query: 184 GWRKKTIHYVEGGDFGNREDKINKLLRKMV 213
           G+      + EGG+ G R +KIN+LLR+M 
Sbjct: 125 GFGGIKRPFKEGGELGYRGEKINELLRRMR 154


>gnl|CDD|201157 pfam00327, Ribosomal_L30, Ribosomal protein L30p/L7e.  This family
           includes prokaryotic L30 and eukaryotic L7.
          Length = 52

 Score = 49.7 bits (120), Expect = 9e-09
 Identities = 21/54 (38%), Positives = 29/54 (53%), Gaps = 2/54 (3%)

Query: 103 LAFVIRIRGINRVAPKVRKTLQLLRLRQINNGIFVRLNKVSVGKQNIICVEDLI 156
           L  + RIR I    PK +KTL+LL LR+IN  +FV+      G      V+ L+
Sbjct: 1   LLKITRIRSIIGRPPKQKKTLKLLGLRKINQTVFVKDTPAIRGMLRK--VKHLV 52


>gnl|CDD|100098 cd00355, Ribosomal_L30_like, Ribosomal protein L30, which is found
           in eukaryotes and prokaryotes but not in archaea, is one
           of the smallest ribosomal proteins with a molecular mass
           of about 7kDa. L30 binds the 23SrRNA as well as the 5S
           rRNA and is one of five ribosomal proteins that mediate
           the interactions 5S rRNA makes with the ribosome.  The
           eukaryotic L30 members have N- and/or C-terminal
           extensions not found in their prokaryotic orthologs.
           L30 is closely related to the ribosomal L7 protein found
           in eukaryotes and archaea.
          Length = 53

 Score = 49.3 bits (119), Expect = 1e-08
 Identities = 19/52 (36%), Positives = 27/52 (51%), Gaps = 2/52 (3%)

Query: 105 FVIRIRGINRVAPKVRKTLQLLRLRQINNGIFVRLNKVSVGKQNIICVEDLI 156
            V R+R +    PK RKTL+ L LR+IN  +FV+      G      V+ L+
Sbjct: 1   KVTRVRSLIGRPPKQRKTLKALGLRKINQTVFVKDTPSIRGMLRK--VKHLV 50


>gnl|CDD|224754 COG1841, RpmD, Ribosomal protein L30/L7E [Translation, ribosomal
           structure and biogenesis].
          Length = 55

 Score = 45.6 bits (109), Expect = 3e-07
 Identities = 16/37 (43%), Positives = 21/37 (56%)

Query: 105 FVIRIRGINRVAPKVRKTLQLLRLRQINNGIFVRLNK 141
            V  IR      PK+RKTL+LL LR+IN+ + V    
Sbjct: 2   KVTLIRSPIGRPPKIRKTLRLLGLRKINHTVIVEDTP 38


>gnl|CDD|130376 TIGR01309, L30P_arch, 50S ribosomal protein L30P, archaeal.  This
           model represents the archaeal ribosomal protein similar
           to longer (~ 250 residue) eukaryotic 60S ribosomal
           protein L7 and to the much shorter (~ 60 residue)
           bacterial 50S ribosomal protein L30. Protein naming
           follows the SwissProt designation as L30P, while the
           gene symbol rpmD follows TIGR usage [Protein synthesis,
           Ribosomal proteins: synthesis and modification].
          Length = 152

 Score = 43.9 bits (104), Expect = 9e-06
 Identities = 39/152 (25%), Positives = 59/152 (38%), Gaps = 46/152 (30%)

Query: 106 VIRIRGINRVAPKVRKTLQLLRLRQINNGIFVR--------LNKVS-------------- 143
           VIRIRG   V   +  TL++LRL ++N+ +           L KV               
Sbjct: 3   VIRIRGTVDVPRDIADTLEMLRLHRVNHCVLYPETPTYLGMLQKVKDYVTWGEIDEDTLE 62

Query: 144 ---------VGKQNII-----------CVEDLIHEVFTVGDNFKYASNALWPFKLNNPTG 183
                    VG   +             V++L   +         A      F+L+ P  
Sbjct: 63  LLIRKRGRLVGGDKVTDEYVKEVTGYESVDELAKALVEGEIKLSEAGLKP-VFRLHPPRK 121

Query: 184 GWRK--KTIHYVEGGDFGNREDKINKLLRKMV 213
           G++   KT  Y +GG+ G R +KIN+LLR+M 
Sbjct: 122 GFKGGIKT-PYRDGGELGYRGEKINELLRRMR 152


>gnl|CDD|232945 TIGR00376, TIGR00376, DNA helicase, putative.  The gene product may
           represent a DNA helicase. Eukaryotic members of this
           family have been characterized as binding certain
           single-stranded G-rich DNA sequences (GGGGT and GGGCT).
           A number of related proteins are characterized as
           helicases [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 637

 Score = 32.9 bits (75), Expect = 0.13
 Identities = 21/88 (23%), Positives = 36/88 (40%), Gaps = 4/88 (4%)

Query: 29  PESVLKKRKKRENSRIISLHKKLTALANRKKKTREIFIRAEKYVKAYRAKER--DEIRLM 86
           P  +LK  K+     +I  H K   +A+ ++K  E+     K +K    K R   +I+++
Sbjct: 234 PARLLKSNKQHSLDYLIENHPKYQIVADIREKIDELIEERNKKLKPSPQKRRGLSDIKIL 293

Query: 87  REAKKRGNYYIPGEAKLAFVIRIRGINR 114
           R+A K+      G   L        I  
Sbjct: 294 RKALKKREA--RGIESLKIASMAEWIET 319


>gnl|CDD|234622 PRK00085, recO, DNA repair protein RecO; Reviewed.
          Length = 247

 Score = 29.5 bits (67), Expect = 1.1
 Identities = 13/37 (35%), Positives = 20/37 (54%), Gaps = 6/37 (16%)

Query: 32 VLKKRKKRENSRIISL----HKKLTALA--NRKKKTR 62
          VL  R   E S I++L    H ++ A+A   R+ K+R
Sbjct: 10 VLHTRPYGETSLIVTLFTREHGRVRAVAKGARRPKSR 46


>gnl|CDD|232820 TIGR00089, TIGR00089, radical SAM methylthiotransferase, MiaB/RimO
           family.  This subfamily contains the tRNA-i(6)A37
           modification enzyme, MiaB (TIGR01574). The phylogenetic
           tree indicates 4 distinct clades, one of which
           corresponds to MiaB. The other three clades are modelled
           by hypothetical equivalogs (TIGR01125, TIGR01579 and
           TIGR01578). Together, the four models hit every sequence
           hit by the subfamily model without any overlap between
           them. This subfamily is aparrently a part of a larger
           superfamily of enzymes utilizing both a 4Fe4S cluster
           and S-adenosyl methionine (SAM) to initiate radical
           reactions. MiaB acts on a particular isoprenylated
           Adenine base of certain tRNAs causing thiolation at an
           aromatic carbon, and probably also transferring a methyl
           grouyp from SAM to the thiol. The particular substrate
           of the three other clades is unknown but may be very
           closely related.
          Length = 429

 Score = 29.1 bits (66), Expect = 1.6
 Identities = 19/57 (33%), Positives = 30/57 (52%), Gaps = 6/57 (10%)

Query: 28  VPESVLKKRKKRENSRIISLHKKLTALANRKK--KTREIFIRAEKYVKAYRAKERDE 82
           VPE V K+R +    R+I+L K+++   N+K   KT E+ +  E+  K      R E
Sbjct: 343 VPEEVKKERLE----RLIALQKEISLEKNKKYVGKTLEVLVEGEEGKKEGELTGRTE 395


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
           are designated YL1. These proteins have been shown to be
           DNA-binding and may be a transcription factor.
          Length = 238

 Score = 28.9 bits (65), Expect = 1.7
 Identities = 22/93 (23%), Positives = 42/93 (45%), Gaps = 10/93 (10%)

Query: 5   EKKAATAGKPKEPKPKTKKGLPVVPESVLKKRKKRENSRIISLHKKLTALANRKKKTREI 64
           +  AA + K   P+PK K        ++L   +++ +      +K+ T   + + K REI
Sbjct: 106 DPTAAKSPKAAAPRPKKKSERISWAPTLLDSPRRKSSRSSTVQNKEAT---HERLKEREI 162

Query: 65  FIRAEKYVKAYRAKERDEIR------LMREAKK 91
             R +   KA + KE+ + +       + EAK+
Sbjct: 163 -RRKKIQAKARKRKEKKKEKELTQEERLAEAKE 194


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 29.3 bits (65), Expect = 1.7
 Identities = 23/91 (25%), Positives = 44/91 (48%)

Query: 2    ADVEKKAATAGKPKEPKPKTKKGLPVVPESVLKKRKKRENSRIISLHKKLTALANRKKKT 61
            A+  KKA    K +E K   ++      E   ++ +K E +R+    ++  A+   + + 
Sbjct: 1220 AEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARK 1279

Query: 62   REIFIRAEKYVKAYRAKERDEIRLMREAKKR 92
             +   +AE+  KA  AK+ +E +   EAKK+
Sbjct: 1280 ADELKKAEEKKKADEAKKAEEKKKADEAKKK 1310


>gnl|CDD|236942 PRK11637, PRK11637, AmiB activator; Provisional.
          Length = 428

 Score = 28.9 bits (65), Expect = 2.3
 Identities = 27/94 (28%), Positives = 45/94 (47%), Gaps = 7/94 (7%)

Query: 30  ESVLKKRKKRENSRIISLHK---KLTALANRKKKTREIFIRAEKYVKAYRAKERDEIRLM 86
           E    +RKK       SL K   +L+ L   + + R+   RAE+  KA   +E  E   +
Sbjct: 211 EQARNERKKTLTGLESSLQKDQQQLSELRANESRLRDSIARAEREAKARAEREAREAARV 270

Query: 87  R----EAKKRGNYYIPGEAKLAFVIRIRGINRVA 116
           R    +AK++G+ Y P E++ + + R  G+ R  
Sbjct: 271 RDKQKQAKRKGSTYKPTESERSLMSRTGGLGRPR 304


>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family.  This
           family includes gbp a protein from Soybean that binds to
           GAGA element dinucleotide repeat DNA. It seems likely
           that the this domain mediates DNA binding. This putative
           domain contains several conserved cysteines and a
           histidine suggesting this may be a zinc-binding DNA
           interaction domain.
          Length = 301

 Score = 28.3 bits (63), Expect = 2.8
 Identities = 12/39 (30%), Positives = 15/39 (38%)

Query: 6   KKAATAGKPKEPKPKTKKGLPVVPESVLKKRKKRENSRI 44
           KK  +   PK PKPK  K    V    +K       S+ 
Sbjct: 151 KKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDPRSKP 189


>gnl|CDD|221346 pfam11967, RecO_N, Recombination protein O N terminal.
          Recombination protein O (RecO) is involved in DNA
          repair and pfam00470 pathway recombination. This domain
          forms a beta barrel structure.
          Length = 80

 Score = 26.4 bits (59), Expect = 3.5
 Identities = 10/35 (28%), Positives = 20/35 (57%), Gaps = 4/35 (11%)

Query: 32 VLKKRKKRENSRIISL----HKKLTALANRKKKTR 62
          VL+ R   E+ +I++L    H K++ +A   +K +
Sbjct: 10 VLRTRDYGESDKIVTLFTREHGKISGVARGARKPK 44


>gnl|CDD|224837 COG1926, COG1926, Predicted phosphoribosyltransferases [General
           function prediction only].
          Length = 220

 Score = 27.7 bits (62), Expect = 4.3
 Identities = 10/40 (25%), Positives = 20/40 (50%)

Query: 68  AEKYVKAYRAKERDEIRLMREAKKRGNYYIPGEAKLAFVI 107
            + Y++A  A+ER E+   REA + G      + +   ++
Sbjct: 91  DDAYIEAAAARERKELLRRREAYRGGRPVPSLKGRTVILV 130


>gnl|CDD|223085 COG0006, PepP, Xaa-Pro aminopeptidase [Amino acid transport and
           metabolism].
          Length = 384

 Score = 27.8 bits (62), Expect = 4.4
 Identities = 9/39 (23%), Positives = 13/39 (33%), Gaps = 1/39 (2%)

Query: 52  TALANRKKKTREIFIRAEKYVKAYRA-KERDEIRLMREA 89
                +    R   + A   V   R  K   EI  +R+A
Sbjct: 128 AFERLQAALPRAELVDASDLVDRLRLIKSPAEIAKIRKA 166


>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
          Length = 246

 Score = 27.3 bits (61), Expect = 5.4
 Identities = 11/27 (40%), Positives = 15/27 (55%), Gaps = 1/27 (3%)

Query: 13  KPKEPKPKTKKGLPVVPESVLKKRKKR 39
           KPK PKPK  K +   P+  +K  + R
Sbjct: 102 KPK-PKPKPVKKVEEQPKREVKPVEPR 127


>gnl|CDD|240028 cd04671, Nudix_Hydrolase_13, Members of the Nudix hydrolase
          superfamily catalyze the hydrolysis of NUcleoside
          DIphosphates linked to other moieties, X. Enzymes
          belonging to this superfamily require a divalent
          cation, such as Mg2+ or Mn2+, for their activity and
          contain a highly conserved 23-residue nudix motif
          (GX5EX7REUXEEXGU, where U = I, L or V), which functions
          as a metal binding and catalytic site. Substrates of
          nudix hydrolases include intact and oxidatively damaged
          nucleoside triphosphates, dinucleoside polyphosphates,
          nucleotide-sugars and dinucleotide enzymes. These
          substrates are metabolites or cell signaling molecules
          that require regulation during different stages of the
          cell cycle or during periods of stress. In general, the
          role of the nudix hydrolase is to sanitize the
          nucleotide pools and to maintain cell viability,
          thereby serving as surveillance & "house-cleaning"
          enzymes. Substrate specificity is used to define
          families within the superfamily. Differences in
          substrate specificity are determined by the N-terminal
          extension or by residues in variable loop regions.
          Mechanistically, substrate hydrolysis occurs by a
          nucleophilic substitution reaction, with variation in
          the numbers and roles of divalent cations required.
          Length = 123

 Score = 26.6 bits (59), Expect = 6.2
 Identities = 9/22 (40%), Positives = 16/22 (72%), Gaps = 2/22 (9%)

Query: 79 ERDEIRLMREAKK--RGNYYIP 98
           + E+ L++EAK+  RG +Y+P
Sbjct: 10 NQGEVLLIQEAKRSCRGKWYLP 31


>gnl|CDD|219538 pfam07728, AAA_5, AAA domain (dynein-related subfamily).  This Pfam
           entry includes some of the AAA proteins not detected by
           the pfam00004 model.
          Length = 135

 Score = 26.5 bits (59), Expect = 6.2
 Identities = 16/71 (22%), Positives = 28/71 (39%), Gaps = 7/71 (9%)

Query: 71  YVKAYRAKERDEIRLMREAKKRGNYYIPGE----AKLAFVIRIRGINRVAPKVRKTLQLL 126
           YV+  R    ++++  R     G  ++ G     A+   +  +  INR  P V  +L  L
Sbjct: 30  YVQLTRDTTEEDLKGRRNIDPGGASWVDGPLVRAAREGEIAVLDEINRANPDVLNSLLSL 89

Query: 127 ---RLRQINNG 134
              R   +  G
Sbjct: 90  LDERRLLLPEG 100


>gnl|CDD|180892 PRK07220, PRK07220, DNA topoisomerase I; Validated.
          Length = 740

 Score = 27.4 bits (61), Expect = 6.8
 Identities = 17/66 (25%), Positives = 32/66 (48%), Gaps = 3/66 (4%)

Query: 5   EKKAATAGKPKEPKPKTKKGLPVVPESVLKKRKKRENSRIISLHKKLTALANRKKKTREI 64
           ++K   A +PK+ KPK+ K +  + +   K  +K E + I ++     A      KT +I
Sbjct: 671 KQKEEKAQQPKKEKPKSIKDIKGIGK---KTAEKLEEAGITTVEALAEADPVELSKTTKI 727

Query: 65  FIRAEK 70
            ++  K
Sbjct: 728 SVKKIK 733


>gnl|CDD|237676 PRK14329, PRK14329, (dimethylallyl)adenosine tRNA
           methylthiotransferase; Provisional.
          Length = 467

 Score = 27.3 bits (61), Expect = 7.2
 Identities = 16/41 (39%), Positives = 24/41 (58%), Gaps = 6/41 (14%)

Query: 28  VPESVLKKRKKRENSRIISLHKKLTALANRKK--KTREIFI 66
           VPE V    KKR  + II+L ++L+   N++   KT E+ I
Sbjct: 378 VPEEV----KKRRLNEIIALQQELSLERNQRDIGKTFEVLI 414


>gnl|CDD|220086 pfam08982, DUF1857, Domain of unknown function (DUF1857).  This
           domain has no known function. It is found in various
           hypothetical bacterial and fungal proteins.
          Length = 147

 Score = 26.1 bits (58), Expect = 8.7
 Identities = 8/21 (38%), Positives = 13/21 (61%)

Query: 68  AEKYVKAYRAKERDEIRLMRE 88
            E   +AY+A + D +R +RE
Sbjct: 123 QEIVKQAYKAADIDTVRTIRE 143


>gnl|CDD|176480 cd08643, DNA_pol_A_pol_I_B, Polymerase I functions primarily to
           fill DNA gaps that arise during DNA repair,
           recombination and replication.  Family A polymerase
           functions primarily to fill DNA gaps that arise during
           DNA repair, recombination and replication. DNA-dependent
           DNA polymerases can be classified in six main groups
           based upon phylogenetic relationships with E. coli
           polymerase I (classA), E. coli polymerase II (class B),
           E.coli polymerase III (class C), euryarchaaeota
           polymerase II (class D), human polymerase  beta (class
           x), E. coli UmuC/DinB and eukaryotic RAP 30/Xeroderma
           pigmentosum variant (class Y). Family A polymerase are
           found primarily in organisms related to prokaryotes and
           include prokaryotic DNA polymerase I ,mitochondrial
           polymerase delta, and several bacteriphage polymerases
           including those from odd-numbered phage (T3, T5, and
           T7). Prokaryotic Pol Is have two functional domains
           located on the same polypeptide; a 5'-3' polymerase and
           5'-3' exonuclease. Pol I uses its 5' nuclease activity
           to remove the ribonucleotide portion of newly
           synthesized Okazaki fragments and DNA polymerase
           activity to fill in the resulting gap. A combination of
           phylogenomic and signature sequence-based (or phonetic)
           approaches is used to understand the evolutionary
           relationships among bacteria. DNA polymerase I is one of
           the conserved proteins that is used to search for
           protein signatures. The structure of these polymerases
           resembles in overall morphology a cupped human right
           hand, with fingers (which bind an incoming nucleotide
           and interact with the single-stranded template), palm
           (which harbors the catalytic amino acid residues and
           also binds an incoming dNTP) and thumb (which binds
           double-stranded DNA) subdomains.
          Length = 429

 Score = 27.0 bits (60), Expect = 8.9
 Identities = 15/48 (31%), Positives = 22/48 (45%), Gaps = 3/48 (6%)

Query: 16  EPKPKTKKGLPVVPESVLKKRKKREN---SRIISLHKKLTALANRKKK 60
           EP+  T+ G P V E VL K    E    +  + + K+L  LA+    
Sbjct: 84  EPQEFTESGEPKVDEDVLSKLDYPEAKLLAEYLLVQKRLGQLADGNNA 131


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.136    0.390 

Gapped
Lambda     K      H
   0.267   0.0666    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 11,008,062
Number of extensions: 1060165
Number of successful extensions: 1333
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1297
Number of HSP's successfully gapped: 91
Length of query: 213
Length of database: 10,937,602
Length adjustment: 93
Effective length of query: 120
Effective length of database: 6,812,680
Effective search space: 817521600
Effective search space used: 817521600
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 57 (25.9 bits)