RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy9595
         (232 letters)



>gnl|CDD|128818 smart00545, JmjN, Small domain found in the jumonji family of
           transcription factors.  To date, this domain always
           co-occurs with the JmjC domain (although the reverse is
           not true).
          Length = 42

 Score = 59.2 bits (144), Expect = 3e-12
 Identities = 22/43 (51%), Positives = 27/43 (62%), Gaps = 1/43 (2%)

Query: 72  EIMVFRPTYEEFKDFSKFITYMESQGAHKAGLAKVIPPPEWVP 114
           EI VF PT EEFKD   +I+ +  Q A K G+ KV+PP  W P
Sbjct: 1   EIPVFYPTMEEFKDPLAYISKIRPQ-AEKYGICKVVPPKSWKP 42


>gnl|CDD|190295 pfam02375, JmjN, jmjN domain. 
          Length = 34

 Score = 57.0 bits (139), Expect = 1e-11
 Identities = 18/35 (51%), Positives = 22/35 (62%), Gaps = 1/35 (2%)

Query: 74  MVFRPTYEEFKDFSKFITYMESQGAHKAGLAKVIP 108
            VFRPT EEFKD  K+I  +   G  K G+ KV+P
Sbjct: 1   PVFRPTMEEFKDPIKYIEKIRPLGE-KYGICKVVP 34


>gnl|CDD|227360 COG5027, SAS2, Histone acetyltransferase (MYST family) [Chromatin
           structure and dynamics].
          Length = 395

 Score = 33.2 bits (76), Expect = 0.10
 Identities = 16/80 (20%), Positives = 20/80 (25%), Gaps = 11/80 (13%)

Query: 12  PPAKKKTKAAATGKGAKKKPRTPSKKTKVVKEKAESGVQEEGEVKSTIKTLLPVKQERKH 71
           P  KK+T+     K  K   R       V  E   S   E             ++Q R  
Sbjct: 66  PKRKKQTEKGKKEKKPKVSDRMDLDNENVQLEMLYSISNER-----------EIRQLRFG 114

Query: 72  EIMVFRPTYEEFKDFSKFIT 91
              V  P           I 
Sbjct: 115 GSKVQNPHEGARVKNINEIK 134


>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
          Length = 213

 Score = 32.2 bits (74), Expect = 0.14
 Identities = 15/57 (26%), Positives = 22/57 (38%)

Query: 14  AKKKTKAAATGKGAKKKPRTPSKKTKVVKEKAESGVQEEGEVKSTIKTLLPVKQERK 70
             KK +AA   + A ++ R P  K    K K  +   E      +  + L V Q  K
Sbjct: 118 QAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISELTVGQAVK 174


>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein. 
          Length = 198

 Score = 32.0 bits (73), Expect = 0.15
 Identities = 15/50 (30%), Positives = 19/50 (38%), Gaps = 4/50 (8%)

Query: 5  SEDFEPSPPAKKKTKAAATGKGAKKKPRTPSKKTKVVKEKAESGVQEEGE 54
           ED E   P + K K   T     +KP T  K  K  KEK     ++   
Sbjct: 40 PEDIEIPSPKQPKKKRPTT----PRKPATTKKSKKKDKEKLTEEEKKPES 85


>gnl|CDD|236698 PRK10475, PRK10475, 23S rRNA pseudouridine synthase F; Provisional.
          Length = 290

 Score = 31.2 bits (71), Expect = 0.34
 Identities = 13/37 (35%), Positives = 16/37 (43%), Gaps = 3/37 (8%)

Query: 3   SSSEDFEPSPPAKKKTKAAATGKGAKKKPRTPSKKTK 39
           SSSE     P AK K K A   +   K  +T  K  +
Sbjct: 239 SSSEA---KPKAKAKPKTAGIKRPVVKMEKTAEKGGR 272


>gnl|CDD|226382 COG3864, COG3864, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 396

 Score = 31.0 bits (70), Expect = 0.54
 Identities = 14/73 (19%), Positives = 28/73 (38%), Gaps = 1/73 (1%)

Query: 154 QMTVREYGAMANKPRYATPKHFDYEDLERKYWKNITYVSPIYGADVSGSITDEDVNVWNI 213
           QM ++ +     K    T    D      +   +   +  +   D SGS+TD +++   +
Sbjct: 225 QMYLKRFIGTLPKDYKKTITRKDRRQPYLRGKLSEHLIKIVVAVDTSGSMTDAEIDAA-M 283

Query: 214 NHLGTILDYVNED 226
             +  IL   N +
Sbjct: 284 TEIFDILKNKNYE 296


>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
          Length = 333

 Score = 30.6 bits (69), Expect = 0.59
 Identities = 18/55 (32%), Positives = 26/55 (47%), Gaps = 4/55 (7%)

Query: 9   EPSPPAKK--KTKAAATGKGAKKKPRTPSKKTKVVKEKAESGVQEEGEVKSTIKT 61
           +    AKK  K KAA T   AKK  +T +K  K  K+ A+  ++     K+  K 
Sbjct: 218 KAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALK--KAAKAVKKA 270



 Score = 29.9 bits (67), Expect = 1.1
 Identities = 14/60 (23%), Positives = 19/60 (31%)

Query: 11  SPPAKKKTKAAATGKGAKKKPRTPSKKTKVVKEKAESGVQEEGEVKSTIKTLLPVKQERK 70
               K K  A    K A K  +  +KK      KA  G  +  + K+  K     K    
Sbjct: 247 KAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAKKKAGKKAAAG 306


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 30.6 bits (69), Expect = 0.65
 Identities = 16/36 (44%), Positives = 18/36 (50%), Gaps = 1/36 (2%)

Query: 3   SSSEDFEPSPPAKKKTK-AAATGKGAKKKPRTPSKK 37
           S SED    PP K K K +      A KKP+ P KK
Sbjct: 380 SFSEDEAEPPPTKPKPKVSTPAVPAAAKKPKAPKKK 415



 Score = 27.9 bits (62), Expect = 5.8
 Identities = 15/64 (23%), Positives = 21/64 (32%), Gaps = 7/64 (10%)

Query: 10  PSPPAKKKTKAAATGKGAKKKPRTPSKKTKVVKEKAESGVQEEGEVKSTIKTLLPVKQER 69
            +P A    K  A GK    KP       +V  + A+         + T K     K E 
Sbjct: 146 VAPAASPALKPTANGKRPSSKPPKSIMSPEVKVKSAKK-------TQDTSKETTTEKTEG 198

Query: 70  KHEI 73
           K  +
Sbjct: 199 KTSV 202


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
          subunit.  This is a family of proteins which are
          subunits of the eukaryotic translation initiation
          factor 3 (eIF3). In yeast it is called Hcr1. The
          Saccharomyces cerevisiae protein eIF3j (HCR1) has been
          shown to be required for processing of 20S pre-rRNA and
          binds to 18S rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 30.0 bits (68), Expect = 0.75
 Identities = 16/53 (30%), Positives = 24/53 (45%)

Query: 4  SSEDFEPSPPAKKKTKAAATGKGAKKKPRTPSKKTKVVKEKAESGVQEEGEVK 56
          S ++ E     ++K K AA  K  K       +K K  +EK E G++E  E  
Sbjct: 36 SWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDT 88


>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
          Length = 813

 Score = 30.5 bits (69), Expect = 0.93
 Identities = 11/40 (27%), Positives = 16/40 (40%), Gaps = 1/40 (2%)

Query: 1   GASSSEDFEP-SPPAKKKTKAAATGKGAKKKPRTPSKKTK 39
                 +FEP S    +K       K   +K + PS KT+
Sbjct: 756 QVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKKPSAKTQ 795


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 30.1 bits (68), Expect = 1.0
 Identities = 19/89 (21%), Positives = 30/89 (33%), Gaps = 9/89 (10%)

Query: 15  KKKTKAAATGKGAKKKPRTPSKKTKVVKEKAESGVQEEGEVKSTIKTLLPVKQERKHEIM 74
             K           KK     KK K  K++ +S  + +   K+  K+    K+  K +  
Sbjct: 53  STKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPK-- 110

Query: 75  VFRPTYEEFKDFSKFITYMESQ----GAH 99
              P  +    F+K     E      GAH
Sbjct: 111 ---PNEDVDNAFNKIAELAEKSNVYIGAH 136


>gnl|CDD|227038 COG4694, COG4694, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 758

 Score = 29.5 bits (66), Expect = 1.8
 Identities = 26/107 (24%), Positives = 34/107 (31%), Gaps = 7/107 (6%)

Query: 17  KTKAAATGKGAKKKPRTPSKKTKVVKEKAESGVQEEGEVKSTIKTLLPVKQE-RKHEIMV 75
           K       K   KK     K  K +K +A   V+ + E K     +    +   K     
Sbjct: 111 KKTNDDLEKIESKKESINKKNEKKIKNEASLQVKTQREEKEEKDFIADCWRNLYKKNEER 170

Query: 76  FRPTYEEF--KDFSKFITYMESQGAHKA----GLAKVIPPPEWVPRK 116
           F+   E F  K F + I          A    GL KV    E V R 
Sbjct: 171 FKEYLENFKRKKFKRKILKEFENAKINASEIVGLEKVKEKIEIVFRH 217


>gnl|CDD|219549 pfam07744, SPOC, SPOC domain.  The SPOC (Spen paralogue and
           orthologue C-terminal) domain is involved in
           developmental signalling.
          Length = 109

 Score = 28.0 bits (63), Expect = 1.8
 Identities = 12/45 (26%), Positives = 21/45 (46%), Gaps = 5/45 (11%)

Query: 72  EIMVFRPTY---EEFKDFSKFITYMESQGAHKAGLAKVIPPPEWV 113
            ++V   +     +   F + I Y++S+    AG+AKV  P   V
Sbjct: 60  AVVVLALSPDSESDRAAFDELIDYLQSKQR--AGVAKVGDPGSQV 102


>gnl|CDD|236048 PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated.
          Length = 859

 Score = 29.4 bits (67), Expect = 1.9
 Identities = 9/37 (24%), Positives = 12/37 (32%)

Query: 11  SPPAKKKTKAAATGKGAKKKPRTPSKKTKVVKEKAES 47
               +    A A     KK     S+KTK     +E 
Sbjct: 818 PEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEK 854


>gnl|CDD|184900 PRK14907, rplD, 50S ribosomal protein L4; Provisional.
          Length = 295

 Score = 28.8 bits (64), Expect = 2.3
 Identities = 19/73 (26%), Positives = 27/73 (36%), Gaps = 7/73 (9%)

Query: 11  SPPAKKKTKAAAT--GKGAKKKPRTPS-----KKTKVVKEKAESGVQEEGEVKSTIKTLL 63
           +   KK  K  +T   K A K  +T S     KK  V  EK ES  +E    K+  K  +
Sbjct: 29  TAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKVTVKFEKTESVKKESVAKKTVKKEAV 88

Query: 64  PVKQERKHEIMVF 76
             +       +  
Sbjct: 89  SAEVFEASNKLFK 101


>gnl|CDD|235860 PRK06778, PRK06778, hypothetical protein; Validated.
          Length = 289

 Score = 28.7 bits (64), Expect = 2.6
 Identities = 16/52 (30%), Positives = 29/52 (55%), Gaps = 3/52 (5%)

Query: 11  SPPAKKKTKAAATGKGAKKKPRTPSKKTKVVKEKAESGVQEEGEVKSTIKTL 62
           SP    K    A  +  +KK R  ++KT ++K+K+ +   E GE+ ++I T+
Sbjct: 76  SPSHPPKPATVAVPEETEKKARDVNEKTALLKKKSAT---ELGELATSINTI 124


>gnl|CDD|220222 pfam09403, FadA, Adhesion protein FadA.  FadA (Fusobacterium
           adhesin A) is an adhesin which forms two alpha helices.
          Length = 126

 Score = 27.6 bits (61), Expect = 2.9
 Identities = 15/68 (22%), Positives = 30/68 (44%), Gaps = 2/68 (2%)

Query: 36  KKTKVVKEKAESGVQEEGEVKSTIKTLLPVKQERKHEIMVFRPTYEEF-KDFSKFITYME 94
           ++ K  KE AE  VQE  E +   + L    +E   +    +  Y++  K + +++  +E
Sbjct: 48  EEEKQEKETAEKEVQELKERQLGREELYAKLKEES-KTRFHKDEYKKLLKRYDEYLNKLE 106

Query: 95  SQGAHKAG 102
            +   K  
Sbjct: 107 QKITDKEQ 114


>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
          Length = 430

 Score = 28.8 bits (64), Expect = 3.0
 Identities = 28/97 (28%), Positives = 44/97 (45%), Gaps = 16/97 (16%)

Query: 14  AKKKTKAAATGKGAKKKPRTPS-------KKTKVVKEKAESGVQEEGEVKSTIKTL--LP 64
           A+ KTK A   K  + K   PS       K  +V+KEK    V E+  +    K +  L 
Sbjct: 251 ARAKTKGAEGKKEEEPKQEEPSVNQPYLNKYVEVIKEKLGEDVLEDSYINKLSKDVPTLV 310

Query: 65  VKQERKHEIMVFRPTYEE--FKDFSK-----FITYME 94
           V+ E+ +E+      +E+  F   S+     F+T+ME
Sbjct: 311 VEPEKYYEVAELLRFHEQLAFDYLSELHGTDFVTHME 347


>gnl|CDD|234336 TIGR03734, PRTRC_parB, PRTRC system ParB family protein.  A novel
           genetic system characterized by six major proteins,
           included a ParB homolog and a ThiF homolog, is
           designated PRTRC, or ParB-Related,ThiF-Related Cassette.
           It is often found on plasmids. This protein family the
           member related to ParB, and is designated PRTRC system
           ParB family protein.
          Length = 554

 Score = 28.5 bits (64), Expect = 3.2
 Identities = 10/46 (21%), Positives = 11/46 (23%)

Query: 12  PPAKKKTKAAATGKGAKKKPRTPSKKTKVVKEKAESGVQEEGEVKS 57
            PA          K         S   K   +KA    Q    VK 
Sbjct: 334 KPAAPAAGPGTPAKEKSPAETATSGAAKPAAKKAVPSSQPSNRVKD 379


>gnl|CDD|239002 cd02047, HCII, Heparin cofactor II (HCII) inhibits thrombin, the
           final protease of the coagulation cascade. HCII is
           allosterically activated by binding to cell surface
           glycosaminoglycans (GAGs). The specificity of HCII for
           thrombin is conferred by a highly acidic hirudin-like
           N-terminal tail, which becomes available after GAG
           binding for interaction with the anion-binding exosite I
           of thrombin. This subgroup corresponds to clade D of the
           serpin superfamily.
          Length = 436

 Score = 28.6 bits (64), Expect = 3.2
 Identities = 22/108 (20%), Positives = 46/108 (42%), Gaps = 15/108 (13%)

Query: 125 IGDMSIPAPICQVVTGKQGLYQQINIQKRQMTVREYGAMANK------PRYATPKHFDYE 178
           +G++S+   +   ++G + L +QI     Q+  R   +M N+      P++   K+++  
Sbjct: 289 VGNISMLIVVPHKLSGMKTLEKQIT---PQVVERWQKSMTNRTREVVLPKFKLEKNYNLI 345

Query: 179 DLERKYWKNITYVSPIYGADVSGSITDEDVNVWNINHLGTILDYVNED 226
               +  K +              ++DE + +    H GTI   VNE+
Sbjct: 346 ----ESLKLMGITDLFTEKGNMAGVSDEKIAIDLFKHQGTIT--VNEE 387


>gnl|CDD|220305 pfam09598, Stm1_N, Stm1.  This region is found at the N terminal
          of the Stm1 protein. Stm1 is a G4 quadraplex and purine
          motif triplex nucleic acid-binding protein. It has been
          implicated in many biological processes including
          apoptosis and telomere biosynthesis. Stm1 is known to
          interact with CDC13, and is known to associate with
          ribosomes and nuclear telomere cap complexes.
          Length = 66

 Score = 26.3 bits (58), Expect = 3.7
 Identities = 7/34 (20%), Positives = 11/34 (32%), Gaps = 1/34 (2%)

Query: 7  DFEPSPPAKKKTKAAATGKGAKKKPRTPSKKTKV 40
             P+PP K+  K + +       P  P      
Sbjct: 14 SRVPAPPTKEVVKPSTS-SKKADVPGAPPSPAPP 46


>gnl|CDD|211420 cd11579, Glyco_tran_WbsX, Glycosyl hydrolase family 99-like domain
           of WbsX-like glycosyltransferases.  Members of this
           domain family are found in proteins within O-antigen
           biosynthesis clusters in Gram negative bacteria, where
           they may function as glycosyl hydrolases and typically
           co-occur with glycosyltransferase domains. They bear
           resemblance to GH71 and the GH99 family of
           alpha-1,2-mannosidases and may share a similar cataltyic
           site and mechanism. The O-antigens are essential
           lipopolysaccharides in gram-negative bacteria's outer
           membrane and have been linked to pathogenicity.
          Length = 347

 Score = 28.3 bits (64), Expect = 3.8
 Identities = 26/149 (17%), Positives = 47/149 (31%), Gaps = 37/149 (24%)

Query: 59  IKTLLP-VKQER--KHE----IMVFRPTYEEFKDFSKFITYMESQGAHKAG-----LAKV 106
            + LLP  K  R  K +     +++RP  E   D  + +     + A +AG     L  V
Sbjct: 142 FRYLLPYFKDPRYIKIDGKPVFLIYRP--ELIPDPKETLELWR-ELAREAGLPGLYLVAV 198

Query: 107 IPPPEWVPRKSGYSLDSSIGDMSIPAPICQVVTGKQGLYQQINIQKRQMTVREYGAMANK 166
                  PR  G+  D+++                   +    +   +++         K
Sbjct: 199 ASFGIEDPRDLGF--DAAV------------------EFPPHYVVASELSKSRKLLKLKK 238

Query: 167 PRYATPKHFDYEDLERKYWKNITYVSPIY 195
                   +DY+DL +          P Y
Sbjct: 239 GFG--GNVYDYDDLVKYILNRPDPDYPYY 265


>gnl|CDD|222666 pfam14307, Glyco_tran_WbsX, Glycosyltransferase WbsX.  Members of
           this family are found in within O-antigen biosynthesis
           clusters in Gram negative bacteria, where they are
           predicted to function as glycosyltransferases.
          Length = 345

 Score = 27.9 bits (63), Expect = 4.8
 Identities = 27/152 (17%), Positives = 49/152 (32%), Gaps = 39/152 (25%)

Query: 59  IKTLLPV-KQER--KHE----IMVFRPTYEEFKDFSKFITYMESQGAHKAGLAKV----- 106
            + LLP  K  R  K +     +++RP  E+  D  +FI     + A +AGL  +     
Sbjct: 141 FRYLLPYFKDPRYIKVDGKPVFLIYRP--EDIPDPKEFIDLWR-ELAKEAGLPGLYLVAV 197

Query: 107 ---IPPPEWVPRKSGYSLDSSIGDMSIPAPICQVVTGKQGLYQQINIQKRQMTVREYGAM 163
                     PR  G+       D ++  P      GK+   + + +             
Sbjct: 198 INSFEEEYKDPRDLGF-------DAAVEFPPHYANKGKKLPNKYVRLLN----------- 239

Query: 164 ANKPRYATPKHFDYEDLERKYWKNITYVSPIY 195
               ++     +DY+ +            P Y
Sbjct: 240 ---KKFGKGDVYDYDKVVEYILNRPAPDYPYY 268


>gnl|CDD|136022 PRK06728, PRK06728, aspartate-semialdehyde dehydrogenase;
           Provisional.
          Length = 347

 Score = 27.7 bits (61), Expect = 5.5
 Identities = 19/47 (40%), Positives = 29/47 (61%), Gaps = 6/47 (12%)

Query: 41  VKEKAESGVQEEGEVKSTIKTLLPVKQERKHEIMVFR--PTYEEFKD 85
           +KE+A+S +  E EV+STI   LP K+++KH  + F   P  + F D
Sbjct: 172 LKEQAKSILAGE-EVESTI---LPAKKDKKHYPIAFNVLPQVDIFTD 214


>gnl|CDD|218735 pfam05760, IER, Immediate early response protein (IER).  This
           family consists of several eukaryotic immediate early
           response (IER) 2 and 5 proteins. The role of IER5 is
           unclear although it play an important role in mediating
           the cellular response to mitogenic signals. Again,
           little is known about the function of IER2 although it
           is thought to play a role in mediating the cellular
           responses to a variety of extracellular signals.
          Length = 272

 Score = 27.6 bits (61), Expect = 6.5
 Identities = 16/66 (24%), Positives = 28/66 (42%), Gaps = 10/66 (15%)

Query: 4   SSEDFEPSPPAKKKTKAAATGKG----------AKKKPRTPSKKTKVVKEKAESGVQEEG 53
            +ED +P+PPA  K K  ++G G            K+ R    + +   E  ++   E G
Sbjct: 160 QAEDGQPAPPAVCKRKRRSSGYGSDSGEASGFVPCKRARREDFEPESGGESEDAEEMETG 219

Query: 54  EVKSTI 59
            + + I
Sbjct: 220 NISNLI 225


>gnl|CDD|177871 PLN02226, PLN02226, 2-oxoglutarate dehydrogenase E2 component.
          Length = 463

 Score = 27.4 bits (60), Expect = 6.9
 Identities = 12/34 (35%), Positives = 17/34 (50%)

Query: 2   ASSSEDFEPSPPAKKKTKAAATGKGAKKKPRTPS 35
              + D +PSPPA+ K K         +KP+ PS
Sbjct: 181 IPETTDPKPSPPAEDKQKPKVESAPVAEKPKAPS 214


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 27.8 bits (61), Expect = 7.1
 Identities = 17/45 (37%), Positives = 23/45 (51%)

Query: 2    ASSSEDFEPSPPAKKKTKAAATGKGAKKKPRTPSKKTKVVKEKAE 46
            A  +E+ + +  AKKK + A     AKKK     KK    K+KAE
Sbjct: 1295 AKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAE 1339


>gnl|CDD|234007 TIGR02776, NHEJ_ligase_prk, DNA ligase D.  Members of this protein
           family are DNA ligases involved in the repair of DNA
           double-stranded breaks by non-homologous end joining
           (NHEJ). The system of the bacterial Ku protein
           (TIGR02772) plus this DNA ligase is seen in about 20 %
           of bacterial genomes to date and at least one archaeon
           (Archeoglobus fulgidus). This model describes a central
           and a C-terminal domain. These two domains may be
           permuted, as in genus Mycobacterium, or divided into
           tandem ORFs, and therefore not be identified by This
           model. An additional N-terminal 3'-phosphoesterase (PE)
           domain present in some but not all examples of this
           ligase is not included in the seed alignment for this
           model; This model models only the central ATP-dependent
           ligase domain and the C-terminal polymerase domain. Most
           examples of genes for this ligase are adjacent to the
           gene for Ku [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 552

 Score = 27.3 bits (61), Expect = 8.2
 Identities = 8/21 (38%), Positives = 12/21 (57%), Gaps = 1/21 (4%)

Query: 77  RPTYEEFKDFSKFI-TYMESQ 96
             T++E K F+K I  Y+  Q
Sbjct: 447 TFTWDETKLFAKAIAEYLARQ 467


>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
           complex component).  RNases H are enzymes that
           specifically hydrolyse RNA when annealed to a
           complementary DNA and are present in all living
           organisms. In yeast RNase H2 is composed of a complex of
           three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
           family represents the homologues of Ydr279p. It is not
           known whether non yeast proteins in this family fulfil
           the same function.
          Length = 287

 Score = 26.9 bits (60), Expect = 8.5
 Identities = 12/47 (25%), Positives = 19/47 (40%), Gaps = 3/47 (6%)

Query: 2   ASSSEDFEPSPPAKKKTKAAATGKGAKKKPRTPSKKTKVVKEKAESG 48
            +  +       A+KK K   + +  KKK    SK  K +K+    G
Sbjct: 236 ETEEDVEAAESRAEKKRK---SKEEIKKKKPKESKGVKALKKVVAKG 279


>gnl|CDD|235778 PRK06319, PRK06319, DNA topoisomerase I/SWI domain fusion protein;
           Validated.
          Length = 860

 Score = 27.5 bits (61), Expect = 8.6
 Identities = 14/37 (37%), Positives = 18/37 (48%), Gaps = 1/37 (2%)

Query: 10  PSPPAKKKTKAAATGKGAK-KKPRTPSKKTKVVKEKA 45
           P  P +KKTKA       K K  +T  KK+K   +K 
Sbjct: 743 PKTPYEKKTKAKKKSASTKGKAAKTVKKKSKAKSKKT 779


>gnl|CDD|224668 COG1754, COG1754, Uncharacterized C-terminal domain of
           topoisomerase IA [General function prediction only].
          Length = 298

 Score = 27.0 bits (60), Expect = 9.2
 Identities = 7/24 (29%), Positives = 9/24 (37%)

Query: 16  KKTKAAATGKGAKKKPRTPSKKTK 39
              K  A    A K  +T +KK  
Sbjct: 260 GPGKKPAKKATAAKAKKTTAKKAA 283


>gnl|CDD|219564 pfam07771, TSGP1, Tick salivary peptide group 1.  This contains a
           group of peptides derived from a salivary gland cDNA
           library of the tick Ixodes scapularis. Also present are
           peptides from a related tick species, Ixodes ricinus.
           They are characterized by a putative signal peptide
           indicative of secretion and conserved cysteine residues.
          Length = 120

 Score = 26.0 bits (57), Expect = 9.9
 Identities = 14/37 (37%), Positives = 17/37 (45%), Gaps = 2/37 (5%)

Query: 1   GASSSEDFEPSPPAKKKTKAAATGKGAKKKPRTPSKK 37
             S  +D  P P  K K K   + K   KKP+  SKK
Sbjct: 85  EPSHPDDHPPEPTEKPKKKKKKSKK--TKKPKKSSKK 119


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.311    0.130    0.378 

Gapped
Lambda     K      H
   0.267   0.0754    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 11,806,106
Number of extensions: 1090992
Number of successful extensions: 1263
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1169
Number of HSP's successfully gapped: 98
Length of query: 232
Length of database: 10,937,602
Length adjustment: 94
Effective length of query: 138
Effective length of database: 6,768,326
Effective search space: 934028988
Effective search space used: 934028988
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 57 (25.7 bits)