RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy8473
         (307 letters)



>gnl|CDD|213296 cd05930, A_NRPS, The adenylation domain of nonribosomal peptide
           synthetases (NRPS).  The adenylation (A) domain of NRPS
           recognizes a specific amino acid or hydroxy acid and
           activates it as an (amino) acyl adenylate by hydrolysis
           of ATP. The activated acyl moiety then forms a thioester
           bond to the enzyme-bound cofactor phosphopantetheine of
           a peptidyl carrier protein domain. NRPSs are large
           multifunctional enzymes which synthesize many
           therapeutically useful peptides in bacteria and fungi
           via a template-directed, nucleic acid independent
           nonribosomal mechanism. These natural products include
           antibiotics, immunosuppressants, plant and animal
           toxins, and enzyme inhibitors. NRPS has a distinct
           modular structure in which each module is responsible
           for the recognition, activation, and in some cases,
           modification of a single amino acid residue of the final
           peptide product. The modules can be subdivided into
           domains that catalyze specific biochemical reactions.
          Length = 445

 Score = 81.7 bits (203), Expect = 2e-17
 Identities = 30/84 (35%), Positives = 46/84 (54%), Gaps = 11/84 (13%)

Query: 153 RMYKTGDYGRIVDD-ILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSVLCYKPG 211
           R+Y+TGD  R + D  L + GR D QVKIRG R++L E+EAAL     V +  V+  + G
Sbjct: 327 RLYRTGDLARWLPDGNLEFLGRADDQVKIRGYRIELGEIEAALLAHPGVREAVVVAREDG 386

Query: 212 EVDQV----------SEVTSEEIR 225
             ++           +E+ + E+R
Sbjct: 387 AGEKRLVAYVVPAAGAELDAAELR 410



 Score = 59.8 bits (146), Expect = 4e-10
 Identities = 21/51 (41%), Positives = 30/51 (58%)

Query: 228 DDILLYEGRTDSQVKIRGNRVDLSEVEAALSKISEVDKLSVLCYKPGEVDQ 278
           D  L + GR D QVKIRG R++L E+EAAL     V +  V+  + G  ++
Sbjct: 340 DGNLEFLGRADDQVKIRGYRIELGEIEAALLAHPGVREAVVVAREDGAGEK 390



 Score = 41.3 bits (98), Expect = 4e-04
 Identities = 12/19 (63%), Positives = 15/19 (78%)

Query: 102 QSNDIAIVLYTSGSTGIPK 120
             +D+A V+YTSGSTG PK
Sbjct: 91  DPDDLAYVIYTSGSTGRPK 109


>gnl|CDD|233550 TIGR01733, AA-adenyl-dom, amino acid adenylation domain.  This
           model represents a domain responsible for the specific
           recognition of amino acids and activation as adenylyl
           amino acids. The reaction catalyzed is aa + ATP ->
           aa-AMP + PPi. These domains are usually found as
           components of multi-domain non-ribosomal peptide
           synthetases and are usually called "A-domains" in that
           context (for a review, see ). A-domains are almost
           invariably followed by "T-domains" (thiolation domains,
           pfam00550) to which the amino acid adenylate is
           transferred as a thiol-ester to a bound pantetheine
           cofactor with the release of AMP (these are also called
           peptide carrier proteins, or PCPs. When the A-domain
           does not represent the first module (corresponding to
           the first amino acid in the product molecule) it is
           usually preceded by a "C-domain" (condensation domain,
           pfam00668) which catalyzes the ligation of two amino
           acid thiol-esters from neighboring modules. This domain
           is a subset of the AMP-binding domain found in Pfam
           (pfam00501) which also hits substrate--CoA ligases and
           luciferases. Sequences scoring in between trusted and
           noise for this model may be ambiguous as to whether they
           activate amino acids or other molecules lacking an alpha
           amino group.
          Length = 409

 Score = 77.3 bits (191), Expect = 6e-16
 Identities = 25/51 (49%), Positives = 33/51 (64%), Gaps = 1/51 (1%)

Query: 152 RRMYKTGDYGRIVDD-ILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEVD 201
            R+Y+TGD  R + D  L + GR D QVKIRG R++L E+EAAL +   V 
Sbjct: 355 ARLYRTGDLVRYLPDGNLEFLGRIDDQVKIRGYRIELGEIEAALLRHPGVR 405



 Score = 57.3 bits (139), Expect = 3e-09
 Identities = 18/34 (52%), Positives = 23/34 (67%)

Query: 231 LLYEGRTDSQVKIRGNRVDLSEVEAALSKISEVD 264
           L + GR D QVKIRG R++L E+EAAL +   V 
Sbjct: 372 LEFLGRIDDQVKIRGYRIELGEIEAALLRHPGVR 405



 Score = 39.9 bits (94), Expect = 0.001
 Identities = 12/18 (66%), Positives = 15/18 (83%)

Query: 103 SNDIAIVLYTSGSTGIPK 120
            +D+A V+YTSGSTG PK
Sbjct: 119 PDDLAYVIYTSGSTGRPK 136


>gnl|CDD|213325 cd12117, A_NRPS_Srf_like, The adenylation domain of nonribosomal
           peptide synthetases (NRPS), including Bacillus subtilis
           termination module Surfactin (SrfA-C).  The adenylation
           (A) domain of NRPS recognizes a specific amino acid or
           hydroxy acid and activates it as an (amino) acyl
           adenylate by hydrolysis of ATP. The activated acyl
           moiety then forms a thioester to the enzyme-bound
           cofactor phosphopantetheine of a peptidyl carrier
           protein domain. NRPSs are large multifunctional enzymes
           which synthesize many therapeutically useful peptides in
           bacteria and fungi via a template-directed, nucleic acid
           independent nonribosomal mechanism. These natural
           products include antibiotics, immunosuppressants, plant
           and animal toxins, and enzyme inhibitors. NRPS has a
           distinct modular structure in which each module is
           responsible for the recognition, activation, and, in
           some cases, modification of a single amino acid residue
           of the final peptide product. The modules can be
           subdivided into domains that catalyze specific
           biochemical reactions. This family includes the
           adenylation domain of the Bacillus subtilis termination
           module (Surfactin domain, SrfA-C) which recognizes a
           specific amino acid building block, which is then
           activated and transferred to the terminal thiol of the
           4'-phosphopantetheine (Ppan) arm of the downstream
           peptidyl carrier protein (PCP) domain.
          Length = 474

 Score = 74.1 bits (183), Expect = 7e-15
 Identities = 28/84 (33%), Positives = 43/84 (51%), Gaps = 9/84 (10%)

Query: 152 RRMYKTGDYGRIVDD-ILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSVLCYKP 210
            R+Y+TGD  R   D  + + GR D QVKIRG R++L E+E AL +   V + +VL  + 
Sbjct: 357 ERLYRTGDLARWRPDGNIEFLGRIDRQVKIRGFRIELGEIEQALLQHPGVREAAVLVRED 416

Query: 211 GEVDQ--------VSEVTSEEIRG 226
              D+           + + E+R 
Sbjct: 417 AAGDKRLVAYVVARGALDAAELRA 440



 Score = 55.2 bits (134), Expect = 1e-08
 Identities = 20/51 (39%), Positives = 30/51 (58%)

Query: 228 DDILLYEGRTDSQVKIRGNRVDLSEVEAALSKISEVDKLSVLCYKPGEVDQ 278
           D  + + GR D QVKIRG R++L E+E AL +   V + +VL  +    D+
Sbjct: 371 DGNIEFLGRIDRQVKIRGFRIELGEIEQALLQHPGVREAAVLVREDAAGDK 421



 Score = 35.2 bits (82), Expect = 0.036
 Identities = 20/58 (34%), Positives = 26/58 (44%), Gaps = 5/58 (8%)

Query: 68  RTVLSLPQLR-LLSASLSHNNIPGESMLHADNNNEQS----NDIAIVLYTSGSTGIPK 120
           R +L+   L  L  A      I  E +   D          +D+A V+YTSGSTG PK
Sbjct: 86  RVLLTDESLAPLARADQLPLVILEEELDAEDAGPPAPAVDADDLAYVMYTSGSTGRPK 143


>gnl|CDD|213285 cd05918, A_NRPS_SidN3_like, The adenylation (A) domain of
           siderophore-synthesizing nonribosomal peptide
           synthetases (NRPS).  The adenylation (A) domain of NRPS
           recognizes a specific amino acid or hydroxy acid and
           activates it as an (amino) acyl adenylate by hydrolysis
           of ATP. The activated acyl moiety then forms a thioester
           to the enzyme-bound cofactor phosphopantetheine of a
           peptidyl carrier protein domain. This family of
           siderophore-synthesizing NRPS includes the third
           adenylation domain of SidN from the endophytic fungus
           Neotyphodium lolii, ferrichrome siderophore synthetase,
           HC-toxin synthetase, and enniatin synthase. NRPSs are
           large multifunctional enzymes which synthesize many
           therapeutically useful peptides. These natural products
           include antibiotics, immunosuppressants, plant and
           animal toxins, and enzyme inhibitors. NRPS has a
           distinct modular structure in which each module is
           responsible for the recognition, activation, and in some
           cases, modification of a single amino acid residue of
           the final peptide product. The modules can be subdivided
           into domains that catalyze specific biochemical
           reactions.
          Length = 447

 Score = 73.4 bits (181), Expect = 1e-14
 Identities = 26/55 (47%), Positives = 33/55 (60%), Gaps = 1/55 (1%)

Query: 152 RRMYKTGDYGRI-VDDILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSV 205
            R+Y+TGD  R   D  L + GR D QVKIRG R++L E+EA L  L  V   +V
Sbjct: 322 GRIYRTGDLVRYLEDGSLEFLGRKDDQVKIRGQRIELGEIEAVLRALPGVVVAAV 376



 Score = 54.1 bits (131), Expect = 3e-08
 Identities = 19/41 (46%), Positives = 25/41 (60%)

Query: 228 DDILLYEGRTDSQVKIRGNRVDLSEVEAALSKISEVDKLSV 268
           D  L + GR D QVKIRG R++L E+EA L  +  V   +V
Sbjct: 336 DGSLEFLGRKDDQVKIRGQRIELGEIEAVLRALPGVVVAAV 376



 Score = 38.7 bits (91), Expect = 0.003
 Identities = 12/19 (63%), Positives = 14/19 (73%)

Query: 102 QSNDIAIVLYTSGSTGIPK 120
             +D A V+YTSGSTG PK
Sbjct: 97  SPDDPAYVIYTSGSTGKPK 115


>gnl|CDD|213322 cd12114, A_NRPS_TlmIV_like, The adenylation domain of nonribosomal
           peptide synthetases (NRPS), including Streptoalloteichus
           tallysomycin biosynthesis genes.  The adenylation (A)
           domain of NRPS recognizes a specific amino acid or
           hydroxy acid and activates it as an (amino) acyl
           adenylate by hydrolysis of ATP. The activated acyl
           moiety then forms a thioester to the enzyme-bound
           cofactor phosphopantetheine of a peptidyl carrier
           protein domain. NRPSs are large multifunctional enzymes
           which synthesize many therapeutically useful peptides in
           bacteria and fungi via a template-directed, nucleic acid
           independent nonribosomal mechanism. These natural
           products include antibiotics, immunosuppressants, plant
           and animal toxins, and enzyme inhibitors. NRPS has a
           distinct modular structure in which each module is
           responsible for the recognition, activation, and in some
           cases, modification of a single amino acid residue of
           the final peptide product. The modules can be subdivided
           into domains that catalyze specific biochemical
           reactions. This family includes the TLM biosynthetic
           gene cluster from Streptoalloteichus that consists of
           nine NRPS genes; the N-terminal module of TlmVI (NRPS-5)
           and the starter module of BlmVI (NRPS-5) are comprised
           of the acyl CoA ligase (AL) and acyl carrier protein
           (ACP)-like domains, which are thought to be involved in
           the biosynthesis of the beta-aminoalaninamide moiety.
          Length = 476

 Score = 72.2 bits (178), Expect = 3e-14
 Identities = 27/56 (48%), Positives = 37/56 (66%), Gaps = 1/56 (1%)

Query: 152 RRMYKTGDYGRIVDD-ILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSVL 206
            R+Y+TGD GR   D  + + GR D QVKIRG RV+L E+EAAL++   V +  V+
Sbjct: 358 ERLYRTGDLGRYRPDGTIEFLGRADHQVKIRGYRVELGEIEAALARHPGVQRAVVV 413



 Score = 51.8 bits (125), Expect = 2e-07
 Identities = 19/35 (54%), Positives = 25/35 (71%)

Query: 235 GRTDSQVKIRGNRVDLSEVEAALSKISEVDKLSVL 269
           GR D QVKIRG RV+L E+EAAL++   V +  V+
Sbjct: 379 GRADHQVKIRGYRVELGEIEAALARHPGVQRAVVV 413



 Score = 38.0 bits (89), Expect = 0.006
 Identities = 16/57 (28%), Positives = 24/57 (42%), Gaps = 4/57 (7%)

Query: 68  RTVLSLPQLRLLSASLSHNNIPGESMLHADNNNEQSN----DIAIVLYTSGSTGIPK 120
           R VL+ P L     +     +  ++                D+A V++TSGSTG PK
Sbjct: 86  RAVLTDPGLAQPEEAPDLLVVADDAAAAESPAPPPPRVDPDDLAYVIFTSGSTGEPK 142


>gnl|CDD|213310 cd05945, DltA, D-alanine:D-alanyl carrier protein ligase (DltA).
           DltA belongs to the class I AMP-forming adenylation
           domain superfamily, which also includes acetyl-CoA
           synthetase, luciferase, and the adenylation domains of
           non-ribosomal synthetases. It catalyzes the two-step
           activation reaction of D-alanine: the formation of a
           substrate-AMP molecule as an intermediate, and then the
           transfer of the amino acid adenylate to teichoic acid in
           the biosynthesis of lipoteichoic acid (LTA) and wall
           teichoic acid (WTA) in gram-positive bacteria.
          Length = 447

 Score = 72.2 bits (178), Expect = 4e-14
 Identities = 33/84 (39%), Positives = 47/84 (55%), Gaps = 3/84 (3%)

Query: 152 RRMYKTGDYGRIVDDILL-YEGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSVLC-YK 209
           +R Y+TGD   + DD LL Y GR D Q+K+ G R++L E+EAAL  L  V++  VL   K
Sbjct: 329 QRWYRTGDLVYLEDDGLLVYLGRKDFQIKLHGYRIELEEIEAALRALPGVEEAVVLPVPK 388

Query: 210 PGEVDQ-VSEVTSEEIRGVDDILL 232
             +V + V+ V  +E       L 
Sbjct: 389 GEKVVRLVAFVVGKEGALDTKALK 412



 Score = 56.8 bits (138), Expect = 3e-09
 Identities = 21/53 (39%), Positives = 34/53 (64%), Gaps = 1/53 (1%)

Query: 228 DDILLYEGRTDSQVKIRGNRVDLSEVEAALSKISEVDKLSVLC-YKPGEVDQI 279
           D +L+Y GR D Q+K+ G R++L E+EAAL  +  V++  VL   K  +V ++
Sbjct: 343 DGLLVYLGRKDFQIKLHGYRIELEEIEAALRALPGVEEAVVLPVPKGEKVVRL 395



 Score = 37.6 bits (88), Expect = 0.007
 Identities = 11/17 (64%), Positives = 15/17 (88%)

Query: 104 NDIAIVLYTSGSTGIPK 120
           +D+A +L+TSGSTG PK
Sbjct: 97  DDLAYILFTSGSTGKPK 113


>gnl|CDD|213324 cd12116, A_NRPS_Ta1_like, The adenylation domain of nonribosomal
           peptide synthetases (NRPS), including salinosporamide A
           polyketide synthase.  The adenylation (A) domain of NRPS
           recognizes a specific amino acid or hydroxy acid and
           activates it as an (amino) acyl adenylate by hydrolysis
           of ATP. The activated acyl moiety then forms a thioester
           to the enzyme-bound cofactor phosphopantetheine of a
           peptidyl carrier protein domain. NRPSs are large
           multifunctional enzymes which synthesize many
           therapeutically useful peptides in bacteria and fungi
           via a template-directed, nucleic acid independent
           nonribosomal mechanism. These natural products include
           antibiotics, immunosuppressants, plant and animal
           toxins, and enzyme inhibitors. NRPS has a distinct
           modular structure in which each module is responsible
           for the recognition, activation, and in some cases,
           modification of a single amino acid residue of the final
           peptide product. The modules can be subdivided into
           domains that catalyze specific biochemical reactions.
           This family includes the myxovirescin (TA) antibiotic
           biosynthetic gene in Myxococcus xanthus; TA production
           plays a role in predation. It also includes the
           salinosporamide A polyketide synthase which is involved
           in the biosynthesis of salinosporamide A, a marine
           microbial metabolite whose chlorine atom is crucial for
           potent proteasome inhibition and anticancer activity.
          Length = 438

 Score = 71.1 bits (175), Expect = 8e-14
 Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 153 RMYKTGDYGR-IVDDILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEV-DKLSVLC-YK 209
           R+Y+TGD  R   D  L Y GR D QVKIRG R++L E+EAAL     V D + V+    
Sbjct: 320 RLYRTGDLVRWRPDGRLEYLGRADDQVKIRGFRIELGEIEAALRAHPGVADAVVVVREEG 379

Query: 210 PGEVDQVSEVTSEEIRGVD 228
           PG+   V+ V  +     D
Sbjct: 380 PGDQRLVAYVVPKAGAAPD 398



 Score = 55.7 bits (135), Expect = 1e-08
 Identities = 25/69 (36%), Positives = 34/69 (49%), Gaps = 5/69 (7%)

Query: 228 DDILLYEGRTDSQVKIRGNRVDLSEVEAALSKISEVDKLSVLCYKPGEVDQ-----IWEA 282
           D  L Y GR D QVKIRG R++L E+EAAL     V    V+  + G  DQ     +   
Sbjct: 333 DGRLEYLGRADDQVKIRGFRIELGEIEAALRAHPGVADAVVVVREEGPGDQRLVAYVVPK 392

Query: 283 PSSAGSTDS 291
             +A   ++
Sbjct: 393 AGAAPDAEA 401



 Score = 36.4 bits (85), Expect = 0.016
 Identities = 12/18 (66%), Positives = 15/18 (83%)

Query: 103 SNDIAIVLYTSGSTGIPK 120
            +D+A V+YTSGSTG PK
Sbjct: 92  PDDLAYVIYTSGSTGKPK 109


>gnl|CDD|213323 cd12115, A_NRPS_Sfm_like, The adenylation domain of nonribosomal
           peptide synthetases (NRPS), including Saframycin A gene
           cluster from Streptomyces lavendulae.  The adenylation
           (A) domain of NRPS recognizes a specific amino acid or
           hydroxy acid and activates it as an (amino) acyl
           adenylate by hydrolysis of ATP. The activated acyl
           moiety then forms a thioester to the enzyme-bound
           cofactor phosphopantetheine of a peptidyl carrier
           protein domain. NRPSs are large multifunctional enzymes
           which synthesize many therapeutically useful peptides in
           bacteria and fungi via a template-directed, nucleic acid
           independent nonribosomal mechanism. These natural
           products include antibiotics, immunosuppressants, plant
           and animal toxins, and enzyme inhibitors. NRPS has a
           distinct modular structure in which each module is
           responsible for the recognition, activation, and in some
           cases, modification of a single amino acid residue of
           the final peptide product. The modules can be subdivided
           into domains that catalyze specific biochemical
           reactions. This family includes the saframycin A gene
           cluster from Streptomyces lavendulae which implicates
           the NRPS system for assembling the unusual tetrapeptidyl
           skeleton in an iterative manner. It also includes
           saframycin Mx1 produced by Myxococcus xanthus NRPS.
          Length = 449

 Score = 68.1 bits (167), Expect = 7e-13
 Identities = 25/52 (48%), Positives = 33/52 (63%), Gaps = 1/52 (1%)

Query: 152 RRMYKTGDYGRIVDDILL-YEGRTDSQVKIRGNRVDLSEVEAALSKLSEVDK 202
            R+Y+TGD  R   D  L Y GR D QVK+RG R++L E+EAAL  +  V +
Sbjct: 330 ERLYRTGDLVRRRADGQLEYLGRIDDQVKVRGFRIELGEIEAALRSIPGVAE 381



 Score = 50.7 bits (122), Expect = 3e-07
 Identities = 20/38 (52%), Positives = 25/38 (65%)

Query: 228 DDILLYEGRTDSQVKIRGNRVDLSEVEAALSKISEVDK 265
           D  L Y GR D QVK+RG R++L E+EAAL  I  V +
Sbjct: 344 DGQLEYLGRIDDQVKVRGFRIELGEIEAALRSIPGVAE 381



 Score = 36.9 bits (86), Expect = 0.010
 Identities = 12/19 (63%), Positives = 15/19 (78%)

Query: 102 QSNDIAIVLYTSGSTGIPK 120
             +D+A V+YTSGSTG PK
Sbjct: 103 DPDDLAYVIYTSGSTGRPK 121


>gnl|CDD|223951 COG1020, EntF, Non-ribosomal peptide synthetase modules and related
           proteins [Secondary metabolites biosynthesis, transport,
           and catabolism].
          Length = 642

 Score = 65.4 bits (159), Expect = 9e-12
 Identities = 26/47 (55%), Positives = 34/47 (72%), Gaps = 1/47 (2%)

Query: 152 RRMYKTGDYGRIVDDILL-YEGRTDSQVKIRGNRVDLSEVEAALSKL 197
            R+Y+TGD  R + D  L Y GR DSQVKIRG R++L E+EAAL++ 
Sbjct: 595 LRLYRTGDLARPLADGALEYLGRKDSQVKIRGFRIELGEIEAALAEQ 641



 Score = 47.6 bits (113), Expect = 4e-06
 Identities = 20/33 (60%), Positives = 25/33 (75%)

Query: 228 DDILLYEGRTDSQVKIRGNRVDLSEVEAALSKI 260
           D  L Y GR DSQVKIRG R++L E+EAAL++ 
Sbjct: 609 DGALEYLGRKDSQVKIRGFRIELGEIEAALAEQ 641



 Score = 36.9 bits (85), Expect = 0.013
 Identities = 17/51 (33%), Positives = 23/51 (45%), Gaps = 6/51 (11%)

Query: 70  VLSLPQLRLLSASLSHNNIPGESMLHADNNNEQSNDIAIVLYTSGSTGIPK 120
            + LP L L  A      IP    +         + +A ++YTSGSTG PK
Sbjct: 338 DVGLPGLALDDALS---EIPDTDPIPQA---LLGDALAYIIYTSGSTGQPK 382


>gnl|CDD|237108 PRK12467, PRK12467, peptide synthase; Provisional.
          Length = 3956

 Score = 65.2 bits (159), Expect = 1e-11
 Identities = 26/55 (47%), Positives = 34/55 (61%), Gaps = 1/55 (1%)

Query: 153 RMYKTGDYGR-IVDDILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSVL 206
           R+Y+TGD  R   D ++ Y GR D QVKIRG R++L E+EA L     V +  VL
Sbjct: 890 RLYRTGDLARYRADGVIEYLGRMDHQVKIRGFRIELGEIEARLLAQPGVREAVVL 944



 Score = 56.7 bits (137), Expect = 7e-09
 Identities = 26/55 (47%), Positives = 35/55 (63%), Gaps = 1/55 (1%)

Query: 153  RMYKTGDYGRIVDD-ILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSVL 206
            R+Y+TGD  R   D ++ Y GR D QVKIRG R++L E+EA L +   V +  VL
Sbjct: 3471 RLYRTGDLARYRADGVIEYLGRIDHQVKIRGFRIELGEIEARLLQHPSVREAVVL 3525



 Score = 54.4 bits (131), Expect = 4e-08
 Identities = 25/55 (45%), Positives = 35/55 (63%), Gaps = 1/55 (1%)

Query: 153  RMYKTGDYGRI-VDDILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSVL 206
            R+Y+TGD  R   D ++ Y GR D QVKIRG R++L E+EA L +   V +  V+
Sbjct: 1954 RLYRTGDLARYRADGVIEYLGRIDHQVKIRGFRIELGEIEARLREQGGVREAVVI 2008



 Score = 50.2 bits (120), Expect = 8e-07
 Identities = 21/53 (39%), Positives = 31/53 (58%), Gaps = 1/53 (1%)

Query: 227 VDDILLYEGRTDSQVKIRGNRVDLSEVEAALSKISEVDKLSVLCYKPGEVDQI 279
            D ++ Y GR D QVKIRG R++L E+EA L     V + +V+  +PG+    
Sbjct: 902 ADGVIEYLGRMDHQVKIRGFRIELGEIEARLLAQPGVRE-AVVLAQPGDAGLQ 953



 Score = 43.6 bits (103), Expect = 1e-04
 Identities = 19/42 (45%), Positives = 27/42 (64%)

Query: 228  DDILLYEGRTDSQVKIRGNRVDLSEVEAALSKISEVDKLSVL 269
            D ++ Y GR D QVKIRG R++L E+EA L +   V +  V+
Sbjct: 1967 DGVIEYLGRIDHQVKIRGFRIELGEIEARLREQGGVREAVVI 2008



 Score = 43.6 bits (103), Expect = 1e-04
 Identities = 20/42 (47%), Positives = 27/42 (64%)

Query: 228  DDILLYEGRTDSQVKIRGNRVDLSEVEAALSKISEVDKLSVL 269
            D ++ Y GR D QVKIRG R++L E+EA L +   V +  VL
Sbjct: 3484 DGVIEYLGRIDHQVKIRGFRIELGEIEARLLQHPSVREAVVL 3525



 Score = 32.4 bits (74), Expect = 0.32
 Identities = 15/31 (48%), Positives = 18/31 (58%), Gaps = 1/31 (3%)

Query: 90   GESMLHADNNNEQSNDIAIVLYTSGSTGIPK 120
            G S  +   N    N +A V+YTSGSTG PK
Sbjct: 1705 GYSDSNPAVNLAPQN-LAYVIYTSGSTGRPK 1734



 Score = 32.4 bits (74), Expect = 0.38
 Identities = 23/58 (39%), Positives = 30/58 (51%), Gaps = 11/58 (18%)

Query: 63   PAYKGRTVLSLPQLRLLSASLSHNNIPGESMLHADNNNEQSNDIAIVLYTSGSTGIPK 120
            PA  G T L+L +L L     S NN    + +  +N       +A V+YTSGSTG PK
Sbjct: 3207 PAPAGDTALTLDRLDL--NGYSENNP--STRVMGEN-------LAYVIYTSGSTGKPK 3253



 Score = 31.3 bits (71), Expect = 0.86
 Identities = 19/55 (34%), Positives = 27/55 (49%), Gaps = 2/55 (3%)

Query: 67  GRTVLSLPQ-LRLLSASLSHNNIPGESMLHADNNNEQSNDIAIVLYTSGSTGIPK 120
               L +P  LR L      + + G S  + +   +  N +A V+YTSGSTG PK
Sbjct: 619 LLAQLPVPAGLRSLCLDEPADLLCGYSGHNPEVALDPDN-LAYVIYTSGSTGQPK 672


>gnl|CDD|236668 PRK10252, entF, enterobactin synthase subunit F; Provisional.
          Length = 1296

 Score = 63.5 bits (155), Expect = 4e-11
 Identities = 24/50 (48%), Positives = 36/50 (72%), Gaps = 1/50 (2%)

Query: 153 RMYKTGDYGRIVDD-ILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEVD 201
           RMY+TGD  R +DD  + Y GR+D Q+KIRG R++L E++ A+  L +V+
Sbjct: 837 RMYRTGDVARWLDDGAVEYLGRSDDQLKIRGQRIELGEIDRAMQALPDVE 886



 Score = 47.0 bits (112), Expect = 1e-05
 Identities = 14/32 (43%), Positives = 24/32 (75%)

Query: 233 YEGRTDSQVKIRGNRVDLSEVEAALSKISEVD 264
           Y GR+D Q+KIRG R++L E++ A+  + +V+
Sbjct: 855 YLGRSDDQLKIRGQRIELGEIDRAMQALPDVE 886



 Score = 32.7 bits (75), Expect = 0.26
 Identities = 19/65 (29%), Positives = 27/65 (41%), Gaps = 20/65 (30%)

Query: 105 DIAIVLYTSGSTGIPKEYRRMYKTGDYGRIVDDIHRCWSSGDRAIKYRRMYKTGDYGRIV 164
             A +++TSGSTG PK           G +V         G  AI  R ++    Y    
Sbjct: 599 HTAYIIFTSGSTGRPK-----------GVMV---------GQTAIVNRLLWMQNHYPLTA 638

Query: 165 DDILL 169
           DD++L
Sbjct: 639 DDVVL 643


>gnl|CDD|215954 pfam00501, AMP-binding, AMP-binding enzyme. 
          Length = 412

 Score = 62.7 bits (153), Expect = 5e-11
 Identities = 23/47 (48%), Positives = 29/47 (61%), Gaps = 1/47 (2%)

Query: 155 YKTGDYGRIVDD-ILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEV 200
           Y+TGD GR  +D  L   GR D QVKIRG R++  E+EA L +   V
Sbjct: 361 YRTGDLGRWDEDGYLEILGRKDDQVKIRGERIEPGEIEAVLLEHPGV 407



 Score = 49.3 bits (118), Expect = 1e-06
 Identities = 17/36 (47%), Positives = 21/36 (58%)

Query: 228 DDILLYEGRTDSQVKIRGNRVDLSEVEAALSKISEV 263
           D  L   GR D QVKIRG R++  E+EA L +   V
Sbjct: 372 DGYLEILGRKDDQVKIRGERIEPGEIEAVLLEHPGV 407



 Score = 40.8 bits (96), Expect = 6e-04
 Identities = 10/19 (52%), Positives = 15/19 (78%)

Query: 102 QSNDIAIVLYTSGSTGIPK 120
             +D+A ++YTSG+TG PK
Sbjct: 129 DPDDLAYIIYTSGTTGKPK 147


>gnl|CDD|235313 PRK04813, PRK04813, D-alanine--poly(phosphoribitol) ligase subunit
           1; Provisional.
          Length = 503

 Score = 59.5 bits (145), Expect = 5e-10
 Identities = 24/64 (37%), Positives = 36/64 (56%), Gaps = 1/64 (1%)

Query: 153 RMYKTGDYGRIVDDILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSVLCYKP-G 211
             Y TGD G + D +L Y+GR D Q+K+ G R++L E+E  L + S V+   V+ Y    
Sbjct: 376 PAYHTGDAGYLEDGLLFYQGRIDFQIKLNGYRIELEEIEQNLRQSSYVESAVVVPYNKDH 435

Query: 212 EVDQ 215
           +V  
Sbjct: 436 KVQY 439



 Score = 45.3 bits (108), Expect = 2e-05
 Identities = 19/53 (35%), Positives = 31/53 (58%), Gaps = 1/53 (1%)

Query: 228 DDILLYEGRTDSQVKIRGNRVDLSEVEAALSKISEVDKLSVLCYKP-GEVDQI 279
           D +L Y+GR D Q+K+ G R++L E+E  L + S V+   V+ Y    +V  +
Sbjct: 388 DGLLFYQGRIDFQIKLNGYRIELEEIEQNLRQSSYVESAVVVPYNKDHKVQYL 440



 Score = 29.1 bits (66), Expect = 3.4
 Identities = 8/19 (42%), Positives = 14/19 (73%)

Query: 102 QSNDIAIVLYTSGSTGIPK 120
           + +D   +++TSG+TG PK
Sbjct: 141 KGDDNYYIIFTSGTTGKPK 159


>gnl|CDD|233551 TIGR01734, D-ala-DACP-lig, D-alanine--poly(phosphoribitol) ligase,
           subunit 1.  This model represents the enzyme (also
           called D-alanine-D-alanyl carrier protein ligase) which
           activates D-alanine as an adenylate via the reaction
           D-ala + ATP -> D-ala-AMP + PPi, and further catalyzes
           the condensation of the amino acid adenylate with the
           D-alanyl carrier protein (D-ala-ACP). The D-alanine is
           then further transferred to teichoic acid in the
           biosynthesis of lipoteichoic acid (LTA) and wall
           teichoic acid (WTA) in gram positive bacteria, both
           polysacchatides [Cell envelope, Biosynthesis and
           degradation of murein sacculus and peptidoglycan].
          Length = 502

 Score = 59.0 bits (143), Expect = 8e-10
 Identities = 21/54 (38%), Positives = 33/54 (61%)

Query: 153 RMYKTGDYGRIVDDILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSVL 206
             Y+TGD G I D  L Y+GR D Q+K+ G R++L ++E  L + S ++   V+
Sbjct: 374 PAYRTGDAGTITDGQLFYQGRLDFQIKLHGYRIELEDIEFNLRQSSYIESAVVV 427



 Score = 45.5 bits (108), Expect = 2e-05
 Identities = 22/80 (27%), Positives = 38/80 (47%), Gaps = 13/80 (16%)

Query: 228 DDILLYEGRTDSQVKIRGNRVDLSEVEAALSKISEVDKLSVL--CYKPGEVDQIWEAPSS 285
           D  L Y+GR D Q+K+ G R++L ++E  L + S ++   V+    K  +V+ +      
Sbjct: 386 DGQLFYQGRLDFQIKLHGYRIELEDIEFNLRQSSYIESAVVVPKYNKDHKVEYL------ 439

Query: 286 AGSTDSSFIVKSREDRAEEE 305
                 + IV   ED  +E 
Sbjct: 440 -----IAAIVPETEDFEKEF 454



 Score = 29.0 bits (65), Expect = 3.7
 Identities = 14/51 (27%), Positives = 23/51 (45%), Gaps = 9/51 (17%)

Query: 70  VLSLPQLRLLSASLSHNNIPGESMLHADNNNEQSNDIAIVLYTSGSTGIPK 120
           +++L  L     S     +  +  +  D+N         ++YTSGSTG PK
Sbjct: 116 IITLSALEQAETS--GGPVSFDHAVKGDDNY-------YIIYTSGSTGNPK 157


>gnl|CDD|235564 PRK05691, PRK05691, peptide synthase; Validated.
          Length = 4334

 Score = 59.0 bits (143), Expect = 1e-09
 Identities = 27/55 (49%), Positives = 37/55 (67%), Gaps = 1/55 (1%)

Query: 153  RMYKTGDYGRI-VDDILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSVL 206
            R+Y+TGD  R   D +L Y GR D QVKIRG R++L E+EA L + +EV + +V 
Sbjct: 4102 RLYRTGDLARRRSDGVLEYVGRIDHQVKIRGYRIELGEIEARLHEQAEVREAAVA 4156



 Score = 56.7 bits (137), Expect = 8e-09
 Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 1/55 (1%)

Query: 153  RMYKTGDYGRIVDDILL-YEGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSVL 206
            R+Y+TGD  R+  D L+ Y GR D QVKIRG R++L E+E+ L +   V +  VL
Sbjct: 2568 RLYRTGDLVRLRADGLVEYVGRIDHQVKIRGFRIELGEIESRLLEHPAVREAVVL 2622



 Score = 52.1 bits (125), Expect = 2e-07
 Identities = 25/58 (43%), Positives = 34/58 (58%), Gaps = 1/58 (1%)

Query: 153  RMYKTGDYGRI-VDDILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSVLCYK 209
            R+Y+TGD  R   D  L Y GR D QVK+RG RV+  E++A L     V + +VL  +
Sbjct: 1505 RLYRTGDRARWNADGALEYLGRLDQQVKLRGFRVEPEEIQARLLAQPGVAQAAVLVRE 1562



 Score = 49.4 bits (118), Expect = 2e-06
 Identities = 22/45 (48%), Positives = 30/45 (66%)

Query: 225  RGVDDILLYEGRTDSQVKIRGNRVDLSEVEAALSKISEVDKLSVL 269
            R  D +L Y GR D QVKIRG R++L E+EA L + +EV + +V 
Sbjct: 4112 RRSDGVLEYVGRIDHQVKIRGYRIELGEIEARLHEQAEVREAAVA 4156



 Score = 43.6 bits (103), Expect = 1e-04
 Identities = 22/51 (43%), Positives = 31/51 (60%), Gaps = 1/51 (1%)

Query: 220  TSEEIRGVDDILL-YEGRTDSQVKIRGNRVDLSEVEAALSKISEVDKLSVL 269
            T + +R   D L+ Y GR D QVKIRG R++L E+E+ L +   V +  VL
Sbjct: 2572 TGDLVRLRADGLVEYVGRIDHQVKIRGFRIELGEIESRLLEHPAVREAVVL 2622



 Score = 41.3 bits (97), Expect = 6e-04
 Identities = 19/46 (41%), Positives = 26/46 (56%)

Query: 227  VDDILLYEGRTDSQVKIRGNRVDLSEVEAALSKISEVDKLSVLCYK 272
             D  L Y GR D QVK+RG RV+  E++A L     V + +VL  +
Sbjct: 1517 ADGALEYLGRLDQQVKLRGFRVEPEEIQARLLAQPGVAQAAVLVRE 1562



 Score = 36.3 bits (84), Expect = 0.020
 Identities = 23/58 (39%), Positives = 29/58 (50%), Gaps = 11/58 (18%)

Query: 63   PAYKGRTVLSLPQLRLLSASLSHNNIPGESMLHADNNNEQSNDIAIVLYTSGSTGIPK 120
            P  +G + ++L  L L S        PG   LH DN       +A V+YTSGSTG PK
Sbjct: 1243 PQAEGVSAIALDSLHLDSWPSQ---APGLH-LHGDN-------LAYVIYTSGSTGQPK 1289



 Score = 34.4 bits (79), Expect = 0.082
 Identities = 22/61 (36%), Positives = 31/61 (50%), Gaps = 19/61 (31%)

Query: 74   PQLRLL-----SASLSHNNIPGESMLHADNNNEQSNDIAIVLYTSGSTGIPK----EYRR 124
            P+L +          SHN  PG   +++  +N     +A V+YTSGSTG+PK    E R 
Sbjct: 3844 PRLLVWEEVQAGEVASHN--PG---IYSGPDN-----LAYVIYTSGSTGLPKGVMVEQRG 3893

Query: 125  M 125
            M
Sbjct: 3894 M 3894



 Score = 32.1 bits (73), Expect = 0.43
 Identities = 12/19 (63%), Positives = 15/19 (78%)

Query: 102 QSNDIAIVLYTSGSTGIPK 120
           Q +DIA + YTSGST +PK
Sbjct: 164 QPDDIAFLQYTSGSTALPK 182



 Score = 28.2 bits (63), Expect = 7.6
 Identities = 10/14 (71%), Positives = 12/14 (85%)

Query: 107  AIVLYTSGSTGIPK 120
            A ++YTSGSTG PK
Sbjct: 2336 AYLIYTSGSTGKPK 2349


>gnl|CDD|237054 PRK12316, PRK12316, peptide synthase; Provisional.
          Length = 5163

 Score = 57.7 bits (139), Expect = 3e-09
 Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 1/55 (1%)

Query: 153 RMYKTGDYGRI-VDDILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSVL 206
           RMY+TGD  R   D ++ Y GR D QVK+RG R++L E+EA L +   V + +VL
Sbjct: 886 RMYRTGDLARYRADGVIEYAGRIDHQVKLRGLRIELGEIEARLLEHPWVREAAVL 940



 Score = 56.1 bits (135), Expect = 1e-08
 Identities = 25/55 (45%), Positives = 34/55 (61%), Gaps = 1/55 (1%)

Query: 153  RMYKTGDYGRI-VDDILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSVL 206
            R+Y+TGD  R   D ++ Y GR D QVKIRG R++L E+EA L     V +  V+
Sbjct: 2381 RLYRTGDLARYRADGVVEYLGRIDHQVKIRGFRIELGEIEARLQAHPAVREAVVV 2435



 Score = 53.0 bits (127), Expect = 9e-08
 Identities = 23/43 (53%), Positives = 30/43 (69%), Gaps = 1/43 (2%)

Query: 153  RMYKTGDYGRI-VDDILLYEGRTDSQVKIRGNRVDLSEVEAAL 194
            R+Y+TGD  R   D ++ Y GR D QVKIRG R++L E+EA L
Sbjct: 4929 RLYRTGDLARYRADGVIDYLGRVDHQVKIRGFRIELGEIEARL 4971



 Score = 51.9 bits (124), Expect = 3e-07
 Identities = 26/55 (47%), Positives = 35/55 (63%), Gaps = 1/55 (1%)

Query: 153  RMYKTGDYGRIVDD-ILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSVL 206
            R+Y+TGD  R   D ++ Y GR D QVKIRG R++L E+EA L +   V +  VL
Sbjct: 3424 RLYRTGDLARYRADGVIEYIGRVDHQVKIRGFRIELGEIEARLLEHPWVREAVVL 3478



 Score = 44.6 bits (105), Expect = 6e-05
 Identities = 19/43 (44%), Positives = 26/43 (60%)

Query: 227  VDDILLYEGRTDSQVKIRGNRVDLSEVEAALSKISEVDKLSVL 269
             D ++ Y GR D QVKIRG R++L E+EA L     V +  V+
Sbjct: 2393 ADGVVEYLGRIDHQVKIRGFRIELGEIEARLQAHPAVREAVVV 2435



 Score = 44.6 bits (105), Expect = 6e-05
 Identities = 19/44 (43%), Positives = 28/44 (63%)

Query: 226 GVDDILLYEGRTDSQVKIRGNRVDLSEVEAALSKISEVDKLSVL 269
             D ++ Y GR D QVK+RG R++L E+EA L +   V + +VL
Sbjct: 897 RADGVIEYAGRIDHQVKLRGLRIELGEIEARLLEHPWVREAAVL 940



 Score = 44.2 bits (104), Expect = 7e-05
 Identities = 17/33 (51%), Positives = 22/33 (66%)

Query: 225  RGVDDILLYEGRTDSQVKIRGNRVDLSEVEAAL 257
               D ++ Y GR D QVKIRG R++L E+EA L
Sbjct: 4939 YRADGVIDYLGRVDHQVKIRGFRIELGEIEARL 4971



 Score = 40.3 bits (94), Expect = 0.001
 Identities = 20/42 (47%), Positives = 27/42 (64%)

Query: 228  DDILLYEGRTDSQVKIRGNRVDLSEVEAALSKISEVDKLSVL 269
            D ++ Y GR D QVKIRG R++L E+EA L +   V +  VL
Sbjct: 3437 DGVIEYIGRVDHQVKIRGFRIELGEIEARLLEHPWVREAVVL 3478



 Score = 34.9 bits (80), Expect = 0.066
 Identities = 21/58 (36%), Positives = 27/58 (46%), Gaps = 10/58 (17%)

Query: 63   PAYKGRTVLSLPQLRLLSASLSHNNIPGESMLHADNNNEQSNDIAIVLYTSGSTGIPK 120
            P   G   L+L +        +H+  P    LH DN       +A V+YTSGSTG PK
Sbjct: 4663 PIPDGLASLALDRDEDWEGFPAHD--P-AVRLHPDN-------LAYVIYTSGSTGRPK 4710



 Score = 33.8 bits (77), Expect = 0.13
 Identities = 12/29 (41%), Positives = 18/29 (62%), Gaps = 3/29 (10%)

Query: 95  HADNNNEQS---NDIAIVLYTSGSTGIPK 120
           +++ N        ++A V+YTSGSTG PK
Sbjct: 643 YSEENPGTELNPENLAYVIYTSGSTGKPK 671



 Score = 33.4 bits (76), Expect = 0.16
 Identities = 12/26 (46%), Positives = 16/26 (61%)

Query: 95   HADNNNEQSNDIAIVLYTSGSTGIPK 120
             A        ++A V+YTSGSTG+PK
Sbjct: 2137 TAPAVQLAGENLAYVIYTSGSTGLPK 2162



 Score = 30.3 bits (68), Expect = 1.6
 Identities = 18/57 (31%), Positives = 24/57 (42%), Gaps = 4/57 (7%)

Query: 68   RTVLSLPQLRLLSASLSHNNIPGESMLHADNNNEQSN----DIAIVLYTSGSTGIPK 120
            + +LS   LRL  A             +    N        ++A V+YTSGSTG PK
Sbjct: 3156 QLLLSQSHLRLPLAQGVQVLDLDRGDENYAEANPAIRTMPENLAYVIYTSGSTGKPK 3212


>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
          Length = 434

 Score = 47.6 bits (114), Expect = 4e-06
 Identities = 15/36 (41%), Positives = 23/36 (63%)

Query: 19  APFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
           AP E++  K  K    K+ +KK+K+K+K K KK+ R
Sbjct: 382 APSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHR 417



 Score = 47.2 bits (113), Expect = 5e-06
 Identities = 12/35 (34%), Positives = 22/35 (62%)

Query: 20  PFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
           P ++  AK+ + K+K+K+K K KK+ +  K   +R
Sbjct: 391 PSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKR 425



 Score = 47.2 bits (113), Expect = 5e-06
 Identities = 15/44 (34%), Positives = 23/44 (52%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCE 61
           P+     K  +KK K+K+K K KK+ +  K   K+R+    S E
Sbjct: 391 PSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434



 Score = 42.2 bits (100), Expect = 2e-04
 Identities = 13/35 (37%), Positives = 22/35 (62%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
           P+  +  K  KK   K+ +KK+K+K+K K KK+ +
Sbjct: 383 PSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHR 417



 Score = 41.9 bits (99), Expect = 3e-04
 Identities = 12/34 (35%), Positives = 21/34 (61%)

Query: 20  PFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
           P  +  ++KK  K  KK   K+ +KK+K+K+K +
Sbjct: 378 PKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPK 411



 Score = 40.7 bits (96), Expect = 6e-04
 Identities = 11/38 (28%), Positives = 19/38 (50%)

Query: 14  KVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
           KV+     ++EK K+K   KK+ +  K   K++K    
Sbjct: 394 KVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGT 431



 Score = 39.5 bits (93), Expect = 0.002
 Identities = 11/34 (32%), Positives = 20/34 (58%)

Query: 25  KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
              +KK  K  KK   K+ +KK+K+K+K ++  +
Sbjct: 382 APSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKR 415



 Score = 36.1 bits (84), Expect = 0.017
 Identities = 12/32 (37%), Positives = 17/32 (53%)

Query: 23  REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
           R K K    KK  K  KK   K+ +KK+K++ 
Sbjct: 377 RPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKE 408



 Score = 32.6 bits (75), Expect = 0.27
 Identities = 11/31 (35%), Positives = 17/31 (54%)

Query: 28  KKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
           K  ++KK  K  KK   K+ +KK+K +   K
Sbjct: 381 KAPSEKKTGKPSKKVLAKRAEKKEKEKEKPK 411


>gnl|CDD|234212 TIGR03443, alpha_am_amid, L-aminoadipate-semialdehyde
           dehydrogenase.  Members of this protein family are
           L-aminoadipate-semialdehyde dehydrogenase (EC 1.2.1.31),
           product of the LYS2 gene. It is also called
           alpha-aminoadipate reductase. In fungi, lysine is
           synthesized via aminoadipate. Currently, all members of
           this family are fungal.
          Length = 1389

 Score = 47.4 bits (113), Expect = 6e-06
 Identities = 22/45 (48%), Positives = 31/45 (68%), Gaps = 1/45 (2%)

Query: 153 RMYKTGDYGRIV-DDILLYEGRTDSQVKIRGNRVDLSEVEAALSK 196
           R+Y+TGD GR + D  +   GR D QVKIRG R++L E++  LS+
Sbjct: 678 RLYRTGDLGRYLPDGNVECCGRADDQVKIRGFRIELGEIDTHLSQ 722



 Score = 36.6 bits (85), Expect = 0.017
 Identities = 14/25 (56%), Positives = 19/25 (76%)

Query: 235 GRTDSQVKIRGNRVDLSEVEAALSK 259
           GR D QVKIRG R++L E++  LS+
Sbjct: 698 GRADDQVKIRGFRIELGEIDTHLSQ 722


>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
           complex aNOP56 subunit; Provisional.
          Length = 414

 Score = 44.6 bits (106), Expect = 3e-05
 Identities = 20/39 (51%), Positives = 29/39 (74%)

Query: 14  KVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
           K   P P ++++ +KK  K+KKKKK+KKK KK+KKK +K
Sbjct: 376 KEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 43.8 bits (104), Expect = 7e-05
 Identities = 18/30 (60%), Positives = 25/30 (83%)

Query: 25  KAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
           K K+++ K +K+KKKKK+KKK KK+KKK R
Sbjct: 384 KKKREEKKPQKRKKKKKRKKKGKKRKKKGR 413



 Score = 42.3 bits (100), Expect = 2e-04
 Identities = 17/30 (56%), Positives = 25/30 (83%)

Query: 25  KAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
           K  KKK ++KK +K+KKKKK+KKK KK+++
Sbjct: 381 KPPKKKREEKKPQKRKKKKKRKKKGKKRKK 410



 Score = 40.7 bits (96), Expect = 7e-04
 Identities = 18/33 (54%), Positives = 26/33 (78%), Gaps = 1/33 (3%)

Query: 23  REK-AKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
           +EK  K  K K+++KK +K+KKKKK+KKK K+R
Sbjct: 376 KEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKR 408



 Score = 35.7 bits (83), Expect = 0.027
 Identities = 15/31 (48%), Positives = 25/31 (80%)

Query: 24  EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
           E+ K+K  K  KKK+++KK +K+KKKKK+++
Sbjct: 373 EEIKEKYPKPPKKKREEKKPQKRKKKKKRKK 403



 Score = 31.5 bits (72), Expect = 0.52
 Identities = 14/36 (38%), Positives = 24/36 (66%), Gaps = 4/36 (11%)

Query: 23  REKAKKK----KNKKKKKKKKKKKKKKKKKKKKKRR 54
           +E+  K+    K K  K  KKK+++KK +K+KKK++
Sbjct: 365 KEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKK 400


>gnl|CDD|213270 cd04433, AFD_class_I, Adenylate forming domain, Class I.  This
           family includes acyl- and aryl-CoA ligases, as well as
           the adenylation domain of nonribosomal peptide
           synthetases and firefly luciferases. The
           adenylate-forming enzymes catalyze an ATP-dependent
           two-step reaction to first activate a carboxylate
           substrate as an adenylate and then transfer the
           carboxylate to the pantetheine group of either coenzyme
           A or an acyl-carrier protein. The active site of the
           domain is located at the interface of a large N-terminal
           subdomain and a smaller C-terminal subdomain.
          Length = 338

 Score = 43.0 bits (102), Expect = 1e-04
 Identities = 17/48 (35%), Positives = 27/48 (56%), Gaps = 1/48 (2%)

Query: 155 YKTGDYGRIVDDILLY-EGRTDSQVKIRGNRVDLSEVEAALSKLSEVD 201
           Y+TGD G + ++  LY  GR+   +K+ G  V  +EVE+ L +   V 
Sbjct: 226 YRTGDLGYLDEEGYLYITGRSKDLIKVGGENVYPAEVESVLLQHPAVA 273



 Score = 36.5 bits (85), Expect = 0.012
 Identities = 11/16 (68%), Positives = 13/16 (81%)

Query: 105 DIAIVLYTSGSTGIPK 120
           D A +LYTSG+TG PK
Sbjct: 1   DPAFILYTSGTTGKPK 16



 Score = 31.1 bits (71), Expect = 0.60
 Identities = 10/30 (33%), Positives = 16/30 (53%)

Query: 235 GRTDSQVKIRGNRVDLSEVEAALSKISEVD 264
           GR+   +K+ G  V  +EVE+ L +   V 
Sbjct: 244 GRSKDLIKVGGENVYPAEVESVLLQHPAVA 273


>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
          Length = 135

 Score = 41.2 bits (97), Expect = 1e-04
 Identities = 23/29 (79%), Positives = 24/29 (82%)

Query: 24  EKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
           EKAK+KK KKKKKKKKKKK  KK  KKKK
Sbjct: 107 EKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 39.7 bits (93), Expect = 4e-04
 Identities = 19/29 (65%), Positives = 22/29 (75%)

Query: 25  KAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
           K K K+ K+KKKKKKKKKKK  KK  KK+
Sbjct: 106 KEKAKQKKQKKKKKKKKKKKTSKKAAKKK 134



 Score = 38.5 bits (90), Expect = 8e-04
 Identities = 18/28 (64%), Positives = 22/28 (78%)

Query: 25  KAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
           +  K+K K+KK+KKKKKKKKKKK  KK 
Sbjct: 103 QVLKEKAKQKKQKKKKKKKKKKKTSKKA 130



 Score = 38.1 bits (89), Expect = 0.001
 Identities = 20/29 (68%), Positives = 22/29 (75%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKK 50
           E+ K KK+K KKKKKKKKK  KK  KKKK
Sbjct: 107 EKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 37.7 bits (88), Expect = 0.002
 Identities = 18/32 (56%), Positives = 23/32 (71%)

Query: 23  REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
           +   +K K KK+KKKKKKKKKKK  KK  K++
Sbjct: 103 QVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKK 134



 Score = 37.7 bits (88), Expect = 0.002
 Identities = 17/30 (56%), Positives = 22/30 (73%)

Query: 23  REKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
           ++  K+K  +KK+KKKKKKKKKKK  KK  
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKKAA 131



 Score = 36.2 bits (84), Expect = 0.005
 Identities = 19/32 (59%), Positives = 24/32 (75%)

Query: 21  FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
            E +K  K+  K+K K+KK+KKKKKKKKKKK 
Sbjct: 95  VEEQKIVKQVLKEKAKQKKQKKKKKKKKKKKT 126



 Score = 36.2 bits (84), Expect = 0.006
 Identities = 19/34 (55%), Positives = 22/34 (64%)

Query: 25  KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
           K   K+  K+KK+KKKKKKKKKKK  KK     K
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 33.5 bits (77), Expect = 0.045
 Identities = 19/41 (46%), Positives = 25/41 (60%)

Query: 7   ISEHHFDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKK 47
           + E    K ++    +++K KKKK KKKKKK  KK  KKKK
Sbjct: 95  VEEQKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 32.4 bits (74), Expect = 0.12
 Identities = 19/35 (54%), Positives = 24/35 (68%)

Query: 21  FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
           F  E+ K  K   K+K K+KK+KKKKKKKKKK+  
Sbjct: 93  FLVEEQKIVKQVLKEKAKQKKQKKKKKKKKKKKTS 127



 Score = 31.2 bits (71), Expect = 0.28
 Identities = 15/28 (53%), Positives = 22/28 (78%)

Query: 32  KKKKKKKKKKKKKKKKKKKKKRRILAKS 59
           K+  K+K K+KK+KKKKKKKK++  +K 
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKK 129


>gnl|CDD|213289 cd05922, FACL_like_6, Uncharacterized subfamily of fatty acid CoA
           ligase (FACL).  Fatty acyl-CoA ligases catalyze the
           ATP-dependent activation of fatty acids in a two-step
           reaction. The carboxylate substrate first reacts with
           ATP to form an acyl-adenylate intermediate, which then
           reacts with CoA to produce an acyl-CoA ester. This is a
           required step before free fatty acids can participate in
           most catabolic and anabolic reactions.
          Length = 350

 Score = 42.6 bits (101), Expect = 1e-04
 Identities = 18/53 (33%), Positives = 28/53 (52%), Gaps = 1/53 (1%)

Query: 155 YKTGDYGRIVDDILLY-EGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSVL 206
             TGD  R+ ++  LY  GR D  +K RG RV  +E+E  +     V + +V+
Sbjct: 233 LYTGDLVRMDEEGYLYFVGRKDDMIKTRGYRVSPTEIEEVICAHPLVAEAAVI 285



 Score = 34.9 bits (81), Expect = 0.043
 Identities = 11/18 (61%), Positives = 15/18 (83%)

Query: 103 SNDIAIVLYTSGSTGIPK 120
            +D+A ++YTSGSTG PK
Sbjct: 1   DDDLAAIIYTSGSTGEPK 18



 Score = 33.0 bits (76), Expect = 0.17
 Identities = 13/39 (33%), Positives = 21/39 (53%)

Query: 231 LLYEGRTDSQVKIRGNRVDLSEVEAALSKISEVDKLSVL 269
           L + GR D  +K RG RV  +E+E  +     V + +V+
Sbjct: 247 LYFVGRKDDMIKTRGYRVSPTEIEEVICAHPLVAEAAVI 285


>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19. 
           Med19 represents a family of conserved proteins which
           are members of the multi-protein co-activator Mediator
           complex. Mediator is required for activation of RNA
           polymerase II transcription by DNA binding
           transactivators.
          Length = 178

 Score = 40.6 bits (95), Expect = 3e-04
 Identities = 18/27 (66%), Positives = 22/27 (81%)

Query: 27  KKKKNKKKKKKKKKKKKKKKKKKKKKR 53
           KK K KK +  K++KKKKK+KKKKKKR
Sbjct: 142 KKHKKKKHEDDKERKKKKKEKKKKKKR 168



 Score = 39.4 bits (92), Expect = 7e-04
 Identities = 16/30 (53%), Positives = 22/30 (73%)

Query: 21  FEREKAKKKKNKKKKKKKKKKKKKKKKKKK 50
            E++  KKK    K++KKKKK+KKKKKK+ 
Sbjct: 140 HEKKHKKKKHEDDKERKKKKKEKKKKKKRH 169



 Score = 39.4 bits (92), Expect = 7e-04
 Identities = 17/30 (56%), Positives = 22/30 (73%)

Query: 24  EKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
           EK  KKK  +  K++KKKKK+KKKKKK+  
Sbjct: 141 EKKHKKKKHEDDKERKKKKKEKKKKKKRHS 170



 Score = 39.4 bits (92), Expect = 7e-04
 Identities = 17/27 (62%), Positives = 22/27 (81%)

Query: 28  KKKNKKKKKKKKKKKKKKKKKKKKKRR 54
           KK  KKK +  K++KKKKK+KKKKK+R
Sbjct: 142 KKHKKKKHEDDKERKKKKKEKKKKKKR 168



 Score = 39.0 bits (91), Expect = 0.001
 Identities = 16/29 (55%), Positives = 21/29 (72%)

Query: 27  KKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
           K KK K +  K++KKKKK+KKKKKK+   
Sbjct: 143 KHKKKKHEDDKERKKKKKEKKKKKKRHSP 171



 Score = 38.7 bits (90), Expect = 0.001
 Identities = 19/37 (51%), Positives = 28/37 (75%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
           P+  E  K  +KK+KKKK +  K++KKKKK+KKKK++
Sbjct: 131 PSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKK 167



 Score = 37.9 bits (88), Expect = 0.002
 Identities = 19/46 (41%), Positives = 28/46 (60%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENP 63
             P + E  K  + K KKKK +  K++KKKKK+KK++    S E+P
Sbjct: 129 ETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRHSPEHP 174



 Score = 34.0 bits (78), Expect = 0.042
 Identities = 16/42 (38%), Positives = 25/42 (59%)

Query: 26  AKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKG 67
           ++  K  +KK KKKK +  K++KKKKK +   K   +P + G
Sbjct: 134 SEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRHSPEHPG 175


>gnl|CDD|213274 cd05906, A_NRPS_TubE_like, The adenylation domain (A domain) of a
           family of nonribosomal peptide synthetases (NRPSs)
           synthesizing toxins and antitumor agents.  The
           adenylation (A) domain of NRPS recognizes a specific
           amino acid or hydroxy acid and activates it as an
           (amino)-acyl adenylate by hydrolysis of ATP. The
           activated acyl moiety then forms a thioester to the
           enzyme-bound cofactor phosphopantetheine of a peptidyl
           carrier protein domain. This family includes NRPSs that
           synthesize toxins and antitumor agents; for example,
           TubE for Tubulysine, CrpA for cryptophycin, TdiA for
           terrequinone A, KtzG for kutzneride, and Vlm1/Vlm2 for
           Valinomycin. Nonribosomal peptide synthetases are large
           multifunctional enzymes which synthesize many
           therapeutically useful peptides. NRPS has a distinct
           modular structure in which each module is responsible
           for the recognition, activation, and, in some cases,
           modification of a single amino acid residue of the final
           peptide product. The modules can be subdivided into
           domains that catalyze specific biochemical reactions.
          Length = 560

 Score = 41.5 bits (98), Expect = 4e-04
 Identities = 20/53 (37%), Positives = 30/53 (56%), Gaps = 5/53 (9%)

Query: 68  RTVLSLPQLRLLSASLSHNNIPGESMLHADNNNEQSNDIAIVLYTSGSTGIPK 120
           RT   L  LR+L+     +  P ++ LH      + +D A++L TSGSTG+PK
Sbjct: 136 RTRAGLEALRVLAIEELRSAPP-DAPLHP----ARPDDPALLLLTSGSTGVPK 183



 Score = 30.3 bits (69), Expect = 1.3
 Identities = 21/92 (22%), Positives = 35/92 (38%), Gaps = 21/92 (22%)

Query: 155 YKTGDYGRIVDDILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEV-------------- 200
           ++TGD G + D  L   GR    + I G      E+EAA+ ++  V              
Sbjct: 431 FRTGDLGFLHDGRLTITGREKDMIIINGVNYYNHEIEAAVEQVPGVEPSFTAACAVRDAG 490

Query: 201 ---DKLSVLCYKP--GEVDQVSEVTSEEIRGV 227
              D+L  + + P     D+++      IR  
Sbjct: 491 SGTDEL-AVFFVPRYDLDDRLAA-LLRAIRTH 520


>gnl|CDD|223442 COG0365, Acs, Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases
           [Lipid metabolism].
          Length = 528

 Score = 41.1 bits (97), Expect = 5e-04
 Identities = 10/18 (55%), Positives = 14/18 (77%)

Query: 103 SNDIAIVLYTSGSTGIPK 120
           ++D   +LYTSG+TG PK
Sbjct: 170 ADDPLFLLYTSGTTGKPK 187



 Score = 40.7 bits (96), Expect = 7e-04
 Identities = 18/62 (29%), Positives = 30/62 (48%), Gaps = 3/62 (4%)

Query: 142 WSSGDRAIKY--RRMYKTGDYGRIVDD-ILLYEGRTDSQVKIRGNRVDLSEVEAALSKLS 198
           W+  +R  +    R Y+TGD+    +D      GR+D  +K+ G R+   E+E+ L    
Sbjct: 383 WNDPERYKEAYFGRWYRTGDWAERDEDGYFWLHGRSDDVIKVSGKRIGPLEIESVLLAHP 442

Query: 199 EV 200
            V
Sbjct: 443 AV 444



 Score = 29.2 bits (66), Expect = 3.2
 Identities = 12/40 (30%), Positives = 18/40 (45%)

Query: 224 IRGVDDILLYEGRTDSQVKIRGNRVDLSEVEAALSKISEV 263
            R  D      GR+D  +K+ G R+   E+E+ L     V
Sbjct: 405 ERDEDGYFWLHGRSDDVIKVSGKRIGPLEIESVLLAHPAV 444


>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.
          This domain is found to the N-terminus of bacterial
          signal peptidases of the S49 family (pfam01343).
          Length = 154

 Score = 39.0 bits (92), Expect = 7e-04
 Identities = 15/31 (48%), Positives = 21/31 (67%)

Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
          KA +K  KK +K K K +KKK KK++ K R+
Sbjct: 69 KAWEKAEKKAEKAKAKAEKKKAKKEEPKPRL 99



 Score = 37.9 bits (89), Expect = 0.002
 Identities = 13/31 (41%), Positives = 18/31 (58%)

Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
          +K  K   K +KK +K K K +KKK KK+  
Sbjct: 65 KKELKAWEKAEKKAEKAKAKAEKKKAKKEEP 95



 Score = 37.5 bits (88), Expect = 0.002
 Identities = 14/28 (50%), Positives = 19/28 (67%)

Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKK 50
           EKA+KK  K K K +KKK KK++ K +
Sbjct: 71 WEKAEKKAEKAKAKAEKKKAKKEEPKPR 98



 Score = 37.1 bits (87), Expect = 0.003
 Identities = 13/32 (40%), Positives = 20/32 (62%)

Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
          E +  +K + K +K K K +KKK KK++ K R
Sbjct: 67 ELKAWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98



 Score = 34.4 bits (80), Expect = 0.023
 Identities = 12/32 (37%), Positives = 21/32 (65%)

Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
          ++E    +K +KK +K K K +KKK KK++ +
Sbjct: 65 KKELKAWEKAEKKAEKAKAKAEKKKAKKEEPK 96



 Score = 34.0 bits (79), Expect = 0.032
 Identities = 12/34 (35%), Positives = 19/34 (55%)

Query: 27 KKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSC 60
          KK+    +K +KK +K K K +KKK ++   K  
Sbjct: 65 KKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98



 Score = 32.1 bits (74), Expect = 0.15
 Identities = 11/28 (39%), Positives = 19/28 (67%)

Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKK 48
          +E+ + K +K K K +KKK KK++ K +
Sbjct: 71 WEKAEKKAEKAKAKAEKKKAKKEEPKPR 98



 Score = 31.3 bits (72), Expect = 0.32
 Identities = 13/42 (30%), Positives = 22/42 (52%)

Query: 6  TISEHHFDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKK 47
          ++     DK  + A  + EK  +K   K +KKK KK++ K +
Sbjct: 57 SLEAALLDKKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98


>gnl|CDD|223395 COG0318, CaiC, Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases
           II [Lipid metabolism / Secondary metabolites
           biosynthesis, transport, and catabolism].
          Length = 534

 Score = 39.8 bits (93), Expect = 0.001
 Identities = 16/57 (28%), Positives = 24/57 (42%), Gaps = 1/57 (1%)

Query: 146 DRAIKYRRMYKTGDYGRIVDDILLY-EGRTDSQVKIRGNRVDLSEVEAALSKLSEVD 201
             A       +TGD G + +D  LY  GR    +   G  +   E+EA L++   V 
Sbjct: 394 AEAFDEDGWLRTGDLGYVDEDGYLYIVGRLKDLIISGGENIYPEEIEAVLAEHPAVA 450



 Score = 37.8 bits (88), Expect = 0.006
 Identities = 11/19 (57%), Positives = 16/19 (84%)

Query: 102 QSNDIAIVLYTSGSTGIPK 120
             +D+A +LYTSG+TG+PK
Sbjct: 169 DPDDLAFLLYTSGTTGLPK 187



 Score = 27.8 bits (62), Expect = 9.5
 Identities = 8/30 (26%), Positives = 13/30 (43%)

Query: 235 GRTDSQVKIRGNRVDLSEVEAALSKISEVD 264
           GR    +   G  +   E+EA L++   V 
Sbjct: 421 GRLKDLIISGGENIYPEEIEAVLAEHPAVA 450


>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase;
          Provisional.
          Length = 330

 Score = 39.0 bits (92), Expect = 0.002
 Identities = 11/29 (37%), Positives = 18/29 (62%)

Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
          +K  K  +K +KKK+K++ K  K K K +
Sbjct: 63 KKELKAWHKAQKKKEKQEAKAAKAKSKPR 91



 Score = 37.9 bits (89), Expect = 0.004
 Identities = 12/29 (41%), Positives = 17/29 (58%)

Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
          K + K   K +KKK+K++ K  K K K R
Sbjct: 63 KKELKAWHKAQKKKEKQEAKAAKAKSKPR 91



 Score = 36.3 bits (85), Expect = 0.013
 Identities = 12/34 (35%), Positives = 18/34 (52%)

Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
               K+ K   K +KKK+K++ K  K K K R+
Sbjct: 59 ALLDKKELKAWHKAQKKKEKQEAKAAKAKSKPRL 92



 Score = 35.6 bits (83), Expect = 0.025
 Identities = 11/33 (33%), Positives = 18/33 (54%)

Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
          +     KK+ K   K +KKK+K++ K  K K +
Sbjct: 57 KAALLDKKELKAWHKAQKKKEKQEAKAAKAKSK 89



 Score = 32.5 bits (75), Expect = 0.26
 Identities = 10/29 (34%), Positives = 17/29 (58%)

Query: 32 KKKKKKKKKKKKKKKKKKKKKRRILAKSC 60
          KK+ K   K +KKK+K++ K  +  +K  
Sbjct: 63 KKELKAWHKAQKKKEKQEAKAAKAKSKPR 91



 Score = 29.0 bits (66), Expect = 3.0
 Identities = 7/23 (30%), Positives = 13/23 (56%)

Query: 21 FEREKAKKKKNKKKKKKKKKKKK 43
          + + + KK+K + K  K K K +
Sbjct: 69 WHKAQKKKEKQEAKAAKAKSKPR 91


>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 142

 Score = 37.7 bits (88), Expect = 0.002
 Identities = 19/31 (61%), Positives = 24/31 (77%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
           + EK  KK+ K++KKK+KKKKKKK KK  KK
Sbjct: 79  DEEKTAKKRAKRQKKKQKKKKKKKAKKGNKK 109



 Score = 36.6 bits (85), Expect = 0.005
 Identities = 17/30 (56%), Positives = 24/30 (80%)

Query: 23  REKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
           + K ++K  KK+ K++KKK+KKKKKKK KK
Sbjct: 76  KRKDEEKTAKKRAKRQKKKQKKKKKKKAKK 105



 Score = 34.6 bits (80), Expect = 0.020
 Identities = 16/31 (51%), Positives = 25/31 (80%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
           E+++  ++K  KK+ K++KKK+KKKKKKK K
Sbjct: 74  EKKRKDEEKTAKKRAKRQKKKQKKKKKKKAK 104



 Score = 34.3 bits (79), Expect = 0.028
 Identities = 15/33 (45%), Positives = 23/33 (69%)

Query: 21  FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
             +++ K  K + K++KKK+KKKKKKK KK  +
Sbjct: 76  KRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNK 108



 Score = 33.9 bits (78), Expect = 0.034
 Identities = 18/31 (58%), Positives = 22/31 (70%)

Query: 23  REKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
           R K +KKK KKKKKKK KK  KK++K+  K 
Sbjct: 87  RAKRQKKKQKKKKKKKAKKGNKKEEKEGSKS 117



 Score = 33.5 bits (77), Expect = 0.044
 Identities = 17/31 (54%), Positives = 25/31 (80%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
           +RE+ K+K  +K  KK+ K++KKK+KKKKKK
Sbjct: 71  KREEKKRKDEEKTAKKRAKRQKKKQKKKKKK 101



 Score = 32.7 bits (75), Expect = 0.080
 Identities = 15/38 (39%), Positives = 29/38 (76%)

Query: 21  FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
           F++++ +KK+  ++K  KK+ K++KKK+KKKK++   K
Sbjct: 68  FQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKK 105



 Score = 29.2 bits (66), Expect = 1.3
 Identities = 16/29 (55%), Positives = 24/29 (82%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKK 50
            +++AK++K K+KKKKKKK KK  KK++K
Sbjct: 84  AKKRAKRQKKKQKKKKKKKAKKGNKKEEK 112



 Score = 28.9 bits (65), Expect = 2.1
 Identities = 15/29 (51%), Positives = 20/29 (68%)

Query: 23  REKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
           R+K K+KK KKKK KK  KK++K+  K  
Sbjct: 90  RQKKKQKKKKKKKAKKGNKKEEKEGSKSS 118



 Score = 28.1 bits (63), Expect = 3.6
 Identities = 14/38 (36%), Positives = 27/38 (71%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
           E E+ ++K+ +KK+K ++K  KK+ K++KKK++   K 
Sbjct: 64  EDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKK 101



 Score = 26.5 bits (59), Expect = 9.8
 Identities = 13/28 (46%), Positives = 19/28 (67%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKK 49
           +++K KKKK KK KK  KK++K+  K  
Sbjct: 91  QKKKQKKKKKKKAKKGNKKEEKEGSKSS 118


>gnl|CDD|215217 PLN02387, PLN02387, long-chain-fatty-acid-CoA ligase family
           protein.
          Length = 696

 Score = 39.3 bits (92), Expect = 0.002
 Identities = 14/26 (53%), Positives = 19/26 (73%)

Query: 95  HADNNNEQSNDIAIVLYTSGSTGIPK 120
             D +    NDIA+++YTSGSTG+PK
Sbjct: 241 PVDPDLPSPNDIAVIMYTSGSTGLPK 266


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 38.9 bits (91), Expect = 0.003
 Identities = 16/46 (34%), Positives = 30/46 (65%)

Query: 14  KVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
           K+      +REK +K+  K++K+ +K KK+++KKKK+ ++   AK 
Sbjct: 546 KLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAKI 591


>gnl|CDD|213297 cd05931, FAAL, Fatty acyl-AMP ligase (FAAL).  FAAL belongs to the
           class I adenylate forming enzyme family and is
           homologous to fatty acyl-coenzyme A (CoA) ligases
           (FACLs). However, FAALs produce only the acyl adenylate
           and are unable to perform the thioester-forming
           reaction, while FACLs perform a two-step catalytic
           reaction; AMP ligation followed by CoA ligation using
           ATP and CoA as cofactors. FAALs have insertion motifs
           between the N-terminal and C-terminal subdomains that
           distinguish them from the FACLs. This insertion motif
           precludes the binding of CoA, thus preventing CoA
           ligation. It has been suggested that the acyl adenylates
           serve as substrates for multifunctional polyketide
           synthases to permit synthesis of complex lipids such as
           phthiocerol dimycocerosate, sulfolipids, mycolic acids,
           and mycobactin.
          Length = 547

 Score = 38.7 bits (91), Expect = 0.003
 Identities = 12/18 (66%), Positives = 14/18 (77%)

Query: 103 SNDIAIVLYTSGSTGIPK 120
            +DIA + YTSGSTG PK
Sbjct: 147 PDDIAFLQYTSGSTGAPK 164


>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
          Length = 213

 Score = 37.6 bits (88), Expect = 0.004
 Identities = 8/57 (14%), Positives = 26/57 (45%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRTVLSLPQLRL 78
            + +A++ + + KK++   +K+K  ++++K +    +    P  +       P   +
Sbjct: 108 AKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDI 164



 Score = 32.2 bits (74), Expect = 0.22
 Identities = 12/36 (33%), Positives = 22/36 (61%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
            A  E+EKA +++ K K K  +KK+K + +K + + 
Sbjct: 123 EAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQH 158



 Score = 31.8 bits (73), Expect = 0.27
 Identities = 5/32 (15%), Positives = 21/32 (65%)

Query: 23  REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
           + K + ++ +++ KK++   +K+K  +++++ 
Sbjct: 107 KAKVQAQRAEQQAKKREAAGEKEKAPRRERKP 138



 Score = 31.8 bits (73), Expect = 0.34
 Identities = 7/38 (18%), Positives = 18/38 (47%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
                 ++   ++ +K K K  +KK+K + +K + +  
Sbjct: 122 REAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHT 159



 Score = 29.1 bits (66), Expect = 2.1
 Identities = 10/31 (32%), Positives = 20/31 (64%)

Query: 24  EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
           E+AK K   ++ +++ KK++   +K+K  RR
Sbjct: 104 EEAKAKVQAQRAEQQAKKREAAGEKEKAPRR 134



 Score = 28.4 bits (64), Expect = 3.7
 Identities = 8/35 (22%), Positives = 18/35 (51%)

Query: 19  APFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
              E+   +++K K K  +KK+K + +K + +   
Sbjct: 126 GEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTP 160


>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992).  This
           bacterial family of proteins has no known function.
           However, the cis-regulatory yjdF motif, just upstream
           from the gene encoding the proteins for this family, is
           a small non-coding RNA, Rfam:RF01764. The yjdF motif is
           found in many Firmicutes, including Bacillus subtilis.
           In most cases, it resides in potential 5' UTRs of
           homologues of the yjdF gene whose function is unknown.
           However, in Streptococcus thermophilus, a yjdF RNA motif
           is associated with an operon whose protein products
           synthesise nicotinamide adenine dinucleotide (NAD+).
           Also, the S. thermophilus yjdF RNA lacks typical yjdF
           motif consensus features downstream of and including the
           P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
           S. thermophilus RNAs might sense a distinct compound
           that structurally resembles the ligand bound by other
           yjdF RNAs. On the ohter hand, perhaps these RNAs have an
           alternative solution forming a similar binding site, as
           is observed with some SAM riboswitches.
          Length = 132

 Score = 36.5 bits (85), Expect = 0.004
 Identities = 15/34 (44%), Positives = 27/34 (79%)

Query: 21  FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
            ER K +KKK  K+KK+++K++K++ K++KKK +
Sbjct: 95  HERNKQEKKKRSKEKKEEEKERKRQLKQQKKKAK 128



 Score = 29.9 bits (68), Expect = 0.64
 Identities = 11/29 (37%), Positives = 24/29 (82%)

Query: 27  KKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
           K +  + K++KKK+ K+KK+++K++KR++
Sbjct: 92  KLEHERNKQEKKKRSKEKKEEEKERKRQL 120



 Score = 29.9 bits (68), Expect = 0.80
 Identities = 12/34 (35%), Positives = 28/34 (82%)

Query: 25  KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
           +A K ++++ K++KKK+ K+KK+++K+++R L +
Sbjct: 89  QALKLEHERNKQEKKKRSKEKKEEEKERKRQLKQ 122


>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
          Length = 813

 Score = 38.2 bits (89), Expect = 0.005
 Identities = 14/42 (33%), Positives = 21/42 (50%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
              FE + A + + K K K  KK  +K KK   K ++I A +
Sbjct: 760 KVNFEPDSAFRGEKKAKPKAAKKDARKAKKPSAKTQKIAAAT 801



 Score = 32.8 bits (75), Expect = 0.28
 Identities = 15/37 (40%), Positives = 17/37 (45%), Gaps = 4/37 (10%)

Query: 21  FEREKAKKKKNKKKKKKKKKKKKKKKKKK----KKKR 53
           F  EK  K K  KK  +K KK   K +K     K KR
Sbjct: 769 FRGEKKAKPKAAKKDARKAKKPSAKTQKIAAATKAKR 805



 Score = 31.3 bits (71), Expect = 0.73
 Identities = 13/34 (38%), Positives = 15/34 (44%)

Query: 25  KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
             KK K K  KK  +K KK   K +K      AK
Sbjct: 771 GEKKAKPKAAKKDARKAKKPSAKTQKIAAATKAK 804



 Score = 30.9 bits (70), Expect = 0.86
 Identities = 12/40 (30%), Positives = 18/40 (45%)

Query: 13  DKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
           +K   P   +++  K KK   K +K     K K+  KKK 
Sbjct: 772 EKKAKPKAAKKDARKAKKPSAKTQKIAAATKAKRAAKKKV 811



 Score = 30.5 bits (69), Expect = 1.4
 Identities = 11/38 (28%), Positives = 16/38 (42%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
           E++   K   K  +K KK   K +K     K +  AK 
Sbjct: 772 EKKAKPKAAKKDARKAKKPSAKTQKIAAATKAKRAAKK 809


>gnl|CDD|213271 cd05903, CHC_CoA_lg, Cyclohexanecarboxylate-CoA ligase (also called
           cyclohex-1-ene-1-carboxylate:CoA ligase).
           Cyclohexanecarboxylate-CoA ligase activates the
           aliphatic ring compound, cyclohexanecarboxylate, for
           degradation. It catalyzes the synthesis of
           cyclohexanecarboxylate-CoA thioesters in a two-step
           reaction involving the formation of
           cyclohexanecarboxylate-AMP anhydride, followed by the
           nucleophilic substitution of AMP by CoA.
          Length = 437

 Score = 38.0 bits (89), Expect = 0.005
 Identities = 11/20 (55%), Positives = 16/20 (80%)

Query: 101 EQSNDIAIVLYTSGSTGIPK 120
              +D+A++LYTSG+TG PK
Sbjct: 90  AMPDDVALLLYTSGTTGEPK 109



 Score = 29.5 bits (67), Expect = 2.1
 Identities = 18/54 (33%), Positives = 28/54 (51%), Gaps = 5/54 (9%)

Query: 155 YKTGDYGRIVDD-ILLYEGRTDSQVKIRG--NRVDLSEVEAALSKLSEVDKLSV 205
           ++TGD GR+  D  L   GR    + IRG  N +   E+E  L +   V +++V
Sbjct: 318 FRTGDLGRLDADGYLRITGRK-KDIIIRGGEN-ISAREIEDLLLRHPAVAEVAV 369


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 37.0 bits (86), Expect = 0.005
 Identities = 19/52 (36%), Positives = 29/52 (55%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRT 69
               E E  +++K +KKKKK+ KK+KK+KK KK+K      S +    K + 
Sbjct: 142 KVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKKK 193



 Score = 36.2 bits (84), Expect = 0.010
 Identities = 20/46 (43%), Positives = 28/46 (60%)

Query: 14  KVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
           KV   A  E E+ K+KK KK+ KK+KK+KK KK+K  + +    K 
Sbjct: 142 KVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKK 187


>gnl|CDD|219564 pfam07771, TSGP1, Tick salivary peptide group 1.  This contains a
           group of peptides derived from a salivary gland cDNA
           library of the tick Ixodes scapularis. Also present are
           peptides from a related tick species, Ixodes ricinus.
           They are characterized by a putative signal peptide
           indicative of secretion and conserved cysteine residues.
          Length = 120

 Score = 35.6 bits (82), Expect = 0.006
 Identities = 16/27 (59%), Positives = 18/27 (66%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKKKK 44
           P P E+ K KKKK+KK KK KK  KK 
Sbjct: 94  PEPTEKPKKKKKKSKKTKKPKKSSKKD 120



 Score = 35.6 bits (82), Expect = 0.007
 Identities = 15/22 (68%), Positives = 15/22 (68%)

Query: 30  KNKKKKKKKKKKKKKKKKKKKK 51
           K KKKKKK KK KK KK  KK 
Sbjct: 99  KPKKKKKKSKKTKKPKKSSKKD 120



 Score = 34.5 bits (79), Expect = 0.016
 Identities = 15/22 (68%), Positives = 15/22 (68%)

Query: 31  NKKKKKKKKKKKKKKKKKKKKK 52
            K KKKKKK KK KK KK  KK
Sbjct: 98  EKPKKKKKKSKKTKKPKKSSKK 119



 Score = 34.5 bits (79), Expect = 0.018
 Identities = 16/32 (50%), Positives = 17/32 (53%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKK 49
           P     E  +K K KKKK KK KK KK  KK 
Sbjct: 89  PDDHPPEPTEKPKKKKKKSKKTKKPKKSSKKD 120



 Score = 33.3 bits (76), Expect = 0.043
 Identities = 14/20 (70%), Positives = 14/20 (70%)

Query: 33  KKKKKKKKKKKKKKKKKKKK 52
           KKKKKK KK KK KK  KK 
Sbjct: 101 KKKKKKSKKTKKPKKSSKKD 120



 Score = 32.2 bits (73), Expect = 0.12
 Identities = 14/21 (66%), Positives = 15/21 (71%)

Query: 33  KKKKKKKKKKKKKKKKKKKKR 53
           K KKKKKK KK KK KK  K+
Sbjct: 99  KPKKKKKKSKKTKKPKKSSKK 119



 Score = 30.6 bits (69), Expect = 0.35
 Identities = 13/21 (61%), Positives = 15/21 (71%)

Query: 34  KKKKKKKKKKKKKKKKKKKRR 54
           K KKKKKK KK KK KK  ++
Sbjct: 99  KPKKKKKKSKKTKKPKKSSKK 119


>gnl|CDD|204792 pfam11947, DUF3464, Protein of unknown function (DUF3464).  This
          family of proteins are functionally uncharacterized.
          This protein is found in bacteria and eukaryotes.
          Proteins in this family are typically between 137 to
          196 amino acids in length.
          Length = 149

 Score = 35.8 bits (83), Expect = 0.008
 Identities = 15/37 (40%), Positives = 21/37 (56%)

Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
            PFE ++++KK  K  +K    K K K+KK K KR 
Sbjct: 7  RLPFEPKRSRKKGVKALRKAAVAKSKDKQKKPKSKRA 43


>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family.  This
           family includes gbp a protein from Soybean that binds to
           GAGA element dinucleotide repeat DNA. It seems likely
           that the this domain mediates DNA binding. This putative
           domain contains several conserved cysteines and a
           histidine suggesting this may be a zinc-binding DNA
           interaction domain.
          Length = 301

 Score = 37.2 bits (86), Expect = 0.008
 Identities = 17/52 (32%), Positives = 24/52 (46%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRT 69
           P      +AK+ K  KK +  K  K  K KK KKK  +  +S + P    R+
Sbjct: 136 PISTAPPEAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDPRS 187



 Score = 35.2 bits (81), Expect = 0.032
 Identities = 16/50 (32%), Positives = 20/50 (40%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKG 67
             P  +E  K KK +  K  K  K KK KKK     R +     +P  K 
Sbjct: 140 APPEAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDPRSKP 189



 Score = 32.2 bits (73), Expect = 0.29
 Identities = 12/37 (32%), Positives = 17/37 (45%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
             P      + K+ KK KK +  K  K  K KK K++
Sbjct: 134 VPPISTAPPEAKEVKKPKKGQSPKVPKAPKPKKPKKK 170



 Score = 32.2 bits (73), Expect = 0.34
 Identities = 16/46 (34%), Positives = 19/46 (41%)

Query: 9   EHHFDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
            H     I  AP E ++ KK K  +  K  K  K KK KKK     
Sbjct: 130 HHLEVPPISTAPPEAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSN 175



 Score = 30.6 bits (69), Expect = 0.82
 Identities = 15/42 (35%), Positives = 17/42 (40%)

Query: 9   EHHFDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKK 50
            HH +   I       K  KK  K +  K  K  K KK KKK
Sbjct: 129 MHHLEVPPISTAPPEAKEVKKPKKGQSPKVPKAPKPKKPKKK 170



 Score = 29.9 bits (67), Expect = 1.8
 Identities = 12/47 (25%), Positives = 19/47 (40%)

Query: 16  IIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCEN 62
            +  P       + K  KK KK +  K  K  K KK ++  + S  +
Sbjct: 131 HLEVPPISTAPPEAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRS 177



 Score = 27.9 bits (62), Expect = 7.3
 Identities = 9/38 (23%), Positives = 12/38 (31%)

Query: 25  KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCEN 62
             K  K KK KKK     +  K      R       ++
Sbjct: 158 VPKAPKPKKPKKKGSVSNRSVKMPGIDPRSKPDWKSQD 195


>gnl|CDD|233770 TIGR02188, Ac_CoA_lig_AcsA, acetate--CoA ligase.  This model
           describes acetate-CoA ligase (EC 6.2.1.1), also called
           acetyl-CoA synthetase and acetyl-activating enzyme. It
           catalyzes the reaction ATP + acetate + CoA = AMP +
           diphosphate + acetyl-CoA and belongs to the family of
           AMP-binding enzymes described by pfam00501.
          Length = 625

 Score = 36.8 bits (86), Expect = 0.011
 Identities = 16/49 (32%), Positives = 24/49 (48%), Gaps = 7/49 (14%)

Query: 150 KYRRMYKTGDYGRIVDD----ILLYEGRTDSQVKIRGNRVDLSEVEAAL 194
            +   Y TGD  R   D    I    GR D  + + G+R+  +E+E+AL
Sbjct: 472 PFPGYYFTGDGARRDKDGYIWIT---GRVDDVINVSGHRLGTAEIESAL 517



 Score = 30.7 bits (70), Expect = 1.1
 Identities = 20/54 (37%), Positives = 25/54 (46%), Gaps = 12/54 (22%)

Query: 103 SNDIAIVLYTSGSTGIPKEYRRMYKTGDY-------GRIVDDIH---RCWSSGD 146
           S D   +LYTSGSTG PK    ++ TG Y        + V DI      W + D
Sbjct: 235 SEDPLFILYTSGSTGKPKGV--LHTTGGYLLYAAMTMKYVFDIKDGDIFWCTAD 286


>gnl|CDD|213293 cd05927, LC-FACS_euk, Eukaryotic long-chain fatty acid CoA
           synthetase (LC-FACS).  The members of this family are
           eukaryotic fatty acid CoA synthetases that activate
           fatty acids with chain lengths of 12 to 20. LC-FACS
           catalyzes the formation of fatty acyl-CoA in a two-step
           reaction: the formation of a fatty acyl-AMP molecule as
           an intermediate, and the formation of a fatty acyl-CoA.
           This is a required step before free fatty acids can
           participate in most catabolic and anabolic reactions.
           Organisms tend to have multiple isoforms of LC-FACS
           genes with multiple splice variants. For example, nine
           genes are found in Arabidopsis and six genes are
           expressed in mammalian cells.
          Length = 539

 Score = 36.8 bits (86), Expect = 0.011
 Identities = 10/17 (58%), Positives = 14/17 (82%)

Query: 104 NDIAIVLYTSGSTGIPK 120
            D+A ++YTSG+TG PK
Sbjct: 111 EDLATIMYTSGTTGNPK 127


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 36.8 bits (86), Expect = 0.012
 Identities = 13/31 (41%), Positives = 22/31 (70%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
           ER+K +KK+ K +KK +K++ +K   KKK +
Sbjct: 410 ERKKLRKKQRKAEKKAEKEEAEKAAAKKKAE 440



 Score = 35.3 bits (82), Expect = 0.041
 Identities = 11/36 (30%), Positives = 21/36 (58%)

Query: 24  EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
            + KK + K++K +KK +K++ +K   KK+   A  
Sbjct: 409 AERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAK 444



 Score = 34.1 bits (79), Expect = 0.092
 Identities = 17/49 (34%), Positives = 28/49 (57%), Gaps = 3/49 (6%)

Query: 25  KAKKKKNKKKKKKKKKKKKK---KKKKKKKKRRILAKSCENPAYKGRTV 70
            A++KK +KK++K +KK +K   +K   KKK    AK  + P  + + V
Sbjct: 408 PAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKV 456



 Score = 32.2 bits (74), Expect = 0.32
 Identities = 10/32 (31%), Positives = 19/32 (59%)

Query: 23  REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
           R+K +K + K +K++ +K   KKK +   K+ 
Sbjct: 415 RKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKA 446



 Score = 31.4 bits (72), Expect = 0.61
 Identities = 12/31 (38%), Positives = 17/31 (54%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
           E+E+A+K   KKK +   KK K    + KK 
Sbjct: 426 EKEEAEKAAAKKKAEAAAKKAKGPDGETKKV 456



 Score = 31.1 bits (71), Expect = 0.82
 Identities = 9/30 (30%), Positives = 15/30 (50%)

Query: 23  REKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
           + + K +K + +K   KKK +   KK K  
Sbjct: 420 KAEKKAEKEEAEKAAAKKKAEAAAKKAKGP 449



 Score = 30.3 bits (69), Expect = 1.3
 Identities = 10/30 (33%), Positives = 16/30 (53%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
             +KA+K++ +K   KKK +   KK K   
Sbjct: 421 AEKKAEKEEAEKAAAKKKAEAAAKKAKGPD 450



 Score = 30.3 bits (69), Expect = 1.4
 Identities = 9/31 (29%), Positives = 18/31 (58%)

Query: 24  EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
                 + KK +KK++K +KK +K++ +K  
Sbjct: 404 GNLSPAERKKLRKKQRKAEKKAEKEEAEKAA 434



 Score = 30.3 bits (69), Expect = 1.4
 Identities = 11/34 (32%), Positives = 16/34 (47%)

Query: 19  APFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
           A  + EK + +K   KKK +   KK K    + K
Sbjct: 421 AEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETK 454



 Score = 28.0 bits (63), Expect = 8.4
 Identities = 13/47 (27%), Positives = 24/47 (51%), Gaps = 3/47 (6%)

Query: 12  FDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
            DK   P   E E+ + +       ++KK +KK++K +KK  +  A+
Sbjct: 388 HDK---PLLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAE 431


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 35.8 bits (83), Expect = 0.013
 Identities = 18/31 (58%), Positives = 20/31 (64%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
           E E+ +K K KKKK KKKK K K KK  KK 
Sbjct: 77  EYEEKQKWKWKKKKSKKKKDKDKDKKDDKKD 107



 Score = 34.7 bits (80), Expect = 0.028
 Identities = 15/32 (46%), Positives = 23/32 (71%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
           +++K+KKKK+K K KK  KK  K +KK +K+ 
Sbjct: 87  KKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEA 118



 Score = 34.3 bits (79), Expect = 0.035
 Identities = 14/38 (36%), Positives = 18/38 (47%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
            ++K  KKK  K K KK  KK  K +KK +K       
Sbjct: 86  WKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLE 123



 Score = 33.9 bits (78), Expect = 0.050
 Identities = 16/32 (50%), Positives = 19/32 (59%)

Query: 21  FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
           +E ++  K K KK KKKK K K KK  KK  K
Sbjct: 78  YEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDK 109



 Score = 33.9 bits (78), Expect = 0.055
 Identities = 17/31 (54%), Positives = 19/31 (61%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
           E++K K KK K KKKK K K KK  KK  K 
Sbjct: 80  EKQKWKWKKKKSKKKKDKDKDKKDDKKDDKS 110



 Score = 33.9 bits (78), Expect = 0.058
 Identities = 13/35 (37%), Positives = 22/35 (62%)

Query: 21  FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
            ++ K KK K+K KK  KK  K +KK +K+ + ++
Sbjct: 88  KKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKL 122



 Score = 33.5 bits (77), Expect = 0.075
 Identities = 14/33 (42%), Positives = 19/33 (57%)

Query: 20  PFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
            ++ +K K KK K K K KK  KK  K +KK +
Sbjct: 83  KWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDE 115



 Score = 32.4 bits (74), Expect = 0.18
 Identities = 14/31 (45%), Positives = 18/31 (58%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
           +++   KKK  KKKK K K KK  KK  K +
Sbjct: 81  KQKWKWKKKKSKKKKDKDKDKKDDKKDDKSE 111



 Score = 32.0 bits (73), Expect = 0.23
 Identities = 17/31 (54%), Positives = 21/31 (67%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
           ++ K KKKK+KKKK K K KK  KK  K +K
Sbjct: 82  QKWKWKKKKSKKKKDKDKDKKDDKKDDKSEK 112



 Score = 31.6 bits (72), Expect = 0.31
 Identities = 15/31 (48%), Positives = 17/31 (54%)

Query: 23  REKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
           + K KK K KK K K KK  KK  K +KK  
Sbjct: 85  KWKKKKSKKKKDKDKDKKDDKKDDKSEKKDE 115



 Score = 31.2 bits (71), Expect = 0.38
 Identities = 18/31 (58%), Positives = 21/31 (67%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
           E EK KK+  +K+K K KKKK KKKK K K 
Sbjct: 70  EIEKVKKEYEEKQKWKWKKKKSKKKKDKDKD 100



 Score = 31.2 bits (71), Expect = 0.38
 Identities = 18/35 (51%), Positives = 19/35 (54%), Gaps = 4/35 (11%)

Query: 22  EREKAKKKKN----KKKKKKKKKKKKKKKKKKKKK 52
             E  K KK     +K K KKKK KKKK K K KK
Sbjct: 68  AEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKK 102



 Score = 30.8 bits (70), Expect = 0.53
 Identities = 11/31 (35%), Positives = 20/31 (64%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
           +++K K KK+ KK  K +KK +K+ + K + 
Sbjct: 94  KKDKDKDKKDDKKDDKSEKKDEKEAEDKLED 124



 Score = 30.4 bits (69), Expect = 0.77
 Identities = 15/38 (39%), Positives = 22/38 (57%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
            ++K  K K+KK  KK  K +KK +K+ + K   L KS
Sbjct: 91  SKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKS 128



 Score = 29.7 bits (67), Expect = 1.5
 Identities = 20/49 (40%), Positives = 26/49 (53%), Gaps = 7/49 (14%)

Query: 11 HFDKVIIPAPFEREKAKKKKNKKKKKK-------KKKKKKKKKKKKKKK 52
          HF   I  A +   K KKK+  ++ +K       K+K K KKKK KKKK
Sbjct: 47 HFATPIYDAEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKK 95



 Score = 29.3 bits (66), Expect = 1.6
 Identities = 11/38 (28%), Positives = 19/38 (50%)

Query: 24  EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCE 61
           +K  KK +K +KK +K+ + K +   K     L+   E
Sbjct: 101 KKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSE 138


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
           are designated YL1. These proteins have been shown to be
           DNA-binding and may be a transcription factor.
          Length = 238

 Score = 35.8 bits (83), Expect = 0.016
 Identities = 19/43 (44%), Positives = 27/43 (62%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPA 64
           E+E  ++++ KKKK+ K K  K+  KKKKKK    AKS +  A
Sbjct: 75  EKELQREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAA 117



 Score = 34.3 bits (79), Expect = 0.045
 Identities = 18/55 (32%), Positives = 25/55 (45%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRTVLSLPQL 76
           +RE+  KKK + K K  K+  KKKKKK     +    +   P  K   +   P L
Sbjct: 79  QREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSERISWAPTL 133



 Score = 31.6 bits (72), Expect = 0.41
 Identities = 14/34 (41%), Positives = 22/34 (64%)

Query: 21  FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
            E EK  +++ + KKKK+ K K  K+  KKKK++
Sbjct: 72  EEGEKELQREERLKKKKRVKTKAYKEPTKKKKKK 105


>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
           complex component).  RNases H are enzymes that
           specifically hydrolyse RNA when annealed to a
           complementary DNA and are present in all living
           organisms. In yeast RNase H2 is composed of a complex of
           three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
           family represents the homologues of Ydr279p. It is not
           known whether non yeast proteins in this family fulfil
           the same function.
          Length = 287

 Score = 35.8 bits (83), Expect = 0.017
 Identities = 13/32 (40%), Positives = 17/32 (53%)

Query: 21  FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
              E   +KK K K++ KKKK K+ K  K  K
Sbjct: 242 EAAESRAEKKRKSKEEIKKKKPKESKGVKALK 273



 Score = 35.8 bits (83), Expect = 0.019
 Identities = 13/29 (44%), Positives = 19/29 (65%)

Query: 24  EKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
            +A+KK+  K++ KKKK K+ K  K  KK
Sbjct: 246 SRAEKKRKSKEEIKKKKPKESKGVKALKK 274



 Score = 35.4 bits (82), Expect = 0.028
 Identities = 13/35 (37%), Positives = 19/35 (54%)

Query: 17  IPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
           + A   R + K+K  ++ KKKK K+ K  K  KK 
Sbjct: 241 VEAAESRAEKKRKSKEEIKKKKPKESKGVKALKKV 275



 Score = 34.2 bits (79), Expect = 0.069
 Identities = 13/29 (44%), Positives = 16/29 (55%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKK 50
            +E+ KKKK K+ K  K  KK   K  KK
Sbjct: 254 SKEEIKKKKPKESKGVKALKKVVAKGMKK 282



 Score = 33.5 bits (77), Expect = 0.11
 Identities = 11/33 (33%), Positives = 15/33 (45%)

Query: 19  APFEREKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
              + ++  KKK  K+ K  K  KK   K  KK
Sbjct: 250 KKRKSKEEIKKKKPKESKGVKALKKVVAKGMKK 282



 Score = 33.1 bits (76), Expect = 0.15
 Identities = 12/35 (34%), Positives = 15/35 (42%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
                + K + KK K K+ K  K  KK   K  KK
Sbjct: 248 AEKKRKSKEEIKKKKPKESKGVKALKKVVAKGMKK 282



 Score = 32.3 bits (74), Expect = 0.23
 Identities = 12/32 (37%), Positives = 18/32 (56%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
           E+++  K++ KKKK K+ K  K  KK   K  
Sbjct: 249 EKKRKSKEEIKKKKPKESKGVKALKKVVAKGM 280



 Score = 31.5 bits (72), Expect = 0.45
 Identities = 12/32 (37%), Positives = 22/32 (68%)

Query: 27  KKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
           + +K +K K++ KKKK K+ K  K  ++++AK
Sbjct: 247 RAEKKRKSKEEIKKKKPKESKGVKALKKVVAK 278



 Score = 30.8 bits (70), Expect = 0.86
 Identities = 11/29 (37%), Positives = 20/29 (68%)

Query: 25  KAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
           +A + + +KK+K K++ KKKK K+ K  +
Sbjct: 242 EAAESRAEKKRKSKEEIKKKKPKESKGVK 270



 Score = 29.2 bits (66), Expect = 2.7
 Identities = 12/31 (38%), Positives = 19/31 (61%)

Query: 28  KKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
            +   +KK+K K++ KKKK K+ K  + L K
Sbjct: 244 AESRAEKKRKSKEEIKKKKPKESKGVKALKK 274


>gnl|CDD|213286 cd05919, BCL_like, Benzoate CoA ligase (BCL) and similar adenylate
           forming enzymes.  This family contains benzoate CoA
           ligase (BCL) and related ligases that catalyze the
           acylation of benzoate derivatives, 2-aminobenzoate and
           4-hydroxybenzoate. Aromatic compounds represent the
           second most abundant class of organic carbon compounds
           after carbohydrates. Xenobiotic aromatic compounds are
           also a major class of man-made pollutants. Some bacteria
           use benzoate as the sole source of carbon and energy
           through benzoate degradation. Benzoate degradation
           starts with its activation to benzoyl-CoA by benzoate
           CoA ligase. The reaction catalyzed by benzoate CoA
           ligase proceeds via a two-step process; the first
           ATP-dependent step forms an acyl-AMP intermediate, and
           the second step forms the acyl-CoA ester with release of
           the AMP.
          Length = 436

 Score = 36.2 bits (84), Expect = 0.017
 Identities = 20/52 (38%), Positives = 28/52 (53%), Gaps = 1/52 (1%)

Query: 155 YKTGD-YGRIVDDILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSV 205
            +TGD + R  D    Y+GR D  +K+ G  V   EVEAAL +   V + +V
Sbjct: 316 LRTGDRFSRDADGWYRYQGRADDMIKVSGQWVSPLEVEAALGEHPAVAEAAV 367



 Score = 35.4 bits (82), Expect = 0.034
 Identities = 17/56 (30%), Positives = 26/56 (46%), Gaps = 9/56 (16%)

Query: 105 DIAIVLYTSGSTGIPK----EYRRMYKTGD-YGRIVDDIHRCWSSGDRAIKYRRMY 155
           DIA  LYTSG+TG PK     +R      + + R +  +      GDR     +++
Sbjct: 92  DIAYWLYTSGTTGKPKGVMHRHRDPLTFAEAFARELLGL----QPGDRIFSSSKLF 143



 Score = 31.5 bits (72), Expect = 0.48
 Identities = 13/26 (50%), Positives = 16/26 (61%)

Query: 233 YEGRTDSQVKIRGNRVDLSEVEAALS 258
           Y+GR D  +K+ G  V   EVEAAL 
Sbjct: 332 YQGRADDMIKVSGQWVSPLEVEAALG 357


>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
           This is a family of fungal proteins whose function is
           unknown.
          Length = 130

 Score = 34.6 bits (80), Expect = 0.018
 Identities = 9/32 (28%), Positives = 23/32 (71%)

Query: 23  REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
           R + ++ K + +K+K+ +K ++KK K+++K +
Sbjct: 97  RLRRERTKERAEKEKRTRKNREKKFKRRQKEK 128



 Score = 34.2 bits (79), Expect = 0.022
 Identities = 10/31 (32%), Positives = 23/31 (74%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
            R +  K++ +K+K+ +K ++KK K+++K+K
Sbjct: 98  LRRERTKERAEKEKRTRKNREKKFKRRQKEK 128



 Score = 33.8 bits (78), Expect = 0.031
 Identities = 10/32 (31%), Positives = 22/32 (68%)

Query: 23  REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
           R   + ++ + K++ +K+K+ +K ++KK KRR
Sbjct: 93  RIALRLRRERTKERAEKEKRTRKNREKKFKRR 124



 Score = 33.0 bits (76), Expect = 0.072
 Identities = 12/30 (40%), Positives = 23/30 (76%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
           ER K + +K K+ +K ++KK K+++K+K+K
Sbjct: 101 ERTKERAEKEKRTRKNREKKFKRRQKEKEK 130



 Score = 32.2 bits (74), Expect = 0.11
 Identities = 11/31 (35%), Positives = 25/31 (80%)

Query: 23  REKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
           RE+ K++  K+K+ +K ++KK K+++K+K++
Sbjct: 100 RERTKERAEKEKRTRKNREKKFKRRQKEKEK 130



 Score = 31.1 bits (71), Expect = 0.32
 Identities = 10/37 (27%), Positives = 24/37 (64%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
            A  E+E  ++K+ KKK+K+  KK++   + ++++ +
Sbjct: 68  NAKVEKELLREKEKKKKRKRPGKKRRIALRLRRERTK 104



 Score = 30.3 bits (69), Expect = 0.52
 Identities = 7/31 (22%), Positives = 22/31 (70%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
              + ++++ K++ +K+K+ +K ++KK K++
Sbjct: 94  IALRLRRERTKERAEKEKRTRKNREKKFKRR 124



 Score = 29.5 bits (67), Expect = 0.95
 Identities = 11/45 (24%), Positives = 22/45 (48%)

Query: 10  HHFDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
               K       E  + K+KK K+K+  KK++   + ++++ K R
Sbjct: 62  IDLKKHNAKVEKELLREKEKKKKRKRPGKKRRIALRLRRERTKER 106



 Score = 29.2 bits (66), Expect = 1.2
 Identities = 16/36 (44%), Positives = 24/36 (66%), Gaps = 2/36 (5%)

Query: 28  KKKNKKKKKKKKKKKKKKKKKKK--KKRRILAKSCE 61
           KK N K +K+  ++K+KKKK+K+  KKRRI  +   
Sbjct: 65  KKHNAKVEKELLREKEKKKKRKRPGKKRRIALRLRR 100



 Score = 27.2 bits (61), Expect = 5.7
 Identities = 9/42 (21%), Positives = 26/42 (61%), Gaps = 8/42 (19%)

Query: 22  EREKAKKKKNKKK--------KKKKKKKKKKKKKKKKKKRRI 55
           E++K +K+  KK+        ++ K++ +K+K+ +K ++++ 
Sbjct: 80  EKKKKRKRPGKKRRIALRLRRERTKERAEKEKRTRKNREKKF 121


>gnl|CDD|218148 pfam04557, tRNA_synt_1c_R2, Glutaminyl-tRNA synthetase,
          non-specific RNA binding region part 2.  This is a
          region found N terminal to the catalytic domain of
          glutaminyl-tRNA synthetase (EC 6.1.1.18) in eukaryotes
          but not in Escherichia coli. This region is thought to
          bind RNA in a non-specific manner, enhancing
          interactions between the tRNA and enzyme, but is not
          essential for enzyme function.
          Length = 83

 Score = 33.5 bits (77), Expect = 0.019
 Identities = 13/29 (44%), Positives = 15/29 (51%)

Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKK 50
          E +  KKKK KKKKK +      K KK  
Sbjct: 19 EADLVKKKKKKKKKKAEDTAATAKAKKAT 47



 Score = 30.8 bits (70), Expect = 0.19
 Identities = 12/26 (46%), Positives = 15/26 (57%)

Query: 27 KKKKNKKKKKKKKKKKKKKKKKKKKK 52
          K + +  KKKKKKKKKK +      K
Sbjct: 17 KTEADLVKKKKKKKKKKAEDTAATAK 42



 Score = 30.4 bits (69), Expect = 0.25
 Identities = 12/21 (57%), Positives = 13/21 (61%)

Query: 32 KKKKKKKKKKKKKKKKKKKKK 52
          KKKKKKKKKK +      K K
Sbjct: 24 KKKKKKKKKKAEDTAATAKAK 44



 Score = 29.6 bits (67), Expect = 0.38
 Identities = 11/21 (52%), Positives = 12/21 (57%)

Query: 32 KKKKKKKKKKKKKKKKKKKKK 52
           KKKKKKKKKK +      K 
Sbjct: 23 VKKKKKKKKKKAEDTAATAKA 43



 Score = 26.6 bits (59), Expect = 5.3
 Identities = 10/36 (27%), Positives = 15/36 (41%)

Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
           A   ++K KKKK K +      K KK   +   + 
Sbjct: 19 EADLVKKKKKKKKKKAEDTAATAKAKKATAEDVSEG 54



 Score = 26.2 bits (58), Expect = 6.6
 Identities = 10/32 (31%), Positives = 13/32 (40%)

Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
          +K KKKK KK +      K KK   +      
Sbjct: 24 KKKKKKKKKKAEDTAATAKAKKATAEDVSEGA 55


>gnl|CDD|240254 PTZ00069, PTZ00069, 60S ribosomal protein L5; Provisional.
          Length = 300

 Score = 35.8 bits (83), Expect = 0.019
 Identities = 19/36 (52%), Positives = 22/36 (61%)

Query: 23  REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
             K KKKK KKKK   KK K KK   K++K R+ AK
Sbjct: 252 PSKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAK 287



 Score = 31.2 bits (71), Expect = 0.62
 Identities = 17/41 (41%), Positives = 25/41 (60%), Gaps = 4/41 (9%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKK----KKKKKKKKKKKKRR 54
           P+  +++K KKKK   KK K KK    ++K + K KK +RR
Sbjct: 252 PSKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRR 292



 Score = 29.3 bits (66), Expect = 2.8
 Identities = 15/39 (38%), Positives = 22/39 (56%)

Query: 20  PFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
           P + +K KKKK K   KK K KK   K++K + +   A+
Sbjct: 252 PSKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQ 290



 Score = 28.9 bits (65), Expect = 3.3
 Identities = 20/52 (38%), Positives = 24/52 (46%), Gaps = 5/52 (9%)

Query: 26  AKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRTVLSLPQLR 77
           A  + N  K KKKKKKKKK   KK K +++  K       K R      Q R
Sbjct: 246 AAIRANPSKVKKKKKKKKKVVHKKYKTKKLTGKQ-----RKARVKAKKAQRR 292



 Score = 27.3 bits (61), Expect = 9.0
 Identities = 10/38 (26%), Positives = 22/38 (57%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
                 +K K KK   K++K + K KK +++++ +++I
Sbjct: 262 KKKVVHKKYKTKKLTGKQRKARVKAKKAQRRERLQKKI 299


>gnl|CDD|206228 pfam14058, PcfK, PcfK-like protein.  The PcfK-like protein family
           includes the Enterococcus faecalis PcfK protein, which
           is functionally uncharacterized. This family of proteins
           is found in bacteria and viruses. Proteins in this
           family are typically between 137 and 257 amino acids in
           length. There are two completely conserved residues (D
           and L) that may be functionally important.
          Length = 136

 Score = 34.7 bits (80), Expect = 0.021
 Identities = 10/40 (25%), Positives = 22/40 (55%)

Query: 14  KVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
             +     E+ +A+K+  K  ++++ +K +K+ KK KK  
Sbjct: 86  HTVELTEEEKAEARKEALKAYQQEELRKIQKRSKKSKKAE 125


>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
          Length = 158

 Score = 34.7 bits (80), Expect = 0.024
 Identities = 12/37 (32%), Positives = 26/37 (70%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
           E  + ++K+  K+ ++K+  K KK+KK+KK++++  K
Sbjct: 93  EESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEK 129



 Score = 32.8 bits (75), Expect = 0.11
 Identities = 13/38 (34%), Positives = 24/38 (63%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
           + +K   K+ ++K+  K KK+KK+KK+KK   ++  K 
Sbjct: 97  KEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKK 134



 Score = 32.0 bits (73), Expect = 0.21
 Identities = 17/43 (39%), Positives = 27/43 (62%)

Query: 16  IIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
           + P   E+E+ +  K  ++K+  K KK+KK+KK+KK    LAK
Sbjct: 90  LFPEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAK 132



 Score = 30.9 bits (70), Expect = 0.46
 Identities = 15/35 (42%), Positives = 21/35 (60%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
               E E+ +  K KK+KK+KK+KK  +K  KKK 
Sbjct: 101 EVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKS 135



 Score = 29.7 bits (67), Expect = 0.98
 Identities = 14/45 (31%), Positives = 22/45 (48%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCEN 62
                +E  +K+  K KK+KK+KK+KK  +K  KK+         
Sbjct: 99  QKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTT 143



 Score = 27.8 bits (62), Expect = 5.0
 Identities = 14/41 (34%), Positives = 23/41 (56%)

Query: 29  KKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRT 69
           +K +K+  K+ ++K+  K KK+KK +   K  E  A K  T
Sbjct: 96  EKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKST 136



 Score = 27.4 bits (61), Expect = 6.6
 Identities = 14/46 (30%), Positives = 21/46 (45%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKG 67
            + K +KK+ K+KK  +K  KKK  K  K   +   K       +G
Sbjct: 112 IKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTKKEEG 157


>gnl|CDD|203848 pfam08079, Ribosomal_L30_N, Ribosomal L30 N-terminal domain.
          This presumed domain is found at the N-terminus of
          Ribosomal L30 proteins and has been termed RL30NT or
          NUC018.
          Length = 71

 Score = 33.0 bits (76), Expect = 0.026
 Identities = 14/32 (43%), Positives = 24/32 (75%)

Query: 27 KKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
          K+K+N+K + K+ KK+  KK  +KKKR+++ K
Sbjct: 5  KRKRNEKLRAKRAKKRAAKKAARKKKRKLIFK 36



 Score = 32.6 bits (75), Expect = 0.029
 Identities = 13/34 (38%), Positives = 20/34 (58%)

Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
            KK+K  +K + K+ KK+  KK  +KK+R L  
Sbjct: 2  LLKKRKRNEKLRAKRAKKRAAKKAARKKKRKLIF 35



 Score = 30.7 bits (70), Expect = 0.13
 Identities = 11/29 (37%), Positives = 16/29 (55%)

Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
            K +  + KK+  KK  +KKK+K   KR
Sbjct: 9  NEKLRAKRAKKRAAKKAARKKKRKLIFKR 37



 Score = 28.7 bits (65), Expect = 0.64
 Identities = 12/29 (41%), Positives = 18/29 (62%)

Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
           EK + K+ KK+  KK  +KKK+K   K+
Sbjct: 9  NEKLRAKRAKKRAAKKAARKKKRKLIFKR 37



 Score = 28.3 bits (64), Expect = 0.96
 Identities = 11/31 (35%), Positives = 18/31 (58%)

Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
          +R +  + K  KK+  KK  +KKK+K   K+
Sbjct: 7  KRNEKLRAKRAKKRAAKKAARKKKRKLIFKR 37



 Score = 28.3 bits (64), Expect = 1.1
 Identities = 12/32 (37%), Positives = 21/32 (65%)

Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
          R++ +K + K+ KK+  KK  +KKK+K   +R
Sbjct: 6  RKRNEKLRAKRAKKRAAKKAARKKKRKLIFKR 37



 Score = 27.6 bits (62), Expect = 1.8
 Identities = 12/32 (37%), Positives = 20/32 (62%)

Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
           E+ +AK+ K +  KK  +KKK+K   K+ +K
Sbjct: 9  NEKLRAKRAKKRAAKKAARKKKRKLIFKRAEK 40


>gnl|CDD|236235 PRK08314, PRK08314, long-chain-fatty-acid--CoA ligase; Validated.
          Length = 546

 Score = 35.7 bits (83), Expect = 0.026
 Identities = 14/50 (28%), Positives = 23/50 (46%), Gaps = 7/50 (14%)

Query: 71  LSLPQLRLLSASLSHNNIPGESMLHADNNNEQSNDIAIVLYTSGSTGIPK 120
           L+   +     +L+    P       D       D+A++ YTSG+TG+PK
Sbjct: 164 LAPGGVVAWKEALAAGLAPPPHTAGPD-------DLAVLPYTSGTTGVPK 206


>gnl|CDD|213277 cd05909, AAS_C, C-terminal domain of the acyl-acyl carrier protein
           synthetase (also called 2-acylglycerophosphoethanolamine
           acyltransferase, Aas).  Acyl-acyl carrier protein
           synthase (Aas) is a membrane protein responsible for a
           minor pathway of incorporating exogenous fatty acids
           into membrane phospholipids. Its in vitro activity is
           characterized by the ligation of free fatty acids
           between 8 and 18 carbons in length to the acyl carrier
           protein sulfydryl group (ACP-SH) in the presence of ATP
           and Mg2+. However, its in vivo function is as a
           2-acylglycerophosphoethanolamine (2-acyl-GPE)
           acyltransferase. The reaction occurs in two steps: the
           acyl chain is first esterified to acyl carrier protein
           (ACP) via a thioester bond, followed by a second step
           where the acyl chain is transferred to a
           2-acyllysophospholipid, thus completing the
           transacylation reaction. This model represents the
           C-terminal domain of the enzyme, which belongs to the
           class I adenylate-forming enzyme family, including
           acyl-CoA synthetases.
          Length = 489

 Score = 35.7 bits (83), Expect = 0.026
 Identities = 27/82 (32%), Positives = 39/82 (47%), Gaps = 7/82 (8%)

Query: 155 YKTGDYGRIVDDILLY-EGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKL----SVLCYK 209
           Y TGD G+I +D  L   GR     KI G  V L+ VE  LS++   D L     V   K
Sbjct: 370 YDTGDIGKIDEDGFLTIVGRLKRFAKIGGEMVSLTAVEELLSEILPEDSLHAAVEVPDEK 429

Query: 210 PGEVDQVSEVTSEEIRGVDDIL 231
            GE  ++  +T+ E    +++ 
Sbjct: 430 KGE--KIVLLTTTEDLDPEELN 449



 Score = 33.7 bits (78), Expect = 0.12
 Identities = 11/29 (37%), Positives = 19/29 (65%)

Query: 92  SMLHADNNNEQSNDIAIVLYTSGSTGIPK 120
           ++L       + +D A++L+TSGS G+PK
Sbjct: 133 ALLRLFLAGAKPDDPAVILFTSGSEGLPK 161


>gnl|CDD|213302 cd05936, FC-FACS_FadD_like, Prokaryotic long-chain fatty acid CoA
           synthetases similar to Escherichia coli FadD.  This
           subfamily of the AMP-forming adenylation family contains
           Escherichia coli FadD and similar prokaryotic fatty acid
           CoA synthetases. FadD was characterized as a long-chain
           fatty acid CoA synthetase. The gene fadD is regulated by
           the fatty acid regulatory protein FadR. Fatty acid CoA
           synthetase catalyzes the formation of fatty acyl-CoA in
           a two-step reaction: the formation of a fatty acyl-AMP
           molecule as an intermediate, followed by the formation
           of a fatty acyl-CoA. This is a required step before free
           fatty acids can participate in most catabolic and
           anabolic reactions.
          Length = 468

 Score = 35.6 bits (83), Expect = 0.027
 Identities = 9/17 (52%), Positives = 15/17 (88%)

Query: 104 NDIAIVLYTSGSTGIPK 120
           +D+A++ YT G+TG+PK
Sbjct: 125 DDLAVLQYTGGTTGVPK 141


>gnl|CDD|235730 PRK06187, PRK06187, long-chain-fatty-acid--CoA ligase; Validated.
          Length = 521

 Score = 35.5 bits (83), Expect = 0.029
 Identities = 12/17 (70%), Positives = 14/17 (82%)

Query: 104 NDIAIVLYTSGSTGIPK 120
           ND A +LYTSG+TG PK
Sbjct: 167 NDAAAMLYTSGTTGHPK 183


>gnl|CDD|235778 PRK06319, PRK06319, DNA topoisomerase I/SWI domain fusion protein;
           Validated.
          Length = 860

 Score = 35.6 bits (82), Expect = 0.030
 Identities = 19/48 (39%), Positives = 24/48 (50%), Gaps = 3/48 (6%)

Query: 20  PFEREKAKKKK---NKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPA 64
           P+E++   KKK    K K  K  KKK K K KK  K+R       +PA
Sbjct: 746 PYEKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKRAGPLYTPSPA 793



 Score = 29.4 bits (66), Expect = 2.9
 Identities = 16/56 (28%), Positives = 20/56 (35%), Gaps = 1/56 (1%)

Query: 24  EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRTVLSLPQLRLL 79
            K   +K  K KKK    K K  K  KKK +  +K        G      P L  +
Sbjct: 743 PKTPYEKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKR-AGPLYTPSPALAAM 797


>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
          (DUF1754).  This is a eukaryotic protein family of
          unknown function.
          Length = 90

 Score = 33.2 bits (76), Expect = 0.032
 Identities = 16/28 (57%), Positives = 18/28 (64%)

Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
          K K KK   KKKKKKKKKK K K++   
Sbjct: 11 KLKGKKIDVKKKKKKKKKKNKSKEEVVT 38



 Score = 31.6 bits (72), Expect = 0.11
 Identities = 17/36 (47%), Positives = 20/36 (55%)

Query: 28 KKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENP 63
          K K KK   KKKKKKKKKK K K++     +  E  
Sbjct: 11 KLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKS 46



 Score = 30.9 bits (70), Expect = 0.20
 Identities = 16/28 (57%), Positives = 19/28 (67%)

Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
          K KK   KKKKKKKKKK K K++   +K
Sbjct: 13 KGKKIDVKKKKKKKKKKNKSKEEVVTEK 40



 Score = 29.7 bits (67), Expect = 0.50
 Identities = 15/33 (45%), Positives = 18/33 (54%)

Query: 27 KKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
          K K  K   KKKKKKKKKK K K++      + 
Sbjct: 11 KLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEE 43



 Score = 28.9 bits (65), Expect = 0.91
 Identities = 13/28 (46%), Positives = 17/28 (60%)

Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
            K    KKKKKKKKK K K++   +K+
Sbjct: 14 GKKIDVKKKKKKKKKKNKSKEEVVTEKE 41



 Score = 28.5 bits (64), Expect = 1.2
 Identities = 15/35 (42%), Positives = 23/35 (65%)

Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
          K   KK KKKKKKK K K++   +K+++ +  A+S
Sbjct: 16 KIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAES 50



 Score = 28.5 bits (64), Expect = 1.4
 Identities = 16/25 (64%), Positives = 16/25 (64%)

Query: 28 KKKNKKKKKKKKKKKKKKKKKKKKK 52
            K K K KK   KKKKKKKKKK K
Sbjct: 7  GGKLKLKGKKIDVKKKKKKKKKKNK 31



 Score = 27.8 bits (62), Expect = 2.5
 Identities = 13/29 (44%), Positives = 18/29 (62%)

Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
          K    K KKKKKKKK K K++   +K++ 
Sbjct: 15 KKIDVKKKKKKKKKKNKSKEEVVTEKEEE 43



 Score = 26.6 bits (59), Expect = 6.3
 Identities = 16/61 (26%), Positives = 26/61 (42%)

Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRTVLSLPQLRLLSASLS 84
          K KKKK KKK K K++   +K++++K       K  E      +       + L  A  +
Sbjct: 20 KKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDEDNEKIEQEEDGMNLTEAERA 79

Query: 85 H 85
           
Sbjct: 80 F 80


>gnl|CDD|213275 cd05907, VL_LC_FACS_like, Long-chain fatty acid CoA synthetases and
           Bubblegum-like very long-chain fatty acid CoA
           synthetases.  This family includes long-chain fatty acid
           (C12-C20) CoA synthetases and Bubblegum-like very
           long-chain (>C20) fatty acid CoA synthetases. FACS
           catalyzes the formation of fatty acyl-CoA in a two-step
           reaction: the formation of a fatty acyl-AMP molecule as
           an intermediate, and the formation of a fatty acyl-CoA.
           Eukaryotes generally have multiple isoforms of LC-FACS
           genes with multiple splice variants. For example, nine
           genes are found in Arabidopsis and six genes are
           expressed in mammalian cells. Drosophila melanogaster
           mutant bubblegum (BGM) have elevated levels of
           very-long-chain fatty acids (VLCFA) caused by a
           defective gene later named bubblegum. The human homolog
           (hsBG) of bubblegum has been characterized as a very
           long chain fatty acid CoA synthetase that functions
           specifically in the brain; hsBG may play a central role
           in brain VLCFA metabolism and myelinogenesis. Free fatty
           acids must be "activated" to their CoA thioesters before
           participating in most catabolic and anabolic reactions.
          Length = 456

 Score = 35.2 bits (82), Expect = 0.033
 Identities = 10/19 (52%), Positives = 16/19 (84%)

Query: 102 QSNDIAIVLYTSGSTGIPK 120
           + +D+A ++YTSG+TG PK
Sbjct: 86  KPDDLATLIYTSGTTGNPK 104


>gnl|CDD|236108 PRK07824, PRK07824, O-succinylbenzoic acid--CoA ligase;
           Provisional.
          Length = 358

 Score = 35.0 bits (81), Expect = 0.034
 Identities = 17/60 (28%), Positives = 28/60 (46%), Gaps = 18/60 (30%)

Query: 66  KGRTVLSLP-----QLRLLSASLSHNNIPGESMLHADNNNEQSNDIAIVLYTSGSTGIPK 120
           +   +L +P     +  LL  +L      GE +          +D+A+V+ TSG+TG PK
Sbjct: 5   RAPALLPVPAQDERRAALLRDALR----VGEPI---------DDDVALVVATSGTTGTPK 51


>gnl|CDD|236048 PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated.
          Length = 859

 Score = 35.5 bits (83), Expect = 0.036
 Identities = 8/36 (22%), Positives = 16/36 (44%)

Query: 24  EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
           EK +   +   K    KK   K  +K K++ + ++ 
Sbjct: 819 EKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEK 854



 Score = 35.2 bits (82), Expect = 0.045
 Identities = 8/37 (21%), Positives = 15/37 (40%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
           E+ +  +       K    KK   K  +K K++ +A 
Sbjct: 816 EKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVAS 852



 Score = 35.2 bits (82), Expect = 0.047
 Identities = 7/29 (24%), Positives = 12/29 (41%)

Query: 24  EKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
           +    KK   K  +K K++    +K  K 
Sbjct: 830 KDPAGKKAAVKFSRKTKQQYVASEKDGKA 858



 Score = 35.2 bits (82), Expect = 0.049
 Identities = 7/30 (23%), Positives = 12/30 (40%)

Query: 23  REKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
            +   K    KK   K  +K K++    +K
Sbjct: 825 ADAPAKDPAGKKAAVKFSRKTKQQYVASEK 854



 Score = 34.0 bits (79), Expect = 0.089
 Identities = 5/29 (17%), Positives = 10/29 (34%)

Query: 25  KAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
                  K   K  +K K++    +K  +
Sbjct: 829 AKDPAGKKAAVKFSRKTKQQYVASEKDGK 857



 Score = 34.0 bits (79), Expect = 0.11
 Identities = 8/29 (27%), Positives = 14/29 (48%)

Query: 23  REKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
           ++ A KK   K  +K K++    +K  K 
Sbjct: 830 KDPAGKKAAVKFSRKTKQQYVASEKDGKA 858



 Score = 34.0 bits (79), Expect = 0.11
 Identities = 8/28 (28%), Positives = 12/28 (42%)

Query: 25  KAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
            AK    KK   K  +K K++    +K 
Sbjct: 828 PAKDPAGKKAAVKFSRKTKQQYVASEKD 855



 Score = 33.6 bits (78), Expect = 0.13
 Identities = 9/32 (28%), Positives = 14/32 (43%)

Query: 19  APFEREKAKKKKNKKKKKKKKKKKKKKKKKKK 50
           AP +    KK   K  +K K++    +K  K 
Sbjct: 827 APAKDPAGKKAAVKFSRKTKQQYVASEKDGKA 858



 Score = 33.2 bits (77), Expect = 0.17
 Identities = 7/38 (18%), Positives = 12/38 (31%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
             +  K +       K    KK   K  +K ++    S
Sbjct: 815 AEKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVAS 852



 Score = 32.1 bits (74), Expect = 0.37
 Identities = 8/36 (22%), Positives = 11/36 (30%)

Query: 23  REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
           R  A         KK   K  +K K++        K
Sbjct: 822 RYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKDGK 857



 Score = 30.9 bits (71), Expect = 0.86
 Identities = 7/36 (19%), Positives = 11/36 (30%)

Query: 17  IPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
            P           K+   KK   K  +K K++    
Sbjct: 817 KPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVAS 852



 Score = 30.5 bits (70), Expect = 1.3
 Identities = 7/30 (23%), Positives = 10/30 (33%)

Query: 25  KAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
            A+K +  +       K    KK   K  R
Sbjct: 814 LAEKPEKLRYLADAPAKDPAGKKAAVKFSR 843



 Score = 28.2 bits (64), Expect = 7.5
 Identities = 8/31 (25%), Positives = 12/31 (38%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKKKKKKKK 48
           PA     K    K  +K K++    +K  K 
Sbjct: 828 PAKDPAGKKAAVKFSRKTKQQYVASEKDGKA 858


>gnl|CDD|211788 TIGR03098, ligase_PEP_1, acyl-CoA ligase (AMP-forming), exosortase
           A-associated.  This group of proteins contains an
           AMP-binding domain (pfam00501) associated with acyl
           CoA-ligases. These proteins are generally found in
           genomes containing the exosortase/PEP-CTERM protein
           expoert system , specifically the type 1 variant of this
           system described by the Genome Property GenProp0652.
           When found in this context they are invariably present
           next to a decarboxylase enzyme. A number of sequences
           from Burkholderia species also hit this model, but the
           genomic context is obviously different. The hypothesis
           of a constant substrate for this family is only strong
           where the exosortase context is present.
          Length = 517

 Score = 35.1 bits (81), Expect = 0.036
 Identities = 12/18 (66%), Positives = 15/18 (83%)

Query: 103 SNDIAIVLYTSGSTGIPK 120
            +D+A +LYTSGSTG PK
Sbjct: 162 DSDMAAILYTSGSTGRPK 179



 Score = 34.8 bits (80), Expect = 0.049
 Identities = 15/39 (38%), Positives = 20/39 (51%), Gaps = 1/39 (2%)

Query: 157 TGDYGRIVDDILLY-EGRTDSQVKIRGNRVDLSEVEAAL 194
           +GD  R  ++  LY  GR D  +K  G RV  +EVE   
Sbjct: 400 SGDTVRRDEEGFLYFVGRRDEMIKTSGYRVSPTEVEEVA 438



 Score = 30.9 bits (70), Expect = 0.89
 Identities = 13/39 (33%), Positives = 19/39 (48%), Gaps = 3/39 (7%)

Query: 228 DDILLYEGRTDSQVKIRGNRVDLSEVEA---ALSKISEV 263
           +  L + GR D  +K  G RV  +EVE    A   ++E 
Sbjct: 409 EGFLYFVGRRDEMIKTSGYRVSPTEVEEVAYATGLVAEA 447


>gnl|CDD|233803 TIGR02262, benz_CoA_lig, benzoate-CoA ligase family.  Characterized
           members of this protein family include benzoate-CoA
           ligase, 4-hydroxybenzoate-CoA ligase,
           2-aminobenzoate-CoA ligase, etc. Members are related to
           fatty acid and acetate CoA ligases.
          Length = 508

 Score = 35.2 bits (81), Expect = 0.038
 Identities = 31/98 (31%), Positives = 44/98 (44%), Gaps = 18/98 (18%)

Query: 114 GSTGIPKEYRRMYKTGDYGRIVDD-------IHRCWSS----GDRAIKYRRMY-----KT 157
           G++G P    R+   GD G+ V         I    S+     +RA K R  +     ++
Sbjct: 330 GTSGKPVPGYRLRLVGDGGQDVAAGEPGELLISGPSSATMYWNNRA-KTRDTFQGEWTRS 388

Query: 158 GD-YGRIVDDILLYEGRTDSQVKIRGNRVDLSEVEAAL 194
           GD Y R  D    Y GRTD  +K+ G  V   E+E+AL
Sbjct: 389 GDKYVRNDDGSYTYAGRTDDMLKVSGIYVSPFEIESAL 426



 Score = 30.6 bits (69), Expect = 1.1
 Identities = 12/19 (63%), Positives = 16/19 (84%)

Query: 102 QSNDIAIVLYTSGSTGIPK 120
           Q++D A  LY+SGSTG+PK
Sbjct: 159 QADDPAFWLYSSGSTGMPK 177



 Score = 28.3 bits (63), Expect = 5.8
 Identities = 14/34 (41%), Positives = 19/34 (55%)

Query: 224 IRGVDDILLYEGRTDSQVKIRGNRVDLSEVEAAL 257
           +R  D    Y GRTD  +K+ G  V   E+E+AL
Sbjct: 393 VRNDDGSYTYAGRTDDMLKVSGIYVSPFEIESAL 426


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
          subunit.  This is a family of proteins which are
          subunits of the eukaryotic translation initiation
          factor 3 (eIF3). In yeast it is called Hcr1. The
          Saccharomyces cerevisiae protein eIF3j (HCR1) has been
          shown to be required for processing of 20S pre-rRNA and
          binds to 18S rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 34.6 bits (80), Expect = 0.040
 Identities = 14/35 (40%), Positives = 22/35 (62%), Gaps = 2/35 (5%)

Query: 22 EREKAKK--KKNKKKKKKKKKKKKKKKKKKKKKRR 54
          E EKAK   K   KK  K K ++K+K K++K+++ 
Sbjct: 46 EEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKG 80



 Score = 33.5 bits (77), Expect = 0.084
 Identities = 12/38 (31%), Positives = 21/38 (55%)

Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
          E E+ +++K K   K K KK  K K ++K+K +   + 
Sbjct: 41 EDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEE 78



 Score = 33.5 bits (77), Expect = 0.11
 Identities = 12/37 (32%), Positives = 20/37 (54%)

Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
               +  AK K  K  K K ++K+K K++K++K  R
Sbjct: 46 EEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLR 82



 Score = 32.7 bits (75), Expect = 0.16
 Identities = 12/30 (40%), Positives = 20/30 (66%)

Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
          KAK KK  K K ++K+K K++K++K  +  
Sbjct: 55 KAKAKKALKAKIEEKEKAKREKEEKGLREL 84



 Score = 32.3 bits (74), Expect = 0.25
 Identities = 11/43 (25%), Positives = 22/43 (51%)

Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPA 64
          +     K K   K K ++K+K K++K++K  R +   + E+  
Sbjct: 51 KVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDEL 93



 Score = 31.6 bits (72), Expect = 0.45
 Identities = 12/32 (37%), Positives = 13/32 (40%)

Query: 24  EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
           EK K +K  K  KKKK K K K          
Sbjct: 193 EKLKAEKAAKGGKKKKGKAKAKLNVGGANDDD 224



 Score = 31.2 bits (71), Expect = 0.47
 Identities = 12/33 (36%), Positives = 18/33 (54%)

Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
          E E  +K++ K K   K K KK  K K ++K +
Sbjct: 39 EEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEK 71



 Score = 30.8 bits (70), Expect = 0.70
 Identities = 11/31 (35%), Positives = 13/31 (41%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
           E+ KA+K     KKKK K K K         
Sbjct: 193 EKLKAEKAAKGGKKKKGKAKAKLNVGGANDD 223



 Score = 29.2 bits (66), Expect = 2.1
 Identities = 10/37 (27%), Positives = 20/37 (54%)

Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
          + E+ ++K+ +K K   K K KK  K K +++    +
Sbjct: 38 DEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKR 74



 Score = 29.2 bits (66), Expect = 2.5
 Identities = 11/23 (47%), Positives = 12/23 (52%)

Query: 30  KNKKKKKKKKKKKKKKKKKKKKK 52
             K K +K  K  KKKK K K K
Sbjct: 192 NEKLKAEKAAKGGKKKKGKAKAK 214



 Score = 28.5 bits (64), Expect = 3.6
 Identities = 8/33 (24%), Positives = 18/33 (54%)

Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKK 50
           A   + K   K   ++K+K K++K++K  ++ 
Sbjct: 52 VAAKAKAKKALKAKIEEKEKAKREKEEKGLREL 84



 Score = 28.5 bits (64), Expect = 3.8
 Identities = 11/29 (37%), Positives = 16/29 (55%)

Query: 31  NKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
           N+K K +K  K  KKKK K K +  +  +
Sbjct: 192 NEKLKAEKAAKGGKKKKGKAKAKLNVGGA 220



 Score = 28.1 bits (63), Expect = 6.0
 Identities = 11/23 (47%), Positives = 13/23 (56%)

Query: 28  KKKNKKKKKKKKKKKKKKKKKKK 50
            +K K +K  K  KKKK K K K
Sbjct: 192 NEKLKAEKAAKGGKKKKGKAKAK 214



 Score = 27.3 bits (61), Expect = 9.8
 Identities = 7/27 (25%), Positives = 8/27 (29%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKK 48
           E+     KK K K K K          
Sbjct: 198 EKAAKGGKKKKGKAKAKLNVGGANDDD 224


>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe.  JAK1 is a
           tyrosine kinase protein essential in signaling type I
           and type II cytokines. It interacts with the gamma chain
           of type I cytokine receptors to elicit signals from the
           IL-2 receptor family, the IL-4 receptor family, the
           gp130 receptor family, ciliary neurotrophic factor
           receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
           Leptin-R). It also is involved in transducing a signal
           by type I (IFN-alpha/beta) and type II (IFN-gamma)
           interferons, and members of the IL-10 family via type II
           cytokine receptors. JAK (also called Just Another
           Kinase) is a family of intracellular, non-receptor
           tyrosine kinases that transduce cytokine-mediated
           signals via the JAK-STAT pathway. The JAK family in
           mammals consists of 4 members: JAK1, JAK2, JAK3 and
           TYK2. JAKs are composed of seven JAK homology (JH)
           domains (JH1-JH7) . The C-terminal JH1 domain is the
           main catalytic domain, followed by JH2, which is often
           referred to as a pseudokinase domain, followed by
           JH3-JH4 which is homologous to the SH2 domain, and
           lastly JH5-JH7 which is a FERM domain.  Named after
           Janus, the two-faced Roman god of doorways, JAKs possess
           two near-identical phosphate-transferring domains; one
           which displays the kinase activity (JH1), while the
           other negatively regulates the kinase activity of the
           first (JH2). The FERM domain has a cloverleaf tripart
           structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
           C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
           C-lobe/F3 within the FERM domain is part of the PH
           domain family. The FERM domain is found in the
           cytoskeletal-associated proteins such as ezrin, moesin,
           radixin, 4.1R, and merlin. These proteins provide a link
           between the membrane and cytoskeleton and are involved
           in signal transduction pathways. The FERM domain is also
           found in protein tyrosine phosphatases (PTPs) , the
           tyrosine kinases FAK and JAK, in addition to other
           proteins involved in signaling. This domain is
           structurally similar to the PH and PTB domains and
           consequently is capable of binding to both peptides and
           phospholipids at different sites.
          Length = 198

 Score = 34.4 bits (79), Expect = 0.042
 Identities = 16/31 (51%), Positives = 19/31 (61%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKKKKKKKK 48
           PA    EK KK K+KK K K KK + KKK +
Sbjct: 84  PATTAVEKKKKGKSKKNKLKGKKDEDKKKAR 114



 Score = 31.0 bits (70), Expect = 0.52
 Identities = 14/24 (58%), Positives = 15/24 (62%)

Query: 30  KNKKKKKKKKKKKKKKKKKKKKKR 53
           K KK K KK K K KK + KKK R
Sbjct: 91  KKKKGKSKKNKLKGKKDEDKKKAR 114



 Score = 31.0 bits (70), Expect = 0.60
 Identities = 13/24 (54%), Positives = 15/24 (62%)

Query: 29  KKNKKKKKKKKKKKKKKKKKKKKK 52
           KK K K KK K K KK + KKK +
Sbjct: 91  KKKKGKSKKNKLKGKKDEDKKKAR 114



 Score = 29.0 bits (65), Expect = 2.4
 Identities = 14/33 (42%), Positives = 18/33 (54%)

Query: 23  REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
           R K      +KKKK K KK K K KK + K++ 
Sbjct: 81  RRKPATTAVEKKKKGKSKKNKLKGKKDEDKKKA 113


>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
           homolog [Translation, ribosomal structure and
           biogenesis].
          Length = 395

 Score = 34.7 bits (80), Expect = 0.047
 Identities = 11/38 (28%), Positives = 20/38 (52%)

Query: 17  IPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
           +     +   K K  + KK++  + ++KKK+KK K  R
Sbjct: 353 LKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSER 390



 Score = 32.4 bits (74), Expect = 0.27
 Identities = 10/37 (27%), Positives = 23/37 (62%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
           P P  + K ++ K ++  + ++KKK+KK K +++  +
Sbjct: 358 PKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQ 394



 Score = 32.4 bits (74), Expect = 0.30
 Identities = 9/36 (25%), Positives = 21/36 (58%)

Query: 19  APFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
            P + +  + KK +  + ++KKK+KK K +++  + 
Sbjct: 360 PPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQN 395



 Score = 30.4 bits (69), Expect = 1.1
 Identities = 10/37 (27%), Positives = 23/37 (62%)

Query: 23  REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
           +EK  K   K K ++ KK++  + ++KKK+++  ++ 
Sbjct: 354 KEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSER 390



 Score = 29.3 bits (66), Expect = 2.9
 Identities = 9/37 (24%), Positives = 20/37 (54%)

Query: 17  IPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
            P    + +  KK+   + ++KKK+KK K +++  + 
Sbjct: 359 KPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQN 395



 Score = 27.7 bits (62), Expect = 7.8
 Identities = 13/39 (33%), Positives = 22/39 (56%)

Query: 21  FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
            E+ K K  K   K K ++ KK++  + ++KK+   AKS
Sbjct: 350 IEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKS 388


>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR).  This
           family consists of several bovine specific leukaemia
           virus receptors which are thought to function as
           transmembrane proteins, although their exact function is
           unknown.
          Length = 561

 Score = 34.7 bits (79), Expect = 0.054
 Identities = 17/45 (37%), Positives = 30/45 (66%)

Query: 20  PFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPA 64
           P   +K+KK K K+KK+K+K++ K KKK+ +  + +L    ++PA
Sbjct: 193 PAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLLALDDSPA 237



 Score = 34.3 bits (78), Expect = 0.083
 Identities = 18/37 (48%), Positives = 24/37 (64%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
           P   E E+ KK    KKKK++K+K++KKKKKK    R
Sbjct: 266 PKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHR 302



 Score = 33.5 bits (76), Expect = 0.15
 Identities = 15/43 (34%), Positives = 26/43 (60%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPA 64
           E++  K KK +KK+K+K++ K KKK+ +  K  +LA      +
Sbjct: 196 EKKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLLALDDSPAS 238



 Score = 31.6 bits (71), Expect = 0.50
 Identities = 16/39 (41%), Positives = 26/39 (66%)

Query: 13  DKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
           +K  +PA  ++ K  KKK KK+K+K++ K KKK+ +  K
Sbjct: 188 EKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFK 226



 Score = 31.6 bits (71), Expect = 0.53
 Identities = 9/32 (28%), Positives = 23/32 (71%)

Query: 21  FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
            E E+  +++ +K K++KKK++K+K+ +++  
Sbjct: 84  LEEERRHRQRLEKDKREKKKREKEKRGRRRHH 115



 Score = 31.6 bits (71), Expect = 0.59
 Identities = 15/36 (41%), Positives = 23/36 (63%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
           P   +    +KK  K KKK+KK+K+K++ K KKK+ 
Sbjct: 187 PEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEV 222



 Score = 31.2 bits (70), Expect = 0.82
 Identities = 9/33 (27%), Positives = 24/33 (72%)

Query: 24  EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRIL 56
           E+ ++ + + +K K++KKK++K+K+ +++   L
Sbjct: 85  EEERRHRQRLEKDKREKKKREKEKRGRRRHHSL 117



 Score = 30.8 bits (69), Expect = 0.86
 Identities = 11/41 (26%), Positives = 28/41 (68%)

Query: 24  EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPA 64
           E+ + ++  +K K++KKK++K+K+ +++   +  +S E+ A
Sbjct: 86  EERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDEDIA 126



 Score = 30.4 bits (68), Expect = 1.3
 Identities = 15/40 (37%), Positives = 22/40 (55%)

Query: 25  KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPA 64
           K KKKK +K+K++KKKKKK    +         +  +N A
Sbjct: 279 KHKKKKQRKEKEEKKKKKKHHHHRCHHSDGGAEQPVQNGA 318



 Score = 29.6 bits (66), Expect = 2.5
 Identities = 10/30 (33%), Positives = 24/30 (80%)

Query: 25  KAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
           K ++++  +++ +K K++KKK++K+K+ RR
Sbjct: 83  KLEEERRHRQRLEKDKREKKKREKEKRGRR 112



 Score = 29.6 bits (66), Expect = 2.6
 Identities = 17/41 (41%), Positives = 28/41 (68%)

Query: 20  PFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSC 60
           P E + A+ ++ KK  K KKKK++K+K++KKKK++     C
Sbjct: 263 PDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHRC 303



 Score = 29.6 bits (66), Expect = 2.6
 Identities = 14/37 (37%), Positives = 25/37 (67%)

Query: 19  APFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
           +P + +    +K  KK KKK+KK+K+K++ K KK+ +
Sbjct: 186 SPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEV 222



 Score = 29.6 bits (66), Expect = 2.6
 Identities = 14/36 (38%), Positives = 21/36 (58%)

Query: 19  APFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
                EK      +KK KK KKK+KK+K+K++ K +
Sbjct: 183 TSKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDK 218



 Score = 29.3 bits (65), Expect = 2.9
 Identities = 16/46 (34%), Positives = 26/46 (56%)

Query: 6   TISEHHFDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
           T  +   D+       E +K+ K K KK++K+K++KKKKKK    +
Sbjct: 257 TAPDSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHR 302



 Score = 29.3 bits (65), Expect = 3.2
 Identities = 10/34 (29%), Positives = 24/34 (70%)

Query: 20  PFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
           P   +  K ++ ++ +++ +K K++KKK++K+KR
Sbjct: 76  PMSDQYVKLEEERRHRQRLEKDKREKKKREKEKR 109



 Score = 28.9 bits (64), Expect = 3.6
 Identities = 13/36 (36%), Positives = 24/36 (66%)

Query: 19  APFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
           +  +  K  + +  KK  K KKKK++K+K++KKK++
Sbjct: 261 SEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKK 296



 Score = 28.9 bits (64), Expect = 3.7
 Identities = 18/45 (40%), Positives = 26/45 (57%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYK 66
           E  K+ +K +    +KK KK KKK+KK+K+K R   K  E   +K
Sbjct: 182 ETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFK 226



 Score = 28.5 bits (63), Expect = 6.0
 Identities = 13/78 (16%), Positives = 39/78 (50%), Gaps = 6/78 (7%)

Query: 28  KKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRTVLSLPQLRLLSASLSHNN 87
           K + +++ +++ +K K++KKK++K++R   +           +     + +++  +  N 
Sbjct: 83  KLEEERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDEDIAPAQMVDIVTEEMPENA 142

Query: 88  IPGESMLHADNNNEQSND 105
           +P      +D +++  ND
Sbjct: 143 LP------SDEDDKDPND 154


>gnl|CDD|235132 PRK03577, PRK03577, acid shock protein precursor; Provisional.
          Length = 102

 Score = 32.7 bits (74), Expect = 0.058
 Identities = 21/58 (36%), Positives = 30/58 (51%), Gaps = 3/58 (5%)

Query: 10  HHFDKVIIPAPFEREKAKKKKNKKKKKKK---KKKKKKKKKKKKKKRRILAKSCENPA 64
           HH  K    AP ++ +A KK +K KK++K   +K +  KK  KK   +  AK    PA
Sbjct: 44  HHHKKQHKKAPEQKAQAAKKHHKNKKEQKAPEQKAQAAKKHAKKHSHKTAAKPAAQPA 101



 Score = 26.5 bits (58), Expect = 7.7
 Identities = 12/33 (36%), Positives = 17/33 (51%)

Query: 9  EHHFDKVIIPAPFEREKAKKKKNKKKKKKKKKK 41
          +HH +K    AP ++ +A KK  KK   K   K
Sbjct: 63 KHHKNKKEQKAPEQKAQAAKKHAKKHSHKTAAK 95


>gnl|CDD|221654 pfam12589, WBS_methylT, Methyltransferase involved in
          Williams-Beuren syndrome.  This domain family is found
          in eukaryotes, and is typically between 72 and 83 amino
          acids in length. The family is found in association
          with pfam08241. This family is made up of
          S-adenosylmethionine-dependent methyltransferases. The
          proteins are deleted in Williams-Beuren syndrome (WBS),
          a complex developmental disorder with multisystemic
          manifestations including supravalvular aortic stenosis
          (SVAS) and a specific cognitive phenotype.
          Length = 85

 Score = 32.3 bits (74), Expect = 0.062
 Identities = 15/30 (50%), Positives = 20/30 (66%)

Query: 29 KKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
          K  +  ++ ++KKKKKKK KKK K  IL K
Sbjct: 31 KVRRISQRNRRKKKKKKKLKKKSKEWILRK 60



 Score = 30.0 bits (68), Expect = 0.39
 Identities = 14/32 (43%), Positives = 21/32 (65%)

Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
          R   + ++ KKKKKK KKK K+   +KK++ R
Sbjct: 34 RISQRNRRKKKKKKKLKKKSKEWILRKKEQMR 65



 Score = 29.6 bits (67), Expect = 0.49
 Identities = 17/42 (40%), Positives = 25/42 (59%)

Query: 13 DKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
          D   I A   R  +++ + KKKKKKK KKK K+   +KK++ 
Sbjct: 23 DDEQIDASKVRRISQRNRRKKKKKKKLKKKSKEWILRKKEQM 64



 Score = 29.2 bits (66), Expect = 0.57
 Identities = 14/34 (41%), Positives = 23/34 (67%)

Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
             ++ ++KK KKKK KKK K+   +KK++ +RR
Sbjct: 34 RISQRNRRKKKKKKKLKKKSKEWILRKKEQMRRR 67


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 34.2 bits (79), Expect = 0.064
 Identities = 12/37 (32%), Positives = 27/37 (72%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
           E E+ K  K  +++++++ ++KK++KKK+++   LAK
Sbjct: 266 EEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAK 302



 Score = 33.8 bits (78), Expect = 0.082
 Identities = 12/43 (27%), Positives = 28/43 (65%), Gaps = 10/43 (23%)

Query: 22  EREKAKKKKNKKKKKKKKK----------KKKKKKKKKKKKRR 54
            +E+A++KK +KKK++++           +K ++K++KK+ R+
Sbjct: 280 RQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQARK 322


>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2.  It is vital for
           effective cell-replication that replication is not
           stalled at any point by, for instance, damaged bases.
           Rtf2 stabilizes the replication fork stalled at the
           site-specific replication barrier RTS1 by preventing
           replication restart until completion of DNA synthesis by
           a converging replication fork initiated at a flanking
           origin. The RTS1 element terminates replication forks
           that are moving in the cen2-distal direction while
           allowing forks moving in the cen2-proximal direction to
           pass through the region. Rtf2 contains a C2HC2 motif
           related to the C3HC4 RING-finger motif, and would appear
           to fold up, creating a RING finger-like structure but
           forming only one functional Zn2+ ion-binding site.
          Length = 254

 Score = 34.2 bits (79), Expect = 0.065
 Identities = 16/40 (40%), Positives = 23/40 (57%)

Query: 13  DKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
           D V +    E  +  K + ++++ KKKKKKKKKK KK   
Sbjct: 158 DVVPLNPTEEEVELLKARLEEERAKKKKKKKKKKTKKNNA 197



 Score = 31.5 bits (72), Expect = 0.45
 Identities = 14/31 (45%), Positives = 16/31 (51%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
           E E+AKKKK KKKKK KK        +    
Sbjct: 177 EEERAKKKKKKKKKKTKKNNATGSSAEATVS 207



 Score = 30.8 bits (70), Expect = 0.69
 Identities = 12/31 (38%), Positives = 18/31 (58%)

Query: 23  REKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
           + + ++++ KKKKKKKKKK KK         
Sbjct: 173 KARLEEERAKKKKKKKKKKTKKNNATGSSAE 203



 Score = 30.8 bits (70), Expect = 0.80
 Identities = 13/38 (34%), Positives = 14/38 (36%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
           ER K KKKK KKK KK        +            S
Sbjct: 179 ERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSS 216


>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
           membranes [Cell envelope biogenesis, outer membrane].
          Length = 244

 Score = 33.6 bits (77), Expect = 0.086
 Identities = 14/37 (37%), Positives = 19/37 (51%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
           P P + ++  K + K KK K K K K K K K K + 
Sbjct: 79  PTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQP 115



 Score = 33.2 bits (76), Expect = 0.13
 Identities = 15/36 (41%), Positives = 17/36 (47%)

Query: 17  IPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
              P   +K KK K K K K K K K K + K KK 
Sbjct: 85  KEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKP 120



 Score = 32.5 bits (74), Expect = 0.20
 Identities = 14/46 (30%), Positives = 18/46 (39%)

Query: 14  KVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
           K       + +K K K   K K K K K + K KK   K    A +
Sbjct: 85  KEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPSKTAAKAPA 130



 Score = 32.1 bits (73), Expect = 0.26
 Identities = 15/35 (42%), Positives = 19/35 (54%)

Query: 20  PFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
           P  +EK K +K  KK K K K K K K K K + +
Sbjct: 82  PKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPK 116



 Score = 32.1 bits (73), Expect = 0.31
 Identities = 15/36 (41%), Positives = 17/36 (47%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
           P      K K+K   +KK KK K K K K K K K 
Sbjct: 76  PPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKV 111



 Score = 31.3 bits (71), Expect = 0.44
 Identities = 14/41 (34%), Positives = 17/41 (41%)

Query: 14  KVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
                 P       K+K K +KK KK K K K K K K + 
Sbjct: 71  TEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKV 111



 Score = 30.9 bits (70), Expect = 0.75
 Identities = 16/40 (40%), Positives = 20/40 (50%)

Query: 16  IIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
             P P E+ K +KK  K K K K K K K K K + K + 
Sbjct: 80  TPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKK 119



 Score = 29.8 bits (67), Expect = 1.7
 Identities = 14/39 (35%), Positives = 18/39 (46%)

Query: 14  KVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
           +     P E E   +    K K+K K +KK KK K K K
Sbjct: 64  EEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPK 102



 Score = 29.0 bits (65), Expect = 2.6
 Identities = 15/38 (39%), Positives = 19/38 (50%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
             P E    K K+  K +KK KK K K K K K K ++
Sbjct: 74  ETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKV 111


>gnl|CDD|235531 PRK05605, PRK05605, long-chain-fatty-acid--CoA ligase; Validated.
          Length = 573

 Score = 34.2 bits (79), Expect = 0.087
 Identities = 19/62 (30%), Positives = 32/62 (51%), Gaps = 12/62 (19%)

Query: 71  LSLPQLRLLSASLS---HNNIPGESML---------HADNNNEQSNDIAIVLYTSGSTGI 118
           L +P LR   A+L+      +P E+++            +     +D+A++LYTSG+TG 
Sbjct: 174 LPIPALRKARAALTGPAPGTVPWETLVDAAIGGDGSDVSHPRPTPDDVALILYTSGTTGK 233

Query: 119 PK 120
           PK
Sbjct: 234 PK 235


>gnl|CDD|213284 cd05917, FACL_like_2, Uncharacterized subfamily of fatty acid CoA
           ligase (FACL).  Fatty acyl-CoA ligases catalyze the
           ATP-dependent activation of fatty acids in a two-step
           reaction. The carboxylate substrate first reacts with
           ATP to form an acyl-adenylate intermediate, which then
           reacts with CoA to produce an acyl-CoA ester. This is a
           required step before free fatty acids can participate in
           most catabolic and anabolic reactions.
          Length = 347

 Score = 34.0 bits (79), Expect = 0.088
 Identities = 10/16 (62%), Positives = 13/16 (81%)

Query: 105 DIAIVLYTSGSTGIPK 120
           D A++ YTSG+TG PK
Sbjct: 3   DPALIQYTSGTTGRPK 18


>gnl|CDD|213306 cd05940, FATP_FACS, Fatty acid transport proteins (FATP) play dual
           roles as fatty acid transporters and its activation
           enzymes.  Fatty acid transport protein (FATP) transports
           long-chain or very-long-chain fatty acids across the
           plasma membrane. FATPs also have fatty acid CoA
           synthetase activity, thus playing dual roles as fatty
           acid transporters and its activation enzymes. At least
           five copies of FATPs are identified in mammalian cells.
           This family also includes prokaryotic FATPs. FATPs are
           the key players in the trafficking of exogenous fatty
           acids into the cell and in intracellular fatty acid
           homeostasis.
          Length = 444

 Score = 34.1 bits (79), Expect = 0.088
 Identities = 15/46 (32%), Positives = 21/46 (45%), Gaps = 9/46 (19%)

Query: 105 DIAIVLYTSGSTGIPK----EYRRMYKTGD-----YGRIVDDIHRC 141
           D A  +YTSG+TG+PK     +RR  + G           DD+   
Sbjct: 82  DPAFYIYTSGTTGLPKAAIMSHRRWLRAGAVFGGLGLLKPDDVLYL 127


>gnl|CDD|213318 cd05971, MACS_like_3, Uncharacterized subfamily of medium-chain
           acyl-CoA synthetase (MACS).  MACS catalyzes the two-step
           activation of medium chain fatty acids (containing 4-12
           carbons). The carboxylate substrate first reacts with
           ATP to form an acyl-adenylate intermediate, which then
           reacts with CoA to produce an acyl-CoA ester. MACS
           enzymes are localized to mitochondria.
          Length = 439

 Score = 33.9 bits (78), Expect = 0.089
 Identities = 12/20 (60%), Positives = 17/20 (85%)

Query: 101 EQSNDIAIVLYTSGSTGIPK 120
           + S+D AI++YTSG+TG PK
Sbjct: 85  DGSDDPAILIYTSGTTGPPK 104



 Score = 30.8 bits (70), Expect = 0.83
 Identities = 17/41 (41%), Positives = 23/41 (56%), Gaps = 1/41 (2%)

Query: 157 TGDYGRIVDD-ILLYEGRTDSQVKIRGNRVDLSEVEAALSK 196
           TGD GR   D  L ++GR D  +K  G R+  +E+E  L K
Sbjct: 320 TGDLGRRDADGYLWFKGRADDVIKSSGYRIGPAEIEECLLK 360


>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
           N-terminal.  SRP is a complex of six distinct
           polypeptides and a 7S RNA that is essential for
           transferring nascent polypeptide chains that are
           destined for export from the cell to the translocation
           apparatus of the endoplasmic reticulum (ER) membrane.
           SRP binds hydrophobic signal sequences as they emerge
           from the ribosome, and arrests translation.
          Length = 272

 Score = 33.6 bits (77), Expect = 0.090
 Identities = 13/54 (24%), Positives = 19/54 (35%)

Query: 1   MNINLTISEHHFDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
            N +   + H       P+   ++ AK         +K  K K   KK  KK R
Sbjct: 160 PNSSRPSTPHLLKAKEGPSRRAKKAAKLSSTASSGDEKSPKSKAAPKKAGKKMR 213



 Score = 28.6 bits (64), Expect = 3.7
 Identities = 8/31 (25%), Positives = 12/31 (38%)

Query: 21  FEREKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
             +  +      +K  K K   KK  KK +K
Sbjct: 184 AAKLSSTASSGDEKSPKSKAAPKKAGKKMRK 214


>gnl|CDD|236443 PRK09274, PRK09274, peptide synthase; Provisional.
          Length = 552

 Score = 34.1 bits (79), Expect = 0.091
 Identities = 11/19 (57%), Positives = 15/19 (78%)

Query: 102 QSNDIAIVLYTSGSTGIPK 120
             +D+A +L+TSGSTG PK
Sbjct: 172 APDDMAAILFTSGSTGTPK 190


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 33.8 bits (78), Expect = 0.093
 Identities = 17/100 (17%), Positives = 33/100 (33%)

Query: 6   TISEHHFDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAY 65
            +       V            KKK K   K    K   KKK K +         +N   
Sbjct: 54  QVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALD 113

Query: 66  KGRTVLSLPQLRLLSASLSHNNIPGESMLHADNNNEQSND 105
           K   +  +  + +L+ +   ++   +  L  D+ ++  +D
Sbjct: 114 KDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDD 153



 Score = 29.6 bits (67), Expect = 2.1
 Identities = 11/51 (21%), Positives = 16/51 (31%)

Query: 2   NINLTISEHHFDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
            + + +S    D          +K  K   K    K   KKK K +    K
Sbjct: 54  QVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSK 104



 Score = 28.4 bits (64), Expect = 4.7
 Identities = 11/51 (21%), Positives = 16/51 (31%)

Query: 3   INLTISEHHFDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
           I L+      D        +++     K    K   KKK K +    KK  
Sbjct: 57  IFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAE 107


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 34.1 bits (78), Expect = 0.094
 Identities = 24/92 (26%), Positives = 38/92 (41%), Gaps = 16/92 (17%)

Query: 14  KVIIPAPFEREKAKKKKNKKKKKKKKKKKKKK----------------KKKKKKKRRILA 57
           K++     ER+K K++K K+K+ KK K  +K+                 KK +KK R   
Sbjct: 11  KILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKKSRKRD 70

Query: 58  KSCENPAYKGRTVLSLPQLRLLSASLSHNNIP 89
              ENP           Q + LS+ ++    P
Sbjct: 71  VEDENPEDFIDPDTPFGQKKRLSSQMAKQYSP 102



 Score = 31.8 bits (72), Expect = 0.47
 Identities = 12/41 (29%), Positives = 24/41 (58%)

Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
              ++   +++  +KKKK++K K+K+ KK K  ++   AK
Sbjct: 6  SEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAK 46



 Score = 28.7 bits (64), Expect = 4.2
 Identities = 14/40 (35%), Positives = 25/40 (62%)

Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCE 61
          E E  KK   +++ ++KKKK++K K+K+ KK +   K  +
Sbjct: 5  ESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAK 44


>gnl|CDD|219547 pfam07741, BRF1, Brf1-like TBP-binding domain.  This region
          covers both the Brf homology II and III regions. This
          region is involved in binding TATA binding protein.
          Length = 95

 Score = 31.8 bits (73), Expect = 0.10
 Identities = 15/41 (36%), Positives = 23/41 (56%)

Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPA 64
          E+ ++K+ K+K  +     KKKKK+K KK+R  A      A
Sbjct: 30 EEQEEKELKQKADEGNNSGKKKKKRKAKKKRDEAGPASTAA 70



 Score = 31.5 bits (72), Expect = 0.11
 Identities = 13/31 (41%), Positives = 19/31 (61%)

Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
           E+ +K+  +K  +     KKKKK+K KKKR
Sbjct: 30 EEQEEKELKQKADEGNNSGKKKKKRKAKKKR 60



 Score = 29.9 bits (68), Expect = 0.42
 Identities = 11/29 (37%), Positives = 15/29 (51%)

Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
            K K  +     KKKKK+K KKK+ +  
Sbjct: 36 ELKQKADEGNNSGKKKKKRKAKKKRDEAG 64



 Score = 29.5 bits (67), Expect = 0.68
 Identities = 13/31 (41%), Positives = 19/31 (61%)

Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
          E ++ K+ K K  +     KKKKK+K KKK+
Sbjct: 30 EEQEEKELKQKADEGNNSGKKKKKRKAKKKR 60



 Score = 29.1 bits (66), Expect = 0.80
 Identities = 13/32 (40%), Positives = 18/32 (56%)

Query: 20 PFEREKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
            + EK  K+K  +     KKKKK+K KKK+ 
Sbjct: 30 EEQEEKELKQKADEGNNSGKKKKKRKAKKKRD 61



 Score = 28.8 bits (65), Expect = 1.2
 Identities = 12/31 (38%), Positives = 17/31 (54%)

Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
          E+E  +K        KKKKK+K KKK+ +  
Sbjct: 34 EKELKQKADEGNNSGKKKKKRKAKKKRDEAG 64



 Score = 28.0 bits (63), Expect = 2.0
 Identities = 12/31 (38%), Positives = 20/31 (64%)

Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
          E ++ K+K ++     KKKKK+K KKK+ + 
Sbjct: 33 EEKELKQKADEGNNSGKKKKKRKAKKKRDEA 63



 Score = 26.1 bits (58), Expect = 8.7
 Identities = 9/28 (32%), Positives = 17/28 (60%)

Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKK 48
           +++  +   + KKKKK+K KKK+ +  
Sbjct: 37 LKQKADEGNNSGKKKKKRKAKKKRDEAG 64


>gnl|CDD|213320 cd05973, MACS_like_2, Uncharacterized subfamily of medium-chain
           acyl-CoA synthetase (MACS).  MACS catalyzes the two-step
           activation of medium chain fatty acids (containing 4-12
           carbons). The carboxylate substrate first reacts with
           ATP to form an acyl-adenylate intermediate, which then
           reacts with CoA to produce an acyl-CoA ester. MACS
           enzymes are localized to mitochondria.
          Length = 440

 Score = 34.0 bits (78), Expect = 0.10
 Identities = 11/26 (42%), Positives = 18/26 (69%)

Query: 95  HADNNNEQSNDIAIVLYTSGSTGIPK 120
           +A N  +  +D+   +YTSG+TG+PK
Sbjct: 79  NAANRGKLDDDLFAQMYTSGTTGLPK 104


>gnl|CDD|213280 cd05912, OSB_CoA_lg, O-succinylbenzoate-CoA ligase (also known as
           O-succinylbenzoate-CoA synthase, OSB-CoA synthetase, or
           MenE).  O-succinylbenzoic acid-CoA synthase catalyzes
           the coenzyme A (CoA)- and ATP-dependent conversion of
           o-succinylbenzoic acid to o-succinylbenzoyl-CoA. The
           reaction is the fourth step of the biosynthesis pathway
           of menaquinone (vitamin K2). In certain bacteria,
           menaquinone is used during fumarate reduction in
           anaerobic respiration. In cyanobacteria, the product of
           the menaquinone pathway is phylloquinone
           (2-methyl-3-phytyl-1,4-naphthoquinone), a molecule used
           exclusively as an electron transfer cofactor in
           Photosystem 1. In green sulfur bacteria and
           heliobacteria, menaquinones are used as loosely bound
           secondary electron acceptors in the photosynthetic
           reaction center.
          Length = 407

 Score = 33.7 bits (78), Expect = 0.10
 Identities = 9/16 (56%), Positives = 12/16 (75%)

Query: 105 DIAIVLYTSGSTGIPK 120
             A +++TSGSTG PK
Sbjct: 78  RPATIIFTSGSTGKPK 93



 Score = 28.3 bits (64), Expect = 5.8
 Identities = 16/53 (30%), Positives = 27/53 (50%), Gaps = 1/53 (1%)

Query: 155 YKTGDYGRIVDDILLY-EGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSVL 206
           + TGD G +  +  LY  GR D  +   G  +   E+EA L +   V++ +V+
Sbjct: 290 FHTGDLGYLDAEGYLYVLGRRDDLIISGGENIYPEEIEAVLLQHPAVEEAAVV 342


>gnl|CDD|213315 cd05968, AACS_like, Uncharacterized acyl-CoA synthetase subfamily
           similar to Acetoacetyl-CoA synthetase.  This
           uncharacterized acyl-CoA synthetase family is highly
           homologous to acetoacetyl-CoA synthetase. However, the
           proteins in this family exist in only bacteria and
           archaea. AACS is a cytosolic ligase that specifically
           activates acetoacetate to its coenzyme A ester by a
           two-step reaction. Acetoacetate first reacts with ATP to
           form an acyl-adenylate intermediate, which then reacts
           with CoA to produce an acyl-CoA ester. This is the first
           step of the mevalonate pathway of isoprenoid
           biosynthesis via isopentenyl diphosphate. Isoprenoids
           are a large class of compounds found in all living
           organisms.
          Length = 474

 Score = 33.8 bits (78), Expect = 0.10
 Identities = 15/44 (34%), Positives = 25/44 (56%), Gaps = 1/44 (2%)

Query: 158 GDYGRIVDDILLY-EGRTDSQVKIRGNRVDLSEVEAALSKLSEV 200
           GD+  + +D   Y  GR+D  +K+ G RV  +E+E+ L+    V
Sbjct: 340 GDWALVDEDGYWYILGRSDDTIKVAGKRVGPAEIESVLNSHPAV 383



 Score = 31.5 bits (72), Expect = 0.57
 Identities = 11/29 (37%), Positives = 18/29 (62%)

Query: 235 GRTDSQVKIRGNRVDLSEVEAALSKISEV 263
           GR+D  +K+ G RV  +E+E+ L+    V
Sbjct: 355 GRSDDTIKVAGKRVGPAEIESVLNSHPAV 383



 Score = 30.3 bits (69), Expect = 1.2
 Identities = 10/17 (58%), Positives = 15/17 (88%)

Query: 104 NDIAIVLYTSGSTGIPK 120
           +D A+++YTSG+TG PK
Sbjct: 100 DDPAMIIYTSGTTGKPK 116


>gnl|CDD|182517 PRK10524, prpE, propionyl-CoA synthetase; Provisional.
          Length = 629

 Score = 33.8 bits (78), Expect = 0.11
 Identities = 15/29 (51%), Positives = 20/29 (68%), Gaps = 2/29 (6%)

Query: 103 SNDIAIVLYTSGSTGIPKEYRRMYKTGDY 131
           SN+ + +LYTSG+TG PK  +R   TG Y
Sbjct: 232 SNEPSYILYTSGTTGKPKGVQR--DTGGY 258



 Score = 33.0 bits (76), Expect = 0.20
 Identities = 22/69 (31%), Positives = 34/69 (49%), Gaps = 14/69 (20%)

Query: 142 WSSGDRAIKYRRMYKTGDYGRIVDD-----ILLYEGRTDSQVKIRGNRVDLSEVEAALSK 196
           WS   R +     Y T D+G I D      IL   GRTD  + + G+R+   E+E ++S 
Sbjct: 467 WSLFGRQV-----YSTFDWG-IRDADGYYFIL---GRTDDVINVAGHRLGTREIEESISS 517

Query: 197 LSEVDKLSV 205
              V +++V
Sbjct: 518 HPAVAEVAV 526


>gnl|CDD|213282 cd05914, FACL_like_3, Uncharacterized subfamily of fatty acid CoA
           ligase (FACL).  Fatty acyl-CoA ligases catalyze the
           ATP-dependent activation of fatty acids in a two-step
           reaction. The carboxylate substrate first reacts with
           ATP to form an acyl-adenylate intermediate, which then
           reacts with CoA to produce an acyl-CoA ester. This is a
           required step before free fatty acids can participate in
           most catabolic and anabolic reactions.
          Length = 448

 Score = 33.8 bits (78), Expect = 0.11
 Identities = 16/48 (33%), Positives = 21/48 (43%), Gaps = 4/48 (8%)

Query: 77  RLLSASLSHNNIPGE----SMLHADNNNEQSNDIAIVLYTSGSTGIPK 120
              SA  +     GE    ++  + N  E     A + YTSGSTG PK
Sbjct: 81  DAASALHTPFATLGEDLYIALRPSANPVELPAGTAKITYTSGSTGQPK 128


>gnl|CDD|213292 cd05926, FACL_fum10p_like, Subfamily of fatty acid CoA ligase
           (FACL) similar to Fum10p of Gibberella moniliformis.
           FACL catalyzes the formation of fatty acyl-CoA in a
           two-step reaction: the formation of a fatty acyl-AMP
           molecule as an intermediate, followed by the formation
           of a fatty acyl-CoA. This is a required step before free
           fatty acids can participate in most catabolic and
           anabolic reactions. Fum10p is a fatty acid CoA ligase
           involved in the synthesis of fumonisin, a polyketide
           mycotoxin, in Gibberella moniliformis.
          Length = 345

 Score = 33.4 bits (77), Expect = 0.12
 Identities = 10/18 (55%), Positives = 15/18 (83%)

Query: 103 SNDIAIVLYTSGSTGIPK 120
            +D A++L+TSG+TG PK
Sbjct: 1   PDDPALILHTSGTTGRPK 18


>gnl|CDD|236315 PRK08633, PRK08633, 2-acyl-glycerophospho-ethanolamine
            acyltransferase; Validated.
          Length = 1146

 Score = 33.7 bits (78), Expect = 0.12
 Identities = 18/60 (30%), Positives = 25/60 (41%), Gaps = 1/60 (1%)

Query: 147  RAIKYRRMYKTGDYGRIVDDILLY-EGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSV 205
            + I     Y TGD G + +D  L    R     KI G  V L  VE  L+K    +++  
Sbjct: 1013 KDIDGIGWYVTGDKGHLDEDGFLTITDRYSRFAKIGGEMVPLGAVEEELAKALGGEEVVF 1072



 Score = 31.4 bits (72), Expect = 0.74
 Identities = 8/16 (50%), Positives = 12/16 (75%)

Query: 105 DIAIVLYTSGSTGIPK 120
           D A ++++SGS G PK
Sbjct: 783 DTATIIFSSGSEGEPK 798


>gnl|CDD|213313 cd05966, ACS, Acetyl-CoA synthetase (also known as acetate-CoA
           ligase and acetyl-activating enzyme).  Acetyl-CoA
           synthetase (ACS) catalyzes the formation of acetyl-CoA
           from acetate, CoA, and ATP. Synthesis of acetyl-CoA is
           carried out in a two-step reaction. In the first step,
           the enzyme catalyzes the synthesis of acetyl-AMP
           intermediate from acetate and ATP. In the second step,
           acetyl-AMP reacts with CoA to produce acetyl-CoA. This
           enzyme is widely present in all living organisms. The
           activity of this enzyme is crucial for maintaining the
           required levels of acetyl-CoA, a key intermediate in
           many important biosynthetic and catabolic processes.
           Acetyl-CoA is used in the biosynthesis of glucose, fatty
           acids, and cholesterol. It can also be used in the
           production of energy in the citric acid cycle.
           Eukaryotes typically have two isoforms of acetyl-CoA
           synthetase, a cytosolic form involved in biosynthetic
           processes and a mitochondrial form primarily involved in
           energy generation.
          Length = 602

 Score = 33.7 bits (78), Expect = 0.12
 Identities = 19/49 (38%), Positives = 27/49 (55%), Gaps = 7/49 (14%)

Query: 150 KYRRMYKTGDYGRIVDD----ILLYEGRTDSQVKIRGNRVDLSEVEAAL 194
           K+   Y TGD  R  +D    IL   GR D  + + G+R+  +EVE+AL
Sbjct: 460 KFPGYYFTGDGARRDEDGYYWIL---GRVDDVINVSGHRLGTAEVESAL 505



 Score = 32.5 bits (75), Expect = 0.25
 Identities = 11/18 (61%), Positives = 13/18 (72%)

Query: 103 SNDIAIVLYTSGSTGIPK 120
           + D   +LYTSGSTG PK
Sbjct: 224 AEDPLFILYTSGSTGKPK 241


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 33.9 bits (78), Expect = 0.12
 Identities = 14/24 (58%), Positives = 21/24 (87%)

Query: 32  KKKKKKKKKKKKKKKKKKKKKRRI 55
           K KKKKKK+KKK+++ K+++K RI
Sbjct: 744 KTKKKKKKEKKKEEEYKREEKARI 767



 Score = 29.3 bits (66), Expect = 3.1
 Identities = 12/32 (37%), Positives = 21/32 (65%)

Query: 14  KVIIPAPFEREKAKKKKNKKKKKKKKKKKKKK 45
             I  +  E+ K KKKK KKK+++ K+++K +
Sbjct: 735 FKISDSVEEKTKKKKKKEKKKEEEYKREEKAR 766



 Score = 27.8 bits (62), Expect = 8.7
 Identities = 12/28 (42%), Positives = 20/28 (71%)

Query: 20  PFEREKAKKKKNKKKKKKKKKKKKKKKK 47
               EK KKKK K+KKK+++ K+++K +
Sbjct: 739 DSVEEKTKKKKKKEKKKEEEYKREEKAR 766


>gnl|CDD|213307 cd05941, MCS, Malonyl-CoA synthetase (MCS).  MCS catalyzes the
           formation of malonyl-CoA in a two-step reaction
           consisting of the adenylation of malonate with ATP,
           followed by malonyl transfer from malonyl-AMP to CoA.
           Malonic acid and its derivatives are the building blocks
           of polyketides and malonyl-CoA serves as the substrate
           of polyketide synthases. Malonyl-CoA synthetase has
           broad substrate tolerance and can activate a variety of
           malonyl acid derivatives. MCS may play an important role
           in biosynthesis of polyketides, the important secondary
           metabolites with therapeutic and agrochemical utility.
          Length = 430

 Score = 33.4 bits (77), Expect = 0.13
 Identities = 10/16 (62%), Positives = 14/16 (87%)

Query: 105 DIAIVLYTSGSTGIPK 120
           D A+++YTSG+TG PK
Sbjct: 89  DPALIIYTSGTTGRPK 104


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 33.4 bits (77), Expect = 0.13
 Identities = 14/31 (45%), Positives = 22/31 (70%)

Query: 23  REKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
            +KA KK  K  +K +KK++++KK+KKKK  
Sbjct: 405 SKKATKKIKKIVEKAEKKREEEKKEKKKKAF 435



 Score = 33.4 bits (77), Expect = 0.15
 Identities = 14/31 (45%), Positives = 24/31 (77%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
             EKA+KK+ ++KK+KKKK    KKK+++++
Sbjct: 415 IVEKAEKKREEEKKEKKKKAFAGKKKEEEEE 445



 Score = 33.0 bits (76), Expect = 0.18
 Identities = 13/42 (30%), Positives = 23/42 (54%)

Query: 23  REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPA 64
           ++  KK K   +K +KK++++KK+KKKK       +  E   
Sbjct: 406 KKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEE 447



 Score = 32.6 bits (75), Expect = 0.29
 Identities = 13/34 (38%), Positives = 25/34 (73%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
           +RE+ KK+K KK    KKK+++++++K+KK+   
Sbjct: 422 KREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEK 455



 Score = 32.2 bits (74), Expect = 0.36
 Identities = 12/38 (31%), Positives = 23/38 (60%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
           +  K  KK  +K +KK++++KK+KKKK    ++   + 
Sbjct: 407 KATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEE 444



 Score = 31.8 bits (73), Expect = 0.46
 Identities = 11/31 (35%), Positives = 22/31 (70%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
             +  KK++ +KK+KKKK    KKK++++++
Sbjct: 416 VEKAEKKREEEKKEKKKKAFAGKKKEEEEEE 446



 Score = 31.8 bits (73), Expect = 0.48
 Identities = 11/38 (28%), Positives = 25/38 (65%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
             E+ K+KK K    KKK+++++++K+KK++ +   + 
Sbjct: 423 REEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEE 460



 Score = 31.4 bits (72), Expect = 0.55
 Identities = 11/33 (33%), Positives = 24/33 (72%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
           ++ + +KK+ KKK    KKK+++++++K+KK  
Sbjct: 421 KKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEE 453



 Score = 31.4 bits (72), Expect = 0.57
 Identities = 11/32 (34%), Positives = 24/32 (75%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
           + EK ++++ K+KKKK    KKK+++++++K 
Sbjct: 418 KAEKKREEEKKEKKKKAFAGKKKEEEEEEEKE 449



 Score = 30.7 bits (70), Expect = 0.91
 Identities = 11/32 (34%), Positives = 22/32 (68%)

Query: 21  FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
            ++   K +K ++++KK+KKKK    KKK+++
Sbjct: 412 IKKIVEKAEKKREEEKKEKKKKAFAGKKKEEE 443



 Score = 30.3 bits (69), Expect = 1.3
 Identities = 12/34 (35%), Positives = 22/34 (64%)

Query: 26  AKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
              KK  KK KK  +K +KK++++KK+++  A +
Sbjct: 403 TGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFA 436



 Score = 30.3 bits (69), Expect = 1.3
 Identities = 12/31 (38%), Positives = 25/31 (80%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
           E+++ ++KK KKKK    KKK+++++++K+K
Sbjct: 420 EKKREEEKKEKKKKAFAGKKKEEEEEEEKEK 450



 Score = 30.3 bits (69), Expect = 1.4
 Identities = 12/40 (30%), Positives = 25/40 (62%)

Query: 13  DKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
               I    E+ + K+++ KK+KKKK    KKK+++++++
Sbjct: 408 ATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEE 447



 Score = 30.3 bits (69), Expect = 1.6
 Identities = 12/38 (31%), Positives = 28/38 (73%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
           ++EK KK    KKK+++++++K+KK+++K++    A+ 
Sbjct: 427 KKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEE 464



 Score = 29.5 bits (67), Expect = 2.2
 Identities = 12/31 (38%), Positives = 20/31 (64%)

Query: 23  REKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
               K  K  KK  +K +KK++++KK+KKK+
Sbjct: 403 TGSKKATKKIKKIVEKAEKKREEEKKEKKKK 433


>gnl|CDD|213314 cd05967, PrpE, Propionyl-CoA synthetase (PrpE).  PrpE catalyzes the
           first step of the 2-methylcitric acid cycle for
           propionate catabolism. It activates propionate to
           propionyl-CoA in a two-step reaction, which proceeds
           through a propionyl-AMP intermediate and requires ATP
           and Mg2+. In Salmonella enterica, the PrpE protein is
           required for growth of S. enterica on propionate and can
           substitute for the acetyl-CoA synthetase (Acs) enzyme
           during growth on acetate. PrpE can also activate
           acetate, 3HP, and butyrate to their corresponding
           CoA-thioesters, although with less efficiency.
          Length = 607

 Score = 33.4 bits (77), Expect = 0.15
 Identities = 17/48 (35%), Positives = 27/48 (56%), Gaps = 1/48 (2%)

Query: 150 KYRRMYKTGDYGRIVDDILLY-EGRTDSQVKIRGNRVDLSEVEAALSK 196
           K+   Y TGD G   +D  L+  GRTD  + + G+R+   E+E ++ K
Sbjct: 458 KFPGYYDTGDSGYKDEDGYLFVMGRTDDVINVAGHRLSTGEMEESVLK 505



 Score = 30.0 bits (68), Expect = 1.6
 Identities = 11/18 (61%), Positives = 13/18 (72%)

Query: 103 SNDIAIVLYTSGSTGIPK 120
           S D   +LYTSG+TG PK
Sbjct: 219 STDPLYILYTSGTTGKPK 236


>gnl|CDD|185507 PTZ00191, PTZ00191, 60S ribosomal protein L23a; Provisional.
          Length = 145

 Score = 32.0 bits (73), Expect = 0.15
 Identities = 14/31 (45%), Positives = 16/31 (51%)

Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
          +   KK  KK KK  K  KK  K KK+K R 
Sbjct: 1  KAPAKKAKKKAKKAAKAAKKGVKVKKRKVRT 31



 Score = 31.2 bits (71), Expect = 0.30
 Identities = 14/30 (46%), Positives = 17/30 (56%)

Query: 26 AKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
          AKK K K KK  K  KK  K KK+K +  +
Sbjct: 4  AKKAKKKAKKAAKAAKKGVKVKKRKVRTSV 33



 Score = 29.7 bits (67), Expect = 1.1
 Identities = 12/29 (41%), Positives = 14/29 (48%)

Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
            K KK  KK  K  KK  K KK+K +  
Sbjct: 4  AKKAKKKAKKAAKAAKKGVKVKKRKVRTS 32



 Score = 29.7 bits (67), Expect = 1.2
 Identities = 14/29 (48%), Positives = 16/29 (55%)

Query: 27 KKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
          K    K KKK KK  K  KK  K KKR++
Sbjct: 1  KAPAKKAKKKAKKAAKAAKKGVKVKKRKV 29


>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
          Length = 246

 Score = 32.7 bits (75), Expect = 0.15
 Identities = 18/51 (35%), Positives = 24/51 (47%), Gaps = 3/51 (5%)

Query: 14  KVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPA 64
            V+IP P   E   K K K K K  KK +++ K++ K      A   EN A
Sbjct: 89  PVVIPKP---EPKPKPKPKPKPKPVKKVEEQPKREVKPVEPRPASPFENTA 136



 Score = 32.0 bits (73), Expect = 0.30
 Identities = 11/37 (29%), Positives = 18/37 (48%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
            AP    K + K   K K K K  KK +++ K++ + 
Sbjct: 87  EAPVVIPKPEPKPKPKPKPKPKPVKKVEEQPKREVKP 123



 Score = 31.6 bits (72), Expect = 0.41
 Identities = 15/49 (30%), Positives = 21/49 (42%), Gaps = 1/49 (2%)

Query: 17  IPAPFEREK-AKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPA 64
           IP P +       K   K K K K K K  KK +++ +R +      PA
Sbjct: 81  IPEPPKEAPVVIPKPEPKPKPKPKPKPKPVKKVEEQPKREVKPVEPRPA 129


>gnl|CDD|236217 PRK08279, PRK08279, long-chain-acyl-CoA synthetase; Validated.
          Length = 600

 Score = 33.3 bits (77), Expect = 0.15
 Identities = 10/19 (52%), Positives = 14/19 (73%)

Query: 102 QSNDIAIVLYTSGSTGIPK 120
            + D A  +YTSG+TG+PK
Sbjct: 197 TAKDTAFYIYTSGTTGLPK 215


>gnl|CDD|213301 cd05935, LC_FACS_like, Putative long-chain fatty acid CoA ligase.
           The members of this family are putative long-chain fatty
           acyl-CoA synthetases, which catalyze the ATP-dependent
           activation of fatty acids in a two-step reaction. The
           carboxylate substrate first reacts with ATP to form an
           acyl-adenylate intermediate, which then reacts with CoA
           to produce an acyl-CoA ester. Fatty acyl-CoA synthetases
           are responsible for fatty acid degradation as well as
           physiological regulation of cellular functions via the
           production of fatty acyl-CoA esters.
          Length = 430

 Score = 33.1 bits (76), Expect = 0.16
 Identities = 25/94 (26%), Positives = 43/94 (45%), Gaps = 15/94 (15%)

Query: 147 RAIKYRRMYKTGDYGRIVDD-ILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSV 205
             +  +R ++TGD G I ++    +  R    + + G +V  +EVEA L +   V ++ V
Sbjct: 304 IELDGKRFFRTGDLGYIDEEGYFFFLDRVKRMINVSGYKVWPAEVEALLYQHPAVLEVCV 363

Query: 206 LCYKP----GEV---------DQVSEVTSEEIRG 226
           +  +P    GEV         D   +VT E+I  
Sbjct: 364 IG-RPDPRRGEVVKAFVVLKPDYRGKVTEEDIIE 396



 Score = 32.7 bits (75), Expect = 0.22
 Identities = 10/20 (50%), Positives = 17/20 (85%)

Query: 101 EQSNDIAIVLYTSGSTGIPK 120
            + +D+A++ YTSG+TG+PK
Sbjct: 81  SELDDVAVIPYTSGTTGLPK 100


>gnl|CDD|224458 COG1541, PaaK, Coenzyme F390 synthetase [Coenzyme metabolism].
          Length = 438

 Score = 33.1 bits (76), Expect = 0.16
 Identities = 19/57 (33%), Positives = 28/57 (49%), Gaps = 10/57 (17%)

Query: 155 YKTGDYGRIVDD----------ILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEVD 201
           Y+TGD   I+ D          I   EGR+D  + +RG  V  S++E  L ++ EV 
Sbjct: 304 YRTGDITVILSDPCGCGRTHRRIERIEGRSDDMLIVRGVNVFPSQIERVLLQIPEVT 360


>gnl|CDD|223953 COG1022, FAA1, Long-chain acyl-CoA synthetases (AMP-forming) [Lipid
           metabolism].
          Length = 613

 Score = 33.4 bits (77), Expect = 0.17
 Identities = 10/19 (52%), Positives = 16/19 (84%)

Query: 102 QSNDIAIVLYTSGSTGIPK 120
           + +D+A ++YTSG+TG PK
Sbjct: 189 KPDDLATIIYTSGTTGTPK 207


>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
           eukaryotic RNase H complex which cleaves RNA-DNA
           hybrids.  Ribonuclease H2B is one of the three proteins
           of eukaryotic RNase H2 complex that is required for
           nucleic acid binding and hydrolysis. RNase H is
           classified into two families, type I (prokaryotic RNase
           HI, eukaryotic RNase H1 and viral RNase H) and type II
           (prokaryotic RNase HII and HIII, and eukaryotic RNase
           H2/HII). RNase H endonucleolytically hydrolyzes an RNA
           strand when it is annealed to a complementary DNA strand
           in the presence of divalent cations, in DNA replication
           and repair. The enzyme can be found in bacteria,
           archaea, and eukaryotes. Most prokaryotic and eukaryotic
           genomes contain multiple RNase H genes. Despite a lack
           of evidence for homology from sequence comparisons, type
           I and type II RNase H share a common fold and similar
           steric configurations of the four acidic active-site
           residues, suggesting identical or very similar catalytic
           mechanisms. Eukaryotic RNase HII is active during
           replication and is believed to play a role in removal of
           Okazaki fragment primers and single ribonucleotides in
           DNA-DNA duplexes. Eukaryotic RNase HII is functional
           when it forms a complex with RNase H2B and RNase H2C
           proteins. It is speculated that the two accessory
           subunits are required for correct folding of the
           catalytic subunit of RNase HII. Mutations in the three
           subunits of human RNase HII cause neurological disorder.
          Length = 211

 Score = 32.7 bits (75), Expect = 0.17
 Identities = 16/32 (50%), Positives = 19/32 (59%)

Query: 21  FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
                 KK  N +++KKKKKKK  KKKK KK 
Sbjct: 167 EAARDKKKSNNAEEEKKKKKKKSAKKKKLKKV 198



 Score = 32.3 bits (74), Expect = 0.23
 Identities = 18/37 (48%), Positives = 22/37 (59%)

Query: 16  IIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
           I  A   R+K K    +++KKKKKKK  KKKK KK  
Sbjct: 163 IKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVA 199



 Score = 31.5 bits (72), Expect = 0.40
 Identities = 15/38 (39%), Positives = 21/38 (55%), Gaps = 1/38 (2%)

Query: 19  APFEREKAKKKKNKKKKKKKKKKKKK-KKKKKKKKRRI 55
           A  +++    ++ KKKKKKK  KKKK KK      + I
Sbjct: 169 ARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVGMKAI 206



 Score = 27.7 bits (62), Expect = 6.5
 Identities = 15/30 (50%), Positives = 20/30 (66%)

Query: 24  EKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
               KKK+   +++KKKKKKK  KKKK K+
Sbjct: 168 AARDKKKSNNAEEEKKKKKKKSAKKKKLKK 197



 Score = 27.7 bits (62), Expect = 6.8
 Identities = 14/36 (38%), Positives = 20/36 (55%)

Query: 23  REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
             K  +    KKK    +++KKKKKKK  K++ L K
Sbjct: 162 DIKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKK 197


>gnl|CDD|222592 pfam14204, Ribosomal_L18_c, Ribosomal L18 C-terminal region.
          This domain is the C-terminal end of ribosomal L18/L5
          proteins.
          Length = 93

 Score = 31.0 bits (71), Expect = 0.17
 Identities = 12/38 (31%), Positives = 21/38 (55%), Gaps = 2/38 (5%)

Query: 23 REKAKKKKNKKKKKKKKKKKKKKKK--KKKKKRRILAK 58
          R    +KK +KK+ K + K+   KK   +++K R+  K
Sbjct: 56 RADPSRKKKEKKEVKAESKRYNAKKLTLEQRKARVAQK 93



 Score = 26.7 bits (60), Expect = 6.2
 Identities = 9/31 (29%), Positives = 16/31 (51%)

Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
          +KA         +KKK+KK+ K + K+   +
Sbjct: 49 KKAHAAIRADPSRKKKEKKEVKAESKRYNAK 79


>gnl|CDD|236175 PRK08180, PRK08180, feruloyl-CoA synthase; Reviewed.
          Length = 614

 Score = 33.3 bits (77), Expect = 0.17
 Identities = 11/17 (64%), Positives = 14/17 (82%)

Query: 104 NDIAIVLYTSGSTGIPK 120
           + IA  L+TSGSTG+PK
Sbjct: 209 DTIAKFLFTSGSTGLPK 225


>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6.  The surfeit locus
           protein SURF-6 is shown to be a component of the
           nucleolar matrix and has a strong binding capacity for
           nucleic acids.
          Length = 206

 Score = 32.3 bits (74), Expect = 0.18
 Identities = 21/37 (56%), Positives = 25/37 (67%), Gaps = 4/37 (10%)

Query: 24  EKAKKKKNKKK----KKKKKKKKKKKKKKKKKKRRIL 56
           EK K ++ KK+    KK+K  KK KKKKK KKK RIL
Sbjct: 170 EKKKAERQKKREENLKKRKDDKKNKKKKKAKKKGRIL 206



 Score = 29.6 bits (67), Expect = 1.8
 Identities = 16/30 (53%), Positives = 22/30 (73%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
           ER+K +++  KK+K  KK KKKKK KKK +
Sbjct: 175 ERQKKREENLKKRKDDKKNKKKKKAKKKGR 204



 Score = 28.0 bits (63), Expect = 4.7
 Identities = 11/33 (33%), Positives = 24/33 (72%)

Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
           RE   +++ +K++++K +KK+K+K+ KKK+  
Sbjct: 3  SREALLEQRRRKREQRKARKKQKRKEAKKKEDA 35



 Score = 28.0 bits (63), Expect = 5.5
 Identities = 14/26 (53%), Positives = 17/26 (65%)

Query: 23  REKAKKKKNKKKKKKKKKKKKKKKKK 48
            E  KK+K+ KK KKKKK KKK +  
Sbjct: 181 EENLKKRKDDKKNKKKKKAKKKGRIL 206



 Score = 27.7 bits (62), Expect = 6.6
 Identities = 10/37 (27%), Positives = 27/37 (72%)

Query: 26 AKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCEN 62
           ++++ ++++K +KK+K+K+ KKK+  ++  A+  +N
Sbjct: 9  EQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKN 45



 Score = 27.3 bits (61), Expect = 8.7
 Identities = 14/27 (51%), Positives = 17/27 (62%)

Query: 23  REKAKKKKNKKKKKKKKKKKKKKKKKK 49
           RE+  KK+   KK KKKKK KKK +  
Sbjct: 180 REENLKKRKDDKKNKKKKKAKKKGRIL 206



 Score = 27.3 bits (61), Expect = 8.8
 Identities = 9/29 (31%), Positives = 23/29 (79%)

Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
          + +++K +++K +KK+K+K+ KKK+  ++
Sbjct: 9  EQRRRKREQRKARKKQKRKEAKKKEDAQK 37


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 32.8 bits (75), Expect = 0.19
 Identities = 12/34 (35%), Positives = 26/34 (76%)

Query: 19  APFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
           A  + EK K+++N+KK+K++  + K+K++K ++K
Sbjct: 400 AKAKEEKLKQEENEKKQKEQADEDKEKRQKDERK 433



 Score = 29.8 bits (67), Expect = 1.9
 Identities = 10/34 (29%), Positives = 24/34 (70%)

Query: 20  PFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
                KAK++K K+++ +KK+K++  + K+K+++
Sbjct: 396 EEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQK 429



 Score = 29.4 bits (66), Expect = 2.6
 Identities = 9/30 (30%), Positives = 22/30 (73%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
           E+ K ++ + K+K++  + K+K++K ++KK
Sbjct: 405 EKLKQEENEKKQKEQADEDKEKRQKDERKK 434



 Score = 29.0 bits (65), Expect = 3.2
 Identities = 12/37 (32%), Positives = 24/37 (64%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
               E  +AK K+ K K+++ +KK+K++  + K+KR+
Sbjct: 392 TDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQ 428


>gnl|CDD|213295 cd05929, BACL_like, Bacterial Bile acid CoA ligases and similar
           proteins.  Bile acid-Coenzyme A ligase catalyzes the
           formation of bile acid-CoA conjugates in a two-step
           reaction: the formation of a bile acid-AMP molecule as
           an intermediate, followed by the formation of a bile
           acid-CoA. This ligase requires a bile acid with a free
           carboxyl group, ATP, Mg2+, and CoA for synthesis of the
           final bile acid-CoA conjugate. The bile acid-CoA
           ligation is believed to be the initial step in the bile
           acid 7alpha-dehydroxylation pathway in the intestinal
           bacterium Eubacterium sp.
          Length = 342

 Score = 32.8 bits (76), Expect = 0.21
 Identities = 11/16 (68%), Positives = 14/16 (87%)

Query: 105 DIAIVLYTSGSTGIPK 120
           D A++LYTSG+TG PK
Sbjct: 2   DPALILYTSGTTGRPK 17


>gnl|CDD|236072 PRK07656, PRK07656, long-chain-fatty-acid--CoA ligase; Validated.
          Length = 513

 Score = 33.0 bits (76), Expect = 0.22
 Identities = 10/16 (62%), Positives = 14/16 (87%)

Query: 105 DIAIVLYTSGSTGIPK 120
           D+A +L+TSG+TG PK
Sbjct: 167 DVADILFTSGTTGRPK 182


>gnl|CDD|131369 TIGR02316, propion_prpE, propionate--CoA ligase.  This family
           contains one of three readily separable clades of
           proteins in the group of acetate and propionate--CoA
           ligases. Characterized members of this family act on
           propionate. From propionyl-CoA, there is a cyclic
           degradation pathway: it is ligated by PrpC to the TCA
           cycle intermediate oxaloacetate, acted upon further by
           PrpD and an aconitase, then cleaved by PrpB to pyruvate
           and the TCA cycle intermediate succinate.
          Length = 628

 Score = 33.0 bits (75), Expect = 0.22
 Identities = 24/91 (26%), Positives = 38/91 (41%), Gaps = 20/91 (21%)

Query: 53  RRILAKSCENPAYKGRTVL----SLPQLRLLS------ASLSHNNIPGESMLHADNNNE- 101
           + +L  +     +    VL     L  +RL+       A+L   +        A    E 
Sbjct: 175 KPLLDAAIAEAQHPPPHVLLVDRGLAPMRLIPGRDVDYAALRTQHE------DAQVPVEW 228

Query: 102 -QSNDIAIVLYTSGSTGIPKEYRRMYKTGDY 131
            +SN+ + +LYTSG+TG PK  +R    G Y
Sbjct: 229 LESNEPSYILYTSGTTGKPKGVQR--DVGGY 257



 Score = 29.9 bits (67), Expect = 1.7
 Identities = 17/56 (30%), Positives = 31/56 (55%), Gaps = 1/56 (1%)

Query: 152 RRMYKTGDYG-RIVDDILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSVL 206
           R +Y + D+G R  D      GRTD  + + G+R+   E+E ++S    V +++V+
Sbjct: 471 RPLYSSFDWGIRDEDGYTFILGRTDDVINVAGHRLGTREIEESVSSHPSVAEVAVV 526


>gnl|CDD|213312 cd05959, BCL_4HBCL, Benzoate CoA ligase (BCL) and
           4-Hydroxybenzoate-Coenzyme A Ligase (4-HBA-CoA ligase). 
           Benzoate CoA ligase and 4-hydroxybenzoate-coenzyme A
           ligase catalyze the first activating step for benzoate
           and 4-hydroxybenzoate catabolic pathways, respectively.
           Although these two enzymes share very high sequence
           homology, they have their own substrate preference. The
           reaction proceeds via a two-step process; the first
           ATP-dependent step forms the substrate-AMP intermediate,
           while the second step forms the acyl-CoA ester,
           releasing the AMP. Aromatic compounds represent the
           second most abundant class of organic carbon compounds
           after carbohydrates. Some bacteria can use benzoic acid
           or benzenoid compounds as the sole source of carbon and
           energy through degradation. Benzoate CoA ligase and
           4-hydroxybenzoate-Coenzyme A ligase are key enzymes of
           this process.
          Length = 506

 Score = 32.7 bits (75), Expect = 0.22
 Identities = 11/17 (64%), Positives = 14/17 (82%)

Query: 104 NDIAIVLYTSGSTGIPK 120
           +D+A  LY+SGSTG PK
Sbjct: 161 DDMAFWLYSSGSTGRPK 177



 Score = 32.3 bits (74), Expect = 0.30
 Identities = 18/41 (43%), Positives = 22/41 (53%), Gaps = 1/41 (2%)

Query: 155 YKTGD-YGRIVDDILLYEGRTDSQVKIRGNRVDLSEVEAAL 194
            +TGD Y R  D    Y GR+D  +K+ G  V   EVE AL
Sbjct: 386 TRTGDKYYRDEDGYYWYCGRSDDMLKVSGIWVSPFEVEDAL 426


>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
          (DUF2058).  This domain, found in various prokaryotic
          proteins, has no known function.
          Length = 177

 Score = 31.8 bits (73), Expect = 0.23
 Identities = 11/22 (50%), Positives = 18/22 (81%)

Query: 31 NKKKKKKKKKKKKKKKKKKKKK 52
          +KKK KK KK+K+K++K+ +K 
Sbjct: 13 DKKKAKKAKKEKRKQRKQARKG 34



 Score = 31.0 bits (71), Expect = 0.46
 Identities = 11/24 (45%), Positives = 17/24 (70%)

Query: 26 AKKKKNKKKKKKKKKKKKKKKKKK 49
            KKK KK KK+K+K++K+ +K  
Sbjct: 12 VDKKKAKKAKKEKRKQRKQARKGA 35



 Score = 29.5 bits (67), Expect = 1.6
 Identities = 11/20 (55%), Positives = 17/20 (85%)

Query: 35 KKKKKKKKKKKKKKKKKKRR 54
          KKK KK KK+K+K++K+ R+
Sbjct: 14 KKKAKKAKKEKRKQRKQARK 33



 Score = 29.1 bits (66), Expect = 1.9
 Identities = 10/20 (50%), Positives = 17/20 (85%)

Query: 34 KKKKKKKKKKKKKKKKKKKR 53
          KKK KK KK+K+K++K+ ++
Sbjct: 14 KKKAKKAKKEKRKQRKQARK 33



 Score = 28.0 bits (63), Expect = 5.4
 Identities = 10/22 (45%), Positives = 16/22 (72%)

Query: 25 KAKKKKNKKKKKKKKKKKKKKK 46
          K K KK KK+K+K++K+ +K  
Sbjct: 14 KKKAKKAKKEKRKQRKQARKGA 35



 Score = 27.6 bits (62), Expect = 6.5
 Identities = 9/21 (42%), Positives = 16/21 (76%)

Query: 24 EKAKKKKNKKKKKKKKKKKKK 44
          +KAKK K +K+K++K+ +K  
Sbjct: 15 KKAKKAKKEKRKQRKQARKGA 35



 Score = 27.6 bits (62), Expect = 6.6
 Identities = 10/22 (45%), Positives = 16/22 (72%)

Query: 30 KNKKKKKKKKKKKKKKKKKKKK 51
          K K KK KK+K+K++K+ +K  
Sbjct: 14 KKKAKKAKKEKRKQRKQARKGA 35



 Score = 27.6 bits (62), Expect = 7.3
 Identities = 10/22 (45%), Positives = 16/22 (72%)

Query: 29 KKNKKKKKKKKKKKKKKKKKKK 50
          KK  KK KK+K+K++K+ +K  
Sbjct: 14 KKKAKKAKKEKRKQRKQARKGA 35


>gnl|CDD|213288 cd05921, FCS, Feruloyl-CoA synthetase (FCS).  Feruloyl-CoA
           synthetase is an essential enzyme in the feruloyl acid
           degradation pathway and enables some proteobacteria to
           grow on media containing feruloyl acid as the sole
           carbon source. It catalyzes the transfer of CoA to the
           carboxyl group of ferulic acid, which then forms
           feruloyl-CoA in the presence of ATP and Mg2. The
           resulting feruloyl-CoA is further degraded to vanillin
           and acetyl-CoA. Feruloyl-CoA synthetase (FCS) is a
           subfamily of the adenylate-forming enzymes superfamily.
          Length = 559

 Score = 32.6 bits (75), Expect = 0.23
 Identities = 10/16 (62%), Positives = 13/16 (81%)

Query: 105 DIAIVLYTSGSTGIPK 120
            +A  L+TSGSTG+PK
Sbjct: 165 TVAKYLFTSGSTGLPK 180


>gnl|CDD|234677 PRK00174, PRK00174, acetyl-CoA synthetase; Provisional.
          Length = 637

 Score = 32.8 bits (76), Expect = 0.24
 Identities = 17/49 (34%), Positives = 27/49 (55%), Gaps = 7/49 (14%)

Query: 150 KYRRMYKTGDYGRIVDD----ILLYEGRTDSQVKIRGNRVDLSEVEAAL 194
            ++ MY TGD  R  +D    I    GR D  + + G+R+  +E+E+AL
Sbjct: 480 TFKGMYFTGDGARRDEDGYYWIT---GRVDDVLNVSGHRLGTAEIESAL 525



 Score = 31.6 bits (73), Expect = 0.54
 Identities = 10/12 (83%), Positives = 11/12 (91%)

Query: 109 VLYTSGSTGIPK 120
           +LYTSGSTG PK
Sbjct: 250 ILYTSGSTGKPK 261


>gnl|CDD|236091 PRK07768, PRK07768, long-chain-fatty-acid--CoA ligase; Validated.
          Length = 545

 Score = 32.7 bits (75), Expect = 0.24
 Identities = 10/17 (58%), Positives = 14/17 (82%)

Query: 104 NDIAIVLYTSGSTGIPK 120
           +D+A++  TSGSTG PK
Sbjct: 152 DDLALMQLTSGSTGSPK 168


>gnl|CDD|213311 cd05958, ABCL, 2-aminobenzoate-CoA ligase (ABCL).  ABCL catalyzes
           the initial step in the 2-aminobenzoate aerobic
           degradation pathway by activating 2-aminobenzoate to
           2-aminobenzoyl-CoA. The reaction is carried out via a
           two-step process; the first step is ATP-dependent and
           forms a 2-aminobenzoyl-AMP intermediate, and the second
           step forms the 2-aminobenzoyl-CoA ester and releases the
           AMP. 2-Aminobenzoyl-CoA is further converted to
           2-amino-5-oxo-cyclohex-1-ene-1-carbonyl-CoA catalyzed by
           2-aminobenzoyl-CoA monooxygenase/reductase. ABCL has
           been purified from cells aerobically grown with
           2-aminobenzoate as sole carbon, energy, and nitrogen
           source, and has been characterized as a monomer.
          Length = 487

 Score = 32.8 bits (75), Expect = 0.24
 Identities = 24/102 (23%), Positives = 42/102 (41%), Gaps = 21/102 (20%)

Query: 54  RILAKSCENPAY----------KGRTVLSLPQLRLLSASLSHNNIPGESMLHA--DNNNE 101
           R+L +S   P                V ++P LR  +  L+        + HA  D    
Sbjct: 88  RVLLRSANTPMLVACWLAVLKAGAIVVTTMPLLR--AKELTTIVDKAR-ITHALCDKRLT 144

Query: 102 QSNDIAIVLYTSGSTGIPKEYRRMYKTGDYGRIVDDIHRCWS 143
            ++D+A++ +TSG+TG+PK       T  + R +  I   + 
Sbjct: 145 AADDVALIGFTSGTTGLPK------GTMHFHRDILAICDAFP 180


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 32.5 bits (74), Expect = 0.25
 Identities = 14/36 (38%), Positives = 21/36 (58%)

Query: 23  REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
           +E+AKK+  ++ K K   + KKK  + KKK    AK
Sbjct: 145 KEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAK 180



 Score = 30.2 bits (68), Expect = 1.3
 Identities = 11/37 (29%), Positives = 20/37 (54%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
             E   K + + +KK K++ KK+ +++ K K    AK
Sbjct: 129 AAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAK 165



 Score = 29.8 bits (67), Expect = 1.6
 Identities = 14/45 (31%), Positives = 19/45 (42%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYK 66
              +AKKK  + KKK + + K K + K K K        E    K
Sbjct: 160 AAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAKAK 204



 Score = 29.4 bits (66), Expect = 2.2
 Identities = 11/31 (35%), Positives = 15/31 (48%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
           E E   K   + KKK  + KKK + + K K 
Sbjct: 153 EEEAKAKAAAEAKKKAAEAKKKAEAEAKAKA 183



 Score = 29.0 bits (65), Expect = 3.1
 Identities = 14/35 (40%), Positives = 19/35 (54%)

Query: 24  EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
           E+AK K   + KKK  + KKK + + K K    AK
Sbjct: 154 EEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAK 188



 Score = 29.0 bits (65), Expect = 3.1
 Identities = 8/37 (21%), Positives = 14/37 (37%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
           + E+AK K    K K   +   K + +        A+
Sbjct: 190 KAEEAKAKAEAAKAKAAAEAAAKAEAEAAAAAAAEAE 226



 Score = 28.6 bits (64), Expect = 4.3
 Identities = 15/43 (34%), Positives = 21/43 (48%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPA 64
           E E  KK K + KK+ +++ K K   + KKK     K  E  A
Sbjct: 137 EAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEA 179



 Score = 28.6 bits (64), Expect = 4.6
 Identities = 11/30 (36%), Positives = 18/30 (60%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
           E  KAK+    K K + + +KK K++ KK+
Sbjct: 122 EEAKAKQAAEAKAKAEAEAEKKAKEEAKKQ 151



 Score = 28.3 bits (63), Expect = 4.8
 Identities = 13/48 (27%), Positives = 21/48 (43%)

Query: 19  APFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYK 66
           A    EK K+ +  K K+  + K K + + +KK +    K  E  A  
Sbjct: 111 AKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKA 158



 Score = 28.3 bits (63), Expect = 4.9
 Identities = 13/40 (32%), Positives = 19/40 (47%)

Query: 19  APFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
           A  +     KKK  + KKK + + K K + K K +   AK
Sbjct: 156 AKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAK 195



 Score = 27.9 bits (62), Expect = 8.1
 Identities = 6/33 (18%), Positives = 20/33 (60%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
           E+ + K+ + +   +K  K+ ++  K+ ++K++
Sbjct: 87  EQARQKELEQRAAAEKAAKQAEQAAKQAEEKQK 119



 Score = 27.5 bits (61), Expect = 8.8
 Identities = 9/29 (31%), Positives = 18/29 (62%)

Query: 24  EKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
           E+A K+  +K+K+ ++ K K+  + K K 
Sbjct: 108 EQAAKQAEEKQKQAEEAKAKQAAEAKAKA 136


>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6.  RPL6
           contains KOW motif that has an extra ribosomal role as
           an oncogenic. KOW domain is known as an RNA-binding
           motif that is shared so far among some families of
           ribosomal proteins, the essential bacterial
           transcriptional elongation factor NusG, the eukaryotic
           chromatin elongation factor Spt5, the higher eukaryotic
           KIN17 proteins and Mtr4. .
          Length = 152

 Score = 31.4 bits (72), Expect = 0.26
 Identities = 13/32 (40%), Positives = 23/32 (71%)

Query: 21  FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
           F+R+K KKKK K+ +  ++KKKK    +++K+
Sbjct: 81  FKRKKKKKKKKKEGEFFEEKKKKYVVSEERKE 112



 Score = 27.5 bits (62), Expect = 5.5
 Identities = 14/30 (46%), Positives = 21/30 (70%)

Query: 25  KAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
           K KKKK KKKK+ +  ++KKKK    ++R+
Sbjct: 82  KRKKKKKKKKKEGEFFEEKKKKYVVSEERK 111


>gnl|CDD|235865 PRK06814, PRK06814, acylglycerophosphoethanolamine acyltransferase;
           Provisional.
          Length = 1140

 Score = 32.6 bits (75), Expect = 0.26
 Identities = 11/22 (50%), Positives = 15/22 (68%)

Query: 99  NNEQSNDIAIVLYTSGSTGIPK 120
            N   +D A++L+TSGS G PK
Sbjct: 788 CNRDPDDPAVILFTSGSEGTPK 809


>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
           fragment found in eukaryotic DNA topoisomerase (topo) IB
           proteins similar to the monomeric yeast and human topo I
           and heterodimeric topo I from Leishmania donvanni. Topo
           I enzymes are divided into:  topo type IA (bacterial)
           and type IB (eukaryotic). Topo I relaxes superhelical
           tension in duplex DNA by creating a single-strand nick,
           the broken strand can then rotate around the unbroken
           strand to remove DNA supercoils and, the nick is
           religated, liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit re-ligation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topo I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topos I play putative roles
           in organizing the kinetoplast DNA network unique to
           these parasites.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 31.9 bits (73), Expect = 0.27
 Identities = 13/29 (44%), Positives = 20/29 (68%)

Query: 21  FEREKAKKKKNKKKKKKKKKKKKKKKKKK 49
           FE EK KKK   K++KK  K++K+K ++ 
Sbjct: 94  FEEEKEKKKAMSKEEKKAIKEEKEKLEEP 122



 Score = 27.6 bits (62), Expect = 6.6
 Identities = 11/33 (33%), Positives = 19/33 (57%)

Query: 21  FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
           F       ++ K+KKK   K++KK  K++K+K 
Sbjct: 87  FTPIYQYFEEEKEKKKAMSKEEKKAIKEEKEKL 119



 Score = 27.2 bits (61), Expect = 9.5
 Identities = 9/27 (33%), Positives = 18/27 (66%)

Query: 32  KKKKKKKKKKKKKKKKKKKKKRRILAK 58
           ++K+KKK   K++KK  K++K ++   
Sbjct: 96  EEKEKKKAMSKEEKKAIKEEKEKLEEP 122


>gnl|CDD|235724 PRK06178, PRK06178, acyl-CoA synthetase; Validated.
          Length = 567

 Score = 32.7 bits (75), Expect = 0.27
 Identities = 20/53 (37%), Positives = 27/53 (50%), Gaps = 1/53 (1%)

Query: 155 YKTGDYGRIVDDILL-YEGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSVL 206
             TGD G+I +   L Y GR    +K+ G  V  SEVEA L +   V   +V+
Sbjct: 444 LHTGDIGKIDEQGFLHYLGRRKEMLKVNGMSVFPSEVEALLGQHPAVLGSAVV 496


>gnl|CDD|219913 pfam08576, DUF1764, Eukaryotic protein of unknown function
          (DUF1764).  This is a family of eukaryotic proteins of
          unknown function. This family contains many
          hypothetical proteins.
          Length = 98

 Score = 30.5 bits (69), Expect = 0.28
 Identities = 17/43 (39%), Positives = 22/43 (51%)

Query: 13 DKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
          DK  I   F   K +KKK K+  K  + K  KK +KK KK+  
Sbjct: 10 DKRDIDDIFSNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDE 52



 Score = 28.6 bits (64), Expect = 1.3
 Identities = 15/42 (35%), Positives = 20/42 (47%)

Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENP 63
            +K KKKK +  K  + K  KK +KK KKK        E+ 
Sbjct: 20 NIKKRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEESK 61



 Score = 27.0 bits (60), Expect = 4.1
 Identities = 16/48 (33%), Positives = 24/48 (50%)

Query: 27 KKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRTVLSLP 74
          K+KK KK+  K  + K  KK +KK K++         + + RT   LP
Sbjct: 23 KRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEESKRRRTEDGLP 70


>gnl|CDD|233352 TIGR01310, L7, 60S ribosomal protein L7, eukaryotic.  This model
          describes the eukaryotic 60S (cytosolic) ribosomal
          protein L7 and paralogs that may or may not also be L7.
          Human, Drosophila, and Arabidopsis all have both a
          typical L7 and an L7-related paralog. This family is
          designated subfamily rather than equivalog to reflect
          these uncharacterized paralogs. Members of this family
          average ~ 250 residues in length, somewhat longer than
          the archaeal L30P/L7E homolog (~ 155 residues) and much
          longer than the related bacterial/organellar form (~ 60
          residues).
          Length = 235

 Score = 31.9 bits (73), Expect = 0.30
 Identities = 14/36 (38%), Positives = 21/36 (58%)

Query: 27 KKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCEN 62
          K+K  ++   +  K+ K KKK  KKKR+I  K  E+
Sbjct: 6  KRKARQELAVQVAKQAKAKKKANKKKRKIYFKRAES 41



 Score = 31.2 bits (71), Expect = 0.55
 Identities = 11/35 (31%), Positives = 19/35 (54%)

Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILA 57
          R+  ++   +  K+ K KKK  KKK+K   +R  +
Sbjct: 7  RKARQELAVQVAKQAKAKKKANKKKRKIYFKRAES 41


>gnl|CDD|213279 cd05911, Firefly_Luc_like, Firefly luciferase of light emitting
           insects and 4-Coumarate-CoA Ligase (4CL).  This family
           contains two functionally unique groups of proteins; one
           group is insect firefly luciferases and the other is
           plant 4-coumarate:coenzyme A ligases. However, they
           share significant sequence similarity in spite of their
           functional diversity. Luciferase catalyzes the
           production of light in the presence of MgATP, molecular
           oxygen, and luciferin. In the first step, luciferin is
           activated by acylation of its carboxylate group with
           ATP, resulting in an enzyme-bound luciferyl adenylate.
           In the second step, luciferyl adenylate reacts with
           molecular oxygen, producing an enzyme-bound excited
           state product (Luc=O*) and releasing AMP. This
           excited-state product then decays to the ground state
           (Luc=O), emitting a quantum of visible light.
           4-coumarate:coenzyme A ligase is a key enzyme in the
           phenylpropanoid metabolic pathway for monolignol and
           flavonoid biosynthesis. It catalyzes the synthesis of
           hydroxycinnamate-CoA thioesters in a two-step reaction,
           involving the formation of hydroxycinnamate-AMP
           anhydride and then the nucleophilic substitution of AMP
           by CoA. The phenylpropanoid pathway is one of the most
           important secondary metabolism pathways in plants and
           hydroxycinnamate-CoA thioesters are the precursors of
           lignin and other important phenylpropanoids.
          Length = 487

 Score = 32.2 bits (74), Expect = 0.30
 Identities = 10/17 (58%), Positives = 15/17 (88%)

Query: 104 NDIAIVLYTSGSTGIPK 120
           +D A +LY+SG+TG+PK
Sbjct: 146 DDTAALLYSSGTTGLPK 162


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 32.4 bits (74), Expect = 0.31
 Identities = 15/38 (39%), Positives = 23/38 (60%)

Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
          ++    KKK++KKKKKKK+KK+ K + + K      K 
Sbjct: 61 DKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKK 98



 Score = 30.8 bits (70), Expect = 0.90
 Identities = 14/43 (32%), Positives = 24/43 (55%)

Query: 24  EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYK 66
           +K  + K K +KKKKKKK+KK+ K + + +       ++   K
Sbjct: 61  DKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTK 103



 Score = 30.8 bits (70), Expect = 0.97
 Identities = 12/31 (38%), Positives = 18/31 (58%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
           +++K K+KK  K + + K   K  KK KK K
Sbjct: 73  KKKKKKEKKEPKSEGETKLGFKTPKKSKKTK 103



 Score = 30.5 bits (69), Expect = 1.2
 Identities = 12/35 (34%), Positives = 18/35 (51%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRIL 56
           +++K KK+   + + K   K  KK KK KKK    
Sbjct: 75  KKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKP 109



 Score = 30.5 bits (69), Expect = 1.2
 Identities = 19/88 (21%), Positives = 31/88 (35%), Gaps = 7/88 (7%)

Query: 24  EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRTVLSLPQLRLLSASL 83
           E+  K      KK KK+ K  + KKK +K++   K  + P  +G T L     +    + 
Sbjct: 44  EEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTK 103

Query: 84  SHNNIPGESMLHADNNNEQSNDIAIVLY 111
                P         N +  N    +  
Sbjct: 104 KKPPKP-------KPNEDVDNAFNKIAE 124



 Score = 30.1 bits (68), Expect = 1.4
 Identities = 12/31 (38%), Positives = 19/31 (61%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
           +++K +KK+ K + + K   K  KK KK KK
Sbjct: 74  KKKKKEKKEPKSEGETKLGFKTPKKSKKTKK 104



 Score = 29.7 bits (67), Expect = 1.8
 Identities = 13/31 (41%), Positives = 19/31 (61%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
           E++K KKK+ K+ K + + K   K  KK KK
Sbjct: 71  EKKKKKKKEKKEPKSEGETKLGFKTPKKSKK 101



 Score = 29.3 bits (66), Expect = 2.5
 Identities = 12/31 (38%), Positives = 16/31 (51%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
           ++E   + + K   K  KK KK KKK  K K
Sbjct: 80  KKEPKSEGETKLGFKTPKKSKKTKKKPPKPK 110



 Score = 28.1 bits (63), Expect = 6.6
 Identities = 12/37 (32%), Positives = 21/37 (56%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
           +++K KK+K + K + + K   K  KK KK ++   K
Sbjct: 72  KKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPK 108


>gnl|CDD|213303 cd05937, FATP_chFAT1_like, Uncharacterized subfamily of
           bifunctional fatty acid transporter/very-long-chain
           acyl-CoA synthetase in fungi.  Fatty acid transport
           protein (FATP) transports long-chain or very-long-chain
           fatty acids across the plasma membrane. FATPs also have
           fatty acid CoA synthetase activity, thus playing dual
           roles as fatty acid transporters and its activation
           enzymes. FATPs are the key players in the trafficking of
           exogenous fatty acids into the cell and in intracellular
           fatty acid homeostasis. Members of this family are
           fungal FATPs, including FAT1 from Cochliobolus
           heterostrophus.
          Length = 468

 Score = 32.4 bits (74), Expect = 0.32
 Identities = 22/95 (23%), Positives = 42/95 (44%), Gaps = 11/95 (11%)

Query: 45  KKKKKKKKRRILAKSCENPAYKGRTVLSLPQLRLLSASLSHNNIPGESMLHADNNNE--- 101
              +  +    +A    N A      L+L  +  + A +++N + G+ ++H    +    
Sbjct: 23  HGDRNVQSGDFVAIDTTNSAEFVFLWLALWSIGAVPAFINYN-LSGDPLIHCLKISGAKF 81

Query: 102 ---QSNDIAIVLYTSGSTGIPK----EYRRMYKTG 129
                +D A ++YTSG+TG+PK     +RR   T 
Sbjct: 82  VIVDPDDPAALIYTSGTTGLPKGCAISWRRTLVTS 116


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 32.3 bits (74), Expect = 0.32
 Identities = 17/48 (35%), Positives = 27/48 (56%)

Query: 22   EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRT 69
            E  K ++ K+K K K  K +K K KKK+KKK++  A   +  +  G +
Sbjct: 1151 EIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNS 1198



 Score = 32.3 bits (74), Expect = 0.36
 Identities = 13/49 (26%), Positives = 25/49 (51%)

Query: 21   FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRT 69
             + E  +K+  K+++ K K K K  K +K K ++   K  ++ A K + 
Sbjct: 1143 EQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKK 1191



 Score = 31.2 bits (71), Expect = 0.79
 Identities = 15/41 (36%), Positives = 19/41 (46%)

Query: 20   PFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSC 60
               + KA K +  K KKK+KKKKK    K KK   +     
Sbjct: 1160 SKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKR 1200



 Score = 30.0 bits (68), Expect = 1.7
 Identities = 15/42 (35%), Positives = 19/42 (45%)

Query: 24   EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAY 65
            +   K K  K +K K KKK+KKKKK    +   A    N   
Sbjct: 1159 KSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKR 1200



 Score = 29.6 bits (67), Expect = 2.3
 Identities = 18/77 (23%), Positives = 31/77 (40%), Gaps = 1/77 (1%)

Query: 29   KKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRTVLSLPQLRLLSASLSHNNI 88
            KK  KK+ +      KKKKK +KK     KS      +     S   LR      S ++ 
Sbjct: 1307 KKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVK-QASASQSSRLLRRPRKKKSDSSS 1365

Query: 89   PGESMLHADNNNEQSND 105
              +     D++ ++ ++
Sbjct: 1366 EDDDDSEVDDSEDEDDE 1382


>gnl|CDD|225324 COG2604, COG2604, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 594

 Score = 32.3 bits (74), Expect = 0.34
 Identities = 14/58 (24%), Positives = 20/58 (34%), Gaps = 8/58 (13%)

Query: 2   NINLTIS-------EHHFDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
            I  TI        E   DK     PF +       +++K   K  +K KK     K+
Sbjct: 435 RIKGTIELPFKEVCEKLLDK-DKNKPFIKLILLSNNSQEKAVLKLLQKIKKNNDFIKE 491



 Score = 29.3 bits (66), Expect = 3.2
 Identities = 15/47 (31%), Positives = 23/47 (48%), Gaps = 1/47 (2%)

Query: 12  FDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
           F K+I+ +   +EKA  K   +K KK     K+ +    K + IL K
Sbjct: 460 FIKLILLSNNSQEKAVLK-LLQKIKKNNDFIKEFETDALKLQEILEK 505


>gnl|CDD|236096 PRK07787, PRK07787, acyl-CoA synthetase; Validated.
          Length = 471

 Score = 32.3 bits (74), Expect = 0.34
 Identities = 9/14 (64%), Positives = 13/14 (92%)

Query: 107 AIVLYTSGSTGIPK 120
           A+++YTSG+TG PK
Sbjct: 131 ALIVYTSGTTGPPK 144


>gnl|CDD|216298 pfam01101, HMG14_17, HMG14 and HMG17. 
          Length = 93

 Score = 30.2 bits (68), Expect = 0.35
 Identities = 23/52 (44%), Positives = 26/52 (50%), Gaps = 2/52 (3%)

Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRT 69
          PAP + E   KK   KKKK   K KK  K KK+ K+       ENPA  G T
Sbjct: 29 PAPPKPEPKPKKAAAKKKKPAVKGKKGAKGKKETKQE--EAKEENPAENGET 78


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 31.1 bits (71), Expect = 0.36
 Identities = 9/37 (24%), Positives = 22/37 (59%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
           E E+  +   K+  K K++K+++ ++K+K+  +   K
Sbjct: 116 EEEQIDELLEKELAKLKREKRRENERKQKEILKEQMK 152



 Score = 29.2 bits (66), Expect = 1.7
 Identities = 6/35 (17%), Positives = 21/35 (60%)

Query: 21  FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
            +    K+    K++K+++ ++K+K+  K++ + +
Sbjct: 120 IDELLEKELAKLKREKRRENERKQKEILKEQMKML 154



 Score = 28.4 bits (64), Expect = 3.1
 Identities = 8/29 (27%), Positives = 22/29 (75%)

Query: 28  KKKNKKKKKKKKKKKKKKKKKKKKKRRIL 56
           K+  K K++K+++ ++K+K+  K++ ++L
Sbjct: 126 KELAKLKREKRRENERKQKEILKEQMKML 154


>gnl|CDD|220357 pfam09716, ETRAMP, Malarial early transcribed membrane protein
          (ETRAMP).  These sequences represent a family of
          proteins from the malaria parasite Plasmodium
          falciparum, several of which have been shown to be
          expressed specifically in the ring stage as well as the
          rodent parasite Plasmodium yoelii. A homologue from
          Plasmodium chabaudi was localized to the
          parasitophorous vacuole membrane. Members have an
          initial hydrophobic, Phe/Tyr-rich, stretch long enough
          to span the membrane, a highly charged region rich in
          Lys, a second putative transmembrane region and a
          second highly charged, low complexity sequence region.
          Some members have up to 100 residues of additional
          C-terminal sequence. These genes have been shown to be
          found in the sub-telomeric regions of both Plasmodium
          falciparum and P. yoelii chromosomes.
          Length = 82

 Score = 29.9 bits (68), Expect = 0.37
 Identities = 15/38 (39%), Positives = 18/38 (47%), Gaps = 1/38 (2%)

Query: 14 KVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
           ++ P         KKK  KK KK     +KKKK KKK
Sbjct: 17 NLLAPG-LNNNVVAKKKKLKKLKKIDDDLEKKKKNKKK 53



 Score = 28.4 bits (64), Expect = 1.1
 Identities = 14/29 (48%), Positives = 18/29 (62%)

Query: 29 KKNKKKKKKKKKKKKKKKKKKKKKRRILA 57
           K KK KK KK     +KKKK KK+ I++
Sbjct: 29 AKKKKLKKLKKIDDDLEKKKKNKKKIIIS 57



 Score = 27.6 bits (62), Expect = 1.9
 Identities = 14/25 (56%), Positives = 15/25 (60%)

Query: 28 KKKNKKKKKKKKKKKKKKKKKKKKK 52
           KK K KK KK     +KKKK KKK
Sbjct: 29 AKKKKLKKLKKIDDDLEKKKKNKKK 53


>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
          Length = 742

 Score = 32.1 bits (73), Expect = 0.37
 Identities = 14/34 (41%), Positives = 16/34 (47%)

Query: 20  PFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
           P      +K K K +K KKK    K   KKKKK 
Sbjct: 109 PLAISLMRKPKPKTEKLKKKITVNKSTNKKKKKV 142



 Score = 29.8 bits (67), Expect = 1.9
 Identities = 13/29 (44%), Positives = 14/29 (48%)

Query: 24  EKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
           EK KKK    K   KKKKK    K +  K
Sbjct: 123 EKLKKKITVNKSTNKKKKKVLSSKDELIK 151



 Score = 29.0 bits (65), Expect = 3.7
 Identities = 13/34 (38%), Positives = 15/34 (44%)

Query: 20  PFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
              + K K +K KKK    K   KKKKK    K 
Sbjct: 114 LMRKPKPKTEKLKKKITVNKSTNKKKKKVLSSKD 147



 Score = 28.6 bits (64), Expect = 5.2
 Identities = 12/47 (25%), Positives = 16/47 (34%)

Query: 13  DKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
              I      + K +K K K    K   KKKKK    K +      +
Sbjct: 109 PLAISLMRKPKPKTEKLKKKITVNKSTNKKKKKVLSSKDELIKYDNN 155



 Score = 28.3 bits (63), Expect = 6.0
 Identities = 14/43 (32%), Positives = 17/43 (39%)

Query: 21  FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENP 63
             R+   K +  KKK    K   KKKKK    +  L K   N 
Sbjct: 114 LMRKPKPKTEKLKKKITVNKSTNKKKKKVLSSKDELIKYDNNK 156


>gnl|CDD|180393 PRK06087, PRK06087, short chain acyl-CoA synthetase; Reviewed.
          Length = 547

 Score = 32.0 bits (73), Expect = 0.38
 Identities = 9/18 (50%), Positives = 15/18 (83%)

Query: 103 SNDIAIVLYTSGSTGIPK 120
            +++A VL+TSG+ G+PK
Sbjct: 186 GDELAAVLFTSGTEGLPK 203


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 32.2 bits (73), Expect = 0.39
 Identities = 12/32 (37%), Positives = 24/32 (75%)

Query: 23  REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
             K  K ++ K+++K+K++ K++KKKKK+K +
Sbjct: 94  PAKEPKNESGKEEEKEKEQVKEEKKKKKEKPK 125



 Score = 31.4 bits (71), Expect = 0.61
 Identities = 13/37 (35%), Positives = 24/37 (64%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
             P      +++K K++ K++KKKKK+K K++ K R+
Sbjct: 96  KEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRK 132



 Score = 31.0 bits (70), Expect = 0.87
 Identities = 11/36 (30%), Positives = 23/36 (63%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
             P  + K  K+   +  K+++K+K++ K++KKKK+
Sbjct: 86  KGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKK 121



 Score = 31.0 bits (70), Expect = 0.90
 Identities = 14/35 (40%), Positives = 24/35 (68%)

Query: 20  PFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
           P E  K +K K + K+K+  K+K+K+K+KK ++ R
Sbjct: 124 PKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPR 158



 Score = 30.6 bits (69), Expect = 1.0
 Identities = 11/33 (33%), Positives = 22/33 (66%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
            +E   +   +++K+K++ K++KKKKK+K K  
Sbjct: 95  AKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEE 127



 Score = 30.6 bits (69), Expect = 1.2
 Identities = 15/43 (34%), Positives = 26/43 (60%)

Query: 19  APFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCE 61
            P +  K +  K ++K+K++ K++KKKKK+K K+     K  E
Sbjct: 93  KPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKE 135



 Score = 29.5 bits (66), Expect = 2.2
 Identities = 10/36 (27%), Positives = 22/36 (61%)

Query: 19  APFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
           +     K K  K  K +  K+++K+K++ K++KK++
Sbjct: 85  SKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKK 120



 Score = 29.5 bits (66), Expect = 2.6
 Identities = 13/36 (36%), Positives = 27/36 (75%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
           PA   + ++ K++ K+K++ K++KKKKK+K K++ +
Sbjct: 94  PAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPK 129



 Score = 29.5 bits (66), Expect = 2.9
 Identities = 12/33 (36%), Positives = 23/33 (69%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
           + +  ++KK KK+K K++ K +K K++ K+KR 
Sbjct: 110 KEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRP 142



 Score = 29.5 bits (66), Expect = 2.9
 Identities = 14/37 (37%), Positives = 26/37 (70%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
               E+E+ K++K KKK+K K++ K +K K++ K++R
Sbjct: 105 EEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKR 141



 Score = 29.1 bits (65), Expect = 3.1
 Identities = 14/36 (38%), Positives = 21/36 (58%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
             P +RE+ KK++  + K + KK  KKK   KKK+ 
Sbjct: 155 EEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEP 190



 Score = 29.1 bits (65), Expect = 3.6
 Identities = 14/43 (32%), Positives = 26/43 (60%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPA 64
           E++K K+K  ++ K +K K++ K+K+  K+K +   K  E P 
Sbjct: 116 EKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPR 158



 Score = 28.3 bits (63), Expect = 5.3
 Identities = 13/38 (34%), Positives = 22/38 (57%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
            R K++ KK  KKK   KKK+  +++K+++  R   K 
Sbjct: 169 VRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKG 206



 Score = 28.3 bits (63), Expect = 6.4
 Identities = 15/53 (28%), Positives = 29/53 (54%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRTV 70
           P    +++  K++ K+K+  K+K+K+K+KK ++ + R   K  E    K R  
Sbjct: 124 PKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPK 176



 Score = 28.3 bits (63), Expect = 6.5
 Identities = 13/44 (29%), Positives = 27/44 (61%)

Query: 21  FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPA 64
            + E  K+++ +K++ K++KKKKK+K K++ K R   +  +   
Sbjct: 98  PKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKR 141



 Score = 27.9 bits (62), Expect = 7.1
 Identities = 16/44 (36%), Positives = 28/44 (63%)

Query: 25  KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGR 68
           + +K+K + K++KKKKK+K K++ K +K +  AK    P  K +
Sbjct: 105 EEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEK 148



 Score = 27.9 bits (62), Expect = 7.3
 Identities = 14/37 (37%), Positives = 24/37 (64%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
               E+EK + K+ KKKKK+K K++ K +K K++ + 
Sbjct: 103 GKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKE 139


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
          entry is characterized by proteins with alternating
          conserved and low-complexity regions. Bud13 together
          with Snu17p and a newly identified factor,
          Pml1p/Ylr016c, form a novel trimeric complex. called
          The RES complex, pre-mRNA retention and splicing
          complex. Subunits of this complex are not essential for
          viability of yeasts but they are required for efficient
          splicing in vitro and in vivo. Furthermore,
          inactivation of this complex causes pre-mRNA leakage
          from the nucleus. Bud13 contains a unique,
          phylogenetically conserved C-terminal region of unknown
          function.
          Length = 141

 Score = 30.7 bits (70), Expect = 0.40
 Identities = 9/36 (25%), Positives = 28/36 (77%)

Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
          R+K+ +  + ++K+++K+++K++K++K++K +   K
Sbjct: 5  RDKSGRIIDIEEKREEKEREKEEKERKEEKEKEWGK 40


>gnl|CDD|235722 PRK06164, PRK06164, acyl-CoA synthetase; Validated.
          Length = 540

 Score = 32.0 bits (73), Expect = 0.41
 Identities = 17/55 (30%), Positives = 28/55 (50%), Gaps = 1/55 (1%)

Query: 147 RAIKYRRMYKTGDYGRIVDD-ILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEV 200
           RA+     ++TGD G    D   +Y+ R    +++ G  V+ +E+E AL  L  V
Sbjct: 400 RALTDDGYFRTGDLGYTRGDGQFVYQTRMGDSLRLGGFLVNPAEIEHALEALPGV 454


>gnl|CDD|181207 PRK08043, PRK08043, bifunctional acyl-[acyl carrier protein]
           synthetase/2-acylglycerophosphoethanolamine
           acyltransferase; Validated.
          Length = 718

 Score = 32.0 bits (73), Expect = 0.43
 Identities = 11/20 (55%), Positives = 15/20 (75%)

Query: 101 EQSNDIAIVLYTSGSTGIPK 120
           +Q  D A++L+TSGS G PK
Sbjct: 362 QQPEDAALILFTSGSEGHPK 381


>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
          Length = 1094

 Score = 32.1 bits (73), Expect = 0.46
 Identities = 15/26 (57%), Positives = 21/26 (80%)

Query: 32  KKKKKKKKKKKKKKKKKKKKKRRILA 57
           +KKKKKKK K+KKKKK ++  ++I A
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQIHA 709



 Score = 31.0 bits (70), Expect = 1.1
 Identities = 13/23 (56%), Positives = 18/23 (78%)

Query: 28  KKKNKKKKKKKKKKKKKKKKKKK 50
           +KK KKKK K+KKKKK ++  K+
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQ 706



 Score = 30.6 bits (69), Expect = 1.4
 Identities = 13/23 (56%), Positives = 18/23 (78%)

Query: 29  KKNKKKKKKKKKKKKKKKKKKKK 51
           +K KKKKK K+KKKKK ++  K+
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQ 706



 Score = 29.5 bits (66), Expect = 2.9
 Identities = 15/22 (68%), Positives = 18/22 (81%)

Query: 37  KKKKKKKKKKKKKKKKRRILAK 58
           +KKKKKKK K+KKKKK R  +K
Sbjct: 684 RKKKKKKKSKEKKKKKNREASK 705



 Score = 28.3 bits (63), Expect = 6.5
 Identities = 13/24 (54%), Positives = 19/24 (79%)

Query: 26  AKKKKNKKKKKKKKKKKKKKKKKK 49
           ++KKK KKK K+KKKKK ++  K+
Sbjct: 683 SRKKKKKKKSKEKKKKKNREASKQ 706



 Score = 28.3 bits (63), Expect = 7.5
 Identities = 13/23 (56%), Positives = 18/23 (78%)

Query: 30  KNKKKKKKKKKKKKKKKKKKKKK 52
           + KKKKKK K+KKKKK ++  K+
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQ 706


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 31.6 bits (71), Expect = 0.54
 Identities = 7/36 (19%), Positives = 15/36 (41%)

Query: 24   EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
            E+AKK     K + +    + +  ++K +     K 
Sbjct: 1339 EEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKE 1374



 Score = 30.9 bits (69), Expect = 1.0
 Identities = 10/39 (25%), Positives = 20/39 (51%)

Query: 23   REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCE 61
              +A   + +  ++K +  +KKK++ KKK      K+ E
Sbjct: 1351 EAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEE 1389



 Score = 30.9 bits (69), Expect = 1.2
 Identities = 6/31 (19%), Positives = 16/31 (51%)

Query: 22   EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
            +  +A K + +    + +  ++K +  +KKK
Sbjct: 1343 KAAEAAKAEAEAAADEAEAAEEKAEAAEKKK 1373



 Score = 30.1 bits (67), Expect = 1.6
 Identities = 9/36 (25%), Positives = 22/36 (61%)

Query: 23   REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
            + +A+   ++ +  ++K +  +KKK++ KK+   AK
Sbjct: 1349 KAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAK 1384



 Score = 29.7 bits (66), Expect = 2.5
 Identities = 7/30 (23%), Positives = 14/30 (46%)

Query: 24   EKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
            + AKKK  + KK  +  K + +    + + 
Sbjct: 1332 DAAKKKAEEAKKAAEAAKAEAEAAADEAEA 1361



 Score = 29.7 bits (66), Expect = 2.6
 Identities = 6/40 (15%), Positives = 16/40 (40%)

Query: 22   EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCE 61
            ++ +  KK  +  K + +    + +  ++K      K  E
Sbjct: 1336 KKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEE 1375



 Score = 29.7 bits (66), Expect = 2.7
 Identities = 5/36 (13%), Positives = 18/36 (50%)

Query: 23   REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
            ++KA++ K   +  K + +    + +  +++   A+
Sbjct: 1335 KKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAE 1370



 Score = 29.3 bits (65), Expect = 3.2
 Identities = 12/43 (27%), Positives = 20/43 (46%), Gaps = 3/43 (6%)

Query: 22   EREKAKKKKNKKKKKKKKKKKKKKKKKK---KKKRRILAKSCE 61
            E  KA+ +    + +  ++K +  +KKK   KKK     K  E
Sbjct: 1346 EAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAE 1388



 Score = 29.0 bits (64), Expect = 4.2
 Identities = 6/36 (16%), Positives = 19/36 (52%)

Query: 23   REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
            ++ A+  K + +    + +  ++K +  +K++  AK
Sbjct: 1342 KKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAK 1377



 Score = 29.0 bits (64), Expect = 4.3
 Identities = 10/40 (25%), Positives = 16/40 (40%)

Query: 22   EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCE 61
            E  K K    KKK ++ KK  +  K + +        + E
Sbjct: 1325 EEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEE 1364



 Score = 28.6 bits (63), Expect = 5.0
 Identities = 17/48 (35%), Positives = 24/48 (50%)

Query: 23   REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRTV 70
             EKA+  + KK++ KKK    KKK ++KKK     K  E    K   +
Sbjct: 1363 EEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADEL 1410



 Score = 28.2 bits (62), Expect = 6.9
 Identities = 16/45 (35%), Positives = 23/45 (51%)

Query: 22   EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYK 66
            E +KA + K K ++ KKK    KKK ++ KK    AK+    A  
Sbjct: 1313 EAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAAD 1357



 Score = 28.2 bits (62), Expect = 7.1
 Identities = 12/33 (36%), Positives = 21/33 (63%)

Query: 22   EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
            E  + K +  +KKK++ KKK    KKK ++K++
Sbjct: 1360 EAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKK 1392


>gnl|CDD|185611 PTZ00428, PTZ00428, 60S ribosomal protein L4; Provisional.
          Length = 381

 Score = 31.6 bits (72), Expect = 0.55
 Identities = 13/42 (30%), Positives = 20/42 (47%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
           PA  +  K      +K   K  +K K ++ +KK K+  LAK 
Sbjct: 330 PAAKKLRKLAVLAQEKATAKGAQKVKNRRARKKAKKARLAKV 371



 Score = 30.8 bits (70), Expect = 0.91
 Identities = 10/43 (23%), Positives = 19/43 (44%)

Query: 24  EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYK 66
               ++K   K  +K K ++ +KK KK +   +AK+       
Sbjct: 339 AVLAQEKATAKGAQKVKNRRARKKAKKARLAKVAKALYKSIKA 381



 Score = 29.7 bits (67), Expect = 2.1
 Identities = 10/30 (33%), Positives = 16/30 (53%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
           E+  AK  +  K ++ +KK KK +  K  K
Sbjct: 344 EKATAKGAQKVKNRRARKKAKKARLAKVAK 373



 Score = 28.5 bits (64), Expect = 4.5
 Identities = 9/31 (29%), Positives = 14/31 (45%)

Query: 23  REKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
            +   K   K K ++ +KK KK +  K  K 
Sbjct: 344 EKATAKGAQKVKNRRARKKAKKARLAKVAKA 374


>gnl|CDD|240316 PTZ00216, PTZ00216, acyl-CoA synthetase; Provisional.
          Length = 700

 Score = 31.5 bits (72), Expect = 0.57
 Identities = 15/33 (45%), Positives = 22/33 (66%), Gaps = 2/33 (6%)

Query: 90  GESML--HADNNNEQSNDIAIVLYTSGSTGIPK 120
           G S    H  N  E ++D+A+++YTSG+TG PK
Sbjct: 248 GHSAGSHHPLNIPENNDDLALIMYTSGTTGDPK 280


>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
          Length = 1465

 Score = 31.8 bits (72), Expect = 0.57
 Identities = 11/42 (26%), Positives = 18/42 (42%)

Query: 12   FDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
             DK    A   REK ++   + +    KK  ++  KK   K+
Sbjct: 1168 LDKEDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKK 1209



 Score = 29.8 bits (67), Expect = 2.4
 Identities = 7/41 (17%), Positives = 15/41 (36%)

Query: 22   EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCEN 62
            E  + K ++   + +    KK  ++  KK   +   K    
Sbjct: 1176 EEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASE 1216


>gnl|CDD|220231 pfam09420, Nop16, Ribosome biogenesis protein Nop16.  Nop16 is a
          protein involved in ribosome biogenesis.
          Length = 173

 Score = 30.9 bits (70), Expect = 0.58
 Identities = 13/29 (44%), Positives = 18/29 (62%)

Query: 27 KKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
          KKKKN+    K  +K+ K+K +KKK   I
Sbjct: 3  KKKKNRSSNYKVNRKRLKRKDRKKKINII 31



 Score = 30.1 bits (68), Expect = 1.0
 Identities = 9/33 (27%), Positives = 17/33 (51%)

Query: 29 KKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCE 61
          +K KK +    K  +K+ K+K +K++I      
Sbjct: 2  RKKKKNRSSNYKVNRKRLKRKDRKKKINIIGNP 34



 Score = 29.3 bits (66), Expect = 1.8
 Identities = 9/26 (34%), Positives = 16/26 (61%)

Query: 28 KKKNKKKKKKKKKKKKKKKKKKKKKR 53
          +KK K +    K  +K+ K+K +KK+
Sbjct: 2  RKKKKNRSSNYKVNRKRLKRKDRKKK 27



 Score = 28.2 bits (63), Expect = 3.8
 Identities = 10/29 (34%), Positives = 16/29 (55%)

Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
          K KK ++   K  +K+ K+K +KKK    
Sbjct: 3  KKKKNRSSNYKVNRKRLKRKDRKKKINII 31



 Score = 28.2 bits (63), Expect = 3.9
 Identities = 12/36 (33%), Positives = 20/36 (55%)

Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
          ++K  +  N K  +K+ K+K +KKK       I+AK
Sbjct: 3  KKKKNRSSNYKVNRKRLKRKDRKKKINIIGNPIIAK 38



 Score = 27.0 bits (60), Expect = 8.4
 Identities = 11/32 (34%), Positives = 16/32 (50%)

Query: 32 KKKKKKKKKKKKKKKKKKKKKRRILAKSCENP 63
          KKKK +    K  +K+ K+K R+       NP
Sbjct: 3  KKKKNRSSNYKVNRKRLKRKDRKKKINIIGNP 34


>gnl|CDD|193409 pfam12936, Kri1_C, KRI1-like family C-terminal.  The yeast member
          of this family (Kri1p) is found to be required for 40S
          ribosome biogenesis in the nucleolus. This is the
          C-terminal domain of the family.
          Length = 93

 Score = 29.5 bits (67), Expect = 0.58
 Identities = 13/31 (41%), Positives = 22/31 (70%), Gaps = 1/31 (3%)

Query: 19 APFEREKAKKKKNKKKKKKKKKKKKKKKKKK 49
          AP+ R+K +++K+KKK  KK + ++ KKK  
Sbjct: 64 APY-RDKEERRKDKKKYGKKARLREWKKKVF 93



 Score = 26.4 bits (59), Expect = 6.3
 Identities = 8/25 (32%), Positives = 18/25 (72%)

Query: 32 KKKKKKKKKKKKKKKKKKKKKRRIL 56
          K++++K KKK  KK + ++ K+++ 
Sbjct: 69 KEERRKDKKKYGKKARLREWKKKVF 93


>gnl|CDD|213300 cd05934, FACL_DitJ_like, Uncharacterized subfamily of fatty acid
           CoA ligase (FACL).  Fatty acyl-CoA ligases catalyze the
           ATP-dependent activation of fatty acids in a two-step
           reaction. The carboxylate substrate first reacts with
           ATP to form an acyl-adenylate intermediate, which then
           reacts with CoA to produce an acyl-CoA ester. This is a
           required step before free fatty acids can participate in
           most catabolic and anabolic reactions. Members of this
           family include DitJ from Pseudomonas and similar
           proteins.
          Length = 421

 Score = 31.4 bits (72), Expect = 0.58
 Identities = 11/16 (68%), Positives = 13/16 (81%)

Query: 105 DIAIVLYTSGSTGIPK 120
           D A +LYTSG+TG PK
Sbjct: 82  DTAAILYTSGTTGPPK 97


>gnl|CDD|213298 cd05932, LC_FACS_bac, Bacterial long-chain fatty acid CoA
           synthetase (LC-FACS), including Marinobacter
           hydrocarbonoclasticus isoprenoid Coenzyme A synthetase. 
           The members of this family are bacterial long-chain
           fatty acid CoA synthetase. Marinobacter
           hydrocarbonoclasticus isoprenoid Coenzyme A synthetase
           in this family is involved in the synthesis of
           isoprenoid wax ester storage compounds when grown on
           phytol as the sole carbon source. LC-FACS catalyzes the
           formation of fatty acyl-CoA in a two-step reaction: the
           formation of a fatty acyl-AMP molecule as an
           intermediate, and the formation of a fatty acyl-CoA.
           Free fatty acids must be "activated" to their CoA
           thioesters before participating in most catabolic and
           anabolic reactions.
          Length = 504

 Score = 31.5 bits (72), Expect = 0.61
 Identities = 10/17 (58%), Positives = 14/17 (82%)

Query: 104 NDIAIVLYTSGSTGIPK 120
            D+A ++YTSG+TG PK
Sbjct: 137 EDLATIVYTSGTTGQPK 153


>gnl|CDD|218435 pfam05104, Rib_recp_KP_reg, Ribosome receptor lysine/proline rich
          region.  This highly conserved region is found towards
          the C-terminus of the transmembrane domain. The
          function is unclear.
          Length = 151

 Score = 30.3 bits (68), Expect = 0.63
 Identities = 15/31 (48%), Positives = 22/31 (70%)

Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
          E  K + +K+ KKKK+K  +KK K KKK++K
Sbjct: 16 ESGKTQSQKSDKKKKEKVSEKKGKSKKKEEK 46



 Score = 28.0 bits (62), Expect = 3.7
 Identities = 14/29 (48%), Positives = 18/29 (62%)

Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
          +E  K +  K  KKKK+K  +KK K KKK
Sbjct: 15 KESGKTQSQKSDKKKKEKVSEKKGKSKKK 43



 Score = 28.0 bits (62), Expect = 3.9
 Identities = 14/31 (45%), Positives = 23/31 (74%)

Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
          +R+++ K +++K  KKKK+K  +KK K KKK
Sbjct: 13 QRKESGKTQSQKSDKKKKEKVSEKKGKSKKK 43


>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
          Length = 746

 Score = 31.6 bits (72), Expect = 0.64
 Identities = 17/99 (17%), Positives = 38/99 (38%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRTVLSLPQLRLLSA 81
           E+       N   K  K+++K      + +   +     +    +    L     +L  +
Sbjct: 222 EKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEKLEKIKDLKEKAKKLGIS 281

Query: 82  SLSHNNIPGESMLHADNNNEQSNDIAIVLYTSGSTGIPK 120
            +  +++      +    NE  + I  ++YTSG++G PK
Sbjct: 282 IILFDDMTKNKTTNYKIQNEDPDFITSIVYTSGTSGKPK 320


>gnl|CDD|168170 PRK05677, PRK05677, long-chain-fatty-acid--CoA ligase; Validated.
          Length = 562

 Score = 31.3 bits (71), Expect = 0.64
 Identities = 19/58 (32%), Positives = 31/58 (53%), Gaps = 11/58 (18%)

Query: 63  PAYKGRTVLSLPQLRLLSASLSHNNIPGESMLHADNNNEQSNDIAIVLYTSGSTGIPK 120
           PAY       LPQ    + +L+     G+ +  A   N Q++D+A++ YT G+TG+ K
Sbjct: 177 PAY------HLPQAVKFNDALAKGA--GQPVTEA---NPQADDVAVLQYTGGTTGVAK 223


>gnl|CDD|240325 PTZ00237, PTZ00237, acetyl-CoA synthetase; Provisional.
          Length = 647

 Score = 31.3 bits (71), Expect = 0.64
 Identities = 18/49 (36%), Positives = 28/49 (57%), Gaps = 3/49 (6%)

Query: 150 KYRRMYKTGDYGRIVDDILLYE--GRTDSQVKIRGNRVDLSEVEAALSK 196
           K+   Y +GD G   D+   Y    R+D Q+KI GN+V L+ +E ++ K
Sbjct: 489 KFPGYYNSGDLG-FKDENGYYTIVSRSDDQIKISGNKVQLNTIETSILK 536



 Score = 29.3 bits (66), Expect = 3.2
 Identities = 11/24 (45%), Positives = 18/24 (75%)

Query: 236 RTDSQVKIRGNRVDLSEVEAALSK 259
           R+D Q+KI GN+V L+ +E ++ K
Sbjct: 513 RSDDQIKISGNKVQLNTIETSILK 536


>gnl|CDD|235809 PRK06464, PRK06464, phosphoenolpyruvate synthase; Validated.
          Length = 795

 Score = 31.3 bits (72), Expect = 0.65
 Identities = 8/21 (38%), Positives = 14/21 (66%), Gaps = 2/21 (9%)

Query: 134 IVDDIHRCWSS--GDRAIKYR 152
           +++ +  C++S   DRAI YR
Sbjct: 153 VLEAVKECFASLFTDRAISYR 173


>gnl|CDD|221857 pfam12923, RRP7, Ribosomal RNA-processing protein 7 (RRP7).  RRP7
           is an essential protein in yeast that is involved in
           pre-rRNA processing and ribosome assembly. It is
           speculated to be required for correct assembly of rpS27
           into the pre-ribosomal particle.
          Length = 131

 Score = 29.9 bits (68), Expect = 0.65
 Identities = 16/42 (38%), Positives = 26/42 (61%), Gaps = 5/42 (11%)

Query: 19  APFEREKAKKKKNKKKKKKKKKKKKKK-----KKKKKKKRRI 55
           A   R KA +++ K K+KKKKKKK+ +     + ++KKK  +
Sbjct: 64  AGASRNKAAEERRKLKEKKKKKKKELENFYRFQIREKKKEEL 105



 Score = 26.8 bits (60), Expect = 9.4
 Identities = 12/29 (41%), Positives = 20/29 (68%)

Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
          K K   ++ K  ++++K K+KKKKKKK+ 
Sbjct: 61 KRKAGASRNKAAEERRKLKEKKKKKKKEL 89


>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
           binding fragment.  Topoisomerase I promotes the
           relaxation of DNA superhelical tension by introducing a
           transient single-stranded break in duplex DNA and are
           vital for the processes of replication, transcription,
           and recombination. This family may be more than one
           structural domain.
          Length = 215

 Score = 30.6 bits (70), Expect = 0.67
 Identities = 13/29 (44%), Positives = 19/29 (65%)

Query: 21  FEREKAKKKKNKKKKKKKKKKKKKKKKKK 49
           FE EK KKK   K++KK  K++K K ++ 
Sbjct: 95  FEAEKEKKKAMSKEEKKAIKEEKDKLEEP 123


>gnl|CDD|220818 pfam10595, UPF0564, Uncharacterized protein family UPF0564.  This
          family of proteins has no known function. However, one
          of the members is annotated as an EF-hand family
          protein.
          Length = 349

 Score = 31.3 bits (71), Expect = 0.67
 Identities = 14/38 (36%), Positives = 25/38 (65%)

Query: 14 KVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
          K+ +P PF+  K +KKK +KK  ++ K +++  K +KK
Sbjct: 2  KITVPQPFQMTKREKKKREKKSIRQSKLEEELNKLEKK 39


>gnl|CDD|220611 pfam10169, Laps, Learning-associated protein.  This is a family of
           121-amino acid secretory proteins. Laps functions in the
           regulation of neuronal cell adhesion and/or movement and
           synapse attachment. Laps binds to the ApC/EBP (Aplysia
           CCAAT/enhancer binding protein) promoter and activates
           the transcription of ApC/EBP mRNA.
          Length = 124

 Score = 29.8 bits (67), Expect = 0.69
 Identities = 13/25 (52%), Positives = 20/25 (80%)

Query: 31  NKKKKKKKKKKKKKKKKKKKKKRRI 55
           N+++ KK KKK++KKKKKKK  + +
Sbjct: 99  NQRQAKKLKKKREKKKKKKKGAKNL 123



 Score = 27.1 bits (60), Expect = 6.8
 Identities = 13/26 (50%), Positives = 18/26 (69%)

Query: 27  KKKKNKKKKKKKKKKKKKKKKKKKKK 52
            +++ KK KKK++KKKKKKK  K   
Sbjct: 99  NQRQAKKLKKKREKKKKKKKGAKNLA 124



 Score = 27.1 bits (60), Expect = 7.1
 Identities = 15/30 (50%), Positives = 19/30 (63%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKKKKKKK 47
           PA   + +AKK K K++KKKKKKK  K   
Sbjct: 95  PAWMNQRQAKKLKKKREKKKKKKKGAKNLA 124



 Score = 26.7 bits (59), Expect = 8.5
 Identities = 13/26 (50%), Positives = 17/26 (65%)

Query: 28  KKKNKKKKKKKKKKKKKKKKKKKKKR 53
            ++  KK KKK++KKKKKKK  K   
Sbjct: 99  NQRQAKKLKKKREKKKKKKKGAKNLA 124


>gnl|CDD|213278 cd05910, FACL_like_1, Uncharacterized subfamily of fatty acid CoA
           ligase (FACL).  Fatty acyl-CoA ligases catalyze the
           ATP-dependent activation of fatty acids in a two-step
           reaction. The carboxylate substrate first reacts with
           ATP to form an acyl-adenylate intermediate, which then
           reacts with CoA to produce an acyl-CoA ester. This is a
           required step before free fatty acids can participate in
           most catabolic and anabolic reactions.
          Length = 455

 Score = 31.2 bits (71), Expect = 0.71
 Identities = 12/21 (57%), Positives = 14/21 (66%), Gaps = 2/21 (9%)

Query: 105 DIAIVLYTSGSTGIPK--EYR 123
           D A +L+TSGSTG PK   Y 
Sbjct: 86  DPAAILFTSGSTGPPKGVVYT 106


>gnl|CDD|102207 PRK06145, PRK06145, acyl-CoA synthetase; Validated.
          Length = 497

 Score = 31.0 bits (70), Expect = 0.73
 Identities = 15/53 (28%), Positives = 28/53 (52%), Gaps = 1/53 (1%)

Query: 155 YKTGDYGRIVDDILLY-EGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSVL 206
           +++GD G + ++  LY   R    +   G  +  SEVE  + +L EV + +V+
Sbjct: 375 FRSGDVGYLDEEGFLYLTDRKKDMIISGGENIASSEVERVIYELPEVAEAAVI 427


>gnl|CDD|218437 pfam05107, DUF694, Family of unknown function (DUF694).  Family of
           hypothetical bacterial proteins.
          Length = 278

 Score = 30.7 bits (70), Expect = 0.75
 Identities = 15/41 (36%), Positives = 22/41 (53%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCEN 62
             E  +  +   + ++  K+KKKKKKKKKKK  +    CE 
Sbjct: 69  NDEHGELYERADRAEELLKEKKKKKKKKKKKEELREWMCEK 109



 Score = 28.4 bits (64), Expect = 4.2
 Identities = 13/32 (40%), Positives = 20/32 (62%)

Query: 29  KKNKKKKKKKKKKKKKKKKKKKKKRRILAKSC 60
               ++  K+KKKKKKKKKKK++ R  + +  
Sbjct: 79  ADRAEELLKEKKKKKKKKKKKEELREWMCEKF 110



 Score = 28.0 bits (63), Expect = 6.6
 Identities = 14/46 (30%), Positives = 25/46 (54%)

Query: 13  DKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
           +K  +         +  + ++  K+KKKKKKKKKKK++ +  +  K
Sbjct: 64  EKEKLNDEHGELYERADRAEELLKEKKKKKKKKKKKEELREWMCEK 109


>gnl|CDD|217502 pfam03343, SART-1, SART-1 family.  SART-1 is a protein involved in
           cell cycle arrest and pre-mRNA splicing. It has been
           shown to be a component of U4/U6 x U5 tri-snRNP complex
           in human, Schizosaccharomyces pombe and Saccharomyces
           cerevisiae. SART-1 is a known tumour antigen in a range
           of cancers recognised by T cells.
          Length = 603

 Score = 31.3 bits (71), Expect = 0.78
 Identities = 20/58 (34%), Positives = 25/58 (43%), Gaps = 23/58 (39%)

Query: 22  EREKAKKKKNKKKKK-----------------------KKKKKKKKKKKKKKKKRRIL 56
           E EK +++  KK K                         K KK KKKKKKKKK+R+ L
Sbjct: 235 EAEKKRQEVKKKLKINNVSLDDDSTETPASDYYDVSEMVKFKKPKKKKKKKKKRRKDL 292



 Score = 30.5 bits (69), Expect = 1.2
 Identities = 13/33 (39%), Positives = 20/33 (60%)

Query: 32  KKKKKKKKKKKKKKKKKKKKKRRILAKSCENPA 64
           K KK KKKKKKKKK++K   +  +  ++    +
Sbjct: 274 KFKKPKKKKKKKKKRRKDLDEDELEPEAEGLGS 306



 Score = 29.0 bits (65), Expect = 3.6
 Identities = 13/24 (54%), Positives = 15/24 (62%)

Query: 20  PFEREKAKKKKNKKKKKKKKKKKK 43
             E  K KK K KKKKKKK++K  
Sbjct: 269 VSEMVKFKKPKKKKKKKKKRRKDL 292


>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
          Length = 333

 Score = 31.0 bits (70), Expect = 0.79
 Identities = 16/58 (27%), Positives = 18/58 (31%)

Query: 25  KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRTVLSLPQLRLLSAS 82
           K  KK  KK  KK  K  KK  KK  K     AK                  +  + S
Sbjct: 250 KKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAKKKAGKKAAAGS 307



 Score = 30.6 bits (69), Expect = 1.0
 Identities = 14/42 (33%), Positives = 19/42 (45%)

Query: 23  REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPA 64
              + KK  K   K  KK KK  KK  KK  + + K+ +  A
Sbjct: 234 TAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAA 275



 Score = 30.2 bits (68), Expect = 1.1
 Identities = 14/46 (30%), Positives = 19/46 (41%)

Query: 23  REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGR 68
            +   KK + K K   KK  KKK  K     +  AK+    A K +
Sbjct: 208 AKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAK 253



 Score = 30.2 bits (68), Expect = 1.2
 Identities = 17/40 (42%), Positives = 20/40 (50%)

Query: 25  KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPA 64
           KA KK  K  KK  KK  K  KK  KK  +  AK+ +  A
Sbjct: 247 KAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAA 286



 Score = 30.2 bits (68), Expect = 1.4
 Identities = 15/43 (34%), Positives = 17/43 (39%)

Query: 25  KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKG 67
           KA K   K  K   K  K K K KKK  ++  A S      K 
Sbjct: 273 KAAKAAAKAAKGAAKATKGKAKAKKKAGKKAAAGSKAKATAKA 315



 Score = 29.9 bits (67), Expect = 1.6
 Identities = 16/42 (38%), Positives = 17/42 (40%)

Query: 25  KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYK 66
            A K   KK   K K   KK  KKK  K  + AK     A K
Sbjct: 206 PAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAK 247



 Score = 29.9 bits (67), Expect = 1.9
 Identities = 16/42 (38%), Positives = 17/42 (40%)

Query: 25  KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYK 66
           K   K   K  KK KK  KK  KK  K  +  AK     A K
Sbjct: 239 KKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAK 280



 Score = 29.1 bits (65), Expect = 3.1
 Identities = 14/50 (28%), Positives = 16/50 (32%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKG 67
               +     K   KK  KKK  K     KK  K     AK  +  A K 
Sbjct: 210 SGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKA 259



 Score = 28.3 bits (63), Expect = 4.6
 Identities = 16/44 (36%), Positives = 16/44 (36%)

Query: 24  EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKG 67
                KK KK  KK  KK  K  KK  KK    A      A K 
Sbjct: 245 AAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKA 288



 Score = 27.9 bits (62), Expect = 6.2
 Identities = 12/34 (35%), Positives = 14/34 (41%)

Query: 19  APFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
              ++ K   KK  KK  K  KK  KK  K   K
Sbjct: 247 KAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAK 280



 Score = 27.9 bits (62), Expect = 7.6
 Identities = 14/50 (28%), Positives = 18/50 (36%)

Query: 17  IPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYK 66
           + A   ++ AK    K   KK   K K   KK  KK+          A K
Sbjct: 194 VGAANAKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAK 243



 Score = 27.5 bits (61), Expect = 9.1
 Identities = 14/46 (30%), Positives = 19/46 (41%)

Query: 19  APFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPA 64
           A    +K  KKK  K     KK  K   K  KK ++   K+ +  A
Sbjct: 219 AKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAA 264



 Score = 27.5 bits (61), Expect = 9.6
 Identities = 16/49 (32%), Positives = 16/49 (32%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYK 66
           PA     K    K K   KK  KKK  K     KK    A      A K
Sbjct: 206 PAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKK 254


>gnl|CDD|227238 COG4901, COG4901, Ribosomal protein S25 [Translation, ribosomal
          structure and biogenesis].
          Length = 107

 Score = 29.5 bits (66), Expect = 0.83
 Identities = 15/32 (46%), Positives = 18/32 (56%)

Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
           K KK +  K    K KKK  KKKKK++ RR 
Sbjct: 8  SKEKKAEKAKAGTAKDKKKWSKKKKKEEARRA 39



 Score = 26.4 bits (58), Expect = 8.1
 Identities = 16/37 (43%), Positives = 20/37 (54%)

Query: 19 APFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
          AP  +   +KK  K K    K KKK  KKKKK++ R 
Sbjct: 2  APKSQLSKEKKAEKAKAGTAKDKKKWSKKKKKEEARR 38


>gnl|CDD|171961 PRK13295, PRK13295, cyclohexanecarboxylate-CoA ligase; Reviewed.
          Length = 547

 Score = 30.8 bits (70), Expect = 0.86
 Identities = 9/17 (52%), Positives = 14/17 (82%)

Query: 104 NDIAIVLYTSGSTGIPK 120
           +D+  ++YTSG+TG PK
Sbjct: 197 DDVTQLIYTSGTTGEPK 213


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 30.1 bits (68), Expect = 0.86
 Identities = 14/44 (31%), Positives = 20/44 (45%)

Query: 8   SEHHFDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
           S    ++   P      K KKK+   K K +K+K K + KK K 
Sbjct: 105 STDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKP 148



 Score = 29.7 bits (67), Expect = 0.97
 Identities = 9/37 (24%), Positives = 19/37 (51%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
           E     ++++  + K + K+KKK++  K K  +   K
Sbjct: 104 ESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPK 140



 Score = 29.3 bits (66), Expect = 1.4
 Identities = 12/33 (36%), Positives = 20/33 (60%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
           E+E   + K + K+KKK++  K K +K+K K  
Sbjct: 110 EQEDPPETKTESKEKKKREVPKPKTEKEKPKTE 142



 Score = 28.6 bits (64), Expect = 2.4
 Identities = 13/36 (36%), Positives = 19/36 (52%)

Query: 17  IPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
            P   + E  +KKK +  K K +K+K K + KK K 
Sbjct: 113 DPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKP 148



 Score = 28.2 bits (63), Expect = 3.9
 Identities = 8/33 (24%), Positives = 16/33 (48%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
           E    + ++    + K + K+KKK++  K K  
Sbjct: 103 EESTDETEQEDPPETKTESKEKKKREVPKPKTE 135



 Score = 27.8 bits (62), Expect = 5.5
 Identities = 12/31 (38%), Positives = 15/31 (48%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKKKKKKKK 48
               +RE  K K  K+K K + KK K  K K
Sbjct: 122 KEKKKREVPKPKTEKEKPKTEPKKPKPSKPK 152



 Score = 27.0 bits (60), Expect = 7.7
 Identities = 7/30 (23%), Positives = 18/30 (60%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
           E E  ++  ++ +++   + K + K+KKK+
Sbjct: 98  EEETEEESTDETEQEDPPETKTESKEKKKR 127


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 30.8 bits (70), Expect = 0.90
 Identities = 12/40 (30%), Positives = 21/40 (52%)

Query: 19  APFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
           A    E   +  NK  + K+K K ++ K+K++K+    AK
Sbjct: 258 AWEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELEREAK 297



 Score = 30.4 bits (69), Expect = 1.0
 Identities = 13/34 (38%), Positives = 26/34 (76%)

Query: 25  KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
           K K ++NK+K++K+ +++ K++K+ KKK   LA+
Sbjct: 278 KTKAQRNKEKRRKELEREAKEEKQLKKKLAQLAR 311



 Score = 28.1 bits (63), Expect = 6.4
 Identities = 10/38 (26%), Positives = 24/38 (63%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
                R K K K  + K+K++K+ +++ K++K+ K+++
Sbjct: 269 INKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKKL 306



 Score = 27.7 bits (62), Expect = 7.8
 Identities = 9/37 (24%), Positives = 21/37 (56%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
           E E   K    K+K K ++ K+K++K+ +++ +   +
Sbjct: 265 EYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQ 301


>gnl|CDD|213276 cd05908, A_NRPS_MycA_like, The adenylation domain of nonribosomal
           peptide synthetases (NRPS) similar to mycosubtilin
           synthase subunit A (MycA).  The adenylation (A) domain
           of NRPS recognizes a specific amino acid or hydroxy acid
           and activates it as (amino)-acyl adenylate by hydrolysis
           of ATP. The activated acyl moiety then forms thioester
           to the enzyme-bound cofactor phosphopantetheine of a
           peptidyl carrier protein domain. This family includes
           NRPS similar to mycosubtilin synthase subunit A (MycA).
           Mycosubtilin, which is characterized by a beta-amino
           fatty acid moiety linked to the circular heptapeptide
           Asn-Tyr-Asn-Gln-Pro-Ser-Asn, belongs to the iturin
           family of lipopeptide antibiotics. The mycosubtilin
           synthase subunit A (MycA) combines functional domains
           derived from peptide synthetases, amino transferases,
           and fatty acid synthases. Nonribosomal peptide
           synthetases are large multifunction enzymes that
           synthesize many therapeutically useful peptides. NRPS
           has a distinct modular structure in which each module is
           responsible for the recognition, activation, and, in
           some cases, modification of a single amino acid residue
           of the final peptide product. The modules can be
           subdivided into domains that catalyze specific
           biochemical reactions.
          Length = 499

 Score = 30.7 bits (70), Expect = 0.92
 Identities = 10/21 (47%), Positives = 14/21 (66%)

Query: 100 NEQSNDIAIVLYTSGSTGIPK 120
               +DIA + ++SGSTG PK
Sbjct: 102 LSLPDDIAFIQFSSGSTGEPK 122


>gnl|CDD|219621 pfam07890, Rrp15p, Rrp15p.  Rrp15p is required for the formation
          of 60S ribosomal subunits.
          Length = 132

 Score = 29.6 bits (67), Expect = 0.98
 Identities = 14/33 (42%), Positives = 21/33 (63%)

Query: 26 AKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
           K  K KKK K +K +KK K++ + +KR+ L K
Sbjct: 28 KKLLKAKKKLKSEKLEKKAKRQLRAEKRQALEK 60



 Score = 28.9 bits (65), Expect = 1.8
 Identities = 11/34 (32%), Positives = 20/34 (58%)

Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
           R K   K  KK K +K +KK K++ + +K++ +
Sbjct: 25 SRSKKLLKAKKKLKSEKLEKKAKRQLRAEKRQAL 58



 Score = 28.1 bits (63), Expect = 2.8
 Identities = 10/33 (30%), Positives = 22/33 (66%)

Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
           +  KK K++K +KK K++ + +K++  +K R+
Sbjct: 31 LKAKKKLKSEKLEKKAKRQLRAEKRQALEKGRV 63



 Score = 27.7 bits (62), Expect = 4.5
 Identities = 17/46 (36%), Positives = 22/46 (47%)

Query: 32 KKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRTVLSLPQLR 77
          + KK  K KKK K +K +KK +R L         KGR    LP+  
Sbjct: 26 RSKKLLKAKKKLKSEKLEKKAKRQLRAEKRQALEKGRVKPVLPEDL 71



 Score = 26.6 bits (59), Expect = 9.4
 Identities = 10/41 (24%), Positives = 25/41 (60%)

Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCEN 62
           ++K K +K +KK K++ + +K++  +K + + +L +  E 
Sbjct: 33 AKKKLKSEKLEKKAKRQLRAEKRQALEKGRVKPVLPEDLEK 73


>gnl|CDD|178337 PLN02736, PLN02736, long-chain acyl-CoA synthetase.
          Length = 651

 Score = 30.8 bits (70), Expect = 1.0
 Identities = 10/16 (62%), Positives = 13/16 (81%)

Query: 105 DIAIVLYTSGSTGIPK 120
           D+A + YTSG+TG PK
Sbjct: 222 DVATICYTSGTTGTPK 237


>gnl|CDD|240256 PTZ00071, PTZ00071, 40S ribosomal protein S24; Provisional.
          Length = 132

 Score = 29.6 bits (67), Expect = 1.0
 Identities = 15/45 (33%), Positives = 23/45 (51%), Gaps = 4/45 (8%)

Query: 9   EHHFDKV---IIPAPFEREKAKK-KKNKKKKKKKKKKKKKKKKKK 49
           E  +  +   +I       KA+K  KN++KK + K+K K   KKK
Sbjct: 88  EPKYRLIRLGLIKKKKAGRKARKELKNRRKKVRGKEKTKVSGKKK 132



 Score = 27.7 bits (62), Expect = 4.7
 Identities = 12/33 (36%), Positives = 20/33 (60%)

Query: 26  AKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
            KKKK  +K +K+ K ++KK + K+K +    K
Sbjct: 99  IKKKKAGRKARKELKNRRKKVRGKEKTKVSGKK 131


>gnl|CDD|213299 cd05933, ACSBG_like, Bubblegum-like very long-chain fatty acid CoA
           synthetase (VL-FACS).  This family of very long-chain
           fatty acid CoA synthetase is named bubblegum because
           Drosophila melanogaster mutant bubblegum (BGM) has
           elevated levels of very-long-chain fatty acids (VLCFA)
           caused by a defective gene of this family. The human
           homolog (hsBG) has been characterized as a very long
           chain fatty acid CoA synthetase that functions
           specifically in the brain; hsBG may play a central role
           in brain VLCFA metabolism and myelinogenesis. VL-FACS is
           involved in the first reaction step of very long chain
           fatty acid degradation. It catalyzes the formation of
           fatty acyl-CoA in a two-step reaction: the formation of
           a fatty acyl-AMP molecule as an intermediate, and the
           formation of a fatty acyl-CoA. Free fatty acids must be
           "activated" to their CoA thioesters before participating
           in most catabolic and anabolic reactions.
          Length = 594

 Score = 30.8 bits (70), Expect = 1.0
 Identities = 11/30 (36%), Positives = 19/30 (63%)

Query: 91  ESMLHADNNNEQSNDIAIVLYTSGSTGIPK 120
           +  L A   +++ N    ++YTSG+TG+PK
Sbjct: 137 DEQLDAIIESQKPNQCCTLIYTSGTTGMPK 166


>gnl|CDD|235279 PRK04319, PRK04319, acetyl-CoA synthetase; Provisional.
          Length = 570

 Score = 30.6 bits (70), Expect = 1.0
 Identities = 12/16 (75%), Positives = 13/16 (81%)

Query: 105 DIAIVLYTSGSTGIPK 120
           D AI+ YTSGSTG PK
Sbjct: 206 DGAILHYTSGSTGKPK 221


>gnl|CDD|213316 cd05969, MACS_like_4, Uncharacterized subfamily of Acetyl-CoA
           synthetase like family (ACS).  This family is most
           similar to acetyl-CoA synthetase. Acetyl-CoA synthetase
           (ACS) catalyzes the formation of acetyl-CoA from
           acetate, CoA, and ATP. Synthesis of acetyl-CoA is
           carried out in a two-step reaction. In the first step,
           the enzyme catalyzes the synthesis of acetyl-AMP
           intermediate from acetate and ATP. In the second step,
           acetyl-AMP reacts with CoA to produce acetyl-CoA. This
           enzyme is only present in bacteria.
          Length = 443

 Score = 30.5 bits (69), Expect = 1.0
 Identities = 31/118 (26%), Positives = 43/118 (36%), Gaps = 19/118 (16%)

Query: 107 AIVLYTSGSTGIPKEYRRMYKTGDYGRIVDDIHRCWSSGDRAI-----KYRR-----MYK 156
           A V+   G    P       + G+       +   W S  R       +Y        Y 
Sbjct: 271 AAVIERDGDGLTP--VTGPGQVGELA-----LKPGWPSMFRGYLGNEERYASSFVDGWYL 323

Query: 157 TGDYG-RIVDDILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSVLCYKPGEV 213
           TGD   R  D    + GR D  +K  G+ V   EVE+AL +   V +  V+  KP  V
Sbjct: 324 TGDLAYRDEDGYFWFVGRADDVIKTAGHLVGPFEVESALMEHPAVAEAGVIG-KPDPV 380



 Score = 27.8 bits (62), Expect = 7.6
 Identities = 9/16 (56%), Positives = 13/16 (81%)

Query: 105 DIAIVLYTSGSTGIPK 120
           D A++ +TSG+TG PK
Sbjct: 90  DPALLHFTSGTTGKPK 105


>gnl|CDD|218078 pfam04419, 4F5, 4F5 protein family.  Members of this family are
          short proteins that are rich in aspartate, glutamate,
          lysine and arginine. Although the function of these
          proteins is unknown, they are found to be ubiquitously
          expressed.
          Length = 38

 Score = 27.2 bits (61), Expect = 1.1
 Identities = 12/27 (44%), Positives = 14/27 (51%)

Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKK 49
          RE+  KK  KKKKK  K + K   K  
Sbjct: 11 RERNAKKAAKKKKKGAKSQLKAAAKAL 37



 Score = 27.2 bits (61), Expect = 1.1
 Identities = 13/30 (43%), Positives = 18/30 (60%)

Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
          RE+A+++  KK  KKKKK  K + K   K 
Sbjct: 7  REQARERNAKKAAKKKKKGAKSQLKAAAKA 36


>gnl|CDD|218991 pfam06326, Vesiculo_matrix, Vesiculovirus matrix protein.  This
          family consists of several Vesiculovirus matrix
          proteins. The matrix (M) protein of vesicular
          stomatitis virus (VSV) expressed in the absence of
          other viral components causes many of the cytopathic
          effects of VSV, including an inhibition of host gene
          expression and the induction of cell rounding. It has
          been shown that M protein also induces apoptosis in the
          absence of other viral components. It is thought that
          the activation of apoptotic pathways causes the
          inhibition of host gene expression and cell rounding by
          M protein.
          Length = 229

 Score = 30.1 bits (68), Expect = 1.1
 Identities = 12/43 (27%), Positives = 17/43 (39%)

Query: 32 KKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRTVLSLP 74
           K  K K K+   KK K  KK  +       P+Y+     + P
Sbjct: 4  FKLLKLKGKEVASKKSKGSKKLNMSMDWDAPPSYEETRPGAAP 46


>gnl|CDD|218738 pfam05766, NinG, Bacteriophage Lambda NinG protein.  NinG or Rap
          is involved in recombination. Rap (recombination adept
          with plasmid) increases lambda-by-plasmid recombination
          catalyzed by Escherichia coli's RecBCD pathway.
          Length = 188

 Score = 30.0 bits (68), Expect = 1.1
 Identities = 10/36 (27%), Positives = 22/36 (61%)

Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
          E A   K +K ++KK+K + + ++++ K R+   K+
Sbjct: 29 ECALALKREKAQEKKRKAEAQAERRELKARKEKLKT 64



 Score = 28.8 bits (65), Expect = 2.7
 Identities = 11/31 (35%), Positives = 22/31 (70%)

Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
          REKA++KK K + + ++++ K +K+K K + 
Sbjct: 36 REKAQEKKRKAEAQAERRELKARKEKLKTRS 66


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
           Topoisomerase I (eukaryota), DNA topoisomerase V,
           Vaccina virus topoisomerase, Variola virus
           topoisomerase, Shope fibroma virus topoisomeras.
          Length = 391

 Score = 30.4 bits (69), Expect = 1.1
 Identities = 15/28 (53%), Positives = 21/28 (75%)

Query: 21  FEREKAKKKKNKKKKKKKKKKKKKKKKK 48
           FER+  K     K+KKK+KKK++KKKK+
Sbjct: 317 FERDNEKLDAEVKEKKKEKKKEEKKKKQ 344



 Score = 30.4 bits (69), Expect = 1.3
 Identities = 12/39 (30%), Positives = 21/39 (53%)

Query: 21  FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
           FE     K+K K K ++  +K   + K+KKK+++   K 
Sbjct: 303 FEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKK 341



 Score = 30.0 bits (68), Expect = 1.4
 Identities = 13/35 (37%), Positives = 25/35 (71%)

Query: 21  FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
            + +   ++ N+K   + K+KKK+KKK++KKK++I
Sbjct: 311 RKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQI 345



 Score = 28.1 bits (63), Expect = 5.6
 Identities = 11/33 (33%), Positives = 21/33 (63%)

Query: 20  PFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
             + +  +  +    + K+KKK+KKK++KKKK+
Sbjct: 312 KLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQ 344



 Score = 28.1 bits (63), Expect = 6.6
 Identities = 13/32 (40%), Positives = 24/32 (75%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
           + EK   +  +KKK+KKK++KKKK+ ++ ++R
Sbjct: 320 DNEKLDAEVKEKKKEKKKEEKKKKQIERLEER 351



 Score = 28.1 bits (63), Expect = 6.8
 Identities = 14/31 (45%), Positives = 24/31 (77%)

Query: 23  REKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
            E+  +K + + K+KKK+KKK++KKKK+ +R
Sbjct: 317 FERDNEKLDAEVKEKKKEKKKEEKKKKQIER 347


>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein.  This family includes proteins
           related to Mpp10 (M phase phosphoprotein 10). The U3
           small nucleolar ribonucleoprotein (snoRNP) is required
           for three cleavage events that generate the mature 18S
           rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
           depletion of Mpp10, a U3 snoRNP-specific protein, halts
           18S rRNA production and impairs cleavage at the three U3
           snoRNP-dependent sites.
          Length = 613

 Score = 30.7 bits (69), Expect = 1.2
 Identities = 16/49 (32%), Positives = 21/49 (42%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRTV 70
           ER K K+K+   KK   K KK++K  K+K        S    A    T 
Sbjct: 562 ERRKKKRKRRAAKKAVTKAKKERKIGKEKVDGVAKKSSSAFFAKLQETK 610



 Score = 28.0 bits (62), Expect = 7.3
 Identities = 13/34 (38%), Positives = 23/34 (67%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
           E ++  K + ++KKK+K++  KK   K KK+R+I
Sbjct: 553 ELDRTDKNRERRKKKRKRRAAKKAVTKAKKERKI 586


>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
          Length = 470

 Score = 30.5 bits (69), Expect = 1.2
 Identities = 17/45 (37%), Positives = 23/45 (51%)

Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGR 68
           + ++ +NK+  KKKKKKKKKKKKK   +   LA           
Sbjct: 45 SENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLAYDLPVVWSSAA 89



 Score = 30.1 bits (68), Expect = 1.8
 Identities = 18/50 (36%), Positives = 24/50 (48%), Gaps = 3/50 (6%)

Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRTVLSL 73
           +   +  + + K+  KKKKKKKKKKKKK         + AY    V S 
Sbjct: 41 AELISENQEAENKQNNKKKKKKKKKKKKKN---LGEAYDLAYDLPVVWSS 87



 Score = 28.9 bits (65), Expect = 4.2
 Identities = 13/38 (34%), Positives = 19/38 (50%)

Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCEN 62
          + K++K + K+K  KKK KK KK        LA+    
Sbjct: 9  EVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISE 46



 Score = 28.5 bits (64), Expect = 4.6
 Identities = 11/31 (35%), Positives = 18/31 (58%)

Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
            E  + +  ++K++ K+K  KKK KK KK 
Sbjct: 2  MNENGENEVKQQKQQNKQKGTKKKNKKSKKD 32



 Score = 27.8 bits (62), Expect = 7.8
 Identities = 15/38 (39%), Positives = 18/38 (47%)

Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
          E E  +  K KKKKKKKKKKK   +         +  S
Sbjct: 49 EAENKQNNKKKKKKKKKKKKKNLGEAYDLAYDLPVVWS 86


>gnl|CDD|216337 pfam01159, Ribosomal_L6e, Ribosomal protein L6e. 
          Length = 108

 Score = 28.7 bits (65), Expect = 1.2
 Identities = 15/43 (34%), Positives = 26/43 (60%), Gaps = 2/43 (4%)

Query: 14 KVIIPAPFEREKAKKKKNKKKKKKKKK--KKKKKKKKKKKKRR 54
           V +P     E  K++K KKKKK + +   +KK+KK+  ++R+
Sbjct: 25 GVKVPEKINDEYFKREKKKKKKKSEGEFFAEKKEKKEVSEQRK 67



 Score = 28.0 bits (63), Expect = 2.6
 Identities = 15/42 (35%), Positives = 25/42 (59%), Gaps = 3/42 (7%)

Query: 14 KVIIPAPFEREKAKKKKNKKK---KKKKKKKKKKKKKKKKKK 52
          + I    F+REK KKKK  +     +KK+KK+  +++K  +K
Sbjct: 30 EKINDEYFKREKKKKKKKSEGEFFAEKKEKKEVSEQRKADQK 71


>gnl|CDD|213317 cd05970, MACS_AAE_MA_like, Medium-chain acyl-CoA synthetase (MACS)
           of AAE_MA like.  MACS catalyzes the two-step activation
           of medium chain fatty acids (containing 4-12 carbons).
           The carboxylate substrate first reacts with ATP to form
           an acyl-adenylate intermediate, which then reacts with
           CoA to produce an acyl-CoA ester. This family of MACS
           enzymes is found in archaea and bacteria. It is
           represented by the acyl-adenylating enzyme from
           Methanosarcina acetivorans (AAE_MA). AAE_MA is most
           active with propionate, butyrate, and the branched
           analogs: 2-methyl-propionate, butyrate, and pentanoate.
           The specific activity is weaker for smaller or larger
           acids.
          Length = 537

 Score = 30.5 bits (69), Expect = 1.2
 Identities = 17/41 (41%), Positives = 21/41 (51%), Gaps = 1/41 (2%)

Query: 155 YKTGDYG-RIVDDILLYEGRTDSQVKIRGNRVDLSEVEAAL 194
           Y TGD      D  L + GR D  +K  G R+   EVE+AL
Sbjct: 414 YHTGDTAWMDEDGYLWFVGRADDLIKSSGYRIGPFEVESAL 454



 Score = 28.2 bits (63), Expect = 6.4
 Identities = 24/80 (30%), Positives = 33/80 (41%), Gaps = 10/80 (12%)

Query: 228 DDILLYEGRTDSQVKIRGNRVDLSEVEAALSKISEVDKLSVLCYKPGEVDQIWEAPSSAG 287
           D  L + GR D  +K  G R+   EVE+AL +   V    + C   G  D I       G
Sbjct: 425 DGYLWFVGRADDLIKSSGYRIGPFEVESALIQHPAV----LECAVTGVPDPI------RG 474

Query: 288 STDSSFIVKSREDRAEEELV 307
               + IV ++     EEL 
Sbjct: 475 QVVKATIVLTKGYEPSEELK 494



 Score = 28.2 bits (63), Expect = 6.6
 Identities = 10/16 (62%), Positives = 14/16 (87%)

Query: 105 DIAIVLYTSGSTGIPK 120
           DI +V +TSG+TG+PK
Sbjct: 186 DILLVYFTSGTTGMPK 201


>gnl|CDD|132249 TIGR03205, pimA, dicarboxylate--CoA ligase PimA.  PimA, a member of
           a large family of acyl-CoA ligases, is found in a
           characteristic operon pimFABCDE for the metabolism of
           pimelate and related compounds. It is found, so far, in
           Bradyrhizobium japonicum and several strains of
           Rhodopseudomonas palustris. PimA from R. palustris was
           shown to be active as a CoA ligase for C(7) to C(14)
           dicarboxylates and fatty acids.
          Length = 541

 Score = 30.3 bits (68), Expect = 1.2
 Identities = 16/45 (35%), Positives = 24/45 (53%), Gaps = 3/45 (6%)

Query: 104 NDIAIVLYTSGSTGIPKEYRRMYKTGDYGRIVDDIHRCWSSGDRA 148
           +D+A++ YT G+TG+PK    M   G+    V  I+  W    RA
Sbjct: 192 DDVALLQYTGGTTGLPKG--AMLTHGNLTSAV-SIYDVWGKPSRA 233


>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
           factor.  Replicative DNA polymerases are capable of
           polymerising tens of thousands of nucleotides without
           dissociating from their DNA templates. The high
           processivity of these polymerases is dependent upon
           accessory proteins that bind to the catalytic subunit of
           the polymerase or to the substrate. The Epstein-Barr
           virus (EBV) BMRF1 protein is an essential component of
           the viral DNA polymerase and is absolutely required for
           lytic virus replication. BMRF1 is also a transactivator.
           This family is predicted to have a UL42 like structure.
          Length = 381

 Score = 30.4 bits (69), Expect = 1.3
 Identities = 17/31 (54%), Positives = 20/31 (64%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKKKKKKKK 48
           PA FER  A   K K++  KK+KKKK KK K
Sbjct: 344 PAAFERPLALSPKRKREGDKKQKKKKSKKLK 374



 Score = 28.1 bits (63), Expect = 6.9
 Identities = 11/19 (57%), Positives = 15/19 (78%)

Query: 32  KKKKKKKKKKKKKKKKKKK 50
           K+K++  KK+KKKK KK K
Sbjct: 356 KRKREGDKKQKKKKSKKLK 374



 Score = 28.1 bits (63), Expect = 6.9
 Identities = 11/19 (57%), Positives = 15/19 (78%)

Query: 33  KKKKKKKKKKKKKKKKKKK 51
           K+K++  KK+KKKK KK K
Sbjct: 356 KRKREGDKKQKKKKSKKLK 374



 Score = 28.1 bits (63), Expect = 6.9
 Identities = 11/19 (57%), Positives = 15/19 (78%)

Query: 34  KKKKKKKKKKKKKKKKKKK 52
           K+K++  KK+KKKK KK K
Sbjct: 356 KRKREGDKKQKKKKSKKLK 374


>gnl|CDD|213287 cd05920, 23DHB-AMP_lg, 2,3-dihydroxybenzoate-AMP ligase.
           2,3-dihydroxybenzoate-AMP ligase activates
           2,3-dihydroxybenzoate (DHB) by ligation of AMP from ATP
           with the release of pyrophosphate. However, it can also
           catalyze the ATP-PPi exchange for 2,3-DHB analogs, such
           as salicyclic acid (o-hydrobenzoate), as well as 2,4-DHB
           and 2,5-DHB, but with less efficiency. Proteins in this
           family are the stand-alone adenylation components of
           non-ribosomal peptide synthases (NRPSs) involved in the
           biosynthesis of siderophores, which are low molecular
           weight iron-chelating compounds synthesized by many
           bacteria to aid in the acquisition of this vital trace
           elements. In Escherichia coli, the
           2,3-dihydroxybenzoate-AMP ligase is called EntE, the
           adenylation component of the enterobactin NRPS system.
          Length = 483

 Score = 30.3 bits (69), Expect = 1.3
 Identities = 7/16 (43%), Positives = 12/16 (75%)

Query: 105 DIAIVLYTSGSTGIPK 120
           D+A+   + G+TG+PK
Sbjct: 140 DVALFQLSGGTTGLPK 155


>gnl|CDD|213319 cd05972, MACS_like, Medium-chain acyl-CoA synthetase (MACS or
           ACSM).  MACS catalyzes the two-step activation of medium
           chain fatty acids (containing 4-12 carbons). The
           carboxylate substrate first reacts with ATP to form an
           acyl-adenylate intermediate, which then reacts with CoA
           to produce an acyl-CoA ester. The acyl-CoA is a key
           intermediate in many important biosynthetic and
           catabolic processes.
          Length = 430

 Score = 30.4 bits (69), Expect = 1.3
 Identities = 17/47 (36%), Positives = 23/47 (48%), Gaps = 1/47 (2%)

Query: 155 YKTGDYGRIVDD-ILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEV 200
           Y TGD     +D    + GR D  +K  G R+   EVE+AL +   V
Sbjct: 309 YLTGDRAIKDEDGYFWFVGRADDVIKSSGYRIGPFEVESALIEHPAV 355



 Score = 29.6 bits (67), Expect = 2.1
 Identities = 9/16 (56%), Positives = 14/16 (87%)

Query: 105 DIAIVLYTSGSTGIPK 120
           D A++ +TSG+TG+PK
Sbjct: 82  DPALLYFTSGTTGLPK 97


>gnl|CDD|216599 pfam01603, B56, Protein phosphatase 2A regulatory B subunit (B56
           family).  Protein phosphatase 2A (PP2A) is a major
           intracellular protein phosphatase that regulates
           multiple aspects of cell growth and metabolism. The
           ability of this widely distributed heterotrimeric enzyme
           to act on a diverse array of substrates is largely
           controlled by the nature of its regulatory B subunit.
           There are multiple families of B subunits (See also
           pfam01240), this family is called the B56 family.
          Length = 405

 Score = 30.3 bits (69), Expect = 1.3
 Identities = 13/32 (40%), Positives = 19/32 (59%)

Query: 24  EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
                +   K K+K+ K+KKK+KK+KK   RI
Sbjct: 369 PDLFDECGNKYKEKEAKEKKKEKKRKKTWARI 400


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
           nuclear protein involved in bud-site selection. It plays
           a role in positioning the proximal bud pole signal. More
           recently it has been shown to be involved in ribosome
           biogenesis.
          Length = 424

 Score = 30.2 bits (68), Expect = 1.4
 Identities = 13/35 (37%), Positives = 15/35 (42%)

Query: 21  FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
              E   KK   KKK KK KKK+ K+   K     
Sbjct: 140 LGIETKAKKGKAKKKTKKSKKKEAKESSDKDDEEE 174


>gnl|CDD|233160 TIGR00869, sec62, protein translocation protein, Sec62 family.
           Members of the NSCC2 family have been sequenced from
           various yeast, fungal and animals species including
           Saccharomyces cerevisiae, Drosophila melanogaster and
           Homo sapiens. These proteins are the Sec62 proteins,
           believed to be associated with the Sec61 and Sec63
           constituents of the general protein secretary systems of
           yeast microsomes. They are also the non-selective cation
           (NS) channels of the mammalian cytoplasmic membrane. The
           yeast Sec62 protein has been shown to be essential for
           cell growth. The mammalian NS channel proteins has been
           implicated in platelet derived growth factor(PGDF)
           dependent single channel current in fibroblasts. These
           channels are essentially closed in serum deprived
           tissue-culture cells and are specifically opened by
           exposure to PDGF. These channels are reported to exhibit
           equal selectivity for Na+, K+ and Cs+ with low
           permeability to Ca2+, and no permeability to anions
           [Transport and binding proteins, Amino acids, peptides
           and amines].
          Length = 232

 Score = 29.9 bits (67), Expect = 1.4
 Identities = 13/23 (56%), Positives = 14/23 (60%)

Query: 31  NKKKKKKKKKKKKKKKKKKKKKR 53
            K K   KKK K KK KKK+ KR
Sbjct: 207 EKDKYSYKKKLKSKKLKKKQAKR 229



 Score = 29.1 bits (65), Expect = 2.3
 Identities = 11/20 (55%), Positives = 13/20 (65%)

Query: 28  KKKNKKKKKKKKKKKKKKKK 47
           K   KKK K KK KKK+ K+
Sbjct: 210 KYSYKKKLKSKKLKKKQAKR 229



 Score = 28.3 bits (63), Expect = 4.5
 Identities = 12/22 (54%), Positives = 15/22 (68%)

Query: 30  KNKKKKKKKKKKKKKKKKKKKK 51
           K+K   KKK K KK KKK+ K+
Sbjct: 208 KDKYSYKKKLKSKKLKKKQAKR 229



 Score = 28.3 bits (63), Expect = 4.8
 Identities = 12/22 (54%), Positives = 14/22 (63%)

Query: 28  KKKNKKKKKKKKKKKKKKKKKK 49
           K K   KKK K KK KKK+ K+
Sbjct: 208 KDKYSYKKKLKSKKLKKKQAKR 229



 Score = 27.2 bits (60), Expect = 9.6
 Identities = 12/22 (54%), Positives = 14/22 (63%)

Query: 33  KKKKKKKKKKKKKKKKKKKKRR 54
           K K   KKK K KK KKK+ +R
Sbjct: 208 KDKYSYKKKLKSKKLKKKQAKR 229


>gnl|CDD|218550 pfam05311, Baculo_PP31, Baculovirus 33KDa late protein (PP31).
           Autographa californica nuclear polyhedrosis virus
           (AcMNPV) pp31 is a nuclear phosphoprotein that
           accumulates in the virogenic stroma, which is the viral
           replication centre in the infected-cell nucleus, binds
           to DNA, and serves as a late expression factor.
          Length = 267

 Score = 30.0 bits (68), Expect = 1.4
 Identities = 12/33 (36%), Positives = 18/33 (54%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKK 50
           P P     ++K+K  KK  KK    KK+K+K+ 
Sbjct: 216 PTPQSSSPSRKRKAPKKVAKKVAAAKKRKQKRS 248



 Score = 29.7 bits (67), Expect = 2.0
 Identities = 10/30 (33%), Positives = 15/30 (50%)

Query: 24  EKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
             +  +K K  KK  KK    KK+K+K+  
Sbjct: 220 SSSPSRKRKAPKKVAKKVAAAKKRKQKRSP 249



 Score = 28.9 bits (65), Expect = 3.4
 Identities = 10/28 (35%), Positives = 16/28 (57%)

Query: 28  KKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
              ++K+K  KK  KK    KK+K++R 
Sbjct: 221 SSPSRKRKAPKKVAKKVAAAKKRKQKRS 248


>gnl|CDD|178452 PLN02861, PLN02861, long-chain-fatty-acid-CoA ligase.
          Length = 660

 Score = 30.2 bits (68), Expect = 1.5
 Identities = 11/21 (52%), Positives = 15/21 (71%)

Query: 100 NEQSNDIAIVLYTSGSTGIPK 120
            +Q  DI  ++YTSG+TG PK
Sbjct: 216 PKQKTDICTIMYTSGTTGEPK 236


>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3.  This family
           of proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). Shr3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of Shr3, AAPs are retained in the ER.
          Length = 196

 Score = 29.7 bits (67), Expect = 1.6
 Identities = 10/29 (34%), Positives = 14/29 (48%)

Query: 21  FEREKAKKKKNKKKKKKKKKKKKKKKKKK 49
           F  E+ K+      KK    +K + KKKK
Sbjct: 168 FAAEERKEALAAAAKKSATPQKVETKKKK 196


>gnl|CDD|219978 pfam08701, GN3L_Grn1, GNL3L/Grn1 putative GTPase.  Grn1 (yeast)
          and GNL3L (human) are putative GTPases which are
          required for growth and play a role in processing of
          nucleolar pre-rRNA. This family contains a potential
          nuclear localisation signal.
          Length = 80

 Score = 28.0 bits (63), Expect = 1.7
 Identities = 13/31 (41%), Positives = 17/31 (54%), Gaps = 1/31 (3%)

Query: 23 REKAKKK-KNKKKKKKKKKKKKKKKKKKKKK 52
          R K +KK     +K +K+ KK    K KKKK
Sbjct: 3  RYKIEKKVAEHHRKLRKEAKKNPTWKSKKKK 33


>gnl|CDD|215464 PLN02860, PLN02860, o-succinylbenzoate-CoA ligase.
          Length = 563

 Score = 30.2 bits (68), Expect = 1.7
 Identities = 18/42 (42%), Positives = 23/42 (54%), Gaps = 3/42 (7%)

Query: 157 TGDYGRIVDDI--LLYEGRTDSQVKIRGNRVDLSEVEAALSK 196
           TGD G I D    L   GR++ ++K  G  V   EVEA LS+
Sbjct: 418 TGDIGWI-DKAGNLWLIGRSNDRIKTGGENVYPEEVEAVLSQ 458



 Score = 28.6 bits (64), Expect = 4.4
 Identities = 8/18 (44%), Positives = 13/18 (72%)

Query: 103 SNDIAIVLYTSGSTGIPK 120
            +D  ++ +TSG+TG PK
Sbjct: 171 PDDAVLICFTSGTTGRPK 188


>gnl|CDD|215278 PLN02502, PLN02502, lysyl-tRNA synthetase.
          Length = 553

 Score = 30.0 bits (68), Expect = 1.7
 Identities = 9/32 (28%), Positives = 16/32 (50%)

Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
          R KAK+ + +K  K++ K        K + R+
Sbjct: 16 RLKAKQAEEEKAAKEEAKAAAAAAAAKGRSRK 47


>gnl|CDD|237144 PRK12582, PRK12582, acyl-CoA synthetase; Provisional.
          Length = 624

 Score = 30.0 bits (68), Expect = 1.8
 Identities = 10/15 (66%), Positives = 13/15 (86%)

Query: 106 IAIVLYTSGSTGIPK 120
           +A  L+TSGSTG+PK
Sbjct: 222 VAKYLFTSGSTGMPK 236


>gnl|CDD|217834 pfam03998, Utp11, Utp11 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 239

 Score = 29.6 bits (67), Expect = 1.8
 Identities = 16/34 (47%), Positives = 20/34 (58%)

Query: 21  FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
            +RE  KK K KKKK  K K  K   K KK+++R
Sbjct: 206 LQRELMKKGKGKKKKIVKDKDGKVVYKWKKERKR 239



 Score = 29.3 bits (66), Expect = 2.3
 Identities = 15/33 (45%), Positives = 17/33 (51%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
           E ++   KK K KKKK  K K  K   K KK R
Sbjct: 205 ELQRELMKKGKGKKKKIVKDKDGKVVYKWKKER 237


>gnl|CDD|235719 PRK06155, PRK06155, crotonobetaine/carnitine-CoA ligase;
           Provisional.
          Length = 542

 Score = 30.1 bits (68), Expect = 1.8
 Identities = 21/58 (36%), Positives = 25/58 (43%), Gaps = 7/58 (12%)

Query: 70  VLSLPQLRLLSASLSHNNIPGESMLH-------ADNNNEQSNDIAIVLYTSGSTGIPK 120
            L LP + LL A  S +   G S          A     Q  D A +LYTSG+TG  K
Sbjct: 139 DLPLPAVWLLDAPASVSVPAGWSTAPLPPLDAPAPAAAVQPGDTAAILYTSGTTGPSK 196


>gnl|CDD|218612 pfam05501, DUF755, Domain of unknown function (DUF755).  This
           family is predominated by ORFs from Circoviridae. The
           function of this family remains to be determined.
          Length = 122

 Score = 28.5 bits (64), Expect = 1.8
 Identities = 13/50 (26%), Positives = 23/50 (46%), Gaps = 5/50 (10%)

Query: 10  HHFDKVIIPAPFEREKAKKKKNKKKKKKKKK-----KKKKKKKKKKKKRR 54
           H   ++  P    +E+ K  +  KKK+KK +       KK   KK+++  
Sbjct: 56  HLSQQLQQPQRKRKEQQKSSQTHKKKRKKSRHVSSRSAKKISAKKRRRSS 105



 Score = 28.5 bits (64), Expect = 1.9
 Identities = 13/46 (28%), Positives = 24/46 (52%), Gaps = 3/46 (6%)

Query: 17  IPAPFEREKAKKKKNKKKKKKKKKKK---KKKKKKKKKKRRILAKS 59
           +  P  + K ++K ++  KKK+KK +    +  KK   K+R  + S
Sbjct: 61  LQQPQRKRKEQQKSSQTHKKKRKKSRHVSSRSAKKISAKKRRRSSS 106


>gnl|CDD|181011 PRK07514, PRK07514, malonyl-CoA synthase; Validated.
          Length = 504

 Score = 29.8 bits (68), Expect = 1.9
 Identities = 10/16 (62%), Positives = 13/16 (81%)

Query: 105 DIAIVLYTSGSTGIPK 120
           D+A +LYTSG+TG  K
Sbjct: 157 DLAAILYTSGTTGRSK 172


>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
           Provisional.
          Length = 695

 Score = 29.9 bits (68), Expect = 2.0
 Identities = 13/53 (24%), Positives = 20/53 (37%), Gaps = 1/53 (1%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRTVLSLP 74
           EREKA  ++ + KK  + +  K K        R+ AK             + P
Sbjct: 463 EREKAA-REARHKKAAEARAAKDKDAVAAALARVKAKKAAATQPIVIKAGARP 514


>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family.  The yeast member of this
          family (Kri1p) is found to be required for 40S ribosome
          biogenesis in the nucleolus.
          Length = 99

 Score = 28.0 bits (63), Expect = 2.0
 Identities = 11/31 (35%), Positives = 22/31 (70%)

Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
          E EKA++++  K+ K  K+++ ++K +K KK
Sbjct: 5  EEEKAQREEELKRLKNLKREEIEEKLEKIKK 35



 Score = 26.8 bits (60), Expect = 6.2
 Identities = 8/32 (25%), Positives = 21/32 (65%)

Query: 27 KKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
          K++K ++K +++++ K+ K  K+++    L K
Sbjct: 1  KERKEEEKAQREEELKRLKNLKREEIEEKLEK 32



 Score = 26.5 bits (59), Expect = 6.6
 Identities = 6/25 (24%), Positives = 18/25 (72%)

Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKK 49
          K +K++ K +++++ K+ K  K+++
Sbjct: 1  KERKEEEKAQREEELKRLKNLKREE 25


>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
           domain.  This domain is found in a number of different
           types of plant proteins including NAM-like proteins.
          Length = 147

 Score = 28.9 bits (65), Expect = 2.0
 Identities = 15/39 (38%), Positives = 28/39 (71%)

Query: 20  PFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
           P  R+KAK+K  + K K KK++ +K+K+K+++  + LA+
Sbjct: 62  PEGRKKAKEKLRRDKLKAKKEEAEKEKEKEERFMKALAE 100



 Score = 28.1 bits (63), Expect = 3.7
 Identities = 11/37 (29%), Positives = 24/37 (64%)

Query: 23  REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
           + K K +++K K KK++ +K+K+K+++  K    A+ 
Sbjct: 67  KAKEKLRRDKLKAKKEEAEKEKEKEERFMKALAEAEK 103



 Score = 27.7 bits (62), Expect = 4.7
 Identities = 13/35 (37%), Positives = 22/35 (62%)

Query: 27  KKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCE 61
           KK K K ++ K K KK++ +K+K+K+ R +    E
Sbjct: 66  KKAKEKLRRDKLKAKKEEAEKEKEKEERFMKALAE 100


>gnl|CDD|215329 PLN02610, PLN02610, probable methionyl-tRNA synthetase.
          Length = 801

 Score = 29.7 bits (67), Expect = 2.1
 Identities = 13/42 (30%), Positives = 17/42 (40%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENP 63
           E  +AKK   + KKK      KKK+ KK         + E  
Sbjct: 596 EAAEAKKLAKQLKKKALSDGGKKKQGKKAGGGGKSKAAAERE 637



 Score = 28.2 bits (63), Expect = 6.9
 Identities = 11/34 (32%), Positives = 16/34 (47%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
           ++ K K   +  KKK+ KK     K K   +R I
Sbjct: 605 KQLKKKALSDGGKKKQGKKAGGGGKSKAAAEREI 638


>gnl|CDD|218734 pfam05758, Ycf1, Ycf1.  The chloroplast genomes of most higher
           plants contain two giant open reading frames designated
           ycf1 and ycf2. Although the function of Ycf1 is unknown,
           it is known to be an essential gene.
          Length = 832

 Score = 30.0 bits (68), Expect = 2.1
 Identities = 12/36 (33%), Positives = 23/36 (63%)

Query: 20  PFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
            +  +K + K     ++K KK++KK+K K+++K RI
Sbjct: 645 NWIEKKKEFKILDYTEEKTKKEEKKEKNKREEKERI 680



 Score = 29.6 bits (67), Expect = 2.8
 Identities = 17/56 (30%), Positives = 34/56 (60%), Gaps = 2/56 (3%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCEN--PAYKGRTVLSLPQ 75
           E++K  K  +  ++K KK++KK+K K+++K+R  +A++ +    A   R  L + Q
Sbjct: 648 EKKKEFKILDYTEEKTKKEEKKEKNKREEKERIEIAETWDTIPLAQAIRGFLLVIQ 703


>gnl|CDD|162605 TIGR01923, menE, O-succinylbenzoate-CoA ligase.  This model
           represents an enzyme, O-succinylbenzoate-CoA ligase,
           which is involved in the fourth step of the menaquinone
           biosynthesis pathway. O-succinylbenzoate-CoA ligase,
           together with menB - naphtoate synthase, take
           2-succinylbenzoate and convert it into 1,4-di-hydroxy-2-
           naphtoate [Biosynthesis of cofactors, prosthetic groups,
           and carriers, Menaquinone and ubiquinone].
          Length = 436

 Score = 29.7 bits (67), Expect = 2.2
 Identities = 17/55 (30%), Positives = 25/55 (45%), Gaps = 11/55 (20%)

Query: 66  KGRTVLSLPQLRLLSASLSHNNIPGESMLHADNNNEQSNDIAIVLYTSGSTGIPK 120
           K     SL ++              E+ L A  N +Q   IA +++TSG+TG PK
Sbjct: 84  KDFQADSLDRIEAAGRY--------ETSLSASFNMDQ---IATLMFTSGTTGKPK 127


>gnl|CDD|213290 cd05923, CBAL, 4-Chlorobenzoate-CoA ligase (CBAL).  CBAL catalyzes
           the conversion of 4-chlorobenzoate (4-CB) to
           4-chlorobenzoyl-coenzyme A (4-CB-CoA) by the two-step
           adenylation and thioester-forming reactions.
           4-Chlorobenzoate (4-CBA) is an environmental pollutant
           derived from microbial breakdown of aromatic pollutants,
           such as polychlorinated biphenyls (PCBs), DDT, and
           certain herbicides. The 4-CBA degrading pathway converts
           4-CBA to the metabolite 4-hydroxybezoate (4-HBA),
           allowing some soil-dwelling microbes to utilize 4-CBA as
           an alternate carbon source. This pathway consists of
           three chemical steps catalyzed by 4-CBA-CoA ligase,
           4-CBA-CoA dehalogenase, and 4HBA-CoA thioesterase in
           sequential reactions.
          Length = 495

 Score = 29.4 bits (66), Expect = 2.3
 Identities = 10/16 (62%), Positives = 12/16 (75%)

Query: 105 DIAIVLYTSGSTGIPK 120
             A V YTSG+TG+PK
Sbjct: 153 QPAFVFYTSGTTGLPK 168


>gnl|CDD|241262 cd01228, PH_BCR-related, Breakpoint Cluster Region-related
           pleckstrin homology (PH) domain.  The BCR gene is one of
           the two genes in the BCR-ABL complex, which is
           associated with the Philadelphia chromosome, a product
           of a reciprocal translocation between chromosomes 22 and
           9. BCR is a GTPase-activating protein (GAP) for RAC1
           (primarily) and CDC42. The Dbl region of BCR has the
           most RhoGEF activity for Cdc42, and less activity
           towards Rac and Rho. Since BCR possesses both GAP and
           GEF activities, it may function to temporally regulate
           the activity of these GTPases. It also displays
           serine/threonine kinase activity. The BCR protein
           contains multiple domains including an N-terminal kinase
           domain, a RhoGEF domain, a PH domain, a C1 domain, a C2
           domain, and a C-terminal RhoGAP domain. ABR, a related
           smaller protein, is structurally similar to BCR, but
           lacks the N-terminal kinase domain and has GAP activity
           for both Rac and Cdc42. PH domains have diverse
           functions, but in general are involved in targeting
           proteins to the appropriate cellular location or in the
           interaction with a binding partner. They share little
           sequence conservation, but all have a common fold, which
           is electrostatically polarized. Less than 10% of PH
           domains bind phosphoinositide phosphates (PIPs) with
           high affinity and specificity. PH domains are
           distinguished from other PIP-binding domains by their
           specific high-affinity binding to PIPs with two vicinal
           phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
           PtdIns(3,4,5)P3 which results in targeting some PH
           domain proteins to the plasma membrane. A few display
           strong specificity in lipid binding. Any specificity is
           usually determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 179

 Score = 28.8 bits (65), Expect = 2.3
 Identities = 13/57 (22%), Positives = 22/57 (38%), Gaps = 7/57 (12%)

Query: 8   SEHHFDKVIIPAPFE-------REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILA 57
            E      ++P   E          A K+K K+   ++KK      +  +K R+ LA
Sbjct: 75  DESEAAPDVVPKSSEELVQMKKAISALKQKIKRDSAERKKAASSGSRAIEKLRKKLA 131


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 29.6 bits (67), Expect = 2.3
 Identities = 12/26 (46%), Positives = 19/26 (73%)

Query: 29  KKNKKKKKKKKKKKKKKKKKKKKKRR 54
           KK KKK+K+K++   K +  KK+KR+
Sbjct: 627 KKRKKKRKRKRRFLTKIEGVKKEKRK 652



 Score = 29.6 bits (67), Expect = 2.7
 Identities = 12/35 (34%), Positives = 22/35 (62%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRIL 56
           ++ K K+K+ ++   K +  KK+K+K KK K  I+
Sbjct: 627 KKRKKKRKRKRRFLTKIEGVKKEKRKDKKLKNVII 661



 Score = 28.5 bits (64), Expect = 4.9
 Identities = 16/43 (37%), Positives = 23/43 (53%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPA 64
           E  KAK+ K  K KK  +  KK+K K++ K+   L K+    A
Sbjct: 216 EEAKAKRIKKIKSKKYHRVHKKEKLKEELKEFEELVKADPEAA 258



 Score = 28.5 bits (64), Expect = 5.8
 Identities = 11/27 (40%), Positives = 19/27 (70%)

Query: 26  AKKKKNKKKKKKKKKKKKKKKKKKKKK 52
            KK+K K+K+K++   K +  KK+K+K
Sbjct: 626 IKKRKKKRKRKRRFLTKIEGVKKEKRK 652


>gnl|CDD|176075 cd08693, C2_PI3K_class_I_beta_delta, C2 domain present in class I
           beta and delta phosphatidylinositol 3-kinases (PI3Ks).
           PI3Ks (AKA phosphatidylinositol (PtdIns) 3-kinases)
           regulate cell processes such as cell growth,
           differentiation, proliferation, and motility.  PI3Ks
           work on phosphorylation of phosphatidylinositol,
           phosphatidylinositide (4)P (PtdIns (4)P),2 or
           PtdIns(4,5)P2. Specifically they phosphorylate the D3
           hydroxyl group of phosphoinositol lipids on the inositol
           ring. There are 3 classes of PI3Ks based on structure,
           regulation, and specificity. All classes contain a C2
           domain, a PIK domain, and a kinase catalytic domain.
           The members here are class I, beta and delta isoforms of
           PI3Ks and contain both a Ras-binding domain and a
           p85-binding domain.  Class II PI3Ks contain both of
           these as well as a PX domain, and a C-terminal C2 domain
           containing a nuclear localization signal.  C2 domains
           fold into an 8-standed beta-sandwich that can adopt 2
           structural arrangements: Type I and Type II,
           distinguished by a circular permutation involving their
           N- and C-terminal beta strands. Many C2 domains are
           Ca2+-dependent membrane-targeting modules that bind a
           wide variety of substances including bind phospholipids,
           inositol polyphosphates, and intracellular proteins.
           Most C2 domain proteins are either signal transduction
           enzymes that contain a single C2 domain, such as protein
           kinase C, or membrane trafficking proteins which contain
           at least two C2 domains, such as synaptotagmin 1.
           However, there are a few exceptions to this including
           RIM isoforms and some splice variants of piccolo/aczonin
           and intersectin which only have a single C2 domain.  C2
           domains with a calcium binding region have negatively
           charged residues, primarily aspartates, that serve as
           ligands for calcium ions.  Members have a type-I
           topology.
          Length = 173

 Score = 28.8 bits (65), Expect = 2.3
 Identities = 11/21 (52%), Positives = 14/21 (66%)

Query: 31  NKKKKKKKKKKKKKKKKKKKK 51
             KK K K+ +K + KKKKKK
Sbjct: 85  VSKKAKGKRSRKNQTKKKKKK 105



 Score = 28.4 bits (64), Expect = 3.4
 Identities = 11/21 (52%), Positives = 14/21 (66%)

Query: 32  KKKKKKKKKKKKKKKKKKKKK 52
             KK K K+ +K + KKKKKK
Sbjct: 85  VSKKAKGKRSRKNQTKKKKKK 105



 Score = 28.0 bits (63), Expect = 4.0
 Identities = 12/22 (54%), Positives = 16/22 (72%)

Query: 24  EKAKKKKNKKKKKKKKKKKKKK 45
           E +KK K K+ +K + KKKKKK
Sbjct: 84  EVSKKAKGKRSRKNQTKKKKKK 105



 Score = 27.3 bits (61), Expect = 7.0
 Identities = 11/22 (50%), Positives = 14/22 (63%)

Query: 26  AKKKKNKKKKKKKKKKKKKKKK 47
              KK K K+ +K + KKKKKK
Sbjct: 84  EVSKKAKGKRSRKNQTKKKKKK 105



 Score = 27.3 bits (61), Expect = 8.4
 Identities = 10/21 (47%), Positives = 13/21 (61%)

Query: 29  KKNKKKKKKKKKKKKKKKKKK 49
              K K K+ +K + KKKKKK
Sbjct: 85  VSKKAKGKRSRKNQTKKKKKK 105



 Score = 26.9 bits (60), Expect = 9.2
 Identities = 10/21 (47%), Positives = 13/21 (61%)

Query: 28  KKKNKKKKKKKKKKKKKKKKK 48
             K  K K+ +K + KKKKKK
Sbjct: 85  VSKKAKGKRSRKNQTKKKKKK 105


>gnl|CDD|235134 PRK03584, PRK03584, acetoacetyl-CoA synthetase; Provisional.
          Length = 655

 Score = 29.4 bits (67), Expect = 2.4
 Identities = 8/12 (66%), Positives = 12/12 (100%)

Query: 109 VLYTSGSTGIPK 120
           +LY+SG+TG+PK
Sbjct: 268 ILYSSGTTGLPK 279


>gnl|CDD|226814 COG4379, COG4379, Mu-like prophage tail protein gpP [General
           function prediction only].
          Length = 386

 Score = 29.4 bits (66), Expect = 2.5
 Identities = 10/37 (27%), Positives = 16/37 (43%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
              F+ +   KK    +K+K  +K    K KK  K+ 
Sbjct: 342 DGAFDVDAYPKKAEAARKRKGDRKGVSHKAKKGGKKN 378



 Score = 28.3 bits (63), Expect = 5.1
 Identities = 10/35 (28%), Positives = 17/35 (48%)

Query: 24  EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
             A  KK +  +K+K  +K    K KK  ++  A+
Sbjct: 347 VDAYPKKAEAARKRKGDRKGVSHKAKKGGKKNQAE 381


>gnl|CDD|240578 cd12951, RRP7_Rrp7A, RRP7 domain ribosomal RNA-processing protein
          7 homolog A (Rrp7A) and similar proteins.  The family
          corresponds to the RRP7 domain of Rrp7A, also termed
          gastric cancer antigen Zg14, and similar proteins which
          are yeast ribosomal RNA-processing protein 7 (Rrp7p)
          homologs mainly found in Metazoans. The cellular
          function of Rrp7A remains unclear currently. Rrp7A
          harbors an N-terminal RNA recognition motif (RRM), also
          termed RBD (RNA binding domain) or RNP
          (ribonucleoprotein domain), and a C-terminal RRP7
          domain.
          Length = 129

 Score = 28.4 bits (64), Expect = 2.5
 Identities = 12/28 (42%), Positives = 18/28 (64%)

Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
          K ++ K  +K+    K  +K+KKKKKKK
Sbjct: 58 KGRRPKTARKESVAAKAAEKEKKKKKKK 85



 Score = 28.0 bits (63), Expect = 3.0
 Identities = 11/30 (36%), Positives = 19/30 (63%)

Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
          ++  + K  +K+    K  +K+KKKKKKK+
Sbjct: 57 KKGRRPKTARKESVAAKAAEKEKKKKKKKE 86



 Score = 27.6 bits (62), Expect = 4.8
 Identities = 10/28 (35%), Positives = 17/28 (60%)

Query: 27 KKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
          KK +  K  +K+    K  +K+KKKK++
Sbjct: 57 KKGRRPKTARKESVAAKAAEKEKKKKKK 84


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 29.4 bits (66), Expect = 2.5
 Identities = 8/42 (19%), Positives = 15/42 (35%)

Query: 23  REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPA 64
           ++KA  +      K   + K   +K    K    A + +  A
Sbjct: 216 KKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKAAA 257



 Score = 28.6 bits (64), Expect = 4.5
 Identities = 8/36 (22%), Positives = 12/36 (33%)

Query: 26  AKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCE 61
           AK    K   + K   +K    K  +K      + E
Sbjct: 223 AKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKAAAE 258



 Score = 28.2 bits (63), Expect = 6.1
 Identities = 7/37 (18%), Positives = 13/37 (35%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
           ++  A+ K    K   + K   +K    K   +  A 
Sbjct: 217 KKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAA 253



 Score = 27.8 bits (62), Expect = 7.2
 Identities = 7/38 (18%), Positives = 11/38 (28%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
              K K     K    K   + K   +K    +   K+
Sbjct: 213 AEAKKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKA 250


>gnl|CDD|218396 pfam05039, Agouti, Agouti protein.  The agouti protein regulates
          pigmentation in the mouse hair follicle producing a
          black hair with a subapical yellow band. A highly
          homologous protein agouti signal protein (ASIP)is
          present in humans and is expressed at highest levels in
          adipose tissue where it may play a role in energy
          homeostasis and possibly human pigmentation.
          Length = 90

 Score = 27.5 bits (61), Expect = 2.5
 Identities = 16/56 (28%), Positives = 25/56 (44%), Gaps = 6/56 (10%)

Query: 30 KNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRTVLSLPQLRLLSASLSH 85
           NK  KK  K + ++KKK  KK+   L ++      +GR       +RL  + L  
Sbjct: 16 LNKGLKKFNKNEAEEKKKSSKKEAEALTEA------QGRPPPPRRCVRLHESCLGP 65


>gnl|CDD|222791 PHA00431, PHA00431, internal virion protein C.
          Length = 746

 Score = 29.4 bits (66), Expect = 2.5
 Identities = 9/33 (27%), Positives = 18/33 (54%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
            REKA K  NK+       + +++ ++K+K+  
Sbjct: 705 AREKALKDANKRAPIVAATQAREEVREKRKQTS 737


>gnl|CDD|150240 pfam09495, DUF2462, Protein of unknown function (DUF2462).  This
          protein is highly conserved, but its function is
          unknown. It can be isolated from HeLa cell nucleoli and
          is found to be homologous with Leydig cell tumour
          protein whose function is unknown.
          Length = 82

 Score = 27.4 bits (61), Expect = 2.7
 Identities = 14/35 (40%), Positives = 16/35 (45%)

Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
          + A K K K     K  KKK+K  KK   R I  K
Sbjct: 3  QGALKVKAKGPAAVKAGKKKQKGPKKAGPRVIKPK 37


>gnl|CDD|219572 pfam07780, Spb1_C, Spb1 C-terminal domain.  This presumed domain is
           found at the C-terminus of a family of FtsJ-like
           methyltransferases. Members of this family are involved
           in 60S ribosomal biogenesis.
          Length = 212

 Score = 28.8 bits (65), Expect = 2.7
 Identities = 23/39 (58%), Positives = 24/39 (61%), Gaps = 6/39 (15%)

Query: 22  EREKAK--KKKNKKKKKKKKKKKKK----KKKKKKKKRR 54
           EREKAK  KK  KK KKKKKK+KK     KK  K K  R
Sbjct: 146 EREKAKQIKKLYKKAKKKKKKRKKTYVVAKKGNKGKAGR 184


>gnl|CDD|217356 pfam03078, ATHILA, ATHILA ORF-1 family.  ATHILA is a group of
           Arabidopsis thaliana retrotransposons belonging to the
           Ty3/gypsy family of the long terminal repeat (LTR) class
           of eukaryotic retrotransposons. The central region of
           ATHILA retrotransposons contains two or three open
           reading frames (ORFs). This family represents the ORF1
           product. The function of ORF1 is unknown.
          Length = 456

 Score = 29.2 bits (65), Expect = 2.7
 Identities = 16/52 (30%), Positives = 27/52 (51%)

Query: 8   SEHHFDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
           S +HF++ + PA   +  ++  KN  K +K  KK+ K   K  KK + + K 
Sbjct: 397 SMYHFEEHVPPARESKSLSEAHKNNSKLQKWCKKQDKLIAKCLKKIKFMKKK 448


>gnl|CDD|236363 PRK09029, PRK09029, O-succinylbenzoic acid--CoA ligase;
           Provisional.
          Length = 458

 Score = 29.5 bits (67), Expect = 2.7
 Identities = 21/75 (28%), Positives = 26/75 (34%), Gaps = 21/75 (28%)

Query: 67  GRTVLSL-PQL--RLLSASLSHNNI------------PGESMLHADNNNEQSND------ 105
           G  VL L PQL   LL   L    +               + LH                
Sbjct: 77  GARVLPLNPQLPQPLLEELLPSLTLDFALVLEGENTFSALTSLHLQLVEGAHAVAWQPQR 136

Query: 106 IAIVLYTSGSTGIPK 120
           +A +  TSGSTG+PK
Sbjct: 137 LATMTLTSGSTGLPK 151


>gnl|CDD|213308 cd05943, AACS, Acetoacetyl-CoA synthetase (acetoacetate-CoA ligase,
           AACS).  AACS is a cytosolic ligase that specifically
           activates acetoacetate to its coenzyme A ester by a
           two-step reaction. Acetoacetate first reacts with ATP to
           form an acyl-adenylate intermediate, which then reacts
           with CoA to produce an acyl-CoA ester. This is the first
           step of the mevalonate pathway of isoprenoid
           biosynthesis via isopentenyl diphosphate. Isoprenoids
           are a large class of compounds found in all living
           organisms. AACS is widely distributed in bacteria,
           archaea and eukaryotes. In bacteria, AACS is known to
           exhibit an important role in the metabolism of
           poly-b-hydroxybutyrate, an intracellular reserve of
           organic carbon and chemical energy by some
           microorganisms. In mammals, AACS influences the rate of
           ketone body utilization for the formation of
           physiologically important fatty acids and cholesterol.
          Length = 616

 Score = 29.5 bits (67), Expect = 2.7
 Identities = 8/12 (66%), Positives = 12/12 (100%)

Query: 109 VLYTSGSTGIPK 120
           +LY+SG+TG+PK
Sbjct: 241 ILYSSGTTGLPK 252


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 29.5 bits (66), Expect = 2.7
 Identities = 16/30 (53%), Positives = 19/30 (63%)

Query: 24  EKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
           E   ++K+KKKKKK  K KKK    KK KR
Sbjct: 234 EDGDEEKSKKKKKKLAKNKKKLDDDKKGKR 263



 Score = 27.6 bits (61), Expect = 8.9
 Identities = 12/33 (36%), Positives = 16/33 (48%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
           E E+ K ++     KK KK KK K KK    + 
Sbjct: 322 ESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKD 354


>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
           eukaryotic snRNP [Transcription].
          Length = 564

 Score = 29.3 bits (66), Expect = 2.8
 Identities = 11/36 (30%), Positives = 22/36 (61%)

Query: 24  EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
           E+ +++  ++   K KKKK+KKK+  +K R  ++  
Sbjct: 413 EEIREELIEEGLLKSKKKKRKKKEWFEKFRWFVSSD 448


>gnl|CDD|219947 pfam08639, SLD3, DNA replication regulator SLD3.  The SLD3 DNA
           replication regulator is required for loading and
           maintenance of Cdc45 on chromatin during DNA
           replication.
          Length = 437

 Score = 29.4 bits (66), Expect = 2.8
 Identities = 10/39 (25%), Positives = 21/39 (53%)

Query: 13  DKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
                    E +K   K+++   + K++K++K+KK KK+
Sbjct: 120 LDRFTKFEKEYKKKLLKRSQNLDRSKRRKRRKRKKNKKQ 158


>gnl|CDD|237276 PRK13024, PRK13024, bifunctional preprotein translocase subunit
           SecD/SecF; Reviewed.
          Length = 755

 Score = 29.4 bits (67), Expect = 3.0
 Identities = 11/21 (52%), Positives = 17/21 (80%)

Query: 32  KKKKKKKKKKKKKKKKKKKKK 52
           +K++ KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753



 Score = 29.0 bits (66), Expect = 3.7
 Identities = 11/21 (52%), Positives = 17/21 (80%)

Query: 27  KKKKNKKKKKKKKKKKKKKKK 47
           +K++ KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753



 Score = 29.0 bits (66), Expect = 4.1
 Identities = 10/21 (47%), Positives = 16/21 (76%)

Query: 30  KNKKKKKKKKKKKKKKKKKKK 50
           + ++ KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753



 Score = 28.7 bits (65), Expect = 4.3
 Identities = 11/22 (50%), Positives = 17/22 (77%)

Query: 34  KKKKKKKKKKKKKKKKKKKRRI 55
           +K++ KKKKK+KK KK +  +I
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEKI 754



 Score = 28.3 bits (64), Expect = 5.6
 Identities = 11/21 (52%), Positives = 16/21 (76%)

Query: 29  KKNKKKKKKKKKKKKKKKKKK 49
           +K + KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753



 Score = 27.9 bits (63), Expect = 7.5
 Identities = 11/21 (52%), Positives = 16/21 (76%)

Query: 28  KKKNKKKKKKKKKKKKKKKKK 48
           +K+  KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753



 Score = 27.9 bits (63), Expect = 8.8
 Identities = 10/21 (47%), Positives = 17/21 (80%)

Query: 33  KKKKKKKKKKKKKKKKKKKKR 53
           +K++ KKKKK+KK KK + ++
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753


>gnl|CDD|234298 TIGR03659, IsdE, heme ABC transporter, heme-binding protein isdE.
           This family of ABC substrate-binding proteins is
           observed primarily in close proximity with proteins
           localized to the cell wall and bearing the NEAT (NEAr
           Transporter, pfam05031) heme-binding domain. IsdE has
           been shown to bind heme and is involved in the process
           of scavenging heme for the purpose of obtaining iron.
          Length = 289

 Score = 29.2 bits (66), Expect = 3.0
 Identities = 15/36 (41%), Positives = 22/36 (61%)

Query: 21  FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRIL 56
           + RE+  +K  K+  +K+ + KKK K KKK K  IL
Sbjct: 138 YGREEQAEKLVKEINEKEAEVKKKVKGKKKPKVLIL 173


>gnl|CDD|235146 PRK03640, PRK03640, O-succinylbenzoic acid--CoA ligase;
           Provisional.
          Length = 483

 Score = 29.2 bits (66), Expect = 3.0
 Identities = 9/17 (52%), Positives = 15/17 (88%)

Query: 104 NDIAIVLYTSGSTGIPK 120
           +++A ++YTSG+TG PK
Sbjct: 141 DEVATIMYTSGTTGKPK 157


>gnl|CDD|149438 pfam08374, Protocadherin, Protocadherin.  The structure of
           protocadherins is similar to that of classic cadherins
           (pfam00028), but particularly on the cytoplasmic domains
           they also have some unique features. They are expressed
           in a variety of organisms and are found in high
           concentrations in the brain where they seem to be
           localised mainly at cell-cell contact sites. Their
           expression seems to be developmentally regulated.
          Length = 223

 Score = 28.7 bits (64), Expect = 3.1
 Identities = 12/21 (57%), Positives = 16/21 (76%)

Query: 26  AKKKKNKKKKKKKKKKKKKKK 46
           +  ++NK+KKKKK KKKK  K
Sbjct: 84  SPNQENKQKKKKKDKKKKSPK 104


>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
          Length = 1437

 Score = 29.4 bits (67), Expect = 3.1
 Identities = 13/49 (26%), Positives = 22/49 (44%)

Query: 4   NLTISEHHFDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
            L I     D       FE +K ++ +   K+  +  KK + +KKK+ K
Sbjct: 158 ILKIDFEIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSK 206



 Score = 28.3 bits (64), Expect = 7.2
 Identities = 10/42 (23%), Positives = 19/42 (45%)

Query: 21  FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCEN 62
            + + AK+     KK + +KKK+ K    K+    + K  + 
Sbjct: 182 EDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQIGKKIDK 223



 Score = 28.3 bits (64), Expect = 7.4
 Identities = 14/58 (24%), Positives = 27/58 (46%)

Query: 11  HFDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGR 68
            F+        E+ +A+K++  +K  K+  +  KK + +KKK+       E P   G+
Sbjct: 162 DFEIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQIGK 219


>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207).  This
           family is found in eukaryotes; it has several conserved
           tryptophan residues. The function is not known.
          Length = 261

 Score = 28.9 bits (65), Expect = 3.1
 Identities = 8/32 (25%), Positives = 29/32 (90%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
           E +K K+++ K++++++K++KK+++++++K++
Sbjct: 188 ELKKLKQQQQKREEERRKQRKKQQEEEERKQK 219



 Score = 28.9 bits (65), Expect = 3.3
 Identities = 9/31 (29%), Positives = 27/31 (87%)

Query: 27  KKKKNKKKKKKKKKKKKKKKKKKKKKRRILA 57
           KK K +++K++++++K++KK++++++R+  A
Sbjct: 190 KKLKQQQQKREEERRKQRKKQQEEEERKQKA 220



 Score = 28.2 bits (63), Expect = 5.9
 Identities = 8/34 (23%), Positives = 28/34 (82%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
           E E  K K+ ++K++++++K++KK+++++++++ 
Sbjct: 186 EWELKKLKQQQQKREEERRKQRKKQQEEEERKQK 219



 Score = 27.4 bits (61), Expect = 8.4
 Identities = 10/38 (26%), Positives = 28/38 (73%)

Query: 23  REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSC 60
           ++  ++KK +K  ++K+K++++K++++ + R+ LAK  
Sbjct: 96  KQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEK 133


>gnl|CDD|240253 PTZ00068, PTZ00068, 60S ribosomal protein L13a; Provisional.
          Length = 202

 Score = 28.5 bits (64), Expect = 3.2
 Identities = 13/42 (30%), Positives = 19/42 (45%), Gaps = 5/42 (11%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKK-----KKKKRRILAK 58
           E+ K +     KKK K +K  K+ +KK      K    +L K
Sbjct: 157 EKRKERAAAYYKKKVKLRKAWKEARKKALAKLPKAIVAVLKK 198



 Score = 27.7 bits (62), Expect = 5.6
 Identities = 11/35 (31%), Positives = 21/35 (60%)

Query: 24  EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
            K ++K+ ++     KKK K +K  K+ +++ LAK
Sbjct: 153 AKLEEKRKERAAAYYKKKVKLRKAWKEARKKALAK 187


>gnl|CDD|132252 TIGR03208, cyc_hxne_CoA_lg, cyclohexanecarboxylate-CoA ligase.
           Members of this protein family are
           cyclohexanecarboxylate-CoA ligase. This enzyme prepares
           the aliphatic ring compound, cyclohexanecarboxylate, for
           dehydrogenation and then degradation by a pathway also
           used in benzoyl-CoA degradation in Rhodopseudomonas
           palustris.
          Length = 538

 Score = 29.1 bits (65), Expect = 3.2
 Identities = 9/17 (52%), Positives = 14/17 (82%)

Query: 104 NDIAIVLYTSGSTGIPK 120
           +D+  ++YTSG+TG PK
Sbjct: 195 DDVTQLIYTSGTTGEPK 211


>gnl|CDD|240407 PTZ00412, PTZ00412, leucyl aminopeptidase; Provisional.
          Length = 569

 Score = 29.2 bits (65), Expect = 3.2
 Identities = 13/34 (38%), Positives = 18/34 (52%)

Query: 21  FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
           F   K  K+K  KK +  KK  K  +KK+  +RR
Sbjct: 534 FRHNKPPKRKGIKKAQNAKKDLKVVQKKRHGRRR 567



 Score = 28.0 bits (62), Expect = 8.3
 Identities = 11/31 (35%), Positives = 20/31 (64%)

Query: 20  PFEREKAKKKKNKKKKKKKKKKKKKKKKKKK 50
           P +R+  KK +N KK  K  +KK+  +++K+
Sbjct: 539 PPKRKGIKKAQNAKKDLKVVQKKRHGRRRKQ 569


>gnl|CDD|240576 cd12932, RRP7_like, RRP7 domain ribosomal RNA-processing protein
          7 (Rrp7p), ribosomal RNA-processing protein 7 homolog A
          (Rrp7A), and similar proteins.  This CD corresponds to
          the RRP7 domain of Rrp7p and Rrp7A. Rrp7p is encoded by
          YCL031C gene from Saccharomyces cerevisiae. It is an
          essential yeast protein involved in pre-rRNA processing
          and ribosome assembly, and is speculated to be required
          for correct assembly of rpS27 into the pre-ribosomal
          particle. Rrp7A, also termed gastric cancer antigen
          Zg14, is the Rrp7p homolog mainly found in Metazoans.
          The cellular function of Rrp7A remains unclear
          currently. Both Rrp7p and Rrp7A harbor an N-terminal
          RNA recognition motif (RRM), also termed RBD (RNA
          binding domain) or RNP (ribonucleoprotein domain), and
          a C-terminal RRP7 domain.
          Length = 118

 Score = 28.0 bits (63), Expect = 3.2
 Identities = 12/28 (42%), Positives = 19/28 (67%)

Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
            +K K  +++  + K K+K+KKKKKKK
Sbjct: 59 GGRKGKTAREEAVEAKAKEKEKKKKKKK 86



 Score = 27.2 bits (61), Expect = 4.8
 Identities = 16/41 (39%), Positives = 26/41 (63%), Gaps = 7/41 (17%)

Query: 19  APFEREKAKKKKNKKKKKKKKKKKKKK-------KKKKKKK 52
               RE+A + K K+K+KKKKKKK+ +       ++KKK++
Sbjct: 63  GKTAREEAVEAKAKEKEKKKKKKKELEDFYRFQIREKKKEE 103



 Score = 26.8 bits (60), Expect = 7.1
 Identities = 13/30 (43%), Positives = 20/30 (66%)

Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
          R   K K  +++  + K K+K+KKKKKKK+
Sbjct: 58 RGGRKGKTAREEAVEAKAKEKEKKKKKKKE 87



 Score = 26.8 bits (60), Expect = 7.5
 Identities = 11/28 (39%), Positives = 18/28 (64%)

Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
          +  K   ++  + K K+K+KKKKKKK+ 
Sbjct: 61 RKGKTAREEAVEAKAKEKEKKKKKKKEL 88



 Score = 26.8 bits (60), Expect = 8.2
 Identities = 15/39 (38%), Positives = 24/39 (61%), Gaps = 6/39 (15%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKK------KKKKKKKK 50
            A  E  +AK K+ +KKKKKKK+ +       ++KKK++
Sbjct: 65  TAREEAVEAKAKEKEKKKKKKKELEDFYRFQIREKKKEE 103


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 29.0 bits (66), Expect = 3.3
 Identities = 15/31 (48%), Positives = 25/31 (80%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
           E  +A+K+ NK  +KK+KKKKK+K+K+++ K
Sbjct: 608 ELIEARKRLNKANEKKEKKKKKQKEKQEELK 638



 Score = 28.6 bits (65), Expect = 4.5
 Identities = 12/28 (42%), Positives = 19/28 (67%)

Query: 25  KAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
           KA +    +K+  K  +KK+KKKKK+K+
Sbjct: 605 KAHELIEARKRLNKANEKKEKKKKKQKE 632



 Score = 28.3 bits (64), Expect = 7.0
 Identities = 13/27 (48%), Positives = 16/27 (59%)

Query: 28  KKKNKKKKKKKKKKKKKKKKKKKKKRR 54
              + +K +K KKKKKKK K  K K R
Sbjct: 675 PLSDLEKIQKPKKKKKKKPKTVKPKPR 701


>gnl|CDD|197773 smart00527, HMG17, domain in high mobilty group proteins HMG14
          and HMG 17. 
          Length = 88

 Score = 27.2 bits (60), Expect = 3.3
 Identities = 21/52 (40%), Positives = 24/52 (46%), Gaps = 1/52 (1%)

Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRT 69
          PAP + E  K KK   KKK +    K KK  K K+     K   NPA  G T
Sbjct: 27 PAPPKPE-PKPKKAAAKKKSENVAAKGKKGAKGKQTEEAGKEENNPAENGET 77


>gnl|CDD|177779 PLN00185, PLN00185, 60S ribosomal protein L4-1; Provisional.
          Length = 405

 Score = 29.0 bits (65), Expect = 3.3
 Identities = 12/31 (38%), Positives = 17/31 (54%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
            R KAKK+K  KK+K+  K++    K   K 
Sbjct: 350 ARVKAKKEKLAKKRKQVSKEEAAAIKAAGKA 380



 Score = 27.8 bits (62), Expect = 9.0
 Identities = 10/29 (34%), Positives = 15/29 (51%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKK 50
           E  + K KK K  KK+K+  K++    K 
Sbjct: 348 EAARVKAKKEKLAKKRKQVSKEEAAAIKA 376


>gnl|CDD|234874 PRK00955, PRK00955, hypothetical protein; Provisional.
          Length = 620

 Score = 29.1 bits (66), Expect = 3.4
 Identities = 11/32 (34%), Positives = 16/32 (50%)

Query: 23  REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
             K  KKK+ KK           KK+KKK+++
Sbjct: 589 PPKKPKKKSNKKNNNNNYSSNNSKKRKKKRKK 620



 Score = 28.4 bits (64), Expect = 6.5
 Identities = 12/39 (30%), Positives = 19/39 (48%)

Query: 10  HHFDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKK 48
           +    +I P   +++  KK  N        KK+KKK+KK
Sbjct: 582 NGPKCLIPPKKPKKKSNKKNNNNNYSSNNSKKRKKKRKK 620



 Score = 28.0 bits (63), Expect = 8.3
 Identities = 11/32 (34%), Positives = 15/32 (46%)

Query: 20  PFEREKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
           P ++ K K  K            KK+KKK+KK
Sbjct: 589 PPKKPKKKSNKKNNNNNYSSNNSKKRKKKRKK 620


>gnl|CDD|240309 PTZ00189, PTZ00189, 60S ribosomal protein L21; Provisional.
          Length = 160

 Score = 28.0 bits (63), Expect = 3.5
 Identities = 6/26 (23%), Positives = 11/26 (42%)

Query: 25  KAKKKKNKKKKKKKKKKKKKKKKKKK 50
             +   N   K + KKK +    K++
Sbjct: 106 LKRVAANDAIKAEAKKKGELPSTKRQ 131


>gnl|CDD|217479 pfam03297, Ribosomal_S25, S25 ribosomal protein. 
          Length = 105

 Score = 27.7 bits (62), Expect = 3.5
 Identities = 11/30 (36%), Positives = 14/30 (46%)

Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
          K ++KK  K        K KKKK  K K +
Sbjct: 4  KKQQKKAAKAAAASAGGKAKKKKWSKGKVK 33



 Score = 26.6 bits (59), Expect = 7.2
 Identities = 12/34 (35%), Positives = 15/34 (44%)

Query: 19 APFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
           P +++K   K        K KKKK  K K K K
Sbjct: 2  PPKKQQKKAAKAAAASAGGKAKKKKWSKGKVKDK 35


>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
            biogenesis [Translation, ribosomal structure and
            biogenesis].
          Length = 1077

 Score = 29.3 bits (65), Expect = 3.6
 Identities = 11/47 (23%), Positives = 28/47 (59%)

Query: 22   EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGR 68
            ER K +++K + +  ++ K+++  KK+K++++RI     +N     +
Sbjct: 1024 ERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEMAK 1070


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 29.0 bits (65), Expect = 3.6
 Identities = 13/49 (26%), Positives = 31/49 (63%), Gaps = 4/49 (8%)

Query: 17 IPAPFEREKAKKKKNKKKKKKKKKK----KKKKKKKKKKKRRILAKSCE 61
          +P   E E+ ++KK +++K   K++    +++K++KKKK +++   + E
Sbjct: 35 VPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTE 83


>gnl|CDD|213294 cd05928, MACS_euk, Eukaryotic Medium-chain acyl-CoA synthetase
           (MACS or ACSM).  MACS catalyzes the two-step activation
           of medium chain fatty acids (containing 4-12 carbons).
           The carboxylate substrate first reacts with ATP to form
           an acyl-adenylate intermediate, which then reacts with
           CoA to produce an acyl-CoA ester. The acyl-CoA is a key
           intermediate in many important biosynthetic and
           catabolic processes. MACS enzymes are localized to
           mitochondria. Two murine MACS family proteins are found
           in liver and kidney. In rodents, a MACS member is
           detected particularly in the olfactory epithelium and is
           called O-MACS. O-MACS demonstrates substrate preference
           for the fatty acid lengths of C6-C12.
          Length = 530

 Score = 29.0 bits (65), Expect = 3.7
 Identities = 16/41 (39%), Positives = 22/41 (53%), Gaps = 1/41 (2%)

Query: 155 YKTGDYGRIVDD-ILLYEGRTDSQVKIRGNRVDLSEVEAAL 194
           Y TGD G + +D    + GR D  +   G R+   EVE+AL
Sbjct: 404 YITGDRGIMDEDGYFWFVGRADDVINSSGYRIGPFEVESAL 444


>gnl|CDD|239570 cd03488, Topoisomer_IB_N_htopoI_like, Topoisomer_IB_N_htopoI_like :
           N-terminal DNA binding fragment found in eukaryotic DNA
           topoisomerase (topo) IB proteins similar to the
           monomeric yeast and human topo I.  Topo I enzymes are
           divided into:  topo type IA (bacterial) and type IB
           (eukaryotic). Topo I relaxes superhelical tension in
           duplex DNA by creating a single-strand nick, the broken
           strand can then rotate around the unbroken strand to
           remove DNA supercoils and, the nick is religated,
           liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit religation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 28.5 bits (64), Expect = 3.7
 Identities = 8/30 (26%), Positives = 19/30 (63%)

Query: 23  REKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
           + + ++KK   K++KK  K +K+K +++  
Sbjct: 95  KAQKEEKKAMSKEEKKAIKAEKEKLEEEYG 124



 Score = 27.3 bits (61), Expect = 8.6
 Identities = 10/31 (32%), Positives = 20/31 (64%)

Query: 21  FEREKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
           F+ +K +KK   K++KK  K +K+K +++  
Sbjct: 94  FKAQKEEKKAMSKEEKKAIKAEKEKLEEEYG 124


>gnl|CDD|241245 cd01209, PTB_Shc, Shc-like phosphotyrosine-binding (PTB) domain. 
          Shc is a substrate for receptor tyrosine kinases, which
          can interact with phosphoproteins at NPXY motifs. Shc
          contains an PTB domain followed by an SH2 domain. PTB
          domains have a common PH-like fold and are found in
          various eukaryotic signaling molecules. This domain was
          initially shown to binds peptides with a NPXY motif
          with differing requirements for phosphorylation of the
          tyrosine, although more recent studies have found that
          some types of PTB domains can bind to peptides lack
          tyrosine residues altogether. In contrast to SH2
          domains, which recognize phosphotyrosine and adjacent
          carboxy-terminal residues, PTB-domain binding
          specificity is conferred by residues amino-terminal to
          the phosphotyrosine.  PTB domains are classified into
          three groups: phosphotyrosine-dependent Shc-like,
          phosphotyrosine-dependent IRS-like, and
          phosphotyrosine-independent Dab-like PTB domains. This
          cd is part of the Shc-like subgroup.
          Length = 170

 Score = 28.3 bits (64), Expect = 3.7
 Identities = 11/25 (44%), Positives = 12/25 (48%)

Query: 40 KKKKKKKKKKKKKRRILAKSCENPA 64
           K  KKK+K K    IL KS    A
Sbjct: 57 AKGAKKKRKSKALSSILGKSNLQFA 81


>gnl|CDD|213304 cd05938, hsFATP2a_ACSVL_like, Fatty acid transport proteins (FATP)
           including hsFATP2, hsFATP5, and hsFATP6, and similar
           proteins.  Fatty acid transport proteins (FATP) of this
           family transport long-chain or very-long-chain fatty
           acids across the plasma membrane. At least five copies
           of FATPs are identified in mammalian cells. This family
           includes hsFATP2, hsFATP5, and hsFATP6, and similar
           proteins. Each FATP has unique patterns of tissue
           distribution. These FATPs also have fatty acid CoA
           synthetase activity, thus playing dual roles as fatty
           acid transporters and its activation enzymes. The hsFATP
           proteins exist in two splice variants; the b variant,
           lacking exon 3, has no acyl-CoA synthetase activity.
           FATPs are key players in the trafficking of exogenous
           fatty acids into the cell and in intracellular fatty
           acid homeostasis.
          Length = 535

 Score = 28.9 bits (65), Expect = 3.7
 Identities = 9/16 (56%), Positives = 13/16 (81%)

Query: 105 DIAIVLYTSGSTGIPK 120
             A+ +YTSG+TG+PK
Sbjct: 143 STALYIYTSGTTGLPK 158


>gnl|CDD|224668 COG1754, COG1754, Uncharacterized C-terminal domain of
           topoisomerase IA [General function prediction only].
          Length = 298

 Score = 28.5 bits (64), Expect = 3.8
 Identities = 13/34 (38%), Positives = 15/34 (44%)

Query: 26  AKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
           AK    KK  KK    K KK   KK   +  AK+
Sbjct: 257 AKGGPGKKPAKKATAAKAKKTTAKKAAAKKAAKT 290



 Score = 28.5 bits (64), Expect = 4.3
 Identities = 12/31 (38%), Positives = 13/31 (41%)

Query: 25  KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
           K  KK    K KK   KK   KK  K K+  
Sbjct: 264 KPAKKATAAKAKKTTAKKAAAKKAAKTKKAA 294



 Score = 27.7 bits (62), Expect = 8.6
 Identities = 13/30 (43%), Positives = 15/30 (50%)

Query: 23  REKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
           ++ AKK    K KK   KK   KK  K KK
Sbjct: 263 KKPAKKATAAKAKKTTAKKAAAKKAAKTKK 292


>gnl|CDD|240317 PTZ00217, PTZ00217, flap endonuclease-1; Provisional.
          Length = 393

 Score = 28.8 bits (65), Expect = 3.9
 Identities = 20/76 (26%), Positives = 31/76 (40%), Gaps = 14/76 (18%)

Query: 7   ISEHHFDKVIIPAPFER-EKAKKKK-------------NKKKKKKKKKKKKKKKKKKKKK 52
           + E +F++  +    ER +KAK KK                KK   K K KKKKKK    
Sbjct: 313 VKEKNFNEERVEKYIERLKKAKTKKTQTRLDSFFTATKKPIKKSNSKAKLKKKKKKAGAS 372

Query: 53  RRILAKSCENPAYKGR 68
               +++ +     G+
Sbjct: 373 AVPKSETSQEAKSSGK 388


>gnl|CDD|236097 PRK07788, PRK07788, acyl-CoA synthetase; Validated.
          Length = 549

 Score = 28.7 bits (65), Expect = 4.1
 Identities = 9/13 (69%), Positives = 11/13 (84%)

Query: 108 IVLYTSGSTGIPK 120
           IV+ TSG+TG PK
Sbjct: 211 IVILTSGTTGTPK 223


>gnl|CDD|213327 cd12119, ttLC_FACS_AlkK_like, Fatty acyl-CoA synthetases similar to
           LC-FACS from Thermus thermophiles.  This family includes
           fatty acyl-CoA synthetases that can activate
           medium-chain to long-chain fatty acids. They catalyze
           the ATP-dependent acylation of fatty acids in a two-step
           reaction. The carboxylate substrate first reacts with
           ATP to form an acyl-adenylate intermediate, which then
           reacts with CoA to produce an acyl-CoA ester. The fatty
           acyl-CoA synthetases are responsible for fatty acid
           degradation as well as physiological regulation of
           cellular functions via the production of fatty acyl-CoA
           esters. The fatty acyl-CoA synthetase from Thermus
           thermophiles in this family was shown catalyzing the
           long-chain fatty acid, myristoyl acid, while another
           member in this family, the AlkK protein identified from
           Pseudomonas oleovorans, targets medium chain fatty
           acids. This family also includes uncharacterized FACS
           proteins.
          Length = 517

 Score = 28.7 bits (65), Expect = 4.2
 Identities = 10/17 (58%), Positives = 12/17 (70%)

Query: 104 NDIAIVLYTSGSTGIPK 120
           N  A + YTSG+TG PK
Sbjct: 162 NTAAGLCYTSGTTGNPK 178


>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein. 
          Length = 198

 Score = 28.2 bits (63), Expect = 4.3
 Identities = 13/48 (27%), Positives = 19/48 (39%)

Query: 16 IIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENP 63
          I P   E    K+ K K+    +K    KK KKK K++    +     
Sbjct: 38 IFPEDIEIPSPKQPKKKRPTTPRKPATTKKSKKKDKEKLTEEEKKPES 85


>gnl|CDD|197876 smart00792, Agouti, Agouti protein.  The agouti protein regulates
           pigmentation in the mouse hair follicle producing a
           black hair with a subapical yellow band. A highly
           homologous protein agouti signal protein (ASIP) is
           present in humans and is expressed at highest levels in
           adipose tissue where it may play a role in energy
           homeostasis and possibly human pigmentation.
          Length = 124

 Score = 27.5 bits (61), Expect = 4.3
 Identities = 14/49 (28%), Positives = 20/49 (40%), Gaps = 10/49 (20%)

Query: 26  AKKKKNKKKKKKKKKKKKKKKKKKKKKRRI----------LAKSCENPA 64
            K KK   ++ +KK  +KK+KK      R           L  SC+ P 
Sbjct: 52  KKPKKISAEEAEKKLLQKKEKKALTNVLRPEPRSPRRCVRLRDSCKGPQ 100



 Score = 27.5 bits (61), Expect = 5.0
 Identities = 10/29 (34%), Positives = 16/29 (55%)

Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
          K  KK + ++ +KK  +KK+KK      R
Sbjct: 52 KKPKKISAEEAEKKLLQKKEKKALTNVLR 80


>gnl|CDD|192292 pfam09429, Wbp11, WW domain binding protein 11.  The WW domain is
          a small protein module with a triple-stranded
          beta-sheet fold. This is a family of WW domain binding
          proteins.
          Length = 78

 Score = 26.9 bits (60), Expect = 4.5
 Identities = 11/34 (32%), Positives = 20/34 (58%)

Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
          K +KKK  KK K +++ +++ K  KK   R+  +
Sbjct: 12 KEQKKKELKKNKAERQARREAKLAKKNPDRLQRE 45


>gnl|CDD|223952 COG1021, EntE, Peptide arylation enzymes [Secondary metabolites
           biosynthesis, transport, and catabolism].
          Length = 542

 Score = 28.6 bits (64), Expect = 4.5
 Identities = 16/47 (34%), Positives = 23/47 (48%), Gaps = 1/47 (2%)

Query: 155 YKTGDYGRIVDD-ILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEV 200
           Y++GD  R   D  L+ EGR   Q+   G ++   EVE  L +   V
Sbjct: 416 YRSGDLVRRDPDGYLVVEGRVKDQINRGGEKIAAEEVENLLLRHPAV 462


>gnl|CDD|183506 PRK12406, PRK12406, long-chain-fatty-acid--CoA ligase; Provisional.
          Length = 509

 Score = 28.5 bits (64), Expect = 4.5
 Identities = 10/16 (62%), Positives = 13/16 (81%)

Query: 109 VLYTSGSTGIPKEYRR 124
           ++YTSG+TG PK  RR
Sbjct: 157 MIYTSGTTGHPKGVRR 172


>gnl|CDD|218899 pfam06102, DUF947, Domain of unknown function (DUF947).  Family of
           eukaryotic proteins with unknown function.
          Length = 168

 Score = 28.0 bits (63), Expect = 4.6
 Identities = 14/42 (33%), Positives = 23/42 (54%), Gaps = 2/42 (4%)

Query: 19  APFEREKAKKKKNKKKKKKK--KKKKKKKKKKKKKKRRILAK 58
            P+  +K++ KK   KKK    KK K+  K  +KK+++   K
Sbjct: 124 KPYYLKKSEIKKLVLKKKFDELKKSKQLDKALEKKRKKNAGK 165


>gnl|CDD|213321 cd05974, MACS_like_1, Uncharacterized subfamily of medium-chain
           acyl-CoA synthetase (MACS).  MACS catalyzes the two-step
           activation of medium chain fatty acids (containing 4-12
           carbons). The carboxylate substrate first reacts with
           ATP to form an acyl-adenylate intermediate, which then
           reacts with CoA to produce an acyl-CoA ester. MACS
           enzymes are localized to mitochondria.
          Length = 433

 Score = 28.5 bits (64), Expect = 4.7
 Identities = 16/41 (39%), Positives = 21/41 (51%), Gaps = 1/41 (2%)

Query: 155 YKTGDYGRIVDD-ILLYEGRTDSQVKIRGNRVDLSEVEAAL 194
           Y+TGD     +D  L + GR D   K    R+   EVE+AL
Sbjct: 310 YRTGDKAYRDEDGYLWFVGRADDVFKSSDYRISPFEVESAL 350


>gnl|CDD|150786 pfam10153, DUF2361, Uncharacterized conserved protein (DUF2361). 
          This is a region of 120 amino acids that is conserved
          in a family of proteins found from plants to fungi. The
          function is not known.
          Length = 114

 Score = 27.2 bits (61), Expect = 4.8
 Identities = 10/36 (27%), Positives = 21/36 (58%)

Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRIL 56
          FER+KA ++  + +KK ++   +K+ K  K++    
Sbjct: 59 FERKKATRRLKQLEKKLEETSDEKEIKDLKEQLSKA 94


>gnl|CDD|237791 PRK14701, PRK14701, reverse gyrase; Provisional.
          Length = 1638

 Score = 28.7 bits (64), Expect = 4.8
 Identities = 16/69 (23%), Positives = 30/69 (43%)

Query: 2   NINLTISEHHFDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCE 61
           NI+ ++    F + II   ++    KK+ N +   +K++   K+ +K   K   L  +  
Sbjct: 218 NIDRSLQLLGFYEEIIEKAWKIIYLKKQGNIEDAMEKREILNKEIEKIGNKIGCLIVASA 277

Query: 62  NPAYKGRTV 70
               KG  V
Sbjct: 278 TGKAKGDRV 286


>gnl|CDD|149515 pfam08492, SRP72, SRP72 RNA-binding domain.  This region has been
          identified as the binding site of the SRP72 protein to
          SRP RNA.
          Length = 57

 Score = 25.9 bits (57), Expect = 4.9
 Identities = 15/31 (48%), Positives = 21/31 (67%)

Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
          +AKK K   KKK   ++KKKK  KKK+K ++
Sbjct: 1  EAKKPKKIAKKKPGTEQKKKKSAKKKRKPKL 31



 Score = 25.9 bits (57), Expect = 5.5
 Identities = 15/30 (50%), Positives = 19/30 (63%)

Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
          E +K KK   KK   ++KKKK  KKK+K K
Sbjct: 1  EAKKPKKIAKKKPGTEQKKKKSAKKKRKPK 30


>gnl|CDD|235971 PRK07219, PRK07219, DNA topoisomerase I; Validated.
          Length = 822

 Score = 28.8 bits (65), Expect = 4.9
 Identities = 11/43 (25%), Positives = 17/43 (39%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPA 64
             EK KK+K  K + +  K    K  +K K   +   +    A
Sbjct: 764 YTEKQKKEKESKSELEALKGVGAKTAEKLKDAGVETVTDLTAA 806


>gnl|CDD|227314 COG4981, COG4981, Enoyl reductase domain of yeast-type FAS1 [Lipid
           metabolism].
          Length = 717

 Score = 28.7 bits (64), Expect = 5.1
 Identities = 9/35 (25%), Positives = 18/35 (51%), Gaps = 2/35 (5%)

Query: 189 EVEAALSKLSEV--DKLSVLCYKPGEVDQVSEVTS 221
            +E A+  + E+  D    + +KPG ++Q+  V  
Sbjct: 135 SLEEAVELIEELGDDGFPYVAFKPGTIEQIRSVIR 169



 Score = 28.7 bits (64), Expect = 5.1
 Identities = 10/30 (33%), Positives = 17/30 (56%), Gaps = 2/30 (6%)

Query: 252 EVEAALSKISEV--DKLSVLCYKPGEVDQI 279
            +E A+  I E+  D    + +KPG ++QI
Sbjct: 135 SLEEAVELIEELGDDGFPYVAFKPGTIEQI 164


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 28.2 bits (63), Expect = 5.1
 Identities = 10/34 (29%), Positives = 24/34 (70%)

Query: 23  REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRIL 56
           RE  K KK + ++K++++++K  +++K ++  IL
Sbjct: 149 RELEKIKKERAEEKEREEEEKAAEEEKAREEEIL 182


>gnl|CDD|189016 cd09609, M3B_PepF_4, Peptidase family M3B Oligopeptidase F (PepF). 
           Peptidase family M3B Oligopeptidase F (PepF;
           Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and
           includes oligoendopeptidase F from Lactococcus lactis.
           This enzyme hydrolyzes peptides containing between 7 and
           17 amino acids with fairly broad specificity. The PepF
           gene is duplicated in L. lactis on the plasmid that
           bears it, while a shortened second copy is found in
           Bacillus subtilis. Most bacterial PepFs are cytoplasmic
           endopeptidases; however, the PepF Bacillus
           amyloliquefaciens oligopeptidase is a secreted protein
           and may facilitate the process of sporulation.
           Specifically, the yjbG gene encoding the homolog of the
           PepF1 and PepF2 oligoendopeptidases of Lactococcus
           lactis has been identified in Bacillus subtilis as an
           inhibitor of sporulation initiation when over expressed
           from a multicopy plasmid.
          Length = 586

 Score = 28.7 bits (65), Expect = 5.1
 Identities = 26/117 (22%), Positives = 44/117 (37%), Gaps = 25/117 (21%)

Query: 151 YRRMYKTGDYGRIVDDILLYEG-RTDSQVKIRGNRVD--LSEVEAALSKL-SEVDKLSVL 206
              + +   Y       L      TD + + R  + D   +++ A LS   SE+  L   
Sbjct: 61  LELISRLSHYAS-----LPQSTDGTDPENQARAAKFDSLAAKINAKLSFFESELLALDE- 114

Query: 207 CYKPGEVDQVSEVTSEEIRGVDDILLYEGRTDSQVKIRGNRVDLSEVEAALSKISEV 263
                 +++V E   E    ++D+L  +  T S            EVE AL+ +S V
Sbjct: 115 ----KILEEVIEELPEYAAYLEDLLRKKPHTLS-----------PEVEEALAALSPV 156


>gnl|CDD|218337 pfam04939, RRS1, Ribosome biogenesis regulatory protein (RRS1).
           This family consists of several eukaryotic ribosome
           biogenesis regulatory (RRS1) proteins. RRS1 is a nuclear
           protein that is essential for the maturation of 25 S
           rRNA and the 60 S ribosomal subunit assembly in
           Saccharomyces cerevisiae.
          Length = 164

 Score = 27.6 bits (62), Expect = 5.3
 Identities = 9/25 (36%), Positives = 17/25 (68%)

Query: 20  PFEREKAKKKKNKKKKKKKKKKKKK 44
           PF +++ +KK+   K +K++ K KK
Sbjct: 140 PFAKKRREKKERVAKNEKRELKNKK 164


>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
           Genome duplication is precisely regulated by
           cyclin-dependent kinases CDKs, which bring about the
           onset of S phase by activating replication origins and
           then prevent relicensing of origins until mitosis is
           completed. The optimum sequence motif for CDK
           phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
           to have at least 11 potential phosphorylation sites.
           Drc1 is required for DNA synthesis and S-M replication
           checkpoint control. Drc1 associates with Cdc2 and is
           phosphorylated at the onset of S phase when Cdc2 is
           activated. Thus Cdc2 promotes DNA replication by
           phosphorylating Drc1 and regulating its association with
           Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
           substrates required for DNA replication.
          Length = 397

 Score = 28.2 bits (63), Expect = 5.4
 Identities = 10/39 (25%), Positives = 23/39 (58%)

Query: 19  APFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILA 57
                E  + + +++ K++ +KK+K KKK +K+K   ++
Sbjct: 335 MGGIDEDDEDEDDEESKEEVEKKQKVKKKPRKRKVNPVS 373



 Score = 27.8 bits (62), Expect = 7.2
 Identities = 11/40 (27%), Positives = 22/40 (55%)

Query: 12  FDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
           F   +     + E    +++K++ +KK+K KKK +K+K  
Sbjct: 331 FLGYMGGIDEDDEDEDDEESKEEVEKKQKVKKKPRKRKVN 370


>gnl|CDD|233605 TIGR01865, cas_Csn1, CRISPR-associated protein Cas9/Csn1, subtype
           II/NMEMI.  CRISPR loci appear to be mobile elements with
           a wide host range. This model represents a protein found
           only in CRISPR-containing species, near other
           CRISPR-associated proteins (cas), as part of the NMENI
           subtype of CRISPR/Cas locus. The species range so far
           for this protein is animal pathogens and commensals only
           [Mobile and extrachromosomal element functions, Other].
          Length = 805

 Score = 28.5 bits (64), Expect = 5.5
 Identities = 8/27 (29%), Positives = 14/27 (51%)

Query: 23  REKAKKKKNKKKKKKKKKKKKKKKKKK 49
           RE+      K+  K++ KK + K K+ 
Sbjct: 524 REEQGTNFGKRNSKERYKKNEDKIKEF 550


>gnl|CDD|151885 pfam11446, DUF2897, Protein of unknown function (DUF2897).  This
          is a bacterial family of uncharacterized proteins.
          Length = 55

 Score = 25.7 bits (57), Expect = 5.5
 Identities = 10/28 (35%), Positives = 15/28 (53%)

Query: 26 AKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
          AK K  +  K+KKKK K+  ++  K   
Sbjct: 27 AKMKFPQFDKQKKKKAKRNAEQDAKNSA 54


>gnl|CDD|223747 COG0675, COG0675, Transposase and inactivated derivatives [DNA
           replication, recombination, and repair].
          Length = 364

 Score = 28.1 bits (62), Expect = 5.6
 Identities = 9/38 (23%), Positives = 22/38 (57%)

Query: 21  FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
            ++  ++KK   K++KK K K  + +++ + +R+   K
Sbjct: 212 AQKRLSRKKSRSKRRKKAKLKLARLRERIRNRRKDFHK 249


>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62. 
          Length = 217

 Score = 27.8 bits (62), Expect = 5.7
 Identities = 14/23 (60%), Positives = 15/23 (65%)

Query: 31  NKKKKKKKKKKKKKKKKKKKKKR 53
           + K  K   KK KK KKKKKKKR
Sbjct: 194 HYKGDKSSAKKDKKSKKKKKKKR 216



 Score = 27.8 bits (62), Expect = 5.9
 Identities = 13/23 (56%), Positives = 14/23 (60%)

Query: 24  EKAKKKKNKKKKKKKKKKKKKKK 46
            K  K   KK KK KKKKKKK+ 
Sbjct: 195 YKGDKSSAKKDKKSKKKKKKKRS 217



 Score = 27.8 bits (62), Expect = 6.0
 Identities = 11/38 (28%), Positives = 17/38 (44%)

Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCE 61
          E  K K NK K   +   K   + K  +K ++L K+  
Sbjct: 17 ESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQM 54



 Score = 27.5 bits (61), Expect = 7.6
 Identities = 13/22 (59%), Positives = 15/22 (68%)

Query: 29  KKNKKKKKKKKKKKKKKKKKKK 50
           K +K   KK KK KKKKKKK+ 
Sbjct: 196 KGDKSSAKKDKKSKKKKKKKRS 217



 Score = 27.5 bits (61), Expect = 8.7
 Identities = 13/22 (59%), Positives = 14/22 (63%)

Query: 30  KNKKKKKKKKKKKKKKKKKKKK 51
           K  K   KK KK KKKKKKK+ 
Sbjct: 196 KGDKSSAKKDKKSKKKKKKKRS 217



 Score = 27.1 bits (60), Expect = 9.9
 Identities = 12/36 (33%), Positives = 16/36 (44%)

Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILA 57
          E EK K  K+K   +   K   + K  +K K  I A
Sbjct: 17 ESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKA 52


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 28.3 bits (63), Expect = 5.7
 Identities = 15/49 (30%), Positives = 23/49 (46%)

Query: 13  DKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCE 61
            K  + A   +    KK N      KKK K+KK+KK+  +  +  +S E
Sbjct: 198 GKTSVKAASLKRNPPKKSNIMSSFFKKKTKEKKEKKEASESTVKEESEE 246


>gnl|CDD|169098 PRK07786, PRK07786, long-chain-fatty-acid--CoA ligase; Validated.
          Length = 542

 Score = 28.2 bits (63), Expect = 5.7
 Identities = 9/14 (64%), Positives = 13/14 (92%)

Query: 107 AIVLYTSGSTGIPK 120
           A+++YTSG+TG PK
Sbjct: 177 ALIMYTSGTTGRPK 190


>gnl|CDD|193580 cd09891, NGN_Bact_1, Bacterial N-Utilization Substance G (NusG)
          N-terminal (NGN) domain, subgroup 1.  The N-Utilization
          Substance G (NusG) protein is involved in transcription
          elongation and termination in bacteria. NusG is
          essential in Escherichia coli and associates with RNA
          polymerase elongation and Rho-termination. Homologs of
          the NusG gene exist in all bacteria. The NusG
          N-terminal domain (NGN) is similar in all NusG
          homologs, but its C-terminal domain and the linker that
          separates these two domains are different. The domain
          organization of NusG suggests that the common
          properties of NusG and its homologs are due to their
          similar NGN domains.
          Length = 107

 Score = 27.1 bits (61), Expect = 5.7
 Identities = 11/32 (34%), Positives = 19/32 (59%), Gaps = 4/32 (12%)

Query: 9  EHHFDKVIIPAPFEREKAKKKKNKKKKKKKKK 40
          E +  +V++P     E+  + KN KKK K++K
Sbjct: 29 EDYIGEVLVPT----EEVVEVKNGKKKVKERK 56


>gnl|CDD|236215 PRK08276, PRK08276, long-chain-fatty-acid--CoA ligase; Validated.
          Length = 502

 Score = 28.3 bits (64), Expect = 5.8
 Identities = 9/15 (60%), Positives = 12/15 (80%)

Query: 110 LYTSGSTGIPKEYRR 124
           LY+SG+TG PK  +R
Sbjct: 146 LYSSGTTGRPKGIKR 160


>gnl|CDD|180289 PRK05851, PRK05851, long-chain-fatty-acid--[acyl-carrier-protein]
           ligase; Validated.
          Length = 525

 Score = 28.2 bits (63), Expect = 6.0
 Identities = 14/47 (29%), Positives = 22/47 (46%)

Query: 145 GDRAIKYRRMYKTGDYGRIVDDILLYEGRTDSQVKIRGNRVDLSEVE 191
           G   I     + TGD G +VD  L+  GR    + + G  +  +E+E
Sbjct: 388 GQAPIDPDDWFPTGDLGYLVDGGLVVCGRAKELITVAGRNIFPTEIE 434


>gnl|CDD|223898 COG0828, RpsU, Ribosomal protein S21 [Translation, ribosomal
          structure and biogenesis].
          Length = 67

 Score = 26.1 bits (58), Expect = 6.2
 Identities = 13/34 (38%), Positives = 23/34 (67%)

Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
          ERE  +K   K+K+KK   +K+K K+ +K+++R 
Sbjct: 34 EREFYEKPSEKRKRKKAAARKRKFKRLRKEQQRE 67


>gnl|CDD|215124 PLN02195, PLN02195, cellulose synthase A.
          Length = 977

 Score = 28.4 bits (63), Expect = 6.2
 Identities = 17/60 (28%), Positives = 31/60 (51%), Gaps = 1/60 (1%)

Query: 20  PFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKG-RTVLSLPQLRL 78
           P  + + +  K+KK KKKK  KKK+  K +    + + +     AY+    V+ +P+ +L
Sbjct: 112 PIWKNRVESWKDKKNKKKKSAKKKEAHKAQIPPEQQMEEKPSADAYEPLSRVIPIPRNKL 171


>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor.  The
           spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
           U5) and proteins, catalyzes the excision of introns from
           pre-mRNAs in two successive trans-esterification
           reactions. Step 2 depends upon integral spliceosome
           constituents such as U5 snRNA and Prp8 and
           non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
           ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
           conformational change in the spliceosome that leads to
           protection of the 3'ss from targeted RNase H cleavage.
           This change, which probably reflects binding of the 3'ss
           PyAG in the catalytic centre of the spliceosome,
           requires the ordered recruitment of Slu7, Prp18, and
           Prp22 to the spliceosome. There is a close functional
           relationship between Prp8, Prp18, and Slu7, and Prp18
           interacts with Slu7, so that together they recruit Prp22
           to the spliceosome. Most members of the family carry a
           zinc-finger of the CCHC-type upstream of this domain.
          Length = 236

 Score = 27.8 bits (62), Expect = 6.3
 Identities = 16/47 (34%), Positives = 24/47 (51%), Gaps = 6/47 (12%)

Query: 35  KKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRTVLSLPQLRLLSA 81
             +KK+K+KK++ K +KK+ +L K      Y G   L  P   LL  
Sbjct: 160 LLRKKEKEKKEQLKIQKKQSLLEK------YGGEEHLDKPPKELLLG 200


>gnl|CDD|220129 pfam09159, Ydc2-catalyt, Mitochondrial resolvase Ydc2 / RNA
           splicing MRS1.  Members of this family adopt a secondary
           structure consisting of two beta sheets and one alpha
           helix, arranged as a beta-alpha-beta motif. Each beta
           sheet has five strands, arranged in a 32145 order, with
           the second strand being antiparallel to the rest.
           Mitochondrial resolvase Ydc2 is capable of resolving
           Holliday junctions and cleaves DNA after 5'-CT-3' and
           5'-TT-3' sequences. This family also contains the
           mitochondrial RNA-splicing protein MRS1 which is
           involved in the excision of group I introns.
          Length = 254

 Score = 27.7 bits (62), Expect = 6.3
 Identities = 10/29 (34%), Positives = 14/29 (48%)

Query: 21  FEREKAKKKKNKKKKKKKKKKKKKKKKKK 49
            ER +   +K K + KKK  K  KK +  
Sbjct: 146 CERTEILAEKKKPRSKKKSSKNSKKLRID 174



 Score = 27.3 bits (61), Expect = 8.5
 Identities = 11/21 (52%), Positives = 13/21 (61%)

Query: 35  KKKKKKKKKKKKKKKKKKRRI 55
            +KKK + KKK  K  KK RI
Sbjct: 153 AEKKKPRSKKKSSKNSKKLRI 173



 Score = 27.3 bits (61), Expect = 9.3
 Identities = 9/24 (37%), Positives = 13/24 (54%)

Query: 32  KKKKKKKKKKKKKKKKKKKKKRRI 55
            +KKK + KKK  K  KK +   +
Sbjct: 153 AEKKKPRSKKKSSKNSKKLRIDLV 176


>gnl|CDD|131210 TIGR02155, PA_CoA_ligase, phenylacetate-CoA ligase.
           Phenylacetate-CoA ligase (PA-CoA ligase) catalyzes the
           first step in aromatic catabolism of phenylacetic acid
           (PA) into phenylacetyl-CoA (PA-CoA). Often located in a
           conserved gene cluster with enzymes involved in
           phenylacetic acid activation (paaG/H/I/J),
           phenylacetate-CoA ligase has been found among the
           proteobacteria as well as in gram positive prokaryotes.
           In the B-subclass proteobacterium Azoarcus evansii,
           phenylacetate-CoA ligase has been shown to be induced
           under aerobic and anaerobic growth conditions. It
           remains unclear however, whether this induction is due
           to the same enzyme or to another isoenzyme restricted to
           specific anaerobic growth conditions [Energy metabolism,
           Other].
          Length = 422

 Score = 28.2 bits (63), Expect = 6.3
 Identities = 21/79 (26%), Positives = 42/79 (53%), Gaps = 4/79 (5%)

Query: 186 DLSEVEAALSKLSEVDKLSVLCYKPGEVDQVSEVTSEEIRGVDDILLYEGRTDSQVKIRG 245
           D  E E   + L++ + L V+ Y+  ++ ++   T+  +R +D I    GR+D  + IRG
Sbjct: 270 DGEEGELVFTTLTK-EALPVIRYRTRDLTRLLPGTARTMRRMDRI---TGRSDDMLIIRG 325

Query: 246 NRVDLSEVEAALSKISEVD 264
             V  +++E  + K+ E+ 
Sbjct: 326 VNVFPTQLEEVILKMDELS 344


>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355).  This
          family of proteins is found in bacteria and viruses.
          Proteins in this family are typically between 180 and
          214 amino acids in length.
          Length = 125

 Score = 27.2 bits (61), Expect = 6.5
 Identities = 11/37 (29%), Positives = 21/37 (56%)

Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
          E+ K +KK+ +KK + +K  K   ++K + +   L K
Sbjct: 20 EKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEK 56



 Score = 26.8 bits (60), Expect = 7.9
 Identities = 8/37 (21%), Positives = 21/37 (56%)

Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
            +K ++KK++ +K  K   ++K + + +K  + L +
Sbjct: 24 WEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEE 60


>gnl|CDD|217301 pfam02956, TT_ORF1, TT viral orf 1.  TT virus (TTV), isolated
           initially from a Japanese patient with hepatitis of
           unknown aetiology, has since been found to infect both
           healthy and diseased individuals and numerous prevalence
           studies have raised questions about its role in
           unexplained hepatitis. ORF1 is a large 750 residue
           protein. The N-terminal half of this protein corresponds
           to the capsid protein.
          Length = 525

 Score = 28.0 bits (63), Expect = 6.5
 Identities = 8/23 (34%), Positives = 10/23 (43%)

Query: 33  KKKKKKKKKKKKKKKKKKKKRRI 55
           K K      K + K KK  K +I
Sbjct: 189 KHKILIPSLKTRPKGKKYVKIKI 211



 Score = 28.0 bits (63), Expect = 6.8
 Identities = 7/24 (29%), Positives = 9/24 (37%)

Query: 30  KNKKKKKKKKKKKKKKKKKKKKKR 53
             K K      K + K KK  K +
Sbjct: 187 LAKHKILIPSLKTRPKGKKYVKIK 210


>gnl|CDD|217512 pfam03359, GKAP, Guanylate-kinase-associated protein (GKAP)
           protein. 
          Length = 342

 Score = 27.9 bits (62), Expect = 6.6
 Identities = 8/41 (19%), Positives = 20/41 (48%)

Query: 17  IPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILA 57
           +P P  ++ AK    K    ++K      +++++ + R+ A
Sbjct: 272 VPPPVPKKPAKGPIVKPAISREKSLDSSDRQRQEARSRLAA 312


>gnl|CDD|240292 PTZ00148, PTZ00148, 40S ribosomal protein S8; Provisional.
          Length = 205

 Score = 27.6 bits (62), Expect = 6.7
 Identities = 17/44 (38%), Positives = 22/44 (50%), Gaps = 1/44 (2%)

Query: 15  VIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
            I  APF ++   K       KKKK KK+++ KK KKK   L  
Sbjct: 103 QIDAAPF-KQWYAKHYGIDLGKKKKYKKEEENKKAKKKSESLVA 145


>gnl|CDD|236100 PRK07798, PRK07798, acyl-CoA synthetase; Validated.
          Length = 533

 Score = 27.9 bits (63), Expect = 6.9
 Identities = 11/20 (55%), Positives = 15/20 (75%)

Query: 101 EQSNDIAIVLYTSGSTGIPK 120
           E+S D   +LYT G+TG+PK
Sbjct: 160 ERSPDDLYLLYTGGTTGMPK 179


>gnl|CDD|235202 PRK04023, PRK04023, DNA polymerase II large subunit; Validated.
          Length = 1121

 Score = 28.3 bits (64), Expect = 7.0
 Identities = 17/48 (35%), Positives = 27/48 (56%), Gaps = 9/48 (18%)

Query: 27  KKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRTVLSLP 74
           ++  + KK+K+++++KKK K K K  R I+A         GR V S P
Sbjct: 272 EELISGKKEKEEEEEKKKIKPKDKYLRDIIA---------GRPVFSHP 310


>gnl|CDD|178049 PLN02430, PLN02430, long-chain-fatty-acid-CoA ligase.
          Length = 660

 Score = 28.2 bits (63), Expect = 7.1
 Identities = 10/25 (40%), Positives = 17/25 (68%)

Query: 96  ADNNNEQSNDIAIVLYTSGSTGIPK 120
           ++ N  +  DI  ++YTSG++G PK
Sbjct: 212 SETNPPKPLDICTIMYTSGTSGDPK 236


>gnl|CDD|214377 CHL00152, rpl32, ribosomal protein L32; Validated.
          Length = 53

 Score = 25.4 bits (56), Expect = 7.3
 Identities = 14/33 (42%), Positives = 19/33 (57%)

Query: 27 KKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
          KK+ +K KK+ +K   K+K  K   K   LAKS
Sbjct: 5  KKRTSKSKKRIRKNIWKRKAYKAALKAFSLAKS 37


>gnl|CDD|219783 pfam08297, U3_snoRNA_assoc, U3 snoRNA associated.  This family of
          proteins is associated with U3 snoRNA. U3 snoRNA is
          required for nucleolar processing of pre-18S ribosomal
          RNA.
          Length = 87

 Score = 26.2 bits (58), Expect = 7.4
 Identities = 14/35 (40%), Positives = 19/35 (54%)

Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
            P E  + + KK KKK K +  KK +K+ K  KK
Sbjct: 18 TPPPESYEEELKKQKKKNKLRFLKKLEKRPKDVKK 52



 Score = 25.8 bits (57), Expect = 9.6
 Identities = 15/35 (42%), Positives = 20/35 (57%)

Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRIL 56
          E  K +KKKNK +  KK +K+ K  KK     R+L
Sbjct: 26 EELKKQKKKNKLRFLKKLEKRPKDVKKGPVTVRVL 60



 Score = 25.8 bits (57), Expect = 9.7
 Identities = 13/38 (34%), Positives = 18/38 (47%)

Query: 13 DKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKK 50
               P     E+ KK+K K K +  KK +K+ K  KK
Sbjct: 15 RPPTPPPESYEEELKKQKKKNKLRFLKKLEKRPKDVKK 52


>gnl|CDD|235370 PRK05244, PRK05244, Der GTPase activator; Provisional.
          Length = 177

 Score = 27.2 bits (61), Expect = 7.4
 Identities = 13/37 (35%), Positives = 20/37 (54%), Gaps = 9/37 (24%)

Query: 27 KKKKNKKKKKKKKKKKKK---------KKKKKKKKRR 54
          KKK + K+ K   K KKK         +++K+KKK +
Sbjct: 2  KKKSSPKRSKGMAKSKKKTREELDAEARERKRKKKHK 38


>gnl|CDD|223625 COG0551, TopA, Zn-finger domain associated with topoisomerase type
           I [DNA replication, recombination, and repair].
          Length = 140

 Score = 27.1 bits (60), Expect = 7.6
 Identities = 14/39 (35%), Positives = 18/39 (46%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSC 60
             EK K K+ K  K   +K  +KK+KKK        K C
Sbjct: 93  FTEKPKPKEKKCPKCGSRKLVEKKEKKKGVFLECSNKPC 131


>gnl|CDD|220207 pfam09368, Sas10_Utp3_C, Sas10 C-terminal domain.  This family
          contains a C-terminal presumed domain in Sas10 which
          hash been identified as a regulator of chromatin
          silencing.
          Length = 76

 Score = 26.1 bits (58), Expect = 7.6
 Identities = 15/45 (33%), Positives = 27/45 (60%), Gaps = 2/45 (4%)

Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRT 69
          + +KK  +  + KK+KK +K  K+ K +RR++ +  E+  Y G T
Sbjct: 21 RKRKKDRRNPRVKKRKKYEKAVKRLKSQRRVVRE--EDGGYGGET 63


>gnl|CDD|221712 pfam12687, DUF3801, Protein of unknown function (DUF3801).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 158 and 187 amino
           acids in length. This family includes the PcfB protein.
          Length = 137

 Score = 26.8 bits (60), Expect = 7.6
 Identities = 13/32 (40%), Positives = 21/32 (65%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
           E+   +KK  K K+++  KKK++K K K KK+
Sbjct: 106 EKPSVRKKLKKFKEEQAAKKKERKDKVKNKKK 137



 Score = 26.8 bits (60), Expect = 8.8
 Identities = 13/44 (29%), Positives = 25/44 (56%)

Query: 11  HFDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
            F    +    + + + +KK KK K+++  KKK++K K K K++
Sbjct: 94  EFTAKKLKKEQDEKPSVRKKLKKFKEEQAAKKKERKDKVKNKKK 137


>gnl|CDD|222965 PHA03057, PHA03057, Hypothetical protein; Provisional.
          Length = 146

 Score = 27.0 bits (59), Expect = 7.6
 Identities = 12/33 (36%), Positives = 16/33 (48%)

Query: 12  FDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKK 44
           FD + IP PF+R     KKN    +   +  KK
Sbjct: 103 FDFMKIPNPFKRYYEYCKKNNNNPQPVSEPPKK 135


>gnl|CDD|240577 cd12950, RRP7_Rrp7p, RRP7 domain ribosomal RNA-processing protein 7
           (Rrp7p) and similar proteins.  This CD corresponds to
           the RRP7 domain of Rrp7p. Rrp7p is encoded by YCL031C
           gene from Saccharomyces cerevisiae. It is an essential
           yeast protein involved in pre-rRNA processing and
           ribosome assembly. Rrp7p contains an N-terminal RNA
           recognition motif (RRM), also termed RBD (RNA binding
           domain) or RNP (ribonucleoprotein domain), and a
           C-terminal RRP7 domain.
          Length = 128

 Score = 26.9 bits (60), Expect = 7.7
 Identities = 15/41 (36%), Positives = 23/41 (56%), Gaps = 7/41 (17%)

Query: 22  EREKAKKKKNKKKKKKKKKKKKKKKK-------KKKKKRRI 55
             E  K  + +KK+K+KKKKKKK+ +       ++KKK   
Sbjct: 67  GEEAGKAAEEEKKEKEKKKKKKKELEDFYRFQLREKKKEEQ 107


>gnl|CDD|219721 pfam08118, MDM31_MDM32, Yeast mitochondrial distribution and
           morphology (MDM) proteins.  Proteins in this family are
           yeast mitochondrial inner membrane proteins MDM31 and
           MDM32. These proteins are required for the maintenance
           of mitochondrial morphology, and the stability of
           mitochondrial DNA.
          Length = 503

 Score = 27.9 bits (62), Expect = 8.0
 Identities = 9/27 (33%), Positives = 16/27 (59%)

Query: 134 IVDDIHRCWSSGDRAIKYRRMYKTGDY 160
           +VD  H  W + D A  Y+ +++ GD+
Sbjct: 192 VVDRTHVHWKADDDARNYKNVHQPGDF 218


>gnl|CDD|221161 pfam11661, DUF2986, Protein of unknown function (DUF2986).  This
          family of proteins has no known function.
          Length = 44

 Score = 24.9 bits (55), Expect = 8.1
 Identities = 13/26 (50%), Positives = 14/26 (53%)

Query: 27 KKKKNKKKKKKKKKKKKKKKKKKKKK 52
          KKK N+K KKK KK   K     K K
Sbjct: 4  KKKINQKLKKKAKKANAKLHPSNKPK 29



 Score = 24.9 bits (55), Expect = 8.6
 Identities = 12/29 (41%), Positives = 15/29 (51%)

Query: 27 KKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
          +KKK  +K KKK KK   K     K + I
Sbjct: 3  RKKKINQKLKKKAKKANAKLHPSNKPKYI 31


>gnl|CDD|234707 PRK00270, rpsU, 30S ribosomal protein S21; Reviewed.
          Length = 64

 Score = 25.6 bits (57), Expect = 8.1
 Identities = 9/23 (39%), Positives = 16/23 (69%)

Query: 32 KKKKKKKKKKKKKKKKKKKKKRR 54
          K  +K+K+KK   +K+++KK  R
Sbjct: 40 KPSEKRKRKKAAARKRRRKKLAR 62



 Score = 25.6 bits (57), Expect = 8.5
 Identities = 7/21 (33%), Positives = 14/21 (66%)

Query: 34 KKKKKKKKKKKKKKKKKKKRR 54
          K+K+KK   +K+++KK  +  
Sbjct: 44 KRKRKKAAARKRRRKKLAREE 64


>gnl|CDD|189027 cd09857, PIN_EXO1, PIN domain of Exonuclease-1, a
           structure-specific, divalent-metal-ion dependent, 5'
           nuclease and homologs.  Exonuclease-1 (EXO1) is involved
           in multiple, eukaryotic DNA metabolic pathways,
           including DNA replication processes (5' flap DNA
           endonuclease activity and double stranded DNA
           5'-exonuclease activity), DNA repair processes (DNA
           mismatch repair (MMR) and post-replication repair
           (PRR)), recombination, and telomere integrity. EXO1
           functions in the MMS2 error-free branch of the PRR
           pathway in the maintenance and repair of stalled
           replication forks. Studies also suggest that EXO1 plays
           both structural and catalytic roles during MMR-mediated
           mutation avoidance. EXO1 belongs to the FEN1-EXO1-like
           family of structure-specific, 5' nucleases. These
           nucleases contain a PIN (PilT N terminus) domain with a
           helical arch/clamp region (I domain) of variable length
           (approximately 43 residues in EXO1 PIN domains) and a
           H3TH (helix-3-turn-helix) domain, an atypical
           helix-hairpin-helix-2-like region. Both the H3TH domain
           (not included here) and the helical arch/clamp region
           are involved in DNA binding. Nucleases within this group
           also have a carboxylate-rich active site that is
           involved in binding essential divalent metal ion
           cofactors (Mg2+/Mn2+). EXO1 nucleases also have
           C-terminal Mlh1- and Msh2-binding domains which allow
           interaction with MMR and PRR proteins, respectively.
          Length = 210

 Score = 27.5 bits (62), Expect = 8.2
 Identities = 6/35 (17%), Positives = 21/35 (60%)

Query: 26  AKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSC 60
           +KK   +++++++++  +K  +  ++  R  A+ C
Sbjct: 84  SKKGTEEERRERREENLEKALELLREGNRSEAREC 118


>gnl|CDD|215353 PLN02654, PLN02654, acetate-CoA ligase.
          Length = 666

 Score = 27.9 bits (62), Expect = 8.3
 Identities = 14/29 (48%), Positives = 18/29 (62%), Gaps = 2/29 (6%)

Query: 103 SNDIAIVLYTSGSTGIPKEYRRMYKTGDY 131
           + D   +LYTSGSTG PK    ++ TG Y
Sbjct: 274 AEDPLFLLYTSGSTGKPKGV--LHTTGGY 300


>gnl|CDD|218797 pfam05890, Ebp2, Eukaryotic rRNA processing protein EBP2.  This
           family consists of several Eukaryotic rRNA processing
           protein EBP2 sequences. Ebp2p is required for the
           maturation of 25S rRNA and 60S subunit assembly. Ebp2p
           may be one of the target proteins of Rrs1p for executing
           the signal to regulate ribosome biogenesis. This family
           also plays a role in chromosome segregation.
          Length = 270

 Score = 27.7 bits (62), Expect = 8.3
 Identities = 11/33 (33%), Positives = 16/33 (48%)

Query: 21  FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
           F  + A K  +KKK +K    + K   K+K K 
Sbjct: 188 FLVDDAAKGGSKKKGRKGGAARGKPNAKRKAKD 220


>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4;
          Provisional.
          Length = 191

 Score = 27.5 bits (61), Expect = 8.3
 Identities = 20/48 (41%), Positives = 25/48 (52%), Gaps = 5/48 (10%)

Query: 15 VIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCEN 62
          V IP+     K  KKK K   +KKK  KK KKKKK+K+      + E 
Sbjct: 43 VEIPSK----KTSKKK-KTTPRKKKTTKKTKKKKKEKEEVPELAAEEL 85


>gnl|CDD|181546 PRK08751, PRK08751, putative long-chain fatty acyl CoA ligase;
           Provisional.
          Length = 560

 Score = 27.9 bits (62), Expect = 8.4
 Identities = 9/19 (47%), Positives = 14/19 (73%)

Query: 102 QSNDIAIVLYTSGSTGIPK 120
           + +DIA + YT G+TG+ K
Sbjct: 206 EPDDIAFLQYTGGTTGVAK 224


>gnl|CDD|215507 PLN02939, PLN02939, transferase, transferring glycosyl groups.
          Length = 977

 Score = 27.9 bits (62), Expect = 8.5
 Identities = 21/96 (21%), Positives = 37/96 (38%), Gaps = 11/96 (11%)

Query: 18  PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCEN-------PAYKGRTV 70
             P  R  A   + +++    ++KKK+ K    K+R   +K   N            RTV
Sbjct: 24  YLPSRRRLAVSCRARRRGFSSQQKKKRGKNIAPKQRSSNSKLQSNTDENGQLENTSLRTV 83

Query: 71  LSLPQLRLLSASLSHNNIPGESMLHADNNNEQSNDI 106
           + LPQ     ++ S ++    SM   +      N+ 
Sbjct: 84  MELPQ----KSTSSDDDHNRASMQRDEAIAAIDNEQ 115


>gnl|CDD|239314 cd03016, PRX_1cys, Peroxiredoxin (PRX) family, 1-cys PRX subfamily;
           composed of PRXs containing only one conserved cysteine,
           which serves as the peroxidatic cysteine. They are
           homodimeric thiol-specific antioxidant (TSA) proteins
           that confer a protective role in cells by reducing and
           detoxifying hydrogen peroxide, peroxynitrite, and
           organic hydroperoxides. As with all other PRXs, a
           cysteine sulfenic acid intermediate is formed upon
           reaction of 1-cys PRX with its substrates. Having no
           resolving cysteine, the oxidized enzyme is resolved by
           an external small-molecule or protein reductant such as
           thioredoxin or glutaredoxin. Similar to typical 2-cys
           PRX, 1-cys PRX forms a functional dimeric unit with a
           B-type interface, as well as a decameric structure which
           is stabilized in the reduced form of the enzyme. Other
           oligomeric forms, tetramers and hexamers, have also been
           reported. Mammalian 1-cys PRX is localized cellularly in
           the cytosol and is expressed at high levels in brain,
           eye, testes and lung. The seed-specific plant 1-cys PRXs
           protect tissues from reactive oxygen species during
           desiccation and are also called rehydrins.
          Length = 203

 Score = 27.1 bits (61), Expect = 8.6
 Identities = 10/26 (38%), Positives = 13/26 (50%)

Query: 13  DKVIIPAPFEREKAKKKKNKKKKKKK 38
           D VI+P     E+AKKK  K  +   
Sbjct: 171 DDVIVPPSVSDEEAKKKFPKGYETVD 196


>gnl|CDD|215521 PLN02967, PLN02967, kinase.
          Length = 581

 Score = 27.7 bits (61), Expect = 9.2
 Identities = 14/54 (25%), Positives = 29/54 (53%), Gaps = 8/54 (14%)

Query: 22  EREKAKKKKNKKKKKKKKKK--------KKKKKKKKKKKRRILAKSCENPAYKG 67
             + A  K++KK  ++ ++K        +++K +KK +KRR + K  E+   +G
Sbjct: 101 NEDAALDKESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDEDVEDQG 154


>gnl|CDD|215494 PLN02915, PLN02915, cellulose synthase A [UDP-forming], catalytic
           subunit.
          Length = 1044

 Score = 28.0 bits (62), Expect = 9.2
 Identities = 13/43 (30%), Positives = 20/43 (46%)

Query: 25  KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKG 67
           + K KK+KK KK ++      KK+KKK     +   +    K 
Sbjct: 608 RGKSKKSKKGKKGRRSLLGGLKKRKKKGGGGGSMMGKKYGRKK 650


>gnl|CDD|148820 pfam07432, Hc1, Histone H1-like protein Hc1.  This family consists
           of several bacterial histone H1-like Hc1 proteins. In
           Chlamydia, Hc1 is expressed in the late stages of the
           life cycle, concomitant with the reorganisation of
           chlamydial reticulate bodies into elementary bodies.
           This suggests that Hc1 protein plays a role in the
           condensation of chromatin during intracellular
           differentiation.
          Length = 123

 Score = 26.6 bits (58), Expect = 9.2
 Identities = 17/56 (30%), Positives = 24/56 (42%)

Query: 26  AKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRTVLSLPQLRLLSA 81
             +K    KKK  KK  K  K  KK  +    K+   PA K + V + P  +  +A
Sbjct: 54  KAEKSGLMKKKPAKKAPKAAKAPKKAAKAPAKKAAAAPAKKAKAVKAKPATKKTAA 109


>gnl|CDD|192718 pfam11176, DUF2962, Protein of unknown function (DUF2962).  This
          eukaryotic family of proteins has no known function.
          Length = 155

 Score = 26.8 bits (60), Expect = 9.3
 Identities = 11/47 (23%), Positives = 20/47 (42%), Gaps = 11/47 (23%)

Query: 23 REKAKKKKNK-----------KKKKKKKKKKKKKKKKKKKKRRILAK 58
          +  AKKKK             +   +  K +KKK ++ +K+   L +
Sbjct: 4  KNIAKKKKVLHPKSRKAKQLNRATLRDDKLEKKKAERAEKRSLELDR 50


>gnl|CDD|222331 pfam13707, RloB, RloB-like protein.  This family includes the RloB
           protein that is found within a bacterial restriction
           modification operon. This family includes the AbiLii
           protein that is found as part of a plasmid encoded phage
           abortive infection mechanism. Deletion within abiLii
           abolished the phage resistance. The family includes some
           proteins annotated as CRISPR Csm2 proteins.
          Length = 176

 Score = 26.9 bits (60), Expect = 9.3
 Identities = 12/42 (28%), Positives = 17/42 (40%), Gaps = 2/42 (4%)

Query: 11  HFDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
           HF+     A   R+   KK   KK     KK K   K+  ++
Sbjct: 102 HFEDT--TAALSRDAYAKKLLLKKGLPDYKKGKNLFKELLER 141


>gnl|CDD|180161 PRK05609, nusG, transcription antitermination protein NusG;
          Validated.
          Length = 181

 Score = 27.1 bits (61), Expect = 9.4
 Identities = 11/32 (34%), Positives = 18/32 (56%), Gaps = 4/32 (12%)

Query: 9  EHHFDKVIIPAPFEREKAKKKKNKKKKKKKKK 40
          E    +V++P     E+  + KN KKKK ++K
Sbjct: 35 EDLIGEVLVP----TEEVVEVKNGKKKKVERK 62


>gnl|CDD|218896 pfam06098, Radial_spoke_3, Radial spoke protein 3.  This family
           consists of several radial spoke protein 3 (RSP3)
           sequences. Eukaryotic cilia and flagella present in
           diverse types of cells perform motile, sensory, and
           developmental functions in organisms from protists to
           humans. They are centred by precisely organised,
           microtubule-based structures, the axonemes. The axoneme
           consists of two central singlet microtubules, called the
           central pair, and nine outer doublet microtubules. These
           structures are well-conserved during evolution. The
           outer doublet microtubules, each composed of A and B
           sub-fibres, are connected to each other by nexin links,
           while the central pair is held at the centre of the
           axoneme by radial spokes. The radial spokes are T-shaped
           structures extending from the A-tubule of each outer
           doublet microtubule to the centre of the axoneme. Radial
           spoke protein 3 (RSP3), is present at the proximal end
           of the spoke stalk and helps in anchoring the radial
           spoke to the outer doublet. It is thought that radial
           spokes regulate the activity of inner arm dynein through
           protein phosphorylation and dephosphorylation.
          Length = 288

 Score = 27.3 bits (61), Expect = 9.5
 Identities = 10/39 (25%), Positives = 30/39 (76%)

Query: 21  FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
              E+A++++ ++K+++KK+ K++K+++K+   +I A++
Sbjct: 176 QRLEEAERRRREEKERRKKQDKERKQREKETAEKIAARA 214


>gnl|CDD|180666 PRK06710, PRK06710, long-chain-fatty-acid--CoA ligase; Validated.
          Length = 563

 Score = 27.7 bits (61), Expect = 9.6
 Identities = 12/35 (34%), Positives = 20/35 (57%)

Query: 86  NNIPGESMLHADNNNEQSNDIAIVLYTSGSTGIPK 120
           N++  E     +   +  ND+A++ YT G+TG PK
Sbjct: 188 NSVEKEVNTGVEVPCDPENDLALLQYTGGTTGFPK 222


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. This family includes the
           PrrC protein that is thought to be the active component
           of the anticodon nuclease.
          Length = 713

 Score = 27.7 bits (62), Expect = 9.6
 Identities = 9/48 (18%), Positives = 22/48 (45%)

Query: 21  FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGR 68
            E  K + KK ++K ++ + + +KK+++ +K +            K  
Sbjct: 97  IEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKY 144


>gnl|CDD|146789 pfam04337, DUF480, Protein of unknown function, DUF480.  This
           family consists of several proteins of uncharacterized
           function.
          Length = 148

 Score = 26.7 bits (60), Expect = 9.7
 Identities = 19/66 (28%), Positives = 31/66 (46%), Gaps = 9/66 (13%)

Query: 143 SSGDRAIKYR-RMYKTGDYGR----IVDDILLYEGRTDSQVKIRGNRV----DLSEVEAA 193
             G R  KY  R   T         ++  +LL   +T  +++ R  R+    D++EVEA 
Sbjct: 68  GFGSRVPKYEHRFCNTLQLSPQELALLCLLLLRGPQTPGELRTRSERLHRFADVAEVEAV 127

Query: 194 LSKLSE 199
           L +L+E
Sbjct: 128 LEELAE 133


>gnl|CDD|235501 PRK05559, PRK05559, DNA topoisomerase IV subunit B; Reviewed.
          Length = 631

 Score = 27.8 bits (63), Expect = 9.9
 Identities = 9/30 (30%), Positives = 15/30 (50%), Gaps = 5/30 (16%)

Query: 13  DKVIIPAPFEREKAKKKKNKKKKKKKKKKK 42
           +K I     +  +A+ +  KK K+KKK   
Sbjct: 374 EKAI-----KAAQARLRAAKKVKRKKKTSG 398


>gnl|CDD|181109 PRK07769, PRK07769, long-chain-fatty-acid--CoA ligase; Validated.
          Length = 631

 Score = 27.8 bits (62), Expect = 9.9
 Identities = 10/16 (62%), Positives = 12/16 (75%)

Query: 104 NDIAIVLYTSGSTGIP 119
           + IA + YTSGST IP
Sbjct: 180 DTIAYLQYTSGSTRIP 195


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.312    0.131    0.360 

Gapped
Lambda     K      H
   0.267   0.0685    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 15,566,187
Number of extensions: 1520540
Number of successful extensions: 16024
Number of sequences better than 10.0: 1
Number of HSP's gapped: 10795
Number of HSP's successfully gapped: 1700
Length of query: 307
Length of database: 10,937,602
Length adjustment: 97
Effective length of query: 210
Effective length of database: 6,635,264
Effective search space: 1393405440
Effective search space used: 1393405440
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 59 (26.6 bits)