RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy8473
(307 letters)
>gnl|CDD|213296 cd05930, A_NRPS, The adenylation domain of nonribosomal peptide
synthetases (NRPS). The adenylation (A) domain of NRPS
recognizes a specific amino acid or hydroxy acid and
activates it as an (amino) acyl adenylate by hydrolysis
of ATP. The activated acyl moiety then forms a thioester
bond to the enzyme-bound cofactor phosphopantetheine of
a peptidyl carrier protein domain. NRPSs are large
multifunctional enzymes which synthesize many
therapeutically useful peptides in bacteria and fungi
via a template-directed, nucleic acid independent
nonribosomal mechanism. These natural products include
antibiotics, immunosuppressants, plant and animal
toxins, and enzyme inhibitors. NRPS has a distinct
modular structure in which each module is responsible
for the recognition, activation, and in some cases,
modification of a single amino acid residue of the final
peptide product. The modules can be subdivided into
domains that catalyze specific biochemical reactions.
Length = 445
Score = 81.7 bits (203), Expect = 2e-17
Identities = 30/84 (35%), Positives = 46/84 (54%), Gaps = 11/84 (13%)
Query: 153 RMYKTGDYGRIVDD-ILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSVLCYKPG 211
R+Y+TGD R + D L + GR D QVKIRG R++L E+EAAL V + V+ + G
Sbjct: 327 RLYRTGDLARWLPDGNLEFLGRADDQVKIRGYRIELGEIEAALLAHPGVREAVVVAREDG 386
Query: 212 EVDQV----------SEVTSEEIR 225
++ +E+ + E+R
Sbjct: 387 AGEKRLVAYVVPAAGAELDAAELR 410
Score = 59.8 bits (146), Expect = 4e-10
Identities = 21/51 (41%), Positives = 30/51 (58%)
Query: 228 DDILLYEGRTDSQVKIRGNRVDLSEVEAALSKISEVDKLSVLCYKPGEVDQ 278
D L + GR D QVKIRG R++L E+EAAL V + V+ + G ++
Sbjct: 340 DGNLEFLGRADDQVKIRGYRIELGEIEAALLAHPGVREAVVVAREDGAGEK 390
Score = 41.3 bits (98), Expect = 4e-04
Identities = 12/19 (63%), Positives = 15/19 (78%)
Query: 102 QSNDIAIVLYTSGSTGIPK 120
+D+A V+YTSGSTG PK
Sbjct: 91 DPDDLAYVIYTSGSTGRPK 109
>gnl|CDD|233550 TIGR01733, AA-adenyl-dom, amino acid adenylation domain. This
model represents a domain responsible for the specific
recognition of amino acids and activation as adenylyl
amino acids. The reaction catalyzed is aa + ATP ->
aa-AMP + PPi. These domains are usually found as
components of multi-domain non-ribosomal peptide
synthetases and are usually called "A-domains" in that
context (for a review, see ). A-domains are almost
invariably followed by "T-domains" (thiolation domains,
pfam00550) to which the amino acid adenylate is
transferred as a thiol-ester to a bound pantetheine
cofactor with the release of AMP (these are also called
peptide carrier proteins, or PCPs. When the A-domain
does not represent the first module (corresponding to
the first amino acid in the product molecule) it is
usually preceded by a "C-domain" (condensation domain,
pfam00668) which catalyzes the ligation of two amino
acid thiol-esters from neighboring modules. This domain
is a subset of the AMP-binding domain found in Pfam
(pfam00501) which also hits substrate--CoA ligases and
luciferases. Sequences scoring in between trusted and
noise for this model may be ambiguous as to whether they
activate amino acids or other molecules lacking an alpha
amino group.
Length = 409
Score = 77.3 bits (191), Expect = 6e-16
Identities = 25/51 (49%), Positives = 33/51 (64%), Gaps = 1/51 (1%)
Query: 152 RRMYKTGDYGRIVDD-ILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEVD 201
R+Y+TGD R + D L + GR D QVKIRG R++L E+EAAL + V
Sbjct: 355 ARLYRTGDLVRYLPDGNLEFLGRIDDQVKIRGYRIELGEIEAALLRHPGVR 405
Score = 57.3 bits (139), Expect = 3e-09
Identities = 18/34 (52%), Positives = 23/34 (67%)
Query: 231 LLYEGRTDSQVKIRGNRVDLSEVEAALSKISEVD 264
L + GR D QVKIRG R++L E+EAAL + V
Sbjct: 372 LEFLGRIDDQVKIRGYRIELGEIEAALLRHPGVR 405
Score = 39.9 bits (94), Expect = 0.001
Identities = 12/18 (66%), Positives = 15/18 (83%)
Query: 103 SNDIAIVLYTSGSTGIPK 120
+D+A V+YTSGSTG PK
Sbjct: 119 PDDLAYVIYTSGSTGRPK 136
>gnl|CDD|213325 cd12117, A_NRPS_Srf_like, The adenylation domain of nonribosomal
peptide synthetases (NRPS), including Bacillus subtilis
termination module Surfactin (SrfA-C). The adenylation
(A) domain of NRPS recognizes a specific amino acid or
hydroxy acid and activates it as an (amino) acyl
adenylate by hydrolysis of ATP. The activated acyl
moiety then forms a thioester to the enzyme-bound
cofactor phosphopantetheine of a peptidyl carrier
protein domain. NRPSs are large multifunctional enzymes
which synthesize many therapeutically useful peptides in
bacteria and fungi via a template-directed, nucleic acid
independent nonribosomal mechanism. These natural
products include antibiotics, immunosuppressants, plant
and animal toxins, and enzyme inhibitors. NRPS has a
distinct modular structure in which each module is
responsible for the recognition, activation, and, in
some cases, modification of a single amino acid residue
of the final peptide product. The modules can be
subdivided into domains that catalyze specific
biochemical reactions. This family includes the
adenylation domain of the Bacillus subtilis termination
module (Surfactin domain, SrfA-C) which recognizes a
specific amino acid building block, which is then
activated and transferred to the terminal thiol of the
4'-phosphopantetheine (Ppan) arm of the downstream
peptidyl carrier protein (PCP) domain.
Length = 474
Score = 74.1 bits (183), Expect = 7e-15
Identities = 28/84 (33%), Positives = 43/84 (51%), Gaps = 9/84 (10%)
Query: 152 RRMYKTGDYGRIVDD-ILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSVLCYKP 210
R+Y+TGD R D + + GR D QVKIRG R++L E+E AL + V + +VL +
Sbjct: 357 ERLYRTGDLARWRPDGNIEFLGRIDRQVKIRGFRIELGEIEQALLQHPGVREAAVLVRED 416
Query: 211 GEVDQ--------VSEVTSEEIRG 226
D+ + + E+R
Sbjct: 417 AAGDKRLVAYVVARGALDAAELRA 440
Score = 55.2 bits (134), Expect = 1e-08
Identities = 20/51 (39%), Positives = 30/51 (58%)
Query: 228 DDILLYEGRTDSQVKIRGNRVDLSEVEAALSKISEVDKLSVLCYKPGEVDQ 278
D + + GR D QVKIRG R++L E+E AL + V + +VL + D+
Sbjct: 371 DGNIEFLGRIDRQVKIRGFRIELGEIEQALLQHPGVREAAVLVREDAAGDK 421
Score = 35.2 bits (82), Expect = 0.036
Identities = 20/58 (34%), Positives = 26/58 (44%), Gaps = 5/58 (8%)
Query: 68 RTVLSLPQLR-LLSASLSHNNIPGESMLHADNNNEQS----NDIAIVLYTSGSTGIPK 120
R +L+ L L A I E + D +D+A V+YTSGSTG PK
Sbjct: 86 RVLLTDESLAPLARADQLPLVILEEELDAEDAGPPAPAVDADDLAYVMYTSGSTGRPK 143
>gnl|CDD|213285 cd05918, A_NRPS_SidN3_like, The adenylation (A) domain of
siderophore-synthesizing nonribosomal peptide
synthetases (NRPS). The adenylation (A) domain of NRPS
recognizes a specific amino acid or hydroxy acid and
activates it as an (amino) acyl adenylate by hydrolysis
of ATP. The activated acyl moiety then forms a thioester
to the enzyme-bound cofactor phosphopantetheine of a
peptidyl carrier protein domain. This family of
siderophore-synthesizing NRPS includes the third
adenylation domain of SidN from the endophytic fungus
Neotyphodium lolii, ferrichrome siderophore synthetase,
HC-toxin synthetase, and enniatin synthase. NRPSs are
large multifunctional enzymes which synthesize many
therapeutically useful peptides. These natural products
include antibiotics, immunosuppressants, plant and
animal toxins, and enzyme inhibitors. NRPS has a
distinct modular structure in which each module is
responsible for the recognition, activation, and in some
cases, modification of a single amino acid residue of
the final peptide product. The modules can be subdivided
into domains that catalyze specific biochemical
reactions.
Length = 447
Score = 73.4 bits (181), Expect = 1e-14
Identities = 26/55 (47%), Positives = 33/55 (60%), Gaps = 1/55 (1%)
Query: 152 RRMYKTGDYGRI-VDDILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSV 205
R+Y+TGD R D L + GR D QVKIRG R++L E+EA L L V +V
Sbjct: 322 GRIYRTGDLVRYLEDGSLEFLGRKDDQVKIRGQRIELGEIEAVLRALPGVVVAAV 376
Score = 54.1 bits (131), Expect = 3e-08
Identities = 19/41 (46%), Positives = 25/41 (60%)
Query: 228 DDILLYEGRTDSQVKIRGNRVDLSEVEAALSKISEVDKLSV 268
D L + GR D QVKIRG R++L E+EA L + V +V
Sbjct: 336 DGSLEFLGRKDDQVKIRGQRIELGEIEAVLRALPGVVVAAV 376
Score = 38.7 bits (91), Expect = 0.003
Identities = 12/19 (63%), Positives = 14/19 (73%)
Query: 102 QSNDIAIVLYTSGSTGIPK 120
+D A V+YTSGSTG PK
Sbjct: 97 SPDDPAYVIYTSGSTGKPK 115
>gnl|CDD|213322 cd12114, A_NRPS_TlmIV_like, The adenylation domain of nonribosomal
peptide synthetases (NRPS), including Streptoalloteichus
tallysomycin biosynthesis genes. The adenylation (A)
domain of NRPS recognizes a specific amino acid or
hydroxy acid and activates it as an (amino) acyl
adenylate by hydrolysis of ATP. The activated acyl
moiety then forms a thioester to the enzyme-bound
cofactor phosphopantetheine of a peptidyl carrier
protein domain. NRPSs are large multifunctional enzymes
which synthesize many therapeutically useful peptides in
bacteria and fungi via a template-directed, nucleic acid
independent nonribosomal mechanism. These natural
products include antibiotics, immunosuppressants, plant
and animal toxins, and enzyme inhibitors. NRPS has a
distinct modular structure in which each module is
responsible for the recognition, activation, and in some
cases, modification of a single amino acid residue of
the final peptide product. The modules can be subdivided
into domains that catalyze specific biochemical
reactions. This family includes the TLM biosynthetic
gene cluster from Streptoalloteichus that consists of
nine NRPS genes; the N-terminal module of TlmVI (NRPS-5)
and the starter module of BlmVI (NRPS-5) are comprised
of the acyl CoA ligase (AL) and acyl carrier protein
(ACP)-like domains, which are thought to be involved in
the biosynthesis of the beta-aminoalaninamide moiety.
Length = 476
Score = 72.2 bits (178), Expect = 3e-14
Identities = 27/56 (48%), Positives = 37/56 (66%), Gaps = 1/56 (1%)
Query: 152 RRMYKTGDYGRIVDD-ILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSVL 206
R+Y+TGD GR D + + GR D QVKIRG RV+L E+EAAL++ V + V+
Sbjct: 358 ERLYRTGDLGRYRPDGTIEFLGRADHQVKIRGYRVELGEIEAALARHPGVQRAVVV 413
Score = 51.8 bits (125), Expect = 2e-07
Identities = 19/35 (54%), Positives = 25/35 (71%)
Query: 235 GRTDSQVKIRGNRVDLSEVEAALSKISEVDKLSVL 269
GR D QVKIRG RV+L E+EAAL++ V + V+
Sbjct: 379 GRADHQVKIRGYRVELGEIEAALARHPGVQRAVVV 413
Score = 38.0 bits (89), Expect = 0.006
Identities = 16/57 (28%), Positives = 24/57 (42%), Gaps = 4/57 (7%)
Query: 68 RTVLSLPQLRLLSASLSHNNIPGESMLHADNNNEQSN----DIAIVLYTSGSTGIPK 120
R VL+ P L + + ++ D+A V++TSGSTG PK
Sbjct: 86 RAVLTDPGLAQPEEAPDLLVVADDAAAAESPAPPPPRVDPDDLAYVIFTSGSTGEPK 142
>gnl|CDD|213310 cd05945, DltA, D-alanine:D-alanyl carrier protein ligase (DltA).
DltA belongs to the class I AMP-forming adenylation
domain superfamily, which also includes acetyl-CoA
synthetase, luciferase, and the adenylation domains of
non-ribosomal synthetases. It catalyzes the two-step
activation reaction of D-alanine: the formation of a
substrate-AMP molecule as an intermediate, and then the
transfer of the amino acid adenylate to teichoic acid in
the biosynthesis of lipoteichoic acid (LTA) and wall
teichoic acid (WTA) in gram-positive bacteria.
Length = 447
Score = 72.2 bits (178), Expect = 4e-14
Identities = 33/84 (39%), Positives = 47/84 (55%), Gaps = 3/84 (3%)
Query: 152 RRMYKTGDYGRIVDDILL-YEGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSVLC-YK 209
+R Y+TGD + DD LL Y GR D Q+K+ G R++L E+EAAL L V++ VL K
Sbjct: 329 QRWYRTGDLVYLEDDGLLVYLGRKDFQIKLHGYRIELEEIEAALRALPGVEEAVVLPVPK 388
Query: 210 PGEVDQ-VSEVTSEEIRGVDDILL 232
+V + V+ V +E L
Sbjct: 389 GEKVVRLVAFVVGKEGALDTKALK 412
Score = 56.8 bits (138), Expect = 3e-09
Identities = 21/53 (39%), Positives = 34/53 (64%), Gaps = 1/53 (1%)
Query: 228 DDILLYEGRTDSQVKIRGNRVDLSEVEAALSKISEVDKLSVLC-YKPGEVDQI 279
D +L+Y GR D Q+K+ G R++L E+EAAL + V++ VL K +V ++
Sbjct: 343 DGLLVYLGRKDFQIKLHGYRIELEEIEAALRALPGVEEAVVLPVPKGEKVVRL 395
Score = 37.6 bits (88), Expect = 0.007
Identities = 11/17 (64%), Positives = 15/17 (88%)
Query: 104 NDIAIVLYTSGSTGIPK 120
+D+A +L+TSGSTG PK
Sbjct: 97 DDLAYILFTSGSTGKPK 113
>gnl|CDD|213324 cd12116, A_NRPS_Ta1_like, The adenylation domain of nonribosomal
peptide synthetases (NRPS), including salinosporamide A
polyketide synthase. The adenylation (A) domain of NRPS
recognizes a specific amino acid or hydroxy acid and
activates it as an (amino) acyl adenylate by hydrolysis
of ATP. The activated acyl moiety then forms a thioester
to the enzyme-bound cofactor phosphopantetheine of a
peptidyl carrier protein domain. NRPSs are large
multifunctional enzymes which synthesize many
therapeutically useful peptides in bacteria and fungi
via a template-directed, nucleic acid independent
nonribosomal mechanism. These natural products include
antibiotics, immunosuppressants, plant and animal
toxins, and enzyme inhibitors. NRPS has a distinct
modular structure in which each module is responsible
for the recognition, activation, and in some cases,
modification of a single amino acid residue of the final
peptide product. The modules can be subdivided into
domains that catalyze specific biochemical reactions.
This family includes the myxovirescin (TA) antibiotic
biosynthetic gene in Myxococcus xanthus; TA production
plays a role in predation. It also includes the
salinosporamide A polyketide synthase which is involved
in the biosynthesis of salinosporamide A, a marine
microbial metabolite whose chlorine atom is crucial for
potent proteasome inhibition and anticancer activity.
Length = 438
Score = 71.1 bits (175), Expect = 8e-14
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 153 RMYKTGDYGR-IVDDILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEV-DKLSVLC-YK 209
R+Y+TGD R D L Y GR D QVKIRG R++L E+EAAL V D + V+
Sbjct: 320 RLYRTGDLVRWRPDGRLEYLGRADDQVKIRGFRIELGEIEAALRAHPGVADAVVVVREEG 379
Query: 210 PGEVDQVSEVTSEEIRGVD 228
PG+ V+ V + D
Sbjct: 380 PGDQRLVAYVVPKAGAAPD 398
Score = 55.7 bits (135), Expect = 1e-08
Identities = 25/69 (36%), Positives = 34/69 (49%), Gaps = 5/69 (7%)
Query: 228 DDILLYEGRTDSQVKIRGNRVDLSEVEAALSKISEVDKLSVLCYKPGEVDQ-----IWEA 282
D L Y GR D QVKIRG R++L E+EAAL V V+ + G DQ +
Sbjct: 333 DGRLEYLGRADDQVKIRGFRIELGEIEAALRAHPGVADAVVVVREEGPGDQRLVAYVVPK 392
Query: 283 PSSAGSTDS 291
+A ++
Sbjct: 393 AGAAPDAEA 401
Score = 36.4 bits (85), Expect = 0.016
Identities = 12/18 (66%), Positives = 15/18 (83%)
Query: 103 SNDIAIVLYTSGSTGIPK 120
+D+A V+YTSGSTG PK
Sbjct: 92 PDDLAYVIYTSGSTGKPK 109
>gnl|CDD|213323 cd12115, A_NRPS_Sfm_like, The adenylation domain of nonribosomal
peptide synthetases (NRPS), including Saframycin A gene
cluster from Streptomyces lavendulae. The adenylation
(A) domain of NRPS recognizes a specific amino acid or
hydroxy acid and activates it as an (amino) acyl
adenylate by hydrolysis of ATP. The activated acyl
moiety then forms a thioester to the enzyme-bound
cofactor phosphopantetheine of a peptidyl carrier
protein domain. NRPSs are large multifunctional enzymes
which synthesize many therapeutically useful peptides in
bacteria and fungi via a template-directed, nucleic acid
independent nonribosomal mechanism. These natural
products include antibiotics, immunosuppressants, plant
and animal toxins, and enzyme inhibitors. NRPS has a
distinct modular structure in which each module is
responsible for the recognition, activation, and in some
cases, modification of a single amino acid residue of
the final peptide product. The modules can be subdivided
into domains that catalyze specific biochemical
reactions. This family includes the saframycin A gene
cluster from Streptomyces lavendulae which implicates
the NRPS system for assembling the unusual tetrapeptidyl
skeleton in an iterative manner. It also includes
saframycin Mx1 produced by Myxococcus xanthus NRPS.
Length = 449
Score = 68.1 bits (167), Expect = 7e-13
Identities = 25/52 (48%), Positives = 33/52 (63%), Gaps = 1/52 (1%)
Query: 152 RRMYKTGDYGRIVDDILL-YEGRTDSQVKIRGNRVDLSEVEAALSKLSEVDK 202
R+Y+TGD R D L Y GR D QVK+RG R++L E+EAAL + V +
Sbjct: 330 ERLYRTGDLVRRRADGQLEYLGRIDDQVKVRGFRIELGEIEAALRSIPGVAE 381
Score = 50.7 bits (122), Expect = 3e-07
Identities = 20/38 (52%), Positives = 25/38 (65%)
Query: 228 DDILLYEGRTDSQVKIRGNRVDLSEVEAALSKISEVDK 265
D L Y GR D QVK+RG R++L E+EAAL I V +
Sbjct: 344 DGQLEYLGRIDDQVKVRGFRIELGEIEAALRSIPGVAE 381
Score = 36.9 bits (86), Expect = 0.010
Identities = 12/19 (63%), Positives = 15/19 (78%)
Query: 102 QSNDIAIVLYTSGSTGIPK 120
+D+A V+YTSGSTG PK
Sbjct: 103 DPDDLAYVIYTSGSTGRPK 121
>gnl|CDD|223951 COG1020, EntF, Non-ribosomal peptide synthetase modules and related
proteins [Secondary metabolites biosynthesis, transport,
and catabolism].
Length = 642
Score = 65.4 bits (159), Expect = 9e-12
Identities = 26/47 (55%), Positives = 34/47 (72%), Gaps = 1/47 (2%)
Query: 152 RRMYKTGDYGRIVDDILL-YEGRTDSQVKIRGNRVDLSEVEAALSKL 197
R+Y+TGD R + D L Y GR DSQVKIRG R++L E+EAAL++
Sbjct: 595 LRLYRTGDLARPLADGALEYLGRKDSQVKIRGFRIELGEIEAALAEQ 641
Score = 47.6 bits (113), Expect = 4e-06
Identities = 20/33 (60%), Positives = 25/33 (75%)
Query: 228 DDILLYEGRTDSQVKIRGNRVDLSEVEAALSKI 260
D L Y GR DSQVKIRG R++L E+EAAL++
Sbjct: 609 DGALEYLGRKDSQVKIRGFRIELGEIEAALAEQ 641
Score = 36.9 bits (85), Expect = 0.013
Identities = 17/51 (33%), Positives = 23/51 (45%), Gaps = 6/51 (11%)
Query: 70 VLSLPQLRLLSASLSHNNIPGESMLHADNNNEQSNDIAIVLYTSGSTGIPK 120
+ LP L L A IP + + +A ++YTSGSTG PK
Sbjct: 338 DVGLPGLALDDALS---EIPDTDPIPQA---LLGDALAYIIYTSGSTGQPK 382
>gnl|CDD|237108 PRK12467, PRK12467, peptide synthase; Provisional.
Length = 3956
Score = 65.2 bits (159), Expect = 1e-11
Identities = 26/55 (47%), Positives = 34/55 (61%), Gaps = 1/55 (1%)
Query: 153 RMYKTGDYGR-IVDDILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSVL 206
R+Y+TGD R D ++ Y GR D QVKIRG R++L E+EA L V + VL
Sbjct: 890 RLYRTGDLARYRADGVIEYLGRMDHQVKIRGFRIELGEIEARLLAQPGVREAVVL 944
Score = 56.7 bits (137), Expect = 7e-09
Identities = 26/55 (47%), Positives = 35/55 (63%), Gaps = 1/55 (1%)
Query: 153 RMYKTGDYGRIVDD-ILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSVL 206
R+Y+TGD R D ++ Y GR D QVKIRG R++L E+EA L + V + VL
Sbjct: 3471 RLYRTGDLARYRADGVIEYLGRIDHQVKIRGFRIELGEIEARLLQHPSVREAVVL 3525
Score = 54.4 bits (131), Expect = 4e-08
Identities = 25/55 (45%), Positives = 35/55 (63%), Gaps = 1/55 (1%)
Query: 153 RMYKTGDYGRI-VDDILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSVL 206
R+Y+TGD R D ++ Y GR D QVKIRG R++L E+EA L + V + V+
Sbjct: 1954 RLYRTGDLARYRADGVIEYLGRIDHQVKIRGFRIELGEIEARLREQGGVREAVVI 2008
Score = 50.2 bits (120), Expect = 8e-07
Identities = 21/53 (39%), Positives = 31/53 (58%), Gaps = 1/53 (1%)
Query: 227 VDDILLYEGRTDSQVKIRGNRVDLSEVEAALSKISEVDKLSVLCYKPGEVDQI 279
D ++ Y GR D QVKIRG R++L E+EA L V + +V+ +PG+
Sbjct: 902 ADGVIEYLGRMDHQVKIRGFRIELGEIEARLLAQPGVRE-AVVLAQPGDAGLQ 953
Score = 43.6 bits (103), Expect = 1e-04
Identities = 19/42 (45%), Positives = 27/42 (64%)
Query: 228 DDILLYEGRTDSQVKIRGNRVDLSEVEAALSKISEVDKLSVL 269
D ++ Y GR D QVKIRG R++L E+EA L + V + V+
Sbjct: 1967 DGVIEYLGRIDHQVKIRGFRIELGEIEARLREQGGVREAVVI 2008
Score = 43.6 bits (103), Expect = 1e-04
Identities = 20/42 (47%), Positives = 27/42 (64%)
Query: 228 DDILLYEGRTDSQVKIRGNRVDLSEVEAALSKISEVDKLSVL 269
D ++ Y GR D QVKIRG R++L E+EA L + V + VL
Sbjct: 3484 DGVIEYLGRIDHQVKIRGFRIELGEIEARLLQHPSVREAVVL 3525
Score = 32.4 bits (74), Expect = 0.32
Identities = 15/31 (48%), Positives = 18/31 (58%), Gaps = 1/31 (3%)
Query: 90 GESMLHADNNNEQSNDIAIVLYTSGSTGIPK 120
G S + N N +A V+YTSGSTG PK
Sbjct: 1705 GYSDSNPAVNLAPQN-LAYVIYTSGSTGRPK 1734
Score = 32.4 bits (74), Expect = 0.38
Identities = 23/58 (39%), Positives = 30/58 (51%), Gaps = 11/58 (18%)
Query: 63 PAYKGRTVLSLPQLRLLSASLSHNNIPGESMLHADNNNEQSNDIAIVLYTSGSTGIPK 120
PA G T L+L +L L S NN + + +N +A V+YTSGSTG PK
Sbjct: 3207 PAPAGDTALTLDRLDL--NGYSENNP--STRVMGEN-------LAYVIYTSGSTGKPK 3253
Score = 31.3 bits (71), Expect = 0.86
Identities = 19/55 (34%), Positives = 27/55 (49%), Gaps = 2/55 (3%)
Query: 67 GRTVLSLPQ-LRLLSASLSHNNIPGESMLHADNNNEQSNDIAIVLYTSGSTGIPK 120
L +P LR L + + G S + + + N +A V+YTSGSTG PK
Sbjct: 619 LLAQLPVPAGLRSLCLDEPADLLCGYSGHNPEVALDPDN-LAYVIYTSGSTGQPK 672
>gnl|CDD|236668 PRK10252, entF, enterobactin synthase subunit F; Provisional.
Length = 1296
Score = 63.5 bits (155), Expect = 4e-11
Identities = 24/50 (48%), Positives = 36/50 (72%), Gaps = 1/50 (2%)
Query: 153 RMYKTGDYGRIVDD-ILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEVD 201
RMY+TGD R +DD + Y GR+D Q+KIRG R++L E++ A+ L +V+
Sbjct: 837 RMYRTGDVARWLDDGAVEYLGRSDDQLKIRGQRIELGEIDRAMQALPDVE 886
Score = 47.0 bits (112), Expect = 1e-05
Identities = 14/32 (43%), Positives = 24/32 (75%)
Query: 233 YEGRTDSQVKIRGNRVDLSEVEAALSKISEVD 264
Y GR+D Q+KIRG R++L E++ A+ + +V+
Sbjct: 855 YLGRSDDQLKIRGQRIELGEIDRAMQALPDVE 886
Score = 32.7 bits (75), Expect = 0.26
Identities = 19/65 (29%), Positives = 27/65 (41%), Gaps = 20/65 (30%)
Query: 105 DIAIVLYTSGSTGIPKEYRRMYKTGDYGRIVDDIHRCWSSGDRAIKYRRMYKTGDYGRIV 164
A +++TSGSTG PK G +V G AI R ++ Y
Sbjct: 599 HTAYIIFTSGSTGRPK-----------GVMV---------GQTAIVNRLLWMQNHYPLTA 638
Query: 165 DDILL 169
DD++L
Sbjct: 639 DDVVL 643
>gnl|CDD|215954 pfam00501, AMP-binding, AMP-binding enzyme.
Length = 412
Score = 62.7 bits (153), Expect = 5e-11
Identities = 23/47 (48%), Positives = 29/47 (61%), Gaps = 1/47 (2%)
Query: 155 YKTGDYGRIVDD-ILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEV 200
Y+TGD GR +D L GR D QVKIRG R++ E+EA L + V
Sbjct: 361 YRTGDLGRWDEDGYLEILGRKDDQVKIRGERIEPGEIEAVLLEHPGV 407
Score = 49.3 bits (118), Expect = 1e-06
Identities = 17/36 (47%), Positives = 21/36 (58%)
Query: 228 DDILLYEGRTDSQVKIRGNRVDLSEVEAALSKISEV 263
D L GR D QVKIRG R++ E+EA L + V
Sbjct: 372 DGYLEILGRKDDQVKIRGERIEPGEIEAVLLEHPGV 407
Score = 40.8 bits (96), Expect = 6e-04
Identities = 10/19 (52%), Positives = 15/19 (78%)
Query: 102 QSNDIAIVLYTSGSTGIPK 120
+D+A ++YTSG+TG PK
Sbjct: 129 DPDDLAYIIYTSGTTGKPK 147
>gnl|CDD|235313 PRK04813, PRK04813, D-alanine--poly(phosphoribitol) ligase subunit
1; Provisional.
Length = 503
Score = 59.5 bits (145), Expect = 5e-10
Identities = 24/64 (37%), Positives = 36/64 (56%), Gaps = 1/64 (1%)
Query: 153 RMYKTGDYGRIVDDILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSVLCYKP-G 211
Y TGD G + D +L Y+GR D Q+K+ G R++L E+E L + S V+ V+ Y
Sbjct: 376 PAYHTGDAGYLEDGLLFYQGRIDFQIKLNGYRIELEEIEQNLRQSSYVESAVVVPYNKDH 435
Query: 212 EVDQ 215
+V
Sbjct: 436 KVQY 439
Score = 45.3 bits (108), Expect = 2e-05
Identities = 19/53 (35%), Positives = 31/53 (58%), Gaps = 1/53 (1%)
Query: 228 DDILLYEGRTDSQVKIRGNRVDLSEVEAALSKISEVDKLSVLCYKP-GEVDQI 279
D +L Y+GR D Q+K+ G R++L E+E L + S V+ V+ Y +V +
Sbjct: 388 DGLLFYQGRIDFQIKLNGYRIELEEIEQNLRQSSYVESAVVVPYNKDHKVQYL 440
Score = 29.1 bits (66), Expect = 3.4
Identities = 8/19 (42%), Positives = 14/19 (73%)
Query: 102 QSNDIAIVLYTSGSTGIPK 120
+ +D +++TSG+TG PK
Sbjct: 141 KGDDNYYIIFTSGTTGKPK 159
>gnl|CDD|233551 TIGR01734, D-ala-DACP-lig, D-alanine--poly(phosphoribitol) ligase,
subunit 1. This model represents the enzyme (also
called D-alanine-D-alanyl carrier protein ligase) which
activates D-alanine as an adenylate via the reaction
D-ala + ATP -> D-ala-AMP + PPi, and further catalyzes
the condensation of the amino acid adenylate with the
D-alanyl carrier protein (D-ala-ACP). The D-alanine is
then further transferred to teichoic acid in the
biosynthesis of lipoteichoic acid (LTA) and wall
teichoic acid (WTA) in gram positive bacteria, both
polysacchatides [Cell envelope, Biosynthesis and
degradation of murein sacculus and peptidoglycan].
Length = 502
Score = 59.0 bits (143), Expect = 8e-10
Identities = 21/54 (38%), Positives = 33/54 (61%)
Query: 153 RMYKTGDYGRIVDDILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSVL 206
Y+TGD G I D L Y+GR D Q+K+ G R++L ++E L + S ++ V+
Sbjct: 374 PAYRTGDAGTITDGQLFYQGRLDFQIKLHGYRIELEDIEFNLRQSSYIESAVVV 427
Score = 45.5 bits (108), Expect = 2e-05
Identities = 22/80 (27%), Positives = 38/80 (47%), Gaps = 13/80 (16%)
Query: 228 DDILLYEGRTDSQVKIRGNRVDLSEVEAALSKISEVDKLSVL--CYKPGEVDQIWEAPSS 285
D L Y+GR D Q+K+ G R++L ++E L + S ++ V+ K +V+ +
Sbjct: 386 DGQLFYQGRLDFQIKLHGYRIELEDIEFNLRQSSYIESAVVVPKYNKDHKVEYL------ 439
Query: 286 AGSTDSSFIVKSREDRAEEE 305
+ IV ED +E
Sbjct: 440 -----IAAIVPETEDFEKEF 454
Score = 29.0 bits (65), Expect = 3.7
Identities = 14/51 (27%), Positives = 23/51 (45%), Gaps = 9/51 (17%)
Query: 70 VLSLPQLRLLSASLSHNNIPGESMLHADNNNEQSNDIAIVLYTSGSTGIPK 120
+++L L S + + + D+N ++YTSGSTG PK
Sbjct: 116 IITLSALEQAETS--GGPVSFDHAVKGDDNY-------YIIYTSGSTGNPK 157
>gnl|CDD|235564 PRK05691, PRK05691, peptide synthase; Validated.
Length = 4334
Score = 59.0 bits (143), Expect = 1e-09
Identities = 27/55 (49%), Positives = 37/55 (67%), Gaps = 1/55 (1%)
Query: 153 RMYKTGDYGRI-VDDILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSVL 206
R+Y+TGD R D +L Y GR D QVKIRG R++L E+EA L + +EV + +V
Sbjct: 4102 RLYRTGDLARRRSDGVLEYVGRIDHQVKIRGYRIELGEIEARLHEQAEVREAAVA 4156
Score = 56.7 bits (137), Expect = 8e-09
Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 1/55 (1%)
Query: 153 RMYKTGDYGRIVDDILL-YEGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSVL 206
R+Y+TGD R+ D L+ Y GR D QVKIRG R++L E+E+ L + V + VL
Sbjct: 2568 RLYRTGDLVRLRADGLVEYVGRIDHQVKIRGFRIELGEIESRLLEHPAVREAVVL 2622
Score = 52.1 bits (125), Expect = 2e-07
Identities = 25/58 (43%), Positives = 34/58 (58%), Gaps = 1/58 (1%)
Query: 153 RMYKTGDYGRI-VDDILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSVLCYK 209
R+Y+TGD R D L Y GR D QVK+RG RV+ E++A L V + +VL +
Sbjct: 1505 RLYRTGDRARWNADGALEYLGRLDQQVKLRGFRVEPEEIQARLLAQPGVAQAAVLVRE 1562
Score = 49.4 bits (118), Expect = 2e-06
Identities = 22/45 (48%), Positives = 30/45 (66%)
Query: 225 RGVDDILLYEGRTDSQVKIRGNRVDLSEVEAALSKISEVDKLSVL 269
R D +L Y GR D QVKIRG R++L E+EA L + +EV + +V
Sbjct: 4112 RRSDGVLEYVGRIDHQVKIRGYRIELGEIEARLHEQAEVREAAVA 4156
Score = 43.6 bits (103), Expect = 1e-04
Identities = 22/51 (43%), Positives = 31/51 (60%), Gaps = 1/51 (1%)
Query: 220 TSEEIRGVDDILL-YEGRTDSQVKIRGNRVDLSEVEAALSKISEVDKLSVL 269
T + +R D L+ Y GR D QVKIRG R++L E+E+ L + V + VL
Sbjct: 2572 TGDLVRLRADGLVEYVGRIDHQVKIRGFRIELGEIESRLLEHPAVREAVVL 2622
Score = 41.3 bits (97), Expect = 6e-04
Identities = 19/46 (41%), Positives = 26/46 (56%)
Query: 227 VDDILLYEGRTDSQVKIRGNRVDLSEVEAALSKISEVDKLSVLCYK 272
D L Y GR D QVK+RG RV+ E++A L V + +VL +
Sbjct: 1517 ADGALEYLGRLDQQVKLRGFRVEPEEIQARLLAQPGVAQAAVLVRE 1562
Score = 36.3 bits (84), Expect = 0.020
Identities = 23/58 (39%), Positives = 29/58 (50%), Gaps = 11/58 (18%)
Query: 63 PAYKGRTVLSLPQLRLLSASLSHNNIPGESMLHADNNNEQSNDIAIVLYTSGSTGIPK 120
P +G + ++L L L S PG LH DN +A V+YTSGSTG PK
Sbjct: 1243 PQAEGVSAIALDSLHLDSWPSQ---APGLH-LHGDN-------LAYVIYTSGSTGQPK 1289
Score = 34.4 bits (79), Expect = 0.082
Identities = 22/61 (36%), Positives = 31/61 (50%), Gaps = 19/61 (31%)
Query: 74 PQLRLL-----SASLSHNNIPGESMLHADNNNEQSNDIAIVLYTSGSTGIPK----EYRR 124
P+L + SHN PG +++ +N +A V+YTSGSTG+PK E R
Sbjct: 3844 PRLLVWEEVQAGEVASHN--PG---IYSGPDN-----LAYVIYTSGSTGLPKGVMVEQRG 3893
Query: 125 M 125
M
Sbjct: 3894 M 3894
Score = 32.1 bits (73), Expect = 0.43
Identities = 12/19 (63%), Positives = 15/19 (78%)
Query: 102 QSNDIAIVLYTSGSTGIPK 120
Q +DIA + YTSGST +PK
Sbjct: 164 QPDDIAFLQYTSGSTALPK 182
Score = 28.2 bits (63), Expect = 7.6
Identities = 10/14 (71%), Positives = 12/14 (85%)
Query: 107 AIVLYTSGSTGIPK 120
A ++YTSGSTG PK
Sbjct: 2336 AYLIYTSGSTGKPK 2349
>gnl|CDD|237054 PRK12316, PRK12316, peptide synthase; Provisional.
Length = 5163
Score = 57.7 bits (139), Expect = 3e-09
Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 1/55 (1%)
Query: 153 RMYKTGDYGRI-VDDILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSVL 206
RMY+TGD R D ++ Y GR D QVK+RG R++L E+EA L + V + +VL
Sbjct: 886 RMYRTGDLARYRADGVIEYAGRIDHQVKLRGLRIELGEIEARLLEHPWVREAAVL 940
Score = 56.1 bits (135), Expect = 1e-08
Identities = 25/55 (45%), Positives = 34/55 (61%), Gaps = 1/55 (1%)
Query: 153 RMYKTGDYGRI-VDDILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSVL 206
R+Y+TGD R D ++ Y GR D QVKIRG R++L E+EA L V + V+
Sbjct: 2381 RLYRTGDLARYRADGVVEYLGRIDHQVKIRGFRIELGEIEARLQAHPAVREAVVV 2435
Score = 53.0 bits (127), Expect = 9e-08
Identities = 23/43 (53%), Positives = 30/43 (69%), Gaps = 1/43 (2%)
Query: 153 RMYKTGDYGRI-VDDILLYEGRTDSQVKIRGNRVDLSEVEAAL 194
R+Y+TGD R D ++ Y GR D QVKIRG R++L E+EA L
Sbjct: 4929 RLYRTGDLARYRADGVIDYLGRVDHQVKIRGFRIELGEIEARL 4971
Score = 51.9 bits (124), Expect = 3e-07
Identities = 26/55 (47%), Positives = 35/55 (63%), Gaps = 1/55 (1%)
Query: 153 RMYKTGDYGRIVDD-ILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSVL 206
R+Y+TGD R D ++ Y GR D QVKIRG R++L E+EA L + V + VL
Sbjct: 3424 RLYRTGDLARYRADGVIEYIGRVDHQVKIRGFRIELGEIEARLLEHPWVREAVVL 3478
Score = 44.6 bits (105), Expect = 6e-05
Identities = 19/43 (44%), Positives = 26/43 (60%)
Query: 227 VDDILLYEGRTDSQVKIRGNRVDLSEVEAALSKISEVDKLSVL 269
D ++ Y GR D QVKIRG R++L E+EA L V + V+
Sbjct: 2393 ADGVVEYLGRIDHQVKIRGFRIELGEIEARLQAHPAVREAVVV 2435
Score = 44.6 bits (105), Expect = 6e-05
Identities = 19/44 (43%), Positives = 28/44 (63%)
Query: 226 GVDDILLYEGRTDSQVKIRGNRVDLSEVEAALSKISEVDKLSVL 269
D ++ Y GR D QVK+RG R++L E+EA L + V + +VL
Sbjct: 897 RADGVIEYAGRIDHQVKLRGLRIELGEIEARLLEHPWVREAAVL 940
Score = 44.2 bits (104), Expect = 7e-05
Identities = 17/33 (51%), Positives = 22/33 (66%)
Query: 225 RGVDDILLYEGRTDSQVKIRGNRVDLSEVEAAL 257
D ++ Y GR D QVKIRG R++L E+EA L
Sbjct: 4939 YRADGVIDYLGRVDHQVKIRGFRIELGEIEARL 4971
Score = 40.3 bits (94), Expect = 0.001
Identities = 20/42 (47%), Positives = 27/42 (64%)
Query: 228 DDILLYEGRTDSQVKIRGNRVDLSEVEAALSKISEVDKLSVL 269
D ++ Y GR D QVKIRG R++L E+EA L + V + VL
Sbjct: 3437 DGVIEYIGRVDHQVKIRGFRIELGEIEARLLEHPWVREAVVL 3478
Score = 34.9 bits (80), Expect = 0.066
Identities = 21/58 (36%), Positives = 27/58 (46%), Gaps = 10/58 (17%)
Query: 63 PAYKGRTVLSLPQLRLLSASLSHNNIPGESMLHADNNNEQSNDIAIVLYTSGSTGIPK 120
P G L+L + +H+ P LH DN +A V+YTSGSTG PK
Sbjct: 4663 PIPDGLASLALDRDEDWEGFPAHD--P-AVRLHPDN-------LAYVIYTSGSTGRPK 4710
Score = 33.8 bits (77), Expect = 0.13
Identities = 12/29 (41%), Positives = 18/29 (62%), Gaps = 3/29 (10%)
Query: 95 HADNNNEQS---NDIAIVLYTSGSTGIPK 120
+++ N ++A V+YTSGSTG PK
Sbjct: 643 YSEENPGTELNPENLAYVIYTSGSTGKPK 671
Score = 33.4 bits (76), Expect = 0.16
Identities = 12/26 (46%), Positives = 16/26 (61%)
Query: 95 HADNNNEQSNDIAIVLYTSGSTGIPK 120
A ++A V+YTSGSTG+PK
Sbjct: 2137 TAPAVQLAGENLAYVIYTSGSTGLPK 2162
Score = 30.3 bits (68), Expect = 1.6
Identities = 18/57 (31%), Positives = 24/57 (42%), Gaps = 4/57 (7%)
Query: 68 RTVLSLPQLRLLSASLSHNNIPGESMLHADNNNEQSN----DIAIVLYTSGSTGIPK 120
+ +LS LRL A + N ++A V+YTSGSTG PK
Sbjct: 3156 QLLLSQSHLRLPLAQGVQVLDLDRGDENYAEANPAIRTMPENLAYVIYTSGSTGKPK 3212
>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
Length = 434
Score = 47.6 bits (114), Expect = 4e-06
Identities = 15/36 (41%), Positives = 23/36 (63%)
Query: 19 APFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
AP E++ K K K+ +KK+K+K+K K KK+ R
Sbjct: 382 APSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHR 417
Score = 47.2 bits (113), Expect = 5e-06
Identities = 12/35 (34%), Positives = 22/35 (62%)
Query: 20 PFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
P ++ AK+ + K+K+K+K K KK+ + K +R
Sbjct: 391 PSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKR 425
Score = 47.2 bits (113), Expect = 5e-06
Identities = 15/44 (34%), Positives = 23/44 (52%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCE 61
P+ K +KK K+K+K K KK+ + K K+R+ S E
Sbjct: 391 PSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434
Score = 42.2 bits (100), Expect = 2e-04
Identities = 13/35 (37%), Positives = 22/35 (62%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
P+ + K KK K+ +KK+K+K+K K KK+ +
Sbjct: 383 PSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHR 417
Score = 41.9 bits (99), Expect = 3e-04
Identities = 12/34 (35%), Positives = 21/34 (61%)
Query: 20 PFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
P + ++KK K KK K+ +KK+K+K+K +
Sbjct: 378 PKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPK 411
Score = 40.7 bits (96), Expect = 6e-04
Identities = 11/38 (28%), Positives = 19/38 (50%)
Query: 14 KVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
KV+ ++EK K+K KK+ + K K++K
Sbjct: 394 KVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGT 431
Score = 39.5 bits (93), Expect = 0.002
Identities = 11/34 (32%), Positives = 20/34 (58%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
+KK K KK K+ +KK+K+K+K ++ +
Sbjct: 382 APSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKR 415
Score = 36.1 bits (84), Expect = 0.017
Identities = 12/32 (37%), Positives = 17/32 (53%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
R K K KK K KK K+ +KK+K++
Sbjct: 377 RPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKE 408
Score = 32.6 bits (75), Expect = 0.27
Identities = 11/31 (35%), Positives = 17/31 (54%)
Query: 28 KKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
K ++KK K KK K+ +KK+K + K
Sbjct: 381 KAPSEKKTGKPSKKVLAKRAEKKEKEKEKPK 411
>gnl|CDD|234212 TIGR03443, alpha_am_amid, L-aminoadipate-semialdehyde
dehydrogenase. Members of this protein family are
L-aminoadipate-semialdehyde dehydrogenase (EC 1.2.1.31),
product of the LYS2 gene. It is also called
alpha-aminoadipate reductase. In fungi, lysine is
synthesized via aminoadipate. Currently, all members of
this family are fungal.
Length = 1389
Score = 47.4 bits (113), Expect = 6e-06
Identities = 22/45 (48%), Positives = 31/45 (68%), Gaps = 1/45 (2%)
Query: 153 RMYKTGDYGRIV-DDILLYEGRTDSQVKIRGNRVDLSEVEAALSK 196
R+Y+TGD GR + D + GR D QVKIRG R++L E++ LS+
Sbjct: 678 RLYRTGDLGRYLPDGNVECCGRADDQVKIRGFRIELGEIDTHLSQ 722
Score = 36.6 bits (85), Expect = 0.017
Identities = 14/25 (56%), Positives = 19/25 (76%)
Query: 235 GRTDSQVKIRGNRVDLSEVEAALSK 259
GR D QVKIRG R++L E++ LS+
Sbjct: 698 GRADDQVKIRGFRIELGEIDTHLSQ 722
>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
complex aNOP56 subunit; Provisional.
Length = 414
Score = 44.6 bits (106), Expect = 3e-05
Identities = 20/39 (51%), Positives = 29/39 (74%)
Query: 14 KVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
K P P ++++ +KK K+KKKKK+KKK KK+KKK +K
Sbjct: 376 KEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 43.8 bits (104), Expect = 7e-05
Identities = 18/30 (60%), Positives = 25/30 (83%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
K K+++ K +K+KKKKK+KKK KK+KKK R
Sbjct: 384 KKKREEKKPQKRKKKKKRKKKGKKRKKKGR 413
Score = 42.3 bits (100), Expect = 2e-04
Identities = 17/30 (56%), Positives = 25/30 (83%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
K KKK ++KK +K+KKKKK+KKK KK+++
Sbjct: 381 KPPKKKREEKKPQKRKKKKKRKKKGKKRKK 410
Score = 40.7 bits (96), Expect = 7e-04
Identities = 18/33 (54%), Positives = 26/33 (78%), Gaps = 1/33 (3%)
Query: 23 REK-AKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
+EK K K K+++KK +K+KKKKK+KKK K+R
Sbjct: 376 KEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKR 408
Score = 35.7 bits (83), Expect = 0.027
Identities = 15/31 (48%), Positives = 25/31 (80%)
Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
E+ K+K K KKK+++KK +K+KKKKK+++
Sbjct: 373 EEIKEKYPKPPKKKREEKKPQKRKKKKKRKK 403
Score = 31.5 bits (72), Expect = 0.52
Identities = 14/36 (38%), Positives = 24/36 (66%), Gaps = 4/36 (11%)
Query: 23 REKAKKK----KNKKKKKKKKKKKKKKKKKKKKKRR 54
+E+ K+ K K K KKK+++KK +K+KKK++
Sbjct: 365 KEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKK 400
>gnl|CDD|213270 cd04433, AFD_class_I, Adenylate forming domain, Class I. This
family includes acyl- and aryl-CoA ligases, as well as
the adenylation domain of nonribosomal peptide
synthetases and firefly luciferases. The
adenylate-forming enzymes catalyze an ATP-dependent
two-step reaction to first activate a carboxylate
substrate as an adenylate and then transfer the
carboxylate to the pantetheine group of either coenzyme
A or an acyl-carrier protein. The active site of the
domain is located at the interface of a large N-terminal
subdomain and a smaller C-terminal subdomain.
Length = 338
Score = 43.0 bits (102), Expect = 1e-04
Identities = 17/48 (35%), Positives = 27/48 (56%), Gaps = 1/48 (2%)
Query: 155 YKTGDYGRIVDDILLY-EGRTDSQVKIRGNRVDLSEVEAALSKLSEVD 201
Y+TGD G + ++ LY GR+ +K+ G V +EVE+ L + V
Sbjct: 226 YRTGDLGYLDEEGYLYITGRSKDLIKVGGENVYPAEVESVLLQHPAVA 273
Score = 36.5 bits (85), Expect = 0.012
Identities = 11/16 (68%), Positives = 13/16 (81%)
Query: 105 DIAIVLYTSGSTGIPK 120
D A +LYTSG+TG PK
Sbjct: 1 DPAFILYTSGTTGKPK 16
Score = 31.1 bits (71), Expect = 0.60
Identities = 10/30 (33%), Positives = 16/30 (53%)
Query: 235 GRTDSQVKIRGNRVDLSEVEAALSKISEVD 264
GR+ +K+ G V +EVE+ L + V
Sbjct: 244 GRSKDLIKVGGENVYPAEVESVLLQHPAVA 273
>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
Length = 135
Score = 41.2 bits (97), Expect = 1e-04
Identities = 23/29 (79%), Positives = 24/29 (82%)
Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
EKAK+KK KKKKKKKKKKK KK KKKK
Sbjct: 107 EKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 39.7 bits (93), Expect = 4e-04
Identities = 19/29 (65%), Positives = 22/29 (75%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
K K K+ K+KKKKKKKKKKK KK KK+
Sbjct: 106 KEKAKQKKQKKKKKKKKKKKTSKKAAKKK 134
Score = 38.5 bits (90), Expect = 8e-04
Identities = 18/28 (64%), Positives = 22/28 (78%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
+ K+K K+KK+KKKKKKKKKKK KK
Sbjct: 103 QVLKEKAKQKKQKKKKKKKKKKKTSKKA 130
Score = 38.1 bits (89), Expect = 0.001
Identities = 20/29 (68%), Positives = 22/29 (75%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKK 50
E+ K KK+K KKKKKKKKK KK KKKK
Sbjct: 107 EKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 37.7 bits (88), Expect = 0.002
Identities = 18/32 (56%), Positives = 23/32 (71%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
+ +K K KK+KKKKKKKKKKK KK K++
Sbjct: 103 QVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKK 134
Score = 37.7 bits (88), Expect = 0.002
Identities = 17/30 (56%), Positives = 22/30 (73%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
++ K+K +KK+KKKKKKKKKKK KK
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKKAA 131
Score = 36.2 bits (84), Expect = 0.005
Identities = 19/32 (59%), Positives = 24/32 (75%)
Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
E +K K+ K+K K+KK+KKKKKKKKKKK
Sbjct: 95 VEEQKIVKQVLKEKAKQKKQKKKKKKKKKKKT 126
Score = 36.2 bits (84), Expect = 0.006
Identities = 19/34 (55%), Positives = 22/34 (64%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
K K+ K+KK+KKKKKKKKKKK KK K
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 33.5 bits (77), Expect = 0.045
Identities = 19/41 (46%), Positives = 25/41 (60%)
Query: 7 ISEHHFDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKK 47
+ E K ++ +++K KKKK KKKKKK KK KKKK
Sbjct: 95 VEEQKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 32.4 bits (74), Expect = 0.12
Identities = 19/35 (54%), Positives = 24/35 (68%)
Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
F E+ K K K+K K+KK+KKKKKKKKKK+
Sbjct: 93 FLVEEQKIVKQVLKEKAKQKKQKKKKKKKKKKKTS 127
Score = 31.2 bits (71), Expect = 0.28
Identities = 15/28 (53%), Positives = 22/28 (78%)
Query: 32 KKKKKKKKKKKKKKKKKKKKKRRILAKS 59
K+ K+K K+KK+KKKKKKKK++ +K
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKK 129
>gnl|CDD|213289 cd05922, FACL_like_6, Uncharacterized subfamily of fatty acid CoA
ligase (FACL). Fatty acyl-CoA ligases catalyze the
ATP-dependent activation of fatty acids in a two-step
reaction. The carboxylate substrate first reacts with
ATP to form an acyl-adenylate intermediate, which then
reacts with CoA to produce an acyl-CoA ester. This is a
required step before free fatty acids can participate in
most catabolic and anabolic reactions.
Length = 350
Score = 42.6 bits (101), Expect = 1e-04
Identities = 18/53 (33%), Positives = 28/53 (52%), Gaps = 1/53 (1%)
Query: 155 YKTGDYGRIVDDILLY-EGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSVL 206
TGD R+ ++ LY GR D +K RG RV +E+E + V + +V+
Sbjct: 233 LYTGDLVRMDEEGYLYFVGRKDDMIKTRGYRVSPTEIEEVICAHPLVAEAAVI 285
Score = 34.9 bits (81), Expect = 0.043
Identities = 11/18 (61%), Positives = 15/18 (83%)
Query: 103 SNDIAIVLYTSGSTGIPK 120
+D+A ++YTSGSTG PK
Sbjct: 1 DDDLAAIIYTSGSTGEPK 18
Score = 33.0 bits (76), Expect = 0.17
Identities = 13/39 (33%), Positives = 21/39 (53%)
Query: 231 LLYEGRTDSQVKIRGNRVDLSEVEAALSKISEVDKLSVL 269
L + GR D +K RG RV +E+E + V + +V+
Sbjct: 247 LYFVGRKDDMIKTRGYRVSPTEIEEVICAHPLVAEAAVI 285
>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19.
Med19 represents a family of conserved proteins which
are members of the multi-protein co-activator Mediator
complex. Mediator is required for activation of RNA
polymerase II transcription by DNA binding
transactivators.
Length = 178
Score = 40.6 bits (95), Expect = 3e-04
Identities = 18/27 (66%), Positives = 22/27 (81%)
Query: 27 KKKKNKKKKKKKKKKKKKKKKKKKKKR 53
KK K KK + K++KKKKK+KKKKKKR
Sbjct: 142 KKHKKKKHEDDKERKKKKKEKKKKKKR 168
Score = 39.4 bits (92), Expect = 7e-04
Identities = 16/30 (53%), Positives = 22/30 (73%)
Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKKKK 50
E++ KKK K++KKKKK+KKKKKK+
Sbjct: 140 HEKKHKKKKHEDDKERKKKKKEKKKKKKRH 169
Score = 39.4 bits (92), Expect = 7e-04
Identities = 17/30 (56%), Positives = 22/30 (73%)
Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
EK KKK + K++KKKKK+KKKKKK+
Sbjct: 141 EKKHKKKKHEDDKERKKKKKEKKKKKKRHS 170
Score = 39.4 bits (92), Expect = 7e-04
Identities = 17/27 (62%), Positives = 22/27 (81%)
Query: 28 KKKNKKKKKKKKKKKKKKKKKKKKKRR 54
KK KKK + K++KKKKK+KKKKK+R
Sbjct: 142 KKHKKKKHEDDKERKKKKKEKKKKKKR 168
Score = 39.0 bits (91), Expect = 0.001
Identities = 16/29 (55%), Positives = 21/29 (72%)
Query: 27 KKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
K KK K + K++KKKKK+KKKKKK+
Sbjct: 143 KHKKKKHEDDKERKKKKKEKKKKKKRHSP 171
Score = 38.7 bits (90), Expect = 0.001
Identities = 19/37 (51%), Positives = 28/37 (75%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
P+ E K +KK+KKKK + K++KKKKK+KKKK++
Sbjct: 131 PSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKK 167
Score = 37.9 bits (88), Expect = 0.002
Identities = 19/46 (41%), Positives = 28/46 (60%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENP 63
P + E K + K KKKK + K++KKKKK+KK++ S E+P
Sbjct: 129 ETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRHSPEHP 174
Score = 34.0 bits (78), Expect = 0.042
Identities = 16/42 (38%), Positives = 25/42 (59%)
Query: 26 AKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKG 67
++ K +KK KKKK + K++KKKKK + K +P + G
Sbjct: 134 SEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRHSPEHPG 175
>gnl|CDD|213274 cd05906, A_NRPS_TubE_like, The adenylation domain (A domain) of a
family of nonribosomal peptide synthetases (NRPSs)
synthesizing toxins and antitumor agents. The
adenylation (A) domain of NRPS recognizes a specific
amino acid or hydroxy acid and activates it as an
(amino)-acyl adenylate by hydrolysis of ATP. The
activated acyl moiety then forms a thioester to the
enzyme-bound cofactor phosphopantetheine of a peptidyl
carrier protein domain. This family includes NRPSs that
synthesize toxins and antitumor agents; for example,
TubE for Tubulysine, CrpA for cryptophycin, TdiA for
terrequinone A, KtzG for kutzneride, and Vlm1/Vlm2 for
Valinomycin. Nonribosomal peptide synthetases are large
multifunctional enzymes which synthesize many
therapeutically useful peptides. NRPS has a distinct
modular structure in which each module is responsible
for the recognition, activation, and, in some cases,
modification of a single amino acid residue of the final
peptide product. The modules can be subdivided into
domains that catalyze specific biochemical reactions.
Length = 560
Score = 41.5 bits (98), Expect = 4e-04
Identities = 20/53 (37%), Positives = 30/53 (56%), Gaps = 5/53 (9%)
Query: 68 RTVLSLPQLRLLSASLSHNNIPGESMLHADNNNEQSNDIAIVLYTSGSTGIPK 120
RT L LR+L+ + P ++ LH + +D A++L TSGSTG+PK
Sbjct: 136 RTRAGLEALRVLAIEELRSAPP-DAPLHP----ARPDDPALLLLTSGSTGVPK 183
Score = 30.3 bits (69), Expect = 1.3
Identities = 21/92 (22%), Positives = 35/92 (38%), Gaps = 21/92 (22%)
Query: 155 YKTGDYGRIVDDILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEV-------------- 200
++TGD G + D L GR + I G E+EAA+ ++ V
Sbjct: 431 FRTGDLGFLHDGRLTITGREKDMIIINGVNYYNHEIEAAVEQVPGVEPSFTAACAVRDAG 490
Query: 201 ---DKLSVLCYKP--GEVDQVSEVTSEEIRGV 227
D+L + + P D+++ IR
Sbjct: 491 SGTDEL-AVFFVPRYDLDDRLAA-LLRAIRTH 520
>gnl|CDD|223442 COG0365, Acs, Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases
[Lipid metabolism].
Length = 528
Score = 41.1 bits (97), Expect = 5e-04
Identities = 10/18 (55%), Positives = 14/18 (77%)
Query: 103 SNDIAIVLYTSGSTGIPK 120
++D +LYTSG+TG PK
Sbjct: 170 ADDPLFLLYTSGTTGKPK 187
Score = 40.7 bits (96), Expect = 7e-04
Identities = 18/62 (29%), Positives = 30/62 (48%), Gaps = 3/62 (4%)
Query: 142 WSSGDRAIKY--RRMYKTGDYGRIVDD-ILLYEGRTDSQVKIRGNRVDLSEVEAALSKLS 198
W+ +R + R Y+TGD+ +D GR+D +K+ G R+ E+E+ L
Sbjct: 383 WNDPERYKEAYFGRWYRTGDWAERDEDGYFWLHGRSDDVIKVSGKRIGPLEIESVLLAHP 442
Query: 199 EV 200
V
Sbjct: 443 AV 444
Score = 29.2 bits (66), Expect = 3.2
Identities = 12/40 (30%), Positives = 18/40 (45%)
Query: 224 IRGVDDILLYEGRTDSQVKIRGNRVDLSEVEAALSKISEV 263
R D GR+D +K+ G R+ E+E+ L V
Sbjct: 405 ERDEDGYFWLHGRSDDVIKVSGKRIGPLEIESVLLAHPAV 444
>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.
This domain is found to the N-terminus of bacterial
signal peptidases of the S49 family (pfam01343).
Length = 154
Score = 39.0 bits (92), Expect = 7e-04
Identities = 15/31 (48%), Positives = 21/31 (67%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
KA +K KK +K K K +KKK KK++ K R+
Sbjct: 69 KAWEKAEKKAEKAKAKAEKKKAKKEEPKPRL 99
Score = 37.9 bits (89), Expect = 0.002
Identities = 13/31 (41%), Positives = 18/31 (58%)
Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
+K K K +KK +K K K +KKK KK+
Sbjct: 65 KKELKAWEKAEKKAEKAKAKAEKKKAKKEEP 95
Score = 37.5 bits (88), Expect = 0.002
Identities = 14/28 (50%), Positives = 19/28 (67%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKK 50
EKA+KK K K K +KKK KK++ K +
Sbjct: 71 WEKAEKKAEKAKAKAEKKKAKKEEPKPR 98
Score = 37.1 bits (87), Expect = 0.003
Identities = 13/32 (40%), Positives = 20/32 (62%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
E + +K + K +K K K +KKK KK++ K R
Sbjct: 67 ELKAWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98
Score = 34.4 bits (80), Expect = 0.023
Identities = 12/32 (37%), Positives = 21/32 (65%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
++E +K +KK +K K K +KKK KK++ +
Sbjct: 65 KKELKAWEKAEKKAEKAKAKAEKKKAKKEEPK 96
Score = 34.0 bits (79), Expect = 0.032
Identities = 12/34 (35%), Positives = 19/34 (55%)
Query: 27 KKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSC 60
KK+ +K +KK +K K K +KKK ++ K
Sbjct: 65 KKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98
Score = 32.1 bits (74), Expect = 0.15
Identities = 11/28 (39%), Positives = 19/28 (67%)
Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKK 48
+E+ + K +K K K +KKK KK++ K +
Sbjct: 71 WEKAEKKAEKAKAKAEKKKAKKEEPKPR 98
Score = 31.3 bits (72), Expect = 0.32
Identities = 13/42 (30%), Positives = 22/42 (52%)
Query: 6 TISEHHFDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKK 47
++ DK + A + EK +K K +KKK KK++ K +
Sbjct: 57 SLEAALLDKKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98
>gnl|CDD|223395 COG0318, CaiC, Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases
II [Lipid metabolism / Secondary metabolites
biosynthesis, transport, and catabolism].
Length = 534
Score = 39.8 bits (93), Expect = 0.001
Identities = 16/57 (28%), Positives = 24/57 (42%), Gaps = 1/57 (1%)
Query: 146 DRAIKYRRMYKTGDYGRIVDDILLY-EGRTDSQVKIRGNRVDLSEVEAALSKLSEVD 201
A +TGD G + +D LY GR + G + E+EA L++ V
Sbjct: 394 AEAFDEDGWLRTGDLGYVDEDGYLYIVGRLKDLIISGGENIYPEEIEAVLAEHPAVA 450
Score = 37.8 bits (88), Expect = 0.006
Identities = 11/19 (57%), Positives = 16/19 (84%)
Query: 102 QSNDIAIVLYTSGSTGIPK 120
+D+A +LYTSG+TG+PK
Sbjct: 169 DPDDLAFLLYTSGTTGLPK 187
Score = 27.8 bits (62), Expect = 9.5
Identities = 8/30 (26%), Positives = 13/30 (43%)
Query: 235 GRTDSQVKIRGNRVDLSEVEAALSKISEVD 264
GR + G + E+EA L++ V
Sbjct: 421 GRLKDLIISGGENIYPEEIEAVLAEHPAVA 450
>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase;
Provisional.
Length = 330
Score = 39.0 bits (92), Expect = 0.002
Identities = 11/29 (37%), Positives = 18/29 (62%)
Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
+K K +K +KKK+K++ K K K K +
Sbjct: 63 KKELKAWHKAQKKKEKQEAKAAKAKSKPR 91
Score = 37.9 bits (89), Expect = 0.004
Identities = 12/29 (41%), Positives = 17/29 (58%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
K + K K +KKK+K++ K K K K R
Sbjct: 63 KKELKAWHKAQKKKEKQEAKAAKAKSKPR 91
Score = 36.3 bits (85), Expect = 0.013
Identities = 12/34 (35%), Positives = 18/34 (52%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
K+ K K +KKK+K++ K K K K R+
Sbjct: 59 ALLDKKELKAWHKAQKKKEKQEAKAAKAKSKPRL 92
Score = 35.6 bits (83), Expect = 0.025
Identities = 11/33 (33%), Positives = 18/33 (54%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
+ KK+ K K +KKK+K++ K K K +
Sbjct: 57 KAALLDKKELKAWHKAQKKKEKQEAKAAKAKSK 89
Score = 32.5 bits (75), Expect = 0.26
Identities = 10/29 (34%), Positives = 17/29 (58%)
Query: 32 KKKKKKKKKKKKKKKKKKKKKRRILAKSC 60
KK+ K K +KKK+K++ K + +K
Sbjct: 63 KKELKAWHKAQKKKEKQEAKAAKAKSKPR 91
Score = 29.0 bits (66), Expect = 3.0
Identities = 7/23 (30%), Positives = 13/23 (56%)
Query: 21 FEREKAKKKKNKKKKKKKKKKKK 43
+ + + KK+K + K K K K +
Sbjct: 69 WHKAQKKKEKQEAKAAKAKSKPR 91
>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 142
Score = 37.7 bits (88), Expect = 0.002
Identities = 19/31 (61%), Positives = 24/31 (77%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
+ EK KK+ K++KKK+KKKKKKK KK KK
Sbjct: 79 DEEKTAKKRAKRQKKKQKKKKKKKAKKGNKK 109
Score = 36.6 bits (85), Expect = 0.005
Identities = 17/30 (56%), Positives = 24/30 (80%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
+ K ++K KK+ K++KKK+KKKKKKK KK
Sbjct: 76 KRKDEEKTAKKRAKRQKKKQKKKKKKKAKK 105
Score = 34.6 bits (80), Expect = 0.020
Identities = 16/31 (51%), Positives = 25/31 (80%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
E+++ ++K KK+ K++KKK+KKKKKKK K
Sbjct: 74 EKKRKDEEKTAKKRAKRQKKKQKKKKKKKAK 104
Score = 34.3 bits (79), Expect = 0.028
Identities = 15/33 (45%), Positives = 23/33 (69%)
Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
+++ K K + K++KKK+KKKKKKK KK +
Sbjct: 76 KRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNK 108
Score = 33.9 bits (78), Expect = 0.034
Identities = 18/31 (58%), Positives = 22/31 (70%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
R K +KKK KKKKKKK KK KK++K+ K
Sbjct: 87 RAKRQKKKQKKKKKKKAKKGNKKEEKEGSKS 117
Score = 33.5 bits (77), Expect = 0.044
Identities = 17/31 (54%), Positives = 25/31 (80%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
+RE+ K+K +K KK+ K++KKK+KKKKKK
Sbjct: 71 KREEKKRKDEEKTAKKRAKRQKKKQKKKKKK 101
Score = 32.7 bits (75), Expect = 0.080
Identities = 15/38 (39%), Positives = 29/38 (76%)
Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
F++++ +KK+ ++K KK+ K++KKK+KKKK++ K
Sbjct: 68 FQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKK 105
Score = 29.2 bits (66), Expect = 1.3
Identities = 16/29 (55%), Positives = 24/29 (82%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKK 50
+++AK++K K+KKKKKKK KK KK++K
Sbjct: 84 AKKRAKRQKKKQKKKKKKKAKKGNKKEEK 112
Score = 28.9 bits (65), Expect = 2.1
Identities = 15/29 (51%), Positives = 20/29 (68%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
R+K K+KK KKKK KK KK++K+ K
Sbjct: 90 RQKKKQKKKKKKKAKKGNKKEEKEGSKSS 118
Score = 28.1 bits (63), Expect = 3.6
Identities = 14/38 (36%), Positives = 27/38 (71%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
E E+ ++K+ +KK+K ++K KK+ K++KKK++ K
Sbjct: 64 EDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKK 101
Score = 26.5 bits (59), Expect = 9.8
Identities = 13/28 (46%), Positives = 19/28 (67%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKK 49
+++K KKKK KK KK KK++K+ K
Sbjct: 91 QKKKQKKKKKKKAKKGNKKEEKEGSKSS 118
>gnl|CDD|215217 PLN02387, PLN02387, long-chain-fatty-acid-CoA ligase family
protein.
Length = 696
Score = 39.3 bits (92), Expect = 0.002
Identities = 14/26 (53%), Positives = 19/26 (73%)
Query: 95 HADNNNEQSNDIAIVLYTSGSTGIPK 120
D + NDIA+++YTSGSTG+PK
Sbjct: 241 PVDPDLPSPNDIAVIMYTSGSTGLPK 266
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 38.9 bits (91), Expect = 0.003
Identities = 16/46 (34%), Positives = 30/46 (65%)
Query: 14 KVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
K+ +REK +K+ K++K+ +K KK+++KKKK+ ++ AK
Sbjct: 546 KLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAKI 591
>gnl|CDD|213297 cd05931, FAAL, Fatty acyl-AMP ligase (FAAL). FAAL belongs to the
class I adenylate forming enzyme family and is
homologous to fatty acyl-coenzyme A (CoA) ligases
(FACLs). However, FAALs produce only the acyl adenylate
and are unable to perform the thioester-forming
reaction, while FACLs perform a two-step catalytic
reaction; AMP ligation followed by CoA ligation using
ATP and CoA as cofactors. FAALs have insertion motifs
between the N-terminal and C-terminal subdomains that
distinguish them from the FACLs. This insertion motif
precludes the binding of CoA, thus preventing CoA
ligation. It has been suggested that the acyl adenylates
serve as substrates for multifunctional polyketide
synthases to permit synthesis of complex lipids such as
phthiocerol dimycocerosate, sulfolipids, mycolic acids,
and mycobactin.
Length = 547
Score = 38.7 bits (91), Expect = 0.003
Identities = 12/18 (66%), Positives = 14/18 (77%)
Query: 103 SNDIAIVLYTSGSTGIPK 120
+DIA + YTSGSTG PK
Sbjct: 147 PDDIAFLQYTSGSTGAPK 164
>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
Length = 213
Score = 37.6 bits (88), Expect = 0.004
Identities = 8/57 (14%), Positives = 26/57 (45%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRTVLSLPQLRL 78
+ +A++ + + KK++ +K+K ++++K + + P + P +
Sbjct: 108 AKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDI 164
Score = 32.2 bits (74), Expect = 0.22
Identities = 12/36 (33%), Positives = 22/36 (61%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
A E+EKA +++ K K K +KK+K + +K + +
Sbjct: 123 EAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQH 158
Score = 31.8 bits (73), Expect = 0.27
Identities = 5/32 (15%), Positives = 21/32 (65%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
+ K + ++ +++ KK++ +K+K +++++
Sbjct: 107 KAKVQAQRAEQQAKKREAAGEKEKAPRRERKP 138
Score = 31.8 bits (73), Expect = 0.34
Identities = 7/38 (18%), Positives = 18/38 (47%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
++ ++ +K K K +KK+K + +K + +
Sbjct: 122 REAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHT 159
Score = 29.1 bits (66), Expect = 2.1
Identities = 10/31 (32%), Positives = 20/31 (64%)
Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
E+AK K ++ +++ KK++ +K+K RR
Sbjct: 104 EEAKAKVQAQRAEQQAKKREAAGEKEKAPRR 134
Score = 28.4 bits (64), Expect = 3.7
Identities = 8/35 (22%), Positives = 18/35 (51%)
Query: 19 APFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
E+ +++K K K +KK+K + +K + +
Sbjct: 126 GEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTP 160
>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992). This
bacterial family of proteins has no known function.
However, the cis-regulatory yjdF motif, just upstream
from the gene encoding the proteins for this family, is
a small non-coding RNA, Rfam:RF01764. The yjdF motif is
found in many Firmicutes, including Bacillus subtilis.
In most cases, it resides in potential 5' UTRs of
homologues of the yjdF gene whose function is unknown.
However, in Streptococcus thermophilus, a yjdF RNA motif
is associated with an operon whose protein products
synthesise nicotinamide adenine dinucleotide (NAD+).
Also, the S. thermophilus yjdF RNA lacks typical yjdF
motif consensus features downstream of and including the
P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
S. thermophilus RNAs might sense a distinct compound
that structurally resembles the ligand bound by other
yjdF RNAs. On the ohter hand, perhaps these RNAs have an
alternative solution forming a similar binding site, as
is observed with some SAM riboswitches.
Length = 132
Score = 36.5 bits (85), Expect = 0.004
Identities = 15/34 (44%), Positives = 27/34 (79%)
Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
ER K +KKK K+KK+++K++K++ K++KKK +
Sbjct: 95 HERNKQEKKKRSKEKKEEEKERKRQLKQQKKKAK 128
Score = 29.9 bits (68), Expect = 0.64
Identities = 11/29 (37%), Positives = 24/29 (82%)
Query: 27 KKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
K + + K++KKK+ K+KK+++K++KR++
Sbjct: 92 KLEHERNKQEKKKRSKEKKEEEKERKRQL 120
Score = 29.9 bits (68), Expect = 0.80
Identities = 12/34 (35%), Positives = 28/34 (82%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
+A K ++++ K++KKK+ K+KK+++K+++R L +
Sbjct: 89 QALKLEHERNKQEKKKRSKEKKEEEKERKRQLKQ 122
>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
Length = 813
Score = 38.2 bits (89), Expect = 0.005
Identities = 14/42 (33%), Positives = 21/42 (50%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
FE + A + + K K K KK +K KK K ++I A +
Sbjct: 760 KVNFEPDSAFRGEKKAKPKAAKKDARKAKKPSAKTQKIAAAT 801
Score = 32.8 bits (75), Expect = 0.28
Identities = 15/37 (40%), Positives = 17/37 (45%), Gaps = 4/37 (10%)
Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKKK----KKKR 53
F EK K K KK +K KK K +K K KR
Sbjct: 769 FRGEKKAKPKAAKKDARKAKKPSAKTQKIAAATKAKR 805
Score = 31.3 bits (71), Expect = 0.73
Identities = 13/34 (38%), Positives = 15/34 (44%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
KK K K KK +K KK K +K AK
Sbjct: 771 GEKKAKPKAAKKDARKAKKPSAKTQKIAAATKAK 804
Score = 30.9 bits (70), Expect = 0.86
Identities = 12/40 (30%), Positives = 18/40 (45%)
Query: 13 DKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
+K P +++ K KK K +K K K+ KKK
Sbjct: 772 EKKAKPKAAKKDARKAKKPSAKTQKIAAATKAKRAAKKKV 811
Score = 30.5 bits (69), Expect = 1.4
Identities = 11/38 (28%), Positives = 16/38 (42%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
E++ K K +K KK K +K K + AK
Sbjct: 772 EKKAKPKAAKKDARKAKKPSAKTQKIAAATKAKRAAKK 809
>gnl|CDD|213271 cd05903, CHC_CoA_lg, Cyclohexanecarboxylate-CoA ligase (also called
cyclohex-1-ene-1-carboxylate:CoA ligase).
Cyclohexanecarboxylate-CoA ligase activates the
aliphatic ring compound, cyclohexanecarboxylate, for
degradation. It catalyzes the synthesis of
cyclohexanecarboxylate-CoA thioesters in a two-step
reaction involving the formation of
cyclohexanecarboxylate-AMP anhydride, followed by the
nucleophilic substitution of AMP by CoA.
Length = 437
Score = 38.0 bits (89), Expect = 0.005
Identities = 11/20 (55%), Positives = 16/20 (80%)
Query: 101 EQSNDIAIVLYTSGSTGIPK 120
+D+A++LYTSG+TG PK
Sbjct: 90 AMPDDVALLLYTSGTTGEPK 109
Score = 29.5 bits (67), Expect = 2.1
Identities = 18/54 (33%), Positives = 28/54 (51%), Gaps = 5/54 (9%)
Query: 155 YKTGDYGRIVDD-ILLYEGRTDSQVKIRG--NRVDLSEVEAALSKLSEVDKLSV 205
++TGD GR+ D L GR + IRG N + E+E L + V +++V
Sbjct: 318 FRTGDLGRLDADGYLRITGRK-KDIIIRGGEN-ISAREIEDLLLRHPAVAEVAV 369
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 37.0 bits (86), Expect = 0.005
Identities = 19/52 (36%), Positives = 29/52 (55%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRT 69
E E +++K +KKKKK+ KK+KK+KK KK+K S + K +
Sbjct: 142 KVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKKK 193
Score = 36.2 bits (84), Expect = 0.010
Identities = 20/46 (43%), Positives = 28/46 (60%)
Query: 14 KVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
KV A E E+ K+KK KK+ KK+KK+KK KK+K + + K
Sbjct: 142 KVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKK 187
>gnl|CDD|219564 pfam07771, TSGP1, Tick salivary peptide group 1. This contains a
group of peptides derived from a salivary gland cDNA
library of the tick Ixodes scapularis. Also present are
peptides from a related tick species, Ixodes ricinus.
They are characterized by a putative signal peptide
indicative of secretion and conserved cysteine residues.
Length = 120
Score = 35.6 bits (82), Expect = 0.006
Identities = 16/27 (59%), Positives = 18/27 (66%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKK 44
P P E+ K KKKK+KK KK KK KK
Sbjct: 94 PEPTEKPKKKKKKSKKTKKPKKSSKKD 120
Score = 35.6 bits (82), Expect = 0.007
Identities = 15/22 (68%), Positives = 15/22 (68%)
Query: 30 KNKKKKKKKKKKKKKKKKKKKK 51
K KKKKKK KK KK KK KK
Sbjct: 99 KPKKKKKKSKKTKKPKKSSKKD 120
Score = 34.5 bits (79), Expect = 0.016
Identities = 15/22 (68%), Positives = 15/22 (68%)
Query: 31 NKKKKKKKKKKKKKKKKKKKKK 52
K KKKKKK KK KK KK KK
Sbjct: 98 EKPKKKKKKSKKTKKPKKSSKK 119
Score = 34.5 bits (79), Expect = 0.018
Identities = 16/32 (50%), Positives = 17/32 (53%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKK 49
P E +K K KKKK KK KK KK KK
Sbjct: 89 PDDHPPEPTEKPKKKKKKSKKTKKPKKSSKKD 120
Score = 33.3 bits (76), Expect = 0.043
Identities = 14/20 (70%), Positives = 14/20 (70%)
Query: 33 KKKKKKKKKKKKKKKKKKKK 52
KKKKKK KK KK KK KK
Sbjct: 101 KKKKKKSKKTKKPKKSSKKD 120
Score = 32.2 bits (73), Expect = 0.12
Identities = 14/21 (66%), Positives = 15/21 (71%)
Query: 33 KKKKKKKKKKKKKKKKKKKKR 53
K KKKKKK KK KK KK K+
Sbjct: 99 KPKKKKKKSKKTKKPKKSSKK 119
Score = 30.6 bits (69), Expect = 0.35
Identities = 13/21 (61%), Positives = 15/21 (71%)
Query: 34 KKKKKKKKKKKKKKKKKKKRR 54
K KKKKKK KK KK KK ++
Sbjct: 99 KPKKKKKKSKKTKKPKKSSKK 119
>gnl|CDD|204792 pfam11947, DUF3464, Protein of unknown function (DUF3464). This
family of proteins are functionally uncharacterized.
This protein is found in bacteria and eukaryotes.
Proteins in this family are typically between 137 to
196 amino acids in length.
Length = 149
Score = 35.8 bits (83), Expect = 0.008
Identities = 15/37 (40%), Positives = 21/37 (56%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
PFE ++++KK K +K K K K+KK K KR
Sbjct: 7 RLPFEPKRSRKKGVKALRKAAVAKSKDKQKKPKSKRA 43
>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family. This
family includes gbp a protein from Soybean that binds to
GAGA element dinucleotide repeat DNA. It seems likely
that the this domain mediates DNA binding. This putative
domain contains several conserved cysteines and a
histidine suggesting this may be a zinc-binding DNA
interaction domain.
Length = 301
Score = 37.2 bits (86), Expect = 0.008
Identities = 17/52 (32%), Positives = 24/52 (46%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRT 69
P +AK+ K KK + K K K KK KKK + +S + P R+
Sbjct: 136 PISTAPPEAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDPRS 187
Score = 35.2 bits (81), Expect = 0.032
Identities = 16/50 (32%), Positives = 20/50 (40%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKG 67
P +E K KK + K K K KK KKK R + +P K
Sbjct: 140 APPEAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDPRSKP 189
Score = 32.2 bits (73), Expect = 0.29
Identities = 12/37 (32%), Positives = 17/37 (45%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
P + K+ KK KK + K K K KK K++
Sbjct: 134 VPPISTAPPEAKEVKKPKKGQSPKVPKAPKPKKPKKK 170
Score = 32.2 bits (73), Expect = 0.34
Identities = 16/46 (34%), Positives = 19/46 (41%)
Query: 9 EHHFDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
H I AP E ++ KK K + K K K KK KKK
Sbjct: 130 HHLEVPPISTAPPEAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSN 175
Score = 30.6 bits (69), Expect = 0.82
Identities = 15/42 (35%), Positives = 17/42 (40%)
Query: 9 EHHFDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKK 50
HH + I K KK K + K K K KK KKK
Sbjct: 129 MHHLEVPPISTAPPEAKEVKKPKKGQSPKVPKAPKPKKPKKK 170
Score = 29.9 bits (67), Expect = 1.8
Identities = 12/47 (25%), Positives = 19/47 (40%)
Query: 16 IIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCEN 62
+ P + K KK KK + K K K KK ++ + S +
Sbjct: 131 HLEVPPISTAPPEAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRS 177
Score = 27.9 bits (62), Expect = 7.3
Identities = 9/38 (23%), Positives = 12/38 (31%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCEN 62
K K KK KKK + K R ++
Sbjct: 158 VPKAPKPKKPKKKGSVSNRSVKMPGIDPRSKPDWKSQD 195
>gnl|CDD|233770 TIGR02188, Ac_CoA_lig_AcsA, acetate--CoA ligase. This model
describes acetate-CoA ligase (EC 6.2.1.1), also called
acetyl-CoA synthetase and acetyl-activating enzyme. It
catalyzes the reaction ATP + acetate + CoA = AMP +
diphosphate + acetyl-CoA and belongs to the family of
AMP-binding enzymes described by pfam00501.
Length = 625
Score = 36.8 bits (86), Expect = 0.011
Identities = 16/49 (32%), Positives = 24/49 (48%), Gaps = 7/49 (14%)
Query: 150 KYRRMYKTGDYGRIVDD----ILLYEGRTDSQVKIRGNRVDLSEVEAAL 194
+ Y TGD R D I GR D + + G+R+ +E+E+AL
Sbjct: 472 PFPGYYFTGDGARRDKDGYIWIT---GRVDDVINVSGHRLGTAEIESAL 517
Score = 30.7 bits (70), Expect = 1.1
Identities = 20/54 (37%), Positives = 25/54 (46%), Gaps = 12/54 (22%)
Query: 103 SNDIAIVLYTSGSTGIPKEYRRMYKTGDY-------GRIVDDIH---RCWSSGD 146
S D +LYTSGSTG PK ++ TG Y + V DI W + D
Sbjct: 235 SEDPLFILYTSGSTGKPKGV--LHTTGGYLLYAAMTMKYVFDIKDGDIFWCTAD 286
>gnl|CDD|213293 cd05927, LC-FACS_euk, Eukaryotic long-chain fatty acid CoA
synthetase (LC-FACS). The members of this family are
eukaryotic fatty acid CoA synthetases that activate
fatty acids with chain lengths of 12 to 20. LC-FACS
catalyzes the formation of fatty acyl-CoA in a two-step
reaction: the formation of a fatty acyl-AMP molecule as
an intermediate, and the formation of a fatty acyl-CoA.
This is a required step before free fatty acids can
participate in most catabolic and anabolic reactions.
Organisms tend to have multiple isoforms of LC-FACS
genes with multiple splice variants. For example, nine
genes are found in Arabidopsis and six genes are
expressed in mammalian cells.
Length = 539
Score = 36.8 bits (86), Expect = 0.011
Identities = 10/17 (58%), Positives = 14/17 (82%)
Query: 104 NDIAIVLYTSGSTGIPK 120
D+A ++YTSG+TG PK
Sbjct: 111 EDLATIMYTSGTTGNPK 127
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 36.8 bits (86), Expect = 0.012
Identities = 13/31 (41%), Positives = 22/31 (70%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
ER+K +KK+ K +KK +K++ +K KKK +
Sbjct: 410 ERKKLRKKQRKAEKKAEKEEAEKAAAKKKAE 440
Score = 35.3 bits (82), Expect = 0.041
Identities = 11/36 (30%), Positives = 21/36 (58%)
Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
+ KK + K++K +KK +K++ +K KK+ A
Sbjct: 409 AERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAK 444
Score = 34.1 bits (79), Expect = 0.092
Identities = 17/49 (34%), Positives = 28/49 (57%), Gaps = 3/49 (6%)
Query: 25 KAKKKKNKKKKKKKKKKKKK---KKKKKKKKRRILAKSCENPAYKGRTV 70
A++KK +KK++K +KK +K +K KKK AK + P + + V
Sbjct: 408 PAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKV 456
Score = 32.2 bits (74), Expect = 0.32
Identities = 10/32 (31%), Positives = 19/32 (59%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
R+K +K + K +K++ +K KKK + K+
Sbjct: 415 RKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKA 446
Score = 31.4 bits (72), Expect = 0.61
Identities = 12/31 (38%), Positives = 17/31 (54%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
E+E+A+K KKK + KK K + KK
Sbjct: 426 EKEEAEKAAAKKKAEAAAKKAKGPDGETKKV 456
Score = 31.1 bits (71), Expect = 0.82
Identities = 9/30 (30%), Positives = 15/30 (50%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
+ + K +K + +K KKK + KK K
Sbjct: 420 KAEKKAEKEEAEKAAAKKKAEAAAKKAKGP 449
Score = 30.3 bits (69), Expect = 1.3
Identities = 10/30 (33%), Positives = 16/30 (53%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
+KA+K++ +K KKK + KK K
Sbjct: 421 AEKKAEKEEAEKAAAKKKAEAAAKKAKGPD 450
Score = 30.3 bits (69), Expect = 1.4
Identities = 9/31 (29%), Positives = 18/31 (58%)
Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
+ KK +KK++K +KK +K++ +K
Sbjct: 404 GNLSPAERKKLRKKQRKAEKKAEKEEAEKAA 434
Score = 30.3 bits (69), Expect = 1.4
Identities = 11/34 (32%), Positives = 16/34 (47%)
Query: 19 APFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
A + EK + +K KKK + KK K + K
Sbjct: 421 AEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETK 454
Score = 28.0 bits (63), Expect = 8.4
Identities = 13/47 (27%), Positives = 24/47 (51%), Gaps = 3/47 (6%)
Query: 12 FDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
DK P E E+ + + ++KK +KK++K +KK + A+
Sbjct: 388 HDK---PLLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAE 431
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 35.8 bits (83), Expect = 0.013
Identities = 18/31 (58%), Positives = 20/31 (64%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
E E+ +K K KKKK KKKK K K KK KK
Sbjct: 77 EYEEKQKWKWKKKKSKKKKDKDKDKKDDKKD 107
Score = 34.7 bits (80), Expect = 0.028
Identities = 15/32 (46%), Positives = 23/32 (71%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
+++K+KKKK+K K KK KK K +KK +K+
Sbjct: 87 KKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEA 118
Score = 34.3 bits (79), Expect = 0.035
Identities = 14/38 (36%), Positives = 18/38 (47%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
++K KKK K K KK KK K +KK +K
Sbjct: 86 WKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLE 123
Score = 33.9 bits (78), Expect = 0.050
Identities = 16/32 (50%), Positives = 19/32 (59%)
Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
+E ++ K K KK KKKK K K KK KK K
Sbjct: 78 YEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDK 109
Score = 33.9 bits (78), Expect = 0.055
Identities = 17/31 (54%), Positives = 19/31 (61%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
E++K K KK K KKKK K K KK KK K
Sbjct: 80 EKQKWKWKKKKSKKKKDKDKDKKDDKKDDKS 110
Score = 33.9 bits (78), Expect = 0.058
Identities = 13/35 (37%), Positives = 22/35 (62%)
Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
++ K KK K+K KK KK K +KK +K+ + ++
Sbjct: 88 KKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKL 122
Score = 33.5 bits (77), Expect = 0.075
Identities = 14/33 (42%), Positives = 19/33 (57%)
Query: 20 PFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
++ +K K KK K K K KK KK K +KK +
Sbjct: 83 KWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDE 115
Score = 32.4 bits (74), Expect = 0.18
Identities = 14/31 (45%), Positives = 18/31 (58%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
+++ KKK KKKK K K KK KK K +
Sbjct: 81 KQKWKWKKKKSKKKKDKDKDKKDDKKDDKSE 111
Score = 32.0 bits (73), Expect = 0.23
Identities = 17/31 (54%), Positives = 21/31 (67%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
++ K KKKK+KKKK K K KK KK K +K
Sbjct: 82 QKWKWKKKKSKKKKDKDKDKKDDKKDDKSEK 112
Score = 31.6 bits (72), Expect = 0.31
Identities = 15/31 (48%), Positives = 17/31 (54%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
+ K KK K KK K K KK KK K +KK
Sbjct: 85 KWKKKKSKKKKDKDKDKKDDKKDDKSEKKDE 115
Score = 31.2 bits (71), Expect = 0.38
Identities = 18/31 (58%), Positives = 21/31 (67%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
E EK KK+ +K+K K KKKK KKKK K K
Sbjct: 70 EIEKVKKEYEEKQKWKWKKKKSKKKKDKDKD 100
Score = 31.2 bits (71), Expect = 0.38
Identities = 18/35 (51%), Positives = 19/35 (54%), Gaps = 4/35 (11%)
Query: 22 EREKAKKKKN----KKKKKKKKKKKKKKKKKKKKK 52
E K KK +K K KKKK KKKK K K KK
Sbjct: 68 AEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKK 102
Score = 30.8 bits (70), Expect = 0.53
Identities = 11/31 (35%), Positives = 20/31 (64%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
+++K K KK+ KK K +KK +K+ + K +
Sbjct: 94 KKDKDKDKKDDKKDDKSEKKDEKEAEDKLED 124
Score = 30.4 bits (69), Expect = 0.77
Identities = 15/38 (39%), Positives = 22/38 (57%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
++K K K+KK KK K +KK +K+ + K L KS
Sbjct: 91 SKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKS 128
Score = 29.7 bits (67), Expect = 1.5
Identities = 20/49 (40%), Positives = 26/49 (53%), Gaps = 7/49 (14%)
Query: 11 HFDKVIIPAPFEREKAKKKKNKKKKKK-------KKKKKKKKKKKKKKK 52
HF I A + K KKK+ ++ +K K+K K KKKK KKKK
Sbjct: 47 HFATPIYDAEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKK 95
Score = 29.3 bits (66), Expect = 1.6
Identities = 11/38 (28%), Positives = 19/38 (50%)
Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCE 61
+K KK +K +KK +K+ + K + K L+ E
Sbjct: 101 KKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSE 138
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 35.8 bits (83), Expect = 0.016
Identities = 19/43 (44%), Positives = 27/43 (62%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPA 64
E+E ++++ KKKK+ K K K+ KKKKKK AKS + A
Sbjct: 75 EKELQREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAA 117
Score = 34.3 bits (79), Expect = 0.045
Identities = 18/55 (32%), Positives = 25/55 (45%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRTVLSLPQL 76
+RE+ KKK + K K K+ KKKKKK + + P K + P L
Sbjct: 79 QREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSERISWAPTL 133
Score = 31.6 bits (72), Expect = 0.41
Identities = 14/34 (41%), Positives = 22/34 (64%)
Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
E EK +++ + KKKK+ K K K+ KKKK++
Sbjct: 72 EEGEKELQREERLKKKKRVKTKAYKEPTKKKKKK 105
>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
complex component). RNases H are enzymes that
specifically hydrolyse RNA when annealed to a
complementary DNA and are present in all living
organisms. In yeast RNase H2 is composed of a complex of
three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
family represents the homologues of Ydr279p. It is not
known whether non yeast proteins in this family fulfil
the same function.
Length = 287
Score = 35.8 bits (83), Expect = 0.017
Identities = 13/32 (40%), Positives = 17/32 (53%)
Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
E +KK K K++ KKKK K+ K K K
Sbjct: 242 EAAESRAEKKRKSKEEIKKKKPKESKGVKALK 273
Score = 35.8 bits (83), Expect = 0.019
Identities = 13/29 (44%), Positives = 19/29 (65%)
Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
+A+KK+ K++ KKKK K+ K K KK
Sbjct: 246 SRAEKKRKSKEEIKKKKPKESKGVKALKK 274
Score = 35.4 bits (82), Expect = 0.028
Identities = 13/35 (37%), Positives = 19/35 (54%)
Query: 17 IPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
+ A R + K+K ++ KKKK K+ K K KK
Sbjct: 241 VEAAESRAEKKRKSKEEIKKKKPKESKGVKALKKV 275
Score = 34.2 bits (79), Expect = 0.069
Identities = 13/29 (44%), Positives = 16/29 (55%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKK 50
+E+ KKKK K+ K K KK K KK
Sbjct: 254 SKEEIKKKKPKESKGVKALKKVVAKGMKK 282
Score = 33.5 bits (77), Expect = 0.11
Identities = 11/33 (33%), Positives = 15/33 (45%)
Query: 19 APFEREKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
+ ++ KKK K+ K K KK K KK
Sbjct: 250 KKRKSKEEIKKKKPKESKGVKALKKVVAKGMKK 282
Score = 33.1 bits (76), Expect = 0.15
Identities = 12/35 (34%), Positives = 15/35 (42%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
+ K + KK K K+ K K KK K KK
Sbjct: 248 AEKKRKSKEEIKKKKPKESKGVKALKKVVAKGMKK 282
Score = 32.3 bits (74), Expect = 0.23
Identities = 12/32 (37%), Positives = 18/32 (56%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
E+++ K++ KKKK K+ K K KK K
Sbjct: 249 EKKRKSKEEIKKKKPKESKGVKALKKVVAKGM 280
Score = 31.5 bits (72), Expect = 0.45
Identities = 12/32 (37%), Positives = 22/32 (68%)
Query: 27 KKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
+ +K +K K++ KKKK K+ K K ++++AK
Sbjct: 247 RAEKKRKSKEEIKKKKPKESKGVKALKKVVAK 278
Score = 30.8 bits (70), Expect = 0.86
Identities = 11/29 (37%), Positives = 20/29 (68%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
+A + + +KK+K K++ KKKK K+ K +
Sbjct: 242 EAAESRAEKKRKSKEEIKKKKPKESKGVK 270
Score = 29.2 bits (66), Expect = 2.7
Identities = 12/31 (38%), Positives = 19/31 (61%)
Query: 28 KKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
+ +KK+K K++ KKKK K+ K + L K
Sbjct: 244 AESRAEKKRKSKEEIKKKKPKESKGVKALKK 274
>gnl|CDD|213286 cd05919, BCL_like, Benzoate CoA ligase (BCL) and similar adenylate
forming enzymes. This family contains benzoate CoA
ligase (BCL) and related ligases that catalyze the
acylation of benzoate derivatives, 2-aminobenzoate and
4-hydroxybenzoate. Aromatic compounds represent the
second most abundant class of organic carbon compounds
after carbohydrates. Xenobiotic aromatic compounds are
also a major class of man-made pollutants. Some bacteria
use benzoate as the sole source of carbon and energy
through benzoate degradation. Benzoate degradation
starts with its activation to benzoyl-CoA by benzoate
CoA ligase. The reaction catalyzed by benzoate CoA
ligase proceeds via a two-step process; the first
ATP-dependent step forms an acyl-AMP intermediate, and
the second step forms the acyl-CoA ester with release of
the AMP.
Length = 436
Score = 36.2 bits (84), Expect = 0.017
Identities = 20/52 (38%), Positives = 28/52 (53%), Gaps = 1/52 (1%)
Query: 155 YKTGD-YGRIVDDILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSV 205
+TGD + R D Y+GR D +K+ G V EVEAAL + V + +V
Sbjct: 316 LRTGDRFSRDADGWYRYQGRADDMIKVSGQWVSPLEVEAALGEHPAVAEAAV 367
Score = 35.4 bits (82), Expect = 0.034
Identities = 17/56 (30%), Positives = 26/56 (46%), Gaps = 9/56 (16%)
Query: 105 DIAIVLYTSGSTGIPK----EYRRMYKTGD-YGRIVDDIHRCWSSGDRAIKYRRMY 155
DIA LYTSG+TG PK +R + + R + + GDR +++
Sbjct: 92 DIAYWLYTSGTTGKPKGVMHRHRDPLTFAEAFARELLGL----QPGDRIFSSSKLF 143
Score = 31.5 bits (72), Expect = 0.48
Identities = 13/26 (50%), Positives = 16/26 (61%)
Query: 233 YEGRTDSQVKIRGNRVDLSEVEAALS 258
Y+GR D +K+ G V EVEAAL
Sbjct: 332 YQGRADDMIKVSGQWVSPLEVEAALG 357
>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
This is a family of fungal proteins whose function is
unknown.
Length = 130
Score = 34.6 bits (80), Expect = 0.018
Identities = 9/32 (28%), Positives = 23/32 (71%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
R + ++ K + +K+K+ +K ++KK K+++K +
Sbjct: 97 RLRRERTKERAEKEKRTRKNREKKFKRRQKEK 128
Score = 34.2 bits (79), Expect = 0.022
Identities = 10/31 (32%), Positives = 23/31 (74%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
R + K++ +K+K+ +K ++KK K+++K+K
Sbjct: 98 LRRERTKERAEKEKRTRKNREKKFKRRQKEK 128
Score = 33.8 bits (78), Expect = 0.031
Identities = 10/32 (31%), Positives = 22/32 (68%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
R + ++ + K++ +K+K+ +K ++KK KRR
Sbjct: 93 RIALRLRRERTKERAEKEKRTRKNREKKFKRR 124
Score = 33.0 bits (76), Expect = 0.072
Identities = 12/30 (40%), Positives = 23/30 (76%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
ER K + +K K+ +K ++KK K+++K+K+K
Sbjct: 101 ERTKERAEKEKRTRKNREKKFKRRQKEKEK 130
Score = 32.2 bits (74), Expect = 0.11
Identities = 11/31 (35%), Positives = 25/31 (80%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
RE+ K++ K+K+ +K ++KK K+++K+K++
Sbjct: 100 RERTKERAEKEKRTRKNREKKFKRRQKEKEK 130
Score = 31.1 bits (71), Expect = 0.32
Identities = 10/37 (27%), Positives = 24/37 (64%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
A E+E ++K+ KKK+K+ KK++ + ++++ +
Sbjct: 68 NAKVEKELLREKEKKKKRKRPGKKRRIALRLRRERTK 104
Score = 30.3 bits (69), Expect = 0.52
Identities = 7/31 (22%), Positives = 22/31 (70%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
+ ++++ K++ +K+K+ +K ++KK K++
Sbjct: 94 IALRLRRERTKERAEKEKRTRKNREKKFKRR 124
Score = 29.5 bits (67), Expect = 0.95
Identities = 11/45 (24%), Positives = 22/45 (48%)
Query: 10 HHFDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
K E + K+KK K+K+ KK++ + ++++ K R
Sbjct: 62 IDLKKHNAKVEKELLREKEKKKKRKRPGKKRRIALRLRRERTKER 106
Score = 29.2 bits (66), Expect = 1.2
Identities = 16/36 (44%), Positives = 24/36 (66%), Gaps = 2/36 (5%)
Query: 28 KKKNKKKKKKKKKKKKKKKKKKK--KKRRILAKSCE 61
KK N K +K+ ++K+KKKK+K+ KKRRI +
Sbjct: 65 KKHNAKVEKELLREKEKKKKRKRPGKKRRIALRLRR 100
Score = 27.2 bits (61), Expect = 5.7
Identities = 9/42 (21%), Positives = 26/42 (61%), Gaps = 8/42 (19%)
Query: 22 EREKAKKKKNKKK--------KKKKKKKKKKKKKKKKKKRRI 55
E++K +K+ KK+ ++ K++ +K+K+ +K ++++
Sbjct: 80 EKKKKRKRPGKKRRIALRLRRERTKERAEKEKRTRKNREKKF 121
>gnl|CDD|218148 pfam04557, tRNA_synt_1c_R2, Glutaminyl-tRNA synthetase,
non-specific RNA binding region part 2. This is a
region found N terminal to the catalytic domain of
glutaminyl-tRNA synthetase (EC 6.1.1.18) in eukaryotes
but not in Escherichia coli. This region is thought to
bind RNA in a non-specific manner, enhancing
interactions between the tRNA and enzyme, but is not
essential for enzyme function.
Length = 83
Score = 33.5 bits (77), Expect = 0.019
Identities = 13/29 (44%), Positives = 15/29 (51%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKK 50
E + KKKK KKKKK + K KK
Sbjct: 19 EADLVKKKKKKKKKKAEDTAATAKAKKAT 47
Score = 30.8 bits (70), Expect = 0.19
Identities = 12/26 (46%), Positives = 15/26 (57%)
Query: 27 KKKKNKKKKKKKKKKKKKKKKKKKKK 52
K + + KKKKKKKKKK + K
Sbjct: 17 KTEADLVKKKKKKKKKKAEDTAATAK 42
Score = 30.4 bits (69), Expect = 0.25
Identities = 12/21 (57%), Positives = 13/21 (61%)
Query: 32 KKKKKKKKKKKKKKKKKKKKK 52
KKKKKKKKKK + K K
Sbjct: 24 KKKKKKKKKKAEDTAATAKAK 44
Score = 29.6 bits (67), Expect = 0.38
Identities = 11/21 (52%), Positives = 12/21 (57%)
Query: 32 KKKKKKKKKKKKKKKKKKKKK 52
KKKKKKKKKK + K
Sbjct: 23 VKKKKKKKKKKAEDTAATAKA 43
Score = 26.6 bits (59), Expect = 5.3
Identities = 10/36 (27%), Positives = 15/36 (41%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
A ++K KKKK K + K KK + +
Sbjct: 19 EADLVKKKKKKKKKKAEDTAATAKAKKATAEDVSEG 54
Score = 26.2 bits (58), Expect = 6.6
Identities = 10/32 (31%), Positives = 13/32 (40%)
Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
+K KKKK KK + K KK +
Sbjct: 24 KKKKKKKKKKAEDTAATAKAKKATAEDVSEGA 55
>gnl|CDD|240254 PTZ00069, PTZ00069, 60S ribosomal protein L5; Provisional.
Length = 300
Score = 35.8 bits (83), Expect = 0.019
Identities = 19/36 (52%), Positives = 22/36 (61%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
K KKKK KKKK KK K KK K++K R+ AK
Sbjct: 252 PSKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAK 287
Score = 31.2 bits (71), Expect = 0.62
Identities = 17/41 (41%), Positives = 25/41 (60%), Gaps = 4/41 (9%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKK----KKKKKKKKKKKKRR 54
P+ +++K KKKK KK K KK ++K + K KK +RR
Sbjct: 252 PSKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRR 292
Score = 29.3 bits (66), Expect = 2.8
Identities = 15/39 (38%), Positives = 22/39 (56%)
Query: 20 PFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
P + +K KKKK K KK K KK K++K + + A+
Sbjct: 252 PSKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQ 290
Score = 28.9 bits (65), Expect = 3.3
Identities = 20/52 (38%), Positives = 24/52 (46%), Gaps = 5/52 (9%)
Query: 26 AKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRTVLSLPQLR 77
A + N K KKKKKKKKK KK K +++ K K R Q R
Sbjct: 246 AAIRANPSKVKKKKKKKKKVVHKKYKTKKLTGKQ-----RKARVKAKKAQRR 292
Score = 27.3 bits (61), Expect = 9.0
Identities = 10/38 (26%), Positives = 22/38 (57%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
+K K KK K++K + K KK +++++ +++I
Sbjct: 262 KKKVVHKKYKTKKLTGKQRKARVKAKKAQRRERLQKKI 299
>gnl|CDD|206228 pfam14058, PcfK, PcfK-like protein. The PcfK-like protein family
includes the Enterococcus faecalis PcfK protein, which
is functionally uncharacterized. This family of proteins
is found in bacteria and viruses. Proteins in this
family are typically between 137 and 257 amino acids in
length. There are two completely conserved residues (D
and L) that may be functionally important.
Length = 136
Score = 34.7 bits (80), Expect = 0.021
Identities = 10/40 (25%), Positives = 22/40 (55%)
Query: 14 KVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
+ E+ +A+K+ K ++++ +K +K+ KK KK
Sbjct: 86 HTVELTEEEKAEARKEALKAYQQEELRKIQKRSKKSKKAE 125
>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
Length = 158
Score = 34.7 bits (80), Expect = 0.024
Identities = 12/37 (32%), Positives = 26/37 (70%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
E + ++K+ K+ ++K+ K KK+KK+KK++++ K
Sbjct: 93 EESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEK 129
Score = 32.8 bits (75), Expect = 0.11
Identities = 13/38 (34%), Positives = 24/38 (63%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
+ +K K+ ++K+ K KK+KK+KK+KK ++ K
Sbjct: 97 KEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKK 134
Score = 32.0 bits (73), Expect = 0.21
Identities = 17/43 (39%), Positives = 27/43 (62%)
Query: 16 IIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
+ P E+E+ + K ++K+ K KK+KK+KK+KK LAK
Sbjct: 90 LFPEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAK 132
Score = 30.9 bits (70), Expect = 0.46
Identities = 15/35 (42%), Positives = 21/35 (60%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
E E+ + K KK+KK+KK+KK +K KKK
Sbjct: 101 EVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKS 135
Score = 29.7 bits (67), Expect = 0.98
Identities = 14/45 (31%), Positives = 22/45 (48%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCEN 62
+E +K+ K KK+KK+KK+KK +K KK+
Sbjct: 99 QKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTT 143
Score = 27.8 bits (62), Expect = 5.0
Identities = 14/41 (34%), Positives = 23/41 (56%)
Query: 29 KKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRT 69
+K +K+ K+ ++K+ K KK+KK + K E A K T
Sbjct: 96 EKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKST 136
Score = 27.4 bits (61), Expect = 6.6
Identities = 14/46 (30%), Positives = 21/46 (45%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKG 67
+ K +KK+ K+KK +K KKK K K + K +G
Sbjct: 112 IKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTKKEEG 157
>gnl|CDD|203848 pfam08079, Ribosomal_L30_N, Ribosomal L30 N-terminal domain.
This presumed domain is found at the N-terminus of
Ribosomal L30 proteins and has been termed RL30NT or
NUC018.
Length = 71
Score = 33.0 bits (76), Expect = 0.026
Identities = 14/32 (43%), Positives = 24/32 (75%)
Query: 27 KKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
K+K+N+K + K+ KK+ KK +KKKR+++ K
Sbjct: 5 KRKRNEKLRAKRAKKRAAKKAARKKKRKLIFK 36
Score = 32.6 bits (75), Expect = 0.029
Identities = 13/34 (38%), Positives = 20/34 (58%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
KK+K +K + K+ KK+ KK +KK+R L
Sbjct: 2 LLKKRKRNEKLRAKRAKKRAAKKAARKKKRKLIF 35
Score = 30.7 bits (70), Expect = 0.13
Identities = 11/29 (37%), Positives = 16/29 (55%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
K + + KK+ KK +KKK+K KR
Sbjct: 9 NEKLRAKRAKKRAAKKAARKKKRKLIFKR 37
Score = 28.7 bits (65), Expect = 0.64
Identities = 12/29 (41%), Positives = 18/29 (62%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
EK + K+ KK+ KK +KKK+K K+
Sbjct: 9 NEKLRAKRAKKRAAKKAARKKKRKLIFKR 37
Score = 28.3 bits (64), Expect = 0.96
Identities = 11/31 (35%), Positives = 18/31 (58%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
+R + + K KK+ KK +KKK+K K+
Sbjct: 7 KRNEKLRAKRAKKRAAKKAARKKKRKLIFKR 37
Score = 28.3 bits (64), Expect = 1.1
Identities = 12/32 (37%), Positives = 21/32 (65%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
R++ +K + K+ KK+ KK +KKK+K +R
Sbjct: 6 RKRNEKLRAKRAKKRAAKKAARKKKRKLIFKR 37
Score = 27.6 bits (62), Expect = 1.8
Identities = 12/32 (37%), Positives = 20/32 (62%)
Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
E+ +AK+ K + KK +KKK+K K+ +K
Sbjct: 9 NEKLRAKRAKKRAAKKAARKKKRKLIFKRAEK 40
>gnl|CDD|236235 PRK08314, PRK08314, long-chain-fatty-acid--CoA ligase; Validated.
Length = 546
Score = 35.7 bits (83), Expect = 0.026
Identities = 14/50 (28%), Positives = 23/50 (46%), Gaps = 7/50 (14%)
Query: 71 LSLPQLRLLSASLSHNNIPGESMLHADNNNEQSNDIAIVLYTSGSTGIPK 120
L+ + +L+ P D D+A++ YTSG+TG+PK
Sbjct: 164 LAPGGVVAWKEALAAGLAPPPHTAGPD-------DLAVLPYTSGTTGVPK 206
>gnl|CDD|213277 cd05909, AAS_C, C-terminal domain of the acyl-acyl carrier protein
synthetase (also called 2-acylglycerophosphoethanolamine
acyltransferase, Aas). Acyl-acyl carrier protein
synthase (Aas) is a membrane protein responsible for a
minor pathway of incorporating exogenous fatty acids
into membrane phospholipids. Its in vitro activity is
characterized by the ligation of free fatty acids
between 8 and 18 carbons in length to the acyl carrier
protein sulfydryl group (ACP-SH) in the presence of ATP
and Mg2+. However, its in vivo function is as a
2-acylglycerophosphoethanolamine (2-acyl-GPE)
acyltransferase. The reaction occurs in two steps: the
acyl chain is first esterified to acyl carrier protein
(ACP) via a thioester bond, followed by a second step
where the acyl chain is transferred to a
2-acyllysophospholipid, thus completing the
transacylation reaction. This model represents the
C-terminal domain of the enzyme, which belongs to the
class I adenylate-forming enzyme family, including
acyl-CoA synthetases.
Length = 489
Score = 35.7 bits (83), Expect = 0.026
Identities = 27/82 (32%), Positives = 39/82 (47%), Gaps = 7/82 (8%)
Query: 155 YKTGDYGRIVDDILLY-EGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKL----SVLCYK 209
Y TGD G+I +D L GR KI G V L+ VE LS++ D L V K
Sbjct: 370 YDTGDIGKIDEDGFLTIVGRLKRFAKIGGEMVSLTAVEELLSEILPEDSLHAAVEVPDEK 429
Query: 210 PGEVDQVSEVTSEEIRGVDDIL 231
GE ++ +T+ E +++
Sbjct: 430 KGE--KIVLLTTTEDLDPEELN 449
Score = 33.7 bits (78), Expect = 0.12
Identities = 11/29 (37%), Positives = 19/29 (65%)
Query: 92 SMLHADNNNEQSNDIAIVLYTSGSTGIPK 120
++L + +D A++L+TSGS G+PK
Sbjct: 133 ALLRLFLAGAKPDDPAVILFTSGSEGLPK 161
>gnl|CDD|213302 cd05936, FC-FACS_FadD_like, Prokaryotic long-chain fatty acid CoA
synthetases similar to Escherichia coli FadD. This
subfamily of the AMP-forming adenylation family contains
Escherichia coli FadD and similar prokaryotic fatty acid
CoA synthetases. FadD was characterized as a long-chain
fatty acid CoA synthetase. The gene fadD is regulated by
the fatty acid regulatory protein FadR. Fatty acid CoA
synthetase catalyzes the formation of fatty acyl-CoA in
a two-step reaction: the formation of a fatty acyl-AMP
molecule as an intermediate, followed by the formation
of a fatty acyl-CoA. This is a required step before free
fatty acids can participate in most catabolic and
anabolic reactions.
Length = 468
Score = 35.6 bits (83), Expect = 0.027
Identities = 9/17 (52%), Positives = 15/17 (88%)
Query: 104 NDIAIVLYTSGSTGIPK 120
+D+A++ YT G+TG+PK
Sbjct: 125 DDLAVLQYTGGTTGVPK 141
>gnl|CDD|235730 PRK06187, PRK06187, long-chain-fatty-acid--CoA ligase; Validated.
Length = 521
Score = 35.5 bits (83), Expect = 0.029
Identities = 12/17 (70%), Positives = 14/17 (82%)
Query: 104 NDIAIVLYTSGSTGIPK 120
ND A +LYTSG+TG PK
Sbjct: 167 NDAAAMLYTSGTTGHPK 183
>gnl|CDD|235778 PRK06319, PRK06319, DNA topoisomerase I/SWI domain fusion protein;
Validated.
Length = 860
Score = 35.6 bits (82), Expect = 0.030
Identities = 19/48 (39%), Positives = 24/48 (50%), Gaps = 3/48 (6%)
Query: 20 PFEREKAKKKK---NKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPA 64
P+E++ KKK K K K KKK K K KK K+R +PA
Sbjct: 746 PYEKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKRAGPLYTPSPA 793
Score = 29.4 bits (66), Expect = 2.9
Identities = 16/56 (28%), Positives = 20/56 (35%), Gaps = 1/56 (1%)
Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRTVLSLPQLRLL 79
K +K K KKK K K K KKK + +K G P L +
Sbjct: 743 PKTPYEKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKR-AGPLYTPSPALAAM 797
>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
(DUF1754). This is a eukaryotic protein family of
unknown function.
Length = 90
Score = 33.2 bits (76), Expect = 0.032
Identities = 16/28 (57%), Positives = 18/28 (64%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
K K KK KKKKKKKKKK K K++
Sbjct: 11 KLKGKKIDVKKKKKKKKKKNKSKEEVVT 38
Score = 31.6 bits (72), Expect = 0.11
Identities = 17/36 (47%), Positives = 20/36 (55%)
Query: 28 KKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENP 63
K K KK KKKKKKKKKK K K++ + E
Sbjct: 11 KLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKS 46
Score = 30.9 bits (70), Expect = 0.20
Identities = 16/28 (57%), Positives = 19/28 (67%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
K KK KKKKKKKKKK K K++ +K
Sbjct: 13 KGKKIDVKKKKKKKKKKNKSKEEVVTEK 40
Score = 29.7 bits (67), Expect = 0.50
Identities = 15/33 (45%), Positives = 18/33 (54%)
Query: 27 KKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
K K K KKKKKKKKKK K K++ +
Sbjct: 11 KLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEE 43
Score = 28.9 bits (65), Expect = 0.91
Identities = 13/28 (46%), Positives = 17/28 (60%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
K KKKKKKKKK K K++ +K+
Sbjct: 14 GKKIDVKKKKKKKKKKNKSKEEVVTEKE 41
Score = 28.5 bits (64), Expect = 1.2
Identities = 15/35 (42%), Positives = 23/35 (65%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
K KK KKKKKKK K K++ +K+++ + A+S
Sbjct: 16 KIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAES 50
Score = 28.5 bits (64), Expect = 1.4
Identities = 16/25 (64%), Positives = 16/25 (64%)
Query: 28 KKKNKKKKKKKKKKKKKKKKKKKKK 52
K K K KK KKKKKKKKKK K
Sbjct: 7 GGKLKLKGKKIDVKKKKKKKKKKNK 31
Score = 27.8 bits (62), Expect = 2.5
Identities = 13/29 (44%), Positives = 18/29 (62%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
K K KKKKKKKK K K++ +K++
Sbjct: 15 KKIDVKKKKKKKKKKNKSKEEVVTEKEEE 43
Score = 26.6 bits (59), Expect = 6.3
Identities = 16/61 (26%), Positives = 26/61 (42%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRTVLSLPQLRLLSASLS 84
K KKKK KKK K K++ +K++++K K E + + L A +
Sbjct: 20 KKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDEDNEKIEQEEDGMNLTEAERA 79
Query: 85 H 85
Sbjct: 80 F 80
>gnl|CDD|213275 cd05907, VL_LC_FACS_like, Long-chain fatty acid CoA synthetases and
Bubblegum-like very long-chain fatty acid CoA
synthetases. This family includes long-chain fatty acid
(C12-C20) CoA synthetases and Bubblegum-like very
long-chain (>C20) fatty acid CoA synthetases. FACS
catalyzes the formation of fatty acyl-CoA in a two-step
reaction: the formation of a fatty acyl-AMP molecule as
an intermediate, and the formation of a fatty acyl-CoA.
Eukaryotes generally have multiple isoforms of LC-FACS
genes with multiple splice variants. For example, nine
genes are found in Arabidopsis and six genes are
expressed in mammalian cells. Drosophila melanogaster
mutant bubblegum (BGM) have elevated levels of
very-long-chain fatty acids (VLCFA) caused by a
defective gene later named bubblegum. The human homolog
(hsBG) of bubblegum has been characterized as a very
long chain fatty acid CoA synthetase that functions
specifically in the brain; hsBG may play a central role
in brain VLCFA metabolism and myelinogenesis. Free fatty
acids must be "activated" to their CoA thioesters before
participating in most catabolic and anabolic reactions.
Length = 456
Score = 35.2 bits (82), Expect = 0.033
Identities = 10/19 (52%), Positives = 16/19 (84%)
Query: 102 QSNDIAIVLYTSGSTGIPK 120
+ +D+A ++YTSG+TG PK
Sbjct: 86 KPDDLATLIYTSGTTGNPK 104
>gnl|CDD|236108 PRK07824, PRK07824, O-succinylbenzoic acid--CoA ligase;
Provisional.
Length = 358
Score = 35.0 bits (81), Expect = 0.034
Identities = 17/60 (28%), Positives = 28/60 (46%), Gaps = 18/60 (30%)
Query: 66 KGRTVLSLP-----QLRLLSASLSHNNIPGESMLHADNNNEQSNDIAIVLYTSGSTGIPK 120
+ +L +P + LL +L GE + +D+A+V+ TSG+TG PK
Sbjct: 5 RAPALLPVPAQDERRAALLRDALR----VGEPI---------DDDVALVVATSGTTGTPK 51
>gnl|CDD|236048 PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated.
Length = 859
Score = 35.5 bits (83), Expect = 0.036
Identities = 8/36 (22%), Positives = 16/36 (44%)
Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
EK + + K KK K +K K++ + ++
Sbjct: 819 EKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEK 854
Score = 35.2 bits (82), Expect = 0.045
Identities = 8/37 (21%), Positives = 15/37 (40%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
E+ + + K KK K +K K++ +A
Sbjct: 816 EKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVAS 852
Score = 35.2 bits (82), Expect = 0.047
Identities = 7/29 (24%), Positives = 12/29 (41%)
Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
+ KK K +K K++ +K K
Sbjct: 830 KDPAGKKAAVKFSRKTKQQYVASEKDGKA 858
Score = 35.2 bits (82), Expect = 0.049
Identities = 7/30 (23%), Positives = 12/30 (40%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
+ K KK K +K K++ +K
Sbjct: 825 ADAPAKDPAGKKAAVKFSRKTKQQYVASEK 854
Score = 34.0 bits (79), Expect = 0.089
Identities = 5/29 (17%), Positives = 10/29 (34%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
K K +K K++ +K +
Sbjct: 829 AKDPAGKKAAVKFSRKTKQQYVASEKDGK 857
Score = 34.0 bits (79), Expect = 0.11
Identities = 8/29 (27%), Positives = 14/29 (48%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
++ A KK K +K K++ +K K
Sbjct: 830 KDPAGKKAAVKFSRKTKQQYVASEKDGKA 858
Score = 34.0 bits (79), Expect = 0.11
Identities = 8/28 (28%), Positives = 12/28 (42%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
AK KK K +K K++ +K
Sbjct: 828 PAKDPAGKKAAVKFSRKTKQQYVASEKD 855
Score = 33.6 bits (78), Expect = 0.13
Identities = 9/32 (28%), Positives = 14/32 (43%)
Query: 19 APFEREKAKKKKNKKKKKKKKKKKKKKKKKKK 50
AP + KK K +K K++ +K K
Sbjct: 827 APAKDPAGKKAAVKFSRKTKQQYVASEKDGKA 858
Score = 33.2 bits (77), Expect = 0.17
Identities = 7/38 (18%), Positives = 12/38 (31%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
+ K + K KK K +K ++ S
Sbjct: 815 AEKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVAS 852
Score = 32.1 bits (74), Expect = 0.37
Identities = 8/36 (22%), Positives = 11/36 (30%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
R A KK K +K K++ K
Sbjct: 822 RYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKDGK 857
Score = 30.9 bits (71), Expect = 0.86
Identities = 7/36 (19%), Positives = 11/36 (30%)
Query: 17 IPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
P K+ KK K +K K++
Sbjct: 817 KPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVAS 852
Score = 30.5 bits (70), Expect = 1.3
Identities = 7/30 (23%), Positives = 10/30 (33%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
A+K + + K KK K R
Sbjct: 814 LAEKPEKLRYLADAPAKDPAGKKAAVKFSR 843
Score = 28.2 bits (64), Expect = 7.5
Identities = 8/31 (25%), Positives = 12/31 (38%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKK 48
PA K K +K K++ +K K
Sbjct: 828 PAKDPAGKKAAVKFSRKTKQQYVASEKDGKA 858
>gnl|CDD|211788 TIGR03098, ligase_PEP_1, acyl-CoA ligase (AMP-forming), exosortase
A-associated. This group of proteins contains an
AMP-binding domain (pfam00501) associated with acyl
CoA-ligases. These proteins are generally found in
genomes containing the exosortase/PEP-CTERM protein
expoert system , specifically the type 1 variant of this
system described by the Genome Property GenProp0652.
When found in this context they are invariably present
next to a decarboxylase enzyme. A number of sequences
from Burkholderia species also hit this model, but the
genomic context is obviously different. The hypothesis
of a constant substrate for this family is only strong
where the exosortase context is present.
Length = 517
Score = 35.1 bits (81), Expect = 0.036
Identities = 12/18 (66%), Positives = 15/18 (83%)
Query: 103 SNDIAIVLYTSGSTGIPK 120
+D+A +LYTSGSTG PK
Sbjct: 162 DSDMAAILYTSGSTGRPK 179
Score = 34.8 bits (80), Expect = 0.049
Identities = 15/39 (38%), Positives = 20/39 (51%), Gaps = 1/39 (2%)
Query: 157 TGDYGRIVDDILLY-EGRTDSQVKIRGNRVDLSEVEAAL 194
+GD R ++ LY GR D +K G RV +EVE
Sbjct: 400 SGDTVRRDEEGFLYFVGRRDEMIKTSGYRVSPTEVEEVA 438
Score = 30.9 bits (70), Expect = 0.89
Identities = 13/39 (33%), Positives = 19/39 (48%), Gaps = 3/39 (7%)
Query: 228 DDILLYEGRTDSQVKIRGNRVDLSEVEA---ALSKISEV 263
+ L + GR D +K G RV +EVE A ++E
Sbjct: 409 EGFLYFVGRRDEMIKTSGYRVSPTEVEEVAYATGLVAEA 447
>gnl|CDD|233803 TIGR02262, benz_CoA_lig, benzoate-CoA ligase family. Characterized
members of this protein family include benzoate-CoA
ligase, 4-hydroxybenzoate-CoA ligase,
2-aminobenzoate-CoA ligase, etc. Members are related to
fatty acid and acetate CoA ligases.
Length = 508
Score = 35.2 bits (81), Expect = 0.038
Identities = 31/98 (31%), Positives = 44/98 (44%), Gaps = 18/98 (18%)
Query: 114 GSTGIPKEYRRMYKTGDYGRIVDD-------IHRCWSS----GDRAIKYRRMY-----KT 157
G++G P R+ GD G+ V I S+ +RA K R + ++
Sbjct: 330 GTSGKPVPGYRLRLVGDGGQDVAAGEPGELLISGPSSATMYWNNRA-KTRDTFQGEWTRS 388
Query: 158 GD-YGRIVDDILLYEGRTDSQVKIRGNRVDLSEVEAAL 194
GD Y R D Y GRTD +K+ G V E+E+AL
Sbjct: 389 GDKYVRNDDGSYTYAGRTDDMLKVSGIYVSPFEIESAL 426
Score = 30.6 bits (69), Expect = 1.1
Identities = 12/19 (63%), Positives = 16/19 (84%)
Query: 102 QSNDIAIVLYTSGSTGIPK 120
Q++D A LY+SGSTG+PK
Sbjct: 159 QADDPAFWLYSSGSTGMPK 177
Score = 28.3 bits (63), Expect = 5.8
Identities = 14/34 (41%), Positives = 19/34 (55%)
Query: 224 IRGVDDILLYEGRTDSQVKIRGNRVDLSEVEAAL 257
+R D Y GRTD +K+ G V E+E+AL
Sbjct: 393 VRNDDGSYTYAGRTDDMLKVSGIYVSPFEIESAL 426
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation
factor 3 (eIF3). In yeast it is called Hcr1. The
Saccharomyces cerevisiae protein eIF3j (HCR1) has been
shown to be required for processing of 20S pre-rRNA and
binds to 18S rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 34.6 bits (80), Expect = 0.040
Identities = 14/35 (40%), Positives = 22/35 (62%), Gaps = 2/35 (5%)
Query: 22 EREKAKK--KKNKKKKKKKKKKKKKKKKKKKKKRR 54
E EKAK K KK K K ++K+K K++K+++
Sbjct: 46 EEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKG 80
Score = 33.5 bits (77), Expect = 0.084
Identities = 12/38 (31%), Positives = 21/38 (55%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
E E+ +++K K K K KK K K ++K+K + +
Sbjct: 41 EDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEE 78
Score = 33.5 bits (77), Expect = 0.11
Identities = 12/37 (32%), Positives = 20/37 (54%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
+ AK K K K K ++K+K K++K++K R
Sbjct: 46 EEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLR 82
Score = 32.7 bits (75), Expect = 0.16
Identities = 12/30 (40%), Positives = 20/30 (66%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
KAK KK K K ++K+K K++K++K +
Sbjct: 55 KAKAKKALKAKIEEKEKAKREKEEKGLREL 84
Score = 32.3 bits (74), Expect = 0.25
Identities = 11/43 (25%), Positives = 22/43 (51%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPA 64
+ K K K K ++K+K K++K++K R + + E+
Sbjct: 51 KVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDEL 93
Score = 31.6 bits (72), Expect = 0.45
Identities = 12/32 (37%), Positives = 13/32 (40%)
Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
EK K +K K KKKK K K K
Sbjct: 193 EKLKAEKAAKGGKKKKGKAKAKLNVGGANDDD 224
Score = 31.2 bits (71), Expect = 0.47
Identities = 12/33 (36%), Positives = 18/33 (54%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
E E +K++ K K K K KK K K ++K +
Sbjct: 39 EEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEK 71
Score = 30.8 bits (70), Expect = 0.70
Identities = 11/31 (35%), Positives = 13/31 (41%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
E+ KA+K KKKK K K K
Sbjct: 193 EKLKAEKAAKGGKKKKGKAKAKLNVGGANDD 223
Score = 29.2 bits (66), Expect = 2.1
Identities = 10/37 (27%), Positives = 20/37 (54%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
+ E+ ++K+ +K K K K KK K K +++ +
Sbjct: 38 DEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKR 74
Score = 29.2 bits (66), Expect = 2.5
Identities = 11/23 (47%), Positives = 12/23 (52%)
Query: 30 KNKKKKKKKKKKKKKKKKKKKKK 52
K K +K K KKKK K K K
Sbjct: 192 NEKLKAEKAAKGGKKKKGKAKAK 214
Score = 28.5 bits (64), Expect = 3.6
Identities = 8/33 (24%), Positives = 18/33 (54%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKK 50
A + K K ++K+K K++K++K ++
Sbjct: 52 VAAKAKAKKALKAKIEEKEKAKREKEEKGLREL 84
Score = 28.5 bits (64), Expect = 3.8
Identities = 11/29 (37%), Positives = 16/29 (55%)
Query: 31 NKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
N+K K +K K KKKK K K + + +
Sbjct: 192 NEKLKAEKAAKGGKKKKGKAKAKLNVGGA 220
Score = 28.1 bits (63), Expect = 6.0
Identities = 11/23 (47%), Positives = 13/23 (56%)
Query: 28 KKKNKKKKKKKKKKKKKKKKKKK 50
+K K +K K KKKK K K K
Sbjct: 192 NEKLKAEKAAKGGKKKKGKAKAK 214
Score = 27.3 bits (61), Expect = 9.8
Identities = 7/27 (25%), Positives = 8/27 (29%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKK 48
E+ KK K K K K
Sbjct: 198 EKAAKGGKKKKGKAKAKLNVGGANDDD 224
>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe. JAK1 is a
tyrosine kinase protein essential in signaling type I
and type II cytokines. It interacts with the gamma chain
of type I cytokine receptors to elicit signals from the
IL-2 receptor family, the IL-4 receptor family, the
gp130 receptor family, ciliary neurotrophic factor
receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
Leptin-R). It also is involved in transducing a signal
by type I (IFN-alpha/beta) and type II (IFN-gamma)
interferons, and members of the IL-10 family via type II
cytokine receptors. JAK (also called Just Another
Kinase) is a family of intracellular, non-receptor
tyrosine kinases that transduce cytokine-mediated
signals via the JAK-STAT pathway. The JAK family in
mammals consists of 4 members: JAK1, JAK2, JAK3 and
TYK2. JAKs are composed of seven JAK homology (JH)
domains (JH1-JH7) . The C-terminal JH1 domain is the
main catalytic domain, followed by JH2, which is often
referred to as a pseudokinase domain, followed by
JH3-JH4 which is homologous to the SH2 domain, and
lastly JH5-JH7 which is a FERM domain. Named after
Janus, the two-faced Roman god of doorways, JAKs possess
two near-identical phosphate-transferring domains; one
which displays the kinase activity (JH1), while the
other negatively regulates the kinase activity of the
first (JH2). The FERM domain has a cloverleaf tripart
structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
C-lobe/F3 within the FERM domain is part of the PH
domain family. The FERM domain is found in the
cytoskeletal-associated proteins such as ezrin, moesin,
radixin, 4.1R, and merlin. These proteins provide a link
between the membrane and cytoskeleton and are involved
in signal transduction pathways. The FERM domain is also
found in protein tyrosine phosphatases (PTPs) , the
tyrosine kinases FAK and JAK, in addition to other
proteins involved in signaling. This domain is
structurally similar to the PH and PTB domains and
consequently is capable of binding to both peptides and
phospholipids at different sites.
Length = 198
Score = 34.4 bits (79), Expect = 0.042
Identities = 16/31 (51%), Positives = 19/31 (61%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKK 48
PA EK KK K+KK K K KK + KKK +
Sbjct: 84 PATTAVEKKKKGKSKKNKLKGKKDEDKKKAR 114
Score = 31.0 bits (70), Expect = 0.52
Identities = 14/24 (58%), Positives = 15/24 (62%)
Query: 30 KNKKKKKKKKKKKKKKKKKKKKKR 53
K KK K KK K K KK + KKK R
Sbjct: 91 KKKKGKSKKNKLKGKKDEDKKKAR 114
Score = 31.0 bits (70), Expect = 0.60
Identities = 13/24 (54%), Positives = 15/24 (62%)
Query: 29 KKNKKKKKKKKKKKKKKKKKKKKK 52
KK K K KK K K KK + KKK +
Sbjct: 91 KKKKGKSKKNKLKGKKDEDKKKAR 114
Score = 29.0 bits (65), Expect = 2.4
Identities = 14/33 (42%), Positives = 18/33 (54%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
R K +KKKK K KK K K KK + K++
Sbjct: 81 RRKPATTAVEKKKKGKSKKNKLKGKKDEDKKKA 113
>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
homolog [Translation, ribosomal structure and
biogenesis].
Length = 395
Score = 34.7 bits (80), Expect = 0.047
Identities = 11/38 (28%), Positives = 20/38 (52%)
Query: 17 IPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
+ + K K + KK++ + ++KKK+KK K R
Sbjct: 353 LKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSER 390
Score = 32.4 bits (74), Expect = 0.27
Identities = 10/37 (27%), Positives = 23/37 (62%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
P P + K ++ K ++ + ++KKK+KK K +++ +
Sbjct: 358 PKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQ 394
Score = 32.4 bits (74), Expect = 0.30
Identities = 9/36 (25%), Positives = 21/36 (58%)
Query: 19 APFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
P + + + KK + + ++KKK+KK K +++ +
Sbjct: 360 PPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQN 395
Score = 30.4 bits (69), Expect = 1.1
Identities = 10/37 (27%), Positives = 23/37 (62%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
+EK K K K ++ KK++ + ++KKK+++ ++
Sbjct: 354 KEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSER 390
Score = 29.3 bits (66), Expect = 2.9
Identities = 9/37 (24%), Positives = 20/37 (54%)
Query: 17 IPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
P + + KK+ + ++KKK+KK K +++ +
Sbjct: 359 KPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQN 395
Score = 27.7 bits (62), Expect = 7.8
Identities = 13/39 (33%), Positives = 22/39 (56%)
Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
E+ K K K K K ++ KK++ + ++KK+ AKS
Sbjct: 350 IEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKS 388
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This
family consists of several bovine specific leukaemia
virus receptors which are thought to function as
transmembrane proteins, although their exact function is
unknown.
Length = 561
Score = 34.7 bits (79), Expect = 0.054
Identities = 17/45 (37%), Positives = 30/45 (66%)
Query: 20 PFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPA 64
P +K+KK K K+KK+K+K++ K KKK+ + + +L ++PA
Sbjct: 193 PAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLLALDDSPA 237
Score = 34.3 bits (78), Expect = 0.083
Identities = 18/37 (48%), Positives = 24/37 (64%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
P E E+ KK KKKK++K+K++KKKKKK R
Sbjct: 266 PKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHR 302
Score = 33.5 bits (76), Expect = 0.15
Identities = 15/43 (34%), Positives = 26/43 (60%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPA 64
E++ K KK +KK+K+K++ K KKK+ + K +LA +
Sbjct: 196 EKKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLLALDDSPAS 238
Score = 31.6 bits (71), Expect = 0.50
Identities = 16/39 (41%), Positives = 26/39 (66%)
Query: 13 DKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
+K +PA ++ K KKK KK+K+K++ K KKK+ + K
Sbjct: 188 EKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFK 226
Score = 31.6 bits (71), Expect = 0.53
Identities = 9/32 (28%), Positives = 23/32 (71%)
Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
E E+ +++ +K K++KKK++K+K+ +++
Sbjct: 84 LEEERRHRQRLEKDKREKKKREKEKRGRRRHH 115
Score = 31.6 bits (71), Expect = 0.59
Identities = 15/36 (41%), Positives = 23/36 (63%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
P + +KK K KKK+KK+K+K++ K KKK+
Sbjct: 187 PEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEV 222
Score = 31.2 bits (70), Expect = 0.82
Identities = 9/33 (27%), Positives = 24/33 (72%)
Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRIL 56
E+ ++ + + +K K++KKK++K+K+ +++ L
Sbjct: 85 EEERRHRQRLEKDKREKKKREKEKRGRRRHHSL 117
Score = 30.8 bits (69), Expect = 0.86
Identities = 11/41 (26%), Positives = 28/41 (68%)
Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPA 64
E+ + ++ +K K++KKK++K+K+ +++ + +S E+ A
Sbjct: 86 EERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDEDIA 126
Score = 30.4 bits (68), Expect = 1.3
Identities = 15/40 (37%), Positives = 22/40 (55%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPA 64
K KKKK +K+K++KKKKKK + + +N A
Sbjct: 279 KHKKKKQRKEKEEKKKKKKHHHHRCHHSDGGAEQPVQNGA 318
Score = 29.6 bits (66), Expect = 2.5
Identities = 10/30 (33%), Positives = 24/30 (80%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
K ++++ +++ +K K++KKK++K+K+ RR
Sbjct: 83 KLEEERRHRQRLEKDKREKKKREKEKRGRR 112
Score = 29.6 bits (66), Expect = 2.6
Identities = 17/41 (41%), Positives = 28/41 (68%)
Query: 20 PFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSC 60
P E + A+ ++ KK K KKKK++K+K++KKKK++ C
Sbjct: 263 PDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHRC 303
Score = 29.6 bits (66), Expect = 2.6
Identities = 14/37 (37%), Positives = 25/37 (67%)
Query: 19 APFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
+P + + +K KK KKK+KK+K+K++ K KK+ +
Sbjct: 186 SPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEV 222
Score = 29.6 bits (66), Expect = 2.6
Identities = 14/36 (38%), Positives = 21/36 (58%)
Query: 19 APFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
EK +KK KK KKK+KK+K+K++ K +
Sbjct: 183 TSKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDK 218
Score = 29.3 bits (65), Expect = 2.9
Identities = 16/46 (34%), Positives = 26/46 (56%)
Query: 6 TISEHHFDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
T + D+ E +K+ K K KK++K+K++KKKKKK +
Sbjct: 257 TAPDSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHR 302
Score = 29.3 bits (65), Expect = 3.2
Identities = 10/34 (29%), Positives = 24/34 (70%)
Query: 20 PFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
P + K ++ ++ +++ +K K++KKK++K+KR
Sbjct: 76 PMSDQYVKLEEERRHRQRLEKDKREKKKREKEKR 109
Score = 28.9 bits (64), Expect = 3.6
Identities = 13/36 (36%), Positives = 24/36 (66%)
Query: 19 APFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
+ + K + + KK K KKKK++K+K++KKK++
Sbjct: 261 SEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKK 296
Score = 28.9 bits (64), Expect = 3.7
Identities = 18/45 (40%), Positives = 26/45 (57%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYK 66
E K+ +K + +KK KK KKK+KK+K+K R K E +K
Sbjct: 182 ETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFK 226
Score = 28.5 bits (63), Expect = 6.0
Identities = 13/78 (16%), Positives = 39/78 (50%), Gaps = 6/78 (7%)
Query: 28 KKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRTVLSLPQLRLLSASLSHNN 87
K + +++ +++ +K K++KKK++K++R + + + +++ + N
Sbjct: 83 KLEEERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDEDIAPAQMVDIVTEEMPENA 142
Query: 88 IPGESMLHADNNNEQSND 105
+P +D +++ ND
Sbjct: 143 LP------SDEDDKDPND 154
>gnl|CDD|235132 PRK03577, PRK03577, acid shock protein precursor; Provisional.
Length = 102
Score = 32.7 bits (74), Expect = 0.058
Identities = 21/58 (36%), Positives = 30/58 (51%), Gaps = 3/58 (5%)
Query: 10 HHFDKVIIPAPFEREKAKKKKNKKKKKKK---KKKKKKKKKKKKKKRRILAKSCENPA 64
HH K AP ++ +A KK +K KK++K +K + KK KK + AK PA
Sbjct: 44 HHHKKQHKKAPEQKAQAAKKHHKNKKEQKAPEQKAQAAKKHAKKHSHKTAAKPAAQPA 101
Score = 26.5 bits (58), Expect = 7.7
Identities = 12/33 (36%), Positives = 17/33 (51%)
Query: 9 EHHFDKVIIPAPFEREKAKKKKNKKKKKKKKKK 41
+HH +K AP ++ +A KK KK K K
Sbjct: 63 KHHKNKKEQKAPEQKAQAAKKHAKKHSHKTAAK 95
>gnl|CDD|221654 pfam12589, WBS_methylT, Methyltransferase involved in
Williams-Beuren syndrome. This domain family is found
in eukaryotes, and is typically between 72 and 83 amino
acids in length. The family is found in association
with pfam08241. This family is made up of
S-adenosylmethionine-dependent methyltransferases. The
proteins are deleted in Williams-Beuren syndrome (WBS),
a complex developmental disorder with multisystemic
manifestations including supravalvular aortic stenosis
(SVAS) and a specific cognitive phenotype.
Length = 85
Score = 32.3 bits (74), Expect = 0.062
Identities = 15/30 (50%), Positives = 20/30 (66%)
Query: 29 KKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
K + ++ ++KKKKKKK KKK K IL K
Sbjct: 31 KVRRISQRNRRKKKKKKKLKKKSKEWILRK 60
Score = 30.0 bits (68), Expect = 0.39
Identities = 14/32 (43%), Positives = 21/32 (65%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
R + ++ KKKKKK KKK K+ +KK++ R
Sbjct: 34 RISQRNRRKKKKKKKLKKKSKEWILRKKEQMR 65
Score = 29.6 bits (67), Expect = 0.49
Identities = 17/42 (40%), Positives = 25/42 (59%)
Query: 13 DKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
D I A R +++ + KKKKKKK KKK K+ +KK++
Sbjct: 23 DDEQIDASKVRRISQRNRRKKKKKKKLKKKSKEWILRKKEQM 64
Score = 29.2 bits (66), Expect = 0.57
Identities = 14/34 (41%), Positives = 23/34 (67%)
Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
++ ++KK KKKK KKK K+ +KK++ +RR
Sbjct: 34 RISQRNRRKKKKKKKLKKKSKEWILRKKEQMRRR 67
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 34.2 bits (79), Expect = 0.064
Identities = 12/37 (32%), Positives = 27/37 (72%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
E E+ K K +++++++ ++KK++KKK+++ LAK
Sbjct: 266 EEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAK 302
Score = 33.8 bits (78), Expect = 0.082
Identities = 12/43 (27%), Positives = 28/43 (65%), Gaps = 10/43 (23%)
Query: 22 EREKAKKKKNKKKKKKKKK----------KKKKKKKKKKKKRR 54
+E+A++KK +KKK++++ +K ++K++KK+ R+
Sbjct: 280 RQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQARK 322
>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2. It is vital for
effective cell-replication that replication is not
stalled at any point by, for instance, damaged bases.
Rtf2 stabilizes the replication fork stalled at the
site-specific replication barrier RTS1 by preventing
replication restart until completion of DNA synthesis by
a converging replication fork initiated at a flanking
origin. The RTS1 element terminates replication forks
that are moving in the cen2-distal direction while
allowing forks moving in the cen2-proximal direction to
pass through the region. Rtf2 contains a C2HC2 motif
related to the C3HC4 RING-finger motif, and would appear
to fold up, creating a RING finger-like structure but
forming only one functional Zn2+ ion-binding site.
Length = 254
Score = 34.2 bits (79), Expect = 0.065
Identities = 16/40 (40%), Positives = 23/40 (57%)
Query: 13 DKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
D V + E + K + ++++ KKKKKKKKKK KK
Sbjct: 158 DVVPLNPTEEEVELLKARLEEERAKKKKKKKKKKTKKNNA 197
Score = 31.5 bits (72), Expect = 0.45
Identities = 14/31 (45%), Positives = 16/31 (51%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
E E+AKKKK KKKKK KK +
Sbjct: 177 EEERAKKKKKKKKKKTKKNNATGSSAEATVS 207
Score = 30.8 bits (70), Expect = 0.69
Identities = 12/31 (38%), Positives = 18/31 (58%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
+ + ++++ KKKKKKKKKK KK
Sbjct: 173 KARLEEERAKKKKKKKKKKTKKNNATGSSAE 203
Score = 30.8 bits (70), Expect = 0.80
Identities = 13/38 (34%), Positives = 14/38 (36%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
ER K KKKK KKK KK + S
Sbjct: 179 ERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSS 216
>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
membranes [Cell envelope biogenesis, outer membrane].
Length = 244
Score = 33.6 bits (77), Expect = 0.086
Identities = 14/37 (37%), Positives = 19/37 (51%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
P P + ++ K + K KK K K K K K K K K +
Sbjct: 79 PTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQP 115
Score = 33.2 bits (76), Expect = 0.13
Identities = 15/36 (41%), Positives = 17/36 (47%)
Query: 17 IPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
P +K KK K K K K K K K K + K KK
Sbjct: 85 KEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKP 120
Score = 32.5 bits (74), Expect = 0.20
Identities = 14/46 (30%), Positives = 18/46 (39%)
Query: 14 KVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
K + +K K K K K K K K + K KK K A +
Sbjct: 85 KEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPSKTAAKAPA 130
Score = 32.1 bits (73), Expect = 0.26
Identities = 15/35 (42%), Positives = 19/35 (54%)
Query: 20 PFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
P +EK K +K KK K K K K K K K K + +
Sbjct: 82 PKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPK 116
Score = 32.1 bits (73), Expect = 0.31
Identities = 15/36 (41%), Positives = 17/36 (47%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
P K K+K +KK KK K K K K K K K
Sbjct: 76 PPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKV 111
Score = 31.3 bits (71), Expect = 0.44
Identities = 14/41 (34%), Positives = 17/41 (41%)
Query: 14 KVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
P K+K K +KK KK K K K K K K +
Sbjct: 71 TEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKV 111
Score = 30.9 bits (70), Expect = 0.75
Identities = 16/40 (40%), Positives = 20/40 (50%)
Query: 16 IIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
P P E+ K +KK K K K K K K K K K + K +
Sbjct: 80 TPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKK 119
Score = 29.8 bits (67), Expect = 1.7
Identities = 14/39 (35%), Positives = 18/39 (46%)
Query: 14 KVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
+ P E E + K K+K K +KK KK K K K
Sbjct: 64 EEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPK 102
Score = 29.0 bits (65), Expect = 2.6
Identities = 15/38 (39%), Positives = 19/38 (50%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
P E K K+ K +KK KK K K K K K K ++
Sbjct: 74 ETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKV 111
>gnl|CDD|235531 PRK05605, PRK05605, long-chain-fatty-acid--CoA ligase; Validated.
Length = 573
Score = 34.2 bits (79), Expect = 0.087
Identities = 19/62 (30%), Positives = 32/62 (51%), Gaps = 12/62 (19%)
Query: 71 LSLPQLRLLSASLS---HNNIPGESML---------HADNNNEQSNDIAIVLYTSGSTGI 118
L +P LR A+L+ +P E+++ + +D+A++LYTSG+TG
Sbjct: 174 LPIPALRKARAALTGPAPGTVPWETLVDAAIGGDGSDVSHPRPTPDDVALILYTSGTTGK 233
Query: 119 PK 120
PK
Sbjct: 234 PK 235
>gnl|CDD|213284 cd05917, FACL_like_2, Uncharacterized subfamily of fatty acid CoA
ligase (FACL). Fatty acyl-CoA ligases catalyze the
ATP-dependent activation of fatty acids in a two-step
reaction. The carboxylate substrate first reacts with
ATP to form an acyl-adenylate intermediate, which then
reacts with CoA to produce an acyl-CoA ester. This is a
required step before free fatty acids can participate in
most catabolic and anabolic reactions.
Length = 347
Score = 34.0 bits (79), Expect = 0.088
Identities = 10/16 (62%), Positives = 13/16 (81%)
Query: 105 DIAIVLYTSGSTGIPK 120
D A++ YTSG+TG PK
Sbjct: 3 DPALIQYTSGTTGRPK 18
>gnl|CDD|213306 cd05940, FATP_FACS, Fatty acid transport proteins (FATP) play dual
roles as fatty acid transporters and its activation
enzymes. Fatty acid transport protein (FATP) transports
long-chain or very-long-chain fatty acids across the
plasma membrane. FATPs also have fatty acid CoA
synthetase activity, thus playing dual roles as fatty
acid transporters and its activation enzymes. At least
five copies of FATPs are identified in mammalian cells.
This family also includes prokaryotic FATPs. FATPs are
the key players in the trafficking of exogenous fatty
acids into the cell and in intracellular fatty acid
homeostasis.
Length = 444
Score = 34.1 bits (79), Expect = 0.088
Identities = 15/46 (32%), Positives = 21/46 (45%), Gaps = 9/46 (19%)
Query: 105 DIAIVLYTSGSTGIPK----EYRRMYKTGD-----YGRIVDDIHRC 141
D A +YTSG+TG+PK +RR + G DD+
Sbjct: 82 DPAFYIYTSGTTGLPKAAIMSHRRWLRAGAVFGGLGLLKPDDVLYL 127
>gnl|CDD|213318 cd05971, MACS_like_3, Uncharacterized subfamily of medium-chain
acyl-CoA synthetase (MACS). MACS catalyzes the two-step
activation of medium chain fatty acids (containing 4-12
carbons). The carboxylate substrate first reacts with
ATP to form an acyl-adenylate intermediate, which then
reacts with CoA to produce an acyl-CoA ester. MACS
enzymes are localized to mitochondria.
Length = 439
Score = 33.9 bits (78), Expect = 0.089
Identities = 12/20 (60%), Positives = 17/20 (85%)
Query: 101 EQSNDIAIVLYTSGSTGIPK 120
+ S+D AI++YTSG+TG PK
Sbjct: 85 DGSDDPAILIYTSGTTGPPK 104
Score = 30.8 bits (70), Expect = 0.83
Identities = 17/41 (41%), Positives = 23/41 (56%), Gaps = 1/41 (2%)
Query: 157 TGDYGRIVDD-ILLYEGRTDSQVKIRGNRVDLSEVEAALSK 196
TGD GR D L ++GR D +K G R+ +E+E L K
Sbjct: 320 TGDLGRRDADGYLWFKGRADDVIKSSGYRIGPAEIEECLLK 360
>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
N-terminal. SRP is a complex of six distinct
polypeptides and a 7S RNA that is essential for
transferring nascent polypeptide chains that are
destined for export from the cell to the translocation
apparatus of the endoplasmic reticulum (ER) membrane.
SRP binds hydrophobic signal sequences as they emerge
from the ribosome, and arrests translation.
Length = 272
Score = 33.6 bits (77), Expect = 0.090
Identities = 13/54 (24%), Positives = 19/54 (35%)
Query: 1 MNINLTISEHHFDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
N + + H P+ ++ AK +K K K KK KK R
Sbjct: 160 PNSSRPSTPHLLKAKEGPSRRAKKAAKLSSTASSGDEKSPKSKAAPKKAGKKMR 213
Score = 28.6 bits (64), Expect = 3.7
Identities = 8/31 (25%), Positives = 12/31 (38%)
Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
+ + +K K K KK KK +K
Sbjct: 184 AAKLSSTASSGDEKSPKSKAAPKKAGKKMRK 214
>gnl|CDD|236443 PRK09274, PRK09274, peptide synthase; Provisional.
Length = 552
Score = 34.1 bits (79), Expect = 0.091
Identities = 11/19 (57%), Positives = 15/19 (78%)
Query: 102 QSNDIAIVLYTSGSTGIPK 120
+D+A +L+TSGSTG PK
Sbjct: 172 APDDMAAILFTSGSTGTPK 190
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 33.8 bits (78), Expect = 0.093
Identities = 17/100 (17%), Positives = 33/100 (33%)
Query: 6 TISEHHFDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAY 65
+ V KKK K K K KKK K + +N
Sbjct: 54 QVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALD 113
Query: 66 KGRTVLSLPQLRLLSASLSHNNIPGESMLHADNNNEQSND 105
K + + + +L+ + ++ + L D+ ++ +D
Sbjct: 114 KDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDD 153
Score = 29.6 bits (67), Expect = 2.1
Identities = 11/51 (21%), Positives = 16/51 (31%)
Query: 2 NINLTISEHHFDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
+ + +S D +K K K K KKK K + K
Sbjct: 54 QVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSK 104
Score = 28.4 bits (64), Expect = 4.7
Identities = 11/51 (21%), Positives = 16/51 (31%)
Query: 3 INLTISEHHFDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
I L+ D +++ K K KKK K + KK
Sbjct: 57 IFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAE 107
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 34.1 bits (78), Expect = 0.094
Identities = 24/92 (26%), Positives = 38/92 (41%), Gaps = 16/92 (17%)
Query: 14 KVIIPAPFEREKAKKKKNKKKKKKKKKKKKKK----------------KKKKKKKRRILA 57
K++ ER+K K++K K+K+ KK K +K+ KK +KK R
Sbjct: 11 KILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKKSRKRD 70
Query: 58 KSCENPAYKGRTVLSLPQLRLLSASLSHNNIP 89
ENP Q + LS+ ++ P
Sbjct: 71 VEDENPEDFIDPDTPFGQKKRLSSQMAKQYSP 102
Score = 31.8 bits (72), Expect = 0.47
Identities = 12/41 (29%), Positives = 24/41 (58%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
++ +++ +KKKK++K K+K+ KK K ++ AK
Sbjct: 6 SEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAK 46
Score = 28.7 bits (64), Expect = 4.2
Identities = 14/40 (35%), Positives = 25/40 (62%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCE 61
E E KK +++ ++KKKK++K K+K+ KK + K +
Sbjct: 5 ESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAK 44
>gnl|CDD|219547 pfam07741, BRF1, Brf1-like TBP-binding domain. This region
covers both the Brf homology II and III regions. This
region is involved in binding TATA binding protein.
Length = 95
Score = 31.8 bits (73), Expect = 0.10
Identities = 15/41 (36%), Positives = 23/41 (56%)
Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPA 64
E+ ++K+ K+K + KKKKK+K KK+R A A
Sbjct: 30 EEQEEKELKQKADEGNNSGKKKKKRKAKKKRDEAGPASTAA 70
Score = 31.5 bits (72), Expect = 0.11
Identities = 13/31 (41%), Positives = 19/31 (61%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
E+ +K+ +K + KKKKK+K KKKR
Sbjct: 30 EEQEEKELKQKADEGNNSGKKKKKRKAKKKR 60
Score = 29.9 bits (68), Expect = 0.42
Identities = 11/29 (37%), Positives = 15/29 (51%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
K K + KKKKK+K KKK+ +
Sbjct: 36 ELKQKADEGNNSGKKKKKRKAKKKRDEAG 64
Score = 29.5 bits (67), Expect = 0.68
Identities = 13/31 (41%), Positives = 19/31 (61%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
E ++ K+ K K + KKKKK+K KKK+
Sbjct: 30 EEQEEKELKQKADEGNNSGKKKKKRKAKKKR 60
Score = 29.1 bits (66), Expect = 0.80
Identities = 13/32 (40%), Positives = 18/32 (56%)
Query: 20 PFEREKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
+ EK K+K + KKKKK+K KKK+
Sbjct: 30 EEQEEKELKQKADEGNNSGKKKKKRKAKKKRD 61
Score = 28.8 bits (65), Expect = 1.2
Identities = 12/31 (38%), Positives = 17/31 (54%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
E+E +K KKKKK+K KKK+ +
Sbjct: 34 EKELKQKADEGNNSGKKKKKRKAKKKRDEAG 64
Score = 28.0 bits (63), Expect = 2.0
Identities = 12/31 (38%), Positives = 20/31 (64%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
E ++ K+K ++ KKKKK+K KKK+ +
Sbjct: 33 EEKELKQKADEGNNSGKKKKKRKAKKKRDEA 63
Score = 26.1 bits (58), Expect = 8.7
Identities = 9/28 (32%), Positives = 17/28 (60%)
Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKK 48
+++ + + KKKKK+K KKK+ +
Sbjct: 37 LKQKADEGNNSGKKKKKRKAKKKRDEAG 64
>gnl|CDD|213320 cd05973, MACS_like_2, Uncharacterized subfamily of medium-chain
acyl-CoA synthetase (MACS). MACS catalyzes the two-step
activation of medium chain fatty acids (containing 4-12
carbons). The carboxylate substrate first reacts with
ATP to form an acyl-adenylate intermediate, which then
reacts with CoA to produce an acyl-CoA ester. MACS
enzymes are localized to mitochondria.
Length = 440
Score = 34.0 bits (78), Expect = 0.10
Identities = 11/26 (42%), Positives = 18/26 (69%)
Query: 95 HADNNNEQSNDIAIVLYTSGSTGIPK 120
+A N + +D+ +YTSG+TG+PK
Sbjct: 79 NAANRGKLDDDLFAQMYTSGTTGLPK 104
>gnl|CDD|213280 cd05912, OSB_CoA_lg, O-succinylbenzoate-CoA ligase (also known as
O-succinylbenzoate-CoA synthase, OSB-CoA synthetase, or
MenE). O-succinylbenzoic acid-CoA synthase catalyzes
the coenzyme A (CoA)- and ATP-dependent conversion of
o-succinylbenzoic acid to o-succinylbenzoyl-CoA. The
reaction is the fourth step of the biosynthesis pathway
of menaquinone (vitamin K2). In certain bacteria,
menaquinone is used during fumarate reduction in
anaerobic respiration. In cyanobacteria, the product of
the menaquinone pathway is phylloquinone
(2-methyl-3-phytyl-1,4-naphthoquinone), a molecule used
exclusively as an electron transfer cofactor in
Photosystem 1. In green sulfur bacteria and
heliobacteria, menaquinones are used as loosely bound
secondary electron acceptors in the photosynthetic
reaction center.
Length = 407
Score = 33.7 bits (78), Expect = 0.10
Identities = 9/16 (56%), Positives = 12/16 (75%)
Query: 105 DIAIVLYTSGSTGIPK 120
A +++TSGSTG PK
Sbjct: 78 RPATIIFTSGSTGKPK 93
Score = 28.3 bits (64), Expect = 5.8
Identities = 16/53 (30%), Positives = 27/53 (50%), Gaps = 1/53 (1%)
Query: 155 YKTGDYGRIVDDILLY-EGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSVL 206
+ TGD G + + LY GR D + G + E+EA L + V++ +V+
Sbjct: 290 FHTGDLGYLDAEGYLYVLGRRDDLIISGGENIYPEEIEAVLLQHPAVEEAAVV 342
>gnl|CDD|213315 cd05968, AACS_like, Uncharacterized acyl-CoA synthetase subfamily
similar to Acetoacetyl-CoA synthetase. This
uncharacterized acyl-CoA synthetase family is highly
homologous to acetoacetyl-CoA synthetase. However, the
proteins in this family exist in only bacteria and
archaea. AACS is a cytosolic ligase that specifically
activates acetoacetate to its coenzyme A ester by a
two-step reaction. Acetoacetate first reacts with ATP to
form an acyl-adenylate intermediate, which then reacts
with CoA to produce an acyl-CoA ester. This is the first
step of the mevalonate pathway of isoprenoid
biosynthesis via isopentenyl diphosphate. Isoprenoids
are a large class of compounds found in all living
organisms.
Length = 474
Score = 33.8 bits (78), Expect = 0.10
Identities = 15/44 (34%), Positives = 25/44 (56%), Gaps = 1/44 (2%)
Query: 158 GDYGRIVDDILLY-EGRTDSQVKIRGNRVDLSEVEAALSKLSEV 200
GD+ + +D Y GR+D +K+ G RV +E+E+ L+ V
Sbjct: 340 GDWALVDEDGYWYILGRSDDTIKVAGKRVGPAEIESVLNSHPAV 383
Score = 31.5 bits (72), Expect = 0.57
Identities = 11/29 (37%), Positives = 18/29 (62%)
Query: 235 GRTDSQVKIRGNRVDLSEVEAALSKISEV 263
GR+D +K+ G RV +E+E+ L+ V
Sbjct: 355 GRSDDTIKVAGKRVGPAEIESVLNSHPAV 383
Score = 30.3 bits (69), Expect = 1.2
Identities = 10/17 (58%), Positives = 15/17 (88%)
Query: 104 NDIAIVLYTSGSTGIPK 120
+D A+++YTSG+TG PK
Sbjct: 100 DDPAMIIYTSGTTGKPK 116
>gnl|CDD|182517 PRK10524, prpE, propionyl-CoA synthetase; Provisional.
Length = 629
Score = 33.8 bits (78), Expect = 0.11
Identities = 15/29 (51%), Positives = 20/29 (68%), Gaps = 2/29 (6%)
Query: 103 SNDIAIVLYTSGSTGIPKEYRRMYKTGDY 131
SN+ + +LYTSG+TG PK +R TG Y
Sbjct: 232 SNEPSYILYTSGTTGKPKGVQR--DTGGY 258
Score = 33.0 bits (76), Expect = 0.20
Identities = 22/69 (31%), Positives = 34/69 (49%), Gaps = 14/69 (20%)
Query: 142 WSSGDRAIKYRRMYKTGDYGRIVDD-----ILLYEGRTDSQVKIRGNRVDLSEVEAALSK 196
WS R + Y T D+G I D IL GRTD + + G+R+ E+E ++S
Sbjct: 467 WSLFGRQV-----YSTFDWG-IRDADGYYFIL---GRTDDVINVAGHRLGTREIEESISS 517
Query: 197 LSEVDKLSV 205
V +++V
Sbjct: 518 HPAVAEVAV 526
>gnl|CDD|213282 cd05914, FACL_like_3, Uncharacterized subfamily of fatty acid CoA
ligase (FACL). Fatty acyl-CoA ligases catalyze the
ATP-dependent activation of fatty acids in a two-step
reaction. The carboxylate substrate first reacts with
ATP to form an acyl-adenylate intermediate, which then
reacts with CoA to produce an acyl-CoA ester. This is a
required step before free fatty acids can participate in
most catabolic and anabolic reactions.
Length = 448
Score = 33.8 bits (78), Expect = 0.11
Identities = 16/48 (33%), Positives = 21/48 (43%), Gaps = 4/48 (8%)
Query: 77 RLLSASLSHNNIPGE----SMLHADNNNEQSNDIAIVLYTSGSTGIPK 120
SA + GE ++ + N E A + YTSGSTG PK
Sbjct: 81 DAASALHTPFATLGEDLYIALRPSANPVELPAGTAKITYTSGSTGQPK 128
>gnl|CDD|213292 cd05926, FACL_fum10p_like, Subfamily of fatty acid CoA ligase
(FACL) similar to Fum10p of Gibberella moniliformis.
FACL catalyzes the formation of fatty acyl-CoA in a
two-step reaction: the formation of a fatty acyl-AMP
molecule as an intermediate, followed by the formation
of a fatty acyl-CoA. This is a required step before free
fatty acids can participate in most catabolic and
anabolic reactions. Fum10p is a fatty acid CoA ligase
involved in the synthesis of fumonisin, a polyketide
mycotoxin, in Gibberella moniliformis.
Length = 345
Score = 33.4 bits (77), Expect = 0.12
Identities = 10/18 (55%), Positives = 15/18 (83%)
Query: 103 SNDIAIVLYTSGSTGIPK 120
+D A++L+TSG+TG PK
Sbjct: 1 PDDPALILHTSGTTGRPK 18
>gnl|CDD|236315 PRK08633, PRK08633, 2-acyl-glycerophospho-ethanolamine
acyltransferase; Validated.
Length = 1146
Score = 33.7 bits (78), Expect = 0.12
Identities = 18/60 (30%), Positives = 25/60 (41%), Gaps = 1/60 (1%)
Query: 147 RAIKYRRMYKTGDYGRIVDDILLY-EGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSV 205
+ I Y TGD G + +D L R KI G V L VE L+K +++
Sbjct: 1013 KDIDGIGWYVTGDKGHLDEDGFLTITDRYSRFAKIGGEMVPLGAVEEELAKALGGEEVVF 1072
Score = 31.4 bits (72), Expect = 0.74
Identities = 8/16 (50%), Positives = 12/16 (75%)
Query: 105 DIAIVLYTSGSTGIPK 120
D A ++++SGS G PK
Sbjct: 783 DTATIIFSSGSEGEPK 798
>gnl|CDD|213313 cd05966, ACS, Acetyl-CoA synthetase (also known as acetate-CoA
ligase and acetyl-activating enzyme). Acetyl-CoA
synthetase (ACS) catalyzes the formation of acetyl-CoA
from acetate, CoA, and ATP. Synthesis of acetyl-CoA is
carried out in a two-step reaction. In the first step,
the enzyme catalyzes the synthesis of acetyl-AMP
intermediate from acetate and ATP. In the second step,
acetyl-AMP reacts with CoA to produce acetyl-CoA. This
enzyme is widely present in all living organisms. The
activity of this enzyme is crucial for maintaining the
required levels of acetyl-CoA, a key intermediate in
many important biosynthetic and catabolic processes.
Acetyl-CoA is used in the biosynthesis of glucose, fatty
acids, and cholesterol. It can also be used in the
production of energy in the citric acid cycle.
Eukaryotes typically have two isoforms of acetyl-CoA
synthetase, a cytosolic form involved in biosynthetic
processes and a mitochondrial form primarily involved in
energy generation.
Length = 602
Score = 33.7 bits (78), Expect = 0.12
Identities = 19/49 (38%), Positives = 27/49 (55%), Gaps = 7/49 (14%)
Query: 150 KYRRMYKTGDYGRIVDD----ILLYEGRTDSQVKIRGNRVDLSEVEAAL 194
K+ Y TGD R +D IL GR D + + G+R+ +EVE+AL
Sbjct: 460 KFPGYYFTGDGARRDEDGYYWIL---GRVDDVINVSGHRLGTAEVESAL 505
Score = 32.5 bits (75), Expect = 0.25
Identities = 11/18 (61%), Positives = 13/18 (72%)
Query: 103 SNDIAIVLYTSGSTGIPK 120
+ D +LYTSGSTG PK
Sbjct: 224 AEDPLFILYTSGSTGKPK 241
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 33.9 bits (78), Expect = 0.12
Identities = 14/24 (58%), Positives = 21/24 (87%)
Query: 32 KKKKKKKKKKKKKKKKKKKKKRRI 55
K KKKKKK+KKK+++ K+++K RI
Sbjct: 744 KTKKKKKKEKKKEEEYKREEKARI 767
Score = 29.3 bits (66), Expect = 3.1
Identities = 12/32 (37%), Positives = 21/32 (65%)
Query: 14 KVIIPAPFEREKAKKKKNKKKKKKKKKKKKKK 45
I + E+ K KKKK KKK+++ K+++K +
Sbjct: 735 FKISDSVEEKTKKKKKKEKKKEEEYKREEKAR 766
Score = 27.8 bits (62), Expect = 8.7
Identities = 12/28 (42%), Positives = 20/28 (71%)
Query: 20 PFEREKAKKKKNKKKKKKKKKKKKKKKK 47
EK KKKK K+KKK+++ K+++K +
Sbjct: 739 DSVEEKTKKKKKKEKKKEEEYKREEKAR 766
>gnl|CDD|213307 cd05941, MCS, Malonyl-CoA synthetase (MCS). MCS catalyzes the
formation of malonyl-CoA in a two-step reaction
consisting of the adenylation of malonate with ATP,
followed by malonyl transfer from malonyl-AMP to CoA.
Malonic acid and its derivatives are the building blocks
of polyketides and malonyl-CoA serves as the substrate
of polyketide synthases. Malonyl-CoA synthetase has
broad substrate tolerance and can activate a variety of
malonyl acid derivatives. MCS may play an important role
in biosynthesis of polyketides, the important secondary
metabolites with therapeutic and agrochemical utility.
Length = 430
Score = 33.4 bits (77), Expect = 0.13
Identities = 10/16 (62%), Positives = 14/16 (87%)
Query: 105 DIAIVLYTSGSTGIPK 120
D A+++YTSG+TG PK
Sbjct: 89 DPALIIYTSGTTGRPK 104
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 33.4 bits (77), Expect = 0.13
Identities = 14/31 (45%), Positives = 22/31 (70%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
+KA KK K +K +KK++++KK+KKKK
Sbjct: 405 SKKATKKIKKIVEKAEKKREEEKKEKKKKAF 435
Score = 33.4 bits (77), Expect = 0.15
Identities = 14/31 (45%), Positives = 24/31 (77%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
EKA+KK+ ++KK+KKKK KKK+++++
Sbjct: 415 IVEKAEKKREEEKKEKKKKAFAGKKKEEEEE 445
Score = 33.0 bits (76), Expect = 0.18
Identities = 13/42 (30%), Positives = 23/42 (54%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPA 64
++ KK K +K +KK++++KK+KKKK + E
Sbjct: 406 KKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEE 447
Score = 32.6 bits (75), Expect = 0.29
Identities = 13/34 (38%), Positives = 25/34 (73%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
+RE+ KK+K KK KKK+++++++K+KK+
Sbjct: 422 KREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEK 455
Score = 32.2 bits (74), Expect = 0.36
Identities = 12/38 (31%), Positives = 23/38 (60%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
+ K KK +K +KK++++KK+KKKK ++ +
Sbjct: 407 KATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEE 444
Score = 31.8 bits (73), Expect = 0.46
Identities = 11/31 (35%), Positives = 22/31 (70%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
+ KK++ +KK+KKKK KKK++++++
Sbjct: 416 VEKAEKKREEEKKEKKKKAFAGKKKEEEEEE 446
Score = 31.8 bits (73), Expect = 0.48
Identities = 11/38 (28%), Positives = 25/38 (65%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
E+ K+KK K KKK+++++++K+KK++ + +
Sbjct: 423 REEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEE 460
Score = 31.4 bits (72), Expect = 0.55
Identities = 11/33 (33%), Positives = 24/33 (72%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
++ + +KK+ KKK KKK+++++++K+KK
Sbjct: 421 KKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEE 453
Score = 31.4 bits (72), Expect = 0.57
Identities = 11/32 (34%), Positives = 24/32 (75%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
+ EK ++++ K+KKKK KKK+++++++K
Sbjct: 418 KAEKKREEEKKEKKKKAFAGKKKEEEEEEEKE 449
Score = 30.7 bits (70), Expect = 0.91
Identities = 11/32 (34%), Positives = 22/32 (68%)
Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
++ K +K ++++KK+KKKK KKK+++
Sbjct: 412 IKKIVEKAEKKREEEKKEKKKKAFAGKKKEEE 443
Score = 30.3 bits (69), Expect = 1.3
Identities = 12/34 (35%), Positives = 22/34 (64%)
Query: 26 AKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
KK KK KK +K +KK++++KK+++ A +
Sbjct: 403 TGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFA 436
Score = 30.3 bits (69), Expect = 1.3
Identities = 12/31 (38%), Positives = 25/31 (80%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
E+++ ++KK KKKK KKK+++++++K+K
Sbjct: 420 EKKREEEKKEKKKKAFAGKKKEEEEEEEKEK 450
Score = 30.3 bits (69), Expect = 1.4
Identities = 12/40 (30%), Positives = 25/40 (62%)
Query: 13 DKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
I E+ + K+++ KK+KKKK KKK+++++++
Sbjct: 408 ATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEE 447
Score = 30.3 bits (69), Expect = 1.6
Identities = 12/38 (31%), Positives = 28/38 (73%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
++EK KK KKK+++++++K+KK+++K++ A+
Sbjct: 427 KKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEE 464
Score = 29.5 bits (67), Expect = 2.2
Identities = 12/31 (38%), Positives = 20/31 (64%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
K K KK +K +KK++++KK+KKK+
Sbjct: 403 TGSKKATKKIKKIVEKAEKKREEEKKEKKKK 433
>gnl|CDD|213314 cd05967, PrpE, Propionyl-CoA synthetase (PrpE). PrpE catalyzes the
first step of the 2-methylcitric acid cycle for
propionate catabolism. It activates propionate to
propionyl-CoA in a two-step reaction, which proceeds
through a propionyl-AMP intermediate and requires ATP
and Mg2+. In Salmonella enterica, the PrpE protein is
required for growth of S. enterica on propionate and can
substitute for the acetyl-CoA synthetase (Acs) enzyme
during growth on acetate. PrpE can also activate
acetate, 3HP, and butyrate to their corresponding
CoA-thioesters, although with less efficiency.
Length = 607
Score = 33.4 bits (77), Expect = 0.15
Identities = 17/48 (35%), Positives = 27/48 (56%), Gaps = 1/48 (2%)
Query: 150 KYRRMYKTGDYGRIVDDILLY-EGRTDSQVKIRGNRVDLSEVEAALSK 196
K+ Y TGD G +D L+ GRTD + + G+R+ E+E ++ K
Sbjct: 458 KFPGYYDTGDSGYKDEDGYLFVMGRTDDVINVAGHRLSTGEMEESVLK 505
Score = 30.0 bits (68), Expect = 1.6
Identities = 11/18 (61%), Positives = 13/18 (72%)
Query: 103 SNDIAIVLYTSGSTGIPK 120
S D +LYTSG+TG PK
Sbjct: 219 STDPLYILYTSGTTGKPK 236
>gnl|CDD|185507 PTZ00191, PTZ00191, 60S ribosomal protein L23a; Provisional.
Length = 145
Score = 32.0 bits (73), Expect = 0.15
Identities = 14/31 (45%), Positives = 16/31 (51%)
Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
+ KK KK KK K KK K KK+K R
Sbjct: 1 KAPAKKAKKKAKKAAKAAKKGVKVKKRKVRT 31
Score = 31.2 bits (71), Expect = 0.30
Identities = 14/30 (46%), Positives = 17/30 (56%)
Query: 26 AKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
AKK K K KK K KK K KK+K + +
Sbjct: 4 AKKAKKKAKKAAKAAKKGVKVKKRKVRTSV 33
Score = 29.7 bits (67), Expect = 1.1
Identities = 12/29 (41%), Positives = 14/29 (48%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
K KK KK K KK K KK+K +
Sbjct: 4 AKKAKKKAKKAAKAAKKGVKVKKRKVRTS 32
Score = 29.7 bits (67), Expect = 1.2
Identities = 14/29 (48%), Positives = 16/29 (55%)
Query: 27 KKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
K K KKK KK K KK K KKR++
Sbjct: 1 KAPAKKAKKKAKKAAKAAKKGVKVKKRKV 29
>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
Length = 246
Score = 32.7 bits (75), Expect = 0.15
Identities = 18/51 (35%), Positives = 24/51 (47%), Gaps = 3/51 (5%)
Query: 14 KVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPA 64
V+IP P E K K K K K KK +++ K++ K A EN A
Sbjct: 89 PVVIPKP---EPKPKPKPKPKPKPVKKVEEQPKREVKPVEPRPASPFENTA 136
Score = 32.0 bits (73), Expect = 0.30
Identities = 11/37 (29%), Positives = 18/37 (48%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
AP K + K K K K K KK +++ K++ +
Sbjct: 87 EAPVVIPKPEPKPKPKPKPKPKPVKKVEEQPKREVKP 123
Score = 31.6 bits (72), Expect = 0.41
Identities = 15/49 (30%), Positives = 21/49 (42%), Gaps = 1/49 (2%)
Query: 17 IPAPFEREK-AKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPA 64
IP P + K K K K K K K KK +++ +R + PA
Sbjct: 81 IPEPPKEAPVVIPKPEPKPKPKPKPKPKPVKKVEEQPKREVKPVEPRPA 129
>gnl|CDD|236217 PRK08279, PRK08279, long-chain-acyl-CoA synthetase; Validated.
Length = 600
Score = 33.3 bits (77), Expect = 0.15
Identities = 10/19 (52%), Positives = 14/19 (73%)
Query: 102 QSNDIAIVLYTSGSTGIPK 120
+ D A +YTSG+TG+PK
Sbjct: 197 TAKDTAFYIYTSGTTGLPK 215
>gnl|CDD|213301 cd05935, LC_FACS_like, Putative long-chain fatty acid CoA ligase.
The members of this family are putative long-chain fatty
acyl-CoA synthetases, which catalyze the ATP-dependent
activation of fatty acids in a two-step reaction. The
carboxylate substrate first reacts with ATP to form an
acyl-adenylate intermediate, which then reacts with CoA
to produce an acyl-CoA ester. Fatty acyl-CoA synthetases
are responsible for fatty acid degradation as well as
physiological regulation of cellular functions via the
production of fatty acyl-CoA esters.
Length = 430
Score = 33.1 bits (76), Expect = 0.16
Identities = 25/94 (26%), Positives = 43/94 (45%), Gaps = 15/94 (15%)
Query: 147 RAIKYRRMYKTGDYGRIVDD-ILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSV 205
+ +R ++TGD G I ++ + R + + G +V +EVEA L + V ++ V
Sbjct: 304 IELDGKRFFRTGDLGYIDEEGYFFFLDRVKRMINVSGYKVWPAEVEALLYQHPAVLEVCV 363
Query: 206 LCYKP----GEV---------DQVSEVTSEEIRG 226
+ +P GEV D +VT E+I
Sbjct: 364 IG-RPDPRRGEVVKAFVVLKPDYRGKVTEEDIIE 396
Score = 32.7 bits (75), Expect = 0.22
Identities = 10/20 (50%), Positives = 17/20 (85%)
Query: 101 EQSNDIAIVLYTSGSTGIPK 120
+ +D+A++ YTSG+TG+PK
Sbjct: 81 SELDDVAVIPYTSGTTGLPK 100
>gnl|CDD|224458 COG1541, PaaK, Coenzyme F390 synthetase [Coenzyme metabolism].
Length = 438
Score = 33.1 bits (76), Expect = 0.16
Identities = 19/57 (33%), Positives = 28/57 (49%), Gaps = 10/57 (17%)
Query: 155 YKTGDYGRIVDD----------ILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEVD 201
Y+TGD I+ D I EGR+D + +RG V S++E L ++ EV
Sbjct: 304 YRTGDITVILSDPCGCGRTHRRIERIEGRSDDMLIVRGVNVFPSQIERVLLQIPEVT 360
>gnl|CDD|223953 COG1022, FAA1, Long-chain acyl-CoA synthetases (AMP-forming) [Lipid
metabolism].
Length = 613
Score = 33.4 bits (77), Expect = 0.17
Identities = 10/19 (52%), Positives = 16/19 (84%)
Query: 102 QSNDIAIVLYTSGSTGIPK 120
+ +D+A ++YTSG+TG PK
Sbjct: 189 KPDDLATIIYTSGTTGTPK 207
>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
eukaryotic RNase H complex which cleaves RNA-DNA
hybrids. Ribonuclease H2B is one of the three proteins
of eukaryotic RNase H2 complex that is required for
nucleic acid binding and hydrolysis. RNase H is
classified into two families, type I (prokaryotic RNase
HI, eukaryotic RNase H1 and viral RNase H) and type II
(prokaryotic RNase HII and HIII, and eukaryotic RNase
H2/HII). RNase H endonucleolytically hydrolyzes an RNA
strand when it is annealed to a complementary DNA strand
in the presence of divalent cations, in DNA replication
and repair. The enzyme can be found in bacteria,
archaea, and eukaryotes. Most prokaryotic and eukaryotic
genomes contain multiple RNase H genes. Despite a lack
of evidence for homology from sequence comparisons, type
I and type II RNase H share a common fold and similar
steric configurations of the four acidic active-site
residues, suggesting identical or very similar catalytic
mechanisms. Eukaryotic RNase HII is active during
replication and is believed to play a role in removal of
Okazaki fragment primers and single ribonucleotides in
DNA-DNA duplexes. Eukaryotic RNase HII is functional
when it forms a complex with RNase H2B and RNase H2C
proteins. It is speculated that the two accessory
subunits are required for correct folding of the
catalytic subunit of RNase HII. Mutations in the three
subunits of human RNase HII cause neurological disorder.
Length = 211
Score = 32.7 bits (75), Expect = 0.17
Identities = 16/32 (50%), Positives = 19/32 (59%)
Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
KK N +++KKKKKKK KKKK KK
Sbjct: 167 EAARDKKKSNNAEEEKKKKKKKSAKKKKLKKV 198
Score = 32.3 bits (74), Expect = 0.23
Identities = 18/37 (48%), Positives = 22/37 (59%)
Query: 16 IIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
I A R+K K +++KKKKKKK KKKK KK
Sbjct: 163 IKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVA 199
Score = 31.5 bits (72), Expect = 0.40
Identities = 15/38 (39%), Positives = 21/38 (55%), Gaps = 1/38 (2%)
Query: 19 APFEREKAKKKKNKKKKKKKKKKKKK-KKKKKKKKRRI 55
A +++ ++ KKKKKKK KKKK KK + I
Sbjct: 169 ARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVGMKAI 206
Score = 27.7 bits (62), Expect = 6.5
Identities = 15/30 (50%), Positives = 20/30 (66%)
Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
KKK+ +++KKKKKKK KKKK K+
Sbjct: 168 AARDKKKSNNAEEEKKKKKKKSAKKKKLKK 197
Score = 27.7 bits (62), Expect = 6.8
Identities = 14/36 (38%), Positives = 20/36 (55%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
K + KKK +++KKKKKKK K++ L K
Sbjct: 162 DIKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKK 197
>gnl|CDD|222592 pfam14204, Ribosomal_L18_c, Ribosomal L18 C-terminal region.
This domain is the C-terminal end of ribosomal L18/L5
proteins.
Length = 93
Score = 31.0 bits (71), Expect = 0.17
Identities = 12/38 (31%), Positives = 21/38 (55%), Gaps = 2/38 (5%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKK--KKKKKRRILAK 58
R +KK +KK+ K + K+ KK +++K R+ K
Sbjct: 56 RADPSRKKKEKKEVKAESKRYNAKKLTLEQRKARVAQK 93
Score = 26.7 bits (60), Expect = 6.2
Identities = 9/31 (29%), Positives = 16/31 (51%)
Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
+KA +KKK+KK+ K + K+ +
Sbjct: 49 KKAHAAIRADPSRKKKEKKEVKAESKRYNAK 79
>gnl|CDD|236175 PRK08180, PRK08180, feruloyl-CoA synthase; Reviewed.
Length = 614
Score = 33.3 bits (77), Expect = 0.17
Identities = 11/17 (64%), Positives = 14/17 (82%)
Query: 104 NDIAIVLYTSGSTGIPK 120
+ IA L+TSGSTG+PK
Sbjct: 209 DTIAKFLFTSGSTGLPK 225
>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6. The surfeit locus
protein SURF-6 is shown to be a component of the
nucleolar matrix and has a strong binding capacity for
nucleic acids.
Length = 206
Score = 32.3 bits (74), Expect = 0.18
Identities = 21/37 (56%), Positives = 25/37 (67%), Gaps = 4/37 (10%)
Query: 24 EKAKKKKNKKK----KKKKKKKKKKKKKKKKKKRRIL 56
EK K ++ KK+ KK+K KK KKKKK KKK RIL
Sbjct: 170 EKKKAERQKKREENLKKRKDDKKNKKKKKAKKKGRIL 206
Score = 29.6 bits (67), Expect = 1.8
Identities = 16/30 (53%), Positives = 22/30 (73%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
ER+K +++ KK+K KK KKKKK KKK +
Sbjct: 175 ERQKKREENLKKRKDDKKNKKKKKAKKKGR 204
Score = 28.0 bits (63), Expect = 4.7
Identities = 11/33 (33%), Positives = 24/33 (72%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
RE +++ +K++++K +KK+K+K+ KKK+
Sbjct: 3 SREALLEQRRRKREQRKARKKQKRKEAKKKEDA 35
Score = 28.0 bits (63), Expect = 5.5
Identities = 14/26 (53%), Positives = 17/26 (65%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKK 48
E KK+K+ KK KKKKK KKK +
Sbjct: 181 EENLKKRKDDKKNKKKKKAKKKGRIL 206
Score = 27.7 bits (62), Expect = 6.6
Identities = 10/37 (27%), Positives = 27/37 (72%)
Query: 26 AKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCEN 62
++++ ++++K +KK+K+K+ KKK+ ++ A+ +N
Sbjct: 9 EQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKN 45
Score = 27.3 bits (61), Expect = 8.7
Identities = 14/27 (51%), Positives = 17/27 (62%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKK 49
RE+ KK+ KK KKKKK KKK +
Sbjct: 180 REENLKKRKDDKKNKKKKKAKKKGRIL 206
Score = 27.3 bits (61), Expect = 8.8
Identities = 9/29 (31%), Positives = 23/29 (79%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
+ +++K +++K +KK+K+K+ KKK+ ++
Sbjct: 9 EQRRRKREQRKARKKQKRKEAKKKEDAQK 37
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 32.8 bits (75), Expect = 0.19
Identities = 12/34 (35%), Positives = 26/34 (76%)
Query: 19 APFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
A + EK K+++N+KK+K++ + K+K++K ++K
Sbjct: 400 AKAKEEKLKQEENEKKQKEQADEDKEKRQKDERK 433
Score = 29.8 bits (67), Expect = 1.9
Identities = 10/34 (29%), Positives = 24/34 (70%)
Query: 20 PFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
KAK++K K+++ +KK+K++ + K+K+++
Sbjct: 396 EEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQK 429
Score = 29.4 bits (66), Expect = 2.6
Identities = 9/30 (30%), Positives = 22/30 (73%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
E+ K ++ + K+K++ + K+K++K ++KK
Sbjct: 405 EKLKQEENEKKQKEQADEDKEKRQKDERKK 434
Score = 29.0 bits (65), Expect = 3.2
Identities = 12/37 (32%), Positives = 24/37 (64%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
E +AK K+ K K+++ +KK+K++ + K+KR+
Sbjct: 392 TDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQ 428
>gnl|CDD|213295 cd05929, BACL_like, Bacterial Bile acid CoA ligases and similar
proteins. Bile acid-Coenzyme A ligase catalyzes the
formation of bile acid-CoA conjugates in a two-step
reaction: the formation of a bile acid-AMP molecule as
an intermediate, followed by the formation of a bile
acid-CoA. This ligase requires a bile acid with a free
carboxyl group, ATP, Mg2+, and CoA for synthesis of the
final bile acid-CoA conjugate. The bile acid-CoA
ligation is believed to be the initial step in the bile
acid 7alpha-dehydroxylation pathway in the intestinal
bacterium Eubacterium sp.
Length = 342
Score = 32.8 bits (76), Expect = 0.21
Identities = 11/16 (68%), Positives = 14/16 (87%)
Query: 105 DIAIVLYTSGSTGIPK 120
D A++LYTSG+TG PK
Sbjct: 2 DPALILYTSGTTGRPK 17
>gnl|CDD|236072 PRK07656, PRK07656, long-chain-fatty-acid--CoA ligase; Validated.
Length = 513
Score = 33.0 bits (76), Expect = 0.22
Identities = 10/16 (62%), Positives = 14/16 (87%)
Query: 105 DIAIVLYTSGSTGIPK 120
D+A +L+TSG+TG PK
Sbjct: 167 DVADILFTSGTTGRPK 182
>gnl|CDD|131369 TIGR02316, propion_prpE, propionate--CoA ligase. This family
contains one of three readily separable clades of
proteins in the group of acetate and propionate--CoA
ligases. Characterized members of this family act on
propionate. From propionyl-CoA, there is a cyclic
degradation pathway: it is ligated by PrpC to the TCA
cycle intermediate oxaloacetate, acted upon further by
PrpD and an aconitase, then cleaved by PrpB to pyruvate
and the TCA cycle intermediate succinate.
Length = 628
Score = 33.0 bits (75), Expect = 0.22
Identities = 24/91 (26%), Positives = 38/91 (41%), Gaps = 20/91 (21%)
Query: 53 RRILAKSCENPAYKGRTVL----SLPQLRLLS------ASLSHNNIPGESMLHADNNNE- 101
+ +L + + VL L +RL+ A+L + A E
Sbjct: 175 KPLLDAAIAEAQHPPPHVLLVDRGLAPMRLIPGRDVDYAALRTQHE------DAQVPVEW 228
Query: 102 -QSNDIAIVLYTSGSTGIPKEYRRMYKTGDY 131
+SN+ + +LYTSG+TG PK +R G Y
Sbjct: 229 LESNEPSYILYTSGTTGKPKGVQR--DVGGY 257
Score = 29.9 bits (67), Expect = 1.7
Identities = 17/56 (30%), Positives = 31/56 (55%), Gaps = 1/56 (1%)
Query: 152 RRMYKTGDYG-RIVDDILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSVL 206
R +Y + D+G R D GRTD + + G+R+ E+E ++S V +++V+
Sbjct: 471 RPLYSSFDWGIRDEDGYTFILGRTDDVINVAGHRLGTREIEESVSSHPSVAEVAVV 526
>gnl|CDD|213312 cd05959, BCL_4HBCL, Benzoate CoA ligase (BCL) and
4-Hydroxybenzoate-Coenzyme A Ligase (4-HBA-CoA ligase).
Benzoate CoA ligase and 4-hydroxybenzoate-coenzyme A
ligase catalyze the first activating step for benzoate
and 4-hydroxybenzoate catabolic pathways, respectively.
Although these two enzymes share very high sequence
homology, they have their own substrate preference. The
reaction proceeds via a two-step process; the first
ATP-dependent step forms the substrate-AMP intermediate,
while the second step forms the acyl-CoA ester,
releasing the AMP. Aromatic compounds represent the
second most abundant class of organic carbon compounds
after carbohydrates. Some bacteria can use benzoic acid
or benzenoid compounds as the sole source of carbon and
energy through degradation. Benzoate CoA ligase and
4-hydroxybenzoate-Coenzyme A ligase are key enzymes of
this process.
Length = 506
Score = 32.7 bits (75), Expect = 0.22
Identities = 11/17 (64%), Positives = 14/17 (82%)
Query: 104 NDIAIVLYTSGSTGIPK 120
+D+A LY+SGSTG PK
Sbjct: 161 DDMAFWLYSSGSTGRPK 177
Score = 32.3 bits (74), Expect = 0.30
Identities = 18/41 (43%), Positives = 22/41 (53%), Gaps = 1/41 (2%)
Query: 155 YKTGD-YGRIVDDILLYEGRTDSQVKIRGNRVDLSEVEAAL 194
+TGD Y R D Y GR+D +K+ G V EVE AL
Sbjct: 386 TRTGDKYYRDEDGYYWYCGRSDDMLKVSGIWVSPFEVEDAL 426
>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
(DUF2058). This domain, found in various prokaryotic
proteins, has no known function.
Length = 177
Score = 31.8 bits (73), Expect = 0.23
Identities = 11/22 (50%), Positives = 18/22 (81%)
Query: 31 NKKKKKKKKKKKKKKKKKKKKK 52
+KKK KK KK+K+K++K+ +K
Sbjct: 13 DKKKAKKAKKEKRKQRKQARKG 34
Score = 31.0 bits (71), Expect = 0.46
Identities = 11/24 (45%), Positives = 17/24 (70%)
Query: 26 AKKKKNKKKKKKKKKKKKKKKKKK 49
KKK KK KK+K+K++K+ +K
Sbjct: 12 VDKKKAKKAKKEKRKQRKQARKGA 35
Score = 29.5 bits (67), Expect = 1.6
Identities = 11/20 (55%), Positives = 17/20 (85%)
Query: 35 KKKKKKKKKKKKKKKKKKRR 54
KKK KK KK+K+K++K+ R+
Sbjct: 14 KKKAKKAKKEKRKQRKQARK 33
Score = 29.1 bits (66), Expect = 1.9
Identities = 10/20 (50%), Positives = 17/20 (85%)
Query: 34 KKKKKKKKKKKKKKKKKKKR 53
KKK KK KK+K+K++K+ ++
Sbjct: 14 KKKAKKAKKEKRKQRKQARK 33
Score = 28.0 bits (63), Expect = 5.4
Identities = 10/22 (45%), Positives = 16/22 (72%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKK 46
K K KK KK+K+K++K+ +K
Sbjct: 14 KKKAKKAKKEKRKQRKQARKGA 35
Score = 27.6 bits (62), Expect = 6.5
Identities = 9/21 (42%), Positives = 16/21 (76%)
Query: 24 EKAKKKKNKKKKKKKKKKKKK 44
+KAKK K +K+K++K+ +K
Sbjct: 15 KKAKKAKKEKRKQRKQARKGA 35
Score = 27.6 bits (62), Expect = 6.6
Identities = 10/22 (45%), Positives = 16/22 (72%)
Query: 30 KNKKKKKKKKKKKKKKKKKKKK 51
K K KK KK+K+K++K+ +K
Sbjct: 14 KKKAKKAKKEKRKQRKQARKGA 35
Score = 27.6 bits (62), Expect = 7.3
Identities = 10/22 (45%), Positives = 16/22 (72%)
Query: 29 KKNKKKKKKKKKKKKKKKKKKK 50
KK KK KK+K+K++K+ +K
Sbjct: 14 KKKAKKAKKEKRKQRKQARKGA 35
>gnl|CDD|213288 cd05921, FCS, Feruloyl-CoA synthetase (FCS). Feruloyl-CoA
synthetase is an essential enzyme in the feruloyl acid
degradation pathway and enables some proteobacteria to
grow on media containing feruloyl acid as the sole
carbon source. It catalyzes the transfer of CoA to the
carboxyl group of ferulic acid, which then forms
feruloyl-CoA in the presence of ATP and Mg2. The
resulting feruloyl-CoA is further degraded to vanillin
and acetyl-CoA. Feruloyl-CoA synthetase (FCS) is a
subfamily of the adenylate-forming enzymes superfamily.
Length = 559
Score = 32.6 bits (75), Expect = 0.23
Identities = 10/16 (62%), Positives = 13/16 (81%)
Query: 105 DIAIVLYTSGSTGIPK 120
+A L+TSGSTG+PK
Sbjct: 165 TVAKYLFTSGSTGLPK 180
>gnl|CDD|234677 PRK00174, PRK00174, acetyl-CoA synthetase; Provisional.
Length = 637
Score = 32.8 bits (76), Expect = 0.24
Identities = 17/49 (34%), Positives = 27/49 (55%), Gaps = 7/49 (14%)
Query: 150 KYRRMYKTGDYGRIVDD----ILLYEGRTDSQVKIRGNRVDLSEVEAAL 194
++ MY TGD R +D I GR D + + G+R+ +E+E+AL
Sbjct: 480 TFKGMYFTGDGARRDEDGYYWIT---GRVDDVLNVSGHRLGTAEIESAL 525
Score = 31.6 bits (73), Expect = 0.54
Identities = 10/12 (83%), Positives = 11/12 (91%)
Query: 109 VLYTSGSTGIPK 120
+LYTSGSTG PK
Sbjct: 250 ILYTSGSTGKPK 261
>gnl|CDD|236091 PRK07768, PRK07768, long-chain-fatty-acid--CoA ligase; Validated.
Length = 545
Score = 32.7 bits (75), Expect = 0.24
Identities = 10/17 (58%), Positives = 14/17 (82%)
Query: 104 NDIAIVLYTSGSTGIPK 120
+D+A++ TSGSTG PK
Sbjct: 152 DDLALMQLTSGSTGSPK 168
>gnl|CDD|213311 cd05958, ABCL, 2-aminobenzoate-CoA ligase (ABCL). ABCL catalyzes
the initial step in the 2-aminobenzoate aerobic
degradation pathway by activating 2-aminobenzoate to
2-aminobenzoyl-CoA. The reaction is carried out via a
two-step process; the first step is ATP-dependent and
forms a 2-aminobenzoyl-AMP intermediate, and the second
step forms the 2-aminobenzoyl-CoA ester and releases the
AMP. 2-Aminobenzoyl-CoA is further converted to
2-amino-5-oxo-cyclohex-1-ene-1-carbonyl-CoA catalyzed by
2-aminobenzoyl-CoA monooxygenase/reductase. ABCL has
been purified from cells aerobically grown with
2-aminobenzoate as sole carbon, energy, and nitrogen
source, and has been characterized as a monomer.
Length = 487
Score = 32.8 bits (75), Expect = 0.24
Identities = 24/102 (23%), Positives = 42/102 (41%), Gaps = 21/102 (20%)
Query: 54 RILAKSCENPAY----------KGRTVLSLPQLRLLSASLSHNNIPGESMLHA--DNNNE 101
R+L +S P V ++P LR + L+ + HA D
Sbjct: 88 RVLLRSANTPMLVACWLAVLKAGAIVVTTMPLLR--AKELTTIVDKAR-ITHALCDKRLT 144
Query: 102 QSNDIAIVLYTSGSTGIPKEYRRMYKTGDYGRIVDDIHRCWS 143
++D+A++ +TSG+TG+PK T + R + I +
Sbjct: 145 AADDVALIGFTSGTTGLPK------GTMHFHRDILAICDAFP 180
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 32.5 bits (74), Expect = 0.25
Identities = 14/36 (38%), Positives = 21/36 (58%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
+E+AKK+ ++ K K + KKK + KKK AK
Sbjct: 145 KEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAK 180
Score = 30.2 bits (68), Expect = 1.3
Identities = 11/37 (29%), Positives = 20/37 (54%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
E K + + +KK K++ KK+ +++ K K AK
Sbjct: 129 AAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAK 165
Score = 29.8 bits (67), Expect = 1.6
Identities = 14/45 (31%), Positives = 19/45 (42%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYK 66
+AKKK + KKK + + K K + K K K E K
Sbjct: 160 AAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAKAK 204
Score = 29.4 bits (66), Expect = 2.2
Identities = 11/31 (35%), Positives = 15/31 (48%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
E E K + KKK + KKK + + K K
Sbjct: 153 EEEAKAKAAAEAKKKAAEAKKKAEAEAKAKA 183
Score = 29.0 bits (65), Expect = 3.1
Identities = 14/35 (40%), Positives = 19/35 (54%)
Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
E+AK K + KKK + KKK + + K K AK
Sbjct: 154 EEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAK 188
Score = 29.0 bits (65), Expect = 3.1
Identities = 8/37 (21%), Positives = 14/37 (37%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
+ E+AK K K K + K + + A+
Sbjct: 190 KAEEAKAKAEAAKAKAAAEAAAKAEAEAAAAAAAEAE 226
Score = 28.6 bits (64), Expect = 4.3
Identities = 15/43 (34%), Positives = 21/43 (48%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPA 64
E E KK K + KK+ +++ K K + KKK K E A
Sbjct: 137 EAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEA 179
Score = 28.6 bits (64), Expect = 4.6
Identities = 11/30 (36%), Positives = 18/30 (60%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
E KAK+ K K + + +KK K++ KK+
Sbjct: 122 EEAKAKQAAEAKAKAEAEAEKKAKEEAKKQ 151
Score = 28.3 bits (63), Expect = 4.8
Identities = 13/48 (27%), Positives = 21/48 (43%)
Query: 19 APFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYK 66
A EK K+ + K K+ + K K + + +KK + K E A
Sbjct: 111 AKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKA 158
Score = 28.3 bits (63), Expect = 4.9
Identities = 13/40 (32%), Positives = 19/40 (47%)
Query: 19 APFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
A + KKK + KKK + + K K + K K + AK
Sbjct: 156 AKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAK 195
Score = 27.9 bits (62), Expect = 8.1
Identities = 6/33 (18%), Positives = 20/33 (60%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
E+ + K+ + + +K K+ ++ K+ ++K++
Sbjct: 87 EQARQKELEQRAAAEKAAKQAEQAAKQAEEKQK 119
Score = 27.5 bits (61), Expect = 8.8
Identities = 9/29 (31%), Positives = 18/29 (62%)
Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
E+A K+ +K+K+ ++ K K+ + K K
Sbjct: 108 EQAAKQAEEKQKQAEEAKAKQAAEAKAKA 136
>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6. RPL6
contains KOW motif that has an extra ribosomal role as
an oncogenic. KOW domain is known as an RNA-binding
motif that is shared so far among some families of
ribosomal proteins, the essential bacterial
transcriptional elongation factor NusG, the eukaryotic
chromatin elongation factor Spt5, the higher eukaryotic
KIN17 proteins and Mtr4. .
Length = 152
Score = 31.4 bits (72), Expect = 0.26
Identities = 13/32 (40%), Positives = 23/32 (71%)
Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
F+R+K KKKK K+ + ++KKKK +++K+
Sbjct: 81 FKRKKKKKKKKKEGEFFEEKKKKYVVSEERKE 112
Score = 27.5 bits (62), Expect = 5.5
Identities = 14/30 (46%), Positives = 21/30 (70%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
K KKKK KKKK+ + ++KKKK ++R+
Sbjct: 82 KRKKKKKKKKKEGEFFEEKKKKYVVSEERK 111
>gnl|CDD|235865 PRK06814, PRK06814, acylglycerophosphoethanolamine acyltransferase;
Provisional.
Length = 1140
Score = 32.6 bits (75), Expect = 0.26
Identities = 11/22 (50%), Positives = 15/22 (68%)
Query: 99 NNEQSNDIAIVLYTSGSTGIPK 120
N +D A++L+TSGS G PK
Sbjct: 788 CNRDPDDPAVILFTSGSEGTPK 809
>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
fragment found in eukaryotic DNA topoisomerase (topo) IB
proteins similar to the monomeric yeast and human topo I
and heterodimeric topo I from Leishmania donvanni. Topo
I enzymes are divided into: topo type IA (bacterial)
and type IB (eukaryotic). Topo I relaxes superhelical
tension in duplex DNA by creating a single-strand nick,
the broken strand can then rotate around the unbroken
strand to remove DNA supercoils and, the nick is
religated, liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit re-ligation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topo I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topos I play putative roles
in organizing the kinetoplast DNA network unique to
these parasites. This family may represent more than
one structural domain.
Length = 215
Score = 31.9 bits (73), Expect = 0.27
Identities = 13/29 (44%), Positives = 20/29 (68%)
Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKKK 49
FE EK KKK K++KK K++K+K ++
Sbjct: 94 FEEEKEKKKAMSKEEKKAIKEEKEKLEEP 122
Score = 27.6 bits (62), Expect = 6.6
Identities = 11/33 (33%), Positives = 19/33 (57%)
Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
F ++ K+KKK K++KK K++K+K
Sbjct: 87 FTPIYQYFEEEKEKKKAMSKEEKKAIKEEKEKL 119
Score = 27.2 bits (61), Expect = 9.5
Identities = 9/27 (33%), Positives = 18/27 (66%)
Query: 32 KKKKKKKKKKKKKKKKKKKKKRRILAK 58
++K+KKK K++KK K++K ++
Sbjct: 96 EEKEKKKAMSKEEKKAIKEEKEKLEEP 122
>gnl|CDD|235724 PRK06178, PRK06178, acyl-CoA synthetase; Validated.
Length = 567
Score = 32.7 bits (75), Expect = 0.27
Identities = 20/53 (37%), Positives = 27/53 (50%), Gaps = 1/53 (1%)
Query: 155 YKTGDYGRIVDDILL-YEGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSVL 206
TGD G+I + L Y GR +K+ G V SEVEA L + V +V+
Sbjct: 444 LHTGDIGKIDEQGFLHYLGRRKEMLKVNGMSVFPSEVEALLGQHPAVLGSAVV 496
>gnl|CDD|219913 pfam08576, DUF1764, Eukaryotic protein of unknown function
(DUF1764). This is a family of eukaryotic proteins of
unknown function. This family contains many
hypothetical proteins.
Length = 98
Score = 30.5 bits (69), Expect = 0.28
Identities = 17/43 (39%), Positives = 22/43 (51%)
Query: 13 DKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
DK I F K +KKK K+ K + K KK +KK KK+
Sbjct: 10 DKRDIDDIFSNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDE 52
Score = 28.6 bits (64), Expect = 1.3
Identities = 15/42 (35%), Positives = 20/42 (47%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENP 63
+K KKKK + K + K KK +KK KKK E+
Sbjct: 20 NIKKRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEESK 61
Score = 27.0 bits (60), Expect = 4.1
Identities = 16/48 (33%), Positives = 24/48 (50%)
Query: 27 KKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRTVLSLP 74
K+KK KK+ K + K KK +KK K++ + + RT LP
Sbjct: 23 KRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEESKRRRTEDGLP 70
>gnl|CDD|233352 TIGR01310, L7, 60S ribosomal protein L7, eukaryotic. This model
describes the eukaryotic 60S (cytosolic) ribosomal
protein L7 and paralogs that may or may not also be L7.
Human, Drosophila, and Arabidopsis all have both a
typical L7 and an L7-related paralog. This family is
designated subfamily rather than equivalog to reflect
these uncharacterized paralogs. Members of this family
average ~ 250 residues in length, somewhat longer than
the archaeal L30P/L7E homolog (~ 155 residues) and much
longer than the related bacterial/organellar form (~ 60
residues).
Length = 235
Score = 31.9 bits (73), Expect = 0.30
Identities = 14/36 (38%), Positives = 21/36 (58%)
Query: 27 KKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCEN 62
K+K ++ + K+ K KKK KKKR+I K E+
Sbjct: 6 KRKARQELAVQVAKQAKAKKKANKKKRKIYFKRAES 41
Score = 31.2 bits (71), Expect = 0.55
Identities = 11/35 (31%), Positives = 19/35 (54%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILA 57
R+ ++ + K+ K KKK KKK+K +R +
Sbjct: 7 RKARQELAVQVAKQAKAKKKANKKKRKIYFKRAES 41
>gnl|CDD|213279 cd05911, Firefly_Luc_like, Firefly luciferase of light emitting
insects and 4-Coumarate-CoA Ligase (4CL). This family
contains two functionally unique groups of proteins; one
group is insect firefly luciferases and the other is
plant 4-coumarate:coenzyme A ligases. However, they
share significant sequence similarity in spite of their
functional diversity. Luciferase catalyzes the
production of light in the presence of MgATP, molecular
oxygen, and luciferin. In the first step, luciferin is
activated by acylation of its carboxylate group with
ATP, resulting in an enzyme-bound luciferyl adenylate.
In the second step, luciferyl adenylate reacts with
molecular oxygen, producing an enzyme-bound excited
state product (Luc=O*) and releasing AMP. This
excited-state product then decays to the ground state
(Luc=O), emitting a quantum of visible light.
4-coumarate:coenzyme A ligase is a key enzyme in the
phenylpropanoid metabolic pathway for monolignol and
flavonoid biosynthesis. It catalyzes the synthesis of
hydroxycinnamate-CoA thioesters in a two-step reaction,
involving the formation of hydroxycinnamate-AMP
anhydride and then the nucleophilic substitution of AMP
by CoA. The phenylpropanoid pathway is one of the most
important secondary metabolism pathways in plants and
hydroxycinnamate-CoA thioesters are the precursors of
lignin and other important phenylpropanoids.
Length = 487
Score = 32.2 bits (74), Expect = 0.30
Identities = 10/17 (58%), Positives = 15/17 (88%)
Query: 104 NDIAIVLYTSGSTGIPK 120
+D A +LY+SG+TG+PK
Sbjct: 146 DDTAALLYSSGTTGLPK 162
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 32.4 bits (74), Expect = 0.31
Identities = 15/38 (39%), Positives = 23/38 (60%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
++ KKK++KKKKKKK+KK+ K + + K K
Sbjct: 61 DKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKK 98
Score = 30.8 bits (70), Expect = 0.90
Identities = 14/43 (32%), Positives = 24/43 (55%)
Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYK 66
+K + K K +KKKKKKK+KK+ K + + + ++ K
Sbjct: 61 DKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTK 103
Score = 30.8 bits (70), Expect = 0.97
Identities = 12/31 (38%), Positives = 18/31 (58%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
+++K K+KK K + + K K KK KK K
Sbjct: 73 KKKKKKEKKEPKSEGETKLGFKTPKKSKKTK 103
Score = 30.5 bits (69), Expect = 1.2
Identities = 12/35 (34%), Positives = 18/35 (51%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRIL 56
+++K KK+ + + K K KK KK KKK
Sbjct: 75 KKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKP 109
Score = 30.5 bits (69), Expect = 1.2
Identities = 19/88 (21%), Positives = 31/88 (35%), Gaps = 7/88 (7%)
Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRTVLSLPQLRLLSASL 83
E+ K KK KK+ K + KKK +K++ K + P +G T L + +
Sbjct: 44 EEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTK 103
Query: 84 SHNNIPGESMLHADNNNEQSNDIAIVLY 111
P N + N +
Sbjct: 104 KKPPKP-------KPNEDVDNAFNKIAE 124
Score = 30.1 bits (68), Expect = 1.4
Identities = 12/31 (38%), Positives = 19/31 (61%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
+++K +KK+ K + + K K KK KK KK
Sbjct: 74 KKKKKEKKEPKSEGETKLGFKTPKKSKKTKK 104
Score = 29.7 bits (67), Expect = 1.8
Identities = 13/31 (41%), Positives = 19/31 (61%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
E++K KKK+ K+ K + + K K KK KK
Sbjct: 71 EKKKKKKKEKKEPKSEGETKLGFKTPKKSKK 101
Score = 29.3 bits (66), Expect = 2.5
Identities = 12/31 (38%), Positives = 16/31 (51%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
++E + + K K KK KK KKK K K
Sbjct: 80 KKEPKSEGETKLGFKTPKKSKKTKKKPPKPK 110
Score = 28.1 bits (63), Expect = 6.6
Identities = 12/37 (32%), Positives = 21/37 (56%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
+++K KK+K + K + + K K KK KK ++ K
Sbjct: 72 KKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPK 108
>gnl|CDD|213303 cd05937, FATP_chFAT1_like, Uncharacterized subfamily of
bifunctional fatty acid transporter/very-long-chain
acyl-CoA synthetase in fungi. Fatty acid transport
protein (FATP) transports long-chain or very-long-chain
fatty acids across the plasma membrane. FATPs also have
fatty acid CoA synthetase activity, thus playing dual
roles as fatty acid transporters and its activation
enzymes. FATPs are the key players in the trafficking of
exogenous fatty acids into the cell and in intracellular
fatty acid homeostasis. Members of this family are
fungal FATPs, including FAT1 from Cochliobolus
heterostrophus.
Length = 468
Score = 32.4 bits (74), Expect = 0.32
Identities = 22/95 (23%), Positives = 42/95 (44%), Gaps = 11/95 (11%)
Query: 45 KKKKKKKKRRILAKSCENPAYKGRTVLSLPQLRLLSASLSHNNIPGESMLHADNNNE--- 101
+ + +A N A L+L + + A +++N + G+ ++H +
Sbjct: 23 HGDRNVQSGDFVAIDTTNSAEFVFLWLALWSIGAVPAFINYN-LSGDPLIHCLKISGAKF 81
Query: 102 ---QSNDIAIVLYTSGSTGIPK----EYRRMYKTG 129
+D A ++YTSG+TG+PK +RR T
Sbjct: 82 VIVDPDDPAALIYTSGTTGLPKGCAISWRRTLVTS 116
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 32.3 bits (74), Expect = 0.32
Identities = 17/48 (35%), Positives = 27/48 (56%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRT 69
E K ++ K+K K K K +K K KKK+KKK++ A + + G +
Sbjct: 1151 EIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNS 1198
Score = 32.3 bits (74), Expect = 0.36
Identities = 13/49 (26%), Positives = 25/49 (51%)
Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRT 69
+ E +K+ K+++ K K K K K +K K ++ K ++ A K +
Sbjct: 1143 EQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKK 1191
Score = 31.2 bits (71), Expect = 0.79
Identities = 15/41 (36%), Positives = 19/41 (46%)
Query: 20 PFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSC 60
+ KA K + K KKK+KKKKK K KK +
Sbjct: 1160 SKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKR 1200
Score = 30.0 bits (68), Expect = 1.7
Identities = 15/42 (35%), Positives = 19/42 (45%)
Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAY 65
+ K K K +K K KKK+KKKKK + A N
Sbjct: 1159 KSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKR 1200
Score = 29.6 bits (67), Expect = 2.3
Identities = 18/77 (23%), Positives = 31/77 (40%), Gaps = 1/77 (1%)
Query: 29 KKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRTVLSLPQLRLLSASLSHNNI 88
KK KK+ + KKKKK +KK KS + S LR S ++
Sbjct: 1307 KKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVK-QASASQSSRLLRRPRKKKSDSSS 1365
Query: 89 PGESMLHADNNNEQSND 105
+ D++ ++ ++
Sbjct: 1366 EDDDDSEVDDSEDEDDE 1382
>gnl|CDD|225324 COG2604, COG2604, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 594
Score = 32.3 bits (74), Expect = 0.34
Identities = 14/58 (24%), Positives = 20/58 (34%), Gaps = 8/58 (13%)
Query: 2 NINLTIS-------EHHFDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
I TI E DK PF + +++K K +K KK K+
Sbjct: 435 RIKGTIELPFKEVCEKLLDK-DKNKPFIKLILLSNNSQEKAVLKLLQKIKKNNDFIKE 491
Score = 29.3 bits (66), Expect = 3.2
Identities = 15/47 (31%), Positives = 23/47 (48%), Gaps = 1/47 (2%)
Query: 12 FDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
F K+I+ + +EKA K +K KK K+ + K + IL K
Sbjct: 460 FIKLILLSNNSQEKAVLK-LLQKIKKNNDFIKEFETDALKLQEILEK 505
>gnl|CDD|236096 PRK07787, PRK07787, acyl-CoA synthetase; Validated.
Length = 471
Score = 32.3 bits (74), Expect = 0.34
Identities = 9/14 (64%), Positives = 13/14 (92%)
Query: 107 AIVLYTSGSTGIPK 120
A+++YTSG+TG PK
Sbjct: 131 ALIVYTSGTTGPPK 144
>gnl|CDD|216298 pfam01101, HMG14_17, HMG14 and HMG17.
Length = 93
Score = 30.2 bits (68), Expect = 0.35
Identities = 23/52 (44%), Positives = 26/52 (50%), Gaps = 2/52 (3%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRT 69
PAP + E KK KKKK K KK K KK+ K+ ENPA G T
Sbjct: 29 PAPPKPEPKPKKAAAKKKKPAVKGKKGAKGKKETKQE--EAKEENPAENGET 78
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 31.1 bits (71), Expect = 0.36
Identities = 9/37 (24%), Positives = 22/37 (59%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
E E+ + K+ K K++K+++ ++K+K+ + K
Sbjct: 116 EEEQIDELLEKELAKLKREKRRENERKQKEILKEQMK 152
Score = 29.2 bits (66), Expect = 1.7
Identities = 6/35 (17%), Positives = 21/35 (60%)
Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
+ K+ K++K+++ ++K+K+ K++ + +
Sbjct: 120 IDELLEKELAKLKREKRRENERKQKEILKEQMKML 154
Score = 28.4 bits (64), Expect = 3.1
Identities = 8/29 (27%), Positives = 22/29 (75%)
Query: 28 KKKNKKKKKKKKKKKKKKKKKKKKKRRIL 56
K+ K K++K+++ ++K+K+ K++ ++L
Sbjct: 126 KELAKLKREKRRENERKQKEILKEQMKML 154
>gnl|CDD|220357 pfam09716, ETRAMP, Malarial early transcribed membrane protein
(ETRAMP). These sequences represent a family of
proteins from the malaria parasite Plasmodium
falciparum, several of which have been shown to be
expressed specifically in the ring stage as well as the
rodent parasite Plasmodium yoelii. A homologue from
Plasmodium chabaudi was localized to the
parasitophorous vacuole membrane. Members have an
initial hydrophobic, Phe/Tyr-rich, stretch long enough
to span the membrane, a highly charged region rich in
Lys, a second putative transmembrane region and a
second highly charged, low complexity sequence region.
Some members have up to 100 residues of additional
C-terminal sequence. These genes have been shown to be
found in the sub-telomeric regions of both Plasmodium
falciparum and P. yoelii chromosomes.
Length = 82
Score = 29.9 bits (68), Expect = 0.37
Identities = 15/38 (39%), Positives = 18/38 (47%), Gaps = 1/38 (2%)
Query: 14 KVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
++ P KKK KK KK +KKKK KKK
Sbjct: 17 NLLAPG-LNNNVVAKKKKLKKLKKIDDDLEKKKKNKKK 53
Score = 28.4 bits (64), Expect = 1.1
Identities = 14/29 (48%), Positives = 18/29 (62%)
Query: 29 KKNKKKKKKKKKKKKKKKKKKKKKRRILA 57
K KK KK KK +KKKK KK+ I++
Sbjct: 29 AKKKKLKKLKKIDDDLEKKKKNKKKIIIS 57
Score = 27.6 bits (62), Expect = 1.9
Identities = 14/25 (56%), Positives = 15/25 (60%)
Query: 28 KKKNKKKKKKKKKKKKKKKKKKKKK 52
KK K KK KK +KKKK KKK
Sbjct: 29 AKKKKLKKLKKIDDDLEKKKKNKKK 53
>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
Length = 742
Score = 32.1 bits (73), Expect = 0.37
Identities = 14/34 (41%), Positives = 16/34 (47%)
Query: 20 PFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
P +K K K +K KKK K KKKKK
Sbjct: 109 PLAISLMRKPKPKTEKLKKKITVNKSTNKKKKKV 142
Score = 29.8 bits (67), Expect = 1.9
Identities = 13/29 (44%), Positives = 14/29 (48%)
Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
EK KKK K KKKKK K + K
Sbjct: 123 EKLKKKITVNKSTNKKKKKVLSSKDELIK 151
Score = 29.0 bits (65), Expect = 3.7
Identities = 13/34 (38%), Positives = 15/34 (44%)
Query: 20 PFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
+ K K +K KKK K KKKKK K
Sbjct: 114 LMRKPKPKTEKLKKKITVNKSTNKKKKKVLSSKD 147
Score = 28.6 bits (64), Expect = 5.2
Identities = 12/47 (25%), Positives = 16/47 (34%)
Query: 13 DKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
I + K +K K K K KKKKK K + +
Sbjct: 109 PLAISLMRKPKPKTEKLKKKITVNKSTNKKKKKVLSSKDELIKYDNN 155
Score = 28.3 bits (63), Expect = 6.0
Identities = 14/43 (32%), Positives = 17/43 (39%)
Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENP 63
R+ K + KKK K KKKKK + L K N
Sbjct: 114 LMRKPKPKTEKLKKKITVNKSTNKKKKKVLSSKDELIKYDNNK 156
>gnl|CDD|180393 PRK06087, PRK06087, short chain acyl-CoA synthetase; Reviewed.
Length = 547
Score = 32.0 bits (73), Expect = 0.38
Identities = 9/18 (50%), Positives = 15/18 (83%)
Query: 103 SNDIAIVLYTSGSTGIPK 120
+++A VL+TSG+ G+PK
Sbjct: 186 GDELAAVLFTSGTEGLPK 203
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 32.2 bits (73), Expect = 0.39
Identities = 12/32 (37%), Positives = 24/32 (75%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
K K ++ K+++K+K++ K++KKKKK+K +
Sbjct: 94 PAKEPKNESGKEEEKEKEQVKEEKKKKKEKPK 125
Score = 31.4 bits (71), Expect = 0.61
Identities = 13/37 (35%), Positives = 24/37 (64%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
P +++K K++ K++KKKKK+K K++ K R+
Sbjct: 96 KEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRK 132
Score = 31.0 bits (70), Expect = 0.87
Identities = 11/36 (30%), Positives = 23/36 (63%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
P + K K+ + K+++K+K++ K++KKKK+
Sbjct: 86 KGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKK 121
Score = 31.0 bits (70), Expect = 0.90
Identities = 14/35 (40%), Positives = 24/35 (68%)
Query: 20 PFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
P E K +K K + K+K+ K+K+K+K+KK ++ R
Sbjct: 124 PKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPR 158
Score = 30.6 bits (69), Expect = 1.0
Identities = 11/33 (33%), Positives = 22/33 (66%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
+E + +++K+K++ K++KKKKK+K K
Sbjct: 95 AKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEE 127
Score = 30.6 bits (69), Expect = 1.2
Identities = 15/43 (34%), Positives = 26/43 (60%)
Query: 19 APFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCE 61
P + K + K ++K+K++ K++KKKKK+K K+ K E
Sbjct: 93 KPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKE 135
Score = 29.5 bits (66), Expect = 2.2
Identities = 10/36 (27%), Positives = 22/36 (61%)
Query: 19 APFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
+ K K K K + K+++K+K++ K++KK++
Sbjct: 85 SKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKK 120
Score = 29.5 bits (66), Expect = 2.6
Identities = 13/36 (36%), Positives = 27/36 (75%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
PA + ++ K++ K+K++ K++KKKKK+K K++ +
Sbjct: 94 PAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPK 129
Score = 29.5 bits (66), Expect = 2.9
Identities = 12/33 (36%), Positives = 23/33 (69%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
+ + ++KK KK+K K++ K +K K++ K+KR
Sbjct: 110 KEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRP 142
Score = 29.5 bits (66), Expect = 2.9
Identities = 14/37 (37%), Positives = 26/37 (70%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
E+E+ K++K KKK+K K++ K +K K++ K++R
Sbjct: 105 EEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKR 141
Score = 29.1 bits (65), Expect = 3.1
Identities = 14/36 (38%), Positives = 21/36 (58%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
P +RE+ KK++ + K + KK KKK KKK+
Sbjct: 155 EEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEP 190
Score = 29.1 bits (65), Expect = 3.6
Identities = 14/43 (32%), Positives = 26/43 (60%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPA 64
E++K K+K ++ K +K K++ K+K+ K+K + K E P
Sbjct: 116 EKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPR 158
Score = 28.3 bits (63), Expect = 5.3
Identities = 13/38 (34%), Positives = 22/38 (57%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
R K++ KK KKK KKK+ +++K+++ R K
Sbjct: 169 VRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKG 206
Score = 28.3 bits (63), Expect = 6.4
Identities = 15/53 (28%), Positives = 29/53 (54%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRTV 70
P +++ K++ K+K+ K+K+K+K+KK ++ + R K E K R
Sbjct: 124 PKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPK 176
Score = 28.3 bits (63), Expect = 6.5
Identities = 13/44 (29%), Positives = 27/44 (61%)
Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPA 64
+ E K+++ +K++ K++KKKKK+K K++ K R + +
Sbjct: 98 PKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKR 141
Score = 27.9 bits (62), Expect = 7.1
Identities = 16/44 (36%), Positives = 28/44 (63%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGR 68
+ +K+K + K++KKKKK+K K++ K +K + AK P K +
Sbjct: 105 EEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEK 148
Score = 27.9 bits (62), Expect = 7.3
Identities = 14/37 (37%), Positives = 24/37 (64%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
E+EK + K+ KKKKK+K K++ K +K K++ +
Sbjct: 103 GKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKE 139
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called
The RES complex, pre-mRNA retention and splicing
complex. Subunits of this complex are not essential for
viability of yeasts but they are required for efficient
splicing in vitro and in vivo. Furthermore,
inactivation of this complex causes pre-mRNA leakage
from the nucleus. Bud13 contains a unique,
phylogenetically conserved C-terminal region of unknown
function.
Length = 141
Score = 30.7 bits (70), Expect = 0.40
Identities = 9/36 (25%), Positives = 28/36 (77%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
R+K+ + + ++K+++K+++K++K++K++K + K
Sbjct: 5 RDKSGRIIDIEEKREEKEREKEEKERKEEKEKEWGK 40
>gnl|CDD|235722 PRK06164, PRK06164, acyl-CoA synthetase; Validated.
Length = 540
Score = 32.0 bits (73), Expect = 0.41
Identities = 17/55 (30%), Positives = 28/55 (50%), Gaps = 1/55 (1%)
Query: 147 RAIKYRRMYKTGDYGRIVDD-ILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEV 200
RA+ ++TGD G D +Y+ R +++ G V+ +E+E AL L V
Sbjct: 400 RALTDDGYFRTGDLGYTRGDGQFVYQTRMGDSLRLGGFLVNPAEIEHALEALPGV 454
>gnl|CDD|181207 PRK08043, PRK08043, bifunctional acyl-[acyl carrier protein]
synthetase/2-acylglycerophosphoethanolamine
acyltransferase; Validated.
Length = 718
Score = 32.0 bits (73), Expect = 0.43
Identities = 11/20 (55%), Positives = 15/20 (75%)
Query: 101 EQSNDIAIVLYTSGSTGIPK 120
+Q D A++L+TSGS G PK
Sbjct: 362 QQPEDAALILFTSGSEGHPK 381
>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
Length = 1094
Score = 32.1 bits (73), Expect = 0.46
Identities = 15/26 (57%), Positives = 21/26 (80%)
Query: 32 KKKKKKKKKKKKKKKKKKKKKRRILA 57
+KKKKKKK K+KKKKK ++ ++I A
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQIHA 709
Score = 31.0 bits (70), Expect = 1.1
Identities = 13/23 (56%), Positives = 18/23 (78%)
Query: 28 KKKNKKKKKKKKKKKKKKKKKKK 50
+KK KKKK K+KKKKK ++ K+
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQ 706
Score = 30.6 bits (69), Expect = 1.4
Identities = 13/23 (56%), Positives = 18/23 (78%)
Query: 29 KKNKKKKKKKKKKKKKKKKKKKK 51
+K KKKKK K+KKKKK ++ K+
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQ 706
Score = 29.5 bits (66), Expect = 2.9
Identities = 15/22 (68%), Positives = 18/22 (81%)
Query: 37 KKKKKKKKKKKKKKKKRRILAK 58
+KKKKKKK K+KKKKK R +K
Sbjct: 684 RKKKKKKKSKEKKKKKNREASK 705
Score = 28.3 bits (63), Expect = 6.5
Identities = 13/24 (54%), Positives = 19/24 (79%)
Query: 26 AKKKKNKKKKKKKKKKKKKKKKKK 49
++KKK KKK K+KKKKK ++ K+
Sbjct: 683 SRKKKKKKKSKEKKKKKNREASKQ 706
Score = 28.3 bits (63), Expect = 7.5
Identities = 13/23 (56%), Positives = 18/23 (78%)
Query: 30 KNKKKKKKKKKKKKKKKKKKKKK 52
+ KKKKKK K+KKKKK ++ K+
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQ 706
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 31.6 bits (71), Expect = 0.54
Identities = 7/36 (19%), Positives = 15/36 (41%)
Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
E+AKK K + + + + ++K + K
Sbjct: 1339 EEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKE 1374
Score = 30.9 bits (69), Expect = 1.0
Identities = 10/39 (25%), Positives = 20/39 (51%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCE 61
+A + + ++K + +KKK++ KKK K+ E
Sbjct: 1351 EAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEE 1389
Score = 30.9 bits (69), Expect = 1.2
Identities = 6/31 (19%), Positives = 16/31 (51%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
+ +A K + + + + ++K + +KKK
Sbjct: 1343 KAAEAAKAEAEAAADEAEAAEEKAEAAEKKK 1373
Score = 30.1 bits (67), Expect = 1.6
Identities = 9/36 (25%), Positives = 22/36 (61%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
+ +A+ ++ + ++K + +KKK++ KK+ AK
Sbjct: 1349 KAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAK 1384
Score = 29.7 bits (66), Expect = 2.5
Identities = 7/30 (23%), Positives = 14/30 (46%)
Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
+ AKKK + KK + K + + + +
Sbjct: 1332 DAAKKKAEEAKKAAEAAKAEAEAAADEAEA 1361
Score = 29.7 bits (66), Expect = 2.6
Identities = 6/40 (15%), Positives = 16/40 (40%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCE 61
++ + KK + K + + + + ++K K E
Sbjct: 1336 KKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEE 1375
Score = 29.7 bits (66), Expect = 2.7
Identities = 5/36 (13%), Positives = 18/36 (50%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
++KA++ K + K + + + + +++ A+
Sbjct: 1335 KKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAE 1370
Score = 29.3 bits (65), Expect = 3.2
Identities = 12/43 (27%), Positives = 20/43 (46%), Gaps = 3/43 (6%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKK---KKKRRILAKSCE 61
E KA+ + + + ++K + +KKK KKK K E
Sbjct: 1346 EAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAE 1388
Score = 29.0 bits (64), Expect = 4.2
Identities = 6/36 (16%), Positives = 19/36 (52%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
++ A+ K + + + + ++K + +K++ AK
Sbjct: 1342 KKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAK 1377
Score = 29.0 bits (64), Expect = 4.3
Identities = 10/40 (25%), Positives = 16/40 (40%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCE 61
E K K KKK ++ KK + K + + + E
Sbjct: 1325 EEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEE 1364
Score = 28.6 bits (63), Expect = 5.0
Identities = 17/48 (35%), Positives = 24/48 (50%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRTV 70
EKA+ + KK++ KKK KKK ++KKK K E K +
Sbjct: 1363 EEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADEL 1410
Score = 28.2 bits (62), Expect = 6.9
Identities = 16/45 (35%), Positives = 23/45 (51%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYK 66
E +KA + K K ++ KKK KKK ++ KK AK+ A
Sbjct: 1313 EAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAAD 1357
Score = 28.2 bits (62), Expect = 7.1
Identities = 12/33 (36%), Positives = 21/33 (63%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
E + K + +KKK++ KKK KKK ++K++
Sbjct: 1360 EAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKK 1392
>gnl|CDD|185611 PTZ00428, PTZ00428, 60S ribosomal protein L4; Provisional.
Length = 381
Score = 31.6 bits (72), Expect = 0.55
Identities = 13/42 (30%), Positives = 20/42 (47%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
PA + K +K K +K K ++ +KK K+ LAK
Sbjct: 330 PAAKKLRKLAVLAQEKATAKGAQKVKNRRARKKAKKARLAKV 371
Score = 30.8 bits (70), Expect = 0.91
Identities = 10/43 (23%), Positives = 19/43 (44%)
Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYK 66
++K K +K K ++ +KK KK + +AK+
Sbjct: 339 AVLAQEKATAKGAQKVKNRRARKKAKKARLAKVAKALYKSIKA 381
Score = 29.7 bits (67), Expect = 2.1
Identities = 10/30 (33%), Positives = 16/30 (53%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
E+ AK + K ++ +KK KK + K K
Sbjct: 344 EKATAKGAQKVKNRRARKKAKKARLAKVAK 373
Score = 28.5 bits (64), Expect = 4.5
Identities = 9/31 (29%), Positives = 14/31 (45%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
+ K K K ++ +KK KK + K K
Sbjct: 344 EKATAKGAQKVKNRRARKKAKKARLAKVAKA 374
>gnl|CDD|240316 PTZ00216, PTZ00216, acyl-CoA synthetase; Provisional.
Length = 700
Score = 31.5 bits (72), Expect = 0.57
Identities = 15/33 (45%), Positives = 22/33 (66%), Gaps = 2/33 (6%)
Query: 90 GESML--HADNNNEQSNDIAIVLYTSGSTGIPK 120
G S H N E ++D+A+++YTSG+TG PK
Sbjct: 248 GHSAGSHHPLNIPENNDDLALIMYTSGTTGDPK 280
>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
Length = 1465
Score = 31.8 bits (72), Expect = 0.57
Identities = 11/42 (26%), Positives = 18/42 (42%)
Query: 12 FDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
DK A REK ++ + + KK ++ KK K+
Sbjct: 1168 LDKEDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKK 1209
Score = 29.8 bits (67), Expect = 2.4
Identities = 7/41 (17%), Positives = 15/41 (36%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCEN 62
E + K ++ + + KK ++ KK + K
Sbjct: 1176 EEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASE 1216
>gnl|CDD|220231 pfam09420, Nop16, Ribosome biogenesis protein Nop16. Nop16 is a
protein involved in ribosome biogenesis.
Length = 173
Score = 30.9 bits (70), Expect = 0.58
Identities = 13/29 (44%), Positives = 18/29 (62%)
Query: 27 KKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
KKKKN+ K +K+ K+K +KKK I
Sbjct: 3 KKKKNRSSNYKVNRKRLKRKDRKKKINII 31
Score = 30.1 bits (68), Expect = 1.0
Identities = 9/33 (27%), Positives = 17/33 (51%)
Query: 29 KKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCE 61
+K KK + K +K+ K+K +K++I
Sbjct: 2 RKKKKNRSSNYKVNRKRLKRKDRKKKINIIGNP 34
Score = 29.3 bits (66), Expect = 1.8
Identities = 9/26 (34%), Positives = 16/26 (61%)
Query: 28 KKKNKKKKKKKKKKKKKKKKKKKKKR 53
+KK K + K +K+ K+K +KK+
Sbjct: 2 RKKKKNRSSNYKVNRKRLKRKDRKKK 27
Score = 28.2 bits (63), Expect = 3.8
Identities = 10/29 (34%), Positives = 16/29 (55%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
K KK ++ K +K+ K+K +KKK
Sbjct: 3 KKKKNRSSNYKVNRKRLKRKDRKKKINII 31
Score = 28.2 bits (63), Expect = 3.9
Identities = 12/36 (33%), Positives = 20/36 (55%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
++K + N K +K+ K+K +KKK I+AK
Sbjct: 3 KKKKNRSSNYKVNRKRLKRKDRKKKINIIGNPIIAK 38
Score = 27.0 bits (60), Expect = 8.4
Identities = 11/32 (34%), Positives = 16/32 (50%)
Query: 32 KKKKKKKKKKKKKKKKKKKKKRRILAKSCENP 63
KKKK + K +K+ K+K R+ NP
Sbjct: 3 KKKKNRSSNYKVNRKRLKRKDRKKKINIIGNP 34
>gnl|CDD|193409 pfam12936, Kri1_C, KRI1-like family C-terminal. The yeast member
of this family (Kri1p) is found to be required for 40S
ribosome biogenesis in the nucleolus. This is the
C-terminal domain of the family.
Length = 93
Score = 29.5 bits (67), Expect = 0.58
Identities = 13/31 (41%), Positives = 22/31 (70%), Gaps = 1/31 (3%)
Query: 19 APFEREKAKKKKNKKKKKKKKKKKKKKKKKK 49
AP+ R+K +++K+KKK KK + ++ KKK
Sbjct: 64 APY-RDKEERRKDKKKYGKKARLREWKKKVF 93
Score = 26.4 bits (59), Expect = 6.3
Identities = 8/25 (32%), Positives = 18/25 (72%)
Query: 32 KKKKKKKKKKKKKKKKKKKKKRRIL 56
K++++K KKK KK + ++ K+++
Sbjct: 69 KEERRKDKKKYGKKARLREWKKKVF 93
>gnl|CDD|213300 cd05934, FACL_DitJ_like, Uncharacterized subfamily of fatty acid
CoA ligase (FACL). Fatty acyl-CoA ligases catalyze the
ATP-dependent activation of fatty acids in a two-step
reaction. The carboxylate substrate first reacts with
ATP to form an acyl-adenylate intermediate, which then
reacts with CoA to produce an acyl-CoA ester. This is a
required step before free fatty acids can participate in
most catabolic and anabolic reactions. Members of this
family include DitJ from Pseudomonas and similar
proteins.
Length = 421
Score = 31.4 bits (72), Expect = 0.58
Identities = 11/16 (68%), Positives = 13/16 (81%)
Query: 105 DIAIVLYTSGSTGIPK 120
D A +LYTSG+TG PK
Sbjct: 82 DTAAILYTSGTTGPPK 97
>gnl|CDD|213298 cd05932, LC_FACS_bac, Bacterial long-chain fatty acid CoA
synthetase (LC-FACS), including Marinobacter
hydrocarbonoclasticus isoprenoid Coenzyme A synthetase.
The members of this family are bacterial long-chain
fatty acid CoA synthetase. Marinobacter
hydrocarbonoclasticus isoprenoid Coenzyme A synthetase
in this family is involved in the synthesis of
isoprenoid wax ester storage compounds when grown on
phytol as the sole carbon source. LC-FACS catalyzes the
formation of fatty acyl-CoA in a two-step reaction: the
formation of a fatty acyl-AMP molecule as an
intermediate, and the formation of a fatty acyl-CoA.
Free fatty acids must be "activated" to their CoA
thioesters before participating in most catabolic and
anabolic reactions.
Length = 504
Score = 31.5 bits (72), Expect = 0.61
Identities = 10/17 (58%), Positives = 14/17 (82%)
Query: 104 NDIAIVLYTSGSTGIPK 120
D+A ++YTSG+TG PK
Sbjct: 137 EDLATIVYTSGTTGQPK 153
>gnl|CDD|218435 pfam05104, Rib_recp_KP_reg, Ribosome receptor lysine/proline rich
region. This highly conserved region is found towards
the C-terminus of the transmembrane domain. The
function is unclear.
Length = 151
Score = 30.3 bits (68), Expect = 0.63
Identities = 15/31 (48%), Positives = 22/31 (70%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
E K + +K+ KKKK+K +KK K KKK++K
Sbjct: 16 ESGKTQSQKSDKKKKEKVSEKKGKSKKKEEK 46
Score = 28.0 bits (62), Expect = 3.7
Identities = 14/29 (48%), Positives = 18/29 (62%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
+E K + K KKKK+K +KK K KKK
Sbjct: 15 KESGKTQSQKSDKKKKEKVSEKKGKSKKK 43
Score = 28.0 bits (62), Expect = 3.9
Identities = 14/31 (45%), Positives = 23/31 (74%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
+R+++ K +++K KKKK+K +KK K KKK
Sbjct: 13 QRKESGKTQSQKSDKKKKEKVSEKKGKSKKK 43
>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
Length = 746
Score = 31.6 bits (72), Expect = 0.64
Identities = 17/99 (17%), Positives = 38/99 (38%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRTVLSLPQLRLLSA 81
E+ N K K+++K + + + + + L +L +
Sbjct: 222 EKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEKLEKIKDLKEKAKKLGIS 281
Query: 82 SLSHNNIPGESMLHADNNNEQSNDIAIVLYTSGSTGIPK 120
+ +++ + NE + I ++YTSG++G PK
Sbjct: 282 IILFDDMTKNKTTNYKIQNEDPDFITSIVYTSGTSGKPK 320
>gnl|CDD|168170 PRK05677, PRK05677, long-chain-fatty-acid--CoA ligase; Validated.
Length = 562
Score = 31.3 bits (71), Expect = 0.64
Identities = 19/58 (32%), Positives = 31/58 (53%), Gaps = 11/58 (18%)
Query: 63 PAYKGRTVLSLPQLRLLSASLSHNNIPGESMLHADNNNEQSNDIAIVLYTSGSTGIPK 120
PAY LPQ + +L+ G+ + A N Q++D+A++ YT G+TG+ K
Sbjct: 177 PAY------HLPQAVKFNDALAKGA--GQPVTEA---NPQADDVAVLQYTGGTTGVAK 223
>gnl|CDD|240325 PTZ00237, PTZ00237, acetyl-CoA synthetase; Provisional.
Length = 647
Score = 31.3 bits (71), Expect = 0.64
Identities = 18/49 (36%), Positives = 28/49 (57%), Gaps = 3/49 (6%)
Query: 150 KYRRMYKTGDYGRIVDDILLYE--GRTDSQVKIRGNRVDLSEVEAALSK 196
K+ Y +GD G D+ Y R+D Q+KI GN+V L+ +E ++ K
Sbjct: 489 KFPGYYNSGDLG-FKDENGYYTIVSRSDDQIKISGNKVQLNTIETSILK 536
Score = 29.3 bits (66), Expect = 3.2
Identities = 11/24 (45%), Positives = 18/24 (75%)
Query: 236 RTDSQVKIRGNRVDLSEVEAALSK 259
R+D Q+KI GN+V L+ +E ++ K
Sbjct: 513 RSDDQIKISGNKVQLNTIETSILK 536
>gnl|CDD|235809 PRK06464, PRK06464, phosphoenolpyruvate synthase; Validated.
Length = 795
Score = 31.3 bits (72), Expect = 0.65
Identities = 8/21 (38%), Positives = 14/21 (66%), Gaps = 2/21 (9%)
Query: 134 IVDDIHRCWSS--GDRAIKYR 152
+++ + C++S DRAI YR
Sbjct: 153 VLEAVKECFASLFTDRAISYR 173
>gnl|CDD|221857 pfam12923, RRP7, Ribosomal RNA-processing protein 7 (RRP7). RRP7
is an essential protein in yeast that is involved in
pre-rRNA processing and ribosome assembly. It is
speculated to be required for correct assembly of rpS27
into the pre-ribosomal particle.
Length = 131
Score = 29.9 bits (68), Expect = 0.65
Identities = 16/42 (38%), Positives = 26/42 (61%), Gaps = 5/42 (11%)
Query: 19 APFEREKAKKKKNKKKKKKKKKKKKKK-----KKKKKKKRRI 55
A R KA +++ K K+KKKKKKK+ + + ++KKK +
Sbjct: 64 AGASRNKAAEERRKLKEKKKKKKKELENFYRFQIREKKKEEL 105
Score = 26.8 bits (60), Expect = 9.4
Identities = 12/29 (41%), Positives = 20/29 (68%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
K K ++ K ++++K K+KKKKKKK+
Sbjct: 61 KRKAGASRNKAAEERRKLKEKKKKKKKEL 89
>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
binding fragment. Topoisomerase I promotes the
relaxation of DNA superhelical tension by introducing a
transient single-stranded break in duplex DNA and are
vital for the processes of replication, transcription,
and recombination. This family may be more than one
structural domain.
Length = 215
Score = 30.6 bits (70), Expect = 0.67
Identities = 13/29 (44%), Positives = 19/29 (65%)
Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKKK 49
FE EK KKK K++KK K++K K ++
Sbjct: 95 FEAEKEKKKAMSKEEKKAIKEEKDKLEEP 123
>gnl|CDD|220818 pfam10595, UPF0564, Uncharacterized protein family UPF0564. This
family of proteins has no known function. However, one
of the members is annotated as an EF-hand family
protein.
Length = 349
Score = 31.3 bits (71), Expect = 0.67
Identities = 14/38 (36%), Positives = 25/38 (65%)
Query: 14 KVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
K+ +P PF+ K +KKK +KK ++ K +++ K +KK
Sbjct: 2 KITVPQPFQMTKREKKKREKKSIRQSKLEEELNKLEKK 39
>gnl|CDD|220611 pfam10169, Laps, Learning-associated protein. This is a family of
121-amino acid secretory proteins. Laps functions in the
regulation of neuronal cell adhesion and/or movement and
synapse attachment. Laps binds to the ApC/EBP (Aplysia
CCAAT/enhancer binding protein) promoter and activates
the transcription of ApC/EBP mRNA.
Length = 124
Score = 29.8 bits (67), Expect = 0.69
Identities = 13/25 (52%), Positives = 20/25 (80%)
Query: 31 NKKKKKKKKKKKKKKKKKKKKKRRI 55
N+++ KK KKK++KKKKKKK + +
Sbjct: 99 NQRQAKKLKKKREKKKKKKKGAKNL 123
Score = 27.1 bits (60), Expect = 6.8
Identities = 13/26 (50%), Positives = 18/26 (69%)
Query: 27 KKKKNKKKKKKKKKKKKKKKKKKKKK 52
+++ KK KKK++KKKKKKK K
Sbjct: 99 NQRQAKKLKKKREKKKKKKKGAKNLA 124
Score = 27.1 bits (60), Expect = 7.1
Identities = 15/30 (50%), Positives = 19/30 (63%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKK 47
PA + +AKK K K++KKKKKKK K
Sbjct: 95 PAWMNQRQAKKLKKKREKKKKKKKGAKNLA 124
Score = 26.7 bits (59), Expect = 8.5
Identities = 13/26 (50%), Positives = 17/26 (65%)
Query: 28 KKKNKKKKKKKKKKKKKKKKKKKKKR 53
++ KK KKK++KKKKKKK K
Sbjct: 99 NQRQAKKLKKKREKKKKKKKGAKNLA 124
>gnl|CDD|213278 cd05910, FACL_like_1, Uncharacterized subfamily of fatty acid CoA
ligase (FACL). Fatty acyl-CoA ligases catalyze the
ATP-dependent activation of fatty acids in a two-step
reaction. The carboxylate substrate first reacts with
ATP to form an acyl-adenylate intermediate, which then
reacts with CoA to produce an acyl-CoA ester. This is a
required step before free fatty acids can participate in
most catabolic and anabolic reactions.
Length = 455
Score = 31.2 bits (71), Expect = 0.71
Identities = 12/21 (57%), Positives = 14/21 (66%), Gaps = 2/21 (9%)
Query: 105 DIAIVLYTSGSTGIPK--EYR 123
D A +L+TSGSTG PK Y
Sbjct: 86 DPAAILFTSGSTGPPKGVVYT 106
>gnl|CDD|102207 PRK06145, PRK06145, acyl-CoA synthetase; Validated.
Length = 497
Score = 31.0 bits (70), Expect = 0.73
Identities = 15/53 (28%), Positives = 28/53 (52%), Gaps = 1/53 (1%)
Query: 155 YKTGDYGRIVDDILLY-EGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSVL 206
+++GD G + ++ LY R + G + SEVE + +L EV + +V+
Sbjct: 375 FRSGDVGYLDEEGFLYLTDRKKDMIISGGENIASSEVERVIYELPEVAEAAVI 427
>gnl|CDD|218437 pfam05107, DUF694, Family of unknown function (DUF694). Family of
hypothetical bacterial proteins.
Length = 278
Score = 30.7 bits (70), Expect = 0.75
Identities = 15/41 (36%), Positives = 22/41 (53%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCEN 62
E + + + ++ K+KKKKKKKKKKK + CE
Sbjct: 69 NDEHGELYERADRAEELLKEKKKKKKKKKKKEELREWMCEK 109
Score = 28.4 bits (64), Expect = 4.2
Identities = 13/32 (40%), Positives = 20/32 (62%)
Query: 29 KKNKKKKKKKKKKKKKKKKKKKKKRRILAKSC 60
++ K+KKKKKKKKKKK++ R + +
Sbjct: 79 ADRAEELLKEKKKKKKKKKKKEELREWMCEKF 110
Score = 28.0 bits (63), Expect = 6.6
Identities = 14/46 (30%), Positives = 25/46 (54%)
Query: 13 DKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
+K + + + ++ K+KKKKKKKKKKK++ + + K
Sbjct: 64 EKEKLNDEHGELYERADRAEELLKEKKKKKKKKKKKEELREWMCEK 109
>gnl|CDD|217502 pfam03343, SART-1, SART-1 family. SART-1 is a protein involved in
cell cycle arrest and pre-mRNA splicing. It has been
shown to be a component of U4/U6 x U5 tri-snRNP complex
in human, Schizosaccharomyces pombe and Saccharomyces
cerevisiae. SART-1 is a known tumour antigen in a range
of cancers recognised by T cells.
Length = 603
Score = 31.3 bits (71), Expect = 0.78
Identities = 20/58 (34%), Positives = 25/58 (43%), Gaps = 23/58 (39%)
Query: 22 EREKAKKKKNKKKKK-----------------------KKKKKKKKKKKKKKKKRRIL 56
E EK +++ KK K K KK KKKKKKKKK+R+ L
Sbjct: 235 EAEKKRQEVKKKLKINNVSLDDDSTETPASDYYDVSEMVKFKKPKKKKKKKKKRRKDL 292
Score = 30.5 bits (69), Expect = 1.2
Identities = 13/33 (39%), Positives = 20/33 (60%)
Query: 32 KKKKKKKKKKKKKKKKKKKKKRRILAKSCENPA 64
K KK KKKKKKKKK++K + + ++ +
Sbjct: 274 KFKKPKKKKKKKKKRRKDLDEDELEPEAEGLGS 306
Score = 29.0 bits (65), Expect = 3.6
Identities = 13/24 (54%), Positives = 15/24 (62%)
Query: 20 PFEREKAKKKKNKKKKKKKKKKKK 43
E K KK K KKKKKKK++K
Sbjct: 269 VSEMVKFKKPKKKKKKKKKRRKDL 292
>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
Length = 333
Score = 31.0 bits (70), Expect = 0.79
Identities = 16/58 (27%), Positives = 18/58 (31%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRTVLSLPQLRLLSAS 82
K KK KK KK K KK KK K AK + + S
Sbjct: 250 KKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAKKKAGKKAAAGS 307
Score = 30.6 bits (69), Expect = 1.0
Identities = 14/42 (33%), Positives = 19/42 (45%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPA 64
+ KK K K KK KK KK KK + + K+ + A
Sbjct: 234 TAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAA 275
Score = 30.2 bits (68), Expect = 1.1
Identities = 14/46 (30%), Positives = 19/46 (41%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGR 68
+ KK + K K KK KKK K + AK+ A K +
Sbjct: 208 AKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAK 253
Score = 30.2 bits (68), Expect = 1.2
Identities = 17/40 (42%), Positives = 20/40 (50%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPA 64
KA KK K KK KK K KK KK + AK+ + A
Sbjct: 247 KAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAA 286
Score = 30.2 bits (68), Expect = 1.4
Identities = 15/43 (34%), Positives = 17/43 (39%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKG 67
KA K K K K K K K KKK ++ A S K
Sbjct: 273 KAAKAAAKAAKGAAKATKGKAKAKKKAGKKAAAGSKAKATAKA 315
Score = 29.9 bits (67), Expect = 1.6
Identities = 16/42 (38%), Positives = 17/42 (40%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYK 66
A K KK K K KK KKK K + AK A K
Sbjct: 206 PAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAK 247
Score = 29.9 bits (67), Expect = 1.9
Identities = 16/42 (38%), Positives = 17/42 (40%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYK 66
K K K KK KK KK KK K + AK A K
Sbjct: 239 KKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAK 280
Score = 29.1 bits (65), Expect = 3.1
Identities = 14/50 (28%), Positives = 16/50 (32%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKG 67
+ K KK KKK K KK K AK + A K
Sbjct: 210 SGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKA 259
Score = 28.3 bits (63), Expect = 4.6
Identities = 16/44 (36%), Positives = 16/44 (36%)
Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKG 67
KK KK KK KK K KK KK A A K
Sbjct: 245 AAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKA 288
Score = 27.9 bits (62), Expect = 6.2
Identities = 12/34 (35%), Positives = 14/34 (41%)
Query: 19 APFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
++ K KK KK K KK KK K K
Sbjct: 247 KAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAK 280
Score = 27.9 bits (62), Expect = 7.6
Identities = 14/50 (28%), Positives = 18/50 (36%)
Query: 17 IPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYK 66
+ A ++ AK K KK K K KK KK+ A K
Sbjct: 194 VGAANAKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAK 243
Score = 27.5 bits (61), Expect = 9.1
Identities = 14/46 (30%), Positives = 19/46 (41%)
Query: 19 APFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPA 64
A +K KKK K KK K K KK ++ K+ + A
Sbjct: 219 AKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAA 264
Score = 27.5 bits (61), Expect = 9.6
Identities = 16/49 (32%), Positives = 16/49 (32%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYK 66
PA K K K KK KKK K KK A A K
Sbjct: 206 PAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKK 254
>gnl|CDD|227238 COG4901, COG4901, Ribosomal protein S25 [Translation, ribosomal
structure and biogenesis].
Length = 107
Score = 29.5 bits (66), Expect = 0.83
Identities = 15/32 (46%), Positives = 18/32 (56%)
Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
K KK + K K KKK KKKKK++ RR
Sbjct: 8 SKEKKAEKAKAGTAKDKKKWSKKKKKEEARRA 39
Score = 26.4 bits (58), Expect = 8.1
Identities = 16/37 (43%), Positives = 20/37 (54%)
Query: 19 APFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
AP + +KK K K K KKK KKKKK++ R
Sbjct: 2 APKSQLSKEKKAEKAKAGTAKDKKKWSKKKKKEEARR 38
>gnl|CDD|171961 PRK13295, PRK13295, cyclohexanecarboxylate-CoA ligase; Reviewed.
Length = 547
Score = 30.8 bits (70), Expect = 0.86
Identities = 9/17 (52%), Positives = 14/17 (82%)
Query: 104 NDIAIVLYTSGSTGIPK 120
+D+ ++YTSG+TG PK
Sbjct: 197 DDVTQLIYTSGTTGEPK 213
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 30.1 bits (68), Expect = 0.86
Identities = 14/44 (31%), Positives = 20/44 (45%)
Query: 8 SEHHFDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
S ++ P K KKK+ K K +K+K K + KK K
Sbjct: 105 STDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKP 148
Score = 29.7 bits (67), Expect = 0.97
Identities = 9/37 (24%), Positives = 19/37 (51%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
E ++++ + K + K+KKK++ K K + K
Sbjct: 104 ESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPK 140
Score = 29.3 bits (66), Expect = 1.4
Identities = 12/33 (36%), Positives = 20/33 (60%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
E+E + K + K+KKK++ K K +K+K K
Sbjct: 110 EQEDPPETKTESKEKKKREVPKPKTEKEKPKTE 142
Score = 28.6 bits (64), Expect = 2.4
Identities = 13/36 (36%), Positives = 19/36 (52%)
Query: 17 IPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
P + E +KKK + K K +K+K K + KK K
Sbjct: 113 DPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKP 148
Score = 28.2 bits (63), Expect = 3.9
Identities = 8/33 (24%), Positives = 16/33 (48%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
E + ++ + K + K+KKK++ K K
Sbjct: 103 EESTDETEQEDPPETKTESKEKKKREVPKPKTE 135
Score = 27.8 bits (62), Expect = 5.5
Identities = 12/31 (38%), Positives = 15/31 (48%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKK 48
+RE K K K+K K + KK K K K
Sbjct: 122 KEKKKREVPKPKTEKEKPKTEPKKPKPSKPK 152
Score = 27.0 bits (60), Expect = 7.7
Identities = 7/30 (23%), Positives = 18/30 (60%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
E E ++ ++ +++ + K + K+KKK+
Sbjct: 98 EEETEEESTDETEQEDPPETKTESKEKKKR 127
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 30.8 bits (70), Expect = 0.90
Identities = 12/40 (30%), Positives = 21/40 (52%)
Query: 19 APFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
A E + NK + K+K K ++ K+K++K+ AK
Sbjct: 258 AWEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELEREAK 297
Score = 30.4 bits (69), Expect = 1.0
Identities = 13/34 (38%), Positives = 26/34 (76%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
K K ++NK+K++K+ +++ K++K+ KKK LA+
Sbjct: 278 KTKAQRNKEKRRKELEREAKEEKQLKKKLAQLAR 311
Score = 28.1 bits (63), Expect = 6.4
Identities = 10/38 (26%), Positives = 24/38 (63%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
R K K K + K+K++K+ +++ K++K+ K+++
Sbjct: 269 INKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKKL 306
Score = 27.7 bits (62), Expect = 7.8
Identities = 9/37 (24%), Positives = 21/37 (56%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
E E K K+K K ++ K+K++K+ +++ + +
Sbjct: 265 EYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQ 301
>gnl|CDD|213276 cd05908, A_NRPS_MycA_like, The adenylation domain of nonribosomal
peptide synthetases (NRPS) similar to mycosubtilin
synthase subunit A (MycA). The adenylation (A) domain
of NRPS recognizes a specific amino acid or hydroxy acid
and activates it as (amino)-acyl adenylate by hydrolysis
of ATP. The activated acyl moiety then forms thioester
to the enzyme-bound cofactor phosphopantetheine of a
peptidyl carrier protein domain. This family includes
NRPS similar to mycosubtilin synthase subunit A (MycA).
Mycosubtilin, which is characterized by a beta-amino
fatty acid moiety linked to the circular heptapeptide
Asn-Tyr-Asn-Gln-Pro-Ser-Asn, belongs to the iturin
family of lipopeptide antibiotics. The mycosubtilin
synthase subunit A (MycA) combines functional domains
derived from peptide synthetases, amino transferases,
and fatty acid synthases. Nonribosomal peptide
synthetases are large multifunction enzymes that
synthesize many therapeutically useful peptides. NRPS
has a distinct modular structure in which each module is
responsible for the recognition, activation, and, in
some cases, modification of a single amino acid residue
of the final peptide product. The modules can be
subdivided into domains that catalyze specific
biochemical reactions.
Length = 499
Score = 30.7 bits (70), Expect = 0.92
Identities = 10/21 (47%), Positives = 14/21 (66%)
Query: 100 NEQSNDIAIVLYTSGSTGIPK 120
+DIA + ++SGSTG PK
Sbjct: 102 LSLPDDIAFIQFSSGSTGEPK 122
>gnl|CDD|219621 pfam07890, Rrp15p, Rrp15p. Rrp15p is required for the formation
of 60S ribosomal subunits.
Length = 132
Score = 29.6 bits (67), Expect = 0.98
Identities = 14/33 (42%), Positives = 21/33 (63%)
Query: 26 AKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
K K KKK K +K +KK K++ + +KR+ L K
Sbjct: 28 KKLLKAKKKLKSEKLEKKAKRQLRAEKRQALEK 60
Score = 28.9 bits (65), Expect = 1.8
Identities = 11/34 (32%), Positives = 20/34 (58%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
R K K KK K +K +KK K++ + +K++ +
Sbjct: 25 SRSKKLLKAKKKLKSEKLEKKAKRQLRAEKRQAL 58
Score = 28.1 bits (63), Expect = 2.8
Identities = 10/33 (30%), Positives = 22/33 (66%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
+ KK K++K +KK K++ + +K++ +K R+
Sbjct: 31 LKAKKKLKSEKLEKKAKRQLRAEKRQALEKGRV 63
Score = 27.7 bits (62), Expect = 4.5
Identities = 17/46 (36%), Positives = 22/46 (47%)
Query: 32 KKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRTVLSLPQLR 77
+ KK K KKK K +K +KK +R L KGR LP+
Sbjct: 26 RSKKLLKAKKKLKSEKLEKKAKRQLRAEKRQALEKGRVKPVLPEDL 71
Score = 26.6 bits (59), Expect = 9.4
Identities = 10/41 (24%), Positives = 25/41 (60%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCEN 62
++K K +K +KK K++ + +K++ +K + + +L + E
Sbjct: 33 AKKKLKSEKLEKKAKRQLRAEKRQALEKGRVKPVLPEDLEK 73
>gnl|CDD|178337 PLN02736, PLN02736, long-chain acyl-CoA synthetase.
Length = 651
Score = 30.8 bits (70), Expect = 1.0
Identities = 10/16 (62%), Positives = 13/16 (81%)
Query: 105 DIAIVLYTSGSTGIPK 120
D+A + YTSG+TG PK
Sbjct: 222 DVATICYTSGTTGTPK 237
>gnl|CDD|240256 PTZ00071, PTZ00071, 40S ribosomal protein S24; Provisional.
Length = 132
Score = 29.6 bits (67), Expect = 1.0
Identities = 15/45 (33%), Positives = 23/45 (51%), Gaps = 4/45 (8%)
Query: 9 EHHFDKV---IIPAPFEREKAKK-KKNKKKKKKKKKKKKKKKKKK 49
E + + +I KA+K KN++KK + K+K K KKK
Sbjct: 88 EPKYRLIRLGLIKKKKAGRKARKELKNRRKKVRGKEKTKVSGKKK 132
Score = 27.7 bits (62), Expect = 4.7
Identities = 12/33 (36%), Positives = 20/33 (60%)
Query: 26 AKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
KKKK +K +K+ K ++KK + K+K + K
Sbjct: 99 IKKKKAGRKARKELKNRRKKVRGKEKTKVSGKK 131
>gnl|CDD|213299 cd05933, ACSBG_like, Bubblegum-like very long-chain fatty acid CoA
synthetase (VL-FACS). This family of very long-chain
fatty acid CoA synthetase is named bubblegum because
Drosophila melanogaster mutant bubblegum (BGM) has
elevated levels of very-long-chain fatty acids (VLCFA)
caused by a defective gene of this family. The human
homolog (hsBG) has been characterized as a very long
chain fatty acid CoA synthetase that functions
specifically in the brain; hsBG may play a central role
in brain VLCFA metabolism and myelinogenesis. VL-FACS is
involved in the first reaction step of very long chain
fatty acid degradation. It catalyzes the formation of
fatty acyl-CoA in a two-step reaction: the formation of
a fatty acyl-AMP molecule as an intermediate, and the
formation of a fatty acyl-CoA. Free fatty acids must be
"activated" to their CoA thioesters before participating
in most catabolic and anabolic reactions.
Length = 594
Score = 30.8 bits (70), Expect = 1.0
Identities = 11/30 (36%), Positives = 19/30 (63%)
Query: 91 ESMLHADNNNEQSNDIAIVLYTSGSTGIPK 120
+ L A +++ N ++YTSG+TG+PK
Sbjct: 137 DEQLDAIIESQKPNQCCTLIYTSGTTGMPK 166
>gnl|CDD|235279 PRK04319, PRK04319, acetyl-CoA synthetase; Provisional.
Length = 570
Score = 30.6 bits (70), Expect = 1.0
Identities = 12/16 (75%), Positives = 13/16 (81%)
Query: 105 DIAIVLYTSGSTGIPK 120
D AI+ YTSGSTG PK
Sbjct: 206 DGAILHYTSGSTGKPK 221
>gnl|CDD|213316 cd05969, MACS_like_4, Uncharacterized subfamily of Acetyl-CoA
synthetase like family (ACS). This family is most
similar to acetyl-CoA synthetase. Acetyl-CoA synthetase
(ACS) catalyzes the formation of acetyl-CoA from
acetate, CoA, and ATP. Synthesis of acetyl-CoA is
carried out in a two-step reaction. In the first step,
the enzyme catalyzes the synthesis of acetyl-AMP
intermediate from acetate and ATP. In the second step,
acetyl-AMP reacts with CoA to produce acetyl-CoA. This
enzyme is only present in bacteria.
Length = 443
Score = 30.5 bits (69), Expect = 1.0
Identities = 31/118 (26%), Positives = 43/118 (36%), Gaps = 19/118 (16%)
Query: 107 AIVLYTSGSTGIPKEYRRMYKTGDYGRIVDDIHRCWSSGDRAI-----KYRR-----MYK 156
A V+ G P + G+ + W S R +Y Y
Sbjct: 271 AAVIERDGDGLTP--VTGPGQVGELA-----LKPGWPSMFRGYLGNEERYASSFVDGWYL 323
Query: 157 TGDYG-RIVDDILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEVDKLSVLCYKPGEV 213
TGD R D + GR D +K G+ V EVE+AL + V + V+ KP V
Sbjct: 324 TGDLAYRDEDGYFWFVGRADDVIKTAGHLVGPFEVESALMEHPAVAEAGVIG-KPDPV 380
Score = 27.8 bits (62), Expect = 7.6
Identities = 9/16 (56%), Positives = 13/16 (81%)
Query: 105 DIAIVLYTSGSTGIPK 120
D A++ +TSG+TG PK
Sbjct: 90 DPALLHFTSGTTGKPK 105
>gnl|CDD|218078 pfam04419, 4F5, 4F5 protein family. Members of this family are
short proteins that are rich in aspartate, glutamate,
lysine and arginine. Although the function of these
proteins is unknown, they are found to be ubiquitously
expressed.
Length = 38
Score = 27.2 bits (61), Expect = 1.1
Identities = 12/27 (44%), Positives = 14/27 (51%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKK 49
RE+ KK KKKKK K + K K
Sbjct: 11 RERNAKKAAKKKKKGAKSQLKAAAKAL 37
Score = 27.2 bits (61), Expect = 1.1
Identities = 13/30 (43%), Positives = 18/30 (60%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
RE+A+++ KK KKKKK K + K K
Sbjct: 7 REQARERNAKKAAKKKKKGAKSQLKAAAKA 36
>gnl|CDD|218991 pfam06326, Vesiculo_matrix, Vesiculovirus matrix protein. This
family consists of several Vesiculovirus matrix
proteins. The matrix (M) protein of vesicular
stomatitis virus (VSV) expressed in the absence of
other viral components causes many of the cytopathic
effects of VSV, including an inhibition of host gene
expression and the induction of cell rounding. It has
been shown that M protein also induces apoptosis in the
absence of other viral components. It is thought that
the activation of apoptotic pathways causes the
inhibition of host gene expression and cell rounding by
M protein.
Length = 229
Score = 30.1 bits (68), Expect = 1.1
Identities = 12/43 (27%), Positives = 17/43 (39%)
Query: 32 KKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRTVLSLP 74
K K K K+ KK K KK + P+Y+ + P
Sbjct: 4 FKLLKLKGKEVASKKSKGSKKLNMSMDWDAPPSYEETRPGAAP 46
>gnl|CDD|218738 pfam05766, NinG, Bacteriophage Lambda NinG protein. NinG or Rap
is involved in recombination. Rap (recombination adept
with plasmid) increases lambda-by-plasmid recombination
catalyzed by Escherichia coli's RecBCD pathway.
Length = 188
Score = 30.0 bits (68), Expect = 1.1
Identities = 10/36 (27%), Positives = 22/36 (61%)
Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
E A K +K ++KK+K + + ++++ K R+ K+
Sbjct: 29 ECALALKREKAQEKKRKAEAQAERRELKARKEKLKT 64
Score = 28.8 bits (65), Expect = 2.7
Identities = 11/31 (35%), Positives = 22/31 (70%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
REKA++KK K + + ++++ K +K+K K +
Sbjct: 36 REKAQEKKRKAEAQAERRELKARKEKLKTRS 66
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 30.4 bits (69), Expect = 1.1
Identities = 15/28 (53%), Positives = 21/28 (75%)
Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKK 48
FER+ K K+KKK+KKK++KKKK+
Sbjct: 317 FERDNEKLDAEVKEKKKEKKKEEKKKKQ 344
Score = 30.4 bits (69), Expect = 1.3
Identities = 12/39 (30%), Positives = 21/39 (53%)
Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
FE K+K K K ++ +K + K+KKK+++ K
Sbjct: 303 FEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKK 341
Score = 30.0 bits (68), Expect = 1.4
Identities = 13/35 (37%), Positives = 25/35 (71%)
Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
+ + ++ N+K + K+KKK+KKK++KKK++I
Sbjct: 311 RKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQI 345
Score = 28.1 bits (63), Expect = 5.6
Identities = 11/33 (33%), Positives = 21/33 (63%)
Query: 20 PFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
+ + + + + K+KKK+KKK++KKKK+
Sbjct: 312 KLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQ 344
Score = 28.1 bits (63), Expect = 6.6
Identities = 13/32 (40%), Positives = 24/32 (75%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
+ EK + +KKK+KKK++KKKK+ ++ ++R
Sbjct: 320 DNEKLDAEVKEKKKEKKKEEKKKKQIERLEER 351
Score = 28.1 bits (63), Expect = 6.8
Identities = 14/31 (45%), Positives = 24/31 (77%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
E+ +K + + K+KKK+KKK++KKKK+ +R
Sbjct: 317 FERDNEKLDAEVKEKKKEKKKEEKKKKQIER 347
>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein. This family includes proteins
related to Mpp10 (M phase phosphoprotein 10). The U3
small nucleolar ribonucleoprotein (snoRNP) is required
for three cleavage events that generate the mature 18S
rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
depletion of Mpp10, a U3 snoRNP-specific protein, halts
18S rRNA production and impairs cleavage at the three U3
snoRNP-dependent sites.
Length = 613
Score = 30.7 bits (69), Expect = 1.2
Identities = 16/49 (32%), Positives = 21/49 (42%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRTV 70
ER K K+K+ KK K KK++K K+K S A T
Sbjct: 562 ERRKKKRKRRAAKKAVTKAKKERKIGKEKVDGVAKKSSSAFFAKLQETK 610
Score = 28.0 bits (62), Expect = 7.3
Identities = 13/34 (38%), Positives = 23/34 (67%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
E ++ K + ++KKK+K++ KK K KK+R+I
Sbjct: 553 ELDRTDKNRERRKKKRKRRAAKKAVTKAKKERKI 586
>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
Length = 470
Score = 30.5 bits (69), Expect = 1.2
Identities = 17/45 (37%), Positives = 23/45 (51%)
Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGR 68
+ ++ +NK+ KKKKKKKKKKKKK + LA
Sbjct: 45 SENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLAYDLPVVWSSAA 89
Score = 30.1 bits (68), Expect = 1.8
Identities = 18/50 (36%), Positives = 24/50 (48%), Gaps = 3/50 (6%)
Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRTVLSL 73
+ + + + K+ KKKKKKKKKKKKK + AY V S
Sbjct: 41 AELISENQEAENKQNNKKKKKKKKKKKKKN---LGEAYDLAYDLPVVWSS 87
Score = 28.9 bits (65), Expect = 4.2
Identities = 13/38 (34%), Positives = 19/38 (50%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCEN 62
+ K++K + K+K KKK KK KK LA+
Sbjct: 9 EVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISE 46
Score = 28.5 bits (64), Expect = 4.6
Identities = 11/31 (35%), Positives = 18/31 (58%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
E + + ++K++ K+K KKK KK KK
Sbjct: 2 MNENGENEVKQQKQQNKQKGTKKKNKKSKKD 32
Score = 27.8 bits (62), Expect = 7.8
Identities = 15/38 (39%), Positives = 18/38 (47%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
E E + K KKKKKKKKKKK + + S
Sbjct: 49 EAENKQNNKKKKKKKKKKKKKNLGEAYDLAYDLPVVWS 86
>gnl|CDD|216337 pfam01159, Ribosomal_L6e, Ribosomal protein L6e.
Length = 108
Score = 28.7 bits (65), Expect = 1.2
Identities = 15/43 (34%), Positives = 26/43 (60%), Gaps = 2/43 (4%)
Query: 14 KVIIPAPFEREKAKKKKNKKKKKKKKK--KKKKKKKKKKKKRR 54
V +P E K++K KKKKK + + +KK+KK+ ++R+
Sbjct: 25 GVKVPEKINDEYFKREKKKKKKKSEGEFFAEKKEKKEVSEQRK 67
Score = 28.0 bits (63), Expect = 2.6
Identities = 15/42 (35%), Positives = 25/42 (59%), Gaps = 3/42 (7%)
Query: 14 KVIIPAPFEREKAKKKKNKKK---KKKKKKKKKKKKKKKKKK 52
+ I F+REK KKKK + +KK+KK+ +++K +K
Sbjct: 30 EKINDEYFKREKKKKKKKSEGEFFAEKKEKKEVSEQRKADQK 71
>gnl|CDD|213317 cd05970, MACS_AAE_MA_like, Medium-chain acyl-CoA synthetase (MACS)
of AAE_MA like. MACS catalyzes the two-step activation
of medium chain fatty acids (containing 4-12 carbons).
The carboxylate substrate first reacts with ATP to form
an acyl-adenylate intermediate, which then reacts with
CoA to produce an acyl-CoA ester. This family of MACS
enzymes is found in archaea and bacteria. It is
represented by the acyl-adenylating enzyme from
Methanosarcina acetivorans (AAE_MA). AAE_MA is most
active with propionate, butyrate, and the branched
analogs: 2-methyl-propionate, butyrate, and pentanoate.
The specific activity is weaker for smaller or larger
acids.
Length = 537
Score = 30.5 bits (69), Expect = 1.2
Identities = 17/41 (41%), Positives = 21/41 (51%), Gaps = 1/41 (2%)
Query: 155 YKTGDYG-RIVDDILLYEGRTDSQVKIRGNRVDLSEVEAAL 194
Y TGD D L + GR D +K G R+ EVE+AL
Sbjct: 414 YHTGDTAWMDEDGYLWFVGRADDLIKSSGYRIGPFEVESAL 454
Score = 28.2 bits (63), Expect = 6.4
Identities = 24/80 (30%), Positives = 33/80 (41%), Gaps = 10/80 (12%)
Query: 228 DDILLYEGRTDSQVKIRGNRVDLSEVEAALSKISEVDKLSVLCYKPGEVDQIWEAPSSAG 287
D L + GR D +K G R+ EVE+AL + V + C G D I G
Sbjct: 425 DGYLWFVGRADDLIKSSGYRIGPFEVESALIQHPAV----LECAVTGVPDPI------RG 474
Query: 288 STDSSFIVKSREDRAEEELV 307
+ IV ++ EEL
Sbjct: 475 QVVKATIVLTKGYEPSEELK 494
Score = 28.2 bits (63), Expect = 6.6
Identities = 10/16 (62%), Positives = 14/16 (87%)
Query: 105 DIAIVLYTSGSTGIPK 120
DI +V +TSG+TG+PK
Sbjct: 186 DILLVYFTSGTTGMPK 201
>gnl|CDD|132249 TIGR03205, pimA, dicarboxylate--CoA ligase PimA. PimA, a member of
a large family of acyl-CoA ligases, is found in a
characteristic operon pimFABCDE for the metabolism of
pimelate and related compounds. It is found, so far, in
Bradyrhizobium japonicum and several strains of
Rhodopseudomonas palustris. PimA from R. palustris was
shown to be active as a CoA ligase for C(7) to C(14)
dicarboxylates and fatty acids.
Length = 541
Score = 30.3 bits (68), Expect = 1.2
Identities = 16/45 (35%), Positives = 24/45 (53%), Gaps = 3/45 (6%)
Query: 104 NDIAIVLYTSGSTGIPKEYRRMYKTGDYGRIVDDIHRCWSSGDRA 148
+D+A++ YT G+TG+PK M G+ V I+ W RA
Sbjct: 192 DDVALLQYTGGTTGLPKG--AMLTHGNLTSAV-SIYDVWGKPSRA 233
>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
factor. Replicative DNA polymerases are capable of
polymerising tens of thousands of nucleotides without
dissociating from their DNA templates. The high
processivity of these polymerases is dependent upon
accessory proteins that bind to the catalytic subunit of
the polymerase or to the substrate. The Epstein-Barr
virus (EBV) BMRF1 protein is an essential component of
the viral DNA polymerase and is absolutely required for
lytic virus replication. BMRF1 is also a transactivator.
This family is predicted to have a UL42 like structure.
Length = 381
Score = 30.4 bits (69), Expect = 1.3
Identities = 17/31 (54%), Positives = 20/31 (64%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKK 48
PA FER A K K++ KK+KKKK KK K
Sbjct: 344 PAAFERPLALSPKRKREGDKKQKKKKSKKLK 374
Score = 28.1 bits (63), Expect = 6.9
Identities = 11/19 (57%), Positives = 15/19 (78%)
Query: 32 KKKKKKKKKKKKKKKKKKK 50
K+K++ KK+KKKK KK K
Sbjct: 356 KRKREGDKKQKKKKSKKLK 374
Score = 28.1 bits (63), Expect = 6.9
Identities = 11/19 (57%), Positives = 15/19 (78%)
Query: 33 KKKKKKKKKKKKKKKKKKK 51
K+K++ KK+KKKK KK K
Sbjct: 356 KRKREGDKKQKKKKSKKLK 374
Score = 28.1 bits (63), Expect = 6.9
Identities = 11/19 (57%), Positives = 15/19 (78%)
Query: 34 KKKKKKKKKKKKKKKKKKK 52
K+K++ KK+KKKK KK K
Sbjct: 356 KRKREGDKKQKKKKSKKLK 374
>gnl|CDD|213287 cd05920, 23DHB-AMP_lg, 2,3-dihydroxybenzoate-AMP ligase.
2,3-dihydroxybenzoate-AMP ligase activates
2,3-dihydroxybenzoate (DHB) by ligation of AMP from ATP
with the release of pyrophosphate. However, it can also
catalyze the ATP-PPi exchange for 2,3-DHB analogs, such
as salicyclic acid (o-hydrobenzoate), as well as 2,4-DHB
and 2,5-DHB, but with less efficiency. Proteins in this
family are the stand-alone adenylation components of
non-ribosomal peptide synthases (NRPSs) involved in the
biosynthesis of siderophores, which are low molecular
weight iron-chelating compounds synthesized by many
bacteria to aid in the acquisition of this vital trace
elements. In Escherichia coli, the
2,3-dihydroxybenzoate-AMP ligase is called EntE, the
adenylation component of the enterobactin NRPS system.
Length = 483
Score = 30.3 bits (69), Expect = 1.3
Identities = 7/16 (43%), Positives = 12/16 (75%)
Query: 105 DIAIVLYTSGSTGIPK 120
D+A+ + G+TG+PK
Sbjct: 140 DVALFQLSGGTTGLPK 155
>gnl|CDD|213319 cd05972, MACS_like, Medium-chain acyl-CoA synthetase (MACS or
ACSM). MACS catalyzes the two-step activation of medium
chain fatty acids (containing 4-12 carbons). The
carboxylate substrate first reacts with ATP to form an
acyl-adenylate intermediate, which then reacts with CoA
to produce an acyl-CoA ester. The acyl-CoA is a key
intermediate in many important biosynthetic and
catabolic processes.
Length = 430
Score = 30.4 bits (69), Expect = 1.3
Identities = 17/47 (36%), Positives = 23/47 (48%), Gaps = 1/47 (2%)
Query: 155 YKTGDYGRIVDD-ILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEV 200
Y TGD +D + GR D +K G R+ EVE+AL + V
Sbjct: 309 YLTGDRAIKDEDGYFWFVGRADDVIKSSGYRIGPFEVESALIEHPAV 355
Score = 29.6 bits (67), Expect = 2.1
Identities = 9/16 (56%), Positives = 14/16 (87%)
Query: 105 DIAIVLYTSGSTGIPK 120
D A++ +TSG+TG+PK
Sbjct: 82 DPALLYFTSGTTGLPK 97
>gnl|CDD|216599 pfam01603, B56, Protein phosphatase 2A regulatory B subunit (B56
family). Protein phosphatase 2A (PP2A) is a major
intracellular protein phosphatase that regulates
multiple aspects of cell growth and metabolism. The
ability of this widely distributed heterotrimeric enzyme
to act on a diverse array of substrates is largely
controlled by the nature of its regulatory B subunit.
There are multiple families of B subunits (See also
pfam01240), this family is called the B56 family.
Length = 405
Score = 30.3 bits (69), Expect = 1.3
Identities = 13/32 (40%), Positives = 19/32 (59%)
Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
+ K K+K+ K+KKK+KK+KK RI
Sbjct: 369 PDLFDECGNKYKEKEAKEKKKEKKRKKTWARI 400
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 30.2 bits (68), Expect = 1.4
Identities = 13/35 (37%), Positives = 15/35 (42%)
Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
E KK KKK KK KKK+ K+ K
Sbjct: 140 LGIETKAKKGKAKKKTKKSKKKEAKESSDKDDEEE 174
>gnl|CDD|233160 TIGR00869, sec62, protein translocation protein, Sec62 family.
Members of the NSCC2 family have been sequenced from
various yeast, fungal and animals species including
Saccharomyces cerevisiae, Drosophila melanogaster and
Homo sapiens. These proteins are the Sec62 proteins,
believed to be associated with the Sec61 and Sec63
constituents of the general protein secretary systems of
yeast microsomes. They are also the non-selective cation
(NS) channels of the mammalian cytoplasmic membrane. The
yeast Sec62 protein has been shown to be essential for
cell growth. The mammalian NS channel proteins has been
implicated in platelet derived growth factor(PGDF)
dependent single channel current in fibroblasts. These
channels are essentially closed in serum deprived
tissue-culture cells and are specifically opened by
exposure to PDGF. These channels are reported to exhibit
equal selectivity for Na+, K+ and Cs+ with low
permeability to Ca2+, and no permeability to anions
[Transport and binding proteins, Amino acids, peptides
and amines].
Length = 232
Score = 29.9 bits (67), Expect = 1.4
Identities = 13/23 (56%), Positives = 14/23 (60%)
Query: 31 NKKKKKKKKKKKKKKKKKKKKKR 53
K K KKK K KK KKK+ KR
Sbjct: 207 EKDKYSYKKKLKSKKLKKKQAKR 229
Score = 29.1 bits (65), Expect = 2.3
Identities = 11/20 (55%), Positives = 13/20 (65%)
Query: 28 KKKNKKKKKKKKKKKKKKKK 47
K KKK K KK KKK+ K+
Sbjct: 210 KYSYKKKLKSKKLKKKQAKR 229
Score = 28.3 bits (63), Expect = 4.5
Identities = 12/22 (54%), Positives = 15/22 (68%)
Query: 30 KNKKKKKKKKKKKKKKKKKKKK 51
K+K KKK K KK KKK+ K+
Sbjct: 208 KDKYSYKKKLKSKKLKKKQAKR 229
Score = 28.3 bits (63), Expect = 4.8
Identities = 12/22 (54%), Positives = 14/22 (63%)
Query: 28 KKKNKKKKKKKKKKKKKKKKKK 49
K K KKK K KK KKK+ K+
Sbjct: 208 KDKYSYKKKLKSKKLKKKQAKR 229
Score = 27.2 bits (60), Expect = 9.6
Identities = 12/22 (54%), Positives = 14/22 (63%)
Query: 33 KKKKKKKKKKKKKKKKKKKKRR 54
K K KKK K KK KKK+ +R
Sbjct: 208 KDKYSYKKKLKSKKLKKKQAKR 229
>gnl|CDD|218550 pfam05311, Baculo_PP31, Baculovirus 33KDa late protein (PP31).
Autographa californica nuclear polyhedrosis virus
(AcMNPV) pp31 is a nuclear phosphoprotein that
accumulates in the virogenic stroma, which is the viral
replication centre in the infected-cell nucleus, binds
to DNA, and serves as a late expression factor.
Length = 267
Score = 30.0 bits (68), Expect = 1.4
Identities = 12/33 (36%), Positives = 18/33 (54%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKK 50
P P ++K+K KK KK KK+K+K+
Sbjct: 216 PTPQSSSPSRKRKAPKKVAKKVAAAKKRKQKRS 248
Score = 29.7 bits (67), Expect = 2.0
Identities = 10/30 (33%), Positives = 15/30 (50%)
Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
+ +K K KK KK KK+K+K+
Sbjct: 220 SSSPSRKRKAPKKVAKKVAAAKKRKQKRSP 249
Score = 28.9 bits (65), Expect = 3.4
Identities = 10/28 (35%), Positives = 16/28 (57%)
Query: 28 KKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
++K+K KK KK KK+K++R
Sbjct: 221 SSPSRKRKAPKKVAKKVAAAKKRKQKRS 248
>gnl|CDD|178452 PLN02861, PLN02861, long-chain-fatty-acid-CoA ligase.
Length = 660
Score = 30.2 bits (68), Expect = 1.5
Identities = 11/21 (52%), Positives = 15/21 (71%)
Query: 100 NEQSNDIAIVLYTSGSTGIPK 120
+Q DI ++YTSG+TG PK
Sbjct: 216 PKQKTDICTIMYTSGTTGEPK 236
>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3. This family
of proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). Shr3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of Shr3, AAPs are retained in the ER.
Length = 196
Score = 29.7 bits (67), Expect = 1.6
Identities = 10/29 (34%), Positives = 14/29 (48%)
Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKKK 49
F E+ K+ KK +K + KKKK
Sbjct: 168 FAAEERKEALAAAAKKSATPQKVETKKKK 196
>gnl|CDD|219978 pfam08701, GN3L_Grn1, GNL3L/Grn1 putative GTPase. Grn1 (yeast)
and GNL3L (human) are putative GTPases which are
required for growth and play a role in processing of
nucleolar pre-rRNA. This family contains a potential
nuclear localisation signal.
Length = 80
Score = 28.0 bits (63), Expect = 1.7
Identities = 13/31 (41%), Positives = 17/31 (54%), Gaps = 1/31 (3%)
Query: 23 REKAKKK-KNKKKKKKKKKKKKKKKKKKKKK 52
R K +KK +K +K+ KK K KKKK
Sbjct: 3 RYKIEKKVAEHHRKLRKEAKKNPTWKSKKKK 33
>gnl|CDD|215464 PLN02860, PLN02860, o-succinylbenzoate-CoA ligase.
Length = 563
Score = 30.2 bits (68), Expect = 1.7
Identities = 18/42 (42%), Positives = 23/42 (54%), Gaps = 3/42 (7%)
Query: 157 TGDYGRIVDDI--LLYEGRTDSQVKIRGNRVDLSEVEAALSK 196
TGD G I D L GR++ ++K G V EVEA LS+
Sbjct: 418 TGDIGWI-DKAGNLWLIGRSNDRIKTGGENVYPEEVEAVLSQ 458
Score = 28.6 bits (64), Expect = 4.4
Identities = 8/18 (44%), Positives = 13/18 (72%)
Query: 103 SNDIAIVLYTSGSTGIPK 120
+D ++ +TSG+TG PK
Sbjct: 171 PDDAVLICFTSGTTGRPK 188
>gnl|CDD|215278 PLN02502, PLN02502, lysyl-tRNA synthetase.
Length = 553
Score = 30.0 bits (68), Expect = 1.7
Identities = 9/32 (28%), Positives = 16/32 (50%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
R KAK+ + +K K++ K K + R+
Sbjct: 16 RLKAKQAEEEKAAKEEAKAAAAAAAAKGRSRK 47
>gnl|CDD|237144 PRK12582, PRK12582, acyl-CoA synthetase; Provisional.
Length = 624
Score = 30.0 bits (68), Expect = 1.8
Identities = 10/15 (66%), Positives = 13/15 (86%)
Query: 106 IAIVLYTSGSTGIPK 120
+A L+TSGSTG+PK
Sbjct: 222 VAKYLFTSGSTGMPK 236
>gnl|CDD|217834 pfam03998, Utp11, Utp11 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 239
Score = 29.6 bits (67), Expect = 1.8
Identities = 16/34 (47%), Positives = 20/34 (58%)
Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
+RE KK K KKKK K K K K KK+++R
Sbjct: 206 LQRELMKKGKGKKKKIVKDKDGKVVYKWKKERKR 239
Score = 29.3 bits (66), Expect = 2.3
Identities = 15/33 (45%), Positives = 17/33 (51%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
E ++ KK K KKKK K K K K KK R
Sbjct: 205 ELQRELMKKGKGKKKKIVKDKDGKVVYKWKKER 237
>gnl|CDD|235719 PRK06155, PRK06155, crotonobetaine/carnitine-CoA ligase;
Provisional.
Length = 542
Score = 30.1 bits (68), Expect = 1.8
Identities = 21/58 (36%), Positives = 25/58 (43%), Gaps = 7/58 (12%)
Query: 70 VLSLPQLRLLSASLSHNNIPGESMLH-------ADNNNEQSNDIAIVLYTSGSTGIPK 120
L LP + LL A S + G S A Q D A +LYTSG+TG K
Sbjct: 139 DLPLPAVWLLDAPASVSVPAGWSTAPLPPLDAPAPAAAVQPGDTAAILYTSGTTGPSK 196
>gnl|CDD|218612 pfam05501, DUF755, Domain of unknown function (DUF755). This
family is predominated by ORFs from Circoviridae. The
function of this family remains to be determined.
Length = 122
Score = 28.5 bits (64), Expect = 1.8
Identities = 13/50 (26%), Positives = 23/50 (46%), Gaps = 5/50 (10%)
Query: 10 HHFDKVIIPAPFEREKAKKKKNKKKKKKKKK-----KKKKKKKKKKKKRR 54
H ++ P +E+ K + KKK+KK + KK KK+++
Sbjct: 56 HLSQQLQQPQRKRKEQQKSSQTHKKKRKKSRHVSSRSAKKISAKKRRRSS 105
Score = 28.5 bits (64), Expect = 1.9
Identities = 13/46 (28%), Positives = 24/46 (52%), Gaps = 3/46 (6%)
Query: 17 IPAPFEREKAKKKKNKKKKKKKKKKK---KKKKKKKKKKRRILAKS 59
+ P + K ++K ++ KKK+KK + + KK K+R + S
Sbjct: 61 LQQPQRKRKEQQKSSQTHKKKRKKSRHVSSRSAKKISAKKRRRSSS 106
>gnl|CDD|181011 PRK07514, PRK07514, malonyl-CoA synthase; Validated.
Length = 504
Score = 29.8 bits (68), Expect = 1.9
Identities = 10/16 (62%), Positives = 13/16 (81%)
Query: 105 DIAIVLYTSGSTGIPK 120
D+A +LYTSG+TG K
Sbjct: 157 DLAAILYTSGTTGRSK 172
>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
Provisional.
Length = 695
Score = 29.9 bits (68), Expect = 2.0
Identities = 13/53 (24%), Positives = 20/53 (37%), Gaps = 1/53 (1%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRTVLSLP 74
EREKA ++ + KK + + K K R+ AK + P
Sbjct: 463 EREKAA-REARHKKAAEARAAKDKDAVAAALARVKAKKAAATQPIVIKAGARP 514
>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family. The yeast member of this
family (Kri1p) is found to be required for 40S ribosome
biogenesis in the nucleolus.
Length = 99
Score = 28.0 bits (63), Expect = 2.0
Identities = 11/31 (35%), Positives = 22/31 (70%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
E EKA++++ K+ K K+++ ++K +K KK
Sbjct: 5 EEEKAQREEELKRLKNLKREEIEEKLEKIKK 35
Score = 26.8 bits (60), Expect = 6.2
Identities = 8/32 (25%), Positives = 21/32 (65%)
Query: 27 KKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
K++K ++K +++++ K+ K K+++ L K
Sbjct: 1 KERKEEEKAQREEELKRLKNLKREEIEEKLEK 32
Score = 26.5 bits (59), Expect = 6.6
Identities = 6/25 (24%), Positives = 18/25 (72%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKK 49
K +K++ K +++++ K+ K K+++
Sbjct: 1 KERKEEEKAQREEELKRLKNLKREE 25
>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
domain. This domain is found in a number of different
types of plant proteins including NAM-like proteins.
Length = 147
Score = 28.9 bits (65), Expect = 2.0
Identities = 15/39 (38%), Positives = 28/39 (71%)
Query: 20 PFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
P R+KAK+K + K K KK++ +K+K+K+++ + LA+
Sbjct: 62 PEGRKKAKEKLRRDKLKAKKEEAEKEKEKEERFMKALAE 100
Score = 28.1 bits (63), Expect = 3.7
Identities = 11/37 (29%), Positives = 24/37 (64%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
+ K K +++K K KK++ +K+K+K+++ K A+
Sbjct: 67 KAKEKLRRDKLKAKKEEAEKEKEKEERFMKALAEAEK 103
Score = 27.7 bits (62), Expect = 4.7
Identities = 13/35 (37%), Positives = 22/35 (62%)
Query: 27 KKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCE 61
KK K K ++ K K KK++ +K+K+K+ R + E
Sbjct: 66 KKAKEKLRRDKLKAKKEEAEKEKEKEERFMKALAE 100
>gnl|CDD|215329 PLN02610, PLN02610, probable methionyl-tRNA synthetase.
Length = 801
Score = 29.7 bits (67), Expect = 2.1
Identities = 13/42 (30%), Positives = 17/42 (40%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENP 63
E +AKK + KKK KKK+ KK + E
Sbjct: 596 EAAEAKKLAKQLKKKALSDGGKKKQGKKAGGGGKSKAAAERE 637
Score = 28.2 bits (63), Expect = 6.9
Identities = 11/34 (32%), Positives = 16/34 (47%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
++ K K + KKK+ KK K K +R I
Sbjct: 605 KQLKKKALSDGGKKKQGKKAGGGGKSKAAAEREI 638
>gnl|CDD|218734 pfam05758, Ycf1, Ycf1. The chloroplast genomes of most higher
plants contain two giant open reading frames designated
ycf1 and ycf2. Although the function of Ycf1 is unknown,
it is known to be an essential gene.
Length = 832
Score = 30.0 bits (68), Expect = 2.1
Identities = 12/36 (33%), Positives = 23/36 (63%)
Query: 20 PFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
+ +K + K ++K KK++KK+K K+++K RI
Sbjct: 645 NWIEKKKEFKILDYTEEKTKKEEKKEKNKREEKERI 680
Score = 29.6 bits (67), Expect = 2.8
Identities = 17/56 (30%), Positives = 34/56 (60%), Gaps = 2/56 (3%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCEN--PAYKGRTVLSLPQ 75
E++K K + ++K KK++KK+K K+++K+R +A++ + A R L + Q
Sbjct: 648 EKKKEFKILDYTEEKTKKEEKKEKNKREEKERIEIAETWDTIPLAQAIRGFLLVIQ 703
>gnl|CDD|162605 TIGR01923, menE, O-succinylbenzoate-CoA ligase. This model
represents an enzyme, O-succinylbenzoate-CoA ligase,
which is involved in the fourth step of the menaquinone
biosynthesis pathway. O-succinylbenzoate-CoA ligase,
together with menB - naphtoate synthase, take
2-succinylbenzoate and convert it into 1,4-di-hydroxy-2-
naphtoate [Biosynthesis of cofactors, prosthetic groups,
and carriers, Menaquinone and ubiquinone].
Length = 436
Score = 29.7 bits (67), Expect = 2.2
Identities = 17/55 (30%), Positives = 25/55 (45%), Gaps = 11/55 (20%)
Query: 66 KGRTVLSLPQLRLLSASLSHNNIPGESMLHADNNNEQSNDIAIVLYTSGSTGIPK 120
K SL ++ E+ L A N +Q IA +++TSG+TG PK
Sbjct: 84 KDFQADSLDRIEAAGRY--------ETSLSASFNMDQ---IATLMFTSGTTGKPK 127
>gnl|CDD|213290 cd05923, CBAL, 4-Chlorobenzoate-CoA ligase (CBAL). CBAL catalyzes
the conversion of 4-chlorobenzoate (4-CB) to
4-chlorobenzoyl-coenzyme A (4-CB-CoA) by the two-step
adenylation and thioester-forming reactions.
4-Chlorobenzoate (4-CBA) is an environmental pollutant
derived from microbial breakdown of aromatic pollutants,
such as polychlorinated biphenyls (PCBs), DDT, and
certain herbicides. The 4-CBA degrading pathway converts
4-CBA to the metabolite 4-hydroxybezoate (4-HBA),
allowing some soil-dwelling microbes to utilize 4-CBA as
an alternate carbon source. This pathway consists of
three chemical steps catalyzed by 4-CBA-CoA ligase,
4-CBA-CoA dehalogenase, and 4HBA-CoA thioesterase in
sequential reactions.
Length = 495
Score = 29.4 bits (66), Expect = 2.3
Identities = 10/16 (62%), Positives = 12/16 (75%)
Query: 105 DIAIVLYTSGSTGIPK 120
A V YTSG+TG+PK
Sbjct: 153 QPAFVFYTSGTTGLPK 168
>gnl|CDD|241262 cd01228, PH_BCR-related, Breakpoint Cluster Region-related
pleckstrin homology (PH) domain. The BCR gene is one of
the two genes in the BCR-ABL complex, which is
associated with the Philadelphia chromosome, a product
of a reciprocal translocation between chromosomes 22 and
9. BCR is a GTPase-activating protein (GAP) for RAC1
(primarily) and CDC42. The Dbl region of BCR has the
most RhoGEF activity for Cdc42, and less activity
towards Rac and Rho. Since BCR possesses both GAP and
GEF activities, it may function to temporally regulate
the activity of these GTPases. It also displays
serine/threonine kinase activity. The BCR protein
contains multiple domains including an N-terminal kinase
domain, a RhoGEF domain, a PH domain, a C1 domain, a C2
domain, and a C-terminal RhoGAP domain. ABR, a related
smaller protein, is structurally similar to BCR, but
lacks the N-terminal kinase domain and has GAP activity
for both Rac and Cdc42. PH domains have diverse
functions, but in general are involved in targeting
proteins to the appropriate cellular location or in the
interaction with a binding partner. They share little
sequence conservation, but all have a common fold, which
is electrostatically polarized. Less than 10% of PH
domains bind phosphoinositide phosphates (PIPs) with
high affinity and specificity. PH domains are
distinguished from other PIP-binding domains by their
specific high-affinity binding to PIPs with two vicinal
phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
PtdIns(3,4,5)P3 which results in targeting some PH
domain proteins to the plasma membrane. A few display
strong specificity in lipid binding. Any specificity is
usually determined by loop regions or insertions in the
N-terminus of the domain, which are not conserved across
all PH domains. PH domains are found in cellular
signaling proteins such as serine/threonine kinase,
tyrosine kinases, regulators of G-proteins, endocytotic
GTPases, adaptors, as well as cytoskeletal associated
molecules and in lipid associated enzymes.
Length = 179
Score = 28.8 bits (65), Expect = 2.3
Identities = 13/57 (22%), Positives = 22/57 (38%), Gaps = 7/57 (12%)
Query: 8 SEHHFDKVIIPAPFE-------REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILA 57
E ++P E A K+K K+ ++KK + +K R+ LA
Sbjct: 75 DESEAAPDVVPKSSEELVQMKKAISALKQKIKRDSAERKKAASSGSRAIEKLRKKLA 131
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 29.6 bits (67), Expect = 2.3
Identities = 12/26 (46%), Positives = 19/26 (73%)
Query: 29 KKNKKKKKKKKKKKKKKKKKKKKKRR 54
KK KKK+K+K++ K + KK+KR+
Sbjct: 627 KKRKKKRKRKRRFLTKIEGVKKEKRK 652
Score = 29.6 bits (67), Expect = 2.7
Identities = 12/35 (34%), Positives = 22/35 (62%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRIL 56
++ K K+K+ ++ K + KK+K+K KK K I+
Sbjct: 627 KKRKKKRKRKRRFLTKIEGVKKEKRKDKKLKNVII 661
Score = 28.5 bits (64), Expect = 4.9
Identities = 16/43 (37%), Positives = 23/43 (53%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPA 64
E KAK+ K K KK + KK+K K++ K+ L K+ A
Sbjct: 216 EEAKAKRIKKIKSKKYHRVHKKEKLKEELKEFEELVKADPEAA 258
Score = 28.5 bits (64), Expect = 5.8
Identities = 11/27 (40%), Positives = 19/27 (70%)
Query: 26 AKKKKNKKKKKKKKKKKKKKKKKKKKK 52
KK+K K+K+K++ K + KK+K+K
Sbjct: 626 IKKRKKKRKRKRRFLTKIEGVKKEKRK 652
>gnl|CDD|176075 cd08693, C2_PI3K_class_I_beta_delta, C2 domain present in class I
beta and delta phosphatidylinositol 3-kinases (PI3Ks).
PI3Ks (AKA phosphatidylinositol (PtdIns) 3-kinases)
regulate cell processes such as cell growth,
differentiation, proliferation, and motility. PI3Ks
work on phosphorylation of phosphatidylinositol,
phosphatidylinositide (4)P (PtdIns (4)P),2 or
PtdIns(4,5)P2. Specifically they phosphorylate the D3
hydroxyl group of phosphoinositol lipids on the inositol
ring. There are 3 classes of PI3Ks based on structure,
regulation, and specificity. All classes contain a C2
domain, a PIK domain, and a kinase catalytic domain.
The members here are class I, beta and delta isoforms of
PI3Ks and contain both a Ras-binding domain and a
p85-binding domain. Class II PI3Ks contain both of
these as well as a PX domain, and a C-terminal C2 domain
containing a nuclear localization signal. C2 domains
fold into an 8-standed beta-sandwich that can adopt 2
structural arrangements: Type I and Type II,
distinguished by a circular permutation involving their
N- and C-terminal beta strands. Many C2 domains are
Ca2+-dependent membrane-targeting modules that bind a
wide variety of substances including bind phospholipids,
inositol polyphosphates, and intracellular proteins.
Most C2 domain proteins are either signal transduction
enzymes that contain a single C2 domain, such as protein
kinase C, or membrane trafficking proteins which contain
at least two C2 domains, such as synaptotagmin 1.
However, there are a few exceptions to this including
RIM isoforms and some splice variants of piccolo/aczonin
and intersectin which only have a single C2 domain. C2
domains with a calcium binding region have negatively
charged residues, primarily aspartates, that serve as
ligands for calcium ions. Members have a type-I
topology.
Length = 173
Score = 28.8 bits (65), Expect = 2.3
Identities = 11/21 (52%), Positives = 14/21 (66%)
Query: 31 NKKKKKKKKKKKKKKKKKKKK 51
KK K K+ +K + KKKKKK
Sbjct: 85 VSKKAKGKRSRKNQTKKKKKK 105
Score = 28.4 bits (64), Expect = 3.4
Identities = 11/21 (52%), Positives = 14/21 (66%)
Query: 32 KKKKKKKKKKKKKKKKKKKKK 52
KK K K+ +K + KKKKKK
Sbjct: 85 VSKKAKGKRSRKNQTKKKKKK 105
Score = 28.0 bits (63), Expect = 4.0
Identities = 12/22 (54%), Positives = 16/22 (72%)
Query: 24 EKAKKKKNKKKKKKKKKKKKKK 45
E +KK K K+ +K + KKKKKK
Sbjct: 84 EVSKKAKGKRSRKNQTKKKKKK 105
Score = 27.3 bits (61), Expect = 7.0
Identities = 11/22 (50%), Positives = 14/22 (63%)
Query: 26 AKKKKNKKKKKKKKKKKKKKKK 47
KK K K+ +K + KKKKKK
Sbjct: 84 EVSKKAKGKRSRKNQTKKKKKK 105
Score = 27.3 bits (61), Expect = 8.4
Identities = 10/21 (47%), Positives = 13/21 (61%)
Query: 29 KKNKKKKKKKKKKKKKKKKKK 49
K K K+ +K + KKKKKK
Sbjct: 85 VSKKAKGKRSRKNQTKKKKKK 105
Score = 26.9 bits (60), Expect = 9.2
Identities = 10/21 (47%), Positives = 13/21 (61%)
Query: 28 KKKNKKKKKKKKKKKKKKKKK 48
K K K+ +K + KKKKKK
Sbjct: 85 VSKKAKGKRSRKNQTKKKKKK 105
>gnl|CDD|235134 PRK03584, PRK03584, acetoacetyl-CoA synthetase; Provisional.
Length = 655
Score = 29.4 bits (67), Expect = 2.4
Identities = 8/12 (66%), Positives = 12/12 (100%)
Query: 109 VLYTSGSTGIPK 120
+LY+SG+TG+PK
Sbjct: 268 ILYSSGTTGLPK 279
>gnl|CDD|226814 COG4379, COG4379, Mu-like prophage tail protein gpP [General
function prediction only].
Length = 386
Score = 29.4 bits (66), Expect = 2.5
Identities = 10/37 (27%), Positives = 16/37 (43%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
F+ + KK +K+K +K K KK K+
Sbjct: 342 DGAFDVDAYPKKAEAARKRKGDRKGVSHKAKKGGKKN 378
Score = 28.3 bits (63), Expect = 5.1
Identities = 10/35 (28%), Positives = 17/35 (48%)
Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
A KK + +K+K +K K KK ++ A+
Sbjct: 347 VDAYPKKAEAARKRKGDRKGVSHKAKKGGKKNQAE 381
>gnl|CDD|240578 cd12951, RRP7_Rrp7A, RRP7 domain ribosomal RNA-processing protein
7 homolog A (Rrp7A) and similar proteins. The family
corresponds to the RRP7 domain of Rrp7A, also termed
gastric cancer antigen Zg14, and similar proteins which
are yeast ribosomal RNA-processing protein 7 (Rrp7p)
homologs mainly found in Metazoans. The cellular
function of Rrp7A remains unclear currently. Rrp7A
harbors an N-terminal RNA recognition motif (RRM), also
termed RBD (RNA binding domain) or RNP
(ribonucleoprotein domain), and a C-terminal RRP7
domain.
Length = 129
Score = 28.4 bits (64), Expect = 2.5
Identities = 12/28 (42%), Positives = 18/28 (64%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
K ++ K +K+ K +K+KKKKKKK
Sbjct: 58 KGRRPKTARKESVAAKAAEKEKKKKKKK 85
Score = 28.0 bits (63), Expect = 3.0
Identities = 11/30 (36%), Positives = 19/30 (63%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
++ + K +K+ K +K+KKKKKKK+
Sbjct: 57 KKGRRPKTARKESVAAKAAEKEKKKKKKKE 86
Score = 27.6 bits (62), Expect = 4.8
Identities = 10/28 (35%), Positives = 17/28 (60%)
Query: 27 KKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
KK + K +K+ K +K+KKKK++
Sbjct: 57 KKGRRPKTARKESVAAKAAEKEKKKKKK 84
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 29.4 bits (66), Expect = 2.5
Identities = 8/42 (19%), Positives = 15/42 (35%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPA 64
++KA + K + K +K K A + + A
Sbjct: 216 KKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKAAA 257
Score = 28.6 bits (64), Expect = 4.5
Identities = 8/36 (22%), Positives = 12/36 (33%)
Query: 26 AKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCE 61
AK K + K +K K +K + E
Sbjct: 223 AKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKAAAE 258
Score = 28.2 bits (63), Expect = 6.1
Identities = 7/37 (18%), Positives = 13/37 (35%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
++ A+ K K + K +K K + A
Sbjct: 217 KKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAA 253
Score = 27.8 bits (62), Expect = 7.2
Identities = 7/38 (18%), Positives = 11/38 (28%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
K K K K + K +K + K+
Sbjct: 213 AEAKKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKA 250
>gnl|CDD|218396 pfam05039, Agouti, Agouti protein. The agouti protein regulates
pigmentation in the mouse hair follicle producing a
black hair with a subapical yellow band. A highly
homologous protein agouti signal protein (ASIP)is
present in humans and is expressed at highest levels in
adipose tissue where it may play a role in energy
homeostasis and possibly human pigmentation.
Length = 90
Score = 27.5 bits (61), Expect = 2.5
Identities = 16/56 (28%), Positives = 25/56 (44%), Gaps = 6/56 (10%)
Query: 30 KNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRTVLSLPQLRLLSASLSH 85
NK KK K + ++KKK KK+ L ++ +GR +RL + L
Sbjct: 16 LNKGLKKFNKNEAEEKKKSSKKEAEALTEA------QGRPPPPRRCVRLHESCLGP 65
>gnl|CDD|222791 PHA00431, PHA00431, internal virion protein C.
Length = 746
Score = 29.4 bits (66), Expect = 2.5
Identities = 9/33 (27%), Positives = 18/33 (54%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
REKA K NK+ + +++ ++K+K+
Sbjct: 705 AREKALKDANKRAPIVAATQAREEVREKRKQTS 737
>gnl|CDD|150240 pfam09495, DUF2462, Protein of unknown function (DUF2462). This
protein is highly conserved, but its function is
unknown. It can be isolated from HeLa cell nucleoli and
is found to be homologous with Leydig cell tumour
protein whose function is unknown.
Length = 82
Score = 27.4 bits (61), Expect = 2.7
Identities = 14/35 (40%), Positives = 16/35 (45%)
Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
+ A K K K K KKK+K KK R I K
Sbjct: 3 QGALKVKAKGPAAVKAGKKKQKGPKKAGPRVIKPK 37
>gnl|CDD|219572 pfam07780, Spb1_C, Spb1 C-terminal domain. This presumed domain is
found at the C-terminus of a family of FtsJ-like
methyltransferases. Members of this family are involved
in 60S ribosomal biogenesis.
Length = 212
Score = 28.8 bits (65), Expect = 2.7
Identities = 23/39 (58%), Positives = 24/39 (61%), Gaps = 6/39 (15%)
Query: 22 EREKAK--KKKNKKKKKKKKKKKKK----KKKKKKKKRR 54
EREKAK KK KK KKKKKK+KK KK K K R
Sbjct: 146 EREKAKQIKKLYKKAKKKKKKRKKTYVVAKKGNKGKAGR 184
>gnl|CDD|217356 pfam03078, ATHILA, ATHILA ORF-1 family. ATHILA is a group of
Arabidopsis thaliana retrotransposons belonging to the
Ty3/gypsy family of the long terminal repeat (LTR) class
of eukaryotic retrotransposons. The central region of
ATHILA retrotransposons contains two or three open
reading frames (ORFs). This family represents the ORF1
product. The function of ORF1 is unknown.
Length = 456
Score = 29.2 bits (65), Expect = 2.7
Identities = 16/52 (30%), Positives = 27/52 (51%)
Query: 8 SEHHFDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
S +HF++ + PA + ++ KN K +K KK+ K K KK + + K
Sbjct: 397 SMYHFEEHVPPARESKSLSEAHKNNSKLQKWCKKQDKLIAKCLKKIKFMKKK 448
>gnl|CDD|236363 PRK09029, PRK09029, O-succinylbenzoic acid--CoA ligase;
Provisional.
Length = 458
Score = 29.5 bits (67), Expect = 2.7
Identities = 21/75 (28%), Positives = 26/75 (34%), Gaps = 21/75 (28%)
Query: 67 GRTVLSL-PQL--RLLSASLSHNNI------------PGESMLHADNNNEQSND------ 105
G VL L PQL LL L + + LH
Sbjct: 77 GARVLPLNPQLPQPLLEELLPSLTLDFALVLEGENTFSALTSLHLQLVEGAHAVAWQPQR 136
Query: 106 IAIVLYTSGSTGIPK 120
+A + TSGSTG+PK
Sbjct: 137 LATMTLTSGSTGLPK 151
>gnl|CDD|213308 cd05943, AACS, Acetoacetyl-CoA synthetase (acetoacetate-CoA ligase,
AACS). AACS is a cytosolic ligase that specifically
activates acetoacetate to its coenzyme A ester by a
two-step reaction. Acetoacetate first reacts with ATP to
form an acyl-adenylate intermediate, which then reacts
with CoA to produce an acyl-CoA ester. This is the first
step of the mevalonate pathway of isoprenoid
biosynthesis via isopentenyl diphosphate. Isoprenoids
are a large class of compounds found in all living
organisms. AACS is widely distributed in bacteria,
archaea and eukaryotes. In bacteria, AACS is known to
exhibit an important role in the metabolism of
poly-b-hydroxybutyrate, an intracellular reserve of
organic carbon and chemical energy by some
microorganisms. In mammals, AACS influences the rate of
ketone body utilization for the formation of
physiologically important fatty acids and cholesterol.
Length = 616
Score = 29.5 bits (67), Expect = 2.7
Identities = 8/12 (66%), Positives = 12/12 (100%)
Query: 109 VLYTSGSTGIPK 120
+LY+SG+TG+PK
Sbjct: 241 ILYSSGTTGLPK 252
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 29.5 bits (66), Expect = 2.7
Identities = 16/30 (53%), Positives = 19/30 (63%)
Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
E ++K+KKKKKK K KKK KK KR
Sbjct: 234 EDGDEEKSKKKKKKLAKNKKKLDDDKKGKR 263
Score = 27.6 bits (61), Expect = 8.9
Identities = 12/33 (36%), Positives = 16/33 (48%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
E E+ K ++ KK KK KK K KK +
Sbjct: 322 ESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKD 354
>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
eukaryotic snRNP [Transcription].
Length = 564
Score = 29.3 bits (66), Expect = 2.8
Identities = 11/36 (30%), Positives = 22/36 (61%)
Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
E+ +++ ++ K KKKK+KKK+ +K R ++
Sbjct: 413 EEIREELIEEGLLKSKKKKRKKKEWFEKFRWFVSSD 448
>gnl|CDD|219947 pfam08639, SLD3, DNA replication regulator SLD3. The SLD3 DNA
replication regulator is required for loading and
maintenance of Cdc45 on chromatin during DNA
replication.
Length = 437
Score = 29.4 bits (66), Expect = 2.8
Identities = 10/39 (25%), Positives = 21/39 (53%)
Query: 13 DKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
E +K K+++ + K++K++K+KK KK+
Sbjct: 120 LDRFTKFEKEYKKKLLKRSQNLDRSKRRKRRKRKKNKKQ 158
>gnl|CDD|237276 PRK13024, PRK13024, bifunctional preprotein translocase subunit
SecD/SecF; Reviewed.
Length = 755
Score = 29.4 bits (67), Expect = 3.0
Identities = 11/21 (52%), Positives = 17/21 (80%)
Query: 32 KKKKKKKKKKKKKKKKKKKKK 52
+K++ KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753
Score = 29.0 bits (66), Expect = 3.7
Identities = 11/21 (52%), Positives = 17/21 (80%)
Query: 27 KKKKNKKKKKKKKKKKKKKKK 47
+K++ KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753
Score = 29.0 bits (66), Expect = 4.1
Identities = 10/21 (47%), Positives = 16/21 (76%)
Query: 30 KNKKKKKKKKKKKKKKKKKKK 50
+ ++ KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753
Score = 28.7 bits (65), Expect = 4.3
Identities = 11/22 (50%), Positives = 17/22 (77%)
Query: 34 KKKKKKKKKKKKKKKKKKKRRI 55
+K++ KKKKK+KK KK + +I
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEKI 754
Score = 28.3 bits (64), Expect = 5.6
Identities = 11/21 (52%), Positives = 16/21 (76%)
Query: 29 KKNKKKKKKKKKKKKKKKKKK 49
+K + KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753
Score = 27.9 bits (63), Expect = 7.5
Identities = 11/21 (52%), Positives = 16/21 (76%)
Query: 28 KKKNKKKKKKKKKKKKKKKKK 48
+K+ KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753
Score = 27.9 bits (63), Expect = 8.8
Identities = 10/21 (47%), Positives = 17/21 (80%)
Query: 33 KKKKKKKKKKKKKKKKKKKKR 53
+K++ KKKKK+KK KK + ++
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753
>gnl|CDD|234298 TIGR03659, IsdE, heme ABC transporter, heme-binding protein isdE.
This family of ABC substrate-binding proteins is
observed primarily in close proximity with proteins
localized to the cell wall and bearing the NEAT (NEAr
Transporter, pfam05031) heme-binding domain. IsdE has
been shown to bind heme and is involved in the process
of scavenging heme for the purpose of obtaining iron.
Length = 289
Score = 29.2 bits (66), Expect = 3.0
Identities = 15/36 (41%), Positives = 22/36 (61%)
Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRIL 56
+ RE+ +K K+ +K+ + KKK K KKK K IL
Sbjct: 138 YGREEQAEKLVKEINEKEAEVKKKVKGKKKPKVLIL 173
>gnl|CDD|235146 PRK03640, PRK03640, O-succinylbenzoic acid--CoA ligase;
Provisional.
Length = 483
Score = 29.2 bits (66), Expect = 3.0
Identities = 9/17 (52%), Positives = 15/17 (88%)
Query: 104 NDIAIVLYTSGSTGIPK 120
+++A ++YTSG+TG PK
Sbjct: 141 DEVATIMYTSGTTGKPK 157
>gnl|CDD|149438 pfam08374, Protocadherin, Protocadherin. The structure of
protocadherins is similar to that of classic cadherins
(pfam00028), but particularly on the cytoplasmic domains
they also have some unique features. They are expressed
in a variety of organisms and are found in high
concentrations in the brain where they seem to be
localised mainly at cell-cell contact sites. Their
expression seems to be developmentally regulated.
Length = 223
Score = 28.7 bits (64), Expect = 3.1
Identities = 12/21 (57%), Positives = 16/21 (76%)
Query: 26 AKKKKNKKKKKKKKKKKKKKK 46
+ ++NK+KKKKK KKKK K
Sbjct: 84 SPNQENKQKKKKKDKKKKSPK 104
>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
Length = 1437
Score = 29.4 bits (67), Expect = 3.1
Identities = 13/49 (26%), Positives = 22/49 (44%)
Query: 4 NLTISEHHFDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
L I D FE +K ++ + K+ + KK + +KKK+ K
Sbjct: 158 ILKIDFEIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSK 206
Score = 28.3 bits (64), Expect = 7.2
Identities = 10/42 (23%), Positives = 19/42 (45%)
Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCEN 62
+ + AK+ KK + +KKK+ K K+ + K +
Sbjct: 182 EDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQIGKKIDK 223
Score = 28.3 bits (64), Expect = 7.4
Identities = 14/58 (24%), Positives = 27/58 (46%)
Query: 11 HFDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGR 68
F+ E+ +A+K++ +K K+ + KK + +KKK+ E P G+
Sbjct: 162 DFEIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQIGK 219
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 28.9 bits (65), Expect = 3.1
Identities = 8/32 (25%), Positives = 29/32 (90%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
E +K K+++ K++++++K++KK+++++++K++
Sbjct: 188 ELKKLKQQQQKREEERRKQRKKQQEEEERKQK 219
Score = 28.9 bits (65), Expect = 3.3
Identities = 9/31 (29%), Positives = 27/31 (87%)
Query: 27 KKKKNKKKKKKKKKKKKKKKKKKKKKRRILA 57
KK K +++K++++++K++KK++++++R+ A
Sbjct: 190 KKLKQQQQKREEERRKQRKKQQEEEERKQKA 220
Score = 28.2 bits (63), Expect = 5.9
Identities = 8/34 (23%), Positives = 28/34 (82%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
E E K K+ ++K++++++K++KK+++++++++
Sbjct: 186 EWELKKLKQQQQKREEERRKQRKKQQEEEERKQK 219
Score = 27.4 bits (61), Expect = 8.4
Identities = 10/38 (26%), Positives = 28/38 (73%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSC 60
++ ++KK +K ++K+K++++K++++ + R+ LAK
Sbjct: 96 KQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEK 133
>gnl|CDD|240253 PTZ00068, PTZ00068, 60S ribosomal protein L13a; Provisional.
Length = 202
Score = 28.5 bits (64), Expect = 3.2
Identities = 13/42 (30%), Positives = 19/42 (45%), Gaps = 5/42 (11%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKK-----KKKKRRILAK 58
E+ K + KKK K +K K+ +KK K +L K
Sbjct: 157 EKRKERAAAYYKKKVKLRKAWKEARKKALAKLPKAIVAVLKK 198
Score = 27.7 bits (62), Expect = 5.6
Identities = 11/35 (31%), Positives = 21/35 (60%)
Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
K ++K+ ++ KKK K +K K+ +++ LAK
Sbjct: 153 AKLEEKRKERAAAYYKKKVKLRKAWKEARKKALAK 187
>gnl|CDD|132252 TIGR03208, cyc_hxne_CoA_lg, cyclohexanecarboxylate-CoA ligase.
Members of this protein family are
cyclohexanecarboxylate-CoA ligase. This enzyme prepares
the aliphatic ring compound, cyclohexanecarboxylate, for
dehydrogenation and then degradation by a pathway also
used in benzoyl-CoA degradation in Rhodopseudomonas
palustris.
Length = 538
Score = 29.1 bits (65), Expect = 3.2
Identities = 9/17 (52%), Positives = 14/17 (82%)
Query: 104 NDIAIVLYTSGSTGIPK 120
+D+ ++YTSG+TG PK
Sbjct: 195 DDVTQLIYTSGTTGEPK 211
>gnl|CDD|240407 PTZ00412, PTZ00412, leucyl aminopeptidase; Provisional.
Length = 569
Score = 29.2 bits (65), Expect = 3.2
Identities = 13/34 (38%), Positives = 18/34 (52%)
Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
F K K+K KK + KK K +KK+ +RR
Sbjct: 534 FRHNKPPKRKGIKKAQNAKKDLKVVQKKRHGRRR 567
Score = 28.0 bits (62), Expect = 8.3
Identities = 11/31 (35%), Positives = 20/31 (64%)
Query: 20 PFEREKAKKKKNKKKKKKKKKKKKKKKKKKK 50
P +R+ KK +N KK K +KK+ +++K+
Sbjct: 539 PPKRKGIKKAQNAKKDLKVVQKKRHGRRRKQ 569
>gnl|CDD|240576 cd12932, RRP7_like, RRP7 domain ribosomal RNA-processing protein
7 (Rrp7p), ribosomal RNA-processing protein 7 homolog A
(Rrp7A), and similar proteins. This CD corresponds to
the RRP7 domain of Rrp7p and Rrp7A. Rrp7p is encoded by
YCL031C gene from Saccharomyces cerevisiae. It is an
essential yeast protein involved in pre-rRNA processing
and ribosome assembly, and is speculated to be required
for correct assembly of rpS27 into the pre-ribosomal
particle. Rrp7A, also termed gastric cancer antigen
Zg14, is the Rrp7p homolog mainly found in Metazoans.
The cellular function of Rrp7A remains unclear
currently. Both Rrp7p and Rrp7A harbor an N-terminal
RNA recognition motif (RRM), also termed RBD (RNA
binding domain) or RNP (ribonucleoprotein domain), and
a C-terminal RRP7 domain.
Length = 118
Score = 28.0 bits (63), Expect = 3.2
Identities = 12/28 (42%), Positives = 19/28 (67%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
+K K +++ + K K+K+KKKKKKK
Sbjct: 59 GGRKGKTAREEAVEAKAKEKEKKKKKKK 86
Score = 27.2 bits (61), Expect = 4.8
Identities = 16/41 (39%), Positives = 26/41 (63%), Gaps = 7/41 (17%)
Query: 19 APFEREKAKKKKNKKKKKKKKKKKKKK-------KKKKKKK 52
RE+A + K K+K+KKKKKKK+ + ++KKK++
Sbjct: 63 GKTAREEAVEAKAKEKEKKKKKKKELEDFYRFQIREKKKEE 103
Score = 26.8 bits (60), Expect = 7.1
Identities = 13/30 (43%), Positives = 20/30 (66%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
R K K +++ + K K+K+KKKKKKK+
Sbjct: 58 RGGRKGKTAREEAVEAKAKEKEKKKKKKKE 87
Score = 26.8 bits (60), Expect = 7.5
Identities = 11/28 (39%), Positives = 18/28 (64%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
+ K ++ + K K+K+KKKKKKK+
Sbjct: 61 RKGKTAREEAVEAKAKEKEKKKKKKKEL 88
Score = 26.8 bits (60), Expect = 8.2
Identities = 15/39 (38%), Positives = 24/39 (61%), Gaps = 6/39 (15%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKK------KKKKKKKK 50
A E +AK K+ +KKKKKKK+ + ++KKK++
Sbjct: 65 TAREEAVEAKAKEKEKKKKKKKELEDFYRFQIREKKKEE 103
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 29.0 bits (66), Expect = 3.3
Identities = 15/31 (48%), Positives = 25/31 (80%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
E +A+K+ NK +KK+KKKKK+K+K+++ K
Sbjct: 608 ELIEARKRLNKANEKKEKKKKKQKEKQEELK 638
Score = 28.6 bits (65), Expect = 4.5
Identities = 12/28 (42%), Positives = 19/28 (67%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
KA + +K+ K +KK+KKKKK+K+
Sbjct: 605 KAHELIEARKRLNKANEKKEKKKKKQKE 632
Score = 28.3 bits (64), Expect = 7.0
Identities = 13/27 (48%), Positives = 16/27 (59%)
Query: 28 KKKNKKKKKKKKKKKKKKKKKKKKKRR 54
+ +K +K KKKKKKK K K K R
Sbjct: 675 PLSDLEKIQKPKKKKKKKPKTVKPKPR 701
>gnl|CDD|197773 smart00527, HMG17, domain in high mobilty group proteins HMG14
and HMG 17.
Length = 88
Score = 27.2 bits (60), Expect = 3.3
Identities = 21/52 (40%), Positives = 24/52 (46%), Gaps = 1/52 (1%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRT 69
PAP + E K KK KKK + K KK K K+ K NPA G T
Sbjct: 27 PAPPKPE-PKPKKAAAKKKSENVAAKGKKGAKGKQTEEAGKEENNPAENGET 77
>gnl|CDD|177779 PLN00185, PLN00185, 60S ribosomal protein L4-1; Provisional.
Length = 405
Score = 29.0 bits (65), Expect = 3.3
Identities = 12/31 (38%), Positives = 17/31 (54%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
R KAKK+K KK+K+ K++ K K
Sbjct: 350 ARVKAKKEKLAKKRKQVSKEEAAAIKAAGKA 380
Score = 27.8 bits (62), Expect = 9.0
Identities = 10/29 (34%), Positives = 15/29 (51%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKK 50
E + K KK K KK+K+ K++ K
Sbjct: 348 EAARVKAKKEKLAKKRKQVSKEEAAAIKA 376
>gnl|CDD|234874 PRK00955, PRK00955, hypothetical protein; Provisional.
Length = 620
Score = 29.1 bits (66), Expect = 3.4
Identities = 11/32 (34%), Positives = 16/32 (50%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
K KKK+ KK KK+KKK+++
Sbjct: 589 PPKKPKKKSNKKNNNNNYSSNNSKKRKKKRKK 620
Score = 28.4 bits (64), Expect = 6.5
Identities = 12/39 (30%), Positives = 19/39 (48%)
Query: 10 HHFDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKK 48
+ +I P +++ KK N KK+KKK+KK
Sbjct: 582 NGPKCLIPPKKPKKKSNKKNNNNNYSSNNSKKRKKKRKK 620
Score = 28.0 bits (63), Expect = 8.3
Identities = 11/32 (34%), Positives = 15/32 (46%)
Query: 20 PFEREKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
P ++ K K K KK+KKK+KK
Sbjct: 589 PPKKPKKKSNKKNNNNNYSSNNSKKRKKKRKK 620
>gnl|CDD|240309 PTZ00189, PTZ00189, 60S ribosomal protein L21; Provisional.
Length = 160
Score = 28.0 bits (63), Expect = 3.5
Identities = 6/26 (23%), Positives = 11/26 (42%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKK 50
+ N K + KKK + K++
Sbjct: 106 LKRVAANDAIKAEAKKKGELPSTKRQ 131
>gnl|CDD|217479 pfam03297, Ribosomal_S25, S25 ribosomal protein.
Length = 105
Score = 27.7 bits (62), Expect = 3.5
Identities = 11/30 (36%), Positives = 14/30 (46%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
K ++KK K K KKKK K K +
Sbjct: 4 KKQQKKAAKAAAASAGGKAKKKKWSKGKVK 33
Score = 26.6 bits (59), Expect = 7.2
Identities = 12/34 (35%), Positives = 15/34 (44%)
Query: 19 APFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
P +++K K K KKKK K K K K
Sbjct: 2 PPKKQQKKAAKAAAASAGGKAKKKKWSKGKVKDK 35
>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
biogenesis [Translation, ribosomal structure and
biogenesis].
Length = 1077
Score = 29.3 bits (65), Expect = 3.6
Identities = 11/47 (23%), Positives = 28/47 (59%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGR 68
ER K +++K + + ++ K+++ KK+K++++RI +N +
Sbjct: 1024 ERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEMAK 1070
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 29.0 bits (65), Expect = 3.6
Identities = 13/49 (26%), Positives = 31/49 (63%), Gaps = 4/49 (8%)
Query: 17 IPAPFEREKAKKKKNKKKKKKKKKK----KKKKKKKKKKKRRILAKSCE 61
+P E E+ ++KK +++K K++ +++K++KKKK +++ + E
Sbjct: 35 VPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTE 83
>gnl|CDD|213294 cd05928, MACS_euk, Eukaryotic Medium-chain acyl-CoA synthetase
(MACS or ACSM). MACS catalyzes the two-step activation
of medium chain fatty acids (containing 4-12 carbons).
The carboxylate substrate first reacts with ATP to form
an acyl-adenylate intermediate, which then reacts with
CoA to produce an acyl-CoA ester. The acyl-CoA is a key
intermediate in many important biosynthetic and
catabolic processes. MACS enzymes are localized to
mitochondria. Two murine MACS family proteins are found
in liver and kidney. In rodents, a MACS member is
detected particularly in the olfactory epithelium and is
called O-MACS. O-MACS demonstrates substrate preference
for the fatty acid lengths of C6-C12.
Length = 530
Score = 29.0 bits (65), Expect = 3.7
Identities = 16/41 (39%), Positives = 22/41 (53%), Gaps = 1/41 (2%)
Query: 155 YKTGDYGRIVDD-ILLYEGRTDSQVKIRGNRVDLSEVEAAL 194
Y TGD G + +D + GR D + G R+ EVE+AL
Sbjct: 404 YITGDRGIMDEDGYFWFVGRADDVINSSGYRIGPFEVESAL 444
>gnl|CDD|239570 cd03488, Topoisomer_IB_N_htopoI_like, Topoisomer_IB_N_htopoI_like :
N-terminal DNA binding fragment found in eukaryotic DNA
topoisomerase (topo) IB proteins similar to the
monomeric yeast and human topo I. Topo I enzymes are
divided into: topo type IA (bacterial) and type IB
(eukaryotic). Topo I relaxes superhelical tension in
duplex DNA by creating a single-strand nick, the broken
strand can then rotate around the unbroken strand to
remove DNA supercoils and, the nick is religated,
liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit religation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. This family may represent more than
one structural domain.
Length = 215
Score = 28.5 bits (64), Expect = 3.7
Identities = 8/30 (26%), Positives = 19/30 (63%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
+ + ++KK K++KK K +K+K +++
Sbjct: 95 KAQKEEKKAMSKEEKKAIKAEKEKLEEEYG 124
Score = 27.3 bits (61), Expect = 8.6
Identities = 10/31 (32%), Positives = 20/31 (64%)
Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
F+ +K +KK K++KK K +K+K +++
Sbjct: 94 FKAQKEEKKAMSKEEKKAIKAEKEKLEEEYG 124
>gnl|CDD|241245 cd01209, PTB_Shc, Shc-like phosphotyrosine-binding (PTB) domain.
Shc is a substrate for receptor tyrosine kinases, which
can interact with phosphoproteins at NPXY motifs. Shc
contains an PTB domain followed by an SH2 domain. PTB
domains have a common PH-like fold and are found in
various eukaryotic signaling molecules. This domain was
initially shown to binds peptides with a NPXY motif
with differing requirements for phosphorylation of the
tyrosine, although more recent studies have found that
some types of PTB domains can bind to peptides lack
tyrosine residues altogether. In contrast to SH2
domains, which recognize phosphotyrosine and adjacent
carboxy-terminal residues, PTB-domain binding
specificity is conferred by residues amino-terminal to
the phosphotyrosine. PTB domains are classified into
three groups: phosphotyrosine-dependent Shc-like,
phosphotyrosine-dependent IRS-like, and
phosphotyrosine-independent Dab-like PTB domains. This
cd is part of the Shc-like subgroup.
Length = 170
Score = 28.3 bits (64), Expect = 3.7
Identities = 11/25 (44%), Positives = 12/25 (48%)
Query: 40 KKKKKKKKKKKKKRRILAKSCENPA 64
K KKK+K K IL KS A
Sbjct: 57 AKGAKKKRKSKALSSILGKSNLQFA 81
>gnl|CDD|213304 cd05938, hsFATP2a_ACSVL_like, Fatty acid transport proteins (FATP)
including hsFATP2, hsFATP5, and hsFATP6, and similar
proteins. Fatty acid transport proteins (FATP) of this
family transport long-chain or very-long-chain fatty
acids across the plasma membrane. At least five copies
of FATPs are identified in mammalian cells. This family
includes hsFATP2, hsFATP5, and hsFATP6, and similar
proteins. Each FATP has unique patterns of tissue
distribution. These FATPs also have fatty acid CoA
synthetase activity, thus playing dual roles as fatty
acid transporters and its activation enzymes. The hsFATP
proteins exist in two splice variants; the b variant,
lacking exon 3, has no acyl-CoA synthetase activity.
FATPs are key players in the trafficking of exogenous
fatty acids into the cell and in intracellular fatty
acid homeostasis.
Length = 535
Score = 28.9 bits (65), Expect = 3.7
Identities = 9/16 (56%), Positives = 13/16 (81%)
Query: 105 DIAIVLYTSGSTGIPK 120
A+ +YTSG+TG+PK
Sbjct: 143 STALYIYTSGTTGLPK 158
>gnl|CDD|224668 COG1754, COG1754, Uncharacterized C-terminal domain of
topoisomerase IA [General function prediction only].
Length = 298
Score = 28.5 bits (64), Expect = 3.8
Identities = 13/34 (38%), Positives = 15/34 (44%)
Query: 26 AKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
AK KK KK K KK KK + AK+
Sbjct: 257 AKGGPGKKPAKKATAAKAKKTTAKKAAAKKAAKT 290
Score = 28.5 bits (64), Expect = 4.3
Identities = 12/31 (38%), Positives = 13/31 (41%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
K KK K KK KK KK K K+
Sbjct: 264 KPAKKATAAKAKKTTAKKAAAKKAAKTKKAA 294
Score = 27.7 bits (62), Expect = 8.6
Identities = 13/30 (43%), Positives = 15/30 (50%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
++ AKK K KK KK KK K KK
Sbjct: 263 KKPAKKATAAKAKKTTAKKAAAKKAAKTKK 292
>gnl|CDD|240317 PTZ00217, PTZ00217, flap endonuclease-1; Provisional.
Length = 393
Score = 28.8 bits (65), Expect = 3.9
Identities = 20/76 (26%), Positives = 31/76 (40%), Gaps = 14/76 (18%)
Query: 7 ISEHHFDKVIIPAPFER-EKAKKKK-------------NKKKKKKKKKKKKKKKKKKKKK 52
+ E +F++ + ER +KAK KK KK K K KKKKKK
Sbjct: 313 VKEKNFNEERVEKYIERLKKAKTKKTQTRLDSFFTATKKPIKKSNSKAKLKKKKKKAGAS 372
Query: 53 RRILAKSCENPAYKGR 68
+++ + G+
Sbjct: 373 AVPKSETSQEAKSSGK 388
>gnl|CDD|236097 PRK07788, PRK07788, acyl-CoA synthetase; Validated.
Length = 549
Score = 28.7 bits (65), Expect = 4.1
Identities = 9/13 (69%), Positives = 11/13 (84%)
Query: 108 IVLYTSGSTGIPK 120
IV+ TSG+TG PK
Sbjct: 211 IVILTSGTTGTPK 223
>gnl|CDD|213327 cd12119, ttLC_FACS_AlkK_like, Fatty acyl-CoA synthetases similar to
LC-FACS from Thermus thermophiles. This family includes
fatty acyl-CoA synthetases that can activate
medium-chain to long-chain fatty acids. They catalyze
the ATP-dependent acylation of fatty acids in a two-step
reaction. The carboxylate substrate first reacts with
ATP to form an acyl-adenylate intermediate, which then
reacts with CoA to produce an acyl-CoA ester. The fatty
acyl-CoA synthetases are responsible for fatty acid
degradation as well as physiological regulation of
cellular functions via the production of fatty acyl-CoA
esters. The fatty acyl-CoA synthetase from Thermus
thermophiles in this family was shown catalyzing the
long-chain fatty acid, myristoyl acid, while another
member in this family, the AlkK protein identified from
Pseudomonas oleovorans, targets medium chain fatty
acids. This family also includes uncharacterized FACS
proteins.
Length = 517
Score = 28.7 bits (65), Expect = 4.2
Identities = 10/17 (58%), Positives = 12/17 (70%)
Query: 104 NDIAIVLYTSGSTGIPK 120
N A + YTSG+TG PK
Sbjct: 162 NTAAGLCYTSGTTGNPK 178
>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein.
Length = 198
Score = 28.2 bits (63), Expect = 4.3
Identities = 13/48 (27%), Positives = 19/48 (39%)
Query: 16 IIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENP 63
I P E K+ K K+ +K KK KKK K++ +
Sbjct: 38 IFPEDIEIPSPKQPKKKRPTTPRKPATTKKSKKKDKEKLTEEEKKPES 85
>gnl|CDD|197876 smart00792, Agouti, Agouti protein. The agouti protein regulates
pigmentation in the mouse hair follicle producing a
black hair with a subapical yellow band. A highly
homologous protein agouti signal protein (ASIP) is
present in humans and is expressed at highest levels in
adipose tissue where it may play a role in energy
homeostasis and possibly human pigmentation.
Length = 124
Score = 27.5 bits (61), Expect = 4.3
Identities = 14/49 (28%), Positives = 20/49 (40%), Gaps = 10/49 (20%)
Query: 26 AKKKKNKKKKKKKKKKKKKKKKKKKKKRRI----------LAKSCENPA 64
K KK ++ +KK +KK+KK R L SC+ P
Sbjct: 52 KKPKKISAEEAEKKLLQKKEKKALTNVLRPEPRSPRRCVRLRDSCKGPQ 100
Score = 27.5 bits (61), Expect = 5.0
Identities = 10/29 (34%), Positives = 16/29 (55%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
K KK + ++ +KK +KK+KK R
Sbjct: 52 KKPKKISAEEAEKKLLQKKEKKALTNVLR 80
>gnl|CDD|192292 pfam09429, Wbp11, WW domain binding protein 11. The WW domain is
a small protein module with a triple-stranded
beta-sheet fold. This is a family of WW domain binding
proteins.
Length = 78
Score = 26.9 bits (60), Expect = 4.5
Identities = 11/34 (32%), Positives = 20/34 (58%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
K +KKK KK K +++ +++ K KK R+ +
Sbjct: 12 KEQKKKELKKNKAERQARREAKLAKKNPDRLQRE 45
>gnl|CDD|223952 COG1021, EntE, Peptide arylation enzymes [Secondary metabolites
biosynthesis, transport, and catabolism].
Length = 542
Score = 28.6 bits (64), Expect = 4.5
Identities = 16/47 (34%), Positives = 23/47 (48%), Gaps = 1/47 (2%)
Query: 155 YKTGDYGRIVDD-ILLYEGRTDSQVKIRGNRVDLSEVEAALSKLSEV 200
Y++GD R D L+ EGR Q+ G ++ EVE L + V
Sbjct: 416 YRSGDLVRRDPDGYLVVEGRVKDQINRGGEKIAAEEVENLLLRHPAV 462
>gnl|CDD|183506 PRK12406, PRK12406, long-chain-fatty-acid--CoA ligase; Provisional.
Length = 509
Score = 28.5 bits (64), Expect = 4.5
Identities = 10/16 (62%), Positives = 13/16 (81%)
Query: 109 VLYTSGSTGIPKEYRR 124
++YTSG+TG PK RR
Sbjct: 157 MIYTSGTTGHPKGVRR 172
>gnl|CDD|218899 pfam06102, DUF947, Domain of unknown function (DUF947). Family of
eukaryotic proteins with unknown function.
Length = 168
Score = 28.0 bits (63), Expect = 4.6
Identities = 14/42 (33%), Positives = 23/42 (54%), Gaps = 2/42 (4%)
Query: 19 APFEREKAKKKKNKKKKKKK--KKKKKKKKKKKKKKRRILAK 58
P+ +K++ KK KKK KK K+ K +KK+++ K
Sbjct: 124 KPYYLKKSEIKKLVLKKKFDELKKSKQLDKALEKKRKKNAGK 165
>gnl|CDD|213321 cd05974, MACS_like_1, Uncharacterized subfamily of medium-chain
acyl-CoA synthetase (MACS). MACS catalyzes the two-step
activation of medium chain fatty acids (containing 4-12
carbons). The carboxylate substrate first reacts with
ATP to form an acyl-adenylate intermediate, which then
reacts with CoA to produce an acyl-CoA ester. MACS
enzymes are localized to mitochondria.
Length = 433
Score = 28.5 bits (64), Expect = 4.7
Identities = 16/41 (39%), Positives = 21/41 (51%), Gaps = 1/41 (2%)
Query: 155 YKTGDYGRIVDD-ILLYEGRTDSQVKIRGNRVDLSEVEAAL 194
Y+TGD +D L + GR D K R+ EVE+AL
Sbjct: 310 YRTGDKAYRDEDGYLWFVGRADDVFKSSDYRISPFEVESAL 350
>gnl|CDD|150786 pfam10153, DUF2361, Uncharacterized conserved protein (DUF2361).
This is a region of 120 amino acids that is conserved
in a family of proteins found from plants to fungi. The
function is not known.
Length = 114
Score = 27.2 bits (61), Expect = 4.8
Identities = 10/36 (27%), Positives = 21/36 (58%)
Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRIL 56
FER+KA ++ + +KK ++ +K+ K K++
Sbjct: 59 FERKKATRRLKQLEKKLEETSDEKEIKDLKEQLSKA 94
>gnl|CDD|237791 PRK14701, PRK14701, reverse gyrase; Provisional.
Length = 1638
Score = 28.7 bits (64), Expect = 4.8
Identities = 16/69 (23%), Positives = 30/69 (43%)
Query: 2 NINLTISEHHFDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCE 61
NI+ ++ F + II ++ KK+ N + +K++ K+ +K K L +
Sbjct: 218 NIDRSLQLLGFYEEIIEKAWKIIYLKKQGNIEDAMEKREILNKEIEKIGNKIGCLIVASA 277
Query: 62 NPAYKGRTV 70
KG V
Sbjct: 278 TGKAKGDRV 286
>gnl|CDD|149515 pfam08492, SRP72, SRP72 RNA-binding domain. This region has been
identified as the binding site of the SRP72 protein to
SRP RNA.
Length = 57
Score = 25.9 bits (57), Expect = 4.9
Identities = 15/31 (48%), Positives = 21/31 (67%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
+AKK K KKK ++KKKK KKK+K ++
Sbjct: 1 EAKKPKKIAKKKPGTEQKKKKSAKKKRKPKL 31
Score = 25.9 bits (57), Expect = 5.5
Identities = 15/30 (50%), Positives = 19/30 (63%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
E +K KK KK ++KKKK KKK+K K
Sbjct: 1 EAKKPKKIAKKKPGTEQKKKKSAKKKRKPK 30
>gnl|CDD|235971 PRK07219, PRK07219, DNA topoisomerase I; Validated.
Length = 822
Score = 28.8 bits (65), Expect = 4.9
Identities = 11/43 (25%), Positives = 17/43 (39%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPA 64
EK KK+K K + + K K +K K + + A
Sbjct: 764 YTEKQKKEKESKSELEALKGVGAKTAEKLKDAGVETVTDLTAA 806
>gnl|CDD|227314 COG4981, COG4981, Enoyl reductase domain of yeast-type FAS1 [Lipid
metabolism].
Length = 717
Score = 28.7 bits (64), Expect = 5.1
Identities = 9/35 (25%), Positives = 18/35 (51%), Gaps = 2/35 (5%)
Query: 189 EVEAALSKLSEV--DKLSVLCYKPGEVDQVSEVTS 221
+E A+ + E+ D + +KPG ++Q+ V
Sbjct: 135 SLEEAVELIEELGDDGFPYVAFKPGTIEQIRSVIR 169
Score = 28.7 bits (64), Expect = 5.1
Identities = 10/30 (33%), Positives = 17/30 (56%), Gaps = 2/30 (6%)
Query: 252 EVEAALSKISEV--DKLSVLCYKPGEVDQI 279
+E A+ I E+ D + +KPG ++QI
Sbjct: 135 SLEEAVELIEELGDDGFPYVAFKPGTIEQI 164
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 28.2 bits (63), Expect = 5.1
Identities = 10/34 (29%), Positives = 24/34 (70%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRIL 56
RE K KK + ++K++++++K +++K ++ IL
Sbjct: 149 RELEKIKKERAEEKEREEEEKAAEEEKAREEEIL 182
>gnl|CDD|189016 cd09609, M3B_PepF_4, Peptidase family M3B Oligopeptidase F (PepF).
Peptidase family M3B Oligopeptidase F (PepF;
Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and
includes oligoendopeptidase F from Lactococcus lactis.
This enzyme hydrolyzes peptides containing between 7 and
17 amino acids with fairly broad specificity. The PepF
gene is duplicated in L. lactis on the plasmid that
bears it, while a shortened second copy is found in
Bacillus subtilis. Most bacterial PepFs are cytoplasmic
endopeptidases; however, the PepF Bacillus
amyloliquefaciens oligopeptidase is a secreted protein
and may facilitate the process of sporulation.
Specifically, the yjbG gene encoding the homolog of the
PepF1 and PepF2 oligoendopeptidases of Lactococcus
lactis has been identified in Bacillus subtilis as an
inhibitor of sporulation initiation when over expressed
from a multicopy plasmid.
Length = 586
Score = 28.7 bits (65), Expect = 5.1
Identities = 26/117 (22%), Positives = 44/117 (37%), Gaps = 25/117 (21%)
Query: 151 YRRMYKTGDYGRIVDDILLYEG-RTDSQVKIRGNRVD--LSEVEAALSKL-SEVDKLSVL 206
+ + Y L TD + + R + D +++ A LS SE+ L
Sbjct: 61 LELISRLSHYAS-----LPQSTDGTDPENQARAAKFDSLAAKINAKLSFFESELLALDE- 114
Query: 207 CYKPGEVDQVSEVTSEEIRGVDDILLYEGRTDSQVKIRGNRVDLSEVEAALSKISEV 263
+++V E E ++D+L + T S EVE AL+ +S V
Sbjct: 115 ----KILEEVIEELPEYAAYLEDLLRKKPHTLS-----------PEVEEALAALSPV 156
>gnl|CDD|218337 pfam04939, RRS1, Ribosome biogenesis regulatory protein (RRS1).
This family consists of several eukaryotic ribosome
biogenesis regulatory (RRS1) proteins. RRS1 is a nuclear
protein that is essential for the maturation of 25 S
rRNA and the 60 S ribosomal subunit assembly in
Saccharomyces cerevisiae.
Length = 164
Score = 27.6 bits (62), Expect = 5.3
Identities = 9/25 (36%), Positives = 17/25 (68%)
Query: 20 PFEREKAKKKKNKKKKKKKKKKKKK 44
PF +++ +KK+ K +K++ K KK
Sbjct: 140 PFAKKRREKKERVAKNEKRELKNKK 164
>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
Genome duplication is precisely regulated by
cyclin-dependent kinases CDKs, which bring about the
onset of S phase by activating replication origins and
then prevent relicensing of origins until mitosis is
completed. The optimum sequence motif for CDK
phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
to have at least 11 potential phosphorylation sites.
Drc1 is required for DNA synthesis and S-M replication
checkpoint control. Drc1 associates with Cdc2 and is
phosphorylated at the onset of S phase when Cdc2 is
activated. Thus Cdc2 promotes DNA replication by
phosphorylating Drc1 and regulating its association with
Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
substrates required for DNA replication.
Length = 397
Score = 28.2 bits (63), Expect = 5.4
Identities = 10/39 (25%), Positives = 23/39 (58%)
Query: 19 APFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILA 57
E + + +++ K++ +KK+K KKK +K+K ++
Sbjct: 335 MGGIDEDDEDEDDEESKEEVEKKQKVKKKPRKRKVNPVS 373
Score = 27.8 bits (62), Expect = 7.2
Identities = 11/40 (27%), Positives = 22/40 (55%)
Query: 12 FDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKK 51
F + + E +++K++ +KK+K KKK +K+K
Sbjct: 331 FLGYMGGIDEDDEDEDDEESKEEVEKKQKVKKKPRKRKVN 370
>gnl|CDD|233605 TIGR01865, cas_Csn1, CRISPR-associated protein Cas9/Csn1, subtype
II/NMEMI. CRISPR loci appear to be mobile elements with
a wide host range. This model represents a protein found
only in CRISPR-containing species, near other
CRISPR-associated proteins (cas), as part of the NMENI
subtype of CRISPR/Cas locus. The species range so far
for this protein is animal pathogens and commensals only
[Mobile and extrachromosomal element functions, Other].
Length = 805
Score = 28.5 bits (64), Expect = 5.5
Identities = 8/27 (29%), Positives = 14/27 (51%)
Query: 23 REKAKKKKNKKKKKKKKKKKKKKKKKK 49
RE+ K+ K++ KK + K K+
Sbjct: 524 REEQGTNFGKRNSKERYKKNEDKIKEF 550
>gnl|CDD|151885 pfam11446, DUF2897, Protein of unknown function (DUF2897). This
is a bacterial family of uncharacterized proteins.
Length = 55
Score = 25.7 bits (57), Expect = 5.5
Identities = 10/28 (35%), Positives = 15/28 (53%)
Query: 26 AKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
AK K + K+KKKK K+ ++ K
Sbjct: 27 AKMKFPQFDKQKKKKAKRNAEQDAKNSA 54
>gnl|CDD|223747 COG0675, COG0675, Transposase and inactivated derivatives [DNA
replication, recombination, and repair].
Length = 364
Score = 28.1 bits (62), Expect = 5.6
Identities = 9/38 (23%), Positives = 22/38 (57%)
Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
++ ++KK K++KK K K + +++ + +R+ K
Sbjct: 212 AQKRLSRKKSRSKRRKKAKLKLARLRERIRNRRKDFHK 249
>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62.
Length = 217
Score = 27.8 bits (62), Expect = 5.7
Identities = 14/23 (60%), Positives = 15/23 (65%)
Query: 31 NKKKKKKKKKKKKKKKKKKKKKR 53
+ K K KK KK KKKKKKKR
Sbjct: 194 HYKGDKSSAKKDKKSKKKKKKKR 216
Score = 27.8 bits (62), Expect = 5.9
Identities = 13/23 (56%), Positives = 14/23 (60%)
Query: 24 EKAKKKKNKKKKKKKKKKKKKKK 46
K K KK KK KKKKKKK+
Sbjct: 195 YKGDKSSAKKDKKSKKKKKKKRS 217
Score = 27.8 bits (62), Expect = 6.0
Identities = 11/38 (28%), Positives = 17/38 (44%)
Query: 24 EKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCE 61
E K K NK K + K + K +K ++L K+
Sbjct: 17 ESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQM 54
Score = 27.5 bits (61), Expect = 7.6
Identities = 13/22 (59%), Positives = 15/22 (68%)
Query: 29 KKNKKKKKKKKKKKKKKKKKKK 50
K +K KK KK KKKKKKK+
Sbjct: 196 KGDKSSAKKDKKSKKKKKKKRS 217
Score = 27.5 bits (61), Expect = 8.7
Identities = 13/22 (59%), Positives = 14/22 (63%)
Query: 30 KNKKKKKKKKKKKKKKKKKKKK 51
K K KK KK KKKKKKK+
Sbjct: 196 KGDKSSAKKDKKSKKKKKKKRS 217
Score = 27.1 bits (60), Expect = 9.9
Identities = 12/36 (33%), Positives = 16/36 (44%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILA 57
E EK K K+K + K + K +K K I A
Sbjct: 17 ESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKA 52
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 28.3 bits (63), Expect = 5.7
Identities = 15/49 (30%), Positives = 23/49 (46%)
Query: 13 DKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCE 61
K + A + KK N KKK K+KK+KK+ + + +S E
Sbjct: 198 GKTSVKAASLKRNPPKKSNIMSSFFKKKTKEKKEKKEASESTVKEESEE 246
>gnl|CDD|169098 PRK07786, PRK07786, long-chain-fatty-acid--CoA ligase; Validated.
Length = 542
Score = 28.2 bits (63), Expect = 5.7
Identities = 9/14 (64%), Positives = 13/14 (92%)
Query: 107 AIVLYTSGSTGIPK 120
A+++YTSG+TG PK
Sbjct: 177 ALIMYTSGTTGRPK 190
>gnl|CDD|193580 cd09891, NGN_Bact_1, Bacterial N-Utilization Substance G (NusG)
N-terminal (NGN) domain, subgroup 1. The N-Utilization
Substance G (NusG) protein is involved in transcription
elongation and termination in bacteria. NusG is
essential in Escherichia coli and associates with RNA
polymerase elongation and Rho-termination. Homologs of
the NusG gene exist in all bacteria. The NusG
N-terminal domain (NGN) is similar in all NusG
homologs, but its C-terminal domain and the linker that
separates these two domains are different. The domain
organization of NusG suggests that the common
properties of NusG and its homologs are due to their
similar NGN domains.
Length = 107
Score = 27.1 bits (61), Expect = 5.7
Identities = 11/32 (34%), Positives = 19/32 (59%), Gaps = 4/32 (12%)
Query: 9 EHHFDKVIIPAPFEREKAKKKKNKKKKKKKKK 40
E + +V++P E+ + KN KKK K++K
Sbjct: 29 EDYIGEVLVPT----EEVVEVKNGKKKVKERK 56
>gnl|CDD|236215 PRK08276, PRK08276, long-chain-fatty-acid--CoA ligase; Validated.
Length = 502
Score = 28.3 bits (64), Expect = 5.8
Identities = 9/15 (60%), Positives = 12/15 (80%)
Query: 110 LYTSGSTGIPKEYRR 124
LY+SG+TG PK +R
Sbjct: 146 LYSSGTTGRPKGIKR 160
>gnl|CDD|180289 PRK05851, PRK05851, long-chain-fatty-acid--[acyl-carrier-protein]
ligase; Validated.
Length = 525
Score = 28.2 bits (63), Expect = 6.0
Identities = 14/47 (29%), Positives = 22/47 (46%)
Query: 145 GDRAIKYRRMYKTGDYGRIVDDILLYEGRTDSQVKIRGNRVDLSEVE 191
G I + TGD G +VD L+ GR + + G + +E+E
Sbjct: 388 GQAPIDPDDWFPTGDLGYLVDGGLVVCGRAKELITVAGRNIFPTEIE 434
>gnl|CDD|223898 COG0828, RpsU, Ribosomal protein S21 [Translation, ribosomal
structure and biogenesis].
Length = 67
Score = 26.1 bits (58), Expect = 6.2
Identities = 13/34 (38%), Positives = 23/34 (67%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
ERE +K K+K+KK +K+K K+ +K+++R
Sbjct: 34 EREFYEKPSEKRKRKKAAARKRKFKRLRKEQQRE 67
>gnl|CDD|215124 PLN02195, PLN02195, cellulose synthase A.
Length = 977
Score = 28.4 bits (63), Expect = 6.2
Identities = 17/60 (28%), Positives = 31/60 (51%), Gaps = 1/60 (1%)
Query: 20 PFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKG-RTVLSLPQLRL 78
P + + + K+KK KKKK KKK+ K + + + + AY+ V+ +P+ +L
Sbjct: 112 PIWKNRVESWKDKKNKKKKSAKKKEAHKAQIPPEQQMEEKPSADAYEPLSRVIPIPRNKL 171
>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor. The
spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
U5) and proteins, catalyzes the excision of introns from
pre-mRNAs in two successive trans-esterification
reactions. Step 2 depends upon integral spliceosome
constituents such as U5 snRNA and Prp8 and
non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
conformational change in the spliceosome that leads to
protection of the 3'ss from targeted RNase H cleavage.
This change, which probably reflects binding of the 3'ss
PyAG in the catalytic centre of the spliceosome,
requires the ordered recruitment of Slu7, Prp18, and
Prp22 to the spliceosome. There is a close functional
relationship between Prp8, Prp18, and Slu7, and Prp18
interacts with Slu7, so that together they recruit Prp22
to the spliceosome. Most members of the family carry a
zinc-finger of the CCHC-type upstream of this domain.
Length = 236
Score = 27.8 bits (62), Expect = 6.3
Identities = 16/47 (34%), Positives = 24/47 (51%), Gaps = 6/47 (12%)
Query: 35 KKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRTVLSLPQLRLLSA 81
+KK+K+KK++ K +KK+ +L K Y G L P LL
Sbjct: 160 LLRKKEKEKKEQLKIQKKQSLLEK------YGGEEHLDKPPKELLLG 200
>gnl|CDD|220129 pfam09159, Ydc2-catalyt, Mitochondrial resolvase Ydc2 / RNA
splicing MRS1. Members of this family adopt a secondary
structure consisting of two beta sheets and one alpha
helix, arranged as a beta-alpha-beta motif. Each beta
sheet has five strands, arranged in a 32145 order, with
the second strand being antiparallel to the rest.
Mitochondrial resolvase Ydc2 is capable of resolving
Holliday junctions and cleaves DNA after 5'-CT-3' and
5'-TT-3' sequences. This family also contains the
mitochondrial RNA-splicing protein MRS1 which is
involved in the excision of group I introns.
Length = 254
Score = 27.7 bits (62), Expect = 6.3
Identities = 10/29 (34%), Positives = 14/29 (48%)
Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKKK 49
ER + +K K + KKK K KK +
Sbjct: 146 CERTEILAEKKKPRSKKKSSKNSKKLRID 174
Score = 27.3 bits (61), Expect = 8.5
Identities = 11/21 (52%), Positives = 13/21 (61%)
Query: 35 KKKKKKKKKKKKKKKKKKRRI 55
+KKK + KKK K KK RI
Sbjct: 153 AEKKKPRSKKKSSKNSKKLRI 173
Score = 27.3 bits (61), Expect = 9.3
Identities = 9/24 (37%), Positives = 13/24 (54%)
Query: 32 KKKKKKKKKKKKKKKKKKKKKRRI 55
+KKK + KKK K KK + +
Sbjct: 153 AEKKKPRSKKKSSKNSKKLRIDLV 176
>gnl|CDD|131210 TIGR02155, PA_CoA_ligase, phenylacetate-CoA ligase.
Phenylacetate-CoA ligase (PA-CoA ligase) catalyzes the
first step in aromatic catabolism of phenylacetic acid
(PA) into phenylacetyl-CoA (PA-CoA). Often located in a
conserved gene cluster with enzymes involved in
phenylacetic acid activation (paaG/H/I/J),
phenylacetate-CoA ligase has been found among the
proteobacteria as well as in gram positive prokaryotes.
In the B-subclass proteobacterium Azoarcus evansii,
phenylacetate-CoA ligase has been shown to be induced
under aerobic and anaerobic growth conditions. It
remains unclear however, whether this induction is due
to the same enzyme or to another isoenzyme restricted to
specific anaerobic growth conditions [Energy metabolism,
Other].
Length = 422
Score = 28.2 bits (63), Expect = 6.3
Identities = 21/79 (26%), Positives = 42/79 (53%), Gaps = 4/79 (5%)
Query: 186 DLSEVEAALSKLSEVDKLSVLCYKPGEVDQVSEVTSEEIRGVDDILLYEGRTDSQVKIRG 245
D E E + L++ + L V+ Y+ ++ ++ T+ +R +D I GR+D + IRG
Sbjct: 270 DGEEGELVFTTLTK-EALPVIRYRTRDLTRLLPGTARTMRRMDRI---TGRSDDMLIIRG 325
Query: 246 NRVDLSEVEAALSKISEVD 264
V +++E + K+ E+
Sbjct: 326 VNVFPTQLEEVILKMDELS 344
>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355). This
family of proteins is found in bacteria and viruses.
Proteins in this family are typically between 180 and
214 amino acids in length.
Length = 125
Score = 27.2 bits (61), Expect = 6.5
Identities = 11/37 (29%), Positives = 21/37 (56%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
E+ K +KK+ +KK + +K K ++K + + L K
Sbjct: 20 EKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEK 56
Score = 26.8 bits (60), Expect = 7.9
Identities = 8/37 (21%), Positives = 21/37 (56%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
+K ++KK++ +K K ++K + + +K + L +
Sbjct: 24 WEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEE 60
>gnl|CDD|217301 pfam02956, TT_ORF1, TT viral orf 1. TT virus (TTV), isolated
initially from a Japanese patient with hepatitis of
unknown aetiology, has since been found to infect both
healthy and diseased individuals and numerous prevalence
studies have raised questions about its role in
unexplained hepatitis. ORF1 is a large 750 residue
protein. The N-terminal half of this protein corresponds
to the capsid protein.
Length = 525
Score = 28.0 bits (63), Expect = 6.5
Identities = 8/23 (34%), Positives = 10/23 (43%)
Query: 33 KKKKKKKKKKKKKKKKKKKKRRI 55
K K K + K KK K +I
Sbjct: 189 KHKILIPSLKTRPKGKKYVKIKI 211
Score = 28.0 bits (63), Expect = 6.8
Identities = 7/24 (29%), Positives = 9/24 (37%)
Query: 30 KNKKKKKKKKKKKKKKKKKKKKKR 53
K K K + K KK K +
Sbjct: 187 LAKHKILIPSLKTRPKGKKYVKIK 210
>gnl|CDD|217512 pfam03359, GKAP, Guanylate-kinase-associated protein (GKAP)
protein.
Length = 342
Score = 27.9 bits (62), Expect = 6.6
Identities = 8/41 (19%), Positives = 20/41 (48%)
Query: 17 IPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILA 57
+P P ++ AK K ++K +++++ + R+ A
Sbjct: 272 VPPPVPKKPAKGPIVKPAISREKSLDSSDRQRQEARSRLAA 312
>gnl|CDD|240292 PTZ00148, PTZ00148, 40S ribosomal protein S8; Provisional.
Length = 205
Score = 27.6 bits (62), Expect = 6.7
Identities = 17/44 (38%), Positives = 22/44 (50%), Gaps = 1/44 (2%)
Query: 15 VIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAK 58
I APF ++ K KKKK KK+++ KK KKK L
Sbjct: 103 QIDAAPF-KQWYAKHYGIDLGKKKKYKKEEENKKAKKKSESLVA 145
>gnl|CDD|236100 PRK07798, PRK07798, acyl-CoA synthetase; Validated.
Length = 533
Score = 27.9 bits (63), Expect = 6.9
Identities = 11/20 (55%), Positives = 15/20 (75%)
Query: 101 EQSNDIAIVLYTSGSTGIPK 120
E+S D +LYT G+TG+PK
Sbjct: 160 ERSPDDLYLLYTGGTTGMPK 179
>gnl|CDD|235202 PRK04023, PRK04023, DNA polymerase II large subunit; Validated.
Length = 1121
Score = 28.3 bits (64), Expect = 7.0
Identities = 17/48 (35%), Positives = 27/48 (56%), Gaps = 9/48 (18%)
Query: 27 KKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRTVLSLP 74
++ + KK+K+++++KKK K K K R I+A GR V S P
Sbjct: 272 EELISGKKEKEEEEEKKKIKPKDKYLRDIIA---------GRPVFSHP 310
>gnl|CDD|178049 PLN02430, PLN02430, long-chain-fatty-acid-CoA ligase.
Length = 660
Score = 28.2 bits (63), Expect = 7.1
Identities = 10/25 (40%), Positives = 17/25 (68%)
Query: 96 ADNNNEQSNDIAIVLYTSGSTGIPK 120
++ N + DI ++YTSG++G PK
Sbjct: 212 SETNPPKPLDICTIMYTSGTSGDPK 236
>gnl|CDD|214377 CHL00152, rpl32, ribosomal protein L32; Validated.
Length = 53
Score = 25.4 bits (56), Expect = 7.3
Identities = 14/33 (42%), Positives = 19/33 (57%)
Query: 27 KKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
KK+ +K KK+ +K K+K K K LAKS
Sbjct: 5 KKRTSKSKKRIRKNIWKRKAYKAALKAFSLAKS 37
>gnl|CDD|219783 pfam08297, U3_snoRNA_assoc, U3 snoRNA associated. This family of
proteins is associated with U3 snoRNA. U3 snoRNA is
required for nucleolar processing of pre-18S ribosomal
RNA.
Length = 87
Score = 26.2 bits (58), Expect = 7.4
Identities = 14/35 (40%), Positives = 19/35 (54%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
P E + + KK KKK K + KK +K+ K KK
Sbjct: 18 TPPPESYEEELKKQKKKNKLRFLKKLEKRPKDVKK 52
Score = 25.8 bits (57), Expect = 9.6
Identities = 15/35 (42%), Positives = 20/35 (57%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRIL 56
E K +KKKNK + KK +K+ K KK R+L
Sbjct: 26 EELKKQKKKNKLRFLKKLEKRPKDVKKGPVTVRVL 60
Score = 25.8 bits (57), Expect = 9.7
Identities = 13/38 (34%), Positives = 18/38 (47%)
Query: 13 DKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKK 50
P E+ KK+K K K + KK +K+ K KK
Sbjct: 15 RPPTPPPESYEEELKKQKKKNKLRFLKKLEKRPKDVKK 52
>gnl|CDD|235370 PRK05244, PRK05244, Der GTPase activator; Provisional.
Length = 177
Score = 27.2 bits (61), Expect = 7.4
Identities = 13/37 (35%), Positives = 20/37 (54%), Gaps = 9/37 (24%)
Query: 27 KKKKNKKKKKKKKKKKKK---------KKKKKKKKRR 54
KKK + K+ K K KKK +++K+KKK +
Sbjct: 2 KKKSSPKRSKGMAKSKKKTREELDAEARERKRKKKHK 38
>gnl|CDD|223625 COG0551, TopA, Zn-finger domain associated with topoisomerase type
I [DNA replication, recombination, and repair].
Length = 140
Score = 27.1 bits (60), Expect = 7.6
Identities = 14/39 (35%), Positives = 18/39 (46%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSC 60
EK K K+ K K +K +KK+KKK K C
Sbjct: 93 FTEKPKPKEKKCPKCGSRKLVEKKEKKKGVFLECSNKPC 131
>gnl|CDD|220207 pfam09368, Sas10_Utp3_C, Sas10 C-terminal domain. This family
contains a C-terminal presumed domain in Sas10 which
hash been identified as a regulator of chromatin
silencing.
Length = 76
Score = 26.1 bits (58), Expect = 7.6
Identities = 15/45 (33%), Positives = 27/45 (60%), Gaps = 2/45 (4%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRT 69
+ +KK + + KK+KK +K K+ K +RR++ + E+ Y G T
Sbjct: 21 RKRKKDRRNPRVKKRKKYEKAVKRLKSQRRVVRE--EDGGYGGET 63
>gnl|CDD|221712 pfam12687, DUF3801, Protein of unknown function (DUF3801). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are typically between 158 and 187 amino
acids in length. This family includes the PcfB protein.
Length = 137
Score = 26.8 bits (60), Expect = 7.6
Identities = 13/32 (40%), Positives = 21/32 (65%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
E+ +KK K K+++ KKK++K K K KK+
Sbjct: 106 EKPSVRKKLKKFKEEQAAKKKERKDKVKNKKK 137
Score = 26.8 bits (60), Expect = 8.8
Identities = 13/44 (29%), Positives = 25/44 (56%)
Query: 11 HFDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRR 54
F + + + + +KK KK K+++ KKK++K K K K++
Sbjct: 94 EFTAKKLKKEQDEKPSVRKKLKKFKEEQAAKKKERKDKVKNKKK 137
>gnl|CDD|222965 PHA03057, PHA03057, Hypothetical protein; Provisional.
Length = 146
Score = 27.0 bits (59), Expect = 7.6
Identities = 12/33 (36%), Positives = 16/33 (48%)
Query: 12 FDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKK 44
FD + IP PF+R KKN + + KK
Sbjct: 103 FDFMKIPNPFKRYYEYCKKNNNNPQPVSEPPKK 135
>gnl|CDD|240577 cd12950, RRP7_Rrp7p, RRP7 domain ribosomal RNA-processing protein 7
(Rrp7p) and similar proteins. This CD corresponds to
the RRP7 domain of Rrp7p. Rrp7p is encoded by YCL031C
gene from Saccharomyces cerevisiae. It is an essential
yeast protein involved in pre-rRNA processing and
ribosome assembly. Rrp7p contains an N-terminal RNA
recognition motif (RRM), also termed RBD (RNA binding
domain) or RNP (ribonucleoprotein domain), and a
C-terminal RRP7 domain.
Length = 128
Score = 26.9 bits (60), Expect = 7.7
Identities = 15/41 (36%), Positives = 23/41 (56%), Gaps = 7/41 (17%)
Query: 22 EREKAKKKKNKKKKKKKKKKKKKKKK-------KKKKKRRI 55
E K + +KK+K+KKKKKKK+ + ++KKK
Sbjct: 67 GEEAGKAAEEEKKEKEKKKKKKKELEDFYRFQLREKKKEEQ 107
>gnl|CDD|219721 pfam08118, MDM31_MDM32, Yeast mitochondrial distribution and
morphology (MDM) proteins. Proteins in this family are
yeast mitochondrial inner membrane proteins MDM31 and
MDM32. These proteins are required for the maintenance
of mitochondrial morphology, and the stability of
mitochondrial DNA.
Length = 503
Score = 27.9 bits (62), Expect = 8.0
Identities = 9/27 (33%), Positives = 16/27 (59%)
Query: 134 IVDDIHRCWSSGDRAIKYRRMYKTGDY 160
+VD H W + D A Y+ +++ GD+
Sbjct: 192 VVDRTHVHWKADDDARNYKNVHQPGDF 218
>gnl|CDD|221161 pfam11661, DUF2986, Protein of unknown function (DUF2986). This
family of proteins has no known function.
Length = 44
Score = 24.9 bits (55), Expect = 8.1
Identities = 13/26 (50%), Positives = 14/26 (53%)
Query: 27 KKKKNKKKKKKKKKKKKKKKKKKKKK 52
KKK N+K KKK KK K K K
Sbjct: 4 KKKINQKLKKKAKKANAKLHPSNKPK 29
Score = 24.9 bits (55), Expect = 8.6
Identities = 12/29 (41%), Positives = 15/29 (51%)
Query: 27 KKKKNKKKKKKKKKKKKKKKKKKKKKRRI 55
+KKK +K KKK KK K K + I
Sbjct: 3 RKKKINQKLKKKAKKANAKLHPSNKPKYI 31
>gnl|CDD|234707 PRK00270, rpsU, 30S ribosomal protein S21; Reviewed.
Length = 64
Score = 25.6 bits (57), Expect = 8.1
Identities = 9/23 (39%), Positives = 16/23 (69%)
Query: 32 KKKKKKKKKKKKKKKKKKKKKRR 54
K +K+K+KK +K+++KK R
Sbjct: 40 KPSEKRKRKKAAARKRRRKKLAR 62
Score = 25.6 bits (57), Expect = 8.5
Identities = 7/21 (33%), Positives = 14/21 (66%)
Query: 34 KKKKKKKKKKKKKKKKKKKRR 54
K+K+KK +K+++KK +
Sbjct: 44 KRKRKKAAARKRRRKKLAREE 64
>gnl|CDD|189027 cd09857, PIN_EXO1, PIN domain of Exonuclease-1, a
structure-specific, divalent-metal-ion dependent, 5'
nuclease and homologs. Exonuclease-1 (EXO1) is involved
in multiple, eukaryotic DNA metabolic pathways,
including DNA replication processes (5' flap DNA
endonuclease activity and double stranded DNA
5'-exonuclease activity), DNA repair processes (DNA
mismatch repair (MMR) and post-replication repair
(PRR)), recombination, and telomere integrity. EXO1
functions in the MMS2 error-free branch of the PRR
pathway in the maintenance and repair of stalled
replication forks. Studies also suggest that EXO1 plays
both structural and catalytic roles during MMR-mediated
mutation avoidance. EXO1 belongs to the FEN1-EXO1-like
family of structure-specific, 5' nucleases. These
nucleases contain a PIN (PilT N terminus) domain with a
helical arch/clamp region (I domain) of variable length
(approximately 43 residues in EXO1 PIN domains) and a
H3TH (helix-3-turn-helix) domain, an atypical
helix-hairpin-helix-2-like region. Both the H3TH domain
(not included here) and the helical arch/clamp region
are involved in DNA binding. Nucleases within this group
also have a carboxylate-rich active site that is
involved in binding essential divalent metal ion
cofactors (Mg2+/Mn2+). EXO1 nucleases also have
C-terminal Mlh1- and Msh2-binding domains which allow
interaction with MMR and PRR proteins, respectively.
Length = 210
Score = 27.5 bits (62), Expect = 8.2
Identities = 6/35 (17%), Positives = 21/35 (60%)
Query: 26 AKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSC 60
+KK +++++++++ +K + ++ R A+ C
Sbjct: 84 SKKGTEEERRERREENLEKALELLREGNRSEAREC 118
>gnl|CDD|215353 PLN02654, PLN02654, acetate-CoA ligase.
Length = 666
Score = 27.9 bits (62), Expect = 8.3
Identities = 14/29 (48%), Positives = 18/29 (62%), Gaps = 2/29 (6%)
Query: 103 SNDIAIVLYTSGSTGIPKEYRRMYKTGDY 131
+ D +LYTSGSTG PK ++ TG Y
Sbjct: 274 AEDPLFLLYTSGSTGKPKGV--LHTTGGY 300
>gnl|CDD|218797 pfam05890, Ebp2, Eukaryotic rRNA processing protein EBP2. This
family consists of several Eukaryotic rRNA processing
protein EBP2 sequences. Ebp2p is required for the
maturation of 25S rRNA and 60S subunit assembly. Ebp2p
may be one of the target proteins of Rrs1p for executing
the signal to regulate ribosome biogenesis. This family
also plays a role in chromosome segregation.
Length = 270
Score = 27.7 bits (62), Expect = 8.3
Identities = 11/33 (33%), Positives = 16/33 (48%)
Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKR 53
F + A K +KKK +K + K K+K K
Sbjct: 188 FLVDDAAKGGSKKKGRKGGAARGKPNAKRKAKD 220
>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4;
Provisional.
Length = 191
Score = 27.5 bits (61), Expect = 8.3
Identities = 20/48 (41%), Positives = 25/48 (52%), Gaps = 5/48 (10%)
Query: 15 VIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCEN 62
V IP+ K KKK K +KKK KK KKKKK+K+ + E
Sbjct: 43 VEIPSK----KTSKKK-KTTPRKKKTTKKTKKKKKEKEEVPELAAEEL 85
>gnl|CDD|181546 PRK08751, PRK08751, putative long-chain fatty acyl CoA ligase;
Provisional.
Length = 560
Score = 27.9 bits (62), Expect = 8.4
Identities = 9/19 (47%), Positives = 14/19 (73%)
Query: 102 QSNDIAIVLYTSGSTGIPK 120
+ +DIA + YT G+TG+ K
Sbjct: 206 EPDDIAFLQYTGGTTGVAK 224
>gnl|CDD|215507 PLN02939, PLN02939, transferase, transferring glycosyl groups.
Length = 977
Score = 27.9 bits (62), Expect = 8.5
Identities = 21/96 (21%), Positives = 37/96 (38%), Gaps = 11/96 (11%)
Query: 18 PAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCEN-------PAYKGRTV 70
P R A + +++ ++KKK+ K K+R +K N RTV
Sbjct: 24 YLPSRRRLAVSCRARRRGFSSQQKKKRGKNIAPKQRSSNSKLQSNTDENGQLENTSLRTV 83
Query: 71 LSLPQLRLLSASLSHNNIPGESMLHADNNNEQSNDI 106
+ LPQ ++ S ++ SM + N+
Sbjct: 84 MELPQ----KSTSSDDDHNRASMQRDEAIAAIDNEQ 115
>gnl|CDD|239314 cd03016, PRX_1cys, Peroxiredoxin (PRX) family, 1-cys PRX subfamily;
composed of PRXs containing only one conserved cysteine,
which serves as the peroxidatic cysteine. They are
homodimeric thiol-specific antioxidant (TSA) proteins
that confer a protective role in cells by reducing and
detoxifying hydrogen peroxide, peroxynitrite, and
organic hydroperoxides. As with all other PRXs, a
cysteine sulfenic acid intermediate is formed upon
reaction of 1-cys PRX with its substrates. Having no
resolving cysteine, the oxidized enzyme is resolved by
an external small-molecule or protein reductant such as
thioredoxin or glutaredoxin. Similar to typical 2-cys
PRX, 1-cys PRX forms a functional dimeric unit with a
B-type interface, as well as a decameric structure which
is stabilized in the reduced form of the enzyme. Other
oligomeric forms, tetramers and hexamers, have also been
reported. Mammalian 1-cys PRX is localized cellularly in
the cytosol and is expressed at high levels in brain,
eye, testes and lung. The seed-specific plant 1-cys PRXs
protect tissues from reactive oxygen species during
desiccation and are also called rehydrins.
Length = 203
Score = 27.1 bits (61), Expect = 8.6
Identities = 10/26 (38%), Positives = 13/26 (50%)
Query: 13 DKVIIPAPFEREKAKKKKNKKKKKKK 38
D VI+P E+AKKK K +
Sbjct: 171 DDVIVPPSVSDEEAKKKFPKGYETVD 196
>gnl|CDD|215521 PLN02967, PLN02967, kinase.
Length = 581
Score = 27.7 bits (61), Expect = 9.2
Identities = 14/54 (25%), Positives = 29/54 (53%), Gaps = 8/54 (14%)
Query: 22 EREKAKKKKNKKKKKKKKKK--------KKKKKKKKKKKRRILAKSCENPAYKG 67
+ A K++KK ++ ++K +++K +KK +KRR + K E+ +G
Sbjct: 101 NEDAALDKESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDEDVEDQG 154
>gnl|CDD|215494 PLN02915, PLN02915, cellulose synthase A [UDP-forming], catalytic
subunit.
Length = 1044
Score = 28.0 bits (62), Expect = 9.2
Identities = 13/43 (30%), Positives = 20/43 (46%)
Query: 25 KAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKG 67
+ K KK+KK KK ++ KK+KKK + + K
Sbjct: 608 RGKSKKSKKGKKGRRSLLGGLKKRKKKGGGGGSMMGKKYGRKK 650
>gnl|CDD|148820 pfam07432, Hc1, Histone H1-like protein Hc1. This family consists
of several bacterial histone H1-like Hc1 proteins. In
Chlamydia, Hc1 is expressed in the late stages of the
life cycle, concomitant with the reorganisation of
chlamydial reticulate bodies into elementary bodies.
This suggests that Hc1 protein plays a role in the
condensation of chromatin during intracellular
differentiation.
Length = 123
Score = 26.6 bits (58), Expect = 9.2
Identities = 17/56 (30%), Positives = 24/56 (42%)
Query: 26 AKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGRTVLSLPQLRLLSA 81
+K KKK KK K K KK + K+ PA K + V + P + +A
Sbjct: 54 KAEKSGLMKKKPAKKAPKAAKAPKKAAKAPAKKAAAAPAKKAKAVKAKPATKKTAA 109
>gnl|CDD|192718 pfam11176, DUF2962, Protein of unknown function (DUF2962). This
eukaryotic family of proteins has no known function.
Length = 155
Score = 26.8 bits (60), Expect = 9.3
Identities = 11/47 (23%), Positives = 20/47 (42%), Gaps = 11/47 (23%)
Query: 23 REKAKKKKNK-----------KKKKKKKKKKKKKKKKKKKKRRILAK 58
+ AKKKK + + K +KKK ++ +K+ L +
Sbjct: 4 KNIAKKKKVLHPKSRKAKQLNRATLRDDKLEKKKAERAEKRSLELDR 50
>gnl|CDD|222331 pfam13707, RloB, RloB-like protein. This family includes the RloB
protein that is found within a bacterial restriction
modification operon. This family includes the AbiLii
protein that is found as part of a plasmid encoded phage
abortive infection mechanism. Deletion within abiLii
abolished the phage resistance. The family includes some
proteins annotated as CRISPR Csm2 proteins.
Length = 176
Score = 26.9 bits (60), Expect = 9.3
Identities = 12/42 (28%), Positives = 17/42 (40%), Gaps = 2/42 (4%)
Query: 11 HFDKVIIPAPFEREKAKKKKNKKKKKKKKKKKKKKKKKKKKK 52
HF+ A R+ KK KK KK K K+ ++
Sbjct: 102 HFEDT--TAALSRDAYAKKLLLKKGLPDYKKGKNLFKELLER 141
>gnl|CDD|180161 PRK05609, nusG, transcription antitermination protein NusG;
Validated.
Length = 181
Score = 27.1 bits (61), Expect = 9.4
Identities = 11/32 (34%), Positives = 18/32 (56%), Gaps = 4/32 (12%)
Query: 9 EHHFDKVIIPAPFEREKAKKKKNKKKKKKKKK 40
E +V++P E+ + KN KKKK ++K
Sbjct: 35 EDLIGEVLVP----TEEVVEVKNGKKKKVERK 62
>gnl|CDD|218896 pfam06098, Radial_spoke_3, Radial spoke protein 3. This family
consists of several radial spoke protein 3 (RSP3)
sequences. Eukaryotic cilia and flagella present in
diverse types of cells perform motile, sensory, and
developmental functions in organisms from protists to
humans. They are centred by precisely organised,
microtubule-based structures, the axonemes. The axoneme
consists of two central singlet microtubules, called the
central pair, and nine outer doublet microtubules. These
structures are well-conserved during evolution. The
outer doublet microtubules, each composed of A and B
sub-fibres, are connected to each other by nexin links,
while the central pair is held at the centre of the
axoneme by radial spokes. The radial spokes are T-shaped
structures extending from the A-tubule of each outer
doublet microtubule to the centre of the axoneme. Radial
spoke protein 3 (RSP3), is present at the proximal end
of the spoke stalk and helps in anchoring the radial
spoke to the outer doublet. It is thought that radial
spokes regulate the activity of inner arm dynein through
protein phosphorylation and dephosphorylation.
Length = 288
Score = 27.3 bits (61), Expect = 9.5
Identities = 10/39 (25%), Positives = 30/39 (76%)
Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKS 59
E+A++++ ++K+++KK+ K++K+++K+ +I A++
Sbjct: 176 QRLEEAERRRREEKERRKKQDKERKQREKETAEKIAARA 214
>gnl|CDD|180666 PRK06710, PRK06710, long-chain-fatty-acid--CoA ligase; Validated.
Length = 563
Score = 27.7 bits (61), Expect = 9.6
Identities = 12/35 (34%), Positives = 20/35 (57%)
Query: 86 NNIPGESMLHADNNNEQSNDIAIVLYTSGSTGIPK 120
N++ E + + ND+A++ YT G+TG PK
Sbjct: 188 NSVEKEVNTGVEVPCDPENDLALLQYTGGTTGFPK 222
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 27.7 bits (62), Expect = 9.6
Identities = 9/48 (18%), Positives = 22/48 (45%)
Query: 21 FEREKAKKKKNKKKKKKKKKKKKKKKKKKKKKRRILAKSCENPAYKGR 68
E K + KK ++K ++ + + +KK+++ +K + K
Sbjct: 97 IEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKY 144
>gnl|CDD|146789 pfam04337, DUF480, Protein of unknown function, DUF480. This
family consists of several proteins of uncharacterized
function.
Length = 148
Score = 26.7 bits (60), Expect = 9.7
Identities = 19/66 (28%), Positives = 31/66 (46%), Gaps = 9/66 (13%)
Query: 143 SSGDRAIKYR-RMYKTGDYGR----IVDDILLYEGRTDSQVKIRGNRV----DLSEVEAA 193
G R KY R T ++ +LL +T +++ R R+ D++EVEA
Sbjct: 68 GFGSRVPKYEHRFCNTLQLSPQELALLCLLLLRGPQTPGELRTRSERLHRFADVAEVEAV 127
Query: 194 LSKLSE 199
L +L+E
Sbjct: 128 LEELAE 133
>gnl|CDD|235501 PRK05559, PRK05559, DNA topoisomerase IV subunit B; Reviewed.
Length = 631
Score = 27.8 bits (63), Expect = 9.9
Identities = 9/30 (30%), Positives = 15/30 (50%), Gaps = 5/30 (16%)
Query: 13 DKVIIPAPFEREKAKKKKNKKKKKKKKKKK 42
+K I + +A+ + KK K+KKK
Sbjct: 374 EKAI-----KAAQARLRAAKKVKRKKKTSG 398
>gnl|CDD|181109 PRK07769, PRK07769, long-chain-fatty-acid--CoA ligase; Validated.
Length = 631
Score = 27.8 bits (62), Expect = 9.9
Identities = 10/16 (62%), Positives = 12/16 (75%)
Query: 104 NDIAIVLYTSGSTGIP 119
+ IA + YTSGST IP
Sbjct: 180 DTIAYLQYTSGSTRIP 195
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.312 0.131 0.360
Gapped
Lambda K H
0.267 0.0685 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 15,566,187
Number of extensions: 1520540
Number of successful extensions: 16024
Number of sequences better than 10.0: 1
Number of HSP's gapped: 10795
Number of HSP's successfully gapped: 1700
Length of query: 307
Length of database: 10,937,602
Length adjustment: 97
Effective length of query: 210
Effective length of database: 6,635,264
Effective search space: 1393405440
Effective search space used: 1393405440
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 59 (26.6 bits)