RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy13461
(194 letters)
>gnl|CDD|149648 pfam08662, eIF2A, Eukaryotic translation initiation factor eIF2A.
This is a family of eukaryotic translation initiation
factors.
Length = 194
Score = 80.0 bits (198), Expect = 6e-19
Identities = 24/83 (28%), Positives = 43/83 (51%), Gaps = 1/83 (1%)
Query: 6 GTHIVLAELRDT-GSLEFVDTGDFSVMNAAEHHQATDVEWDPTGRYVMSGVSLWKTKADT 64
G ++LA + G +EF D + + AE ATD EW P GRY ++ + + + D
Sbjct: 112 GRLVLLAGFGNLAGQIEFWDVKNKKKIATAEASNATDCEWSPDGRYFLTATTSPRLRVDN 171
Query: 65 GYWQWSFQGKIIKRFNSPTFCQL 87
G+ W + GK++ +++ Q+
Sbjct: 172 GFKIWHYSGKLVYKYDFDELYQV 194
>gnl|CDD|227657 COG5354, COG5354, Uncharacterized protein, contains Trp-Asp (WD)
repeat [General function prediction only].
Length = 561
Score = 73.0 bits (179), Expect = 3e-15
Identities = 38/165 (23%), Positives = 68/165 (41%), Gaps = 5/165 (3%)
Query: 34 AEHHQATDVEWDPTGRYVMSGVSLWKTKADTGYWQWSFQGKIIKRFNSPTFCQLRWRPRP 93
A+ + T++ WDP+G+YV + S K K + GY + G + F WRP P
Sbjct: 397 AKVFELTNITWDPSGQYVTTSSSCPKHKVEHGYKIFKIAGALYPEEARGGFKNFAWRPEP 456
Query: 94 ASLLSKEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETW 153
S L+ E K K + FE + A++ ++++L +Q+ E ++ +
Sbjct: 457 PSKLTIESAKKYVKPSRHRFVPFEKAVIMEADSANRSSAPRKKELVEQWPEYSDEDKIR- 515
Query: 154 EAEKEERKY----LRNLVDTDELDSENVEEEVVEFVIKEEITICE 194
K+ R R + + + E E V+ E + E
Sbjct: 516 SLLKKLRAIEALKERMRSGEELEVIQVNKIETEEEVLSELKELVE 560
>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342). This
family of bacterial proteins has no known function. The
proteins are in the region of 500-600 amino acid
residues in length.
Length = 450
Score = 34.5 bits (80), Expect = 0.025
Identities = 19/67 (28%), Positives = 33/67 (49%), Gaps = 4/67 (5%)
Query: 99 KEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWEAEKE 158
+E+++KIKK LKK + +E +EK + ++ E E+L E + KE
Sbjct: 347 EEELEKIKKLLKKLPKKARGQLPPEK----REQLEKLLETKEKLSEELEELEEELKELKE 402
Query: 159 ERKYLRN 165
E + L +
Sbjct: 403 ELESLYS 409
>gnl|CDD|200413 TIGR04162, exo_VPEID, exosortase E/protease, VPEID-CTERM system.
Members of this protein family are fusion proteins of
exosortase (N-terminal) and a CAAX prenyl protease
domain (C-terminal). Members are restricted to the alpha
Proteobacteria. The variant C-terminal protein sequence
VPEID-CTERM occurs only in these species, often
adjacent.
Length = 519
Score = 33.6 bits (77), Expect = 0.059
Identities = 20/53 (37%), Positives = 27/53 (50%), Gaps = 6/53 (11%)
Query: 50 YVMSGV---SLWKTKADTGY-WQWSFQGKIIKRFNSPTFCQLRWRPRPASLLS 98
+++SGV SLW A GY + +F + F P L WRP PA LL+
Sbjct: 337 FMLSGVLVQSLWANPAQ-GYPLRAAFMAAGLWLF-WPALRALDWRPGPADLLT 387
>gnl|CDD|147573 pfam05461, ApoL, Apolipoprotein L. Apo L belongs to the high
density lipoprotein family that plays a central role in
cholesterol transport. The cholesterol content of
membranes is important in cellular processes such as
modulating gene transcription and signal transduction
both in the adult brain and during neurodevelopment.
There are six apo L genes located in close proximity to
each other on chromosome 22q12 in humans. 22q12 is a
confirmed high-susceptibility locus for schizophrenia
and close to the region associated with velocardiofacial
syndrome that includes symptoms of schizophrenia.
Length = 313
Score = 32.7 bits (75), Expect = 0.10
Identities = 17/58 (29%), Positives = 33/58 (56%), Gaps = 4/58 (6%)
Query: 94 ASLLSKEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRE 151
+ LS+E+ D ++++LKK T +D+ R+ K ++ R + K+F L+ +L E
Sbjct: 23 EAELSREEADALREALKKLTTDLAIEDKDRLQKDQQD----RERFLKEFPRLKAELEE 76
>gnl|CDD|236686 PRK10416, PRK10416, signal recognition particle-docking protein
FtsY; Provisional.
Length = 318
Score = 32.4 bits (75), Expect = 0.11
Identities = 25/106 (23%), Positives = 41/106 (38%), Gaps = 30/106 (28%)
Query: 103 DKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQ-------FEELREKL------ 149
+KK K+ + + ++ ++K + E LF + EEL E L
Sbjct: 3 SWLKKKKKEKKEGWFERLKKGLSKTRENFGEGINGLFAKKKIDEDLLEELEELLIEADVG 62
Query: 150 RETWEAEK-----EERKYLRNLVDTDELDSENVEEEVVEFVIKEEI 190
ET E+ ER +NL D +EL ++KEE+
Sbjct: 63 VET--TEEIIEELRERVKRKNLKDPEELKE----------LLKEEL 96
>gnl|CDD|188955 cd09556, SAM_VTS1_fungal, SAM domain of VTS1 RNA-binding proteins.
SAM (sterile alpha motif) domain of VTS1 subfamily
proteins is RNA binding domain located in the C-terminal
region. SAM interacts with stem-loop structures of mRNA.
Proteins of this subfamily participate in regulation of
transcript stability and degradation, and also may be
involved in vacuolar protein transport regulation. VTS1
protein of S.cerevisiae induces mRNA degradation via the
major deadenylation-dependent mRNA decay pathway; VTS1
recruits CCR4/POP2/NOT deadenylase complex to target
mRNA. The recruitment is the initial step resulting in
poly(A) tail removal transcripts. Potentially SAM domain
may be responsible not only for RNA binding but also for
deadenylase binding.
Length = 69
Score = 30.3 bits (69), Expect = 0.12
Identities = 15/49 (30%), Positives = 21/49 (42%), Gaps = 8/49 (16%)
Query: 109 LKKYTPAFEAKDRQRMNKASKELIEK--------RRKLFKQFEELREKL 149
L KYT + + + + E +E RRKL K FE +RE
Sbjct: 18 LHKYTDNLKDLSWKELIELDDEDLEDKGVSALGARRKLLKAFEIVREYK 66
>gnl|CDD|221069 pfam11304, DUF3106, Protein of unknown function (DUF3106). Some
members in this family of proteins are annotated as
transmembrane proteins however this cannot be confirmed.
Currently no function is known.
Length = 107
Score = 30.8 bits (70), Expect = 0.17
Identities = 21/72 (29%), Positives = 38/72 (52%), Gaps = 11/72 (15%)
Query: 104 KIKKSLKKYTPAFEAKDRQRMNKASKELIEKR---RKLFKQFEEL----REKLRETWEAE 156
+I + K +P + + ++RM + +K E+R R +++F +L +E LR WEA
Sbjct: 33 RIAERYPKMSPEEQKRLQERMTRWAKLSPEQREQARLNYQRFRQLPPEQKEALRARWEAY 92
Query: 157 K----EERKYLR 164
+ EER+ L
Sbjct: 93 QALSPEERQALA 104
>gnl|CDD|131526 TIGR02473, flagell_FliJ, flagellar export protein FliJ. Members of
this family are the FliJ protein found, in nearly every
case, in the midst of other flagellar biosynthesis genes
in bacgterial genomes. Typically the fliJ gene is found
adjacent to the gene for the flagellum-specific ATPase
FliI. Sequence scoring in the gray zone between trusted
and noise cutoffs include both probable FliJ proteins
and components of bacterial type III secretion systems.
Length = 141
Score = 30.7 bits (70), Expect = 0.21
Identities = 16/45 (35%), Positives = 28/45 (62%), Gaps = 3/45 (6%)
Query: 121 RQRMNKASKELIEKRRKLFKQFEELREKLRETWEAE--KEERKYL 163
+Q + + L+E RR+L K E+L+EK ++ + AE K E+K +
Sbjct: 88 QQEVEAKRERLLEARREL-KALEKLKEKKQKEYRAEEAKREQKEM 131
>gnl|CDD|223771 COG0699, COG0699, Predicted GTPases (dynamin-related) [General
function prediction only].
Length = 546
Score = 31.0 bits (70), Expect = 0.38
Identities = 13/65 (20%), Positives = 25/65 (38%), Gaps = 3/65 (4%)
Query: 100 EQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWEAEKEE 159
E + + + ++ + + K L +K+R E+L ++L T E E
Sbjct: 470 ESLLILAQKIRDS---VLKAIFELLKNKRKRLAQKQRLKRLYLEQLEDELLRTAEEILEL 526
Query: 160 RKYLR 164
R L
Sbjct: 527 RLLLE 531
>gnl|CDD|221326 pfam11935, DUF3453, Domain of unknown function (DUF3453). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 239 to 261 amino acids in length.
Length = 233
Score = 30.3 bits (69), Expect = 0.42
Identities = 17/72 (23%), Positives = 34/72 (47%), Gaps = 9/72 (12%)
Query: 87 LRWRPRPASLLSKEQVDKIKKSLK-------KYTPAFEAKDRQRMNKASKELIEKRRKLF 139
L + P LSK QV ++KSLK K+ + + R+ +A ++L + R ++
Sbjct: 159 LNFNPNLPPTLSKLQVRSVEKSLKNFLLHLLKHQLSPPF--QGRIQQALEDLGQSRAEIA 216
Query: 140 KQFEELREKLRE 151
+ + + L +
Sbjct: 217 DEAPKEKRILPK 228
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 30.8 bits (70), Expect = 0.42
Identities = 12/61 (19%), Positives = 32/61 (52%), Gaps = 1/61 (1%)
Query: 97 LSKEQVDKIKKSLK-KYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWEA 155
L E+ ++++++LK K + + + + + ++L +FE +E+LR+ +E
Sbjct: 182 LKAEEEEELERALKEKREELLSKLEEELLARLESKEAALEKQLRLEFEREKEELRKKYEE 241
Query: 156 E 156
+
Sbjct: 242 K 242
>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
autophagy-related subunit 14. The Atg14 or Apg14
proteins are hydrophilic proteins with a predicted
molecular mass of 40.5 kDa, and have a coiled-coil motif
at the N terminus region. Yeast cells with mutant Atg14
are defective not only in autophagy but also in sorting
of carboxypeptidase Y (CPY), a vacuolar-soluble
hydrolase, to the vacuole. Subcellular fractionation
indicate that Apg14p and Apg6p are peripherally
associated with a membrane structure(s). Apg14p was
co-immunoprecipitated with Apg6p, suggesting that they
form a stable protein complex. These results imply that
Apg6/Vps30p has two distinct functions: in the
autophagic process and in the vacuolar protein sorting
pathway. Apg14p may be a component specifically required
for the function of Apg6/Vps30p through the autophagic
pathway. There are 17 auto-phagosomal component proteins
which are categorized into six functional units, one of
which is the AS-PI3K complex (Vps30/Atg6 and Atg14). The
AS-PI3K complex and the Atg2-Atg18 complex are essential
for nucleation, and the specific function of the AS-PI3K
apparently is to produce phosphatidylinositol
3-phosphate (PtdIns(3)P) at the pre-autophagosomal
structure (PAS). The localisation of this complex at the
PAS is controlled by Atg14. Autophagy mediates the
cellular response to nutrient deprivation, protein
aggregation, and pathogen invasion in humans, and
malfunction of autophagy has been implicated in multiple
human diseases including cancer. This effect seems to be
mediated through direct interaction of the human Atg14
with Beclin 1 in the human phosphatidylinositol 3-kinase
class III complex.
Length = 307
Score = 30.4 bits (69), Expect = 0.44
Identities = 13/56 (23%), Positives = 21/56 (37%), Gaps = 1/56 (1%)
Query: 86 QLRWRPRPASLLSKEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQ 141
L R S S + + L+K + R ++N L EKR L ++
Sbjct: 100 ALAQRRSDLSSASYQLEKRRASQLEKLQDEIKRT-RSKLNALHSLLAEKRSFLCRE 154
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 30.1 bits (68), Expect = 0.47
Identities = 18/74 (24%), Positives = 39/74 (52%), Gaps = 2/74 (2%)
Query: 121 RQRMNKASKELIEKRRKLFKQFEELREKLRETWEAEKEERKYLRNLVDTDEL--DSENVE 178
R++ +A +E E+R+KL ++ E R++ E E +++++ ++ + E E
Sbjct: 17 RRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEEYE 76
Query: 179 EEVVEFVIKEEITI 192
+ FV++EE T
Sbjct: 77 KLKSSFVVEEEGTD 90
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 30.3 bits (69), Expect = 0.59
Identities = 25/95 (26%), Positives = 46/95 (48%), Gaps = 7/95 (7%)
Query: 92 RPASLLSKEQVDKIKKSLKKYTPAFEAKDRQRMNKASKE---LIEKRRKLFKQFEELREK 148
+ A L KE + + K+ + K E + ++R N+ + L+++ L ++ E L +K
Sbjct: 43 KEAETLKKEALLEAKEEVHKLRAELERELKERRNELQRLERRLLQREETLDRKMESLDKK 102
Query: 149 LRETWEAEKEERKYLRNLVDTDELDSENVEEEVVE 183
+ EKE +NL DE + E +EE + E
Sbjct: 103 EENLEKKEKELSNKEKNL---DEKEEE-LEELIAE 133
>gnl|CDD|232986 TIGR00464, gltX_bact, glutamyl-tRNA synthetase, bacterial family.
The glutamyl-tRNA synthetases of the eukaryotic cytosol
and of the Archaea are more similar to glutaminyl-tRNA
synthetases than to bacterial glutamyl-tRNA synthetases.
This model models just the bacterial and mitochondrial
forms of the enzyme. In many species, the charging of
tRNA(gln) proceeds first through misacylation with Glu
and then transamidation. For this reason, glutamyl-tRNA
synthetases may act on both tRNA(gln) and tRNA(glu).
This model is highly specific. Proteins with positive
scores below the trusted cutoff may be fragments rather
than full-length sequences [Protein synthesis, tRNA
aminoacylation].
Length = 470
Score = 30.4 bits (69), Expect = 0.63
Identities = 20/78 (25%), Positives = 40/78 (51%), Gaps = 5/78 (6%)
Query: 113 TPAFEAKDRQRMNKASKELIEKRRKLFKQF---EELREKLRETWEAEKEERKYLRNLVDT 169
P ++++ K +KEL+E+ ++ + E L E+LRE +A KE +Y R+ +
Sbjct: 70 GPYYQSQRLDIYKKYAKELLEEG-LAYRCYCSKERL-ERLREEQKANKETPRYDRHCRNL 127
Query: 170 DELDSENVEEEVVEFVIK 187
+ EN + + V++
Sbjct: 128 HNEEIENALAKGIPPVVR 145
>gnl|CDD|188234 TIGR02591, cas_Csh1, CRISPR-associated protein Cas8b/Csh1, subtype
I-B/HMARI. This domain is found in the C-terminal 2/3
of a family of CRISPR associated proteins of the Hmari
subtype. Except for the two sequences from halophilic
archaea this domain contains a pair of CXXC motifs
[Mobile and extrachromosomal element functions, Other].
Length = 393
Score = 30.1 bits (68), Expect = 0.67
Identities = 17/79 (21%), Positives = 32/79 (40%), Gaps = 6/79 (7%)
Query: 86 QLRWRPRPASLLSKEQVDKI-KKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEE 144
Q R + L++K + KI KK+++ K E K + E
Sbjct: 313 QKSKRNKSKPLVNKLKGLKITKKTIENLLNEVREK-----LTQYGRQSEGGSKRYTSTSE 367
Query: 145 LREKLRETWEAEKEERKYL 163
L ++L E+W+ ++E +
Sbjct: 368 LLDRLIESWKLSEDENVFY 386
>gnl|CDD|224691 COG1777, COG1777, Predicted transcriptional regulators
[Transcription].
Length = 217
Score = 29.7 bits (67), Expect = 0.68
Identities = 23/112 (20%), Positives = 42/112 (37%)
Query: 81 SPTFCQLRWRPRPASLLSKEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFK 140
SP F L E+ + K + +N+ +EL + +L K
Sbjct: 87 SPNFFGAERFDLEEDDLESERSEVSKLFKSPEGISELISRLLEINREIEELSRAQTELQK 146
Query: 141 QFEELREKLRETWEAEKEERKYLRNLVDTDELDSENVEEEVVEFVIKEEITI 192
Q EL ++++E E + + L + + +VEE V+K E +
Sbjct: 147 QLNELMDRIKEEIEDKDGDMTERIVLEYLLKNGAADVEETSRRTVLKIEEVL 198
>gnl|CDD|237063 PRK12329, nusA, transcription elongation factor NusA; Provisional.
Length = 449
Score = 30.1 bits (68), Expect = 0.78
Identities = 18/72 (25%), Positives = 28/72 (38%)
Query: 120 DRQRMNKASKELIEKRRKLFKQFEELREKLRETWEAEKEERKYLRNLVDTDELDSENVEE 179
D++ + ELI +R + E E+L EE LR L E + E+ +E
Sbjct: 377 DQEAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDE 436
Query: 180 EVVEFVIKEEIT 191
+EE
Sbjct: 437 LEEAQPEEEEEA 448
>gnl|CDD|216996 pfam02364, Glucan_synthase, 1,3-beta-glucan synthase component.
This family consists of various 1,3-beta-glucan synthase
components including Gls1, Gls2 and Gls3 from yeast.
1,3-beta-glucan synthase EC:2.4.1.34 also known as
callose synthase catalyzes the formation of a
beta-1,3-glucan polymer that is a major component of the
fungal cell wall. The reaction catalyzed is:-
UDP-glucose + {(1,3)-beta-D-glucosyl}(N) <=> UDP +
{(1,3)-beta-D-glucosyl}(N+1).
Length = 818
Score = 30.1 bits (68), Expect = 0.84
Identities = 22/68 (32%), Positives = 28/68 (41%), Gaps = 3/68 (4%)
Query: 122 QRMNKASKELIEKRRKLFKQFEELREK--LRETWEAEKEERKYLRNLVDTDELDSENVEE 179
QR K KE E L + + +L+ E EAE E Y L+D EN
Sbjct: 199 QRYAKFKKEEAENAEFLLRAYPDLQIAYLDEEPPEAEGGEPDYYSALIDGHCELDEN-GR 257
Query: 180 EVVEFVIK 187
V +F IK
Sbjct: 258 RVPKFRIK 265
>gnl|CDD|223137 COG0059, IlvC, Ketol-acid reductoisomerase [Amino acid transport
and metabolism / Coenzyme metabolism].
Length = 338
Score = 29.9 bits (68), Expect = 0.86
Identities = 17/71 (23%), Positives = 27/71 (38%), Gaps = 16/71 (22%)
Query: 116 FEAKDRQRMNKA---------SKELIEKRRKLFKQFEELREKLRET--WEAEKEERK--- 161
+A+ ++ M K +KE I + + + E LRE+ E + KE RK
Sbjct: 268 IDAETKEEMKKVLKDIQSGEFAKEWILENQAGRPKLEALREETAEHEIEKVGKELRKMMP 327
Query: 162 --YLRNLVDTD 170
VD
Sbjct: 328 WEKKDKGVDMA 338
>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
Provisional.
Length = 1123
Score = 29.9 bits (68), Expect = 0.90
Identities = 18/65 (27%), Positives = 35/65 (53%), Gaps = 2/65 (3%)
Query: 99 KEQVDKIKKSLKKYTPAFE--AKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWEAE 156
K+Q++ + + E ++ + + EL EK+++L Q E+L+EK ET +
Sbjct: 155 KQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELEAQLEQLQEKAAETSQER 214
Query: 157 KEERK 161
K++RK
Sbjct: 215 KQKRK 219
>gnl|CDD|221313 pfam11917, DUF3435, Protein of unknown function (DUF3435). This
family of proteins are functionally uncharacterized.
This protein is found in eukaryotes. Proteins in this
family are typically between 435 to 791 amino acids in
length. This family is related to pfam00589 suggesting
it may be an integrase enzyme.
Length = 418
Score = 29.6 bits (67), Expect = 0.96
Identities = 16/64 (25%), Positives = 34/64 (53%), Gaps = 7/64 (10%)
Query: 122 QRMNKASKELIEKRRKLFKQFEELREKLRETWEAEKE----ERKYLRNLVDTDELDSENV 177
+R+ K +E+ +R++L + EL++K+RE ++ E+ ER+ V + +
Sbjct: 321 ERLEKRRREVRNERQRLRR---ELKKKIREEFDEEQPVIDIERQLSGLAVSEEVKEDLPD 377
Query: 178 EEEV 181
E+ V
Sbjct: 378 EQLV 381
>gnl|CDD|223957 COG1026, COG1026, Predicted Zn-dependent peptidases,
insulinase-like [General function prediction only].
Length = 978
Score = 30.0 bits (68), Expect = 0.99
Identities = 11/71 (15%), Positives = 33/71 (46%), Gaps = 5/71 (7%)
Query: 122 QRMNKASKELIEKRRKLFKQ--FEELREKLRETWEA-EKEERKYLRNLVDTDELDSENVE 178
+++ K +EL++KR E++ + ++ E ++ + + + T L +V
Sbjct: 463 EKLEKEERELLQKRSSELTDEDLEKIIKDSKKLKERQDQPDSEEDLATLPT--LKLGDVP 520
Query: 179 EEVVEFVIKEE 189
+ + + ++ E
Sbjct: 521 DPIEKTSLETE 531
>gnl|CDD|222290 pfam13654, AAA_32, AAA domain. This family includes a wide variety
of AAA domains including some that have lost essential
nucleotide binding residues in the P-loop.
Length = 509
Score = 29.7 bits (68), Expect = 1.0
Identities = 16/59 (27%), Positives = 31/59 (52%), Gaps = 13/59 (22%)
Query: 102 VDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWEAEKEER 160
V+++KK + AFE S+E ++ ++ ++F+E RE+ E E E +E+
Sbjct: 97 VEELKKEIPA---AFE----------SEEYEARKEEIEEEFQEKREEAFEELEEEAKEK 142
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 29.7 bits (66), Expect = 1.1
Identities = 25/103 (24%), Positives = 44/103 (42%), Gaps = 6/103 (5%)
Query: 92 RPASLLSKEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRE 151
+ A K + D+ K + A EAK + + K E +K +L K EE + K E
Sbjct: 1675 KKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEE 1734
Query: 152 TWEAEKEERKYLRNLVDTDELDSENVEEEVVEFVIKEEITICE 194
+ +E++K +E + E++ + + KEE E
Sbjct: 1735 AKKEAEEDKK------KAEEAKKDEEEKKKIAHLKKEEEKKAE 1771
Score = 29.7 bits (66), Expect = 1.3
Identities = 22/68 (32%), Positives = 35/68 (51%)
Query: 94 ASLLSKEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETW 153
A L K + +K K K A E K + + KA +E K + K+ EE ++K E
Sbjct: 1622 AEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAK 1681
Query: 154 EAEKEERK 161
+AE++E+K
Sbjct: 1682 KAEEDEKK 1689
Score = 28.2 bits (62), Expect = 3.8
Identities = 16/64 (25%), Positives = 35/64 (54%)
Query: 98 SKEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWEAEK 157
+K++ D KK ++ A EAK + +K + ++K K+ +E ++K E +A++
Sbjct: 1376 AKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADE 1435
Query: 158 EERK 161
++K
Sbjct: 1436 AKKK 1439
>gnl|CDD|181215 PRK08059, PRK08059, general stress protein 13; Validated.
Length = 123
Score = 28.5 bits (64), Expect = 1.1
Identities = 15/60 (25%), Positives = 23/60 (38%), Gaps = 7/60 (11%)
Query: 100 EQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWEAEKEE 159
E+ KI S++ A EAK ++ + F LR+KL E E +
Sbjct: 67 EEKGKISLSIRATEEAPEAKRKKGKILIPNPSEQG-------FNTLRDKLEEWIEQSNKT 119
>gnl|CDD|233224 TIGR00993, 3a0901s04IAP86, chloroplast protein import component
Toc86/159, G and M domains. The long precursor of the
86K protein originally described is proposed to have
three domains. The N-terminal A-domain is acidic,
repetitive, weakly conserved, readily removed by
proteolysis during chloroplast isolation, and not
required for protein translocation. The other domains
are designated G (GTPase) and M (membrane anchor); this
family includes most of the G domain and all of M
[Transport and binding proteins, Amino acids, peptides
and amines].
Length = 763
Score = 29.5 bits (66), Expect = 1.1
Identities = 21/73 (28%), Positives = 37/73 (50%), Gaps = 10/73 (13%)
Query: 114 PAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKL--RETWEAE----KEERKYLRNLV 167
P F+ + +M K SKE +RK + + + R KL ++ W E K +K+ + +
Sbjct: 411 PPFKPLTKAQMAKLSKE----QRKAYLEEYDYRVKLLQKKQWREELKRMKMMKKFGKEIG 466
Query: 168 DTDELDSENVEEE 180
+ + SE V+EE
Sbjct: 467 ELPDGYSEEVDEE 479
>gnl|CDD|234904 PRK01117, PRK01117, adenylosuccinate synthetase; Provisional.
Length = 430
Score = 29.2 bits (67), Expect = 1.4
Identities = 12/42 (28%), Positives = 18/42 (42%), Gaps = 3/42 (7%)
Query: 143 EELREKLRETWEAEKEERKYLRNLVDTDELDSENVEEEVVEF 184
E EKL E E K L + +D E + +E +E+
Sbjct: 155 ETFAEKLEENLE-YKNF--LLTKYYGAEAVDFEEILDEYLEY 193
>gnl|CDD|234173 TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB. Members
of this protein family are the bacterial ATP-dependent
chaperone ClpB. This protein belongs to the AAA family,
ATPases associated with various cellular activities
(pfam00004). This molecular chaperone does not act as a
protease, but rather serves to disaggregate misfolded
and aggregated proteins [Protein fate, Protein folding
and stabilization].
Length = 852
Score = 29.2 bits (66), Expect = 1.5
Identities = 18/57 (31%), Positives = 29/57 (50%), Gaps = 2/57 (3%)
Query: 127 ASKELIEKRRKLFKQFEELREKLRETWEAEKEERKYLRNLVDTDELDSENVEEEVVE 183
ASKE +E K + EE L E W+AEK + ++ + +E++ +E E E
Sbjct: 431 ASKERLEDLEKELAELEEEYADLEEQWKAEKAAIQGIQQI--KEEIEQVRLELEQAE 485
>gnl|CDD|183017 PRK11177, PRK11177, phosphoenolpyruvate-protein phosphotransferase;
Provisional.
Length = 575
Score = 29.2 bits (66), Expect = 1.5
Identities = 25/98 (25%), Positives = 44/98 (44%), Gaps = 17/98 (17%)
Query: 96 LLSKEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWEA 155
LL KE D+I + KK + ++ +R + R K Q E ++ K ET+
Sbjct: 17 LLLKE--DEIVINRKKISADQVDQEVER-------FLSGRAKASAQLEAIKTKAGETFGE 67
Query: 156 EKEE--RKYLRNLVDTDELDSENVEEEVVEFVIKEEIT 191
EKE ++ L D E +E+E++ + + +T
Sbjct: 68 EKEAIFEGHIMLLED------EELEQEIIALIKDKHMT 99
>gnl|CDD|236394 PRK09169, PRK09169, hypothetical protein; Validated.
Length = 2316
Score = 29.3 bits (66), Expect = 1.6
Identities = 24/91 (26%), Positives = 43/91 (47%), Gaps = 18/91 (19%)
Query: 107 KSLKKYTPAFEAKDRQRMNKASKELIEKR------------RKLFKQFEELREKLRE--- 151
K K TPAF+ + R + +A++ L +R + Q + LRE + E
Sbjct: 1571 KPPKTATPAFDEQGRYQAGRAAEILSAQRGQELVGRASTLQFRFLAQPDALREAMAERAM 1630
Query: 152 --TWEA-EKEERKYLRNLVDTDELDSENVEE 179
T+E E+E R+ LR L++ + + ++E
Sbjct: 1631 FGTYEGTERELRRGLRRLLEGEAEAPQALQE 1661
>gnl|CDD|224062 COG1139, COG1139, Uncharacterized conserved protein containing a
ferredoxin-like domain [Energy production and
conversion].
Length = 459
Score = 28.8 bits (65), Expect = 1.6
Identities = 19/62 (30%), Positives = 26/62 (41%), Gaps = 9/62 (14%)
Query: 121 RQRMNKASKELIEKRRKL---FKQFEELREKLRETWEAEKEERKYLRNLVDTDELDSENV 177
R+ + KA L R K+ +EELR+ RE + L NL + E ENV
Sbjct: 21 REALRKAQDTLRANREKVLDELPDWEELRDLAREI------KLHVLENLDEYLEQLEENV 74
Query: 178 EE 179
Sbjct: 75 TR 76
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 29.0 bits (66), Expect = 1.7
Identities = 28/104 (26%), Positives = 49/104 (47%), Gaps = 12/104 (11%)
Query: 98 SKEQVDKIKKSLKKYTPAFEAKDR--QRMNKASKELIEKRRKLFKQFEELREK---LRET 152
+K++ + IKK EAK+ + N+ KEL E+R +L K + L +K L
Sbjct: 47 AKKEAEAIKKEAL-----LEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRK 101
Query: 153 WEAEKEERKYLRNLVDTDELDSENVE--EEVVEFVIKEEITICE 194
E ++ + L E + +E EE +E +I+E++ E
Sbjct: 102 LELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELE 145
>gnl|CDD|179730 PRK04051, rps4p, 30S ribosomal protein S4P; Validated.
Length = 177
Score = 28.3 bits (64), Expect = 1.8
Identities = 19/57 (33%), Positives = 28/57 (49%), Gaps = 2/57 (3%)
Query: 129 KELIEKRRKLFKQFEELREKLRETWEAEKEERKYLRNLVDTDELDSENVEEEVVEFV 185
KE I + R+L ++ LR K RE W+A+ RKY R L E +E + +
Sbjct: 20 KERIAEERELLGKYG-LRNK-RELWKAQSILRKYRRQARSLLALPPEERAKEEEQLL 74
>gnl|CDD|215355 PLN02658, PLN02658, homogentisate 1,2-dioxygenase.
Length = 435
Score = 28.9 bits (65), Expect = 1.9
Identities = 10/26 (38%), Positives = 13/26 (50%), Gaps = 5/26 (19%)
Query: 73 GKIIKRFNSPTFC-----QLRWRPRP 93
K++ F+ C QLRWRP P
Sbjct: 74 EKLVGEFDPSNSCETTPTQLRWRPFP 99
>gnl|CDD|99866 cd06113, citrate_synt_like_1_2, Citrate synthase (CS) catalyzes the
condensation of acetyl coenzyme A (AcCoA) and
oxalacetate (OAA) to form citrate and coenzyme A (CoA),
the first step in the oxidative citric acid cycle (TCA
or Krebs cycle). Peroxisomal CS is involved in the
glyoxylate cycle. This group also includes CS proteins
which functions as a 2-methylcitrate synthase (2MCS).
2MCS catalyzes the condensation of propionyl-CoA (PrCoA)
and OAA to form 2-methylcitrate and CoA during
propionate metabolism. This group contains proteins
which functions exclusively as either a CS or a 2MCS, as
well as those with relaxed specificity which have dual
functions as both a CS and a 2MCS. The overall CS
reaction is thought to proceed through three partial
reactions and involves both closed and open
conformational forms of the enzyme: a) a carbanion or
equivalent is generated from AcCoA by base abstraction
of a proton, b) nucleophilic attack of this carbanion on
OAA to generate citryl-CoA, and c) hydrolysis of
citryl-CoA to produce citrate and CoA. CSs are found in
two structural types: type I (homodimeric) and type II
CSs (homohexameric). Type II CSs are unique to
gram-negative bacteria. Type I CSs are found in eukarya,
gram-positive bacteria, archaea, and in some
gram-negative bacteria. Type I CS is active as a
homodimer, both subunits participating in the active
site. Type II CS is a hexamer of identical subunits
(approximated as a trimer of dimers). Some type II CSs
are strongly and specifically inhibited by NADH through
an allosteric mechanism. This subgroup includes both
gram-positive and gram-negative bacteria.
Length = 406
Score = 28.8 bits (65), Expect = 2.1
Identities = 11/37 (29%), Positives = 23/37 (62%), Gaps = 1/37 (2%)
Query: 137 KLFKQFEELREKLRETWEAEKEERKYLRNLVDTDELD 173
K+ + E+++E +++ W E E R YLR +++ + D
Sbjct: 250 KVMEMLEDIKENVKD-WTDEDEVRAYLRKILNKEAFD 285
>gnl|CDD|235246 PRK04184, PRK04184, DNA topoisomerase VI subunit B; Validated.
Length = 535
Score = 28.7 bits (65), Expect = 2.1
Identities = 14/70 (20%), Positives = 25/70 (35%), Gaps = 10/70 (14%)
Query: 126 KASKELIEKRRKLFKQFEELREKLRETWEAEKE----------ERKYLRNLVDTDELDSE 175
+ +E +K + K E+ KL E E E+ +K V+ +E +
Sbjct: 463 RKEEEAKKKAKTFEKYIPEIARKLAEILEKEEPDIEEVLAKLIGKKLEEEEVEEEEEEEA 522
Query: 176 NVEEEVVEFV 185
V E +
Sbjct: 523 VVVESAKNYT 532
>gnl|CDD|227000 COG4653, COG4653, Predicted phage phi-C31 gp36 major capsid-like
protein [General function prediction only].
Length = 422
Score = 28.6 bits (64), Expect = 2.3
Identities = 8/68 (11%), Positives = 16/68 (23%)
Query: 100 EQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWEAEKEE 159
+++D I +L K T E ++ + A
Sbjct: 57 DKLDAIPGNLDKDTAFLALIACVAARLKLGGAAETADPACREGPPAFNSYIQRGLAATGA 116
Query: 160 RKYLRNLV 167
L+
Sbjct: 117 LLELKAFS 124
Score = 27.9 bits (62), Expect = 4.0
Identities = 12/37 (32%), Positives = 20/37 (54%), Gaps = 3/37 (8%)
Query: 143 EELREKLRETWE-AEKEERKYLRNL--VDTDELDSEN 176
+R + R E AE+E R+ L ++ V D+LD+
Sbjct: 27 TAIRAEQRRRGEEAEEENRRLLADIERVGGDKLDAIP 63
>gnl|CDD|145900 pfam02994, Transposase_22, L1 transposable element.
Length = 370
Score = 28.5 bits (63), Expect = 2.3
Identities = 20/78 (25%), Positives = 35/78 (44%), Gaps = 8/78 (10%)
Query: 99 KEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWEAEKE 158
+E ++ K ++ F + + + K + IE K K+ EL++K RE KE
Sbjct: 61 REDIETKGKEVQN----FLKELEECITKQVEAHIENTEKCLKELMELKKKAREL----KE 112
Query: 159 ERKYLRNLVDTDELDSEN 176
E + ++ LD EN
Sbjct: 113 ECRSIKKEHMETTLDIEN 130
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 28.5 bits (64), Expect = 2.4
Identities = 17/81 (20%), Positives = 40/81 (49%), Gaps = 7/81 (8%)
Query: 99 KEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWEAEKE 158
+++++++ +++ E K R ++ + EL E+ L + EE+ ++ ET + K+
Sbjct: 335 LAEIEELEREIEE-----ERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKD 389
Query: 159 ERKYLRNLVDTDELDSENVEE 179
R+ L L E++ E
Sbjct: 390 YREKLEKL--KREINELKREL 408
Score = 27.3 bits (61), Expect = 6.7
Identities = 21/88 (23%), Positives = 40/88 (45%), Gaps = 5/88 (5%)
Query: 99 KEQVDKIKKSLKKYTPAFEAKDRQ--RMNKASKELIEKRRKLFKQFEELREKLRET---W 153
+ ++ +I+ L + + R+ + K ++L ++ KL ++ EEL E L
Sbjct: 694 QSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEI 753
Query: 154 EAEKEERKYLRNLVDTDELDSENVEEEV 181
E K E K L ++ E D +EE +
Sbjct: 754 ENVKSELKELEARIEELEEDLHKLEEAL 781
>gnl|CDD|184430 PRK13975, PRK13975, thymidylate kinase; Provisional.
Length = 196
Score = 28.2 bits (63), Expect = 2.5
Identities = 20/71 (28%), Positives = 33/71 (46%), Gaps = 4/71 (5%)
Query: 122 QRMNKASKELIEKRRKLFKQFEELREKLRETWEAEKEERKYLRNLVDTDELDSENVEEEV 181
+RM KE+ EK+ L ++++EK E EK KY ++DT E V E+
Sbjct: 129 KRMETRDKEIFEKKEFL----KKVQEKYLELANNEKFMPKYGFIVIDTTNKSIEEVFNEI 184
Query: 182 VEFVIKEEITI 192
+ + + I
Sbjct: 185 LNKIKDKIPDI 195
>gnl|CDD|222259 pfam13608, Potyvirid-P3, Protein P3 of Potyviral polyprotein. This
is the P3 protein section of the Potyviridae
polyproteins. The function is not known except that the
protein is essential to viral survival.
Length = 445
Score = 28.3 bits (64), Expect = 2.6
Identities = 15/56 (26%), Positives = 24/56 (42%), Gaps = 4/56 (7%)
Query: 124 MNKASKELIEK--RRKLFKQFEEL--REKLRETWEAEKEERKYLRNLVDTDELDSE 175
+N EL+EK L + + EL EKL W + K ++ + L D +
Sbjct: 135 LNDELYELMEKIYLDLLEESWRELSLLEKLSAIWRSRKFRKRSEKCLKPKAAEDLK 190
>gnl|CDD|227899 COG5612, COG5612, Predicted integral membrane protein [Function
unknown].
Length = 148
Score = 27.6 bits (61), Expect = 2.6
Identities = 16/70 (22%), Positives = 31/70 (44%), Gaps = 5/70 (7%)
Query: 109 LKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQ-FEELREKLRETWEAEKEERKYLRNLV 167
++ TPA A + L + R+ F+Q R+K RE + ++ R+ L+
Sbjct: 29 VRLSTPA----PAMTTPAAGEALPPENRRGFRQALRAARQKNREITQRARQARREAAALL 84
Query: 168 DTDELDSENV 177
+ LD+ +
Sbjct: 85 ASPGLDTAAL 94
>gnl|CDD|206685 cd01898, Obg, Obg GTPase. The Obg nucleotide binding protein
subfamily has been implicated in stress response,
chromosome partitioning, replication initiation,
mycelium development, and sporulation. Obg proteins are
among a large group of GTP binding proteins conserved
from bacteria to humans. The E. coli homolog, ObgE is
believed to function in ribosomal biogenesis. Members of
the subfamily contain two equally and highly conserved
domains, a C-terminal GTP binding domain and an
N-terminal glycine-rich domain.
Length = 170
Score = 27.8 bits (63), Expect = 2.7
Identities = 11/54 (20%), Positives = 23/54 (42%), Gaps = 5/54 (9%)
Query: 100 EQVDKIKKSLKKYTPAFEAKDRQRM---NKASKELIEKRRKLFKQFEELREKLR 150
E + I+ L+ Y P ++ R+ NK E+R + K+ + + +
Sbjct: 96 EDYETIRNELEAYNPGLA--EKPRIVVLNKIDLLDAEERFEKLKELLKELKGKK 147
>gnl|CDD|235855 PRK06707, PRK06707, amidase; Provisional.
Length = 536
Score = 28.3 bits (63), Expect = 2.9
Identities = 20/60 (33%), Positives = 31/60 (51%), Gaps = 2/60 (3%)
Query: 107 KSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWE-AEKEERKYLRN 165
KSL++ AF KD +R K + LIE K +E + ++ + E A+KE +YL
Sbjct: 407 KSLEEII-AFNKKDSKRRIKYGQTLIEASEKSAITKDEFEKVVQTSQENAKKELDRYLVE 465
>gnl|CDD|232831 TIGR00115, tig, trigger factor. Trigger factor is a
ribosome-associated molecular chaperone and is the first
chaperone to interact with nascent polypeptide. Trigger
factor can bind at the same time as the signal
recognition particle (SRP), but is excluded by the SRP
receptor (FtsY). The central domain of trigger factor
has peptidyl-prolyl cis/trans isomerase activity. This
protein is found in a single copy in virtually every
bacterial genome [Protein fate, Protein folding and
stabilization].
Length = 408
Score = 28.3 bits (64), Expect = 3.0
Identities = 24/73 (32%), Positives = 37/73 (50%), Gaps = 11/73 (15%)
Query: 129 KELIEKRRKLFKQF-------EELREKLRETWEAEKEER--KYLRNLVDTDELDSENVEE 179
KEL E + K EEL+ +R+ E EK+ER L+ + D+L EN E
Sbjct: 232 KELPELDDEFAKSLGEEFETLEELKADIRKNLEREKKERAKNKLKEQL-LDKLV-ENNEF 289
Query: 180 EVVEFVIKEEITI 192
E+ E ++++EI
Sbjct: 290 ELPESLVEQEIDR 302
>gnl|CDD|236798 PRK10929, PRK10929, putative mechanosensitive channel protein;
Provisional.
Length = 1109
Score = 28.5 bits (64), Expect = 3.0
Identities = 13/50 (26%), Positives = 27/50 (54%)
Query: 126 KASKELIEKRRKLFKQFEELREKLRETWEAEKEERKYLRNLVDTDELDSE 175
K S E ++ +++ F +L +LR+ E++E + + + TD L+ E
Sbjct: 61 KGSLERAKQYQQVIDNFPKLSAELRQQLNNERDEPRSVPPNMSTDALEQE 110
>gnl|CDD|221049 pfam11262, Tho2, Transcription factor/nuclear export subunit
protein 2. THO and TREX form a eukaryotic complex which
functions in messenger ribonucleoprotein metabolism and
plays a role in preventing the transcription-associated
genetic instability. Tho2, along with four other
subunits forms THO.
Length = 296
Score = 28.1 bits (63), Expect = 3.0
Identities = 19/95 (20%), Positives = 36/95 (37%), Gaps = 15/95 (15%)
Query: 98 SKEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWEAEK 157
KE DK + L+K ++ I+K++K K+ + L +KL E +
Sbjct: 25 PKELYDKEIERLEKQIKELDSS---------SSGIDKKKKEKKRLKSLIKKLEEELKKHI 75
Query: 158 EE----RKYLRNLVDT--DELDSENVEEEVVEFVI 186
E +K L + D + + ++ I
Sbjct: 76 EHNEKTKKRLSEEKSSWFPSKDPDEINSAFLQLCI 110
>gnl|CDD|234252 TIGR03545, TIGR03545, TIGR03545 family protein. This model
represents a relatively rare but broadly distributed
uncharacterized protein family, distributed in 1-2
percent of bacterial genomes, all of which have outer
membranes. In many of these genomes, it is part of a
two-gene pair.
Length = 555
Score = 28.1 bits (63), Expect = 3.1
Identities = 24/105 (22%), Positives = 45/105 (42%), Gaps = 13/105 (12%)
Query: 86 QLRWRPRPASLLSKEQVDKIKKSLKKY------TPAFEAKDRQRMNK------ASKELIE 133
Q +W+ R L +K+ +++ KK L+ P K ++ +K A K+ I+
Sbjct: 177 QQKWKKRKKDLPNKQDLEEYKKRLEAIKKKDIKNPLELQKIKEEFDKLKKEGKADKQKIK 236
Query: 134 K-RRKLFKQFEELREKLRETWEAEKEERKYLRNLVDTDELDSENV 177
+ L ++L+ L E +A + + K L N D +N
Sbjct: 237 SAKNDLQNDKKQLKADLAELKKAPQNDLKRLENKYAIKSGDLKNF 281
Score = 26.6 bits (59), Expect = 8.9
Identities = 23/88 (26%), Positives = 42/88 (47%), Gaps = 13/88 (14%)
Query: 95 SLLSKEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRK------------LFKQF 142
L + E ++I+KSLK ++ + + NK E +KR + L K
Sbjct: 159 DLKTVETAEEIEKSLKAMQQKWKKRKKDLPNKQDLEEYKKRLEAIKKKDIKNPLELQKIK 218
Query: 143 EELREKLRETWEAEKEERKYLRNLVDTD 170
EE +KL++ +A+K++ K +N + D
Sbjct: 219 EEF-DKLKKEGKADKQKIKSAKNDLQND 245
>gnl|CDD|218896 pfam06098, Radial_spoke_3, Radial spoke protein 3. This family
consists of several radial spoke protein 3 (RSP3)
sequences. Eukaryotic cilia and flagella present in
diverse types of cells perform motile, sensory, and
developmental functions in organisms from protists to
humans. They are centred by precisely organised,
microtubule-based structures, the axonemes. The axoneme
consists of two central singlet microtubules, called the
central pair, and nine outer doublet microtubules. These
structures are well-conserved during evolution. The
outer doublet microtubules, each composed of A and B
sub-fibres, are connected to each other by nexin links,
while the central pair is held at the centre of the
axoneme by radial spokes. The radial spokes are T-shaped
structures extending from the A-tubule of each outer
doublet microtubule to the centre of the axoneme. Radial
spoke protein 3 (RSP3), is present at the proximal end
of the spoke stalk and helps in anchoring the radial
spoke to the outer doublet. It is thought that radial
spokes regulate the activity of inner arm dynein through
protein phosphorylation and dephosphorylation.
Length = 288
Score = 28.1 bits (63), Expect = 3.1
Identities = 16/54 (29%), Positives = 30/54 (55%), Gaps = 3/54 (5%)
Query: 117 EAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWE---AEKEERKYLRNLV 167
E + QR+ +A + E++ + KQ +E +++ +ET E A + YL +LV
Sbjct: 171 ELAETQRLEEAERRRREEKERRKKQDKERKQREKETAEKIAARAFAQGYLSDLV 224
>gnl|CDD|219526 pfam07701, HNOBA, Heme NO binding associated. The HNOBA domain is
found associated with the HNOB domain and pfam00211 in
soluble cyclases and signalling proteins. The HNOB
domain is predicted to function as a heme-dependent
sensor for gaseous ligands, and transduce diverse
downstream signals, in both bacteria and animals.
Length = 209
Score = 27.6 bits (62), Expect = 3.2
Identities = 19/52 (36%), Positives = 30/52 (57%), Gaps = 12/52 (23%)
Query: 127 ASKELI---EKRR---KLFKQFEELREKLRETWEAEKEERKYLRNLVDTDEL 172
A+++LI E+ R KL K+ E+L++KL E +EE+K TD+L
Sbjct: 151 ATRDLILLGEQARAQLKLKKRLEKLKDKLEEAHRELEEEKK------KTDDL 196
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 28.2 bits (63), Expect = 3.2
Identities = 16/74 (21%), Positives = 37/74 (50%), Gaps = 4/74 (5%)
Query: 120 DRQRMNKASKELIEKRRKLFKQFEELREKLRETWEAEKE-ERKYLRNLVDTDELDSENVE 178
++ +N ++L ++L ++ E L + L+E + +KE E++ E +E
Sbjct: 509 FKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLE-LE 567
Query: 179 EEVVEFV--IKEEI 190
+E E + +K+E+
Sbjct: 568 KEAQEALKALKKEV 581
>gnl|CDD|237841 PRK14867, PRK14867, DNA topoisomerase VI subunit B; Provisional.
Length = 659
Score = 28.2 bits (63), Expect = 3.3
Identities = 18/73 (24%), Positives = 35/73 (47%), Gaps = 2/73 (2%)
Query: 97 LSKEQVDKIKKSLKKYTPAF-EAKDRQRMNKAS-KELIEKRRKLFKQFEELREKLRETWE 154
L +E++ +IK KK F E ++ +N + E +K + + K EE ++L++
Sbjct: 354 LDEEEIKEIKNHFKKKPENFMEWAEKNYLNASLLDEFRKKVKNITKSPEEFVKELKDKAM 413
Query: 155 AEKEERKYLRNLV 167
EE + R +
Sbjct: 414 VSDEELEKYRKEI 426
>gnl|CDD|220764 pfam10454, DUF2458, Protein of unknown function (DUF2458). This a
is family of uncharacterized proteins.
Length = 155
Score = 27.5 bits (61), Expect = 3.3
Identities = 14/48 (29%), Positives = 25/48 (52%)
Query: 129 KELIEKRRKLFKQFEELREKLRETWEAEKEERKYLRNLVDTDELDSEN 176
+ LI ++ + +Q+ + RE L + EA KE++K L +L S
Sbjct: 30 RRLIIEQHRHERQWWKEREALIQKQEARKEDKKKLLPNGTDSKLSSVG 77
>gnl|CDD|216925 pfam02194, PXA, PXA domain. This domain is associated with PX
domains pfam00787.
Length = 182
Score = 27.6 bits (62), Expect = 3.5
Identities = 20/77 (25%), Positives = 35/77 (45%), Gaps = 4/77 (5%)
Query: 92 RPASLLSKEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRE 151
A LL+ + + K L+ + A E R + +++ + +L E KL
Sbjct: 56 DLADLLTDRLLPLLTKHLESFREA-EQAVRNLSSSSTES---EELELALALEPESAKLHP 111
Query: 152 TWEAEKEERKYLRNLVD 168
+E+EE +YLR LV+
Sbjct: 112 ALSSEEEELEYLRKLVE 128
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 28.1 bits (63), Expect = 3.6
Identities = 22/93 (23%), Positives = 47/93 (50%), Gaps = 9/93 (9%)
Query: 99 KEQVDKIKKSLKKYTPAFEAKDRQRMNKAS--KELIEKRRKLFKQFEELREKLRETWEAE 156
+E++++++ L++ E + + K KE +E+R L ++ E+L +L E A+
Sbjct: 308 RERLEELENELEEL----EERLEELKEKIEALKEELEERETLLEELEQLLAELEE---AK 360
Query: 157 KEERKYLRNLVDTDELDSENVEEEVVEFVIKEE 189
+E + L L++ E E + EE+ E +
Sbjct: 361 EELEEKLSALLEELEELFEALREELAELEAELA 393
>gnl|CDD|165679 PLN00107, PLN00107, FAD-dependent oxidoreductase; Provisional.
Length = 257
Score = 27.6 bits (61), Expect = 3.7
Identities = 13/36 (36%), Positives = 18/36 (50%)
Query: 70 SFQGKIIKRFNSPTFCQLRWRPRPASLLSKEQVDKI 105
+F G I K + F +++ R P L S E DKI
Sbjct: 167 AFDGAIAKYKKAGEFLKVKERLDPEGLFSSEWSDKI 202
>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 429
Score = 27.9 bits (63), Expect = 4.0
Identities = 25/141 (17%), Positives = 53/141 (37%), Gaps = 27/141 (19%)
Query: 80 NSPTFCQLRWRPRPASLLSKEQVDKIKKSLKKYTPAFEAKDRQRMNKASKE--------- 130
+P + + + R L +++ ++ + +K E +R N+ SKE
Sbjct: 9 ENPDAVREKLKKRGGDALDVDKLLELDEERRKLLRELEELQAER-NELSKEIGRALKRGE 67
Query: 131 -----LIEKRRKLFKQFEELREKLRETWEAEKEERKYLRNLVDTD----ELDSENVE--- 178
LI + ++L ++ +EL L E + N+ + + +NVE
Sbjct: 68 DDAEELIAEVKELKEKLKELEAALDELEAELDTLLLTIPNIPHESVPVGKDEDDNVEVRR 127
Query: 179 -----EEVVEFVIKEEITICE 194
V +F K+ + + E
Sbjct: 128 WGEPPVFVFDFEPKDHVELGE 148
Score = 27.2 bits (61), Expect = 6.0
Identities = 18/90 (20%), Positives = 40/90 (44%), Gaps = 8/90 (8%)
Query: 99 KEQVDKIKKSLKK---YTPAFEA-----KDRQRMNKASKELIEKRRKLFKQFEELREKLR 150
+E D +++ LKK + ++R+++ + +EL +R +L K+ ++
Sbjct: 8 RENPDAVREKLKKRGGDALDVDKLLELDEERRKLLRELEELQAERNELSKEIGRALKRGE 67
Query: 151 ETWEAEKEERKYLRNLVDTDELDSENVEEE 180
+ E E K L+ + E + +E E
Sbjct: 68 DDAEELIAEVKELKEKLKELEAALDELEAE 97
>gnl|CDD|237047 PRK12298, obgE, GTPase CgtA; Reviewed.
Length = 390
Score = 27.9 bits (63), Expect = 4.1
Identities = 11/51 (21%), Positives = 21/51 (41%)
Query: 130 ELIEKRRKLFKQFEELREKLRETWEAEKEERKYLRNLVDTDELDSENVEEE 180
IE+ + + E EK+ W+ E+ D D+ D + E++
Sbjct: 330 TFIEENPREEAEEAEAPEKVEFMWDDYHREQLEEVEEEDDDDWDDDWDEDD 380
>gnl|CDD|227545 COG5220, TFB3, Cdk activating kinase (CAK)/RNA polymerase II
transcription initiation/nucleotide excision repair
factor TFIIH, subunit TFB3 [Cell division and chromosome
partitioning / Transcription / DNA replication,
recombination, and repair].
Length = 314
Score = 27.7 bits (61), Expect = 4.3
Identities = 25/110 (22%), Positives = 44/110 (40%), Gaps = 22/110 (20%)
Query: 96 LLSKEQVDKIKKSLKKY-------------TPAFEAKDRQRMNKASKELIEKRRKLFKQF 142
LL V ++ +KKY P E ++ N K + K+R+L +Q
Sbjct: 120 LLELIDVSLTEEKVKKYEEMNQDSILNNLERPTPEVMPGKQKNVLQKRMKLKKRQLERQI 179
Query: 143 EELREKLRETWEAE---------KEERKYLRNLVDTDELDSENVEEEVVE 183
EE +E + E ++ K ++ + +D+ D E + VVE
Sbjct: 180 EEEERMNKEWTKKEIGNRLGTASEDGNKTIKIGIISDKFDPEELPRIVVE 229
>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
Length = 425
Score = 27.7 bits (63), Expect = 4.5
Identities = 16/79 (20%), Positives = 29/79 (36%), Gaps = 17/79 (21%)
Query: 121 RQRMNKASKE-------------LIEKRRKLFKQFEELREKLRETWEAEKEERKYLRNLV 167
+ N SKE LI + ++L ++ + L +L E +E + NL
Sbjct: 48 QAERNALSKEIGQAKRKGEDAEALIAEVKELKEEIKALEAELDELEAELEELLLRIPNLP 107
Query: 168 DTD----ELDSENVEEEVV 182
+ + +NVE
Sbjct: 108 HDSVPVGKDEDDNVEVRRW 126
>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family. This
family consists of the 116kDa V-type ATPase (vacuolar
(H+)-ATPases) subunits, as well as V-type ATP synthase
subunit i. The V-type ATPases family are proton pumps
that acidify intracellular compartments in eukaryotic
cells for example yeast central vacuoles,
clathrin-coated and synaptic vesicles. They have
important roles in membrane trafficking processes. The
116kDa subunit (subunit a) in the V-type ATPase is part
of the V0 functional domain responsible for proton
transport. The a subunit is a transmembrane glycoprotein
with multiple putative transmembrane helices it has a
hydrophilic amino terminal and a hydrophobic carboxy
terminal. It has roles in proton transport and assembly
of the V-type ATPase complex. This subunit is encoded by
two homologous gene in yeast VPH1 and STV1.
Length = 707
Score = 27.7 bits (62), Expect = 4.5
Identities = 12/73 (16%), Positives = 29/73 (39%), Gaps = 9/73 (12%)
Query: 99 KEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWEAEKE 158
++ +++ + EA+ ++ + + + L ++ +L EE L E E
Sbjct: 66 SKEFLDLEEEILDL----EAEIKE-VEENLESLEKEINEL----EEWLNVLDEEKSFLDE 116
Query: 159 ERKYLRNLVDTDE 171
+ L L + D
Sbjct: 117 NLEELSELSNLDI 129
>gnl|CDD|214710 smart00533, MUTSd, DNA-binding domain of DNA mismatch repair MUTS
family.
Length = 308
Score = 27.3 bits (61), Expect = 4.7
Identities = 23/96 (23%), Positives = 39/96 (40%), Gaps = 16/96 (16%)
Query: 98 SKEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWEAEK 157
S E + +I++ L+ K ++E L + E L E L + E
Sbjct: 87 SLEGLKEIRQLLESLDGPLLGLLL-------KVILE---PLLELLELLLELLNDDDPLEV 136
Query: 158 EERKYLRNLVDTDELDS-----ENVEEEVVEFVIKE 188
+ +++ D ELD E +EEE+ E + KE
Sbjct: 137 NDGGLIKDGFD-PELDELREKLEELEEELEELLKKE 171
>gnl|CDD|213577 TIGR01018, rpsD_arch, ribosomal protein S4(archaeal
type)/S9(eukaryote cytosolic type). This model finds
eukaryotic ribosomal protein S9 as well as archaeal
ribosomal protein S4 [Protein synthesis, Ribosomal
proteins: synthesis and modification].
Length = 162
Score = 27.0 bits (60), Expect = 4.8
Identities = 19/52 (36%), Positives = 27/52 (51%), Gaps = 2/52 (3%)
Query: 129 KELIEKRRKLFKQFEELREKLRETWEAEKEERKYLRNLVDTDELDSENVEEE 180
KE +++ KL ++ LR K +E W+AE RKY R + LD E E
Sbjct: 20 KERLDRELKLVGKYG-LRNK-KEVWKAETTLRKYRRAARELLGLDDEQRLFE 69
>gnl|CDD|227488 COG5159, RPN6, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 421
Score = 27.6 bits (61), Expect = 4.8
Identities = 18/70 (25%), Positives = 32/70 (45%), Gaps = 1/70 (1%)
Query: 96 LLSKEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWE- 154
+SK + ++ A E + ++ K + LIEK E+ + L E
Sbjct: 55 YVSKGDYCSLGDTITSSREAMEDFTKPKITKIIRTLIEKFPYSSDSLEDQIKVLTALIEW 114
Query: 155 AEKEERKYLR 164
A++E+RK+LR
Sbjct: 115 ADREKRKFLR 124
>gnl|CDD|226650 COG4187, RocB, Arginine degradation protein (predicted deacylase)
[Amino acid transport and metabolism].
Length = 553
Score = 27.4 bits (61), Expect = 4.8
Identities = 21/102 (20%), Positives = 39/102 (38%), Gaps = 13/102 (12%)
Query: 95 SLLSKEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFK---------QFEEL 145
S +KE D++K+ + T A EA + R +E + + F+EL
Sbjct: 308 SRTAKELFDRLKEEAE--TAAEEAIETLR--DRYEEYGKLVNRPAGPLPAKPRVLTFQEL 363
Query: 146 REKLRETWEAEKEERKYLRNLVDTDELDSENVEEEVVEFVIK 187
E+ R + E + ELD ++ E+++
Sbjct: 364 LERARVRGHIDAEYAEKEYEFAQNGELDLRLRSTQLTEWLVG 405
>gnl|CDD|216112 pfam00773, RNB, RNB domain. This domain is the catalytic domain of
ribonuclease II.
Length = 322
Score = 27.3 bits (61), Expect = 4.9
Identities = 19/78 (24%), Positives = 33/78 (42%), Gaps = 4/78 (5%)
Query: 92 RPASLLSKEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRK----LFKQFEELRE 147
R + L+ E+VD++ + A A D + + + +K L +KR K + E +
Sbjct: 117 RSKARLTYEEVDELLEGKPSEREAPVADDLELLYELAKALRKKRLKRGALELEDLPEAKI 176
Query: 148 KLRETWEAEKEERKYLRN 165
L E E R R+
Sbjct: 177 DLDEDGEPVDIIRIEDRS 194
>gnl|CDD|226133 COG3605, PtsP, Signal transduction protein containing GAF and PtsI
domains [Signal transduction mechanisms].
Length = 756
Score = 27.8 bits (62), Expect = 4.9
Identities = 13/56 (23%), Positives = 25/56 (44%), Gaps = 5/56 (8%)
Query: 126 KASKELIEKRRKLFKQFEELREKLRETWEAEKEERKYLRN---LVDTDELDSENVE 178
+A +EL+ + +L + +LR T + E+ + L N VD +L +
Sbjct: 411 RAYQELVREEAELSRLARDLRNLPAITKDGERIK--LLMNAGLSVDLPQLLESGAD 464
>gnl|CDD|219289 pfam07083, DUF1351, Protein of unknown function (DUF1351). This
family consists of several bacterial and phage proteins
of around 230 residues in length. The function of this
family is unknown.
Length = 220
Score = 27.3 bits (61), Expect = 5.0
Identities = 27/122 (22%), Positives = 50/122 (40%), Gaps = 32/122 (26%)
Query: 99 KEQVDKIKKSLKKYTPAFEA-----KDRQRMNKASKELIEKRRKLFKQF----------- 142
++ +D++ K Y E K+R +NK K L +KR+++ KQ+
Sbjct: 21 EKAIDEVVAKYKGYVVTEENYKDDKKERAELNKLIKALNDKRKEIKKQYSKPADEFEAYM 80
Query: 143 -----------EELREKLRETWEAEKEERK-----YLRNLVDTDELDSENVEEEVVEFVI 186
+ + ++E E +KE R+ YL + LD + EE+ ++V
Sbjct: 81 KKLIAPIKAVIDLIDSGIKEFEEKQKELRRDTVKEYLSEKAEEYGLDKRDFEEKYDDYVK 140
Query: 187 KE 188
K
Sbjct: 141 KS 142
>gnl|CDD|197366 cd09916, CpxP_like, CpxP component of the bacterial
Cpx-two-component system and related proteins. This
family summarizes bacterial proteins related to CpxP, a
periplasmic protein that forms part of a two-component
system which acts as a global modulator of cell-envelope
stress in gram-negative bacteria. CpxP aids in combating
extracytoplasmic protein-mediated toxicity, and may also
be involved in the response to alkaline pH. Functioning
as a dimer, it inhibits activation of the kinase CpxA,
but also plays a vital role in the quality control
system of P pili. It has been suggested that CpxP
directly interacts with CpxA via its concave polar
surface. Another member of this family, Spy, is also a
periplasmic protein that may be involved in the response
to stress. The homology between CpxP and Spy suggests
similar functions. A characteristic 5-residue sequence
motif LTXXQ is found repeated twice in many members of
this family.
Length = 96
Score = 26.0 bits (58), Expect = 5.1
Identities = 10/47 (21%), Positives = 21/47 (44%)
Query: 133 EKRRKLFKQFEELREKLRETWEAEKEERKYLRNLVDTDELDSENVEE 179
E++ ++ + R +++ E + R+ LR L+ D D V
Sbjct: 7 EQKAQIKAIRQAARAQMKALREQMRAAREELRALLTADTFDEAAVRA 53
>gnl|CDD|235623 PRK05849, PRK05849, hypothetical protein; Provisional.
Length = 783
Score = 27.6 bits (62), Expect = 5.1
Identities = 16/81 (19%), Positives = 28/81 (34%), Gaps = 11/81 (13%)
Query: 81 SPTFCQLRWRPRPASLLSKEQVDKIKKSLKKYT-------PAFEAKDRQRMNK---ASKE 130
T + P S+ ++ +I+ SL K T KD +++ K KE
Sbjct: 367 DFTLDKKLEELLPG-GFSENEIKRIEFSLLKLTNRIISPKSGLFRKDLEKIEKLEERYKE 425
Query: 131 LIEKRRKLFKQFEELREKLRE 151
L+ + L E +
Sbjct: 426 LLNSNLDPLDKIYWLLEDCKR 446
>gnl|CDD|239570 cd03488, Topoisomer_IB_N_htopoI_like, Topoisomer_IB_N_htopoI_like :
N-terminal DNA binding fragment found in eukaryotic DNA
topoisomerase (topo) IB proteins similar to the
monomeric yeast and human topo I. Topo I enzymes are
divided into: topo type IA (bacterial) and type IB
(eukaryotic). Topo I relaxes superhelical tension in
duplex DNA by creating a single-strand nick, the broken
strand can then rotate around the unbroken strand to
remove DNA supercoils and, the nick is religated,
liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit religation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. This family may represent more than
one structural domain.
Length = 215
Score = 27.3 bits (61), Expect = 5.2
Identities = 17/67 (25%), Positives = 27/67 (40%), Gaps = 9/67 (13%)
Query: 106 KKSLKKYTPA----FEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWEAEKEERK 161
KK + K F D +M K E+++ + K+ EK E EK E +
Sbjct: 68 KKVMTKEEKVIIKDFSKCDFTQMFAYFKAQKEEKKAMSKE-----EKKAIKAEKEKLEEE 122
Query: 162 YLRNLVD 168
Y ++D
Sbjct: 123 YGFCILD 129
>gnl|CDD|224477 COG1561, COG1561, Uncharacterized stress-induced protein [Function
unknown].
Length = 290
Score = 27.2 bits (61), Expect = 5.7
Identities = 22/86 (25%), Positives = 35/86 (40%), Gaps = 18/86 (20%)
Query: 124 MNKASKELIEKRRK-------LFKQ----FEELREKLRETWEAEKEER--KYLRNLVD-T 169
+ +A +LIE R + Q EEL EK+ E + + L +
Sbjct: 137 LEEALDDLIEMREREGAALKADLLQRLDAIEELVEKVESLMPEILEWYRERLVARLNEAQ 196
Query: 170 DELDSENVEEEVVEFVIK----EEIT 191
D+LD + +E+EV K EE+
Sbjct: 197 DQLDEDRLEQEVALLAQKADIAEELD 222
>gnl|CDD|204467 pfam10376, Mei5, Double-strand recombination repair protein. Mei5
is one of a pair of meiosis-specific proteins which
facilitate the loading of Dmc1 on to Rad51 on DNA at
double-strand breaks during recombination. Recombination
is carried out by a large protein complex based around
the two RecA homologues, Rad51 and Dmc1. This complex
may play both a catalytic and a structural role in the
interaction between homologous chromosomes during
meiosis. Mei5 is seen to contain a coiled-coli region.
Length = 212
Score = 27.1 bits (60), Expect = 5.7
Identities = 14/33 (42%), Positives = 18/33 (54%)
Query: 129 KELIEKRRKLFKQFEELREKLRETWEAEKEERK 161
KEL ++RKL Q +E + LR EK E K
Sbjct: 125 KELEAEKRKLEYQVDEESDDLRRLKLVEKYEIK 157
>gnl|CDD|235582 PRK05729, valS, valyl-tRNA synthetase; Reviewed.
Length = 874
Score = 27.4 bits (62), Expect = 5.8
Identities = 17/60 (28%), Positives = 30/60 (50%), Gaps = 13/60 (21%)
Query: 97 LSKEQVDKIKKSLKKYT-----PAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRE 151
L KE K++K +++ F + KA +E++EK R+ ++EE KL+E
Sbjct: 816 LEKEL-AKLEKEIERVEKKLSNEGF-------VAKAPEEVVEKEREKLAEYEEKLAKLKE 867
>gnl|CDD|222389 pfam13801, Metal_resist, Heavy-metal resistance. This is a
metal-binding protein which is involved in resistance to
heavy-metal ions. The protein forms a four-helix hooked
hairpin, consisting of two long alpha helices each
flanked by a shorter alpha helix. It binds a metal ion
in a type-2 like centre. It contains two copies of an
LTXXQ motif.
Length = 125
Score = 26.4 bits (59), Expect = 5.8
Identities = 13/72 (18%), Positives = 26/72 (36%)
Query: 115 AFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWEAEKEERKYLRNLVDTDELDS 174
A + R+ + A + + R L + R +L AE + LR +
Sbjct: 41 ALPPEQRRALRAALRAHRREVRPLRRALRAARRELLALLRAEPFDPAALRAALARQREAR 100
Query: 175 ENVEEEVVEFVI 186
++ V E ++
Sbjct: 101 AALQARVQEALL 112
>gnl|CDD|223962 COG1031, COG1031, Uncharacterized Fe-S oxidoreductase [Energy
production and conversion].
Length = 560
Score = 27.3 bits (61), Expect = 6.1
Identities = 11/30 (36%), Positives = 16/30 (53%), Gaps = 3/30 (10%)
Query: 123 RMNKASKELIEKRRKLFKQFEELREKLRET 152
M + K+ EK +KLF F + K+RE
Sbjct: 410 PMWERGKKKAEKHKKLFAAF---KRKVREE 436
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 27.5 bits (62), Expect = 6.2
Identities = 20/89 (22%), Positives = 42/89 (47%), Gaps = 10/89 (11%)
Query: 99 KEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWEAEKE 158
K++ D+I K L+ + + + + ELIE R++L K E+ +K ++ E ++E
Sbjct: 583 KKEADEIIKELR------QLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEE 636
Query: 159 ----ERKYLRNLVDTDELDSENVEEEVVE 183
+ +L E+ S ++E +
Sbjct: 637 LKVGDEVKYLSLGQKGEVLSIPDDKEAIV 665
>gnl|CDD|217372 pfam03105, SPX, SPX domain. We have named this region the SPX
domain after (SYG1, Pho81 and XPR1). This 180 residue
length domain is found at the amino terminus of a
variety of proteins. In the yeast protein SYG1, the
N-terminus directly binds to the G- protein beta subunit
and inhibits transduction of the mating pheromone
signal. This finding suggests that all the members of
this family are involved in G-protein associated signal
transduction. The N-termini of several proteins involved
in the regulation of phosphate transport, including the
putative phosphate level sensors PHO81 from
Saccharomyces cerevisiae and NUC-2 from Neurospora
crassa, are also members of this family. The SPX domain
of S. cerevisiae low-affinity phosphate transporters
Pho87 and Pho90 auto-regulates uptake and prevents
efflux. This SPX dependent inhibition is mediated by the
physical interaction with Spl2 NUC-2 contains several
ankyrin repeats pfam00023. Several members of this
family are annotated as XPR1 proteins: the xenotropic
and polytropic retrovirus receptor confers
susceptibility to infection with murine leukaemia
viruses (MLV). The similarity between SYG1, phosphate
regulators and XPR1 sequences has been previously noted,
as has the additional similarity to several predicted
proteins, of unknown function, from Drosophila
melanogaster, Arabidopsis thaliana, Caenorhabditis
elegans, Schizosaccharomyces pombe, and Saccharomyces
cerevisiae. In addition, given the similarities between
XPR1 and SYG1 and phosphate regulatory proteins, it has
been proposed that XPR1 might be involved in G-protein
associated signal transduction and may itself function
as a phosphate sensor.
Length = 176
Score = 26.6 bits (59), Expect = 6.3
Identities = 10/52 (19%), Positives = 27/52 (51%)
Query: 100 EQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRE 151
E++++++K L++ + + + + K L++ +L K EL +L+
Sbjct: 89 ERLEELEKQLEELEERRDETSSAKSSPSDKNLLKAFEELKKALLELYRELQL 140
>gnl|CDD|214682 smart00475, 53EXOc, 5'-3' exonuclease.
Length = 259
Score = 26.8 bits (60), Expect = 6.3
Identities = 15/50 (30%), Positives = 28/50 (56%), Gaps = 5/50 (10%)
Query: 126 KASKELIEKRR---KLFKQFEELREKLRETWEAEKEERKYLRNLV--DTD 170
K + +L+++ + + ++L++KLRE A KE+ K R L +TD
Sbjct: 197 KTAAKLLKEFGSLENILENLDKLKKKLREKLLAHKEDAKLSRKLATIETD 246
>gnl|CDD|233842 TIGR02389, RNA_pol_rpoA2, DNA-directed RNA polymerase, subunit A''.
This family consists of the archaeal A'' subunit of the
DNA-directed RNA polymerase. The example from
Methanocaldococcus jannaschii contains an intein
[Transcription, DNA-dependent RNA polymerase].
Length = 367
Score = 26.9 bits (60), Expect = 6.4
Identities = 14/42 (33%), Positives = 22/42 (52%), Gaps = 6/42 (14%)
Query: 136 RKLFKQFEELREKLRETWEAEKEE------RKYLRNLVDTDE 171
KL K+ EE +K + + E +E +YLR+L+D E
Sbjct: 4 EKLLKELEETVKKREISDKEELDEIIKRVEEEYLRSLIDPGE 45
>gnl|CDD|225281 COG2425, COG2425, Uncharacterized protein containing a von
Willebrand factor type A (vWA) domain [General function
prediction only].
Length = 437
Score = 27.0 bits (60), Expect = 6.6
Identities = 16/78 (20%), Positives = 33/78 (42%), Gaps = 9/78 (11%)
Query: 86 QLRWRPRPASLLSKEQVDKIKKSLK---KYTPAFEAKDRQRMNKASKELIEKRRKLFKQF 142
L P LLS + ++K++ + K++ + +R + ++L +
Sbjct: 71 FLVQLPFILKLLSSDVYRRVKEATRLNEKHSLSAPESFLERWEELLQDLQREGS------ 124
Query: 143 EELREKLRETWEAEKEER 160
E+ E RE +EK+E
Sbjct: 125 EDFLEGEREGLLSEKQEE 142
>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
binding fragment. Topoisomerase I promotes the
relaxation of DNA superhelical tension by introducing a
transient single-stranded break in duplex DNA and are
vital for the processes of replication, transcription,
and recombination. This family may be more than one
structural domain.
Length = 215
Score = 26.8 bits (60), Expect = 6.6
Identities = 19/67 (28%), Positives = 30/67 (44%), Gaps = 16/67 (23%)
Query: 104 KIKKSLKK--YTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWEAEKEERK 161
I K L+K +TP +E + ++ EK++ + K+ EK E +K E
Sbjct: 78 AIIKDLEKCDFTPIYEYFEAEK---------EKKKAMSKE-----EKKAIKEEKDKLEEP 123
Query: 162 YLRNLVD 168
Y LVD
Sbjct: 124 YGYCLVD 130
>gnl|CDD|221794 pfam12825, DUF3818, Domain of unknown function in PX-proteins
(DUF3818). This domain is found on proteins carrying a
PX domain. Its function is unknown.
Length = 340
Score = 26.9 bits (60), Expect = 6.8
Identities = 11/54 (20%), Positives = 27/54 (50%), Gaps = 4/54 (7%)
Query: 140 KQFEELREKLRETWEAEKEERKYLRNLVDTDELDSENVEEEVVEFVIKEEITIC 193
K+FE+ ++L + + L+N V+ E++++ + E +KE+ +
Sbjct: 110 KEFEKEIKELEAKIDDYPVICQKLKNFVE----ADEDIKDAIREEAVKEDEDLL 159
>gnl|CDD|184696 PRK14474, PRK14474, F0F1 ATP synthase subunit B; Provisional.
Length = 250
Score = 26.7 bits (59), Expect = 6.9
Identities = 12/52 (23%), Positives = 25/52 (48%), Gaps = 2/52 (3%)
Query: 117 EAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWEA--EKEERKYLRNL 166
E + M +A + E+R+ L + E R+ W E+E++++ + L
Sbjct: 70 EQQRASFMAQAQEAADEQRQHLLNEAREDVATARDEWLEQLEREKQEFFKAL 121
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 26.8 bits (60), Expect = 7.4
Identities = 18/73 (24%), Positives = 31/73 (42%), Gaps = 11/73 (15%)
Query: 97 LSKEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEE-LREKL------ 149
LS E + K+ K+ ++ E + + +E EK+ + K+ E KL
Sbjct: 253 LSPEVLRKVDKTREE---EEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQR 309
Query: 150 -RETWEAEKEERK 161
E E +K+ RK
Sbjct: 310 KLEEKERKKQARK 322
>gnl|CDD|179699 PRK03992, PRK03992, proteasome-activating nucleotidase;
Provisional.
Length = 389
Score = 26.7 bits (60), Expect = 7.5
Identities = 16/48 (33%), Positives = 23/48 (47%), Gaps = 6/48 (12%)
Query: 129 KELIEKRRKLFKQFEELREKLRETWEAEKEERKYLRNLVDTDELDSEN 176
+ L E+ +L +Q +L KLR + E E K R L + L SE
Sbjct: 4 EALEERNSELEEQIRQLELKLR---DLEAENEKLEREL---ERLKSEL 45
>gnl|CDD|234956 PRK01490, tig, trigger factor; Provisional.
Length = 435
Score = 27.0 bits (61), Expect = 7.7
Identities = 13/72 (18%), Positives = 33/72 (45%), Gaps = 6/72 (8%)
Query: 121 RQRMNKASKELIEKRRKLFKQFEELREKLRETWEAEKEERKYLRNLVDTDEL-DSENVE- 178
+ + +A ++ ++ + + E+ RE + + E R ++ + DE+ +E +E
Sbjct: 311 DRLLRQALQQGLDLEGQFLEDTGTTEEEPREEFREQAERR--VKLGLLLDEIAKAEEIEV 368
Query: 179 --EEVVEFVIKE 188
EEV + +
Sbjct: 369 SDEEVKAEIEEM 380
Score = 26.7 bits (60), Expect = 8.4
Identities = 18/50 (36%), Positives = 28/50 (56%), Gaps = 4/50 (8%)
Query: 143 EELREKLRETWEAEKEERKYLRNLVDTDELD--SENVEEEVVEFVIKEEI 190
EEL+ +R+ E EK+E R V LD EN E ++ E ++++EI
Sbjct: 263 EELKADIRKNLEREKKEA--QRAKVKEAVLDALVENAEIDLPEALVEQEI 310
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 26.6 bits (59), Expect = 7.8
Identities = 17/98 (17%), Positives = 40/98 (40%), Gaps = 7/98 (7%)
Query: 87 LRWRPRPASLLSKEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELR 146
++ P L KE + + K +K ++++ ++E + KL ++L+
Sbjct: 102 IKEHPFDLELFPKELYSVMGINKKAGKKLALSKFKRKVGLFTEEEEDIDEKLSMLEKKLK 161
Query: 147 EKLRETWEAEKEERKYLRNLVDTDELDSENVEEEVVEF 184
E E + E E+ + +E + E +E+ +
Sbjct: 162 ELEAEDVDEEDEK-------DEEEEEEEEEEDEDFDDD 192
>gnl|CDD|148363 pfam06717, DUF1202, Protein of unknown function (DUF1202). This
family consists of several hypothetical bacterial
proteins of around 335 residues in length. Members of
this family are found exclusively in Escherichia coli
and Salmonella species and are often referred to as YggM
proteins. The function of this family is unknown.
Length = 308
Score = 26.8 bits (59), Expect = 8.0
Identities = 14/48 (29%), Positives = 24/48 (50%), Gaps = 7/48 (14%)
Query: 106 KKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETW 153
K+S + Y + K +N E+RR+ F + +EL KL++ W
Sbjct: 231 KQSEECYEVPIQQKRDFDIN-------EQRRQTFLKSQELSRKLQDDW 271
>gnl|CDD|234829 PRK00750, lysK, lysyl-tRNA synthetase; Reviewed.
Length = 510
Score = 26.7 bits (60), Expect = 8.0
Identities = 16/86 (18%), Positives = 32/86 (37%), Gaps = 5/86 (5%)
Query: 94 ASLLSKEQVDKIKKSLKKYTPAFEAKD----RQRMNKASKELIEKRRKLFKQFEELREKL 149
A + + E + + LK+Y P + + + A K++ EK
Sbjct: 372 AQIANAEDKEVLWGFLKRYAPGATPETHPRLDRLVEYAIN-WYRDFVAPEKKYRAPTEKE 430
Query: 150 RETWEAEKEERKYLRNLVDTDELDSE 175
R E + + L + D +E+ +E
Sbjct: 431 RAALEDLDDALRELADGADAEEIQNE 456
>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region. The domain
is found in the primary vegetative sigma factor. The
function of this domain is unclear and can be removed
without loss of function.
Length = 211
Score = 26.4 bits (59), Expect = 8.2
Identities = 17/80 (21%), Positives = 38/80 (47%), Gaps = 10/80 (12%)
Query: 97 LSKEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLF-------KQFEELREKL 149
++E+ ++++ +K A E R +K +++ E + F KQF+ L + L
Sbjct: 79 EARERFAELREQYEKTRKAIEKHGRG--SKKAQKAREALAEEFMQFRLVPKQFDRLVDNL 136
Query: 150 RETWEA-EKEERKYLRNLVD 168
R + +ER+ ++ V+
Sbjct: 137 RGMVDRVRAQERQIMKLCVE 156
>gnl|CDD|220404 pfam09789, DUF2353, Uncharacterized coiled-coil protein (DUF2353).
Members of this family of uncharacterized proteins have
no known function.
Length = 319
Score = 26.8 bits (59), Expect = 8.3
Identities = 12/29 (41%), Positives = 19/29 (65%)
Query: 99 KEQVDKIKKSLKKYTPAFEAKDRQRMNKA 127
+E+ + IK +L KY EAK+++ M KA
Sbjct: 209 EEESELIKMTLAKYKQMLEAKNKKTMVKA 237
>gnl|CDD|221712 pfam12687, DUF3801, Protein of unknown function (DUF3801). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are typically between 158 and 187 amino
acids in length. This family includes the PcfB protein.
Length = 137
Score = 26.1 bits (58), Expect = 8.4
Identities = 16/58 (27%), Positives = 26/58 (44%), Gaps = 3/58 (5%)
Query: 105 IKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWEAEKEERKY 162
KY F+AKD +N A KE K K+ ++ + +R+ + KEE+
Sbjct: 69 KNNGPPKYLVFFKAKDADVINAAFKEF---TAKKLKKEQDEKPSVRKKLKKFKEEQAA 123
>gnl|CDD|132364 TIGR03321, alt_F1F0_F0_B, alternate F1F0 ATPase, F0 subunit B. A
small number of taxonomically diverse prokaryotic
species, including Methanosarcina barkeri, have what
appears to be a second ATP synthase, in addition to the
normal F1F0 ATPase in bacteria and A1A0 ATPase in
archaea. These enzymes use ion gradients to synthesize
ATP, CC and in principle may run in either direction.
This model represents the F0 subunit B of this apparent
second ATP synthase.
Length = 246
Score = 26.6 bits (59), Expect = 8.4
Identities = 19/93 (20%), Positives = 40/93 (43%), Gaps = 21/93 (22%)
Query: 99 KEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLF----KQFEELREKLRETWE 154
K + ++ ++ ++ + + + KA +E +R++L ++ +E+REK +E
Sbjct: 52 KREAEQERREYEEKNEELDQQREVLLTKAKEEAQAERQRLLDEAREEADEIREKWQEALR 111
Query: 155 AEKEE-----------------RKYLRNLVDTD 170
E+ RK L +L DTD
Sbjct: 112 REQAALSDELRRRTGAEVFAIARKVLTDLADTD 144
>gnl|CDD|180485 PRK06245, cofG, FO synthase subunit 1; Reviewed.
Length = 336
Score = 26.8 bits (60), Expect = 8.5
Identities = 11/34 (32%), Positives = 15/34 (44%)
Query: 79 FNSPTFCQLRWRPRPASLLSKEQVDKIKKSLKKY 112
N +C R P SLLS E+V +I +
Sbjct: 23 RNRCGYCTFRRDPGQPSLLSPEEVKEILRRGADA 56
>gnl|CDD|217417 pfam03194, LUC7, LUC7 N_terminus. This family contains the N
terminal region of several LUC7 protein homologues and
only contains eukaryotic proteins. LUC7 has been shown
to be a U1 snRNA associated protein with a role in
splice site recognition. The family also contains human
and mouse LUC7 like (LUC7L) proteins and human cisplatin
resistance-associated overexpressed protein (CROP).
Length = 252
Score = 26.6 bits (59), Expect = 8.6
Identities = 13/62 (20%), Positives = 29/62 (46%), Gaps = 7/62 (11%)
Query: 90 RPRPASLLSKEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKL 149
+ + A+ E++ ++ + + K EA + +++ KL K+ EEL+ K
Sbjct: 113 QTKIAADSKAEELAELDEEIGKLLAEAEALGEEGK-------VDEAMKLMKEVEELKAKK 165
Query: 150 RE 151
+E
Sbjct: 166 KE 167
>gnl|CDD|151490 pfam11043, DUF2856, Protein of unknown function (DUF2856). Some
members in this viral family of proteins with unknown
function are annotated as Abc2 however this cannot be
confirmed.
Length = 97
Score = 25.7 bits (56), Expect = 8.7
Identities = 17/59 (28%), Positives = 30/59 (50%), Gaps = 8/59 (13%)
Query: 114 PAFEAKDRQRMNKASK-ELIEKRRKLFKQFEEL-------REKLRETWEAEKEERKYLR 164
P + A D +R + S E+++K RK + L R+ R+ W AEK+E++ +
Sbjct: 4 PLYGADDPRRCSGNSVSEVLDKFRKNYDAIMSLPQETKAERKFRRDIWLAEKQEKERIN 62
>gnl|CDD|226967 COG4608, AppF, ABC-type oligopeptide transport system, ATPase
component [Amino acid transport and metabolism].
Length = 268
Score = 26.4 bits (59), Expect = 8.7
Identities = 11/43 (25%), Positives = 24/43 (55%), Gaps = 1/43 (2%)
Query: 70 SFQGKIIKRFNSPTFCQLRWRPRPASLLSK-EQVDKIKKSLKK 111
S G++I PT ++ + + + LSK E+ +++ + L+K
Sbjct: 53 STLGRLILGLEEPTSGEILFEGKDITKLSKEERRERVLELLEK 95
>gnl|CDD|235751 PRK06241, PRK06241, phosphoenolpyruvate synthase; Validated.
Length = 871
Score = 26.8 bits (60), Expect = 8.8
Identities = 16/50 (32%), Positives = 24/50 (48%)
Query: 123 RMNKASKELIEKRRKLFKQFEELREKLRETWEAEKEERKYLRNLVDTDEL 172
R NK ELI KR++ ++ +E+L T + E KY R + L
Sbjct: 719 RTNKLDYELIAKRKEEYELYEKLTPPRVMTSDGEIITGKYKRENLPAGAL 768
>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain. Axonemal
dynein light chain proteins play a dynamic role in
flagellar and cilia motility. Eukaryotic cilia and
flagella are complex organelles consisting of a core
structure, the axoneme, which is composed of nine
microtubule doublets forming a cylinder that surrounds a
pair of central singlet microtubules. This
ultra-structural arrangement seems to be one of the most
stable micro-tubular assemblies known and is responsible
for the flagellar and ciliary movement of a large number
of organisms ranging from protozoan to mammals. This
light chain interacts directly with the N-terminal half
of the heavy chains.
Length = 189
Score = 26.4 bits (59), Expect = 9.0
Identities = 13/42 (30%), Positives = 24/42 (57%)
Query: 119 KDRQRMNKASKELIEKRRKLFKQFEELREKLRETWEAEKEER 160
+ + + + K+L E++ +L K+ EL KL + E+EER
Sbjct: 120 QGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEER 161
>gnl|CDD|225468 COG2916, Hns, DNA-binding protein H-NS [General function prediction
only].
Length = 128
Score = 25.9 bits (57), Expect = 9.0
Identities = 10/35 (28%), Positives = 18/35 (51%)
Query: 127 ASKELIEKRRKLFKQFEELREKLRETWEAEKEERK 161
EL+E+ + +Q + R++ AE EER+
Sbjct: 15 TYLELLEEMLEKEEQVVQERQEEEAAAIAEIEERQ 49
>gnl|CDD|197875 smart00788, Adenylsucc_synt, Adenylosuccinate synthetase.
Adenylosuccinate synthetase plays an important role in
purine biosynthesis, by catalyzing the GTP-dependent
conversion of IMP and aspartic acid to AMP.
Adenylosuccinate synthetase has been characterized from
various sources ranging from Escherichia coli (gene
purA) to vertebrate tissues. In vertebrates, two
isozymes are present - one involved in purine
biosynthesis and the other in the purine nucleotide
cycle. The crystal structure of adenylosuccinate
synthetase from E. coli reveals that the dominant
structural element of each monomer of the homodimer is a
central beta-sheet of 10 strands. The first nine strands
of the sheet are mutually parallel with right-handed
crossover connections between the strands. The 10th
strand is antiparallel with respect to the first nine
strands. In addition, the enzyme has two antiparallel
beta-sheets, comprised of two strands and three strands
each, 11 alpha-helices and two short 3/10-helices.
Further, it has been suggested that the similarities in
the GTP-binding domains of the synthetase and the p21ras
protein are an example of convergent evolution of two
distinct families of GTP-binding proteins. Structures of
adenylosuccinate synthetase from Triticum aestivum and
Arabidopsis thaliana when compared with the known
structures from E. coli reveals that the overall fold is
very similar to that of the E. coli protein.
Length = 417
Score = 26.7 bits (60), Expect = 9.3
Identities = 14/43 (32%), Positives = 23/43 (53%), Gaps = 3/43 (6%)
Query: 142 FEELREKLRETWEAEKEERKYLRNLVDTDELDSENVEEEVVEF 184
E LREKL E + + L+ L + +D E + EE++E+
Sbjct: 148 EEVLREKLEELLD---YKNFLLKKLYGAEPVDVEEILEELLEY 187
>gnl|CDD|237215 PRK12815, carB, carbamoyl phosphate synthase large subunit;
Reviewed.
Length = 1068
Score = 26.9 bits (60), Expect = 9.8
Identities = 10/36 (27%), Positives = 17/36 (47%), Gaps = 3/36 (8%)
Query: 127 ASKELIEK--RRKLFKQF-EELREKLRETWEAEKEE 159
+ E I+K R+ F+ +EL E + E+ E
Sbjct: 118 TNIEAIQKGEDRERFRALMKELGEPVPESEIVTSVE 153
>gnl|CDD|223610 COG0536, Obg, Predicted GTPase [General function prediction only].
Length = 369
Score = 26.4 bits (59), Expect = 9.9
Identities = 20/87 (22%), Positives = 32/87 (36%), Gaps = 3/87 (3%)
Query: 100 EQVDKIKKSLKKYTPAFEAKDRQRM-NKASKELIEKRRKLFKQFEELREKLRETWEAEKE 158
E I+ L+KY+P K R + NK L E+ + K+ +
Sbjct: 257 EDYQTIRNELEKYSPKLAEKPRIVVLNKIDLPLDEEELEELKKALAEALGWEVFYLISAL 316
Query: 159 ERKYLRNLVD--TDELDSENVEEEVVE 183
R+ L L+ + L+ E E E
Sbjct: 317 TREGLDELLRALAELLEETKAEAEAAE 343
>gnl|CDD|181574 PRK08881, rpsN, 30S ribosomal protein S14; Reviewed.
Length = 101
Score = 25.5 bits (57), Expect = 10.0
Identities = 10/38 (26%), Positives = 17/38 (44%), Gaps = 3/38 (7%)
Query: 126 KASKELIEKRRKLFKQFEELREKLRETW---EAEKEER 160
K+ +KR+KL ++ R +L+ EER
Sbjct: 4 KSMIAREKKRKKLVAKYAAKRAELKAIISDPNLSDEER 41
>gnl|CDD|216510 pfam01450, IlvC, Acetohydroxy acid isomeroreductase, catalytic
domain. Acetohydroxy acid isomeroreductase catalyzes
the conversion of acetohydroxy acids into dihydroxy
valerates. This reaction is the second in the synthetic
pathway of the essential branched side chain amino acids
valine and isoleucine.
Length = 146
Score = 26.1 bits (58), Expect = 10.0
Identities = 16/76 (21%), Positives = 31/76 (40%), Gaps = 15/76 (19%)
Query: 89 WRPRPASLLSKEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREK 148
W PR + +K + +I K ++ T A KE + + + + R++
Sbjct: 79 WGPRIITEATKILMKEILKDIQNGTFA-------------KEWLLENSAGNPRLNKERKE 125
Query: 149 LRET--WEAEKEERKY 162
+RE + K+ RK
Sbjct: 126 IREHPIEKVGKKLRKL 141
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.316 0.131 0.389
Gapped
Lambda K H
0.267 0.0677 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 10,163,757
Number of extensions: 958213
Number of successful extensions: 3190
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2933
Number of HSP's successfully gapped: 517
Length of query: 194
Length of database: 10,937,602
Length adjustment: 92
Effective length of query: 102
Effective length of database: 6,857,034
Effective search space: 699417468
Effective search space used: 699417468
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 56 (25.5 bits)