RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy17635
(103 letters)
>gnl|CDD|222432 pfam13882, Bravo_FIGEY, C-terminal domain of Fibronectin type III.
This is the very C-terminal region of neural adhesion
molecule L1 proteins that are also known as Bravo or
NrCAM. It lies upstream of the IG and Fn3 domains and
has the highly conserved motif FIGEY. The function is
not known.
Length = 118
Score = 129 bits (326), Expect = 2e-40
Identities = 52/99 (52%), Positives = 68/99 (68%), Gaps = 4/99 (4%)
Query: 6 VATAGWFIGMLLAIAFLILVLILVCLIKRNRGGKYAVHEREAAHGRHDY--PEEPGFNEY 63
AT GWFIG++ AIA L+L+L++VC IKRN+GGKY+V E+E A G + ++ GF EY
Sbjct: 2 FATQGWFIGLMCAIALLLLILLIVCFIKRNKGGKYSVKEKEDAPGDPEDQPMKDEGFGEY 61
Query: 64 SQPLDGKPRTRSSMSSEKHPV--ESDTDSMAEYGDGDTG 100
S+P D KP R S S + +S TDS+ EYGDGDTG
Sbjct: 62 SRPEDNKPLPRGSQQSLDGEIGDDSSTDSLDEYGDGDTG 100
>gnl|CDD|222017 pfam13273, DUF4064, Protein of unknown function (DUF4064).
Length = 96
Score = 34.5 bits (80), Expect = 0.002
Identities = 11/33 (33%), Positives = 18/33 (54%), Gaps = 1/33 (3%)
Query: 4 DTVATAGWFIGMLLAIAFLILVLILVCLIKRNR 36
D ++T G +G+ AI IL +I L+K+
Sbjct: 48 DALSTLGTGLGIG-AIILSILGIIGSILVKKKP 79
>gnl|CDD|220342 pfam09685, Tic20, Tic20-like protein. Chloroplast function
requires the import of nuclear encoded proteins from
the cytoplasm across the chloroplast double membrane.
This is accomplished by two protein complexes, the Toc
complex located at the outer membrane and the Tic
complex located at the inner membrane. The Toc complex
recognises specific proteins by a cleavable N-terminal
sequence and is primarily responsible for translocation
through the outer membrane, while the Tic complex
translocates the protein through the inner membrane.
This entry represents Tic20, a core member of the Tic
complex. This protein is deeply embedded in the inner
envelope membrane and is thought to function as a
protein- conducting component of the Tic complex. This
family also includes many proteins of unknown function
from non-synthetic organisms.
Length = 104
Score = 33.3 bits (77), Expect = 0.006
Identities = 14/33 (42%), Positives = 16/33 (48%), Gaps = 2/33 (6%)
Query: 10 GWFIGMLLAIAFLI--LVLILVCLIKRNRGGKY 40
FIG LL I I LV ++ IK N G Y
Sbjct: 63 LGFIGFLLLILLWILSLVFTIIAAIKANNGEPY 95
>gnl|CDD|177566 PHA03242, PHA03242, envelope glycoprotein M; Provisional.
Length = 428
Score = 32.9 bits (75), Expect = 0.017
Identities = 21/86 (24%), Positives = 32/86 (37%), Gaps = 16/86 (18%)
Query: 16 LLAIAFLILVLILVCLIKRNRGGKYAVHEREAAHGRHDYPEEPGFNEYSQPLDGKPRTRS 75
L A+A +L L+ L R ++ H R+A H Y R RS
Sbjct: 334 LFALAMAVLRLVRAYLYHRRHRSRFYGHVRDARHRARSYIR---------------RVRS 378
Query: 76 SM-SSEKHPVESDTDSMAEYGDGDTG 100
SM +S + P ++D + T
Sbjct: 379 SMRNSREAPYAGESDVGRDDDYASTD 404
>gnl|CDD|227155 COG4818, COG4818, Predicted membrane protein [Function unknown].
Length = 105
Score = 30.4 bits (69), Expect = 0.050
Identities = 12/34 (35%), Positives = 19/34 (55%), Gaps = 4/34 (11%)
Query: 8 TAGWFI-GMLLAIAFLILVLILVCLIKRNRGGKY 40
GW + G++ AF+ L LVC+ K RG ++
Sbjct: 57 YIGWLLSGLVGLAAFI---LWLVCMYKAYRGERF 87
>gnl|CDD|147975 pfam06103, DUF948, Bacterial protein of unknown function
(DUF948). This family consists of bacterial sequences
several of which are thought to be general stress
proteins.
Length = 90
Score = 30.2 bits (69), Expect = 0.052
Identities = 9/21 (42%), Positives = 16/21 (76%)
Query: 14 GMLLAIAFLILVLILVCLIKR 34
+++AIAFL+LV+ L+ +K
Sbjct: 3 VLIIAIAFLVLVIFLIKTLKS 23
>gnl|CDD|225833 COG3296, COG3296, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 143
Score = 29.8 bits (67), Expect = 0.14
Identities = 12/37 (32%), Positives = 17/37 (45%), Gaps = 2/37 (5%)
Query: 6 VATAGWFIGMLLAIAFLI--LVLILVCLIKRNRGGKY 40
+ IG+L A I LVL ++ IK G +Y
Sbjct: 97 IGFFLTLIGILSIFALSILSLVLTIIAAIKAYEGQEY 133
>gnl|CDD|203869 pfam08173, YbgT_YccB, Membrane bound YbgT-like protein. This
family contains a set of membrane proteins, typically
33 amino acids long. The family has no known function,
but the protein is found in the operon CydAB in E.
coli. Members have a consensus motif (MWYFXW) which is
rich in aromatic residues. The protein forms a single
membrane-spanning helix. This family seems to be
restricted to Proteobacteria.
Length = 28
Score = 27.6 bits (62), Expect = 0.18
Identities = 7/17 (41%), Positives = 10/17 (58%)
Query: 11 WFIGMLLAIAFLILVLI 27
W +G+ LA F IL +
Sbjct: 6 WILGVGLACLFGILNAM 22
>gnl|CDD|222949 PHA02947, PHA02947, S-S bond formation pathway protein;
Provisional.
Length = 215
Score = 29.3 bits (66), Expect = 0.22
Identities = 12/30 (40%), Positives = 17/30 (56%), Gaps = 2/30 (6%)
Query: 11 WFIGMLLAIAFLILVLILVCLIKRNRGGKY 40
WFI + + LI+ +I +C IKR KY
Sbjct: 180 WFI--VGVVIILIIFVIAICSIKRKINLKY 207
>gnl|CDD|192825 pfam11778, SID, Septation initiation. This family is required for
activation of the spg1 GTPase signalling cascade which
leads to the initiation of septation and the subsequent
termination of mitosis. It may act as a scaffold at the
spindle pole body to which other components of the spg1
signalling cascade attach in pombe. In S.cerevisiae it
is both required for the proper formation of the spindle
pole body outer plaque and may also connect the outer
plaque to the central plaque embedded in the nuclear
envelope.
Length = 138
Score = 28.9 bits (65), Expect = 0.26
Identities = 12/41 (29%), Positives = 17/41 (41%), Gaps = 9/41 (21%)
Query: 63 YSQPLDGKPRTRSSMS---------SEKHPVESDTDSMAEY 94
YS+ LD K T + S KHP+E D + +
Sbjct: 94 YSKELDFKKFTDKAYSDYLKSHNIDEIKHPLEQCLDDLYKT 134
>gnl|CDD|187765 cd09325, TDT_C4-dicarb_trans, C4-dicarboxylate transporters of the
Tellurite-resistance/Dicarboxylate Transporter (TDT)
family. This subfamily contains bacterial
C4-dicarboxylate transporters, which is part of the
Tellurite-resistance/Dicarboxylate Transporter (TDT)
family. It includes Tellurite resistance protein tehA;
the tehA gene encodes an integral membrane protein that
has been shown to have efflux activity of quaternary
ammonium compounds. TehA protein of Escherichia coli
functions as a tellurite-resistance uptake permease.
Length = 293
Score = 29.1 bits (66), Expect = 0.31
Identities = 5/23 (21%), Positives = 12/23 (52%)
Query: 11 WFIGMLLAIAFLILVLILVCLIK 33
+ LL ++ L+ +L+ L +
Sbjct: 207 VLVLFLLILSQLLYFFVLLQLPR 229
>gnl|CDD|224216 COG1297, COG1297, Predicted membrane protein [Function unknown].
Length = 624
Score = 28.9 bits (65), Expect = 0.39
Identities = 11/29 (37%), Positives = 17/29 (58%)
Query: 14 GMLLAIAFLILVLILVCLIKRNRGGKYAV 42
M+LA A + +VLI + + RGGK +
Sbjct: 516 SMILAGAGIGIVLIGIDERLKGRGGKLRL 544
>gnl|CDD|234507 TIGR04222, near_uncomplex, TIGR04222 domain. The majority of the
proteins with a domain as described by this model have
an extreme C-terminal sequence that is consists of
extremely low-complexity sequence, rich in Ser or in Gly
interspersed with Cys. That C-terminal region resembles
ribosomal natural product precursors, although there is
no evidence that C-terminal regions of these proteins
undergo any modification or have any such function.
Length = 229
Score = 28.1 bits (63), Expect = 0.66
Identities = 13/54 (24%), Positives = 25/54 (46%), Gaps = 2/54 (3%)
Query: 9 AGWFIGMLLAIAFLILVLILVCLIKRNRGGKYAV--HEREAAHGRHDYPEEPGF 60
G+ + +LLA A L+ +++ +R R G+ A+ A R P++
Sbjct: 165 VGFLVLLLLATAVLLGLVLARRPNRRTRAGRRALARLRARHAGLRPYGPDDLAL 218
>gnl|CDD|205007 pfam12670, DUF3792, Protein of unknown function (DUF3792). This
family of proteins is functionally uncharacterized.
This family of proteins is found in bacteria. Proteins
in this family are approximately 130 amino acids in
length. These proteins are integral membrane proteins.
Length = 117
Score = 27.5 bits (62), Expect = 0.85
Identities = 8/23 (34%), Positives = 16/23 (69%), Gaps = 1/23 (4%)
Query: 10 GWFIGMLLAIAFLILVLILVCLI 32
GW G+L+ + + L+L+L+ L+
Sbjct: 67 GWLWGLLVGLLY-FLILLLISLL 88
>gnl|CDD|218122 pfam04515, Choline_transpo, Plasma-membrane choline transporter.
This family represents a high-affinity plasma-membrane
choline transporter in C.elegans which is thought to be
rate-limiting for ACh synthesis in cholinergic nerve
terminals.
Length = 327
Score = 27.9 bits (63), Expect = 0.88
Identities = 7/23 (30%), Positives = 15/23 (65%)
Query: 12 FIGMLLAIAFLILVLILVCLIKR 34
+ ++ + LIL+L+++ L KR
Sbjct: 1 IVAIIFTVLTLILLLLIIFLRKR 23
>gnl|CDD|238344 cd00625, ArsB_NhaD_permease, Anion permease ArsB/NhaD. These
permeases have been shown to translocate sodium,
arsenate, antimonite, sulfate and organic anions across
biological membranes in all three kingdoms of life. A
typical anion permease contains 8-13 transmembrane
helices and can function either independently as a
chemiosmotic transporter or as a channel-forming subunit
of an ATP-driven anion pump.
Length = 396
Score = 28.0 bits (63), Expect = 0.90
Identities = 10/52 (19%), Positives = 16/52 (30%)
Query: 10 GWFIGMLLAIAFLILVLILVCLIKRNRGGKYAVHEREAAHGRHDYPEEPGFN 61
F+ + A +L+L+L L R E + P P
Sbjct: 155 LDFLAFMAPPALGLLLLLLGLLYLLFRKKLLLPDEDKLTVLAEPLPARPLLK 206
>gnl|CDD|133477 cd06971, PgpA, Phosphatidylglycerophosphatase A; a bacterial
membrane-associated enzyme involved in lipid
metabolism. Phosphatidylglycerophosphatase A domain
represents a family of bacterial membrane-associated
enzymes involved in lipid metabolism. The prototype of
this CD is a putative Phosphatidylglycerophosphatase A
(PGPase A) from Listeria monocytogenes. PGPase A (EC:
3.1.3.27), encoded by the gene pgpA, specifically
catalyzes the formation of phosphatidylglycerol from
phosphatidyl glycerophosphate (PGP). It requires Mg2+
for activity and is inhibited by sulfhydryl agents and
freezing/thawing. PGPase B encoded from pgpB is not
included in this family, which also acts on
phosphatidic acid (PA) and lysophosphatidic acid (LPA).
Aside from PGPase A and B, evidence shows that there is
another PGPase existing in E. coli. Thus, PGPase A is
not essential for PGPase activity in E. coli.
Length = 143
Score = 27.5 bits (62), Expect = 0.95
Identities = 11/48 (22%), Positives = 18/48 (37%), Gaps = 5/48 (10%)
Query: 9 AGWFIGMLLAIAFLILVLILVCLIKRNRGGKYAVHEREAAHGRHDYPE 56
+ L + +L L+L L+ G +A + G HD E
Sbjct: 28 LELLLSELGKVLLELLALLLAFLL-----GIWAAEKTLKDLGVHDPGE 70
>gnl|CDD|217168 pfam02659, DUF204, Domain of unknown function DUF. This family
consists of hypothetical transmembrane proteins non of
which have any known function, the aligned region is
180 amino acids long.
Length = 66
Score = 26.6 bits (60), Expect = 0.95
Identities = 7/30 (23%), Positives = 10/30 (33%), Gaps = 9/30 (30%)
Query: 7 ATAGWFIGMLLA---------IAFLILVLI 27
G +G L I L+L+ I
Sbjct: 36 PLIGLLLGRLFGKFIGRYAEWIGGLLLIGI 65
>gnl|CDD|218173 pfam04610, TrbL, TrbL/VirB6 plasmid conjugal transfer protein.
Length = 223
Score = 27.5 bits (62), Expect = 0.95
Identities = 11/29 (37%), Positives = 17/29 (58%), Gaps = 3/29 (10%)
Query: 7 ATAGWF---IGMLLAIAFLILVLILVCLI 32
AT F +G +L+ A L+L+L L+ I
Sbjct: 148 ATRDMFENWLGQVLSYALLLLLLALIVGI 176
>gnl|CDD|234462 TIGR04086, TIGR04086_membr, putative membrane protein, TIGR04086
family. Members of this family of strongly hydrophobic
putative transmembrane protein average about 125 amino
acids in length and occur mostly, but not exclusively,
in the Firmicutes. Members are quite diverse in
sequence. The function is unknown.
Length = 115
Score = 27.1 bits (61), Expect = 1.1
Identities = 6/26 (23%), Positives = 15/26 (57%)
Query: 10 GWFIGMLLAIAFLILVLILVCLIKRN 35
GW G+L+ + + +++L++ L
Sbjct: 62 GWLWGLLVGLLYFLILLLISLLFGPG 87
>gnl|CDD|100078 cd05709, S2P-M50, Site-2 protease (S2P) class of zinc
metalloproteases (MEROPS family M50) cleaves
transmembrane domains of substrate proteins, regulating
intramembrane proteolysis (RIP) of diverse signal
transduction mechanisms. Members of this family use
proteolytic activity within the membrane to transfer
information across membranes to integrate gene
expression with physiologic stresses occurring in
another cellular compartment. The domain core structure
appears to contain at least three transmembrane helices
with a catalytic zinc atom coordinated by three
conserved residues contained within the consensus
sequence HExxH, together with a conserved aspartate
residue. The S2P/M50 family of RIP proteases is widely
distributed; in eukaryotic cells, they regulate such
processes as sterol and lipid metabolism, and
endoplasmic reticulum (ER) stress responses. In
sterol-depleted mammalian cells, a two-step proteolytic
process releases the N-terminal domains of sterol
regulatory element-binding proteins (SREBPs) from
membranes of the ER. These domains translocate into the
nucleus, where they activate genes of cholesterol and
fatty acid biosynthesis. It is the second proteolytic
step that is carried out by the SREBP Site-2 protease
(S2P) which is present in this CD superfamily.
Prokaryotic S2P/M50 homologs have been shown to regulate
stress responses, sporulation, cell division, and cell
differentiation. In Escherichia coli, the S2P homolog
RseP is involved in the sigmaE pathway of
extracytoplasmic stress responses, and in Bacillus
subtilis, the S2P homolog SpoIVFB is involved in the
pro-sigmaK pathway of spore formation. Some of the
subfamilies within this hierarchy contain one or two PDZ
domain insertions, with putative regulatory roles, such
as the inhibition of substrate cleavage as seen by the
RseP PDZ domain.
Length = 180
Score = 27.2 bits (61), Expect = 1.1
Identities = 11/24 (45%), Positives = 16/24 (66%)
Query: 6 VATAGWFIGMLLAIAFLILVLILV 29
VA AG +LLA+ L+L+L+L
Sbjct: 78 VALAGPLANLLLALLLLLLLLLLG 101
>gnl|CDD|224882 COG1971, COG1971, Predicted membrane protein [Function unknown].
Length = 190
Score = 27.2 bits (61), Expect = 1.4
Identities = 9/29 (31%), Positives = 15/29 (51%), Gaps = 9/29 (31%)
Query: 8 TAGWFIGMLLA---------IAFLILVLI 27
GWFIG L+ I F++L+++
Sbjct: 52 LIGWFIGKFLSTFIAEWAHWIGFVLLIIL 80
>gnl|CDD|222342 pfam13722, DUF4161, C-terminal domain on CstA (DUF4161). This
domain is found at the C=terminal of most known CstA
domain-containing proteins. The function is not known.
Length = 127
Score = 26.7 bits (60), Expect = 1.5
Identities = 10/28 (35%), Positives = 15/28 (53%), Gaps = 3/28 (10%)
Query: 15 MLLAIAFLILVLILVCLIKRNRGGKYAV 42
+L A+A L+ + V L K + KYA
Sbjct: 87 LLAALALLL---VTVWLAKMGKPTKYAW 111
>gnl|CDD|218842 pfam05987, DUF898, Bacterial protein of unknown function
(DUF898). This family consists of several bacterial
proteins of unknown function. Some of the family
members are described as putative membrane proteins.
Length = 339
Score = 27.2 bits (61), Expect = 1.6
Identities = 6/18 (33%), Positives = 9/18 (50%)
Query: 12 FIGMLLAIAFLILVLILV 29
G L+A+ L+ I V
Sbjct: 66 LKGFLIALVLLVAYYIAV 83
>gnl|CDD|205390 pfam13209, DUF4017, Protein of unknown function (DUF4017). This
family of proteins is functionally uncharacterized.
This family of proteins is found in bacteria. Proteins
in this family are approximately 60 amino acids in
length.
Length = 60
Score = 25.9 bits (57), Expect = 1.6
Identities = 9/36 (25%), Positives = 16/36 (44%), Gaps = 2/36 (5%)
Query: 3 SDTVATAGW--FIGMLLAIAFLILVLILVCLIKRNR 36
S+ GW F+G AI ++ I+ I + +
Sbjct: 24 SEGYNHVGWKLFVGQAYAIPIFLITAIITFYINKKK 59
>gnl|CDD|233951 TIGR02614, ftsW, cell division protein FtsW. This family
consists of FtsW, an integral membrane protein with ten
transmembrane segments. In general, it is one of two
paralogs involved in peptidoglycan biosynthesis, the
other being RodA, and is essential for cell division.
All members of the seed alignment for this model are
encoded in operons for the biosynthesis of
UDP-N-acetylmuramoyl-pentapeptide, a precursor of
murein (peptidoglycan). The FtsW designation is not
used in endospore-forming bacterial (e.g. Bacillus
subtilis), where the member of this family is
designated SpoVE and three or more RodA/FtsW/SpoVE
family paralogs are present. SpoVE acts in spore cortex
formation and is dispensible for growth. Biological
rolls for FtsW in cell division include recruitment of
penicillin-binding protein 3 to the division site [Cell
envelope, Biosynthesis and degradation of murein
sacculus and peptidoglycan, Cellular processes, Cell
division].
Length = 356
Score = 27.1 bits (61), Expect = 1.7
Identities = 11/38 (28%), Positives = 17/38 (44%), Gaps = 3/38 (7%)
Query: 1 MPSDTVATAGWFIGMLLAIAFLILVLILVCLIKRNRGG 38
+P I L IA ++LVL+L+ I + G
Sbjct: 58 LPLRFWRKLSVPI---LLIAIVLLVLVLIPGIGKEVNG 92
>gnl|CDD|222674 pfam14316, DUF4381, Domain of unknown function (DUF4381). This
family of proteins is found in bacteria and eukaryotes.
Proteins in this family are typically between 158 and
180 amino acids in length.
Length = 147
Score = 26.9 bits (60), Expect = 1.7
Identities = 10/29 (34%), Positives = 16/29 (55%), Gaps = 3/29 (10%)
Query: 8 TAGWFIGMLLAIAFLILVLILVCLIKRNR 36
GW++ L L+L+L+LV + R R
Sbjct: 20 APGWWL---LLALLLLLLLLLVYRLWRRR 45
>gnl|CDD|187762 cd09322, TDT_TehA_like, The Tellurite-resistance/Dicarboxylate
Transporter (TDT) family includes TehA proteins. The
Tellurite-resistance/Dicarboxylate Transporter (TDT)
family includes members from all three kingdoms, but
only three members of the family have been functionally
characterized: the TehA protein of E. coli functioning
as a tellurite-resistance uptake permease, the Mae1
protein of S. pombe functioning in the uptake of malate
and other dicarboxylates, and the sulfite efflux pump
(SSU1) of Saccharomyces cerevisiae. In plants, the
plasma membrane protein SLAC1 (Slow Anion
Channel-Associated 1), which is preferentially expressed
in guard cells, encodes a distant homolog of fungal and
bacterial dicarboxylate/malic acid transport proteins.
SLAC1 is essential in mediating stomatal responses to
physiological and stress stimuli. Members of the TDT
family exhibit 10 putative transmembrane a-helical
spanners (TMSs).
Length = 289
Score = 27.1 bits (61), Expect = 1.8
Identities = 7/28 (25%), Positives = 14/28 (50%)
Query: 11 WFIGMLLAIAFLILVLILVCLIKRNRGG 38
F +LL +A + +L+L L + +
Sbjct: 212 LFALILLGLALFLFLLLLRLLPRLLKLP 239
>gnl|CDD|112753 pfam03954, Lectin_N, Hepatic lectin, N-terminal domain.
Length = 138
Score = 26.7 bits (59), Expect = 1.8
Identities = 8/22 (36%), Positives = 15/22 (68%)
Query: 11 WFIGMLLAIAFLILVLILVCLI 32
+LL++ IL+L++VC+I
Sbjct: 32 GTRLLLLSLGLSILLLVVVCVI 53
>gnl|CDD|100079 cd06158, S2P-M50_like_1, Uncharacterized homologs of Site-2
protease (S2P), zinc metalloproteases (MEROPS family
M50) which cleave transmembrane domains of substrate
proteins, regulating intramembrane proteolysis (RIP) of
diverse signal transduction mechanisms. Members of the
S2P/M50 family of RIP proteases use proteolytic activity
within the membrane to transfer information across
membranes to integrate gene expression with physiologic
stresses occurring in another cellular compartment. In
eukaryotic cells they regulate such processes as sterol
and lipid metabolism, and endoplasmic reticulum stress
responses. In prokaryotes they regulate such processes
as sporulation, cell division, stress response, and cell
differentiation. This group includes bacterial,
eukaryotic, and Archaeal S2P/M50s homologs with a
minimal core protein and no PDZ domains.
Length = 181
Score = 26.7 bits (60), Expect = 1.8
Identities = 9/24 (37%), Positives = 15/24 (62%)
Query: 6 VATAGWFIGMLLAIAFLILVLILV 29
V+ AG +LLA+ F +L+ +L
Sbjct: 84 VSLAGPLSNLLLALLFALLLRLLP 107
>gnl|CDD|235668 PRK06007, fliF, flagellar MS-ring protein; Reviewed.
Length = 542
Score = 27.2 bits (61), Expect = 1.9
Identities = 8/24 (33%), Positives = 15/24 (62%)
Query: 11 WFIGMLLAIAFLILVLILVCLIKR 34
WF+ ++ A +L+LIL+ + R
Sbjct: 436 WFMDLIKLAAGALLILILIFFVLR 459
>gnl|CDD|225632 COG3090, DctM, TRAP-type C4-dicarboxylate transport system, small
permease component [Carbohydrate transport and
metabolism].
Length = 177
Score = 26.5 bits (59), Expect = 1.9
Identities = 8/45 (17%), Positives = 18/45 (40%), Gaps = 4/45 (8%)
Query: 11 WFIGMLLAIAFLILVLI----LVCLIKRNRGGKYAVHEREAAHGR 51
++ + L I +++ L L+ L++ + E E A
Sbjct: 130 GWVYLALPIGGVLMALRFLERLIRLLRGKKVVLELADEAEEAVIL 174
>gnl|CDD|177101 MTH00009, COX3, cytochrome c oxidase subunit III; Validated.
Length = 259
Score = 26.7 bits (59), Expect = 2.0
Identities = 10/29 (34%), Positives = 16/29 (55%)
Query: 5 TVATAGWFIGMLLAIAFLILVLILVCLIK 33
TV A WF G L L++I++ +I+
Sbjct: 26 TVGLASWFHGYGTLCLILGLIIIILTMIQ 54
>gnl|CDD|216682 pfam01757, Acyl_transf_3, Acyltransferase family. This family
includes a range of acyltransferase enzymes. This domain
is found in many as yet uncharacterized C. elegans
proteins and it is approximately 300 amino acids long.
Length = 326
Score = 26.7 bits (59), Expect = 2.0
Identities = 7/38 (18%), Positives = 17/38 (44%), Gaps = 2/38 (5%)
Query: 5 TVATAGWFIGMLLAIAFLILVLILVCLIKRNRGGKYAV 42
WF+ L F +L+ +L+ L+++ + +
Sbjct: 120 GPNGHLWFLPAL--FVFYLLLPLLLRLLRKLKKLLLLL 155
>gnl|CDD|237262 PRK12997, PRK12997, PTS system ascorbate-specific transporter
subunit IIC; Reviewed.
Length = 466
Score = 26.8 bits (60), Expect = 2.1
Identities = 8/22 (36%), Positives = 12/22 (54%)
Query: 7 ATAGWFIGMLLAIAFLILVLIL 28
T G IGM + I F ++I+
Sbjct: 344 GTIGGLIGMGILIVFKSPIVII 365
>gnl|CDD|191585 pfam06695, Sm_multidrug_ex, Putative small multi-drug export
protein. This family contains a small number of
putative small multi-drug export proteins.
Length = 121
Score = 26.3 bits (59), Expect = 2.2
Identities = 13/51 (25%), Positives = 20/51 (39%), Gaps = 2/51 (3%)
Query: 12 FIGMLLAIAFLILVL--ILVCLIKRNRGGKYAVHEREAAHGRHDYPEEPGF 60
IG LL + F++L L + L + K A + E+ GF
Sbjct: 24 VIGNLLPVPFILLFLRKVFKWLKRTKLFKKIYDWLERRARKASEKVEKYGF 74
>gnl|CDD|165538 PHA03281, PHA03281, envelope glycoprotein E; Provisional.
Length = 642
Score = 27.0 bits (59), Expect = 2.2
Identities = 14/46 (30%), Positives = 21/46 (45%), Gaps = 3/46 (6%)
Query: 9 AGWFIGMLLAIAFLILVLILVCLIKRNRGGKYAVHEREAAHGRHDY 54
A G A+A L L + L+C K+ Y +AA+G+ Y
Sbjct: 557 AAI-TGGFAALALLCLAIALICTAKKFGHKAYRSD--KAAYGQSMY 599
>gnl|CDD|224535 COG1620, LldP, L-lactate permease [Energy production and
conversion].
Length = 522
Score = 26.8 bits (60), Expect = 2.3
Identities = 7/29 (24%), Positives = 13/29 (44%)
Query: 13 IGMLLAIAFLILVLILVCLIKRNRGGKYA 41
+ + +A L +VL + L+ G A
Sbjct: 2 LWLSALVALLPIVLFFLLLVVLKLKGYVA 30
>gnl|CDD|224214 COG1295, Rbn, Ribonuclease BN family enzyme [Replication,
recombination, and repair].
Length = 303
Score = 26.5 bits (59), Expect = 2.4
Identities = 6/19 (31%), Positives = 12/19 (63%)
Query: 11 WFIGMLLAIAFLILVLILV 29
W +G LL + F +L+ ++
Sbjct: 147 WTLGTLLPLLFALLLSVVG 165
>gnl|CDD|215966 pfam00517, GP41, Retroviral envelope protein. This family includes
envelope protein from a variety of retroviruses. It
includes the GP41 subunit of the envelope protein
complex from human and simian immunodeficiency viruses
(HIV and SIV) which mediate membrane fusion during viral
entry. The family also includes bovine immunodeficiency
virus, feline immunodeficiency virus and Equine
infectious anaemia (EIAV). The family also includes the
Gp36 protein from mouse mammary tumour virus (MMTV) and
human endogenous retroviruses (HERVs).
Length = 204
Score = 26.2 bits (58), Expect = 2.6
Identities = 7/26 (26%), Positives = 15/26 (57%)
Query: 11 WFIGMLLAIAFLILVLILVCLIKRNR 36
+I + + I +I+VL ++ + RN
Sbjct: 155 SYIKIGILILLIIIVLRILPGVLRNC 180
>gnl|CDD|223445 COG0368, CobS, Cobalamin-5-phosphate synthase [Coenzyme
metabolism].
Length = 246
Score = 26.5 bits (59), Expect = 2.8
Identities = 5/20 (25%), Positives = 10/20 (50%)
Query: 9 AGWFIGMLLAIAFLILVLIL 28
G +G++ A+ IL +
Sbjct: 35 VGLLLGLIAALVAYILAPLF 54
>gnl|CDD|224412 COG1495, DsbB, Disulfide bond formation protein DsbB
[Posttranslational modification, protein turnover,
chaperones].
Length = 170
Score = 26.3 bits (58), Expect = 3.0
Identities = 8/30 (26%), Positives = 15/30 (50%), Gaps = 5/30 (16%)
Query: 5 TVATAGWFIGMLLAIAFLILVLILVCLIKR 34
++ W L + F L+L+L+ L+ R
Sbjct: 141 GLSMPPW-----LLLIFFALLLLLLLLLFR 165
>gnl|CDD|222695 pfam14342, DUF4396, Domain of unknown function (DUF4396). This
family of proteins is functionally uncharacterized.
This family of proteins is found in bacteria, archaea
and eukaryotes. Proteins in this family are typically
between 167 and 310 amino acids in length.
Length = 139
Score = 26.0 bits (58), Expect = 3.1
Identities = 15/51 (29%), Positives = 23/51 (45%), Gaps = 4/51 (7%)
Query: 1 MPSDTVATAGWFIG--MLLAI--AFLILVLILVCLIKRNRGGKYAVHEREA 47
M T GW + M+LA+ AFL + + L++R G Y + A
Sbjct: 25 MWILTAYLLGWSMAGIMILAVALAFLFGIFLQTFLLRRMAGLSYGAALKTA 75
>gnl|CDD|216251 pfam01027, Bax1-I, Inhibitor of apoptosis-promoting Bax1.
Programmed cell-death involves a set of Bcl-2 family
proteins, some of which inhibit apoptosis (Bcl-2 and
Bcl-XL) and some of which promote it (Bax and Bak).
Human Bax inhibitor, BI-1, is an evolutionarily
conserved integral membrane protein containing multiple
membrane-spanning segments predominantly localised to
intracellular membranes. It has 6-7 membrane-spanning
domains. The C termini of the mammalian BI-1 proteins
are comprised of basic amino acids resembling some
nuclear targeting sequences, but otherwise the
predicted proteins lack motifs that suggest a function.
As plant BI-1 appears to localise predominantly to the
ER, we hypothesized that plant BI-1 could also regulate
cell death triggered by ER stress. BI-1 appears to
exert its effect through an interaction with
calmodulin.
Length = 205
Score = 26.0 bits (58), Expect = 3.1
Identities = 10/28 (35%), Positives = 17/28 (60%)
Query: 8 TAGWFIGMLLAIAFLILVLILVCLIKRN 35
+++G+LL +A LI V +L IK+
Sbjct: 36 FPLFWVGLLLPLALLIFVFLLFSRIKKK 63
>gnl|CDD|239486 cd03392, PAP2_like_2, PAP2_like_2 proteins. PAP2 is a super-family
of phosphatases and haloperoxidases. This subgroup,
which is specific to bacteria, lacks functional
characterization and may act as a membrane-associated
lipid phosphatase.
Length = 182
Score = 26.0 bits (58), Expect = 3.2
Identities = 11/28 (39%), Positives = 17/28 (60%), Gaps = 6/28 (21%)
Query: 2 PSDTVATAGWFIGMLLAIAFLILVLILV 29
PSD +A G LL +A+L L+++L
Sbjct: 158 PSDVLA------GWLLGLAWLALLILLY 179
>gnl|CDD|219559 pfam07760, DUF1616, Protein of unknown function (DUF1616). This is
a family of sequences from hypothetical archaeal
proteins. The region in question is approximately 330
amino acid residues long.
Length = 290
Score = 26.1 bits (58), Expect = 3.2
Identities = 13/79 (16%), Positives = 26/79 (32%), Gaps = 23/79 (29%)
Query: 11 WFIGMLLAIAFLILVLILVCLIKRNRGGKYAVHEREAAHGRHDYPEEPGFNEYSQPLDGK 70
I +LL+++ L+L L+ ++R R P + F
Sbjct: 85 RLIPILLSLSLFTLILCLIAYLRRRR-----------------LPNDERFIVL------F 121
Query: 71 PRTRSSMSSEKHPVESDTD 89
S++ + S +D
Sbjct: 122 RALDSALKIGLYLPNSYSD 140
>gnl|CDD|217505 pfam03348, Serinc, Serine incorporator (Serinc). This is a family
of eukaryotic membrane proteins which incorporate serine
into membranes and facilitate the synthesis of the
serine-derived lipids phosphatidylserine and
sphingolipid. Members of this family contain 11
transmembrane domains and form intracellular complexes
with key enzymes involved in serine and sphingolipid
biosynthesis.
Length = 428
Score = 26.1 bits (58), Expect = 3.3
Identities = 7/28 (25%), Positives = 16/28 (57%)
Query: 1 MPSDTVATAGWFIGMLLAIAFLILVLIL 28
+P+ +IG++ + F+++ LIL
Sbjct: 128 IPNGFFFFVWMYIGVIGSFLFILIQLIL 155
>gnl|CDD|227680 COG5391, COG5391, Phox homology (PX) domain protein
[Intracellular trafficking and secretion / General
function prediction only].
Length = 524
Score = 26.3 bits (58), Expect = 3.4
Identities = 14/44 (31%), Positives = 16/44 (36%)
Query: 56 EEPGFNEYSQPLDGKPRTRSSMSSEKHPVESDTDSMAEYGDGDT 99
+ S PLD + R SS K P SD S G T
Sbjct: 39 SPVNSSIKSTPLDIQKRYSGFESSAKLPRISDAPSFVPPPGGHT 82
>gnl|CDD|227109 COG4768, COG4768, Uncharacterized protein containing a divergent
version of the methyl-accepting chemotaxis-like domain
[General function prediction only].
Length = 139
Score = 25.9 bits (57), Expect = 3.5
Identities = 9/21 (42%), Positives = 16/21 (76%)
Query: 14 GMLLAIAFLILVLILVCLIKR 34
++AIAFL+LV+ L+ +K+
Sbjct: 8 LAIIAIAFLVLVIYLIITLKK 28
>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit. This model
describes Pseudomonas denitrificans CobT gene product,
which is a cobalt chelatase subunit that functions in
cobalamin biosynthesis. Cobalamin (vitamin B12) can be
synthesized via several pathways, including an aerobic
pathway (found in Pseudomonas denitrificans) and an
anaerobic pathway (found in P. shermanii and Salmonella
typhimurium). These pathways differ in the point of
cobalt insertion during corrin ring formation. There are
apparently a number of variations on these two pathways,
where the major differences seem to be concerned with
the process of ring contraction. Confusion regarding the
functions of enzymes found in the aerobic vs. anaerobic
pathways has arisen because nonhomologous genes in these
different pathways were given the same gene symbols.
Thus, cobT in the aerobic pathway (P. denitrificans) is
not a homolog of cobT in the anaerobic pathway (S.
typhimurium). It should be noted that E. coli
synthesizes cobalamin only when it is supplied with the
precursor cobinamide, which is a complex intermediate.
Additionally, all E. coli cobalamin synthesis genes
(cobU, cobS and cobT) were named after their Salmonella
typhimurium homologs which function in the anaerobic
cobalamin synthesis pathway. This model describes the
aerobic cobalamin pathway Pseudomonas denitrificans CobT
gene product, which is a cobalt chelatase subunit, with
a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
heterotrimeric, ATP-dependent enzyme that catalyzes
cobalt insertion during cobalamin biosynthesis. The
other two subunits are the P. denitrificans CobS
(TIGR01650) and CobN (pfam02514 CobN/Magnesium
Chelatase) proteins. To avoid potential confusion with
the nonhomologous Salmonella typhimurium/E.coli cobT
gene product, the P. denitrificans gene symbol is not
used in the name of this model [Biosynthesis of
cofactors, prosthetic groups, and carriers, Heme,
porphyrin, and cobalamin].
Length = 600
Score = 26.1 bits (57), Expect = 3.7
Identities = 15/46 (32%), Positives = 22/46 (47%)
Query: 56 EEPGFNEYSQPLDGKPRTRSSMSSEKHPVESDTDSMAEYGDGDTGT 101
E E S P + + R S S E+ V+SD D + + D D+ T
Sbjct: 231 GEGEGQEGSAPQESEATDRESESGEEEMVQSDQDDLPDESDDDSET 276
>gnl|CDD|221116 pfam11457, DUF3021, Protein of unknown function (DUF3021). This is
a bacterial family of uncharacterized proteins.
Length = 134
Score = 25.7 bits (57), Expect = 3.7
Identities = 8/24 (33%), Positives = 13/24 (54%)
Query: 9 AGWFIGMLLAIAFLILVLILVCLI 32
GWF +A+ ++ IL+ LI
Sbjct: 91 NGWFPLSGIALLLFTIIFILIYLI 114
>gnl|CDD|223701 COG0628, yhhT, Predicted permease, member of the PurR regulon
[General function prediction only].
Length = 355
Score = 26.1 bits (58), Expect = 3.9
Identities = 6/29 (20%), Positives = 15/29 (51%), Gaps = 5/29 (17%)
Query: 6 VATAGW-----FIGMLLAIAFLILVLILV 29
++ G F+G++LA ++ +L+
Sbjct: 309 LSLLGGGSLFGFVGLILAPPLAAVLKVLL 337
>gnl|CDD|234691 PRK00220, PRK00220, putative glycerol-3-phosphate acyltransferase
PlsY; Provisional.
Length = 198
Score = 25.9 bits (58), Expect = 4.0
Identities = 9/24 (37%), Positives = 14/24 (58%), Gaps = 3/24 (12%)
Query: 6 VATAGWFIGMLLAIAFLILVLILV 29
VATA G+LL I L+ +++
Sbjct: 108 VATA---AGVLLGIGPLLALILAA 128
>gnl|CDD|215647 pfam00002, 7tm_2, 7 transmembrane receptor (Secretin family).
This family is known as Family B, the secretin-receptor
family or family 2 of the G-protein-coupled receptors
(GCPRs).They have been described in many animal
species, but not in plants, fungi or prokaryotes. Three
distinct sub-families are recognised. Subfamily B1
contains classical hormone receptors, such as receptors
for secretin and glucagon, that are all involved in
cAMP-mediated signalling pathways. Subfamily B2
contains receptors with long extracellular N-termini,
such as the leukocyte cell-surface antigen CD97 ;
calcium-independent receptors for latrotoxin, and
brain-specific angiogenesis inhibitors amongst others.
Subfamily B3 includes Methuselah and other Drosophila
proteins. Other than the typical seven-transmembrane
region, characteristic structural features include an
amino-terminal extracellular domain involved in ligand
binding, and an intracellular loop (IC3) required for
specific G-protein coupling.
Length = 239
Score = 26.0 bits (58), Expect = 4.1
Identities = 6/33 (18%), Positives = 16/33 (48%)
Query: 4 DTVATAGWFIGMLLAIAFLILVLILVCLIKRNR 36
+ + +G L++ L+L + + L ++ R
Sbjct: 1 ALLLKVIYTVGYSLSLVALLLAIAIFLLFRKLR 33
>gnl|CDD|217759 pfam03845, Spore_permease, Spore germination protein.
Length = 320
Score = 26.0 bits (58), Expect = 4.2
Identities = 12/30 (40%), Positives = 18/30 (60%), Gaps = 2/30 (6%)
Query: 7 ATAGWFIGMLLA-IAFLILVLILVCLIKRN 35
A GW I +LL + L+LVL+ L+K+
Sbjct: 31 AGDGW-IAVLLGGLIGLLLVLLYYKLLKKF 59
>gnl|CDD|224945 COG2034, COG2034, Predicted membrane protein [Function unknown].
Length = 85
Score = 24.9 bits (55), Expect = 4.3
Identities = 9/23 (39%), Positives = 15/23 (65%), Gaps = 1/23 (4%)
Query: 12 FIGMLLAIAFLILVLILVCLIKR 34
I ++LA+ +IL L+L L+ R
Sbjct: 64 IISVVLALILMILWLLLF-LLGR 85
>gnl|CDD|184968 PRK15006, PRK15006, thiosulfate reductase cytochrome B subunit;
Provisional.
Length = 261
Score = 25.9 bits (57), Expect = 4.3
Identities = 11/37 (29%), Positives = 16/37 (43%)
Query: 10 GWFIGMLLAIAFLILVLILVCLIKRNRGGKYAVHERE 46
W I + + + LVL L L++R K A E
Sbjct: 28 VWLIIAGVLLVGMWLVLGLHALLRRRLAPKSATGHGE 64
>gnl|CDD|179877 PRK04778, PRK04778, septation ring formation regulator EzrA;
Provisional.
Length = 569
Score = 26.0 bits (58), Expect = 4.4
Identities = 9/26 (34%), Positives = 15/26 (57%)
Query: 11 WFIGMLLAIAFLILVLILVCLIKRNR 36
I +++AI +I++ L LI R R
Sbjct: 1 GMIYLIIAIVVIIIIAYLAGLILRKR 26
>gnl|CDD|115263 pfam06593, RBDV_coat, Raspberry bushy dwarf virus coat protein.
This family consists of several Raspberry bushy dwarf
virus coat proteins.
Length = 274
Score = 25.9 bits (56), Expect = 4.5
Identities = 11/22 (50%), Positives = 14/22 (63%)
Query: 59 GFNEYSQPLDGKPRTRSSMSSE 80
GF+E S P GKP SMS++
Sbjct: 40 GFSEGSNPATGKPHADMSMSAK 61
>gnl|CDD|214448 MTH00173, ATP6, ATP synthase F0 subunit 6; Provisional.
Length = 231
Score = 25.6 bits (57), Expect = 4.5
Identities = 7/24 (29%), Positives = 10/24 (41%)
Query: 9 AGWFIGMLLAIAFLILVLILVCLI 32
G L ++ L L LI + L
Sbjct: 65 GLNLGGFSLLLSSLFLFLISLNLS 88
>gnl|CDD|224196 COG1277, NosY, ABC-type transport system involved in multi-copper
enzyme maturation, permease component [General function
prediction only].
Length = 278
Score = 25.9 bits (57), Expect = 4.7
Identities = 8/21 (38%), Positives = 12/21 (57%)
Query: 12 FIGMLLAIAFLILVLILVCLI 32
F+G LL I +IL+ + L
Sbjct: 110 FLGALLVILIIILISFISLLT 130
>gnl|CDD|216390 pfam01252, Peptidase_A8, Signal peptidase (SPase) II.
Length = 143
Score = 25.5 bits (57), Expect = 4.8
Identities = 9/35 (25%), Positives = 17/35 (48%), Gaps = 3/35 (8%)
Query: 7 ATAGWFIGMLLAIAFLILVLILVCLIKRNRGGKYA 41
+ WF L IA L ++ +L L++ + + A
Sbjct: 50 SGQRWF---LSLIALLAILALLYYLLRLKKRSRLA 81
>gnl|CDD|188151 TIGR01517, ATPase-IIB_Ca, plasma-membrane calcium-translocating
P-type ATPase. This model describes the P-type ATPase
responsible for translocating calcium ions across the
plasma membrane of eukaryotes , out of the cell. In
some organisms, this type of pump may also be found in
vacuolar membranes. In humans and mice, at least, there
are multiple isoforms of the PMCA pump with overlapping
but not redundant functions. Accordingly, there are no
human diseases linked to PMCA defects, although
alterations of PMCA function do elicit physiological
effects. The calcium P-type ATPases have been
characterized as Type IIB based on a phylogenetic
analysis which distinguishes this group from the Type
IIA SERCA calcium pump. A separate analysis divides
Type IIA into sub-types (SERCA and PMR1) which are
modelled by TIGR01116 and TIGR01522. This model is well
separated from those.
Length = 944
Score = 25.9 bits (57), Expect = 4.8
Identities = 7/23 (30%), Positives = 13/23 (56%)
Query: 36 RGGKYAVHEREAAHGRHDYPEEP 58
R + RE +G+++ PE+P
Sbjct: 61 RLSSSTLERREKVYGKNELPEKP 83
>gnl|CDD|217749 pfam03824, NicO, High-affinity nickel-transport protein. High
affinity nickel transporters involved in the
incorporation of nickel into H2-uptake hydrogenase and
urease enzymes. Essential for the expression of
catalytically active hydrogenase and urease. Ion uptake
is dependent on proton motive force. HoxN in Alcaligenes
eutrophus is thought to be an integral membrane protein
with seven transmembrane helices. The family also
includes a cobalt transporter.
Length = 278
Score = 25.8 bits (57), Expect = 4.9
Identities = 12/48 (25%), Positives = 23/48 (47%), Gaps = 8/48 (16%)
Query: 13 IGMLLAIAFLILVLILVCLIKRN--------RGGKYAVHEREAAHGRH 52
IG L++ +FL+++ + L+ R R G+ + EAA +
Sbjct: 79 IGTLVSGSFLLIIGLGNLLLLRELRHLFRKLRRGRAPEPDLEAALQKR 126
>gnl|CDD|224211 COG1292, BetT, Choline-glycine betaine transporter [Cell envelope
biogenesis, outer membrane].
Length = 537
Score = 25.7 bits (57), Expect = 5.3
Identities = 8/25 (32%), Positives = 11/25 (44%), Gaps = 1/25 (4%)
Query: 6 VATAGWFIGMLLAIAFLILVLILVC 30
GW+ +L + FL VL L
Sbjct: 48 TNNFGWYY-ILTVLLFLGFVLYLAF 71
>gnl|CDD|165201 PHA02867, PHA02867, C-type lectin protein; Provisional.
Length = 167
Score = 25.4 bits (55), Expect = 5.4
Identities = 4/23 (17%), Positives = 12/23 (52%)
Query: 11 WFIGMLLAIAFLILVLILVCLIK 33
+ I +L + ++ +++V K
Sbjct: 15 YAITVLGILCLILFTILVVVTCK 37
>gnl|CDD|220507 pfam09991, DUF2232, Predicted membrane protein (DUF2232). This
domain, found in various hypothetical bacterial
proteins, has no known function.
Length = 290
Score = 25.6 bits (57), Expect = 5.4
Identities = 8/34 (23%), Positives = 16/34 (47%)
Query: 5 TVATAGWFIGMLLAIAFLILVLILVCLIKRNRGG 38
G + + A+ F +L L+L L+K+ +
Sbjct: 51 LGLFGGPILALPYALQFGLLGLVLGYLLKKKKSP 84
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 25.8 bits (57), Expect = 5.6
Identities = 17/44 (38%), Positives = 22/44 (50%), Gaps = 12/44 (27%)
Query: 10 GWFIGMLLAIAFLILV-LILVCL-----IKRN---RGGKYAVHE 44
GW IG +L F+ V L+LV + I+ N R KY V E
Sbjct: 169 GWLIGHIL---FMKWVGLVLVWIQQNNSIRSNVLIRSNKYLVSE 209
>gnl|CDD|223629 COG0555, CysU, ABC-type sulfate transport system, permease
component [Posttranslational modification, protein
turnover, chaperones].
Length = 274
Score = 25.7 bits (57), Expect = 5.7
Identities = 10/34 (29%), Positives = 16/34 (47%)
Query: 1 MPSDTVATAGWFIGMLLAIAFLILVLILVCLIKR 34
S + G +G+ L LIL+L L L+ +
Sbjct: 4 KRSRVIPGFGLLLGLTLLFLLLILLLPLSALVTK 37
>gnl|CDD|239477 cd03382, PAP2_dolichyldiphosphatase, PAP2_like proteins,
dolichyldiphosphatase subfamily. Dolichyldiphosphatase
is a membrane-associated protein located in the
endoplasmic reticulum and hydrolyzes dolichyl
pyrophosphate, as well as dolichylmonophosphate at a
low rate. The enzyme is necessary for maintaining
proper levels of dolichol-linked oligosaccharides and
protein N-glycosylation, and might play a role in
re-utilization of the glycosyl carrier lipid for
additional rounds of lipid intermediate biosynthesis
after its release during protein N-glycosylation
reactions.
Length = 159
Score = 25.3 bits (56), Expect = 5.8
Identities = 9/24 (37%), Positives = 11/24 (45%)
Query: 9 AGWFIGMLLAIAFLILVLILVCLI 32
G + LLA L+ V ILV
Sbjct: 10 PGDLLSFLLAYLSLLPVAILVGYA 33
>gnl|CDD|220520 pfam10011, DUF2254, Predicted membrane protein (DUF2254).
Members of this family of bacterial proteins comprises
various hypothetical and putative membrane proteins.
Their exact function, has not, as yet, been defined.
Length = 369
Score = 25.6 bits (57), Expect = 6.0
Identities = 6/21 (28%), Positives = 10/21 (47%)
Query: 11 WFIGMLLAIAFLILVLILVCL 31
WF L A+ ++L + L
Sbjct: 6 WFRPTLYALLAVLLAFVAPLL 26
>gnl|CDD|182684 PRK10734, PRK10734, putative calcium/sodium:proton antiporter;
Provisional.
Length = 325
Score = 25.4 bits (56), Expect = 6.0
Identities = 9/15 (60%), Positives = 12/15 (80%)
Query: 15 MLLAIAFLILVLILV 29
MLLA A LI+ L+L+
Sbjct: 1 MLLATALLIIGLLLL 15
>gnl|CDD|131664 TIGR02615, spoVE, stage V sporulation protein E. This model
represents an exception within the members of the FtsW
model TIGR02614. This exception occurs only in
endospore-forming genera such as Bacillus, Geobacillus,
and Oceanobacillus. Like FtsW, members are found in a
peptidoglycan operon context, but in these genera they
part of a larger set of paralogs (not just the pair
FtsW and RodA) and are required specifically for
sporulation, not for viability [Cell envelope,
Biosynthesis and degradation of murein sacculus and
peptidoglycan, Cellular processes, Sporulation and
germination].
Length = 354
Score = 25.5 bits (56), Expect = 6.2
Identities = 9/25 (36%), Positives = 16/25 (64%)
Query: 14 GMLLAIAFLILVLILVCLIKRNRGG 38
ML+ I F++L+L+L+ + R G
Sbjct: 68 KMLMVICFVLLLLVLIPGVGMERNG 92
>gnl|CDD|131337 TIGR02284, TIGR02284, conserved hypothetical protein. Members of
this protein family are found mostly in the
Proteobacteria, although one member is found in the the
marine planctomycete Pirellula sp. strain 1. The
function is unknown.
Length = 139
Score = 25.2 bits (55), Expect = 6.6
Identities = 16/72 (22%), Positives = 21/72 (29%), Gaps = 16/72 (22%)
Query: 25 VLILVCLIKRNRGGKYAVHE--------------REAAHGRHDYPEEPGFNEYSQPLDGK 70
+ L LI+ + GK E R A + E + L GK
Sbjct: 2 IHSLNDLIEISIDGKDGFEESAEEVKDPELATLFRRIAGEKSAIVSE--LQQVVASLGGK 59
Query: 71 PRTRSSMSSEKH 82
P SM H
Sbjct: 60 PEDHGSMVGSLH 71
>gnl|CDD|226657 COG4194, COG4194, Predicted membrane protein [General function
prediction only].
Length = 350
Score = 25.5 bits (56), Expect = 6.8
Identities = 6/20 (30%), Positives = 11/20 (55%)
Query: 10 GWFIGMLLAIAFLILVLILV 29
W + + LI+VLI++
Sbjct: 330 AWLFIAVFVVPALIIVLIVL 349
>gnl|CDD|226645 COG4177, LivM, ABC-type branched-chain amino acid transport system,
permease component [Amino acid transport and
metabolism].
Length = 314
Score = 25.3 bits (56), Expect = 7.0
Identities = 11/36 (30%), Positives = 20/36 (55%), Gaps = 2/36 (5%)
Query: 15 MLLAIAFLILVLILVCLIKRNRGGKY--AVHEREAA 48
LA+ L+L L+++ + R+ G+ A+ E E A
Sbjct: 160 YYLALLLLLLTLLVLERLVRSPFGRALRAIREDEIA 195
>gnl|CDD|233832 TIGR02353, NRPS_term_dom, non-ribosomal peptide synthetase terminal
domain of unknown function. This domain is found
exclusively in non-ribosomal peptide synthetases and
always as the final domain in the polypeptide. This
domain is roughly 700 amino acids in size and is found
in polypeptides roughly twice that size.
Length = 695
Score = 25.5 bits (56), Expect = 7.2
Identities = 10/52 (19%), Positives = 19/52 (36%), Gaps = 9/52 (17%)
Query: 3 SDTVATAGW---FIGMLLAIAFLILVLILVCLIKRN------RGGKYAVHER 45
+ G+ + +A+A I +L+ R + G+Y VH
Sbjct: 266 TLGPDMVGFILALVLTFVALAGFIAYTVLLLAAVRLLLNLVLKPGRYYVHSG 317
>gnl|CDD|225088 COG2177, FtsX, Cell division protein [Cell division and chromosome
partitioning].
Length = 297
Score = 25.3 bits (56), Expect = 7.2
Identities = 8/23 (34%), Positives = 14/23 (60%)
Query: 3 SDTVATAGWFIGMLLAIAFLILV 25
V T G + +LLA+A ++L+
Sbjct: 163 LRLVRTVGIGLSILLALAAVLLI 185
>gnl|CDD|223076 PHA03399, pif3, per os infectivity factor 3; Provisional.
Length = 200
Score = 25.2 bits (56), Expect = 7.3
Identities = 4/18 (22%), Positives = 11/18 (61%)
Query: 15 MLLAIAFLILVLILVCLI 32
M+ + L+++L++V
Sbjct: 1 MIWGLLVLLIILLIVYYY 18
>gnl|CDD|223633 COG0559, LivH, Branched-chain amino acid ABC-type transport system,
permease components [Amino acid transport and
metabolism].
Length = 297
Score = 25.2 bits (56), Expect = 7.3
Identities = 13/46 (28%), Positives = 26/46 (56%), Gaps = 5/46 (10%)
Query: 3 SDTVATAGWF-IGMLLAIAFLILVLILVCLIKRNRGGKYAVHEREA 47
++++ +A +F +AFL+L+L+L L++ G + ERE
Sbjct: 256 AESLVSAKYFGSEYKDVVAFLLLILVL--LVR--PQGLFGRKEREK 297
>gnl|CDD|216852 pfam02028, BCCT, BCCT family transporter.
Length = 484
Score = 25.2 bits (56), Expect = 7.3
Identities = 8/16 (50%), Positives = 12/16 (75%)
Query: 13 IGMLLAIAFLILVLIL 28
+ M LA+A L+ VLI+
Sbjct: 248 LNMWLALALLLFVLIV 263
>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein. Members of this
protein family have a signal peptide, a strongly
conserved SH3 domain, a variable region, and then a
C-terminal hydrophobic transmembrane alpha helix region.
Length = 198
Score = 25.0 bits (55), Expect = 7.5
Identities = 11/27 (40%), Positives = 14/27 (51%), Gaps = 1/27 (3%)
Query: 11 WFI-GMLLAIAFLILVLILVCLIKRNR 36
WF+ G + L+L LIL L R R
Sbjct: 166 WFLYGGGVLFLGLLLGLILPRLPPRRR 192
>gnl|CDD|238233 cd00400, Voltage_gated_ClC, CLC voltage-gated chloride channel. The
ClC chloride channels catalyse the selective flow of Cl-
ions across cell membranes, thereby regulating
electrical excitation in skeletal muscle and the flow of
salt and water across epithelial barriers. This domain
is found in the halogen ions (Cl-, Br- and I-) transport
proteins of the ClC family. The ClC channels are found
in all three kingdoms of life and perform a variety of
functions including cellular excitability regulation,
cell volume regulation, membrane potential
stabilization, acidification of intracellular
organelles, signal transduction, transepithelial
transport in animals, and the extreme acid resistance
response in eubacteria. They lack any structural or
sequence similarity to other known ion channels and
exhibit unique properties of ion permeation and gating.
Unlike cation-selective ion channels, which form
oligomers containing a single pore along the axis of
symmetry, the ClC channels form two-pore homodimers with
one pore per subunit without axial symmetry. Although
lacking the typical voltage-sensor found in cation
channels, all studied ClC channels are gated (opened and
closed) by transmembrane voltage. The gating is
conferred by the permeating ion itself, acting as the
gating charge. In addition, eukaryotic and some
prokaryotic ClC channels have two additional C-terminal
CBS (cystathionine beta synthase) domains of putative
regulatory function.
Length = 383
Score = 25.2 bits (56), Expect = 7.5
Identities = 11/36 (30%), Positives = 17/36 (47%), Gaps = 1/36 (2%)
Query: 7 ATAGWF-IGMLLAIAFLILVLILVCLIKRNRGGKYA 41
A AG + +LL + L L+ + L GG +A
Sbjct: 273 ALAGELSLLLLLLLLLLKLLATALTLGSGFPGGVFA 308
>gnl|CDD|223187 COG0109, CyoE, Polyprenyltransferase (cytochrome oxidase assembly
factor) [Posttranslational modification, protein
turnover, chaperones].
Length = 304
Score = 25.3 bits (56), Expect = 7.7
Identities = 9/21 (42%), Positives = 12/21 (57%)
Query: 12 FIGMLLAIAFLILVLILVCLI 32
F GMLLA I L+L+ +
Sbjct: 38 FAGMLLAPRGSINPLLLLLTL 58
>gnl|CDD|200505 cd11244, Sema_plexin_A, The Sema domain, a protein interacting
module, of Plexin A. Plexins serve as receptors of
semaphorins and may be the ancestor of semaphorins.
Members of the Plexin A subfamily are receptors for
Sema1s, Sema3s, and Sema6s, and they mediate diverse
biological functions including axon guidance,
cardiovascular development, and immune function.
Guanylyl cyclase Gyc76C and Off-track kinase (OTK), a
putative receptor tyrosine kinase, modulate
Sema1a-Plexin A mediated axon repulsion. Sema3s do not
interact directly with plexin A receptors, but instead
bind Neuropilin-1 or Neuropilin-2 toactivate
neuropilin-plexin A holoreceptor complexes. In contrast
to Sema3s, Sema6s do not require neuropilins for plexin
A binding. In the complex, plexin As serve as
signal-transducing subunits. An increasing number of
molecules that interact with the intracellular region of
Plexin A have been identified; among them are IgCAMs (in
axon guidance events) and Trem2-DAP12 (in immune
responses). The Sema domain is located at the N-terminus
and contains four disulfide bonds formed by eight
conserved cysteine residues. It serves as a
ligand-recognition and -binding module.
Length = 470
Score = 25.2 bits (55), Expect = 7.7
Identities = 9/29 (31%), Positives = 15/29 (51%)
Query: 66 PLDGKPRTRSSMSSEKHPVESDTDSMAEY 94
+DGK ++SS K + ++D M Y
Sbjct: 150 AVDGKSEYFPTLSSRKLTADEESDGMFAY 178
>gnl|CDD|225154 COG2245, COG2245, Predicted membrane protein [Function unknown].
Length = 182
Score = 25.0 bits (55), Expect = 7.9
Identities = 12/27 (44%), Positives = 18/27 (66%), Gaps = 2/27 (7%)
Query: 8 TAG--WFIGMLLAIAFLILVLILVCLI 32
TAG +FIG +L I + L++IL+ I
Sbjct: 138 TAGLLYFIGAILLIVAVGLIIILIAAI 164
>gnl|CDD|218462 pfam05140, ResB, ResB-like family. This family includes both
ResB and cytochrome c biogenesis proteins. Mutations in
ResB indicate that they are essential for growth. ResB
is predicted to be a transmembrane protein.
Length = 437
Score = 25.3 bits (56), Expect = 8.0
Identities = 8/31 (25%), Positives = 17/31 (54%), Gaps = 4/31 (12%)
Query: 4 DTVATAGWFIGMLLAIAFLILVLILVCLIKR 34
V ++ WF+ +L L+ V ++ C ++R
Sbjct: 48 FDVYSSWWFLALLA----LLFVSLIGCSLRR 74
>gnl|CDD|234739 PRK00376, lspA, lipoprotein signal peptidase; Reviewed.
Length = 160
Score = 24.8 bits (55), Expect = 8.7
Identities = 8/31 (25%), Positives = 12/31 (38%)
Query: 9 AGWFIGMLLAIAFLILVLILVCLIKRNRGGK 39
GW IA I V+++ L + K
Sbjct: 64 GGWQRWFFTGIAIGISVVLVYLLYRSKALQK 94
>gnl|CDD|222372 pfam13777, DUF4173, Domain of unknown function (DUF4173). This
domain of unknown function contains multiple predicted
transmembrane domains.
Length = 188
Score = 24.8 bits (55), Expect = 8.7
Identities = 10/25 (40%), Positives = 12/25 (48%)
Query: 12 FIGMLLAIAFLILVLILVCLIKRNR 36
MLL L+LVL + K NR
Sbjct: 105 LAFMLLLAIGLLLVLAKIFRGKSNR 129
>gnl|CDD|217414 pfam03189, Otopetrin, Protein of unknown function, DUF270.
Length = 413
Score = 25.0 bits (55), Expect = 8.7
Identities = 8/20 (40%), Positives = 11/20 (55%), Gaps = 2/20 (10%)
Query: 13 IGMLLAIAFLILVLILVCLI 32
G+ L I L+L +I LI
Sbjct: 212 KGLFLGILILVLSII--SLI 229
>gnl|CDD|214459 MTH00211, ND5, NADH dehydrogenase subunit 5; Provisional.
Length = 597
Score = 24.9 bits (55), Expect = 8.7
Identities = 8/29 (27%), Positives = 13/29 (44%)
Query: 1 MPSDTVATAGWFIGMLLAIAFLILVLILV 29
+ + I LA+ L L+LIL+
Sbjct: 564 LIKLVTSPQQGLIKSYLALLLLSLLLILL 592
>gnl|CDD|213363 cd12829, Alr1p-like, Saccharomyces cerevisiae Alr1p-like subfamily.
This eukaryotic subfamily belongs to the Thermotoga
maritima CorA (TmCorA)-family of the MIT superfamily of
essential membrane proteins involved in transporting
divalent cations (uptake or efflux) across membranes.
This subfamily includes three Saccharomyces cerevisiae
members: two plasma membrane proteins, the Mg2+
transporter Alr1p/Swc3p and the putative Mg2+
transporter, Alr2p, and the vacuole membrane protein
Mnr2p, a putative Mg2+ transporter. Thermotoga maritima
CorA forms funnel-shaped homopentamers, the tip of the
funnel is formed from two C-terminal transmembrane (TM)
helices from each monomer, and the large opening of the
funnel from the N-terminal cytoplasmic domains. The GMN
signature motif of the MIT superfamily occurs just after
TM1, mutation within this motif is known to abolish Mg2+
transport by Alr1p. Natural variants in this signature
sequence may be associated with the transport of
different divalent cations. The functional diversity of
the MIT superfamily may also be due to minor structural
differences regulating gating, substrate selection, and
transport.
Length = 305
Score = 24.9 bits (55), Expect = 8.8
Identities = 7/34 (20%), Positives = 21/34 (61%)
Query: 1 MPSDTVATAGWFIGMLLAIAFLILVLILVCLIKR 34
+P + + WF G+++ +++ +++V L++R
Sbjct: 271 VPGQDIESLYWFFGIVIVAIMVLVAIVVVFLVRR 304
>gnl|CDD|240529 cd13124, MATE_SpoVB_like, Stage V sporulation protein B, also known
as Stage III sporulation protein F, and related
proteins. The integral membrane protein SpoVB has been
implicated in the biosynthesis of the peptidoglycan
component of the spore cortex in Bacillus subtilis. This
model represents a subfamily of the multidrug and toxic
compound extrusion (MATE)-like proteins. Proteins from
the MATE family are involved in exporting metabolites
across the cell membrane and are often responsible for
multidrug resistance (MDR).
Length = 434
Score = 25.1 bits (56), Expect = 8.9
Identities = 7/23 (30%), Positives = 9/23 (39%)
Query: 5 TVATAGWFIGMLLAIAFLILVLI 27
AT G IG L + L+
Sbjct: 177 AGATFGAVIGALAGLLVLLYYYR 199
>gnl|CDD|119323 cd06581, TM_PBP1_LivM_like, Transmembrane subunit (TM) of
Escherichia coli LivM and related proteins. LivM is one
of two TMs of the E. coli LIV-1/LS transporter, a
Periplasmic Binding Protein (PBP)-dependent ATP-Binding
Cassette (ABC) transporter involved in the uptake of
branched-chain amino acids (AAs). These types of
transporters generally bind type 1 PBPs. PBP-dependent
ABC transporters consist of a PBP, two TMs, and two
cytoplasmic ABCs, and are mainly involved in importing
solutes from the environment. The solute is captured by
the PBP, which delivers it to a gated translocation
pathway formed by the two TMs. The two ABCs bind and
hydrolyze ATP and drive the transport reaction. E. coli
LivM forms a heterodimer with another TM, LivH, to
generate the transmembrane pore. LivH is not included in
this subgroup. The LIV-1/LS transporter is comprised of
two TMs (LivM and LivH), two ABCs (LivG and LivF), and
one of two alternative PBPs, LivJ (LIV-BP) or LivK
(LS-BP). In addition to transporting branched-chain AAs
including leucine, isoleucine and valine, the E. coli
LIV-1/LS transporter is involved in the uptake of the
aromatic AA, phenylalanine.
Length = 268
Score = 24.7 bits (55), Expect = 9.0
Identities = 13/35 (37%), Positives = 21/35 (60%), Gaps = 2/35 (5%)
Query: 16 LLAIAFLILVLILVCLIKRNRGGKY--AVHEREAA 48
L +A L+LVL+L+ + R+ G+ A+ E E A
Sbjct: 126 YLVLAVLLLVLLLLRRLVRSPFGRALRAIRENEVA 160
>gnl|CDD|216705 pfam01794, Ferric_reduct, Ferric reductase like transmembrane
component. This family includes a common region in the
transmembrane proteins mammalian cytochrome B-245 heavy
chain (gp91-phox), ferric reductase transmembrane
component in yeast and respiratory burst oxidase from
mouse-ear cress. This may be a family of
flavocytochromes capable of moving electrons across the
plasma membrane. The Frp1 protein from S. pombe is a
ferric reductase component and is required for cell
surface ferric reductase activity, mutants in frp1 are
deficient in ferric iron uptake. Cytochrome B-245 heavy
chain is a FAD-dependent dehydrogenase it is also has
electron transferase activity which reduces molecular
oxygen to superoxide anion, a precursor in the
production of microbicidal oxidants. Mutations in the
sequence of cytochrome B-245 heavy chain (gp91-phox)
lead to the X-linked chronic granulomatous disease. The
bacteriocidal ability of phagocytic cells is reduced and
is characterized by the absence of a functional plasma
membrane associated NADPH oxidase. The chronic
granulomatous disease gene codes for the beta chain of
cytochrome B-245 and cytochrome B-245 is missing from
patients with the disease.
Length = 122
Score = 24.6 bits (54), Expect = 9.3
Identities = 6/43 (13%), Positives = 19/43 (44%), Gaps = 2/43 (4%)
Query: 3 SDTVATAGWFIGMLLAIAFLILVLILVCLIKRNRGGK--YAVH 43
+ + +G++ + L+L + + ++R G + +H
Sbjct: 69 LEKLKRPYILLGVIALVLLLLLAITSLPPLRRRLGYELFLYLH 111
>gnl|CDD|223884 COG0814, SdaC, Amino acid permeases [Amino acid transport and
metabolism].
Length = 415
Score = 24.9 bits (55), Expect = 9.4
Identities = 6/25 (24%), Positives = 10/25 (40%)
Query: 8 TAGWFIGMLLAIAFLILVLILVCLI 32
G IG+ A L++ + I
Sbjct: 85 KGGILIGLSYFFALYGLLVAYIVGI 109
>gnl|CDD|221362 pfam11991, Trp_DMAT, Tryptophan dimethylallyltransferase. This
family of proteins represents tryptophan
dimethylallyltransferase (EC:2.5.1.34), which catalyzes
the first step of ergot alkaloid biosynthesis. Ergot
alkaloids, which are produced by endophyte fungi, can
enhance plant host fitness, but also cause livestock
toxicosis to host plants. This protein is found in
bacteria and eukaryotes. Proteins in this family are
typically between 390 to 465 amino acids in length.
Length = 358
Score = 25.0 bits (55), Expect = 9.5
Identities = 8/21 (38%), Positives = 13/21 (61%)
Query: 65 QPLDGKPRTRSSMSSEKHPVE 85
P G+PR RS ++ + P+E
Sbjct: 46 YPSAGQPRWRSFLTDDGTPLE 66
>gnl|CDD|234697 PRK00235, cobS, cobalamin synthase; Reviewed.
Length = 249
Score = 24.8 bits (55), Expect = 9.7
Identities = 7/20 (35%), Positives = 11/20 (55%)
Query: 9 AGWFIGMLLAIAFLILVLIL 28
G +G L A+ + +L L L
Sbjct: 40 VGLLLGALAALVYWLLQLGL 59
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.316 0.134 0.403
Gapped
Lambda K H
0.267 0.0771 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 5,257,449
Number of extensions: 440763
Number of successful extensions: 1334
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1321
Number of HSP's successfully gapped: 120
Length of query: 103
Length of database: 10,937,602
Length adjustment: 69
Effective length of query: 34
Effective length of database: 7,877,176
Effective search space: 267823984
Effective search space used: 267823984
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 53 (24.1 bits)