RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy3469
(154 letters)
>gnl|CDD|129972 TIGR00894, 2A0114euk, Na(+)-dependent inorganic phosphate
cotransporter. [Transport and binding proteins,
Anions].
Length = 465
Score = 110 bits (278), Expect = 2e-29
Identities = 38/123 (30%), Positives = 59/123 (47%), Gaps = 1/123 (0%)
Query: 3 LGQYGPAVCLVAASYTGCDPYLTVGILTIGLGFNGAVYSGFKVNHLDISPRFAGILMSFT 62
+G GP + A Y YLT+ ILT+ + +G +N LD++PRF G + T
Sbjct: 342 IGGLGPGIFAYALPYLSAAFYLTIIILTLANAVSSGPLAGVLINSLDLAPRFLGFIKGIT 401
Query: 63 NCLANLAGLLAPIIAGYVIQGKPTQAAWRVVFVAAAVVYLFSSTFYIFAASGDRQPWDNP 122
+ GL+A +AG I + ++ W +VF+ A V + FY+ S +RQ W
Sbjct: 402 GLPGFIGGLIASTLAG-NILSQDSKNVWLIVFLIMAFVNILCVIFYLIFGSAERQDWAKE 460
Query: 123 NND 125
D
Sbjct: 461 EKD 463
>gnl|CDD|119392 cd06174, MFS, The Major Facilitator Superfamily (MFS) is a large
and diverse group of secondary transporters that
includes uniporters, symporters, and antiporters. MFS
proteins facilitate the transport across cytoplasmic or
internal membranes of a variety of substrates including
ions, sugar phosphates, drugs, neurotransmitters,
nucleosides, amino acids, and peptides. They do so using
the electrochemical potential of the transported
substrates. Uniporters transport a single substrate,
while symporters and antiporters transport two
substrates in the same or in opposite directions,
respectively, across membranes. MFS proteins are
typically 400 to 600 amino acids in length, and the
majority contain 12 transmembrane alpha helices (TMs)
connected by hydrophilic loops. The N- and C-terminal
halves of these proteins display weak similarity and may
be the result of a gene duplication/fusion event. Based
on kinetic studies and the structures of a few bacterial
superfamily members, GlpT (glycerol-3-phosphate
transporter), LacY (lactose permease), and EmrD
(multidrug transporter), MFS proteins are thought to
function through a single substrate binding site,
alternating-access mechanism involving a rocker-switch
type of movement. Bacterial members function primarily
for nutrient uptake, and as drug-efflux pumps to confer
antibiotic resistance. Some MFS proteins have medical
significance in humans such as the glucose transporter
Glut4, which is impaired in type II diabetes, and
glucose-6-phosphate transporter (G6PT), which causes
glycogen storage disease when mutated.
Length = 352
Score = 40.4 bits (95), Expect = 1e-04
Identities = 22/110 (20%), Positives = 43/110 (39%), Gaps = 4/110 (3%)
Query: 1 MILGQYGPAVCLVAASYTGCDPYLTVGILTIGLGFNGAVYSGFKVNHLDISPRFAGILMS 60
+++G A+ L+ + L V +L +G G A + + P G
Sbjct: 245 LLIGLLLAALGLLLLALAPSLALLLVALLLLGFGLGFAFPALLTLASELAPPEARGTASG 304
Query: 61 FTNCLANLAGLLAPIIAGYVIQGKPTQAAWRVVFVAAAVVYLFSSTFYIF 110
N +L G L P++AG ++ + VF+ A + L ++ +
Sbjct: 305 LFNTFGSLGGALGPLLAGLLLDTG----GYGGVFLILAALALLAALLLLL 350
Score = 30.7 bits (70), Expect = 0.24
Identities = 20/102 (19%), Positives = 40/102 (39%), Gaps = 4/102 (3%)
Query: 9 AVCLVAASYTGCDPYLTVGILTIGLGFNGAVYSGFKVNHLDISPRFAGILMSFTNCLANL 68
A+ + ++ L VG +GLG + + P+ G + + L
Sbjct: 75 ALGSLLLAFASSLWLLLVGRFLLGLGGGALYPAAAALIAEWFPPKERGRALGLFSAGFGL 134
Query: 69 AGLLAPIIAGYVIQGKPTQAAWRVVFVAAAVVYLFSSTFYIF 110
LL P++ G + + WR +F+ A++ L + +F
Sbjct: 135 GALLGPLLGGLLAE----SLGWRWLFLILAILGLLLALLLLF 172
>gnl|CDD|233174 TIGR00893, 2A0114, D-galactonate transporter. [Transport and
binding proteins, Carbohydrates, organic alcohols, and
acids].
Length = 399
Score = 36.9 bits (86), Expect = 0.002
Identities = 23/99 (23%), Positives = 44/99 (44%), Gaps = 2/99 (2%)
Query: 2 ILGQYGPAVCLVAASYTGCDPYLTVGILTIGLGFNGAVYSGFKVNHLDISPRFAGILMSF 61
I+ ++ + A +Y PY + ++ +G GA G+ + + AG+
Sbjct: 293 IIAGLVLSLLMFATNYVN-IPYAALALVALGFFGLGAGAIGWALISDNAPGNIAGLTGGL 351
Query: 62 TNCLANLAGLLAPIIAGYVIQGKPTQAAWRVVFVAAAVV 100
N L NL G++ PI+ G + + A ++ VAA +
Sbjct: 352 INSLGNLGGIVGPIVIGAIAATTGS-FAGALMVVAALAL 389
>gnl|CDD|217379 pfam03125, Sre, C. elegans Sre G protein-coupled chemoreceptor.
Caenorhabditis elegans Sre proteins are candidate
chemosensory receptors. There are four main recognised
groups of such receptors: Odr-10, Sra, Sro, and Srg. Sre
(this family), Sra pfam02117 and Srb pfam02175 comprise
the Sra group. All of the above receptors are thought to
be G protein-coupled seven transmembrane domain
proteins. The existence of several different
chemosensory receptors underlies the fact that in spite
of having only 20-30 chemosensory neurones, C. elegans
detects hundreds of different chemicals, with the
ability to discern individual chemicals among
combinations.
Length = 363
Score = 31.5 bits (72), Expect = 0.16
Identities = 12/63 (19%), Positives = 16/63 (25%), Gaps = 6/63 (9%)
Query: 32 GLGFNGAVYSGFKVNHLDISPRFAGILMSFTNCLANLAGLLAPIIAGYVIQGKPTQAAWR 91
L SG D+ P L L LL + + AW+
Sbjct: 257 VLFIIVLCGSGILALSFDLIPLLRTFLNHIFENSLFLYPLLICPTIMFSVP------AWK 310
Query: 92 VVF 94
F
Sbjct: 311 KEF 313
>gnl|CDD|178518 PLN02930, PLN02930, CDP-diacylglycerol-serine
O-phosphatidyltransferase.
Length = 353
Score = 30.9 bits (70), Expect = 0.20
Identities = 18/62 (29%), Positives = 30/62 (48%), Gaps = 7/62 (11%)
Query: 62 TNCLANLAGLLAPIIAGYVIQG------KPTQAAWRVVFVAAAVVYLFSSTFYIFAASGD 115
T+ + ++A +A ++Q +P A WR+V AVVYL + TF +F D
Sbjct: 36 TSVKRGVWAMIAVFLAYCLLQAPSTVLIRPHPAIWRLVH-GMAVVYLVALTFLLFQKRDD 94
Query: 116 RQ 117
+
Sbjct: 95 AR 96
>gnl|CDD|225646 COG3104, PTR2, Dipeptide/tripeptide permease [Amino acid transport
and metabolism].
Length = 498
Score = 30.7 bits (70), Expect = 0.25
Identities = 31/127 (24%), Positives = 39/127 (30%), Gaps = 26/127 (20%)
Query: 1 MILGQYGPAVCLVAASYT------GCDPYLTVGILTIGLGFNGAVYSGFKVNHLDISPRF 54
+LG + L A G L IGL FK N IS
Sbjct: 87 RVLGTRR-TIVLGAILMAIGHLVLAISSVSGPGGLYIGLALIIVGTGLFKPN---ISS-L 141
Query: 55 AGILMS---------FTNCLA--NLAGLLAPIIAGYVIQGKPTQAAWRVVFVAAAVVYLF 103
G L FT N+ L+APII G + W V F AAV +
Sbjct: 142 LGELYPKDDPRRDGGFTLFYMGINIGSLIAPIITGLLAINYG----WHVGFGLAAVGMII 197
Query: 104 SSTFYIF 110
++
Sbjct: 198 GLVIFLL 204
>gnl|CDD|145924 pfam03034, PSS, Phosphatidyl serine synthase. Phosphatidyl serine
synthase is also known as serine exchange enzyme. This
family represents eukaryotic PSS I and II which are
membrane bound proteins which catalyzes the replacement
of the head group of a phospholipid (phosphotidylcholine
or phosphotidylethanolamine) by L-serine.
Length = 278
Score = 29.6 bits (67), Expect = 0.55
Identities = 10/26 (38%), Positives = 15/26 (57%), Gaps = 1/26 (3%)
Query: 85 PTQAAWRVVFVAAAVVYLFSSTFYIF 110
P A WR+V +V+YL + F +F
Sbjct: 1 PHPAFWRLVL-GLSVLYLLALVFLLF 25
>gnl|CDD|211363 cd09252, AP-3_Mu3_Cterm, C-terminal domain of medium Mu3 subunit in
adaptor protein (AP) complex AP-3. AP complexes
participate in the formation of intracellular coated
transport vesicles and select cargo molecules for
incorporation into the coated vesicles in the late
secretory and endocytic pathways. There are four AP
complexes, AP-1, AP-2, AP-3, and AP-4, described in
various eukaryotic organisms. Each AP complex consists
of four subunits: two large chains (one each of
gamma/alpha/delta/epsilon and beta1-4, respectively), a
medium mu chain (mu1-4), and a small sigma chain
(sigma1-4). Each of the four subunits from the different
AP complexes exhibits similarity with each other. This
family corresponds to the C-terminal domain of
heterotetrameric adaptor protein complex 3 (AP-3) medium
mu3 subunit, which includes two closely related
homologs, mu3A (P47A, encoded by ap3m1) and mu1B (P47B,
encoded by ap3m2). Mu3A is ubiquitously expressed, but
mu3B is specifically expressed in neurons and
neuroendocrine cells. AP-3 is particularly important for
targeting integral membrane proteins to lysosomes and
lysome-related organelles at trans-Golgi network (TGN)
and/or endosomes, such as the yeast vacuole, fly pigment
granules and mammalian melanosomes, platelet dense
bodies and the secretory lysosomes of cytotoxic T
lymphocytes. Unlike AP-1 and AP-2, which function in
conjunction with clathrin which is a scaffolding protein
participating in the formation of coated vesicles, the
nature of the outer shell of AP-3 containing coats
remains to be elucidated. Membrane-anchored cargo
molecules interact with adaptors through short sorting
signals in their cytosolic segments. Tyrosine-based
endocytotic signals are one of the most important
sorting signals. They are of the form Y-X-X-Phi, where Y
is tyrosine, X is any amino acid and Phi is a bulky
hydrophobic residue that can be Leu, Ile, Met, Phe, or
Val. These kinds of sorting signals can be recognized by
the C-terminal domain of AP-3 mu3 subunit, also known as
Y-X-X-Phi signal-binding domain that contains two
hydrophobic pockets, one for the tyrosine-binding and
one for the bulky hydrophobic residue-binding.
Length = 248
Score = 29.5 bits (67), Expect = 0.62
Identities = 10/23 (43%), Positives = 14/23 (60%), Gaps = 2/23 (8%)
Query: 30 TIGLGF--NGAVYSGFKVNHLDI 50
+I + F G + SG KV+ LDI
Sbjct: 206 SISVSFKIPGYLPSGLKVDSLDI 228
>gnl|CDD|148986 pfam07682, SOR, Sulphur oxygenase reductase. The sulphur
oxygenase/reductase (SOR) of the thermo-acidophilic
archaeon Acidianus ambivalens is an unusual enzyme
consisting of 24 identical subunits arranged in a
perfectly symmetrical hollow sphere and containing a
mononuclear non-heme iron centre (personal
communication: A. Kletzin). At 85 degrees C in vitro,
elemental sulphur is oxidized to sulphite, thiosulphate
and hydrogen sulphide with no external cofactors
needed. The proposed equation is: 4S + O2 + 4 H2O --->
2 HSO3- + 2 H2S + 2 H+.
Length = 303
Score = 29.1 bits (65), Expect = 0.92
Identities = 13/40 (32%), Positives = 21/40 (52%), Gaps = 3/40 (7%)
Query: 2 ILGQYGPAVCLVAA---SYTGCDPYLTVGILTIGLGFNGA 38
+ GP VC+V A + G ++ +G+L +G F GA
Sbjct: 21 LFSAVGPKVCMVTARHPGFVGFQNHVQIGVLPLGERFGGA 60
>gnl|CDD|219516 pfam07690, MFS_1, Major Facilitator Superfamily.
Length = 346
Score = 28.9 bits (65), Expect = 0.94
Identities = 19/110 (17%), Positives = 39/110 (35%), Gaps = 4/110 (3%)
Query: 1 MILGQYGPAVCLVAASYTGCDPYLTVGILTIGLGFNGAVYSGFKVNHLDISPRFAGILMS 60
+++G A+ L+ + L V + GLG + + P G +
Sbjct: 65 LLIGLLLFALGLLLLLFASSLWLLLVLRVLQGLGGGALFPAAAALIADWFPPEERGRALG 124
Query: 61 FTNCLANLAGLLAPIIAGYVIQGKPTQAAWRVVFVAAAVVYLFSSTFYIF 110
+ L L P++ G + + WR F+ A++ L ++
Sbjct: 125 LLSAGFGLGAALGPLLGGLLA----SLFGWRAAFLILAILALLAAVLAAL 170
>gnl|CDD|233175 TIGR00895, 2A0115, benzoate transport. [Transport and binding
proteins, Carbohydrates, organic alcohols, and acids].
Length = 398
Score = 28.5 bits (64), Expect = 1.6
Identities = 17/95 (17%), Positives = 32/95 (33%), Gaps = 4/95 (4%)
Query: 9 AVCLVAASYTGCDPYLTVGILTIGLGFNGAVYSGFKVNHLDISPRFAGILMSFTNCLANL 68
+V + + L + GLG G + + + RF G + C +
Sbjct: 93 SVFTLLCALATNVTQLLILRFLAGLGLGGLMPNLNALVSEYAPKRFRGTAVGLMFCGYPI 152
Query: 69 AGLLAPIIAGYVIQGKPTQAAWRVVFVAAAVVYLF 103
+ +AG++I WR +F + L
Sbjct: 153 GAAVGGFLAGWLI----PVFGWRSLFYVGGIAPLL 183
>gnl|CDD|225371 COG2814, AraJ, Arabinose efflux permease [Carbohydrate transport
and metabolism].
Length = 394
Score = 28.3 bits (64), Expect = 1.8
Identities = 22/109 (20%), Positives = 41/109 (37%), Gaps = 4/109 (3%)
Query: 9 AVCLVAASYTGCDPYLTVGILTIGLGFNGAVYSGFKVNHLDISPRFAGILMSFTNCLANL 68
A+ L+A ++TG P L + +L + G + ++P A + S NL
Sbjct: 287 ALALLALTFTGASPALALALLFLWGFAFSPALQGLQTRLARLAPDAADLAGSLNVAAFNL 346
Query: 69 AGLLAPIIAGYVIQGKPTQAAWRVVFVAAAVVYLFSSTFYIFAASGDRQ 117
L + G V+ + A + L + + +A DR+
Sbjct: 347 GIALGAALGGLVLD----ALGYAATGWVGAALLLLALLLALLSARKDRR 391
>gnl|CDD|222011 pfam13257, DUF4048, Domain of unknown function (DUF4048). This
presumed domain is functionally uncharacterized. This
domain family is found in eukaryotes, and is typically
between 228 and 257 amino acids in length.
Length = 242
Score = 27.8 bits (62), Expect = 2.1
Identities = 15/49 (30%), Positives = 21/49 (42%), Gaps = 3/49 (6%)
Query: 103 FSSTFYIFAASGDRQPWDNPNNDAPTPRAATKRDASNNNRTSYASDVED 151
F S F I Q +P + TP T + S N ++S DV+D
Sbjct: 176 FWSEFGIDTPG---QKSKSPQKASSTPAGNTNQGQSQNAQSSNLLDVDD 221
>gnl|CDD|239250 cd02952, TRP14_like, Human TRX-related protein 14 (TRP14)-like
family; composed of proteins similar to TRP14, a 14kD
cytosolic protein that shows disulfide reductase
activity in vitro with a different substrate specificity
compared with another human cytosolic protein, TRX1.
TRP14 catalyzes the reduction of small
disulfide-containing peptides but does not reduce
disulfides of ribonucleotide reductase, peroxiredoxin
and methionine sulfoxide reductase, which are TRX1
substrates. TRP14 also plays a role in tumor necrosis
factor (TNF)-alpha signaling pathways, distinct from
that of TRX1. Its depletion promoted TNF-alpha induced
activation of c-Jun N-terminal kinase and
mitogen-activated protein kinases.
Length = 119
Score = 26.9 bits (60), Expect = 2.8
Identities = 9/22 (40%), Positives = 12/22 (54%)
Query: 108 YIFAASGDRQPWDNPNNDAPTP 129
+I+ GDR W +PNN T
Sbjct: 63 FIYCDVGDRPYWRDPNNPFRTD 84
>gnl|CDD|220557 pfam10086, DUF2324, Putative membrane peptidase family (DUF2324).
This domain, found in various hypothetical bacterial
proteins, has no known function. This family appears to
be related to the prenyl protease 2 family pfam02517,
suggesting this family may be peptidases.
Length = 222
Score = 27.2 bits (61), Expect = 3.7
Identities = 12/34 (35%), Positives = 19/34 (55%), Gaps = 1/34 (2%)
Query: 71 LLAPIIAGYVIQGKPTQAAWRVVFVAAAVVYLFS 104
+L P++ YV K + +WRV + A V LF+
Sbjct: 3 ILLPVLL-YVYLRKRKKLSWRVFGLGALVFALFA 35
>gnl|CDD|218922 pfam06168, DUF981, Protein of unknown function (DUF981). Family of
uncharacterized proteins found in bacteria and archaea.
Length = 189
Score = 26.9 bits (60), Expect = 4.3
Identities = 25/123 (20%), Positives = 44/123 (35%), Gaps = 31/123 (25%)
Query: 2 ILGQYGPAVCLVAASYTGC--DPYLTVGILTIGLGFNGAVYSGFKVNHLDISPRFAGI-- 57
I G +G + SY DPYL +G++ + ++Y G+ + L I F G+
Sbjct: 53 ITGLWGELTWPLPGSYNILFGDPYLLLGLILLSAAI--SLYKGYDLRPLGILALFLGLYT 110
Query: 58 --------------------LMSFTNCLANLAGLLAPIIAGYVIQGKPTQAAWRVVFVAA 97
L + L LAGLL P +A + + ++
Sbjct: 111 IYYGVAIYNYGLTREPLLALLGLYI--LTGLAGLLPPTLA---LDRLKGKKILLILGAVI 165
Query: 98 AVV 100
++
Sbjct: 166 LIL 168
>gnl|CDD|235348 PRK05122, PRK05122, major facilitator superfamily transporter;
Provisional.
Length = 399
Score = 27.2 bits (61), Expect = 4.4
Identities = 27/97 (27%), Positives = 37/97 (38%), Gaps = 10/97 (10%)
Query: 23 YLTVGILTIGLGFNGAVYSGFKVNHLDISPRFAGILMSFTNCLANLAGLLAPIIAGYVIQ 82
+ + LTIGL AV G+ + L S AG+++S L LA LL+ AG
Sbjct: 23 FTFISYLTIGLPL--AVLPGYVHDQLGFSAFLAGLVIS----LQYLATLLSRPHAGRYAD 76
Query: 83 GKPTQAAWRVVFVAAAVVYLFSSTFYIFAASGDRQPW 119
T + V L S Y+ A P
Sbjct: 77 ---TLGPKKAVVFGLCGCAL-SGLLYLLAGLLAAWPV 109
>gnl|CDD|226115 COG3587, COG3587, Restriction endonuclease [Defense mechanisms].
Length = 985
Score = 27.1 bits (60), Expect = 4.4
Identities = 8/16 (50%), Positives = 11/16 (68%)
Query: 108 YIFAASGDRQPWDNPN 123
+IF+ R+ WDNPN
Sbjct: 486 FIFSKWALREGWDNPN 501
>gnl|CDD|178126 PLN02510, PLN02510, probable 1-acyl-sn-glycerol-3-phosphate
acyltransferase.
Length = 374
Score = 27.0 bits (60), Expect = 4.4
Identities = 12/50 (24%), Positives = 23/50 (46%), Gaps = 2/50 (4%)
Query: 59 MSFTNCLANLAGLLAPIIAGYVIQGKPTQAAWRVVFVAAAVVYLFSSTFY 108
+S CLAN ++A + + W ++V+ + YL S+T +
Sbjct: 307 LSTLKCLANFLAVIA--LTTVFTYLTLFSSVWFKIYVSLSCAYLTSATHF 354
>gnl|CDD|235151 PRK03699, PRK03699, putative transporter; Provisional.
Length = 394
Score = 27.2 bits (61), Expect = 4.4
Identities = 13/42 (30%), Positives = 23/42 (54%), Gaps = 3/42 (7%)
Query: 61 FTNCLANLAGLLAPIIAGYVIQGKPTQAAWRVVFVAAAVVYL 102
FT+ ++AG++ PIIA Y++ W V+ +VY+
Sbjct: 135 FTDSFFSMAGMIFPIIAAYLLARS---IEWYWVYACIGLVYV 173
>gnl|CDD|226812 COG4377, COG4377, Predicted membrane protein [Function unknown].
Length = 258
Score = 26.7 bits (59), Expect = 4.9
Identities = 9/40 (22%), Positives = 15/40 (37%)
Query: 65 LANLAGLLAPIIAGYVIQGKPTQAAWRVVFVAAAVVYLFS 104
+ LLA I + Q V+ + A ++FS
Sbjct: 12 IITAIALLAFPIGSIWWAKRKYQINLAVLGLGAVAFFVFS 51
>gnl|CDD|182964 PRK11102, PRK11102, bicyclomycin/multidrug efflux system;
Provisional.
Length = 377
Score = 26.8 bits (60), Expect = 4.9
Identities = 11/23 (47%), Positives = 16/23 (69%)
Query: 58 LMSFTNCLANLAGLLAPIIAGYV 80
+MSF + +A LLAPII G++
Sbjct: 116 MMSFVTLVMTIAPLLAPIIGGWL 138
>gnl|CDD|152471 pfam12036, DUF3522, Protein of unknown function (DUF3522). This
family of proteins is functionally uncharacterized. This
protein is found in eukaryotes. Proteins in this family
are typically between 220 to 787 amino acids in length.
Length = 183
Score = 26.6 bits (59), Expect = 5.0
Identities = 16/58 (27%), Positives = 22/58 (37%), Gaps = 15/58 (25%)
Query: 54 FAGILMSFTNCLANLAGLLAPIIAGYVIQGKPTQAAWRVVFVAAAVVY---LFSSTFY 108
F +L L+NLA L + A + +V AVVY +F S Y
Sbjct: 2 FEQLLQFLLLTLSNLAFLPTIYV------------ALKRRYVYEAVVYIFTMFFSFMY 47
>gnl|CDD|223856 COG0785, CcdA, Cytochrome c biogenesis protein [Posttranslational
modification, protein turnover, chaperones].
Length = 220
Score = 26.5 bits (59), Expect = 5.1
Identities = 14/59 (23%), Positives = 22/59 (37%), Gaps = 5/59 (8%)
Query: 55 AGILMSFTNCLANLAGLLAPIIAGYVIQGKPTQAAWRVVFVAAAVVYLFSSTFYIFAAS 113
AG+L + C L L P Y+ G A V+ + V F++ F +
Sbjct: 19 AGLLSFLSPC--VLP--LLPAYLSYLAGGSLG-ARKSVLLASLLFVLGFATVFVLLGIG 72
>gnl|CDD|224191 COG1272, COG1272, Predicted membrane protein, hemolysin III homolog
[General function prediction only].
Length = 226
Score = 26.8 bits (60), Expect = 5.2
Identities = 12/72 (16%), Positives = 23/72 (31%), Gaps = 10/72 (13%)
Query: 58 LMSFTNCLAN----LAGLLAPIIAG------YVIQGKPTQAAWRVVFVAAAVVYLFSSTF 107
S+ +AN L G + I+ +I G ++ + + ST
Sbjct: 15 SYSWHEEIANAITHLIGAILAIVGLVLLLVYALITGSALAVIVFSIYGLSLFLLFLVSTL 74
Query: 108 YIFAASGDRQPW 119
Y +G +
Sbjct: 75 YHSIPNGQKAKA 86
>gnl|CDD|185739 cd08998, GH43_ABN_1, Glycosyl hydrolase family 43. This glycosyl
hydrolase family 43 (GH43) includes mostly enzymes with
alpha-L-arabinofuranosidase (AFN; EC 3.2.1.55) and
endo-alpha-L-arabinanase (ABN; EC 3.2.1.99) activities.
These are inverting enzymes (i.e. they invert the
stereochemistry of the anomeric carbon atom of the
substrate) that have an aspartate as the catalytic
general base, a glutamate as the catalytic general acid
and another aspartate that is responsible for pKa
modulation and orienting the catalytic acid. The GH43
ABN enzymes hydrolyze alpha-1,5-L-arabinofuranoside
linkages while the ABF enzymes cleave arabinose side
chains so that the combined actions of these two enzymes
reduce arabinan to L-arabinose and/or
arabinooligosaccharides. These arabinan-degrading
enzymes are important in the food industry for efficient
production of L-arabinose from agricultural waste;
L-arabinose is often used as a bioactive sweetener. A
common structural feature of GH43 enzymes is a 5-bladed
beta-propeller domain that contains the catalytic acid
and catalytic base. A long V-shaped groove, partially
enclosed at one end, forms a single extended
substrate-binding surface across the face of the
propeller.
Length = 288
Score = 26.8 bits (60), Expect = 5.3
Identities = 14/50 (28%), Positives = 18/50 (36%), Gaps = 1/50 (2%)
Query: 101 YLFSSTFYIFAA-SGDRQPWDNPNNDAPTPRAATKRDASNNNRTSYASDV 149
YLFS+ I A S D W + PT + +A DV
Sbjct: 14 YLFSTGNGIAIAKSKDLGNWTYVGSVFPTGPPWWSSIDPGGSGNLWAPDV 63
>gnl|CDD|220948 pfam11027, DUF2615, Protein of unknown function (DUF2615). This
small. approximately 100 residue, family is conserved
from worms to humans. It is cysteine-rich with a
characteristic FDxCEC sequence motif. The function is
not known.
Length = 103
Score = 25.8 bits (57), Expect = 5.4
Identities = 21/69 (30%), Positives = 31/69 (44%), Gaps = 7/69 (10%)
Query: 62 TNCLANLAGLLAPIIAGYVIQGKPTQAAWRVVFVAAAVV-YLFSSTFYIFAASGDRQPWD 120
T CL +L G P +G G + + ++ A+ YL + SGD +P D
Sbjct: 35 TECLQDLPG--RPQGSG---TGGISGFMLMMAWMLLAMALYLLRP-SSLRNPSGDNKPRD 88
Query: 121 NPNNDAPTP 129
N +ND P P
Sbjct: 89 NNSNDGPPP 97
>gnl|CDD|225365 COG2807, CynX, Cyanate permease [Inorganic ion transport and
metabolism].
Length = 395
Score = 26.1 bits (58), Expect = 8.2
Identities = 12/57 (21%), Positives = 21/57 (36%)
Query: 50 ISPRFAGILMSFTNCLANLAGLLAPIIAGYVIQGKPTQAAWRVVFVAAAVVYLFSST 106
+S AG L+S LL P++A +P ++ + V L +
Sbjct: 240 LSAAEAGSLLSLMQLAQLPTALLIPLLARRSKNQRPLVVLALLLMLVGLVGLLLAPG 296
>gnl|CDD|233167 TIGR00881, 2A0104, phosphoglycerate transporter family protein.
[Transport and binding proteins, Carbohydrates, organic
alcohols, and acids].
Length = 379
Score = 26.2 bits (58), Expect = 8.3
Identities = 25/138 (18%), Positives = 46/138 (33%), Gaps = 26/138 (18%)
Query: 24 LTVGILTIGLGFNGAVYS--------------GFKVNHLDI----SPRFAGILMSFTNCL 65
+ I+ + GF+ +++ G+ + S G +SF NC
Sbjct: 68 ILCAIVNLFFGFSTSLWVMAALWALNGIFQGMGWPPCGRTVTKWFSRSERGTWVSFWNCS 127
Query: 66 ANLAGLLAPIIAGYVIQGKPTQAAWRVVFVAAAVVYLFSSTFYIFAASGDRQ-----PWD 120
N+ G L P + + I +W VF+ ++ + S Q P +
Sbjct: 128 HNVGGGLLPPLVLFGIA---ELYSWHWVFIVPGIIAIIVSLICFLLLRDSPQSCGLPPAE 184
Query: 121 NPNNDAPTPRAATKRDAS 138
N+ P A K +
Sbjct: 185 EMPNEEPDGDAEKKEEEL 202
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.322 0.136 0.419
Gapped
Lambda K H
0.267 0.0748 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 7,876,747
Number of extensions: 725968
Number of successful extensions: 836
Number of sequences better than 10.0: 1
Number of HSP's gapped: 825
Number of HSP's successfully gapped: 93
Length of query: 154
Length of database: 10,937,602
Length adjustment: 89
Effective length of query: 65
Effective length of database: 6,990,096
Effective search space: 454356240
Effective search space used: 454356240
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 55 (24.9 bits)