RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy16729
(238 letters)
>gnl|CDD|216528 pfam01490, Aa_trans, Transmembrane amino acid transporter protein.
This transmembrane region is found in many amino acid
transporters including UNC-47 and MTR. UNC-47 encodes a
vesicular amino butyric acid (GABA) transporter, (VGAT).
UNC-47 is predicted to have 10 transmembrane domains.
MTR is a N system amino acid transporter system protein
involved in methyltryptophan resistance. Other members
of this family include proline transporters and amino
acid permeases.
Length = 406
Score = 111 bits (281), Expect = 5e-29
Identities = 52/194 (26%), Positives = 96/194 (49%), Gaps = 7/194 (3%)
Query: 11 VMPLENNMQTPNHFVGLCGVLNQGMGGVTMIYILLGFLGYLKYGELTEDNITYNLPQHEI 70
++P++N M++P+ F + VL + VT++YIL+G +GYL +G + NI NLP+ +
Sbjct: 212 LLPIQNTMKSPSKFKAMTKVLLTAIIIVTVLYILVGLVGYLAFGNNVKGNILLNLPKSDW 271
Query: 71 APQIANIAIGIAVFCTFGLQFFVCLEIVWNGVK---DNFVKKPKFYE--YIVRTVIVTAA 125
IAN+ + + + ++ LQ F +IV N + + PK ++R+ +V
Sbjct: 272 LIDIANLLLVLHLLLSYPLQAFPIRQIVENLLFRKGASGKHNPKSKLLRVVIRSGLVVIT 331
Query: 126 VGLAVAVPTIGPFLGLIGAFCFSLLGLIFPVLIEFVTYWDVGFGPGNWRVW--KNILVLI 183
+A++VP +G FL L+GA + L I P L +++ ++
Sbjct: 332 YLIAISVPFLGDFLSLVGATSGAPLTFILPPLFHLKLKKTKKKSLEKLWKPDILDVICIV 391
Query: 184 FGVLALVFGTQTSI 197
G+L + +G I
Sbjct: 392 IGLLLMAYGVAGLI 405
>gnl|CDD|240313 PTZ00206, PTZ00206, amino acid transporter; Provisional.
Length = 467
Score = 49.9 bits (119), Expect = 4e-07
Identities = 51/195 (26%), Positives = 87/195 (44%), Gaps = 10/195 (5%)
Query: 9 SQVMPLENNMQTPNHFVG-LCGVLNQGMGGVTMIYILLGFLGYLKYG-ELTEDNITYNLP 66
Q+ E M N VG MG +Y+L F GY+ +G +T + P
Sbjct: 271 FQITAYEVYMDMTNRSVGKFVLASTIAMGMCFTMYVLTAFFGYMDFGRNVTGSVLLMYDP 330
Query: 67 QHEIAPQIANIAIGIAVFCTFGLQFFVCLEIVWNGVKDNFVKKPKFYEYIVRTVIVTAAV 126
+E A + + + + +F ++ L C +++ + + +K F+++ + V ++ +
Sbjct: 331 VNEPAIMVGFVGVLVKLFVSYALLGMACRNALYDVIGWDA-RKVAFWKHCIAVVTLSVVM 389
Query: 127 GL-AVAVPTIGPFLGLIGAFCFSLLGLIFPVLIEFVTYWDVGF---GPGNWRVWKNILVL 182
L + +P I LG G+ LLG I P L Y GF G + +VL
Sbjct: 390 LLCGLFIPKINTVLGFAGSISGGLLGFILPAL--LFMY-SGGFTWQKVGPFYYISTYVVL 446
Query: 183 IFGVLALVFGTQTSI 197
I GV+A+VFGT +I
Sbjct: 447 ITGVIAIVFGTGATI 461
>gnl|CDD|223884 COG0814, SdaC, Amino acid permeases [Amino acid transport and
metabolism].
Length = 415
Score = 40.3 bits (95), Expect = 5e-04
Identities = 41/177 (23%), Positives = 63/177 (35%), Gaps = 23/177 (12%)
Query: 34 GMGGVTMIYILLGFLGYLKYGELTEDNITYNLPQHEIAPQ-----IANIAIGIAV----- 83
G ++YIL+GF + +G L NI Q+ I + + IA+
Sbjct: 234 GSLIALVLYILVGFFVFGCFGSLVFGNILAAKEQNISLLSALAGVINSPILSIALNIFAL 293
Query: 84 ------FCTFGLQFFVCLE--IVWNGVKDNFVKKPKFYEYIVRTVIVTAAVGLAVAVPTI 135
F L F L + K K + G A+A+
Sbjct: 294 FAIATSFLGVYLGLFEGLADLFKKSNSKPGRKKTGLLTFLPPLIFALLYPWGFAIALGYA 353
Query: 136 GPFLGLIGAFCFSLLGLIFPV-LIEFVTYWDVGFGPGNWRVWKNILVLIFGVLALVF 191
G + IGA L I P LI + V GN+ + +LVL+FG+L ++
Sbjct: 354 GGLIATIGAPIIPALLFIKPRKLIYKLPALKV--YGGNFLL--LLLVLLFGILVILS 406
>gnl|CDD|217505 pfam03348, Serinc, Serine incorporator (Serinc). This is a family
of eukaryotic membrane proteins which incorporate serine
into membranes and facilitate the synthesis of the
serine-derived lipids phosphatidylserine and
sphingolipid. Members of this family contain 11
transmembrane domains and form intracellular complexes
with key enzymes involved in serine and sphingolipid
biosynthesis.
Length = 428
Score = 31.4 bits (72), Expect = 0.36
Identities = 18/65 (27%), Positives = 33/65 (50%), Gaps = 9/65 (13%)
Query: 134 TIGPFLGLIGAFCFSLLGLIFPVLIEFV----TYWDVGFGPGNWRVWKNILV---LIFGV 186
+ ++G+IG+F F L+ LI L++F W + GN R+W L+ L+ +
Sbjct: 135 FVWMYIGVIGSFLFILIQLIL--LVDFAHSWAESWVEKYEDGNSRLWYAALLGVTLLMYI 192
Query: 187 LALVF 191
L++
Sbjct: 193 LSIAL 197
>gnl|CDD|215559 PLN03074, PLN03074, auxin influx permease; Provisional.
Length = 473
Score = 31.2 bits (71), Expect = 0.50
Identities = 26/88 (29%), Positives = 34/88 (38%), Gaps = 9/88 (10%)
Query: 81 IAVFCTFGLQ----FFVCLEIVWNGVKDNFVKKPKFYEYIVRTVIVTAAVGLAVAVPTIG 136
I F TFG +FV + + GV D K + R +V LA+ P G
Sbjct: 320 IHQFITFGFACTPLYFVWEKAI--GVHDT---KSICLRALARLPVVVPIWFLAIIFPFFG 374
Query: 137 PFLGLIGAFCFSLLGLIFPVLIEFVTYW 164
P +GA S I P L +TY
Sbjct: 375 PINSAVGALLVSFTVYIIPSLAHMLTYR 402
>gnl|CDD|212086 cd11517, SLC6sbd_B0AT3, glycine transporter, B0AT3; solute-binding
domain. B0AT3 (also called Xtrp2, XT2) transports
glycine. Human B0AT3 is encoded by the SLC6A18 gene.
B0AT3 is expressed in the kidney. Mutations in the
SLC6A18 gene may contribute to the autosomal recessive
disorder iminoglycinuria and its related disorder
hyperglycinuria. SLC6A18 or its neighboring genes are
associated with increased susceptibility to myocardial
infarction. This subgroup belongs to the solute carrier
6 (SLC6) transporter family.
Length = 576
Score = 30.6 bits (69), Expect = 0.77
Identities = 43/167 (25%), Positives = 63/167 (37%), Gaps = 35/167 (20%)
Query: 73 QIANIAIGIAVFCTFGLQFFVCLEIVWNGVKDNFVKKPKFYEYIVRTVIVTAAVGLAVAV 132
Q NI I + +CL W V Y ++R + T G +
Sbjct: 149 QTLNITPDINDSGSIQWWLLLCLAACWAIV----------YLCVIRGIETT---GKVIYF 195
Query: 133 PTIGPFLGLIGAFCFSLLGLIFPVLIEFVTYWDVGFGPG-----NWRVWKNILVLIFGVL 187
+ P+L L F + GL P E + Y F P N RVW + IF L
Sbjct: 196 TALFPYLVLT---IFLIRGLTLPGATEGLIYL---FTPNMHILQNPRVWLDAATQIFFSL 249
Query: 188 ALVFGTQTSIGDIMRAYMPPDSD-----LPVAIINGTLGLAGNASLP 229
+L FG + +Y PP +D + +A++N L AS+P
Sbjct: 250 SLAFGGHIAFA----SYNPPRNDCEKDAVTIALVNSMTSLY--ASIP 290
>gnl|CDD|185126 PRK15204, PRK15204, undecaprenyl-phosphate galactose
phosphotransferase; Provisional.
Length = 476
Score = 29.6 bits (66), Expect = 1.7
Identities = 18/67 (26%), Positives = 32/67 (47%), Gaps = 2/67 (2%)
Query: 93 VCLEIVWNGVKDNFVKKPKFYEY--IVRTVIVTAAVGLAVAVPTIGPFLGLIGAFCFSLL 150
VC+ W ++ +KP +YE I RT+++ A LA+ T F + FC++
Sbjct: 65 VCVGWFWIRLRHYTYRKPFWYELKEIFRTIVIFAIFDLALIAFTKWQFSRYVWVFCWTFA 124
Query: 151 GLIFPVL 157
++ P
Sbjct: 125 LILVPFF 131
>gnl|CDD|232841 TIGR00131, gal_kin, galactokinase. Galactokinase is a member of
the GHMP kinases (Galactokinase, Homoserine kinase,
Mevalonate kinase, Phosphomevalonate kinase) and shares
with them an amino-terminal domain probably related to
ATP binding.The galactokinases found by This model are
divided into two sets. Prokaryotic forms are generally
shorter. The eukaryotic forms are longer because of
additional central regions and in some cases are known
to be bifunctional, with regulatory activities that are
independent of galactokinase activity [Energy
metabolism, Sugars].
Length = 386
Score = 29.0 bits (65), Expect = 2.1
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 9/75 (12%)
Query: 4 GHLPDSQVMPLENNMQTPNHFVGL-CGVLNQGMGGVTMIYILLGFLGYLKYGELTEDNIT 62
GHLP L NHFVG+ CG+++Q LG + E T
Sbjct: 143 GHLPLDSKQILLRIQVAENHFVGVNCGIMDQAASV-------LGKEDHALLVECRSLKAT 195
Query: 63 Y-NLPQHEIAPQIAN 76
PQ IA IAN
Sbjct: 196 PFKFPQLGIAFVIAN 210
>gnl|CDD|234062 TIGR02924, ICDH_alpha, isocitrate dehydrogenase. This family of
mainly alphaproteobacterial enzymes is a member of the
isocitrate/isopropylmalate dehydrogenase superfamily
described by pfam00180. Every member of the seed of this
model appears to have a TCA cycle lacking only a
determined isocitrate dehydrogenase. The precise
identity of the cofactor (NADH -- 1.1.1.41 vs. NADPH --
1.1.1.42) is unclear [Energy metabolism, TCA cycle].
Length = 473
Score = 29.0 bits (65), Expect = 2.1
Identities = 15/40 (37%), Positives = 23/40 (57%), Gaps = 8/40 (20%)
Query: 198 GDIMRAYMPPDSDLPVAIINGTLGLAGNASLPSNGTAFTA 237
GDI+ SD+ A I+G++GLAG+A++ F A
Sbjct: 227 GDIL-------SDV-AAEISGSVGLAGSANIGEEYAMFEA 258
>gnl|CDD|215373 PLN02691, PLN02691, porphobilinogen deaminase.
Length = 351
Score = 28.6 bits (64), Expect = 2.9
Identities = 25/76 (32%), Positives = 35/76 (46%), Gaps = 12/76 (15%)
Query: 7 PDSQVMPLENNMQTPNHFVGLCGVLNQGMGGVTMIYILLGFLGYLKYGELTEDNITYNLP 66
P +V+ N+QT L +G+ T LL G LK ++TE + T L
Sbjct: 183 PHLKVVNFRGNVQTR------LRKLQEGVVDAT----LLALAG-LKRLDMTE-HATSILS 230
Query: 67 QHEIAPQIANIAIGIA 82
E+ P +A AIGIA
Sbjct: 231 TDEMLPAVAQGAIGIA 246
>gnl|CDD|183106 PRK11375, PRK11375, allantoin permease; Provisional.
Length = 484
Score = 28.6 bits (64), Expect = 3.2
Identities = 22/66 (33%), Positives = 31/66 (46%), Gaps = 11/66 (16%)
Query: 138 FLGLIGAFC---------FSLLGLIFPVLIEFVTYW--DVGFGPGNWRVWKNILVLIFGV 186
L LIG F+LLGL P LI F+ +W +VG G G +V ++
Sbjct: 131 LLILIGKIWPGFLTLGGDFTLLGLSLPGLITFLIFWLVNVGIGFGGGKVLNKFTAILNPC 190
Query: 187 LALVFG 192
+ +VFG
Sbjct: 191 IYIVFG 196
>gnl|CDD|217540 pfam03408, Foamy_virus_ENV, Foamy virus envelope protein.
Expression of the envelope (Env) glycoprotein is
essential for viral particle egress. This feature is
unique to the Spumavirinae, a subclass of the
Retroviridae.
Length = 985
Score = 28.2 bits (63), Expect = 5.3
Identities = 17/38 (44%), Positives = 18/38 (47%), Gaps = 4/38 (10%)
Query: 128 LAVAVPTIGPFLGLI--GAF--CFSLLGLIFPVLIEFV 161
A AV IG FL G F FSLLG P+LI
Sbjct: 926 AASAVSGIGNFLSKTAQGIFGTAFSLLGYAKPILIGIG 963
>gnl|CDD|223844 COG0773, MurC, UDP-N-acetylmuramate-alanine ligase [Cell envelope
biogenesis, outer membrane].
Length = 459
Score = 27.9 bits (63), Expect = 5.6
Identities = 12/32 (37%), Positives = 19/32 (59%), Gaps = 5/32 (15%)
Query: 18 MQTPN--HFVGLCGVLNQGMGGVTMIYILLGF 47
M+T HF+G+ G+ GM G+ I + LG+
Sbjct: 4 MKTLPKIHFIGIGGI---GMSGLAEILLNLGY 32
>gnl|CDD|235246 PRK04184, PRK04184, DNA topoisomerase VI subunit B; Validated.
Length = 535
Score = 27.9 bits (63), Expect = 5.8
Identities = 8/23 (34%), Positives = 15/23 (65%), Gaps = 1/23 (4%)
Query: 102 VKDNFVK-KPKFYEYIVRTVIVT 123
++ ++ + K + YEY+ RT IV
Sbjct: 181 IEGDWYRAKQRIYEYLKRTAIVN 203
>gnl|CDD|181922 PRK09512, rbsC, ribose ABC transporter permease protein; Reviewed.
Length = 320
Score = 27.4 bits (61), Expect = 6.2
Identities = 13/39 (33%), Positives = 21/39 (53%), Gaps = 5/39 (12%)
Query: 120 VIVTAAVGLAVAVPTIGPFLGLIGAFCFSLLGLIFPVLI 158
VI+T+ + L+V G L L GA S++G+ L+
Sbjct: 65 VILTSGIDLSV-----GSLLALTGAVAASIVGIEVNALV 98
>gnl|CDD|234210 TIGR03439, methyl_EasF, probable methyltransferase domain, EasF
family. This model represents an uncharacterized domain
of about 300 amino acids with homology to
S-adenosylmethionine-dependent methyltransferases.
Proteins with this domain are exclusively fungal. A few,
such as EasF from Neotyphodium lolii, are associated
with the biosynthesis of ergot alkaloids, a class of
fungal secondary metabolites. EasF may, in fact, be the
AdoMet:dimethylallyltryptophan N-methyltransferase, the
enzyme that follows tryptophan dimethylallyltransferase
(DMATS) in ergot alkaloid biosynthesis. Several other
members of this family, including mug158 (meiotically
up-regulated gene 158 protein) from Schizosaccharomyces
pombe, contain an additional uncharacterized domain
DUF323 (pfam03781).
Length = 319
Score = 27.6 bits (62), Expect = 6.2
Identities = 9/34 (26%), Positives = 13/34 (38%), Gaps = 2/34 (5%)
Query: 197 IGDIMRAYMPPDSDLPVAIINGTLGLAGNASLPS 230
I DI + + L II G +LP+
Sbjct: 1 IIDIRSNTV--EDSLVTEIIQGLRPNGQPRTLPT 32
>gnl|CDD|143566 cd07457, CRD_FZ9_like, Cysteine-rich Wnt-binding domain (CRD) of
receptors similar to frizzled 9. The cysteine-rich
domain (CRD) is an essential extracellular portion of
the frizzled 9 (Fz9) and frizzled 10 (Fz10) receptors,
and similar proteins. This domain is required for
binding Wnt proteins, which play fundamental roles in
many aspects of early development, such as cell and
tissue polarity, neural synapse formation, and the
regulation of proliferation. Fz proteins serve as Wnt
receptors for multiple signal transduction pathways,
including both beta-catenin dependent and -independent
cellular signaling, as well as the planar cell polarity
pathway and Ca(2+) modulating signaling pathway. The
CRD domain is well conserved in metazoans - 10 frizzled
proteins have been identified in mammals, 4 in
Drosophila and 3 in Caenorhabditis elegans. Very little
is known about the mechanism by which CRD domains
interact with their ligands. The domain contains 10
conserved cysteines.
Length = 121
Score = 26.7 bits (59), Expect = 6.3
Identities = 10/28 (35%), Positives = 13/28 (46%)
Query: 67 QHEIAPQIANIAIGIAVFCTFGLQFFVC 94
Q E A I A + C L+FF+C
Sbjct: 30 QSEAAISIHEFAPLVQYGCAEHLRFFLC 57
>gnl|CDD|206380 pfam14212, DUF4324, Domain of unknown function (DUF4324). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria, archaea and
eukaryotes. Proteins in this family are typically
between 88 and 217 amino acids in length.
Length = 95
Score = 26.2 bits (58), Expect = 6.5
Identities = 9/9 (100%), Positives = 9/9 (100%)
Query: 118 RTVIVTAAV 126
RTVIVTAAV
Sbjct: 79 RTVIVTAAV 87
>gnl|CDD|216420 pfam01298, Lipoprotein_5, Transferrin binding protein-like solute
binding protein. This family of proteins are distantly
related to other families of solute binding proteins.
Length = 554
Score = 27.5 bits (61), Expect = 6.7
Identities = 14/59 (23%), Positives = 20/59 (33%), Gaps = 5/59 (8%)
Query: 2 ANGHLPDSQVMPLENNMQTPNHFVGLCGVLNQGMGGVTMIYILLGFLGYLKYGELTEDN 60
A L D + +PL + F G G L YLK+G+L +
Sbjct: 337 ATKLLIDGETIPL---TPANSDFAGSKHHELDGKTYKVEAC--CSNLEYLKFGQLKLKD 390
>gnl|CDD|217079 pfam02516, STT3, Oligosaccharyl transferase STT3 subunit. This
family consists of the oligosaccharyl transferase STT3
subunit and related proteins. The STT3 subunit is part
of the oligosaccharyl transferase (OTase) complex of
proteins and is required for its activity. In
eukaryotes, OTase transfers a lipid-linked
core-oligosaccharide to selected asparagine residues in
the ER. In the archaea STT3 occurs alone, rather than in
an OTase complex, and is required for N-glycosylation of
asparagines.
Length = 650
Score = 27.4 bits (61), Expect = 7.4
Identities = 23/142 (16%), Positives = 40/142 (28%), Gaps = 24/142 (16%)
Query: 24 FVGLCGVLNQGMGGVTMIY---ILLGFLGYLKYGEL-TEDNITYNLPQHEIAPQIANIAI 79
FVG V + G +++ LL F +K L T P + + +
Sbjct: 242 FVGFAKVRSSEHMGAFLVFGLAQLLAFGSVVKGVYLSFYQFETLTYPGLFLVLGLIGPVL 301
Query: 80 GIAVFCTFGLQFFVCLEIVWNGVKDNFVKKPKFYEYIVRTVIVTAAVGLAVAVPTIGPFL 139
+ F +W G + K + + GL
Sbjct: 302 FLLGLLGFI--------ALWTGRFYSLFDYAKIDIPTIILISEWQPTGLYA--------- 344
Query: 140 GLIGAFCFSLLGLIFPVLIEFV 161
L+ F + FP+ + F
Sbjct: 345 -LLAGLRFLI--FTFPLGLIFC 363
>gnl|CDD|234697 PRK00235, cobS, cobalamin synthase; Reviewed.
Length = 249
Score = 27.1 bits (61), Expect = 8.5
Identities = 12/30 (40%), Positives = 16/30 (53%)
Query: 129 AVAVPTIGPFLGLIGAFCFSLLGLIFPVLI 158
A A P +G LG + A + LL L P L+
Sbjct: 34 ARAFPLVGLLLGALAALVYWLLQLGLPPLL 63
>gnl|CDD|185731 cd08990, GH43_AXH_like, Glycosyl hydrolase family 43, includes
arabinoxylan arabinofuranohydrolase, beta-xylosidase,
endo-1,4-beta-xylanase, alpha-L-arabinofuranosidase.
This glycosyl hydrolase family 43 (GH43) includes
enzymes that have been characterized with
beta-xylosidase (EC 3.2.1.37),
alpha-L-arabinofuranosidase (EC 3.2.1.55),
endo-alpha-L-arabinanase as well as arabinoxylan
arabinofuranohydrolase (AXH) activities. These are all
inverting enzymes (i.e. they invert the stereochemistry
of the anomeric carbon atom of the substrate) that have
an aspartate as the catalytic general base, a glutamate
as the catalytic general acid and another aspartate that
is responsible for pKa modulation and orienting the
catalytic acid. Many of the enzymes in this family
display both alpha-L-arabinofuranosidase and
beta-D-xylosidase activity using aryl-glycosides as
substrates. AXHs specifically hydrolyze the glycosidic
bond between arabinofuranosyl substituents and
xylopyranosyl backbone residues of arabinoxylan. Several
of these enzymes also contain carbohydrate binding
modules (CBMs) that bind cellulose or xylan. A common
structural feature of GH43 enzymes is a 5-bladed
beta-propeller domain that contains the catalytic acid
and catalytic base. A long V-shaped groove, partially
enclosed at one end, forms a single extended
substrate-binding surface across the face of the
propeller.
Length = 274
Score = 26.8 bits (60), Expect = 9.3
Identities = 7/32 (21%), Positives = 12/32 (37%)
Query: 41 IYILLGFLGYLKYGELTEDNITYNLPQHEIAP 72
Y+ G L+ +L D ++ EI
Sbjct: 134 AYLYWGGGLGLRVAKLKPDMLSLKGEPVEIVI 165
>gnl|CDD|185317 PRK15419, PRK15419, proline:sodium symporter PutP; Provisional.
Length = 502
Score = 27.3 bits (60), Expect = 9.7
Identities = 30/107 (28%), Positives = 46/107 (42%), Gaps = 8/107 (7%)
Query: 65 LPQHEIAPQIANI---AIGIAVFCTFGLQFFVCLEIVWNGVKDNFVKKPKFYEYIV---- 117
L Q P IA I AI AV T Q VC + + F++K + +V
Sbjct: 317 LAQILFNPWIAGILLSAILAAVMSTLSCQLLVCSSAITEDLYKAFLRKHASQKELVWVGR 376
Query: 118 RTVIVTAAVGLAVAVPTIGPFLGLIGAFCFSLLGLIFPVLIEFVTYW 164
V+V A V +A+A LGL+ ++ ++ G F ++ F W
Sbjct: 377 VMVLVVALVAIALAANPENRVLGLV-SYAWAGFGAAFGPVVLFSVMW 422
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.327 0.146 0.460
Gapped
Lambda K H
0.267 0.0761 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 12,745,737
Number of extensions: 1272156
Number of successful extensions: 2462
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2428
Number of HSP's successfully gapped: 210
Length of query: 238
Length of database: 10,937,602
Length adjustment: 94
Effective length of query: 144
Effective length of database: 6,768,326
Effective search space: 974638944
Effective search space used: 974638944
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 57 (25.7 bits)