RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy4337
(174 letters)
>gnl|CDD|217869 pfam04065, Not3, Not1 N-terminal domain, CCR4-Not complex
component.
Length = 233
Score = 103 bits (260), Expect = 5e-28
Identities = 47/96 (48%), Positives = 72/96 (75%), Gaps = 1/96 (1%)
Query: 48 LDQFESEIESLLVGKKKR-LDKDKQDRMDELKAKVERHRYHIRKLETLLRMLDNMSVEVD 106
++ E+E+ESL V +KKR D KQ+R+ EL+ +ERH++HI KLE +LR+L+N ++ +
Sbjct: 138 IELLEAEVESLSVQRKKRKADAAKQERIAELEELLERHKFHISKLELILRLLENGELDPE 197
Query: 107 QIRRIKDDVEYYIESSQDIDFEENEYIYDDIEDDVE 142
Q+ IK+D+EYY+ES+Q+ DF E+E IYDD+ D E
Sbjct: 198 QVDDIKEDIEYYVESNQEPDFVEDETIYDDLNLDEE 233
>gnl|CDD|227952 COG5665, NOT5, CCR4-NOT transcriptional regulation complex, NOT5
subunit [Transcription].
Length = 548
Score = 62.0 bits (150), Expect = 8e-12
Identities = 23/68 (33%), Positives = 44/68 (64%)
Query: 75 DELKAKVERHRYHIRKLETLLRMLDNMSVEVDQIRRIKDDVEYYIESSQDIDFEENEYIY 134
E + + ERH +HI LE +L+ L N ++ + + +DD++YY+E++ D DF E + IY
Sbjct: 134 QENEEQTERHEFHIANLENILKKLQNNEMDPEPVEEFQDDIKYYVENNDDPDFIEYDTIY 193
Query: 135 DDIEDDVE 142
+D+ +++
Sbjct: 194 EDMGCEIQ 201
>gnl|CDD|221059 pfam11285, DUF3086, Protein of unknown function (DUF3086). This
family of proteins with unknown function appears to be
restricted to Cyanobacteria.
Length = 283
Score = 29.2 bits (66), Expect = 0.97
Identities = 10/27 (37%), Positives = 14/27 (51%), Gaps = 3/27 (11%)
Query: 21 PEILAQFGDRTDALEGRKFVPTIVIDD 47
+ L + DR LE R +P I+ID
Sbjct: 230 ADALIERADR---LEERGELPLILIDA 253
>gnl|CDD|131210 TIGR02155, PA_CoA_ligase, phenylacetate-CoA ligase.
Phenylacetate-CoA ligase (PA-CoA ligase) catalyzes the
first step in aromatic catabolism of phenylacetic acid
(PA) into phenylacetyl-CoA (PA-CoA). Often located in a
conserved gene cluster with enzymes involved in
phenylacetic acid activation (paaG/H/I/J),
phenylacetate-CoA ligase has been found among the
proteobacteria as well as in gram positive prokaryotes.
In the B-subclass proteobacterium Azoarcus evansii,
phenylacetate-CoA ligase has been shown to be induced
under aerobic and anaerobic growth conditions. It
remains unclear however, whether this induction is due
to the same enzyme or to another isoenzyme restricted to
specific anaerobic growth conditions [Energy metabolism,
Other].
Length = 422
Score = 28.6 bits (64), Expect = 1.5
Identities = 22/92 (23%), Positives = 45/92 (48%), Gaps = 12/92 (13%)
Query: 30 RTDALEGRKFVPTIVIDDLDQFESEIESLLVGKKK-----RLDKDKQDRMDELKAKVERH 84
R D + GR ++I ++ F +++E +++ + +L+ + MDEL KVE
Sbjct: 309 RMDRITGRS-DDMLIIRGVNVFPTQLEEVILKMDELSPHYQLELTRNGHMDELTLKVE-- 365
Query: 85 RYHIRKLETLLRMLDNMSVEVDQIRR-IKDDV 115
++ LR+ + S+ +I+ IK +V
Sbjct: 366 ---LKPESYTLRLHEQASLLAGEIQHTIKQEV 394
>gnl|CDD|223732 COG0659, SUL1, Sulfate permease and related transporters (MFS
superfamily) [Inorganic ion transport and metabolism].
Length = 554
Score = 28.8 bits (65), Expect = 1.7
Identities = 10/54 (18%), Positives = 25/54 (46%), Gaps = 1/54 (1%)
Query: 91 LETLLRMLDNMSVEVDQIRRIKDDVEYYIESSQDIDFEENEYIYDDIEDDVEYY 144
LE L++ L+ +++ I + V + + + E+I+D ++ +E
Sbjct: 495 LEDLIKELERRGIQL-LIVGLSAQVLRLLRRAGLLYLVGAEHIFDSVDSALEKA 547
>gnl|CDD|184039 PRK13411, PRK13411, molecular chaperone DnaK; Provisional.
Length = 653
Score = 28.2 bits (63), Expect = 2.5
Identities = 23/124 (18%), Positives = 51/124 (41%), Gaps = 19/124 (15%)
Query: 54 EIESLLVGKKKRLDKDKQDRMDELKAKVERHRYHIRKLETLLRMLDNMSVEVDQIRRIKD 113
+ +SLL + L ++ + +ELK + E+ K+E L L + ++ ++++++ +
Sbjct: 533 QADSLLYSYESTLKENGELISEELKQRAEQ------KVEQLEAALTDPNISLEELKQQLE 586
Query: 114 DVE----------YYIESSQDIDFEENEYIY---DDIEDDVEYYIESSQDIDFEENEYIY 160
+ + Y SQ D E + E + + DF++ E +
Sbjct: 587 EFQQALLAIGAEVYQQGGSQTTDTVEPTSDTLITATMNSSNETTLIDEFNFDFDDEETVA 646
Query: 161 DDIE 164
D E
Sbjct: 647 SDYE 650
>gnl|CDD|146145 pfam03357, Snf7, Snf7. This family of proteins are involved in
protein sorting and transport from the endosome to the
vacuole/lysosome in eukaryotic cells. Vacuoles/lysosomes
play an important role in the degradation of both lipids
and cellular proteins. In order to perform this
degradative function, vacuoles/lysosomes contain
numerous hydrolases which have been transported in the
form of inactive precursors via the biosynthetic pathway
and are proteolytically activated upon delivery to the
vacuole/lysosome. The delivery of transmembrane
proteins, such as activated cell surface receptors to
the lumen of the vacuole/lysosome, either for
degradation/downregulation, or in the case of
hydrolases, for proper localisation, requires the
formation of multivesicular bodies (MVBs). These late
endosomal structures are formed by invaginating and
budding of the limiting membrane into the lumen of the
compartment. During this process, a subset of the
endosomal membrane proteins is sorted into the forming
vesicles. Mature MVBs fuse with the vacuole/lysosome,
thereby releasing cargo containing vesicles into its
hydrolytic lumen for degradation. Endosomal proteins
that are not sorted into the intralumenal MVB vesicles
are either recycled back to the plasma membrane or Golgi
complex, or remain in the limiting membrane of the MVB
and are thereby transported to the limiting membrane of
the vacuole/lysosome as a consequence of fusion.
Therefore, the MVB sorting pathway plays a critical role
in the decision between recycling and degradation of
membrane proteins. A few archaeal sequences are also
present within this family.
Length = 169
Score = 27.2 bits (61), Expect = 3.1
Identities = 26/117 (22%), Positives = 53/117 (45%), Gaps = 25/117 (21%)
Query: 62 KKKRLDKDKQDRMDELKAKVERHRYHIRKLETLLRMLDNMS------------VEVDQIR 109
+KKR +K D++D A +E+ R I +T +L+ M +++D+I
Sbjct: 49 QKKRYEKQL-DQLDGQLANLEQVRMAIENAKTNQEVLNAMKGGAKAMKAMNKNMDIDKID 107
Query: 110 RIKDDVEYYIESSQDIDFEENEYIYDDIEDDVEYYIESSQDIDFEENEYIYDDIEDE 166
+ D++E ++ E D+I + + ++ + + D EE + D + DE
Sbjct: 108 DLMDEIE-----------DQMEK-ADEISEMLSDTLDDADEEDEEELDAELDALLDE 152
>gnl|CDD|227584 COG5259, RSC8, RSC chromatin remodeling complex subunit RSC8
[Chromatin structure and dynamics / Transcription].
Length = 531
Score = 27.9 bits (62), Expect = 3.2
Identities = 20/75 (26%), Positives = 29/75 (38%), Gaps = 7/75 (9%)
Query: 20 GPEILAQFGDRTDALEGRKFV-PTIVIDDLDQFESEIESLLVGKK------KRLDKDKQD 72
GP + + F D D G P I+ EIE ++ K K+ Q
Sbjct: 152 GPPLTSHFQDLHDTPRGLSPFLPWGPINQRVLGAKEIEYETHKEENYSPSLKSPKKESQG 211
Query: 73 RMDELKAKVERHRYH 87
++DELK E+H
Sbjct: 212 KVDELKDHSEKHPSS 226
>gnl|CDD|224172 COG1252, Ndh, NADH dehydrogenase, FAD-containing subunit [Energy
production and conversion].
Length = 405
Score = 27.6 bits (62), Expect = 3.5
Identities = 15/70 (21%), Positives = 26/70 (37%), Gaps = 10/70 (14%)
Query: 81 VERHRYHIRKLETLLRML--DNMSVEVDQIR-----RIKDDVEYYIESSQDIDFEENEYI 133
V+R YH+ LL + +S I R +V++ DID + +
Sbjct: 34 VDRRDYHL--FTPLLYEVATGTLSESEIAIPLRALLRKSGNVQFVQGEVTDIDRDAKKVT 91
Query: 134 YDDIEDDVEY 143
D ++ Y
Sbjct: 92 LAD-LGEISY 100
>gnl|CDD|119335 cd06565, GH20_GcnA-like, Glycosyl hydrolase family 20 (GH20)
catalytic domain of N-acetyl-beta-D-glucosaminidase
(GcnA, also known as BhsA) and related proteins. GcnA
is an exoglucosidase which cleaves
N-acetyl-beta-D-galactosamine (NAG) and
N-acetyl-beta-D-galactosamine residues from
4-methylumbelliferylated (4MU) substrates, as well as
cleaving NAG from chito-oligosaccharides (i.e. NAG
polymers). In contrast, sulfated forms of the substrate
are unable to be cleaved and act instead as mild
competitive inhibitors. Additionally, the enzyme is
known to be poisoned by several first-row transition
metals as well as by mercury. GcnA forms a homodimer
with subunits comprised of three domains, an N-terminal
zincin-like domain, this central catalytic GH20 domain,
and a C-terminal alpha helical domain. The GH20
hexosaminidases are thought to act via a catalytic
mechanism in which the catalytic nucleophile is not
provided by solvent or the enzyme, but by the substrate
itself.
Length = 301
Score = 27.6 bits (62), Expect = 3.8
Identities = 13/74 (17%), Positives = 29/74 (39%), Gaps = 17/74 (22%)
Query: 56 ESLLVGKKKRLDKDKQDRMDELKAKVERHRYHIRKLETLLR-----------MLDNMSVE 104
E+ +G+ + L K EL + H++K+ +++ ML +S+E
Sbjct: 141 EAYDLGRGRSLRKHGNLGRGELYLE------HLKKVLKIIKKRGPKPMMWDDMLRKLSIE 194
Query: 105 VDQIRRIKDDVEYY 118
+ + + V
Sbjct: 195 PEALSGLPKLVTPV 208
>gnl|CDD|218003 pfam04281, Tom22, Mitochondrial import receptor subunit Tom22. The
mitochondrial protein translocase family, which is
responsible for movement of nuclear encoded pre-proteins
into mitochondria, is very complex with at least 19
components. These proteins include several chaperone
proteins, four proteins of the outer membrane
translocase (Tom) import receptor, five proteins of the
Tom channel complex, five proteins of the inner membrane
translocase (Tim) and three "motor" proteins. This
family represents the Tom22 proteins. The N terminal
region of Tom22 has been shown to have chaperone-like
activity, and the C terminal region faces the
intermembrane face.
Length = 136
Score = 26.8 bits (60), Expect = 4.3
Identities = 13/47 (27%), Positives = 22/47 (46%), Gaps = 3/47 (6%)
Query: 120 ESSQDIDFEENEYIYDDIEDDVEYYIESSQDIDFE---ENEYIYDDI 163
E+ Q+ ++ +DD E ++ DI + ENE +YD I
Sbjct: 10 ETFQEKPAAPKNLAQEESDDDDEDDTDTDSDISDDSDFENETLYDRI 56
>gnl|CDD|173534 PTZ00341, PTZ00341, Ring-infected erythrocyte surface antigen;
Provisional.
Length = 1136
Score = 27.4 bits (60), Expect = 5.6
Identities = 19/58 (32%), Positives = 36/58 (62%), Gaps = 1/58 (1%)
Query: 106 DQIRRIKDDVEYYIESSQDIDFEEN-EYIYDDIEDDVEYYIESSQDIDFEENEYIYDD 162
+ + I++++E IE + + + EEN E I +++E++VE E + + + EEN YDD
Sbjct: 1067 ENVEEIEENIEENIEENVEENVEENVEEIEENVEENVEENAEENAEENAEENAEEYDD 1124
Score = 26.7 bits (58), Expect = 8.8
Identities = 18/67 (26%), Positives = 40/67 (59%), Gaps = 2/67 (2%)
Query: 102 SVEVDQIRRIKDDVEYYIESSQDIDFEEN--EYIYDDIEDDVEYYIESSQDIDFEENEYI 159
+VE + ++++VE +E + + + EEN E + +++E+++E +E + + + EEN
Sbjct: 962 NVEENVEENVEENVEENVEENVEENVEENVEENVEENVEENIEENVEENVEENIEENVEE 1021
Query: 160 YDDIEDE 166
YD+ E
Sbjct: 1022 YDEENVE 1028
>gnl|CDD|238935 cd01976, Nitrogenase_MoFe_alpha, Nitrogenase_MoFe_alpha_II:
Nitrogenase MoFe protein, beta subunit. A group of
proteins similar to the alpha subunit of the MoFe
protein of the molybdenum (Mo-) nitrogenase. The
nitrogenase enzyme catalyzes the ATP-dependent reduction
of dinitrogen to ammonia. The Mo-nitrogenase is the most
widespread and best characterized of these systems.
Mo-nitrogenase consists of the MoFe protein (component
1) and the Fe protein (component 2). MoFe is an
alpha2beta2 tetramer. Each alphabeta pair of MoFe
contains one P-cluster (at the alphabeta interface) and,
one molecule of iron molybdenum cofactor (FeMoco)
contained within the alpha subunit. The Fe protein
contains a single [4Fe-4S] cluster. Electrons are
transferred from the [4Fe-4S] cluster of the Fe protein
to the P-cluster of the MoFe and in turn to FeMoCo, the
site of substrate reduction.
Length = 421
Score = 26.9 bits (60), Expect = 5.9
Identities = 15/60 (25%), Positives = 26/60 (43%), Gaps = 7/60 (11%)
Query: 1 CAPVAGVNFRKLQTCVKSKGPEILA---QFGDRTDALEGRKFVP---TIVIDDLDQFESE 54
V G+ R + G E++ +F R D E + +P T++ DD+ +E E
Sbjct: 304 MLYVGGLRPRHYIGAYEDLGMEVVGTGYEFAHRDD-YERTEVIPKEGTLLYDDVTHYELE 362
>gnl|CDD|233222 TIGR00986, 3a0801s05tom22, mitochondrial import receptor subunit
Tom22. The mitochondrial protein translocase (MPT)
family, which brings nuclearly encoded preproteins into
mitochondria, is very complex with 19 currently
identified protein constituents.These proteins include
several chaperone proteins, four proteins of the outer
membrane translocase (Tom) import receptor, five
proteins of the Tom channel complex, five proteins of
the inner membrane translocase (Tim) and three "motor"
proteins. This family is specific for the Tom22 proteins
[Transport and binding proteins, Amino acids, peptides
and amines].
Length = 145
Score = 26.5 bits (58), Expect = 6.0
Identities = 16/59 (27%), Positives = 27/59 (45%), Gaps = 8/59 (13%)
Query: 107 QIRRIKDDVEYYIESSQDIDFEENEYIYDDIEDDVEYYIESSQDIDFE--ENEYIYDDI 163
Q+ ++D+ E ++ EE++ + D++ E S D DFE E E D I
Sbjct: 3 QLTEVEDEHFQQPEQQTVVEPEEDDEDFTDVDS------EDSVDSDFESLEEETFTDRI 55
>gnl|CDD|219836 pfam08429, PLU-1, PLU-1-like protein. Sequences in this family
bear similarity to the central region of PLU-1. This is
a nuclear protein that may have a role in DNA-binding
and transcription, and is closely associated with the
malignant phenotype of breast cancer. This region is
found in various other Jumonji/ARID domain-containing
proteins (see pfam02373, pfam01388).
Length = 335
Score = 26.9 bits (60), Expect = 6.5
Identities = 20/115 (17%), Positives = 51/115 (44%), Gaps = 13/115 (11%)
Query: 44 VIDDLDQFESEIESLLVGKKKRLDKDKQDRMDELKAKVERHRYHIRKLETLLRMLDNMSV 103
+ + + + + + LL K++ K ++ + R + + +L LL+ D++
Sbjct: 46 FVQEAESWVEKAQQLLSRKQQTRRKTEKA-----DDEKSREKRSLEELYALLKEADSLPF 100
Query: 104 ---EVDQIRRIKDDVEYYIESSQD-IDFEENEYIYDDIEDDVEYYIESSQDIDFE 154
E+DQ+R + + +E + E +Q + E+ ++E+ +E + E
Sbjct: 101 DCPEIDQLRELLEAIEEFRERAQSALSTPEDSLSIQELEE----LLEEGESFGVE 151
>gnl|CDD|236017 PRK07428, PRK07428, nicotinate-nucleotide pyrophosphorylase;
Provisional.
Length = 288
Score = 26.5 bits (59), Expect = 6.6
Identities = 24/64 (37%), Positives = 29/64 (45%), Gaps = 19/64 (29%)
Query: 97 MLDNMSVE-----VDQIR----RIKDDVEYYIESSQDIDFEENEYIYDDIEDDVEYYIES 147
MLDNM V+ V IR R+K IE+S +I E I E V+ YI S
Sbjct: 221 MLDNMPVDLMQQAVQLIRQQNPRVK------IEASGNITLET---IRAVAETGVD-YISS 270
Query: 148 SQDI 151
S I
Sbjct: 271 SAPI 274
>gnl|CDD|227250 COG4913, COG4913, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 1104
Score = 26.9 bits (59), Expect = 7.9
Identities = 18/78 (23%), Positives = 33/78 (42%), Gaps = 8/78 (10%)
Query: 64 KRLDKDKQDRMDELKAKVER--HRYHIRKLETLLRMLDNMSVEVDQIRRIKDDVEYYIES 121
RL +D + E A+ + +R + LL LD + I++ +E +S
Sbjct: 829 ARLQTLTEDALPEFLARFQELLNRSSDDGVTQLLSHLD------HERALIEERIEAINDS 882
Query: 122 SQDIDFEENEYIYDDIED 139
+ +DF Y++ DI
Sbjct: 883 LRRVDFNSGRYLHIDIAK 900
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 26.6 bits (59), Expect = 8.1
Identities = 19/87 (21%), Positives = 38/87 (43%), Gaps = 3/87 (3%)
Query: 30 RTDALEGRKFVPTIVIDDLDQFESEIESLLVGKKKRLDKDKQDRMDELKAK---VERHRY 86
RT+ L + V + + L Q + IE+ K RL + K+ EL + +
Sbjct: 716 RTERLAKWQVVEGELDNQLAQLSAAIEAARTQAKARLKELKKQYDRELASLDVDPNTVKE 775
Query: 87 HIRKLETLLRMLDNMSVEVDQIRRIKD 113
R++E L ++ ++V ++R +
Sbjct: 776 LKRQIEELETTIERIAVRRPEVREYRA 802
>gnl|CDD|237418 PRK13534, PRK13534, 7-cyano-7-deazaguanine tRNA-ribosyltransferase;
Provisional.
Length = 639
Score = 26.5 bits (59), Expect = 8.7
Identities = 8/41 (19%), Positives = 15/41 (36%)
Query: 120 ESSQDIDFEENEYIYDDIEDDVEYYIESSQDIDFEENEYIY 160
ES + D+E E+ + + + E DI +
Sbjct: 439 ESPELFDYEGKEFNNEFVSEFKVIIHEKILDILTYNYYINH 479
>gnl|CDD|232885 TIGR00230, sfsA, sugar fermentation stimulation protein. probable
regulatory factor involved in maltose metabolism
contains a putative DNA binding domain. Isolated as a
gene which enabled E.coli strain MK2001 to use maltose
[Energy metabolism, Sugars, Regulatory functions,
Other].
Length = 232
Score = 26.3 bits (58), Expect = 9.0
Identities = 19/74 (25%), Positives = 32/74 (43%), Gaps = 18/74 (24%)
Query: 40 VPTIVIDDLDQFESEIESLLVGKKKRLD----KDKQDRMDELK--------------AKV 81
+P+ I +LD +E + G + R+D + + E+K A
Sbjct: 93 IPSSKISELDGYEVRKREVKYGNRSRIDFLLGESNERMYVEVKSATLKKEILALFPDAPT 152
Query: 82 ERHRYHIRKLETLL 95
ER R H+R+LE +L
Sbjct: 153 ERGRKHLRELEEIL 166
>gnl|CDD|118696 pfam10168, Nup88, Nuclear pore component. Nup88 can be divided
into two structural domains; the N-terminal two-thirds
of the protein has no obvious structural motifs but is
the region for binding to Nup98, one of the components
of the nuclear pore. the C-terminal end is a predicted
coiled-coil domain. Nup88 is overexpressed in tumour
cells.
Length = 717
Score = 26.4 bits (58), Expect = 9.0
Identities = 17/78 (21%), Positives = 39/78 (50%), Gaps = 5/78 (6%)
Query: 49 DQFE--SEIESLLVGKKKRLDKDKQDRMDELKAKVERHRYHIRKLETLLRMLDNMSVEVD 106
++FE + LLV + KRL Q +L + R ++L+ + + L +++ +
Sbjct: 600 EKFEEAKYNQELLVNRCKRL---LQSANSQLPVLSDSERDMSKELQRINKQLQHLANGIK 656
Query: 107 QIRRIKDDVEYYIESSQD 124
Q+++ K+ Y++ S +
Sbjct: 657 QVKKKKNYQRYHMASQES 674
>gnl|CDD|223683 COG0610, COG0610, Type I site-specific restriction-modification
system, R (restriction) subunit and related helicases
[Defense mechanisms].
Length = 962
Score = 26.6 bits (59), Expect = 9.8
Identities = 13/124 (10%), Positives = 42/124 (33%), Gaps = 1/124 (0%)
Query: 46 DDLDQFESEIESLLVGKKKRLDKDKQDRMDELKAKVERHRYHIRKLETLLRMLDNMSVEV 105
D L++F+ + E + + E+K + + L+ + +
Sbjct: 688 DVLNEFKEQSEDIGYLEDAGELLRDFAPELEIKVERGNKIL-AKLFGRFLKAFNALKKLY 746
Query: 106 DQIRRIKDDVEYYIESSQDIDFEENEYIYDDIEDDVEYYIESSQDIDFEENEYIYDDIED 165
+ ++++++Y + + ++ + Y D+ D++ E + D +
Sbjct: 747 EFAYTLEEEIQYLEDLLKARLEKDYQSKYLDLIDNLNDAREIKDKESKAIDLDDIDFELE 806
Query: 166 ECFI 169
Sbjct: 807 LIGK 810
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.318 0.139 0.388
Gapped
Lambda K H
0.267 0.0847 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 9,409,535
Number of extensions: 938385
Number of successful extensions: 1886
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1837
Number of HSP's successfully gapped: 205
Length of query: 174
Length of database: 10,937,602
Length adjustment: 90
Effective length of query: 84
Effective length of database: 6,945,742
Effective search space: 583442328
Effective search space used: 583442328
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 56 (25.1 bits)