RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy13744
(296 letters)
>gnl|CDD|240510 cd06086, KOW_Spt5_6, KOW domain of Spt5, repeat 6. Spt5, an
eukaryotic ortholog of NusG, contains multiple KOW
motifs at its C-terminus. Spt5 is involved in
transcription elongation and termination. KOW domain is
known as an RNA-binding motif that is shared so far
among some families of ribosomal proteins, the essential
bacterial transcriptional elongation factor NusG, the
eukaryotic chromatin elongation factor Spt5, the higher
eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains
play critical roles in recruitment of multiple other
eukaryotic transcription elongation and RNA biogenesis
factors and additionally are involved in the binding of
the eukaryotic Spt5 proteins to RNA polymerases.
Length = 58
Score = 91.4 bits (228), Expect = 6e-24
Identities = 25/57 (43%), Positives = 38/57 (66%)
Query: 238 ELEPVMPQPNDKVKVIVGEHKECTGVLLSVDNGEGVVKLTEEDDVKMIDVKFLCKYK 294
LEPV P+ D+VKVI GE + TG L+S+D +G+VK+ + D+K++ + FL K
Sbjct: 2 HLEPVPPEKGDRVKVIKGEDRGSTGELISIDGADGIVKMDSDGDIKILPMNFLAKLV 58
>gnl|CDD|240509 cd06085, KOW_Spt5_5, KOW domain of Spt5, repeat 5. Spt5, an
eukaryotic ortholog of NusG, contains multiple KOW
motifs at its C-terminus. Spt5 is involved in
transcription elongation and termination. KOW domain is
known as an RNA-binding motif that is shared so far
among some families of ribosomal proteins, the
essential bacterial transcriptional elongation factor
NusG, the eukaryotic chromatin elongation factor Spt5,
the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5
domains play critical roles in recruitment of multiple
other eukaryotic transcription elongation and RNA
biogenesis factors and additionally are involved in the
binding of the eukaryotic Spt5 proteins to RNA
polymerases.
Length = 52
Score = 35.9 bits (84), Expect = 0.001
Identities = 10/18 (55%), Positives = 12/18 (66%)
Query: 37 RDRELIGKTIKITGGPYK 54
LIGKT++I GPYK
Sbjct: 1 GRDPLIGKTVRIRKGPYK 18
>gnl|CDD|240506 cd06082, KOW_Spt5_2, KOW domain of Spt5, repeat 2. Spt5, an
eukaryotic ortholog of NusG, contains multiple KOW
motifs at its C-terminus. Spt5 is involved in
transcription elongation and termination. KOW domain is
known as an RNA-binding motif that is shared so far
among some families of ribosomal proteins, the essential
bacterial transcriptional elongation factor NusG, the
eukaryotic chromatin elongation factor Spt5, the higher
eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains
play critical roles in recruitment of multiple other
eukaryotic transcription elongation and RNA biogenesis
factors and additionally are involved in the binding of
the eukaryotic Spt5 proteins to RNA polymerases.
Length = 51
Score = 35.6 bits (83), Expect = 0.002
Identities = 15/44 (34%), Positives = 21/44 (47%)
Query: 245 QPNDKVKVIVGEHKECTGVLLSVDNGEGVVKLTEEDDVKMIDVK 288
QP D V+VI GE K G + SVD + ED + ++
Sbjct: 2 QPGDNVEVIEGELKGLQGKVESVDGDIVTIMPKHEDLKEPLEFP 45
>gnl|CDD|223044 PHA03325, PHA03325, nuclear-egress-membrane-like protein;
Provisional.
Length = 418
Score = 36.0 bits (83), Expect = 0.022
Identities = 23/112 (20%), Positives = 31/112 (27%), Gaps = 1/112 (0%)
Query: 59 TPGGGATPLHDGSRTPGPWDPSSTPA-RSIHNDAYPMDDSFNNSYPHTPGTMYGSSSEHT 117
P P R P P P + + S + S
Sbjct: 307 LPASLPPPPVRRPRVKHPEAGKEEPDGARNAEAKEPAQPATSTSSKGSSSAQNKDSGSTG 366
Query: 118 SYSPLQASPSPSGYQSHSGVSSGSSSYLGATPSPSSEAAPYGTPSPLSYSSR 169
S L A+ S ++ L PSPS +AP PL+Y S
Sbjct: 367 PGSSLAAASSFLEDDDFGSPPLDLTTSLRHMPSPSVTSAPEPPSIPLTYLSD 418
Score = 29.1 bits (65), Expect = 3.0
Identities = 22/127 (17%), Positives = 32/127 (25%), Gaps = 6/127 (4%)
Query: 69 DGSRTPGPWDPSSTPARSIHNDAYPMDDSFNNSYPHTPGTMYGSSSEHTSYSPLQA---- 124
S P + P +P + + P Q
Sbjct: 290 TADLADDDGSEHSDPEPLPASLPPPPVRRPRVKHPEAGKEEPDGARNAEAKEPAQPATST 349
Query: 125 -SPSPSGYQSHSGVSSGSSSYLGATPSPSSEAAPYGTPSPLSYSSRQASPFTPGTGLDQY 183
S S Q+ S+G S L A S + P L+ S R P T +
Sbjct: 350 SSKGSSSAQNKDSGSTGPGSSLAAASSFLEDDDFGSPPLDLTTSLR-HMPSPSVTSAPEP 408
Query: 184 PTPEWHT 190
P+
Sbjct: 409 PSIPLTY 415
>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1). This family
consists of several rhoptry-associated protein 1 (RAP-1)
sequences which appear to be specific to Plasmodium
falciparum.
Length = 790
Score = 34.7 bits (79), Expect = 0.065
Identities = 14/86 (16%), Positives = 24/86 (27%), Gaps = 4/86 (4%)
Query: 110 YGSSSEHTSYSPLQASPSPSGYQSHSGVSSGSSSYLGATPSPSSEAAPYGTPSPLSY--- 166
+ + P G Q SS + S S A+ +G +
Sbjct: 117 KDDMEFKANPNEAGKPGKPKGNQGEGLASSSDGKSKASAKSGSKSASKHGESNSSDESAT 176
Query: 167 -SSRQASPFTPGTGLDQYPTPEWHTT 191
S + ++ G D+ P T
Sbjct: 177 DSGKASASVAGIVGADEEAPPAPKNT 202
>gnl|CDD|128978 smart00739, KOW, KOW (Kyprides, Ouzounis, Woese) motif. Motif in
ribosomal proteins, NusG, Spt5p, KIN17 and T54.
Length = 28
Score = 30.0 bits (69), Expect = 0.097
Identities = 10/25 (40%), Positives = 13/25 (52%)
Query: 244 PQPNDKVKVIVGEHKECTGVLLSVD 268
+ D V+VI G K G +L VD
Sbjct: 2 FEVGDTVRVIAGPFKGKVGKVLEVD 26
>gnl|CDD|240517 cd13153, KOW_GPKOW_B, KOW motif of the "G-patch domain and KOW
motifs-containing protein" (GPKOW) repeat B. GPKOW
contains one G-patch domain and two KOW motifs. GPKOW is
a nuclear protein that regulated by catalytic (C)
subunit of Protein Kinase A (PKA) and bind RNA in vivo.
PKA may be involved in regulating multiple steps in
post-transcriptional processing of pre-mRNAs. KOW domain
is known as an RNA-binding motif that is shared so far
among some families of ribosomal proteins, the essential
bacterial transcriptional elongation factor NusG, the
eukaryotic chromatin elongation factor Spt5, the higher
eukaryotic KIN17 proteins and Mtr4. GPKOW is also known
as the T54 protein or MOS2 homolog.
Length = 51
Score = 30.6 bits (70), Expect = 0.11
Identities = 14/49 (28%), Positives = 23/49 (46%), Gaps = 2/49 (4%)
Query: 247 NDKVKVIVGEHKECTGVLLSVD--NGEGVVKLTEEDDVKMIDVKFLCKY 293
V V+ G+H+ G LL D VV+L ++ V + +C+Y
Sbjct: 1 GAYVMVVRGKHRGQVGKLLERDKKKETAVVQLLDDFSVVTLSYDDICEY 49
>gnl|CDD|218191 pfam04652, DUF605, Vta1 like. Vta1 (VPS20-associated protein 1) is
a positive regulator of Vps4. Vps4 is an ATPase that is
required in the multivesicular body (MVB) sorting
pathway to dissociate the endosomal sorting complex
required for transport (ESCRT). Vta1 promotes correct
assembly of Vps4 and stimulates its ATPase activity
through its conserved Vta1/SBP1/LIP5 region.
Length = 315
Score = 33.1 bits (76), Expect = 0.14
Identities = 21/116 (18%), Positives = 29/116 (25%), Gaps = 3/116 (2%)
Query: 72 RTPGPWDPSSTPARSIHNDAYPMDDSFNNSYPHTPGTMYGSSSEHTSYSPLQASPSPSGY 131
P P P D+SF S S+ S SP S
Sbjct: 146 EDPNPGPPLDEEDEDADVATTNSDNSFP---GEDADPASASPSDPPSSSPGVPSFPSPPE 202
Query: 132 QSHSGVSSGSSSYLGATPSPSSEAAPYGTPSPLSYSSRQASPFTPGTGLDQYPTPE 187
S S + S + +P +P A P P + T +
Sbjct: 203 DPSSPSDSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAK 258
Score = 28.9 bits (65), Expect = 3.3
Identities = 20/99 (20%), Positives = 30/99 (30%), Gaps = 5/99 (5%)
Query: 94 MDDSFNNSYPHTPGTMYGSSSEHTSYSPLQASPSPSGYQSHSG--VSSGSSSYLGATPSP 151
D+ + + ++ + G ++ S SP S G S S S P+P
Sbjct: 157 EDEDADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAP 216
Query: 152 SSEAAPYGTPSPLSYSSRQASPFTPGTGLDQYPTPEWHT 190
SS P S S PG P +
Sbjct: 217 SS---FQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVP 252
Score = 27.3 bits (61), Expect = 9.9
Identities = 27/127 (21%), Positives = 33/127 (25%), Gaps = 20/127 (15%)
Query: 69 DGSRTPGPWDPSSTPARSIHNDAYPMDDSFNNSYPHTPGTMYGSSSEHTSYSPLQASPSP 128
D S DP+S S + P P SS S P P+P
Sbjct: 169 DNSFPGEDADPASASP--------SDPPSSSPGVPSFPSPPEDPSSPSDSSLP----PAP 216
Query: 129 SGYQSHSGVSSGSSSYLGATPSPSSEAAPYGTPSPLSYSSRQAS---PFTPGTGLDQYPT 185
S +QS + +PS P P P S P P P
Sbjct: 217 SSFQSD-----TPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASATPAPI 271
Query: 186 PEWHTTD 192
D
Sbjct: 272 GGITLDD 278
>gnl|CDD|173871 cd08506, PBP2_clavulanate_OppA2, The substrate-binding domain of an
oligopeptide binding protein (OppA2) from the
biosynthesis pathway of the beta-lactamase inhibitor
clavulanic acid contains the type 2 periplasmic binding
fold. Clavulanic acid (CA), a clinically important
beta-lactamase inhibitor, is one of a family of clavams
produced as secondary metabolites by fermentation of
Streptomyces clavuligeru. The biosynthesis of CA
proceeds via multiple steps from the precursors,
glyceraldehyde-3-phosphate and arginine. CA possesses a
characteristic (3R,5R) stereochemistry essential for
reaction with penicillin-binding proteins and
beta-lactamases. Two genes (oppA1 and oppA2) in the
clavulanic acid gene cluster encode oligopeptide-binding
proteins that are required for CA biosynthesis. OppA1/2
is involved in the binding and transport of peptides
across the cell membrane of Streptomyces clavuligerus.
Most of other periplasmic binding proteins are comprised
of only two globular subdomains corresponding to domains
I and III of the dipeptide/oligopeptide binding
proteins. The structural topology of these domains is
most similar to that of the type 2 periplasmic binding
proteins (PBP2), which are responsible for the uptake of
a variety of substrates such as phosphate, sulfate,
polysaccharides, lysine/arginine/ornithine, and
histidine. The PBP2 bind their ligand in the cleft
between these domains in a manner resembling a Venus
flytrap. After binding their specific ligand with high
affinity, they can interact with a cognate membrane
transport complex comprised of two integral membrane
domains and two cytoplasmically located ATPase domains.
This interaction triggers the ligand translocation
across the cytoplasmic membrane energized by ATP
hydrolysis. Besides transport proteins, the PBP2
superfamily includes the ligand-binding domains from
ionotropic glutamate receptors, LysR-type
transcriptional regulators, and unorthodox sensor
proteins involved in signal transduction.
Length = 466
Score = 33.0 bits (76), Expect = 0.18
Identities = 19/43 (44%), Positives = 21/43 (48%), Gaps = 9/43 (20%)
Query: 51 GPYKVGSRTPGGGATPLHDGSRTPGPWDPSSTPARSIHNDAYP 93
GPYK+ S PG G L R P WD + P R DAYP
Sbjct: 146 GPYKIESYDPGKGLV-L---VRNPH-WDAETDPIR----DAYP 179
>gnl|CDD|211858 TIGR03665, arCOG04150, arCOG04150 universal archaeal KH domain
protein. This family of proteins is universal among the
41 archaeal genomes analyzed in and is not observed
outside of the archaea. The proteins contain a single KH
domain (pfam00013) which is likely to confer the ability
to bind RNA.
Length = 172
Score = 32.1 bits (74), Expect = 0.20
Identities = 16/43 (37%), Positives = 25/43 (58%), Gaps = 7/43 (16%)
Query: 246 PNDKVKVIVGEHKEC-------TGVLLSVDNGEGVVKLTEEDD 281
P D++ V++G+ E TGV L +D+ G VK+ EED+
Sbjct: 5 PKDRIGVLIGKGGETKKEIEERTGVKLDIDSETGEVKIEEEDE 47
>gnl|CDD|234428 TIGR03979, His_Ser_Rich, His-Xaa-Ser repeat protein HxsA. Members
of this protein share two defining regions. One is a
histidine/serine-rich cluster, typically
H-R-S-H-S-S-H-R-S-H-S-S-H. Members are found always in
the context of a pair of radical SAM proteins, HxsB and
HxsC, and a fourth protein HxsD. The system is predicted
to perform peptide modifications, likely in the
His-Xaa-Ser region, to produce some uncharacterized
natural product.
Length = 186
Score = 31.8 bits (72), Expect = 0.23
Identities = 20/71 (28%), Positives = 27/71 (38%), Gaps = 6/71 (8%)
Query: 108 TMYGSSSEHTSYSPLQASPSPSGYQSHSGVSSGSSSYLGATPSPSSEAAPYGTPSPLSYS 167
+ H+S+ S S + SH + GS S S S P + SP S
Sbjct: 48 NLLAGHRSHSSHR------SHSSHSSHYSGAGGSYSVPSGDTSTYSYPVPSPSYSPSPGS 101
Query: 168 SRQASPFTPGT 178
S Q+ P T G
Sbjct: 102 SIQSLPSTTGV 112
Score = 28.3 bits (63), Expect = 3.5
Identities = 16/59 (27%), Positives = 23/59 (38%), Gaps = 1/59 (1%)
Query: 129 SGYQSHSGVSSGSSSYLGATPSPSSEAAPYGTPSPLSYSSRQASPFTPGTGLDQ-YPTP 186
+G++SHS S SS + + S + P G S SY S Q P+
Sbjct: 51 AGHRSHSSHRSHSSHSSHYSGAGGSYSVPSGDTSTYSYPVPSPSYSPSPGSSIQSLPST 109
>gnl|CDD|240291 PTZ00146, PTZ00146, fibrillarin; Provisional.
Length = 293
Score = 32.0 bits (73), Expect = 0.35
Identities = 21/59 (35%), Positives = 22/59 (37%), Gaps = 5/59 (8%)
Query: 18 GGRGGRGGGRGGHHGGNVRRDRELIGKTIKITGGPYKVGSRTPGGGATPLHDGSRTPGP 76
G GG GGGRGG GG R G+ GG G GG G GP
Sbjct: 1 GMGGGFGGGRGGGRGGGGGGGRGGGGR-----GGGRGGGRGRGRGGGGGGRGGGGGGGP 54
Score = 29.7 bits (67), Expect = 1.9
Identities = 11/20 (55%), Positives = 12/20 (60%)
Query: 16 PSGGRGGRGGGRGGHHGGNV 35
GG GGRGGG GG G +
Sbjct: 39 RGGGGGGRGGGGGGGPGKVI 58
Score = 28.9 bits (65), Expect = 2.7
Identities = 12/17 (70%), Positives = 12/17 (70%)
Query: 17 SGGRGGRGGGRGGHHGG 33
GRGG GGGRGG GG
Sbjct: 36 GRGRGGGGGGRGGGGGG 52
Score = 27.8 bits (62), Expect = 6.3
Identities = 11/17 (64%), Positives = 11/17 (64%)
Query: 17 SGGRGGRGGGRGGHHGG 33
G GGRGGGRGG G
Sbjct: 21 GRGGGGRGGGRGGGRGR 37
>gnl|CDD|240507 cd06083, KOW_Spt5_3, KOW domain of Spt5, repeat 3. Spt5, an
eukaryotic ortholog of NusG, contains multiple KOW
motifs at its C-terminus. Spt5 is involved in
transcription elongation and termination. KOW domain is
known as an RNA-binding motif that is shared so far
among some families of ribosomal proteins, the essential
bacterial transcriptional elongation factor NusG, the
eukaryotic chromatin elongation factor Spt5, the higher
eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains
play critical roles in recruitment of multiple other
eukaryotic transcription elongation and RNA biogenesis
factors and additionally are involved in the binding of
the eukaryotic Spt5 proteins to RNA polymerases.
Length = 51
Score = 29.0 bits (66), Expect = 0.39
Identities = 14/44 (31%), Positives = 25/44 (56%), Gaps = 4/44 (9%)
Query: 245 QPNDKVKVIVGEHKECTGVLLSVDNGEGVVKLTEEDDVKMIDVK 288
+ D VKVI G H+ TG+++ V++ V + D+ M ++K
Sbjct: 3 KVGDHVKVISGRHEGETGLVVKVEDDV-VTVFS---DLTMRELK 42
>gnl|CDD|227430 COG5099, COG5099, RNA-binding protein of the Puf family,
translational repressor [Translation, ribosomal
structure and biogenesis].
Length = 777
Score = 32.0 bits (73), Expect = 0.40
Identities = 25/114 (21%), Positives = 40/114 (35%), Gaps = 12/114 (10%)
Query: 67 LHDGSRTPGPWDPSSTPARSIHNDAYPMDDSF--------NNSYPHTPGTMYGSSSEHTS 118
LH ++P PSST ++ + N + + ++ + + TS
Sbjct: 17 LHHSKKSP----PSSTTSQELMNGNSTPNSFSPIPSKASSSATFTLNLPINNSVNHKITS 72
Query: 119 YSPLQASPSPSGYQSHSGVSSGSSSYLGATPSPSSEAAPYGTPSPLSYSSRQAS 172
S + PS S + S +SGS S L PS S + S S
Sbjct: 73 SSSSRRKPSGSWSVAISSSTSGSQSLLMELPSSSFNPSTSSRNKSNSALSSTQQ 126
>gnl|CDD|236722 PRK10590, PRK10590, ATP-dependent RNA helicase RhlE; Provisional.
Length = 456
Score = 31.7 bits (72), Expect = 0.46
Identities = 19/66 (28%), Positives = 26/66 (39%), Gaps = 1/66 (1%)
Query: 18 GGRGGRGGGRGGHHGGNVRRDRELIGKTIKITGGPYKVGSRTPGGGATPLHDGSRTPGPW 77
RGG G G+GG G + R G + P + SR G A P + R P
Sbjct: 392 QQRGGGGRGQGGGRGQQQGQPRRGEGGAKSASAKPAEKPSR-RLGDAKPAGEQQRRRRPR 450
Query: 78 DPSSTP 83
P++
Sbjct: 451 KPAAAQ 456
>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
Length = 1352
Score = 31.7 bits (72), Expect = 0.49
Identities = 24/125 (19%), Positives = 41/125 (32%), Gaps = 15/125 (12%)
Query: 64 ATPLHDGSRTPGPWDPSSTPARSIH---------NDAYPMDDSFNNSYPHTPGTM----- 109
A+P +GS TP P + D M + P +
Sbjct: 98 ASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGA 157
Query: 110 -YGSSSEHTSYSPLQASPSPSGYQSHSGVSSGSSSYLGATPSPSSEAAPYGTPSPLSYSS 168
+ + + S A P S ++ SS + +TP ++ P SP+S S+
Sbjct: 158 SPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASA 217
Query: 169 RQASP 173
+P
Sbjct: 218 SSPAP 222
Score = 29.0 bits (65), Expect = 4.3
Identities = 27/122 (22%), Positives = 36/122 (29%), Gaps = 1/122 (0%)
Query: 57 SRTPGGGATPLHDGSRTPGPWDPSSTPARSIHNDAYPMDDSFNNSYPHTPGTMYGSSSEH 116
S + G G P ++ T P S +P S S
Sbjct: 242 SESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPS 301
Query: 117 TSYSPLQASPSPSGYQSHSGVSSGSSSYLGATPSPSSEAAPYG-TPSPLSYSSRQASPFT 175
+ S S + S S S SSS ++ S A G +PS SR P
Sbjct: 302 SPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPAD 361
Query: 176 PG 177
P
Sbjct: 362 PS 363
>gnl|CDD|240513 cd06089, KOW_RPL26, KOW motif of Ribosomal Protein L26. RPL26 and
its bacterial paralogs RPL24 have a KOW motif at their N
terminal. KOW domain is known as an RNA-binding motif
that is shared so far among some families of ribosomal
proteins, the essential bacterial transcriptional
elongation factor NusG, the eukaryotic chromatin
elongation factor Spt5, the higher eukaryotic KIN17
proteins and Mtr4. RPL26 makes a very minor
contributions to the biogenesis, structure, and function
of 60s ribosomal subunits. However, RPL24 is essential
to generate the first intermediate during 50s ribosomal
subunits assembly. RPL26 have an extra-ribosomal
function to enhances p53 translation after DNA damage.
Length = 65
Score = 29.0 bits (66), Expect = 0.50
Identities = 11/28 (39%), Positives = 16/28 (57%)
Query: 247 NDKVKVIVGEHKECTGVLLSVDNGEGVV 274
D+V+VI G+ K G +L VD + V
Sbjct: 1 GDEVQVIRGKDKGKQGKVLKVDRKKNRV 28
>gnl|CDD|240515 cd06091, KOW_NusG, NusG contains an NGN domain at its N-terminus
and KOW motif at its C-terminus. KOW_NusG motif is one
of the two domains of N-Utilization Substance G (NusG) a
transcription elongation and Rho-termination factor in
bacteria and archaea. KOW domain is known as an
RNA-binding motif that is shared so far among some
families of ribosomal proteins, the essential bacterial
transcriptional elongation factor NusG, the eukaryotic
chromatin elongation factor Spt5, the higher eukaryotic
KIN17 proteins and Mtr4. The eukaryotic ortholog of NusG
is Spt5 with multiple KOW motifs at its C-terminus.
Length = 56
Score = 28.6 bits (65), Expect = 0.55
Identities = 9/33 (27%), Positives = 16/33 (48%)
Query: 245 QPNDKVKVIVGEHKECTGVLLSVDNGEGVVKLT 277
+ D V++I G G + +D +G VK+
Sbjct: 5 EVGDTVRIISGPFAGFEGKVEEIDEEKGKVKVL 37
>gnl|CDD|237494 PRK13763, PRK13763, putative RNA-processing protein; Provisional.
Length = 180
Score = 30.6 bits (70), Expect = 0.61
Identities = 13/44 (29%), Positives = 21/44 (47%), Gaps = 7/44 (15%)
Query: 246 PNDKVKVIVGEHKEC-------TGVLLSVDNGEGVVKLTEEDDV 282
P D++ V++G+ E TGV L +D+ G V + D
Sbjct: 10 PKDRIGVLIGKKGETKKEIEERTGVKLEIDSETGEVIIEPTDGE 53
>gnl|CDD|235307 PRK04537, PRK04537, ATP-dependent RNA helicase RhlB; Provisional.
Length = 572
Score = 30.7 bits (69), Expect = 1.0
Identities = 19/74 (25%), Positives = 22/74 (29%), Gaps = 3/74 (4%)
Query: 3 GGFMSPRISSPMHPSGGRGGRGGGRGGHHGGNVRRDRELIGKTIKITGGPYK--VGSRTP 60
GG P S G GGR G G RR + G+ G V +
Sbjct: 434 GGRSGPG-GGSRSGSVGGGGRRDGAGADGKPRPRRKPRVEGEADAAAAGAETPVVAAAAA 492
Query: 61 GGGATPLHDGSRTP 74
DG R P
Sbjct: 493 QAPGVVAADGERAP 506
>gnl|CDD|99995 cd04299, GT1_Glycogen_Phosphorylase_like, This family is most
closely related to the oligosaccharide phosphorylase
domain family and other unidentified sequences.
Oligosaccharide phosphorylase catalyzes the breakdown of
oligosaccharides into glucose-1-phosphate units. They
are important allosteric enzymes in carbohydrate
metabolism. The members of this family are found in
bacteria and Archaea.
Length = 778
Score = 30.6 bits (70), Expect = 1.0
Identities = 14/33 (42%), Positives = 17/33 (51%), Gaps = 8/33 (24%)
Query: 196 RINENARDPEFRGQI--------GVIRHLGSGV 220
I E +R PEFRG+I + RHL GV
Sbjct: 535 EIVEFSRRPEFRGRIVFLEDYDMALARHLVQGV 567
>gnl|CDD|240504 cd00380, KOW, KOW: an acronym for the authors' surnames (Kyrpides,
Ouzounis and Woese). KOW domain is known as an
RNA-binding motif that is shared so far among some
families of ribosomal proteins, the essential bacterial
transcriptional elongation factor NusG, the eukaryotic
chromatin elongation factor Spt5, the higher eukaryotic
KIN17 proteins and Mtr4. The KOW motif contains an
invariants glycine residue and comprises alternating
blocks of hydrophilic and hydrophobic residues.
Length = 49
Score = 27.6 bits (62), Expect = 1.2
Identities = 8/22 (36%), Positives = 14/22 (63%)
Query: 247 NDKVKVIVGEHKECTGVLLSVD 268
D V+V+ G +K GV++ +D
Sbjct: 1 GDVVRVLRGPYKGREGVVVDID 22
>gnl|CDD|180777 PRK06958, PRK06958, single-stranded DNA-binding protein;
Provisional.
Length = 182
Score = 29.8 bits (67), Expect = 1.3
Identities = 27/83 (32%), Positives = 30/83 (36%), Gaps = 9/83 (10%)
Query: 14 MHPSGGRGGRGGGRGGHHGGNVRRDRELIGKTIKITGGPYKVGSRTPGGGATPLHDGSRT 73
M GGRGG GGG GG G GG Y SR+ GGG G
Sbjct: 105 MQMLGGRGGSGGGGGGGDEGGYGGGGGG-------GGGGYGGESRSGGGGGRASGGGGGG 157
Query: 74 PGPWDPSSTPARSIHNDAYPMDD 96
G +S P+ MDD
Sbjct: 158 AGG--GASRPSAPAGGGFDEMDD 178
>gnl|CDD|217330 pfam03035, RNA_capsid, Calicivirus putative RNA polymerase/capsid
protein.
Length = 226
Score = 29.6 bits (67), Expect = 1.4
Identities = 27/83 (32%), Positives = 34/83 (40%), Gaps = 17/83 (20%)
Query: 107 GTMYG--SSSEHTSYS----------PLQASPSPSGYQSHSGVSSGSSSYLGATPSPSSE 154
GT Y +S TSYS P +S S S S S S+G SS ++ S SS
Sbjct: 102 GTRYWAPNSMATTSYSGGFTSSPVPVPPSSSSSASSVSSQSTQSTGLSSSSYSSSSASSR 161
Query: 155 AAPYGTPSPLSYSSRQASPFTPG 177
T S + + PF PG
Sbjct: 162 -----TSSWVRSQNSNLEPFMPG 179
>gnl|CDD|217769 pfam03867, FTZ, Fushi tarazu (FTZ), N-terminal region. This region
contains the important motif (LXXLL) necessary for the
interaction of FTZ with the nuclear receptor FTZ-F1. FTZ
is thought to represents a category of LXXLL
motif-dependent co-activators for nuclear receptors.
Length = 264
Score = 29.4 bits (65), Expect = 2.2
Identities = 17/54 (31%), Positives = 25/54 (46%), Gaps = 1/54 (1%)
Query: 92 YPMDDSFNNSYPHT-PGTMYGSSSEHTSYSPLQASPSPSGYQSHSGVSSGSSSY 144
Y D+ +N + H+ P T Y +SS +SY P+ YQS+ SY
Sbjct: 10 YYADNMYNMYHAHSLPPTYYDNSSSSSSYQQTSQGWQPASYQSNYYAHYSQESY 63
>gnl|CDD|220401 pfam09786, CytochromB561_N, Cytochrome B561, N terminal. Members
of this family are found in the N terminal region of
cytochrome B561, as well as in various other putative
uncharacterized proteins.
Length = 559
Score = 29.7 bits (67), Expect = 2.3
Identities = 30/123 (24%), Positives = 39/123 (31%), Gaps = 11/123 (8%)
Query: 47 KITGGPYKVGSRTPGGGATPLHDGSRTPGPWDPSSTPARSIHNDAYPMDDSFNNSYPHTP 106
+ G K T A S+T P + S P H+ SF++S +
Sbjct: 99 GLLGVKAKDSQFTVVSQAKKSPPASKTSTPMNTSE-PLVPGHS-------SFSDSPSRSA 150
Query: 107 GTM-YGSSSEHTSYSPLQASPSPSGYQSHSGVSSGSSSYLGATPSPSSEAAPYGTPSPLS 165
S S SP + S S S SS LG P SS SP +
Sbjct: 151 SPSRKFSPSSTIQQSPQLTPSNKPASPSSSYQSPSYSSSLG--PVNSSGNRSNLRSSPWA 208
Query: 166 YSS 168
S
Sbjct: 209 LRS 211
>gnl|CDD|215621 PLN03188, PLN03188, kinesin-12 family protein; Provisional.
Length = 1320
Score = 29.5 bits (66), Expect = 2.5
Identities = 18/60 (30%), Positives = 24/60 (40%), Gaps = 1/60 (1%)
Query: 117 TSYSPLQASPSPSGYQSHSGVSSGSSSYLGATPSPSSEAAPYGTPSPLSYSSRQASPFTP 176
TS Q SP+PS ++S SS A P + P S S++ SP P
Sbjct: 15 TSSGEEQ-SPNPSSHKSKPSSRKLKSSKENAPPPDLNSLTSDLKPDHRSASAKLKSPLPP 73
>gnl|CDD|234566 PRK00004, rplX, 50S ribosomal protein L24; Reviewed.
Length = 105
Score = 27.8 bits (63), Expect = 2.6
Identities = 10/28 (35%), Positives = 13/28 (46%)
Query: 247 NDKVKVIVGEHKECTGVLLSVDNGEGVV 274
D V VI G+ K G +L V + V
Sbjct: 8 GDTVIVIAGKDKGKRGKVLKVLPKKNKV 35
>gnl|CDD|214374 CHL00141, rpl24, ribosomal protein L24; Validated.
Length = 83
Score = 27.3 bits (61), Expect = 3.0
Identities = 14/42 (33%), Positives = 19/42 (45%), Gaps = 6/42 (14%)
Query: 248 DKVKVIVGEHKECTGVLLSVDNGEGVVKLTEEDDVKMIDVKF 289
D VK+I G K G +L + V VK I++KF
Sbjct: 13 DTVKIISGSDKGKIGEVLKIIKKSNKVI------VKGINIKF 48
>gnl|CDD|219833 pfam08418, Pol_alpha_B_N, DNA polymerase alpha subunit B
N-terminal. This is the eukaryotic DNA polymerase alpha
subunit B N-terminal domain which is involved in complex
formation. Also see pfam04058.
Length = 239
Score = 28.9 bits (65), Expect = 3.1
Identities = 25/107 (23%), Positives = 38/107 (35%), Gaps = 17/107 (15%)
Query: 104 HTPG---TMYGSSSEHTSYSPLQASPS-PSGYQS--HSGVSSGSSSYLGATPSPSSEAAP 157
P ++ S + + + S P + SS S L +P+ S +A
Sbjct: 80 EVPEAEESLLDSYTTPSDKGGMLRILSTPELPKRKRSFSASSLESPSLFFSPASFSPSAA 139
Query: 158 YGTPSPLS--YSSRQASPFTPGT-----GLDQYPTPEWHTTDIEVRI 197
TPSP S +SSR PG TPE D + ++
Sbjct: 140 PSTPSPNSAKFSSRSN----PGEVVETLNPHLGQTPEGGGPDSDPKV 182
>gnl|CDD|240519 cd13155, KOW_KIN17, KOW_Kin17 is a RNA-binding motif. KOW domain
of the KIN17protein contributes to the RNA-binding
properties of the whole protein. KOW domain is known as
an RNA-binding motif that is shared so far among some
families of ribosomal proteins, the essential bacterial
transcriptional elongation factor NusG, the eukaryotic
chromatin elongation factor Spt5, the higher eukaryotic
KIN17 proteins and Mtr4. KIN17 is conserved from yeast
to human that ubiquitously expressed at low levels in
mammals tissue and have functions in DNA replication,
DNA repair and cell cycle control.
Length = 54
Score = 26.3 bits (59), Expect = 3.2
Identities = 7/20 (35%), Positives = 12/20 (60%)
Query: 249 KVKVIVGEHKECTGVLLSVD 268
+V V+ G ++ C L S+D
Sbjct: 4 RVLVVNGAYRGCEATLESID 23
>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 614
Score = 29.0 bits (65), Expect = 3.3
Identities = 16/64 (25%), Positives = 22/64 (34%)
Query: 123 QASPSPSGYQSHSGVSSGSSSYLGATPSPSSEAAPYGTPSPLSYSSRQASPFTPGTGLDQ 182
A+ + + S SS +P S P GTP +S A P P + Q
Sbjct: 386 PAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPVNPPSTAPQ 445
Query: 183 YPTP 186
P
Sbjct: 446 AVRP 449
>gnl|CDD|183399 PRK12281, rplX, 50S ribosomal protein L24; Reviewed.
Length = 76
Score = 26.9 bits (60), Expect = 3.3
Identities = 11/21 (52%), Positives = 14/21 (66%)
Query: 247 NDKVKVIVGEHKECTGVLLSV 267
D VKVI G+ K TG +L+V
Sbjct: 10 GDMVKVIAGDDKGKTGKVLAV 30
>gnl|CDD|219133 pfam06682, DUF1183, Protein of unknown function (DUF1183). This
family consists of several eukaryotic proteins of around
360 residues in length. The function of this family is
unknown.
Length = 317
Score = 28.5 bits (64), Expect = 3.6
Identities = 27/132 (20%), Positives = 36/132 (27%), Gaps = 14/132 (10%)
Query: 57 SRTPGGGATPLHDGSRTPGPWDPSSTPARSIHNDAYPMDDSFNNSYPHTPGTMYGSSSEH 116
GG P G G P + P+ PG S
Sbjct: 180 CGGVRGGPRPERAGYGGGGGGGGGGGGGGGSGPGPPPPGFKSSFPPPYGPGAGPSSGYGS 239
Query: 117 TSYSPLQASPSP-----------SGY---QSHSGVSSGSSSYLGATPSPSSEAAPYGTPS 162
Q P GY + SS SY +PS S ++ + S
Sbjct: 240 GGTRSGQGGWGPGFWTGLGAGGALGYLFGSRRNNNSSYGRSYGSGSPSYSPSSSSNSSSS 299
Query: 163 PLSYSSRQASPF 174
S S+R +S F
Sbjct: 300 SSSSSTRTSSGF 311
>gnl|CDD|233045 TIGR00601, rad23, UV excision repair protein Rad23. All proteins
in this family for which functions are known are
components of a multiprotein complex used for targeting
nucleotide excision repair to specific parts of the
genome. In humans, Rad23 complexes with the XPC protein.
This family is based on the phylogenomic analysis of JA
Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
metabolism, DNA replication, recombination, and repair].
Length = 378
Score = 28.7 bits (64), Expect = 4.1
Identities = 12/51 (23%), Positives = 22/51 (43%)
Query: 141 SSSYLGATPSPSSEAAPYGTPSPLSYSSRQASPFTPGTGLDQYPTPEWHTT 191
+ + A P+ + +AP TPSP + + S ++ P+ E T
Sbjct: 79 TGTGKVAPPAATPTSAPTPTPSPPASPASGMSAAPASAVEEKSPSEESATA 129
>gnl|CDD|143398 cd07079, ALDH_F18-19_ProA-GPR, Gamma-glutamyl phosphate reductase
(GPR), aldehyde dehydrogenase families 18 and 19.
Gamma-glutamyl phosphate reductase (GPR), a L-proline
biosynthetic pathway (PBP) enzyme that catalyzes the
NADPH dependent reduction of L-gamma-glutamyl
5-phosphate into L-glutamate 5-semialdehyde and
phosphate. The glutamate route of the PBP involves two
enzymatic steps catalyzed by gamma-glutamyl kinase (GK,
EC 2.7.2.11) and GPR (EC 1.2.1.41). These enzymes are
fused into the bifunctional enzyme, ProA or
delta(1)-pyrroline-5-carboxylate synthetase (P5CS) in
plants and animals, whereas they are separate enzymes in
bacteria and yeast. In humans, the P5CS (ALDH18A1), an
inner mitochondrial membrane enzyme, is essential to the
de novo synthesis of the amino acids proline and
arginine. Tomato (Lycopersicon esculentum) has both the
prokaryotic-like polycistronic operons encoding GK and
GPR (PRO1, ALDH19) and the full-length, bifunctional
P5CS (PRO2, ALDH18B1).
Length = 406
Score = 28.6 bits (65), Expect = 4.2
Identities = 13/33 (39%), Positives = 16/33 (48%), Gaps = 8/33 (24%)
Query: 196 RINENARDPEFRGQIGVIRHLGSGVCSVYLAEE 228
+ ENA P VI+H G G C VY+ E
Sbjct: 206 FVVENATIP-------VIKH-GDGNCHVYVDES 230
>gnl|CDD|177556 PHA03195, PHA03195, tegument protein VP11/12; Provisional.
Length = 746
Score = 28.8 bits (64), Expect = 4.4
Identities = 21/73 (28%), Positives = 30/73 (41%), Gaps = 5/73 (6%)
Query: 99 NNSYPHTPGTMYGS-----SSEHTSYSPLQASPSPSGYQSHSGVSSGSSSYLGATPSPSS 153
N +YPH S SS T Y L ++PSP+G ++ G + A+P P
Sbjct: 519 NGAYPHQRSDSSSSDNSTCSSTETQYITLPSTPSPTGEVVYTNPLLGPDEEVDASPQPVD 578
Query: 154 EAAPYGTPSPLSY 166
+ Y P Y
Sbjct: 579 PVSDYSAPKNPDY 591
>gnl|CDD|233723 TIGR02094, more_P_ylases, alpha-glucan phosphorylases. This family
consists of known phosphorylases, and homologs believed
to share the function of using inorganic phosphate to
cleave an alpha 1,4 linkage between the terminal glucose
residue and the rest of the polymer (maltodextrin,
glycogen, etc.). The name of the glucose storage polymer
substrate, and therefore the name of this enzyme,
depends on the chain lengths and branching patterns. A
number of the members of this family have been shown to
operate on small maltodextrins, as may be obtained by
utilization of exogenous sources. This family represents
a distinct clade from the related family modeled by
TIGR02093/pfam00343.
Length = 601
Score = 28.5 bits (64), Expect = 5.2
Identities = 14/33 (42%), Positives = 19/33 (57%), Gaps = 8/33 (24%)
Query: 196 RINENARDPEFRGQI--------GVIRHLGSGV 220
RI E ++ PEFRG+I + R+L SGV
Sbjct: 446 RIVEFSKRPEFRGRIVFLENYDINLARYLVSGV 478
>gnl|CDD|215130 PLN02217, PLN02217, probable pectinesterase/pectinesterase
inhibitor.
Length = 670
Score = 28.5 bits (63), Expect = 5.2
Identities = 29/101 (28%), Positives = 43/101 (42%), Gaps = 27/101 (26%)
Query: 82 TPARSIHNDAY-PMDDSFNNSYPHTPGTMYGSSSEHTSYSPLQASPSPSGYQSHSGVSSG 140
TPA+ I DA+ P P+ PG G+ S +P+G S+
Sbjct: 539 TPAQYIQGDAWIP-----GKGVPYIPGLFAGNPGSTNS--------TPTG-------SAA 578
Query: 141 SSSYLGATPSPSSEAAPYGTP------SPLSYSSRQASPFT 175
SS+ ++ SPS+ AP +P SP + S+ SP T
Sbjct: 579 SSNTTFSSDSPSTVVAPSTSPPAGHLGSPPATPSKIVSPST 619
>gnl|CDD|183063 PRK11264, PRK11264, putative amino-acid ABC transporter ATP-binding
protein YecC; Provisional.
Length = 250
Score = 28.2 bits (63), Expect = 5.4
Identities = 15/40 (37%), Positives = 21/40 (52%), Gaps = 10/40 (25%)
Query: 201 ARDPEFRGQI-GVIRHLGSGVCSVYLAEEERTLSIEAHEL 239
A DPE G++ IR L A+E+RT+ I HE+
Sbjct: 174 ALDPELVGEVLNTIRQL---------AQEKRTMVIVTHEM 204
>gnl|CDD|184923 PRK14959, PRK14959, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 624
Score = 28.1 bits (62), Expect = 6.3
Identities = 28/142 (19%), Positives = 37/142 (26%), Gaps = 36/142 (25%)
Query: 16 PSGGRGGRGGGRGGHHGGNVRRDRELIGKTIKITGGPYKVGSRTPGGGATPLHDGSRTPG 75
G G GG TI T G P G TP S P
Sbjct: 380 APSGSAAEGPASGGA-------------ATIP-TPGTQGPQGTAPAAGMTP---SSAAPA 422
Query: 76 PWDPSSTPARSIHNDAYPMDDSFNNSYPHTPGTMYGSSSEHTSYSPLQASPSPSGYQSHS 135
PS+ P+ + P DD+ P P P + +P
Sbjct: 423 TPAPSAAPSPRV-----PWDDA-----PPAPPRSGI---------PPRPAPRMPEASPVP 463
Query: 136 GVSSGSSSYLGATPSPSSEAAP 157
G +S A P+ +
Sbjct: 464 GAPDSVASASDAPPTLGDPSDT 485
>gnl|CDD|178259 PLN02653, PLN02653, GDP-mannose 4,6-dehydratase.
Length = 340
Score = 27.8 bits (62), Expect = 6.5
Identities = 13/33 (39%), Positives = 18/33 (54%)
Query: 139 SGSSSYLGATPSPSSEAAPYGTPSPLSYSSRQA 171
+GSS G+TP P SE P+ SP + + A
Sbjct: 137 AGSSEMYGSTPPPQSETTPFHPRSPYAVAKVAA 169
>gnl|CDD|226961 COG4598, HisP, ABC-type histidine transport system, ATPase
component [Amino acid transport and metabolism].
Length = 256
Score = 27.8 bits (62), Expect = 7.4
Identities = 14/41 (34%), Positives = 21/41 (51%), Gaps = 10/41 (24%)
Query: 201 ARDPEFRGQI-GVIRHLGSGVCSVYLAEEERTLSIEAHELE 240
A DPE G++ V++ L AEE RT+ + HE+
Sbjct: 182 ALDPELVGEVLKVMQDL---------AEEGRTMVVVTHEMG 213
>gnl|CDD|227493 COG5164, SPT5, Transcription elongation factor [Transcription].
Length = 607
Score = 28.1 bits (62), Expect = 7.6
Identities = 17/67 (25%), Positives = 27/67 (40%), Gaps = 2/67 (2%)
Query: 205 EFRGQIGVIRHLGSGVCSVYLAEEERTLSIEAHELEPVMPQPNDKVKVIVGEHKECTGVL 264
E RG++GV+ + ++ E+R L L D VKVI G G +
Sbjct: 103 EQRGRVGVVPRQKEMIETMTYHFEQR-LCGPWGRLRKGF-YKGDLVKVIEGGEMVDIGTV 160
Query: 265 LSVDNGE 271
+D +
Sbjct: 161 PRIDGEK 167
>gnl|CDD|224070 COG1148, HdrA, Heterodisulfide reductase, subunit A and related
polyferredoxins [Energy production and conversion].
Length = 622
Score = 27.8 bits (62), Expect = 7.7
Identities = 7/32 (21%), Positives = 18/32 (56%)
Query: 260 CTGVLLSVDNGEGVVKLTEEDDVKMIDVKFLC 291
C G+ ++V + E + + + + V + D ++C
Sbjct: 8 CCGLNIAVVDIEALKEFAKLEGVVVADYPYMC 39
>gnl|CDD|72144 pfam08720, Hema_stalk, Influenza C hemagglutinin stalk. This
domain corresponds to the stalk segment of hemagglutinin
in influenza C virus. It forms a coiled coil structure.
Length = 175
Score = 27.3 bits (60), Expect = 8.1
Identities = 19/55 (34%), Positives = 27/55 (49%), Gaps = 1/55 (1%)
Query: 181 DQYPTPEWHTTDIEVRINENARDPEFRGQIGVIRHLGSGVCSVYLAEEERTLSIE 235
D+ E D+ + I ENAR G++G+IR L G S+ L E L+ E
Sbjct: 68 DRISHDEQAIRDLTLEI-ENARSEALLGELGIIRALLVGNISIGLQESLWELASE 121
>gnl|CDD|236779 PRK10864, PRK10864, putative methyltransferase; Provisional.
Length = 346
Score = 27.4 bits (61), Expect = 8.8
Identities = 15/55 (27%), Positives = 18/55 (32%), Gaps = 2/55 (3%)
Query: 18 GGRGGRGGGRGGHHGGNVRRDRELIGKTIKITGGPYKVGSRTPGGGA--TPLHDG 70
GG G R R DR + K P++ SR PG H G
Sbjct: 35 GGGRPSGKSRADGGRRPARDDRNSQSRDRKWEDSPWRTVSRAPGDETPEKADHGG 89
>gnl|CDD|177553 PHA03185, PHA03185, UL14 tegument protein; Provisional.
Length = 214
Score = 27.3 bits (60), Expect = 9.4
Identities = 16/42 (38%), Positives = 17/42 (40%), Gaps = 10/42 (23%)
Query: 49 TGGPYKVGSRT---------PGGGATPLHDGSRTPGPWDPSS 81
G GSR PGGGA R PGP DP+S
Sbjct: 162 PGDLSGPGSRPTSGPVAGGLPGGGAPAPPTPPR-PGPTDPAS 202
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.310 0.132 0.402
Gapped
Lambda K H
0.267 0.0630 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 15,476,133
Number of extensions: 1487971
Number of successful extensions: 1771
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1564
Number of HSP's successfully gapped: 150
Length of query: 296
Length of database: 10,937,602
Length adjustment: 96
Effective length of query: 200
Effective length of database: 6,679,618
Effective search space: 1335923600
Effective search space used: 1335923600
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 59 (26.7 bits)