RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy16107
(256 letters)
>gnl|CDD|203738 pfam07716, bZIP_2, Basic region leucine zipper.
Length = 54
Score = 64.6 bits (158), Expect = 5e-14
Identities = 28/54 (51%), Positives = 38/54 (70%)
Query: 170 NTDEYKRRRERNNIAVRKSREKAKIRSRETEEKVKLLVKENERLQKRIELLSEE 223
DEY+ RR RNN A R+SREK K R E EE+VK L +EN +L++++E L +E
Sbjct: 1 KDDEYRDRRRRNNEAARRSREKKKQREEELEERVKELEEENAQLRQKVEQLEKE 54
>gnl|CDD|197664 smart00338, BRLZ, basic region leucin zipper.
Length = 65
Score = 62.2 bits (152), Expect = 5e-13
Identities = 30/61 (49%), Positives = 38/61 (62%)
Query: 172 DEYKRRRERNNIAVRKSREKAKIRSRETEEKVKLLVKENERLQKRIELLSEELNVLRSLF 231
++ +RRRERN A R+SRE+ K E E KV+ L ENERL+K IE L EL L+S
Sbjct: 4 EKRRRRRERNREAARRSRERKKAEIEELERKVEQLEAENERLKKEIERLRRELEKLKSEL 63
Query: 232 T 232
Sbjct: 64 E 64
>gnl|CDD|201054 pfam00170, bZIP_1, bZIP transcription factor. The Pfam entry
includes the basic region and the leucine zipper region.
Length = 64
Score = 47.7 bits (114), Expect = 8e-08
Identities = 24/55 (43%), Positives = 33/55 (60%)
Query: 176 RRRERNNIAVRKSREKAKIRSRETEEKVKLLVKENERLQKRIELLSEELNVLRSL 230
RRR++N A R+SR + K E EEKVK L EN+ L+ +E L +E L+S
Sbjct: 8 RRRQKNREAARRSRLRKKAYIEELEEKVKELEAENKTLRSELERLKKECAKLKSE 62
>gnl|CDD|213400 cd12193, B_zip1, basic leucine zipper DNA-binding and
multimerization region of GCN4 and related proteins.
Basic leucine zipper (bZIP) transcription factors act in
networks of homo- and hetero-dimers in the regulation in
a diverse set of cellular pathways. Classical leucine
zippers have alpha helices with leucine residues 7 amino
acids apart, which stabilize dimerization with a
parallel leucine zipper domain. Dimerization creates a
pair of basic regions that bind DNA and undergo
conformational change. GCN4 was identified in
Saccharomyces cerevisiae from mutations in a deficiency
in activation with the general amino acid control
pathway. GCN4 encodes a trans-activator of amino acid
biosynthetic genes containing 2 acidic activation
domains and a C-terminal bZIP domain, comprised of a
basic alpha-helical DNA-binding region and a coiled-coil
dimerization region.
Length = 54
Score = 43.1 bits (102), Expect = 3e-06
Identities = 18/49 (36%), Positives = 29/49 (59%)
Query: 176 RRRERNNIAVRKSREKAKIRSRETEEKVKLLVKENERLQKRIELLSEEL 224
R+R RN A R+SR + R ++ E+KV+ L EN L+ ++ L + L
Sbjct: 5 RKRARNREAARRSRARKMERMKQLEDKVEELTSENSDLEDEVDRLKKAL 53
>gnl|CDD|179699 PRK03992, PRK03992, proteasome-activating nucleotidase;
Provisional.
Length = 389
Score = 34.8 bits (81), Expect = 0.035
Identities = 15/48 (31%), Positives = 28/48 (58%)
Query: 182 NIAVRKSREKAKIRSRETEEKVKLLVKENERLQKRIELLSEELNVLRS 229
A+ + + + + R+ E K++ L ENE+L++ +E L EL L+S
Sbjct: 3 LEALEERNSELEEQIRQLELKLRDLEAENEKLERELERLKSELEKLKS 50
Score = 29.8 bits (68), Expect = 1.3
Identities = 16/50 (32%), Positives = 26/50 (52%), Gaps = 3/50 (6%)
Query: 172 DEYKRRRERNNIAVRKSRE-KAKIRSRETEEKVKLLVKENERLQKRIELL 220
+ + ERN+ + R+ + K+R E E KL +E ERL+ +E L
Sbjct: 1 ERLEALEERNSELEEQIRQLELKLRDLE-AENEKLE-RELERLKSELEKL 48
>gnl|CDD|130309 TIGR01242, 26Sp45, 26S proteasome subunit P45 family. Many
proteins may score above the trusted cutoff because an
internal.
Length = 364
Score = 34.0 bits (78), Expect = 0.059
Identities = 18/49 (36%), Positives = 28/49 (57%)
Query: 193 KIRSRETEEKVKLLVKENERLQKRIELLSEELNVLRSLFTNVGVLPEHL 241
+R R+ E++ + L KE RL++ +E L E+ LRS VG + E L
Sbjct: 5 DVRIRKLEDEKRSLEKEKIRLERELERLRSEIERLRSPPLIVGTVLEVL 53
>gnl|CDD|224143 COG1222, RPT1, ATP-dependent 26S proteasome regulatory subunit
[Posttranslational modification, protein turnover,
chaperones].
Length = 406
Score = 33.0 bits (76), Expect = 0.12
Identities = 14/54 (25%), Positives = 25/54 (46%), Gaps = 3/54 (5%)
Query: 189 REKAKIRSRETE---EKVKLLVKENERLQKRIELLSEELNVLRSLFTNVGVLPE 239
K K+ +E + + L E RL++ ++ L EE+ L+ VG + E
Sbjct: 27 DTKLKLLEKEKRLLLLEEQRLEAEGLRLKREVDRLREEIERLKEPPLIVGTVLE 80
>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family. This
family consists of the 116kDa V-type ATPase (vacuolar
(H+)-ATPases) subunits, as well as V-type ATP synthase
subunit i. The V-type ATPases family are proton pumps
that acidify intracellular compartments in eukaryotic
cells for example yeast central vacuoles,
clathrin-coated and synaptic vesicles. They have
important roles in membrane trafficking processes. The
116kDa subunit (subunit a) in the V-type ATPase is part
of the V0 functional domain responsible for proton
transport. The a subunit is a transmembrane glycoprotein
with multiple putative transmembrane helices it has a
hydrophilic amino terminal and a hydrophobic carboxy
terminal. It has roles in proton transport and assembly
of the V-type ATPase complex. This subunit is encoded by
two homologous gene in yeast VPH1 and STV1.
Length = 707
Score = 33.1 bits (76), Expect = 0.13
Identities = 15/47 (31%), Positives = 23/47 (48%), Gaps = 2/47 (4%)
Query: 185 VRKSREKAKIRSR--ETEEKVKLLVKENERLQKRIELLSEELNVLRS 229
V S+E + + E ++K + + E L+K I L E LNVL
Sbjct: 63 VPPSKEFLDLEEEILDLEAEIKEVEENLESLEKEINELEEWLNVLDE 109
>gnl|CDD|236243 PRK08327, PRK08327, acetolactate synthase catalytic subunit;
Validated.
Length = 569
Score = 31.9 bits (73), Expect = 0.33
Identities = 16/88 (18%), Positives = 24/88 (27%), Gaps = 18/88 (20%)
Query: 171 TDEYKRRRERNNIAVRKSREKAKIRSRETEEKVKLLVKENERLQKRI--ELLSEELNVLR 228
+ E +R R R R + E K E + + I LS L +
Sbjct: 348 SAERRRARRR--------RAAVRELRIRQEAA-KRAEIERLKDRGPITPAYLSYCLGEVA 398
Query: 229 S----LFTNVGVLPEHLHRELNKHFDSY 252
+ T +P SY
Sbjct: 399 DEYDAIVTEYPFVPRQARLN---KPGSY 423
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 31.6 bits (72), Expect = 0.44
Identities = 22/87 (25%), Positives = 38/87 (43%), Gaps = 4/87 (4%)
Query: 165 KSVDKNTDEYKRRRERNNIAVRKSREKAKIRSRETEEKVKLLVKENERLQKRIELLSEEL 224
KS+ + E +R+ E+ + + E + ++ L +E E+LQ R+E L EEL
Sbjct: 691 KSLKNELRSLEDLLE----ELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEEL 746
Query: 225 NVLRSLFTNVGVLPEHLHRELNKHFDS 251
L + E L EL ++
Sbjct: 747 EELEEELEELQERLEELEEELESLEEA 773
Score = 28.1 bits (63), Expect = 5.2
Identities = 20/63 (31%), Positives = 31/63 (49%), Gaps = 5/63 (7%)
Query: 177 RRERNNIAVRKSREKAKIRS-----RETEEKVKLLVKENERLQKRIELLSEELNVLRSLF 231
RE ++ R+ R + +I E EEK+ L +E E L+K +E L EEL L +
Sbjct: 813 ERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEK 872
Query: 232 TNV 234
+
Sbjct: 873 EEL 875
Score = 27.8 bits (62), Expect = 8.1
Identities = 14/66 (21%), Positives = 29/66 (43%), Gaps = 2/66 (3%)
Query: 169 KNTDEYKRRRERNNIAVRKSREKAKIRSRETEEKVKLLVKENERLQKRIELLSEELNVLR 228
+ R E + + + ++ E EE++ L +E E LQ+ +E +E+ L+
Sbjct: 216 QELKAELRELELALLLAKLKELRKELE--ELEEELSRLEEELEELQEELEEAEKEIEELK 273
Query: 229 SLFTNV 234
S +
Sbjct: 274 SELEEL 279
Score = 27.4 bits (61), Expect = 8.9
Identities = 16/76 (21%), Positives = 27/76 (35%)
Query: 173 EYKRRRERNNIAVRKSREKAKIRSRETEEKVKLLVKENERLQKRIELLSEELNVLRSLFT 232
E ++ A + EK E EE + L +E L+ + + EL L+
Sbjct: 348 ELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIE 407
Query: 233 NVGVLPEHLHRELNKH 248
++ E L L
Sbjct: 408 SLEERLERLSERLEDL 423
Score = 27.4 bits (61), Expect = 9.9
Identities = 16/62 (25%), Positives = 31/62 (50%), Gaps = 3/62 (4%)
Query: 171 TDEYKRRRERNNIAVRKSREK---AKIRSRETEEKVKLLVKENERLQKRIELLSEELNVL 227
+E + +R+ + + E+ A+ R E +++ L + ERL++ IE L EE+ L
Sbjct: 781 IEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEEL 840
Query: 228 RS 229
Sbjct: 841 EE 842
>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
potential nuclease [General function prediction only].
Length = 290
Score = 31.0 bits (70), Expect = 0.48
Identities = 13/52 (25%), Positives = 27/52 (51%)
Query: 177 RRERNNIAVRKSREKAKIRSRETEEKVKLLVKENERLQKRIELLSEELNVLR 228
+R + +++ E+ K + E +++ + L+KE E L+ E + E L L
Sbjct: 125 QRVPEYMDLKEDYEELKEKLEELQKEKEELLKELEELEAEYEEVQERLKRLE 176
Score = 29.1 bits (65), Expect = 2.4
Identities = 17/70 (24%), Positives = 29/70 (41%), Gaps = 2/70 (2%)
Query: 185 VRKSREKAKIRSR--ETEEKVKLLVKENERLQKRIELLSEELNVLRSLFTNVGVLPEHLH 242
+++ E ++ E +EK++ L KE E L K +E L E ++ + V L
Sbjct: 124 LQRVPEYMDLKEDYEELKEKLEELQKEKEELLKELEELEAEYEEVQERLKRLEVENSRLE 183
Query: 243 RELNKHFDSY 252
L K
Sbjct: 184 EMLKKLPGEV 193
>gnl|CDD|237560 PRK13922, PRK13922, rod shape-determining protein MreC;
Provisional.
Length = 276
Score = 31.1 bits (71), Expect = 0.51
Identities = 10/25 (40%), Positives = 13/25 (52%)
Query: 206 LVKENERLQKRIELLSEELNVLRSL 230
L +ENE L+K + L L L L
Sbjct: 74 LREENEELKKELLELESRLQELEQL 98
>gnl|CDD|236268 PRK08452, PRK08452, flagellar protein FlaG; Provisional.
Length = 124
Score = 29.6 bits (67), Expect = 0.71
Identities = 17/57 (29%), Positives = 34/57 (59%), Gaps = 3/57 (5%)
Query: 181 NNIAVRKSR--EKAKIRSRETEEKVKLLVKENERLQKRIELLSEELNV-LRSLFTNV 234
+I S+ E++ I++ + K + NE+L+K++E L+E+LN ++ L TN+
Sbjct: 17 TSINQTTSKEIERSNIQTTIDQSKKTIDGGNNEQLKKKLEELTEKLNEEMKRLDTNI 73
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 30.8 bits (70), Expect = 0.75
Identities = 17/65 (26%), Positives = 32/65 (49%), Gaps = 5/65 (7%)
Query: 168 DKNTDEYKRRRERNNIAVRKSREKAKIRS--RETEEKVKLLVKENERLQKRIELLSEELN 225
+ ++ + ER R+ R+K + R + +++ L KE E +KR+E L +L
Sbjct: 449 KREIEKLESELER---FRREVRDKVRKDREIRARDRRIERLEKELEEKKKRVEELERKLA 505
Query: 226 VLRSL 230
LR +
Sbjct: 506 ELRKM 510
Score = 30.4 bits (69), Expect = 0.87
Identities = 20/59 (33%), Positives = 26/59 (44%), Gaps = 3/59 (5%)
Query: 172 DEYKRRRERNNIAVRKSREKAKIRSR--ETEEKVKLLVKENERLQKRIELLSEELNVLR 228
E ER I V + R K K+ EE+ L +E E L++ IE L EL R
Sbjct: 406 KEGTEEEERREITVYEKRIK-KLEETVERLEEENSELKRELEELKREIEKLESELERFR 463
>gnl|CDD|153072 cd03602, CLECT_1, C-type lectin (CTL)/C-type lectin-like (CTLD)
domain subgroup 1; a subgroup of protein domains
homologous to the carbohydrate-recognition domains
(CRDs) of the C-type lectins. CLECT_1: C-type lectin
(CTL)/C-type lectin-like (CTLD) domain subgroup 1; a
subgroup of protein domains homologous to the
carbohydrate-recognition domains (CRDs) of the C-type
lectins. Many CTLDs are calcium-dependent carbohydrate
binding modules; other CTLDs bind protein ligands,
lipids, and inorganic surfaces including CaCO3 and ice.
Animal C-type lectins are involved in such functions
as extracellular matrix organization, endocytosis,
complement activation, pathogen recognition, and
cell-cell interactions. CTLDs may bind a variety of
carbohydrate ligands including mannose,
N-acetylglucosamine, galactose, N-acetylgalactosamine,
and fucose. CTLDs associate with each other through
several different surfaces to form dimers, trimers, or
tetramers from which ligand-binding sites project in
different orientations. In some CTLDs a loop extends
to the adjoining domain to form a loop-swapped dimer.
Length = 108
Score = 29.3 bits (66), Expect = 0.83
Identities = 10/25 (40%), Positives = 13/25 (52%)
Query: 37 KRYTDLADLNTPEISLDLQNLIDDS 61
+ YTDLA + E + L NL S
Sbjct: 21 ENYTDLATVQNQEDNALLSNLSRVS 45
>gnl|CDD|217536 pfam03403, PAF-AH_p_II, Platelet-activating factor acetylhydrolase,
isoform II. Platelet-activating factor acetylhydrolase
(PAF-AH) is a subfamily of phospholipases A2,
responsible for inactivation of platelet-activating
factor through cleavage of an acetyl group. Three known
PAF-AHs are the brain heterotrimeric PAF-AH Ib, whose
catalytic beta and gamma subunits are aligned in
pfam02266, the extracellular, plasma PAF-AH (pPAF-AH),
and the intracellular PAF-AH isoform II (PAF-AH II).
This family aligns pPAF-AH and PAF-AH II, whose
similarity was previously noted.
Length = 372
Score = 30.1 bits (68), Expect = 1.2
Identities = 13/62 (20%), Positives = 30/62 (48%), Gaps = 8/62 (12%)
Query: 189 REKAKIRSRETEEKVKLLVKENERLQKRIELLSEELNVLRSLFTNVGVLPEHLHRELNKH 248
+R EE+ + NE++ +R + S+ L+++ + ++G E++ L+
Sbjct: 157 PSWIYLRDLNAEEEFHI---RNEQVGQRAQECSKALSIILQI--DLGTPVENV---LDSD 208
Query: 249 FD 250
FD
Sbjct: 209 FD 210
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 30.0 bits (68), Expect = 1.4
Identities = 14/63 (22%), Positives = 26/63 (41%)
Query: 171 TDEYKRRRERNNIAVRKSREKAKIRSRETEEKVKLLVKENERLQKRIELLSEELNVLRSL 230
E ER + E+ + + E E++ L ++ E L++ +E L EL L +
Sbjct: 307 LRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAE 366
Query: 231 FTN 233
Sbjct: 367 LEE 369
Score = 27.7 bits (62), Expect = 6.8
Identities = 20/88 (22%), Positives = 35/88 (39%), Gaps = 4/88 (4%)
Query: 165 KSVDKNTDEYKRRRERNNIAVRKSREKAKIRSRETEEKVKLLVKENERLQKRIELLSEEL 224
K++ + DE + N RE+ + R + E L+++IE LSE++
Sbjct: 799 KALREALDELRAELTLLNEEAANLRERLESLERRIAATER----RLEDLEEQIEELSEDI 854
Query: 225 NVLRSLFTNVGVLPEHLHRELNKHFDSY 252
L + + L E L EL +
Sbjct: 855 ESLAAEIEELEELIEELESELEALLNER 882
>gnl|CDD|201951 pfam01749, IBB, Importin beta binding domain. This family consists
of the importin alpha (karyopherin alpha), importin beta
(karyopherin beta) binding domain. The domain mediates
formation of the importin alpha beta complex; required
for classical NLS import of proteins into the nucleus,
through the nuclear pore complex and across the nuclear
envelope. Also in the alignment is the NLS of importin
alpha which overlaps with the IBB domain.
Length = 97
Score = 28.1 bits (63), Expect = 1.4
Identities = 10/33 (30%), Positives = 21/33 (63%)
Query: 157 HHHHRKNGKSVDKNTDEYKRRRERNNIAVRKSR 189
+ RK+ K+ ++ +E +RRRE + +RK++
Sbjct: 6 PEYRRKSYKNKGRDAEEMRRRREEVGVELRKNK 38
>gnl|CDD|234428 TIGR03979, His_Ser_Rich, His-Xaa-Ser repeat protein HxsA. Members
of this protein share two defining regions. One is a
histidine/serine-rich cluster, typically
H-R-S-H-S-S-H-R-S-H-S-S-H. Members are found always in
the context of a pair of radical SAM proteins, HxsB and
HxsC, and a fourth protein HxsD. The system is predicted
to perform peptide modifications, likely in the
His-Xaa-Ser region, to produce some uncharacterized
natural product.
Length = 186
Score = 29.1 bits (65), Expect = 1.6
Identities = 12/35 (34%), Positives = 15/35 (42%), Gaps = 3/35 (8%)
Query: 135 APTNFNTPL---AHSSKSSNQHHSHHHHHRKNGKS 166
AP N + P H S SS++ HS H H
Sbjct: 40 APLNVDIPNLLAGHRSHSSHRSHSSHSSHYSGAGG 74
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 29.7 bits (67), Expect = 1.7
Identities = 20/81 (24%), Positives = 34/81 (41%), Gaps = 8/81 (9%)
Query: 165 KSVDKNTDEYKRRRERNNIAVRKSREKAKIRSRETEEKVKLLVKENERLQKRIELLSEEL 224
K DK T+EY +E + E K ET +++K ++++E L E+
Sbjct: 350 KRRDKLTEEYAELKEELEDLRAELEEVDK-EFAETRDELK-------DYREKLEKLKREI 401
Query: 225 NVLRSLFTNVGVLPEHLHREL 245
N L+ + + L EL
Sbjct: 402 NELKRELDRLQEELQRLSEEL 422
Score = 27.7 bits (62), Expect = 7.6
Identities = 13/63 (20%), Positives = 24/63 (38%), Gaps = 5/63 (7%)
Query: 172 DEYKRRRERNNIAVRKSREK-----AKIRSRETEEKVKLLVKENERLQKRIELLSEELNV 226
+RE +++ R + + K+ + KE E+L++ E L E L
Sbjct: 682 RLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEE 741
Query: 227 LRS 229
L
Sbjct: 742 LEE 744
>gnl|CDD|239561 cd03479, Rieske_RO_Alpha_PhDO_like, Rieske non-heme iron oxygenase
(RO) family, Phthalate 4,5-dioxygenase (PhDO)-like
subfamily, N-terminal Rieske domain of the oxygenase
alpha subunit; composed of the oxygenase alpha subunits
of PhDO and similar proteins including 3-chlorobenzoate
3,4-dioxygenase (CBDO), phenoxybenzoate dioxygenase
(POB-dioxygenase) and 3-nitrobenzoate oxygenase (MnbA).
ROs comprise a large class of aromatic
ring-hydroxylating dioxygenases that enable
microorganisms to tolerate and utilize aromatic
compounds for growth. The oxygenase alpha subunit
contains an N-terminal Rieske domain with an [2Fe-2S]
cluster and a C-terminal catalytic domain with a
mononuclear Fe(II) binding site. The Rieske [2Fe-2S]
cluster accepts electrons from a reductase or ferredoxin
component and transfers them to the mononuclear iron for
catalysis. PhDO and CBDO are two-component RO systems,
containing oxygenase and reductase components. PhDO
catalyzes the dihydroxylation of phthalate to form the
4,5-dihydro-cis-dihydrodiol of phthalate (DHD). CBDO,
together with CbaC dehydrogenase, converts the
environmental pollutant 3CBA to protocatechuate (PCA)
and 5-Cl-PCA, which are then metabolized by the
chromosomal PCA meta (extradiol) ring fission pathway.
POB-dioxygenase catalyzes the initial catabolic step in
the angular dioxygenation of phenoxybenzoate, converting
mono- and dichlorinated phenoxybenzoates to
protocatechuate and chlorophenols. These
phenoxybenzoates are metabolic products formed during
the degradation of pyrethroid insecticides.
Length = 144
Score = 28.4 bits (64), Expect = 2.0
Identities = 13/30 (43%), Positives = 16/30 (53%), Gaps = 1/30 (3%)
Query: 215 KRIELLSEELNVLRSLFTNVGVLPEH-LHR 243
R+ LL E+L R VG+L EH HR
Sbjct: 38 VRVRLLGEDLVAFRDTSGRVGLLDEHCPHR 67
>gnl|CDD|176524 cd08582, GDPD_like_2, Glycerophosphodiester phosphodiesterase
domain of uncharacterized bacterial
glycerophosphodiester phosphodiesterases. This
subfamily corresponds to the glycerophosphodiester
phosphodiesterase domain (GDPD) present in a group of
uncharacterized bacterial glycerophosphodiester
phosphodiesterase and similar proteins. They show high
sequence similarity to Escherichia coli
glycerophosphodiester phosphodiesterase, which catalyzes
the degradation of glycerophosphodiesters to produce
sn-glycerol-3-phosphate (G3P) and the corresponding
alcohols.
Length = 233
Score = 28.8 bits (65), Expect = 2.1
Identities = 12/40 (30%), Positives = 23/40 (57%)
Query: 191 KAKIRSRETEEKVKLLVKENERLQKRIELLSEELNVLRSL 230
K R E EE++ L+KE+ L ++I ++S + L+ +
Sbjct: 108 KHPRRGPEAEEELLKLLKESGLLPEQIVIISFDAEALKRV 147
>gnl|CDD|149640 pfam08653, DASH_Dam1, DASH complex subunit Dam1. The DASH complex
is a ~10 subunit microtubule-binding complex that is
transferred to the kinetochore prior to mitosis. In
Saccharomyces cerevisiae DASH forms both rings and
spiral structures on microtubules in vitro. Components
of the DASH complex, including Dam1, Duo1, Spc34, Dad1
and Ask1, are essential and connect the centromere to
the plus end of spindle microtubules.
Length = 58
Score = 26.9 bits (60), Expect = 2.2
Identities = 10/40 (25%), Positives = 20/40 (50%), Gaps = 1/40 (2%)
Query: 213 LQKRIELLSEELNVLRSLFTNVGVLPEHLHRELNKHFDSY 252
LQ + L++ + L + T++ + E L N+ F S+
Sbjct: 3 LQPQFAELADSMATLDANMTHLQFIHESLSD-FNESFASF 41
>gnl|CDD|235871 PRK06835, PRK06835, DNA replication protein DnaC; Validated.
Length = 329
Score = 29.1 bits (66), Expect = 2.5
Identities = 12/67 (17%), Positives = 33/67 (49%), Gaps = 11/67 (16%)
Query: 173 EYKRRRERNNIAVRKSREKAKIRSRETEE-----------KVKLLVKENERLQKRIELLS 221
EY++RR++ + ++ +E+ + E E + ++K ++ ++ ++ L
Sbjct: 12 EYEKRRDKEELELKNRKEEVYKKIPEIAEIDDEIAKLGIKLSRAILKNPDKKEETLKELK 71
Query: 222 EELNVLR 228
E++ LR
Sbjct: 72 EKITDLR 78
>gnl|CDD|221583 pfam12449, DUF3684, Protein of unknown function (DUF3684). This
domain family is found in eukaryotes, and is typically
between 1072 and 1090 amino acids in length.
Length = 1084
Score = 29.2 bits (66), Expect = 2.6
Identities = 13/42 (30%), Positives = 21/42 (50%)
Query: 199 TEEKVKLLVKENERLQKRIELLSEELNVLRSLFTNVGVLPEH 240
E ++LVKE R ++ + L +LRSL N+ L +
Sbjct: 917 KHEIAQMLVKEPARFLSLLQSPEKYLKLLRSLAENMRTLRKD 958
>gnl|CDD|203138 pfam04977, DivIC, Septum formation initiator. DivIC from B.
subtilis is necessary for both vegetative and
sporulation septum formation. These proteins are mainly
composed of an amino terminal coiled-coil.
Length = 80
Score = 27.2 bits (61), Expect = 2.8
Identities = 8/33 (24%), Positives = 16/33 (48%)
Query: 197 RETEEKVKLLVKENERLQKRIELLSEELNVLRS 229
+ +++ L E +L+ E L E+ L+S
Sbjct: 20 YQLNQEIAALQAELAKLKAENEELEAEVKDLKS 52
>gnl|CDD|235239 PRK04172, pheS, phenylalanyl-tRNA synthetase subunit alpha;
Provisional.
Length = 489
Score = 29.0 bits (66), Expect = 3.0
Identities = 25/90 (27%), Positives = 39/90 (43%), Gaps = 14/90 (15%)
Query: 162 KNGKSVDKNTDEYKRRRERNNIAVRKSREKAKIRSRETEEKV------KLLVKENERLQK 215
+ GK V D Y+ E+ A++ E K E + KV + LV+E ER ++
Sbjct: 120 EKGK-VILKPDAYEDPEEK---ALKALAEGDKEELSEEDLKVLKELKKRKLVEEKERTER 175
Query: 216 RIELLSEELNVLRSLFTN---VGVL-PEHL 241
+EL L +L+ + L PE L
Sbjct: 176 SVELTDAGLELLKEGIELKEEITQLTPELL 205
>gnl|CDD|236583 PRK09601, PRK09601, GTP-binding protein YchF; Reviewed.
Length = 364
Score = 28.9 bits (66), Expect = 3.0
Identities = 19/61 (31%), Positives = 31/61 (50%), Gaps = 12/61 (19%)
Query: 176 RRRERNNIAVRKSREKAKIRSRETEEKVKLLVK-----ENERLQKRIELLSEELNVLRSL 230
+R ER + K KAK +E + +++LL K E + + +EL EE +L+SL
Sbjct: 142 KRLER----LEK---KAKGGDKEAKAELELLEKLLEHLEEGKPARTLELTDEEEKLLKSL 194
Query: 231 F 231
Sbjct: 195 Q 195
>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein. Members of this
protein family have a signal peptide, a strongly
conserved SH3 domain, a variable region, and then a
C-terminal hydrophobic transmembrane alpha helix region.
Length = 198
Score = 28.4 bits (64), Expect = 3.1
Identities = 10/67 (14%), Positives = 26/67 (38%), Gaps = 2/67 (2%)
Query: 165 KSVDKNTDEYKRRRERNNIAVRKSREKAK--IRSRETEEKVKLLVKENERLQKRIELLSE 222
+ + + E + ++ + + K R ++ L +EN L++ + L +
Sbjct: 87 EQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAIELDEENRELREELAELKQ 146
Query: 223 ELNVLRS 229
E L +
Sbjct: 147 ENEALEA 153
>gnl|CDD|234402 TIGR03928, T7_EssCb_Firm, type VII secretion protein EssC,
C-terminal domain. This model describes the C-terminal
domain, or longer subunit, of the Firmicutes type VII
secretion protein EssC. This protein (homologous to EccC
in Actinobacteria) and the WXG100 target proteins are
the only homologous parts of type VII secretion between
Firmicutes and Actinobacteria [Protein fate, Protein and
peptide secretion and trafficking].
Length = 1296
Score = 28.8 bits (65), Expect = 3.1
Identities = 9/32 (28%), Positives = 19/32 (59%)
Query: 205 LLVKENERLQKRIELLSEELNVLRSLFTNVGV 236
+ E E+++K I + +E++ + LF+ GV
Sbjct: 868 FTLDEEEKIEKLIRRIKKEIDRRKKLFSEYGV 899
>gnl|CDD|214974 smart01015, Arfaptin, Arfaptin-like domain. Arfaptin interacts
with ARF1, a small GTPase involved in vesicle budding at
the Golgi complex and immature secretory granules. The
structure of arfaptin shows that upon binding to a small
GTPase, arfaptin forms an elongated, crescent-shaped
dimer of three-helix coiled-coils. The N-terminal region
of ICA69 is similar to arfaptin.
Length = 217
Score = 28.4 bits (64), Expect = 3.3
Identities = 13/46 (28%), Positives = 22/46 (47%)
Query: 197 RETEEKVKLLVKENERLQKRIELLSEELNVLRSLFTNVGVLPEHLH 242
R T+ + L+K E+ Q+R+ LS+ N L F ++ L
Sbjct: 37 RSTQRTYEDLLKLIEKYQQRLCNLSQTENELGDFFRDLSEKDPTLK 82
>gnl|CDD|148751 pfam07323, DUF1465, Protein of unknown function (DUF1465). This
family consists of several hypothetical bacterial
proteins of around 180 residues in length. The function
of this family is unknown.
Length = 156
Score = 28.0 bits (63), Expect = 3.5
Identities = 15/49 (30%), Positives = 22/49 (44%), Gaps = 11/49 (22%)
Query: 189 REKAKIRSR-----------ETEEKVKLLVKENERLQKRIELLSEELNV 226
REK ++R E E ++ L+ +ERL R+ L EEL
Sbjct: 86 REKYRVRLDTPSPPDPAGWAELPEALRDLIARSERLYARVARLDEELYG 134
>gnl|CDD|221244 pfam11818, DUF3340, C-terminal domain of tail specific protease
(DUF3340). This presumed domain is found at the
C-terminus of tail specific proteases. Its function is
unknown. This family is found in bacteria and
eukaryotes. This presumed domain is typically between 88
to 187 amino acids in length.
Length = 139
Score = 27.6 bits (62), Expect = 3.5
Identities = 16/55 (29%), Positives = 26/55 (47%), Gaps = 12/55 (21%)
Query: 165 KSVDKNTDEYKRRRERN----NIAVRKSREKAKIRSRETEEKVKLLVKENERLQK 215
+ ++++ K R+ N A RK+ RE +EK K L +ENER +
Sbjct: 55 QYLNEDIAWLKEERDDKTVSLNEAERKAE-------REEDEK-KRLKRENERRKA 101
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 28.5 bits (64), Expect = 3.8
Identities = 28/76 (36%), Positives = 38/76 (50%), Gaps = 9/76 (11%)
Query: 177 RRERNNIAVR-KSREKAKIRSRETEEKVKLLVKENERLQKRIELLSE------ELNVLRS 229
E N I R K E+ + R E ++K+K L K E L++R EL E EL L+
Sbjct: 320 EEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKK 379
Query: 230 LFTNVGVLPEHLHREL 245
T G+ PE L +EL
Sbjct: 380 RLT--GLTPEKLEKEL 393
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 28.4 bits (64), Expect = 4.4
Identities = 11/42 (26%), Positives = 21/42 (50%), Gaps = 1/42 (2%)
Query: 200 EEKVKLLVKENERLQKRIELLSEELNVLRSLFTNVGVLPEHL 241
E+++K L +E L+ I+ L +E+ L + N + L
Sbjct: 99 EKEIKELEEEISELENEIKELEQEIERL-EPWGNFDLDLSLL 139
>gnl|CDD|226795 COG4345, COG4345, Uncharacterized protein conserved in archaea
[Function unknown].
Length = 181
Score = 27.9 bits (62), Expect = 4.5
Identities = 13/36 (36%), Positives = 21/36 (58%)
Query: 189 REKAKIRSRETEEKVKLLVKENERLQKRIELLSEEL 224
++ +E EEK+ ++E ER++K IE L EL
Sbjct: 113 TIGIEVYPKELEEKLADAMEEVERIEKTIEELVSEL 148
>gnl|CDD|240274 PTZ00112, PTZ00112, origin recognition complex 1 protein;
Provisional.
Length = 1164
Score = 28.4 bits (63), Expect = 4.9
Identities = 23/108 (21%), Positives = 45/108 (41%), Gaps = 16/108 (14%)
Query: 101 SSSSNSTGGPNIKEEPVESEFRHQTTPYNVISNGAPTNFNTPLAHSSKSSNQHHSHHHHH 160
S+S +S P + +E Q+T N+ S+ +P N K +N H+ +
Sbjct: 151 SNSLSSKHSPKVIKEN-------QSTHVNISSDNSPRNKEISNKQLKKQTNVTHTTCYDK 203
Query: 161 RK-------NGKSVDKNTDEYKRRRERNNIAVRKSREKAKIRSRETEE 201
+ K+ + ++ K + + NI +K R+ K R +E+
Sbjct: 204 MRRSPRNTSTIKNNTNDKNKEKNKEKDKNI--KKDRDGDKQTKRNSEK 249
>gnl|CDD|234173 TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB. Members
of this protein family are the bacterial ATP-dependent
chaperone ClpB. This protein belongs to the AAA family,
ATPases associated with various cellular activities
(pfam00004). This molecular chaperone does not act as a
protease, but rather serves to disaggregate misfolded
and aggregated proteins [Protein fate, Protein folding
and stabilization].
Length = 852
Score = 28.4 bits (64), Expect = 5.1
Identities = 12/38 (31%), Positives = 16/38 (42%)
Query: 219 LLSEELNVLRSLFTNVGVLPEHLHRELNKHFDSYPHVQ 256
LL +E + R L GV L + L K + P V
Sbjct: 33 LLDQEGGLARRLLQKAGVNVGALRQALEKELEKLPKVS 70
>gnl|CDD|181894 PRK09480, slmA, division inhibitor protein; Provisional.
Length = 194
Score = 27.5 bits (62), Expect = 5.5
Identities = 11/28 (39%), Positives = 16/28 (57%), Gaps = 5/28 (17%)
Query: 206 LVKENERLQKRIELLSEEL-----NVLR 228
L+ E+ERLQ RI L + + +LR
Sbjct: 113 LMFEDERLQGRINQLFDRIEAQLRQILR 140
>gnl|CDD|236641 PRK10019, PRK10019, nickel/cobalt efflux protein RcnA; Provisional.
Length = 279
Score = 27.8 bits (62), Expect = 5.9
Identities = 9/32 (28%), Positives = 17/32 (53%)
Query: 145 HSSKSSNQHHSHHHHHRKNGKSVDKNTDEYKR 176
H + H HHHH ++G + ++ D ++R
Sbjct: 129 HDHDHEHHHDHGHHHHHEHGATAEEYQDAHER 160
Score = 27.1 bits (60), Expect = 9.1
Identities = 9/36 (25%), Positives = 13/36 (36%)
Query: 145 HSSKSSNQHHSHHHHHRKNGKSVDKNTDEYKRRRER 180
H + H HH H + +EY+ ER
Sbjct: 125 HDHDHDHDHEHHHDHGHHHHHEHGATAEEYQDAHER 160
>gnl|CDD|179382 PRK02195, PRK02195, V-type ATP synthase subunit D; Provisional.
Length = 201
Score = 27.6 bits (62), Expect = 6.0
Identities = 13/47 (27%), Positives = 24/47 (51%), Gaps = 6/47 (12%)
Query: 191 KAKIRS--RETEEKVKLLVKENERLQKRIE----LLSEELNVLRSLF 231
KA++++ R + + L +E ++L++ IE L SE L L
Sbjct: 33 KAQLQAEVRRAKAEAAELEQEYQKLRQAIEAWISLFSEPLYFDEDLI 79
>gnl|CDD|203455 pfam06456, Arfaptin, Arfaptin-like domain. Arfaptin interacts with
ARF1, a small GTPase involved in vesicle budding at the
Golgi complex and immature secretory granules. The
structure of arfaptin shows that upon binding to a small
GTPase, arfaptin forms an elongated, crescent-shaped
dimer of three-helix coiled-coils. The N-terminal region
of ICA69 is similar to arfaptin.
Length = 229
Score = 27.7 bits (62), Expect = 6.1
Identities = 11/40 (27%), Positives = 16/40 (40%)
Query: 197 RETEEKVKLLVKENERLQKRIELLSEELNVLRSLFTNVGV 236
R + LVK KR+ LS+ L F ++G
Sbjct: 43 RSIQRTYLGLVKLARNYSKRLYDLSQTQKELGDFFKDLGK 82
>gnl|CDD|217089 pfam02535, Zip, ZIP Zinc transporter. The ZIP family consists of
zinc transport proteins and many putative metal
transporters. The main contribution to this family is
from the Arabidopsis thaliana ZIP protein family these
proteins are responsible for zinc uptake in the plant.
Also found within this family are C. elegans proteins of
unknown function which are annotated as being similar to
human growth arrest inducible gene product, although
this protein in not found within this family.
Length = 314
Score = 27.7 bits (62), Expect = 6.3
Identities = 6/33 (18%), Positives = 12/33 (36%)
Query: 145 HSSKSSNQHHSHHHHHRKNGKSVDKNTDEYKRR 177
+ H HHH + G+S ++ +
Sbjct: 135 VHHEHGEGPHHHHHEGHEKGESDKESGLRVRDI 167
>gnl|CDD|110514 pfam01517, HDV_ag, Hepatitis delta virus delta antigen. The
hepatitis delta virus (HDV) encodes a single protein,
the hepatitis delta antigen (HDAg). The central region
of this protein has been shown to bind RNA. Several
interactions are also mediated by a coiled-coil region
at the N terminus of the protein.
Length = 194
Score = 27.5 bits (61), Expect = 6.4
Identities = 15/50 (30%), Positives = 29/50 (58%), Gaps = 1/50 (2%)
Query: 168 DKNTDEYKRRRERNNIAVRKSREKAKIRSRETEEKVKLLVKENERLQKRI 217
D+ +++RR+ N + S K SRE EE+++ L +E+E ++R+
Sbjct: 95 DQERRDHRRRKALENKKKQLS-SGGKHLSREEEEELRRLTEEDEERERRV 143
>gnl|CDD|130902 TIGR01843, type_I_hlyD, type I secretion membrane fusion protein,
HlyD family. Type I secretion is an ABC transport
process that exports proteins, without cleavage of any
signal sequence, from the cytosol to extracellular
medium across both inner and outer membranes. The
secretion signal is found in the C-terminus of the
transported protein. This model represents the adaptor
protein between the ATP-binding cassette (ABC) protein
of the inner membrane and the outer membrane protein,
and is called the membrane fusion protein. This model
selects a subfamily closely related to HlyD; it is
defined narrowly and excludes, for example, colicin V
secretion protein CvaA and multidrug efflux proteins
[Protein fate, Protein and peptide secretion and
trafficking].
Length = 423
Score = 27.7 bits (62), Expect = 6.6
Identities = 13/82 (15%), Positives = 33/82 (40%), Gaps = 7/82 (8%)
Query: 165 KSVDKNTDEYKRRR-ERNNIAVRKSREKAKIRSRETEEKVKLLVKENERLQKRIELLSEE 223
+ + ++ R+ + ++ + + +L + L++++E++SEE
Sbjct: 126 ELIKGQQSLFESRKSTLRAQLELILAQIKQLEAELAGLQAQL-----QALRQQLEVISEE 180
Query: 224 LNVLRSLFTNVGVLPEHLHREL 245
L R L G++ EL
Sbjct: 181 LEARRKLK-EKGLVSRLELLEL 201
>gnl|CDD|206307 pfam14138, COX16, Cytochrome c oxidase assembly protein COX16.
This family represents homologues of COX16 which has
been shown to be involved in assembly of cytochrome
oxidase. Protein in this family are typically between
106 and 134 amino acids in length.
Length = 79
Score = 25.6 bits (57), Expect = 7.7
Identities = 17/57 (29%), Positives = 22/57 (38%), Gaps = 17/57 (29%)
Query: 184 AVRKSREKAKIRSRETEEKVKLLVK--------ENERLQ---------KRIELLSEE 223
+R R K++ E EE +KLL K E RLQ R+ EE
Sbjct: 23 QLRYERRDRKVQQLEEEEALKLLKKRRKVDLEEEYYRLQGLDIDDWENVRVPRPKEE 79
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 27.7 bits (62), Expect = 8.1
Identities = 13/52 (25%), Positives = 26/52 (50%), Gaps = 8/52 (15%)
Query: 165 KSVDKNTDEYKRRRERNNIAVRKSREKAKIRSRETEEKVKLLVKENERLQKR 216
K DK E +R +E +++ ++ K + +++ K KE E+L+K
Sbjct: 546 KLDDKE--ELQREKEE-----KEALKEQKRLRKLKKQEEKK-KKELEKLEKA 589
>gnl|CDD|131391 TIGR02338, gimC_beta, prefoldin, beta subunit, archaeal.
Chaperonins are cytosolic, ATP-dependent molecular
chaperones, with a conserved toroidal architecture, that
assist in the folding of nascent and/or denatured
polypeptide chains. The group I chaperonin system
consists of GroEL and GroES, and is found (usually) in
bacteria and organelles of bacterial origin. The group
II chaperonin system, called the thermosome in Archaea
and TRiC or CCT in the Eukaryota, is structurally
similar but only distantly related. Prefoldin, also
called GimC, is a complex in Archaea and Eukaryota, that
works with group II chaperonins. Members of this protein
family are the archaeal clade of the beta class of
prefoldin subunit. Closely related, but outside the
scope of this family are the eukaryotic beta-class
prefoldin subunits, Gim-1,3,4 and 6. The alpha class
prefoldin subunits are more distantly related.
Length = 110
Score = 26.2 bits (58), Expect = 8.7
Identities = 15/65 (23%), Positives = 37/65 (56%), Gaps = 4/65 (6%)
Query: 165 KSVDKNTDEYKRRRERNNIAVRKSREKAKIRSRETEE----KVKLLVKENERLQKRIELL 220
+ +++ D+ + N+ V+ +E+A +E +E +VK L ++ ERL+++++ L
Sbjct: 41 EELERLPDDTPVYKSVGNLLVKTDKEEAIQELKEKKETLELRVKTLQRQEERLREQLKEL 100
Query: 221 SEELN 225
E++
Sbjct: 101 QEKIQ 105
>gnl|CDD|222426 pfam13870, DUF4201, Domain of unknown function (DUF4201). This is
a family of coiled-coil proteins from eukaryotes. The
function is not known.
Length = 177
Score = 26.8 bits (60), Expect = 9.3
Identities = 17/72 (23%), Positives = 34/72 (47%), Gaps = 6/72 (8%)
Query: 168 DKNTDEYKRRRERNNIAVRKSREKAKIRSRET-EEKVKL-----LVKENERLQKRIELLS 221
K + + R + + ++ + K+ +E E + L L EN+ L ++IE +
Sbjct: 3 AKRNEISELRLKLITLKHTLAKIQEKLEQKEELGEGLHLIDFEQLQIENQALNEKIEERN 62
Query: 222 EELNVLRSLFTN 233
+EL L+ +TN
Sbjct: 63 KELKRLKLKYTN 74
>gnl|CDD|219951 pfam08644, SPT16, FACT complex subunit (SPT16/CDC68). Proteins in
this family are subunits the FACT complex. The FACT
complex plays a role in transcription initiation and
promotes binding of TATA-binding protein (TBP) to a TATA
box in chromatin.
Length = 152
Score = 26.4 bits (59), Expect = 9.3
Identities = 10/24 (41%), Positives = 15/24 (62%)
Query: 191 KAKIRSRETEEKVKLLVKENERLQ 214
+ +++ RETE K K V E E+L
Sbjct: 91 QKRVKKRETERKEKADVVEQEKLI 114
>gnl|CDD|205692 pfam13514, AAA_27, AAA domain. This domain is found in a number of
double-strand DNA break proteins. This domain contains a
P-loop motif.
Length = 1118
Score = 27.4 bits (61), Expect = 9.4
Identities = 20/74 (27%), Positives = 32/74 (43%), Gaps = 13/74 (17%)
Query: 165 KSVDKNTDE-YKRRRERNNIAVRKSREK---AKIRSRETEEKVKL-----LVKENERLQK 215
++K DE +K R + I V K A+IR E ++K LVK + ++
Sbjct: 133 DELEKEADELWKPRGRKPEINVALKELKELEAEIR----EVQLKTRTWKDLVKALDEAEE 188
Query: 216 RIELLSEELNVLRS 229
+ L +EL L
Sbjct: 189 ELANLRKELRQLEK 202
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.310 0.126 0.354
Gapped
Lambda K H
0.267 0.0749 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 12,629,978
Number of extensions: 1161581
Number of successful extensions: 2095
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2002
Number of HSP's successfully gapped: 220
Length of query: 256
Length of database: 10,937,602
Length adjustment: 95
Effective length of query: 161
Effective length of database: 6,723,972
Effective search space: 1082559492
Effective search space used: 1082559492
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 58 (26.1 bits)