RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy2248
(102 letters)
>gnl|CDD|217417 pfam03194, LUC7, LUC7 N_terminus. This family contains the N
terminal region of several LUC7 protein homologues and
only contains eukaryotic proteins. LUC7 has been shown
to be a U1 snRNA associated protein with a role in
splice site recognition. The family also contains human
and mouse LUC7 like (LUC7L) proteins and human cisplatin
resistance-associated overexpressed protein (CROP).
Length = 252
Score = 70.1 bits (172), Expect = 5e-16
Identities = 31/92 (33%), Positives = 53/92 (57%), Gaps = 2/92 (2%)
Query: 10 QQNENAEQITLLSEKINNLVSEAERVGMEGSVEKAQGLMKLCDQLKEEREMLRKQNDNNH 69
+ AE++ L E+I L++EAE +G EG V++A LMK ++LK +++ L ++
Sbjct: 117 AADSKAEELAELDEEIGKLLAEAEALGEEGKVDEAMKLMKEVEELKAKKKELEDSDEVR- 175
Query: 70 WSQTVELAAAQEKQMEVCTVCGAFLIVGDAQS 101
+ A AQ++++ VC VCGA+L D
Sbjct: 176 -NAAPSSAQAQQQKLRVCEVCGAYLSRLDNDR 206
>gnl|CDD|227527 COG5200, LUC7, U1 snRNP component, mediates U1 snRNP association
with cap-binding complex [RNA processing and
modification].
Length = 258
Score = 37.3 bits (86), Expect = 4e-04
Identities = 21/86 (24%), Positives = 39/86 (45%), Gaps = 4/86 (4%)
Query: 13 ENAEQITLLSEKINNLVSEAERVGMEGSVEKAQGLMKLCDQLKEEREMLRKQNDNNHWSQ 72
E E + E I L E + + VE+A ++L+EER+ +++ +S
Sbjct: 118 EKEELFNRVDESIGELGMEGDALAKRKLVERACSAFNELERLREERKEIKEA----VYSM 173
Query: 73 TVELAAAQEKQMEVCTVCGAFLIVGD 98
++++VC +CGA+L D
Sbjct: 174 VENNGQGTHQKLQVCGICGAYLSRLD 199
>gnl|CDD|225800 COG3261, HycE, Ni,Fe-hydrogenase III large subunit [Energy
production and conversion].
Length = 382
Score = 28.8 bits (65), Expect = 0.40
Identities = 11/49 (22%), Positives = 22/49 (44%), Gaps = 5/49 (10%)
Query: 14 NAEQITLLSEKINNLVSEAERVGMEGSVEKAQGLMKLCDQLKEEREMLR 62
+ L+E++ + S A V +VE A G+ ++ E + +R
Sbjct: 57 PYNKALFLAERVCGICSFAHAVAFVLAVEDALGI-----EVPERAQYIR 100
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 28.5 bits (64), Expect = 0.52
Identities = 21/100 (21%), Positives = 38/100 (38%), Gaps = 19/100 (19%)
Query: 6 LSQSQQNENAEQITLLSEKINNLVSE---AERVGMEGSV--EKAQGLMKLCDQLKEER-- 58
L QN +E L E + E A + M+ +K + L++LCD+L++E
Sbjct: 470 LGSQGQNSTSENFKPLLEALLRFRDEVRDAAKAEMKLISLDKKKKQLLQLCDKLRDEWLP 529
Query: 59 ------------EMLRKQNDNNHWSQTVELAAAQEKQMEV 86
+ K +D + E A ++Q +
Sbjct: 530 NLGIRIEDKPDGPSVWKLDDKEELQREKEEKEALKEQKRL 569
>gnl|CDD|218368 pfam04992, RNA_pol_Rpb1_6, RNA polymerase Rpb1, domain 6. RNA
polymerases catalyze the DNA dependent polymerisation
of RNA. Prokaryotes contain a single RNA polymerase
compared to three in eukaryotes (not including
mitochondrial. and chloroplast polymerases). This
domain, domain 6, represents a mobile module of the RNA
polymerase. Domain 6 forms part of the shelf module.
This family appears to be specific to the largest
subunit of RNA polymerase II.
Length = 187
Score = 26.7 bits (60), Expect = 1.7
Identities = 11/47 (23%), Positives = 24/47 (51%), Gaps = 3/47 (6%)
Query: 27 NLVSEAERVGMEGSVEKAQGLMKLCDQLKEEREMLRKQ---NDNNHW 70
++ E + G E + L + +QL E+R +LR++ + ++ W
Sbjct: 39 GVLDEDVVKELLGDAEVQKLLDEEYEQLLEDRRLLREEIFPDGDSKW 85
>gnl|CDD|204807 pfam12010, DUF3502, Domain of unknown function (DUF3502). This
presumed domain is functionally uncharacterized. This
domain is found in bacteria. This domain is about 140
amino acids in length. This domain is found associated
with pfam01547.
Length = 129
Score = 26.5 bits (59), Expect = 2.0
Identities = 16/50 (32%), Positives = 26/50 (52%), Gaps = 5/50 (10%)
Query: 23 EKINNLVSEAERVGMEGSVEKAQGLMKLCDQLKEE-----REMLRKQNDN 67
I+N++SE E GSV+ + L KL +LK ++ ++KQ D
Sbjct: 74 AAISNVMSEFEPALYTGSVDPEEYLPKLNQKLKAAGIDKVQKEMQKQLDE 123
>gnl|CDD|237077 PRK12364, PRK12364, ribonucleotide-diphosphate reductase subunit
alpha; Provisional.
Length = 842
Score = 26.3 bits (58), Expect = 3.1
Identities = 10/36 (27%), Positives = 16/36 (44%)
Query: 16 EQITLLSEKINNLVSEAERVGMEGSVEKAQGLMKLC 51
E I L ++ + E G SVE+ Q L++
Sbjct: 36 EIIYSLVAEVERFIKEKYPNGHNVSVEEIQDLVEKT 71
>gnl|CDD|217229 pfam02782, FGGY_C, FGGY family of carbohydrate kinases,
C-terminal domain. This domain adopts a ribonuclease
H-like fold and is structurally related to the
N-terminal domain.
Length = 193
Score = 26.1 bits (58), Expect = 3.2
Identities = 16/67 (23%), Positives = 21/67 (31%), Gaps = 8/67 (11%)
Query: 32 AERVGMEGSVEKAQGLMKLCDQLKEEREMLRKQNDNNHWSQTVELAAAQEKQMEVCTVCG 91
G+EG A L+ Q RE LR + + LAA G
Sbjct: 32 EGGWGLEGGQSAAGSLLGWLLQTLRAREELRDAGLVESLALLLALAAD--------APAG 83
Query: 92 AFLIVGD 98
L + D
Sbjct: 84 GLLFLPD 90
>gnl|CDD|225821 COG3283, TyrR, Transcriptional regulator of aromatic amino acids
metabolism [Transcription / Amino acid transport and
metabolism].
Length = 511
Score = 26.2 bits (58), Expect = 3.9
Identities = 9/35 (25%), Positives = 20/35 (57%), Gaps = 2/35 (5%)
Query: 5 RLSQSQQNENA--EQITLLSEKINNLVSEAERVGM 37
+L + + EQI +S K+ ++V +A+++ M
Sbjct: 191 QLQNVAAQDVSGFEQIVAVSPKMKHVVEQAQKLAM 225
>gnl|CDD|223262 COG0184, RpsO, Ribosomal protein S15P/S13E [Translation,
ribosomal structure and biogenesis].
Length = 89
Score = 25.2 bits (56), Expect = 4.1
Identities = 8/13 (61%), Positives = 11/13 (84%)
Query: 16 EQITLLSEKINNL 28
Q+ LL+E+INNL
Sbjct: 27 VQLALLTERINNL 39
>gnl|CDD|217864 pfam04053, Coatomer_WDAD, Coatomer WD associated region. This
region is composed of WD40 repeats.
Length = 429
Score = 25.6 bits (57), Expect = 4.7
Identities = 6/30 (20%), Positives = 14/30 (46%)
Query: 51 CDQLKEEREMLRKQNDNNHWSQTVELAAAQ 80
L E+ ++ +D + W + + A +Q
Sbjct: 317 LGNLDVALEIAKELDDEHKWKRLGDAALSQ 346
>gnl|CDD|219308 pfam07146, DUF1389, Protein of unknown function (DUF1389). This
family consists of several hypothetical bacterial
proteins which seem to be specific to Chlamydia
pneumoniae. Members of this family are typically around
400 residues in length. The function of this family is
unknown.
Length = 311
Score = 25.4 bits (56), Expect = 5.6
Identities = 10/33 (30%), Positives = 17/33 (51%)
Query: 55 KEEREMLRKQNDNNHWSQTVELAAAQEKQMEVC 87
+EE +ML NN W Q +L A + ++ +
Sbjct: 180 EEEYQMLLNHARNNTWDQNQDLVEALKDRLLIY 212
>gnl|CDD|180170 PRK05626, rpsO, 30S ribosomal protein S15; Reviewed.
Length = 89
Score = 24.7 bits (55), Expect = 6.8
Identities = 7/12 (58%), Positives = 11/12 (91%)
Query: 17 QITLLSEKINNL 28
Q+ LL+E+IN+L
Sbjct: 28 QVALLTERINHL 39
>gnl|CDD|236403 PRK09192, PRK09192, acyl-CoA synthetase; Validated.
Length = 579
Score = 25.4 bits (56), Expect = 6.8
Identities = 11/40 (27%), Positives = 15/40 (37%), Gaps = 7/40 (17%)
Query: 65 NDNNHWSQTVELAAAQE---KQMEVCTVCGAFLIVGDAQS 101
N N W Q +E A QE + + AF I +
Sbjct: 467 NGRNIWPQDIEWIAEQEPELRSGDAA----AFSIAQENGE 502
>gnl|CDD|224340 COG1422, COG1422, Predicted membrane protein [Function unknown].
Length = 201
Score = 25.0 bits (55), Expect = 7.2
Identities = 13/44 (29%), Positives = 21/44 (47%), Gaps = 6/44 (13%)
Query: 42 EKAQGLMKLCDQLKEEREMLRKQNDNNHWSQTVELAAAQEKQME 85
EK + L K+ + ++E ++ D +L QEKQME
Sbjct: 72 EKMKELQKMMKEFQKEFREAQESGD------MKKLKKLQEKQME 109
>gnl|CDD|139605 PRK13484, PRK13484, putative iron-regulated outer membrane
virulence protein; Provisional.
Length = 682
Score = 25.4 bits (55), Expect = 7.3
Identities = 10/42 (23%), Positives = 14/42 (33%), Gaps = 9/42 (21%)
Query: 54 LKEEREMLRKQNDNNHWSQTV---------ELAAAQEKQMEV 86
LK+ N +HW E+A QEK +
Sbjct: 271 LKQPNSKRETHNSRSHWVAAWNAQGEILHPEIAVYQEKVIRE 312
>gnl|CDD|238213 cd00353, Ribosomal_S15p_S13e, Ribosomal protein S15
(prokaryotic)_S13 (eukaryotic) binds the central domain
of 16S rRNA and is required for assembly of the small
ribosomal subunit and for intersubunit association,
thus representing a key element in the assembly of the
whole ribosome. S15 also plays an important
autoregulatory role by binding and preventing its own
mRNA from being translated. S15 has a predominantly
alpha-helical fold that is highly structured except for
the N-terminal alpha helix.
Length = 80
Score = 24.4 bits (54), Expect = 7.9
Identities = 7/13 (53%), Positives = 10/13 (76%)
Query: 16 EQITLLSEKINNL 28
Q+ LL+E+I NL
Sbjct: 21 VQLALLTERIVNL 33
>gnl|CDD|107204 cd01561, CBS_like, CBS_like: This subgroup includes Cystathionine
beta-synthase (CBS) and Cysteine synthase. CBS is a
unique heme-containing enzyme that catalyzes a pyridoxal
5'-phosphate (PLP)-dependent condensation of serine and
homocysteine to give cystathionine. Deficiency of CBS
leads to homocystinuria, an inherited disease of sulfur
metabolism characterized by increased levels of the
toxic metabolite homocysteine. Cysteine synthase on the
other hand catalyzes the last step of cysteine
biosynthesis. This subgroup also includes an
O-Phosphoserine sulfhydrylase found in hyperthermophilic
archaea which produces L-cysteine from sulfide and the
more thermostable O-phospho-L-serine.
Length = 291
Score = 25.2 bits (56), Expect = 7.9
Identities = 10/22 (45%), Positives = 14/22 (63%)
Query: 28 LVSEAERVGMEGSVEKAQGLMK 49
L EAE GM+G++ KA+ L
Sbjct: 104 LTPEAEADGMKGAIAKARELAA 125
>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein. Members of this
protein family have a signal peptide, a strongly
conserved SH3 domain, a variable region, and then a
C-terminal hydrophobic transmembrane alpha helix region.
Length = 198
Score = 25.0 bits (55), Expect = 8.0
Identities = 13/59 (22%), Positives = 29/59 (49%), Gaps = 2/59 (3%)
Query: 8 QSQQNENAEQITLLSEKINNLVSEAERVGMEGSVEKAQGLMKLCDQLKEEREMLRKQND 66
Q + E ++++ L ++ L E R ++ A L + +L+EE L+++N+
Sbjct: 93 QQENQELKQELSTLEAELERLQKELAR--IKQLSANAIELDEENRELREELAELKQENE 149
>gnl|CDD|182249 PRK10117, PRK10117, trehalose-6-phosphate synthase; Provisional.
Length = 474
Score = 25.1 bits (55), Expect = 8.9
Identities = 10/28 (35%), Positives = 15/28 (53%), Gaps = 2/28 (7%)
Query: 48 MKLCDQLKEEREMLR--KQNDNNHWSQT 73
M L +++ EML +ND NHW +
Sbjct: 419 MPLAERISRHAEMLDVIVKNDINHWQEC 446
>gnl|CDD|213942 TIGR04329, cas1_PREFRAN, CRISPR-associated endonuclease Cas1,
subtype PREFRAN. Members of this family are the Cas1
endonuclease of a novel CRISPR subtype, PREFRAN, found
in Prevotella bryantii B14, Prevotella disiens
FB035-09AN, Francisella tularensis subsp. novicida,
Francisella philomiragia, Butyrivibrio proteoclasticus
B316, Helcococcus kunzii ATCC 51366, etc.
Length = 317
Score = 25.1 bits (55), Expect = 9.1
Identities = 9/40 (22%), Positives = 20/40 (50%)
Query: 4 SRLSQSQQNENAEQITLLSEKINNLVSEAERVGMEGSVEK 43
+R+ E ++T + I+ + + +G+EG+V K
Sbjct: 126 TRMKDELLTEAKFKLTRYIDAIDAIEDDKRLMGIEGNVSK 165
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.309 0.124 0.332
Gapped
Lambda K H
0.267 0.0722 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 4,673,148
Number of extensions: 365361
Number of successful extensions: 463
Number of sequences better than 10.0: 1
Number of HSP's gapped: 460
Number of HSP's successfully gapped: 63
Length of query: 102
Length of database: 10,937,602
Length adjustment: 68
Effective length of query: 34
Effective length of database: 7,921,530
Effective search space: 269332020
Effective search space used: 269332020
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 53 (24.2 bits)