RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy2248
         (102 letters)



>gnl|CDD|217417 pfam03194, LUC7, LUC7 N_terminus.  This family contains the N
           terminal region of several LUC7 protein homologues and
           only contains eukaryotic proteins. LUC7 has been shown
           to be a U1 snRNA associated protein with a role in
           splice site recognition. The family also contains human
           and mouse LUC7 like (LUC7L) proteins and human cisplatin
           resistance-associated overexpressed protein (CROP).
          Length = 252

 Score = 70.1 bits (172), Expect = 5e-16
 Identities = 31/92 (33%), Positives = 53/92 (57%), Gaps = 2/92 (2%)

Query: 10  QQNENAEQITLLSEKINNLVSEAERVGMEGSVEKAQGLMKLCDQLKEEREMLRKQNDNNH 69
             +  AE++  L E+I  L++EAE +G EG V++A  LMK  ++LK +++ L   ++   
Sbjct: 117 AADSKAEELAELDEEIGKLLAEAEALGEEGKVDEAMKLMKEVEELKAKKKELEDSDEVR- 175

Query: 70  WSQTVELAAAQEKQMEVCTVCGAFLIVGDAQS 101
            +     A AQ++++ VC VCGA+L   D   
Sbjct: 176 -NAAPSSAQAQQQKLRVCEVCGAYLSRLDNDR 206


>gnl|CDD|227527 COG5200, LUC7, U1 snRNP component, mediates U1 snRNP association
           with cap-binding complex [RNA processing and
           modification].
          Length = 258

 Score = 37.3 bits (86), Expect = 4e-04
 Identities = 21/86 (24%), Positives = 39/86 (45%), Gaps = 4/86 (4%)

Query: 13  ENAEQITLLSEKINNLVSEAERVGMEGSVEKAQGLMKLCDQLKEEREMLRKQNDNNHWSQ 72
           E  E    + E I  L  E + +     VE+A       ++L+EER+ +++      +S 
Sbjct: 118 EKEELFNRVDESIGELGMEGDALAKRKLVERACSAFNELERLREERKEIKEA----VYSM 173

Query: 73  TVELAAAQEKQMEVCTVCGAFLIVGD 98
                    ++++VC +CGA+L   D
Sbjct: 174 VENNGQGTHQKLQVCGICGAYLSRLD 199


>gnl|CDD|225800 COG3261, HycE, Ni,Fe-hydrogenase III large subunit [Energy
           production and conversion].
          Length = 382

 Score = 28.8 bits (65), Expect = 0.40
 Identities = 11/49 (22%), Positives = 22/49 (44%), Gaps = 5/49 (10%)

Query: 14  NAEQITLLSEKINNLVSEAERVGMEGSVEKAQGLMKLCDQLKEEREMLR 62
              +   L+E++  + S A  V    +VE A G+     ++ E  + +R
Sbjct: 57  PYNKALFLAERVCGICSFAHAVAFVLAVEDALGI-----EVPERAQYIR 100


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 28.5 bits (64), Expect = 0.52
 Identities = 21/100 (21%), Positives = 38/100 (38%), Gaps = 19/100 (19%)

Query: 6   LSQSQQNENAEQITLLSEKINNLVSE---AERVGMEGSV--EKAQGLMKLCDQLKEER-- 58
           L    QN  +E    L E +     E   A +  M+     +K + L++LCD+L++E   
Sbjct: 470 LGSQGQNSTSENFKPLLEALLRFRDEVRDAAKAEMKLISLDKKKKQLLQLCDKLRDEWLP 529

Query: 59  ------------EMLRKQNDNNHWSQTVELAAAQEKQMEV 86
                         + K +D     +  E   A ++Q  +
Sbjct: 530 NLGIRIEDKPDGPSVWKLDDKEELQREKEEKEALKEQKRL 569


>gnl|CDD|218368 pfam04992, RNA_pol_Rpb1_6, RNA polymerase Rpb1, domain 6.  RNA
          polymerases catalyze the DNA dependent polymerisation
          of RNA. Prokaryotes contain a single RNA polymerase
          compared to three in eukaryotes (not including
          mitochondrial. and chloroplast polymerases). This
          domain, domain 6, represents a mobile module of the RNA
          polymerase. Domain 6 forms part of the shelf module.
          This family appears to be specific to the largest
          subunit of RNA polymerase II.
          Length = 187

 Score = 26.7 bits (60), Expect = 1.7
 Identities = 11/47 (23%), Positives = 24/47 (51%), Gaps = 3/47 (6%)

Query: 27 NLVSEAERVGMEGSVEKAQGLMKLCDQLKEEREMLRKQ---NDNNHW 70
           ++ E     + G  E  + L +  +QL E+R +LR++   + ++ W
Sbjct: 39 GVLDEDVVKELLGDAEVQKLLDEEYEQLLEDRRLLREEIFPDGDSKW 85


>gnl|CDD|204807 pfam12010, DUF3502, Domain of unknown function (DUF3502).  This
           presumed domain is functionally uncharacterized. This
           domain is found in bacteria. This domain is about 140
           amino acids in length. This domain is found associated
           with pfam01547.
          Length = 129

 Score = 26.5 bits (59), Expect = 2.0
 Identities = 16/50 (32%), Positives = 26/50 (52%), Gaps = 5/50 (10%)

Query: 23  EKINNLVSEAERVGMEGSVEKAQGLMKLCDQLKEE-----REMLRKQNDN 67
             I+N++SE E     GSV+  + L KL  +LK       ++ ++KQ D 
Sbjct: 74  AAISNVMSEFEPALYTGSVDPEEYLPKLNQKLKAAGIDKVQKEMQKQLDE 123


>gnl|CDD|237077 PRK12364, PRK12364, ribonucleotide-diphosphate reductase subunit
          alpha; Provisional.
          Length = 842

 Score = 26.3 bits (58), Expect = 3.1
 Identities = 10/36 (27%), Positives = 16/36 (44%)

Query: 16 EQITLLSEKINNLVSEAERVGMEGSVEKAQGLMKLC 51
          E I  L  ++   + E    G   SVE+ Q L++  
Sbjct: 36 EIIYSLVAEVERFIKEKYPNGHNVSVEEIQDLVEKT 71


>gnl|CDD|217229 pfam02782, FGGY_C, FGGY family of carbohydrate kinases,
          C-terminal domain.  This domain adopts a ribonuclease
          H-like fold and is structurally related to the
          N-terminal domain.
          Length = 193

 Score = 26.1 bits (58), Expect = 3.2
 Identities = 16/67 (23%), Positives = 21/67 (31%), Gaps = 8/67 (11%)

Query: 32 AERVGMEGSVEKAQGLMKLCDQLKEEREMLRKQNDNNHWSQTVELAAAQEKQMEVCTVCG 91
              G+EG    A  L+    Q    RE LR        +  + LAA            G
Sbjct: 32 EGGWGLEGGQSAAGSLLGWLLQTLRAREELRDAGLVESLALLLALAAD--------APAG 83

Query: 92 AFLIVGD 98
            L + D
Sbjct: 84 GLLFLPD 90


>gnl|CDD|225821 COG3283, TyrR, Transcriptional regulator of aromatic amino acids
           metabolism [Transcription / Amino acid transport and
           metabolism].
          Length = 511

 Score = 26.2 bits (58), Expect = 3.9
 Identities = 9/35 (25%), Positives = 20/35 (57%), Gaps = 2/35 (5%)

Query: 5   RLSQSQQNENA--EQITLLSEKINNLVSEAERVGM 37
           +L      + +  EQI  +S K+ ++V +A+++ M
Sbjct: 191 QLQNVAAQDVSGFEQIVAVSPKMKHVVEQAQKLAM 225


>gnl|CDD|223262 COG0184, RpsO, Ribosomal protein S15P/S13E [Translation,
          ribosomal structure and biogenesis].
          Length = 89

 Score = 25.2 bits (56), Expect = 4.1
 Identities = 8/13 (61%), Positives = 11/13 (84%)

Query: 16 EQITLLSEKINNL 28
           Q+ LL+E+INNL
Sbjct: 27 VQLALLTERINNL 39


>gnl|CDD|217864 pfam04053, Coatomer_WDAD, Coatomer WD associated region.  This
           region is composed of WD40 repeats.
          Length = 429

 Score = 25.6 bits (57), Expect = 4.7
 Identities = 6/30 (20%), Positives = 14/30 (46%)

Query: 51  CDQLKEEREMLRKQNDNNHWSQTVELAAAQ 80
              L    E+ ++ +D + W +  + A +Q
Sbjct: 317 LGNLDVALEIAKELDDEHKWKRLGDAALSQ 346


>gnl|CDD|219308 pfam07146, DUF1389, Protein of unknown function (DUF1389).  This
           family consists of several hypothetical bacterial
           proteins which seem to be specific to Chlamydia
           pneumoniae. Members of this family are typically around
           400 residues in length. The function of this family is
           unknown.
          Length = 311

 Score = 25.4 bits (56), Expect = 5.6
 Identities = 10/33 (30%), Positives = 17/33 (51%)

Query: 55  KEEREMLRKQNDNNHWSQTVELAAAQEKQMEVC 87
           +EE +ML     NN W Q  +L  A + ++ + 
Sbjct: 180 EEEYQMLLNHARNNTWDQNQDLVEALKDRLLIY 212


>gnl|CDD|180170 PRK05626, rpsO, 30S ribosomal protein S15; Reviewed.
          Length = 89

 Score = 24.7 bits (55), Expect = 6.8
 Identities = 7/12 (58%), Positives = 11/12 (91%)

Query: 17 QITLLSEKINNL 28
          Q+ LL+E+IN+L
Sbjct: 28 QVALLTERINHL 39


>gnl|CDD|236403 PRK09192, PRK09192, acyl-CoA synthetase; Validated.
          Length = 579

 Score = 25.4 bits (56), Expect = 6.8
 Identities = 11/40 (27%), Positives = 15/40 (37%), Gaps = 7/40 (17%)

Query: 65  NDNNHWSQTVELAAAQE---KQMEVCTVCGAFLIVGDAQS 101
           N  N W Q +E  A QE   +  +      AF I  +   
Sbjct: 467 NGRNIWPQDIEWIAEQEPELRSGDAA----AFSIAQENGE 502


>gnl|CDD|224340 COG1422, COG1422, Predicted membrane protein [Function unknown].
          Length = 201

 Score = 25.0 bits (55), Expect = 7.2
 Identities = 13/44 (29%), Positives = 21/44 (47%), Gaps = 6/44 (13%)

Query: 42  EKAQGLMKLCDQLKEEREMLRKQNDNNHWSQTVELAAAQEKQME 85
           EK + L K+  + ++E    ++  D        +L   QEKQME
Sbjct: 72  EKMKELQKMMKEFQKEFREAQESGD------MKKLKKLQEKQME 109


>gnl|CDD|139605 PRK13484, PRK13484, putative iron-regulated outer membrane
           virulence protein; Provisional.
          Length = 682

 Score = 25.4 bits (55), Expect = 7.3
 Identities = 10/42 (23%), Positives = 14/42 (33%), Gaps = 9/42 (21%)

Query: 54  LKEEREMLRKQNDNNHWSQTV---------ELAAAQEKQMEV 86
           LK+        N  +HW             E+A  QEK +  
Sbjct: 271 LKQPNSKRETHNSRSHWVAAWNAQGEILHPEIAVYQEKVIRE 312


>gnl|CDD|238213 cd00353, Ribosomal_S15p_S13e, Ribosomal protein S15
          (prokaryotic)_S13 (eukaryotic) binds the central domain
          of 16S rRNA and is required for assembly of the small
          ribosomal subunit and for intersubunit association,
          thus representing a key element in the assembly of the
          whole ribosome. S15 also plays an important
          autoregulatory role by binding and preventing its own
          mRNA from being translated. S15 has a predominantly
          alpha-helical fold that is highly structured except for
          the N-terminal alpha helix.
          Length = 80

 Score = 24.4 bits (54), Expect = 7.9
 Identities = 7/13 (53%), Positives = 10/13 (76%)

Query: 16 EQITLLSEKINNL 28
           Q+ LL+E+I NL
Sbjct: 21 VQLALLTERIVNL 33


>gnl|CDD|107204 cd01561, CBS_like, CBS_like: This subgroup includes Cystathionine
           beta-synthase (CBS) and Cysteine synthase. CBS is a
           unique heme-containing enzyme that catalyzes a pyridoxal
           5'-phosphate (PLP)-dependent condensation of serine and
           homocysteine to give cystathionine. Deficiency of CBS
           leads to homocystinuria, an inherited disease of sulfur
           metabolism characterized by increased levels of the
           toxic metabolite homocysteine. Cysteine synthase on the
           other hand catalyzes the last step of cysteine
           biosynthesis.  This subgroup also includes an
           O-Phosphoserine sulfhydrylase found in hyperthermophilic
           archaea which produces L-cysteine from sulfide and the
           more thermostable O-phospho-L-serine.
          Length = 291

 Score = 25.2 bits (56), Expect = 7.9
 Identities = 10/22 (45%), Positives = 14/22 (63%)

Query: 28  LVSEAERVGMEGSVEKAQGLMK 49
           L  EAE  GM+G++ KA+ L  
Sbjct: 104 LTPEAEADGMKGAIAKARELAA 125


>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein.  Members of this
           protein family have a signal peptide, a strongly
           conserved SH3 domain, a variable region, and then a
           C-terminal hydrophobic transmembrane alpha helix region.
          Length = 198

 Score = 25.0 bits (55), Expect = 8.0
 Identities = 13/59 (22%), Positives = 29/59 (49%), Gaps = 2/59 (3%)

Query: 8   QSQQNENAEQITLLSEKINNLVSEAERVGMEGSVEKAQGLMKLCDQLKEEREMLRKQND 66
           Q +  E  ++++ L  ++  L  E  R  ++     A  L +   +L+EE   L+++N+
Sbjct: 93  QQENQELKQELSTLEAELERLQKELAR--IKQLSANAIELDEENRELREELAELKQENE 149


>gnl|CDD|182249 PRK10117, PRK10117, trehalose-6-phosphate synthase; Provisional.
          Length = 474

 Score = 25.1 bits (55), Expect = 8.9
 Identities = 10/28 (35%), Positives = 15/28 (53%), Gaps = 2/28 (7%)

Query: 48  MKLCDQLKEEREMLR--KQNDNNHWSQT 73
           M L +++    EML    +ND NHW + 
Sbjct: 419 MPLAERISRHAEMLDVIVKNDINHWQEC 446


>gnl|CDD|213942 TIGR04329, cas1_PREFRAN, CRISPR-associated endonuclease Cas1,
           subtype PREFRAN.  Members of this family are the Cas1
           endonuclease of a novel CRISPR subtype, PREFRAN, found
           in Prevotella bryantii B14, Prevotella disiens
           FB035-09AN, Francisella tularensis subsp. novicida,
           Francisella philomiragia, Butyrivibrio proteoclasticus
           B316, Helcococcus kunzii ATCC 51366, etc.
          Length = 317

 Score = 25.1 bits (55), Expect = 9.1
 Identities = 9/40 (22%), Positives = 20/40 (50%)

Query: 4   SRLSQSQQNENAEQITLLSEKINNLVSEAERVGMEGSVEK 43
           +R+      E   ++T   + I+ +  +   +G+EG+V K
Sbjct: 126 TRMKDELLTEAKFKLTRYIDAIDAIEDDKRLMGIEGNVSK 165


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.309    0.124    0.332 

Gapped
Lambda     K      H
   0.267   0.0722    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 4,673,148
Number of extensions: 365361
Number of successful extensions: 463
Number of sequences better than 10.0: 1
Number of HSP's gapped: 460
Number of HSP's successfully gapped: 63
Length of query: 102
Length of database: 10,937,602
Length adjustment: 68
Effective length of query: 34
Effective length of database: 7,921,530
Effective search space: 269332020
Effective search space used: 269332020
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 53 (24.2 bits)