RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= 012818
         (456 letters)



>gnl|CDD|128777 smart00501, BRIGHT, BRIGHT, ARID (A/T-rich interaction domain)
           domain.  DNA-binding domain containing a
           helix-turn-helix structure.
          Length = 93

 Score =  108 bits (271), Expect = 1e-28
 Identities = 30/92 (32%), Positives = 46/92 (50%), Gaps = 1/92 (1%)

Query: 148 EEQAEFMKEIESFYRENALE-FKPPKFYGEPLNCLKLWRAVVRLGGYEVVTASKLWRQVG 206
            E+  F+  +  F  E      K P   G+PL+  +L+R V   GGY+ VT  K W+++ 
Sbjct: 1   RERVLFLDRLYKFMEERGSPLKKIPVIGGKPLDLYRLYRLVQERGGYDQVTKDKKWKEIA 60

Query: 207 ESFHPPKTCTTVSWTFRIFYEKALLEYEKHKR 238
                P T T+ + + R  YE+ LL YE+  R
Sbjct: 61  RELGIPDTSTSAASSLRKHYERYLLPYERFLR 92


>gnl|CDD|216473 pfam01388, ARID, ARID/BRIGHT DNA binding domain.  This domain is
           know as ARID for AT-Rich Interaction Domain, and also
           known as the BRIGHT domain.
          Length = 90

 Score =  106 bits (268), Expect = 2e-28
 Identities = 35/89 (39%), Positives = 46/89 (51%), Gaps = 1/89 (1%)

Query: 147 PEEQAEFMKEIESFYRENALEF-KPPKFYGEPLNCLKLWRAVVRLGGYEVVTASKLWRQV 205
             E+  F+  +  F         K P   G+PL+  KL+R V +LGG+E VT SK WR+V
Sbjct: 2   TPEKENFLDRLAKFMERRGTPLDKIPVIGGKPLDLYKLYRLVQKLGGFEKVTKSKKWREV 61

Query: 206 GESFHPPKTCTTVSWTFRIFYEKALLEYE 234
            +    P T T+ S   R  YEK LL YE
Sbjct: 62  AQKLGIPPTITSASQELRSIYEKYLLPYE 90


>gnl|CDD|198082 smart01014, ARID, ARID/BRIGHT DNA binding domain.  Members of the
           recently discovered ARID (AT-rich interaction domain)
           family of DNA-binding proteins are found in fungi and
           invertebrate and vertebrate metazoans. ARID-encoding
           genes are involved in a variety of biological processes
           including embryonic development, cell lineage gene
           regulation and cell cycle control. Although the specific
           roles of this domain and of ARID-containing proteins in
           transcriptional regulation are yet to be elucidated,
           they include both positive and negative transcriptional
           regulation and a likely involvement in the modification
           of chromatin structure. The basic structure of the ARID
           domain domain appears to be a series of six
           alpha-helices separated by beta-strands, loops, or
           turns, but the structured region may extend to an
           additional helix at either or both ends of the basic
           six. Based on primary sequence homology, they can be
           partitioned into three structural classes: Minimal ARID
           proteins that consist of a core domain formed by six
           alpha helices; ARID proteins that supplement the core
           domain with an N-terminal alpha-helix; and Extended-ARID
           proteins, which contain the core domain and additional
           alpha-helices at their N- and C-termini.
          Length = 88

 Score = 95.8 bits (239), Expect = 3e-24
 Identities = 30/88 (34%), Positives = 45/88 (51%), Gaps = 1/88 (1%)

Query: 148 EEQAEFMKEIESFYRENALEF-KPPKFYGEPLNCLKLWRAVVRLGGYEVVTASKLWRQVG 206
            E+  F+  +  F  +      K P   G+PL+  +L+RAV + GG++ VT  K W+QV 
Sbjct: 1   RERELFLDRLRKFMEKRGTPLDKIPVIGGKPLDLYRLYRAVQKRGGFDKVTKKKKWKQVA 60

Query: 207 ESFHPPKTCTTVSWTFRIFYEKALLEYE 234
                P + T+   + R  YEK LL YE
Sbjct: 61  RELGIPPSATSAGTSLRKHYEKYLLPYE 88


>gnl|CDD|107221 cd06464, ACD_sHsps-like, Alpha-crystallin domain (ACD) of
           alpha-crystallin-type small(s) heat shock proteins
           (Hsps). sHsps are small stress induced proteins with
           monomeric masses between 12 -43 kDa, whose common
           feature is the Alpha-crystallin domain  (ACD). sHsps are
           generally active as large oligomers consisting of
           multiple subunits, and are believed to be
           ATP-independent chaperones that prevent aggregation and
           are important in refolding in combination with other
           Hsps.
          Length = 88

 Score = 42.2 bits (100), Expect = 3e-05
 Identities = 22/76 (28%), Positives = 36/76 (47%), Gaps = 9/76 (11%)

Query: 357 NVREAKDCYEVYALVPGLLREEVRVQSDPAGRLVITGE-PEQVDNPWGIT-------PFK 408
           +V E  D Y V A +PG  +E+++V+ +  G L I+GE  E+ +              F 
Sbjct: 1   DVYETDDAYVVEADLPGFKKEDIKVEVED-GVLTISGEREEEEEEEENYLRRERSYGSFS 59

Query: 409 KVVILPSRIDPLQTSA 424
           +   LP  +DP +  A
Sbjct: 60  RSFRLPEDVDPDKIKA 75


>gnl|CDD|107219 cd00298, ACD_sHsps_p23-like, This domain family includes the
           alpha-crystallin domain (ACD) of alpha-crystallin-type
           small heat shock proteins (sHsps) and a similar domain
           found in p23-like proteins.  sHsps are small stress
           induced proteins with monomeric masses between 12 -43
           kDa, whose common feature is this ACD. sHsps are
           generally active as large oligomers consisting of
           multiple subunits, and are believed to be
           ATP-independent chaperones that prevent aggregation and
           are important in refolding in combination with other
           Hsps. p23 is a cochaperone of the Hsp90 chaperoning
           pathway. It binds Hsp90 and participates in the folding
           of a number of Hsp90 clients including the progesterone
           receptor. p23 also has a passive chaperoning activity.
           p23 in addition may act as the cytosolic prostaglandin
           E2 synthase. Included in this family is the p23-like
           C-terminal CHORD-SGT1 (CS) domain of suppressor of G2
           allele of Skp1 (Sgt1) and  the p23-like domains of human
           butyrate-induced transcript 1 (hB-ind1), NUD (nuclear
           distribution) C, Melusin, and NAD(P)H cytochrome b5
           (NCB5) oxidoreductase (OR).
          Length = 80

 Score = 41.8 bits (99), Expect = 3e-05
 Identities = 18/76 (23%), Positives = 38/76 (50%), Gaps = 3/76 (3%)

Query: 358 VREAKDCYEVYALVPGLLREEVRVQSDPAGRLVITGE-PEQVDNPWGITPFKKVVILPSR 416
             +  D   V   +PG+ +E+++V+ +    L I+G+  E+ +       F++   LP  
Sbjct: 1   WYQTDDEVVVTVDLPGVKKEDIKVEVED-NVLTISGKREEEEERERSYGEFERSFELPED 59

Query: 417 IDPLQTSAVVSLHGRL 432
           +DP ++ A +  +G L
Sbjct: 60  VDPEKSKASLE-NGVL 74


>gnl|CDD|215655 pfam00011, HSP20, Hsp20/alpha crystallin family. 
          Length = 101

 Score = 35.3 bits (82), Expect = 0.010
 Identities = 20/77 (25%), Positives = 38/77 (49%), Gaps = 8/77 (10%)

Query: 358 VREAKDCYEVYALVPGLLREEVRVQSDPAGRLVITGEPEQV--DNPWGIT-----PFKKV 410
           ++E KD + V   VPG   EE++V+ +   R+++ G+ E+   D+    +      F + 
Sbjct: 2   IKEDKDAFVVKLDVPGFKPEELKVKVE-DNRVLVKGKHEKEEEDDHGLRSERSYRSFSRK 60

Query: 411 VILPSRIDPLQTSAVVS 427
            +LP   DP +  A + 
Sbjct: 61  FVLPENADPDKVKASLK 77


>gnl|CDD|223149 COG0071, IbpA, Molecular chaperone (small heat shock protein)
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 146

 Score = 34.3 bits (79), Expect = 0.046
 Identities = 19/84 (22%), Positives = 36/84 (42%), Gaps = 10/84 (11%)

Query: 350 PADWVKINVREAKDCYEVYALVPGLLREEVRVQSDPAGRLVITGEPEQVDNPWGIT---- 405
           P     +++ E  D Y + A +PG+ +E++ +  +    L I GE E+ +          
Sbjct: 37  PTGTPPVDIEETDDEYRITAELPGVDKEDIEITVEG-NTLTIRGEREEEEEEEEEGYLRR 95

Query: 406 -----PFKKVVILPSRIDPLQTSA 424
                 F++   LP ++DP    A
Sbjct: 96  ERAYGEFERTFRLPEKVDPEVIKA 119


>gnl|CDD|107229 cd06472, ACD_ScHsp26_like, Alpha crystallin domain (ACD) found in
           Saccharomyces cerevisiae (Sc) small heat shock protein
           (Hsp)26 and similar proteins. sHsps are molecular
           chaperones that suppress protein aggregation and protect
           against cell stress, and are generally active as large
           oligomers consisting of multiple subunits. ScHsp26 is
           temperature-regulated, it switches from an inactive to a
           chaperone-active form upon elevation in temperature. It
           associates into large 24-mers storage forms which upon
           heat shock disassociate into dimers. These dimers
           initiate the interaction with non-native substrate
           proteins and re-assemble into large globular assemblies
           having one monomer of substrate bound per dimer. This
           group also contains Arabidopsis thaliana (Ath) Hsp15.7,
           a peroxisomal matrix protein which can complement the
           morphological phenotype of S. cerevisiae mutants
           deficient in Hsps26. AthHsp15.7 is minimally expressed
           under normal conditions and is strongly induced by heat
           and oxidative stress. Also belonging to this group is
           wheat HSP16.9 which differs in quaternary structure from
           the shell-type particles of ScHsp26, it assembles as a
           dodecameric double disc, with each disc organized as a
           trimer of dimers.
          Length = 92

 Score = 31.5 bits (72), Expect = 0.20
 Identities = 14/43 (32%), Positives = 24/43 (55%), Gaps = 5/43 (11%)

Query: 352 DWVKINVREAKDCYEVYALVPGLLREEVRVQSDPAGRLVITGE 394
           DW     +E  + +   A VPG+ +E+V+V+ +    L I+GE
Sbjct: 3   DW-----KETPEAHVFKADVPGVKKEDVKVEVEDGRVLRISGE 40


>gnl|CDD|151062 pfam10502, Peptidase_S26, Signal peptidase, peptidase S26.  This is
           a family of membrane signal serine endopeptidases which
           function in the processing of newly-synthesised secreted
           proteins. Peptidase S26 removes the hydrophobic,
           N-terminal, signal peptides as proteins are translocated
           across membranes. The active site residues take the form
           of a catalytic dyad that is Ser, Lys in subfamily S26A;
           the Ser is the nucleophile in catalysis, and the Lys is
           the general base.
          Length = 138

 Score = 29.5 bits (67), Expect = 1.6
 Identities = 10/29 (34%), Positives = 14/29 (48%)

Query: 266 PGSSRARRDAAARAMQGWHAQRLLGHGEV 294
           P S+   RD   R +  W   R+L  GE+
Sbjct: 77  PVSAALERDRKGRPLPPWQGCRVLPEGEL 105


>gnl|CDD|222689 pfam14335, DUF4391, Domain of unknown function (DUF4391).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria and archaea.
           Proteins in this family are typically between 220 and
           257 amino acids in length.
          Length = 221

 Score = 30.3 bits (69), Expect = 1.8
 Identities = 12/43 (27%), Positives = 18/43 (41%), Gaps = 8/43 (18%)

Query: 304 LPSPARREKQLKNIGLPKNRTLDSAEKAALTEADKQIITEIVD 346
           LP      K++     PK       E   LT ADK++  E ++
Sbjct: 3   LPDSTEVNKRI-----PKKAFY---ENGELTAADKKLFVEDIE 37


>gnl|CDD|224120 COG1199, DinG, Rad3-related DNA helicases [Transcription / DNA
           replication, recombination, and repair].
          Length = 654

 Score = 30.2 bits (68), Expect = 2.5
 Identities = 16/90 (17%), Positives = 30/90 (33%), Gaps = 4/90 (4%)

Query: 340 IITEIVDVGPPADWVKINVREAKDCYEVYALVPGLLREEVRVQSDPA----GRLVITGEP 395
               + D    + W++I  RE      +  LVP  L EE+  +          L      
Sbjct: 360 RELLLSDDPDYSYWLEIEEREGVLLLVLPLLVPSKLLEELFSKVASVVLTSATLSPLDSF 419

Query: 396 EQVDNPWGITPFKKVVILPSRIDPLQTSAV 425
             +    G+    + + LPS  +  +   +
Sbjct: 420 SSLLGLLGLEEKLRFLSLPSPFNYEEQGQL 449


>gnl|CDD|201361 pfam00634, BRCA2, BRCA2 repeat.  The alignment covers only the most
           conserved region of the repeat.
          Length = 34

 Score = 26.7 bits (60), Expect = 2.5
 Identities = 7/15 (46%), Positives = 12/15 (80%)

Query: 124 DESVRKSKNWLNDIE 138
           +ES++K+K  L +IE
Sbjct: 20  EESLKKAKKLLREIE 34


>gnl|CDD|234698 PRK00236, xerC, site-specific tyrosine recombinase XerC; Reviewed.
          Length = 297

 Score = 29.7 bits (68), Expect = 2.8
 Identities = 18/73 (24%), Positives = 27/73 (36%), Gaps = 14/73 (19%)

Query: 267 GSSRARRDAAARAMQGWHAQRLLGHGEVAEPIIKDKSLPSPARREKQLKNIGLPKNRTLD 326
             S ARR +A R+   W    L+  G +          P   +R        LPK   +D
Sbjct: 72  ARSLARRLSALRSFYRW----LVRRGLLKANPAAGLRAPKIPKR--------LPKPLDVD 119

Query: 327 SAEKAALTEADKQ 339
            A+   L +A  +
Sbjct: 120 QAK--RLLDAIDE 130


>gnl|CDD|240433 PTZ00482, PTZ00482, membrane-attack complex/perforin (MACPF)
           Superfamily; Provisional.
          Length = 844

 Score = 29.8 bits (67), Expect = 3.4
 Identities = 23/118 (19%), Positives = 39/118 (33%), Gaps = 8/118 (6%)

Query: 64  DAEATAGVNTKSGGGGAGTDELPESTSKTNGDDTHVENEPKTNTSVTPIPHGESSTSKAE 123
           + +     N  SG      D L E   +    DT   N  +TN         + S++   
Sbjct: 99  EDDEDDAGNATSGESSTDDDSLLELPDRDEDADTQ-ANNDQTNDFDQ-----DDSSNSQT 152

Query: 124 DESVRKSKNWLNDIEMGEADEGTPEEQAEFMKEIESFYRENALEFKPPKFYGEPLNCL 181
           D+ +++S N  +  ++ E  +G  E   +F         E A      K        L
Sbjct: 153 DQGLKQSVNLSSAEKLIEEKKGQTENTFKFYNFGND--GEEAAAKDGGKSKSSDPGPL 208


>gnl|CDD|133475 cd06583, PGRP, Peptidoglycan recognition proteins (PGRPs) are
           pattern recognition receptors that bind, and in certain
           cases, hydrolyze peptidoglycans (PGNs) of bacterial cell
           walls. PGRPs have been divided into three classes: short
           PGRPs (PGRP-S), that are small (20 kDa) extracellular
           proteins; intermediate PGRPs (PGRP-I) that are 40-45 kDa
           and are predicted to be transmembrane proteins; and long
           PGRPs (PGRP-L), up to 90 kDa, which may be either
           intracellular or transmembrane. Several structures of
           PGRPs are known in insects and mammals, some bound with
           substrates like Muramyl Tripeptide (MTP) or Tracheal
           Cytotoxin (TCT). The substrate binding site is conserved
           in PGRP-LCx, PGRP-LE, and PGRP-Ialpha proteins. This
           family includes Zn-dependent N-Acetylmuramoyl-L-alanine
           Amidase, EC:3.5.1.28. This enzyme cleaves the amide bond
           between N-acetylmuramoyl and L-amino acids,
           preferentially D-lactyl-L-Ala, in bacterial cell walls.
           The structure for the bacteriophage T7 lysozyme shows
           that two of the conserved histidines and a cysteine are
           zinc binding residues. Site-directed mutagenesis of T7
           lysozyme indicates that two conserved residues, a Tyr
           and a Lys, are important for amidase activity.
          Length = 126

 Score = 28.4 bits (64), Expect = 3.6
 Identities = 8/12 (66%), Positives = 10/12 (83%)

Query: 441 QIYEGRDWNYVA 452
           +IY+GR WNYV 
Sbjct: 47  RIYQGRGWNYVG 58


>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein.  This
           protein is covalently attached to the terminii of
           replicating DNA in vivo.
          Length = 548

 Score = 29.6 bits (67), Expect = 4.1
 Identities = 13/37 (35%), Positives = 19/37 (51%), Gaps = 2/37 (5%)

Query: 128 RKSKNWLNDIEMGEADEGTPEEQAEFMKEIESFYREN 164
           R  ++ L  +E     E   EEQ +F+ +I   YREN
Sbjct: 455 RILRDLLATVERAGRGELDEEEQEQFLSDIA--YREN 489


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 28.9 bits (65), Expect = 5.4
 Identities = 16/85 (18%), Positives = 32/85 (37%), Gaps = 8/85 (9%)

Query: 88  STSKTNGDDTHVENEPKTNTSVTPIPHGESSTSKAEDESVRKSKNWLNDIEMGEADEGTP 147
                    T  E E K  T+ T     E   ++++ +S +K K         + ++  P
Sbjct: 32  LVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKK--------KKEKKEP 83

Query: 148 EEQAEFMKEIESFYRENALEFKPPK 172
           + + E     ++  +    + KPPK
Sbjct: 84  KSEGETKLGFKTPKKSKKTKKKPPK 108


>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
           Reviewed.
          Length = 460

 Score = 28.8 bits (64), Expect = 6.0
 Identities = 15/69 (21%), Positives = 22/69 (31%), Gaps = 4/69 (5%)

Query: 15  SEANLVEASAQQQQASQQPSENGQSSCRETAEDKPETLPSVDEVFPEKSDAEATAGVNTK 74
                 +A A  Q  ++  S   +      +E   ET P V    PEK+          +
Sbjct: 376 PGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEV----PEKAAPIPDPAKPDE 431

Query: 75  SGGGGAGTD 83
               G G D
Sbjct: 432 LAVAGPGDD 440


>gnl|CDD|191251 pfam05283, MGC-24, Multi-glycosylated core protein 24 (MGC-24).
           This family consists of several MGC-24 (or Cd164
           antigen) proteins from eukaryotic organisms.
           MGC-24/CD164 is a sialomucin expressed in many normal
           and cancerous tissues. In humans, soluble and
           transmembrane forms of MGC-24 are produced by
           alternative splicing.
          Length = 187

 Score = 27.7 bits (61), Expect = 9.5
 Identities = 13/35 (37%), Positives = 18/35 (51%)

Query: 86  PESTSKTNGDDTHVENEPKTNTSVTPIPHGESSTS 120
           P  T        HV +EPKTNT+VTP    +  ++
Sbjct: 124 PTITPSPTTSHHHVTSEPKTNTTVTPTSQPDRKST 158


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.310    0.129    0.375 

Gapped
Lambda     K      H
   0.267   0.0771    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 23,128,605
Number of extensions: 2238305
Number of successful extensions: 1665
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1656
Number of HSP's successfully gapped: 40
Length of query: 456
Length of database: 10,937,602
Length adjustment: 100
Effective length of query: 356
Effective length of database: 6,502,202
Effective search space: 2314783912
Effective search space used: 2314783912
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 61 (27.2 bits)