RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy3929
         (120 letters)



>gnl|CDD|100121 cd06224, REM, Guanine nucleotide exchange factor for Ras-like
          GTPases; N-terminal domain (RasGef_N), also called REM
          domain (Ras exchanger motif). This domain is common in
          nucleotide exchange factors for Ras-like small GTPases
          and is typically found immediately N-terminal to the
          RasGef (Cdc25-like) domain. REM contacts the GTPase and
          is assumed to participate in the catalytic activity of
          the exchange factor. Proteins with the REM domain
          include Sos1 and Sos2, which relay signals from
          tyrosine-kinase mediated signalling to Ras, RasGRP1-4,
          RasGRF1,2, CNrasGEF, and RAP-specific nucleotide
          exchange factors, to name a few.
          Length = 122

 Score = 46.6 bits (111), Expect = 1e-07
 Identities = 20/50 (40%), Positives = 31/50 (62%), Gaps = 1/50 (2%)

Query: 44 TLDALIQHIVPTIDYYPDRAYLFAFLLSSRLFIKPHELLGRVIALCDVQQ 93
          TL+ALI+H+  T D  PD +++  FLL+ R F  P ELL ++I   ++  
Sbjct: 1  TLEALIEHLTSTFDM-PDPSFVSTFLLTYRSFTTPTELLEKLIERYEIAP 49


>gnl|CDD|216026 pfam00618, RasGEF_N, RasGEF N-terminal motif.  A subset of
          guanine nucleotide exchange factor for Ras-like small
          GTPases appear to possess this motif/domain N-terminal
          to the RasGef (Cdc25-like) domain.
          Length = 94

 Score = 40.4 bits (95), Expect = 2e-05
 Identities = 19/55 (34%), Positives = 29/55 (52%), Gaps = 1/55 (1%)

Query: 40 LVSGTLDALIQHIVPTIDYYPDRAYLFAFLLSSRLFIKPHELLGRVIALCDVQQK 94
          +  GTL+AL++H+  T D   D ++L  FLL+ R F  P ELL  ++        
Sbjct: 2  VKGGTLEALVEHLTST-DDMLDDSFLETFLLTYRSFTTPGELLELLLERFLSPPL 55


>gnl|CDD|214571 smart00229, RasGEFN, Guanine nucleotide exchange factor for
          Ras-like GTPases; N-terminal motif.  A subset of
          guanine nucleotide exchange factor for Ras-like small
          GTPases appear to possess this domain N-terminal to the
          RasGef (Cdc25-like) domain. The recent crystal
          structureof Sos shows that this domain is alpha-helical
          and plays a "purely structural role" (Nature 394,
          337-343).
          Length = 127

 Score = 36.9 bits (86), Expect = 5e-04
 Identities = 19/52 (36%), Positives = 30/52 (57%), Gaps = 2/52 (3%)

Query: 37 EGHLV-SGTLDALIQHIVPTIDYYPDRAYLFAFLLSSRLFIKPHELLGRVIA 87
          +G L+  GTL+ALI+H+        D +++  FLL+ R FI   ELL  ++ 
Sbjct: 1  DGGLIKGGTLEALIEHLTEA-FDKADPSFVETFLLTYRSFITTQELLQLLLY 51


>gnl|CDD|215247 PLN02448, PLN02448, UDP-glycosyltransferase family protein.
          Length = 459

 Score = 30.0 bits (68), Expect = 0.23
 Identities = 18/55 (32%), Positives = 22/55 (40%), Gaps = 4/55 (7%)

Query: 6   IIRLVKEEEDEEENERKRRRRRRRSWALVYREGHLVSGT----LDALIQHIVPTI 56
           I  LVK   D E  E K  RRR +    + R      G+    LDA I+ I    
Sbjct: 405 IAELVKRFMDLESEEGKEMRRRAKELQEICRGAIAKGGSSDTNLDAFIRDISQGR 459


>gnl|CDD|218693 pfam05687, DUF822, Plant protein of unknown function (DUF822).
          This family consists of the N terminal regions of
          several plant proteins of unknown function.
          Length = 151

 Score = 28.6 bits (64), Expect = 0.62
 Identities = 9/18 (50%), Positives = 11/18 (61%)

Query: 11 KEEEDEEENERKRRRRRR 28
          K    E EN ++R RRRR
Sbjct: 6  KPTWKERENNKRRERRRR 23



 Score = 25.5 bits (56), Expect = 7.1
 Identities = 7/14 (50%), Positives = 9/14 (64%)

Query: 16 EEENERKRRRRRRR 29
           +E E  +RR RRR
Sbjct: 9  WKERENNKRRERRR 22


>gnl|CDD|218898 pfam06101, DUF946, Plant protein of unknown function (DUF946).
           This family consists of several hypothetical proteins
           from Arabidopsis thaliana and Oryza sativa. The function
           of this family is unknown.
          Length = 537

 Score = 28.8 bits (65), Expect = 0.64
 Identities = 12/20 (60%), Positives = 14/20 (70%)

Query: 46  DALIQHIVPTIDYYPDRAYL 65
            ALIQH  PTI ++PD  YL
Sbjct: 234 HALIQHYGPTIYFHPDEEYL 253


>gnl|CDD|224138 COG1217, TypA, Predicted membrane GTPase involved in stress
           response [Signal transduction mechanisms].
          Length = 603

 Score = 28.7 bits (65), Expect = 0.75
 Identities = 13/27 (48%), Positives = 18/27 (66%), Gaps = 2/27 (7%)

Query: 2   ITPYIIRLVKEEEDEEENERKRRRRRR 28
           +TP  IRL K+  +  ENERKR  +R+
Sbjct: 579 VTPESIRLRKKILN--ENERKRAEKRK 603


>gnl|CDD|132187 TIGR03143, AhpF_homolog, putative alkyl hydroperoxide reductase F
           subunit.  This family of thioredoxin reductase homologs
           is found adjacent to alkylhydroperoxide reductase C
           subunit predominantly in cases where there is only one C
           subunit in the genome and that genome is lacking the F
           subunit partner (also a thioredcxin reductase homolog)
           that is usually found (TIGR03140).
          Length = 555

 Score = 28.2 bits (63), Expect = 0.94
 Identities = 23/103 (22%), Positives = 38/103 (36%), Gaps = 7/103 (6%)

Query: 11  KEEEDEEENERKRRRRRRRSWALVYREGHLVSGTLDALIQHIVPTIDYYPDRAYLFAFLL 70
           +E E+EE  E         + A   ++G L+    D+L Q +V       +   L  FL 
Sbjct: 319 EEYEEEEAKEASEASAAETTPAATTKKGSLLD---DSLRQQLVGIFGRLENPVTLLLFLD 375

Query: 71  SSRLFIKPHELLGRVIALCDVQQKLTDKTPH--GKKELIRKYK 111
            S    K  EL   +     + +KL  +  +   + E     K
Sbjct: 376 GSNE--KSAELQSFLGEFASLSEKLNSEAVNRGEEPESETLPK 416


>gnl|CDD|220514 pfam10002, DUF2243, Predicted membrane protein (DUF2243).  This
           domain, found in various hypothetical bacterial
           proteins, has no known function.
          Length = 141

 Score = 27.6 bits (62), Expect = 1.2
 Identities = 14/64 (21%), Positives = 25/64 (39%), Gaps = 9/64 (14%)

Query: 22  KRRRRRRRSWALVYREGHLVSGT-----LDALIQHIV---PTIDYYPDR-AYLFAFLLSS 72
           +   RRRR W+     G L+ G      +D +I H +     + Y  +   +   +L   
Sbjct: 66  RADLRRRRPWSGRTLWGGLLLGWGLFNLVDGIIDHKILGIHHVRYGDNPLLWDLGWLAFG 125

Query: 73  RLFI 76
            L +
Sbjct: 126 ALLL 129


>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon. 
          Length = 431

 Score = 27.7 bits (61), Expect = 1.5
 Identities = 10/15 (66%), Positives = 12/15 (80%)

Query: 14 EDEEENERKRRRRRR 28
          +DEEE  R+RRRR R
Sbjct: 2  DDEEEAARERRRRER 16


>gnl|CDD|115789 pfam07158, MatC_N, Dicarboxylate carrier protein MatC N-terminus.
           This family represents the N-terminal region of the
           bacterial dicarboxylate carrier protein MatC. The MatC
           protein is an integral membrane protein that could
           function as a malonate carrier.
          Length = 149

 Score = 27.4 bits (61), Expect = 1.6
 Identities = 10/34 (29%), Positives = 16/34 (47%)

Query: 41  VSGTLDALIQHIVPTIDYYPDRAYLFAFLLSSRL 74
            +GT+D L+ H V  +           FLL++ L
Sbjct: 72  TNGTIDWLVHHAVRAVRGRVGAIPWVMFLLAALL 105


>gnl|CDD|226859 COG4452, CreD, Inner membrane protein involved in colicin E2
          resistance [Defense mechanisms].
          Length = 443

 Score = 27.4 bits (61), Expect = 2.0
 Identities = 9/46 (19%), Positives = 17/46 (36%), Gaps = 1/46 (2%)

Query: 1  MITPYIIRLVKEEEDEEENERKRRRRRRRSWALVYREGHLVSGTLD 46
          ++ P I     E    E   +   R+R      +  E  +V+G + 
Sbjct: 55 LVGPLIAIPYTELYTVEVEGKTVVRKRSF-IHFILPESLMVTGNMK 99


>gnl|CDD|220550 pfam10070, DUF2309, Uncharacterized protein conserved in bacteria
           (DUF2309).  Members of this family of hypothetical
           bacterial proteins have no known function.
          Length = 783

 Score = 27.6 bits (62), Expect = 2.0
 Identities = 12/24 (50%), Positives = 14/24 (58%)

Query: 9   LVKEEEDEEENERKRRRRRRRSWA 32
            VKE  DE+E     +RRRRR  A
Sbjct: 381 RVKEGGDEDEAAAYDQRRRRRLRA 404


>gnl|CDD|221586 pfam12457, TIP_N, Tuftelin interacting protein N terminal.  This
          domain family is found in eukaryotes, and is typically
          between 99 and 114 amino acids in length. The family is
          found in association with pfam08697, pfam01585. There
          are two completely conserved residues (G and F) that
          may be functionally important. TIP is involved in
          enamel assembly by interacting with one of the major
          proteins responsible for biomineralisation of enamel -
          tuftelin.
          Length = 106

 Score = 26.6 bits (59), Expect = 2.3
 Identities = 6/19 (31%), Positives = 12/19 (63%)

Query: 13 EEDEEENERKRRRRRRRSW 31
          ++D+++   +RR R RR  
Sbjct: 41 DDDDDDEGERRRGRGRRRS 59



 Score = 25.8 bits (57), Expect = 4.5
 Identities = 10/19 (52%), Positives = 16/19 (84%)

Query: 12 EEEDEEENERKRRRRRRRS 30
          +++D++E ER+R R RRRS
Sbjct: 41 DDDDDDEGERRRGRGRRRS 59


>gnl|CDD|217301 pfam02956, TT_ORF1, TT viral orf 1.  TT virus (TTV), isolated
          initially from a Japanese patient with hepatitis of
          unknown aetiology, has since been found to infect both
          healthy and diseased individuals and numerous
          prevalence studies have raised questions about its role
          in unexplained hepatitis. ORF1 is a large 750 residue
          protein. The N-terminal half of this protein
          corresponds to the capsid protein.
          Length = 525

 Score = 27.2 bits (61), Expect = 2.4
 Identities = 8/16 (50%), Positives = 9/16 (56%)

Query: 21 RKRRRRRRRSWALVYR 36
          R+R RRRRR      R
Sbjct: 31 RRRVRRRRRGRRRRRR 46


>gnl|CDD|235390 PRK05289, PRK05289, UDP-N-acetylglucosamine acyltransferase;
           Provisional.
          Length = 262

 Score = 26.2 bits (59), Expect = 4.0
 Identities = 16/60 (26%), Positives = 28/60 (46%), Gaps = 5/60 (8%)

Query: 20  ERKRRRRRRRSWALVYREGHLVSGTLDALIQHIVPTIDYYPDRAYLFAFLLSS-RLFIKP 78
            R+     RR++ L+YR G  +   L+ L +    +    P+   +  F+ SS R  I+P
Sbjct: 205 SREEIHALRRAYKLLYRSGLTLEEALEELAEEYPDS----PEVKEILDFIESSKRGIIRP 260


>gnl|CDD|204616 pfam11240, DUF3042, Protein of unknown function (DUF3042).  This
          family of proteins with unknown function appears to be
          restricted to Firmicutes.
          Length = 54

 Score = 25.0 bits (55), Expect = 4.2
 Identities = 9/25 (36%), Positives = 16/25 (64%)

Query: 6  IIRLVKEEEDEEENERKRRRRRRRS 30
          +I  ++E+E+  E  RK+  R+R S
Sbjct: 30 VIEPIEEKEEFIEENRKKAARKRVS 54


>gnl|CDD|176217 cd08255, 2-desacetyl-2-hydroxyethyl_bacteriochlorophyllide_like,
           2-desacetyl-2-hydroxyethyl bacteriochlorophyllide and
           other MDR family members.  This subgroup of the medium
           chain dehydrogenases/reductase (MDR)/zinc-dependent
           alcohol dehydrogenase-like family has members identified
           as 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A
           dehydrogenase and alcohol dehydrogenases. The medium
           chain dehydrogenases/reductase (MDR)/zinc-dependent
           alcohol dehydrogenase-like family, which contains the
           zinc-dependent alcohol dehydrogenase (ADH-Zn) and
           related proteins, is a diverse group of proteins related
           to the first identified member, class I mammalian ADH.
           MDRs display a broad range of activities and are
           distinguished from the smaller short chain
           dehydrogenases (~ 250 amino acids vs. the ~ 350 amino
           acids of the MDR).  The MDR proteins have 2 domains: a
           C-terminal NAD(P) binding-Rossmann fold domain of a
           beta-alpha form and an N-terminal catalytic domain with
           distant homology to GroES.  The MDR group contains a
           host of activities, including the founding alcohol
           dehydrogenase (ADH), quinone reductase, sorbitol
           dehydrogenase, formaldehyde dehydrogenase, butanediol
           DH, ketose reductase, cinnamyl reductase, and numerous
           others. The zinc-dependent alcohol dehydrogenases (ADHs)
           catalyze the  NAD(P)(H)-dependent interconversion of
           alcohols to aldehydes or ketones.  Active site zinc has
           a catalytic role, while structural zinc aids in
           stability.
          Length = 277

 Score = 26.1 bits (58), Expect = 4.5
 Identities = 11/32 (34%), Positives = 13/32 (40%), Gaps = 6/32 (18%)

Query: 23  RRRRRRRSWALVYREGHLVSGTLDALIQHIVP 54
             R    +  L      L  G L+ALI H VP
Sbjct: 229 EARNLEEALDL------LAEGRLEALITHRVP 254


>gnl|CDD|182045 PRK09718, PRK09718, hypothetical protein; Validated.
          Length = 512

 Score = 26.2 bits (57), Expect = 5.1
 Identities = 18/66 (27%), Positives = 31/66 (46%)

Query: 45  LDALIQHIVPTIDYYPDRAYLFAFLLSSRLFIKPHELLGRVIALCDVQQKLTDKTPHGKK 104
           LD  I +I+  ID  PD A +   +L+ +L ++   L        +  +K+    P+ K 
Sbjct: 258 LDGYINNIIKMIDTLPDNAMILKSVLAVKLVMQLKILNIVNKNFIENMKKIFSHCPYIKD 317

Query: 105 ELIRKY 110
            +IR Y
Sbjct: 318 PIIRSY 323


>gnl|CDD|238009 cd00052, EH, Eps15 homology domain; found in proteins implicated
          in endocytosis, vesicle transport, and signal
          transduction. The alignment contains a pair of EF-hand
          motifs, typically one of them is canonical and binds to
          Ca2+, while the other may not bind to Ca2+. A
          hydrophobic binding pocket is formed by residues from
          both EF-hand motifs. The EH domain binds to proteins
          containing NPF (class I), [WF]W or SWG (class II), or
          H[TS]F (class III) sequence motifs.
          Length = 67

 Score = 24.9 bits (55), Expect = 5.3
 Identities = 8/28 (28%), Positives = 12/28 (42%), Gaps = 3/28 (10%)

Query: 67 AFLLSSRLFIKPHELLGRVIALCDVQQK 94
           FL  S L   P  +L ++  L D  + 
Sbjct: 23 PFLGKSGL---PRSVLAQIWDLADTDKD 47


>gnl|CDD|197477 smart00027, EH, Eps15 homology domain.  Pair of EF hand motifs
          that recognise proteins containing Asn-Pro-Phe (NPF)
          sequences.
          Length = 96

 Score = 25.3 bits (56), Expect = 5.6
 Identities = 9/28 (32%), Positives = 12/28 (42%), Gaps = 3/28 (10%)

Query: 67 AFLLSSRLFIKPHELLGRVIALCDVQQK 94
            LL S L   P  LL ++  L D+   
Sbjct: 34 PILLKSGL---PQTLLAKIWNLADIDND 58


>gnl|CDD|221115 pfam11456, DUF3019, Protein of unknown function (DUF3019).  This is
           a bacterial family of uncharacterized proteins.
          Length = 102

 Score = 25.3 bits (56), Expect = 5.7
 Identities = 8/14 (57%), Positives = 10/14 (71%)

Query: 20  ERKRRRRRRRSWAL 33
             K+RRRRR  W+L
Sbjct: 89  SPKKRRRRRNPWSL 102


>gnl|CDD|215003 smart01060, Catalase, Catalases are antioxidant enzymes that
           catalyse the conversion of hydrogen peroxide to water
           and molecular oxygen, serving to protect cells from its
           toxic effects.  Hydrogen peroxide is produced as a
           consequence of oxidative cellular metabolism and can be
           converted to the highly reactive hydroxyl radical via
           transition metals, this radical being able to damage a
           wide variety of molecules within a cell, leading to
           oxidative stress and cell death. Catalases act to
           neutralise hydrogen peroxide toxicity, and are produced
           by all aerobic organisms ranging from bacteria to man.
           Most catalases are mono-functional, haem-containing
           enzymes, although there are also bifunctional
           haem-containing peroxidase/catalases that are closely
           related to plant peroxidases, and non-haem,
           manganese-containing catalases that are found in
           bacteria.
          Length = 373

 Score = 25.9 bits (58), Expect = 6.0
 Identities = 9/26 (34%), Positives = 15/26 (57%), Gaps = 6/26 (23%)

Query: 50  QHIVPTIDYYPDRAYLFAFLLSSRLF 75
            ++VP I++ PD+      +L  RLF
Sbjct: 307 SNLVPGIEFSPDK------MLQGRLF 326


>gnl|CDD|219939 pfam08619, Nha1_C, Alkali metal cation/H+ antiporter Nha1 C
          terminus.  The C terminus of the plasma membrane Nha1
          antiporter plays an important role in the immediate
          cell response to hypo-osmotic shock which prevents an
          execessive loss of ions and water. This domain is found
          with pfam00999.
          Length = 430

 Score = 25.9 bits (57), Expect = 6.7
 Identities = 8/27 (29%), Positives = 15/27 (55%), Gaps = 5/27 (18%)

Query: 8  RLVKEEE-----DEEENERKRRRRRRR 29
          R V++++         +  +RRRR+RR
Sbjct: 56 RRVRKKKKGSRAGRRASSLRRRRRQRR 82


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 26.0 bits (57), Expect = 6.9
 Identities = 9/19 (47%), Positives = 14/19 (73%)

Query: 11  KEEEDEEENERKRRRRRRR 29
           K+++ EEE  RKR+ R R+
Sbjct: 394 KQDDTEEEERRKRQERERQ 412


>gnl|CDD|236485 PRK09368, PRK09368, gas vesicle synthesis-like protein; Reviewed.
          Length = 140

 Score = 25.4 bits (56), Expect = 7.2
 Identities = 7/20 (35%), Positives = 12/20 (60%)

Query: 11  KEEEDEEENERKRRRRRRRS 30
           +  ++EEE  R+R R R+  
Sbjct: 115 RGSDEEEERRRERPRPRKAP 134


>gnl|CDD|220245 pfam09444, MRC1, MRC1-like domain.  This putative domain is found
           to be the most conserved region in mediator of
           replication checkpoint protein 1.
          Length = 145

 Score = 25.4 bits (56), Expect = 7.3
 Identities = 12/40 (30%), Positives = 18/40 (45%), Gaps = 4/40 (10%)

Query: 12  EEEDEEENERKRRRRRRRSWALVYREGHLVSGTLDALIQH 51
           + EDE    R+ +RR         R+  L  G LD L ++
Sbjct: 101 DSEDELLQRRRLKRRELALM----RQRLLEDGDLDKLAKN 136


>gnl|CDD|237747 PRK14530, PRK14530, adenylate kinase; Provisional.
          Length = 215

 Score = 25.5 bits (56), Expect = 7.7
 Identities = 13/58 (22%), Positives = 23/58 (39%), Gaps = 21/58 (36%)

Query: 8   RLVKEEEDEEENERKRRRRRRRSWALVYREGHLVSGTLDALIQHIVPTIDYYPDRAYL 65
            L++ ++D EE  R+R          V+ E             +  P I++Y D+  L
Sbjct: 155 ELIQRDDDTEETVRERLD--------VFEE-------------NTEPVIEHYRDQGVL 191


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 25.6 bits (56), Expect = 7.9
 Identities = 7/21 (33%), Positives = 14/21 (66%)

Query: 9   LVKEEEDEEENERKRRRRRRR 29
               EE+ EE E+++R+R ++
Sbjct: 280 RSDSEEETEEKEKEKRKRLKK 300


>gnl|CDD|218749 pfam05788, Orbi_VP1, Orbivirus RNA-dependent RNA polymerase (VP1). 
           This family consists of the RNA-dependent RNA polymerase
           protein VP1 from the Orbiviruses. VP1 may have both
           enzymatic and structural roles in the virus life cycle.
          Length = 1301

 Score = 25.5 bits (56), Expect = 8.3
 Identities = 14/68 (20%), Positives = 27/68 (39%), Gaps = 10/68 (14%)

Query: 8   RLVKEEEDEEENERKRRRRRRR----------SWALVYREGHLVSGTLDALIQHIVPTID 57
             VK + D EE ER+ + R+++          SW  ++        TL   +  ++    
Sbjct: 43  HGVKYKRDAEEIEREIKMRKKQLYGLPVLRESSWEELFDIEERSEDTLKIYVNSVLDLEK 102

Query: 58  YYPDRAYL 65
             P+  +L
Sbjct: 103 LDPEEEFL 110


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
           nuclear protein involved in bud-site selection. It plays
           a role in positioning the proximal bud pole signal. More
           recently it has been shown to be involved in ribosome
           biogenesis.
          Length = 424

 Score = 25.6 bits (56), Expect = 8.4
 Identities = 7/16 (43%), Positives = 9/16 (56%)

Query: 16  EEENERKRRRRRRRSW 31
           E E E+K R  R+  W
Sbjct: 333 EREKEQKEREGRQSEW 348


>gnl|CDD|189027 cd09857, PIN_EXO1, PIN domain of Exonuclease-1, a
           structure-specific, divalent-metal-ion dependent, 5'
           nuclease and homologs.  Exonuclease-1 (EXO1) is involved
           in multiple, eukaryotic DNA metabolic pathways,
           including DNA replication processes (5' flap DNA
           endonuclease activity and double stranded DNA
           5'-exonuclease activity), DNA repair processes (DNA
           mismatch repair (MMR) and post-replication repair
           (PRR)), recombination, and telomere integrity. EXO1
           functions in the MMS2 error-free branch of the PRR
           pathway in the maintenance and repair of stalled
           replication forks. Studies also suggest that EXO1 plays
           both structural and catalytic roles during MMR-mediated
           mutation avoidance. EXO1 belongs to the FEN1-EXO1-like
           family of structure-specific, 5' nucleases. These
           nucleases contain a PIN (PilT N terminus) domain with a
           helical arch/clamp region (I domain) of variable length
           (approximately 43 residues in EXO1 PIN domains) and a
           H3TH (helix-3-turn-helix) domain, an atypical
           helix-hairpin-helix-2-like region. Both the H3TH domain
           (not included here) and the helical arch/clamp region
           are involved in DNA binding. Nucleases within this group
           also have a carboxylate-rich active site that is
           involved in binding essential divalent metal ion
           cofactors (Mg2+/Mn2+). EXO1 nucleases also have
           C-terminal Mlh1- and Msh2-binding domains which allow
           interaction with MMR and PRR proteins, respectively.
          Length = 210

 Score = 25.1 bits (56), Expect = 9.6
 Identities = 7/22 (31%), Positives = 8/22 (36%)

Query: 16  EEENERKRRRRRRRSWALVYRE 37
             E ER+ RR      AL    
Sbjct: 87  GTEEERRERREENLEKALELLR 108


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.327    0.144    0.428 

Gapped
Lambda     K      H
   0.267   0.0812    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 6,759,361
Number of extensions: 632952
Number of successful extensions: 2026
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1931
Number of HSP's successfully gapped: 110
Length of query: 120
Length of database: 10,937,602
Length adjustment: 83
Effective length of query: 37
Effective length of database: 7,256,220
Effective search space: 268480140
Effective search space used: 268480140
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.7 bits)
S2: 53 (24.0 bits)