RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy11688
         (546 letters)



>gnl|CDD|163660 cd07417, MPP_PP5_C, PP5, C-terminal metallophosphatase domain.
           Serine/threonine protein phosphatase-5 (PP5) is a member
           of the PPP gene family of protein phosphatases that is
           highly conserved among eukaryotes and widely expressed
           in mammalian tissues. PP5 has a C-terminal phosphatase
           domain and an extended N-terminal TPR (tetratricopeptide
           repeat) domain containing three TPR motifs.  The PPP
           (phosphoprotein phosphatase) family, to which PP5
           belongs, is one of two known protein phosphatase
           families specific for serine and threonine.  The PPP
           family also includes: PP1, PP2A, PP2B (calcineurin),
           PP4, PP6, PP7, Bsu1, RdgC, PrpE, PrpA/PrpB, and ApA4
           hydrolase. The PPP catalytic domain is defined by three
           conserved motifs (-GDXHG-, -GDXVDRG- and -GNHE-).  The
           PPP enzyme family is ancient with members found in all
           eukaryotes, and in most bacterial and archeal genomes.
           Dephosphorylation of phosphoserines and
           phosphothreonines on target proteins plays a central
           role in the regulation of many cellular processes.  PPPs
           belong to the metallophosphatase (MPP) superfamily.
           MPPs are functionally diverse, but all share a conserved
           domain with an active site consisting of two metal ions
           (usually manganese, iron, or zinc) coordinated with
           octahedral geometry by a cage of histidine, aspartate,
           and asparagine residues. The MPP superfamily includes:
           Mre11/SbcD-like exonucleases, Dbr1-like RNA lariat
           debranching enzymes, YfcE-like phosphodiesterases,
           purple acid phosphatases (PAPs), YbbF-like
           UDP-2,3-diacylglucosamine hydrolases, and acid
           sphingomyelinases (ASMases).  The conserved domain is a
           double beta-sheet sandwich with a di-metal active site
           made up of residues located at the C-terminal side of
           the sheets.  This domain is thought to allow for
           productive metal coordination.
          Length = 316

 Score =  567 bits (1463), Expect = 0.0
 Identities = 204/303 (67%), Positives = 237/303 (78%), Gaps = 2/303 (0%)

Query: 131 ARPNDKDAKAKFTECNKIVKRMQIKGKLHRKYAYRILMDIKALFMTQDSLVHITVEDEAK 190
            R  D+    +F +  ++++  + + KLH+KYAY+IL+ +K L     SLV IT+ +  K
Sbjct: 4   PRLEDEKVTLEFVK--EMIEWFKDQKKLHKKYAYQILLQVKELLKKLPSLVEITIPEGEK 61

Query: 191 FTVCGDIHGQFYDLMNIFELNGLPSPDNPYLFNGDFVDRGSFSVECIFTLFGFKLLYPNH 250
            TVCGD HGQFYDL+NIFELNGLPS  NPYLFNGDFVDRGSFSVE I TLF FKLLYPNH
Sbjct: 62  ITVCGDTHGQFYDLLNIFELNGLPSETNPYLFNGDFVDRGSFSVEVILTLFAFKLLYPNH 121

Query: 251 FFMARGNHESAYMNQIYGFEGEVKSKYTDLMAQFFTEVYNWLPLCHCINNKVLVMHGGLF 310
           F + RGNHE+  MN++YGFEGEVK+KY + M   F+EV+NWLPL H IN KVLV+HGGLF
Sbjct: 122 FHLNRGNHETDNMNKMYGFEGEVKAKYNEQMFDLFSEVFNWLPLAHLINGKVLVVHGGLF 181

Query: 311 SSDNVTLEDIRTIDRNRQPPDEGLMCELLWSDPQIPNGRAPSKRGVGVHFGPDVTKAFLE 370
           S D VTL+DIR IDR RQPPD GLMCELLWSDPQ   GR+PSKRGVG  FGPDVTK FLE
Sbjct: 182 SDDGVTLDDIRKIDRFRQPPDSGLMCELLWSDPQPQPGRSPSKRGVGCQFGPDVTKRFLE 241

Query: 371 RNKLEYIIRSHEVKQDGYEVAHDGRCITVFSAPNYCDSVGNKGAFITMKGKDMVPHFTTY 430
            N LEYIIRSHEVK +GYEV HDG+CITVFSAPNYCD +GNKGAFI + G D+ P FT +
Sbjct: 242 ENNLEYIIRSHEVKDEGYEVEHDGKCITVFSAPNYCDQMGNKGAFIRITGSDLKPKFTQF 301

Query: 431 EAV 433
           EAV
Sbjct: 302 EAV 304



 Score =  122 bits (309), Expect = 3e-31
 Identities = 53/123 (43%), Positives = 70/123 (56%), Gaps = 38/123 (30%)

Query: 440 PALENDKVTLKFMTDLMQTYKAQGKLHRKYAYRILMDIKK----------------KKI- 482
           P LE++KVTL+F+ ++++ +K Q KLH+KYAY+IL+ +K+                +KI 
Sbjct: 4   PRLEDEKVTLEFVKEMIEWFKDQKKLHKKYAYQILLQVKELLKKLPSLVEITIPEGEKIT 63

Query: 483 ------------LNL---------DFTQLFNGDFVDRGSFSVECIFTLFGFKLLYPNHFF 521
                       LN+             LFNGDFVDRGSFSVE I TLF FKLLYPNHF 
Sbjct: 64  VCGDTHGQFYDLLNIFELNGLPSETNPYLFNGDFVDRGSFSVEVILTLFAFKLLYPNHFH 123

Query: 522 MAR 524
           + R
Sbjct: 124 LNR 126


>gnl|CDD|197547 smart00156, PP2Ac, Protein phosphatase 2A homologues, catalytic
           domain.  Large family of serine/threonine phosphatases,
           that includes PP1, PP2A and PP2B (calcineurin) family
           members.
          Length = 271

 Score =  366 bits (942), Expect = e-125
 Identities = 135/277 (48%), Positives = 181/277 (65%), Gaps = 8/277 (2%)

Query: 158 LHRKYAYRILMDIKALFMTQDSLVHITVEDEAKFTVCGDIHGQFYDLMNIFELNGLPSPD 217
           L+++    +L ++K +F  + +LV +     A  TVCGDIHGQF DL+ +F+ NG P P+
Sbjct: 1   LYKEEILELLREVKEIFRQEPNLVEV----SAPVTVCGDIHGQFDDLLRLFDKNGQP-PE 55

Query: 218 NPYLFNGDFVDRGSFSVECIFTLFGFKLLYPNHFFMARGNHESAYMNQIYGFEGEVKSKY 277
             Y+F GD+VDRG FS+E I  LF  K+LYPN   + RGNHES  MN+IYGF  E K KY
Sbjct: 56  TNYVFLGDYVDRGPFSIEVILLLFALKILYPNRIVLLRGNHESRSMNEIYGFYDECKRKY 115

Query: 278 TDLMAQFFTEVYNWLPLCHCINNKVLVMHGGLFSSDNVTLEDIRTIDRNRQPPDEGLMCE 337
            + + + F E ++WLPL   IN K+L MHGGL S D  TL+DIR + R ++PPD+GL+ +
Sbjct: 116 GERIYEKFNEAFSWLPLAALINGKILCMHGGL-SPDLTTLDDIRKLKRPQEPPDDGLLID 174

Query: 338 LLWSDP-QIPNGRAPSKRGVGVHFGPDVTKAFLERNKLEYIIRSHEVKQDGYEVAHDGRC 396
           LLWSDP Q  NG  PS RG    FGPD    FL++N L+ IIR+H+V  DGYE   DG+ 
Sbjct: 175 LLWSDPDQPVNGFGPSIRGASYIFGPDAVDEFLKKNNLKLIIRAHQVVDDGYEFFADGKL 234

Query: 397 ITVFSAPNYCDSVGNKGAFITMKGKDMVPHFTTYEAV 433
           +T+FSAPNYCD  GNK A + +  KD+   F  ++  
Sbjct: 235 VTIFSAPNYCDRFGNKAAVLKVD-KDLKLTFEQFKPG 270



 Score = 65.7 bits (161), Expect = 7e-12
 Identities = 19/35 (54%), Positives = 24/35 (68%)

Query: 490 LFNGDFVDRGSFSVECIFTLFGFKLLYPNHFFMAR 524
           +F GD+VDRG FS+E I  LF  K+LYPN   + R
Sbjct: 59  VFLGDYVDRGPFSIEVILLLFALKILYPNRIVLLR 93


>gnl|CDD|163663 cd07420, MPP_RdgC, Drosophila melanogaster RdgC and related
           proteins, metallophosphatase domain.  RdgC (retinal
           degeneration C) is a vertebrate serine-threonine protein
           phosphatase that is required to prevent light-induced
           retinal degeneration.  In addition to its catalytic
           domain, RdgC has two C-terminal EF hands.  Homologs of
           RdgC include the human phosphatases protein phosphatase
           with EF hands 1 and -2 (PPEF-1 and -2).  PPEF-1
           transcripts are present at low levels in the retina,
           PPEF-2 transcripts and PPEF-2 protein are present at
           high levels in photoreceptors.  The PPP (phosphoprotein
           phosphatase) family, to which RdgC belongs, is one of
           two known protein phosphatase families specific for
           serine and threonine.  The PPP family also includes:
           PP1, PP2A, PP2B (calcineurin), PP4, PP5, PP6, PP7, Bsu1,
           PrpE, PrpA/PrpB, and ApA4 hydrolase. The PPP catalytic
           domain is defined by three conserved motifs (-GDXHG-,
           -GDXVDRG- and -GNHE-).  The PPP enzyme family is ancient
           with members found in all eukaryotes, and in most
           bacterial and archeal genomes.  Dephosphorylation of
           phosphoserines and phosphothreonines on target proteins
           plays a central role in the regulation of many cellular
           processes.  PPPs belong to the metallophosphatase (MPP)
           superfamily.  MPPs are functionally diverse, but all
           share a conserved domain with an active site consisting
           of two metal ions (usually manganese, iron, or zinc)
           coordinated with octahedral geometry by a cage of
           histidine, aspartate, and asparagine residues. The MPP
           superfamily includes: Mre11/SbcD-like exonucleases,
           Dbr1-like RNA lariat debranching enzymes, YfcE-like
           phosphodiesterases, purple acid phosphatases (PAPs),
           YbbF-like UDP-2,3-diacylglucosamine hydrolases, and acid
           sphingomyelinases (ASMases).  The conserved domain is a
           double beta-sheet sandwich with a di-metal active site
           made up of residues located at the C-terminal side of
           the sheets.  This domain is thought to allow for
           productive metal coordination.
          Length = 321

 Score =  293 bits (753), Expect = 1e-95
 Identities = 126/321 (39%), Positives = 182/321 (56%), Gaps = 34/321 (10%)

Query: 142 FTECNKIVKRMQIKGKLHRKYAYRILMDIKALFMTQDSLVHITVEDEAKFTVCGDIHGQF 201
               + +++  + K  LH KY   IL + + +     ++  ++     + T+CGD+HG+ 
Sbjct: 4   KDHIDALIEAFKEKQLLHAKYVLLILREARKVLKQLPNISRVSTSISKQVTICGDLHGKL 63

Query: 202 YDLMNIFELNGLPSPDNPYLFNGDFVDRGSFSVECIFTLFGFKLLYPNHFFMARGNHESA 261
            DL  IF  NGLPSP+NPY+FNGDFVDRG  S+E +  LF F L+YPN   + RGNHE  
Sbjct: 64  DDLFLIFYKNGLPSPENPYVFNGDFVDRGKRSIEILIILFAFFLVYPNEVHLNRGNHEDH 123

Query: 262 YMNQIYGFEGEVKSKYTDLMA---QFFTEVYNWLPLCHCINNKVLVMHGGLFSSDNVTLE 318
            MN  YGF  EV SKY        +   +V++WLPL   I+NK+LV+HGG+  SD+  L+
Sbjct: 124 IMNLRYGFTKEVMSKYKLHGKKILRLLEDVFSWLPLATIIDNKILVVHGGI--SDSTDLD 181

Query: 319 DIRTIDRNR-----QPP-------------DEGL---------MCELLWSDPQIPNGRAP 351
            +  IDR++     +PP             D            + ++LWSDP+   G  P
Sbjct: 182 LLDKIDRHKYVSVLRPPLRKGMEELTGEEEDPSEPLDKTEWRQILDILWSDPKAQKGCKP 241

Query: 352 -SKRGVGVHFGPDVTKAFLERNKLEYIIRSHEVKQDGYEVAHDGRCITVFSAPNYCDSVG 410
            + RG G +FGPDVT   L+++ L  +IRSHE K +GYE  H+ + IT+FSA NY +   
Sbjct: 242 NTFRGGGCYFGPDVTSKVLQKHGLSLLIRSHECKPEGYEFCHNNKVITIFSASNYYEEGS 301

Query: 411 NKGAFITMKGKDMVPHFTTYE 431
           N+GA+I + G D+ PHF  Y+
Sbjct: 302 NRGAYIKL-GPDLTPHFVQYQ 321



 Score = 52.0 bits (125), Expect = 3e-07
 Identities = 19/35 (54%), Positives = 24/35 (68%)

Query: 490 LFNGDFVDRGSFSVECIFTLFGFKLLYPNHFFMAR 524
           +FNGDFVDRG  S+E +  LF F L+YPN   + R
Sbjct: 83  VFNGDFVDRGKRSIEILIILFAFFLVYPNEVHLNR 117


>gnl|CDD|163658 cd07415, MPP_PP2A_PP4_PP6, PP2A, PP4, and PP6 phosphoprotein
           phosphatases, metallophosphatase domain.  PP2A-like
           family of phosphoprotein phosphatases (PPP's) including
           PP4 and PP6.  PP2A (Protein phosphatase 2A) is a
           critical regulator of many cellular activities.  PP2A
           comprises about 1% of total cellular proteins.  PP2A,
           together with protein phosphatase 1 (PP1), accounts for
           more than 90% of all serine/threonine phosphatase
           activities in most cells and tissues. The PP2A subunit
           in addition to having a catalytic domain homologous to
           PP1, has a unique C-terminal tail, containing a motif
           that is conserved in the catalytic subunits of all
           PP2A-like phosphatases including PP4 and PP6, and has an
           important role in PP2A regulation.  The PP2A-like family
           of phosphatases all share a similar heterotrimeric
           architecture, that includes: a 65kDa scaffolding subunit
           (A), a 36kDa catalytic subunit (C), and one of 18
           regulatory subunits (B).  The PPP (phosphoprotein
           phosphatase) family, to which PP2A belongs, is one of
           two known protein phosphatase families specific for
           serine and threonine.  The PPP family also includes:
           PP1, PP2B (calcineurin), PP4, PP5, PP6, PP7, Bsu1, RdgC,
           PrpE, PrpA/PrpB, and ApA4 hydrolase. The PPP catalytic
           domain is defined by three conserved motifs (-GDXHG-,
           -GDXVDRG- and -GNHE-).  The PPP enzyme family is ancient
           with members found in all eukaryotes, and in most
           bacterial and archeal genomes.  Dephosphorylation of
           phosphoserines and phosphothreonines on target proteins
           plays a central role in the regulation of many cellular
           processes.  PPPs belong to the metallophosphatase (MPP)
           superfamily.  MPPs are functionally diverse, but all
           share a conserved domain with an active site consisting
           of two metal ions (usually manganese, iron, or zinc)
           coordinated with octahedral geometry by a cage of
           histidine, aspartate, and asparagine residues. The MPP
           superfamily includes: Mre11/SbcD-like exonucleases,
           Dbr1-like RNA lariat debranching enzymes, YfcE-like
           phosphodiesterases, purple acid phosphatases (PAPs),
           YbbF-like UDP-2,3-diacylglucosamine hydrolases, and acid
           sphingomyelinases (ASMases).  The conserved domain is a
           double beta-sheet sandwich with a di-metal active site
           made up of residues located at the C-terminal side of
           the sheets.  This domain is thought to allow for
           productive metal coordination.
          Length = 285

 Score =  286 bits (735), Expect = 2e-93
 Identities = 107/243 (44%), Positives = 148/243 (60%), Gaps = 4/243 (1%)

Query: 192 TVCGDIHGQFYDLMNIFELNGLPSPDNPYLFNGDFVDRGSFSVECIFTLFGFKLLYPNHF 251
           TVCGDIHGQFYDL+ +F + G P PD  YLF GD+VDRG +SVE    L   K+ YP+  
Sbjct: 45  TVCGDIHGQFYDLLELFRVGGDP-PDTNYLFLGDYVDRGYYSVETFLLLLALKVRYPDRI 103

Query: 252 FMARGNHESAYMNQIYGFEGEVKSKYTDLMA-QFFTEVYNWLPLCHCINNKVLVMHGGLF 310
            + RGNHES  + Q+YGF  E   KY +    ++ T+++++LPL   I+N++  +HGGL 
Sbjct: 104 TLLRGNHESRQITQVYGFYDECLRKYGNANVWKYCTDLFDYLPLAALIDNQIFCVHGGL- 162

Query: 311 SSDNVTLEDIRTIDRNRQPPDEGLMCELLWSDPQIPNGRAPSKRGVGVHFGPDVTKAFLE 370
           S    TL+ IR IDR ++ P EG MC+LLWSDP    G   S RG G  FG DV + F  
Sbjct: 163 SPSIDTLDQIRAIDRFQEVPHEGPMCDLLWSDPDDIEGWGISPRGAGYLFGQDVVEEFNH 222

Query: 371 RNKLEYIIRSHEVKQDGYEVAHDGRCITVFSAPNYCDSVGNKGAFITMKGKDMVPHFTTY 430
            N L  I R+H++  +GY+   D + +TV+SAPNYC   GN  + + +  + +   F  +
Sbjct: 223 NNGLTLICRAHQLVMEGYQWMFDDKLVTVWSAPNYCYRCGNVASIMEL-DEHLKRSFKVF 281

Query: 431 EAV 433
           EA 
Sbjct: 282 EAA 284



 Score = 51.5 bits (124), Expect = 4e-07
 Identities = 16/35 (45%), Positives = 21/35 (60%)

Query: 490 LFNGDFVDRGSFSVECIFTLFGFKLLYPNHFFMAR 524
           LF GD+VDRG +SVE    L   K+ YP+   + R
Sbjct: 73  LFLGDYVDRGYYSVETFLLLLALKVRYPDRITLLR 107


>gnl|CDD|163659 cd07416, MPP_PP2B, PP2B, metallophosphatase domain.  PP2B
           (calcineurin) is a unique serine/threonine protein
           phosphatase in its regulation by a second messenger
           (calcium and calmodulin).  PP2B is involved in many
           biological processes including immune responses, the
           second messenger cAMP pathway, sodium/potassium ion
           transport in the nephron, cell cycle progression in
           lower eukaryotes, cardiac hypertrophy, and memory
           formation.  PP2B is highly conserved from yeast to
           humans, but is absent from plants.  PP2B is a
           heterodimer consisting of a catalytic subunit (CnA) and
           a regulatory subunit (CnB); CnB  contains four Ca2+
           binding motifs referred to as EF hands.  The PPP
           (phosphoprotein phosphatase) family, to which PP2B
           belongs, is one of two known protein phosphatase
           families specific for serine and threonine.  The PPP
           family also includes: PP1, PP2A, PP4, PP5, PP6, PP7,
           Bsu1, RdgC, PrpE, PrpA/PrpB, and ApA4 hydrolase. The PPP
           catalytic domain is defined by three conserved motifs
           (-GDXHG-, -GDXVDRG- and -GNHE-).  The PPP enzyme family
           is ancient with members found in all eukaryotes, and in
           most bacterial and archeal genomes.  Dephosphorylation
           of phosphoserines and phosphothreonines on target
           proteins plays a central role in the regulation of many
           cellular processes.  PPPs belong to the
           metallophosphatase (MPP) superfamily.  MPPs are
           functionally diverse, but all share a conserved domain
           with an active site consisting of two metal ions
           (usually manganese, iron, or zinc) coordinated with
           octahedral geometry by a cage of histidine, aspartate,
           and asparagine residues. The MPP superfamily includes:
           Mre11/SbcD-like exonucleases, Dbr1-like RNA lariat
           debranching enzymes, YfcE-like phosphodiesterases,
           purple acid phosphatases (PAPs), YbbF-like
           UDP-2,3-diacylglucosamine hydrolases, and acid
           sphingomyelinases (ASMases).  The conserved domain is a
           double beta-sheet sandwich with a di-metal active site
           made up of residues located at the C-terminal side of
           the sheets.  This domain is thought to allow for
           productive metal coordination.
          Length = 305

 Score =  261 bits (670), Expect = 2e-83
 Identities = 112/282 (39%), Positives = 160/282 (56%), Gaps = 20/282 (7%)

Query: 156 GKLHRKYAYRILMDIKALFMTQDSLVHITVEDEAKFTVCGDIHGQFYDLMNIFELNGLPS 215
           G+L  + A RI+ +   +   + +L+ I    EA  TVCGDIHGQFYDL+ +FE+ G P+
Sbjct: 14  GRLSEEDALRIITEGAEILRQEPNLLRI----EAPVTVCGDIHGQFYDLLKLFEVGGSPA 69

Query: 216 PDNPYLFNGDFVDRGSFSVECIFTLFGFKLLYPNHFFMARGNHESAYMNQIYGFEGEVKS 275
            +  YLF GD+VDRG FS+EC+  L+  K+LYP   F+ RGNHE  ++ + + F+ E K 
Sbjct: 70  -NTRYLFLGDYVDRGYFSIECVLYLWALKILYPKTLFLLRGNHECRHLTEYFTFKQECKI 128

Query: 276 KYTDLMAQFFTEVYNWLPLCHCINNKVLVMHGGLFSSDNVTLEDIRTIDRNRQPPDEGLM 335
           KY++ +     E ++ LPL   +N + L +HGGL S +  TL+DIR +DR R+PP  G M
Sbjct: 129 KYSERVYDACMEAFDCLPLAALMNQQFLCVHGGL-SPELKTLDDIRKLDRFREPPAFGPM 187

Query: 336 CELLWSDPQIPNGRAP--------SKRGVGVHFGPDVTKAFLERNKLEYIIRSHEVKQDG 387
           C+LLWSDP    G           + RG    +       FL++N L  IIR+HE +  G
Sbjct: 188 CDLLWSDPLEDFGNEKTQEHFVHNTVRGCSYFYSYRAVCEFLQKNNLLSIIRAHEAQDAG 247

Query: 388 YEVAHDGR------CITVFSAPNYCDSVGNKGAFITMKGKDM 423
           Y +    +       IT+FSAPNY D   NK A +  +   M
Sbjct: 248 YRMYRKSQTTGFPSLITIFSAPNYLDVYNNKAAVLKYENNVM 289



 Score = 61.2 bits (149), Expect = 3e-10
 Identities = 24/51 (47%), Positives = 31/51 (60%), Gaps = 4/51 (7%)

Query: 490 LFNGDFVDRGSFSVECIFTLFGFKLLYPNHFFMAR--VEC--IFTLFGFKL 536
           LF GD+VDRG FS+EC+  L+  K+LYP   F+ R   EC  +   F FK 
Sbjct: 74  LFLGDYVDRGYFSIECVLYLWALKILYPKTLFLLRGNHECRHLTEYFTFKQ 124


>gnl|CDD|163657 cd07414, MPP_PP1_PPKL, PP1, PPKL (PP1 and kelch-like) enzymes,  and
           related proteins, metallophosphatase domain.  PP1
           (protein phosphatase type 1) is a serine/threonine
           phosphatase that regulates many cellular processes
           including: cell-cycle progression, protein synthesis,
           muscle contraction, carbohydrate metabolism,
           transcription and neuronal signaling, through its
           interaction with at least 180 known targeting proteins. 
           PP1 occurs in all tissues and regulates many pathways,
           ranging from cell-cycle progression to carbohydrate
           metabolism.  Also included here are the PPKL (PP1 and
           kelch-like) enzymes including the PPQ, PPZ1, and PPZ2
           fungal phosphatases.  These PPKLs have a large
           N-terminal kelch repeat in addition to a C-terminal
           phosphoesterase domain.  The PPP (phosphoprotein
           phosphatase) family, to which PP1 belongs, is one of two
           known protein phosphatase families specific for serine
           and threonine.  The PPP family also includes: PP2A, PP2B
           (calcineurin), PP4, PP5, PP6,  PP7, Bsu1, RdgC, PrpE,
           PrpA/PrpB, and ApA4 hydrolase. The PPP catalytic domain
           is defined by three conserved motifs (-GDXHG-, -GDXVDRG-
           and -GNHE-).  The PPP enzyme family is ancient with
           members found in all eukaryotes, and in most bacterial
           and archeal genomes.  Dephosphorylation of
           phosphoserines and phosphothreonines on target proteins
           plays a central role in the regulation of many cellular
           processes.  PPPs belong to the metallophosphatase (MPP)
           superfamily.  MPPs are functionally diverse, but all
           share a conserved domain with an active site consisting
           of two metal ions (usually manganese, iron, or zinc)
           coordinated with octahedral geometry by a cage of
           histidine, aspartate, and asparagine residues. The MPP
           superfamily includes: Mre11/SbcD-like exonucleases,
           Dbr1-like RNA lariat debranching enzymes, YfcE-like
           phosphodiesterases, purple acid phosphatases (PAPs),
           YbbF-like UDP-2,3-diacylglucosamine hydrolases, and acid
           sphingomyelinases (ASMases).  The conserved domain is a
           double beta-sheet sandwich with a di-metal active site
           made up of residues located at the C-terminal side of
           the sheets.  This domain is thought to allow for
           productive metal coordination.
          Length = 293

 Score =  253 bits (649), Expect = 1e-80
 Identities = 113/250 (45%), Positives = 159/250 (63%), Gaps = 7/250 (2%)

Query: 171 KALFMTQDSLVHITVEDEAKFTVCGDIHGQFYDLMNIFELNGLPSPDNPYLFNGDFVDRG 230
           + +F++Q  L+    E EA   +CGDIHGQ+YDL+ +FE  G P P++ YLF GD+VDRG
Sbjct: 36  REIFLSQPILL----ELEAPLKICGDIHGQYYDLLRLFEYGGFP-PESNYLFLGDYVDRG 90

Query: 231 SFSVECIFTLFGFKLLYPNHFFMARGNHESAYMNQIYGFEGEVKSKYTDLMAQFFTEVYN 290
             S+E I  L  +K+ YP +FF+ RGNHE A +N+IYGF  E K +Y   + + FT+ +N
Sbjct: 91  KQSLETICLLLAYKIKYPENFFLLRGNHECASINRIYGFYDECKRRYNIKLWKTFTDCFN 150

Query: 291 WLPLCHCINNKVLVMHGGLFSSDNVTLEDIRTIDRNRQPPDEGLMCELLWSDP-QIPNGR 349
            LP+   I+ K+  MHGGL S D  ++E IR I R    PD+GL+C+LLWSDP +   G 
Sbjct: 151 CLPVAAIIDEKIFCMHGGL-SPDLQSMEQIRRIMRPTDVPDQGLLCDLLWSDPDKDVQGW 209

Query: 350 APSKRGVGVHFGPDVTKAFLERNKLEYIIRSHEVKQDGYEVAHDGRCITVFSAPNYCDSV 409
             + RGV   FG DV   FL ++ L+ I R+H+V +DGYE     + +T+FSAPNYC   
Sbjct: 210 GENDRGVSFTFGKDVVAKFLNKHDLDLICRAHQVVEDGYEFFAKRQLVTLFSAPNYCGEF 269

Query: 410 GNKGAFITMK 419
            N GA +++ 
Sbjct: 270 DNAGAMMSVD 279



 Score = 48.5 bits (116), Expect = 3e-06
 Identities = 23/49 (46%), Positives = 31/49 (63%), Gaps = 4/49 (8%)

Query: 490 LFNGDFVDRGSFSVECIFTLFGFKLLYPNHFFMAR--VEC--IFTLFGF 534
           LF GD+VDRG  S+E I  L  +K+ YP +FF+ R   EC  I  ++GF
Sbjct: 81  LFLGDYVDRGKQSLETICLLLAYKIKYPENFFLLRGNHECASINRIYGF 129


>gnl|CDD|140271 PTZ00244, PTZ00244, serine/threonine-protein phosphatase PP1;
           Provisional.
          Length = 294

 Score =  225 bits (576), Expect = 8e-70
 Identities = 109/257 (42%), Positives = 159/257 (61%), Gaps = 7/257 (2%)

Query: 166 ILMDIKALFMTQDSLVHITVEDEAKFTVCGDIHGQFYDLMNIFELNGLPSPDNPYLFNGD 225
           +L +++ +FM+Q  L+ I         VCGD HGQ+YDL+ IFE  G P P + YLF GD
Sbjct: 33  VLTEVREIFMSQPMLLEIR----PPVRVCGDTHGQYYDLLRIFEKCGFP-PYSNYLFLGD 87

Query: 226 FVDRGSFSVECIFTLFGFKLLYPNHFFMARGNHESAYMNQIYGFEGEVKSKYTDLMAQFF 285
           +VDRG  SVE I   F +K++YP +FF+ RGNHE A +N++YGF  +VK +Y   + + F
Sbjct: 88  YVDRGKHSVETITLQFCYKIVYPENFFLLRGNHECASINKMYGFFDDVKRRYNIKLFKAF 147

Query: 286 TEVYNWLPLCHCINNKVLVMHGGLFSSDNVTLEDIRTIDRNRQPPDEGLMCELLWSDPQI 345
           T+V+N +P+C  I+ K++ MHGGL S D  +L  +  I+R    PD G++C+LLW+DP+ 
Sbjct: 148 TDVFNTMPVCCVISEKIICMHGGL-SPDLTSLASVNEIERPCDVPDRGILCDLLWADPED 206

Query: 346 P-NGRAPSKRGVGVHFGPDVTKAFLERNKLEYIIRSHEVKQDGYEVAHDGRCITVFSAPN 404
              G   S RGV   FG D+   FL+   ++ I+R+H+V + GY      + +TVFSAPN
Sbjct: 207 EVRGFLESDRGVSYLFGEDIVNDFLDMVDMDLIVRAHQVMERGYGFFASRQLVTVFSAPN 266

Query: 405 YCDSVGNKGAFITMKGK 421
           YC    N  A + +  K
Sbjct: 267 YCGEFDNDAAVMNIDDK 283



 Score = 48.4 bits (115), Expect = 5e-06
 Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 4/49 (8%)

Query: 490 LFNGDFVDRGSFSVECIFTLFGFKLLYPNHFFMAR--VEC--IFTLFGF 534
           LF GD+VDRG  SVE I   F +K++YP +FF+ R   EC  I  ++GF
Sbjct: 83  LFLGDYVDRGKHSVETITLQFCYKIVYPENFFLLRGNHECASINKMYGF 131


>gnl|CDD|163661 cd07418, MPP_PP7, PP7, metallophosphatase domain.  PP7 is a plant
           phosphoprotein phosphatase that is highly expressed in a
           subset of stomata and thought to play an important role
           in sensory signaling.  PP7 acts as a positive regulator
           of signaling downstream of cryptochrome blue light
           photoreceptors.  PP7 also controls amplification of
           phytochrome signaling, and interacts with
           nucleotidediphosphate kinase 2 (NDPK2), a positive
           regulator of phytochrome signalling.  In addition, PP7
           interacts with heat shock transcription factor HSF and
           up-regulates protective heat shock proteins.  PP7 may
           also play a role in salicylic acid-dependent defense
           signaling.  The PPP (phosphoprotein phosphatase) family,
           to which PP7 belongs, is one of two known protein
           phosphatase families specific for serine and threonine. 
           The PPP family also includes: PP2A, PP2B (calcineurin),
           PP4, PP5, PP6, Bsu1, RdgC, PrpE, PrpA/PrpB, and ApA4
           hydrolase. The PPP catalytic domain is defined by three
           conserved motifs (-GDXHG-, -GDXVDRG- and -GNHE-).  The
           PPP enzyme family is ancient with members found in all
           eukaryotes, and in most bacterial and archeal genomes.
           Dephosphorylation of phosphoserines and
           phosphothreonines on target proteins plays a central
           role in the regulation of many cellular processes.  PPPs
           belong to the metallophosphatase (MPP) superfamily.
           MPPs are functionally diverse, but all share a conserved
           domain with an active site consisting of two metal ions
           (usually manganese, iron, or zinc) coordinated with
           octahedral geometry by a cage of histidine, aspartate,
           and asparagine residues. The MPP superfamily includes:
           Mre11/SbcD-like exonucleases, Dbr1-like RNA lariat
           debranching enzymes, YfcE-like phosphodiesterases,
           purple acid phosphatases (PAPs), YbbF-like
           UDP-2,3-diacylglucosamine hydrolases, and acid
           sphingomyelinases (ASMases).  The conserved domain is a
           double beta-sheet sandwich with a di-metal active site
           made up of residues located at the C-terminal side of
           the sheets.  This domain is thought to allow for
           productive metal coordination.
          Length = 377

 Score =  226 bits (579), Expect = 3e-69
 Identities = 113/316 (35%), Positives = 159/316 (50%), Gaps = 56/316 (17%)

Query: 179 SLVHITVEDEAKFTVCGDIHGQFYDLMNIFELNGLPSPDNPYLFNGDFVDRGSFSVECIF 238
           + V I VED  +  V GD+HGQ +D++ + E  G P  +  Y+FNGD+VDRG++ +E   
Sbjct: 56  NCVRIDVEDVCEVVVVGDVHGQLHDVLFLLEDAGFPDQNRFYVFNGDYVDRGAWGLETFL 115

Query: 239 TLFGFKLLYPNHFFMARGNHESAYMNQIYGFEGEVKSKYTDLMAQF---FTEVYNWLPLC 295
            L  +K+L P+  ++ RGNHES +   +YGFE EV +KY D            +  LPL 
Sbjct: 116 LLLSWKVLLPDRVYLLRGNHESKFCTSMYGFEQEVLTKYGDKGKHVYRKCLGCFEGLPLA 175

Query: 296 HCINNKVLVMHGGLFSSDNVTLEDIRTIDRNRQ--------------------------- 328
             I  +V   HGGLF S ++     +     R                            
Sbjct: 176 SIIAGRVYTAHGGLFRSPSLPKRKKQKGKNRRVLLLEPESESLKLGTLDDLMKARRSVLD 235

Query: 329 PPDEGLMC---ELLWSDPQIPNGRAPSK-RGVGVHFGPDVTKAFLERNKLEYIIRSHE-- 382
           PP EG      ++LWSDP +  G +P+K RG+G+ +GPD T+ FLE+N L+ IIRSHE  
Sbjct: 236 PPGEGSNLIPGDVLWSDPSLTPGLSPNKQRGIGLLWGPDCTEEFLEKNNLKLIIRSHEGP 295

Query: 383 ---VKQD-------GYEVAHD---GRCITVFSAPNYC------DSVGNKGAFITMKGKDM 423
               K+        GY V HD   G+ IT+FSAP+Y       +   NKGA+I ++  D 
Sbjct: 296 DAREKRPGLAGMNKGYTVDHDVESGKLITLFSAPDYPQFQATEERYNNKGAYIILQPPDF 355

Query: 424 V-PHFTTYEAVGVSPP 438
             P F T+EAV   P 
Sbjct: 356 SDPQFHTFEAVKPRPK 371



 Score = 44.0 bits (104), Expect = 1e-04
 Identities = 14/35 (40%), Positives = 24/35 (68%)

Query: 490 LFNGDFVDRGSFSVECIFTLFGFKLLYPNHFFMAR 524
           +FNGD+VDRG++ +E    L  +K+L P+  ++ R
Sbjct: 98  VFNGDYVDRGAWGLETFLLLLSWKVLLPDRVYLLR 132


>gnl|CDD|185658 PTZ00480, PTZ00480, serine/threonine-protein phosphatase;
           Provisional.
          Length = 320

 Score =  224 bits (572), Expect = 5e-69
 Identities = 111/249 (44%), Positives = 163/249 (65%), Gaps = 7/249 (2%)

Query: 171 KALFMTQDSLVHITVEDEAKFTVCGDIHGQFYDLMNIFELNGLPSPDNPYLFNGDFVDRG 230
           + +F++Q     I +E EA   +CGD+HGQ++DL+ +FE  G P P++ YLF GD+VDRG
Sbjct: 45  RDIFISQP----ILLELEAPLKICGDVHGQYFDLLRLFEYGGYP-PESNYLFLGDYVDRG 99

Query: 231 SFSVECIFTLFGFKLLYPNHFFMARGNHESAYMNQIYGFEGEVKSKYTDLMAQFFTEVYN 290
             S+E I  L  +K+ YP +FF+ RGNHE A +N+IYGF  E K +YT  + + FT+ +N
Sbjct: 100 KQSLETICLLLAYKIKYPENFFLLRGNHECASINRIYGFYDECKRRYTIKLWKTFTDCFN 159

Query: 291 WLPLCHCINNKVLVMHGGLFSSDNVTLEDIRTIDRNRQPPDEGLMCELLWSDP-QIPNGR 349
            LP+   I+ K+L MHGGL S +   LE IR I R    PD GL+C+LLWSDP +   G 
Sbjct: 160 CLPVAALIDEKILCMHGGL-SPELSNLEQIRRIMRPTDVPDTGLLCDLLWSDPDKDVQGW 218

Query: 350 APSKRGVGVHFGPDVTKAFLERNKLEYIIRSHEVKQDGYEVAHDGRCITVFSAPNYCDSV 409
           A ++RGV   F  ++ + FL++++L+ I R+H+V +DGYE     + +T+FSAPNYC   
Sbjct: 219 ADNERGVSYVFSQEIVQVFLKKHELDLICRAHQVVEDGYEFFSKRQLVTLFSAPNYCGEF 278

Query: 410 GNKGAFITM 418
            N G+ +T+
Sbjct: 279 DNAGSMMTI 287



 Score = 42.7 bits (100), Expect = 3e-04
 Identities = 23/49 (46%), Positives = 31/49 (63%), Gaps = 4/49 (8%)

Query: 490 LFNGDFVDRGSFSVECIFTLFGFKLLYPNHFFMAR--VEC--IFTLFGF 534
           LF GD+VDRG  S+E I  L  +K+ YP +FF+ R   EC  I  ++GF
Sbjct: 90  LFLGDYVDRGKQSLETICLLLAYKIKYPENFFLLRGNHECASINRIYGF 138


>gnl|CDD|163613 cd00144, MPP_PPP_family, phosphoprotein phosphatases of the
           metallophosphatase superfamily, metallophosphatase
           domain.  The PPP (phosphoprotein phosphatase) family is
           one of two known protein phosphatase families specific
           for serine and threonine.  This family includes: PP1,
           PP2A, PP2B (calcineurin), PP4, PP5, PP6, PP7, Bsu1,
           RdgC, PrpE, PrpA/PrpB, and ApA4 hydrolase. The PPP
           catalytic domain is defined by three conserved motifs
           (-GDXHG-, -GDXVDRG- and -GNHE-).  The PPP enzyme family
           is ancient with members found in all eukaryotes, and in
           most bacterial and archeal genomes.  Dephosphorylation
           of phosphoserines and phosphothreonines on target
           proteins plays a central role in the regulation of many
           cellular processes.  PPPs belong to the
           metallophosphatase (MPP) superfamily.  MPPs are
           functionally diverse, but all share a conserved domain
           with an active site consisting of two metal ions
           (usually manganese, iron, or zinc) coordinated with
           octahedral geometry by a cage of histidine, aspartate,
           and asparagine residues. The MPP superfamily includes:
           Mre11/SbcD-like exonucleases, Dbr1-like RNA lariat
           debranching enzymes, YfcE-like phosphodiesterases,
           purple acid phosphatases (PAPs), YbbF-like
           UDP-2,3-diacylglucosamine hydrolases, and acid
           sphingomyelinases (ASMases).  The conserved domain is a
           double beta-sheet sandwich with a di-metal active site
           made up of residues located at the C-terminal side of
           the sheets.  This domain is thought to allow for
           productive metal coordination.
          Length = 225

 Score =  219 bits (559), Expect = 3e-68
 Identities = 89/235 (37%), Positives = 126/235 (53%), Gaps = 22/235 (9%)

Query: 192 TVCGDIHGQFYDLMNIFELNGLPSPDNPYLFNGDFVDRGSFSVECIFTLFGFKLLYPNHF 251
            V GDIHG   DL+ + E  G P P++  +F GD+VDRG  SVE I  L   K+L P++ 
Sbjct: 1   YVIGDIHGCLDDLLRLLEKIGFP-PNDKLIFLGDYVDRGPDSVEVIDLLLALKIL-PDNV 58

Query: 252 FMARGNHESAYMNQIYGFEGEVK---------SKYTDLMAQFFTEVYNWLPLCHCINN-K 301
            + RGNHE   +N +YGF  E +          K  + + + F +V+ +LPL   I   K
Sbjct: 59  ILLRGNHEDMLLNFLYGFYDEDEWIGGTLRLLKKLGEDLWEEFNDVFFYLPLAALIETKK 118

Query: 302 VLVMHGGLFSSDNVTLEDIRTIDRNRQPPDEGLMCELLWSDPQIPNGRAPSKRGVGVHFG 361
           VL +HGGL     +        ++ ++ P++ L  +LLWSDP    G   S R  G   G
Sbjct: 119 VLCVHGGLSPGLPL-------EEQIKEEPEDQLPEDLLWSDPLELPGGFGSSRRGG---G 168

Query: 362 PDVTKAFLERNKLEYIIRSHEVKQDGYEVAHDGRCITVFSAPNYCDSVGNKGAFI 416
           PD  + FL++N L+ I+R H   ++GYE  HDG  IT+ S  NYC   GNK A +
Sbjct: 169 PDAVEWFLKKNGLKLIVRGHTPVEEGYEFGHDGNLITIDSGCNYCGGGGNKLAAL 223



 Score = 45.5 bits (108), Expect = 2e-05
 Identities = 15/32 (46%), Positives = 18/32 (56%)

Query: 490 LFNGDFVDRGSFSVECIFTLFGFKLLYPNHFF 521
           +F GD+VDRG  SVE I  L   K+L  N   
Sbjct: 29  IFLGDYVDRGPDSVEVIDLLLALKILPDNVIL 60


>gnl|CDD|173488 PTZ00239, PTZ00239, serine/threonine protein phosphatase 2A;
           Provisional.
          Length = 303

 Score =  212 bits (542), Expect = 1e-64
 Identities = 104/247 (42%), Positives = 145/247 (58%), Gaps = 5/247 (2%)

Query: 189 AKFTVCGDIHGQFYDLMNIFELNGLPSPDNPYLFNGDFVDRGSFSVECIFTLFGFKLLYP 248
           A   VCGDIHGQFYDL  +F+  G   P+  Y+F GDFVDRG  SVE +  L   K+ YP
Sbjct: 43  APVNVCGDIHGQFYDLQALFKEGG-DIPNANYIFIGDFVDRGYNSVETMEYLLCLKVKYP 101

Query: 249 NHFFMARGNHESAYMNQIYGFEGEVKSKYTDLMA-QFFTEVYNWLPLCHCINNKVLVMHG 307
            +  + RGNHES    Q+YGF  E+  KY +    + F +V++ LPL   I  ++L +HG
Sbjct: 102 GNITLLRGNHESRQCTQVYGFYEEILRKYGNSNPWRLFMDVFDCLPLAALIEGQILCVHG 161

Query: 308 GLFSSDNVTLEDIRTIDRNRQPPDEGLMCELLWSDPQIPNGRAPSKRGVGVHFGPDVTKA 367
           GL S D  T++ IRTIDR  + P EG  C+L+WSDP+     A + RG G  FG  VTK 
Sbjct: 162 GL-SPDMRTIDQIRTIDRKIEIPHEGPFCDLMWSDPEEVEYWAVNSRGAGYLFGAKVTKE 220

Query: 368 FLERNKLEYIIRSHEVKQDGYEV-AHDGRCITVFSAPNYCDSVGNKGAFITMKGKDMVPH 426
           F   N L  I R+H++  +GY+    D   +TV+SAPNYC   GN  + + +  +++   
Sbjct: 221 FCRLNDLTLICRAHQLVMEGYKYWFPDQNLVTVWSAPNYCYRCGNIASILCLD-ENLQQT 279

Query: 427 FTTYEAV 433
           + T++ V
Sbjct: 280 WKTFKEV 286



 Score = 37.5 bits (87), Expect = 0.012
 Identities = 16/35 (45%), Positives = 21/35 (60%)

Query: 490 LFNGDFVDRGSFSVECIFTLFGFKLLYPNHFFMAR 524
           +F GDFVDRG  SVE +  L   K+ YP +  + R
Sbjct: 74  IFIGDFVDRGYNSVETMEYLLCLKVKYPGNITLLR 108


>gnl|CDD|163662 cd07419, MPP_Bsu1_C, Arabidopsis thaliana Bsu1 phosphatase and
           related proteins, C-terminal metallophosphatase domain. 
           Bsu1 encodes a nuclear serine-threonine protein
           phosphatase found in plants and protozoans.  Bsu1 has a
           C-terminal phosphatase domain and an N-terminal
           Kelch-repeat domain.  Bsu1 is preferentially expressed
           in elongating plant cells. It modulates the
           phosphorylation state of Bes1, a transcriptional
           regulator phosphorylated by the glycogen synthase kinase
           Bin2, as part of a steroid hormone signal transduction
           pathway.  The PPP (phosphoprotein phosphatase) family,
           to which Bsu1 belongs, is one of two known protein
           phosphatase families specific for serine and threonine. 
           The PPP family also includes: PP1, PP2A, PP2B
           (calcineurin), PP4, PP5, PP6, PP7, Bsu1, RdgC, PrpE,
           PrpA/PrpB, and ApA4 hydrolase. The PPP catalytic domain
           is defined by three conserved motifs (-GDXHG-, -GDXVDRG-
           and -GNHE-).  The PPP enzyme family is ancient with
           members found in all eukaryotes, and in most bacterial
           and archeal genomes.  Dephosphorylation of
           phosphoserines and phosphothreonines on target proteins
           plays a central role in the regulation of many cellular
           processes.  PPPs belong to the metallophosphatase (MPP)
           superfamily.  MPPs are functionally diverse, but all
           share a conserved domain with an active site consisting
           of two metal ions (usually manganese, iron, or zinc)
           coordinated with octahedral geometry by a cage of
           histidine, aspartate, and asparagine residues. The MPP
           superfamily includes: Mre11/SbcD-like exonucleases,
           Dbr1-like RNA lariat debranching enzymes, YfcE-like
           phosphodiesterases, purple acid phosphatases (PAPs),
           YbbF-like UDP-2,3-diacylglucosamine hydrolases, and acid
           sphingomyelinases (ASMases).  The conserved domain is a
           double beta-sheet sandwich with a di-metal active site
           made up of residues located at the C-terminal side of
           the sheets.  This domain is thought to allow for
           productive metal coordination.
          Length = 311

 Score =  208 bits (530), Expect = 7e-63
 Identities = 102/274 (37%), Positives = 143/274 (52%), Gaps = 28/274 (10%)

Query: 173 LFMTQDSLVHITVEDEAKFTVCGDIHGQFYDLMNIFELNGLPSPD-------NPYLFNGD 225
           +F  +  ++ +     A   + GDIHGQF DLM +F+  G P  +         YLF GD
Sbjct: 36  IFKQEPMVLRL----RAPIKIFGDIHGQFGDLMRLFDEYGSPVTEAAGDIEYIDYLFLGD 91

Query: 226 FVDRGSFSVECIFTLFGFKLLYPNHFFMARGNHESAYMNQIYGFEGEVKSKYTDLMA--- 282
           +VDRGS S+E I  L   K+ YPN   + RGNHE   +N ++GF  E K +  +      
Sbjct: 92  YVDRGSNSLETICLLLALKVKYPNQIHLIRGNHEDRDINALFGFREECKERLGEDPNDGD 151

Query: 283 ---QFFTEVYNWLPLCHCINNKVLVMHGGLFSSDNVTLEDIRTIDRN-RQPPDEGLMCEL 338
              +    ++ WLPL   I +K+L MHGG+  S N  + +I  + R       E ++ +L
Sbjct: 152 SVWRRINRLFEWLPLAAIIEDKILCMHGGIGRSIN-HVSEIEDLKRPLTMEFGEQVVMDL 210

Query: 339 LWSDPQ--------IPNGRAPSKRGVGVHFGPDVTKAFLERNKLEYIIRSHEVKQDGYEV 390
           LWSDP          PN   P   G+ V FGPD    FLE N L+ IIR+HE   DG+E 
Sbjct: 211 LWSDPTENDSVLGLRPNAIDPRGPGLIVKFGPDRVHRFLEENDLQMIIRAHECVMDGFER 270

Query: 391 AHDGRCITVFSAPNYCDSVGNKGAFITMKGKDMV 424
              G+ IT+FSA NYC + GN GA + + G+D+ 
Sbjct: 271 FAQGKLITLFSATNYCGTAGNAGAILVL-GRDLT 303



 Score = 45.5 bits (108), Expect = 3e-05
 Identities = 23/54 (42%), Positives = 28/54 (51%), Gaps = 4/54 (7%)

Query: 487 FTQLFNGDFVDRGSFSVECIFTLFGFKLLYPNHFFMAR----VECIFTLFGFKL 536
              LF GD+VDRGS S+E I  L   K+ YPN   + R       I  LFGF+ 
Sbjct: 84  IDYLFLGDYVDRGSNSLETICLLLALKVKYPNQIHLIRGNHEDRDINALFGFRE 137


>gnl|CDD|215750 pfam00149, Metallophos, Calcineurin-like phosphoesterase.  This
           family includes a diverse range of phosphoesterases,
           including protein phosphoserine phosphatases,
           nucleotidases, sphingomyelin phosphodiesterases and
           2'-3' cAMP phosphodiesterases as well as nucleases such
           as bacterial SbcD or yeast MRE11. The most conserved
           regions in this superfamily centre around the metal
           chelating residues.
          Length = 185

 Score =  104 bits (259), Expect = 9e-26
 Identities = 54/224 (24%), Positives = 81/224 (36%), Gaps = 67/224 (29%)

Query: 190 KFTVCGDIHGQFYDL---MNIFELNGLPSPDNPYLFNGDFVDRGSFSVECIFTLFGFKLL 246
           +  V GD+HG   DL   + + EL G P PD   LF GD VDRG  S+E +  LF  KL 
Sbjct: 1   RILVIGDLHGGLDDLDLLLLLLELLGEPKPDL-VLFLGDLVDRGPPSLEVLALLFALKLK 59

Query: 247 YPNHFFMARGNHESAYMNQIYGFEGEVKS-------------------------KYTDLM 281
            P   ++ RGNH+    N   GF  E                            K   L+
Sbjct: 60  APGPVYLVRGNHDFDSGNSELGFYLECAGLPYVLGNGDVSNGTVEIIGLSSLYGKGGGLV 119

Query: 282 AQFFTEVYNWLPLCHCINNKVLVMHGGLFSSDNVTLEDIRTIDRNRQPPDEGLMCELLWS 341
            + F E+ + L L   ++ K+L++HG L  S +   +                       
Sbjct: 120 WEEFLELLDLLLLAALVDGKILLVHGPLSPSLDSGDDI---------------------- 157

Query: 342 DPQIPNGRAPSKRGVGVHFGPDVTKAFLERNKLEYIIRSHEVKQ 385
                             FG +  +  L+ N ++ ++R H    
Sbjct: 158 ----------------YLFGEEALEDLLKDNGVDLVLRGHTHVP 185



 Score = 43.6 bits (102), Expect = 8e-05
 Identities = 19/61 (31%), Positives = 27/61 (44%), Gaps = 1/61 (1%)

Query: 465 LHRKYAYRILMDIKKKKILNLDFTQ-LFNGDFVDRGSFSVECIFTLFGFKLLYPNHFFMA 523
           LH       L+ +  + +        LF GD VDRG  S+E +  LF  KL  P   ++ 
Sbjct: 8   LHGGLDDLDLLLLLLELLGEPKPDLVLFLGDLVDRGPPSLEVLALLFALKLKAPGPVYLV 67

Query: 524 R 524
           R
Sbjct: 68  R 68


>gnl|CDD|223712 COG0639, ApaH, Diadenosine tetraphosphatase and related
           serine/threonine protein phosphatases [Signal
           transduction mechanisms].
          Length = 155

 Score = 89.3 bits (221), Expect = 6e-21
 Identities = 51/153 (33%), Positives = 67/153 (43%), Gaps = 10/153 (6%)

Query: 263 MNQIYGFEGEVKSKYTD----LMAQFFTEVYNWLPLCHCINNKV-LVMHGGLFSSDNVTL 317
           +  +YGF  E   KY +    L A    E ++ LPL         L  HGGL    +  L
Sbjct: 3   LTALYGFYDEKLRKYGEELEWLRAAGGLETFDSLPLAAVAEGGKLLCHHGGLSPGLDRLL 62

Query: 318 EDIRTIDRNR--QPPDEGLMCELLWSDPQIPNGR--APSKRGVGVHFGPDVTKAFLERNK 373
           + I  +DR R  + P  G   +LLWSDP   + R   P  RGV      DVT  F   + 
Sbjct: 63  DIIEVLDRLRACEVPHAGHTHDLLWSDPDGGDRRIWNPGPRGVPRDG-GDVTAVFGIVHT 121

Query: 374 LEYIIRSHEVKQDGYEVAHDGRCITVFSAPNYC 406
            + I R+H +          G  +T FSAPNYC
Sbjct: 122 PKLIERAHVLYDIDTGAVFGGGLLTAFSAPNYC 154


>gnl|CDD|233223 TIGR00990, 3a0801s09, mitochondrial precursor proteins import
           receptor (72 kDa mitochondrial outermembrane protein)
           (mitochondrial import receptor for the ADP/ATP carrier)
           (translocase of outermembrane tom70).  [Transport and
           binding proteins, Amino acids, peptides and amines].
          Length = 615

 Score = 87.7 bits (217), Expect = 3e-18
 Identities = 47/114 (41%), Positives = 64/114 (56%), Gaps = 6/114 (5%)

Query: 20  SNPAPKEVSV------EDKERAEKLKAEANEHFKNQAYNKAIELYSAAIEVNPTAIYYAN 73
             P   E SV      E K+ A KLK + N+ ++N+ +NKAI+LYS AIE  P  +YY+N
Sbjct: 106 ELPEIDESSVANLSEEERKKYAAKLKEKGNKAYRNKDFNKAIKLYSKAIECKPDPVYYSN 165

Query: 74  RSFAYLKTEAIGYALNDASKAIELDQTYTKAYYRRAAAYMSLGKFKLALRDYEA 127
           R+  +         + D + A+ELD  Y+KA  RRA AY  LGK+  AL D  A
Sbjct: 166 RAACHNALGDWEKVVEDTTAALELDPDYSKALNRRANAYDGLGKYADALLDLTA 219



 Score = 43.4 bits (102), Expect = 2e-04
 Identities = 37/125 (29%), Positives = 58/125 (46%), Gaps = 10/125 (8%)

Query: 22  PAPKEVSVE-DKERAEKLKAEANEHFKNQA---YNKAIELYSAAIEVNP----TAIYYAN 73
           PA  E S E D+E          +  +++A   Y +A   +  A+++       AI    
Sbjct: 277 PAGLEDSNELDEETGNGQLQLGLKSPESKADESYEEAARAFEKALDLGKLGEKEAIALNL 336

Query: 74  R-SFAYLKTEAIGYALNDASKAIELDQTYTKAYYRRAAAYMSLGKFKLALRDYEAVHKAR 132
           R +F  LK + +  AL D SK+IELD   T++Y +RA+  + LG    A  D++   K  
Sbjct: 337 RGTFKCLKGKHLE-ALADLSKSIELDPRVTQSYIKRASMNLELGDPDKAEEDFDKALKLN 395

Query: 133 PNDKD 137
             D D
Sbjct: 396 SEDPD 400



 Score = 34.2 bits (78), Expect = 0.18
 Identities = 22/83 (26%), Positives = 37/83 (44%), Gaps = 1/83 (1%)

Query: 45  HFKNQAYNKAIELYSAAIEVNPTAIY-YANRSFAYLKTEAIGYALNDASKAIELDQTYTK 103
                 + +A+   S +IE++P     Y  R+   L+      A  D  KA++L+     
Sbjct: 341 KCLKGKHLEALADLSKSIELDPRVTQSYIKRASMNLELGDPDKAEEDFDKALKLNSEDPD 400

Query: 104 AYYRRAAAYMSLGKFKLALRDYE 126
            YY RA  +   G+F  A +DY+
Sbjct: 401 IYYHRAQLHFIKGEFAQAGKDYQ 423


>gnl|CDD|238112 cd00189, TPR, Tetratricopeptide repeat domain; typically contains
           34 amino acids
           [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-
           X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found
           in a variety of organisms including bacteria,
           cyanobacteria, yeast, fungi, plants, and humans in
           various subcellular locations; involved in a variety of
           functions including protein-protein interactions, but
           common features in the interaction partners have not
           been defined; involved in chaperone, cell-cycle,
           transciption, and protein transport complexes; the
           number of TPR motifs varies among proteins (1,3-11,13
           15,16,19); 5-6 tandem repeats generate a right-handed
           helical structure with an amphipathic channel that is
           thought to accomodate an alpha-helix of a target
           protein; it has been proposed that TPR proteins
           preferably interact with WD-40 repeat proteins, but in
           many instances several TPR-proteins seem to aggregate to
           multi-protein complexes; examples of TPR-proteins
           include, Cdc16p, Cdc23p and Cdc27p components of the
           cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal
           targeting signals, the Tom70p co-receptor for
           mitochondrial targeting signals, Ser/Thr phosphatase 5C
           and the p110 subunit of O-GlcNAc transferase; three
           copies of the repeat are present here.
          Length = 100

 Score = 79.0 bits (195), Expect = 5e-18
 Identities = 40/100 (40%), Positives = 53/100 (53%), Gaps = 1/100 (1%)

Query: 36  EKLKAEANEHFKNQAYNKAIELYSAAIEVNPT-AIYYANRSFAYLKTEAIGYALNDASKA 94
           E L    N ++K   Y++A+E Y  A+E++P  A  Y N + AY K      AL D  KA
Sbjct: 1   EALLNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKA 60

Query: 95  IELDQTYTKAYYRRAAAYMSLGKFKLALRDYEAVHKARPN 134
           +ELD    KAYY    AY  LGK++ AL  YE   +  PN
Sbjct: 61  LELDPDNAKAYYNLGLAYYKLGKYEEALEAYEKALELDPN 100



 Score = 32.4 bits (74), Expect = 0.15
 Identities = 11/39 (28%), Positives = 16/39 (41%)

Query: 103 KAYYRRAAAYMSLGKFKLALRDYEAVHKARPNDKDAKAK 141
           +A       Y  LG +  AL  YE   +  P++ DA   
Sbjct: 1   EALLNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYN 39


>gnl|CDD|215568 PLN03088, PLN03088, SGT1,  suppressor of G2 allele of SKP1;
           Provisional.
          Length = 356

 Score = 78.7 bits (194), Expect = 8e-16
 Identities = 39/113 (34%), Positives = 61/113 (53%), Gaps = 1/113 (0%)

Query: 35  AEKLKAEANEHFKNQAYNKAIELYSAAIEVNP-TAIYYANRSFAYLKTEAIGYALNDASK 93
           A+ L+ +A E F +  +  A++LY+ AI+++P  A  YA+R+ A +K      A+ DA+K
Sbjct: 2   AKDLEDKAKEAFVDDDFALAVDLYTQAIDLDPNNAELYADRAQANIKLGNFTEAVADANK 61

Query: 94  AIELDQTYTKAYYRRAAAYMSLGKFKLALRDYEAVHKARPNDKDAKAKFTECN 146
           AIELD +  KAY R+  A M L +++ A    E      P D        EC+
Sbjct: 62  AIELDPSLAKAYLRKGTACMKLEEYQTAKAALEKGASLAPGDSRFTKLIKECD 114


>gnl|CDD|149395 pfam08321, PPP5, PPP5 TPR repeat region.  This region is specific
           to the PPP5 subfamily of serine/threonine phosphatases
           and contains TPR repeats.
          Length = 95

 Score = 59.3 bits (144), Expect = 4e-11
 Identities = 21/40 (52%), Positives = 32/40 (80%)

Query: 440 PALENDKVTLKFMTDLMQTYKAQGKLHRKYAYRILMDIKK 479
           P LE+ KVTL+F+ D+M+ +K Q KLH+K AY+IL+ +K+
Sbjct: 45  PRLEDGKVTLEFVKDMMERFKDQKKLHKKCAYQILLQVKE 84



 Score = 35.8 bits (83), Expect = 0.008
 Identities = 14/27 (51%), Positives = 18/27 (66%)

Query: 157 KLHRKYAYRILMDIKALFMTQDSLVHI 183
           KLH+K AY+IL+ +K L     SLV I
Sbjct: 69  KLHKKCAYQILLQVKELLKKLPSLVEI 95


>gnl|CDD|222112 pfam13414, TPR_11, TPR repeat. 
          Length = 69

 Score = 55.0 bits (133), Expect = 6e-10
 Identities = 30/68 (44%), Positives = 39/68 (57%), Gaps = 2/68 (2%)

Query: 33 ERAEKLKAEANEHFKNQAYNKAIELYSAAIEVNPT-AIYYANRSFAYLKT-EAIGYALND 90
          + AE LK   N  FK   Y++AIE Y  A+E++P  A  Y N + AYLK  +    AL D
Sbjct: 1  DNAEALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALED 60

Query: 91 ASKAIELD 98
            KA+ELD
Sbjct: 61 LEKALELD 68



 Score = 32.3 bits (74), Expect = 0.067
 Identities = 9/36 (25%), Positives = 16/36 (44%)

Query: 103 KAYYRRAAAYMSLGKFKLALRDYEAVHKARPNDKDA 138
           +A      A   LG +  A+  YE   +  P++ +A
Sbjct: 4   EALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEA 39


>gnl|CDD|163667 cd07424, MPP_PrpA_PrpB, PrpA and PrpB, metallophosphatase domain.
           PrpA and PrpB are bacterial type I serine/threonine and
           tyrosine phosphatases thought to modulate the expression
           of proteins that protect the cell upon accumulation of
           misfolded proteins in the periplasm.  The PPP
           (phosphoprotein phosphatase) family, to which PrpA and
           PrpB belong, is one of two known protein phosphatase
           families specific for serine and threonine.  This family
           also includes: PP1, PP2A, PP2B (calcineurin), PP4, PP5,
           PP6, PP7, Bsu1, RdgC, PrpE, and ApA4 hydrolase. The PPP
           catalytic domain is defined by three conserved motifs
           (-GDXHG-, -GDXVDRG- and -GNHE-).  The PPP enzyme family
           is ancient with members found in all eukaryotes, and in
           most bacterial and archeal genomes.  Dephosphorylation
           of phosphoserines and phosphothreonines on target
           proteins plays a central role in the regulation of many
           cellular processes.  PPPs belong to the
           metallophosphatase (MPP) superfamily.  MPPs are
           functionally diverse, but all share a conserved domain
           with an active site consisting of two metal ions
           (usually manganese, iron, or zinc) coordinated with
           octahedral geometry by a cage of histidine, aspartate,
           and asparagine residues. The MPP superfamily includes:
           Mre11/SbcD-like exonucleases, Dbr1-like RNA lariat
           debranching enzymes, YfcE-like phosphodiesterases,
           purple acid phosphatases (PAPs), YbbF-like
           UDP-2,3-diacylglucosamine hydrolases, and acid
           sphingomyelinases (ASMases).  The conserved domain is a
           double beta-sheet sandwich with a di-metal active site
           made up of residues located at the C-terminal side of
           the sheets.  This domain is thought to allow for
           productive metal coordination.
          Length = 207

 Score = 56.1 bits (136), Expect = 5e-09
 Identities = 45/183 (24%), Positives = 62/183 (33%), Gaps = 37/183 (20%)

Query: 193 VCGDIHGQFYDLMNIFELNGLPSPDNPYLFN-GDFVDRGSFSVECIFTLFGFKLLYPNHF 251
           V GDIHG +  L    +  G   P    L + GD +DRG  S+ C+  L    LL P  F
Sbjct: 5   VVGDIHGHYSLLQKALDAVGF-DPARDRLISVGDLIDRGPESLACL-EL----LLEPW-F 57

Query: 252 FMARGNHESAYMNQIYGFEGEVKSKYTDLMA---QFFTEVY------NWLPLCH------ 296
              RGNHE   M  I     E        +A   ++F ++                    
Sbjct: 58  HAVRGNHE--QM-AIDALRAEPLD-AVRWLANGGEWFLDLPDEELRRWLALKLEQLPLAI 113

Query: 297 ---CINNKVLVMHGGLFSSDNVTLEDIRTIDRNRQPPDEGLMCELLWSDPQIPNGRAPSK 353
                  KV ++H    S D                     + ELLWS  +I   +    
Sbjct: 114 EVETEGGKVGIVHADYPSDDWSDGVG-------AVTLRPEDIEELLWSRTRIQKAQTQPI 166

Query: 354 RGV 356
           +GV
Sbjct: 167 KGV 169


>gnl|CDD|163614 cd00838, MPP_superfamily, metallophosphatase superfamily,
           metallophosphatase domain.  Metallophosphatases (MPPs),
           also known as metallophosphoesterases,
           phosphodiesterases (PDEs), binuclear
           metallophosphoesterases, and dimetal-containing
           phosphoesterases (DMPs), represent a diverse superfamily
           of enzymes with a conserved domain containing an active
           site consisting of two metal ions (usually manganese,
           iron, or zinc) coordinated with octahedral geometry by a
           cage of histidine, aspartate, and asparagine residues.
           This superfamily includes: the phosphoprotein
           phosphatases (PPPs), Mre11/SbcD-like exonucleases,
           Dbr1-like RNA lariat debranching enzymes, YfcE-like
           phosphodiesterases, purple acid phosphatases (PAPs),
           YbbF-like UDP-2,3-diacylglucosamine hydrolases, and acid
           sphingomyelinases (ASMases).  The conserved domain is a
           double beta-sheet sandwich with a di-metal active site
           made up of residues located at the C-terminal side of
           the sheets.  This domain is thought to allow for
           productive metal coordination.
          Length = 131

 Score = 48.1 bits (114), Expect = 7e-07
 Identities = 20/69 (28%), Positives = 25/69 (36%), Gaps = 3/69 (4%)

Query: 193 VCGDIHGQFYDLMNIFELNGL--PSPDNPYLFNGDFVDRGSFSVECIFTLFGFKLLYPNH 250
           V  DIHG    L  + E        PD   L  GD V  G    E +       LL    
Sbjct: 2   VISDIHGNLEALEAVLEAALAAAEKPDF-VLVLGDLVGDGPDPEEVLAAALALLLLLGIP 60

Query: 251 FFMARGNHE 259
            ++  GNH+
Sbjct: 61  VYVVPGNHD 69



 Score = 30.0 bits (67), Expect = 1.6
 Identities = 15/74 (20%), Positives = 22/74 (29%), Gaps = 10/74 (13%)

Query: 451 FMTDLMQTYKAQGKLHRKYAYRILMDIKKKKILNLDFTQLFNGDFVDRGSFSVECIFTLF 510
            ++D+       G L    A         +K    DF  L  GD V  G    E +    
Sbjct: 2   VISDI------HGNLEALEAVLEAALAAAEKP---DFV-LVLGDLVGDGPDPEEVLAAAL 51

Query: 511 GFKLLYPNHFFMAR 524
              LL     ++  
Sbjct: 52  ALLLLLGIPVYVVP 65


>gnl|CDD|163668 cd07425, MPP_Shelphs, Shewanella-like phosphatases,
           metallophosphatase domain.  This family includes
           bacterial, eukaryotic, and archeal proteins orthologous
           to the Shewanella cold-active protein-tyrosine
           phosphatase, CAPTPase.  CAPTPase is an uncharacterized
           protein that belongs to the Shelph (Shewanella-like
           phosphatase) family of PPP (phosphoprotein
           phosphatases).  The PPP family is one of two known
           protein phosphatase families specific for serine and
           threonine.  In addition to Shelps, the PPP family also
           includes: PP1, PP2A, PP2B (calcineurin), PP4, PP5, PP6,
           PP7, Bsu1, RdgC, PrpE, PrpA/PrpB, and ApA4 hydrolase.
           The PPP catalytic domain is defined by three conserved
           motifs (-GDXHG-, -GDXVDRG- and -GNHE-).  The PPP enzyme
           family is ancient with members found in all eukaryotes,
           and in most bacterial and archeal genomes.
           Dephosphorylation of phosphoserines and
           phosphothreonines on target proteins plays a central
           role in the regulation of many cellular processes.  PPPs
           belong to the metallophosphatase (MPP) superfamily.
           MPPs are functionally diverse, but all share a conserved
           domain with an active site consisting of two metal ions
           (usually manganese, iron, or zinc) coordinated with
           octahedral geometry by a cage of histidine, aspartate,
           and asparagine residues. The MPP superfamily includes:
           Mre11/SbcD-like exonucleases, Dbr1-like RNA lariat
           debranching enzymes, YfcE-like phosphodiesterases,
           purple acid phosphatases (PAPs), YbbF-like
           UDP-2,3-diacylglucosamine hydrolases, and acid
           sphingomyelinases (ASMases).  The conserved domain is a
           double beta-sheet sandwich with a di-metal active site
           made up of residues located at the C-terminal side of
           the sheets.  This domain is thought to allow for
           productive metal coordination.
          Length = 208

 Score = 48.8 bits (117), Expect = 1e-06
 Identities = 38/146 (26%), Positives = 56/146 (38%), Gaps = 43/146 (29%)

Query: 195 GDIHGQFYDLMNIFELNGLPSPDNPYLFN-------GDFVDRGSFSVECIFTLFGFKLLY 247
           GD+HG       I +  G+   ++ ++         GD  DRG   +E +       LLY
Sbjct: 4   GDLHGDLDAFREILKGAGVIDSNDHWIGGSTHLVQLGDIFDRGPDVIEIL------WLLY 57

Query: 248 PN-----------HFFMARGNHESAYMNQIYGFEGEVKSKYTDL-----MAQFF-----T 286
                        HF +  GNHE   MN + G    V  KY +      M +        
Sbjct: 58  KLEQEAAKAGGKVHFLL--GNHE--LMN-LCGDFRYVHPKYFNEFGGLAMRRRELFSPGG 112

Query: 287 EVYNWL---PLCHCINNKVLVMHGGL 309
           E+  WL   P+   +N+  L +HGGL
Sbjct: 113 ELGRWLRSKPVIVKVND-TLFVHGGL 137


>gnl|CDD|234059 TIGR02917, PEP_TPR_lipo, putative PEP-CTERM system TPR-repeat
           lipoprotein.  This protein family occurs in strictly
           within a subset of Gram-negative bacterial species with
           the proposed PEP-CTERM/exosortase system, analogous to
           the LPXTG/sortase system common in Gram-positive
           bacteria. This protein occurs in a species if and only
           if a transmembrane histidine kinase (TIGR02916) and a
           DNA-binding response regulator (TIGR02915) also occur.
           The present of tetratricopeptide repeats (TPR) suggests
           protein-protein interaction, possibly for the regulation
           of PEP-CTERM protein expression, since many PEP-CTERM
           proteins in these genomes are preceded by a proposed DNA
           binding site for the response regulator.
          Length = 899

 Score = 50.1 bits (120), Expect = 3e-06
 Identities = 29/136 (21%), Positives = 48/136 (35%), Gaps = 3/136 (2%)

Query: 2   ARVYLSLEYLSMSSTTNASNPAPKEVSVEDKERAEKLKAEANEHFKNQAYNKAIELYSAA 61
               L+  YL         +  P +  ++D+  AE L      +        A + Y  A
Sbjct: 92  VLPLLARAYLLQGKFQQVLDELPGKTLLDDEGAAELLALRGLAYLGLGQLELAQKSYEQA 151

Query: 62  IEVNPTAIYYANRSFAYLKTEA--IGYALNDASKAIELDQTYTKAYYRRAAAYMSLGKFK 119
           + ++P ++Y A    A L         A     + +  D     A   +    +SLG  +
Sbjct: 152 LAIDPRSLY-AKLGLAQLALAENRFDEARALIDEVLTADPGNVDALLLKGDLLLSLGNIE 210

Query: 120 LALRDYEAVHKARPND 135
           LAL  Y      RPN+
Sbjct: 211 LALAAYRKAIALRPNN 226



 Score = 31.2 bits (71), Expect = 1.6
 Identities = 27/92 (29%), Positives = 34/92 (36%), Gaps = 9/92 (9%)

Query: 51  YNKAIELYSAAIEVNP--TAIYYANRSFAYLKTEAIGYALNDASKAIELDQTYTKAYYRR 108
             KA+ L     +V P     Y     F  +  E  G  L D   A EL      A   R
Sbjct: 79  LRKALSLGYPKNQVLPLLARAYLLQGKFQQVLDELPGKTLLDDEGAAEL-----LAL--R 131

Query: 109 AAAYMSLGKFKLALRDYEAVHKARPNDKDAKA 140
             AY+ LG+ +LA + YE      P    AK 
Sbjct: 132 GLAYLGLGQLELAQKSYEQALAIDPRSLYAKL 163



 Score = 30.8 bits (70), Expect = 2.6
 Identities = 30/142 (21%), Positives = 45/142 (31%), Gaps = 37/142 (26%)

Query: 35  AEKLKAEANEHFKNQAYNKAIELYSAAIEVNP---TAIYYANRSFAYLK--TEAIGY--- 86
           A  LKA     F+ + Y  A E    A++  P    A+  A  S   L    +A  Y   
Sbjct: 263 AHYLKALV--DFQKKNYEDARETLQDALKSAPEYLPALLLAGASEYQLGNLEQAYQYLNQ 320

Query: 87  ---------------------------ALNDASKAIELDQTYTKAYYRRAAAYMSLGKFK 119
                                      A+   S A+ LD     A      AY++LG F+
Sbjct: 321 ILKYAPNSHQARRLLASIQLRLGRVDEAIATLSPALGLDPDDPAALSLLGEAYLALGDFE 380

Query: 120 LALRDYEAVHKARPNDKDAKAK 141
            A        +  P +  A+ +
Sbjct: 381 KAAEYLAKATELDPENAAARTQ 402



 Score = 30.4 bits (69), Expect = 3.0
 Identities = 22/126 (17%), Positives = 52/126 (41%), Gaps = 14/126 (11%)

Query: 17  TNASNPAPKEVSVEDKERAEKLKAEANEHFKNQAYNKAIELYSAAIEVNPTAIYYANRSF 76
           T ++    K +  +  + A   + E + + + + Y  AI+ Y  A++  P++        
Sbjct: 685 TESAKKIAKSLQKQHPKAALGFELEGDLYLRQKDYPAAIQAYRKALKRAPSSQNAIKLHR 744

Query: 77  AYLKTEAIGYALNDASKAIELDQTYTK-------AYYRRAAAYMSLGKFKLALRDYEAVH 129
           A L       A  + ++A++  + + K            A  Y++   +  A++ Y+ V 
Sbjct: 745 ALL-------ASGNTAEAVKTLEAWLKTHPNDAVLRTALAELYLAQKDYDKAIKHYQTVV 797

Query: 130 KARPND 135
           K  P++
Sbjct: 798 KKAPDN 803



 Score = 30.1 bits (68), Expect = 4.1
 Identities = 21/93 (22%), Positives = 38/93 (40%), Gaps = 1/93 (1%)

Query: 44  EHFKNQAYNKAIELYSAAIEVNP-TAIYYANRSFAYLKTEAIGYALNDASKAIELDQTYT 102
            +     + KA E  + A E++P  A        + L       A+ D   A +LD    
Sbjct: 372 AYLALGDFEKAAEYLAKATELDPENAAARTQLGISKLSQGDPSEAIADLETAAQLDPELG 431

Query: 103 KAYYRRAAAYMSLGKFKLALRDYEAVHKARPND 135
           +A      +Y+  G+F  AL   + + K +P++
Sbjct: 432 RADLLLILSYLRSGQFDKALAAAKKLEKKQPDN 464


>gnl|CDD|201277 pfam00515, TPR_1, Tetratricopeptide repeat. 
          Length = 34

 Score = 38.9 bits (92), Expect = 1e-04
 Identities = 14/33 (42%), Positives = 18/33 (54%)

Query: 103 KAYYRRAAAYMSLGKFKLALRDYEAVHKARPND 135
           KA Y    AY+ LGK+  AL  YE   +  PN+
Sbjct: 2   KALYNLGNAYLKLGKYDEALEYYEKALELNPNN 34



 Score = 31.6 bits (73), Expect = 0.048
 Identities = 10/26 (38%), Positives = 16/26 (61%)

Query: 42 ANEHFKNQAYNKAIELYSAAIEVNPT 67
           N + K   Y++A+E Y  A+E+NP 
Sbjct: 8  GNAYLKLGKYDEALEYYEKALELNPN 33


>gnl|CDD|197478 smart00028, TPR, Tetratricopeptide repeats.  Repeats present in 4
           or more copies in proteins. Contain a minimum of 34
           amino acids each and self-associate via a "knobs and
           holes" mechanism.
          Length = 34

 Score = 37.8 bits (89), Expect = 4e-04
 Identities = 12/33 (36%), Positives = 17/33 (51%)

Query: 103 KAYYRRAAAYMSLGKFKLALRDYEAVHKARPND 135
           +A Y    AY+ LG +  AL  YE   +  PN+
Sbjct: 2   EALYNLGNAYLKLGDYDEALEYYEKALELDPNN 34



 Score = 30.5 bits (70), Expect = 0.15
 Identities = 9/26 (34%), Positives = 16/26 (61%)

Query: 42 ANEHFKNQAYNKAIELYSAAIEVNPT 67
           N + K   Y++A+E Y  A+E++P 
Sbjct: 8  GNAYLKLGDYDEALEYYEKALELDPN 33


>gnl|CDD|234164 TIGR03302, OM_YfiO, outer membrane assembly lipoprotein YfiO.
           Members of this protein family include YfiO, a
           near-essential protein of the outer membrane, part of a
           complex involved in protein insertion into the bacterial
           outer membrane. Many proteins in this family are
           annotated as ComL, based on the involvement of this
           protein in natural transformation with exogenous DNA in
           Neisseria gonorrhoeae. This protein family shows
           sequence similarity to, but is distinct from, the
           tol-pal system protein YbgF (TIGR02795) [Protein fate,
           Protein and peptide secretion and trafficking].
          Length = 235

 Score = 41.8 bits (99), Expect = 4e-04
 Identities = 35/156 (22%), Positives = 56/156 (35%), Gaps = 39/156 (25%)

Query: 19  ASNPAPKEVSVEDKERAEKLKAEANEHFKNQAYNKAIELYSAAIEVNPTAIYYANRS--- 75
           +S    +   VE+   AE+L  EA E   +  Y +AI+ + A     P +  YA ++   
Sbjct: 18  SSKKKKEADPVEEWP-AEELYEEAKEALDSGDYTEAIKYFEALESRYPFS-PYAEQAQLD 75

Query: 76  --FAYLKTEAIGYALNDASKAIELDQTYTK----------AYYRRAAAYMSLGKFKLALR 123
             +AY       Y   D ++AI     + +          AYY R  +  +        R
Sbjct: 76  LAYAY-------YKSGDYAEAIAAADRFIRLHPNHPDADYAYYLRGLSNYNQ--IDRVDR 126

Query: 124 DYEAVHKA----------RPNDK---DAKAKFTECN 146
           D  A  +A           PN +   DAK +     
Sbjct: 127 DQTAAREAFEAFQELIRRYPNSEYAPDAKKRMDYLR 162



 Score = 28.7 bits (65), Expect = 6.4
 Identities = 18/54 (33%), Positives = 23/54 (42%), Gaps = 4/54 (7%)

Query: 78  YLKTEAIGYALNDASKAIELDQTYT---KAYYRRAAAYMSLGKFKLALRDYEAV 128
           YLK  A   A+N     +E         +A  R   AY+ LG   LA +D  AV
Sbjct: 176 YLKRGAYVAAINRFQTVVENYPDTPATEEALARLVEAYLKLGLTDLA-QDAAAV 228


>gnl|CDD|184187 PRK13625, PRK13625, bis(5'-nucleosyl)-tetraphosphatase PrpE;
           Provisional.
          Length = 245

 Score = 41.2 bits (97), Expect = 7e-04
 Identities = 40/148 (27%), Positives = 61/148 (41%), Gaps = 35/148 (23%)

Query: 190 KFTVCGDIHGQFYDLMNIFEL------NGLP-SPDNPYL-FNGDFVDRGSFSVECIFTLF 241
           K+ + GDIHG + +   + E       +GLP  PD   L F GD  DRG  S+  I    
Sbjct: 2   KYDIIGDIHGCYQEFQALTEKLGYNWSSGLPVHPDQRKLAFVGDLTDRGPHSLRMI--EI 59

Query: 242 GFKLLYPNHFFMARGNHESAYMNQIY------------GFEGEV-------KSKYTDLMA 282
            ++L+     +   GNH     N++Y            G E  V         K   +  
Sbjct: 60  VWELVEKKAAYYVPGNH----CNKLYRFFLGRNVTIAHGLETTVAEYEALPSHKQNMIKE 115

Query: 283 QFFTEVYNWLPLCHCI-NNKVLVMHGGL 309
           +F T +Y   PL H +   +++V H G+
Sbjct: 116 KFIT-LYEQAPLYHILDEGRLVVAHAGI 142


>gnl|CDD|223533 COG0457, NrfG, FOG: TPR repeat [General function prediction only].
          Length = 291

 Score = 41.0 bits (94), Expect = 0.001
 Identities = 26/126 (20%), Positives = 49/126 (38%), Gaps = 5/126 (3%)

Query: 25  KEVSVEDKERAEKLKAEANEHFKNQAYNKAIELYSAAIEVNPT----AIYYANRSFAYLK 80
           K ++++      +        ++   Y +A+ELY  A+E++P     A            
Sbjct: 120 KALALDPDPDLAEALLALGALYELGDYEEALELYEKALELDPELNELAEALLALGALLEA 179

Query: 81  TEAIGYALNDASKAIELD-QTYTKAYYRRAAAYMSLGKFKLALRDYEAVHKARPNDKDAK 139
                 AL    KA++L+     +A       Y+ LGK++ AL  YE   +  P++ +A 
Sbjct: 180 LGRYEEALELLEKALKLNPDDDAEALLNLGLLYLKLGKYEEALEYYEKALELDPDNAEAL 239

Query: 140 AKFTEC 145
                 
Sbjct: 240 YNLALL 245



 Score = 40.6 bits (93), Expect = 0.001
 Identities = 27/112 (24%), Positives = 40/112 (35%), Gaps = 5/112 (4%)

Query: 30  EDKERAEKLKAEANEHFKNQAYNKAIELYSAAIEVNPT--AIYYANRSFAYLKTEAIGYA 87
                AE L            Y +A+EL   A+ ++P            A  +      A
Sbjct: 90  LLPNLAEALLNLGLLLEALGKYEEALELLEKALALDPDPDLAEALLALGALYELGDYEEA 149

Query: 88  LNDASKAIELDQTYTK---AYYRRAAAYMSLGKFKLALRDYEAVHKARPNDK 136
           L    KA+ELD    +   A     A   +LG+++ AL   E   K  P+D 
Sbjct: 150 LELYEKALELDPELNELAEALLALGALLEALGRYEEALELLEKALKLNPDDD 201


>gnl|CDD|236911 PRK11439, pphA, serine/threonine protein phosphatase 1;
           Provisional.
          Length = 218

 Score = 40.1 bits (94), Expect = 0.001
 Identities = 25/69 (36%), Positives = 33/69 (47%), Gaps = 10/69 (14%)

Query: 193 VCGDIHGQFYDLMN-IFELNGLPSPDNPYLFN-GDFVDRGSFSVECIFTLFGFKLLYPNH 250
           + GDIHG F  LM  +      P  D   L + GD +DRG  S+ C+      +LL  + 
Sbjct: 21  LVGDIHGCFEQLMRKLRHCRFDPWRD--LLISVGDLIDRGPQSLRCL------QLLEEHW 72

Query: 251 FFMARGNHE 259
               RGNHE
Sbjct: 73  VRAVRGNHE 81


>gnl|CDD|191825 pfam07719, TPR_2, Tetratricopeptide repeat.  This Pfam entry
           includes outlying Tetratricopeptide-like repeats (TPR)
           that are not matched by pfam00515.
          Length = 34

 Score = 35.6 bits (83), Expect = 0.002
 Identities = 12/33 (36%), Positives = 17/33 (51%)

Query: 103 KAYYRRAAAYMSLGKFKLALRDYEAVHKARPND 135
           +A Y    AY  LG ++ AL  YE   +  PN+
Sbjct: 2   EALYNLGLAYYKLGDYEEALEAYEKALELDPNN 34



 Score = 30.2 bits (69), Expect = 0.15
 Identities = 11/34 (32%), Positives = 18/34 (52%)

Query: 35 AEKLKAEANEHFKNQAYNKAIELYSAAIEVNPTA 68
          AE L      ++K   Y +A+E Y  A+E++P  
Sbjct: 1  AEALYNLGLAYYKLGDYEEALEAYEKALELDPNN 34


>gnl|CDD|107154 PHA02239, PHA02239, putative protein phosphatase.
          Length = 235

 Score = 37.7 bits (87), Expect = 0.009
 Identities = 22/75 (29%), Positives = 39/75 (52%), Gaps = 2/75 (2%)

Query: 193 VCGDIHGQFYDLMNIF-ELNGLPSPDNPYLFNGDFVDRGSFSVECIFTLFGFKLLYPNHF 251
           V  DIHG++  L+ I  ++N    P+   +F GD+VDRG  S + +  +F   +   ++ 
Sbjct: 5   VVPDIHGEYQKLLTIMDKINNERKPEETIVFLGDYVDRGKRSKDVVNYIFDL-MSNDDNV 63

Query: 252 FMARGNHESAYMNQI 266
               GNH+  + N +
Sbjct: 64  VTLLGNHDDEFYNIM 78


>gnl|CDD|227343 COG5010, TadD, Flp pilus assembly protein TadD, contains TPR
           repeats [Intracellular trafficking and secretion].
          Length = 257

 Score = 37.4 bits (87), Expect = 0.012
 Identities = 28/118 (23%), Positives = 44/118 (37%), Gaps = 16/118 (13%)

Query: 23  APKEVSVEDKERAEKLKAEANEHFKNQAYNKAIELYSAAIEVNPTAIYYANRSFAYLKTE 82
            P+++S+     A  L+ +A+             L  +AI   P          A  K +
Sbjct: 63  NPEDLSIAKLATALYLRGDADS--------SLAVLQKSAI-AYPKDRELLA---AQGKNQ 110

Query: 83  A----IGYALNDASKAIELDQTYTKAYYRRAAAYMSLGKFKLALRDYEAVHKARPNDK 136
                 G A++   KA  L  T  +A+    AA   LG+F  A R Y    +  PN+ 
Sbjct: 111 IRNGNFGEAVSVLRKAARLAPTDWEAWNLLGAALDQLGRFDEARRAYRQALELAPNEP 168


>gnl|CDD|163666 cd07423, MPP_PrpE, Bacillus subtilis PrpE and related proteins,
           metallophosphatase domain.  PrpE (protein phosphatase E)
           is a bacterial member of the PPP (phosphoprotein
           phosphatase) family of serine/threonine phosphatases and
           a key signal transduction pathway component controlling
           the expression of spore germination receptors GerA and
           GerK in Bacillus subtilis. PrpE is closely related to
           ApaH (also known symmetrical Ap(4)A hydrolase and
           bis(5'nucleosyl)-tetraphosphatase).  PrpE has
           specificity for phosphotyrosine only, unlike the
           serine/threonine phosphatases to which it is related.
           The Bacilli members of this family are single domain
           proteins while the other members have N- and C-terminal
           domains in addition to this phosphatase domain.  The PPP
           (phosphoprotein phosphatase) family, to which PrpE
           belongs, is one of two known protein phosphatase
           families specific for serine and threonine.  The PPP
           family also includes: PP1, PP2A, PP2B (calcineurin),
           PP4, PP5, PP6, PP7, Bsu1, RdgC, PrpA/PrpB, and ApA4
           hydrolase. The PPP catalytic domain is defined by three
           conserved motifs (-GDXHG-, -GDXVDRG- and -GNHE-).  The
           PPP enzyme family is ancient with members found in all
           eukaryotes, and in most bacterial and archeal genomes.
           Dephosphorylation of phosphoserines and
           phosphothreonines on target proteins plays a central
           role in the regulation of many cellular processes.  PPPs
           belong to the metallophosphatase (MPP) superfamily.
           MPPs are functionally diverse, but all share a conserved
           domain with an active site consisting of two metal ions
           (usually manganese, iron, or zinc) coordinated with
           octahedral geometry by a cage of histidine, aspartate,
           and asparagine residues. The MPP superfamily includes:
           Mre11/SbcD-like exonucleases, Dbr1-like RNA lariat
           debranching enzymes, YfcE-like phosphodiesterases,
           purple acid phosphatases (PAPs), YbbF-like
           UDP-2,3-diacylglucosamine hydrolases, and acid
           sphingomyelinases (ASMases).  The conserved domain is a
           double beta-sheet sandwich with a di-metal active site
           made up of residues located at the C-terminal side of
           the sheets.  This domain is thought to allow for
           productive metal coordination.
          Length = 234

 Score = 37.3 bits (87), Expect = 0.012
 Identities = 32/140 (22%), Positives = 58/140 (41%), Gaps = 22/140 (15%)

Query: 190 KFTVCGDIHGQFYDLMNIFELNG-----LPSPDNP----YLFNGDFVDRGSFSVECIFTL 240
            F + GD+HG + +L  + E  G     + +  +P     +F GD VDRG  S E +  +
Sbjct: 2   PFDIIGDVHGCYDELEELLEKLGYRIKRVGTVTHPEGRRAVFVGDLVDRGPDSPEVLRLV 61

Query: 241 FGFKLLYPNHFFMARGNHESAYMNQIYGFEGEVKSKYTDLMAQF----------FTEVYN 290
               ++         GNH++    ++ G   ++     + +AQ             E Y 
Sbjct: 62  MS--MVAAGAALCVPGNHDNKLYRKLQGRNVKITHGLEETVAQLEAESEEFKEEVIEFYE 119

Query: 291 WLPLCHCINN-KVLVMHGGL 309
            LP    ++  K++V H G+
Sbjct: 120 SLPSHLVLDEGKLVVAHAGI 139


>gnl|CDD|222123 pfam13432, TPR_16, Tetratricopeptide repeat. 
          Length = 65

 Score = 33.4 bits (77), Expect = 0.021
 Identities = 16/96 (16%), Positives = 24/96 (25%), Gaps = 33/96 (34%)

Query: 40  AEANEHFKNQAYNKAIELYSAAIEVNPTAIYYANRSFAYLKTEAIGYALNDASKAIELDQ 99
           A A    +   Y++A+    AA+   P                        A        
Sbjct: 2   ALARAALRAGDYDEALAALEAALARYP----------------------LAAE------- 32

Query: 100 TYTKAYYRRAAAYMSLGKFKLALRDYEAVHKARPND 135
               A      A +  G+   A     A   A P+D
Sbjct: 33  ----ALLLLGEALLRQGRLAEAAALLRAALAADPDD 64



 Score = 28.4 bits (64), Expect = 1.7
 Identities = 9/33 (27%), Positives = 12/33 (36%)

Query: 106 YRRAAAYMSLGKFKLALRDYEAVHKARPNDKDA 138
              A A +  G +  AL   EA     P   +A
Sbjct: 1   LALARAALRAGDYDEALAALEAALARYPLAAEA 33


>gnl|CDD|163665 cd07422, MPP_ApaH, Escherichia coli ApaH and related proteins,
           metallophosphatase domain.  ApaH (also known as
           symmetrically cleaving Ap4A hydrolase and
           bis(5'nucleosyl)-tetraphosphatase) is a bacterial member
           of the PPP (phosphoprotein phosphatase) family of
           serine/threonine phosphatases that hydrolyzes the
           nucleotide-signaling molecule diadenosine tetraphosphate
           (Ap(4)A) into two ADP and also hydrolyzes Ap(5)A,
           Gp(4)G, and other extending compounds.  Null mutations
           in apaH result in high intracellular levels of Ap(4)A
           which correlate with multiple phenotypes, including a
           decreased expression of catabolite-repressible genes, a
           reduction in the expression of flagellar operons, and an
           increased sensitivity to UV  and heat.  Ap4A hydrolase
           is important in responding to heat shock and oxidative
           stress via regulating the concentration of Ap4A in
           bacteria.  Ap4A hydrolase is also thought to play a role
           in siderophore production, but the mechanism by which
           ApaH interacts with siderophore pathways in unknown.
           The PPP (phosphoprotein phosphatase) family, to which
           ApaH belongs, is one of two known protein phosphatase
           families specific for serine and threonine.  The PPP
           family also includes: PP1, PP2A, PP2B (calcineurin),
           PP4, PP5, PP6, PP7, Bsu1, RdgC, PrpE, and PrpA/PrpB. The
           PPP catalytic domain is defined by three conserved
           motifs (-GDXHG-, -GDXVDRG- and -GNHE-).  The PPP enzyme
           family is ancient with members found in all eukaryotes,
           and in most bacterial and archeal genomes.
           Dephosphorylation of phosphoserines and
           phosphothreonines on target proteins plays a central
           role in the regulation of many cellular processes.  PPPs
           belong to the metallophosphatase (MPP) superfamily.
           MPPs are functionally diverse, but all share a conserved
           domain with an active site consisting of two metal ions
           (usually manganese, iron, or zinc) coordinated with
           octahedral geometry by a cage of histidine, aspartate,
           and asparagine residues. The MPP superfamily includes:
           Mre11/SbcD-like exonucleases, Dbr1-like RNA lariat
           debranching enzymes, YfcE-like phosphodiesterases,
           purple acid phosphatases (PAPs), YbbF-like
           UDP-2,3-diacylglucosamine hydrolases, and acid
           sphingomyelinases (ASMases).  The conserved domain is a
           double beta-sheet sandwich with a di-metal active site
           made up of residues located at the C-terminal side of
           the sheets.  This domain is thought to allow for
           productive metal coordination.
          Length = 257

 Score = 36.3 bits (85), Expect = 0.023
 Identities = 31/129 (24%), Positives = 47/129 (36%), Gaps = 26/129 (20%)

Query: 195 GDIHGQFYDLMNIFELNGLPSPDNPYL-FNGDFVDRGSFSVECIFTLFGFKLLYPNHFFM 253
           GDI G + +L  + E      P    L   GD V+RG  S+E +  +        +    
Sbjct: 5   GDIQGCYDELQRLLEKIN-FDPAKDRLWLVGDLVNRGPDSLETLRFVKSLG----DSAKT 59

Query: 254 ARGNHE----SAYMNQIYGFEGEVKSKYTDLMAQFFT-----EVYNWL---PLCHCINNK 301
             GNH+    +          G  K K  D +          E+ +WL   PL H     
Sbjct: 60  VLGNHDLHLLAVA-------AGIKKPKKKDTLDDILNAPDRDELLDWLRHQPLLHRDPEL 112

Query: 302 VLVM-HGGL 309
            ++M H G+
Sbjct: 113 GILMVHAGI 121


>gnl|CDD|205609 pfam13431, TPR_17, Tetratricopeptide repeat. 
          Length = 34

 Score = 32.6 bits (75), Expect = 0.027
 Identities = 13/31 (41%), Positives = 20/31 (64%)

Query: 93  KAIELDQTYTKAYYRRAAAYMSLGKFKLALR 123
           KA+ELD    +AYY  A   ++LG++  AL+
Sbjct: 4   KALELDPNNAEAYYNLALLLLNLGQYDEALQ 34


>gnl|CDD|227124 COG4785, NlpI, Lipoprotein NlpI, contains TPR repeats [General
           function prediction only].
          Length = 297

 Score = 35.9 bits (83), Expect = 0.037
 Identities = 17/41 (41%), Positives = 23/41 (56%)

Query: 95  IELDQTYTKAYYRRAAAYMSLGKFKLALRDYEAVHKARPND 135
           +ELD TY  A+  R  A    G++KLA  D  A ++  PND
Sbjct: 126 LELDPTYNYAHLNRGIALYYGGRYKLAQDDLLAFYQDDPND 166


>gnl|CDD|163664 cd07421, MPP_Rhilphs, Rhilph phosphatases, metallophosphatase
           domain.  Rhilphs (Rhizobiales/ Rhodobacterales/
           Rhodospirillaceae-like phosphatases) are a
           phylogenetically distinct group of PPP (phosphoprotein
           phosphatases), found only in land plants. They are named
           for their close relationship to to PPP phosphatases from
           alpha-Proteobacteria, including Rhizobiales,
           Rhodobacterales and Rhodospirillaceae.  The PPP
           (phosphoprotein phosphatase) family, to which the
           Rhilphs belong, is one of two known protein phosphatase
           families specific for serine and threonine.  The PPP
           family also includes: PP1, PP2A, PP2B (calcineurin),
           PP4, PP5, PP6, PP7, Bsu1, RdgC, PrpE, PrpA/PrpB, and
           ApA4 hydrolase. The PPP catalytic domain is defined by
           three conserved motifs (-GDXHG-, -GDXVDRG- and -GNHE-). 
           The PPP enzyme family is ancient with members found in
           all eukaryotes, and in most bacterial and archeal
           genomes.  Dephosphorylation of phosphoserines and
           phosphothreonines on target proteins plays a central
           role in the regulation of many cellular processes.  PPPs
           belong to the metallophosphatase (MPP) superfamily.
           MPPs are functionally diverse, but all share a conserved
           domain with an active site consisting of two metal ions
           (usually manganese, iron, or zinc) coordinated with
           octahedral geometry by a cage of histidine, aspartate,
           and asparagine residues. The MPP superfamily includes:
           Mre11/SbcD-like exonucleases, Dbr1-like RNA lariat
           debranching enzymes, YfcE-like phosphodiesterases,
           purple acid phosphatases (PAPs), YbbF-like
           UDP-2,3-diacylglucosamine hydrolases, and acid
           sphingomyelinases (ASMases).  The conserved domain is a
           double beta-sheet sandwich with a di-metal active site
           made up of residues located at the C-terminal side of
           the sheets.  This domain is thought to allow for
           productive metal coordination.
          Length = 304

 Score = 35.9 bits (83), Expect = 0.046
 Identities = 28/83 (33%), Positives = 40/83 (48%), Gaps = 13/83 (15%)

Query: 190 KFTVC-GDIHGQFYDLMNIFELNGLPSPDNP-------YLFNGDFVDRGSFSVECIFTLF 241
           +  +C GDIHG    L N++ LN L S   P        +F GD+ DRG  + + I  L 
Sbjct: 2   RVVICVGDIHGYISKLNNLW-LN-LQSALGPSDFASALVIFLGDYCDRGPETRKVIDFLI 59

Query: 242 GFKLLYPN--HFFMARGNHESAY 262
                +P   H F+  GNH+ A+
Sbjct: 60  SLPEKHPKQRHVFLC-GNHDFAF 81


>gnl|CDD|222121 pfam13428, TPR_14, Tetratricopeptide repeat. 
          Length = 44

 Score = 32.1 bits (73), Expect = 0.051
 Identities = 10/36 (27%), Positives = 14/36 (38%)

Query: 103 KAYYRRAAAYMSLGKFKLALRDYEAVHKARPNDKDA 138
            A    A A ++LG    AL          P+D +A
Sbjct: 2   AALLALARALLALGDLDEALALLRRALALDPDDPEA 37


>gnl|CDD|115680 pfam07042, TrfA, TrfA protein.  This family consists of several
           bacterial TrfA proteins. The trfA operon of
           broad-host-range IncP plasmids is essential to activate
           the origin of vegetative replication in diverse species.
           The trfA operon encodes two ORFs. The first ORF is
           highly conserved and encodes a putative single-stranded
           DNA binding protein (Ssb). The second, trfA, contains
           two translational starts as in the IncP alpha plasmids,
           generating related polypeptides of 406 (TrfA1) and 282
           (TrfA2) amino acids. TrfA2 is very similar to the IncP
           alpha product, whereas the N-terminal region of TrfA1
           shows very little similarity to the equivalent region of
           IncP alpha TrfA1. This region has been implicated in the
           ability of IncP alpha plasmids to replicate efficiently
           in Pseudomonas aeruginosa.
          Length = 282

 Score = 35.0 bits (80), Expect = 0.071
 Identities = 34/121 (28%), Positives = 50/121 (41%), Gaps = 13/121 (10%)

Query: 91  ASKAIELDQTYTKAYYRRAAAYMSLGKFKLALRDYEAVHKARPNDKDAKAKFTECNKIVK 150
           A K  E  Q     + ++    M L  F +A   +   H A PND    A FT  NK + 
Sbjct: 10  AEKVSEAKQAALLKHTKQQIKDMQLSLFDIA--PWPDSHAALPNDFARSAIFTTRNKKIP 67

Query: 151 RMQIKGKLHRKYAYRILMDIKALFM------TQDSLVHITVEDEAKFTVCGD-IHGQFYD 203
           R  ++GK      + +  D++  +         D LV   V + AK T  G+ I   FY+
Sbjct: 68  REALQGKS----IFHVNKDVEITYTGIELRADDDELVFAQVLEYAKGTAIGEPISFTFYE 123

Query: 204 L 204
           L
Sbjct: 124 L 124


>gnl|CDD|188247 TIGR02795, tol_pal_ybgF, tol-pal system protein YbgF.  Members of
           this protein family are the product of one of seven
           genes regularly clustered in operons to encode the
           proteins of the tol-pal system, which is critical for
           maintaining the integrity of the bacterial outer
           membrane. The gene for this periplasmic protein has been
           designated orf2 and ybgF. All members of the seed
           alignment were from unique tol-pal gene regions from
           completed bacterial genomes. The architecture of this
           protein is a signal sequence, a low-complexity region
           usually rich in Asn and Gln, a well-conserved region
           with tandem repeats that resemble the tetratricopeptide
           (TPR) repeat, involved in protein-protein interaction.
          Length = 117

 Score = 33.4 bits (77), Expect = 0.083
 Identities = 22/99 (22%), Positives = 32/99 (32%), Gaps = 32/99 (32%)

Query: 42  ANEHFKNQAYNKAIELYSAAIEVNPTAIYYANRSFAYLKTEAIGYALNDASKAIELDQTY 101
           A    K   Y  AI+ + A ++  P + Y  N                            
Sbjct: 7   ALLVLKAGDYADAIQAFQAFLKKYPKSTYAPN---------------------------- 38

Query: 102 TKAYYRRAAAYMSLGKFKLALRDYEAVHKARPNDKDAKA 140
             A+Y    AY + G +  A + + AV K  P  K  KA
Sbjct: 39  --AHYWLGEAYYAQGDYADAAKAFLAVVKKYP--KSPKA 73


>gnl|CDD|205550 pfam13371, TPR_9, Tetratricopeptide repeat. 
          Length = 73

 Score = 31.8 bits (73), Expect = 0.12
 Identities = 18/62 (29%), Positives = 24/62 (38%)

Query: 77  AYLKTEAIGYALNDASKAIELDQTYTKAYYRRAAAYMSLGKFKLALRDYEAVHKARPNDK 136
            YL+ E    AL    + + L          R   Y  LG F+ AL D E   +  P+  
Sbjct: 4   IYLREEDFERALAVVERLLLLAPDDPYERRDRGLLYAQLGCFQAALADLEYFLELCPDAP 63

Query: 137 DA 138
           DA
Sbjct: 64  DA 65


>gnl|CDD|147786 pfam05823, Gp-FAR-1, Nematode fatty acid retinoid binding protein
           (Gp-FAR-1).  Parasitic nematodes produce at least two
           structurally novel classes of small helix-rich retinol-
           and fatty-acid-binding proteins that have no
           counterparts in their plant or animal hosts and thus
           represent potential targets for new nematicides.
           Gp-FAR-1 is a member of the nematode-specific
           fatty-acid- and retinol-binding (FAR) family of proteins
           but localises to the surface of the organism, placing it
           in a strategic position for interaction with the host.
           Gp-FAR-1 functions as a broad-spectrum retinol- and
           fatty-acid-binding protein, and it is thought that it is
           involved in the evasion of primary host plant defence
           systems.
          Length = 154

 Score = 32.7 bits (75), Expect = 0.19
 Identities = 34/138 (24%), Positives = 56/138 (40%), Gaps = 8/138 (5%)

Query: 24  PKEVSVEDKERAEKLKAEANEHFKNQAYNKAIELYSAAI-EVNPTAIYYANRSFAYLKTE 82
           P EV    K    + KA   E  KN    K  +   AA+ E +P+    A +  A LK +
Sbjct: 11  PAEVVEAYKSLTPEEKAALKEVAKNHKKFKNEDEALAALKEKSPSLYAKAEKLRALLKKK 70

Query: 83  AIGYALNDASKAI--ELDQTYTKAYYRRAAAY-MSLGKFKLALRDYEAVHKARPND--KD 137
               AL+  +KA   ++     K + +  A    SL + K   + Y A +KA   +  ++
Sbjct: 71  I--DALSPEAKAFVKKVIAKGRKLHAQSLAGEKPSLEELKELAKGYIAKYKALSEEAKEE 128

Query: 138 AKAKFTECNKIVKRMQIK 155
            K  F    K  +  + +
Sbjct: 129 LKKTFPILTKFFQNEKFQ 146


>gnl|CDD|234673 PRK00166, apaH, diadenosine tetraphosphatase; Reviewed.
          Length = 275

 Score = 33.2 bits (77), Expect = 0.24
 Identities = 32/125 (25%), Positives = 49/125 (39%), Gaps = 18/125 (14%)

Query: 195 GDIHGQFYDLMNIFELNGLPSPDNPYL-FNGDFVDRGSFSVECIFTLFGFKLLYPNHFFM 253
           GDI G + +L  + E      P    L   GD V+RG  S+E +  +        +    
Sbjct: 7   GDIQGCYDELQRLLEKIDF-DPAKDTLWLVGDLVNRGPDSLEVLRFVKSLG----DSAVT 61

Query: 254 ARGNHESAYMNQIYGFEGEVKSKYTDLMAQFFT-----EVYNWL---PLCHCINNKVLVM 305
             GNH+   +    G +   ++K  D +          E+ +WL   PL H      LVM
Sbjct: 62  VLGNHDLHLLAVAAGIK---RNKKKDTLDPILEAPDRDELLDWLRHQPLLHVDEELGLVM 118

Query: 306 -HGGL 309
            H G+
Sbjct: 119 VHAGI 123


>gnl|CDD|233082 TIGR00668, apaH, bis(5'-nucleosyl)-tetraphosphatase (symmetrical). 
           Diadenosine 5',5"'-P1,P4-tetraphosphate (Ap4A) is a
           regulatory metabolite of stress conditions. It is
           hydrolyzed to two ADP by this enzyme. Alternate names
           include diadenosine-tetraphosphatase and Ap4A hydrolase
           [Cellular processes, Adaptations to atypical
           conditions].
          Length = 279

 Score = 33.3 bits (76), Expect = 0.24
 Identities = 40/125 (32%), Positives = 57/125 (45%), Gaps = 18/125 (14%)

Query: 195 GDIHGQFYDLMNIFE-LNGLPSPDNPYLFNGDFVDRGSFSVECIFTLFGFKLLYPNHFFM 253
           GD+HG + +L  + E +   P  D  +L  GD V RG  S+E    L   K L  +   +
Sbjct: 7   GDLHGCYDELQALLERVEFDPGQDTLWL-TGDLVARGPGSLEV---LRYVKSL-GDAVRL 61

Query: 254 ARGNHESAYMNQIYGFEGEVKSKYTD-----LMAQFFTEVYNWL---PLCHCINNKVLVM 305
             GNH+   ++ +  F G  ++K  D     L A    E+ NWL   PL      K LVM
Sbjct: 62  VLGNHD---LHLLAVFAGISRNKPKDRLDPLLEAPDADELLNWLRRQPLLQHDEEKKLVM 118

Query: 306 -HGGL 309
            H G+
Sbjct: 119 AHAGI 123


>gnl|CDD|131573 TIGR02521, type_IV_pilW, type IV pilus biogenesis/stability protein
           PilW.  Members of this family are designated PilF in ref
           (PMID:8973346) and PilW in ref (PMID:15612916). This
           outer membrane protein is required both for pilus
           stability and for pilus function such as adherence to
           human cells. Members of this family contain copies of
           the TPR (tetratricopeptide repeat) domain.
          Length = 234

 Score = 33.1 bits (76), Expect = 0.26
 Identities = 17/62 (27%), Positives = 25/62 (40%)

Query: 77  AYLKTEAIGYALNDASKAIELDQTYTKAYYRRAAAYMSLGKFKLALRDYEAVHKARPNDK 136
            YL+   +  A  +  KA+E D     AY   A  Y  LG+ + A   +       PN+ 
Sbjct: 40  GYLEQGDLEVAKENLDKALEHDPDDYLAYLALALYYQQLGELEKAEDSFRRALTLNPNNG 99

Query: 137 DA 138
           D 
Sbjct: 100 DV 101


>gnl|CDD|225605 COG3063, PilF, Tfp pilus assembly protein PilF [Cell motility and
           secretion / Intracellular trafficking and secretion].
          Length = 250

 Score = 32.4 bits (74), Expect = 0.52
 Identities = 17/45 (37%), Positives = 22/45 (48%)

Query: 93  KAIELDQTYTKAYYRRAAAYMSLGKFKLALRDYEAVHKARPNDKD 137
           KA+E D +Y  A+  RA  Y  LG+  LA   Y       PN+ D
Sbjct: 60  KALEHDPSYYLAHLVRAHYYQKLGENDLADESYRKALSLAPNNGD 104


>gnl|CDD|236875 PRK11189, PRK11189, lipoprotein NlpI; Provisional.
          Length = 296

 Score = 32.2 bits (74), Expect = 0.58
 Identities = 17/42 (40%), Positives = 23/42 (54%)

Query: 94  AIELDQTYTKAYYRRAAAYMSLGKFKLALRDYEAVHKARPND 135
            +ELD TY  AY  R  A    G+++LA  D  A ++  PND
Sbjct: 124 VLELDPTYNYAYLNRGIALYYGGRYELAQDDLLAFYQDDPND 165


>gnl|CDD|205602 pfam13424, TPR_12, Tetratricopeptide repeat. 
          Length = 78

 Score = 30.0 bits (68), Expect = 0.60
 Identities = 17/75 (22%), Positives = 25/75 (33%), Gaps = 7/75 (9%)

Query: 67  TAIYYANRSFAYLK----TEAIGY---ALNDASKAIELDQTYTKAYYRRAAAYMSLGKFK 119
            A    N +    +     EA+     AL  A +  E      +A    A  Y++LG + 
Sbjct: 4   LAAALNNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALGDYD 63

Query: 120 LALRDYEAVHKARPN 134
            AL   E     R  
Sbjct: 64  EALEYLEKALALREA 78


>gnl|CDD|205362 pfam13181, TPR_8, Tetratricopeptide repeat. 
          Length = 34

 Score = 27.8 bits (63), Expect = 1.2
 Identities = 9/26 (34%), Positives = 13/26 (50%)

Query: 103 KAYYRRAAAYMSLGKFKLALRDYEAV 128
           +AYY     Y+ LG ++ A   YE  
Sbjct: 2   RAYYLLGQIYLQLGDYEEAKEYYEKA 27


>gnl|CDD|205403 pfam13222, DUF4030, Protein of unknown function (DUF4030).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 164 and 197 amino
           acids in length.
          Length = 142

 Score = 30.1 bits (68), Expect = 1.5
 Identities = 16/54 (29%), Positives = 26/54 (48%), Gaps = 7/54 (12%)

Query: 11  LSMSSTTNASNPAPKEVSVEDKERAEKLKAEANEHFKNQAYNKAIELYSAAIEV 64
           L++ +  N+S+P  KE         +K++ E N   K +   K IE  S  IE+
Sbjct: 89  LTIDTKINSSDPGAKEF-------GKKIEKEVNNLLKTEEVKKWIENDSYTIEI 135


>gnl|CDD|189027 cd09857, PIN_EXO1, PIN domain of Exonuclease-1, a
           structure-specific, divalent-metal-ion dependent, 5'
           nuclease and homologs.  Exonuclease-1 (EXO1) is involved
           in multiple, eukaryotic DNA metabolic pathways,
           including DNA replication processes (5' flap DNA
           endonuclease activity and double stranded DNA
           5'-exonuclease activity), DNA repair processes (DNA
           mismatch repair (MMR) and post-replication repair
           (PRR)), recombination, and telomere integrity. EXO1
           functions in the MMS2 error-free branch of the PRR
           pathway in the maintenance and repair of stalled
           replication forks. Studies also suggest that EXO1 plays
           both structural and catalytic roles during MMR-mediated
           mutation avoidance. EXO1 belongs to the FEN1-EXO1-like
           family of structure-specific, 5' nucleases. These
           nucleases contain a PIN (PilT N terminus) domain with a
           helical arch/clamp region (I domain) of variable length
           (approximately 43 residues in EXO1 PIN domains) and a
           H3TH (helix-3-turn-helix) domain, an atypical
           helix-hairpin-helix-2-like region. Both the H3TH domain
           (not included here) and the helical arch/clamp region
           are involved in DNA binding. Nucleases within this group
           also have a carboxylate-rich active site that is
           involved in binding essential divalent metal ion
           cofactors (Mg2+/Mn2+). EXO1 nucleases also have
           C-terminal Mlh1- and Msh2-binding domains which allow
           interaction with MMR and PRR proteins, respectively.
          Length = 210

 Score = 30.5 bits (70), Expect = 1.5
 Identities = 12/45 (26%), Positives = 23/45 (51%)

Query: 22  PAPKEVSVEDKERAEKLKAEANEHFKNQAYNKAIELYSAAIEVNP 66
           P+ K    E +ER E+   +A E  +    ++A E +  A+++ P
Sbjct: 83  PSKKGTEEERRERREENLEKALELLREGNRSEARECFQRAVDITP 127


>gnl|CDD|222122 pfam13429, TPR_15, Tetratricopeptide repeat. 
          Length = 279

 Score = 30.7 bits (70), Expect = 2.0
 Identities = 31/125 (24%), Positives = 48/125 (38%), Gaps = 29/125 (23%)

Query: 28  SVEDKERAEK----LKAEANEHFKNQAYNKAIELYSAAIEVNPTAIYYANRSFAYLKTEA 83
           S+  +E AE+    L A A  + +    + A+  Y  A+ + P     A+         A
Sbjct: 134 SLTPEEAAEQSAGFLSARAEYYRQTGQPDAALRDYRRALALPP-----ASSDLR----AA 184

Query: 84  IGYALNDASKAIEL-------------DQTYTKAYYRRAAAYMSLGKFKLALRDYEAVHK 130
           + + L D  +  EL             D      +   AAAY+ LG+ K AL       K
Sbjct: 185 LLWLLIDLGRRAELKRLLAAWRGRAESDSALWGPF---AAAYLLLGRPKKALPYLRRQAK 241

Query: 131 ARPND 135
           + PND
Sbjct: 242 SNPND 246



 Score = 30.4 bits (69), Expect = 2.4
 Identities = 19/75 (25%), Positives = 29/75 (38%), Gaps = 8/75 (10%)

Query: 68  AIYYANRSFAYLKTEAIGYALNDASKAIELDQTYTKAYY--RRAAAYMSLGKFKLALRDY 125
           A+Y   R+  Y +   +   L   +     +Q+   A +   RA  Y   G+   ALRDY
Sbjct: 115 ALYLYTRAGDYKRIVDL---LASLTPEEAAEQS---AGFLSARAEYYRQTGQPDAALRDY 168

Query: 126 EAVHKARPNDKDAKA 140
                  P   D +A
Sbjct: 169 RRALALPPASSDLRA 183


>gnl|CDD|221956 pfam13174, TPR_6, Tetratricopeptide repeat. 
          Length = 33

 Score = 27.1 bits (61), Expect = 2.3
 Identities = 11/33 (33%), Positives = 15/33 (45%)

Query: 103 KAYYRRAAAYMSLGKFKLALRDYEAVHKARPND 135
            A Y+ A AY+ LG    A    E + K  P+ 
Sbjct: 1   DALYKLALAYLKLGDTDEAKEALERLLKRYPDS 33


>gnl|CDD|163656 cd07413, MPP_PA3087, Pseudomonas aeruginosa PA3087 and related
           proteins, metallophosphatase domain.  PA3087 is an
           uncharacterized protein from Pseudomonas aeruginosa with
           a metallophosphatase domain that belongs to the
           phosphoprotein phosphatase (PPP) family.  The PPP family
           also includes: PP1, PP2A, PP2B (calcineurin), PP4, PP5,
           PP6, PP7, Bsu1, RdgC, PrpE, PrpA/PrpB, and ApA4
           hydrolase. The PPP catalytic domain is defined by three
           conserved motifs (-GDXHG-, -GDXVDRG- and -GNHE-).  The
           PPP enzyme family is ancient with members found in all
           eukaryotes, and in most bacterial and archeal genomes.
           Dephosphorylation of phosphoserines and
           phosphothreonines on target proteins plays a central
           role in the regulation of many cellular processes.  PPPs
           belong to the metallophosphatase (MPP) superfamily.
           MPPs are functionally diverse, but all share a conserved
           domain with an active site consisting of two metal ions
           (usually manganese, iron, or zinc) coordinated with
           octahedral geometry by a cage of histidine, aspartate,
           and asparagine residues. The MPP superfamily includes:
           Mre11/SbcD-like exonucleases, Dbr1-like RNA lariat
           debranching enzymes, YfcE-like phosphodiesterases,
           purple acid phosphatases (PAPs), YbbF-like
           UDP-2,3-diacylglucosamine hydrolases, and acid
           sphingomyelinases (ASMases).  The conserved domain is a
           double beta-sheet sandwich with a di-metal active site
           made up of residues located at the C-terminal side of
           the sheets.  This domain is thought to allow for
           productive metal coordination.
          Length = 222

 Score = 29.9 bits (68), Expect = 2.8
 Identities = 14/43 (32%), Positives = 19/43 (44%), Gaps = 7/43 (16%)

Query: 195 GDIHGQFYDLMNIFELNGL-------PSPDNPYLFNGDFVDRG 230
           GDIHG    L+ +    G          P+   +F GD +DRG
Sbjct: 5   GDIHGHAEKLVVLLHKLGYQELSGVYRHPERQVVFLGDLIDRG 47


>gnl|CDD|234973 PRK01712, PRK01712, carbon storage regulator; Provisional.
          Length = 64

 Score = 27.6 bits (62), Expect = 2.9
 Identities = 11/27 (40%), Positives = 17/27 (62%)

Query: 23 APKEVSVEDKERAEKLKAEANEHFKNQ 49
          APKEVSV  +E  ++++AE +   K  
Sbjct: 36 APKEVSVHREEIYQRIQAENSTALKQS 62


>gnl|CDD|237480 PRK13721, PRK13721, conjugal transfer ATP-binding protein TraC;
           Provisional.
          Length = 844

 Score = 30.1 bits (68), Expect = 3.5
 Identities = 24/83 (28%), Positives = 37/83 (44%), Gaps = 16/83 (19%)

Query: 82  EAIGYALNDASKAIELDQTY---TKAYYRRAAAYMSLGKFKLA--------LRDYEA-VH 129
           + I Y L + S + E  + +   T+AYY RAA    L +F L         LR Y   + 
Sbjct: 78  DRIEYGLREFSWSGEQAERFNAITRAYYMRAA----LTQFPLPEGMNLPLTLRHYRVFIS 133

Query: 130 KARPNDKDAKAKFTECNKIVKRM 152
              P+ K ++A   E   +VK +
Sbjct: 134 YCSPSKKKSRADILEMENLVKII 156


>gnl|CDD|173250 PRK14789, PRK14789, lipoprotein signal peptidase; Provisional.
          Length = 191

 Score = 29.2 bits (65), Expect = 3.8
 Identities = 9/27 (33%), Positives = 12/27 (44%)

Query: 328 QPPDEGLMCELLWSDPQIPNGRAPSKR 354
           +P D+ L+ ELL  DP  P        
Sbjct: 159 EPIDDELLSELLSRDPSPPAATDARTS 185


>gnl|CDD|198111 smart01043, BTAD, Bacterial transcriptional activator domain.
           Found in the DNRI/REDD/AFSR family of regulators. This
           region of AFSR along with the C terminal region is
           capable of independently directing actinorhodin
           production. This family contains TPR repeats.
          Length = 145

 Score = 28.8 bits (65), Expect = 4.5
 Identities = 10/38 (26%), Positives = 17/38 (44%)

Query: 96  ELDQTYTKAYYRRAAAYMSLGKFKLALRDYEAVHKARP 133
            L +   +A    A A ++LG+ + AL   E +    P
Sbjct: 55  RLRELRLEALEALAEALLALGRHEEALALLERLLALDP 92


>gnl|CDD|153354 cd07670, BAR_SNX18, The Bin/Amphiphysin/Rvs (BAR) domain of Sorting
           Nexin 18.  BAR domains are dimerization, lipid binding
           and curvature sensing modules found in many different
           proteins with diverse functions. Sorting nexins (SNXs)
           are Phox homology (PX) domain containing proteins that
           are involved in regulating membrane traffic and protein
           sorting in the endosomal system. SNXs differ from each
           other in their lipid-binding specificity, subcellular
           localization and specific function in the endocytic
           pathway. A subset of SNXs also contain BAR domains. The
           PX-BAR structural unit determines the specific membrane
           targeting of SNXs. SNX18 is localized to peripheral
           endosomal structures, and acts in a trafficking pathway
           that is clathrin-independent but relies on AP-1 and
           PACS1. BAR domains form dimers that bind to membranes,
           induce membrane bending and curvature, and may also be
           involved in protein-protein interactions.
          Length = 207

 Score = 29.1 bits (65), Expect = 5.3
 Identities = 16/55 (29%), Positives = 27/55 (49%), Gaps = 2/55 (3%)

Query: 60  AAIEVNPTAIYYANRSFAYLKTE--AIGYALNDASKAIELDQTYTKAYYRRAAAY 112
           + +++N TA  +A +     K E   +G +    S+A ELDQ    A   +A A+
Sbjct: 27  SVLQLNHTANEFARKQVTGFKKEYQKVGQSFKGLSQAFELDQQAFSAGLNQAIAF 81


>gnl|CDD|237917 PRK15134, PRK15134, microcin C ABC transporter ATP-binding protein
           YejF; Provisional.
          Length = 529

 Score = 29.3 bits (66), Expect = 6.1
 Identities = 15/50 (30%), Positives = 24/50 (48%), Gaps = 13/50 (26%)

Query: 366 KAFLERNKLEYIIRSHE------------VKQDGYEVAHDGRCITVFSAP 403
           K+  ++++L Y+  SH+            V + G EV   G C  VF+AP
Sbjct: 469 KSLQQKHQLAYLFISHDLHVVRALCHQVIVLRQG-EVVEQGDCERVFAAP 517


>gnl|CDD|225464 COG2912, COG2912, Uncharacterized conserved protein [Function
           unknown].
          Length = 269

 Score = 28.8 bits (65), Expect = 6.3
 Identities = 20/109 (18%), Positives = 35/109 (32%), Gaps = 7/109 (6%)

Query: 33  ERAEKLKAEANEHFKNQAYNKAIELYSAAIEVNPTAIY---YANRSFAYLKTEAIGYALN 89
               +L+          A  + +       + +   I      N   A L+      AL 
Sbjct: 147 LSQHELQEWLKGTIGPSA--ELLPEDLK--QASNREILSRLLRNLKAALLRELQWELALR 202

Query: 90  DASKAIELDQTYTKAYYRRAAAYMSLGKFKLALRDYEAVHKARPNDKDA 138
            A + ++L+         R   Y  LG + +AL D     +  P+D  A
Sbjct: 203 VAERLLDLNPEDPYEIRDRGLIYAQLGCYHVALEDLSYFVEHCPDDPIA 251


>gnl|CDD|225318 COG2520, COG2520, Predicted methyltransferase [General function
           prediction only].
          Length = 341

 Score = 28.8 bits (65), Expect = 6.7
 Identities = 26/137 (18%), Positives = 47/137 (34%), Gaps = 34/137 (24%)

Query: 61  AIEVNPTAIYYANRSFAYLKTEAIGYALNDAS---KAIELDQTYTKAYYRRA-------- 109
           AI++NP A+ Y   +            LN      + I  D          A        
Sbjct: 216 AIDINPDAVEYLKENI----------RLNKVEGRVEPILGDAREVAPELGVADRIIMGLP 265

Query: 110 -AAYMSLGKFKLALRD------YEAVHKARPNDKDAKAKFTECNKIVKRMQIKGKLHR-- 160
            +A+  L      L+D      YE V +    ++  K   +   K   ++    K+ R  
Sbjct: 266 KSAHEFLPLALELLKDGGIIHYYEFVPEDDIEERPEKRIKSAARKGGYKV-EVLKVRRVK 324

Query: 161 KYA---YRILMDIKALF 174
            Y+   Y +++D++ L 
Sbjct: 325 SYSPGVYHVVVDLRVLK 341


>gnl|CDD|129630 TIGR00539, hemN_rel, putative oxygen-independent coproporphyrinogen
           III oxidase.  Experimentally determined examples of
           oxygen-independent coproporphyrinogen III oxidase, an
           enzyme that replaces HemF function under anaerobic
           conditions, belong to a family of proteins described by
           the model hemN. This model, hemN_rel, models a closely
           related protein, shorter at the amino end and lacking
           the region containing the motif PYRT[SC]YP found in
           members of the hemN family. Several species, including
           E. coli, Helicobacter pylori, Aquifex aeolicus, and
           Chlamydia trachomatis, have members of both this family
           and the E. coli hemN family. The member of this family
           from Bacillus subtilis was shown to complement an
           hemF/hemN double mutant of Salmonella typimurium and to
           prevent accumulation of coproporphyrinogen III under
           anaerobic conditions, but the exact role of this protein
           is still uncertain. It is found in a number of species
           that do not synthesize heme de novo [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Heme,
           porphyrin, and cobalamin].
          Length = 360

 Score = 29.1 bits (65), Expect = 7.0
 Identities = 33/170 (19%), Positives = 58/170 (34%), Gaps = 21/170 (12%)

Query: 5   YLSLEYLSMSSTTNASNPAPKEVSVE-----DKERAEKLKAEANEHFKNQAYNKAIELYS 59
           +LS   LS+   TN    A K    +     D+   E L+    + ++   Y KA     
Sbjct: 183 HLSAYALSVEPNTNFEKNAKKLPDDDSCAHFDEVVREILEGFGFKQYEVSNYAKA----- 237

Query: 60  AAIEVNPTAIYYANRSFAYLKTEAIGYALNDASKAIELDQTYTKAYYRRAAAYMS----- 114
              +V     Y+  + +      A G   N+   A +L + Y K   +R    ++     
Sbjct: 238 -GYQVKHNLAYWGAKDYLGCGAGAHGCVANERFFAKKLIKNYIKDPLQRGVETLNEKNVP 296

Query: 115 -----LGKFKLALRDYEAVHKARPNDKDAKAKFTECNKIVKRMQIKGKLH 159
                L K  L LR    V K+  ++    ++     +  K      +L 
Sbjct: 297 KQDKRLEKLFLGLRCVLGVEKSFFDENKGLSQVKFLIEENKAFIKNNRLI 346


>gnl|CDD|179448 PRK02603, PRK02603, photosystem I assembly protein Ycf3;
           Provisional.
          Length = 172

 Score = 28.1 bits (63), Expect = 7.5
 Identities = 13/38 (34%), Positives = 21/38 (55%), Gaps = 1/38 (2%)

Query: 30  EDK-ERAEKLKAEANEHFKNQAYNKAIELYSAAIEVNP 66
           ED  +R+  L      +  N  ++KA+E Y  A+E+NP
Sbjct: 66  EDPNDRSYILYNMGIIYASNGEHDKALEYYHQALELNP 103


>gnl|CDD|219523 pfam07697, 7TMR-HDED, 7TM-HD extracellular.  This entry
          represents the extracellular domain of the 7TM-HD (7TM
          Receptors with HD hydrolase).
          Length = 218

 Score = 28.4 bits (64), Expect = 7.7
 Identities = 8/20 (40%), Positives = 14/20 (70%)

Query: 23 APKEVSVEDKERAEKLKAEA 42
          AP+++ V D+E  E+ + EA
Sbjct: 13 APRDIEVVDEEATEEKRKEA 32


>gnl|CDD|234457 TIGR04075, bacter_Pnkp, polynucleotide kinase-phosphatase.  Members
           of this protein family are the bacterial polynucleotide
           kinase-phosphatase (Pnkp) whose genes occur paired with
           genes for the 3' terminal RNA ribose
           2'-O-methyltransferase Hen1. All members of the seed
           alignment belong to a cassette with the Hen1. The pair
           acts in bacterial RNA repair. This enzyme performs
           end-healing reactions on broken RNA, preparing from the
           RNA ligase to close the break. The working hypothesis is
           that the combination of Pnkp (RNA repair) and Hen1 (RNA
           modification) serves to first repair RNA damage from
           ribotoxins and then perform a modification that prevents
           the damage from recurring [Transcription, RNA
           processing].
          Length = 851

 Score = 28.8 bits (65), Expect = 8.4
 Identities = 18/83 (21%), Positives = 31/83 (37%), Gaps = 16/83 (19%)

Query: 191 FTVCGDIHGQFYDLMNI-----FELNGLPSPDNPY---------LFNGDFVDRGSFSVEC 236
           F + GD+HG   +L  +     +++                   +F GD VDRG  S   
Sbjct: 182 FDIIGDVHGCRDELETLLEELGYQIERDEGGRPVDVTHPEGRKAVFVGDLVDRGPDSPGV 241

Query: 237 IFTLFGFKLLYPNHFFMARGNHE 259
           +  + G  ++         GNH+
Sbjct: 242 LRLVMG--MVAAGTALCVPGNHD 262


>gnl|CDD|239141 cd02678, MIT_VPS4, MIT: domain contained within Microtubule
          Interacting and Trafficking molecules. This sub-family
          of MIT domains is found in intracellular protein
          transport proteins of the AAA-ATPase family. The
          molecular function of the MIT domain is unclear.
          Length = 75

 Score = 26.5 bits (59), Expect = 10.0
 Identities = 12/40 (30%), Positives = 20/40 (50%)

Query: 32 KERAEKLKAEANEHFKNQAYNKAIELYSAAIEVNPTAIYY 71
           ++A +L  +A E      Y +A+ LY  A+E    A+ Y
Sbjct: 3  LQKAIELVKKAIEEDNAGNYEEALRLYQHALEYFMHALKY 42


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.323    0.138    0.416 

Gapped
Lambda     K      H
   0.267   0.0733    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 28,102,519
Number of extensions: 2775596
Number of successful extensions: 2466
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2385
Number of HSP's successfully gapped: 118
Length of query: 546
Length of database: 10,937,602
Length adjustment: 102
Effective length of query: 444
Effective length of database: 6,413,494
Effective search space: 2847591336
Effective search space used: 2847591336
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 61 (27.3 bits)