RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy545
         (531 letters)



>gnl|CDD|238363 cd00707, Pancreat_lipase_like, Pancreatic lipase-like enzymes.
           Lipases are esterases that can hydrolyze long-chain
           acyl-triglycerides into di- and monoglycerides,
           glycerol, and free fatty acids at a water/lipid
           interface.  A typical feature of lipases is "interfacial
           activation," the process of becoming active at the
           lipid/water interface, although several examples of
           lipases have been identified that do not undergo
           interfacial activation .  The active site of a lipase
           contains a catalytic triad consisting of Ser - His -
           Asp/Glu, but unlike most serine proteases, the active
           site is buried inside the structure.  A "lid" or "flap"
           covers the active site, making it inaccessible to
           solvent and substrates. The lid opens during the process
           of interfacial activation, allowing the lipid substrate
           access to the active site.
          Length = 275

 Score =  378 bits (973), Expect = e-129
 Identities = 141/299 (47%), Positives = 182/299 (60%), Gaps = 25/299 (8%)

Query: 75  IDTRFILYTRKNPTEGHMLKVQNERTIEKSNFDPKKKTKFIIHGFIDTPLSSWVKEMRRE 134
           ID RF+LYTR+NP    +L   +  +++ SNF+P + T+FIIHG+  +   SW+ ++R+ 
Sbjct: 1   IDVRFLLYTRENPNCPQLLFADDPSSLKNSNFNPSRPTRFIIHGWTSSGEESWISDLRKA 60

Query: 135 LLKHSDWNVIVVDWAGGSLPLYTQATANTRLVGLELAYFVNYLKDNYGLNPADVHMIGHS 194
            L   D+NVIVVDW  G+ P Y QA  NTR+VG ELA F+++L DN GL+  +VH+IGHS
Sbjct: 61  YLSRGDYNVIVVDWGRGANPNYPQAVNNTRVVGAELAKFLDFLVDNTGLSLENVHLIGHS 120

Query: 195 LGAHTAGYAGQAIEG-LGRITGLDPAEPYFQGMPPFARLDPTDADLVDVIHTDGSSIFLL 253
           LGAH AG+AG+ + G LGRITGLDPA P F G  P  RLDP+DA  VDVIHTDG      
Sbjct: 121 LGAHVAGFAGKRLNGKLGRITGLDPAGPLFSGADPEDRLDPSDAQFVDVIHTDGG----- 175

Query: 254 GYGMSETCGHLDFYPNNGKEQPGCDLTETPLPLTLIKEGIEEAGRVLVACNHVRAIKLFT 313
             G S+  GH DFYPN G++QPGC                +      VAC+H RA+  F 
Sbjct: 176 LLGFSQPIGHADFYPNGGRDQPGC--------------PKDILSSDFVACSHQRAVHYFA 221

Query: 314 ESINSKCPYVAHRCPSYQHFLQGRCFSCGENGTGCALMGLEAQKTNHPPGSKYYITTGK 372
           ESI S C +VA+ C SY  FL G+CF CG    GC  MG  A +       K+Y+ T  
Sbjct: 222 ESILSPCGFVAYPCSSYDEFLAGKCFPCGS---GCVRMGYHADRF--RREGKFYLKTNA 275


>gnl|CDD|215752 pfam00151, Lipase, Lipase. 
          Length = 329

 Score =  331 bits (850), Expect = e-110
 Identities = 144/341 (42%), Positives = 186/341 (54%), Gaps = 23/341 (6%)

Query: 44  CYDELGCLNVTRDWYHL-IYRPFNIFPLPRAVIDTRFILYTRKNPTEGHMLKVQNERTIE 102
           CY  LGC    + W    + RP    P     I+ RF+LYT +NP    ++      TIE
Sbjct: 4   CYGLLGCFGDKKPWAGNTLARPAKSLPWQPKFIEVRFLLYTNENPNNCQLITTGLPSTIE 63

Query: 103 KSNFDPKKKTKFIIHGFIDTPLS-SWVKEMRRELLKHSDWNVIVVDWAGGSLPLYTQATA 161
            SNF+  KKT+ IIHGF D     SW+ +M + L +    NVIVVDW  GS   Y QAT 
Sbjct: 64  NSNFNTSKKTRAIIHGFTDKGQEESWLSDMCKNLFQVEGVNVIVVDWGSGSTTFYRQATL 123

Query: 162 NTRLVGLELAYFVNYLKDNYGLNPADVHMIGHSLGAHTAGYAGQAIEG---LGRITGLDP 218
           N R+VG E+A  +  L++   ++P +VH+IGHSLGAH AG AG+  +G   LGRITGLDP
Sbjct: 124 NVRVVGAEVAKLLVELEEELNVSPENVHLIGHSLGAHVAGEAGRRTKGKTKLGRITGLDP 183

Query: 219 AEPYFQGMPPFARLDPTDADLVDVIHTDGSSIFLLGYGMSETCGHLDFYPNNGKEQPGCD 278
           A PYF+G P   RLDP DAD VD IHTD   I  LG G S+  GH+DF+PN G EQPGC 
Sbjct: 184 AGPYFKGTPELTRLDPGDADFVDAIHTDTRPIPGLGMGTSQRVGHVDFFPNGGSEQPGCQ 243

Query: 279 LTETPLPLTLIKEGIEEAGRVLVACNHVRAIKLFTESINSKCPYVAHRCPSYQHFLQGRC 338
                             G   VAC H+R+++ F ES+ +   + A+ C SY  F   +C
Sbjct: 244 -------------NNVLEGTQFVACAHMRSVRYFAESLLNPRNFPAYPCSSYDEFSNNKC 290

Query: 339 FSCGENGTGCALMGLEAQKTNHPPGS---KYYITTGKEVPF 376
           F C +    C  MG+ A K +         YY+ T  + PF
Sbjct: 291 FGCPKG--RCPYMGIYADKFDGKTSKLEGDYYLNTNSKSPF 329


>gnl|CDD|132274 TIGR03230, lipo_lipase, lipoprotein lipase.  Members of this
           protein family are lipoprotein lipase (EC 3.1.1.34), a
           eukaryotic triacylglycerol lipase active in plasma and
           similar to pancreatic and hepatic triacylglycerol
           lipases (EC 3.1.1.3). It is also called clearing factor.
           It cleaves chylomicron and VLDL triacylglycerols; it
           also has phospholipase A-1 activity.
          Length = 442

 Score =  210 bits (536), Expect = 3e-62
 Identities = 136/392 (34%), Positives = 213/392 (54%), Gaps = 28/392 (7%)

Query: 75  IDTRFILYTRKNPTEG--HMLKVQNERTIEKSNFDPKKKTKFIIHGFIDTPL-SSWVKEM 131
           I+++F L T + P +   +++  Q + +I   NF+ + KT  +IHG+  T +  SWV ++
Sbjct: 5   IESKFSLRTPEEPDDDTCYIVPGQPD-SIADCNFNHETKTFIVIHGWTVTGMFESWVPKL 63

Query: 132 RRELLKHS-DWNVIVVDWAGGSLPLYTQATANTRLVGLELAYFVNYLKDNYGLNPADVHM 190
              L +     NVIVVDW   +   Y  + A T+LVG ++A FVN++++ +     +VH+
Sbjct: 64  VAALYEREPSANVIVVDWLSRAQQHYPTSAAYTKLVGKDVAKFVNWMQEEFNYPWDNVHL 123

Query: 191 IGHSLGAHTAGYAGQAIE-GLGRITGLDPAEPYFQGMPPFARLDPTDADLVDVIHTD--- 246
           +G+SLGAH AG AG   +  + RITGLDPA P F+     + L P DAD VDV+HT+   
Sbjct: 124 LGYSLGAHVAGIAGSLTKHKVNRITGLDPAGPTFEYADAPSTLSPDDADFVDVLHTNTRG 183

Query: 247 --GSSIFLLGYGMSETCGHLDFYPNNGKEQPGCDLTETPLPLTLIKE-GIEEAGRVLVAC 303
               SI     G+    GH+D YPN G  QPGCD+ ET   L +I E G+    + LV C
Sbjct: 184 SPDRSI-----GIQRPVGHIDIYPNGGTFQPGCDIQET---LLVIAEKGLGNMDQ-LVKC 234

Query: 304 NHVRAIKLFTES-INSKCPYVAHRCPSYQHFLQGRCFSCGENGTGCALMGLEAQKTNHPP 362
           +H R+I LF +S +N + P +A+RC S + F +G C SC +N   C  +G E  K     
Sbjct: 235 SHERSIHLFIDSLLNEENPSMAYRCSSKEAFNKGLCLSCRKN--RCNKLGYEINKVRTKR 292

Query: 363 GSKYYITTGKEVPFCRHHYRITLDLARPPRAETWVQGFMRVSLHSDNGVIRNLDLTPNGY 422
            SK Y+ T + +P+   HY++ +       + +     M++SL+  +G   N+   P   
Sbjct: 293 SSKMYLKTREMMPYKVFHYQVKVHFF-GKTSLSHTDQPMKISLYGTHGEKENI---PFTL 348

Query: 423 EKLEHGTSRSFVVTHPEDVGTINKVEFYWEYD 454
            ++    + SF++T   D+G +  V+  WE D
Sbjct: 349 PEVSTNKTYSFLITTDVDIGELLMVKLKWEKD 380


>gnl|CDD|238382 cd00741, Lipase, Lipase.  Lipases are esterases that can hydrolyze
           long-chain acyl-triglycerides into di- and
           monoglycerides, glycerol, and free fatty acids at a
           water/lipid interface.  A typical feature of lipases is
           "interfacial activation", the process of becoming active
           at the lipid/water interface, although several examples
           of lipases have been identified that do not undergo
           interfacial activation . The active site of a lipase
           contains a catalytic triad consisting of Ser - His -
           Asp/Glu, but unlike most serine proteases, the active
           site is buried inside the structure.  A "lid" or "flap"
           covers the active site, making it inaccessible to
           solvent and substrates. The lid opens during the process
           of interfacial activation, allowing the lipid substrate
           access to the active site.
          Length = 153

 Score =  100 bits (251), Expect = 5e-25
 Identities = 42/156 (26%), Positives = 54/156 (34%), Gaps = 12/156 (7%)

Query: 161 ANTRLVGLELAYFVNYLKDNYGLN--PADVHMIGHSLGAHTAGYAG-----QAIEGLGRI 213
                    LA  V  L  +         +H+ GHSLG   AG AG     + +  L R+
Sbjct: 1   KGFYKAARSLANLVLPLLKSALAQYPDYKIHVTGHSLGGALAGLAGLDLRGRGLGRLVRV 60

Query: 214 TGLDPAEPYFQGMPPFARLDPTDADLVDVIHTDGSSI-FLLGYGMSETCGHLDFYPNNGK 272
               P            RLDP+DA  VD I  D   +  L   G     G  +FY N GK
Sbjct: 61  YTFGPPRVGNAAFAED-RLDPSDALFVDRIVNDNDIVPRLPPGGEGYPHGGAEFYINGGK 119

Query: 273 EQPGCDLTETPLPLTLIKEGIEEAGRVLVACNHVRA 308
            QPGC             +           C+H+R 
Sbjct: 120 SQPGCCKNVLEAVD---IDFGNIGLSGNGLCDHLRY 152


>gnl|CDD|221720 pfam12697, Abhydrolase_6, Alpha/beta hydrolase family.  This family
           contains alpha/beta hydrolase enzymes of diverse
           specificity.
          Length = 187

 Score = 42.4 bits (100), Expect = 2e-04
 Identities = 22/119 (18%), Positives = 43/119 (36%), Gaps = 12/119 (10%)

Query: 115 IIHGFIDTPLSSWVKEMRRELLKHSDWNVIVVDWAG-GSLPLYTQATANTRLVGLELAYF 173
           ++HG   +   SW + +   L   + + V+  D  G G      +   +      +LA  
Sbjct: 3   LLHGAGGSA-ESW-RPLAEAL--AAGYRVLAPDLPGHGDSDGPPRTPYSLEDDAADLAAL 58

Query: 174 VNYLKDNYGLNPADVHMIGHSLGAHTA-GYAGQAIEGLGRITGLDPAEPYFQGMPPFAR 231
           ++ L          V ++GHSLG   A   A +  E +  +  + P     + +     
Sbjct: 59  LDAL------GLGPVVLVGHSLGGAVALAAAARRPERVAGLVLISPPLRDLEELLAADA 111


>gnl|CDD|238853 cd01755, PLAT_lipase, PLAT/ LH2 domain present in connection with a
           lipase domain. This family contains two major subgroups,
           the  lipoprotein lipase (LPL) and the pancreatic
           triglyceride lipase.  LPL is a key enzyme in catabolism
           of plasma lipoprotein triglycerides (TGs). The central
           role of triglyceride lipases is in energy production. In
           general, PLAT/LH2 domain's proposed function is to
           mediate interaction with lipids or membrane bound
           proteins.
          Length = 120

 Score = 37.3 bits (87), Expect = 0.003
 Identities = 26/106 (24%), Positives = 45/106 (42%), Gaps = 10/106 (9%)

Query: 379 HHYRITLDLARPPRAETWVQGFMRVSLHSDNGVIRNLDLTPNGYEKLEHGTSRSFVVTHP 438
            HY++ + L+   +    V G   VSL+   G    L +      +L+   + SF++   
Sbjct: 1   WHYQVKVHLSG--KKNLEVDGTFTVSLYGTKGETEQLPIV---LGELKPNKTYSFLIDTE 55

Query: 439 EDVGTINKVEFYWEYDMDVLQPRSLCFLWCNDHLYVSSIKVTETKD 484
            D+G + KV+F WE   +V+   S   L     L    I+V   + 
Sbjct: 56  VDIGDLLKVKFKWEN--NVINSNSGETL---PKLGARKIRVKSGET 96


>gnl|CDD|223669 COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta
           hydrolase superfamily) [General function prediction
           only].
          Length = 282

 Score = 36.9 bits (84), Expect = 0.018
 Identities = 24/111 (21%), Positives = 37/111 (33%), Gaps = 11/111 (9%)

Query: 106 FDPKKKTKFIIHGFIDTPLSSWVKEMRRELLKHSDWNVIVVDWAGGSLPLYTQATANTRL 165
                    ++HGF  +  S W    +      + + VI  D  G         +     
Sbjct: 17  AGGGGPPLVLLHGFPGSS-SVWRPVFKVLPALAARYRVIAPDLRGHGR------SDPAGY 69

Query: 166 VGLELAYFVNYLKDNYGLNPADVHMIGHSLGAHTAGYAGQAIEGLGRITGL 216
                A  +  L D  GL    V ++GHS+G   A     A+    R+ GL
Sbjct: 70  SLSAYADDLAALLDALGLEK--VVLVGHSMGGAVALAL--ALRHPDRVRGL 116


>gnl|CDD|221718 pfam12695, Abhydrolase_5, Alpha/beta hydrolase family.  This family
           contains a diverse range of alpha/beta hydrolase
           enzymes.
          Length = 145

 Score = 34.7 bits (80), Expect = 0.036
 Identities = 19/111 (17%), Positives = 34/111 (30%), Gaps = 21/111 (18%)

Query: 115 IIHGFIDTPLSSWVKEMRR--ELLKHSDWNVIVVDWAGGSLPLYTQATANTRLVGLELAY 172
           ++HG          +        L    +NV+ VD+ G        A     ++      
Sbjct: 4   LLHGA-----GGDPEAYAPLARALASRGYNVVAVDYPGHGA--SLGAPDAEAVL------ 50

Query: 173 FVNYLKDNYGLNPADVHMIGHSLGAHTAGYAGQAIEGLGRITGLDPAEPYF 223
                  +  L+P  + ++GHSLG   A         +     L   +P  
Sbjct: 51  ------ADAPLDPERIVLVGHSLGGGVALLLAARDPRVKAAVVLAAGDPPD 95


>gnl|CDD|201306 pfam00561, Abhydrolase_1, alpha/beta hydrolase fold.  This
           catalytic domain is found in a very wide range of
           enzymes.
          Length = 226

 Score = 33.3 bits (76), Expect = 0.22
 Identities = 20/86 (23%), Positives = 36/86 (41%), Gaps = 9/86 (10%)

Query: 141 WNVIVVDWAG-GSLPLYTQATANTRLVGLE-LAYFVNYLKDNYGLNPADVHMIGHSLGAH 198
           ++VI  D  G G     +    +      + LA  +  L D  GL+   V+++GHS+G  
Sbjct: 1   FDVIAFDLRGFG----RSSPPKDFADYRFDDLAEDLEALLDALGLDK--VNLVGHSMGGL 54

Query: 199 TAG-YAGQAIEGLGRITGLDPAEPYF 223
            A  YA +  + +  +  +    P  
Sbjct: 55  IALAYAAKYPDRVKALVLVGTVHPAG 80


>gnl|CDD|237934 PRK15290, lfpB, fimbrial chaperone protein; Provisional.
          Length = 243

 Score = 33.4 bits (76), Expect = 0.23
 Identities = 12/34 (35%), Positives = 22/34 (64%)

Query: 217 DPAEPYFQGMPPFARLDPTDADLVDVIHTDGSSI 250
           D ++  F  +PP +RL+P+   ++ +IHT G S+
Sbjct: 81  DKSKAPFTVIPPVSRLEPSQEKVLRIIHTKGVSL 114


>gnl|CDD|216521 pfam01477, PLAT, PLAT/LH2 domain.  This domain is found in a
           variety of membrane or lipid associated proteins. It is
           called the PLAT (Polycystin-1, Lipoxygenase,
           Alpha-Toxin) domain or LH2 (Lipoxygenase homology)
           domain. The known structure of pancreatic lipase shows
           this domain binds to procolipase pfam01114, which
           mediates membrane association. So it appears possible
           that this domain mediates membrane attachment via other
           protein binding partners. The structure of this domain
           is known for many members of the family and is composed
           of a beta sandwich.
          Length = 115

 Score = 31.6 bits (72), Expect = 0.34
 Identities = 16/80 (20%), Positives = 31/80 (38%), Gaps = 12/80 (15%)

Query: 401 MRVSLHSDNGVIRNLDLTPNGYEKLEHGTSRSFVVTHPEDVGTINKVEFYWEYDMDVLQP 460
           + +SL+  +G      L     +  + G++ SF     ED+G +  ++  WE        
Sbjct: 19  VYISLYGTHGKSGEYPLLEPELKPDDRGSTDSFTFDTDEDLGDLGAIKIRWEI------- 71

Query: 461 RSLCFLWCNDHLYVSSIKVT 480
                   +D  ++ SI V 
Sbjct: 72  -----NGHSDEWFLKSILVK 86


>gnl|CDD|225858 COG3321, COG3321, Polyketide synthase modules and related proteins
           [Secondary metabolites biosynthesis, transport, and
           catabolism].
          Length = 1061

 Score = 32.8 bits (75), Expect = 0.60
 Identities = 20/81 (24%), Positives = 32/81 (39%), Gaps = 4/81 (4%)

Query: 126 SWVKEMRRELLKHSDWNVIVVDWAGGSLPLYTQATANTRLVGLELAYFVNYLKDNYGLNP 185
           S    +   L +   +++  V +A     L +   A   L  + +A     L  ++G+ P
Sbjct: 554 SAFDALEALLHRLLGFSLPEVIFAPDYPFLESIDFAQPALFAVSVALAA--LWRSWGVIP 611

Query: 186 ADVHMIGHSLGAHTAGYAGQA 206
             V  IGHSLG   A  A   
Sbjct: 612 GAV--IGHSLGELAAAVAAGV 630


>gnl|CDD|238061 cd00113, PLAT, PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin)
           domain or LH2 (Lipoxygenase homology 2)  domain.  It
           consists of an eight stranded beta-barrel. The domain
           can be found in various domain architectures, in case of
           lipoxygenases, alpha toxin, lipases and polycystin, but
           also as a single domain or as repeats.The putative
           function of this domain is to facilitate access to
           sequestered membrane or micelle bound substrates.
          Length = 116

 Score = 30.8 bits (70), Expect = 0.64
 Identities = 24/111 (21%), Positives = 38/111 (34%), Gaps = 19/111 (17%)

Query: 379 HHYRITLDLARPPRAETWVQGFMRVSLHSDNGVI-RNLDLTPNGYEKLEHGTSRSFVVTH 437
             Y +T+      +        + ++L+ +NG       L   G    E G++ +F +  
Sbjct: 1   CRYTVTIKTG--DKKGAGTDSNISLALYGENGNSSDIPILDGPG--SFERGSTDTFQIDL 56

Query: 438 PEDVGTINKVEFYWEYDMDVLQPRSLCFLWCNDHLYVSSIKV--TETKDIM 486
             D+G I KV      D   L        W     Y  SI V    TK + 
Sbjct: 57  KLDIGDITKVYLRR--DGSGLSDG-----W-----YCESITVQALGTKKVY 95


>gnl|CDD|238857 cd01759, PLAT_PL, PLAT/LH2 domain of pancreatic triglyceride
           lipase.  Lipases hydrolyze phospholipids and
           triglycerides to generate fatty acids for energy
           production or for storage and to release inositol
           phosphates that act as second messengers. The central
           role of triglyceride lipases is in energy production.
           The proposed function of PLAT/LH2 domains is to mediate
           interaction with lipids or membrane bound proteins.
          Length = 113

 Score = 30.0 bits (68), Expect = 0.88
 Identities = 20/76 (26%), Positives = 33/76 (43%), Gaps = 15/76 (19%)

Query: 381 YRITLDLARPPRAETWVQGFMRVSLHSDNGVIRNLDLTPNGYE----KLEHGTSRSFVVT 436
           Y++++ L      +  V G + VSL+ + G           YE     L+ G + S  + 
Sbjct: 3   YKVSVTL----SGKKKVTGTILVSLYGNKG-------NTRQYEIFKGTLKPGNTYSAFID 51

Query: 437 HPEDVGTINKVEFYWE 452
              DVG + KV+F W 
Sbjct: 52  VDVDVGPLTKVKFIWN 67


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 32.0 bits (72), Expect = 1.0
 Identities = 11/27 (40%), Positives = 23/27 (85%), Gaps = 1/27 (3%)

Query: 3   KKDKEREKERDREEEKGEREKEMRKSS 29
           ++++EREKE+++E E+ ERE+E  +++
Sbjct: 597 REEREREKEKEKERER-EREREAERAA 622



 Score = 30.4 bits (68), Expect = 2.7
 Identities = 12/23 (52%), Positives = 20/23 (86%), Gaps = 1/23 (4%)

Query: 2   EKKDKEREKERDREEEKGEREKE 24
           E++++E+EKE++RE E+ ERE E
Sbjct: 598 EEREREKEKEKERERER-EREAE 619



 Score = 28.9 bits (64), Expect = 7.6
 Identities = 11/23 (47%), Positives = 20/23 (86%), Gaps = 1/23 (4%)

Query: 2   EKKDKEREKERDREEEKGEREKE 24
           +K  +ERE+E+++E+E+ ERE+E
Sbjct: 594 QKAREEREREKEKEKER-ERERE 615


>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
           (TAF4) is one of several TAFs that bind TBP and is
           involved in forming Transcription Factor IID (TFIID)
           complex.  The TATA Binding Protein (TBP) Associated
           Factor 4 (TAF4) is one of several TAFs that bind TBP and
           are involved in forming the Transcription Factor IID
           (TFIID) complex. TFIID is one of seven General
           Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
           TFIIF, and TFIID) that are involved in accurate
           initiation of transcription by RNA polymerase II in
           eukaryote. TFIID plays an important role in the
           recognition of promoter DNA and assembly of the
           pre-initiation complex. TFIID complex is composed of the
           TBP and at least 13 TAFs. TAFs from various species were
           originally named by their predicted molecular weight or
           their electrophoretic mobility in polyacrylamide gels. A
           new, unified nomenclature for the pol II TAFs has been
           suggested to show the relationship between TAF orthologs
           and paralogs. Several hypotheses are proposed for TAFs
           functions such as serving as activator-binding sites,
           core-promoter recognition or a role in essential
           catalytic activity. Each TAF, with the help of a
           specific activator, is required only for the expression
           of subset of genes and is not universally involved for
           transcription as are GTFs. In yeast and human cells,
           TAFs have been found as components of other complexes
           besides TFIID.   Several TAFs interact via histone-fold
           (HFD) motifs; HFD is the interaction motif involved in
           heterodimerization of the core histones and their
           assembly into nucleosome octamers. The minimal HFD
           contains three alpha-helices linked by two loops and is
           found in core histones, TAFS and many other
           transcription factors. TFIID has a histone octamer-like
           substructure. TAF4 domain interacts with TAF12 and makes
           a novel histone-like heterodimer that binds DNA and has
           a core promoter function of a subset of genes.
          Length = 212

 Score = 31.1 bits (71), Expect = 1.1
 Identities = 8/30 (26%), Positives = 16/30 (53%)

Query: 1   MEKKDKEREKERDREEEKGEREKEMRKSSD 30
           +E+ ++E E++RD EE +        +S  
Sbjct: 119 LEQLEREEEEKRDEEERERLLRAAKSRSEQ 148


>gnl|CDD|129739 TIGR00654, PhzF_family, phenazine biosynthesis protein PhzF family.
            Members of this family show a distant global similarity
           to diaminopimelate epimerases, which can be taken as the
           outgroup. One member of this family has been shown to
           act as an enzyme in the biosynthesis of the antibiotic
           phenazine in Pseudomonas aureofaciens. The function in
           other species is unclear [Cellular processes, Toxin
           production and resistance].
          Length = 297

 Score = 31.0 bits (70), Expect = 1.3
 Identities = 24/99 (24%), Positives = 37/99 (37%), Gaps = 24/99 (24%)

Query: 351 MGLEAQKTNH-------PPGSK------YYITTGKEVPFCRH-----HYRITLDLARPPR 392
           M   A +TN+        P S+         T   E+PF  H      Y   L+  +   
Sbjct: 35  MQAIANETNYSETTFLLKPSSEKYDYRLRIFTPRSELPFAGHPTIGSCY-ALLEFTKLTT 93

Query: 393 AETWVQ----GFMRVSLHSDNGVIRNLDLTPN-GYEKLE 426
           A T VQ    G + V+++  NG +R     P   +E + 
Sbjct: 94  ATTLVQECKAGAVPVTINEKNGDLRISLEQPMPDFEPIS 132


>gnl|CDD|214838 smart00827, PKS_AT, Acyl transferase domain in polyketide synthase
           (PKS) enzymes. 
          Length = 298

 Score = 31.2 bits (72), Expect = 1.3
 Identities = 17/84 (20%), Positives = 30/84 (35%), Gaps = 17/84 (20%)

Query: 127 WVKEMRRELLKHSDWNVIVVDWAGG-----SLPLYTQATANTRLVGLELAYFVNYLKDNY 181
            + E    L     W+++ V                Q      L  +++A     L+  +
Sbjct: 26  ALDECDAALQPLLGWSLLDVLLGEDGAASLLDTEVAQPA----LFAVQVA-LARLLRS-W 79

Query: 182 GLNPADVHMIGHSLG----AHTAG 201
           G+ P    ++GHS G    A+ AG
Sbjct: 80  GVRPD--AVVGHSSGEIAAAYVAG 101


>gnl|CDD|214608 smart00308, LH2, Lipoxygenase homology 2 (beta barrel) domain. 
          Length = 105

 Score = 29.1 bits (66), Expect = 1.8
 Identities = 11/54 (20%), Positives = 19/54 (35%)

Query: 401 MRVSLHSDNGVIRNLDLTPNGYEKLEHGTSRSFVVTHPEDVGTINKVEFYWEYD 454
           + +SL    G  +   L          G++  F     ED G +  V+   E+ 
Sbjct: 21  VSLSLVGAEGDGKESKLDYLFKGIFARGSTYEFTFDVDEDFGELGAVKIKNEHR 74


>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886).  This
          family of proteins is functionally uncharacterized.
          This family of proteins is found in bacteria. Proteins
          in this family are approximately 90 amino acids in
          length. There are two completely conserved L residues
          that may be functionally important.
          Length = 70

 Score = 28.1 bits (63), Expect = 1.9
 Identities = 13/23 (56%), Positives = 16/23 (69%), Gaps = 1/23 (4%)

Query: 2  EKKDKEREKERDREEEKGEREKE 24
          E+K +E E+ R REE K EREK 
Sbjct: 33 EEKREEEEEARKREERK-EREKN 54



 Score = 28.1 bits (63), Expect = 2.2
 Identities = 7/24 (29%), Positives = 14/24 (58%)

Query: 2  EKKDKEREKERDREEEKGEREKEM 25
          EK+++E E  +  E ++ E+ K  
Sbjct: 34 EKREEEEEARKREERKEREKNKSF 57



 Score = 27.7 bits (62), Expect = 3.5
 Identities = 8/25 (32%), Positives = 17/25 (68%)

Query: 3  KKDKEREKERDREEEKGEREKEMRK 27
          KK+ + E+E+  EEE+  + +E ++
Sbjct: 26 KKELKAEEEKREEEEEARKREERKE 50


>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin.  Trichoplein
           or mitostatin, was first defined as a meiosis-specific
           nuclear structural protein. It has since been linked
           with mitochondrial movement. It is associated with the
           mitochondrial outer membrane, and over-expression leads
           to reduction in mitochondrial motility whereas lack of
           it enhances mitochondrial movement. The activity appears
           to be mediated through binding the mitochondria to the
           actin intermediate filaments (IFs).
          Length = 349

 Score = 30.6 bits (70), Expect = 2.0
 Identities = 9/22 (40%), Positives = 13/22 (59%)

Query: 6   KEREKERDREEEKGEREKEMRK 27
           +EREK R  E E+  +E+E   
Sbjct: 81  EEREKRRQEEYEERLQEREQMD 102



 Score = 29.5 bits (67), Expect = 4.5
 Identities = 11/27 (40%), Positives = 18/27 (66%)

Query: 1   MEKKDKEREKERDREEEKGEREKEMRK 27
           +E+K++ R  ER+ E E+GER +E   
Sbjct: 297 IEEKEERRAAEREEELEEGERLREEEA 323


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 30.6 bits (69), Expect = 2.4
 Identities = 14/28 (50%), Positives = 21/28 (75%), Gaps = 1/28 (3%)

Query: 2   EKKDKEREKERDREEEKGEREKEMRKSS 29
           ++K+K+ E+ RDREEEK +RE+   KS 
Sbjct: 148 KEKEKKVEEPRDREEEK-KRERVRAKSR 174


>gnl|CDD|233496 TIGR01622, SF-CC1, splicing factor, CC1-like family.  This model
          represents a subfamily of RNA splicing factors
          including the Pad-1 protein (N. crassa), CAPER (M.
          musculus) and CC1.3 (H.sapiens). These proteins are
          characterized by an N-terminal arginine-rich, low
          complexity domain followed by three (or in the case of
          4 H. sapiens paralogs, two) RNA recognition domains
          (rrm: pfam00706). These splicing factors are closely
          related to the U2AF splicing factor family (TIGR01642).
          A homologous gene from Plasmodium falciparum was
          identified in the course of the analysis of that genome
          at TIGR and was included in the seed.
          Length = 457

 Score = 30.2 bits (68), Expect = 2.7
 Identities = 8/29 (27%), Positives = 15/29 (51%), Gaps = 1/29 (3%)

Query: 3  KKDKEREKERDREEEKGE-REKEMRKSSD 30
           +D+ER + R+      + RE+  R+S  
Sbjct: 2  YRDRERGRLRNDTRRSDKGRERSRRRSRS 30


>gnl|CDD|221756 pfam12757, DUF3812, Protein of unknown function (DUF3812).  This is
           a family of fungal proteins whose function is not known.
          Length = 126

 Score = 29.2 bits (66), Expect = 2.7
 Identities = 14/26 (53%), Positives = 17/26 (65%), Gaps = 1/26 (3%)

Query: 2   EKKDKEREKERDREEEKGEREKEMRK 27
           EKK  E E +R  EE K ERE+E +K
Sbjct: 102 EKKLDEEEAKRQHEEAK-EREREKKK 126


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 29.9 bits (68), Expect = 2.8
 Identities = 13/38 (34%), Positives = 23/38 (60%), Gaps = 13/38 (34%)

Query: 2   EKKDKEREKERDRE------------EEKGEREKEMRK 27
           EKK++++++ER+ +            EEK ER+K+ RK
Sbjct: 286 EKKEEKKKEEREAKLAKLSPEEQRKLEEK-ERKKQARK 322



 Score = 28.4 bits (64), Expect = 9.3
 Identities = 10/29 (34%), Positives = 19/29 (65%)

Query: 1   MEKKDKEREKERDREEEKGEREKEMRKSS 29
            E++ +E  +E+  E++K ERE ++ K S
Sbjct: 276 AEEERQEEAQEKKEEKKKEEREAKLAKLS 304


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
           found at the N terminus of SMC proteins. The SMC
           (structural maintenance of chromosomes) superfamily
           proteins have ATP-binding domains at the N- and
           C-termini, and two extended coiled-coil domains
           separated by a hinge in the middle. The eukaryotic SMC
           proteins form two kind of heterodimers: the SMC1/SMC3
           and the SMC2/SMC4 types. These heterodimers constitute
           an essential part of higher order complexes, which are
           involved in chromatin and DNA dynamics. This family also
           includes the RecF and RecN proteins that are involved in
           DNA metabolism and recombination.
          Length = 1162

 Score = 30.3 bits (68), Expect = 2.9
 Identities = 9/45 (20%), Positives = 17/45 (37%)

Query: 1   MEKKDKEREKERDREEEKGEREKEMRKSSDVLYIFYIVENETRCY 45
           +E+ +   E   D EE K +  K   ++   L  + + E      
Sbjct: 180 IEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEE 224


>gnl|CDD|223730 COG0657, Aes, Esterase/lipase [Lipid metabolism].
          Length = 312

 Score = 29.9 bits (67), Expect = 3.1
 Identities = 22/99 (22%), Positives = 37/99 (37%), Gaps = 15/99 (15%)

Query: 114 FIIHG--FIDTPLSSWVKEMRRELLKHSDWNVIVVDWAGGSLPLYTQATANTRLVGLELA 171
             +HG  ++   L +    + R L   +   V+ VD        Y  A  +     LE A
Sbjct: 83  LYLHGGGWVLGSLRTHDALVAR-LAAAAGAVVVSVD--------YRLAPEHPFPAALEDA 133

Query: 172 Y----FVNYLKDNYGLNPADVHMIGHSLGAHTAGYAGQA 206
           Y    ++       G++P+ + + G S G H A     A
Sbjct: 134 YAAYRWLRANAAELGIDPSRIAVAGDSAGGHLALALALA 172


>gnl|CDD|220376 pfam09745, DUF2040, Coiled-coil domain-containing protein 55
          (DUF2040).  This entry is a conserved domain of
          approximately 130 residues of proteins conserved from
          fungi to humans. The proteins do contain a coiled-coil
          domain, but the function is unknown.
          Length = 128

 Score = 28.5 bits (64), Expect = 3.7
 Identities = 11/27 (40%), Positives = 18/27 (66%), Gaps = 4/27 (14%)

Query: 1  MEKKDKEREKERDREEEK---GEREKE 24
          +E  ++ R++ER+  EE+    EREKE
Sbjct: 55 LEAAER-RKREREIAEERKLQKEREKE 80


>gnl|CDD|163133 TIGR03090, SASP_tlp, small, acid-soluble spore protein tlp.  This
          protein family is restricted to a subset of
          endospore-forming bacteria such as Bacillus subtilis,
          all of which are in the Firmicutes (low-GC
          Gram-positive) lineage. Although previously designated
          tlp (thioredoxin-like protein), the B. subtilis protein
          was shown to be a minor small acid-soluble spore
          protein SASP, unique to spores. The motif E[VIL]XDE
          near the C-terminus probably represents at a
          germination protease cleavage site [Cellular processes,
          Sporulation and germination].
          Length = 70

 Score = 27.5 bits (61), Expect = 3.7
 Identities = 9/24 (37%), Positives = 15/24 (62%)

Query: 1  MEKKDKEREKERDREEEKGEREKE 24
          +E   +  E+E+ R EEK ER ++
Sbjct: 31 IEAHAELSEEEKQRIEEKNERREQ 54


>gnl|CDD|184995 PRK15035, PRK15035, cytochrome bd-II oxidase subunit 1;
           Provisional.
          Length = 514

 Score = 30.0 bits (67), Expect = 3.8
 Identities = 32/116 (27%), Positives = 50/116 (43%), Gaps = 22/116 (18%)

Query: 124 LSSWVKEMRRELLKHSDWNVIVVDWAGGSLPLYTQATANTRLVGLELAYFVNYLKDNYGL 183
           L + V+ +R ++ +H  W + +  W   SLPL          + +E  +F+      +G 
Sbjct: 405 LIALVQTLRGKIDQHR-WVLKMALW---SLPL--------PWIAIEAGWFMT----EFGR 448

Query: 184 NPADVHMIGHSLGAHTAGYAGQAIEGLGRITGLDP----AEPYFQGMPPFARLDPT 235
            P  +  I  +  AH+A   GQ    L  I GL      AE Y   M  +ARL P+
Sbjct: 449 QPWAIQDILPTYSAHSALTTGQLAFSLIMIVGLYTLFLIAEVYL--MQKYARLGPS 502


>gnl|CDD|233111 TIGR00745, apbA_panE, 2-dehydropantoate 2-reductase.  This model
           describes enzymes that perform as 2-dehydropantoate
           2-reductase, one of four enzymes required for the de
           novo biosynthesis of pantothenate (vitamin B5) from Asp
           and 2-oxoisovalerate. Although few members of the seed
           alignment are characterized experimentally, nearly all
           from complete genomes are found in a genome-wide (but
           not local) context of all three other
           pantothenate-biosynthetic enzymes (TIGR00222, TIGR00018,
           TIGR00223). The gene encoding this enzyme is designated
           apbA in Salmonella typhimurium and panE in Escherichia
           coli; this protein functions as a monomer and functions
           in the alternative pyrimidine biosynthetic, or APB,
           pathway, used to synthesize the pyrimidine moiety of
           thiamine. Note, synthesis of the pyrimidine moiety of
           thiamine occurs either via the first five steps in de
           novo purine biosynthesis, which uses the pur gene
           products, or through the APB pathway. Note that this
           family includes both NADH and NADPH-dependent enzymes,
           and enzymes with broad specificity, such as a
           D-mandelate dehydrogease that is also a
           2-dehydropantoate 2-reductase [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Pantothenate
           and coenzyme A].
          Length = 293

 Score = 29.6 bits (67), Expect = 4.1
 Identities = 15/60 (25%), Positives = 22/60 (36%), Gaps = 1/60 (1%)

Query: 279 LTETPLPLTLIKEGIEEAGRVLVACNHVRAIKLFTESINSKCPYVAHRCPS-YQHFLQGR 337
           L E P    L++  ++E  RV  A           E + +     A    S  Q  L+GR
Sbjct: 196 LLENPEARELLRRLMDEVVRVARAEGVDLPDDEVEELVRAVIRMTAENTSSMLQDLLRGR 255


>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207).  This
           family is found in eukaryotes; it has several conserved
           tryptophan residues. The function is not known.
          Length = 261

 Score = 29.3 bits (66), Expect = 4.5
 Identities = 9/26 (34%), Positives = 16/26 (61%)

Query: 2   EKKDKEREKERDREEEKGEREKEMRK 27
           +K  K  E+++ +E EK   E E+R+
Sbjct: 102 KKLQKLLEEKQKQEREKEREEAELRQ 127


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
          approximately 300 residues, found in plants and
          vertebrates. They contain a highly conserved DDRGK
          motif.
          Length = 189

 Score = 28.9 bits (65), Expect = 4.6
 Identities = 12/45 (26%), Positives = 25/45 (55%), Gaps = 4/45 (8%)

Query: 1  MEKKDKEREKERDREEE--KGEREKEMRKSSDVLYI--FYIVENE 41
           E+++ E E+E+ +EEE  K   E+  ++  +   +   ++VE E
Sbjct: 43 KEEEELEEEREKKKEEEERKEREEQARKEQEEYEKLKSSFVVEEE 87


>gnl|CDD|179997 PRK05293, glgC, glucose-1-phosphate adenylyltransferase;
           Provisional.
          Length = 380

 Score = 29.5 bits (67), Expect = 4.6
 Identities = 11/21 (52%), Positives = 15/21 (71%), Gaps = 3/21 (14%)

Query: 440 DVGTINKVEFYWEYDMDVLQP 460
           DVGTI   E  WE +M++L+P
Sbjct: 243 DVGTI---ESLWEANMELLRP 260


>gnl|CDD|233733 TIGR02117, chp_urease_rgn, conserved hypothetical protein.  This
           conserved hypothetical protein of unknown function is
           found in several Proteobacteria. Its function is unknown
           and its genome context is not well-conserved. It is
           found amid urease genes in at least one species
           [Hypothetical proteins, Conserved].
          Length = 208

 Score = 28.7 bits (64), Expect = 6.2
 Identities = 11/47 (23%), Positives = 19/47 (40%), Gaps = 4/47 (8%)

Query: 392 RAETWVQGFMRVSL----HSDNGVIRNLDLTPNGYEKLEHGTSRSFV 434
           R+ T     + V      +     ++ L ++ N Y +L    S SFV
Sbjct: 99  RSLTADMSVLHVDGQPERYEAQSEVKRLLVSENQYNRLMDFISASFV 145


>gnl|CDD|223635 COG0561, Cof, Predicted hydrolases of the HAD superfamily [General
           function prediction only].
          Length = 264

 Score = 28.9 bits (65), Expect = 6.2
 Identities = 19/132 (14%), Positives = 32/132 (24%), Gaps = 25/132 (18%)

Query: 70  LPRAVIDTRFILYTRKNPTEGHMLKVQNERTIEKSNFDPKKKTKFIIHGFIDTPLSSWVK 129
           +   +     I  T+K  T           +         +  K I        L     
Sbjct: 101 IALVLYTDDGIYLTKKRGTFAEARIGFANLSPVGREAAELEDNKIIALDKDHEILE---- 156

Query: 130 EMRRELLKHSDWNVIVVDWAGGSLPLYTQATANTRLVGLEL-------AYFVNYLKDNYG 182
           E+   L K      + V  +G                 L++        Y +  L    G
Sbjct: 157 ELVEALRKRFPDLGLTVSSSGPIS--------------LDITPKGVSKGYALQRLAKLLG 202

Query: 183 LNPADVHMIGHS 194
           +   +V   G S
Sbjct: 203 IKLEEVIAFGDS 214


>gnl|CDD|213298 cd05932, LC_FACS_bac, Bacterial long-chain fatty acid CoA
           synthetase (LC-FACS), including Marinobacter
           hydrocarbonoclasticus isoprenoid Coenzyme A synthetase. 
           The members of this family are bacterial long-chain
           fatty acid CoA synthetase. Marinobacter
           hydrocarbonoclasticus isoprenoid Coenzyme A synthetase
           in this family is involved in the synthesis of
           isoprenoid wax ester storage compounds when grown on
           phytol as the sole carbon source. LC-FACS catalyzes the
           formation of fatty acyl-CoA in a two-step reaction: the
           formation of a fatty acyl-AMP molecule as an
           intermediate, and the formation of a fatty acyl-CoA.
           Free fatty acids must be "activated" to their CoA
           thioesters before participating in most catabolic and
           anabolic reactions.
          Length = 504

 Score = 29.2 bits (66), Expect = 6.2
 Identities = 16/57 (28%), Positives = 22/57 (38%), Gaps = 19/57 (33%)

Query: 254 GYGMSETCGHLDF-YPNN------GKEQPGCDLTETPLPLTLIKEGIEEAGRVLVAC 303
           GYGM+E   +    YP +      GK  PG +L             I E G +L+  
Sbjct: 305 GYGMTENFAYSTLNYPGDRRIGTVGKPIPGVELK------------ISEDGEILIRS 349


>gnl|CDD|205067 pfam12771, SusD-like_2, Starch-binding associating with outer
           membrane.  SusD is a secreted starch-binding protein
           with an N-terminal lipid tail that allows it to
           associate with the outer membrane.
          Length = 488

 Score = 29.2 bits (65), Expect = 6.5
 Identities = 18/61 (29%), Positives = 26/61 (42%), Gaps = 2/61 (3%)

Query: 200 AGYAGQAIEGLGRITGLDPAEPYFQGMPPFARLDPTDADLVDVIHTDGSSIFLLGYGMSE 259
               G+ I       G DPA PY   MP + + D +  +  D   TD     +L +G +E
Sbjct: 281 LKGDGKYIGDFSGYEGGDPAAPYGYSMPKYEQGDISPIN--DRFRTDPIVEPMLLFGYAE 338

Query: 260 T 260
           T
Sbjct: 339 T 339


>gnl|CDD|223477 COG0400, COG0400, Predicted esterase [General function prediction
           only].
          Length = 207

 Score = 28.4 bits (64), Expect = 6.9
 Identities = 20/80 (25%), Positives = 33/80 (41%), Gaps = 8/80 (10%)

Query: 169 ELAYFVNYLKDNYGLNPADVHMIGHSLGAHTAGYAGQAIEG-LGRITGLDPAEPYFQGMP 227
           +LA F+  L + YG++ + + +IG S GA+ A   G  + G              F GM 
Sbjct: 82  KLAEFLEELAEEYGIDSSRIILIGFSQGANIALSLGLTLPGLFAGAIL-------FSGML 134

Query: 228 PFARLDPTDADLVDVIHTDG 247
           P       D     ++ + G
Sbjct: 135 PLEPELLPDLAGTPILLSHG 154


>gnl|CDD|181184 PRK07967, PRK07967, 3-oxoacyl-(acyl carrier protein) synthase I;
           Reviewed.
          Length = 406

 Score = 28.9 bits (65), Expect = 7.7
 Identities = 16/46 (34%), Positives = 18/46 (39%), Gaps = 11/46 (23%)

Query: 190 MIGHSLGA---HTAGYA-----GQAIEGLGRITGLDPAEPYFQGMP 227
           + GHSLGA     A Y+        I     I  LDP      GMP
Sbjct: 329 LTGHSLGAAGVQEAIYSLLMMEHGFIAPSANIEELDPQ---AAGMP 371


>gnl|CDD|238856 cd01758, PLAT_LPL, PLAT/ LH2 domain present in lipoprotein lipase
           (LPL).  LPL is a key enzyme in catabolism of plasma
           lipoprotein triglycerides (TGs) and has therefeore has a
           profound influence on triglyceride and high-density
           lipoprotein (HDL) cholesterol levels in the blood. In
           general, PLAT/LH2 domain's proposed function is to
           mediate interaction with lipids or membrane bound
           proteins.
          Length = 137

 Score = 27.7 bits (62), Expect = 7.8
 Identities = 15/52 (28%), Positives = 26/52 (50%), Gaps = 2/52 (3%)

Query: 401 MRVSLHSDNGVIRNLDLTPNGYEKLEHGTSRSFVVTHPEDVGTINKVEFYWE 452
             +SL+   G   NL LT    E +    + SF++T  +D+G +  ++  WE
Sbjct: 22  FTISLYGTLGESENLPLTLP--EGITGNKTNSFLITTEKDIGDLLMLKLKWE 71


>gnl|CDD|227388 COG5055, RAD52, Recombination DNA repair protein (RAD52 pathway)
           [DNA replication, recombination, and repair].
          Length = 375

 Score = 28.7 bits (64), Expect = 8.2
 Identities = 12/23 (52%), Positives = 16/23 (69%)

Query: 95  VQNERTIEKSNFDPKKKTKFIIH 117
           VQNER I +S FDPK +++ I  
Sbjct: 324 VQNERYIGESIFDPKYQSQSIRK 346


>gnl|CDD|234109 TIGR03105, gln_synth_III, glutamine synthetase, type III.  This
           family consists of the type III isozyme of glutamine
           synthetase, originally described in Rhizobium meliloti,
           where types I and II also occur.
          Length = 435

 Score = 28.9 bits (65), Expect = 8.4
 Identities = 15/48 (31%), Positives = 19/48 (39%), Gaps = 5/48 (10%)

Query: 188 VHMIGHSLGAHTAGYAGQAIEGLGRITGLDPAEPYFQGMPPFARLDPT 235
              I H +    AG+AG A  GL    G  PA+P    +P    L   
Sbjct: 31  AEAIDH-MATGGAGFAGFAAWGL----GQSPADPDLMAIPDLDSLTQL 73


>gnl|CDD|184875 PRK14875, PRK14875, acetoin dehydrogenase E2 subunit
           dihydrolipoyllysine-residue acetyltransferase;
           Provisional.
          Length = 371

 Score = 28.8 bits (65), Expect = 8.5
 Identities = 16/48 (33%), Positives = 21/48 (43%), Gaps = 4/48 (8%)

Query: 169 ELAYFVNYLKDNYGLNPADVHMIGHSLGAHTAGYAGQAIEGLGRITGL 216
           ELA  V    D  G+  A  H++GHS+G   A     A     R+  L
Sbjct: 182 ELAAAVLAFLDALGIERA--HLVGHSMGGAVA--LRLAARAPQRVASL 225


>gnl|CDD|182945 PRK11071, PRK11071, esterase YqiA; Provisional.
          Length = 190

 Score = 28.0 bits (63), Expect = 8.8
 Identities = 24/91 (26%), Positives = 34/91 (37%), Gaps = 25/91 (27%)

Query: 116 IHGFIDTPLSSWVKEMRRELLKH-SDWNVIVVDWAGGSLPLYTQATA------------- 161
           +HGF  +P S+    ++  L +H  D  +IV       LP Y    A             
Sbjct: 7   LHGFNSSPRSAKATLLKNWLAQHHPDIEMIV-----PQLPPYPADAAELLESLVLEHGGD 61

Query: 162 NTRLVGLEL-AYFVNYLKDNYGL-----NPA 186
              LVG  L  Y+  +L   + L     NPA
Sbjct: 62  PLGLVGSSLGGYYATWLSQCFMLPAVVVNPA 92


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 28.2 bits (63), Expect = 8.9
 Identities = 13/31 (41%), Positives = 21/31 (67%)

Query: 2   EKKDKEREKERDREEEKGEREKEMRKSSDVL 32
           EK  KER +E++REEE+   E+E  +  ++L
Sbjct: 152 EKIKKERAEEKEREEEEKAAEEEKAREEEIL 182


>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR).  This
           family consists of several bovine specific leukaemia
           virus receptors which are thought to function as
           transmembrane proteins, although their exact function is
           unknown.
          Length = 561

 Score = 28.9 bits (64), Expect = 9.2
 Identities = 8/21 (38%), Positives = 17/21 (80%)

Query: 2   EKKDKEREKERDREEEKGERE 22
           EKK+KE+E+++D+++E    +
Sbjct: 206 EKKEKEKERDKDKKKEVEGFK 226


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
          entry is characterized by proteins with alternating
          conserved and low-complexity regions. Bud13 together
          with Snu17p and a newly identified factor,
          Pml1p/Ylr016c, form a novel trimeric complex. called
          The RES complex, pre-mRNA retention and splicing
          complex. Subunits of this complex are not essential for
          viability of yeasts but they are required for efficient
          splicing in vitro and in vivo. Furthermore,
          inactivation of this complex causes pre-mRNA leakage
          from the nucleus. Bud13 contains a unique,
          phylogenetically conserved C-terminal region of unknown
          function.
          Length = 141

 Score = 27.7 bits (62), Expect = 9.6
 Identities = 12/26 (46%), Positives = 20/26 (76%)

Query: 2  EKKDKEREKERDREEEKGEREKEMRK 27
          E+K +E+E+E++ +E K E+EKE  K
Sbjct: 15 EEKREEKEREKEEKERKEEKEKEWGK 40


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.320    0.138    0.430 

Gapped
Lambda     K      H
   0.267   0.0893    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 27,511,698
Number of extensions: 2699545
Number of successful extensions: 3546
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3477
Number of HSP's successfully gapped: 77
Length of query: 531
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 430
Effective length of database: 6,457,848
Effective search space: 2776874640
Effective search space used: 2776874640
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 61 (27.0 bits)