RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= 021865
         (306 letters)



>gnl|CDD|215522 PLN02968, PLN02968, Probable N-acetyl-gamma-glutamyl-phosphate
           reductase.
          Length = 381

 Score =  450 bits (1158), Expect = e-159
 Identities = 202/262 (77%), Positives = 225/262 (85%)

Query: 37  SSIRGSATLPTKSLQVEDGKTQKSEKQVRIGLLGASGYTGAEIVRLLANHPYFGIKLMTA 96
            + R S T   +S+      + KSE++ RI +LGASGYTGAE+ RLLANHP F I +MTA
Sbjct: 12  LASRASVTSSPQSVVSSASSSVKSEEKKRIFVLGASGYTGAEVRRLLANHPDFEITVMTA 71

Query: 97  DRKAGQSIGSVFPHLISQDLPTMVAVKDADFSNVDAVFCCLPHGTTQEIIKGLPKSLKIV 156
           DRKAGQS GSVFPHLI+QDLP +VAVKDADFS+VDAVFCCLPHGTTQEIIK LPK LKIV
Sbjct: 72  DRKAGQSFGSVFPHLITQDLPNLVAVKDADFSDVDAVFCCLPHGTTQEIIKALPKDLKIV 131

Query: 157 DLSADFRLRDVSEYEEWYGQPHIAPDLQKEAVYGLTEISREDIKNARLVANPGCYPTSIQ 216
           DLSADFRLRD++EYEEWYG PH AP+LQKEAVYGLTE+ RE+IK+ARLVANPGCYPT IQ
Sbjct: 132 DLSADFRLRDIAEYEEWYGHPHRAPELQKEAVYGLTELQREEIKSARLVANPGCYPTGIQ 191

Query: 217 LPLVPLIQANLIQYRNIIIDAKSGVSGAGRGAKEANLYSEIAEGIYSYGVTRHRHVPEIE 276
           LPLVPL++A LI+  NIIIDAKSGVSGAGRGAKEANLY+EIAEGI +YGVTRHRHVPEIE
Sbjct: 192 LPLVPLVKAGLIEPDNIIIDAKSGVSGAGRGAKEANLYTEIAEGIGAYGVTRHRHVPEIE 251

Query: 277 QGLTGFASSKVTVSFTPHLMPM 298
           QGL   A SKVT SFTPHLMPM
Sbjct: 252 QGLADAAGSKVTPSFTPHLMPM 273


>gnl|CDD|234761 PRK00436, argC, N-acetyl-gamma-glutamyl-phosphate reductase;
           Validated.
          Length = 343

 Score =  356 bits (917), Expect = e-123
 Identities = 121/239 (50%), Positives = 159/239 (66%), Gaps = 4/239 (1%)

Query: 62  KQVRIGLLGASGYTGAEIVRLLANHPYFGIKLMTADRKAGQSIGSVFPHLISQDLPTMVA 121
             +++G++GASGYTG E++RLL NHP   I  +T+   AG+ +  V PHL       +  
Sbjct: 1   MMIKVGIVGASGYTGGELLRLLLNHPEVEIVAVTSRSSAGKPLSDVHPHLRGLVDLVLEP 60

Query: 122 VKDADFSNVDAVFCCLPHGTTQEIIKGLPKS-LKIVDLSADFRLRDVSEYEEWYGQPHIA 180
           +     +  D VF  LPHG + ++   L ++ +K++DLSADFRL+D   YE+WYG  H A
Sbjct: 61  LDPEILAGADVVFLALPHGVSMDLAPQLLEAGVKVIDLSADFRLKDPEVYEKWYGFEHAA 120

Query: 181 PDLQKEAVYGLTEISREDIKNARLVANPGCYPTSIQLPLVPLIQANLIQYRNIIIDAKSG 240
           P+L KEAVYGL E++RE+IK ARL+ANPGCYPT+  L L PL++A LI   +IIIDAKSG
Sbjct: 121 PELLKEAVYGLPELNREEIKGARLIANPGCYPTASLLALAPLLKAGLIDPDSIIIDAKSG 180

Query: 241 VSGAGRGAKEANLYSEIAEGIYSYGVTRHRHVPEIEQGLTGFASSKVTVSFTPHLMPMV 299
           VSGAGR A E  L+SE+ E +  Y V  HRH PEIEQ L+  A     VSFTPHL+PM 
Sbjct: 181 VSGAGRKASEGTLFSEVNENLRPYKVGGHRHTPEIEQELSALAG---EVSFTPHLVPMT 236


>gnl|CDD|233597 TIGR01850, argC, N-acetyl-gamma-glutamyl-phosphate reductase,
           common form.  This model represents the more common of
           two related families of
           N-acetyl-gamma-glutamyl-phosphate reductase, an enzyme
           catalyzing the third step or Arg biosynthesis from Glu.
           The two families differ by phylogeny, similarity
           clustering, and the gap architecture in a multiple
           sequence alignment. Bacterial members of this family
           tend to be found within Arg biosynthesis operons [Amino
           acid biosynthesis, Glutamate family].
          Length = 346

 Score =  351 bits (902), Expect = e-121
 Identities = 119/239 (49%), Positives = 157/239 (65%), Gaps = 3/239 (1%)

Query: 64  VRIGLLGASGYTGAEIVRLLANHPYFGIKLMTADRK-AGQSIGSVFPHLISQDLPTMVAV 122
           +++ ++GASGYTG E++RLL NHP   I  + + R+ AG+ +  V PHL       +  +
Sbjct: 1   IKVAIVGASGYTGGELLRLLLNHPEVEITYLVSSRESAGKPVSEVHPHLRGLVDLNLEPI 60

Query: 123 KDADFS-NVDAVFCCLPHGTTQEIIKG-LPKSLKIVDLSADFRLRDVSEYEEWYGQPHIA 180
              +   + D VF  LPHG + E+    L   +K++DLSADFRL+D   YE+WYG  H  
Sbjct: 61  DVEEILEDADVVFLALPHGVSAELAPELLAAGVKVIDLSADFRLKDPELYEKWYGFEHAG 120

Query: 181 PDLQKEAVYGLTEISREDIKNARLVANPGCYPTSIQLPLVPLIQANLIQYRNIIIDAKSG 240
           P+L ++AVYGL E+ RE+IK ARL+ANPGCYPT+  L L PL++  LI   +II+DAKSG
Sbjct: 121 PELLQKAVYGLPELHREEIKGARLIANPGCYPTATLLALAPLLKEGLIDPTSIIVDAKSG 180

Query: 241 VSGAGRGAKEANLYSEIAEGIYSYGVTRHRHVPEIEQGLTGFASSKVTVSFTPHLMPMV 299
           VSGAGR A EAN + E+ E +  Y VT HRH PEIEQ L   A  KV VSFTPHL+PM 
Sbjct: 181 VSGAGRKASEANHFPEVNENLRPYKVTGHRHTPEIEQELGRLAGGKVKVSFTPHLVPMT 239


>gnl|CDD|223081 COG0002, ArgC, Acetylglutamate semialdehyde dehydrogenase [Amino
           acid transport and metabolism].
          Length = 349

 Score =  298 bits (765), Expect = e-100
 Identities = 114/240 (47%), Positives = 157/240 (65%), Gaps = 4/240 (1%)

Query: 64  VRIGLLGASGYTGAEIVRLLANHPYFGIKLMTADRKAGQSIGSVFPHLIS-QDLP-TMVA 121
           +++G++GASGYTG E++RLLA HP   + L+++  +AG+ +  V P+L    DLP   + 
Sbjct: 3   IKVGIVGASGYTGLELLRLLAGHPDVELILISSRERAGKPVSDVHPNLRGLVDLPFQTID 62

Query: 122 VKDADFSNVDAVFCCLPHGTTQEII-KGLPKSLKIVDLSADFRLRDVSEYEEWYGQPHIA 180
            +  +    D VF  LPHG + E++ + L    K++DLSADFRL+D   YE+WYG  H  
Sbjct: 63  PEKIELDECDVVFLALPHGVSAELVPELLEAGCKVIDLSADFRLKDPEVYEKWYGFTHAG 122

Query: 181 PDLQKEAVYGLTEISREDIKNARLVANPGCYPTSIQLPLVPLIQANLIQY-RNIIIDAKS 239
           P+L ++AVYGL E+ RE I+ A+L+ANPGCYPT+  L L PL++A L+      I+DAKS
Sbjct: 123 PELLEDAVYGLPELHREKIRGAKLIANPGCYPTAAILALAPLVKAGLLDPDSPPIVDAKS 182

Query: 240 GVSGAGRGAKEANLYSEIAEGIYSYGVTRHRHVPEIEQGLTGFASSKVTVSFTPHLMPMV 299
           GVSGAGR A   N + E+ + +  YG+T HRH PEIEQ L   A  KV V FTPHL P V
Sbjct: 183 GVSGAGRKASVKNHFPEVNDSLRPYGLTGHRHTPEIEQHLGRLAGRKVPVIFTPHLGPFV 242


>gnl|CDD|214863 smart00859, Semialdhyde_dh, Semialdehyde dehydrogenase, NAD binding
           domain.  The semialdehyde dehydrogenase family is found
           in N-acetyl-glutamine semialdehyde dehydrogenase (AgrC),
           which is involved in arginine biosynthesis, and
           aspartate-semialdehyde dehydrogenase, an enzyme involved
           in the biosynthesis of various amino acids from
           aspartate. This family is also found in yeast and fungal
           Arg5,6 protein, which is cleaved into the enzymes
           N-acety-gamma-glutamyl-phosphate reductase and
           acetylglutamate kinase. These are also involved in
           arginine biosynthesis. All proteins in this entry
           contain a NAD binding region of semialdehyde
           dehydrogenase.
          Length = 123

 Score =  124 bits (314), Expect = 1e-35
 Identities = 46/143 (32%), Positives = 72/143 (50%), Gaps = 25/143 (17%)

Query: 65  RIGLLGASGYTGAEIVRLLANHPYFGIKLMTA-DRKAGQSIGSVFPHLISQDLPTMVAVK 123
           ++ ++GA+GY G E++RLLA HP F +  + A  R AG+ +    PHL   ++   +   
Sbjct: 1   KVAIVGATGYVGQELLRLLAEHPDFELTALAASSRSAGKKVSEAGPHL-KGEVVLELDPP 59

Query: 124 DADFSNVDAVFCCLPHGTTQEIIKGLPKSL----KIVDLSADFRLRDVSEYEEWYGQPHI 179
           D +   VD VF  LPHG ++E    LP++      ++DLS+ FR+ D             
Sbjct: 60  DFEELAVDIVFLALPHGVSKESAPLLPRAAAAGAVVIDLSSAFRMDD------------- 106

Query: 180 APDLQKEAVYGLTEISREDIKNA 202
                 +  YGL E++ E IK A
Sbjct: 107 ------DVPYGLPEVNPEAIKKA 123


>gnl|CDD|201603 pfam01118, Semialdhyde_dh, Semialdehyde dehydrogenase, NAD binding
           domain.  This Pfam entry contains the following members:
           N-acetyl-glutamine semialdehyde dehydrogenase (AgrC)
           Aspartate-semialdehyde dehydrogenase.
          Length = 121

 Score =  115 bits (291), Expect = 3e-32
 Identities = 43/140 (30%), Positives = 69/140 (49%), Gaps = 21/140 (15%)

Query: 65  RIGLLGASGYTGAEIVRLLANHPYFGIKLMTA-DRKAGQSIGSVFPHLISQDLPTMVAVK 123
           ++ ++GA+GY G E++RLLA HP   +  + A  R AG+ +   +P L       +  V 
Sbjct: 1   KVAIVGATGYVGQELLRLLAEHPPLELVALVASSRSAGKKVAFAYPILEGGKDLLLEDVD 60

Query: 124 DADFSNVDAVFCCLPHGTTQEIIKGLPKS-LKIVDLSADFRLRDVSEYEEWYGQPHIAPD 182
             D  +VD VF  LP G ++E+   L ++   ++DLS+ FR+ D                
Sbjct: 61  PEDLKDVDIVFLALPAGVSKELAPKLLEAGAVVIDLSSAFRMDD---------------- 104

Query: 183 LQKEAVYGLTEISREDIKNA 202
              +  Y L E++RE IK A
Sbjct: 105 ---DVPYVLPEVNREAIKKA 121


>gnl|CDD|237004 PRK11863, PRK11863, N-acetyl-gamma-glutamyl-phosphate reductase;
           Provisional.
          Length = 313

 Score = 85.2 bits (212), Expect = 5e-19
 Identities = 65/225 (28%), Positives = 93/225 (41%), Gaps = 55/225 (24%)

Query: 70  GASGYTGAEIVRLLANHPYFGIKLMT---ADRKAGQSIGSVFPHLISQDLPTMVAVKDAD 126
           G +G TG +I   LA      I+L++   A RK               D     A +   
Sbjct: 9   GEAGTTGLQIRERLAGRS--DIELLSIPEAKRK---------------D----AAARREL 47

Query: 127 FSNVDAVFCCLPHGTTQEIIKGLP-KSLKIVDLSADFRLRDVSEYEEWYGQPHIAPDLQK 185
            +  D    CLP    +E +  +   + +++D S   R                AP    
Sbjct: 48  LNAADVAILCLPDDAAREAVALIDNPATRVIDASTAHR---------------TAPGW-- 90

Query: 186 EAVYGLTEIS---REDIKNARLVANPGCYPT-SIQLPLVPLIQANLIQ-YRNIIIDAKSG 240
             VYG  E++   RE I  A+ VANPGCYPT +I L L PL+ A L+     + I+A SG
Sbjct: 91  --VYGFPELAPGQRERIAAAKRVANPGCYPTGAIAL-LRPLVDAGLLPADYPVSINAVSG 147

Query: 241 VSGAGRGAKEA--NLYSEIAEGIYSYGVT-RHRHVPEIEQ--GLT 280
            SG G+    A        A     YG+   H+H+PE++   GL 
Sbjct: 148 YSGGGKAMIAAYEAAPDGKAPAFRLYGLGLAHKHLPEMQAHAGLA 192


>gnl|CDD|233220 TIGR00978, asd_EA, aspartate-semialdehyde dehydrogenase
           (non-peptidoglycan organisms).  Two closely related
           families of aspartate-semialdehyde dehydrogenase are
           found. They differ by a deep split in phylogenetic and
           percent identity trees and in gap patterns. Separate
           models are built for the two types in order to exclude
           the USG-1 protein, found in several species, which is
           specifically related to the Bacillus subtilis type of
           aspartate-semialdehyde dehydrogenase. Members of this
           type are found primarily in organisms that lack
           peptidoglycan [Amino acid biosynthesis, Aspartate
           family].
          Length = 341

 Score = 81.0 bits (200), Expect = 2e-17
 Identities = 57/192 (29%), Positives = 92/192 (47%), Gaps = 21/192 (10%)

Query: 64  VRIGLLGASGYTGAEIVRLLANHPYFGI-KLMTADRKAGQSIGSVFPHLISQDLP----- 117
           +R+ +LGA+G  G + V+LLA HPYF + K++ + R AG+  G     +   D+P     
Sbjct: 1   MRVAVLGATGLVGQKFVKLLAKHPYFELAKVVASPRSAGKRYGEAVKWIEPGDMPEYVRD 60

Query: 118 -TMVAVKDADFSNVDAVFCCLPHGTTQEIIKGLPKSLKIVDLSA-DFRLR-DVSEYEEWY 174
             +V  +     +VD VF  LP    +E+   L ++ K V  +A + R+  DV       
Sbjct: 61  LPIVEPEPIAEDDVDIVFSALPSEVAEEVEPKLAEAGKPVFSNASNHRMDPDV------- 113

Query: 175 GQPHIAPDLQKEAVYGLTEISREDIKNARLVANPGCYPTSIQLPLVPLIQANLIQYRNII 234
             P I P++  + +  L ++ +E      +V NP C    + L L PLI A  I  + + 
Sbjct: 114 --PLIIPEVNSDHL-ELLKVQKERGWKGFIVTNPNCTTAGLTLALKPLIDAFGI--KKVH 168

Query: 235 IDAKSGVSGAGR 246
           +     VSGAG 
Sbjct: 169 VTTMQAVSGAGY 180


>gnl|CDD|236329 PRK08664, PRK08664, aspartate-semialdehyde dehydrogenase; Reviewed.
          Length = 349

 Score = 76.8 bits (190), Expect = 7e-16
 Identities = 52/192 (27%), Positives = 89/192 (46%), Gaps = 23/192 (11%)

Query: 64  VRIGLLGASGYTGAEIVRLLANHPYFGIKLMTA-DRKAGQSIGSVFPHLISQDLP----- 117
           +++G+LGA+G  G   V+LLANHP+F +  + A +R AG++ G      +   +P     
Sbjct: 4   LKVGILGATGMVGQRFVQLLANHPWFEVTALAASERSAGKTYGEAVRWQLDGPIPEEVAD 63

Query: 118 -TMVAVKDADFSNVDAVFCCLPHGTTQEIIKGLPKSLKIV-DLSADFRLR-DVSEYEEWY 174
             +V+       +VD VF  LP     E+ +   K+ K V   ++  R+  DV       
Sbjct: 64  MEVVSTDPEAVDDVDIVFSALPSDVAGEVEEEFAKAGKPVFSNASAHRMDPDV------- 116

Query: 175 GQPHIAPDLQKEAVYGLTEISRE-DIKNARLVANPGCYPTSIQLPLVPLIQANLIQYRNI 233
             P + P++  E +  L E+ R+    +  +V NP C    + L L PL+   +     +
Sbjct: 117 --PLVIPEVNPEHL-ELIEVQRKRRGWDGFIVTNPNCSTIGLVLALKPLMDFGI---ERV 170

Query: 234 IIDAKSGVSGAG 245
            +     +SGAG
Sbjct: 171 HVTTMQAISGAG 182


>gnl|CDD|233598 TIGR01851, argC_other, N-acetyl-gamma-glutamyl-phosphate reductase,
           uncommon form.  This model represents the less common of
           two related families of
           N-acetyl-gamma-glutamyl-phosphate reductase, an enzyme
           catalyzing the third step or Arg biosynthesis from Glu.
           The two families differ by phylogeny, similarity
           clustering, and gap architecture in a multiple sequence
           alignment [Amino acid biosynthesis, Glutamate family].
          Length = 310

 Score = 72.2 bits (177), Expect = 2e-14
 Identities = 56/185 (30%), Positives = 82/185 (44%), Gaps = 36/185 (19%)

Query: 123 KDAD-----FSNVDAVFCCLPHGTTQEIIKGLPKS-LKIVDLSADFRLRDVSEYEEW-YG 175
           KDA       +  D    CLP    +E +  +      I+D S  +R  D     +W YG
Sbjct: 38  KDAAERAKLLNAADVAILCLPDDAAREAVSLVDNPNTCIIDASTAYRTAD-----DWAYG 92

Query: 176 QPHIAPDLQKEAVYGLTEISREDIKNARLVANPGCYPTSIQLPLVPLIQANLIQYR-NII 234
            P +AP              RE I+N++ +ANPGCYPT     + PL++A ++     I 
Sbjct: 93  FPELAPG------------QREKIRNSKRIANPGCYPTGFIALMRPLVEAGILPADFPIT 140

Query: 235 IDAKSGVSGAGRG------AKEANLYSEIAEGIYSYGVTRHRHVPEIEQGLTGFASSKVT 288
           I+A SG SG G+          A+  S     IY   +T H+H+PE+    +G A   + 
Sbjct: 141 INAVSGYSGGGKAMIADYEQGSADNPSLQPFRIYGLALT-HKHLPEMRV-HSGLALPPI- 197

Query: 289 VSFTP 293
             FTP
Sbjct: 198 --FTP 200


>gnl|CDD|223214 COG0136, Asd, Aspartate-semialdehyde dehydrogenase [Amino acid
           transport and metabolism].
          Length = 334

 Score = 66.1 bits (162), Expect = 3e-12
 Identities = 52/205 (25%), Positives = 86/205 (41%), Gaps = 38/205 (18%)

Query: 65  RIGLLGASGYTGAEIVRLLAN-HPYFGIKLMTA-DRKAGQSIGSVFPHLISQDLPTMVAV 122
            + +LGA+G  G  ++ LL   H  F   ++ A  R AG+          S  +P   A 
Sbjct: 3   NVAVLGATGAVGQVLLELLEERHFPFEELVLLASARSAGKKYIEF--GGKSIGVPEDAA- 59

Query: 123 KDADFSNVDAVFCCLPHGTTQEIIKGLPKSLKIV-DLSADFRL-RDVSEYEEWYGQPHIA 180
            +  FS+VD VF       ++E+     ++  +V D S+ FR+  DV         P + 
Sbjct: 60  DEFVFSDVDIVFFAAGGSVSKEVEPKAAEAGCVVIDNSSAFRMDPDV---------PLVV 110

Query: 181 PDLQKEAVYGLTEISREDIKNAR----LVANPGCYPTSIQL--PLVPLIQANLIQYRNII 234
           P           E++ E + + +    ++ANP C  ++IQL   L PL  A  I  + ++
Sbjct: 111 P-----------EVNPEHLIDYQKRGFIIANPNC--STIQLVLALKPLHDAFGI--KRVV 155

Query: 235 IDAKSGVSGAGRGAKEANLYSEIAE 259
           +     VSGAG       L  +   
Sbjct: 156 VSTYQAVSGAGAEGGVE-LAGQTDA 179


>gnl|CDD|217222 pfam02774, Semialdhyde_dhC, Semialdehyde dehydrogenase,
           dimerisation domain.  This Pfam entry contains the
           following members: N-acetyl-glutamine semialdehyde
           dehydrogenase (AgrC) Aspartate-semialdehyde
           dehydrogenase.
          Length = 167

 Score = 58.9 bits (143), Expect = 1e-10
 Identities = 25/77 (32%), Positives = 34/77 (44%), Gaps = 2/77 (2%)

Query: 219 LVPLIQANLIQYRNIIIDAKSGVSGAGRGAKEANLYSEIAEGIYSYGVTR-HRHVPEIEQ 277
           L PL  A L     +I+D    VSGAG+ AK     + IA+ +  Y     H   PE  +
Sbjct: 1   LKPLRDA-LGGLERVIVDTYQAVSGAGKKAKPGVFGAPIADNLIPYIDGEEHNGTPETRE 59

Query: 278 GLTGFASSKVTVSFTPH 294
            L     +K  + FTP 
Sbjct: 60  ELKMVNETKKILGFTPK 76


>gnl|CDD|233347 TIGR01296, asd_B, aspartate-semialdehyde dehydrogenase
           (peptidoglycan organisms).  Two closely related families
           of aspartate-semialdehyde dehydrogenase are found. They
           differ by a deep split in phylogenetic and percent
           identity trees and in gap patterns. This model
           represents a branch more closely related to the USG-1
           protein than to the other aspartate-semialdehyde
           dehydrogenases represented in model TIGR00978 [Amino
           acid biosynthesis, Aspartate family].
          Length = 338

 Score = 50.6 bits (121), Expect = 4e-07
 Identities = 46/202 (22%), Positives = 89/202 (44%), Gaps = 42/202 (20%)

Query: 65  RIGLLGASGYTGAEIVRLLA--NHPYFGIKLMTADRKAGQSIGSVFPHLISQDLPTMVAV 122
            + ++GA+G  G E+++LL   N P   + L+ + R AG+ +         ++L  +   
Sbjct: 1   NVAIVGATGAVGQEMLKLLEERNFPIDKLVLLASARSAGRKL-----TFKGKEL-EVEEA 54

Query: 123 KDADFSNVDAVFCCLPHGTTQEIIKGLPKSLK----IVDLSADFRL-RDVSEYEEWYGQP 177
           +   F  +D          ++E     PK+ K    ++D ++ FR+  DV         P
Sbjct: 55  ETESFEGIDIALFSAGGSVSKEFA---PKAAKAGVIVIDNTSAFRMDPDV---------P 102

Query: 178 HIAPDLQKEAVYGLTEISREDIKNAR---LVANPGCYPTSIQLPLVPLIQANLIQYRNII 234
            + P           E++ ED+K      ++ANP C  ++IQ+ +V     +  + + ++
Sbjct: 103 LVVP-----------EVNFEDLKEFNPKGIIANPNC--STIQMVVVLKPLHDEAKIKRVV 149

Query: 235 IDAKSGVSGAGRGAKEANLYSE 256
           +     VSGAG    E  LY++
Sbjct: 150 VSTYQAVSGAGNAGVE-ELYNQ 170


>gnl|CDD|178009 PLN02383, PLN02383, aspartate semialdehyde dehydrogenase.
          Length = 344

 Score = 47.5 bits (113), Expect = 3e-06
 Identities = 45/201 (22%), Positives = 79/201 (39%), Gaps = 35/201 (17%)

Query: 59  KSEKQVRIGLLGASGYTGAEIVRLLA--NHPYFGIKLMTADRKAGQSIGSVFPHLISQDL 116
            +E    + ++G +G  G E + +L   + PY  +K++ + R AG+ +         ++L
Sbjct: 3   LTENGPSVAIVGVTGAVGQEFLSVLTDRDFPYSSLKMLASARSAGKKVTFEGRDYTVEEL 62

Query: 117 PTMVAVKDADFSNVDAVF------CCLPHGTTQEIIKGLPKSLKIVDLSADFRLRDVSEY 170
                  +  F  VD              G        + K   +VD S+ FR+ +    
Sbjct: 63  ------TEDSFDGVDIALFSAGGSISKKFGPIA-----VDKGAVVVDNSSAFRMEE---- 107

Query: 171 EEWYGQPHIAPDLQKEAVYGLTEISREDIKNARLVANPGCYPTSIQLPLV-PLIQANLIQ 229
               G P + P++  EA+       +       L+ANP C  T I L  V PL +    +
Sbjct: 108 ----GVPLVIPEVNPEAMKH----IKLGKGKGALIANPNC-STIICLMAVTPLHR--HAK 156

Query: 230 YRNIIIDAKSGVSGAGRGAKE 250
            + +++      SGAG  A E
Sbjct: 157 VKRMVVSTYQAASGAGAAAME 177


>gnl|CDD|136022 PRK06728, PRK06728, aspartate-semialdehyde dehydrogenase;
           Provisional.
          Length = 347

 Score = 42.7 bits (100), Expect = 1e-04
 Identities = 42/189 (22%), Positives = 76/189 (40%), Gaps = 28/189 (14%)

Query: 60  SEKQVRIGLLGASGYTGAEIVRLLANHPYFGIK---LMTADRKAGQSIGSVFPHLISQDL 116
           SEK   + ++GA+G  G +I+ LL     F I    L+++ R AG+++      +I Q+ 
Sbjct: 2   SEKGYHVAVVGATGAVGQKIIELLEKETKFNIAEVTLLSSKRSAGKTVQFKGREIIIQE- 60

Query: 117 PTMVAVKDADFSNVDAVFCCLPHGTTQEIIKGLPKSLKIVDLSADFRLRDVSEYEEWYGQ 176
                 K   F  VD  F       +++ +     S  IV       + + SEY   +  
Sbjct: 61  -----AKINSFEGVDIAFFSAGGEVSRQFVNQAVSSGAIV-------IDNTSEYRMAHDV 108

Query: 177 PHIAPDLQKEAVYGLTEISREDIKNARLVANPGCYPTSIQLPLVPLIQANLIQYRNIIID 236
           P + P++    +           ++  ++A P C    +   L P+ +        II+ 
Sbjct: 109 PLVVPEVNAHTLK----------EHKGIIAVPNCSALQMVTALQPIRKV--FGLERIIVS 156

Query: 237 AKSGVSGAG 245
               VSG+G
Sbjct: 157 TYQAVSGSG 165


>gnl|CDD|187569 cd05259, PCBER_SDR_a, phenylcoumaran benzylic ether reductase
           (PCBER) like, atypical (a) SDRs.  PCBER and
           pinoresinol-lariciresinol reductases are NADPH-dependent
           aromatic alcohol reductases, and are atypical members of
           the SDR family. Other proteins in this subgroup are
           identified as eugenol synthase. These proteins contain
           an N-terminus characteristic of NAD(P)-binding proteins
           and a small C-terminal domain presumed to be involved in
           substrate binding, but they do not have the conserved
           active site Tyr residue typically found in SDRs.
           Numerous other members have unknown functions. The
           glycine rich NADP-binding motif in this subgroup is of 2
           forms: GXGXXG and G[GA]XGXXG; it tends to be atypical
           compared with the forms generally seen in classical or
           extended SDRs. The usual SDR active site tetrad is not
           present, but a critical active site Lys at the usual SDR
           position has been identified in various members, though
           other charged and polar residues are found at this
           position in this subgroup. Atypical SDR-related proteins
           retain the Rossmann fold of the SDRs, but have limited
           sequence identity and generally lack the catalytic
           properties of the archetypical members. Atypical SDRs
           include biliverdin IX beta reductase (BVR-B,aka flavin
           reductase), NMRa (a negative transcriptional regulator
           of various fungi), progesterone 5-beta-reductase like
           proteins, phenylcoumaran benzylic ether and
           pinoresinol-lariciresinol reductases, phenylpropene
           synthases, eugenol synthase, triphenylmethane reductase,
           isoflavone reductases, and others. SDRs are a
           functionally diverse family of oxidoreductases that have
           a single domain with a structurally conserved Rossmann
           fold, an NAD(P)(H)-binding region, and a structurally
           diverse C-terminal region. Sequence identity between
           different SDR enzymes is typically in the 15-30% range;
           they catalyze a wide range of activities including the
           metabolism of steroids, cofactors, carbohydrates,
           lipids, aromatic compounds, and amino acids, and act in
           redox sensing. Classical SDRs have an TGXXX[AG]XG
           cofactor binding motif and a YXXXK active site motif,
           with the Tyr residue of the active site motif serving as
           a critical catalytic residue (Tyr-151, human
           15-hydroxyprostaglandin dehydrogenase numbering). In
           addition to the Tyr and Lys, there is often an upstream
           Ser and/or an Asn, contributing to the active site;
           while substrate binding is in the C-terminal region,
           which determines specificity. The standard reaction
           mechanism is a 4-pro-S hydride transfer and proton relay
           involving the conserved Tyr and Lys, a water molecule
           stabilized by Asn, and nicotinamide. In addition to the
           Rossmann fold core region typical of all SDRs, extended
           SDRs have a less conserved C-terminal extension of
           approximately 100 amino acids, and typically have a
           TGXXGXXG cofactor binding motif. Complex (multidomain)
           SDRs such as ketoreductase domains of fatty acid
           synthase have a GGXGXXG NAD(P)-binding motif and an
           altered active site motif (YXXXN). Fungal type ketoacyl
           reductases have a TGXXXGX(1-2)G NAD(P)-binding motif.
          Length = 282

 Score = 41.9 bits (99), Expect = 2e-04
 Identities = 22/86 (25%), Positives = 36/86 (41%), Gaps = 6/86 (6%)

Query: 65  RIGLLGASGYTGAEIVRLLANHPYFGIKLMTADRKAGQSIGSVFPH---LISQDLPTMVA 121
           +I + GA+G  G  IV  L   P F + ++T  R +  S     P    ++  D  +  +
Sbjct: 1   KIAIAGATGTLGGPIVSALLASPGFTVTVLT--RPSSTSSNEFQPSGVKVVPVDYASHES 58

Query: 122 VKDADFSNVDAVFCCLPHGTTQEIIK 147
           +  A    VDAV   L      + +K
Sbjct: 59  LVAA-LKGVDAVISALGGAAIGDQLK 83


>gnl|CDD|187560 cd05250, CC3_like_SDR_a, CC3(TIP30)-like, atypical (a) SDRs.
           Atypical SDRs in this subgroup include CC3 (also known
           as TIP30) which is implicated in tumor suppression.
           Atypical SDRs are distinct from classical SDRs. Members
           of this subgroup have a glycine rich NAD(P)-binding
           motif that resembles the extended SDRs, and have an
           active site triad of the SDRs (YXXXK and upstream Ser),
           although the upstream Asn of the usual SDR active site
           is substituted with Asp. For CC3, the Tyr of the triad
           is displaced compared to the usual SDRs and the protein
           is monomeric, both these observations suggest that the
           usual SDR catalytic activity is not present. NADP
           appears to serve an important role as a ligand, and may
           be important in the interaction with other
           macromolecules. Atypical SDRs generally lack the
           catalytic residues characteristic of the SDRs, and their
           glycine-rich NAD(P)-binding motif is often different
           from the forms normally seen in classical or extended
           SDRs. Atypical SDRs include biliverdin IX beta reductase
           (BVR-B,aka flavin reductase), NMRa (a negative
           transcriptional regulator of various fungi),
           progesterone 5-beta-reductase like proteins,
           phenylcoumaran benzylic ether and
           pinoresinol-lariciresinol reductases, phenylpropene
           synthases, eugenol synthase, triphenylmethane reductase,
           isoflavone reductases, and others. SDRs are a
           functionally diverse family of oxidoreductases that have
           a single domain with a structurally conserved Rossmann
           fold, an NAD(P)(H)-binding region, and a structurally
           diverse C-terminal region. Sequence identity between
           different SDR enzymes is typically in the 15-30% range;
           they catalyze a wide range of activities including the
           metabolism of steroids, cofactors, carbohydrates,
           lipids, aromatic compounds, and amino acids, and act in
           redox sensing. Classical SDRs have an TGXXX[AG]XG
           cofactor binding motif and a YXXXK active site motif,
           with the Tyr residue of the active site motif serving as
           a critical catalytic residue (Tyr-151, human
           15-hydroxyprostaglandin dehydrogenase numbering). In
           addition to the Tyr and Lys, there is often an upstream
           Ser and/or an Asn, contributing to the active site;
           while substrate binding is in the C-terminal region,
           which determines specificity. The standard reaction
           mechanism is a 4-pro-S hydride transfer and proton relay
           involving the conserved Tyr and Lys, a water molecule
           stabilized by Asn, and nicotinamide. In addition to the
           Rossmann fold core region typical of all SDRs, extended
           SDRs have a less conserved C-terminal extension of
           approximately 100 amino acids, and typically have a
           TGXXGXXG cofactor binding motif. Complex (multidomain)
           SDRs such as ketoreductase domains of fatty acid
           synthase have a GGXGXXG NAD(P)-binding motif and an
           altered active site motif (YXXXN). Fungal type ketoacyl
           reductases have a TGXXXGX(1-2)G NAD(P)-binding motif.
          Length = 214

 Score = 40.7 bits (96), Expect = 4e-04
 Identities = 24/76 (31%), Positives = 33/76 (43%), Gaps = 5/76 (6%)

Query: 68  LLGASGYTGAEIVRLLANHPYFGIKLMTADRKAGQ-SIGSVFPHLISQDLPTMVAVKDAD 126
           +LGA+G  G  ++R L   PY+  K+    R+             I  D   +    +A 
Sbjct: 5   VLGATGLVGKHLLRELLKSPYYS-KVTAIVRRKLTFPEAKEKLVQIVVDFERLDEYLEA- 62

Query: 127 FSNVDAVFCCLPHGTT 142
           F N D  FCCL  GTT
Sbjct: 63  FQNPDVGFCCL--GTT 76


>gnl|CDD|237845 PRK14874, PRK14874, aspartate-semialdehyde dehydrogenase;
           Provisional.
          Length = 334

 Score = 41.3 bits (98), Expect = 4e-04
 Identities = 48/202 (23%), Positives = 88/202 (43%), Gaps = 51/202 (25%)

Query: 64  VRIGLLGASGYTGAEIVRLLA--NHPYFGIKLMTADRKAGQSI---GSVFPHLISQDLPT 118
             + ++GA+G  G E++ +L   N P   ++L+ + R AG+ +   G        ++L  
Sbjct: 2   YNVAVVGATGAVGREMLNILEERNFPVDKLRLLASARSAGKELSFKG--------KEL-K 52

Query: 119 MVAVKDADFSNVDAVFCCLPHGTTQEIIKGLPKSLK----IVDLSADFRL-RDVSEYEEW 173
           +  +   DFS VD          +++     PK+      ++D S+ FR+  DV      
Sbjct: 53  VEDLTTFDFSGVDIALFSAGGSVSKKYA---PKAAAAGAVVIDNSSAFRMDPDV------ 103

Query: 174 YGQPHIAPDLQKEAVYGLTEISREDIKNAR---LVANPGCYPTSIQLPLV--PLIQANLI 228
              P + P           E++ E +   R   ++ANP C  ++IQ+ +   PL  A  I
Sbjct: 104 ---PLVVP-----------EVNPEALAEHRKKGIIANPNC--STIQMVVALKPLHDAAGI 147

Query: 229 QYRNIIIDAKSGVSGAGRGAKE 250
           +   +++     VSGAG+   E
Sbjct: 148 K--RVVVSTYQAVSGAGKAGME 167


>gnl|CDD|234595 PRK00048, PRK00048, dihydrodipicolinate reductase; Provisional.
          Length = 257

 Score = 37.4 bits (88), Expect = 0.005
 Identities = 19/75 (25%), Positives = 35/75 (46%), Gaps = 6/75 (8%)

Query: 64  VRIGLLGASGYTGAEIVRLLANHPYFGIKLMTA-DRKAGQSIGSVFPH-LISQDLPTMVA 121
           +++ + GASG  G E++  +       ++L+ A DR     +G       I+ DL  ++A
Sbjct: 2   IKVAVAGASGRMGRELIEAVEAAE--DLELVAAVDRPGSPLVGQGALGVAITDDLEAVLA 59

Query: 122 VKDA--DFSNVDAVF 134
             D   DF+  +A  
Sbjct: 60  DADVLIDFTTPEATL 74


>gnl|CDD|216304 pfam01113, DapB_N, Dihydrodipicolinate reductase, N-terminus.
           Dihydrodipicolinate reductase (DapB) reduces the
           alpha,beta-unsaturated cyclic imine,
           dihydro-dipicolinate. This reaction is the second
           committed step in the biosynthesis of L-lysine and its
           precursor meso-diaminopimelate, which are critical for
           both protein and cell wall biosynthesis. The N-terminal
           domain of DapB binds the dinucleotide NADPH.
          Length = 122

 Score = 34.1 bits (79), Expect = 0.020
 Identities = 15/71 (21%), Positives = 32/71 (45%), Gaps = 3/71 (4%)

Query: 64  VRIGLLGASGYTGAEIVRLLANHPYFGIKLMTA-DRKAGQSIGSVFPHLISQDLPTMVAV 122
           +++ ++GASG  G E+++ +   P    +L+ A DR     +GS    L       +   
Sbjct: 1   IKVAVVGASGRMGRELIKAILEAP--DFELVAAVDRPGSSLLGSDAGELAGPLGVPVTDD 58

Query: 123 KDADFSNVDAV 133
            +   ++ D +
Sbjct: 59  LEEVLADADVL 69


>gnl|CDD|222146 pfam13460, NAD_binding_10, NADH(P)-binding. 
          Length = 182

 Score = 34.6 bits (80), Expect = 0.030
 Identities = 25/142 (17%), Positives = 50/142 (35%), Gaps = 13/142 (9%)

Query: 66  IGLLGASGYTGAEIVRLLANHPYFGIKLMTADRKAGQSIGSVFPHLISQDLPTMVAVKDA 125
           I ++GA+G TG  +V+ L      G ++    R   ++        + +DL  +  +   
Sbjct: 1   IAVIGATGKTGRRLVKELLAR---GHQVTALSRNPSKAPAPGVTP-VQKDLFDLADL-AE 55

Query: 126 DFSNVDAVFCCLPHGT-----TQEIIKGLPKS--LKIVDLSADFRLRDVSEYEEWYGQPH 178
             + VDAV              + ++    ++   +IV +SA    RD          P 
Sbjct: 56  ALAGVDAVVDAFGARPDDSDGVKHLLDAAARAGVRRIVVVSAAGLYRDEPGTFRLDDAPL 115

Query: 179 IAPDLQKEAVYGLTEISREDIK 200
             P  + +A      +    + 
Sbjct: 116 FPPYARAKAA-AEELLRASGLD 136


>gnl|CDD|234294 TIGR03649, ergot_EASG, ergot alkaloid biosynthesis protein,
           AFUA_2G17970 family.  This family consists of fungal
           proteins of unknown function associated with secondary
           metabolite biosynthesis, such as of the ergot alkaloids
           such as ergovaline. Nomenclature differs because gene
           order differs - this is EasG in Neotyphodium lolii but
           is designated ergot alkaloid biosynthetic protein A in
           several other fungi.
          Length = 285

 Score = 33.1 bits (76), Expect = 0.14
 Identities = 21/81 (25%), Positives = 30/81 (37%), Gaps = 12/81 (14%)

Query: 66  IGLLGASGYTGAEIVRLL--ANHPYFGIKLMTADRKAGQSIGSVFPH-----LISQDLPT 118
           I L G +G T + I RLL  A+ P+     + A R +  S G    H     L       
Sbjct: 2   ILLTGGTGKTASRIARLLQAASVPF-----LVASRSSSSSAGPNEKHVKFDWLDEDTWDN 56

Query: 119 MVAVKDADFSNVDAVFCCLPH 139
             +  D     + AV+   P 
Sbjct: 57  PFSSDDGMEPEISAVYLVAPP 77


>gnl|CDD|223745 COG0673, MviM, Predicted dehydrogenases and related proteins
           [General function prediction only].
          Length = 342

 Score = 32.9 bits (75), Expect = 0.19
 Identities = 21/89 (23%), Positives = 36/89 (40%), Gaps = 8/89 (8%)

Query: 62  KQVRIGLLGASGYTGAEIVRLLANHPYFGIKLMTADRKA--GQSIGSVFPHLIS-QDLPT 118
           K +R+G++GA G  G   +  LA        +   DR     ++    F    +  DL  
Sbjct: 2   KMIRVGIIGAGGIAGKAHLPALAALGGGLELVAVVDRDPERAEAFAEEFGIAKAYTDLEE 61

Query: 119 MVAVKDADFSNVDAVFCCLPHGTTQEIIK 147
           ++A  D     +DAV+   P+    E+  
Sbjct: 62  LLADPD-----IDAVYIATPNALHAELAL 85


>gnl|CDD|234093 TIGR03025, EPS_sugtrans, exopolysaccharide biosynthesis polyprenyl
           glycosylphosphotransferase.  Members of this family are
           generally found near other genes involved in the
           biosynthesis of a variety of exopolysaccharides. These
           proteins consist of two fused domains, an N-terminal
           hydrophobic domain of generally low conservation and a
           highly conserved C-terminal sugar transferase domain
           (pfam02397). Characterized and partially characterized
           members of this subfamily include Salmonella WbaP
           (originally RfbP) , E. coli WcaJ , Methylobacillus EpsB,
           Xanthomonas GumD, Vibrio CpsA, Erwinia AmsG, Group B
           Streptococcus CpsE (originally CpsD), and Streptococcus
           suis Cps2E. Each of these is believed to act in
           transferring the sugar from, for instance, UDP-glucose
           or UDP-galactose, to a lipid carrier such as
           undecaprenyl phosphate as the first (priming) step in
           the synthesis of an oligosaccharide "block". This
           function is encoded in the C-terminal domain. The
           liposaccharide is believed to be subsequently
           transferred through a "flippase" function from the
           cytoplasmic to the periplasmic face of the inner
           membrane by the N-terminal domain. Certain closely
           related transferase enzymes such as Sinorhizobium ExoY
           and Lactococcus EpsD lack the N-terminal domain and are
           not found by this model.
          Length = 445

 Score = 32.2 bits (74), Expect = 0.31
 Identities = 25/121 (20%), Positives = 43/121 (35%), Gaps = 25/121 (20%)

Query: 65  RIGLLGASGYTGAEIVRLLANHPYFGIKLM--TADRKAGQSIGSVFPHLISQDLPTMVAV 122
           R+ ++G  G    E+ + L+ +P  G +++    DR + +   +  P L   D      V
Sbjct: 127 RVLIVGT-GEAAEELAKALSRNPALGYRVVGFVDDRPSDRVEVAGLPVLGKLDD----LV 181

Query: 123 KDADFSNVDAVFCCLPHGTTQEIIKGLPKSLKIVDLSADFRLRDVSEYEEWYGQPHIAPD 182
           +      VD V   LP          L +  +I+ L        V           + PD
Sbjct: 182 ELVRAHRVDEVIIALP----------LSEEARILRLLLQLEDLGVDVR--------LVPD 223

Query: 183 L 183
           L
Sbjct: 224 L 224


>gnl|CDD|223366 COG0289, DapB, Dihydrodipicolinate reductase [Amino acid transport
           and metabolism].
          Length = 266

 Score = 31.9 bits (73), Expect = 0.34
 Identities = 19/70 (27%), Positives = 31/70 (44%), Gaps = 3/70 (4%)

Query: 64  VRIGLLGASGYTGAEIVRLLANHPYFGIKLMTA-DRKAGQSIGSVFPHLISQDLPTMVAV 122
           +++ + GASG  G  ++R +   P   ++L+ A DR    S+GS    L    L  +   
Sbjct: 3   IKVAVAGASGRMGRTLIRAVLEAP--DLELVAAFDRPGSLSLGSDAGELAGLGLLGVPVT 60

Query: 123 KDADFSNVDA 132
            D      DA
Sbjct: 61  DDLLLVKADA 70


>gnl|CDD|129992 TIGR00914, 2A0601, heavy metal efflux pump, CzcA family.  This
           model represents a family of H+/heavy metal cation
           antiporters. This family is one of several subfamilies
           within the scope of Pfam model pfam00873 [Cellular
           processes, Detoxification, Transport and binding
           proteins, Cations and iron carrying compounds].
          Length = 1051

 Score = 32.0 bits (73), Expect = 0.44
 Identities = 25/94 (26%), Positives = 36/94 (38%), Gaps = 9/94 (9%)

Query: 176 QPHIAPDLQKEAVYGLTEISREDIKNARLVANPGCYPTSIQ------LPLVPLIQANLIQ 229
           Q  +APD +K A YG   +S  D+ NA    N       I+      L   P    ++  
Sbjct: 193 QFLVAPDPEKLAAYG---LSLADVVNALERNNQNVGAGYIERRGEQYLVRAPGQVQSMDD 249

Query: 230 YRNIIIDAKSGVSGAGRGAKEANLYSEIAEGIYS 263
            RNI+I    GV    R      +  E+  G  +
Sbjct: 250 IRNIVIATGEGVPIRIRDVARVQIGKELRTGAAT 283


>gnl|CDD|221804 pfam12847, Methyltransf_18, Methyltransferase domain.  Protein in
           this family function as methyltransferases.
          Length = 104

 Score = 30.0 bits (68), Expect = 0.52
 Identities = 21/92 (22%), Positives = 33/92 (35%), Gaps = 11/92 (11%)

Query: 69  LGA-SGYTGAEIVRLLANHPYFGI-----KLMTADRKAGQSIGSVFPHLISQDLPTMVAV 122
           +G  +G    E+ RL       G+      L  A   A  ++G      +  D P  + +
Sbjct: 8   IGCGTGSLAIELARLFPGARVTGVDLSPEMLELARENAKLALGPRI-TFVQGDAPDALDL 66

Query: 123 KDADFSNVDAVFCCLPHGTTQEIIKGLPKSLK 154
            +      DAVF     G   E++  L   LK
Sbjct: 67  LE----GFDAVFIGGGGGDLLELLDALASLLK 94


>gnl|CDD|240645 cd12168, Mand_dh_like, D-Mandelate Dehydrogenase-like
           dehydrogenases.  D-Mandelate dehydrogenase (D-ManDH),
           identified as an enzyme that interconverts
           benzoylformate and D-mandelate, is a D-2-hydroxyacid
           dehydrogenase family member that catalyzes the
           conversion of c3-branched 2-ketoacids. D-ManDH exhibits
           broad substrate specificities for 2-ketoacids with large
           hydrophobic side chains, particularly those with
           C3-branched side chains. 2-hydroxyacid dehydrogenases
           catalyze the conversion of a wide variety of D-2-hydroxy
           acids to their corresponding keto acids. The general
           mechanism is (R)-lactate + acceptor to pyruvate +
           reduced acceptor. Glycerate dehydrogenase catalyzes the
           reaction (R)-glycerate + NAD+ to hydroxypyruvate + NADH
           + H+. Formate/glycerate and related dehydrogenases of
           the D-specific 2-hydroxyacid dehydrogenase superfamily
           include groups such as formate dehydrogenase, glycerate
           dehydrogenase, L-alanine dehydrogenase, and
           S-adenosylhomocysteine hydrolase. Despite often low
           sequence identity, these proteins typically have a
           characteristic arrangement of 2 similar subdomains of
           the alpha/beta Rossmann fold NAD+ binding form. The NAD+
           binding domain is inserted within the linear sequence of
           the mostly N-terminal catalytic domain, which has a
           similar domain structure to the internal NAD binding
           domain. Structurally, these domains are connected by
           extended alpha helices and create a cleft in which NAD
           is bound, primarily to the C-terminal portion of the 2nd
           (internal) domain.
          Length = 321

 Score = 31.0 bits (71), Expect = 0.74
 Identities = 12/43 (27%), Positives = 21/43 (48%), Gaps = 9/43 (20%)

Query: 121 AVKDADFSNVDAVFCCLPHGTTQ-------EIIKGLPKSLKIV 156
           A+K+  + +  A++     G+         E+I  LP SLKI+
Sbjct: 40  ALKEGKYGDFVAIYR--TFGSAGETGPFDEELISPLPPSLKII 80


>gnl|CDD|226947 COG4581, COG4581, Superfamily II RNA helicase [DNA replication,
           recombination, and repair].
          Length = 1041

 Score = 30.8 bits (70), Expect = 0.95
 Identities = 12/36 (33%), Positives = 20/36 (55%), Gaps = 2/36 (5%)

Query: 143 QEIIKGLPKSLKIVDLSADFRLRDVSEYEEWYGQPH 178
           +E+I  LP  ++ V LSA   + +  E+ EW  + H
Sbjct: 251 EEVIILLPDHVRFVFLSAT--VPNAEEFAEWIQRVH 284


>gnl|CDD|216461 pfam01370, Epimerase, NAD dependent epimerase/dehydratase family.
           This family of proteins utilise NAD as a cofactor. The
           proteins in this family use nucleotide-sugar substrates
           for a variety of chemical reactions.
          Length = 233

 Score = 30.3 bits (69), Expect = 1.1
 Identities = 17/68 (25%), Positives = 26/68 (38%), Gaps = 3/68 (4%)

Query: 68  LLGASGYTGAEIVRLLANHPYFGIKLMTADRKAGQSIGSVFPHLISQDLPTMVAVKDA-D 126
           + G +G+ G+ +VR L    Y  I L    R    S+ +        DL    A++    
Sbjct: 3   VTGGTGFIGSHLVRRLLQEGYEVIVLGRRRRSE--SLNTGRIRFHEGDLTDPDALERLLA 60

Query: 127 FSNVDAVF 134
               DAV 
Sbjct: 61  EVQPDAVI 68


>gnl|CDD|185068 PRK15113, PRK15113, glutathione S-transferase; Provisional.
          Length = 214

 Score = 29.9 bits (68), Expect = 1.2
 Identities = 31/97 (31%), Positives = 35/97 (36%), Gaps = 40/97 (41%)

Query: 124 DADFSN--VDAVFCCLPHGTTQEIIKGLPKSLKIVDLSA--------------------- 160
           DA F +  V + F  L     QE  KGLP  LK VDL A                     
Sbjct: 11  DAHFFSPYVMSAFVAL-----QE--KGLPFELKTVDLDAGEHLQPTYQGYSLTRRVPTLQ 63

Query: 161 --DFRLRDVS---EY-EEWYGQPHIAP----DLQKEA 187
             DF L + S   EY EE +  P        DLQ  A
Sbjct: 64  HDDFELSESSAIAEYLEERFAPPAWERIYPADLQARA 100


>gnl|CDD|187555 cd05244, BVR-B_like_SDR_a, biliverdin IX beta reductase (BVR-B,
          aka flavin reductase)-like proteins; atypical (a) SDRs.
           Human BVR-B catalyzes pyridine nucleotide-dependent
          production of bilirubin-IX beta during fetal
          development; in the adult BVR-B has flavin and ferric
          reductase activities. Human BVR-B catalyzes the
          reduction of FMN, FAD, and riboflavin. Recognition of
          flavin occurs mostly by hydrophobic interactions,
          accounting for the broad substrate specificity.
          Atypical SDRs are distinct from classical SDRs. BVR-B
          does not share the key catalytic triad, or conserved
          tyrosine typical of SDRs. The glycine-rich NADP-binding
          motif of BVR-B is GXXGXXG, which is similar but not
          identical to the pattern seen in extended SDRs.
          Atypical SDRs generally lack the catalytic residues
          characteristic of the SDRs, and their glycine-rich
          NAD(P)-binding motif is often different from the forms
          normally seen in classical or extended SDRs. Atypical
          SDRs include biliverdin IX beta reductase (BVR-B,aka
          flavin reductase), NMRa (a negative transcriptional
          regulator of various fungi), progesterone
          5-beta-reductase like proteins, phenylcoumaran benzylic
          ether and pinoresinol-lariciresinol reductases,
          phenylpropene synthases, eugenol synthase,
          triphenylmethane reductase, isoflavone reductases, and
          others. SDRs are a functionally diverse family of
          oxidoreductases that have a single domain with a
          structurally conserved Rossmann fold, an
          NAD(P)(H)-binding region, and a structurally diverse
          C-terminal region. Sequence identity between different
          SDR enzymes is typically in the 15-30% range; they
          catalyze a wide range of activities including the
          metabolism of steroids, cofactors, carbohydrates,
          lipids, aromatic compounds, and amino acids, and act in
          redox sensing. Classical SDRs have an TGXXX[AG]XG
          cofactor binding motif and a YXXXK active site motif,
          with the Tyr residue of the active site motif serving
          as a critical catalytic residue (Tyr-151, human
          15-hydroxyprostaglandin dehydrogenase numbering). In
          addition to the Tyr and Lys, there is often an upstream
          Ser and/or an Asn, contributing to the active site;
          while substrate binding is in the C-terminal region,
          which determines specificity. The standard reaction
          mechanism is a 4-pro-S hydride transfer and proton
          relay involving the conserved Tyr and Lys, a water
          molecule stabilized by Asn, and nicotinamide. In
          addition to the Rossmann fold core region typical of
          all SDRs, extended SDRs have a less conserved
          C-terminal extension of approximately 100 amino acids,
          and typically have a TGXXGXXG cofactor binding motif.
          Complex (multidomain) SDRs such as ketoreductase
          domains of fatty acid synthase have a GGXGXXG
          NAD(P)-binding motif and an altered active site motif
          (YXXXN). Fungal type ketoacyl reductases have a
          TGXXXGX(1-2)G NAD(P)-binding motif.
          Length = 207

 Score = 29.5 bits (67), Expect = 1.5
 Identities = 9/17 (52%), Positives = 14/17 (82%)

Query: 65 RIGLLGASGYTGAEIVR 81
          +I ++GA+G TG+ IVR
Sbjct: 1  KIAIIGATGRTGSAIVR 17


>gnl|CDD|187537 cd05226, SDR_e_a, Extended (e) and atypical (a) SDRs.  Extended or
           atypical short-chain dehydrogenases/reductases (SDRs,
           aka tyrosine-dependent oxidoreductases) are distinct
           from classical SDRs. In addition to the Rossmann fold
           (alpha/beta folding pattern with a central beta-sheet)
           core region typical of all SDRs, extended SDRs have a
           less conserved C-terminal extension of approximately 100
           amino acids. Extended SDRs are a diverse collection of
           proteins, and include isomerases, epimerases,
           oxidoreductases, and lyases; they typically have a
           TGXXGXXG cofactor binding motif. Atypical SDRs generally
           lack the catalytic residues characteristic of the SDRs,
           and their glycine-rich NAD(P)-binding motif is often
           different from the forms normally seen in classical or
           extended SDRs. Atypical SDRs include biliverdin IX beta
           reductase (BVR-B,aka flavin reductase), NMRa (a negative
           transcriptional regulator of various fungi),
           progesterone 5-beta-reductase like proteins,
           phenylcoumaran benzylic ether and
           pinoresinol-lariciresinol reductases, phenylpropene
           synthases, eugenol synthase, triphenylmethane reductase,
           isoflavone reductases, and others. SDRs are a
           functionally diverse family of oxidoreductases that have
           a single domain with a structurally conserved Rossmann
           fold, an NAD(P)(H)-binding region, and a structurally
           diverse C-terminal region. Sequence identity between
           different SDR enzymes is typically in the 15-30% range;
           they catalyze a wide range of activities including the
           metabolism of steroids, cofactors, carbohydrates,
           lipids, aromatic compounds, and amino acids, and act in
           redox sensing. Classical SDRs have an TGXXX[AG]XG
           cofactor binding motif and a YXXXK active site motif,
           with the Tyr residue of the active site motif serving as
           a critical catalytic residue (Tyr-151, human
           15-hydroxyprostaglandin dehydrogenase numbering). In
           addition to the Tyr and Lys, there is often an upstream
           Ser and/or an Asn, contributing to the active site;
           while substrate binding is in the C-terminal region,
           which determines specificity. The standard reaction
           mechanism is a 4-pro-S hydride transfer and proton relay
           involving the conserved Tyr and Lys, a water molecule
           stabilized by Asn, and nicotinamide. Complex
           (multidomain) SDRs such as ketoreductase domains of
           fatty acid synthase have a GGXGXXG NAD(P)-binding motif
           and an altered active site motif (YXXXN). Fungal type
           ketoacyl reductases have a TGXXXGX(1-2)G NAD(P)-binding
           motif.
          Length = 176

 Score = 29.3 bits (66), Expect = 1.6
 Identities = 18/101 (17%), Positives = 37/101 (36%), Gaps = 10/101 (9%)

Query: 66  IGLLGASGYTGAEIVR--LLANHPYFGI-KLMTADRKAGQSIGSVFPHLISQDLPTMVAV 122
           I +LGA+G+ G  + R  L   H    + +      K  Q   +V    +        AV
Sbjct: 1   ILILGATGFIGRALARELLEQGHEVTLLVRNTKRLSKEDQEPVAVVEGDLRDLDSLSDAV 60

Query: 123 KDADFSNVDAVFCCL-PHGTTQEIIKGLPKSLK-IVDLSAD 161
           +      VD V         T++  +   +  + +++ + +
Sbjct: 61  QG-----VDVVIHLAGAPRDTRDFCEVDVEGTRNVLEAAKE 96


>gnl|CDD|187578 cd05269, TMR_SDR_a, triphenylmethane reductase (TMR)-like proteins,
           NMRa-like, atypical (a) SDRs.  TMR is an atypical
           NADP-binding protein of the SDR family. It lacks the
           active site residues of the SDRs but has a glycine rich
           NAD(P)-binding motif that matches the extended SDRs.
           Proteins in this subgroup however, are more similar in
           length to the classical SDRs. TMR was identified as a
           reducer of triphenylmethane dyes, important
           environmental pollutants. This subgroup also includes
           Escherichia coli NADPH-dependent quinine oxidoreductase
           (QOR2), which catalyzes two-electron reduction of
           quinone; but is unlikely to play a major role in
           protecting against quinone cytotoxicity. Atypical SDRs
           are distinct from classical SDRs. Atypical SDRs include
           biliverdin IX beta reductase (BVR-B,aka flavin
           reductase), NMRa (a negative transcriptional regulator
           of various fungi), progesterone 5-beta-reductase like
           proteins, phenylcoumaran benzylic ether and
           pinoresinol-lariciresinol reductases, phenylpropene
           synthases, eugenol synthase, triphenylmethane reductase,
           isoflavone reductases, and others. SDRs are a
           functionally diverse family of oxidoreductases that have
           a single domain with a structurally conserved Rossmann
           fold, an NAD(P)(H)-binding region, and a structurally
           diverse C-terminal region. Sequence identity between
           different SDR enzymes is typically in the 15-30% range;
           they catalyze a wide range of activities including the
           metabolism of steroids, cofactors, carbohydrates,
           lipids, aromatic compounds, and amino acids, and act in
           redox sensing. Classical SDRs have an TGXXX[AG]XG
           cofactor binding motif and a YXXXK active site motif,
           with the Tyr residue of the active site motif serving as
           a critical catalytic residue (Tyr-151, human
           15-hydroxyprostaglandin dehydrogenase numbering). In
           addition to the Tyr and Lys, there is often an upstream
           Ser and/or an Asn, contributing to the active site;
           while substrate binding is in the C-terminal region,
           which determines specificity. The standard reaction
           mechanism is a 4-pro-S hydride transfer and proton relay
           involving the conserved Tyr and Lys, a water molecule
           stabilized by Asn, and nicotinamide. In addition to the
           Rossmann fold core region typical of all SDRs, extended
           SDRs have a less conserved C-terminal extension of
           approximately 100 amino acids, and typically have a
           TGXXGXXG cofactor binding motif. Complex (multidomain)
           SDRs such as ketoreductase domains of fatty acid
           synthase have a GGXGXXG NAD(P)-binding motif and an
           altered active site motif (YXXXN). Fungal type ketoacyl
           reductases have a TGXXXGX(1-2)G NAD(P)-binding motif.
          Length = 272

 Score = 29.5 bits (67), Expect = 1.8
 Identities = 16/72 (22%), Positives = 25/72 (34%), Gaps = 8/72 (11%)

Query: 66  IGLLGASGYTGAEIVRLLANHPYFGIKLMTAD---RKAGQSIGSVFPHLISQDLPTMVAV 122
           I + GA+G  G  +V LL       +  +  +    KA  + G         D  T+   
Sbjct: 1   ILVTGATGKLGTAVVELLLAKVA-SVVALVRNPEKAKAFAADGVEVRQGDYDDPETLERA 59

Query: 123 KDADFSNVDAVF 134
               F  VD + 
Sbjct: 60  ----FEGVDRLL 67


>gnl|CDD|178263 PLN02657, PLN02657, 3,8-divinyl protochlorophyllide a 8-vinyl
           reductase.
          Length = 390

 Score = 29.3 bits (66), Expect = 2.5
 Identities = 26/151 (17%), Positives = 48/151 (31%), Gaps = 29/151 (19%)

Query: 1   MSAATFSSLCFSRGCFWKDEAKNSPRVMKGSGKVKISSIRGSATLPTKSLQVEDGKTQKS 60
           + AA  +S   S               + G+   + ++    +     +        +  
Sbjct: 9   LRAAAAASSSPSNRLA---------ASLGGALVRRAAAASRGSRATAAAAAQSFRSKEPK 59

Query: 61  EKQVRIGLLGASGYTGAEIVRLLANHPYFGIKLMTADRKAGQSIGSVFPHLISQDLPTMV 120
           +  V +  +GA+GY G  +VR L    Y  + +    R+     G        ++LP   
Sbjct: 60  DVTVLV--VGATGYIGKFVVRELVRRGYNVVAV---AREKSGIRGKNGKEDTKKELP-GA 113

Query: 121 AVKDADFSN--------------VDAVFCCL 137
            V   D ++              VD V  CL
Sbjct: 114 EVVFGDVTDADSLRKVLFSEGDPVDVVVSCL 144


>gnl|CDD|217277 pfam02901, PFL, Pyruvate formate lyase. 
          Length = 646

 Score = 29.5 bits (67), Expect = 2.6
 Identities = 23/93 (24%), Positives = 37/93 (39%), Gaps = 16/93 (17%)

Query: 167 VSEYEEWYGQPHIAPDLQKEAVYGLTEISREDIKNARLVANPGCYPTSIQLPLVPLIQAN 226
            +   + YG+ HI  D ++  +YGL  +  E  K  +L A     P           +A 
Sbjct: 147 FTGLPDAYGRGHIIGDYRRVLLYGLDGLIAE--KEEKLAALDLTDPEDADKIY----KAM 200

Query: 227 LIQYRNIIIDAKSGVSGAGRGAKEANLYSEIAE 259
           +I    +I  A+       R A+ A    E+AE
Sbjct: 201 IISCDAVINYAE-------RYARLA---EELAE 223


>gnl|CDD|187561 cd05251, NmrA_like_SDR_a, NmrA (a transcriptional regulator) and
           HSCARG (an NADPH sensor) like proteins, atypical (a)
           SDRs.  NmrA and HSCARG like proteins. NmrA is a negative
           transcriptional regulator of various fungi, involved in
           the post-translational modulation of the GATA-type
           transcription factor AreA. NmrA lacks the canonical
           GXXGXXG NAD-binding motif and has altered residues at
           the catalytic triad, including a Met instead of the
           critical Tyr residue. NmrA may bind nucleotides but
           appears to lack any dehydrogenase activity. HSCARG has
           been identified as a putative NADP-sensing molecule, and
           redistributes and restructures in response to NADPH/NADP
           ratios. Like NmrA, it lacks most of the active site
           residues of the SDR family, but has an NAD(P)-binding
           motif similar to the extended SDR family, GXXGXXG. SDRs
           are a functionally diverse family of oxidoreductases
           that have a single domain with a structurally conserved
           Rossmann fold, an NAD(P)(H)-binding region, and a
           structurally diverse C-terminal region. Sequence
           identity between different SDR enzymes is typically in
           the 15-30% range; they catalyze a wide range of
           activities including the metabolism of steroids,
           cofactors, carbohydrates, lipids, aromatic compounds,
           and amino acids, and act in redox sensing. Atypical SDRs
           are distinct from classical SDRs. Classical SDRs have an
           TGXXX[AG]XG cofactor binding motif and a YXXXK active
           site motif, with the Tyr residue of the active site
           motif serving as a critical catalytic residue (Tyr-151,
           human 15-hydroxyprostaglandin dehydrogenase numbering).
           In addition to the Tyr and Lys, there is often an
           upstream Ser and/or an Asn, contributing to the active
           site; while substrate binding is in the C-terminal
           region, which determines specificity. The standard
           reaction mechanism is a 4-pro-S hydride transfer and
           proton relay involving the conserved Tyr and Lys, a
           water molecule stabilized by Asn, and nicotinamide. In
           addition to the Rossmann fold core region typical of all
           SDRs, extended SDRs have a less conserved C-terminal
           extension of approximately 100 amino acids, and
           typically have a TGXXGXXG cofactor binding motif.
           Complex (multidomain) SDRs such as ketoreductase domains
           of fatty acid synthase have a GGXGXXG NAD(P)-binding
           motif and an altered active site motif (YXXXN). Fungal
           type ketoacyl reductases have a TGXXXGX(1-2)G
           NAD(P)-binding motif.
          Length = 242

 Score = 29.2 bits (66), Expect = 2.7
 Identities = 19/72 (26%), Positives = 34/72 (47%), Gaps = 3/72 (4%)

Query: 66  IGLLGASGYTGAEIVRLLANHPYFGIKLMTAD--RKAGQSIGSVFPHLISQDLPTMVAVK 123
           I + GA+G  G  +VR L   P F ++ +T D    A +++ +    ++  DL    +++
Sbjct: 1   ILVFGATGKQGGSVVRALLKDPGFKVRALTRDPSSPAAKALAAPGVEVVQGDLDDPESLE 60

Query: 124 DADFSNVDAVFC 135
            A    V  VF 
Sbjct: 61  AA-LKGVYGVFL 71


>gnl|CDD|223774 COG0702, COG0702, Predicted nucleoside-diphosphate-sugar
          epimerases [Cell envelope biogenesis, outer membrane /
          Carbohydrate transport and metabolism].
          Length = 275

 Score = 29.1 bits (65), Expect = 2.9
 Identities = 8/24 (33%), Positives = 14/24 (58%)

Query: 65 RIGLLGASGYTGAEIVRLLANHPY 88
          +I + GA+G+ G  +VR L    +
Sbjct: 2  KILVTGATGFVGGAVVRELLARGH 25


>gnl|CDD|219193 pfam06824, DUF1237, Protein of unknown function (DUF1237).  This
           family contains a number of hypothetical proteins of
           about 450 residues in length. Their function is unknown,
           and most are bacterial. However, structurally this
           family is part of the 6 hairpin glycosidase superfamily,
           suggesting a glycosyl hydrolase function.
          Length = 403

 Score = 29.1 bits (66), Expect = 3.5
 Identities = 12/24 (50%), Positives = 14/24 (58%), Gaps = 1/24 (4%)

Query: 248 AKEAN-LYSEIAEGIYSYGVTRHR 270
           A+EA  L  EI EGI  YG+  H 
Sbjct: 238 AEEAEKLADEIREGIEKYGIVEHP 261


>gnl|CDD|131072 TIGR02017, hutG_amidohyd, N-formylglutamate amidohydrolase.  In
           some species, histidine is converted to via urocanate
           and then formimino-L-glutamate to glutamate in four
           steps, where the fourth step is conversion of
           N-formimino-L-glutamate to L-glutamate and formamide. In
           others, that pathway from formimino-L-glutamate may
           differ, with the next enzyme being formiminoglutamate
           hydrolase (HutF) yielding N-formyl-L-glutamate. This
           model represents the enzyme N-formylglutamate
           deformylase, also called N-formylglutamate
           amidohydrolase, which then produces glutamate [Energy
           metabolism, Amino acids and amines].
          Length = 263

 Score = 28.6 bits (64), Expect = 4.0
 Identities = 18/41 (43%), Positives = 18/41 (43%), Gaps = 7/41 (17%)

Query: 77  AEIVRLLANHPYFGIKLMTADRKAGQSIGSVFPHLISQDLP 117
           AEI RL A H Y    L  A      SI SV P L    LP
Sbjct: 129 AEIERLRAQHGY--AVLYDA-----HSIRSVIPRLFEGKLP 162


>gnl|CDD|176803 cd08969, MeNeil3_N, N-terminal domain of metazoan Nei-like
          glycosylase 3 (NEIL3).  This family contains the
          N-terminal domain of the Metazoan Neil3. It belongs to
          the FpgNei_N, [N-terminal domain of Fpg
          (formamidopyrimidine-DNA glycosylase, MutM)_Nei
          (endonuclease VIII)] domain superfamily. DNA
          glycosylases maintain genome integrity by recognizing
          base lesions created by ionizing radiation, alkylating
          or oxidizing agents, and endogenous reactive oxygen
          species. They initiate the base-excision repair
          process, which is completed with the help of enzymes
          such as phosphodiesterases, AP endonucleases, DNA
          polymerases and DNA ligases. DNA glycosylases cleave
          the N-glycosyl bond between the sugar and the damaged
          base, creating an AP (apurinic/apyrimidinic) site. Most
          FpgNei DNA glycosylases use their N-terminal proline
          residue as the key catalytic nucleophile, and the
          reaction proceeds via a Schiff base intermediate. In
          contrast, mouse NEIL3 (MmuNEIL3) forms a Schiff base
          intermediate via its N-terminal valine. The latter is a
          functional DNA glycosylase in vitro and in vivo.
          MmuNEIL3 prefers lesions in single-stranded DNA and in
          bubble structures.  In duplex DNA, it recognizes the
          oxidized purines spiroiminodihydantoin (Sp),
          guanidinohydantoin (Gh),
          2,6-diamino-4-hydroxy-5-formamidopyrimidine (FapyG) and
          4,6-diamino-5-formamidopyrimidine (FapyA), but not
          8-oxo-7,8-dihydroguanine (8-oxoG). Since the expression
          of the MmuNeil3 glycosylase domain (MmuNeil3delta324)
          reduces both the high spontaneous mutation frequency
          and the FapyG level in a Escherichia coli mutant
          lacking Fpg, Nei and MutY glycosylase activites, NEIL3
          may play a role in repairing FapyG in vivo.  In
          addition to this MeNeil3_N domain, enzymes belonging to
          this family contain a helix-two turn-helix (H2TH)
          domain and a zinc finger motif, plus a characteristic
          C-terminal extension that contains additional zinc
          fingers. Neil3 is one of three homologs found in
          eukaryotes.
          Length = 140

 Score = 27.8 bits (62), Expect = 4.4
 Identities = 21/68 (30%), Positives = 29/68 (42%), Gaps = 6/68 (8%)

Query: 26 RVMKGSGKVKISSIRGSATLPTKSLQVEDGKTQKSEKQVRIGLLGASGYTGAEIV-RLLA 84
          RV KG    ++  +RGSA          +G   K E+   +  L    YTG E + + L 
Sbjct: 17 RVEKGQ---RVVHVRGSAPSSPSGAASRNGAGSKDERSHVLDSLTGQVYTGVETLGKELF 73

Query: 85 NHPYFGIK 92
             YFG K
Sbjct: 74 M--YFGDK 79


>gnl|CDD|225722 COG3181, COG3181, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 319

 Score = 28.0 bits (63), Expect = 7.1
 Identities = 17/50 (34%), Positives = 23/50 (46%), Gaps = 3/50 (6%)

Query: 115 DLPTMV-AVKDADFSNVDAVFCCLPHGTTQEIIKGLPKSLKIVDLSADFR 163
           D+PT+     D   S    VF   P GT  EII  L  +LK    S +++
Sbjct: 232 DVPTLKEQGYDVVMSIWRGVF--APAGTPDEIIAKLSAALKKALASPEWQ 279


>gnl|CDD|187554 cd05243, SDR_a5, atypical (a) SDRs, subgroup 5.  This subgroup
          contains atypical SDRs, some of which are identified as
          putative NAD(P)-dependent epimerases, one as a putative
          NAD-dependent epimerase/dehydratase. Atypical SDRs are
          distinct from classical SDRs. Members of this subgroup
          have a glycine-rich NAD(P)-binding motif that is very
          similar to the extended SDRs, GXXGXXG, and binds NADP.
          Generally, this subgroup has poor conservation of the
          active site tetrad; however, individual sequences do
          contain matches to the YXXXK active site motif, the
          upstream Ser, and there is a highly conserved Asp in
          place of the usual active site Asn throughout the
          subgroup. Atypical SDRs generally lack the catalytic
          residues characteristic of the SDRs, and their
          glycine-rich NAD(P)-binding motif is often different
          from the forms normally seen in classical or extended
          SDRs. Atypical SDRs include biliverdin IX beta
          reductase (BVR-B,aka flavin reductase), NMRa (a
          negative transcriptional regulator of various fungi),
          progesterone 5-beta-reductase like proteins,
          phenylcoumaran benzylic ether and
          pinoresinol-lariciresinol reductases, phenylpropene
          synthases, eugenol synthase, triphenylmethane
          reductase, isoflavone reductases, and others. SDRs are
          a functionally diverse family of oxidoreductases that
          have a single domain with a structurally conserved
          Rossmann fold, an NAD(P)(H)-binding region, and a
          structurally diverse C-terminal region. Sequence
          identity between different SDR enzymes is typically in
          the 15-30% range; they catalyze a wide range of
          activities including the metabolism of steroids,
          cofactors, carbohydrates, lipids, aromatic compounds,
          and amino acids, and act in redox sensing. Classical
          SDRs have an TGXXX[AG]XG cofactor binding motif and a
          YXXXK active site motif, with the Tyr residue of the
          active site motif serving as a critical catalytic
          residue (Tyr-151, human 15-hydroxyprostaglandin
          dehydrogenase numbering). In addition to the Tyr and
          Lys, there is often an upstream Ser and/or an Asn,
          contributing to the active site; while substrate
          binding is in the C-terminal region, which determines
          specificity. The standard reaction mechanism is a
          4-pro-S hydride transfer and proton relay involving the
          conserved Tyr and Lys, a water molecule stabilized by
          Asn, and nicotinamide. In addition to the Rossmann fold
          core region typical of all SDRs, extended SDRs have a
          less conserved C-terminal extension of approximately
          100 amino acids, and typically have a TGXXGXXG cofactor
          binding motif. Complex (multidomain) SDRs such as
          ketoreductase domains of fatty acid synthase have a
          GGXGXXG NAD(P)-binding motif and an altered active site
          motif (YXXXN). Fungal type ketoacyl reductases have a
          TGXXXGX(1-2)G NAD(P)-binding motif.
          Length = 203

 Score = 27.6 bits (62), Expect = 7.5
 Identities = 8/24 (33%), Positives = 15/24 (62%)

Query: 65 RIGLLGASGYTGAEIVRLLANHPY 88
          ++ ++GA+G  G  +VR L +  Y
Sbjct: 1  KVLVVGATGKVGRHVVRELLDRGY 24


>gnl|CDD|227623 COG5307, COG5307, SEC7 domain proteins [General function prediction
           only].
          Length = 1024

 Score = 27.8 bits (62), Expect = 8.9
 Identities = 23/112 (20%), Positives = 37/112 (33%), Gaps = 22/112 (19%)

Query: 106 SVFPHLISQDLPTMVAVKDADFS--NVD----AVFCC--LPHGTTQ---EIIKGLPKSLK 154
           SVF  ++S DLP+    K+ D S  NVD           L   +     ++ K L +S  
Sbjct: 674 SVFKAILSSDLPSPPLQKEVDISKKNVDIIKSESKISNVLFKNSEGLSPDLNKTLLESAL 733

Query: 155 IVDLSADFRLRDVSE---------YEEWYGQPHIAPDL--QKEAVYGLTEIS 195
                   R  ++ E             +        +   KE V G ++  
Sbjct: 734 DSKSQLSSRFLEIEELSDFGFQIALLLPFEYSVEVSLVVAVKELVIGCSDNL 785


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.318    0.134    0.393 

Gapped
Lambda     K      H
   0.267   0.0807    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 15,297,576
Number of extensions: 1459604
Number of successful extensions: 1283
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1253
Number of HSP's successfully gapped: 56
Length of query: 306
Length of database: 10,937,602
Length adjustment: 97
Effective length of query: 209
Effective length of database: 6,635,264
Effective search space: 1386770176
Effective search space used: 1386770176
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 59 (26.4 bits)