RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= 017153
         (376 letters)



>gnl|CDD|178009 PLN02383, PLN02383, aspartate semialdehyde dehydrogenase.
          Length = 344

 Score =  689 bits (1779), Expect = 0.0
 Identities = 291/344 (84%), Positives = 319/344 (92%)

Query: 33  MSYQESAPSVAVVGVTGAVGQEFLSVLSDRDFPYRSIKMLASKRSAGKQLSFQDKAYTVE 92
           M+  E+ PSVA+VGVTGAVGQEFLSVL+DRDFPY S+KMLAS RSAGK+++F+ + YTVE
Sbjct: 1   MALTENGPSVAIVGVTGAVGQEFLSVLTDRDFPYSSLKMLASARSAGKKVTFEGRDYTVE 60

Query: 93  ELTEDSFDGVDIALFSAGGSISKKFGPIAVEKGSIVVDNSSAFRMVENVPLVIPEVNPEA 152
           ELTEDSFDGVDIALFSAGGSISKKFGPIAV+KG++VVDNSSAFRM E VPLVIPEVNPEA
Sbjct: 61  ELTEDSFDGVDIALFSAGGSISKKFGPIAVDKGAVVVDNSSAFRMEEGVPLVIPEVNPEA 120

Query: 153 MSGIKVGMGKGALIANPNCSTIICLMAATPLHRRAKVTRMVVSTYQAASGAGAAAMEELE 212
           M  IK+G GKGALIANPNCSTIICLMA TPLHR AKV RMVVSTYQAASGAGAAAMEELE
Sbjct: 121 MKHIKLGKGKGALIANPNCSTIICLMAVTPLHRHAKVKRMVVSTYQAASGAGAAAMEELE 180

Query: 213 LQTREVLEGKPPTCKIFSQQYAFNLFSHNAPVLENGYNEEEMKMVKETRKIWNDKDVRVT 272
            QTREVLEGKPPTC IF+QQYAFNLFSHNAP+ ENGYNEEEMK+VKETRKIWND DV+VT
Sbjct: 181 QQTREVLEGKPPTCNIFAQQYAFNLFSHNAPMQENGYNEEEMKLVKETRKIWNDDDVKVT 240

Query: 273 ATCIRVPVMRAHAESVNLQFEKPLDEDTARDILKNAPGVVVIDDRASNHFPTPLEVSNKD 332
           ATCIRVPVMRAHAES+NLQFEKPLDE TAR+IL +APGV +IDDRA+N FPTPL+ SNKD
Sbjct: 241 ATCIRVPVMRAHAESINLQFEKPLDEATAREILASAPGVKIIDDRANNRFPTPLDASNKD 300

Query: 333 DVAVGRIRRDVSQDGNHGLDIFVCGDQVRKGAALNAVQIAEMLL 376
           DVAVGRIR+D+SQDGN GLDIFVCGDQ+RKGAALNAVQIAE+LL
Sbjct: 301 DVAVGRIRQDISQDGNKGLDIFVCGDQIRKGAALNAVQIAELLL 344


>gnl|CDD|237845 PRK14874, PRK14874, aspartate-semialdehyde dehydrogenase;
           Provisional.
          Length = 334

 Score =  499 bits (1288), Expect = e-179
 Identities = 188/337 (55%), Positives = 246/337 (72%), Gaps = 8/337 (2%)

Query: 42  VAVVGVTGAVGQEFLSVLSDRDFPYRSIKMLASKRSAGKQLSFQDKAYTVEELTEDSFDG 101
           VAVVG TGAVG+E L++L +R+FP   +++LAS RSAGK+LSF+ K   VE+LT   F G
Sbjct: 4   VAVVGATGAVGREMLNILEERNFPVDKLRLLASARSAGKELSFKGKELKVEDLTTFDFSG 63

Query: 102 VDIALFSAGGSISKKFGPIAVEKGSIVVDNSSAFRMVENVPLVIPEVNPEAMSGIKVGMG 161
           VDIALFSAGGS+SKK+ P A   G++V+DNSSAFRM  +VPLV+PEVNPEA++  +    
Sbjct: 64  VDIALFSAGGSVSKKYAPKAAAAGAVVIDNSSAFRMDPDVPLVVPEVNPEALAEHR---K 120

Query: 162 KGALIANPNCSTIICLMAATPLHRRAKVTRMVVSTYQAASGAGAAAMEELELQTREVLEG 221
           KG +IANPNCSTI  ++A  PLH  A + R+VVSTYQA SGAG A MEEL  QTR VL  
Sbjct: 121 KG-IIANPNCSTIQMVVALKPLHDAAGIKRVVVSTYQAVSGAGKAGMEELFEQTRAVLNA 179

Query: 222 K--PPTCKIFSQQYAFNLFSHNAPVLENGYNEEEMKMVKETRKIWNDKDVRVTATCIRVP 279
              P   K F +  AFN+  H    +++GY +EEMKMV ET+KI  D D++V+ATC+RVP
Sbjct: 180 AVDPVEPKKFPKPIAFNVIPHIDVFMDDGYTKEEMKMVNETKKILGDPDLKVSATCVRVP 239

Query: 280 VMRAHAESVNLQFEKPLDEDTARDILKNAPGVVVIDDRASNHFPTPLEVSNKDDVAVGRI 339
           V   H+ESVN++FE+P+  + AR+IL  APGVV++DD  +  +PTPLE   KD   VGRI
Sbjct: 240 VFTGHSESVNIEFEEPISVEEAREILAEAPGVVLVDDPENGGYPTPLEAVGKDATFVGRI 299

Query: 340 RRDVSQDGNHGLDIFVCGDQVRKGAALNAVQIAEMLL 376
           R+D++ +  +GL ++V  D +RKGAALNAVQIAE+L+
Sbjct: 300 RKDLTVE--NGLHLWVVSDNLRKGAALNAVQIAELLI 334


>gnl|CDD|223214 COG0136, Asd, Aspartate-semialdehyde dehydrogenase [Amino acid
           transport and metabolism].
          Length = 334

 Score =  396 bits (1020), Expect = e-138
 Identities = 169/340 (49%), Positives = 224/340 (65%), Gaps = 14/340 (4%)

Query: 42  VAVVGVTGAVGQEFLSVLSDRDFPYRSIKMLASKRSAGKQ-LSFQDKAYTVEELTEDSFD 100
           VAV+G TGAVGQ  L +L +R FP+  + +LAS RSAGK+ + F  K+  V E   D F 
Sbjct: 4   VAVLGATGAVGQVLLELLEERHFPFEELVLLASARSAGKKYIEFGGKSIGVPEDAADEFV 63

Query: 101 --GVDIALFSAGGSISKKFGPIAVEKGSIVVDNSSAFRMVENVPLVIPEVNPEAMSGIKV 158
              VDI  F+AGGS+SK+  P A E G +V+DNSSAFRM  +VPLV+PEVNPE +   + 
Sbjct: 64  FSDVDIVFFAAGGSVSKEVEPKAAEAGCVVIDNSSAFRMDPDVPLVVPEVNPEHLIDYQ- 122

Query: 159 GMGKGALIANPNCSTIICLMAATPLHRRAKVTRMVVSTYQAASGAGAAAMEELELQTREV 218
              +G +IANPNCSTI  ++A  PLH    + R+VVSTYQA SGAGA    EL  QT  +
Sbjct: 123 --KRGFIIANPNCSTIQLVLALKPLHDAFGIKRVVVSTYQAVSGAGAEGGVELAGQTDAL 180

Query: 219 LEGKPPTCKIFSQQYAFNLFSHNAPVLENGYNEEEMKMVKETRKIWNDKDVRVTATCIRV 278
           L G P          AFN+  H    L+NGY +EE K+  ETRKI  D D++V+ATC+RV
Sbjct: 181 LNGIPILPI--GYPLAFNVIPHIDGFLDNGYTKEEWKIEAETRKILGDPDIKVSATCVRV 238

Query: 279 PVMRAHAESVNLQFEKPLDEDTAR-DILKNAPGVVVIDDRASNHFP-TPLEVSNKDDVAV 336
           PV   H+E+V ++F+K +D +  R ++L +APGVVV+D+      P TPL+ +  D+V+V
Sbjct: 239 PVFYGHSEAVTVEFKKDVDPEEIREELLPSAPGVVVVDNP--EDRPQTPLDATGGDEVSV 296

Query: 337 GRIRRDVSQDGNHGLDIFVCGDQVRKGAALNAVQIAEMLL 376
           GRIR+D+S     GL ++V GD +RKGAALNAV IAE+LL
Sbjct: 297 GRIRKDLSGP--EGLKLWVVGDNLRKGAALNAVLIAELLL 334


>gnl|CDD|233347 TIGR01296, asd_B, aspartate-semialdehyde dehydrogenase
           (peptidoglycan organisms).  Two closely related families
           of aspartate-semialdehyde dehydrogenase are found. They
           differ by a deep split in phylogenetic and percent
           identity trees and in gap patterns. This model
           represents a branch more closely related to the USG-1
           protein than to the other aspartate-semialdehyde
           dehydrogenases represented in model TIGR00978 [Amino
           acid biosynthesis, Aspartate family].
          Length = 338

 Score =  388 bits (999), Expect = e-135
 Identities = 178/341 (52%), Positives = 236/341 (69%), Gaps = 12/341 (3%)

Query: 41  SVAVVGVTGAVGQEFLSVLSDRDFPYRSIKMLASKRSAGKQLSFQDKAYTVEELTEDSFD 100
           +VA+VG TGAVGQE L +L +R+FP   + +LAS RSAG++L+F+ K   VEE   +SF+
Sbjct: 1   NVAIVGATGAVGQEMLKLLEERNFPIDKLVLLASARSAGRKLTFKGKELEVEEAETESFE 60

Query: 101 GVDIALFSAGGSISKKFGPIAVEKGSIVVDNSSAFRMVENVPLVIPEVNPEAMSGIKVGM 160
           G+DIALFSAGGS+SK+F P A + G IV+DN+SAFRM  +VPLV+PEVN E +       
Sbjct: 61  GIDIALFSAGGSVSKEFAPKAAKAGVIVIDNTSAFRMDPDVPLVVPEVNFEDLKEFN--- 117

Query: 161 GKGALIANPNCSTIICLMAATPLHRRAKVTRMVVSTYQAASGAGAAAMEELELQTREVLE 220
               +IANPNCSTI  ++   PLH  AK+ R+VVSTYQA SGAG A +EEL  QT+ VLE
Sbjct: 118 -PKGIIANPNCSTIQMVVVLKPLHDEAKIKRVVVSTYQAVSGAGNAGVEELYNQTKAVLE 176

Query: 221 GK------PPTCKIFSQQYAFNLFSHNAPVLENGYNEEEMKMVKETRKIWNDKDVRVTAT 274
           G        P    F  Q AFN   H    +++GY +EE KM+ ETRKI    D++V+AT
Sbjct: 177 GAEQLPYIQPKANKFPYQIAFNAIPHIDSFVDDGYTKEEQKMLFETRKIMGIPDLKVSAT 236

Query: 275 CIRVPVMRAHAESVNLQFEKPLDEDTARDILKNAPGVVVIDDRASNHFPTPLEVSNKDDV 334
           C+RVPV   H+ES+N++FEK +  + AR++LKNAPGV +IDD + N +PTPL     D+V
Sbjct: 237 CVRVPVFTGHSESINIEFEKEISPEDARELLKNAPGVQLIDDPSGNLYPTPLAAVGVDEV 296

Query: 335 AVGRIRRDVSQDGNHGLDIFVCGDQVRKGAALNAVQIAEML 375
            VGRIR+D+     +GL ++V  D +RKGAALN+VQIAE+L
Sbjct: 297 FVGRIRKDLPDG--NGLHLWVVADNLRKGAALNSVQIAELL 335


>gnl|CDD|136022 PRK06728, PRK06728, aspartate-semialdehyde dehydrogenase;
           Provisional.
          Length = 347

 Score =  255 bits (652), Expect = 2e-82
 Identities = 139/348 (39%), Positives = 213/348 (61%), Gaps = 16/348 (4%)

Query: 37  ESAPSVAVVGVTGAVGQEFLSVL-SDRDFPYRSIKMLASKRSAGKQLSFQDKAYTVEELT 95
           E    VAVVG TGAVGQ+ + +L  +  F    + +L+SKRSAGK + F+ +   ++E  
Sbjct: 3   EKGYHVAVVGATGAVGQKIIELLEKETKFNIAEVTLLSSKRSAGKTVQFKGREIIIQEAK 62

Query: 96  EDSFDGVDIALFSAGGSISKKFGPIAVEKGSIVVDNSSAFRMVENVPLVIPEVNPEAMSG 155
            +SF+GVDIA FSAGG +S++F   AV  G+IV+DN+S +RM  +VPLV+PEVN   +  
Sbjct: 63  INSFEGVDIAFFSAGGEVSRQFVNQAVSSGAIVIDNTSEYRMAHDVPLVVPEVNAHTLKE 122

Query: 156 IKVGMGKGALIANPNCSTIICLMAATPLHRRAKVTRMVVSTYQAASGAGAAAMEELELQT 215
            K       +IA PNCS +  + A  P+ +   + R++VSTYQA SG+G  A++EL+ Q 
Sbjct: 123 HK------GIIAVPNCSALQMVTALQPIRKVFGLERIIVSTYQAVSGSGIHAIQELKEQA 176

Query: 216 REVLEGKPPTCKIF-----SQQY--AFNLFSHNAPVLENGYNEEEMKMVKETRKIWNDKD 268
           + +L G+     I       + Y  AFN+        +N +  EE+KM++ET+KI  D +
Sbjct: 177 KSILAGEEVESTILPAKKDKKHYPIAFNVLPQVDIFTDNDFTFEEVKMIQETKKILEDPN 236

Query: 269 VRVTATCIRVPVMRAHAESVNLQFEKPLDEDTARDILKNAPGVVVIDDRASNHFPTPLEV 328
           +++ ATC+RVPV+  H+ESV ++ EK       +++L +APGV++ D+ +   +P PL  
Sbjct: 237 LKMAATCVRVPVISGHSESVYIELEKEATVAEIKEVLFDAPGVILQDNPSEQLYPMPLYA 296

Query: 329 SNKDDVAVGRIRRDVSQDGNHGLDIFVCGDQVRKGAALNAVQIAEMLL 376
             K D  VGRIR+D   D  +G  +++  D + KGAA N+VQIAE ++
Sbjct: 297 EGKIDTFVGRIRKD--PDTPNGFHLWIVSDNLLKGAAWNSVQIAETMV 342


>gnl|CDD|168165 PRK05671, PRK05671, aspartate-semialdehyde dehydrogenase; Reviewed.
          Length = 336

 Score =  200 bits (511), Expect = 1e-61
 Identities = 115/336 (34%), Positives = 179/336 (53%), Gaps = 11/336 (3%)

Query: 42  VAVVGVTGAVGQEFLSVLSDRDFPYRSIKMLASKRSAGKQLSFQDKAYTVEELTEDSFDG 101
           +AVVG TG VG+  + +L +RDFP  ++ +LAS  SAG  + F  K   V E+    F  
Sbjct: 7   IAVVGATGTVGEALVQILEERDFPVGTLHLLASSESAGHSVPFAGKNLRVREVDSFDFSQ 66

Query: 102 VDIALFSAGGSISKKFGPIAVEKGSIVVDNSSAFRMVENVPLVIPEVNPEAMSGIKVGMG 161
           V +A F+AG ++S+ F   A   G  V+D S A       P V+PEVN E ++ +     
Sbjct: 67  VQLAFFAAGAAVSRSFAEKARAAGCSVIDLSGALP-SAQAPNVVPEVNAERLASLA---- 121

Query: 162 KGALIANPNCSTIICLMAATPLHRRAKVTRMVVSTYQAASGAGAAAMEELELQTREVLEG 221
              L+++P+ S +   +A  PL     + R+ V+   A S  G   + EL  QT E+L  
Sbjct: 122 APFLVSSPSASAVALAVALAPLKGLLDIQRVQVTACLAVSSLGREGVSELARQTAELLNA 181

Query: 222 KPPTCKIFSQQYAFNLFSHNAPVLENGYNEEEMKMVKETRKIWNDKDVRVTATCIRVPVM 281
           +P   + F +Q AFNL +        G+   E ++V E R++    +++++ TCI+VPV 
Sbjct: 182 RPLEPRFFDRQVAFNLLAQVGAPDAQGHTALERRLVAELRQLLGLPELKISVTCIQVPVF 241

Query: 282 RAHAESVNLQFEKPLDEDTARDILKNAPGVVVIDDRASNHFPTPLE-VSNKDDVAVGRIR 340
              + SV LQ   P+D       L+ APG+ +++      +PTP+     +D V VGR+R
Sbjct: 242 FGDSLSVALQSAAPVDLAAVNAALEAAPGIELVEA---GDYPTPVGDAVGQDVVYVGRVR 298

Query: 341 RDVSQDGNHGLDIFVCGDQVRKGAALNAVQIAEMLL 376
             V       L++++  D VRKGAALNAVQ+AE+L+
Sbjct: 299 AGVDDPCQ--LNLWLTSDNVRKGAALNAVQVAELLI 332


>gnl|CDD|181205 PRK08040, PRK08040, putative semialdehyde dehydrogenase;
           Provisional.
          Length = 336

 Score =  196 bits (499), Expect = 8e-60
 Identities = 112/343 (32%), Positives = 185/343 (53%), Gaps = 24/343 (6%)

Query: 41  SVAVVGVTGAVGQEFLSVLSDRDFPYRSIKMLASKRSAGKQLSFQDKAYTVEELTEDSFD 100
           ++A++G TGAVG+  L +L++R FP   +  LAS+ SAG+ L F  K+ TV++  E  + 
Sbjct: 6   NIALLGATGAVGEALLELLAERQFPVGELYALASEESAGETLRFGGKSVTVQDAAEFDWS 65

Query: 101 GVDIALFSAGGSISKKFGPIAVEKGSIVVDNSSAFRMVENVPLVIPEVNPEAMSGIKVGM 160
              +A F AG   S  +   A   G +V+D+S  F +  +VPLV+PEVNP  ++  +   
Sbjct: 66  QAQLAFFVAGREASAAYAEEATNAGCLVIDSSGLFALEPDVPLVVPEVNPFVLADYR--- 122

Query: 161 GKGALIANPNCSTIICLMAATPLHRRAKVTRMVVSTYQAASGAGAAAMEELELQTREVLE 220
               +IA  +  T   L A  PL  +A ++R+ V+   +AS  G AA++ L  Q+ ++L 
Sbjct: 123 -NRNIIAVADSLTSQLLTAIKPLIDQAGLSRLHVTNLLSASAHGKAAVDALAGQSAKLLN 181

Query: 221 GKPPTCKIFSQQYAFNLFSHNAPVL--ENGYNEEEMKMVKETRKIWNDKDVRVTATCIRV 278
           G P     F +Q AFN+     P+L    G   EE ++V + RKI  D+ + ++ +C++ 
Sbjct: 182 GIPIEEGFFGRQLAFNML----PLLPDSEGSVREERRLVDQVRKILQDEGLPISVSCVQS 237

Query: 279 PVMRAHAESVNLQFEKPLDEDTARDILKNAPGVVVIDDRASNHFPTPL-EVSNKDDVAVG 337
           PV   HA+ V+ +  +PL  + ARD L+    +V+ ++   N +PT + + S    +++G
Sbjct: 238 PVFYGHAQMVHFEALRPLAAEEARDALEQGEDIVLSEE---NDYPTQVGDASGNPHLSIG 294

Query: 338 RIRRDVSQDGNHG----LDIFVCGDQVRKGAALNAVQIAEMLL 376
            +R D      +G    L  +   D VR G AL AV+ AE L+
Sbjct: 295 CVRND------YGMPEQLQFWSVADNVRFGGALMAVKTAEKLV 331


>gnl|CDD|236329 PRK08664, PRK08664, aspartate-semialdehyde dehydrogenase; Reviewed.
          Length = 349

 Score =  188 bits (480), Expect = 1e-56
 Identities = 116/389 (29%), Positives = 175/389 (44%), Gaps = 106/389 (27%)

Query: 42  VAVVGVTGAVGQEFLSVLSDRDFPYRSIKML-ASKRSAGKQL--------------SFQD 86
           V ++G TG VGQ F+ +L++   P+  +  L AS+RSAGK                   D
Sbjct: 6   VGILGATGMVGQRFVQLLAN--HPWFEVTALAASERSAGKTYGEAVRWQLDGPIPEEVAD 63

Query: 87  KAYTVEELTEDSFDGVDIALFSA-----GGSISKKFGPIAVEKGSIVVDNSSAFRMVENV 141
               V     ++ D VDI +FSA      G + ++F     + G  V  N+SA RM  +V
Sbjct: 64  --MEVVSTDPEAVDDVDI-VFSALPSDVAGEVEEEF----AKAGKPVFSNASAHRMDPDV 116

Query: 142 PLVIPEVNPEAMSGIKV----GMGKGALIANPNCSTIICLMAATPLHRRAKVTRMVVSTY 197
           PLVIPEVNPE +  I+V        G ++ NPNCSTI  ++A  PL     + R+ V+T 
Sbjct: 117 PLVIPEVNPEHLELIEVQRKRRGWDGFIVTNPNCSTIGLVLALKPL-MDFGIERVHVTTM 175

Query: 198 QAASGAG---AAAMEELELQTREVLEGKPPTCKIFSQQYAFNLFSHNAPVLENGYNEEEM 254
           QA SGAG     +M+ ++                       N+  +          EEE 
Sbjct: 176 QAISGAGYPGVPSMDIVD-----------------------NVIPY-------IGGEEE- 204

Query: 255 KMVKETRKIW--------NDKDVRVTATCIRVPVMRAHAESVNLQFEKPLDEDTARDILK 306
           K+ KET KI            D  ++ATC RVPV+  H E+V ++F++ +D +  R+ L+
Sbjct: 205 KIEKETLKILGKFEGGKIVPADFPISATCHRVPVIDGHTEAVFVKFKEDVDPEEIREALE 264

Query: 307 NAPGVVVIDDRASNHFPT----PLEVSNKDD---------------VAVGRIRRDVSQDG 347
           +  G+           P+    P+ +  + D               V+VGR+R    +DG
Sbjct: 265 SFKGLPQELG-----LPSAPKKPIILFEEPDRPQPRLDRDAGDGMAVSVGRLR----EDG 315

Query: 348 NHGLDIFVC-GDQVRKGAALNAVQIAEML 375
              +  FV  G    +GAA  +V  AE+L
Sbjct: 316 IFDIK-FVVLGHNTVRGAAGASVLNAELL 343


>gnl|CDD|233220 TIGR00978, asd_EA, aspartate-semialdehyde dehydrogenase
           (non-peptidoglycan organisms).  Two closely related
           families of aspartate-semialdehyde dehydrogenase are
           found. They differ by a deep split in phylogenetic and
           percent identity trees and in gap patterns. Separate
           models are built for the two types in order to exclude
           the USG-1 protein, found in several species, which is
           specifically related to the Bacillus subtilis type of
           aspartate-semialdehyde dehydrogenase. Members of this
           type are found primarily in organisms that lack
           peptidoglycan [Amino acid biosynthesis, Aspartate
           family].
          Length = 341

 Score =  156 bits (396), Expect = 2e-44
 Identities = 113/380 (29%), Positives = 169/380 (44%), Gaps = 87/380 (22%)

Query: 42  VAVVGVTGAVGQEFLSVLSDRDFPYRSI-KMLASKRSAGKQL--------------SFQD 86
           VAV+G TG VGQ+F+ +L+    PY  + K++AS RSAGK+                 +D
Sbjct: 3   VAVLGATGLVGQKFVKLLAKH--PYFELAKVVASPRSAGKRYGEAVKWIEPGDMPEYVRD 60

Query: 87  KAYTVEELTEDSFDGVDIALFSAGGSISKKFGPIAVEKGSIVVDNSSAFRMVENVPLVIP 146
               + E    + D VDI   +    ++++  P   E G  V  N+S  RM  +VPL+IP
Sbjct: 61  --LPIVEPEPIAEDDVDIVFSALPSEVAEEVEPKLAEAGKPVFSNASNHRMDPDVPLIIP 118

Query: 147 EVNPEAMSGIKV---GMGKGALIANPNCSTIICLMAATPLHRRAKVTRMVVSTYQAASGA 203
           EVN + +  +KV      KG ++ NPNC+T    +A  PL     + ++ V+T QA SGA
Sbjct: 119 EVNSDHLELLKVQKERGWKGFIVTNPNCTTAGLTLALKPLIDAFGIKKVHVTTMQAVSGA 178

Query: 204 GAAAMEELELQTREVLEGKPPTCKIFSQQYAFNLFSHNAPVLENGYNEEEMKMVKETRKI 263
           G   +  +++                      N+  H           EE K+ +ETRKI
Sbjct: 179 GYPGVPSMDILD--------------------NIIPH--------IGGEEEKIERETRKI 210

Query: 264 W----NDK----DVRVTATCIRVPVMRAHAESVNLQFEKPLDEDTARDILKN-------- 307
                N K       V+AT  RVPV+  H ESV+++F+K  D +  R+ LK+        
Sbjct: 211 LGKLENGKIEPAPFSVSATTTRVPVLDGHTESVHVEFDKKFDIEEIREALKSFRGLPQKL 270

Query: 308 ----APGVVVI----DDRASNHFPTP-LEVSNKDD--VAVGRIRRDVSQDGNHGLDIFVC 356
               AP   +I    +DR     P P L+        V VGR+R +        L   V 
Sbjct: 271 GLPSAPEKPIIVRDEEDR-----PQPRLDRDAGGGMAVTVGRLREE-----GGSLKYVVL 320

Query: 357 GDQVRKGAALNAVQIAEMLL 376
           G  + +GAA   +  AE+  
Sbjct: 321 GHNLVRGAAGATLLNAELAY 340


>gnl|CDD|201603 pfam01118, Semialdhyde_dh, Semialdehyde dehydrogenase, NAD binding
           domain.  This Pfam entry contains the following members:
           N-acetyl-glutamine semialdehyde dehydrogenase (AgrC)
           Aspartate-semialdehyde dehydrogenase.
          Length = 121

 Score =  132 bits (334), Expect = 6e-38
 Identities = 47/121 (38%), Positives = 70/121 (57%), Gaps = 7/121 (5%)

Query: 42  VAVVGVTGAVGQEFLSVLSDRDFPYRSIKMLASKRSAGKQLSFQDKA------YTVEELT 95
           VA+VG TG VGQE L +L     P   + ++AS RSAGK+++F            +E++ 
Sbjct: 2   VAIVGATGYVGQELLRLL-AEHPPLELVALVASSRSAGKKVAFAYPILEGGKDLLLEDVD 60

Query: 96  EDSFDGVDIALFSAGGSISKKFGPIAVEKGSIVVDNSSAFRMVENVPLVIPEVNPEAMSG 155
            +    VDI   +    +SK+  P  +E G++V+D SSAFRM ++VP V+PEVN EA+  
Sbjct: 61  PEDLKDVDIVFLALPAGVSKELAPKLLEAGAVVIDLSSAFRMDDDVPYVLPEVNREAIKK 120

Query: 156 I 156
            
Sbjct: 121 A 121


>gnl|CDD|217222 pfam02774, Semialdhyde_dhC, Semialdehyde dehydrogenase,
           dimerisation domain.  This Pfam entry contains the
           following members: N-acetyl-glutamine semialdehyde
           dehydrogenase (AgrC) Aspartate-semialdehyde
           dehydrogenase.
          Length = 167

 Score =  126 bits (319), Expect = 3e-35
 Identities = 67/186 (36%), Positives = 92/186 (49%), Gaps = 26/186 (13%)

Query: 182 PLH-RRAKVTRMVVSTYQAASGAGAAAMEELELQTREVLEGKPPTCKIFSQQYAFNLFSH 240
           PL      + R++V TYQA SGAG  A                    +F    A NL  +
Sbjct: 3   PLRDALGGLERVIVDTYQAVSGAGKKAKPG-----------------VFGAPIADNLIPY 45

Query: 241 NAPVLENGYNE--EEMKMVKETRKIWNDKDVRVTATCIRVPVMRAHAESVNLQFE-KPLD 297
                 NG  E  EE+KMV ET+KI       V+ATC+RVPV R H+E+V ++ + KP+D
Sbjct: 46  IDGEEHNGTPETREELKMVNETKKILGFTPK-VSATCVRVPVFRGHSETVTVKLKLKPID 104

Query: 298 EDTARDILKNAPGVVVIDDRASNHFPTPLEVSNK-DDVAVGRIRRDVSQDGNHGLDIFVC 356
            +   +    APGV V+  R    +PTP  V    + V VGR+R+D   DG+ GL +   
Sbjct: 105 VEEVYEAFYAAPGVFVV-VRPEEDYPTPRAVRGGTNFVYVGRVRKDP--DGDRGLKLVSV 161

Query: 357 GDQVRK 362
            D +RK
Sbjct: 162 IDNLRK 167


>gnl|CDD|214863 smart00859, Semialdhyde_dh, Semialdehyde dehydrogenase, NAD binding
           domain.  The semialdehyde dehydrogenase family is found
           in N-acetyl-glutamine semialdehyde dehydrogenase (AgrC),
           which is involved in arginine biosynthesis, and
           aspartate-semialdehyde dehydrogenase, an enzyme involved
           in the biosynthesis of various amino acids from
           aspartate. This family is also found in yeast and fungal
           Arg5,6 protein, which is cleaved into the enzymes
           N-acety-gamma-glutamyl-phosphate reductase and
           acetylglutamate kinase. These are also involved in
           arginine biosynthesis. All proteins in this entry
           contain a NAD binding region of semialdehyde
           dehydrogenase.
          Length = 123

 Score =  118 bits (297), Expect = 1e-32
 Identities = 52/125 (41%), Positives = 70/125 (56%), Gaps = 11/125 (8%)

Query: 41  SVAVVGVTGAVGQEFLSVLSD-RDFPYRSIKMLASKRSAGKQLSF---QDKAYTVEELTE 96
            VA+VG TG VGQE L +L++  DF   ++   AS RSAGK++S      K   V EL  
Sbjct: 1   KVAIVGATGYVGQELLRLLAEHPDFELTAL--AASSRSAGKKVSEAGPHLKGEVVLELDP 58

Query: 97  DSFD--GVDIALFSAGGSISK---KFGPIAVEKGSIVVDNSSAFRMVENVPLVIPEVNPE 151
             F+   VDI   +    +SK      P A   G++V+D SSAFRM ++VP  +PEVNPE
Sbjct: 59  PDFEELAVDIVFLALPHGVSKESAPLLPRAAAAGAVVIDLSSAFRMDDDVPYGLPEVNPE 118

Query: 152 AMSGI 156
           A+   
Sbjct: 119 AIKKA 123


>gnl|CDD|130806 TIGR01745, asd_gamma, aspartate-semialdehyde dehydrogenase,
           gamma-proteobacterial.  [Amino acid biosynthesis,
           Aspartate family].
          Length = 366

 Score =  102 bits (255), Expect = 2e-24
 Identities = 95/358 (26%), Positives = 152/358 (42%), Gaps = 39/358 (10%)

Query: 42  VAVVGVTGAVGQEFLS-VLSDRDFPYRSIKMLASKRSAGKQLSFQDKAYTVEELTE-DSF 99
           V +VG  G VG   +  +  +RDF        ++ +      SF     T+++  + D+ 
Sbjct: 3   VGLVGWRGMVGSVLMQRMQEERDFDAIRPVFFSTSQLGQAAPSFGGTTGTLQDAFDIDAL 62

Query: 100 DGVDIALFSAGGSISKKFGPIAVEKG--SIVVDNSSAFRMVENVPLVIPEVNPEAMSGIK 157
             +DI +   GG  + +  P   E G     +D +S+ RM ++  +++  VN +    I 
Sbjct: 63  KALDIIITCQGGDYTNEIYPKLRESGWQGYWIDAASSLRMKDDAVIILDPVNQDV---IT 119

Query: 158 VGMGKGA-LIANPNCSTIICLMAATPLHRRAKVTRMVVSTYQAASGAGAAAMEELELQT- 215
            G+  G       NC+  + LM+   L     V  + V+TYQAASG GA  M EL  Q  
Sbjct: 120 DGLNNGIRTFVGGNCTVSLMLMSLGGLFANDLVEWVSVATYQAASGGGARHMRELLTQMG 179

Query: 216 ---REVLE--GKPPTC------KIFSQQYAFNLFSHNAPV-------------LENGYNE 251
                V +    P +       K+     +  L   N  V             L+NG + 
Sbjct: 180 HLYGHVEDELATPSSAILDIERKVTKLTRSGELPVDNFGVPLAGSLIPWIDKQLDNGQSR 239

Query: 252 EEMKMVKETRKIWNDKDV-RVTATCIRVPVMRAHAESVNLQFEKPLDEDTARDILKNA-P 309
           EE K   ET KI        V   C+R+  +R H+++  ++ +K +  +T  +I++   P
Sbjct: 240 EEWKGQAETNKILGTSSTIPVDGLCVRIGALRCHSQAFTIKLKKDVSLETIEEIIRAHNP 299

Query: 310 GV-VVIDDRASNHFP-TPLEVSNKDDVAVGRIRRDVSQDGNHGLDIFVCGDQVRKGAA 365
            V VV +DR       TP  V+    + VGR+R+     G   L  F  GDQ+  GAA
Sbjct: 300 WVKVVPNDREITMRELTPAAVTGTLTIPVGRLRK--LNMGPEYLSAFTVGDQLLWGAA 355


>gnl|CDD|235839 PRK06598, PRK06598, aspartate-semialdehyde dehydrogenase; Reviewed.
          Length = 369

 Score =  101 bits (255), Expect = 3e-24
 Identities = 110/376 (29%), Positives = 156/376 (41%), Gaps = 74/376 (19%)

Query: 42  VAVVGVTGAVGQEFLSVLSDR-----DF----PY---RSIKMLASKRSAGKQLSFQDKAY 89
           V  VG  G VG    SVL  R     DF    P     S    A+    GK+ + QD A+
Sbjct: 4   VGFVGWRGMVG----SVLMQRMVEENDFDLIEPVFFSTSQAGGAAPSFGGKEGTLQD-AF 58

Query: 90  TVEELTEDSFDGVDIALFSAGGSISKKFGPIAVEKG--SIVVDNSSAFRMVENVPLVIPE 147
            ++ L +     +DI +   GG  + +  P     G     +D +S  RM ++  +++  
Sbjct: 59  DIDALKK-----LDIIITCQGGDYTNEVYPKLRAAGWQGYWIDAASTLRMKDDAIIILDP 113

Query: 148 VNPEAMSGIKVGMGKGALIAN---PNCSTIICLMAATPLHRRAKVTRMVVSTYQAASGAG 204
           VN      I   +  G  +      NC+  + LMA   L +   V  + V TYQAASGAG
Sbjct: 114 VN---RDVIDDALANG--VKTFVGGNCTVSLMLMALGGLFKNDLVEWVSVMTYQAASGAG 168

Query: 205 AAAMEEL-------------ELQT---------REVLE----GKPPTCKIFSQQYAFNLF 238
           A  M EL             EL           R+V E    G  PT   F    A +L 
Sbjct: 169 ARNMRELLTQMGALHGAVADELADPASAILDIDRKVTELMRSGDLPT-DNFGVPLAGSLI 227

Query: 239 SHNAP----VLENGYNEEEMKMVKETRKIWN--DKDVRVTATCIRVPVMRAHAESVNLQF 292
               P     L NG + EE K   ET KI       + V   C+RV  MR H++++ ++ 
Sbjct: 228 ----PWIDKDLGNGQSREEWKGQAETNKILGLTKNPIPVDGLCVRVGAMRCHSQALTIKL 283

Query: 293 EKPLDEDTARDILKNA-PGVVVI-DDR-ASNHFPTPLEVSNKDDVAVGRIRRDVSQDGNH 349
           +K +      +IL    P V V+ +DR A+    TP  V+    + VGR+R+     G  
Sbjct: 284 KKDVPLAEIEEILAAHNPWVKVVPNDREATMRELTPAAVTGTLTIPVGRLRK--LNMGPE 341

Query: 350 GLDIFVCGDQVRKGAA 365
            L  F  GDQ+  GAA
Sbjct: 342 YLSAFTVGDQLLWGAA 357


>gnl|CDD|223081 COG0002, ArgC, Acetylglutamate semialdehyde dehydrogenase [Amino
           acid transport and metabolism].
          Length = 349

 Score = 65.0 bits (159), Expect = 1e-11
 Identities = 76/365 (20%), Positives = 127/365 (34%), Gaps = 71/365 (19%)

Query: 39  APSVAVVGVTGAVGQEFLSVLSDRDFPYRSIKMLASKRSAGKQLS--------FQDKAYT 90
              V +VG +G  G E L +L+    P   + +++S+  AGK +S          D  + 
Sbjct: 2   MIKVGIVGASGYTGLELLRLLAGH--PDVELILISSRERAGKPVSDVHPNLRGLVDLPFQ 59

Query: 91  VEELTEDSFDGVDIALFSAGGSISKKFGPIAVEKGSIVVDNSSAFRMVE--------NVP 142
             +  +   D  D+   +    +S +  P  +E G  V+D S+ FR+ +           
Sbjct: 60  TIDPEKIELDECDVVFLALPHGVSAELVPELLEAGCKVIDLSADFRLKDPEVYEKWYGFT 119

Query: 143 LVIPEVNPEAMSGI------KVGMGKGA-LIANPNCSTIICLMAATPLHRRAKVTRM-VV 194
              PE+  +A+ G+      K+   +GA LIANP C     ++A  PL  +A +      
Sbjct: 120 HAGPELLEDAVYGLPELHREKI---RGAKLIANPGCYPTAAILALAPL-VKAGLLDPDSP 175

Query: 195 STYQAASGAGAAAMEELELQTREVLEGKPPTCKIFSQQYAFNLFSHNAPVLENGYNEEEM 254
               A SG   A              G+  +          N F      L   Y     
Sbjct: 176 PIVDAKSGVSGA--------------GRKAS--------VKNHFPEVNDSL-RPYGLTGH 212

Query: 255 KMVKETR---KIWNDKDVRVTATCIRVPVMRAHAESVNLQFEKPLDEDTARDIL----KN 307
           +   E          + V V  T    P +R    ++ L+ +  +  +            
Sbjct: 213 RHTPEIEQHLGRLAGRKVPVIFTPHLGPFVRGILATIYLKLKDLVTLEELHAAYEEFYAG 272

Query: 308 APGVVVIDDRASNHFPTPLEV--SNKDDVAVGRIRRDVSQDGNHGLDIFVCGDQVRKGAA 365
            P V V+ +     +P    V  SN  D+        V +     + +    D + KGAA
Sbjct: 273 EPFVRVVPEGG---YPDTKAVAGSNFCDIGFA-----VDERTGR-VVVVSAIDNLVKGAA 323

Query: 366 LNAVQ 370
             AVQ
Sbjct: 324 GQAVQ 328


>gnl|CDD|234761 PRK00436, argC, N-acetyl-gamma-glutamyl-phosphate reductase;
           Validated.
          Length = 343

 Score = 58.2 bits (142), Expect = 2e-09
 Identities = 52/198 (26%), Positives = 78/198 (39%), Gaps = 44/198 (22%)

Query: 41  SVAVVGVTGAVGQEFLSVLSDRDFPYRSIKMLASKRSAGKQLS------FQDKAYTVEEL 94
            V +VG +G  G E L +L +   P   I  + S+ SAGK LS             +E L
Sbjct: 4   KVGIVGASGYTGGELLRLLLNH--PEVEIVAVTSRSSAGKPLSDVHPHLRGLVDLVLEPL 61

Query: 95  TEDSFDGVDIALFSAGGSISKKFGPIAVEKGSIVVDNSSAFRMV-------------ENV 141
             +   G D+   +    +S    P  +E G  V+D S+ FR+                 
Sbjct: 62  DPEILAGADVVFLALPHGVSMDLAPQLLEAGVKVIDLSADFRLKDPEVYEKWYGFEHAAP 121

Query: 142 PLV------IPEVNPEAMSGIKVGMGKGA-LIANPNC---STIICLMAATPL--HRRAKV 189
            L+      +PE+N E +        KGA LIANP C   ++++ L    PL        
Sbjct: 122 ELLKEAVYGLPELNREEI--------KGARLIANPGCYPTASLLALA---PLLKAGLIDP 170

Query: 190 TRMVVSTYQAASGAGAAA 207
             +++      SGAG  A
Sbjct: 171 DSIIIDAKSGVSGAGRKA 188


>gnl|CDD|233597 TIGR01850, argC, N-acetyl-gamma-glutamyl-phosphate reductase,
           common form.  This model represents the more common of
           two related families of
           N-acetyl-gamma-glutamyl-phosphate reductase, an enzyme
           catalyzing the third step or Arg biosynthesis from Glu.
           The two families differ by phylogeny, similarity
           clustering, and the gap architecture in a multiple
           sequence alignment. Bacterial members of this family
           tend to be found within Arg biosynthesis operons [Amino
           acid biosynthesis, Glutamate family].
          Length = 346

 Score = 56.4 bits (137), Expect = 7e-09
 Identities = 85/366 (23%), Positives = 137/366 (37%), Gaps = 80/366 (21%)

Query: 42  VAVVGVTGAVGQEFLSVLSDRDFPYRSIKML-ASKRSAGKQLSF---QDKAYTVEELTED 97
           VA+VG +G  G E L +L +   P   I  L +S+ SAGK +S      +      L   
Sbjct: 3   VAIVGASGYTGGELLRLLLNH--PEVEITYLVSSRESAGKPVSEVHPHLRGLVDLNLEPI 60

Query: 98  SFD----GVDIALFSAGGSISKKFGPIAVEKGSIVVDNSSAFRM---------------- 137
             +      D+   +    +S +  P  +  G  V+D S+ FR+                
Sbjct: 61  DVEEILEDADVVFLALPHGVSAELAPELLAAGVKVIDLSADFRLKDPELYEKWYGFEHAG 120

Query: 138 ---VENVPLVIPEVNPEAMSGIKVGMGKGA-LIANPNCSTIICLMAATPL--HRRAKVTR 191
              ++     +PE++ E +        KGA LIANP C     L+A  PL        T 
Sbjct: 121 PELLQKAVYGLPELHREEI--------KGARLIANPGCYPTATLLALAPLLKEGLIDPTS 172

Query: 192 MVVSTYQAASGAGAAAMEELELQTREVLEGKPPTCKIFSQQYAFNLFSH-NAPVLENGYN 250
           ++V      SGAG  A E       EV E   P          + +  H + P       
Sbjct: 173 IIVDAKSGVSGAGRKASEANHFP--EVNENLRP----------YKVTGHRHTP------- 213

Query: 251 EEEMKMVKETRKIWNDKDVRVTATCIRVPVMRAHAESVNLQFEKPLDEDTARDIL----K 306
               ++ +E  +      V+V+ T   VP+ R    ++  + +  L E+  R +      
Sbjct: 214 ----EIEQELGR-LAGGKVKVSFTPHLVPMTRGILATIYAKLKDGLTEEDLRALYEEFYA 268

Query: 307 NAPGVVVIDDRASNHFPTPLEV--SNKDDVAVGRIRRDVSQDGNHGLDIFVCGDQVRKGA 364
           + P V V+ +     +P+   V  SN  D+        V +     + +    D + KGA
Sbjct: 269 DEPFVRVLPEGG---YPSTKAVIGSNFCDIGF-----AVDERTGR-VVVVSAIDNLVKGA 319

Query: 365 ALNAVQ 370
           A  AVQ
Sbjct: 320 AGQAVQ 325


>gnl|CDD|235883 PRK06901, PRK06901, aspartate-semialdehyde dehydrogenase;
           Provisional.
          Length = 322

 Score = 55.1 bits (133), Expect = 2e-08
 Identities = 57/219 (26%), Positives = 101/219 (46%), Gaps = 21/219 (9%)

Query: 82  LSFQDKAYTVEELTEDSFDGVDIA-LFSAGGSISKKFGPIAVEKGSIVVDNSSAFRMVEN 140
           + F +KA  VE++  +  +  D   +F AG     +    A E G IV+D       + N
Sbjct: 46  IRFNNKA--VEQIAPEEVEWADFNYVFFAGKMAQAEHLAQAAEAGCIVIDLYGICAALAN 103

Query: 141 VPLVIPEVNPEAMSGIKVGMGKGALIANPNCSTIICLMAATPLHRRAKVTRMVVSTYQAA 200
           VP+V+P VN E ++ ++    +  +++ P+       +A  P  +   ++++ V++   A
Sbjct: 104 VPVVVPSVNDEQLAELR----QRNIVSLPDPQVSQLALALAPFLQEQPLSQIFVTSLLPA 159

Query: 201 SGAGAAAMEELELQTREVLEGKPPTCKIFSQQYAFNLFSHNAPVLENGYNEEEMKMVKET 260
           S   A  +++L  QT  +L G P   +   Q+ AF++F  NA  LE            + 
Sbjct: 160 SYTDAETVKKLAGQTARLLNGIPLDEE--EQRLAFDVFPANAQNLEL-----------QL 206

Query: 261 RKIWNDKDVRVTATCIRVPVMRAHAESVNLQFEKPLDED 299
           +KI+   +  VT   I+VPV    A+ V    E  LD +
Sbjct: 207 QKIFPQLE-NVTFHSIQVPVFYGLAQMVTALSEYELDIE 244


>gnl|CDD|215522 PLN02968, PLN02968, Probable N-acetyl-gamma-glutamyl-phosphate
           reductase.
          Length = 381

 Score = 42.5 bits (100), Expect = 2e-04
 Identities = 52/236 (22%), Positives = 84/236 (35%), Gaps = 41/236 (17%)

Query: 2   ATFSSHQTQTHFISKLPANKPRTKPMFTRVRMSYQESAPSVAVVGVTGAVGQEFLSVLSD 61
              S          +          + +       E    + V+G +G  G E   +L++
Sbjct: 3   FADSVGSKGLAS--RASVTSSPQSVVSSASSSVKSEEKKRIFVLGASGYTGAEVRRLLAN 60

Query: 62  RDFPYRSIKMLASKRSAGKQ-------LSFQDKAYTVEELTEDSFDGVDIALF-----SA 109
              P   I ++ + R AG+        L  QD    V     D F  VD A+F       
Sbjct: 61  H--PDFEITVMTADRKAGQSFGSVFPHLITQDLPNLVAVKDAD-FSDVD-AVFCCLPHGT 116

Query: 110 GGSISKKFGPIAVEKGSIVVDNSSAFRMVE--------NVPLVIPEVNPEAMSGI----K 157
              I K     A+ K   +VD S+ FR+ +          P   PE+  EA+ G+    +
Sbjct: 117 TQEIIK-----ALPKDLKIVDLSADFRLRDIAEYEEWYGHPHRAPELQKEAVYGLTELQR 171

Query: 158 VGMGKGALIANPNC--STIICLMAATPLHRRA--KVTRMVVSTYQAASGAGAAAME 209
             +    L+ANP C  + I   +   PL +    +   +++      SGAG  A E
Sbjct: 172 EEIKSARLVANPGCYPTGIQLPL--VPLVKAGLIEPDNIIIDAKSGVSGAGRGAKE 225


>gnl|CDD|179786 PRK04207, PRK04207, glyceraldehyde-3-phosphate dehydrogenase;
           Provisional.
          Length = 341

 Score = 35.6 bits (83), Expect = 0.033
 Identities = 13/50 (26%), Positives = 29/50 (58%)

Query: 267 KDVRVTATCIRVPVMRAHAESVNLQFEKPLDEDTARDILKNAPGVVVIDD 316
            D+ +T   ++VP    H  SVN++ +KP+ ++   + L+N P ++++  
Sbjct: 203 PDLDITTMAVKVPTTLMHMHSVNVELKKPVTKEEVLEALENTPRILLVRA 252


>gnl|CDD|233598 TIGR01851, argC_other, N-acetyl-gamma-glutamyl-phosphate reductase,
           uncommon form.  This model represents the less common of
           two related families of
           N-acetyl-gamma-glutamyl-phosphate reductase, an enzyme
           catalyzing the third step or Arg biosynthesis from Glu.
           The two families differ by phylogeny, similarity
           clustering, and gap architecture in a multiple sequence
           alignment [Amino acid biosynthesis, Glutamate family].
          Length = 310

 Score = 33.7 bits (77), Expect = 0.13
 Identities = 38/172 (22%), Positives = 63/172 (36%), Gaps = 20/172 (11%)

Query: 39  APSVAVVGVTGAVGQEFLSVLSDRDFPYRSIKMLASKRSAGKQLSFQDKAYTVEELTEDS 98
           AP V + G  G  G +    LS RD     + +   +R        +D A   + L    
Sbjct: 1   APKVFIDGEAGTTGLQIRERLSGRD-DIELLSIAPDRR--------KDAAERAKLL---- 47

Query: 99  FDGVDIALFSAGGSISKKFGPIAVEKGSIVVDNSSAFRMVENVPLVIPEVNPEAMSGIKV 158
            +  D+A+       +++   +     + ++D S+A+R  ++     PE+ P     I+ 
Sbjct: 48  -NAADVAILCLPDDAAREAVSLVDNPNTCIIDASTAYRTADDWAYGFPELAPGQREKIR- 105

Query: 159 GMGKGALIANPNCSTIICLMAATPLHRRAKVTRMVVSTYQAASG--AGAAAM 208
                  IANP C     +    PL     +      T  A SG   G  AM
Sbjct: 106 ---NSKRIANPGCYPTGFIALMRPLVEAGILPADFPITINAVSGYSGGGKAM 154


>gnl|CDD|238769 cd01492, Aos1_SUMO, Ubiquitin activating enzyme (E1) subunit Aos1.
           Aos1 is part of the heterodimeric activating enzyme
           (E1), specific for the SUMO family of ubiquitin-like
           proteins (Ubls). E1 enzymes are part of a conjugation
           cascade to attach Ub or Ubls, covalently to substrate
           proteins consisting of activating (E1), conjugating
           (E2), and/or ligating (E3) enzymes. E1 activates
           ubiquitin by C-terminal adenylation, and subsequently
           forms a highly reactive thioester bond between its
           catalytic cysteine and Ubls C-terminus. The E1 also
           associates with E2 and promotes ubiquitin transfer to
           the E2's catalytic cysteine. Post-translational
           modification by SUMO family of ubiquitin-like proteins
           (Ublps) is involved in cell division, nuclear transport,
           the stress response and signal transduction. Aos1
           contains part of the adenylation domain.
          Length = 197

 Score = 33.0 bits (76), Expect = 0.13
 Identities = 13/41 (31%), Positives = 21/41 (51%), Gaps = 2/41 (4%)

Query: 39  APSVAVVGVTGAVGQEFLSVLSDRDFPYRSIKMLASKRSAG 79
           AP  AVVG  G + Q+ ++ LS R+ P  +  +   + S  
Sbjct: 156 APVAAVVG--GILAQDVINALSKRESPLNNFFVFDGETSEA 194


>gnl|CDD|187554 cd05243, SDR_a5, atypical (a) SDRs, subgroup 5.  This subgroup
           contains atypical SDRs, some of which are identified as
           putative NAD(P)-dependent epimerases, one as a putative
           NAD-dependent epimerase/dehydratase. Atypical SDRs are
           distinct from classical SDRs. Members of this subgroup
           have a glycine-rich NAD(P)-binding motif that is very
           similar to the extended SDRs, GXXGXXG, and binds NADP.
           Generally, this subgroup has poor conservation of the
           active site tetrad; however, individual sequences do
           contain matches to the YXXXK active site motif, the
           upstream Ser, and there is a highly conserved Asp in
           place of the usual active site Asn throughout the
           subgroup. Atypical SDRs generally lack the catalytic
           residues characteristic of the SDRs, and their
           glycine-rich NAD(P)-binding motif is often different
           from the forms normally seen in classical or extended
           SDRs. Atypical SDRs include biliverdin IX beta reductase
           (BVR-B,aka flavin reductase), NMRa (a negative
           transcriptional regulator of various fungi),
           progesterone 5-beta-reductase like proteins,
           phenylcoumaran benzylic ether and
           pinoresinol-lariciresinol reductases, phenylpropene
           synthases, eugenol synthase, triphenylmethane reductase,
           isoflavone reductases, and others. SDRs are a
           functionally diverse family of oxidoreductases that have
           a single domain with a structurally conserved Rossmann
           fold, an NAD(P)(H)-binding region, and a structurally
           diverse C-terminal region. Sequence identity between
           different SDR enzymes is typically in the 15-30% range;
           they catalyze a wide range of activities including the
           metabolism of steroids, cofactors, carbohydrates,
           lipids, aromatic compounds, and amino acids, and act in
           redox sensing. Classical SDRs have an TGXXX[AG]XG
           cofactor binding motif and a YXXXK active site motif,
           with the Tyr residue of the active site motif serving as
           a critical catalytic residue (Tyr-151, human
           15-hydroxyprostaglandin dehydrogenase numbering). In
           addition to the Tyr and Lys, there is often an upstream
           Ser and/or an Asn, contributing to the active site;
           while substrate binding is in the C-terminal region,
           which determines specificity. The standard reaction
           mechanism is a 4-pro-S hydride transfer and proton relay
           involving the conserved Tyr and Lys, a water molecule
           stabilized by Asn, and nicotinamide. In addition to the
           Rossmann fold core region typical of all SDRs, extended
           SDRs have a less conserved C-terminal extension of
           approximately 100 amino acids, and typically have a
           TGXXGXXG cofactor binding motif. Complex (multidomain)
           SDRs such as ketoreductase domains of fatty acid
           synthase have a GGXGXXG NAD(P)-binding motif and an
           altered active site motif (YXXXN). Fungal type ketoacyl
           reductases have a TGXXXGX(1-2)G NAD(P)-binding motif.
          Length = 203

 Score = 32.6 bits (75), Expect = 0.21
 Identities = 25/88 (28%), Positives = 37/88 (42%), Gaps = 10/88 (11%)

Query: 41  SVAVVGVTGAVGQEFLSVLSDRDFPYRSIKMLASKRSAGKQLSFQDKAYTVE-ELTEDS- 98
            V VVG TG VG+  +  L DR +  R+   L    S  ++L     A  V  +LT+   
Sbjct: 1   KVLVVGATGKVGRHVVRELLDRGYQVRA---LVRDPSQAEKL-EAAGAEVVVGDLTDAES 56

Query: 99  ----FDGVDIALFSAGGSISKKFGPIAV 122
                +G+D  + +AG          AV
Sbjct: 57  LAAALEGIDAVISAAGSGGKGGPRTEAV 84


>gnl|CDD|130609 TIGR01546, GAPDH-II_archae, glyceraldehyde-3-phosphate
           dehydrogenase, type II.  This model describes the type
           II glyceraldehyde-3-phosphate dehydrogenases which are
           limited to archaea. These enzymes catalyze the
           interconversion of 1,3-diphosphoglycerate and
           glyceraldehyde-3-phosphate, a central step in glycolysis
           and gluconeogenesis. In archaea, either NAD or NADP may
           be utilized as the cofactor. The class I GAPDH's from
           bacteria and eukaryotes are covered by TIGR01534. All of
           the members of the seed are characterized. See, for
           instance. This model is very solid, there are no species
           falling between trusted and noise at this time. The
           closest relatives scoring in the noise are the class I
           GAPDH's.
          Length = 333

 Score = 32.9 bits (75), Expect = 0.21
 Identities = 13/48 (27%), Positives = 26/48 (54%)

Query: 268 DVRVTATCIRVPVMRAHAESVNLQFEKPLDEDTARDILKNAPGVVVID 315
           ++ +      VP    H  S+ ++ +KP+ +D   DIL+N P V++ +
Sbjct: 201 NLNIETMAFVVPTTLMHVHSIMVELKKPVTKDDIIDILENTPRVLLFE 248


>gnl|CDD|181721 PRK09242, PRK09242, tropinone reductase; Provisional.
          Length = 257

 Score = 31.3 bits (71), Expect = 0.62
 Identities = 21/87 (24%), Positives = 37/87 (42%), Gaps = 13/87 (14%)

Query: 43  AVVGVTGAVGQEFLSVLSDRDFPYRSIKMLASKRSAGKQLSFQDKAY------------- 89
           A  G+  A+ +EFL + +D     R    LA  R    +   + + +             
Sbjct: 17  ASKGIGLAIAREFLGLGADVLIVARDADALAQARDELAEEFPEREVHGLAADVSDDEDRR 76

Query: 90  TVEELTEDSFDGVDIALFSAGGSISKK 116
            + +  ED +DG+ I + +AGG+I K 
Sbjct: 77  AILDWVEDHWDGLHILVNNAGGNIRKA 103


>gnl|CDD|130833 TIGR01772, MDH_euk_gproteo, malate dehydrogenase, NAD-dependent.
           This model represents the NAD-dependent malate
           dehydrogenase found in eukaryotes and certain gamma
           proteobacteria. The enzyme is involved in the citric
           acid cycle as well as the glyoxalate cycle. Several
           isoforms exidt in eukaryotes. In S. cereviseae, for
           example, there are cytoplasmic, mitochondrial and
           peroxisomal forms. Although malate dehydrogenases have
           in some cases been mistaken for lactate dehydrogenases
           due to the similarity of these two substrates and the
           apparent ease with which evolution can toggle these
           activities, critical residues have been identified which
           can discriminate between the two activities. At the time
           of the creation of this model no hits above the trusted
           cutoff contained critical residues typical of lactate
           dehydrogenases [Energy metabolism, TCA cycle].
          Length = 312

 Score = 31.2 bits (71), Expect = 0.87
 Identities = 35/140 (25%), Positives = 61/140 (43%), Gaps = 25/140 (17%)

Query: 42  VAVVGVTGAVGQEFLSVLSDRDFPYRSIKMLASKRSAG--KQLSFQD-----KAYTVEEL 94
           VAV+G  G +GQ  LS+L         + +     +AG    LS        K ++ EE 
Sbjct: 2   VAVLGAAGGIGQP-LSLLLKLQPYVSELSLYDIAGAAGVAADLSHIPTAASVKGFSGEEG 60

Query: 95  TEDSFDGVDIALFSAGGSISKKFGPIAVEKGSIVVDNSSAFRMVENVPLVIPEVNPEAMS 154
            E++  G D+ +  AG  + +K G   + +  +   N+    +V+++   + E  P+AM 
Sbjct: 61  LENALKGADVVVIPAG--VPRKPG---MTRDDLFNVNAG---IVKDLVAAVAESCPKAMI 112

Query: 155 GIKVGMGKGALIANPNCSTI 174
            +         I NP  ST+
Sbjct: 113 LV---------ITNPVNSTV 123


>gnl|CDD|240360 PTZ00325, PTZ00325, malate dehydrogenase; Provisional.
          Length = 321

 Score = 31.2 bits (71), Expect = 0.94
 Identities = 25/77 (32%), Positives = 35/77 (45%), Gaps = 10/77 (12%)

Query: 42  VAVVGVTGAVGQEFLSVLSDRDFPYRS-IKMLASKRSAG--KQLSFQD-----KAYTVEE 93
           VAV+G  G +GQ  LS+L  +  P+ S + +     + G    LS  D       Y   E
Sbjct: 11  VAVLGAAGGIGQP-LSLLL-KQNPHVSELSLYDIVGAPGVAADLSHIDTPAKVTGYADGE 68

Query: 94  LTEDSFDGVDIALFSAG 110
           L E +  G D+ L  AG
Sbjct: 69  LWEKALRGADLVLICAG 85


>gnl|CDD|217227 pfam02780, Transketolase_C, Transketolase, C-terminal domain.  The
           C-terminal domain of transketolase has been proposed as
           a regulatory molecule binding site.
          Length = 124

 Score = 29.1 bits (66), Expect = 1.6
 Identities = 13/32 (40%), Positives = 22/32 (68%)

Query: 285 AESVNLQFEKPLDEDTARDILKNAPGVVVIDD 316
           AE ++L+  KPLDEDT  + +K    +VV+++
Sbjct: 38  AEVIDLRTVKPLDEDTILESVKKTGRLVVVEE 69


>gnl|CDD|205225 pfam13044, DUF3904, Protein of unknown function (DUF3904).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in viruses. Proteins in this
           family are typically between 437 and 448 amino acids in
           length.
          Length = 436

 Score = 30.0 bits (67), Expect = 2.3
 Identities = 12/37 (32%), Positives = 20/37 (54%), Gaps = 7/37 (18%)

Query: 331 KDDVAVGRIR-------RDVSQDGNHGLDIFVCGDQV 360
           K D+ +G+I+        D+S+ G  GL +F+CG   
Sbjct: 107 KHDIVLGKIKVSEFGGDHDISKCGRKGLKVFICGGTT 143


>gnl|CDD|187569 cd05259, PCBER_SDR_a, phenylcoumaran benzylic ether reductase
           (PCBER) like, atypical (a) SDRs.  PCBER and
           pinoresinol-lariciresinol reductases are NADPH-dependent
           aromatic alcohol reductases, and are atypical members of
           the SDR family. Other proteins in this subgroup are
           identified as eugenol synthase. These proteins contain
           an N-terminus characteristic of NAD(P)-binding proteins
           and a small C-terminal domain presumed to be involved in
           substrate binding, but they do not have the conserved
           active site Tyr residue typically found in SDRs.
           Numerous other members have unknown functions. The
           glycine rich NADP-binding motif in this subgroup is of 2
           forms: GXGXXG and G[GA]XGXXG; it tends to be atypical
           compared with the forms generally seen in classical or
           extended SDRs. The usual SDR active site tetrad is not
           present, but a critical active site Lys at the usual SDR
           position has been identified in various members, though
           other charged and polar residues are found at this
           position in this subgroup. Atypical SDR-related proteins
           retain the Rossmann fold of the SDRs, but have limited
           sequence identity and generally lack the catalytic
           properties of the archetypical members. Atypical SDRs
           include biliverdin IX beta reductase (BVR-B,aka flavin
           reductase), NMRa (a negative transcriptional regulator
           of various fungi), progesterone 5-beta-reductase like
           proteins, phenylcoumaran benzylic ether and
           pinoresinol-lariciresinol reductases, phenylpropene
           synthases, eugenol synthase, triphenylmethane reductase,
           isoflavone reductases, and others. SDRs are a
           functionally diverse family of oxidoreductases that have
           a single domain with a structurally conserved Rossmann
           fold, an NAD(P)(H)-binding region, and a structurally
           diverse C-terminal region. Sequence identity between
           different SDR enzymes is typically in the 15-30% range;
           they catalyze a wide range of activities including the
           metabolism of steroids, cofactors, carbohydrates,
           lipids, aromatic compounds, and amino acids, and act in
           redox sensing. Classical SDRs have an TGXXX[AG]XG
           cofactor binding motif and a YXXXK active site motif,
           with the Tyr residue of the active site motif serving as
           a critical catalytic residue (Tyr-151, human
           15-hydroxyprostaglandin dehydrogenase numbering). In
           addition to the Tyr and Lys, there is often an upstream
           Ser and/or an Asn, contributing to the active site;
           while substrate binding is in the C-terminal region,
           which determines specificity. The standard reaction
           mechanism is a 4-pro-S hydride transfer and proton relay
           involving the conserved Tyr and Lys, a water molecule
           stabilized by Asn, and nicotinamide. In addition to the
           Rossmann fold core region typical of all SDRs, extended
           SDRs have a less conserved C-terminal extension of
           approximately 100 amino acids, and typically have a
           TGXXGXXG cofactor binding motif. Complex (multidomain)
           SDRs such as ketoreductase domains of fatty acid
           synthase have a GGXGXXG NAD(P)-binding motif and an
           altered active site motif (YXXXN). Fungal type ketoacyl
           reductases have a TGXXXGX(1-2)G NAD(P)-binding motif.
          Length = 282

 Score = 29.6 bits (67), Expect = 2.5
 Identities = 16/71 (22%), Positives = 24/71 (33%), Gaps = 10/71 (14%)

Query: 41  SVAVVGVTGAVGQEFLSVLSDRDFPYRSIKMLASKRSAGKQLSFQDKAYTV-------EE 93
            +A+ G TG +G   +S L     P  ++ +L    S             V       E 
Sbjct: 1   KIAIAGATGTLGGPIVSALLAS--PGFTVTVLTRPSSTSSNEFQPSGVKVVPVDYASHES 58

Query: 94  LTEDSFDGVDI 104
           L   +  GVD 
Sbjct: 59  LV-AALKGVDA 68


>gnl|CDD|235699 PRK06110, PRK06110, hypothetical protein; Provisional.
          Length = 322

 Score = 29.2 bits (66), Expect = 3.9
 Identities = 15/28 (53%), Positives = 17/28 (60%), Gaps = 1/28 (3%)

Query: 195 STYQAASGAGAAAMEELELQTREVLEGK 222
            T+  A GAGAAA+    LQ RE L GK
Sbjct: 270 DTHNVAEGAGAAALAAA-LQERERLAGK 296


>gnl|CDD|235470 PRK05444, PRK05444, 1-deoxy-D-xylulose-5-phosphate synthase;
           Provisional.
          Length = 580

 Score = 28.9 bits (66), Expect = 5.7
 Identities = 9/29 (31%), Positives = 17/29 (58%)

Query: 288 VNLQFEKPLDEDTARDILKNAPGVVVIDD 316
           V+ +F KPLDE+   ++      VV +++
Sbjct: 491 VDARFVKPLDEELLLELAAKHDLVVTVEE 519


>gnl|CDD|116361 pfam07747, MTH865, MTH865-like family.  This domain has an
          EF-hand like fold.
          Length = 75

 Score = 26.7 bits (59), Expect = 6.0
 Identities = 9/22 (40%), Positives = 14/22 (63%)

Query: 54 EFLSVLSDRDFPYRSIKMLASK 75
          E   +L+D DFP++S + LA  
Sbjct: 47 ELGKLLTDADFPFKSAEDLADT 68


>gnl|CDD|224076 COG1154, Dxs, Deoxyxylulose-5-phosphate synthase [Coenzyme
           metabolism / Lipid metabolism].
          Length = 627

 Score = 28.7 bits (65), Expect = 6.0
 Identities = 10/29 (34%), Positives = 18/29 (62%)

Query: 288 VNLQFEKPLDEDTARDILKNAPGVVVIDD 316
           V+ +F KPLDE    ++ K+   VV +++
Sbjct: 533 VDPRFVKPLDEALLLELAKSHDLVVTLEE 561


>gnl|CDD|223135 COG0057, GapA, Glyceraldehyde-3-phosphate
           dehydrogenase/erythrose-4-phosphate dehydrogenase
           [Carbohydrate transport and metabolism].
          Length = 335

 Score = 28.3 bits (64), Expect = 6.5
 Identities = 10/42 (23%), Positives = 17/42 (40%)

Query: 268 DVRVTATCIRVPVMRAHAESVNLQFEKPLDEDTARDILKNAP 309
             ++T   IRVP        + ++ EK +  +     LK A 
Sbjct: 224 KGKLTGMAIRVPTPNVSVVDLTVELEKEVTVEEINAALKAAS 265


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.317    0.131    0.371 

Gapped
Lambda     K      H
   0.267   0.0807    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 18,959,292
Number of extensions: 1840845
Number of successful extensions: 1797
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1735
Number of HSP's successfully gapped: 51
Length of query: 376
Length of database: 10,937,602
Length adjustment: 98
Effective length of query: 278
Effective length of database: 6,590,910
Effective search space: 1832272980
Effective search space used: 1832272980
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 60 (26.7 bits)