RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= 017153
(376 letters)
>gnl|CDD|178009 PLN02383, PLN02383, aspartate semialdehyde dehydrogenase.
Length = 344
Score = 689 bits (1779), Expect = 0.0
Identities = 291/344 (84%), Positives = 319/344 (92%)
Query: 33 MSYQESAPSVAVVGVTGAVGQEFLSVLSDRDFPYRSIKMLASKRSAGKQLSFQDKAYTVE 92
M+ E+ PSVA+VGVTGAVGQEFLSVL+DRDFPY S+KMLAS RSAGK+++F+ + YTVE
Sbjct: 1 MALTENGPSVAIVGVTGAVGQEFLSVLTDRDFPYSSLKMLASARSAGKKVTFEGRDYTVE 60
Query: 93 ELTEDSFDGVDIALFSAGGSISKKFGPIAVEKGSIVVDNSSAFRMVENVPLVIPEVNPEA 152
ELTEDSFDGVDIALFSAGGSISKKFGPIAV+KG++VVDNSSAFRM E VPLVIPEVNPEA
Sbjct: 61 ELTEDSFDGVDIALFSAGGSISKKFGPIAVDKGAVVVDNSSAFRMEEGVPLVIPEVNPEA 120
Query: 153 MSGIKVGMGKGALIANPNCSTIICLMAATPLHRRAKVTRMVVSTYQAASGAGAAAMEELE 212
M IK+G GKGALIANPNCSTIICLMA TPLHR AKV RMVVSTYQAASGAGAAAMEELE
Sbjct: 121 MKHIKLGKGKGALIANPNCSTIICLMAVTPLHRHAKVKRMVVSTYQAASGAGAAAMEELE 180
Query: 213 LQTREVLEGKPPTCKIFSQQYAFNLFSHNAPVLENGYNEEEMKMVKETRKIWNDKDVRVT 272
QTREVLEGKPPTC IF+QQYAFNLFSHNAP+ ENGYNEEEMK+VKETRKIWND DV+VT
Sbjct: 181 QQTREVLEGKPPTCNIFAQQYAFNLFSHNAPMQENGYNEEEMKLVKETRKIWNDDDVKVT 240
Query: 273 ATCIRVPVMRAHAESVNLQFEKPLDEDTARDILKNAPGVVVIDDRASNHFPTPLEVSNKD 332
ATCIRVPVMRAHAES+NLQFEKPLDE TAR+IL +APGV +IDDRA+N FPTPL+ SNKD
Sbjct: 241 ATCIRVPVMRAHAESINLQFEKPLDEATAREILASAPGVKIIDDRANNRFPTPLDASNKD 300
Query: 333 DVAVGRIRRDVSQDGNHGLDIFVCGDQVRKGAALNAVQIAEMLL 376
DVAVGRIR+D+SQDGN GLDIFVCGDQ+RKGAALNAVQIAE+LL
Sbjct: 301 DVAVGRIRQDISQDGNKGLDIFVCGDQIRKGAALNAVQIAELLL 344
>gnl|CDD|237845 PRK14874, PRK14874, aspartate-semialdehyde dehydrogenase;
Provisional.
Length = 334
Score = 499 bits (1288), Expect = e-179
Identities = 188/337 (55%), Positives = 246/337 (72%), Gaps = 8/337 (2%)
Query: 42 VAVVGVTGAVGQEFLSVLSDRDFPYRSIKMLASKRSAGKQLSFQDKAYTVEELTEDSFDG 101
VAVVG TGAVG+E L++L +R+FP +++LAS RSAGK+LSF+ K VE+LT F G
Sbjct: 4 VAVVGATGAVGREMLNILEERNFPVDKLRLLASARSAGKELSFKGKELKVEDLTTFDFSG 63
Query: 102 VDIALFSAGGSISKKFGPIAVEKGSIVVDNSSAFRMVENVPLVIPEVNPEAMSGIKVGMG 161
VDIALFSAGGS+SKK+ P A G++V+DNSSAFRM +VPLV+PEVNPEA++ +
Sbjct: 64 VDIALFSAGGSVSKKYAPKAAAAGAVVIDNSSAFRMDPDVPLVVPEVNPEALAEHR---K 120
Query: 162 KGALIANPNCSTIICLMAATPLHRRAKVTRMVVSTYQAASGAGAAAMEELELQTREVLEG 221
KG +IANPNCSTI ++A PLH A + R+VVSTYQA SGAG A MEEL QTR VL
Sbjct: 121 KG-IIANPNCSTIQMVVALKPLHDAAGIKRVVVSTYQAVSGAGKAGMEELFEQTRAVLNA 179
Query: 222 K--PPTCKIFSQQYAFNLFSHNAPVLENGYNEEEMKMVKETRKIWNDKDVRVTATCIRVP 279
P K F + AFN+ H +++GY +EEMKMV ET+KI D D++V+ATC+RVP
Sbjct: 180 AVDPVEPKKFPKPIAFNVIPHIDVFMDDGYTKEEMKMVNETKKILGDPDLKVSATCVRVP 239
Query: 280 VMRAHAESVNLQFEKPLDEDTARDILKNAPGVVVIDDRASNHFPTPLEVSNKDDVAVGRI 339
V H+ESVN++FE+P+ + AR+IL APGVV++DD + +PTPLE KD VGRI
Sbjct: 240 VFTGHSESVNIEFEEPISVEEAREILAEAPGVVLVDDPENGGYPTPLEAVGKDATFVGRI 299
Query: 340 RRDVSQDGNHGLDIFVCGDQVRKGAALNAVQIAEMLL 376
R+D++ + +GL ++V D +RKGAALNAVQIAE+L+
Sbjct: 300 RKDLTVE--NGLHLWVVSDNLRKGAALNAVQIAELLI 334
>gnl|CDD|223214 COG0136, Asd, Aspartate-semialdehyde dehydrogenase [Amino acid
transport and metabolism].
Length = 334
Score = 396 bits (1020), Expect = e-138
Identities = 169/340 (49%), Positives = 224/340 (65%), Gaps = 14/340 (4%)
Query: 42 VAVVGVTGAVGQEFLSVLSDRDFPYRSIKMLASKRSAGKQ-LSFQDKAYTVEELTEDSFD 100
VAV+G TGAVGQ L +L +R FP+ + +LAS RSAGK+ + F K+ V E D F
Sbjct: 4 VAVLGATGAVGQVLLELLEERHFPFEELVLLASARSAGKKYIEFGGKSIGVPEDAADEFV 63
Query: 101 --GVDIALFSAGGSISKKFGPIAVEKGSIVVDNSSAFRMVENVPLVIPEVNPEAMSGIKV 158
VDI F+AGGS+SK+ P A E G +V+DNSSAFRM +VPLV+PEVNPE + +
Sbjct: 64 FSDVDIVFFAAGGSVSKEVEPKAAEAGCVVIDNSSAFRMDPDVPLVVPEVNPEHLIDYQ- 122
Query: 159 GMGKGALIANPNCSTIICLMAATPLHRRAKVTRMVVSTYQAASGAGAAAMEELELQTREV 218
+G +IANPNCSTI ++A PLH + R+VVSTYQA SGAGA EL QT +
Sbjct: 123 --KRGFIIANPNCSTIQLVLALKPLHDAFGIKRVVVSTYQAVSGAGAEGGVELAGQTDAL 180
Query: 219 LEGKPPTCKIFSQQYAFNLFSHNAPVLENGYNEEEMKMVKETRKIWNDKDVRVTATCIRV 278
L G P AFN+ H L+NGY +EE K+ ETRKI D D++V+ATC+RV
Sbjct: 181 LNGIPILPI--GYPLAFNVIPHIDGFLDNGYTKEEWKIEAETRKILGDPDIKVSATCVRV 238
Query: 279 PVMRAHAESVNLQFEKPLDEDTAR-DILKNAPGVVVIDDRASNHFP-TPLEVSNKDDVAV 336
PV H+E+V ++F+K +D + R ++L +APGVVV+D+ P TPL+ + D+V+V
Sbjct: 239 PVFYGHSEAVTVEFKKDVDPEEIREELLPSAPGVVVVDNP--EDRPQTPLDATGGDEVSV 296
Query: 337 GRIRRDVSQDGNHGLDIFVCGDQVRKGAALNAVQIAEMLL 376
GRIR+D+S GL ++V GD +RKGAALNAV IAE+LL
Sbjct: 297 GRIRKDLSGP--EGLKLWVVGDNLRKGAALNAVLIAELLL 334
>gnl|CDD|233347 TIGR01296, asd_B, aspartate-semialdehyde dehydrogenase
(peptidoglycan organisms). Two closely related families
of aspartate-semialdehyde dehydrogenase are found. They
differ by a deep split in phylogenetic and percent
identity trees and in gap patterns. This model
represents a branch more closely related to the USG-1
protein than to the other aspartate-semialdehyde
dehydrogenases represented in model TIGR00978 [Amino
acid biosynthesis, Aspartate family].
Length = 338
Score = 388 bits (999), Expect = e-135
Identities = 178/341 (52%), Positives = 236/341 (69%), Gaps = 12/341 (3%)
Query: 41 SVAVVGVTGAVGQEFLSVLSDRDFPYRSIKMLASKRSAGKQLSFQDKAYTVEELTEDSFD 100
+VA+VG TGAVGQE L +L +R+FP + +LAS RSAG++L+F+ K VEE +SF+
Sbjct: 1 NVAIVGATGAVGQEMLKLLEERNFPIDKLVLLASARSAGRKLTFKGKELEVEEAETESFE 60
Query: 101 GVDIALFSAGGSISKKFGPIAVEKGSIVVDNSSAFRMVENVPLVIPEVNPEAMSGIKVGM 160
G+DIALFSAGGS+SK+F P A + G IV+DN+SAFRM +VPLV+PEVN E +
Sbjct: 61 GIDIALFSAGGSVSKEFAPKAAKAGVIVIDNTSAFRMDPDVPLVVPEVNFEDLKEFN--- 117
Query: 161 GKGALIANPNCSTIICLMAATPLHRRAKVTRMVVSTYQAASGAGAAAMEELELQTREVLE 220
+IANPNCSTI ++ PLH AK+ R+VVSTYQA SGAG A +EEL QT+ VLE
Sbjct: 118 -PKGIIANPNCSTIQMVVVLKPLHDEAKIKRVVVSTYQAVSGAGNAGVEELYNQTKAVLE 176
Query: 221 GK------PPTCKIFSQQYAFNLFSHNAPVLENGYNEEEMKMVKETRKIWNDKDVRVTAT 274
G P F Q AFN H +++GY +EE KM+ ETRKI D++V+AT
Sbjct: 177 GAEQLPYIQPKANKFPYQIAFNAIPHIDSFVDDGYTKEEQKMLFETRKIMGIPDLKVSAT 236
Query: 275 CIRVPVMRAHAESVNLQFEKPLDEDTARDILKNAPGVVVIDDRASNHFPTPLEVSNKDDV 334
C+RVPV H+ES+N++FEK + + AR++LKNAPGV +IDD + N +PTPL D+V
Sbjct: 237 CVRVPVFTGHSESINIEFEKEISPEDARELLKNAPGVQLIDDPSGNLYPTPLAAVGVDEV 296
Query: 335 AVGRIRRDVSQDGNHGLDIFVCGDQVRKGAALNAVQIAEML 375
VGRIR+D+ +GL ++V D +RKGAALN+VQIAE+L
Sbjct: 297 FVGRIRKDLPDG--NGLHLWVVADNLRKGAALNSVQIAELL 335
>gnl|CDD|136022 PRK06728, PRK06728, aspartate-semialdehyde dehydrogenase;
Provisional.
Length = 347
Score = 255 bits (652), Expect = 2e-82
Identities = 139/348 (39%), Positives = 213/348 (61%), Gaps = 16/348 (4%)
Query: 37 ESAPSVAVVGVTGAVGQEFLSVL-SDRDFPYRSIKMLASKRSAGKQLSFQDKAYTVEELT 95
E VAVVG TGAVGQ+ + +L + F + +L+SKRSAGK + F+ + ++E
Sbjct: 3 EKGYHVAVVGATGAVGQKIIELLEKETKFNIAEVTLLSSKRSAGKTVQFKGREIIIQEAK 62
Query: 96 EDSFDGVDIALFSAGGSISKKFGPIAVEKGSIVVDNSSAFRMVENVPLVIPEVNPEAMSG 155
+SF+GVDIA FSAGG +S++F AV G+IV+DN+S +RM +VPLV+PEVN +
Sbjct: 63 INSFEGVDIAFFSAGGEVSRQFVNQAVSSGAIVIDNTSEYRMAHDVPLVVPEVNAHTLKE 122
Query: 156 IKVGMGKGALIANPNCSTIICLMAATPLHRRAKVTRMVVSTYQAASGAGAAAMEELELQT 215
K +IA PNCS + + A P+ + + R++VSTYQA SG+G A++EL+ Q
Sbjct: 123 HK------GIIAVPNCSALQMVTALQPIRKVFGLERIIVSTYQAVSGSGIHAIQELKEQA 176
Query: 216 REVLEGKPPTCKIF-----SQQY--AFNLFSHNAPVLENGYNEEEMKMVKETRKIWNDKD 268
+ +L G+ I + Y AFN+ +N + EE+KM++ET+KI D +
Sbjct: 177 KSILAGEEVESTILPAKKDKKHYPIAFNVLPQVDIFTDNDFTFEEVKMIQETKKILEDPN 236
Query: 269 VRVTATCIRVPVMRAHAESVNLQFEKPLDEDTARDILKNAPGVVVIDDRASNHFPTPLEV 328
+++ ATC+RVPV+ H+ESV ++ EK +++L +APGV++ D+ + +P PL
Sbjct: 237 LKMAATCVRVPVISGHSESVYIELEKEATVAEIKEVLFDAPGVILQDNPSEQLYPMPLYA 296
Query: 329 SNKDDVAVGRIRRDVSQDGNHGLDIFVCGDQVRKGAALNAVQIAEMLL 376
K D VGRIR+D D +G +++ D + KGAA N+VQIAE ++
Sbjct: 297 EGKIDTFVGRIRKD--PDTPNGFHLWIVSDNLLKGAAWNSVQIAETMV 342
>gnl|CDD|168165 PRK05671, PRK05671, aspartate-semialdehyde dehydrogenase; Reviewed.
Length = 336
Score = 200 bits (511), Expect = 1e-61
Identities = 115/336 (34%), Positives = 179/336 (53%), Gaps = 11/336 (3%)
Query: 42 VAVVGVTGAVGQEFLSVLSDRDFPYRSIKMLASKRSAGKQLSFQDKAYTVEELTEDSFDG 101
+AVVG TG VG+ + +L +RDFP ++ +LAS SAG + F K V E+ F
Sbjct: 7 IAVVGATGTVGEALVQILEERDFPVGTLHLLASSESAGHSVPFAGKNLRVREVDSFDFSQ 66
Query: 102 VDIALFSAGGSISKKFGPIAVEKGSIVVDNSSAFRMVENVPLVIPEVNPEAMSGIKVGMG 161
V +A F+AG ++S+ F A G V+D S A P V+PEVN E ++ +
Sbjct: 67 VQLAFFAAGAAVSRSFAEKARAAGCSVIDLSGALP-SAQAPNVVPEVNAERLASLA---- 121
Query: 162 KGALIANPNCSTIICLMAATPLHRRAKVTRMVVSTYQAASGAGAAAMEELELQTREVLEG 221
L+++P+ S + +A PL + R+ V+ A S G + EL QT E+L
Sbjct: 122 APFLVSSPSASAVALAVALAPLKGLLDIQRVQVTACLAVSSLGREGVSELARQTAELLNA 181
Query: 222 KPPTCKIFSQQYAFNLFSHNAPVLENGYNEEEMKMVKETRKIWNDKDVRVTATCIRVPVM 281
+P + F +Q AFNL + G+ E ++V E R++ +++++ TCI+VPV
Sbjct: 182 RPLEPRFFDRQVAFNLLAQVGAPDAQGHTALERRLVAELRQLLGLPELKISVTCIQVPVF 241
Query: 282 RAHAESVNLQFEKPLDEDTARDILKNAPGVVVIDDRASNHFPTPLE-VSNKDDVAVGRIR 340
+ SV LQ P+D L+ APG+ +++ +PTP+ +D V VGR+R
Sbjct: 242 FGDSLSVALQSAAPVDLAAVNAALEAAPGIELVEA---GDYPTPVGDAVGQDVVYVGRVR 298
Query: 341 RDVSQDGNHGLDIFVCGDQVRKGAALNAVQIAEMLL 376
V L++++ D VRKGAALNAVQ+AE+L+
Sbjct: 299 AGVDDPCQ--LNLWLTSDNVRKGAALNAVQVAELLI 332
>gnl|CDD|181205 PRK08040, PRK08040, putative semialdehyde dehydrogenase;
Provisional.
Length = 336
Score = 196 bits (499), Expect = 8e-60
Identities = 112/343 (32%), Positives = 185/343 (53%), Gaps = 24/343 (6%)
Query: 41 SVAVVGVTGAVGQEFLSVLSDRDFPYRSIKMLASKRSAGKQLSFQDKAYTVEELTEDSFD 100
++A++G TGAVG+ L +L++R FP + LAS+ SAG+ L F K+ TV++ E +
Sbjct: 6 NIALLGATGAVGEALLELLAERQFPVGELYALASEESAGETLRFGGKSVTVQDAAEFDWS 65
Query: 101 GVDIALFSAGGSISKKFGPIAVEKGSIVVDNSSAFRMVENVPLVIPEVNPEAMSGIKVGM 160
+A F AG S + A G +V+D+S F + +VPLV+PEVNP ++ +
Sbjct: 66 QAQLAFFVAGREASAAYAEEATNAGCLVIDSSGLFALEPDVPLVVPEVNPFVLADYR--- 122
Query: 161 GKGALIANPNCSTIICLMAATPLHRRAKVTRMVVSTYQAASGAGAAAMEELELQTREVLE 220
+IA + T L A PL +A ++R+ V+ +AS G AA++ L Q+ ++L
Sbjct: 123 -NRNIIAVADSLTSQLLTAIKPLIDQAGLSRLHVTNLLSASAHGKAAVDALAGQSAKLLN 181
Query: 221 GKPPTCKIFSQQYAFNLFSHNAPVL--ENGYNEEEMKMVKETRKIWNDKDVRVTATCIRV 278
G P F +Q AFN+ P+L G EE ++V + RKI D+ + ++ +C++
Sbjct: 182 GIPIEEGFFGRQLAFNML----PLLPDSEGSVREERRLVDQVRKILQDEGLPISVSCVQS 237
Query: 279 PVMRAHAESVNLQFEKPLDEDTARDILKNAPGVVVIDDRASNHFPTPL-EVSNKDDVAVG 337
PV HA+ V+ + +PL + ARD L+ +V+ ++ N +PT + + S +++G
Sbjct: 238 PVFYGHAQMVHFEALRPLAAEEARDALEQGEDIVLSEE---NDYPTQVGDASGNPHLSIG 294
Query: 338 RIRRDVSQDGNHG----LDIFVCGDQVRKGAALNAVQIAEMLL 376
+R D +G L + D VR G AL AV+ AE L+
Sbjct: 295 CVRND------YGMPEQLQFWSVADNVRFGGALMAVKTAEKLV 331
>gnl|CDD|236329 PRK08664, PRK08664, aspartate-semialdehyde dehydrogenase; Reviewed.
Length = 349
Score = 188 bits (480), Expect = 1e-56
Identities = 116/389 (29%), Positives = 175/389 (44%), Gaps = 106/389 (27%)
Query: 42 VAVVGVTGAVGQEFLSVLSDRDFPYRSIKML-ASKRSAGKQL--------------SFQD 86
V ++G TG VGQ F+ +L++ P+ + L AS+RSAGK D
Sbjct: 6 VGILGATGMVGQRFVQLLAN--HPWFEVTALAASERSAGKTYGEAVRWQLDGPIPEEVAD 63
Query: 87 KAYTVEELTEDSFDGVDIALFSA-----GGSISKKFGPIAVEKGSIVVDNSSAFRMVENV 141
V ++ D VDI +FSA G + ++F + G V N+SA RM +V
Sbjct: 64 --MEVVSTDPEAVDDVDI-VFSALPSDVAGEVEEEF----AKAGKPVFSNASAHRMDPDV 116
Query: 142 PLVIPEVNPEAMSGIKV----GMGKGALIANPNCSTIICLMAATPLHRRAKVTRMVVSTY 197
PLVIPEVNPE + I+V G ++ NPNCSTI ++A PL + R+ V+T
Sbjct: 117 PLVIPEVNPEHLELIEVQRKRRGWDGFIVTNPNCSTIGLVLALKPL-MDFGIERVHVTTM 175
Query: 198 QAASGAG---AAAMEELELQTREVLEGKPPTCKIFSQQYAFNLFSHNAPVLENGYNEEEM 254
QA SGAG +M+ ++ N+ + EEE
Sbjct: 176 QAISGAGYPGVPSMDIVD-----------------------NVIPY-------IGGEEE- 204
Query: 255 KMVKETRKIW--------NDKDVRVTATCIRVPVMRAHAESVNLQFEKPLDEDTARDILK 306
K+ KET KI D ++ATC RVPV+ H E+V ++F++ +D + R+ L+
Sbjct: 205 KIEKETLKILGKFEGGKIVPADFPISATCHRVPVIDGHTEAVFVKFKEDVDPEEIREALE 264
Query: 307 NAPGVVVIDDRASNHFPT----PLEVSNKDD---------------VAVGRIRRDVSQDG 347
+ G+ P+ P+ + + D V+VGR+R +DG
Sbjct: 265 SFKGLPQELG-----LPSAPKKPIILFEEPDRPQPRLDRDAGDGMAVSVGRLR----EDG 315
Query: 348 NHGLDIFVC-GDQVRKGAALNAVQIAEML 375
+ FV G +GAA +V AE+L
Sbjct: 316 IFDIK-FVVLGHNTVRGAAGASVLNAELL 343
>gnl|CDD|233220 TIGR00978, asd_EA, aspartate-semialdehyde dehydrogenase
(non-peptidoglycan organisms). Two closely related
families of aspartate-semialdehyde dehydrogenase are
found. They differ by a deep split in phylogenetic and
percent identity trees and in gap patterns. Separate
models are built for the two types in order to exclude
the USG-1 protein, found in several species, which is
specifically related to the Bacillus subtilis type of
aspartate-semialdehyde dehydrogenase. Members of this
type are found primarily in organisms that lack
peptidoglycan [Amino acid biosynthesis, Aspartate
family].
Length = 341
Score = 156 bits (396), Expect = 2e-44
Identities = 113/380 (29%), Positives = 169/380 (44%), Gaps = 87/380 (22%)
Query: 42 VAVVGVTGAVGQEFLSVLSDRDFPYRSI-KMLASKRSAGKQL--------------SFQD 86
VAV+G TG VGQ+F+ +L+ PY + K++AS RSAGK+ +D
Sbjct: 3 VAVLGATGLVGQKFVKLLAKH--PYFELAKVVASPRSAGKRYGEAVKWIEPGDMPEYVRD 60
Query: 87 KAYTVEELTEDSFDGVDIALFSAGGSISKKFGPIAVEKGSIVVDNSSAFRMVENVPLVIP 146
+ E + D VDI + ++++ P E G V N+S RM +VPL+IP
Sbjct: 61 --LPIVEPEPIAEDDVDIVFSALPSEVAEEVEPKLAEAGKPVFSNASNHRMDPDVPLIIP 118
Query: 147 EVNPEAMSGIKV---GMGKGALIANPNCSTIICLMAATPLHRRAKVTRMVVSTYQAASGA 203
EVN + + +KV KG ++ NPNC+T +A PL + ++ V+T QA SGA
Sbjct: 119 EVNSDHLELLKVQKERGWKGFIVTNPNCTTAGLTLALKPLIDAFGIKKVHVTTMQAVSGA 178
Query: 204 GAAAMEELELQTREVLEGKPPTCKIFSQQYAFNLFSHNAPVLENGYNEEEMKMVKETRKI 263
G + +++ N+ H EE K+ +ETRKI
Sbjct: 179 GYPGVPSMDILD--------------------NIIPH--------IGGEEEKIERETRKI 210
Query: 264 W----NDK----DVRVTATCIRVPVMRAHAESVNLQFEKPLDEDTARDILKN-------- 307
N K V+AT RVPV+ H ESV+++F+K D + R+ LK+
Sbjct: 211 LGKLENGKIEPAPFSVSATTTRVPVLDGHTESVHVEFDKKFDIEEIREALKSFRGLPQKL 270
Query: 308 ----APGVVVI----DDRASNHFPTP-LEVSNKDD--VAVGRIRRDVSQDGNHGLDIFVC 356
AP +I +DR P P L+ V VGR+R + L V
Sbjct: 271 GLPSAPEKPIIVRDEEDR-----PQPRLDRDAGGGMAVTVGRLREE-----GGSLKYVVL 320
Query: 357 GDQVRKGAALNAVQIAEMLL 376
G + +GAA + AE+
Sbjct: 321 GHNLVRGAAGATLLNAELAY 340
>gnl|CDD|201603 pfam01118, Semialdhyde_dh, Semialdehyde dehydrogenase, NAD binding
domain. This Pfam entry contains the following members:
N-acetyl-glutamine semialdehyde dehydrogenase (AgrC)
Aspartate-semialdehyde dehydrogenase.
Length = 121
Score = 132 bits (334), Expect = 6e-38
Identities = 47/121 (38%), Positives = 70/121 (57%), Gaps = 7/121 (5%)
Query: 42 VAVVGVTGAVGQEFLSVLSDRDFPYRSIKMLASKRSAGKQLSFQDKA------YTVEELT 95
VA+VG TG VGQE L +L P + ++AS RSAGK+++F +E++
Sbjct: 2 VAIVGATGYVGQELLRLL-AEHPPLELVALVASSRSAGKKVAFAYPILEGGKDLLLEDVD 60
Query: 96 EDSFDGVDIALFSAGGSISKKFGPIAVEKGSIVVDNSSAFRMVENVPLVIPEVNPEAMSG 155
+ VDI + +SK+ P +E G++V+D SSAFRM ++VP V+PEVN EA+
Sbjct: 61 PEDLKDVDIVFLALPAGVSKELAPKLLEAGAVVIDLSSAFRMDDDVPYVLPEVNREAIKK 120
Query: 156 I 156
Sbjct: 121 A 121
>gnl|CDD|217222 pfam02774, Semialdhyde_dhC, Semialdehyde dehydrogenase,
dimerisation domain. This Pfam entry contains the
following members: N-acetyl-glutamine semialdehyde
dehydrogenase (AgrC) Aspartate-semialdehyde
dehydrogenase.
Length = 167
Score = 126 bits (319), Expect = 3e-35
Identities = 67/186 (36%), Positives = 92/186 (49%), Gaps = 26/186 (13%)
Query: 182 PLH-RRAKVTRMVVSTYQAASGAGAAAMEELELQTREVLEGKPPTCKIFSQQYAFNLFSH 240
PL + R++V TYQA SGAG A +F A NL +
Sbjct: 3 PLRDALGGLERVIVDTYQAVSGAGKKAKPG-----------------VFGAPIADNLIPY 45
Query: 241 NAPVLENGYNE--EEMKMVKETRKIWNDKDVRVTATCIRVPVMRAHAESVNLQFE-KPLD 297
NG E EE+KMV ET+KI V+ATC+RVPV R H+E+V ++ + KP+D
Sbjct: 46 IDGEEHNGTPETREELKMVNETKKILGFTPK-VSATCVRVPVFRGHSETVTVKLKLKPID 104
Query: 298 EDTARDILKNAPGVVVIDDRASNHFPTPLEVSNK-DDVAVGRIRRDVSQDGNHGLDIFVC 356
+ + APGV V+ R +PTP V + V VGR+R+D DG+ GL +
Sbjct: 105 VEEVYEAFYAAPGVFVV-VRPEEDYPTPRAVRGGTNFVYVGRVRKDP--DGDRGLKLVSV 161
Query: 357 GDQVRK 362
D +RK
Sbjct: 162 IDNLRK 167
>gnl|CDD|214863 smart00859, Semialdhyde_dh, Semialdehyde dehydrogenase, NAD binding
domain. The semialdehyde dehydrogenase family is found
in N-acetyl-glutamine semialdehyde dehydrogenase (AgrC),
which is involved in arginine biosynthesis, and
aspartate-semialdehyde dehydrogenase, an enzyme involved
in the biosynthesis of various amino acids from
aspartate. This family is also found in yeast and fungal
Arg5,6 protein, which is cleaved into the enzymes
N-acety-gamma-glutamyl-phosphate reductase and
acetylglutamate kinase. These are also involved in
arginine biosynthesis. All proteins in this entry
contain a NAD binding region of semialdehyde
dehydrogenase.
Length = 123
Score = 118 bits (297), Expect = 1e-32
Identities = 52/125 (41%), Positives = 70/125 (56%), Gaps = 11/125 (8%)
Query: 41 SVAVVGVTGAVGQEFLSVLSD-RDFPYRSIKMLASKRSAGKQLSF---QDKAYTVEELTE 96
VA+VG TG VGQE L +L++ DF ++ AS RSAGK++S K V EL
Sbjct: 1 KVAIVGATGYVGQELLRLLAEHPDFELTAL--AASSRSAGKKVSEAGPHLKGEVVLELDP 58
Query: 97 DSFD--GVDIALFSAGGSISK---KFGPIAVEKGSIVVDNSSAFRMVENVPLVIPEVNPE 151
F+ VDI + +SK P A G++V+D SSAFRM ++VP +PEVNPE
Sbjct: 59 PDFEELAVDIVFLALPHGVSKESAPLLPRAAAAGAVVIDLSSAFRMDDDVPYGLPEVNPE 118
Query: 152 AMSGI 156
A+
Sbjct: 119 AIKKA 123
>gnl|CDD|130806 TIGR01745, asd_gamma, aspartate-semialdehyde dehydrogenase,
gamma-proteobacterial. [Amino acid biosynthesis,
Aspartate family].
Length = 366
Score = 102 bits (255), Expect = 2e-24
Identities = 95/358 (26%), Positives = 152/358 (42%), Gaps = 39/358 (10%)
Query: 42 VAVVGVTGAVGQEFLS-VLSDRDFPYRSIKMLASKRSAGKQLSFQDKAYTVEELTE-DSF 99
V +VG G VG + + +RDF ++ + SF T+++ + D+
Sbjct: 3 VGLVGWRGMVGSVLMQRMQEERDFDAIRPVFFSTSQLGQAAPSFGGTTGTLQDAFDIDAL 62
Query: 100 DGVDIALFSAGGSISKKFGPIAVEKG--SIVVDNSSAFRMVENVPLVIPEVNPEAMSGIK 157
+DI + GG + + P E G +D +S+ RM ++ +++ VN + I
Sbjct: 63 KALDIIITCQGGDYTNEIYPKLRESGWQGYWIDAASSLRMKDDAVIILDPVNQDV---IT 119
Query: 158 VGMGKGA-LIANPNCSTIICLMAATPLHRRAKVTRMVVSTYQAASGAGAAAMEELELQT- 215
G+ G NC+ + LM+ L V + V+TYQAASG GA M EL Q
Sbjct: 120 DGLNNGIRTFVGGNCTVSLMLMSLGGLFANDLVEWVSVATYQAASGGGARHMRELLTQMG 179
Query: 216 ---REVLE--GKPPTC------KIFSQQYAFNLFSHNAPV-------------LENGYNE 251
V + P + K+ + L N V L+NG +
Sbjct: 180 HLYGHVEDELATPSSAILDIERKVTKLTRSGELPVDNFGVPLAGSLIPWIDKQLDNGQSR 239
Query: 252 EEMKMVKETRKIWNDKDV-RVTATCIRVPVMRAHAESVNLQFEKPLDEDTARDILKNA-P 309
EE K ET KI V C+R+ +R H+++ ++ +K + +T +I++ P
Sbjct: 240 EEWKGQAETNKILGTSSTIPVDGLCVRIGALRCHSQAFTIKLKKDVSLETIEEIIRAHNP 299
Query: 310 GV-VVIDDRASNHFP-TPLEVSNKDDVAVGRIRRDVSQDGNHGLDIFVCGDQVRKGAA 365
V VV +DR TP V+ + VGR+R+ G L F GDQ+ GAA
Sbjct: 300 WVKVVPNDREITMRELTPAAVTGTLTIPVGRLRK--LNMGPEYLSAFTVGDQLLWGAA 355
>gnl|CDD|235839 PRK06598, PRK06598, aspartate-semialdehyde dehydrogenase; Reviewed.
Length = 369
Score = 101 bits (255), Expect = 3e-24
Identities = 110/376 (29%), Positives = 156/376 (41%), Gaps = 74/376 (19%)
Query: 42 VAVVGVTGAVGQEFLSVLSDR-----DF----PY---RSIKMLASKRSAGKQLSFQDKAY 89
V VG G VG SVL R DF P S A+ GK+ + QD A+
Sbjct: 4 VGFVGWRGMVG----SVLMQRMVEENDFDLIEPVFFSTSQAGGAAPSFGGKEGTLQD-AF 58
Query: 90 TVEELTEDSFDGVDIALFSAGGSISKKFGPIAVEKG--SIVVDNSSAFRMVENVPLVIPE 147
++ L + +DI + GG + + P G +D +S RM ++ +++
Sbjct: 59 DIDALKK-----LDIIITCQGGDYTNEVYPKLRAAGWQGYWIDAASTLRMKDDAIIILDP 113
Query: 148 VNPEAMSGIKVGMGKGALIAN---PNCSTIICLMAATPLHRRAKVTRMVVSTYQAASGAG 204
VN I + G + NC+ + LMA L + V + V TYQAASGAG
Sbjct: 114 VN---RDVIDDALANG--VKTFVGGNCTVSLMLMALGGLFKNDLVEWVSVMTYQAASGAG 168
Query: 205 AAAMEEL-------------ELQT---------REVLE----GKPPTCKIFSQQYAFNLF 238
A M EL EL R+V E G PT F A +L
Sbjct: 169 ARNMRELLTQMGALHGAVADELADPASAILDIDRKVTELMRSGDLPT-DNFGVPLAGSLI 227
Query: 239 SHNAP----VLENGYNEEEMKMVKETRKIWN--DKDVRVTATCIRVPVMRAHAESVNLQF 292
P L NG + EE K ET KI + V C+RV MR H++++ ++
Sbjct: 228 ----PWIDKDLGNGQSREEWKGQAETNKILGLTKNPIPVDGLCVRVGAMRCHSQALTIKL 283
Query: 293 EKPLDEDTARDILKNA-PGVVVI-DDR-ASNHFPTPLEVSNKDDVAVGRIRRDVSQDGNH 349
+K + +IL P V V+ +DR A+ TP V+ + VGR+R+ G
Sbjct: 284 KKDVPLAEIEEILAAHNPWVKVVPNDREATMRELTPAAVTGTLTIPVGRLRK--LNMGPE 341
Query: 350 GLDIFVCGDQVRKGAA 365
L F GDQ+ GAA
Sbjct: 342 YLSAFTVGDQLLWGAA 357
>gnl|CDD|223081 COG0002, ArgC, Acetylglutamate semialdehyde dehydrogenase [Amino
acid transport and metabolism].
Length = 349
Score = 65.0 bits (159), Expect = 1e-11
Identities = 76/365 (20%), Positives = 127/365 (34%), Gaps = 71/365 (19%)
Query: 39 APSVAVVGVTGAVGQEFLSVLSDRDFPYRSIKMLASKRSAGKQLS--------FQDKAYT 90
V +VG +G G E L +L+ P + +++S+ AGK +S D +
Sbjct: 2 MIKVGIVGASGYTGLELLRLLAGH--PDVELILISSRERAGKPVSDVHPNLRGLVDLPFQ 59
Query: 91 VEELTEDSFDGVDIALFSAGGSISKKFGPIAVEKGSIVVDNSSAFRMVE--------NVP 142
+ + D D+ + +S + P +E G V+D S+ FR+ +
Sbjct: 60 TIDPEKIELDECDVVFLALPHGVSAELVPELLEAGCKVIDLSADFRLKDPEVYEKWYGFT 119
Query: 143 LVIPEVNPEAMSGI------KVGMGKGA-LIANPNCSTIICLMAATPLHRRAKVTRM-VV 194
PE+ +A+ G+ K+ +GA LIANP C ++A PL +A +
Sbjct: 120 HAGPELLEDAVYGLPELHREKI---RGAKLIANPGCYPTAAILALAPL-VKAGLLDPDSP 175
Query: 195 STYQAASGAGAAAMEELELQTREVLEGKPPTCKIFSQQYAFNLFSHNAPVLENGYNEEEM 254
A SG A G+ + N F L Y
Sbjct: 176 PIVDAKSGVSGA--------------GRKAS--------VKNHFPEVNDSL-RPYGLTGH 212
Query: 255 KMVKETR---KIWNDKDVRVTATCIRVPVMRAHAESVNLQFEKPLDEDTARDIL----KN 307
+ E + V V T P +R ++ L+ + + +
Sbjct: 213 RHTPEIEQHLGRLAGRKVPVIFTPHLGPFVRGILATIYLKLKDLVTLEELHAAYEEFYAG 272
Query: 308 APGVVVIDDRASNHFPTPLEV--SNKDDVAVGRIRRDVSQDGNHGLDIFVCGDQVRKGAA 365
P V V+ + +P V SN D+ V + + + D + KGAA
Sbjct: 273 EPFVRVVPEGG---YPDTKAVAGSNFCDIGFA-----VDERTGR-VVVVSAIDNLVKGAA 323
Query: 366 LNAVQ 370
AVQ
Sbjct: 324 GQAVQ 328
>gnl|CDD|234761 PRK00436, argC, N-acetyl-gamma-glutamyl-phosphate reductase;
Validated.
Length = 343
Score = 58.2 bits (142), Expect = 2e-09
Identities = 52/198 (26%), Positives = 78/198 (39%), Gaps = 44/198 (22%)
Query: 41 SVAVVGVTGAVGQEFLSVLSDRDFPYRSIKMLASKRSAGKQLS------FQDKAYTVEEL 94
V +VG +G G E L +L + P I + S+ SAGK LS +E L
Sbjct: 4 KVGIVGASGYTGGELLRLLLNH--PEVEIVAVTSRSSAGKPLSDVHPHLRGLVDLVLEPL 61
Query: 95 TEDSFDGVDIALFSAGGSISKKFGPIAVEKGSIVVDNSSAFRMV-------------ENV 141
+ G D+ + +S P +E G V+D S+ FR+
Sbjct: 62 DPEILAGADVVFLALPHGVSMDLAPQLLEAGVKVIDLSADFRLKDPEVYEKWYGFEHAAP 121
Query: 142 PLV------IPEVNPEAMSGIKVGMGKGA-LIANPNC---STIICLMAATPL--HRRAKV 189
L+ +PE+N E + KGA LIANP C ++++ L PL
Sbjct: 122 ELLKEAVYGLPELNREEI--------KGARLIANPGCYPTASLLALA---PLLKAGLIDP 170
Query: 190 TRMVVSTYQAASGAGAAA 207
+++ SGAG A
Sbjct: 171 DSIIIDAKSGVSGAGRKA 188
>gnl|CDD|233597 TIGR01850, argC, N-acetyl-gamma-glutamyl-phosphate reductase,
common form. This model represents the more common of
two related families of
N-acetyl-gamma-glutamyl-phosphate reductase, an enzyme
catalyzing the third step or Arg biosynthesis from Glu.
The two families differ by phylogeny, similarity
clustering, and the gap architecture in a multiple
sequence alignment. Bacterial members of this family
tend to be found within Arg biosynthesis operons [Amino
acid biosynthesis, Glutamate family].
Length = 346
Score = 56.4 bits (137), Expect = 7e-09
Identities = 85/366 (23%), Positives = 137/366 (37%), Gaps = 80/366 (21%)
Query: 42 VAVVGVTGAVGQEFLSVLSDRDFPYRSIKML-ASKRSAGKQLSF---QDKAYTVEELTED 97
VA+VG +G G E L +L + P I L +S+ SAGK +S + L
Sbjct: 3 VAIVGASGYTGGELLRLLLNH--PEVEITYLVSSRESAGKPVSEVHPHLRGLVDLNLEPI 60
Query: 98 SFD----GVDIALFSAGGSISKKFGPIAVEKGSIVVDNSSAFRM---------------- 137
+ D+ + +S + P + G V+D S+ FR+
Sbjct: 61 DVEEILEDADVVFLALPHGVSAELAPELLAAGVKVIDLSADFRLKDPELYEKWYGFEHAG 120
Query: 138 ---VENVPLVIPEVNPEAMSGIKVGMGKGA-LIANPNCSTIICLMAATPL--HRRAKVTR 191
++ +PE++ E + KGA LIANP C L+A PL T
Sbjct: 121 PELLQKAVYGLPELHREEI--------KGARLIANPGCYPTATLLALAPLLKEGLIDPTS 172
Query: 192 MVVSTYQAASGAGAAAMEELELQTREVLEGKPPTCKIFSQQYAFNLFSH-NAPVLENGYN 250
++V SGAG A E EV E P + + H + P
Sbjct: 173 IIVDAKSGVSGAGRKASEANHFP--EVNENLRP----------YKVTGHRHTP------- 213
Query: 251 EEEMKMVKETRKIWNDKDVRVTATCIRVPVMRAHAESVNLQFEKPLDEDTARDIL----K 306
++ +E + V+V+ T VP+ R ++ + + L E+ R +
Sbjct: 214 ----EIEQELGR-LAGGKVKVSFTPHLVPMTRGILATIYAKLKDGLTEEDLRALYEEFYA 268
Query: 307 NAPGVVVIDDRASNHFPTPLEV--SNKDDVAVGRIRRDVSQDGNHGLDIFVCGDQVRKGA 364
+ P V V+ + +P+ V SN D+ V + + + D + KGA
Sbjct: 269 DEPFVRVLPEGG---YPSTKAVIGSNFCDIGF-----AVDERTGR-VVVVSAIDNLVKGA 319
Query: 365 ALNAVQ 370
A AVQ
Sbjct: 320 AGQAVQ 325
>gnl|CDD|235883 PRK06901, PRK06901, aspartate-semialdehyde dehydrogenase;
Provisional.
Length = 322
Score = 55.1 bits (133), Expect = 2e-08
Identities = 57/219 (26%), Positives = 101/219 (46%), Gaps = 21/219 (9%)
Query: 82 LSFQDKAYTVEELTEDSFDGVDIA-LFSAGGSISKKFGPIAVEKGSIVVDNSSAFRMVEN 140
+ F +KA VE++ + + D +F AG + A E G IV+D + N
Sbjct: 46 IRFNNKA--VEQIAPEEVEWADFNYVFFAGKMAQAEHLAQAAEAGCIVIDLYGICAALAN 103
Query: 141 VPLVIPEVNPEAMSGIKVGMGKGALIANPNCSTIICLMAATPLHRRAKVTRMVVSTYQAA 200
VP+V+P VN E ++ ++ + +++ P+ +A P + ++++ V++ A
Sbjct: 104 VPVVVPSVNDEQLAELR----QRNIVSLPDPQVSQLALALAPFLQEQPLSQIFVTSLLPA 159
Query: 201 SGAGAAAMEELELQTREVLEGKPPTCKIFSQQYAFNLFSHNAPVLENGYNEEEMKMVKET 260
S A +++L QT +L G P + Q+ AF++F NA LE +
Sbjct: 160 SYTDAETVKKLAGQTARLLNGIPLDEE--EQRLAFDVFPANAQNLEL-----------QL 206
Query: 261 RKIWNDKDVRVTATCIRVPVMRAHAESVNLQFEKPLDED 299
+KI+ + VT I+VPV A+ V E LD +
Sbjct: 207 QKIFPQLE-NVTFHSIQVPVFYGLAQMVTALSEYELDIE 244
>gnl|CDD|215522 PLN02968, PLN02968, Probable N-acetyl-gamma-glutamyl-phosphate
reductase.
Length = 381
Score = 42.5 bits (100), Expect = 2e-04
Identities = 52/236 (22%), Positives = 84/236 (35%), Gaps = 41/236 (17%)
Query: 2 ATFSSHQTQTHFISKLPANKPRTKPMFTRVRMSYQESAPSVAVVGVTGAVGQEFLSVLSD 61
S + + + E + V+G +G G E +L++
Sbjct: 3 FADSVGSKGLAS--RASVTSSPQSVVSSASSSVKSEEKKRIFVLGASGYTGAEVRRLLAN 60
Query: 62 RDFPYRSIKMLASKRSAGKQ-------LSFQDKAYTVEELTEDSFDGVDIALF-----SA 109
P I ++ + R AG+ L QD V D F VD A+F
Sbjct: 61 H--PDFEITVMTADRKAGQSFGSVFPHLITQDLPNLVAVKDAD-FSDVD-AVFCCLPHGT 116
Query: 110 GGSISKKFGPIAVEKGSIVVDNSSAFRMVE--------NVPLVIPEVNPEAMSGI----K 157
I K A+ K +VD S+ FR+ + P PE+ EA+ G+ +
Sbjct: 117 TQEIIK-----ALPKDLKIVDLSADFRLRDIAEYEEWYGHPHRAPELQKEAVYGLTELQR 171
Query: 158 VGMGKGALIANPNC--STIICLMAATPLHRRA--KVTRMVVSTYQAASGAGAAAME 209
+ L+ANP C + I + PL + + +++ SGAG A E
Sbjct: 172 EEIKSARLVANPGCYPTGIQLPL--VPLVKAGLIEPDNIIIDAKSGVSGAGRGAKE 225
>gnl|CDD|179786 PRK04207, PRK04207, glyceraldehyde-3-phosphate dehydrogenase;
Provisional.
Length = 341
Score = 35.6 bits (83), Expect = 0.033
Identities = 13/50 (26%), Positives = 29/50 (58%)
Query: 267 KDVRVTATCIRVPVMRAHAESVNLQFEKPLDEDTARDILKNAPGVVVIDD 316
D+ +T ++VP H SVN++ +KP+ ++ + L+N P ++++
Sbjct: 203 PDLDITTMAVKVPTTLMHMHSVNVELKKPVTKEEVLEALENTPRILLVRA 252
>gnl|CDD|233598 TIGR01851, argC_other, N-acetyl-gamma-glutamyl-phosphate reductase,
uncommon form. This model represents the less common of
two related families of
N-acetyl-gamma-glutamyl-phosphate reductase, an enzyme
catalyzing the third step or Arg biosynthesis from Glu.
The two families differ by phylogeny, similarity
clustering, and gap architecture in a multiple sequence
alignment [Amino acid biosynthesis, Glutamate family].
Length = 310
Score = 33.7 bits (77), Expect = 0.13
Identities = 38/172 (22%), Positives = 63/172 (36%), Gaps = 20/172 (11%)
Query: 39 APSVAVVGVTGAVGQEFLSVLSDRDFPYRSIKMLASKRSAGKQLSFQDKAYTVEELTEDS 98
AP V + G G G + LS RD + + +R +D A + L
Sbjct: 1 APKVFIDGEAGTTGLQIRERLSGRD-DIELLSIAPDRR--------KDAAERAKLL---- 47
Query: 99 FDGVDIALFSAGGSISKKFGPIAVEKGSIVVDNSSAFRMVENVPLVIPEVNPEAMSGIKV 158
+ D+A+ +++ + + ++D S+A+R ++ PE+ P I+
Sbjct: 48 -NAADVAILCLPDDAAREAVSLVDNPNTCIIDASTAYRTADDWAYGFPELAPGQREKIR- 105
Query: 159 GMGKGALIANPNCSTIICLMAATPLHRRAKVTRMVVSTYQAASG--AGAAAM 208
IANP C + PL + T A SG G AM
Sbjct: 106 ---NSKRIANPGCYPTGFIALMRPLVEAGILPADFPITINAVSGYSGGGKAM 154
>gnl|CDD|238769 cd01492, Aos1_SUMO, Ubiquitin activating enzyme (E1) subunit Aos1.
Aos1 is part of the heterodimeric activating enzyme
(E1), specific for the SUMO family of ubiquitin-like
proteins (Ubls). E1 enzymes are part of a conjugation
cascade to attach Ub or Ubls, covalently to substrate
proteins consisting of activating (E1), conjugating
(E2), and/or ligating (E3) enzymes. E1 activates
ubiquitin by C-terminal adenylation, and subsequently
forms a highly reactive thioester bond between its
catalytic cysteine and Ubls C-terminus. The E1 also
associates with E2 and promotes ubiquitin transfer to
the E2's catalytic cysteine. Post-translational
modification by SUMO family of ubiquitin-like proteins
(Ublps) is involved in cell division, nuclear transport,
the stress response and signal transduction. Aos1
contains part of the adenylation domain.
Length = 197
Score = 33.0 bits (76), Expect = 0.13
Identities = 13/41 (31%), Positives = 21/41 (51%), Gaps = 2/41 (4%)
Query: 39 APSVAVVGVTGAVGQEFLSVLSDRDFPYRSIKMLASKRSAG 79
AP AVVG G + Q+ ++ LS R+ P + + + S
Sbjct: 156 APVAAVVG--GILAQDVINALSKRESPLNNFFVFDGETSEA 194
>gnl|CDD|187554 cd05243, SDR_a5, atypical (a) SDRs, subgroup 5. This subgroup
contains atypical SDRs, some of which are identified as
putative NAD(P)-dependent epimerases, one as a putative
NAD-dependent epimerase/dehydratase. Atypical SDRs are
distinct from classical SDRs. Members of this subgroup
have a glycine-rich NAD(P)-binding motif that is very
similar to the extended SDRs, GXXGXXG, and binds NADP.
Generally, this subgroup has poor conservation of the
active site tetrad; however, individual sequences do
contain matches to the YXXXK active site motif, the
upstream Ser, and there is a highly conserved Asp in
place of the usual active site Asn throughout the
subgroup. Atypical SDRs generally lack the catalytic
residues characteristic of the SDRs, and their
glycine-rich NAD(P)-binding motif is often different
from the forms normally seen in classical or extended
SDRs. Atypical SDRs include biliverdin IX beta reductase
(BVR-B,aka flavin reductase), NMRa (a negative
transcriptional regulator of various fungi),
progesterone 5-beta-reductase like proteins,
phenylcoumaran benzylic ether and
pinoresinol-lariciresinol reductases, phenylpropene
synthases, eugenol synthase, triphenylmethane reductase,
isoflavone reductases, and others. SDRs are a
functionally diverse family of oxidoreductases that have
a single domain with a structurally conserved Rossmann
fold, an NAD(P)(H)-binding region, and a structurally
diverse C-terminal region. Sequence identity between
different SDR enzymes is typically in the 15-30% range;
they catalyze a wide range of activities including the
metabolism of steroids, cofactors, carbohydrates,
lipids, aromatic compounds, and amino acids, and act in
redox sensing. Classical SDRs have an TGXXX[AG]XG
cofactor binding motif and a YXXXK active site motif,
with the Tyr residue of the active site motif serving as
a critical catalytic residue (Tyr-151, human
15-hydroxyprostaglandin dehydrogenase numbering). In
addition to the Tyr and Lys, there is often an upstream
Ser and/or an Asn, contributing to the active site;
while substrate binding is in the C-terminal region,
which determines specificity. The standard reaction
mechanism is a 4-pro-S hydride transfer and proton relay
involving the conserved Tyr and Lys, a water molecule
stabilized by Asn, and nicotinamide. In addition to the
Rossmann fold core region typical of all SDRs, extended
SDRs have a less conserved C-terminal extension of
approximately 100 amino acids, and typically have a
TGXXGXXG cofactor binding motif. Complex (multidomain)
SDRs such as ketoreductase domains of fatty acid
synthase have a GGXGXXG NAD(P)-binding motif and an
altered active site motif (YXXXN). Fungal type ketoacyl
reductases have a TGXXXGX(1-2)G NAD(P)-binding motif.
Length = 203
Score = 32.6 bits (75), Expect = 0.21
Identities = 25/88 (28%), Positives = 37/88 (42%), Gaps = 10/88 (11%)
Query: 41 SVAVVGVTGAVGQEFLSVLSDRDFPYRSIKMLASKRSAGKQLSFQDKAYTVE-ELTEDS- 98
V VVG TG VG+ + L DR + R+ L S ++L A V +LT+
Sbjct: 1 KVLVVGATGKVGRHVVRELLDRGYQVRA---LVRDPSQAEKL-EAAGAEVVVGDLTDAES 56
Query: 99 ----FDGVDIALFSAGGSISKKFGPIAV 122
+G+D + +AG AV
Sbjct: 57 LAAALEGIDAVISAAGSGGKGGPRTEAV 84
>gnl|CDD|130609 TIGR01546, GAPDH-II_archae, glyceraldehyde-3-phosphate
dehydrogenase, type II. This model describes the type
II glyceraldehyde-3-phosphate dehydrogenases which are
limited to archaea. These enzymes catalyze the
interconversion of 1,3-diphosphoglycerate and
glyceraldehyde-3-phosphate, a central step in glycolysis
and gluconeogenesis. In archaea, either NAD or NADP may
be utilized as the cofactor. The class I GAPDH's from
bacteria and eukaryotes are covered by TIGR01534. All of
the members of the seed are characterized. See, for
instance. This model is very solid, there are no species
falling between trusted and noise at this time. The
closest relatives scoring in the noise are the class I
GAPDH's.
Length = 333
Score = 32.9 bits (75), Expect = 0.21
Identities = 13/48 (27%), Positives = 26/48 (54%)
Query: 268 DVRVTATCIRVPVMRAHAESVNLQFEKPLDEDTARDILKNAPGVVVID 315
++ + VP H S+ ++ +KP+ +D DIL+N P V++ +
Sbjct: 201 NLNIETMAFVVPTTLMHVHSIMVELKKPVTKDDIIDILENTPRVLLFE 248
>gnl|CDD|181721 PRK09242, PRK09242, tropinone reductase; Provisional.
Length = 257
Score = 31.3 bits (71), Expect = 0.62
Identities = 21/87 (24%), Positives = 37/87 (42%), Gaps = 13/87 (14%)
Query: 43 AVVGVTGAVGQEFLSVLSDRDFPYRSIKMLASKRSAGKQLSFQDKAY------------- 89
A G+ A+ +EFL + +D R LA R + + + +
Sbjct: 17 ASKGIGLAIAREFLGLGADVLIVARDADALAQARDELAEEFPEREVHGLAADVSDDEDRR 76
Query: 90 TVEELTEDSFDGVDIALFSAGGSISKK 116
+ + ED +DG+ I + +AGG+I K
Sbjct: 77 AILDWVEDHWDGLHILVNNAGGNIRKA 103
>gnl|CDD|130833 TIGR01772, MDH_euk_gproteo, malate dehydrogenase, NAD-dependent.
This model represents the NAD-dependent malate
dehydrogenase found in eukaryotes and certain gamma
proteobacteria. The enzyme is involved in the citric
acid cycle as well as the glyoxalate cycle. Several
isoforms exidt in eukaryotes. In S. cereviseae, for
example, there are cytoplasmic, mitochondrial and
peroxisomal forms. Although malate dehydrogenases have
in some cases been mistaken for lactate dehydrogenases
due to the similarity of these two substrates and the
apparent ease with which evolution can toggle these
activities, critical residues have been identified which
can discriminate between the two activities. At the time
of the creation of this model no hits above the trusted
cutoff contained critical residues typical of lactate
dehydrogenases [Energy metabolism, TCA cycle].
Length = 312
Score = 31.2 bits (71), Expect = 0.87
Identities = 35/140 (25%), Positives = 61/140 (43%), Gaps = 25/140 (17%)
Query: 42 VAVVGVTGAVGQEFLSVLSDRDFPYRSIKMLASKRSAG--KQLSFQD-----KAYTVEEL 94
VAV+G G +GQ LS+L + + +AG LS K ++ EE
Sbjct: 2 VAVLGAAGGIGQP-LSLLLKLQPYVSELSLYDIAGAAGVAADLSHIPTAASVKGFSGEEG 60
Query: 95 TEDSFDGVDIALFSAGGSISKKFGPIAVEKGSIVVDNSSAFRMVENVPLVIPEVNPEAMS 154
E++ G D+ + AG + +K G + + + N+ +V+++ + E P+AM
Sbjct: 61 LENALKGADVVVIPAG--VPRKPG---MTRDDLFNVNAG---IVKDLVAAVAESCPKAMI 112
Query: 155 GIKVGMGKGALIANPNCSTI 174
+ I NP ST+
Sbjct: 113 LV---------ITNPVNSTV 123
>gnl|CDD|240360 PTZ00325, PTZ00325, malate dehydrogenase; Provisional.
Length = 321
Score = 31.2 bits (71), Expect = 0.94
Identities = 25/77 (32%), Positives = 35/77 (45%), Gaps = 10/77 (12%)
Query: 42 VAVVGVTGAVGQEFLSVLSDRDFPYRS-IKMLASKRSAG--KQLSFQD-----KAYTVEE 93
VAV+G G +GQ LS+L + P+ S + + + G LS D Y E
Sbjct: 11 VAVLGAAGGIGQP-LSLLL-KQNPHVSELSLYDIVGAPGVAADLSHIDTPAKVTGYADGE 68
Query: 94 LTEDSFDGVDIALFSAG 110
L E + G D+ L AG
Sbjct: 69 LWEKALRGADLVLICAG 85
>gnl|CDD|217227 pfam02780, Transketolase_C, Transketolase, C-terminal domain. The
C-terminal domain of transketolase has been proposed as
a regulatory molecule binding site.
Length = 124
Score = 29.1 bits (66), Expect = 1.6
Identities = 13/32 (40%), Positives = 22/32 (68%)
Query: 285 AESVNLQFEKPLDEDTARDILKNAPGVVVIDD 316
AE ++L+ KPLDEDT + +K +VV+++
Sbjct: 38 AEVIDLRTVKPLDEDTILESVKKTGRLVVVEE 69
>gnl|CDD|205225 pfam13044, DUF3904, Protein of unknown function (DUF3904). This
family of proteins is functionally uncharacterized. This
family of proteins is found in viruses. Proteins in this
family are typically between 437 and 448 amino acids in
length.
Length = 436
Score = 30.0 bits (67), Expect = 2.3
Identities = 12/37 (32%), Positives = 20/37 (54%), Gaps = 7/37 (18%)
Query: 331 KDDVAVGRIR-------RDVSQDGNHGLDIFVCGDQV 360
K D+ +G+I+ D+S+ G GL +F+CG
Sbjct: 107 KHDIVLGKIKVSEFGGDHDISKCGRKGLKVFICGGTT 143
>gnl|CDD|187569 cd05259, PCBER_SDR_a, phenylcoumaran benzylic ether reductase
(PCBER) like, atypical (a) SDRs. PCBER and
pinoresinol-lariciresinol reductases are NADPH-dependent
aromatic alcohol reductases, and are atypical members of
the SDR family. Other proteins in this subgroup are
identified as eugenol synthase. These proteins contain
an N-terminus characteristic of NAD(P)-binding proteins
and a small C-terminal domain presumed to be involved in
substrate binding, but they do not have the conserved
active site Tyr residue typically found in SDRs.
Numerous other members have unknown functions. The
glycine rich NADP-binding motif in this subgroup is of 2
forms: GXGXXG and G[GA]XGXXG; it tends to be atypical
compared with the forms generally seen in classical or
extended SDRs. The usual SDR active site tetrad is not
present, but a critical active site Lys at the usual SDR
position has been identified in various members, though
other charged and polar residues are found at this
position in this subgroup. Atypical SDR-related proteins
retain the Rossmann fold of the SDRs, but have limited
sequence identity and generally lack the catalytic
properties of the archetypical members. Atypical SDRs
include biliverdin IX beta reductase (BVR-B,aka flavin
reductase), NMRa (a negative transcriptional regulator
of various fungi), progesterone 5-beta-reductase like
proteins, phenylcoumaran benzylic ether and
pinoresinol-lariciresinol reductases, phenylpropene
synthases, eugenol synthase, triphenylmethane reductase,
isoflavone reductases, and others. SDRs are a
functionally diverse family of oxidoreductases that have
a single domain with a structurally conserved Rossmann
fold, an NAD(P)(H)-binding region, and a structurally
diverse C-terminal region. Sequence identity between
different SDR enzymes is typically in the 15-30% range;
they catalyze a wide range of activities including the
metabolism of steroids, cofactors, carbohydrates,
lipids, aromatic compounds, and amino acids, and act in
redox sensing. Classical SDRs have an TGXXX[AG]XG
cofactor binding motif and a YXXXK active site motif,
with the Tyr residue of the active site motif serving as
a critical catalytic residue (Tyr-151, human
15-hydroxyprostaglandin dehydrogenase numbering). In
addition to the Tyr and Lys, there is often an upstream
Ser and/or an Asn, contributing to the active site;
while substrate binding is in the C-terminal region,
which determines specificity. The standard reaction
mechanism is a 4-pro-S hydride transfer and proton relay
involving the conserved Tyr and Lys, a water molecule
stabilized by Asn, and nicotinamide. In addition to the
Rossmann fold core region typical of all SDRs, extended
SDRs have a less conserved C-terminal extension of
approximately 100 amino acids, and typically have a
TGXXGXXG cofactor binding motif. Complex (multidomain)
SDRs such as ketoreductase domains of fatty acid
synthase have a GGXGXXG NAD(P)-binding motif and an
altered active site motif (YXXXN). Fungal type ketoacyl
reductases have a TGXXXGX(1-2)G NAD(P)-binding motif.
Length = 282
Score = 29.6 bits (67), Expect = 2.5
Identities = 16/71 (22%), Positives = 24/71 (33%), Gaps = 10/71 (14%)
Query: 41 SVAVVGVTGAVGQEFLSVLSDRDFPYRSIKMLASKRSAGKQLSFQDKAYTV-------EE 93
+A+ G TG +G +S L P ++ +L S V E
Sbjct: 1 KIAIAGATGTLGGPIVSALLAS--PGFTVTVLTRPSSTSSNEFQPSGVKVVPVDYASHES 58
Query: 94 LTEDSFDGVDI 104
L + GVD
Sbjct: 59 LV-AALKGVDA 68
>gnl|CDD|235699 PRK06110, PRK06110, hypothetical protein; Provisional.
Length = 322
Score = 29.2 bits (66), Expect = 3.9
Identities = 15/28 (53%), Positives = 17/28 (60%), Gaps = 1/28 (3%)
Query: 195 STYQAASGAGAAAMEELELQTREVLEGK 222
T+ A GAGAAA+ LQ RE L GK
Sbjct: 270 DTHNVAEGAGAAALAAA-LQERERLAGK 296
>gnl|CDD|235470 PRK05444, PRK05444, 1-deoxy-D-xylulose-5-phosphate synthase;
Provisional.
Length = 580
Score = 28.9 bits (66), Expect = 5.7
Identities = 9/29 (31%), Positives = 17/29 (58%)
Query: 288 VNLQFEKPLDEDTARDILKNAPGVVVIDD 316
V+ +F KPLDE+ ++ VV +++
Sbjct: 491 VDARFVKPLDEELLLELAAKHDLVVTVEE 519
>gnl|CDD|116361 pfam07747, MTH865, MTH865-like family. This domain has an
EF-hand like fold.
Length = 75
Score = 26.7 bits (59), Expect = 6.0
Identities = 9/22 (40%), Positives = 14/22 (63%)
Query: 54 EFLSVLSDRDFPYRSIKMLASK 75
E +L+D DFP++S + LA
Sbjct: 47 ELGKLLTDADFPFKSAEDLADT 68
>gnl|CDD|224076 COG1154, Dxs, Deoxyxylulose-5-phosphate synthase [Coenzyme
metabolism / Lipid metabolism].
Length = 627
Score = 28.7 bits (65), Expect = 6.0
Identities = 10/29 (34%), Positives = 18/29 (62%)
Query: 288 VNLQFEKPLDEDTARDILKNAPGVVVIDD 316
V+ +F KPLDE ++ K+ VV +++
Sbjct: 533 VDPRFVKPLDEALLLELAKSHDLVVTLEE 561
>gnl|CDD|223135 COG0057, GapA, Glyceraldehyde-3-phosphate
dehydrogenase/erythrose-4-phosphate dehydrogenase
[Carbohydrate transport and metabolism].
Length = 335
Score = 28.3 bits (64), Expect = 6.5
Identities = 10/42 (23%), Positives = 17/42 (40%)
Query: 268 DVRVTATCIRVPVMRAHAESVNLQFEKPLDEDTARDILKNAP 309
++T IRVP + ++ EK + + LK A
Sbjct: 224 KGKLTGMAIRVPTPNVSVVDLTVELEKEVTVEEINAALKAAS 265
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.317 0.131 0.371
Gapped
Lambda K H
0.267 0.0807 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 18,959,292
Number of extensions: 1840845
Number of successful extensions: 1797
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1735
Number of HSP's successfully gapped: 51
Length of query: 376
Length of database: 10,937,602
Length adjustment: 98
Effective length of query: 278
Effective length of database: 6,590,910
Effective search space: 1832272980
Effective search space used: 1832272980
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 60 (26.7 bits)