RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy8850
         (397 letters)



>gnl|CDD|184450 PRK14012, PRK14012, cysteine desulfurase; Provisional.
          Length = 404

 Score =  773 bits (1998), Expect = 0.0
 Identities = 276/399 (69%), Positives = 344/399 (86%), Gaps = 3/399 (0%)

Query: 1   MDYSATTPIDPRVVDKMIPYLRV--QFGNPASRNHIYGWTAEKAIEKARNEVAQLIYADP 58
           +DYSATTP+DPRV +KM+PYL +   FGNPASR+H +GW AE+A++ ARN++A LI ADP
Sbjct: 7   LDYSATTPVDPRVAEKMMPYLTMDGTFGNPASRSHRFGWQAEEAVDIARNQIADLIGADP 66

Query: 59  REIIWTSGATESNNLAIKGAAHFYKKRGKHIITVQTEHKSVLDSIRKLENDGFRVTYLKP 118
           REI++TSGATES+NLAIKGAAHFY+K+GKHIIT +TEHK+VLD+ R+LE +GF VTYL P
Sbjct: 67  REIVFTSGATESDNLAIKGAAHFYQKKGKHIITSKTEHKAVLDTCRQLEREGFEVTYLDP 126

Query: 119 KNNGLITLSQLEEKICSDTILVSVMLVNNEIGVIQPILEIGELCRSKGVIFHCDAAQAVG 178
           ++NG+I L +LE  +  DTILVS+M VNNEIGVIQ I  IGE+CR +G+IFH DAAQ+VG
Sbjct: 127 QSNGIIDLEKLEAAMRDDTILVSIMHVNNEIGVIQDIAAIGEICRERGIIFHVDAAQSVG 186

Query: 179 KIDINIQNYKVDLMSFSAHKIYGPKGIGALYIRRNPRTRIEAQIHGGGHEYGLRSGTLAT 238
           K+ I++   KVDLMSFSAHKIYGPKGIGALY+RR PR R+EAQ+HGGGHE G+RSGTL T
Sbjct: 187 KVPIDLSKLKVDLMSFSAHKIYGPKGIGALYVRRKPRVRLEAQMHGGGHERGMRSGTLPT 246

Query: 239 HQIVGMGEAFRLARIEMKEELKRILFLQKRLYNGLMESIEEIYINGDMNNRIPHNLNISF 298
           HQIVGMGEA R+A+ EM  E +RI  L+ RL+NG+ + IEE+Y+NGD+  R+P NLN+SF
Sbjct: 247 HQIVGMGEAARIAKEEMATENERIRALRDRLWNGI-KDIEEVYLNGDLEQRVPGNLNVSF 305

Query: 299 NFIEGESLLMAIKDIAVSSGSACTSASLEPSYVLRALGCNDELAHSSIRFTIGRFTTEKE 358
           N++EGESL+MA+KD+AVSSGSACTSASLEPSYVLRALG NDELAHSSIRF++GRFTTE+E
Sbjct: 306 NYVEGESLIMALKDLAVSSGSACTSASLEPSYVLRALGLNDELAHSSIRFSLGRFTTEEE 365

Query: 359 IDFTIKLLKMKVKKLRELSPLWDMHKEGIDLNTIKWAIH 397
           ID+ I+L++  + KLRELSPLW+M KEG+DLN+I+WA H
Sbjct: 366 IDYAIELVRKSIGKLRELSPLWEMFKEGVDLNSIEWAHH 404


>gnl|CDD|131061 TIGR02006, IscS, cysteine desulfurase IscS.  This model represents
           IscS, one of several cysteine desulfurases from a larger
           protein family designated (misleadingly, in this case)
           class V aminotransferases. IscS is one of at least 6
           enzymes characteristic of the IscSUA-hscAB-fsx system of
           iron-sulfur cluster assembly. Scoring almost as well as
           proteobacterial sequences included in the model are
           mitochondrial cysteine desulfurases, apparently from an
           analogous system in eukaryotes. The sulfur, taken from
           cysteine, may be used in other systems as well, such as
           tRNA base modification and biosynthesis of other
           cofactors [Biosynthesis of cofactors, prosthetic groups,
           and carriers, Other, Protein synthesis, tRNA and rRNA
           base modification].
          Length = 402

 Score =  686 bits (1772), Expect = 0.0
 Identities = 279/397 (70%), Positives = 337/397 (84%), Gaps = 1/397 (0%)

Query: 1   MDYSATTPIDPRVVDKMIPYLRVQFGNPASRNHIYGWTAEKAIEKARNEVAQLIYADPRE 60
           +DY+ATTP+DPRV +KM+PYL  +FGNPASR+H +GW AE+A+E ARN+VA+LI AD RE
Sbjct: 7   LDYAATTPVDPRVAEKMMPYLTEKFGNPASRSHSFGWEAEEAVENARNQVAELIGADSRE 66

Query: 61  IIWTSGATESNNLAIKGAAHFYKKRGKHIITVQTEHKSVLDSIRKLENDGFRVTYLKPKN 120
           I++TSGATESNNLAIKG AHFYK +G HIIT +TEHK+VLD+ R LE +GF VTYL PK+
Sbjct: 67  IVFTSGATESNNLAIKGIAHFYKSKGNHIITSKTEHKAVLDTCRYLEREGFEVTYLPPKS 126

Query: 121 NGLITLSQLEEKICSDTILVSVMLVNNEIGVIQPILEIGELCRSKGVIFHCDAAQAVGKI 180
           NGLI L +L+  I  DTILVS+M VNNEIGVIQ I  IGE+CR + V FH DAAQ+VGKI
Sbjct: 127 NGLIDLEELKAAIRDDTILVSIMHVNNEIGVIQDIAAIGEICRERKVFFHVDAAQSVGKI 186

Query: 181 DINIQNYKVDLMSFSAHKIYGPKGIGALYIRRNPRTRIEAQIHGGGHEYGLRSGTLATHQ 240
            IN+   KVDLMS S HKIYGPKGIGALY+RR PR R+EA IHGGGHE G+RSGTL THQ
Sbjct: 187 PINVNELKVDLMSISGHKIYGPKGIGALYVRRKPRVRLEALIHGGGHERGMRSGTLPTHQ 246

Query: 241 IVGMGEAFRLARIEMKEELKRILFLQKRLYNGLMESIEEIYINGDMNNRIPHNLNISFNF 300
           IVGMGEAFR+A+ EM ++   +L L+ RL NG+ +SIEE+Y+NGD+ +R+P NLN+SFN+
Sbjct: 247 IVGMGEAFRIAKEEMAQDTAHVLALRDRLLNGI-KSIEEVYLNGDLEHRVPGNLNVSFNY 305

Query: 301 IEGESLLMAIKDIAVSSGSACTSASLEPSYVLRALGCNDELAHSSIRFTIGRFTTEKEID 360
           +EGESL+MA+KD+AVSSGSACTSASLEPSYVLRALG NDELAHSSIRFTIGRFTTE+EID
Sbjct: 306 VEGESLIMALKDLAVSSGSACTSASLEPSYVLRALGINDELAHSSIRFTIGRFTTEEEID 365

Query: 361 FTIKLLKMKVKKLRELSPLWDMHKEGIDLNTIKWAIH 397
           + +KL+K  + KLRELSPLW+M KEG+DLN+I+WA H
Sbjct: 366 YAVKLVKSAIDKLRELSPLWEMFKEGVDLNSIEWAAH 402


>gnl|CDD|224029 COG1104, NifS, Cysteine sulfinate desulfinase/cysteine desulfurase
           and related enzymes [Amino acid transport and
           metabolism].
          Length = 386

 Score =  587 bits (1516), Expect = 0.0
 Identities = 224/385 (58%), Positives = 287/385 (74%), Gaps = 7/385 (1%)

Query: 1   MDYSATTPIDPRVVDKMIPYLRVQFGNPASRNHIYGWTAEKAIEKARNEVAQLIYADPRE 60
           +D +ATTP+DP V++ M+PYL   FGNP+S  H +G  A KA+E+AR ++A+L+ ADP E
Sbjct: 5   LDNAATTPVDPEVLEAMLPYLTEVFGNPSSL-HSFGREARKAVEEAREQIAKLLGADPEE 63

Query: 61  IIWTSGATESNNLAIKGAAHFY--KKRGKHIITVQTEHKSVLDSIRKLENDGFRVTYLKP 118
           II+TSGATESNNLAIKGAA  Y   ++GKHIIT   EH +VL++ R LE  GF VTYL  
Sbjct: 64  IIFTSGATESNNLAIKGAALAYRNAQKGKHIITSAIEHPAVLNTCRYLERQGFEVTYLPV 123

Query: 119 KNNGLITLSQLEEKICSDTILVSVMLVNNEIGVIQPILEIGELCRSKGVIFHCDAAQAVG 178
            +NGL+ L QLEE +  DTILVS+M  NNE G IQPI EIGE+C+ +G++FH DA QAVG
Sbjct: 124 DSNGLVDLEQLEEALRPDTILVSIMHANNETGTIQPIAEIGEICKERGILFHVDAVQAVG 183

Query: 179 KIDINIQNYKVDLMSFSAHKIYGPKGIGALYIRRNPRTRIEAQIHGGGHEYGLRSGTLAT 238
           KI I+++   VDL+SFSAHK  GPKGIGALY+R  P  R+E  IHGGG E GLRSGT   
Sbjct: 184 KIPIDLEELGVDLLSFSAHKFGGPKGIGALYVR--PGVRLEPLIHGGGQERGLRSGTENV 241

Query: 239 HQIVGMGEAFRLARIEMKEELKRILFLQKRLYNGLMESIEEIYINGDMNNRIPHNLNISF 298
             IVG G+A  +A  E++EE  R+  L+ RL +GL+E I ++Y+NGD   R+P+ LN SF
Sbjct: 242 PGIVGFGKAAEIAVEELEEENARLRKLRDRLEDGLLEIIPDVYLNGDDEPRLPNILNFSF 301

Query: 299 NFIEGESLLMA--IKDIAVSSGSACTSASLEPSYVLRALGCNDELAHSSIRFTIGRFTTE 356
             +EGESLL+A  +  IAVS+GSAC+S SLEPS+VLRA+G ++ELAH SIRF++GRFTTE
Sbjct: 302 PGVEGESLLLALDLAGIAVSTGSACSSGSLEPSHVLRAMGISEELAHGSIRFSLGRFTTE 361

Query: 357 KEIDFTIKLLKMKVKKLRELSPLWD 381
           +EID   + LK  +K+LRELSPLW 
Sbjct: 362 EEIDAAAEALKEIIKRLRELSPLWA 386


>gnl|CDD|178257 PLN02651, PLN02651, cysteine desulfurase.
          Length = 364

 Score =  553 bits (1426), Expect = 0.0
 Identities = 224/362 (61%), Positives = 273/362 (75%), Gaps = 2/362 (0%)

Query: 1   MDYSATTPIDPRVVDKMIPYLRVQFGNPASRNHIYGWTAEKAIEKARNEVAQLIYADPRE 60
           +D  ATTPIDPRV+D M+P+L   FGNP SR H+YGW +E A+EKAR +VA LI ADP+E
Sbjct: 3   LDMQATTPIDPRVLDAMLPFLIEHFGNPHSRTHLYGWESEDAVEKARAQVAALIGADPKE 62

Query: 61  IIWTSGATESNNLAIKGAAHFYKKRGKHIITVQTEHKSVLDSIRKLENDGFRVTYLKPKN 120
           II+TSGATESNNLAIKG  HFYK + KH+IT QTEHK VLDS R L+ +GF VTYL  K+
Sbjct: 63  IIFTSGATESNNLAIKGVMHFYKDKKKHVITTQTEHKCVLDSCRHLQQEGFEVTYLPVKS 122

Query: 121 NGLITLSQLEEKICSDTILVSVMLVNNEIGVIQPILEIGELCRSKGVIFHCDAAQAVGKI 180
           +GL+ L +L   I  DT LVSVM VNNEIGVIQP+ EIGELCR K V+FH DAAQAVGKI
Sbjct: 123 DGLVDLDELAAAIRPDTALVSVMAVNNEIGVIQPVEEIGELCREKKVLFHTDAAQAVGKI 182

Query: 181 DINIQNYKVDLMSFSAHKIYGPKGIGALYIRRNPRTRIEAQIHGGGHEYGLRSGTLATHQ 240
            +++ +  VDLMS S HKIYGPKG+GALY+RR PR R+E  + GGG E G RSGT  T  
Sbjct: 183 PVDVDDLGVDLMSISGHKIYGPKGVGALYVRRRPRVRLEPLMSGGGQERGRRSGTENTPL 242

Query: 241 IVGMGEAFRLARIEMKEELKRILFLQKRLYNGLMESIEEIYING--DMNNRIPHNLNISF 298
           +VG+G A  LA  EM  + K +  L++RL NGL   +  + +NG  D   R P  LN+SF
Sbjct: 243 VVGLGAACELAMKEMDYDEKHMKALRERLLNGLRAKLGGVRVNGPRDPEKRYPGTLNLSF 302

Query: 299 NFIEGESLLMAIKDIAVSSGSACTSASLEPSYVLRALGCNDELAHSSIRFTIGRFTTEKE 358
            ++EGESLLM +K++AVSSGSACTSASLEPSYVLRALG  +E+AH S+R  +GRFTTE+E
Sbjct: 303 AYVEGESLLMGLKEVAVSSGSACTSASLEPSYVLRALGVPEEMAHGSLRLGVGRFTTEEE 362

Query: 359 ID 360
           +D
Sbjct: 363 VD 364


>gnl|CDD|132443 TIGR03402, FeS_nifS, cysteine desulfurase NifS.  Members of this
           protein family are NifS, one of several related families
           of cysteine desulfurase involved in iron-sulfur (FeS)
           cluster biosynthesis. NifS is part of the NIF system,
           usually associated with other nif genes involved in
           nitrogenase expression and nitrogen fixation. The
           protein family is given a fairly broad interpretation
           here. It includes a clade nearly always found in
           extended nitrogen fixation genomic regions, plus a
           second clade more closely related to the first than to
           IscS and also part of NifS-like/NifU-like systems. This
           model does not extend to a more distantly clade found in
           the epsilon proteobacteria such as Helicobacter pylori,
           also named NifS in the literature, built instead in
           TIGR03403.
          Length = 379

 Score =  477 bits (1230), Expect = e-169
 Identities = 179/381 (46%), Positives = 252/381 (66%), Gaps = 6/381 (1%)

Query: 1   MDYSATTPIDPRVVDKMIPYLRVQFGNPASRNHIYGWTAEKAIEKARNEVAQLIYADPRE 60
           +D +ATT +DP V++ M+PY    FGNP+S  H +G    KA+E+AR +VA+L+ A+P E
Sbjct: 3   LDNNATTRVDPEVLEAMLPYFTEYFGNPSSM-HSFGGEVGKAVEEAREQVAKLLGAEPDE 61

Query: 61  IIWTSGATESNNLAIKGAAHFYKKRGKHIITVQTEHKSVLDSIRKLENDGFRVTYLKPKN 120
           II+TSG TES+N AIK A     ++ +HIIT   EH +VL   + LE  G++VTYL    
Sbjct: 62  IIFTSGGTESDNTAIKSALAAQPEK-RHIITTAVEHPAVLSLCQHLEKQGYKVTYLPVDE 120

Query: 121 NGLITLSQLEEKICSDTILVSVMLVNNEIGVIQPILEIGELCRSKGVIFHCDAAQAVGKI 180
            G + L +L   I  DT LVSVM  NNE G I PI EIGE+ + +G +FH DA QAVGKI
Sbjct: 121 EGRLDLEELRAAITDDTALVSVMWANNETGTIFPIEEIGEIAKERGALFHTDAVQAVGKI 180

Query: 181 DINIQNYKVDLMSFSAHKIYGPKGIGALYIRRNPRTRIEAQIHGGGHEYGLRSGTLATHQ 240
            I+++   +D++S S HK++GPKG+GALYIR+   TR    + GG  E G R+GT     
Sbjct: 181 PIDLKEMNIDMLSLSGHKLHGPKGVGALYIRKG--TRFRPLLRGGHQERGRRAGTENVPG 238

Query: 241 IVGMGEAFRLARIEMKEELKRILFLQKRLYNGLMESIEEIYINGDMNNRIPHNLNISFNF 300
           IVG+G+A  LA   ++EE  R+  L+ RL  GL+  I +  +NGD   R+P+ +NISF +
Sbjct: 239 IVGLGKAAELATEHLEEENTRVRALRDRLEAGLLARIPDARLNGDPTKRLPNTVNISFEY 298

Query: 301 IEGESLLMAI--KDIAVSSGSACTSASLEPSYVLRALGCNDELAHSSIRFTIGRFTTEKE 358
           IEGE++L+ +  + I  SSGSACTS SLEPS+VLRA+G     AH SIRF++ R+ TE++
Sbjct: 299 IEGEAILLLLDMEGICASSGSACTSGSLEPSHVLRAMGVPHTAAHGSIRFSLSRYNTEED 358

Query: 359 IDFTIKLLKMKVKKLRELSPL 379
           ID+ +++L   + +LR +SP 
Sbjct: 359 IDYVLEVLPPIIARLRAMSPF 379


>gnl|CDD|163191 TIGR03235, DNA_S_dndA, cysteine desulfurase DndA.  This model
           describes DndA, a protein related to IscS and part of a
           larger family of cysteine desulfurases. It is encoded,
           typically, divergently from a conserved, sparsely
           distributed operon for sulfur modification of DNA. This
           modification system is designated dnd, after the
           phenotype of DNA degradation during electrophoresis. The
           system is sporadically distributed in bacteria, much
           like some restriction enzyme operons. DndB is described
           as a putative ATPase [DNA metabolism,
           Restriction/modification].
          Length = 353

 Score =  399 bits (1028), Expect = e-139
 Identities = 170/353 (48%), Positives = 239/353 (67%), Gaps = 6/353 (1%)

Query: 1   MDYSATTPIDPRVVDKMIPYLRVQFGNPASRNHIYGWTAEKAIEKARNEVAQLIYADPRE 60
           +D++ATTPIDP V + M+P+L  +FGNP+SR H +G  A+KA+E+AR +VA+ + AD  E
Sbjct: 2   LDHNATTPIDPAVAEAMLPWLLEEFGNPSSRTHEFGHNAKKAVERARKQVAEALGADTEE 61

Query: 61  IIWTSGATESNNLAIKGAAHFYKKRG-KHIITVQTEHKSVLDSIRKLENDGFRVTYLKPK 119
           +I+TSGATESNNLAI G A   +++G KHIIT   EH +VL+ IR LE +GF VTYL   
Sbjct: 62  VIFTSGATESNNLAILGLARAGEQKGKKHIITSAIEHPAVLEPIRALERNGFTVTYLPVD 121

Query: 120 NNGLITLSQLEEKICSDTILVSVMLVNNEIGVIQPILEIGELCRSKGVIFHCDAAQAVGK 179
            +G I + +L + I  DT+LVS+M VNNE G IQPI EI E+  +    FH DAAQ VGK
Sbjct: 122 ESGRIDVDELADAIRPDTLLVSIMHVNNETGSIQPIREIAEVLEAHEAFFHVDAAQVVGK 181

Query: 180 IDINIQNYKVDLMSFSAHKIYGPKGIGALYIRRN--PRTRIEAQIHGGGHEYGLRSGTLA 237
           I +++   ++DL+S S HKIYGPKGIGAL IR+   P+  ++  + GGG E GLR GTL 
Sbjct: 182 ITVDLSADRIDLISCSGHKIYGPKGIGALVIRKRGKPKAPLKPIMFGGGQERGLRPGTLP 241

Query: 238 THQIVGMGEAFRLARIEMKEELKRILFLQKRLYNGLMESIEEIYINGDMNNRIPHNLNIS 297
            H IVGMGEA  +AR   +    ++  ++ +L + L      + +NGD    IPH LN S
Sbjct: 242 VHLIVGMGEAAEIARRNAQAWEVKLRAMRNQLRDALQTL--GVKLNGDPAETIPHILNFS 299

Query: 298 FNFIEGESLLMAIK-DIAVSSGSACTSASLEPSYVLRALGCNDELAHSSIRFT 349
            + +  E+L++ ++ D AVS+GSAC+S+  EPS+VL+A+G + + A  +IRF+
Sbjct: 300 IDGVNSEALIVNLRADAAVSTGSACSSSKYEPSHVLQAMGLDTDRARGAIRFS 352


>gnl|CDD|132444 TIGR03403, nifS_epsilon, cysteine desulfurase, NifS family, epsilon
           proteobacteria type.  Members of this family are the
           NifS-like cysteine desulfurase of the epsilon division
           of the Proteobacteria, similar to the NifS protein of
           nitrogen-fixing bacteria. Like NifS, and unlike IscS,
           this protein is found as part of a system of just two
           proteins, a cysteine desulfurase and a scaffold, for
           iron-sulfur cluster biosynthesis. This protein is called
           NifS by Olsen, et al. (PMID:11123951), so we use this
           designation.
          Length = 382

 Score =  337 bits (866), Expect = e-114
 Identities = 170/382 (44%), Positives = 238/382 (62%), Gaps = 9/382 (2%)

Query: 1   MDYSATTPIDPRVVDKMIPYLRVQFGNPASRNHIYGWTAEKAIEKARNEVAQLIYA-DPR 59
           +D +ATT +DP+V + M P+    +GNP S  H +G     AI +A +++ + I A D  
Sbjct: 3   LDNNATTMLDPKVKELMDPFFCDIYGNPNSL-HQFGTATHPAIAEALDKLYKGINARDLD 61

Query: 60  EIIWTSGATESNNLAIKGA--AHFYKKRGKHIITVQTEHKSVLDSIRKLENDGFRVTYLK 117
           +II TS ATESNN  +KG       K    HIIT + EH +V  +   LE+ G  VTYL 
Sbjct: 62  DIIITSCATESNNWVLKGVYFDEILKGGKNHIITTEVEHPAVRATCAFLESLGVEVTYLP 121

Query: 118 PKNNGLITLSQLEEKICSDTILVSVMLVNNEIGVIQPILEIGELCRSKGVIFHCDAAQAV 177
               G IT  Q+ E I   T LVSVM  NNE G+I PI EIGE+C+ +GV+FH DA QA+
Sbjct: 122 INEQGTITAEQVREAITEKTALVSVMWANNETGMIFPIKEIGEICKERGVLFHTDAVQAI 181

Query: 178 GKIDINIQNYKVDLMSFSAHKIYGPKGIGALYIRRNPRTRIEAQIHGGGHEYGLRSGTLA 237
           GKI +++Q   VD +SFSAHK +GPKG+G LYIR+     +    HGG H  G RSGTL 
Sbjct: 182 GKIPVDVQKAGVDFLSFSAHKFHGPKGVGGLYIRKG--VELTPLFHGGEHMGGRRSGTLN 239

Query: 238 THQIVGMGEAFRLARIEMKEELKRILFLQKRLYNGLMESIEEIYINGDMNNRIPHNLNIS 297
              IV MGEA RLA   +  E   +  L+ RL + L+E + ++++ GD  +R+P+ + IS
Sbjct: 240 VPYIVAMGEAMRLANEYLDFEKSHVRRLRDRLEDALLE-LPDVFVVGDREHRVPNTILIS 298

Query: 298 FNFIEGESLLMAIKD--IAVSSGSACTSASLEPSYVLRALGCNDELAHSSIRFTIGRFTT 355
              +EGE++L  +    IA S+GSAC S  LE + V+ A+G + ELAH++IR ++ RFTT
Sbjct: 299 IKGVEGEAMLWDLNKAGIAASTGSACASEDLEANPVMVAIGADKELAHTAIRLSLSRFTT 358

Query: 356 EKEIDFTIKLLKMKVKKLRELS 377
           E+EID+TI++ K  V++LR +S
Sbjct: 359 EEEIDYTIEVFKKAVQRLRAIS 380


>gnl|CDD|215829 pfam00266, Aminotran_5, Aminotransferase class-V.  This domain is
           found in amino transferases, and other enzymes including
           cysteine desulphurase EC:4.4.1.-.
          Length = 370

 Score =  297 bits (762), Expect = 2e-98
 Identities = 121/378 (32%), Positives = 177/378 (46%), Gaps = 32/378 (8%)

Query: 2   DYSATTPIDPRVVDKMIPYLRVQFGNPASRNHIYGWTAEKAIEKARNEVAQLIYADPR-E 60
           D +ATT     V+D +  Y     GN     H  G  A +A E+AR +VA+ I A    E
Sbjct: 4   DSAATTQKPQAVLDALQEYYTDYNGNVHRGVHHLGKEATQAYEEAREKVAEFINAPSDEE 63

Query: 61  IIWTSGATESNNLAIKGAAHFYKKRGKHIITVQTEHKSVLDSIRKLEND-GFRVTYLKPK 119
           II+TSG TE+ NL         K  G  I+  + EH + L   ++L    G  V  +   
Sbjct: 64  IIFTSGTTEAINLVAISLGRSLKP-GDEILVTEMEHHANLVPWQELAKRTGATVRVIPVD 122

Query: 120 NNGLITLSQLEEKICSDTILVSVMLVNNEIGVIQPILEIGELCRSKGVIFHCDAAQAVGK 179
            NGL+ L  LE+ +   T LV++  V+N  G + P+ EIG+L    G +   DAAQAVG 
Sbjct: 123 PNGLLDLDALEKLLTPRTKLVAITHVSNVTGTVNPVEEIGKLAHEYGALVVVDAAQAVGH 182

Query: 180 IDINIQNYKVDLMSFSAHKIYGPKGIGALYIRRNPRTRIEAQIHGGG-------HEYG-- 230
             I++Q   VD ++FS HK+YGP GIG LY RR+   ++   + GGG        E    
Sbjct: 183 RPIDVQALGVDFLAFSGHKLYGPTGIGVLYGRRDLLEKLPPLLGGGGMIDTVSLQETTFA 242

Query: 231 -----LRSGTLATHQIVGMGEAFR-LARIEMKEELKRILFLQKRLYNGLMESIEEIYING 284
                  +GT     I+G+G A   LA I ++   K    L   LY  L  +I  I + G
Sbjct: 243 DAPSKFEAGTPNIAGIIGLGAALDYLAEIGLEAIEKHERELAAYLYERL-LAIPGIRLYG 301

Query: 285 DMNNRIPHNLNISFNFIEGESL--LMAIKDIAVSSGSACTSASLEPSYVLRALGCNDELA 342
               R P  ++ +F  +    +  L+  + IAV SG  C     +P  V   L       
Sbjct: 302 PPAERRPSIISFNFPGVHPHDVATLLDERGIAVRSGHHCA----QPLMVRLGL------- 350

Query: 343 HSSIRFTIGRFTTEKEID 360
             ++R ++  + TE+E+D
Sbjct: 351 GGTLRASLYFYNTEEEVD 368


>gnl|CDD|179511 PRK02948, PRK02948, cysteine desulfurase; Provisional.
          Length = 381

 Score =  275 bits (705), Expect = 9e-90
 Identities = 136/368 (36%), Positives = 193/368 (52%), Gaps = 5/368 (1%)

Query: 1   MDYSATTPIDPRVVDKMIPYLRVQFGNPASRNHIYGWTAEKAIEKARNEVAQLIYADPRE 60
           +DY+ATTP+    +          FGN  S  H  G TA   ++  R   A++I  + + 
Sbjct: 4   LDYAATTPMSKEALQTYQKAASQYFGN-ESSLHDIGGTASSLLQVCRKTFAEMIGGEEQG 62

Query: 61  IIWTSGATESNNLAIKGAAHFYKKRGKHIITVQTEHKSVLDSIRKLENDGFRVTYLKPKN 120
           I +TSG TESN LAI+   +   +  KHIIT   EH S+    + LE+ G+ VT +    
Sbjct: 63  IYFTSGGTESNYLAIQSLLNALPQNKKHIITTPMEHASIHSYFQSLESQGYTVTEIPVDK 122

Query: 121 NGLITLSQLEEKICSDTILVSVMLVNNEIGVIQPILEIGELCRSKGVIFHCDAAQAVGKI 180
           +GLI L  LE  I  DT+L S+   N+EIG IQPI EIG L +   V+FH D  Q  GK+
Sbjct: 123 SGLIRLVDLERAITPDTVLASIQHANSEIGTIQPIAEIGALLKKYNVLFHSDCVQTFGKL 182

Query: 181 DINIQNYKVDLMSFSAHKIYGPKGIGALYIRRNPRTRIEAQIHGGGHEYGLRSGTLATHQ 240
            I++    +D +S SAHKIYGPKG+GA+YI  NP+ R +    G  HE G R GT+    
Sbjct: 183 PIDVFEMGIDSLSVSAHKIYGPKGVGAVYI--NPQVRWKPVFPGTTHEKGFRPGTVNVPG 240

Query: 241 IVGMGEAFRLARIEMKEELKRILFLQKRLYNGLMESIEEIYINGDMNNRIPHNLNISFNF 300
           I     A       M+EE  R   L+      +      I + G   + +PH + ++   
Sbjct: 241 IAAFLTAAENILKNMQEESLRFKELRSYFLEQIQTLPLPIEVEGHSTSCLPHIIGVTIKG 300

Query: 301 IEGESLLMAI--KDIAVSSGSACTSASLEPSYVLRALGCNDELAHSSIRFTIGRFTTEKE 358
           IEG+  ++    + IA+S+GSAC     EPS  + A+G   E A   +RF+ G+ TT+ +
Sbjct: 301 IEGQYTMLECNRRGIAISTGSACQVGKQEPSKTMLAIGKTYEEAKQFVRFSFGQQTTKDQ 360

Query: 359 IDFTIKLL 366
           ID TI  L
Sbjct: 361 IDTTIHAL 368


>gnl|CDD|223594 COG0520, csdA, Selenocysteine lyase/Cysteine desulfurase
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 405

 Score =  197 bits (504), Expect = 2e-59
 Identities = 113/392 (28%), Positives = 174/392 (44%), Gaps = 42/392 (10%)

Query: 1   MDYSATTPIDPRVVDKMIPYLRVQFGNPASRNHIYGWTAEKAIEKARNEVAQLIYADPR- 59
           +D +AT+     V+D +  Y R    N     H     A    E AR  VA+ + AD   
Sbjct: 26  LDNAATSQKPQAVLDAVAEYYRRYNANVHRGAHTLAEEATDLYEAAREAVARFLNADSSD 85

Query: 60  EIIWTSGATESNNLAIKGAAHFYKKRGKHIITVQTEHKSVLDSIRKLEND-GFRVTYLKP 118
           EI++T G TE+ NL  +G      K G  I+    EH S +   ++L    G +V  +  
Sbjct: 86  EIVFTRGTTEALNLVARGLGR-SLKPGDEIVVSDLEHHSNIVPWQELAKRTGAKVRVIPL 144

Query: 119 KNNGLITLSQLEEKICSDTILVSVMLVNNEIGVIQPILEIGELCRSKGVIFHCDAAQAVG 178
            ++GL+ L  LE+ I   T LV++  V+N  G + P+ EI EL    G +   DAAQA G
Sbjct: 145 DDDGLLDLDALEKLITPKTKLVALSHVSNVTGTVNPVKEIAELAHEHGALVLVDAAQAAG 204

Query: 179 KIDINIQNYKVDLMSFSAHK-IYGPKGIGALYIRRNPRTRIEAQIHGGGH---------- 227
            + I++Q    D ++FS HK + GP GIG LY+R+     +E  + GGG           
Sbjct: 205 HLPIDVQELGCDFLAFSGHKWLLGPTGIGVLYVRKELLEELEPFLGGGGMIEYVSRDEGV 264

Query: 228 -----EYGLRSGTLATHQIVGMGEAFR-LARIEMKEELKRILFLQKRLYNGLMESIEEIY 281
                     +GT      +G+  A   L  I M+        L + L  GL   +  + 
Sbjct: 265 TLAELPLRFEAGTPNIAGAIGLAAALDYLLEIGMEAIEAHERELTEYLLEGL-SELPGVE 323

Query: 282 ING-DMNNRIPHNLNISFNFIEGES-----LLMAIKDIAVSSGSACTSASLEPSYVLRAL 335
           I G    +R      +SFN ++G        L+  K IAV +G  C         + R L
Sbjct: 324 IYGPPDADRGG---IVSFN-VKGIHPHDVATLLDEKGIAVRAGHHCAQP------LHRLL 373

Query: 336 GCNDELAHSSIRFTIGRFTTEKEIDFTIKLLK 367
           G       ++IR ++  + TE+++D  ++ LK
Sbjct: 374 G-----VDATIRASLHLYNTEEDVDRLLEALK 400


>gnl|CDD|99746 cd06453, SufS_like, Cysteine desulfurase (SufS)-like. This family
           belongs to the pyridoxal phosphate (PLP)-dependent
           aspartate aminotransferase superfamily (fold I). The
           major groups in this CD correspond to cysteine
           desulfurase (SufS) and selenocysteine lyase. SufS
           catalyzes the removal of elemental sulfur and selenium
           atoms from L-cysteine, L-cystine, L-selenocysteine, and
           L-selenocystine to produce L-alanine; and selenocysteine
           lyase catalyzes the decomposition of L-selenocysteine.
          Length = 373

 Score =  188 bits (479), Expect = 5e-56
 Identities = 112/387 (28%), Positives = 171/387 (44%), Gaps = 39/387 (10%)

Query: 2   DYSATTPIDPRVVDKMIPYLRVQFGNPASRNHIYGWTAEKAIEKARNEVAQLIYA-DPRE 60
           D +AT+     V+D ++ Y R    N     H     A  A E AR +VA+ I A  P E
Sbjct: 4   DNAATSQKPQPVIDAIVDYYRHYNANVHRGVHELSARATDAYEAAREKVARFINAPSPDE 63

Query: 61  IIWTSGATESNNLAIKGAAHFYKKRGKHIITVQTEHKSVLDSIRKLEND-GFRVTYLKPK 119
           II+T   TE+ NL   G      K G  I+T   EH S +   ++L    G ++  +   
Sbjct: 64  IIFTRNTTEAINLVAYGLG-RANKPGDEIVTSVMEHHSNIVPWQQLAERTGAKLKVVPVD 122

Query: 120 NNGLITLSQLEEKICSDTILVSVMLVNNEIGVIQPILEIGELCRSKGVIFHCDAAQAVGK 179
           ++G + L  LE+ +   T LV+V  V+N +G I P+ EIGE+    GV    D AQ+ G 
Sbjct: 123 DDGQLDLEALEKLLTERTKLVAVTHVSNVLGTINPVKEIGEIAHEAGVPVLVDGAQSAGH 182

Query: 180 IDINIQNYKVDLMSFSAHKIYGPKGIGALYIRRNPRTRIEAQIHGGGH--------EYGL 231
           + +++Q+   D ++FS HK+ GP GIG LY +      +     GGG         E   
Sbjct: 183 MPVDVQDLGCDFLAFSGHKMLGPTGIGVLYGKEELLEEMP-PYGGGGEMIEEVSFEETTY 241

Query: 232 R-------SGTLATHQIVGMGEAFR-LARIEMKEELKRILFLQKRLYNGLMESIEEIYIN 283
                   +GT      +G+G A   L +I M+        L       L   I  + + 
Sbjct: 242 ADLPHKFEAGTPNIAGAIGLGAAIDYLEKIGMEAIAAHEHELTAYALERL-SEIPGVRVY 300

Query: 284 GDMNNRIPHNLNISFNF--IEGESLLMAI--KDIAVSSGSACTSASLEPSYVLRALGCND 339
           GD  +R      +SFN   I    +   +    IAV +G  C     +P  ++R LG   
Sbjct: 301 GDAEDRAG---VVSFNLEGIHPHDVATILDQYGIAVRAGHHCA----QP--LMRRLG--- 348

Query: 340 ELAHSSIRFTIGRFTTEKEIDFTIKLL 366
                ++R + G + TE+EID  ++ L
Sbjct: 349 --VPGTVRASFGLYNTEEEIDALVEAL 373


>gnl|CDD|131032 TIGR01977, am_tr_V_EF2568, cysteine desulfurase family protein.
           This model describes a subfamily of probable pyridoxal
           phosphate-dependent enzymes in the aminotransferase
           class V family. Related families contain members active
           as cysteine desulfurases, selenocysteine lyases, or
           both. The members of this family form a distinct clade
           and all are shorter at the N-terminus. The function of
           this subfamily is unknown [Unknown function, Enzymes of
           unknown specificity].
          Length = 376

 Score =  171 bits (435), Expect = 1e-49
 Identities = 120/404 (29%), Positives = 186/404 (46%), Gaps = 71/404 (17%)

Query: 2   DYSATT-PIDPRVVDKMIPYLRVQFGNPASRNHIYGWTAEKAIEKARNEVAQLIYA-DPR 59
           D +ATT P    V + M  + +   G+P    +     A + +E+ R  +A+L  A    
Sbjct: 4   DNAATTYPKPDEVYEAMADFYKNYGGSPGRGRYRLALRASREVEETRQLLAKLFNAPSSA 63

Query: 60  EIIWTSGATESNNLAIKGAAHFYKKRGKHIITVQTEHKSVLDSIRKLEND-GFRVTYLKP 118
            +++T+ AT + N+A+KG      K G H+IT   EH SV   +  L+   G  +T +K 
Sbjct: 64  HVVFTNNATTALNIALKGLL----KEGDHVITTPMEHNSVARPLECLKEQIGVEITIVKC 119

Query: 119 KNNGLITLSQLEEKICSDTILVSVMLVNNEIGVIQPILEIGELCRSKGVIFHCDAAQAVG 178
            N GLI+  +++  I ++T L+ V   +N  G I PI EIGEL +  G+ F  DAAQ  G
Sbjct: 120 DNEGLISPERIKRAIKTNTKLIVVSHASNVTGTILPIEEIGELAQENGIFFILDAAQTAG 179

Query: 179 KIDINIQNYKVDLMSFSAHK-IYGPKGIGALYIRRNPRTRIEAQIHGGGHEYG------- 230
            I I++    +D+++F+ HK + GP+G G LYIR     +++    GG   +        
Sbjct: 180 VIPIDMTELAIDMLAFTGHKGLLGPQGTGGLYIREG--IKLKPLKSGGTGSHSALIDQPS 237

Query: 231 -----LRSGTLATHQIVGMGEA------FRLARIEMKEELKRILFLQKRLYNGLMESIEE 279
                  SGTL T  I G+           +A I  KE +     L ++L NGL E I +
Sbjct: 238 ELPDRFESGTLNTPGIAGLNAGIKFIEKIGIANIAKKECM-----LTEKLLNGLRE-INK 291

Query: 280 IYINGDMN--NRIPHNLNISFNFIEGES------LLMAIKDIAVSSGSACTSASLEPSYV 331
           + I G  +  NR+     +SF  +EG        +L    DIA  +G  C          
Sbjct: 292 VKIYGPADPANRVG---VVSFT-VEGIDSEEVADILDEKFDIATRTGLHCAP-------- 339

Query: 332 LRALGCNDELAHSS--------IRFTIGRFTTEKEIDFTIKLLK 367
                    LAH +        IR ++G F TE+EI+  ++ L 
Sbjct: 340 ---------LAHKTIGTFATGTIRLSLGYFNTEEEIEKLLEALS 374


>gnl|CDD|131034 TIGR01979, sufS, cysteine desulfurases, SufSfamily.  This model
           represents a subfamily of NifS-related cysteine
           desulfurases involved in FeS cluster formation needed
           for nitrogen fixation among other vital functions. Many
           cysteine desulfurases are also active as selenocysteine
           lyase and/or cysteine sulfinate desulfinase. This
           subfamily is associated with the six-gene SUF system
           described in E. coli and Erwinia as an FeS cluster
           formation system during oxidative stress. The active
           site Cys is this subfamily resembles GHHC with one or
           both His conserved [Biosynthesis of cofactors,
           prosthetic groups, and carriers, Other].
          Length = 403

 Score =  151 bits (383), Expect = 9e-42
 Identities = 110/393 (27%), Positives = 173/393 (44%), Gaps = 38/393 (9%)

Query: 4   SATTPIDPR-VVDKMIPYLRVQFGNPASRNHIYGWTAEKAIEKARNEVAQLIYA-DPREI 61
           SA T   P+ V+D +  Y R    N     H     A +A E  R +VA+ I A    EI
Sbjct: 24  SAATSQKPQQVIDAVAEYYRNSNANVHRGIHTLSVRATEAYEAVREKVAKFINAASDEEI 83

Query: 62  IWTSGATESNNLAIKGAAHFYKKRGKHIITVQTEHKSVLDSIRKL-ENDGFRVTYLKPKN 120
           ++T G TES NL          K G  I+  + EH + +   + L E  G  + ++   +
Sbjct: 84  VFTRGTTESINLVAYSWGDSNLKAGDEIVISEMEHHANIVPWQLLAERTGATLKFIPLDD 143

Query: 121 NGLITLSQLEEKICSDTILVSVMLVNNEIGVIQPILEIGELCRSKGVIFHCDAAQAVGKI 180
           +G + L  LE+ +   T LV++  V+N +G + P+ EI +L    G     D AQAV  +
Sbjct: 144 DGTLDLDDLEKLLTEKTKLVAITHVSNVLGTVNPVEEIAKLAHQVGAKVLVDGAQAVPHM 203

Query: 181 DINIQNYKVDLMSFSAHKIYGPKGIGALYIRRNPRTRIEAQIHGGG------HE------ 228
            +++Q    D   FS HK+YGP GIG LY +     ++   + GG        E      
Sbjct: 204 PVDVQALDCDFYVFSGHKMYGPTGIGVLYGKEELLEQMPPFLGGGEMIAEVSFEETTYNE 263

Query: 229 --YGLRSGTLATHQIVGMGEAFR-LARIEMKEELKRILFLQKRLYNGLMESIEEIYINGD 285
             +   +GT     ++G+G A   L  I ++        L       L E I  + I G 
Sbjct: 264 APHKFEAGTPNIAGVIGLGAAIDYLEAIGLENIEAHEHELTAYALERLGE-IPGLRIYGP 322

Query: 286 MNNRIPHNLNISFNFIEGE-----SLLMAIKDIAVSSGSACTSASLEPSYVLRALGCNDE 340
            +      + ISFN +EG        ++  + IAV SG  C     +P  ++R  G    
Sbjct: 323 RDAEDRGGI-ISFN-VEGVHPHDVGTILDEEGIAVRSGHHCA----QP--LMRRFG---- 370

Query: 341 LAHSSIRFTIGRFTTEKEIDFTIKLLKMKVKKL 373
              ++ R +   + TE++ID  ++ LK KV+K 
Sbjct: 371 -VPATCRASFYIYNTEEDIDALVEALK-KVRKF 401


>gnl|CDD|132433 TIGR03392, FeS_syn_CsdA, cysteine desulfurase, catalytic subunit
           CsdA.  Members of this protein family are CsdS. This
           protein, found Escherichia coli, Yersinia pestis,
           Photorhabdus luminescens, and related species, and
           related to SufS, works together with and physically
           interacts with CsdE (a paralog of SufE). CsdA has
           cysteine desulfurase activity that is enhanced by CsdE,
           a sulfur acceptor protein. This gene pair, although
           involved in FeS cluster biosynthesis, is not found next
           to other such genes as are its paralogs from the Suf or
           Isc systems [Biosynthesis of cofactors, prosthetic
           groups, and carriers, Other].
          Length = 398

 Score = 95.3 bits (237), Expect = 1e-21
 Identities = 64/211 (30%), Positives = 96/211 (45%), Gaps = 7/211 (3%)

Query: 4   SATTPIDPR-VVDKMIPYLRVQFGNPASRNHIYGWTAEKAIEKARNEVAQLIYA-DPREI 61
           SA T + P+ V+D    + R+  G      H    +     E AR +VA+ + A D   I
Sbjct: 22  SAATALKPQAVIDATQQFYRLSSGTVHRSQHQQAQSLTARYELARQQVARFLNAPDAENI 81

Query: 62  IWTSGATESNNLAIKGAAHFYKKRGKHIITVQTEHKSVLDSIRKL---ENDGFRVTYLKP 118
           +WT G TES NL  +  A    + G  II  + EH + L  I  L   +  G +V  L  
Sbjct: 82  VWTRGTTESINLVAQSYARPRLQPGDEIIVSEAEHHANL--IPWLMVAQQTGAKVVKLPI 139

Query: 119 KNNGLITLSQLEEKICSDTILVSVMLVNNEIGVIQPILEIGELCRSKGVIFHCDAAQAVG 178
             + L  + QL E +   T ++++  ++N  G    +     L    G +   D AQ V 
Sbjct: 140 GADLLPDIRQLPELLTPRTRILALGQMSNVTGGCPDLARAITLAHQYGAVVVVDGAQGVV 199

Query: 179 KIDINIQNYKVDLMSFSAHKIYGPKGIGALY 209
               ++Q   +D  +FS HK+YGP GIG LY
Sbjct: 200 HGPPDVQALDIDFYAFSGHKLYGPTGIGVLY 230


>gnl|CDD|182799 PRK10874, PRK10874, cysteine sulfinate desulfinase; Provisional.
          Length = 401

 Score = 94.3 bits (235), Expect = 3e-21
 Identities = 62/211 (29%), Positives = 95/211 (45%), Gaps = 7/211 (3%)

Query: 4   SATTPIDPR-VVDKMIPYLRVQFGNPASRNHIYGWTAEKAIEKARNEVAQLIYA-DPREI 61
           SA T + P+ V++    +  +  GN                E AR +VAQL+ A D + I
Sbjct: 25  SAATALKPQAVIEATQQFYSLSAGNVHRSQFAAAQRLTARYEAAREQVAQLLNAPDAKNI 84

Query: 62  IWTSGATESNNLAIKGAAHFYKKRGKHIITVQTEHKSVLDSIRKL---ENDGFRVTYLKP 118
           +WT G TES NL  +  A    + G  II  + EH + L  +  L   +  G +V  L  
Sbjct: 85  VWTRGTTESINLVAQSYARPRLQPGDEIIVSEAEHHANL--VPWLMVAQQTGAKVVKLPL 142

Query: 119 KNNGLITLSQLEEKICSDTILVSVMLVNNEIGVIQPILEIGELCRSKGVIFHCDAAQAVG 178
             + L  +  L E I   T ++++  ++N  G    +     L    G++   D AQ   
Sbjct: 143 GADRLPDVDLLPELITPRTRILALGQMSNVTGGCPDLARAITLAHQAGMVVMVDGAQGAV 202

Query: 179 KIDINIQNYKVDLMSFSAHKIYGPKGIGALY 209
               ++Q   +D  +FS HK+YGP GIG LY
Sbjct: 203 HFPADVQALDIDFYAFSGHKLYGPTGIGVLY 233


>gnl|CDD|215460 PLN02855, PLN02855, Bifunctional selenocysteine lyase/cysteine
           desulfurase.
          Length = 424

 Score = 93.3 bits (232), Expect = 7e-21
 Identities = 105/402 (26%), Positives = 173/402 (43%), Gaps = 41/402 (10%)

Query: 1   MDYSATTPIDPRVVDKMIPYLRVQFGNPASRNHIYGWTAEKAIEKARNEVAQLIYA-DPR 59
           +D +AT+     V+D +  Y      N     H     A  A E AR +VA  I A   R
Sbjct: 36  LDNAATSQKPAAVLDALQDYYEEYNSNVHRGIHALSAKATDAYELARKKVAAFINASTSR 95

Query: 60  EIIWTSGATESNNLAIK--GAAHFYKKRGKHIITVQTEHKSVLDSIRKL-ENDGFRVTYL 116
           EI++T  ATE+ NL     G A+   K G  +I    EH S +   + + +  G  + ++
Sbjct: 96  EIVFTRNATEAINLVAYTWGLANL--KPGDEVILSVAEHHSNIVPWQLVAQKTGAVLKFV 153

Query: 117 KPKNNGLITLSQLEEKICSDTILVSVMLVNNEIGVIQPILEIGELCRSKGVIFHCDAAQA 176
               + ++ + QL+E +   T LV+   V+N +G I P+ +I     + G     DA Q+
Sbjct: 154 GLTPDEVLDVEQLKELLSEKTKLVATHHVSNVLGSILPVEDIVHWAHAVGAKVLVDACQS 213

Query: 177 VGKIDINIQNYKVDLMSFSAHKIYGPKGIGALYIRRNPRTRIEAQIHGG---------GH 227
           V  + +++Q    D +  S+HK+ GP GIG L+ + +    +   + GG           
Sbjct: 214 VPHMPVDVQTLGADFLVASSHKMCGPTGIGFLWGKSDLLESMPPFLGGGEMISDVFLDHS 273

Query: 228 EYG-----LRSGTLATHQIVGMGEAFR-LARIEMKEELKRILFLQKRLYNGLMESIEEIY 281
            Y        +GT A  + +G+G A   L+ I M    +  + L   LY  L  S+  + 
Sbjct: 274 TYAPPPSRFEAGTPAIGEAIGLGAAIDYLSEIGMDRIHEYEVELGTYLYEKL-SSVPGVR 332

Query: 282 INGDMNNRIPHNLNI-SFNFIEG------ESLLMAIKDIAVSSGSACTSASLEPSYVLRA 334
           I G   +       + +FN +EG       + L     +A+ SG  C     +P +  R 
Sbjct: 333 IYGPKPSEGVGRAALCAFN-VEGIHPTDLSTFLDQQHGVAIRSGHHCA----QPLH--RY 385

Query: 335 LGCNDELAHSSIRFTIGRFTTEKEIDFTIKLLKMKVKKLREL 376
           LG N   A +S+ F    + T++E+D  I  LK  +      
Sbjct: 386 LGVNAS-ARASLYF----YNTKEEVDAFIHALKDTIAFFSSF 422


>gnl|CDD|181766 PRK09295, PRK09295, bifunctional cysteine
           desulfurase/selenocysteine lyase; Validated.
          Length = 406

 Score = 90.6 bits (225), Expect = 5e-20
 Identities = 50/176 (28%), Positives = 80/176 (45%), Gaps = 2/176 (1%)

Query: 39  AEKAIEKARNEVAQLIYA-DPREIIWTSGATESNNLAIKGAAHFYKKRGKHIITVQTEHK 97
           A + +E  R + A  I A    E+++  G TE  NL      +   + G +II  + EH 
Sbjct: 65  ATEKMENVRKQAALFINARSAEELVFVRGTTEGINLVANSWGNSNVRAGDNIIISEMEHH 124

Query: 98  SVLDSIRKL-ENDGFRVTYLKPKNNGLITLSQLEEKICSDTILVSVMLVNNEIGVIQPIL 156
           + +   + L    G  +  +    +G + L  L       T L+++  V+N +G   P+ 
Sbjct: 125 ANIVPWQMLCARVGAELRVIPLNPDGTLQLETLPALFDERTRLLAITHVSNVLGTENPLA 184

Query: 157 EIGELCRSKGVIFHCDAAQAVGKIDINIQNYKVDLMSFSAHKIYGPKGIGALYIRR 212
           E+  L    G     D AQAV    +++Q    D   FS HK+YGP GIG LY++ 
Sbjct: 185 EMIALAHQHGAKVLVDGAQAVMHHPVDVQALDCDFYVFSGHKLYGPTGIGILYVKE 240


>gnl|CDD|233664 TIGR01976, am_tr_V_VC1184, cysteine desulfurase family protein,
           VC1184 subfamily.  This model describes a subfamily of
           probable pyridoxal phosphate-dependent enzymes in the
           aminotransferase class V family (pfam00266). The most
           closely related characterized proteins are active as
           cysteine desulfurases, selenocysteine lyases, or both;
           some are involved in FeS cofactor biosynthesis and are
           designated NifS. An active site Cys residue present in
           those sequences, in motifs resembling GHHC or GSAC, is
           not found in this family. The function of members of
           this family is unknown, but seems unlike to be as an
           aminotransferase [Unknown function, Enzymes of unknown
           specificity].
          Length = 397

 Score = 82.5 bits (204), Expect = 2e-17
 Identities = 88/399 (22%), Positives = 154/399 (38%), Gaps = 55/399 (13%)

Query: 1   MDYSATTPIDPRVVDKMIPYLRVQFGNPASRNHIYGWTAEKAIEKARNEVAQLIYADPRE 60
            D  A T I   V D +   L  +        +     A++ ++ AR  VA L+ ADP E
Sbjct: 21  FDNPAGTQIPQSVADAVSAALT-RSNANRGGAYESSRRADQVVDDAREAVADLLNADPPE 79

Query: 61  IIWTSGATESNNLAIKGAAHFYK--KRGKHIITVQTEHKSVLDS-IRKLENDG--FRVTY 115
           +++ + AT    L    +    +    G  +I  + +H++ +   ++  E  G   +   
Sbjct: 80  VVFGANATS---LTFLLSRAISRRWGPGDEVIVTRLDHEANISPWLQAAERAGAKVKWAR 136

Query: 116 LKPKNNGLITLSQLEEKICSDTILVSVMLVNNEIGVIQPILEIGELCRSKGVIFHCDAAQ 175
           +      L     L   +   T LV+V   +N +G I  +  I EL  + G +   DA  
Sbjct: 137 VDEATGELHP-DDLASLLSPRTRLVAVTAASNTLGSIVDLAAITELVHAAGALVVVDAVH 195

Query: 176 AVGKIDINIQNYKVDLMSFSAHKIYGPKGIGALYIRRN----------------PRTRIE 219
                 I++Q    D ++ SA+K +GP  +G L+ R                     R E
Sbjct: 196 YAPHGLIDVQATGADFLTCSAYKFFGPH-MGILWGRPELLMNLPPYKLTFSYDTGPERFE 254

Query: 220 AQIHGGGHEYGLRSGTLATHQ-IVGMGE-AFRLARIEMKEELKRILFLQKRLYNGLMESI 277
                G  +Y L +G +A    + G+GE A    R  +    + I   + RL   L+  +
Sbjct: 255 L----GTPQYELLAGVVAAVDYLAGLGESANGSRRERLVASFQAIDAYENRLAEYLLVGL 310

Query: 278 EE-----IYINGDMNNRIPHNLNISFNFIEGES-----LLMAIKDIAVSSGSACTSASLE 327
            +     +Y    +  R+P    +SF  + G         +A + I   +G         
Sbjct: 311 SDLPGVTLYGVARLAARVP---TVSFT-VHGLPPQRVVRRLADQGIDAWAGHFYAVR--- 363

Query: 328 PSYVLRALGCNDELAHSSIRFTIGRFTTEKEIDFTIKLL 366
              +LR LG NDE     +R  +  + T +E+D  ++ L
Sbjct: 364 ---LLRRLGLNDE--GGVVRVGLAHYNTAEEVDRLLEAL 397


>gnl|CDD|99742 cd01494, AAT_I, Aspartate aminotransferase (AAT) superfamily (fold
           type I) of pyridoxal phosphate (PLP)-dependent enzymes.
           PLP combines with an alpha-amino acid to form a compound
           called a Schiff base or aldimine intermediate, which
           depending on the reaction, is the substrate in four
           kinds of reactions (1) transamination (movement of amino
           groups), (2) racemization (redistribution of
           enantiomers), (3) decarboxylation (removing COOH
           groups), and (4) various side-chain reactions depending
           on the enzyme involved. Pyridoxal phosphate (PLP)
           dependent enzymes were previously classified into alpha,
           beta and gamma classes, based on the chemical
           characteristics (carbon atom involved) of the reaction
           they catalyzed. The availability of several structures
           allowed a comprehensive analysis of  the evolutionary
           classification of PLP dependent enzymes, and it was
           found that the functional classification did not always
           agree with the evolutionary history of these enzymes.
           Structure and sequence analysis has revealed that the
           PLP dependent enzymes can be classified into four major
           groups of different evolutionary origin: aspartate
           aminotransferase superfamily (fold type I), tryptophan
           synthase beta superfamily (fold type II), alanine
           racemase superfamily (fold type III), and D-amino acid
           superfamily (fold type IV) and Glycogen phophorylase
           family (fold type V).
          Length = 170

 Score = 67.4 bits (165), Expect = 2e-13
 Identities = 34/157 (21%), Positives = 62/157 (39%), Gaps = 12/157 (7%)

Query: 61  IIWTSGATESNNLAIKGAAHFYKKRGKHIITVQTEHKSVLDSIRKLENDGFRVTYLKPKN 120
            ++    T +N  A+          G  +I     H S        E  G +   +   +
Sbjct: 20  AVFVPSGTGANEAAL----LALLGPGDEVIVDANGHGSRYWVAA--ELAGAKPVPVPVDD 73

Query: 121 N--GLITLSQLEEKICSDTI-LVSVMLVNNEIGVIQPILEIGELCRSKGVIFHCDAAQAV 177
              G + ++ LEE      + L+ +       GV+ P+ EI ++ +  G++   DAA A 
Sbjct: 74  AGYGGLDVAILEELKAKPNVALIVITPNTTSGGVLVPLKEIRKIAKEYGILLLVDAASAG 133

Query: 178 GKIDIN---IQNYKVDLMSFSAHKIYGPKGIGALYIR 211
           G        I     D+++FS HK  G +G G + ++
Sbjct: 134 GASPAPGVLIPEGGADVVTFSLHKNLGGEGGGVVIVK 170


>gnl|CDD|223154 COG0076, GadB, Glutamate decarboxylase and related PLP-dependent
           proteins [Amino acid transport and metabolism].
          Length = 460

 Score = 65.1 bits (159), Expect = 2e-11
 Identities = 50/211 (23%), Positives = 80/211 (37%), Gaps = 32/211 (15%)

Query: 64  TSGATESNNLAIKGA-AHFYKKRGKH---------IITVQTEHKSVLDSIRKLENDGFRV 113
           TSG TE+N LA+  A   + K+             I+  +T H S   + R L   G  +
Sbjct: 125 TSGGTEANLLALLAARERWRKRALAESGKPGGKPNIVCSETAHFSFEKAARYL---GLGL 181

Query: 114 TYLK-PKNNGLITLSQLEEKICSDTILVSVMLV--NNEIGVIQPILEIGELCRSKGVIFH 170
             +     +  I +  LEE I  +TI   V+      + G I  I E+ ++    G+  H
Sbjct: 182 RRVPTVPTDYRIDVDALEEAIDENTIGGVVVGTAGTTDTGSIDDIEELADIAEEYGIWLH 241

Query: 171 CDAA---QAV------GKIDINIQNYKVDLMSFSAHKI-YGPKGIGALYIRRNPRTRIEA 220
            DAA     +      G+ D  ++   VD ++   HK    P G G +   R+       
Sbjct: 242 VDAAFGGFLLPFLEPDGRWDFGLEG--VDSITVDGHKYGLAPIGCGVVLF-RDEEALRRI 298

Query: 221 QIHGGGHEYGLRSGTLATHQIVGMGEAFRLA 251
            I     +Y L  G +    I+G     +  
Sbjct: 299 LIF---ADYYLPGGGIPNFTILGSRPGRQAL 326


>gnl|CDD|234554 TIGR04343, egtE_PLP_lyase, ergothioneine biosynthesis PLP-dependent
           enzyme EgtE.  Members of this protein family are the
           pyridoxal phosphate-dependent enzyme EgtE, which
           catalyzes the final step in the biosynthesis of
           ergothioneine [Biosynthesis of cofactors, prosthetic
           groups, and carriers, Glutathione and analogs].
          Length = 370

 Score = 59.9 bits (145), Expect = 6e-10
 Identities = 43/146 (29%), Positives = 68/146 (46%), Gaps = 8/146 (5%)

Query: 110 GFRVTYLKPKNNGLITLSQLEEKICSDTI-LVSVMLVNNEIGVIQPILEIGELCRSKGVI 168
           GF V  L   ++G + +      + +D   LV + +V +  G +QP   + ELCR+ GV 
Sbjct: 116 GFEVRELPVDDDGRLDVDGAASALRADPPALVHLTVVGSHRGTVQPAAAVAELCRAAGVP 175

Query: 169 FHCDAAQAVGKIDINIQNYKVDLMSFSAHK-IYGPKGIGALYIRRNPRTRIEAQIHGGGH 227
              DAAQA+G +D        D    S+ K + GP+G+G L +R     R+  ++     
Sbjct: 176 LVVDAAQALGHLDC---AVGADAYYSSSRKWLAGPRGVGVLAVRPELLERLAPRLPPPDW 232

Query: 228 EYGLRSGTLATHQIVGMGEAFRLARI 253
                +  LA  Q +  GEA   AR+
Sbjct: 233 P---EASPLAVAQRLEHGEANIAARV 255


>gnl|CDD|213793 TIGR03301, PhnW-AepZ, 2-aminoethylphosphonate aminotransferase.
           This family includes a number of 2-aminoethylphosphonate
           aminotransferases, some of which are indicated to
           operate in the catabolism of 2-aminoethylphosphonate
           (AEP) and others which are involved in the biosynthesis
           of the same compound. The catabolic enzyme (PhnW, ) is
           known to use pyruvate:alanine as the transfer partner
           and is modeled by the equivalog-level TIGR02326. The
           PhnW family is apparently a branch of a larger tree
           including genes (AepZ) adjacent to others responsible
           for the biosynthesis of phosphonoacetaldehyde. The
           identity of the transfer partner is unknown for these
           enzymes and considering the reversed flux compared to
           PhnW, it may very well be different.
          Length = 355

 Score = 55.5 bits (134), Expect = 2e-08
 Identities = 31/102 (30%), Positives = 52/102 (50%), Gaps = 3/102 (2%)

Query: 114 TYLKPKNNGLITLSQLEEKICSDTILVSVMLVNNEI--GVIQPILEIGELCRSKGVIFHC 171
           T L         L+++EE + +D  +  V  V++E   G++ P+  I ++ RS G +   
Sbjct: 101 TDLNFSEYEPPDLNRIEEALAADPDITHVATVHHETTTGILNPLEAIAKVARSHGAVLIV 160

Query: 172 DAAQAVGKIDINIQNYKVDLMSFSAHK-IYGPKGIGALYIRR 212
           DA  + G I I+I+   VD +  SA+K + G  G G +  RR
Sbjct: 161 DAMSSFGAIPIDIEELDVDALIASANKCLEGVPGFGFVIARR 202


>gnl|CDD|99743 cd06450, DOPA_deC_like, DOPA decarboxylase family. This family
           belongs to pyridoxal phosphate (PLP)-dependent aspartate
           aminotransferase superfamily (fold I). The major groups
           in this CD correspond to DOPA/tyrosine decarboxylase
           (DDC), histidine decarboxylase (HDC), and glutamate
           decarboxylase (GDC). DDC is active as a dimer and
           catalyzes the decarboxylation of tyrosine. GDC catalyzes
           the decarboxylation of glutamate and HDC catalyzes the
           decarboxylation of histidine.
          Length = 345

 Score = 54.9 bits (133), Expect = 2e-08
 Identities = 48/183 (26%), Positives = 68/183 (37%), Gaps = 46/183 (25%)

Query: 62  IWTSGATESNNLAIKGAAHFYKKRGKH----------IITVQTEHKSV-------LDSIR 104
           ++TSG +ESN LA+  A    +KR K           I+     H SV          +R
Sbjct: 61  VFTSGGSESNLLALLAARDRARKRLKAGGGRGIDKLVIVCSDQAHVSVEKAAAYLDVKVR 120

Query: 105 KLENDGFRVTYLKPKNNGLITLSQLEEKICSD------TILVSVMLVNNEIGVIQPILEI 158
            +  D           +G +    LE  I  D       I+V       + G I P+ EI
Sbjct: 121 LVPVDE----------DGRMDPEALEAAIDEDKAEGLNPIMVVATAGTTDTGAIDPLEEI 170

Query: 159 GELCRSKGVIFHCDAAQAVGKI--------DINIQNYKVDLMSFSAHKIYG--PKGIGAL 208
            +L     +  H DAA     +        D  I+   VD +S   HK YG  P G  A+
Sbjct: 171 ADLAEKYDLWLHVDAAYGGFLLPFPEPRHLDFGIER--VDSISVDPHK-YGLVPLGCSAV 227

Query: 209 YIR 211
            +R
Sbjct: 228 LVR 230


>gnl|CDD|234361 TIGR03812, tyr_de_CO2_Arch, tyrosine decarboxylase MnfA.  Members
           of this protein family are the archaeal form, MnfA, of
           tyrosine decarboxylase, and are involved in methanofuran
           biosynthesis. Members show clear homology to the
           Enterococcus form, Tdc, that is involved in tyrosine
           decarboxylation for resistance to acidic conditions
           [Biosynthesis of cofactors, prosthetic groups, and
           carriers, Other].
          Length = 373

 Score = 53.9 bits (130), Expect = 5e-08
 Identities = 30/113 (26%), Positives = 53/113 (46%), Gaps = 5/113 (4%)

Query: 64  TSGATESNNLAIKGAAHFY--KKRGKHIITVQTEHKSVLDSIRKLENDGFRVTYLKPKNN 121
            SG TE+N  A++ A +    +KR  +II  ++ H S     +  E  G  + Y     +
Sbjct: 82  VSGGTEANIQAVRAAKNLAREEKRTPNIIVPESAHFSF---EKAAEMLGLELRYAPLDED 138

Query: 122 GLITLSQLEEKICSDTILVSVMLVNNEIGVIQPILEIGELCRSKGVIFHCDAA 174
             + +  +E+ I  +TI +  +    E+G I  I E+ ++    G+  H DAA
Sbjct: 139 YTVDVKDVEDLIDDNTIGIVGIAGTTELGQIDDIEELSKIALENGIYLHVDAA 191


>gnl|CDD|223153 COG0075, COG0075, Serine-pyruvate aminotransferase/archaeal
           aspartate aminotransferase [Amino acid transport and
           metabolism].
          Length = 383

 Score = 51.1 bits (123), Expect = 5e-07
 Identities = 31/149 (20%), Positives = 61/149 (40%), Gaps = 18/149 (12%)

Query: 124 ITLSQLEEKICSDTILVSVMLVNNEI--GVIQPILEIGELCRSKGVIFHCDAAQAVGKID 181
           +   ++EE +  D  + +V +V+NE   GV+ P+ EI +  +  G +   DA  ++G   
Sbjct: 117 VDPEEVEEALDKDPDIKAVAVVHNETSTGVLNPLKEIAKAAKEHGALLIVDAVSSLGGEP 176

Query: 182 INIQNYKVDLMSFSAHK-IYGPKGIGALYIRRNPRTRIEAQIHGG-----------GHEY 229
           + +  + +D+    + K +  P G+  + +       IE + H               + 
Sbjct: 177 LKVDEWGIDVAITGSQKALGAPPGLAFVAVSERALEAIEERKHPSFYLDLKKWLKYMEKK 236

Query: 230 GLRSGTLATHQIVGMGEAFRLARIEMKEE 258
           G    T   + I  + EA  L    + EE
Sbjct: 237 GSTPYTPPVNLIYALREALDL----ILEE 261


>gnl|CDD|237409 PRK13520, PRK13520, L-tyrosine decarboxylase; Provisional.
          Length = 371

 Score = 50.3 bits (121), Expect = 8e-07
 Identities = 31/119 (26%), Positives = 56/119 (47%), Gaps = 19/119 (15%)

Query: 64  TSGATESNNLAIKGAAHFYKKRGKHIITVQTEHKS------VLD-SIRKLE-NDGFRVTY 115
           TSG TE+N  A++ A +  K    +I+  ++ H S      +L   +R+   +D +RV  
Sbjct: 82  TSGGTEANIQAVRAARNLAKAEKPNIVVPESAHFSFDKAADMLGVELRRAPLDDDYRVD- 140

Query: 116 LKPKNNGLITLSQLEEKICSDTILVSVMLVNNEIGVIQPILEIGELCRSKGVIFHCDAA 174
                     +  +E+ I  +TI +  +    E+G + PI E+ ++    G+  H DAA
Sbjct: 141 ----------VKAVEDLIDDNTIGIVGIAGTTELGQVDPIPELSKIALENGIFLHVDAA 189


>gnl|CDD|224919 COG2008, GLY1, Threonine aldolase [Amino acid transport and
           metabolism].
          Length = 342

 Score = 49.6 bits (119), Expect = 1e-06
 Identities = 44/196 (22%), Positives = 75/196 (38%), Gaps = 43/196 (21%)

Query: 34  IYGWTAEKAIEKARNEVAQLIYADPREIIWTSGATESNNLAIKGAAHFYKKRGKHIITVQ 93
           +YG   +         +A+L        ++    T++N LA+  AAH   + G+ +I  +
Sbjct: 30  VYG--EDPTTNALEQRIAELF--GKEAALFVPSGTQANQLAL--AAHC--QPGESVICHE 81

Query: 94  TEH--------KSVLDSIRKLENDGFRVTYLKPKNNGLITLSQLEEKICSDTI------L 139
           T H               +KL         + P  +G +T   +E  I  D I      L
Sbjct: 82  TAHIYTDECGAPEFFGGGQKL--------PIVPGADGKLTPEDVEAAIRPDDIHHAPTPL 133

Query: 140 VSVMLVNNEIGVIQP---ILEIGELCRSKGVIFHCDAAQ---AVGKIDINIQNYK--VDL 191
             +     E G + P   +  I  +C+  G+  H D A+   A+  + + ++  K  VD 
Sbjct: 134 AVLENTATEGGTVYPLDELEAISAVCKEHGLPLHMDGARLANALVALGVALKTIKSYVDS 193

Query: 192 MSFSAHKIYGPKGIGA 207
           +SF        KG GA
Sbjct: 194 VSFC-----LTKGGGA 204


>gnl|CDD|99744 cd06451, AGAT_like, Alanine-glyoxylate aminotransferase (AGAT)
           family. This family belongs to pyridoxal phosphate
           (PLP)-dependent aspartate aminotransferase superfamily
           (fold I). The major groups in this CD correspond to
           alanine-glyoxylate aminotransferase (AGAT),
           serine-glyoxylate aminotransferase (SGAT), and
           3-hydroxykynurenine transaminase (HKT). AGAT is a
           homodimeric protein, which catalyses the transamination
           of glyoxylate to glycine, and SGAT converts serine and
           glyoxylate to hydroxypyruvate and glycine. HKT catalyzes
           the PLP-dependent transamination of 3-hydroxykynurenine,
           a potentially toxic metabolite of the kynurenine
           pathway.
          Length = 356

 Score = 48.4 bits (116), Expect = 3e-06
 Identities = 38/186 (20%), Positives = 71/186 (38%), Gaps = 19/186 (10%)

Query: 110 GFRVTYLKPKNNGLITLSQLEEKICSDTILVSVMLVNNE--IGVIQPILEIGELCRSKGV 167
           G  V  ++      ++  ++ E +    I  +V L +NE   GV+ P+  IG L +    
Sbjct: 97  GADVDVVEKPWGEAVSPEEIAEALEQHDI-KAVTLTHNETSTGVLNPLEGIGALAKKHDA 155

Query: 168 IFHCDAAQAVGKIDINIQNYKVDLMSFSAHKIYG-PKGIGALYIRRNPRTRIEAQ----- 221
           +   DA  ++G     +  + VD+    + K  G P G+G +        RI+ +     
Sbjct: 156 LLIVDAVSSLGGEPFRMDEWGVDVAYTGSQKALGAPPGLGPIAFSERALERIKKKTKPKG 215

Query: 222 ----IHGGGHEYGLRSGTLAT---HQIVGMGEAFRLARIEMKEELKRILFLQKRLYNGLM 274
               +      +G       T   + +  + EA  L     +E L+      +RL   L 
Sbjct: 216 FYFDLLLLLKYWGEGYSYPHTPPVNLLYALREALDLIL---EEGLENRWARHRRLAKALR 272

Query: 275 ESIEEI 280
           E +E +
Sbjct: 273 EGLEAL 278


>gnl|CDD|233915 TIGR02539, SepCysS, O-phospho-L-seryl-tRNA:Cys-tRNA synthase.
           Aminoacylation of tRNA(Cys) with Cys, and cysteine
           biosynthesis in the process, happens in
           Methanocaldococcus jannaschii and several other archaea
           by misacylation of tRNA(Cys) with O-phosphoserine (Sep),
           followed by modification of the phosphoserine to
           cysteine. In some species, direct tRNA-cys
           aminoacylation also occurs but this pathway is required
           for Cys biosynthesis. Members of this protein catalyze
           the second step in this two step pathway, using
           pyridoxal phosphate and a sulfur donor to synthesize Cys
           from Sep while attached to the tRNA.
          Length = 370

 Score = 42.1 bits (99), Expect = 3e-04
 Identities = 23/107 (21%), Positives = 53/107 (49%), Gaps = 4/107 (3%)

Query: 102 SIRKLENDGFRVTYLKPKNNGLITLSQLEEKICSDTILVSVMLVNNEIGVIQPILEIGEL 161
           +++++ + G     + P+  G + + ++E++     +L  +  V+ E G +    ++ ++
Sbjct: 113 NVKEVPHTGHPEYKVDPEGYGEV-IEEVEDESGKPPVLALLTHVDGEYGNLPDAGKVAKV 171

Query: 162 CRSKGVIFHCDAAQAVGKIDINIQNYKVDLMSFSAHK---IYGPKGI 205
           CR KGV    + A  VG++ ++ +    D +  S HK     GP G+
Sbjct: 172 CREKGVPLLLNCAYTVGRMPVSAKEIGADFIVGSGHKSMAASGPCGV 218


>gnl|CDD|216367 pfam01212, Beta_elim_lyase, Beta-eliminating lyase. 
          Length = 288

 Score = 41.1 bits (97), Expect = 5e-04
 Identities = 47/180 (26%), Positives = 68/180 (37%), Gaps = 35/180 (19%)

Query: 110 GFRVTYLKPKNNGLITLSQLEEKICSDTI-------LVSVMLVNNEIG--VIQP--ILEI 158
           G +   L     G + L  LE  I            L+S+   +N  G  V+    + EI
Sbjct: 95  GAQPVPLPGAEAGKLDLEDLEAAIRPVGDIHFPPTGLISLENTHNSAGGQVVSLEELREI 154

Query: 159 GELCRSKGVIFHCD------AAQAVGKIDINIQNYKVDLMSFSAHKIYGPKGIGAL---- 208
             + R  G+  H D      AA A+G I   I +Y  D +S S  K  G   +G++    
Sbjct: 155 RAIAREHGIPLHLDGARLANAAVALGVIVKEITSY-ADSVSMSLSKGLG-APVGSVLAGS 212

Query: 209 --YIRRNPRTRIEAQIHGGGHEYGLR-SGTLATHQIVGMGE-AFRLAR-IEMKEELKRIL 263
             +I    R R   +  GG    G+R +G LA   +  + E   RL R       L   L
Sbjct: 213 DDFIAYARRQR---KYLGG----GMRQAGVLAAAGLRALEEGVARLGRDHATARRLAEGL 265


>gnl|CDD|215384 PLN02724, PLN02724, Molybdenum cofactor sulfurase.
          Length = 805

 Score = 41.8 bits (98), Expect = 5e-04
 Identities = 56/234 (23%), Positives = 97/234 (41%), Gaps = 37/234 (15%)

Query: 25  FGNPASRNHIYGWTAEKAIEKARNEVAQLIYADPRE--IIWTSGATESNNLAIKGAAHFY 82
           +GNP S++     +++  IE AR +V +   A P +   ++TSGAT +  L + G    +
Sbjct: 63  YGNPHSQSDSSMRSSD-TIESARQQVLEYFNAPPSDYACVFTSGATAA--LKLVGETFPW 119

Query: 83  KKRGKHIITVQTEHKSVLDSIRK--LENDG------FRVTYLKPKNNGL---ITLSQLEE 131
                   T++  H SVL  IR+  LE               +P N+     +    L+ 
Sbjct: 120 SSESHFCYTLEN-HNSVL-GIREYALEKGAAAIAVDIEEAANQPTNSQGSVVVKSRGLQR 177

Query: 132 KICS---------DTI-LVSVMLVNNEIGVIQPILEIGELCRSKGVIFH--------CDA 173
           +  S         +   L +     N  G   P+  +  +  ++   F          DA
Sbjct: 178 RNTSKLQKREDDGEAYNLFAFPSECNFSGAKFPLDLVKLIKDNQHSNFSKSGRWMVLLDA 237

Query: 174 AQAVGKIDINIQNYKVDLMSFSAHKIYG-PKGIGALYIRRNPRTRIEAQIHGGG 226
           A+  G    ++  Y  D +  S +KI+G P G+GAL +RR+    ++ +  GGG
Sbjct: 238 AKGCGTSPPDLSRYPADFVVVSFYKIFGYPTGLGALLVRRDAAKLLKKKYFGGG 291


>gnl|CDD|234769 PRK00451, PRK00451, glycine dehydrogenase subunit 1; Validated.
          Length = 447

 Score = 40.5 bits (96), Expect = 0.001
 Identities = 28/94 (29%), Positives = 46/94 (48%), Gaps = 8/94 (8%)

Query: 79  AHFYKKRGKHII--TVQTEHKSVLDSIRKLENDGFRVTYLKPKNNGLITLSQLEEKICSD 136
           A    KR K ++   V  E++ VL +   L+  G  V  + P  +G+  L  LE  +  D
Sbjct: 148 AVRITKRKKVLVSGAVHPEYREVLKTY--LKGQGIEVVEV-PYEDGVTDLEALEAAVDDD 204

Query: 137 TILVSVMLVN-NEIGVIQPILEIGELCRSKGVIF 169
           T   +V++   N  GVI+ + EI E+  + G +F
Sbjct: 205 T--AAVVVQYPNFFGVIEDLEEIAEIAHAGGALF 236


>gnl|CDD|224028 COG1103, COG1103, Archaea-specific pyridoxal phosphate-dependent
           enzymes [General function prediction only].
          Length = 382

 Score = 38.6 bits (90), Expect = 0.004
 Identities = 20/106 (18%), Positives = 47/106 (44%), Gaps = 4/106 (3%)

Query: 103 IRKLENDGFRVTYLKPKNNGLITLSQLEEKICSDTILVSVMLVNNEIGVIQPILEIGELC 162
           + ++ N G+    + P+      + +++++      L  +  V+ E G +    ++ ++C
Sbjct: 124 VAEVPNTGYPEYKITPEGYAE-VIEEVKDEGGDPPALALLTHVDGEYGNLADAKKVAKIC 182

Query: 163 RSKGVIFHCDAAQAVGKIDINIQNYKVDLMSFSAHKIY---GPKGI 205
           R  GV    + A  VG++ ++ +    D +  S HK      P G+
Sbjct: 183 REYGVPLLLNCAYTVGRMPVSGKEIGADFIVGSGHKSMAASAPIGV 228


>gnl|CDD|235989 PRK07324, PRK07324, transaminase; Validated.
          Length = 373

 Score = 37.6 bits (88), Expect = 0.008
 Identities = 44/152 (28%), Positives = 66/152 (43%), Gaps = 31/152 (20%)

Query: 35  YGWTAEKAIEKA---RNEVAQLIYA--DPREIIWTSGATESNNLAIKGAAHFYKKRGKHI 89
           YGW     IE +   +  VA L Y    P  I+ T+GAT +N L +        + G H+
Sbjct: 58  YGW-----IEGSPEFKEAVASL-YQNVKPENILQTNGATGANFLVLYALV----EPGDHV 107

Query: 90  ITV----QTEHKSVLDSIRKLENDGFRVTY--LKPKNNGLITLSQLEEKICSDTILVSVM 143
           I+V    Q +   + +S+      G  V Y  LK +N  L  L +L   +  +T L+ + 
Sbjct: 108 ISVYPTYQ-QLYDIPESL------GAEVDYWQLKEENGWLPDLDELRRLVRPNTKLICIN 160

Query: 144 LVNNEIGVI--QPIL-EIGELCRSKGVIFHCD 172
             NN  G +  +  L EI E+ RS       D
Sbjct: 161 NANNPTGALMDRAYLEEIVEIARSVDAYVLSD 192


>gnl|CDD|184076 PRK13479, PRK13479, 2-aminoethylphosphonate--pyruvate transaminase;
           Provisional.
          Length = 368

 Score = 37.2 bits (87), Expect = 0.012
 Identities = 29/106 (27%), Positives = 48/106 (45%), Gaps = 3/106 (2%)

Query: 110 GFRVTYLKPKNNGLITLSQLEEKICSDTILVSVMLVNNEI--GVIQPILEIGELCRSKGV 167
           G     L    +     +++E  + +D  +  V LV+ E   G++ P+ EI  + +  G 
Sbjct: 103 GIAHVVLDTGEDEPPDAAEVEAALAADPRITHVALVHCETTTGILNPLDEIAAVAKRHGK 162

Query: 168 IFHCDAAQAVGKIDINIQNYKVDLMSFSAHK-IYGPKGIGALYIRR 212
               DA  + G I I+I    +D +  SA+K I G  G G +  RR
Sbjct: 163 RLIVDAMSSFGAIPIDIAELGIDALISSANKCIEGVPGFGFVIARR 208


>gnl|CDD|99737 cd00613, GDC-P, Glycine cleavage system P-protein, alpha- and
           beta-subunits. This family consists of Glycine cleavage
           system P-proteins EC:1.4.4.2 from bacterial, mammalian
           and plant sources. The P protein is part of the glycine
           decarboxylase multienzyme complex EC:2.1.2.10 (GDC) also
           annotated as glycine cleavage system or glycine
           synthase. GDC consists of four proteins P, H, L and T.
           The reaction catalysed by this protein is: Glycine +
           lipoylprotein <=> S-aminomethyldihydrolipoylprotein +
           CO2. Alpha-beta-type dimers associate to form an
           alpha(2)beta(2) tetramer, where the alpha- and
           beta-subunits are structurally similar and appear to
           have arisen by gene duplication and subsequent
           divergence with a loss of one active site. The members
           of this CD are widely dispersed among all three forms of
           cellular life.
          Length = 398

 Score = 37.2 bits (87), Expect = 0.014
 Identities = 43/198 (21%), Positives = 73/198 (36%), Gaps = 56/198 (28%)

Query: 78  AAHFYKKRGKHIITVQTEHKSVLDSIR-KLENDGFRVTYLKPKNNGLITLSQLEEKICSD 136
           A   Y KR K ++   + H +     R + E  G  V  +     G + L  L+E++  +
Sbjct: 101 AIRAYHKRNK-VLVPDSAHPTNPAVARTRGEPLGIEVVEVPSDEGGTVDLEALKEEVSEE 159

Query: 137 TILVSVMLVN-NEIGVI-QPILEIGELCRSKGVIFHCDAAQAVGKIDINIQ------NYK 188
               ++M+   N +GV    I EI ++  S G + + D        ++N+        Y 
Sbjct: 160 V--AALMVQYPNTLGVFEDLIKEIADIAHSAGALVYVDGD------NLNLTGLKPPGEYG 211

Query: 189 VDLMSFSAHKIYGPKGIGALYIRRNPRTRIEAQIHGGGHEYGLRSGTLATHQ-------- 240
            D++  +  K   P                    HGGG   G  +G  A  +        
Sbjct: 212 ADIVVGNLQKTGVP--------------------HGGG---GPGAGFFAVKKELVRFLPG 248

Query: 241 -IVGMGE------AFRLA 251
            +VG+ +      AFRLA
Sbjct: 249 RLVGVTKDAEGNRAFRLA 266


>gnl|CDD|235292 PRK04366, PRK04366, glycine dehydrogenase subunit 2; Validated.
          Length = 481

 Score = 37.0 bits (87), Expect = 0.017
 Identities = 28/97 (28%), Positives = 47/97 (48%), Gaps = 15/97 (15%)

Query: 110 GFRVTYLKPKNNGLITLSQLEEKICSDTILVSVMLVN-NEIGVIQP-ILEIGELCRSKGV 167
           GF+V  +    +GL+ L  L+  +  DT   ++ML N N +G+ +  ILEI E+    G 
Sbjct: 182 GFKVVEIPSNEDGLVDLEALKAAVGEDT--AALMLTNPNTLGLFERNILEIAEIVHEAGG 239

Query: 168 IFHCDAAQA---VGKI---DINIQNYKVDLMSFSAHK 198
           + + D A     +GK    D+       D++  + HK
Sbjct: 240 LLYYDGANLNAILGKARPGDMG-----FDVVHLNLHK 271


>gnl|CDD|99748 cd06502, TA_like, Low-specificity threonine aldolase (TA). This
           family belongs to pyridoxal phosphate (PLP)-dependent
           aspartate aminotransferase superfamily (fold I).  TA
           catalyzes the conversion of L-threonine or
           L-allo-threonine to glycine and acetaldehyde in a
           secondary glycine biosynthetic pathway.
          Length = 338

 Score = 36.5 bits (85), Expect = 0.021
 Identities = 41/183 (22%), Positives = 62/183 (33%), Gaps = 33/183 (18%)

Query: 35  YGWTAEKAIEKARNEVAQLIYADPREIIWTSGATESNNLAIKGAAHFYKK----RGKHII 90
           YG   +    K     A+L        ++    T +N LA+                HI 
Sbjct: 28  YG--EDPTTAKLEARAAELF--GKEAALFVPSGTAANQLALAAHTQPGGSVICHETAHIY 83

Query: 91  TVQTEHKSVLDSIRKLENDGFRVTYLKPKNNGLITLSQLEEKICSD-------TILVSVM 143
           T +      L  ++ L   G          NG +T   LE  I            LVS +
Sbjct: 84  TDEAGAPEFLSGVKLLPVPG---------ENGKLTPEDLEAAIRPRDDIHFPPPSLVS-L 133

Query: 144 LVNNEIGVIQP---ILEIGELCRSKGVIFHCDAAQ---AVGKIDINIQNYK--VDLMSFS 195
               E G + P   +  I  L +  G+  H D A+   A   + + ++ YK  VD +SF 
Sbjct: 134 ENTTEGGTVYPLDELKAISALAKENGLPLHLDGARLANAAAALGVALKTYKSGVDSVSFC 193

Query: 196 AHK 198
             K
Sbjct: 194 LSK 196


>gnl|CDD|215756 pfam00155, Aminotran_1_2, Aminotransferase class I and II. 
          Length = 357

 Score = 36.1 bits (84), Expect = 0.029
 Identities = 43/255 (16%), Positives = 82/255 (32%), Gaps = 35/255 (13%)

Query: 45  KARNEVAQLIYADP-------REIIWTSGATESNNLAIKGAAHFYKKRGKHIITVQTEHK 97
           + R  +A+ +   P         +++ SGA   N  A+             I+     + 
Sbjct: 45  ELREALAKFLGRSPVLKLDREAAVVFGSGAGA-NIEALIFLLRLNPGDA--ILVPAPTYP 101

Query: 98  SVLDSIRKLENDGFRVTY-LKPKNNGLITLSQLEEKICSDTILVSVMLVN---NEIGVIQ 153
           S +   R     G  V Y L   N+  +    LE  +   T    V+L     N  G + 
Sbjct: 102 SYIRIFRL--AGGEVVRYPLYSSNDFHLDFDALEAALKEATEGNKVVLHTSPHNPTGTVA 159

Query: 154 P---ILEIGELCRSKGVIFHCDAAQAVG--------KIDINIQNYKVDLMSFSAHKIYGP 202
               + ++ +L +   ++   D A A               +      L+  S  K +G 
Sbjct: 160 TLEELEKLLDLAKEHNILLLVDEAYAGFVFGSLDAVATRALLAEGPNLLVVGSFSKAFGL 219

Query: 203 KG--IGALYIRRNPRTRIEAQIHGGGHEYGLRSGTLATHQIVGMGEAFRLARIEMKEELK 260
            G  +G +       +++  ++    +     S T              L   E++E  +
Sbjct: 220 AGWRVGYILGNAAVISQL-RKLARPFY-----SSTHLQAAAAAALSDPLLVASELEEMRQ 273

Query: 261 RILFLQKRLYNGLME 275
           RI   +  L +GL  
Sbjct: 274 RIKERRDYLRDGLEA 288


>gnl|CDD|131379 TIGR02326, transamin_PhnW, 2-aminoethylphosphonate--pyruvate
           transaminase.  Members of this family are
           2-aminoethylphosphonate--pyruvate transaminase. This
           enzyme acts on the most common type of naturally
           occurring phosphonate. It interconverts
           2-aminoethylphosphonate plus pyruvate with
           2-phosphonoacetaldehyde plus alanine. The enzyme
           phosphonoacetaldehyde hydrolase (EC 3.11.1.1), usually
           encoded by an adjacent gene, then cleaves the C-P bond
           of phosphonoacetaldehyde, adding water to yield
           acetaldehyde plus inorganic phosphate. Species with this
           pathway generally have an identified phosphonate ABC
           transporter but do not also have the multisubunit C-P
           lysase complex as found in Escherichia coli [Central
           intermediary metabolism, Phosphorus compounds].
          Length = 363

 Score = 35.5 bits (82), Expect = 0.037
 Identities = 25/91 (27%), Positives = 45/91 (49%), Gaps = 3/91 (3%)

Query: 125 TLSQLEEKICSDTILVSVMLVNNEI--GVIQPILEIGELCRSKGVIFHCDAAQAVGKIDI 182
            + ++E  + +D  +  + LV+ E   G++ PI  + +L    G +   DA  + G I I
Sbjct: 116 DVVEVEAILAADPAITHIALVHCETTTGILNPIEAVAKLAHRHGKVTIVDAMSSFGGIPI 175

Query: 183 NIQNYKVDLMSFSAHK-IYGPKGIGALYIRR 212
           +I    +D +  SA+K I G  G G +  R+
Sbjct: 176 DIAELHIDYLISSANKCIQGVPGFGFVIARQ 206


>gnl|CDD|99745 cd06452, SepCysS, Sep-tRNA:Cys-tRNA synthase. This family belongs
           to the pyridoxal phosphate (PLP)-dependent aspartate
           aminotransferase superfamily (fold I). Cys-tRNA(Cys) is
           produced by O-phosphoseryl-tRNA synthetase which ligates
           O-phosphoserine (Sep) to tRNA(Cys), and
           Sep-tRNA:Cys-tRNA synthase (SepCysS) converts
           Sep-tRNA(Cys) to Cys-tRNA(Cys), in methanogenic archaea.
           SepCysS forms a dimer, each monomer is composed of a
           large and small domain; the larger, a typical pyridoxal
           5'-phosphate (PLP)-dependent-like enzyme fold.  In the
           active site of each monomer, PLP is covalently bound to
           a conserved Lys residue near the dimer interface.
          Length = 361

 Score = 35.1 bits (81), Expect = 0.050
 Identities = 48/251 (19%), Positives = 91/251 (36%), Gaps = 34/251 (13%)

Query: 64  TSGATESNNLAIKGAAHFYKKRGKHIITVQTEHKSVLDS-------IRKLENDGFRVTYL 116
           T GA E          H   ++G  ++     H +   +       +R++ N G    ++
Sbjct: 65  TPGARE----GKFAVMHSLCEKGDWVVVDGLAHYTSYVAAERAGLNVREVPNTGHPEYHI 120

Query: 117 KPKNNGLITLSQLEEKICSDTILVSVML---VNNEIGVIQPILEIGELCRSKGVIFHCDA 173
            P+         +EE         ++ L   V+   G +    +I ++C   GV    + 
Sbjct: 121 TPEG----YAEVIEEVKDEFGKPPALALLTHVDGNYGNLHDAKKIAKVCHEYGVPLLLNG 176

Query: 174 AQAVGKIDINIQNYKVDLMSFSAHKIY---GPKGIGALYIRRNPRTRIEAQIHGGGHEYG 230
           A  VG++ ++ +    D +  S HK      P G+ A            +Q+     E  
Sbjct: 177 AYTVGRMPVSGKELGADFIVGSGHKSMAASAPIGVLATTEEWADIVFRTSQMF-KIKEVE 235

Query: 231 LRSGTLATHQIVGMGEAFR--LARIE-MKEELKRILFLQKRLYNGLMESIEEIYINGDMN 287
           L   TL    +V +  +F     R++   EE+++  +         +E IE I   G+  
Sbjct: 236 LLGCTLRGAPLVTLMASFPHVKERVKRWDEEVEKARWFVAE-----LEKIEGIKQLGEK- 289

Query: 288 NRIPHNLNISF 298
              P N ++ F
Sbjct: 290 ---PKNHDLMF 297


>gnl|CDD|235823 PRK06541, PRK06541, hypothetical protein; Provisional.
          Length = 460

 Score = 34.6 bits (80), Expect = 0.077
 Identities = 18/64 (28%), Positives = 30/64 (46%), Gaps = 6/64 (9%)

Query: 34  IYGWTAEKAIEKARNEVAQLIYADPREIIWTSGATESNNLAIKGAAHFYKKRGKHIITVQ 93
           ++ +    AIE A   +A L   D   + +T+G +E+   A K A  ++K  GK      
Sbjct: 89  LWSYAHPPAIELA-ERLAALAPGDLNRVFFTTGGSEAVESAWKLAKQYFKLTGKP----- 142

Query: 94  TEHK 97
            +HK
Sbjct: 143 GKHK 146


>gnl|CDD|226363 COG3844, COG3844, Kynureninase [Amino acid transport and
           metabolism].
          Length = 407

 Score = 34.0 bits (78), Expect = 0.12
 Identities = 41/183 (22%), Positives = 74/183 (40%), Gaps = 15/183 (8%)

Query: 48  NEVAQLIYADPREIIWTSGATESNNLAIKGAAHFYKKRGKHIITVQTEH----KSVLDSI 103
           +++A LI A   E++ T   T S NL    AA    + G+ +I  + ++      + + +
Sbjct: 83  DKLAPLIGARAGEVVVTD--TTSINLFKVLAAALRPQEGRRVIVSEGDNFPTDLYIAEGL 140

Query: 104 RKLENDGFRVTYLKPKNNGLITLSQLEEKICSDTILVSVMLVNNEIGVIQPILEIGELCR 163
             L   G+ +        G+I    LEE I  D  +V +  VN + G +  +  I  L  
Sbjct: 141 ADLLGIGYDLE-------GVIAPRALEEAITDDVAVVLLSHVNYKTGQLLDMRAITALAH 193

Query: 164 SKGVIFHCDAAQAVGKIDINIQNYKVDLMSFSAHKIY--GPKGIGALYIRRNPRTRIEAQ 221
             G +   D A + G + +++     D     ++K    GP     L++    R R    
Sbjct: 194 QHGALVGWDLAHSAGAVPVDLHAAGADFAIGCSYKYLNGGPGAPAGLFVAPRHRERSWPP 253

Query: 222 IHG 224
           + G
Sbjct: 254 LSG 256


>gnl|CDD|223513 COG0436, COG0436, Aspartate/tyrosine/aromatic aminotransferase
           [Amino acid transport and metabolism].
          Length = 393

 Score = 34.2 bits (79), Expect = 0.12
 Identities = 35/177 (19%), Positives = 57/177 (32%), Gaps = 23/177 (12%)

Query: 8   PIDPRVVDKMIPYLRVQFGNPASRNHIYGWTAEKAIEKARNEVAQ-------LIYADPRE 60
           P    +++  I          A       +T    I + R  +A+       L      E
Sbjct: 41  PTPEHIIEAAIE---------ALEEGGTHYTPSAGIPELREAIAEKYKRRYGLDVDPEEE 91

Query: 61  IIWTSGATESNNLAIKGAAHFYKKRGKHIITVQTEHKSVLDSIRKLENDGFRVTYLKPKN 120
           II T+GA E+  LA           G  ++     + S   +++        V   + +N
Sbjct: 92  IIVTAGAKEALFLAFL----ALLNPGDEVLIPDPGYPSYEAAVKLAGGKPVPVPLDEEEN 147

Query: 121 NGLITLSQLEEKICSDTILVSVMLVNNEIGVIQP---ILEIGELCRSKGVIFHCDAA 174
                L  LE  I   T  + +   NN  G +     +  I EL R   +I   D  
Sbjct: 148 GFKPDLEDLEAAITPKTKAIILNSPNNPTGAVYSKEELKAIVELAREHDIIIISDEI 204


>gnl|CDD|215836 pfam00282, Pyridoxal_deC, Pyridoxal-dependent decarboxylase
           conserved domain. 
          Length = 373

 Score = 32.7 bits (75), Expect = 0.27
 Identities = 43/178 (24%), Positives = 62/178 (34%), Gaps = 36/178 (20%)

Query: 62  IWTSGATESNNLAIKGAAHFYKKRGKHIITVQTEHKSVLD------------SIRKL-EN 108
           +   G++ESN LA+  A      R K I  ++   K  L             SI K    
Sbjct: 107 VLQGGSSESNLLALLAA------RTKWIRRMKAAGKPSLGKLVAYTSDQAHSSIEKAALI 160

Query: 109 DGFRVTYLKPKNNGLITLSQLEEKICSD-----TIL-VSVMLVNNEIGVIQPILEIGELC 162
            G  +  +    NG +    LE+ I  D         V   L     G   P+ E+G++C
Sbjct: 161 AGVELREIPTDENGKMRGMDLEKAIEEDKENGLIPFFVCATLGTTGSGAFDPLQELGDIC 220

Query: 163 RSKGVIFHCDAAQAVGKI--------DINIQNYKVDLMSFSAHKIYG-PKGIGALYIR 211
               +  H DAA A               I+    D  SF+ HK         AL++R
Sbjct: 221 NKYDLWLHVDAAYAGSAFICPEFRHWLFGIER--ADSFSFNPHKWMLVLLDCSALWVR 276


>gnl|CDD|224843 COG1932, SerC, Phosphoserine aminotransferase [Coenzyme metabolism
           / Amino acid transport and metabolism].
          Length = 365

 Score = 33.0 bits (76), Expect = 0.28
 Identities = 22/80 (27%), Positives = 37/80 (46%), Gaps = 9/80 (11%)

Query: 142 VMLVNNEI-GV-IQPILEIGELCRSKGVIFHCDAAQAVGKIDINIQNYKVDLMSFSAHKI 199
               N  I GV +  + +IG    S G++   DA+ A+    I++  Y  D++   A K 
Sbjct: 149 HFCWNETISGVEVPELPDIG----SDGLLV-ADASSAILSRPIDVSKY--DVIYAGAQKN 201

Query: 200 YGPKGIGALYIRRNPRTRIE 219
            GP G+  + +R +   R E
Sbjct: 202 LGPAGLTVVIVRPDLLERAE 221


>gnl|CDD|163511 TIGR03799, NOD_PanD_pyr, putative pyridoxal-dependent aspartate
           1-decarboxylase.  This enzyme is proposed here to be a
           form of aspartate 1-decarboxylase, pyridoxal-dependent,
           that represents a non-orthologous displacement to the
           more widely distributed pyruvoyl-dependent form
           (TIGR00223). Aspartate 1-decarboxylase makes
           beta-alanine, used usually in pathothenate biosynthesis,
           by decarboxylation from asparatate. A number of species
           with the PanB and PanC enzymes, however, lack PanD. This
           protein family occurs in a number of Proteobacteria that
           lack PanD. This enzyme family appears to be a
           pyridoxal-dependent enzyme (see pfam00282). The family
           was identified by Partial Phylogenetic Profiling;
           members in Geobacter sulfurreducens, G. metallireducens,
           and Pseudoalteromonas atlantica are clustered with the
           genes for PanB and PanC. We suggest the gene symbol panP
           (panthothenate biosynthesis enzyme, Pyridoxal-dependent)
           [Biosynthesis of cofactors, prosthetic groups, and
           carriers, Pantothenate and coenzyme A].
          Length = 522

 Score = 32.7 bits (75), Expect = 0.33
 Identities = 31/107 (28%), Positives = 49/107 (45%), Gaps = 17/107 (15%)

Query: 148 EIGVIQPILEIGELCRSKGVIFHCDAAQAVGKIDINIQNYK--------VDLMSFSAHK- 198
           E G I P+ E+ ++ +  G  FH DAA   G   +    Y+         D ++  AHK 
Sbjct: 277 ETGNIDPLDEMADIAQELGCHFHVDAAW--GGATLLSNTYRHLLKGIERADSVTIDAHKQ 334

Query: 199 IYGPKGIGALYIRRNPRTRIEAQIHGGGHEYGLRSGT--LATHQIVG 243
           +Y P G G + + ++P      + H    EY LR G+  L +H + G
Sbjct: 335 LYVPMGAG-MVLFKDPALMSAIEHHA---EYILRKGSKDLGSHTLEG 377


>gnl|CDD|99734 cd00609, AAT_like, Aspartate aminotransferase family. This family
           belongs to pyridoxal phosphate (PLP)-dependent aspartate
           aminotransferase superfamily (fold I). Pyridoxal
           phosphate combines with an alpha-amino acid to form a
           compound called a Schiff base or aldimine intermediate,
           which depending on the reaction, is the substrate in
           four kinds of reactions (1) transamination (movement of
           amino groups), (2) racemization (redistribution of
           enantiomers), (3) decarboxylation (removing COOH
           groups), and (4) various side-chain reactions depending
           on the enzyme involved. Pyridoxal phosphate (PLP)
           dependent enzymes were previously classified into alpha,
           beta and gamma classes, based on the chemical
           characteristics (carbon atom involved) of the reaction
           they catalyzed. The availability of several structures
           allowed a comprehensive analysis of  the evolutionary
           classification of PLP dependent enzymes, and it was
           found that the functional classification did not always
           agree with the evolutionary history of these enzymes.
           The major groups in this CD corresponds to Aspartate
           aminotransferase a, b and c, Tyrosine, Alanine,
           Aromatic-amino-acid, Glutamine phenylpyruvate,
           1-Aminocyclopropane-1-carboxylate synthase,
           Histidinol-phosphate, gene products of malY and cobC,
           Valine-pyruvate aminotransferase and Rhizopine
           catabolism regulatory protein.
          Length = 350

 Score = 32.3 bits (74), Expect = 0.41
 Identities = 34/178 (19%), Positives = 60/178 (33%), Gaps = 26/178 (14%)

Query: 8   PIDPRVVDKMIPYLRVQFGNPASRNHIYGWTAEKAIEKARNEVAQL------IYADPREI 61
           P  P V++ +            +    Y       + + R  +A+       +   P EI
Sbjct: 11  PPPPEVLEALA------AAALRAGLLGYYPDP--GLPELREAIAEWLGRRGGVDVPPEEI 62

Query: 62  IWTSGATESNNLAIKGAAHFYKKRGKHIITVQTEHKSVLDSIRKLENDGFRVTY--LKPK 119
           + T+GA E    A+          G  ++     +     + R     G  V    L  +
Sbjct: 63  VVTNGAQE----ALSLLLRALLNPGDEVLVPDPTYPGYEAAARLA---GAEVVPVPLDEE 115

Query: 120 NNGLITLSQLEEKICSDTILVSVMLVNNEIGVIQP---ILEIGELCRSKGVIFHCDAA 174
              L+ L  LE      T L+ +   NN  G +     + E+ EL +  G++   D A
Sbjct: 116 GGFLLDLELLEAAKTPKTKLLYLNNPNNPTGAVLSEEELEELAELAKKHGILIISDEA 173


>gnl|CDD|236469 PRK09331, PRK09331, Sep-tRNA:Cys-tRNA synthetase; Provisional.
          Length = 387

 Score = 31.8 bits (73), Expect = 0.63
 Identities = 11/52 (21%), Positives = 24/52 (46%), Gaps = 3/52 (5%)

Query: 157 EIGELCRSKGVIFHCDAAQAVGKIDINIQNYKVDLMSFSAHKIY---GPKGI 205
           ++ ++    G+ F  + A  VG++ ++ +    D +  S HK      P G+
Sbjct: 179 KVAKVAHEYGIPFLLNGAYTVGRMPVDGKKLGADFIVGSGHKSMAASAPSGV 230


>gnl|CDD|147133 pfam04822, Takusan, Takusan.  This domain is named takusan, which
           is a Japanese word meaning 'many'. Members of this
           family regulate synaptic activity.
          Length = 84

 Score = 29.3 bits (66), Expect = 0.74
 Identities = 16/46 (34%), Positives = 24/46 (52%), Gaps = 1/46 (2%)

Query: 251 ARIEMKEELKRILFLQKRLYNGLMESIEEIYINGDMNNRIPHNLNI 296
             I+  EELK  L L  +  N L + I ++Y+NG +N R+     I
Sbjct: 16  KNIKELEELKLELRLITQERNELRD-ILDLYMNGSLNYRLNPEYEI 60


>gnl|CDD|223935 COG1003, GcvP, Glycine cleavage system protein P
           (pyridoxal-binding), C-terminal domain [Amino acid
           transport and metabolism].
          Length = 496

 Score = 31.5 bits (72), Expect = 0.88
 Identities = 20/67 (29%), Positives = 32/67 (47%), Gaps = 4/67 (5%)

Query: 110 GFRVTYLKPKNNGLITLSQLEEKICSDTILVSVMLVN-NEIGVIQP-ILEIGELCRSKGV 167
           GF+V  +K   NG + L  L  K   +    ++M+ N + +GV +  I EI E+    G 
Sbjct: 178 GFKVVVVKCDENGNVDLDDLRAKAEDNL--AALMITNPSTLGVFEEDIREICEIVHEAGG 235

Query: 168 IFHCDAA 174
             + D A
Sbjct: 236 QVYYDGA 242


>gnl|CDD|215475 PLN02880, PLN02880, tyrosine decarboxylase.
          Length = 490

 Score = 31.0 bits (70), Expect = 1.0
 Identities = 12/25 (48%), Positives = 19/25 (76%)

Query: 152 IQPILEIGELCRSKGVIFHCDAAQA 176
           + P+LE+G++ +S G+ FH DAA A
Sbjct: 255 VDPLLELGKIAKSNGMWFHVDAAYA 279


>gnl|CDD|220337 pfam09669, Phage_pRha, Phage regulatory protein Rha (Phage_pRha).
           Members of this protein family are found in temperate
           phage and bacterial prophage regions. Members include
           the product of the rha gene of the lambdoid phage
           phi-80, a late operon gene. The presence of this gene
           interferes with infection of bacterial strains that lack
           integration host factor (IHF), which regulates the rha
           gene. It is suggested that Rha is a phage regulatory
           protein.
          Length = 88

 Score = 27.9 bits (63), Expect = 2.5
 Identities = 7/26 (26%), Positives = 11/26 (42%)

Query: 95  EHKSVLDSIRKLENDGFRVTYLKPKN 120
            HK+VL  I  L  D   + +    +
Sbjct: 14  RHKNVLRDIENLLEDLKFLNFFIESS 39


>gnl|CDD|235650 PRK05939, PRK05939, hypothetical protein; Provisional.
          Length = 397

 Score = 29.7 bits (67), Expect = 2.8
 Identities = 21/92 (22%), Positives = 41/92 (44%), Gaps = 11/92 (11%)

Query: 84  KRGKHIITVQ---TEHKSVLDSIRKLENDGFRVTYLKPKNNGLITLSQLEEKICSDTILV 140
           + G H+++ Q       S+  ++R L   G  VT +   +     +  +   I  +T +V
Sbjct: 84  RAGDHLVSSQFLFGNTNSLFGTLRGL---GVEVTMVDATD-----VQNVAAAIRPNTRMV 135

Query: 141 SVMLVNNEIGVIQPILEIGELCRSKGVIFHCD 172
            V  + N    +  +  IG LCR +G+++  D
Sbjct: 136 FVETIANPGTQVADLAGIGALCRERGLLYVVD 167


>gnl|CDD|219376 pfam07332, DUF1469, Protein of unknown function (DUF1469).  This
           family consists of several hypothetical bacterial
           proteins of around 140 residues in length. The function
           of this family is unknown.
          Length = 121

 Score = 28.3 bits (64), Expect = 3.3
 Identities = 9/27 (33%), Positives = 13/27 (48%), Gaps = 1/27 (3%)

Query: 237 ATHQIVGMGEA-FRLARIEMKEELKRI 262
            T Q+  +      LA+ E+ EE KR 
Sbjct: 8   LTAQVSRLVRTEVELAKAELAEEAKRA 34


>gnl|CDD|187793 cd09662, Csm5_III-A, CRISPR/Cas system-associated RAMP superfamily
           protein Csm5.  CRISPR (Clustered Regularly Interspaced
           Short Palindromic Repeats) and associated Cas proteins
           comprise a system for heritable host defense by
           prokaryotic cells against phage and other foreign DNA;
           RAMP superfamily protein.
          Length = 365

 Score = 29.4 bits (66), Expect = 3.8
 Identities = 14/47 (29%), Positives = 22/47 (46%), Gaps = 1/47 (2%)

Query: 242 VGMGEAFRLARIEMKEELKRILFL-QKRLYNGLMESIEEIYINGDMN 287
           +G GE +      M  E  RI  +  ++LY  L+E   +   +GD N
Sbjct: 11  IGNGEEYTPVDYIMHVEGDRIYVIDFEKLYADLLEKKRKDEESGDNN 57


>gnl|CDD|132354 TIGR03311, Se_dep_Molyb_1, selenium-dependent molybdenum
           hydroxylase 1.  Members of this protein family show full
           length homology to the molybdenum-containing aldehyde
           oxido-reductase of Desulfovibrio gigas. Members,
           however, are found only within species that have, and
           near those genes that encode, a set of predicted
           accessory proteins for selenium-dependent molybdenum
           hydroxylases. The best known examples of such enzymes
           are forms of xanthine dehydrogenase and purine
           hydroxylase; this family appears to be another such
           enzyme.
          Length = 848

 Score = 29.4 bits (66), Expect = 4.5
 Identities = 19/60 (31%), Positives = 30/60 (50%), Gaps = 3/60 (5%)

Query: 235 TLATHQIVGMGEAFRLARIEMKEEL--KRILFLQKRLYNGLMESIEEIYINGDMNNRIPH 292
           T A+ Q +  GEA R A  ++KE L  + +  L+   + G   SI +  +N D  N + H
Sbjct: 654 TTASRQSLFTGEATRRAAAKLKEALRGRTLAELEGEEFYGEYHSITD-PLNSDKENPVSH 712


>gnl|CDD|130881 TIGR01822, 2am3keto_CoA, 2-amino-3-ketobutyrate coenzyme A ligase. 
           This model represents a narrowly defined clade of animal
           and bacterial (almost exclusively Proteobacterial)
           2-amino-3-ketobutyrate--CoA ligase. This enzyme can act
           in threonine catabolism. The closest homolog from
           Bacillus subtilis, and sequences like it, may be
           functionally equivalent but were not included in the
           model because of difficulty in finding reports of
           function [Energy metabolism, Amino acids and amines].
          Length = 393

 Score = 29.0 bits (65), Expect = 4.7
 Identities = 25/93 (26%), Positives = 41/93 (44%), Gaps = 8/93 (8%)

Query: 89  IITVQTEHKSVLDSIRKLENDGFRVTYLKPKNNGLITL-SQLEE--KICSDTILVSVMLV 145
           II+    H S++D +R  +   +R       NN +  L +QL+E     +   L++   V
Sbjct: 125 IISDALNHASIIDGVRLCKAKRYRY-----ANNDMADLEAQLKEARAAGARHRLIATDGV 179

Query: 146 NNEIGVIQPILEIGELCRSKGVIFHCDAAQAVG 178
            +  GVI P+ EI +L      +   D   A G
Sbjct: 180 FSMDGVIAPLDEICDLADKYDALVMVDECHATG 212


>gnl|CDD|227174 COG4837, COG4837, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 106

 Score = 27.1 bits (60), Expect = 7.0
 Identities = 16/47 (34%), Positives = 22/47 (46%), Gaps = 1/47 (2%)

Query: 104 RKLENDGFRVTYLKPKNNGLIT-LSQLEEKICSDTILVSVMLVNNEI 149
           RK  N  F+ TY+   N  L     Q  EKI  D     +++V +EI
Sbjct: 38  RKYPNQPFKYTYIDITNPPLEDHDLQFAEKIEQDEYFYPLIVVEDEI 84


>gnl|CDD|221634 pfam12548, DUF3740, Sulfatase protein.  This domain family is found
           in eukaryotes, and is typically between 144 and 173
           amino acids in length. The family is found in
           association with pfam00884.
          Length = 144

 Score = 27.3 bits (61), Expect = 7.6
 Identities = 9/18 (50%), Positives = 13/18 (72%)

Query: 359 IDFTIKLLKMKVKKLREL 376
           ID  I+ LK K+K L+E+
Sbjct: 110 IDMEIETLKDKIKNLKEV 127


>gnl|CDD|200535 cd11274, Sema_plexin_A4, The Sema domain, a protein interacting
           module, of Plexin A4.  Plexin A4 forms a receptor
           complex with neuropilins (NRPs) and transduces signals
           for class 3 semaphorins in the nervous system. It
           regulates facial nerve development by functioning as a
           receptor for Sema3A/NRP1. Both plexins A3 and A4 are
           essential for normal sympathetic development. They
           function both cooperatively, to regulate the migration
           of sympathetic neurons, and differentially, to guide
           sympathetic axons. Plexin A4 is also expressed in
           lymphoid tissues and functions in the immune system. It
           negatively regulates T lymphocyte responses. The Sema
           domain is located at the N-terminus and contains four
           disulfide bonds formed by eight conserved cysteine
           residues. It serves as a ligand-recognition and -binding
           module.
          Length = 473

 Score = 28.4 bits (63), Expect = 8.1
 Identities = 19/56 (33%), Positives = 27/56 (48%), Gaps = 7/56 (12%)

Query: 266 QKRLYNGLMESIEEIYINGDMNNRIPHNLNISFNFIEGESLL----MAIKDIAVSS 317
           QKR    L ES   I++  ++N+RI   L   +    GE  L    + +KDI  SS
Sbjct: 305 QKRKMKSLDESALCIFVLKEINDRIKDRLQSCY---RGEGTLDLAWLKVKDIPCSS 357


>gnl|CDD|140343 PTZ00322, PTZ00322,
           6-phosphofructo-2-kinase/fructose-2,6-biphosphatase;
           Provisional.
          Length = 664

 Score = 28.3 bits (63), Expect = 9.5
 Identities = 10/31 (32%), Positives = 17/31 (54%), Gaps = 1/31 (3%)

Query: 7   TPIDPRVVDKMIPYLRVQFGNPASRNHIYGW 37
             ++P V D  + Y+R++     S N+I GW
Sbjct: 374 KSLNP-VTDCDLTYIRIEDTQTFSLNNISGW 403


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.320    0.137    0.397 

Gapped
Lambda     K      H
   0.267   0.0722    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 20,527,134
Number of extensions: 2037191
Number of successful extensions: 2011
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1939
Number of HSP's successfully gapped: 75
Length of query: 397
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 298
Effective length of database: 6,546,556
Effective search space: 1950873688
Effective search space used: 1950873688
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 60 (26.9 bits)