RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy7357
         (302 letters)



>gnl|CDD|235432 PRK05367, PRK05367, glycine dehydrogenase; Provisional.
          Length = 954

 Score =  600 bits (1550), Expect = 0.0
 Identities = 196/327 (59%), Positives = 238/327 (72%), Gaps = 26/327 (7%)

Query: 1   MKLNATTEMIPCSFPQLTNMHPFIPQDQARGYEQLIGELETDLCEITGYDKISFQPNSGA 60
           MKLNA  EMIP ++P+  N+HPF P +QA GY +LI +LE  L EITGYD +S QPN+GA
Sbjct: 509 MKLNAAAEMIPITWPEFANLHPFAPAEQAAGYRELIDQLEAWLAEITGYDAVSLQPNAGA 568

Query: 61  QGEYAGLRAIQCYHQAQDAHHRNVCLIPVSAHGTNPASAQMAGMSVEPVSVRKDGTIDFS 120
           QGEYAGL AI+ YH+++   HR+VCLIP SAHGTNPASA MAGM V  V+  ++G ID  
Sbjct: 569 QGEYAGLLAIRAYHESRGEGHRDVCLIPSSAHGTNPASAVMAGMKVVVVACDENGNIDLD 628

Query: 121 DLETKVKKNKETLSCLMITYPSTFGVFEENITDVCELIHEHGGQVYLDGANMNAQVGLCR 180
           DL  K +++ + L+ +MITYPST GVFEE I ++CE++HEHGGQVYLDGANMNAQVGL R
Sbjct: 629 DLRAKAEEHADNLAAIMITYPSTHGVFEETIREICEIVHEHGGQVYLDGANMNAQVGLAR 688

Query: 181 PGDYGSDVSHLNLHKTFCIPHGGGGPGMGPIGVKSHLAPFLPVHPLSSI--DSSIGAVSA 238
           PGD G+DVSHLNLHKTFCIPHGGGGPG+GPIGVK+HLAPFLP HP+     ++ IGAVSA
Sbjct: 689 PGDIGADVSHLNLHKTFCIPHGGGGPGVGPIGVKAHLAPFLPGHPVQIAGGETGIGAVSA 748

Query: 239 AHYGSASILPISWAYIR-----------------------RLESHYKTLFRSSRSGLVAH 275
           A +GSASILPISW YIR                       RL+ HY  L+    +G VAH
Sbjct: 749 APFGSASILPISWMYIRMMGAEGLRQATEVAILNANYIAKRLKDHYPVLYT-GANGRVAH 807

Query: 276 EFVIDVRDFKKSANIEAVDIAKRLMDY 302
           E ++D+R  K+S  I   DIAKRL+DY
Sbjct: 808 ECILDLRPLKESTGITVDDIAKRLIDY 834


>gnl|CDD|223935 COG1003, GcvP, Glycine cleavage system protein P
           (pyridoxal-binding), C-terminal domain [Amino acid
           transport and metabolism].
          Length = 496

 Score =  484 bits (1247), Expect = e-171
 Identities = 178/334 (53%), Positives = 218/334 (65%), Gaps = 37/334 (11%)

Query: 1   MKLNATTEMIPCSFPQLTNMHPFIPQDQARGYEQLIGELETDLCEITGYDKISFQPNSGA 60
           MKLN   EM P ++P+  N+HPF P++Q +GY +LI EL+  L EITG D +S QPN+GA
Sbjct: 76  MKLNPKAEMKPATWPEFANIHPFQPEEQVQGYLELIYELQEWLKEITGMDAVSLQPNAGA 135

Query: 61  QGEYAGLRAIQCYHQAQDAHHRNVCLIPVSAHGTNPASAQMAGMSVEPVSVRKDGTIDFS 120
           QGEYAGL AI+ YH+++   HRN+CLIP SAHGTNPASA MAG  V  V   ++G +D  
Sbjct: 136 QGEYAGLLAIRAYHESRGEGHRNICLIPDSAHGTNPASAAMAGFKVVVVKCDENGNVDLD 195

Query: 121 DLETKVKKNKETLSCLMITYPSTFGVFEENITDVCELIHEHGGQVYLDGANMNAQVGLCR 180
           DL  K + N   L+ LMIT PST GVFEE+I ++CE++HE GGQVY DGAN+NA VGL R
Sbjct: 196 DLRAKAEDN---LAALMITNPSTLGVFEEDIREICEIVHEAGGQVYYDGANLNAIVGLAR 252

Query: 181 PGDYGSDVSHLNLHKTFCIPHGGGGPGMGPIGVKSHLAPFLPV---------HPLSSIDS 231
           PGD G DV HLNLHKTFCIPHGGGGPG GP+GVK+HLAPFLP          + L     
Sbjct: 253 PGDMGFDVVHLNLHKTFCIPHGGGGPGAGPVGVKAHLAPFLPGPVVYHDVGEYRLDYDGK 312

Query: 232 SIGAVSAAHYGSASILPISWAYIR-----------------------RLESHYKTLFRSS 268
               VSAA YGSASILPI+WAYIR                       RL+ +Y   +   
Sbjct: 313 KSIGVSAAPYGSASILPIAWAYIRMMGADGLKQASEVAVLNANYIARRLKGYYPVPYTGE 372

Query: 269 RSGLVAHEFVIDVRDFKKSANIEAVDIAKRLMDY 302
               VAHE ++D R  KK   + A+D+AKRL+DY
Sbjct: 373 N--RVAHECILDARPLKKETGVRALDVAKRLLDY 404


>gnl|CDD|178035 PLN02414, PLN02414, glycine dehydrogenase (decarboxylating).
          Length = 993

 Score =  481 bits (1240), Expect = e-164
 Identities = 185/333 (55%), Positives = 230/333 (69%), Gaps = 32/333 (9%)

Query: 1   MKLNATTEMIPCSFPQLTNMHPFIPQDQARGYEQLIGELETDLCEITGYDKISFQPNSGA 60
           MKLNATTEM+P ++P+  N+HPF P DQA+GY+++  +L   LCEITG+D  S QPN+GA
Sbjct: 535 MKLNATTEMMPVTWPEFANIHPFAPVDQAQGYQEMFEDLGDLLCEITGFDSFSLQPNAGA 594

Query: 61  QGEYAGLRAIQCYHQAQDAHHRNVCLIPVSAHGTNPASAQMAGMSVEPVSVRKDGTIDFS 120
            GEYAGL  I+ YH ++  HHRNVC+IPVSAHGTNPASA M GM +  V     G I+  
Sbjct: 595 AGEYAGLMVIRAYHLSRGDHHRNVCIIPVSAHGTNPASAAMCGMKIVVVGTDAKGNINIE 654

Query: 121 DLETKVKKNKETLSCLMITYPSTFGVFEENITDVCELIHEHGGQVYLDGANMNAQVGLCR 180
           +L    + +K+ L+ LM+TYPST GV+EE I ++C++IH++GGQVY+DGANMNAQVGL  
Sbjct: 655 ELRKAAEAHKDNLAALMVTYPSTHGVYEEGIDEICDIIHDNGGQVYMDGANMNAQVGLTS 714

Query: 181 PGDYGSDVSHLNLHKTFCIPHGGGGPGMGPIGVKSHLAPFLPVHPLSSIDS--------S 232
           PG  G+DV HLNLHKTFCIPHGGGGPGMGPIGVK HLAPFLP HP+              
Sbjct: 715 PGFIGADVCHLNLHKTFCIPHGGGGPGMGPIGVKKHLAPFLPSHPVVPTGGIPRPEKTQP 774

Query: 233 IGAVSAAHYGSASILPISWAYI-----------------------RRLESHYKTLFRSSR 269
           +G +SAA +GSA ILPIS+ YI                       +RLE HY  LFR  +
Sbjct: 775 LGTISAAPWGSALILPISYTYIAMMGSEGLTDASKIAILNANYMAKRLEGHYPVLFR-GK 833

Query: 270 SGLVAHEFVIDVRDFKKSANIEAVDIAKRLMDY 302
           +G  AHEF+ID+R FK +A IE  D+AKRLMDY
Sbjct: 834 NGTCAHEFIIDLRPFKNTAGIEPEDVAKRLMDY 866


>gnl|CDD|129553 TIGR00461, gcvP, glycine dehydrogenase (decarboxylating).  This
           apparently ubiquitous enzyme is found in bacterial,
           mammalian and plant sources. The enzyme catalyzes the
           reaction: GLYCINE + LIPOYLPROTEIN =
           S-AMINOMETHYL-DIHYDROLIPOYLPROTEIN + CO2. It is part of
           the glycine decarboxylase multienzyme complex (GDC)
           consisting of four proteins P, H, L and T. Active site
           in E.coli is located as the (K) residues at position 713
           of the SEED alignment [Energy metabolism, Amino acids
           and amines].
          Length = 939

 Score =  470 bits (1210), Expect = e-160
 Identities = 193/331 (58%), Positives = 234/331 (70%), Gaps = 30/331 (9%)

Query: 1   MKLNATTEMIPCSFPQLTNMHPFIPQDQARGYEQLIGELETDLCEITGYDKISFQPNSGA 60
           MKLNAT EM+P ++P+ +N+HPF P +Q  GY++LI +LE  LC ITG+D IS QPNSGA
Sbjct: 497 MKLNATAEMMPITWPEFSNIHPFQPSNQVEGYQELIAQLEKWLCSITGFDAISLQPNSGA 556

Query: 61  QGEYAGLRAIQCYHQAQDAHHRNVCLIPVSAHGTNPASAQMAGMSVEPVSVRKDGTIDFS 120
           QGEYAGLR I+ YH+++  +HRN+CLIPVSAHGTNPASA MAGM V PV+  +DG ID  
Sbjct: 557 QGEYAGLRVIRSYHESRGENHRNICLIPVSAHGTNPASAAMAGMQVVPVNCDQDGNIDLV 616

Query: 121 DLETKVKKNKETLSCLMITYPSTFGVFEENITDVCELIHEHGGQVYLDGANMNAQVGLCR 180
           DL+ K +++ + L+ +M+TYPST GVFE  I   C+++H  GGQVYLDGANMNAQVGL  
Sbjct: 617 DLKNKAEQHGDELAAVMVTYPSTHGVFEPTIQHACDIVHSFGGQVYLDGANMNAQVGLTS 676

Query: 181 PGDYGSDVSHLNLHKTFCIPHGGGGPGMGPIGVKSHLAPFLPVHPLSSI------DSSIG 234
           PGD G+DV HLNLHKTFCIPHGGGGPGMGPIGVKSHL PFLP H + S+        SIG
Sbjct: 677 PGDLGADVCHLNLHKTFCIPHGGGGPGMGPIGVKSHLIPFLPKHDVVSMITGIGGSKSIG 736

Query: 235 AVSAAHYGSASILPISWAYIR-----------------------RLESHYKTLFRSSRSG 271
           +VSAA YGSASILPISW YI+                       RL+ HY  LF      
Sbjct: 737 SVSAAPYGSASILPISWMYIKMMGNEGLPKASVVAILNANYMATRLKDHYPILFV-GTLK 795

Query: 272 LVAHEFVIDVRDFKKSANIEAVDIAKRLMDY 302
            VAHE ++D+R  K    IEA+D+AKRL DY
Sbjct: 796 HVAHECILDLRPLKAKTGIEAIDVAKRLQDY 826


>gnl|CDD|171585 PRK12566, PRK12566, glycine dehydrogenase; Provisional.
          Length = 954

 Score =  425 bits (1093), Expect = e-142
 Identities = 187/327 (57%), Positives = 229/327 (70%), Gaps = 26/327 (7%)

Query: 1   MKLNATTEMIPCSFPQLTNMHPFIPQDQARGYEQLIGELETDLCEITGYDKISFQPNSGA 60
           MKLNAT+EMIP ++P+   +HPF P++QA GY  +I ELE  LC ITG+D I  QPNSGA
Sbjct: 510 MKLNATSEMIPITWPEFAQLHPFAPREQAEGYRAMIDELEAWLCAITGFDAICMQPNSGA 569

Query: 61  QGEYAGLRAIQCYHQAQDAHHRNVCLIPVSAHGTNPASAQMAGMSVEPVSVRKDGTIDFS 120
           QGEYAGL AI+ YH+++    R++CLIP SAHGTNPASAQMAGM V  V    DG +D  
Sbjct: 570 QGEYAGLLAIRRYHRSRGQSQRDICLIPSSAHGTNPASAQMAGMRVVIVECDPDGNVDLD 629

Query: 121 DLETKVKKNKETLSCLMITYPSTFGVFEENITDVCELIHEHGGQVYLDGANMNAQVGLCR 180
           DL+ K     + LSCLMITYPST GV+EE I ++CE++H+HGGQVY+DGAN+NAQVGL R
Sbjct: 630 DLKAKAAAAGDRLSCLMITYPSTHGVYEEGIREICEVVHQHGGQVYMDGANLNAQVGLAR 689

Query: 181 PGDYGSDVSHLNLHKTFCIPHGGGGPGMGPIGVKSHLAPFL---PVHPLSSIDSSIGAVS 237
           P D G+DVSH+NLHKTFCIPHGGGGPGMGPIGV++HLAPF+   PV P+   D + GAVS
Sbjct: 690 PADIGADVSHMNLHKTFCIPHGGGGPGMGPIGVRAHLAPFVANHPVVPVEGPDPNNGAVS 749

Query: 238 AAHYGSASILPISWAYI----------------------RRLESHYKTLFRSSRSGLVAH 275
           AA +GSASILPISW YI                       +L   +  L+R  R+  VAH
Sbjct: 750 AAPWGSASILPISWMYIAMMGPQLADASEVAILSANYLANQLGGAFPVLYR-GRNERVAH 808

Query: 276 EFVIDVRDFKKSANIEAVDIAKRLMDY 302
           E ++D+R  K    I   D+AKRLMDY
Sbjct: 809 ECILDLRPLKAQTGISEEDVAKRLMDY 835


>gnl|CDD|235292 PRK04366, PRK04366, glycine dehydrogenase subunit 2; Validated.
          Length = 481

 Score =  325 bits (835), Expect = e-109
 Identities = 137/337 (40%), Positives = 185/337 (54%), Gaps = 46/337 (13%)

Query: 1   MKLN--ATTEMIPCSFPQLTNMHPFIPQDQARGYEQLIGELETDLCEITGYDKISFQPNS 58
           MK N     ++     P    +HP  P++  +G  +L+ EL+  L EITG D ++ QP +
Sbjct: 80  MKYNPKINEKV--ARLPGFAELHPLQPEETVQGALELMYELQEWLKEITGMDAVTLQPAA 137

Query: 59  GAQGEYAGLRAIQCYHQAQDAHHRNVCLIPVSAHGTNPASAQMAGMSVEPVSVRKDGTID 118
           GA GE  GL  I+ YH+A+    R   ++P SAHGTNPASA MAG  V  +   +DG +D
Sbjct: 138 GAHGELTGLLMIRAYHEARGDTKRTEVIVPDSAHGTNPASAAMAGFKVVEIPSNEDGLVD 197

Query: 119 FSDLETKVKKNKETLSCLMITYPSTFGVFEENITDVCELIHEHGGQVYLDGANMNAQVGL 178
              L+  V   ++T + LM+T P+T G+FE NI ++ E++HE GG +Y DGAN+NA +G 
Sbjct: 198 LEALKAAV--GEDT-AALMLTNPNTLGLFERNILEIAEIVHEAGGLLYYDGANLNAILGK 254

Query: 179 CRPGDYGSDVSHLNLHKTFCIPHGGGGPGMGPIGVKSHLAPFLPV----------HPLSS 228
            RPGD G DV HLNLHKTF  PHGGGGPG GP+GVK  LAPFLPV               
Sbjct: 255 ARPGDMGFDVVHLNLHKTFSTPHGGGGPGSGPVGVKEELAPFLPVPVVEKDGDRYRLDYD 314

Query: 229 IDSSIGAVSAAHYGSASILPISWAYIR-----------------------RLESHYKTLF 265
              SIG V A  YG+  +L  ++AYIR                       RL+  Y   +
Sbjct: 315 RPKSIGRVRA-FYGNFGVLVRAYAYIRSLGAEGLREVSEDAVLNANYLKARLKDIYDLPY 373

Query: 266 RSSRSGLVAHEFVIDVRDFKKSANIEAVDIAKRLMDY 302
                    HEFV+  +  K++  +  +DIAKRL+DY
Sbjct: 374 DRPCM----HEFVLSGKKLKET-GVRTLDIAKRLLDY 405


>gnl|CDD|99737 cd00613, GDC-P, Glycine cleavage system P-protein, alpha- and
           beta-subunits. This family consists of Glycine cleavage
           system P-proteins EC:1.4.4.2 from bacterial, mammalian
           and plant sources. The P protein is part of the glycine
           decarboxylase multienzyme complex EC:2.1.2.10 (GDC) also
           annotated as glycine cleavage system or glycine
           synthase. GDC consists of four proteins P, H, L and T.
           The reaction catalysed by this protein is: Glycine +
           lipoylprotein <=> S-aminomethyldihydrolipoylprotein +
           CO2. Alpha-beta-type dimers associate to form an
           alpha(2)beta(2) tetramer, where the alpha- and
           beta-subunits are structurally similar and appear to
           have arisen by gene duplication and subsequent
           divergence with a loss of one active site. The members
           of this CD are widely dispersed among all three forms of
           cellular life.
          Length = 398

 Score =  314 bits (808), Expect = e-106
 Identities = 110/351 (31%), Positives = 155/351 (44%), Gaps = 66/351 (18%)

Query: 1   MKLNATTEMIPCSFP-QLTNMHPFIPQDQARGYEQLIGELETDLCEITGYDK--ISFQPN 57
            K N    +       +    +     + ++G  Q + EL+T LCE+TG D    S Q  
Sbjct: 30  YKHNPPAVIKRNILENEFYTAYTPYQPEISQGRLQALFELQTMLCELTGMDVANASLQDE 89

Query: 58  SGAQGEYAGLRAIQCYHQAQDAHHRNVCLIPVSAHGTNPASAQMA----GMSVEPVSVRK 113
           + A  E AGL AI+ YH       RN  L+P SAH TNPA A+      G+ V  V   +
Sbjct: 90  ATAAAEAAGLAAIRAYH------KRNKVLVPDSAHPTNPAVARTRGEPLGIEVVEVPSDE 143

Query: 114 DGTIDFSDLETKVKKNKETLSCLMITYPSTFGVFEENITDVCELIHEHGGQVYLDGANMN 173
            GT+D   L+ +V +    ++ LM+ YP+T GVFE+ I ++ ++ H  G  VY+DG N+N
Sbjct: 144 GGTVDLEALKEEVSEE---VAALMVQYPNTLGVFEDLIKEIADIAHSAGALVYVDGDNLN 200

Query: 174 AQVGLCRPGDYGSDVSHLNLHKTFCIPHGGGGPGMGPIGVKSHLAPFLPVHPLSSIDSSI 233
              GL  PG+YG+D+   NL KT  +PHGGGGPG G   VK  L  FLP   +     + 
Sbjct: 201 LT-GLKPPGEYGADIVVGNLQKTG-VPHGGGGPGAGFFAVKKELVRFLPGRLVGVTKDAE 258

Query: 234 G-------------------AVSAAHYGSASILPISWAYIR------------------- 255
           G                   A S    G A +  ++  YI                    
Sbjct: 259 GNRAFRLALQTREQHIRREKATSNICTGQALLALMAAMYIVYLGPEGLKEIAERAHLNAN 318

Query: 256 ----RLESHYKTLFRSSRSGLVAHEFVIDVRDFKKSANIEAVDIAKRLMDY 302
               RL+     L     +G   HEFV+ +        I A D+AK L+D 
Sbjct: 319 YLAKRLKEVGGVLP---FNGPFFHEFVLRLPPL---YGIRAEDLAKALIDG 363


>gnl|CDD|99742 cd01494, AAT_I, Aspartate aminotransferase (AAT) superfamily (fold
           type I) of pyridoxal phosphate (PLP)-dependent enzymes.
           PLP combines with an alpha-amino acid to form a compound
           called a Schiff base or aldimine intermediate, which
           depending on the reaction, is the substrate in four
           kinds of reactions (1) transamination (movement of amino
           groups), (2) racemization (redistribution of
           enantiomers), (3) decarboxylation (removing COOH
           groups), and (4) various side-chain reactions depending
           on the enzyme involved. Pyridoxal phosphate (PLP)
           dependent enzymes were previously classified into alpha,
           beta and gamma classes, based on the chemical
           characteristics (carbon atom involved) of the reaction
           they catalyzed. The availability of several structures
           allowed a comprehensive analysis of  the evolutionary
           classification of PLP dependent enzymes, and it was
           found that the functional classification did not always
           agree with the evolutionary history of these enzymes.
           Structure and sequence analysis has revealed that the
           PLP dependent enzymes can be classified into four major
           groups of different evolutionary origin: aspartate
           aminotransferase superfamily (fold type I), tryptophan
           synthase beta superfamily (fold type II), alanine
           racemase superfamily (fold type III), and D-amino acid
           superfamily (fold type IV) and Glycogen phophorylase
           family (fold type V).
          Length = 170

 Score = 56.2 bits (136), Expect = 1e-09
 Identities = 47/185 (25%), Positives = 74/185 (40%), Gaps = 26/185 (14%)

Query: 38  ELETDLCEIT--GYDKISFQPNSGAQGEYAGLRAIQCYHQAQDAHHRNVCLIPVSAHGTN 95
           ELE  L  +   G DK  F P SG     A L A+               ++  + HG+ 
Sbjct: 4   ELEEKLARLLQPGNDKAVFVP-SGTGANEAALLALLGPGD--------EVIVDANGHGSR 54

Query: 96  PASAQMAGMSVEPVSVRKD----GTIDFSDLETKVKKNKETLSCLMITYPSTFGVFEENI 151
              A       +PV V  D    G +D + LE    K K  ++ ++IT  +T G     +
Sbjct: 55  YWVAAELAG-AKPVPVPVDDAGYGGLDVAILEEL--KAKPNVALIVITPNTTSGGVLVPL 111

Query: 152 TDVCELIHEHGGQVYLDGANMNAQVG--LCRPGDYGSDVSHLNLHKTFCIPHGGGGPGMG 209
            ++ ++  E+G  + +D A+             + G+DV   +LHK        GG G G
Sbjct: 112 KEIRKIAKEYGILLLVDAASAGGASPAPGVLIPEGGADVVTFSLHKNL------GGEGGG 165

Query: 210 PIGVK 214
            + VK
Sbjct: 166 VVIVK 170


>gnl|CDD|223480 COG0403, GcvP, Glycine cleavage system protein P
           (pyridoxal-binding), N-terminal domain [Amino acid
           transport and metabolism].
          Length = 450

 Score = 50.3 bits (121), Expect = 5e-07
 Identities = 28/94 (29%), Positives = 49/94 (52%), Gaps = 2/94 (2%)

Query: 116 TIDFSDLETKVKKNKETLSCLMITYPSTFGVFEENITDVCELIHEHGGQVYLDGANMNAQ 175
            +D  DL+     +   +  +++ YP+TFG+ EE++  + E  H  G  V +  A+  A 
Sbjct: 194 VVDADDLDDLESADDGDVFGVLVQYPNTFGIVEEDLRALIEAAHSAGALVIV-AADPLAL 252

Query: 176 VGLCRPGDYGSDVSHLNLHKTFCIPHGGGGPGMG 209
             L  PG++G+D+   +  + F +P G GGP  G
Sbjct: 253 GLLKPPGEFGADIVVGSAQR-FGVPMGFGGPHAG 285


>gnl|CDD|216367 pfam01212, Beta_elim_lyase, Beta-eliminating lyase. 
          Length = 288

 Score = 48.8 bits (117), Expect = 1e-06
 Identities = 44/196 (22%), Positives = 75/196 (38%), Gaps = 23/196 (11%)

Query: 27  DQARGYEQLIGELETDLCEITGYDKISFQPNSGAQGEYAGLRA-IQCYHQAQDAHHRNVC 85
           D+  G +  +  LE  + E+ G +   F P SG       L A  +   +       ++ 
Sbjct: 25  DEVYGGDPTVNRLEDRVAELFGKEAALFVP-SGTMANQLALMAHCRRGDEVICGEPAHIY 83

Query: 86  LIPVSAHGTNPASAQMAGMSVEPVSVRKDGTIDFSDLETKVK------KNKETLSCLMIT 139
                   T    A++ G    P+   + G +D  DLE  ++           L  L  T
Sbjct: 84  F-----DETG-GHAELGGAQPVPLPGAEAGKLDLEDLEAAIRPVGDIHFPPTGLISLENT 137

Query: 140 YPSTFG--VFEENITDVCELIHEHGGQVYLDGANM-NAQVGL-CRPGDYGSDVSHLNLHK 195
           + S  G  V  E + ++  +  EHG  ++LDGA + NA V L     +  S    +    
Sbjct: 138 HNSAGGQVVSLEELREIRAIAREHGIPLHLDGARLANAAVALGVIVKEITSYADSV---- 193

Query: 196 TFCIPHGGGGPGMGPI 211
           +  +  G G P +G +
Sbjct: 194 SMSLSKGLGAP-VGSV 208


>gnl|CDD|234769 PRK00451, PRK00451, glycine dehydrogenase subunit 1; Validated.
          Length = 447

 Score = 47.4 bits (114), Expect = 4e-06
 Identities = 31/122 (25%), Positives = 54/122 (44%), Gaps = 25/122 (20%)

Query: 102 AGMSVEPVSVRKDGTIDFSDLETKVKKNKETLSCLMITYPSTFGVFEENITDVCELIHEH 161
            G+ V  V   +DG  D   LE  V  + +T + +++ YP+ FGV E+ + ++ E+ H  
Sbjct: 178 QGIEVVEVPY-EDGVTDLEALEAAV--DDDT-AAVVVQYPNFFGVIED-LEEIAEIAHAG 232

Query: 162 GGQVYLDGANMNAQVGLCRPGDYGSDV---------SHLNLHKTFCIPHGGGGPGMGPIG 212
           G    + G +  +   L  PG+YG+D+           L+           GGP +G   
Sbjct: 233 GALFIV-GVDPVSLGLLKPPGEYGADIVVGEGQPLGIPLSF----------GGPYLGFFA 281

Query: 213 VK 214
            +
Sbjct: 282 TR 283


>gnl|CDD|224919 COG2008, GLY1, Threonine aldolase [Amino acid transport and
           metabolism].
          Length = 342

 Score = 47.3 bits (113), Expect = 5e-06
 Identities = 41/198 (20%), Positives = 67/198 (33%), Gaps = 28/198 (14%)

Query: 24  IPQDQARGYE--QLIGELETDLCEITGYDKISFQPNSGAQGEYAGLRAI----QCYHQAQ 77
                   Y        LE  + E+ G +   F P SG Q     L A     +     +
Sbjct: 23  ANAVGDDVYGEDPTTNALEQRIAELFGKEAALFVP-SGTQANQLALAAHCQPGESVICHE 81

Query: 78  DAHHRNVCLIPVSAHGTNPASAQMAGMSVEPVSVRKDGTIDFSDLETKVKKNK----ETL 133
            AH        +       A     G    P+    DG +   D+E  ++ +      T 
Sbjct: 82  TAH--------IYTDECG-APEFFGGGQKLPIVPGADGKLTPEDVEAAIRPDDIHHAPTP 132

Query: 134 SCLMITYPSTFGVF--EENITDVCELIHEHGGQVYLDGANM-NAQVGLCRP-GDYGSDVS 189
             ++    +  G     + +  +  +  EHG  +++DGA + NA V L        S V 
Sbjct: 133 LAVLENTATEGGTVYPLDELEAISAVCKEHGLPLHMDGARLANALVALGVALKTIKSYVD 192

Query: 190 HLNLHKTFCIPHGGGGPG 207
            +    +FC+  GGG P 
Sbjct: 193 SV----SFCLTKGGGAPV 206


>gnl|CDD|223594 COG0520, csdA, Selenocysteine lyase/Cysteine desulfurase
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 405

 Score = 45.8 bits (109), Expect = 2e-05
 Identities = 28/117 (23%), Positives = 45/117 (38%), Gaps = 8/117 (6%)

Query: 92  HGTNPASAQMA----GMSVEPVSVRKDGTIDFSDLETKVKKNKETLSCLMITYPSTFGVF 147
           H +N    Q      G  V  + +  DG +D   LE  +    + ++   ++  +  G  
Sbjct: 121 HHSNIVPWQELAKRTGAKVRVIPLDDDGLLDLDALEKLITPKTKLVALSHVS--NVTGT- 177

Query: 148 EENITDVCELIHEHGGQVYLDGANMNAQVGLCRPGDYGSDVSHLNLHKTFCIPHGGG 204
              + ++ EL HEHG  V +D A   A        + G D    + HK    P G G
Sbjct: 178 VNPVKEIAELAHEHGALVLVDAAQ-AAGHLPIDVQELGCDFLAFSGHKWLLGPTGIG 233


>gnl|CDD|99748 cd06502, TA_like, Low-specificity threonine aldolase (TA). This
           family belongs to pyridoxal phosphate (PLP)-dependent
           aspartate aminotransferase superfamily (fold I).  TA
           catalyzes the conversion of L-threonine or
           L-allo-threonine to glycine and acetaldehyde in a
           secondary glycine biosynthetic pathway.
          Length = 338

 Score = 40.8 bits (96), Expect = 5e-04
 Identities = 45/193 (23%), Positives = 71/193 (36%), Gaps = 27/193 (13%)

Query: 27  DQARGYEQLIGELETDLCEITGYDKISFQPNSGAQGEYAGLRAI-QCYHQA---QDAHHR 82
           D   G +    +LE    E+ G +   F P SG       L A  Q        + AH  
Sbjct: 25  DDVYGEDPTTAKLEARAAELFGKEAALFVP-SGTAANQLALAAHTQPGGSVICHETAH-- 81

Query: 83  NVCLIPVSAHGTNPASAQMAGMSVEPVSVRKDGTIDFSDLETKVKKN------KETLSCL 136
                 +       A   ++G+ + PV   ++G +   DLE  ++          +L  L
Sbjct: 82  ------IYTDEAG-APEFLSGVKLLPVP-GENGKLTPEDLEAAIRPRDDIHFPPPSLVSL 133

Query: 137 MITYPSTFGVFEENITDVCELIHEHGGQVYLDGANM-NAQVGL-CRPGDYGSDVSHLNLH 194
             T         + +  +  L  E+G  ++LDGA + NA   L      Y S V  +   
Sbjct: 134 ENTTEGGTVYPLDELKAISALAKENGLPLHLDGARLANAAAALGVALKTYKSGVDSV--- 190

Query: 195 KTFCIPHGGGGPG 207
            +FC+  GGG P 
Sbjct: 191 -SFCLSKGGGAPV 202


>gnl|CDD|216983 pfam02347, GDC-P, Glycine cleavage system P-protein.  This family
           consists of Glycine cleavage system P-proteins
           EC:1.4.4.2 from bacterial, mammalian and plant sources.
           The P protein is part of the glycine decarboxylase
           multienzyme complex EC:2.1.2.10 (GDC) also annotated as
           glycine cleavage system or glycine synthase. GDC
           consists of four proteins P, H, L and T. The reaction
           catalyzed by this protein is:- Glycine + lipoylprotein
           <=> S-aminomethyldihydrolipoylprotein + CO2.
          Length = 429

 Score = 38.9 bits (91), Expect = 0.002
 Identities = 28/87 (32%), Positives = 48/87 (55%), Gaps = 3/87 (3%)

Query: 136 LMITYPSTFGVFEENITDVCELIHEHGGQVYLDGANMNAQVGLCRPGDYGSDVSHLNLHK 195
           +++ YP+T G  E+   ++ E+ H+HG  V +  A++ A   L  PG++G+D+  +   +
Sbjct: 201 VLVQYPNTEGRIED-YKELIEIAHQHGSLVVV-AADLLALTILKPPGEFGADIV-VGSAQ 257

Query: 196 TFCIPHGGGGPGMGPIGVKSHLAPFLP 222
            F +P G GGP  G   VK  L   +P
Sbjct: 258 RFGVPMGYGGPHAGFFAVKEELKRKMP 284


>gnl|CDD|234361 TIGR03812, tyr_de_CO2_Arch, tyrosine decarboxylase MnfA.  Members
           of this protein family are the archaeal form, MnfA, of
           tyrosine decarboxylase, and are involved in methanofuran
           biosynthesis. Members show clear homology to the
           Enterococcus form, Tdc, that is involved in tyrosine
           decarboxylation for resistance to acidic conditions
           [Biosynthesis of cofactors, prosthetic groups, and
           carriers, Other].
          Length = 373

 Score = 35.0 bits (81), Expect = 0.035
 Identities = 26/110 (23%), Positives = 54/110 (49%), Gaps = 4/110 (3%)

Query: 62  GEYAGLRAIQCYHQAQDAHHRNVCLI-PVSAHGTNPASAQMAGMSVEPVSVRKDGTIDFS 120
           G  A ++A++          R   +I P SAH +   +A+M G+ +    + +D T+D  
Sbjct: 85  GTEANIQAVRAAKNLAREEKRTPNIIVPESAHFSFEKAAEMLGLELRYAPLDEDYTVDVK 144

Query: 121 DLETKVKKNKETLSCLMITYPSTFGVFEENITDVCELIHEHGGQVYLDGA 170
           D+E  +  N  T+  + I   +  G   ++I ++ ++  E+G  +++D A
Sbjct: 145 DVEDLIDDN--TIGIVGIAGTTELGQI-DDIEELSKIALENGIYLHVDAA 191


>gnl|CDD|180392 PRK06084, PRK06084, O-acetylhomoserine
           aminocarboxypropyltransferase; Validated.
          Length = 425

 Score = 34.5 bits (79), Expect = 0.053
 Identities = 40/147 (27%), Positives = 61/147 (41%), Gaps = 27/147 (18%)

Query: 118 DFSDLETKVKKNKETLSCLMITYPSTFGVFEENITDV---CELIHEHGGQVYLDGANMNA 174
           D + LE  + +  + + C  I  P+       NI D+    +  H HG  + +D  N  A
Sbjct: 132 DIAALEALIDERTKAVFCESIGNPAG------NIIDIQALADAAHRHGVPLIVD--NTVA 183

Query: 175 QVGLCRPGDYGSDVSHLNLHKTFCIPHGGGGPGMGPIGVKSHLAPFLPVHP----LSSID 230
              LCRP ++G+D+   +L K      GG G  +G I V S   P+         L++ D
Sbjct: 184 TPVLCRPFEHGADIVVHSLTKYI----GGHGTSIGGIVVDSGKFPWAEHKERFALLNTPD 239

Query: 231 SSI-GAVSAAHYGSASILPISWAYIRR 256
            S  G      +G A       A+I R
Sbjct: 240 PSYHGVTYTEAFGPA-------AFIGR 259


>gnl|CDD|131034 TIGR01979, sufS, cysteine desulfurases, SufSfamily.  This model
           represents a subfamily of NifS-related cysteine
           desulfurases involved in FeS cluster formation needed
           for nitrogen fixation among other vital functions. Many
           cysteine desulfurases are also active as selenocysteine
           lyase and/or cysteine sulfinate desulfinase. This
           subfamily is associated with the six-gene SUF system
           described in E. coli and Erwinia as an FeS cluster
           formation system during oxidative stress. The active
           site Cys is this subfamily resembles GHHC with one or
           both His conserved [Biosynthesis of cofactors,
           prosthetic groups, and carriers, Other].
          Length = 403

 Score = 34.2 bits (79), Expect = 0.067
 Identities = 22/89 (24%), Positives = 43/89 (48%), Gaps = 7/89 (7%)

Query: 86  LIPVSAHGTNPASAQMA----GMSVEPVSVRKDGTIDFSDLETKVKKNKETLSCLMITYP 141
           +I    H  N    Q+     G +++ + +  DGT+D  DLE  + +  + ++   IT+ 
Sbjct: 112 VISEMEHHANIVPWQLLAERTGATLKFIPLDDDGTLDLDDLEKLLTEKTKLVA---ITHV 168

Query: 142 STFGVFEENITDVCELIHEHGGQVYLDGA 170
           S        + ++ +L H+ G +V +DGA
Sbjct: 169 SNVLGTVNPVEEIAKLAHQVGAKVLVDGA 197


>gnl|CDD|223190 COG0112, GlyA, Glycine/serine hydroxymethyltransferase [Amino acid
           transport and metabolism].
          Length = 413

 Score = 34.1 bits (79), Expect = 0.070
 Identities = 43/181 (23%), Positives = 75/181 (41%), Gaps = 27/181 (14%)

Query: 33  EQLIGELETDLCEITGYDKISFQPNSGAQGEYAGLRAIQCYHQAQDAHHRNVCLIPVSA- 91
           E+L  E      ++ G +  + QP+SG+Q   A   A+      Q      +  + +S  
Sbjct: 74  EELAIER---AKKLFGAEYANVQPHSGSQANQAVYLAL-----LQ--PGDTIMGLDLSHG 123

Query: 92  -HGTNPASAQMAGMSVEPVS--VRKD-GTIDFSDLETKVKKNKETLSCLMITYPSTFGVF 147
            H T+ +    +G     VS  V  + G ID+ ++E   K+ K     L+I   S +   
Sbjct: 124 GHLTHGSPVNFSGKLFNVVSYGVDPETGLIDYDEVEKLAKEVKPK---LIIAGGSAY-SR 179

Query: 148 EENITDVCELIHEHGGQVYLDGANMNAQVGLCRPGDYGSDVSHLNL-----HKTFCIPHG 202
             +     E+  E G  + +D A++    GL   G + + + H ++     HKT   P G
Sbjct: 180 PIDFKRFREIADEVGAYLMVDMAHV---AGLIAGGVHPNPLPHADVVTTTTHKTLRGPRG 236

Query: 203 G 203
           G
Sbjct: 237 G 237


>gnl|CDD|223154 COG0076, GadB, Glutamate decarboxylase and related PLP-dependent
           proteins [Amino acid transport and metabolism].
          Length = 460

 Score = 33.9 bits (78), Expect = 0.086
 Identities = 18/99 (18%), Positives = 41/99 (41%), Gaps = 6/99 (6%)

Query: 75  QAQDAHHRNVCLIPVSAHGTNPASAQMAGMSVEPVSVRK-DGTIDFSDLETKVKKN--KE 131
           ++     +   +   +AH +   +A+  G+ +  V     D  ID   LE  + +N    
Sbjct: 150 ESGKPGGKPNIVCSETAHFSFEKAARYLGLGLRRVPTVPTDYRIDVDALEEAIDENTIGG 209

Query: 132 TLSCLMITYPSTFGVFEENITDVCELIHEHGGQVYLDGA 170
            +     T     G   ++I ++ ++  E+G  +++D A
Sbjct: 210 VVVGTAGTTD--TGSI-DDIEELADIAEEYGIWLHVDAA 245


>gnl|CDD|99743 cd06450, DOPA_deC_like, DOPA decarboxylase family. This family
           belongs to pyridoxal phosphate (PLP)-dependent aspartate
           aminotransferase superfamily (fold I). The major groups
           in this CD correspond to DOPA/tyrosine decarboxylase
           (DDC), histidine decarboxylase (HDC), and glutamate
           decarboxylase (GDC). DDC is active as a dimer and
           catalyzes the decarboxylation of tyrosine. GDC catalyzes
           the decarboxylation of glutamate and HDC catalyzes the
           decarboxylation of histidine.
          Length = 345

 Score = 33.7 bits (78), Expect = 0.089
 Identities = 19/91 (20%), Positives = 41/91 (45%), Gaps = 5/91 (5%)

Query: 84  VCLIPVSAHGTNPASAQMAGMSVEPVSVRKDGTIDFSDLETKVKKNKETLS---CLMITY 140
           V +    AH +   +A    + V  V V +DG +D   LE  + ++K        ++ T 
Sbjct: 97  VIVCSDQAHVSVEKAAAYLDVKVRLVPVDEDGRMDPEALEAAIDEDKAEGLNPIMVVATA 156

Query: 141 PST-FGVFEENITDVCELIHEHGGQVYLDGA 170
            +T  G   + + ++ +L  ++   +++D A
Sbjct: 157 GTTDTGAI-DPLEEIADLAEKYDLWLHVDAA 186


>gnl|CDD|215756 pfam00155, Aminotran_1_2, Aminotransferase class I and II. 
          Length = 357

 Score = 33.8 bits (78), Expect = 0.093
 Identities = 21/88 (23%), Positives = 37/88 (42%), Gaps = 5/88 (5%)

Query: 86  LIPVSAHGTNPASAQMAGMSVEPVSVR--KDGTIDFSDLETKVKKNKETLSCLMITYPS- 142
           L+P   + +     ++AG  V    +    D  +DF  LE  +K+  E    ++ T P  
Sbjct: 94  LVPAPTYPSYIRIFRLAGGEVVRYPLYSSNDFHLDFDALEAALKEATEGNKVVLHTSPHN 153

Query: 143 -TFGVF-EENITDVCELIHEHGGQVYLD 168
            T  V   E +  + +L  EH   + +D
Sbjct: 154 PTGTVATLEELEKLLDLAKEHNILLLVD 181


>gnl|CDD|99734 cd00609, AAT_like, Aspartate aminotransferase family. This family
           belongs to pyridoxal phosphate (PLP)-dependent aspartate
           aminotransferase superfamily (fold I). Pyridoxal
           phosphate combines with an alpha-amino acid to form a
           compound called a Schiff base or aldimine intermediate,
           which depending on the reaction, is the substrate in
           four kinds of reactions (1) transamination (movement of
           amino groups), (2) racemization (redistribution of
           enantiomers), (3) decarboxylation (removing COOH
           groups), and (4) various side-chain reactions depending
           on the enzyme involved. Pyridoxal phosphate (PLP)
           dependent enzymes were previously classified into alpha,
           beta and gamma classes, based on the chemical
           characteristics (carbon atom involved) of the reaction
           they catalyzed. The availability of several structures
           allowed a comprehensive analysis of  the evolutionary
           classification of PLP dependent enzymes, and it was
           found that the functional classification did not always
           agree with the evolutionary history of these enzymes.
           The major groups in this CD corresponds to Aspartate
           aminotransferase a, b and c, Tyrosine, Alanine,
           Aromatic-amino-acid, Glutamine phenylpyruvate,
           1-Aminocyclopropane-1-carboxylate synthase,
           Histidinol-phosphate, gene products of malY and cobC,
           Valine-pyruvate aminotransferase and Rhizopine
           catabolism regulatory protein.
          Length = 350

 Score = 33.1 bits (76), Expect = 0.17
 Identities = 24/88 (27%), Positives = 41/88 (46%), Gaps = 4/88 (4%)

Query: 86  LIPVSAHGTNPASAQMAGMSVEPVSVRKDGTIDFSDLETKVKKNKETLSCLMITYPS--T 143
           L+P   +    A+A++AG  V PV + ++G         +  K  +T   L +  P+  T
Sbjct: 87  LVPDPTYPGYEAAARLAGAEVVPVPLDEEGGFLLDLELLEAAKTPKT-KLLYLNNPNNPT 145

Query: 144 FGVF-EENITDVCELIHEHGGQVYLDGA 170
             V  EE + ++ EL  +HG  +  D A
Sbjct: 146 GAVLSEEELEELAELAKKHGILIISDEA 173


>gnl|CDD|178323 PLN02721, PLN02721, threonine aldolase.
          Length = 353

 Score = 32.7 bits (75), Expect = 0.20
 Identities = 43/167 (25%), Positives = 70/167 (41%), Gaps = 26/167 (15%)

Query: 27  DQARGYEQLIGELETDLCEITGYDKISFQPNSGAQGEYAGLRAIQCYHQAQDAHHRNVCL 86
           D   GY+     LE ++ +I G +   F P SG  G    +  + C  +  +    +   
Sbjct: 33  DDVLGYDPTALRLEEEMAKIFGKEAALFVP-SGTMGNLISV-LVHCDVRGSEVILGDNSH 90

Query: 87  IPVSAHGTNPASAQMAGMS----VEPVSV--RKDGTIDFSDLETKVKKNKET------LS 134
           I +  +G         G+S    V P +V   +DGT+D   +E  ++   +       L 
Sbjct: 91  IHLYENG---------GISTLGGVHPRTVKNNEDGTMDLDAIEAAIRPKGDDHFPTTRLI 141

Query: 135 CLMITYPSTFGVF--EENITDVCELIHEHGGQVYLDGANM-NAQVGL 178
           CL  T+ +  G     E    V EL   HG ++++DGA + NA V L
Sbjct: 142 CLENTHANCGGRCLSVEYTDKVGELAKRHGLKLHIDGARIFNASVAL 188


>gnl|CDD|235068 PRK02769, PRK02769, histidine decarboxylase; Provisional.
          Length = 380

 Score = 32.7 bits (75), Expect = 0.23
 Identities = 19/83 (22%), Positives = 39/83 (46%), Gaps = 3/83 (3%)

Query: 91  AHGTNPASAQMAGMSVEPVSVRKDGTIDFSDLETKVKKNKETLSCLMITYPSTFGVFEEN 150
            H +    A++  +    ++   +G ID+ DL +K+K+NK     +     +T     +N
Sbjct: 119 THYSVSKIARLLRIKSRVITSLPNGEIDYDDLISKIKENKNQPPIIFANIGTTMTGAIDN 178

Query: 151 ITDVCELIHEHG-GQVYL--DGA 170
           I ++ E++ + G    Y+  D A
Sbjct: 179 IKEIQEILKKIGIDDYYIHADAA 201


>gnl|CDD|153332 cd07648, F-BAR_FCHO, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of FCH domain Only proteins.
            F-BAR domains are dimerization modules that bind and
           bend membranes and are found in proteins involved in
           membrane dynamics and actin reorganization. Proteins in
           this group have been named FCH domain Only (FCHO)
           proteins. Vertebrates have two members, FCHO1 and FCHO2.
           These proteins contain an F-BAR domain and a C-terminal
           domain of unknown function named SAFF which is also
           present in endophilin interacting protein 1. F-BAR
           domains form banana-shaped dimers with a
           positively-charged concave surface that binds to
           negatively-charged lipid membranes. They can induce
           membrane deformation in the form of long tubules.
          Length = 261

 Score = 32.3 bits (74), Expect = 0.27
 Identities = 13/52 (25%), Positives = 23/52 (44%), Gaps = 2/52 (3%)

Query: 112 RKDGTI--DFSDLETKVKKNKETLSCLMITYPSTFGVFEENITDVCELIHEH 161
           R++     +    E K+KK ++    L+  Y +    FE  +TD C+   E 
Sbjct: 143 RRENASPKEIEKAEAKLKKAQDEYKALVEKYNNIRADFETKMTDSCKRFQEI 194


>gnl|CDD|215829 pfam00266, Aminotran_5, Aminotransferase class-V.  This domain is
           found in amino transferases, and other enzymes including
           cysteine desulphurase EC:4.4.1.-.
          Length = 370

 Score = 31.8 bits (73), Expect = 0.39
 Identities = 22/90 (24%), Positives = 38/90 (42%), Gaps = 9/90 (10%)

Query: 86  LIPVSAHGTNPASAQMA----GMSVEPVSVRKDGTIDFSDLETKVKKNKETLSCLMITYP 141
           L+    H  N    Q      G +V  + V  +G +D   LE  +    + ++   IT+ 
Sbjct: 92  LVTEMEHHANLVPWQELAKRTGATVRVIPVDPNGLLDLDALEKLLTPRTKLVA---ITHV 148

Query: 142 ST-FGVFEENITDVCELIHEHGGQVYLDGA 170
           S   G     + ++ +L HE+G  V +D A
Sbjct: 149 SNVTGTVNP-VEEIGKLAHEYGALVVVDAA 177


>gnl|CDD|99746 cd06453, SufS_like, Cysteine desulfurase (SufS)-like. This family
           belongs to the pyridoxal phosphate (PLP)-dependent
           aspartate aminotransferase superfamily (fold I). The
           major groups in this CD correspond to cysteine
           desulfurase (SufS) and selenocysteine lyase. SufS
           catalyzes the removal of elemental sulfur and selenium
           atoms from L-cysteine, L-cystine, L-selenocysteine, and
           L-selenocystine to produce L-alanine; and selenocysteine
           lyase catalyzes the decomposition of L-selenocysteine.
          Length = 373

 Score = 31.7 bits (73), Expect = 0.47
 Identities = 22/76 (28%), Positives = 35/76 (46%), Gaps = 11/76 (14%)

Query: 99  AQMAGMSVEPVSVRKDGTIDFSDLETKVKKNKETLSCLMITYPST-FGVFEENITDV--- 154
           A+  G  ++ V V  DG +D   LE  + +  + ++   +T+ S   G     I  V   
Sbjct: 109 AERTGAKLKVVPVDDDGQLDLEALEKLLTERTKLVA---VTHVSNVLGT----INPVKEI 161

Query: 155 CELIHEHGGQVYLDGA 170
            E+ HE G  V +DGA
Sbjct: 162 GEIAHEAGVPVLVDGA 177


>gnl|CDD|236103 PRK07810, PRK07810, O-succinylhomoserine sulfhydrylase;
           Provisional.
          Length = 403

 Score = 31.2 bits (71), Expect = 0.59
 Identities = 16/39 (41%), Positives = 20/39 (51%), Gaps = 2/39 (5%)

Query: 150 NITDVCELIHEHGGQVYLDGANMNAQVGLCRPGDYGSDV 188
           +I  V EL H  G +V LD  N+ A   L R    G+DV
Sbjct: 173 DIAAVSELAHAAGAKVVLD--NVFATPLLQRGLPLGADV 209


>gnl|CDD|218796 pfam05889, SLA_LP_auto_ag, Soluble liver antigen/liver pancreas
           antigen (SLA/LP autoantigen).  This family consists of
           several eukaryotic and archaeal proteins which are
           related to the human soluble liver antigen/liver
           pancreas antigen (SLA/LP autoantigen). Autoantibodies
           are a hallmark of autoimmune hepatitis, but most are not
           disease specific. Autoantibodies to soluble liver
           antigen (SLA) and to liver and pancreas antigen (LP)
           have been described as disease specific, occurring in
           about 30% of all patients with autoimmune hepatitis. The
           function of SLA/LP is unknown, however, it has been
           suggested that the protein may function as a serine
           hydroxymethyltransferase and may be an important enzyme
           in the thus far poorly understood selenocysteine
           pathway. The archaeal sequences Methanopyrus kandleri
           spcS and MK0229 are annotated as being pyridoxal
           phosphate-dependent enzymes.
          Length = 386

 Score = 31.0 bits (70), Expect = 0.66
 Identities = 25/138 (18%), Positives = 54/138 (39%), Gaps = 10/138 (7%)

Query: 88  PVSAHGTNPASAQMAGMS---VEPVSVRKDGTIDFSDLETKVKKNKETLSCLMITYPSTF 144
           P     ++  +A+ AG     VE V        D +D+ET +++  E +   +++  S F
Sbjct: 107 PRIDQKSSIKAAERAGFEPRLVETVLEGDYLITDVNDVETIIEEKGEEVILAVLSTTSCF 166

Query: 145 -GVFEENITDVCELIHEHGGQVYLDGA----NMNAQVGLCRPGDYGS-DVSHLNLHKTFC 198
                + + ++ ++  E+     ++GA    +      + +    G  D    +L K F 
Sbjct: 167 APRSPDRVKEIAKICAEYDVPHLVNGAYGIQSEKYIHLIQQAAKVGRIDAVVQSLDKNFI 226

Query: 199 IPHGGGG-PGMGPIGVKS 215
           +P GG          ++ 
Sbjct: 227 VPVGGAIIAAFDESFIQE 244


>gnl|CDD|191635 pfam06903, VirK, VirK protein.  This family consists of several
           bacterial VirK proteins of around 145 residues in
           length. The function of this family is unknown.
          Length = 101

 Score = 28.8 bits (65), Expect = 1.0
 Identities = 8/32 (25%), Positives = 15/32 (46%)

Query: 90  SAHGTNPASAQMAGMSVEPVSVRKDGTIDFSD 121
                 P++    G+ ++   +  DGT+ FSD
Sbjct: 31  PEGAGTPSTKTRGGVRIDAYRITSDGTLAFSD 62


>gnl|CDD|130884 TIGR01825, gly_Cac_T_rel, pyridoxal phosphate-dependent
           acyltransferase, putative.  This model represents an
           enzyme subfamily related to three known enzymes; it
           appears closest to glycine C-acteyltransferase, shows no
           overlap with it in species distribution, and may share
           that function. The three closely related enzymes are
           glycine C-acetyltransferase (2-amino-3-ketobutyrate
           coenzyme A ligase), 5-aminolevulinic acid synthase, and
           8-amino-7-oxononanoate synthase. All transfer the
           R-group (acetyl, succinyl, or 6-carboxyhexanoyl) from
           coenzyme A to an amino acid (Gly, Gly, Ala,
           respectively), with release of CO2 for the latter two
           reactions.
          Length = 385

 Score = 30.6 bits (69), Expect = 1.1
 Identities = 24/101 (23%), Positives = 41/101 (40%), Gaps = 13/101 (12%)

Query: 117 IDFSDLETKVKKNKETLSCLMITYPSTFGVFEEN-----ITDVCELIHEHGGQVYLDGAN 171
            D  DL+  +++N      L++T     GVF  +     + ++ EL   +G   Y+D A+
Sbjct: 147 ADMDDLDRVLRENPSYGKKLIVTD----GVFSMDGDVAPLPEIVELAERYGAVTYVDDAH 202

Query: 172 MNAQVGLCRPGDYGSDVSHLNLHKTFCIPHGGGGPGMGPIG 212
            +  +G    G     V H  L     I  G     +G +G
Sbjct: 203 GSGVMGEAGRG----TVHHFGLEDKVDIQVGTLSKAIGVVG 239


>gnl|CDD|237409 PRK13520, PRK13520, L-tyrosine decarboxylase; Provisional.
          Length = 371

 Score = 30.2 bits (69), Expect = 1.2
 Identities = 25/96 (26%), Positives = 45/96 (46%), Gaps = 10/96 (10%)

Query: 34  QLIGELETDLCEITGYDKISFQPNSGAQGEYAGLRAIQCYHQAQDAHHRNVCLIPVSAHG 93
           +++GEL   L +  GY  I+    SG  G  A ++A++       A   N+ ++P SAH 
Sbjct: 66  EMLGEL-LHLPDAYGY--IT----SG--GTEANIQAVRAARNLAKAEKPNI-VVPESAHF 115

Query: 94  TNPASAQMAGMSVEPVSVRKDGTIDFSDLETKVKKN 129
           +   +A M G+ +    +  D  +D   +E  +  N
Sbjct: 116 SFDKAADMLGVELRRAPLDDDYRVDVKAVEDLIDDN 151


>gnl|CDD|99799 cd06202, Nitric_oxide_synthase, The ferredoxin-reductase (FNR) like
           C-terminal domain of the nitric oxide synthase (NOS)
           fuses with a heme-containing N-terminal oxidase domain.
           The reductase portion is similar in structure to NADPH
           dependent cytochrome-450 reductase (CYPOR), having an
           inserted connecting sub-domain within the FAD binding
           portion of FNR. NOS differs from CYPOR in a requirement
           for the cofactor tetrahydrobiopterin and unlike most
           CYPOR is dimeric. Nitric oxide synthase produces nitric
           oxide in the conversion of L-arginine to L-citruline.
           NOS has been implicated in a variety of processes
           including cytotoxicity, anti-inflamation,
           neurotransmission, and vascular smooth muscle
           relaxation.
          Length = 406

 Score = 30.4 bits (69), Expect = 1.3
 Identities = 16/63 (25%), Positives = 26/63 (41%), Gaps = 9/63 (14%)

Query: 123 ETKVKKNKETLSCLMITY------PSTF--GVFEENITDVCELIHEHGGQVYLDG-ANMN 173
           ET+  KNK  L+ +          P T+   + +E    V + +   GG +Y+ G   M 
Sbjct: 303 ETEEAKNKGVLTEVYTALSREPGKPKTYVQDLLKEQAESVYDALVREGGHIYVCGDVTMA 362

Query: 174 AQV 176
             V
Sbjct: 363 EDV 365


>gnl|CDD|221493 pfam12256, TcdB_toxin_midN, Insecticide toxin TcdB
           middle/N-terminal region.  This domain family is found
           in bacteria and archaea, and is typically between 164
           and 180 amino acids in length. The family is found in
           association with pfam05593. This family is the
           N-terminal-sided middle region of the bacterial
           insecticide toxin TcdB. This region appears related to
           the FG-GAP repeat pfam01839.
          Length = 171

 Score = 29.3 bits (66), Expect = 1.4
 Identities = 10/40 (25%), Positives = 19/40 (47%), Gaps = 1/40 (2%)

Query: 206 PGMGPIGVKSHLAPFLPVHPLSSIDSSIGAVSAAHYGSAS 245
           P + P   +       P + L SI +++GA +   Y S++
Sbjct: 45  PHLTPRHWRYDFPADKP-YLLKSITNNMGATTTLEYRSSA 83


>gnl|CDD|233359 TIGR01329, cysta_beta_ly_E, cystathionine beta-lyase, eukaryotic.
           This model represents cystathionine beta-lyase
           (alternate name: beta-cystathionase), one of several
           pyridoxal-dependent enzymes of cysteine, methionine, and
           homocysteine metabolism. This enzyme is involved in the
           biosynthesis of Met from Cys.
          Length = 378

 Score = 30.2 bits (68), Expect = 1.4
 Identities = 12/49 (24%), Positives = 22/49 (44%), Gaps = 2/49 (4%)

Query: 150 NITDVCELIHEHGGQVYLDGANMNAQVGLCRPGDYGSDVSHLNLHKTFC 198
           +I  + E+ H     V +D   M+    LC P + G+D+ + +  K   
Sbjct: 149 DIRKISEMAHAQNALVVVDNTMMSPL--LCNPLELGADIVYHSATKFLA 195


>gnl|CDD|131199 TIGR02144, LysX_arch, Lysine biosynthesis enzyme LysX.  The family
           of proteins found in this equivalog include the
           characterized LysX from Thermus thermophilus which is
           part of a well-organized lysine biosynthesis gene
           cluster. LysX is believed to carry out an ATP-dependent
           acylation of the amino group of alpha-aminoadipate in
           the prokaryotic version of the fungal AAA lysine
           biosynthesis pathway. No species having a sequence in
           this equivalog contains the elements of the more common
           diaminopimelate lysine biosythesis pathway, and none has
           been shown to be a lysine auxotroph. These sequences
           have mainly recieved the name of the related enzyme,
           "ribosomal protein S6 modification protein RimK". RimK
           has been characterized in E. coli, and acts by
           ATP-dependent condensation of S6 with glutamate
           residues.
          Length = 280

 Score = 29.3 bits (66), Expect = 2.4
 Identities = 13/39 (33%), Positives = 23/39 (58%), Gaps = 1/39 (2%)

Query: 264 LFRSSRSGLVAHEFVIDVRDFKKSANIEAVDIAKRLMDY 302
           +F S   GL+ +E V  V +FK S  +  V++A  +++Y
Sbjct: 237 IFESKERGLLVNE-VNHVPEFKNSVRVTGVNVAGEILEY 274


>gnl|CDD|218685 pfam05673, DUF815, Protein of unknown function (DUF815).  This
           family consists of several bacterial proteins of unknown
           function.
          Length = 248

 Score = 29.0 bits (66), Expect = 2.4
 Identities = 16/43 (37%), Positives = 21/43 (48%), Gaps = 5/43 (11%)

Query: 238 AAHYGSASILPISWAYIRRLESHYKTLFRSSRSGLVAHEFVID 280
           AA  G    LP+    +RR E+    L R  RSG  A +F+ D
Sbjct: 206 AARLG----LPLDEEELRR-EALQWALTRGGRSGRTARQFIDD 243


>gnl|CDD|182661 PRK10702, PRK10702, endonuclease III; Provisional.
          Length = 211

 Score = 28.4 bits (63), Expect = 4.1
 Identities = 16/46 (34%), Positives = 24/46 (52%), Gaps = 4/46 (8%)

Query: 139 TYPSTFGVFE---ENITDVCE-LIHEHGGQVYLDGANMNAQVGLCR 180
           TY  T G++    EN+   C  L+ +H G+V  D A + A  G+ R
Sbjct: 74  TYIKTIGLYNSKAENVIKTCRILLEQHNGEVPEDRAALEALPGVGR 119


>gnl|CDD|216911 pfam02165, WT1, Wilm's tumour protein. 
          Length = 288

 Score = 28.3 bits (63), Expect = 4.5
 Identities = 22/49 (44%), Positives = 26/49 (53%), Gaps = 3/49 (6%)

Query: 185 GSDVSHLN--LHKTFCIPHGGGGPGMGPIGVKSHLAPFLPVHPLSSIDS 231
           GSDV  L+  L     +P GGGG G+ P+G  S  AP L  HP S   S
Sbjct: 2   GSDVRDLSALLPPVSSLPGGGGGCGL-PVGGASQWAPVLDFHPGSPYSS 49


>gnl|CDD|237964 PRK15426, PRK15426, putative diguanylate cyclase YedQ; Provisional.
          Length = 570

 Score = 28.8 bits (65), Expect = 4.5
 Identities = 15/52 (28%), Positives = 23/52 (44%), Gaps = 5/52 (9%)

Query: 221 LPVHPLS-SIDSSIGAVSAAHYGSASILP----ISWAYIRRLESHYKTLFRS 267
           + VH L   +    G++S A     ++      ISW  IRR+ S+   L  S
Sbjct: 342 VRVHTLREGVRGDFGSISIALTLLWALFTAMLLISWYVIRRMVSNMFVLQSS 393


>gnl|CDD|233947 TIGR02600, Verru_Chthon_A, Verru_Chthon cassette protein A.  In
           Verrucomicrobium spinosum and Chthoniobacter flavus, a
           four-gene operon that includes proteins with an
           N-terminal signal sequence for cleavage and methylation
           recurs many times. Each operon is likely to encode a
           membrane complex, the function of which is unknown. This
           model represents a long protein from this putative
           membrame complex, with members averaging about 1300
           amino acids. The N-terminal region includes an apparent
           signal sequence. The function is unknown. Most cassettes
           are adjacent to an unusually large protein with both an
           outer membrane autotransporter region and PEP-CTERM
           putative protein-sorting motif [Cell envelope, Surface
           structures].
          Length = 1265

 Score = 28.7 bits (64), Expect = 4.8
 Identities = 10/36 (27%), Positives = 15/36 (41%)

Query: 86  LIPVSAHGTNPASAQMAGMSVEPVSVRKDGTIDFSD 121
                A     A++    M V  + V +DGT+D  D
Sbjct: 169 TSGTVAPPGTDATSPRLPMPVRWLYVLQDGTLDVPD 204


>gnl|CDD|168128 PRK05613, PRK05613, O-acetylhomoserine
           aminocarboxypropyltransferase; Validated.
          Length = 437

 Score = 28.3 bits (63), Expect = 5.2
 Identities = 21/62 (33%), Positives = 30/62 (48%), Gaps = 6/62 (9%)

Query: 150 NITDVCELIHEHGGQVYLDGANMNAQVGLCRPGDYGSDVSHLNLHKTFCIPHGGGGPGMG 209
           +I  V E+ H +  QV L   N  A   L RP + G+DV   +L K +     G G G+G
Sbjct: 173 DIPAVAEVAHRN--QVPLIVDNTIATAALVRPLELGADVVVASLTKFYT----GNGSGLG 226

Query: 210 PI 211
            +
Sbjct: 227 GV 228


>gnl|CDD|191647 pfam06947, DUF1290, Protein of unknown function (DUF1290).  This
           family consists of several bacterial small basic
           proteins of around 100 residues in length. The function
           of this family is unknown.
          Length = 88

 Score = 26.4 bits (59), Expect = 5.7
 Identities = 8/23 (34%), Positives = 14/23 (60%)

Query: 217 LAPFLPVHPLSSIDSSIGAVSAA 239
            +P+L V  L+++DS  G + A 
Sbjct: 6   YSPYLSVAVLAALDSVFGGIRAN 28


>gnl|CDD|214732 smart00583, SPK, domain in SET and PHD domain containing proteins
           and protein kinases. 
          Length = 114

 Score = 27.1 bits (61), Expect = 5.7
 Identities = 9/29 (31%), Positives = 14/29 (48%)

Query: 105 SVEPVSVRKDGTIDFSDLETKVKKNKETL 133
           ++EP+ V      +FS LE     + ET 
Sbjct: 16  AIEPLVVPLKVFEEFSKLEGNSLLSYETY 44


>gnl|CDD|130828 TIGR01767, MTRK, 5-methylthioribose kinase.  This enzyme is
           involved in the methionine salvage pathway in certain
           bacteria.
          Length = 370

 Score = 28.3 bits (63), Expect = 5.8
 Identities = 15/38 (39%), Positives = 16/38 (42%), Gaps = 3/38 (7%)

Query: 262 KTLFRSSRSGLVAHEFVIDVRDFKKSANIEAVDIAKRL 299
           KTLF SS   L        V  F    N E  DI +RL
Sbjct: 111 KTLFYSSDYALEPKVKKALVAQF---TNPELCDITERL 145


>gnl|CDD|233290 TIGR01141, hisC, histidinol-phosphate aminotransferase.  Alternate
           names: histidinol-phosphate transaminase; imidazole
           acetol-phosphate transaminase Histidinol-phosphate
           aminotransferase is a pyridoxal-phosphate dependent
           enzyme [Amino acid biosynthesis, Histidine family].
          Length = 346

 Score = 27.6 bits (62), Expect = 8.8
 Identities = 26/115 (22%), Positives = 45/115 (39%), Gaps = 17/115 (14%)

Query: 86  LIPVSAHGTNPASAQMAGMSVEPVSVRKDGTIDFSDLETKVKKNKETLSCLMITYPS--T 143
           L+P   +     SA++ G  V  V + +DG +D  D+      +K  L  + +  P+  T
Sbjct: 99  LVPPPTYSMYEISAKIHGAEVVKVPLDEDGQLDLEDILV-AIDDKPKL--VFLCSPNNPT 155

Query: 144 FGVF-EENITDVCELIHEHGGQVYLDGANMNAQVGLCRPGDYGSDVSHLNLHKTF 197
             +    +I  V E   E    V +D A           G++  + S L L   +
Sbjct: 156 GNLLSRSDIEAVLERTPE-DALVVVDEAY----------GEFSGEPSTLPLLAEY 199


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.135    0.411 

Gapped
Lambda     K      H
   0.267   0.0798    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 15,128,314
Number of extensions: 1406093
Number of successful extensions: 1337
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1302
Number of HSP's successfully gapped: 55
Length of query: 302
Length of database: 10,937,602
Length adjustment: 96
Effective length of query: 206
Effective length of database: 6,679,618
Effective search space: 1376001308
Effective search space used: 1376001308
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 59 (26.4 bits)