RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy7357
(302 letters)
>gnl|CDD|235432 PRK05367, PRK05367, glycine dehydrogenase; Provisional.
Length = 954
Score = 600 bits (1550), Expect = 0.0
Identities = 196/327 (59%), Positives = 238/327 (72%), Gaps = 26/327 (7%)
Query: 1 MKLNATTEMIPCSFPQLTNMHPFIPQDQARGYEQLIGELETDLCEITGYDKISFQPNSGA 60
MKLNA EMIP ++P+ N+HPF P +QA GY +LI +LE L EITGYD +S QPN+GA
Sbjct: 509 MKLNAAAEMIPITWPEFANLHPFAPAEQAAGYRELIDQLEAWLAEITGYDAVSLQPNAGA 568
Query: 61 QGEYAGLRAIQCYHQAQDAHHRNVCLIPVSAHGTNPASAQMAGMSVEPVSVRKDGTIDFS 120
QGEYAGL AI+ YH+++ HR+VCLIP SAHGTNPASA MAGM V V+ ++G ID
Sbjct: 569 QGEYAGLLAIRAYHESRGEGHRDVCLIPSSAHGTNPASAVMAGMKVVVVACDENGNIDLD 628
Query: 121 DLETKVKKNKETLSCLMITYPSTFGVFEENITDVCELIHEHGGQVYLDGANMNAQVGLCR 180
DL K +++ + L+ +MITYPST GVFEE I ++CE++HEHGGQVYLDGANMNAQVGL R
Sbjct: 629 DLRAKAEEHADNLAAIMITYPSTHGVFEETIREICEIVHEHGGQVYLDGANMNAQVGLAR 688
Query: 181 PGDYGSDVSHLNLHKTFCIPHGGGGPGMGPIGVKSHLAPFLPVHPLSSI--DSSIGAVSA 238
PGD G+DVSHLNLHKTFCIPHGGGGPG+GPIGVK+HLAPFLP HP+ ++ IGAVSA
Sbjct: 689 PGDIGADVSHLNLHKTFCIPHGGGGPGVGPIGVKAHLAPFLPGHPVQIAGGETGIGAVSA 748
Query: 239 AHYGSASILPISWAYIR-----------------------RLESHYKTLFRSSRSGLVAH 275
A +GSASILPISW YIR RL+ HY L+ +G VAH
Sbjct: 749 APFGSASILPISWMYIRMMGAEGLRQATEVAILNANYIAKRLKDHYPVLYT-GANGRVAH 807
Query: 276 EFVIDVRDFKKSANIEAVDIAKRLMDY 302
E ++D+R K+S I DIAKRL+DY
Sbjct: 808 ECILDLRPLKESTGITVDDIAKRLIDY 834
>gnl|CDD|223935 COG1003, GcvP, Glycine cleavage system protein P
(pyridoxal-binding), C-terminal domain [Amino acid
transport and metabolism].
Length = 496
Score = 484 bits (1247), Expect = e-171
Identities = 178/334 (53%), Positives = 218/334 (65%), Gaps = 37/334 (11%)
Query: 1 MKLNATTEMIPCSFPQLTNMHPFIPQDQARGYEQLIGELETDLCEITGYDKISFQPNSGA 60
MKLN EM P ++P+ N+HPF P++Q +GY +LI EL+ L EITG D +S QPN+GA
Sbjct: 76 MKLNPKAEMKPATWPEFANIHPFQPEEQVQGYLELIYELQEWLKEITGMDAVSLQPNAGA 135
Query: 61 QGEYAGLRAIQCYHQAQDAHHRNVCLIPVSAHGTNPASAQMAGMSVEPVSVRKDGTIDFS 120
QGEYAGL AI+ YH+++ HRN+CLIP SAHGTNPASA MAG V V ++G +D
Sbjct: 136 QGEYAGLLAIRAYHESRGEGHRNICLIPDSAHGTNPASAAMAGFKVVVVKCDENGNVDLD 195
Query: 121 DLETKVKKNKETLSCLMITYPSTFGVFEENITDVCELIHEHGGQVYLDGANMNAQVGLCR 180
DL K + N L+ LMIT PST GVFEE+I ++CE++HE GGQVY DGAN+NA VGL R
Sbjct: 196 DLRAKAEDN---LAALMITNPSTLGVFEEDIREICEIVHEAGGQVYYDGANLNAIVGLAR 252
Query: 181 PGDYGSDVSHLNLHKTFCIPHGGGGPGMGPIGVKSHLAPFLPV---------HPLSSIDS 231
PGD G DV HLNLHKTFCIPHGGGGPG GP+GVK+HLAPFLP + L
Sbjct: 253 PGDMGFDVVHLNLHKTFCIPHGGGGPGAGPVGVKAHLAPFLPGPVVYHDVGEYRLDYDGK 312
Query: 232 SIGAVSAAHYGSASILPISWAYIR-----------------------RLESHYKTLFRSS 268
VSAA YGSASILPI+WAYIR RL+ +Y +
Sbjct: 313 KSIGVSAAPYGSASILPIAWAYIRMMGADGLKQASEVAVLNANYIARRLKGYYPVPYTGE 372
Query: 269 RSGLVAHEFVIDVRDFKKSANIEAVDIAKRLMDY 302
VAHE ++D R KK + A+D+AKRL+DY
Sbjct: 373 N--RVAHECILDARPLKKETGVRALDVAKRLLDY 404
>gnl|CDD|178035 PLN02414, PLN02414, glycine dehydrogenase (decarboxylating).
Length = 993
Score = 481 bits (1240), Expect = e-164
Identities = 185/333 (55%), Positives = 230/333 (69%), Gaps = 32/333 (9%)
Query: 1 MKLNATTEMIPCSFPQLTNMHPFIPQDQARGYEQLIGELETDLCEITGYDKISFQPNSGA 60
MKLNATTEM+P ++P+ N+HPF P DQA+GY+++ +L LCEITG+D S QPN+GA
Sbjct: 535 MKLNATTEMMPVTWPEFANIHPFAPVDQAQGYQEMFEDLGDLLCEITGFDSFSLQPNAGA 594
Query: 61 QGEYAGLRAIQCYHQAQDAHHRNVCLIPVSAHGTNPASAQMAGMSVEPVSVRKDGTIDFS 120
GEYAGL I+ YH ++ HHRNVC+IPVSAHGTNPASA M GM + V G I+
Sbjct: 595 AGEYAGLMVIRAYHLSRGDHHRNVCIIPVSAHGTNPASAAMCGMKIVVVGTDAKGNINIE 654
Query: 121 DLETKVKKNKETLSCLMITYPSTFGVFEENITDVCELIHEHGGQVYLDGANMNAQVGLCR 180
+L + +K+ L+ LM+TYPST GV+EE I ++C++IH++GGQVY+DGANMNAQVGL
Sbjct: 655 ELRKAAEAHKDNLAALMVTYPSTHGVYEEGIDEICDIIHDNGGQVYMDGANMNAQVGLTS 714
Query: 181 PGDYGSDVSHLNLHKTFCIPHGGGGPGMGPIGVKSHLAPFLPVHPLSSIDS--------S 232
PG G+DV HLNLHKTFCIPHGGGGPGMGPIGVK HLAPFLP HP+
Sbjct: 715 PGFIGADVCHLNLHKTFCIPHGGGGPGMGPIGVKKHLAPFLPSHPVVPTGGIPRPEKTQP 774
Query: 233 IGAVSAAHYGSASILPISWAYI-----------------------RRLESHYKTLFRSSR 269
+G +SAA +GSA ILPIS+ YI +RLE HY LFR +
Sbjct: 775 LGTISAAPWGSALILPISYTYIAMMGSEGLTDASKIAILNANYMAKRLEGHYPVLFR-GK 833
Query: 270 SGLVAHEFVIDVRDFKKSANIEAVDIAKRLMDY 302
+G AHEF+ID+R FK +A IE D+AKRLMDY
Sbjct: 834 NGTCAHEFIIDLRPFKNTAGIEPEDVAKRLMDY 866
>gnl|CDD|129553 TIGR00461, gcvP, glycine dehydrogenase (decarboxylating). This
apparently ubiquitous enzyme is found in bacterial,
mammalian and plant sources. The enzyme catalyzes the
reaction: GLYCINE + LIPOYLPROTEIN =
S-AMINOMETHYL-DIHYDROLIPOYLPROTEIN + CO2. It is part of
the glycine decarboxylase multienzyme complex (GDC)
consisting of four proteins P, H, L and T. Active site
in E.coli is located as the (K) residues at position 713
of the SEED alignment [Energy metabolism, Amino acids
and amines].
Length = 939
Score = 470 bits (1210), Expect = e-160
Identities = 193/331 (58%), Positives = 234/331 (70%), Gaps = 30/331 (9%)
Query: 1 MKLNATTEMIPCSFPQLTNMHPFIPQDQARGYEQLIGELETDLCEITGYDKISFQPNSGA 60
MKLNAT EM+P ++P+ +N+HPF P +Q GY++LI +LE LC ITG+D IS QPNSGA
Sbjct: 497 MKLNATAEMMPITWPEFSNIHPFQPSNQVEGYQELIAQLEKWLCSITGFDAISLQPNSGA 556
Query: 61 QGEYAGLRAIQCYHQAQDAHHRNVCLIPVSAHGTNPASAQMAGMSVEPVSVRKDGTIDFS 120
QGEYAGLR I+ YH+++ +HRN+CLIPVSAHGTNPASA MAGM V PV+ +DG ID
Sbjct: 557 QGEYAGLRVIRSYHESRGENHRNICLIPVSAHGTNPASAAMAGMQVVPVNCDQDGNIDLV 616
Query: 121 DLETKVKKNKETLSCLMITYPSTFGVFEENITDVCELIHEHGGQVYLDGANMNAQVGLCR 180
DL+ K +++ + L+ +M+TYPST GVFE I C+++H GGQVYLDGANMNAQVGL
Sbjct: 617 DLKNKAEQHGDELAAVMVTYPSTHGVFEPTIQHACDIVHSFGGQVYLDGANMNAQVGLTS 676
Query: 181 PGDYGSDVSHLNLHKTFCIPHGGGGPGMGPIGVKSHLAPFLPVHPLSSI------DSSIG 234
PGD G+DV HLNLHKTFCIPHGGGGPGMGPIGVKSHL PFLP H + S+ SIG
Sbjct: 677 PGDLGADVCHLNLHKTFCIPHGGGGPGMGPIGVKSHLIPFLPKHDVVSMITGIGGSKSIG 736
Query: 235 AVSAAHYGSASILPISWAYIR-----------------------RLESHYKTLFRSSRSG 271
+VSAA YGSASILPISW YI+ RL+ HY LF
Sbjct: 737 SVSAAPYGSASILPISWMYIKMMGNEGLPKASVVAILNANYMATRLKDHYPILFV-GTLK 795
Query: 272 LVAHEFVIDVRDFKKSANIEAVDIAKRLMDY 302
VAHE ++D+R K IEA+D+AKRL DY
Sbjct: 796 HVAHECILDLRPLKAKTGIEAIDVAKRLQDY 826
>gnl|CDD|171585 PRK12566, PRK12566, glycine dehydrogenase; Provisional.
Length = 954
Score = 425 bits (1093), Expect = e-142
Identities = 187/327 (57%), Positives = 229/327 (70%), Gaps = 26/327 (7%)
Query: 1 MKLNATTEMIPCSFPQLTNMHPFIPQDQARGYEQLIGELETDLCEITGYDKISFQPNSGA 60
MKLNAT+EMIP ++P+ +HPF P++QA GY +I ELE LC ITG+D I QPNSGA
Sbjct: 510 MKLNATSEMIPITWPEFAQLHPFAPREQAEGYRAMIDELEAWLCAITGFDAICMQPNSGA 569
Query: 61 QGEYAGLRAIQCYHQAQDAHHRNVCLIPVSAHGTNPASAQMAGMSVEPVSVRKDGTIDFS 120
QGEYAGL AI+ YH+++ R++CLIP SAHGTNPASAQMAGM V V DG +D
Sbjct: 570 QGEYAGLLAIRRYHRSRGQSQRDICLIPSSAHGTNPASAQMAGMRVVIVECDPDGNVDLD 629
Query: 121 DLETKVKKNKETLSCLMITYPSTFGVFEENITDVCELIHEHGGQVYLDGANMNAQVGLCR 180
DL+ K + LSCLMITYPST GV+EE I ++CE++H+HGGQVY+DGAN+NAQVGL R
Sbjct: 630 DLKAKAAAAGDRLSCLMITYPSTHGVYEEGIREICEVVHQHGGQVYMDGANLNAQVGLAR 689
Query: 181 PGDYGSDVSHLNLHKTFCIPHGGGGPGMGPIGVKSHLAPFL---PVHPLSSIDSSIGAVS 237
P D G+DVSH+NLHKTFCIPHGGGGPGMGPIGV++HLAPF+ PV P+ D + GAVS
Sbjct: 690 PADIGADVSHMNLHKTFCIPHGGGGPGMGPIGVRAHLAPFVANHPVVPVEGPDPNNGAVS 749
Query: 238 AAHYGSASILPISWAYI----------------------RRLESHYKTLFRSSRSGLVAH 275
AA +GSASILPISW YI +L + L+R R+ VAH
Sbjct: 750 AAPWGSASILPISWMYIAMMGPQLADASEVAILSANYLANQLGGAFPVLYR-GRNERVAH 808
Query: 276 EFVIDVRDFKKSANIEAVDIAKRLMDY 302
E ++D+R K I D+AKRLMDY
Sbjct: 809 ECILDLRPLKAQTGISEEDVAKRLMDY 835
>gnl|CDD|235292 PRK04366, PRK04366, glycine dehydrogenase subunit 2; Validated.
Length = 481
Score = 325 bits (835), Expect = e-109
Identities = 137/337 (40%), Positives = 185/337 (54%), Gaps = 46/337 (13%)
Query: 1 MKLN--ATTEMIPCSFPQLTNMHPFIPQDQARGYEQLIGELETDLCEITGYDKISFQPNS 58
MK N ++ P +HP P++ +G +L+ EL+ L EITG D ++ QP +
Sbjct: 80 MKYNPKINEKV--ARLPGFAELHPLQPEETVQGALELMYELQEWLKEITGMDAVTLQPAA 137
Query: 59 GAQGEYAGLRAIQCYHQAQDAHHRNVCLIPVSAHGTNPASAQMAGMSVEPVSVRKDGTID 118
GA GE GL I+ YH+A+ R ++P SAHGTNPASA MAG V + +DG +D
Sbjct: 138 GAHGELTGLLMIRAYHEARGDTKRTEVIVPDSAHGTNPASAAMAGFKVVEIPSNEDGLVD 197
Query: 119 FSDLETKVKKNKETLSCLMITYPSTFGVFEENITDVCELIHEHGGQVYLDGANMNAQVGL 178
L+ V ++T + LM+T P+T G+FE NI ++ E++HE GG +Y DGAN+NA +G
Sbjct: 198 LEALKAAV--GEDT-AALMLTNPNTLGLFERNILEIAEIVHEAGGLLYYDGANLNAILGK 254
Query: 179 CRPGDYGSDVSHLNLHKTFCIPHGGGGPGMGPIGVKSHLAPFLPV----------HPLSS 228
RPGD G DV HLNLHKTF PHGGGGPG GP+GVK LAPFLPV
Sbjct: 255 ARPGDMGFDVVHLNLHKTFSTPHGGGGPGSGPVGVKEELAPFLPVPVVEKDGDRYRLDYD 314
Query: 229 IDSSIGAVSAAHYGSASILPISWAYIR-----------------------RLESHYKTLF 265
SIG V A YG+ +L ++AYIR RL+ Y +
Sbjct: 315 RPKSIGRVRA-FYGNFGVLVRAYAYIRSLGAEGLREVSEDAVLNANYLKARLKDIYDLPY 373
Query: 266 RSSRSGLVAHEFVIDVRDFKKSANIEAVDIAKRLMDY 302
HEFV+ + K++ + +DIAKRL+DY
Sbjct: 374 DRPCM----HEFVLSGKKLKET-GVRTLDIAKRLLDY 405
>gnl|CDD|99737 cd00613, GDC-P, Glycine cleavage system P-protein, alpha- and
beta-subunits. This family consists of Glycine cleavage
system P-proteins EC:1.4.4.2 from bacterial, mammalian
and plant sources. The P protein is part of the glycine
decarboxylase multienzyme complex EC:2.1.2.10 (GDC) also
annotated as glycine cleavage system or glycine
synthase. GDC consists of four proteins P, H, L and T.
The reaction catalysed by this protein is: Glycine +
lipoylprotein <=> S-aminomethyldihydrolipoylprotein +
CO2. Alpha-beta-type dimers associate to form an
alpha(2)beta(2) tetramer, where the alpha- and
beta-subunits are structurally similar and appear to
have arisen by gene duplication and subsequent
divergence with a loss of one active site. The members
of this CD are widely dispersed among all three forms of
cellular life.
Length = 398
Score = 314 bits (808), Expect = e-106
Identities = 110/351 (31%), Positives = 155/351 (44%), Gaps = 66/351 (18%)
Query: 1 MKLNATTEMIPCSFP-QLTNMHPFIPQDQARGYEQLIGELETDLCEITGYDK--ISFQPN 57
K N + + + + ++G Q + EL+T LCE+TG D S Q
Sbjct: 30 YKHNPPAVIKRNILENEFYTAYTPYQPEISQGRLQALFELQTMLCELTGMDVANASLQDE 89
Query: 58 SGAQGEYAGLRAIQCYHQAQDAHHRNVCLIPVSAHGTNPASAQMA----GMSVEPVSVRK 113
+ A E AGL AI+ YH RN L+P SAH TNPA A+ G+ V V +
Sbjct: 90 ATAAAEAAGLAAIRAYH------KRNKVLVPDSAHPTNPAVARTRGEPLGIEVVEVPSDE 143
Query: 114 DGTIDFSDLETKVKKNKETLSCLMITYPSTFGVFEENITDVCELIHEHGGQVYLDGANMN 173
GT+D L+ +V + ++ LM+ YP+T GVFE+ I ++ ++ H G VY+DG N+N
Sbjct: 144 GGTVDLEALKEEVSEE---VAALMVQYPNTLGVFEDLIKEIADIAHSAGALVYVDGDNLN 200
Query: 174 AQVGLCRPGDYGSDVSHLNLHKTFCIPHGGGGPGMGPIGVKSHLAPFLPVHPLSSIDSSI 233
GL PG+YG+D+ NL KT +PHGGGGPG G VK L FLP + +
Sbjct: 201 LT-GLKPPGEYGADIVVGNLQKTG-VPHGGGGPGAGFFAVKKELVRFLPGRLVGVTKDAE 258
Query: 234 G-------------------AVSAAHYGSASILPISWAYIR------------------- 255
G A S G A + ++ YI
Sbjct: 259 GNRAFRLALQTREQHIRREKATSNICTGQALLALMAAMYIVYLGPEGLKEIAERAHLNAN 318
Query: 256 ----RLESHYKTLFRSSRSGLVAHEFVIDVRDFKKSANIEAVDIAKRLMDY 302
RL+ L +G HEFV+ + I A D+AK L+D
Sbjct: 319 YLAKRLKEVGGVLP---FNGPFFHEFVLRLPPL---YGIRAEDLAKALIDG 363
>gnl|CDD|99742 cd01494, AAT_I, Aspartate aminotransferase (AAT) superfamily (fold
type I) of pyridoxal phosphate (PLP)-dependent enzymes.
PLP combines with an alpha-amino acid to form a compound
called a Schiff base or aldimine intermediate, which
depending on the reaction, is the substrate in four
kinds of reactions (1) transamination (movement of amino
groups), (2) racemization (redistribution of
enantiomers), (3) decarboxylation (removing COOH
groups), and (4) various side-chain reactions depending
on the enzyme involved. Pyridoxal phosphate (PLP)
dependent enzymes were previously classified into alpha,
beta and gamma classes, based on the chemical
characteristics (carbon atom involved) of the reaction
they catalyzed. The availability of several structures
allowed a comprehensive analysis of the evolutionary
classification of PLP dependent enzymes, and it was
found that the functional classification did not always
agree with the evolutionary history of these enzymes.
Structure and sequence analysis has revealed that the
PLP dependent enzymes can be classified into four major
groups of different evolutionary origin: aspartate
aminotransferase superfamily (fold type I), tryptophan
synthase beta superfamily (fold type II), alanine
racemase superfamily (fold type III), and D-amino acid
superfamily (fold type IV) and Glycogen phophorylase
family (fold type V).
Length = 170
Score = 56.2 bits (136), Expect = 1e-09
Identities = 47/185 (25%), Positives = 74/185 (40%), Gaps = 26/185 (14%)
Query: 38 ELETDLCEIT--GYDKISFQPNSGAQGEYAGLRAIQCYHQAQDAHHRNVCLIPVSAHGTN 95
ELE L + G DK F P SG A L A+ ++ + HG+
Sbjct: 4 ELEEKLARLLQPGNDKAVFVP-SGTGANEAALLALLGPGD--------EVIVDANGHGSR 54
Query: 96 PASAQMAGMSVEPVSVRKD----GTIDFSDLETKVKKNKETLSCLMITYPSTFGVFEENI 151
A +PV V D G +D + LE K K ++ ++IT +T G +
Sbjct: 55 YWVAAELAG-AKPVPVPVDDAGYGGLDVAILEEL--KAKPNVALIVITPNTTSGGVLVPL 111
Query: 152 TDVCELIHEHGGQVYLDGANMNAQVG--LCRPGDYGSDVSHLNLHKTFCIPHGGGGPGMG 209
++ ++ E+G + +D A+ + G+DV +LHK GG G G
Sbjct: 112 KEIRKIAKEYGILLLVDAASAGGASPAPGVLIPEGGADVVTFSLHKNL------GGEGGG 165
Query: 210 PIGVK 214
+ VK
Sbjct: 166 VVIVK 170
>gnl|CDD|223480 COG0403, GcvP, Glycine cleavage system protein P
(pyridoxal-binding), N-terminal domain [Amino acid
transport and metabolism].
Length = 450
Score = 50.3 bits (121), Expect = 5e-07
Identities = 28/94 (29%), Positives = 49/94 (52%), Gaps = 2/94 (2%)
Query: 116 TIDFSDLETKVKKNKETLSCLMITYPSTFGVFEENITDVCELIHEHGGQVYLDGANMNAQ 175
+D DL+ + + +++ YP+TFG+ EE++ + E H G V + A+ A
Sbjct: 194 VVDADDLDDLESADDGDVFGVLVQYPNTFGIVEEDLRALIEAAHSAGALVIV-AADPLAL 252
Query: 176 VGLCRPGDYGSDVSHLNLHKTFCIPHGGGGPGMG 209
L PG++G+D+ + + F +P G GGP G
Sbjct: 253 GLLKPPGEFGADIVVGSAQR-FGVPMGFGGPHAG 285
>gnl|CDD|216367 pfam01212, Beta_elim_lyase, Beta-eliminating lyase.
Length = 288
Score = 48.8 bits (117), Expect = 1e-06
Identities = 44/196 (22%), Positives = 75/196 (38%), Gaps = 23/196 (11%)
Query: 27 DQARGYEQLIGELETDLCEITGYDKISFQPNSGAQGEYAGLRA-IQCYHQAQDAHHRNVC 85
D+ G + + LE + E+ G + F P SG L A + + ++
Sbjct: 25 DEVYGGDPTVNRLEDRVAELFGKEAALFVP-SGTMANQLALMAHCRRGDEVICGEPAHIY 83
Query: 86 LIPVSAHGTNPASAQMAGMSVEPVSVRKDGTIDFSDLETKVK------KNKETLSCLMIT 139
T A++ G P+ + G +D DLE ++ L L T
Sbjct: 84 F-----DETG-GHAELGGAQPVPLPGAEAGKLDLEDLEAAIRPVGDIHFPPTGLISLENT 137
Query: 140 YPSTFG--VFEENITDVCELIHEHGGQVYLDGANM-NAQVGL-CRPGDYGSDVSHLNLHK 195
+ S G V E + ++ + EHG ++LDGA + NA V L + S +
Sbjct: 138 HNSAGGQVVSLEELREIRAIAREHGIPLHLDGARLANAAVALGVIVKEITSYADSV---- 193
Query: 196 TFCIPHGGGGPGMGPI 211
+ + G G P +G +
Sbjct: 194 SMSLSKGLGAP-VGSV 208
>gnl|CDD|234769 PRK00451, PRK00451, glycine dehydrogenase subunit 1; Validated.
Length = 447
Score = 47.4 bits (114), Expect = 4e-06
Identities = 31/122 (25%), Positives = 54/122 (44%), Gaps = 25/122 (20%)
Query: 102 AGMSVEPVSVRKDGTIDFSDLETKVKKNKETLSCLMITYPSTFGVFEENITDVCELIHEH 161
G+ V V +DG D LE V + +T + +++ YP+ FGV E+ + ++ E+ H
Sbjct: 178 QGIEVVEVPY-EDGVTDLEALEAAV--DDDT-AAVVVQYPNFFGVIED-LEEIAEIAHAG 232
Query: 162 GGQVYLDGANMNAQVGLCRPGDYGSDV---------SHLNLHKTFCIPHGGGGPGMGPIG 212
G + G + + L PG+YG+D+ L+ GGP +G
Sbjct: 233 GALFIV-GVDPVSLGLLKPPGEYGADIVVGEGQPLGIPLSF----------GGPYLGFFA 281
Query: 213 VK 214
+
Sbjct: 282 TR 283
>gnl|CDD|224919 COG2008, GLY1, Threonine aldolase [Amino acid transport and
metabolism].
Length = 342
Score = 47.3 bits (113), Expect = 5e-06
Identities = 41/198 (20%), Positives = 67/198 (33%), Gaps = 28/198 (14%)
Query: 24 IPQDQARGYE--QLIGELETDLCEITGYDKISFQPNSGAQGEYAGLRAI----QCYHQAQ 77
Y LE + E+ G + F P SG Q L A + +
Sbjct: 23 ANAVGDDVYGEDPTTNALEQRIAELFGKEAALFVP-SGTQANQLALAAHCQPGESVICHE 81
Query: 78 DAHHRNVCLIPVSAHGTNPASAQMAGMSVEPVSVRKDGTIDFSDLETKVKKNK----ETL 133
AH + A G P+ DG + D+E ++ + T
Sbjct: 82 TAH--------IYTDECG-APEFFGGGQKLPIVPGADGKLTPEDVEAAIRPDDIHHAPTP 132
Query: 134 SCLMITYPSTFGVF--EENITDVCELIHEHGGQVYLDGANM-NAQVGLCRP-GDYGSDVS 189
++ + G + + + + EHG +++DGA + NA V L S V
Sbjct: 133 LAVLENTATEGGTVYPLDELEAISAVCKEHGLPLHMDGARLANALVALGVALKTIKSYVD 192
Query: 190 HLNLHKTFCIPHGGGGPG 207
+ +FC+ GGG P
Sbjct: 193 SV----SFCLTKGGGAPV 206
>gnl|CDD|223594 COG0520, csdA, Selenocysteine lyase/Cysteine desulfurase
[Posttranslational modification, protein turnover,
chaperones].
Length = 405
Score = 45.8 bits (109), Expect = 2e-05
Identities = 28/117 (23%), Positives = 45/117 (38%), Gaps = 8/117 (6%)
Query: 92 HGTNPASAQMA----GMSVEPVSVRKDGTIDFSDLETKVKKNKETLSCLMITYPSTFGVF 147
H +N Q G V + + DG +D LE + + ++ ++ + G
Sbjct: 121 HHSNIVPWQELAKRTGAKVRVIPLDDDGLLDLDALEKLITPKTKLVALSHVS--NVTGT- 177
Query: 148 EENITDVCELIHEHGGQVYLDGANMNAQVGLCRPGDYGSDVSHLNLHKTFCIPHGGG 204
+ ++ EL HEHG V +D A A + G D + HK P G G
Sbjct: 178 VNPVKEIAELAHEHGALVLVDAAQ-AAGHLPIDVQELGCDFLAFSGHKWLLGPTGIG 233
>gnl|CDD|99748 cd06502, TA_like, Low-specificity threonine aldolase (TA). This
family belongs to pyridoxal phosphate (PLP)-dependent
aspartate aminotransferase superfamily (fold I). TA
catalyzes the conversion of L-threonine or
L-allo-threonine to glycine and acetaldehyde in a
secondary glycine biosynthetic pathway.
Length = 338
Score = 40.8 bits (96), Expect = 5e-04
Identities = 45/193 (23%), Positives = 71/193 (36%), Gaps = 27/193 (13%)
Query: 27 DQARGYEQLIGELETDLCEITGYDKISFQPNSGAQGEYAGLRAI-QCYHQA---QDAHHR 82
D G + +LE E+ G + F P SG L A Q + AH
Sbjct: 25 DDVYGEDPTTAKLEARAAELFGKEAALFVP-SGTAANQLALAAHTQPGGSVICHETAH-- 81
Query: 83 NVCLIPVSAHGTNPASAQMAGMSVEPVSVRKDGTIDFSDLETKVKKN------KETLSCL 136
+ A ++G+ + PV ++G + DLE ++ +L L
Sbjct: 82 ------IYTDEAG-APEFLSGVKLLPVP-GENGKLTPEDLEAAIRPRDDIHFPPPSLVSL 133
Query: 137 MITYPSTFGVFEENITDVCELIHEHGGQVYLDGANM-NAQVGL-CRPGDYGSDVSHLNLH 194
T + + + L E+G ++LDGA + NA L Y S V +
Sbjct: 134 ENTTEGGTVYPLDELKAISALAKENGLPLHLDGARLANAAAALGVALKTYKSGVDSV--- 190
Query: 195 KTFCIPHGGGGPG 207
+FC+ GGG P
Sbjct: 191 -SFCLSKGGGAPV 202
>gnl|CDD|216983 pfam02347, GDC-P, Glycine cleavage system P-protein. This family
consists of Glycine cleavage system P-proteins
EC:1.4.4.2 from bacterial, mammalian and plant sources.
The P protein is part of the glycine decarboxylase
multienzyme complex EC:2.1.2.10 (GDC) also annotated as
glycine cleavage system or glycine synthase. GDC
consists of four proteins P, H, L and T. The reaction
catalyzed by this protein is:- Glycine + lipoylprotein
<=> S-aminomethyldihydrolipoylprotein + CO2.
Length = 429
Score = 38.9 bits (91), Expect = 0.002
Identities = 28/87 (32%), Positives = 48/87 (55%), Gaps = 3/87 (3%)
Query: 136 LMITYPSTFGVFEENITDVCELIHEHGGQVYLDGANMNAQVGLCRPGDYGSDVSHLNLHK 195
+++ YP+T G E+ ++ E+ H+HG V + A++ A L PG++G+D+ + +
Sbjct: 201 VLVQYPNTEGRIED-YKELIEIAHQHGSLVVV-AADLLALTILKPPGEFGADIV-VGSAQ 257
Query: 196 TFCIPHGGGGPGMGPIGVKSHLAPFLP 222
F +P G GGP G VK L +P
Sbjct: 258 RFGVPMGYGGPHAGFFAVKEELKRKMP 284
>gnl|CDD|234361 TIGR03812, tyr_de_CO2_Arch, tyrosine decarboxylase MnfA. Members
of this protein family are the archaeal form, MnfA, of
tyrosine decarboxylase, and are involved in methanofuran
biosynthesis. Members show clear homology to the
Enterococcus form, Tdc, that is involved in tyrosine
decarboxylation for resistance to acidic conditions
[Biosynthesis of cofactors, prosthetic groups, and
carriers, Other].
Length = 373
Score = 35.0 bits (81), Expect = 0.035
Identities = 26/110 (23%), Positives = 54/110 (49%), Gaps = 4/110 (3%)
Query: 62 GEYAGLRAIQCYHQAQDAHHRNVCLI-PVSAHGTNPASAQMAGMSVEPVSVRKDGTIDFS 120
G A ++A++ R +I P SAH + +A+M G+ + + +D T+D
Sbjct: 85 GTEANIQAVRAAKNLAREEKRTPNIIVPESAHFSFEKAAEMLGLELRYAPLDEDYTVDVK 144
Query: 121 DLETKVKKNKETLSCLMITYPSTFGVFEENITDVCELIHEHGGQVYLDGA 170
D+E + N T+ + I + G ++I ++ ++ E+G +++D A
Sbjct: 145 DVEDLIDDN--TIGIVGIAGTTELGQI-DDIEELSKIALENGIYLHVDAA 191
>gnl|CDD|180392 PRK06084, PRK06084, O-acetylhomoserine
aminocarboxypropyltransferase; Validated.
Length = 425
Score = 34.5 bits (79), Expect = 0.053
Identities = 40/147 (27%), Positives = 61/147 (41%), Gaps = 27/147 (18%)
Query: 118 DFSDLETKVKKNKETLSCLMITYPSTFGVFEENITDV---CELIHEHGGQVYLDGANMNA 174
D + LE + + + + C I P+ NI D+ + H HG + +D N A
Sbjct: 132 DIAALEALIDERTKAVFCESIGNPAG------NIIDIQALADAAHRHGVPLIVD--NTVA 183
Query: 175 QVGLCRPGDYGSDVSHLNLHKTFCIPHGGGGPGMGPIGVKSHLAPFLPVHP----LSSID 230
LCRP ++G+D+ +L K GG G +G I V S P+ L++ D
Sbjct: 184 TPVLCRPFEHGADIVVHSLTKYI----GGHGTSIGGIVVDSGKFPWAEHKERFALLNTPD 239
Query: 231 SSI-GAVSAAHYGSASILPISWAYIRR 256
S G +G A A+I R
Sbjct: 240 PSYHGVTYTEAFGPA-------AFIGR 259
>gnl|CDD|131034 TIGR01979, sufS, cysteine desulfurases, SufSfamily. This model
represents a subfamily of NifS-related cysteine
desulfurases involved in FeS cluster formation needed
for nitrogen fixation among other vital functions. Many
cysteine desulfurases are also active as selenocysteine
lyase and/or cysteine sulfinate desulfinase. This
subfamily is associated with the six-gene SUF system
described in E. coli and Erwinia as an FeS cluster
formation system during oxidative stress. The active
site Cys is this subfamily resembles GHHC with one or
both His conserved [Biosynthesis of cofactors,
prosthetic groups, and carriers, Other].
Length = 403
Score = 34.2 bits (79), Expect = 0.067
Identities = 22/89 (24%), Positives = 43/89 (48%), Gaps = 7/89 (7%)
Query: 86 LIPVSAHGTNPASAQMA----GMSVEPVSVRKDGTIDFSDLETKVKKNKETLSCLMITYP 141
+I H N Q+ G +++ + + DGT+D DLE + + + ++ IT+
Sbjct: 112 VISEMEHHANIVPWQLLAERTGATLKFIPLDDDGTLDLDDLEKLLTEKTKLVA---ITHV 168
Query: 142 STFGVFEENITDVCELIHEHGGQVYLDGA 170
S + ++ +L H+ G +V +DGA
Sbjct: 169 SNVLGTVNPVEEIAKLAHQVGAKVLVDGA 197
>gnl|CDD|223190 COG0112, GlyA, Glycine/serine hydroxymethyltransferase [Amino acid
transport and metabolism].
Length = 413
Score = 34.1 bits (79), Expect = 0.070
Identities = 43/181 (23%), Positives = 75/181 (41%), Gaps = 27/181 (14%)
Query: 33 EQLIGELETDLCEITGYDKISFQPNSGAQGEYAGLRAIQCYHQAQDAHHRNVCLIPVSA- 91
E+L E ++ G + + QP+SG+Q A A+ Q + + +S
Sbjct: 74 EELAIER---AKKLFGAEYANVQPHSGSQANQAVYLAL-----LQ--PGDTIMGLDLSHG 123
Query: 92 -HGTNPASAQMAGMSVEPVS--VRKD-GTIDFSDLETKVKKNKETLSCLMITYPSTFGVF 147
H T+ + +G VS V + G ID+ ++E K+ K L+I S +
Sbjct: 124 GHLTHGSPVNFSGKLFNVVSYGVDPETGLIDYDEVEKLAKEVKPK---LIIAGGSAY-SR 179
Query: 148 EENITDVCELIHEHGGQVYLDGANMNAQVGLCRPGDYGSDVSHLNL-----HKTFCIPHG 202
+ E+ E G + +D A++ GL G + + + H ++ HKT P G
Sbjct: 180 PIDFKRFREIADEVGAYLMVDMAHV---AGLIAGGVHPNPLPHADVVTTTTHKTLRGPRG 236
Query: 203 G 203
G
Sbjct: 237 G 237
>gnl|CDD|223154 COG0076, GadB, Glutamate decarboxylase and related PLP-dependent
proteins [Amino acid transport and metabolism].
Length = 460
Score = 33.9 bits (78), Expect = 0.086
Identities = 18/99 (18%), Positives = 41/99 (41%), Gaps = 6/99 (6%)
Query: 75 QAQDAHHRNVCLIPVSAHGTNPASAQMAGMSVEPVSVRK-DGTIDFSDLETKVKKN--KE 131
++ + + +AH + +A+ G+ + V D ID LE + +N
Sbjct: 150 ESGKPGGKPNIVCSETAHFSFEKAARYLGLGLRRVPTVPTDYRIDVDALEEAIDENTIGG 209
Query: 132 TLSCLMITYPSTFGVFEENITDVCELIHEHGGQVYLDGA 170
+ T G ++I ++ ++ E+G +++D A
Sbjct: 210 VVVGTAGTTD--TGSI-DDIEELADIAEEYGIWLHVDAA 245
>gnl|CDD|99743 cd06450, DOPA_deC_like, DOPA decarboxylase family. This family
belongs to pyridoxal phosphate (PLP)-dependent aspartate
aminotransferase superfamily (fold I). The major groups
in this CD correspond to DOPA/tyrosine decarboxylase
(DDC), histidine decarboxylase (HDC), and glutamate
decarboxylase (GDC). DDC is active as a dimer and
catalyzes the decarboxylation of tyrosine. GDC catalyzes
the decarboxylation of glutamate and HDC catalyzes the
decarboxylation of histidine.
Length = 345
Score = 33.7 bits (78), Expect = 0.089
Identities = 19/91 (20%), Positives = 41/91 (45%), Gaps = 5/91 (5%)
Query: 84 VCLIPVSAHGTNPASAQMAGMSVEPVSVRKDGTIDFSDLETKVKKNKETLS---CLMITY 140
V + AH + +A + V V V +DG +D LE + ++K ++ T
Sbjct: 97 VIVCSDQAHVSVEKAAAYLDVKVRLVPVDEDGRMDPEALEAAIDEDKAEGLNPIMVVATA 156
Query: 141 PST-FGVFEENITDVCELIHEHGGQVYLDGA 170
+T G + + ++ +L ++ +++D A
Sbjct: 157 GTTDTGAI-DPLEEIADLAEKYDLWLHVDAA 186
>gnl|CDD|215756 pfam00155, Aminotran_1_2, Aminotransferase class I and II.
Length = 357
Score = 33.8 bits (78), Expect = 0.093
Identities = 21/88 (23%), Positives = 37/88 (42%), Gaps = 5/88 (5%)
Query: 86 LIPVSAHGTNPASAQMAGMSVEPVSVR--KDGTIDFSDLETKVKKNKETLSCLMITYPS- 142
L+P + + ++AG V + D +DF LE +K+ E ++ T P
Sbjct: 94 LVPAPTYPSYIRIFRLAGGEVVRYPLYSSNDFHLDFDALEAALKEATEGNKVVLHTSPHN 153
Query: 143 -TFGVF-EENITDVCELIHEHGGQVYLD 168
T V E + + +L EH + +D
Sbjct: 154 PTGTVATLEELEKLLDLAKEHNILLLVD 181
>gnl|CDD|99734 cd00609, AAT_like, Aspartate aminotransferase family. This family
belongs to pyridoxal phosphate (PLP)-dependent aspartate
aminotransferase superfamily (fold I). Pyridoxal
phosphate combines with an alpha-amino acid to form a
compound called a Schiff base or aldimine intermediate,
which depending on the reaction, is the substrate in
four kinds of reactions (1) transamination (movement of
amino groups), (2) racemization (redistribution of
enantiomers), (3) decarboxylation (removing COOH
groups), and (4) various side-chain reactions depending
on the enzyme involved. Pyridoxal phosphate (PLP)
dependent enzymes were previously classified into alpha,
beta and gamma classes, based on the chemical
characteristics (carbon atom involved) of the reaction
they catalyzed. The availability of several structures
allowed a comprehensive analysis of the evolutionary
classification of PLP dependent enzymes, and it was
found that the functional classification did not always
agree with the evolutionary history of these enzymes.
The major groups in this CD corresponds to Aspartate
aminotransferase a, b and c, Tyrosine, Alanine,
Aromatic-amino-acid, Glutamine phenylpyruvate,
1-Aminocyclopropane-1-carboxylate synthase,
Histidinol-phosphate, gene products of malY and cobC,
Valine-pyruvate aminotransferase and Rhizopine
catabolism regulatory protein.
Length = 350
Score = 33.1 bits (76), Expect = 0.17
Identities = 24/88 (27%), Positives = 41/88 (46%), Gaps = 4/88 (4%)
Query: 86 LIPVSAHGTNPASAQMAGMSVEPVSVRKDGTIDFSDLETKVKKNKETLSCLMITYPS--T 143
L+P + A+A++AG V PV + ++G + K +T L + P+ T
Sbjct: 87 LVPDPTYPGYEAAARLAGAEVVPVPLDEEGGFLLDLELLEAAKTPKT-KLLYLNNPNNPT 145
Query: 144 FGVF-EENITDVCELIHEHGGQVYLDGA 170
V EE + ++ EL +HG + D A
Sbjct: 146 GAVLSEEELEELAELAKKHGILIISDEA 173
>gnl|CDD|178323 PLN02721, PLN02721, threonine aldolase.
Length = 353
Score = 32.7 bits (75), Expect = 0.20
Identities = 43/167 (25%), Positives = 70/167 (41%), Gaps = 26/167 (15%)
Query: 27 DQARGYEQLIGELETDLCEITGYDKISFQPNSGAQGEYAGLRAIQCYHQAQDAHHRNVCL 86
D GY+ LE ++ +I G + F P SG G + + C + + +
Sbjct: 33 DDVLGYDPTALRLEEEMAKIFGKEAALFVP-SGTMGNLISV-LVHCDVRGSEVILGDNSH 90
Query: 87 IPVSAHGTNPASAQMAGMS----VEPVSV--RKDGTIDFSDLETKVKKNKET------LS 134
I + +G G+S V P +V +DGT+D +E ++ + L
Sbjct: 91 IHLYENG---------GISTLGGVHPRTVKNNEDGTMDLDAIEAAIRPKGDDHFPTTRLI 141
Query: 135 CLMITYPSTFGVF--EENITDVCELIHEHGGQVYLDGANM-NAQVGL 178
CL T+ + G E V EL HG ++++DGA + NA V L
Sbjct: 142 CLENTHANCGGRCLSVEYTDKVGELAKRHGLKLHIDGARIFNASVAL 188
>gnl|CDD|235068 PRK02769, PRK02769, histidine decarboxylase; Provisional.
Length = 380
Score = 32.7 bits (75), Expect = 0.23
Identities = 19/83 (22%), Positives = 39/83 (46%), Gaps = 3/83 (3%)
Query: 91 AHGTNPASAQMAGMSVEPVSVRKDGTIDFSDLETKVKKNKETLSCLMITYPSTFGVFEEN 150
H + A++ + ++ +G ID+ DL +K+K+NK + +T +N
Sbjct: 119 THYSVSKIARLLRIKSRVITSLPNGEIDYDDLISKIKENKNQPPIIFANIGTTMTGAIDN 178
Query: 151 ITDVCELIHEHG-GQVYL--DGA 170
I ++ E++ + G Y+ D A
Sbjct: 179 IKEIQEILKKIGIDDYYIHADAA 201
>gnl|CDD|153332 cd07648, F-BAR_FCHO, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of FCH domain Only proteins.
F-BAR domains are dimerization modules that bind and
bend membranes and are found in proteins involved in
membrane dynamics and actin reorganization. Proteins in
this group have been named FCH domain Only (FCHO)
proteins. Vertebrates have two members, FCHO1 and FCHO2.
These proteins contain an F-BAR domain and a C-terminal
domain of unknown function named SAFF which is also
present in endophilin interacting protein 1. F-BAR
domains form banana-shaped dimers with a
positively-charged concave surface that binds to
negatively-charged lipid membranes. They can induce
membrane deformation in the form of long tubules.
Length = 261
Score = 32.3 bits (74), Expect = 0.27
Identities = 13/52 (25%), Positives = 23/52 (44%), Gaps = 2/52 (3%)
Query: 112 RKDGTI--DFSDLETKVKKNKETLSCLMITYPSTFGVFEENITDVCELIHEH 161
R++ + E K+KK ++ L+ Y + FE +TD C+ E
Sbjct: 143 RRENASPKEIEKAEAKLKKAQDEYKALVEKYNNIRADFETKMTDSCKRFQEI 194
>gnl|CDD|215829 pfam00266, Aminotran_5, Aminotransferase class-V. This domain is
found in amino transferases, and other enzymes including
cysteine desulphurase EC:4.4.1.-.
Length = 370
Score = 31.8 bits (73), Expect = 0.39
Identities = 22/90 (24%), Positives = 38/90 (42%), Gaps = 9/90 (10%)
Query: 86 LIPVSAHGTNPASAQMA----GMSVEPVSVRKDGTIDFSDLETKVKKNKETLSCLMITYP 141
L+ H N Q G +V + V +G +D LE + + ++ IT+
Sbjct: 92 LVTEMEHHANLVPWQELAKRTGATVRVIPVDPNGLLDLDALEKLLTPRTKLVA---ITHV 148
Query: 142 ST-FGVFEENITDVCELIHEHGGQVYLDGA 170
S G + ++ +L HE+G V +D A
Sbjct: 149 SNVTGTVNP-VEEIGKLAHEYGALVVVDAA 177
>gnl|CDD|99746 cd06453, SufS_like, Cysteine desulfurase (SufS)-like. This family
belongs to the pyridoxal phosphate (PLP)-dependent
aspartate aminotransferase superfamily (fold I). The
major groups in this CD correspond to cysteine
desulfurase (SufS) and selenocysteine lyase. SufS
catalyzes the removal of elemental sulfur and selenium
atoms from L-cysteine, L-cystine, L-selenocysteine, and
L-selenocystine to produce L-alanine; and selenocysteine
lyase catalyzes the decomposition of L-selenocysteine.
Length = 373
Score = 31.7 bits (73), Expect = 0.47
Identities = 22/76 (28%), Positives = 35/76 (46%), Gaps = 11/76 (14%)
Query: 99 AQMAGMSVEPVSVRKDGTIDFSDLETKVKKNKETLSCLMITYPST-FGVFEENITDV--- 154
A+ G ++ V V DG +D LE + + + ++ +T+ S G I V
Sbjct: 109 AERTGAKLKVVPVDDDGQLDLEALEKLLTERTKLVA---VTHVSNVLGT----INPVKEI 161
Query: 155 CELIHEHGGQVYLDGA 170
E+ HE G V +DGA
Sbjct: 162 GEIAHEAGVPVLVDGA 177
>gnl|CDD|236103 PRK07810, PRK07810, O-succinylhomoserine sulfhydrylase;
Provisional.
Length = 403
Score = 31.2 bits (71), Expect = 0.59
Identities = 16/39 (41%), Positives = 20/39 (51%), Gaps = 2/39 (5%)
Query: 150 NITDVCELIHEHGGQVYLDGANMNAQVGLCRPGDYGSDV 188
+I V EL H G +V LD N+ A L R G+DV
Sbjct: 173 DIAAVSELAHAAGAKVVLD--NVFATPLLQRGLPLGADV 209
>gnl|CDD|218796 pfam05889, SLA_LP_auto_ag, Soluble liver antigen/liver pancreas
antigen (SLA/LP autoantigen). This family consists of
several eukaryotic and archaeal proteins which are
related to the human soluble liver antigen/liver
pancreas antigen (SLA/LP autoantigen). Autoantibodies
are a hallmark of autoimmune hepatitis, but most are not
disease specific. Autoantibodies to soluble liver
antigen (SLA) and to liver and pancreas antigen (LP)
have been described as disease specific, occurring in
about 30% of all patients with autoimmune hepatitis. The
function of SLA/LP is unknown, however, it has been
suggested that the protein may function as a serine
hydroxymethyltransferase and may be an important enzyme
in the thus far poorly understood selenocysteine
pathway. The archaeal sequences Methanopyrus kandleri
spcS and MK0229 are annotated as being pyridoxal
phosphate-dependent enzymes.
Length = 386
Score = 31.0 bits (70), Expect = 0.66
Identities = 25/138 (18%), Positives = 54/138 (39%), Gaps = 10/138 (7%)
Query: 88 PVSAHGTNPASAQMAGMS---VEPVSVRKDGTIDFSDLETKVKKNKETLSCLMITYPSTF 144
P ++ +A+ AG VE V D +D+ET +++ E + +++ S F
Sbjct: 107 PRIDQKSSIKAAERAGFEPRLVETVLEGDYLITDVNDVETIIEEKGEEVILAVLSTTSCF 166
Query: 145 -GVFEENITDVCELIHEHGGQVYLDGA----NMNAQVGLCRPGDYGS-DVSHLNLHKTFC 198
+ + ++ ++ E+ ++GA + + + G D +L K F
Sbjct: 167 APRSPDRVKEIAKICAEYDVPHLVNGAYGIQSEKYIHLIQQAAKVGRIDAVVQSLDKNFI 226
Query: 199 IPHGGGG-PGMGPIGVKS 215
+P GG ++
Sbjct: 227 VPVGGAIIAAFDESFIQE 244
>gnl|CDD|191635 pfam06903, VirK, VirK protein. This family consists of several
bacterial VirK proteins of around 145 residues in
length. The function of this family is unknown.
Length = 101
Score = 28.8 bits (65), Expect = 1.0
Identities = 8/32 (25%), Positives = 15/32 (46%)
Query: 90 SAHGTNPASAQMAGMSVEPVSVRKDGTIDFSD 121
P++ G+ ++ + DGT+ FSD
Sbjct: 31 PEGAGTPSTKTRGGVRIDAYRITSDGTLAFSD 62
>gnl|CDD|130884 TIGR01825, gly_Cac_T_rel, pyridoxal phosphate-dependent
acyltransferase, putative. This model represents an
enzyme subfamily related to three known enzymes; it
appears closest to glycine C-acteyltransferase, shows no
overlap with it in species distribution, and may share
that function. The three closely related enzymes are
glycine C-acetyltransferase (2-amino-3-ketobutyrate
coenzyme A ligase), 5-aminolevulinic acid synthase, and
8-amino-7-oxononanoate synthase. All transfer the
R-group (acetyl, succinyl, or 6-carboxyhexanoyl) from
coenzyme A to an amino acid (Gly, Gly, Ala,
respectively), with release of CO2 for the latter two
reactions.
Length = 385
Score = 30.6 bits (69), Expect = 1.1
Identities = 24/101 (23%), Positives = 41/101 (40%), Gaps = 13/101 (12%)
Query: 117 IDFSDLETKVKKNKETLSCLMITYPSTFGVFEEN-----ITDVCELIHEHGGQVYLDGAN 171
D DL+ +++N L++T GVF + + ++ EL +G Y+D A+
Sbjct: 147 ADMDDLDRVLRENPSYGKKLIVTD----GVFSMDGDVAPLPEIVELAERYGAVTYVDDAH 202
Query: 172 MNAQVGLCRPGDYGSDVSHLNLHKTFCIPHGGGGPGMGPIG 212
+ +G G V H L I G +G +G
Sbjct: 203 GSGVMGEAGRG----TVHHFGLEDKVDIQVGTLSKAIGVVG 239
>gnl|CDD|237409 PRK13520, PRK13520, L-tyrosine decarboxylase; Provisional.
Length = 371
Score = 30.2 bits (69), Expect = 1.2
Identities = 25/96 (26%), Positives = 45/96 (46%), Gaps = 10/96 (10%)
Query: 34 QLIGELETDLCEITGYDKISFQPNSGAQGEYAGLRAIQCYHQAQDAHHRNVCLIPVSAHG 93
+++GEL L + GY I+ SG G A ++A++ A N+ ++P SAH
Sbjct: 66 EMLGEL-LHLPDAYGY--IT----SG--GTEANIQAVRAARNLAKAEKPNI-VVPESAHF 115
Query: 94 TNPASAQMAGMSVEPVSVRKDGTIDFSDLETKVKKN 129
+ +A M G+ + + D +D +E + N
Sbjct: 116 SFDKAADMLGVELRRAPLDDDYRVDVKAVEDLIDDN 151
>gnl|CDD|99799 cd06202, Nitric_oxide_synthase, The ferredoxin-reductase (FNR) like
C-terminal domain of the nitric oxide synthase (NOS)
fuses with a heme-containing N-terminal oxidase domain.
The reductase portion is similar in structure to NADPH
dependent cytochrome-450 reductase (CYPOR), having an
inserted connecting sub-domain within the FAD binding
portion of FNR. NOS differs from CYPOR in a requirement
for the cofactor tetrahydrobiopterin and unlike most
CYPOR is dimeric. Nitric oxide synthase produces nitric
oxide in the conversion of L-arginine to L-citruline.
NOS has been implicated in a variety of processes
including cytotoxicity, anti-inflamation,
neurotransmission, and vascular smooth muscle
relaxation.
Length = 406
Score = 30.4 bits (69), Expect = 1.3
Identities = 16/63 (25%), Positives = 26/63 (41%), Gaps = 9/63 (14%)
Query: 123 ETKVKKNKETLSCLMITY------PSTF--GVFEENITDVCELIHEHGGQVYLDG-ANMN 173
ET+ KNK L+ + P T+ + +E V + + GG +Y+ G M
Sbjct: 303 ETEEAKNKGVLTEVYTALSREPGKPKTYVQDLLKEQAESVYDALVREGGHIYVCGDVTMA 362
Query: 174 AQV 176
V
Sbjct: 363 EDV 365
>gnl|CDD|221493 pfam12256, TcdB_toxin_midN, Insecticide toxin TcdB
middle/N-terminal region. This domain family is found
in bacteria and archaea, and is typically between 164
and 180 amino acids in length. The family is found in
association with pfam05593. This family is the
N-terminal-sided middle region of the bacterial
insecticide toxin TcdB. This region appears related to
the FG-GAP repeat pfam01839.
Length = 171
Score = 29.3 bits (66), Expect = 1.4
Identities = 10/40 (25%), Positives = 19/40 (47%), Gaps = 1/40 (2%)
Query: 206 PGMGPIGVKSHLAPFLPVHPLSSIDSSIGAVSAAHYGSAS 245
P + P + P + L SI +++GA + Y S++
Sbjct: 45 PHLTPRHWRYDFPADKP-YLLKSITNNMGATTTLEYRSSA 83
>gnl|CDD|233359 TIGR01329, cysta_beta_ly_E, cystathionine beta-lyase, eukaryotic.
This model represents cystathionine beta-lyase
(alternate name: beta-cystathionase), one of several
pyridoxal-dependent enzymes of cysteine, methionine, and
homocysteine metabolism. This enzyme is involved in the
biosynthesis of Met from Cys.
Length = 378
Score = 30.2 bits (68), Expect = 1.4
Identities = 12/49 (24%), Positives = 22/49 (44%), Gaps = 2/49 (4%)
Query: 150 NITDVCELIHEHGGQVYLDGANMNAQVGLCRPGDYGSDVSHLNLHKTFC 198
+I + E+ H V +D M+ LC P + G+D+ + + K
Sbjct: 149 DIRKISEMAHAQNALVVVDNTMMSPL--LCNPLELGADIVYHSATKFLA 195
>gnl|CDD|131199 TIGR02144, LysX_arch, Lysine biosynthesis enzyme LysX. The family
of proteins found in this equivalog include the
characterized LysX from Thermus thermophilus which is
part of a well-organized lysine biosynthesis gene
cluster. LysX is believed to carry out an ATP-dependent
acylation of the amino group of alpha-aminoadipate in
the prokaryotic version of the fungal AAA lysine
biosynthesis pathway. No species having a sequence in
this equivalog contains the elements of the more common
diaminopimelate lysine biosythesis pathway, and none has
been shown to be a lysine auxotroph. These sequences
have mainly recieved the name of the related enzyme,
"ribosomal protein S6 modification protein RimK". RimK
has been characterized in E. coli, and acts by
ATP-dependent condensation of S6 with glutamate
residues.
Length = 280
Score = 29.3 bits (66), Expect = 2.4
Identities = 13/39 (33%), Positives = 23/39 (58%), Gaps = 1/39 (2%)
Query: 264 LFRSSRSGLVAHEFVIDVRDFKKSANIEAVDIAKRLMDY 302
+F S GL+ +E V V +FK S + V++A +++Y
Sbjct: 237 IFESKERGLLVNE-VNHVPEFKNSVRVTGVNVAGEILEY 274
>gnl|CDD|218685 pfam05673, DUF815, Protein of unknown function (DUF815). This
family consists of several bacterial proteins of unknown
function.
Length = 248
Score = 29.0 bits (66), Expect = 2.4
Identities = 16/43 (37%), Positives = 21/43 (48%), Gaps = 5/43 (11%)
Query: 238 AAHYGSASILPISWAYIRRLESHYKTLFRSSRSGLVAHEFVID 280
AA G LP+ +RR E+ L R RSG A +F+ D
Sbjct: 206 AARLG----LPLDEEELRR-EALQWALTRGGRSGRTARQFIDD 243
>gnl|CDD|182661 PRK10702, PRK10702, endonuclease III; Provisional.
Length = 211
Score = 28.4 bits (63), Expect = 4.1
Identities = 16/46 (34%), Positives = 24/46 (52%), Gaps = 4/46 (8%)
Query: 139 TYPSTFGVFE---ENITDVCE-LIHEHGGQVYLDGANMNAQVGLCR 180
TY T G++ EN+ C L+ +H G+V D A + A G+ R
Sbjct: 74 TYIKTIGLYNSKAENVIKTCRILLEQHNGEVPEDRAALEALPGVGR 119
>gnl|CDD|216911 pfam02165, WT1, Wilm's tumour protein.
Length = 288
Score = 28.3 bits (63), Expect = 4.5
Identities = 22/49 (44%), Positives = 26/49 (53%), Gaps = 3/49 (6%)
Query: 185 GSDVSHLN--LHKTFCIPHGGGGPGMGPIGVKSHLAPFLPVHPLSSIDS 231
GSDV L+ L +P GGGG G+ P+G S AP L HP S S
Sbjct: 2 GSDVRDLSALLPPVSSLPGGGGGCGL-PVGGASQWAPVLDFHPGSPYSS 49
>gnl|CDD|237964 PRK15426, PRK15426, putative diguanylate cyclase YedQ; Provisional.
Length = 570
Score = 28.8 bits (65), Expect = 4.5
Identities = 15/52 (28%), Positives = 23/52 (44%), Gaps = 5/52 (9%)
Query: 221 LPVHPLS-SIDSSIGAVSAAHYGSASILP----ISWAYIRRLESHYKTLFRS 267
+ VH L + G++S A ++ ISW IRR+ S+ L S
Sbjct: 342 VRVHTLREGVRGDFGSISIALTLLWALFTAMLLISWYVIRRMVSNMFVLQSS 393
>gnl|CDD|233947 TIGR02600, Verru_Chthon_A, Verru_Chthon cassette protein A. In
Verrucomicrobium spinosum and Chthoniobacter flavus, a
four-gene operon that includes proteins with an
N-terminal signal sequence for cleavage and methylation
recurs many times. Each operon is likely to encode a
membrane complex, the function of which is unknown. This
model represents a long protein from this putative
membrame complex, with members averaging about 1300
amino acids. The N-terminal region includes an apparent
signal sequence. The function is unknown. Most cassettes
are adjacent to an unusually large protein with both an
outer membrane autotransporter region and PEP-CTERM
putative protein-sorting motif [Cell envelope, Surface
structures].
Length = 1265
Score = 28.7 bits (64), Expect = 4.8
Identities = 10/36 (27%), Positives = 15/36 (41%)
Query: 86 LIPVSAHGTNPASAQMAGMSVEPVSVRKDGTIDFSD 121
A A++ M V + V +DGT+D D
Sbjct: 169 TSGTVAPPGTDATSPRLPMPVRWLYVLQDGTLDVPD 204
>gnl|CDD|168128 PRK05613, PRK05613, O-acetylhomoserine
aminocarboxypropyltransferase; Validated.
Length = 437
Score = 28.3 bits (63), Expect = 5.2
Identities = 21/62 (33%), Positives = 30/62 (48%), Gaps = 6/62 (9%)
Query: 150 NITDVCELIHEHGGQVYLDGANMNAQVGLCRPGDYGSDVSHLNLHKTFCIPHGGGGPGMG 209
+I V E+ H + QV L N A L RP + G+DV +L K + G G G+G
Sbjct: 173 DIPAVAEVAHRN--QVPLIVDNTIATAALVRPLELGADVVVASLTKFYT----GNGSGLG 226
Query: 210 PI 211
+
Sbjct: 227 GV 228
>gnl|CDD|191647 pfam06947, DUF1290, Protein of unknown function (DUF1290). This
family consists of several bacterial small basic
proteins of around 100 residues in length. The function
of this family is unknown.
Length = 88
Score = 26.4 bits (59), Expect = 5.7
Identities = 8/23 (34%), Positives = 14/23 (60%)
Query: 217 LAPFLPVHPLSSIDSSIGAVSAA 239
+P+L V L+++DS G + A
Sbjct: 6 YSPYLSVAVLAALDSVFGGIRAN 28
>gnl|CDD|214732 smart00583, SPK, domain in SET and PHD domain containing proteins
and protein kinases.
Length = 114
Score = 27.1 bits (61), Expect = 5.7
Identities = 9/29 (31%), Positives = 14/29 (48%)
Query: 105 SVEPVSVRKDGTIDFSDLETKVKKNKETL 133
++EP+ V +FS LE + ET
Sbjct: 16 AIEPLVVPLKVFEEFSKLEGNSLLSYETY 44
>gnl|CDD|130828 TIGR01767, MTRK, 5-methylthioribose kinase. This enzyme is
involved in the methionine salvage pathway in certain
bacteria.
Length = 370
Score = 28.3 bits (63), Expect = 5.8
Identities = 15/38 (39%), Positives = 16/38 (42%), Gaps = 3/38 (7%)
Query: 262 KTLFRSSRSGLVAHEFVIDVRDFKKSANIEAVDIAKRL 299
KTLF SS L V F N E DI +RL
Sbjct: 111 KTLFYSSDYALEPKVKKALVAQF---TNPELCDITERL 145
>gnl|CDD|233290 TIGR01141, hisC, histidinol-phosphate aminotransferase. Alternate
names: histidinol-phosphate transaminase; imidazole
acetol-phosphate transaminase Histidinol-phosphate
aminotransferase is a pyridoxal-phosphate dependent
enzyme [Amino acid biosynthesis, Histidine family].
Length = 346
Score = 27.6 bits (62), Expect = 8.8
Identities = 26/115 (22%), Positives = 45/115 (39%), Gaps = 17/115 (14%)
Query: 86 LIPVSAHGTNPASAQMAGMSVEPVSVRKDGTIDFSDLETKVKKNKETLSCLMITYPS--T 143
L+P + SA++ G V V + +DG +D D+ +K L + + P+ T
Sbjct: 99 LVPPPTYSMYEISAKIHGAEVVKVPLDEDGQLDLEDILV-AIDDKPKL--VFLCSPNNPT 155
Query: 144 FGVF-EENITDVCELIHEHGGQVYLDGANMNAQVGLCRPGDYGSDVSHLNLHKTF 197
+ +I V E E V +D A G++ + S L L +
Sbjct: 156 GNLLSRSDIEAVLERTPE-DALVVVDEAY----------GEFSGEPSTLPLLAEY 199
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.319 0.135 0.411
Gapped
Lambda K H
0.267 0.0798 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 15,128,314
Number of extensions: 1406093
Number of successful extensions: 1337
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1302
Number of HSP's successfully gapped: 55
Length of query: 302
Length of database: 10,937,602
Length adjustment: 96
Effective length of query: 206
Effective length of database: 6,679,618
Effective search space: 1376001308
Effective search space used: 1376001308
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 59 (26.4 bits)