RPS-BLAST 2.2.22 [Sep-27-2009]
Database: CddA
21,609 sequences; 6,263,737 total letters
Searching..................................................done
Query= gi|254780439|ref|YP_003064852.1| carbamoyl phosphate synthase
large subunit [Candidatus Liberibacter asiaticus str. psy62]
(1162 letters)
>gnl|CDD|35591 KOG0370, KOG0370, KOG0370, Multifunctional pyrimidine synthesis
protein CAD (includes carbamoyl-phophate synthetase,
aspartate transcarbamylase, and glutamine
amidotransferase) [General function prediction only].
Length = 1435
Score = 807 bits (2085), Expect = 0.0
Identities = 429/1161 (36%), Positives = 639/1161 (55%), Gaps = 117/1161 (10%)
Query: 4 RQDLKTLLIIGAGPIVIGQACEFDYSGTQACKALKEEGYRIILVNSNPATIMTDPDLADA 63
R ++K +L++G+G + IGQA EFDYSG+QA KALKEE IL+N N AT+ T LAD
Sbjct: 374 RVEVKKVLVLGSGGLSIGQAGEFDYSGSQAIKALKEENIFTILINPNIATVQTSKGLADK 433
Query: 64 TYTEPITPEVVAKIIEKERPDAILPTTGGQTALNTALSLKRMGVLDRYGVEMIGAKPETI 123
Y P+TPE V K+I+ ERPD IL T GGQTALN + L + GV +YGV+++G +TI
Sbjct: 434 VYFLPVTPEYVTKVIKAERPDGILLTFGGQTALNCGVELDKAGVFAQYGVKVLGTPIQTI 493
Query: 124 DKAEDRSLFSKAMQNIPLATPKSILANATDIKEHDRKLHEEERENLKKTLSKEELDAALY 183
EDR LF++A+ I E + + + ++ AL
Sbjct: 494 ITTEDRDLFARALNEI--------------------------NEKIAPSEAVSTIEEALE 527
Query: 184 ALELKWNLEENDRKHRYICHAMAVAVQALDEIGLPLIIRPSFTLGGTGGGIAYNRSEFLE 243
A E +G P+I+R ++ LGG G G A N E +
Sbjct: 528 AAE---------------------------RLGYPVIVRAAYALGGLGSGFANNEEELQD 560
Query: 244 IVENGLHASPTTEVLIEESVLGWKEYELEMMRDIKGNCIVVCSIENLDPMGVHTGDSITV 303
+ L + + ++L+E+S+ GWKE E E++RD NCI VC++EN DP+G+HTGDSI V
Sbjct: 561 LAAQAL--ALSPQILVEKSLKGWKEVEYEVVRDAYDNCITVCNMENFDPLGIHTGDSIVV 618
Query: 304 APALTLTDKEYQLMRNAAIAVLKEIGVESGGANVQFAVNPKNGKMVVIEMNPRVSRSSAL 363
AP+ TL+D+EYQ++R AI V++ +GV G N+Q+A+NP + + +IE+N R+SRSSAL
Sbjct: 619 APSQTLSDEEYQMLRTTAIKVIRHLGV-VGECNIQYALNPYSLEYRIIEVNARLSRSSAL 677
Query: 364 ASKATGFPIAKIAAKLAVGYTLDELGNDITGGKTPASFEPSIDYIVTKIPRFTFEKFPGS 423
ASKATG+P+A AAKLA+G L EL N +T T A FEPS+DY V KIPR+ KF
Sbjct: 678 ASKATGYPLAYTAAKLALGIPLPELKNSVTK-TTTACFEPSLDYCVVKIPRWDLSKFQRV 736
Query: 424 DVTLTTSMKSVGEVMAIGRTFAESLQKALRGLETGLTGLDEIHIPSMESDNDPNALRSAI 483
+ +SMKSVGEVMAIGRTF E+ QKALR ++ L G + L +
Sbjct: 737 STEIGSSMKSVGEVMAIGRTFEEAFQKALRMVDPSLLGFMSTPFL--------DDLDEEL 788
Query: 484 SIPCPDRLRTVAQALRLGVSVEETHQSSNIDPWFIQQIKMIVDVEHRIREH---GLPKDF 540
S P R+ +A AL G SV+ H+ + ID WF+ ++ IV++ + H LPK+
Sbjct: 789 STPTDRRVFAIAAALAKGYSVDRIHELTRIDKWFLYKLMNIVNIYKLLESHSLSSLPKEL 848
Query: 541 QNLQAIKAMGFSDARLSILSGIPCNEIRKIRHQMGLHPVFKCVDTCAGEFSSPTSYMYST 600
L K +GFSD +++ G +R++R ++G+HP K +DT A EF + T+Y+Y+T
Sbjct: 849 --LLRAKKLGFSDKQIAKFIGSTELAVRRLRKELGIHPFVKQIDTVAAEFPAQTNYLYTT 906
Query: 601 YETNFINKPVSEDKVSDRKKIVILGGGPNRIGQGIEFDYCCCHASFSLKEAGFETIMINC 660
Y D + +++LG G RIG +EFD+C + +L++ G +TIM+N
Sbjct: 907 YNA------TEHDVDFNEHGVMVLGSGVYRIGSSVEFDWCAVGCARTLRKLGKKTIMVNY 960
Query: 661 NPETVSTDYDIADRLYFESLTEEDILEILRVEQQKGELVGIIVQFGGQTPLKLSKILEKN 720
NPETVSTDYD DRLYFE ++ E +++I +E +G II+ GGQ P ++ L +N
Sbjct: 961 NPETVSTDYDECDRLYFEEISYERVMDIYELENSEG----IIISVGGQLPNNIALKLHRN 1016
Query: 721 QIPILGTQPDSIDLAEDRDRFQKLLMELDLNQPRNGISHSVEHARLIACEIGFPLLIRPS 780
+ ILGT P+ ID AE+R +F ++L + ++QP S+E A+ A ++G+P+L+RPS
Sbjct: 1017 GVKILGTSPEMIDSAENRFKFSRMLDSIGVDQPAWKELTSLEEAKKFAEKVGYPVLVRPS 1076
Query: 781 YVLGGRAMQIVYSENMLQDYLLDTLPGLVSEEIKRQYPNDKTGQINTLLGTHPLLFDSYL 840
YVL G AM +VYSE+ L+ YL VS + HP++ ++
Sbjct: 1077 YVLSGAAMNVVYSESDLKSYL--EQASAVSPD-------------------HPVVISKFI 1115
Query: 841 SDAMEIDVDALCQNDQVIVVGIIEHIEEAGIHSGDSACSLPSRSLSQQLKDELISQTKAL 900
A EIDVDA+ + +V+V I EH+E AG+HSGD+ LP + LS + + +
Sbjct: 1116 EGAKEIDVDAVASDGKVLVHAISEHVENAGVHSGDATLVLPPQDLSADTLERIKDIAAKV 1175
Query: 901 AGALNVIGLINVQYAIKDGKIYILEVNPRASRTVPFIAKAIGFPVAKVAARIIAGESLDA 960
A AL + G N+Q KD ++ ++E N RASR+ PF++K +G +A R I G +
Sbjct: 1176 AKALKITGPFNMQIIAKDNELKVIECNVRASRSFPFVSKTLGVDFIALATRAIMGVPV-- 1233
Query: 961 SIAAYGKRPDLSQIKHFAVKESVFPFNKFPGVDILLGPEMRSTGEVIGIDQDFPLAFAKS 1020
L + AVK F F++ G D +LG EM STGEV +D A+ K+
Sbjct: 1234 ------PPDLLLHPDYVAVKVPQFSFSRLAGADPVLGVEMASTGEVACFGEDRYEAYLKA 1287
Query: 1021 QLGIGVDLPHEGTVFVSVRDADKKRIVPIIQNFKKLGFKIMATEGTARFLESHG-LETQK 1079
L G +P + + +S+ + K ++P ++ KLG+K+ AT GTA F + E +
Sbjct: 1288 MLSTGFKIPKKN-ILISI-GSYKPELLPSARDLAKLGYKLYATNGTADFYLENKYAEVSE 1345
Query: 1080 INKVLEGRPHIEDAISNRQVHLVINTTEGKKAIEDSKSLRRATLIRKIPYYTTIAGADAV 1139
+ R + N + LVIN ++ RR + IP T + A
Sbjct: 1346 EPTNDKLRELLA----NYNIDLVINLPRPSSFVDHGYKTRRLAVDFSIPLITDVKCAKLF 1401
Query: 1140 FQAIQALKAGNLEVHSLQSYR 1160
+A++ + +++ +
Sbjct: 1402 VEALKEKLS-HVDSMTSFRPV 1421
Score = 146 bits (369), Expect = 4e-35
Identities = 147/540 (27%), Positives = 234/540 (43%), Gaps = 70/540 (12%)
Query: 605 FINKPVSEDKVSDRKKIVILGGGPNRIGQGIEFDYCCCHASFSLKEAGFETIMINCNPET 664
FI +P + KK+++LG G IGQ EFDY A +LKE TI+IN N T
Sbjct: 364 FITEPAKAAPRVEVKKVLVLGSGGLSIGQAGEFDYSGSQAIKALKEENIFTILINPNIAT 423
Query: 665 VSTDYDIADRLYFESLTEEDILEILRVEQQKGELVGIIVQFGGQT----PLKLSKI--LE 718
V T +AD++YF +T E + ++++ E+ GI++ FGGQT ++L K
Sbjct: 424 VQTSKGLADKVYFLPVTPEYVTKVIKAERPD----GILLTFGGQTALNCGVELDKAGVFA 479
Query: 719 KNQIPILGTQPDSIDLAEDRDRFQKLLMELDLNQPRNGISHSVEHARLIACEIGFPLLIR 778
+ + +LGT +I EDRD F + L E++ + ++E A A +G+P+++R
Sbjct: 480 QYGVKVLGTPIQTIITTEDRDLFARALNEINEKIAPSEAVSTIEEALEAAERLGYPVIVR 539
Query: 779 PSYVLGGRAMQIVYSENMLQDYLLDTLPGLVSEEIKRQYPNDKTGQINTLLGTHPLLFDS 838
+Y LGG N ++ L D + QI L +
Sbjct: 540 AAYALGGLGSG---FAN-NEEELQDLA----------AQALALSPQI---------LVEK 576
Query: 839 YLSDAMEIDV----DALCQNDQVIVVGIIEHIEEAGIHSGDSACSLPSRSLSQQLKDELI 894
L E++ DA D I V +E+ + GIH+GDS PS++LS + L
Sbjct: 577 SLKGWKEVEYEVVRDAY---DNCITVCNMENFDPLGIHTGDSIVVAPSQTLSDEEYQMLR 633
Query: 895 SQTKALAGALNVIGLINVQYAIKDGKI--YILEVNPRASRTVPFIAKAIGFPVAKVAARI 952
+ + L V+G N+QYA+ + I+EVN R SR+ +KA G+P+A AA++
Sbjct: 634 TTAIKVIRHLGVVGECNIQYALNPYSLEYRIIEVNARLSRSSALASKATGYPLAYTAAKL 693
Query: 953 IAGESLDAS--IAAYGKRPDLSQIKHFAVKESV-FPFNKFPGVDILLGPEMRSTGEVIGI 1009
G L + V + + +KF V +G M+S GEV+ I
Sbjct: 694 ALGIPLPELKNSVTKTTTACFEPSLDYCVVKIPRWDLSKFQRVSTEIGSSMKSVGEVMAI 753
Query: 1010 DQDFPLAFAKSQLGIGVDL------------------PHEGTVFVSVRDADKKRIVPIIQ 1051
+ F AF K+ + L P + VF K V I
Sbjct: 754 GRTFEEAFQKALRMVDPSLLGFMSTPFLDDLDEELSTPTDRRVFAIAAALAKGYSVDRIH 813
Query: 1052 NFKKLG----FKIMATEGTARFLESHGLETQKINKVLEGRPHIEDAISNRQVHLVINTTE 1107
++ +K+M + LESH L + +L + + S++Q+ I +TE
Sbjct: 814 ELTRIDKWFLYKLMNIVNIYKLLESHSLSSLPKELLLRAK---KLGFSDKQIAKFIGSTE 870
>gnl|CDD|30806 COG0458, CarB, Carbamoylphosphate synthase large subunit (split
gene in MJ) [Amino acid transport and metabolism /
Nucleotide transport and metabolism].
Length = 400
Score = 543 bits (1400), Expect = e-154
Identities = 281/449 (62%), Positives = 331/449 (73%), Gaps = 56/449 (12%)
Query: 13 IGAGPIVIGQACEFDYSGTQACKALKEEGYRIILVNSNPATIMTDPDLADATYTEPITPE 72
IG+GPIVIGQA EFDYSGTQACKALKEEGY ++LVNSNPATIMTDP+LAD Y EPIT E
Sbjct: 1 IGSGPIVIGQAAEFDYSGTQACKALKEEGYGVVLVNSNPATIMTDPELADKVYIEPITKE 60
Query: 73 VVAKIIEKERPDAILPTTGGQTALNTALSLKRMGVLDRYGVEMIGAKPETIDKAEDRSLF 132
V KIIEKERPDAILPT GGQTALN AL LK GVL++YGVE++G+ PE I+ AED
Sbjct: 61 PVEKIIEKERPDAILPTLGGQTALNAALELKEKGVLEKYGVEVVGSDPEAIEIAED---- 116
Query: 133 SKAMQNIPLATPKSILANATDIKEHDRKLHEEERENLKKTLSKEELDAALYALELKWNLE 192
++ K+ + + +
Sbjct: 117 ---------------------------------KKLFKEAMREIGIPVP----------- 132
Query: 193 ENDRKHRYICHAMAVAVQALDEIGLPLIIRPSFTLGGTGGGIAYNRSEFLEIVENGLHAS 252
I H++ A + DEIG P+I++PSF LGG+GGGIAYN E EI+E GL AS
Sbjct: 133 ------SRIAHSVEEADEIADEIGYPVIVKPSFGLGGSGGGIAYNEEELEEIIEEGLRAS 186
Query: 253 PTTEVLIEESVLGWKEYELEMMRDIKGNCIVVCSIENLDPMGVHTGDSITVAPALTLTDK 312
P EVLIEES++GWKE+E E++RD K NCIVVC++ENLDPMGVHTGDSITVAPA TLTDK
Sbjct: 187 PVEEVLIEESIIGWKEFEYEVVRDGKDNCIVVCNMENLDPMGVHTGDSITVAPAQTLTDK 246
Query: 313 EYQLMRNAAIAVLKEIGVESGGANVQFAVNPKNGKMVVIEMNPRVSRSSALASKATGFPI 372
EYQ++R+AAI V++EIG+E GG N+QFAV+P G++ VIE+NPRVSRSSALASKATG+PI
Sbjct: 247 EYQMLRDAAIKVIREIGIE-GGCNIQFAVDPGGGELYVIEINPRVSRSSALASKATGYPI 305
Query: 373 AKIAAKLAVGYTLDELGNDITGGKTPASFEPSIDYIVTKIPRFTFEKFPGSDVTLTTSMK 432
AK+AAKLAVGYTLDE+ NDIT G+TPASFEPS+DY+VTKIPRF FEKFPG+D L T MK
Sbjct: 306 AKVAAKLAVGYTLDEIRNDIT-GRTPASFEPSLDYVVTKIPRFDFEKFPGADRRLGTQMK 364
Query: 433 SVGEVMAIGRTFAESLQKALRGLETGLTG 461
SVGEVMAIGRTF E+LQKALR LE GL G
Sbjct: 365 SVGEVMAIGRTFEEALQKALRSLEIGLAG 393
Score = 222 bits (567), Expect = 5e-58
Identities = 133/424 (31%), Positives = 221/424 (52%), Gaps = 46/424 (10%)
Query: 624 LGGGPNRIGQGIEFDYCCCHASFSLKEAGFETIMINCNPETVSTDYDIADRLYFESLTEE 683
+G GP IGQ EFDY A +LKE G+ +++N NP T+ TD ++AD++Y E +T+E
Sbjct: 1 IGSGPIVIGQAAEFDYSGTQACKALKEEGYGVVLVNSNPATIMTDPELADKVYIEPITKE 60
Query: 684 DILEILRVEQQKGELVGIIVQFGGQTPL------KLSKILEKNQIPILGTQPDSIDLAED 737
+ +I+ E+ I+ GGQT L K +LEK + ++G+ P++I++AED
Sbjct: 61 PVEKIIEKER----PDAILPTLGGQTALNAALELKEKGVLEKYGVEVVGSDPEAIEIAED 116
Query: 738 RDRFQKLLMELDLNQPRNGISHSVEHARLIACEIGFPLLIRPSYVLGGRAMQIVYSENML 797
+ F++ + E+ + P I+HSVE A IA EIG+P++++PS+ LGG I Y+E
Sbjct: 117 KKLFKEAMREIGIPVPS-RIAHSVEEADEIADEIGYPVIVKPSFGLGGSGGGIAYNE--- 172
Query: 798 QDYLLDTLPGLVSEEIKRQYPNDKTGQINTLLGTHPLLFDSYLSDAMEIDVDALC-QNDQ 856
EE++ +L + + E + + + D
Sbjct: 173 -------------EELEEIIEEG-----LRASPVEEVLIEESIIGWKEFEYEVVRDGKDN 214
Query: 857 VIVVGIIEHIEEAGIHSGDSACSLPSRSLS----QQLKDELISQTKALAGALNVIGLINV 912
IVV +E+++ G+H+GDS P+++L+ Q L+D I + + + G N+
Sbjct: 215 CIVVCNMENLDPMGVHTGDSITVAPAQTLTDKEYQMLRDAAIKVIREIG----IEGGCNI 270
Query: 913 QYAIKD--GKIYILEVNPRASRTVPFIAKAIGFPVAKVAARIIAGESLD---ASIAAYGK 967
Q+A+ G++Y++E+NPR SR+ +KA G+P+AKVAA++ G +LD I
Sbjct: 271 QFAVDPGGGELYVIEINPRVSRSSALASKATGYPIAKVAAKLAVGYTLDEIRNDITGRTP 330
Query: 968 RPDLSQIKHFAVKESVFPFNKFPGVDILLGPEMRSTGEVIGIDQDFPLAFAKSQLGIGVD 1027
+ + K F F KFPG D LG +M+S GEV+ I + F A K+ + +
Sbjct: 331 ASFEPSLDYVVTKIPRFDFEKFPGADRRLGTQMKSVGEVMAIGRTFEEALQKALRSLEIG 390
Query: 1028 LPHE 1031
L +
Sbjct: 391 LAGD 394
>gnl|CDD|145768 pfam02786, CPSase_L_D2, Carbamoyl-phosphate synthase L chain, ATP
binding domain. Carbamoyl-phosphate synthase catalyses
the ATP-dependent synthesis of carbamyl-phosphate from
glutamine or ammonia and bicarbonate. This important
enzyme initiates both the urea cycle and the
biosynthesis of arginine and/or pyrimidines. The
carbamoyl-phosphate synthase (CPS) enzyme in prokaryotes
is a heterodimer of a small and large chain. The small
chain promotes the hydrolysis of glutamine to ammonia,
which is used by the large chain to synthesize carbamoyl
phosphate. See pfam00988. The small chain has a GATase
domain in the carboxyl terminus. See pfam00117. The ATP
binding domain (this one) has an ATP-grasp fold.
Length = 211
Score = 241 bits (616), Expect = 1e-63
Identities = 83/190 (43%), Positives = 115/190 (60%), Gaps = 7/190 (3%)
Query: 203 HAMAVAVQALDEIGLPLIIRPSFTLGGTGGGIAYNRSEFLEIVENGLHASPTT----EVL 258
A+ A EIG P+II+ +F GG G GIA N E E+ L +P +VL
Sbjct: 25 ETEEEALAAAKEIGYPVIIKAAFGGGGLGMGIARNEEELAELFALALAEAPAAFGNPQVL 84
Query: 259 IEESVLGWKEYELEMMRDIKGNCIVVCSIENLDPMGVHTGDSITVAPALTLTDKEYQLMR 318
+E+S+ G K E +++RD GNCI VC+ E D T SI VAP+ TLTD+E Q++R
Sbjct: 85 VEKSLKGPKHIEYQVLRDAHGNCITVCNRECSDQR--RTQKSIEVAPSQTLTDEERQMLR 142
Query: 319 NAAIAVLKEIGVESGGANVQFAVNPKNGKMVVIEMNPRVSRSSALASKATGFPIAKIAAK 378
AA+ + + +G G V+FA++P +G+ IEMN R+ S ALA KATG+ +AK AAK
Sbjct: 143 EAAVKIARHLGYVGAG-TVEFALDPFSGEYYFIEMNTRLQVSHALAEKATGYDLAKEAAK 201
Query: 379 LAVGYTLDEL 388
+A+GY L EL
Sbjct: 202 IALGYPLPEL 211
Score = 112 bits (282), Expect = 6e-25
Identities = 52/228 (22%), Positives = 90/228 (39%), Gaps = 24/228 (10%)
Query: 737 DRDRFQKLLMELDLNQPRN--GISHSVEHARLIACEIGFPLLIRPSYVLGGRAMQIVYSE 794
D+ F+ + E + G + E A A EIG+P++I+ ++ GG M I +E
Sbjct: 1 DKVLFKAAMKEAGVPTVPGTAGPVETEEEALAAAKEIGYPVIIKAAFGGGGLGMGIARNE 60
Query: 795 NMLQDYLLDTLPGLVSEEIKRQYPNDKTGQINTLLGTHPLLFDSYLSDAMEIDVDALC-Q 853
L + L + G +L + L I+ L
Sbjct: 61 EELAELFALAL-----------------AEAPAAFGNPQVLVEKSLKGPKHIEYQVLRDA 103
Query: 854 NDQVIVVGIIEHIEEAGIHSGDSACSLPSRSLSQQLKDELISQTKALAGALNVIGLINVQ 913
+ I V E ++ + S PS++L+ + + L +A L +G V+
Sbjct: 104 HGNCITVCNRECSDQ--RRTQKSIEVAPSQTLTDEERQMLREAAVKIARHLGYVGAGTVE 161
Query: 914 YAI--KDGKIYILEVNPRASRTVPFIAKAIGFPVAKVAARIIAGESLD 959
+A+ G+ Y +E+N R + KA G+ +AK AA+I G L
Sbjct: 162 FALDPFSGEYYFIEMNTRLQVSHALAEKATGYDLAKEAAKIALGYPLP 209
>gnl|CDD|145769 pfam02787, CPSase_L_D3, Carbamoyl-phosphate synthetase large chain,
oligomerisation domain. Carbamoyl-phosphate synthase
catalyses the ATP-dependent synthesis of
carbamyl-phosphate from glutamine or ammonia and
bicarbonate. The carbamoyl-phosphate synthase (CPS)
enzyme in prokaryotes is a heterodimer of a small and
large chain.
Length = 122
Score = 160 bits (407), Expect = 2e-39
Identities = 55/123 (44%), Positives = 81/123 (65%), Gaps = 1/123 (0%)
Query: 479 LRSAISIPCPDRLRTVAQALRLGVSVEETHQSSNIDPWFIQQIKMIVDVEHRIREHGLPK 538
L + P +R+ +A+ALR G SVEE H+ + IDPWF+++IK IV++E ++ G P
Sbjct: 1 LLEKLRHPTDERIFAIAEALRRGYSVEEIHELTKIDPWFLEKIKEIVELEKELK-KGDPL 59
Query: 539 DFQNLQAIKAMGFSDARLSILSGIPCNEIRKIRHQMGLHPVFKCVDTCAGEFSSPTSYMY 598
L K +GFSDA+++ L G+ E+RK+R ++G+ PV+K VDTCA EF + T Y Y
Sbjct: 60 SDDLLLKAKKLGFSDAQIAKLLGVTEEEVRKLRKELGIRPVYKMVDTCAAEFEAKTPYYY 119
Query: 599 STY 601
STY
Sbjct: 120 STY 122
>gnl|CDD|29636 cd01424, MGS_CPS_II, Methylglyoxal synthase-like domain from type II
glutamine-dependent carbamoyl phosphate synthetase (CSP).
CSP, a CarA and CarB heterodimer, catalyzes the
production of carbamoyl phosphate which is subsequently
employed in the metabolic pathways responsible for the
synthesis of pyrimidine nucleotides or arginine. The
MGS-like domain is the C-terminal domain of CarB and
appears to play a regulatory role in CPS function by
binding allosteric effector molecules, including UMP and
ornithine..
Length = 110
Score = 158 bits (401), Expect = 9e-39
Identities = 56/110 (50%), Positives = 76/110 (69%)
Query: 1032 GTVFVSVRDADKKRIVPIIQNFKKLGFKIMATEGTARFLESHGLETQKINKVLEGRPHIE 1091
GTVF+SV D DK V I + +LGFK++ATEGTA++L+ G+ + +NKV EGRP+I
Sbjct: 1 GTVFISVADRDKPEAVEIAKRLAELGFKLVATEGTAKYLQEAGIPVEVVNKVSEGRPNIV 60
Query: 1092 DAISNRQVHLVINTTEGKKAIEDSKSLRRATLIRKIPYYTTIAGADAVFQ 1141
D I N ++ LVINT GK+AI D S+RRA L K+PY+TT+ A A +
Sbjct: 61 DLIKNGEIQLVINTPSGKRAIRDGFSIRRAALEYKVPYFTTLDTARAAVE 110
>gnl|CDD|145348 pfam02142, MGS, MGS-like domain. This domain composes the whole
protein of methylglyoxal synthetase and the domain is
also found in Carbamoyl phosphate synthetase (CPS) where
it forms a regulatory domain that binds to the allosteric
effector ornithine. This family also includes inosicase.
The known structures in this family show a common
phosphate binding site.
Length = 92
Score = 103 bits (259), Expect = 4e-22
Identities = 38/92 (41%), Positives = 53/92 (57%), Gaps = 5/92 (5%)
Query: 1045 RIVPIIQNFKKLGFKIMATEGTARFLESHGLETQKINKVLEGRPH----IEDAISNRQVH 1100
+V + + +LGFK+ AT GTA+FL+ G+ T+ +NK EGRP I D I N ++
Sbjct: 1 GLVELAKALVELGFKLYATGGTAKFLKEAGIPTEVVNKTGEGRPGGRVQIGDLIKNGEID 60
Query: 1101 LVINTTEGKKA-IEDSKSLRRATLIRKIPYYT 1131
LVINT KA + D ++RRA IP T
Sbjct: 61 LVINTLYPFKATVHDGYAIRRAAENIDIPLPT 92
>gnl|CDD|144029 pfam00289, CPSase_L_chain, Carbamoyl-phosphate synthase L chain,
N-terminal domain. Carbamoyl-phosphate synthase
catalyses the ATP-dependent synthesis of
carbamyl-phosphate from glutamine or ammonia and
bicarbonate. This important enzyme initiates both the
urea cycle and the biosynthesis of arginine and/or
pyrimidines. The carbamoyl-phosphate synthase (CPS)
enzyme in prokaryotes is a heterodimer of a small and
large chain. The small chain promotes the hydrolysis of
glutamine to ammonia, which is used by the large chain
to synthesize carbamoyl phosphate. See pfam00988. The
small chain has a GATase domain in the carboxyl
terminus. See pfam00117.
Length = 109
Score = 92.5 bits (231), Expect = 5e-19
Identities = 36/125 (28%), Positives = 49/125 (39%), Gaps = 24/125 (19%)
Query: 7 LKTLLIIGAGPIVIGQACEFDYSGTQACKALKEEGYRIILVNSNPATIMTDPDLADATYT 66
+K +L+ G I + +AL+E G + VNSNP T+ T LAD Y
Sbjct: 1 IKKVLVANRGEI-----------AVRIIRALRELGIETVAVNSNPDTVSTHVRLADEAYF 49
Query: 67 EP--------ITPEVVAKIIEKERPDAILPTTGGQTALNTALSLKRMGVLDRYGVEMIGA 118
+ E + I EKE DAI P G L+ + G+ IG
Sbjct: 50 LGPGPASESYLNIERILDIAEKEGADAIHP---GYGFLSENAEFAEACE--KAGITFIGP 104
Query: 119 KPETI 123
PE I
Sbjct: 105 SPEAI 109
Score = 89.1 bits (222), Expect = 7e-18
Identities = 34/126 (26%), Positives = 50/126 (39%), Gaps = 30/126 (23%)
Query: 619 KKIVILGGGPNRIGQGIEFDYCCCHASFSLKEAGFETIMINCNPETVSTDYDIADRLYFE 678
KK+++ G +L+E G ET+ +N NP+TVST +AD YF
Sbjct: 2 KKVLVANRGEI-----------AVRIIRALRELGIETVAVNSNPDTVSTHVRLADEAYFL 50
Query: 679 S--------LTEEDILEILRVEQQKGELVGIIVQFGGQTPL----KLSKILEKNQIPILG 726
L E IL+I E G G L + ++ EK I +G
Sbjct: 51 GPGPASESYLNIERILDIAEKE-------GADAIHPGYGFLSENAEFAEACEKAGITFIG 103
Query: 727 TQPDSI 732
P++I
Sbjct: 104 PSPEAI 109
>gnl|CDD|29632 cd00532, MGS-like, MGS-like domain. This domain composes the whole
protein of methylglyoxal synthetase, which catalyzes the
enolization of dihydroxyacetone phosphate (DHAP) to
produce methylglyoxal. The family also includes the
C-terminal domain in carbamoyl phosphate synthetase (CPS)
where it catalyzes the last phosphorylation of a
coaboxyphosphate intermediate to form the product
carbamoyl phosphate and may also play a regulatory role.
This family also includes inosine monophosphate
cyclohydrolase. The known structures in this family show
a common phosphate binding site..
Length = 112
Score = 77.7 bits (191), Expect = 2e-14
Identities = 36/111 (32%), Positives = 55/111 (49%), Gaps = 5/111 (4%)
Query: 1034 VFVSVRDADKKRIVPIIQNFKKLGFKIMATEGTARFLESHGLETQKINKVLE-GRPHIED 1092
VF+SV D K +V + GF + AT GT+R L G+ + ++K E G P ++
Sbjct: 2 VFLSVSDHVKAMLVDLAPKLSSDGFPLFATGGTSRVLADAGIPVRAVSKRHEDGEPTVDA 61
Query: 1093 AISNR-QVHLVINTTEG---KKAIEDSKSLRRATLIRKIPYYTTIAGADAV 1139
AI+ + + +VIN + + ED +L R + KIP T A A V
Sbjct: 62 AIAEKGKFDVVINLRDPRRDRCTDEDGTALLRLARLYKIPVTTPNATAMFV 112
>gnl|CDD|30788 COG0439, AccC, Biotin carboxylase [Lipid metabolism].
Length = 449
Score = 74.9 bits (184), Expect = 1e-13
Identities = 66/295 (22%), Positives = 113/295 (38%), Gaps = 36/295 (12%)
Query: 208 AVQALDEIGLPLIIRPSFTLGGTGGGIAYNRSEFLEIVENGLHASPTT----EVLIEESV 263
A+ +EIG P+I++ + GG G + N E E + V +E+ +
Sbjct: 144 ALAIAEEIGYPVIVKAAAGGGGRGMRVVRNEEELEAAFEAARGEAEAAFGNPRVYLEKFI 203
Query: 264 LGWKEYELEMMRDIKGNCIVV----CSIENLDPMGVHTGDSITVAPALTLTDKEYQLMRN 319
G + E++++ D GN I + CSI+ + AP+ LT++ + +
Sbjct: 204 EGPRHIEVQVLGDGHGNVIHLGERDCSIQRRHQKVIEE------APSPLLTEELREKIGE 257
Query: 320 AAIAVLKEIGVESGGANVQFAVNPKNGKMVVIEMNPRVSRSSALASKATGFPIAKIAAKL 379
AA+ K IG G V+F NG+ IEMN R+ + TG + K ++
Sbjct: 258 AAVRAAKLIG-YRGAGTVEFLY-DSNGEFYFIEMNTRLQVEHPVTEMVTGIDLVKEQIRI 315
Query: 380 AVGYTLDELGNDIT----------GGKTP-ASFEPS----IDYIVTKIPRFTFEKFPGSD 424
A G L DI + P +F PS Y P +
Sbjct: 316 AAGEPLSLKQEDIKFRGHAIECRINAEDPLGNFLPSPGKITRYAPPGGPGVRVDSGVYDG 375
Query: 425 VTLTTSMKS-VGEVMAIGRTFAESLQKALRGLE----TGLTGLDEIHIPSMESDN 474
+ S +G+V+ GRT E++ + R L+ G+ + + +
Sbjct: 376 YRVPPYYDSMIGKVIVHGRTRDEAIARMRRALDELVIDGIKTNIPLLQEILRDPD 430
Score = 73.0 bits (179), Expect = 4e-13
Identities = 56/300 (18%), Positives = 110/300 (36%), Gaps = 54/300 (18%)
Query: 712 KLSKILEKNQIPILGTQPDSIDLAEDRDRFQKLLMELDLNQP--RNGISHSVEHARLIAC 769
++ + + +G ++I D+ ++L+ + + +G E A IA
Sbjct: 90 AFAEACAEAGLTFIGPSAEAIRRMGDKITARRLMAKAGVPVVPGSDGAVADNEEALAIAE 149
Query: 770 EIGFPLLIRPSYVLGGRAMQIVYSENMLQDYLLDTLPGLVSEEIKRQYPNDKTGQINTLL 829
EIG+P++++ + GGR M++V +E L+
Sbjct: 150 EIGYPVIVKAAAGGGGRGMRVVRNEEELEA------------------------AFEAAR 185
Query: 830 GTHPLLFDSYLSDAMEIDVDALCQNDQVIVVGIIEHIEEAGIHSGDSACSL--------- 880
G F + + ++ + + I V ++ IH G+ CS+
Sbjct: 186 GEAEAAFGNP-----RVYLEKFIEGPRHIEVQVLGDGHGNVIHLGERDCSIQRRHQKVIE 240
Query: 881 --PSRSLSQQLKDELISQTKALAGALNVIGLINVQYAI-KDGKIYILEVNPRASRTVPFI 937
PS L+++L++++ A + G V++ +G+ Y +E+N R P
Sbjct: 241 EAPSPLLTEELREKIGEAAVRAAKLIGYRGAGTVEFLYDSNGEFYFIEMNTRLQVEHPVT 300
Query: 938 AKAIGFPVAKVAARIIAGESLDASIAAYGKRPDLSQIKH-----FAVKESVFPFNKFPGV 992
G + K RI AGE L K+ D+ H ++ + F PG
Sbjct: 301 EMVTGIDLVKEQIRIAAGEPLSL------KQEDIKFRGHAIECRINAEDPLGNFLPSPGK 354
>gnl|CDD|31240 COG1038, PycA, Pyruvate carboxylase [Energy production and
conversion].
Length = 1149
Score = 66.1 bits (161), Expect = 5e-11
Identities = 51/192 (26%), Positives = 91/192 (47%), Gaps = 18/192 (9%)
Query: 208 AVQALDEIGLPLIIRPSFTLGGTGGGIAYNRSEFLEIVENGLHASPTT----EVLIEESV 263
A++ +E G P++I+ + GG G + + ++ E E + EV +E+ V
Sbjct: 150 ALEFAEEYGYPVMIKAAAGGGGRGMRVVRSEADLAEAFERAKSEAKAAFGNDEVYVEKLV 209
Query: 264 LGWKEYELEMMRDIKGNCIVV----CSIENLDPMGVHTGDSITVAPALTLTDKEYQLMRN 319
K E++++ D GN + + CS++ + VAPA L+ + + +
Sbjct: 210 ENPKHIEVQILGDTHGNVVHLFERDCSVQR------RHQKVVEVAPAPYLSPELRDEICD 263
Query: 320 AAIAVLKEIGVESGGANVQFAVNPKNGKMVVIEMNPRVSRSSALASKATGFPIAKIAAKL 379
A+ + + IG + G V+F V+ ++GK IE+NPR+ + + TG I K +
Sbjct: 264 DAVKLARNIGYINAGT-VEFLVD-EDGKFYFIEVNPRIQVEHTITEEITGIDIVKAQIHI 321
Query: 380 AVGYTL--DELG 389
A G TL ELG
Sbjct: 322 AAGATLHTPELG 333
Score = 57.2 bits (138), Expect = 3e-08
Identities = 67/290 (23%), Positives = 122/290 (42%), Gaps = 45/290 (15%)
Query: 684 DILEILRVEQQKG-ELVGIIVQFGGQTPLKLSKILEKNQIPILGTQPDSIDLAEDRDRFQ 742
I EI+R+ ++ G + + F + P + ++ + I +G +P+ +D+ D+ + +
Sbjct: 68 SIDEIIRIAKRSGADAIHPGYGFLSENP-EFARACAEAGITFIGPKPEVLDMLGDKVKAR 126
Query: 743 KLLMELDLN--QPRNGISHSVEHARLIACEIGFPLLIRPSYVLGGRAMQIVYSENMLQDY 800
++ + +G ++E A A E G+P++I+ + GGR M++V SE L +
Sbjct: 127 NAAIKAGVPVIPGTDGPIETIEEALEFAEEYGYPVMIKAAAGGGGRGMRVVRSEADLAEA 186
Query: 801 LLDTLPGLVSEEIKRQYPNDKTGQINTLLGTHPLLFDSYLSDAMEIDVDALCQNDQVIVV 860
E K + ND E+ V+ L +N + I V
Sbjct: 187 FERA-----KSEAKAAFGND------------------------EVYVEKLVENPKHIEV 217
Query: 861 GIIEHIEEAGIHSGDSACSL-----------PSRSLSQQLKDELISQTKALAGALNVIGL 909
I+ +H + CS+ P+ LS +L+DE+ LA + I
Sbjct: 218 QILGDTHGNVVHLFERDCSVQRRHQKVVEVAPAPYLSPELRDEICDDAVKLARNIGYINA 277
Query: 910 INVQYAI-KDGKIYILEVNPRASRTVPFIAKAIGFPVAKVAARIIAGESL 958
V++ + +DGK Y +EVNPR + G + K I AG +L
Sbjct: 278 GTVEFLVDEDGKFYFIEVNPRIQVEHTITEEITGIDIVKAQIHIAAGATL 327
>gnl|CDD|35590 KOG0369, KOG0369, KOG0369, Pyruvate carboxylase [Energy production
and conversion].
Length = 1176
Score = 59.2 bits (143), Expect = 7e-09
Identities = 51/192 (26%), Positives = 91/192 (47%), Gaps = 20/192 (10%)
Query: 208 AVQALDEIGLPLIIRPSFTLGGTGGGIAYNRS------EFLEIVENGLHASPTTEVLIEE 261
A++ + E GLP+II+ ++ GG G G+ RS F L A + +E+
Sbjct: 176 ALEFVKEYGLPVIIKAAY--GGGGRGMRVVRSGEDVEEAFQRAYSEALAAFGNGTLFVEK 233
Query: 262 SVLGWKEYELEMMRDIKGNCIVV----CSIENLDPMGVHTGDSITVAPALTLTDKEYQLM 317
+ + E++++ D GN + + CS++ V +APA TL + +
Sbjct: 234 FLEKPRHIEVQLLGDKHGNVVHLYERDCSVQRRHQKVVE------IAPAKTLPPEVRDAI 287
Query: 318 RNAAIAVLKEIGVESGGANVQFAVNPKNGKMVVIEMNPRVSRSSALASKATGFPIAKIAA 377
A+ + K +G E+ G V+F V+ K G+ IE+NPR+ + + TG + +
Sbjct: 288 LTDAVKLAKHVGYENAGT-VEFLVDQK-GRHYFIEVNPRLQVEHTVTEEITGVDLVQAQI 345
Query: 378 KLAVGYTLDELG 389
+A G +L +LG
Sbjct: 346 HVAEGASLPDLG 357
Score = 41.5 bits (97), Expect = 0.001
Identities = 66/295 (22%), Positives = 112/295 (37%), Gaps = 57/295 (19%)
Query: 760 SVEHARLIACEIGFPLLIRPSYVLGGRAMQIVYSENMLQDYLLDTLPGLVSEEIKRQYPN 819
+VE A E G P++I+ +Y GGR M++V S + V E +R Y
Sbjct: 172 TVEEALEFVKEYGLPVIIKAAYGGGGRGMRVVRS--------GED----VEEAFQRAYSE 219
Query: 820 DKTGQINTLLGTHPLLFDSYLSDAMEIDVDALCQNDQVIVVGIIEHIEEAGIHSGDSACS 879
G L + +L I+V L +V H + CS
Sbjct: 220 ALAA-----FGNGTLFVEKFLEKPRHIEVQLLGDKHGNVV------------HLYERDCS 262
Query: 880 L-----------PSRSLSQQLKDELISQTKALAGALNVIGLINVQYAI-KDGKIYILEVN 927
+ P+++L +++D +++ LA + V++ + + G+ Y +EVN
Sbjct: 263 VQRRHQKVVEIAPAKTLPPEVRDAILTDAVKLAKHVGYENAGTVEFLVDQKGRHYFIEVN 322
Query: 928 PRASRTVPFIAKAIGFPVAKVAARIIAGESLDASIAAYGKRPDLSQIKHFAVKESVF--- 984
PR + G + + + G SL G D + FA++ V
Sbjct: 323 PRLQVEHTVTEEITGVDLVQAQIHVAEGASLP----DLGLTQDKITTRGFAIQCRVTTED 378
Query: 985 PFNKF-PGVDILLGPEMRSTGEVIGIDQDFPLAFAKSQLGIGVDLPHEGTVFVSV 1038
P F P + E+ +GE +GI D AFA + + PH ++ V V
Sbjct: 379 PAKGFQPDTGRI---EVFRSGEGMGIRLDGASAFAGAVIS-----PHYDSLLVKV 425
>gnl|CDD|35459 KOG0238, KOG0238, KOG0238, 3-Methylcrotonyl-CoA carboxylase,
biotin-containing subunit/Propionyl-CoA carboxylase,
alpha chain/Acetyl-CoA carboxylase, biotin carboxylase
subunit [Lipid transport and metabolism, Amino acid
transport and metabolism].
Length = 670
Score = 58.4 bits (141), Expect = 1e-08
Identities = 47/194 (24%), Positives = 86/194 (44%), Gaps = 16/194 (8%)
Query: 208 AVQALDEIGLPLIIRPSFTLGGTGGGIAYNRSEFLEIVENGLHASPTT----EVLIEESV 263
A + EIG P++I+ + GG G IA++ EF E +E+ + + +L+E+ +
Sbjct: 140 AKKVAREIGYPVMIKATAGGGGKGMRIAWSEEEFEEGLESAKQEAAKSFGDDGMLLEKFI 199
Query: 264 LGWKEYELEMMRDIKGNCIVV----CSIENLDPMGVHTGDSITVAPALTLTDKEYQLMRN 319
+ E+++ D GN + + CS++ I APA L ++ + +
Sbjct: 200 DNPRHIEVQVFGDKHGNAVHLGERDCSVQR------RNQKIIEEAPAPNLPEETRRALGE 253
Query: 320 AAIAVLKEIGVESGGANVQFAVNPKNGKMVVIEMNPRVSRSSALASKATGFPIAKIAAKL 379
AA+ K +G G V+F V+ K +EMN R+ + TG + + ++
Sbjct: 254 AAVRAAKAVGYVGAG-TVEFIVDSK-DNFYFMEMNTRLQVEHPVTEMITGTDLVEWQIRV 311
Query: 380 AVGYTLDELGNDIT 393
A G L +I
Sbjct: 312 AAGEPLPLKQEEIP 325
Score = 58.4 bits (141), Expect = 1e-08
Identities = 45/217 (20%), Positives = 87/217 (40%), Gaps = 41/217 (18%)
Query: 755 NGISHSVEHARLIACEIGFPLLIRPSYVLGGRAMQIVYSENMLQDYLLDTLPGLVSEEIK 814
+G S E A+ +A EIG+P++I+ + GG+ M+I +SE ++ L +E
Sbjct: 131 HGEDQSDEEAKKVAREIGYPVMIKATAGGGGKGMRIAWSEEEFEEGLE-----SAKQEAA 185
Query: 815 RQYPNDKTGQINTLLGTHPLLFDSYLSDAMEIDVDALCQNDQVIVVGIIEHIEEAGIHSG 874
+ + +D +L + ++ + I+V + A +H G
Sbjct: 186 KSFGDDG------------MLLEKFIDNPRHIEVQVFG-----------DKHGNA-VHLG 221
Query: 875 DSACSL-----------PSRSLSQQLKDELISQTKALAGALNVIGLINVQYAI-KDGKIY 922
+ CS+ P+ +L ++ + L A A+ +G V++ + Y
Sbjct: 222 ERDCSVQRRNQKIIEEAPAPNLPEETRRALGEAAVRAAKAVGYVGAGTVEFIVDSKDNFY 281
Query: 923 ILEVNPRASRTVPFIAKAIGFPVAKVAARIIAGESLD 959
+E+N R P G + + R+ AGE L
Sbjct: 282 FMEMNTRLQVEHPVTEMITGTDLVEWQIRVAAGEPLP 318
>gnl|CDD|34383 COG4770, COG4770, Acetyl/propionyl-CoA carboxylase, alpha subunit
[Lipid metabolism].
Length = 645
Score = 58.4 bits (141), Expect = 1e-08
Identities = 52/196 (26%), Positives = 86/196 (43%), Gaps = 16/196 (8%)
Query: 206 AVAVQALDEIGLPLIIRPSFTLGGTGGGIAYNRSEFLEIVEN----GLHASPTTEVLIEE 261
A V +EIG P++I+ S GG G + EF E +E+ + V IE+
Sbjct: 142 AELVAIAEEIGYPVLIKASAGGGGKGMRVVETPEEFAEALESARREAKASFGDDRVFIEK 201
Query: 262 SVLGWKEYELEMMRDIKGNCIVV----CSIENLDPMGVHTGDSITVAPALTLTDKEYQLM 317
+ + E+++ D GN + + CS++ I APA LT++ + M
Sbjct: 202 YLDKPRHIEIQVFADQHGNVVHLGERDCSLQR------RHQKVIEEAPAPFLTEETREAM 255
Query: 318 RNAAIAVLKEIGVESGGANVQFAVNPKNGKMVVIEMNPRVSRSSALASKATGFPIAKIAA 377
AA+A K +G G V+F V+ +G +EMN R+ + TG + +
Sbjct: 256 GEAAVAAAKAVGYVGAG-TVEFIVDA-DGNFYFLEMNTRLQVEHPVTELITGIDLVEWQI 313
Query: 378 KLAVGYTLDELGNDIT 393
++A G L +DI
Sbjct: 314 RVASGEKLPFTQDDIP 329
Score = 49.9 bits (119), Expect = 4e-06
Identities = 54/256 (21%), Positives = 95/256 (37%), Gaps = 43/256 (16%)
Query: 717 LEKNQIPILGTQPDSIDLAEDRDRFQKLLMELD--LNQPRNGISHSVEHARLIACEIGFP 774
+E + +G +I D+ +KL E +G IA EIG+P
Sbjct: 95 VEDAGLVFIGPSAGAIRAMGDKIAAKKLAAEAGVPTVPGYHGPIQDAAELVAIAEEIGYP 154
Query: 775 LLIRPSYVLGGRAMQIVYSENMLQDYLLDTLPGLVSEEIKRQYPNDKTGQINTLLGTHPL 834
+LI+ S GG+ M++V + + L E K + +D+ +
Sbjct: 155 VLIKASAGGGGKGMRVVETPEEFAEAL-----ESARREAKASFGDDR------------V 197
Query: 835 LFDSYLSDAMEIDVDALCQNDQVIVVGIIEHIEEAGIHSGDSACSL-----------PSR 883
+ YL I++ +V H G+ CSL P+
Sbjct: 198 FIEKYLDKPRHIEIQVFADQHGNVV------------HLGERDCSLQRRHQKVIEEAPAP 245
Query: 884 SLSQQLKDELISQTKALAGALNVIGLINVQYAI-KDGKIYILEVNPRASRTVPFIAKAIG 942
L+++ ++ + A A A+ +G V++ + DG Y LE+N R P G
Sbjct: 246 FLTEETREAMGEAAVAAAKAVGYVGAGTVEFIVDADGNFYFLEMNTRLQVEHPVTELITG 305
Query: 943 FPVAKVAARIIAGESL 958
+ + R+ +GE L
Sbjct: 306 IDLVEWQIRVASGEKL 321
>gnl|CDD|29635 cd01423, MGS_CPS_I_III, Methylglyoxal synthase-like domain found in
pyr1 and URA1-like carbamoyl phosphate synthetases (CPS),
including ammonia-dependent CPS Type I, and
glutamine-dependent CPS Type III. These are multidomain
proteins, in which MGS is the C-terminal domain..
Length = 116
Score = 56.4 bits (136), Expect = 5e-08
Identities = 27/104 (25%), Positives = 48/104 (46%), Gaps = 5/104 (4%)
Query: 1034 VFVSVRDADKKRIVPIIQNFKKLGFKIMATEGTARFLESHGLETQKINKVLE----GRPH 1089
+ +S+ K ++P Q KLG+K+ ATEGTA FL +G+ + E +P
Sbjct: 3 ILISIGSYSKPELLPTAQKLSKLGYKLYATEGTADFLLENGIPVTPVAWPSEEPQNDKPS 62
Query: 1090 IEDAISNRQVHLVIN-TTEGKKAIEDSKSLRRATLIRKIPYYTT 1132
+ + ++ ++ LVIN + K + D+ + R T
Sbjct: 63 LRELLAEGKIDLVINLPSNRGKRVLDNDYVMRRAADDFAVPLIT 106
>gnl|CDD|35589 KOG0368, KOG0368, KOG0368, Acetyl-CoA carboxylase [Lipid transport
and metabolism].
Length = 2196
Score = 52.7 bits (126), Expect = 7e-07
Identities = 40/156 (25%), Positives = 77/156 (49%), Gaps = 15/156 (9%)
Query: 208 AVQALDEIGLPLIIRPSFTLGGTGGGI--AYNRSEFLEIVENGLHASPTTEVLIEESVLG 265
++A ++IG P++I+ S GG G GI N +F + + + P + + + +
Sbjct: 231 GLEAAEKIGFPVMIKASE--GGGGKGIRKVENEDDFKALFKQVQNEVPGSPIFLMKLADQ 288
Query: 266 WKEYELEMMRDIKGNCIVV----CSIENLDPMGVHTGDSITVAPALTLTDKEYQLMRNAA 321
+ E++++ D GN I + CSI+ + T+AP T ++ M AA
Sbjct: 289 ARHLEVQLLADQYGNVISLFGRDCSIQRRHQKIIEEA-PATIAPPET-----FKKMEQAA 342
Query: 322 IAVLKEIGVESGGANVQFAVNPKNGKMVVIEMNPRV 357
+ + K +G S G V++ +P +G+ +E+NPR+
Sbjct: 343 VRLAKLVGYVSAG-TVEYLYSPDDGEYYFLELNPRL 377
>gnl|CDD|32413 COG2232, COG2232, Predicted ATP-dependent carboligase related to
biotin carboxylase [General function prediction only].
Length = 389
Score = 49.2 bits (117), Expect = 6e-06
Identities = 77/386 (19%), Positives = 140/386 (36%), Gaps = 69/386 (17%)
Query: 647 SLKEAGFETIMI------NCNPETVSTDYDIADRLY--FESLTEEDILEILRVEQQKGEL 698
S + GFE + + + +S + L FE+L E+ ++E E ++
Sbjct: 27 SASKLGFEVYSVQYYDPADLPGDAISYLRERPGELLGRFENLDEQKLIEAA--EDLAEDV 84
Query: 699 VGIIVQFGGQTPLKLSKILEKNQIPILGTQPDS-IDLAEDRDRFQKLLMELDLNQPRNGI 757
I+ F G L+ S + + G +P+ + A ++ +F + L L + +P
Sbjct: 85 DAPIIPFSGFEALRTSG---ELGCEVAGNEPEVKVVEASNKLKFYRKLEVLGMPEPSEKK 141
Query: 758 SHSVEHARLIACEIGFPLLIRPSYVLGGRAMQIVYSENMLQDYLLDTLPGLVSEEIKRQY 817
+E E L+++P GG + + E D PG + +E
Sbjct: 142 IEPLE-------EGEKTLILKPVSGAGGLVELVKFDEE-------DPPPGFIFQEFIEGR 187
Query: 818 PNDKTGQINTLLGTHPLLFDSYLSDAMEIDVDALCQNDQVIVVGIIEHIEEAGIHSGDSA 877
P + F S SDA+ + V NDQ+I + ++ G+
Sbjct: 188 PVSVS-------------FISNGSDALTLAV-----NDQII--DGLRGEYSQFVYKGNLT 227
Query: 878 CSLPSRSLSQQLKDELISQTKALAGALNVIGLINVQYAIKDGKIYILEVNPRASRTVPFI 937
P + + L + L L ++G V + + D Y++EVNPR T+ I
Sbjct: 228 ---PFPYEEVEEAERLAEE---LVEELGLVGSNGVDFVLNDKGPYVIEVNPRIQGTLECI 281
Query: 938 AKAIGFPVAKVAARIIAGESLDASIAAYGKRPDLSQIKHFAVKESVFPFNKFPGVDILLG 997
++ G + ++ + GE +RP + +A K ++ V IL
Sbjct: 282 ERSSGINLFRLHIQAFDGE--------LPERPK---PRGYACKRILYAPRTVR-VPILKL 329
Query: 998 PEMR---STGEVIGIDQDFPLAFAKS 1020
G VI + A S
Sbjct: 330 SWTHDIPRPGTVIEKGEPLCSVIASS 355
>gnl|CDD|31374 COG1181, DdlA, D-alanine-D-alanine ligase and related ATP-grasp
enzymes [Cell envelope biogenesis, outer membrane].
Length = 317
Score = 44.5 bits (105), Expect = 2e-04
Identities = 33/149 (22%), Positives = 61/149 (40%), Gaps = 13/149 (8%)
Query: 216 GLPLIIRPSFTLGGTGGGIAYNRSEFLEIVENGLHASPTTEVLIEESVLGWKEYEL-EMM 274
G PL ++P+ G + +E +VL E+ + G +E E+ +
Sbjct: 142 GFPLFVKPAREGSSVGRSPVNVEGDLQSALELAFKYD--RDVLREQGITG-REIEVGVLG 198
Query: 275 RDIKGNCIVVCSIENLDPM-------GVHTGDSITVAPALTLTDKEYQLMRNAAIAVLKE 327
D + + + I + TG + PA LTD+ ++ ++ A+ K
Sbjct: 199 NDYEEQALPLGEIPPKGEEFYDYEAKYLSTGGAQYDIPA-GLTDEIHEEIKELALRAYKA 257
Query: 328 IGVESGGANVQFAVNPKNGKMVVIEMNPR 356
+G G A V F V+ G+ V++E+N
Sbjct: 258 LGC-LGLARVDFFVDDDEGEFVLLEVNTN 285
Score = 42.2 bits (99), Expect = 0.001
Identities = 52/257 (20%), Positives = 90/257 (35%), Gaps = 44/257 (17%)
Query: 716 ILEKNQIPILGTQPDSIDLAEDRDRFQKLLMELDLNQPRNGISHSVEHARLIACEI---- 771
+LE IP +G + A D+ ++L L E++ +I E+
Sbjct: 82 LLELLGIPYVGKGVLASAGAMDKIVTKRLFKAEGLPVAPYVALTRDEYSSVIVEEVEEGL 141
Query: 772 GFPLLIRPSYVLGGRAMQIVYSENMLQDYLLDTLPGLVSEEIKRQYPNDKTGQINTLLGT 831
GFPL ++P+ V E LQ L E + D+ +
Sbjct: 142 GFPLFVKPAREGSSVGRSPVNVEGDLQSAL---------ELAFKY---DRDVLREQGI-- 187
Query: 832 HPLLFDSYLSDAMEIDVDALCQNDQVIVVGIIEHI---------EEAGIHSGDSACSLPS 882
EI+V L + + + + E E + +G + +P+
Sbjct: 188 ----------TGREIEVGVLGNDYEEQALPLGEIPPKGEEFYDYEAKYLSTGGAQYDIPA 237
Query: 883 RSLSQQLKDELISQTKALAGALNVIGLINVQYAIKD--GKIYILEVNPRASRT----VPF 936
L+ ++ +E+ AL +GL V + + D G+ +LEVN T P
Sbjct: 238 G-LTDEIHEEIKELALRAYKALGCLGLARVDFFVDDDEGEFVLLEVNTNPGMTAMSLFPK 296
Query: 937 IAKAIGFPVAKVAARII 953
A A G A + R +
Sbjct: 297 AAAAAGISFAILVLRFV 313
>gnl|CDD|176178 cd05188, MDR, Medium chain reductase/dehydrogenase
(MDR)/zinc-dependent alcohol dehydrogenase-like family.
The medium chain reductase/dehydrogenases
(MDR)/zinc-dependent alcohol dehydrogenase-like family,
which contains the zinc-dependent alcohol dehydrogenase
(ADH-Zn) and related proteins, is a diverse group of
proteins related to the first identified member, class I
mammalian ADH. MDRs display a broad range of activities
and are distinguished from the smaller short chain
dehydrogenases (~ 250 amino acids vs. the ~ 350 amino
acids of the MDR). The MDR proteins have 2 domains: a
C-terminal NAD(P) binding-Rossmann fold domain of a
beta-alpha form and an N-terminal catalytic domain with
distant homology to GroES. The MDR group contains a
host of activities, including the founding alcohol
dehydrogenase (ADH) , quinone reductase, sorbitol
dehydrogenase, formaldehyde dehydrogenase, butanediol
DH, ketose reductase, cinnamyl reductase, and numerous
others. The zinc-dependent alcohol dehydrogenases (ADHs)
catalyze the NAD(P)(H)-dependent interconversion of
alcohols to aldehydes or ketones. ADH-like proteins
typically form dimers (typically higher plants, mammals)
or tetramers (yeast, bacteria), and generally have 2
tightly bound zinc atoms per subunit, a catalytic zinc
at the active site and a structural zinc in a lobe of
the catalytic domain. The active site zinc is
coordinated by a histidine, two cysteines, and a water
molecule. The second zinc seems to play a structural
role, affects subunit interactions, and is typically
coordinated by 4 cysteines. Other MDR members have only
a catalytic zinc, and some contain no coordinated zinc.
Length = 271
Score = 43.1 bits (102), Expect = 5e-04
Identities = 33/134 (24%), Positives = 49/134 (36%), Gaps = 27/134 (20%)
Query: 8 KTLLIIGAGPIVIGQACEFDYSGTQACKALKEEGYRIILVNSNPATIMTDPDLA-----D 62
T+L++GAG + G Q KA G R+I+ + + + +LA D
Sbjct: 136 DTVLVLGAGGV--GLLA------AQLAKAA---GARVIVTDRSDEKL----ELAKELGAD 180
Query: 63 AT--YTEPITPEVVAKIIEKERPDAILPTTGGQTALNTALSLKRMGVLDRYGVEMIGAKP 120
Y E E ++ D ++ GG L AL L R G R V +G
Sbjct: 181 HVIDYKEE-DLEEELRLTGGGGADVVIDAVGGPETLAQALRLLRPGG--RIVV--VGGTS 235
Query: 121 ETIDKAEDRSLFSK 134
+ R L K
Sbjct: 236 GGPPLDDLRRLLFK 249
>gnl|CDD|29633 cd01421, IMPCH, Inosine monophosphate cyclohydrolase domain. This is
the N-terminal domain in the purine biosynthesis pathway
protein ATIC (purH). The bifunctional ATIC protein
contains a C-terminal ATIC formylase domain that
formylates 5-aminoimidazole-4-carboxamide-ribonucleotide.
The IMPCH domain then converts the
formyl-5-aminoimidazole-4-carboxamide-ribonucleotide to
inosine monophosphate. This is the final step in de novo
purine production..
Length = 187
Score = 41.7 bits (98), Expect = 0.001
Identities = 22/75 (29%), Positives = 36/75 (48%), Gaps = 13/75 (17%)
Query: 1034 VFVSVRDADKKRIVPIIQNFKKLGFKIMATEGTARFLESHGLETQKINKV------LEGR 1087
+SV +DK +V + +LG +I++T GTA+FL+ G+ ++ + L GR
Sbjct: 3 ALISV--SDKTGLVEFAKELVELGVEILSTGGTAKFLKEAGIPVTDVSDITGFPEILGGR 60
Query: 1088 -----PHIEDAISNR 1097
P I I R
Sbjct: 61 VKTLHPKIHGGILAR 75
>gnl|CDD|30487 COG0138, PurH, AICAR transformylase/IMP cyclohydrolase PurH (only IMP
cyclohydrolase domain in Aful) [Nucleotide transport and
metabolism].
Length = 515
Score = 39.4 bits (92), Expect = 0.005
Identities = 18/61 (29%), Positives = 32/61 (52%), Gaps = 8/61 (13%)
Query: 1033 TVFVSVRDADKKRIVPIIQNFKKLGFKIMATEGTARFLESHGLETQKINKV------LEG 1086
+SV +DK IV + +LG +I++T GTA+ L G+ +++ + L+G
Sbjct: 4 RALLSV--SDKTGIVEFAKALVELGVEILSTGGTAKLLAEAGIPVTEVSDITGFPEMLDG 61
Query: 1087 R 1087
R
Sbjct: 62 R 62
>gnl|CDD|145680 pfam02655, ATP-grasp_3, ATP-grasp domain. No functional
information or experimental verification of function is
known in this family. This family appears to be an
ATP-grasp domain (Pers. obs. A Bateman).
Length = 160
Score = 38.1 bits (89), Expect = 0.013
Identities = 27/93 (29%), Positives = 43/93 (46%), Gaps = 14/93 (15%)
Query: 838 SYLSDAMEIDVDALCQNDQVIVVGIIEHIEEAGIHSGDSACSLPSRSLSQQLKDELISQT 897
S LSD L N Q +I++ +++G PSR+ +LK+EL
Sbjct: 77 SLLSDG--EKALPLSVNRQ-----LIDNAGSGFVYAG---NLTPSRT---ELKEELEELA 123
Query: 898 KALAGAL-NVIGLINVQYAIKDGKIYILEVNPR 929
+ + AL + G + V + D Y++EVNPR
Sbjct: 124 EEVVEALPGLRGYVGVDLVLTDNGPYVIEVNPR 156
Score = 33.1 bits (76), Expect = 0.42
Identities = 35/172 (20%), Positives = 64/172 (37%), Gaps = 22/172 (12%)
Query: 188 KWNLEENDRKHRYICHAMAVAVQALDEIGLPLIIRPSFTLGGTGGGIAYNRSEFLEIVEN 247
K + + A + +E I++P GG G N E E +EN
Sbjct: 4 KLKTYKALKNAGVPTPETLSAEEPTEE-EKKYIVKPRDGCGGEGVRFVENGREDEEFIEN 62
Query: 248 GLHASPTTEVLIEESVLGWKEYELEMMRDIKGNCIVVCS---IENLDPMGVHTGDSITVA 304
V+I+E + G + + ++ D + + + I+N V+ G+
Sbjct: 63 ---------VIIQEFIEG-EPLSVSLLSDGEKALPLSVNRQLIDNAGSGFVYAGNLT--- 109
Query: 305 PALTLTDKEYQLMRNAAIAVLKEIGVESGGANVQFAVNPKNGKMVVIEMNPR 356
P+ T +E + A V++ + G V + + VIE+NPR
Sbjct: 110 PSRTELKEE---LEELAEEVVEALPGLRGYVGVDLVLT--DNGPYVIEVNPR 156
>gnl|CDD|30949 COG0604, Qor, NADPH:quinone reductase and related Zn-dependent
oxidoreductases [Energy production and conversion /
General function prediction only].
Length = 326
Score = 36.9 bits (85), Expect = 0.032
Identities = 28/108 (25%), Positives = 44/108 (40%), Gaps = 13/108 (12%)
Query: 8 KTLLIIGAGPIVIGQACEFDYSGTQACKALKEEGYRIILVNSNPATIMTDPDL-ADAT-- 64
+T+L+ GA V G+ A + K G ++ V S+ + +L AD
Sbjct: 144 ETVLVHGAAGGV----------GSAAIQLAKALGATVVAVVSSSEKLELLKELGADHVIN 193
Query: 65 YTEPITPEVVAKIIEKERPDAILPTTGGQTALNTALSLKRMGVLDRYG 112
Y E E V ++ + D +L T GG T + +L G L G
Sbjct: 194 YREEDFVEQVRELTGGKGVDVVLDTVGGDTFAASLAALAPGGRLVSIG 241
>gnl|CDD|30377 COG0027, PurT, Formate-dependent phosphoribosylglycinamide
formyltransferase (GAR transformylase) [Nucleotide
transport and metabolism].
Length = 394
Score = 36.0 bits (83), Expect = 0.055
Identities = 53/254 (20%), Positives = 99/254 (38%), Gaps = 44/254 (17%)
Query: 734 LAEDRDRFQKLLME-LDLNQPRNGISHSVEHARLIACEIGFPLLIRPSYVLGGRAMQIVY 792
L +R+ ++L E L L + + S+E R +IGFP +++P G+ +V
Sbjct: 110 LTMNREGIRRLAAEELGLPTSKYRFADSLEELRAAVEKIGFPCVVKPVMSSSGKGQSVVR 169
Query: 793 SENMLQDYLLDTLPGLVSEEIKRQYPNDKTGQINTLLGTHPLLFDSYLSDAMEIDVDALC 852
S E++++ + + Q G+ ++ + ++ EI + +
Sbjct: 170 SP----------------EDVEKAW---EYAQQGGRGGSGRVIVEEFVKFDFEITLLTVR 210
Query: 853 QNDQV-IVVGIIEHIEEAGIHSGDSACSLPSRSLSQQLKDELISQTKALAGALNVIGLIN 911
D I H +E GD S + +S+ +E S K + AL GL
Sbjct: 211 AVDGTGSFCAPIGHRQE----DGDYRESWQPQEMSEAALEEAQSIAKRVTDALGGRGLFG 266
Query: 912 VQYAIKDGKIYILEVNPRASRT--VPFI----------AKAI-GFPVAKV------AARI 952
V+ +K ++ EV+PR T V I +AI G P+ ++ A+ +
Sbjct: 267 VELFVKGDEVIFSEVSPRPHDTGMVTLISQDLSEFALHVRAILGLPIPEIRQISPAASAV 326
Query: 953 IAGESLDASIAAYG 966
I + + G
Sbjct: 327 ILAQETSQAPTFDG 340
>gnl|CDD|143431 cd07113, ALDH_PADH_NahF, Escherichia coli NAD+-dependent
phenylacetaldehyde dehydrogenase PadA-like.
NAD+-dependent, homodimeric, phenylacetaldehyde
dehydrogenase (PADH, EC=1.2.1.39) PadA of Escherichia
coli involved in the catabolism of 2-phenylethylamine,
and other related sequences, are present in this CD.
Also included is the Pseudomonas fluorescens ST StyD
PADH involved in styrene catabolism, the Sphingomonas
sp. LB126 FldD protein involved in fluorene degradation,
and the Novosphingobium aromaticivorans NahF
salicylaldehyde dehydrogenase involved in the
NAD+-dependent conversion of salicylaldehyde to
salicylate.
Length = 477
Score = 34.7 bits (80), Expect = 0.15
Identities = 42/125 (33%), Positives = 58/125 (46%), Gaps = 20/125 (16%)
Query: 297 TGDSITVAPALT--LTDKEYQLMRNAAIAVLKEIGVESGGANVQFAVNPKNGKMVVIEMN 354
TG +I + P+ LT L+R A +A KE G+ G NV VN K + +
Sbjct: 169 TGCTIVIKPSEFTPLT-----LLRVAELA--KEAGIPDGVLNV---VNGKGAVGAQLISH 218
Query: 355 PRVSRSSALASKATGFPIAKIAAKLAVGYTLDELGNDITGGKTPASF--EPSIDYIVTKI 412
P V++ S S ATG I + AA TL EL GGK A+F + ID++V +
Sbjct: 219 PDVAKVSFTGSVATGKKIGRQAASDLTRVTL-EL-----GGKNAAAFLKDADIDWVVEGL 272
Query: 413 PRFTF 417
F
Sbjct: 273 LTAGF 277
>gnl|CDD|30500 COG0151, PurD, Phosphoribosylamine-glycine ligase [Nucleotide
transport and metabolism].
Length = 428
Score = 34.8 bits (80), Expect = 0.15
Identities = 61/300 (20%), Positives = 93/300 (31%), Gaps = 75/300 (25%)
Query: 208 AVQALDEIGLPLIIRPSFTLGGTGGGIAYNRSEFL----EIVENGLHASPTTEVLIEESV 263
A +DE G P++++ G G +A E E++E S V+IEE +
Sbjct: 130 AKAYIDEKGAPIVVKADGLAAGKGVIVAMTLEEAEAAVDEMLEGNAFGSAGARVVIEEFL 189
Query: 264 LGWKEYELEMMRDIKGNCIVVCSIENLDPM--------------GVHTGDSITVAPALTL 309
G +E+ L + + PM G +TG +PA +
Sbjct: 190 DG-EEFSL----------QAFVDGKTVIPMPTAQDHKRAYDGDTGPNTGGMGAYSPAPFI 238
Query: 310 TDKEYQLMRNAAI-----AVLKEIGVESGGANVQFAVNPKNGKMVVIEMNPRVS------ 358
TD+ + + + KE G + K VIE N R
Sbjct: 239 TDEVVERAVEEIVEPTVEGMAKEGYPFRGVLYAGLMLTADGPK--VIEFNARFGDPETQV 296
Query: 359 ---------------------RSSALASKATGFPIAKIAAKLAVGYTLD-ELGNDITGGK 396
+ G + + A A GY D E G+ ITG +
Sbjct: 297 VLPLLESDLVELLLAAVDGKLDEVEILFWDKGAAVGVVLA--AEGYPGDPEKGDVITGDE 354
Query: 397 TPASFEPSIDYIVTKIPRFTFEKFPGSDVTLTTSMKSVGEVMAIGRTFAESLQKALRGLE 456
+ + K L TS V V+ G T E+ +KA LE
Sbjct: 355 EAEEEGAKVFHAGVK---------LDDGGQLVTSGGRVLAVVGTGDTLEEAQEKAYEALE 405
>gnl|CDD|37766 KOG2555, KOG2555, KOG2555, AICAR transformylase/IMP
cyclohydrolase/methylglyoxal synthase [Nucleotide
transport and metabolism].
Length = 588
Score = 34.6 bits (79), Expect = 0.18
Identities = 22/112 (19%), Positives = 44/112 (39%), Gaps = 18/112 (16%)
Query: 1042 DKKRIVPIIQNFKKLGFKIMATEGTARFLESHGLETQKINKV------LEGR-----PHI 1090
DK ++ + + +LG ++A+ GTA+ L GL + ++++ L GR P +
Sbjct: 13 DKTGLLDLAKGLVELGVTLVASGGTAKMLRDAGLPVRDVSEITHFPEMLGGRVKTLHPAV 72
Query: 1091 EDAISNRQVHLVINTTEGKKAI-EDSKSLRRATLIRKIPYYTTIAGADAVFQ 1141
I R + +K + E + P+ T+A +
Sbjct: 73 HGGILAR------DIESDEKDLKEQGIDKVDVVVCNLYPFKETVAKPGVTVE 118
>gnl|CDD|176188 cd05285, sorbitol_DH, Sorbitol dehydrogenase. Sorbitol and aldose
reductase are NAD(+) binding proteins of the polyol
pathway, which interconverts glucose and fructose.
Sorbitol dehydrogenase is tetrameric and has a single
catalytic zinc per subunit. Aldose reductase catalyzes
the NADP(H)-dependent conversion of glucose to sorbital,
and SDH uses NAD(H) in the conversion of sorbitol to
fructose. NAD(P)(H)-dependent oxidoreductases are the
major enzymes in the interconversion of alcohols and
aldehydes, or ketones. The medium chain alcohol
dehydrogenase family (MDR) have a NAD(P)(H)-binding
domain in a Rossmann fold of a beta-alpha form. The
N-terminal region typically has an all-beta catalytic
domain. These proteins typically form dimers (typically
higher plants, mammals) or tetramers (yeast, bacteria),
and have 2 tightly bound zinc atoms per subunit.
Length = 343
Score = 34.4 bits (80), Expect = 0.18
Identities = 24/112 (21%), Positives = 48/112 (42%), Gaps = 25/112 (22%)
Query: 8 KTLLIIGAGPIVIGQACEFDYSGTQACKALKEEGYRIILVNSNP----------ATIMTD 57
T+L+ GAGP IG KA ++++ + +P AT +
Sbjct: 164 DTVLVFGAGP--IGLLT------AAVAKAFG--ATKVVVTDIDPSRLEFAKELGATHTVN 213
Query: 58 PDLADATYTEPITPEVVAKIIEKERPDAILPTTGGQTALNTAL-SLKRMGVL 108
D P + E +A+++ + PD ++ TG ++ + TA+ + + G +
Sbjct: 214 VRTEDT----PESAEKIAELLGGKGPDVVIECTGAESCIQTAIYATRPGGTV 261
>gnl|CDD|176244 cd08284, FDH_like_2, Glutathione-dependent formaldehyde
dehydrogenase related proteins, child 2.
Glutathione-dependent formaldehyde dehydrogenases (FDHs)
are members of the zinc-dependent/medium chain alcohol
dehydrogenase family. Formaldehyde dehydrogenase (FDH)
is a member of the zinc-dependent/medium chain alcohol
dehydrogenase family. FDH converts formaldehyde and NAD
to formate and NADH. The initial step in this process
the spontaneous formation of a
S-(hydroxymethyl)glutathione adduct from formaldehyde
and glutathione, followed by FDH-mediated oxidation (and
detoxification) of the adduct to S-formylglutathione.
These tetrameric FDHs have a catalytic zinc that resides
between the catalytic and NAD(H)binding domains and a
structural zinc in a lobe of the catalytic domain. The
medium chain alcohol dehydrogenase family (MDR) has a
NAD(P)(H)-binding domain in a Rossmann fold of a
beta-alpha form. The N-terminal region typically has an
all-beta catalytic domain. These proteins typically form
dimers (typically higher plants, mammals) or tetramers
(yeast, bacteria), and have 2 tightly bound zinc atoms
per subunit.
Length = 344
Score = 34.2 bits (79), Expect = 0.24
Identities = 31/113 (27%), Positives = 46/113 (40%), Gaps = 23/113 (20%)
Query: 9 TLLIIGAGPIVIGQACEFDYSGTQACKALKEEGY-RIILVNSNPATIMTDPDLADATYTE 67
T+ +IG GP+ G A + + G R+ V+ P + + A A E
Sbjct: 170 TVAVIGCGPV-----------GLCAVLSAQVLGAARVFAVDPVPERL----ERAAALGAE 214
Query: 68 PITPE---VVAKIIEKER---PDAILPTTGGQTALNTALSL-KRMGVLDRYGV 113
PI E V ++ E D +L GG AL+ A L + GV+ GV
Sbjct: 215 PINFEDAEPVERVREATEGRGADVVLEAVGGAAALDLAFDLVRPGGVISSVGV 267
>gnl|CDD|30376 COG0026, PurK, Phosphoribosylaminoimidazole carboxylase (NCAIR
synthetase) [Nucleotide transport and metabolism].
Length = 375
Score = 33.7 bits (77), Expect = 0.30
Identities = 43/207 (20%), Positives = 82/207 (39%), Gaps = 37/207 (17%)
Query: 729 PDSIDLAEDRDRFQKLLMELDLNQPRNGISHSVEHARLIACEIGFPLLIRPSYVLG--GR 786
PD++ +A+DR ++ L + L + S E A ++GFP +++ G G+
Sbjct: 91 PDALRIAQDRLVEKQFLDKAGLPVAPFQVVDSAEELDAAAADLGFPAVLKTRR-GGYDGK 149
Query: 787 AMQIVYSENMLQ--DYLLDTLPGLVSEEIKRQYPNDKTGQINTLLGTHPLLFDSYLSDAM 844
+ S+ L+ L V EE ++
Sbjct: 150 GQWRIRSDADLELRAAGLAEGGVPVLEE--------------------------FVPFER 183
Query: 845 EIDV-DALCQNDQVIVVGIIEHIEEAGIHSGDSACSLPSRSLSQQLKDELISQTKALAGA 903
EI V A + +V + E++ GI A P+R + L+ + K +A
Sbjct: 184 EISVIVARSNDGEVAFYPVAENVHRNGILRTSIA---PAR-IPDDLQAQAEEMAKKIAEE 239
Query: 904 LNVIGLINVQ-YAIKDGKIYILEVNPR 929
L+ +G++ V+ + DG++ + E+ PR
Sbjct: 240 LDYVGVLAVEFFVTPDGELLVNEIAPR 266
>gnl|CDD|173842 cd01153, ACAD_fadE5, Putative acyl-CoA dehydrogenases similar to
fadE5. Putative acyl-CoA dehydrogenase (ACAD).
Mitochondrial acyl-CoA dehydrogenases (ACAD) catalyze
the alpha,beta dehydrogenation of the corresponding
trans-enoyl-CoA by FAD, which becomes reduced. The
reduced form of ACAD is reoxidized in the oxidative
half-reaction by electron-transferring flavoprotein
(ETF), from which the electrons are transferred to the
mitochondrial respiratory chain coupled with ATP
synthesis. The ACD family includes the eukaryotic
beta-oxidation, as well as amino acid catabolism
enzymes. These enzymes share high sequence similarity,
but differ in their substrate specificities. The
mitochondrial ACD's are generally homotetramers and have
an active site glutamate at a conserved position.
Length = 407
Score = 33.5 bits (77), Expect = 0.31
Identities = 16/49 (32%), Positives = 23/49 (46%), Gaps = 8/49 (16%)
Query: 216 GLPLIIRPSFTLGGTGGGIAYNRSEFLEIVEN--GLHASPTTEVLIEES 262
GL L + P F G G+ R +E GLH SPT E++ + +
Sbjct: 184 GLSLFLVPKFLDDGERNGVTVAR------IEEKMGLHGSPTCELVFDNA 226
>gnl|CDD|176184 cd05281, TDH, Threonine dehydrogenase. L-threonine dehydrogenase
(TDH) catalyzes the zinc-dependent formation of
2-amino-3-ketobutyrate from L-threonine via NAD(H)-
dependent oxidation. THD is a member of the
zinc-requiring, medium chain NAD(H)-dependent alcohol
dehydrogenase family (MDR). MDRs have a
NAD(P)(H)-binding domain in a Rossmann fold of a
beta-alpha form. NAD(P)(H)-dependent oxidoreductases are
the major enzymes in the interconversion of alcohols and
aldehydes, or ketones. The N-terminal region typically
has an all-beta catalytic domain. These proteins
typically form dimers (typically higher plants, mammals)
or tetramers (yeast, bacteria) and have 2 tightly bound
zinc atoms per subunit. Sorbitol and aldose reductase
are NAD(+) binding proteins of the polyol pathway, which
interconverts glucose and fructose.
Length = 341
Score = 33.0 bits (76), Expect = 0.45
Identities = 24/102 (23%), Positives = 36/102 (35%), Gaps = 14/102 (13%)
Query: 8 KTLLIIGAGPI---VIGQACEFDYSGTQACKALKEEGYRIILVNSNPATIMTDPDLADAT 64
K++LI G GPI I A +G A YR+ L A ++ +P D
Sbjct: 165 KSVLITGCGPIGLMAIAVA---KAAGASLVIASDPNPYRLELAKKMGADVVINPREED-- 219
Query: 65 YTEPITPEVVAKIIEKERPDAILPTTGGQTALNTALSLKRMG 106
V + + D +L +G A+ L G
Sbjct: 220 ------VVEVKSVTDGTGVDVVLEMSGNPKAIEQGLKALTPG 255
>gnl|CDD|33705 COG3919, COG3919, Predicted ATP-grasp enzyme [General function
prediction only].
Length = 415
Score = 33.1 bits (75), Expect = 0.46
Identities = 32/178 (17%), Positives = 61/178 (34%), Gaps = 18/178 (10%)
Query: 892 ELISQTKALAGALNVIGLINV--QYAIKDGKIYILEVNPRASRTVPFIAKAIGFPVAKVA 949
++I + ++ GL+ V +Y +DG +L+VNPR R + A G+ + +
Sbjct: 248 QVIQAARDFLESIEHTGLVEVEFKYDPRDGSYKLLDVNPRPWRWF-GLVTAAGYNLGRYL 306
Query: 950 ARIIAGESLDASIAAYGKRPDLSQIKHFAVKESVFPFNKFPGVDILLGPEMRSTGEVIGI 1009
+ K + V++ V P + + S G
Sbjct: 307 WADRINNE-TFTETIARKGTSWW----YLVRDKVAAKYLNPSGKEEIRRYINSFTGNRG- 360
Query: 1010 DQDFPLAFAKSQLGIGVDLPHEGTVFVSVRDADKKRIVPIIQNFKKLGFKIMATEGTA 1067
+ L ++ +D P ++R RI+ + +K G K TA
Sbjct: 361 ---WALFYSNDPSPGLIDKP-----LTAMRVL-FHRILGAFKTWKCYGHKRDGVSSTA 409
>gnl|CDD|176201 cd08239, THR_DH_like, L-threonine dehydrogenase (TDH)-like.
MDR/AHD-like proteins, including a protein annotated as
a threonine dehydrogenase. L-threonine dehydrogenase
(TDH) catalyzes the zinc-dependent formation of
2-amino-3-ketobutyrate from L-threonine via
NAD(H)-dependent oxidation. The zinc-dependent alcohol
dehydrogenases (ADHs) catalyze the NAD(P)(H)-dependent
interconversion of alcohols to aldehydes or ketones.
Zinc-dependent ADHs are medium chain
dehydrogenase/reductase type proteins (MDRs) and have a
NAD(P)(H)-binding domain in a Rossmann fold of an
beta-alpha form. The N-terminal region typically has an
all-beta catalytic domain. In addition to alcohol
dehydrogenases, this group includes quinone reductase,
sorbitol dehydrogenase, formaldehyde dehydrogenase,
butanediol DH, ketose reductase, cinnamyl reductase, and
numerous others. These proteins typically form dimers
(typically higher plants, mammals) or tetramers (yeast,
bacteria), and generally have 2 tightly bound zinc atoms
per subunit. The active site zinc is coordinated by a
histidine, two cysteines, and a water molecule. The
second zinc seems to play a structural role, affects
subunit interactions, and is typically coordinated by 4
cysteines.
Length = 339
Score = 32.7 bits (75), Expect = 0.65
Identities = 23/104 (22%), Positives = 42/104 (40%), Gaps = 15/104 (14%)
Query: 9 TLLIIGAGPIVIGQACEFDYSGTQACKALKEEGYR-IILVNSNPATIMTDPDL-ADATYT 66
T+L++GAGP+ +G +AL G +I V+ +P + L AD
Sbjct: 166 TVLVVGAGPVGLG--------ALMLARAL---GAEDVIGVDPSPERLELAKALGADFVIN 214
Query: 67 -EPITPEVVAKIIEKERPDAILPTTGGQTALNTAL-SLKRMGVL 108
+ + ++ D + +G A AL +++ G L
Sbjct: 215 SGQDDVQEIRELTSGAGADVAIECSGNTAARRLALEAVRPWGRL 258
>gnl|CDD|32019 COG1834, COG1834, N-Dimethylarginine dimethylaminohydrolase [Amino
acid transport and metabolism].
Length = 267
Score = 32.5 bits (74), Expect = 0.76
Identities = 28/134 (20%), Positives = 46/134 (34%), Gaps = 17/134 (12%)
Query: 10 LLIIGAGPIVIGQACEFDYSGTQACKALKEEGYRIILVNSNPATIMTDPDLA-----DAT 64
+L+ G + IG + + G + +A EEGY + LV + + D A
Sbjct: 123 VLMDGGDTVYIGYSFRTNLEGIEQLQAWLEEGYEVSLVRLDERYLHLDTVFNPLAEGLAL 182
Query: 65 YTEPITPEVVAKIIEKERPDAILPTTGGQTAL---------NTALSLKRMGV---LDRYG 112
P E ++++ + I AL N ++L R L G
Sbjct: 183 AYPPAFSEGANDVLKERGFELIEVPEEEAFALGCNVVSLGPNVVIALPRTPKAEQLAAAG 242
Query: 113 VEMIGAKPETIDKA 126
E+I I K
Sbjct: 243 FEVIEVDLTEILKG 256
>gnl|CDD|176192 cd08230, glucose_DH, Glucose dehydrogenase. Glucose dehydrogenase
(GlcDH), a member of the medium chain
dehydrogenase/zinc-dependent alcohol dehydrogenase-like
family, catalyzes the NADP(+)-dependent oxidation of
glucose to gluconate, the first step in the
Entner-Doudoroff pathway, an alternative to or
substitute for glycolysis or the pentose phosphate
pathway. The medium chain dehydrogenases/reductase
(MDR)/zinc-dependent alcohol dehydrogenase-like family,
which contains the zinc-dependent alcohol dehydrogenase
(ADH-Zn) and related proteins, is a diverse group of
proteins related to the first identified member, class I
mammalian ADH. MDRs display a broad range of activities
and are distinguished from the smaller short chain
dehydrogenases (~ 250 amino acids vs. the ~ 350 amino
acids of the MDR). The MDR proteins have 2 domains: a
C-terminal NAD(P) binding-Rossman fold domain of a
beta-alpha form and an N-terminal catalytic domain with
distant homology to GroES. The MDR group contains a
host of activities, including the founding alcohol
dehydrogenase (ADH), quinone reductase, sorbitol
dehydrogenase, formaldehyde dehydrogenase, butanediol
DH, ketose reductase, cinnamyl reductase, and numerous
others. The zinc-dependent alcohol dehydrogenases (ADHs)
catalyze the NAD(P)(H)-dependent interconversion of
alcohols to aldehydes or ketones. Active site zinc has a
catalytic role, while structural zinc aids in stability.
Length = 355
Score = 32.2 bits (74), Expect = 0.83
Identities = 26/110 (23%), Positives = 43/110 (39%), Gaps = 23/110 (20%)
Query: 8 KTLLIIGAGPIVIGQACEFDYSGTQACKALKEEGYRIILVNSNPATIMTDPDLAD---AT 64
+ L++GAGPI G A L+ G+ + ++N D+ + AT
Sbjct: 174 RRALVLGAGPI-----------GLLAALLLRLRGFEVYVLNRRDPPDPK-ADIVEELGAT 221
Query: 65 Y-TEPITPEVVAKIIEKERPDAILPTTGGQTALNTALSLKRMGVLDRYGV 113
Y TP K++ + D I+ TG L+ + + L GV
Sbjct: 222 YVNSSKTPVAEVKLVGE--FDLIIEATG-----VPPLAFEALPALAPNGV 264
>gnl|CDD|176193 cd08231, MDR_TM0436_like, Hypothetical enzyme TM0436 resembles the
zinc-dependent alcohol dehydrogenases (ADH). This group
contains the hypothetical TM0436 alcohol dehydrogenase
from Thermotoga maritima, proteins annotated as
5-exo-alcohol dehydrogenase, and other members of the
medium chain dehydrogenases/reductase
(MDR)/zinc-dependent alcohol dehydrogenase-like family.
MDR, which contains the zinc-dependent alcohol
dehydrogenase (ADH-Zn) and related proteins, is a
diverse group of proteins related to the first
identified member, class I mammalian ADH. MDRs display
a broad range of activities and are distinguished from
the smaller short chain dehydrogenases (~ 250 amino
acids vs. the ~ 350 amino acids of the MDR). The MDR
proteins have 2 domains: a C-terminal NAD(P)
binding-Rossmann fold domain of a beta-alpha form and an
N-terminal catalytic domain with distant homology to
GroES. The MDR group contains a host of activities,
including the founding alcohol dehydrogenase (ADH),
quinone reductase, sorbitol dehydrogenase, formaldehyde
dehydrogenase, butanediol DH, ketose reductase, cinnamyl
reductase, and numerous others. The zinc-dependent
alcohol dehydrogenases (ADHs) catalyze the
NAD(P)(H)-dependent interconversion of alcohols to
aldehydes or ketones. Active site zinc has a catalytic
role, while structural zinc aids in stability.
Length = 361
Score = 32.2 bits (74), Expect = 0.95
Identities = 24/106 (22%), Positives = 40/106 (37%), Gaps = 18/106 (16%)
Query: 8 KTLLIIGAGPIVIGQACEFDYSGTQACKALKEEGY-RIILVNSNPA------TIMTDPDL 60
T+++ GAGP+ G A A K G R+I+++ +P D +
Sbjct: 179 DTVVVQGAGPL-----------GLYAVAAAKLAGARRVIVIDGSPERLELAREFGADATI 227
Query: 61 ADATYTEPITPEVVAKIIEKERPDAILPTTGGQTALNTALSLKRMG 106
+P +V I D ++ +G A+ L L R G
Sbjct: 228 DIDELPDPQRRAIVRDITGGRGADVVIEASGHPAAVPEGLELLRRG 273
>gnl|CDD|32006 COG1821, COG1821, Predicted ATP-utilizing enzyme (ATP-grasp
superfamily) [General function prediction only].
Length = 307
Score = 31.9 bits (72), Expect = 1.1
Identities = 24/113 (21%), Positives = 44/113 (38%), Gaps = 11/113 (9%)
Query: 847 DVDALCQNDQVIVVGIIEHIEEAGIHSGDSACSLPSRSLSQQLKDELISQTKALAGALNV 906
V L N Q I+ E + G D L ++ +E I + + G +
Sbjct: 184 KVLPLSVNRQFIIFAGSELVYNGGRTPIDH-------ELKREAFEEAIRAVECIPG---L 233
Query: 907 IGLINVQYAIKDGKIYILEVNPRASRTVPFIAKAIGFPVAKVAARIIAGESLD 959
G + V + D Y++E+NPR + +++ VA++ G+ L
Sbjct: 234 NGYVGVDLVLSDE-PYVIEINPRPTTPTVGLSRVTPESVAELLLEGPTGKVLR 285
>gnl|CDD|31263 COG1063, Tdh, Threonine dehydrogenase and related Zn-dependent
dehydrogenases [Amino acid transport and metabolism /
General function prediction only].
Length = 350
Score = 31.5 bits (71), Expect = 1.3
Identities = 22/102 (21%), Positives = 38/102 (37%), Gaps = 14/102 (13%)
Query: 9 TLLIIGAGPIVIGQACEFDYSGTQACKALKEEGYRIILVNSNPATIMTDPDLADATYT-- 66
T++++GAGP IG K L +I+V+ +P + + A
Sbjct: 171 TVVVVGAGP--IGLL------AIALAKLLGAS--VVIVVDRSPERLELAKEAGGADVVVN 220
Query: 67 --EPITPEVVAKIIEKERPDAILPTTGGQTALNTALSLKRMG 106
E + ++ D ++ G AL+ AL R G
Sbjct: 221 PSEDDAGAEILELTGGRGADVVIEAVGSPPALDQALEALRPG 262
>gnl|CDD|113104 pfam04321, RmlD_sub_bind, RmlD substrate binding domain.
L-rhamnose is a saccharide required for the virulence
of some bacteria. Its precursor, dTDP-L-rhamnose, is
synthesized by four different enzymes the final one of
which is RmlD. The RmlD substrate binding domain is
responsible for binding a sugar nucleotide.
Length = 284
Score = 31.4 bits (72), Expect = 1.5
Identities = 19/77 (24%), Positives = 31/77 (40%), Gaps = 24/77 (31%)
Query: 10 LLIIGAGPIVIGQACEFDYSGTQACKALKEEGYRIILVNSNPATIMTDPDLADATYTEPI 69
+L+ GA GQ G + + L E G ++ ++ DL D
Sbjct: 1 ILVTGAN----GQL------GRELTRLLAERGVEVVALDRPEL------DLTD------- 37
Query: 70 TPEVVAKIIEKERPDAI 86
PE VA ++ + RPD +
Sbjct: 38 -PEAVAALVREARPDVV 53
>gnl|CDD|176257 cd08297, CAD3, Cinnamyl alcohol dehydrogenases (CAD). These
alcohol dehydrogenases are related to the cinnamyl
alcohol dehydrogenases (CAD), members of the medium
chain dehydrogenase/reductase family.
NAD(P)(H)-dependent oxidoreductases are the major
enzymes in the interconversion of alcohols and
aldehydes, or ketones. Cinnamyl alcohol dehydrogenases
(CAD) reduce cinnamaldehydes to cinnamyl alcohols in the
last step of monolignal metabolism in plant cells walls.
CAD binds 2 zinc ions and is NADPH- dependent. CAD
family members are also found in non-plant species, e.g.
in yeast where they have an aldehyde reductase activity.
The medium chain dehydrogenases/reductase
(MDR)/zinc-dependent alcohol dehydrogenase-like family,
which contains the zinc-dependent alcohol dehydrogenase
(ADH-Zn) and related proteins, is a diverse group of
proteins related to the first identified member, class I
mammalian ADH. MDRs display a broad range of activities
and are distinguished from the smaller short chain
dehydrogenases (~ 250 amino acids vs. the ~ 350 amino
acids of the MDR). The MDR proteins have 2 domains: a
C-terminal NAD(P) binding-Rossmann fold domain of a
beta-alpha form and an N-terminal catalytic domain with
distant homology to GroES. The MDR group contains a
host of activities, including the founding alcohol
dehydrogenase (ADH), quinone reductase, sorbitol
dehydrogenase, formaldehyde dehydrogenase, butanediol
DH, ketose reductase, cinnamyl reductase, and numerous
others. The zinc-dependent alcohol dehydrogenases (ADHs)
catalyze the NAD(P)(H)-dependent interconversion of
alcohols to aldehydes or ketones. Active site zinc has
a catalytic role, while structural zinc aids in
stability. ADH-like proteins typically form dimers
(typically higher plants, mammals) or tetramers (yeast,
bacteria), and generally have 2 tightly bound zinc atoms
per subunit. The active site zinc is coordinated by a
histidine, two cysteines, and a water molecule. The
second zinc seems to play a structural role, affects
subunit interactions, and is typically coordinated by 4
cysteines.
Length = 341
Score = 30.6 bits (70), Expect = 2.3
Identities = 21/87 (24%), Positives = 34/87 (39%), Gaps = 15/87 (17%)
Query: 30 GTQACKALKEEGYRIILVNSNPATIMTDPDL-----ADAT--YTEPITPEVVAKIIEKER 82
G Q KA+ G R+I ++ + +L ADA + + E V ++
Sbjct: 182 GVQYAKAM---GLRVIAIDVGDEKL----ELAKELGADAFVDFKKSDDVEAVKELTGGGG 234
Query: 83 PDAILPTTGGQTALNTALS-LKRMGVL 108
A++ T A AL L+ G L
Sbjct: 235 AHAVVVTAVSAAAYEQALDYLRPGGTL 261
>gnl|CDD|37175 KOG1964, KOG1964, KOG1964, Nuclear pore complex, rNup107 component
(sc Nup84) [Nuclear structure, Intracellular
trafficking, secretion, and vesicular transport].
Length = 800
Score = 30.4 bits (68), Expect = 2.9
Identities = 24/105 (22%), Positives = 43/105 (40%), Gaps = 13/105 (12%)
Query: 796 MLQDYLLDTLPGLVSEEIKRQYPNDKTGQINTLL---GTHPLLFDSYLSDAMEIDVDALC 852
+ +LD + L+ E + D+ GQ+ L TH +LF +S + +
Sbjct: 391 LQNHLILDLIKELI-ESVVEWLEKDRNGQVPPHLLRIMTHLVLF-LRISGLEVNEDGS-- 446
Query: 853 QNDQVIVVGIIEHIEEAGIHSGDSACSLPSRSLSQQLKDELISQT 897
I++ IE + +G H A +R L + L+ E S+
Sbjct: 447 ---AKIILTYIELLARSG-HKNLIA--FYTRFLPEDLQLEAYSRF 485
>gnl|CDD|143468 cd07150, ALDH_VaniDH_like, Pseudomonas putida vanillin
dehydrogenase-like. Vanillin dehydrogenase (Vdh,
VaniDH) involved in the metabolism of ferulic acid and
other related sequences are included in this CD. The
E. coli vanillin dehydrogenase (LigV) preferred NAD+ to
NADP+ and exhibited a broad substrate preference,
including vanillin, benzaldehyde, protocatechualdehyde,
m-anisaldehyde, and p-hydroxybenzaldehyde.
Length = 451
Score = 30.4 bits (69), Expect = 3.2
Identities = 25/77 (32%), Positives = 34/77 (44%), Gaps = 12/77 (15%)
Query: 324 VLKEIGVESGGANVQFAVNPKNGKMVVIEM--NPRVSRSSALASKATGFPIAKIAAKLAV 381
+++E G+ G NV G V E+ +PRV + S A G IA+ A +
Sbjct: 168 IMEEAGLPKGVFNVV----TGGGAEVGDELVDDPRVRMVTFTGSTAVGREIAEKAGRHLK 223
Query: 382 GYTLDELGNDITGGKTP 398
TL EL GGK P
Sbjct: 224 KITL-EL-----GGKNP 234
>gnl|CDD|32398 COG2216, KdpB, High-affinity K+ transport system, ATPase chain B
[Inorganic ion transport and metabolism].
Length = 681
Score = 30.2 bits (68), Expect = 3.5
Identities = 21/57 (36%), Positives = 31/57 (54%), Gaps = 4/57 (7%)
Query: 648 LKEAGFETIMINC-NPETVSTDYDIA--DRLYFESLTEEDILEILRVEQQKGELVGI 701
L++ G +T+MI NP T + A D E+ T ED L ++R EQ +G LV +
Sbjct: 459 LRKMGIKTVMITGDNPLTAAAIAAEAGVDDFIAEA-TPEDKLALIRQEQAEGRLVAM 514
>gnl|CDD|31342 COG1148, HdrA, Heterodisulfide reductase, subunit A and related
polyferredoxins [Energy production and conversion].
Length = 622
Score = 30.3 bits (68), Expect = 3.6
Identities = 15/57 (26%), Positives = 26/57 (45%), Gaps = 11/57 (19%)
Query: 607 NKPVSEDKVSDRKKIVILGGGPNRIGQGIEFDYCCCHASFSLKEAGFETIMINCNPE 663
+P+ E KV K ++++GGG GI A+ L + GF+ ++ P
Sbjct: 113 LEPLEEIKVEVSKSVLVIGGGV----AGIT-------AALELADMGFKVYLVEKEPS 158
>gnl|CDD|31284 COG1087, GalE, UDP-glucose 4-epimerase [Cell envelope biogenesis,
outer membrane].
Length = 329
Score = 30.1 bits (68), Expect = 4.1
Identities = 12/70 (17%), Positives = 28/70 (40%), Gaps = 18/70 (25%)
Query: 28 YSGTQACKALKEEGYRII----LVNSNPATIMTDP------DLADATYTEPITPEVVAKI 77
Y G+ + L + G+ ++ L N + ++ DL D ++ +
Sbjct: 11 YIGSHTVRQLLKTGHEVVVLDNLSNGHKIALLKLQFKFYEGDLLDR--------ALLTAV 62
Query: 78 IEKERPDAIL 87
E+ + DA++
Sbjct: 63 FEENKIDAVV 72
>gnl|CDD|31264 COG1064, AdhP, Zn-dependent alcohol dehydrogenases [General
function prediction only].
Length = 339
Score = 29.8 bits (67), Expect = 5.0
Identities = 25/113 (22%), Positives = 42/113 (37%), Gaps = 13/113 (11%)
Query: 8 KTLLIIGAGPIVIGQACEFDYSGTQACKALKEEGYRIILVNSNPATIMTDPDLADATYTE 67
K + ++GAG + Q KA+ G +I + + + L
Sbjct: 168 KWVAVVGAGGLGH--------MAVQYAKAM---GAEVIAITRSEEKLELAKKLGADHVIN 216
Query: 68 PITPEVVAKIIEKERPDAILPTTGGQTALNTALSLKRMGVLDRYGVEMIGAKP 120
+ + + KE DAI+ T G T + +L+R G L G+ G P
Sbjct: 217 SSDSDALEAV--KEIADAIIDTVGPATLEPSLKALRRGGTLVLVGLPGGGPIP 267
>gnl|CDD|35960 KOG0741, KOG0741, KOG0741, AAA+-type ATPase [Posttranslational
modification, protein turnover, chaperones].
Length = 744
Score = 29.5 bits (66), Expect = 5.0
Identities = 55/263 (20%), Positives = 97/263 (36%), Gaps = 55/263 (20%)
Query: 273 MMRDIKGNCIVVCSIENLDPMGVHTGDSIT----VAPALTLTDKEYQLMRNAAIAVLKEI 328
M R +K V ++ + V GD + V PA +++++ + +
Sbjct: 456 MNRHVKAGGKVEVDPVAIENLKVTRGDFLNALEDVKPAFGISEEDLERFVMNGMINWGPP 515
Query: 329 ---GVESGGANVQFAVNPKNGKMV-VIEMNPRVSRSSALASKA---TGFPIAKI-AAKLA 380
++ G VQ N + +V V+ P S +ALA+K + FP KI + +
Sbjct: 516 VTRILDDGKLLVQQVKNSERSPLVSVLLEGPPGSGKTALAAKIALSSDFPFVKIISPEDM 575
Query: 381 VGYTLDELGNDITGG-----KTPAS------FEPSIDYIVTKIPRFT----------FEK 419
+G + I K+P S E +DY+ PRF+ +K
Sbjct: 576 IGLSESAKCAHIKKIFEDAYKSPLSIIVVDDIERLLDYVPIG-PRFSNLVLQALLVLLKK 634
Query: 420 FPGSDVTL----TTSMKSVGEVMAIGRTFAESLQKALRGLETGLTGLDEIHIPSMESDND 475
P L TTS + V + M I F+ IH+P++ +
Sbjct: 635 QPPKGRKLLIFGTTSRREVLQEMGILDCFS-----------------STIHVPNLTTGEQ 677
Query: 476 PNALRSAISIPCPDRLRTVAQAL 498
+ ++I D +R +A+ L
Sbjct: 678 LLEVLEELNIFSDDEVRAIAEQL 700
>gnl|CDD|145188 pfam01887, SAM_adeno_trans, S-adenosyl-l-methionine hydroxide
adenosyltransferase. This is a family of proteins,
previously known as DUF62, found in archaebacteria and
bacteria. The structure of proteins in this family is
similar to that of a bacterial fluorinating enzyme.
S-adenosyl-l-methionine hydroxide adenosyltransferases
utilizes a rigorously conserved amino acid side chain
triad (Asp-Arg-His) which may have a role in activating
water to hydroxide ion. This family used to be known as
DUF62.
Length = 258
Score = 29.8 bits (68), Expect = 5.0
Identities = 24/91 (26%), Positives = 35/91 (38%), Gaps = 19/91 (20%)
Query: 373 AKIAAKLAVGYTLDELGNDITGG--KTPASFEPSID---------YI------VTKIPRF 415
A AA LA G L+ELG +I EPS++ YI +T I R
Sbjct: 131 APAAAHLARGGPLEELGPEIDPDSLVRLPLPEPSVEDDGIEGEVIYIDHFGNVITNITRE 190
Query: 416 TFEKFPGSDVTLTTSMKSVGEVMAIGRTFAE 446
F G V + + ++ RT+ +
Sbjct: 191 LFGLGKGDRVEVRIRGEK--LILPFVRTYGD 219
>gnl|CDD|146510 pfam03915, AIP3, Actin interacting protein 3.
Length = 422
Score = 29.6 bits (67), Expect = 5.1
Identities = 23/93 (24%), Positives = 43/93 (46%), Gaps = 2/93 (2%)
Query: 256 EVLIEESVLGWKEYELEMMRDIKGNCIVVCSIENLDPMGVHTGDSI-TVAPALTLTDKEY 314
++ I++ V G YELE + DIK ++V ++E LD + H DS+ ++ + +
Sbjct: 41 DIYIQDPVSG-VSYELEELSDIKDGSVLVLNVEELDEVKKHIDDSLGGLSEKIKSLKQAL 99
Query: 315 QLMRNAAIAVLKEIGVESGGANVQFAVNPKNGK 347
+N + LKE+ + + A P
Sbjct: 100 DDQQNTILQRLKELQQNAAKEVARPAAAPPPSS 132
>gnl|CDD|176222 cd08261, Zn_ADH7, Alcohol dehydrogenases of the MDR family. This
group contains members identified as related to
zinc-dependent alcohol dehydrogenase and other members
of the MDR family. The medium chain
dehydrogenases/reductase (MDR)/zinc-dependent alcohol
dehydrogenase-like family, which contains the
zinc-dependent alcohol dehydrogenase (ADH-Zn) and
related proteins, is a diverse group of proteins related
to the first identified member, class I mammalian ADH.
MDRs display a broad range of activities and are
distinguished from the smaller short chain
dehydrogenases (~ 250 amino acids vs. the ~ 350 amino
acids of the MDR). The MDR proteins have 2 domains: a
C-terminal NAD(P)-binding Rossmann fold domain of a
beta-alpha form and an N-terminal catalytic domain with
distant homology to GroES. The MDR group includes
various activities, including the founding alcohol
dehydrogenase (ADH), quinone reductase, sorbitol
dehydrogenase, formaldehyde dehydrogenase, butanediol
DH, ketose reductase, cinnamyl reductase, and numerous
others. The zinc-dependent alcohol dehydrogenases (ADHs)
catalyze the NAD(P)(H)-dependent interconversion of
alcohols to aldehydes or ketones. Active site zinc has a
catalytic role, while structural zinc aids in stability.
ADH-like proteins typically form dimers (typically
higher plants, mammals) or tetramers (yeast, bacteria),
and generally have 2 tightly bound zinc atoms per
subunit. The active site zinc is coordinated by a
histidine, two cysteines, and a water molecule. The
second zinc seems to play a structural role, affects
subunit interactions, and is typically coordinated by 4
cysteines.
Length = 337
Score = 29.5 bits (67), Expect = 5.3
Identities = 23/104 (22%), Positives = 42/104 (40%), Gaps = 28/104 (26%)
Query: 9 TLLIIGAGPIVIGQACEFDYSGTQACKALKEEGYRIILVNSNPATIMTDPD--------L 60
T+L++GAGPI +G + K G R+I+V+ D +
Sbjct: 162 TVLVVGAGPIGLG-----------VIQVAKARGARVIVVD-------IDDERLEFARELG 203
Query: 61 ADATY--TEPITPEVVAKIIEKERPDAILPTTGGQTALNTALSL 102
AD T + + ++ + E D ++ TG ++ A+ L
Sbjct: 204 ADDTINVGDEDVAARLRELTDGEGADVVIDATGNPASMEEAVEL 247
>gnl|CDD|110097 pfam01071, GARS_A, Phosphoribosylglycinamide synthetase, ATP-grasp
(A) domain. Phosphoribosylglycinamide synthetase
catalyses the second step in the de novo biosynthesis of
purine. The reaction catalysed by
Phosphoribosylglycinamide synthetase is the ATP-
dependent addition of 5-phosphoribosylamine to glycine
to form 5'phosphoribosylglycinamide. This domain is
related to the ATP-grasp domain of biotin
carboxylase/carbamoyl phosphate synthetase (see
pfam02786).
Length = 193
Score = 29.5 bits (67), Expect = 6.1
Identities = 34/172 (19%), Positives = 61/172 (35%), Gaps = 36/172 (20%)
Query: 208 AVQALDEIGLPLIIRPSFTLGGTGGGIAYNRSEFLEIVENGLHASPTTE----VLIEESV 263
A + E G P +++ G G +A + E ++ V+ L E V+IEE +
Sbjct: 29 AKSYIREAGFPAVVKADGLAAGKGVIVAMDNEEAIKAVDEILEQKKFGEAGEPVVIEEFL 88
Query: 264 LGWKEYELEMMRDIKGNCIVVCSIENLDPM--------------GVHTGDSITVAPALTL 309
G E+ ++ + G + P+ G +TG +PA L
Sbjct: 89 EG---EEVSVLAFVDGKTVK--------PLPPAQDHKRLGEGDTGPNTGGMGAYSPAPVL 137
Query: 310 TDKEYQLMRNAAI-----AVLKEIGVESGGANVQFAVNPKNGKMVVIEMNPR 356
T + + ++ + + KE G + K V+E N R
Sbjct: 138 TPELLERIKETIVEPTVDGLRKEGIPYKGVLYAGLMLTKDGPK--VLEFNCR 187
>gnl|CDD|176468 cd01596, Aspartase_like, aspartase (L-aspartate ammonia-lyase) and
fumarase class II enzymes. This group contains
aspartase (L-aspartate ammonia-lyase), fumarase class II
enzymes, and related proteins. It is a member of the
Lyase class I family. Members of this family for the
most part catalyze similar beta-elimination reactions in
which a C-N or C-O bond is cleaved with the release of
fumarate as one of the products. These proteins are
active as tetramers. The four active sites of the
homotetrameric enzyme are each formed by residues from
three different subunits. Aspartase catalyzes the
reversible deamination of aspartic acid. Fumarase
catalyzes the reversible hydration/dehydration of
fumarate to L-malate during the Krebs cycle.
Length = 450
Score = 29.3 bits (67), Expect = 6.2
Identities = 22/85 (25%), Positives = 37/85 (43%), Gaps = 9/85 (10%)
Query: 393 TGGKTPASFEPSIDYIVTKIPRFTFEKFPGSD--VTLTTSMKSVGEVMAIGRTFAESLQK 450
TG P + + + ++ T F + T + ++ EV +T A SL K
Sbjct: 229 TGLNAPPGY---AEKVAAELAELTGLPFVTAPNLFEATAAHDALVEVSGALKTLAVSLSK 285
Query: 451 A---LRGLETG-LTGLDEIHIPSME 471
LR L +G GL EI++P+ +
Sbjct: 286 IANDLRLLSSGPRAGLGEINLPANQ 310
>gnl|CDD|34435 COG4826, COG4826, Serine protease inhibitor [Posttranslational
modification, protein turnover, chaperones].
Length = 410
Score = 29.2 bits (65), Expect = 6.4
Identities = 11/41 (26%), Positives = 18/41 (43%), Gaps = 4/41 (9%)
Query: 403 PSIDYIVTKIPRFTFEKFPGSDVTLTTSMKSVGEVMAIGRT 443
D + +IP+F FE + L ++ +G V A T
Sbjct: 292 EDQDEVEVEIPKFKFE----TKTELKDALIEMGVVDAFENT 328
>gnl|CDD|107353 cd06358, PBP1_NHase, Type I periplasmic-binding protein of the
nitrile hydratase (NHase) system that selectively
converts nitriles to corresponding amides. This group
includes the type I periplasmic-binding protein of the
nitrile hydratase (NHase) system that selectively
converts nitriles to corresponding amides, which are
subsequently converted by amidases to yield free
carboxylic acids and ammonia. NHases from bacteria and
fungi have been purified and characterized. In
Rhodococcus sp., the nitrile hydratase operon consists
of six genes encoding NHase regulator 2, NHase regulator
1, amidase, NHase alpha subunit, NHase beta subunit, and
NHase activator. The operon produces a constitutive
hydratase that has a broad substrate spectrum: aliphatic
and aromatic nitriles, mononitriles and dinitriles,
hydroxynitriles and amino-nitriles, and a constitutive
amidase of equally low substrate specificity. NHases are
metalloenzymes containing either cobalt or iron, and
therefore can be classified into two subgroups: ferric
NHases and cobalt NHases.
Length = 333
Score = 29.1 bits (66), Expect = 7.1
Identities = 12/39 (30%), Positives = 17/39 (43%), Gaps = 1/39 (2%)
Query: 72 EVVAKIIEKERPDAILPTTGGQTALNTALSLKRMGVLDR 110
V+ +I DA+L T GQ A+ G+ DR
Sbjct: 178 SVLERI-AASGADAVLSTLVGQDAVAFNRQFAAAGLRDR 215
>gnl|CDD|176198 cd08236, sugar_DH, NAD(P)-dependent sugar dehydrogenases. This
group contains proteins identified as sorbitol
dehydrogenases and other sugar dehydrogenases of the
medium-chain dehydrogenase/reductase family (MDR), which
includes zinc-dependent alcohol dehydrogenase and
related proteins. Sorbitol and aldose reductase are
NAD(+) binding proteins of the polyol pathway, which
interconverts glucose and fructose. Sorbitol
dehydrogenase is tetrameric and has a single catalytic
zinc per subunit. NAD(P)(H)-dependent oxidoreductases
are the major enzymes in the interconversion of alcohols
and aldehydes, or ketones. Related proteins include
threonine dehydrogenase, formaldehyde dehydrogenase, and
butanediol dehydrogenase. The medium chain alcohol
dehydrogenase family (MDR) has a NAD(P)(H)-binding
domain in a Rossmann fold of a beta-alpha form. The
N-terminal region typically has an all-beta catalytic
domain. These proteins typically form dimers (typically
higher plants, mammals) or tetramers (yeast, bacteria),
and have 2 tightly bound zinc atoms per subunit. Horse
liver alcohol dehydrogenase is a dimeric enzyme and each
subunit has two domains. The NAD binding domain is in a
Rossmann fold and the catalytic domain contains a zinc
ion to which substrates bind. There is a cleft between
the domains that closes upon formation of the ternary
complex.
Length = 343
Score = 29.1 bits (66), Expect = 7.4
Identities = 28/102 (27%), Positives = 44/102 (43%), Gaps = 14/102 (13%)
Query: 8 KTLLIIGAGPIVIGQACEFDYSGTQACKALKEEGY-RIILVNSNPATIMTDPDL-ADATY 65
T+++IGAG I G A + LK G R+I V+ + + +L AD T
Sbjct: 161 DTVVVIGAGTI-----------GLLAIQWLKILGAKRVIAVDIDDEKLAVARELGADDTI 209
Query: 66 -TEPITPEVVAKIIEKERPDAILPTTGGQTALNTALSLKRMG 106
+ E V ++ E D ++ G + AL+L R G
Sbjct: 210 NPKEEDVEKVRELTEGRGADLVIEAAGSPATIEQALALARPG 251
>gnl|CDD|31945 COG1759, COG1759, ATP-utilizing enzymes of ATP-grasp superfamily
(probably carboligases) [General function prediction
only].
Length = 361
Score = 29.0 bits (65), Expect = 7.6
Identities = 25/88 (28%), Positives = 39/88 (44%), Gaps = 11/88 (12%)
Query: 187 LKWNLEENDRKHRYICHAMAVAV----QALDEIGLPLIIRPSFTLGGTGGGIAYNRSEFL 242
L+W EE+ + + + + ++ +EI P+I++ GG G IA + EF
Sbjct: 119 LRW--EEDRKLEYKLLEKAGLRIPKKYKSPEEIDRPVIVKLPGAKGGRGYFIASSPEEFY 176
Query: 243 EIVENGLHASPTTE-----VLIEESVLG 265
E E L TE IEE V+G
Sbjct: 177 EKAERLLKRGVITEEDLKNARIEEYVVG 204
>gnl|CDD|30854 COG0508, AceF, Pyruvate/2-oxoglutarate dehydrogenase complex,
dihydrolipoamide acyltransferase (E2) component, and
related enzymes [Energy production and conversion].
Length = 404
Score = 28.8 bits (64), Expect = 8.1
Identities = 16/36 (44%), Positives = 20/36 (55%), Gaps = 1/36 (2%)
Query: 1021 QLGIGVDLPHEGTVFVSVRDADKKRIVPIIQNFKKL 1056
+GI VD P G V +RDADKK + I + K L
Sbjct: 268 NIGIAVDTPR-GLVVPVIRDADKKSLAEIAKEIKDL 302
>gnl|CDD|31216 COG1012, PutA, NAD-dependent aldehyde dehydrogenases [Energy
production and conversion].
Length = 472
Score = 28.8 bits (64), Expect = 9.9
Identities = 28/87 (32%), Positives = 35/87 (40%), Gaps = 8/87 (9%)
Query: 320 AAIAVLKEIGVESGGANVQFAVNPKNGKMVVIEMNPRVSRSSALASKATGFPIAKIAAKL 379
A + E G+ +G NV + G +V +P V S S A G IA AA
Sbjct: 178 ALAELAAEAGLPAGVLNVVTGGGAEVGDALVA--HPDVDAISFTGSTAVGRAIAAAAAAN 235
Query: 380 AVGYTLDELGNDITGGKTPASFEPSID 406
TL EL GGK+PA D
Sbjct: 236 LKPVTL-EL-----GGKSPAIVLEDAD 256
Database: CddA
Posted date: Feb 4, 2011 9:38 PM
Number of letters in database: 6,263,737
Number of sequences in database: 21,609
Lambda K H
0.318 0.136 0.386
Gapped
Lambda K H
0.267 0.0772 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 21609
Number of Hits to DB: 13,770,662
Number of extensions: 761962
Number of successful extensions: 2297
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2235
Number of HSP's successfully gapped: 93
Length of query: 1162
Length of database: 6,263,737
Length adjustment: 104
Effective length of query: 1058
Effective length of database: 4,016,401
Effective search space: 4249352258
Effective search space used: 4249352258
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 63 (28.0 bits)