RPS-BLAST 2.2.22 [Sep-27-2009] Database: CddA 21,609 sequences; 6,263,737 total letters Searching..................................................done Query= gi|254780268|ref|YP_003064681.1| acetyl-CoA carboxylase biotin carboxylase subunit [Candidatus Liberibacter asiaticus str. psy62] (443 letters) >gnl|CDD|30788 COG0439, AccC, Biotin carboxylase [Lipid metabolism]. Length = 449 Score = 599 bits (1546), Expect = e-172 Identities = 278/444 (62%), Positives = 347/444 (78%), Gaps = 4/444 (0%) Query: 1 MISKILIANRGEIALRILRACKELGIPTVAVHSTADSGAMHVRLADESVCIGPPSSKDSY 60 M KILIANRGEIA+RI+RAC+ELGI TVAV+S AD+ A+HV LADE+VCIGP S DSY Sbjct: 1 MFKKILIANRGEIAVRIIRACRELGIETVAVYSEADADALHVALADEAVCIGPAPSADSY 60 Query: 61 LNIQQIVAACEVTGADAIHPGYGFLSENAKFAEILEDHHIKFIGPSSEHIKIMGDKITAK 120 LNI I+AA E TGADAIHPGYGFLSENA FAE + + FIGPS+E I+ MGDKITA+ Sbjct: 61 LNIDAIIAAAEETGADAIHPGYGFLSENAAFAEACAEAGLTFIGPSAEAIRRMGDKITAR 120 Query: 121 KTAQQLGIPVVPGSGEVYPHT--AMPIAKEIGFPVLVKASAGGGGRGMRIAYSENDLSEA 178 + + G+PVVPGS A+ IA+EIG+PV+VKA+AGGGGRGMR+ +E +L A Sbjct: 121 RLMAKAGVPVVPGSDGAVADNEEALAIAEEIGYPVIVKAAAGGGGRGMRVVRNEEELEAA 180 Query: 179 IDQARSEALAAFGNDAVYIEKYLENPRHIEVQIFGDGMGNAIHFGERDCSVQRRNQKIWE 238 + AR EA AAFGN VY+EK++E PRHIEVQ+ GDG GN IH GERDCS+QRR+QK+ E Sbjct: 181 FEAARGEAEAAFGNPRVYLEKFIEGPRHIEVQVLGDGHGNVIHLGERDCSIQRRHQKVIE 240 Query: 239 EAHSPVISAQDRTEIGETCVKAMKKIDYRGAGTIEFLYE-NGQFYFIEMNTRLQVEHPVT 297 EA SP+++ + R +IGE V+A K I YRGAGT+EFLY+ NG+FYFIEMNTRLQVEHPVT Sbjct: 241 EAPSPLLTEELREKIGEAAVRAAKLIGYRGAGTVEFLYDSNGEFYFIEMNTRLQVEHPVT 300 Query: 298 EAITGIDLVHEQIYVASENRLSVQQKDITFSGHAIECRINAEDPE-NFIPNPGEITYFHA 356 E +TGIDLV EQI +A+ LS++Q+DI F GHAIECRINAEDP NF+P+PG+IT + Sbjct: 301 EMVTGIDLVKEQIRIAAGEPLSLKQEDIKFRGHAIECRINAEDPLGNFLPSPGKITRYAP 360 Query: 357 PGGLGIRMDSASYQGYTVPSYYDSLIAKLIVHGKNRKECMMRLNRALNEIIIDGIKTTIP 416 PGG G+R+DS Y GY VP YYDS+I K+IVHG+ R E + R+ RAL+E++IDGIKT IP Sbjct: 361 PGGPGVRVDSGVYDGYRVPPYYDSMIGKVIVHGRTRDEAIARMRRALDELVIDGIKTNIP 420 Query: 417 LFQKLIKNEDIIEGNYDIHWLENK 440 L Q+++++ D + G+ D H+LE Sbjct: 421 LLQEILRDPDFLAGDLDTHFLETH 444 >gnl|CDD|34383 COG4770, COG4770, Acetyl/propionyl-CoA carboxylase, alpha subunit [Lipid metabolism]. Length = 645 Score = 591 bits (1525), Expect = e-169 Identities = 239/437 (54%), Positives = 313/437 (71%), Gaps = 4/437 (0%) Query: 1 MISKILIANRGEIALRILRACKELGIPTVAVHSTADSGAMHVRLADESVCIGPPSSKDSY 60 M SKILIANRGEIA R++R ++LGI TVAV+S AD+ A+HVR+ADE+V IGP + +SY Sbjct: 1 MFSKILIANRGEIACRVIRTARDLGIRTVAVYSDADADALHVRMADEAVHIGPAPAAESY 60 Query: 61 LNIQQIVAACEVTGADAIHPGYGFLSENAKFAEILEDHHIKFIGPSSEHIKIMGDKITAK 120 L+I +I+ A TGA AIHPGYGFLSENA FA+ +ED + FIGPS+ I+ MGDKI AK Sbjct: 61 LDIDKIIDAARRTGAQAIHPGYGFLSENADFAQAVEDAGLVFIGPSAGAIRAMGDKIAAK 120 Query: 121 KTAQQLGIPVVPGS-GEVY-PHTAMPIAKEIGFPVLVKASAGGGGRGMRIAYSENDLSEA 178 K A + G+P VPG G + + IA+EIG+PVL+KASAGGGG+GMR+ + + +EA Sbjct: 121 KLAAEAGVPTVPGYHGPIQDAAELVAIAEEIGYPVLIKASAGGGGKGMRVVETPEEFAEA 180 Query: 179 IDQARSEALAAFGNDAVYIEKYLENPRHIEVQIFGDGMGNAIHFGERDCSVQRRNQKIWE 238 ++ AR EA A+FG+D V+IEKYL+ PRHIE+Q+F D GN +H GERDCS+QRR+QK+ E Sbjct: 181 LESARREAKASFGDDRVFIEKYLDKPRHIEIQVFADQHGNVVHLGERDCSLQRRHQKVIE 240 Query: 239 EAHSPVISAQDRTEIGETCVKAMKKIDYRGAGTIEFLY-ENGQFYFIEMNTRLQVEHPVT 297 EA +P ++ + R +GE V A K + Y GAGT+EF+ +G FYF+EMNTRLQVEHPVT Sbjct: 241 EAPAPFLTEETREAMGEAAVAAAKAVGYVGAGTVEFIVDADGNFYFLEMNTRLQVEHPVT 300 Query: 298 EAITGIDLVHEQIYVASENRLSVQQKDITFSGHAIECRINAEDPE-NFIPNPGEITYFHA 356 E ITGIDLV QI VAS +L Q DI +GHAIE RI AEDP F+P+ G +T + Sbjct: 301 ELITGIDLVEWQIRVASGEKLPFTQDDIPLNGHAIEARIYAEDPARGFLPSTGRLTRYRP 360 Query: 357 PGGLGIRMDSASYQGYTVPSYYDSLIAKLIVHGKNRKECMMRLNRALNEIIIDGIKTTIP 416 P G G+R+DS +G + +YD +IAKLIVHG +R+E + RL RAL E ++GI T IP Sbjct: 361 PAGPGVRVDSGVREGDEISPFYDPMIAKLIVHGADREEALDRLRRALAEFEVEGIATNIP 420 Query: 417 LFQKLIKNEDIIEGNYD 433 + L+ + G+ D Sbjct: 421 FLRALMADPRFRGGDLD 437 >gnl|CDD|35459 KOG0238, KOG0238, KOG0238, 3-Methylcrotonyl-CoA carboxylase, biotin-containing subunit/Propionyl-CoA carboxylase, alpha chain/Acetyl-CoA carboxylase, biotin carboxylase subunit [Lipid transport and metabolism, Amino acid transport and metabolism]. Length = 670 Score = 541 bits (1395), Expect = e-154 Identities = 225/442 (50%), Positives = 309/442 (69%), Gaps = 5/442 (1%) Query: 5 ILIANRGEIALRILRACKELGIPTVAVHSTADSGAMHVRLADESVCIGPPSSKDSYLNIQ 64 ILIANRGEIA R++R K++GI TVAV+S AD ++HV++ADE+VCIGP + SYL + Sbjct: 1 ILIANRGEIACRVIRTAKKMGIRTVAVYSDADRNSLHVKMADEAVCIGPAPAAQSYLRMD 60 Query: 65 QIVAACEVTGADAIHPGYGFLSENAKFAEILEDHHIKFIGPSSEHIKIMGDKITAKKTAQ 124 +I+ A + TGA AIHPGYGFLSENA+FAE+ ED I FIGP I+ MGDK T+K+ + Sbjct: 61 KIIDAAKRTGAQAIHPGYGFLSENAEFAELCEDAGITFIGPPPSAIRDMGDKSTSKQIMK 120 Query: 125 QLGIPVVPG-SGEVYP-HTAMPIAKEIGFPVLVKASAGGGGRGMRIAYSENDLSEAIDQA 182 G+P+VPG GE A +A+EIG+PV++KA+AGGGG+GMRIA+SE + E ++ A Sbjct: 121 AAGVPLVPGYHGEDQSDEEAKKVAREIGYPVMIKATAGGGGKGMRIAWSEEEFEEGLESA 180 Query: 183 RSEALAAFGNDAVYIEKYLENPRHIEVQIFGDGMGNAIHFGERDCSVQRRNQKIWEEAHS 242 + EA +FG+D + +EK+++NPRHIEVQ+FGD GNA+H GERDCSVQRRNQKI EEA + Sbjct: 181 KQEAAKSFGDDGMLLEKFIDNPRHIEVQVFGDKHGNAVHLGERDCSVQRRNQKIIEEAPA 240 Query: 243 PVISAQDRTEIGETCVKAMKKIDYRGAGTIEFLY-ENGQFYFIEMNTRLQVEHPVTEAIT 301 P + + R +GE V+A K + Y GAGT+EF+ FYF+EMNTRLQVEHPVTE IT Sbjct: 241 PNLPEETRRALGEAAVRAAKAVGYVGAGTVEFIVDSKDNFYFMEMNTRLQVEHPVTEMIT 300 Query: 302 GIDLVHEQIYVASENRLSVQQKDITFSGHAIECRINAEDP-ENFIPNPGEITYFHAPGGL 360 G DLV QI VA+ L ++Q++I +GHA E RI AEDP + F+P+ G + Y+ PG Sbjct: 301 GTDLVEWQIRVAAGEPLPLKQEEIPLNGHAFEARIYAEDPYKGFLPSAGRLVYYSFPGHS 360 Query: 361 -GIRMDSASYQGYTVPSYYDSLIAKLIVHGKNRKECMMRLNRALNEIIIDGIKTTIPLFQ 419 G+R+D+ G V +YD +IAKL+V GK+R+E + +L AL+ +I G+ T I + Sbjct: 361 PGVRVDTGVRSGDEVSIHYDPMIAKLVVWGKDREEALNKLKDALDNYVIRGVPTNIDFLR 420 Query: 420 KLIKNEDIIEGNYDIHWLENKY 441 +I + + +GN ++ Sbjct: 421 DIISHPEFAKGNVSTKFIPEHQ 442 >gnl|CDD|31240 COG1038, PycA, Pyruvate carboxylase [Energy production and conversion]. Length = 1149 Score = 500 bits (1290), Expect = e-142 Identities = 220/451 (48%), Positives = 302/451 (66%), Gaps = 12/451 (2%) Query: 1 MISKILIANRGEIALRILRACKELGIPTVAVHSTADSGAMHVRLADESVCIGPPSSK-DS 59 I K+L+ANRGEIA+R+ RA ELGI TVA++S D ++H ADES IG ++ Sbjct: 6 KIKKVLVANRGEIAIRVFRAANELGIKTVAIYSEEDRLSLHRFKADESYLIGEGKGPVEA 65 Query: 60 YLNIQQIVAACEVTGADAIHPGYGFLSENAKFAEILEDHHIKFIGPSSEHIKIMGDKITA 119 YL+I +I+ + +GADAIHPGYGFLSEN +FA + I FIGP E + ++GDK+ A Sbjct: 66 YLSIDEIIRIAKRSGADAIHPGYGFLSENPEFARACAEAGITFIGPKPEVLDMLGDKVKA 125 Query: 120 KKTAQQLGIPVVPGS-GEV-YPHTAMPIAKEIGFPVLVKASAGGGGRGMRIAYSENDLSE 177 + A + G+PV+PG+ G + A+ A+E G+PV++KA+AGGGGRGMR+ SE DL+E Sbjct: 126 RNAAIKAGVPVIPGTDGPIETIEEALEFAEEYGYPVMIKAAAGGGGRGMRVVRSEADLAE 185 Query: 178 AIDQARSEALAAFGNDAVYIEKYLENPRHIEVQIFGDGMGNAIHFGERDCSVQRRNQKIW 237 A ++A+SEA AAFGND VY+EK +ENP+HIEVQI GD GN +H ERDCSVQRR+QK+ Sbjct: 186 AFERAKSEAKAAFGNDEVYVEKLVENPKHIEVQILGDTHGNVVHLFERDCSVQRRHQKVV 245 Query: 238 EEAHSPVISAQDRTEIGETCVKAMKKIDYRGAGTIEFLYEN-GQFYFIEMNTRLQVEHPV 296 E A +P +S + R EI + VK + I Y AGT+EFL + G+FYFIE+N R+QVEH + Sbjct: 246 EVAPAPYLSPELRDEICDDAVKLARNIGYINAGTVEFLVDEDGKFYFIEVNPRIQVEHTI 305 Query: 297 TEAITGIDLVHEQIYVASENRLSV------QQKDITFSGHAIECRINAEDPEN-FIPNPG 349 TE ITGID+V QI++A+ L QQKDI G+AI+CRI EDPEN FIP+ G Sbjct: 306 TEEITGIDIVKAQIHIAAGATLHTPELGLPQQKDIRTHGYAIQCRITTEDPENGFIPDTG 365 Query: 350 EITYFHAPGGLGIRMDSA-SYQGYTVPSYYDSLIAKLIVHGKNRKECMMRLNRALNEIII 408 IT + + GG G+R+D +Y G + YYDSL+ K+ G +E + ++ RAL E I Sbjct: 366 RITAYRSAGGFGVRLDGGNAYAGAVITPYYDSLLVKVTCWGSTFEEAIRKMIRALREFRI 425 Query: 409 DGIKTTIPLFQKLIKNEDIIEGNYDIHWLEN 439 G+KT IP + ++ + D G Y +++ Sbjct: 426 RGVKTNIPFLEAVLNHPDFRSGRYTTSFIDT 456 >gnl|CDD|35590 KOG0369, KOG0369, KOG0369, Pyruvate carboxylase [Energy production and conversion]. Length = 1176 Score = 449 bits (1156), Expect = e-127 Identities = 211/448 (47%), Positives = 287/448 (64%), Gaps = 13/448 (2%) Query: 3 SKILIANRGEIALRILRACKELGIPTVAVHSTADSGAMHVRLADESVCIG---PPSSKDS 59 +K+L+ANRGEIA+R+ RA EL + TVA++S D +MH + ADE+ IG PP + Sbjct: 34 NKVLVANRGEIAIRVFRAATELSMRTVAIYSEQDRLSMHRQKADEAYLIGKGLPPVG--A 91 Query: 60 YLNIQQIVAACEVTGADAIHPGYGFLSENAKFAEILEDHHIKFIGPSSEHIKIMGDKITA 119 YL I +I++ + DA+HPGYGFLSE + FA+ ++D I+FIGPS E I MGDK+ A Sbjct: 92 YLAIDEIISIAKKHNVDAVHPGYGFLSERSDFAQAVQDAGIRFIGPSPEVIDSMGDKVAA 151 Query: 120 KKTAQQLGIPVVPGS-GEV-YPHTAMPIAKEIGFPVLVKASAGGGGRGMRIAYSENDLSE 177 + A + G+PVVPG+ G + A+ KE G PV++KA+ GGGGRGMR+ S D+ E Sbjct: 152 RAIAIEAGVPVVPGTPGPITTVEEALEFVKEYGLPVIIKAAYGGGGRGMRVVRSGEDVEE 211 Query: 178 AIDQARSEALAAFGNDAVYIEKYLENPRHIEVQIFGDGMGNAIHFGERDCSVQRRNQKIW 237 A +A SEALAAFGN +++EK+LE PRHIEVQ+ GD GN +H ERDCSVQRR+QK+ Sbjct: 212 AFQRAYSEALAAFGNGTLFVEKFLEKPRHIEVQLLGDKHGNVVHLYERDCSVQRRHQKVV 271 Query: 238 EEAHSPVISAQDRTEIGETCVKAMKKIDYRGAGTIEFLYEN-GQFYFIEMNTRLQVEHPV 296 E A + + + R I VK K + Y AGT+EFL + G+ YFIE+N RLQVEH V Sbjct: 272 EIAPAKTLPPEVRDAILTDAVKLAKHVGYENAGTVEFLVDQKGRHYFIEVNPRLQVEHTV 331 Query: 297 TEAITGIDLVHEQIYVA---SENRLSVQQKDITFSGHAIECRINAEDP-ENFIPNPGEIT 352 TE ITG+DLV QI+VA S L + Q IT G AI+CR+ EDP + F P+ G I Sbjct: 332 TEEITGVDLVQAQIHVAEGASLPDLGLTQDKITTRGFAIQCRVTTEDPAKGFQPDTGRIE 391 Query: 353 YFHAPGGLGIRMDSAS-YQGYTVPSYYDSLIAKLIVHGKNRKECMMRLNRALNEIIIDGI 411 F + G+GIR+D AS + G + +YDSL+ K+I HG + ++ RAL E I G+ Sbjct: 392 VFRSGEGMGIRLDGASAFAGAVISPHYDSLLVKVICHGSTYEIAARKMIRALIEFRIRGV 451 Query: 412 KTTIPLFQKLIKNEDIIEGNYDIHWLEN 439 KT IP ++ N +EG D +++ Sbjct: 452 KTNIPFLLNVLTNPVFLEGTVDTTFIDE 479 >gnl|CDD|35589 KOG0368, KOG0368, KOG0368, Acetyl-CoA carboxylase [Lipid transport and metabolism]. Length = 2196 Score = 271 bits (695), Expect = 2e-73 Identities = 150/500 (30%), Positives = 244/500 (48%), Gaps = 68/500 (13%) Query: 2 ISKILIANRGEIALRILRACK---------ELGIPTVAVHSTAD--SGAMHVRLADESVC 50 I +ILIAN G A++ +R+ + E I V + + D + A ++R+AD+ V Sbjct: 54 IKRILIANNGIAAVKCIRSIRKWAYETFGNERAIQFVCMATPDDLRANAEYIRMADQYVE 113 Query: 51 IGPPSSKDSYLNIQQIVAACEVTGADAIHPGYGFLSENAKFAEILEDHHIKFIGPSSEHI 110 + ++ ++Y N+ IV E T DA+ G+G SEN + E L + I FIGP + + Sbjct: 114 VPGGTNNNNYANVDLIVDIAERTDVDAVWAGWGHASENPELPERLSANGIIFIGPPASAM 173 Query: 111 KIMGDKITAKKTAQQLGIPVVPGSG-------------------EVY-------PHTAMP 144 + +GDKI + AQ G+P +P SG ++Y + Sbjct: 174 RALGDKIASTIIAQSAGVPTLPWSGSGVKVEHIEDKTNLVSVPEDLYEKACVRNVEEGLE 233 Query: 145 IAKEIGFPVLVKASAGGGGRGMRIAYSENDLSEAIDQARSEALAAFGNDAVYIEKYLENP 204 A++IGFPV++KAS GGGG+G+R +E+D Q ++E +++ K + Sbjct: 234 AAEKIGFPVMIKASEGGGGKGIRKVENEDDFKALFKQVQNE----VPGSPIFLMKLADQA 289 Query: 205 RHIEVQIFGDGMGNAIHFGERDCSVQRRNQKIWEEAHSPVISAQDRTEIGETCVKAMKKI 264 RH+EVQ+ D GN I RDCS+QRR+QKI EEA + + + ++ + V+ K + Sbjct: 290 RHLEVQLLADQYGNVISLFGRDCSIQRRHQKIIEEAPATIAPPETFKKMEQAAVRLAKLV 349 Query: 265 DYRGAGTIEFLY--ENGQFYFIEMNTRLQVEHPVTEAITGIDLVHEQIYVA--------- 313 Y AGT+E+LY ++G++YF+E+N RLQVEHP TE I+ ++L Q+ +A Sbjct: 350 GYVSAGTVEYLYSPDDGEYYFLELNPRLQVEHPTTEMISDVNLPAAQLQIAMGIPLHRIP 409 Query: 314 -------------SENRLSVQQKDITFSGHAIECRINAEDP-ENFIPNPGEITYFHAPGG 359 S + GH I RI +EDP + F P+ G + + Sbjct: 410 DIRRLYGLEPTGDSPIDFENAKLPCP-KGHCIAARITSEDPDDGFKPSSGTVQELNFRSS 468 Query: 360 LGIRMDSASYQGYTVPSYYDSLIAKLIVHGKNRKECMMRLNRALNEIIIDG-IKTTIPLF 418 + + G + + DS + G++R+E + + AL E+ I G +TT+ Sbjct: 469 SNVWGYFSVGNGGGIHEFADSQFGHIFAFGESRQEAIANMVVALKELSIRGDFRTTVEYL 528 Query: 419 QKLIKNEDIIEGNYDIHWLE 438 L++ ED D WL+ Sbjct: 529 IDLLETEDFESNKIDTGWLD 548 >gnl|CDD|145768 pfam02786, CPSase_L_D2, Carbamoyl-phosphate synthase L chain, ATP binding domain. Carbamoyl-phosphate synthase catalyses the ATP-dependent synthesis of carbamyl-phosphate from glutamine or ammonia and bicarbonate. This important enzyme initiates both the urea cycle and the biosynthesis of arginine and/or pyrimidines. The carbamoyl-phosphate synthase (CPS) enzyme in prokaryotes is a heterodimer of a small and large chain. The small chain promotes the hydrolysis of glutamine to ammonia, which is used by the large chain to synthesize carbamoyl phosphate. See pfam00988. The small chain has a GATase domain in the carboxyl terminus. See pfam00117. The ATP binding domain (this one) has an ATP-grasp fold. Length = 211 Score = 261 bits (668), Expect = 4e-70 Identities = 94/210 (44%), Positives = 127/210 (60%), Gaps = 6/210 (2%) Query: 115 DKITAKKTAQQLGIPVVPGSGEVYPHT---AMPIAKEIGFPVLVKASAGGGGRGMRIAYS 171 DK+ K ++ G+P VPG+ T A+ AKEIG+PV++KA+ GGGG GM IA + Sbjct: 1 DKVLFKAAMKEAGVPTVPGT-AGPVETEEEALAAAKEIGYPVIIKAAFGGGGLGMGIARN 59 Query: 172 ENDLSEAIDQARSEALAAFGNDAVYIEKYLENPRHIEVQIFGDGMGNAIHFGERDCSVQR 231 E +L+E A +EA AAFGN V +EK L+ P+HIE Q+ D GN I R+CS QR Sbjct: 60 EEELAELFALALAEAPAAFGNPQVLVEKSLKGPKHIEYQVLRDAHGNCITVCNRECSDQR 119 Query: 232 RNQKIWEEAHSPVISAQDRTEIGETCVKAMKKIDYRGAGTIEFLYE--NGQFYFIEMNTR 289 R QK E A S ++ ++R + E VK + + Y GAGT+EF + +G++YFIEMNTR Sbjct: 120 RTQKSIEVAPSQTLTDEERQMLREAAVKIARHLGYVGAGTVEFALDPFSGEYYFIEMNTR 179 Query: 290 LQVEHPVTEAITGIDLVHEQIYVASENRLS 319 LQV H + E TG DL E +A L Sbjct: 180 LQVSHALAEKATGYDLAKEAAKIALGYPLP 209 >gnl|CDD|144029 pfam00289, CPSase_L_chain, Carbamoyl-phosphate synthase L chain, N-terminal domain. Carbamoyl-phosphate synthase catalyses the ATP-dependent synthesis of carbamyl-phosphate from glutamine or ammonia and bicarbonate. This important enzyme initiates both the urea cycle and the biosynthesis of arginine and/or pyrimidines. The carbamoyl-phosphate synthase (CPS) enzyme in prokaryotes is a heterodimer of a small and large chain. The small chain promotes the hydrolysis of glutamine to ammonia, which is used by the large chain to synthesize carbamoyl phosphate. See pfam00988. The small chain has a GATase domain in the carboxyl terminus. See pfam00117. Length = 109 Score = 176 bits (448), Expect = 1e-44 Identities = 68/109 (62%), Positives = 84/109 (77%) Query: 2 ISKILIANRGEIALRILRACKELGIPTVAVHSTADSGAMHVRLADESVCIGPPSSKDSYL 61 I K+L+ANRGEIA+RI+RA +ELGI TVAV+S D+ + HVRLADE+ +GP + +SYL Sbjct: 1 IKKVLVANRGEIAVRIIRALRELGIETVAVNSNPDTVSTHVRLADEAYFLGPGPASESYL 60 Query: 62 NIQQIVAACEVTGADAIHPGYGFLSENAKFAEILEDHHIKFIGPSSEHI 110 NI++I+ E GADAIHPGYGFLSENA+FAE E I FIGPS E I Sbjct: 61 NIERILDIAEKEGADAIHPGYGFLSENAEFAEACEKAGITFIGPSPEAI 109 >gnl|CDD|145767 pfam02785, Biotin_carb_C, Biotin carboxylase C-terminal domain. Biotin carboxylase is a component of the acetyl-CoA carboxylase multi-component enzyme which catalyses the first committed step in fatty acid synthesis in animals, plants and bacteria. Most of the active site residues reported in reference are in this C-terminal domain. Length = 107 Score = 163 bits (414), Expect = 1e-40 Identities = 54/107 (50%), Positives = 73/107 (68%), Gaps = 1/107 (0%) Query: 333 ECRINAEDPE-NFIPNPGEITYFHAPGGLGIRMDSASYQGYTVPSYYDSLIAKLIVHGKN 391 E RI AEDP F+P+PG+IT + PGG G+R+DS Y+G V YYDS+IAKLIVHG + Sbjct: 1 EARIYAEDPANGFLPSPGKITRYRFPGGPGVRVDSGVYEGDEVSPYYDSMIAKLIVHGPD 60 Query: 392 RKECMMRLNRALNEIIIDGIKTTIPLFQKLIKNEDIIEGNYDIHWLE 438 R+E + RL RAL E I+G+KT IP + ++++ D G D +LE Sbjct: 61 REEAIARLRRALAETRIEGVKTNIPFLRAILEHPDFRAGEVDTGFLE 107 >gnl|CDD|30806 COG0458, CarB, Carbamoylphosphate synthase large subunit (split gene in MJ) [Amino acid transport and metabolism / Nucleotide transport and metabolism]. Length = 400 Score = 89.2 bits (221), Expect = 2e-18 Identities = 100/400 (25%), Positives = 160/400 (40%), Gaps = 46/400 (11%) Query: 17 ILRACKELGIPTVAVHSTADSGAMHVRLADESVCIGPPSSKDSYLNIQQIVAACEVTGAD 76 +A KE G V V+S + LAD+ P +K+ I E D Sbjct: 21 ACKALKEEGYGVVLVNSNPATIMTDPELADKVYIE--PITKEPVEKI------IEKERPD 72 Query: 77 AIHPGYG-----FLSENAKFAEILEDHHIKFIGPSSEHIKIMGDKITAKKTAQQLGIPVV 131 AI P G + K +LE + ++ +G E I+I DK K+ +++GIPV Sbjct: 73 AILPTLGGQTALNAALELKEKGVLEKYGVEVVGSDPEAIEIAEDKKLFKEAMREIGIPVP 132 Query: 132 PGSGEVYPHTAMPIAKEIGFPVLVKASAGGGGRGMRIAYSENDLSEAIDQARSEALAAFG 191 A IA EIG+PV+VK S G GG G IAY+E +L E I E L A Sbjct: 133 SRIAHSVEE-ADEIADEIGYPVIVKPSFGLGGSGGGIAYNEEELEEII----EEGLRASP 187 Query: 192 NDAVYIEKYLENPRHIEVQIFGDGMGNAIHFGERDCSVQRRNQKIWEEAHSPVIS-AQDR 250 + V IE+ + + E ++ DG N I C+++ + S ++ AQ Sbjct: 188 VEEVLIEESIIGWKEFEYEVVRDGKDNCIV----VCNMENLDPMGVHTGDSITVAPAQTL 243 Query: 251 TEIGETCVKAM-----KKIDYRGAGTIEFLYEN--GQFYFIEMNTRLQVEHPVTEAITGI 303 T+ ++ ++I G I+F + G+ Y IE+N R+ + TG Sbjct: 244 TDKEYQMLRDAAIKVIREIGIEGGCNIQFAVDPGGGELYVIEINPRVSRSSALASKATGY 303 Query: 304 DLVHEQIYVASENRLSVQQKDITFSGHAIECRINAEDPENFIPNPGEITYFHAPGGLGIR 363 + +A L + DIT A + + IP + A LG + Sbjct: 304 PIAKVAAKLAVGYTLDEIRNDITGRTPASF-EPSLDYVVTKIPRFDFEKFPGADRRLGTQ 362 Query: 364 MDSASYQGYTVPSYYDSLIAKLIVHGKNRKECMMRLNRAL 403 M S + +++ G+ +E + + R+L Sbjct: 363 MKS---------------VGEVMAIGRTFEEALQKALRSL 387 >gnl|CDD|31374 COG1181, DdlA, D-alanine-D-alanine ligase and related ATP-grasp enzymes [Cell envelope biogenesis, outer membrane]. Length = 317 Score = 66.1 bits (161), Expect = 2e-11 Identities = 50/224 (22%), Positives = 86/224 (38%), Gaps = 22/224 (9%) Query: 81 GYGFLSENAKFAEILEDHHIKFIGPSSEHIKIMGDKITAKKTAQQLGIPVVPG----SGE 136 +G E+ +LE I ++G DKI K+ + G+PV P E Sbjct: 69 LHGPYGEDGTIQGLLELLGIPYVGKGVLASAGAMDKIVTKRLFKAEGLPVAPYVALTRDE 128 Query: 137 VYPHTAMPIAKEIGFPVLVKASAGGGGRGMRIAYSENDLSEAIDQARSEALAAFGNDAVY 196 + + +GFP+ VK + G G E DL A++ LA + V Sbjct: 129 YSSVIVEEVEEGLGFPLFVKPAREGSSVGRSPVNVEGDLQSALE------LAFKYDRDVL 182 Query: 197 IEKYLENPRHIEVQIFGD-------GMGNAIHFGERDCSVQRRNQKIWEEAHSPV--ISA 247 E+ + R IEV + G+ +G GE + + + ++ Sbjct: 183 REQGI-TGREIEVGVLGNDYEEQALPLGEIPPKGEEFYDYEAKYLSTGGAQYDIPAGLTD 241 Query: 248 QDRTEIGETCVKAMKKIDYRGAGTIEFLY--ENGQFYFIEMNTR 289 + EI E ++A K + G ++F + G+F +E+NT Sbjct: 242 EIHEEIKELALRAYKALGCLGLARVDFFVDDDEGEFVLLEVNTN 285 >gnl|CDD|30376 COG0026, PurK, Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) [Nucleotide transport and metabolism]. Length = 375 Score = 61.0 bits (148), Expect = 6e-10 Identities = 60/289 (20%), Positives = 112/289 (38%), Gaps = 26/289 (8%) Query: 3 SKILIANRGEIALRILRACKELGIPTVAVHSTADSGAMHVRLADESVCIGPPSSKDSYLN 62 + I G++ + A LGI + + AD+ A ++AD + ++ D Sbjct: 2 KTVGILGGGQLGRMMALAAARLGIKVIVLDPDADAPA--AQVADRVIV----AAYDDPEA 55 Query: 63 IQQIVAACEVTGADAIHPGYGFLSENAKFAEILEDHHIKFIGPSSEHIKIMGDKITAKKT 122 ++++ A C+V + F + A+ E L + PS + ++I D++ K+ Sbjct: 56 LRELAAKCDVITYE-------FENVPAEALEKLAASV--KVFPSPDALRIAQDRLVEKQF 106 Query: 123 AQQLGIPVVPGSGEVYPHTAMPIAKEIGFPVLVKASAGG-GGRGMRIAYSENDLSEAIDQ 181 + G+PV P A ++GFP ++K GG G+G S+ DL Sbjct: 107 LDKAGLPVAPFQVVDSAEELDAAAADLGFPAVLKTRRGGYDGKGQWRIRSDADLELRAAG 166 Query: 182 ARSEALAAFGNDAVYIEKYLENPRHIEVQIFGDGMGNAIHFGERDCSVQRRNQKIWEEAH 241 +E+++ R I V I + F +V R A Sbjct: 167 LAEG-------GVPVLEEFVPFEREISV-IVARSNDGEVAFYPVAENVHRNGILRTSIAP 218 Query: 242 SPVISAQDRTEIGETCVKAMKKIDYRGAGTIE-FLYENGQFYFIEMNTR 289 + I + + E K +++DY G +E F+ +G+ E+ R Sbjct: 219 AR-IPDDLQAQAEEMAKKIAEELDYVGVLAVEFFVTPDGELLVNEIAPR 266 >gnl|CDD|30377 COG0027, PurT, Formate-dependent phosphoribosylglycinamide formyltransferase (GAR transformylase) [Nucleotide transport and metabolism]. Length = 394 Score = 53.7 bits (129), Expect = 9e-08 Identities = 44/200 (22%), Positives = 86/200 (43%), Gaps = 15/200 (7%) Query: 4 KILIANRGEIALRILRACKELGIPTVAVHSTADSGAMHVRLADESVCIGPPSSKDSYLNI 63 K+++ GE+ + + LG+ +AV A++ AM V A S I L+ Sbjct: 14 KVMLLGSGELGKEVAIEAQRLGVEVIAVDRYANAPAMQV--AHRSYVI-------DMLDG 64 Query: 64 QQIVAACEVTGADAIHPGYGFLSENAKFAEILEDHHIKFIGPSSEHIKIMGDKITAKKTA 123 + A E D I P ++ +A LE+ + P++ K+ ++ ++ A Sbjct: 65 DALRAVVEREKPDYIVPEIEAIATDALVE--LEEEGYTVV-PNARATKLTMNREGIRRLA 121 Query: 124 -QQLGIPVVPGSGEVYPHTAMPIAKEIGFPVLVKASAGGGGRGMRIAYSENDLSEAIDQA 182 ++LG+P ++IGFP +VK G+G + S D+ +A + A Sbjct: 122 AEELGLPTSKYRFADSLEELRAAVEKIGFPCVVKPVMSSSGKGQSVVRSPEDVEKAWEYA 181 Query: 183 RSEALAAFGNDAVYIEKYLE 202 + G+ V +E++++ Sbjct: 182 QQGGRG--GSGRVIVEEFVK 199 >gnl|CDD|35591 KOG0370, KOG0370, KOG0370, Multifunctional pyrimidine synthesis protein CAD (includes carbamoyl-phophate synthetase, aspartate transcarbamylase, and glutamine amidotransferase) [General function prediction only]. Length = 1435 Score = 51.5 bits (123), Expect = 4e-07 Identities = 48/218 (22%), Positives = 85/218 (38%), Gaps = 15/218 (6%) Query: 95 LEDHHIKFIGPSSEHIKIMGDKITAKKTAQQLGIPVVPGSGEVYPHTAMPIAKEIGFPVL 154 L + +K +G S E I ++ + +G+ A A+++G+PVL Sbjct: 1013 LHRNGVKILGTSPEMIDSAENRFKFSRMLDSIGVDQPAWKELTSLEEAKKFAEKVGYPVL 1072 Query: 155 VKASAGGGGRGMRIAYSENDLSEAIDQARSEALAAFGND-AVYIEKYLENPRHIEVQIFG 213 V+ S G M + YSE+DL ++QA +A D V I K++E + I+V Sbjct: 1073 VRPSYVLSGAAMNVVYSESDLKSYLEQA-----SAVSPDHPVVISKFIEGAKEIDVDAVA 1127 Query: 214 DGMGNAIHFGERDCSVQRRNQKIWEEAHSPVISAQDRT-EIGETCVKAMKKI----DYRG 268 +H S N + + V+ QD + + E K+ G Sbjct: 1128 SDGKVLVHA----ISEHVENAGVHSGDATLVLPPQDLSADTLERIKDIAAKVAKALKITG 1183 Query: 269 AGTIEFLYENGQFYFIEMNTRLQVEHPVTEAITGIDLV 306 ++ + ++ + IE N R P G+D + Sbjct: 1184 PFNMQIIAKDNELKVIECNVRASRSFPFVSKTLGVDFI 1221 Score = 39.9 bits (93), Expect = 0.001 Identities = 54/290 (18%), Positives = 112/290 (38%), Gaps = 39/290 (13%) Query: 18 LRACKELGIPTVAVHSTADSGAMHVRLADE--SVCIGPPSSKDSYLNIQQIVAACEVTGA 75 ++A KE I T+ ++ + LAD+ + + P + G Sbjct: 404 IKALKEENIFTILINPNIATVQTSKGLADKVYFLPVTPEYVTKVIKAERPDGILLTFGGQ 463 Query: 76 DAIHPGYGFLSENAKFAEILEDHHIKFIGPSSEHIKIMGDKITAKKTAQQLGIPVVPGSG 135 A++ G L + FA+ + +K +G + I D+ + ++ + P Sbjct: 464 TALNCGV-ELDKAGVFAQ----YGVKVLGTPIQTIITTEDRDLFARALNEINEKIAPSEA 518 Query: 136 EVYPHTAMPIAKEIGFPVLVKASAGGGGRGMRIAYSENDLSEAIDQARSEALAAFGNDAV 195 A+ A+ +G+PV+V+A+ GG G A +E +L + QA + + + Sbjct: 519 VSTIEEALEAAERLGYPVIVRAAYALGGLGSGFANNEEELQDLAAQALALS------PQI 572 Query: 196 YIEKYLENPRHIEVQIFGDGMGNA-------------IHFGERDCSVQRRNQKIWEEAHS 242 +EK L+ + +E ++ D N IH G+ + Sbjct: 573 LVEKSLKGWKEVEYEVVRDAYDNCITVCNMENFDPLGIHTGDSIVVAPSQ---------- 622 Query: 243 PVISAQDRTEIGETCVKAMKKIDYRGAGTIEFLY--ENGQFYFIEMNTRL 290 +S ++ + T +K ++ + G I++ + ++ IE+N RL Sbjct: 623 -TLSDEEYQMLRTTAIKVIRHLGVVGECNIQYALNPYSLEYRIIEVNARL 671 >gnl|CDD|30500 COG0151, PurD, Phosphoribosylamine-glycine ligase [Nucleotide transport and metabolism]. Length = 428 Score = 50.6 bits (121), Expect = 9e-07 Identities = 50/219 (22%), Positives = 83/219 (37%), Gaps = 29/219 (13%) Query: 92 AEILEDHHIKFIGPSSEHIKIMGDKITAKKTAQQLGIPVVPGSGEVY--PHTAMPIAKEI 149 + L I GP+ ++ G K AK ++ GIP EV+ P A E Sbjct: 80 VDALRAAGIPVFGPTKAAAQLEGSKAFAKDFMKKYGIPT--AEYEVFTDPEEAKAYIDEK 137 Query: 150 GFPVLVKASAGGGGRGMRIAYSENDLSEAIDQARSEALAAFGNDAVYIEKYLENPRHIEV 209 G P++VKA G+G+ +A + + A+D+ V IE++L+ + Sbjct: 138 GAPIVVKADGLAAGKGVIVAMTLEEAEAAVDEMLEGNAFGSAGARVVIEEFLDGEE-FSL 196 Query: 210 QIFGDGMGNAIHFGERDCSVQRRNQKIWEE------------AHSPVISAQD----RTEI 253 Q F DG I + Q +++ ++ + +P I+ + EI Sbjct: 197 QAFVDG-KTVIPMP----TAQ-DHKRAYDGDTGPNTGGMGAYSPAPFITDEVVERAVEEI 250 Query: 254 GETCVKAMKK--IDYRGAGTIEFLYENGQFYFIEMNTRL 290 E V+ M K +RG + IE N R Sbjct: 251 VEPTVEGMAKEGYPFRGVLYAGLMLTADGPKVIEFNARF 289 >gnl|CDD|145401 pfam02222, ATP-grasp, ATP-grasp domain. This family does not contain all known ATP-grasp domain members. This family includes a diverse set of enzymes that possess ATP-dependent carboxylate-amine ligase activity. Length = 171 Score = 47.6 bits (114), Expect = 6e-06 Identities = 35/171 (20%), Positives = 66/171 (38%), Gaps = 16/171 (9%) Query: 123 AQQLGIPVVPGSGEVYPHTAMPIAKEIGFPVLVKASAGG-GGRGMRIAYSENDLSEAIDQ 181 Q+LG+P + +E+G+P ++KA GG G+G + SE D+ Sbjct: 1 LQKLGLPTPRFMAAESLEELIEAGQELGYPCVLKARRGGYDGKGQYVVRSEADI------ 54 Query: 182 ARSEALAAFGNDAVYIEKYLENPRHIEVQI--FGDGMGNAIHFGERDCSVQRRNQKIWEE 239 +A G V +E+++ + + V + DG F ++Q Sbjct: 55 --PQAWEELGGGPVIVEEFVPFDKELSVLVVRSVDG---ETAFYPPVETIQEDGICHESV 109 Query: 240 AHSPVISAQDRTEIGETCVKAMKKIDYRGAGTIE-FLYENGQFYFIEMNTR 289 A + V + + E K + ++ G +E F+ +G E+ R Sbjct: 110 APARV-PDSQQAKAQEIAKKIVDELGGVGIFGVELFVLPDGDLLVNELAPR 159 >gnl|CDD|30538 COG0189, RimK, Glutathione synthase/Ribosomal protein S6 modification enzyme (glutaminyl transferase) [Coenzyme metabolism / Translation, ribosomal structure and biogenesis]. Length = 318 Score = 44.2 bits (104), Expect = 8e-05 Identities = 45/221 (20%), Positives = 86/221 (38%), Gaps = 19/221 (8%) Query: 74 GADAIHPGYG-FLSENAKFAEILEDHHIKFIGPSSEHIKIMGDKITAKKTAQQLGIPVVP 132 D I +F + E + I + I+ +K+ + + GIPV P Sbjct: 78 ELDVIIMRKDPPFDFATRFLRLAERKGVPVI-NDPQSIRRCRNKLYTTQLLAKAGIPV-P 135 Query: 133 GSGEVY--PHTAMPIAKEIGFPVLVKASAGGGGRG-MRIAYSENDLSEAIDQARSEALAA 189 + A +A+ +GFPV++K G GGRG + ++ +L + E L Sbjct: 136 PTLITRDPDEAAEFVAEHLGFPVVLKPLDGSGGRGVFLVEDADPELLSLL-----ETLTQ 190 Query: 190 FGNDAVYIEKYLENPRHIEVQIF-GDGMGNAIHFGERDCSVQ--RRNQKIWEEAHSPVIS 246 G + +++Y+ + + ++ G G AI+ R + R N A ++ Sbjct: 191 EGRKLIIVQEYIPKAKRDDRRVLVGGGEVVAIYALARIPASGDFRSNLARGGRAEPCELT 250 Query: 247 AQDRTEIGETCVKAMKKIDYRGAGTIEFLYENGQFYFIEMN 287 ++ E VKA + G ++ + + Y E+N Sbjct: 251 EEEE----ELAVKAAPALGLGLVG-VDIIEDKDGLYVTEVN 286 >gnl|CDD|35458 KOG0237, KOG0237, KOG0237, Glycinamide ribonucleotide synthetase (GARS)/Aminoimidazole ribonucleotide synthetase (AIRS) [Nucleotide transport and metabolism]. Length = 788 Score = 38.0 bits (88), Expect = 0.006 Identities = 29/125 (23%), Positives = 50/125 (40%), Gaps = 2/125 (1%) Query: 92 AEILEDHHIKFIGPSSEHIKIMGDKITAKKTAQQLGIPVVPGSGEVYPHTAMPIAKEIGF 151 A++L I GPS + ++ K +K + IP P A + Sbjct: 85 ADVLRSAGIPCFGPSKQAAQLEASKNFSKDFMHRHNIPTAKYKTFTDPEEAKSFIQSATD 144 Query: 152 P-VLVKASAGGGGRGMRIAYSENDLSEAIDQARSEALAAFGNDAVYIEKYLENPRHIEVQ 210 +++KA G+G+ +A S+ + EA+D + + V IE+ LE + Sbjct: 145 KALVIKADGLAAGKGVIVAKSKEEAFEAVDSILVKKVFGSAGKTVVIEELLEG-EEVSFL 203 Query: 211 IFGDG 215 F DG Sbjct: 204 AFTDG 208 >gnl|CDD|30394 COG0045, SucC, Succinyl-CoA synthetase, beta subunit [Energy production and conversion]. Length = 387 Score = 36.7 bits (85), Expect = 0.012 Identities = 27/68 (39%), Positives = 37/68 (54%), Gaps = 5/68 (7%) Query: 119 AKKTAQQLGIPVVPGSGEVYPHTAMPIAKEIG-FPVLVKASAGGGGRG----MRIAYSEN 173 AK+ + GIPV PG P A AKE+G PV+VKA GGRG +++A S Sbjct: 8 AKELFAKYGIPVPPGYVATSPEEAEEAAKELGGGPVVVKAQVHAGGRGKAGGVKLAKSPE 67 Query: 174 DLSEAIDQ 181 + EA ++ Sbjct: 68 EAKEAAEE 75 >gnl|CDD|32413 COG2232, COG2232, Predicted ATP-dependent carboligase related to biotin carboxylase [General function prediction only]. Length = 389 Score = 35.0 bits (80), Expect = 0.045 Identities = 23/108 (21%), Positives = 43/108 (39%), Gaps = 8/108 (7%) Query: 252 EIGETCVKAMKKIDYRGAGTIEFLYENGQFYFIEMNTRLQVEHPVTEAITGIDLVHEQIY 311 E + ++++ G+ ++F+ + Y IE+N R+Q E +GI+L I Sbjct: 236 EAERLAEELVEELGLVGSNGVDFVLNDKGPYVIEVNPRIQGTLECIERSSGINLFRLHI- 294 Query: 312 VASENRL-------SVQQKDITFSGHAIECRINAEDPENFIPNPGEIT 352 A + L K I ++ + I + IP PG + Sbjct: 295 QAFDGELPERPKPRGYACKRILYAPRTVRVPILKLSWTHDIPRPGTVI 342 >gnl|CDD|110097 pfam01071, GARS_A, Phosphoribosylglycinamide synthetase, ATP-grasp (A) domain. Phosphoribosylglycinamide synthetase catalyses the second step in the de novo biosynthesis of purine. The reaction catalysed by Phosphoribosylglycinamide synthetase is the ATP- dependent addition of 5-phosphoribosylamine to glycine to form 5'phosphoribosylglycinamide. This domain is related to the ATP-grasp domain of biotin carboxylase/carbamoyl phosphate synthetase (see pfam02786). Length = 193 Score = 34.5 bits (80), Expect = 0.056 Identities = 21/69 (30%), Positives = 35/69 (50%), Gaps = 1/69 (1%) Query: 147 KEIGFPVLVKASAGGGGRGMRIAYSENDLSEAIDQARSEALAAFGNDAVYIEKYLENPRH 206 +E GFP +VKA G+G+ +A + +A+D+ + + V IE++LE Sbjct: 34 REAGFPAVVKADGLAAGKGVIVAMDNEEAIKAVDEILEQKKFGEAGEPVVIEEFLEGE-E 92 Query: 207 IEVQIFGDG 215 + V F DG Sbjct: 93 VSVLAFVDG 101 >gnl|CDD|99985 cd03814, GT1_like_2, This family is most closely related to the GT1 family of glycosyltransferases. Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homology. The large cleft that separates the two domains includes the catalytic center and permits a high degree of flexibility. The members of this family are found mainly in bacteria and eukaryotes.. Length = 364 Score = 31.0 bits (71), Expect = 0.67 Identities = 13/30 (43%), Positives = 17/30 (56%) Query: 5 ILIANRGEIALRILRACKELGIPTVAVHST 34 + IA G + L LRA + LGIP V + T Sbjct: 87 VHIATPGPLGLAALRAARRLGIPVVTSYHT 116 >gnl|CDD|33705 COG3919, COG3919, Predicted ATP-grasp enzyme [General function prediction only]. Length = 415 Score = 31.1 bits (70), Expect = 0.69 Identities = 38/176 (21%), Positives = 76/176 (43%), Gaps = 15/176 (8%) Query: 121 KTAQQLGIPVVPGSGEVYPHTAMPIAKEIGFPVLVKASAGGG----GRGMRIAYSENDLS 176 A++LG+P P + V + E+ FPV++K GG R ++N Sbjct: 120 NRAEELGLPY-PKTYLV-NSEIDTLVDELTFPVILKPGMGGSVHFEARAKAFTAADN--- 174 Query: 177 EAIDQARSEALAAFGNDAVYIEKYLENPRHIEVQIFGDGMGNAIHFGERDCSVQRRNQKI 236 E + A A G D V +++++ P E Q + + H + + +R Q Sbjct: 175 EEMKLALHRAYEEIGPDNVVVQEFI--PGGGENQFSYAALWDKGH-PVAEFTARRLRQYP 231 Query: 237 WEEAH-SPVISAQDRTEIGETCVKAMKKIDYRGAGTIEFLYE--NGQFYFIEMNTR 289 + + S V+ D ++ + ++ I++ G +EF Y+ +G + +++N R Sbjct: 232 VDFGYTSTVVEVVDNQQVIQAARDFLESIEHTGLVEVEFKYDPRDGSYKLLDVNPR 287 >gnl|CDD|30719 COG0370, FeoB, Fe2+ transport system protein B [Inorganic ion transport and metabolism]. Length = 653 Score = 31.0 bits (70), Expect = 0.70 Identities = 27/155 (17%), Positives = 49/155 (31%), Gaps = 24/155 (15%) Query: 114 GDKITAKKTAQQLGIPVVPGSGEVYPHTAMPIAKEIGFPVLVKASAGGGGRGMRIAYSEN 173 G +I +K ++ LG+PVVP + + I K + G I Sbjct: 125 GIRIDIEKLSKLLGVPVVPTVA-KRGEGLEELKRAIIELAESKTTPREVDYGEEIEEEIK 183 Query: 174 DLSEAIDQARSEALAAFGNDAVYIEKYLENPRHIEVQIFGDGMGNAIHFGERDCSVQRRN 233 +L + R A+ D +E L+ P V+ Sbjct: 184 ELEALSEDPRWLAIKLL-EDDELVEAVLKEPE---------------------KRVEELL 221 Query: 234 QKIWEEAHSPVISAQDR-TEIGETCVKAMKKIDYR 267 +++ EE ++ A R I +K+ + Sbjct: 222 EELSEEEGHLLLIADARYALIERILRSVVKQEEEE 256 >gnl|CDD|88405 cd05008, SIS_GlmS_GlmD_1, SIS (Sugar ISomerase) domain repeat 1 found in Glucosamine 6-phosphate synthase (GlmS) and Glucosamine-6-phosphate deaminase (GlmD). The SIS domain is found in many phosphosugar isomerases and phosphosugar binding proteins. GlmS contains a N-terminal glutaminase domain and two C-terminal SIS domains and catalyzes the first step in hexosamine metabolism, converting fructose 6-phosphate into glucosamine 6-phosphate using glutamine as nitrogen source. The glutaminase domain hydrolyzes glutamine to glutamate and ammonia. Ammonia is transferred through a channel to the isomerase domain for glucosamine 6-phosphate synthesis. The end product of the pathway is N-acetylglucosamine, which plays multiple roles in eukaryotic cells including being a building block of bacterial and fungal cell walls. In the absence of glutamine, GlmS catalyzes the isomerization of fructose 6-phosphate into glucose 6- phosphate (PGI-like activity). Glucosamine-6-phosphate deaminase (GlmD) contains two SIS domains and catalyzes the deamination and isomerization of glucosamine-6-phosphate into fructose-6-phosphate with the release of ammonia; in presence of high ammonia concentration, GlmD can catalyze the reverse reaction.. Length = 126 Score = 30.9 bits (70), Expect = 0.77 Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 6/59 (10%) Query: 5 ILIANRGEIA--LRILRACKELGIPTVAVHSTADSGAMHVRLADESVCI--GPPSSKDS 59 I I+ GE A L LR KE G TVA+ T G+ R AD + + GP S + Sbjct: 51 IAISQSGETADTLAALRLAKEKGAKTVAI--TNVVGSTLAREADYVLYLRAGPEISVAA 107 >gnl|CDD|31137 COG0794, GutQ, Predicted sugar phosphate isomerase involved in capsule formation [Cell envelope biogenesis, outer membrane]. Length = 202 Score = 30.6 bits (69), Expect = 0.95 Identities = 15/49 (30%), Positives = 21/49 (42%), Gaps = 4/49 (8%) Query: 5 ILIANRGEIA--LRILRACKELGIPTVAVHSTADSGAMHVRLADESVCI 51 I I+ GE L + K LG +A+ S DS + AD + I Sbjct: 91 IAISGSGETKELLNLAPKAKRLGAKLIAITSNPDSSL--AKAADVVLVI 137 >gnl|CDD|29354 cd01170, THZ_kinase, 4-methyl-5-beta-hydroxyethylthiazole (Thz) kinase catalyzes the phosphorylation of the hydroxylgroup of Thz. A reaction that allows cells to recycle Thz into the thiamine biosynthesis pathway, as an alternative to its synthesis from cysteine, tyrosine and 1-deoxy-D-xylulose-5-phosphate.. Length = 242 Score = 29.8 bits (67), Expect = 1.4 Identities = 28/93 (30%), Positives = 42/93 (45%), Gaps = 14/93 (15%) Query: 105 PSSEHIKIMGDKITAKKTAQQLGIPVV--P---GSGEVYPHTAMPIAKEIGFPVL----- 154 +SE I+ M + A K A QLG PVV P G+ A + E V+ Sbjct: 60 LTSEQIEAM---LKAGKAANQLGKPVVLDPVGVGATSFRTEVAKELLAEGQPTVIRGNAS 116 Query: 155 -VKASAGGGGRGMRIAYSENDLSEAIDQARSEA 186 + A AG G G + S +D +A++ A++ A Sbjct: 117 EIAALAGLTGLGKGVDSSSSDEEDALELAKALA 149 >gnl|CDD|144821 pfam01370, Epimerase, NAD dependent epimerase/dehydratase family. This family of proteins utilize NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions. Length = 235 Score = 29.6 bits (67), Expect = 1.7 Identities = 15/79 (18%), Positives = 28/79 (35%), Gaps = 9/79 (11%) Query: 6 LIANRGEIALRILRACKELGIPTVAVHSTADS--GAMHVRLADESVCIGPPS----SKDS 59 I L +L A + G+ V +++ G + E +GP S +K Sbjct: 88 FIRANVLGTLNLLEAARRAGVKRF-VFASSSEVYGDVADPPITEDTPLGPLSPYAAAK-- 144 Query: 60 YLNIQQIVAACEVTGADAI 78 + ++A G A+ Sbjct: 145 LAGERLVLAYARAYGLRAV 163 >gnl|CDD|111971 pfam03133, TTL, Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This modification cycle involves a specific carboxypeptidase and the activity of the tubulin-tyrosine ligase (TTL). The true physiological function of TTL has so far not been established. Tubulin-tyrosine ligase (TTL) catalyses the ATP-dependent post-translational addition of a tyrosine to the carboxy terminal end of detyrosinated alpha-tubulin. In normally cycling cells, the tyrosinated form of tubulin predominates. However, in breast cancer cells, the detyrosinated form frequently predominates, with a correlation to tumour aggressiveness. On the other hand, 3-nitrotyrosine has been shown to be incorporated, by TTL, into the carboxy terminal end of detyrosinated alpha-tubulin. This reaction is not reversible by the carboxypeptidase enzyme. Cells cultured in 3-nitrotyrosine rich medium showed evidence of altered microtubule structure and function, including altered cell morphology, epithelial barrier dysfunction, and apoptosis. Length = 291 Score = 29.6 bits (67), Expect = 1.9 Identities = 20/50 (40%), Positives = 27/50 (54%), Gaps = 11/50 (22%) Query: 155 VKASAGGGGRGMRIAYSENDLSEAIDQARSEALAAFGNDAVYIEKYLENP 204 VK SA GRG+RI NDLS+ + Q +S L ++KY+E P Sbjct: 71 VKPSASARGRGIRIT---NDLSQILKQIQSRPL--------VVQKYIERP 109 >gnl|CDD|29951 cd00958, DhnA, Class I fructose-1,6-bisphosphate (FBP) aldolases of the archaeal type (DhnA homologs) found in bacteria and archaea. Catalysis of the enzymes proceeds via a Schiff-base mechanism like other class I aldolases, although this subfamily is clearly divergent based on sequence similarity to other class I and class II (metal dependent) aldolase subfamilies.. Length = 235 Score = 29.4 bits (66), Expect = 2.2 Identities = 13/71 (18%), Positives = 23/71 (32%), Gaps = 17/71 (23%) Query: 16 RILRACKELGIPTVAV----------HSTADSGAMHVRLADES----VCIGPPSSKDSYL 61 R+ + G+P +A D A R+ E V +S+ Sbjct: 113 RVAAEAHKYGLPLIAWMYPRGPAVKNEKDPDLIAYAARIGAELGADIVKTKYTGDAESF- 171 Query: 62 NIQQIVAACEV 72 +++V C V Sbjct: 172 --KEVVEGCPV 180 >gnl|CDD|30844 COG0498, ThrC, Threonine synthase [Amino acid transport and metabolism]. Length = 411 Score = 29.1 bits (65), Expect = 2.4 Identities = 36/141 (25%), Positives = 54/141 (38%), Gaps = 15/141 (10%) Query: 22 KELGIPTVAVHSTADSGA---MHVRLADESVCIGPPSSKDSYLNIQQIVAACEVTGADAI 78 KELG T+ S+ ++GA + A V + P K S + Q++ GA I Sbjct: 122 KELGAKTILCASSGNTGASAAAYAARAGLKVFVLYPKGKVSPGKLAQMLT----LGAHVI 177 Query: 79 HPGYGFLSENAKFAEILEDHHIKFIGPSSEHIKIMGDKITAKKTAQQLG--IP---VVP- 132 F E + S ++ G K A + A+QLG P VVP Sbjct: 178 AVDGNFDDAQELVKEAANREGLLSAVNSINPYRLEGQKTYAFEIAEQLGWKAPDHVVVPV 237 Query: 133 GSGEVYPHTAMPIAKEIGFPV 153 G+G A+ + G P+ Sbjct: 238 GNGGNL--LAIYKGFKEGLPI 256 >gnl|CDD|145523 pfam02421, FeoB_N, Ferrous iron transport protein B. Escherichia coli has an iron(II) transport system (feo) which may make an important contribution to the iron supply of the cell under anaerobic conditions. FeoB has been identified as part of this transport system. FeoB is a large 700-800 amino acid integral membrane protein. The N terminus contains a P-loop motif suggesting that iron transport may be ATP dependent. Length = 188 Score = 29.0 bits (66), Expect = 2.4 Identities = 23/73 (31%), Positives = 32/73 (43%), Gaps = 15/73 (20%) Query: 113 MGDKITAKKTAQQLGIPVVPGSGEVYPHTAMPIAKEIGFPVLVKA----SAGGGGRGMRI 168 G KI KK ++ LG+PVVP S K G L A + G ++I Sbjct: 120 KGIKIDIKKLSELLGVPVVPTSA----------RKGEGIDELKDAIIEVAEGKVKPPLKI 169 Query: 169 AYSENDLSEAIDQ 181 Y E ++ EAI + Sbjct: 170 NYGE-EIEEAISE 181 >gnl|CDD|133280 cd01879, FeoB, Ferrous iron transport protein B (FeoB) subfamily. E. coli has an iron(II) transport system, known as feo, which may make an important contribution to the iron supply of the cell under anaerobic conditions. FeoB has been identified as part of this transport system. FeoB is a large 700-800 amino acid integral membrane protein. The N terminus contains a P-loop motif suggesting that iron transport may be ATP dependent. Length = 158 Score = 28.6 bits (65), Expect = 3.5 Identities = 11/22 (50%), Positives = 14/22 (63%) Query: 114 GDKITAKKTAQQLGIPVVPGSG 135 G KI K ++ LG+PVVP S Sbjct: 118 GIKIDLDKLSELLGVPVVPTSA 139 >gnl|CDD|173931 cd08172, GlyDH-like1, Glycerol dehydrogenases-like. Glycerol dehydrogenases-like. The proteins in this family have not been characterized, but they show sequence homology with glycerol dehydrogenase. Glycerol dehydrogenases (GlyDH) is a key enzyme in the glycerol dissimilation pathway. In anaerobic conditions, many microorganisms utilize glycerol as a source of carbon through coupled oxidative and reductive pathways. One of the pathways involves the oxidation of glycerol to dihydroxyacetone with the reduction of NAD+ to NADH catalyzed by glycerol dehydrogenases. Dihydroxyacetone is then phosphorylated by dihydroxyacetone kinase and enters the glycolytic pathway for further degradation. The activity of GlyDH is zinc-dependent. The zinc ion plays a role in stabilizing an alkoxide intermediate at the active site. Length = 347 Score = 28.3 bits (64), Expect = 4.1 Identities = 20/75 (26%), Positives = 28/75 (37%), Gaps = 33/75 (44%) Query: 59 SYLNIQQIVAACEVTGADAIHPGYGFLSENAKFAEILEDHHIKFIGPSSEHIKIMGDKI- 117 S NI+++ A + GAD I I I G K+ Sbjct: 61 SEENIERLAAQAKENGADVI-------------------------------IGIGGGKVL 89 Query: 118 -TAKKTAQQLGIPVV 131 TAK A +LG+PV+ Sbjct: 90 DTAKAVADRLGVPVI 104 >gnl|CDD|38938 KOG3734, KOG3734, KOG3734, Predicted phosphoglycerate mutase [Carbohydrate transport and metabolism]. Length = 272 Score = 28.0 bits (62), Expect = 4.6 Identities = 13/69 (18%), Positives = 23/69 (33%), Gaps = 4/69 (5%) Query: 344 FIPNPGEITYFHAPGGLG----IRMDSASYQGYTVPSYYDSLIAKLIVHGKNRKECMMRL 399 PG P I D + G+ V YD + + G++ ++C R+ Sbjct: 122 IRVEPGLFEPEKWPKDGKFPFFISPDELKFPGFPVDLNYDPVYKETPRWGESLEDCNDRI 181 Query: 400 NRALNEIII 408 + I Sbjct: 182 QKVFKAIAD 190 >gnl|CDD|31527 COG1336, COG1336, Uncharacterized protein predicted to be involved in DNA repair (RAMP superfamily) [DNA replication, recombination, and repair]. Length = 298 Score = 28.0 bits (62), Expect = 4.8 Identities = 18/63 (28%), Positives = 31/63 (49%), Gaps = 5/63 (7%) Query: 129 PVVPGSGEVYPHTAMPIAKE--IGFPVLVKASAGGGGRGMRIAY---SENDLSEAIDQAR 183 PV G+G+ +PI +E GFP++ +S G R +A E++L E I++ Sbjct: 12 PVHIGAGQGLGVIDLPIQRERHTGFPIIPGSSLKGALRSFLLAKLEDKEDELGEKIEKCV 71 Query: 184 SEA 186 + Sbjct: 72 FGS 74 >gnl|CDD|39212 KOG4009, KOG4009, KOG4009, NADH-ubiquinone oxidoreductase, subunit NDUFB10/PDSW [Energy production and conversion]. Length = 156 Score = 27.8 bits (61), Expect = 5.6 Identities = 14/51 (27%), Positives = 28/51 (54%), Gaps = 4/51 (7%) Query: 197 IEKYLENPRHIEVQIFGDGMGNAIHFGERDCSVQRRNQKIWEEAHSPVISA 247 +E+Y + R+ V+ +GD G + R ++++++ IWE H P +A Sbjct: 105 VEQYEDATRNYFVK-YGDLGGYS---SARKAYMKQKHRMIWERRHGPKPAA 151 >gnl|CDD|30414 COG0065, LeuC, 3-isopropylmalate dehydratase large subunit [Amino acid transport and metabolism]. Length = 423 Score = 27.8 bits (62), Expect = 5.9 Identities = 18/74 (24%), Positives = 29/74 (39%), Gaps = 10/74 (13%) Query: 75 ADAIHP--GYGFLSENAKFAEILEDHHIK----FIGPSS----EHIKIMGDKITAKKTAQ 124 A + P + +N +E IK FIG + E ++ + + +K A Sbjct: 266 ASDLEPQVAWPTNPDNVVPVSEVEPDPIKIDQVFIGSCTNGRIEDLRAAAEILKGRKVAP 325 Query: 125 QLGIPVVPGSGEVY 138 + VVPGS V Sbjct: 326 GVRAIVVPGSRRVK 339 >gnl|CDD|37018 KOG1807, KOG1807, KOG1807, Helicases [Replication, recombination and repair]. Length = 1025 Score = 27.7 bits (61), Expect = 6.2 Identities = 11/24 (45%), Positives = 17/24 (70%) Query: 92 AEILEDHHIKFIGPSSEHIKIMGD 115 AE+LE H I + P +EH+ ++GD Sbjct: 730 AEVLEAHIIAALTPHTEHVILIGD 753 >gnl|CDD|48077 cd01976, Nitrogenase_MoFe_alpha, Nitrogenase_MoFe_alpha_II: Nitrogenase MoFe protein, beta subunit. A group of proteins similar to the alpha subunit of the MoFe protein of the molybdenum (Mo-) nitrogenase. The nitrogenase enzyme catalyzes the ATP-dependent reduction of dinitrogen to ammonia. The Mo-nitrogenase is the most widespread and best characterized of these systems. Mo-nitrogenase consists of the MoFe protein (component 1) and the Fe protein (component 2). MoFe is an alpha2beta2 tetramer. Each alphabeta pair of MoFe contains one P-cluster (at the alphabeta interface) and, one molecule of iron molybdenum cofactor (FeMoco) contained within the alpha subunit. The Fe protein contains a single [4Fe-4S] cluster. Electrons are transferred from the [4Fe-4S] cluster of the Fe protein to the P-cluster of the MoFe and in turn to FeMoCo, the site of substrate reduction.. Length = 421 Score = 27.5 bits (61), Expect = 7.5 Identities = 13/31 (41%), Positives = 21/31 (67%), Gaps = 2/31 (6%) Query: 110 IKIMGDKI--TAKKTAQQLGIPVVPGSGEVY 138 + ++GD I A+K +++LGIPVVP E + Sbjct: 111 VGLIGDDIEAVARKASKELGIPVVPVRCEGF 141 >gnl|CDD|31945 COG1759, COG1759, ATP-utilizing enzymes of ATP-grasp superfamily (probably carboligases) [General function prediction only]. Length = 361 Score = 27.5 bits (61), Expect = 7.5 Identities = 17/61 (27%), Positives = 26/61 (42%), Gaps = 1/61 (1%) Query: 146 AKEIGFPVLVKASAGGGGRGMRIAYSENDLSEAIDQ-ARSEALAAFGNDAVYIEKYLENP 204 +EI PV+VK GGRG IA S + E ++ + + IE+Y+ Sbjct: 146 PEEIDRPVIVKLPGAKGGRGYFIASSPEEFYEKAERLLKRGVITEEDLKNARIEEYVVGA 205 Query: 205 R 205 Sbjct: 206 P 206 >gnl|CDD|37481 KOG2270, KOG2270, KOG2270, Serine/threonine protein kinase involved in cell cycle control [Signal transduction mechanisms, Cell cycle control, cell division, chromosome partitioning]. Length = 520 Score = 27.3 bits (60), Expect = 7.7 Identities = 15/58 (25%), Positives = 32/58 (55%), Gaps = 8/58 (13%) Query: 245 ISAQDRTEIGETCVKAMKKIDYRGAGTI-----EF--LYENGQFYFIEMNTRLQVEHP 295 +S E+ + CV+ M+++ Y+ + EF LY +G+ Y I+++ ++ +HP Sbjct: 261 LSTSKARELYQQCVRIMRRL-YQKCRLVHADLSEFNLLYHDGKLYIIDVSQSVEHDHP 317 >gnl|CDD|185679 cd02064, FAD_synthetase_N, FAD synthetase, N-terminal domain of the bifunctional enzyme. FAD synthetase_N. N-terminal domain of the bifunctional riboflavin biosynthesis protein riboflavin kinase/FAD synthetase. These enzymes have both ATP:riboflavin 5'-phosphotransferase and ATP:FMN-adenylyltransferase activities. The N-terminal domain is believed to play a role in the adenylylation reaction of FAD synthetases. The C-terminal domain is thought to have kinase activity. FAD synthetase is present among all kingdoms of life. However, the bifunctional enzyme is not found in mammals, which use separate enzymes for FMN and FAD formation. Length = 180 Score = 27.1 bits (61), Expect = 9.1 Identities = 7/21 (33%), Positives = 10/21 (47%) Query: 266 YRGAGTIEFLYENGQFYFIEM 286 +G E L E G+ Y E+ Sbjct: 113 KGRSGDAELLKELGKKYGFEV 133 >gnl|CDD|144739 pfam01255, Prenyltransf, Putative undecaprenyl diphosphate synthase. Previously known as uncharacterized protein family UPF0015, a single member of this family has been identified as an undecaprenyl diphosphate synthase. Length = 222 Score = 27.1 bits (61), Expect = 9.2 Identities = 8/20 (40%), Positives = 10/20 (50%) Query: 12 EIALRILRACKELGIPTVAV 31 E IL C ELGI + + Sbjct: 25 EALREILEWCLELGIKYLTL 44 >gnl|CDD|144518 pfam00946, Paramyx_RNA_pol, Paramyxovirus RNA dependent RNA polymerase. Paramyxoviridae, like other non-segmented negative strand RNA viruses, have an RNA-dependent RNA polymerase composed of two subunits, a large protein L and a phosphoprotein P. This is a protein family of the L protein. The L protein confers the RNA polymerase activity on the complex. The P protein acts as a transcription factor. Length = 1161 Score = 27.2 bits (61), Expect = 9.5 Identities = 11/43 (25%), Positives = 14/43 (32%), Gaps = 5/43 (11%) Query: 294 HPVTEAITGIDLVHEQIYVASENRLSVQQKDITFSGHAIECRI 336 HP+ +A D V E + L K A C I Sbjct: 353 HPMLDAREAADKVREHMCAPKVLDLETLLK-----LLAFFCGI 390 Database: CddA Posted date: Feb 4, 2011 9:38 PM Number of letters in database: 6,263,737 Number of sequences in database: 21,609 Lambda K H 0.318 0.137 0.399 Gapped Lambda K H 0.267 0.0668 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 21609 Number of Hits to DB: 5,434,235 Number of extensions: 294924 Number of successful extensions: 1036 Number of sequences better than 10.0: 1 Number of HSP's gapped: 992 Number of HSP's successfully gapped: 60 Length of query: 443 Length of database: 6,263,737 Length adjustment: 97 Effective length of query: 346 Effective length of database: 4,167,664 Effective search space: 1442011744 Effective search space used: 1442011744 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 59 (26.6 bits)