RPS-BLAST 2.2.22 [Sep-27-2009] Database: CddB 21,608 sequences; 5,994,473 total letters Searching..................................................done Query= gi|254780157|ref|YP_003064570.1| phosphoribosylaminoimidazole carboxylase catalytic subunit protein [Candidatus Liberibacter asiaticus str. psy62] (165 letters) >gnl|CDD|130230 TIGR01162, purE, phosphoribosylaminoimidazole carboxylase, PurE protein. Phosphoribosylaminoimidazole carboxylase is a fusion protein in plants and fungi, but consists of two non-interacting proteins in bacteria, PurK and PurE. This model represents PurK, an N5-CAIR mutase. Length = 156 Score = 215 bits (550), Expect = 4e-57 Identities = 91/157 (57%), Positives = 117/157 (74%), Gaps = 2/157 (1%) Query: 7 VAIIMGSQSDWKIMKYAADMLDTLGIDYEARIISAHRTPDRLIEFAKNARFEGFKLIIAG 66 V IIMGS SD MK AAD+L+ GI YE R++SAHRTP+ ++E+AK A G K+IIAG Sbjct: 1 VGIIMGSDSDLPTMKKAADILEEFGIPYELRVVSAHRTPELMLEYAKEAEERGIKVIIAG 60 Query: 67 AGGAAHLPGMIAAMTSLPVLGVPIISQTLGGIDSLLSIVQMPAGVPVGTMAIGQSGAINA 126 AGGAAHLPGM+AA+T LPV+GVP+ S+ L G+DSLLSIVQMP+GVPV T+AIG +G NA Sbjct: 61 AGGAAHLPGMVAALTPLPVIGVPVPSKALSGLDSLLSIVQMPSGVPVATVAIGNAG--NA 118 Query: 127 SLLAVAILALDDKELTDRLNEWRTQQTISISEYPKDI 163 +LLA IL + D EL ++L E+R Q + + K + Sbjct: 119 ALLAAQILGIKDPELAEKLKEYRENQKEEVLKKNKKL 155 >gnl|CDD|178534 PLN02948, PLN02948, phosphoribosylaminoimidazole carboxylase. Length = 577 Score = 209 bits (534), Expect = 3e-55 Identities = 83/145 (57%), Positives = 107/145 (73%), Gaps = 2/145 (1%) Query: 4 APPVAIIMGSQSDWKIMKYAADMLDTLGIDYEARIISAHRTPDRLIEFAKNARFEGFKLI 63 P V IIMGS SD MK AA++LD+ G+ YE I+SAHRTP+R+ +A++A G ++I Sbjct: 410 TPLVGIIMGSDSDLPTMKDAAEILDSFGVPYEVTIVSAHRTPERMFSYARSAHSRGLQVI 469 Query: 64 IAGAGGAAHLPGMIAAMTSLPVLGVPIISQTLGGIDSLLSIVQMPAGVPVGTMAIGQSGA 123 IAGAGGAAHLPGM+A+MT LPV+GVP+ + L G+DSLLSIVQMP GVPV T+AIG A Sbjct: 470 IAGAGGAAHLPGMVASMTPLPVIGVPVKTSHLDGLDSLLSIVQMPRGVPVATVAIGN--A 527 Query: 124 INASLLAVAILALDDKELTDRLNEW 148 NA LLAV +L D +L D++ + Sbjct: 528 TNAGLLAVRMLGASDPDLLDKMEAY 552 >gnl|CDD|179386 PRK02227, PRK02227, hypothetical protein; Provisional. Length = 238 Score = 32.2 bits (74), Expect = 0.060 Identities = 21/71 (29%), Positives = 29/71 (40%), Gaps = 8/71 (11%) Query: 11 MGSQSDWKIMKYAAD------MLDTLGIDYEARIISAHRTPDRLIEFAKNARFEGFKLII 64 +GS S + AAD MLDT D + + H + L EF AR G + Sbjct: 128 VGSVSPLSLPAIAADAGFDGAMLDTAIKD--GKSLFDHMDEEELAEFVAEARSHGLMSAL 185 Query: 65 AGAGGAAHLPG 75 AG+ +P Sbjct: 186 AGSLKFEDIPA 196 >gnl|CDD|178242 PLN02635, PLN02635, disproportionating enzyme. Length = 538 Score = 30.1 bits (68), Expect = 0.30 Identities = 20/59 (33%), Positives = 28/59 (47%), Gaps = 2/59 (3%) Query: 99 DSLLSIVQMPAGVPVGTMAIGQSGAINASLLAVAI--LALDDKELTDRLNEWRTQQTIS 155 D LL ++P VPVG + + L+A A L L D EL + L ++R IS Sbjct: 107 DGLLEEDELPEPVPVGKVDFSAVAELKDPLIAKAAERLLLSDGELKEELEDFRKDPEIS 165 >gnl|CDD|161922 TIGR00549, mevalon_kin, mevalonate kinase. Paracoccus exhibits two genes within the phosphomevalonate/mevalonate kinase family, one of which falls between trusted and noise cutoffs of this model. The degree of divergence is high, but if the trees created from this model are correct, the proper names of these genes have been swapped. Length = 273 Score = 30.0 bits (68), Expect = 0.36 Identities = 20/61 (32%), Positives = 28/61 (45%), Gaps = 13/61 (21%) Query: 18 KIMKYAADMLDTLGIDYEARIISAHRTPDRLIEFAKNARFEGFKLIIAGAGGAAHLPGMI 77 ++M +L LG+ +H D+L+E A+ A G KL AG GG MI Sbjct: 225 ELMNINQGLLKALGV--------SHPKLDQLVETARKAGALGAKLTGAGGGGC-----MI 271 Query: 78 A 78 A Sbjct: 272 A 272 >gnl|CDD|183856 PRK13055, PRK13055, putative lipid kinase; Reviewed. Length = 334 Score = 29.6 bits (67), Expect = 0.47 Identities = 24/56 (42%), Positives = 28/56 (50%), Gaps = 8/56 (14%) Query: 18 KIMKYAADMLDTL-GIDYEARIISAHRT---PDRLIEFAKNARFEGFKLIIAGAGG 69 + K AD+LD L YE SA +T P+ AK A GF LIIA AGG Sbjct: 17 IMKKNVADILDILEQAGYET---SAFQTTPEPNSAKNEAKRAAEAGFDLIIA-AGG 68 >gnl|CDD|184615 PRK14315, glmM, phosphoglucosamine mutase; Provisional. Length = 448 Score = 29.5 bits (67), Expect = 0.47 Identities = 16/41 (39%), Positives = 23/41 (56%), Gaps = 8/41 (19%) Query: 37 RIISAHRTPDRLIEFAK-----NARFEGFKLIIAGAGGAAH 72 RI AH R IEFAK + R +G ++++ A GAA+ Sbjct: 152 RIDDAH---GRYIEFAKRTLPRDLRLDGLRVVVDCANGAAY 189 >gnl|CDD|184102 PRK13511, PRK13511, 6-phospho-beta-galactosidase; Provisional. Length = 469 Score = 28.4 bits (64), Expect = 0.88 Identities = 10/20 (50%), Positives = 15/20 (75%) Query: 15 SDWKIMKYAADMLDTLGIDY 34 D++I+K A D+ D LGI+Y Sbjct: 279 EDFEILKAAKDLNDFLGINY 298 >gnl|CDD|179412 PRK02308, uvsE, putative UV damage endonuclease; Provisional. Length = 315 Score = 28.0 bits (63), Expect = 1.3 Identities = 9/21 (42%), Positives = 15/21 (71%) Query: 18 KIMKYAADMLDTLGIDYEARI 38 K ++Y A +LD +GID ++I Sbjct: 130 KDLEYHAKLLDLMGIDDSSKI 150 >gnl|CDD|181021 PRK07534, PRK07534, methionine synthase I; Validated. Length = 336 Score = 27.8 bits (62), Expect = 1.3 Identities = 23/67 (34%), Positives = 30/67 (44%), Gaps = 6/67 (8%) Query: 39 ISAHRTPDRLIEFAKNARFEGFKLIIAGAGGA--AHLPGMIAAMTSLPVLGVPI---ISQ 93 I TP+ + E+A AR G + II G G HL M AA+ + P P I + Sbjct: 251 IHYDGTPELMAEYAVLARDAGAR-IIGGCCGTMPEHLAAMRAALDARPRGPRPSLETIVE 309 Query: 94 TLGGIDS 100 LG S Sbjct: 310 RLGPFSS 316 >gnl|CDD|178820 PRK00046, murB, UDP-N-acetylenolpyruvoylglucosamine reductase; Provisional. Length = 334 Score = 27.4 bits (62), Expect = 1.9 Identities = 14/41 (34%), Positives = 24/41 (58%), Gaps = 2/41 (4%) Query: 29 TLGIDYEAR-IISAHRTPDRLIEFAKNARFEGFKLIIAGAG 68 T GID AR ++ A + ++L+E +AR G +++ G G Sbjct: 14 TFGIDARARHLVEAE-SEEQLLEALADARAAGLPVLVLGGG 53 >gnl|CDD|161876 TIGR00433, bioB, biotin synthetase. Catalyzes the last step of the biotin biosynthesis pathway. Length = 296 Score = 27.0 bits (60), Expect = 2.2 Identities = 19/61 (31%), Positives = 28/61 (45%), Gaps = 5/61 (8%) Query: 34 YEARIISAHRTPDRLIEFAKNARFEGFKLIIAGAGGAAH----LPGMIAAMTSLPVLGVP 89 + + IIS H DR+ +NA+ G K+ G G G+ A+ +LP VP Sbjct: 147 FYSNIISTHTYDDRVDTL-ENAKKAGLKVCSGGIFGLGETVEDRIGLALALANLPPESVP 205 Query: 90 I 90 I Sbjct: 206 I 206 >gnl|CDD|147831 pfam05893, LuxC, Acyl-CoA reductase (LuxC). This family consists of several bacterial Acyl-CoA reductase (LuxC) proteins. The channelling of fatty acids into the fatty aldehyde substrate for the bacterial bioluminescence reaction is catalysed by a fatty acid reductase multienzyme complex, which channels fatty acids through the thioesterase (LuxD), synthetase (LuxE) and reductase (LuxC) components. Length = 400 Score = 26.9 bits (60), Expect = 2.3 Identities = 5/28 (17%), Positives = 13/28 (46%) Query: 137 DDKELTDRLNEWRTQQTISISEYPKDIP 164 D ++ + RT QT+ ++ + + Sbjct: 332 DIFDVVRYVTPNRTLQTVGLAPWESRLK 359 >gnl|CDD|177291 PHA00684, PHA00684, hypothetical protein. Length = 128 Score = 26.7 bits (59), Expect = 3.0 Identities = 12/35 (34%), Positives = 16/35 (45%) Query: 46 DRLIEFAKNARFEGFKLIIAGAGGAAHLPGMIAAM 80 +R I +A F++ G G A HL IA M Sbjct: 63 NRFIAYATAHPHLNFQVTRVGCGLAGHLDADIAPM 97 >gnl|CDD|180468 PRK06205, PRK06205, acetyl-CoA acetyltransferase; Provisional. Length = 404 Score = 26.1 bits (58), Expect = 4.2 Identities = 18/39 (46%), Positives = 20/39 (51%), Gaps = 2/39 (5%) Query: 109 AGVPVGTMAIGQSGAINASLLAVAILALDDKELTDRLNE 147 AGV M IG A LA A L LDD +L + LNE Sbjct: 289 AGVEPSRMGIGPVPATEK-ALARAGLTLDDIDLIE-LNE 325 >gnl|CDD|185096 PRK15174, PRK15174, Vi polysaccharide export protein VexE; Provisional. Length = 656 Score = 25.9 bits (56), Expect = 4.9 Identities = 18/80 (22%), Positives = 32/80 (40%), Gaps = 5/80 (6%) Query: 14 QSDWKIMKYAADMLDTLGIDY-EARIISAHRTPDRLIEFAKNARFEGFKLIIAGAG---G 69 + +W+ + D +D+ E R A RL + + +F +L + G Sbjct: 426 RDEWERRAKWGYLADNFLLDWLECRGEQADEPMYRLADISHVEQFFQ-RLQLDQRGCIIV 484 Query: 70 AAHLPGMIAAMTSLPVLGVP 89 +AHL M A L +L + Sbjct: 485 SAHLGAMYAGPMILSLLEMN 504 >gnl|CDD|185331 PRK15433, PRK15433, branched-chain amino acid transport system 2 carrier protein BrnQ; Provisional. Length = 439 Score = 26.1 bits (57), Expect = 5.0 Identities = 15/42 (35%), Positives = 24/42 (57%) Query: 75 GMIAAMTSLPVLGVPIISQTLGGIDSLLSIVQMPAGVPVGTM 116 G + LPVL V +++ GG+DSL + + AGV + T+ Sbjct: 47 GFLITAVGLPVLTVVALAKVGGGVDSLSTPIGKVAGVLLATV 88 >gnl|CDD|137505 PRK09751, PRK09751, putative ATP-dependent helicase Lhr; Provisional. Length = 1490 Score = 26.0 bits (57), Expect = 5.4 Identities = 9/39 (23%), Positives = 18/39 (46%) Query: 75 GMIAAMTSLPVLGVPIISQTLGGIDSLLSIVQMPAGVPV 113 G++ A P L G+D ++ +++ AGV + Sbjct: 1065 GVLPATDGSPALFASTSPGVYEGVDGVMRVIEQLAGVGL 1103 >gnl|CDD|132150 TIGR03106, trio_M42_hydro, hydrolase, peptidase M42 family. This model describes a subfamily of MEROPS peptidase family M42, a glutamyl aminopeptidase family that also includes the cellulase CelM from Clostridium thermocellum and deblocking aminopeptidases that can remove acylated amino acids. Members of this family occur in a three gene cassette with an amidotransferase (TIGR03104)in the asparagine synthase (glutamine-hydrolyzing) family, and a probable acetyltransferase (TIGR03103) in the GNAT family. Length = 343 Score = 25.5 bits (56), Expect = 6.8 Identities = 12/40 (30%), Positives = 20/40 (50%), Gaps = 8/40 (20%) Query: 18 KIMKYAADMLDTLGIDYE--------ARIISAHRTPDRLI 49 +++Y A+ L+ LGI+YE A + TP R + Sbjct: 24 AVVRYVAERLEDLGIEYELTRRGAIRATLPGREATPARAV 63 >gnl|CDD|183786 PRK12843, PRK12843, putative FAD-binding dehydrogenase; Reviewed. Length = 578 Score = 25.5 bits (56), Expect = 7.1 Identities = 13/42 (30%), Positives = 19/42 (45%), Gaps = 3/42 (7%) Query: 45 PDRLIEFAKNARFEGFKLIIAGAGGAAHLPGMIAAMTSLPVL 86 L +A F+ +I+ GAG A + AA+ L VL Sbjct: 5 VSELSPERWDAEFD---VIVIGAGAAGMSAALFAAIAGLKVL 43 >gnl|CDD|115546 pfam06896, DUF1268, Protein of unknown function (DUF1268). This family consists of several bacterial and phage proteins of around 115 residues in length. The function of this family is unknown. Length = 114 Score = 25.2 bits (55), Expect = 8.1 Identities = 10/28 (35%), Positives = 16/28 (57%), Gaps = 1/28 (3%) Query: 132 AILALDDKELTDRLNEWRTQQTISISEY 159 AIL L+ KE D+L + Q+T+ + Sbjct: 64 AILNLNKKE-YDKLEDLENQETMEVVSK 90 >gnl|CDD|169701 PRK09189, PRK09189, uroporphyrinogen-III synthase; Validated. Length = 240 Score = 25.4 bits (56), Expect = 8.4 Identities = 11/28 (39%), Positives = 15/28 (53%) Query: 52 AKNARFEGFKLIIAGAGGAAHLPGMIAA 79 A+ AR GF+ +I G G L +AA Sbjct: 86 AEAARELGFRHVIEGGGDGVRLAETVAA 113 >gnl|CDD|148297 pfam06605, DUF1142, Protein of unknown function (DUF1142). This family consists of hypothetical bacterial and viral proteins of unknown function. Length = 318 Score = 25.3 bits (56), Expect = 8.5 Identities = 10/18 (55%), Positives = 12/18 (66%) Query: 28 DTLGIDYEARIISAHRTP 45 + LGID E RI+S R P Sbjct: 269 EPLGIDTEVRIVSITRYP 286 >gnl|CDD|162965 TIGR02665, molyb_mobA, molybdopterin-guanine dinucleotide biosynthesis protein A, proteobacterial. In many molybdopterin-containing enzymes, including nitrate reductase and dimethylsulfoxide reductase, the cofactor is molybdopterin-guanine dinucleotide. The family described here contains MobA, molybdopterin-guanine dinucleotide biosynthesis protein A, from the Proteobacteria only. MobA can reconstitute molybdopterin-guanine dinucleotide biosynthesis without the product of the neighboring gene MobB. The probable MobA proteins of other lineages differ sufficiently that they are not included in scope of this family. Length = 186 Score = 25.3 bits (56), Expect = 8.6 Identities = 14/46 (30%), Positives = 20/46 (43%), Gaps = 10/46 (21%) Query: 38 IISAHRTPDRLIEFAKNARFEGFKLII---AGAGGAAHLPGMIAAM 80 ISA+R P+R GF L + A A L G++A + Sbjct: 45 AISANRNPER-------YAQAGFGLPVVPDALADFPGPLAGILAGL 83 >gnl|CDD|181303 PRK08224, ligC, ATP-dependent DNA ligase; Reviewed. Length = 350 Score = 25.2 bits (56), Expect = 9.4 Identities = 10/19 (52%), Positives = 14/19 (73%) Query: 126 ASLLAVAILALDDKELTDR 144 AS +A +LAL D++LT R Sbjct: 114 ASFVAFDLLALGDRDLTGR 132 >gnl|CDD|181969 PRK09584, tppB, putative tripeptide transporter permease; Reviewed. Length = 500 Score = 25.1 bits (55), Expect = 9.9 Identities = 19/70 (27%), Positives = 29/70 (41%), Gaps = 7/70 (10%) Query: 15 SDWKIMKYAADMLDTLGIDYEARIISAHRTPDRLIEFAKNARFEGFKLIIAGAGGAAHLP 74 +W I Y + L I + A P RL+ F + F L AGA A + Sbjct: 382 VNWLIASYGLQSIGELMISGLGLAMVAQLVPQRLMGFIMGSWF----LTTAGA---ALIA 434 Query: 75 GMIAAMTSLP 84 G +A + ++P Sbjct: 435 GYVANLMAVP 444 Database: CddB Posted date: Feb 4, 2011 9:54 PM Number of letters in database: 5,994,473 Number of sequences in database: 21,608 Lambda K H 0.320 0.136 0.389 Gapped Lambda K H 0.267 0.0618 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 21608 Number of Hits to DB: 2,791,686 Number of extensions: 180295 Number of successful extensions: 523 Number of sequences better than 10.0: 1 Number of HSP's gapped: 521 Number of HSP's successfully gapped: 62 Length of query: 165 Length of database: 5,994,473 Length adjustment: 86 Effective length of query: 79 Effective length of database: 4,136,185 Effective search space: 326758615 Effective search space used: 326758615 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 53 (24.4 bits)