RPS-BLAST 2.2.22 [Sep-27-2009] Database: CddB 21,608 sequences; 5,994,473 total letters Searching..................................................done Query= gi|254781162|ref|YP_003065575.1| GCN5-related N-acetyltransferase [Candidatus Liberibacter asiaticus str. psy62] (192 letters) >gnl|CDD|140351 PTZ00330, PTZ00330, acetyltransferase; Provisional. Length = 147 Score = 45.6 bits (108), Expect = 8e-06 Identities = 24/60 (40%), Positives = 36/60 (60%), Gaps = 4/60 (6%) Query: 97 KITGHLLGPIVVHPLYQNKGIGRKLISMSVDAAEKKGS-QVIVLVGD--IAYYSKLGFQA 153 K GH+ +VV P Y+ +G+GR LIS + A G +VI+ + +A+Y KLGF+A Sbjct: 80 KCVGHI-EDVVVDPSYRGQGLGRALISDLCEIARSSGCYKVILDCTEDMVAFYKKLGFRA 138 >gnl|CDD|162430 TIGR01575, rimI, ribosomal-protein-alanine acetyltransferase. Members of this model belong to the GCN5-related N-acetyltransferase (GNAT) superfamily. This model covers prokarotes and the archaea. The seed contains a characterized accession for Gram negative E. coli. An untraceable characterized accession (PIR|S66013) for Gram positive B. subtilis scores well (205.0) in the full alignment. Characterized members are lacking in the archaea. Noise cutoff (72.4) was set to exclude M. loti paralog of rimI. Trusted cutoff (80.0) was set at next highest scoring member in the mini-database. Length = 131 Score = 43.1 bits (102), Expect = 5e-05 Identities = 27/81 (33%), Positives = 38/81 (46%), Gaps = 10/81 (12%) Query: 80 EGKRIVGSVRMTPISIEKITGHLLGPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQVIVL 139 G ++VG + I H+L I V P YQ +GIGR L+ +D A+ +G I L Sbjct: 38 IGGKVVGYAGV---QIVLDEAHILN-IAVKPEYQGQGIGRALLRELIDEAKGRGVNEIFL 93 Query: 140 ------VGDIAYYSKLGFQAV 154 + A Y KLGF + Sbjct: 94 EVRVSNIAAQALYKKLGFNEI 114 >gnl|CDD|181103 PRK07757, PRK07757, acetyltransferase; Provisional. Length = 152 Score = 42.5 bits (101), Expect = 7e-05 Identities = 24/88 (27%), Positives = 39/88 (44%), Gaps = 10/88 (11%) Query: 76 FLCAEGKRIVGSVRMTPISIEKITGHLLGPI---VVHPLYQNKGIGRKLISMSVDAAEKK 132 ++ E IVG + I L I V Y+ +GIGR L+ ++ A + Sbjct: 44 YVAEEEGEIVGCCALH------ILWEDLAEIRSLAVSEDYRGQGIGRMLVEACLEEAREL 97 Query: 133 G-SQVIVLVGDIAYYSKLGFQAVPWKSL 159 G +V L ++ KLGF+ V ++L Sbjct: 98 GVKRVFALTYQPEFFEKLGFREVDKEAL 125 >gnl|CDD|181904 PRK09491, rimI, ribosomal-protein-alanine N-acetyltransferase; Provisional. Length = 146 Score = 38.8 bits (91), Expect = 0.001 Identities = 21/55 (38%), Positives = 27/55 (49%), Gaps = 6/55 (10%) Query: 106 IVVHPLYQNKGIGRKLISMSVDAAEKKGSQVIVL------VGDIAYYSKLGFQAV 154 I V P YQ +G+GR L+ +D EK+G + L IA Y LGF V Sbjct: 69 IAVDPDYQRQGLGRALLEHLIDELEKRGVATLWLEVRASNAAAIALYESLGFNEV 123 >gnl|CDD|179611 PRK03624, PRK03624, putative acetyltransferase; Provisional. Length = 140 Score = 35.7 bits (83), Expect = 0.008 Identities = 32/102 (31%), Positives = 45/102 (44%), Gaps = 30/102 (29%) Query: 70 HDLSLSFLCAE-GKRIVGSVRMTPISIEKITGH-----LLGPIVVHPLYQNKGIGRKLIS 123 HD SL FL AE G +VG+V M GH L VHP ++ +GIGR L++ Sbjct: 42 HDPSL-FLVAEVGGEVVGTV-MGGYD-----GHRGWAYYLA---VHPDFRGRGIGRALVA 91 Query: 124 MSVDAAEKK----GS---QVIVLVGD---IAYYSKLGFQAVP 155 EKK G + V + + +Y LG++ Sbjct: 92 ----RLEKKLIARGCPKINLQVREDNDAVLGFYEALGYEEQD 129 >gnl|CDD|179965 PRK05218, PRK05218, heat shock protein 90; Provisional. Length = 613 Score = 34.3 bits (80), Expect = 0.020 Identities = 16/40 (40%), Positives = 22/40 (55%), Gaps = 4/40 (10%) Query: 93 ISIEKITGHLLGPIVVHPLYQNKGIG-RKLISMSVDAAEK 131 E LL +++H LY NK I R+LIS + DA +K Sbjct: 8 FQAE--VKQLL-HLMIHSLYSNKEIFLRELISNASDAIDK 44 >gnl|CDD|178308 PLN02706, PLN02706, glucosamine 6-phosphate N-acetyltransferase. Length = 150 Score = 34.3 bits (79), Expect = 0.021 Identities = 16/55 (29%), Positives = 28/55 (50%), Gaps = 4/55 (7%) Query: 100 GHLLGPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQVIVL---VGDIAYYSKLGF 151 GH+ +VV + KG+G+K+I + A G ++L + A+Y K G+ Sbjct: 86 GHI-EDVVVDSAARGKGLGKKIIEALTEHARSAGCYKVILDCSEENKAFYEKCGY 139 >gnl|CDD|178776 PLN03238, PLN03238, probable histone acetyltransferase MYST; Provisional. Length = 290 Score = 32.5 bits (74), Expect = 0.070 Identities = 18/55 (32%), Positives = 28/55 (50%), Gaps = 3/55 (5%) Query: 79 AEGKRIVGSVRMTPISIEKITGHLLGPIVVHPLYQNKGIGRKLISMSVDAAEKKG 133 G IVG +S E + L I+ P YQ KG G+ LIS + + ++++G Sbjct: 137 DHGSHIVGYFSKEKVSAE---DYNLACILTLPPYQRKGYGKFLISFAYELSKREG 188 >gnl|CDD|117022 pfam08445, FR47, FR47-like protein. The members of this family are similar to the C-terminal region of the D. melanogaster hypothetical protein FR47. This protein has been found to consist of two N-acyltransferase-like domains swapped with the C-terminal strands. Length = 86 Score = 31.9 bits (73), Expect = 0.11 Identities = 13/56 (23%), Positives = 25/56 (44%), Gaps = 5/56 (8%) Query: 103 LGPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQVIVLVG-----DIAYYSKLGFQA 153 LG + P ++ +G+G +L++ ++G +V Y KLGF+ Sbjct: 24 LGALQTLPEHRRRGLGSRLVAALARGIAERGITPFAVVVAGNTPSRRLYEKLGFRK 79 >gnl|CDD|177721 PLN00104, PLN00104, MYST -like histone acetyltransferase; Provisional. Length = 450 Score = 31.6 bits (72), Expect = 0.14 Identities = 20/62 (32%), Positives = 31/62 (50%), Gaps = 10/62 (16%) Query: 77 LC---AEGKRIVGSVRMTPISIEKIT--GHLLGPIVVHPLYQNKGIGRKLISMSVDAAEK 131 LC G +VG S EK + + L I+ P YQ KG G+ LI+ S + +++ Sbjct: 283 LCECDDRGCHMVGY-----FSKEKHSEEDYNLACILTLPPYQRKGYGKFLIAFSYELSKR 337 Query: 132 KG 133 +G Sbjct: 338 EG 339 >gnl|CDD|132147 TIGR03103, trio_acet_GNAT, GNAT-family acetyltransferase TIGR03103. Members of this protein family belong to the GNAT family of acetyltransferases. Each is part of a conserved three-gene cassette sparsely distributed across at least twenty different species known so far, including alpha, beta, and gamma Proteobacteria, Mycobacterium, and Prosthecochloris, which is a member of the Chlorobi. The other two members of the cassette are a probable protease and an asparagine synthetase family protein. Length = 547 Score = 30.9 bits (70), Expect = 0.20 Identities = 29/112 (25%), Positives = 49/112 (43%), Gaps = 18/112 (16%) Query: 58 VRAAVLLREQGMHDLSLSFLCAE---GKRIVGSVRMTPISIEKI-----TGHLLGPIVVH 109 VR +L + H ++++L AE I+G+V + K G L + V Sbjct: 109 VRVDFVLDHR--HSRAITYLVAEDEASGAIIGTV--MGVDHRKAFNDPEHGSSLWCLAVD 164 Query: 110 PLYQNKGIGRKLISMSVDAAEKKGSQVIVL--VGD----IAYYSKLGFQAVP 155 P + G+G L+ + + +G + L + D IA Y KLGF+ +P Sbjct: 165 PQAAHPGVGEALVRALAEHFQSRGCAYMDLSVMHDNEQAIALYEKLGFRRIP 216 >gnl|CDD|132489 TIGR03448, mycothiol_MshD, mycothiol biosynthesis acetyltransferase. Members of this family are MshD, the acetyltransferase that catalyzes the final step of mycothiol biosynthesis in various members of the Actinomyctes, Mycothiol replaces glutathione in these species. Length = 292 Score = 30.1 bits (68), Expect = 0.43 Identities = 36/128 (28%), Positives = 51/128 (39%), Gaps = 25/128 (19%) Query: 57 FVRAAVLLREQGMHDLSLSFLCAEGKRIVGSVRMTPISIEKITGHLLGPIVVHPLYQNKG 116 +R LRE G + + + IVG + P T + +VVHP ++ +G Sbjct: 34 VLRG---LREPG-AGHTRHLVAVDSDPIVGYANLVPARG---TDPAMAELVVHPAHRRRG 86 Query: 117 IGRKLISMSVDAA-EKKGSQVIV-----LVGDIAYYSKLGFQ--------AVPWKSLILP 162 IGR LI A K G ++ V L A S+LG P + L LP Sbjct: 87 IGRALI----RALLAKGGGRLRVWAHGDLPAARALASRLGLVPTRELLQMRRPLRDLELP 142 Query: 163 APVDPNRV 170 P P+ V Sbjct: 143 EPQVPDGV 150 >gnl|CDD|162253 TIGR01211, ELP3, histone acetyltransferase, ELP3 family. The Saccharomyces cerevisiae member YPL086C has been characterized in vitro as an N-terminal acetyltransferase for all four core histones. It is a component of the RNA polymerase II holoenzyme, designated Elp3p for Elongator Protein 3. Members of this family are found in eukaryotes and archaea. These proteins are part of the larger set of GNAT acetyltransferases. Length = 522 Score = 29.7 bits (67), Expect = 0.44 Identities = 20/62 (32%), Positives = 34/62 (54%), Gaps = 12/62 (19%) Query: 101 HLLGPIVVHPL-------YQNKGIGRKLISMSVDAAEKKGSQ---VIVLVGDIAYYSKLG 150 H+ G V P+ +Q++G GR+L+ + A ++GS+ VI +G YY KLG Sbjct: 453 HVYGSEV--PIGERGDDEWQHRGYGRRLLEEAERIAAEEGSEKILVISGIGVREYYRKLG 510 Query: 151 FQ 152 ++ Sbjct: 511 YE 512 >gnl|CDD|181170 PRK07922, PRK07922, N-acetylglutamate synthase; Validated. Length = 169 Score = 29.5 bits (67), Expect = 0.56 Identities = 17/63 (26%), Positives = 32/63 (50%), Gaps = 7/63 (11%) Query: 106 IVVHPLYQNKGIGRKLISMSVDAAEKKG-SQVIVLVGDIAYYSKLGFQAVPWKSLILPAP 164 + V P + +G+G ++ +D A + G S+V VL ++ ++++ GF I P Sbjct: 76 VAVDPAARGRGVGHAIVERLLDVARELGLSRVFVLTFEVEFFARHGFVE------IDGTP 129 Query: 165 VDP 167 V P Sbjct: 130 VTP 132 >gnl|CDD|184454 PRK14016, PRK14016, cyanophycin synthetase; Provisional. Length = 727 Score = 29.0 bits (66), Expect = 0.74 Identities = 16/41 (39%), Positives = 20/41 (48%), Gaps = 7/41 (17%) Query: 105 PIVVHPLYQNKGIG-------RKLISMSVDAAEKKGSQVIV 138 P+VV PL N G G R+ I + A K+ S VIV Sbjct: 251 PVVVKPLDGNHGRGVTVNITTREEIEAAYAVASKESSDVIV 291 >gnl|CDD|182510 PRK10514, PRK10514, putative acetyltransferase; Provisional. Length = 145 Score = 28.0 bits (63), Expect = 1.5 Identities = 15/69 (21%), Positives = 29/69 (42%), Gaps = 15/69 (21%) Query: 92 PISIEKITGHLLGPIVVHPLYQNKGIGRKLIS--------MSVDAAEKKGSQVIVLVGDI 143 P+ ++G + + V P + G+GR L+ ++ D E+ V Sbjct: 61 PVGFMLLSGGHMEALFVDPDVRGCGVGRMLVEHALSLHPELTTDVNEQNEQAV------- 113 Query: 144 AYYSKLGFQ 152 +Y K+GF+ Sbjct: 114 GFYKKMGFK 122 >gnl|CDD|173359 PTZ00064, PTZ00064, histone acetyltransferase; Provisional. Length = 552 Score = 27.7 bits (61), Expect = 2.2 Identities = 20/56 (35%), Positives = 29/56 (51%), Gaps = 7/56 (12%) Query: 80 EGKRIVGSVRMTPISIEKIT--GHLLGPIVVHPLYQNKGIGRKLISMSVDAAEKKG 133 EG IVG S EK++ + L I+ P YQ KG G+ L+ +S + K+G Sbjct: 367 EGCHIVGY-----FSKEKVSLLHYNLACILTLPCYQRKGYGKLLVDLSYKLSLKEG 417 >gnl|CDD|177369 PHA02528, 43, DNA polymerase; Provisional. Length = 881 Score = 27.3 bits (61), Expect = 2.6 Identities = 14/40 (35%), Positives = 23/40 (57%), Gaps = 6/40 (15%) Query: 126 VDAAEKKGSQVIVLVGDIAYYSKLGFQAV-----PWKSLI 160 VD + K ++I LV +AYY+K+ F+ V W ++I Sbjct: 317 VDRLDDK-RKLIELVLSMAYYAKINFEDVFSPIKTWDAII 355 >gnl|CDD|162681 TIGR02068, cya_phycin_syn, cyanophycin synthetase. Cyanophycin synthesis is analogous to polyhydroxyalkanoic acid (PHA) biosynthesis, except that PHA polymers lack nitrogen and may be made under nitrogen-limiting conditions. Length = 864 Score = 26.7 bits (59), Expect = 4.1 Identities = 14/42 (33%), Positives = 20/42 (47%), Gaps = 7/42 (16%) Query: 104 GPIVVHPLYQNKGIG-------RKLISMSVDAAEKKGSQVIV 138 P+V+ P N G G R I + +AA ++ S VIV Sbjct: 249 YPVVIKPYDGNHGRGVTINILTRDEIESAYEAAVEESSGVIV 290 >gnl|CDD|152258 pfam11822, DUF3342, Domain of unknown function (DUF3342). This family of proteins are functionally uncharacterized. This family is found in bacteria. This presumed domain is typically between 170 to 303 amino acids in length. The N-terminal half of this family is a BTB-like domain. Length = 302 Score = 26.2 bits (58), Expect = 4.7 Identities = 12/63 (19%), Positives = 25/63 (39%), Gaps = 3/63 (4%) Query: 1 MHDNNIELRKFLSIAFFEFICWRRSRWQKIGA---FFLERLEHDSSICAMHADSFGPGRF 57 + D + ++ +++S E W + W+ G+ + R H S+ S+ P F Sbjct: 214 VADKSWDVLEYISWLTEELESWAKVYWRIWGSINYLYCRRCNHLFSLTDTTHCSYHPAAF 273 Query: 58 VRA 60 Sbjct: 274 SFP 276 >gnl|CDD|183948 PRK13285, PRK13285, flagellar assembly protein FliW; Provisional. Length = 148 Score = 26.0 bits (58), Expect = 6.1 Identities = 11/32 (34%), Positives = 18/32 (56%), Gaps = 3/32 (9%) Query: 92 PISIEKITGHLLGPIVVHPLYQNKGIGRKLIS 123 P EKIT +L PI+++ K G+++I Sbjct: 99 PEDPEKITANLQAPIIINK---KKRKGKQVIL 127 >gnl|CDD|184245 PRK13688, PRK13688, hypothetical protein; Provisional. Length = 156 Score = 26.1 bits (58), Expect = 6.2 Identities = 17/49 (34%), Positives = 22/49 (44%), Gaps = 6/49 (12%) Query: 108 VHPLYQNKGIGRKLISMSVDAAEKKGS--QVIVLVGDIAYYSKLGFQAV 154 V P YQN+G G L VD A+ + I ++ KLGF V Sbjct: 87 VLPKYQNRGYGEML----VDFAKSFQLPIKTIARNKSKDFWLKLGFTPV 131 >gnl|CDD|177387 PHA02553, 6, baseplate wedge subunit; Provisional. Length = 611 Score = 25.8 bits (57), Expect = 8.3 Identities = 17/74 (22%), Positives = 27/74 (36%), Gaps = 1/74 (1%) Query: 74 LSFLCAEGKRIVGSVRMTPISIEKITGHLLGPIVVHPLYQNKGIGRKLISMSVDAAEKKG 133 +SF + V S + S E + + IV + GIG+ +I D G Sbjct: 478 ISFGNQIKEGSVKSSGFSFPSEEDDFSYDVR-IVSTDDDTSTGIGKIVIGPFADKTAIDG 536 Query: 134 SQVIVLVGDIAYYS 147 + VG + Y Sbjct: 537 RDLYYEVGTVDYVK 550 >gnl|CDD|183959 PRK13300, PRK13300, tRNA CCA-pyrophosphorylase; Provisional. Length = 447 Score = 25.6 bits (57), Expect = 8.6 Identities = 10/24 (41%), Positives = 12/24 (50%), Gaps = 8/24 (33%) Query: 157 KSLILPAPVDPNRVLFLPLVQNVA 180 L++ PVDPNR NVA Sbjct: 221 DPLVVIDPVDPNR--------NVA 236 >gnl|CDD|149336 pfam08217, DUF1712, Fungal domain of unknown function (DUF1712). The function of this family of proteins is unknown. Length = 468 Score = 25.4 bits (56), Expect = 9.9 Identities = 11/37 (29%), Positives = 15/37 (40%), Gaps = 4/37 (10%) Query: 156 WKSLILPAPVDPNRVL----FLPLVQNVAQNIKGIVR 188 W+ L LP D L L L + I+G +R Sbjct: 140 WQDLKLPPECDLLDELGTVQNLLLDKENYLGIQGFIR 176 Database: CddB Posted date: Feb 4, 2011 9:54 PM Number of letters in database: 5,994,473 Number of sequences in database: 21,608 Lambda K H 0.327 0.142 0.434 Gapped Lambda K H 0.267 0.0769 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 21608 Number of Hits to DB: 3,237,353 Number of extensions: 203857 Number of successful extensions: 496 Number of sequences better than 10.0: 1 Number of HSP's gapped: 488 Number of HSP's successfully gapped: 34 Length of query: 192 Length of database: 5,994,473 Length adjustment: 88 Effective length of query: 104 Effective length of database: 4,092,969 Effective search space: 425668776 Effective search space used: 425668776 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 15 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (21.7 bits) S2: 54 (24.5 bits)