RPS-BLAST 2.2.22 [Sep-27-2009] Database: CddB 21,608 sequences; 5,994,473 total letters Searching..................................................done Query= gi|254781093|ref|YP_003065506.1| hypothetical protein CLIBASIA_04975 [Candidatus Liberibacter asiaticus str. psy62] (225 letters) >gnl|CDD|132144 TIGR03100, hydr1_PEP, hydrolase, ortholog 1, exosortase system type 1 associated. This group of proteins are members of the alpha/beta hydrolase superfamily. These proteins are generally found in genomes containing the exosortase/PEP-CTERM protein expoert system, specifically the type 1 variant of this system described by the Genome Property GenProp0652. When found in this context they are invariably present in the vicinity of a second, relatively unrelated enzyme (ortholog 2, TIGR03101) of the same superfamily. Length = 274 Score = 40.6 bits (95), Expect = 4e-04 Identities = 25/89 (28%), Positives = 38/89 (42%), Gaps = 7/89 (7%) Query: 57 GFVSLRFNFRGIGRSEGEFDYGDGELSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQ 116 GF LRF++RG+G SEGE +G +D AAA+D + P + G A ++ Sbjct: 57 GFPVLRFDYRGMGDSEGENLGFEGIDADIAAAIDAFREAAPHLRRIVAWGLCDAASAALL 116 Query: 117 LLMRRPEINGFISVAP-------QPKSYD 138 + G + + P Q S Sbjct: 117 YAPADLRVAGLVLLNPWVRTEAAQAASRI 145 >gnl|CDD|162143 TIGR00976, /NonD, putative hydrolase, CocE/NonD family. This model represents a protein subfamily that includes the cocaine esterase CocE, several glutaryl-7-ACA acylases, and the putative diester hydrolase NonD of Streptomyces griseus (all hydrolases). This family shows extensive, low-level similarity to a family of xaa-pro dipeptidyl-peptidases, and local similarity by PSI-BLAST to many other hydrolases. Length = 550 Score = 38.2 bits (89), Expect = 0.002 Identities = 15/41 (36%), Positives = 22/41 (53%), Gaps = 1/41 (2%) Query: 53 FQQRGFVSLRFNFRGIGRSEGEFDYGD-GELSDAAAALDWV 92 F +G+ + + RG G SEGEFD E +D +DW+ Sbjct: 49 FVAQGYAVVIQDTRGRGASEGEFDLLGSDEAADGYDLVDWI 89 >gnl|CDD|163354 TIGR03611, RutD, pyrimidine utilization protein D. This protein is observed in operons extremely similar to that characterized in E. coli K-12 responsible for the import and catabolism of pyrimidines, primarily uracil. This protein is a member of the hydrolase, alpha/beta fold family defined by pfam00067. Length = 257 Score = 34.3 bits (79), Expect = 0.028 Identities = 27/88 (30%), Positives = 38/88 (43%), Gaps = 10/88 (11%) Query: 52 LFQQRGFVSLRFNFRGIGRSEGEFDYGDGELSDAAAALDWVQSLNPESKSCWIAGYSFGA 111 + QR F + ++ RG GRS GE G A L + +LN E G++ G Sbjct: 35 VLTQR-FHVVTYDHRGTGRSPGELPPGYSIAHMADDVLQLLDALNIER--FHFVGHALGG 91 Query: 112 WISMQLLMRRPE-------INGFISVAP 132 I +QL +R PE IN + P Sbjct: 92 LIGLQLALRYPERLLSLVLINAWSRPDP 119 >gnl|CDD|177511 PHA03061, PHA03061, putative DNA-binding virion core protein; Provisional. Length = 311 Score = 30.8 bits (70), Expect = 0.26 Identities = 17/68 (25%), Positives = 26/68 (38%), Gaps = 21/68 (30%) Query: 163 SDVKDLVNKLMNQKGISITHKVIPDANHFFIG---------KVDELINECAHYLDNSLDE 213 SDVK L +L A+ F K+++LIN+ LDE Sbjct: 125 SDVKVLAARLKT------------SASSFEFNGHTYHLENDKIEDLINQLVKNGAIILDE 172 Query: 214 KFTLLKSI 221 K ++ S+ Sbjct: 173 KSSIKDSM 180 >gnl|CDD|163135 TIGR03101, hydr2_PEP, hydrolase, ortholog 2, exosortase system type 1 associated. This group of proteins are members of the alpha/beta hydrolase superfamily. These proteins are generally found in genomes containing the exosortase/PEP-CTERM protein expoert system, specifically the type 1 variant of this system described by the Genome Property GenProp0652. When found in this context they are invariably present in the vicinity of a second, relatively unrelated enzyme (ortholog 1, TIGR03100) of the same superfamily. Length = 266 Score = 30.6 bits (69), Expect = 0.31 Identities = 29/129 (22%), Positives = 45/129 (34%), Gaps = 15/129 (11%) Query: 8 GPSGRLEGRYQPSTNPNAPIALILHPHPRFGGTMND--NIVYQLFYLFQQRGFVSLRFNF 65 P G Y P P ++++ P F MN +V F GF L+ + Sbjct: 7 APHGFRFCLYHPPVA-VGPRGVVIYLPP-FAEEMNKSRRMVALQARAFAAGGFGVLQIDL 64 Query: 66 RGIGRSEGEFDYG--DGELSDAAAALDWVQSLNPESKSC---WIAGYSFGAWISMQLLMR 120 G G S G+F D D AAA W+ + + G GA +++ Sbjct: 65 YGCGDSAGDFAAARWDVWKEDVAAAYRWL-----IEQGHPPVTLWGLRLGALLALDAANP 119 Query: 121 RP-EINGFI 128 + N + Sbjct: 120 LAAKCNRLV 128 >gnl|CDD|165939 PLN02298, PLN02298, hydrolase, alpha/beta fold family protein. Length = 330 Score = 29.7 bits (67), Expect = 0.66 Identities = 40/142 (28%), Positives = 59/142 (41%), Gaps = 14/142 (9%) Query: 6 FNGPSGR--LEGRYQPSTNPNAPIALILHPHPRFGGTMND-NIVYQLFYLF-QQRGFVSL 61 F P G + PS + + P ALI H G ND + +Q +F Q GF Sbjct: 37 FTSPRGLSLFTRSWLPS-SSSPPRALIFMVH----GYGNDISWTFQSTAIFLAQMGFACF 91 Query: 62 RFNFRGIGRSEGEFDYG---DGELSDAAAALDWVQSLNP-ESKSCWIAGYSFGAWISMQL 117 + G GRSEG Y D + D + + V+ + ++ G S G I + + Sbjct: 92 ALDLEGHGRSEGLRAYVPNVDLVVEDCLSFFNSVKQREEFQGLPRFLYGESMGGAICLLI 151 Query: 118 LMRRPE-INGFISVAPQPKSYD 138 + PE +G + VAP K D Sbjct: 152 HLANPEGFDGAVLVAPMCKISD 173 >gnl|CDD|183825 PRK12903, secA, preprotein translocase subunit SecA; Reviewed. Length = 925 Score = 29.3 bits (66), Expect = 0.76 Identities = 22/75 (29%), Positives = 30/75 (40%), Gaps = 15/75 (20%) Query: 150 LIING-SNDTVATTSDVKDLVNKLMNQ--------KGISITHKVIPDANHFFIGKVDELI 200 LII+G ++ V L K IS+T K I AN FF K+ L Sbjct: 219 LIISGGQSNDSNLYLAADQFVRTLKEDDYKIDEETKAISLTEKGIKKANKFF--KLKNLY 276 Query: 201 N----ECAHYLDNSL 211 + E H + N+L Sbjct: 277 DIENSELVHRIQNAL 291 >gnl|CDD|131480 TIGR02427, protocat_pcaD, 3-oxoadipate enol-lactonase. Members of this family are 3-oxoadipate enol-lactonase. Note that the substrate is known as 3-oxoadipate enol-lactone, 2-oxo-2,3-dihydrofuran-5-acetate, 4,5-Dihydro-5-oxofuran-2-acetate, and 5-oxo-4,5-dihydrofuran-2-acetate. The enzyme the catalyzes the fourth step in the protocatechuate degradation to beta-ketoadipate and then to succinyl-CoA and acetyl-CoA. 4-hydroxybenzoate, 3-hydroxybenzoate, and vanillate all can be converted in one step to protocatechuate. This enzyme also acts in catechol degradation. In genomes that catabolize both catechol and protocatechuate, two forms of this enzyme may be found. All members of the seed alignment for this model were chosen from within protocatechuate degradation operons of at least three genes of the pathway, from genomes with the complete pathway through beta-ketoadipate. Length = 251 Score = 28.9 bits (65), Expect = 1.0 Identities = 24/71 (33%), Positives = 31/71 (43%), Gaps = 7/71 (9%) Query: 56 RGFVSLRFNFRGIGRSE-GEFDYGDGELS-DAAAALDWVQSLNPESKSCWIAGYSFGAWI 113 F LR++ RG G S+ E Y +L+ D A LD L E G S G I Sbjct: 38 PDFRVLRYDKRGHGLSDAPEGPYSIEDLADDVLALLD---HLGIERAV--FCGLSLGGLI 92 Query: 114 SMQLLMRRPEI 124 + L RRP+ Sbjct: 93 AQGLAARRPDR 103 >gnl|CDD|184875 PRK14875, PRK14875, acetoin dehydrogenase E2 subunit dihydrolipoyllysine-residue acetyltransferase; Provisional. Length = 371 Score = 28.4 bits (64), Expect = 1.7 Identities = 18/59 (30%), Positives = 27/59 (45%), Gaps = 7/59 (11%) Query: 67 GIGRSEGEFDYGD-GELSDA-AAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRRPE 123 G G S G EL+ A A LD +L E + G+S G ++++L R P+ Sbjct: 167 GHGASSKAVGAGSLDELAAAVLAFLD---ALGIER--AHLVGHSMGGAVALRLAARAPQ 220 >gnl|CDD|184688 PRK14457, PRK14457, ribosomal RNA large subunit methyltransferase N; Provisional. Length = 345 Score = 28.0 bits (63), Expect = 1.9 Identities = 11/35 (31%), Positives = 19/35 (54%), Gaps = 6/35 (17%) Query: 174 NQKGISITHKVIPDANHFFIGKVDELINECAHYLD 208 NQK + +IP A ++ I + L+ +C HY+ Sbjct: 222 NQK---LRETLIPSAKNYPI---ENLLEDCRHYVA 250 >gnl|CDD|151520 pfam11075, DUF2780, Protein of unknown function VcgC/VcgE (DUF2780). This is a bacterial family of uncharacterized proteins. Length = 168 Score = 26.9 bits (60), Expect = 3.9 Identities = 11/24 (45%), Positives = 15/24 (62%) Query: 156 NDTVATTSDVKDLVNKLMNQKGIS 179 VA T++ DLV LM+Q G+S Sbjct: 36 AAQVAATTETSDLVGSLMSQLGVS 59 >gnl|CDD|181029 PRK07560, PRK07560, elongation factor EF-2; Reviewed. Length = 731 Score = 27.1 bits (61), Expect = 4.0 Identities = 8/11 (72%), Positives = 8/11 (72%) Query: 192 FIGKVDELINE 202 FI KVD LI E Sbjct: 145 FINKVDRLIKE 155 >gnl|CDD|151780 pfam11339, DUF3141, Protein of unknown function (DUF3141). This family of proteins with unknown function appears to be restricted to Proteobacteria. Length = 581 Score = 26.8 bits (60), Expect = 4.8 Identities = 18/52 (34%), Positives = 25/52 (48%) Query: 86 AAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRRPEINGFISVAPQPKSY 137 AA L V L+P++ + G W + L RP++ G I VA P SY Sbjct: 126 AAFLREVIELHPDAPKPAVIGNCQAGWALLMLAALRPDLFGPIIVAGAPLSY 177 >gnl|CDD|185393 PTZ00007, PTZ00007, (NAP-L) nucleosome assembly protein -L; Provisional. Length = 337 Score = 26.3 bits (58), Expect = 6.4 Identities = 9/22 (40%), Positives = 16/22 (72%) Query: 177 GISITHKVIPDANHFFIGKVDE 198 GI+I K+IP A ++F+G+ + Sbjct: 260 GITIRDKLIPYAVYWFLGEAID 281 >gnl|CDD|179529 PRK03057, PRK03057, hypothetical protein; Provisional. Length = 180 Score = 25.9 bits (57), Expect = 7.7 Identities = 8/16 (50%), Positives = 10/16 (62%) Query: 21 TNPNAPIALILHPHPR 36 NPN P+ LILH + Sbjct: 87 LNPNEPLYLILHFPGK 102 >gnl|CDD|152581 pfam12146, Hydrolase_4, Putative lysophospholipase. This domain is found in bacteria and eukaryotes and is approximately 110 amino acids in length. It is found in association with pfam00561. Many members are annotated as being lysophospholipases, and others as alpha-beta hydrolase fold-containing proteins. Length = 79 Score = 26.0 bits (58), Expect = 8.0 Identities = 15/66 (22%), Positives = 27/66 (40%), Gaps = 8/66 (12%) Query: 16 RYQPSTNPNAPIALILHPHPRFGGTMNDNIVYQ-LFYLFQQRGFVSLRFNFRGIGRSEGE 74 R+ P+ P A + +++H G Y L +G+ ++ RG GRS G+ Sbjct: 8 RWLPAGPPKAVV-VLVH---GGGEHSG---RYAELAEELAAQGYAVYAYDHRGHGRSPGK 60 Query: 75 FDYGDG 80 + Sbjct: 61 RGHVPS 66 >gnl|CDD|173608 PTZ00418, PTZ00418, Poly(A) polymerase; Provisional. Length = 593 Score = 25.9 bits (57), Expect = 8.2 Identities = 15/73 (20%), Positives = 29/73 (39%), Gaps = 10/73 (13%) Query: 112 WIS--MQLLMRRPEINGFISVAPQPKSYDFS----FLAPCPSSGLIINGSN----DTVAT 161 WI ++ L+++ E + + P PK + + A GL+ N T Sbjct: 432 WIESKIRFLIKKLETLNNLKIRPYPKFFKYQDDGWDYASSFFIGLVFFSKNVYNNSTFDL 491 Query: 162 TSDVKDLVNKLMN 174 ++D V+ + N Sbjct: 492 RYAIRDFVDIINN 504 >gnl|CDD|150036 pfam09220, LA-virus_coat, L-A virus, major coat protein. Members of this family form the major coat protein of the Saccharomyces cerevisiae L-A virus. Length = 436 Score = 25.8 bits (56), Expect = 8.6 Identities = 25/114 (21%), Positives = 44/114 (38%), Gaps = 13/114 (11%) Query: 64 NFRGIGRSEGEFDY---GDGELSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMR 120 N G G + + GD L A A DW ++ + + + ++ L +R Sbjct: 321 NIPKFGSIRGRYPFLLSGDAALIQATALEDW-SAIMAKPELVFTYAMQVSVALNTGLYLR 379 Query: 121 RPEINGFISVAP---------QPKSYDFSFLAPCPSSGLIINGSNDTVATTSDV 165 R + GF + QP+++ + LA C +NG +D T D+ Sbjct: 380 RVKKTGFGTTIDDSYEDGAFLQPETFVQAALACCTGQDAPLNGMSDVYVTYPDL 433 >gnl|CDD|140308 PTZ00285, PTZ00285, glucosamine-6-phosphate isomerase; Provisional. Length = 253 Score = 25.9 bits (57), Expect = 9.5 Identities = 8/23 (34%), Positives = 10/23 (43%), Gaps = 1/23 (4%) Query: 185 IPDAN-HFFIGKVDELINECAHY 206 I + N H G +L EC Y Sbjct: 99 IKEENRHILNGTAPDLEEECRRY 121 >gnl|CDD|178176 PLN02562, PLN02562, UDP-glycosyltransferase. Length = 448 Score = 25.6 bits (56), Expect = 9.5 Identities = 10/26 (38%), Positives = 14/26 (53%), Gaps = 4/26 (15%) Query: 89 LDWVQSLNPESKSCWIAGYSFGAWIS 114 L W+Q P S + SFG+W+S Sbjct: 264 LGWLQEQKPNS----VIYISFGSWVS 285 Database: CddB Posted date: Feb 4, 2011 9:54 PM Number of letters in database: 5,994,473 Number of sequences in database: 21,608 Lambda K H 0.321 0.139 0.426 Gapped Lambda K H 0.267 0.0779 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 21608 Number of Hits to DB: 3,839,980 Number of extensions: 241372 Number of successful extensions: 484 Number of sequences better than 10.0: 1 Number of HSP's gapped: 484 Number of HSP's successfully gapped: 23 Length of query: 225 Length of database: 5,994,473 Length adjustment: 90 Effective length of query: 135 Effective length of database: 4,049,753 Effective search space: 546716655 Effective search space used: 546716655 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 55 (24.9 bits)