RPS-BLAST 2.2.22 [Sep-27-2009] Database: CddB 21,608 sequences; 5,994,473 total letters Searching..................................................done Query= gi|254780395|ref|YP_003064808.1| organic solvent tolerance protein [Candidatus Liberibacter asiaticus str. psy62] (762 letters) >gnl|CDD|179645 PRK03761, PRK03761, LPS assembly outer membrane complex protein LptD; Provisional. Length = 778 Score = 57.6 bits (140), Expect = 1e-08 Identities = 44/160 (27%), Positives = 70/160 (43%), Gaps = 13/160 (8%) Query: 144 TASSAQIIGQ--RTIFDKGTYTACSSCSKPNSRPPFWIVKSKRAILNRKTHTIRLEKPYL 201 A + GQ TI + G++T+C NS W V I +R+ + Sbjct: 149 KADLMKQRGQNRYTILENGSFTSCLPGD--NS----WSVVGSEIIHDREEEVAEIWNARF 202 Query: 202 EIFGNSIFYFPLIEIP--DETVVRKTGFLTPLFSSGEKQRFGVGIPYYLVISDNSDATFT 259 ++ G +FY P +++P D+ R++GFL P K F +PYY I+ N DAT T Sbjct: 203 KVGGVPVFYSPYLQLPIGDK---RRSGFLIPNAKYSSKNGFEFELPYYWNIAPNYDATIT 259 Query: 260 FSPHPKKGILGEMELRKYFHSGKHTLHAAYMYNNNVESGE 299 ++G E E R +G + Y+ ++ V E Sbjct: 260 PHYMSRRGWQWENEFRYLTQAGAGLMAGEYLPSDRVYEDE 299 Score = 28.4 bits (64), Expect = 7.9 Identities = 14/37 (37%), Positives = 17/37 (45%) Query: 535 IPNEDSHSLVLNSTSLFTQNRFSGFDRIEGGNRTNLG 571 I N DS L + LF +SG DRI N+ G Sbjct: 521 IYNYDSTLLQSDYYGLFRDRTYSGLDRIASANQVTTG 557 >gnl|CDD|179846 PRK04423, PRK04423, organic solvent tolerance protein; Provisional. Length = 798 Score = 57.6 bits (139), Expect = 1e-08 Identities = 65/295 (22%), Positives = 116/295 (39%), Gaps = 22/295 (7%) Query: 35 TSTPSKIKKNETNRHSE-LDISSDEIVLNSEGSTTTAV--GNVKIEYKGYHLSARDITFN 91 P+ K R DI D++ G++TT GNV ++ L A ++ + Sbjct: 42 DGAPAADPKAAEMRQQLPTDIEGDQLS----GTSTTPQYQGNVALKRGDQFLGADNLRMD 97 Query: 92 HKNHRIIASGNIKLIEPDKRQIHAEYLDITDDFTNGIIKNLTIKIPADETYLTASSAQII 151 + IA GN++ + R + A+ + D I N+ ++ + A S + Sbjct: 98 TETGNYIAEGNVRYQDTSIRMV-ADRAEGNQDTDTHKITNIQYQLVSRRGNGDAESVDLQ 156 Query: 152 GQRTIFDKGTYTACSSCSKPNSRPPFWIVKSKRAILNRKTHTIRLEKPYLEIFGNSIFYF 211 GQ + TYT C + P W +++ ++ L+I + YF Sbjct: 157 GQVGQMHRSTYTTC------DPSQPIWRLRAPEIDVDNDEGFGTARNAVLQIGKVPVLYF 210 Query: 212 PLIEIPDETVVRKTGFLTPLFSSGEKQRFGVGIPYYLVISDNSDATFTFSPHPKKGILGE 271 P + P + R+TG L P F + F P YL ++ N DAT K+G + Sbjct: 211 PWFKFPIDDR-RQTGLLFPQFGLSGRNGFDYLQPIYLNLAPNYDATLLPRYMSKRGFMFG 269 Query: 272 MELRKYFHSGKHTLHAAYMYNNNVESGEERHQAMLASIAEFEINPIW----NLGW 322 E R + G+ + Y+ N+ + ++R + + +N W +L W Sbjct: 270 TEFRYLYDGGRGEVTGNYLPNDKLRD-KDRGRVFYSGY--HNVNSHWQARASLSW 321 >gnl|CDD|182134 PRK09897, PRK09897, hypothetical protein; Provisional. Length = 534 Score = 29.7 bits (67), Expect = 2.7 Identities = 18/39 (46%), Positives = 22/39 (56%), Gaps = 3/39 (7%) Query: 297 SGEERHQAMLASIAEFEINPIWN--LGWHLKKQTSGQLS 333 S EE + MLA+IA EI PI+ L W L+KQ L Sbjct: 45 SDEENSKMMLANIASIEIPPIYCTYLEW-LQKQEDSHLQ 82 >gnl|CDD|169557 PRK08706, PRK08706, lipid A biosynthesis lauroyl acyltransferase; Provisional. Length = 289 Score = 29.4 bits (66), Expect = 3.6 Identities = 16/71 (22%), Positives = 29/71 (40%), Gaps = 11/71 (15%) Query: 460 FTPIANIRG-----DLHYLSFNRDLSSDTISNNPNFVASKMLTAGLDIRYPIVAV--TQK 512 + P ++ + HYL I P+F A +M L+ P++++ QK Sbjct: 80 YAPAGRLKSLVRYRNKHYLDDALAAGEKVIILYPHFTAFEMAVYALNQDVPLISMYSHQK 139 Query: 513 S----RHILEG 519 + IL+G Sbjct: 140 NKILDEQILKG 150 >gnl|CDD|115849 pfam07220, DUF1420, Protein of unknown function (DUF1420). This family consists of several hypothetical putative lipoproteins which seem to be found specifically in the bacterium Leptospira interrogans. Members of this family are typically around 670 resides in length and their function is unknown. Length = 672 Score = 29.1 bits (65), Expect = 4.9 Identities = 11/57 (19%), Positives = 20/57 (35%), Gaps = 3/57 (5%) Query: 319 NLGWHLKKQTSGQLSYNYYSDALSKRININ-QIYLTGTGE--KNSFDMRALHYHIQE 372 N+ + +KK + N + K IN + G E K F + ++ Sbjct: 93 NICFFIKKGKKNKDIINISKQNIDKSINFHIFFDKIGNDEKIKKPFFLFKEIGFFKD 149 >gnl|CDD|181879 PRK09464, pdhR, transcriptional regulator PdhR; Reviewed. Length = 254 Score = 28.8 bits (65), Expect = 4.9 Identities = 13/24 (54%), Positives = 16/24 (66%), Gaps = 3/24 (12%) Query: 513 SRHILEGIAQVYAA---TDEKYIK 533 +RH LEGIA YAA TDE + + Sbjct: 103 TRHALEGIAAYYAALRGTDEDFER 126 >gnl|CDD|177276 PHA00431, PHA00431, internal virion protein C. Length = 746 Score = 28.2 bits (63), Expect = 8.3 Identities = 16/55 (29%), Positives = 26/55 (47%), Gaps = 2/55 (3%) Query: 287 AAYMYNNNVESGEERHQAMLASIAEFEINPIWNLGWHLKKQTSGQLSYNYYSDAL 341 A Y ++N S + + A+L S E+N + N L+ SG+ NY + L Sbjct: 179 ALYGAHDNFLSDQAQKGAILNS--RVELNGVLNDPDVLRSPESGEFFMNYIDNGL 231 >gnl|CDD|180882 PRK07205, PRK07205, hypothetical protein; Provisional. Length = 444 Score = 28.1 bits (63), Expect = 9.4 Identities = 22/73 (30%), Positives = 34/73 (46%), Gaps = 14/73 (19%) Query: 464 ANIRGDLH-----YLSFNRDLSSDTISNNPNFVASKMLTAGLDIRYPIVAVTQKSRHILE 518 NI GD+ LSFN ++ TI+ + + +DIR P++A +K L Sbjct: 291 LNIFGDIEDEPSGKLSFN--IAGLTITKEKSEI-------RIDIRIPVLADKEKLVQQLS 341 Query: 519 GIAQVYAATDEKY 531 AQ Y T E++ Sbjct: 342 QKAQEYGLTYEEF 354 >gnl|CDD|151448 pfam11001, DUF2841, Protein of unknown function (DUF2841). This family of proteins with unknown function are all present in yeast. Length = 126 Score = 28.0 bits (63), Expect = 9.7 Identities = 11/25 (44%), Positives = 14/25 (56%) Query: 90 FNHKNHRIIASGNIKLIEPDKRQIH 114 N R+IA IKLIEP K+ + Sbjct: 20 LQQLNCRVIAKAWIKLIEPKKQAKY 44 Database: CddB Posted date: Feb 4, 2011 9:54 PM Number of letters in database: 5,994,473 Number of sequences in database: 21,608 Lambda K H 0.317 0.134 0.392 Gapped Lambda K H 0.267 0.0594 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 21608 Number of Hits to DB: 12,399,098 Number of extensions: 803843 Number of successful extensions: 1332 Number of sequences better than 10.0: 1 Number of HSP's gapped: 1331 Number of HSP's successfully gapped: 15 Length of query: 762 Length of database: 5,994,473 Length adjustment: 101 Effective length of query: 661 Effective length of database: 3,812,065 Effective search space: 2519774965 Effective search space used: 2519774965 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 61 (27.6 bits)