RPS-BLAST 2.2.22 [Sep-27-2009]
Database: CddB
21,608 sequences; 5,994,473 total letters
Searching..................................................done
Query= gi|254780395|ref|YP_003064808.1| organic solvent tolerance
protein [Candidatus Liberibacter asiaticus str. psy62]
(762 letters)
>gnl|CDD|179645 PRK03761, PRK03761, LPS assembly outer membrane complex protein
LptD; Provisional.
Length = 778
Score = 57.6 bits (140), Expect = 1e-08
Identities = 44/160 (27%), Positives = 70/160 (43%), Gaps = 13/160 (8%)
Query: 144 TASSAQIIGQ--RTIFDKGTYTACSSCSKPNSRPPFWIVKSKRAILNRKTHTIRLEKPYL 201
A + GQ TI + G++T+C NS W V I +R+ +
Sbjct: 149 KADLMKQRGQNRYTILENGSFTSCLPGD--NS----WSVVGSEIIHDREEEVAEIWNARF 202
Query: 202 EIFGNSIFYFPLIEIP--DETVVRKTGFLTPLFSSGEKQRFGVGIPYYLVISDNSDATFT 259
++ G +FY P +++P D+ R++GFL P K F +PYY I+ N DAT T
Sbjct: 203 KVGGVPVFYSPYLQLPIGDK---RRSGFLIPNAKYSSKNGFEFELPYYWNIAPNYDATIT 259
Query: 260 FSPHPKKGILGEMELRKYFHSGKHTLHAAYMYNNNVESGE 299
++G E E R +G + Y+ ++ V E
Sbjct: 260 PHYMSRRGWQWENEFRYLTQAGAGLMAGEYLPSDRVYEDE 299
Score = 28.4 bits (64), Expect = 7.9
Identities = 14/37 (37%), Positives = 17/37 (45%)
Query: 535 IPNEDSHSLVLNSTSLFTQNRFSGFDRIEGGNRTNLG 571
I N DS L + LF +SG DRI N+ G
Sbjct: 521 IYNYDSTLLQSDYYGLFRDRTYSGLDRIASANQVTTG 557
>gnl|CDD|179846 PRK04423, PRK04423, organic solvent tolerance protein; Provisional.
Length = 798
Score = 57.6 bits (139), Expect = 1e-08
Identities = 65/295 (22%), Positives = 116/295 (39%), Gaps = 22/295 (7%)
Query: 35 TSTPSKIKKNETNRHSE-LDISSDEIVLNSEGSTTTAV--GNVKIEYKGYHLSARDITFN 91
P+ K R DI D++ G++TT GNV ++ L A ++ +
Sbjct: 42 DGAPAADPKAAEMRQQLPTDIEGDQLS----GTSTTPQYQGNVALKRGDQFLGADNLRMD 97
Query: 92 HKNHRIIASGNIKLIEPDKRQIHAEYLDITDDFTNGIIKNLTIKIPADETYLTASSAQII 151
+ IA GN++ + R + A+ + D I N+ ++ + A S +
Sbjct: 98 TETGNYIAEGNVRYQDTSIRMV-ADRAEGNQDTDTHKITNIQYQLVSRRGNGDAESVDLQ 156
Query: 152 GQRTIFDKGTYTACSSCSKPNSRPPFWIVKSKRAILNRKTHTIRLEKPYLEIFGNSIFYF 211
GQ + TYT C + P W +++ ++ L+I + YF
Sbjct: 157 GQVGQMHRSTYTTC------DPSQPIWRLRAPEIDVDNDEGFGTARNAVLQIGKVPVLYF 210
Query: 212 PLIEIPDETVVRKTGFLTPLFSSGEKQRFGVGIPYYLVISDNSDATFTFSPHPKKGILGE 271
P + P + R+TG L P F + F P YL ++ N DAT K+G +
Sbjct: 211 PWFKFPIDDR-RQTGLLFPQFGLSGRNGFDYLQPIYLNLAPNYDATLLPRYMSKRGFMFG 269
Query: 272 MELRKYFHSGKHTLHAAYMYNNNVESGEERHQAMLASIAEFEINPIW----NLGW 322
E R + G+ + Y+ N+ + ++R + + +N W +L W
Sbjct: 270 TEFRYLYDGGRGEVTGNYLPNDKLRD-KDRGRVFYSGY--HNVNSHWQARASLSW 321
>gnl|CDD|182134 PRK09897, PRK09897, hypothetical protein; Provisional.
Length = 534
Score = 29.7 bits (67), Expect = 2.7
Identities = 18/39 (46%), Positives = 22/39 (56%), Gaps = 3/39 (7%)
Query: 297 SGEERHQAMLASIAEFEINPIWN--LGWHLKKQTSGQLS 333
S EE + MLA+IA EI PI+ L W L+KQ L
Sbjct: 45 SDEENSKMMLANIASIEIPPIYCTYLEW-LQKQEDSHLQ 82
>gnl|CDD|169557 PRK08706, PRK08706, lipid A biosynthesis lauroyl acyltransferase;
Provisional.
Length = 289
Score = 29.4 bits (66), Expect = 3.6
Identities = 16/71 (22%), Positives = 29/71 (40%), Gaps = 11/71 (15%)
Query: 460 FTPIANIRG-----DLHYLSFNRDLSSDTISNNPNFVASKMLTAGLDIRYPIVAV--TQK 512
+ P ++ + HYL I P+F A +M L+ P++++ QK
Sbjct: 80 YAPAGRLKSLVRYRNKHYLDDALAAGEKVIILYPHFTAFEMAVYALNQDVPLISMYSHQK 139
Query: 513 S----RHILEG 519
+ IL+G
Sbjct: 140 NKILDEQILKG 150
>gnl|CDD|115849 pfam07220, DUF1420, Protein of unknown function (DUF1420). This
family consists of several hypothetical putative
lipoproteins which seem to be found specifically in the
bacterium Leptospira interrogans. Members of this family
are typically around 670 resides in length and their
function is unknown.
Length = 672
Score = 29.1 bits (65), Expect = 4.9
Identities = 11/57 (19%), Positives = 20/57 (35%), Gaps = 3/57 (5%)
Query: 319 NLGWHLKKQTSGQLSYNYYSDALSKRININ-QIYLTGTGE--KNSFDMRALHYHIQE 372
N+ + +KK + N + K IN + G E K F + ++
Sbjct: 93 NICFFIKKGKKNKDIINISKQNIDKSINFHIFFDKIGNDEKIKKPFFLFKEIGFFKD 149
>gnl|CDD|181879 PRK09464, pdhR, transcriptional regulator PdhR; Reviewed.
Length = 254
Score = 28.8 bits (65), Expect = 4.9
Identities = 13/24 (54%), Positives = 16/24 (66%), Gaps = 3/24 (12%)
Query: 513 SRHILEGIAQVYAA---TDEKYIK 533
+RH LEGIA YAA TDE + +
Sbjct: 103 TRHALEGIAAYYAALRGTDEDFER 126
>gnl|CDD|177276 PHA00431, PHA00431, internal virion protein C.
Length = 746
Score = 28.2 bits (63), Expect = 8.3
Identities = 16/55 (29%), Positives = 26/55 (47%), Gaps = 2/55 (3%)
Query: 287 AAYMYNNNVESGEERHQAMLASIAEFEINPIWNLGWHLKKQTSGQLSYNYYSDAL 341
A Y ++N S + + A+L S E+N + N L+ SG+ NY + L
Sbjct: 179 ALYGAHDNFLSDQAQKGAILNS--RVELNGVLNDPDVLRSPESGEFFMNYIDNGL 231
>gnl|CDD|180882 PRK07205, PRK07205, hypothetical protein; Provisional.
Length = 444
Score = 28.1 bits (63), Expect = 9.4
Identities = 22/73 (30%), Positives = 34/73 (46%), Gaps = 14/73 (19%)
Query: 464 ANIRGDLH-----YLSFNRDLSSDTISNNPNFVASKMLTAGLDIRYPIVAVTQKSRHILE 518
NI GD+ LSFN ++ TI+ + + +DIR P++A +K L
Sbjct: 291 LNIFGDIEDEPSGKLSFN--IAGLTITKEKSEI-------RIDIRIPVLADKEKLVQQLS 341
Query: 519 GIAQVYAATDEKY 531
AQ Y T E++
Sbjct: 342 QKAQEYGLTYEEF 354
>gnl|CDD|151448 pfam11001, DUF2841, Protein of unknown function (DUF2841). This
family of proteins with unknown function are all present
in yeast.
Length = 126
Score = 28.0 bits (63), Expect = 9.7
Identities = 11/25 (44%), Positives = 14/25 (56%)
Query: 90 FNHKNHRIIASGNIKLIEPDKRQIH 114
N R+IA IKLIEP K+ +
Sbjct: 20 LQQLNCRVIAKAWIKLIEPKKQAKY 44
Database: CddB
Posted date: Feb 4, 2011 9:54 PM
Number of letters in database: 5,994,473
Number of sequences in database: 21,608
Lambda K H
0.317 0.134 0.392
Gapped
Lambda K H
0.267 0.0594 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 21608
Number of Hits to DB: 12,399,098
Number of extensions: 803843
Number of successful extensions: 1332
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1331
Number of HSP's successfully gapped: 15
Length of query: 762
Length of database: 5,994,473
Length adjustment: 101
Effective length of query: 661
Effective length of database: 3,812,065
Effective search space: 2519774965
Effective search space used: 2519774965
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 61 (27.6 bits)