BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254781214|ref|YP_003065627.1| hypothetical protein CLIBASIA_05605 [Candidatus Liberibacter asiaticus str. psy62] (110 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done >gi|254781214|ref|YP_003065627.1| hypothetical protein CLIBASIA_05605 [Candidatus Liberibacter asiaticus str. psy62] gi|254040891|gb|ACT57687.1| hypothetical protein CLIBASIA_05605 [Candidatus Liberibacter asiaticus str. psy62] gi|317120679|gb|ADV02502.1| hypothetical protein SC1_gp110 [Liberibacter phage SC1] gi|317120823|gb|ADV02644.1| hypothetical protein SC1_gp110 [Candidatus Liberibacter asiaticus] Length = 110 Score = 204 bits (519), Expect = 3e-51, Method: Composition-based stats. Identities = 110/110 (100%), Positives = 110/110 (100%) Query: 1 MSFLDSSVKFGSSLSSGFKLGSIFGPVGGVIGGLMGGAAGLYSEWDTIGGFFGSSQKEGK 60 MSFLDSSVKFGSSLSSGFKLGSIFGPVGGVIGGLMGGAAGLYSEWDTIGGFFGSSQKEGK Sbjct: 1 MSFLDSSVKFGSSLSSGFKLGSIFGPVGGVIGGLMGGAAGLYSEWDTIGGFFGSSQKEGK 60 Query: 61 KEEEGVRPLEGDELAEVRRQESLRAYEMNRIPIPARRFTSSSLLSGVHVR 110 KEEEGVRPLEGDELAEVRRQESLRAYEMNRIPIPARRFTSSSLLSGVHVR Sbjct: 61 KEEEGVRPLEGDELAEVRRQESLRAYEMNRIPIPARRFTSSSLLSGVHVR 110 >gi|197294457|ref|YP_001798998.1| hypothetical protein PAa_0382 [Candidatus Phytoplasma australiense] gi|171853784|emb|CAM11717.1| Conserved hypothetical protein [Candidatus Phytoplasma australiense] Length = 1164 Score = 38.5 bits (88), Expect = 0.31, Method: Composition-based stats. Identities = 18/56 (32%), Positives = 29/56 (51%), Gaps = 8/56 (14%) Query: 6 SSVKFGSSLSSGFKLGSIFGPVGGVIGGLMGGAAG--------LYSEWDTIGGFFG 53 +++ G S +G +G+ G G IG +GGAAG + S+WD + +FG Sbjct: 1108 AAINIGVSTKAGVMVGATLGSAAGPIGTFLGGAAGGVISAAVAVASKWDVVKTWFG 1163 >gi|159043458|ref|YP_001532252.1| binding-protein-dependent transport systems inner membrane component [Dinoroseobacter shibae DFL 12] gi|157911218|gb|ABV92651.1| binding-protein-dependent transport systems inner membrane component [Dinoroseobacter shibae DFL 12] Length = 244 Score = 38.1 bits (87), Expect = 0.40, Method: Composition-based stats. Identities = 15/29 (51%), Positives = 21/29 (72%) Query: 13 SLSSGFKLGSIFGPVGGVIGGLMGGAAGL 41 SL SGFKL +++ P+G VIG +G + GL Sbjct: 170 SLGSGFKLAAVYAPIGAVIGEWVGSSQGL 198 >gi|162452218|ref|YP_001614585.1| hypothetical protein sce3945 [Sorangium cellulosum 'So ce 56'] gi|161162800|emb|CAN94105.1| putative membrane protein [Sorangium cellulosum 'So ce 56'] Length = 87 Score = 37.7 bits (86), Expect = 0.53, Method: Composition-based stats. Identities = 16/38 (42%), Positives = 24/38 (63%) Query: 17 GFKLGSIFGPVGGVIGGLMGGAAGLYSEWDTIGGFFGS 54 GF + ++ G G ++GG +G A GLY + + GGFF S Sbjct: 28 GFVVTALLGVAGSLLGGFIGRALGLYPSYQSSGGFFTS 65 >gi|119382790|ref|YP_913846.1| binding-protein-dependent transport systems inner membrane component [Paracoccus denitrificans PD1222] gi|119372557|gb|ABL68150.1| binding-protein-dependent transport systems inner membrane component [Paracoccus denitrificans PD1222] Length = 243 Score = 37.7 bits (86), Expect = 0.55, Method: Composition-based stats. Identities = 14/29 (48%), Positives = 22/29 (75%) Query: 13 SLSSGFKLGSIFGPVGGVIGGLMGGAAGL 41 SL++G +L +++ P+G VIG +GGA GL Sbjct: 168 SLATGLRLAAVYAPIGAVIGEWVGGARGL 196 >gi|293382313|ref|ZP_06628252.1| putative M23 peptidase domain protein [Enterococcus faecalis R712] gi|293389425|ref|ZP_06633883.1| putative M23 peptidase domain protein [Enterococcus faecalis S613] gi|312907445|ref|ZP_07766436.1| peptidase, M23 family [Enterococcus faecalis DAPTO 512] gi|312910063|ref|ZP_07768910.1| peptidase, M23 family [Enterococcus faecalis DAPTO 516] gi|291080258|gb|EFE17622.1| putative M23 peptidase domain protein [Enterococcus faecalis R712] gi|291081312|gb|EFE18275.1| putative M23 peptidase domain protein [Enterococcus faecalis S613] gi|310626473|gb|EFQ09756.1| peptidase, M23 family [Enterococcus faecalis DAPTO 512] gi|311289336|gb|EFQ67892.1| peptidase, M23 family [Enterococcus faecalis DAPTO 516] Length = 1722 Score = 37.7 bits (86), Expect = 0.57, Method: Composition-based stats. Identities = 21/51 (41%), Positives = 28/51 (54%), Gaps = 9/51 (17%) Query: 16 SGFKLGSI----FGPVGGVIGGLMGGAAGLYSEWDTIGGFFGSSQKEGKKE 62 +G KLG++ FGP+G VIGG++G AAG + G G KE E Sbjct: 671 AGAKLGAVAGTTFGPIGTVIGGILGTAAG-----SSFGSVLGKKLKENWPE 716 >gi|317407404|gb|EFV87367.1| hypothetical protein HMPREF0005_05331 [Achromobacter xylosoxidans C54] Length = 1092 Score = 37.3 bits (85), Expect = 0.65, Method: Composition-based stats. Identities = 22/49 (44%), Positives = 29/49 (59%), Gaps = 3/49 (6%) Query: 15 SSGFKLGSIFGPVGGVIGGLMGGAAG-LYSEWDTIGGFFGSSQKEGKKE 62 S+G GS FGP+G VIG +G A G + SE ++ G SQ EG+ E Sbjct: 301 SAGVLAGSFFGPLGVVIGSQVGAAIGTVLSEQESTGT--SRSQAEGRSE 347 >gi|323969108|gb|EGB64413.1| hypothetical protein ERHG_04812 [Escherichia coli TA007] Length = 571 Score = 36.9 bits (84), Expect = 0.84, Method: Composition-based stats. Identities = 30/77 (38%), Positives = 38/77 (49%), Gaps = 9/77 (11%) Query: 3 FLDSSVKFGSSLSSGFKLGSIFGPVGGVIGGLMGGAAG-LYSEWDTIGGFFGSSQKEGKK 61 F ++K GS SG +GSIF VIG L+G A G L T+ G F S + +K Sbjct: 432 FSSMALKVGSYAMSGSAIGSIFP----VIGSLVGSALGALVGIIMTVAGIFTSKASKIRK 487 Query: 62 EEEGVRPLEGDELAEVR 78 + VR D L EVR Sbjct: 488 AQGKVR----DRLEEVR 500 >gi|77458888|ref|YP_348394.1| hypothetical protein Pfl01_2663 [Pseudomonas fluorescens Pf0-1] gi|77382891|gb|ABA74404.1| conserved hypothetical protein [Pseudomonas fluorescens Pf0-1] Length = 543 Score = 36.2 bits (82), Expect = 1.5, Method: Composition-based stats. Identities = 20/65 (30%), Positives = 34/65 (52%), Gaps = 5/65 (7%) Query: 2 SFLDSSVKFGSSLSSGFKLGSIFGPVGGVIGGLMGGAAGLYSEWDTIGGFFGSSQKEGKK 61 SFL + G SGF +G+ F +G ++G ++GG G+++ G F G K+ + Sbjct: 410 SFL---LSVGGYALSGFSIGTFFPVIGNIVGAIVGGVIGVFAA--AFGYFLGGRGKKIAE 464 Query: 62 EEEGV 66 +E V Sbjct: 465 AQEQV 469 >gi|271965069|ref|YP_003339265.1| hypothetical protein Sros_3588 [Streptosporangium roseum DSM 43021] gi|270508244|gb|ACZ86522.1| hypothetical protein Sros_3588 [Streptosporangium roseum DSM 43021] Length = 610 Score = 36.2 bits (82), Expect = 1.5, Method: Composition-based stats. Identities = 29/90 (32%), Positives = 38/90 (42%), Gaps = 23/90 (25%) Query: 26 PVGGVIGGLMGGAAGLYSEWDTI-GGFFGSSQKEGKKEEEGVRPLEGDELAEVRRQESLR 84 P GG G L GG G D + GG G + G+ EE + PL+ +RRQ++LR Sbjct: 29 PPGGFRGALPGGEPG-----DGLTGGEPGGAPAAGESGEEALAPLQRPAALRLRRQQALR 83 Query: 85 AY-----------------EMNRIPIPARR 97 + E R PI ARR Sbjct: 84 QWPQQGQRGVGRGHRRDPIETGRDPIEARR 113 >gi|311105269|ref|YP_003978122.1| tape measure domain-containing protein [Achromobacter xylosoxidans A8] gi|310759958|gb|ADP15407.1| tape measure domain protein [Achromobacter xylosoxidans A8] Length = 782 Score = 36.2 bits (82), Expect = 1.7, Method: Composition-based stats. Identities = 17/46 (36%), Positives = 27/46 (58%), Gaps = 2/46 (4%) Query: 26 PVGGVIGGLMGGAAGLYSEWDTIGGFFGSSQKEGKKEEEGVRPLEG 71 P+G +IG ++ G ++ WD+I FFG + E KK +G L+G Sbjct: 590 PLGRIIGLVLAGLLLIWQNWDSISKFFGETWDEFKK--KGKEALDG 633 >gi|156100839|ref|XP_001616113.1| hypothetical protein [Plasmodium vivax SaI-1] gi|148804987|gb|EDL46386.1| hypothetical protein, conserved [Plasmodium vivax] Length = 5707 Score = 35.4 bits (80), Expect = 2.5, Method: Composition-based stats. Identities = 22/55 (40%), Positives = 29/55 (52%), Gaps = 8/55 (14%) Query: 10 FGSSLSSGFKLGSIFGPV----GGVIGGLMGGAAGLY-SEWDTIGGFFGSSQKEG 59 FGS+ SG G +FG GG+ GG GL+ S + GG FGS+Q+ G Sbjct: 2487 FGSAQQSG---GGLFGGAQQSGGGLFGGAQQSGGGLFGSAQQSGGGLFGSAQQSG 2538 >gi|78186455|ref|YP_374498.1| hypothetical protein Plut_0572 [Chlorobium luteolum DSM 273] gi|78166357|gb|ABB23455.1| conserved hypothetical protein [Chlorobium luteolum DSM 273] Length = 163 Score = 35.4 bits (80), Expect = 2.7, Method: Composition-based stats. Identities = 18/44 (40%), Positives = 26/44 (59%) Query: 12 SSLSSGFKLGSIFGPVGGVIGGLMGGAAGLYSEWDTIGGFFGSS 55 S+++ G G+ FG +GG++ GLM G GL+ T GGF S Sbjct: 43 SAVTLGRNAGTGFGFLGGMLCGLMTGDMGLHMLAGTTGGFMAGS 86 >gi|117920477|ref|YP_869669.1| hypothetical protein Shewana3_2032 [Shewanella sp. ANA-3] gi|117612809|gb|ABK48263.1| conserved hypothetical protein [Shewanella sp. ANA-3] Length = 542 Score = 35.0 bits (79), Expect = 3.5, Method: Composition-based stats. Identities = 37/109 (33%), Positives = 47/109 (43%), Gaps = 20/109 (18%) Query: 7 SVKFGSSLSSGFKLGSIFGPVGGVIGGLMGGAA-----GLYSEWDTIGGFFGSSQKEGKK 61 + + GS SGF +GS F PV G I G +GGA GL+ G F S +K + Sbjct: 409 AFQVGSYALSGFTIGSAF-PVIGNIAGAIGGAVLGIVMGLF-------GLFTSKEKRIRN 460 Query: 62 EEEGVRPLEGDELAEVRRQESLRAYEMNRIPIPARRFT--SSSLLSGVH 108 + VR E E E L E N I AR + S LS V+ Sbjct: 461 AKSEVR-----EKIEAVCDEKLATIESNVAKIVARVHSEVESKALSAVN 504 >gi|317500213|ref|ZP_07958444.1| hypothetical protein HMPREF1026_00386 [Lachnospiraceae bacterium 8_1_57FAA] gi|331087559|ref|ZP_08336490.1| hypothetical protein HMPREF1025_00073 [Lachnospiraceae bacterium 3_1_46FAA] gi|316898384|gb|EFV20424.1| hypothetical protein HMPREF1026_00386 [Lachnospiraceae bacterium 8_1_57FAA] gi|330400699|gb|EGG80303.1| hypothetical protein HMPREF1025_00073 [Lachnospiraceae bacterium 3_1_46FAA] Length = 561 Score = 35.0 bits (79), Expect = 3.7, Method: Composition-based stats. Identities = 24/73 (32%), Positives = 36/73 (49%), Gaps = 9/73 (12%) Query: 1 MSFLDSSVKFGSSLSSGFK-----LGSIFGPVGGVIGGLMGGAAG----LYSEWDTIGGF 51 +S + +SV FG +G K LG +G GG +G ++G AAG L+ W GG Sbjct: 178 VSIVGASVLFGMGSRAGEKAGEELLGPAYGAAGGTLGTVIGAAAGLLFLLFVLWIFKGGI 237 Query: 52 FGSSQKEGKKEEE 64 Q++ K +E Sbjct: 238 RRQIQRDHTKGKE 250 >gi|153816464|ref|ZP_01969132.1| hypothetical protein RUMTOR_02717 [Ruminococcus torques ATCC 27756] gi|145846206|gb|EDK23124.1| hypothetical protein RUMTOR_02717 [Ruminococcus torques ATCC 27756] Length = 575 Score = 35.0 bits (79), Expect = 3.7, Method: Composition-based stats. Identities = 24/73 (32%), Positives = 36/73 (49%), Gaps = 9/73 (12%) Query: 1 MSFLDSSVKFGSSLSSGFK-----LGSIFGPVGGVIGGLMGGAAG----LYSEWDTIGGF 51 +S + +SV FG +G K LG +G GG +G ++G AAG L+ W GG Sbjct: 192 VSIVGASVLFGMGSRAGEKAGEELLGPAYGAAGGTLGTVIGAAAGLLFLLFVLWIFKGGI 251 Query: 52 FGSSQKEGKKEEE 64 Q++ K +E Sbjct: 252 RRQIQRDHTKGKE 264 >gi|330983522|gb|EGH81625.1| hypothetical protein PLA107_00730 [Pseudomonas syringae pv. lachrymans str. M301315] Length = 124 Score = 33.9 bits (76), Expect = 6.9, Method: Composition-based stats. Identities = 12/28 (42%), Positives = 21/28 (75%) Query: 16 SGFKLGSIFGPVGGVIGGLMGGAAGLYS 43 +G +G++ GP G +GGL+GGA G+++ Sbjct: 86 AGVAMGAVAGPAGMAVGGLVGGAIGVFA 113 >gi|302693961|ref|XP_003036659.1| hypothetical protein SCHCODRAFT_80348 [Schizophyllum commune H4-8] gi|300110356|gb|EFJ01757.1| hypothetical protein SCHCODRAFT_80348 [Schizophyllum commune H4-8] Length = 557 Score = 33.9 bits (76), Expect = 7.8, Method: Composition-based stats. Identities = 18/40 (45%), Positives = 21/40 (52%) Query: 15 SSGFKLGSIFGPVGGVIGGLMGGAAGLYSEWDTIGGFFGS 54 + G GS FG G GGL GG A + +T GG FGS Sbjct: 64 NQGASTGSAFGQTGATSGGLFGGGAANSNNTNTGGGLFGS 103 >gi|91976470|ref|YP_569129.1| gene transfer agent (GTA) orfg15 [Rhodopseudomonas palustris BisB5] gi|91682926|gb|ABE39228.1| gene transfer agent (GTA) orfg15 [Rhodopseudomonas palustris BisB5] Length = 1283 Score = 33.9 bits (76), Expect = 8.0, Method: Composition-based stats. Identities = 22/76 (28%), Positives = 34/76 (44%), Gaps = 8/76 (10%) Query: 16 SGFKLGSIFGPVGGVIGGLMGGAAGLYSEWDTIGGFFGSSQKEGKKEEEGVRPLEGDELA 75 +G +G++FGPVG + G + G AG + +GG G + EG R + D +A Sbjct: 9 AGGAVGALFGPVGAIAGRIAGALAGSMLDHALLGG--------GDRNVEGPRLADLDIMA 60 Query: 76 EVRRQESLRAYEMNRI 91 R Y R+ Sbjct: 61 STEGAPIPRVYGRARL 76 >gi|167588612|ref|ZP_02381000.1| hypothetical protein BuboB_24974 [Burkholderia ubonensis Bu] Length = 217 Score = 33.9 bits (76), Expect = 8.8, Method: Composition-based stats. Identities = 18/43 (41%), Positives = 24/43 (55%), Gaps = 2/43 (4%) Query: 20 LGSIFGPVGGVIGGLMGGAAGLYSEWDTIGGF-FGSSQKEGKK 61 LG + G +G V+ GL GG AG + E+D GF F EG + Sbjct: 50 LGRVLGGIGNVVAGLTGGGAG-FDEYDEGDGFEFADEADEGDE 91 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.315 0.137 0.390 Lambda K H 0.267 0.0423 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,000,104,939 Number of Sequences: 14124377 Number of extensions: 39259546 Number of successful extensions: 143221 Number of sequences better than 10.0: 138 Number of HSP's better than 10.0 without gapping: 74 Number of HSP's successfully gapped in prelim test: 64 Number of HSP's that attempted gapping in prelim test: 142807 Number of HSP's gapped (non-prelim): 374 length of query: 110 length of database: 4,842,793,630 effective HSP length: 78 effective length of query: 32 effective length of database: 3,741,092,224 effective search space: 119714951168 effective search space used: 119714951168 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (22.0 bits) S2: 76 (33.8 bits)