BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254781207|ref|YP_003065620.1| hypothetical protein CLIBASIA_05570 [Candidatus Liberibacter asiaticus str. psy62] (80 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done >gi|254781207|ref|YP_003065620.1| hypothetical protein CLIBASIA_05570 [Candidatus Liberibacter asiaticus str. psy62] gi|254040884|gb|ACT57680.1| hypothetical protein CLIBASIA_05570 [Candidatus Liberibacter asiaticus str. psy62] Length = 80 Score = 125 bits (313), Expect = 2e-27, Method: Composition-based stats. Identities = 80/80 (100%), Positives = 80/80 (100%) Query: 1 MGFWNSITSIAATVGAVVGTVATAAALATPIGWVGAAVAGVGAAVVGAGASDLAMHKMRE 60 MGFWNSITSIAATVGAVVGTVATAAALATPIGWVGAAVAGVGAAVVGAGASDLAMHKMRE Sbjct: 1 MGFWNSITSIAATVGAVVGTVATAAALATPIGWVGAAVAGVGAAVVGAGASDLAMHKMRE 60 Query: 61 QEEEEKKLLKKGLKKRQKNY 80 QEEEEKKLLKKGLKKRQKNY Sbjct: 61 QEEEEKKLLKKGLKKRQKNY 80 >gi|317120672|gb|ADV02495.1| hypothetical protein SC1_gp060 [Liberibacter phage SC1] gi|317120816|gb|ADV02637.1| hypothetical protein SC1_gp060 [Candidatus Liberibacter asiaticus] Length = 1340 Score = 60.9 bits (146), Expect = 5e-08, Method: Composition-based stats. Identities = 63/84 (75%), Positives = 66/84 (78%), Gaps = 13/84 (15%) Query: 1 MGFWNSITSIAATVGAVVGTVATAAALATPIGWVGAAVAGVGAAVVGAGASDLAMH---- 56 MGFWNSITSIAATVGAVVGTVATAAALATPIGWVGAAVAGVGAAVVGAGASDLAMH Sbjct: 1 MGFWNSITSIAATVGAVVGTVATAAALATPIGWVGAAVAGVGAAVVGAGASDLAMHKMRE 60 Query: 57 ---------KMREQEEEEKKLLKK 71 + R +EE EK L+ K Sbjct: 61 QEEEEKKASEKRIKEETEKLLISK 84 >gi|254933575|ref|ZP_05266934.1| conserved hypothetical protein [Listeria monocytogenes HPB2262] gi|293585136|gb|EFF97168.1| conserved hypothetical protein [Listeria monocytogenes HPB2262] Length = 1599 Score = 40.1 bits (92), Expect = 0.11, Method: Composition-based stats. Identities = 25/70 (35%), Positives = 37/70 (52%), Gaps = 4/70 (5%) Query: 1 MGFWNSITSIAATVGA----VVGTVATAAALATPIGWVGAAVAGVGAAVVGAGASDLAMH 56 +G + +TS+AA GA V GT AA LA G + A + GAA++G G L H Sbjct: 473 LGLGSKLTSLAAGFGATTTAVEGTSLAAAGLAGSFGALPAVIGLAGAALIGVGIYALDKH 532 Query: 57 KMREQEEEEK 66 + +E +E+ Sbjct: 533 ISKIEESKER 542 >gi|328474314|gb|EGF45119.1| hypothetical protein VP10329_16445 [Vibrio parahaemolyticus 10329] Length = 336 Score = 39.4 bits (90), Expect = 0.15, Method: Composition-based stats. Identities = 17/38 (44%), Positives = 24/38 (63%) Query: 8 TSIAATVGAVVGTVATAAALATPIGWVGAAVAGVGAAV 45 T+ + VG V G + +ATP+GWVGA V GV ++V Sbjct: 265 TAASTVVGTVSGIAISLFVVATPVGWVGALVLGVASSV 302 >gi|156975567|ref|YP_001446474.1| hypothetical protein VIBHAR_03299 [Vibrio harveyi ATCC BAA-1116] gi|156527161|gb|ABU72247.1| hypothetical protein VIBHAR_03299 [Vibrio harveyi ATCC BAA-1116] Length = 338 Score = 39.4 bits (90), Expect = 0.19, Method: Composition-based stats. Identities = 16/53 (30%), Positives = 28/53 (52%) Query: 8 TSIAATVGAVVGTVATAAALATPIGWVGAAVAGVGAAVVGAGASDLAMHKMRE 60 T+++ VG G V + LATP+GW+GA GV + + + + + +E Sbjct: 267 TAVSTVVGLAGGAVISLVVLATPVGWIGALALGVTSGIASYASGKVVANVYKE 319 >gi|315302848|ref|ZP_07873606.1| gp15 [Listeria ivanovii FSL F6-596] gi|313628795|gb|EFR97170.1| gp15 [Listeria ivanovii FSL F6-596] Length = 844 Score = 38.2 bits (87), Expect = 0.41, Method: Composition-based stats. Identities = 22/56 (39%), Positives = 29/56 (51%) Query: 11 AATVGAVVGTVATAAALATPIGWVGAAVAGVGAAVVGAGASDLAMHKMREQEEEEK 66 AT AV GT AA LA G + A + GAA++G G L H + +E EE+ Sbjct: 487 GATTTAVEGTSLAAAGLAGSFGALPAVIGLAGAALIGVGIYALDKHISKIEESEER 542 >gi|269966193|ref|ZP_06180282.1| hypothetical protein VMC_17120 [Vibrio alginolyticus 40B] gi|269829108|gb|EEZ83353.1| hypothetical protein VMC_17120 [Vibrio alginolyticus 40B] Length = 347 Score = 37.8 bits (86), Expect = 0.51, Method: Composition-based stats. Identities = 17/53 (32%), Positives = 27/53 (50%) Query: 8 TSIAATVGAVVGTVATAAALATPIGWVGAAVAGVGAAVVGAGASDLAMHKMRE 60 T+ + VG G V + +ATP+GW+GA GV +A + + K +E Sbjct: 276 TAASTVVGLAGGAVISLVVMATPVGWIGAIALGVVSASASYASGKVVADKYKE 328 >gi|313635396|gb|EFS01657.1| gp15 [Listeria seeligeri FSL N1-067] Length = 1337 Score = 36.3 bits (82), Expect = 1.5, Method: Composition-based stats. Identities = 21/56 (37%), Positives = 29/56 (51%) Query: 11 AATVGAVVGTVATAAALATPIGWVGAAVAGVGAAVVGAGASDLAMHKMREQEEEEK 66 AT AV GT AA LA G + A + GAA++G G L H + +E +E+ Sbjct: 487 GATTTAVEGTSLAAAGLAGSFGALPAVITVAGAALLGVGIYALDKHISKIEESKER 542 >gi|224500922|ref|ZP_03669229.1| hypothetical protein LmonFR_00125 [Listeria monocytogenes FSL R2-561] Length = 1599 Score = 36.3 bits (82), Expect = 1.5, Method: Composition-based stats. Identities = 21/56 (37%), Positives = 29/56 (51%) Query: 11 AATVGAVVGTVATAAALATPIGWVGAAVAGVGAAVVGAGASDLAMHKMREQEEEEK 66 AT AV GT AA LA G + A + GAA++G G L H + +E +E+ Sbjct: 487 GATTTAVEGTSLAAAGLAGSFGALPAVITVAGAALLGVGIYALDKHISKIEESKER 542 >gi|317033872|ref|XP_001395607.2| efflux pump antibiotic resistance protein [Aspergillus niger CBS 513.88] Length = 571 Score = 35.9 bits (81), Expect = 2.0, Method: Composition-based stats. Identities = 19/33 (57%), Positives = 24/33 (72%) Query: 18 VGTVATAAALATPIGWVGAAVAGVGAAVVGAGA 50 VGT+ AAA ++P+ VG AVAG+GAA V GA Sbjct: 97 VGTILCAAANSSPMFIVGRAVAGLGAAGVLQGA 129 >gi|134080328|emb|CAK46250.1| unnamed protein product [Aspergillus niger] Length = 569 Score = 35.9 bits (81), Expect = 2.0, Method: Composition-based stats. Identities = 19/33 (57%), Positives = 24/33 (72%) Query: 18 VGTVATAAALATPIGWVGAAVAGVGAAVVGAGA 50 VGT+ AAA ++P+ VG AVAG+GAA V GA Sbjct: 97 VGTILCAAANSSPMFIVGRAVAGLGAAGVLQGA 129 >gi|308809207|ref|XP_003081913.1| CBS domain-containing protein / transporter associated domain-containing protein (ISS) [Ostreococcus tauri] gi|116060380|emb|CAL55716.1| CBS domain-containing protein / transporter associated domain-containing protein (ISS) [Ostreococcus tauri] Length = 520 Score = 35.5 bits (80), Expect = 2.2, Method: Composition-based stats. Identities = 28/73 (38%), Positives = 34/73 (46%), Gaps = 5/73 (6%) Query: 8 TSIAATVGAVVGTVATAAALATPIGWVGAAVAGVGAA---VVGAGASDLAMHKMREQ--E 62 T IA AV VATA +ATP+ W+ V VG +V AG S + E Sbjct: 133 TEIAPKSVAVQHAVATAKVIATPVYWLSLIVYPVGRIFQWIVNAGFSLFGVETSAEPFVS 192 Query: 63 EEEKKLLKKGLKK 75 EEE KL+ G K Sbjct: 193 EEELKLVLAGATK 205 >gi|242813635|ref|XP_002486206.1| efflux pump antibiotic resistance protein, putative [Talaromyces stipitatus ATCC 10500] gi|242813642|ref|XP_002486207.1| efflux pump antibiotic resistance protein, putative [Talaromyces stipitatus ATCC 10500] gi|242813648|ref|XP_002486208.1| efflux pump antibiotic resistance protein, putative [Talaromyces stipitatus ATCC 10500] gi|218714545|gb|EED13968.1| efflux pump antibiotic resistance protein, putative [Talaromyces stipitatus ATCC 10500] gi|218714546|gb|EED13969.1| efflux pump antibiotic resistance protein, putative [Talaromyces stipitatus ATCC 10500] gi|218714547|gb|EED13970.1| efflux pump antibiotic resistance protein, putative [Talaromyces stipitatus ATCC 10500] Length = 527 Score = 35.5 bits (80), Expect = 2.5, Method: Composition-based stats. Identities = 18/33 (54%), Positives = 23/33 (69%) Query: 18 VGTVATAAALATPIGWVGAAVAGVGAAVVGAGA 50 VGT+ AAA ++P+ VG A+AG GAA V GA Sbjct: 186 VGTILCAAATSSPMFIVGRAIAGFGAAGVLQGA 218 >gi|242813630|ref|XP_002486205.1| efflux pump antibiotic resistance protein, putative [Talaromyces stipitatus ATCC 10500] gi|218714544|gb|EED13967.1| efflux pump antibiotic resistance protein, putative [Talaromyces stipitatus ATCC 10500] Length = 642 Score = 35.5 bits (80), Expect = 2.5, Method: Composition-based stats. Identities = 18/33 (54%), Positives = 23/33 (69%) Query: 18 VGTVATAAALATPIGWVGAAVAGVGAAVVGAGA 50 VGT+ AAA ++P+ VG A+AG GAA V GA Sbjct: 186 VGTILCAAATSSPMFIVGRAIAGFGAAGVLQGA 218 >gi|103486523|ref|YP_616084.1| hypothetical protein Sala_1034 [Sphingopyxis alaskensis RB2256] gi|98976600|gb|ABF52751.1| protein of unknown function DUF1295 [Sphingopyxis alaskensis RB2256] Length = 269 Score = 35.1 bits (79), Expect = 3.6, Method: Composition-based stats. Identities = 24/67 (35%), Positives = 31/67 (46%), Gaps = 4/67 (5%) Query: 16 AVVGTVATAAALATPIGWVGAAVAGVGAAVVGAGASDLAMHKMREQEEEEKKLLKKGLKK 75 A +G A+A IGWVGAA A G A G D +H R + ++L GL + Sbjct: 130 AQIGIWASAGEGVGIIGWVGAAAALTGIAFESIG--DAQLHAFRRNPANKGRVLDTGLWR 187 Query: 76 --RQKNY 80 R NY Sbjct: 188 YTRHPNY 194 >gi|260437285|ref|ZP_05791101.1| phosphoenolpyruvate-protein phosphotransferase [Butyrivibrio crossotus DSM 2876] gi|292810595|gb|EFF69800.1| phosphoenolpyruvate-protein phosphotransferase [Butyrivibrio crossotus DSM 2876] Length = 554 Score = 34.7 bits (78), Expect = 3.8, Method: Composition-based stats. Identities = 26/76 (34%), Positives = 36/76 (47%), Gaps = 1/76 (1%) Query: 2 GFWNSITSIAATVGAVVGTVATAAALATPIGWVGAAVAGV-GAAVVGAGASDLAMHKMRE 60 G NS T+I A A+ V T L I V A V G G + L + K R+ Sbjct: 183 GSLNSHTAILARTMAIPALVNTPLPLDEEIDGVMAVVDGTKGVIYIDPDCETLELMKKRK 242 Query: 61 QEEEEKKLLKKGLKKR 76 EE+EK++L + LK + Sbjct: 243 AEEDEKRVLLQTLKGK 258 >gi|291518950|emb|CBK74171.1| hypothetical protein CIY_13670 [Butyrivibrio fibrisolvens 16/4] Length = 185 Score = 34.7 bits (78), Expect = 4.6, Method: Composition-based stats. Identities = 26/71 (36%), Positives = 37/71 (52%), Gaps = 11/71 (15%) Query: 7 ITSIAATVGAVVGTVATAAALATPIGWVGAAVAGVGAAVVGAGASDLAMH-KMREQEEEE 65 ITS AT GAVVG A + G AVAGVG L+ H K ++ +E+ Sbjct: 75 ITSGLATAGAVVGGGMAAGIFVLAVPIAGLAVAGVG----------LSSHLKNKQLNQEK 124 Query: 66 KKLLKKGLKKR 76 ++L K+ L+K+ Sbjct: 125 ERLYKEALQKQ 135 >gi|261333538|emb|CBH16533.1| hypothetical protein, conserved [Trypanosoma brucei gambiense DAL972] Length = 636 Score = 34.4 bits (77), Expect = 5.3, Method: Composition-based stats. Identities = 26/69 (37%), Positives = 33/69 (47%), Gaps = 10/69 (14%) Query: 6 SITSIAATVGAVVGTVATAAALATPIGWVGAAVAGVGAAVVGAGASDLAMHKMREQEEEE 65 SI A AVV A A P+ VAG GA A+DLA+ QEEE+ Sbjct: 241 SIVPEALAFSAVVPHTANALQAQIPV------VAGTGAVGPAVSANDLAL----AQEEED 290 Query: 66 KKLLKKGLK 74 +K L+K +K Sbjct: 291 RKQLEKIIK 299 >gi|71748994|ref|XP_827836.1| hypothetical protein [Trypanosoma brucei TREU927] gi|70833220|gb|EAN78724.1| hypothetical protein, conserved [Trypanosoma brucei] Length = 535 Score = 34.4 bits (77), Expect = 5.3, Method: Composition-based stats. Identities = 26/69 (37%), Positives = 33/69 (47%), Gaps = 10/69 (14%) Query: 6 SITSIAATVGAVVGTVATAAALATPIGWVGAAVAGVGAAVVGAGASDLAMHKMREQEEEE 65 SI A AVV A A P+ VAG GA A+DLA+ QEEE+ Sbjct: 140 SIVPEALAFSAVVPHTANALQAQIPV------VAGTGAVGPAVSANDLAL----AQEEED 189 Query: 66 KKLLKKGLK 74 +K L+K +K Sbjct: 190 RKQLEKIIK 198 >gi|134102689|ref|YP_001108350.1| sodium:solute symporter [Saccharopolyspora erythraea NRRL 2338] gi|291004625|ref|ZP_06562598.1| sodium:solute symporter [Saccharopolyspora erythraea NRRL 2338] gi|133915312|emb|CAM05425.1| sodium:solute symporter [Saccharopolyspora erythraea NRRL 2338] Length = 560 Score = 34.4 bits (77), Expect = 5.4, Method: Composition-based stats. Identities = 19/49 (38%), Positives = 26/49 (53%), Gaps = 1/49 (2%) Query: 3 FWNSITSIAATVGAVVGTVATAAALATPIGWVGAAVAGVGAAVVGAGAS 51 FW +T A +G V+GT+A G V + G GA+ VGAGA+ Sbjct: 443 FWKRMTPTAGWLGLVLGTLAAVTVFGLSEGGV-LDLPGQGASFVGAGAA 490 >gi|326331219|ref|ZP_08197513.1| lipolytic enzyme, G-D-S-L [Nocardioidaceae bacterium Broad-1] gi|325950989|gb|EGD43035.1| lipolytic enzyme, G-D-S-L [Nocardioidaceae bacterium Broad-1] Length = 404 Score = 34.4 bits (77), Expect = 5.9, Method: Composition-based stats. Identities = 18/55 (32%), Positives = 27/55 (49%) Query: 4 WNSITSIAATVGAVVGTVATAAALATPIGWVGAAVAGVGAAVVGAGASDLAMHKM 58 W ++ A V + TV AA A P+G GA+ VG AG +DL++ + Sbjct: 4 WTRAFAVLAAVLTLASTVPAQAAPAHPVGTWGASADEVGGTTGAAGLADLSVRNL 58 >gi|197301512|ref|ZP_03166590.1| hypothetical protein RUMLAC_00243 [Ruminococcus lactaris ATCC 29176] gi|197299401|gb|EDY33923.1| hypothetical protein RUMLAC_00243 [Ruminococcus lactaris ATCC 29176] Length = 275 Score = 34.4 bits (77), Expect = 5.9, Method: Composition-based stats. Identities = 15/50 (30%), Positives = 30/50 (60%), Gaps = 2/50 (4%) Query: 18 VGTVATAAALATPIGWVGAAVAGVGAAVVGAGASDLAMHKMREQEEEEKK 67 + +V +++A++ P+ G A+AG G A+ G+ ++ M +R Q+ E K Sbjct: 104 IASVGSSSAISIPVATEGIAIAGTGLAISGSALAN--MFDVRIQKSESNK 151 >gi|284028588|ref|YP_003378519.1| transport system permease protein [Kribbella flavida DSM 17836] gi|283807881|gb|ADB29720.1| transport system permease protein [Kribbella flavida DSM 17836] Length = 329 Score = 33.6 bits (75), Expect = 8.7, Method: Composition-based stats. Identities = 18/34 (52%), Positives = 21/34 (61%) Query: 15 GAVVGTVATAAALATPIGWVGAAVAGVGAAVVGA 48 A+V TA ALA PIG+VG AV A+VGA Sbjct: 237 AAIVLLAGTATALAGPIGFVGLAVPHAARALVGA 270 >gi|310816689|ref|YP_003964653.1| putative transcriptional regulator [Ketogulonicigenium vulgare Y25] gi|308755424|gb|ADO43353.1| putative transcriptional regulator [Ketogulonicigenium vulgare Y25] Length = 189 Score = 33.6 bits (75), Expect = 8.7, Method: Composition-based stats. Identities = 24/65 (36%), Positives = 32/65 (49%), Gaps = 1/65 (1%) Query: 10 IAATVGAVVGTVATAAALATPIGWVGAAVAGVGAAVVGAGASDLAMHKMREQEEEEKKLL 69 +AA + V AAL T G + VG A A DLA H MRE+ EE +KL Sbjct: 15 LAARLPEGVDIQGALAALLTHYAGRGVELVRVGEAYAFRTAGDLA-HLMREEVEETRKLS 73 Query: 70 KKGLK 74 + G++ Sbjct: 74 RAGIE 78 >gi|229819861|ref|YP_002881387.1| hypothetical protein Bcav_1366 [Beutenbergia cavernae DSM 12333] gi|229565774|gb|ACQ79625.1| hypothetical protein Bcav_1366 [Beutenbergia cavernae DSM 12333] Length = 741 Score = 33.6 bits (75), Expect = 9.0, Method: Composition-based stats. Identities = 20/64 (31%), Positives = 29/64 (45%) Query: 1 MGFWNSITSIAATVGAVVGTVATAAALATPIGWVGAAVAGVGAAVVGAGASDLAMHKMRE 60 +G ++T V A+ + AALA +G V A V GV VGA A H + Sbjct: 191 LGVVAALTDGLGVVAALTDGLGVVAALAAGLGVVAAPVGGVARCPVGAPADRACAHGVLS 250 Query: 61 QEEE 64 +E + Sbjct: 251 EENQ 254 >gi|225869651|ref|YP_002745598.1| phage minor tail protein [Streptococcus equi subsp. equi 4047] gi|225699055|emb|CAW92182.1| putative phage minor tail protein [Streptococcus equi subsp. equi 4047] Length = 1086 Score = 33.6 bits (75), Expect = 9.2, Method: Composition-based stats. Identities = 29/68 (42%), Positives = 37/68 (54%), Gaps = 19/68 (27%) Query: 27 LATPIGWVGAAVAGVGAAVVGAGA---------SDLAMHKMREQE---EEEKKL---LKK 71 L PIGWV VAGVG A+V AG SD +EQE E KKL +K+ Sbjct: 429 LTGPIGWV---VAGVG-ALVAAGVSLWSWLTRESDETKKLKKEQEGLVESNKKLRDSVKE 484 Query: 72 GLKKRQKN 79 G+++R+KN Sbjct: 485 GVQERKKN 492 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.313 0.126 0.355 Lambda K H 0.267 0.0398 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 601,017,376 Number of Sequences: 14124377 Number of extensions: 19514291 Number of successful extensions: 168764 Number of sequences better than 10.0: 204 Number of HSP's better than 10.0 without gapping: 91 Number of HSP's successfully gapped in prelim test: 113 Number of HSP's that attempted gapping in prelim test: 168328 Number of HSP's gapped (non-prelim): 455 length of query: 80 length of database: 4,842,793,630 effective HSP length: 51 effective length of query: 29 effective length of database: 4,122,450,403 effective search space: 119551061687 effective search space used: 119551061687 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (22.0 bits) S2: 75 (33.5 bits)