BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254781083|ref|YP_003065496.1| hypothetical protein CLIBASIA_04925 [Candidatus Liberibacter asiaticus str. psy62] (160 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done >gi|254781083|ref|YP_003065496.1| hypothetical protein CLIBASIA_04925 [Candidatus Liberibacter asiaticus str. psy62] gi|254040760|gb|ACT57556.1| hypothetical protein CLIBASIA_04925 [Candidatus Liberibacter asiaticus str. psy62] Length = 160 Score = 332 bits (852), Expect = 8e-90, Method: Composition-based stats. Identities = 160/160 (100%), Positives = 160/160 (100%) Query: 1 MTYLSVTWNDLVQVSILVFCFVSHFVQRKSFADVHEALERYKKVARVVGPHIPNSNVFSS 60 MTYLSVTWNDLVQVSILVFCFVSHFVQRKSFADVHEALERYKKVARVVGPHIPNSNVFSS Sbjct: 1 MTYLSVTWNDLVQVSILVFCFVSHFVQRKSFADVHEALERYKKVARVVGPHIPNSNVFSS 60 Query: 61 SVRRTYWSERGDIINALDQDLQDVESIEVFEYLKTRLSDLEAHRRSMSFYFAKYPKKVGP 120 SVRRTYWSERGDIINALDQDLQDVESIEVFEYLKTRLSDLEAHRRSMSFYFAKYPKKVGP Sbjct: 61 SVRRTYWSERGDIINALDQDLQDVESIEVFEYLKTRLSDLEAHRRSMSFYFAKYPKKVGP 120 Query: 121 SVIDVVYPHSHLVKEDKVDEKNSSSELKVDLNKLNAPGSP 160 SVIDVVYPHSHLVKEDKVDEKNSSSELKVDLNKLNAPGSP Sbjct: 121 SVIDVVYPHSHLVKEDKVDEKNSSSELKVDLNKLNAPGSP 160 >gi|296127974|ref|YP_003635224.1| 5-methyltetrahydropteroyltriglutamate/homocystei neS-methyltransferase [Cellulomonas flavigena DSM 20109] gi|296019789|gb|ADG73025.1| 5-methyltetrahydropteroyltriglutamate/homocystei neS-methyltransferase [Cellulomonas flavigena DSM 20109] Length = 783 Score = 39.3 bits (90), Expect = 0.16, Method: Composition-based stats. Identities = 28/88 (31%), Positives = 44/88 (50%), Gaps = 11/88 (12%) Query: 67 WSERGDIINALDQDLQDVESIEVFEYLKTRLSDLEAHRRSMSFYFAKYPKKVGPSVIDVV 126 +SE G+I+ A+D DV SIE L D+ A A YP+ VGP V D+ Sbjct: 673 YSEFGEIMEAIDGLDADVTSIEAARSKMEILGDIAA---------AGYPRAVGPGVYDIH 723 Query: 127 YPHSHLVKEDKVDEKNSSSELKVDLNKL 154 P + +D+V E + + +D+++L Sbjct: 724 SP--RVPSQDEVTELLTEAVRTIDVDQL 749 >gi|322371540|ref|ZP_08046086.1| DNA mismatch repair protein MutS [Haladaptatus paucihalophilus DX253] gi|320548831|gb|EFW90499.1| DNA mismatch repair protein MutS [Haladaptatus paucihalophilus DX253] Length = 895 Score = 36.6 bits (83), Expect = 1.0, Method: Composition-based stats. Identities = 29/104 (27%), Positives = 48/104 (46%), Gaps = 4/104 (3%) Query: 45 ARVVGPHIPNSNVFSSSVRRTYWSERGDIINALDQDLQDVESIEVFEYLKTRLSDLEAHR 104 A V PN+ +RR Y E DII++ +++L+ +++ E +T +S L+ R Sbjct: 414 AEAVAEDPPNTVTQGGLLRRGYDEELDDIIDSHEENLEWFDTLADREKSRTGISHLQVDR 473 Query: 105 RSMSFYFAKYPKKVGPSVIDVVYPHSHLVKEDKVDEKNSSSELK 148 Y+ +VG S D V VK K E+ + EL+ Sbjct: 474 NKTDGYYL----QVGKSDTDKVPEEYEAVKTLKNSERYVTDELR 513 >gi|332672206|ref|YP_004455214.1| 5-methyltetrahydropteroyltriglutamate/homocysteine S-methyltransferase [Cellulomonas fimi ATCC 484] gi|332341244|gb|AEE47827.1| 5-methyltetrahydropteroyltriglutamate/homocysteine S-methyltransferase [Cellulomonas fimi ATCC 484] Length = 773 Score = 36.6 bits (83), Expect = 1.1, Method: Composition-based stats. Identities = 28/88 (31%), Positives = 42/88 (47%), Gaps = 11/88 (12%) Query: 67 WSERGDIINALDQDLQDVESIEVFEYLKTRLSDLEAHRRSMSFYFAKYPKKVGPSVIDVV 126 +SE G+II A+D DV SIE L D+ A YP+ VGP V D+ Sbjct: 663 YSEFGEIIGAIDGLDADVTSIEAARSKMEILGDIAG---------AGYPRAVGPGVYDIH 713 Query: 127 YPHSHLVKEDKVDEKNSSSELKVDLNKL 154 P + E +V+E + + + L++L Sbjct: 714 SP--RVPSEGEVEELLTEAVRSIALDQL 739 >gi|81427920|ref|YP_394919.1| putative drug resistance ABC transporter, two ATP-binding subunits [Lactobacillus sakei subsp. sakei 23K] gi|78609561|emb|CAI54607.1| Putative drug resistance ABC transporter, two ATP-binding subunits [Lactobacillus sakei subsp. sakei 23K] Length = 514 Score = 36.2 bits (82), Expect = 1.4, Method: Composition-based stats. Identities = 17/41 (41%), Positives = 23/41 (56%) Query: 74 INALDQDLQDVESIEVFEYLKTRLSDLEAHRRSMSFYFAKY 114 I LDQ + + +FE+LKT +DL A MS Y+A Y Sbjct: 68 IGYLDQYAKLAPGLSIFEFLKTAFADLYAQEERMSQYYADY 108 >gi|328476062|gb|EGF46775.1| low specificity L-allo-threonine aldolase [Listeria monocytogenes 220] Length = 351 Score = 35.4 bits (80), Expect = 2.7, Method: Composition-based stats. Identities = 25/90 (27%), Positives = 45/90 (50%), Gaps = 10/90 (11%) Query: 39 ERYKKVARV-VGPHIPNSNVFSSSVRRTYWSERGDIINALDQDLQDVESIEVFEYLKTRL 97 ER+ + V P +P SN+F Y+ + D I A+ +QD + + YL+ + Sbjct: 271 ERFNSCSGVKTVPEVPVSNMF-----HVYFEKSADEIGAILTKIQDETGVGISGYLQEKS 325 Query: 98 SDLEAHRRSMSFYFAKYPKKVGPSVIDVVY 127 +D+ A S+ FA+ P + ++DVV+ Sbjct: 326 ADVCAFEVSVGDAFAEIPVE----ILDVVF 351 >gi|226222940|ref|YP_002757047.1| low specificity L-allo-threonine aldolase [Listeria monocytogenes Clip81459] gi|254932539|ref|ZP_05265898.1| threonine aldolase [Listeria monocytogenes HPB2262] gi|225875402|emb|CAS04099.1| Putative low specificity L-allo-threonine aldolase [Listeria monocytogenes serotype 4b str. CLIP 80459] gi|293584096|gb|EFF96128.1| threonine aldolase [Listeria monocytogenes HPB2262] gi|332310730|gb|EGJ23825.1| Threonine aldolase family protein [Listeria monocytogenes str. Scott A] Length = 362 Score = 35.4 bits (80), Expect = 2.7, Method: Composition-based stats. Identities = 25/90 (27%), Positives = 45/90 (50%), Gaps = 10/90 (11%) Query: 39 ERYKKVARV-VGPHIPNSNVFSSSVRRTYWSERGDIINALDQDLQDVESIEVFEYLKTRL 97 ER+ + V P +P SN+F Y+ + D I A+ +QD + + YL+ + Sbjct: 271 ERFNSCSGVKTVPEVPVSNMF-----HVYFEKSADEIGAILTKIQDETGVGISGYLQEKS 325 Query: 98 SDLEAHRRSMSFYFAKYPKKVGPSVIDVVY 127 +D+ A S+ FA+ P + ++DVV+ Sbjct: 326 ADVCAFEVSVGDAFAEIPVE----ILDVVF 351 >gi|195454310|ref|XP_002074183.1| GK12757 [Drosophila willistoni] gi|194170268|gb|EDW85169.1| GK12757 [Drosophila willistoni] Length = 293 Score = 35.1 bits (79), Expect = 3.2, Method: Composition-based stats. Identities = 29/104 (27%), Positives = 48/104 (46%), Gaps = 10/104 (9%) Query: 35 HEALERYKKVARVVGPHIPNSNVFSSSVRRTYWSERGDIINALDQDLQDVESIEVFEYLK 94 +EALE+Y+ V+ I ++ +R + I+ A L+ ++ E+ EYLK Sbjct: 95 YEALEQYEDAEEVLNAIIAKDETNAAPRKR-----KIAILKACGNRLEAIK--ELNEYLK 147 Query: 95 TRLSDLEAHRRSMSFYFAKYPKKVGPSVIDVVY---PHSHLVKE 135 +SD EA S Y A+ ++ V PHSHL+ + Sbjct: 148 KFMSDQEAWHELCSLYLAEAEYGKAAFCMEEVLLHNPHSHLIHQ 191 >gi|254994494|ref|ZP_05276684.1| hypothetical protein LmonocytoFSL_17037 [Listeria monocytogenes FSL J2-064] Length = 253 Score = 34.7 bits (78), Expect = 3.9, Method: Composition-based stats. Identities = 23/80 (28%), Positives = 39/80 (48%), Gaps = 6/80 (7%) Query: 39 ERYKKVARV-VGPHIPNSNVFSSSVRRTYWSERGDIINALDQDLQDVESIEVFEYLKTRL 97 ER+ + V P +P SN+F Y+ + D I A+ +QD + + YL+ + Sbjct: 164 ERFNSCSGVKTVPEVPVSNMF-----HVYFEKSADEIGAILTKIQDETGVGISGYLQEKS 218 Query: 98 SDLEAHRRSMSFYFAKYPKK 117 +D+ A S+ FA+ P K Sbjct: 219 ADVCAFEVSVGDAFAEIPAK 238 >gi|91216043|ref|ZP_01253012.1| hypothetical protein P700755_15951 [Psychroflexus torquis ATCC 700755] gi|91186020|gb|EAS72394.1| hypothetical protein P700755_15951 [Psychroflexus torquis ATCC 700755] Length = 516 Score = 34.7 bits (78), Expect = 5.0, Method: Composition-based stats. Identities = 25/94 (26%), Positives = 50/94 (53%), Gaps = 6/94 (6%) Query: 64 RTYWSERGDIINALDQDLQDVESIEVFEYLKTRLSDLEAH-RRSMSFYFAKYPKKVGPSV 122 +TY+ + D + L DL ++++ ++ E + L L+ + ++ FY ++ + P + Sbjct: 51 KTYYQQEADFASQLLTDLAEIDNSQLSETEQISLEMLKFKLKETVDFY--EFEAYLNPLL 108 Query: 123 IDVVYPHSHLVKEDKVDEKNSSSELKVDLNKLNA 156 D + HL +V + N+ ++KV LNKLNA Sbjct: 109 SDSGF---HLSLSYEVRDLNTYEQVKVYLNKLNA 139 >gi|224436996|ref|ZP_03657977.1| hypothetical protein HcinC1_03440 [Helicobacter cinaedi CCUG 18818] gi|313143469|ref|ZP_07805662.1| conserved hypothetical protein [Helicobacter cinaedi CCUG 18818] gi|313128500|gb|EFR46117.1| conserved hypothetical protein [Helicobacter cinaedi CCUG 18818] Length = 340 Score = 34.3 bits (77), Expect = 5.5, Method: Composition-based stats. Identities = 18/59 (30%), Positives = 31/59 (52%), Gaps = 2/59 (3%) Query: 92 YLKTRLSDLEAHRRSMSFYFAKYPKKVGPSVIDVVYPHSHLVKEDKVDEKNSSSELKVD 150 YLK RL +E+ R + F Y + V+D+ HL K++K + +N +++K D Sbjct: 118 YLKPRLDSMESFNRKIDSLFETYKDVMSGKVVDL--KKYHLSKDNKFNLQNLKNKIKAD 174 >gi|255025192|ref|ZP_05297178.1| low specificity L-allo-threonine aldolase [Listeria monocytogenes FSL J2-003] Length = 362 Score = 34.3 bits (77), Expect = 5.9, Method: Composition-based stats. Identities = 19/69 (27%), Positives = 35/69 (50%), Gaps = 5/69 (7%) Query: 50 PHIPNSNVFSSSVRRTYWSERGDIINALDQDLQDVESIEVFEYLKTRLSDLEAHRRSMSF 109 P +P SN+F Y+ + D + A+ +QD + + YL+ + +D+ A S+ Sbjct: 283 PEVPVSNMF-----HVYFEKSADEVGAILTKIQDETRVGISGYLQEKSADVCAFEVSVGD 337 Query: 110 YFAKYPKKV 118 FAK P ++ Sbjct: 338 AFAKIPVEI 346 >gi|70947800|ref|XP_743481.1| hypothetical protein [Plasmodium chabaudi chabaudi] gi|56522998|emb|CAH75639.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi] Length = 1076 Score = 33.5 bits (75), Expect = 8.9, Method: Composition-based stats. Identities = 27/84 (32%), Positives = 42/84 (50%), Gaps = 12/84 (14%) Query: 74 INALDQDLQDVESIEVFEYLKTRLSDLEAHRRSMSFYFAKYPKKV--GPSVIDVVYPHSH 131 IN L DL +SI++ +YL + L + MSFYFA K P ++D++Y Sbjct: 41 INRLMNDLPLHKSIKIMKYLSIKKMALCININHMSFYFAAEICKSFNSPEIVDILY---- 96 Query: 132 LVKEDKVDEKNSSSELK-VDLNKL 154 ++EKN+ LK D+ K+ Sbjct: 97 -----YLNEKNTEILLKECDIKKI 115 >gi|325189599|emb|CCA24084.1| conserved hypothetical protein [Albugo laibachii Nc14] Length = 616 Score = 33.5 bits (75), Expect = 9.5, Method: Composition-based stats. Identities = 18/64 (28%), Positives = 35/64 (54%), Gaps = 2/64 (3%) Query: 77 LDQDLQDVESIEVFEYLKTRLSDLEAHRRSMSFYFAKYPKKVGPSVIDVVYPHSHLVKED 136 L Q+L VES + +++ ++ E R SM YF Y + +V+++ + H H+V+ Sbjct: 74 LVQNLLTVESWK--QFILPKIESKENERASMRLYFIMYHEATLSNVLEIAFHHQHIVESF 131 Query: 137 KVDE 140 + +E Sbjct: 132 QDEE 135 >gi|302663572|ref|XP_003023427.1| hypothetical protein TRV_02437 [Trichophyton verrucosum HKI 0517] gi|291187423|gb|EFE42809.1| hypothetical protein TRV_02437 [Trichophyton verrucosum HKI 0517] Length = 1628 Score = 33.5 bits (75), Expect = 9.5, Method: Composition-based stats. Identities = 25/84 (29%), Positives = 46/84 (54%), Gaps = 5/84 (5%) Query: 77 LDQDLQDVESIE-VFEYLKTRLSDLEAHRRSMSFYFAKYPKKVGPSVIDVVYPHSHLVKE 135 L+Q Q +E +E + L+ R+SD+E +S+ +Y +K+G ++ +SHL + Sbjct: 1036 LEQAAQIMERLEGEKKELQERISDMEKQLKSVESTHGEYDEKIGALNQEINTLNSHLAMK 1095 Query: 136 DK----VDEKNSSSELKVDLNKLN 155 DK ++ K SS+ ++DL N Sbjct: 1096 DKKLQDLEAKLLSSDQQLDLELAN 1119 >gi|302505838|ref|XP_003014876.1| hypothetical protein ARB_06633 [Arthroderma benhamiae CBS 112371] gi|291178447|gb|EFE34236.1| hypothetical protein ARB_06633 [Arthroderma benhamiae CBS 112371] Length = 2406 Score = 33.5 bits (75), Expect = 9.5, Method: Composition-based stats. Identities = 25/84 (29%), Positives = 46/84 (54%), Gaps = 5/84 (5%) Query: 77 LDQDLQDVESIE-VFEYLKTRLSDLEAHRRSMSFYFAKYPKKVGPSVIDVVYPHSHLVKE 135 L+Q Q +E +E + L+ R+SD+E +S+ +Y +K+G ++ +SHL + Sbjct: 1036 LEQAAQIMERLEGEKKELQERISDMEKQLKSVESTHGEYDEKIGALNQEINTLNSHLAMK 1095 Query: 136 DK----VDEKNSSSELKVDLNKLN 155 DK ++ K SS+ ++DL N Sbjct: 1096 DKKLQDLEAKLLSSDQQLDLELAN 1119 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.318 0.133 0.386 Lambda K H 0.267 0.0409 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,435,630,839 Number of Sequences: 14124377 Number of extensions: 52521164 Number of successful extensions: 141070 Number of sequences better than 10.0: 28 Number of HSP's better than 10.0 without gapping: 2 Number of HSP's successfully gapped in prelim test: 26 Number of HSP's that attempted gapping in prelim test: 141064 Number of HSP's gapped (non-prelim): 28 length of query: 160 length of database: 4,842,793,630 effective HSP length: 122 effective length of query: 38 effective length of database: 3,119,619,636 effective search space: 118545546168 effective search space used: 118545546168 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 75 (33.5 bits)