BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254781136|ref|YP_003065549.1| hypothetical protein CLIBASIA_05190 [Candidatus Liberibacter asiaticus str. psy62] (114 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done >gi|254781136|ref|YP_003065549.1| hypothetical protein CLIBASIA_05190 [Candidatus Liberibacter asiaticus str. psy62] gi|254040813|gb|ACT57609.1| hypothetical protein CLIBASIA_05190 [Candidatus Liberibacter asiaticus str. psy62] Length = 114 Score = 229 bits (585), Expect = 7e-59, Method: Composition-based stats. Identities = 114/114 (100%), Positives = 114/114 (100%) Query: 1 MSQFLRNQYFDSNNLSGEYISSKDQNFLYFPTEKQQLQENAFSILHNFWPTTKGPIMRGG 60 MSQFLRNQYFDSNNLSGEYISSKDQNFLYFPTEKQQLQENAFSILHNFWPTTKGPIMRGG Sbjct: 1 MSQFLRNQYFDSNNLSGEYISSKDQNFLYFPTEKQQLQENAFSILHNFWPTTKGPIMRGG 60 Query: 61 CRKICTLDSNSSIISAFSYRSSTQQRVFLANHREIYDVTDASSMQSATVCVCRE 114 CRKICTLDSNSSIISAFSYRSSTQQRVFLANHREIYDVTDASSMQSATVCVCRE Sbjct: 61 CRKICTLDSNSSIISAFSYRSSTQQRVFLANHREIYDVTDASSMQSATVCVCRE 114 >gi|315122528|ref|YP_004063017.1| hypothetical protein CKC_03900 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495930|gb|ADR52529.1| hypothetical protein CKC_03900 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 532 Score = 143 bits (361), Expect = 7e-33, Method: Composition-based stats. Identities = 64/114 (56%), Positives = 84/114 (73%), Gaps = 3/114 (2%) Query: 1 MSQFLRNQYFDSNNLSGE---YISSKDQNFLYFPTEKQQLQENAFSILHNFWPTTKGPIM 57 MS ++N+Y + N+ E + S + NFL+FP QQL++N +L NFWPT+ GP + Sbjct: 1 MSYLMKNRYHNQKNIGTEQSSFFSREQHNFLHFPVGTQQLEDNISCVLRNFWPTSNGPSI 60 Query: 58 RGGCRKICTLDSNSSIISAFSYRSSTQQRVFLANHREIYDVTDASSMQSATVCV 111 RGGCR+ICTL+ S+IISAFSY SS+QQR+FLANHREIYDVT+AS MQ A C+ Sbjct: 61 RGGCRRICTLNHRSNIISAFSYCSSSQQRIFLANHREIYDVTNASLMQPAIRCL 114 >gi|227822437|ref|YP_002826409.1| hypothetical protein NGR_c18920 [Sinorhizobium fredii NGR234] gi|227341438|gb|ACP25656.1| hypothetical protein NGR_c18920 [Sinorhizobium fredii NGR234] Length = 536 Score = 44.3 bits (103), Expect = 0.005, Method: Composition-based stats. Identities = 20/56 (35%), Positives = 32/56 (57%) Query: 43 SILHNFWPTTKGPIMRGGCRKICTLDSNSSIISAFSYRSSTQQRVFLANHREIYDV 98 ++L NF PT G +RGG RK+ +I SAF Y+ +++F+A IY++ Sbjct: 51 TVLTNFLPTLAGCKIRGGSRKVGLAADGGAIRSAFKYKFGNNEKLFMATATAIYNM 106 >gi|150397022|ref|YP_001327489.1| hypothetical protein Smed_1819 [Sinorhizobium medicae WSM419] gi|150028537|gb|ABR60654.1| conserved hypothetical protein [Sinorhizobium medicae WSM419] Length = 538 Score = 42.4 bits (98), Expect = 0.022, Method: Composition-based stats. Identities = 19/61 (31%), Positives = 35/61 (57%) Query: 38 QENAFSILHNFWPTTKGPIMRGGCRKICTLDSNSSIISAFSYRSSTQQRVFLANHREIYD 97 Q + ++L NF+PT G +RGG ++ I SAF Y+ + +++F+A + IY+ Sbjct: 48 QPGSATVLRNFFPTLMGCKIRGGSQRKGLAADGGDIRSAFKYKYGSNEKLFMATNAGIYN 107 Query: 98 V 98 + Sbjct: 108 M 108 >gi|316933870|ref|YP_004108852.1| hypothetical protein Rpdx1_2528 [Rhodopseudomonas palustris DX-1] gi|315601584|gb|ADU44119.1| hypothetical protein Rpdx1_2528 [Rhodopseudomonas palustris DX-1] Length = 558 Score = 37.8 bits (86), Expect = 0.52, Method: Composition-based stats. Identities = 21/66 (31%), Positives = 34/66 (51%), Gaps = 3/66 (4%) Query: 39 ENAFSILHNFWPTTKGPIMRGGCRKICTL--DSNSSIISAFSYRSSTQQRVFLANHREIY 96 + AF +L NF+P G MR G + D + ++S FSY + ++F A +IY Sbjct: 47 QGAF-VLDNFFPEATGLRMRRGSESYAQVGADGSQPVLSLFSYINGANAKLFAATATDIY 105 Query: 97 DVTDAS 102 DV+ + Sbjct: 106 DVSSPA 111 >gi|13470677|ref|NP_102246.1| hypothetical protein mll0452 [Mesorhizobium loti MAFF303099] gi|14021419|dbj|BAB48032.1| mll0452 [Mesorhizobium loti MAFF303099] Length = 528 Score = 36.2 bits (82), Expect = 1.5, Method: Composition-based stats. Identities = 19/60 (31%), Positives = 33/60 (55%), Gaps = 1/60 (1%) Query: 44 ILHNFWPTTKGPIMRGGCRKICTLDSNSSIISAFSYRSSTQQRVFLANHREIYDVTDASS 103 +L N++PT+ G +RGG + T+ + ++S SY +F A+ I+DVT +S Sbjct: 46 VLENWFPTSTGIRLRGGAARHATIGT-IPVVSMMSYDGPAGSFMFAADGTNIFDVTTPAS 104 >gi|110632595|ref|YP_672803.1| hypothetical protein Meso_0234 [Mesorhizobium sp. BNC1] gi|110283579|gb|ABG61638.1| hypothetical protein Meso_0234 [Chelativorans sp. BNC1] Length = 632 Score = 36.2 bits (82), Expect = 1.6, Method: Composition-based stats. Identities = 20/56 (35%), Positives = 29/56 (51%), Gaps = 2/56 (3%) Query: 44 ILHNFWPTTKGPIMRGGCRKICTLDSNSSIISAFSYRSSTQQRVFLANHREIYDVT 99 +L NF+PT G I+R G +K L + + S F Y + +F + IYDVT Sbjct: 47 VLENFFPTATGAILRRGRQKHAELP--TEVRSLFKYVVGNNRHLFASTVTTIYDVT 100 >gi|257092814|ref|YP_003166455.1| NTPase (NACHT family)-like protein [Candidatus Accumulibacter phosphatis clade IIA str. UW-1] gi|257045338|gb|ACV34526.1| NTPase (NACHT family)-like protein [Candidatus Accumulibacter phosphatis clade IIA str. UW-1] Length = 1416 Score = 34.3 bits (77), Expect = 6.0, Method: Composition-based stats. Identities = 18/41 (43%), Positives = 23/41 (56%) Query: 50 PTTKGPIMRGGCRKICTLDSNSSIISAFSYRSSTQQRVFLA 90 P TK I R GCR + S S + +AF R ST QR+ +A Sbjct: 258 PATKQEIFRQGCRLLADELSQSRLQAAFRGRLSTDQRLAIA 298 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.320 0.130 0.386 Lambda K H 0.267 0.0411 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,080,231,017 Number of Sequences: 14124377 Number of extensions: 41049744 Number of successful extensions: 75322 Number of sequences better than 10.0: 8 Number of HSP's better than 10.0 without gapping: 5 Number of HSP's successfully gapped in prelim test: 3 Number of HSP's that attempted gapping in prelim test: 75316 Number of HSP's gapped (non-prelim): 8 length of query: 114 length of database: 4,842,793,630 effective HSP length: 81 effective length of query: 33 effective length of database: 3,698,719,093 effective search space: 122057730069 effective search space used: 122057730069 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.9 bits) S2: 76 (33.9 bits)