BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254780699|ref|YP_003065112.1| hypothetical protein CLIBASIA_02930 [Candidatus Liberibacter asiaticus str. psy62] (97 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done >gi|254780699|ref|YP_003065112.1| hypothetical protein CLIBASIA_02930 [Candidatus Liberibacter asiaticus str. psy62] gi|254040376|gb|ACT57172.1| hypothetical protein CLIBASIA_02930 [Candidatus Liberibacter asiaticus str. psy62] Length = 97 Score = 174 bits (442), Expect = 3e-42, Method: Composition-based stats. Identities = 97/97 (100%), Positives = 97/97 (100%) Query: 1 MLFDNQFIENILKEDREDNNIKSLAQLQRIAIRTVLANRTKNIKSITKEFMEYWIAHSSR 60 MLFDNQFIENILKEDREDNNIKSLAQLQRIAIRTVLANRTKNIKSITKEFMEYWIAHSSR Sbjct: 1 MLFDNQFIENILKEDREDNNIKSLAQLQRIAIRTVLANRTKNIKSITKEFMEYWIAHSSR 60 Query: 61 AWRTSKPRTYLNLHIASQSKKIPVSIYIGNIQKNKKI 97 AWRTSKPRTYLNLHIASQSKKIPVSIYIGNIQKNKKI Sbjct: 61 AWRTSKPRTYLNLHIASQSKKIPVSIYIGNIQKNKKI 97 >gi|315121864|ref|YP_004062353.1| hypothetical protein CKC_00570 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495266|gb|ADR51865.1| hypothetical protein CKC_00570 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 101 Score = 116 bits (291), Expect = 9e-25, Method: Composition-based stats. Identities = 61/95 (64%), Positives = 81/95 (85%), Gaps = 1/95 (1%) Query: 1 MLFDNQFIENILKEDREDNNIKSLAQ-LQRIAIRTVLANRTKNIKSITKEFMEYWIAHSS 59 M FDNQF+ENILK+ R++N KSL+Q LQ IA+ +LANRTKNIKSITKEF+EYW+ ++S Sbjct: 1 MAFDNQFLENILKKSRKENLNKSLSQALQGIALHAILANRTKNIKSITKEFIEYWMTYNS 60 Query: 60 RAWRTSKPRTYLNLHIASQSKKIPVSIYIGNIQKN 94 RAWR+S+ R YLN+HI+S+ KK+P+ I +GNI+KN Sbjct: 61 RAWRSSQARAYLNIHISSRGKKVPIYISLGNIEKN 95 >gi|222148620|ref|YP_002549577.1| hypothetical protein Avi_2192 [Agrobacterium vitis S4] gi|221735606|gb|ACM36569.1| conserved hypothetical protein [Agrobacterium vitis S4] Length = 95 Score = 43.9 bits (102), Expect = 0.008, Method: Composition-based stats. Identities = 22/57 (38%), Positives = 32/57 (56%), Gaps = 2/57 (3%) Query: 23 SLAQLQRIAIRTVLANRTKNIKSITKEFMEYWIAHSSRAWRTSKPRTYLNLHIASQS 79 SL L A+R LA+ ++ TKE M + H +RAWR S+P ++LH+A S Sbjct: 31 SLNPLHEAAMR--LADGINKPRARTKELMSLLLCHGARAWRYSQPEANIHLHVAPHS 85 >gi|15888831|ref|NP_354512.1| hypothetical protein Atu1506 [Agrobacterium tumefaciens str. C58] gi|15156591|gb|AAK87297.1| hypothetical protein Atu1506 [Agrobacterium tumefaciens str. C58] Length = 86 Score = 43.2 bits (100), Expect = 0.014, Method: Composition-based stats. Identities = 15/45 (33%), Positives = 30/45 (66%) Query: 44 KSITKEFMEYWIAHSSRAWRTSKPRTYLNLHIASQSKKIPVSIYI 88 K+ T++ + ++H +RAWR ++P ++LH+A +S + P+ I I Sbjct: 41 KAKTRDLVSLLLSHGARAWRANQPEARIHLHVAHRSGRAPIHIRI 85 >gi|222086065|ref|YP_002544597.1| hypothetical protein Arad_2493 [Agrobacterium radiobacter K84] gi|221723513|gb|ACM26669.1| conserved hypothetical protein [Agrobacterium radiobacter K84] Length = 91 Score = 42.0 bits (97), Expect = 0.025, Method: Composition-based stats. Identities = 18/51 (35%), Positives = 32/51 (62%), Gaps = 1/51 (1%) Query: 38 NRTKNIKSITKEFMEYWIAHSSRAWRTSKPRTYLNLHIASQSKKIPVSIYI 88 NR+++ +S TK+ + + H +RAWR S+P + LHI S + + PV + + Sbjct: 41 NRSRS-RSKTKDLIGILLCHGARAWRYSQPEANIRLHITSPNGQAPVQLRV 90 >gi|325292860|ref|YP_004278724.1| hypothetical protein AGROH133_06134 [Agrobacterium sp. H13-3] gi|325060713|gb|ADY64404.1| hypothetical protein AGROH133_06134 [Agrobacterium sp. H13-3] Length = 86 Score = 42.0 bits (97), Expect = 0.031, Method: Composition-based stats. Identities = 15/45 (33%), Positives = 29/45 (64%) Query: 44 KSITKEFMEYWIAHSSRAWRTSKPRTYLNLHIASQSKKIPVSIYI 88 K+ T++ + ++H +RAWR ++P ++LH+ +S + PV I I Sbjct: 41 KAKTRDLVSLLLSHGARAWRANQPEANIHLHVGRRSGRAPVHIRI 85 >gi|15965163|ref|NP_385516.1| hypothetical protein SMc01068 [Sinorhizobium meliloti 1021] gi|307309176|ref|ZP_07588847.1| conserved hypothetical protein [Sinorhizobium meliloti BL225C] gi|307321506|ref|ZP_07600901.1| conserved hypothetical protein [Sinorhizobium meliloti AK83] gi|15074343|emb|CAC45989.1| Hypothetical/unknown protein [Sinorhizobium meliloti 1021] gi|306892813|gb|EFN23604.1| conserved hypothetical protein [Sinorhizobium meliloti AK83] gi|306900322|gb|EFN30938.1| conserved hypothetical protein [Sinorhizobium meliloti BL225C] Length = 89 Score = 40.9 bits (94), Expect = 0.065, Method: Composition-based stats. Identities = 14/43 (32%), Positives = 29/43 (67%) Query: 44 KSITKEFMEYWIAHSSRAWRTSKPRTYLNLHIASQSKKIPVSI 86 K+ T++ + + H +RAWR+++PR ++LH+A+ + P+ I Sbjct: 44 KAKTRDLVGMLLTHGARAWRSTQPRAGIHLHVAAPGRSRPLRI 86 >gi|150396260|ref|YP_001326727.1| hypothetical protein Smed_1039 [Sinorhizobium medicae WSM419] gi|150027775|gb|ABR59892.1| conserved hypothetical protein [Sinorhizobium medicae WSM419] Length = 89 Score = 39.3 bits (90), Expect = 0.18, Method: Composition-based stats. Identities = 22/75 (29%), Positives = 42/75 (56%), Gaps = 6/75 (8%) Query: 15 DREDNNIK-SLAQLQRIAIRT--VLANRTKNIKSITKEFMEYWIAHSSRAWRTSKPRTYL 71 DR+ N + L L A+R + NR+K + T++ + + H +RAWR+++PR + Sbjct: 15 DRKANPARVRLHPLHEAAMRIADIGLNRSK---AKTRDLVGMLLTHGARAWRSTQPRADI 71 Query: 72 NLHIASQSKKIPVSI 86 +LH+ + + P+ I Sbjct: 72 HLHVTALGRSHPLRI 86 >gi|209549454|ref|YP_002281371.1| hypothetical protein Rleg2_1860 [Rhizobium leguminosarum bv. trifolii WSM2304] gi|241204781|ref|YP_002975877.1| hypothetical protein Rleg_2060 [Rhizobium leguminosarum bv. trifolii WSM1325] gi|209535210|gb|ACI55145.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM2304] gi|240858671|gb|ACS56338.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 91 Score = 39.3 bits (90), Expect = 0.20, Method: Composition-based stats. Identities = 20/66 (30%), Positives = 32/66 (48%), Gaps = 2/66 (3%) Query: 23 SLAQLQRIAIR--TVLANRTKNIKSITKEFMEYWIAHSSRAWRTSKPRTYLNLHIASQSK 80 SL L A+R + R K T++ + + H +RAWR S+P ++LH+ S Sbjct: 23 SLHPLHEAAMRLAEIGLQRPKAKSPKTRDLINLLLCHGARAWRYSQPEARIHLHVTSPDG 82 Query: 81 KIPVSI 86 PV + Sbjct: 83 SAPVQL 88 >gi|86357828|ref|YP_469720.1| hypothetical protein RHE_CH02212 [Rhizobium etli CFN 42] gi|86281930|gb|ABC90993.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 91 Score = 38.9 bits (89), Expect = 0.21, Method: Composition-based stats. Identities = 19/66 (28%), Positives = 33/66 (50%), Gaps = 2/66 (3%) Query: 23 SLAQLQRIAIR--TVLANRTKNIKSITKEFMEYWIAHSSRAWRTSKPRTYLNLHIASQSK 80 SL L A+R + R + + T++ + + H +RAWR S+P ++LH+ S Sbjct: 23 SLHPLHEAALRLAEIGLQRPRAKSAKTRDLINLLLCHGARAWRYSQPEARIHLHVTSPDG 82 Query: 81 KIPVSI 86 PV + Sbjct: 83 SAPVQL 88 >gi|116252288|ref|YP_768126.1| hypothetical protein RL2542 [Rhizobium leguminosarum bv. viciae 3841] gi|115256936|emb|CAK08030.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae 3841] Length = 91 Score = 38.5 bits (88), Expect = 0.32, Method: Composition-based stats. Identities = 20/66 (30%), Positives = 32/66 (48%), Gaps = 2/66 (3%) Query: 23 SLAQLQRIAIR--TVLANRTKNIKSITKEFMEYWIAHSSRAWRTSKPRTYLNLHIASQSK 80 SL L A+R + R K T++ + + H +RAWR S+P ++LH+ S Sbjct: 23 SLHPLHEAALRLAEIGLPRPKAKSPKTRDLINLLLCHGARAWRYSQPEARIHLHVTSPDG 82 Query: 81 KIPVSI 86 PV + Sbjct: 83 SAPVQL 88 >gi|218515844|ref|ZP_03512684.1| hypothetical protein Retl8_20236 [Rhizobium etli 8C-3] Length = 82 Score = 38.2 bits (87), Expect = 0.37, Method: Composition-based stats. Identities = 18/58 (31%), Positives = 31/58 (53%), Gaps = 2/58 (3%) Query: 29 RIAIRTVLANRTKNIKSITKEFMEYWIAHSSRAWRTSKPRTYLNLHIASQSKKIPVSI 86 R+A + R K+ K T++ + + H +RAWR S+P ++LH+ S PV + Sbjct: 24 RLAEMGLQRPRAKSAK--TRDLINLLLCHGARAWRYSQPEARIHLHVTSPDGSAPVQL 79 >gi|218462488|ref|ZP_03502579.1| hypothetical protein RetlK5_24823 [Rhizobium etli Kim 5] gi|218674959|ref|ZP_03524628.1| hypothetical protein RetlG_27770 [Rhizobium etli GR56] Length = 91 Score = 38.2 bits (87), Expect = 0.37, Method: Composition-based stats. Identities = 18/58 (31%), Positives = 31/58 (53%), Gaps = 2/58 (3%) Query: 29 RIAIRTVLANRTKNIKSITKEFMEYWIAHSSRAWRTSKPRTYLNLHIASQSKKIPVSI 86 R+A + R K+ K T++ + + H +RAWR S+P ++LH+ S PV + Sbjct: 33 RLAEMGLQRPRAKSAK--TRDLINLLLCHGARAWRYSQPEARIHLHVTSPDGSAPVQL 88 >gi|190891892|ref|YP_001978434.1| hypothetical protein RHECIAT_CH0002301 [Rhizobium etli CIAT 652] gi|190697171|gb|ACE91256.1| hypothetical conserved protein [Rhizobium etli CIAT 652] Length = 91 Score = 38.2 bits (87), Expect = 0.37, Method: Composition-based stats. Identities = 18/58 (31%), Positives = 31/58 (53%), Gaps = 2/58 (3%) Query: 29 RIAIRTVLANRTKNIKSITKEFMEYWIAHSSRAWRTSKPRTYLNLHIASQSKKIPVSI 86 R+A + R K+ K T++ + + H +RAWR S+P ++LH+ S PV + Sbjct: 33 RLAEMGLQRPRAKSAK--TRDLINLLLCHGARAWRYSQPEARIHLHVTSPDGSAPVQL 88 >gi|292615491|ref|XP_686042.3| PREDICTED: c-type mannose receptor 2-like isoform 1 [Danio rerio] Length = 368 Score = 35.5 bits (80), Expect = 2.3, Method: Composition-based stats. Identities = 18/40 (45%), Positives = 26/40 (65%), Gaps = 2/40 (5%) Query: 58 SSRAWRTSKPRTYLNLHIAS--QSKKIPVSIYIGNIQKNK 95 S R WRT +PR+ N H A+ Q+ + SIY+ NI+KN+ Sbjct: 202 SFRYWRTGEPRSQTNNHRAAVEQTSQGQWSIYMYNIEKNR 241 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.319 0.131 0.367 Lambda K H 0.267 0.0402 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 779,133,230 Number of Sequences: 14124377 Number of extensions: 22359681 Number of successful extensions: 63904 Number of sequences better than 10.0: 16 Number of HSP's better than 10.0 without gapping: 14 Number of HSP's successfully gapped in prelim test: 2 Number of HSP's that attempted gapping in prelim test: 63889 Number of HSP's gapped (non-prelim): 16 length of query: 97 length of database: 4,842,793,630 effective HSP length: 66 effective length of query: 31 effective length of database: 3,910,584,748 effective search space: 121228127188 effective search space used: 121228127188 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 75 (33.5 bits)