BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254781146|ref|YP_003065559.1| hypothetical protein CLIBASIA_05265 [Candidatus Liberibacter asiaticus str. psy62] (122 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done >gi|254781146|ref|YP_003065559.1| hypothetical protein CLIBASIA_05265 [Candidatus Liberibacter asiaticus str. psy62] gi|254040823|gb|ACT57619.1| hypothetical protein CLIBASIA_05265 [Candidatus Liberibacter asiaticus str. psy62] Length = 122 Score = 189 bits (479), Expect = 2e-46, Method: Composition-based stats. Identities = 122/122 (100%), Positives = 122/122 (100%) Query: 1 MSPLTLLPIRKKYPCINRLSHKEISLQYLYQSVLTDDFSKYGRKVIKNLREEKPEQYLRL 60 MSPLTLLPIRKKYPCINRLSHKEISLQYLYQSVLTDDFSKYGRKVIKNLREEKPEQYLRL Sbjct: 1 MSPLTLLPIRKKYPCINRLSHKEISLQYLYQSVLTDDFSKYGRKVIKNLREEKPEQYLRL 60 Query: 61 ISQILPREKIKQDGITNGDQLTDEQLCEIIRSLEKELQIFTDFKNKDAHSRETKETTKSA 120 ISQILPREKIKQDGITNGDQLTDEQLCEIIRSLEKELQIFTDFKNKDAHSRETKETTKSA Sbjct: 61 ISQILPREKIKQDGITNGDQLTDEQLCEIIRSLEKELQIFTDFKNKDAHSRETKETTKSA 120 Query: 121 SS 122 SS Sbjct: 121 SS 122 >gi|150397046|ref|YP_001327513.1| hypothetical protein Smed_1843 [Sinorhizobium medicae WSM419] gi|150028561|gb|ABR60678.1| conserved hypothetical protein [Sinorhizobium medicae WSM419] Length = 126 Score = 113 bits (282), Expect = 1e-23, Method: Composition-based stats. Identities = 29/76 (38%), Positives = 47/76 (61%), Gaps = 2/76 (2%) Query: 37 DFSKYGRKVIKNLREEKPEQYLRLISQILPREKIKQDGITNGDQLTDEQLCEIIRSLEKE 96 DF+++G V+ +REEKPE YL+L+S ILP++ G D+L+DEQL E I++L+ Sbjct: 46 DFAQHGVGVLARIREEKPETYLKLVSSILPKDLSAATGC--ADELSDEQLIERIQALDAA 103 Query: 97 LQIFTDFKNKDAHSRE 112 ++ K + R+ Sbjct: 104 IRPLISGKKRAGGMRK 119 >gi|15965773|ref|NP_386126.1| hypothetical protein SMc04191 [Sinorhizobium meliloti 1021] gi|307311307|ref|ZP_07590950.1| conserved hypothetical protein [Sinorhizobium meliloti BL225C] gi|307318840|ref|ZP_07598272.1| conserved hypothetical protein [Sinorhizobium meliloti AK83] gi|15075042|emb|CAC46599.1| Hypothetical protein SMc04191 [Sinorhizobium meliloti 1021] gi|306895561|gb|EFN26315.1| conserved hypothetical protein [Sinorhizobium meliloti AK83] gi|306899608|gb|EFN30236.1| conserved hypothetical protein [Sinorhizobium meliloti BL225C] Length = 120 Score = 112 bits (281), Expect = 1e-23, Method: Composition-based stats. Identities = 26/76 (34%), Positives = 46/76 (60%), Gaps = 2/76 (2%) Query: 37 DFSKYGRKVIKNLREEKPEQYLRLISQILPREKIKQDGITNGDQLTDEQLCEIIRSLEKE 96 DF+++G VI +REEKPE YL+L++ +LP++ G D L+DEQ+ + IR+L+ Sbjct: 40 DFARHGVNVIARIREEKPEAYLKLVASVLPKDLSAATG--GVDDLSDEQIIDRIRALDAA 97 Query: 97 LQIFTDFKNKDAHSRE 112 ++ + + R+ Sbjct: 98 IRPLLSLRKRAGGLRK 113 >gi|315122537|ref|YP_004063026.1| hypothetical protein CKC_03945 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495939|gb|ADR52538.1| hypothetical protein CKC_03945 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 117 Score = 107 bits (268), Expect = 4e-22, Method: Composition-based stats. Identities = 71/106 (66%), Positives = 87/106 (82%) Query: 12 KYPCINRLSHKEISLQYLYQSVLTDDFSKYGRKVIKNLREEKPEQYLRLISQILPREKIK 71 KY N+L H++ISLQ LY++VL DFSK+GRKV+KNLR+EKPEQYLRLI+QILP+E IK Sbjct: 12 KYSEKNQLFHQDISLQRLYKNVLIKDFSKHGRKVVKNLRKEKPEQYLRLIAQILPKENIK 71 Query: 72 QDGITNGDQLTDEQLCEIIRSLEKELQIFTDFKNKDAHSRETKETT 117 ++ NGD LTDEQLCEIIRSLEKELQ+ TD KN+D + K+TT Sbjct: 72 EEETINGDPLTDEQLCEIIRSLEKELQLVTDSKNQDVCPTKVKKTT 117 >gi|227821701|ref|YP_002825671.1| hypothetical protein NGR_c11330 [Sinorhizobium fredii NGR234] gi|227340700|gb|ACP24918.1| hypothetical protein NGR_c11330 [Sinorhizobium fredii NGR234] Length = 125 Score = 107 bits (267), Expect = 6e-22, Method: Composition-based stats. Identities = 35/85 (41%), Positives = 50/85 (58%), Gaps = 3/85 (3%) Query: 34 LTDDFSKYGRKVIKNLREEKPEQYLRLISQILPREKIKQDGITNGDQLTDEQLCEIIRSL 93 + DF ++G I +REEKP+QYL++I+ ILP++ I N D LTD+QL E IRSL Sbjct: 42 MHADFEQHGVAAIVRVREEKPDQYLKVIASILPKDL--NVNINNMDDLTDDQLIERIRSL 99 Query: 94 EKELQIFTDFKNKDAHSRET-KETT 117 + ++ F D + T ETT Sbjct: 100 DSAIRPFLDAQGASGSVGGTGPETT 124 >gi|227822453|ref|YP_002826425.1| hypothetical protein NGR_c19080 [Sinorhizobium fredii NGR234] gi|227341454|gb|ACP25672.1| hypothetical protein NGR_c19080 [Sinorhizobium fredii NGR234] Length = 151 Score = 102 bits (253), Expect = 2e-20, Method: Composition-based stats. Identities = 26/69 (37%), Positives = 44/69 (63%), Gaps = 2/69 (2%) Query: 37 DFSKYGRKVIKNLREEKPEQYLRLISQILPREKIKQDGITNGDQLTDEQLCEIIRSLEKE 96 DF+ +G VI +R EKPE YL+L++ +LP++ G T D+L+D+QL E IR+L+ Sbjct: 70 DFAAHGVGVIARIRAEKPETYLKLVASLLPKDLNAAAGGT--DELSDDQLIERIRALDSA 127 Query: 97 LQIFTDFKN 105 ++ + + Sbjct: 128 IRPLFNARK 136 >gi|260463787|ref|ZP_05811984.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] gi|259030384|gb|EEW31663.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] Length = 131 Score = 95.9 bits (237), Expect = 1e-18, Method: Composition-based stats. Identities = 24/73 (32%), Positives = 43/73 (58%), Gaps = 2/73 (2%) Query: 30 YQSVLTDDFSKYGRKVIKNLREEKPEQYLRLISQILPREKIKQDGITNGDQLTDEQLCEI 89 + + + DF +G VI +R +KP+QYL+++ +LP++ I D L+DE++ Sbjct: 44 FLAAVRADFRTHGAGVIAEVRADKPDQYLKIVLSVLPKD--FDVSINQLDGLSDEEIRSR 101 Query: 90 IRSLEKELQIFTD 102 IR+LE L+ F + Sbjct: 102 IRALETALRPFLE 114 >gi|13471715|ref|NP_103282.1| hypothetical protein mll1773 [Mesorhizobium loti MAFF303099] gi|14022459|dbj|BAB49068.1| mll1773 [Mesorhizobium loti MAFF303099] Length = 130 Score = 95.9 bits (237), Expect = 2e-18, Method: Composition-based stats. Identities = 25/73 (34%), Positives = 43/73 (58%), Gaps = 2/73 (2%) Query: 30 YQSVLTDDFSKYGRKVIKNLREEKPEQYLRLISQILPREKIKQDGITNGDQLTDEQLCEI 89 + + + DF +G VI +R +KP+QYL+++ +LPR+ I + D L+DE++ Sbjct: 43 FLAAVRADFRAHGAGVIAEVRADKPDQYLKIVLSVLPRD--FDVAINHLDALSDEEIRSR 100 Query: 90 IRSLEKELQIFTD 102 IR LE L+ F + Sbjct: 101 IRGLETVLRPFLE 113 >gi|158422461|ref|YP_001523753.1| putative bacteriophage protein [Azorhizobium caulinodans ORS 571] gi|158329350|dbj|BAF86835.1| putative bacteriophage protein [Azorhizobium caulinodans ORS 571] Length = 220 Score = 75.1 bits (183), Expect = 3e-12, Method: Composition-based stats. Identities = 25/68 (36%), Positives = 41/68 (60%), Gaps = 4/68 (5%) Query: 30 YQSVLTDDFSKYGRKVIKNLREEKPEQYLRLISQILPREKIKQDGITNGDQLTDEQLCEI 89 + + L DDF + G + I+ +REEKP+QYL++++ ILP+E + L+D++L Sbjct: 133 FIAALHDDFIENGPETIQRVREEKPDQYLKVVASILPKELKVTTEV----DLSDDELDRR 188 Query: 90 IRSLEKEL 97 IR L L Sbjct: 189 IRQLAAAL 196 >gi|298292906|ref|YP_003694845.1| bacteriophage protein [Starkeya novella DSM 506] gi|296929417|gb|ADH90226.1| putative bacteriophage protein [Starkeya novella DSM 506] Length = 106 Score = 60.9 bits (146), Expect = 6e-08, Method: Composition-based stats. Identities = 20/59 (33%), Positives = 34/59 (57%), Gaps = 2/59 (3%) Query: 32 SVLTDDFSKYGRKVIKNLREEKPEQYLRLISQILPREKIKQDGITNGDQLTDEQLCEII 90 S L +F ++ VI +RE KPE YL++++ +LPRE + +L+D +L +I Sbjct: 28 SDLYINFRQHEPSVIAKVRETKPEVYLKVVASLLPREVKLDA--PDLRELSDAELMAVI 84 >gi|148557329|ref|YP_001264911.1| hypothetical protein Swit_4435 [Sphingomonas wittichii RW1] gi|148502519|gb|ABQ70773.1| hypothetical protein Swit_4435 [Sphingomonas wittichii RW1] Length = 99 Score = 43.5 bits (101), Expect = 0.010, Method: Composition-based stats. Identities = 14/58 (24%), Positives = 32/58 (55%), Gaps = 2/58 (3%) Query: 34 LTDDFSKYGRKVIKNLREEKPEQYLRLISQILPREKIKQDGITNGDQLTDEQLCEIIR 91 L ++++G I+ +R E+P YLRL++ L + + + L+D+++ + +R Sbjct: 29 LQASWARHGAATIEQVRNERPHDYLRLMASSLAK--RAEGKSDAIEALSDDEIADELR 84 >gi|302308347|ref|NP_985233.2| AER377Cp [Ashbya gossypii ATCC 10895] gi|299789411|gb|AAS53057.2| AER377Cp [Ashbya gossypii ATCC 10895] Length = 1012 Score = 38.9 bits (89), Expect = 0.25, Method: Composition-based stats. Identities = 24/81 (29%), Positives = 41/81 (50%), Gaps = 9/81 (11%) Query: 19 LSHKEISL--QYLYQSVLTDDFSK------YGRKVIKN-LREEKPEQYLRLISQILPREK 69 L H+E S+ QYL +++LTDD G KVI + E L+ I ++LP Sbjct: 488 LKHEERSMVIQYLTETILTDDNKNIYIACLEGLKVISRNYEADVHEVSLKRILELLPESA 547 Query: 70 IKQDGITNGDQLTDEQLCEII 90 + I NG++ + + + ++I Sbjct: 548 SAETVIVNGEETSVKHILKVI 568 >gi|126338697|ref|XP_001363409.1| PREDICTED: similar to dynamin 1-like, isoform 2 [Monodelphis domestica] Length = 712 Score = 37.8 bits (86), Expect = 0.55, Method: Composition-based stats. Identities = 17/72 (23%), Positives = 36/72 (50%) Query: 9 IRKKYPCINRLSHKEISLQYLYQSVLTDDFSKYGRKVIKNLREEKPEQYLRLISQILPRE 68 +RK+ P N + H ++++ Y + DF+ + N+ E++ + R ++ +PR+ Sbjct: 485 LRKRLPVTNEMVHNLVAIELAYINTKHPDFADACGLMNNNIEEQRRNRLARELTSTVPRD 544 Query: 69 KIKQDGITNGDQ 80 K+ G GD Sbjct: 545 KVAPAGSGVGDG 556 >gi|126338699|ref|XP_001363490.1| PREDICTED: similar to dynamin 1-like, isoform 3 [Monodelphis domestica] Length = 699 Score = 37.8 bits (86), Expect = 0.55, Method: Composition-based stats. Identities = 17/72 (23%), Positives = 36/72 (50%) Query: 9 IRKKYPCINRLSHKEISLQYLYQSVLTDDFSKYGRKVIKNLREEKPEQYLRLISQILPRE 68 +RK+ P N + H ++++ Y + DF+ + N+ E++ + R ++ +PR+ Sbjct: 472 LRKRLPVTNEMVHNLVAIELAYINTKHPDFADACGLMNNNIEEQRRNRLARELTSTVPRD 531 Query: 69 KIKQDGITNGDQ 80 K+ G GD Sbjct: 532 KVAPAGSGVGDG 543 >gi|300778872|ref|ZP_07088730.1| metallo-beta-lactamase superfamily protein [Chryseobacterium gleum ATCC 35910] gi|300504382|gb|EFK35522.1| metallo-beta-lactamase superfamily protein [Chryseobacterium gleum ATCC 35910] Length = 473 Score = 37.4 bits (85), Expect = 0.72, Method: Composition-based stats. Identities = 12/31 (38%), Positives = 20/31 (64%) Query: 8 PIRKKYPCINRLSHKEISLQYLYQSVLTDDF 38 P+R+ P INRL E+ L+Y++++ DF Sbjct: 29 PLRETQPYINRLKEDEVELKYIFETHFHADF 59 >gi|325963726|ref|YP_004241632.1| condensin subunit Smc [Arthrobacter phenanthrenivorans Sphe3] gi|323469813|gb|ADX73498.1| condensin subunit Smc [Arthrobacter phenanthrenivorans Sphe3] Length = 1194 Score = 35.5 bits (80), Expect = 2.7, Method: Composition-based stats. Identities = 21/73 (28%), Positives = 38/73 (52%), Gaps = 5/73 (6%) Query: 24 ISLQYLYQSVLTDDFSKYGRKVIKNLREEKPEQY--LRLISQILPREKIKQDGITNGDQL 81 +SLQY+ QSV D +++ R + + RE++ + +R +++L RE + D+L Sbjct: 852 LSLQYVDQSV---DLARHERDLAEENREKRDQALAEIRTANELLARELADLTDNVHRDEL 908 Query: 82 TDEQLCEIIRSLE 94 Q I +LE Sbjct: 909 ARAQQQARIEALE 921 >gi|294674640|ref|YP_003575256.1| ABC transporter permease [Prevotella ruminicola 23] gi|294474339|gb|ADE83728.1| ABC transporter, permease protein [Prevotella ruminicola 23] Length = 806 Score = 35.5 bits (80), Expect = 2.8, Method: Composition-based stats. Identities = 14/56 (25%), Positives = 24/56 (42%) Query: 57 YLRLISQILPREKIKQDGITNGDQLTDEQLCEIIRSLEKELQIFTDFKNKDAHSRE 112 Y+RL I P + Q D L ++L + L+ L+ D+ +D +R Sbjct: 241 YVRLAEGITPDDLKAQIAKMRRDNLPADELKKAGVDLDYSLRPLADYHTQDDATRR 296 >gi|240275770|gb|EER39283.1| RecQ family helicase MusN [Ajellomyces capsulatus H143] Length = 1559 Score = 35.5 bits (80), Expect = 2.8, Method: Composition-based stats. Identities = 22/89 (24%), Positives = 39/89 (43%), Gaps = 11/89 (12%) Query: 31 QSVLTDDFSKYGRKVIKNLREEK-------PEQYLRLISQILPREKIKQDGITNGD---- 79 QS++ D+F Y + ++ +K P+ LR ++ LPR+K + I N D Sbjct: 1306 QSIIVDEFMFYAKNTCHDIMMKKSLRNQPFPDAILREMAISLPRDKSQLLKIPNIDPDKV 1365 Query: 80 QLTDEQLCEIIRSLEKELQIFTDFKNKDA 108 +Q +IR E Q +++D Sbjct: 1366 DRYGDQFLRLIRRAEDRYQELLAERDRDG 1394 >gi|325093139|gb|EGC46449.1| RecQ family helicase MusN [Ajellomyces capsulatus H88] Length = 1559 Score = 35.1 bits (79), Expect = 2.9, Method: Composition-based stats. Identities = 22/89 (24%), Positives = 39/89 (43%), Gaps = 11/89 (12%) Query: 31 QSVLTDDFSKYGRKVIKNLREEK-------PEQYLRLISQILPREKIKQDGITNGD---- 79 QS++ D+F Y + ++ +K P+ LR ++ LPR+K + I N D Sbjct: 1306 QSIIVDEFMFYAKNTCHDIMMKKSLRNQPFPDAILREMAISLPRDKSQLLKIPNIDPDKV 1365 Query: 80 QLTDEQLCEIIRSLEKELQIFTDFKNKDA 108 +Q +IR E Q +++D Sbjct: 1366 DRYGDQFLRLIRRAEDRYQELLAERDRDG 1394 >gi|225563190|gb|EEH11469.1| RecQ family helicase MusN [Ajellomyces capsulatus G186AR] Length = 1559 Score = 35.1 bits (79), Expect = 3.2, Method: Composition-based stats. Identities = 22/89 (24%), Positives = 39/89 (43%), Gaps = 11/89 (12%) Query: 31 QSVLTDDFSKYGRKVIKNLREEK-------PEQYLRLISQILPREKIKQDGITNGD---- 79 QS++ D+F Y + ++ +K P+ LR ++ LPR+K + I N D Sbjct: 1306 QSIIVDEFMFYAKNTCHDIMMKKSLRNQPFPDAILREMAISLPRDKSQLLKIPNIDPDKV 1365 Query: 80 QLTDEQLCEIIRSLEKELQIFTDFKNKDA 108 +Q +IR E Q +++D Sbjct: 1366 DRYGDQFLRLIRRAEDRYQELLAERDRDG 1394 >gi|315611813|ref|ZP_07886733.1| conserved hypothetical protein [Streptococcus sanguinis ATCC 49296] gi|315316095|gb|EFU64127.1| conserved hypothetical protein [Streptococcus sanguinis ATCC 49296] Length = 538 Score = 35.1 bits (79), Expect = 3.4, Method: Composition-based stats. Identities = 26/92 (28%), Positives = 35/92 (38%), Gaps = 1/92 (1%) Query: 31 QSVLTDDFSKYGRKVIKNLREEKPEQYLRLISQILPREKIKQDGITNGDQLTDEQLCEII 90 Q D +YG VI L + Y L+S+ +P+ K + L E L I Sbjct: 44 QGAAYDSGKQYGMNVITPLLKGAI-MYTELVSEAVPKLPSKYRSEVGDEDLDSEVLESEI 102 Query: 91 RSLEKELQIFTDFKNKDAHSRETKETTKSASS 122 RSLE L N T +T S+ S Sbjct: 103 RSLEASLHSIRGMYNAMVGDESTNASTLSSLS 134 >gi|156740746|ref|YP_001430875.1| nucleotidyl transferase [Roseiflexus castenholzii DSM 13941] gi|156232074|gb|ABU56857.1| Nucleotidyl transferase [Roseiflexus castenholzii DSM 13941] Length = 354 Score = 33.9 bits (76), Expect = 7.0, Method: Composition-based stats. Identities = 13/86 (15%), Positives = 34/86 (39%), Gaps = 15/86 (17%) Query: 13 YPCINRLSHKEISLQYLYQSVLTDDFSKYGRKVIKNLREEKPEQYLRLISQILPREKIKQ 72 +P ++R+ + +Y ++ + +G ++ LR + P + L Sbjct: 189 WPALDRIQRSPVKGEYYLTDLVALAIADHGIGAVQALRADDPSETL-------------- 234 Query: 73 DGITNGDQLTDEQLCEIIRSLEKELQ 98 G+ + QL + +R L+ ++ Sbjct: 235 -GVNDRVQLAQAEHVLRLRLLDALMR 259 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.311 0.131 0.347 Lambda K H 0.267 0.0409 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 909,527,935 Number of Sequences: 14124377 Number of extensions: 28894446 Number of successful extensions: 82984 Number of sequences better than 10.0: 25 Number of HSP's better than 10.0 without gapping: 13 Number of HSP's successfully gapped in prelim test: 12 Number of HSP's that attempted gapping in prelim test: 82962 Number of HSP's gapped (non-prelim): 25 length of query: 122 length of database: 4,842,793,630 effective HSP length: 89 effective length of query: 33 effective length of database: 3,585,724,077 effective search space: 118328894541 effective search space used: 118328894541 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.8 bits) S2: 75 (33.5 bits)