BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254781220|ref|YP_003065633.1| hypothetical protein CLIBASIA_05635 [Candidatus Liberibacter asiaticus str. psy62] (130 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done >gi|254781220|ref|YP_003065633.1| hypothetical protein CLIBASIA_05635 [Candidatus Liberibacter asiaticus str. psy62] gi|254040897|gb|ACT57693.1| hypothetical protein CLIBASIA_05635 [Candidatus Liberibacter asiaticus str. psy62] gi|317120684|gb|ADV02507.1| hypothetical protein SC1_gp135 [Liberibacter phage SC1] gi|317120726|gb|ADV02548.1| hypothetical protein SC2_gp135 [Liberibacter phage SC2] gi|317120787|gb|ADV02608.1| hypothetical protein SC2_gp135 [Liberibacter phage SC2] gi|317120828|gb|ADV02649.1| hypothetical protein SC1_gp135 [Liberibacter phage SC1] Length = 130 Score = 174 bits (441), Expect = 3e-42, Method: Composition-based stats. Identities = 130/130 (100%), Positives = 130/130 (100%) Query: 1 MGQLKQYYLEEIEANYEFLSAVNPRMGIEPESNYEVEVIEKLERALKTSKKLIHFRDRTI 60 MGQLKQYYLEEIEANYEFLSAVNPRMGIEPESNYEVEVIEKLERALKTSKKLIHFRDRTI Sbjct: 1 MGQLKQYYLEEIEANYEFLSAVNPRMGIEPESNYEVEVIEKLERALKTSKKLIHFRDRTI 60 Query: 61 RTHILEDLIEEVNRIIVLAKAHKRRLELKIFEDNEVWRLLDEAREDCEGCENCSEHPDQE 120 RTHILEDLIEEVNRIIVLAKAHKRRLELKIFEDNEVWRLLDEAREDCEGCENCSEHPDQE Sbjct: 61 RTHILEDLIEEVNRIIVLAKAHKRRLELKIFEDNEVWRLLDEAREDCEGCENCSEHPDQE 120 Query: 121 HKEDYYASQI 130 HKEDYYASQI Sbjct: 121 HKEDYYASQI 130 >gi|254780122|ref|YP_003064535.1| hypothetical protein CLIBASIA_00005 [Candidatus Liberibacter asiaticus str. psy62] gi|254039799|gb|ACT56595.1| hypothetical protein CLIBASIA_00005 [Candidatus Liberibacter asiaticus str. psy62] gi|317120736|gb|ADV02558.1| hypothetical protein SC2_gp185 [Liberibacter phage SC2] gi|317120797|gb|ADV02618.1| hypothetical protein SC2_gp185 [Liberibacter phage SC2] Length = 123 Score = 132 bits (332), Expect = 1e-29, Method: Composition-based stats. Identities = 41/134 (30%), Positives = 68/134 (50%), Gaps = 15/134 (11%) Query: 1 MGQLKQYYLEEIEANYEFLSAVNPRMGIEPESNYEVEVIEK---LERALKTSKKLIHFRD 57 MG LK ++ +EI N+ F S N +P+ + E+++ E L+ + ++ Sbjct: 1 MGALKNHFHDEINENFYFHSHPNA----DPDISIEMQISENQRYLDEEISQCNAVVDVFK 56 Query: 58 RTIRTHILE-DLIEEVNRIIVLAKAHKRRLELKIFEDNEVWRLLDEAREDCEGCENCSEH 116 R+ T + + D ++++ I L +A + L+ + E W E D E E EH Sbjct: 57 RSDSTILDKLDAMDDLKTYISLLQATAKNLKSLL---KEYW----EESLDGEDDEEIYEH 109 Query: 117 PDQEHKEDYYASQI 130 PDQEH+EDYYA+QI Sbjct: 110 PDQEHREDYYANQI 123 >gi|317120693|gb|ADV02516.1| hypothetical protein SC1_gp185 [Liberibacter phage SC1] gi|317120837|gb|ADV02658.1| hypothetical protein SC1_gp185 [Liberibacter phage SC1] Length = 123 Score = 132 bits (331), Expect = 2e-29, Method: Composition-based stats. Identities = 41/134 (30%), Positives = 68/134 (50%), Gaps = 15/134 (11%) Query: 1 MGQLKQYYLEEIEANYEFLSAVNPRMGIEPESNYEVEVIEK---LERALKTSKKLIHFRD 57 MG LK ++ +EI N+ F S N +P+ + E+++ E L+ + ++ Sbjct: 1 MGALKNHFHDEINENFYFHSHPNA----DPDISIEMQISENQRYLDEEISQCNAVVDVFK 56 Query: 58 RTIRTHILE-DLIEEVNRIIVLAKAHKRRLELKIFEDNEVWRLLDEAREDCEGCENCSEH 116 R+ T + + D ++++ I L +A + L+ + E W E D E E EH Sbjct: 57 RSDSTILDKLDAVDDLKTYISLLQATAKNLKSLL---KEYW----EESLDGEDDEEIYEH 109 Query: 117 PDQEHKEDYYASQI 130 PDQEH+EDYYA+QI Sbjct: 110 PDQEHREDYYANQI 123 >gi|315121961|ref|YP_004062450.1| hypothetical protein CKC_01055 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|315122929|ref|YP_004063418.1| hypothetical protein CKC_05925 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495363|gb|ADR51962.1| hypothetical protein CKC_01055 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496331|gb|ADR52930.1| hypothetical protein CKC_05925 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 128 Score = 64.8 bits (156), Expect = 4e-09, Method: Composition-based stats. Identities = 47/136 (34%), Positives = 67/136 (49%), Gaps = 14/136 (10%) Query: 1 MGQLKQYYLEEIEANYEFLSAVNPRMGIEPESNYEV-----EVIEKLERALKTSKKLIHF 55 MG+LKQ+Y+EEIEANYEF NPR ++PE + E+I L++ L+ Sbjct: 1 MGKLKQHYMEEIEANYEFNCLNNPRFNLDPEPEPSIKEEVEELIRSLDQRFLWCCDLVRD 60 Query: 56 RDRTIRTHILE-DLIEEVNRIIVLAKAHKRRLELKIFEDNEVWRLLDEAREDCEGCENCS 114 R+ T + + D +EE+ + + + RLE + E E E Sbjct: 61 LQRSDYTILDKLDAMEEMRMYLSVLRPTANRLEDLLKESWEEPEKEPEEEP--------R 112 Query: 115 EHPDQEHKEDYYASQI 130 EHPDQEH + YYA QI Sbjct: 113 EHPDQEHADAYYADQI 128 >gi|327192599|gb|EGE59544.1| transcriptional regulator domain-containing protein [Rhizobium etli CNPAF512] Length = 511 Score = 37.4 bits (85), Expect = 0.60, Method: Composition-based stats. Identities = 16/94 (17%), Positives = 32/94 (34%) Query: 6 QYYLEEIEANYEFLSAVNPRMGIEPESNYEVEVIEKLERALKTSKKLIHFRDRTIRTHIL 65 ++ + I + F + R+ E E+ I++ R S + R + I Sbjct: 212 DHFEDGIGGMFAFQDHITERVASTVEPTIEIAEIQRSRRDRPNSPAVYDLYLRALAAIID 271 Query: 66 EDLIEEVNRIIVLAKAHKRRLELKIFEDNEVWRL 99 E + +L +A E + + W L Sbjct: 272 ESIENNAIAYGLLQQALAIEPENPMILSHAAWAL 305 >gi|225556604|gb|EEH04892.1| tubulin-tyrosine ligase [Ajellomyces capsulatus G186AR] Length = 1188 Score = 35.1 bits (79), Expect = 2.9, Method: Composition-based stats. Identities = 24/118 (20%), Positives = 45/118 (38%), Gaps = 15/118 (12%) Query: 4 LKQYYLEEIEANYEFLSAVNPRMG--IEPESNYEVEVIEKLERALKTSKKLIHFRDRTIR 61 ++++YL +N+ + + ++P ++E++ E L+ AL + +L Sbjct: 812 IRKHYLSNSVSNW-VAKHPDSILNAHVKPAVHFELDYAEFLDEALVEAYELQESFQANEG 870 Query: 62 TH-------ILEDLIEEVNRIIVLAKAHKRRLELKIFEDNEVWRLLDEAREDCEGCEN 112 IL+ + + R I L E + E E W ED E EN Sbjct: 871 KEEAEKEWWILKPGMSDRGRGIRLF-----NSESALREIFEGWEDDQPDSEDGEDSEN 923 >gi|145489920|ref|XP_001430961.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124398063|emb|CAK63563.1| unnamed protein product [Paramecium tetraurelia] Length = 425 Score = 35.1 bits (79), Expect = 3.7, Method: Composition-based stats. Identities = 26/116 (22%), Positives = 44/116 (37%), Gaps = 8/116 (6%) Query: 9 LEEIEANYEFLSAVNP---RMGIEPESNYEVEVIEKLERALKTSKKLIHFRDRTIRTHIL 65 LE I NY+F + I+ EV+V +KL++ + + I D+ + Sbjct: 184 LESILENYQFHQGRYVGLIKSFIDTLQGTEVQVTQKLDQQTQEKQLFIQTVDKATKKFSA 243 Query: 66 EDLIEEVNRIIVLAKAHKRRLELKIFEDNEVWRLLDEAREDCEGCENCSEHPDQEH 121 ED H +L L+ + E + R + +G + E QEH Sbjct: 244 ED-----QEYARNVDGHDYKLFLQELYEKEYQSGFAQNRFNEKGQKLVYESQAQEH 294 >gi|325087612|gb|EGC40922.1| tubulin-tyrosine ligase [Ajellomyces capsulatus H88] Length = 1206 Score = 34.7 bits (78), Expect = 4.4, Method: Composition-based stats. Identities = 24/118 (20%), Positives = 45/118 (38%), Gaps = 15/118 (12%) Query: 4 LKQYYLEEIEANYEFLSAVNPRMG--IEPESNYEVEVIEKLERALKTSKKLIHFRDRTIR 61 ++++YL +N+ + + ++P ++E++ E L+ AL + +L Sbjct: 830 IRKHYLSNSVSNW-VAKHPDSILDAHVKPAVHFELDYAEFLDEALVEAYELQESFQANEG 888 Query: 62 TH-------ILEDLIEEVNRIIVLAKAHKRRLELKIFEDNEVWRLLDEAREDCEGCEN 112 IL+ + + R I L E + E E W ED E EN Sbjct: 889 KEEAEKEWWILKPGMSDRGRGIRLF-----NSESALREIFEGWEDDQPDSEDGEDSEN 941 >gi|240281460|gb|EER44963.1| tubulin-tyrosine ligase [Ajellomyces capsulatus H143] Length = 1229 Score = 34.7 bits (78), Expect = 4.5, Method: Composition-based stats. Identities = 24/118 (20%), Positives = 45/118 (38%), Gaps = 15/118 (12%) Query: 4 LKQYYLEEIEANYEFLSAVNPRMG--IEPESNYEVEVIEKLERALKTSKKLIHFRDRTIR 61 ++++YL +N+ + + ++P ++E++ E L+ AL + +L Sbjct: 853 IRKHYLSNSVSNW-VAKHPDSILDAHVKPAVHFELDYAEFLDEALVEAYELQESFQANEG 911 Query: 62 TH-------ILEDLIEEVNRIIVLAKAHKRRLELKIFEDNEVWRLLDEAREDCEGCEN 112 IL+ + + R I L E + E E W ED E EN Sbjct: 912 KEEAEKEWWILKPGMSDRGRGIRLF-----NSESALREIFEGWEDDQPDSEDGEDSEN 964 >gi|224106974|ref|XP_002314329.1| predicted protein [Populus trichocarpa] gi|222863369|gb|EEF00500.1| predicted protein [Populus trichocarpa] Length = 260 Score = 34.3 bits (77), Expect = 4.9, Method: Composition-based stats. Identities = 22/98 (22%), Positives = 42/98 (42%) Query: 9 LEEIEANYEFLSAVNPRMGIEPESNYEVEVIEKLERALKTSKKLIHFRDRTIRTHILEDL 68 EI+ N+ LS R+ +SN + + +E+L +K K+LI DR I+ ++ Sbjct: 15 HGEIKDNFRALSNGFQRLNNIKDSNRQSKQLEELTGRMKECKRLIKEFDREIKVEESKNP 74 Query: 69 IEEVNRIIVLAKAHKRRLELKIFEDNEVWRLLDEARED 106 E ++ ++ + L + LD R + Sbjct: 75 PEVNKQLNDEKQSMIKELNSYVQLRKTYMNSLDNKRVE 112 >gi|171679545|ref|XP_001904719.1| hypothetical protein [Podospora anserina S mat+] gi|170939398|emb|CAP64626.1| unnamed protein product [Podospora anserina S mat+] Length = 947 Score = 34.0 bits (76), Expect = 7.8, Method: Composition-based stats. Identities = 24/106 (22%), Positives = 42/106 (39%), Gaps = 1/106 (0%) Query: 20 SAVNPRMGIEPESNYEVEVIEKLERALKTSKKLIHFRDRTIRTHILEDLIEEVNRIIVLA 79 +AVNP + E + + E IE + + R+ +EDL++EV +I Sbjct: 383 NAVNPALVTEIDDDLVKEAIENWDSVVVDPTLDEVLGKRSDGEKEVEDLLDEVTDMIETL 442 Query: 80 KAHKRRLELKIFEDNEVWRLLDEAREDCEGCENCSEHPDQEHKEDY 125 + +R L + + + D D N S+ P +E Y Sbjct: 443 SSFQRNRNLTVPTSQDRYS-ADPNNGDMLRNGNASQQPSEEEMMTY 487 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.312 0.128 0.337 Lambda K H 0.267 0.0379 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 864,499,118 Number of Sequences: 14124377 Number of extensions: 19335672 Number of successful extensions: 92662 Number of sequences better than 10.0: 53 Number of HSP's better than 10.0 without gapping: 11 Number of HSP's successfully gapped in prelim test: 42 Number of HSP's that attempted gapping in prelim test: 92632 Number of HSP's gapped (non-prelim): 57 length of query: 130 length of database: 4,842,793,630 effective HSP length: 96 effective length of query: 34 effective length of database: 3,486,853,438 effective search space: 118553016892 effective search space used: 118553016892 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.4 bits) S2: 75 (33.6 bits)