BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254781124|ref|YP_003065537.1| hypothetical protein CLIBASIA_05130 [Candidatus Liberibacter asiaticus str. psy62] (101 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done >gi|254781124|ref|YP_003065537.1| hypothetical protein CLIBASIA_05130 [Candidatus Liberibacter asiaticus str. psy62] gi|254040801|gb|ACT57597.1| hypothetical protein CLIBASIA_05130 [Candidatus Liberibacter asiaticus str. psy62] Length = 101 Score = 187 bits (476), Expect = 3e-46, Method: Composition-based stats. Identities = 101/101 (100%), Positives = 101/101 (100%) Query: 1 MNPHGLMYAPKEGNQYRPLRKEVRNAFPKILKSIEKALEPYVDPLIEPVEIGKEGMVDEG 60 MNPHGLMYAPKEGNQYRPLRKEVRNAFPKILKSIEKALEPYVDPLIEPVEIGKEGMVDEG Sbjct: 1 MNPHGLMYAPKEGNQYRPLRKEVRNAFPKILKSIEKALEPYVDPLIEPVEIGKEGMVDEG 60 Query: 61 YRLHICALIKLLENRSNHQTKNDINKCVLSEEFTIQNNKGK 101 YRLHICALIKLLENRSNHQTKNDINKCVLSEEFTIQNNKGK Sbjct: 61 YRLHICALIKLLENRSNHQTKNDINKCVLSEEFTIQNNKGK 101 >gi|254780203|ref|YP_003064616.1| hypothetical protein CLIBASIA_00440 [Candidatus Liberibacter asiaticus str. psy62] gi|254039880|gb|ACT56676.1| hypothetical protein CLIBASIA_00440 [Candidatus Liberibacter asiaticus str. psy62] Length = 82 Score = 94.4 bits (233), Expect = 5e-18, Method: Composition-based stats. Identities = 28/62 (45%), Positives = 45/62 (72%), Gaps = 3/62 (4%) Query: 15 QYRPLRKEVRNAFPKILKSIEKALEPY-VDPLIEPVEIGKEGMVDEGYRLHICALIKLLE 73 +YRP ++EVR+AFP+++K++EK L + V+PL PV+ +G E Y H+C LI+LL+ Sbjct: 5 KYRPSKEEVRDAFPELIKALEKGLSTFDVEPLKNPVKSHGKGY--ESYISHVCELIELLK 62 Query: 74 NR 75 N+ Sbjct: 63 NK 64 >gi|254781189|ref|YP_003065602.1| hypothetical protein CLIBASIA_05480 [Candidatus Liberibacter asiaticus str. psy62] gi|254040866|gb|ACT57662.1| hypothetical protein CLIBASIA_05480 [Candidatus Liberibacter asiaticus str. psy62] Length = 252 Score = 41.2 bits (95), Expect = 0.042, Method: Composition-based stats. Identities = 21/61 (34%), Positives = 29/61 (47%), Gaps = 7/61 (11%) Query: 14 NQYRPLRKEVRNAFP-KILKSIEKALEPYVDPLIEPVEIGKEGMVDEGYRLHICALIKLL 72 N+YRP + +R P K++K E + YVDPL + Y H CAL+ L Sbjct: 178 NKYRPSAEAMRTICPTKLMKIFEDTISLYVDPLT------PRDISFTQYEKHACALVNWL 231 Query: 73 E 73 E Sbjct: 232 E 232 >gi|145323938|ref|NP_001077558.1| unknown protein [Arabidopsis thaliana] gi|332191615|gb|AEE29736.1| uncharacterized protein [Arabidopsis thaliana] Length = 1014 Score = 38.9 bits (89), Expect = 0.26, Method: Composition-based stats. Identities = 22/92 (23%), Positives = 45/92 (48%), Gaps = 12/92 (13%) Query: 20 RKEVRNAFPKILKSIEKALEPYVDPLIEPVEIGKEGMVDEGYRLHICALIKLLENRSNHQ 79 RK V +A ++L +E YVDP ++ + K+ + + +C+ I++L+ ++ + Sbjct: 874 RKLVFDAVNEMLGKKLAFVESYVDPWMKQAKARKKVLSAQNLLKELCSEIEILQKQAKKR 933 Query: 80 T------------KNDINKCVLSEEFTIQNNK 99 + + D KC+L E+ IQ+ K Sbjct: 934 SENLLLLEEEEEEEEDFLKCILDEDMAIQSEK 965 >gi|15221824|ref|NP_173297.1| unknown protein [Arabidopsis thaliana] gi|9795593|gb|AAF98411.1|AC026238_3 Unknown protein [Arabidopsis thaliana] gi|20856609|gb|AAM26675.1| At1g18620/F25I16_13 [Arabidopsis thaliana] gi|32306507|gb|AAP78937.1| At1g18620 [Arabidopsis thaliana] gi|332191614|gb|AEE29735.1| uncharacterized protein [Arabidopsis thaliana] Length = 978 Score = 38.5 bits (88), Expect = 0.28, Method: Composition-based stats. Identities = 22/92 (23%), Positives = 45/92 (48%), Gaps = 12/92 (13%) Query: 20 RKEVRNAFPKILKSIEKALEPYVDPLIEPVEIGKEGMVDEGYRLHICALIKLLENRSNHQ 79 RK V +A ++L +E YVDP ++ + K+ + + +C+ I++L+ ++ + Sbjct: 838 RKLVFDAVNEMLGKKLAFVESYVDPWMKQAKARKKVLSAQNLLKELCSEIEILQKQAKKR 897 Query: 80 T------------KNDINKCVLSEEFTIQNNK 99 + + D KC+L E+ IQ+ K Sbjct: 898 SENLLLLEEEEEEEEDFLKCILDEDMAIQSEK 929 >gi|91216863|ref|ZP_01253827.1| hypothetical protein P700755_04997 [Psychroflexus torquis ATCC 700755] gi|91185024|gb|EAS71403.1| hypothetical protein P700755_04997 [Psychroflexus torquis ATCC 700755] Length = 349 Score = 35.0 bits (79), Expect = 3.1, Method: Composition-based stats. Identities = 18/54 (33%), Positives = 31/54 (57%), Gaps = 7/54 (12%) Query: 37 ALEPYVDPLIEPVE-----IGKEGMVDEGYRLHICALIKLLENRSN--HQTKND 83 ALEP++DP+I+ V K G V + + LH+ L+ + +N+ N H K++ Sbjct: 108 ALEPFIDPVIQKVSASVSSSFKRGQVLQSHLLHLKRLLAIQKNKFNTLHNFKSE 161 >gi|261332578|emb|CBH15573.1| hypothetical protein, conserved [Trypanosoma brucei gambiense DAL972] Length = 1366 Score = 35.0 bits (79), Expect = 3.5, Method: Composition-based stats. Identities = 16/74 (21%), Positives = 31/74 (41%), Gaps = 1/74 (1%) Query: 7 MYAPKEGNQYRPLRKEVRNAFPKILKSIEKALEPYV-DPLIEPVEIGKEGMVDEGYRLHI 65 +YAP + RP + R+++ + + + PY+ P+ + G +D L Sbjct: 902 LYAPSTSHVLRPTAADTRDSYVSKILKLARQTSPYILKPIARENSKNRRGFMDNSTTLRE 961 Query: 66 CALIKLLENRSNHQ 79 +L RS H+ Sbjct: 962 FQATRLAIQRSLHE 975 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.308 0.134 0.380 Lambda K H 0.267 0.0414 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 941,334,900 Number of Sequences: 14124377 Number of extensions: 34517233 Number of successful extensions: 68969 Number of sequences better than 10.0: 9 Number of HSP's better than 10.0 without gapping: 3 Number of HSP's successfully gapped in prelim test: 6 Number of HSP's that attempted gapping in prelim test: 68962 Number of HSP's gapped (non-prelim): 9 length of query: 101 length of database: 4,842,793,630 effective HSP length: 70 effective length of query: 31 effective length of database: 3,854,087,240 effective search space: 119476704440 effective search space used: 119476704440 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.6 bits) S2: 75 (33.5 bits)