BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254781124|ref|YP_003065537.1| hypothetical protein CLIBASIA_05130 [Candidatus Liberibacter asiaticus str. psy62] (101 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done Results from round 1 >gi|254781124|ref|YP_003065537.1| hypothetical protein CLIBASIA_05130 [Candidatus Liberibacter asiaticus str. psy62] gi|254040801|gb|ACT57597.1| hypothetical protein CLIBASIA_05130 [Candidatus Liberibacter asiaticus str. psy62] Length = 101 Score = 207 bits (528), Expect = 3e-52, Method: Compositional matrix adjust. Identities = 101/101 (100%), Positives = 101/101 (100%) Query: 1 MNPHGLMYAPKEGNQYRPLRKEVRNAFPKILKSIEKALEPYVDPLIEPVEIGKEGMVDEG 60 MNPHGLMYAPKEGNQYRPLRKEVRNAFPKILKSIEKALEPYVDPLIEPVEIGKEGMVDEG Sbjct: 1 MNPHGLMYAPKEGNQYRPLRKEVRNAFPKILKSIEKALEPYVDPLIEPVEIGKEGMVDEG 60 Query: 61 YRLHICALIKLLENRSNHQTKNDINKCVLSEEFTIQNNKGK 101 YRLHICALIKLLENRSNHQTKNDINKCVLSEEFTIQNNKGK Sbjct: 61 YRLHICALIKLLENRSNHQTKNDINKCVLSEEFTIQNNKGK 101 >gi|254780203|ref|YP_003064616.1| hypothetical protein CLIBASIA_00440 [Candidatus Liberibacter asiaticus str. psy62] gi|254039880|gb|ACT56676.1| hypothetical protein CLIBASIA_00440 [Candidatus Liberibacter asiaticus str. psy62] Length = 82 Score = 55.1 bits (131), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 28/66 (42%), Positives = 46/66 (69%), Gaps = 3/66 (4%) Query: 12 EGNQYRPLRKEVRNAFPKILKSIEKALEPY-VDPLIEPVEIGKEGMVDEGYRLHICALIK 70 + +YRP ++EVR+AFP+++K++EK L + V+PL PV+ +G E Y H+C LI+ Sbjct: 2 DDRKYRPSKEEVRDAFPELIKALEKGLSTFDVEPLKNPVKSHGKGY--ESYISHVCELIE 59 Query: 71 LLENRS 76 LL+N+ Sbjct: 60 LLKNKD 65 >gi|238487886|ref|XP_002375181.1| calpain-like protein [Aspergillus flavus NRRL3357] gi|220700060|gb|EED56399.1| calpain-like protein [Aspergillus flavus NRRL3357] Length = 572 Score = 34.7 bits (78), Expect = 4.4, Method: Composition-based stats. Identities = 15/42 (35%), Positives = 21/42 (50%) Query: 3 PHGLMYAPKEGNQYRPLRKEVRNAFPKILKSIEKALEPYVDP 44 P G++ G Y RKE R A +I+K E+ + Y DP Sbjct: 68 PKGVIQGQDAGKSYEEARKECRRAVDRIVKECERLNQKYTDP 109 >gi|317143265|ref|XP_001819363.2| calpain-like protein [Aspergillus oryzae RIB40] Length = 859 Score = 34.3 bits (77), Expect = 5.4, Method: Composition-based stats. Identities = 15/42 (35%), Positives = 21/42 (50%) Query: 3 PHGLMYAPKEGNQYRPLRKEVRNAFPKILKSIEKALEPYVDP 44 P G++ G Y RKE R A +I+K E+ + Y DP Sbjct: 111 PKGVIQGQDAGKSYEEARKECRRAVDRIVKECERLNQKYTDP 152 >gi|83767222|dbj|BAE57361.1| unnamed protein product [Aspergillus oryzae] Length = 816 Score = 34.3 bits (77), Expect = 5.4, Method: Composition-based stats. Identities = 15/42 (35%), Positives = 21/42 (50%) Query: 3 PHGLMYAPKEGNQYRPLRKEVRNAFPKILKSIEKALEPYVDP 44 P G++ G Y RKE R A +I+K E+ + Y DP Sbjct: 68 PKGVIQGQDAGKSYEEARKECRRAVDRIVKECERLNQKYTDP 109 Searching..................................................done Results from round 2 CONVERGED! >gi|254781124|ref|YP_003065537.1| hypothetical protein CLIBASIA_05130 [Candidatus Liberibacter asiaticus str. psy62] gi|254040801|gb|ACT57597.1| hypothetical protein CLIBASIA_05130 [Candidatus Liberibacter asiaticus str. psy62] Length = 101 Score = 185 bits (469), Expect = 2e-45, Method: Composition-based stats. Identities = 101/101 (100%), Positives = 101/101 (100%) Query: 1 MNPHGLMYAPKEGNQYRPLRKEVRNAFPKILKSIEKALEPYVDPLIEPVEIGKEGMVDEG 60 MNPHGLMYAPKEGNQYRPLRKEVRNAFPKILKSIEKALEPYVDPLIEPVEIGKEGMVDEG Sbjct: 1 MNPHGLMYAPKEGNQYRPLRKEVRNAFPKILKSIEKALEPYVDPLIEPVEIGKEGMVDEG 60 Query: 61 YRLHICALIKLLENRSNHQTKNDINKCVLSEEFTIQNNKGK 101 YRLHICALIKLLENRSNHQTKNDINKCVLSEEFTIQNNKGK Sbjct: 61 YRLHICALIKLLENRSNHQTKNDINKCVLSEEFTIQNNKGK 101 >gi|254780203|ref|YP_003064616.1| hypothetical protein CLIBASIA_00440 [Candidatus Liberibacter asiaticus str. psy62] gi|254039880|gb|ACT56676.1| hypothetical protein CLIBASIA_00440 [Candidatus Liberibacter asiaticus str. psy62] Length = 82 Score = 99.0 bits (245), Expect = 2e-19, Method: Composition-based stats. Identities = 28/66 (42%), Positives = 46/66 (69%), Gaps = 3/66 (4%) Query: 12 EGNQYRPLRKEVRNAFPKILKSIEKALEPY-VDPLIEPVEIGKEGMVDEGYRLHICALIK 70 + +YRP ++EVR+AFP+++K++EK L + V+PL PV+ +G E Y H+C LI+ Sbjct: 2 DDRKYRPSKEEVRDAFPELIKALEKGLSTFDVEPLKNPVKSHGKGY--ESYISHVCELIE 59 Query: 71 LLENRS 76 LL+N+ Sbjct: 60 LLKNKD 65 >gi|254781189|ref|YP_003065602.1| hypothetical protein CLIBASIA_05480 [Candidatus Liberibacter asiaticus str. psy62] gi|254040866|gb|ACT57662.1| hypothetical protein CLIBASIA_05480 [Candidatus Liberibacter asiaticus str. psy62] Length = 252 Score = 42.4 bits (98), Expect = 0.021, Method: Composition-based stats. Identities = 21/62 (33%), Positives = 29/62 (46%), Gaps = 7/62 (11%) Query: 13 GNQYRPLRKEVRNAFP-KILKSIEKALEPYVDPLIEPVEIGKEGMVDEGYRLHICALIKL 71 N+YRP + +R P K++K E + YVDPL + Y H CAL+ Sbjct: 177 DNKYRPSAEAMRTICPTKLMKIFEDTISLYVDPLT------PRDISFTQYEKHACALVNW 230 Query: 72 LE 73 LE Sbjct: 231 LE 232 >gi|145323938|ref|NP_001077558.1| unknown protein [Arabidopsis thaliana] gi|332191615|gb|AEE29736.1| uncharacterized protein [Arabidopsis thaliana] Length = 1014 Score = 37.7 bits (86), Expect = 0.50, Method: Composition-based stats. Identities = 22/92 (23%), Positives = 45/92 (48%), Gaps = 12/92 (13%) Query: 20 RKEVRNAFPKILKSIEKALEPYVDPLIEPVEIGKEGMVDEGYRLHICALIKLLENRSNHQ 79 RK V +A ++L +E YVDP ++ + K+ + + +C+ I++L+ ++ + Sbjct: 874 RKLVFDAVNEMLGKKLAFVESYVDPWMKQAKARKKVLSAQNLLKELCSEIEILQKQAKKR 933 Query: 80 T------------KNDINKCVLSEEFTIQNNK 99 + + D KC+L E+ IQ+ K Sbjct: 934 SENLLLLEEEEEEEEDFLKCILDEDMAIQSEK 965 >gi|15221824|ref|NP_173297.1| unknown protein [Arabidopsis thaliana] gi|9795593|gb|AAF98411.1|AC026238_3 Unknown protein [Arabidopsis thaliana] gi|20856609|gb|AAM26675.1| At1g18620/F25I16_13 [Arabidopsis thaliana] gi|32306507|gb|AAP78937.1| At1g18620 [Arabidopsis thaliana] gi|332191614|gb|AEE29735.1| uncharacterized protein [Arabidopsis thaliana] Length = 978 Score = 37.7 bits (86), Expect = 0.54, Method: Composition-based stats. Identities = 22/92 (23%), Positives = 45/92 (48%), Gaps = 12/92 (13%) Query: 20 RKEVRNAFPKILKSIEKALEPYVDPLIEPVEIGKEGMVDEGYRLHICALIKLLENRSNHQ 79 RK V +A ++L +E YVDP ++ + K+ + + +C+ I++L+ ++ + Sbjct: 838 RKLVFDAVNEMLGKKLAFVESYVDPWMKQAKARKKVLSAQNLLKELCSEIEILQKQAKKR 897 Query: 80 T------------KNDINKCVLSEEFTIQNNK 99 + + D KC+L E+ IQ+ K Sbjct: 898 SENLLLLEEEEEEEEDFLKCILDEDMAIQSEK 929 >gi|71896325|ref|NP_001025537.1| elongation factor Tu GTP binding domain containing 2 [Xenopus (Silurana) tropicalis] gi|60618366|gb|AAH90572.1| eftud2 protein [Xenopus (Silurana) tropicalis] gi|159155738|gb|AAI54880.1| eftud2 protein [Xenopus (Silurana) tropicalis] Length = 974 Score = 37.3 bits (85), Expect = 0.73, Method: Composition-based stats. Identities = 18/84 (21%), Positives = 40/84 (47%), Gaps = 3/84 (3%) Query: 5 GLMYAPKEGNQYRPLRKEVRNAFPKILKSI-EKALEPYVDPLIEPVEIGKEGMVDEGYRL 63 G+ E +Y P +E+ P++ + E+ +P +P+I+PV+ K M+++G Sbjct: 56 GMEVVLHEDKKYYPTAEEIYG--PEVETIVQEEDTQPLTEPIIKPVKAKKFSMMEQGLPA 113 Query: 64 HICALIKLLENRSNHQTKNDINKC 87 + + L + N + ++ C Sbjct: 114 TVYEMDFLADLMDNPELIRNVTLC 137 >gi|160420302|ref|NP_001080281.1| U5 snRNP-specific protein, 116 kD [Xenopus laevis] gi|27469685|gb|AAH41724.1| Snrp116-pending-prov protein [Xenopus laevis] Length = 974 Score = 37.3 bits (85), Expect = 0.73, Method: Composition-based stats. Identities = 18/84 (21%), Positives = 40/84 (47%), Gaps = 3/84 (3%) Query: 5 GLMYAPKEGNQYRPLRKEVRNAFPKILKSI-EKALEPYVDPLIEPVEIGKEGMVDEGYRL 63 G+ E +Y P +E+ P++ + E+ +P +P+I+PV+ K M+++G Sbjct: 56 GMEVVLHEDKKYYPTAEEIYG--PEVETIVQEEDTQPLTEPIIKPVKAKKFSMMEQGLPA 113 Query: 64 HICALIKLLENRSNHQTKNDINKC 87 + + L + N + ++ C Sbjct: 114 TVYEMDFLADLMDNPELIRNVTLC 137 >gi|134085912|ref|NP_001076865.1| 116 kDa U5 small nuclear ribonucleoprotein component [Bos taurus] gi|166231746|sp|A4FUD3|U5S1_BOVIN RecName: Full=116 kDa U5 small nuclear ribonucleoprotein component; AltName: Full=Elongation factor Tu GTP-binding domain protein 2; AltName: Full=U5 snRNP-specific protein, 116 kDa; Short=U5-116 kDa gi|133777447|gb|AAI14718.1| EFTUD2 protein [Bos taurus] Length = 972 Score = 36.2 bits (82), Expect = 1.5, Method: Composition-based stats. Identities = 17/84 (20%), Positives = 40/84 (47%), Gaps = 3/84 (3%) Query: 5 GLMYAPKEGNQYRPLRKEVRNAFPKILKSI-EKALEPYVDPLIEPVEIGKEGMVDEGYRL 63 G+ E +Y P +EV P++ + E+ +P +P+I+PV+ K ++++ + Sbjct: 54 GMEVVLHEDKKYYPTAEEVYG--PEVETIVQEEDTQPLTEPIIKPVKTKKFTLMEQTLPV 111 Query: 64 HICALIKLLENRSNHQTKNDINKC 87 + + L + N + ++ C Sbjct: 112 TVYEMDSLADLMDNSELIRNVTLC 135 >gi|320095806|ref|ZP_08027448.1| methylmalonate-semialdehyde dehydrogenase [Actinomyces sp. oral taxon 178 str. F0338] gi|319977266|gb|EFW08967.1| methylmalonate-semialdehyde dehydrogenase [Actinomyces sp. oral taxon 178 str. F0338] Length = 508 Score = 35.8 bits (81), Expect = 2.2, Method: Composition-based stats. Identities = 13/41 (31%), Positives = 23/41 (56%), Gaps = 3/41 (7%) Query: 12 EGNQYRPLRKEVRNAF---PKILKSIEKALEPYVDPLIEPV 49 +G YRP +E + F P IL ++++ L+ Y + + PV Sbjct: 356 DGRGYRPAGEEYADGFWLGPTILDNVDRGLQVYREEVFGPV 396 >gi|327275796|ref|XP_003222658.1| PREDICTED: 116 kDa U5 small nuclear ribonucleoprotein component-like [Anolis carolinensis] Length = 972 Score = 35.4 bits (80), Expect = 2.9, Method: Composition-based stats. Identities = 16/77 (20%), Positives = 38/77 (49%), Gaps = 3/77 (3%) Query: 12 EGNQYRPLRKEVRNAFPKILKSI-EKALEPYVDPLIEPVEIGKEGMVDEGYRLHICALIK 70 E +Y P +EV P++ + E+ +P +P+I+PV+ K ++++ + + + Sbjct: 61 EDKKYYPTAEEVYG--PEVETIVQEEDTQPLTEPIIKPVKTKKFSLMEQTLPVTVYEMDF 118 Query: 71 LLENRSNHQTKNDINKC 87 L + N + ++ C Sbjct: 119 LADLMDNSELIRNVTLC 135 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.309 0.134 0.380 Lambda K H 0.267 0.0416 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,892,242,224 Number of Sequences: 14124377 Number of extensions: 68986286 Number of successful extensions: 133257 Number of sequences better than 10.0: 10 Number of HSP's better than 10.0 without gapping: 8 Number of HSP's successfully gapped in prelim test: 14 Number of HSP's that attempted gapping in prelim test: 133244 Number of HSP's gapped (non-prelim): 22 length of query: 101 length of database: 4,842,793,630 effective HSP length: 70 effective length of query: 31 effective length of database: 3,854,087,240 effective search space: 119476704440 effective search space used: 119476704440 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.6 bits) S2: 76 (33.9 bits)