BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254780963|ref|YP_003065376.1| hypothetical protein CLIBASIA_04320 [Candidatus Liberibacter asiaticus str. psy62] (215 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done Results from round 1 >gi|254780963|ref|YP_003065376.1| hypothetical protein CLIBASIA_04320 [Candidatus Liberibacter asiaticus str. psy62] gi|254040640|gb|ACT57436.1| hypothetical protein CLIBASIA_04320 [Candidatus Liberibacter asiaticus str. psy62] Length = 215 Score = 443 bits (1140), Expect = e-123, Method: Compositional matrix adjust. Identities = 215/215 (100%), Positives = 215/215 (100%) Query: 1 MVRVFCAIIFVLITFIGEFSQALEHEDELKVNFGLMRRVMIDLWSREISSYRTPLSLDLD 60 MVRVFCAIIFVLITFIGEFSQALEHEDELKVNFGLMRRVMIDLWSREISSYRTPLSLDLD Sbjct: 1 MVRVFCAIIFVLITFIGEFSQALEHEDELKVNFGLMRRVMIDLWSREISSYRTPLSLDLD 60 Query: 61 YKHRVYLDTYKSFSINLGFETFNEIVNPTTMRVLVLPVLSMHKTWNNNFDDSYFFKKIGV 120 YKHRVYLDTYKSFSINLGFETFNEIVNPTTMRVLVLPVLSMHKTWNNNFDDSYFFKKIGV Sbjct: 61 YKHRVYLDTYKSFSINLGFETFNEIVNPTTMRVLVLPVLSMHKTWNNNFDDSYFFKKIGV 120 Query: 121 GVVASTGFNTGDKWLGAEMGMSFYVYPTPWLILQSDFAIRHASSDVVVCMRYQAKFLITD 180 GVVASTGFNTGDKWLGAEMGMSFYVYPTPWLILQSDFAIRHASSDVVVCMRYQAKFLITD Sbjct: 121 GVVASTGFNTGDKWLGAEMGMSFYVYPTPWLILQSDFAIRHASSDVVVCMRYQAKFLITD 180 Query: 181 SIGILYRNVSAVSAAVDKNIGLGVTKIGLDYVYKF 215 SIGILYRNVSAVSAAVDKNIGLGVTKIGLDYVYKF Sbjct: 181 SIGILYRNVSAVSAAVDKNIGLGVTKIGLDYVYKF 215 >gi|254781154|ref|YP_003065567.1| hypothetical protein CLIBASIA_05305 [Candidatus Liberibacter asiaticus str. psy62] gi|254040831|gb|ACT57627.1| hypothetical protein CLIBASIA_05305 [Candidatus Liberibacter asiaticus str. psy62] Length = 65 Score = 36.6 bits (83), Expect = 2.7, Method: Compositional matrix adjust. Identities = 27/72 (37%), Positives = 35/72 (48%), Gaps = 12/72 (16%) Query: 148 TPWLILQSDFAIRHASSDVVVCMRYQAKFLITDSIGILYRNVSAVSAAVDKNI----GLG 203 T WL LQ DF + R Q + LITD N+SA+ ++ DKNI G Sbjct: 2 TSWLKLQRDFGVVLIGDKAFNTYRSQLRILITD-------NLSALISS-DKNITESGKEG 53 Query: 204 VTKIGLDYVYKF 215 + K+ L YVY F Sbjct: 54 IVKMDLSYVYTF 65 Searching..................................................done Results from round 2 CONVERGED! >gi|254780963|ref|YP_003065376.1| hypothetical protein CLIBASIA_04320 [Candidatus Liberibacter asiaticus str. psy62] gi|254040640|gb|ACT57436.1| hypothetical protein CLIBASIA_04320 [Candidatus Liberibacter asiaticus str. psy62] Length = 215 Score = 440 bits (1132), Expect = e-122, Method: Composition-based stats. Identities = 215/215 (100%), Positives = 215/215 (100%) Query: 1 MVRVFCAIIFVLITFIGEFSQALEHEDELKVNFGLMRRVMIDLWSREISSYRTPLSLDLD 60 MVRVFCAIIFVLITFIGEFSQALEHEDELKVNFGLMRRVMIDLWSREISSYRTPLSLDLD Sbjct: 1 MVRVFCAIIFVLITFIGEFSQALEHEDELKVNFGLMRRVMIDLWSREISSYRTPLSLDLD 60 Query: 61 YKHRVYLDTYKSFSINLGFETFNEIVNPTTMRVLVLPVLSMHKTWNNNFDDSYFFKKIGV 120 YKHRVYLDTYKSFSINLGFETFNEIVNPTTMRVLVLPVLSMHKTWNNNFDDSYFFKKIGV Sbjct: 61 YKHRVYLDTYKSFSINLGFETFNEIVNPTTMRVLVLPVLSMHKTWNNNFDDSYFFKKIGV 120 Query: 121 GVVASTGFNTGDKWLGAEMGMSFYVYPTPWLILQSDFAIRHASSDVVVCMRYQAKFLITD 180 GVVASTGFNTGDKWLGAEMGMSFYVYPTPWLILQSDFAIRHASSDVVVCMRYQAKFLITD Sbjct: 121 GVVASTGFNTGDKWLGAEMGMSFYVYPTPWLILQSDFAIRHASSDVVVCMRYQAKFLITD 180 Query: 181 SIGILYRNVSAVSAAVDKNIGLGVTKIGLDYVYKF 215 SIGILYRNVSAVSAAVDKNIGLGVTKIGLDYVYKF Sbjct: 181 SIGILYRNVSAVSAAVDKNIGLGVTKIGLDYVYKF 215 >gi|254781154|ref|YP_003065567.1| hypothetical protein CLIBASIA_05305 [Candidatus Liberibacter asiaticus str. psy62] gi|254040831|gb|ACT57627.1| hypothetical protein CLIBASIA_05305 [Candidatus Liberibacter asiaticus str. psy62] Length = 65 Score = 36.1 bits (82), Expect = 3.5, Method: Composition-based stats. Identities = 27/72 (37%), Positives = 35/72 (48%), Gaps = 12/72 (16%) Query: 148 TPWLILQSDFAIRHASSDVVVCMRYQAKFLITDSIGILYRNVSAVSAAVDKNI----GLG 203 T WL LQ DF + R Q + LITD N+SA+ ++ DKNI G Sbjct: 2 TSWLKLQRDFGVVLIGDKAFNTYRSQLRILITD-------NLSALISS-DKNITESGKEG 53 Query: 204 VTKIGLDYVYKF 215 + K+ L YVY F Sbjct: 54 IVKMDLSYVYTF 65 >gi|88602489|ref|YP_502667.1| phage integrase [Methanospirillum hungatei JF-1] gi|88187951|gb|ABD40948.1| phage integrase [Methanospirillum hungatei JF-1] Length = 301 Score = 35.7 bits (81), Expect = 4.0, Method: Composition-based stats. Identities = 27/89 (30%), Positives = 42/89 (47%), Gaps = 8/89 (8%) Query: 2 VRVFCAIIFVLITFIGEFSQALEHEDELKVNFGLMRRVMIDLWSREISSYRTPLSLDLDY 61 +R F I++ +F G+ S+ EH++ L++ G +D+ + +IS Y LSLD Y Sbjct: 16 IRRFGQFIWLTRSFRGDLSKKWEHKELLRLETG------VDVSASDISRYLEFLSLDRQY 69 Query: 62 KHRVYLDTYKSFSINLGFETFNEIV--NP 88 Y S S F E+V NP Sbjct: 70 HATTYNRILSSLSSFYRFLLMQEVVETNP 98 >gi|317402017|gb|EFV82616.1| diacylglycerol kinase [Achromobacter xylosoxidans C54] Length = 330 Score = 35.4 bits (80), Expect = 5.5, Method: Composition-based stats. Identities = 22/59 (37%), Positives = 33/59 (55%), Gaps = 4/59 (6%) Query: 9 IFVLITFIGEFSQALEHEDELKVNFGLMRRVMIDLWSREISSYRTP--LSLDLDYKHRV 65 +F++ +G + Q LE + K FG R ++ LWS ++ R P LSL LDY+ RV Sbjct: 155 LFLVNASLGLYPQLLEDREAYKQRFG--RSRLVALWSGLVTLMRAPRQLSLRLDYEGRV 211 >gi|156974061|ref|YP_001444968.1| hypothetical protein VIBHAR_01772 [Vibrio harveyi ATCC BAA-1116] gi|156525655|gb|ABU70741.1| hypothetical protein VIBHAR_01772 [Vibrio harveyi ATCC BAA-1116] Length = 461 Score = 35.4 bits (80), Expect = 6.0, Method: Composition-based stats. Identities = 28/88 (31%), Positives = 40/88 (45%), Gaps = 15/88 (17%) Query: 41 IDLWSREISSYRTPLSLDLDYKHRVYLDTYKSFSINLGFETFNEIVNPTTMRVLV----- 95 +D+W RE SSY L+ K YL K IN+ F +EI+ + ++ Sbjct: 134 LDIWVRESSSYHKSLT-----KANNYLSIKKKLLINIMF--LDEIIQDYEILEMISAGFI 186 Query: 96 -LPVLSMHKT--WNNNFDDSYFFKKIGV 120 + VL HKT W N F D F ++ V Sbjct: 187 DVTVLDQHKTGMWGNIFPDVIFHNQLTV 214 >gi|118398810|ref|XP_001031732.1| hypothetical protein TTHERM_00756030 [Tetrahymena thermophila] gi|89286065|gb|EAR84069.1| hypothetical protein TTHERM_00756030 [Tetrahymena thermophila SB210] Length = 3732 Score = 34.6 bits (78), Expect = 8.8, Method: Composition-based stats. Identities = 26/110 (23%), Positives = 54/110 (49%), Gaps = 12/110 (10%) Query: 29 LKVNFGLMRRVMIDLWSREISSYRTPLSLDLDYKHRVYLDTYK-SFSINL-------GFE 80 +K FGL + ++ +++ +S +LD ++ ++ Y S+ IN+ + Sbjct: 1937 IKKGFGL--QSVLKVYNISLSEIAEKRDHNLDIRNVIFKRKYTYSYDINIIAVIDKVSLQ 1994 Query: 81 TFNEI--VNPTTMRVLVLPVLSMHKTWNNNFDDSYFFKKIGVGVVASTGF 128 T I +NPT + +++ L M WN+ + D F++KI + ++S F Sbjct: 1995 TIRTIKCLNPTILNIIISEKLDMIFIWNDYYKDKVFYQKIAIFNISSGLF 2044 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.326 0.141 0.428 Lambda K H 0.267 0.0450 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 5,093,339,261 Number of Sequences: 14124377 Number of extensions: 251840420 Number of successful extensions: 663213 Number of sequences better than 10.0: 106 Number of HSP's better than 10.0 without gapping: 2 Number of HSP's successfully gapped in prelim test: 114 Number of HSP's that attempted gapping in prelim test: 663196 Number of HSP's gapped (non-prelim): 121 length of query: 215 length of database: 4,842,793,630 effective HSP length: 133 effective length of query: 82 effective length of database: 2,964,251,489 effective search space: 243068622098 effective search space used: 243068622098 T: 11 A: 40 X1: 16 ( 7.5 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (21.7 bits) S2: 78 (34.5 bits)