BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254781226|ref|YP_003065639.1| hypothetical protein CLIBASIA_05665 [Candidatus Liberibacter asiaticus str. psy62] (129 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done Results from round 1 >gi|254781226|ref|YP_003065639.1| hypothetical protein CLIBASIA_05665 [Candidatus Liberibacter asiaticus str. psy62] gi|254040903|gb|ACT57699.1| hypothetical protein CLIBASIA_05665 [Candidatus Liberibacter asiaticus str. psy62] Length = 129 Score = 248 bits (632), Expect = 3e-64, Method: Compositional matrix adjust. Identities = 129/129 (100%), Positives = 129/129 (100%) Query: 1 MGRKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKY 60 MGRKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKY Sbjct: 1 MGRKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKY 60 Query: 61 REYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKP 120 REYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKP Sbjct: 61 REYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKP 120 Query: 121 IEELIYKRG 129 IEELIYKRG Sbjct: 121 IEELIYKRG 129 >gi|317120691|gb|ADV02514.1| hypothetical protein SC1_gp170 [Liberibacter phage SC1] gi|317120835|gb|ADV02656.1| hypothetical protein SC1_gp170 [Candidatus Liberibacter asiaticus] Length = 129 Score = 236 bits (601), Expect = 9e-61, Method: Compositional matrix adjust. Identities = 123/129 (95%), Positives = 126/129 (97%) Query: 1 MGRKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKY 60 M RKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKI+E+ HQYYLKNKDKY Sbjct: 1 MERKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIKENNHQYYLKNKDKY 60 Query: 61 REYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKP 120 REYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKP Sbjct: 61 REYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKP 120 Query: 121 IEELIYKRG 129 IEELIYK+ Sbjct: 121 IEELIYKKA 129 >gi|317120733|gb|ADV02555.1| hypothetical protein SC2_gp170 [Liberibacter phage SC2] gi|317120794|gb|ADV02615.1| hypothetical protein SC2_gp170 [Candidatus Liberibacter asiaticus] Length = 129 Score = 234 bits (597), Expect = 3e-60, Method: Compositional matrix adjust. Identities = 123/129 (95%), Positives = 125/129 (96%) Query: 1 MGRKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKY 60 M RKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKI+E HQYYLKNKDKY Sbjct: 1 MERKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIQEYSHQYYLKNKDKY 60 Query: 61 REYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKP 120 REYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKP Sbjct: 61 REYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKP 120 Query: 121 IEELIYKRG 129 IEELIYK+ Sbjct: 121 IEELIYKKA 129 >gi|315121957|ref|YP_004062446.1| hypothetical protein CKC_01035 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|315122925|ref|YP_004063414.1| hypothetical protein CKC_05905 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495359|gb|ADR51958.1| hypothetical protein CKC_01035 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496327|gb|ADR52926.1| hypothetical protein CKC_05905 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 31 Score = 48.5 bits (114), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 25/31 (80%), Positives = 27/31 (87%) Query: 99 MGMTKAEIEALEREIARLKAKPIEELIYKRG 129 M MTK EIEALE +IARLKAKP+EELIYKR Sbjct: 1 MEMTKGEIEALEAKIARLKAKPLEELIYKRA 31 >gi|157953109|ref|YP_001498001.1| hypothetical protein NY2A_B805R [Paramecium bursaria Chlorella virus NY2A] gi|155123336|gb|ABT15204.1| hypothetical protein NY2A_B805R [Paramecium bursaria Chlorella virus NY2A] Length = 340 Score = 45.1 bits (105), Expect = 0.003, Method: Compositional matrix adjust. Identities = 31/73 (42%), Positives = 47/73 (64%), Gaps = 11/73 (15%) Query: 16 REYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYR-----------EYK 64 +E KR+Y +N++ ILE+ R+ Y +NK++I+E QYY NKDK R EYK Sbjct: 51 QEKKRQYREENKEHILEKARQYYEENKEEIQEKNRQYYEDNKDKIRQYREDNKDKILEYK 110 Query: 65 RRYYLKNRDKMRE 77 R+YY NR+K+++ Sbjct: 111 RQYYEDNREKIQQ 123 Score = 44.3 bits (103), Expect = 0.006, Method: Compositional matrix adjust. Identities = 27/68 (39%), Positives = 49/68 (72%) Query: 16 REYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKM 75 ++Y+R+Y +N++ ILE+ R+ Y +NK++I+E QY +NK+ E R+YY +N++++ Sbjct: 21 KDYQRQYREENKEHILEKARQYYEENKEEIQEKKRQYREENKEHILEKARQYYEENKEEI 80 Query: 76 REKARQSY 83 +EK RQ Y Sbjct: 81 QEKNRQYY 88 Score = 41.6 bits (96), Expect = 0.040, Method: Compositional matrix adjust. Identities = 24/58 (41%), Positives = 42/58 (72%) Query: 20 RRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMRE 77 R+YY +N+++I E++R+ +NK+ I E QYY +NK++ +E R+YY N+DK+R+ Sbjct: 40 RQYYEENKEEIQEKKRQYREENKEHILEKARQYYEENKEEIQEKNRQYYEDNKDKIRQ 97 >gi|167044410|gb|ABZ09087.1| hypothetical protein ALOHA_HF4000APKG6D3ctg6g4 [uncultured marine crenarchaeote HF4000_APKG6D3] Length = 308 Score = 45.1 bits (105), Expect = 0.004, Method: Compositional matrix adjust. Identities = 33/74 (44%), Positives = 49/74 (66%) Query: 5 VLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYK 64 VL+ EER RRE R++ +N +KI E R+ KN +KI E+ +Y +N +K RE + Sbjct: 2 VLSDEERKRKRRELTRKWKKENPEKIREGGRKYRAKNPEKISEAKKKYNRENPEKVREKR 61 Query: 65 RRYYLKNRDKMREK 78 R+YY +NR+K+REK Sbjct: 62 RKYYEENREKIREK 75 Score = 42.7 bits (99), Expect = 0.017, Method: Compositional matrix adjust. Identities = 30/74 (40%), Positives = 48/74 (64%) Query: 16 REYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKM 75 +E + ++Y +NR+KI ERRR+ Y +N +KI E + Y KN +K E +R+Y +N +K+ Sbjct: 88 KEARSKHYEENREKINERRRKDYEENHEKINERRRKDYKKNPEKINERRRKYRKENPEKV 147 Query: 76 REKARQSYRKLYSK 89 RE R+ Y K + K Sbjct: 148 RESKRKDYEKNHEK 161 Score = 40.4 bits (93), Expect = 0.091, Method: Compositional matrix adjust. Identities = 28/65 (43%), Positives = 44/65 (67%) Query: 17 EYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMR 76 E +R+ Y +N +KI ERRR+ Y KN +KI E +Y +N +K RE KR+ Y KN +K++ Sbjct: 104 ERRRKDYEENHEKINERRRKDYKKNPEKINERRRKYRKENPEKVRESKRKDYEKNHEKIK 163 Query: 77 EKARQ 81 E+ ++ Sbjct: 164 ERVKK 168 Score = 40.0 bits (92), Expect = 0.10, Method: Compositional matrix adjust. Identities = 26/73 (35%), Positives = 50/73 (68%) Query: 17 EYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMR 76 E K++Y +N +K+ E+RR+ Y +N++KIRE +Y +N +K +E + ++Y +NR+K+ Sbjct: 44 EAKKKYNRENPEKVREKRRKYYEENREKIREKQIKYLEENPEKIKEARSKHYEENREKIN 103 Query: 77 EKARQSYRKLYSK 89 E+ R+ Y + + K Sbjct: 104 ERRRKDYEENHEK 116 Score = 39.7 bits (91), Expect = 0.13, Method: Compositional matrix adjust. Identities = 26/70 (37%), Positives = 50/70 (71%) Query: 16 REYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKM 75 RE +R+YY +NR+KI E++ + +N +KI+E+ ++Y +N++K E +R+ Y +N +K+ Sbjct: 58 REKRRKYYEENREKIREKQIKYLEENPEKIKEARSKHYEENREKINERRRKDYEENHEKI 117 Query: 76 REKARQSYRK 85 E+ R+ Y+K Sbjct: 118 NERRRKDYKK 127 >gi|315121958|ref|YP_004062447.1| hypothetical protein CKC_01040 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|315122926|ref|YP_004063415.1| hypothetical protein CKC_05910 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495360|gb|ADR51959.1| hypothetical protein CKC_01040 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496328|gb|ADR52927.1| hypothetical protein CKC_05910 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 209 Score = 44.3 bits (103), Expect = 0.005, Method: Compositional matrix adjust. Identities = 33/73 (45%), Positives = 47/73 (64%), Gaps = 5/73 (6%) Query: 15 RREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDK 74 R EY+R+YY N++ +E +R+ KNKDKI E Y Q+Y KNKDK E R+YY + ++ Sbjct: 58 RNEYQRQYYQNNKEARIEYQRQYLQKNKDKINEYYRQHYQKNKDKLTENSRQYYQRKKE- 116 Query: 75 MREKARQSYRKLY 87 AR Y++LY Sbjct: 117 ----ARIEYQRLY 125 Score = 43.1 bits (100), Expect = 0.012, Method: Compositional matrix adjust. Identities = 35/84 (41%), Positives = 49/84 (58%), Gaps = 15/84 (17%) Query: 17 EYKRRYYLKNRDKILERRR--------------RRYLKNKDKIRESYHQYYL-KNKDKYR 61 EY R+Y KNRDK+ E +R R+Y +N + R Y + YL KNKDK Sbjct: 30 EYYRQYQQKNRDKLTENKRLYRQRKKEERNEYQRQYYQNNKEARIEYQRQYLQKNKDKIN 89 Query: 62 EYKRRYYLKNRDKMREKARQSYRK 85 EY R++Y KN+DK+ E +RQ Y++ Sbjct: 90 EYYRQHYQKNKDKLTENSRQYYQR 113 Score = 38.1 bits (87), Expect = 0.37, Method: Compositional matrix adjust. Identities = 27/54 (50%), Positives = 36/54 (66%) Query: 14 CRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRY 67 R EY+R+Y KN+DKI E R+ Y KNKDK+ E+ QYY + K+ EY+R Y Sbjct: 72 ARIEYQRQYLQKNKDKINEYYRQHYQKNKDKLTENSRQYYQRKKEARIEYQRLY 125 >gi|261367712|ref|ZP_05980595.1| conserved hypothetical protein [Subdoligranulum variabile DSM 15176] gi|282570507|gb|EFB76042.1| conserved hypothetical protein [Subdoligranulum variabile DSM 15176] Length = 100 Score = 40.0 bits (92), Expect = 0.11, Method: Compositional matrix adjust. Identities = 21/52 (40%), Positives = 30/52 (57%) Query: 19 KRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLK 70 +R +Y NRDK+ ++R Y N+DK+ Y N+DK EY R+YY K Sbjct: 43 QRAWYEANRDKVAAQQRAWYEANRDKVAAQQRAYREANRDKVAEYNRKYYAK 94 >gi|157953770|ref|YP_001498661.1| hypothetical protein AR158_C580L [Paramecium bursaria Chlorella virus AR158] gi|156068418|gb|ABU44125.1| hypothetical protein AR158_C580L [Paramecium bursaria Chlorella virus AR158] Length = 310 Score = 39.7 bits (91), Expect = 0.15, Method: Compositional matrix adjust. Identities = 28/71 (39%), Positives = 43/71 (60%) Query: 19 KRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREK 78 R+Y N +KI ER R+R +N +K RE + KN +KYREY+R+Y + N +K +E Sbjct: 28 NRKYCTNNAEKIKERGRKRRAENPEKFRERDRKRREKNPEKYREYERKYRIANPEKFKEY 87 Query: 79 ARQSYRKLYSK 89 R+ + +Y K Sbjct: 88 ERKRHATIYRK 98 >gi|255947256|ref|XP_002564395.1| Pc22g03540 [Penicillium chrysogenum Wisconsin 54-1255] gi|211591412|emb|CAP97642.1| Pc22g03540 [Penicillium chrysogenum Wisconsin 54-1255] Length = 669 Score = 37.7 bits (86), Expect = 0.49, Method: Compositional matrix adjust. Identities = 29/72 (40%), Positives = 39/72 (54%) Query: 18 YKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMRE 77 Y RRYY KN KI + R+ KN DKIR +Y KN D+ + R + KN DK++ Sbjct: 402 YSRRYYRKNAGKIKAKNRKYQRKNADKIRAKRREYERKNADRIKARLREWRRKNADKIKA 461 Query: 78 KARQSYRKLYSK 89 + R+ RK K Sbjct: 462 RKREWRRKNADK 473 Score = 37.0 bits (84), Expect = 0.86, Method: Compositional matrix adjust. Identities = 26/65 (40%), Positives = 37/65 (56%) Query: 20 RRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKA 79 R+Y KN DKI +RR KN D+I+ ++ KN DK + KR + KN DK++ Sbjct: 419 RKYQRKNADKIRAKRREYERKNADRIKARLREWRRKNADKIKARKREWRRKNADKIKANN 478 Query: 80 RQSYR 84 R+ YR Sbjct: 479 REYYR 483 Score = 35.0 bits (79), Expect = 3.0, Method: Compositional matrix adjust. Identities = 24/70 (34%), Positives = 36/70 (51%) Query: 20 RRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKA 79 R Y +N D+I R R KN D+I S ++ KN D+ KR ++ KN D++ + Sbjct: 524 REYQRENADRIKARNREYQRKNADRISASIQEWRRKNADRVSASKREWHRKNADRVNARQ 583 Query: 80 RQSYRKLYSK 89 R+ RK K Sbjct: 584 RERRRKNAGK 593 Score = 34.7 bits (78), Expect = 4.5, Method: Compositional matrix adjust. Identities = 26/78 (33%), Positives = 42/78 (53%), Gaps = 11/78 (14%) Query: 15 RREYKRR-----------YYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREY 63 RREY+R+ + KN DKI R+R KN DKI+ + +YY +N D+ + Sbjct: 433 RREYERKNADRIKARLREWRRKNADKIKARKREWRRKNADKIKANNREYYRENADRIKAN 492 Query: 64 KRRYYLKNRDKMREKARQ 81 R Y +N D+++ + R+ Sbjct: 493 NREYQRENADRIKARNRE 510 >gi|2947224|gb|AAC39306.1| ORF4 [Lactococcus phage phi31] Length = 246 Score = 37.7 bits (86), Expect = 0.59, Method: Compositional matrix adjust. Identities = 27/85 (31%), Positives = 46/85 (54%) Query: 13 LCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNR 72 +CR+ + Y ++KILE+++R Y KN DKI+E YY +NK K R+ ++ + N Sbjct: 65 VCRKAKDKARYEAKKEKILEQKKRYYEKNADKIKERQLGYYNENKGKCRQSEKDWCKNNP 124 Query: 73 DKMREKARQSYRKLYSKDSWIAPEE 97 + R +S Y +S + +E Sbjct: 125 TRRRMTCAKSRTLKYGSESTLTEKE 149 >gi|222624901|gb|EEE59033.1| hypothetical protein OsJ_10782 [Oryza sativa Japonica Group] Length = 850 Score = 37.4 bits (85), Expect = 0.75, Method: Composition-based stats. Identities = 29/95 (30%), Positives = 50/95 (52%), Gaps = 10/95 (10%) Query: 9 EERMLCRREYKRRYYLKNRDKI--LERRRRRYLKNKDKIRESYHQYYL----KNKDKYRE 62 E RML Y R L R++I L+R R +LK D+ + +H+ KNK + Sbjct: 40 EIRML----YARMDELLYREEIMWLQRSRVAWLKEGDRNTKFFHRQAAWRSKKNKITRLK 95 Query: 63 YKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEE 97 + +++N+++M AR+ ++ LY KD+ + P E Sbjct: 96 AEDSRFVENKEEMEHMAREFFQTLYLKDNSVDPRE 130 >gi|218188462|gb|EEC70889.1| hypothetical protein OsI_02427 [Oryza sativa Indica Group] Length = 1477 Score = 37.4 bits (85), Expect = 0.76, Method: Composition-based stats. Identities = 26/100 (26%), Positives = 51/100 (51%), Gaps = 6/100 (6%) Query: 15 RREYKRRYYLKNRDKI--LERRRRRYLKNKDKIRESYHQYYL----KNKDKYREYKRRYY 68 R+ Y R L R+++ L+R R +LK D+ + +H+ KNK + + Y Sbjct: 564 RKLYARLDELLYREEMMWLQRSRVSWLKEGDRNTKFFHRQAAWRAKKNKINRLKDEDSRY 623 Query: 69 LKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEA 108 ++N+ M + R ++KLY++D + P E + + ++A Sbjct: 624 VENKGDMEKLTRDFFQKLYARDEGVDPGELVDLFDVRVDA 663 >gi|108708055|gb|ABF95850.1| retrotransposon protein, putative, unclassified [Oryza sativa Japonica Group] Length = 940 Score = 36.6 bits (83), Expect = 1.1, Method: Composition-based stats. Identities = 29/95 (30%), Positives = 50/95 (52%), Gaps = 10/95 (10%) Query: 9 EERMLCRREYKRRYYLKNRDKI--LERRRRRYLKNKDKIRESYHQYYL----KNKDKYRE 62 E RML Y R L R++I L+R R +LK D+ + +H+ KNK + Sbjct: 40 EIRML----YARMDELLYREEIMWLQRSRVAWLKEGDRNTKFFHRQAAWRSKKNKITRLK 95 Query: 63 YKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEE 97 + +++N+++M AR+ ++ LY KD+ + P E Sbjct: 96 AEDSRFVENKEEMEHMAREFFQTLYLKDNSVDPRE 130 >gi|315185937|gb|EFU19701.1| permease YjgP/YjgQ family protein [Spirochaeta thermophila DSM 6578] Length = 424 Score = 36.2 bits (82), Expect = 1.4, Method: Composition-based stats. Identities = 20/61 (32%), Positives = 32/61 (52%), Gaps = 4/61 (6%) Query: 24 LKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKN----RDKMREKA 79 +K R+++LE++RR + K +D +R S YL E R +L N ++REK Sbjct: 262 MKEREQVLEQKRRSHEKERDDLRRSVSSLYLTTLSLSNESIRSRFLGNLRSSLQRLREKE 321 Query: 80 R 80 R Sbjct: 322 R 322 >gi|218192800|gb|EEC75227.1| hypothetical protein OsI_11496 [Oryza sativa Indica Group] Length = 1059 Score = 36.2 bits (82), Expect = 1.6, Method: Composition-based stats. Identities = 28/95 (29%), Positives = 50/95 (52%), Gaps = 10/95 (10%) Query: 9 EERMLCRREYKRRYYLKNRDKI--LERRRRRYLKNKDKIRESYHQYYL----KNKDKYRE 62 E RML Y R L R+++ L+R R +LK D+ + +H+ KNK + Sbjct: 75 EIRML----YARMDELLYREEMMWLQRSRVAWLKEGDRNTKFFHRQAAWRSKKNKITRLK 130 Query: 63 YKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEE 97 + +++N+++M AR+ ++ LY KD+ + P E Sbjct: 131 AEDSRFVENKEEMEHMAREFFQTLYLKDNSVDPRE 165 >gi|307718798|ref|YP_003874330.1| hypothetical protein STHERM_c11120 [Spirochaeta thermophila DSM 6192] gi|306532523|gb|ADN02057.1| hypothetical protein STHERM_c11120 [Spirochaeta thermophila DSM 6192] Length = 424 Score = 35.4 bits (80), Expect = 2.3, Method: Composition-based stats. Identities = 18/61 (29%), Positives = 32/61 (52%) Query: 24 LKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSY 83 +K R+++LE++RR + K +D +R S + YL E R +L + ++ RQ Sbjct: 262 MKEREQVLEQKRRSHEKERDDLRRSVSRLYLTTLSLSNESIRSRFLGSLRSSLQRLRQKE 321 Query: 84 R 84 R Sbjct: 322 R 322 >gi|295399960|ref|ZP_06809940.1| conserved hypothetical protein [Geobacillus thermoglucosidasius C56-YS93] gi|312110792|ref|YP_003989108.1| hypothetical protein GY4MC1_1723 [Geobacillus sp. Y4.1MC1] gi|294977739|gb|EFG53337.1| conserved hypothetical protein [Geobacillus thermoglucosidasius C56-YS93] gi|311215893|gb|ADP74497.1| hypothetical protein GY4MC1_1723 [Geobacillus sp. Y4.1MC1] Length = 574 Score = 35.4 bits (80), Expect = 2.7, Method: Composition-based stats. Identities = 26/90 (28%), Positives = 47/90 (52%), Gaps = 3/90 (3%) Query: 36 RRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQ-SYRKLYSKDSWIA 94 +R ++ + ++Y YY K + +EYK+ YYL RD+ RQ ++R L + D + Sbjct: 437 KRMIRQYNVPSKAYFSYY-KPILESKEYKKNYYLLLRDQGYSPIRQVTFRHLETGDKTLT 495 Query: 95 PEEPMGMTKAEIEALEREIARLKAKPIEEL 124 EP + + +EAL+ ++ A + EL Sbjct: 496 IIEPQQI-DSFVEALKADLMEESASTMLEL 524 >gi|73541607|ref|YP_296127.1| hypothetical protein Reut_A1918 [Ralstonia eutropha JMP134] gi|72119020|gb|AAZ61283.1| Uncharacterized protein UPF0065 [Ralstonia eutropha JMP134] Length = 333 Score = 35.0 bits (79), Expect = 3.8, Method: Compositional matrix adjust. Identities = 17/52 (32%), Positives = 26/52 (50%), Gaps = 4/52 (7%) Query: 65 RRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARL 116 R L N RE + + D+W+ P G+ KA ++AL RE+AR+ Sbjct: 238 RSALLPNVPTFREAGYKGFEP----DAWMGLMFPAGVPKARVDALSREVARI 285 >gi|307187162|gb|EFN72405.1| Death-inducer obliterator 1 [Camponotus floridanus] Length = 2322 Score = 34.7 bits (78), Expect = 4.0, Method: Composition-based stats. Identities = 24/70 (34%), Positives = 35/70 (50%), Gaps = 8/70 (11%) Query: 25 KNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYR 84 K RDK ER+ R +DK RE + KDK R+ + +R + RE+ RQ + Sbjct: 1541 KTRDKSKERKSRNKESRRDKDRE-------REKDKDRDRNKDRDKTSRKESRERGRQKEK 1593 Query: 85 KLY-SKDSWI 93 + S DSW+ Sbjct: 1594 DRHKSSDSWM 1603 >gi|301309131|ref|ZP_07215075.1| putative nuclease sbcCD, subunit C [Bacteroides sp. 20_3] gi|300832813|gb|EFK63439.1| putative nuclease sbcCD, subunit C [Bacteroides sp. 20_3] Length = 993 Score = 34.7 bits (78), Expect = 4.7, Method: Compositional matrix adjust. Identities = 22/71 (30%), Positives = 39/71 (54%), Gaps = 7/71 (9%) Query: 54 LKNKDKYREYKRRYYLKNRDKMREKARQSYRKLYSKDSWI--APEEPMGMTKAEIEALER 111 L ++Y E RR + KN + A+++Y +LY++ I EE + ++ + ALE Sbjct: 193 LTGTEQYSEISRRIFAKNTE-----AKEAYEQLYARVQGIELLSEEEVEASQTRLSALEV 247 Query: 112 EIARLKAKPIE 122 E+ARL+ +E Sbjct: 248 ELARLEKAKVE 258 >gi|298375067|ref|ZP_06985024.1| nuclease sbcCD subunit C [Bacteroides sp. 3_1_19] gi|298267567|gb|EFI09223.1| nuclease sbcCD subunit C [Bacteroides sp. 3_1_19] Length = 993 Score = 34.7 bits (78), Expect = 4.8, Method: Compositional matrix adjust. Identities = 21/66 (31%), Positives = 38/66 (57%), Gaps = 7/66 (10%) Query: 54 LKNKDKYREYKRRYYLKNRDKMREKARQSYRKLYSKDSWI--APEEPMGMTKAEIEALER 111 L ++Y E RR ++KN + A+++Y +LY++ I EE + ++ + ALE Sbjct: 193 LTGTEQYSEISRRIFVKNAE-----AKEAYEQLYARVQGIELLSEEEVEASQTRLSALEG 247 Query: 112 EIARLK 117 E+ARL+ Sbjct: 248 ELARLE 253 >gi|150008355|ref|YP_001303098.1| ATP-dependent exonuclease sbcC [Parabacteroides distasonis ATCC 8503] gi|256839357|ref|ZP_05544866.1| ATP-dependent exonuclease sbcC [Parabacteroides sp. D13] gi|149936779|gb|ABR43476.1| ATP-dependent exonuclease sbcC [Parabacteroides distasonis ATCC 8503] gi|256738287|gb|EEU51612.1| ATP-dependent exonuclease sbcC [Parabacteroides sp. D13] Length = 993 Score = 34.3 bits (77), Expect = 5.1, Method: Compositional matrix adjust. Identities = 21/66 (31%), Positives = 38/66 (57%), Gaps = 7/66 (10%) Query: 54 LKNKDKYREYKRRYYLKNRDKMREKARQSYRKLYS--KDSWIAPEEPMGMTKAEIEALER 111 L ++Y E RR + KN + A+++Y +LY+ +D + EE + ++ + ALE Sbjct: 193 LTGTEQYSEISRRIFAKNAE-----AKEAYEQLYARVQDIELLSEEEVEASQTRLSALEV 247 Query: 112 EIARLK 117 E+ARL+ Sbjct: 248 ELARLE 253 >gi|255015511|ref|ZP_05287637.1| ATP-dependent exonuclease sbcC [Bacteroides sp. 2_1_7] Length = 993 Score = 34.3 bits (77), Expect = 5.2, Method: Compositional matrix adjust. Identities = 21/66 (31%), Positives = 38/66 (57%), Gaps = 7/66 (10%) Query: 54 LKNKDKYREYKRRYYLKNRDKMREKARQSYRKLYS--KDSWIAPEEPMGMTKAEIEALER 111 L ++Y E RR + KN + A+++Y +LY+ +D + EE + ++ + ALE Sbjct: 193 LTGTEQYSEISRRIFAKNAE-----AKEAYEQLYARVQDIELLSEEEVEASQTRLSALEV 247 Query: 112 EIARLK 117 E+ARL+ Sbjct: 248 ELARLE 253 >gi|229497039|ref|ZP_04390744.1| peptidase, M23/M37 family [Porphyromonas endodontalis ATCC 35406] gi|229316141|gb|EEN82069.1| peptidase, M23/M37 family [Porphyromonas endodontalis ATCC 35406] Length = 357 Score = 33.9 bits (76), Expect = 6.8, Method: Compositional matrix adjust. Identities = 24/80 (30%), Positives = 41/80 (51%), Gaps = 3/80 (3%) Query: 41 NKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMG 100 N D SY + + NK+K EYK+ +R++ +EKA Q++R + DS + + G Sbjct: 274 NTDNSGPSYGELFANNKEKVTEYKQS--ANSRERAKEKAPQTHR-IKEGDSLSSIAKRHG 330 Query: 101 MTKAEIEALEREIARLKAKP 120 T A++ L A+ +P Sbjct: 331 TTVAKLCKLNNMTAKATLRP 350 >gi|221103613|ref|XP_002170145.1| PREDICTED: similar to ATP-dependent DNA helicase PIF1 [Hydra magnipapillata] Length = 734 Score = 33.9 bits (76), Expect = 7.8, Method: Composition-based stats. Identities = 13/34 (38%), Positives = 25/34 (73%) Query: 10 ERMLCRREYKRRYYLKNRDKILERRRRRYLKNKD 43 +R C R+ ++++Y KN++KIL +R R Y+K ++ Sbjct: 613 KRRECERKREKKHYTKNKNKILYKRAREYIKGEN 646 >gi|309360547|emb|CAP31028.2| hypothetical protein CBG_11979 [Caenorhabditis briggsae AF16] Length = 898 Score = 33.9 bits (76), Expect = 7.9, Method: Compositional matrix adjust. Identities = 22/80 (27%), Positives = 39/80 (48%) Query: 33 RRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKLYSKDSW 92 R RRYLK K RE Y +Y ++ K L N+D+ + Q+ +L++K Sbjct: 185 RVYRRYLKINPKAREDYVEYLIERDQIDEAAKELTTLVNQDQNVSEKGQTSHQLWTKLCT 244 Query: 93 IAPEEPMGMTKAEIEALERE 112 + E P+ + ++A+ R+ Sbjct: 245 LISENPVKIFSLNVDAIIRQ 264 >gi|268560714|ref|XP_002646274.1| Hypothetical protein CBG11979 [Caenorhabditis briggsae] Length = 865 Score = 33.5 bits (75), Expect = 9.2, Method: Compositional matrix adjust. Identities = 21/77 (27%), Positives = 38/77 (49%) Query: 36 RRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAP 95 RRYLK K RE Y +Y ++ K L N+D+ + Q+ +L++K + Sbjct: 186 RRYLKINPKAREDYVEYLIERDQIDEAAKELTTLVNQDQNVSEKGQTSHQLWTKLCTLIS 245 Query: 96 EEPMGMTKAEIEALERE 112 E P+ + ++A+ R+ Sbjct: 246 ENPVKIFSLNVDAIIRQ 262 >gi|160885908|ref|ZP_02066911.1| hypothetical protein BACOVA_03913 [Bacteroides ovatus ATCC 8483] gi|156108721|gb|EDO10466.1| hypothetical protein BACOVA_03913 [Bacteroides ovatus ATCC 8483] Length = 197 Score = 33.5 bits (75), Expect = 9.6, Method: Compositional matrix adjust. Identities = 27/75 (36%), Positives = 39/75 (52%), Gaps = 19/75 (25%) Query: 22 YYLKNRDKILERRR-------------------RRYLKNKDKIRESYHQYYLKNKDKYRE 62 Y ++NRDK+L RR+ RRY NK+KI E +Y L +K + RE Sbjct: 123 YRIRNRDKLLARRKELRNANIEHYRELERASYKRRYKVNKEKILEKNRKYQLAHKSEIRE 182 Query: 63 YKRRYYLKNRDKMRE 77 Y + YY KN+ K ++ Sbjct: 183 YMKVYYQKNKSKWKQ 197 >gi|167768327|ref|ZP_02440380.1| hypothetical protein CLOSS21_02883 [Clostridium sp. SS2/1] gi|167709851|gb|EDS20430.1| hypothetical protein CLOSS21_02883 [Clostridium sp. SS2/1] Length = 409 Score = 33.5 bits (75), Expect = 9.7, Method: Composition-based stats. Identities = 16/69 (23%), Positives = 36/69 (52%) Query: 24 LKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSY 83 +K D I R+ + + ++ ++Y +YY +N D Y + LK +++M + + + Sbjct: 58 IKKTDDINHSRQAKNRNERKQLSKAYQEYYKENIDLYDKLDMTKILKQKEEMLQYHPKGF 117 Query: 84 RKLYSKDSW 92 K + KD++ Sbjct: 118 MKKFVKDNY 126 Searching..................................................done Results from round 2 >gi|254781226|ref|YP_003065639.1| hypothetical protein CLIBASIA_05665 [Candidatus Liberibacter asiaticus str. psy62] gi|254040903|gb|ACT57699.1| hypothetical protein CLIBASIA_05665 [Candidatus Liberibacter asiaticus str. psy62] Length = 129 Score = 143 bits (361), Expect = 7e-33, Method: Composition-based stats. Identities = 129/129 (100%), Positives = 129/129 (100%) Query: 1 MGRKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKY 60 MGRKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKY Sbjct: 1 MGRKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKY 60 Query: 61 REYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKP 120 REYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKP Sbjct: 61 REYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKP 120 Query: 121 IEELIYKRG 129 IEELIYKRG Sbjct: 121 IEELIYKRG 129 >gi|317120691|gb|ADV02514.1| hypothetical protein SC1_gp170 [Liberibacter phage SC1] gi|317120835|gb|ADV02656.1| hypothetical protein SC1_gp170 [Candidatus Liberibacter asiaticus] Length = 129 Score = 143 bits (360), Expect = 1e-32, Method: Composition-based stats. Identities = 123/129 (95%), Positives = 126/129 (97%) Query: 1 MGRKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKY 60 M RKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKI+E+ HQYYLKNKDKY Sbjct: 1 MERKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIKENNHQYYLKNKDKY 60 Query: 61 REYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKP 120 REYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKP Sbjct: 61 REYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKP 120 Query: 121 IEELIYKRG 129 IEELIYK+ Sbjct: 121 IEELIYKKA 129 >gi|317120733|gb|ADV02555.1| hypothetical protein SC2_gp170 [Liberibacter phage SC2] gi|317120794|gb|ADV02615.1| hypothetical protein SC2_gp170 [Candidatus Liberibacter asiaticus] Length = 129 Score = 140 bits (353), Expect = 6e-32, Method: Composition-based stats. Identities = 123/129 (95%), Positives = 125/129 (96%) Query: 1 MGRKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKY 60 M RKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKI+E HQYYLKNKDKY Sbjct: 1 MERKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIQEYSHQYYLKNKDKY 60 Query: 61 REYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKP 120 REYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKP Sbjct: 61 REYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEALEREIARLKAKP 120 Query: 121 IEELIYKRG 129 IEELIYK+ Sbjct: 121 IEELIYKKA 129 >gi|157953109|ref|YP_001498001.1| hypothetical protein NY2A_B805R [Paramecium bursaria Chlorella virus NY2A] gi|155123336|gb|ABT15204.1| hypothetical protein NY2A_B805R [Paramecium bursaria Chlorella virus NY2A] Length = 340 Score = 52.7 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 27/68 (39%), Positives = 49/68 (72%) Query: 16 REYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKM 75 ++Y+R+Y +N++ ILE+ R+ Y +NK++I+E QY +NK+ E R+YY +N++++ Sbjct: 21 KDYQRQYREENKEHILEKARQYYEENKEEIQEKKRQYREENKEHILEKARQYYEENKEEI 80 Query: 76 REKARQSY 83 +EK RQ Y Sbjct: 81 QEKNRQYY 88 Score = 51.9 bits (123), Expect = 3e-05, Method: Composition-based stats. Identities = 31/73 (42%), Positives = 47/73 (64%), Gaps = 11/73 (15%) Query: 16 REYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYR-----------EYK 64 +E KR+Y +N++ ILE+ R+ Y +NK++I+E QYY NKDK R EYK Sbjct: 51 QEKKRQYREENKEHILEKARQYYEENKEEIQEKNRQYYEDNKDKIRQYREDNKDKILEYK 110 Query: 65 RRYYLKNRDKMRE 77 R+YY NR+K+++ Sbjct: 111 RQYYEDNREKIQQ 123 Score = 48.4 bits (114), Expect = 3e-04, Method: Composition-based stats. Identities = 25/61 (40%), Positives = 43/61 (70%) Query: 17 EYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMR 76 E R+YY +N+++I E++R+ +NK+ I E QYY +NK++ +E R+YY N+DK+R Sbjct: 37 EKARQYYEENKEEIQEKKRQYREENKEHILEKARQYYEENKEEIQEKNRQYYEDNKDKIR 96 Query: 77 E 77 + Sbjct: 97 Q 97 Score = 39.2 bits (90), Expect = 0.21, Method: Composition-based stats. Identities = 28/75 (37%), Positives = 44/75 (58%), Gaps = 1/75 (1%) Query: 12 MLCRREYKRRYYLKNRDKILERRRRRYLK-NKDKIRESYHQYYLKNKDKYREYKRRYYLK 70 MLC R K L + + +R+Y + NK+ I E QYY +NK++ +E KR+Y + Sbjct: 1 MLCIRCNKPHNELTKTCRPCKDYQRQYREENKEHILEKARQYYEENKEEIQEKKRQYREE 60 Query: 71 NRDKMREKARQSYRK 85 N++ + EKARQ Y + Sbjct: 61 NKEHILEKARQYYEE 75 Score = 34.6 bits (78), Expect = 4.3, Method: Composition-based stats. Identities = 22/59 (37%), Positives = 34/59 (57%), Gaps = 11/59 (18%) Query: 17 EYKRRYYLKNRDKILERRRRRYLKNKDKIR-----------ESYHQYYLKNKDKYREYK 64 E R+YY +N+++I E+ R+ Y NKDKIR E QYY N++K ++Y+ Sbjct: 67 EKARQYYEENKEEIQEKNRQYYEDNKDKIRQYREDNKDKILEYKRQYYEDNREKIQQYR 125 >gi|157953770|ref|YP_001498661.1| hypothetical protein AR158_C580L [Paramecium bursaria Chlorella virus AR158] gi|156068418|gb|ABU44125.1| hypothetical protein AR158_C580L [Paramecium bursaria Chlorella virus AR158] Length = 310 Score = 48.4 bits (114), Expect = 3e-04, Method: Composition-based stats. Identities = 28/70 (40%), Positives = 43/70 (61%) Query: 20 RRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKA 79 R+Y N +KI ER R+R +N +K RE + KN +KYREY+R+Y + N +K +E Sbjct: 29 RKYCTNNAEKIKERGRKRRAENPEKFRERDRKRREKNPEKYREYERKYRIANPEKFKEYE 88 Query: 80 RQSYRKLYSK 89 R+ + +Y K Sbjct: 89 RKRHATIYRK 98 >gi|2947224|gb|AAC39306.1| ORF4 [Lactococcus phage phi31] Length = 246 Score = 41.5 bits (96), Expect = 0.037, Method: Composition-based stats. Identities = 27/89 (30%), Positives = 48/89 (53%) Query: 13 LCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNR 72 +CR+ + Y ++KILE+++R Y KN DKI+E YY +NK K R+ ++ + N Sbjct: 65 VCRKAKDKARYEAKKEKILEQKKRYYEKNADKIKERQLGYYNENKGKCRQSEKDWCKNNP 124 Query: 73 DKMREKARQSYRKLYSKDSWIAPEEPMGM 101 + R +S Y +S + +E + + Sbjct: 125 TRRRMTCAKSRTLKYGSESTLTEKEWLEI 153 >gi|255947256|ref|XP_002564395.1| Pc22g03540 [Penicillium chrysogenum Wisconsin 54-1255] gi|211591412|emb|CAP97642.1| Pc22g03540 [Penicillium chrysogenum Wisconsin 54-1255] Length = 669 Score = 41.1 bits (95), Expect = 0.050, Method: Composition-based stats. Identities = 28/68 (41%), Positives = 38/68 (55%) Query: 18 YKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMRE 77 Y RRYY KN KI + R+ KN DKIR +Y KN D+ + R + KN DK++ Sbjct: 402 YSRRYYRKNAGKIKAKNRKYQRKNADKIRAKRREYERKNADRIKARLREWRRKNADKIKA 461 Query: 78 KARQSYRK 85 + R+ RK Sbjct: 462 RKREWRRK 469 Score = 40.0 bits (92), Expect = 0.11, Method: Composition-based stats. Identities = 26/66 (39%), Positives = 38/66 (57%) Query: 20 RRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKA 79 R+Y KN DKI +RR KN D+I+ ++ KN DK + KR + KN DK++ Sbjct: 419 RKYQRKNADKIRAKRREYERKNADRIKARLREWRRKNADKIKARKREWRRKNADKIKANN 478 Query: 80 RQSYRK 85 R+ YR+ Sbjct: 479 REYYRE 484 Score = 40.0 bits (92), Expect = 0.12, Method: Composition-based stats. Identities = 23/66 (34%), Positives = 38/66 (57%) Query: 20 RRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKA 79 R + KN DKI R+R KN DKI+ + +YY +N D+ + R Y +N D+++ + Sbjct: 449 REWRRKNADKIKARKREWRRKNADKIKANNREYYRENADRIKANNREYQRENADRIKARN 508 Query: 80 RQSYRK 85 R+ R+ Sbjct: 509 REYQRE 514 Score = 37.3 bits (85), Expect = 0.67, Method: Composition-based stats. Identities = 21/67 (31%), Positives = 37/67 (55%) Query: 19 KRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREK 78 KR + KN DKI R Y +N D+I+ + +Y +N D+ + R Y +N D+++ + Sbjct: 463 KREWRRKNADKIKANNREYYRENADRIKANNREYQRENADRIKARNREYQRENADRIKAR 522 Query: 79 ARQSYRK 85 R+ R+ Sbjct: 523 NREYQRE 529 Score = 36.9 bits (84), Expect = 0.96, Method: Composition-based stats. Identities = 24/62 (38%), Positives = 34/62 (54%) Query: 24 LKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSY 83 L+N D+ RR Y KN KI+ +Y KN DK R +R Y KN D+++ + R+ Sbjct: 393 LQNTDREDSYSRRYYRKNAGKIKAKNRKYQRKNADKIRAKRREYERKNADRIKARLREWR 452 Query: 84 RK 85 RK Sbjct: 453 RK 454 Score = 35.4 bits (80), Expect = 2.7, Method: Composition-based stats. Identities = 20/66 (30%), Positives = 35/66 (53%) Query: 20 RRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKA 79 R YY +N D+I R +N D+I+ +Y +N D+ + R Y +N D+++ + Sbjct: 479 REYYRENADRIKANNREYQRENADRIKARNREYQRENADRIKARNREYQRENADRIKARN 538 Query: 80 RQSYRK 85 R+ RK Sbjct: 539 REYQRK 544 Score = 34.6 bits (78), Expect = 4.5, Method: Composition-based stats. Identities = 23/66 (34%), Positives = 35/66 (53%) Query: 20 RRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKA 79 R Y +N D+I R R KN D+I S ++ KN D+ KR ++ KN D++ + Sbjct: 524 REYQRENADRIKARNREYQRKNADRISASIQEWRRKNADRVSASKREWHRKNADRVNARQ 583 Query: 80 RQSYRK 85 R+ RK Sbjct: 584 RERRRK 589 >gi|315121957|ref|YP_004062446.1| hypothetical protein CKC_01035 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|315122925|ref|YP_004063414.1| hypothetical protein CKC_05905 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495359|gb|ADR51958.1| hypothetical protein CKC_01035 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496327|gb|ADR52926.1| hypothetical protein CKC_05905 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 31 Score = 40.0 bits (92), Expect = 0.097, Method: Composition-based stats. Identities = 25/31 (80%), Positives = 27/31 (87%) Query: 99 MGMTKAEIEALEREIARLKAKPIEELIYKRG 129 M MTK EIEALE +IARLKAKP+EELIYKR Sbjct: 1 MEMTKGEIEALEAKIARLKAKPLEELIYKRA 31 >gi|9632057|ref|NP_048846.1| hypothetical protein PBCV1_A490L [Paramecium bursaria Chlorella virus 1] gi|1620161|gb|AAC96857.1| Lys-, Glu-rich [Paramecium bursaria Chlorella virus 1] Length = 310 Score = 40.0 bits (92), Expect = 0.097, Method: Composition-based stats. Identities = 24/65 (36%), Positives = 43/65 (66%) Query: 16 REYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKM 75 RE+ ++ Y K+ +K ER+R+ +N DK RE + Y ++ +K+REY R+Y+ ++ +K Sbjct: 25 REWDKKRYAKDPEKFRERKRKEREENIDKFRERARERYAEDPEKFREYDRKYHAEDPEKK 84 Query: 76 REKAR 80 RE+ R Sbjct: 85 RERNR 89 >gi|157953051|ref|YP_001497943.1| hypothetical protein NY2A_B747L [Paramecium bursaria Chlorella virus NY2A] gi|155123278|gb|ABT15146.1| hypothetical protein NY2A_B747L [Paramecium bursaria Chlorella virus NY2A] Length = 309 Score = 38.4 bits (88), Expect = 0.33, Method: Composition-based stats. Identities = 19/73 (26%), Positives = 43/73 (58%) Query: 16 REYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKM 75 REY+ Y KN++K+ + ++ Y +NK+ +++ + Y +NKD + + Y +N++ + Sbjct: 9 REYQEEYRTKNKEKLQQNKKDYYQQNKELLKKKRNDRYHQNKDVILQQQNEYCQQNKEMI 68 Query: 76 REKARQSYRKLYS 88 ++ R+ LY+ Sbjct: 69 KKNQRKRNDILYT 81 >gi|218188462|gb|EEC70889.1| hypothetical protein OsI_02427 [Oryza sativa Indica Group] Length = 1477 Score = 37.7 bits (86), Expect = 0.51, Method: Composition-based stats. Identities = 26/100 (26%), Positives = 51/100 (51%), Gaps = 6/100 (6%) Query: 15 RREYKRRYYLKNRDKI--LERRRRRYLKNKDKIRESYHQYYL----KNKDKYREYKRRYY 68 R+ Y R L R+++ L+R R +LK D+ + +H+ KNK + + Y Sbjct: 564 RKLYARLDELLYREEMMWLQRSRVSWLKEGDRNTKFFHRQAAWRAKKNKINRLKDEDSRY 623 Query: 69 LKNRDKMREKARQSYRKLYSKDSWIAPEEPMGMTKAEIEA 108 ++N+ M + R ++KLY++D + P E + + ++A Sbjct: 624 VENKGDMEKLTRDFFQKLYARDEGVDPGELVDLFDVRVDA 663 >gi|222624901|gb|EEE59033.1| hypothetical protein OsJ_10782 [Oryza sativa Japonica Group] Length = 850 Score = 37.7 bits (86), Expect = 0.55, Method: Composition-based stats. Identities = 29/95 (30%), Positives = 50/95 (52%), Gaps = 10/95 (10%) Query: 9 EERMLCRREYKRRYYLKNRDKI--LERRRRRYLKNKDKIRESYHQYYL----KNKDKYRE 62 E RML Y R L R++I L+R R +LK D+ + +H+ KNK + Sbjct: 40 EIRML----YARMDELLYREEIMWLQRSRVAWLKEGDRNTKFFHRQAAWRSKKNKITRLK 95 Query: 63 YKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEE 97 + +++N+++M AR+ ++ LY KD+ + P E Sbjct: 96 AEDSRFVENKEEMEHMAREFFQTLYLKDNSVDPRE 130 >gi|167044410|gb|ABZ09087.1| hypothetical protein ALOHA_HF4000APKG6D3ctg6g4 [uncultured marine crenarchaeote HF4000_APKG6D3] Length = 308 Score = 37.7 bits (86), Expect = 0.58, Method: Composition-based stats. Identities = 33/74 (44%), Positives = 49/74 (66%) Query: 5 VLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYK 64 VL+ EER RRE R++ +N +KI E R+ KN +KI E+ +Y +N +K RE + Sbjct: 2 VLSDEERKRKRRELTRKWKKENPEKIREGGRKYRAKNPEKISEAKKKYNRENPEKVREKR 61 Query: 65 RRYYLKNRDKMREK 78 R+YY +NR+K+REK Sbjct: 62 RKYYEENREKIREK 75 Score = 37.3 bits (85), Expect = 0.65, Method: Composition-based stats. Identities = 32/88 (36%), Positives = 53/88 (60%) Query: 2 GRKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYR 61 +++ EE +E + ++Y +NR+KI ERRR+ Y +N +KI E + Y KN +K Sbjct: 74 EKQIKYLEENPEKIKEARSKHYEENREKINERRRKDYEENHEKINERRRKDYKKNPEKIN 133 Query: 62 EYKRRYYLKNRDKMREKARQSYRKLYSK 89 E +R+Y +N +K+RE R+ Y K + K Sbjct: 134 ERRRKYRKENPEKVRESKRKDYEKNHEK 161 Score = 36.1 bits (82), Expect = 1.5, Method: Composition-based stats. Identities = 26/70 (37%), Positives = 50/70 (71%) Query: 16 REYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKM 75 RE +R+YY +NR+KI E++ + +N +KI+E+ ++Y +N++K E +R+ Y +N +K+ Sbjct: 58 REKRRKYYEENREKIREKQIKYLEENPEKIKEARSKHYEENREKINERRRKDYEENHEKI 117 Query: 76 REKARQSYRK 85 E+ R+ Y+K Sbjct: 118 NERRRKDYKK 127 Score = 35.4 bits (80), Expect = 2.6, Method: Composition-based stats. Identities = 26/73 (35%), Positives = 50/73 (68%) Query: 17 EYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMR 76 E K++Y +N +K+ E+RR+ Y +N++KIRE +Y +N +K +E + ++Y +NR+K+ Sbjct: 44 EAKKKYNRENPEKVREKRRKYYEENREKIREKQIKYLEENPEKIKEARSKHYEENREKIN 103 Query: 77 EKARQSYRKLYSK 89 E+ R+ Y + + K Sbjct: 104 ERRRKDYEENHEK 116 Score = 35.4 bits (80), Expect = 2.9, Method: Composition-based stats. Identities = 28/65 (43%), Positives = 44/65 (67%) Query: 17 EYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMR 76 E +R+ Y +N +KI ERRR+ Y KN +KI E +Y +N +K RE KR+ Y KN +K++ Sbjct: 104 ERRRKDYEENHEKINERRRKDYKKNPEKINERRRKYRKENPEKVRESKRKDYEKNHEKIK 163 Query: 77 EKARQ 81 E+ ++ Sbjct: 164 ERVKK 168 >gi|295399960|ref|ZP_06809940.1| conserved hypothetical protein [Geobacillus thermoglucosidasius C56-YS93] gi|312110792|ref|YP_003989108.1| hypothetical protein GY4MC1_1723 [Geobacillus sp. Y4.1MC1] gi|294977739|gb|EFG53337.1| conserved hypothetical protein [Geobacillus thermoglucosidasius C56-YS93] gi|311215893|gb|ADP74497.1| hypothetical protein GY4MC1_1723 [Geobacillus sp. Y4.1MC1] Length = 574 Score = 37.3 bits (85), Expect = 0.74, Method: Composition-based stats. Identities = 26/90 (28%), Positives = 47/90 (52%), Gaps = 3/90 (3%) Query: 36 RRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQ-SYRKLYSKDSWIA 94 +R ++ + ++Y YY K + +EYK+ YYL RD+ RQ ++R L + D + Sbjct: 437 KRMIRQYNVPSKAYFSYY-KPILESKEYKKNYYLLLRDQGYSPIRQVTFRHLETGDKTLT 495 Query: 95 PEEPMGMTKAEIEALEREIARLKAKPIEEL 124 EP + + +EAL+ ++ A + EL Sbjct: 496 IIEPQQI-DSFVEALKADLMEESASTMLEL 524 >gi|108708055|gb|ABF95850.1| retrotransposon protein, putative, unclassified [Oryza sativa Japonica Group] Length = 940 Score = 37.3 bits (85), Expect = 0.78, Method: Composition-based stats. Identities = 29/95 (30%), Positives = 50/95 (52%), Gaps = 10/95 (10%) Query: 9 EERMLCRREYKRRYYLKNRDKI--LERRRRRYLKNKDKIRESYHQYYL----KNKDKYRE 62 E RML Y R L R++I L+R R +LK D+ + +H+ KNK + Sbjct: 40 EIRML----YARMDELLYREEIMWLQRSRVAWLKEGDRNTKFFHRQAAWRSKKNKITRLK 95 Query: 63 YKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEE 97 + +++N+++M AR+ ++ LY KD+ + P E Sbjct: 96 AEDSRFVENKEEMEHMAREFFQTLYLKDNSVDPRE 130 >gi|170676322|ref|YP_001742083.1| putative endonuclease protein [Salmonella phage E1] gi|170321632|emb|CAM33151.1| phage endonuclease protein [Salmonella phage Vi II-E1] Length = 334 Score = 36.5 bits (83), Expect = 1.1, Method: Composition-based stats. Identities = 22/60 (36%), Positives = 36/60 (60%), Gaps = 1/60 (1%) Query: 14 CRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRY-YLKNR 72 C+ E+++ Y + +I E+ +R Y KNK+K E Q+ KNK R ++R+Y YLK + Sbjct: 208 CQAEFRKNIYKLKKAEINEKSKRYYDKNKEKHNEKSKQWRQKNKGVVRYHQRKYKYLKRQ 267 >gi|169628899|ref|YP_001702548.1| bacteriophage protein [Mycobacterium abscessus ATCC 19977] gi|169240866|emb|CAM61894.1| Bacteriophage protein [Mycobacterium abscessus] Length = 215 Score = 36.5 bits (83), Expect = 1.3, Method: Composition-based stats. Identities = 19/51 (37%), Positives = 30/51 (58%) Query: 23 YLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRD 73 Y ++R+K L R RR N ++ E+ +YY NK + R + +RYY N+D Sbjct: 70 YKRDREKFLARNRRWRADNYERELETNRKYYHDNKAQVRAWHKRYYEANKD 120 >gi|218192800|gb|EEC75227.1| hypothetical protein OsI_11496 [Oryza sativa Indica Group] Length = 1059 Score = 36.5 bits (83), Expect = 1.3, Method: Composition-based stats. Identities = 28/95 (29%), Positives = 50/95 (52%), Gaps = 10/95 (10%) Query: 9 EERMLCRREYKRRYYLKNRDKI--LERRRRRYLKNKDKIRESYHQYYL----KNKDKYRE 62 E RML Y R L R+++ L+R R +LK D+ + +H+ KNK + Sbjct: 75 EIRML----YARMDELLYREEMMWLQRSRVAWLKEGDRNTKFFHRQAAWRSKKNKITRLK 130 Query: 63 YKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEE 97 + +++N+++M AR+ ++ LY KD+ + P E Sbjct: 131 AEDSRFVENKEEMEHMAREFFQTLYLKDNSVDPRE 165 >gi|315121958|ref|YP_004062447.1| hypothetical protein CKC_01040 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|315122926|ref|YP_004063415.1| hypothetical protein CKC_05910 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495360|gb|ADR51959.1| hypothetical protein CKC_01040 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496328|gb|ADR52927.1| hypothetical protein CKC_05910 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 209 Score = 36.5 bits (83), Expect = 1.4, Method: Composition-based stats. Identities = 31/69 (44%), Positives = 43/69 (62%) Query: 17 EYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMR 76 EY R+Y KNRDK+ E +R + K++ E QYY NK+ EY+R+Y KN+DK+ Sbjct: 30 EYYRQYQQKNRDKLTENKRLYRQRKKEERNEYQRQYYQNNKEARIEYQRQYLQKNKDKIN 89 Query: 77 EKARQSYRK 85 E RQ Y+K Sbjct: 90 EYYRQHYQK 98 Score = 36.1 bits (82), Expect = 1.7, Method: Composition-based stats. Identities = 32/79 (40%), Positives = 46/79 (58%) Query: 2 GRKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYR 61 K L + + R EY+R+YY N++ +E +R+ KNKDKI E Y Q+Y KNKDK Sbjct: 45 ENKRLYRQRKKEERNEYQRQYYQNNKEARIEYQRQYLQKNKDKINEYYRQHYQKNKDKLT 104 Query: 62 EYKRRYYLKNRDKMREKAR 80 E R+YY + ++ E R Sbjct: 105 ENSRQYYQRKKEARIEYQR 123 Score = 35.7 bits (81), Expect = 2.1, Method: Composition-based stats. Identities = 29/67 (43%), Positives = 43/67 (64%) Query: 15 RREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDK 74 R EY+R+Y KN+DKI E R+ Y KNKDK+ E+ QYY + K+ EY+R Y + +++ Sbjct: 73 RIEYQRQYLQKNKDKINEYYRQHYQKNKDKLTENSRQYYQRKKEARIEYQRLYRQRKKEE 132 Query: 75 MREKARQ 81 + RQ Sbjct: 133 RKAYRRQ 139 >gi|315185937|gb|EFU19701.1| permease YjgP/YjgQ family protein [Spirochaeta thermophila DSM 6578] Length = 424 Score = 36.1 bits (82), Expect = 1.7, Method: Composition-based stats. Identities = 20/61 (32%), Positives = 32/61 (52%), Gaps = 4/61 (6%) Query: 24 LKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKN----RDKMREKA 79 +K R+++LE++RR + K +D +R S YL E R +L N ++REK Sbjct: 262 MKEREQVLEQKRRSHEKERDDLRRSVSSLYLTTLSLSNESIRSRFLGNLRSSLQRLREKE 321 Query: 80 R 80 R Sbjct: 322 R 322 >gi|254563699|ref|YP_003070794.1| DNA helicase-like protein [Methylobacterium extorquens DM4] gi|254270977|emb|CAX26983.1| putative DNA helicase related protein [Methylobacterium extorquens DM4] Length = 1938 Score = 36.1 bits (82), Expect = 1.8, Method: Composition-based stats. Identities = 20/72 (27%), Positives = 38/72 (52%), Gaps = 12/72 (16%) Query: 70 KNRDKMREKARQSY----RKLYSKDSWIAPEEPMGMTKAEIEALEREI--------ARLK 117 ++RD++R+ + + +++S D + PEE + T A IEA + E+ AR + Sbjct: 1663 RDRDRLRQAVLEDHGWIIHRIWSTDWFQRPEEQLERTVAAIEAAKAELDARLEFGGARAR 1722 Query: 118 AKPIEELIYKRG 129 A P+E + +R Sbjct: 1723 AVPVEVVTIERA 1734 >gi|237741330|ref|ZP_04571811.1| surface-layer protein [Fusobacterium sp. 4_1_13] gi|229430862|gb|EEO41074.1| surface-layer protein [Fusobacterium sp. 4_1_13] Length = 1498 Score = 35.4 bits (80), Expect = 2.7, Method: Composition-based stats. Identities = 43/161 (26%), Positives = 69/161 (42%), Gaps = 39/161 (24%) Query: 2 GRKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKN----------KDKIRESYHQ 51 + LT E + EY++ + +N+ K LE++ KN KD ++E+ + Sbjct: 507 EKNFLTQENAEWNKYEYEKEHESENKIKELEKKIAEKNKNILEAKKNSIPKDILKEAGNY 566 Query: 52 Y---YLKNKDK--------YREYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMG 100 Y Y+ N+ K Y + KR L+ K E+ R+ Y+ + +++ PE P Sbjct: 567 YLNKYIPNEIKNKKNEIGAYGDPKRDEKLRKEIKELEEKREKYKNIPAEN---IPEAP-E 622 Query: 101 MTKAEIEALEREIA--------------RLKAKPIEELIYK 127 K I +LE+EI R KAK EE I K Sbjct: 623 SIKYLISSLEKEIEKDKKLLRQFKDTGMRTKAKEKEEEIAK 663 >gi|294784092|ref|ZP_06749393.1| 168 kDa surface-layer protein [Fusobacterium sp. 3_1_27] gi|294488162|gb|EFG35507.1| 168 kDa surface-layer protein [Fusobacterium sp. 3_1_27] Length = 1498 Score = 35.4 bits (80), Expect = 2.9, Method: Composition-based stats. Identities = 43/161 (26%), Positives = 69/161 (42%), Gaps = 39/161 (24%) Query: 2 GRKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKN----------KDKIRESYHQ 51 + LT E + EY++ + +N+ K LE++ KN KD ++E+ + Sbjct: 507 EKNFLTQENAEWNKYEYEKEHESENKIKELEKKIAEKNKNILEAKKNSIPKDILKEAGNY 566 Query: 52 Y---YLKNKDK--------YREYKRRYYLKNRDKMREKARQSYRKLYSKDSWIAPEEPMG 100 Y Y+ N+ K Y + KR L+ K E+ R+ Y+ + +++ PE P Sbjct: 567 YLNKYIPNEIKNKKNEIGAYGDPKRDEKLRKEIKELEEKREKYKNIPAEN---IPEAP-E 622 Query: 101 MTKAEIEALEREIA--------------RLKAKPIEELIYK 127 K I +LE+EI R KAK EE I K Sbjct: 623 SIKYLISSLEKEIEKDKKLLRQFKDTGMRTKAKEKEEEIAK 663 >gi|301625882|ref|XP_002942130.1| PREDICTED: hypothetical protein LOC100486353 [Xenopus (Silurana) tropicalis] Length = 2359 Score = 35.4 bits (80), Expect = 3.0, Method: Composition-based stats. Identities = 26/73 (35%), Positives = 39/73 (53%), Gaps = 7/73 (9%) Query: 16 REYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKY-REYKRRYYLKNRDK 74 R YK+RY L+ ER RRY K + + Y Q Y +++Y R YK+RY L+ Sbjct: 2258 RRYKQRYKLRYDQGYRERYSRRY---KQRYKLRYDQGY---RERYSRRYKQRYKLRYDQG 2311 Query: 75 MREKARQSYRKLY 87 RE+ + Y++ Y Sbjct: 2312 YRERYSRRYKQRY 2324 Score = 35.0 bits (79), Expect = 4.0, Method: Composition-based stats. Identities = 26/76 (34%), Positives = 40/76 (52%), Gaps = 7/76 (9%) Query: 16 REYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKY-REYKRRYYLKNRDK 74 R YK+RY L+ ER RRY K + + Y Q Y +++Y R YK+RY L+ Sbjct: 2278 RRYKQRYKLRYDQGYRERYSRRY---KQRYKLRYDQGY---RERYSRRYKQRYKLRYDQG 2331 Query: 75 MREKARQSYRKLYSKD 90 RE+ + Y+ Y ++ Sbjct: 2332 YRERYSRRYKLRYDQE 2347 Score = 33.4 bits (75), Expect = 10.0, Method: Composition-based stats. Identities = 25/78 (32%), Positives = 39/78 (50%), Gaps = 3/78 (3%) Query: 15 RREYKRRYYLKNRDKILERRRRRY-LKNKDKIRESY-HQYYLKNKDKYRE-YKRRYYLKN 71 + Y RRY L+ ER RRY L+ +E Y +Y L+ + Y+E Y RRY L+ Sbjct: 2117 KERYSRRYKLRYEQGYKERYSRRYKLRYDQGYKERYSRRYKLRYEQGYKERYSRRYKLRY 2176 Query: 72 RDKMREKARQSYRKLYSK 89 +E+ + Y+ Y + Sbjct: 2177 DQGYKERYSRRYKLRYDQ 2194 >gi|221103613|ref|XP_002170145.1| PREDICTED: similar to ATP-dependent DNA helicase PIF1 [Hydra magnipapillata] Length = 734 Score = 35.4 bits (80), Expect = 3.0, Method: Composition-based stats. Identities = 18/64 (28%), Positives = 36/64 (56%), Gaps = 5/64 (7%) Query: 10 ERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYL 69 +R C R+ ++++Y KN++KIL +R R Y+K E++ ++Y + E++ + L Sbjct: 613 KRRECERKREKKHYTKNKNKILYKRAREYIKG-----ENHDEWYGECGKHNEEFQDVFEL 667 Query: 70 KNRD 73 D Sbjct: 668 LRSD 671 >gi|160885908|ref|ZP_02066911.1| hypothetical protein BACOVA_03913 [Bacteroides ovatus ATCC 8483] gi|156108721|gb|EDO10466.1| hypothetical protein BACOVA_03913 [Bacteroides ovatus ATCC 8483] Length = 197 Score = 35.0 bits (79), Expect = 3.2, Method: Composition-based stats. Identities = 27/77 (35%), Positives = 40/77 (51%), Gaps = 19/77 (24%) Query: 20 RRYYLKNRDKILERRR-------------------RRYLKNKDKIRESYHQYYLKNKDKY 60 + Y ++NRDK+L RR+ RRY NK+KI E +Y L +K + Sbjct: 121 KAYRIRNRDKLLARRKELRNANIEHYRELERASYKRRYKVNKEKILEKNRKYQLAHKSEI 180 Query: 61 REYKRRYYLKNRDKMRE 77 REY + YY KN+ K ++ Sbjct: 181 REYMKVYYQKNKSKWKQ 197 >gi|307718798|ref|YP_003874330.1| hypothetical protein STHERM_c11120 [Spirochaeta thermophila DSM 6192] gi|306532523|gb|ADN02057.1| hypothetical protein STHERM_c11120 [Spirochaeta thermophila DSM 6192] Length = 424 Score = 35.0 bits (79), Expect = 3.2, Method: Composition-based stats. Identities = 18/61 (29%), Positives = 32/61 (52%) Query: 24 LKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSY 83 +K R+++LE++RR + K +D +R S + YL E R +L + ++ RQ Sbjct: 262 MKEREQVLEQKRRSHEKERDDLRRSVSRLYLTTLSLSNESIRSRFLGSLRSSLQRLRQKE 321 Query: 84 R 84 R Sbjct: 322 R 322 >gi|238776850|ref|NP_001154919.1| nipped-B-like protein [Danio rerio] Length = 2856 Score = 35.0 bits (79), Expect = 4.0, Method: Composition-based stats. Identities = 26/62 (41%), Positives = 38/62 (61%), Gaps = 15/62 (24%) Query: 25 KNRDKILERRRRRYL-KNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSY 83 KN+DK LE+ R + K++DK+RE K++DK RE K+RDK+REK R+ Sbjct: 790 KNQDKELEKDRDKVRDKDRDKVRE-------KDRDKVRE-------KDRDKLREKDREKI 835 Query: 84 RK 85 R+ Sbjct: 836 RE 837 >gi|307187162|gb|EFN72405.1| Death-inducer obliterator 1 [Camponotus floridanus] Length = 2322 Score = 35.0 bits (79), Expect = 4.1, Method: Composition-based stats. Identities = 24/70 (34%), Positives = 35/70 (50%), Gaps = 8/70 (11%) Query: 25 KNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYR 84 K RDK ER+ R +DK RE + KDK R+ + +R + RE+ RQ + Sbjct: 1541 KTRDKSKERKSRNKESRRDKDRE-------REKDKDRDRNKDRDKTSRKESRERGRQKEK 1593 Query: 85 KLY-SKDSWI 93 + S DSW+ Sbjct: 1594 DRHKSSDSWM 1603 >gi|82915119|ref|XP_728968.1| ubiquitin carboxyl-terminal hydrolase [Plasmodium yoelii yoelii str. 17XNL] gi|23485702|gb|EAA20533.1| Ubiquitin carboxyl-terminal hydrolase family 2, putative [Plasmodium yoelii yoelii] Length = 2033 Score = 34.2 bits (77), Expect = 5.5, Method: Composition-based stats. Identities = 24/53 (45%), Positives = 33/53 (62%), Gaps = 2/53 (3%) Query: 43 DKIRESYHQY-YLKNKDKYREYKR-RYYLKNRDKMREKARQSYRKLYSKDSWI 93 D IR +Y Q L+N+DKY + KR R+ K++DK REK R YR S ++I Sbjct: 19 DVIRTNYSQNNILRNRDKYIDEKRGRHNGKDKDKFREKDRMKYRDKISYTNYI 71 >gi|302693336|ref|XP_003036347.1| hypothetical protein SCHCODRAFT_81569 [Schizophyllum commune H4-8] gi|300110043|gb|EFJ01445.1| hypothetical protein SCHCODRAFT_81569 [Schizophyllum commune H4-8] Length = 1937 Score = 34.2 bits (77), Expect = 5.8, Method: Composition-based stats. Identities = 19/59 (32%), Positives = 36/59 (61%), Gaps = 4/59 (6%) Query: 17 EYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLK--NRD 73 E +RR +K+RD+ ER +R +++N+D+ + +Y ++ R+Y+ R Y + NRD Sbjct: 500 ELERR--IKDRDERKEREKREWIRNRDRANQGSGRYRDRDDSYGRDYESRPYSRGANRD 556 >gi|148229035|ref|NP_001086537.1| chromosome 3 open reading frame 63 [Xenopus laevis] gi|49903384|gb|AAH76767.1| MGC83207 protein [Xenopus laevis] Length = 1094 Score = 34.2 bits (77), Expect = 6.6, Method: Composition-based stats. Identities = 19/57 (33%), Positives = 31/57 (54%), Gaps = 1/57 (1%) Query: 28 DKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMR-EKARQSY 83 + ++E+ R YLK + +I +Y LK D +YK+ Y+ R+K E A +SY Sbjct: 34 NDVVEKHSRVYLKRRAEIPHKSKEYILKPYDSRLDYKKSLYVAPRNKGHIESALRSY 90 >gi|75253936|sp|Q67VD7|BURP9_ORYSJ RecName: Full=BURP domain-containing protein 9; Short=OsBURP09; Flags: Precursor gi|51535170|dbj|BAD37882.1| putative dehydration-responsive protein [Oryza sativa Japonica Group] Length = 515 Score = 33.8 bits (76), Expect = 7.1, Method: Composition-based stats. Identities = 33/112 (29%), Positives = 51/112 (45%), Gaps = 18/112 (16%) Query: 1 MGRKVLTPEERMLCRREYKRR-YYLKNR------DKILERRRRRYLKNKDKIRESYHQ-- 51 M + L E + R + KR+ YL N D+ +E+ YL NKD+ ++ Q Sbjct: 89 MEKFYLYHEGKTNDRDDQKRKNIYLYNEGHANGDDQTMEKF---YLFNKDQAKDGDDQKM 145 Query: 52 --YYLKNKDKYREYK----RRYYLKNRDKMREKARQSYRKLYSKDSWIAPEE 97 +YL NKD+ ++ R+YL N+ E Q+ K Y + A EE Sbjct: 146 GKFYLYNKDQANDWDDQKMERFYLYNKGHANEGDDQTMEKFYLYNKGHANEE 197 >gi|167768327|ref|ZP_02440380.1| hypothetical protein CLOSS21_02883 [Clostridium sp. SS2/1] gi|167709851|gb|EDS20430.1| hypothetical protein CLOSS21_02883 [Clostridium sp. SS2/1] Length = 409 Score = 33.8 bits (76), Expect = 7.4, Method: Composition-based stats. Identities = 18/84 (21%), Positives = 43/84 (51%) Query: 24 LKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSY 83 +K D I R+ + + ++ ++Y +YY +N D Y + LK +++M + + + Sbjct: 58 IKKTDDINHSRQAKNRNERKQLSKAYQEYYKENIDLYDKLDMTKILKQKEEMLQYHPKGF 117 Query: 84 RKLYSKDSWIAPEEPMGMTKAEIE 107 K + KD++ ++ + K++ E Sbjct: 118 MKKFVKDNYEHLQKRVEEIKSDKE 141 >gi|222635287|gb|EEE65419.1| hypothetical protein OsJ_20766 [Oryza sativa Japonica Group] Length = 447 Score = 33.8 bits (76), Expect = 8.1, Method: Composition-based stats. Identities = 33/112 (29%), Positives = 51/112 (45%), Gaps = 18/112 (16%) Query: 1 MGRKVLTPEERMLCRREYKRR-YYLKNR------DKILERRRRRYLKNKDKIRESYHQ-- 51 M + L E + R + KR+ YL N D+ +E+ YL NKD+ ++ Q Sbjct: 21 MEKFYLYHEGKTNDRDDQKRKNIYLYNEGHANGDDQTMEKF---YLFNKDQAKDGDDQKM 77 Query: 52 --YYLKNKDKYREYK----RRYYLKNRDKMREKARQSYRKLYSKDSWIAPEE 97 +YL NKD+ ++ R+YL N+ E Q+ K Y + A EE Sbjct: 78 GKFYLYNKDQANDWDDQKMERFYLYNKGHANEGDDQTMEKFYLYNKGHANEE 129 >gi|46125097|ref|XP_387102.1| hypothetical protein FG06926.1 [Gibberella zeae PH-1] Length = 2022 Score = 33.8 bits (76), Expect = 8.7, Method: Composition-based stats. Identities = 27/105 (25%), Positives = 51/105 (48%), Gaps = 15/105 (14%) Query: 29 KILERRRRRYLKNKDK-------IRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQ 81 +I ERR+ + + +K+K +RE + + K ++KYR R+ K R++ + KA + Sbjct: 1885 RIEERRQSKLVGDKEKDATMLAKLREKHEKEVAKQEEKYRREMRKLEEK-RERDQRKAEE 1943 Query: 82 SYRKLYSKDSWIAPEEPMGMTKAE-------IEALEREIARLKAK 119 RK ++ + +AE IE LE ++ L+A+ Sbjct: 1944 RRRKAAEQEEKNKLSLELERVRAERDVAQRQIELLEGQVGELQAQ 1988 >gi|118580982|ref|YP_902232.1| SMC domain-containing protein [Pelobacter propionicus DSM 2379] gi|118503692|gb|ABL00175.1| SMC domain protein [Pelobacter propionicus DSM 2379] Length = 1230 Score = 33.8 bits (76), Expect = 8.9, Method: Composition-based stats. Identities = 26/89 (29%), Positives = 40/89 (44%) Query: 2 GRKVLTPEERMLCRREYKRRYYLKNRDKILERRRRRYLKNKDKIRESYHQYYLKNKDKYR 61 R +L ER LC RE + R +R ++E+R RY + E L+ DK R Sbjct: 839 QRALLAETERALCDREQQLREMTAHRGALVEQRLERYGERDPDKEEKRLADALRQADKRR 898 Query: 62 EYKRRYYLKNRDKMREKARQSYRKLYSKD 90 E R + +D++ +Q R S+D Sbjct: 899 EQALREQSRLQDELGGLKQQIERMTSSRD 927 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.313 0.141 0.414 Lambda K H 0.267 0.0411 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 2,374,876,069 Number of Sequences: 14124377 Number of extensions: 93694990 Number of successful extensions: 527534 Number of sequences better than 10.0: 1702 Number of HSP's better than 10.0 without gapping: 440 Number of HSP's successfully gapped in prelim test: 2796 Number of HSP's that attempted gapping in prelim test: 513457 Number of HSP's gapped (non-prelim): 13509 length of query: 129 length of database: 4,842,793,630 effective HSP length: 95 effective length of query: 34 effective length of database: 3,500,977,815 effective search space: 119033245710 effective search space used: 119033245710 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.4 bits) S2: 75 (33.5 bits)