BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254780219|ref|YP_003064632.1| hypothetical protein CLIBASIA_00520 [Candidatus Liberibacter asiaticus str. psy62] (117 letters) Database: nr 13,984,884 sequences; 4,792,584,752 total letters Searching..................................................done Results from round 1 >gi|254780219|ref|YP_003064632.1| hypothetical protein CLIBASIA_00520 [Candidatus Liberibacter asiaticus str. psy62] gi|254039896|gb|ACT56692.1| hypothetical protein CLIBASIA_00520 [Candidatus Liberibacter asiaticus str. psy62] Length = 117 Score = 234 bits (598), Expect = 2e-60, Method: Compositional matrix adjust. Identities = 117/117 (100%), Positives = 117/117 (100%) Query: 1 MHFKIKRFLFPLLALLGSCDDNPKDPIVQFKQMKYESQESKKSLSDALFKTYPDTMDKIN 60 MHFKIKRFLFPLLALLGSCDDNPKDPIVQFKQMKYESQESKKSLSDALFKTYPDTMDKIN Sbjct: 1 MHFKIKRFLFPLLALLGSCDDNPKDPIVQFKQMKYESQESKKSLSDALFKTYPDTMDKIN 60 Query: 61 TVQTALRNLHNAISKMESELKELLSDILLKRHPDEIDKINPIKNSANEISKLKEDLS 117 TVQTALRNLHNAISKMESELKELLSDILLKRHPDEIDKINPIKNSANEISKLKEDLS Sbjct: 61 TVQTALRNLHNAISKMESELKELLSDILLKRHPDEIDKINPIKNSANEISKLKEDLS 117 >gi|242005653|ref|XP_002423678.1| conserved hypothetical protein [Pediculus humanus corporis] gi|212506847|gb|EEB10940.1| conserved hypothetical protein [Pediculus humanus corporis] Length = 928 Score = 38.1 bits (87), Expect = 0.37, Method: Compositional matrix adjust. Identities = 29/83 (34%), Positives = 47/83 (56%), Gaps = 9/83 (10%) Query: 37 SQESKKSLSDALFKTYPDTMDKINTVQTALRNLHNAI--SKMESELKELLSDILLKRHPD 94 +Q+S+ LSDA F+ Y D+M KIN ++ L AI +E L++++ +L +P Sbjct: 126 AQQSRPPLSDAEFRKYLDSMGKIN----QMKELRLAIYYGGVEPGLRKVVWKHILNVYPI 181 Query: 95 EI---DKINPIKNSANEISKLKE 114 + ++IN IKN + E LKE Sbjct: 182 GMSGKERINYIKNKSREYEILKE 204 >gi|171912987|ref|ZP_02928457.1| cytosolic axial filament protein cafA and ribonuclease E [Verrucomicrobium spinosum DSM 4136] Length = 529 Score = 37.7 bits (86), Expect = 0.52, Method: Composition-based stats. Identities = 25/76 (32%), Positives = 38/76 (50%), Gaps = 2/76 (2%) Query: 26 PIVQFKQMKYESQESKKSLSDALFKTYPDTM--DKINTVQTALRNLHNAISKMESELKEL 83 PI QF M+ Q +SLSDA+++ P +I T T L +S + S L E Sbjct: 410 PISQFGLMEMTRQRLHESLSDAMYEECPHCKGHGQIKTPLTMSVELQRRLSSIMSRLPEH 469 Query: 84 LSDILLKRHPDEIDKI 99 D+L+ HPD + ++ Sbjct: 470 DRDVLVVIHPDVMQRL 485 >gi|325576895|ref|ZP_08147510.1| ErpY protein [Haemophilus parainfluenzae ATCC 33392] gi|325161101|gb|EGC73219.1| ErpY protein [Haemophilus parainfluenzae ATCC 33392] Length = 80 Score = 37.0 bits (84), Expect = 0.81, Method: Compositional matrix adjust. Identities = 24/70 (34%), Positives = 32/70 (45%), Gaps = 8/70 (11%) Query: 6 KRFLFPLLALLGSCDDNPKDPIVQFKQMKYESQESK----KSLSDALFKTYPDTM----D 57 K L PL LL SC + DPIV+F ESK K D +K YP+ D Sbjct: 7 KNILIPLFTLLTSCTSDVNDPIVRFWNDGIRPSESKMSAIKECLDKAYKIYPNEKEGYDD 66 Query: 58 KINTVQTALR 67 +I V + ++ Sbjct: 67 RIGYVDSCMK 76 >gi|312371970|gb|EFR20025.1| hypothetical protein AND_20771 [Anopheles darlingi] Length = 1185 Score = 37.0 bits (84), Expect = 0.81, Method: Composition-based stats. Identities = 29/87 (33%), Positives = 43/87 (49%), Gaps = 7/87 (8%) Query: 13 LALLGSCDDNPKDPIVQFKQMKYESQESKKSLSDALFKTYPDTMDK--INTVQTALRNLH 70 LALLG C P+++ Q++ + +L + L T P ++DK + TA+R L Sbjct: 493 LALLGRCL-----PLLEVPQVEEHFDDLWNALKEQLLPTPPTSVDKELLEAGLTAIRELL 547 Query: 71 NAISKMESELKELLSDILLKRHPDEID 97 SK E+ K+LL ILL D D Sbjct: 548 KQASKDETAAKQLLDQILLSVMADLTD 574 >gi|328716212|ref|XP_003245867.1| PREDICTED: hypothetical protein LOC100570925 [Acyrthosiphon pisum] Length = 300 Score = 36.6 bits (83), Expect = 1.1, Method: Compositional matrix adjust. Identities = 24/85 (28%), Positives = 42/85 (49%), Gaps = 10/85 (11%) Query: 11 PLLALLGSCDDNPKDPIVQFKQMKYESQESKKSLSDALFKTYP--------DTMDKINTV 62 P + ++GS +D PK +V F +KY S K+ D FK Y +++D + Sbjct: 216 PCILVIGSLND-PKQILVYFDNIKYVVFSSSKAF-DICFKIYHVFNIEYPMESIDVWQFI 273 Query: 63 QTALRNLHNAISKMESELKELLSDI 87 QT N+H K S +K++ +++ Sbjct: 274 QTFFYNIHTKYDKTSSLVKQVTAEL 298 >gi|229917181|ref|YP_002885827.1| peptidase M23 [Exiguobacterium sp. AT1b] gi|229468610|gb|ACQ70382.1| Peptidase M23 [Exiguobacterium sp. AT1b] Length = 498 Score = 35.0 bits (79), Expect = 3.7, Method: Composition-based stats. Identities = 20/83 (24%), Positives = 43/83 (51%), Gaps = 8/83 (9%) Query: 34 KYESQESKKSLSDALFKTYPDTMDKINTVQTALRNLHNAISKMESELKELLSDILLKRHP 93 K E Q+ K D++ K +T KI + + + ++KM+++L+++++D+ +KR Sbjct: 32 KQEQQQKVKEKRDSVKKDQSETSSKIEVNKEEISKVQAEVNKMDAQLQDIINDVAMKRQ- 90 Query: 94 DEIDKINPIKNSANEISKLKEDL 116 IK + +I L+ D+ Sbjct: 91 -------EIKRTEMKIEDLEADI 106 >gi|126326037|ref|XP_001375700.1| PREDICTED: hypothetical protein [Monodelphis domestica] Length = 641 Score = 34.7 bits (78), Expect = 4.2, Method: Compositional matrix adjust. Identities = 23/72 (31%), Positives = 38/72 (52%), Gaps = 1/72 (1%) Query: 10 FPLLALLGSCDDNPKDPIVQFKQMKYESQESKKSLS-DALFKTYPDTMDKINTVQTALRN 68 F +L + S NP+ P+V +KQ K Q K S + F+ P T++ T+L++ Sbjct: 390 FKILTSIDSIISNPRAPVVIYKQSKGSFQSYNKINSQECGFEHLPLTIEMWADQLTSLKD 449 Query: 69 LHNAISKMESEL 80 LH ++ K+ EL Sbjct: 450 LHRSLEKLSEEL 461 >gi|255564107|ref|XP_002523051.1| conserved hypothetical protein [Ricinus communis] gi|223537708|gb|EEF39330.1| conserved hypothetical protein [Ricinus communis] Length = 116 Score = 34.7 bits (78), Expect = 4.4, Method: Compositional matrix adjust. Identities = 18/60 (30%), Positives = 35/60 (58%), Gaps = 1/60 (1%) Query: 37 SQESKKSLSDALFKTYPDTMDKINTVQTALRNLHNAISKMESELKELLSDILLKRHPDEI 96 SQE K SL + + K + ++ TA+RN +I +E+++ + LS ++++R PD + Sbjct: 20 SQEKKPSLEELMMKFIATSENRFQQTDTAIRNQQASIQNLETQIGQ-LSRMMVERQPDTL 78 >gi|302761126|ref|XP_002963985.1| hypothetical protein SELMODRAFT_166499 [Selaginella moellendorffii] gi|300167714|gb|EFJ34318.1| hypothetical protein SELMODRAFT_166499 [Selaginella moellendorffii] Length = 813 Score = 34.7 bits (78), Expect = 5.0, Method: Composition-based stats. Identities = 14/51 (27%), Positives = 30/51 (58%) Query: 49 FKTYPDTMDKINTVQTALRNLHNAISKMESELKELLSDILLKRHPDEIDKI 99 + T PDT++ +N + AL ++H ++ + L D+LL +H +++ +I Sbjct: 364 YVTCPDTLEWLNVTKVALHHVHGTVAVPTANLIVQFYDVLLDKHANQVCQI 414 >gi|302768517|ref|XP_002967678.1| hypothetical protein SELMODRAFT_88585 [Selaginella moellendorffii] gi|300164416|gb|EFJ31025.1| hypothetical protein SELMODRAFT_88585 [Selaginella moellendorffii] Length = 812 Score = 34.7 bits (78), Expect = 5.0, Method: Composition-based stats. Identities = 14/51 (27%), Positives = 30/51 (58%) Query: 49 FKTYPDTMDKINTVQTALRNLHNAISKMESELKELLSDILLKRHPDEIDKI 99 + T PDT++ +N + AL ++H ++ + L D+LL +H +++ +I Sbjct: 363 YVTCPDTLEWLNVTKVALHHVHGTVAVPTANLIVQFYDVLLDKHANQVCQI 413 >gi|320532536|ref|ZP_08033347.1| tetratricopeptide repeat protein [Actinomyces sp. oral taxon 171 str. F0337] gi|320135250|gb|EFW27387.1| tetratricopeptide repeat protein [Actinomyces sp. oral taxon 171 str. F0337] Length = 195 Score = 33.9 bits (76), Expect = 7.6, Method: Compositional matrix adjust. Identities = 28/87 (32%), Positives = 45/87 (51%), Gaps = 12/87 (13%) Query: 31 KQMKYESQESKKSLSDALFKTYPDTMDKINTVQTALRNLHNAISKMESELKELLSD---I 87 + +KY S+K L D T+P T+ +NT+ A R+ N I S +++L+D I Sbjct: 52 QNLKY----SRKLLGD----THPATLASLNTLANASRDSGN-IEDAASLFEKILADRIRI 102 Query: 88 LLKRHPDEIDKINPIKNSANEISKLKE 114 L HPD + N + + E+ KL+E Sbjct: 103 LGPDHPDILTSRNNLAGTLQEVGKLEE 129 >gi|300785323|ref|YP_003765614.1| cytochrome P450 [Amycolatopsis mediterranei U32] gi|299794837|gb|ADJ45212.1| cytochrome P450 [Amycolatopsis mediterranei U32] Length = 396 Score = 33.9 bits (76), Expect = 7.7, Method: Composition-based stats. Identities = 21/75 (28%), Positives = 34/75 (45%) Query: 23 PKDPIVQFKQMKYESQESKKSLSDALFKTYPDTMDKINTVQTALRNLHNAISKMESELKE 82 P I + + YE +E + LS A F + + + +L L + + K E + Sbjct: 145 PSLTICELLGVSYEDREDFQRLSTARFDLFGGAGASLGAMSESLTYLLDIVKKQREEPGD 204 Query: 83 LLSDILLKRHPDEID 97 L +L+K H DEID Sbjct: 205 GLLGMLIKEHGDEID 219 >gi|298491784|ref|YP_003721961.1| alanyl-tRNA synthetase ['Nostoc azollae' 0708] gi|298233702|gb|ADI64838.1| alanyl-tRNA synthetase ['Nostoc azollae' 0708] Length = 880 Score = 33.9 bits (76), Expect = 8.5, Method: Composition-based stats. Identities = 22/48 (45%), Positives = 30/48 (62%), Gaps = 2/48 (4%) Query: 44 LSDALFKTYPDTM-DKINTVQTALRNLHNAISKMESELKELLSDILLK 90 LSD L K P+ + D+I T+QT LRN I ++S+L + SD LLK Sbjct: 719 LSDRL-KVKPEEIPDRIRTLQTELRNSEKEIQTLKSQLAIVKSDSLLK 765 >gi|291562400|emb|CBL41216.1| Superfamily I DNA and RNA helicases and helicase subunits [butyrate-producing bacterium SS3/4] Length = 972 Score = 33.5 bits (75), Expect = 9.5, Method: Compositional matrix adjust. Identities = 25/72 (34%), Positives = 38/72 (52%), Gaps = 8/72 (11%) Query: 37 SQESKKSLSDALFKTYPDTMDKINTVQTAL-RNLHNAISKMESELKELLSDILLKRHPDE 95 ++E K L D LF PDT +I TVQ + RN+ N IS++ E K + D+ Sbjct: 740 TKEPKYRLFDLLFNVLPDTHKQILTVQYRMRRNIGNLISQVFYEGK-------IATEVDD 792 Query: 96 IDKINPIKNSAN 107 ++++PIK A Sbjct: 793 KNRMHPIKKFAG 804 Searching..................................................done Results from round 2 CONVERGED! >gi|254780219|ref|YP_003064632.1| hypothetical protein CLIBASIA_00520 [Candidatus Liberibacter asiaticus str. psy62] gi|254039896|gb|ACT56692.1| hypothetical protein CLIBASIA_00520 [Candidatus Liberibacter asiaticus str. psy62] Length = 117 Score = 215 bits (548), Expect = 1e-54, Method: Composition-based stats. Identities = 117/117 (100%), Positives = 117/117 (100%) Query: 1 MHFKIKRFLFPLLALLGSCDDNPKDPIVQFKQMKYESQESKKSLSDALFKTYPDTMDKIN 60 MHFKIKRFLFPLLALLGSCDDNPKDPIVQFKQMKYESQESKKSLSDALFKTYPDTMDKIN Sbjct: 1 MHFKIKRFLFPLLALLGSCDDNPKDPIVQFKQMKYESQESKKSLSDALFKTYPDTMDKIN 60 Query: 61 TVQTALRNLHNAISKMESELKELLSDILLKRHPDEIDKINPIKNSANEISKLKEDLS 117 TVQTALRNLHNAISKMESELKELLSDILLKRHPDEIDKINPIKNSANEISKLKEDLS Sbjct: 61 TVQTALRNLHNAISKMESELKELLSDILLKRHPDEIDKINPIKNSANEISKLKEDLS 117 >gi|242005653|ref|XP_002423678.1| conserved hypothetical protein [Pediculus humanus corporis] gi|212506847|gb|EEB10940.1| conserved hypothetical protein [Pediculus humanus corporis] Length = 928 Score = 40.5 bits (93), Expect = 0.079, Method: Composition-based stats. Identities = 29/83 (34%), Positives = 47/83 (56%), Gaps = 9/83 (10%) Query: 37 SQESKKSLSDALFKTYPDTMDKINTVQTALRNLHNAI--SKMESELKELLSDILLKRHPD 94 +Q+S+ LSDA F+ Y D+M KIN ++ L AI +E L++++ +L +P Sbjct: 126 AQQSRPPLSDAEFRKYLDSMGKINQ----MKELRLAIYYGGVEPGLRKVVWKHILNVYPI 181 Query: 95 EI---DKINPIKNSANEISKLKE 114 + ++IN IKN + E LKE Sbjct: 182 GMSGKERINYIKNKSREYEILKE 204 >gi|171912987|ref|ZP_02928457.1| cytosolic axial filament protein cafA and ribonuclease E [Verrucomicrobium spinosum DSM 4136] Length = 529 Score = 40.1 bits (92), Expect = 0.10, Method: Composition-based stats. Identities = 25/76 (32%), Positives = 38/76 (50%), Gaps = 2/76 (2%) Query: 26 PIVQFKQMKYESQESKKSLSDALFKTYPDTM--DKINTVQTALRNLHNAISKMESELKEL 83 PI QF M+ Q +SLSDA+++ P +I T T L +S + S L E Sbjct: 410 PISQFGLMEMTRQRLHESLSDAMYEECPHCKGHGQIKTPLTMSVELQRRLSSIMSRLPEH 469 Query: 84 LSDILLKRHPDEIDKI 99 D+L+ HPD + ++ Sbjct: 470 DRDVLVVIHPDVMQRL 485 >gi|312371970|gb|EFR20025.1| hypothetical protein AND_20771 [Anopheles darlingi] Length = 1185 Score = 39.7 bits (91), Expect = 0.12, Method: Composition-based stats. Identities = 29/87 (33%), Positives = 43/87 (49%), Gaps = 7/87 (8%) Query: 13 LALLGSCDDNPKDPIVQFKQMKYESQESKKSLSDALFKTYPDTMDK--INTVQTALRNLH 70 LALLG C P+++ Q++ + +L + L T P ++DK + TA+R L Sbjct: 493 LALLGRCL-----PLLEVPQVEEHFDDLWNALKEQLLPTPPTSVDKELLEAGLTAIRELL 547 Query: 71 NAISKMESELKELLSDILLKRHPDEID 97 SK E+ K+LL ILL D D Sbjct: 548 KQASKDETAAKQLLDQILLSVMADLTD 574 >gi|325576895|ref|ZP_08147510.1| ErpY protein [Haemophilus parainfluenzae ATCC 33392] gi|325161101|gb|EGC73219.1| ErpY protein [Haemophilus parainfluenzae ATCC 33392] Length = 80 Score = 38.5 bits (88), Expect = 0.31, Method: Composition-based stats. Identities = 24/70 (34%), Positives = 32/70 (45%), Gaps = 8/70 (11%) Query: 6 KRFLFPLLALLGSCDDNPKDPIVQFKQMKYESQESK----KSLSDALFKTYPDTM----D 57 K L PL LL SC + DPIV+F ESK K D +K YP+ D Sbjct: 7 KNILIPLFTLLTSCTSDVNDPIVRFWNDGIRPSESKMSAIKECLDKAYKIYPNEKEGYDD 66 Query: 58 KINTVQTALR 67 +I V + ++ Sbjct: 67 RIGYVDSCMK 76 >gi|221129849|ref|XP_002157295.1| PREDICTED: hypothetical protein [Hydra magnipapillata] Length = 938 Score = 38.1 bits (87), Expect = 0.39, Method: Composition-based stats. Identities = 31/113 (27%), Positives = 56/113 (49%), Gaps = 18/113 (15%) Query: 17 GSCDDNPKDPIVQFKQ----MKYESQ--------ESKKSLSDALFKTYPDTMDKINTVQT 64 G D+N D + K M+ E+Q ES K++SD +K++ ++ Sbjct: 250 GHIDNNQSDESLHVKNNEENMQRETQLISDNQMPESLKNISD----ITQTKSEKLSDIEQ 305 Query: 65 ALRNLHNAISKMESELKELLSDILLKRHPDEIDKINPIKNSANEISKLKEDLS 117 ++NL A+SK + K ++ ++ + PD + +N +KN N SKL E+L+ Sbjct: 306 QIQNLQLALSKEKQNYKNIVEMVMTQSSPDNLKFLNSLKN--NLYSKLIENLN 356 >gi|229917181|ref|YP_002885827.1| peptidase M23 [Exiguobacterium sp. AT1b] gi|229468610|gb|ACQ70382.1| Peptidase M23 [Exiguobacterium sp. AT1b] Length = 498 Score = 37.8 bits (86), Expect = 0.47, Method: Composition-based stats. Identities = 20/83 (24%), Positives = 43/83 (51%), Gaps = 8/83 (9%) Query: 34 KYESQESKKSLSDALFKTYPDTMDKINTVQTALRNLHNAISKMESELKELLSDILLKRHP 93 K E Q+ K D++ K +T KI + + + ++KM+++L+++++D+ +KR Sbjct: 32 KQEQQQKVKEKRDSVKKDQSETSSKIEVNKEEISKVQAEVNKMDAQLQDIINDVAMKRQ- 90 Query: 94 DEIDKINPIKNSANEISKLKEDL 116 IK + +I L+ D+ Sbjct: 91 -------EIKRTEMKIEDLEADI 106 >gi|9634297|ref|NP_037836.1| ORF76 cg30 [Spodoptera exigua MNPV] gi|6960535|gb|AAF33605.1|AF169823_76 ORF76 cg30 [Spodoptera exigua MNPV] Length = 461 Score = 37.4 bits (85), Expect = 0.69, Method: Composition-based stats. Identities = 22/92 (23%), Positives = 48/92 (52%), Gaps = 7/92 (7%) Query: 25 DPIVQFKQMKYESQESKKSLSDALFKTYPDTMDKINTVQTALRNLHNAISKMESELKELL 84 D ++ +++YE ++ K ++D +T D+IN L+N + ++ E+ ++ Sbjct: 261 DAAIELSRIQYEREKIIKEINDNF-----ETQDRINVTNDILKNFRDFMNTKNEEIAKMK 315 Query: 85 SDI--LLKRHPDEIDKINPIKNSANEISKLKE 114 +DI L K+ D + I +++ NE +LK+ Sbjct: 316 TDIENLNKKKNDLVKDIANLQSKLNENEQLKK 347 >gi|255564107|ref|XP_002523051.1| conserved hypothetical protein [Ricinus communis] gi|223537708|gb|EEF39330.1| conserved hypothetical protein [Ricinus communis] Length = 116 Score = 37.4 bits (85), Expect = 0.74, Method: Composition-based stats. Identities = 18/60 (30%), Positives = 35/60 (58%), Gaps = 1/60 (1%) Query: 37 SQESKKSLSDALFKTYPDTMDKINTVQTALRNLHNAISKMESELKELLSDILLKRHPDEI 96 SQE K SL + + K + ++ TA+RN +I +E+++ + LS ++++R PD + Sbjct: 20 SQEKKPSLEELMMKFIATSENRFQQTDTAIRNQQASIQNLETQIGQ-LSRMMVERQPDTL 78 >gi|320532536|ref|ZP_08033347.1| tetratricopeptide repeat protein [Actinomyces sp. oral taxon 171 str. F0337] gi|320135250|gb|EFW27387.1| tetratricopeptide repeat protein [Actinomyces sp. oral taxon 171 str. F0337] Length = 195 Score = 37.0 bits (84), Expect = 0.88, Method: Composition-based stats. Identities = 28/87 (32%), Positives = 45/87 (51%), Gaps = 12/87 (13%) Query: 31 KQMKYESQESKKSLSDALFKTYPDTMDKINTVQTALRNLHNAISKMESELKELLSD---I 87 + +KY S+K L D T+P T+ +NT+ A R+ N I S +++L+D I Sbjct: 52 QNLKY----SRKLLGD----THPATLASLNTLANASRDSGN-IEDAASLFEKILADRIRI 102 Query: 88 LLKRHPDEIDKINPIKNSANEISKLKE 114 L HPD + N + + E+ KL+E Sbjct: 103 LGPDHPDILTSRNNLAGTLQEVGKLEE 129 >gi|308163064|gb|EFO65427.1| DNA repair and recombination protein Rhp26p [Giardia lamblia P15] Length = 930 Score = 37.0 bits (84), Expect = 0.94, Method: Composition-based stats. Identities = 22/96 (22%), Positives = 50/96 (52%), Gaps = 7/96 (7%) Query: 12 LLALLGSCDDNPKDPIVQFKQMKYESQESKKSLSDALFKTYPDTMDKINTVQTALRNLHN 71 L A++G CD++ ++P+VQ K++ E ++ + + P + ++ +L++L + Sbjct: 826 LFAIMGYCDEDEENPVVQPKKVSKEDLQTSIKEVMEIACSAPSNTVTLYAIKESLQHLTD 885 Query: 72 AISK-------MESELKELLSDILLKRHPDEIDKIN 100 + K ++S +EL + LL+ ++KIN Sbjct: 886 STQKDLLDHLFLKSSSEELQTHTLLELKDTIVNKIN 921 >gi|224025591|ref|ZP_03643957.1| hypothetical protein BACCOPRO_02331 [Bacteroides coprophilus DSM 18228] gi|224018827|gb|EEF76825.1| hypothetical protein BACCOPRO_02331 [Bacteroides coprophilus DSM 18228] Length = 580 Score = 36.6 bits (83), Expect = 1.1, Method: Composition-based stats. Identities = 30/105 (28%), Positives = 49/105 (46%), Gaps = 11/105 (10%) Query: 24 KDPIVQFKQM--KYESQ--ESKKSLSDALFKTYPDTMDKINTVQTALRNLHNAISKMESE 79 ++ I F+Q Y+ Q E+ K LF Y D DK+NT+ + N I+K+E+ Sbjct: 96 EEQIASFEQQIANYKKQLAENMKDYQAQLFTNYEDAADKVNTINGEIANNLYLIAKLENG 155 Query: 80 LKELLSDILLKRHPDEID------KINPIKN-SANEISKLKEDLS 117 + + I + DE+ KI K S + +S LK +L+ Sbjct: 156 VASAKAAIAELNYTDELTIAENKAKIEAYKQMSGSSLSDLKSELA 200 >gi|153870308|ref|ZP_01999736.1| hypothetical protein BGP_1042 [Beggiatoa sp. PS] gi|152073226|gb|EDN70264.1| hypothetical protein BGP_1042 [Beggiatoa sp. PS] Length = 333 Score = 36.6 bits (83), Expect = 1.2, Method: Composition-based stats. Identities = 24/107 (22%), Positives = 53/107 (49%), Gaps = 18/107 (16%) Query: 27 IVQFKQMKYESQESKKSLSDALFKTYPDTMDKINTVQTALRNL----------HNAISKM 76 I++ KQMKY+ +E + D F+ P+ + K+ T Q AL + H I ++ Sbjct: 221 ILEVKQMKYQIREETWPVIDQFFREMPEAIGKLPTFQEALASTQKQGIQQGVHHVVIRQL 280 Query: 77 ESELKELLSDILLKRHPDEIDKINPIKN------SANEISKLKEDLS 117 + ++ + ++ +H + ++ + N SANE++++ ++S Sbjct: 281 RRKFPQISTGLV--QHIEATSDMDQLDNWLDQIMSANELAEIDFNIS 325 >gi|291562400|emb|CBL41216.1| Superfamily I DNA and RNA helicases and helicase subunits [butyrate-producing bacterium SS3/4] Length = 972 Score = 36.6 bits (83), Expect = 1.2, Method: Composition-based stats. Identities = 25/73 (34%), Positives = 39/73 (53%), Gaps = 8/73 (10%) Query: 37 SQESKKSLSDALFKTYPDTMDKINTVQTAL-RNLHNAISKMESELKELLSDILLKRHPDE 95 ++E K L D LF PDT +I TVQ + RN+ N IS++ E K + D+ Sbjct: 740 TKEPKYRLFDLLFNVLPDTHKQILTVQYRMRRNIGNLISQVFYEGK-------IATEVDD 792 Query: 96 IDKINPIKNSANE 108 ++++PIK A + Sbjct: 793 KNRMHPIKKFAGK 805 >gi|57238867|ref|YP_180003.1| phenylalanyl-tRNA synthetase subunit alpha [Ehrlichia ruminantium str. Welgevonden] gi|58578796|ref|YP_197008.1| phenylalanyl-tRNA synthetase alpha chain [Ehrlichia ruminantium str. Welgevonden] gi|58616855|ref|YP_196054.1| phenylalanyl-tRNA synthetase alpha chain [Ehrlichia ruminantium str. Gardel] gi|75356985|sp|Q5FH45|SYFA_EHRRG RecName: Full=Phenylalanyl-tRNA synthetase alpha chain; AltName: Full=Phenylalanine--tRNA ligase alpha chain; Short=PheRS gi|81353053|sp|Q5HC40|SYFA_EHRRW RecName: Full=Phenylalanyl-tRNA synthetase alpha chain; AltName: Full=Phenylalanine--tRNA ligase alpha chain; Short=PheRS gi|57160946|emb|CAH57852.1| phenylalanyl-tRNA synthetase alpha chain [Ehrlichia ruminantium str. Welgevonden] gi|58416467|emb|CAI27580.1| Phenylalanyl-tRNA synthetase alpha chain [Ehrlichia ruminantium str. Gardel] gi|58417422|emb|CAI26626.1| Phenylalanyl-tRNA synthetase alpha chain [Ehrlichia ruminantium str. Welgevonden] Length = 344 Score = 36.2 bits (82), Expect = 1.3, Method: Composition-based stats. Identities = 20/74 (27%), Positives = 40/74 (54%), Gaps = 5/74 (6%) Query: 39 ESKKSLSDALFKTYPDTMDKINTVQTALRNLHNAISKMESELKELLSDILLKRHPDEIDK 98 E +KS+ + + Y + IN+ +T L + +++++L + DI L P +I K Sbjct: 53 EERKSVGNVVNTIYSELKSLINSHRTKLHQI-----QIDNQLLQDKVDISLPIRPQKIGK 107 Query: 99 INPIKNSANEISKL 112 ++PI N NE+ ++ Sbjct: 108 LHPISNVLNEVKRI 121 >gi|198415074|ref|XP_002123447.1| PREDICTED: similar to LOC397908 protein [Ciona intestinalis] Length = 852 Score = 36.2 bits (82), Expect = 1.6, Method: Composition-based stats. Identities = 26/92 (28%), Positives = 49/92 (53%), Gaps = 9/92 (9%) Query: 31 KQMKYESQESKKSLSDALFKTYPDTMDKINTVQTALRNLH------NAISKMESELKELL 84 K + + + ++K AL K Y D +D++ ALR H +M ++KEL Sbjct: 201 KNINNKPRVNQKLTKRALIKEYDDEIDRLRREVNALREKHGIYLDPKQYEEMVEKIKELT 260 Query: 85 SDILLKRHPDEIDKINPI-KNSANEISKLKED 115 S L++ E+D+ + KNS+NE++++K++ Sbjct: 261 ST--LQQRTQEMDEFKFLFKNSSNELNEIKQE 290 >gi|149541812|ref|XP_001506568.1| PREDICTED: similar to hepatic multiple inositol polyphosphate phosphatase, partial [Ornithorhynchus anatinus] Length = 131 Score = 35.8 bits (81), Expect = 2.0, Method: Composition-based stats. Identities = 33/124 (26%), Positives = 57/124 (45%), Gaps = 17/124 (13%) Query: 1 MHFKIKRFLFPLLALLGSCDDNPKDPIVQFK-----QMKYESQESKKSLSDALFKTY--- 52 +HF L PLL+L+G D K+P+ + Q K+ S S+ +F Y Sbjct: 10 LHFGHAETLLPLLSLMGYFKD--KEPLTAYNFKEQAQRKFRSGRIVPYASNLIFVLYHCE 67 Query: 53 -PDTMDKINTVQTALRN-----LHNA-ISKMESELKELLSDILLKRHPDEIDKINPIKNS 105 T ++ VQ L +H+ + + +LK+ DIL +PD+ K+ + N+ Sbjct: 68 HATTPEEEYQVQLLLNEKLLPFIHSQETTSLYGDLKDHYKDILQSCNPDQECKLPKVNNT 127 Query: 106 ANEI 109 A+E+ Sbjct: 128 ADEL 131 >gi|126326037|ref|XP_001375700.1| PREDICTED: hypothetical protein [Monodelphis domestica] Length = 641 Score = 35.8 bits (81), Expect = 2.1, Method: Composition-based stats. Identities = 23/72 (31%), Positives = 37/72 (51%), Gaps = 1/72 (1%) Query: 10 FPLLALLGSCDDNPKDPIVQFKQMKYESQESKKSLSDAL-FKTYPDTMDKINTVQTALRN 68 F +L + S NP+ P+V +KQ K Q K S F+ P T++ T+L++ Sbjct: 390 FKILTSIDSIISNPRAPVVIYKQSKGSFQSYNKINSQECGFEHLPLTIEMWADQLTSLKD 449 Query: 69 LHNAISKMESEL 80 LH ++ K+ EL Sbjct: 450 LHRSLEKLSEEL 461 >gi|123498266|ref|XP_001327365.1| hypothetical protein [Trichomonas vaginalis G3] gi|121910293|gb|EAY15142.1| hypothetical protein TVAG_392590 [Trichomonas vaginalis G3] Length = 252 Score = 35.4 bits (80), Expect = 2.4, Method: Composition-based stats. Identities = 28/84 (33%), Positives = 43/84 (51%), Gaps = 9/84 (10%) Query: 36 ESQESKKSLSDALFKTYPDTMDKINTVQTALRNLHNAISKMESELKELLSDILLKRHPDE 95 + Q K S ++ + K D +N +QT NL I ++S+LKE D L K Sbjct: 15 QVQNEKSSATNKIEKLQSD----LNRIQTEKANLEKEIQTLKSQLKEKERDDLPKEF--- 67 Query: 96 IDKINPIKNSA--NEISKLKEDLS 117 + KI+ +KNS N+I K E++S Sbjct: 68 LSKISELKNSEDFNQIYKFFEEIS 91 >gi|298491784|ref|YP_003721961.1| alanyl-tRNA synthetase ['Nostoc azollae' 0708] gi|298233702|gb|ADI64838.1| alanyl-tRNA synthetase ['Nostoc azollae' 0708] Length = 880 Score = 35.4 bits (80), Expect = 2.6, Method: Composition-based stats. Identities = 22/48 (45%), Positives = 30/48 (62%), Gaps = 2/48 (4%) Query: 44 LSDALFKTYPDTM-DKINTVQTALRNLHNAISKMESELKELLSDILLK 90 LSD L K P+ + D+I T+QT LRN I ++S+L + SD LLK Sbjct: 719 LSDRL-KVKPEEIPDRIRTLQTELRNSEKEIQTLKSQLAIVKSDSLLK 765 >gi|157103376|ref|XP_001647951.1| tropomyosin, putative [Aedes aegypti] gi|108884174|gb|EAT48399.1| tropomyosin, putative [Aedes aegypti] Length = 448 Score = 35.4 bits (80), Expect = 2.7, Method: Composition-based stats. Identities = 32/121 (26%), Positives = 60/121 (49%), Gaps = 23/121 (19%) Query: 17 GSCDDNPKDPIVQFKQMKYESQESKKSLSDALFKTYPDTMD---KINTVQTALRN----- 68 G DD ++ +++ + KY+ Q S K DAL K + + +D K+ T++ AL + Sbjct: 223 GMTDDKLREEVLKLQDEKYQYQNSAK---DALRKVHQERLDATHKVATIEKALCSSEDEC 279 Query: 69 --LHNAISKMESELKELLS--DILLKRHPDEID--------KINPIKNSANEISKLKEDL 116 L ++K + +L+E+ D L ++ + + K + I N +EI+ L+E L Sbjct: 280 SLLREQLNKTQLQLQEVTGRLDALQNQYDERVASSEEQLKVKESEITNLGHEINMLQEKL 339 Query: 117 S 117 S Sbjct: 340 S 340 >gi|328716212|ref|XP_003245867.1| PREDICTED: hypothetical protein LOC100570925 [Acyrthosiphon pisum] Length = 300 Score = 35.4 bits (80), Expect = 2.9, Method: Composition-based stats. Identities = 24/85 (28%), Positives = 42/85 (49%), Gaps = 10/85 (11%) Query: 11 PLLALLGSCDDNPKDPIVQFKQMKYESQESKKSLSDALFKTYP--------DTMDKINTV 62 P + ++GS +D PK +V F +KY S K+ D FK Y +++D + Sbjct: 216 PCILVIGSLND-PKQILVYFDNIKYVVFSSSKAF-DICFKIYHVFNIEYPMESIDVWQFI 273 Query: 63 QTALRNLHNAISKMESELKELLSDI 87 QT N+H K S +K++ +++ Sbjct: 274 QTFFYNIHTKYDKTSSLVKQVTAEL 298 >gi|45190651|ref|NP_984905.1| AER045Cp [Ashbya gossypii ATCC 10895] gi|44983630|gb|AAS52729.1| AER045Cp [Ashbya gossypii ATCC 10895] Length = 1292 Score = 35.1 bits (79), Expect = 3.1, Method: Composition-based stats. Identities = 24/83 (28%), Positives = 43/83 (51%), Gaps = 8/83 (9%) Query: 40 SKKSLSDALFKTYPDTMDKINTVQTALRNLHNAISKMESELKELLS-------DILLKRH 92 ++K+ ++ LF+ + DK+NT + L+ +S +SEL E+ S D++ R Sbjct: 1028 NQKAKAEGLFQERAELKDKLNTSEKQLQESSQKLSNAQSELNEIRSRLKANEHDLITSRQ 1087 Query: 93 PDE-IDKINPIKNSANEISKLKE 114 E + K N ++S +I KL E Sbjct: 1088 EAEKLKKQNQQQSSKKDIHKLDE 1110 >gi|270003693|gb|EFA00141.1| hypothetical protein TcasGA2_TC002962 [Tribolium castaneum] Length = 11744 Score = 35.1 bits (79), Expect = 3.2, Method: Composition-based stats. Identities = 24/98 (24%), Positives = 50/98 (51%), Gaps = 4/98 (4%) Query: 24 KDPIVQFKQMKYESQESKKSLSDALFKTYPDTMDKINTVQTALRNLHNAISKMESELKEL 83 ++ ++ +QM E ++ S + K+ + + ++ + T L+N+ N + E + K Sbjct: 7395 QESLLNSEQMSIEESQTLPSATVEQVKSLSEPLHQLTAIVTDLKNVANKPTVQEQKPKVH 7454 Query: 84 LSDILLKRHPDEIDKINP----IKNSANEISKLKEDLS 117 + + LK +EI++I P +K SAN + L+ LS Sbjct: 7455 KAVLALKCKLEEIEQIIPDVKSLKQSANAVQSLRAKLS 7492 >gi|187777645|ref|ZP_02994118.1| hypothetical protein CLOSPO_01237 [Clostridium sporogenes ATCC 15579] gi|187774573|gb|EDU38375.1| hypothetical protein CLOSPO_01237 [Clostridium sporogenes ATCC 15579] Length = 663 Score = 35.1 bits (79), Expect = 3.5, Method: Composition-based stats. Identities = 23/84 (27%), Positives = 39/84 (46%), Gaps = 8/84 (9%) Query: 42 KSLSDALFKTY------PDTMDKINTVQTALRNLHNAISKMESELKEL--LSDILLKRHP 93 K + D L K Y DT DKIN A + +I K+E ++ + I+ Sbjct: 544 KEVKDILDKMYETIELEKDTKDKINITDNAFNTIRESIDKLEESIRNVNESQKIIYNNKN 603 Query: 94 DEIDKINPIKNSANEISKLKEDLS 117 D ++KIN + + EI+ E+++ Sbjct: 604 DILNKINEASSVSEEIAATTEEIT 627 >gi|302761126|ref|XP_002963985.1| hypothetical protein SELMODRAFT_166499 [Selaginella moellendorffii] gi|300167714|gb|EFJ34318.1| hypothetical protein SELMODRAFT_166499 [Selaginella moellendorffii] Length = 813 Score = 34.7 bits (78), Expect = 3.9, Method: Composition-based stats. Identities = 14/51 (27%), Positives = 30/51 (58%) Query: 49 FKTYPDTMDKINTVQTALRNLHNAISKMESELKELLSDILLKRHPDEIDKI 99 + T PDT++ +N + AL ++H ++ + L D+LL +H +++ +I Sbjct: 364 YVTCPDTLEWLNVTKVALHHVHGTVAVPTANLIVQFYDVLLDKHANQVCQI 414 >gi|302768517|ref|XP_002967678.1| hypothetical protein SELMODRAFT_88585 [Selaginella moellendorffii] gi|300164416|gb|EFJ31025.1| hypothetical protein SELMODRAFT_88585 [Selaginella moellendorffii] Length = 812 Score = 34.7 bits (78), Expect = 3.9, Method: Composition-based stats. Identities = 14/51 (27%), Positives = 30/51 (58%) Query: 49 FKTYPDTMDKINTVQTALRNLHNAISKMESELKELLSDILLKRHPDEIDKI 99 + T PDT++ +N + AL ++H ++ + L D+LL +H +++ +I Sbjct: 363 YVTCPDTLEWLNVTKVALHHVHGTVAVPTANLIVQFYDVLLDKHANQVCQI 413 >gi|219113019|ref|XP_002186093.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] gi|209582943|gb|ACI65563.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] Length = 565 Score = 34.7 bits (78), Expect = 3.9, Method: Composition-based stats. Identities = 23/93 (24%), Positives = 44/93 (47%), Gaps = 16/93 (17%) Query: 26 PIVQFKQMKYESQES-----KKSLSDALFKTYPDTMDKINTVQTALRN-----------L 69 PI ++K+ QES K + D L PD+ D ++ +Q+ L+ Sbjct: 318 PIELMAKLKFRRQESVSLFFKPAPDDPLIYMCPDSGDVVHQIQSVLKRHGVKGKHTNAAA 377 Query: 70 HNAISKMESELKELLSDILLKRHPDEIDKINPI 102 H AI++ S ++++ + L +H +D++N I Sbjct: 378 HRAINEALSLVQDIQTKELALKHDPTVDRVNDI 410 >gi|300785323|ref|YP_003765614.1| cytochrome P450 [Amycolatopsis mediterranei U32] gi|299794837|gb|ADJ45212.1| cytochrome P450 [Amycolatopsis mediterranei U32] Length = 396 Score = 34.7 bits (78), Expect = 4.1, Method: Composition-based stats. Identities = 21/75 (28%), Positives = 34/75 (45%) Query: 23 PKDPIVQFKQMKYESQESKKSLSDALFKTYPDTMDKINTVQTALRNLHNAISKMESELKE 82 P I + + YE +E + LS A F + + + +L L + + K E + Sbjct: 145 PSLTICELLGVSYEDREDFQRLSTARFDLFGGAGASLGAMSESLTYLLDIVKKQREEPGD 204 Query: 83 LLSDILLKRHPDEID 97 L +L+K H DEID Sbjct: 205 GLLGMLIKEHGDEID 219 >gi|170088222|ref|XP_001875334.1| predicted protein [Laccaria bicolor S238N-H82] gi|164650534|gb|EDR14775.1| predicted protein [Laccaria bicolor S238N-H82] Length = 914 Score = 34.7 bits (78), Expect = 4.3, Method: Composition-based stats. Identities = 23/76 (30%), Positives = 44/76 (57%), Gaps = 7/76 (9%) Query: 13 LALLGSCDDNPKDPIVQFKQMKYESQESKKSLSDALFKT-YPDTMDKINTVQTALRNLHN 71 LA++G+C++ I+ +Q E + L A+ T +PD+ D I V+ A ++ + Sbjct: 25 LAVIGACEE-----IIFVEQQVGELKNRVCCLMIAIVNTLHPDSSD-IEQVKRASEDIRS 78 Query: 72 AISKMESELKELLSDI 87 I ++ES+LKE++ D+ Sbjct: 79 EIGQLESDLKEIVKDL 94 >gi|168181490|ref|ZP_02616154.1| methyl-accepting chemotaxis protein [Clostridium botulinum Bf] gi|182675390|gb|EDT87351.1| methyl-accepting chemotaxis protein [Clostridium botulinum Bf] Length = 663 Score = 34.7 bits (78), Expect = 4.5, Method: Composition-based stats. Identities = 21/94 (22%), Positives = 45/94 (47%), Gaps = 5/94 (5%) Query: 29 QFKQMKYESQESKKSLSDALFKTY---PDTMDKINTVQTALRNLHNAISKMESELKEL-- 83 + +++ Y + K + D +++T DT DKI A + +I K+E ++ + Sbjct: 534 EIEKVVYNINKEVKDILDKMYETIELEKDTKDKIGITDNAFNTIRKSIDKLEESIRNVNE 593 Query: 84 LSDILLKRHPDEIDKINPIKNSANEISKLKEDLS 117 I+ D ++KIN + + EI+ E+++ Sbjct: 594 SQKIIYNNKNDILNKINEASSVSEEIAATTEEIT 627 >gi|237793884|ref|YP_002861436.1| methyl-accepting chemotaxis protein [Clostridium botulinum Ba4 str. 657] gi|229264020|gb|ACQ55053.1| methyl-accepting chemotaxis protein [Clostridium botulinum Ba4 str. 657] Length = 663 Score = 34.7 bits (78), Expect = 4.5, Method: Composition-based stats. Identities = 21/94 (22%), Positives = 45/94 (47%), Gaps = 5/94 (5%) Query: 29 QFKQMKYESQESKKSLSDALFKTY---PDTMDKINTVQTALRNLHNAISKMESELKEL-- 83 + +++ Y + K + D +++T DT DKI A + +I K+E ++ + Sbjct: 534 EIEKVVYNINKEVKDILDKMYETIELEKDTKDKIGITDNAFNTIRKSIDKLEESIRNVNE 593 Query: 84 LSDILLKRHPDEIDKINPIKNSANEISKLKEDLS 117 I+ D ++KIN + + EI+ E+++ Sbjct: 594 SQKIIYNNKNDILNKINEASSVSEEIAATTEEIT 627 >gi|291395222|ref|XP_002714148.1| PREDICTED: retinoic acid induced 14-like, partial [Oryctolagus cuniculus] Length = 970 Score = 34.7 bits (78), Expect = 4.8, Method: Composition-based stats. Identities = 27/82 (32%), Positives = 42/82 (51%), Gaps = 7/82 (8%) Query: 36 ESQESKKSLSDALFKTYPDTMDKINTVQTALRNLHNAISKMESELKELLSDILLKRHPDE 95 ES S L+D L K+ D+ +I +Q L++L + E+E K+L ++ L R PD Sbjct: 399 ESISSPSVLTDLLGKSTADSDVRIQQLQEVLQDLQKKLESSEAERKQLQAE-LQSRRPD- 456 Query: 96 IDKINPIKNSANEISKLKEDLS 117 P+ + EIS+ DLS Sbjct: 457 -----PVGLNNTEISENGSDLS 473 >gi|170759292|ref|YP_001785899.1| methyl-accepting chemotaxis protein [Clostridium botulinum A3 str. Loch Maree] gi|169406281|gb|ACA54692.1| methyl-accepting chemotaxis protein [Clostridium botulinum A3 str. Loch Maree] Length = 663 Score = 34.3 bits (77), Expect = 5.3, Method: Composition-based stats. Identities = 21/94 (22%), Positives = 46/94 (48%), Gaps = 5/94 (5%) Query: 29 QFKQMKYESQESKKSLSDALFKTY---PDTMDKINTVQTALRNLHNAISKMESELKEL-- 83 + +++ Y + K + D +++T DT DKI+ A + +I K+E ++ + Sbjct: 534 EIERVVYNINKEVKDILDKMYETIELEKDTGDKIDITDNAFNTIRKSIDKLEESIRNVNE 593 Query: 84 LSDILLKRHPDEIDKINPIKNSANEISKLKEDLS 117 I+ D ++KIN + + EI+ E+++ Sbjct: 594 SQKIIYNNKNDILNKINEASSVSEEIAATTEEIT 627 >gi|326913042|ref|XP_003202851.1| PREDICTED: myosin-6-like [Meleagris gallopavo] Length = 1938 Score = 34.3 bits (77), Expect = 5.9, Method: Composition-based stats. Identities = 22/69 (31%), Positives = 36/69 (52%), Gaps = 11/69 (15%) Query: 25 DPIVQFKQMKYE--------SQESKKSLSDALFK---TYPDTMDKINTVQTALRNLHNAI 73 D I+ + KYE SQ+ +SLS LFK Y +T+D + T++ +NL I Sbjct: 1452 DKIINDWKQKYEESQAELEASQKEARSLSTELFKLKNAYEETLDHLETLKRENKNLQEEI 1511 Query: 74 SKMESELKE 82 S + +++ E Sbjct: 1512 SDLTNQISE 1520 >gi|45382109|ref|NP_990097.1| myosin heavy chain, cardiac muscle isoform [Gallus gallus] gi|7248371|dbj|BAA92710.1| myosin heavy chain [Gallus gallus] Length = 1937 Score = 34.3 bits (77), Expect = 5.9, Method: Composition-based stats. Identities = 22/69 (31%), Positives = 36/69 (52%), Gaps = 11/69 (15%) Query: 25 DPIVQFKQMKYE--------SQESKKSLSDALFK---TYPDTMDKINTVQTALRNLHNAI 73 D I+ + KYE SQ+ +SLS LFK Y +T+D + T++ +NL I Sbjct: 1451 DKIINDWKQKYEESQAELEASQKEARSLSTELFKLKNAYEETLDHLETLKRENKNLQEEI 1510 Query: 74 SKMESELKE 82 S + +++ E Sbjct: 1511 SDLTNQISE 1519 >gi|320039889|gb|EFW21823.1| vacuolar protein sorting-associated protein 27 [Coccidioides posadasii str. Silveira] Length = 701 Score = 34.3 bits (77), Expect = 6.4, Method: Composition-based stats. Identities = 26/106 (24%), Positives = 47/106 (44%), Gaps = 1/106 (0%) Query: 12 LLALLGSCDDNPKDPIVQFKQMKYESQESKKSLSDALFKTYPDTMDKINTVQTALRNLHN 71 L L+ P I++ Q++ E ES +L L +TY +TM K +T+ L Sbjct: 363 LATLVDRLQHQPPGTILREPQIQ-ELYESIGALRPKLARTYGETMSKYDTLLDLHAKLST 421 Query: 72 AISKMESELKELLSDILLKRHPDEIDKINPIKNSANEISKLKEDLS 117 + + L+E LS+ ++ D + P ++S N + +S Sbjct: 422 VVRYYDRMLEERLSNTYAQQTLGAYDSLQPAQSSTNLYPTMSSHIS 467 >gi|303311859|ref|XP_003065941.1| VHS domain containing protein [Coccidioides posadasii C735 delta SOWgp] gi|240105603|gb|EER23796.1| VHS domain containing protein [Coccidioides posadasii C735 delta SOWgp] Length = 659 Score = 34.3 bits (77), Expect = 6.4, Method: Composition-based stats. Identities = 26/106 (24%), Positives = 47/106 (44%), Gaps = 1/106 (0%) Query: 12 LLALLGSCDDNPKDPIVQFKQMKYESQESKKSLSDALFKTYPDTMDKINTVQTALRNLHN 71 L L+ P I++ Q++ E ES +L L +TY +TM K +T+ L Sbjct: 300 LATLVDRLQHQPPGTILREPQIQ-ELYESIGALRPKLARTYGETMSKYDTLLDLHAKLST 358 Query: 72 AISKMESELKELLSDILLKRHPDEIDKINPIKNSANEISKLKEDLS 117 + + L+E LS+ ++ D + P ++S N + +S Sbjct: 359 VVRYYDRMLEERLSNTYAQQTLGAYDSLQPAQSSTNLYPTMSSHIS 404 >gi|218184546|gb|EEC66973.1| hypothetical protein OsI_33633 [Oryza sativa Indica Group] Length = 1033 Score = 33.9 bits (76), Expect = 6.6, Method: Composition-based stats. Identities = 21/67 (31%), Positives = 39/67 (58%), Gaps = 6/67 (8%) Query: 57 DKINTVQTALRNLHNAISKMESELKELL-----SDILLKRHPDEIDKINPIKNS-ANEIS 110 +K+N ++ NL NA+S+++ E++ L S++L++ DEI+ + KN NEI Sbjct: 60 EKLNELKQTKVNLENAVSELKKEVENLTEQNRSSELLIQELRDEINSLKDSKNELQNEIQ 119 Query: 111 KLKEDLS 117 L+ +S Sbjct: 120 SLRSTIS 126 >gi|227537156|ref|ZP_03967205.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC 33300] gi|227243087|gb|EEI93102.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC 33300] Length = 995 Score = 33.9 bits (76), Expect = 7.2, Method: Composition-based stats. Identities = 20/53 (37%), Positives = 31/53 (58%) Query: 33 MKYESQESKKSLSDALFKTYPDTMDKINTVQTALRNLHNAISKMESELKELLS 85 +K ++S K SD L +T +I++++ AL NL+N+ISK S EL S Sbjct: 105 IKTNKEKSTKIKSDTLINNLRETKKEIDSLKIALFNLNNSISKWNSLKSELDS 157 >gi|118369957|ref|XP_001018181.1| hypothetical protein TTHERM_00283250 [Tetrahymena thermophila] gi|89299948|gb|EAR97936.1| hypothetical protein TTHERM_00283250 [Tetrahymena thermophila SB210] Length = 745 Score = 33.9 bits (76), Expect = 8.3, Method: Composition-based stats. Identities = 23/98 (23%), Positives = 47/98 (47%), Gaps = 9/98 (9%) Query: 18 SCDDNPKDPIVQFKQMKYESQESKKSLSDALFKTYPDTMDKINTV--------QTALRNL 69 SC +N + ++ KQ KY +K++++D FK + + +N V +T L+++ Sbjct: 427 SCLENKQKNQIEQKQDKYNQVNTKQNINDYHFKNQQENQEILNDVCKSPQNQEKTVLKDI 486 Query: 70 HNAISKMESELKELLSDILLKRHPDEIDKINPIKNSAN 107 S ++ +L +L +H + D +N + S N Sbjct: 487 QQTSSNQINQF-DLSKSYILDKHHKQKDNMNKNQQSKN 523 >gi|254578970|ref|XP_002495471.1| ZYRO0B12144p [Zygosaccharomyces rouxii] gi|238938361|emb|CAR26538.1| ZYRO0B12144p [Zygosaccharomyces rouxii] Length = 2005 Score = 33.5 bits (75), Expect = 8.7, Method: Composition-based stats. Identities = 28/97 (28%), Positives = 45/97 (46%), Gaps = 15/97 (15%) Query: 30 FKQMKYESQESKKSLSDALFKTYPDTMDKINTVQTALRNLHNAISKMESELKELLSDI-- 87 F + E Q + SLS +L + ++ +++ ++ L L K+ SE +ELL + Sbjct: 926 FSESYKELQNERDSLSRSLHEASSNSSNELTALKNELATLSAEKVKLNSEKQELLEQVSS 985 Query: 88 -------LLKRHPDEIDKINPIKNSANEISKLKEDLS 117 L KRH DE +N E+ KLKE +S Sbjct: 986 LNSSYESLSKRHQDETSDLNV------EVEKLKETVS 1016 >gi|222528290|ref|YP_002572172.1| chromosome segregation ATPase-like protein [Caldicellulosiruptor bescii DSM 6725] gi|222455137|gb|ACM59399.1| Chromosome segregation ATPase-like protein [Caldicellulosiruptor bescii DSM 6725] Length = 1350 Score = 33.5 bits (75), Expect = 8.7, Method: Composition-based stats. Identities = 24/82 (29%), Positives = 45/82 (54%), Gaps = 14/82 (17%) Query: 38 QESKKSLSDALFKTYPDTMDKINTVQTALRNLHNAISKMESELKELLSDIL--LKRHPDE 95 QES +LS++ ++ D+++ ++ +QT L+NL E++ D+L LK+ DE Sbjct: 216 QESLPALSESDLRSLSDSIENMDRIQTELKNL------------EIIKDVLEKLKKVYDE 263 Query: 96 IDKINPIKNSANEISKLKEDLS 117 +K+ ++ N + K KE S Sbjct: 264 YNKLLLAQSILNVLEKSKEYFS 285 >gi|291228352|ref|XP_002734143.1| PREDICTED: CDC42 binding protein kinase alpha-like [Saccoglossus kowalevskii] Length = 1949 Score = 33.5 bits (75), Expect = 9.2, Method: Composition-based stats. Identities = 27/90 (30%), Positives = 47/90 (52%), Gaps = 7/90 (7%) Query: 35 YESQESKKSLSD-ALFKTYPDTMDKINTVQTALRNLHNAISKMESELKELL--SD---IL 88 YESQ S+ + + T D +D+++ +Q + R L + K+E EL SD L Sbjct: 901 YESQLSELRVKHHSALHTANDNVDRVSMLQDSERKLKGKVEKLEKELNTRCRQSDEQITL 960 Query: 89 LKRHPDEID-KINPIKNSANEISKLKEDLS 117 L+ E++ K +K+S + I L+++LS Sbjct: 961 LQEEKKEVETKFEKLKDSCSVIQDLEDELS 990 >gi|145548479|ref|XP_001459920.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124427747|emb|CAK92523.1| unnamed protein product [Paramecium tetraurelia] Length = 973 Score = 33.5 bits (75), Expect = 9.9, Method: Composition-based stats. Identities = 27/111 (24%), Positives = 53/111 (47%), Gaps = 3/111 (2%) Query: 8 FLFPLLALLGSCDDNPKDPIVQFKQMKYESQESKKSLSDALFKTYPDTMDKINTVQTALR 67 F FP+ + +N D I Q + E ++ L+D L+ ++I ++ + Sbjct: 109 FPFPIHNQPLAIQNNNNDKIKQLQNKIKEKEQEIDKLNDDLYDLTRVKNNEIKKLENKIL 168 Query: 68 NLHNAISKMESELKELLSDILLKRHPDEIDK-INPIKNSANEISKLKEDLS 117 NL+N I ++ E + +IL R +E+D+ + +KN + K+DL+ Sbjct: 169 NLNNQIDELSREEQNKDREILKLR--NELDQALQKLKNLEQDFKTQKDDLT 217 Database: nr Posted date: May 13, 2011 4:10 AM Number of letters in database: 999,999,932 Number of sequences in database: 2,987,209 Database: /data/usr2/db/fasta/nr.01 Posted date: May 13, 2011 4:17 AM Number of letters in database: 999,998,956 Number of sequences in database: 2,896,973 Database: /data/usr2/db/fasta/nr.02 Posted date: May 13, 2011 4:23 AM Number of letters in database: 999,999,979 Number of sequences in database: 2,907,862 Database: /data/usr2/db/fasta/nr.03 Posted date: May 13, 2011 4:29 AM Number of letters in database: 999,999,513 Number of sequences in database: 2,932,190 Database: /data/usr2/db/fasta/nr.04 Posted date: May 13, 2011 4:33 AM Number of letters in database: 792,586,372 Number of sequences in database: 2,260,650 Lambda K H 0.315 0.132 0.361 Lambda K H 0.267 0.0413 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,808,690,521 Number of Sequences: 13984884 Number of extensions: 55302510 Number of successful extensions: 204073 Number of sequences better than 10.0: 97 Number of HSP's better than 10.0 without gapping: 30 Number of HSP's successfully gapped in prelim test: 412 Number of HSP's that attempted gapping in prelim test: 203760 Number of HSP's gapped (non-prelim): 801 length of query: 117 length of database: 4,792,584,752 effective HSP length: 84 effective length of query: 33 effective length of database: 3,617,854,496 effective search space: 119389198368 effective search space used: 119389198368 T: 11 A: 40 X1: 15 ( 6.8 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (22.0 bits) S2: 75 (33.5 bits)