BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= 537021.9.peg.1078_1 (62 letters) Database: nr 13,984,884 sequences; 4,792,584,752 total letters Searching..................................................done Results from round 1 >gi|315122536|ref|YP_004063025.1| DNA packaging protein Gp2 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495938|gb|ADR52537.1| DNA packaging protein Gp2 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 455 Score = 101 bits (252), Expect = 3e-20, Method: Compositional matrix adjust. Identities = 46/57 (80%), Positives = 51/57 (89%) Query: 6 ECQEWLDEFHQYHRCEGRVIKEKDDLICASRYALMMKIFSISKPGYSSWKYTPRKVI 62 +CQEWLDEF QYHR EGR+IKEK+DLICASRY LMMK FS+S+P SSWKYTPRKVI Sbjct: 399 DCQEWLDEFRQYHRREGRIIKEKEDLICASRYGLMMKRFSVSRPVCSSWKYTPRKVI 455 >gi|307315429|ref|ZP_07594994.1| protein of unknown function DUF264 [Sinorhizobium meliloti BL225C] gi|306898808|gb|EFN29464.1| protein of unknown function DUF264 [Sinorhizobium meliloti BL225C] Length = 477 Score = 79.0 bits (193), Expect = 2e-13, Method: Composition-based stats. Identities = 32/56 (57%), Positives = 43/56 (76%) Query: 7 CQEWLDEFHQYHRCEGRVIKEKDDLICASRYALMMKIFSISKPGYSSWKYTPRKVI 62 C EW +EF YHR +GR++KE+DDLI ASRYALMMK + + GY++W +T RKV+ Sbjct: 422 CTEWFEEFRLYHRKDGRIVKERDDLISASRYALMMKRHARANNGYANWNFTARKVL 477 >gi|15965769|ref|NP_386122.1| DNA packaging protein GP2 [Sinorhizobium meliloti 1021] gi|15075038|emb|CAC46595.1| DNA packaging protein GP2 [Sinorhizobium meliloti 1021] Length = 477 Score = 79.0 bits (193), Expect = 2e-13, Method: Composition-based stats. Identities = 32/56 (57%), Positives = 43/56 (76%) Query: 7 CQEWLDEFHQYHRCEGRVIKEKDDLICASRYALMMKIFSISKPGYSSWKYTPRKVI 62 C EW +EF YHR +GR++KE+DDLI ASRYALMMK + + GY++W +T RKV+ Sbjct: 422 CTEWFEEFRLYHRKDGRIVKERDDLISASRYALMMKRHARANNGYANWNFTARKVL 477 >gi|307318836|ref|ZP_07598268.1| protein of unknown function DUF264 [Sinorhizobium meliloti AK83] gi|306895557|gb|EFN26311.1| protein of unknown function DUF264 [Sinorhizobium meliloti AK83] Length = 477 Score = 79.0 bits (193), Expect = 2e-13, Method: Composition-based stats. Identities = 32/56 (57%), Positives = 43/56 (76%) Query: 7 CQEWLDEFHQYHRCEGRVIKEKDDLICASRYALMMKIFSISKPGYSSWKYTPRKVI 62 C EW +EF YHR +GR++KE+DDLI ASRYALMMK + + GY++W +T RKV+ Sbjct: 422 CTEWFEEFRLYHRKDGRIVKERDDLISASRYALMMKRHARANNGYANWNFTARKVL 477 >gi|227822449|ref|YP_002826421.1| DNA packaging protein Gp2 [Sinorhizobium fredii NGR234] gi|227341450|gb|ACP25668.1| DNA packaging protein Gp2 [Sinorhizobium fredii NGR234] Length = 454 Score = 77.0 bits (188), Expect = 8e-13, Method: Composition-based stats. Identities = 31/56 (55%), Positives = 43/56 (76%) Query: 7 CQEWLDEFHQYHRCEGRVIKEKDDLICASRYALMMKIFSISKPGYSSWKYTPRKVI 62 C EW +EF YHR +GR++KE+DDLI ASRYALMMK ++ + G ++W +T RKV+ Sbjct: 399 CAEWFEEFRLYHRKDGRIVKERDDLISASRYALMMKRYARANNGNANWNFTARKVL 454 >gi|227821702|ref|YP_002825672.1| DNA packaging protein Gp2 [Sinorhizobium fredii NGR234] gi|227340701|gb|ACP24919.1| DNA packaging protein Gp2 [Sinorhizobium fredii NGR234] Length = 416 Score = 75.5 bits (184), Expect = 2e-12, Method: Composition-based stats. Identities = 32/56 (57%), Positives = 43/56 (76%) Query: 7 CQEWLDEFHQYHRCEGRVIKEKDDLICASRYALMMKIFSISKPGYSSWKYTPRKVI 62 C EW +EF YHR +G+V+KE+DD+I ASRYALMMK F+ K ++WK++ RKVI Sbjct: 361 CGEWFEEFRLYHRKDGKVVKERDDVISASRYALMMKRFARVKADAAAWKFSERKVI 416 >gi|150397042|ref|YP_001327509.1| hypothetical protein Smed_1839 [Sinorhizobium medicae WSM419] gi|150028557|gb|ABR60674.1| protein of unknown function DUF264 [Sinorhizobium medicae WSM419] Length = 477 Score = 75.5 bits (184), Expect = 3e-12, Method: Composition-based stats. Identities = 31/56 (55%), Positives = 43/56 (76%) Query: 7 CQEWLDEFHQYHRCEGRVIKEKDDLICASRYALMMKIFSISKPGYSSWKYTPRKVI 62 C EW +EF YHR +GR++KE+DDL+ ASRYALMMK + + G ++WK+T RKV+ Sbjct: 422 CTEWFEEFRLYHRKDGRIVKERDDLLAASRYALMMKRHARAIGGNANWKFTARKVL 477 >gi|158422463|ref|YP_001523755.1| putative DNA packaging protein GP2 [Azorhizobium caulinodans ORS 571] gi|158329352|dbj|BAF86837.1| putative DNA packaging protein GP2 [Azorhizobium caulinodans ORS 571] Length = 251 Score = 56.6 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 21/42 (50%), Positives = 34/42 (80%) Query: 8 QEWLDEFHQYHRCEGRVIKEKDDLICASRYALMMKIFSISKP 49 ++W+ EF YHR +G+++KE+DDL+ A+RYA+MMK F+ +P Sbjct: 189 EDWMSEFRLYHRKDGKIVKERDDLMSATRYAIMMKRFASPEP 230 >gi|148557334|ref|YP_001264916.1| hypothetical protein Swit_4440 [Sphingomonas wittichii RW1] gi|148502524|gb|ABQ70778.1| hypothetical protein Swit_4440 [Sphingomonas wittichii RW1] Length = 276 Score = 53.5 bits (127), Expect = 9e-06, Method: Composition-based stats. Identities = 21/43 (48%), Positives = 30/43 (69%) Query: 8 QEWLDEFHQYHRCEGRVIKEKDDLICASRYALMMKIFSISKPG 50 ++W EF YHR +G V+K DD + ASRYA+MMK F+++ P Sbjct: 218 EDWFAEFRLYHRKDGSVVKTNDDRLSASRYAMMMKRFAVTAPA 260 >gi|273810450|ref|YP_003344921.1| TerL [Xylella phage Xfas53] gi|257097825|gb|ACV41131.1| TerL [Xylella phage Xfas53] Length = 470 Score = 52.4 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 19/37 (51%), Positives = 29/37 (78%) Query: 9 EWLDEFHQYHRCEGRVIKEKDDLICASRYALMMKIFS 45 EW +EF YHR +GR++K DDL+ A+RYA+MM+ ++ Sbjct: 417 EWFEEFRLYHREDGRIVKHHDDLLSATRYAMMMRRYA 453 >gi|71897556|ref|ZP_00679801.1| Protein of unknown function DUF264 [Xylella fastidiosa Ann-1] gi|71732459|gb|EAO34512.1| Protein of unknown function DUF264 [Xylella fastidiosa Ann-1] Length = 471 Score = 52.0 bits (123), Expect = 3e-05, Method: Composition-based stats. Identities = 19/37 (51%), Positives = 29/37 (78%) Query: 9 EWLDEFHQYHRCEGRVIKEKDDLICASRYALMMKIFS 45 EW +EF YHR +GR++K DDL+ A+RYA+MM+ ++ Sbjct: 418 EWFEEFRLYHREDGRIVKHHDDLLSATRYAMMMRRYA 454 >gi|71274675|ref|ZP_00650963.1| Protein of unknown function DUF264 [Xylella fastidiosa Dixon] gi|71901596|ref|ZP_00683677.1| Protein of unknown function DUF264 [Xylella fastidiosa Ann-1] gi|170730087|ref|YP_001775520.1| putative DNA packaging protein GP2 [Xylella fastidiosa M12] gi|71164407|gb|EAO14121.1| Protein of unknown function DUF264 [Xylella fastidiosa Dixon] gi|71728644|gb|EAO30794.1| Protein of unknown function DUF264 [Xylella fastidiosa Ann-1] gi|167964880|gb|ACA11890.1| putative DNA packaging protein GP2 [Xylella fastidiosa M12] Length = 472 Score = 47.4 bits (111), Expect = 7e-04, Method: Composition-based stats. Identities = 20/42 (47%), Positives = 29/42 (69%), Gaps = 1/42 (2%) Query: 9 EWLDEFHQYHRCEGRVIKEKDDLICASRYALMMKIFS-ISKP 49 EW +E YHR GR+ K DDL+ A+RYA+MM+ ++ I+ P Sbjct: 419 EWFEECSLYHRDNGRITKRHDDLLSATRYAMMMRRYAKITNP 460 >gi|167600439|ref|YP_001671939.1| terminase large subunit [Pseudomonas phage LUZ24] gi|161168302|emb|CAP45467.1| terminase large subunit [Pseudomonas phage LUZ24] Length = 482 Score = 44.3 bits (103), Expect = 0.006, Method: Composition-based stats. Identities = 17/35 (48%), Positives = 24/35 (68%) Query: 7 CQEWLDEFHQYHRCEGRVIKEKDDLICASRYALMM 41 C +L E YHR +G++I DD+I A+RYAL+M Sbjct: 416 CTNFLKEMKMYHRKDGKIIDRNDDMISATRYALLM 450 >gi|167041080|gb|ABZ05841.1| hypothetical protein ALOHA_HF400048F7ctg1g8 [uncultured marine microorganism HF4000_48F7] Length = 504 Score = 43.9 bits (102), Expect = 0.008, Method: Composition-based stats. Identities = 15/30 (50%), Positives = 23/30 (76%) Query: 9 EWLDEFHQYHRCEGRVIKEKDDLICASRYA 38 +W EF YHR +G+V+++ DDL+ A+RYA Sbjct: 428 QWFQEFRMYHRKDGKVVRKHDDLMSATRYA 457 >gi|27476053|ref|NP_775255.1| terminase [Pseudomonas phage PaP3] gi|27414483|gb|AAL85569.1| terminase [Pseudomonas phage PaP3] Length = 482 Score = 43.5 bits (101), Expect = 0.009, Method: Composition-based stats. Identities = 16/35 (45%), Positives = 24/35 (68%) Query: 7 CQEWLDEFHQYHRCEGRVIKEKDDLICASRYALMM 41 C +L E YHR +G+++ DD+I A+RYAL+M Sbjct: 416 CTNFLKEMKMYHRKDGKIVDRNDDMISATRYALLM 450 >gi|30061789|ref|NP_835960.1| putative terminase large subunit [Shigella flexneri 2a str. 2457T] gi|30040031|gb|AAP15765.1| putative terminase large subunit [Shigella flexneri 2a str. 2457T] Length = 124 Score = 42.4 bits (98), Expect = 0.022, Method: Compositional matrix adjust. Identities = 18/40 (45%), Positives = 28/40 (70%), Gaps = 1/40 (2%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYALMMKIFS 45 C+ + +EF YHR E G+++K DD++ A RYA MM+ F+ Sbjct: 60 CEPFFEEFRLYHRDENGKIVKLNDDVLSAVRYAYMMRRFA 99 >gi|300920006|ref|ZP_07136465.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 115-1] gi|300412953|gb|EFJ96263.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 115-1] Length = 498 Score = 42.0 bits (97), Expect = 0.027, Method: Composition-based stats. Identities = 18/41 (43%), Positives = 28/41 (68%), Gaps = 1/41 (2%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYALMMKIFSI 46 C+ + +EF YHR E G+++K DD++ A RY MM+ F+I Sbjct: 435 CEPFFEEFRLYHRDENGKIVKLNDDILSAVRYGYMMRRFAI 475 >gi|281599695|gb|ADA72679.1| Gp2-like protein [Shigella flexneri 2002017] Length = 441 Score = 42.0 bits (97), Expect = 0.028, Method: Composition-based stats. Identities = 18/40 (45%), Positives = 28/40 (70%), Gaps = 1/40 (2%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYALMMKIFS 45 C+ + +EF YHR E G+++K DD++ A RYA MM+ F+ Sbjct: 377 CEPFFEEFRLYHRDENGKIVKLNDDVLSAVRYAYMMRRFA 416 >gi|327251967|gb|EGE63639.1| DNA packaging protein gp2 [Escherichia coli STEC_7v] gi|327254495|gb|EGE66117.1| DNA packaging protein gp2 [Escherichia coli STEC_7v] Length = 499 Score = 42.0 bits (97), Expect = 0.029, Method: Composition-based stats. Identities = 18/40 (45%), Positives = 28/40 (70%), Gaps = 1/40 (2%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYALMMKIFS 45 C+ + +EF YHR E G+++K DD++ A RYA MM+ F+ Sbjct: 435 CEPFFEEFRLYHRDENGKIVKLNDDVLSAVRYAYMMRRFA 474 >gi|315299781|gb|EFU59021.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 16-3] Length = 499 Score = 42.0 bits (97), Expect = 0.029, Method: Composition-based stats. Identities = 18/40 (45%), Positives = 28/40 (70%), Gaps = 1/40 (2%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYALMMKIFS 45 C+ + +EF YHR E G+++K DD++ A RYA MM+ F+ Sbjct: 435 CEPFFEEFRLYHRDENGKIVKLNDDVLSAVRYAYMMRRFA 474 >gi|323967108|gb|EGB62533.1| terminase [Escherichia coli M863] Length = 499 Score = 42.0 bits (97), Expect = 0.029, Method: Composition-based stats. Identities = 18/40 (45%), Positives = 28/40 (70%), Gaps = 1/40 (2%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYALMMKIFS 45 C+ + +EF YHR E G+++K DD++ A RYA MM+ F+ Sbjct: 435 CEPFFEEFRLYHRDENGKIVKLNDDVLSAVRYAYMMRRFA 474 >gi|331657716|ref|ZP_08358678.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia coli TA206] gi|331055964|gb|EGI27973.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia coli TA206] Length = 499 Score = 42.0 bits (97), Expect = 0.031, Method: Composition-based stats. Identities = 18/40 (45%), Positives = 28/40 (70%), Gaps = 1/40 (2%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYALMMKIFS 45 C+ + +EF YHR E G+++K DD++ A RYA MM+ F+ Sbjct: 435 CEPFFEEFRLYHRDENGKIVKLNDDVLSAVRYAYMMRRFA 474 >gi|293410725|ref|ZP_06654301.1| DNA-packaging protein gp2 [Escherichia coli B354] gi|291471193|gb|EFF13677.1| DNA-packaging protein gp2 [Escherichia coli B354] Length = 499 Score = 42.0 bits (97), Expect = 0.031, Method: Composition-based stats. Identities = 18/40 (45%), Positives = 28/40 (70%), Gaps = 1/40 (2%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYALMMKIFS 45 C+ + +EF YHR E G+++K DD++ A RYA MM+ F+ Sbjct: 435 CEPFFEEFRLYHRDENGKIVKLNDDVLSAVRYAYMMRRFA 474 >gi|218549377|ref|YP_002383168.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia fergusonii ATCC 35469] gi|307311077|ref|ZP_07590721.1| protein of unknown function DUF264 [Escherichia coli W] gi|331669066|ref|ZP_08369914.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia coli TA271] gi|218356918|emb|CAQ89550.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia fergusonii ATCC 35469] gi|306908583|gb|EFN39080.1| protein of unknown function DUF264 [Escherichia coli W] gi|312945545|gb|ADR26372.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia coli O83:H1 str. NRG 857C] gi|315061655|gb|ADT75982.1| DNA packaging protein gp2 (terminase large subunit) [Escherichia coli W] gi|323377763|gb|ADX50031.1| DNA packaging protein gp2 (terminase large subunit) [Escherichia coli KO11] gi|324117758|gb|EGC11657.1| terminase [Escherichia coli E1167] gi|331064260|gb|EGI36171.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia coli TA271] Length = 499 Score = 42.0 bits (97), Expect = 0.031, Method: Composition-based stats. Identities = 18/40 (45%), Positives = 28/40 (70%), Gaps = 1/40 (2%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYALMMKIFS 45 C+ + +EF YHR E G+++K DD++ A RYA MM+ F+ Sbjct: 435 CEPFFEEFRLYHRDENGKIVKLNDDVLSAVRYAYMMRRFA 474 >gi|110804280|ref|YP_687800.1| putative terminase large subunit [Shigella flexneri 5 str. 8401] gi|110613828|gb|ABF02495.1| putative terminase large subunit [Shigella flexneri 5 str. 8401] Length = 354 Score = 42.0 bits (97), Expect = 0.032, Method: Composition-based stats. Identities = 18/40 (45%), Positives = 28/40 (70%), Gaps = 1/40 (2%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYALMMKIFS 45 C+ + +EF YHR E G+++K DD++ A RYA MM+ F+ Sbjct: 290 CEPFFEEFRLYHRDENGKIVKLNDDVLSAVRYAYMMRRFA 329 >gi|62178924|ref|YP_215341.1| gp2-like protein [Salmonella enterica subsp. enterica serovar Choleraesuis str. SC-B67] gi|62126557|gb|AAX64260.1| gp2-like protein [Salmonella enterica subsp. enterica serovar Choleraesuis str. SC-B67] gi|322713379|gb|EFZ04950.1| gp2-like protein [Salmonella enterica subsp. enterica serovar Choleraesuis str. A50] Length = 499 Score = 41.6 bits (96), Expect = 0.032, Method: Composition-based stats. Identities = 18/40 (45%), Positives = 28/40 (70%), Gaps = 1/40 (2%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYALMMKIFS 45 C+ + +EF YHR E G+++K DD++ A RYA MM+ F+ Sbjct: 435 CEPFFEEFRLYHRDENGKIVKLNDDVLSAVRYAYMMRRFA 474 >gi|281599578|gb|ADA72562.1| putative terminase large subunit [Shigella flexneri 2002017] Length = 351 Score = 41.6 bits (96), Expect = 0.033, Method: Composition-based stats. Identities = 18/40 (45%), Positives = 28/40 (70%), Gaps = 1/40 (2%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYALMMKIFS 45 C+ + +EF YHR E G+++K DD++ A RYA MM+ F+ Sbjct: 287 CEPFFEEFRLYHRDENGKIVKLNDDVLSAVRYAYMMRRFA 326 >gi|24111660|ref|NP_706170.1| putative terminase large subunit [Shigella flexneri 2a str. 301] gi|24050435|gb|AAN41877.1| putative terminase large subunit [Shigella flexneri 2a str. 301] gi|313646707|gb|EFS11166.1| DNA packaging gp2 domain protein [Shigella flexneri 2a str. 2457T] Length = 300 Score = 41.2 bits (95), Expect = 0.042, Method: Composition-based stats. Identities = 18/40 (45%), Positives = 28/40 (70%), Gaps = 1/40 (2%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYALMMKIFS 45 C+ + +EF YHR E G+++K DD++ A RYA MM+ F+ Sbjct: 236 CEPFFEEFRLYHRDENGKIVKLNDDVLSAVRYAYMMRRFA 275 >gi|137993|sp|P16938|VG2_BPLP7 RecName: Full=Protein GP2 gi|75884|pir||Z2BPL7 gene 2 protein - phage LP-7 (fragment) gi|553003|gb|AAA88220.1| packaging glycoprotein [Enterobacteria phage LP7] Length = 475 Score = 40.8 bits (94), Expect = 0.062, Method: Compositional matrix adjust. Identities = 16/38 (42%), Positives = 27/38 (71%), Gaps = 1/38 (2%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYALMMKI 43 C+ + +EF YHR E G+++K DD++ A+RY MM++ Sbjct: 431 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYGYMMRL 468 >gi|89885991|ref|YP_516188.1| phage terminase large subunit [Sodalis phage phiSG1] gi|89191726|dbj|BAE80473.1| phage terminase large subunit [Sodalis phage phiSG1] gi|125470018|gb|ABN42210.1| gp02 [Sodalis phage phiSG1] Length = 475 Score = 38.9 bits (89), Expect = 0.24, Method: Composition-based stats. Identities = 16/40 (40%), Positives = 30/40 (75%), Gaps = 1/40 (2%) Query: 8 QEWLDEFHQYHRCE-GRVIKEKDDLICASRYALMMKIFSI 46 +++ DE++ YHR E R++K +DD++ A RYA MM+ +++ Sbjct: 414 RDFFDEYNFYHRDEKSRIVKMRDDILDAVRYAYMMRRYAV 453 >gi|215304|gb|AAA72960.1| unnamed protein product [Enterobacteria phage P22] Length = 101 Score = 36.6 bits (83), Expect = 1.3, Method: Compositional matrix adjust. Identities = 14/33 (42%), Positives = 23/33 (69%), Gaps = 1/33 (3%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYA 38 C+ + +EF YHR E G+++K DD++ A+RY Sbjct: 37 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYG 69 >gi|219681243|ref|YP_002455888.1| Gp2 [Salmonella enterica bacteriophage SE1] gi|66473858|gb|AAY46504.1| Gp2 [Salmonella phage SE1] Length = 499 Score = 34.7 bits (78), Expect = 3.9, Method: Composition-based stats. Identities = 14/33 (42%), Positives = 23/33 (69%), Gaps = 1/33 (3%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYA 38 C+ + +EF YHR E G+++K DD++ A+RY Sbjct: 435 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYG 467 >gi|168240109|ref|ZP_02665041.1| DNA packaging protein gp2 [Salmonella enterica subsp. enterica serovar Heidelberg str. SL486] gi|194451817|ref|YP_002044341.1| DNA packaging protein gp2 [Salmonella enterica subsp. enterica serovar Heidelberg str. SL476] gi|194410121|gb|ACF70340.1| DNA packaging protein gp2 [Salmonella enterica subsp. enterica serovar Heidelberg str. SL476] gi|205340165|gb|EDZ26929.1| DNA packaging protein gp2 [Salmonella enterica subsp. enterica serovar Heidelberg str. SL486] Length = 499 Score = 34.7 bits (78), Expect = 4.0, Method: Composition-based stats. Identities = 14/33 (42%), Positives = 23/33 (69%), Gaps = 1/33 (3%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYA 38 C+ + +EF YHR E G+++K DD++ A+RY Sbjct: 435 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYG 467 >gi|24371583|ref|NP_720326.1| gp2 [Enterobacteria phage ST64T] gi|24250810|gb|AAL15523.1| gp2 [Salmonella phage ST64T] Length = 517 Score = 34.7 bits (78), Expect = 4.0, Method: Composition-based stats. Identities = 14/33 (42%), Positives = 23/33 (69%), Gaps = 1/33 (3%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYA 38 C+ + +EF YHR E G+++K DD++ A+RY Sbjct: 453 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYG 485 >gi|60476789|gb|AAX21426.1| gp2 [Enterobacteria phage L] Length = 499 Score = 34.7 bits (78), Expect = 4.0, Method: Composition-based stats. Identities = 14/33 (42%), Positives = 23/33 (69%), Gaps = 1/33 (3%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYA 38 C+ + +EF YHR E G+++K DD++ A+RY Sbjct: 435 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYG 467 >gi|51236724|ref|YP_063734.1| terminase large subunit [Enterobacteria phage P22] gi|137879|sp|P26745|TERL_BPP22 RecName: Full=Large terminase protein; AltName: Full=DNA-packaging protein gp2; AltName: Full=Terminase large subunit gi|21914414|gb|AAM81379.1|AF527608_1 terminase large subunit [Salmonella phage P22-pbi] gi|553005|gb|AAA72959.1| DNA pacaging [Enterobacteria phage P22] gi|8439622|gb|AAF75044.1| terminase large subunit [Enterobacteria phage P22] gi|28394263|tpg|DAA00977.1| TPA_inf: terminase large subunit [Enterobacteria phage P22] Length = 499 Score = 34.7 bits (78), Expect = 4.0, Method: Composition-based stats. Identities = 14/33 (42%), Positives = 23/33 (69%), Gaps = 1/33 (3%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYA 38 C+ + +EF YHR E G+++K DD++ A+RY Sbjct: 435 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYG 467 >gi|318065950|ref|YP_004123808.1| Gp2 [Salmonella phage ST160] gi|289066936|gb|ADC81147.1| Gp2 [Salmonella phage ST160] Length = 517 Score = 34.7 bits (78), Expect = 4.1, Method: Composition-based stats. Identities = 14/33 (42%), Positives = 23/33 (69%), Gaps = 1/33 (3%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYA 38 C+ + +EF YHR E G+++K DD++ A+RY Sbjct: 453 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYG 485 >gi|46358697|ref|YP_006405.1| Gp2 [Enterobacteria phage ST104] gi|46357933|dbj|BAD15212.1| Gp2 [Enterobacteria phage ST104] gi|312911340|dbj|BAJ35314.1| putative terminase large subunit [Salmonella enterica subsp. enterica serovar Typhimurium str. T000240] Length = 499 Score = 34.7 bits (78), Expect = 4.1, Method: Composition-based stats. Identities = 14/33 (42%), Positives = 23/33 (69%), Gaps = 1/33 (3%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYA 38 C+ + +EF YHR E G+++K DD++ A+RY Sbjct: 435 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYG 467 >gi|238912312|ref|ZP_04656149.1| putative terminase large subunit [Salmonella enterica subsp. enterica serovar Tennessee str. CDC07-0191] gi|261245593|emb|CBG23388.1| terminase large subunit [Salmonella enterica subsp. enterica serovar Typhimurium str. D23580] Length = 499 Score = 34.7 bits (78), Expect = 4.1, Method: Composition-based stats. Identities = 14/33 (42%), Positives = 23/33 (69%), Gaps = 1/33 (3%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYA 38 C+ + +EF YHR E G+++K DD++ A+RY Sbjct: 435 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYG 467 >gi|221328620|ref|YP_002533461.1| Terminase, large subunit [Salmonella phage epsilon34] gi|255252684|ref|YP_003090219.1| Terminase, large subunit [Salmonella phage c341] gi|193244688|gb|ACF16628.1| Terminase, large subunit [Salmonella phage epsilon34] gi|223697657|gb|ACN18281.1| Terminase, large subunit [Salmonella phage g341c] Length = 499 Score = 34.7 bits (78), Expect = 4.1, Method: Composition-based stats. Identities = 14/33 (42%), Positives = 23/33 (69%), Gaps = 1/33 (3%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYA 38 C+ + +EF YHR E G+++K DD++ A+RY Sbjct: 435 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYG 467 >gi|198245578|ref|YP_002214540.1| terminase large subunit [Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853] gi|197940094|gb|ACH77427.1| terminase large subunit [Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853] Length = 499 Score = 34.7 bits (78), Expect = 4.1, Method: Composition-based stats. Identities = 14/33 (42%), Positives = 23/33 (69%), Gaps = 1/33 (3%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYA 38 C+ + +EF YHR E G+++K DD++ A+RY Sbjct: 435 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYG 467 >gi|161504537|ref|YP_001571649.1| hypothetical protein SARI_02650 [Salmonella enterica subsp. arizonae serovar 62:z4,z23:-- str. RSK2980] gi|160865884|gb|ABX22507.1| hypothetical protein SARI_02650 [Salmonella enterica subsp. arizonae serovar 62:z4,z23:--] Length = 499 Score = 34.7 bits (78), Expect = 4.1, Method: Composition-based stats. Identities = 14/33 (42%), Positives = 23/33 (69%), Gaps = 1/33 (3%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYA 38 C+ + +EF YHR E G+++K DD++ A+RY Sbjct: 435 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYG 467 >gi|326622293|gb|EGE28638.1| terminase large subunit [Salmonella enterica subsp. enterica serovar Dublin str. 3246] Length = 482 Score = 34.7 bits (78), Expect = 4.2, Method: Composition-based stats. Identities = 14/33 (42%), Positives = 23/33 (69%), Gaps = 1/33 (3%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYA 38 C+ + +EF YHR E G+++K DD++ A+RY Sbjct: 418 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYG 450 >gi|197363441|ref|YP_002143078.1| terminase large subunit [Salmonella enterica subsp. enterica serovar Paratyphi A str. AKU_12601] gi|197094918|emb|CAR60455.1| putative terminase large subunit [Salmonella enterica subsp. enterica serovar Paratyphi A str. AKU_12601] gi|320086843|emb|CBY96615.1| DNA packaging protein gp2 Terminase large subunit [Salmonella enterica subsp. enterica serovar Weltevreden str. 2007-60-3289-1] Length = 499 Score = 34.7 bits (78), Expect = 4.2, Method: Composition-based stats. Identities = 14/33 (42%), Positives = 23/33 (69%), Gaps = 1/33 (3%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYA 38 C+ + +EF YHR E G+++K DD++ A+RY Sbjct: 435 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYG 467 >gi|157734711|dbj|BAF80717.1| terminase large subunit [Enterobacteria phage P22] gi|169658843|dbj|BAG12600.1| terminase large subunit [Enterobacteria phage P22] Length = 499 Score = 34.7 bits (78), Expect = 4.3, Method: Composition-based stats. Identities = 14/33 (42%), Positives = 23/33 (69%), Gaps = 1/33 (3%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYA 38 C+ + +EF YHR E G+++K DD++ A+RY Sbjct: 435 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYG 467 >gi|94317806|gb|ABF15069.1| terminase large subunit Gp2 [Salmonella enterica subsp. enterica serovar Typhimurium] Length = 278 Score = 34.7 bits (78), Expect = 4.4, Method: Composition-based stats. Identities = 14/33 (42%), Positives = 23/33 (69%), Gaps = 1/33 (3%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYA 38 C+ + +EF YHR E G+++K DD++ A+RY Sbjct: 218 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYG 250 >gi|321225021|gb|EFX50082.1| Phage terminase, large subunit [Salmonella enterica subsp. enterica serovar Typhimurium str. TN061786] Length = 267 Score = 34.3 bits (77), Expect = 5.2, Method: Composition-based stats. Identities = 14/33 (42%), Positives = 23/33 (69%), Gaps = 1/33 (3%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYA 38 C+ + +EF YHR E G+++K DD++ A+RY Sbjct: 203 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYG 235 Searching..................................................done Results from round 2 >gi|315122536|ref|YP_004063025.1| DNA packaging protein Gp2 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495938|gb|ADR52537.1| DNA packaging protein Gp2 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 455 Score = 108 bits (269), Expect = 3e-22, Method: Composition-based stats. Identities = 46/57 (80%), Positives = 51/57 (89%) Query: 6 ECQEWLDEFHQYHRCEGRVIKEKDDLICASRYALMMKIFSISKPGYSSWKYTPRKVI 62 +CQEWLDEF QYHR EGR+IKEK+DLICASRY LMMK FS+S+P SSWKYTPRKVI Sbjct: 399 DCQEWLDEFRQYHRREGRIIKEKEDLICASRYGLMMKRFSVSRPVCSSWKYTPRKVI 455 >gi|307315429|ref|ZP_07594994.1| protein of unknown function DUF264 [Sinorhizobium meliloti BL225C] gi|306898808|gb|EFN29464.1| protein of unknown function DUF264 [Sinorhizobium meliloti BL225C] Length = 477 Score = 107 bits (268), Expect = 5e-22, Method: Composition-based stats. Identities = 32/56 (57%), Positives = 43/56 (76%) Query: 7 CQEWLDEFHQYHRCEGRVIKEKDDLICASRYALMMKIFSISKPGYSSWKYTPRKVI 62 C EW +EF YHR +GR++KE+DDLI ASRYALMMK + + GY++W +T RKV+ Sbjct: 422 CTEWFEEFRLYHRKDGRIVKERDDLISASRYALMMKRHARANNGYANWNFTARKVL 477 >gi|15965769|ref|NP_386122.1| DNA packaging protein GP2 [Sinorhizobium meliloti 1021] gi|15075038|emb|CAC46595.1| DNA packaging protein GP2 [Sinorhizobium meliloti 1021] Length = 477 Score = 107 bits (267), Expect = 5e-22, Method: Composition-based stats. Identities = 32/56 (57%), Positives = 43/56 (76%) Query: 7 CQEWLDEFHQYHRCEGRVIKEKDDLICASRYALMMKIFSISKPGYSSWKYTPRKVI 62 C EW +EF YHR +GR++KE+DDLI ASRYALMMK + + GY++W +T RKV+ Sbjct: 422 CTEWFEEFRLYHRKDGRIVKERDDLISASRYALMMKRHARANNGYANWNFTARKVL 477 >gi|307318836|ref|ZP_07598268.1| protein of unknown function DUF264 [Sinorhizobium meliloti AK83] gi|306895557|gb|EFN26311.1| protein of unknown function DUF264 [Sinorhizobium meliloti AK83] Length = 477 Score = 107 bits (267), Expect = 5e-22, Method: Composition-based stats. Identities = 32/56 (57%), Positives = 43/56 (76%) Query: 7 CQEWLDEFHQYHRCEGRVIKEKDDLICASRYALMMKIFSISKPGYSSWKYTPRKVI 62 C EW +EF YHR +GR++KE+DDLI ASRYALMMK + + GY++W +T RKV+ Sbjct: 422 CTEWFEEFRLYHRKDGRIVKERDDLISASRYALMMKRHARANNGYANWNFTARKVL 477 >gi|227822449|ref|YP_002826421.1| DNA packaging protein Gp2 [Sinorhizobium fredii NGR234] gi|227341450|gb|ACP25668.1| DNA packaging protein Gp2 [Sinorhizobium fredii NGR234] Length = 454 Score = 105 bits (262), Expect = 2e-21, Method: Composition-based stats. Identities = 31/56 (55%), Positives = 43/56 (76%) Query: 7 CQEWLDEFHQYHRCEGRVIKEKDDLICASRYALMMKIFSISKPGYSSWKYTPRKVI 62 C EW +EF YHR +GR++KE+DDLI ASRYALMMK ++ + G ++W +T RKV+ Sbjct: 399 CAEWFEEFRLYHRKDGRIVKERDDLISASRYALMMKRYARANNGNANWNFTARKVL 454 >gi|150397042|ref|YP_001327509.1| hypothetical protein Smed_1839 [Sinorhizobium medicae WSM419] gi|150028557|gb|ABR60674.1| protein of unknown function DUF264 [Sinorhizobium medicae WSM419] Length = 477 Score = 105 bits (262), Expect = 2e-21, Method: Composition-based stats. Identities = 31/56 (55%), Positives = 43/56 (76%) Query: 7 CQEWLDEFHQYHRCEGRVIKEKDDLICASRYALMMKIFSISKPGYSSWKYTPRKVI 62 C EW +EF YHR +GR++KE+DDL+ ASRYALMMK + + G ++WK+T RKV+ Sbjct: 422 CTEWFEEFRLYHRKDGRIVKERDDLLAASRYALMMKRHARAIGGNANWKFTARKVL 477 >gi|227821702|ref|YP_002825672.1| DNA packaging protein Gp2 [Sinorhizobium fredii NGR234] gi|227340701|gb|ACP24919.1| DNA packaging protein Gp2 [Sinorhizobium fredii NGR234] Length = 416 Score = 103 bits (256), Expect = 1e-20, Method: Composition-based stats. Identities = 32/56 (57%), Positives = 43/56 (76%) Query: 7 CQEWLDEFHQYHRCEGRVIKEKDDLICASRYALMMKIFSISKPGYSSWKYTPRKVI 62 C EW +EF YHR +G+V+KE+DD+I ASRYALMMK F+ K ++WK++ RKVI Sbjct: 361 CGEWFEEFRLYHRKDGKVVKERDDVISASRYALMMKRFARVKADAAAWKFSERKVI 416 >gi|273810450|ref|YP_003344921.1| TerL [Xylella phage Xfas53] gi|257097825|gb|ACV41131.1| TerL [Xylella phage Xfas53] Length = 470 Score = 78.2 bits (191), Expect = 3e-13, Method: Composition-based stats. Identities = 19/40 (47%), Positives = 29/40 (72%) Query: 8 QEWLDEFHQYHRCEGRVIKEKDDLICASRYALMMKIFSIS 47 EW +EF YHR +GR++K DDL+ A+RYA+MM+ ++ Sbjct: 416 TEWFEEFRLYHREDGRIVKHHDDLLSATRYAMMMRRYAKP 455 >gi|71897556|ref|ZP_00679801.1| Protein of unknown function DUF264 [Xylella fastidiosa Ann-1] gi|71732459|gb|EAO34512.1| Protein of unknown function DUF264 [Xylella fastidiosa Ann-1] Length = 471 Score = 78.2 bits (191), Expect = 4e-13, Method: Composition-based stats. Identities = 19/40 (47%), Positives = 29/40 (72%) Query: 8 QEWLDEFHQYHRCEGRVIKEKDDLICASRYALMMKIFSIS 47 EW +EF YHR +GR++K DDL+ A+RYA+MM+ ++ Sbjct: 417 TEWFEEFRLYHREDGRIVKHHDDLLSATRYAMMMRRYAKP 456 >gi|158422463|ref|YP_001523755.1| putative DNA packaging protein GP2 [Azorhizobium caulinodans ORS 571] gi|158329352|dbj|BAF86837.1| putative DNA packaging protein GP2 [Azorhizobium caulinodans ORS 571] Length = 251 Score = 77.8 bits (190), Expect = 4e-13, Method: Composition-based stats. Identities = 21/42 (50%), Positives = 34/42 (80%) Query: 8 QEWLDEFHQYHRCEGRVIKEKDDLICASRYALMMKIFSISKP 49 ++W+ EF YHR +G+++KE+DDL+ A+RYA+MMK F+ +P Sbjct: 189 EDWMSEFRLYHRKDGKIVKERDDLMSATRYAIMMKRFASPEP 230 >gi|148557334|ref|YP_001264916.1| hypothetical protein Swit_4440 [Sphingomonas wittichii RW1] gi|148502524|gb|ABQ70778.1| hypothetical protein Swit_4440 [Sphingomonas wittichii RW1] Length = 276 Score = 77.4 bits (189), Expect = 7e-13, Method: Composition-based stats. Identities = 21/44 (47%), Positives = 30/44 (68%) Query: 8 QEWLDEFHQYHRCEGRVIKEKDDLICASRYALMMKIFSISKPGY 51 ++W EF YHR +G V+K DD + ASRYA+MMK F+++ P Sbjct: 218 EDWFAEFRLYHRKDGSVVKTNDDRLSASRYAMMMKRFAVTAPAA 261 >gi|71274675|ref|ZP_00650963.1| Protein of unknown function DUF264 [Xylella fastidiosa Dixon] gi|71901596|ref|ZP_00683677.1| Protein of unknown function DUF264 [Xylella fastidiosa Ann-1] gi|170730087|ref|YP_001775520.1| putative DNA packaging protein GP2 [Xylella fastidiosa M12] gi|71164407|gb|EAO14121.1| Protein of unknown function DUF264 [Xylella fastidiosa Dixon] gi|71728644|gb|EAO30794.1| Protein of unknown function DUF264 [Xylella fastidiosa Ann-1] gi|167964880|gb|ACA11890.1| putative DNA packaging protein GP2 [Xylella fastidiosa M12] Length = 472 Score = 74.3 bits (181), Expect = 5e-12, Method: Composition-based stats. Identities = 22/51 (43%), Positives = 34/51 (66%), Gaps = 2/51 (3%) Query: 9 EWLDEFHQYHRCEGRVIKEKDDLICASRYALMMKIFS-ISKPGY-SSWKYT 57 EW +E YHR GR+ K DDL+ A+RYA+MM+ ++ I+ P + ++YT Sbjct: 419 EWFEECSLYHRDNGRITKRHDDLLSATRYAMMMRRYAKITNPVQIAVYEYT 469 >gi|167041080|gb|ABZ05841.1| hypothetical protein ALOHA_HF400048F7ctg1g8 [uncultured marine microorganism HF4000_48F7] Length = 504 Score = 65.1 bits (157), Expect = 3e-09, Method: Composition-based stats. Identities = 17/52 (32%), Positives = 29/52 (55%), Gaps = 6/52 (11%) Query: 8 QEWLDEFHQYHRCEGRVIKEKDDLICASRYALMMKIFSISKPGYSSWKYTPR 59 +W EF YHR +G+V+++ DDL+ A+RYA ++ + + PR Sbjct: 427 DQWFQEFRMYHRKDGKVVRKHDDLMSATRYACQSLRYATTA------NFQPR 472 >gi|300920006|ref|ZP_07136465.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 115-1] gi|300412953|gb|EFJ96263.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 115-1] Length = 498 Score = 62.8 bits (151), Expect = 1e-08, Method: Composition-based stats. Identities = 18/41 (43%), Positives = 28/41 (68%), Gaps = 1/41 (2%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYALMMKIFSI 46 C+ + +EF YHR E G+++K DD++ A RY MM+ F+I Sbjct: 435 CEPFFEEFRLYHRDENGKIVKLNDDILSAVRYGYMMRRFAI 475 >gi|281599695|gb|ADA72679.1| Gp2-like protein [Shigella flexneri 2002017] Length = 441 Score = 62.8 bits (151), Expect = 1e-08, Method: Composition-based stats. Identities = 18/40 (45%), Positives = 28/40 (70%), Gaps = 1/40 (2%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYALMMKIFS 45 C+ + +EF YHR E G+++K DD++ A RYA MM+ F+ Sbjct: 377 CEPFFEEFRLYHRDENGKIVKLNDDVLSAVRYAYMMRRFA 416 >gi|327251967|gb|EGE63639.1| DNA packaging protein gp2 [Escherichia coli STEC_7v] gi|327254495|gb|EGE66117.1| DNA packaging protein gp2 [Escherichia coli STEC_7v] Length = 499 Score = 62.8 bits (151), Expect = 1e-08, Method: Composition-based stats. Identities = 18/40 (45%), Positives = 28/40 (70%), Gaps = 1/40 (2%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYALMMKIFS 45 C+ + +EF YHR E G+++K DD++ A RYA MM+ F+ Sbjct: 435 CEPFFEEFRLYHRDENGKIVKLNDDVLSAVRYAYMMRRFA 474 >gi|315299781|gb|EFU59021.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 16-3] Length = 499 Score = 62.8 bits (151), Expect = 1e-08, Method: Composition-based stats. Identities = 18/40 (45%), Positives = 28/40 (70%), Gaps = 1/40 (2%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYALMMKIFS 45 C+ + +EF YHR E G+++K DD++ A RYA MM+ F+ Sbjct: 435 CEPFFEEFRLYHRDENGKIVKLNDDVLSAVRYAYMMRRFA 474 >gi|331657716|ref|ZP_08358678.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia coli TA206] gi|331055964|gb|EGI27973.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia coli TA206] Length = 499 Score = 62.8 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 18/40 (45%), Positives = 28/40 (70%), Gaps = 1/40 (2%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYALMMKIFS 45 C+ + +EF YHR E G+++K DD++ A RYA MM+ F+ Sbjct: 435 CEPFFEEFRLYHRDENGKIVKLNDDVLSAVRYAYMMRRFA 474 >gi|323967108|gb|EGB62533.1| terminase [Escherichia coli M863] Length = 499 Score = 62.8 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 18/40 (45%), Positives = 28/40 (70%), Gaps = 1/40 (2%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYALMMKIFS 45 C+ + +EF YHR E G+++K DD++ A RYA MM+ F+ Sbjct: 435 CEPFFEEFRLYHRDENGKIVKLNDDVLSAVRYAYMMRRFA 474 >gi|293410725|ref|ZP_06654301.1| DNA-packaging protein gp2 [Escherichia coli B354] gi|291471193|gb|EFF13677.1| DNA-packaging protein gp2 [Escherichia coli B354] Length = 499 Score = 62.8 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 18/40 (45%), Positives = 28/40 (70%), Gaps = 1/40 (2%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYALMMKIFS 45 C+ + +EF YHR E G+++K DD++ A RYA MM+ F+ Sbjct: 435 CEPFFEEFRLYHRDENGKIVKLNDDVLSAVRYAYMMRRFA 474 >gi|218549377|ref|YP_002383168.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia fergusonii ATCC 35469] gi|307311077|ref|ZP_07590721.1| protein of unknown function DUF264 [Escherichia coli W] gi|331669066|ref|ZP_08369914.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia coli TA271] gi|218356918|emb|CAQ89550.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia fergusonii ATCC 35469] gi|306908583|gb|EFN39080.1| protein of unknown function DUF264 [Escherichia coli W] gi|312945545|gb|ADR26372.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia coli O83:H1 str. NRG 857C] gi|315061655|gb|ADT75982.1| DNA packaging protein gp2 (terminase large subunit) [Escherichia coli W] gi|323377763|gb|ADX50031.1| DNA packaging protein gp2 (terminase large subunit) [Escherichia coli KO11] gi|324117758|gb|EGC11657.1| terminase [Escherichia coli E1167] gi|331064260|gb|EGI36171.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia coli TA271] Length = 499 Score = 62.8 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 18/40 (45%), Positives = 28/40 (70%), Gaps = 1/40 (2%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYALMMKIFS 45 C+ + +EF YHR E G+++K DD++ A RYA MM+ F+ Sbjct: 435 CEPFFEEFRLYHRDENGKIVKLNDDVLSAVRYAYMMRRFA 474 >gi|62178924|ref|YP_215341.1| gp2-like protein [Salmonella enterica subsp. enterica serovar Choleraesuis str. SC-B67] gi|62126557|gb|AAX64260.1| gp2-like protein [Salmonella enterica subsp. enterica serovar Choleraesuis str. SC-B67] gi|322713379|gb|EFZ04950.1| gp2-like protein [Salmonella enterica subsp. enterica serovar Choleraesuis str. A50] Length = 499 Score = 62.8 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 18/40 (45%), Positives = 28/40 (70%), Gaps = 1/40 (2%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYALMMKIFS 45 C+ + +EF YHR E G+++K DD++ A RYA MM+ F+ Sbjct: 435 CEPFFEEFRLYHRDENGKIVKLNDDVLSAVRYAYMMRRFA 474 >gi|110804280|ref|YP_687800.1| putative terminase large subunit [Shigella flexneri 5 str. 8401] gi|110613828|gb|ABF02495.1| putative terminase large subunit [Shigella flexneri 5 str. 8401] Length = 354 Score = 62.4 bits (150), Expect = 2e-08, Method: Composition-based stats. Identities = 18/40 (45%), Positives = 28/40 (70%), Gaps = 1/40 (2%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYALMMKIFS 45 C+ + +EF YHR E G+++K DD++ A RYA MM+ F+ Sbjct: 290 CEPFFEEFRLYHRDENGKIVKLNDDVLSAVRYAYMMRRFA 329 >gi|281599578|gb|ADA72562.1| putative terminase large subunit [Shigella flexneri 2002017] Length = 351 Score = 62.4 bits (150), Expect = 2e-08, Method: Composition-based stats. Identities = 18/40 (45%), Positives = 28/40 (70%), Gaps = 1/40 (2%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYALMMKIFS 45 C+ + +EF YHR E G+++K DD++ A RYA MM+ F+ Sbjct: 287 CEPFFEEFRLYHRDENGKIVKLNDDVLSAVRYAYMMRRFA 326 >gi|24111660|ref|NP_706170.1| putative terminase large subunit [Shigella flexneri 2a str. 301] gi|24050435|gb|AAN41877.1| putative terminase large subunit [Shigella flexneri 2a str. 301] gi|313646707|gb|EFS11166.1| DNA packaging gp2 domain protein [Shigella flexneri 2a str. 2457T] Length = 300 Score = 61.6 bits (148), Expect = 4e-08, Method: Composition-based stats. Identities = 18/40 (45%), Positives = 28/40 (70%), Gaps = 1/40 (2%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYALMMKIFS 45 C+ + +EF YHR E G+++K DD++ A RYA MM+ F+ Sbjct: 236 CEPFFEEFRLYHRDENGKIVKLNDDVLSAVRYAYMMRRFA 275 >gi|30061789|ref|NP_835960.1| putative terminase large subunit [Shigella flexneri 2a str. 2457T] gi|30040031|gb|AAP15765.1| putative terminase large subunit [Shigella flexneri 2a str. 2457T] Length = 124 Score = 60.1 bits (144), Expect = 9e-08, Method: Composition-based stats. Identities = 18/40 (45%), Positives = 28/40 (70%), Gaps = 1/40 (2%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYALMMKIFS 45 C+ + +EF YHR E G+++K DD++ A RYA MM+ F+ Sbjct: 60 CEPFFEEFRLYHRDENGKIVKLNDDVLSAVRYAYMMRRFA 99 >gi|27476053|ref|NP_775255.1| terminase [Pseudomonas phage PaP3] gi|27414483|gb|AAL85569.1| terminase [Pseudomonas phage PaP3] Length = 482 Score = 58.9 bits (141), Expect = 2e-07, Method: Composition-based stats. Identities = 18/51 (35%), Positives = 28/51 (54%), Gaps = 1/51 (1%) Query: 7 CQEWLDEFHQYHRCEGRVIKEKDDLICASRYALMM-KIFSISKPGYSSWKY 56 C +L E YHR +G+++ DD+I A+RYAL+M + +S Y Sbjct: 416 CTNFLKEMKMYHRKDGKIVDRNDDMISATRYALLMASRHARPGAVRNSGYY 466 >gi|167600439|ref|YP_001671939.1| terminase large subunit [Pseudomonas phage LUZ24] gi|161168302|emb|CAP45467.1| terminase large subunit [Pseudomonas phage LUZ24] Length = 482 Score = 58.9 bits (141), Expect = 2e-07, Method: Composition-based stats. Identities = 19/51 (37%), Positives = 28/51 (54%), Gaps = 1/51 (1%) Query: 7 CQEWLDEFHQYHRCEGRVIKEKDDLICASRYALMM-KIFSISKPGYSSWKY 56 C +L E YHR +G++I DD+I A+RYAL+M + +S Y Sbjct: 416 CTNFLKEMKMYHRKDGKIIDRNDDMISATRYALLMASRHARPGAVRNSGYY 466 >gi|137993|sp|P16938|VG2_BPLP7 RecName: Full=Protein GP2 gi|75884|pir||Z2BPL7 gene 2 protein - phage LP-7 (fragment) gi|553003|gb|AAA88220.1| packaging glycoprotein [Enterobacteria phage LP7] Length = 475 Score = 56.6 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 16/37 (43%), Positives = 26/37 (70%), Gaps = 1/37 (2%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYALMMK 42 C+ + +EF YHR E G+++K DD++ A+RY MM+ Sbjct: 431 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYGYMMR 467 >gi|89885991|ref|YP_516188.1| phage terminase large subunit [Sodalis phage phiSG1] gi|89191726|dbj|BAE80473.1| phage terminase large subunit [Sodalis phage phiSG1] gi|125470018|gb|ABN42210.1| gp02 [Sodalis phage phiSG1] Length = 475 Score = 52.4 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 16/46 (34%), Positives = 30/46 (65%), Gaps = 1/46 (2%) Query: 9 EWLDEFHQYHRCE-GRVIKEKDDLICASRYALMMKIFSISKPGYSS 53 ++ DE++ YHR E R++K +DD++ A RYA MM+ +++ + Sbjct: 415 DFFDEYNFYHRDEKSRIVKMRDDILDAVRYAYMMRRYAVRYADVKN 460 >gi|219681243|ref|YP_002455888.1| Gp2 [Salmonella enterica bacteriophage SE1] gi|66473858|gb|AAY46504.1| Gp2 [Salmonella phage SE1] Length = 499 Score = 51.2 bits (121), Expect = 5e-05, Method: Composition-based stats. Identities = 14/34 (41%), Positives = 23/34 (67%), Gaps = 1/34 (2%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYAL 39 C+ + +EF YHR E G+++K DD++ A+RY Sbjct: 435 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYGY 468 >gi|60476789|gb|AAX21426.1| gp2 [Enterobacteria phage L] Length = 499 Score = 51.2 bits (121), Expect = 5e-05, Method: Composition-based stats. Identities = 14/34 (41%), Positives = 23/34 (67%), Gaps = 1/34 (2%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYAL 39 C+ + +EF YHR E G+++K DD++ A+RY Sbjct: 435 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYGY 468 >gi|24371583|ref|NP_720326.1| gp2 [Enterobacteria phage ST64T] gi|24250810|gb|AAL15523.1| gp2 [Salmonella phage ST64T] Length = 517 Score = 51.2 bits (121), Expect = 5e-05, Method: Composition-based stats. Identities = 14/34 (41%), Positives = 23/34 (67%), Gaps = 1/34 (2%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYAL 39 C+ + +EF YHR E G+++K DD++ A+RY Sbjct: 453 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYGY 486 >gi|238912312|ref|ZP_04656149.1| putative terminase large subunit [Salmonella enterica subsp. enterica serovar Tennessee str. CDC07-0191] gi|261245593|emb|CBG23388.1| terminase large subunit [Salmonella enterica subsp. enterica serovar Typhimurium str. D23580] Length = 499 Score = 50.8 bits (120), Expect = 6e-05, Method: Composition-based stats. Identities = 14/34 (41%), Positives = 23/34 (67%), Gaps = 1/34 (2%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYAL 39 C+ + +EF YHR E G+++K DD++ A+RY Sbjct: 435 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYGY 468 >gi|168240109|ref|ZP_02665041.1| DNA packaging protein gp2 [Salmonella enterica subsp. enterica serovar Heidelberg str. SL486] gi|194451817|ref|YP_002044341.1| DNA packaging protein gp2 [Salmonella enterica subsp. enterica serovar Heidelberg str. SL476] gi|194410121|gb|ACF70340.1| DNA packaging protein gp2 [Salmonella enterica subsp. enterica serovar Heidelberg str. SL476] gi|205340165|gb|EDZ26929.1| DNA packaging protein gp2 [Salmonella enterica subsp. enterica serovar Heidelberg str. SL486] Length = 499 Score = 50.8 bits (120), Expect = 6e-05, Method: Composition-based stats. Identities = 14/34 (41%), Positives = 23/34 (67%), Gaps = 1/34 (2%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYAL 39 C+ + +EF YHR E G+++K DD++ A+RY Sbjct: 435 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYGY 468 >gi|46358697|ref|YP_006405.1| Gp2 [Enterobacteria phage ST104] gi|46357933|dbj|BAD15212.1| Gp2 [Enterobacteria phage ST104] gi|312911340|dbj|BAJ35314.1| putative terminase large subunit [Salmonella enterica subsp. enterica serovar Typhimurium str. T000240] Length = 499 Score = 50.8 bits (120), Expect = 6e-05, Method: Composition-based stats. Identities = 14/34 (41%), Positives = 23/34 (67%), Gaps = 1/34 (2%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYAL 39 C+ + +EF YHR E G+++K DD++ A+RY Sbjct: 435 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYGY 468 >gi|318065950|ref|YP_004123808.1| Gp2 [Salmonella phage ST160] gi|289066936|gb|ADC81147.1| Gp2 [Salmonella phage ST160] Length = 517 Score = 50.8 bits (120), Expect = 6e-05, Method: Composition-based stats. Identities = 14/34 (41%), Positives = 23/34 (67%), Gaps = 1/34 (2%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYAL 39 C+ + +EF YHR E G+++K DD++ A+RY Sbjct: 453 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYGY 486 >gi|221328620|ref|YP_002533461.1| Terminase, large subunit [Salmonella phage epsilon34] gi|255252684|ref|YP_003090219.1| Terminase, large subunit [Salmonella phage c341] gi|193244688|gb|ACF16628.1| Terminase, large subunit [Salmonella phage epsilon34] gi|223697657|gb|ACN18281.1| Terminase, large subunit [Salmonella phage g341c] Length = 499 Score = 50.8 bits (120), Expect = 6e-05, Method: Composition-based stats. Identities = 14/34 (41%), Positives = 23/34 (67%), Gaps = 1/34 (2%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYAL 39 C+ + +EF YHR E G+++K DD++ A+RY Sbjct: 435 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYGY 468 >gi|51236724|ref|YP_063734.1| terminase large subunit [Enterobacteria phage P22] gi|137879|sp|P26745|TERL_BPP22 RecName: Full=Large terminase protein; AltName: Full=DNA-packaging protein gp2; AltName: Full=Terminase large subunit gi|21914414|gb|AAM81379.1|AF527608_1 terminase large subunit [Salmonella phage P22-pbi] gi|553005|gb|AAA72959.1| DNA pacaging [Enterobacteria phage P22] gi|8439622|gb|AAF75044.1| terminase large subunit [Enterobacteria phage P22] gi|28394263|tpg|DAA00977.1| TPA_inf: terminase large subunit [Enterobacteria phage P22] Length = 499 Score = 50.8 bits (120), Expect = 6e-05, Method: Composition-based stats. Identities = 14/34 (41%), Positives = 23/34 (67%), Gaps = 1/34 (2%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYAL 39 C+ + +EF YHR E G+++K DD++ A+RY Sbjct: 435 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYGY 468 >gi|326622293|gb|EGE28638.1| terminase large subunit [Salmonella enterica subsp. enterica serovar Dublin str. 3246] Length = 482 Score = 50.8 bits (120), Expect = 6e-05, Method: Composition-based stats. Identities = 14/34 (41%), Positives = 23/34 (67%), Gaps = 1/34 (2%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYAL 39 C+ + +EF YHR E G+++K DD++ A+RY Sbjct: 418 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYGY 451 >gi|197363441|ref|YP_002143078.1| terminase large subunit [Salmonella enterica subsp. enterica serovar Paratyphi A str. AKU_12601] gi|197094918|emb|CAR60455.1| putative terminase large subunit [Salmonella enterica subsp. enterica serovar Paratyphi A str. AKU_12601] gi|320086843|emb|CBY96615.1| DNA packaging protein gp2 Terminase large subunit [Salmonella enterica subsp. enterica serovar Weltevreden str. 2007-60-3289-1] Length = 499 Score = 50.8 bits (120), Expect = 6e-05, Method: Composition-based stats. Identities = 14/34 (41%), Positives = 23/34 (67%), Gaps = 1/34 (2%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYAL 39 C+ + +EF YHR E G+++K DD++ A+RY Sbjct: 435 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYGY 468 >gi|198245578|ref|YP_002214540.1| terminase large subunit [Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853] gi|197940094|gb|ACH77427.1| terminase large subunit [Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853] Length = 499 Score = 50.8 bits (120), Expect = 6e-05, Method: Composition-based stats. Identities = 14/34 (41%), Positives = 23/34 (67%), Gaps = 1/34 (2%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYAL 39 C+ + +EF YHR E G+++K DD++ A+RY Sbjct: 435 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYGY 468 >gi|161504537|ref|YP_001571649.1| hypothetical protein SARI_02650 [Salmonella enterica subsp. arizonae serovar 62:z4,z23:-- str. RSK2980] gi|160865884|gb|ABX22507.1| hypothetical protein SARI_02650 [Salmonella enterica subsp. arizonae serovar 62:z4,z23:--] Length = 499 Score = 50.8 bits (120), Expect = 6e-05, Method: Composition-based stats. Identities = 14/34 (41%), Positives = 23/34 (67%), Gaps = 1/34 (2%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYAL 39 C+ + +EF YHR E G+++K DD++ A+RY Sbjct: 435 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYGY 468 >gi|157734711|dbj|BAF80717.1| terminase large subunit [Enterobacteria phage P22] gi|169658843|dbj|BAG12600.1| terminase large subunit [Enterobacteria phage P22] Length = 499 Score = 50.8 bits (120), Expect = 6e-05, Method: Composition-based stats. Identities = 14/34 (41%), Positives = 23/34 (67%), Gaps = 1/34 (2%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYAL 39 C+ + +EF YHR E G+++K DD++ A+RY Sbjct: 435 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYGY 468 >gi|94317806|gb|ABF15069.1| terminase large subunit Gp2 [Salmonella enterica subsp. enterica serovar Typhimurium] Length = 278 Score = 50.8 bits (120), Expect = 6e-05, Method: Composition-based stats. Identities = 14/34 (41%), Positives = 23/34 (67%), Gaps = 1/34 (2%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYAL 39 C+ + +EF YHR E G+++K DD++ A+RY Sbjct: 218 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYGY 251 >gi|215304|gb|AAA72960.1| unnamed protein product [Enterobacteria phage P22] Length = 101 Score = 50.4 bits (119), Expect = 8e-05, Method: Composition-based stats. Identities = 14/34 (41%), Positives = 23/34 (67%), Gaps = 1/34 (2%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYAL 39 C+ + +EF YHR E G+++K DD++ A+RY Sbjct: 37 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYGY 70 >gi|321225021|gb|EFX50082.1| Phage terminase, large subunit [Salmonella enterica subsp. enterica serovar Typhimurium str. TN061786] Length = 267 Score = 50.1 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 14/34 (41%), Positives = 23/34 (67%), Gaps = 1/34 (2%) Query: 7 CQEWLDEFHQYHRCE-GRVIKEKDDLICASRYAL 39 C+ + +EF YHR E G+++K DD++ A+RY Sbjct: 203 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYGY 236 >gi|167583562|ref|YP_001671752.1| terminase large subunit [Enterobacteria phage phiEco32] gi|164375400|gb|ABY52808.1| terminase large subunit [Enterobacteria phage phiEco32] Length = 513 Score = 42.4 bits (98), Expect = 0.019, Method: Composition-based stats. Identities = 14/28 (50%), Positives = 20/28 (71%) Query: 11 LDEFHQYHRCEGRVIKEKDDLICASRYA 38 +E +YHR G++IKE DDL+ A RY+ Sbjct: 453 FEEKARYHRKVGKIIKEHDDLMDAMRYS 480 >gi|264678784|ref|YP_003278691.1| phage DNA packaging protein Gp2 [Comamonas testosteroni CNB-2] gi|262209297|gb|ACY33395.1| putative phage DNA packaging protein Gp2 [Comamonas testosteroni CNB-2] Length = 434 Score = 38.9 bits (89), Expect = 0.22, Method: Composition-based stats. Identities = 11/33 (33%), Positives = 23/33 (69%), Gaps = 1/33 (3%) Query: 9 EWLDEFHQYHRCE-GRVIKEKDDLICASRYALM 40 +WL+E+ Y R + G+++K+ D + A+RY ++ Sbjct: 218 DWLNEYRIYRRDDKGQIVKKDDHAMDATRYLIV 250 Database: nr Posted date: May 13, 2011 4:10 AM Number of letters in database: 999,999,932 Number of sequences in database: 2,987,209 Database: /data/usr2/db/fasta/nr.01 Posted date: May 13, 2011 4:17 AM Number of letters in database: 999,998,956 Number of sequences in database: 2,896,973 Database: /data/usr2/db/fasta/nr.02 Posted date: May 13, 2011 4:23 AM Number of letters in database: 999,999,979 Number of sequences in database: 2,907,862 Database: /data/usr2/db/fasta/nr.03 Posted date: May 13, 2011 4:29 AM Number of letters in database: 999,999,513 Number of sequences in database: 2,932,190 Database: /data/usr2/db/fasta/nr.04 Posted date: May 13, 2011 4:33 AM Number of letters in database: 792,586,372 Number of sequences in database: 2,260,650 Lambda K H 0.310 0.134 0.411 Lambda K H 0.267 0.0420 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,234,438,582 Number of Sequences: 13984884 Number of extensions: 32966553 Number of successful extensions: 91789 Number of sequences better than 10.0: 49 Number of HSP's better than 10.0 without gapping: 63 Number of HSP's successfully gapped in prelim test: 33 Number of HSP's that attempted gapping in prelim test: 91662 Number of HSP's gapped (non-prelim): 96 length of query: 62 length of database: 4,792,584,752 effective HSP length: 34 effective length of query: 28 effective length of database: 4,317,098,696 effective search space: 120878763488 effective search space used: 120878763488 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (20.8 bits) S2: 76 (33.8 bits)