BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= 537021.9.peg.1078_1 (62 letters) Database: nr 13,984,884 sequences; 4,792,584,752 total letters Searching..................................................done >gi|307315429|ref|ZP_07594994.1| protein of unknown function DUF264 [Sinorhizobium meliloti BL225C] gi|306898808|gb|EFN29464.1| protein of unknown function DUF264 [Sinorhizobium meliloti BL225C] Length = 477 Score = 102 bits (253), Expect = 2e-20, Method: Composition-based stats. Identities = 32/56 (57%), Positives = 43/56 (76%) Query: 7 CQEWLDEFHQYHRCEGRVIKEKDDLICASRYALMMKIFSISKPGYSSWKYTPRKVI 62 C EW +EF YHR +GR++KE+DDLI ASRYALMMK + + GY++W +T RKV+ Sbjct: 422 CTEWFEEFRLYHRKDGRIVKERDDLISASRYALMMKRHARANNGYANWNFTARKVL 477 >gi|15965769|ref|NP_386122.1| DNA packaging protein GP2 [Sinorhizobium meliloti 1021] gi|15075038|emb|CAC46595.1| DNA packaging protein GP2 [Sinorhizobium meliloti 1021] Length = 477 Score = 102 bits (253), Expect = 3e-20, Method: Composition-based stats. Identities = 32/56 (57%), Positives = 43/56 (76%) Query: 7 CQEWLDEFHQYHRCEGRVIKEKDDLICASRYALMMKIFSISKPGYSSWKYTPRKVI 62 C EW +EF YHR +GR++KE+DDLI ASRYALMMK + + GY++W +T RKV+ Sbjct: 422 CTEWFEEFRLYHRKDGRIVKERDDLISASRYALMMKRHARANNGYANWNFTARKVL 477 >gi|307318836|ref|ZP_07598268.1| protein of unknown function DUF264 [Sinorhizobium meliloti AK83] gi|306895557|gb|EFN26311.1| protein of unknown function DUF264 [Sinorhizobium meliloti AK83] Length = 477 Score = 101 bits (252), Expect = 3e-20, Method: Composition-based stats. Identities = 32/56 (57%), Positives = 43/56 (76%) Query: 7 CQEWLDEFHQYHRCEGRVIKEKDDLICASRYALMMKIFSISKPGYSSWKYTPRKVI 62 C EW +EF YHR +GR++KE+DDLI ASRYALMMK + + GY++W +T RKV+ Sbjct: 422 CTEWFEEFRLYHRKDGRIVKERDDLISASRYALMMKRHARANNGYANWNFTARKVL 477 >gi|227821702|ref|YP_002825672.1| DNA packaging protein Gp2 [Sinorhizobium fredii NGR234] gi|227340701|gb|ACP24919.1| DNA packaging protein Gp2 [Sinorhizobium fredii NGR234] Length = 416 Score = 100 bits (249), Expect = 7e-20, Method: Composition-based stats. Identities = 32/56 (57%), Positives = 43/56 (76%) Query: 7 CQEWLDEFHQYHRCEGRVIKEKDDLICASRYALMMKIFSISKPGYSSWKYTPRKVI 62 C EW +EF YHR +G+V+KE+DD+I ASRYALMMK F+ K ++WK++ RKVI Sbjct: 361 CGEWFEEFRLYHRKDGKVVKERDDVISASRYALMMKRFARVKADAAAWKFSERKVI 416 >gi|227822449|ref|YP_002826421.1| DNA packaging protein Gp2 [Sinorhizobium fredii NGR234] gi|227341450|gb|ACP25668.1| DNA packaging protein Gp2 [Sinorhizobium fredii NGR234] Length = 454 Score = 99.7 bits (247), Expect = 1e-19, Method: Composition-based stats. Identities = 31/56 (55%), Positives = 43/56 (76%) Query: 7 CQEWLDEFHQYHRCEGRVIKEKDDLICASRYALMMKIFSISKPGYSSWKYTPRKVI 62 C EW +EF YHR +GR++KE+DDLI ASRYALMMK ++ + G ++W +T RKV+ Sbjct: 399 CAEWFEEFRLYHRKDGRIVKERDDLISASRYALMMKRYARANNGNANWNFTARKVL 454 >gi|150397042|ref|YP_001327509.1| hypothetical protein Smed_1839 [Sinorhizobium medicae WSM419] gi|150028557|gb|ABR60674.1| protein of unknown function DUF264 [Sinorhizobium medicae WSM419] Length = 477 Score = 96.6 bits (239), Expect = 1e-18, Method: Composition-based stats. Identities = 31/56 (55%), Positives = 43/56 (76%) Query: 7 CQEWLDEFHQYHRCEGRVIKEKDDLICASRYALMMKIFSISKPGYSSWKYTPRKVI 62 C EW +EF YHR +GR++KE+DDL+ ASRYALMMK + + G ++WK+T RKV+ Sbjct: 422 CTEWFEEFRLYHRKDGRIVKERDDLLAASRYALMMKRHARAIGGNANWKFTARKVL 477 >gi|315122536|ref|YP_004063025.1| DNA packaging protein Gp2 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495938|gb|ADR52537.1| DNA packaging protein Gp2 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 455 Score = 90.9 bits (224), Expect = 5e-17, Method: Composition-based stats. Identities = 46/57 (80%), Positives = 51/57 (89%) Query: 6 ECQEWLDEFHQYHRCEGRVIKEKDDLICASRYALMMKIFSISKPGYSSWKYTPRKVI 62 +CQEWLDEF QYHR EGR+IKEK+DLICASRY LMMK FS+S+P SSWKYTPRKVI Sbjct: 399 DCQEWLDEFRQYHRREGRIIKEKEDLICASRYGLMMKRFSVSRPVCSSWKYTPRKVI 455 >gi|167041080|gb|ABZ05841.1| hypothetical protein ALOHA_HF400048F7ctg1g8 [uncultured marine microorganism HF4000_48F7] Length = 504 Score = 80.5 bits (197), Expect = 7e-14, Method: Composition-based stats. Identities = 15/45 (33%), Positives = 26/45 (57%) Query: 9 EWLDEFHQYHRCEGRVIKEKDDLICASRYALMMKIFSISKPGYSS 53 +W EF YHR +G+V+++ DDL+ A+RYA ++ + Sbjct: 428 QWFQEFRMYHRKDGKVVRKHDDLMSATRYACQSLRYATTANFQPR 472 >gi|71897556|ref|ZP_00679801.1| Protein of unknown function DUF264 [Xylella fastidiosa Ann-1] gi|71732459|gb|EAO34512.1| Protein of unknown function DUF264 [Xylella fastidiosa Ann-1] Length = 471 Score = 79.7 bits (195), Expect = 1e-13, Method: Composition-based stats. Identities = 19/45 (42%), Positives = 30/45 (66%) Query: 8 QEWLDEFHQYHRCEGRVIKEKDDLICASRYALMMKIFSISKPGYS 52 EW +EF YHR +GR++K DDL+ A+RYA+MM+ ++ + Sbjct: 417 TEWFEEFRLYHREDGRIVKHHDDLLSATRYAMMMRRYAKPPRVAT 461 >gi|273810450|ref|YP_003344921.1| TerL [Xylella phage Xfas53] gi|257097825|gb|ACV41131.1| TerL [Xylella phage Xfas53] Length = 470 Score = 79.7 bits (195), Expect = 1e-13, Method: Composition-based stats. Identities = 19/40 (47%), Positives = 29/40 (72%) Query: 8 QEWLDEFHQYHRCEGRVIKEKDDLICASRYALMMKIFSIS 47 EW +EF YHR +GR++K DDL+ A+RYA+MM+ ++ Sbjct: 416 TEWFEEFRLYHREDGRIVKHHDDLLSATRYAMMMRRYAKP 455 >gi|158422463|ref|YP_001523755.1| putative DNA packaging protein GP2 [Azorhizobium caulinodans ORS 571] gi|158329352|dbj|BAF86837.1| putative DNA packaging protein GP2 [Azorhizobium caulinodans ORS 571] Length = 251 Score = 79.7 bits (195), Expect = 1e-13, Method: Composition-based stats. Identities = 21/45 (46%), Positives = 34/45 (75%) Query: 8 QEWLDEFHQYHRCEGRVIKEKDDLICASRYALMMKIFSISKPGYS 52 ++W+ EF YHR +G+++KE+DDL+ A+RYA+MMK F+ +P Sbjct: 189 EDWMSEFRLYHRKDGKIVKERDDLMSATRYAIMMKRFASPEPTGP 233 >gi|148557334|ref|YP_001264916.1| hypothetical protein Swit_4440 [Sphingomonas wittichii RW1] gi|148502524|gb|ABQ70778.1| hypothetical protein Swit_4440 [Sphingomonas wittichii RW1] Length = 276 Score = 75.8 bits (185), Expect = 2e-12, Method: Composition-based stats. Identities = 21/45 (46%), Positives = 30/45 (66%) Query: 8 QEWLDEFHQYHRCEGRVIKEKDDLICASRYALMMKIFSISKPGYS 52 ++W EF YHR +G V+K DD + ASRYA+MMK F+++ P Sbjct: 218 EDWFAEFRLYHRKDGSVVKTNDDRLSASRYAMMMKRFAVTAPAAR 262 >gi|71274675|ref|ZP_00650963.1| Protein of unknown function DUF264 [Xylella fastidiosa Dixon] gi|71901596|ref|ZP_00683677.1| Protein of unknown function DUF264 [Xylella fastidiosa Ann-1] gi|170730087|ref|YP_001775520.1| putative DNA packaging protein GP2 [Xylella fastidiosa M12] gi|71164407|gb|EAO14121.1| Protein of unknown function DUF264 [Xylella fastidiosa Dixon] gi|71728644|gb|EAO30794.1| Protein of unknown function DUF264 [Xylella fastidiosa Ann-1] gi|167964880|gb|ACA11890.1| putative DNA packaging protein GP2 [Xylella fastidiosa M12] Length = 472 Score = 74.3 bits (181), Expect = 6e-12, Method: Composition-based stats. Identities = 18/42 (42%), Positives = 26/42 (61%) Query: 8 QEWLDEFHQYHRCEGRVIKEKDDLICASRYALMMKIFSISKP 49 EW +E YHR GR+ K DDL+ A+RYA+MM+ ++ Sbjct: 418 AEWFEECSLYHRDNGRITKRHDDLLSATRYAMMMRRYAKITN 459 >gi|281599695|gb|ADA72679.1| Gp2-like protein [Shigella flexneri 2002017] Length = 441 Score = 73.5 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 17/45 (37%), Positives = 27/45 (60%), Gaps = 1/45 (2%) Query: 7 CQEWLDEFHQYHRC-EGRVIKEKDDLICASRYALMMKIFSISKPG 50 C+ + +EF YHR G+++K DD++ A RYA MM+ F+ Sbjct: 377 CEPFFEEFRLYHRDENGKIVKLNDDVLSAVRYAYMMRRFAKMMRD 421 >gi|327251967|gb|EGE63639.1| DNA packaging protein gp2 [Escherichia coli STEC_7v] gi|327254495|gb|EGE66117.1| DNA packaging protein gp2 [Escherichia coli STEC_7v] Length = 499 Score = 73.1 bits (178), Expect = 1e-11, Method: Composition-based stats. Identities = 17/45 (37%), Positives = 27/45 (60%), Gaps = 1/45 (2%) Query: 7 CQEWLDEFHQYHRC-EGRVIKEKDDLICASRYALMMKIFSISKPG 50 C+ + +EF YHR G+++K DD++ A RYA MM+ F+ Sbjct: 435 CEPFFEEFRLYHRDENGKIVKLNDDVLSAVRYAYMMRRFAKMMRD 479 >gi|315299781|gb|EFU59021.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 16-3] Length = 499 Score = 73.1 bits (178), Expect = 1e-11, Method: Composition-based stats. Identities = 17/45 (37%), Positives = 27/45 (60%), Gaps = 1/45 (2%) Query: 7 CQEWLDEFHQYHRC-EGRVIKEKDDLICASRYALMMKIFSISKPG 50 C+ + +EF YHR G+++K DD++ A RYA MM+ F+ Sbjct: 435 CEPFFEEFRLYHRDENGKIVKLNDDVLSAVRYAYMMRRFAKMMRD 479 >gi|331657716|ref|ZP_08358678.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia coli TA206] gi|331055964|gb|EGI27973.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia coli TA206] Length = 499 Score = 73.1 bits (178), Expect = 1e-11, Method: Composition-based stats. Identities = 17/45 (37%), Positives = 27/45 (60%), Gaps = 1/45 (2%) Query: 7 CQEWLDEFHQYHRC-EGRVIKEKDDLICASRYALMMKIFSISKPG 50 C+ + +EF YHR G+++K DD++ A RYA MM+ F+ Sbjct: 435 CEPFFEEFRLYHRDENGKIVKLNDDVLSAVRYAYMMRRFAKMMRD 479 >gi|293410725|ref|ZP_06654301.1| DNA-packaging protein gp2 [Escherichia coli B354] gi|291471193|gb|EFF13677.1| DNA-packaging protein gp2 [Escherichia coli B354] Length = 499 Score = 73.1 bits (178), Expect = 1e-11, Method: Composition-based stats. Identities = 17/45 (37%), Positives = 27/45 (60%), Gaps = 1/45 (2%) Query: 7 CQEWLDEFHQYHRC-EGRVIKEKDDLICASRYALMMKIFSISKPG 50 C+ + +EF YHR G+++K DD++ A RYA MM+ F+ Sbjct: 435 CEPFFEEFRLYHRDENGKIVKLNDDVLSAVRYAYMMRRFAKMMRD 479 >gi|62178924|ref|YP_215341.1| gp2-like protein [Salmonella enterica subsp. enterica serovar Choleraesuis str. SC-B67] gi|62126557|gb|AAX64260.1| gp2-like protein [Salmonella enterica subsp. enterica serovar Choleraesuis str. SC-B67] gi|322713379|gb|EFZ04950.1| gp2-like protein [Salmonella enterica subsp. enterica serovar Choleraesuis str. A50] Length = 499 Score = 73.1 bits (178), Expect = 1e-11, Method: Composition-based stats. Identities = 17/45 (37%), Positives = 27/45 (60%), Gaps = 1/45 (2%) Query: 7 CQEWLDEFHQYHRC-EGRVIKEKDDLICASRYALMMKIFSISKPG 50 C+ + +EF YHR G+++K DD++ A RYA MM+ F+ Sbjct: 435 CEPFFEEFRLYHRDENGKIVKLNDDVLSAVRYAYMMRRFAKMMRD 479 >gi|323967108|gb|EGB62533.1| terminase [Escherichia coli M863] Length = 499 Score = 73.1 bits (178), Expect = 1e-11, Method: Composition-based stats. Identities = 17/45 (37%), Positives = 27/45 (60%), Gaps = 1/45 (2%) Query: 7 CQEWLDEFHQYHRC-EGRVIKEKDDLICASRYALMMKIFSISKPG 50 C+ + +EF YHR G+++K DD++ A RYA MM+ F+ Sbjct: 435 CEPFFEEFRLYHRDENGKIVKLNDDVLSAVRYAYMMRRFAKMMRD 479 >gi|218549377|ref|YP_002383168.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia fergusonii ATCC 35469] gi|307311077|ref|ZP_07590721.1| protein of unknown function DUF264 [Escherichia coli W] gi|331669066|ref|ZP_08369914.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia coli TA271] gi|218356918|emb|CAQ89550.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia fergusonii ATCC 35469] gi|306908583|gb|EFN39080.1| protein of unknown function DUF264 [Escherichia coli W] gi|312945545|gb|ADR26372.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia coli O83:H1 str. NRG 857C] gi|315061655|gb|ADT75982.1| DNA packaging protein gp2 (terminase large subunit) [Escherichia coli W] gi|323377763|gb|ADX50031.1| DNA packaging protein gp2 (terminase large subunit) [Escherichia coli KO11] gi|324117758|gb|EGC11657.1| terminase [Escherichia coli E1167] gi|331064260|gb|EGI36171.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia coli TA271] Length = 499 Score = 72.8 bits (177), Expect = 1e-11, Method: Composition-based stats. Identities = 17/45 (37%), Positives = 27/45 (60%), Gaps = 1/45 (2%) Query: 7 CQEWLDEFHQYHRC-EGRVIKEKDDLICASRYALMMKIFSISKPG 50 C+ + +EF YHR G+++K DD++ A RYA MM+ F+ Sbjct: 435 CEPFFEEFRLYHRDENGKIVKLNDDVLSAVRYAYMMRRFAKMMRD 479 >gi|281599578|gb|ADA72562.1| putative terminase large subunit [Shigella flexneri 2002017] Length = 351 Score = 72.8 bits (177), Expect = 2e-11, Method: Composition-based stats. Identities = 17/45 (37%), Positives = 27/45 (60%), Gaps = 1/45 (2%) Query: 7 CQEWLDEFHQYHRC-EGRVIKEKDDLICASRYALMMKIFSISKPG 50 C+ + +EF YHR G+++K DD++ A RYA MM+ F+ Sbjct: 287 CEPFFEEFRLYHRDENGKIVKLNDDVLSAVRYAYMMRRFAKMMRD 331 >gi|110804280|ref|YP_687800.1| putative terminase large subunit [Shigella flexneri 5 str. 8401] gi|110613828|gb|ABF02495.1| putative terminase large subunit [Shigella flexneri 5 str. 8401] Length = 354 Score = 72.8 bits (177), Expect = 2e-11, Method: Composition-based stats. Identities = 17/45 (37%), Positives = 27/45 (60%), Gaps = 1/45 (2%) Query: 7 CQEWLDEFHQYHRC-EGRVIKEKDDLICASRYALMMKIFSISKPG 50 C+ + +EF YHR G+++K DD++ A RYA MM+ F+ Sbjct: 290 CEPFFEEFRLYHRDENGKIVKLNDDVLSAVRYAYMMRRFAKMMRD 334 >gi|24111660|ref|NP_706170.1| putative terminase large subunit [Shigella flexneri 2a str. 301] gi|24050435|gb|AAN41877.1| putative terminase large subunit [Shigella flexneri 2a str. 301] gi|313646707|gb|EFS11166.1| DNA packaging gp2 domain protein [Shigella flexneri 2a str. 2457T] Length = 300 Score = 71.6 bits (174), Expect = 3e-11, Method: Composition-based stats. Identities = 17/45 (37%), Positives = 27/45 (60%), Gaps = 1/45 (2%) Query: 7 CQEWLDEFHQYHRC-EGRVIKEKDDLICASRYALMMKIFSISKPG 50 C+ + +EF YHR G+++K DD++ A RYA MM+ F+ Sbjct: 236 CEPFFEEFRLYHRDENGKIVKLNDDVLSAVRYAYMMRRFAKMMRD 280 >gi|300920006|ref|ZP_07136465.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 115-1] gi|300412953|gb|EFJ96263.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 115-1] Length = 498 Score = 71.6 bits (174), Expect = 4e-11, Method: Composition-based stats. Identities = 17/45 (37%), Positives = 27/45 (60%), Gaps = 1/45 (2%) Query: 7 CQEWLDEFHQYHRC-EGRVIKEKDDLICASRYALMMKIFSISKPG 50 C+ + +EF YHR G+++K DD++ A RY MM+ F+I Sbjct: 435 CEPFFEEFRLYHRDENGKIVKLNDDILSAVRYGYMMRRFAIQMRD 479 >gi|30061789|ref|NP_835960.1| putative terminase large subunit [Shigella flexneri 2a str. 2457T] gi|30040031|gb|AAP15765.1| putative terminase large subunit [Shigella flexneri 2a str. 2457T] Length = 124 Score = 69.7 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 17/45 (37%), Positives = 27/45 (60%), Gaps = 1/45 (2%) Query: 7 CQEWLDEFHQYHRC-EGRVIKEKDDLICASRYALMMKIFSISKPG 50 C+ + +EF YHR G+++K DD++ A RYA MM+ F+ Sbjct: 60 CEPFFEEFRLYHRDENGKIVKLNDDVLSAVRYAYMMRRFAKMMRD 104 >gi|27476053|ref|NP_775255.1| terminase [Pseudomonas phage PaP3] gi|27414483|gb|AAL85569.1| terminase [Pseudomonas phage PaP3] Length = 482 Score = 68.9 bits (167), Expect = 2e-10, Method: Composition-based stats. Identities = 16/43 (37%), Positives = 25/43 (58%), Gaps = 1/43 (2%) Query: 7 CQEWLDEFHQYHRCEGRVIKEKDDLICASRYALMM-KIFSISK 48 C +L E YHR +G+++ DD+I A+RYAL+M + Sbjct: 416 CTNFLKEMKMYHRKDGKIVDRNDDMISATRYALLMASRHARPG 458 >gi|167600439|ref|YP_001671939.1| terminase large subunit [Pseudomonas phage LUZ24] gi|161168302|emb|CAP45467.1| terminase large subunit [Pseudomonas phage LUZ24] Length = 482 Score = 68.5 bits (166), Expect = 3e-10, Method: Composition-based stats. Identities = 17/43 (39%), Positives = 25/43 (58%), Gaps = 1/43 (2%) Query: 7 CQEWLDEFHQYHRCEGRVIKEKDDLICASRYALMM-KIFSISK 48 C +L E YHR +G++I DD+I A+RYAL+M + Sbjct: 416 CTNFLKEMKMYHRKDGKIIDRNDDMISATRYALLMASRHARPG 458 >gi|137993|sp|P16938|VG2_BPLP7 RecName: Full=Protein GP2 gi|75884|pir||Z2BPL7 gene 2 protein - phage LP-7 (fragment) gi|553003|gb|AAA88220.1| packaging glycoprotein [Enterobacteria phage LP7] Length = 475 Score = 63.9 bits (154), Expect = 8e-09, Method: Composition-based stats. Identities = 15/37 (40%), Positives = 25/37 (67%), Gaps = 1/37 (2%) Query: 7 CQEWLDEFHQYHRC-EGRVIKEKDDLICASRYALMMK 42 C+ + +EF YHR G+++K DD++ A+RY MM+ Sbjct: 431 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYGYMMR 467 >gi|89885991|ref|YP_516188.1| phage terminase large subunit [Sodalis phage phiSG1] gi|89191726|dbj|BAE80473.1| phage terminase large subunit [Sodalis phage phiSG1] gi|125470018|gb|ABN42210.1| gp02 [Sodalis phage phiSG1] Length = 475 Score = 61.2 bits (147), Expect = 5e-08, Method: Composition-based stats. Identities = 16/39 (41%), Positives = 29/39 (74%), Gaps = 1/39 (2%) Query: 9 EWLDEFHQYHRCE-GRVIKEKDDLICASRYALMMKIFSI 46 ++ DE++ YHR E R++K +DD++ A RYA MM+ +++ Sbjct: 415 DFFDEYNFYHRDEKSRIVKMRDDILDAVRYAYMMRRYAV 453 >gi|219681243|ref|YP_002455888.1| Gp2 [Salmonella enterica bacteriophage SE1] gi|66473858|gb|AAY46504.1| Gp2 [Salmonella phage SE1] Length = 499 Score = 60.1 bits (144), Expect = 1e-07, Method: Composition-based stats. Identities = 13/34 (38%), Positives = 22/34 (64%), Gaps = 1/34 (2%) Query: 7 CQEWLDEFHQYHRC-EGRVIKEKDDLICASRYAL 39 C+ + +EF YHR G+++K DD++ A+RY Sbjct: 435 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYGY 468 >gi|60476789|gb|AAX21426.1| gp2 [Enterobacteria phage L] Length = 499 Score = 60.1 bits (144), Expect = 1e-07, Method: Composition-based stats. Identities = 13/34 (38%), Positives = 22/34 (64%), Gaps = 1/34 (2%) Query: 7 CQEWLDEFHQYHRC-EGRVIKEKDDLICASRYAL 39 C+ + +EF YHR G+++K DD++ A+RY Sbjct: 435 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYGY 468 >gi|24371583|ref|NP_720326.1| gp2 [Enterobacteria phage ST64T] gi|24250810|gb|AAL15523.1| gp2 [Salmonella phage ST64T] Length = 517 Score = 60.1 bits (144), Expect = 1e-07, Method: Composition-based stats. Identities = 13/34 (38%), Positives = 22/34 (64%), Gaps = 1/34 (2%) Query: 7 CQEWLDEFHQYHRC-EGRVIKEKDDLICASRYAL 39 C+ + +EF YHR G+++K DD++ A+RY Sbjct: 453 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYGY 486 >gi|238912312|ref|ZP_04656149.1| putative terminase large subunit [Salmonella enterica subsp. enterica serovar Tennessee str. CDC07-0191] gi|261245593|emb|CBG23388.1| terminase large subunit [Salmonella enterica subsp. enterica serovar Typhimurium str. D23580] Length = 499 Score = 60.1 bits (144), Expect = 1e-07, Method: Composition-based stats. Identities = 13/34 (38%), Positives = 22/34 (64%), Gaps = 1/34 (2%) Query: 7 CQEWLDEFHQYHRC-EGRVIKEKDDLICASRYAL 39 C+ + +EF YHR G+++K DD++ A+RY Sbjct: 435 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYGY 468 >gi|221328620|ref|YP_002533461.1| Terminase, large subunit [Salmonella phage epsilon34] gi|255252684|ref|YP_003090219.1| Terminase, large subunit [Salmonella phage c341] gi|193244688|gb|ACF16628.1| Terminase, large subunit [Salmonella phage epsilon34] gi|223697657|gb|ACN18281.1| Terminase, large subunit [Salmonella phage g341c] Length = 499 Score = 60.1 bits (144), Expect = 1e-07, Method: Composition-based stats. Identities = 13/34 (38%), Positives = 22/34 (64%), Gaps = 1/34 (2%) Query: 7 CQEWLDEFHQYHRC-EGRVIKEKDDLICASRYAL 39 C+ + +EF YHR G+++K DD++ A+RY Sbjct: 435 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYGY 468 >gi|46358697|ref|YP_006405.1| Gp2 [Enterobacteria phage ST104] gi|46357933|dbj|BAD15212.1| Gp2 [Enterobacteria phage ST104] gi|312911340|dbj|BAJ35314.1| putative terminase large subunit [Salmonella enterica subsp. enterica serovar Typhimurium str. T000240] Length = 499 Score = 59.7 bits (143), Expect = 1e-07, Method: Composition-based stats. Identities = 13/34 (38%), Positives = 22/34 (64%), Gaps = 1/34 (2%) Query: 7 CQEWLDEFHQYHRC-EGRVIKEKDDLICASRYAL 39 C+ + +EF YHR G+++K DD++ A+RY Sbjct: 435 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYGY 468 >gi|326622293|gb|EGE28638.1| terminase large subunit [Salmonella enterica subsp. enterica serovar Dublin str. 3246] Length = 482 Score = 59.7 bits (143), Expect = 1e-07, Method: Composition-based stats. Identities = 13/34 (38%), Positives = 22/34 (64%), Gaps = 1/34 (2%) Query: 7 CQEWLDEFHQYHRC-EGRVIKEKDDLICASRYAL 39 C+ + +EF YHR G+++K DD++ A+RY Sbjct: 418 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYGY 451 >gi|318065950|ref|YP_004123808.1| Gp2 [Salmonella phage ST160] gi|289066936|gb|ADC81147.1| Gp2 [Salmonella phage ST160] Length = 517 Score = 59.7 bits (143), Expect = 1e-07, Method: Composition-based stats. Identities = 13/34 (38%), Positives = 22/34 (64%), Gaps = 1/34 (2%) Query: 7 CQEWLDEFHQYHRC-EGRVIKEKDDLICASRYAL 39 C+ + +EF YHR G+++K DD++ A+RY Sbjct: 453 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYGY 486 >gi|197363441|ref|YP_002143078.1| terminase large subunit [Salmonella enterica subsp. enterica serovar Paratyphi A str. AKU_12601] gi|197094918|emb|CAR60455.1| putative terminase large subunit [Salmonella enterica subsp. enterica serovar Paratyphi A str. AKU_12601] gi|320086843|emb|CBY96615.1| DNA packaging protein gp2 Terminase large subunit [Salmonella enterica subsp. enterica serovar Weltevreden str. 2007-60-3289-1] Length = 499 Score = 59.7 bits (143), Expect = 1e-07, Method: Composition-based stats. Identities = 13/34 (38%), Positives = 22/34 (64%), Gaps = 1/34 (2%) Query: 7 CQEWLDEFHQYHRC-EGRVIKEKDDLICASRYAL 39 C+ + +EF YHR G+++K DD++ A+RY Sbjct: 435 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYGY 468 >gi|198245578|ref|YP_002214540.1| terminase large subunit [Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853] gi|197940094|gb|ACH77427.1| terminase large subunit [Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853] Length = 499 Score = 59.7 bits (143), Expect = 1e-07, Method: Composition-based stats. Identities = 13/34 (38%), Positives = 22/34 (64%), Gaps = 1/34 (2%) Query: 7 CQEWLDEFHQYHRC-EGRVIKEKDDLICASRYAL 39 C+ + +EF YHR G+++K DD++ A+RY Sbjct: 435 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYGY 468 >gi|168240109|ref|ZP_02665041.1| DNA packaging protein gp2 [Salmonella enterica subsp. enterica serovar Heidelberg str. SL486] gi|194451817|ref|YP_002044341.1| DNA packaging protein gp2 [Salmonella enterica subsp. enterica serovar Heidelberg str. SL476] gi|194410121|gb|ACF70340.1| DNA packaging protein gp2 [Salmonella enterica subsp. enterica serovar Heidelberg str. SL476] gi|205340165|gb|EDZ26929.1| DNA packaging protein gp2 [Salmonella enterica subsp. enterica serovar Heidelberg str. SL486] Length = 499 Score = 59.7 bits (143), Expect = 1e-07, Method: Composition-based stats. Identities = 13/34 (38%), Positives = 22/34 (64%), Gaps = 1/34 (2%) Query: 7 CQEWLDEFHQYHRC-EGRVIKEKDDLICASRYAL 39 C+ + +EF YHR G+++K DD++ A+RY Sbjct: 435 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYGY 468 >gi|51236724|ref|YP_063734.1| terminase large subunit [Enterobacteria phage P22] gi|137879|sp|P26745|TERL_BPP22 RecName: Full=Large terminase protein; AltName: Full=DNA-packaging protein gp2; AltName: Full=Terminase large subunit gi|21914414|gb|AAM81379.1|AF527608_1 terminase large subunit [Salmonella phage P22-pbi] gi|553005|gb|AAA72959.1| DNA pacaging [Enterobacteria phage P22] gi|8439622|gb|AAF75044.1| terminase large subunit [Enterobacteria phage P22] gi|28394263|tpg|DAA00977.1| TPA_inf: terminase large subunit [Enterobacteria phage P22] Length = 499 Score = 59.7 bits (143), Expect = 1e-07, Method: Composition-based stats. Identities = 13/34 (38%), Positives = 22/34 (64%), Gaps = 1/34 (2%) Query: 7 CQEWLDEFHQYHRC-EGRVIKEKDDLICASRYAL 39 C+ + +EF YHR G+++K DD++ A+RY Sbjct: 435 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYGY 468 >gi|157734711|dbj|BAF80717.1| terminase large subunit [Enterobacteria phage P22] gi|169658843|dbj|BAG12600.1| terminase large subunit [Enterobacteria phage P22] Length = 499 Score = 59.7 bits (143), Expect = 1e-07, Method: Composition-based stats. Identities = 13/34 (38%), Positives = 22/34 (64%), Gaps = 1/34 (2%) Query: 7 CQEWLDEFHQYHRC-EGRVIKEKDDLICASRYAL 39 C+ + +EF YHR G+++K DD++ A+RY Sbjct: 435 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYGY 468 >gi|94317806|gb|ABF15069.1| terminase large subunit Gp2 [Salmonella enterica subsp. enterica serovar Typhimurium] Length = 278 Score = 59.7 bits (143), Expect = 1e-07, Method: Composition-based stats. Identities = 13/34 (38%), Positives = 22/34 (64%), Gaps = 1/34 (2%) Query: 7 CQEWLDEFHQYHRC-EGRVIKEKDDLICASRYAL 39 C+ + +EF YHR G+++K DD++ A+RY Sbjct: 218 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYGY 251 >gi|161504537|ref|YP_001571649.1| hypothetical protein SARI_02650 [Salmonella enterica subsp. arizonae serovar 62:z4,z23:-- str. RSK2980] gi|160865884|gb|ABX22507.1| hypothetical protein SARI_02650 [Salmonella enterica subsp. arizonae serovar 62:z4,z23:--] Length = 499 Score = 59.7 bits (143), Expect = 1e-07, Method: Composition-based stats. Identities = 13/34 (38%), Positives = 22/34 (64%), Gaps = 1/34 (2%) Query: 7 CQEWLDEFHQYHRC-EGRVIKEKDDLICASRYAL 39 C+ + +EF YHR G+++K DD++ A+RY Sbjct: 435 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYGY 468 >gi|215304|gb|AAA72960.1| unnamed protein product [Enterobacteria phage P22] Length = 101 Score = 58.9 bits (141), Expect = 2e-07, Method: Composition-based stats. Identities = 13/34 (38%), Positives = 22/34 (64%), Gaps = 1/34 (2%) Query: 7 CQEWLDEFHQYHRC-EGRVIKEKDDLICASRYAL 39 C+ + +EF YHR G+++K DD++ A+RY Sbjct: 37 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYGY 70 >gi|321225021|gb|EFX50082.1| Phage terminase, large subunit [Salmonella enterica subsp. enterica serovar Typhimurium str. TN061786] Length = 267 Score = 58.9 bits (141), Expect = 2e-07, Method: Composition-based stats. Identities = 13/34 (38%), Positives = 22/34 (64%), Gaps = 1/34 (2%) Query: 7 CQEWLDEFHQYHRC-EGRVIKEKDDLICASRYAL 39 C+ + +EF YHR G+++K DD++ A+RY Sbjct: 203 CEPFFEEFRLYHRDENGKIVKTNDDVLDATRYGY 236 >gi|167583562|ref|YP_001671752.1| terminase large subunit [Enterobacteria phage phiEco32] gi|164375400|gb|ABY52808.1| terminase large subunit [Enterobacteria phage phiEco32] Length = 513 Score = 53.1 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 14/34 (41%), Positives = 20/34 (58%) Query: 11 LDEFHQYHRCEGRVIKEKDDLICASRYALMMKIF 44 +E +YHR G++IKE DDL+ A RY+ Sbjct: 453 FEEKARYHRKVGKIIKEHDDLMDAMRYSACSVTH 486 >gi|264678784|ref|YP_003278691.1| phage DNA packaging protein Gp2 [Comamonas testosteroni CNB-2] gi|262209297|gb|ACY33395.1| putative phage DNA packaging protein Gp2 [Comamonas testosteroni CNB-2] Length = 434 Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 11/30 (36%), Positives = 21/30 (70%), Gaps = 1/30 (3%) Query: 9 EWLDEFHQYHRCE-GRVIKEKDDLICASRY 37 +WL+E+ Y R + G+++K+ D + A+RY Sbjct: 218 DWLNEYRIYRRDDKGQIVKKDDHAMDATRY 247 >gi|309804997|ref|ZP_07699054.1| phage terminase, large subunit, PBSX family [Lactobacillus iners LactinV 09V1-c] gi|308165656|gb|EFO67882.1| phage terminase, large subunit, PBSX family [Lactobacillus iners LactinV 09V1-c] Length = 409 Score = 38.1 bits (87), Expect = 0.40, Method: Composition-based stats. Identities = 13/33 (39%), Positives = 19/33 (57%), Gaps = 2/33 (6%) Query: 9 EWLDEFHQY--HRCEGRVIKEKDDLICASRYAL 39 W +E +QY + G +KE DD + A RYA+ Sbjct: 360 NWFNEVYQYIWNEKTGEPVKENDDSMDAMRYAI 392 >gi|309804260|ref|ZP_07698337.1| phage terminase, large subunit, PBSX family [Lactobacillus iners LactinV 11V1-d] gi|309808639|ref|ZP_07702531.1| phage terminase, large subunit, PBSX family [Lactobacillus iners LactinV 01V1-a] gi|309809485|ref|ZP_07703343.1| phage terminase, large subunit, PBSX family [Lactobacillus iners SPIN 2503V10-D] gi|308163663|gb|EFO65933.1| phage terminase, large subunit, PBSX family [Lactobacillus iners LactinV 11V1-d] gi|308168113|gb|EFO70239.1| phage terminase, large subunit, PBSX family [Lactobacillus iners LactinV 01V1-a] gi|308170157|gb|EFO72192.1| phage terminase, large subunit, PBSX family [Lactobacillus iners SPIN 2503V10-D] Length = 409 Score = 37.3 bits (85), Expect = 0.63, Method: Composition-based stats. Identities = 13/33 (39%), Positives = 19/33 (57%), Gaps = 2/33 (6%) Query: 9 EWLDEFHQY--HRCEGRVIKEKDDLICASRYAL 39 W +E +QY + G +KE DD + A RYA+ Sbjct: 360 NWFNEVYQYIWNEKTGEPVKENDDSMDAMRYAI 392 >gi|49146380|ref|YP_025488.1| putative phage DNA packaging protein Gp2 [Caedibacter taeniospiralis] gi|40458348|gb|AAR87096.1| putative phage DNA packaging protein Gp2 [Caedibacter taeniospiralis] Length = 474 Score = 36.9 bits (84), Expect = 0.98, Method: Composition-based stats. Identities = 15/41 (36%), Positives = 19/41 (46%), Gaps = 2/41 (4%) Query: 9 EWLDEFHQYHRCE-GRVIKEKDDLICASRYALMM-KIFSIS 47 L EF Y R E G V K D L+ RY +M + + S Sbjct: 416 NTLKEFRMYSRDEKGHVKKGNDHLMDCLRYIVMSGERLAKS 456 >gi|309806821|ref|ZP_07700810.1| phage terminase, large subunit, PBSX family [Lactobacillus iners LactinV 03V1-b] gi|308166795|gb|EFO68985.1| phage terminase, large subunit, PBSX family [Lactobacillus iners LactinV 03V1-b] Length = 409 Score = 36.2 bits (82), Expect = 1.4, Method: Composition-based stats. Identities = 14/33 (42%), Positives = 20/33 (60%), Gaps = 2/33 (6%) Query: 9 EWLDEFHQY--HRCEGRVIKEKDDLICASRYAL 39 WL+E +QY + G +KE DD + A RYA+ Sbjct: 360 NWLNEVYQYIWNEKTGEPVKENDDSMDAMRYAI 392 >gi|325912773|ref|ZP_08175152.1| phage terminase, large subunit, PBSX family [Lactobacillus iners UPII 60-B] gi|325477904|gb|EGC81037.1| phage terminase, large subunit, PBSX family [Lactobacillus iners UPII 60-B] Length = 409 Score = 35.8 bits (81), Expect = 2.1, Method: Composition-based stats. Identities = 14/33 (42%), Positives = 20/33 (60%), Gaps = 2/33 (6%) Query: 9 EWLDEFHQY--HRCEGRVIKEKDDLICASRYAL 39 WL+E +QY + G +KE DD + A RYA+ Sbjct: 360 NWLNEVYQYIWNEKTGEPVKENDDSMDAMRYAI 392 >gi|329919929|ref|ZP_08276833.1| phage terminase, large subunit, PBSX family [Lactobacillus iners SPIN 1401G] gi|328936867|gb|EGG33301.1| phage terminase, large subunit, PBSX family [Lactobacillus iners SPIN 1401G] Length = 409 Score = 35.8 bits (81), Expect = 2.2, Method: Composition-based stats. Identities = 14/33 (42%), Positives = 20/33 (60%), Gaps = 2/33 (6%) Query: 9 EWLDEFHQY--HRCEGRVIKEKDDLICASRYAL 39 WL+E +QY + G +KE DD + A RYA+ Sbjct: 360 NWLNEVYQYIWNEKTGEPVKENDDSMDAMRYAI 392 >gi|300766295|ref|ZP_07076256.1| hypothetical protein LMHG_12799 [Listeria monocytogenes FSL N1-017] gi|300513003|gb|EFK40089.1| hypothetical protein LMHG_12799 [Listeria monocytogenes FSL N1-017] Length = 439 Score = 33.9 bits (76), Expect = 7.1, Method: Composition-based stats. Identities = 11/31 (35%), Positives = 16/31 (51%), Gaps = 2/31 (6%) Query: 10 WLDEFHQYHRCE--GRVIKEKDDLICASRYA 38 WL E Y R E G+ + + + + SRYA Sbjct: 398 WLQEMGMYVRDENSGKPVDKNNHAMDTSRYA 428 >gi|47097311|ref|ZP_00234868.1| phage terminase, large subunit, PBSX family [Listeria monocytogenes str. 1/2a F6854] gi|293596726|ref|ZP_05263865.2| phage terminase [Listeria monocytogenes J2818] gi|47014321|gb|EAL05297.1| phage terminase, large subunit, PBSX family [Listeria monocytogenes str. 1/2a F6854] gi|293591870|gb|EFG00205.1| phage terminase [Listeria monocytogenes J2818] Length = 442 Score = 33.5 bits (75), Expect = 9.9, Method: Composition-based stats. Identities = 11/31 (35%), Positives = 16/31 (51%), Gaps = 2/31 (6%) Query: 10 WLDEFHQYHRCE--GRVIKEKDDLICASRYA 38 WL E Y R E G+ + + + + SRYA Sbjct: 401 WLQEMGMYVRDENSGKPVDKNNHAMDTSRYA 431 Database: nr Posted date: May 13, 2011 4:10 AM Number of letters in database: 999,999,932 Number of sequences in database: 2,987,209 Database: /data/usr2/db/fasta/nr.01 Posted date: May 13, 2011 4:17 AM Number of letters in database: 999,998,956 Number of sequences in database: 2,896,973 Database: /data/usr2/db/fasta/nr.02 Posted date: May 13, 2011 4:23 AM Number of letters in database: 999,999,979 Number of sequences in database: 2,907,862 Database: /data/usr2/db/fasta/nr.03 Posted date: May 13, 2011 4:29 AM Number of letters in database: 999,999,513 Number of sequences in database: 2,932,190 Database: /data/usr2/db/fasta/nr.04 Posted date: May 13, 2011 4:33 AM Number of letters in database: 792,586,372 Number of sequences in database: 2,260,650 Lambda K H 0.311 0.136 0.433 Lambda K H 0.267 0.0427 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 647,519,515 Number of Sequences: 13984884 Number of extensions: 20172721 Number of successful extensions: 76394 Number of sequences better than 10.0: 60 Number of HSP's better than 10.0 without gapping: 48 Number of HSP's successfully gapped in prelim test: 12 Number of HSP's that attempted gapping in prelim test: 76312 Number of HSP's gapped (non-prelim): 60 length of query: 62 length of database: 4,792,584,752 effective HSP length: 34 effective length of query: 28 effective length of database: 4,317,098,696 effective search space: 120878763488 effective search space used: 120878763488 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (20.8 bits) S2: 76 (33.8 bits)