BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= 537021.9.peg.1080_1 (110 letters) Database: nr 13,984,884 sequences; 4,792,584,752 total letters Searching..................................................done Results from round 1 >gi|315122536|ref|YP_004063025.1| DNA packaging protein Gp2 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495938|gb|ADR52537.1| DNA packaging protein Gp2 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 455 Score = 172 bits (436), Expect = 2e-41, Method: Compositional matrix adjust. Identities = 79/100 (79%), Positives = 94/100 (94%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSWWRGYRFLQPIVMVAGSVTYELTRDGIQRLLLGE 60 MAGNQLGKTLAGAAEAA+HL+G YP WW G+RF++PIVMVAGSV+YELTRDGIQRLLLGE Sbjct: 41 MAGNQLGKTLAGAAEAAIHLTGFYPPWWLGHRFVKPIVMVAGSVSYELTRDGIQRLLLGE 100 Query: 61 PMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRHLSG 100 PMS D+QGSGMIPA+ ++NMTRR N+AGAY+TVT++H+SG Sbjct: 101 PMSLDRQGSGMIPAHTIVNMTRRFNVAGAYTTVTIKHVSG 140 >gi|15965769|ref|NP_386122.1| DNA packaging protein GP2 [Sinorhizobium meliloti 1021] gi|15075038|emb|CAC46595.1| DNA packaging protein GP2 [Sinorhizobium meliloti 1021] Length = 477 Score = 140 bits (352), Expect = 8e-32, Method: Compositional matrix adjust. Identities = 64/101 (63%), Positives = 82/101 (81%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSWWRGYRFLQPIVMVAGSVTYELTRDGIQRLLLGE 60 MAGNQLGKTLAGAAEAAMHL+G YP WW+G RF +PIVM+AGS +YELTRDG+QRLL+G Sbjct: 63 MAGNQLGKTLAGAAEAAMHLTGRYPDWWQGRRFDRPIVMLAGSESYELTRDGVQRLLIGP 122 Query: 61 PMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRHLSGR 101 P++ ++ G+G +P + TRR+ +GA +VTVRH+SGR Sbjct: 123 PLNEEEWGTGFLPKAAIKATTRRAGASGALDSVTVRHVSGR 163 >gi|307315429|ref|ZP_07594994.1| protein of unknown function DUF264 [Sinorhizobium meliloti BL225C] gi|306898808|gb|EFN29464.1| protein of unknown function DUF264 [Sinorhizobium meliloti BL225C] Length = 477 Score = 139 bits (351), Expect = 9e-32, Method: Compositional matrix adjust. Identities = 64/101 (63%), Positives = 82/101 (81%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSWWRGYRFLQPIVMVAGSVTYELTRDGIQRLLLGE 60 MAGNQLGKTLAGAAEAAMHL+G YP WW+G RF +PIVM+AGS +YELTRDG+QRLL+G Sbjct: 63 MAGNQLGKTLAGAAEAAMHLTGRYPDWWQGRRFDRPIVMLAGSESYELTRDGVQRLLIGP 122 Query: 61 PMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRHLSGR 101 P++ ++ G+G +P + TRR+ +GA +VTVRH+SGR Sbjct: 123 PLNEEEWGTGFLPKAAIKATTRRAGASGALDSVTVRHVSGR 163 >gi|307318836|ref|ZP_07598268.1| protein of unknown function DUF264 [Sinorhizobium meliloti AK83] gi|306895557|gb|EFN26311.1| protein of unknown function DUF264 [Sinorhizobium meliloti AK83] Length = 477 Score = 139 bits (350), Expect = 1e-31, Method: Compositional matrix adjust. Identities = 64/101 (63%), Positives = 81/101 (80%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSWWRGYRFLQPIVMVAGSVTYELTRDGIQRLLLGE 60 MAGNQLGKTLAGAAEAAMHL+G YP WW+G RF +PIVM+AGS +YELTRDG+QRLL+G Sbjct: 63 MAGNQLGKTLAGAAEAAMHLTGRYPDWWQGRRFDRPIVMLAGSESYELTRDGVQRLLIGP 122 Query: 61 PMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRHLSGR 101 P+ ++ G+G +P + TRR+ +GA +VTVRH+SGR Sbjct: 123 PLHEEEWGTGFLPKAAIKATTRRAGASGALDSVTVRHVSGR 163 >gi|150397042|ref|YP_001327509.1| hypothetical protein Smed_1839 [Sinorhizobium medicae WSM419] gi|150028557|gb|ABR60674.1| protein of unknown function DUF264 [Sinorhizobium medicae WSM419] Length = 477 Score = 138 bits (347), Expect = 3e-31, Method: Compositional matrix adjust. Identities = 62/100 (62%), Positives = 80/100 (80%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSWWRGYRFLQPIVMVAGSVTYELTRDGIQRLLLGE 60 MAGNQLGKTLAGAAEAAMHL+G YP WW+G RF +P+ M+AGS +YELTRDG+QRLL+G Sbjct: 63 MAGNQLGKTLAGAAEAAMHLTGRYPEWWQGRRFDRPVAMLAGSESYELTRDGVQRLLIGP 122 Query: 61 PMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRHLSG 100 P++ D+ G+G +P + TRRS +GA +VTVRH++G Sbjct: 123 PLNEDEWGTGFVPKATIQATTRRSGASGALDSVTVRHVAG 162 >gi|227822449|ref|YP_002826421.1| DNA packaging protein Gp2 [Sinorhizobium fredii NGR234] gi|227341450|gb|ACP25668.1| DNA packaging protein Gp2 [Sinorhizobium fredii NGR234] Length = 454 Score = 137 bits (346), Expect = 4e-31, Method: Compositional matrix adjust. Identities = 65/100 (65%), Positives = 80/100 (80%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSWWRGYRFLQPIVMVAGSVTYELTRDGIQRLLLGE 60 MAGNQLGKTLAGAAEAAMHL+G YP+WW+G RF +PIVM+AGS +YELTRDG+QRLL+G Sbjct: 40 MAGNQLGKTLAGAAEAAMHLTGRYPNWWQGRRFDKPIVMLAGSESYELTRDGVQRLLVGP 99 Query: 61 PMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRHLSG 100 P++ G+G IP + TRRS +GA +VTVRH+SG Sbjct: 100 PLNEADWGTGFIPKATIRATTRRSGASGALDSVTVRHVSG 139 >gi|227821702|ref|YP_002825672.1| DNA packaging protein Gp2 [Sinorhizobium fredii NGR234] gi|227340701|gb|ACP24919.1| DNA packaging protein Gp2 [Sinorhizobium fredii NGR234] Length = 416 Score = 136 bits (343), Expect = 7e-31, Method: Compositional matrix adjust. Identities = 65/101 (64%), Positives = 81/101 (80%), Gaps = 1/101 (0%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSWWRGYRFLQPIVMVAGSVTYELTRDGIQRLLLGE 60 MAGNQLGKTLAGAAEAAMHL+G YP WW G RF +PIVM+AGS +YELTRDG+QRL++G Sbjct: 1 MAGNQLGKTLAGAAEAAMHLTGRYPDWWDGRRFDKPIVMLAGSESYELTRDGVQRLMVGP 60 Query: 61 PMSPDQQGSGMIPANKVLNM-TRRSNIAGAYSTVTVRHLSG 100 PM+ + G+G IP ++ TRRS ++GA +VTVRH+SG Sbjct: 61 PMNEEDWGTGCIPKAAIVGTPTRRSGVSGALDSVTVRHVSG 101 >gi|260463788|ref|ZP_05811985.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] gi|259030385|gb|EEW31664.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] Length = 209 Score = 103 bits (257), Expect = 8e-21, Method: Compositional matrix adjust. Identities = 53/103 (51%), Positives = 64/103 (62%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSWWRGYRFLQPIVMVAGSVTYELTRDGIQRLLLGE 60 MAGNQLGKT AG AE AMHL+G YP WW+G F P+ + A VT E TRD QR+L+G Sbjct: 26 MAGNQLGKTRAGGAEWAMHLTGRYPDWWQGKVFDTPVRLWAAGVTGEGTRDNPQRVLIGP 85 Query: 61 PMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRHLSGRDI 103 P G+GMIPA+ +L T GA +V VRH G D+ Sbjct: 86 PQQQAAWGTGMIPADAILQTTMGRGAPGALDSVVVRHGGGGDV 128 >gi|13471714|ref|NP_103281.1| hypothetical protein mll1771 [Mesorhizobium loti MAFF303099] gi|14022458|dbj|BAB49067.1| mll1771 [Mesorhizobium loti MAFF303099] Length = 254 Score = 102 bits (254), Expect = 2e-20, Method: Compositional matrix adjust. Identities = 52/103 (50%), Positives = 65/103 (63%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSWWRGYRFLQPIVMVAGSVTYELTRDGIQRLLLGE 60 MAGNQLGKT AG AE AMHL+G YP+WW+G F P+ + A VT E TRD QR+L+G Sbjct: 71 MAGNQLGKTRAGGAEWAMHLTGRYPAWWQGKVFDTPVRLWAAGVTGEGTRDNPQRVLVGP 130 Query: 61 PMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRHLSGRDI 103 P G+GMIPA+ + N+ GA +V VRH G D+ Sbjct: 131 PQQQAAWGTGMIPADAIRQTIMGRNVPGAIDSVVVRHGGGGDV 173 >gi|148557330|ref|YP_001264912.1| bacteriophage terminase large (ATPase) subunit-like protein [Sphingomonas wittichii RW1] gi|148502520|gb|ABQ70774.1| Bacteriophage terminase large (ATPase) subunit and inactivated derivatives-like protein [Sphingomonas wittichii RW1] Length = 225 Score = 100 bits (248), Expect = 9e-20, Method: Compositional matrix adjust. Identities = 50/100 (50%), Positives = 63/100 (63%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSWWRGYRFLQPIVMVAGSVTYELTRDGIQRLLLGE 60 MAGNQLGKT+AG+ E AMHL+G YP WWRG RF P T TRD +Q+LLLG+ Sbjct: 55 MAGNQLGKTVAGSFEIAMHLTGRYPGWWRGRRFDAPGRYWVAGETRISTRDTVQKLLLGD 114 Query: 61 PMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRHLSG 100 P P+ G+G IP + R S +A A T+TV H++G Sbjct: 115 PERPEAWGTGAIPGAAIRTTHRASGVANAIDTLTVAHVAG 154 >gi|167041080|gb|ABZ05841.1| hypothetical protein ALOHA_HF400048F7ctg1g8 [uncultured marine microorganism HF4000_48F7] Length = 504 Score = 94.0 bits (232), Expect = 6e-18, Method: Compositional matrix adjust. Identities = 44/102 (43%), Positives = 63/102 (61%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSWWRGYRFLQPIVMVAGSVTYELTRDGIQRLLLGE 60 MAGN++GKT +GA E A HL+G YP WW G+RF + I A ++ TRD +Q L+GE Sbjct: 67 MAGNKVGKTFSGAMELAYHLTGKYPDWWTGHRFDRAIHAWAAGKSHYATRDIVQSELIGE 126 Query: 61 PMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRHLSGRD 102 P P+ G+G IP + ++ R + A V V+H+SGR+ Sbjct: 127 PGDPESFGTGAIPKDLIVKTERNPGVPNALGFVLVKHVSGRN 168 >gi|264678785|ref|YP_003278692.1| DNA packaging protein GP3 [Comamonas testosteroni CNB-2] gi|262209298|gb|ACY33396.1| putative DNA packaging protein GP3 [Comamonas testosteroni CNB-2] Length = 189 Score = 90.5 bits (223), Expect = 8e-17, Method: Compositional matrix adjust. Identities = 43/101 (42%), Positives = 62/101 (61%), Gaps = 1/101 (0%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSWWRGYRFLQPIVMVAGSVTYELTRDGIQRLLLGE 60 MAGN++GKT+A E A HL+G YP WW G+RF +P+ + T+E TRD +Q LLG Sbjct: 67 MAGNRVGKTMAAGTELAYHLTGRYPWWWAGHRFTKPVRALISGDTHETTRDILQLKLLGS 126 Query: 61 PMS-PDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRHLSG 100 P+ G+G+IP + + + RS++ GA +RH SG Sbjct: 127 TTDKPENFGTGLIPGDSITGIVARSHVKGAVERAMIRHESG 167 >gi|27476053|ref|NP_775255.1| terminase [Pseudomonas phage PaP3] gi|27414483|gb|AAL85569.1| terminase [Pseudomonas phage PaP3] Length = 482 Score = 89.7 bits (221), Expect = 1e-16, Method: Composition-based stats. Identities = 44/109 (40%), Positives = 61/109 (55%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSWWRGYRFLQPIVMVAGSVTYELTRDGIQRLLLGE 60 M GN+ GKT GA A HL+G YP WW G +F +P+ A ++ + TRD +Q LLG+ Sbjct: 48 MTGNRCGKTYTGAFIMACHLTGRYPEWWTGRKFDKPVNCWAAGISTDTTRDILQSELLGD 107 Query: 61 PMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRHLSGRDIGFIIES 109 +P+ G+GMIP ++ RR G V VRH+SG I +S Sbjct: 108 WKNPEAFGTGMIPKEDIVKTERREGKPGCVQAVMVRHVSGGLSSLIFKS 156 >gi|167600439|ref|YP_001671939.1| terminase large subunit [Pseudomonas phage LUZ24] gi|161168302|emb|CAP45467.1| terminase large subunit [Pseudomonas phage LUZ24] Length = 482 Score = 86.3 bits (212), Expect = 1e-15, Method: Composition-based stats. Identities = 42/109 (38%), Positives = 60/109 (55%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSWWRGYRFLQPIVMVAGSVTYELTRDGIQRLLLGE 60 M GN+ GKT GA A HL+G YP WW G ++ +P+ A ++ + TRD +Q LLG+ Sbjct: 48 MTGNRCGKTYTGAFIMACHLTGRYPEWWTGRKYDRPVNCWAAGISTDTTRDILQSELLGD 107 Query: 61 PMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRHLSGRDIGFIIES 109 +P+ G+GMIP ++ RR G V V+H SG I +S Sbjct: 108 WKNPEAFGTGMIPKEDIVETIRREGKPGCVQAVVVKHTSGGLSSLIFKS 156 >gi|273810450|ref|YP_003344921.1| TerL [Xylella phage Xfas53] gi|257097825|gb|ACV41131.1| TerL [Xylella phage Xfas53] Length = 470 Score = 83.6 bits (205), Expect = 8e-15, Method: Compositional matrix adjust. Identities = 48/110 (43%), Positives = 63/110 (57%), Gaps = 3/110 (2%) Query: 2 AGNQLGKTLAGAAEAAMHLSGCYPSWWRGYRFLQPIVMVAGSVTYELTRDGIQRLLLG-E 60 A NQ GKTL E AMHL+G YP WW G RF + +AGS T ELTR G+QR+LLG + Sbjct: 58 AANQSGKTLCAGHEVAMHLTGRYPQWWEGKRFERSNHGLAGSETGELTRRGVQRILLGRD 117 Query: 61 PMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRHLSGRDIGFIIESI 110 P + + G+G IP + +T + TV VRH+SG ++S Sbjct: 118 PKT--EMGTGAIPGECIEGVTWARGVPELVDTVYVRHVSGERSSISLKSF 165 >gi|71897556|ref|ZP_00679801.1| Protein of unknown function DUF264 [Xylella fastidiosa Ann-1] gi|71732459|gb|EAO34512.1| Protein of unknown function DUF264 [Xylella fastidiosa Ann-1] Length = 471 Score = 83.6 bits (205), Expect = 9e-15, Method: Compositional matrix adjust. Identities = 48/110 (43%), Positives = 63/110 (57%), Gaps = 3/110 (2%) Query: 2 AGNQLGKTLAGAAEAAMHLSGCYPSWWRGYRFLQPIVMVAGSVTYELTRDGIQRLLLG-E 60 A NQ GKTL E AMHL+G YP WW G RF + +AGS T ELTR G+QR+LLG + Sbjct: 59 AANQSGKTLCAGHEVAMHLTGRYPQWWEGKRFERSNHGLAGSETGELTRRGVQRILLGRD 118 Query: 61 PMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRHLSGRDIGFIIESI 110 P + + G+G IP + +T + TV VRH+SG ++S Sbjct: 119 PKT--EMGTGAIPGECIEGVTWARGVPELVDTVYVRHVSGERSSISLKSF 166 >gi|158422462|ref|YP_001523754.1| putative DNA packaging protein GP3 [Azorhizobium caulinodans ORS 571] gi|158329351|dbj|BAF86836.1| putative DNA packaging protein GP3 [Azorhizobium caulinodans ORS 571] Length = 203 Score = 82.4 bits (202), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 45/102 (44%), Positives = 60/102 (58%), Gaps = 5/102 (4%) Query: 1 MAGNQLGKTLA-GAAEAAMHLSGCYPSWWRGYRFLQPIVMVAGSVTYELTRDGIQRLLLG 59 MA N++GKT G E +HL+G YP WW G RF PI A T E TRD +Q +L G Sbjct: 32 MAANRVGKTYGVGGYETVLHLTGRYPDWWEGRRFDHPIEAWAAGDTGETTRDIVQSVLFG 91 Query: 60 EPMSPDQQGSGMIPANKVLNM-TRRSNIAGAYSTVTVRHLSG 100 + D G+G+IPA+ ++ +RR+ I GA T +RH SG Sbjct: 92 K---IDDLGTGLIPADDIVGEPSRRAGITGAIDTAAIRHRSG 130 >gi|71274675|ref|ZP_00650963.1| Protein of unknown function DUF264 [Xylella fastidiosa Dixon] gi|71901596|ref|ZP_00683677.1| Protein of unknown function DUF264 [Xylella fastidiosa Ann-1] gi|170730087|ref|YP_001775520.1| putative DNA packaging protein GP2 [Xylella fastidiosa M12] gi|71164407|gb|EAO14121.1| Protein of unknown function DUF264 [Xylella fastidiosa Dixon] gi|71728644|gb|EAO30794.1| Protein of unknown function DUF264 [Xylella fastidiosa Ann-1] gi|167964880|gb|ACA11890.1| putative DNA packaging protein GP2 [Xylella fastidiosa M12] Length = 472 Score = 79.7 bits (195), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 45/101 (44%), Positives = 61/101 (60%), Gaps = 3/101 (2%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSWWRGYRFLQPIVMVAGSVTYELTRDGIQRLLLG- 59 +A NQ GKTL EAA+HL+G YP WW+G RF +AGS T ELTR G+QR+LLG Sbjct: 59 IAANQSGKTLCAGYEAAIHLTGRYPDWWQGKRFTSANHGLAGSETGELTRRGVQRVLLGR 118 Query: 60 EPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRHLSG 100 +P + + G+G IP + +T + T+ VRH +G Sbjct: 119 DPKT--ELGTGAIPGACIDAVTWARGVPELVDTIYVRHCTG 157 >gi|219681243|ref|YP_002455888.1| Gp2 [Salmonella enterica bacteriophage SE1] gi|66473858|gb|AAY46504.1| Gp2 [Salmonella phage SE1] Length = 499 Score = 70.5 bits (171), Expect = 7e-11, Method: Composition-based stats. Identities = 38/109 (34%), Positives = 51/109 (46%), Gaps = 12/109 (11%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSW------------WRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W+G RF +P+V G T E Sbjct: 59 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWVGGETNET 118 Query: 49 TRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G D+ G G IP +++ + + V+H Sbjct: 119 VTKTTQRILCGRIEENDEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 167 >gi|46358697|ref|YP_006405.1| Gp2 [Enterobacteria phage ST104] gi|46357933|dbj|BAD15212.1| Gp2 [Enterobacteria phage ST104] gi|312911340|dbj|BAJ35314.1| putative terminase large subunit [Salmonella enterica subsp. enterica serovar Typhimurium str. T000240] Length = 499 Score = 70.5 bits (171), Expect = 7e-11, Method: Composition-based stats. Identities = 38/109 (34%), Positives = 51/109 (46%), Gaps = 12/109 (11%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSW------------WRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W+G RF +P+V G T E Sbjct: 59 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWVGGETNET 118 Query: 49 TRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G D+ G G IP +++ + + V+H Sbjct: 119 VTKTTQRILCGRIEENDEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 167 >gi|60476789|gb|AAX21426.1| gp2 [Enterobacteria phage L] Length = 499 Score = 70.5 bits (171), Expect = 7e-11, Method: Composition-based stats. Identities = 38/109 (34%), Positives = 51/109 (46%), Gaps = 12/109 (11%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSW------------WRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W+G RF +P+V G T E Sbjct: 59 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWVGGETNET 118 Query: 49 TRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G D+ G G IP +++ + + V+H Sbjct: 119 VTKTTQRILCGRIEENDEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 167 >gi|318065950|ref|YP_004123808.1| Gp2 [Salmonella phage ST160] gi|289066936|gb|ADC81147.1| Gp2 [Salmonella phage ST160] Length = 517 Score = 70.5 bits (171), Expect = 7e-11, Method: Composition-based stats. Identities = 38/109 (34%), Positives = 51/109 (46%), Gaps = 12/109 (11%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSW------------WRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W+G RF +P+V G T E Sbjct: 77 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWVGGETNET 136 Query: 49 TRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G D+ G G IP +++ + + V+H Sbjct: 137 VTKTTQRILCGRIEENDEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 185 >gi|24371583|ref|NP_720326.1| gp2 [Enterobacteria phage ST64T] gi|24250810|gb|AAL15523.1| gp2 [Salmonella phage ST64T] Length = 517 Score = 70.5 bits (171), Expect = 8e-11, Method: Composition-based stats. Identities = 38/109 (34%), Positives = 51/109 (46%), Gaps = 12/109 (11%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSW------------WRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W+G RF +P+V G T E Sbjct: 77 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWVGGETNET 136 Query: 49 TRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G D+ G G IP +++ + + V+H Sbjct: 137 VTKTTQRILCGRIEENDEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 185 >gi|168240109|ref|ZP_02665041.1| DNA packaging protein gp2 [Salmonella enterica subsp. enterica serovar Heidelberg str. SL486] gi|194451817|ref|YP_002044341.1| DNA packaging protein gp2 [Salmonella enterica subsp. enterica serovar Heidelberg str. SL476] gi|194410121|gb|ACF70340.1| DNA packaging protein gp2 [Salmonella enterica subsp. enterica serovar Heidelberg str. SL476] gi|205340165|gb|EDZ26929.1| DNA packaging protein gp2 [Salmonella enterica subsp. enterica serovar Heidelberg str. SL486] Length = 499 Score = 70.1 bits (170), Expect = 9e-11, Method: Composition-based stats. Identities = 38/109 (34%), Positives = 51/109 (46%), Gaps = 12/109 (11%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSW------------WRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W+G RF +P+V G T E Sbjct: 59 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWIGGETNET 118 Query: 49 TRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G D+ G G IP +++ + + V+H Sbjct: 119 VTKTTQRILCGRIEENDEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 167 >gi|51236724|ref|YP_063734.1| terminase large subunit [Enterobacteria phage P22] gi|137879|sp|P26745|TERL_BPP22 RecName: Full=Large terminase protein; AltName: Full=DNA-packaging protein gp2; AltName: Full=Terminase large subunit gi|21914414|gb|AAM81379.1|AF527608_1 terminase large subunit [Salmonella phage P22-pbi] gi|553005|gb|AAA72959.1| DNA pacaging [Enterobacteria phage P22] gi|8439622|gb|AAF75044.1| terminase large subunit [Enterobacteria phage P22] gi|28394263|tpg|DAA00977.1| TPA_inf: terminase large subunit [Enterobacteria phage P22] Length = 499 Score = 70.1 bits (170), Expect = 9e-11, Method: Composition-based stats. Identities = 38/109 (34%), Positives = 51/109 (46%), Gaps = 12/109 (11%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSW------------WRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W+G RF +P+V G T E Sbjct: 59 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWIGGETNET 118 Query: 49 TRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G D+ G G IP +++ + + V+H Sbjct: 119 VTKTTQRILCGRIEENDEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 167 >gi|238912312|ref|ZP_04656149.1| putative terminase large subunit [Salmonella enterica subsp. enterica serovar Tennessee str. CDC07-0191] gi|261245593|emb|CBG23388.1| terminase large subunit [Salmonella enterica subsp. enterica serovar Typhimurium str. D23580] Length = 499 Score = 70.1 bits (170), Expect = 9e-11, Method: Composition-based stats. Identities = 38/109 (34%), Positives = 51/109 (46%), Gaps = 12/109 (11%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSW------------WRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W+G RF +P+V G T E Sbjct: 59 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWIGGETNET 118 Query: 49 TRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G D+ G G IP +++ + + V+H Sbjct: 119 VTKTTQRILCGRIEENDEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 167 >gi|161504537|ref|YP_001571649.1| hypothetical protein SARI_02650 [Salmonella enterica subsp. arizonae serovar 62:z4,z23:-- str. RSK2980] gi|160865884|gb|ABX22507.1| hypothetical protein SARI_02650 [Salmonella enterica subsp. arizonae serovar 62:z4,z23:--] Length = 499 Score = 70.1 bits (170), Expect = 9e-11, Method: Composition-based stats. Identities = 38/109 (34%), Positives = 51/109 (46%), Gaps = 12/109 (11%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSW------------WRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W+G RF +P+V G T E Sbjct: 59 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWIGGETNET 118 Query: 49 TRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G D+ G G IP +++ + + V+H Sbjct: 119 VTKTTQRILCGRIEENDEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 167 >gi|221328620|ref|YP_002533461.1| Terminase, large subunit [Salmonella phage epsilon34] gi|255252684|ref|YP_003090219.1| Terminase, large subunit [Salmonella phage c341] gi|193244688|gb|ACF16628.1| Terminase, large subunit [Salmonella phage epsilon34] gi|223697657|gb|ACN18281.1| Terminase, large subunit [Salmonella phage g341c] Length = 499 Score = 70.1 bits (170), Expect = 1e-10, Method: Composition-based stats. Identities = 38/109 (34%), Positives = 51/109 (46%), Gaps = 12/109 (11%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSW------------WRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W+G RF +P+V G T E Sbjct: 59 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWIGGETNET 118 Query: 49 TRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G D+ G G IP +++ + + V+H Sbjct: 119 VTKTTQRILCGRIEENDEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 167 >gi|197363441|ref|YP_002143078.1| terminase large subunit [Salmonella enterica subsp. enterica serovar Paratyphi A str. AKU_12601] gi|197094918|emb|CAR60455.1| putative terminase large subunit [Salmonella enterica subsp. enterica serovar Paratyphi A str. AKU_12601] gi|320086843|emb|CBY96615.1| DNA packaging protein gp2 Terminase large subunit [Salmonella enterica subsp. enterica serovar Weltevreden str. 2007-60-3289-1] Length = 499 Score = 70.1 bits (170), Expect = 1e-10, Method: Composition-based stats. Identities = 38/109 (34%), Positives = 51/109 (46%), Gaps = 12/109 (11%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSW------------WRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W+G RF +P+V G T E Sbjct: 59 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWIGGETNET 118 Query: 49 TRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G D+ G G IP +++ + + V+H Sbjct: 119 VTKTTQRILCGRIEENDEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 167 >gi|157734711|dbj|BAF80717.1| terminase large subunit [Enterobacteria phage P22] gi|169658843|dbj|BAG12600.1| terminase large subunit [Enterobacteria phage P22] Length = 499 Score = 70.1 bits (170), Expect = 1e-10, Method: Composition-based stats. Identities = 38/109 (34%), Positives = 51/109 (46%), Gaps = 12/109 (11%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSW------------WRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W+G RF +P+V G T E Sbjct: 59 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWIGGETNET 118 Query: 49 TRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G D+ G G IP +++ + + V+H Sbjct: 119 VTKTTQRILCGRIEENDEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 167 >gi|198245578|ref|YP_002214540.1| terminase large subunit [Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853] gi|197940094|gb|ACH77427.1| terminase large subunit [Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853] Length = 499 Score = 70.1 bits (170), Expect = 1e-10, Method: Composition-based stats. Identities = 38/109 (34%), Positives = 51/109 (46%), Gaps = 12/109 (11%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSW------------WRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W+G RF +P+V G T E Sbjct: 59 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWIGGETNET 118 Query: 49 TRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G D+ G G IP +++ + + V+H Sbjct: 119 VTKTTQRILCGRIEENDEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 167 >gi|326622293|gb|EGE28638.1| terminase large subunit [Salmonella enterica subsp. enterica serovar Dublin str. 3246] Length = 482 Score = 69.7 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 38/109 (34%), Positives = 51/109 (46%), Gaps = 12/109 (11%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSW------------WRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W+G RF +P+V G T E Sbjct: 42 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWIGGETNET 101 Query: 49 TRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G D+ G G IP +++ + + V+H Sbjct: 102 VTKTTQRILCGRIEENDEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 150 >gi|293410725|ref|ZP_06654301.1| DNA-packaging protein gp2 [Escherichia coli B354] gi|291471193|gb|EFF13677.1| DNA-packaging protein gp2 [Escherichia coli B354] Length = 499 Score = 69.3 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 38/109 (34%), Positives = 51/109 (46%), Gaps = 12/109 (11%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSW------------WRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W+G RF +P+V G T E Sbjct: 59 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWIGGETNET 118 Query: 49 TRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G D+ G G IP +++ + + V+H Sbjct: 119 VTKTTQRILCGRIEENDEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 167 >gi|218549377|ref|YP_002383168.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia fergusonii ATCC 35469] gi|307311077|ref|ZP_07590721.1| protein of unknown function DUF264 [Escherichia coli W] gi|331669066|ref|ZP_08369914.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia coli TA271] gi|218356918|emb|CAQ89550.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia fergusonii ATCC 35469] gi|306908583|gb|EFN39080.1| protein of unknown function DUF264 [Escherichia coli W] gi|312945545|gb|ADR26372.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia coli O83:H1 str. NRG 857C] gi|315061655|gb|ADT75982.1| DNA packaging protein gp2 (terminase large subunit) [Escherichia coli W] gi|323377763|gb|ADX50031.1| DNA packaging protein gp2 (terminase large subunit) [Escherichia coli KO11] gi|324117758|gb|EGC11657.1| terminase [Escherichia coli E1167] gi|331064260|gb|EGI36171.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia coli TA271] Length = 499 Score = 69.3 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 38/109 (34%), Positives = 51/109 (46%), Gaps = 12/109 (11%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSW------------WRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W+G RF +P+V G T E Sbjct: 59 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWIGGETNET 118 Query: 49 TRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G D+ G G IP +++ + + V+H Sbjct: 119 VTKTTQRILCGRIEENDEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 167 >gi|62178924|ref|YP_215341.1| gp2-like protein [Salmonella enterica subsp. enterica serovar Choleraesuis str. SC-B67] gi|62126557|gb|AAX64260.1| gp2-like protein [Salmonella enterica subsp. enterica serovar Choleraesuis str. SC-B67] gi|322713379|gb|EFZ04950.1| gp2-like protein [Salmonella enterica subsp. enterica serovar Choleraesuis str. A50] Length = 499 Score = 69.3 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 38/109 (34%), Positives = 51/109 (46%), Gaps = 12/109 (11%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSW------------WRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W+G RF +P+V G T E Sbjct: 59 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWIGGETNET 118 Query: 49 TRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G D+ G G IP +++ + + V+H Sbjct: 119 VTKTTQRILCGRIEENDEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 167 >gi|167583562|ref|YP_001671752.1| terminase large subunit [Enterobacteria phage phiEco32] gi|164375400|gb|ABY52808.1| terminase large subunit [Enterobacteria phage phiEco32] Length = 513 Score = 69.3 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 31/79 (39%), Positives = 51/79 (64%), Gaps = 2/79 (2%) Query: 2 AGNQLGKTLAGAAEAAMHLSGCYPSWWRGYRFLQPIVMVAGSVTYELTRDGIQRLLLGEP 61 A N++GK+ + A E A H++G YP+WW GY+F +PI+ A +T + TR +Q+ L G P Sbjct: 66 AANRVGKSYSEAYEFACHVTGRYPTWWTGYKFKRPILAWAVGITGDSTRKVLQKELFGTP 125 Query: 62 MSPDQQ--GSGMIPANKVL 78 + D G+G+IP + ++ Sbjct: 126 IGKDTNLLGTGVIPRDAIV 144 >gi|300920006|ref|ZP_07136465.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 115-1] gi|300412953|gb|EFJ96263.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 115-1] Length = 498 Score = 69.3 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 38/109 (34%), Positives = 50/109 (45%), Gaps = 12/109 (11%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSW------------WRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W G RF +P+V G T E Sbjct: 59 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWEGKRFYEPVVFWIGGETNET 118 Query: 49 TRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G D+ G G IP +++ + + V+H Sbjct: 119 VTKTTQRILCGRIEENDEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 167 >gi|281599695|gb|ADA72679.1| Gp2-like protein [Shigella flexneri 2002017] Length = 441 Score = 68.9 bits (167), Expect = 2e-10, Method: Composition-based stats. Identities = 38/109 (34%), Positives = 51/109 (46%), Gaps = 12/109 (11%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSW------------WRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W+G RF +P+V G T E Sbjct: 1 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWIGGETNET 60 Query: 49 TRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G D+ G G IP +++ + + V+H Sbjct: 61 VTKTTQRILCGRIEENDEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 109 >gi|323967108|gb|EGB62533.1| terminase [Escherichia coli M863] Length = 499 Score = 66.2 bits (160), Expect = 1e-09, Method: Composition-based stats. Identities = 37/109 (33%), Positives = 50/109 (45%), Gaps = 12/109 (11%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSW------------WRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W+G RF +P+V G T E Sbjct: 59 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWIGGETNET 118 Query: 49 TRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G + G G IP +++ + + V+H Sbjct: 119 VTKTTQRILCGRIEENGEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 167 >gi|327251967|gb|EGE63639.1| DNA packaging protein gp2 [Escherichia coli STEC_7v] gi|327254495|gb|EGE66117.1| DNA packaging protein gp2 [Escherichia coli STEC_7v] Length = 499 Score = 66.2 bits (160), Expect = 1e-09, Method: Composition-based stats. Identities = 37/109 (33%), Positives = 50/109 (45%), Gaps = 12/109 (11%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSW------------WRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W+G RF +P+V G T E Sbjct: 59 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWIGGETNET 118 Query: 49 TRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G + G G IP +++ + + V+H Sbjct: 119 VTKTTQRILCGRIEENGEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 167 >gi|331657716|ref|ZP_08358678.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia coli TA206] gi|331055964|gb|EGI27973.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia coli TA206] Length = 499 Score = 66.2 bits (160), Expect = 1e-09, Method: Composition-based stats. Identities = 37/109 (33%), Positives = 50/109 (45%), Gaps = 12/109 (11%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSW------------WRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W+G RF +P+V G T E Sbjct: 59 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWIGGETNET 118 Query: 49 TRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G + G G IP +++ + + V+H Sbjct: 119 VTKTTQRILCGRIEENGEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 167 >gi|49146380|ref|YP_025488.1| putative phage DNA packaging protein Gp2 [Caedibacter taeniospiralis] gi|40458348|gb|AAR87096.1| putative phage DNA packaging protein Gp2 [Caedibacter taeniospiralis] Length = 474 Score = 65.5 bits (158), Expect = 2e-09, Method: Composition-based stats. Identities = 36/96 (37%), Positives = 55/96 (57%), Gaps = 5/96 (5%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSWWRGYRFLQPIVMVAGSVTYELTRDGIQRLLLGE 60 +AGN+ GKT G AE+ MHL+G YP WW G RF +PI A SVT LT + +++ L E Sbjct: 44 LAGNRTGKTYCGVAESVMHLTGYYPQWWIGKRFTRPIKAWAASVTTALTAEVLEKAYL-E 102 Query: 61 PMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVR 96 ++ D +I +++ + + G YS +T + Sbjct: 103 MIAEDL----VIGVDRLRHSYKIDYKTGGYSELTFK 134 >gi|315299781|gb|EFU59021.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 16-3] Length = 499 Score = 60.8 bits (146), Expect = 6e-08, Method: Composition-based stats. Identities = 37/110 (33%), Positives = 50/110 (45%), Gaps = 14/110 (12%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSWWRGY-------------RFLQPIVMVAGSVTYE 47 MAGNQLGK+ GAAE A HL+G YP +GY RF +P+V G T E Sbjct: 59 MAGNQLGKSFTGAAEVAFHLTGRYPG-TKGYPADGKYGGEGEGKRFYEPVVFWMGGETNE 117 Query: 48 LTRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G + G G IP +++ + + V+H Sbjct: 118 TVTKTTQRILCGRIEENGEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 167 >gi|321225020|gb|EFX50081.1| Phage terminase, large subunit [Salmonella enterica subsp. enterica serovar Typhimurium str. TN061786] Length = 134 Score = 58.5 bits (140), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 31/71 (43%), Positives = 37/71 (52%), Gaps = 12/71 (16%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYP------------SWWRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W+G RF +P+V G T E Sbjct: 59 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWVGGETNET 118 Query: 49 TRDGIQRLLLG 59 QR+L G Sbjct: 119 VTKTTQRILCG 129 >gi|89885991|ref|YP_516188.1| phage terminase large subunit [Sodalis phage phiSG1] gi|89191726|dbj|BAE80473.1| phage terminase large subunit [Sodalis phage phiSG1] gi|125470018|gb|ABN42210.1| gp02 [Sodalis phage phiSG1] Length = 475 Score = 56.2 bits (134), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 34/100 (34%), Positives = 48/100 (48%), Gaps = 1/100 (1%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSWWRGYRFLQPIVMVAGSVTYELTRDGIQRLLLGE 60 +A N++GKT A+H G YP W GYRF V+ + E RD +Q LLG Sbjct: 55 IAANRVGKTDTATYVDAVHALGDYPEAWSGYRFSHAPVIWCLGYSGEKCRDLLQTPLLGR 114 Query: 61 PMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRHLSG 100 QG G+IP ++ + + A T +RH+SG Sbjct: 115 KTDNGWQG-GLIPGERIADTEAMTGTTNAVRTAYIRHVSG 153 >gi|284008126|emb|CBA74349.1| DNA packaging protein gp2 [Arsenophonus nasoniae] Length = 137 Score = 40.8 bits (94), Expect = 0.067, Method: Compositional matrix adjust. Identities = 23/93 (24%), Positives = 39/93 (41%), Gaps = 12/93 (12%) Query: 17 AMHLSGCYP---------SW---WRGYRFLQPIVMVAGSVTYELTRDGIQRLLLGEPMSP 64 + H +G YP +W W+G F +P+V G T E QR+L G Sbjct: 2 SFHFTGRYPGTKSYPEDGAWKGKWKGKIFSEPVVFWIGGETNETVTKTTQRILCGRIEEN 61 Query: 65 DQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 ++ G G+IP +++ + + +RH Sbjct: 62 NEPGYGLIPKEDIISWKKSPFYPNLVDHLLIRH 94 >gi|156392062|ref|XP_001635868.1| predicted protein [Nematostella vectensis] gi|156222966|gb|EDO43805.1| predicted protein [Nematostella vectensis] Length = 892 Score = 37.7 bits (86), Expect = 0.46, Method: Composition-based stats. Identities = 21/66 (31%), Positives = 39/66 (59%), Gaps = 9/66 (13%) Query: 28 WRGYRFLQPIVMVAGSVTYELTRDGIQRLLLGEPMSPDQQGSGMIPANK------VLNMT 81 W+GY FL +V++ +VT L R +R +L EP + G+ ++P+NK +LN T Sbjct: 4 WKGYVFLWTVVLLFPAVTNGLVR---ERRVLNEPGDVGKNGNSLMPSNKRQQIIYLLNKT 60 Query: 82 RRSNIA 87 + ++++ Sbjct: 61 KTNDLS 66 >gi|330995725|ref|ZP_08319623.1| conserved domain protein [Paraprevotella xylaniphila YIT 11841] gi|329574784|gb|EGG56345.1| conserved domain protein [Paraprevotella xylaniphila YIT 11841] Length = 320 Score = 37.7 bits (86), Expect = 0.49, Method: Compositional matrix adjust. Identities = 19/48 (39%), Positives = 26/48 (54%) Query: 51 DGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRHL 98 D I+ +LLG P+ P+ S IPA + N+A A S +T RHL Sbjct: 256 DSIRNILLGYPVEPETIDSLKIPAYPQAQKAEKDNVATALSFLTYRHL 303 Searching..................................................done Results from round 2 >gi|27476053|ref|NP_775255.1| terminase [Pseudomonas phage PaP3] gi|27414483|gb|AAL85569.1| terminase [Pseudomonas phage PaP3] Length = 482 Score = 189 bits (480), Expect = 1e-46, Method: Composition-based stats. Identities = 44/110 (40%), Positives = 61/110 (55%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSWWRGYRFLQPIVMVAGSVTYELTRDGIQRLLLGE 60 M GN+ GKT GA A HL+G YP WW G +F +P+ A ++ + TRD +Q LLG+ Sbjct: 48 MTGNRCGKTYTGAFIMACHLTGRYPEWWTGRKFDKPVNCWAAGISTDTTRDILQSELLGD 107 Query: 61 PMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRHLSGRDIGFIIESI 110 +P+ G+GMIP ++ RR G V VRH+SG I +S Sbjct: 108 WKNPEAFGTGMIPKEDIVKTERREGKPGCVQAVMVRHVSGGLSSLIFKSY 157 >gi|167600439|ref|YP_001671939.1| terminase large subunit [Pseudomonas phage LUZ24] gi|161168302|emb|CAP45467.1| terminase large subunit [Pseudomonas phage LUZ24] Length = 482 Score = 185 bits (469), Expect = 2e-45, Method: Composition-based stats. Identities = 42/110 (38%), Positives = 60/110 (54%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSWWRGYRFLQPIVMVAGSVTYELTRDGIQRLLLGE 60 M GN+ GKT GA A HL+G YP WW G ++ +P+ A ++ + TRD +Q LLG+ Sbjct: 48 MTGNRCGKTYTGAFIMACHLTGRYPEWWTGRKYDRPVNCWAAGISTDTTRDILQSELLGD 107 Query: 61 PMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRHLSGRDIGFIIESI 110 +P+ G+GMIP ++ RR G V V+H SG I +S Sbjct: 108 WKNPEAFGTGMIPKEDIVETIRREGKPGCVQAVVVKHTSGGLSSLIFKSY 157 >gi|15965769|ref|NP_386122.1| DNA packaging protein GP2 [Sinorhizobium meliloti 1021] gi|15075038|emb|CAC46595.1| DNA packaging protein GP2 [Sinorhizobium meliloti 1021] Length = 477 Score = 178 bits (451), Expect = 2e-43, Method: Composition-based stats. Identities = 64/110 (58%), Positives = 85/110 (77%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSWWRGYRFLQPIVMVAGSVTYELTRDGIQRLLLGE 60 MAGNQLGKTLAGAAEAAMHL+G YP WW+G RF +PIVM+AGS +YELTRDG+QRLL+G Sbjct: 63 MAGNQLGKTLAGAAEAAMHLTGRYPDWWQGRRFDRPIVMLAGSESYELTRDGVQRLLIGP 122 Query: 61 PMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRHLSGRDIGFIIESI 110 P++ ++ G+G +P + TRR+ +GA +VTVRH+SGR + ++ Sbjct: 123 PLNEEEWGTGFLPKAAIKATTRRAGASGALDSVTVRHVSGRASTLLFKAY 172 >gi|307315429|ref|ZP_07594994.1| protein of unknown function DUF264 [Sinorhizobium meliloti BL225C] gi|306898808|gb|EFN29464.1| protein of unknown function DUF264 [Sinorhizobium meliloti BL225C] Length = 477 Score = 177 bits (450), Expect = 3e-43, Method: Composition-based stats. Identities = 64/110 (58%), Positives = 85/110 (77%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSWWRGYRFLQPIVMVAGSVTYELTRDGIQRLLLGE 60 MAGNQLGKTLAGAAEAAMHL+G YP WW+G RF +PIVM+AGS +YELTRDG+QRLL+G Sbjct: 63 MAGNQLGKTLAGAAEAAMHLTGRYPDWWQGRRFDRPIVMLAGSESYELTRDGVQRLLIGP 122 Query: 61 PMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRHLSGRDIGFIIESI 110 P++ ++ G+G +P + TRR+ +GA +VTVRH+SGR + ++ Sbjct: 123 PLNEEEWGTGFLPKAAIKATTRRAGASGALDSVTVRHVSGRASTLLFKAY 172 >gi|227822449|ref|YP_002826421.1| DNA packaging protein Gp2 [Sinorhizobium fredii NGR234] gi|227341450|gb|ACP25668.1| DNA packaging protein Gp2 [Sinorhizobium fredii NGR234] Length = 454 Score = 177 bits (450), Expect = 4e-43, Method: Composition-based stats. Identities = 65/110 (59%), Positives = 83/110 (75%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSWWRGYRFLQPIVMVAGSVTYELTRDGIQRLLLGE 60 MAGNQLGKTLAGAAEAAMHL+G YP+WW+G RF +PIVM+AGS +YELTRDG+QRLL+G Sbjct: 40 MAGNQLGKTLAGAAEAAMHLTGRYPNWWQGRRFDKPIVMLAGSESYELTRDGVQRLLVGP 99 Query: 61 PMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRHLSGRDIGFIIESI 110 P++ G+G IP + TRRS +GA +VTVRH+SG + ++ Sbjct: 100 PLNEADWGTGFIPKATIRATTRRSGASGALDSVTVRHVSGGASTLLFKAY 149 >gi|307318836|ref|ZP_07598268.1| protein of unknown function DUF264 [Sinorhizobium meliloti AK83] gi|306895557|gb|EFN26311.1| protein of unknown function DUF264 [Sinorhizobium meliloti AK83] Length = 477 Score = 176 bits (447), Expect = 8e-43, Method: Composition-based stats. Identities = 64/110 (58%), Positives = 84/110 (76%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSWWRGYRFLQPIVMVAGSVTYELTRDGIQRLLLGE 60 MAGNQLGKTLAGAAEAAMHL+G YP WW+G RF +PIVM+AGS +YELTRDG+QRLL+G Sbjct: 63 MAGNQLGKTLAGAAEAAMHLTGRYPDWWQGRRFDRPIVMLAGSESYELTRDGVQRLLIGP 122 Query: 61 PMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRHLSGRDIGFIIESI 110 P+ ++ G+G +P + TRR+ +GA +VTVRH+SGR + ++ Sbjct: 123 PLHEEEWGTGFLPKAAIKATTRRAGASGALDSVTVRHVSGRASTLLFKAY 172 >gi|150397042|ref|YP_001327509.1| hypothetical protein Smed_1839 [Sinorhizobium medicae WSM419] gi|150028557|gb|ABR60674.1| protein of unknown function DUF264 [Sinorhizobium medicae WSM419] Length = 477 Score = 175 bits (445), Expect = 1e-42, Method: Composition-based stats. Identities = 62/110 (56%), Positives = 83/110 (75%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSWWRGYRFLQPIVMVAGSVTYELTRDGIQRLLLGE 60 MAGNQLGKTLAGAAEAAMHL+G YP WW+G RF +P+ M+AGS +YELTRDG+QRLL+G Sbjct: 63 MAGNQLGKTLAGAAEAAMHLTGRYPEWWQGRRFDRPVAMLAGSESYELTRDGVQRLLIGP 122 Query: 61 PMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRHLSGRDIGFIIESI 110 P++ D+ G+G +P + TRRS +GA +VTVRH++G + ++ Sbjct: 123 PLNEDEWGTGFVPKATIQATTRRSGASGALDSVTVRHVAGGASTLLFKAY 172 >gi|227821702|ref|YP_002825672.1| DNA packaging protein Gp2 [Sinorhizobium fredii NGR234] gi|227340701|gb|ACP24919.1| DNA packaging protein Gp2 [Sinorhizobium fredii NGR234] Length = 416 Score = 172 bits (436), Expect = 1e-41, Method: Composition-based stats. Identities = 65/111 (58%), Positives = 84/111 (75%), Gaps = 1/111 (0%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSWWRGYRFLQPIVMVAGSVTYELTRDGIQRLLLGE 60 MAGNQLGKTLAGAAEAAMHL+G YP WW G RF +PIVM+AGS +YELTRDG+QRL++G Sbjct: 1 MAGNQLGKTLAGAAEAAMHLTGRYPDWWDGRRFDKPIVMLAGSESYELTRDGVQRLMVGP 60 Query: 61 PMSPDQQGSGMIPANKVLNM-TRRSNIAGAYSTVTVRHLSGRDIGFIIESI 110 PM+ + G+G IP ++ TRRS ++GA +VTVRH+SG + ++ Sbjct: 61 PMNEEDWGTGCIPKAAIVGTPTRRSGVSGALDSVTVRHVSGGVSILLFKAY 111 >gi|167041080|gb|ABZ05841.1| hypothetical protein ALOHA_HF400048F7ctg1g8 [uncultured marine microorganism HF4000_48F7] Length = 504 Score = 171 bits (433), Expect = 3e-41, Method: Composition-based stats. Identities = 45/110 (40%), Positives = 65/110 (59%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSWWRGYRFLQPIVMVAGSVTYELTRDGIQRLLLGE 60 MAGN++GKT +GA E A HL+G YP WW G+RF + I A ++ TRD +Q L+GE Sbjct: 67 MAGNKVGKTFSGAMELAYHLTGKYPDWWTGHRFDRAIHAWAAGKSHYATRDIVQSELIGE 126 Query: 61 PMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRHLSGRDIGFIIESI 110 P P+ G+G IP + ++ R + A V V+H+SGR+ +S Sbjct: 127 PGDPESFGTGAIPKDLIVKTERNPGVPNALGFVLVKHVSGRNSRLQFKSY 176 >gi|315122536|ref|YP_004063025.1| DNA packaging protein Gp2 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495938|gb|ADR52537.1| DNA packaging protein Gp2 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 455 Score = 166 bits (420), Expect = 9e-40, Method: Composition-based stats. Identities = 79/110 (71%), Positives = 98/110 (89%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSWWRGYRFLQPIVMVAGSVTYELTRDGIQRLLLGE 60 MAGNQLGKTLAGAAEAA+HL+G YP WW G+RF++PIVMVAGSV+YELTRDGIQRLLLGE Sbjct: 41 MAGNQLGKTLAGAAEAAIHLTGFYPPWWLGHRFVKPIVMVAGSVSYELTRDGIQRLLLGE 100 Query: 61 PMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRHLSGRDIGFIIESI 110 PMS D+QGSGMIPA+ ++NMTRR N+AGAY+TVT++H+SG ++++ Sbjct: 101 PMSLDRQGSGMIPAHTIVNMTRRFNVAGAYTTVTIKHVSGGTSVLLLKAY 150 >gi|13471714|ref|NP_103281.1| hypothetical protein mll1771 [Mesorhizobium loti MAFF303099] gi|14022458|dbj|BAB49067.1| mll1771 [Mesorhizobium loti MAFF303099] Length = 254 Score = 162 bits (410), Expect = 1e-38, Method: Composition-based stats. Identities = 53/115 (46%), Positives = 67/115 (58%), Gaps = 5/115 (4%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSWWRGYRFLQPIVMVAGSVTYELTRDGIQRLLLGE 60 MAGNQLGKT AG AE AMHL+G YP+WW+G F P+ + A VT E TRD QR+L+G Sbjct: 71 MAGNQLGKTRAGGAEWAMHLTGRYPAWWQGKVFDTPVRLWAAGVTGEGTRDNPQRVLVGP 130 Query: 61 PMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRHLSGRDIG-----FIIESI 110 P G+GMIPA+ + N+ GA +V VRH G D+ +S Sbjct: 131 PQQQAAWGTGMIPADAIRQTIMGRNVPGAIDSVVVRHGGGGDVQAGESVLSFKSF 185 >gi|260463788|ref|ZP_05811985.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] gi|259030385|gb|EEW31664.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] Length = 209 Score = 160 bits (405), Expect = 5e-38, Method: Composition-based stats. Identities = 54/115 (46%), Positives = 66/115 (57%), Gaps = 5/115 (4%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSWWRGYRFLQPIVMVAGSVTYELTRDGIQRLLLGE 60 MAGNQLGKT AG AE AMHL+G YP WW+G F P+ + A VT E TRD QR+L+G Sbjct: 26 MAGNQLGKTRAGGAEWAMHLTGRYPDWWQGKVFDTPVRLWAAGVTGEGTRDNPQRVLIGP 85 Query: 61 PMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRHLSGRDIG-----FIIESI 110 P G+GMIPA+ +L T GA +V VRH G D+ +S Sbjct: 86 PQQQAAWGTGMIPADAILQTTMGRGAPGALDSVVVRHGGGGDVQAGESVLSFKSF 140 >gi|148557330|ref|YP_001264912.1| bacteriophage terminase large (ATPase) subunit-like protein [Sphingomonas wittichii RW1] gi|148502520|gb|ABQ70774.1| Bacteriophage terminase large (ATPase) subunit and inactivated derivatives-like protein [Sphingomonas wittichii RW1] Length = 225 Score = 159 bits (402), Expect = 1e-37, Method: Composition-based stats. Identities = 50/110 (45%), Positives = 66/110 (60%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSWWRGYRFLQPIVMVAGSVTYELTRDGIQRLLLGE 60 MAGNQLGKT+AG+ E AMHL+G YP WWRG RF P T TRD +Q+LLLG+ Sbjct: 55 MAGNQLGKTVAGSFEIAMHLTGRYPGWWRGRRFDAPGRYWVAGETRISTRDTVQKLLLGD 114 Query: 61 PMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRHLSGRDIGFIIESI 110 P P+ G+G IP + R S +A A T+TV H++G + ++ Sbjct: 115 PERPEAWGTGAIPGAAIRTTHRASGVANAIDTLTVAHVAGGASTLLFKAY 164 >gi|71897556|ref|ZP_00679801.1| Protein of unknown function DUF264 [Xylella fastidiosa Ann-1] gi|71732459|gb|EAO34512.1| Protein of unknown function DUF264 [Xylella fastidiosa Ann-1] Length = 471 Score = 155 bits (392), Expect = 2e-36, Method: Composition-based stats. Identities = 48/110 (43%), Positives = 62/110 (56%), Gaps = 3/110 (2%) Query: 2 AGNQLGKTLAGAAEAAMHLSGCYPSWWRGYRFLQPIVMVAGSVTYELTRDGIQRLLLG-E 60 A NQ GKTL E AMHL+G YP WW G RF + +AGS T ELTR G+QR+LLG + Sbjct: 59 AANQSGKTLCAGHEVAMHLTGRYPQWWEGKRFERSNHGLAGSETGELTRRGVQRILLGRD 118 Query: 61 PMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRHLSGRDIGFIIESI 110 P + G+G IP + +T + TV VRH+SG ++S Sbjct: 119 PKTE--MGTGAIPGECIEGVTWARGVPELVDTVYVRHVSGERSSISLKSF 166 >gi|273810450|ref|YP_003344921.1| TerL [Xylella phage Xfas53] gi|257097825|gb|ACV41131.1| TerL [Xylella phage Xfas53] Length = 470 Score = 155 bits (391), Expect = 3e-36, Method: Composition-based stats. Identities = 48/110 (43%), Positives = 62/110 (56%), Gaps = 3/110 (2%) Query: 2 AGNQLGKTLAGAAEAAMHLSGCYPSWWRGYRFLQPIVMVAGSVTYELTRDGIQRLLLG-E 60 A NQ GKTL E AMHL+G YP WW G RF + +AGS T ELTR G+QR+LLG + Sbjct: 58 AANQSGKTLCAGHEVAMHLTGRYPQWWEGKRFERSNHGLAGSETGELTRRGVQRILLGRD 117 Query: 61 PMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRHLSGRDIGFIIESI 110 P + G+G IP + +T + TV VRH+SG ++S Sbjct: 118 PKTE--MGTGAIPGECIEGVTWARGVPELVDTVYVRHVSGERSSISLKSF 165 >gi|71274675|ref|ZP_00650963.1| Protein of unknown function DUF264 [Xylella fastidiosa Dixon] gi|71901596|ref|ZP_00683677.1| Protein of unknown function DUF264 [Xylella fastidiosa Ann-1] gi|170730087|ref|YP_001775520.1| putative DNA packaging protein GP2 [Xylella fastidiosa M12] gi|71164407|gb|EAO14121.1| Protein of unknown function DUF264 [Xylella fastidiosa Dixon] gi|71728644|gb|EAO30794.1| Protein of unknown function DUF264 [Xylella fastidiosa Ann-1] gi|167964880|gb|ACA11890.1| putative DNA packaging protein GP2 [Xylella fastidiosa M12] Length = 472 Score = 152 bits (384), Expect = 1e-35, Method: Composition-based stats. Identities = 46/111 (41%), Positives = 63/111 (56%), Gaps = 3/111 (2%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSWWRGYRFLQPIVMVAGSVTYELTRDGIQRLLLG- 59 +A NQ GKTL EAA+HL+G YP WW+G RF +AGS T ELTR G+QR+LLG Sbjct: 59 IAANQSGKTLCAGYEAAIHLTGRYPDWWQGKRFTSANHGLAGSETGELTRRGVQRVLLGR 118 Query: 60 EPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRHLSGRDIGFIIESI 110 +P + G+G IP + +T + T+ VRH +G ++S Sbjct: 119 DPKTE--LGTGAIPGACIDAVTWARGVPELVDTIYVRHCTGARSSVSLKSF 167 >gi|158422462|ref|YP_001523754.1| putative DNA packaging protein GP3 [Azorhizobium caulinodans ORS 571] gi|158329351|dbj|BAF86836.1| putative DNA packaging protein GP3 [Azorhizobium caulinodans ORS 571] Length = 203 Score = 147 bits (370), Expect = 7e-34, Method: Composition-based stats. Identities = 46/112 (41%), Positives = 62/112 (55%), Gaps = 5/112 (4%) Query: 1 MAGNQLGKTLA-GAAEAAMHLSGCYPSWWRGYRFLQPIVMVAGSVTYELTRDGIQRLLLG 59 MA N++GKT G E +HL+G YP WW G RF PI A T E TRD +Q +L G Sbjct: 32 MAANRVGKTYGVGGYETVLHLTGRYPDWWEGRRFDHPIEAWAAGDTGETTRDIVQSVLFG 91 Query: 60 EPMSPDQQGSGMIPANKVLNM-TRRSNIAGAYSTVTVRHLSGRDIGFIIESI 110 + D G+G+IPA+ ++ +RR+ I GA T +RH SG +S Sbjct: 92 KI---DDLGTGLIPADDIVGEPSRRAGITGAIDTAAIRHRSGGTSLIGFKSY 140 >gi|264678785|ref|YP_003278692.1| DNA packaging protein GP3 [Comamonas testosteroni CNB-2] gi|262209298|gb|ACY33396.1| putative DNA packaging protein GP3 [Comamonas testosteroni CNB-2] Length = 189 Score = 143 bits (360), Expect = 9e-33, Method: Composition-based stats. Identities = 43/104 (41%), Positives = 63/104 (60%), Gaps = 1/104 (0%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSWWRGYRFLQPIVMVAGSVTYELTRDGIQRLLLGE 60 MAGN++GKT+A E A HL+G YP WW G+RF +P+ + T+E TRD +Q LLG Sbjct: 67 MAGNRVGKTMAAGTELAYHLTGRYPWWWAGHRFTKPVRALISGDTHETTRDILQLKLLGS 126 Query: 61 PMS-PDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRHLSGRDI 103 P+ G+G+IP + + + RS++ GA +RH SG + Sbjct: 127 TTDKPENFGTGLIPGDSITGIVARSHVKGAVERAMIRHESGGES 170 >gi|318065950|ref|YP_004123808.1| Gp2 [Salmonella phage ST160] gi|289066936|gb|ADC81147.1| Gp2 [Salmonella phage ST160] Length = 517 Score = 137 bits (346), Expect = 4e-31, Method: Composition-based stats. Identities = 38/109 (34%), Positives = 51/109 (46%), Gaps = 12/109 (11%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSW------------WRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W+G RF +P+V G T E Sbjct: 77 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWVGGETNET 136 Query: 49 TRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G D+ G G IP +++ + + V+H Sbjct: 137 VTKTTQRILCGRIEENDEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 185 >gi|219681243|ref|YP_002455888.1| Gp2 [Salmonella enterica bacteriophage SE1] gi|66473858|gb|AAY46504.1| Gp2 [Salmonella phage SE1] Length = 499 Score = 137 bits (346), Expect = 4e-31, Method: Composition-based stats. Identities = 38/109 (34%), Positives = 51/109 (46%), Gaps = 12/109 (11%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSW------------WRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W+G RF +P+V G T E Sbjct: 59 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWVGGETNET 118 Query: 49 TRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G D+ G G IP +++ + + V+H Sbjct: 119 VTKTTQRILCGRIEENDEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 167 >gi|46358697|ref|YP_006405.1| Gp2 [Enterobacteria phage ST104] gi|46357933|dbj|BAD15212.1| Gp2 [Enterobacteria phage ST104] gi|312911340|dbj|BAJ35314.1| putative terminase large subunit [Salmonella enterica subsp. enterica serovar Typhimurium str. T000240] Length = 499 Score = 137 bits (346), Expect = 4e-31, Method: Composition-based stats. Identities = 38/109 (34%), Positives = 51/109 (46%), Gaps = 12/109 (11%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSW------------WRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W+G RF +P+V G T E Sbjct: 59 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWVGGETNET 118 Query: 49 TRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G D+ G G IP +++ + + V+H Sbjct: 119 VTKTTQRILCGRIEENDEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 167 >gi|24371583|ref|NP_720326.1| gp2 [Enterobacteria phage ST64T] gi|24250810|gb|AAL15523.1| gp2 [Salmonella phage ST64T] Length = 517 Score = 137 bits (346), Expect = 4e-31, Method: Composition-based stats. Identities = 38/109 (34%), Positives = 51/109 (46%), Gaps = 12/109 (11%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSW------------WRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W+G RF +P+V G T E Sbjct: 77 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWVGGETNET 136 Query: 49 TRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G D+ G G IP +++ + + V+H Sbjct: 137 VTKTTQRILCGRIEENDEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 185 >gi|89885991|ref|YP_516188.1| phage terminase large subunit [Sodalis phage phiSG1] gi|89191726|dbj|BAE80473.1| phage terminase large subunit [Sodalis phage phiSG1] gi|125470018|gb|ABN42210.1| gp02 [Sodalis phage phiSG1] Length = 475 Score = 137 bits (346), Expect = 4e-31, Method: Composition-based stats. Identities = 35/110 (31%), Positives = 49/110 (44%), Gaps = 1/110 (0%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSWWRGYRFLQPIVMVAGSVTYELTRDGIQRLLLGE 60 +A N++GKT A+H G YP W GYRF V+ + E RD +Q LLG Sbjct: 55 IAANRVGKTDTATYVDAVHALGDYPEAWSGYRFSHAPVIWCLGYSGEKCRDLLQTPLLGR 114 Query: 61 PMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRHLSGRDIGFIIESI 110 QG G+IP ++ + + A T +RH+SG S Sbjct: 115 KTDNGWQG-GLIPGERIADTEAMTGTTNAVRTAYIRHVSGLLSKIQFWSY 163 >gi|300920006|ref|ZP_07136465.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 115-1] gi|300412953|gb|EFJ96263.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 115-1] Length = 498 Score = 137 bits (345), Expect = 5e-31, Method: Composition-based stats. Identities = 38/109 (34%), Positives = 50/109 (45%), Gaps = 12/109 (11%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSW------------WRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W G RF +P+V G T E Sbjct: 59 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWEGKRFYEPVVFWIGGETNET 118 Query: 49 TRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G D+ G G IP +++ + + V+H Sbjct: 119 VTKTTQRILCGRIEENDEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 167 >gi|60476789|gb|AAX21426.1| gp2 [Enterobacteria phage L] Length = 499 Score = 137 bits (345), Expect = 5e-31, Method: Composition-based stats. Identities = 38/109 (34%), Positives = 51/109 (46%), Gaps = 12/109 (11%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSW------------WRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W+G RF +P+V G T E Sbjct: 59 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWVGGETNET 118 Query: 49 TRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G D+ G G IP +++ + + V+H Sbjct: 119 VTKTTQRILCGRIEENDEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 167 >gi|198245578|ref|YP_002214540.1| terminase large subunit [Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853] gi|197940094|gb|ACH77427.1| terminase large subunit [Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853] Length = 499 Score = 137 bits (345), Expect = 5e-31, Method: Composition-based stats. Identities = 38/109 (34%), Positives = 51/109 (46%), Gaps = 12/109 (11%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSW------------WRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W+G RF +P+V G T E Sbjct: 59 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWIGGETNET 118 Query: 49 TRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G D+ G G IP +++ + + V+H Sbjct: 119 VTKTTQRILCGRIEENDEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 167 >gi|51236724|ref|YP_063734.1| terminase large subunit [Enterobacteria phage P22] gi|137879|sp|P26745|TERL_BPP22 RecName: Full=Large terminase protein; AltName: Full=DNA-packaging protein gp2; AltName: Full=Terminase large subunit gi|21914414|gb|AAM81379.1|AF527608_1 terminase large subunit [Salmonella phage P22-pbi] gi|553005|gb|AAA72959.1| DNA pacaging [Enterobacteria phage P22] gi|8439622|gb|AAF75044.1| terminase large subunit [Enterobacteria phage P22] gi|28394263|tpg|DAA00977.1| TPA_inf: terminase large subunit [Enterobacteria phage P22] Length = 499 Score = 137 bits (345), Expect = 5e-31, Method: Composition-based stats. Identities = 38/109 (34%), Positives = 51/109 (46%), Gaps = 12/109 (11%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSW------------WRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W+G RF +P+V G T E Sbjct: 59 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWIGGETNET 118 Query: 49 TRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G D+ G G IP +++ + + V+H Sbjct: 119 VTKTTQRILCGRIEENDEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 167 >gi|221328620|ref|YP_002533461.1| Terminase, large subunit [Salmonella phage epsilon34] gi|255252684|ref|YP_003090219.1| Terminase, large subunit [Salmonella phage c341] gi|193244688|gb|ACF16628.1| Terminase, large subunit [Salmonella phage epsilon34] gi|223697657|gb|ACN18281.1| Terminase, large subunit [Salmonella phage g341c] Length = 499 Score = 137 bits (345), Expect = 5e-31, Method: Composition-based stats. Identities = 38/109 (34%), Positives = 51/109 (46%), Gaps = 12/109 (11%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSW------------WRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W+G RF +P+V G T E Sbjct: 59 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWIGGETNET 118 Query: 49 TRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G D+ G G IP +++ + + V+H Sbjct: 119 VTKTTQRILCGRIEENDEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 167 >gi|168240109|ref|ZP_02665041.1| DNA packaging protein gp2 [Salmonella enterica subsp. enterica serovar Heidelberg str. SL486] gi|194451817|ref|YP_002044341.1| DNA packaging protein gp2 [Salmonella enterica subsp. enterica serovar Heidelberg str. SL476] gi|194410121|gb|ACF70340.1| DNA packaging protein gp2 [Salmonella enterica subsp. enterica serovar Heidelberg str. SL476] gi|205340165|gb|EDZ26929.1| DNA packaging protein gp2 [Salmonella enterica subsp. enterica serovar Heidelberg str. SL486] Length = 499 Score = 137 bits (345), Expect = 5e-31, Method: Composition-based stats. Identities = 38/109 (34%), Positives = 51/109 (46%), Gaps = 12/109 (11%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSW------------WRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W+G RF +P+V G T E Sbjct: 59 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWIGGETNET 118 Query: 49 TRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G D+ G G IP +++ + + V+H Sbjct: 119 VTKTTQRILCGRIEENDEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 167 >gi|157734711|dbj|BAF80717.1| terminase large subunit [Enterobacteria phage P22] gi|169658843|dbj|BAG12600.1| terminase large subunit [Enterobacteria phage P22] Length = 499 Score = 137 bits (345), Expect = 5e-31, Method: Composition-based stats. Identities = 38/109 (34%), Positives = 51/109 (46%), Gaps = 12/109 (11%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSW------------WRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W+G RF +P+V G T E Sbjct: 59 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWIGGETNET 118 Query: 49 TRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G D+ G G IP +++ + + V+H Sbjct: 119 VTKTTQRILCGRIEENDEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 167 >gi|238912312|ref|ZP_04656149.1| putative terminase large subunit [Salmonella enterica subsp. enterica serovar Tennessee str. CDC07-0191] gi|261245593|emb|CBG23388.1| terminase large subunit [Salmonella enterica subsp. enterica serovar Typhimurium str. D23580] Length = 499 Score = 137 bits (345), Expect = 5e-31, Method: Composition-based stats. Identities = 38/109 (34%), Positives = 51/109 (46%), Gaps = 12/109 (11%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSW------------WRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W+G RF +P+V G T E Sbjct: 59 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWIGGETNET 118 Query: 49 TRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G D+ G G IP +++ + + V+H Sbjct: 119 VTKTTQRILCGRIEENDEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 167 >gi|161504537|ref|YP_001571649.1| hypothetical protein SARI_02650 [Salmonella enterica subsp. arizonae serovar 62:z4,z23:-- str. RSK2980] gi|160865884|gb|ABX22507.1| hypothetical protein SARI_02650 [Salmonella enterica subsp. arizonae serovar 62:z4,z23:--] Length = 499 Score = 137 bits (345), Expect = 6e-31, Method: Composition-based stats. Identities = 38/109 (34%), Positives = 51/109 (46%), Gaps = 12/109 (11%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSW------------WRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W+G RF +P+V G T E Sbjct: 59 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWIGGETNET 118 Query: 49 TRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G D+ G G IP +++ + + V+H Sbjct: 119 VTKTTQRILCGRIEENDEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 167 >gi|197363441|ref|YP_002143078.1| terminase large subunit [Salmonella enterica subsp. enterica serovar Paratyphi A str. AKU_12601] gi|197094918|emb|CAR60455.1| putative terminase large subunit [Salmonella enterica subsp. enterica serovar Paratyphi A str. AKU_12601] gi|320086843|emb|CBY96615.1| DNA packaging protein gp2 Terminase large subunit [Salmonella enterica subsp. enterica serovar Weltevreden str. 2007-60-3289-1] Length = 499 Score = 137 bits (344), Expect = 6e-31, Method: Composition-based stats. Identities = 38/109 (34%), Positives = 51/109 (46%), Gaps = 12/109 (11%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSW------------WRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W+G RF +P+V G T E Sbjct: 59 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWIGGETNET 118 Query: 49 TRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G D+ G G IP +++ + + V+H Sbjct: 119 VTKTTQRILCGRIEENDEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 167 >gi|326622293|gb|EGE28638.1| terminase large subunit [Salmonella enterica subsp. enterica serovar Dublin str. 3246] Length = 482 Score = 136 bits (343), Expect = 8e-31, Method: Composition-based stats. Identities = 38/109 (34%), Positives = 51/109 (46%), Gaps = 12/109 (11%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSW------------WRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W+G RF +P+V G T E Sbjct: 42 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWIGGETNET 101 Query: 49 TRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G D+ G G IP +++ + + V+H Sbjct: 102 VTKTTQRILCGRIEENDEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 150 >gi|293410725|ref|ZP_06654301.1| DNA-packaging protein gp2 [Escherichia coli B354] gi|291471193|gb|EFF13677.1| DNA-packaging protein gp2 [Escherichia coli B354] Length = 499 Score = 136 bits (342), Expect = 1e-30, Method: Composition-based stats. Identities = 38/109 (34%), Positives = 51/109 (46%), Gaps = 12/109 (11%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSW------------WRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W+G RF +P+V G T E Sbjct: 59 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWIGGETNET 118 Query: 49 TRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G D+ G G IP +++ + + V+H Sbjct: 119 VTKTTQRILCGRIEENDEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 167 >gi|218549377|ref|YP_002383168.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia fergusonii ATCC 35469] gi|307311077|ref|ZP_07590721.1| protein of unknown function DUF264 [Escherichia coli W] gi|331669066|ref|ZP_08369914.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia coli TA271] gi|218356918|emb|CAQ89550.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia fergusonii ATCC 35469] gi|306908583|gb|EFN39080.1| protein of unknown function DUF264 [Escherichia coli W] gi|312945545|gb|ADR26372.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia coli O83:H1 str. NRG 857C] gi|315061655|gb|ADT75982.1| DNA packaging protein gp2 (terminase large subunit) [Escherichia coli W] gi|323377763|gb|ADX50031.1| DNA packaging protein gp2 (terminase large subunit) [Escherichia coli KO11] gi|324117758|gb|EGC11657.1| terminase [Escherichia coli E1167] gi|331064260|gb|EGI36171.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia coli TA271] Length = 499 Score = 135 bits (341), Expect = 1e-30, Method: Composition-based stats. Identities = 38/109 (34%), Positives = 51/109 (46%), Gaps = 12/109 (11%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSW------------WRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W+G RF +P+V G T E Sbjct: 59 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWIGGETNET 118 Query: 49 TRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G D+ G G IP +++ + + V+H Sbjct: 119 VTKTTQRILCGRIEENDEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 167 >gi|62178924|ref|YP_215341.1| gp2-like protein [Salmonella enterica subsp. enterica serovar Choleraesuis str. SC-B67] gi|62126557|gb|AAX64260.1| gp2-like protein [Salmonella enterica subsp. enterica serovar Choleraesuis str. SC-B67] gi|322713379|gb|EFZ04950.1| gp2-like protein [Salmonella enterica subsp. enterica serovar Choleraesuis str. A50] Length = 499 Score = 135 bits (341), Expect = 1e-30, Method: Composition-based stats. Identities = 38/109 (34%), Positives = 51/109 (46%), Gaps = 12/109 (11%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSW------------WRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W+G RF +P+V G T E Sbjct: 59 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWIGGETNET 118 Query: 49 TRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G D+ G G IP +++ + + V+H Sbjct: 119 VTKTTQRILCGRIEENDEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 167 >gi|327251967|gb|EGE63639.1| DNA packaging protein gp2 [Escherichia coli STEC_7v] gi|327254495|gb|EGE66117.1| DNA packaging protein gp2 [Escherichia coli STEC_7v] Length = 499 Score = 134 bits (337), Expect = 4e-30, Method: Composition-based stats. Identities = 37/109 (33%), Positives = 50/109 (45%), Gaps = 12/109 (11%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSW------------WRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W+G RF +P+V G T E Sbjct: 59 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWIGGETNET 118 Query: 49 TRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G + G G IP +++ + + V+H Sbjct: 119 VTKTTQRILCGRIEENGEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 167 >gi|331657716|ref|ZP_08358678.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia coli TA206] gi|331055964|gb|EGI27973.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia coli TA206] Length = 499 Score = 134 bits (337), Expect = 4e-30, Method: Composition-based stats. Identities = 37/109 (33%), Positives = 50/109 (45%), Gaps = 12/109 (11%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSW------------WRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W+G RF +P+V G T E Sbjct: 59 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWIGGETNET 118 Query: 49 TRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G + G G IP +++ + + V+H Sbjct: 119 VTKTTQRILCGRIEENGEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 167 >gi|323967108|gb|EGB62533.1| terminase [Escherichia coli M863] Length = 499 Score = 134 bits (337), Expect = 5e-30, Method: Composition-based stats. Identities = 37/109 (33%), Positives = 50/109 (45%), Gaps = 12/109 (11%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSW------------WRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W+G RF +P+V G T E Sbjct: 59 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWIGGETNET 118 Query: 49 TRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G + G G IP +++ + + V+H Sbjct: 119 VTKTTQRILCGRIEENGEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 167 >gi|281599695|gb|ADA72679.1| Gp2-like protein [Shigella flexneri 2002017] Length = 441 Score = 134 bits (336), Expect = 5e-30, Method: Composition-based stats. Identities = 38/109 (34%), Positives = 51/109 (46%), Gaps = 12/109 (11%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSW------------WRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W+G RF +P+V G T E Sbjct: 1 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWIGGETNET 60 Query: 49 TRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G D+ G G IP +++ + + V+H Sbjct: 61 VTKTTQRILCGRIEENDEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 109 >gi|167583562|ref|YP_001671752.1| terminase large subunit [Enterobacteria phage phiEco32] gi|164375400|gb|ABY52808.1| terminase large subunit [Enterobacteria phage phiEco32] Length = 513 Score = 126 bits (316), Expect = 1e-27, Method: Composition-based stats. Identities = 34/99 (34%), Positives = 57/99 (57%), Gaps = 5/99 (5%) Query: 2 AGNQLGKTLAGAAEAAMHLSGCYPSWWRGYRFLQPIVMVAGSVTYELTRDGIQRLLLGEP 61 A N++GK+ + A E A H++G YP+WW GY+F +PI+ A +T + TR +Q+ L G P Sbjct: 66 AANRVGKSYSEAYEFACHVTGRYPTWWTGYKFKRPILAWAVGITGDSTRKVLQKELFGTP 125 Query: 62 MSPDQQ--GSGMIPANKVL-NMTRRSNIAGAYSTVTVRH 97 + D G+G+IP + ++ + R V ++H Sbjct: 126 IGKDTNLLGTGVIPRDAIVIDTIERDG--NKLQIVQIKH 162 >gi|315299781|gb|EFU59021.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 16-3] Length = 499 Score = 125 bits (313), Expect = 2e-27, Method: Composition-based stats. Identities = 37/110 (33%), Positives = 50/110 (45%), Gaps = 14/110 (12%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSWWRGY-------------RFLQPIVMVAGSVTYE 47 MAGNQLGK+ GAAE A HL+G YP +GY RF +P+V G T E Sbjct: 59 MAGNQLGKSFTGAAEVAFHLTGRYPGT-KGYPADGKYGGEGEGKRFYEPVVFWMGGETNE 117 Query: 48 LTRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 QR+L G + G G IP +++ + + V+H Sbjct: 118 TVTKTTQRILCGRIEENGEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKH 167 >gi|49146380|ref|YP_025488.1| putative phage DNA packaging protein Gp2 [Caedibacter taeniospiralis] gi|40458348|gb|AAR87096.1| putative phage DNA packaging protein Gp2 [Caedibacter taeniospiralis] Length = 474 Score = 117 bits (293), Expect = 5e-25, Method: Composition-based stats. Identities = 37/105 (35%), Positives = 56/105 (53%), Gaps = 5/105 (4%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSWWRGYRFLQPIVMVAGSVTYELTRDGIQRLLLGE 60 +AGN+ GKT G AE+ MHL+G YP WW G RF +PI A SVT LT + +++ L E Sbjct: 44 LAGNRTGKTYCGVAESVMHLTGYYPQWWIGKRFTRPIKAWAASVTTALTAEVLEKAYL-E 102 Query: 61 PMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRHLSGRDIGF 105 ++ D +I +++ + + G YS +T + F Sbjct: 103 MIAEDL----VIGVDRLRHSYKIDYKTGGYSELTFKSYEQGRKKF 143 >gi|284008126|emb|CBA74349.1| DNA packaging protein gp2 [Arsenophonus nasoniae] Length = 137 Score = 97.0 bits (240), Expect = 7e-19, Method: Composition-based stats. Identities = 23/94 (24%), Positives = 38/94 (40%), Gaps = 12/94 (12%) Query: 16 AAMHLSGCYPS---------W---WRGYRFLQPIVMVAGSVTYELTRDGIQRLLLGEPMS 63 + H +G YP W W+G F +P+V G T E QR+L G Sbjct: 1 MSFHFTGRYPGTKSYPEDGAWKGKWKGKIFSEPVVFWIGGETNETVTKTTQRILCGRIEE 60 Query: 64 PDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRH 97 ++ G G+IP +++ + + +RH Sbjct: 61 NNEPGYGLIPKEDIISWKKSPFYPNLVDHLLIRH 94 >gi|321225020|gb|EFX50081.1| Phage terminase, large subunit [Salmonella enterica subsp. enterica serovar Typhimurium str. TN061786] Length = 134 Score = 92.0 bits (227), Expect = 2e-17, Method: Composition-based stats. Identities = 31/71 (43%), Positives = 37/71 (52%), Gaps = 12/71 (16%) Query: 1 MAGNQLGKTLAGAAEAAMHLSGCYPSW------------WRGYRFLQPIVMVAGSVTYEL 48 MAGNQLGK+ GAAE A HL+G YP W+G RF +P+V G T E Sbjct: 59 MAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWVGGETNET 118 Query: 49 TRDGIQRLLLG 59 QR+L G Sbjct: 119 VTKTTQRILCG 129 >gi|182681090|ref|YP_001829250.1| bacteriophage terminase large (ATPase) subunit and inactivated derivatives-like protein [Xylella fastidiosa M23] gi|182631200|gb|ACB91976.1| Bacteriophage terminase large (ATPase) subunit and inactivated derivatives-like protein [Xylella fastidiosa M23] Length = 291 Score = 54.7 bits (130), Expect = 5e-06, Method: Composition-based stats. Identities = 11/39 (28%), Positives = 16/39 (41%) Query: 72 IPANKVLNMTRRSNIAGAYSTVTVRHLSGRDIGFIIESI 110 +P + MT + TV VRH SG ++S Sbjct: 6 VPGACIDGMTWAPGVPELVDTVYVRHCSGVRSSVSLKSF 44 >gi|71898835|ref|ZP_00681003.1| phage-related protein [Xylella fastidiosa Ann-1] gi|71731421|gb|EAO33484.1| phage-related protein [Xylella fastidiosa Ann-1] Length = 291 Score = 53.9 bits (128), Expect = 8e-06, Method: Composition-based stats. Identities = 11/39 (28%), Positives = 16/39 (41%) Query: 72 IPANKVLNMTRRSNIAGAYSTVTVRHLSGRDIGFIIESI 110 +P + MT + TV VRH SG ++S Sbjct: 6 VPGACIDAMTWARGVPELVDTVYVRHCSGVRSSVSLKSF 44 >gi|307579537|gb|ADN63506.1| bacteriophage terminase large (ATPase) subunit and inactivated derivatives-like protein [Xylella fastidiosa subsp. fastidiosa GB514] Length = 278 Score = 45.0 bits (105), Expect = 0.003, Method: Composition-based stats. Identities = 10/31 (32%), Positives = 13/31 (41%) Query: 80 MTRRSNIAGAYSTVTVRHLSGRDIGFIIESI 110 MT + TV VRH SG ++S Sbjct: 1 MTWAPGVPELVDTVYVRHCSGVRSSVSLKSF 31 >gi|137993|sp|P16938|VG2_BPLP7 RecName: Full=Protein GP2 gi|75884|pir||Z2BPL7 gene 2 protein - phage LP-7 (fragment) gi|553003|gb|AAA88220.1| packaging glycoprotein [Enterobacteria phage LP7] Length = 475 Score = 40.8 bits (94), Expect = 0.067, Method: Composition-based stats. Identities = 11/36 (30%), Positives = 13/36 (36%), Gaps = 1/36 (2%) Query: 41 AGSVTYELTRDGIQRLLLGEPMSPDQQGSGMIPANK 76 G T E QR+L G D+ G G P Sbjct: 111 IGGETNETVTKTTQRILCGRIEENDEPGYGS-PKED 145 >gi|313895672|ref|ZP_07829228.1| phage/plasmid primase, P4 family, C-terminal domain protein [Selenomonas sp. oral taxon 137 str. F0430] gi|312975798|gb|EFR41257.1| phage/plasmid primase, P4 family, C-terminal domain protein [Selenomonas sp. oral taxon 137 str. F0430] Length = 759 Score = 36.6 bits (83), Expect = 1.1, Method: Composition-based stats. Identities = 16/86 (18%), Positives = 31/86 (36%), Gaps = 11/86 (12%) Query: 29 RGYRFLQPIVMVAGSVTYELTRDGIQRLLLGE--------PMSPDQQGSGMIPANKVLNM 80 +G+R + + M T + + LLG + G+G+I ++ Sbjct: 471 KGWRMKKALFMYGAGDTGKSRLKCLVEQLLGRGNYVGIDLREIEARFGTGLIYGMRLAGS 530 Query: 81 TRRSNIAGAYSTV-TVRHLSGRDIGF 105 + S I + T + +G D F Sbjct: 531 SDMSFIT--VDELKTFKKCTGGDSIF 554 >gi|298674384|ref|YP_003726134.1| 4Fe-4S ferredoxin iron-sulfur-binding domain-containing protein [Methanohalobium evestigatum Z-7303] gi|298287372|gb|ADI73338.1| 4Fe-4S ferredoxin iron-sulfur binding domain protein [Methanohalobium evestigatum Z-7303] Length = 353 Score = 35.4 bits (80), Expect = 2.7, Method: Composition-based stats. Identities = 16/73 (21%), Positives = 30/73 (41%) Query: 28 WRGYRFLQPIVMVAGSVTYELTRDGIQRLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIA 87 + G RF + SV +++G+ + D+ + I ++ T S IA Sbjct: 81 YNGMRFNAADALWTASVNGFTQESMNAPVIIGDGLMGDESVTVEINGEELKQTTVASAIA 140 Query: 88 GAYSTVTVRHLSG 100 A S + + H+ G Sbjct: 141 KADSMIVLSHVKG 153 >gi|308159700|gb|EFO62222.1| Spindle pole protein, putative [Giardia lamblia P15] Length = 2263 Score = 35.0 bits (79), Expect = 3.1, Method: Composition-based stats. Identities = 17/61 (27%), Positives = 25/61 (40%), Gaps = 4/61 (6%) Query: 21 SGCYPSWWRGYRFLQPIVMVAGSVTYELTRDGIQRLLLGEPMSPDQQGSGMIPANKVLNM 80 +G Y + G +PI G R R LLG+P S + QG+ P N ++ Sbjct: 309 TGRYDNGTSGRTLDEPI----GPRPEYHFRKTRHRKLLGQPDSQECQGALTAPTNSIVTT 364 Query: 81 T 81 Sbjct: 365 V 365 >gi|260795723|ref|XP_002592854.1| hypothetical protein BRAFLDRAFT_201634 [Branchiostoma floridae] gi|229278078|gb|EEN48865.1| hypothetical protein BRAFLDRAFT_201634 [Branchiostoma floridae] Length = 1438 Score = 35.0 bits (79), Expect = 3.9, Method: Composition-based stats. Identities = 25/123 (20%), Positives = 44/123 (35%), Gaps = 27/123 (21%) Query: 9 TLAGAAEAAMHLSGCYPSWWRGYRFLQP----------------IVMVAGSVTYELTRDG 52 + +G + A H G + W + P I AG+ T EL + Sbjct: 260 SFSGTGQFASHWLGDNKAAWEDMAWSIPGILEFGLFGIPHIGADICGFAGNTTEELCQRW 319 Query: 53 IQ-----------RLLLGEPMSPDQQGSGMIPANKVLNMTRRSNIAGAYSTVTVRHLSGR 101 +Q + G P P G MI +++ + MTR + + Y+ H++G Sbjct: 320 MQLGAFYPFSRNHNTMNGNPQDPGSFGKAMIDSSRDVMMTRYTLLPYLYTLFYHAHVAGT 379 Query: 102 DIG 104 + Sbjct: 380 TVA 382 >gi|327190910|gb|EGE57964.1| hypothetical protein RHECNPAF_3500011 [Rhizobium etli CNPAF512] Length = 683 Score = 34.2 bits (77), Expect = 5.4, Method: Composition-based stats. Identities = 16/68 (23%), Positives = 25/68 (36%), Gaps = 3/68 (4%) Query: 14 AEAAMHLSGCYPSWWRGYRFLQPIVMVAGSVTYELTRDGIQRLLLGEPMSPD-QQGSGMI 72 A A+HL G P F +A + D +Q + G+ + D G G I Sbjct: 143 AALALHLVGRLPG--ADRHFADATHGLAVGRDDRESADIVQDVFGGDRLLADTAFGKGDI 200 Query: 73 PANKVLNM 80 ++ M Sbjct: 201 LGDRRRQM 208 >gi|330997689|ref|ZP_08321534.1| carboxymuconolactone decarboxylase family protein [Paraprevotella xylaniphila YIT 11841] gi|329570217|gb|EGG51957.1| carboxymuconolactone decarboxylase family protein [Paraprevotella xylaniphila YIT 11841] Length = 272 Score = 34.2 bits (77), Expect = 5.7, Method: Composition-based stats. Identities = 17/68 (25%), Positives = 25/68 (36%), Gaps = 6/68 (8%) Query: 2 AGNQLGKTLAGAAEAAMHLSGCYPSW---WRGYRFLQPIVMVAGSVTYELTRDGIQRLLL 58 + + G T AE H+ G Y W W F + A VT E + QR ++ Sbjct: 87 SAKKNGITRTEIAEIITHI-GFYAGWPKAWAA--FNLAKNVWAEDVTGEDAKAAFQREMI 143 Query: 59 GEPMSPDQ 66 P+ Sbjct: 144 FPIGEPNT 151 Database: nr Posted date: May 13, 2011 4:10 AM Number of letters in database: 999,999,932 Number of sequences in database: 2,987,209 Database: /data/usr2/db/fasta/nr.01 Posted date: May 13, 2011 4:17 AM Number of letters in database: 999,998,956 Number of sequences in database: 2,896,973 Database: /data/usr2/db/fasta/nr.02 Posted date: May 13, 2011 4:23 AM Number of letters in database: 999,999,979 Number of sequences in database: 2,907,862 Database: /data/usr2/db/fasta/nr.03 Posted date: May 13, 2011 4:29 AM Number of letters in database: 999,999,513 Number of sequences in database: 2,932,190 Database: /data/usr2/db/fasta/nr.04 Posted date: May 13, 2011 4:33 AM Number of letters in database: 792,586,372 Number of sequences in database: 2,260,650 Lambda K H 0.308 0.137 0.421 Lambda K H 0.267 0.0421 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 2,115,677,086 Number of Sequences: 13984884 Number of extensions: 81101742 Number of successful extensions: 194282 Number of sequences better than 10.0: 56 Number of HSP's better than 10.0 without gapping: 98 Number of HSP's successfully gapped in prelim test: 7 Number of HSP's that attempted gapping in prelim test: 194117 Number of HSP's gapped (non-prelim): 106 length of query: 110 length of database: 4,792,584,752 effective HSP length: 78 effective length of query: 32 effective length of database: 3,701,763,800 effective search space: 118456441600 effective search space used: 118456441600 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.1 bits) S2: 76 (33.8 bits)