BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254781208|ref|YP_003065621.1| hypothetical protein CLIBASIA_05575 [Candidatus Liberibacter asiaticus str. psy62] (578 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done >gi|268589382|ref|ZP_06123603.1| hypothetical protein PROVRETT_05514 [Providencia rettgeri DSM 1131] gi|291315409|gb|EFE55862.1| hypothetical protein PROVRETT_05514 [Providencia rettgeri DSM 1131] Length = 818 Score = 429 bits (1102), Expect = e-118, Method: Composition-based stats. Identities = 107/585 (18%), Positives = 207/585 (35%), Gaps = 46/585 (7%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + + SFS GE++P L R DL+ ++ + K N I +YG + + P + Sbjct: 1 MA-YSIIQPSFSGGEIAPSL-YGRIDLAKYSTALRKCSNFIVRQYGGIENRPGTKFIAAA 58 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPA---LFGKTYKTPYTFKD 117 + + R+ F L GDK ++++ ++ TPY D Sbjct: 59 KYPNKKCRLIPFQFSTVQTYALEMGDKYMRVIKDGGQVLYADGEYKGEIFELATPYKEAD 118 Query: 118 NKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSN 177 +L++ VH D+PP L D + ++ P+ K Sbjct: 119 LFNLKFTQSADVMTIVHADYPPMELQRYDHDD---WKLVPVETRNGPFEDINTDKERK-- 173 Query: 178 AKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHP----PEWAKNTNYSIGAYIVADD 233 A T +++ IF G+ I + P W + +I A Sbjct: 174 ---LYVSASTGDVTLSATHNIFGAELVGKQIYIEQQAIDAVPVWETDKTTNINDQRRAGA 230 Query: 234 KVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIK 293 YR+ T G+SG +W + SG Sbjct: 231 NYYRANTAGKSGTLRPSHTEGM-------SWDGWGGDAGIQWEYLHSGFGIVKINSVSTD 283 Query: 294 DVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDEL 353 ++ G+ + P + W S W + +GYPS V ++ RL F+GS+ Sbjct: 284 GLTATGKVVLYIPS--NAVGEENATYKWARSVWNDVDGYPSTVMYYQQRLFFAGSRAYPQ 341 Query: 354 SVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWL 413 +++ S G + DF + + + I + G ++ + Sbjct: 342 TIWASRSGDYKDFGKNNPIQ---DDDRIIYTYAGRQVNEIRHLIDVGS-LVALTSGGEYQ 397 Query: 414 LSISLS---KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRF 469 ++ + S F +G PP++V + +++ G ++ ++ S + G++ Sbjct: 398 ITGDQNKVLTPSSFSFSSQGANGCSDVPPIAVANIALYIQEKGSAVRDLAYSFDVDGYQG 457 Query: 470 NEITQLADHLF-NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHT 528 ++T +A+HLF +I+ + P+SI W + + +LL + E + FAW Sbjct: 458 TDLTIMANHLFQRHQIIDWAFSIVPYSIAWCIRD-----DGKLLSLTYLREQQ-VFAWAP 511 Query: 529 HMISDKHYVLSAASFPNDNRGGTSLWMLVAL-SAGEERSFTVRLN 572 + + S +++ +V G + RL+ Sbjct: 512 QETDGQFESTCSVS----EGNEDAVYFIVCRKVGGGTVRYIERLS 552 >gi|212710810|ref|ZP_03318938.1| hypothetical protein PROVALCAL_01878 [Providencia alcalifaciens DSM 30120] gi|212686507|gb|EEB46035.1| hypothetical protein PROVALCAL_01878 [Providencia alcalifaciens DSM 30120] Length = 818 Score = 421 bits (1082), Expect = e-115, Method: Composition-based stats. Identities = 108/585 (18%), Positives = 208/585 (35%), Gaps = 46/585 (7%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + + SFS GE++P L R DL+ ++ + K N + +YG + + P + Sbjct: 1 MA-YSIIQPSFSGGEIAPSL-YGRIDLAKYSTALRKCENFLVRQYGGIENRPGTKFIAAA 58 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPA---LFGKTYKTPYTFKD 117 + + R+ F L GDK ++++ ++ TPY D Sbjct: 59 KYPNKKCRLIPFQFSTVQTYALEMGDKYMRVIKDGGQVLYADGEHKGEIFELTTPYKEAD 118 Query: 118 NKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSN 177 +L++ VH D+PP L D + ++ P+ + K Sbjct: 119 LFNLKFTQSADVMTIVHADYPPMELQRYDHDD---WKLVPVETRNGPFEDINVDKERK-- 173 Query: 178 AKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHP----PEWAKNTNYSIGAYIVADD 233 A T +T+ IF G+ I + P W + A Sbjct: 174 ---VYVSASTGEVTLTATHNIFGAELVGKQIYIEQQAVDAVPVWETDKTTIKNDQRRAGS 230 Query: 234 KVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIK 293 YR+ T+G+SG +W + SG Sbjct: 231 NYYRANTSGKSGTLRPSHTEGM-------SWDGWGGDTGIQWEYLHSGFGIVKINSVSTD 283 Query: 294 DVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDEL 353 ++ G+ IS P + W S W + +GYPS V ++ RL F+GS+ Sbjct: 284 GLTATGKVISYIPS--NAVGESNATYKWARSVWNDVDGYPSTVMYYQQRLFFAGSRAYPQ 341 Query: 354 SVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWL 413 +++ S G + DF + + + I + G ++ + Sbjct: 342 TIWASRSGDYKDFGKNNPIQ---DDDRIIYTYAGRQVNEIRHLIDVGS-LVALTSGGEYQ 397 Query: 414 LSISLS---KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRF 469 ++ + S F +G PP++V + +++ G ++ ++ S + G++ Sbjct: 398 ITGDQNKVLTPSSFSFSSQGANGCSDVPPIAVANIALYIQEKGSAVRDLAYSFDVDGYQG 457 Query: 470 NEITQLADHLF-NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHT 528 ++T +A+HLF +I+ + P+SI W + + +LL + E + FAW Sbjct: 458 TDLTIMANHLFQRHQIIDWAFTIVPYSIAWCIRD-----DGKLLSLTYLREQQ-VFAWAP 511 Query: 529 HMISDKHYVLSAASFPNDNRGGTSLWMLVALSAG-EERSFTVRLN 572 + + S +++ +V G + RL+ Sbjct: 512 QDTDGQFESTCSIS----EGNEDAVYFIVCRKVGDGTVRYIERLS 552 >gi|227355852|ref|ZP_03840245.1| conserved hypothetical protein [Proteus mirabilis ATCC 29906] gi|227164171|gb|EEI49068.1| conserved hypothetical protein [Proteus mirabilis ATCC 29906] Length = 820 Score = 419 bits (1076), Expect = e-115, Method: Composition-based stats. Identities = 111/580 (19%), Positives = 207/580 (35%), Gaps = 45/580 (7%) Query: 5 TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDP 64 + + SFS GE++P L R DL+ ++ + K N I +YG + + P + + + Sbjct: 4 SLIQPSFSGGEIAPSL-YGRVDLAKYSTALRKCHNFIVRQYGGVENRPGTRFIAETKYQN 62 Query: 65 RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPA---LFGKTYKTPYTFKDNKSL 121 + +R+ F L FGD+ +++ ++ TPY D L Sbjct: 63 KKSRLIPFQFSTVQTYALEFGDRYIRVFKDGGQVLYADGEHKGEVFELATPYKEADLFDL 122 Query: 122 EYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLS 181 +Y VH D+PP L D + ++ P+ A Sbjct: 123 KYTQSADVMTIVHTDYPPMELQRYDHDD---WKLVSVETKNGPFEDINTDK-----AMKV 174 Query: 182 ISQADTSTARITSDMKIFKPLDKGRSIRLGCHP----PEWAKNTNYSIGAYIVADDKVYR 237 + A T +TS IF G+ L P W + ++ AD YR Sbjct: 175 YASASTGQITLTSTHDIFGSEQIGKQFYLEQRDIDAVPVWETDKTTNLNDQRRADSNYYR 234 Query: 238 SLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSK 297 + + G++G +W + SG + Sbjct: 235 ANSGGKTGTLRPSHTEGM-------SWDGWGGDTGIQWEYLHSGFGIVKIETVSEDGKTA 287 Query: 298 DGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYL 357 G+ +S P + W + W + +GYPS V ++ RL F+GS+ +++ Sbjct: 288 TGKVLSYIPS--NAVGEDNASHKWARAVWNDVDGYPSTVVYYQQRLFFAGSRAYPQTIWA 345 Query: 358 SSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSIS 417 S G + DF + + + I + G ++ + ++ Sbjct: 346 SRSGDYKDFGRNNPIQ---DDDRIIYTYAGRQVNEIRHLIDVGS-LVALTSGGEYQITGD 401 Query: 418 LS---KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEIT 473 + S +G PP+SV + +++ G ++ +S S + G++ ++T Sbjct: 402 QNKVLTPSSFSMSSQGANGSSDLPPISVANIALYIQEKGSAVRDLSYSFDVDGYQGTDLT 461 Query: 474 QLADHLF-NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMIS 532 LA+HLF RI+ + P+SI W + + +L + E + FAW Sbjct: 462 MLANHLFQRHRIVDWSFTTVPYSIAWCIRD-----DGLMLALTYLREQQ-VFAWAPQSTE 515 Query: 533 DKHYVLSAASFPNDNRGGTSLWMLVALS-AGEERSFTVRL 571 K + S S + +V + G++ + RL Sbjct: 516 GKFESTCSIS----EGNEDSAYFIVQRTVNGKQVRYVERL 551 >gi|218886166|ref|YP_002435487.1| hypothetical protein DvMF_1065 [Desulfovibrio vulgaris str. 'Miyazaki F'] gi|218757120|gb|ACL08019.1| conserved hypothetical protein [Desulfovibrio vulgaris str. 'Miyazaki F'] Length = 692 Score = 408 bits (1048), Expect = e-111, Method: Composition-based stats. Identities = 119/579 (20%), Positives = 200/579 (34%), Gaps = 75/579 (12%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M TT ++SF+AGELSP L+ +R D + +A G RN++ +GP P ++ C Sbjct: 1 MARTTLIQNSFNAGELSP-LMAARGDQARYASGCRVLRNMLLHPHGPAFRRPGLRFMGAC 59 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 + R+ F +G +L F ++L++ R PY + + Sbjct: 60 VDETVPPRLVPFVFNEGQAYVLEFAPERLRVWW-RGGLVLGEGGAPLVVPAPYAAEHLPT 118 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 L + V P L D + + F P G+ S + Sbjct: 119 LRWCQSADVLYLVTPHAAPRKLERHGHAD---WRLVAVNFGPRVATPTGLRSTGAPSGTR 175 Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLT 240 T+ + T + + L A+ + ++ V YR Sbjct: 176 QHRYVITAVSVDTGEESLPTAE-------LAVTAGTPAEGSAVNLAWTAVEGASEYRVYK 228 Query: 241 TGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGR 300 G +G A + D + Sbjct: 229 AGGGASVYGLLGTAATGET--------------------------------YADTGRTPD 256 Query: 301 SISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSF 360 P+ + F+ + YPS V F RL F+GS+ +++ S Sbjct: 257 FAEGPPEHRNPFEG--------------TDDYPSSVQFWQQRLCFAGSRSHPQTIWASRT 302 Query: 361 GAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSK 420 G + + + A+T + + S + WM P + + VG W LS S+ Sbjct: 303 GCYENMDVSRPLQT---DDAVTVTIASETVSAVRWMMPARKLL-VGTGGGEWTLSGQGSE 358 Query: 421 GLSI---DFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLA 476 S S G PP++VGD ++ V GR ++ S + G+ + T LA Sbjct: 359 PFSPLSCLLEFQSARGSAELPPLAVGDGVLAVQRGGRAVRDFRYSLDVDGYSGADQTILA 418 Query: 477 DHLFNQ-RILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKH 535 +H+ I+ YQ+ PHS+VW ++ + G AE + WH H Sbjct: 419 EHMLRGRNIVDWAYQQSPHSVVWCAMD-----DGTMAGLTLIAEHQ-VAGWHRHDTGGAV 472 Query: 536 YVLSAASFPNDN-RGGTSLWMLVALS-AGEERSFTVRLN 572 L P + GG LW++V G +R + RL+ Sbjct: 473 EALCVVPGPPSDPAGGDELWLVVRRDVDGVQRRYIERLD 511 >gi|254781208|ref|YP_003065621.1| hypothetical protein CLIBASIA_05575 [Candidatus Liberibacter asiaticus str. psy62] gi|254040885|gb|ACT57681.1| hypothetical protein CLIBASIA_05575 [Candidatus Liberibacter asiaticus str. psy62] gi|317120673|gb|ADV02496.1| hypothetical protein SC1_gp080 [Liberibacter phage SC1] gi|317120817|gb|ADV02638.1| hypothetical protein SC1_gp080 [Candidatus Liberibacter asiaticus] Length = 578 Score = 407 bits (1046), Expect = e-111, Method: Composition-based stats. Identities = 578/578 (100%), Positives = 578/578 (100%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC Sbjct: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS Sbjct: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL Sbjct: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLT 240 SISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLT Sbjct: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLT 240 Query: 241 TGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGR 300 TGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGR Sbjct: 241 TGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGR 300 Query: 301 SISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSF 360 SISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSF Sbjct: 301 SISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSF 360 Query: 361 GAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSK 420 GAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSK Sbjct: 361 GAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSK 420 Query: 421 GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQLADHLF 480 GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQLADHLF Sbjct: 421 GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQLADHLF 480 Query: 481 NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSA 540 NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSA Sbjct: 481 NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSA 540 Query: 541 ASFPNDNRGGTSLWMLVALSAGEERSFTVRLNLLDDFK 578 ASFPNDNRGGTSLWMLVALSAGEERSFTVRLNLLDDFK Sbjct: 541 ASFPNDNRGGTSLWMLVALSAGEERSFTVRLNLLDDFK 578 >gi|89152436|ref|YP_512269.1| hypothetical protein PhiV10p15 [Escherichia phage phiV10] gi|74055459|gb|AAZ95908.1| hypothetical protein PhiV10p15 [Escherichia phage phiV10] Length = 823 Score = 402 bits (1032), Expect = e-109, Method: Composition-based stats. Identities = 108/582 (18%), Positives = 211/582 (36%), Gaps = 42/582 (7%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M +W + SF+ GE+ P L R D++ + + K N I +YG + + P + Sbjct: 1 MA-ISWIQPSFAGGEIGPSL-YGRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAA 58 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 + R R+ F L FG + ++++ + + + TPYT D Sbjct: 59 KYPNRKCRLIPFQFSTVQTYALEFGHQYMRVIK-DGALVLNSSNVIYEIATPYTEADLFR 117 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 +++ VH +PP L ++ ++ P+ + + Sbjct: 118 IKFTQSADVLTLVHPAYPPKELRRYAHD---NWQLVDVVTKNGPFEDINIDESL-----T 169 Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVY 236 + A T T +T+ IF G+ L P W + + SIG AD Y Sbjct: 170 VYASASTGTITLTASASIFGAEQVGKLFYLEQPAVDSVPVWETSKSTSIGDIRRADSNYY 229 Query: 237 RSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVS 296 R++T G++G T W + + E + + + Sbjct: 230 RAVTAGKTGTLRPSHTEGTSW----DGWGGSGDDDTGIEWEYLHSGFGIARITA-VNGTT 284 Query: 297 KDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVY 356 IS P + + W AW GYP V ++ RL F+ S +++ Sbjct: 285 ATAEVISYIPSQ--VVGEDNASYKWAKYAWNSVNGYPGTVVYYQQRLYFAASTAFPQTIW 342 Query: 357 LSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSI 416 S G + DF + + I + G ++ ++++ Sbjct: 343 ASRTGDYKDFGKSNPTQ---DDDRIIYTYAGRQVNEIRHLIDVGS-LVALTSGGEYVITG 398 Query: 417 SLS---KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEI 472 + S F +G PP++V + +FV G ++ ++ S + G++ N++ Sbjct: 399 DQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVVRDLAYSFDVDGYQGNDL 458 Query: 473 TQLADHLF-NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMI 531 T LA+HLF I+ + P+S + + + +LL + + + FAW Sbjct: 459 TILANHLFQKHSIVDWCFSIVPYSSAFCIRD-----DGKLLVMTYLRDQQ-VFAWAPQSS 512 Query: 532 SDKHYVLSAASFPNDNRGGTSLWMLVALS-AGEERSFTVRLN 572 + K+ + S +++ +V + G+ + RL+ Sbjct: 513 TGKYESTCSIS----EGNEDAVYFVVNRTVNGQTVRYIERLS 550 >gi|300898435|ref|ZP_07116776.1| conserved domain protein [Escherichia coli MS 198-1] gi|300357902|gb|EFJ73772.1| conserved domain protein [Escherichia coli MS 198-1] Length = 823 Score = 400 bits (1028), Expect = e-109, Method: Composition-based stats. Identities = 108/582 (18%), Positives = 211/582 (36%), Gaps = 42/582 (7%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M +W + SF+ GE+ P L R D++ + + K N I +YG + + P + Sbjct: 1 MA-ISWIQPSFAGGEIGPSL-YGRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAA 58 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 + R R+ F L FG + ++++ + + + TPYT D Sbjct: 59 KYPNRKCRLIPFQFSTVQTYALEFGHQYMRVIK-DGALVLNSSNVIYEIATPYTEADLFR 117 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 +++ VH +PP L ++ ++ P+ + V Sbjct: 118 IKFTQSADVLTLVHPAYPPKELRRYAHD---NWQLVDVVTKNGPFEDINIDESV-----T 169 Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVY 236 + A T T +T+ IF G+ L P W + + SIG AD Y Sbjct: 170 VYASASTGTITLTASASIFGAEQVGKLFYLEQPAVDSVPVWETSKSTSIGDIRRADSNYY 229 Query: 237 RSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVS 296 R++T G++G T W + + E + + + Sbjct: 230 RAVTAGKTGTLRPSHTEGTSW----DGWGGSGDDDTGIEWEYLHSGFGIARITA-VNGTT 284 Query: 297 KDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVY 356 IS P + + W AW GYP V ++ RL F+ S +++ Sbjct: 285 ATAEVISYIPSQ--VVGEDNASYKWAKYAWNSVNGYPGTVVYYQQRLYFAASTAFPQTIW 342 Query: 357 LSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSI 416 S G + DF + + I + G ++ ++++ Sbjct: 343 ASRTGDYKDFGKSNPTQ---DDDRIIYTYAGRQVNEIRHLIDVGS-LVALTSGGEYVITG 398 Query: 417 SLS---KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEI 472 + S F +G PP++V + +FV G ++ ++ S + G++ +++ Sbjct: 399 DQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVVRDLAYSFDVDGYQGSDL 458 Query: 473 TQLADHLF-NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMI 531 T LA+HLF I+ + P+S + + + +LL + + + FAW Sbjct: 459 TILANHLFQKHSIVDWCFSIVPYSSAFCIRD-----DGKLLVMTYLRDQQ-VFAWAPQSS 512 Query: 532 SDKHYVLSAASFPNDNRGGTSLWMLVALS-AGEERSFTVRLN 572 + K+ + S +++ +V + G+ + RL+ Sbjct: 513 TGKYESTCSIS----EGNEDAVYFVVNRTVNGQTVRYIERLS 550 >gi|327252176|gb|EGE63848.1| phage protein [Escherichia coli STEC_7v] Length = 823 Score = 400 bits (1028), Expect = e-109, Method: Composition-based stats. Identities = 109/582 (18%), Positives = 210/582 (36%), Gaps = 42/582 (7%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M +W + SF+ GE+ P L R D++ + + K N I +YG + + P + Sbjct: 1 MA-ISWIQPSFAGGEIGPSL-YGRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAA 58 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 + R R+ F L FG + ++++ + + + TPYT D Sbjct: 59 KYPNRKCRLIPFQFSTVQTYALEFGHQYMRVIK-DGALVLNSSNVIYEIATPYTEADLFR 117 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 +++ VH +PP L ++ ++ P+ + V Sbjct: 118 IKFTQSADVLTLVHPAYPPKELRRYAHD---NWQLVDVVTKNGPFEDINIDESV-----T 169 Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVY 236 + A T T +T+ IF G+ L P W + + SIG AD Y Sbjct: 170 VYASASTGTITLTASASIFGAEQVGKLFYLEQPAVDSVPVWETSKSTSIGDIRRADSNYY 229 Query: 237 RSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVS 296 R++T G++G T W + + E + + Sbjct: 230 RAVTAGKTGTLRPSHTEGTSW----DGWGGSGDDDTGIEWEYLHSGFGIARISA-ANGTT 284 Query: 297 KDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVY 356 IS P + + W AW GYP V ++ RL F+ S +++ Sbjct: 285 ATAEVISYIPSQ--VVGEDNASYKWAKYAWNSVNGYPGTVVYYQQRLYFAASTAFPQTIW 342 Query: 357 LSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSI 416 S G + DF + + I + G ++ ++++ Sbjct: 343 ASRTGDYKDFGKSNPTQ---DDDRIIYTYAGRQVNEIRHLIDVGS-LVALTSGGEYVITG 398 Query: 417 SLS---KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEI 472 + S F +G PP++V + +FV G ++ ++ S + G++ N++ Sbjct: 399 DQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVVRDLAYSFDVDGYQGNDL 458 Query: 473 TQLADHLF-NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMI 531 T LA+HLF I+ + P+S + + + +LL + + + FAW Sbjct: 459 TILANHLFQKHSIVDWCFSIVPYSSAFCIRD-----DGKLLVMTYLRDQQ-VFAWAPQSS 512 Query: 532 SDKHYVLSAASFPNDNRGGTSLWMLVALS-AGEERSFTVRLN 572 + K+ + S +++ +V + G+ + RL+ Sbjct: 513 TGKYESTCSIS----EGNEDAVYFVVNRTVNGQTVRYIERLS 550 >gi|294493191|gb|ADE91947.1| conserved hypothetical protein [Escherichia coli IHE3034] Length = 823 Score = 400 bits (1028), Expect = e-109, Method: Composition-based stats. Identities = 108/582 (18%), Positives = 211/582 (36%), Gaps = 42/582 (7%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M +W + SF+ GE+ P L R D++ + + K N I +YG + + P + Sbjct: 1 MA-ISWIQPSFAGGEIGPSL-YGRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAA 58 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 + R R+ F L FG + ++++ + + + TPYT D Sbjct: 59 KYPNRKCRLIPFQFSTVQTYALEFGHQYMRVIK-DGALVLNSSNVIYEIATPYTEADLFR 117 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 +++ VH +PP L ++ ++ P+ + V Sbjct: 118 IKFTQSADVLTLVHPAYPPKELRRYAHD---NWQLVDVVTKNGPFEDINIDESV-----T 169 Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVY 236 + A T T +T+ IF G+ L P W + + SIG AD Y Sbjct: 170 VYASASTGTITLTASASIFGAEQVGKLFYLEQPAVDSVPVWETSKSTSIGDIRRADSNYY 229 Query: 237 RSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVS 296 R++T G++G T W + + E + + + Sbjct: 230 RAVTAGKTGTLRPSHTEGTSW----DGWGGSGDDDTGIEWEYLHSGFGIARITA-VNGTT 284 Query: 297 KDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVY 356 IS P + + W AW GYP V ++ RL F+ S +++ Sbjct: 285 ATAEVISYIPSQ--VVGEDNASYKWAKYAWNSVNGYPGTVVYYQQRLYFAASTAFPQTIW 342 Query: 357 LSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSI 416 S G + DF + + I + G ++ ++++ Sbjct: 343 ASRTGDYKDFGKSNPTQ---DDDRIIYTYAGRQVNEIRHLIDVGS-LVALTSGGEYVITG 398 Query: 417 SLS---KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEI 472 + S F +G PP++V + +FV G ++ ++ S + G++ N++ Sbjct: 399 DQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVVRDLAYSFDVDGYQGNDL 458 Query: 473 TQLADHLF-NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMI 531 T LA+HLF I+ + P+S + + + +LL + + + FAW Sbjct: 459 TILANHLFQKHSIVDWCFSIVPYSSAFCIRD-----DGKLLVMTYLRDQQ-VFAWAPQSS 512 Query: 532 SDKHYVLSAASFPNDNRGGTSLWMLVALS-AGEERSFTVRLN 572 + K+ + S +++ ++ + G+ + RL+ Sbjct: 513 TGKYESTCSIS----EGNEDAVYFVINRTVNGQTVRYIERLS 550 >gi|332344346|gb|AEE57680.1| conserved hypothetical protein [Escherichia coli UMNK88] Length = 823 Score = 400 bits (1027), Expect = e-109, Method: Composition-based stats. Identities = 109/582 (18%), Positives = 210/582 (36%), Gaps = 42/582 (7%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M +W + SF+ GE+ P L R D++ + + K N I +YG + + P + Sbjct: 1 MA-ISWIQPSFAGGEIGPSL-YGRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAA 58 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 + R R+ F L FG + ++++ + + + TPYT D Sbjct: 59 KYPNRKCRLIPFQFSTVQTYALEFGHQYMRVIK-DGALVLNSSNVIYEIATPYTEADLFR 117 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 +++ VH +PP L ++ ++ P+ + V Sbjct: 118 IKFTQSADVLTLVHPAYPPKELRRYAHD---NWQLVDVVTKNGPFEDINIDESV-----T 169 Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVY 236 + A T T +T+ IF G+ L P W + + SIG AD Y Sbjct: 170 VYASASTGTITLTASASIFGAEQVGKLFYLEQPAVDSVPVWETSKSTSIGDIRRADSNYY 229 Query: 237 RSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVS 296 R++T G++G T W + + E + + Sbjct: 230 RAVTAGKTGTLRPSHTEGTSW----DGWGGSGDDDTGIEWEYLHSGFGIARISA-ANGTT 284 Query: 297 KDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVY 356 IS P + + W AW GYP V ++ RL F+ S +++ Sbjct: 285 ATAEVISYIPSQ--VVGEDNASYKWAKYAWDSINGYPGTVVYYQQRLYFAASTAFPQTIW 342 Query: 357 LSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSI 416 S G + DF + + I + G ++ ++++ Sbjct: 343 ASRTGDYKDFGKSNPTQ---DDDRIIYTYAGRQVNEIRHLIDVGS-LVALTSGGEYVITG 398 Query: 417 SLS---KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEI 472 + S F +G PP++V + +FV G ++ ++ S + G++ N++ Sbjct: 399 DQNKVLAPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVVRDLAYSFDVDGYQGNDL 458 Query: 473 TQLADHLF-NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMI 531 T LA+HLF I+ + P+S + + + +LL + + + FAW Sbjct: 459 TILANHLFQKHSIVDWCFSIVPYSSAFCIRD-----DGKLLVMTYLRDQQ-VFAWAPQSS 512 Query: 532 SDKHYVLSAASFPNDNRGGTSLWMLVALS-AGEERSFTVRLN 572 + K+ + S +++ +V + G+ + RL+ Sbjct: 513 TGKYESTCSIS----EGNEDAVYFVVNRTVNGQTVRYIERLS 550 >gi|301046400|ref|ZP_07193560.1| conserved domain protein [Escherichia coli MS 185-1] gi|300301626|gb|EFJ58011.1| conserved domain protein [Escherichia coli MS 185-1] Length = 821 Score = 400 bits (1027), Expect = e-109, Method: Composition-based stats. Identities = 109/582 (18%), Positives = 210/582 (36%), Gaps = 42/582 (7%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M +W + SF+ GE+ P L R D++ + + K N I +YG + + P + Sbjct: 1 MA-ISWIQPSFAGGEIGPSL-YGRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAA 58 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 + R R+ F L FG + ++++ + + + TPYT D Sbjct: 59 KYPNRKCRLIPFQFSTVQTYALEFGHQYMRVIK-DGALVLNSSNVIYEIATPYTEADLFR 117 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 +++ VH +PP L ++ ++ P+ + V Sbjct: 118 IKFTQSADVLTLVHPAYPPKELRRYAHD---NWQLVDVVTKNGPFEDINIDESV-----T 169 Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVY 236 + A T T +T+ IF G+ L P W + + SIG AD Y Sbjct: 170 VYASASTGTITLTASASIFGAEQVGKLFYLEQPAVDSVPVWETSKSTSIGDIRRADSNYY 229 Query: 237 RSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVS 296 R++T G++G T W + + E + + Sbjct: 230 RAVTAGKTGTLRPSHTEGTSW----DGWGGSGDDDTGIEWEYLHSGFGIARISA-ANGTT 284 Query: 297 KDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVY 356 IS P + + W AW GYP V ++ RL F+ S +++ Sbjct: 285 ATAEVISYIPSQ--VVGEDNASYKWAKYAWNSINGYPGTVVYYQQRLYFAASTAFPQTIW 342 Query: 357 LSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSI 416 S G + DF + + I + G ++ ++++ Sbjct: 343 ASRTGDYKDFGKSNPTQ---DDDRIIYTYAGRQVNEIRHLIDVGS-LVALTSGGEYVITG 398 Query: 417 SLS---KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEI 472 + S F +G PP++V + +FV G ++ ++ S + G++ N++ Sbjct: 399 DQNKALTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVVRDLAYSFDVDGYQGNDL 458 Query: 473 TQLADHLF-NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMI 531 T LA+HLF I+ + P+S + + + +LL + + + FAW Sbjct: 459 TILANHLFQKHSIVDWCFSIVPYSSAFCIRD-----DGKLLVMTYLRDQQ-VFAWAPQSS 512 Query: 532 SDKHYVLSAASFPNDNRGGTSLWMLVALS-AGEERSFTVRLN 572 + K+ + S +++ +V + G+ + RL+ Sbjct: 513 TGKYESTCSIS----EGNEDAVYFVVNRTVNGQTVRYIERLS 550 >gi|298485990|ref|ZP_07004064.1| predicted phage protein [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] gi|298159467|gb|EFI00514.1| predicted phage protein [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] Length = 716 Score = 399 bits (1025), Expect = e-109, Method: Composition-based stats. Identities = 104/588 (17%), Positives = 205/588 (34%), Gaps = 58/588 (9%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M T + +F+AGELSPR+L R D++ + G N PL +G + Sbjct: 1 MAKLTLIQTNFTAGELSPRML-GRVDIARYQNGAKVIENAWPLVHGGVTRRNGTLFCAAA 59 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 + R R+ + ++ FGD ++I G +PY + Sbjct: 60 KFPDRRARLVPYVFNTEQAYMIEFGDFYIRIYYPNG------GWTGVELASPYGQTMLAA 113 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 LEY T H P + L I + + ++ F+ P+ GM A Sbjct: 114 LEYVQGADTMFLFHGRVPIYRLKRISNTE---WSLAPAPFVTTPFEERGMDFAF---AMA 167 Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAK---NTNYSIGAYIVADDKVYR 237 + A + + +T F D GR I G + + S+ +Y Sbjct: 168 ITNPAAGAASTVTPGAPAFFISDVGREIWAGSGIARITAFGSSGSVSVLVINAFSQTLYP 227 Query: 238 SLT---------TGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYV 288 + + T + G + T + SG + V Sbjct: 228 TWSLKGSPQTTCTASAFSPVGATVTLTLGAAGWRPEDVGKFVKLNGGLFQISGFTSSTVV 287 Query: 289 WGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGS 348 I+ + + ++ A S S W + +GYPS T + RL+ +GS Sbjct: 288 NAVIRSI------------ATSVVAAPAGAWSLEASVWNDFDGYPSTGTLYEQRLVAAGS 335 Query: 349 KGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCD 408 +++ S G + +F L + A++ V+ + I + ++ Sbjct: 336 PNYPQTIWESRTGEYLNFELGTK-----DDDAMSFNVSSDQINPIMHVGQVKA-LVTLTY 389 Query: 409 TSLWLLSIS---LSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE- 464 + ++ +I + S G P+ +G+ L FV GR+++ ++ + Sbjct: 390 GGEFTVTGGVEKPITPTNIQIKNQSVYGCNGVRPIRIGNELYFVQRAGRKLRAMAYKYDS 449 Query: 465 QGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDF 524 + +++ L++H ++ + +Q+EP SI+++V S + + + Sbjct: 450 DSYGSPDMSVLSEHATKSGVVDMAFQQEPESILFMVR-----SDGVMATMTVDRD-QDVV 503 Query: 525 AWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALS-AGEERSFTVRL 571 W + + S A P+ +W +V + G+ + R Sbjct: 504 GWARQVTDGAY--ESVAVIPSAEG--DQVWAVVRRTVNGQNVRYLERF 547 >gi|304398395|ref|ZP_07380269.1| conserved hypothetical protein [Pantoea sp. aB] gi|304354261|gb|EFM18634.1| conserved hypothetical protein [Pantoea sp. aB] Length = 824 Score = 398 bits (1022), Expect = e-108, Method: Composition-based stats. Identities = 104/582 (17%), Positives = 199/582 (34%), Gaps = 40/582 (6%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M N + + SF+ GE+SP + R DL+ ++ + + RN I +YG L + P + + Sbjct: 1 MSN-SLIQPSFAGGEISPN-VYGRVDLAKYSIALRRCRNFIVRQYGGLENRPGTRFIAEA 58 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 + R R+ F L FG +++ TPY D Sbjct: 59 KYPDRKCRLIPFQFSTVQTYALEFGHNYMRVYKDGGQVLDGN-NQVYELATPYQEADLFE 117 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 L+ HK + P L S+ E+ P+ + + Sbjct: 118 LKITQSADVMTICHKAYAPRELRRFGHA---SWELVEVVTKNGPFEDINI-----DPSVK 169 Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVY 236 + + + ++ IF G+ L P W + ++G A D Y Sbjct: 170 VYASSYQGNITLNANASIFGSEQVGKLFYLEQVNVDSTPVWETDKAVAVGMTRRAGDNYY 229 Query: 237 RSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVS 296 +LT G++G W + + + E + D Sbjct: 230 VALTAGKTGTLRPSHTEGAAWD----GWGSNGDNDTGIQWEYQHSGFGIARITSVSSDGY 285 Query: 297 KDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVY 356 + + + + W AW + GYP VT++ RL+F+ S +++ Sbjct: 286 IAAAVVQTYMPNDAVGPTK-ASYKWAKFAWNQVNGYPGTVTYYQQRLIFAASIKYPQTIW 344 Query: 357 LSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSI 416 S G + DF + + I + G ++ + + Sbjct: 345 CSKTGDYKDFGKTSPIA---DDDRIVYTYAGKQVNEIRHLIDVGS-LVALTSGGQFQIVG 400 Query: 417 SLS---KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEI 472 + + F G + P++V + +F+ G ++ ++ S + G++ +++ Sbjct: 401 DQNKTLTPTAFSFSSQGADGASSVAPITVSNIALFIQEKGSVVRDLAYSFDVDGYQGSDL 460 Query: 473 TQLADHLFNQ-RILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMI 531 T LA+HLFN R++ + P+S W V S LL + E + FAW Sbjct: 461 TVLANHLFNGYRLVDWTFSVVPYSAGWAVR-----SDGMLLCLTYLREQQ-VFAWAPQPG 514 Query: 532 SDKHYVLSAASFPNDNRGGTSLWMLVALS-AGEERSFTVRLN 572 K + S +++ V + G + + RL+ Sbjct: 515 EGKFESTCSIS----EGTEDAVYFSVQRTVNGASKRYIERLS 552 >gi|323156125|gb|EFZ42284.1| phage protein [Escherichia coli EPECa14] Length = 823 Score = 398 bits (1022), Expect = e-108, Method: Composition-based stats. Identities = 109/582 (18%), Positives = 210/582 (36%), Gaps = 42/582 (7%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M +W SF+ GE+ P L R D++ + + K N I +YG + + P + Sbjct: 1 MA-ISWIHPSFAGGEIGPSL-YGRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAA 58 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 + R R+ F L FG + ++++ + + + TPYT D Sbjct: 59 KYPNRKCRLIPFQFSTVQTYALEFGHQYMRVIK-DGALVLNSSNVIYEIATPYTEADLFR 117 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 +++ VH +PP L ++ ++ P+ + V Sbjct: 118 IKFTQSADVLTLVHPAYPPKELRRYAHD---NWQLVDVVTKNGPFEDINIDESV-----T 169 Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVY 236 + A T T +T++ IF G+ L P W + + SIG AD Y Sbjct: 170 VYASASTGTITLTANASIFGAEQVGKLFYLEQPAVDSVPVWETSKSTSIGDIRRADSNYY 229 Query: 237 RSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVS 296 R++T G++G T W + + E + + Sbjct: 230 RAVTAGKTGTLRPSHTEGTSW----DGWGGSGDDDTGIEWEYLHSGFGIARISA-ANGTT 284 Query: 297 KDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVY 356 IS P + + W AW GYP V ++ RL F+ S +++ Sbjct: 285 ATAEVISYIPSQ--VVGEDNASYKWAKYAWNSVNGYPGTVVYYQQRLYFAASTAFPQTIW 342 Query: 357 LSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSI 416 S G + DF + + I + G ++ ++++ Sbjct: 343 ASRTGDYKDFGKSNPTQ---DDDRIIYTYAGRQVNEIRHLIDVGS-LVALTSGGEYVITG 398 Query: 417 SLS---KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEI 472 + S F +G PP++V + +FV G ++ ++ S + G++ N++ Sbjct: 399 DQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVVRDLAYSFDVDGYQGNDL 458 Query: 473 TQLADHLF-NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMI 531 T LA+HLF I+ + P+S + + + +LL + + + FAW Sbjct: 459 TILANHLFQKHSIVDWCFSIVPYSSAFCIRD-----DGKLLVMTYLRDQQ-VFAWAPQSS 512 Query: 532 SDKHYVLSAASFPNDNRGGTSLWMLVALS-AGEERSFTVRLN 572 + K+ + S +++ +V + G+ + RL+ Sbjct: 513 TGKYESTCSIS----EGNEDAVYFVVNRTVNGQTVRYIERLS 550 >gi|221201505|ref|ZP_03574544.1| conserved hypothetical protein [Burkholderia multivorans CGD2M] gi|221207939|ref|ZP_03580945.1| hypothetical protein BURMUCGD2_2474 [Burkholderia multivorans CGD2] gi|221172124|gb|EEE04565.1| hypothetical protein BURMUCGD2_2474 [Burkholderia multivorans CGD2] gi|221178773|gb|EEE11181.1| conserved hypothetical protein [Burkholderia multivorans CGD2M] Length = 767 Score = 398 bits (1022), Expect = e-108, Method: Composition-based stats. Identities = 126/611 (20%), Positives = 208/611 (34%), Gaps = 58/611 (9%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + SF AGELSP LL +R D++ + G N I GP V + Sbjct: 1 MPKAAAQQVSFDAGELSP-LLGARVDIAKYPNGCKVMENFIATVQGPAVRRGGKRFVAAV 59 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDN-- 118 + + + F + DG +L FGD ++ V R A TPY D Sbjct: 60 KDSSKQAWLLPFIVSDGIAYMLEFGDHYIRFYVDRGQL--VNAGGPVEIATPYALADLVT 117 Query: 119 ----KSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGV 174 ++ T H +PP LL +F+ ++ F+ P+ GV Sbjct: 118 EDGTFAIRATQSADTMYLFHGAYPPQKLLRTSA---TTFSLQQVTFVSGPFQTINSDEGV 174 Query: 175 KSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPP-----EWAKNTNYSIGAYI 229 + T +T+ +F D G L + T G Sbjct: 175 -----TVKASGQTGAVTLTATAPVFSQADVGALFYLEQNDNTSVLPWSVHGTILETGLVR 229 Query: 230 VADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWI-----TVLNLSSKTSRESASGAVA 284 D+ Y S G + + S+ T+ + + E A Sbjct: 230 RVGDRTYVSTAIGPTAPQVTGSETPTHTRGRRYDGDLTDLANDNYGTIGIEWEYQHSGYA 289 Query: 285 PYYVWGDIKDVSKDGRSISVAPQSQTLFQ---AGVSVVSWFMSAWGEQEGYPSHVTFHNN 341 + G + P + W + + +GYP TF N Sbjct: 290 TVLITSVSDSQHATGTVTTNNPTDPCIIPQSIVDTGTYKWAHALFNAADGYPQMGTFWRN 349 Query: 342 RLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGE 401 RL + + S F +F+ D A+ + + + WM + Sbjct: 350 RLWMMRDRW----LVGSVSADFENFASKDADQQTDD-SAIVQQLNARQLNKLAWMVES-D 403 Query: 402 GVLVGCDTSLWLLS----ISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIK 457 +++G W++ +++ R + G PV VG ++FV GR+++ Sbjct: 404 SLIIGMTGDEWVIGPANASQPVSATNLNAARRTSYGSKRIQPVQVGGTIMFVQKAGRKLR 463 Query: 458 YISGST-EQGFRFNEITQLADHLFNQ------RILQLVYQEEPHSIVWVVLEPKDNSFPR 510 F ++T+LADH+ I+ L +Q+EPHSIVW + + Sbjct: 464 DFKYDFSSDNFVSTDVTKLADHITRGRSGTNNGIMSLCFQQEPHSIVWAAR-----ADGQ 518 Query: 511 LLGCRFSAEG--EGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVAL-SAGEERSF 567 L+GC + E + WH H ++ V AS P + LW++V G+ + Sbjct: 519 LIGCTYDEEAGRSDVYGWHRHPDANGF-VECVASMPAPDGASDDLWLIVRRQINGQTVRY 577 Query: 568 TVRLN--LLDD 576 LN L DD Sbjct: 578 VEYLNPALQDD 588 >gi|120601703|ref|YP_966103.1| hypothetical protein Dvul_0653 [Desulfovibrio vulgaris DP4] gi|120561932|gb|ABM27676.1| conserved hypothetical protein [Desulfovibrio vulgaris DP4] Length = 699 Score = 397 bits (1020), Expect = e-108, Method: Composition-based stats. Identities = 117/582 (20%), Positives = 199/582 (34%), Gaps = 77/582 (13%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M T ++SF+AGELSP L+ +R D + + G A N++ +G P ++ Sbjct: 1 MARATIVRNSFNAGELSP-LMAARVDQARYPNGCASLCNMLLHPHGGAWRRPGLRFMGLA 59 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 R+ F + +L FG + L+I +TP+ + + Sbjct: 60 ADPAGPVRLIPFVFSEAQAYVLEFGPRSLRIWHGG-GLVLGGDGEPFRLETPWAGEQLTA 118 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 L + V PP L D + ++ FLP +G+ VK Sbjct: 119 LRWCQSADMLYLVSHAGPPRRLERHGHAD---WRLVDVSFLPGVSPPEGLHCTVKPAGSR 175 Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLT 240 + + T+ R + + + P + P ++ + ++ V D YR Sbjct: 176 TWTYVVTAVHRESGEESLPTPPLQVT------GPDALSQTASVTLAWTPVQDAGEYRVYR 229 Query: 241 TGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGR 300 G +G+ ++GA Y G D Sbjct: 230 AGGGASVYGFLG--------------------------SAGAGETYTDTGRTPDFDAG-- 261 Query: 301 SISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSF 360 P+++ F +PS F RL F+G++ +++ S Sbjct: 262 ----PPEARNPFSGEG--------------DWPSCAVFWQQRLCFAGTRNGPQTIWASRS 303 Query: 361 GAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSK 420 GA+ +FS+ A+T + + S + W+ P + VG W LS + Sbjct: 304 GAYGNFSVSRPLR---DDDAVTVTIAADTVSAVRWLMPARRLL-VGTGGGEWTLSGQGEQ 359 Query: 421 GLSI---DFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLA 476 S R S G P+SVGD ++ + GR ++ S + G+ ++T LA Sbjct: 360 PFSPLSCSLERQSSRGSGDVQPLSVGDAVLALQRGGRVVREFRYSLDVDGYAGTDLTILA 419 Query: 477 DHLFNQ-RILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKH 535 +HL RI+ +Q+ P VW V L+ E E WH H+ Sbjct: 420 EHLTRGRRIIDWAWQQSPSGTVWCV-----TEDGGLIAMTRIPEHE-VAGWHRHVTDGAV 473 Query: 536 YVLSAASFPNDNRGGTSLWMLVALSAGEERS-FTVRLNLLDD 576 + G LW+ V G RL+ D Sbjct: 474 LSVCTIPGT----AGDELWVAVRREGGGMVRCCIERLDPPFD 511 >gi|331648168|ref|ZP_08349258.1| conserved hypothetical protein [Escherichia coli M605] gi|331043028|gb|EGI15168.1| conserved hypothetical protein [Escherichia coli M605] Length = 823 Score = 397 bits (1019), Expect = e-108, Method: Composition-based stats. Identities = 107/582 (18%), Positives = 208/582 (35%), Gaps = 42/582 (7%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M +W + SF+ GE+ P L R D++ + + K N I +YG + + P + Sbjct: 1 MA-ISWIQPSFAGGEIGPSL-YGRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAA 58 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 + R R+ F L FG + ++++ + + + TPYT D Sbjct: 59 KYPNRKCRLIPFQFSTVQTYALEFGHQYMRVIK-DGALVLNSSNVIYEIATPYTEADLFR 117 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 +++ VH +PP L ++ ++ P+ + V Sbjct: 118 IKFTQSADVLTLVHPAYPPKELRRYAHD---NWQLVDVVTKNGPFEDINIDESV-----T 169 Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVY 236 + A T T +T+ IF G+ L P W + + SIG AD Y Sbjct: 170 VYASASTGTITLTASASIFGAEQVGKLFYLEQPAVDSVPVWETSKSTSIGDIRRADSNYY 229 Query: 237 RSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVS 296 R++T G++G T W + + E + + Sbjct: 230 RAVTAGKTGTLRPSHTEGTSW----DGWGGSGDDDTGIEWEYLHSGFGIARITAANGTTA 285 Query: 297 KDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVY 356 + Q + W AW GYP V ++ RL F+ S +++ Sbjct: 286 TAEVISYIPSQVVGE---DNASYKWAKYAWNSVNGYPGTVVYYQQRLYFAASTAFPQTIW 342 Query: 357 LSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSI 416 S G + DF + + I + G ++ ++++ Sbjct: 343 ASRTGDYKDFGKSNPTQ---DDDRIIYTYAGRQVNEIRHLIDVGS-LVALTSGGEYVITG 398 Query: 417 SLS---KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEI 472 + S F +G PP++V + +FV G ++ ++ S + G++ N++ Sbjct: 399 DQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVVRDLAYSFDVDGYQGNDL 458 Query: 473 TQLADHLF-NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMI 531 T LA+HLF I+ + P+S + + + +LL + + + FAW Sbjct: 459 TILANHLFQKHSIVDWCFSIVPYSSAFCIRD-----DGKLLVMTYLRDQQ-VFAWAPQSS 512 Query: 532 SDKHYVLSAASFPNDNRGGTSLWMLVALS-AGEERSFTVRLN 572 + K+ + S +++ +V + G+ + RL+ Sbjct: 513 TGKYESTCSIS----EGNEDAVYFVVNRTVNGQTVRYIERLS 550 >gi|298381710|ref|ZP_06991309.1| conserved hypothetical protein [Escherichia coli FVEC1302] gi|298279152|gb|EFI20666.1| conserved hypothetical protein [Escherichia coli FVEC1302] Length = 823 Score = 397 bits (1019), Expect = e-108, Method: Composition-based stats. Identities = 107/582 (18%), Positives = 208/582 (35%), Gaps = 42/582 (7%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M +W + SF+ GE+ P L R D++ + + K N I +YG + + P + Sbjct: 1 MA-ISWIQPSFAGGEIGPSL-YGRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAA 58 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 + R R+ F L FG + ++++ + + + TPYT D Sbjct: 59 KYPNRKCRLIPFQFSTVQTYALEFGHQYMRVIK-DGALVLNSSNVIYEIATPYTEADLFR 117 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 +++ VH +PP L ++ ++ P+ + V Sbjct: 118 IKFTQSADVLTLVHPAYPPKELRRYAHD---NWQLVDVVTKNGPFEDINIDESV-----T 169 Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVY 236 + A T T +T+ IF G+ L P W + + SIG AD Y Sbjct: 170 VYASASTGTITLTASASIFGAEQVGKLFYLEQPAVDSVPVWETSKSTSIGDIRRADSNYY 229 Query: 237 RSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVS 296 R++T G++G T W + + E + + Sbjct: 230 RAVTAGKTGTLRPSHTEGTSW----DGWGGSGDDDTGIEWEYLHSGFGIARITAANGTTA 285 Query: 297 KDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVY 356 + Q + W AW GYP V ++ RL F+ S +++ Sbjct: 286 TAEVISYIPSQVVGE---DNASYKWAKYAWNSVNGYPGTVVYYQQRLYFAASTAFPQTIW 342 Query: 357 LSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSI 416 S G + DF + + I + G ++ ++++ Sbjct: 343 ASRTGDYKDFGKSNPTQ---DDDRIIYTYAGRQVNEIRHLIDVGS-LVALTSGGEYVITG 398 Query: 417 SLS---KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEI 472 + S F +G PP++V + +FV G ++ ++ S + G++ N++ Sbjct: 399 DQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVVRDLAYSFDVDGYQGNDL 458 Query: 473 TQLADHLF-NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMI 531 T LA+HLF I+ + P+S + + + +LL + + + FAW Sbjct: 459 TILANHLFQKHSIVDWCFSIVPYSSAFCIRD-----DGKLLVMTYLRDQQ-VFAWAPQSS 512 Query: 532 SDKHYVLSAASFPNDNRGGTSLWMLVALS-AGEERSFTVRLN 572 + K+ + S +++ +V + G+ + RL+ Sbjct: 513 TGKYESTCSIS----EGNEDAVYFVVNRTVNGQTVRYIERLS 550 >gi|218700982|ref|YP_002408611.1| hypothetical protein ECIAI39_2672 [Escherichia coli IAI39] gi|218370968|emb|CAR18795.1| conserved hypothetical protein from phage origin [Escherichia coli IAI39] gi|323948677|gb|EGB44582.1| hypothetical protein ERKG_04900 [Escherichia coli H252] Length = 823 Score = 397 bits (1019), Expect = e-108, Method: Composition-based stats. Identities = 107/582 (18%), Positives = 209/582 (35%), Gaps = 42/582 (7%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M +W + SF+ GE+ P L R D++ + + K N I +YG + + P + Sbjct: 1 MA-ISWIQPSFAGGEIGPSL-YGRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAA 58 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 + R R+ F L FG + ++++ + + + TPYT D Sbjct: 59 KYPNRKCRLIPFQFSTVQTYALEFGHQYMRVIK-DGALVLNSSNVIYEIATPYTEADLFR 117 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 +++ VH +PP L ++ ++ P+ + V Sbjct: 118 IKFTQSADVLTLVHPAYPPKELRRYAHD---NWQLVDVVTKNGPFEDINIDESV-----T 169 Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVY 236 + A T T +T+ + IF G+ L P W + + SIG AD Y Sbjct: 170 VYASASTGTITLTASVSIFGAEQVGKLFYLEQPAVDSVPVWETSKSTSIGDIRRADSNYY 229 Query: 237 RSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVS 296 R++T G++G T W + + E + + Sbjct: 230 RAVTAGKTGTLRPSHTEGTSW----DGWGGSGDDDTGIEWEYLHSGFGIARITAANGTTA 285 Query: 297 KDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVY 356 + Q + W AW GYP V ++ RL F+ S +++ Sbjct: 286 TAEVISYIPSQVVGE---DNASYKWAKYAWNSVNGYPGTVVYYQQRLYFAASTAFPQTIW 342 Query: 357 LSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSI 416 S G + DF + + I + G ++ ++++ Sbjct: 343 ASRTGDYKDFGKSNPTQ---DDDRIIYTYAGRQVNEIRHLIDVGS-LVALTSGGEYVITG 398 Query: 417 SLS---KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEI 472 + S F +G PP++V + +FV G ++ ++ S + G++ N++ Sbjct: 399 DQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVVRDLAYSFDVDGYQGNDL 458 Query: 473 TQLADHLF-NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMI 531 T LA+HLF I+ + P+S + + + +LL + + + FAW Sbjct: 459 TILANHLFQKHSIVDWCFSIVPYSSAFCIRD-----DGKLLVMTYLRDQQ-VFAWAPQSS 512 Query: 532 SDKHYVLSAASFPNDNRGGTSLWMLVALS-AGEERSFTVRLN 572 + K+ + S +++ +V + G+ + RL+ Sbjct: 513 TGKYESTCSIS----EGNEDAVYFVVNRTVNGQTVRYIERLS 550 >gi|324008552|gb|EGB77771.1| conserved domain protein [Escherichia coli MS 57-2] Length = 823 Score = 397 bits (1019), Expect = e-108, Method: Composition-based stats. Identities = 107/582 (18%), Positives = 208/582 (35%), Gaps = 42/582 (7%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M +W + SF+ GE+ P L R D++ + + K N I +YG + + P + Sbjct: 1 MA-ISWIQPSFAGGEIGPSL-YGRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAA 58 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 + R R+ F L FG + ++++ + + + TPYT D Sbjct: 59 KYPNRKCRLIPFQFSTVQTYALEFGHQYMRVIK-DGALVLNSSNVIYEIATPYTEADLFR 117 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 +++ VH +PP L ++ ++ P+ + V Sbjct: 118 IKFTQSADVLTLVHPAYPPKELRRYAHD---NWQLVDVVTKNGPFEDINIDESV-----T 169 Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVY 236 + A T T +T+ IF G+ L P W + + SIG AD Y Sbjct: 170 VYASASTGTITLTASASIFGAEQVGKLFYLEQPAVDSVPVWETSKSTSIGDIRRADSNYY 229 Query: 237 RSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVS 296 R++T G++G T W + + E + + Sbjct: 230 RAVTAGKTGTLRPSHTEGTSW----DGWGGSGDDDTGIEWEYLHSGFGIARITAANGTTA 285 Query: 297 KDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVY 356 + Q + W AW GYP V ++ RL F+ S +++ Sbjct: 286 TAEVISYIPSQVVGE---DNASYKWAKYAWNSVNGYPGTVVYYQQRLYFAASTAFPQTIW 342 Query: 357 LSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSI 416 S G + DF + + I + G ++ ++++ Sbjct: 343 ASRTGDYKDFGKSNPTQ---DDDRIIYTYAGRQVNEIRHLIDVGS-LVALTSGGEYVITG 398 Query: 417 SLS---KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEI 472 + S F +G PP++V + +FV G ++ ++ S + G++ N++ Sbjct: 399 DQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVVRDLAYSFDVDGYQGNDL 458 Query: 473 TQLADHLF-NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMI 531 T LA+HLF I+ + P+S + + + +LL + + + FAW Sbjct: 459 TILANHLFQKHSIVDWCFSIVPYSSAFCIRD-----DGKLLVMTYLRDQQ-VFAWAPQSS 512 Query: 532 SDKHYVLSAASFPNDNRGGTSLWMLVALS-AGEERSFTVRLN 572 + K+ + S +++ +V + G+ + RL+ Sbjct: 513 TGKYESTCSIS----EGNEDAVYFVVNRTVNGQTVRYIERLS 550 >gi|117624704|ref|YP_853617.1| hypothetical protein APECO1_4049 [Escherichia coli APEC O1] gi|115513828|gb|ABJ01903.1| conserved hypothetical protein [Escherichia coli APEC O1] Length = 823 Score = 396 bits (1016), Expect = e-108, Method: Composition-based stats. Identities = 104/582 (17%), Positives = 211/582 (36%), Gaps = 42/582 (7%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M +W + SF+ GE+ P L R D++ + + K N I +YG + + P + Sbjct: 1 MA-ISWIQPSFAGGEIGPSL-YGRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAA 58 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 + R R+ F L FG + ++++ + + + TPYT D Sbjct: 59 KYPNRKCRLIPFQFSTVQTYALEFGHQYMRVIK-DGALVLNSSNVIYEIATPYTEADLFR 117 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 +++ VH +PP L ++ ++ P+ + V Sbjct: 118 IKFTQSADVLTLVHPAYPPKELRRYAHD---NWQLVDVVTKNGPFEDINIDESV-----T 169 Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVY 236 + A T T +T+ IF G+ L P W + + SIG AD Y Sbjct: 170 VYASASTGTITLTASASIFGAEQVGKLFYLEQPAVDSVPVWETSKSTSIGDIRRADSNYY 229 Query: 237 RSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVS 296 R++T G++G T + + + + + + + Sbjct: 230 RAVTAGKTGTLRPSHTEGTSWDGWGGSGDDDIGIEWEYLH-------SGFGIARITAANG 282 Query: 297 KDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVY 356 + ++ + + W AW GYP V ++ RL F+ S +++ Sbjct: 283 TTATAEVISYIPSQVVGEDNASYKWAKYAWNSVNGYPGTVVYYQQRLYFAASTAFPQTIW 342 Query: 357 LSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSI 416 S G + DF + + I + G ++ ++++ Sbjct: 343 ASRTGDYKDFGKSNPTQ---DDDRIIYTYAGRQVNEIRHLIDVGS-LVALTSGGEYVITG 398 Query: 417 SLS---KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEI 472 + S F +G PP++V + +FV G ++ ++ S + G++ N++ Sbjct: 399 DQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVVRDLAYSFDVDGYQGNDL 458 Query: 473 TQLADHLF-NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMI 531 T LA+HLF I+ + P+S + + + +LL + + + FAW Sbjct: 459 TILANHLFQKHSIVDWCFSIVPYSSAFCIRD-----DGKLLVMTYLRDQQ-VFAWAPQSS 512 Query: 532 SDKHYVLSAASFPNDNRGGTSLWMLVALS-AGEERSFTVRLN 572 + K+ + S +++ +V + G+ + RL+ Sbjct: 513 TGKYESTCSIS----EGNEDAVYFVVNRTVNGQTVRYIERLS 550 >gi|48697202|ref|YP_024932.1| hypothetical protein BcepC6B_gp12 [Burkholderia phage BcepC6B] gi|47779008|gb|AAT38371.1| gp12 [Burkholderia phage BcepC6B] Length = 768 Score = 393 bits (1008), Expect = e-107, Method: Composition-based stats. Identities = 127/614 (20%), Positives = 211/614 (34%), Gaps = 63/614 (10%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + SF AGELSP LL +R DL+ + G N I GP + + Sbjct: 1 MPKAAPQQVSFDAGELSP-LLGARVDLAKYPNGCQVMENFIATVQGPAIRRGGKRFVAAT 59 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDN-- 118 + + + + F + DG +L FGD ++ V R A TPY D Sbjct: 60 KDSTKQSWLLPFIVADGIAYMLEFGDHYIRFFVNRGQL--VNAGAPVEIATPYALADLTT 117 Query: 119 ----KSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGV 174 ++ T H +P LL +F+ + F+ P+ Sbjct: 118 EDGTFAIRATQSADTMYLFHGGYPTQKLLRTSA---TTFSLQPVTFVGGPFAAVN----- 169 Query: 175 KSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPE----WAKNTNYSIGAYIV 230 N + A T + + +F+P D G L W + Sbjct: 170 SDNNVRVHASAGTGAVTLVASASVFRPSDVGTLFYLEQEDNSFVKPWVVHQKIGPSELRR 229 Query: 231 ADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYY--- 287 D+VY G + + ++ T + W S T + GA Y Sbjct: 230 VGDRVYLCTAVGTATPQVTGTE--TPTHTSGSRWDGTGQDESATDEYGSIGAEWEYQHSG 287 Query: 288 -----VWGDIKDVSKDGRSISVAPQSQTLFQAG----VSVVSWFMSAWGEQEGYPSHVTF 338 + G D G + P + W S + +G+P TF Sbjct: 288 YGTVLITGYTNDQVVTGTVATNDPADPGMLPNTVVTLTGTYKWARSLFNSTDGFPQMGTF 347 Query: 339 HNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHP 398 NRL + + +S F F D A+ + + + WM Sbjct: 348 WRNRLCLMRDRW----LAMSVSADFETFKTKDADQQTDD-SAIVQQLNARQLNKLAWMVE 402 Query: 399 FGEGVLVGCDTSLWLLS----ISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGR 454 + +L+G W++ +++ R + G PV VG ++FV GR Sbjct: 403 S-DSLLIGMTGDEWVIGPANASQPVSAANLNAARRTSYGSKRIQPVQVGGTIMFVQKAGR 461 Query: 455 RIKYISGST-EQGFRFNEITQLADHLFNQ------RILQLVYQEEPHSIVWVVLEPKDNS 507 +++ + ++T++ADH+ I+ L +Q+EPHS+VW + Sbjct: 462 KLRDFKYDFSSDNYVSTDVTKIADHITRGRAGTNSGIMSLCFQQEPHSVVWAAR-----A 516 Query: 508 FPRLLGCRFSAEG--EGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVAL-SAGEE 564 +L+GC + E + WH H ++ V AS P + LW++V G+ Sbjct: 517 DGQLIGCTYDEEAGRSDVYGWHRHPDANGF-VECVASMPAPDGASDDLWVIVRRQVNGQT 575 Query: 565 RSFTVRLN--LLDD 576 + LN L DD Sbjct: 576 VRYVEYLNPALQDD 589 >gi|330007163|ref|ZP_08305905.1| hypothetical protein HMPREF9538_03594 [Klebsiella sp. MS 92-3] gi|328535510|gb|EGF61970.1| hypothetical protein HMPREF9538_03594 [Klebsiella sp. MS 92-3] Length = 825 Score = 393 bits (1008), Expect = e-107, Method: Composition-based stats. Identities = 105/590 (17%), Positives = 202/590 (34%), Gaps = 48/590 (8%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + + S + GE+SP L R DL + + + RN I + G + + P + Sbjct: 1 MA-YSLVQPSLAGGEISPSL-YGRIDLEKYQTSLRRCRNFIVRQSGGIENRPGFRFLGSA 58 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 + R +R+ F L GD ++ + TP+ Sbjct: 59 KYADRYSRLIPFQFSVSQTYALELGDHYFRVWSN--GALVTDGGSPVEVATPWPVSVISE 116 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 L++ H D+PP + + D + + P+ ++ Sbjct: 117 LKFTQSADVMTVCHNDYPPLEIRRYGEAD---WRTAAVTTTSGPFQDLNT-----DDSVT 168 Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVY 236 + T + +T+ IFK G+ + W + + +G + Y Sbjct: 169 VYASGRTGSVTLTASSPIFKSQHVGKLFYMEQKAVDSVGRWETDKDIGVGDECRYQENFY 228 Query: 237 RSLTTGRSGDRFGYSKGATYVKDNNITWITV--LNLSSKTSRESASGA----VAPYYVWG 290 R + G + G + +W + R SG + G Sbjct: 229 RCVDGGSN----GTTGTVAPTHTTGDSWDGWGLGGRNGVLWRYLHSGFGVCRITAIAGDG 284 Query: 291 DIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKG 350 R + + + W AW + +GYP VT++ RL+F GS+ Sbjct: 285 LTATADVVPRQDGEIELPAQVVGSTFATYKWAHYAWNDTDGYPGTVTYYQQRLIFGGSRA 344 Query: 351 DELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTS 410 +++ S G +++F A+T + I + G+ ++V Sbjct: 345 FPQTIWCSRTGDYHNFYRSNPKV---DDDAITYNYAGRQLNKILHLLDVGQ-LIVLTSGG 400 Query: 411 LWLLSISLSKGLS----IDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-Q 465 + ++ + L+ S +G P++VG ++V G I+ + S + Sbjct: 401 EFKVTGDSNGNLTGTGGFAMSGQSFNGSSDLAPINVGSVALYVQQKGSIIRDLFYSFDQD 460 Query: 466 GFRFNEITQLADHLFNQ-RILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDF 524 ++ +++T LA HLFN I +P S+ W S LLG + E + + Sbjct: 461 SYQSSDLTLLASHLFNGYSIRDWALSVQPFSVAWCAR-----SDGMLLGLTYLREQQ-VY 514 Query: 525 AWHTHM-ISDKHYVLSAASFPNDNRGGTSLWMLVALS-AGEERSFTVRLN 572 AWH H + + + S +++ L+ + G + RLN Sbjct: 515 AWHPHPMTNGYVESICSIS----EGQEDAVYALIRRTVNGSTVRYIERLN 560 >gi|221213947|ref|ZP_03586920.1| conserved hypothetical protein [Burkholderia multivorans CGD1] gi|221166124|gb|EED98597.1| conserved hypothetical protein [Burkholderia multivorans CGD1] Length = 766 Score = 392 bits (1006), Expect = e-107, Method: Composition-based stats. Identities = 127/610 (20%), Positives = 210/610 (34%), Gaps = 57/610 (9%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + SF AGELSP LL +R DL+ +A G N I GP V + Sbjct: 1 MPKAAAQQVSFDAGELSP-LLGARVDLAKYANGCLLLENFIATVQGPAVRRGGKRYVSAI 59 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDN-- 118 + + + F + DG +L FGD+ ++ V R A TPY D Sbjct: 60 KDSGKQAWLLPFIVSDGIAYMLEFGDQYIRFYVNRGQLVNDSA--PVEIATPYALADLVT 117 Query: 119 ----KSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGV 174 ++ T H +P L +F + F+ P+ Sbjct: 118 EDGTFAIRATQSADTMYLFHGAYPTQKLSRTSA---TTFELQPVTFVGGPFATVN----- 169 Query: 175 KSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPE----WAKNTNYSIGAYIV 230 +N+ + + +T++ +F+ D G + P WA + + Sbjct: 170 DNNSIRVQASGQSGDVTLTANADVFRASDVGTLFYVEQEQPTGIVPWAVHAESHVNDIRR 229 Query: 231 ADDKVYRSLTTGRSGDRFGY-----SKGATYVKDNNITWITVLNLSSKTSRESASGAVAP 285 D+ YR G + + + S E A Sbjct: 230 VGDRTYRCTQIGLNAPQVTGQETPIHTEGRRWDGDGRDPDGDTYGSIGVEWEYQHSGYAT 289 Query: 286 YYVWGDIKDVSKDGRSISVAPQSQTLFQ---AGVSVVSWFMSAWGEQEGYPSHVTFHNNR 342 + G + + P + W S + +G+P TF +NR Sbjct: 290 VLITGFVNARQVSATVTTNNPNDPCMIPKPVVDSGTYKWARSLFNSTDGFPQMGTFWSNR 349 Query: 343 LLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEG 402 L + + +S F +F D A+ + + + WM + Sbjct: 350 LCVMRDRW----IAMSVSADFENFKTKDADQQTDD-SAIVQQLNARRLNKLAWMVES-DS 403 Query: 403 VLVGCDTSLWLLSISLS----KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKY 458 +LVG W++ S + ++ RR + G PV VG ++FV GR+++ Sbjct: 404 LLVGMTGDEWVIGKSNASLALSATNMSARRRTSYGSKRLQPVEVGGTILFVQKAGRKLRD 463 Query: 459 ISGST-EQGFRFNEITQLADHLFNQ------RILQLVYQEEPHSIVWVVLEPKDNSFPRL 511 + ++T++ADH+ I+ L YQ+EPHSIVW + +L Sbjct: 464 FKYDFSSDNYVSTDVTKIADHVTRGRSGTNSGIMSLCYQQEPHSIVWAAR-----ADGQL 518 Query: 512 LGCRFSAEG--EGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVAL-SAGEERSFT 568 +GC + E + WH H + V AS P + LWM+V G+ + Sbjct: 519 IGCTYDEEAGRSDVYGWHRHPDVNGF-VECVASMPAPDGASDDLWMIVRRQINGQSVRYV 577 Query: 569 VRLN--LLDD 576 LN L DD Sbjct: 578 EYLNQSLQDD 587 >gi|30387391|ref|NP_848220.1| hypothetical protein epsilon15p12 [Enterobacteria phage epsilon15] gi|30266046|gb|AAO06075.1| 12 [Salmonella phage epsilon15] Length = 825 Score = 391 bits (1005), Expect = e-106, Method: Composition-based stats. Identities = 104/583 (17%), Positives = 206/583 (35%), Gaps = 42/583 (7%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M +W + SF+ GE+ P L R D+S + + K N I +YG + + P + Sbjct: 1 MA-FSWIQPSFAGGEIGPSL-YGRIDMSKYQVALRKCDNFIVRQYGGVENRPGTRFVGPA 58 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 + R R+ F L FG ++++ + + PY D Sbjct: 59 KYPDRKCRLIPFQFSTVQTYALEFGHNYMRVIKDGAYVLTTS-NVIYELAMPYADTDLFR 117 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 +++ VH +PP L ++ ++ P+ + VK Sbjct: 118 IKFTQSADVLTLVHPAYPPKELRRYAHD---NWQIVDVTTKNGPFEDINVDETVK----- 169 Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVY 236 + A T T +T+ IF G+ L P W + +I AD Y Sbjct: 170 VYASASTGTITLTASSAIFGAEQVGKLFYLEQPAVDSVPVWETSKTTAINDVRRADSNYY 229 Query: 237 RSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDI-KDV 295 R+ T+G++G W + + E + + Sbjct: 230 RANTSGKTGTLRPSHTEGMSWD----GWGGTGSDDTGIQWEYLHSGFGIAKITAVAGDGL 285 Query: 296 SKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSV 355 + +S P + + + W AW GYPS V ++ RL F+ S ++ Sbjct: 286 TATADVVSFIPSQ--VVGSANASYKWAKYAWNSVNGYPSTVVYYQQRLYFAASTAYPQTI 343 Query: 356 YLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLS 415 + S G + DF + + + I + G ++ + +S Sbjct: 344 WASRTGDYKDFGKNNPIQ---DDDRIIYTYAGRQVNEIRHLIDVGN-LVALTSGGEYTIS 399 Query: 416 ISLS---KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNE 471 + + F +G PP++V + +F+ G ++ ++ S + G++ + Sbjct: 400 GDQNKVLTPSAFSFSSQGNNGSSNVPPIAVANIALFIQEKGSVVRDLAYSFDVDGYQGTD 459 Query: 472 ITQLADHLF-NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHM 530 +T LA+HLF I+ + P+S + + + +LL + + + FAW Sbjct: 460 LTILANHLFQKHSIVDWSFCIVPYSSAFCIRD-----DGKLLVLTYLRDQQ-VFAWAPQS 513 Query: 531 ISDKHYVLSAASFPNDNRGGTSLWMLVALS-AGEERSFTVRLN 572 + K+ + S +++ +V + G+ + RL+ Sbjct: 514 SAGKYESTCSIS----EGSEDAVYFVVNRTINGQTVRYIERLS 552 >gi|215487813|ref|YP_002330244.1| hypothetical protein E2348C_2746 [Escherichia coli O127:H6 str. E2348/69] gi|215265885|emb|CAS10294.1| predicted protein [Escherichia coli O127:H6 str. E2348/69] Length = 825 Score = 390 bits (1000), Expect = e-106, Method: Composition-based stats. Identities = 106/583 (18%), Positives = 206/583 (35%), Gaps = 42/583 (7%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M +W + SF+ GE+ P L R D+S + + K N I +YG + + P + Sbjct: 1 MA-FSWIQPSFAGGEIGPSL-YGRIDMSKYQVALRKCDNFIVRQYGGVENRPGTRFVGPA 58 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 + R R+ F L FG ++++ + + PY D Sbjct: 59 KYPDRKCRLIPFQFSTVQTYALEFGHNYMRVIK-DGEYVLTTSNVIYELAMPYADTDLFR 117 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 +++ VH +PP L ++ ++ P+ + VK Sbjct: 118 IKFTQSADVLTLVHPAYPPKELRRYAHD---NWQIVDVTTKNGPFEDINVDDTVK----- 169 Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVY 236 + A T T +T+ IF G+ L P W + +I AD Y Sbjct: 170 VYASASTGTITLTASSAIFGAEQVGKLFYLEQPAVDSVPVWETSKTTAINDVRRADSNYY 229 Query: 237 RSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKD-V 295 R+ T G++G W + + E + D + Sbjct: 230 RANTAGKTGTLRPSHTEGMSWD----GWGGTGSDDTGIQWEYLHSGFGIAKITAVSGDGL 285 Query: 296 SKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSV 355 + +S P + + + W AW GYPS V ++ RL F+ S ++ Sbjct: 286 TATADVVSFIPSQ--VVGSANASYKWAKYAWNSVNGYPSTVVYYQQRLYFAASTAYPQTI 343 Query: 356 YLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLS 415 + S G + DF + + + I + G ++ + +S Sbjct: 344 WASRTGDYKDFGKNNPIQ---DDDRIIYTYAGRQVNEIRHLIDVGN-LVALTSGGEYTIS 399 Query: 416 ISLS---KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNE 471 + + F +G PP++V + +F+ G ++ ++ S + G++ + Sbjct: 400 GDQNKVLTPSAFSFSSQGNNGSSNVPPIAVANIALFIQEKGSVVRDLAYSFDVDGYQGTD 459 Query: 472 ITQLADHLF-NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHM 530 +T LA+HLF I+ + P+S + + + +LL + + + FAW Sbjct: 460 LTILANHLFQKHSIVDWSFCIVPYSSAFCIRD-----DGKLLVLTYLRDQQ-VFAWAPQS 513 Query: 531 ISDKHYVLSAASFPNDNRGGTSLWMLVALS-AGEERSFTVRLN 572 S K+ + S +++ +V + G+ + RL+ Sbjct: 514 SSGKYESTCSIS----EGSEDAVYFVVNRNINGQTVRYIERLS 552 >gi|78357587|ref|YP_389036.1| hypothetical protein Dde_2545 [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] gi|78219992|gb|ABB39341.1| conserved hypothetical protein [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] Length = 700 Score = 380 bits (974), Expect = e-103, Method: Composition-based stats. Identities = 113/590 (19%), Positives = 189/590 (32%), Gaps = 89/590 (15%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRD- 59 M T T++SF+ GELSP LL SR D + G RN+ +G V P M+ Sbjct: 1 MSRITLTRNSFNGGELSP-LLSSRIDQQRYTAGCRTLRNMTVYPHGAAVRRPGMRHMGTG 59 Query: 60 ---CRLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFK 116 + R+ F +L G+ +++ + +TP+ Sbjct: 60 LSLQPAGSAAVRLVPFVFSQEQAYVLELGEGVMRVWKDDGLVVSADG-SPVCVETPWKGD 118 Query: 117 DNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKS 176 +SL+Y V + P L D + ++F G+ + Sbjct: 119 ALQSLQYCQSADVMYLVCRQCAPRKLARHAHDD---WRITLLEFGAGLPAPQGLTAAAGG 175 Query: 177 NAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVY 236 A+ + T+ A + + + + ++ + V Y Sbjct: 176 AAEREYAYVVTAVAPDGGEESLPSEA-----VNVTAAASLNVRDM-VRLTWQPVEGAGAY 229 Query: 237 RSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVS 296 + G +GY A V +D Sbjct: 230 CVYKSIAGGGSYGYIGKAAGVPA--------------------------------YEDRG 257 Query: 297 KDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVY 356 + P+ + F +P V F+ RL F+G+ +++ Sbjct: 258 AEPDFGQGPPEYRNPFDGEG--------------RWPGCVQFYQQRLCFAGTDEKPQTIW 303 Query: 357 LSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSI 416 S + ++ A+T + + I WM P + VG W LS Sbjct: 304 CSQSANYESMNISSPLR---DDDAVTVTIAADRVNRIRWMMPARRLL-VGTAGGEWQLSG 359 Query: 417 SLSKGLSI---DFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEI 472 S L+ RR + G P+ +G ++FV GR ++ + E G+ ++ Sbjct: 360 SGDAPLTPVDAQLRRDTMHGSAGLMPLVIGQSILFVQRDGRTVREFRYALESDGYDAGDL 419 Query: 473 TQLADHLFNQR-ILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMI 531 T LA+HL R I+ YQ+ P S+VW L S L F E E WH H Sbjct: 420 TILAEHLMRGRRIVSWCYQQSPASVVWCAL-----SDGTLAAMTFLREHE-VVGWHRHDT 473 Query: 532 SDKHYVLSAASFPNDNRGGTSLWMLVALS------AGE---ERSFTVRLN 572 +V + + P D +W+ V G E RL Sbjct: 474 DG--FVEAVTAIPGDEG--DEVWLSVRRVRVLHDENGTRQEEVRSIERLE 519 >gi|292670776|ref|ZP_06604202.1| hypothetical protein HMPREF7545_1740 [Selenomonas noxia ATCC 43541] gi|292647397|gb|EFF65369.1| hypothetical protein HMPREF7545_1740 [Selenomonas noxia ATCC 43541] Length = 762 Score = 379 bits (973), Expect = e-103, Method: Composition-based stats. Identities = 119/609 (19%), Positives = 215/609 (35%), Gaps = 79/609 (12%) Query: 6 WTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPR 65 K SF+ GEL+P L R DL + G + +N+I LRYG P + + + Sbjct: 8 PLKPSFAGGELTPAL-YGRTDLQKYDVGASTLKNMIVLRYGGATRRPGFRHVAKTQG-GK 65 Query: 66 SNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAV 125 R+ F +L F +++ A T YT D ++Y Sbjct: 66 RARLIPFQYSTEQSYVLEFTAGCIRVFTKGGIVVKDDA--PLVIPTSYTEADLSDIKYTQ 123 Query: 126 FGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQA 185 VH +HPP L D + F+ + P+ + + Sbjct: 124 SADVLFLVHVNHPPMTLTRYGVTD---WKFERMDIAGGPFEDPNTK-----DGLKIGASG 175 Query: 186 DTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSG 245 + + + F G IRLG + + + I V R + +G Sbjct: 176 VQGEITLKASVDYFTEDMVGSLIRLGH-------TMSGQLKSGIPTTPLVVRCVPSGTVY 228 Query: 246 DR-FGYSKGATYVKDNNITWIT------------------VLNLSSKTSRESASGAVAPY 286 FG+ G+ V+ ++ + T N Sbjct: 229 VESFGFWNGSFIVEKHDKSTDTWIALQEQHANRTQNYTLNYTNKGDDIVEYRVRSEKFDT 288 Query: 287 YVWGD--------------------IKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAW 326 VW + + ++ + S A + + +SAW Sbjct: 289 SVWSNENERQRGYVTIQTFAQDYYGVARITAVNSATSAAATVTRELADTEATNDFSLSAW 348 Query: 327 GEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVT 386 ++GYP V+F +RL+F+GS+ + + S G +Y+F ++ D A+T ++ Sbjct: 349 SAKKGYPQAVSFFEDRLVFAGSRAKPQTYWASQSGDYYNFWVNTPQQDSD---AITGTLS 405 Query: 387 DFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSID--FRRVSGSGVYACPPVSVGD 444 + I + PFGE +++ + + + G+ PV +G Sbjct: 406 GGQMNGIRAIIPFGEMLML-TSGGEYKVGGGNETFTPTNQKAEPQEYRGINNLTPVVIGG 464 Query: 445 CLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFNQ-RILQLVYQEEPHSIVWVVLE 502 +V+V G I+ ++ S + + ++++ LA HLF I+ L YQ+ P+++VW V E Sbjct: 465 RIVYVQHQGSVIRDLTYSYDVDKYTGDDVSLLAAHLFEGHTIVALAYQQTPNTVVWCVRE 524 Query: 503 PKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSAG 562 LLG + E + +AWH H + K + S LW +V Sbjct: 525 -----DGALLGMTYIKE-QDVYAWHKHTTAGKFTDVCTISGDR----EEELWAVVERDGA 574 Query: 563 EERSFTVRL 571 + ++ Sbjct: 575 ---HYVEQM 580 >gi|262043557|ref|ZP_06016670.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|259039091|gb|EEW40249.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] Length = 511 Score = 377 bits (967), Expect = e-102, Method: Composition-based stats. Identities = 107/536 (19%), Positives = 194/536 (36%), Gaps = 35/536 (6%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M +W + SFS GE++P L R D++ + + K N I +YG + + P Q Sbjct: 1 MA-VSWIQPSFSGGEIAPSL-YGRIDMAKYQVALRKCDNFIVRQYGGVENRPGTQFIAAA 58 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 + R R+ F L FG ++++ + TPYT D Sbjct: 59 KYPDRKCRLIPFQFSTVQTYALEFGHNYMRVIK-DGGLVLTTGDVIYELATPYTENDVFG 117 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 L++ VH +PP L ++ +++ P+ + +K Sbjct: 118 LKFTQSADVMTIVHPSYPPKELRRYAHD---NWQIVDVQTTNGPFEDINVDE-----SKT 169 Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVY 236 + A T T +TS IF G+ L P W + + SI AD Y Sbjct: 170 VWASAPTGTITLTSSSAIFGAEQVGKLFYLEQPAVDSVPVWETSKSTSIEDIRRADSNYY 229 Query: 237 RSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVS 296 R+ T G++G T + SG ++ Sbjct: 230 RANTAGKTGTLRPSHTEGMAWDGWGGTGDDDTGVQ---WEYLHSGFGIVRITAVAGDGLT 286 Query: 297 KDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVY 356 +S P+ + A + W AW GYP+ V ++ RL F+ S +++ Sbjct: 287 ATADVVSRIPE--NVVGADKASYKWARYAWNSVNGYPATVVYYQQRLYFAASPAYPQTIW 344 Query: 357 LSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSI 416 S G + DF + + I + G ++V ++++ Sbjct: 345 ASRTGDYKDFGKSNPTQ---DDDRIVYTYAGRQVNEIRHLIDVGS-LVVLTSGGEFVVTG 400 Query: 417 SLS---KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEI 472 + + +G PP++V + +F+ G ++ ++ S + GF+ N++ Sbjct: 401 DQNKVLTPSAFSLSSQGSNGCSDVPPIAVSNIALFIQEKGSVVRDLAYSFDVDGFQGNDL 460 Query: 473 TQLADHLFNQR-ILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWH 527 T LA+HLF +R I+ + P S + V + +LL + + + FAW Sbjct: 461 TILANHLFQKRSIVDWAFCIVPFSSAFCVRD-----DGKLLVLTYLRDQQ-VFAWS 510 >gi|309702804|emb|CBJ02135.1| hypothetical phage protein [Escherichia coli ETEC H10407] Length = 807 Score = 375 bits (963), Expect = e-101, Method: Composition-based stats. Identities = 97/563 (17%), Positives = 200/563 (35%), Gaps = 40/563 (7%) Query: 21 LQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRVFSFSIPDGGYA 80 + R D++ + + K N I +YG + + P + + + R R+ F Sbjct: 1 MYGRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGEAKYPTRKCRLIPFQFSTVQTY 60 Query: 81 LLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPH 140 L FG ++++ + + + PY D +++ VH +PP Sbjct: 61 ALEFGHNYMRVIK-DGAYVLNSSNVIYELAMPYADTDLFRIKFTQSADVLTLVHPAYPPK 119 Query: 141 HLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFK 200 L ++ ++ P+ + VK + A T T +T+ IF Sbjct: 120 ELRRYAHD---NWQIVDVTTKNGPFEDINVDETVK-----VYASASTGTITLTASSAIFG 171 Query: 201 PLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATY 256 G+ L P W + +I AD YR+ T+G++G Sbjct: 172 AEQVGKLFYLEQPAIDSVPVWETSKTTAINDVRRADSNYYRANTSGKTGTLRPSHTEGMS 231 Query: 257 VKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKD-VSKDGRSISVAPQSQTLFQAG 315 W + + E + D ++ +S P + + Sbjct: 232 WD----GWGGTGDSDTGIQWEYLHSGFGIARITAVSSDGLTATATVVSYIPSQ--VVGSA 285 Query: 316 VSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCY 375 W AW GYPS V ++ RL F+ S +++ S G + DF + Sbjct: 286 NGSYKWARYAWNSVNGYPSTVVYYQQRLYFAASTAYPQTIWASRTGDYKDFGKNNPIQ-- 343 Query: 376 DPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLS---KGLSIDFRRVSGS 432 + + I + G ++ + +S + + F + Sbjct: 344 -DDDRIIYTYAGRQVNEIRHLIDVGN-LVALTSGGEYTISGDQNKVLTPSAFSFSSQGNN 401 Query: 433 GVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFNQR-ILQLVYQ 490 G PP++V + +F+ G ++ ++ S + G++ ++T LA+HLF +R I+ + Sbjct: 402 GSSNVPPIAVANIALFIQEKGSVVRDLAYSFDVDGYQGTDLTILANHLFQKRSIVDWSFC 461 Query: 491 EEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGG 550 P+S + + + +LL + + + FAW + K+ + S Sbjct: 462 IVPYSSAFCIRD-----DGKLLVLTYLRDQQ-VFAWAPQSSTGKYESTCSIS----EGSE 511 Query: 551 TSLWMLVALS-AGEERSFTVRLN 572 +++ +V + G+ + + RL+ Sbjct: 512 DAVYFVVNRTINGQTKRYIERLS 534 >gi|320175038|gb|EFW50151.1| 12 [Shigella dysenteriae CDC 74-1112] Length = 799 Score = 366 bits (938), Expect = 7e-99, Method: Composition-based stats. Identities = 96/555 (17%), Positives = 192/555 (34%), Gaps = 40/555 (7%) Query: 28 SLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRVFSFSIPDGGYALLVFGDK 87 + + + K N I +YG + + P + + R R+ F L FG + Sbjct: 2 AKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYPNRKCRLIPFQFSTVQTYALEFGHQ 61 Query: 88 KLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQD 147 ++++ + + + TPYT D +++ VH +PP L Sbjct: 62 YMRVIK-DGALVLNSSNVIYEIATPYTEADLFRIKFTQSADVLTLVHPAYPPKELRRYAH 120 Query: 148 GDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRS 207 ++ ++ P+ + V + A T T +T+ IF G+ Sbjct: 121 D---NWQLVDVVTKNGPFEDINIDESV-----TVYASASTGTITLTASASIFGAEQVGKL 172 Query: 208 IRLGC----HPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNIT 263 L P W + + SIG AD YR++T G++G T Sbjct: 173 FYLEQPAVDSVPVWETSKSTSIGDIRRADSNYYRAVTAGKTGTLRPSHTEGTSW----DG 228 Query: 264 WITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFM 323 W + + E + + + Q + W Sbjct: 229 WGGSGDDDTGIEWEYLHSGFGIARITAANGTTATAEVISYIPSQVVGE---DNASYKWAK 285 Query: 324 SAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTT 383 W GYP V ++ RL F+ S +++ S G + DF + Sbjct: 286 YTWNSVNGYPGTVVYYQQRLYFAASTAFPQTIWASRTGDYKDFGKSNPTQ---DDDRIIY 342 Query: 384 AVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLS---KGLSIDFRRVSGSGVYACPPV 440 + I + G ++ ++++ + S F +G PP+ Sbjct: 343 TYAGRQVNEIRHLIDVGS-LVALTSGGEYVITGDQNKVLTPSSFAFSSQGSNGSSNVPPI 401 Query: 441 SVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLF-NQRILQLVYQEEPHSIVW 498 +V + +FV G ++ ++ S + G++ N++T LA+HLF I+ + P+S + Sbjct: 402 AVANIALFVQEKGSVVRDLAYSFDVDGYQGNDLTILANHLFQKHSIVDWCFSIVPYSSAF 461 Query: 499 VVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVA 558 + + +LL + + + FAW + K+ + S +++ +V Sbjct: 462 CIRD-----DGKLLVMTYLRDQQ-VFAWAPQSSTGKYESTCSIS----EGNEDAVYFVVN 511 Query: 559 LS-AGEERSFTVRLN 572 + G+ + RL+ Sbjct: 512 RTVNGQTVRYIERLS 526 >gi|303328570|ref|ZP_07359005.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3] gi|302861336|gb|EFL84275.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3] Length = 696 Score = 360 bits (924), Expect = 3e-97, Method: Composition-based stats. Identities = 99/576 (17%), Positives = 190/576 (32%), Gaps = 76/576 (13%) Query: 7 TKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRS 66 ++ + GE++P L++ R D + G + RN +P+ G + P + D Sbjct: 6 IQNVLNGGEITP-LMRGRVDQPRYGTGAREMRNFVPMPQGGVTRRPGTRFLGMAHGDA-- 62 Query: 67 NRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVF 126 R+ F +L FGDK L++ + K +++PY D L +A Sbjct: 63 ARLIPFVFSATQGRMLEFGDKTLRVWLPDGRLVADENGEPKVFESPYAVGDLHELRFAQS 122 Query: 127 GSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQAD 186 H+ + P L D D + + E+ F+P D + V + Sbjct: 123 ADVVYLAHQGYAPRRLSRHADDD---WRWSELAFVPAIAAPDNVSLQVIDRGYNGDNATR 179 Query: 187 TSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGD 246 T +T+ + + G + + A+ + Y + Sbjct: 180 VYTYAVTA-VDEKTGQESGAGAEVSITAKALNSVSYIIRAAWPAVEGAAYYRVYK----- 233 Query: 247 RFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAP 306 + G + D + + P Sbjct: 234 ----------------------------KKYGVFGYIGRSDAECSFDDENIGADTEDTPP 265 Query: 307 QSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDF 366 + + F + +PS V FH RL ++ + ++++LS G F Sbjct: 266 EHKNPFASEG--------------DWPSQVFFHQQRLGWAATANRPITIWLSRPGDFEIM 311 Query: 367 SLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDF 426 + A+ + A+ I W+ P + + G + S W LS L+ Sbjct: 312 AASTPPK---DDDAIEATLAATQANRIVWLQPDRQSLTFGTEGSEWTLSAGEGVALTPSN 368 Query: 427 R----RVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFN 481 + + G A VSVG ++++ G+ ++ + + + ++T LA H+ Sbjct: 369 VSFEMQTANGGDNATQAVSVGGGVLYLQRGGKAVRQFAYNYSADKYLGQDVTILARHILR 428 Query: 482 QRI-LQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSA 540 + +Q+EP++++W L S L G + E + WH H + ++A Sbjct: 429 DAVVTAWAFQQEPYAVLWCAL-----SDGTLAGLTYMPE-QDVMGWHRHDTDGRFEDVAA 482 Query: 541 ASFPNDNRGGTSLWMLVALSAGEERSFTVRLNLLDD 576 W LV G RL+ D Sbjct: 483 MPGTP----DDQTWFLVRRGCG---LCVERLDSFFD 511 >gi|46580124|ref|YP_010932.1| hypothetical protein DVU1714 [Desulfovibrio vulgaris str. Hildenborough] gi|46449540|gb|AAS96191.1| conserved hypothetical protein [Desulfovibrio vulgaris str. Hildenborough] gi|311233883|gb|ADP86737.1| hypothetical protein Deval_1582 [Desulfovibrio vulgaris RCH1] Length = 697 Score = 353 bits (906), Expect = 4e-95, Method: Composition-based stats. Identities = 117/584 (20%), Positives = 190/584 (32%), Gaps = 75/584 (12%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + +F+ GE+SP LL +R D + G RN +PL GP+ P ++ Sbjct: 1 MGTIYPVQQAFNGGEISP-LLTARADQIRYQTGALTMRNAVPLAQGPVTRRPGLRFMGAA 59 Query: 61 RLDP-RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNK 119 + R+ SF L FG +++ + S +PY D Sbjct: 60 KEQGAGPVRLVSFVFSAAQSRALEFGPGYVRVWMDAG--LVSKNGQPYEVASPYGAADIA 117 Query: 120 SLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPP-WLGDGMISGVKSNA 178 L +A ++HPP L D D T + P L G + Sbjct: 118 GLRFAQSADVIYIASRNHPPRKLSRHADDDWRFITPTFMPTQAAPGALTLGTLGTTPGPG 177 Query: 179 KLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRS 238 + S T+ + T + + +G W + + ++ + R Sbjct: 178 NETYSYKVTAVSATTGEESL--ASPEGTITTTAMSSTYWVRVSWAAVPGAVEYRVYKRRY 235 Query: 239 LTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKD 298 G G G D + Sbjct: 236 GVFGFIGRAVGGDTF--------------------------------------FDDRNIG 257 Query: 299 GRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLS 358 + P+++ F A YP V F RL F+GS L+V+LS Sbjct: 258 ADTEDTVPEAKNPFTAAGE--------------YPGLVFFWQQRLGFAGSDKRPLTVWLS 303 Query: 359 SFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLS--- 415 AF + + D + + + W + +G + W LS Sbjct: 304 QSAAFENLAASRPPQDDDG---IEATLAGQRQNRFVW-IEGDRTLCLGTEGGEWTLSGQE 359 Query: 416 ISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQ 474 S+ F+ G P V GD L++V G ++ + S E G+ ++T Sbjct: 360 GGPVTPTSLQFQSHGVRGSEGVPAVRAGDSLLYVQRGGGVVREFTYSFERDGYVAPDLTL 419 Query: 475 LADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDK 534 L L +++ YQ+ PHSIVW VL+ L F E WH H Sbjct: 420 LTGVLRGRKVRAWAYQQSPHSIVWCVLD-----DGTLAALTFLREH-DVVGWHRHDTDGV 473 Query: 535 HYVLSAASFPNDNRGG-TSLWMLVALS-AGEERSFTVRLNLLDD 576 ++ + GG ++WMLV + G+ER + R+ D Sbjct: 474 VEDVTVIPGGDATAGGTDTVWMLVRRTVGGQERRYVERMAPFFD 517 >gi|167032763|ref|YP_001667994.1| hypothetical protein PputGB1_1755 [Pseudomonas putida GB-1] gi|166859251|gb|ABY97658.1| conserved hypothetical protein [Pseudomonas putida GB-1] Length = 774 Score = 352 bits (902), Expect = 1e-94, Method: Composition-based stats. Identities = 94/576 (16%), Positives = 193/576 (33%), Gaps = 83/576 (14%) Query: 4 TTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLD 63 T + SFSAGE++P +R DL+ + + RN + L G + + + + Sbjct: 2 TEVIQPSFSAGEVAPA-TYARVDLARYYTALKTCRNFVVLPEGGAQNRSGTRFITEVKDS 60 Query: 64 PRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEY 123 R+ F +L FG+ ++ + + + +PYT L++ Sbjct: 61 AARTRLIPFQFSTEQTYILEFGNLYIRFISMGGQV--VSGVTPYEIASPYTTAQLPDLKF 118 Query: 124 AVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNA---KL 180 VH DHPP L + ++T I F P G+++ ++ Sbjct: 119 TQSADVMTIVHPDHPPRELSRLAP---TNWTLTAITFEPGIAAPTGLVATARTGGSGDTT 175 Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLT 240 T+ + I+ + P + A+ + ++ Sbjct: 176 EYQYKVTAVSSISEGSVESWASNTATVNSFDDKP--------GATLAWTAVAGADHYNVY 227 Query: 241 TGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGR 300 +S FG+ + V N+I + Sbjct: 228 KNKSSGVFGFIGQSAGVTFNDI---------------------------------NITPA 254 Query: 301 SISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSF 360 + + P F G + PS V ++ R+ F+ S+ + +V++S Sbjct: 255 TDNTVPIGYNPFADGNN---------------PSVVGYYQQRMAFAASRANPQTVWMSRT 299 Query: 361 GAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSK 420 G F++F D + + + I + E + + + + S S Sbjct: 300 GDFHNFGYSDPNKDDDG---IEFVIASRQVNQIRHLVSLRELLAMTSGAEIAITGSSDSG 356 Query: 421 GLSIDFRR--VSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGST-EQGFRFNEITQLAD 477 + S G P + +++ G ++ ++ + GF+ +++ L+ Sbjct: 357 ITPANVSAVEQSYFGSSDVIPAIYANTALYIQARGGKLSTLAYNYVSDGFQPQDVSVLSS 416 Query: 478 HLFNQ-RILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHY 536 HL I + P+ ++W+V LLG F + + + W H Sbjct: 417 HLLRGFTIQDQAFALAPNGVLWLVRN-----DGMLLGFTFLPDQQ-VYGWSWHDTDGA-- 468 Query: 537 VLSAASFPNDNRGGTSLWMLVALS-AGEERSFTVRL 571 V + AS P D+ +L+M+V + G + + R+ Sbjct: 469 VEAVASVPEDD--EDALYMIVRRTINGVTKRYIERM 502 >gi|212703239|ref|ZP_03311367.1| hypothetical protein DESPIG_01281 [Desulfovibrio piger ATCC 29098] gi|212673505|gb|EEB33988.1| hypothetical protein DESPIG_01281 [Desulfovibrio piger ATCC 29098] Length = 694 Score = 351 bits (900), Expect = 2e-94, Method: Composition-based stats. Identities = 104/582 (17%), Positives = 187/582 (32%), Gaps = 79/582 (13%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M T++ + GE+SP LL+ R D ++ G + RN +P+ G + P + Sbjct: 1 MP-VFHTQNVLNGGEISP-LLRGRVDQPRYSTGAREMRNFVPMPQGGVTRRPGTRYLGTA 58 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 D R+ F +L FGD+ +++ + K +++P+ D ++ Sbjct: 59 LGDGG--RLVPFVFSATQGRMLEFGDRAMRVWLPDGRVVADEEGAPKIFESPFAAADLRA 116 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 + YA F H + P L D D + + E+ F+P + Sbjct: 117 VRYAQSADVIYFAHPGYAPRKLARHADDD---WRWSELTFMPAIATPKKPALSTVGTPEG 173 Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLT 240 T D + SI Sbjct: 174 DKKTDYTYCVTAIDDKGQESSPSEPASISAQA---------------------------- 205 Query: 241 TGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGR 300 + + ++ T V G I D + Sbjct: 206 ----LNSVDFHIRISWEAVEGATGYRVYKKKMGVFGYIGKGGADET----YIDDKNIGAD 257 Query: 301 SISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSF 360 + P+ + F+ + YPS V FH RL F+ S ++++LS Sbjct: 258 TEDTPPEYEDPFEGEGN--------------YPSQVFFHQQRLGFAASNSRPITIWLSRS 303 Query: 361 GAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL-- 418 G F + A+ + AS I W+ P + G + S W L S Sbjct: 304 GEFESMAKSTPPK---DDDAIEVTLAATQASRIVWLQPDRSALAFGTEGSEWTLEPSEGV 360 Query: 419 --SKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQL 475 + + + + G A +SVG +++V I+ + + + ++ L Sbjct: 361 ALTPATASFQLQTTNGGSDAVAALSVGGSVLYVQRGAGAIREFAYNYSADKYLGQDLNIL 420 Query: 476 ADHLFNQ-RILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDK 534 A H+ ++ +Q+EP++++W VL S L G + E E WH H + Sbjct: 421 ARHMLRDVDVVAWSWQQEPYAVLWSVL-----SDGTLAGLTYMKEQE-IVGWHRHTTAGD 474 Query: 535 HYVLSAASFPNDNRGGTSLWMLVALSAGEERSFTVRLNLLDD 576 ++ +W LV + F RL D Sbjct: 475 FVDVAGIPGTP----DDQVWFLVRRGG---QVFVERLEPFFD 509 >gi|262043403|ref|ZP_06016528.1| hypothetical protein HMPREF0484_3546 [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|259039229|gb|EEW40375.1| hypothetical protein HMPREF0484_3546 [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] Length = 664 Score = 350 bits (897), Expect = 5e-94, Method: Composition-based stats. Identities = 98/576 (17%), Positives = 187/576 (32%), Gaps = 88/576 (15%) Query: 3 NTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRL 62 K +F+AGE+SPRL+ R D+ +A G N + + G ++ P Q + Sbjct: 2 RANLIKTNFTAGEISPRLM-GRVDIDRYANGAKTLENSVVVVQGGVMRRPGSQFVAATKY 60 Query: 63 DPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLE 122 + +R+ + +L FGD L+I + +PYT S+ Sbjct: 61 GDKKSRLIPYVFNRTQAYILEFGDGYLRIYQ-DGKQLVNDDNTPYEIASPYTSDMLPSVN 119 Query: 123 YAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSI 182 Y T VH+D P+ L D + + F+ P+ Sbjct: 120 YVQGADTMFLVHQDVKPYRLQRRGQTD---WVLEPAPFIVEPF----------------D 160 Query: 183 SQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTG 242 DT +K F G I L E Sbjct: 161 EVRDTPQKWCKPSVKEF----VGSEITLTLSDDE-------------------------- 190 Query: 243 RSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSI 302 G + D + + + + G I+ + Sbjct: 191 ---PPEGSEDPPPFTGDGWVPEDVGSYVRINSGLVLIKSVTSAQVAVGTIRTDLSATQ-- 245 Query: 303 SVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGA 362 A + S W ++ GYP VT + RL+ +GS +++ S G Sbjct: 246 ----------AASPGAWTREDSVWTDEFGYPGAVTLYQQRLVLAGSPRYPQTIWWSESGV 295 Query: 363 FYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLS--- 419 + F L D A++ ++ + I + ++ + ++ Sbjct: 296 YLSFELGT-----DDDDAISFTLSSDQLNPIVHLAQMNT-LIALTYGGEFTITAGNDAAI 349 Query: 420 KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQ--GFRFNEITQLAD 477 +I + S G PV VG ++FV GR++ ++ + + N++T LA+ Sbjct: 350 TPTNISVKNPSPYGCNGIRPVRVGTEIMFVQRSGRKLYAVAYDPDSYVAYSANDMTVLAE 409 Query: 478 HLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYV 537 H+ ++ + YQ++P + W+V ++ + AW + S Sbjct: 410 HITEGGVIDMAYQQQPDAFTWLVRN-----DGVMVTMAIDR-AQNVVAWSRQITSGAF-- 461 Query: 538 LSAASFPNDNRGGTSLWMLVALS-AGEERSFTVRLN 572 S A+ P+ ++ +V + G+ + + Sbjct: 462 ESVATIPSAT--DDVVYAIVRRTVNGQTVRYVEMFS 495 >gi|262043657|ref|ZP_06016766.1| hypothetical protein HMPREF0484_3785 [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|259038995|gb|EEW40157.1| hypothetical protein HMPREF0484_3785 [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] Length = 758 Score = 348 bits (891), Expect = 2e-93, Method: Composition-based stats. Identities = 106/602 (17%), Positives = 185/602 (30%), Gaps = 51/602 (8%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M K SF+AG LSP ++ + D A V +N IPL GP Q Sbjct: 1 MSKIRPIKRSFNAGILSP-VMYGQVDFDKWASAVKYMKNFIPLPQGPARRRGGTQYAGSV 59 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDN-- 118 + + SF +L FG ++ + TP+ D Sbjct: 60 KNSSDRVWLASFQFSTTEAFILEFGPGYIRFWFNHAQLL-DDENNILEVSTPWGAGDLTR 118 Query: 119 ---KSLEYAVFGSTAVF--VHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISG 173 L + ++P + L +++ E F P+ Sbjct: 119 NGKFGLSLQQSADVIYITCTNGNYPVYKLTR---NTNTNWSLAEASFSGGPFADINSDKS 175 Query: 174 V------------KSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNT 221 N + TS IT++ IF+ L G + +T Sbjct: 176 SVVYTDQFRIWSEDGNDLPDGTPTTTSLCNITANTDIFQALHVGCLFYIEASTDAVDDDT 235 Query: 222 --NYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWIT----VLNLSSKTS 275 + I A+ + + + RS ++ T + TW + + Sbjct: 236 GHSGYIPAWAAGTTETFSTGVFCRSDGKYYEDMDGTKTGNTQPTWTAGAHRDGSGGDASL 295 Query: 276 RESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSH 335 + G + S G+ ++ P ++ + W + YP Sbjct: 296 WRYSGGGWGIIEITAVNSATSATGKIVTELP--PSVRNTVGKTYKYAFGDWSDVLRYPQF 353 Query: 336 VTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHW 395 F RL+F+G ++ S G +FS + ++ + D T+ W Sbjct: 354 AAFFRGRLVFAG----RQKIWSSVAGDLQNFSPMTNGYEAESDDSINDRIDDTQ-DTMQW 408 Query: 396 MHPFGEGVLVGCDTSLWLLSISLSKGL----SIDFRRVSGSGVYACPPVSVGDCLVFVCG 451 + + +G + + + S G + D + FV Sbjct: 409 LVASAGKIFIGTAGYEFSYGEQSLTSVFGAGNTKVELNSTIGSNEVQAERLFDRVAFVQR 468 Query: 452 VGRRIKYISGST-EQGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPR 510 GR++ + + F LA HLF I+ L YQ+EP+ I+WV+LE Sbjct: 469 AGRKVMIAAYDSGSDSFSATNSCILAPHLFTSEIIALAYQQEPNRILWVLLEEGKLLGLT 528 Query: 511 LLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALS-AGEERSFTV 569 + WH H V S P+ + G LWM+V + G + Sbjct: 529 ------YDAEQNITGWHEHATGGA--VESIKVIPDIDGGRDELWMVVKRTINGATVRYLE 580 Query: 570 RL 571 + Sbjct: 581 YM 582 >gi|225157020|ref|ZP_03724959.1| hypothetical protein ObacDRAFT_8085 [Opitutaceae bacterium TAV2] gi|224802748|gb|EEG20999.1| hypothetical protein ObacDRAFT_8085 [Opitutaceae bacterium TAV2] Length = 773 Score = 339 bits (870), Expect = 6e-91, Method: Composition-based stats. Identities = 102/616 (16%), Positives = 189/616 (30%), Gaps = 75/616 (12%) Query: 9 HSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNR 68 ++F+AGE +P+L R DL + + N+ + YG + +R Sbjct: 7 NNFTAGEWTPKL-DGRSDLQKYDAACRRLENMRVMPYGGARFRSAFGYVAKTKSAATPSR 65 Query: 69 VFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGS 128 + F +L + L++ ++ +PY +++Y Sbjct: 66 LMPFQFSTEQKFMLEWAHLALRVYSAGAAPALLQ-----EIASPYPAAAVFAIQYRQIND 120 Query: 129 TAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTS 188 VH D+P L D D + + + + PP L + + KLS+S D Sbjct: 121 VVYLVHPDYPVQRLARHADAD---WRLEAVDWAFPPMLDENVTET-----KLSLSAVDGV 172 Query: 189 TARITSDMKIFKPLDKGRSIRL--------GCHPPEWAKNTNYSIGAYIVADDKVYRSLT 240 +T+ +F+P G L + A V D S Sbjct: 173 NVTMTASAALFQPGHVGSYWELRHLKEAASTSVSLATTSGGPFHSAAISVQGDWTANSTE 232 Query: 241 T--GRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKD---- 294 G G T+ T + N+S+ +E + Y GD Sbjct: 233 RWYGTLSIERSLDGGTTWETVRKFTAESDRNISASGHQEELAQFRLKYQPTGDPFGAGVW 292 Query: 295 ---------------------------VSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWG 327 V+ S V + W SAW Sbjct: 293 VGKAPTNYVKARAMLETTDAYVTALVKVTAYTDSTHVKVTVIDKAATVAATDIWCESAWS 352 Query: 328 EQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTD 387 G+P + + RL+F G++ +++ S F +F D A+ Sbjct: 353 PYRGFPRTIGLYEQRLIFGGTRHQPNTMWGSKTDDFENFKYGE-----DDDAAVAYTFAA 407 Query: 388 FSASTIHWMHPFGEGVLVGCDTSLWLLSISLS---KGLSIDFRRVSGSGVYACPPVSVGD 444 + + W+ + + + +I R S +G PV V D Sbjct: 408 SEQNNVQWVESLKRIQAATTAREFTVAAGNTDEPLTPSNIVVRSESANGAAHLQPVLVND 467 Query: 445 CLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEP 503 +++V R++ ++ S E G+ ++T LA + + QL + +P ++ V Sbjct: 468 AILYVERQSRKVMEMAYSIEKDGYASVDLTLLAAPVTESGVKQLAFARQPDPLLLAV--- 524 Query: 504 KDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALS-AG 562 L + + AW + + ++ +W +V + G Sbjct: 525 --TENGNLAVLTYDRP-QDVTAWARWITNGAFESVATLQGTP----EDEIWAVVRRTIGG 577 Query: 563 EERSFTVRLNLLDDFK 578 RL D K Sbjct: 578 VPVRTIERLTPETDSK 593 >gi|212703338|ref|ZP_03311466.1| hypothetical protein DESPIG_01381 [Desulfovibrio piger ATCC 29098] gi|212673248|gb|EEB33731.1| hypothetical protein DESPIG_01381 [Desulfovibrio piger ATCC 29098] Length = 703 Score = 338 bits (866), Expect = 2e-90, Method: Composition-based stats. Identities = 106/592 (17%), Positives = 185/592 (31%), Gaps = 97/592 (16%) Query: 5 TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDP 64 H+F+ GE+SP +L +R DLS + V N++P +G + P Sbjct: 2 RIALHNFTGGEVSP-ILAARYDLSRYGSSVQCMENMLPGLHGDVRRRPGTLFLGSL---E 57 Query: 65 RSNRVFSFSIPD--GGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLE 122 + FS +LV L I + + + TPY + + Sbjct: 58 GEAVLLPFSFNALAEQNFVLVLSGHSLCIADIHGFDRQT--GALPRLPTPYEARHLLEIC 115 Query: 123 YAVFGSTAVFVHKDHPPHHLLYIQDGDK-------------ISFTFDEIKFLPPPWLGDG 169 A G T H +P H L+ D +T + + Sbjct: 116 AAQVGDTVYLAHTAYPLHKLVRSTYSDPEAPLPDNAIRSHGYRWTLEAVALNSSLPAPQA 175 Query: 170 MISGV----KSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSI 225 + + ++ K + G G HP +W I Sbjct: 176 PDCTFVRGNNDDDAGLGYTLRYKIVAVDANGKQSLASEAGSC--DGKHPSDWVVGNRTDI 233 Query: 226 GAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAP 285 V Y +G+ ++ + Sbjct: 234 SWTAVEGATEYNIYREE--AGYYGFIGVSSGTTFS------------------------- 266 Query: 286 YYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLF 345 D + + + F G + PS V FH R++ Sbjct: 267 --------DNNYQADTADTPREDWDPFADGNN---------------PSVVAFHQQRMVL 303 Query: 346 SGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLV 405 +G++ + YLS G F +F + + S I W FG+ + + Sbjct: 304 AGTRDSPQAFYLSRSGDFENFRKSRPLQ---DDDPVEYLIASGSIDAIAWAASFGDLL-L 359 Query: 406 GCDTSLWLLSISLSKGLS--IDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGST 463 G S + S + S I S G P+ +G+ ++ V G ++ + S Sbjct: 360 GTSGSEYKASGNGSAITPGNITITAQSYWGSAGLAPIIIGNAILHVQRHGAHVRDLFYSL 419 Query: 464 E-QGFRFNEITQLADHLFNQ-RILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGE 521 E G+ N+++ LA HLF R+ Q YQ+ P S++W+V + LL + E Sbjct: 420 EKDGYAGNDLSILAPHLFEGHRLRQWAYQQTPGSVLWIVRD-----DGLLLALTYLKEH- 473 Query: 522 GDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVAL--SAGEERSFTVRL 571 + W H + + + + S P+ L ++V + G R RL Sbjct: 474 DIWGWSRHPTAGEVLSVCSISGPDS----DELLLVVRRRDADGGSRYCLERL 521 >gi|332160974|ref|YP_004297551.1| hypothetical protein YE105_C1352 [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|325665204|gb|ADZ41848.1| Hypothetical phage protein [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|330862130|emb|CBX72294.1| hypothetical protein YEW_AK02310 [Yersinia enterocolitica W22703] Length = 657 Score = 338 bits (865), Expect = 2e-90, Method: Composition-based stats. Identities = 91/575 (15%), Positives = 185/575 (32%), Gaps = 93/575 (16%) Query: 3 NTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRL 62 K +F+AGE+SPRL+ R D++ +A G N + + +G ++ P + + Sbjct: 2 RANLIKTNFTAGEISPRLM-GRVDIARYANGAKTVENAVCVIHGGVMRRPGSRFAAKAKF 60 Query: 63 DPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLE 122 + R+ + +L FG+ ++ + +PYT SL Sbjct: 61 GDQKARLIPYVFNRSQAYVLEFGNGYVRFYQNGAQI--GAGSTPYEIASPYTSAMLSSLN 118 Query: 123 YAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSI 182 Y T VH+D PP+ L D + + F+ P+ Sbjct: 119 YVQGADTMFLVHQDVPPYRLQRKGQTD---WVLEPAPFIVKPFDEIR------------- 162 Query: 183 SQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTG 242 DT +K F G +I L E Sbjct: 163 ---DTPEKWCKPSVKEF----VGSAITLTLSDAE-------------------------- 189 Query: 243 RSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSI 302 G + + + + G I+ V Sbjct: 190 ---------SGGALTGAGWVGADVGSYVRINSGLVHIQAVTSAAVATGVIRTVLSA---- 236 Query: 303 SVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGA 362 + + + + W + GYP T + RL+ +GS ++++S G Sbjct: 237 --------VQSSSPGAWTREDAVWSAEFGYPGAATLYQQRLVLAGSPKYPQTIWMSETGI 288 Query: 363 FYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSI--SLSK 420 + F L D A++ V+ + I + + + + S Sbjct: 289 YLSFELGT-----DDDDAISFTVSSDQINPIVHLAQMNTLIALTSTGEFTITGGGESAIT 343 Query: 421 GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQ--GFRFNEITQLADH 478 +I + S G + PV VG ++F+ R++ ++ + + N+++ L++H Sbjct: 344 PTNISVKNPSPYGCNSIKPVRVGTEIMFMQRANRKLFAVAYDPDSFVAYSANDLSVLSEH 403 Query: 479 LFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVL 538 + + + YQ+EP + +W+ + +L + AW + + + Sbjct: 404 ITLSGAVDMAYQQEPDAFIWMTR-----ADGQLAVATIDR-AQDVIAWSRQVTTGAY--E 455 Query: 539 SAASFPNDNRGGTSLWMLVAL-SAGEERSFTVRLN 572 S + P +++LV G+ + + Sbjct: 456 SVVTIPAST--NDVVYVLVKRVINGQIVRYVEVFD 488 >gi|187476936|ref|YP_784960.1| phage protein [Bordetella avium 197N] gi|115421522|emb|CAJ48031.1| phage protein [Bordetella avium 197N] Length = 681 Score = 335 bits (858), Expect = 1e-89, Method: Composition-based stats. Identities = 93/577 (16%), Positives = 178/577 (30%), Gaps = 81/577 (14%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M N + SF GE+SP + R D + G+A RN + GP+ + R+ Sbjct: 1 MSNVRVLQRSFGGGEISPE-MFGRIDDVKYQSGLAICRNFVVKPQGPVENRAGFSFVREV 59 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 + + R+ F+ ++ G + + PYT D S Sbjct: 60 KDSTKKVRLIPFTYSVTQTMVIELGAGYFRFHTDGGTLL--NGDTPYEIANPYTEADLFS 117 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAK- 179 + Y VH ++ P L I D + I F+ + G+ + + Sbjct: 118 IHYVQSADVLTLVHPNYAPRELRRIGATD---WQLATIAFMSSVAMPTGVTATSNNKGTD 174 Query: 180 LSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSL 239 + T+ C + +I + Y Sbjct: 175 YTYRYVVTALDAEGKTESAPSSAGI-------CANNLFTNGGANTIAWSAASGASRYNVY 227 Query: 240 TTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDG 299 + T + D+NI + + +A+G Sbjct: 228 KEQGGLYGYIGQTTGTSLVDDNIAPDLSVTPPIYDAVFNAAG------------------ 269 Query: 300 RSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSS 359 YP+ V++ R F+G+ +++++ Sbjct: 270 -------------------------------DYPAAVSYFEQRRCFAGTINKPQNIWMTR 298 Query: 360 FGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTS--LWLLSIS 417 G S + V A+ I + P E +L+ + ++ Sbjct: 299 SGTESAMSYSLPVR---SDDRVAFRVAAREANAIRHIVPLTELLLLTSSGEWRVASVNSD 355 Query: 418 LSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLA 476 +I R S G PV V + ++ G ++ ++ + + GF +++ Sbjct: 356 AVTPTTISVRPQSYVGATDVQPVVVNNTAIYGAARGGHVRELAYNWQANGFVTGDLSLRC 415 Query: 477 DHLFNQ-RILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKH 535 HLF+ IL + Y + P IVW + +S +LLG + E + AWH H Sbjct: 416 AHLFDNLNILDMAYAKAPQPIVWFI-----SSSGKLLGLTYVPEQQ-IGAWHQHDTEGVF 469 Query: 536 YVLSAASFPNDNRGGTSLWMLVAL-SAGEERSFTVRL 571 + + L+++V G+E + R+ Sbjct: 470 ESCAVVA----EGNEDRLYVVVRRIIGGKEVRYIERM 502 >gi|282848883|ref|ZP_06258273.1| hypothetical protein HMPREF1035_1392 [Veillonella parvula ATCC 17745] gi|282581388|gb|EFB86781.1| hypothetical protein HMPREF1035_1392 [Veillonella parvula ATCC 17745] Length = 772 Score = 333 bits (854), Expect = 4e-89, Method: Composition-based stats. Identities = 101/611 (16%), Positives = 211/611 (34%), Gaps = 68/611 (11%) Query: 4 TTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLD 63 ++ +F+ GE+SP + SR DL + + ++ N++ YG + Q + Sbjct: 5 IYISQLAFTTGEVSPD-VSSRFDLEQYKSALLEAENVVIRPYGAVAKRQGSQYVGQVKYS 63 Query: 64 PRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEY 123 + R+F F+ +L FGDK +++ G TP+T L Sbjct: 64 DKPTRLFEFTTNTNNSFMLEFGDKYIRVWNYGVY-------TGIEVTTPFTSDILFDLNC 116 Query: 124 AVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSIS 183 + G +P L D D + + K P+ V S ++ Sbjct: 117 SQSGDVMFICSGKYPIQTLSRYSDTD---WRLEAYKLTEQPYDTINTD--VNSTVTVTGD 171 Query: 184 QADTS-------------------TARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYS 224 +S A T + + RS G + N NY+ Sbjct: 172 TIRSSKDLFNADMVGMVMQLGYFVAAVHTKNTGTVVEKKEKRSFMGGFNKWNEYNNINYN 231 Query: 225 IGAYIVADDKVYRSLT----TGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRE--- 277 + +Y D ++ T TG + + G T+ + N++ E Sbjct: 232 VESYSTDQDLAWKFTTHGTWTGTVKLQITTNNGTTWKDYRTYSSNNDYNVTDAGKIEPNA 291 Query: 278 -------------SASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMS 324 + ++ PY WG ++ + + + W M Sbjct: 292 KLRIQSDIKSGECNVDLSILPYTTWGIVEFKEFVDSKTMKINILNGIVENE-ATSKWKMG 350 Query: 325 AWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTA 384 +WG GYP TF+ +R + + + + +++S G + +F ++ G ++T Sbjct: 351 SWGRSNGYPKLCTFYQDRFVVAATNKNPNYIWMSRTGDYPNFGVEKVEGTITDDSSITLP 410 Query: 385 VTDFSASTIHWMHPFGEGVLVGCDTSLWLLSIS-LSKGLSIDFRRVSGSGVYACPPVSVG 443 V + I + P + +++ + W++S + + + + G +C P +G Sbjct: 411 VINRKMYEIRHLVPAND-LIILTSGNEWIVSGDKTITPTNCNLKTQTQRGALSCEPQFIG 469 Query: 444 DCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFNQRIL-QLVYQEEPHSIVWVVL 501 + VFV G ++ + S E + ++T + Y ++P SI++ + Sbjct: 470 NRCVFVQERGGTVRDMGYSYESDNYTGQDLTLFVKTRVRGYLTITSAYAQDPDSIIYYIR 529 Query: 502 EPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALS- 560 + + E + + W + + K+ + S SL+ L+ + Sbjct: 530 N-----DGEINCLTYIPE-QKVYGWSHFVTNGKYLYCESVS----EGEQDSLYTLIERTL 579 Query: 561 AGEERSFTVRL 571 G++ R+ Sbjct: 580 QGKKVKCIERM 590 >gi|295096862|emb|CBK85952.1| hypothetical protein ENC_24250 [Enterobacter cloacae subsp. cloacae NCTC 9394] Length = 662 Score = 333 bits (852), Expect = 7e-89, Method: Composition-based stats. Identities = 96/576 (16%), Positives = 187/576 (32%), Gaps = 90/576 (15%) Query: 3 NTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRL 62 K +F+AGE+SPRL+ R D++ +A G N + + G +V P + + Sbjct: 2 RANLIKTNFTAGEVSPRLM-GRVDIARYANGAKIIENAVVVVQGGVVRRPGTRFAAATKH 60 Query: 63 DPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLE 122 + +R+ + +L FGD ++I + +PYT ++ Sbjct: 61 GDKKSRLIPYVFNRSQAYMLEFGDGYMRIFQ-NGKQLVNEDNTPYEIASPYTADMLPAVN 119 Query: 123 YAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSI 182 Y T VH+ PH L D + + F+ P+ Sbjct: 120 YVQGADTMFLVHQSVKPHRLQRRGQTD---WVLEPAPFIVEPF----------------D 160 Query: 183 SQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTG 242 DT +K F G I L + Sbjct: 161 EVRDTPQKWCKPSVKEF----VGSEITLTLSDAD-------------------------- 190 Query: 243 RSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSI 302 GD ++ +N + S VA + D+ Sbjct: 191 -PGDNETPPFTGAGWVAQDVGSYVRINEGLVLIKSITSAQVAVGTIRSDLSATQAAS--- 246 Query: 303 SVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGA 362 + S W + GYP VT + RL+ +GS +++ S G Sbjct: 247 -------------PGSWTREDSVWTNEFGYPGAVTLYQQRLVLAGSPKYPQTIWWSETGV 293 Query: 363 FYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLS--- 419 + F + + A++ ++ + I + ++ + ++ Sbjct: 294 YLSFEIGT-----EDDDAISFTLSSDQLNPIVHLAQMNT-LIALTYGGEFTITSGNDAAI 347 Query: 420 KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQ--GFRFNEITQLAD 477 +I + S G PV VG ++FV GR++ ++ + + N++T LA+ Sbjct: 348 TPTNISVKNPSPYGCNGIRPVRVGTEIMFVQRAGRKLYAVAYDPDSFVSYSANDMTVLAE 407 Query: 478 HLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYV 537 H+ +L + YQ++P + +W+V + + AW + + Sbjct: 408 HITAGGVLDMAYQQQPDAFIWMVRADG------VAVTMAIDRAQDVIAWSRQVTAGAF-- 459 Query: 538 LSAASFPNDNRGGTSLWMLVAL-SAGEERSFTVRLN 572 S A+ P+D ++ +V G+ + + Sbjct: 460 ESVATIPSDT--DDVVYAIVRREINGQTVRYVEVFD 493 >gi|294648405|ref|ZP_06725904.1| phage protein [Acinetobacter haemolyticus ATCC 19194] gi|292825710|gb|EFF84414.1| phage protein [Acinetobacter haemolyticus ATCC 19194] Length = 706 Score = 333 bits (852), Expect = 8e-89, Method: Composition-based stats. Identities = 116/578 (20%), Positives = 194/578 (33%), Gaps = 55/578 (9%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M K++F++GELSP + R DL + G + N +P+ G L + Sbjct: 1 MAKINLIKNNFTSGELSPHIWM-RTDLQQYRNGTKEMLNFLPIIEGGLKRRGGTE---AL 56 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 + + R+ F I LL+F ++ ++ + + K+ TPYT +D K Sbjct: 57 AITAGAIRILPFIISHSTAYLLIFKPNQIDVLDINGTVV-------KSLSTPYTAQDIKE 109 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 + Y H HP L D ++++D F PP V++ A Sbjct: 110 ISYTQNRYQFYIAHSKHPLAWLR--ASEDLTNWSYDPFDFYVPPLEE------VETPALP 161 Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLT 240 S + T + D + + G N Y A + T Sbjct: 162 LKSNEKNAGKVATLTASPYNIYDNSKRYQAGEICHHTINNVKYYFRALRITQGNTPSFGT 221 Query: 241 TGRSGDRFGYSKGATYVKDNNITW-ITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDG 299 +G Y + T + T + V+P V G+I Sbjct: 222 SGPEASPDYYWETTTVTEAQAFTAADVDKFVFINEGIVRIDTYVSPSTVTGEILVKLST- 280 Query: 300 RSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSS 359 +A + + + GYP VT + RL+ +G+K V+LS Sbjct: 281 -----------DIEAIANAWTLKQDIFEVSLGYPRAVTMYQQRLVIAGTKTYPNYVWLSR 329 Query: 360 FGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL- 418 G +F + T + + + + + G+ V S ++S Sbjct: 330 VGDVTNFLP-----TTSDGDSFTVSASSDQLTNVLHLAQSR-GICVMTGGSELVISSQNS 383 Query: 419 -SKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLA 476 + + S P+ VG L+FV RI+ + NE+T LA Sbjct: 384 MTPTNTSILEHTSFGSTENIKPIKVGSELIFVQRGAERIRTLLYDYSIDSLTSNELTVLA 443 Query: 477 DHLFN--QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDK 534 H+ ++VY EP SI+W VL +L + AW TH I Sbjct: 444 SHIAKKSGGFKEMVYCAEPDSIIWFVL-----GNGKLASLT-LNREQSVIAWSTHDIGGT 497 Query: 535 HYVLSAASFPNDNRGGTSLWMLVALSAGEERSFTVRLN 572 VLS S P+ L+ LV + + ++ Sbjct: 498 --VLSLTSLPSTTGA-DRLYFLVNRNGTVQ---IEQMK 529 >gi|303327644|ref|ZP_07358084.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3] gi|302862005|gb|EFL84939.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3] Length = 681 Score = 329 bits (842), Expect = 9e-88, Method: Composition-based stats. Identities = 102/569 (17%), Positives = 184/569 (32%), Gaps = 84/569 (14%) Query: 10 SFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRV 69 +F+ GE++P L +R DL +A + N +P +G P + + Sbjct: 7 NFTGGEVTPTL-SARYDLGRYANSLKIMENFLPNLHGDAYRRPGTYFLENL---GEGCVL 62 Query: 70 FSFSIPDG--GYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFG 127 FS L FG+K L+IV V ++PY D + YA G Sbjct: 63 LPFSFNAEAGQNFALAFGEKSLRIVNVNGYVVAE------AMESPYALADVPEISYAQVG 116 Query: 128 STAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADT 187 HKD+ H ++ +++ + + + Sbjct: 117 DVVYLAHKDYALHKVVRTGSAPAYAWSIGTVALNTSLAAPAAPTAAWQGGGGSYTL--RY 174 Query: 188 STARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDR 247 + + +D K P G + G +P +W + + + V Y Sbjct: 175 KVSAVDADGKESLPSAVGSTAS-GKYPTDWTEGNHCVLSWQAVEGAAEYNIYRESAGYYG 233 Query: 248 FGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQ 307 F T D N Sbjct: 234 FIGIAQGTSFDDQNYEADIA---------------------------------------- 253 Query: 308 SQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFS 367 W A G P VTFH R++ +G++ S Y+S G F +F Sbjct: 254 -------DTPKEDWDPFADGNN---PGTVTFHQQRMVLAGTRNSPQSFYMSRTGDFENFR 303 Query: 368 LDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGL--SID 425 + + + I W FG+ + +G ++ + + + Sbjct: 304 KSRPLQ---DDDPVEYQLASGTVDGIVWAASFGDLL-LGTASAEYKATGDNGAITAKNCT 359 Query: 426 FRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFNQ-R 483 S G P+ +G+ ++ G R++ + S E G+ N+++ LA HLF+ Sbjct: 360 ITAQSYWGSAKIAPIIIGNSVMHCQRHGSRVRDLYYSLEKDGYAGNDLSVLAPHLFDGHT 419 Query: 484 ILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASF 543 I Q +Q+ P S++W+V + LL + E + + W + + ++A S Sbjct: 420 IRQWAFQQTPGSVLWLVRD-----DGVLLALTYMKE-QDIWGWSRQITDGRVRSVAALSG 473 Query: 544 PNDNRGGTSLWMLVALS-AGEERSFTVRL 571 N L ++V S G + + RL Sbjct: 474 ENA----DELLLVVERSVDGARKYYLERL 498 >gi|85059168|ref|YP_454870.1| hypothetical protein SG1190 [Sodalis glossinidius str. 'morsitans'] gi|84779688|dbj|BAE74465.1| hypothetical phage protein [Sodalis glossinidius str. 'morsitans'] Length = 662 Score = 326 bits (835), Expect = 7e-87, Method: Composition-based stats. Identities = 93/576 (16%), Positives = 182/576 (31%), Gaps = 90/576 (15%) Query: 3 NTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRL 62 K +F+AGE+SPRL+ R D+ +A G +N + + G ++ P + + Sbjct: 2 RANLIKTNFTAGEVSPRLM-GRVDIMRYANGAKAIQNGVVVVQGGVMRRPGTRFAAAAKY 60 Query: 63 DPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLE 122 R R+ + +L FGD L++ + + +PY+ S+ Sbjct: 61 SDRPARLIPYVFNRSQAYVLEFGDGYLRVYQ-KGKPVVNANNTPYEIASPYSADRLPSVN 119 Query: 123 YAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSI 182 Y T VH P+ L + P P++ + + Sbjct: 120 YVQGADTMFLVHPAVKPYRLQRRGQT--------DWVLEPAPFIVEPFDEIRE------- 164 Query: 183 SQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTG 242 T K F G + L + Sbjct: 165 ----TPKKWCRPSAKEF----VGSEVTLTLSDAD-------------------------- 190 Query: 243 RSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSI 302 G+ ++ +N + S VA + D+ Sbjct: 191 -PGENRNPPFTGAGWVAQDVGAYVRINGGLVLIQRIDSAQVAVGTLRSDLNAKQAAS--- 246 Query: 303 SVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGA 362 + S W + GYP VT + RL+ +GS +++ S GA Sbjct: 247 -------------PGSWTREESVWTDNLGYPGAVTLYQQRLVLAGSPKYPQTIWWSETGA 293 Query: 363 FYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLS--- 419 + F L + A++ ++ + I + ++ + ++ Sbjct: 294 YLSFELGTK-----DDAAISFTLSSDQLNPIVHLAQMNT-LIALTYGGEFTITSGNDAAI 347 Query: 420 KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQ--GFRFNEITQLAD 477 +I + S G P+ VG ++F+ GR++ ++ + + N++T LA+ Sbjct: 348 TPTNISVKNPSPYGCNRIRPLRVGTEILFIQRAGRKLYAVAYDPDSFVSYAANDLTVLAE 407 Query: 478 HLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYV 537 H+ + + YQ++P ++W+V E + + AW M Sbjct: 408 HITAGGVRDMAYQQQPDGLIWLVRE-----DGVAVTVTMDR-AQDVVAWSRQMTEGAF-- 459 Query: 538 LSAASFPNDNRGGTSLWMLVAL-SAGEERSFTVRLN 572 S S P++ L+ LV G + + Sbjct: 460 ESVTSIPSER--DDVLYALVRRHINGHTVRYVEVFD 493 >gi|220903983|ref|YP_002479295.1| hypothetical protein Ddes_0709 [Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774] gi|219868282|gb|ACL48617.1| conserved hypothetical protein [Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774] Length = 689 Score = 324 bits (830), Expect = 2e-86, Method: Composition-based stats. Identities = 103/586 (17%), Positives = 191/586 (32%), Gaps = 92/586 (15%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M ++F+ GE++P L +R DL+ + ++ N++P +G P + + Sbjct: 1 MP-IRIACNNFTGGEIAPTL-SARYDLARYRNCLSCMENMLPGLHGDTARRPGTRFVANL 58 Query: 61 RLDPRSNRVFSFSIP--DGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDN 118 + + FS +LVFG L I + +TPY + Sbjct: 59 ---DGHSVLIPFSFNALTSQNFVLVFGSHCLHIAGEQGLE------NIPVIETPYAPGEL 109 Query: 119 KSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDK--------ISFTFDEIKFLPPPWLGDGM 170 + + YA G T H +HP H ++ + +++ +++ + Sbjct: 110 QDISYAQVGDTVYLAHSNHPLHKVVRRDAPENRTQFEEAAYAWSLEKVALNASLAAPELP 169 Query: 171 ISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIV 230 +A +Y++ + Sbjct: 170 SVTFSGSA------------------------------------------GSYTLRYKVA 187 Query: 231 ADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWG 290 A D + R A + V S+ S + GAV Sbjct: 188 AVD----------AAGRESLPSPAGQCANGRHPSDWVQGNSAAISWAAVEGAVEYNIYRE 237 Query: 291 DIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKG 350 + G S + Q + + YP V FH R++ + + Sbjct: 238 EAGYFGFIGVSGGLNFNDQNYQADTADTPKEDWDPFADGN-YPGIVAFHQQRMVLAATPK 296 Query: 351 DELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTS 410 + + Y+S G F +F + + S + W FG+ + +G S Sbjct: 297 NPQAFYMSRVGDFENFRKSRPLQ---DDDPVEYLIASGSIDAVTWAASFGDLL-IGTSGS 352 Query: 411 LWLLSISL---SKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QG 466 + S +I S G P+ +G+ ++ V G R++ + S E G Sbjct: 353 EYKASGGDGASITAGNISITAQSYWGSAGLAPIIIGNSILHVQRHGSRVRDLFYSLEKDG 412 Query: 467 FRFNEITQLADHLFNQ-RILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFA 525 + N+++ +A HLF ILQ YQ+ P S +W V + LL + E + Sbjct: 413 YAGNDLSIMAPHLFEGHTILQWAYQQTPGSTIWCVRD-----DGLLLAFTYMKEH-DIWG 466 Query: 526 WHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSAGEERSFTVRL 571 W + + +A S +G T + + G+ R F RL Sbjct: 467 WSRQITQGRVLSAAAISG---EKGDTLMLVTERRIDGQPRIFLERL 509 >gi|303257570|ref|ZP_07343582.1| conserved hypothetical protein [Burkholderiales bacterium 1_1_47] gi|302859540|gb|EFL82619.1| conserved hypothetical protein [Burkholderiales bacterium 1_1_47] Length = 687 Score = 321 bits (823), Expect = 2e-85, Method: Composition-based stats. Identities = 90/585 (15%), Positives = 170/585 (29%), Gaps = 79/585 (13%) Query: 4 TTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLD 63 T + SF+ GE+SP + R D + + G+ N + GP+ + P + R+ + Sbjct: 5 TKVLQRSFAGGEISPE-MFGRTDDTKYQTGLETCLNFLCRPQGPIENRPGFEFVREVKDS 63 Query: 64 PRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEY 123 + R+ F ++ G K + ++ TP+ D LEY Sbjct: 64 SKKVRLIPFIFNAQQTFVIELGHKYARFHSFGATL--MNGNQPYEITTPWDEDDLFELEY 121 Query: 124 AVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSIS 183 H+D+ P + + D + I F S + + ++ Sbjct: 122 VQSNDIITVTHEDYAPTEIRRYSNTD---WRLATISF----------SSTLATPTNVTAV 168 Query: 184 QADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGR 243 + T+ + K E + + + Y Sbjct: 169 RETTTGNEDKNADKYTFQYKVSCLNADKTIESEPSAAVSCTANLYATGTTIKISCSAVSG 228 Query: 244 SGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSIS 303 + Y R Sbjct: 229 ASYYRFYKNQG------------------GIYGYLGDSETTSIIDDNIAPKTDITPRRYD 270 Query: 304 VAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAF 363 S YPS V + R F+G K D V + G Sbjct: 271 SVVSSGN---------------------YPSAVGYFEQRRWFAGFKTDPQRVVATRSGTE 309 Query: 364 YDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLS--ISLSKG 421 D + + + + I + P +L+ + + + Sbjct: 310 SDMTYSLP---SKDDDRINFRIAATEFNKILHISPLSHLILLTTGSEIRISPQNSDAITP 366 Query: 422 LSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLF 480 SI R S +G P+ + L+F ++ ++ + GF ++ + HLF Sbjct: 367 SSISARPQSYNGATTVRPLVYNNNLIFASARDGHVRELAYQYQAGGFVSGDLCLRSQHLF 426 Query: 481 N-QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLS 539 + + I Q+ P+ I+W V +S LLG + E + +WH H Sbjct: 427 DFKTIKDATAQKAPYPIMWFV-----SSDGNLLGLTYIPEQQ-VGSWHRHNTDGVFESCC 480 Query: 540 AASFPNDNRGGTSLWMLVALS-AGEERSFTVRL------NLLDDF 577 A S +L+ ++ + G ++ + R+ NL D F Sbjct: 481 AVS----EGVEDALYCVIRRTINGSQKRYVERMRTRNFKNLADAF 521 >gi|317152064|ref|YP_004120112.1| hypothetical protein Daes_0341 [Desulfovibrio aespoeensis Aspo-2] gi|316942315|gb|ADU61366.1| hypothetical protein Daes_0341 [Desulfovibrio aespoeensis Aspo-2] Length = 698 Score = 316 bits (810), Expect = 5e-84, Method: Composition-based stats. Identities = 91/585 (15%), Positives = 168/585 (28%), Gaps = 80/585 (13%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M TT + +F+AGE+SPRL R DLS + G N +G + Sbjct: 1 MSITTPSLTNFTAGEISPRL-AGRIDLSRYFNGCRTLENFHVHPHGGATRRCGFRFVTQA 59 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGD-----KKLQIVVVRSSTKWSPALFGKTYKTPYTF 115 R+ + F +L FG+ ++++ PY Sbjct: 60 LNPDRAGLLVPFESNADTAYVLEFGEDAAGQGRMRVF--SGHGVVMAGDAPYALDVPYRA 117 Query: 116 KDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWL-GDGMISGV 174 +L YA G + H HP L + + ++++F+ P +G V Sbjct: 118 DQLDTLRYAQSGDELILAHPAHPVRRLTRLAHDQ---WQLEDMEFIGCPETWTEGNHPSV 174 Query: 175 KSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDK 234 + + + A T T L + + + Sbjct: 175 VAFFEQRLVLAATPDKPGT----------------LWFSRTGGIGDFRLRTREVPLDGWR 218 Query: 235 VYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKD 294 + G R G + + D + + Y G Sbjct: 219 DREITDSNSDGLRDGKAGDTFLLLDGD-----GFEKLDGLKGQHPDRTTRYYRYKGAANL 273 Query: 295 VSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELS 354 + A + + Sbjct: 274 TASGADKTVTFR--HEPEGAQIEPIRDAEGELN-------------------------NG 306 Query: 355 VYLS-SFGAFYDFSLDGEYGCY-DPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLW 412 + G + G A+ ++ A+ I ++ + VG W Sbjct: 307 FWECFEPGD----RTEAPAGEAPLDDDAIEVTLSGRQANAIEFLVA-RGKLWVGTAGGEW 361 Query: 413 LLS---ISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFR 468 L SI + G A P +VG +++ GR+I+ ++ E + Sbjct: 362 TLGGSLGDPVTPESIKASQEGSCGASATRPEAVGFATLYIQRAGRKIREMAYRYESDAYV 421 Query: 469 FNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHT 528 ++T L++H+ + Q+ Y +EP SI++ V L+ + + E AW Sbjct: 422 SRDLTILSEHITKPGLTQMAYVQEPDSILYCVR-----GDGALIALTYEPDQE-VAAWSR 475 Query: 529 HMISDKHYVLSAASFPNDNRGGTSLWMLVALS-AGEERSFTVRLN 572 + V A+ N LW ++ + G ER + L Sbjct: 476 MLTDGA--VECVAAVYNQAGKRDVLWAVIRRTVNGLERRYVEFLE 518 >gi|220918520|ref|YP_002493824.1| hypothetical protein A2cp1_3428 [Anaeromyxobacter dehalogenans 2CP-1] gi|219956374|gb|ACL66758.1| hypothetical protein A2cp1_3428 [Anaeromyxobacter dehalogenans 2CP-1] Length = 825 Score = 314 bits (804), Expect = 2e-83, Method: Composition-based stats. Identities = 103/626 (16%), Positives = 183/626 (29%), Gaps = 83/626 (13%) Query: 8 KHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRS- 66 + SF+AGEL PRL R DL+ + G+ ++RN G ++ P R+ + Sbjct: 8 QGSFAAGELGPRL-HGRHDLAKYQVGLRRARNFFLSPEGAALNRPGTPFVREAKDSAAGV 66 Query: 67 ---NRVFSFSIPDG--GYALLVFGDKKLQIVVVRSSTKW-SPALFGKTYKTPYTFKDNKS 120 R+ F + L FG ++ V ++ + TPY D Sbjct: 67 DRGARLIPFIFSEDLGQAYELEFGQGYVRFHVGGATIADPLNSAQPYELATPYLAADLPR 126 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 L+YA G K + P L + P + G+ + Sbjct: 127 LKYAQQGDVVTLTCKGYDPRELRRLAHDSWELVPLSFDVPAPNGVVYLGVEALENVADAT 186 Query: 181 SIS-------------------------QADTSTARITSDMKIFKPLDKGRSIRLGCHPP 215 + + + F G Sbjct: 187 HPARQWAWQVTEIWEDESGLQWETSPLRVRKIAVGAGATWHTGFTYPLGACVSYAGQFWQ 246 Query: 216 EWAKNTNYSIGAYIVADD----KVYRSLTTGRSGDRFGYSKGATYVK------------D 259 + + ++ D G D F + Sbjct: 247 SVIADNRGHVPEAVMVGDPPAATYPYWTPVGAVPDPFAVYESNAPTDVVLFPDRTIKLWA 306 Query: 260 NNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISV---APQSQTLFQAGV 316 + + G V Y ++ + G + + PQ + F Sbjct: 307 SGAWTGVDGSRLVGRRVYRGRGTVFGYVGEFEVAEFRDTGDTPDLSYSPPQGRNPFTVFG 366 Query: 317 SVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYD 376 PS VTFH R G+ +LS G +Y+F Sbjct: 367 PAGEVVRLEQ------PSVVTFHAERRSLLGTAQRPAHAFLSRTGDYYNFDRHTPALV-- 418 Query: 377 PTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLL---SISLSKGLSIDFRRVSGSG 433 A + + W +L+G + +W + S + S +G Sbjct: 419 -DDAFELELAGRLREEVRWAV-GAAALLIGTQSGVWAIRPPSGEVLGPGKATAVPQSSAG 476 Query: 434 VYACPPVSV----GDCLVFVCGVGRRIKYISGS-TEQGFRFNEITQLADHLFNQ-RILQL 487 P+ V GD +++V G ++ + QGF ++++ LA HLF I Sbjct: 477 SSYLDPLVVPSAVGDAVLYVRTKGSGVRDLVYDDGRQGFVGSDLSLLAKHLFTGYSIKAW 536 Query: 488 VYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDN 547 +QE+P S+ W+V S +LL + + E +AW H + A Sbjct: 537 TFQEDPWSVAWLVR-----SDGKLLSLTYVRDQE-VWAWAWHDTQGIVEDVCAI----PE 586 Query: 548 RGGTSLWMLVAL--SAGEERSFTVRL 571 +++++V G + R+ Sbjct: 587 GTEDAVYLIVKRQIGDGTWHRYVERM 612 >gi|146276492|ref|YP_001166651.1| hypothetical protein Rsph17025_0440 [Rhodobacter sphaeroides ATCC 17025] gi|145554733|gb|ABP69346.1| hypothetical protein Rsph17025_0440 [Rhodobacter sphaeroides ATCC 17025] Length = 754 Score = 299 bits (766), Expect = 7e-79, Method: Composition-based stats. Identities = 101/597 (16%), Positives = 177/597 (29%), Gaps = 64/597 (10%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M T+ + +FS+GEL P LL R D G+AK + +PL G + P Sbjct: 1 MTRTSPPQVAFSSGELDP-LLHRRFDYQRFQTGLAKCQGFLPLAQGGVTRAPGTIYRGRT 59 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 R D + FS +L F ++++ TP+ S Sbjct: 60 RGD-ARCVLVPFSFAANDSCILEFTPGRMRVWRY--GALVMSGGAPYELVTPFDETSLSS 116 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 L + V P L + ++T P+ + Sbjct: 117 LSWVQSADVVYMVDGRQPMQRLARLALD---NWTIGAQALRKGPFRVQNTDEAI-----T 168 Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPE----WAKNTNYSIGAY-------- 228 + A T +T+ F G ++L W + Y + Sbjct: 169 LTASAAKGTITLTASAAFFTADHVGSLMQLRPKDNTSVPAWTADEEYGSETWGGPLVGFE 228 Query: 229 --------IVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESAS 280 Y + ++G Y+ D++ T ++ R Sbjct: 229 TEPPADVLRRYGANTYLLVQGTKAGSTPPIHTEGDYMVDSDPTVWRFISDDVGIVR---- 284 Query: 281 GAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHN 340 + + P GV W AW ++ GYPS V + Sbjct: 285 -------ITQILSPTQARAAVTRTIPTGCI----GVPTYRWSEGAWSKRYGYPSTVEIYE 333 Query: 341 NRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFG 400 RL + + + +V+ S+ G F DF G D T S + I + Sbjct: 334 QRLAAAATPSEPRTVWFSAVGDFQDF----LDGTEDDQSFAYTVAGSTSVNRIINLQRGA 389 Query: 401 EGVLVGCDTSLWLLSISL----SKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRI 456 G+ + + + F SG G P++ +F+ +R+ Sbjct: 390 AGLHIFALGEEYSTRSETRSSVIGPKNAVFGLDSGVGSSTAKPITPSGNPIFISRDRKRV 449 Query: 457 KYISGSTE-QGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCR 515 + S + +++ A H+ Q+V+Q P W+ L L+ Sbjct: 450 LEMVYSLDQDRPVSRVLSRTAQHVGGAGFEQIVWQAAPEPTAWLRL-----GTGELVAMV 504 Query: 516 FSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVAL-SAGEERSFTVRL 571 + + E W ++ V + A +P G L M V G+ L Sbjct: 505 YDPDEE-VLGWAPVPVAGGF-VDALAVYPAAGGGSDILTMAVLREIDGQTVRMIEEL 559 >gi|323699364|ref|ZP_08111276.1| hypothetical protein DND132_1955 [Desulfovibrio sp. ND132] gi|323459296|gb|EGB15161.1| hypothetical protein DND132_1955 [Desulfovibrio desulfuricans ND132] Length = 698 Score = 297 bits (759), Expect = 4e-78, Method: Composition-based stats. Identities = 99/583 (16%), Positives = 174/583 (29%), Gaps = 76/583 (13%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M T +F+AGE+SPRL + R DLS + G N +G + + Sbjct: 1 MSIATPAITNFTAGEISPRL-EGRTDLSKYFNGCRTLLNFHVHPHGGTSRRAGFRFVAES 59 Query: 61 RLDPRSNRVFSFSIPDGGYALLVF-----GDKKLQIVVVRSSTKWSPALFGKTYKTPYTF 115 + + F G +L F G ++++ PYT Sbjct: 60 LGQAKPVLLIPFEYSAGQTYVLEFAEDAAGQGRMRVF--SGHGLVLSDGAPYVRDIPYTA 117 Query: 116 KDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWL-GDGMISGV 174 + L+YA + + VH DHP ++ + D +T +E+ FL P G+ Sbjct: 118 DEFDELDYAQSAGSLILVHPDHPVREMVRVDHDD---WTLEEMTFLGQPEAWGENDYPSA 174 Query: 175 KSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDK 234 + + A T + T L + + + Sbjct: 175 VCFYEQRLVLAATRSRPAT----------------LWLSRTGEFSDFRLRTREVPLDGWR 218 Query: 235 VYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKD 294 G R G + + N + G+ Y G Sbjct: 219 DLEIADANGDGLRDGKAGDNVLLLAGN-----GFEARDALKGQHPDGSTRYYRYKGTGNY 273 Query: 295 VSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELS 354 + + Q W + + + Sbjct: 274 ATVNSNVTLTFAAEPGANQLEA---IWDEDGVLDDAAW--------DCFGVGDRTDGP-- 320 Query: 355 VYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLL 414 A+ ++ A+ I ++ P + +G W L Sbjct: 321 ----------------AGAEPLEDDAIEVTLSGRQANAIEFIVP-RRALWIGTAGGEWTL 363 Query: 415 ---SISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFN 470 S ++ + G P +VG ++V GR+I+ +S E + Sbjct: 364 SASSSDPLTPSNVKAAQEGTGGASGVRPEAVGFAALYVQRAGRKIREMSYRYESDAYVSK 423 Query: 471 EITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHM 530 ++T L++H+ + QL Y +EP SI++ V L+ + + E AW + Sbjct: 424 DLTLLSEHITEGGLTQLAYVQEPDSILYGVR-----GDGILVALTYVPDQE-VAAWSRIV 477 Query: 531 ISDKHYVLSAASFPNDNRGGTSLWMLVALS-AGEERSFTVRLN 572 V AAS ND LW+ V + GE R + L Sbjct: 478 TDG--VVERAASVYNDAEKRDELWITVLRTVNGETRRYVEYLE 518 >gi|315121933|ref|YP_004062422.1| hypothetical protein CKC_00915 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495335|gb|ADR51934.1| hypothetical protein CKC_00915 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 588 Score = 295 bits (754), Expect = 1e-77, Method: Composition-based stats. Identities = 220/583 (37%), Positives = 333/583 (57%), Gaps = 25/583 (4%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M +TK SF+ GE+SP+++QSR DL LH+QG+++ N+IPL G LV P + Y Sbjct: 1 MPKGAYTKRSFAGGEVSPQIIQSRSDLELHSQGLSQCFNMIPLSDGSLVRRPPLHRYEHI 60 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 L P+++R+ SF++ L +FG+KK+ + V + K + Y TPY+F++ + Sbjct: 61 DLPPKASRILSFALGGDEAVLFIFGEKKM-VYVEVTGIKPPQF--IRFYGTPYSFREAEQ 117 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAK- 179 L+ A G+ V VH H P+ + + + G F+++ F PPPWLG + G K +AK Sbjct: 118 LDVARMGTLIVLVHPKHSPYKIEFTEAGVI----FEKMVFAPPPWLGRREVGGKKHDAKL 173 Query: 180 -LSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRS 238 +++S +TS + IFKP D GR + LG P +W NT Y A++ KVYR Sbjct: 174 RVTLSATRKGKITVTSTLPIFKPKDVGRMLCLGWLPKDWTANTLYPENAFMQMYGKVYRC 233 Query: 239 LTTGRSGDRFGYSKGATYVKDNNITWIT-------VLNLSSKTSRESASGAVAPYYVWGD 291 +T G SG F ++ TY++D +TW ++ K++ + PYYVWG+ Sbjct: 234 ITEGISGKEFEDNRRDTYIRDGGVTWKVIASSQALSVDKDGKSTLGTGGQYRTPYYVWGE 293 Query: 292 IKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGD 351 I + + + + + S + W MSAWGE+EGYPSHV+F+NNRL FSGSK D Sbjct: 294 IVNCTGAKTVEVMLHEGFCV-TDSNSTLYWNMSAWGEREGYPSHVSFYNNRLCFSGSKFD 352 Query: 352 ELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSL 411 +VY S + F DFS D G D K+L+ A+TD + S I W P +G+++G DTSL Sbjct: 353 PQAVYFSGYNTFTDFSPDTIEGNVDYRKSLSVAITDDTMSAIRWFRPMEKGLVIGTDTSL 412 Query: 412 WLLSISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNE 471 W++ + +G ++ RR++G GVY PP+S+GD L+FV G GRRI+ I G++EQGF+F E Sbjct: 413 WIVILDFERGFNLVSRRLAGIGVYEAPPLSIGDELIFVQGAGRRIQIIGGASEQGFQFLE 472 Query: 472 ITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMI 531 +TQ DHL + RI QL YQE+P+S++WV+ N+ LL C A + +WHTH Sbjct: 473 LTQNVDHLLDYRIRQLAYQEDPYSLLWVL-----NNKGELLSCSLHANSKEKGSWHTHKS 527 Query: 532 SDKH-YVLSAASFPNDNRGGTSLWMLVALS--AGEERSFTVRL 571 ++S +S ++G T++W LV+ + G RL Sbjct: 528 GGGWVKIMSLSSCLCLDQGETTIWFLVSRTNEDGVSSIGLERL 570 >gi|119386474|ref|YP_917529.1| hypothetical protein Pden_3767 [Paracoccus denitrificans PD1222] gi|119377069|gb|ABL71833.1| conserved hypothetical protein [Paracoccus denitrificans PD1222] Length = 679 Score = 294 bits (752), Expect = 3e-77, Method: Composition-based stats. Identities = 90/579 (15%), Positives = 175/579 (30%), Gaps = 80/579 (13%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + +F++G L P L R DL+ + + K RN+ +G + + P ++ + Sbjct: 1 MPAAR-IQPTFASGVLGPALW-GRIDLARYDSALRKGRNVFVHAHGGVSNRPGLRFVCEV 58 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 +R+ F ++L+ G ++ V + + T TP+T ++ Sbjct: 59 MDSAHRHRLLPFVREADDASILIMGQNEMGFVKNGARLQ--SGGVDYTIATPWTATQAQA 116 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 L+ H+ P ++ + D T Sbjct: 117 LDAVQSVDVIFAAHRQVAPRRIMRNGETDWSIATV---------------------PINP 155 Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLT 240 +++ S+ + Y V + Sbjct: 156 TVAAPTISSVTPRNSGDE-----------------------TYRYRVTAVVGGVESFASA 192 Query: 241 TGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGR 300 + S + R G + D + Sbjct: 193 PLATTAAELLSIEGAWNDIAFSAVTGATEYRVYRMRNGVPGYIGFTTGTSFRDD-NISPD 251 Query: 301 SISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSF 360 S P +LF A YPS V+ + RL F S +V+LS Sbjct: 252 STVTPPVQASLFDAAGK--------------YPSVVSIYQQRLAFGASDAQPETVWLSRV 297 Query: 361 GAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLS- 419 G + +F+ D + + + I M E ++ + Sbjct: 298 GDYLNFTRSQNMTSSDRAE---FDMAGEQLNRIRAMLQLRELLVFTSAGEFSVSGPDGGF 354 Query: 420 KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADH 478 L+ + G P+ D ++FV GR ++ + + E G+ N++ A H Sbjct: 355 DALNPIVTQHGYIGSATVKPLVADDTVLFVDRSGRGVRDLRYAYESDGYSGNDLAIFASH 414 Query: 479 LFNQR-ILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYV 537 R I+ + P SI+WVVL+ +LL + E + +AW I Sbjct: 415 FLQGRRIVGWAMAKNPWSIIWVVLD-----NGKLLALTYKREHQ-VWAWTEMDIDGAVES 468 Query: 538 LSAASFPNDNRGGTSLWMLVAL-SAGEERSFTVRLNLLD 575 ++ + +++V G++R + R + D Sbjct: 469 VACI----PEGASDATYLIVRRLIDGQQRRYVERFDDRD 503 >gi|315122895|ref|YP_004063384.1| hypothetical protein CKC_05755 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496297|gb|ADR52896.1| hypothetical protein CKC_05755 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 588 Score = 293 bits (749), Expect = 6e-77, Method: Composition-based stats. Identities = 218/583 (37%), Positives = 333/583 (57%), Gaps = 25/583 (4%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M +TK SF+ GE+SP+++QSR DL LH+QG+++ N+IPL+ G LV P + Y Sbjct: 1 MPKGAYTKRSFAGGEVSPQIMQSRSDLELHSQGLSQCFNMIPLQDGSLVRRPPLYRYEHI 60 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 L P+++R+ SF++ L +FG+KK+ + V + K + TPY+F++ + Sbjct: 61 DLPPKASRILSFALGGDDAVLFIFGEKKM-VYVEVTGIKPPQFIRFYD--TPYSFREAEQ 117 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAK- 179 L+ A G+ V VH H P+ + + + G F+++ F PPPWLG + G K +AK Sbjct: 118 LDVARMGTLIVLVHPKHSPYKIEFTEAGVI----FEKMVFAPPPWLGLREVGGKKHDAKL 173 Query: 180 -LSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRS 238 +++S +TS + IFK D GR +RLG P +W NT Y A++ KVYR Sbjct: 174 RVTLSATRKGKITVTSTLPIFKTKDVGRMLRLGWLPKDWTANTLYPENAFMQMYGKVYRC 233 Query: 239 LTTGRSGDRFGYSKGATYVKDNNITWIT-------VLNLSSKTSRESASGAVAPYYVWGD 291 +T G SG F ++ TY++D +TW ++ K++ + PYYVWG+ Sbjct: 234 ITEGISGKEFEDNRRDTYIRDGGVTWKVIASSQALSVDKDGKSTLGTGGQYRTPYYVWGE 293 Query: 292 IKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGD 351 I + + + + + S + W MSAWGE+EGYPSHV+F+NNRL FSGSK D Sbjct: 294 IVNCTGAKTVEVMLHEGFCV-TDSNSTLYWNMSAWGEREGYPSHVSFYNNRLCFSGSKFD 352 Query: 352 ELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSL 411 +VY S + F DFS D G D K+L+ A+TD + S I W P +G+++G DTSL Sbjct: 353 PQAVYFSGYNTFTDFSPDTIEGNVDYRKSLSVAITDDTMSAIRWFRPMEKGLVIGTDTSL 412 Query: 412 WLLSISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNE 471 W++ + +G ++ RR++G GVY PP+S+GD L+FV G GRRI+ I G++EQGF+F E Sbjct: 413 WIVILDFERGFNLVSRRLAGIGVYEAPPLSIGDELIFVQGAGRRIQIIGGASEQGFQFLE 472 Query: 472 ITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMI 531 +TQ DHL + RI QL YQE+P+S++WV+ N+ LLGC A + +WH H + Sbjct: 473 LTQNVDHLLDYRIRQLAYQEDPYSLLWVL-----NNKGELLGCSLHANSKEKGSWHVHKL 527 Query: 532 SD-KHYVLSAASFPNDNRGGTSLWMLVAL--SAGEERSFTVRL 571 ++S +S ++G T++W+L+ G RL Sbjct: 528 GGRGVKIMSLSSCLCLDQGETTVWLLLRRMNEDGVSSIGLERL 570 >gi|169795391|ref|YP_001713184.1| phage-like protein [Acinetobacter baumannii AYE] gi|169148318|emb|CAM86183.1| hypothetical protein; putative phage related protein [Acinetobacter baumannii AYE] Length = 697 Score = 292 bits (746), Expect = 1e-76, Method: Composition-based stats. Identities = 109/578 (18%), Positives = 186/578 (32%), Gaps = 68/578 (11%) Query: 3 NTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRL 62 K++ S+GELSP L R D+ +A G K N +PL G P + Sbjct: 7 RQWILKNNLSSGELSPLLWT-RTDIQQYANGAKKLLNALPLVEGGAKKRPGTKFRSIFA- 64 Query: 63 DPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKD-NKSL 121 + R+ F LL+ G L++ R+ TPY + + Sbjct: 65 --GALRLIPFIANSENTYLLILGVSFLKVYNPRTYAVV------YETVTPYNTAQKVREV 116 Query: 122 EYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLS 181 +YA FV D P L + D ++ F F P G + Sbjct: 117 QYAHTKYRMYFVQGDTPVQRL--LCSADFTNWQFAAFTFGVNPNDELG-----STPNVAL 169 Query: 182 ISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTT 241 I+ F P W+ Y G ++ + K +R+ Sbjct: 170 SPSGTEVGKVISLTASSF---------------PNWSNTETYLTGDRVIHNSKTWRATAD 214 Query: 242 GRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRS 301 + + W V N ++ ++ G++ D Sbjct: 215 NKGVEP----------SATTPEWEEVTNEAANVFTPASVGSIVEINGGQVKITEYVDPSR 264 Query: 302 ISVAPQSQTLFQAGVSVVSW--FMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSS 359 ++ + SW A+ + GYP V F RL+F+ +K ++ S Sbjct: 265 VNGEVLVKLTSDVQAIAKSWVLKSIAFSAEAGYPKAVCFFKQRLVFANTKTSPNQMWFSR 324 Query: 360 FGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL- 418 G +F A + A + + I + GV+ + +L++ Sbjct: 325 IGDDGNF-----LETTQDADAFSIASSSAQSDNILHL-SQRGGVVALTGGAEFLINSQGP 378 Query: 419 -SKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLA 476 + + S P VG+ L+FV G R++ +S E G E++Q+A Sbjct: 379 LTPASAQIDEHTSYGVQANVKPCRVGNELLFVQRGGERLRAMSYRYEVDGLVSPELSQIA 438 Query: 477 DHLFNQ--RILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDK 534 H+ I +L +Q+ P+SIVW+V+ S L + AW H + Sbjct: 439 PHIPENHAGIKELTFQQTPNSIVWIVMGDGAVSSITL------NRDQEMNAWSQHDFGGQ 492 Query: 535 HYVLSAASFPNDNRGGTSLWMLVALSAGEERSFTVRLN 572 + A G +ML G + Sbjct: 493 VLSICA---LPTGLGEDQCFMLTNR-NGSTV--LEEFS 524 >gi|332875218|ref|ZP_08443051.1| carbohydrate binding domain protein [Acinetobacter baumannii 6014059] gi|332736662|gb|EGJ67656.1| carbohydrate binding domain protein [Acinetobacter baumannii 6014059] Length = 692 Score = 290 bits (741), Expect = 5e-76, Method: Composition-based stats. Identities = 110/578 (19%), Positives = 184/578 (31%), Gaps = 68/578 (11%) Query: 3 NTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRL 62 K++ S+GELSP L R D+ +A G K N +PL G P + Sbjct: 2 RQWILKNNLSSGELSPLLWT-RTDIQQYANGAKKLLNALPLVEGGAKKRPGTKFRSIFA- 59 Query: 63 DPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKD-NKSL 121 + R+ F LL+ G L++ R+ TPY + + Sbjct: 60 --GALRLIPFIANSENTYLLILGVSFLKVYNPRTYAVV------YEAVTPYNTAQKVREV 111 Query: 122 EYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLS 181 +YA FV D P L + D ++ F F P G + Sbjct: 112 QYAHTKYRMYFVQGDTPVQRL--LCSADFTNWQFAAFTFGVNPNDELG-----STPNVAL 164 Query: 182 ISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTT 241 I+ F P W+ Y G ++ K +R+ Sbjct: 165 SPSGTEVGKVISLTASSF---------------PNWSNTETYLTGDRVIHTSKTWRATID 209 Query: 242 GRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRS 301 + + W V N ++ S+ G++ D Sbjct: 210 NKGVEP----------SATTSEWEEVTNEAANVFTPSSVGSIVEINGGQVKITQYVDPSR 259 Query: 302 ISVAPQSQTLFQAGVSVVSW--FMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSS 359 ++ + SW A+ GYP V F RL+F+ +K ++ S Sbjct: 260 VNGEVLVKLTSTVQAIAKSWVLKSIAFSATAGYPKAVCFFKQRLVFANTKTSPNQMWFSR 319 Query: 360 FGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL- 418 G +F A + A + + I + GV+ + +L++ Sbjct: 320 IGDDGNF-----LETTQDADAFSIASSSAQSDNILHL-SQRGGVVALTGGAEFLINSQGP 373 Query: 419 -SKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLA 476 + + S P VG+ L+FV G R++ +S E G E++Q+A Sbjct: 374 LTPASAQIDEHTSYGVQANVKPCRVGNELLFVQRGGERLRAMSYRYEVDGLVSPELSQIA 433 Query: 477 DHLFNQ--RILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDK 534 H+ I +L +Q+ P+SIVW+V+ S L + AW H + Sbjct: 434 PHIPENHAGIKELTFQQTPNSIVWIVMGDGAVSSITL------NRDQEMNAWSQHDFGGQ 487 Query: 535 HYVLSAASFPNDNRGGTSLWMLVALSAGEERSFTVRLN 572 + A G +ML G + Sbjct: 488 VLSICA---LPTGLGEDQCFMLTNR-NGSTV--LEEFS 519 >gi|293609614|ref|ZP_06691916.1| predicted protein [Acinetobacter sp. SH024] gi|292828066|gb|EFF86429.1| predicted protein [Acinetobacter sp. SH024] Length = 692 Score = 288 bits (736), Expect = 2e-75, Method: Composition-based stats. Identities = 110/578 (19%), Positives = 183/578 (31%), Gaps = 68/578 (11%) Query: 3 NTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRL 62 K++ S+GELSP L R D+ +A G K N +PL G P + Sbjct: 2 RQWILKNNLSSGELSPLLWT-RTDIQQYANGAKKLLNALPLVEGGAKKRPGTKFRSIFA- 59 Query: 63 DPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKD-NKSL 121 + R+ F LL+ G L++ R+ TPY + + Sbjct: 60 --GALRLIPFIANSENTYLLILGVSFLKVYNPRTYAVV------YETVTPYNTAQKVREV 111 Query: 122 EYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLS 181 +YA FV D P L + D ++ F F P G + Sbjct: 112 QYAHTKYRMYFVQGDTPVQRL--LCSADFTNWQFAAFTFGVNPNDELG-----STPNVAL 164 Query: 182 ISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTT 241 I+ F P W+ Y G ++ K +R+ Sbjct: 165 SPSGTEVGKVISLTASSF---------------PNWSNTETYLTGDRVIHSGKTWRATID 209 Query: 242 GRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRS 301 + + W V N ++ S G++ D Sbjct: 210 NKGVEP----------TATTSEWEEVTNEAANVFTPSNVGSIIEINGGQVKITQYVDPSR 259 Query: 302 ISVAPQSQTLFQAGVSVVSW--FMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSS 359 ++ + SW A+ GYP V F RL+F+ +K ++ S Sbjct: 260 VNGEVLVKLTSAVQAIAKSWVLKSIAFSATAGYPKAVCFFKQRLVFANTKTSPNQMWFSR 319 Query: 360 FGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL- 418 G +F A + A + + I + GV+ + +L++ Sbjct: 320 IGDDGNF-----LETTQDADAFSIASSSAQSDNILHL-SQRGGVVALTGGAEFLINSQGP 373 Query: 419 -SKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLA 476 + + S P VG+ L+FV G R++ +S E G E++Q+A Sbjct: 374 LTPASAQIDEHTSYGVQANVKPCRVGNELLFVQRGGERLRAMSYRYEVDGLISPELSQIA 433 Query: 477 DHLFNQ--RILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDK 534 H+ I +L +Q+ P+SIVW+V+ S L + AW H + Sbjct: 434 PHIPENHAGIKELTFQQTPNSIVWIVMGDGAVSSITL------NRDQEMNAWSQHDFGGQ 487 Query: 535 HYVLSAASFPNDNRGGTSLWMLVALSAGEERSFTVRLN 572 + A G +ML G + Sbjct: 488 VLSICA---LPTGLGEDQCFMLTIR-NGSTV--LEEFS 519 >gi|118590938|ref|ZP_01548338.1| hypothetical protein SIAM614_19796 [Stappia aggregata IAM 12614] gi|118436460|gb|EAV43101.1| hypothetical protein SIAM614_19796 [Stappia aggregata IAM 12614] Length = 810 Score = 283 bits (724), Expect = 4e-74, Method: Composition-based stats. Identities = 93/648 (14%), Positives = 197/648 (30%), Gaps = 107/648 (16%) Query: 7 TKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRS 66 + +FS GEL P L+ R DL L +A+ RN + L+ G L + + + R Sbjct: 5 LQATFSRGELDPELIY-RSDLELFRSSLAECRNFLTLKRGGLRRRGGTKFIAELKDSSRQ 63 Query: 67 NRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVF 126 + F +G Y +L FG ++ TPY+ L++ Sbjct: 64 GWLIPFEFGNGQYYMLEFGHHIFRVFTSEGRVGTV------EVATPYSSGVLPRLKFVQS 117 Query: 127 GSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL------ 180 T P L + + S+ + + F P+L + A Sbjct: 118 TDTLFIAGGGVAPQALKRLSEL---SWAIEPMSFRDGPYLDVNISPTNLKPAATGNAVPK 174 Query: 181 ----SISQADTSTARITSDMKIFKPLDKGRSIRLGCH----------------------- 213 + S + ++ +G+++ Sbjct: 175 MTSNTAPSGTVSASNGSASAWQLFNRSEGKTVLSSGATGWVQYQFPGSVVIDAYMLQAPN 234 Query: 214 --------PPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWI 265 P +W + + + + D + + + + + + Sbjct: 235 DNSQNDDMPWQWNIEASNNGSDWTILDTQDGQDTWSSNEWREYDFHNETAFTHYRLSFTQ 294 Query: 266 TVLNLSSKT------------------------------------SRESASGAVAPYYVW 289 + S + W Sbjct: 295 GGGSASDNSAIGQLVFHRAGNDQSPFTLTASGTGGINGGAGFQPSDVGRHIRFRGSDGFW 354 Query: 290 GDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSK 349 + S+ + Q + W + AW G+P + +H NRL F+G+ Sbjct: 355 RWFRIHSRQSATSVKVQLFGQALQDTKAQSIWRLGAWSGTTGWPETIGWHKNRLAFAGTS 414 Query: 350 GDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDT 409 + ++ S F +FS+ + A+T + + I W+ + ++VG Sbjct: 415 EEPQKIWESQTEDFTNFSVSHVLK---ASDAVTAGILSGQVNRIQWLVDDND-LIVGTTR 470 Query: 410 SLWLLS----ISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGST-E 464 ++ + ++D + + G P+ VG L++ G ++ ++ Sbjct: 471 AVRAVGKATDQDPYGPENVDQKPETNFGANDVSPIKVGSVLIYYGPYGTDMREMAYDFGS 530 Query: 465 QGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDF 524 G ++++ HLF I YQ+ P S++W + + +G + + + + Sbjct: 531 DGRVSQAVSEVQSHLFQSGIAGACYQQYPDSVIW-----QWDQKGSGIGFTYERQQQ-VY 584 Query: 525 AWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALS-AGEERSFTVRL 571 H V A G ++WM+V + G+ R + + Sbjct: 585 GMQRHDFGG--VVECMADLSGA--GADTVWMIVKRTIDGQTRRYIEIM 628 >gi|195541813|gb|ACF98016.1| hypothetical protein [uncultured bacterium 878] Length = 926 Score = 283 bits (723), Expect = 6e-74, Method: Composition-based stats. Identities = 99/655 (15%), Positives = 183/655 (27%), Gaps = 92/655 (14%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 MV + ++F AGE +P + + R DLS + N +P GP P Sbjct: 1 MVRASPNFNAFDAGEFAP-ITEGRTDLSRYGFACRILENFMPRVVGPAARRPGTSFIAST 59 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 R + + F ++ FG+ ++ + + Sbjct: 60 RYPEKDALLVRFEYSTEQAYVMEFGNLYVRFYRNDGPLLEVTRPITGATQANPVVLTVAN 119 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAK- 179 + V + + ++ + TF+ P G+G + Sbjct: 120 HGWLNGDDIEVSGVTGMTQLNGRRFRVANRTASTFELNDQHGAPINGNGYSAFAAGGTAA 179 Query: 180 ------LSISQADTSTARITSDMKIFKPLD------------------------------ 203 + AD + + I Sbjct: 180 RVYTLPTTYQDADLAQMKFAQSADILYIAHTEYVPRKLQRYGPTNWVLSQIDFQDGPYLP 239 Query: 204 ------KGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDR---------F 248 + T+ +I R + Sbjct: 240 VNGAQTVLTPSAASGAGITISSATSVAITGAANNGAGAVRITSANHGWKTGDKIDITGIV 299 Query: 249 GYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAP-----------------YYVWGD 291 G ++ + T S + ASG A WG Sbjct: 300 GTTEANATWTVTRVNANTYDLNGSTFANAYASGGTAKPHIFESTDLGRLIRIQHASTWGY 359 Query: 292 IKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGD 351 K + A + F + +W + + + GYPS VTF+ RL + G Sbjct: 360 AKITAYTSAVSVTA-DVLSNFGGTAASSAWRLGLYSQGGGYPSCVTFYEGRLFWGGCPLA 418 Query: 352 ELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSL 411 V S + FS A+ + + + WM +G+LVG Sbjct: 419 PTRVDGSMSSNYETFSPSSTASVVADDNAVAYPLDSGDVNNVLWMKDDEKGLLVGTKGGE 478 Query: 412 WL-----LSISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-Q 465 W+ L+ +L+ R + PV G ++FV R+++ ++ + E Sbjct: 479 WVVRANTLNGALTPTNVKATRATTYGSYEGSQPVRTGKDIIFVQRKRRKVRNLNYTYEID 538 Query: 466 GFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFA 525 GF ++T L+ H+ QL +Q EP VW+ +L + + + Sbjct: 539 GFNAGDLTILSGHIGRLEFGQLAFQSEPEGWVWMTR-----GDGQLPVLTYDRDEQKI-G 592 Query: 526 WHTHMISDKH--------YVLSAASFPNDNRGGTSLWMLVAL-SAGEERSFTVRL 571 W ++ V S S P+ N +W++V G+ + Sbjct: 593 WSRQIMGGYQDAARRRPPIVRSVCSIPDPNDARDEVWLIVQRMIDGKTERYVELF 647 >gi|317120716|gb|ADV02538.1| hypothetical protein SC2_gp080 [Liberibacter phage SC2] gi|317120777|gb|ADV02598.1| hypothetical protein SC2_gp080 [Candidatus Liberibacter asiaticus] Length = 590 Score = 281 bits (717), Expect = 3e-73, Method: Composition-based stats. Identities = 154/590 (26%), Positives = 251/590 (42%), Gaps = 41/590 (6%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M K+SF++GE+SP + QS +L ++ +A N IPLR G L+ P + Y Sbjct: 1 MTKAIHFKNSFASGEVSPFVHQSGSNLKIYQSCLAHCHNYIPLRTGALMRRPGTRIYHVF 60 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 + R+FSF ++V G KL I R T + PY +D Sbjct: 61 DDVDKPQRLFSFVKDAYTAYIIVLGYLKLHIFERRMGG----CSKVTTIEVPYKKEDVDE 116 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 +E A T VH HPP L + F E+ F P L + I K + L Sbjct: 117 IEVAQNIDTLWMVHPKHPPCQLELKGKD----WEFKEVLFKHVPPLKEQFIDDKKVSINL 172 Query: 181 SIS-----QADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKV 235 T + +D ++FK +D GR + LG P W +T Y +Y+V +D++ Sbjct: 173 KTPFENTETGKTGMVSVEADGEMFKEMDIGRELNLGFRPQRWIPDTWYLDNSYVVHNDRL 232 Query: 236 YRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGD-IKD 294 + + G+S +T ++ ES G +W + Sbjct: 233 LKCINKGKS--------QSTEWTFSDKEHQQKDGSCLWEKVESTKGNARNLLIWVTGVIK 284 Query: 295 VSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELS 354 K + + + + Q + W + WG++EGYPS +TF NRL+ SG K + + Sbjct: 285 RFKTAKCVLLELKGAFPLQNDLPTKHWLLGEWGQKEGYPSCITFFGNRLVLSGGKHNPQT 344 Query: 355 VYLSSFGAFYDFSLDGEYGCYDP-TKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWL 413 V+ S F DF+ E G T + + + I W+ G+LVG +++LWL Sbjct: 345 VHFSKLDDFTDFNQISEQGGNTDLTSSFSVLLGSDVRQGIQWLSHTDSGLLVGTESALWL 404 Query: 414 LSISL----SKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYI-----SGSTE 464 ++ + ++ R + G A P+ VG VF+ GR + + + +T+ Sbjct: 405 ITQTSQNEVVSKATVAIRSIGNFGSIAVSPILVGSHCVFIKDTGRDLISLVGNRSADNTK 464 Query: 465 QGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDF 524 +RF ++ A+H+ + + + V Q+ P+SI+WVVL RL+GC F + E Sbjct: 465 TEYRFRDLNLFAEHILTKGVWEAVLQQSPYSIIWVVL-----RDGRLVGCTFDPDNE-VC 518 Query: 525 AWHTHMISDKH-YVLSAASFPNDNRGGTSLWMLVALSA--GEERSFTVRL 571 AWHTH + + + S S + G LW+LV G + +L Sbjct: 519 AWHTHDLGGFYTQIHSLTSCASFLDGQDDLWLLVERLDDTGRKTRSLEKL 568 >gi|260549511|ref|ZP_05823729.1| Bbp13 [Acinetobacter sp. RUH2624] gi|260407304|gb|EEX00779.1| Bbp13 [Acinetobacter sp. RUH2624] Length = 678 Score = 278 bits (711), Expect = 2e-72, Method: Composition-based stats. Identities = 84/571 (14%), Positives = 172/571 (30%), Gaps = 79/571 (13%) Query: 8 KHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSN 67 ++SF+ G +SP + R D + + GVAK +N+ +G LV + Sbjct: 2 QYSFNGGVISPD-MFGRIDQAKYQTGVAKCKNMYVELFGGLVYRAGFRYVHHYPKTLGKM 60 Query: 68 RVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFG 127 R+ F + +L F + R + + PY + L YA Sbjct: 61 RLIRFVFSEEQAVVLAFRAGAVNFF-ARGGMLLNNVGEPLEVELPYAEEHLMQLRYAQSA 119 Query: 128 STAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADT 187 H D+PP ++ + +S+ T Sbjct: 120 DVVTITHPDYPPRKIIRKGATEWS-------------------------TEVVSVGYGLT 154 Query: 188 STARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDR 247 + + I +G H ++ +Y + A ++ + S Sbjct: 155 PPQNVAATAHIEDKYKEG----GNMHDSYIERDYSYQVTAVDEQNE-------SAASTKV 203 Query: 248 FGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKD--GRSISVA 305 + N ITW V + + SG + + + Sbjct: 204 TVKNDITLAGNYNTITWDVVTGATRYNIFKLRSGLASYIGETTETSFTDDNIETNGSITP 263 Query: 306 PQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYD 365 P + F+ P+ V +H R ++ G + +S + Sbjct: 264 PLIRNPFEF-----------------NPTAVAYHGQRKVYGGGYQSPQWIRMSRTATDDN 306 Query: 366 FSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLS-KGLSI 424 F D ++ + + + + +++ ++W LS + S+ Sbjct: 307 FGYHIPTQDTD---SIQIRFAARDGNGVKHLITLNDLLVL-TSGAMWKLSSDGAMTAASV 362 Query: 425 DFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKY---ISGSTEQGFRFNEITQLADHLFN 481 + + +G PV V VF + SG ++ +++ + LF+ Sbjct: 363 NMNKQYSTGANDVTPVEVDGAAVFASDQTGHVHEASLASGYNASYYQTLDLSIMCPQLFD 422 Query: 482 Q-RILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSA 540 +I+ P +I++ V + LL + + + +AW H K LS Sbjct: 423 GHKIIDCAAIRNPLNIIYFVRD-----DGVLLSLTYEPQQQ-VWAWAEHHTDGKF--LSV 474 Query: 541 ASFPNDNRGGTSLWMLVALSAGEERSFTVRL 571 A P + L+ + + R+ Sbjct: 475 AEIP--EENQSVLYAFIERNG---FYTIERM 500 >gi|260557972|ref|ZP_05830184.1| Bbp13 [Acinetobacter baumannii ATCC 19606] gi|260408482|gb|EEX01788.1| Bbp13 [Acinetobacter baumannii ATCC 19606] Length = 678 Score = 276 bits (704), Expect = 1e-71, Method: Composition-based stats. Identities = 81/571 (14%), Positives = 170/571 (29%), Gaps = 79/571 (13%) Query: 8 KHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSN 67 ++SF+ G +SP + R D + + GVAK +NL +G +V + Sbjct: 2 QYSFNGGVISPD-MFGRIDQAKYQTGVAKCKNLYVELFGGVVYRAGFRYVHHYPKTMGKM 60 Query: 68 RVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFG 127 R+ F + +L + + PY L YA Sbjct: 61 RLIRFVFSEEQAVVLAIRAGAINFF-ADGGMLLNENNEPLEVAVPYAEDHLMQLRYAQSA 119 Query: 128 STAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADT 187 H ++PP ++ + +++ Sbjct: 120 DVVTITHPNYPPRKIIRKSATEW-------------------------ITELVTVGYGVG 154 Query: 188 STARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDR 247 + + + I G H ++ +Y + A ++ + S Sbjct: 155 TPQNVAATAHIEDKYKPG----GSMHDSYIERDYSYQVTAVDEQNE-------SAASLKV 203 Query: 248 FGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKD--GRSISVA 305 + N ITW V + + SG + + + Sbjct: 204 VVQNDLTLAGNYNTITWDAVTGANRYNIFKLRSGLASFIGETTETSFTDDNIETNGSITP 263 Query: 306 PQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYD 365 P + F+ YP+ V +H R ++ G + +S + Sbjct: 264 PLIRNPFEF-----------------YPTAVAYHGQRKVYGGGYKSPQWIRMSRTATDDN 306 Query: 366 FSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLS-KGLSI 424 F D ++ + + + + +++ +LW +S + S+ Sbjct: 307 FGYHIPTQDTD---SIQIRFAARDGNGVKHLVTMSDLLIL-TSGALWKMSADGAVTAASV 362 Query: 425 DFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYI---SGSTEQGFRFNEITQLADHLFN 481 + + +G PV V +F + I SG ++ +++ + LF+ Sbjct: 363 NMNKQYSTGANDVTPVEVDGATIFSSDQTGHVHEISLASGYNASFYQTIDLSIMCPQLFD 422 Query: 482 Q-RILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSA 540 +I+ P +I++ V LL + + + +AW H + K LS Sbjct: 423 GQKIIDCALLRNPLNIIYFVR-----GDGVLLSLTYEPKQQ-VWAWAEHHTNGKF--LSI 474 Query: 541 ASFPNDNRGGTSLWMLVALSAGEERSFTVRL 571 A P D+ + L+ + R+ Sbjct: 475 AEIPEDD--QSVLYAFIERDG---FYTIERM 500 >gi|158425207|ref|YP_001526499.1| tail tubular protein B [Azorhizobium caulinodans ORS 571] gi|158332096|dbj|BAF89581.1| tail tubular protein B [Azorhizobium caulinodans ORS 571] Length = 785 Score = 274 bits (701), Expect = 2e-71, Method: Composition-based stats. Identities = 71/582 (12%), Positives = 148/582 (25%), Gaps = 46/582 (7%) Query: 18 PRLLQSRKDLS---LHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLD-PRSNRVFSFS 73 P L+ A + N L P + P + V + Sbjct: 9 PNLINGVSQQPFALRLASQAEEQINGFSSIVEGLTKRPPTRHVAKLINSLPENAHVHIIN 68 Query: 74 IPDGGYALLVFGDKKLQIVVVRSSTKWSPALFG----KTYKTPYTFKDNKSLEYAVFGST 129 ++V + L++ + G + + + Sbjct: 69 RDAAERYVVVAFNGDLRVYGFDGVERTVNFPHGKGYLANTSASFGAVTVADYTFFLNKDV 128 Query: 130 AVFVHKDH----PPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQA 185 V + + PP +++++ G+ + + + I+ + S+ Sbjct: 129 TVAMSPETKAGRPPEGIVFVRQGNYAC----KYRIIVDGQAVAEKITSQTDPNDIQSSKI 184 Query: 186 DTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTN-YSIGAYIVADDKVYRSLTTGRS 244 A I + G +I + T S+G + Sbjct: 185 AQDLAAIINSWGSMVASVIGSTIHIRRADSLGFSLTTEDSLGDTGLVCMTKQTQTFANLP 244 Query: 245 GDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISV 304 + N + S + +G G + Sbjct: 245 ARAVQGYQVEISGTPGNPYDNFWVEYDQAGSGGN-NGVWREIAAPGRQIAFDPATMPHVL 303 Query: 305 APQSQTLFQAGVSVVSWFMSAWGEQEGYPSHV-------TFHNNRLLFSGSKGDELSVYL 357 ++ F + + E PS V F+ NRL F + V Sbjct: 304 VREANGSFTFKQADWEKCAAGSDETTPRPSFVGQRISDIFFYRNRLGFISDES----VIF 359 Query: 358 SSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSIS 417 S F++F + T + + S + PF E +L+ D + ++L Sbjct: 360 SRSAKFFNFWRETA-TDLLDTDPIDITTSHVKVSILRHAIPFNESLLLFSDQTQFMLGAG 418 Query: 418 L--SKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQG---FRFNEI 472 + + PV G + F G + N++ Sbjct: 419 EVLTPSGVSLDQVTEFETSSRAKPVGAGQFVYFCTSRGEFTGVREYYIDGSTKTNNANDV 478 Query: 473 TQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMIS 532 T ++ +L +V L D + S + + +W + Sbjct: 479 TNHVPRYIRGKVFKLCASTNEDMLV--ALSDTDRDTLYVYKYYNSGQEKVQSSWSRWKLQ 536 Query: 533 DKHYVLSAASFPNDNRGGTSLWMLVALSAGEERSFTVRLNLL 574 N ++LW++V + G + RLN+ Sbjct: 537 PG------DVILNAEFIESTLWLIVRRADGV---YLDRLNIE 569 >gi|265525004|gb|ACY75867.1| tail tubular protein B [Enterobacteria phage T7] Length = 794 Score = 266 bits (680), Expect = 5e-69, Method: Composition-based stats. Identities = 65/587 (11%), Positives = 144/587 (24%), Gaps = 50/587 (8%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + + + G + + D+ + ++ N L P + Sbjct: 1 MALISQSIKNLKGG------ISQQPDILRYPDQGSRQVNGWSSETEGLQKRPPLVFLNTL 54 Query: 61 RLDP--RSNRVFSFSIPDG-GYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKD 117 + D VF +++ + + K G Y T Sbjct: 55 GDNGALGQAPYIHLINRDEHEQYYAVFTGSGIRVFDLSGNEKQVRYPNGSNYIK--TANP 112 Query: 118 NKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSN 177 L V+++ + + D + + G +I + Sbjct: 113 RNDLRMVTVADYTFIVNRNVVAQKNTKSVNLPNYNPNQDGLINVRGGQYGRELIVHINGK 172 Query: 178 AKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYR 237 D S ++ ++ I + ++ Sbjct: 173 DVAKYKIPDGSQPEHVNNTDAQWLAEELAKQMRTNLSDWTVNVGQGFIHVTAPSGQQIDS 232 Query: 238 SLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGA-------------VA 284 T D+ + + K +++ A Sbjct: 233 FTTKDGYADQLINPVTHYAQSFSKLPPNAPNGYMVKIVGDASKSADQYYVRYDAERKVWT 292 Query: 285 PYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSH-------VT 337 W V + ++ + F S + +PS V Sbjct: 293 ETLGWNTEDQVLWETMPHALVRAADGNFDFKWLEWSPKSCGDVDTNPWPSFVGSSINDVF 352 Query: 338 FHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMH 397 F NRL F + + LS +++F D + AV+ + + + Sbjct: 353 FFRNRLGFLSGEN----IILSRTAKYFNFYPASIANLSDD-DPIDVAVSTNRIAILKYAV 407 Query: 398 PFGEGVLVGCDTSLWLLSISLSKGLSID--FRRVSGSGVYACPPVSVGDCLVFVCGVGRR 455 PF E +L+ D + ++L+ S + P +G + F Sbjct: 408 PFSEELLIWSDEAQFVLTASGTLTSKSVELNLTTQFDVQDRARPFGIGRNVYFASPRSSF 467 Query: 456 IKYISGSTEQG----FRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRL 511 Q +IT + + + + VL D S + Sbjct: 468 TSIHRYYAVQDVSSVKNAEDITSHVPNYIPNGVFSICGSGTENFC--SVLSHGDPSKIFM 525 Query: 512 LGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVA 558 + E +W + VL+ S + +++++ Sbjct: 526 YKFLYLNEELRQQSWSHWDFGENVQVLACQSIS------SDMYVILR 566 >gi|9627472|ref|NP_042000.1| tail tubular protein B [Enterobacteria phage T7] gi|139659|sp|P03747|VTTB_BPT7 RecName: Full=Tail tubular protein B gi|15606|emb|CAA24430.1| unnamed protein product [Enterobacteria phage T7] gi|37956682|gb|AAP33952.1| gene 12 [Enterobacteria phage T7] Length = 794 Score = 266 bits (680), Expect = 6e-69, Method: Composition-based stats. Identities = 65/587 (11%), Positives = 144/587 (24%), Gaps = 50/587 (8%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + + + G + + D+ + ++ N L P + Sbjct: 1 MALISQSIKNLKGG------ISQQPDILRYPDQGSRQVNGWSSETEGLQKRPPLVFLNTL 54 Query: 61 RLDP--RSNRVFSFSIPDG-GYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKD 117 + D VF +++ + + K G Y T Sbjct: 55 GDNGALGQAPYIHLINRDEHEQYYAVFTGSGIRVFDLSGNEKQVRYPNGSNYIK--TANP 112 Query: 118 NKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSN 177 L V+++ + + D + + G +I + Sbjct: 113 RNDLRMVTVADYTFIVNRNVVAQKNTKSVNLPNYNPNQDGLINVRGGQYGRELIVHINGK 172 Query: 178 AKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYR 237 D S ++ ++ I + ++ Sbjct: 173 DVAKYKIPDGSQPEHVNNTDAQWLAEELAKQMRTNLSDWTVNVGQGFIHVTAPSGQQIDS 232 Query: 238 SLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGA-------------VA 284 T D+ + + K +++ A Sbjct: 233 FTTKDGYADQLINPVTHYAQSFSKLPPNAPNGYMVKIVGDASKSADQYYVRYDAERKVWT 292 Query: 285 PYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSH-------VT 337 W V + ++ + F S + +PS V Sbjct: 293 ETLGWNTEDQVLWETMPHALVRAADGNFDFKWLEWSPKSCGDVDTNPWPSFVGSSINDVF 352 Query: 338 FHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMH 397 F NRL F + + LS +++F D + AV+ + + + Sbjct: 353 FFRNRLGFLSGEN----IILSRTAKYFNFYPASIANLSDD-DPIDVAVSTNRIAILKYAV 407 Query: 398 PFGEGVLVGCDTSLWLLSISLSKGLSID--FRRVSGSGVYACPPVSVGDCLVFVCGVGRR 455 PF E +L+ D + ++L+ S + P +G + F Sbjct: 408 PFSEELLIWSDEAQFVLTASGTLTSKSVELNLTTQFDVQDRARPFGIGRNVYFASPRSSF 467 Query: 456 IKYISGSTEQG----FRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRL 511 Q +IT + + + + VL D S + Sbjct: 468 TSIHRYYAVQDVSSVKNAEDITSHVPNYIPNGVFSICGSGTENFC--SVLSHGDPSKIFM 525 Query: 512 LGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVA 558 + E +W + VL+ S + +++++ Sbjct: 526 YKFLYLNEELRQQSWSHWDFGENVQVLACQSIS------SDMYVILR 566 >gi|194100399|ref|YP_002003974.1| gp12 [Enterobacteria phage 13a] gi|193201446|gb|ACF15923.1| gp12 [Enterobacteria phage 13a] Length = 794 Score = 266 bits (680), Expect = 7e-69, Method: Composition-based stats. Identities = 65/587 (11%), Positives = 146/587 (24%), Gaps = 50/587 (8%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + + + G + + D+ + ++ N L P + + Sbjct: 1 MALISQSIKNLKGG------ISQQPDILRYPDQGSRQVNGWSSETEGLQKRPPLVFLKTL 54 Query: 61 RLDP--RSNRVFSFSIPDG-GYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKD 117 + D VF +++ + + K G Y T Sbjct: 55 GYNGALGQAPYIHLINRDEHEQYYAVFTGSGIRVFDLAGNEKQVRYPNGSNYIN--TANP 112 Query: 118 NKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSN 177 L V+++ + + D + + G +I + Sbjct: 113 RNDLRMVTVADYTFIVNRNVVAQKNTNSVNLPNYNPNQDGLINVRGGQYGRELIVHINGK 172 Query: 178 AKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYR 237 D S ++ ++ + I + ++ Sbjct: 173 DVAKYKIPDGSKPEHVNNTDAQWLAEELANQMRTNLSDWTVNVGQGFIHVTAPSGQQIDS 232 Query: 238 SLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGA-------------VA 284 T D+ + + K +++ A Sbjct: 233 FTTKDGYADQLINPVTHYAQSFSKLPPNAPNGYMVKIVGDASKSADQYYVRYDDERKVWT 292 Query: 285 PYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSH-------VT 337 W V + ++ + F S + +PS V Sbjct: 293 ETLGWNTEDQVLWETMPHALVRAADGNFDFKWLEWSPKSCGDVDTNPWPSFVGSSINDVF 352 Query: 338 FHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMH 397 F NRL F + + LS +++F D + AV+ + + + Sbjct: 353 FFRNRLGFLSGEN----IILSRTAKYFNFYPASIANLSDD-DPIDVAVSTNRIAILKYAV 407 Query: 398 PFGEGVLVGCDTSLWLLSISLSKGLSID--FRRVSGSGVYACPPVSVGDCLVFVCGVGRR 455 PF E +L+ D + ++L+ S + P +G + F Sbjct: 408 PFSEELLIWSDEAQFVLTASGTLTSKSVELNLTTQFDVQDRARPFGIGRNVYFASPRSSF 467 Query: 456 IKYISGSTEQG----FRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRL 511 Q +IT + + + + VL D S + Sbjct: 468 TSIHRYYAVQDVSSVKNAEDITSHVPNYIPNGVFSICGSGTENFC--SVLSHGDPSKIFM 525 Query: 512 LGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVA 558 + E +W + VL+ S + +++++ Sbjct: 526 YKFLYLNEELRQQSWSHWDFGENVQVLACQSI------NSDMYVILR 566 >gi|37956735|gb|AAP34004.1| gene 12 [Enterobacteria phage T7] gi|37956785|gb|AAP34053.1| gene 12 [Enterobacteria phage T7] Length = 794 Score = 266 bits (679), Expect = 9e-69, Method: Composition-based stats. Identities = 64/587 (10%), Positives = 144/587 (24%), Gaps = 50/587 (8%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + + + G + + D+ + ++ N L P + Sbjct: 1 MALISQSIKNLKGG------ISQQPDILRYPDQGSRQVNGWSSETEGLQKRPPLVFLNTL 54 Query: 61 RLDP--RSNRVFSFSIPDG-GYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKD 117 + D VF +++ + + K G Y T Sbjct: 55 GDNGALGQAPYIHLINRDEHEQYYAVFTGSGIRVFDLSGNEKQVRYPNGSNYIK--TANP 112 Query: 118 NKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSN 177 L V+++ + + D + + G +I + Sbjct: 113 RNDLRMVTVADYTFIVNRNIVAQKNTKSVNLPNYNPNQDGLINVRGGQYGRELIVHINGK 172 Query: 178 AKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYR 237 D S ++ ++ I + ++ Sbjct: 173 DVAKYKIPDGSQPEHVNNTDAQWLAEELAKQMRTNLSDWTVNVGQGFIHVTAPSGQQIDS 232 Query: 238 SLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGA-------------VA 284 T D+ + + K +++ A Sbjct: 233 FTTKDGYADQLINPVTHYAQSFSKLPPNAPNGYMVKIVGDASKSADQYYVRYDAERKVWT 292 Query: 285 PYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSH-------VT 337 W V + ++ + F S + +PS V Sbjct: 293 ETLGWNTEDQVLWETMPHALVRAADGNFDFKWLEWSPKSCGDVDTNPWPSFVGSSINDVF 352 Query: 338 FHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMH 397 F NRL F + + LS +++F D + AV+ + + + Sbjct: 353 FFRNRLGFLSGEN----IILSRTAKYFNFYPASIANLSDD-DPIDVAVSTNRIAILKYAV 407 Query: 398 PFGEGVLVGCDTSLWLLSISLSKGLSID--FRRVSGSGVYACPPVSVGDCLVFVCGVGRR 455 PF E +L+ D + ++L+ S + P +G + F Sbjct: 408 PFSEELLIWSDEAQFVLTASGTLTSKSVELNLTTQFDVQDRARPFGIGRNVYFASPRSSF 467 Query: 456 IKYISGSTEQG----FRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRL 511 Q +IT + + + + VL + S + Sbjct: 468 TSIHRYYAVQDVSSVKNAEDITSHVPNYIPNGVFSICGSGTENFC--SVLSHGNPSKIFM 525 Query: 512 LGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVA 558 + E +W + VL+ S + +++++ Sbjct: 526 YKFLYLNEELRQQSWSHWDFGENVQVLACQSIS------SDMYVVLR 566 >gi|37956840|gb|AAP34107.1| gene 12 [Enterobacteria phage T7] Length = 794 Score = 266 bits (678), Expect = 1e-68, Method: Composition-based stats. Identities = 66/587 (11%), Positives = 145/587 (24%), Gaps = 50/587 (8%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + + + G + + D+ + ++ N L P + Sbjct: 1 MALISQSIKNLKGG------ISQQPDILRYPDQGSRQVNGWSSETEGLQKRPPLVFLNTL 54 Query: 61 RLDP--RSNRVFSFSIPDG-GYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKD 117 + D VF +++ + + K G Y T Sbjct: 55 GDNGALGQAPYIHLINRDEHEQYYAVFTGSGIRVFDLSGNEKQVRYPNGSNYIK--TANP 112 Query: 118 NKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSN 177 L V+++ + + D + + G +I + Sbjct: 113 RNDLRMVTVADYTFIVNRNVVAQKNTKSVNLPNYNSNQDGLINVRGGQYGRELIVHINGK 172 Query: 178 AKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYR 237 D S ++ ++ I + ++ Sbjct: 173 DVAKYKIPDGSQPEHVNNTDAQWLAEELAKQMRTNLSDWTVNVGQGFIHVIAPSGQQIDS 232 Query: 238 SLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGA-------------VA 284 T D+ S + + K +++ A Sbjct: 233 FTTKDGYADQLINSVTHYAQSFSKLPPNAPNGYMVKIVGDASKSADQYYVRYDAERKVWT 292 Query: 285 PYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSH-------VT 337 W V + ++ + F S + +PS V Sbjct: 293 ETLGWNTEDQVLWETMPHALVRAADGNFDFKWLEWSPKSCGDVDTNPWPSFVGSSINDVF 352 Query: 338 FHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMH 397 F NRL F + + LS +++F D + AV+ + + + Sbjct: 353 FFRNRLGFLSGEN----IILSRTAKYFNFYPASIANLSDD-DPIDVAVSTNRIAILKYAV 407 Query: 398 PFGEGVLVGCDTSLWLLSISLSKGLSID--FRRVSGSGVYACPPVSVGDCLVFVCGVGRR 455 PF E +L+ D + ++L+ S + P +G + F Sbjct: 408 PFSEELLIWSDEAQFVLTASGTLTSKSVELNLTTQFDVQDRARPFGIGRNVYFASSRSSF 467 Query: 456 IKYISGSTEQG----FRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRL 511 Q +IT + + + + VL D S + Sbjct: 468 TSIHRYYAVQDVSSVKNAEDITSHVPNYIPNGVFSICGSGTENFC--SVLSHGDPSKIFM 525 Query: 512 LGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVA 558 + E +W + VL+ S + +++++ Sbjct: 526 YKFLYLNEELRQQSWSHWDFGENVQVLACQSIS------SDMYVILR 566 >gi|37956893|gb|AAP34159.1| gene 12 [Enterobacteria phage T7] Length = 794 Score = 266 bits (678), Expect = 1e-68, Method: Composition-based stats. Identities = 66/587 (11%), Positives = 145/587 (24%), Gaps = 50/587 (8%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + + + G + + D+ + ++ N L P + Sbjct: 1 MALISQSIKNLKGG------ISQQPDILRYPDQGSRQVNGWSSETEGLQKRPPLVFLNTL 54 Query: 61 RLDP--RSNRVFSFSIPDG-GYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKD 117 + D VF +++ + + K G Y T Sbjct: 55 GDNGALGQAPYIHLINRDEHEQYYAVFTGSGIRVFDLSGNEKQVRYPNGSNYIK--TANP 112 Query: 118 NKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSN 177 L V+++ + + D + + G +I + Sbjct: 113 RNDLRMVTVADYTFIVNRNVVAQKNTKSVNLPNYNSNQDGLINVRGGQYGRELIVHINGK 172 Query: 178 AKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYR 237 D S ++ ++ I + ++ Sbjct: 173 DVAKYKIPDGSQPEHVNNTDAQWLAEELAKQMRTNLSDWTVNVGQGFIHVIAPSGQQIDS 232 Query: 238 SLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGA-------------VA 284 T D+ S + + K +++ A Sbjct: 233 FTTKDGYADQLINSVTHYAQSFSKLPPNAPNGYMVKIVGDASKSADQYYVRYDAERKVWT 292 Query: 285 PYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSH-------VT 337 W V + ++ + F S + +PS V Sbjct: 293 ETLGWNTEDQVLWETMPHALVRAADGNFDFKWLEWSPKSCGDVDTNPWPSFVGSSINDVF 352 Query: 338 FHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMH 397 F NRL F + + LS +++F D + AV+ + + + Sbjct: 353 FFRNRLGFLSGEN----IILSRTAKYFNFYPASIANLSDD-DPIDVAVSTNRIAILKYAV 407 Query: 398 PFGEGVLVGCDTSLWLLSISLSKGLSID--FRRVSGSGVYACPPVSVGDCLVFVCGVGRR 455 PF E +L+ D + ++L+ S + P +G + F Sbjct: 408 PFSEELLIWSDEAQFVLTASGTLTSKSVELNLTTQFDVQDRARPFGIGRNVYFASSRPSF 467 Query: 456 IKYISGSTEQG----FRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRL 511 Q +IT + + + + VL D S + Sbjct: 468 TSIHRYYAVQDVSSVKNAEDITSHVPNYIPNGVFSICGSGTENFC--SVLSHGDPSKIFM 525 Query: 512 LGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVA 558 + E +W + VL+ S + +++++ Sbjct: 526 YKFLYLNEELRQQSWSHWDFGENVQVLACQSIS------SDMYVILR 566 >gi|242278913|ref|YP_002991042.1| hypothetical protein Desal_1441 [Desulfovibrio salexigens DSM 2638] gi|242121807|gb|ACS79503.1| hypothetical protein Desal_1441 [Desulfovibrio salexigens DSM 2638] Length = 698 Score = 262 bits (670), Expect = 9e-68, Method: Composition-based stats. Identities = 101/593 (17%), Positives = 180/593 (30%), Gaps = 94/593 (15%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + +FSAGELSPRL R DL+ ++ G+A+ N+ +G + Sbjct: 1 MS-VSLIMTNFSAGELSPRL-GGRVDLAKYSNGLAELENMFTHPHGGASRRTGFR----- 53 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 F V G +L + S+ W+ Sbjct: 54 -----------FIRE-------VMGRNQLPSASLDSAINWTVGNGWTVAS---------- 85 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 D + Sbjct: 86 -----------------------ANASCDGSQTDESTLSRNLELVADRIYEISFNVTG-- 120 Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLT 240 + + + + + D + R + + V +V Sbjct: 121 -FNSGAVCVSAGSDSLSEYVAADGSYTFRSKADADGLLSIIADADFSGAVEAVQVREINP 179 Query: 241 TGRSGDRFGYSKGATYVKDNNIT----WITVLNLSSKTSRESASGAVAPYYVWGDIKDVS 296 R ++ A ++ + + + + S + G S Sbjct: 180 ATRLIPFEFSTEQAYVLEFTDRNIRIFKNGGIVVDDQGSPVEIQSPYTETDLPGIRFTQS 239 Query: 297 KDGRSISVAPQSQTLFQAGVSVVSWFM---------SAWGEQEGYPSHVTFHNNRLLFSG 347 D + V W M W ++G+PS VTF RL F+ Sbjct: 240 ADVMYLVHPEVQPYKLSRTSHV-DWKMELVAFSSPPQEWNSEKGFPSCVTFFEERLCFAA 298 Query: 348 SKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGC 407 S + ++++S G++ DF++ A T ++ + I WM + +++G Sbjct: 299 SPSNPQTIWMSKAGSYEDFAVSSP---VVDDDACTYTLSADQVNAIRWMVSAKK-LIMGT 354 Query: 408 DTSLWLLSI----SLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGST 463 W LS S+ RR + G A PPV VG ++F+ GR I+ +S S Sbjct: 355 SGGEWWLSGGSSLDSVTPNSVMVRRETTHGSAAIPPVVVGGVMLFLQREGRTIRELSYSF 414 Query: 464 E-QGFRFNEITQLADHLFNQR-ILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGE 521 E G+ ++T LA+HL I + YQ+ P S++W+ + ++G + E E Sbjct: 415 EADGYTAPDLTILAEHLTRSNSITEWAYQQSPDSVIWMTRD-----DGVMVGLTYQREHE 469 Query: 522 GDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSAGEERSFTVRLNLL 574 +H H K + P + G R + R+ Sbjct: 470 -VVGFHRHTTDGKFRSVCTVPGPTQEEVWVVVERE---VGGISRKYVERMENQ 518 >gi|30387490|ref|NP_848299.1| tail protein [Yersinia pestis phage phiA1122] gi|30314127|gb|AAP20535.1| tail protein [Yersinia pestis phage phiA1122] Length = 794 Score = 262 bits (670), Expect = 9e-68, Method: Composition-based stats. Identities = 63/587 (10%), Positives = 147/587 (25%), Gaps = 50/587 (8%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + + + G + + D+ + ++ N L P + + Sbjct: 1 MALISQSIKNLKGG------ISQQPDILRYPDQGSRQVNGWSSETEGLQKRPPLVFLKTL 54 Query: 61 RLDP--RSNRVFS-FSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKD 117 + + + VF +++ + + K G Y T Sbjct: 55 GDNGALGQAPYIHLINRDENEQYYAVFTGTGIRVFDLAGNEKQVRYPNGSNYIK--TANP 112 Query: 118 NKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSN 177 L V+++ + + D + + G +I + Sbjct: 113 RSDLRMVTVADYTFIVNRNVVVQKDPNSVNLANYNPKQDGLINIRGGQYGRELIVHINGK 172 Query: 178 AKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYR 237 + D S ++ ++ I + ++ Sbjct: 173 DVATYKIPDGSKPEHVNNTDAQWLAERLAKQMRINLSGWTVNVGQGFIHVTAPSGQQIDS 232 Query: 238 SLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGA-------------VA 284 T D+ + + K +++ A Sbjct: 233 FTTKDGYADQLINPVTHYAQSFSKLPPNAPNGYMVKIVGDASKSADQYYVRYDAERKVWT 292 Query: 285 PYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSH-------VT 337 W V + ++ + F S + +PS V Sbjct: 293 ETLGWNTENQVLLETMPHALVRAADGNFDFKWLEWSPKSCGDVDTNPWPSFVGSSINDVF 352 Query: 338 FHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMH 397 F NRL F + + LS +++F + + AV+ + + + Sbjct: 353 FFRNRLGFLSGEN----IILSRTAKYFNFYPASIANLSND-DPIDVAVSTNRIAILKYAV 407 Query: 398 PFGEGVLVGCDTSLWLLSISLSKGLSID--FRRVSGSGVYACPPVSVGDCLVFVCGVGRR 455 PF E +L+ D + ++L+ S + P +G + F Sbjct: 408 PFSEELLIWSDEAQFVLTASGTLTSRSVELNLTTQFDVQDRARPYGIGRNVYFASPRSSY 467 Query: 456 IKYISGSTEQG----FRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRL 511 Q +IT + + + + VL D S + Sbjct: 468 TSIHRYYAVQDVSSVKNSEDITSHVPNYIPNGVFSICGSGTENFC--SVLSHGDPSKIFM 525 Query: 512 LGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVA 558 + E +W + VL+ S + +++++ Sbjct: 526 YKFLYLNEELRQQSWSHWDFGENVQVLACQSIS------SDMYVILR 566 >gi|41179374|ref|NP_958682.1| Bbp13 [Bordetella phage BPP-1] gi|45569506|ref|NP_996575.1| hypothetical protein BMP-1p12 [Bordetella phage BMP-1] gi|45580757|ref|NP_996623.1| hypothetical protein BIP-1p12 [Bordetella phage BIP-1] gi|40950113|gb|AAR97679.1| Bbp13 [Bordetella phage BPP-1] Length = 681 Score = 261 bits (665), Expect = 4e-67, Method: Composition-based stats. Identities = 88/576 (15%), Positives = 172/576 (29%), Gaps = 79/576 (13%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M N + SF GE+SP + R D + G+A RN + GP + R+ Sbjct: 1 MSNVRVLQRSFGGGEISPE-MFGRIDDVKYQSGLAICRNFVVKPQGPAENRAGFAFVREV 59 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 + + R+ F+ + T G Sbjct: 60 KDSAKKVRLIPFTYSV-------------------TQTMVIELGAGY------------- 87 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 + G T + P+ + Sbjct: 88 FRFHTNGGTLL----------------------------DGAVPYEIANPYAEADLFNIH 119 Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLT 240 + AD T + + +L T S+ A Y Sbjct: 120 YVQSADVLTLVHPNYAPRELRRLGATNWQLATIAFTSPVATPTSVTATSNNKGTDYTYRY 179 Query: 241 TGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGR 300 + D G ++ A + ++ + ++SGA G+ Sbjct: 180 VVTALDAEGKTESAPSSAGTCTNNLFTNGGANTIAWSASSGASRYNVYKEQGGLYGYIGQ 239 Query: 301 SISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSF 360 + + + + + + YP+ V++ R F+G+ +++++ Sbjct: 240 TTGTSLVDDNIAPDLSVTPPIYDAVFNAAGDYPAAVSYFEQRRCFAGTTNKPQNIWMTRS 299 Query: 361 GAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTS--LWLLSISL 418 G S + V A+ I + P E +L+ + ++ Sbjct: 300 GTESAMSYSLPVR---DDDRVAFRVAAREANAIRHIVPLTELLLLTSSGEWRVASVNSDA 356 Query: 419 SKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLAD 477 +I R S G PV V + ++ G ++ ++ + + GF +++ A Sbjct: 357 VTPTTISVRPQSYVGATDVQPVVVNNTTIYGAARGGHVRELAYNWQANGFVTGDLSLRAA 416 Query: 478 HLFNQ-RILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHY 536 HLF+ IL + Y + P IVW + +S +LLG + E + AWH H Sbjct: 417 HLFDNLDILDMAYAKAPQPIVWFI-----SSSGKLLGLTYVPEQQ-IGAWHQHDTDGVFE 470 Query: 537 VLSAASFPNDNRGGTSLWMLVALS-AGEERSFTVRL 571 + + L+ +V + G E + R+ Sbjct: 471 SCAVVA----EGNEDRLYAVVRRTIGGNEVRYVERM 502 >gi|326633075|ref|YP_004306686.1| predicted tail tubular protein B [Salmonella phage Vi06] gi|301170548|emb|CBV65236.1| predicted tail tubular protein B [Salmonella phage Vi06] Length = 795 Score = 256 bits (653), Expect = 9e-66, Method: Composition-based stats. Identities = 59/587 (10%), Positives = 144/587 (24%), Gaps = 49/587 (8%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + + + G + + D+ + ++ N L P M + Sbjct: 1 MALISQSIKNLKGG------ISQQPDILRYPDQGSQQVNGWSSESEGLQKRPPMVFLKTL 54 Query: 61 RLDP--RSNRVFS-FSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKD 117 + + VF +++ + + + + T Sbjct: 55 GGSDTLGPAPYIHLINRDESEQYYAVFTGTGIRVFDLAGNERQVRYTTDGSTYI-NTNNP 113 Query: 118 NKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSN 177 L V+++ + + D + + G + + N Sbjct: 114 RNDLRMVTVADYTFIVNRNVRVTRDTNSVNLAGFNPKQDALINVRGGQYGRTLQIIINGN 173 Query: 178 AKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYR 237 + + D S ++ ++ P I ++ Sbjct: 174 TQATYQIPDGSQPEHVNNTDAQWLAEELARQCRVSAPGWTFNVGQGYIHIIAPEGQQIDS 233 Query: 238 SLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGA-------------VA 284 T D+ + + K +++ A + Sbjct: 234 LTTKDGYADQLINPVTHYAQSFSKLPTNAPEGYVVKIVGDASKSADQYYVRYDTTRKVWS 293 Query: 285 PYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSH-------VT 337 W + + ++ + F+ S + +PS V Sbjct: 294 ETLGWNVNDQLLFETMPHALVRAADGNFELKRIEWSPKTCGDDDTNPWPSFMDSTINDVF 353 Query: 338 FHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMH 397 F NRL + + LS +++F + AV+ + + + Sbjct: 354 FFRNRLGLLSGEN----IILSRTAKYFNFYPAS-IATLSDDDPIDVAVSTNRIAILKYAV 408 Query: 398 PFGEGVLVGCDTSLWLLSISLSKGLSID--FRRVSGSGVYACPPVSVGDCLVFVCGVGRR 455 PF E +L+ D + ++L+ S + P +G + F Sbjct: 409 PFSEELLIWSDEAQFVLTASGTLTSRSIELNLTTQFDVQDRARPFGIGRNVYFASPRSSF 468 Query: 456 IKYISGSTEQG----FRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRL 511 Q +IT + + + + VL D S + Sbjct: 469 TSIHRYYAVQDVSSVKNAEDITAHVQNYIPNGVFDICGSSTENFC--AVLSQGDQSKIFM 526 Query: 512 LGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVA 558 + E +W VL+ + +++++ Sbjct: 527 YKFLYLNEELRQQSWSHWDFGSNVQVLACQCI------NSDMYVILR 567 >gi|325272824|ref|ZP_08139161.1| tail tubular protein B [Pseudomonas sp. TJI-51] gi|324102029|gb|EGB99538.1| tail tubular protein B [Pseudomonas sp. TJI-51] Length = 781 Score = 253 bits (645), Expect = 7e-65, Method: Composition-based stats. Identities = 65/602 (10%), Positives = 155/602 (25%), Gaps = 54/602 (8%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + + +F G + + A + N I L+ P Sbjct: 1 MSLISSSIPNFVNG------VSQQPFTLRLASQLDAQENGISTVSEGLMKRPPTTHLARV 54 Query: 61 RLDPRSNRVFS-FSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNK 119 P + + + + L++ V S + G +Y Sbjct: 55 TASPLESAFVHTINRDSTERYQVAITNGGLRVFAVDGSERTVSFPDGTSYLA--ASDPAS 112 Query: 120 SLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAK 179 V+K + + + + + G + Sbjct: 113 DFTAITVADYTFIVNKAITVANRAAVSGTRGP----EALISVIQGNYGRTYGVILNGVTV 168 Query: 180 LSISQADTSTARITSDMKIFKPL--------DKGRSIRLGCHPPEWAKNTNYSIGAYIVA 231 + + D S A T+ G + +++I Y Sbjct: 169 ATYATPDGSDATKTALASTDYIATELVAGIQSAGFTCVRAGSCLYITSTADFTIDCYDGF 228 Query: 232 DDKVYRSLTT-----GRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPY 286 ++ ++ + G + + + ++G Sbjct: 229 NNNAMKAYKKVVQSFSTLPSNCTQAGGCLFEITGDPGDSSDDYYVYYDVGTDSTGVWREC 288 Query: 287 YVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSH-------VTFH 339 G + ++ + F + + ++ + PS V F+ Sbjct: 289 VGPGVALGLDGSTMPHTLVRNADGTFTFQAATWTDRVAGDADTNEDPSFVGRTINDVVFY 348 Query: 340 NNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPF 399 NRL F + V S G +++F + + + T + + F Sbjct: 349 RNRLGFLADEA----VIFSESGKYWNFYRTTV-TELLDSDPIDVSSTYTKVAILKHAVSF 403 Query: 400 GEGVLVGCDTSLWLL-SISLSKGLSIDFRRVSGS-GVYACPPVSVGDCLVFVCGVGRRIK 457 + +L+ D +L+ + +I + + P SVG + F Sbjct: 404 NKQLLLFSDEVQFLIDNGDTLTPKTISIKPSTEFVCNALTTPQSVGKNVYFASDRENWTA 463 Query: 458 YISGSTEQGFRFNEITQLADH---LFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGC 514 T+ N+ T +A H + ++ + VL D + Sbjct: 464 IREYFTDTNDVSNDSTDVASHVPQYIPSGVFKIASSSSEDML--CVLTTGDRHSIYVYKF 521 Query: 515 RFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSAGEERSFTVRLNLL 574 + + + +W D + N + +++ + + G + +L + Sbjct: 522 YWDGDTKVQSSWSKWTFPDT------DTILNAEFLDSEVFLAINRADG---LYFEKLTVA 572 Query: 575 DD 576 D Sbjct: 573 TD 574 >gi|26989008|ref|NP_744433.1| tail tubular protein B [Pseudomonas putida KT2440] gi|24983829|gb|AAN67897.1|AE016421_9 tail tubular protein B [Pseudomonas putida KT2440] Length = 781 Score = 251 bits (641), Expect = 2e-64, Method: Composition-based stats. Identities = 63/602 (10%), Positives = 154/602 (25%), Gaps = 54/602 (8%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + + +F G + + + + N I L+ P Sbjct: 1 MSLISSSIPNFVNG------VSQQPFTLRLSSQLDAQENGISTVSEGLMKRPPTTHLARV 54 Query: 61 RLDPRSNRVFS-FSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNK 119 P + + + + L++ V + + G Y Sbjct: 55 TASPLESAFVHTINRDASERYQVAITNGGLRVFAVDGTERTVSFPDGTGYLA--ASDPAS 112 Query: 120 SLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAK 179 V+K + + + + + G + Sbjct: 113 DFTAITVADYTFIVNKAITVANRAAVSAPRGP----EALISVIQGNYGRTYGVILNGVTV 168 Query: 180 LSISQADTSTARITSDMKIFKPL--------DKGRSIRLGCHPPEWAKNTNYSIGAYIVA 231 + + D S A TS G + +++I Y Sbjct: 169 ATYATPDGSDATKTSLASTDYIATELVAGIQSAGFTCVRAGSCLYITSTADFTIDCYDGF 228 Query: 232 DDKVYRSLTT-----GRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPY 286 ++ ++ + G + + + ++G Sbjct: 229 NNNAMKAYKKVVQSFSTLPSNCTQAGGCLFEITGDPGDSSDDYYVYYDVGTDSTGVWREC 288 Query: 287 YVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSH-------VTFH 339 G + ++ + F + + ++ + PS V F+ Sbjct: 289 VGPGVALGLDGSTMPHTLVRNADGTFTFQAATWTDRVAGDADTNEDPSFVGRTINDVVFY 348 Query: 340 NNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPF 399 NRL F + V S G +++F + + + T + + F Sbjct: 349 RNRLGFLADEA----VIFSESGKYWNFYRTTV-TELLDSDPIDVSSTYTKVAILKHAVSF 403 Query: 400 GEGVLVGCDTSLWLL-SISLSKGLSIDFRRVSGS-GVYACPPVSVGDCLVFVCGVGRRIK 457 + +L+ D +L+ + +I + + P SVG + F Sbjct: 404 NKQLLLFSDEVQFLIDNGDTLTPKTISIKPSTEFVCNALTTPQSVGKNVYFASDRENWTA 463 Query: 458 YISGSTEQGFRFNEITQLADH---LFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGC 514 T+ N+ T +A H + ++ + VL D + Sbjct: 464 IREYFTDTNDVSNDSTDVASHVPQYIPSGVFKIASSSSEDML--CVLTTGDRHSIYVYKF 521 Query: 515 RFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSAGEERSFTVRLNLL 574 + + + +W D + + + +++ + + G + +L + Sbjct: 522 YWDGDTKVQSSWSKWTFPDT------DTILSAEFLDSEVFLAINRADG---LYFEKLTVA 572 Query: 575 DD 576 D Sbjct: 573 TD 574 >gi|194100345|ref|YP_002003775.1| gp12 [Enterobacteria phage EcoDS1] gi|193201340|gb|ACF15819.1| gp12 [Enterobacteria phage EcoDS1] Length = 785 Score = 248 bits (633), Expect = 2e-63, Method: Composition-based stats. Identities = 73/604 (12%), Positives = 148/604 (24%), Gaps = 50/604 (8%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M T + + G + + D+ + + N L P R Sbjct: 1 MPLITQSIKNLKGG------ISQQPDILRFSDQGEEQVNCWSSESDGLQKRPPTVFKRRL 54 Query: 61 RLDPRSNRVFSFSIPDGG-YALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNK 119 +D SN F D +VF +Q+V + + + Sbjct: 55 NIDVGSNPKFHLINRDEQEQYYIVFNGSNIQVVDLSGNQYSVSGEVDYVKSS----NPRD 110 Query: 120 SLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAK 179 + V++ I + + Sbjct: 111 DIRVVTVADYTFIVNRKVVVKGGSEKSHSGYNRKARALINLRGGQYGRTLKVGINGGVKV 170 Query: 180 LSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSL 239 A + R + + +P + + + + Sbjct: 171 SHKLPAGNDAENDPPKVDAQAIGAALRDLLVAAYPTFTFDLGSGFLLITAPSGTDINSVE 230 Query: 240 TTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGA-------------VAPY 286 T ++ T + + K E+ S A Sbjct: 231 TEDGYANQLISPVLDTVQTISKLPLAAPNGYIIKIQGETNSSADEYYVMYDSNTKTWKET 290 Query: 287 YVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSH-------VTFH 339 G + ++ QS F+ S S + PS V F+ Sbjct: 291 VEPGVVTGFDNTTMPHALVRQSDGSFEFKTLDWSKRGSGNDDTNPMPSFVDATINDVFFY 350 Query: 340 NNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPF 399 NRL F + V +S +++ F D + AV+ S + + PF Sbjct: 351 RNRLGFLSGEN----VIMSRSASYFAFFPKSAATLSDD-DPIDVAVSHPRISILKYAVPF 405 Query: 400 GEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGV--YACPPVSVGDCLVFVCGVGRRIK 457 E +L+ D ++++ S V P +VG + F G Sbjct: 406 SEQLLLWSDEVQFVMTSSGVLTAKSIQLDVGSEFSLGDNARPFAVGRSVFFSAPRGSFTS 465 Query: 458 YISG----STEQGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLG 513 ++ T + + + I V + + Sbjct: 466 IKRYFAVADVSDVKDADDTTGHVLSYIPNGVFDIQGTGTENYI--CVNSTGAYNRIYIYK 523 Query: 514 CRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSAGEERSFTVRLNL 573 F + +W +L++AS G++++++ G + + Sbjct: 524 FLFKDGVQLQASWSHWEFPKADKILASASI------GSTMFIVRQHQGGVDLEHLKFIKE 577 Query: 574 LDDF 577 DF Sbjct: 578 ATDF 581 >gi|77118200|ref|YP_338122.1| tail tube [Enterobacteria phage K1F] gi|72527944|gb|AAZ72996.1| tail tube [Enterobacteria phage K1F] gi|83308152|emb|CAJ29385.1| gp12 protein [Enterobacteria phage K1F] Length = 785 Score = 245 bits (624), Expect = 2e-62, Method: Composition-based stats. Identities = 73/604 (12%), Positives = 147/604 (24%), Gaps = 50/604 (8%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M T + + G + + D+ + N L P R Sbjct: 1 MPLITQSIKNLKGG------ISQQPDILRFSDQGEAQVNCWSSESDGLQKRPPTVFKRRL 54 Query: 61 RLDPRSNRVFSFSIPDGG-YALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNK 119 +D SN F D +VF +QIV + + + Sbjct: 55 NIDVGSNPKFHLINRDEQEQYYIVFNGSNIQIVDLSGNQYSVSGSVDYVKSS----NPRD 110 Query: 120 SLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAK 179 + V++ I + + Sbjct: 111 DIRVVTVADYTFVVNRKVVVKGGSEKSHSGYNRKARALINLRGGQYGRTLKVGINGGVKV 170 Query: 180 LSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSL 239 A + R + + +P + + + + Sbjct: 171 SHKLPAGNDAENDPPKVDAQAIGAALRDLLVTAYPTFTFDLGSGFLLITAPSGTDINSVE 230 Query: 240 TTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGA-------------VAPY 286 T ++ T + + K E+ S A Sbjct: 231 TEDGYANQLISPVLDTVQTISKLPLAAPNGYIIKIQGETNSSADEYYVMYDSNTKTWKET 290 Query: 287 YVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSH-------VTFH 339 G + ++ QS F+ S + + PS V F+ Sbjct: 291 VEPGVVTGFDNTTMPHALVRQSDGSFEFKALDWSKRGAGNDDTNPMPSFVDATINDVFFY 350 Query: 340 NNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPF 399 NRL F + V +S +++ F D + AV+ S + + PF Sbjct: 351 RNRLGFLSGEN----VIMSRSASYFAFFPKSVATLSDD-DPIDVAVSHPRISILKYAVPF 405 Query: 400 GEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGV--YACPPVSVGDCLVFVCGVGRRIK 457 E +L+ D ++++ S V P +VG + F G Sbjct: 406 SEQLLLWSDEVQFVMTSSGVLTSKSIQLDVGSEFALGDNARPFAVGRSVFFSAPRGSFTS 465 Query: 458 YISG----STEQGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLG 513 ++ T + + + I V + + Sbjct: 466 IKRYFAVADVSDVKDADDTTGHVLSYIPNGVFDIQGTGTENYI--CVNSTGAYNRIYIYK 523 Query: 514 CRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSAGEERSFTVRLNL 573 F + +W +L++AS G++++++ G + + Sbjct: 524 FLFKDSVQLQASWSHWEFPKDDKILASASI------GSTMFIVRQHQGGVDIEHLKFIKE 577 Query: 574 LDDF 577 DF Sbjct: 578 ATDF 581 >gi|212671415|ref|YP_002308415.1| tubular tail protein B [Kluyvera phage Kvp1] gi|211997259|gb|ACJ14576.1| tubular tail protein B [Kluyvera phage Kvp1] Length = 793 Score = 242 bits (616), Expect = 2e-61, Method: Composition-based stats. Identities = 67/589 (11%), Positives = 146/589 (24%), Gaps = 56/589 (9%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + + + G + + D+ + A+ N L P + Sbjct: 1 MALISQSVKNLKGG------ISQQPDILRFPEQGAEQINGWSSETEGLQKRPPFIFTKTI 54 Query: 61 RLDP--RSNRVFSFSIPDG-GYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKD 117 + D +VF +++ + Sbjct: 55 GDAGFLGGAPLVHLINRDSIEQYYVVFTGSGVKVFDLNGREYAVHGDTSYA----NCANP 110 Query: 118 NKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKS- 176 L V++ I + + + G + Sbjct: 111 RDDLRMVTVADYTFVVNRSKVVQ--ANKDPIYTIREDGECLINIRGGQYGRTFTIRLNGI 168 Query: 177 NAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWA-KNTNYSIGAYIVADDKV 235 +A I+ + +D + G + W I D+ + Sbjct: 169 SASYKIADGANAPEVEQTDAQWLVKKMAQLLREGGANTWGWTVNEGAGYIHVVSRGDEPI 228 Query: 236 YRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGA------------- 282 ++ G + + T + + S + +++ + Sbjct: 229 WKVEVEDGYGGQLMSAVMHTSQSFSKLPAEAPNGYSVQIVGDTSKTSDAFYVQYDAARKV 288 Query: 283 VAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSH------- 335 WG K ++ ++ QS F+ PS Sbjct: 289 WKEVAGWGVQKGLNNGTMPHALIRQSDGSFKMEALPWDERKCGDMNTNPDPSIVDQKIND 348 Query: 336 VTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHW 395 V F NRL F + + +S ++ D + AV+ ST+ + Sbjct: 349 VFFFRNRLGFLAGEN----IVMSRTSKYFSLFPASVANLSDD-DPIDVAVSHNRISTLKY 403 Query: 396 MHPFGEGVLVGCDTSLWLLSISL--SKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVG 453 PF E +L+ D + ++LS S S P +G + F Sbjct: 404 AVPFSEELLLWSDQAQFVLSASGILSPKSVELNLTTEFDVSDKARPYGIGRGVYFASPRA 463 Query: 454 RRIKYISGSTEQG----FRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFP 509 Q +++ + + + + VL S Sbjct: 464 SYTSINRYYAVQDVSSVKSAEDMSAHVPSYIPNGVFSIRGSGTENFV--SVLSANAPSKI 521 Query: 510 RLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVA 558 + + E +W + VL+ S G+++++L+ Sbjct: 522 FMYKFLYLNEENVQQSWSHWELGSNVTVLACDSI------GSTMYLLLR 564 >gi|323512066|gb|ADX87527.1| tail tubular protein B [Vibrio phage ICP3_2009_B] Length = 794 Score = 240 bits (613), Expect = 3e-61, Method: Composition-based stats. Identities = 67/571 (11%), Positives = 148/571 (25%), Gaps = 43/571 (7%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + + + G + + D+ ++ +K N L P + Sbjct: 1 MALISQSIKNLKGG------ISQQPDILRYSDQGSKQINGFSSEVEGLQKRPPSVHVKRL 54 Query: 61 RLD---PRSNRVFSFSIPDGGYALLVFGDKKLQIVVV-RSSTKWSPALFGKTYKTPYTFK 116 + + + + F +++ + K A G +Y T + Sbjct: 55 TDQFGLGQKPYCHIINRDEVERYAVFFTGSNIRVFDLFTGDEKTVNAPNGLSYVT--SSN 112 Query: 117 DNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKS 176 K L ++++ F + + G V Sbjct: 113 PRKDLRMVTVADYTFILNRNVSTAQGTTNTPSGLAPFGHFGLVVIRGGQYGRTYRVKVNG 172 Query: 177 NAKLSISQADTSTARITSDMKIFKPLD------KGRSIRLGCHPPEWAKNTNYSIGAYIV 230 + + S + I +D R + + + + ++ + Sbjct: 173 SVEASFETPLGDQVEHAKQIDIAYIIDQLAAGLINRGWAVTKGSGYFYFSKSGTVLINSL 232 Query: 231 ADDKVYRSLTTGRSGDRFGYSKGATYVKDNN----ITWITVLNLSSKTSRESASG-AVAP 285 + Y + + NN ++ LN + AS Sbjct: 233 EVEDGYNGQLAWGIINDVQKTTQLPVYAPNNYIIRVSGDPTLNQDDYYVKFDASRNVWTE 292 Query: 286 YYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSH-------VTF 338 D +KD + ++ F + + + + YPS + F Sbjct: 293 CPAPNIKADYNKDTMPHVLIREADGTFTFKQADWTHRAAGDDDTNPYPSFIGNSINDIFF 352 Query: 339 HNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHP 398 NRL F + V LS G +++F + T + AV+ S + + P Sbjct: 353 FRNRLGFLSGEN----VILSGSGNYFNFFPESVA-VLTDTDPIDVAVSTNRISILKYAVP 407 Query: 399 FGEGVLVGCDTSLWLLSISL--SKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRI 456 F E +++ D + ++LS + P +G + FV + Sbjct: 408 FSEELILWSDQAQFVLSSDGGLTPTTVRLDLTTEFEVTEQTRPFGIGRGVYFVSPRAKFS 467 Query: 457 KYISGSTEQG----FRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLL 512 Q +I+ + ++ + +L + Sbjct: 468 SVRRFYAVQDVTQVKNAEDISAHVPSYVENGVFKMSGSSTEN--FLTILTEGNEQRVYFY 525 Query: 513 GCRFSAEGEGDFAWHTHMISDKHYVLSAASF 543 + E +W VL Sbjct: 526 KFLYLQEQLVQQSWSHWDFGVNCRVLCCDMI 556 >gi|323512115|gb|ADX87575.1| tail tubular protein B [Vibrio phage ICP3_2009_A] Length = 794 Score = 240 bits (612), Expect = 4e-61, Method: Composition-based stats. Identities = 67/571 (11%), Positives = 148/571 (25%), Gaps = 43/571 (7%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + + + G + + D+ ++ +K N L P + Sbjct: 1 MALISQSIKNLKGG------ISQQPDILRYSDQGSKQINGFSSEVEGLQKRPPSVHVKRL 54 Query: 61 RLD---PRSNRVFSFSIPDGGYALLVFGDKKLQIVVV-RSSTKWSPALFGKTYKTPYTFK 116 + + + + F +++ + K A G +Y T + Sbjct: 55 TDQFGLGQKPYCHIINRDEVERYAVFFTGSNIRVFDLFTGDEKTVNAPNGLSYVT--SSN 112 Query: 117 DNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKS 176 K L ++++ F + + G V Sbjct: 113 PRKDLRMVTVADYTFILNRNVSTAQGTTNTPSGLAPFGHFGLVVIRGGQYGRTYRVKVNG 172 Query: 177 NAKLSISQADTSTARITSDMKIFKPLD------KGRSIRLGCHPPEWAKNTNYSIGAYIV 230 + + S + I +D R + + + + ++ + Sbjct: 173 SVEASFETPLGDQVEHAKQIDIAYIIDQLAAGLINRGWAVTKGSGYFYFSKSGTVIINSL 232 Query: 231 ADDKVYRSLTTGRSGDRFGYSKGATYVKDNN----ITWITVLNLSSKTSRESASG-AVAP 285 + Y + + NN ++ LN + AS Sbjct: 233 EVEDGYNGQLAWGIINDVQKTTQLPVYAPNNYIIRVSGDPTLNQDDYYVKFDASRNVWTE 292 Query: 286 YYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSH-------VTF 338 D +KD + ++ F + + + + YPS + F Sbjct: 293 CPAPNIKADYNKDTMPHVLIREADGTFTFKQADWTHRAAGDDDTNPYPSFIGNSINDIFF 352 Query: 339 HNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHP 398 NRL F + V LS G +++F + T + AV+ S + + P Sbjct: 353 FRNRLGFLSGEN----VILSGSGNYFNFFPESVA-VLTDTDPIDVAVSTNRISILKYAVP 407 Query: 399 FGEGVLVGCDTSLWLLSISL--SKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRI 456 F E +++ D + ++LS + P +G + FV + Sbjct: 408 FSEELILWSDQAQFVLSSDGGLTPTTVRLDLTTEFEVTEQTRPFGIGRGVYFVSPRAKFS 467 Query: 457 KYISGSTEQG----FRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLL 512 Q +I+ + ++ + +L + Sbjct: 468 SVRRFYAVQDVTQVKNAEDISAHVPSYVENGVFKMSGSSTEN--FLTILTEGNEQRVYFY 525 Query: 513 GCRFSAEGEGDFAWHTHMISDKHYVLSAASF 543 + E +W VL Sbjct: 526 KFLYLQEQLVQQSWSHWDFGVNCRVLCCDMI 556 >gi|281416310|ref|YP_003347550.1| tail tubular protein B [Klebsiella phage KP32] gi|262410429|gb|ACY66694.1| tail tubular protein B [Klebsiella phage KP32] Length = 791 Score = 240 bits (612), Expect = 4e-61, Method: Composition-based stats. Identities = 63/606 (10%), Positives = 142/606 (23%), Gaps = 52/606 (8%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + + + G + + ++ + + N L P M + Sbjct: 1 MALVSQSIKNLKGG------ISQQPEILRYPEQGTLQVNGWSSETEGLQKRPPMVFIKSL 54 Query: 61 RLDP--RSNRVFSFSIPDG-GYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKD 117 + D VF +++ + Sbjct: 55 GGRGYLGEDPYIHLINRDEYEQYYAVFTGNNVRVFDLSGYEYQVRGDRSYVTVN----NP 110 Query: 118 NKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSN 177 +L V++ + +G D + + G + + Sbjct: 111 KDNLRMVTVADYTFIVNRTRQVRESQNLTNGGTFRDNVDALINVRGGQYGRKLEVNINGV 170 Query: 178 AKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYR 237 + + + + HP I A + Sbjct: 171 WVSHQLPPGDNAKDDPPKVDAQAIAEAIAVLLRTAHPTWTFNVGTGFIHCIAPAGTTIDI 230 Query: 238 SLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGA-------------VA 284 T D+ + + K +++ A Sbjct: 231 LETKDGYADQLINPVTHYVQSFSKLPLNAPDGYMVKIVGDTSKTADQYYVKYDKSQKVWK 290 Query: 285 PYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSH-------VT 337 W + ++ + F G + + PS V Sbjct: 291 ETVGWNISIGLDYTTMPWTLVRAADGNFDLGYHDWKDRRAGDEDTNPQPSFVNSTITDVF 350 Query: 338 FHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMH 397 F NRL F + + +S +++F D L AV+ S + + Sbjct: 351 FFRNRLGFISGEN----IVMSRTSKYFEFYPPSVANYTDD-DPLDVAVSHNRVSVLKYAV 405 Query: 398 PFGEGVLVGCDTSLWLLSISL--SKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRR 455 F E +L+ D + ++LS + S + P +G + + Sbjct: 406 SFAEELLLWSDEAQFVLSANGVLSAKTAQLDLTTQFDVSDRARPYGIGRNIYYASPRSSF 465 Query: 456 IKYISGSTEQG----FRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRL 511 + Q ++T + + + + VL S + Sbjct: 466 TSIMRYYAVQDVSSVKNAEDMTAHVPNYIPNGVYSINGSGTENFA--CVLTKGAPSKVFI 523 Query: 512 LGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSAGEERSFTVRL 571 + E +W D V++A ++++ML+ + + Sbjct: 524 YKFLYMDENIRQQSWSHWDFGDGVEVMAANCI------NSTMYMLMRNAYNVWIAAVDFK 577 Query: 572 NLLDDF 577 DF Sbjct: 578 KNSTDF 583 >gi|323512212|gb|ADX87670.1| tail tubular protein B [Vibrio phage ICP3_2007_A] Length = 794 Score = 240 bits (612), Expect = 5e-61, Method: Composition-based stats. Identities = 66/571 (11%), Positives = 147/571 (25%), Gaps = 43/571 (7%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + + + G + + D+ ++ +K N L P + Sbjct: 1 MALISQSIKNLKGG------ISQQPDILRYSDQGSKQINGFSSEVEGLQKRPPSVHVKRL 54 Query: 61 RLD---PRSNRVFSFSIPDGGYALLVFGDKKLQIVVV-RSSTKWSPALFGKTYKTPYTFK 116 + + + + F +++ + K A G +Y T + Sbjct: 55 TDQFGLGQKPYCHIINRDEVERYAVFFTGSNIRVFDLFTGDEKTVNAPNGLSYVT--SSN 112 Query: 117 DNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKS 176 K L ++++ F + + G V Sbjct: 113 PRKDLRMVTVADYTFILNRNVSTAQGTTNTPSGLAPFGHFGLVVIRGGQYGRTYRVKVNG 172 Query: 177 NAKLSISQADTSTARITSDMKIFKPLD------KGRSIRLGCHPPEWAKNTNYSIGAYIV 230 + + S + I +D R + + + + ++ + Sbjct: 173 SVEASFETPLGDQVEHAKQIDIAYIIDQLAAGLINRGWAVTKGSGYFYFSKSGTVIINSL 232 Query: 231 ADDKVYRSLTTGRSGDRFGYSKGATYVKDNN----ITWITVLNLSSKTSRESASG-AVAP 285 + Y + + NN ++ LN + AS Sbjct: 233 EVEDGYNGQLAWGIINDVQKTTQLPVYAPNNYIIRVSGDPTLNQDDYYVKFDASRNVWTE 292 Query: 286 YYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSH-------VTF 338 D +K + ++ F + + + + YPS + F Sbjct: 293 CPAPNIKADYNKATMPHVLIREADGTFTFKQADWTHRAAGDDDTNPYPSFIGNSINDIFF 352 Query: 339 HNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHP 398 NRL F + V LS G +++F + T + AV+ S + + P Sbjct: 353 FRNRLGFLSGEN----VILSGSGNYFNFFPESVA-VLTDTDPIDVAVSTNRISILKYAVP 407 Query: 399 FGEGVLVGCDTSLWLLSISL--SKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRI 456 F E +++ D + ++LS + P +G + FV + Sbjct: 408 FSEELILWSDQAQFVLSSDGGLTPTTVRLDLTTEFEVTEQTRPFGIGRGVYFVSPRAKFS 467 Query: 457 KYISGSTEQG----FRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLL 512 Q +I+ + ++ + +L + Sbjct: 468 SVRRFYAVQDVTQVKNAEDISAHVPSYVENGVFKMSGSSTEN--FLTILTEGNEQRVYFY 525 Query: 513 GCRFSAEGEGDFAWHTHMISDKHYVLSAASF 543 + E +W VL Sbjct: 526 KFLYLQEQLVQQSWSHWDFGVNCRVLCCDMI 556 >gi|323512164|gb|ADX87623.1| tail tubular protein B [Vibrio phage ICP3_2008_A] Length = 795 Score = 240 bits (612), Expect = 5e-61, Method: Composition-based stats. Identities = 66/571 (11%), Positives = 147/571 (25%), Gaps = 43/571 (7%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + + + G + + D+ ++ +K N L P + Sbjct: 1 MALISQSIKNLKGG------ISQQPDILRYSDQGSKQINGFSSEVEGLQKRPPSVHVKRL 54 Query: 61 RLD---PRSNRVFSFSIPDGGYALLVFGDKKLQIVVV-RSSTKWSPALFGKTYKTPYTFK 116 + + + + F +++ + K A G +Y T + Sbjct: 55 TDQFGLGQKPYCHIINRDEVERYAVFFTGSNIRVFDLFTGDEKTVNAPNGLSYVT--SSN 112 Query: 117 DNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKS 176 K L ++++ F + + G V Sbjct: 113 PRKDLRMVTVADYTFILNRNVSTAQGTTNTPSGLAPFGHFGLVVIRGGQYGRTYRVKVNG 172 Query: 177 NAKLSISQADTSTARITSDMKIFKPLD------KGRSIRLGCHPPEWAKNTNYSIGAYIV 230 + + S + I +D R + + + + ++ + Sbjct: 173 SVEASFETPLGDQVEHAKQIDIAYIIDQLAAGLINRGWAVTKGSGYFYFSKSGTVIINSL 232 Query: 231 ADDKVYRSLTTGRSGDRFGYSKGATYVKDNN----ITWITVLNLSSKTSRESASG-AVAP 285 + Y + + NN ++ LN + AS Sbjct: 233 EVEDGYNGQLAWGIINDVQKTTQLPVYAPNNYIIRVSGDPTLNQDDYYVKFDASRNVWTE 292 Query: 286 YYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSH-------VTF 338 D +K + ++ F + + + + YPS + F Sbjct: 293 CPAPNIKADYNKATMPHVLIREADGTFTFKQADWTHRAAGDDDTNPYPSFIGNSINDIFF 352 Query: 339 HNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHP 398 NRL F + V LS G +++F + T + AV+ S + + P Sbjct: 353 FRNRLGFLSGEN----VILSGSGNYFNFFPESVA-VLTDTDPIDVAVSTNRISILKYAVP 407 Query: 399 FGEGVLVGCDTSLWLLSISL--SKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRI 456 F E +++ D + ++LS + P +G + FV + Sbjct: 408 FSEELILWSDQAQFVLSSDGGLTPTTVRLDLTTEFEVTEQTRPFGIGRGVYFVSPRAKFS 467 Query: 457 KYISGSTEQG----FRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLL 512 Q +I+ + ++ + +L + Sbjct: 468 SVRRFYAVQDVTQVKNAEDISAHVPSYVENGVFKMSGSSTEN--FLTILTEGNEQRVYFY 525 Query: 513 GCRFSAEGEGDFAWHTHMISDKHYVLSAASF 543 + E +W VL Sbjct: 526 KFLYLQEQLVQQSWSHWDFGVNCRVLCCDMI 556 >gi|187736306|ref|YP_001878418.1| hypothetical protein Amuc_1819 [Akkermansia muciniphila ATCC BAA-835] gi|187426358|gb|ACD05637.1| hypothetical protein Amuc_1819 [Akkermansia muciniphila ATCC BAA-835] Length = 822 Score = 239 bits (610), Expect = 8e-61, Method: Composition-based stats. Identities = 113/668 (16%), Positives = 198/668 (29%), Gaps = 124/668 (18%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + SF+AGEL+P L R DL ++G ++ N + +G L P + Sbjct: 1 MAKQVLQRLSFTAGELTPWL-AGRADLDPVSRGASRLINFLVSPFGGLRRRPGTRLVARA 59 Query: 61 RLDPRSNRVFS--------FSIPDGGYALLVFGD------------------------KK 88 R+ S F + G + F + Sbjct: 60 GCREGMVRLVSFKYSTGVQFMLEVGRGYVRYFKNGALLTDTEGGVLETLTPWKTDEQVSN 119 Query: 89 LQIVVVRSSTKWSPALFGKTYKTPYTFKD--NKSLEYAVFGSTAVFVHKDHPPHHLLYIQ 146 L++ + Y D ++LE++ + ++ ++ Sbjct: 120 LRMQQLNDVIYCVEPSTPPMTLARYADDDWRLEALEFSGIPYESSLLNAVRLECRMVREG 179 Query: 147 DGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGR 206 +++ T D+ F P + + K ++ T ++K G Sbjct: 180 GVNRLLATADDDVFTPEMEGKEFLRITRKYGETVAEGNQMPFYHLTTLSRDLYK----GE 235 Query: 207 SIRLGCHPPEWAKNTNYSIGAYIVADDKV--------YRSLTTGRSGDRFGYSKGATYVK 258 + + + I + D Y + + + Sbjct: 236 TFSMNREDGW--RQAYTCIRDFSRESDYQEGVDRPERYTAFFEKGADASTRIYVNGAWTL 293 Query: 259 DNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDV----------------------- 295 + TW + S P VW +K Sbjct: 294 ETTGTWDAEWEICRGYPDGSNYLPNRPELVWHSVKSFQQREGFRNNFTLSGNEEEMSYYK 353 Query: 296 -------SKDGRSISVAPQSQTLFQ---------------------------AGVSVVSW 321 V S F + W Sbjct: 354 IRLMAYKDGSSAGTPVFRASAGSFNHEVVVEEYVSPRSAYLASALHLSYHTLSDCDTNDW 413 Query: 322 FMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKAL 381 A+G + GYP V FH RL F G+ G +++ S F F+ + Sbjct: 414 SFGAFGVRNGYPCTVEFHQGRLWFGGTPGQPQTLWASRVDDFSAFTPGIPA-----DSPM 468 Query: 382 TTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSID---FRRVSGSGVYACP 438 + + I W+ G+++G W LS + S+GL+ F R SG G + Sbjct: 469 ILTMAASQQNRISWIASLR-GLMIGTSEGEWRLSATNSEGLNASNAGFERHSGVGSASLD 527 Query: 439 PVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFNQRILQLVYQEEPHSIV 497 +SV + L+FV G +++ + S E G++ +++ L+DHL + I+ Q V Sbjct: 528 ALSVENSLLFVQQGGMKVRELFYSLEADGYQTRDVSLLSDHLLGEGIVDWTVQRSTAFHV 587 Query: 498 WVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGT-SLWML 556 W VL + + AWH H + +LS AS +W Sbjct: 588 WCVL-----GDGSAVCMT-LNREQNVVAWHAHRLEHG-RILSVASLRGSRNTPDEEVWFA 640 Query: 557 VALSAGEE 564 VA GEE Sbjct: 641 VARGEGEE 648 >gi|312436378|gb|ADQ83187.1| tail tubular protein B [Yersinia phage Yep-phi] Length = 792 Score = 239 bits (609), Expect = 1e-60, Method: Composition-based stats. Identities = 65/588 (11%), Positives = 149/588 (25%), Gaps = 55/588 (9%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + + + G + + D+ + ++ N L P + Sbjct: 1 MALISQSVKNLKGG------ISQQPDILRFPEQGSEQINGWSSETEGLQKRPPFVFTKTI 54 Query: 61 RLDP--RSNRVFS-FSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKD 117 + + + +VF + +++ + Sbjct: 55 GDQNALGAKPLVHLINRDSAEQYYVVFTGQGVRVFDLDGKEYSVKGDLSYVKV----GNP 110 Query: 118 NKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSN 177 L V+++ + D + + G + + + Sbjct: 111 RDDLRMVTVADYTFIVNRNMVVR--PDTTPLYTLKENGDCLINIRGGMYGRTLAFTINNT 168 Query: 178 -AKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVY 236 I+ D +D + G + I ++ ++ Sbjct: 169 KIAYEIAHGDVPEHSKQTDAQWLVKKLAGLARLNVAFKGWTFTEGPGYIHVIAPSNSQIN 228 Query: 237 RSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGA-------------V 283 T D+ + T + + + K +++ + Sbjct: 229 SLSTEDGYADQLMNAVMHTSQSFSRLPVEAPNGYTVKIVGDTSKTSDMFYVQYDNLKKVW 288 Query: 284 APYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSH-------V 336 WG K ++ D ++ Q+ FQ + + PS V Sbjct: 289 KEVAGWGVQKGLNGDTMPHALVRQADGSFQMQALPWAQRTCGDMDTNPTPSIVDQTINDV 348 Query: 337 TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM 396 F NRL F + + +S ++ D + AV+ S + + Sbjct: 349 FFFRNRLGFLAGEN----IVMSRTSKYFSLFPASVANLSDD-DPIDVAVSHNRISILKYA 403 Query: 397 HPFGEGVLVGCDTSLWLLSISL--SKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGR 454 PF E +L+ D + ++LS S P VG + F Sbjct: 404 VPFSEELLLWSDQAQFVLSAQGILSPKSVELNLTTEFDVSDRARPFGVGRGVYFASPRAS 463 Query: 455 RIKYISGSTEQG----FRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPR 510 Q +++ + + + I VL S Sbjct: 464 YTSLNRYYAVQDVSSVKSAEDMSAHVPSYIPNGVFSIRGSSTENFI--SVLSSNAPSRIF 521 Query: 511 LLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVA 558 L + E +W + VL+ S G+++++++ Sbjct: 522 LYKFLYLNEEIAQQSWSHWELGSNVTVLACDSI------GSTMYLVLR 563 >gi|325171313|ref|YP_004251284.1| tail tubular protein B [Vibrio phage ICP3] gi|323512019|gb|ADX87481.1| tail tubular protein B [Vibrio phage ICP3] Length = 794 Score = 239 bits (608), Expect = 1e-60, Method: Composition-based stats. Identities = 66/571 (11%), Positives = 147/571 (25%), Gaps = 43/571 (7%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + + + G + + D+ ++ +K N L P + Sbjct: 1 MALISQSIKNLKGG------ISQQPDILRYSDQGSKQINGFSSEIEGLQKRPPSVHVKRL 54 Query: 61 RLD---PRSNRVFSFSIPDGGYALLVFGDKKLQIVVV-RSSTKWSPALFGKTYKTPYTFK 116 + + + + F +++ + K A G +Y T + Sbjct: 55 TDQFGLGQKPYCHIINRDEVERYAVFFTGSNIRVFDLFTGDEKTVNAPNGLSYVT--SSN 112 Query: 117 DNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKS 176 K L ++++ F + + G V Sbjct: 113 PRKDLRMVTVADYTFILNRNVSTAQGTTNTPSGLAPFGHFGLVVIRGGQYGRTYRVKVNG 172 Query: 177 NAKLSISQADTSTARITSDMKIFKPLD------KGRSIRLGCHPPEWAKNTNYSIGAYIV 230 + + S + I +D R + + + + ++ + Sbjct: 173 SVEASFETPLGDQVEHAKQIDIAYIIDQLAAGLINRGWAVTKGSGYFYFSKSGTVIINSL 232 Query: 231 ADDKVYRSLTTGRSGDRFGYSKGATYVKDNN----ITWITVLNLSSKTSRESASG-AVAP 285 + Y + + NN ++ LN + AS Sbjct: 233 EVEDGYNGQLAWGIINDVQKTTQLPVYAPNNYIIRVSGDPTLNQDDYYVKFDASRNVWTE 292 Query: 286 YYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSH-------VTF 338 D +K + ++ F + + + + YPS + F Sbjct: 293 CPAPNIKADYNKATMPHVLIREADGTFTFKQADWTHRAAGDDDTNPYPSFIGNSINDIFF 352 Query: 339 HNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHP 398 NRL F + V LS G +++F + T + AV+ S + + P Sbjct: 353 FRNRLGFLSGEN----VILSGSGNYFNFFPESVA-VLTDTDPIDVAVSTNRISILKYAVP 407 Query: 399 FGEGVLVGCDTSLWLLSISL--SKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRI 456 F E +++ D + ++LS + P +G + FV + Sbjct: 408 FSEELILWSDQAQFVLSSDGGLTPTTVRLDLTTEFEVTEQTRPFGIGRGVYFVSPRAKFS 467 Query: 457 KYISGSTEQG----FRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLL 512 Q +I+ + ++ + +L + Sbjct: 468 SVRRFYAVQDVTQVKNAEDISAHVPSYVENGVFKMSGSSTEN--FLTILTEGNEQRVYFY 525 Query: 513 GCRFSAEGEGDFAWHTHMISDKHYVLSAASF 543 + E +W VL Sbjct: 526 KFLYLQEQLVQQSWSHWDFGVNCRVLCCDMI 556 >gi|68299742|ref|YP_249591.1| Tail tubular protein B [Vibriophage VP4] gi|66473281|gb|AAY46290.1| tail tubular protein B [Vibriophage VP4] Length = 794 Score = 239 bits (608), Expect = 1e-60, Method: Composition-based stats. Identities = 67/571 (11%), Positives = 148/571 (25%), Gaps = 43/571 (7%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + + + G + + D+ ++ +K N L P + Sbjct: 1 MALISQSIKNLKGG------ISQQPDILRYSDQGSKQINGFSSEVEGLQKRPPSVHIKRL 54 Query: 61 RLD---PRSNRVFSFSIPDGGYALLVFGDKKLQIVVV-RSSTKWSPALFGKTYKTPYTFK 116 + + + + F +++ + K A G +Y + + Sbjct: 55 TDQFGLGQKPYCHIINRDEVERYAVFFTGSNIRVFDLFTGDEKTVNAPNGLSYVS--SSN 112 Query: 117 DNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKS 176 K L ++++ F + + G V Sbjct: 113 PRKDLRMVTVADYTFILNRNVATAQGTTNTPSGLAPFGHFGLVVIRGGQYGRTYRIKVNG 172 Query: 177 NAKLSISQADTSTARITSDMKIFKPLD------KGRSIRLGCHPPEWAKNTNYSIGAYIV 230 + + S + I +D + + + + + S+ + Sbjct: 173 SVEASFETPLGDQVAHAKQIDIAYIIDQLAAGLINKGWAVTKGSGYFYFSKSGSVIINSL 232 Query: 231 ADDKVYRSLTTGRSGDRFGYSKGATYVKDNN----ITWITVLNLSSKTSRESASG-AVAP 285 + Y + + NN ++ LN R AS Sbjct: 233 EVEDGYNGQLAWGIINDVQKTTQLPVYAPNNYIIRVSGDPTLNQDDYYVRFDASRNVWTE 292 Query: 286 YYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSH-------VTF 338 D +K + ++ F + + + E YPS + F Sbjct: 293 CPAPNIKADYNKATMPHVLIREADGTFTFKQADWTHRAAGDDETNPYPSFIGNSINDIFF 352 Query: 339 HNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHP 398 NRL F + V LS G +++F + T + AV+ S + + P Sbjct: 353 FRNRLGFLSGEN----VILSGSGNYFNFFPESVA-VLTDTDPIDVAVSTNRISILKYAVP 407 Query: 399 FGEGVLVGCDTSLWLLSISL--SKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRI 456 F E +++ D + ++LS + P +G + FV + Sbjct: 408 FSEELILWSDQAQFVLSSDGGLTPTTIRLDLTTEFEVTEQARPYGIGRGVYFVSPRAKFS 467 Query: 457 KYISGSTEQG----FRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLL 512 Q +I+ + + ++ + +L + Sbjct: 468 SVRRFYAVQDVTQVKNAEDISAHVPYYVENGVFKMSGSSTEN--FLTILTEGNEQRVYFY 525 Query: 513 GCRFSAEGEGDFAWHTHMISDKHYVLSAASF 543 + E +W VL Sbjct: 526 KFLYLQEQLVQQSWSHWDFGVNCRVLCCDMI 556 >gi|326536137|ref|YP_004300571.1| gp12 [Enterobacteria phage 285P] gi|256861526|gb|ACV32482.1| gp12 [Enterobacteria phage 285P] Length = 795 Score = 239 bits (608), Expect = 1e-60, Method: Composition-based stats. Identities = 64/591 (10%), Positives = 147/591 (24%), Gaps = 58/591 (9%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + + + G + + ++ + ++ N L P + Sbjct: 1 MALISQSVKNLKGG------ISQQPNILRFPEQGSEQINGWSSETEGLQKRPPFVFTKTI 54 Query: 61 RLDP--RSNRVFS-FSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKD 117 + + + +VF + +++ + Sbjct: 55 GDQNALGAKPLVHLINRDSVEQYYVVFTGQGIRVFDLNGKEYAVKGDLSYVKV----GNP 110 Query: 118 NKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKS- 176 L V+++ + D + + G + + Sbjct: 111 RDDLRMVTVADYTFIVNRNMVVR--ADTAPLYDLKENGDCLINVRGGQYGRTLAFTINGV 168 Query: 177 --NAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEW-AKNTNYSIGAYIVADD 233 K+ D + + + R +W I + Sbjct: 169 RIAYKIHNGVGDGAEQAVQETDAQWLVKKLAGLARAHGSFKDWKFNEGPGFIHVIAPGNS 228 Query: 234 KVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGA----------- 282 ++ T ++ + T + + + K +++ + Sbjct: 229 QINSLSTEDGYANQLMNAVMHTSQSFSKLPLEAPNGYTVKIVGDTSKTSDQFYVQYDNVK 288 Query: 283 --VAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSH----- 335 WG K ++ ++ QS FQ S + PS Sbjct: 289 KVWKEVAGWGVQKGLNGGTMPHALVRQSDGSFQMQALPWSQRTCGDMDTNPTPSIVDQSI 348 Query: 336 --VTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTI 393 V F NRL F + + +S ++ D + AV+ S + Sbjct: 349 NDVFFFRNRLGFLAGEN----IVMSRTSKYFSLFPASVANLSDD-DPIDVAVSHNRISIL 403 Query: 394 HWMHPFGEGVLVGCDTSLWLLSISL--SKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCG 451 + PF E +L+ D + ++LS S P VG + F Sbjct: 404 KYAVPFSEELLLWSDQAQFVLSAQGILSPKSVELNLTTEFDVSDRARPFGVGRGVYFASP 463 Query: 452 VGRRIKYISGSTEQG----FRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNS 507 Q +++ + + + I VL S Sbjct: 464 RASYTSLNRYYAVQDVSSVKSAEDMSAHVPSYIPNGVFSIRGSGTENFI--SVLSANAPS 521 Query: 508 FPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVA 558 L + E +W + VL+ S G+++++++ Sbjct: 522 KIFLYKFLYLNEEIAQQSWSHWELGSNVTVLACDSI------GSTMYLVLR 566 >gi|281416199|ref|YP_003347934.1| tail tubular protein B [Vibrio phage N4] gi|237701506|gb|ACR16499.1| tail tubular protein B [Vibrio phage N4] Length = 794 Score = 238 bits (606), Expect = 2e-60, Method: Composition-based stats. Identities = 65/560 (11%), Positives = 145/560 (25%), Gaps = 43/560 (7%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + + + G + + D+ ++ +K N L P + Sbjct: 1 MALISQSIKNLKGG------ISQQPDILRYSDQGSKQINGFSSEVEGLQKRPPSVHVKRL 54 Query: 61 RLD---PRSNRVFSFSIPDGGYALLVFGDKKLQIVVV-RSSTKWSPALFGKTYKTPYTFK 116 + + + + F +++ + K A G +Y T + Sbjct: 55 TDQFGLGQKPYCHIINRDEVERYAVFFTGSNIRVFDLFTGDEKTVNAPNGLSYVT--SSN 112 Query: 117 DNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKS 176 K L ++++ F + + G V Sbjct: 113 PRKDLRMVTVADYTFILNRNVSTAQGTTNTPRGLAPFGHFGLVVIRGGQYGRTYRVKVNG 172 Query: 177 NAKLSISQADTSTARITSDMKIFKPLD------KGRSIRLGCHPPEWAKNTNYSIGAYIV 230 + + S + I +D R + + + + S+ + Sbjct: 173 SVEASFETPLGDQVEHAKQIDIAYIIDQLAARLINRGWAVTKGSGYFYFSKSGSVIIKSL 232 Query: 231 ADDKVYRSLTTGRSGDRFGYSKGATYVKDNN----ITWITVLNLSSKTSRESASG-AVAP 285 + Y + + NN ++ LN + AS Sbjct: 233 EVEDGYNGQLAWGIINDVQKTTQLPVYAPNNYIIRVSGDPTLNQDDYYVKFDASRNVWTE 292 Query: 286 YYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSH-------VTF 338 D +K + ++ F + + + + YPS + F Sbjct: 293 CPAPNIKADYNKATMPHVLIREADGTFTFKQADWTHRAAGDDDTNPYPSFIGNSINDIFF 352 Query: 339 HNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHP 398 NRL F + V LS G +++F + T + AV+ S + + P Sbjct: 353 FRNRLGFLSGEN----VILSGSGNYFNFFPESVA-VLTDTDPIDVAVSTNRISILKYAVP 407 Query: 399 FGEGVLVGCDTSLWLLSISL--SKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRI 456 F E +++ D + ++LS + P +G + FV + Sbjct: 408 FSEELILWSDQAQFVLSSDGGLTPTTVRLDLTTEFEVTEQARPFGIGRGVYFVSPRAKFS 467 Query: 457 KYISGSTEQG----FRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLL 512 Q +I+ + ++ + +L + Sbjct: 468 SVRRFYAVQDVTQVKNAEDISAHVPSYVENGVFKMSGSSTEN--FLTILTEGNEQRVYFY 525 Query: 513 GCRFSAEGEGDFAWHTHMIS 532 + E +W Sbjct: 526 KFLYLQEQLVQQSWSHWDFG 545 >gi|194100290|ref|YP_002003488.1| gp12 [Enterobacteria phage BA14] gi|193201285|gb|ACF15765.1| gp12 [Enterobacteria phage BA14] Length = 795 Score = 237 bits (605), Expect = 3e-60, Method: Composition-based stats. Identities = 64/591 (10%), Positives = 147/591 (24%), Gaps = 58/591 (9%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + + + G + + ++ + ++ N L P + Sbjct: 1 MALISQSVKNLKGG------ISQQPNILRFPEQGSEQINGWSSETEGLQKRPPFVFTKTI 54 Query: 61 RLDP--RSNRVFS-FSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKD 117 + + + +VF + +++ + Sbjct: 55 GDQNALGAKPLVHLINRDSVEQYYVVFTGQGVRVFDLNGKEYAVKGDLSYVKV----GNP 110 Query: 118 NKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKS- 176 L V+++ + D + + G + + Sbjct: 111 RDDLRMVTVADYTFIVNRNMVVR--ADTAPLYNLKENGDCLINVRGGQYGRTLAFTINGV 168 Query: 177 --NAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEW-AKNTNYSIGAYIVADD 233 K+ D + + + R +W I + Sbjct: 169 RIAYKIHNGVGDGAEQAVQETDAQWLVKKLAGLARAHGSFKDWKFDEGPGFIHVIAPGNS 228 Query: 234 KVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGA----------- 282 ++ T ++ + T + + + K +++ + Sbjct: 229 QINSLSTEDGYANQLMNAVMHTSQSFSKLPLEAPNGYTVKIVGDTSKTSDQFYVQYDNVK 288 Query: 283 --VAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSH----- 335 WG K ++ ++ QS FQ S + PS Sbjct: 289 KVWKEVAGWGVQKGLNGGTMPHALVRQSDGSFQMQALPWSQRTCGDMDTNPTPSIVDQTI 348 Query: 336 --VTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTI 393 V F NRL F + + +S ++ D + AV+ S + Sbjct: 349 NDVFFFRNRLGFLAGEN----IVMSRTSKYFSLFPASVANLSDD-DPIDVAVSHNRISIL 403 Query: 394 HWMHPFGEGVLVGCDTSLWLLSISL--SKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCG 451 + PF E +L+ D + ++LS S P VG + F Sbjct: 404 KYAVPFSEELLLWSDQAQFVLSAQGILSPKSVELNLTTEFDVSDRARPFGVGRGVYFASP 463 Query: 452 VGRRIKYISGSTEQG----FRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNS 507 Q +++ + + + I VL S Sbjct: 464 RASYTSLNRYYAVQDVSSVKSAEDMSAHVPSYIPNGVFSIRGSGTENFI--SVLSANAPS 521 Query: 508 FPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVA 558 L + E +W + VL+ S G+++++++ Sbjct: 522 KIFLYKFLYLNEEIAQQSWSHWELGSNVTVLACDSI------GSTMYLVLR 566 >gi|119637778|ref|YP_919014.1| Tubular tail protein B [Yersinia phage Berlin] gi|119391809|emb|CAJ70682.1| hypothetical protein [Yersinia phage Berlin] Length = 792 Score = 235 bits (599), Expect = 2e-59, Method: Composition-based stats. Identities = 64/588 (10%), Positives = 150/588 (25%), Gaps = 55/588 (9%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + + + G + + ++ + ++ N L P + Sbjct: 1 MALISQSVKNLKGG------ISQQPNILRFPEQGSEQINGWSSETEGLQKRPPFVFTKTI 54 Query: 61 RLDP--RSNRVFS-FSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKD 117 + + + +VF + +++ + Sbjct: 55 GDQNALGAKPLVHLINRDSAEQYYVVFTGQGVRVFDLNGKEYDVKGDLSYVKV----ENP 110 Query: 118 NKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSN 177 L V+++ + D + + G + + + Sbjct: 111 RDDLRMVTVADYTFIVNRNMVVR--PDTTPLYTLKENGDCLINIRGGMYGRTLAFTINNT 168 Query: 178 -AKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVY 236 I+ D +D + G + I ++ ++ Sbjct: 169 KIAYEIAHGDAPEHSKQTDAQWLVKKLAGLARLNVAFKGWTFTEGPGYIHVIAPSNSQIN 228 Query: 237 RSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGA-------------V 283 T D+ + T + + + K +++ + Sbjct: 229 SLSTEDGYADQLMNAVMHTSQSFSRLPVEAPNGYTVKIVGDTSKTSDMFYVQYDNMKKVW 288 Query: 284 APYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSH-------V 336 WG K ++ ++ Q+ FQ V + + PS V Sbjct: 289 KEVAGWGVQKGLNGGTMPHALVRQADGSFQMQVLPWTQRTCGDMDTNPTPSIVDQKINDV 348 Query: 337 TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM 396 F NRL F + + +S ++ D + AV+ S + + Sbjct: 349 FFFRNRLGFLAGEN----IVMSRTSKYFSLFPASVANLSDD-DPIDVAVSHNRISILKYA 403 Query: 397 HPFGEGVLVGCDTSLWLLSISL--SKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGR 454 PF E +L+ D + ++LS S P VG + F Sbjct: 404 VPFSEELLLWSDQAQFVLSAQGILSPKSVELNLTTEFDVSDRARPFGVGRGVYFASPRAS 463 Query: 455 RIKYISGSTEQG----FRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPR 510 Q +++ + + + + I VL S Sbjct: 464 YTSLNRYYAVQDVSSVKSAEDMSAHVPNYIPNGVFSIRGSSTENFI--SVLSSNAPSRIF 521 Query: 511 LLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVA 558 L + E +W + VL+ S G+++++++ Sbjct: 522 LYKFLYLNEEIAQQSWSHWELGSNVTVLACDSI------GSTMYLVLR 563 >gi|194100501|ref|YP_002003346.1| gp12 [Yersinia phage Yepe2] gi|193201234|gb|ACF15715.1| gp12 [Yersinia phage Yepe2] Length = 792 Score = 235 bits (598), Expect = 2e-59, Method: Composition-based stats. Identities = 64/588 (10%), Positives = 149/588 (25%), Gaps = 55/588 (9%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + + + G + + ++ + ++ N L P + Sbjct: 1 MALISQSVKNLKGG------ISQQPNILRFPEQGSEQINGWSSETEGLQKRPPFVFTKTI 54 Query: 61 RLDP--RSNRVFS-FSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKD 117 + + + +VF + +++ + Sbjct: 55 GDQNALGAKPLVHLINRDSAEQYYVVFTGQGVRVFDLNGKEYDVKGDLSYVKV----ENP 110 Query: 118 NKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSN 177 L V+++ + D + + G + + + Sbjct: 111 RDDLRMVTVADYTFIVNRNMVVR--PDTTPLYTLKENGDCLINIRGGMYGRTLAFTINNT 168 Query: 178 -AKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVY 236 I+ D +D + G + I ++ ++ Sbjct: 169 KIAYEIAHGDAPEHSKQTDAQWLVKKLAGLARLNVAFKGWTFTEGPGYIHVIAPSNSQIN 228 Query: 237 RSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGA-------------V 283 T D+ + T + + + K +++ + Sbjct: 229 SLSTEDGYADQLMNAVMHTSQSFSRLPVEAPNGYTVKIVGDTSKTSDMFYVQYDNLKKVW 288 Query: 284 APYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSH-------V 336 WG K ++ D ++ Q+ FQ + + PS V Sbjct: 289 KEVAGWGVQKGLNGDTMPHALVRQADGSFQMQALPWAQRTCGDMDTNPTPSIVDQTINDV 348 Query: 337 TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM 396 F NRL F + + +S ++ D + AV+ S + + Sbjct: 349 FFFRNRLGFLAGEN----IVMSRTSKYFSLFPASVANLSDD-DPIDVAVSHNRISILKYA 403 Query: 397 HPFGEGVLVGCDTSLWLLSISL--SKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGR 454 PF E +L+ D + ++LS S P VG + F Sbjct: 404 VPFSEELLLWSDQAQFVLSAQGILSPKSVELNLTTEFDVSDRARPFGVGRGVYFASPRAS 463 Query: 455 RIKYISGSTEQG----FRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPR 510 Q +++ + + + I VL S Sbjct: 464 YTSLNRYYAVQDVSSVKSAEDMSAHVPSYIPNGVFSIRGSSTENFI--AVLSSNAPSRIF 521 Query: 511 LLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVA 558 L + E +W + VL+ S G+++++++ Sbjct: 522 LYKFLYLNEEISQQSWSHWELGSNVTVLACDSI------GSTMYLVLR 563 >gi|288959382|ref|YP_003449723.1| hypothetical protein AZL_025410 [Azospirillum sp. B510] gi|288911690|dbj|BAI73179.1| hypothetical protein AZL_025410 [Azospirillum sp. B510] Length = 665 Score = 233 bits (593), Expect = 7e-59, Method: Composition-based stats. Identities = 87/579 (15%), Positives = 163/579 (28%), Gaps = 105/579 (18%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M T +++F+ GE+SPR ++ R DL V + N++ + GP P + Sbjct: 1 MSRATPAQYAFTGGEISPR-IKGRTDLERIRNAVEEMTNMVAVPEGPSERRPGTRFANST 59 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 + D + + F ++ + Y+ D Sbjct: 60 KGDASAV-LIPFEFSTQQAYIIEATAGAFRFYRDGGQI--VSGSSPYEVTHAYSAADLPF 116 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 L + V HPP L ++ E P+L + S Sbjct: 117 LRWTQSADVLFLVCPGHPPRTLSRTGH---TAWNLAEWVMRDGPYLD------LNSGPTT 167 Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLG-CHPPEWAKNTNYSIGAYIVADDKVYRSL 239 + + +T+ +F D GR +RL + W + T + + A + Sbjct: 168 LTPSGTSGSVTLTASAALFAATDVGRLVRLRIANVWGWCRITAFGSVTSVTATVEAAWGG 227 Query: 240 TTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDG 299 TT + R G T +T+ + S + ++ + Sbjct: 228 TTATAFWRLGAWGATTGTWPTAVTFHENRLAFAALQTVWLSCSGDFDNFGPTTENGTVAA 287 Query: 300 RSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSS 359 + + + W SA+G +L +G+ G ++ SS Sbjct: 288 DNAITLTAADDQVNV----IRWLRSAFG---------------VLIAGTSGGPFAIQASS 328 Query: 360 FGA----FYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLS 415 + + A S L+ + + + Sbjct: 329 LREALTPINATMPRVHVAGAADVQPVRVATNLVFPSR-----SRRRLHLL---NAEFAAA 380 Query: 416 ISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQL 475 + L++ ++ +K ++ Sbjct: 381 GYSAPDLALVASHITRH----------------------AVKAMAY-------------- 404 Query: 476 ADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDK- 534 Q+EP S++W+VL+ L G + E AWH H + Sbjct: 405 --------------QQEPWSVMWLVLD-----DGTLAGVTYVPEL-DILAWHRHPLGGTA 444 Query: 535 HYVLSAASFPNDNRGGTSLWMLVAL-SAGEERSFTVRLN 572 VLS A P + LW++V AG R L Sbjct: 445 VKVLSVACIPAAD--RDELWLVVERVVAGGIRRHVEILE 481 >gi|317487276|ref|ZP_07946071.1| hypothetical protein HMPREF0179_03434 [Bilophila wadsworthia 3_1_6] gi|316921466|gb|EFV42757.1| hypothetical protein HMPREF0179_03434 [Bilophila wadsworthia 3_1_6] Length = 794 Score = 232 bits (590), Expect = 2e-58, Method: Composition-based stats. Identities = 68/599 (11%), Positives = 157/599 (26%), Gaps = 62/599 (10%) Query: 18 PRLLQSRKDLS---LHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRV--FSF 72 P L+ + N L P + R P +N + Sbjct: 10 PNLISGVSQQPWNVRLPTQAEEQVNCQSSVTDFLKRRPATRHLARIRDTPAANGIASHHI 69 Query: 73 SIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVF 132 + + ++ + + + + K N+ L + Sbjct: 70 NRDETEQYIVTADASGINVFDLEGNAKTVSVTGTGAAYLAAATAPNRDLRFLTINDYTFV 129 Query: 133 VHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARI 192 +++ L + + + N + A Sbjct: 130 LNRRVAVKTLPDLSPKR------QPEAIVFIKQASYNTTYELILNGTTHAFTTEDGIAPA 183 Query: 193 TSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSK 252 LD ++I ++ T+ S D + + Sbjct: 184 DEPADKLSSLDICKAIADQIPKDAFSVQTSNSTIWIRRHDGGDFTVKVQDSRSNTHTSVC 243 Query: 253 GATYVKDNNITWITVLNLSSKTSRESA--------------------SGAVAPYYVWGDI 292 + +++ + ++ +++ SG G Sbjct: 244 KGKVQRFSDLPTVAPRGFVTEIIGDASSSFDNYFCVFEPSDAGDAFGSGTWKETVKPGIP 303 Query: 293 KDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHV-------TFHNNRLLF 345 + ++ Q+ F G + + +PS V F+ NRL F Sbjct: 304 CKLDPATLPHALIRQADGTFTFGPLEWGERICGDEDSAPFPSFVGRTLNGLFFYRNRLSF 363 Query: 346 SGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLV 405 + V +S G F++F L D + A + +S +H F G+L+ Sbjct: 364 LSGEN----VVMSEVGEFFNFFLTTVTTLVDS-DVVDVAASHTKSSILHHAVTFSGGLLL 418 Query: 406 GCDTSLWLLSISLSKGL-SIDFRRVS-GSGVYACPPVSVGDCLVFVCGVGRRIKYISG-- 461 D S ++L ++ + V+ PVS G + F G Sbjct: 419 FSDQSQFVLEHDTVLSNATVSIKPVTEFEASMKAAPVSSGKTVFFATDKGEWGGVREYIT 478 Query: 462 --STEQGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAE 519 ++IT + +L ++ VL + + L ++ Sbjct: 479 LPDNSDQNDASDITAHVPRYVRGNVSRLECSTNEDMLL--VLSEEMRTSLWLYKYFWNGS 536 Query: 520 GEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSAGEERSFTVRLNLLDDFK 578 + AW + + + T +++++ G + ++++ +K Sbjct: 537 EKIQSAWSRWDMCGEVLSAAI--------LNTGVYLIMQYGDGV---YLEKMDITPGYK 584 >gi|194100452|ref|YP_002003825.1| gp12 [Klebsiella phage K11] gi|193201391|gb|ACF15869.1| gp12 [Klebsiella phage K11] Length = 791 Score = 230 bits (585), Expect = 6e-58, Method: Composition-based stats. Identities = 64/606 (10%), Positives = 141/606 (23%), Gaps = 52/606 (8%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + + + G + + ++ + + N L P M + Sbjct: 1 MALVSQSIKNLKGG------ISQQPEILRYPEQGTLQVNGWSSETEGLQKRPPMVFIKSL 54 Query: 61 RLDP--RSNRVFSFSIPDG-GYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKD 117 + D VF +++ + Sbjct: 55 GPRGYLGEDPYIHLINRDEYEQYYAVFTGNDVRVFDLSGYEYQVRGDRSYISVV----NP 110 Query: 118 NKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSN 177 +L V++ + +G D I + G + + Sbjct: 111 KDNLRMITVADYTFIVNRTRQVRENQNVTNGGTFRDNVDGIVNVRGGQYGRKLEVNINGV 170 Query: 178 AKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYR 237 + + + HP I A + Sbjct: 171 WVSHQLPPGDNAKDDPPKVDAQAIAAALADLLRVAHPTWTFNVGTGYIHCIAPAGVTLDE 230 Query: 238 SLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGA-------------VA 284 T D+ + + K +++ A Sbjct: 231 FQTRDGYADQLINPVTHYVQSFSKLPLNAPDGYMVKIVGDTSKTADQYYVKYDASQKVWK 290 Query: 285 PYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHV-------T 337 W + ++ + F G + + PS V Sbjct: 291 ETVGWNISVGLEYHTMPWTLVRAADGNFDLGYHEWRDRRAGDDDTNPQPSFVNSTITDVF 350 Query: 338 FHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMH 397 F NRL F + + LS +++F D L AV+ S + + Sbjct: 351 FFRNRLGFISGEN----IVLSRTSKYFEFYPPSVANYTDD-DPLDVAVSHNRVSVLKYAV 405 Query: 398 PFGEGVLVGCDTSLWLLSISL--SKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRR 455 F E +L+ D + ++LS + S + P +G + + Sbjct: 406 SFAEELLLWSDEAQFVLSANGVLSAKTAQLDLTTQFDVSDRARPYGIGRNIYYASPRSSF 465 Query: 456 IKYISGSTEQG----FRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRL 511 + Q ++T + + + + VL S + Sbjct: 466 TSIMRYYAVQDVSSVKNAEDMTAHVPNYIPNGVYSINGSGTENFA--CVLTKGAPSKVFI 523 Query: 512 LGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSAGEERSFTVRL 571 + E +W D V++A +++++L+ + + Sbjct: 524 YKFLYMDENIRQQSWSHWDFGDGVEVMAANCI------NSTMYLLMRNAYNVWIAAVDFK 577 Query: 572 NLLDDF 577 DF Sbjct: 578 KESTDF 583 >gi|189427235|ref|YP_001949785.1| gp12 [Salmonella phage phiSG-JL2] gi|189085888|gb|ACD75703.1| gp12 [Salmonella phage phiSG-JL2] Length = 801 Score = 226 bits (575), Expect = 9e-57, Method: Composition-based stats. Identities = 65/584 (11%), Positives = 137/584 (23%), Gaps = 55/584 (9%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + + + G + + D+ A+ + N L P M + Sbjct: 1 MALISQSIKNLKGG------ISQQPDILRFAEQGSVQINGWSSESEGLQKRPPMIHLKTL 54 Query: 61 RLDP---RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKD 117 V + + +VF + +++ + T Sbjct: 55 GAAGYVGAQPYVHLINRDEFEQYFVVFTGEDIKVFDLDGKEYQVRGDRSYVR----TANP 110 Query: 118 NKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSN 177 + L ++ + D + + G + Sbjct: 111 REDLRMVTVADYTFVTNRKVVVQSNDQSVNLPGFKDQGDALINVRGGQYGRRLSIEFNGA 170 Query: 178 AKLSISQADTSTARITSDMKIFKPLDKGRSI--------RLGCHPPEW-AKNTNYSIGAY 228 + ++ D S +++ +K P +W I Sbjct: 171 ERAAVQLPDGSQPAHVNEVDGQAIAEKLAVQLRNNLGNPNNEQDPNKWRFNVGPGFIHIL 230 Query: 229 IVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGA------ 282 +D V+ T D+ + K +++ A Sbjct: 231 APNNDNVWGLQTKDGYADQLINPVTHYTQSFQKLPINAPDGYIVKIVGDTSKTADQYYVR 290 Query: 283 -------VAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSH 335 W + ++ S F YPS Sbjct: 291 FDLNRKVWVETIGWNTRTHLHYHTMPWALVRASDGNFDFKYLEWGARTVGDDTTNPYPSF 350 Query: 336 -------VTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDF 388 + F NRL F + + LS +++F D + AV+ Sbjct: 351 TGQTINDIFFFRNRLGFLSGEN----IILSRTSKYFNFFPASVSNYSDD-DPIDVAVSHN 405 Query: 389 SASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSID--FRRVSGSGVYACPPVSVGDCL 446 ST+ + PF E +L+ D + ++L+ S P VG + Sbjct: 406 RVSTLKYAVPFSEELLLWSDQAQFVLTASGILSSRSVELNLTTQFDVQDRARPHGVGRNV 465 Query: 447 VFVCGVGRRIKYISGSTEQG----FRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLE 502 F Q ++T + + + + +L Sbjct: 466 YFASPRASFTSINRYYAVQDVSSVKNAKDMTAHVPNYIPNGVFSISGTTAENFA--AILT 523 Query: 503 PKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPND 546 + + + E +W D V +A + Sbjct: 524 SGAPNRVYIYKFLYIDEEIRQQSWSHWDFGDNVTVFAAQVINST 567 >gi|9634037|ref|NP_052111.1| tail tubular protein B [Yersinia phage phiYeO3-12] gi|6599028|emb|CAB63632.1| tail tubular protein B [Yersinia phage phiYeO3-12] Length = 801 Score = 225 bits (574), Expect = 1e-56, Method: Composition-based stats. Identities = 66/584 (11%), Positives = 139/584 (23%), Gaps = 55/584 (9%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + + + G + + D+ A+ + N L P M + Sbjct: 1 MALISQSIKNLKGG------ISQQPDILRFAEQGSVQINGWSSESEGLQKRPPMIHLKTL 54 Query: 61 R---LDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKD 117 V + + +VF + +++ + T Sbjct: 55 GPAGYVGAQPYVHLINRDEFEQYFVVFTGEDIKVFDLDGKEYQVRGDRSYVR----TANP 110 Query: 118 NKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSN 177 + L ++ + D + + G + Sbjct: 111 REDLRMITVADYTFVTNRKVVVQSNDQSVNLPGFKDQGDALINVRGGQYGRRLSIEFNGA 170 Query: 178 AKLSISQADTSTARITSDMKIFKPLDKGRSI--------RLGCHPPEW-AKNTNYSIGAY 228 + ++ D S +++ +K + P +W I Sbjct: 171 ERAAVQLPDGSQPAHVNEVDGQAIAEKLAAQLRNNLGNPNNDQDPNKWRFNVGPGFIHIL 230 Query: 229 IVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGA------ 282 +D V+ T D+ + K +++ A Sbjct: 231 APNNDNVWGLQTKDGYADQLINPVTHYTQSFQKLPINAPDGYIVKIVGDTSKTADQYYVR 290 Query: 283 -------VAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSH 335 W + ++ S F V YPS Sbjct: 291 FDLNRKVWVETIGWNTRTHLYYHTMPWALVRASDGNFDFKVLEWGARTVGDDTTNPYPSF 350 Query: 336 -------VTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDF 388 + F NRL F + + LS +++F D + AV+ Sbjct: 351 TGQTINDIFFFRNRLGFLSGEN----IILSRTSKYFNFFPASVSNYSDD-DPIDVAVSHN 405 Query: 389 SASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSID--FRRVSGSGVYACPPVSVGDCL 446 ST+ + PF E +L+ D + ++L+ S P VG + Sbjct: 406 RVSTLKYAVPFSEELLLWSDQAQFVLTASGILSSRSVELNLTTQFDVQDRARPHGVGRNV 465 Query: 447 VFVCGVGRRIKYISGSTEQG----FRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLE 502 F Q ++T + + + + +L Sbjct: 466 YFASPRASFTSINRYYAVQDVSSVKNAEDMTAHVPNYIPNGVFSISGTTAENFA--AILT 523 Query: 503 PKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPND 546 + + + E +W D V +A + Sbjct: 524 SGAPNRVYIYKFLYIDEEIRQQSWSHWDFGDNVTVFAAQVINST 567 >gi|17570828|ref|NP_523337.1| tail tubular protein B [Enterobacteria phage T3] gi|17384312|emb|CAC86300.1| tail tubular protein B [Enterobacteria phage T3] Length = 801 Score = 224 bits (571), Expect = 2e-56, Method: Composition-based stats. Identities = 63/584 (10%), Positives = 138/584 (23%), Gaps = 55/584 (9%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + + + G + + D+ + + N + P M + Sbjct: 1 MALISQSIKNLKGG------ISQQPDILRFTEQGSVQINGWSSESEGIQKRPPMIHLKTL 54 Query: 61 RLDP---RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKD 117 V + + +VF + +++ + T Sbjct: 55 GTAGYVGAQPYVHLINRDEFEQYFVVFTGEDIKVFDLDGKEYQVRGDRSYVR----TANP 110 Query: 118 NKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSN 177 + L ++ + D + + G + Sbjct: 111 REDLRMVTVADYTFVTNRKVVVQSNDQSVNLPGFKDQGDALINVRGGQYGRRLSIEFNGA 170 Query: 178 AKLSISQADTSTARITSDMKIFKPLDKGRSI--------RLGCHPPEW-AKNTNYSIGAY 228 + ++ D S +++ +K + P +W I Sbjct: 171 ERAAVQLPDGSQPAHVNEVDGQAIAEKLAAQLRNNLGNPNNDQDPNKWRFNVGPGFIHIL 230 Query: 229 IVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGA------ 282 +D V+ T D+ + K +++ A Sbjct: 231 APNNDNVWGLQTKDGYADQLINPVTHYTQSFQKLPINAPDGYIVKIVGDTSKTADQYYVR 290 Query: 283 -------VAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSH 335 W + ++ S F YPS Sbjct: 291 FDLNRKVWVETIGWNTRTHLHYHTMPWALVRASDGNFDFKYLEWGARTVGDDTTNPYPSF 350 Query: 336 -------VTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDF 388 + F NRL F + + LS +++F D + AV+ Sbjct: 351 TGQTINDIFFFRNRLGFLSGEN----IILSRTSKYFNFFPASVSNYSDD-DPIDVAVSHD 405 Query: 389 SASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSID--FRRVSGSGVYACPPVSVGDCL 446 ST+ + PF E +L+ D + ++L+ S P VG + Sbjct: 406 RVSTLKYAVPFSEELLLWSDQAQFVLTASDILSSRSVGLNLTTQFDVQDRARPHGVGRNV 465 Query: 447 VFVCGVGRRIKYISGSTEQG----FRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLE 502 F Q ++T + + + + + +L Sbjct: 466 YFSSPRASFTSINRYYAVQDVSSVKNAEDMTAHVPNYIPNGVFSISGTTAENFV--AILT 523 Query: 503 PKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPND 546 + + + E +W D V +A + Sbjct: 524 SGAPNRVYIYKFLYIDEEIRQQSWSHWDFGDNVTVFAAQVINST 567 >gi|326424995|ref|YP_004286217.1| virion structural protein [Pseudomonas phage phi15] gi|325048399|emb|CBZ42012.1| virion structural protein [Pseudomonas phage phi15] Length = 793 Score = 223 bits (568), Expect = 6e-56, Method: Composition-based stats. Identities = 71/613 (11%), Positives = 147/613 (23%), Gaps = 60/613 (9%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M ++ + + G + + D+ + A+ N L P + + Sbjct: 1 MPLSSQSIKNLKGG------ISQQPDVLRYPNQGAQQINGWSSETKGLQKRPPLVFIKRL 54 Query: 61 RLDP--RSNRVFSFSIPDG-GYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKD 117 + + D L+F + L I + + T Sbjct: 55 AESGHFGTKPLVHLINRDAFEQYQLIFHNGALTIFDLAGNNYPVSGSLSYIA----TANP 110 Query: 118 NKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSN 177 + L +++ + + + G + Sbjct: 111 REDLRLLTVADYTFILNRTKTVEMSSELTHTGYPALNSRALVSCRGGQYGRTLRIRANGV 170 Query: 178 AKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGA---------- 227 S D T K +D ++ T+ A Sbjct: 171 ELASYELPDGLAENNTELSKEVAAMDAQAIVKELVKRVNAGTATHGFSAAEGPSHLVIYG 230 Query: 228 -----YIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRE----- 277 + + Y + + S Sbjct: 231 NGQPINNIYTEDGYADQLISGLIYQVQTTTKLPITAPAGYLVEITGEASRSGDNYWVRYD 290 Query: 278 SASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPS--- 334 A+ G I ++ ++ Q+ F G + + E PS Sbjct: 291 GAAKVWKETVKPGIISGINPGTMPHALIRQADGTFSFGPLTWAKRTAGDDETNPMPSLVD 350 Query: 335 ----HVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSA 390 V F NRL F + + +S ++ D + AV+ Sbjct: 351 NKLNDVFFFRNRLGFLSGEN----IIMSKTAKYFQLFPSSVAASADD-DPIDVAVSHSRI 405 Query: 391 STIHWMHPFGEGVLVGCDTSLWLLSISLS-KGLSIDFRRVSGSGVYA-CPPVSVGDCLVF 448 S + + PF E +L+ D + + L+ S + + V P +G + F Sbjct: 406 SILKYAVPFSEQLLLWSDQAQFTLTSSGVLSAKTAQLDLTTEFDVLDAARPYGLGRGVYF 465 Query: 449 VCGVGRRIKYISG----STEQGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPK 504 R +++ ++ + + + VL Sbjct: 466 AAPRARFCSIKRYYAVADVSNVKNAEDVSGHVPTYIPNKVHNVNGSGTENFV--SVLTDG 523 Query: 505 DNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSAGEE 564 D S + + E +W K +LS S G+ + ++ + G Sbjct: 524 DPSKVFIYKFLYQDENLAQQSWSHWTF-GKCKILSMFSI------GSYTYTIMDRAEGVV 576 Query: 565 RSFTVRLNLLDDF 577 N DF Sbjct: 577 LERLEFTNDTVDF 589 >gi|29366731|ref|NP_813776.1| tail tubular protein B [Pseudomonas phage gh-1] gi|29243590|gb|AAO73169.1|AF493143_30 tail tubular protein B [Pseudomonas phage gh-1] Length = 808 Score = 221 bits (562), Expect = 3e-55, Method: Composition-based stats. Identities = 60/620 (9%), Positives = 141/620 (22%), Gaps = 71/620 (11%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + + + G + + D+ + A N L P + Sbjct: 1 MGLVSQSVKNLKGG------ISQQPDILRFSNQGALQINGWSSETQGLQKRPPTTFTKRL 54 Query: 61 RLDP--RSNRVFS-FSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKD 117 + + + + + F L + ++ + G Sbjct: 55 QNKGFLGTKPLVHLINRDAQEQYFVGFSGTGLAVWDLKGNNYTVRGYNGYA----NCANP 110 Query: 118 NKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSN 177 L V+++ + I + G + + + Sbjct: 111 RTDLRLITVADYTFVVNRNTVCQMGSTLTHAAYPRLDGRAIINVRGGQYGRTLSITINGD 170 Query: 178 AKLSISQAD-----------------TSTARITSDMKIFKPLDKGRSIRLGCHPPEW-AK 219 S QA ++ + + R + + W + Sbjct: 171 GTGSSPQASIKMPNGSAEKVPAGDPYAGMNQVDMTDASWIAAELARQLTVSLGGSGWSFQ 230 Query: 220 NTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESA 279 I A+D V + T D + + + ESA Sbjct: 231 AGTGWILINAPANDNVRQIATKDGYADTLLSGFIYQVQTFTKLPANAPPGYLVEITGESA 290 Query: 280 SGA-------------VAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAW 326 I + ++ + F + Sbjct: 291 RSGDNYWVQYDASGKVWKETAKPKIIAGFNNATLPHALVRAADGQFDWTPLTWDGRNAGD 350 Query: 327 GEQEGYPSH-------VTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTK 379 + PS V F NRL F + V +S +++F D Sbjct: 351 DDTNPMPSFVGATINDVFFFRNRLGFLSGEN----VVMSRTSKYFNFFPSSVATLSDD-D 405 Query: 380 ALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSID--FRRVSGSGVYAC 437 + A++ S + + PF E +L+ D + ++LS Sbjct: 406 PIDVAISHNRISILKYAVPFSEQLLLWSDQAQFVLSSKTILSSKTIELDLTTEFDVSDGA 465 Query: 438 PPVSVGDCLVFVCGVGRRIKYISG----STEQGFRFNEITQLADHLFNQRILQLVYQEEP 493 P +G + F +++ + + Sbjct: 466 RPYGIGRGVYFAAPRASFTSLKRYYAIQDVSDVKSAEDVSAHVPSYITNTVHAIHGSGTE 525 Query: 494 HSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSL 553 + + +L + + + E ++ D + + G+ Sbjct: 526 NFV--SILSDGSPNKVFIYKFLYLDEILQQQSFSHWEFGDA----ATTRVLAASCIGSYC 579 Query: 554 WMLVALSAGEERSFTVRLNL 573 ++++ G R+ Sbjct: 580 YLMIDRPEG---LCLERMEF 596 >gi|42526655|ref|NP_971753.1| hypothetical protein TDE1145 [Treponema denticola ATCC 35405] gi|41816848|gb|AAS11634.1| hypothetical protein TDE_1145 [Treponema denticola ATCC 35405] Length = 647 Score = 220 bits (559), Expect = 7e-55, Method: Composition-based stats. Identities = 72/572 (12%), Positives = 166/572 (29%), Gaps = 86/572 (15%) Query: 9 HSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNR 68 +F+ GE+S L R DL ++ V++ N ++ G + + + R Sbjct: 4 TNFAGGEVSKNL-YGRIDLPIYQNSVSRLENFDIMQTGGIKRRGGTERIGKLK---GYAR 59 Query: 69 VFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTP----YTFKDNKSLEYA 124 + F + + + G + ++I S + TP Y D ++YA Sbjct: 60 LIPFIVNNTLSFIFEIGSEYIRIWK-NGSLLTLAGFPVEFSPTPDLPLYQKSDLSEIQYA 118 Query: 125 VFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQ 184 + H+ + P+ + + + Sbjct: 119 QTYDSLYLAHRHYKPYVIKWQGGDAFT-------------------FGSLNITGNAHKLP 159 Query: 185 ADTSTARITSDMKIFKPLDKGRSIRLGC--HPPEWAKNTNYSIGAYIVADDKVYRSLTTG 242 S + L +GR P + + + + D V ++ Sbjct: 160 FQGSD-----NYPSCVALFQGRLFFASTIREPQKIWASKVFEYENFTYFDTVVSKTTQLK 214 Query: 243 RSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSI 302 R +K VKD+++ + + Sbjct: 215 NPDLRVFSAK---AVKDSDVLTELTKDFTD------------------ITNITDYYVSGH 253 Query: 303 SVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGA 362 P+ + + A ++E + N + S Sbjct: 254 KGIPKDTKVLSVTSDSMKISKPATVDKEDIVLSIHLWRN----ADSP---------QADD 300 Query: 363 FYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGL 422 + D + P A + I W+ P + +++G ++S W++S Sbjct: 301 YKD--TEIINNVTAPDHAFYFEIGSDKNDKIKWITPSKD-LIIGTESSEWVMS-DGVTAQ 356 Query: 423 SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQ-GFRFNEITQLADH-LF 480 I+ + S GV +G ++++ GR ++ + ++ ++ ++TQ A H L Sbjct: 357 RIEVQLQSRYGVADLQGSLIGRSVIYIGQGGRSLRDYAYDFQEHTYKSIDLTQAASHLLI 416 Query: 481 NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSA 540 + + Y P +++ LE + G AW ++ + + Sbjct: 417 ESKAVDFDYTNSPVQKIYLSLEDGS------ACVLLYDKNTGIAAWTKIVL-GNGKIKNI 469 Query: 541 ASFPNDNRGGTSLWMLVALSAGEERSFTVRLN 572 + P ++ V + ++ Sbjct: 470 VTVPGLKG-FDDVYFEVERKG---IFYLEKIT 497 >gi|194473836|ref|YP_002048660.1| tail tubular protein B [Morganella phage MmP1] gi|194307057|gb|ACF42039.1| tail tubular protein B [Morganella phage MmP1] Length = 819 Score = 218 bits (555), Expect = 2e-54, Method: Composition-based stats. Identities = 70/597 (11%), Positives = 143/597 (23%), Gaps = 71/597 (11%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + + + G + + D+ + A N L P + + Sbjct: 1 MALVSQSTKNLKGG------ISQQPDILRYPDQGAAQVNGWSSETEGLQKRPPLVFVKQL 54 Query: 61 RLDP--RSNRVFSFS-IPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKD 117 S+ + + + L+ F +++ + K P Sbjct: 55 GGKNYLGSDPLVHYINRSEDEKYLVAFSGTGVKVFDMEGKEYTVHNNNAAYLKAP---NP 111 Query: 118 NKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKS- 176 + L V+++ + G + D + + G + + Sbjct: 112 KQDLRMVTVADYTFIVNRNITVKNRSEKSTGGTFNPKSDCLIAVRGSQYGRTIKVTINGV 171 Query: 177 ------------NAKLSISQAD----------TSTARITSDMKIFKPLDKGRSIRLGCHP 214 + D T+ + G + P Sbjct: 172 DRVNFTLHDGAEAWQGRTISTDKVIRYIVDQLTTGKTTEGQGSLPGLGHYGVFEYVTTTP 231 Query: 215 ---PEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLS 271 K + + A ++ TT D+ Y + N Sbjct: 232 LPSGWTVKGMDGFVYIKAPAGQQIDTITTTDGYSDQLVYPVTHYVQTTAKLPLNAPDNYY 291 Query: 272 SKTSRESASGA-------------VAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSV 318 K E+ A W I KD ++ +S F+ Sbjct: 292 IKVVGEAEGTADQYYLKFDKDARVWREAIGWNAILGFQKDTMPHALIRRSDGNFEVKALD 351 Query: 319 VSWFMSAWGEQEG-------YPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGE 371 S + + S V F NRL F + + +S G ++ Sbjct: 352 WSDKEAGDDDTNPDVSLVDRTISDVFFFRNRLGFVSGEN----IVMSRTGRYFKLYPASV 407 Query: 372 YGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL--SKGLSIDFRRV 429 D + AV+ + + PF E +L+ + + ++L+ S Sbjct: 408 AAISDD-DPIDVAVSYNRVVDLQFAVPFTEELLLWANGAQFILTAQGILSPKTVELNLST 466 Query: 430 SGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQG----FRFNEITQLADHLFNQRIL 485 S PV +G + + T Q + +T + + Sbjct: 467 QFSVHTGARPVGIGRNVYYASPRATFTSINRYFTVQDVSGVKDSDNMTAHVPNYIPNGVF 526 Query: 486 QLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAAS 542 L + V+ S L F +W + V + Sbjct: 527 SLGGSSTEN--YLSVITTGAPSRVYLFKFLFDNGEAIQQSWSHWDFGENITVRAFTV 581 >gi|326536942|ref|YP_004306349.1| tail tubular protein B [Pseudomonas phage phiIBB-PF7A] gi|318054518|gb|ADV35694.1| tail tubular protein B [Pseudomonas phage phiIBB-PF7A] Length = 807 Score = 217 bits (551), Expect = 6e-54, Method: Composition-based stats. Identities = 69/614 (11%), Positives = 143/614 (23%), Gaps = 67/614 (10%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + + + G + + D+ A+ N L P + Sbjct: 1 MGLVSQSVKNLKGG------ISQQPDILRFPNQGAQQINGWSSETQGLQKRPPTTFVKRL 54 Query: 61 RLDP--RSNRVFSFS-IPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKD 117 + + +VF + I ++ + G Sbjct: 55 GAPGAWGAKPLVHLVNRDASEQYYMVFTGSGVAISDLKGNLYQVRGYDGYA----NCPDP 110 Query: 118 NKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSN 177 L V++ P + + G + V + Sbjct: 111 RGDLRLITVADYTFVVNRRTPVQMGSELTHAGYRKLNTRALVPCRGGQYGRTITVEVLID 170 Query: 178 AK-------LSISQADTSTARITSDMKIFKP----LDKGRSIRLGCHPPEWAKN-TNYSI 225 S T+ + + + + + P + + Sbjct: 171 VTWVKLAELALPSGVGTNQDEVAKMVAKVDAQNMIKELVTQVNVNGAPWKITAGEYPGCM 230 Query: 226 GAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGA--- 282 + + T D+ N + + + E+ Sbjct: 231 LLHRDDGGEFNGIRTKDGYADQLINGFIYQVQSFNKLPAQAPEGYLVEITGEATRSGDNY 290 Query: 283 ----------VAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGY 332 G I +++ + + F V + E Sbjct: 291 WVRYDGAGRVWKETVKPGIIAGLNRATMPRGLVRAADGQFDWKVLDWNNRGCGDDETNPL 350 Query: 333 PSH-------VTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAV 385 PS V F NRL F + V +S +++F D L AV Sbjct: 351 PSFVGGTINDVFFFRNRLGFLSGEN----VIMSRSSRYFNFFPPSVAALSDD-DPLDIAV 405 Query: 386 TDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL--SKGLSIDFRRVSGSGVYACPPVSVG 443 + S + + PF E +L+ D + ++LS S P +G Sbjct: 406 SHNRISILKYAVPFSEQLLLWSDQAQFVLSSQGILSPKTVELNLTTEFDVQDTARPFGIG 465 Query: 444 DCLVFVCGVGRRIKYISG----STEQGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWV 499 + F +++ R+ + + + Sbjct: 466 RGVYFSAPRAAYTSLKRYYAVQDVSDVKNAEDVSAHVPSYIENRVFNIHGSGTENYV--T 523 Query: 500 VLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVAL 559 +L L + AE +W +L AAS G+ +++L+ Sbjct: 524 LLSDGAPGIVYLYKFLYMAEDIAQQSWSHWEFGQNVNILGAASI------GSYMYLLMDR 577 Query: 560 SAGEERSFTVRLNL 573 G R+ Sbjct: 578 PEGIV---LERMEF 588 >gi|259419134|ref|ZP_05743051.1| hypothetical protein SCH4B_4402 [Silicibacter sp. TrichCH4B] gi|259345356|gb|EEW57210.1| hypothetical protein SCH4B_4402 [Silicibacter sp. TrichCH4B] Length = 715 Score = 208 bits (528), Expect = 2e-51, Method: Composition-based stats. Identities = 89/588 (15%), Positives = 175/588 (29%), Gaps = 50/588 (8%) Query: 1 MVN--TTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYR 58 M T + FS G++ P Q R D+ L A+ V + N + L G + M+ Sbjct: 1 MARRKETIWQKDFSLGQVRPEA-QERDDIDLVARSVKEGLNCVVLSTGQMEGRSGMRFLN 59 Query: 59 DCRLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDN 118 + +G L F L + ++ +++ + + Sbjct: 60 ATASSQGREV----DLGEGRVFDLHFVPSGLILYDSNNTVEYTGNITWTAAPKKWGIYTF 115 Query: 119 KSLEYAVFGS----TAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGV 174 + + V + + + P L+ + S++F E+ F Sbjct: 116 DEISFWVVADPDSSSILIGSQHFPIQALI---LNEDGSWSFGEMAFATGLAGAIHQSYWR 172 Query: 175 KSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDK 234 + A T +T+ I+ +G +IR + ++ V ++ Sbjct: 173 YNETVSIQPSARTGAITVTASEAIWTADHEGMAIRYQNREIILGTLVSSTVINAAVTEEL 232 Query: 235 --VYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDI 292 Y + S + G + + + I ++ + + G + Sbjct: 233 PPTYDITVSSVSNYQVGEAVEHSVLGGQGIITGIAGSVITVMATSRYDGFDT-------V 285 Query: 293 KDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDE 352 + + + + V W M GY + H +R+ G Sbjct: 286 ASPKLVAPNAAQPISAVAAAATPAATVIWEMQMQSPVHGYAGYAVRHLSRVFLCDFPGAP 345 Query: 353 LSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLW 412 + S GA DF + + V S T+ +M + + + Sbjct: 346 QAFAASVVGAINDFKM-----GSEDADGFVDTVGADSGGTLRFMASVEDLLFLTSKGIYS 400 Query: 413 LLSISLSKGLSIDFRRV--SGSGVYACPPVSVGDCLVFVCGVGRRIKY--ISGSTEQGFR 468 + S R V S G + P++V D VFV VG+RI ++G +R Sbjct: 401 HQTRDGSAITPATIRPVRFSRVGCASVEPIAVDDGCVFVDAVGQRIYAATLAGDIYTKWR 460 Query: 469 FNEITQLADHLFNQRIL---QLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFA 525 +T L L + E S V+VV NS + ++ E + Sbjct: 461 AEPMTSLHPQLIKDAVYLGATSSGSENAESFVYVV-----NSDGSVALGQWDRSNE-IIS 514 Query: 526 WHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSAGEE-RSFTVRLN 572 W + + +V + E + R + Sbjct: 515 WLPWETDGNFLSIYQCFGVS--------HAVVDRTVNETSVRYRERFD 554 >gi|315518952|dbj|BAJ51829.1| putative tail tubular protein B [Ralstonia phage RSB2] Length = 788 Score = 206 bits (524), Expect = 7e-51, Method: Composition-based stats. Identities = 83/608 (13%), Positives = 143/608 (23%), Gaps = 54/608 (8%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M T T + G + + D+ N L P + Sbjct: 1 MPLITQTIKNLKGG------ISQQPDILRFPDQGQAQINGFSSEVEGLQKRPPSVHIKKL 54 Query: 61 -RLDPRSNRVFSFSIPDGGYALLVFGDKK-LQIVVVRSSTKWSPALFGKTYKTPYTFKDN 118 V + F L ++ + K A G Y T Sbjct: 55 DTKHNGKPFVKLINRDQFERYYASFHPGGSLTVIDLDGVQKTVNAPQGFGYIN--TANPR 112 Query: 119 KSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNA 178 L ++K +Q + + Sbjct: 113 TDLRMVTVADFTFVINKAVAVTMNG-VQSFPGYRTNGRALVNVKGGQYSRTYSIEFNGGV 171 Query: 179 KLSISQADTSTARITSDMKIFKPLDK------------GRSIRLGCHPPEWAKNTNYSIG 226 + S + + S + + + G I +G + + S+ Sbjct: 172 QASYTTPNGSDPSHAAQIDTQYIAQQLGNALVAALGPSGWGIDVGPNYIFIEAPSASSVF 231 Query: 227 AYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNI----TWITVLNLSSKTSRESASGA 282 + D L G + ++ +D I + A G Sbjct: 232 NLKIRDG-FNNGLMAGCIFEVQRFNMLPAQARDGYIVKVLGDPGSGADDYYARFDLARGV 290 Query: 283 VAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHV------ 336 G + +K ++ ++ F S + PS V Sbjct: 291 WVECQAPGTVGQFTKATMPHALVREANGTFTFREVDWQERPSGDADTSPEPSFVGQKIND 350 Query: 337 -TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHW 395 F NRL + V LS+ G F+ F T + AV+ ST+H Sbjct: 351 IFFFRNRLGILAGEN----VILSASGEFFKFWPKSVV-TAADTDPIDVAVSHNRVSTLHH 405 Query: 396 MHPFGEGVLVGCDTSLWLLSISL--SKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVG 453 F E +L+ D + ++L S PV+ G + F Sbjct: 406 AVSFAEELLLWSDQTQFILKSDGILSTKTVKVDTATEFESAIDARPVAAGRGVYFAAPRA 465 Query: 454 RRIKYISGSTEQG----FRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFP 509 Q +I+ + L + V VL S Sbjct: 466 SFTSVRRYYAVQDTSAVKNAEDISAHVPSYIPNGVFFLGSSTTEN--VVTVLTEGAESRL 523 Query: 510 RLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSAGEERSFTV 569 L + E AW VL+ +++LV +G Sbjct: 524 YLYKYLYLQEQLVQQAWSHWEFGPGSRVLACDLIGAI------MYILVDAPSGTFLESVE 577 Query: 570 RLNLLDDF 577 DF Sbjct: 578 FTQNTKDF 585 >gi|167041089|gb|ABZ05850.1| hypothetical protein ALOHA_HF400048F7ctg1g17 [uncultured marine microorganism HF4000_48F7] Length = 999 Score = 206 bits (523), Expect = 1e-50, Method: Composition-based stats. Identities = 81/543 (14%), Positives = 154/543 (28%), Gaps = 93/543 (17%) Query: 100 WSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTF---- 155 T YT + + VH DH P L Sbjct: 164 NVNLSQRFEVTTTYTASQVNDIAFTQSADVLFLVHPDHVPARLERNATNSWALTNLLPSL 223 Query: 156 -DEIKFLPPPWLGDGMISGVKSNAKLSISQ-ADTSTARITSDMKIFKPLDKGR------- 206 P L DG + + A S + + G Sbjct: 224 ISGTYTRPTTVLTDGPFKAMNTTDTTLTVALAANSDFTTSFSNGSLSLEEVGTVSPSNVD 283 Query: 207 ---------------------------------------SIRLGCHPPEWAKNTNYSIGA 227 + + T+ Sbjct: 284 VATNAFTLANHPLVNGQTVQFSSIPSGFASTPTLSATTDYFVVSATQNTFKLATSAGGTP 343 Query: 228 YIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYY 287 + LT +S T I T + + +AP Sbjct: 344 VDITAAPTSADLTVNKSFVDKDVYIKVTASATTGINDDTGFQTTDVGRYIRLNTEIAPQI 403 Query: 288 VWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSG 347 G + V + ++ + Q +T + W + ++ GYP V + RL+F+G Sbjct: 404 KHGYGEIVERTSTTVVLV-QLKTAIAGVGATTEWQLGSFSGTTGYPRTVQLYQQRLVFAG 462 Query: 348 SKGDELSVYLSSFGAFYDFSLDGEYGCYDP----------------TKALTTAVTDFSAS 391 + + +++ S F++FS G A++ ++ + Sbjct: 463 TAEESQTIFFSKTADFFNFSATEPLGQQTGQRDSSGRSIVGEQIFEDAAISLTISSDTVD 522 Query: 392 TIHWMHPFGEGVLVGCDTSLWLLSISLSKGL----SIDFRRVSGSGVYACP-PVSVGDCL 446 I W + + +G ++ L S + +VS P VG+ L Sbjct: 523 QIEW-ISEDQRLTIGTSGGIYQLYGSTDDLTLTPFNFSITKVSAWACDPTALPAKVGNNL 581 Query: 447 VFVCGVGRRIKYISGS-TEQGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKD 505 ++V GR+++ ++ + + ++T ++ + ++ YQ++P+S++W + Sbjct: 582 LYVQNNGRKLRELAFDKVQDQYSAADLTLRSEDISESGLIATAYQDQPYSVLWCLRN--- 638 Query: 506 NSFPRLLGCRFSAEGEGDFAWHTHMISDKH---------YVLSAASFPNDNRGGTSLWML 556 RL G + + AWH H I H V S AS P L+M+ Sbjct: 639 --DGRLAGLTYVDLLQ-MRAWHRHTIGGAHYDDTHGSQAKVESIASIPR--GTHDQLYMI 693 Query: 557 VAL 559 V Sbjct: 694 VKR 696 Score = 111 bits (278), Expect = 2e-22, Method: Composition-based stats. Identities = 43/332 (12%), Positives = 85/332 (25%), Gaps = 15/332 (4%) Query: 3 NTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRL 62 + SF+ G++SPR +Q +L + +A N++ L G L P + Sbjct: 2 RIQALQSSFADGQISPR-MQGMVELESYKSSLATLENMVVLPQGSLTRRPGTFFAATTKA 60 Query: 63 DPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLE 122 R+ FS G +L FG+ ++ + + T + Sbjct: 61 -NGQARLIPFSRGQGTSLVLEFGNLYIRFFANDGPVRTDDIAATYSQTTTTVTVTKSTHG 119 Query: 123 YAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSI 182 Y+ + + + N Sbjct: 120 YSASDEVYLDFTSG--------NGVDGFYTIATVADANTFTVTSTTSQTTSGNVNLSQRF 171 Query: 183 SQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTG 242 T TA +D+ + D + P +N S + + + T Sbjct: 172 EVTTTYTASQVNDIAFTQSADVLFLVHPDHVPARLERNATNSWALTNLLPSLISGTYTRP 231 Query: 243 RSGDRFG-----YSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSK 297 + G + T + S+ + G V+P V + Sbjct: 232 TTVLTDGPFKAMNTTDTTLTVALAANSDFTTSFSNGSLSLEEVGTVSPSNVDVATNAFTL 291 Query: 298 DGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQ 329 + Q + +SA + Sbjct: 292 ANHPLVNGQTVQFSSIPSGFASTPTLSATTDY 323 >gi|290968641|ref|ZP_06560179.1| conserved hypothetical protein [Megasphaera genomosp. type_1 str. 28L] gi|290781294|gb|EFD93884.1| conserved hypothetical protein [Megasphaera genomosp. type_1 str. 28L] Length = 1039 Score = 206 bits (523), Expect = 1e-50, Method: Composition-based stats. Identities = 54/295 (18%), Positives = 115/295 (38%), Gaps = 17/295 (5%) Query: 285 PYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLL 344 P+ G I+ + + V V ++ S+W ++ GYP F +RL+ Sbjct: 540 PFENEGIIEITDIVSPKEIKYTAIEPVIPN-VPVDAFAFSSWNDRNGYPKLSCFFQDRLV 598 Query: 345 FSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVL 404 F+G+K + S++ S G + +FS++ G A+ + + I + P + ++ Sbjct: 599 FAGTKKEPYSLWFSRTGDYNNFSVEKAEGTVTEDSAIKLDLIVRNLYEIRHLVPSND-LI 657 Query: 405 VGCDTSLWLLSISL-SKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGST 463 V + W++S + + G C P +G+ L++V G I+ S Sbjct: 658 VLTSGNEWIISGDTAITPTKCTPKVQTMRGASNCKPWHIGNRLIYVQRDGGTIRDFGYSY 717 Query: 464 E-QGFRFNEITQLADHLF-NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGE 521 + + +E+ A HL +++ Y + P+S ++ V E ++ E + Sbjct: 718 DSDNYNGDELNLFASHLTKRHQMVSSAYCQNPYSTLYFVRE-----DGEIICLMLIKE-Q 771 Query: 522 GDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSA--GEERSFTVRLNLL 574 AW K+ + G L+++V + + + + +L Sbjct: 772 NVCAWTHWNTHGKYLDCCSV----LENGKDYLYVIVERTNREAQIVRYLEKFDLS 822 Score = 146 bits (369), Expect = 8e-33, Method: Composition-based stats. Identities = 45/324 (13%), Positives = 94/324 (29%), Gaps = 17/324 (5%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M N T++SF+ GE+SP + R DL + + ++ N + YG + + Sbjct: 1 MQNVFITQNSFTTGEISPE-VAERTDLEKYKSALLQAENAVVSPYGSVSRRTGSKYIGAI 59 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 + + + F LL G K +++ + + TP+ + K Sbjct: 60 KYADKEAVLVPFMDSSDRSYLLEVGYKYIRVWKDETMEQ--------EIDTPF--EYPKE 109 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGM-ISGVKSNAK 179 L + G TA +P + LL+ + + F + F + + + Sbjct: 110 LNFTQSGDTAFICSGRYPVYELLHGRYWELRKFDIPKPYFDDIISAIENVSDVNYTESDT 169 Query: 180 LSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSL 239 SQ T + + + G + +Y+ Y Sbjct: 170 PVFSQTKAGDYTFTPTVSGLY-----KIVLFGGAGGKKGTIEHYAGSTKHDEAIYHYEYG 224 Query: 240 TTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDG 299 G G + + +I K + A G + + Sbjct: 225 VAGNEGQKKIVTVKLKAKTTYSIHVGKGGEDGDKHKKGIARGWEEGDVYNSFLNGGPGED 284 Query: 300 RSISVAPQSQTLFQAGVSVVSWFM 323 ++ + G + + Sbjct: 285 TTVKGNSDGVNIVAKGGATFTGSK 308 >gi|313892508|ref|ZP_07826097.1| tail tubular protein B family protein [Dialister microaerophilus UPII 345-E] gi|313119087|gb|EFR42290.1| tail tubular protein B family protein [Dialister microaerophilus UPII 345-E] Length = 807 Score = 205 bits (520), Expect = 2e-50, Method: Composition-based stats. Identities = 76/612 (12%), Positives = 163/612 (26%), Gaps = 67/612 (10%) Query: 3 NTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC-- 60 T T S +G + + D+ + + + N L P +D Sbjct: 2 RITQTIKSIVSG------ISQQPDILRFPEQLEEQTNGFSTESSGLQKRPPTLFIKDLGV 55 Query: 61 ---RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKD 117 ++ + + +++F + + + ++ K+ + T Sbjct: 56 HTTTTQAKNYACHTVDRDEEEKYIMLFTGEDILVYDLKGKQYKVTYEDEKSKQYITTENP 115 Query: 118 NKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSN 177 + L+ V+ + + + + G + Sbjct: 116 REELKMVTIADHTFVVNTEVVVKMSEDKVPWKWS--DHEALIHIQKGNYGREYSIKINGK 173 Query: 178 AKLSISQADTSTARITSDMKIFKPLDK-GRSIRLG---------CHPPEWAKNTNYSIGA 227 + D A D G +I+ + + T Y+ Sbjct: 174 KVAKYTTPDGGEASDIKYTDTNYIRDILGNAIQTEEVLYTDGKYHNQSSGWQVTYYNSAF 233 Query: 228 YIVADDKVYRSL-TTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGA---- 282 I D S + ++ K N++ + K + +G Sbjct: 234 KIYHPDYYINSFEVSDGFNGEAMHAIKHAVQKFNHLPADAPDGYTVKVIGDKHTGTDDYY 293 Query: 283 ---------VAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYP 333 K + + QS F+ + + E P Sbjct: 294 VTFDGKEHVWKECAKPNISKGFDAETMPHILVRQSDGTFKLKKANWDERKAGDEESNEPP 353 Query: 334 SHV-------TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVT 386 S V NRL F + + LS +F++F L T + AV+ Sbjct: 354 SFVDNTINDIFLFRNRLGFLSGEN----IILSRSASFFNFWLASAV-ELQDTDTIDLAVS 408 Query: 387 DFSASTIHWMHPFGEGVLVGCDTSLWLLSISL--SKGLSIDFRRVSGSGVYACPPVSVGD 444 + S S + F E +L+ + + ++++ + + + S P+ G Sbjct: 409 NNSVSILEHAVLFNEELLLFSNNAQFIMTSEGILTPQKASVYFATSFPSATEVVPIKAGR 468 Query: 445 CLVFVCGVGRRIKYISG----STEQGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVV 500 + F T +IT L I +L ++ SI+ +V Sbjct: 469 RVYFPVKRALYSGIREYYTLEDTRGSKDAQDITAHVPSLIPNGIHKL-WECTNESII-LV 526 Query: 501 LEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALS 560 + FSA +W A G++ +ML Sbjct: 527 ASNATPDSLYVYKYLFSAGTRLQASWSKWHFKG-------AEIIGGGFFGSTFYMLSRR- 578 Query: 561 AGEER-SFTVRL 571 G+++ ++ Sbjct: 579 -GKDKHIVLEKM 589 >gi|144898783|emb|CAM75647.1| conserved hypothetical protein [Magnetospirillum gryphiswaldense MSR-1] Length = 635 Score = 204 bits (518), Expect = 4e-50, Method: Composition-based stats. Identities = 77/572 (13%), Positives = 142/572 (24%), Gaps = 118/572 (20%) Query: 3 NTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRL 62 N T K +F+AGELS +L R DL+ + G + RN+ Sbjct: 5 NITLAKTNFTAGELSLDML-GRGDLAAYGNGAKRLRNV---------------FIAPIGG 48 Query: 63 DPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLE 122 R + + I + +TY T L Sbjct: 49 VSRRPGLRH-----------------VDIARGKGRLIAFEFNTEQTYLLVLT-----DLH 86 Query: 123 YAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSI 182 ++ H D P +T +++ + Sbjct: 87 LDIYADGVAVAHVDTP--------------WTEAQLQQIN-------------------- 112 Query: 183 SQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTG 242 T+D + + W + A V ++ Sbjct: 113 -------WTQTADTLLIVHPEVAPRKLTRTAHSAWTISNWMFHEADGVLFQPYHKFAADE 165 Query: 243 RSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSI 302 + S T + + + Sbjct: 166 VTLQPSATSGSITLTA-------SAAFFVAGHVGTRLRLQQKEVEITAIASATQASATVK 218 Query: 303 SVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGA 362 + W A G+P V FH +RL+ GS+ ++LS Sbjct: 219 Q-------NLVNTSAHKDWEEQALSAVRGWPVSVCFHQDRLVIGGSRDQPNRLWLSKSSD 271 Query: 363 FYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGL 422 ++F G +A+ A+ + I + + V + W++S Sbjct: 272 LFNFD----LGEALDDEAIEFALLSDQVNAIRHVFSGRH-LQVFTSGAEWMVSGQPLTPS 326 Query: 423 SIDFRRVSGSGVYACPPV---SVGDCLVFVCGVGRRIKYISG-STEQGFRFNEITQLADH 478 SI R + G V V +FV G+ ++ EQ ++ ++ LA H Sbjct: 327 SIQLTRQTRVGSPIDRTVPPRDVDGATLFVSRNGKDLREFLFADVEQAYQSGDLAMLAKH 386 Query: 479 LFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVL 538 + + Q L L E AW H+ + + + Sbjct: 387 VMLAPVDQ-------DYDAGRRLFHVVMGDGGLATVTVYR-SEKVTAWTGHVTAGRFLAV 438 Query: 539 SAASFPNDNRGGTSLWMLVALSAGEERSFTVR 570 + +++LV Sbjct: 439 AVVEG--------EVYVLVEREGIVSVECFDE 462 >gi|291336965|gb|ADD96491.1| hypothetical protein [uncultured organism MedDCM-OCT-S11-C1587] Length = 474 Score = 203 bits (515), Expect = 8e-50, Method: Composition-based stats. Identities = 62/549 (11%), Positives = 138/549 (25%), Gaps = 87/549 (15%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + +F+ GE+ P LL++R D++ + + ++RN+I G + P +Q + Sbjct: 1 MSRAVSIQSNFTTGEVDP-LLRARIDINQYYNALEQARNVIVQPQGGIERRPGLQFIFEV 59 Query: 61 ---RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKD 117 ++ F +L+F ++ I + + T T Sbjct: 60 PSAANPQNGMKLVPFEFSTTQSYMLLFVHNRMYIFKDKELVTNINSSGNDYLTTTITSTV 119 Query: 118 NKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSN 177 ++++ T + V +D P ++ ++T +I F P + Sbjct: 120 LATMDHTQSADTLIVVQEDMAPKKIVRGAA--HNTWTISDISFEFIPK--FNFTQSETTI 175 Query: 178 AKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYR 237 + A IT+ +F + + I D Sbjct: 176 NQTITPSAVDGNITITAGGNVFASGNLNQYIEAN--------------------DGMGRA 215 Query: 238 SLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSK 297 +T S + I S S Sbjct: 216 RITRFVSATSVEAIVEIPFFNTTAIASGGTFIDGGYEDSWSGSKGYPRT----------- 264 Query: 298 DGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYL 357 + +G + P+ + + Sbjct: 265 -------------------ATFHEGRLYFGGVKSRPNTIF-----------ASRVARFFD 294 Query: 358 SSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSIS 417 + G D D T A+T + + +L + Sbjct: 295 FNPGEALDDDSIELTISTDSTNAITGMFSGRDLQ------------IFTKGGEFFLPQST 342 Query: 418 LSKGLSIDFR---RVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISG-STEQGFRFNEIT 473 L + PV +F+ G+ ++ E + N I+ Sbjct: 343 LDPITPTNVVVNGATRRGSQEGIKPVGAESGTLFIQRAGKSLREFLFSDVELSYISNNIS 402 Query: 474 QLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISD 533 L+ + + ++ + +L +++ L G+ A Sbjct: 403 LLSS-HLLKSPSDMALRKATSTTDGDLLLLTNSTDGSLATYSILR-GQNVIAPSLSTTDG 460 Query: 534 KHYVLSAAS 542 + + Sbjct: 461 EFINVGVDV 469 >gi|307946248|ref|ZP_07661583.1| hypothetical protein TRICHSKD4_4953 [Roseibium sp. TrichSKD4] gi|307769912|gb|EFO29138.1| hypothetical protein TRICHSKD4_4953 [Roseibium sp. TrichSKD4] Length = 681 Score = 201 bits (510), Expect = 3e-49, Method: Composition-based stats. Identities = 92/573 (16%), Positives = 158/573 (27%), Gaps = 73/573 (12%) Query: 2 VNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCR 61 + +F+AGEL P LL R L + G N++ + G + + Sbjct: 3 ARPGRLQSAFTAGELDP-LLHERSQLKYFSTGADHMENVVSIPQGGFGLRGGLLDIGAV- 60 Query: 62 LDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSL 121 DP ++R+F F DG LVF K++ + + L Sbjct: 61 -DPAASRLFDFKASDGSAYDLVFAPGKMEAWGNSGKLQDLAIPA-------LSETMLPGL 112 Query: 122 EYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLS 181 A T + +H D P I+ +++ D + P G + Sbjct: 113 NDAQQRDTMILLHADLQPQ---RIKHAGPQAWSADAVPLTGLPSYDYG----------AT 159 Query: 182 ISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTT 241 S + R+ F LD L ++ SIG Sbjct: 160 YSNGVAAVWRL-----EFVGLDANSIFTLT-----ISQEETVSIGYTTAM---------- 199 Query: 242 GRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRS 301 G R + I+ + + + A + V G++ + + Sbjct: 200 GTLASRVRTAVQDLPNVAPGISVASAGGSKIAVTFSGENNAGDGWAVSGNVINKADAAIL 259 Query: 302 ISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFG 361 + G+P F+N RLL G KG + S G Sbjct: 260 AAKTTVGVAP----------GEPVISSVRGWPRCGAFYNQRLLLGGFKGLPNAWMFSLQG 309 Query: 362 AFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKG 421 +++F D + + + V + + P + W+ LS+ Sbjct: 310 DYFNF--DERFSAANGPALIPMDVDGGEV--VEQIVPSRNLAIFTNGAEYWIAERGLSRT 365 Query: 422 LSIDFRRVSGSGVYACPPVSVG-DCLVFVCGVGRRIKYISG-STEQGFRFNEITQLADHL 479 + + GV P+ L FV G I E F +I+ L HL Sbjct: 366 EPPNHVQAGERGVKNGVPIVANEGALNFVSSTGSVIGEFRYTDVEGNFVSRDISLLGSHL 425 Query: 480 FNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMIS-DKHYVL 538 + + S L + + A+ + Sbjct: 426 IID-VKDQAMRRAEKSTSGN-LNGIVLEDGQ-ARLATLLREQDVTAFSRMTSDSGHFKAV 482 Query: 539 SAASFPNDNRGGTSLWMLVALSAGEERSFTVRL 571 S G + +V AG RL Sbjct: 483 SV-------NGRNEMSWIVDRPAG---RRLERL 505 >gi|291335597|gb|ADD95206.1| tail tubular protein B [uncultured phage MedDCM-OCT-S04-C650] Length = 845 Score = 200 bits (509), Expect = 4e-49, Method: Composition-based stats. Identities = 68/624 (10%), Positives = 144/624 (23%), Gaps = 88/624 (14%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M T T +F G + + D V + N P L+ P M+ Sbjct: 1 MPAITQTIPNFLGG------VSRQNDDKKLINQVTECVNGYPDPTYGLLKRPGMEHVNVL 54 Query: 61 RLDPR---------SNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKT 111 + F + G + + + T + G Y Sbjct: 55 KKADGTAFSKTELADAAWFFIDRDNAGSYIGAIKGTNIYVWTKEDGTFCTVNNTGTAY-- 112 Query: 112 PYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDG---------DKISFTFDEIKFLP 162 T + V +K + ++ D I + Sbjct: 113 -LTGTQQSDYHFRSVQDVTVITNKTVTTAMQATPAAAVKSVGTLKLNSVTDGLDYIVTIQ 171 Query: 163 PPWLGDGMISGVKSNAKLSISQADTST---------ARITSDMKIFKPL----------- 202 S + L +D +T A I + Sbjct: 172 GIATSISAQSHTTFDDMLVYDSSDVNTNHHLVDAIKATIEAQHSASNADFDGVWSLEAYT 231 Query: 203 -------DKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSG----DRFGYS 251 + G + + + T ++I A + ++ Sbjct: 232 NSLVIKRNAGTNAVVTDYTAPTGAATAFTIEAKGGLGNAGIEVFQDSVGSSAELSVESFN 291 Query: 252 KGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTL 311 V++ N + G + + T Sbjct: 292 GHHVKVRNTNSADDDYYLEFEAFNGTRGKGFWKEAKGVDVSPGLDAATMPFQLENVGATT 351 Query: 312 FQAGVSVVSWFMSAWGEQEGYPSHV-------TFHNNRLLFSGSKGDELSVYLSSFGAFY 364 F + + PS + F+NNR ++L + Sbjct: 352 FNFKPIPWTARLVGDTNSNPDPSFIGYKITSTFFYNNRFGVLSEDN----IFLGVANDSF 407 Query: 365 DFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL----SK 420 +F + D + V ++ + P +G+L+ + + + + Sbjct: 408 NFFVKSALTQVDS-DPIDLNVASVRPVVLNDVLPSPQGLLLFSARQQFQVYSASATTMTP 466 Query: 421 GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGST---EQGFRFNEITQLAD 477 ++ + PV VG FV V K + EQ +I+++ Sbjct: 467 KTTVIRSISNYEMSSDISPVDVGTTAAFVNRVPGYSKLFTLQLREIEQSPLVVDISKVVL 526 Query: 478 HLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYV 537 + L + ++ L S+ L + E + AW + Sbjct: 527 EWIPDTVDALTVSPQNSVVM---LTDTQTSYVYLYRFYNNGEKDLFQAWVKWQLPGT--- 580 Query: 538 LSAASFPNDNRGGTSLWMLVALSA 561 + + ++ Sbjct: 581 -----IQAADIIDDDVTVVSQHED 599 >gi|291336928|gb|ADD96456.1| hypothetical protein [uncultured organism MedDCM-OCT-S09-C787] Length = 138 Score = 197 bits (501), Expect = 3e-48, Method: Composition-based stats. Identities = 29/140 (20%), Positives = 49/140 (35%), Gaps = 3/140 (2%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M +F+ GELSPRL R DL+ + G N+I +G Q + Sbjct: 1 MARVAVQLTNFTGGELSPRL-DGRNDLAKYPTGCKTLENMIVFPHGSAARRSGTQFVAEV 59 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 + + R+ F +L FG++ ++ +PY + Sbjct: 60 KDSSKETRLIPFEFSTTQTYMLEFGNQYIRFYKDNGQIL--SGGSAYEISSPYLEAELFD 117 Query: 121 LEYAVFGSTAVFVHKDHPPH 140 ++YA H +HP Sbjct: 118 IKYAQSADVMYICHPNHPVK 137 >gi|226940469|ref|YP_002795543.1| hypothetical protein LHK_01546 [Laribacter hongkongensis HLHK9] gi|226715396|gb|ACO74534.1| hypothetical protein LHK_01546 [Laribacter hongkongensis HLHK9] Length = 874 Score = 195 bits (494), Expect = 2e-47, Method: Composition-based stats. Identities = 64/398 (16%), Positives = 134/398 (33%), Gaps = 25/398 (6%) Query: 185 ADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRS-----L 239 + +IT ++ + P G + + +I V D + Sbjct: 312 GAIAAGKITIELSVSDPTGSGARLSATVGSVACDGYSVTAIKTVTVIDGGKGYTSPSIVT 371 Query: 240 TTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDG 299 + G + + TV + + S + +G Sbjct: 372 VVKQDGRPITGWGPIHATYSVSTSPNTVQLAVTDSGGGSGAALEPVIIDGAITAVNVING 431 Query: 300 RSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSS 359 S AP + G S ++ YP V++ R F+G+ +++++ Sbjct: 432 GSGYFAPVVSVSYAGGGSGATFGQPVVKSSGDYPGAVSYFEQRRCFAGTTRKPQNIWMTK 491 Query: 360 FGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLS---I 416 G + + V+ A+TI + P + +L+ + W ++ Sbjct: 492 SGTESNMGYSLPVR---DDDRIAFRVSAREANTIRHIVPLAQLLLLTSS-AEWRVTSVNS 547 Query: 417 SLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQL 475 SI R S G PV + + L++ G ++ ++ + + GF +++ Sbjct: 548 DAITPRSISVRPQSYIGASNVQPVIINNTLIYASARGGHVRELAYNWQAGGFVTGDLSIR 607 Query: 476 ADHLFNQ-RILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDK 534 A HLF+ I+ + + + P +VW V +S L+G + E + AWH H Sbjct: 608 APHLFDDFEIVDMAFGKSPQPVVWFV-----SSSGCLIGLTYVPEQQ-VGAWHWHDTDGV 661 Query: 535 HYVLSAASFPNDNRGGTSLWMLVALS-AGEERSFTVRL 571 +A + L+ ++ + G R + R+ Sbjct: 662 FESCAAVA----EGAEDVLYCVIRRTVNGCSRRYVERM 695 Score = 182 bits (460), Expect = 2e-43, Method: Composition-based stats. Identities = 42/313 (13%), Positives = 85/313 (27%), Gaps = 10/313 (3%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + SF+ GE++P R D + + G+A RN + +GP ++ R+ Sbjct: 1 MATVKLLQRSFAGGEVTPEFF-GRIDDAKYQSGLAVCRNFVLAPHGPAMNRAGFAFVREV 59 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPAL-FGKTYKTPYTFKDNK 119 + R+ F+ ++ G + ++ A PY + Sbjct: 60 KDSNLKVRLIPFTYSTTQTMVIELGAGYFRFHTQGATLMQPDAPDSPYEVSNPYREDELF 119 Query: 120 SLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAK 179 L Y VH +HPP L + ++ + P + + ++ Sbjct: 120 DLHYVQSADVMTLVHPNHPPQELRRLGA---TNWELKPVSLQPVIAPPENAAASTAGCSE 176 Query: 180 LSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSL 239 TA + + + + +I A Y Sbjct: 177 AKYDYEYVVTAVMVDLVNESAASNVATVR-----SNVYETGCTNTISWSASAGAYRYNVY 231 Query: 240 TTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDG 299 + + D+NI+ + S +G + V Sbjct: 232 KKEGGVYGYIGQTAGLSLVDDNISPDLSKTPPIYDNVFSVAGQIESVPVTAGGSFYGTHT 291 Query: 300 RSISVAPQSQTLF 312 I + Sbjct: 292 GIIQSVTVLNGVL 304 >gi|254251749|ref|ZP_04945067.1| hypothetical protein BDAG_00946 [Burkholderia dolosa AUO158] gi|124894358|gb|EAY68238.1| hypothetical protein BDAG_00946 [Burkholderia dolosa AUO158] Length = 545 Score = 191 bits (484), Expect = 3e-46, Method: Composition-based stats. Identities = 61/375 (16%), Positives = 116/375 (30%), Gaps = 28/375 (7%) Query: 206 RSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWI 265 + H + S D + + F + T Sbjct: 23 TLQSVNAHGLSVGQQFVLSGFESAGLDGLYTVATVPDATHITFNF----TGTLLEGSVLG 78 Query: 266 TVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQ---TLFQAGVSVVSWF 322 + + ++ G I+ S + + A Sbjct: 79 ALYPYGLGQAWRASDVGSYVTLNGGLIEITQVVDASKAYGRIVKELSATITAPPDGWMLK 138 Query: 323 MSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALT 382 W +GYP V+ + RL +GS G V+ S+ G +YDF+ D + Sbjct: 139 TFMWNPTDGYPCAVSLYQQRLYAAGSSGYPERVWASATGLYYDFTPGT-----DDGDGFS 193 Query: 383 TAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSI---SLSKGLSIDFRRVSGSGVYACPP 439 V + I + + V + + +I+ R S G P Sbjct: 194 YDVASDQVNQIMHLASSR-ILTVLTQGEEFTIDGGSVGSITPTNINVRSQSIYGTARPRP 252 Query: 440 VSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFNQRILQLVYQEEPHSIVW 498 V VG+ L+F ++I+ ++ FR +T+LA H+ ++ + +Q EP +VW Sbjct: 253 VRVGNELIFPQRAAKKIRSMAYDFNTDSFRSQNLTRLAAHITESGVVDIAFQAEPTPVVW 312 Query: 499 VVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVA 558 +V + L+ + + E + H + + G L+ +V Sbjct: 313 MVR-----ADGVLISMTYDRD-ENVCGFARHTTDGAFKSVCCIPGAD----GDVLFAVVQ 362 Query: 559 LS-AGEERSFTVRLN 572 + G RL+ Sbjct: 363 RTINGNVVQNVERLD 377 >gi|257139843|ref|ZP_05588105.1| hypothetical protein BthaA_11681 [Burkholderia thailandensis E264] Length = 489 Score = 190 bits (483), Expect = 4e-46, Method: Composition-based stats. Identities = 56/335 (16%), Positives = 120/335 (35%), Gaps = 24/335 (7%) Query: 246 DRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSI--- 302 F ++ + T V + + G ++ ++ + S Sbjct: 3 SDFTFTFSFGGQLISGGTLGAVYEYGVGQAWRAQDVGSYVEINGGLVQLIAFESASRIFG 62 Query: 303 SVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGA 362 + + + A S + S W +GYP+ V+ RL +GS G + V+ S G Sbjct: 63 VIKRELASTLTAPASGWALKSSMWNSIDGYPAAVSLFKQRLYAAGSTGYPMRVWASGIGL 122 Query: 363 FYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL---S 419 + DF+ + +A + + + + + + ++ Sbjct: 123 YLDFTPGTK-----DGEAFGYDMASDQVNQTVHLASA-KILAALTQGEEFTVTGGSAGAI 176 Query: 420 KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADH 478 +I+ S G PV VG+ +V+V G++++ ++ +R +T+LA H Sbjct: 177 TPTNINVDSQSVYGCARARPVRVGNEIVYVQRAGKKVRAMTYDLNTDAYRSQNLTRLAAH 236 Query: 479 LFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVL 538 + I+ + +Q EP +VW+V + L+ + + E + H+ + Sbjct: 237 VTESGIVDVAFQAEPTPVVWMVR-----ADGVLVSMTYDRD-ENVCGFARHVTDGLFKSV 290 Query: 539 SAASFPNDNRGGTSLWMLVALS-AGEERSFTVRLN 572 P D L+ +V + G + RL+ Sbjct: 291 CC--IPGDEG--DVLFAVVQRTINGATVQYVERLD 321 >gi|209966375|ref|YP_002299290.1| hypothetical protein RC1_3113 [Rhodospirillum centenum SW] gi|209959841|gb|ACJ00478.1| conserved hypothetical protein [Rhodospirillum centenum SW] Length = 638 Score = 188 bits (477), Expect = 2e-45, Method: Composition-based stats. Identities = 73/475 (15%), Positives = 139/475 (29%), Gaps = 32/475 (6%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M K +F+ GELSP LL R DL + G RN++ L G + P Sbjct: 1 MTRLRSVKAAFTGGELSPDLL-GRGDLRSYETGALALRNVLILPTGGVTRRPGTAYLATL 59 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 P R+ +F+ LL F D++L++ ++ +TP+T Sbjct: 60 ---PGPGRLAAFAFDTEQAYLLAFTDRRLEVFRDGATEAV--------LETPWTAGQLAQ 108 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDK--ISFTFDEIKFLPPPWLGDGMISGVKSNA 178 L + + H D PP ++ D ++ F +K L A Sbjct: 109 LAWTQSADVLLVCHPDVPPRRIVRSGDRRWRCEAWRFSTVKTADGRALQRLPFHRFADAA 168 Query: 179 KLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIV---ADDKV 235 R+ + +F GR RL + ++ + D Sbjct: 169 VTLTPSGTRGRVRVRASAPVFDGAHAGRPFRLRRRQGLVVAVRSPTLAEIDLLEDVPDAE 228 Query: 236 YRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDV 295 + + + + +L ++ + G+ + Sbjct: 229 PSIDWDEPAFSPLRGWPVSACFHQDRLVIGGSRDLPNRLWLSRSGDLFDFDPGEGEDDEA 288 Query: 296 SKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFH----NNRLLFSGSKGD 351 + + +F V + W G P +R+ + Sbjct: 289 IEFAILSDQVNAIRQVFSGRHLQVFTTGAEW-AVTGEPLTPKEVRLDRQSRVGSGPGRQI 347 Query: 352 E------LSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVL- 404 +++ GA +F Y T A A + P +L Sbjct: 348 PAREVDGATLFAGRDGAVREFLWTDLESSYSTTDLTLAAGHLCRAPVELDVDPGRRLLLA 407 Query: 405 VGCDTSLWLLSISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVF-VCGVGRRIKY 458 V D + L++ ++ ++ R + V + V + + V GR + Sbjct: 408 VQADGGVAALTLDRAEQVTGWTRLETDGAVRSLAVVR--GEVHWLVERQGRWMLE 460 Score = 165 bits (417), Expect = 2e-38, Method: Composition-based stats. Identities = 38/265 (14%), Positives = 77/265 (29%), Gaps = 27/265 (10%) Query: 311 LFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDG 370 + W A+ G+P FH +RL+ GS+ ++LS G +DF Sbjct: 223 DVPDAEPSIDWDEPAFSPLRGWPVSACFHQDRLVIGGSRDLPNRLWLSRSGDLFDFDP-- 280 Query: 371 EYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVS 430 G + +A+ A+ + I + + V + W ++ + R S Sbjct: 281 --GEGEDDEAIEFAILSDQVNAIRQVFSGRH-LQVFTTGAEWAVTGEPLTPKEVRLDRQS 337 Query: 431 GSGVYACPPVS---VGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQLADHLFNQRILQL 487 G + V +F G +++ E + ++T A HL + Sbjct: 338 RVGSGPGRQIPAREVDGATLFAGRDGAVREFLWTDLESSYSTTDLTLAAGHLCRAPVE-- 395 Query: 488 VYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDN 547 +P + + + + + + W L+ Sbjct: 396 -LDVDPGRRLLLAV----QADGGVAALTLDRAEQ-VTGWTRLETDGAVRSLAVVRG---- 445 Query: 548 RGGTSLWMLVALSAGEERSFTVRLN 572 + LV R + Sbjct: 446 ----EVHWLVERQG---RWMLEQWE 463 >gi|296532340|ref|ZP_06895077.1| conserved hypothetical protein [Roseomonas cervicalis ATCC 49957] gi|296267336|gb|EFH13224.1| conserved hypothetical protein [Roseomonas cervicalis ATCC 49957] Length = 626 Score = 185 bits (468), Expect = 3e-44, Method: Composition-based stats. Identities = 52/389 (13%), Positives = 112/389 (28%), Gaps = 29/389 (7%) Query: 188 STARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDR 247 ++ S + + + + + + + Sbjct: 90 GDVQVASLAGPWTAAMLDAIAWTQSADTLLLLHPDMVPQRVTRSSNTSWSIAPWSFVREP 149 Query: 248 FGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQ 307 F + T +V +S + + V + + G V+ + S Sbjct: 150 FYRFASPGVTLAPSATSGSVTLTASAAAFQPGHAGVR-FRLGGKRVLVTAVASATSATAS 208 Query: 308 SQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFS 367 + + W +A+ G+P FH +RL+ GS+ ++LS G ++F Sbjct: 209 VEETLPGTAASADWDEAAFSAVRGWPVTACFHQDRLVLGGSRDLPNRLWLSRSGDLFNFD 268 Query: 368 LDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFR 427 G +A+ + + I + + V + W+++ SI Sbjct: 269 ----LGSGLDDQAIEFGLLSDQVNAIRAVFSGRH-LQVFTSGAEWMVTGEPMTPASIQLH 323 Query: 428 RVSGSGVYACP---PVSVGDCLVFVCGVGRRIKYISG-STEQGFRFNEITQLADHLFNQR 483 R + G PV V +FV G+ + + +Q ++ N++ +A HL Sbjct: 324 RQTRIGSPVARIIPPVDVDGSTIFVARSGQAVHEYAYTDVQQAYQANDLALVARHLVQTP 383 Query: 484 ILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASF 543 + + Y + L L + AW L+ Sbjct: 384 V-SMAYDQTRR------LLHVAMQGGWLATLTLYRAEQ-VTAWTRQDTDGAFRALA---- 431 Query: 544 PNDNRGGTSLWMLVALSAGEERSFTVRLN 572 ++W V + R + Sbjct: 432 ----EIDGTVWCAVERAGAMR---LERFD 453 Score = 179 bits (453), Expect = 1e-42, Method: Composition-based stats. Identities = 44/212 (20%), Positives = 76/212 (35%), Gaps = 21/212 (9%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M TK SF+AGEL +LL R DL + G + RN+ G L P ++ + Sbjct: 1 MAAGRSTKTSFTAGELGDQLL-GRGDLRAYENGARRLRNVFIQPTGGLTRRPGLRHVAEL 59 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 P R+ +F L+V + L++ + P+T + Sbjct: 60 ---PGPARLIAFEFNTEQTYLVVLTHQGLRVFLGDVQVASLAG--------PWTAAMLDA 108 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 + + T + +H D P + + S++ F+ P+ S Sbjct: 109 IAWTQSADTLLLLHPDMVPQRVTRSSN---TSWSIAPWSFVREPFYR------FASPGVT 159 Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGC 212 A + + +T+ F+P G RLG Sbjct: 160 LAPSATSGSVTLTASAAAFQPGHAGVRFRLGG 191 >gi|83720451|ref|YP_441475.1| hypothetical protein BTH_I0919 [Burkholderia thailandensis E264] gi|83654276|gb|ABC38339.1| conserved hypothetical protein [Burkholderia thailandensis E264] Length = 405 Score = 184 bits (467), Expect = 3e-44, Method: Composition-based stats. Identities = 48/253 (18%), Positives = 98/253 (38%), Gaps = 21/253 (8%) Query: 325 AWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTA 384 W +GYP+ V+ RL +GS G + V+ S G + DF+ + +A Sbjct: 1 MWNSIDGYPAAVSLFKQRLYAAGSTGYPMRVWASGIGLYLDFTPGTK-----DGEAFGYD 55 Query: 385 VTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL---SKGLSIDFRRVSGSGVYACPPVS 441 + + + + + + ++ +I+ S G PV Sbjct: 56 MASDQVNQTVHLASA-KILAALTQGEEFTVTGGSAGAITPTNINVDSQSVYGCARARPVR 114 Query: 442 VGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVV 500 VG+ +V+V G++++ ++ +R +T+LA H+ I+ + +Q EP +VW+V Sbjct: 115 VGNEIVYVQRAGKKVRAMTYDLNTDAYRSQNLTRLAAHVTESGIVDVAFQAEPTPVVWMV 174 Query: 501 LEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALS 560 + L+ + + E + H+ + P D L+ +V + Sbjct: 175 R-----ADGVLVSMTYDRD-ENVCGFARHVTDGLFKSVCC--IPGDEG--DVLFAVVQRT 224 Query: 561 -AGEERSFTVRLN 572 G + RL+ Sbjct: 225 INGATVQYVERLD 237 >gi|83313369|ref|YP_423633.1| hypothetical protein amb4270 [Magnetospirillum magneticum AMB-1] gi|82948210|dbj|BAE53074.1| hypothetical protein [Magnetospirillum magneticum AMB-1] Length = 634 Score = 182 bits (461), Expect = 1e-43, Method: Composition-based stats. Identities = 49/375 (13%), Positives = 101/375 (26%), Gaps = 30/375 (8%) Query: 196 MKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDK-----VYRSLTTGRSGDRFGY 250 + + + + + + Sbjct: 99 ETPWSTAQVAQLSWTQSADTLLVVHPDVEPRKITRTGANSWVLETWSYYQEDGILYVPTH 158 Query: 251 SKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQT 310 V + L++ + A+ A + V G +S + + + Sbjct: 159 KFAKDAVTLTPSGTSGTITLTASEAVFDAAHAGCRFRVGGKQVLISAVTSATQAQAEVKQ 218 Query: 311 LFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDG 370 + W ++ G+P V FH RL GS+G ++LS ++F Sbjct: 219 TLGGTAATEDWEEQSFSPLRGWPVSVCFHQGRLAIGGSRGLPNRLWLSKSMDLFNFD--- 275 Query: 371 EYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVS 430 G +A+ ++ I + + V + W++ S I R + Sbjct: 276 -LGTGLDDEAIEFSLLSTQVDAIRAVFSGRH-LQVFTSGAEWMVVGSPLTPTKIQLNRQT 333 Query: 431 GSGVY---ACPPVSVGDCLVFVCGVGRRIKYISG-STEQGFRFNEITQLADHLFNQRILQ 486 G + PP V FV GR ++ +Q ++ N+++ +A H+ N + Q Sbjct: 334 RVGSPVDRSVPPRDVDGATHFVSRSGRDLREFLFADVDQAYQANDLSMVAKHVMNTPVDQ 393 Query: 487 LVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPND 546 L + + E AW ++ Sbjct: 394 -------DYDASRRLFHVVMADGLMATLTVYR-AEKVTAWTVFETQGAFRSVAVVDG--- 442 Query: 547 NRGGTSLWMLVALSA 561 +LV Sbjct: 443 -----DTHVLVERGG 452 Score = 182 bits (460), Expect = 2e-43, Method: Composition-based stats. Identities = 72/472 (15%), Positives = 137/472 (29%), Gaps = 32/472 (6%) Query: 7 TKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRS 66 TK SF+AGE+ L R DL+L+A G RN++ G + P ++ R Sbjct: 8 TKTSFTAGEVDVDL-AGRGDLALYANGAKSLRNVVVAPIGGVRRRPGLRHVAPAR---GP 63 Query: 67 NRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVF 126 R+ +F LL D ++ I + +TP++ L + Sbjct: 64 GRLIAFEFNTEQTYLLALSDHRMDIYADGAKV--------AELETPWSTAQVAQLSWTQS 115 Query: 127 GSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQAD 186 T + VH D P + S+ + + + +A Sbjct: 116 ADTLLVVHPDVEPRKITRTGA---NSWVLETWSYYQEDGILYVPTHKFAKDAVTLTPSGT 172 Query: 187 TSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTG---R 243 + T +T+ +F G R+G + T+ + V + T + Sbjct: 173 SGTITLTASEAVFDAAHAGCRFRVGGKQVLISAVTSATQAQAEVKQTLGGTAATEDWEEQ 232 Query: 244 SGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSIS 303 S + + L ++ + G + + + Sbjct: 233 SFSPLRGWPVSVCFHQGRLAIGGSRGLPNRLWLSKSMDLFNFDLGTGLDDEAIEFSLLST 292 Query: 304 VAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHN-NRLLFSGSKGDEL--------- 353 + +F V + W G P T NR GS D Sbjct: 293 QVDAIRAVFSGRHLQVFTSGAEWM-VVGSPLTPTKIQLNRQTRVGSPVDRSVPPRDVDGA 351 Query: 354 SVYLSSFG-AFYDFSLDGEYGCYDPTK-ALTTAVTDFSASTIHWMHPFGEGVLVGCDTSL 411 + ++S G +F Y ++ + + +V D + Sbjct: 352 THFVSRSGRDLREFLFADVDQAYQANDLSMVAKHVMNTPVDQDYDASRRLFHVVMADGLM 411 Query: 412 WLLSISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGST 463 L++ ++ ++ + + V GD V V G + T Sbjct: 412 ATLTVYRAEKVTAWTVFETQGAFRSVAVVD-GDTHVLVERGGSHVIECFDDT 462 >gi|288959323|ref|YP_003449664.1| hypothetical protein AZL_024820 [Azospirillum sp. B510] gi|288911631|dbj|BAI73120.1| hypothetical protein AZL_024820 [Azospirillum sp. B510] Length = 632 Score = 180 bits (455), Expect = 7e-43, Method: Composition-based stats. Identities = 47/238 (19%), Positives = 73/238 (30%), Gaps = 15/238 (6%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M K +F+AGE+S RLL R DL + G RNL G + + Sbjct: 2 MGRLHQVKTNFTAGEVSRRLL-GRGDLKAYDNGALALRNLFIDPTGGVTRRSGLAF---T 57 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 L P R+ +F LLVF D+++ + + P+T Sbjct: 58 ALAPGDGRLVAFERNSEQTYLLVFTDRRIDVFQ--------GGSRLASVAAPWTLTQLAQ 109 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 + + T + H D PP L DG + E F L A Sbjct: 110 ITWTQSADTLLVCHPDLPPRKLTRGDDGG---WALAEWAFAVEGGLVRTPFHRFGDPAVT 166 Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRS 238 +T+ +F P G +R+ + + V + Sbjct: 167 VTPSGTGGAITVTASAPVFDPRQDGTRLRIRGKQLLVTGVVSATQVNATVKETLADTQ 224 Score = 174 bits (441), Expect = 3e-41, Method: Composition-based stats. Identities = 56/394 (14%), Positives = 112/394 (28%), Gaps = 33/394 (8%) Query: 188 STARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYR-----SLTTG 242 +R+ S + + + + DD + G Sbjct: 91 GGSRLASVAAPWTLTQLAQITWTQSADTLLVCHPDLPPRKLTRGDDGGWALAEWAFAVEG 150 Query: 243 RSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSI 302 + G V + +++ + G V+ + Sbjct: 151 GLVRTPFHRFGDPAVTVTPSGTGGAITVTASAPVFDPRQDGTRLRIRGKQLLVTGVVSAT 210 Query: 303 SVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGA 362 V + W A+ G+P FH +RL+ GS+ ++LS Sbjct: 211 QVNATVKETLADTQPTPQWEEQAFSALRGWPVSAAFHQDRLVIGGSRDLPNRLWLSRSAQ 270 Query: 363 FYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGL 422 ++F G +A+ + + + + + V + ++++ Sbjct: 271 IWNFD----LGEGLDDQAIEFGILSDQVNAVRAVFSGRH-LQVFTSGAEYMVTGDPLTPQ 325 Query: 423 SIDFRRVSGSGVY---ACPPVSVGDCLVFVCGVGRRIKYISG-STEQGFRFNEITQLADH 478 S+ +R + G A PP V +FV R I+ TE ++ N++ LA H Sbjct: 326 SMQVKRQTRIGSPMDRAIPPRDVEGATLFVPRNRREIREFLFTDTEAAYQANDLALLARH 385 Query: 479 LFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVL 538 L Y + +++V +E L E AW + Sbjct: 386 LVASP-RDQDYDQN-RRLLFVAME-----DGTLGALTAYR-AEDVTAWTLLETDGAVRSV 437 Query: 539 SAASFPNDNRGGTSLWMLVALSAGEERSFTVRLN 572 +A G ++ LV R + Sbjct: 438 AAV--------GDEVYALVERRGFWT---IERFD 460 >gi|54302254|ref|YP_132247.1| hypothetical protein PBPRB0574 [Photobacterium profundum SS9] gi|46915675|emb|CAG22447.1| hypothetical protein PBPRB0574 [Photobacterium profundum SS9] Length = 919 Score = 179 bits (453), Expect = 1e-42, Method: Composition-based stats. Identities = 59/380 (15%), Positives = 116/380 (30%), Gaps = 26/380 (6%) Query: 197 KIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATY 256 + + + + + + SL T Sbjct: 293 SVTDARHAICEVLVRLPDSVVGGERSKLTWNFPGETTQRTFSLATPPLTSNTMKDFTVKL 352 Query: 257 VKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGV 316 V T S + V P G + R + V Q+ Sbjct: 353 VGTTTKTLQFPNEYSIDFDAKRLDLYVNPGVTSGSGSSSTTTARDVDVVQQA-----TSR 407 Query: 317 SVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYD 376 S W + W GYP T+ RL + + +V+LS +F DFS Sbjct: 408 STYKWAIEIWRNSTGYPRCGTYFQQRLSMANTISHPQTVWLSRTDSFNDFSKTRPI---L 464 Query: 377 PTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLL----SISLSKGLSIDFRRVSGS 432 ++ + + I + P + + LW L + S + + Sbjct: 465 ADDSMRYDINSLQVNEIFNIVPLNSLL-LFTSGGLWSLAQDQQGAFSAESPPSVKMQNYE 523 Query: 433 GVYACPPVSVGDCLVFVCGVGRRIKYISGS-TEQGFRFNEITQLADHLFNQ-RILQLVYQ 490 G P+ G ++V R ++ I S + F ++T A HLF R+++ Y Sbjct: 524 GANKLRPIVAGSTAIYVQQGDRIVRDIQFSWSSDSFEGVDLTVRASHLFKHKRVVEWAYA 583 Query: 491 EEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGG 550 + P ++WV+ + + E + + W H + K+ +++ + Sbjct: 584 KNPDKLIWVIFD-----DGTAATLTYMKEQQ-IWGWCPHTTNGKYKNVASV----EEGSR 633 Query: 551 TSLWMLVAL-SAGEERSFTV 569 +S++ +V G + Sbjct: 634 SSIYFVVERIINGAPVNVIE 653 Score = 165 bits (416), Expect = 2e-38, Method: Composition-based stats. Identities = 47/330 (14%), Positives = 97/330 (29%), Gaps = 18/330 (5%) Query: 6 WTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPR 65 ++ S SAGELSP + R D + G+AK+ N +G + + P Sbjct: 5 LSQPSMSAGELSPE-MYGRVDTDHYRIGLAKAENFFVNYHGGISNRPGTT-LSYITARNE 62 Query: 66 SNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAV 125 + F +L FG + +++ + + + TPY + L Y Sbjct: 63 VVALIPFQFSAFDSFMLEFGTEYMRV-MSKGKYITDNSGVKIQVVTPYLAGEILDLSYTQ 121 Query: 126 FGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQA 185 H++H + + D + + + P+ + A + Sbjct: 122 SADVLTIFHRNHAIQQIKRYSNID---WRVEPLINKLGPFESININESQFMYADKNGD-- 176 Query: 186 DTSTARITSDMKIFKPLDKGRSIRLGCHP----PEWAKNTNYSIGAYIVADDKVYRSLTT 241 + S+ F G+ + L +W + + G Y Sbjct: 177 VGEQITLISNFDAFTSDLVGKMVYLDQEETGDISQWMQRYEVAEGDQTYNAGNYYICTKA 236 Query: 242 GRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPY--YVWGDIKDVSKDG 299 + + V W +R++ G Y +G +K +S Sbjct: 237 ELYNGKKAQTGDIAPVHSTGERWDGPGKFLPDDNRDANIGVRWAYLNSGYGVVKIISVTD 296 Query: 300 RSIS----VAPQSQTLFQAGVSVVSWFMSA 325 + + ++ S ++W Sbjct: 297 ARHAICEVLVRLPDSVVGGERSKLTWNFPG 326 >gi|31711676|ref|NP_853594.1| tail protein [Enterobacteria phage SP6] gi|31505680|gb|AAP48773.1| gp34 [Enterobacteria phage SP6] gi|40787051|gb|AAR90025.1| 33 [Enterobacteria phage SP6] Length = 803 Score = 178 bits (452), Expect = 2e-42, Method: Composition-based stats. Identities = 49/598 (8%), Positives = 127/598 (21%), Gaps = 63/598 (10%) Query: 21 LQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRVFSFSIP---DG 77 + + ++ N++P S + Sbjct: 14 ISQQPPAVRLDGQCSEMVNMVPDVVEGTKSRMGTTHIAKLLEYGEDDMAVHHYRRGGEGE 73 Query: 78 GYALLVFGDKKL-QIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKD 136 + ++ +I + + + +++ +++ Sbjct: 74 EEYFFIMKKGQVPEIFDKQGRKCMVQSQDAPMTYLSEVTNPREDVQFMTIADVTFMLNR- 132 Query: 137 HPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDM 196 ++ + I F+ G + D + A D+ Sbjct: 133 ---KKIVKARPERSPQVGSTAIVFMAYGQYGTHYKIIIDGVVAAGYKTRDGAEAHHIEDI 189 Query: 197 KIFKPLD--KGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGA 254 + + SI + T + + + Sbjct: 190 RTESIAYNLYQSLQSWDKIADYEIQLDGTSIYITRRDGSTTFDITTEDGAKGKDLVAIKY 249 Query: 255 TYVKDNNITWITVLNLSSKTS----------------RESASGAVAPYYVWGDIKDVSKD 298 + + + + + + K Sbjct: 250 KVASTDLLPSRAPEGYKVQVWPTGSKPESRYWLQAEKQNGNIVSWKETLAADVLIGFDKS 309 Query: 299 GRSISVAPQ----SQTLFQAGVSVVSWFMSAWGEQEGYPSHV-----------TFHNNRL 343 + F+ PS + NRL Sbjct: 310 TMPYIIERTGFVNGIAQFKIRQGDWEDRKVGDDLTNPMPSFIDEEVPQTLGGMFMVQNRL 369 Query: 344 LFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGV 403 + + V + F+DF T + Sbjct: 370 CVTAGEA----VIATRTSYFFDFFRYTAVSAV-ATDPFDVFSDASEVYQLKHAVTLDGST 424 Query: 404 LVGCDTSLWLLSIS-LSKGLSIDFRR-VSGSGVYACPPVSVGDCLVFVCGVGRR--IKYI 459 ++ D S ++L + ++ + + PV+ G+ ++F G I+ Sbjct: 425 VLFADKSQFILPGDKPLEKSNVLLKPVTTFEVNNNVKPVATGESVMFATSEGAYSGIREF 484 Query: 460 SGS-TEQGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSA 518 + IT + L ++ + + ++ VL K + + Sbjct: 485 YTDSYSDTKKAQAITSHVNKLLEGNVIMMSASTNVNRLL--VLTDKYRNIIYCYDWLWQG 542 Query: 519 EGEGDFAWHTHMIS-DKHYVLSAASFPNDNRGGTSLWMLVALSAGEERSFTVRLNLLD 575 AWH G L++L+ G + R+++ D Sbjct: 543 TERVQAAWHKWEWPLGTF-------IRGMFYSGEHLYLLIER--GSTGVYLERMDMGD 591 >gi|83571759|ref|YP_425011.1| putative tail tubular B protein [Enterobacteria phage K1E] gi|83308210|emb|CAJ29442.1| gp33 protein [Enterobacteria phage K1E] Length = 800 Score = 178 bits (451), Expect = 2e-42, Method: Composition-based stats. Identities = 58/597 (9%), Positives = 128/597 (21%), Gaps = 64/597 (10%) Query: 21 LQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRVFSFSIP--DGG 78 + + N++P S + N Sbjct: 14 ISQQPPAVRLDGQCTTMVNMVPDVVNGTQSRMGTTHIAKLLDEGTDNMATHHYRRGEGDE 73 Query: 79 YALLVFGDKKL-QIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDH 137 ++ +I + + +++ +++ Sbjct: 74 EYFFTLKKGQVPEIFDKHGRKCNVISQDAPMTYLSEVVNPREDVQFMTIADVTFMLNR-- 131 Query: 138 PPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARIT---- 193 ++ + + I F G + S D +A Sbjct: 132 --RKVVKVSNRKSPKVGDKAIVFCAYGQYGTSYSIIINGTTAASFKTPDGGSAEHVEQIR 189 Query: 194 ---------------SDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRS 238 S + ++ G SI + + T + Sbjct: 190 TERITSELYSKLQQWSGVNDYEIQRDGTSIFIERRDGKSFTVTTTDGAKGKDLVAIKNKV 249 Query: 239 LTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKD 298 +T R + + S + K Sbjct: 250 SSTDLLPSRAPAGYKVQVWPTGSKPESRYWLQAEPKEGNLVS--WKETIAADVLLGFDKG 307 Query: 299 GRSISVAPQS--QTLFQAGVSVVSWFMSAWGE--QEGYPSHV-----------TFHNNRL 343 + + Q + W G+ PS + NRL Sbjct: 308 TMPYIIERTGIIDGIAQFKIRQGDWEDRKVGDDLTNPMPSFIDEEVPQTIGGMFMVQNRL 367 Query: 344 LFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGV 403 F+ + V S F+DF T + Sbjct: 368 CFTAGEA----VIASRTSYFFDFFRYTVIS-ALATDPFDIFSDASEVYQLKHAVTLDGAT 422 Query: 404 LVGCDTSLWLLSIS-LSKGLSIDFRR-VSGSGVYACPPVSVGDCLVFVCGVGRR--IKYI 459 ++ D S ++L + + + + PV G+ ++F G ++ Sbjct: 423 VLFSDKSQFILPGDKPLEKSNALLKPVTTFEVNNKVKPVVTGESVMFATNDGSYSGVREF 482 Query: 460 SGS-TEQGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSA 518 + IT + L I + + ++ V K + + Sbjct: 483 YTDSYSDTKKAQAITSHVNKLIEGNITNMAASTNVNRLL--VTTDKYRNIIYCYDWLWQG 540 Query: 519 EGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSAGEERSFTVRLNLLD 575 AWH V G L++L+ G + ++++ D Sbjct: 541 TDRVQSAWHVWEWPMGTKV------RGMFYSGELLYLLLERGDGV---YLEKMDMGD 588 >gi|108862018|ref|YP_654134.1| 33 [Enterobacteria phage K1-5] gi|40787104|gb|AAR90075.1| 33 [Enterobacteria phage K1-5] Length = 800 Score = 176 bits (445), Expect = 1e-41, Method: Composition-based stats. Identities = 59/595 (9%), Positives = 123/595 (20%), Gaps = 60/595 (10%) Query: 21 LQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRVFSFSI--PDGG 78 + + N+IP S + Sbjct: 14 ISQQPPAVRLDGQCTAMVNMIPDVVNGTQSRMGTTHIAKILDAGTDDMATHHYRRGDGDE 73 Query: 79 YALLVFGDKKL-QIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDH 137 ++ +I + + +++ +++ Sbjct: 74 EYFFTLKKGQVPEIFDKYGRKCNVTSQDAPMTYLSEVVNPREDVQFMTIADVTFMLNRRK 133 Query: 138 PPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTST---ARITS 194 I F G + S D + Sbjct: 134 VVKASSRKSPKVGN----KAIVFCAYGQYGTSYSIVINGANAASFKTPDGGSADHVEQIR 189 Query: 195 DMKIFKPLDKGRSIRLGCHPPEWAK-----------NTNYSIGAYIVADDKVYRSLTTGR 243 +I L G E + +++I A K ++ Sbjct: 190 TERITSELYSKLQQWSGVSDYEIQRDGTSIFIERRDGASFTITTTDGAKGKDLVAIKNKV 249 Query: 244 SGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASG---AVAPYYVWGDIKDVSKDGR 300 S S+ K + E G + + K Sbjct: 250 SSTDLLPSRAPAGYKVQVWPTGSKPESRYWLQAEPKEGNLVSWKETIAADVLLGFDKGTM 309 Query: 301 SISVAPQSQTL----FQAGVSVVSWFMSAWGEQEGYPSHV-----------TFHNNRLLF 345 + F+ PS + NRL F Sbjct: 310 PYIIERTDIINGIAQFKIRQGDWEDRKVGDDLTNPMPSFIDEEVPQTIGGMFMVQNRLCF 369 Query: 346 SGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLV 405 + + V S F+DF T + ++ Sbjct: 370 TAGEA----VIASRTSYFFDFFRYTVIS-ALATDPFDIFSDASEVYQLKHAVTLDGATVL 424 Query: 406 GCDTSLWLLSIS-LSKGLSIDFRR-VSGSGVYACPPVSVGDCLVFVCGVGRR--IKYISG 461 D S ++L + + + + PV G+ ++F G ++ Sbjct: 425 FSDKSQFILPGDKPLEKSNALLKPVTTFEVNNKVKPVVTGESVMFATNDGSYSGVREFYT 484 Query: 462 S-TEQGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEG 520 + IT + L I + + ++ V K + + Sbjct: 485 DSYSDTKKAQAITSHVNKLIEGNITNMAASTNVNRLL--VTTDKYRNIIYCYDWLWQGTD 542 Query: 521 EGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSAGEERSFTVRLNLLD 575 AWH V G L++L+ G + ++++ D Sbjct: 543 RVQSAWHVWKWPIGTKV------RGMFYSGELLYLLLERGDGV---YLEKMDMGD 588 >gi|311875239|emb|CBX44498.1| putative tail tubular protein B [Erwinia phage phiEa1H] gi|311875360|emb|CBX45101.1| putative tail tubular protein B protein [Erwinia phage phiEa100] Length = 806 Score = 173 bits (438), Expect = 6e-41, Method: Composition-based stats. Identities = 55/593 (9%), Positives = 135/593 (22%), Gaps = 58/593 (9%) Query: 29 LHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLD-PRSNRVFSFSI-PDGGYALLVFGD 86 V N +P L + + ++ + + D ++ Sbjct: 22 RLPGQVTSQLNAVPNVVDGLKTRMGSKHLARILNSLDANSLIHHYKRGDDAEEYFVILQP 81 Query: 87 KKLQI-VVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYI 145 ++ + V + ++ + G +++ P + Sbjct: 82 GQVPVIFTVGGLACPVNTQGSAATYLSSSSLPRETTQLMTIGDYTFVLNRKMPVQARGDV 141 Query: 146 Q---------DGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARI---- 192 +F+F + + + + + + D ++ Sbjct: 142 TPSLDNKGLVYVAYANFSFTYQILINGQVAAEHKTASSEDVKNEDLVRTDYVAGKLLENF 201 Query: 193 ---TSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFG 249 T+ F G + + T ++ +R Sbjct: 202 NSRTASFPGFSMYQDGNVLVVDNSNGANYALTTVDGADGQDLVAIRHKVTNLDTLPNRAP 261 Query: 250 YSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQ 309 + + + G K + + +S Sbjct: 262 VGYKVQVWPTGSKPESRYWLQAESQDGSKVT--WVETIAPGVRKGWNAATMPHVLVRESL 319 Query: 310 T-----LFQAGVSVVSWFMSAWGEQEGYPS-----------HVTFHNNRLLFSGSKGDEL 353 F +PS + NRL+ + + Sbjct: 320 NANGSANFTYRPGEWEDRDVGDDLTNDFPSLLNDSSPQPISSMLMVQNRLMLTSGEA--- 376 Query: 354 SVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWL 413 V S F+DF D I W V++ + Sbjct: 377 -VVASRTSRFFDFFRYTVLATVDT-DPFDVFADIEEVYNIRWSAQMDGDVVLFTSDQQFT 434 Query: 414 LSIS-LSKGLSIDFRRVSGS-GVYACPPVSVGDCLVFVCGVGRR--IKYISGS-TEQGFR 468 L S R V+ P GD ++F G I+ + Sbjct: 435 LPGDKPLTPTSAVIRPVTQFKMTPGVKPAPSGDSILFAFDQGSYSGIREFFTDSYSDTKK 494 Query: 469 FNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHT 528 T D ++L+L + ++ D + + + + AWH Sbjct: 495 AQPATSHVDKYIRGKVLELSASSSFNRA--FIITSSDRNILYVYDWLYEGTEKVQNAWHK 552 Query: 529 HMISDKHYVLSAASFPNDNRGGTSLWMLVALSA---GEERSFTVRLNLLDDFK 578 + + + L++++ + G + +++ D+ + Sbjct: 553 WSFPAGTVLHAV------SYSNEKLYLVLTRTNTSGGVAGVYIEVMDMGDELE 599 >gi|125999999|ref|YP_001039670.1| tail tubular protein B-like protein [Erwinia amylovora phage Era103] gi|121621855|gb|ABM63429.1| tail tubular protein B-like protein [Enterobacteria phage Era103] Length = 806 Score = 173 bits (438), Expect = 6e-41, Method: Composition-based stats. Identities = 55/593 (9%), Positives = 135/593 (22%), Gaps = 58/593 (9%) Query: 29 LHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLD-PRSNRVFSFSI-PDGGYALLVFGD 86 V N +P L + + ++ + + D ++ Sbjct: 22 RLPGQVTSQLNAVPNVVDGLKTRMGSKHLARILNSLDANSLIHHYKRGDDAEEYFVILQP 81 Query: 87 KKLQI-VVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYI 145 ++ + V + ++ + G +++ P + Sbjct: 82 GQVPVIFTVGGLACPVNTQGSAATYLSSSSLPRETTQLMTIGDYTFVLNRKMPVQARGDV 141 Query: 146 Q---------DGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARI---- 192 +F+F + + + + + + D ++ Sbjct: 142 TPSLDNKGLVYVAYANFSFTYQILINGQVAAEHKTASSEDVKNEDLVRTDYVAGKLLENF 201 Query: 193 ---TSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFG 249 T+ F G + + T ++ +R Sbjct: 202 NSRTASFPGFSMYQDGNVLVVDNSNGANYALTTVDGADGQDLVAIRHKVTNLDTLPNRAP 261 Query: 250 YSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQ 309 + + + G K + + +S Sbjct: 262 VGYKVQVWPTGSKPESRYWLQAESQDGSKVT--WVETIAPGVRKGWNAATMPHVLVRESL 319 Query: 310 T-----LFQAGVSVVSWFMSAWGEQEGYPS-----------HVTFHNNRLLFSGSKGDEL 353 F +PS + NRL+ + + Sbjct: 320 NANGSANFTYRPGEWEDRDVGDDLTNDFPSLLNDSSPQPISSMLMVQNRLMLTSGEA--- 376 Query: 354 SVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWL 413 V S F+DF D I W V++ + Sbjct: 377 -VVASRTSRFFDFFRYTVLATVDT-DPFDVFADIEEVYNIRWSAQMDGDVVLFTSDQQFT 434 Query: 414 LSIS-LSKGLSIDFRRVSGS-GVYACPPVSVGDCLVFVCGVGRR--IKYISGS-TEQGFR 468 L S R V+ P GD ++F G I+ + Sbjct: 435 LPGDKPLTPTSAVIRPVTQFKMTPGVKPAPSGDSILFAFDQGSYSGIREFFTDSYSDTKK 494 Query: 469 FNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHT 528 T D ++L+L + ++ D + + + + AWH Sbjct: 495 AQPATSHVDKYIRGKVLELSASSSFNRA--FIITSPDRNILYVYDWLYEGTEKVQNAWHK 552 Query: 529 HMISDKHYVLSAASFPNDNRGGTSLWMLVALSA---GEERSFTVRLNLLDDFK 578 + + + L++++ + G + +++ D+ + Sbjct: 553 WSFPAGTVLHAV------SYSNEKLYLVLTRTNTSGGVAGVYIEVMDMGDELE 599 >gi|61806431|ref|YP_214208.1| T7-like tail tubular protein B [Prochlorococcus phage P-SSP7] gi|61374356|gb|AAX44210.1| T7-like tail tubular protein B [Prochlorococcus phage P-SSP7] gi|265525468|gb|ACY76234.1| predicted protein [Prochlorococcus phage P-SSP7] Length = 976 Score = 165 bits (416), Expect = 2e-38, Method: Composition-based stats. Identities = 61/509 (11%), Positives = 126/509 (24%), Gaps = 48/509 (9%) Query: 86 DKKLQIVVVRSSTKWSPALFGKTYKT-----PYTFKDNKSLEYAVFGSTAVFVHKDHPPH 140 + +I V S ++ T TF G Sbjct: 277 NLYFRIRTVGQSVPFTTGSGSSATTTYQARYTTTFDLLYGGTGWQEGDYFYV-------- 328 Query: 141 HLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFK 200 + L + + ++ F Sbjct: 329 -WMKDGYYKITVEAISTANVQANLGLIRPNPTPFDTETAVTAESIIGDIRTAIIATGNFT 387 Query: 201 PLDKGRS-IRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKD 259 + + L P N + ++ S V + Sbjct: 388 SANVQQIGTGLYVTRPSGTFNVTAPSSDLLRVMSGEVANVDDLPSQC---KHGYVVKVAN 444 Query: 260 NNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVV 319 + + G + K I + Q+ F + Sbjct: 445 SEADADDYYVKFFGHNNRDGDGVWEECAKPSRNIEFDKGTMPIQLVRQANGTFTVSQATW 504 Query: 320 SWFMSAWGEQEGYPSHV-------TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEY 372 PS V F NRL+F + V +S G F++F Sbjct: 505 QNAEVGDELTNPNPSFVGKTINQLVFFRNRLVFLSDEN----VIMSRPGEFFNFWSKTA- 559 Query: 373 GCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRR---V 429 + P + + + + ++ G+L+ ++L+ + Sbjct: 560 TTFTPQDVIDLSCSSTYPAIVYDGIQVNAGLLLFTKNQQFMLTTDSDILSPETAKINAVS 619 Query: 430 SGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEI---TQLADHLFNQRILQ 486 S + PVS+G + F+ + ++ S ++ +++ L ++ I Sbjct: 620 SYNFNEKTHPVSLGTTVAFIDNANQFTRFFEMSNVVRQGEPDVVDQSKVISRLLDKNISL 679 Query: 487 LVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPND 546 + E +S+V+ KD S E AW T I+ Sbjct: 680 VSVSRE-NSVVFF--SQKDTDKIYCFRYFTSGEKRLLQAWTTWTITGNIQYHCM------ 730 Query: 547 NRGGTSLWMLVALSAGEERSFTVRLNLLD 575 +L+ +V + +++ L L D Sbjct: 731 --LDDALY-VVTRNNNKDQIVKYSLKLDD 756 Score = 76.5 bits (186), Expect = 1e-11, Method: Composition-based stats. Identities = 31/337 (9%), Positives = 76/337 (22%), Gaps = 23/337 (6%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + T T + + G L + D V+ + N+IP L+ P + Sbjct: 1 MASVTQTIPTLTGG------LSQQPDELKIPGQVSVANNVIPDVTHGLLKRPGGKLVASI 54 Query: 61 RL-------DPRSNRVFSFSIPDGGYALLVFG-DKKLQIVV-VRSSTKWSPALFGKTYKT 111 + + FS+ + + + + G Sbjct: 55 SDNGTAALNSQTNGKWFSYYRDETESYIGQVSRSGDINMWRCSDGQAMTVNYDSGTATAL 114 Query: 112 PYTFKDNK--SLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDG 169 ++ ++ ++ D Sbjct: 115 TTYLTHTNDEDIQTLTLNDYTFLTNRTKTVAMSSTVEPVRPPEVFIDLKATAYARQYAVN 174 Query: 170 MISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYI 229 + + A ++++ D + +++ R+ R + Sbjct: 175 LFDNTTTTAVSTVTRIDVELIKSSNNYCDSNGAMVARTSRPSNSTRCDDSAGDGRDAYAP 234 Query: 230 VADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVW 289 KV+ D + + + +V + R G P+ Sbjct: 235 NVGTKVFNVTDGASLTDEANSGSYTYTIDVKDSSNNSVNRGVNLYFRIRTVGQSVPFTTG 294 Query: 290 GDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAW 326 S + + + T F W + Sbjct: 295 ------SGSSATTTYQARYTTTFDLLYGGTGWQEGDY 325 >gi|291335792|gb|ADD95393.1| tail tubular protein B [uncultured phage MedDCM-OCT-S05-C532] Length = 647 Score = 163 bits (412), Expect = 7e-38, Method: Composition-based stats. Identities = 53/445 (11%), Positives = 120/445 (26%), Gaps = 34/445 (7%) Query: 146 QDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKG 205 + L + + ++ S + D F + Sbjct: 4 GYYKITVEAISTTQIQANLGLIRPNPTPFDTETTVTASGILGDIRQAIIDTGNFTSSNVK 63 Query: 206 RSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFG-YSKGATYVKDNNITW 264 + +G + +++ + KV S V ++ Sbjct: 64 Q---IGNGIYVTRPSGTFNVTSPTSDLLKVMSSEVKNVDDLPDQCKHGYVVKVANSEADE 120 Query: 265 ITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMS 324 + G G + K I + Q+ F + Sbjct: 121 DDYFVKFYGNNDRDGDGVWEECAKPGRNIEFDKGTMPIQLVRQANGTFTVSQATWENADV 180 Query: 325 AWGEQEGYPSHV-------TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDP 377 PS V F NRL+F + V +S G F++F + P Sbjct: 181 GDTLTNPNPSFVGKTVNQLVFFRNRLVFLSDEN----VIMSRPGEFFNFWSKTA-TTFTP 235 Query: 378 TKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRR---VSGSGV 434 + + + + ++ G+L+ ++L+ + S + Sbjct: 236 QDVIDLSCSSEYPAIVYDGIQVNAGLLLFTKNQQFMLTTDSDILSPETAKLNAVASYNFN 295 Query: 435 YACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEI---TQLADHLFNQRILQLVYQE 491 PVS+G + F+ + ++ S ++ +++ L ++ I LV + Sbjct: 296 EKTNPVSLGTTVAFIDNANKYTRFFEMSNVLRQGEPDVVDQSKVISRLLDKDI-SLVSES 354 Query: 492 EPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGT 551 +S V+ + D S + AW T ++ Sbjct: 355 RENSAVFFSKKGTD--EIYCFRYFNSGDKRLLQAWCTWTLAGNIQYHCML---------D 403 Query: 552 SLWMLVALSAGEERSFTVRLNLLDD 576 ++ + +++ L L D+ Sbjct: 404 DALFVITRNNNKDQMVKYSLKLDDN 428 >gi|310005866|gb|ADP00251.1| tail tube protein B [Cyanophage Syn26] Length = 977 Score = 160 bits (405), Expect = 5e-37, Method: Composition-based stats. Identities = 61/509 (11%), Positives = 133/509 (26%), Gaps = 46/509 (9%) Query: 85 GDKKLQIVVVRSSTKWSPALFGKTYKT-----PYTFKDNKSLEYAVFGSTAVFVHKDHPP 139 + +I S ++ + T TF G D Sbjct: 277 TNLYFRIRTTGQSVPFTTGAGNEQVTTYQARYTTTFDLLYGGSGWQQGDYFYVWMDDGYY 336 Query: 140 HHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIF 199 ++ +I I+ P P+ + I+ + + DT Sbjct: 337 KVVIEAISTTQIQANLGLIRPNPTPFDTETTITASGILGDIRQAIIDTGNFT-------- 388 Query: 200 KPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKD 259 + L P N + +S+ V + Sbjct: 389 SANVQQIGNGLYITRPSGTFNATAPTSDLLKVMSSEVKSVDDLPDQC---KHGYVVKVAN 445 Query: 260 NNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVV 319 + + G G + K I + Q+ F + Sbjct: 446 SEADEDDYYVKFFGNNDRDGDGVWEECAKPGRNIEFDKGTMPIQLVRQANGTFLVSQATW 505 Query: 320 SWFMSAWGEQEGYPSHV-------TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEY 372 PS V F NRL+F + V +S G F++F Sbjct: 506 ENAEVGDDLTNPNPSFVGKTVNQLVFFRNRLVFLSDEN----VIMSRPGEFFNFWSKTA- 560 Query: 373 GCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRR---V 429 + P + + + + ++ G+L+ ++L+ + Sbjct: 561 TTFTPMDVIDLSCSSEYPAIVYDGIQVNAGLLLFTKNQQFMLTTDSDILSPETAKLNAVA 620 Query: 430 SGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEI---TQLADHLFNQRILQ 486 S + P+++G + F+ + ++ S ++ +++ L ++ I Sbjct: 621 SYNFNEKTNPINLGTTVAFIDNANQFTRFFEMSNVLRQGEPDVVDQSKVISRLLDKDISL 680 Query: 487 LVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPND 546 + E ++ + KD S E AW T + Sbjct: 681 VSESRENSAVFF---SKKDTDTIYCFRYFTSGEKRLLQAWCTWTVVGNIQYHCM------ 731 Query: 547 NRGGTSLWMLVALSAGEERSFTVRLNLLD 575 +L+ ++ + +++ L L D Sbjct: 732 --LDDALY-VITRNNNKDQMVKYSLKLDD 757 Score = 79.6 bits (194), Expect = 1e-12, Method: Composition-based stats. Identities = 50/545 (9%), Positives = 119/545 (21%), Gaps = 64/545 (11%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + T T + + G L + D V+ + N+IP L+ P Q Sbjct: 1 MSSVTQTIPTLTGG------LSQQPDELKIPGQVSIATNVIPDVTHGLLKRPGGQLVASI 54 Query: 61 RL-------DPRSNRVFSFSIPDGGYALLVF---GDKKL-QIVVVRSSTKWSPALFGKTY 109 + + FS+ + + GD + + + + Sbjct: 55 SDNGTSALNSQTNGKWFSYYRDETESYIGQVSRSGDINMWRCSDGAAMVVNYDSGTASAL 114 Query: 110 KTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHH---------------LLYIQDGDKISFT 154 T T +++ ++ ++ L + + Sbjct: 115 ATYLTHTNDQDIQTLTLNDFTFITNRTKTVAMSSTVETVRPPEVFIDLRATAYARQYAVN 174 Query: 155 FDEIKFLPPPWLGDGMISGVKSNAKLSISQAD------------TSTARITSDMK-IFKP 201 + + + ++ + +D T I + Sbjct: 175 LYDNTNTTTETTATRISVDLVKSSNNYCNASDGTLPSRANRISATGRCTINAGDGRDAYA 234 Query: 202 LDKGRSIR-----LGCHPPEWAKNTNYSIGAYIVADDKV------YRSLTTGRSGDRFGY 250 + G I + N Y+I V Y + T F Sbjct: 235 PNVGTRIFDIDDGASLTDEALSGNHTYTIDVKAANGSSVNRGTNLYFRIRTTGQSVPFTT 294 Query: 251 SKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQT 310 G V + T +L S + G K V + + + Sbjct: 295 GAGNEQVTTYQARYTTTFDLLYGGSGWQQGDYFYVWMDDGYYKVVIEAISTTQIQANLGL 354 Query: 311 LFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDG 370 + + G + + +Y++ ++ Sbjct: 355 IRPNPTPFDTETTITASGILGDIRQAIIDTGNFTSANVQQIGNGLYITRPSGTFN--ATA 412 Query: 371 EYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVS 430 + D + + ++ + Sbjct: 413 PTSDLLKVMSSEVKSVDDLPDQCKHGYVVKVANSEADEDDYYVKFFGNNDRDGDGVW--- 469 Query: 431 GSGVYACPPVSVGDCLVFVCGVGRRIKYISGS---TEQGFRFNEITQLADHLFNQRILQL 487 + + + V + S E +++T + + QL Sbjct: 470 EECAKPGRNIEFDKGTMPIQLVRQANGTFLVSQATWENAEVGDDLTNPNPSFVGKTVNQL 529 Query: 488 VYQEE 492 V+ Sbjct: 530 VFFRN 534 >gi|83721618|ref|YP_441474.1| gp12 [Burkholderia thailandensis E264] gi|257139844|ref|ZP_05588106.1| gp12, putative [Burkholderia thailandensis E264] gi|83655443|gb|ABC39506.1| gp12, putative [Burkholderia thailandensis E264] Length = 188 Score = 157 bits (395), Expect = 7e-36, Method: Composition-based stats. Identities = 29/152 (19%), Positives = 48/152 (31%), Gaps = 4/152 (2%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M T + + +AGELSP L + DL +A GV N IP G ++ Sbjct: 1 MAKITTIQSNLNAGELSPPL-EGHIDLDRYANGVKTMLNAIPQIEGGARRRFGFRQVAAT 59 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 + + R+ F + GD + + + TP++ Sbjct: 60 K-TTGATRLVPFVFSKSQAYFVELGDAYARFYTDSGQIQQ--SGVPIELATPWSASQLFE 116 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKIS 152 LEY T H+ + + Sbjct: 117 LEYTQNSDTMFIAHRHDQRRARVRGRHARCSV 148 >gi|291334666|gb|ADD94313.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C695] Length = 189 Score = 156 bits (393), Expect = 1e-35, Method: Composition-based stats. Identities = 35/181 (19%), Positives = 79/181 (43%), Gaps = 13/181 (7%) Query: 397 HPFGEGVLVGCDTSLWLLSISLS----KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGV 452 +++G + +S + +I ++ S +G ++VG+ +F+ Sbjct: 1 MTATRTLIIGTAGGEFAVSGGGTDIAITPTNILIKKQSNNGAANVDALAVGNATLFLQRA 60 Query: 453 GRRIKYISGSTE-QGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRL 511 R+++ ++ + + G+ ++T LA+H+ QL YQ+EP+ ++W V +L Sbjct: 61 RRKLRELAYNFDVDGYVAPDLTILAEHISEGGFKQLSYQQEPNQVIWGVRN-----DGQL 115 Query: 512 LGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALS-AGEERSFTVR 570 +G + E + AWH H+ S A+ P D+ W++ + G + + Sbjct: 116 VGLTYQREQQ-VVAWHRHIFGGSAVCESVATIPTDD-SEYQTWVINKRTINGSTKRYVEY 173 Query: 571 L 571 + Sbjct: 174 I 174 >gi|148724484|ref|YP_001285450.1| tail tube B [Cyanophage Syn5] gi|145588129|gb|ABP87948.1| tail tube B [Synechococcus phage Syn5] Length = 905 Score = 153 bits (387), Expect = 6e-35, Method: Composition-based stats. Identities = 55/474 (11%), Positives = 125/474 (26%), Gaps = 43/474 (9%) Query: 125 VFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQ 184 G T D +L +D + + + + L I Q Sbjct: 248 QNGGTGF-RKGDMITVNLN-GRDYNIRVTQEEFVYTYASDGTAAHTTPQDSTAGTLDIGQ 305 Query: 185 ADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLT-TGR 243 + + + G I + + + Sbjct: 306 ITAGLVNSVNLISNYSAQAVGNVIEIERTDGRDFNLGVRGGATNRAMTAIKGTANSIVDL 365 Query: 244 SGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSIS 303 G F + +N + + S SG+ G + + + Sbjct: 366 PGQCFDGFELKVINTENAESDDYYVVFRSAAEGIPGSGSWEETVAPGIERGFNTSTMPHA 425 Query: 304 VAPQSQTLFQ-------AGVSVVSWFMSAWGEQEGYPSHV-------TFHNNRLLFSGSK 349 + Q+ F ++ + + PS V F+NNRL F Sbjct: 426 LIRQADGNFTLEALNDEGTITGWAQREVGDDDTNPKPSFVGRGISDMFFYNNRLGFLS-- 483 Query: 350 GDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDT 409 E +V +S G +++F + D + + + + +G+++ + Sbjct: 484 --EDAVIMSQPGDYFNFFVTSAITISDS-DPIDVTASSTKPAILRAAIGAPKGLILFAEN 540 Query: 410 SLWLLSISLSKGLSIDFRRVS---GSGVYACPPVSVGDCLVFVCGVGRRIKYISG---ST 463 S +LL+ + + PVS G + FV K S Sbjct: 541 SQFLLASQEVVFSTATIKLTEISDYFYRSLAKPVSTGVSIAFVSEADTYSKIFEMSIDSV 600 Query: 464 EQGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGD 523 + + +IT++ + V +++ +++ + Sbjct: 601 DNRPQVADITRIVPEYVPTGLTWSVSTPNNSMMLF----GDNSNTAYIFKFFNQGNERQV 656 Query: 524 FAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSAGEERSF-TVRLNLLDD 576 W ++ + + + + ++ S+ + LLDD Sbjct: 657 AGWSKWILPGEQRMCG--------FFADTGYFVLY--DSTTGSYVLSAMELLDD 700 Score = 88.1 bits (216), Expect = 4e-15, Method: Composition-based stats. Identities = 28/314 (8%), Positives = 71/314 (22%), Gaps = 24/314 (7%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + G + + D V ++ N+ P + + Sbjct: 1 MGAVLQKIPNLLGG------VSQQPDPVKLPGQVREAENVYLDPTFGCRKRPATKFVGEL 54 Query: 61 RLD-PRSNRVFSFSIPDGGYALLVF-----GDKKLQIVVVRSSTKWSPALFGKTYKTPYT 114 + P R F G + G+ ++++ +++ + + T Sbjct: 55 ATNLPSDTRWFPIFRDAGERYAVALYKDGSGNTQVRVWDMQTGAERTVTPDATATAYLAT 114 Query: 115 FKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGV 174 +L + + +K+ + + + + Sbjct: 115 TN-LNNLNWLTVADYTLLSNKERIVTMSGASEVDSNQR------ALVEINAISYNTTYSI 167 Query: 175 K-SNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLG-CHPPEWAKNTNYSIGAYIVAD 232 S + F+ D G + ++ + + Sbjct: 168 DLDRDGASQQVKVYRAKALEISPGSFEVEDGGVCTEHDVQNYTNQTIGSSTGLAFQVRVQ 227 Query: 233 DKVY--RSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWG 290 Y + R G T + ++ LN R + V Y G Sbjct: 228 CAAYLENNEYRSRYNVSVVLQNGGTGFRKGDMIT-VNLNGRDYNIRVTQEEFVYTYASDG 286 Query: 291 DIKDVSKDGRSISV 304 + + Sbjct: 287 TAAHTTPQDSTAGT 300 >gi|291334457|gb|ADD94111.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C1161] Length = 206 Score = 153 bits (386), Expect = 7e-35, Method: Composition-based stats. Identities = 38/185 (20%), Positives = 82/185 (44%), Gaps = 18/185 (9%) Query: 403 VLVGCDTSLWLLSISLS----KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKY 458 V++G + +S + +I ++ S +G ++VG+ +F+ R+++ Sbjct: 6 VIIGTAGGEFAVSGGGTDIAITPTNILIKKQSNNGAANVDALAVGNATLFLQRARRKLRE 65 Query: 459 ISGSTE-QGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFS 517 ++ + + G+ ++T LA+H+ QL YQ+EP+ ++W V +L+G + Sbjct: 66 LAYNFDVDGYVAPDLTILAEHISEGGFKQLSYQQEPNQVIWGVRN-----DGQLVGLTYQ 120 Query: 518 AEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALS-AGEERSFTV-----RL 571 E + AWH H+ S A+ P D+ W++ + G + + + Sbjct: 121 REQQ-VVAWHRHIFGGSAVCESVATIPTDD-SEYQTWVINKRTINGSTKRYVEYIHQYKF 178 Query: 572 NLLDD 576 + DD Sbjct: 179 DETDD 183 >gi|291334273|gb|ADD93936.1| hypothetical protein [uncultured marine bacterium MedDCM-OCT-S08-C235] Length = 229 Score = 148 bits (373), Expect = 2e-33, Method: Composition-based stats. Identities = 37/238 (15%), Positives = 69/238 (28%), Gaps = 16/238 (6%) Query: 1 MVNTT----WTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQE 56 M T K +F +GEL P L+ R D + +A G K +N+ G + Sbjct: 1 MAKTRSILRQLKTTFQSGELDP-LMNLRSDTTAYANGAKKMQNVSLFSQGGFKRRNGTKR 59 Query: 57 YRDCRLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFK 116 Y P + R+ F D + F + ++ I + + +L P+T Sbjct: 60 YASL---PGNARLVGFDFDDNEQYICAFSNNRVDIYYLS-----NDSLTQTITSCPWTTS 111 Query: 117 DNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKS 176 +++ G T + H + F Sbjct: 112 ILFEMQFTQAGDTMIITHPSMATQVITRTSLTAFSR---SNYTFDSDSENVYQPYYKFAG 168 Query: 177 NAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDK 234 + + T + ITS F +++ +TN + + Sbjct: 169 SGVTLSASGTTGSVTITSSADHFSSDYVNVYLKIEDTTLLITGHTNATTVTATILGTL 226 >gi|254505331|ref|ZP_05117479.1| hypothetical protein SADFL11_PLAS29 [Labrenzia alexandrii DFL-11] gi|222436175|gb|EEE42857.1| hypothetical protein SADFL11_PLAS29 [Labrenzia alexandrii DFL-11] Length = 683 Score = 148 bits (372), Expect = 3e-33, Method: Composition-based stats. Identities = 48/386 (12%), Positives = 100/386 (25%), Gaps = 31/386 (8%) Query: 193 TSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYS-----IGAYIVADDKVYRSLTTGRSGDR 247 T D + + +Y+ ++ K Sbjct: 97 TEQTVQAYDADVQTYLSQIPENLSFVTVADYTFVVNRTTEVVMDPSKTAPGTFRDSVQLF 156 Query: 248 FGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQ 307 AT I+ SA G++ + + Sbjct: 157 SDLPGSATDGDVYRISNGASPLDDYYVKYVSADTEWVECAKPGEVIGFDAKTMPHQIVRE 216 Query: 308 SQTLFQAGVSVVSWFMSAWGEQEGYPSHV-------TFHNNRLLFSGSKGDELSVYLSSF 360 F S E PS V F NRL F + + S Sbjct: 217 EDGSFSVSRVEWSDRQVGDAESVKDPSFVGRAFKDIFFFKNRLGFVSDENT----FFSQA 272 Query: 361 GAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL-- 418 F++ D D + A + + + W+ PF + + D + + L+ S Sbjct: 273 ADFFNLWPDQANVVGDS-DPVDIAASTTKVTILQWVVPFRRALFLSADLAQFELASSDFM 331 Query: 419 SKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQ---GFRFNEITQL 475 + S C P ++GD L F + + ++T+ Sbjct: 332 TPTSVAVDLATSYEATNLCRPTTLGDELYFAAEKQGKTVIYEYFYDDDTLSNTAIDVTKH 391 Query: 476 ADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKH 535 A+ R+ + ++++ V D++ ++ + + AW + + Sbjct: 392 AEGYIPGRVYLMEGSAIANTLLCVA--DGDSASMYTYRVFWNGQEKIQSAWSRWTFDNSY 449 Query: 536 YVLSAASFPNDNRGGTSLWMLVALSA 561 + ++LV + Sbjct: 450 -------IDGVKVINDTAYVLVTHND 468 >gi|254503713|ref|ZP_05115864.1| hypothetical protein SADFL11_3752 [Labrenzia alexandrii DFL-11] gi|222439784|gb|EEE46463.1| hypothetical protein SADFL11_3752 [Labrenzia alexandrii DFL-11] Length = 634 Score = 147 bits (370), Expect = 5e-33, Method: Composition-based stats. Identities = 48/386 (12%), Positives = 100/386 (25%), Gaps = 31/386 (8%) Query: 193 TSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYS-----IGAYIVADDKVYRSLTTGRSGDR 247 T D + + +Y+ ++ K Sbjct: 48 TEQTVQAYDADVQTYLSQIPENLSFVTVADYTFVVNRTTEVVMDPSKTAPGTFRDSVQLF 107 Query: 248 FGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQ 307 AT I+ SA G++ + + Sbjct: 108 SDLPGSATDGDVYRISNGASPLDDYYVKYVSADTEWVECAKPGEVIGFDAKTMPHQIVRE 167 Query: 308 SQTLFQAGVSVVSWFMSAWGEQEGYPSHV-------TFHNNRLLFSGSKGDELSVYLSSF 360 F S E PS V F NRL F + + S Sbjct: 168 EDGSFSVSRVEWSDRQVGDAESVKDPSFVGRAFKDIFFFKNRLGFVSDENT----FFSQA 223 Query: 361 GAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL-- 418 F++ D D + A + + + W+ PF + + D + + L+ S Sbjct: 224 ADFFNLWPDQANVVGDS-DPVDIAASTTKVTILQWVVPFRRALFLSADLAQFELASSDFM 282 Query: 419 SKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQ---GFRFNEITQL 475 + S C P ++GD L F + + ++T+ Sbjct: 283 TPTSVAVDLATSYEATNLCRPTTLGDELYFAAEKQGKTVIYEYFYDDDTLSNTAIDVTKH 342 Query: 476 ADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKH 535 A+ R+ + ++++ V D++ ++ + + AW + + Sbjct: 343 AEGYIPGRVYLMEGSAIANTLLCVA--DGDSASMYTYRVFWNGQEKIQSAWSRWTFDNSY 400 Query: 536 YVLSAASFPNDNRGGTSLWMLVALSA 561 + ++LV + Sbjct: 401 -------IDGVKVINDTAYVLVTHND 419 >gi|320158424|ref|YP_004190802.1| tail tubular protein B [Vibrio vulnificus MO6-24/O] gi|319933736|gb|ADV88599.1| tail tubular protein B [Vibrio vulnificus MO6-24/O] Length = 931 Score = 142 bits (357), Expect = 2e-31, Method: Composition-based stats. Identities = 60/546 (10%), Positives = 125/546 (22%), Gaps = 75/546 (13%) Query: 91 IVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDH---------PPHH 141 + ++ YT+ Y G Sbjct: 234 VYFEVAADVSVSITDNSHATVEYTYHQ----TYWESGDRKWVTKTAKWAETEPLTGLTQM 289 Query: 142 LLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKP 201 + IQ + ++ V + TS Sbjct: 290 MASIQTAPLTPSSPQGFVWIRQADYSVNYDITVNGTKCSITTPEATSDQARAGLNSSKMT 349 Query: 202 LDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNN 261 D I + + AD++ + + T + Sbjct: 350 DDLVAQINKATSTHGCVASRIGNTIHIRAADNQEFDLEVSDGLYGEALKMAKGTVEDQTD 409 Query: 262 ITWITVL-------------NLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQS 308 + V N + +G + + + Sbjct: 410 LPPDGVGDHVLHVVGKADSENDGYYVKWVDKTSMWTESTAYGLANEFNPASMPHILRRHQ 469 Query: 309 QTLFQAGVSVVS---------WFMSAWGEQ--------------------EGYPSHVTFH 339 + + + W G++ E Y S + F Sbjct: 470 DSSKVSVDNPYGIYFKLEQGVWSKRTVGDELSAPIPSFVSTQDESGAMTQERYISAMAFF 529 Query: 340 NNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPF 399 RL G S G ++F D A TIH P Sbjct: 530 RGRLWLLGG----DYACGSVVGDKFNFFRSTALTVLDDDPIDGYTDLTGQAETIHAAIPS 585 Query: 400 GEGVLVGCDTSLWLLSISL--SKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIK 457 +G++V + +L+S S R S + C PV +GD + F Sbjct: 586 SDGLVVFTERGQYLISSQGMMSPTTFEFTRIASYATDNRCDPVLIGDRISFATKTSEYTS 645 Query: 458 YISGSTEQG---FRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFP--RLL 512 + NE+T + +L+ ++ ++ + + Sbjct: 646 VSEMYVADTTGVRKANEVTSHCPTYIEGSVHRLLANATSNTEFLIMRGQGETLTGRMFIY 705 Query: 513 GCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWML-VALSAGEERSF-TVR 570 + AW + V + + L+++ V ++ +++ R Sbjct: 706 DFLMNGNERVQSAWSQWTFNGAVVVDGVLT-------SSELYLVMVRATSDKDKRMTVER 758 Query: 571 LNLLDD 576 ++L+ D Sbjct: 759 IDLVQD 764 Score = 68.0 bits (164), Expect = 4e-09, Method: Composition-based stats. Identities = 22/270 (8%), Positives = 55/270 (20%), Gaps = 26/270 (9%) Query: 18 PRLLQSRKDLS---LHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDP---RSNRVFS 71 P ++ + + N + P + D + Sbjct: 8 PDMIGGVSQQAPLMRFPNQAEEQINCKNSPVTGVSKRPNTKHVADIAGSFLDYARMKTHI 67 Query: 72 FSIPDGGYALLVFGDKKLQIVVV-RSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTA 130 + L+ D +++ + Y + ++ + G Sbjct: 68 IDRDETERYLIGILDGEIRAWDLMTGVQYDIEGGQNVNYLRAGSVPARQAYKAMTLGDDT 127 Query: 131 VFVHKDHPPHHLLYIQD-------------GDKISFTFDEIKFLPPPWLGDGMISGVKSN 177 ++ P ++ + P G + Sbjct: 128 FILNTTMPVTMDYTKREGVPETEAKTKHMRIAFSGIDVSKPVASNPYDYGRNTYNSFSVL 187 Query: 178 AKLSISQADTSTARITSDMKIFKPLD-KGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVY 236 + + + +M I + W T+ Sbjct: 188 SAMYSGVIYVGDKTLPYNMPINDNSPRVILEMLKKGGINAWLNGTSVYFEVAA-----DV 242 Query: 237 RSLTTGRSGDRFGYSKGATYVKDNNITWIT 266 T S Y+ TY + + W+T Sbjct: 243 SVSITDNSHATVEYTYHQTYWESGDRKWVT 272 >gi|281306691|ref|YP_003345497.1| predicted phage tail tubular protein B [Pseudomonas phage phi-2] gi|271277996|emb|CBH51602.1| predicted phage tail tubular protein B [Pseudomonas phage phi-2] Length = 777 Score = 134 bits (336), Expect = 4e-29, Method: Composition-based stats. Identities = 52/591 (8%), Positives = 142/591 (24%), Gaps = 48/591 (8%) Query: 21 LQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRVFSFSIPDGGYA 80 + + + N+ L ++ V ++ GG A Sbjct: 15 VSQQAAQDRLPGQLQAQINMTSDLVAGLRRRASVEAVTAVGTFTDVKSVRQYNTDIGGTA 74 Query: 81 LLVF---GDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDH 137 + + + +++V + + +SL + + Sbjct: 75 VSLICDAVNGTIKVVEEATGVALADFQHDY-----LKAAVARSLRLVTLNDAVWLCNVEQ 129 Query: 138 PPHHLL---YIQDGDKISFTFDEIKFLPPPWLGD-GMISGVKSNAKLSISQADTSTARIT 193 P + + D + + + + + T + + Sbjct: 130 KPVVSVAADRSKYPDPSHWGYYYVAAGAFQKAYTLTITDRSVDPPTSNTVTYTTPVSTVA 189 Query: 194 SDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGY--- 250 F R + + K S T G + R Sbjct: 190 EATPEFITNRLAELARAAWTAYGVTITVEGTFASIQCTTAKPTISTTAGSAYMRCSNAMS 249 Query: 251 ----SKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAP 306 ++ + I S+ + A + +D+ Sbjct: 250 IRDAAELPARLPLVMNNIIVATGASNTKVFYRYNDAEKRWIEDASWEDLKDLSNLPLRMT 309 Query: 307 QSQTLFQAGVSVVSWFMSAWGEQEGYP---------SHVTFHNNRLLFSGSKGDELSVYL 357 Q +T + + + A G+++ P + + RL+F ++ L Sbjct: 310 QDETTDEYKLEAPVYERRAAGDEKSNPLLKFITQGITGMAAFQGRLVFLSNE----YACL 365 Query: 358 SSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSIS 417 S+ F + A + + F + +++ ++ + Sbjct: 366 SASDNPLRFFRST-LSTVADNDPIEVAAQGSLTAPYEYALNFNKDLVMFSRHYQGIIPGN 424 Query: 418 L--SKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVG-RRIKYISG----STEQGFRFN 470 + + + P + G + F + + + + Sbjct: 425 SMVTPRTANVALMTRYEVDTSAEPTAAGRSIFFGAPRSLGYVGVHEMTPSQYADSQYVAD 484 Query: 471 EITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHM 530 ++T +V + +V D++ + ++ + +WH Sbjct: 485 DVTSHIPRYIQGPWRFMVSSTTSN---IMVAGTADHNELVIHEYLWNQSEKVHQSWHKWK 541 Query: 531 ISDKHYVL--SAASFPNDNRGGTSLWML---VALSAGEERSFTVRLNLLDD 576 + S SL++ + AG+ RL+ + Sbjct: 542 FAWPVIDAYFSGDVLICLFGVEGSLYLCRIDLQRGAGDISPTVPRLDFFTE 592 >gi|291335885|gb|ADD95480.1| T7-like tail tubular protein B [uncultured phage MedDCM-OCT-S08-C41] Length = 914 Score = 133 bits (333), Expect = 1e-28, Method: Composition-based stats. Identities = 58/445 (13%), Positives = 126/445 (28%), Gaps = 45/445 (10%) Query: 147 DGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTS-TARITSDMKIFKPLDKG 205 D + ++ + ++ T+ ++ F+ + G Sbjct: 289 DYPINVDKIETVQVRASIKAVRPDPTPFDQQTNVTPDSILGGITSELSGTNINFEVIGNG 348 Query: 206 RSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFG-YSKGATYVKDNNITW 264 + N+++ A V +G F V +++ T Sbjct: 349 IYFY--------SNTVNFTVEAQNTDLMSVITDQVNDVTGLPFQCKHGYIVKVSNSSSTD 400 Query: 265 ITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMS 324 S G+ G ++ + Q F + Sbjct: 401 DDYYLRFEGNGGGSGPGSWVECAEPGIADTINPLTVPPVIQRQGNGQFIVKRFGYAQRTV 460 Query: 325 AWGEQEGYPSH-------VTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDP 377 PS+ V F NRL F + V LS G +F ++ Sbjct: 461 GDTNTNPEPSYIGKTINKVLFFRNRLAFLSDEN----VILSQPGDLGNFFVNTAL-TVSG 515 Query: 378 TKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRR---VSGSGV 434 T + + + + + G++V +LL+ R + + Sbjct: 516 TDPIDISCSSKYPAILFDAIEVNTGLIVFAANQQFLLATDSDILNPETARLSSISTYNYN 575 Query: 435 YACPPVSVGDCLVFVCGVGRRIKYISGST---EQGFRFNEITQLADHLFNQRILQLVYQE 491 A PP S+G F+ G ++ S E NE++++ ++ I L Sbjct: 576 TAVPPFSLGTVAGFLDNAGSHSRFFVMSNVAREGEPNVNELSKVVSTALSKNIDLLADSR 635 Query: 492 EPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGT 551 E +I + K+++ A+ + +W ++ Sbjct: 636 ENTTIFF---GKKNSAEVFGYKYFNVADKQIQSSWFRWKLARPVVYHCCV---------N 683 Query: 552 SLWMLVALSAGEERSFTVRLNLLDD 576 ++ V +++F ++NL+ D Sbjct: 684 DTYIFVD-----DQNFLQKINLIRD 703 Score = 81.1 bits (198), Expect = 5e-13, Method: Composition-based stats. Identities = 50/343 (14%), Positives = 85/343 (24%), Gaps = 38/343 (11%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + T T +F G + D + V + N IP L P Sbjct: 1 MPSITQTIPNFFGG------ISKVPDSQMGQGQVKDALNCIPDLNKGLYKRPGAMRVGTS 54 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 L ++ F + V S A G Y + Sbjct: 55 ALSGATSTGVWFHY-----YRDEIEGSYIGQVQSNGSVNMWDADTGNAITVNYESGQQSN 109 Query: 121 --------------LEYAVFGSTAVFVHKDHPPHHL-LYIQDGDKISFTFDEIKFLPPPW 165 L++ + V+++ + D+ T+ L Sbjct: 110 LQSYLSNGTIGTETLQFTTINDSTFVVNRNVTAAMQPTSVSKTDEKPHTYSAFIELKRTQ 169 Query: 166 LGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSI 225 G + S + T+T + G H P T Sbjct: 170 NGRQYGLNIHDPTSSSTTTIATATQVAANPTGEGYSSFGG----NTGHCP--FVGTKVFT 223 Query: 226 GAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAP 285 A + V+R TG+ G G++ + D T+ L+L + A Sbjct: 224 KNQGSATNLVFRLTVTGQQGPTPGHNDESPEAADYTCTYSHRLDLLHGGEGWAVGSAGTV 283 Query: 286 YY------VWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWF 322 + D + + SI T F +V Sbjct: 284 TLEGKDYPINVDKIETVQVRASIKAVRPDPTPFDQQTNVTPDS 326 >gi|18640503|ref|NP_570344.1| tail protein A [Synechococcus phage P60] gi|18478733|gb|AAL73282.1| tail protein A [Synechococcus phage P60] Length = 680 Score = 130 bits (326), Expect = 6e-28, Method: Composition-based stats. Identities = 44/392 (11%), Positives = 91/392 (23%), Gaps = 35/392 (8%) Query: 166 LGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSI 225 +S S S T + + + F G IR+ P S Sbjct: 289 TAQFTTPVDQSGGGASTSDIVTGLSAAINGLGTFTAESIGNVIRVRYSDPTRTDEFTMSA 348 Query: 226 GAYIVADDKVYRSLTTGRSG------DRFGYSKGATYVKDNNITWITVLNLSSKTSRESA 279 + + + + Sbjct: 349 RGGTSGTGLESIKYSVDTLAELPTKCWNDYQVAVRNTQDTEVDDYYVKFETDVEDADVPG 408 Query: 280 SGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAG-------VSVVSWFMSAWGEQEGY 332 SG GD + D + + F + + + Sbjct: 409 SGYWVETVKNGDDGGLVDDTMPHVLVRNALGDFTFSSLNNSSYGKTWADRSVGSEDTNPH 468 Query: 333 PSH---------VTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTT 383 P+ + + NRL F +V +S G +++F D + Sbjct: 469 PTFTESGNGIYGMFMYKNRLGFLTQ----DAVIMSQVGDYFNFYATSGVTISDA-DPIDM 523 Query: 384 AVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSI---SLSKGLSIDFRRVSGSGVYACPPV 440 A +D + G ++ + + + LS S + + + + PV Sbjct: 524 ATSDTKPVKLEAAISSTSGAILFGNQAQFRLSSPDESFGPKTATLDKISNYTYESKADPV 583 Query: 441 SVGDCLVFVCGVGRRIKYISGSTEQGFRFN---EITQLADHLFNQRILQLVYQEEPHSIV 497 G ++F +G STE + +++ L + ++ Sbjct: 584 QTGVSMIFPTNMGTYSSVYELSTESAKGTPVIEDSSRVIPRLIPSGLTWSTASMNNDTV- 642 Query: 498 WVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTH 529 K + + W T Sbjct: 643 -FFGNAKKGRNVYVFRFFNEGQERKVAGWTTW 673 Score = 97.7 bits (241), Expect = 5e-18, Method: Composition-based stats. Identities = 28/332 (8%), Positives = 79/332 (23%), Gaps = 23/332 (6%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + G + + D V ++RN+ + P + Sbjct: 1 MAAVEQMVPNLLGG------ISQQPDPLKLPGQVKQARNVQLDPTFGALKRPGTELIMQV 54 Query: 61 RLDPRSNRVFSFSIPDGGYALLVF--------GDKKLQIVVVRSSTKWSPALFGKTY--K 110 P+ + + + GD ++++ +++ + + + G Sbjct: 55 TGIPKRAKWIPIMRDAREHYYVAIYREGANESGDLRIRVFDLKAGVERAVSFVGGEVEEY 114 Query: 111 TPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGM 170 P D +++ G + + P I Sbjct: 115 FPGDETDWEAIRSLTIGDYTFLSNPNVQPTTWSRSFSRRPE--GLVTIGAAGYGTSYIVD 172 Query: 171 ISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIV 230 + S + + + + P + G + + + A++V Sbjct: 173 FATEDSGQQRRWAVQEMQAPKTKRKKGDGSPDEAGETTVNNWNGTGLSFRVKVEARAFLV 232 Query: 231 ADDKVY-----RSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAP 285 D + Y +T G+ V + W + ++ + G Sbjct: 233 DDGEEYGHNYIPYVTLLTPGNNTSPFPDTIRVDVSGEGWDIKVTKQIQSKVYANLGTAQF 292 Query: 286 YYVWGDIKDVSKDGRSISVAPQSQTLFQAGVS 317 + ++ + + Sbjct: 293 TTPVDQSGGGASTSDIVTGLSAAINGLGTFTA 324 >gi|291334514|gb|ADD94167.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C1201] gi|291336446|gb|ADD96001.1| hypothetical protein [uncultured organism MedDCM-OCT-S04-C1073] Length = 153 Score = 126 bits (317), Expect = 8e-27, Method: Composition-based stats. Identities = 30/137 (21%), Positives = 62/137 (45%), Gaps = 14/137 (10%) Query: 447 VFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKD 505 +F+ R+++ ++ + + G+ ++T LA+H+ QL YQ+EP+ ++W V Sbjct: 1 MFLQRARRKLRELAYNFDVDGYVAPDLTILAEHISEGGFKQLSYQQEPNQVIWGVRN--- 57 Query: 506 NSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALS-AGEE 564 +L+G + E + AWH H+ S A+ P D+ W++ + G Sbjct: 58 --DGQLVGLTYQREQQ-VVAWHRHIFGGSAVCESVATIPTDD-SEYQTWVINKRTINGST 113 Query: 565 RSFTV-----RLNLLDD 576 + + + + DD Sbjct: 114 KRYVEYIHQYKFDETDD 130 >gi|291334718|gb|ADD94364.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C890] Length = 135 Score = 126 bits (316), Expect = 1e-26, Method: Composition-based stats. Identities = 30/137 (21%), Positives = 62/137 (45%), Gaps = 14/137 (10%) Query: 447 VFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKD 505 +F+ R+++ ++ + + G+ ++T LA+H+ QL YQ+EP+ ++W V Sbjct: 1 MFLQRARRKLRELAYNFDVDGYVAPDLTILAEHISEGGFKQLSYQQEPNQVIWGVRN--- 57 Query: 506 NSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALS-AGEE 564 +L+G + E + AWH H+ S A+ P D+ W++ + G Sbjct: 58 --DGQLVGLTYQREQQ-VVAWHRHIFGGSAVCESVATIPTDD-SEYQTWVINKRTINGST 113 Query: 565 RSFTV-----RLNLLDD 576 + + + + DD Sbjct: 114 KRYVEYIHQYKFDETDD 130 >gi|332800733|emb|CBY88573.1| hypothetical protein [Pantoea phage LIMEzero] Length = 808 Score = 123 bits (307), Expect = 1e-25, Method: Composition-based stats. Identities = 56/600 (9%), Positives = 130/600 (21%), Gaps = 71/600 (11%) Query: 18 PRLLQSRKDL---SLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRS------NR 68 P LL V N+ L P + D + Sbjct: 9 PTLLGGVSQQVYTERQVGQVETQVNMTSDTVRGLRKRPGTRLVLDVSGEDTQWSLGNTGH 68 Query: 69 VFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTY--KTPYT-FKDNKSLEYAV 125 + F+ L +G + + + PY + + +A Sbjct: 69 LRQFTAD------LGWGQTSFVVNTITGTVSAIQEADVMQVLGTKPYLVTSNPSDIVFAT 122 Query: 126 FGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQA 185 GS + D P + + F+ G V + Sbjct: 123 VGSELYVGNCDVLPATVTNESRWNP---RLGGYFFVLSGAYGKVYSVTVSWGTVSYTASY 179 Query: 186 DTSTARITSDMKIFKPLDKGRSIRLGCHPPEW------------------AKNTNYSIGA 227 T A T + + + + Sbjct: 180 TTPQASDTDASNQSTGEYIINQLVNSLSSQVSSSVLNLASDGSYLSFRLQSGYDSDDVLL 239 Query: 228 YIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYY 287 + Y + S D I + Sbjct: 240 VTTSTGSTYAIASKAHSAKSTDDLPARVPFNDGFIMTVGDTGSYQYFQWLVGESRWQECG 299 Query: 288 VWGDIKDVSKDGRSISVAPQSQ-TLFQAGVSVVSWFMSAWGEQEGYPSH----------V 336 +G + + + + + + P + Sbjct: 300 KYGSPTGLDPGTMPLKIIASDTENQHEFSAVEWGGREAGDDDNNETPQFLLEDGVGMTGM 359 Query: 337 TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM 396 + RL+ S+ A+ + + T FS ++ + Sbjct: 360 SAFQGRLIIFSG---PYISMSSNVRAYRTYFYRTTVTQVLDGDRIEFTSTSFSGASFRYG 416 Query: 397 HPFGEGVLVGCDTSLWLLSISL---SKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVG 453 PF +++ +T ++ + + + P+ G L + Sbjct: 417 VPFNSDLILASETHQGVIPGRNQVLTPNNATAVLTSAYQMNTDVSPLVCGRSLYYSYPRS 476 Query: 454 RR---IKYI--SGSTEQGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSF 508 IK + SG T+ + ++T + + +V + + + Sbjct: 477 TSSFAIKELTPSGYTDLQYVSQDVTDHIPTYLEGAASYICSSTTNNIVV--IGSTTELNT 534 Query: 509 PRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSAGEERSFT 568 + +SA+ + +WH + + +L +L+ + Sbjct: 535 LYVNEYMWSADSKVQSSWHKWTFNGTIHC--------AWFVRENLLLLIEQDNAMHLVYL 586 >gi|310005669|gb|ADP00057.1| tail tube B [Cyanophage 9515-10a] Length = 1000 Score = 121 bits (304), Expect = 2e-25, Method: Composition-based stats. Identities = 53/437 (12%), Positives = 114/437 (26%), Gaps = 54/437 (12%) Query: 161 LPPPWLGDGMISGVKSNAKLSISQAD---TSTARITSDMKIFKPLDKGRSIRLGCHPPEW 217 +S K S A + SD+ G + L Sbjct: 334 TYQGVSSIAYYKTAQSPDKGRASMATILKGLETAVNSDLANVTAEIYGSGLYLYGSAAPN 393 Query: 218 AKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRE 277 ++ + + + S + GY ++ + + Sbjct: 394 VNFLGGAVNEAMNVFGNTAQDVARLPSMCKHGYIVQVANSENVD--ADNYYVKFLADNGS 451 Query: 278 SASGAVAPYYVWGD--------IKDVSKDGRSISVAPQSQTLFQAGVSV----------- 318 SG + +K + ++ F Sbjct: 452 GGSGKWEETVRPHNFSSGSDPMVKGLDPATMPHALVNNRNGTFTFKKLDETTANADNTDN 511 Query: 319 -VSWFMSAWGEQEGYPSH-------VTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDG 370 + E +PS + FH NRL F ++ V +S G +++F + Sbjct: 512 YWKYREVGDDETNPFPSFKGLEIQKIFFHRNRLGFVANEQ----VVMSRPGDYFNFFVVS 567 Query: 371 EYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRR-- 428 D + V+D + I+ + P +GV++ D ++L R Sbjct: 568 AITTSDDN-PIDITVSDIKPAFINHVLPVQKGVMMFSDNGQFILFTESDIFSPKTARLKK 626 Query: 429 -VSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEI---TQLADHLFNQRI 484 S A PV +G ++F V + + +I T++ + + Sbjct: 627 ISSYECDDALQPVDMGTSVMFSSSVSAYTRTYEATVVDDDVPPKIVEQTRVVPEFLPKTV 686 Query: 485 LQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFP 544 S+ V + + S E AW++ + + + Sbjct: 687 DTTA---NTTSLGIVSYGETNTNELYHYKYFDSGERRDQSAWYSWTLQGTLQYMVYTAG- 742 Query: 545 NDNRGGTSLWMLVALSA 561 + +++ Sbjct: 743 -------TFYVVTKQDN 752 Score = 83.4 bits (204), Expect = 1e-13, Method: Composition-based stats. Identities = 31/312 (9%), Positives = 67/312 (21%), Gaps = 19/312 (6%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M +F G + + D + N +P L+ P + + Sbjct: 1 MAAINQRIPNFLGG------VSQQPDTIKFPGQLRVCDNAVPDVTFGLMKRPPGEFVKTL 54 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKK-------LQIVVVRSSTKWSPALFGKTYKTPY 113 S + L+ ++I + + + S Y Sbjct: 55 TNANASGYWYDILRDGDEKYLVQMTASSSYSGTKPIRIWNLLTGVEQSLTNSNGDSLFQY 114 Query: 114 --TFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMI 171 K + + P + + + F + Sbjct: 115 MQQTGTTKPYAIQSVQDYTIITN----PQKTIGTDGNTAVPLNSGDYAFARLDTIAYNTE 170 Query: 172 SGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVA 231 + + + S + T+ G + +A +S Sbjct: 171 YVLYTGSAPSANTYYRVTSLKVDYTNTQGGSAVGSTWDDTNEDGRYAGQLGFSFSGGSAV 230 Query: 232 DDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGD 291 + T G + +D + S+ G V+ Y Sbjct: 231 TIPGGQVATEDVEGTLLINGQSFITSQDPQYQADDSSSTSTSGDGSDFIGYVSDYDTRYT 290 Query: 292 IKDVSKDGRSIS 303 K+G I Sbjct: 291 ATVTLKNGGIIK 302 >gi|167841461|ref|ZP_02468145.1| tail tubular protein B [Burkholderia thailandensis MSMB43] Length = 853 Score = 120 bits (301), Expect = 6e-25, Method: Composition-based stats. Identities = 66/638 (10%), Positives = 143/638 (22%), Gaps = 108/638 (16%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPL---MQE- 56 M + S + G + + H + + N+I P Sbjct: 1 MAKVVGSYASVTRG------VSEQVPQDRHPGQMWEQVNMISDPVVGCARRPGSLLTDYK 54 Query: 57 -------YRDCRLDPRSNRVFSFSIPDGGYALL-------------VF-----GDKK--- 88 + D R R F+F YALL F D + Sbjct: 55 VLTAASSLDSLKADIRMYRTFTFFHNSKEYALLYRSDVAACPAALPAFLCYCKTDSRFLS 114 Query: 89 LQIVVVRSSTKWSPALFG---------KTYKTPYTFKDNKSLEYA--VFGSTAVFVHKDH 137 + + W + +A A + Sbjct: 115 VVLADPDGMAPWVTGGVSALCTVGDYIAIAANKLGPGYSLDDRFAGHNMRGVAWVRGGAY 174 Query: 138 P---PHHLLYIQDGDKISFTFDEIKF------------LPPPWLGDGMISGVKSNAKLSI 182 + DG + + + + P + V Sbjct: 175 SRTYTLKITRRSDGVQFTAAYTTMASSYPYLLNTSDIPSSAPDYQKQINDRVNDYNSKVN 234 Query: 183 SQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGA-YIVADDKVYRSLTT 241 + A I K +S +I Sbjct: 235 QWIGDAQASIQPQNIAEKLRAALQSQGFTNCDRRGGTVILDNISFMSCDDGGDGTTFRAV 294 Query: 242 GRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGD---------- 291 + D G + ++ +G W + Sbjct: 295 FNTLDDVGKLSSIHWNDKPIQIKSNTQVDPYYMVFKTDTGEGYGTGKWVEGPAQVVQPGQ 354 Query: 292 ---IKDVSKDGRSIS--VAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFS 346 + ++KDG + + P + V + S G+++ + F R+ Sbjct: 355 VFAVGGITKDGDTFAIGSGPAQLNAYSTDFQVPKFAGSVCGDKDQTGAIPYFFGKRISLL 414 Query: 347 GSKGDE------LSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFG 400 D +V++S G +++F +D + + I + Sbjct: 415 AMFQDRLVIVSDGTVFMSRTGDYFNFFRKTMLSVHDD-DPIQAYALGAADDVITRCVTYN 473 Query: 401 EGVLVGCDTSLWLLSIS--LSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKY 458 + + + + + + + S + C PV G+ + + V Sbjct: 474 KNLFLFGLRNQYTIPGNVAASPANITISPVAAERDAILCQPVVHGNIVFYGSQVASN-GD 532 Query: 459 ISGS----------TEQGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSF 508 + S + +I++ R ++ P +L D Sbjct: 533 VPYSGIINQFQLGLFQDIPETFQISKQLSRYIKGRPTEMATVSAPP----ALLVRADGYD 588 Query: 509 PRLLGCRFSA----EGEGDFAWHTHMISDKHYVLSAAS 542 + + +W SD +++ S Sbjct: 589 NGFYVYTYLDAPGTQQREFDSWSRWEFSDALGIVAGVS 626 >gi|310005781|gb|ADP00167.1| tail tube protein B [Cyanophage NATL2A-133] Length = 985 Score = 120 bits (299), Expect = 9e-25, Method: Composition-based stats. Identities = 67/494 (13%), Positives = 138/494 (27%), Gaps = 58/494 (11%) Query: 86 DKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDN---KSLEYAVFGSTAVFVHKDHPPHHL 142 D + + + + + YKT YT + L STA+ H D + Sbjct: 241 DSNTANYDGGGTAQSNFLGYTQNYKTRYTAQIVLKDGGLIKTGSESTALSRHHDITIEGI 300 Query: 143 LYIQDGD-----KISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMK 197 Y + I F P D + + + ++S +TS++ Sbjct: 301 SYRVKVKAVEEVDTYESVSGIAFHRTPKNPDKGKLSMTNLISALHASINSSLNNVTSEV- 359 Query: 198 IFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYV 257 G + L ++ + + ++ S + GY Sbjct: 360 ------IGSGMYLYGSAAPTVNFLGGAVNENMNIIGNTAQDVSRLPSQCKHGYIAQIANS 413 Query: 258 KDNNITWITVLNLSSKTSRESASGAVAPYYVWGD--------IKDVSKDGRSISVAPQSQ 309 ++ + + SG+ + +K + ++ Sbjct: 414 ENVD--ADNYYVKFYADNGVQGSGSWEECVRPHNFSAGSDPMVKGLDPANMPHALVNNRN 471 Query: 310 TLFQAGVSV------------VSWFMSAWGEQEGYPSH-------VTFHNNRLLFSGSKG 350 F + +PS + FH NRL ++ Sbjct: 472 GTFTFKKLDETTANADSNDNYWKYREVGDDITNPFPSFKGLKISKIFFHRNRLGLIANEQ 531 Query: 351 DELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTS 410 V +S G +++F + D + V+D + I+ + P +GV++ D Sbjct: 532 ----VVMSRPGDYFNFQIVSAITTSDDN-PVDITVSDIKPAFINHVLPIQKGVMMFSDNG 586 Query: 411 LWLLSISLSKGLSIDFR---RVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGF 467 +LL R S A P+ +G ++F V + + Sbjct: 587 QFLLFTESDIFSPKTARLKKLSSYETYPALDPIDMGTSVMFTSNVSAYARAFEATIVDDD 646 Query: 468 RFNEI---TQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDF 524 +I T++ + I I V K++S + + Sbjct: 647 IPPKIIEQTRVVPEFIPKDITISTVSSA---IGIVSFGKKNSSEIYHYKYYDAGDRRDQS 703 Query: 525 AWHTHMISDKHYVL 538 AW++ + K Sbjct: 704 AWYSWTVQGKLQHC 717 Score = 87.3 bits (214), Expect = 6e-15, Method: Composition-based stats. Identities = 31/304 (10%), Positives = 68/304 (22%), Gaps = 16/304 (5%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M +F G + + D + + N +P L+ P + + Sbjct: 1 MPAINQRIPNFLGG------VSQQPDTIKYPGQLRVCDNAVPDVTFGLMKRPPGEFVKTL 54 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGD-------KKLQIVVVRSSTKWSPALFGKTYKTPY 113 + L+ K ++I + + + S Y Sbjct: 55 TNANADGYWYEILRDGDEKYLVQMTALSSYSGTKPIRIWNLLTGVEQSLTNSNGDSLFSY 114 Query: 114 TFKDNKSLEYA--VFGSTAVFVHKDHPPHHLLYI-QDGDKISFTFDEIKFLPPPWLGDGM 170 + ++ YA + + ++ F + + Sbjct: 115 MEQSGTTIPYATQTIQDYTIISNPHKTVTTTGTTDAPLANGNYAFARLDTIAYNTEYILY 174 Query: 171 ISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIV 230 A S + T+D + +K G V Sbjct: 175 TGSTAPAANKYYRVTALSVDKGTNDGNTWDDTNKDGRYAGLAQFSFSDSLCEDVEGHVTV 234 Query: 231 ADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWG 290 S T G S Y ++ + + L ++ S + A Sbjct: 235 NAASYVDSNTANYDGGGTAQSNFLGYTQNYKTRYTAQIVLKDGGLIKTGSESTALSRHHD 294 Query: 291 DIKD 294 + Sbjct: 295 ITIE 298 >gi|282554622|ref|YP_003347639.1| tail tubular protein B [Klebsiella phage KP34] gi|262410455|gb|ACY66719.1| tail tubular protein B [Klebsiella phage KP34] Length = 786 Score = 119 bits (297), Expect = 1e-24, Method: Composition-based stats. Identities = 49/563 (8%), Positives = 138/563 (24%), Gaps = 48/563 (8%) Query: 21 LQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCR-LDPRSNRVFSF--SIPDG 77 + + + N++ + P + + +P + +F+ Sbjct: 16 VSQQVPRERQPGQLGAQLNMLSDPVSGIRRRPPGEIVWESTIDNPGLDSLFTEYVERGTD 75 Query: 78 GYALLV-FGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKD 136 G LL+ + ++ T + T SL+ A ++ + Sbjct: 76 GRHLLINTSNGNWWLLAKNGKTILNSGNDPYFVTTVGQT----SLQTASIAGLTYILNTE 131 Query: 137 HPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDM 196 P+ + S T + S + + + Sbjct: 132 MAPNTTVDNTGRIDPSTTGFFYVKSAAFQKRWNVTVTSAGVD-YSGDYTAPAAGSTSGNA 190 Query: 197 KIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATY 256 + + +R GAY+ +++ G S + Sbjct: 191 EEVSGAYVAQQLRDSLVANGLPAGNVSVRGAYLFFYGLSNCVVSSDAGDTYAGVSNQSRV 250 Query: 257 VKDNNITWITVLNLSSKTSRESASGAVAPY-------YVWGDIKDVSKDGRSISVAPQSQ 309 ++ ++ R + + + W ++ + ++ + Sbjct: 251 DQEQDLPAQLPAQADGAMCRVGTASSETAWYQFSYSTRTWSEVGAYGSITKITNMPRELA 310 Query: 310 TLFQAGVSVVSWFMSAWGE--------QEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFG 361 ++ + + GY + + RL+ V +S+ G Sbjct: 311 ADDNIIARDWEGRLAGNDDNNSDPGFVENGYITGIAAFQGRLVLLSGSS----VDMSASG 366 Query: 362 AFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSIS--LS 419 + F T ++ + S F +++ ++ ++ S L+ Sbjct: 367 LYQRFYRSTV-TSLLDTDRISISSASAQDSVYRTAVQFNRDLVLFANSMQAVVPGSVVLT 425 Query: 420 KGLSIDFRRVSGSGVYACPPVSVGDCLVFVC-GVGRRIKYIS----GSTEQGFRFNEITQ 474 + + PV G +++ + T + + T Sbjct: 426 PTNASISITSTYDCDSRVTPVMAGQTVIYPNKRNDSYAGILELIPSPYTAAQYTTQDATV 485 Query: 475 LADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRF--SAEGEGDFAWHTHMIS 532 R+LQ+ + + + + + S + AWH Sbjct: 486 HLPRYIPGRVLQMQNSSVTNMA--FSRMSGERNSLLVYEFMWGGSDGAKMQAAWHKWSFP 543 Query: 533 DKHYVLSAASFPNDNRGGTSLWM 555 + +++ Sbjct: 544 --------YPILSVQALEDEVFL 558 >gi|13186158|emb|CAC33469.1| hypothetical protein [Legionella pneumophila] Length = 818 Score = 117 bits (293), Expect = 4e-24, Method: Composition-based stats. Identities = 67/547 (12%), Positives = 140/547 (25%), Gaps = 84/547 (15%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M ++F+ GEL P L +R DL ++ +G K RN+I L G P Sbjct: 1 MP-IRSISNTFNRGELDPTLF-ARDDLDIYDKGARKLRNMIALWTGAARIAPGTIYVDMM 58 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 + L K + Y Sbjct: 59 VD------------------------------RENGNAVIQDPLMVKGFDFTYDAD--AE 86 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 + Y + ++ + + + Sbjct: 87 ITYTI----------------IIRKSGTNIAFDIYYADAL-------------QTTVTST 117 Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLT 240 + + + L + IR A ++++S+ + Y Sbjct: 118 AYLATQIQDIHVAAAHDRVLILHENVQIRQLK---RGASHSSWSLTTFEPRVYPTYDFSV 174 Query: 241 TGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGR 300 G + + ++ + + + + + S Sbjct: 175 IGEATNYQSFTFTLSATTGSITITSSSAVFTHNHVGGLFRSLGGTARITAVASTTSASAT 234 Query: 301 SISVAPQSQTLFQAGVSVVS-W----FMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSV 355 + + W + G+P+ F+ NRL+ S + V Sbjct: 235 VLDNFTGTSCAGNLSSLAEKLWNSDTTTAPVSANRGWPARGVFYLNRLILGRSLAVKNLV 294 Query: 356 YLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLS 415 LS+ G + +F D A + ++ + + +L L+ S Sbjct: 295 NLSTAGVYDNFD----DADLDGLVAFSVTFNGKGEQSVQSIVA-DDSILFTTANKLFAQS 349 Query: 416 I---SLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQG-FRFNE 471 S ++ F S S + S+ + +FV ++ ST G + Sbjct: 350 PLVESPITINNVYFAPQSQSPATSIEAASIDNQTLFVSSDRTKVMQAMYSTADGKYITLP 409 Query: 472 ITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMI 531 T L++ + + + EP I L +L + + W Sbjct: 410 ATMLSNSIVDY--INSNGTWEPAGIS-TRLYLATQDNGTMLLYSTL-QTQNVAGWSLRTT 465 Query: 532 SDKHYVL 538 + K + Sbjct: 466 TGKFRQV 472 >gi|308071881|emb|CBW54802.1| putative tail tubular protein B [Pantoea phage LIMElight] Length = 774 Score = 116 bits (290), Expect = 9e-24, Method: Composition-based stats. Identities = 53/558 (9%), Positives = 130/558 (23%), Gaps = 44/558 (7%) Query: 21 LQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPL--MQEYRDCRLDPRSNRVFSFSI--PD 76 + + V+ N++ L P + +N + D Sbjct: 16 VSQQVPRLRLDGQVSTQENMLADPVTSLRRRPGAPLTVIHSLGTITDTNLYTQYVERGSD 75 Query: 77 GGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKD 136 G ++ ++ ++ + + SL+ G ++ Sbjct: 76 GRTLIINTSTGNWWVMNKDATAVLKSGQDAYFIASGGSS----SLQSTSVGGETFILNIQ 131 Query: 137 HPPHHLLYIQ--DGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITS 194 P + D + F ++ + G + + A + Sbjct: 132 QAPQAIASTTKRDPSTTGWYFTKVGAFDKDYTLTIQRGGTTQTFTYHTPSSTDANAVAQT 191 Query: 195 DMKIFKPLDKGRS----IRLGCHPPEWAKNTNYSIGAYIVADDKVY-RSLTTGRSGDRFG 249 + I + ++ + S + Sbjct: 192 SPVYITSQLVQQMQAAGIEVHQQDMYIYVVGAATLVVTSTSGTSYVGYSGRHNVALITDL 251 Query: 250 YSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQ 309 + + T N + E AS + +G + ++ Sbjct: 252 PAVIPAGGDGILTSVGTDANALTWYRWEQASNSWVEDSSYGSPAALR------NMPRVLA 305 Query: 310 TLFQAGVSVVSWFMSAWGEQEGYPSH--------VTFHNNRLLFSGSKGDELSVYLSSFG 361 ++ P+ +T + RL+ + +S G Sbjct: 306 ADDTITAPDFEGRLAGDDLTNEIPTFLDQGVITGMTTYQGRLVLLSGA----FLTMSKSG 361 Query: 362 AFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKG 421 Y F + + + S + F +++ D ++S + Sbjct: 362 NPYRFYRSTV-TELQNSDRIDIGIGSSQNSILRRGIQFNRDLVLFGDAVQAVVSGGGNIL 420 Query: 422 LSIDF---RRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYI-----SGSTEQGFRFNEIT 473 S V P+ G +++ + S T + + T Sbjct: 421 TPSTAAISLTSEESCVSKIAPMQAGQTVLYPFKRSSGYSGMLELIPSQYTSSQYVSQDAT 480 Query: 474 QLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISD 533 F + + V+ +D S + ++S++G+ AWH + Sbjct: 481 GHIPEYFAGDVRVTAASNVVNMCVFT--GSRDTSVIYVHEYQWSSDGKVQAAWHRWTMPQ 538 Query: 534 KHYVLSAASFPNDNRGGT 551 L A Sbjct: 539 PVVSLHFAREKLVIFTAD 556 >gi|282857736|ref|ZP_06266945.1| putative tail tubular protein B [Pyramidobacter piscolens W5455] gi|282584406|gb|EFB89765.1| putative tail tubular protein B [Pyramidobacter piscolens W5455] Length = 865 Score = 116 bits (290), Expect = 9e-24, Method: Composition-based stats. Identities = 58/437 (13%), Positives = 108/437 (24%), Gaps = 37/437 (8%) Query: 140 HHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSIS--QADTSTARITSDMK 197 Y + F ++ + P + S A A + +R+T Sbjct: 231 RGNTYSSLTTWKNKNFTDLPTIAPEGFACCISGSTGSAADDYYVRFVASGAASRLTWQNA 290 Query: 198 IFKPLDKGRSIRLGCHP------PEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYS 251 + + I + TN + + V TG + Sbjct: 291 EYPVGGVKKRIYVRSSEEPLFTENRLVSCTNLHGFTTRIKNVGVQVVTPTGGTPQYRYLV 350 Query: 252 KGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVA-PQSQT 310 + T DN N ++T + G + + Sbjct: 351 EFETKFPDN--AGNLRFNTGTQTITGLSRGTWEECVAPDIPNKFVNATMPHLLVHDLEED 408 Query: 311 LFQAGVSVVSWFMSAWGEQEGYPSHV-------TFHNNRLLFSGSKGDELSVYLSSFGAF 363 ++ + S E +PS + + NRL F SV LS+ G Sbjct: 409 MWVFKPVNWAARSSGDAESAPWPSFIGKKITALFLYRNRLGFVAG----DSVSLSAAGDL 464 Query: 364 YDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL--SKG 421 F + + +V+ S I + + + + S S Sbjct: 465 ERFFPETV-QTLTDADPIDLSVSVDDYSDIRATVTVQDKLFFFSNRRQYTFSSPDALSPK 523 Query: 422 LSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRR--IKYISGS-TEQGFRFNEITQLADH 478 + + S + VGD L F + ++ N +T Sbjct: 524 TAAVLPSTAYSCLPDIGLPVVGDRLYFATAYSAKMQVREYGVDPYTDNKTANPVTAHVAQ 583 Query: 479 LFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVL 538 L + + P + + L C S + AW + Sbjct: 584 LIPKG-ANMCLVASPTADCLAFFSSVYPNTLFLYQCYISGGNKLQSAWSRQTFN------ 636 Query: 539 SAASFPNDNRGGTSLWM 555 A+ N LW+ Sbjct: 637 --ATILNMAFRDNVLWL 651 Score = 82.7 bits (202), Expect = 2e-13, Method: Composition-based stats. Identities = 32/298 (10%), Positives = 70/298 (23%), Gaps = 17/298 (5%) Query: 18 PRLLQSRKDLS---LHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRVFSFSI 74 P L+ + N + L + P ++ F++ Sbjct: 8 PNLIGGISQQPAALRLNNQLEDQLNFVSSPAAGLQNRPALKYVSSSPYTGGGAF---FTL 64 Query: 75 PDGGYAL--LVFGDKKLQIVVVRSSTKWS--PALFGKTYKTPYTFKDNKSLEYAVFGSTA 130 L G L+I ++ + K P S + Sbjct: 65 DRDEQVRHNLWIGPDGLRIEDLQGNVKTVQYQGNALAYLSLPAGADKKNSYRILNIADSC 124 Query: 131 VFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTA 190 V++ P I + + LG ++ + +DTS + Sbjct: 125 FIVNRTKTPQ----IDQNSITESKNHALIHIKQVALGTTWSVTLQGKSVSYGYSSDTSLS 180 Query: 191 RITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGY 250 T + + + S+ + D + + G+ + Sbjct: 181 VSTEQVANELAN---ALLGDSTISAAFNIVHASSVISIERKDGGSFSIGLSDSRGNTYSS 237 Query: 251 SKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQS 308 ++ I + S + S A Y + S+ + P Sbjct: 238 LTTWKNKNFTDLPTIAPEGFACCISGSTGSAADDYYVRFVASGAASRLTWQNAEYPVG 295 >gi|310005690|gb|ADP00077.1| tail tube protein B [Cyanophage NATL1A-7] Length = 1056 Score = 116 bits (290), Expect = 1e-23, Method: Composition-based stats. Identities = 50/411 (12%), Positives = 114/411 (27%), Gaps = 52/411 (12%) Query: 199 FKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDK-VYRSLTTGRSGDRFGYSKGATYV 257 F I + + ++ +++ + +T+G + D F T Sbjct: 427 FSAEGIAEDIDQTGTYARSSNTITVTAASHGLSNGDQIILDITSGGATDGFYTIANVTTN 486 Query: 258 K----DNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQ 313 D+ I+ S T G G ++ I++ F Sbjct: 487 TFTVTDSASGTISAGETCSFTPARFGEGVWEEVVQPGKDIEIDNTTMPIALTRVLPGSFS 546 Query: 314 -------------AGVSVVSWFMSAWGE--QEGYPSHV-------TFHNNRLLFSGSKGD 351 S W+ G+ PS + F NR+ ++ Sbjct: 547 INGGGSQTYSNGAFRFSYPDWYKRDCGDDITNPEPSFIGQTIQKMVFFRNRIALLSAEN- 605 Query: 352 ELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSL 411 V LS FY+F + + + + ++ G+++ + Sbjct: 606 ---VILSRVNDFYNFWNKTAMAISNA-DPIDLQSSSTYPTKLYDAVEQAGGLVIFSASEQ 661 Query: 412 WLLSISL----SKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISG---STE 464 +LLS + + S + P+ +G + F+ + ++ S Sbjct: 662 FLLSSGAEALLTPETAKISYVSSHAFNPDTSPIELGTTIGFLNSTAKNTRFFEMAAVSQR 721 Query: 465 QGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEP--KDNSFPRLLGCRFSAEGEG 522 + E ++ +LF + E +++ V ++ S Sbjct: 722 EEPTIVEQSKSIYNLFPVNTSMMTGSVENQMVLFGVDSTLYTASNEVWGYKFYVSEGRRS 781 Query: 523 DFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSAGEERSFTVRLNL 573 AW + + S ++ V + G +F + ++ Sbjct: 782 QSAWFRWTLPNNLVYHSII---------DDVYYAVLNT-GSTFTF-EKFDI 821 Score = 78.0 bits (190), Expect = 4e-12, Method: Composition-based stats. Identities = 40/411 (9%), Positives = 100/411 (24%), Gaps = 19/411 (4%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + T ++ G + + D V N +P L P Sbjct: 1 MASVTQKIPNYVLG------ISQQPDEKKFPGQVNDLVNGLPDVVEQLTKRPGSHLISAI 54 Query: 61 -RLDPRSNRVFSFSIPDGGYALL-VFGDKKLQIVVVRSST--------KWSPALFGKTYK 110 +++ F+ D + V D ++I + Sbjct: 55 SPSTAANSKWFTIYTRDDESYIGQVAADGGVKIFRCSDGVEIPVDYANIAGSGVATYLDN 114 Query: 111 TPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGM 170 T + + + ++ T FV++ I+ + Sbjct: 115 TALSDEKSSDIQALTINETTFFVNRRKTVEMKRDAASKSPTQPFEAYIQLDSIAYGKQYA 174 Query: 171 ISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIV 230 + + ++S ++ D+ + G + + + Sbjct: 175 LDIYDPSDNSTVSYTRATSIAADEDVSLDGTSSTGANQPGNGDCDGAGREYVTVSTGTSI 234 Query: 231 ADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNL--SSKTSRESASGAVAPYYV 288 + G++ R+ T D++ T + + + + Sbjct: 235 HSTSPPNASAGGKTNLRYEMDARCTPQPDDDHTDSEAQDNYHDTYQTYAKLQFGGEGWTT 294 Query: 289 WGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGS 348 + S+ G + +V + ++ A L + Sbjct: 295 NDTHQHTSEKGLTTTVKITNHVTITTRANIAMVRPEATSSNAEEHVSADGILGELKATLD 354 Query: 349 KGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPF 399 + + G ++G P K L ++T +TI + Sbjct: 355 AISGTGITCTKVGNGLHLYRATKFGVTTPEKTL-MSITTSEVNTIADLPST 404 >gi|291336926|gb|ADD96454.1| hypothetical protein [uncultured organism MedDCM-OCT-S09-C787] Length = 158 Score = 116 bits (290), Expect = 1e-23, Method: Composition-based stats. Identities = 26/142 (18%), Positives = 63/142 (44%), Gaps = 6/142 (4%) Query: 369 DGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLS----KGLSI 424 D +G ++ + + I +M +++G + +S + +I Sbjct: 3 DNYHGTVADDDSIIYTIASNQVNAIRFMTATRT-LIIGTAGGEFAVSGGGTDIAITPTNI 61 Query: 425 DFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFNQR 483 ++ S +G ++VG+ +F+ R+++ ++ + + G+ ++T LA+H+ Sbjct: 62 LIKKQSNNGAANVDALAVGNATLFLQRARRKLRELAYNFDVDGYVAPDLTILAEHISEGG 121 Query: 484 ILQLVYQEEPHSIVWVVLEPKD 505 QL YQ+EP+ ++W V Sbjct: 122 FKQLSYQQEPNQVIWGVRNDGQ 143 >gi|77734533|emb|CAI59394.2| hypothetical protein pSG3.03 [Sodalis glossinidius] Length = 517 Score = 113 bits (282), Expect = 8e-23, Method: Composition-based stats. Identities = 31/121 (25%), Positives = 56/121 (46%), Gaps = 13/121 (10%) Query: 454 RRIKYISGSTE-QGFRFNEITQLADHLFNQ-RILQLVYQEEPHSIVWVVLEPKDNSFPRL 511 ++ ++ S + GF+ N++T LA+H F ++L + P S+VW V L Sbjct: 189 SAVRDLAYSFDVDGFQGNDLTVLANHFFTGFQLLDWAFTITPLSVVWCVRN-----DGTL 243 Query: 512 LGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSA-GEERSFTVR 570 LG + E + AWH H + K+ + + S +L+ +V + G+ R + R Sbjct: 244 LGLTYLREQQ-VAAWHQHPAAGKYEAVCSIS----EGTEDALYCVVNRTIQGQPRRYVER 298 Query: 571 L 571 L Sbjct: 299 L 299 >gi|89886023|ref|YP_516220.1| hypothetical protein SGPHI_0042 [Sodalis phage phiSG1] gi|89191758|dbj|BAE80505.1| conserved hypothetical protein [Sodalis phage phiSG1] gi|125470053|gb|ABN42245.1| gp40 [Sodalis phage phiSG1] Length = 517 Score = 113 bits (281), Expect = 1e-22, Method: Composition-based stats. Identities = 31/121 (25%), Positives = 56/121 (46%), Gaps = 13/121 (10%) Query: 454 RRIKYISGSTE-QGFRFNEITQLADHLFNQ-RILQLVYQEEPHSIVWVVLEPKDNSFPRL 511 ++ ++ S + GF+ N++T LA+H F ++L + P S+VW V L Sbjct: 189 SAVRDLAYSFDVDGFQGNDLTVLANHFFTGFQLLDWAFTITPLSVVWCVRN-----DGTL 243 Query: 512 LGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSA-GEERSFTVR 570 LG + E + AWH H + K+ + + S +L+ +V + G+ R + R Sbjct: 244 LGLTYLREQQ-VAAWHQHPAAGKYEAVCSIS----EGTEDALYCVVNRTIQGQPRRYVER 298 Query: 571 L 571 L Sbjct: 299 L 299 >gi|325971691|ref|YP_004247882.1| hypothetical protein SpiBuddy_1864 [Spirochaeta sp. Buddy] gi|324026929|gb|ADY13688.1| hypothetical protein SpiBuddy_1864 [Spirochaeta sp. Buddy] Length = 551 Score = 113 bits (281), Expect = 1e-22, Method: Composition-based stats. Identities = 51/344 (14%), Positives = 108/344 (31%), Gaps = 22/344 (6%) Query: 5 TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDP 64 +++ GE+SP+L R DL ++ QG ++ + G + P ++ Sbjct: 2 NQLVNNWMYGEISPKL-GGRLDLEMNTQGCEILKDFRNMLQGGITRRPPLKHVAQ----T 56 Query: 65 RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPA---LFGKTYKTPYTFKDNKSL 121 R F++ G L+ +KKL++ ++ T Y D S+ Sbjct: 57 VRGRTIPFTLSSGESFLVELSNKKLRVWRKGVLGFYTVTFLPSGNDYLPTDYLEADVWSI 116 Query: 122 EYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLS 181 +YA + VHKD+ PH ++Y + + S E G V + Sbjct: 117 QYAQYYDRLYLVHKDYQPHVVVYAAEAFQFSPFTAETDAGKQLGKSTGYYPSVVGICQNR 176 Query: 182 ISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTT 241 + + T+ + + ++ + ++ D + T Sbjct: 177 LWFSAAILKPYTTWVSRPP-------YDGSNNHHDFTTFDVIEVNTEVIKDPSTWPKTTN 229 Query: 242 GRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRS 301 + + +S + +V+ +N E ASG + ++ + Sbjct: 230 EQGDEMIDFSDSSKFVETVKEIEEV-INAKCAMEIELASGRNDTIKWVAGMDNIFIGTEA 288 Query: 302 ISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSH---VTFHNNR 342 + + +S++G P F R Sbjct: 289 NEWMCPFDIDPTKQSASM---LSSYGSLPIQPQTLHDGIFFLQR 329 Score = 108 bits (270), Expect = 2e-21, Method: Composition-based stats. Identities = 39/301 (12%), Positives = 85/301 (28%), Gaps = 58/301 (19%) Query: 312 FQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSF--------GAF 363 F + YPS V NRL FS + + ++S F Sbjct: 146 FSPFTAETDAGKQLGKSTGYYPSVVGICQNRLWFSAAILKPYTTWVSRPPYDGSNNHHDF 205 Query: 364 YDFSLDGEYGCY-------------DPTKALTTAVTDFSASTIHWM-------------- 396 F + + + + + T+ + Sbjct: 206 TTFDVIEVNTEVIKDPSTWPKTTNEQGDEMIDFSDSSKFVETVKEIEEVINAKCAMEIEL 265 Query: 397 ----------HPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVYACPPVSVGDCL 446 + + +G + + W+ + +S G P ++ D + Sbjct: 266 ASGRNDTIKWVAGMDNIFIGTEANEWMCPFDIDP-TKQSASMLSSYGSLPIQPQTLHDGI 324 Query: 447 VFVCGVGRRIKYISGSTEQGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDN 506 F+ G R++ ++ ++ G N+++ ADH+ I QL + P +++ +L Sbjct: 325 FFLQR-GNRLREMT-RSQNGSISNDLSFTADHILFAGIRQLATLKNPDPMIFCLLN---- 378 Query: 507 SFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSAGEERS 566 L + G W + P ++ G ++ V Sbjct: 379 -DGTLAVLCYDKNY-GMQGWSRWSTQGEF----MCLAPYEDEDGQKMFAHVRRGNDYSIE 432 Query: 567 F 567 + Sbjct: 433 Y 433 >gi|167565012|ref|ZP_02357928.1| tail tubular protein B [Burkholderia oklahomensis EO147] Length = 776 Score = 112 bits (280), Expect = 1e-22, Method: Composition-based stats. Identities = 57/542 (10%), Positives = 132/542 (24%), Gaps = 49/542 (9%) Query: 21 LQSRKDLSLHAQGVAKSRNLIPLR-YGPLVSMPLMQEYRDCRLDPRSNRVFSFSIPDGGY 79 + + L + + N +P G L + P + G Sbjct: 18 VSRQAPLLRSPSQMDEIVNFLPSVDIGGLADRVGTTCIANLAAAPYKS---------EGT 68 Query: 80 ALLVFGDKKLQIVVVRSSTKW------SPALFGKTYKTPYTFKDNKS---LEYAVFGSTA 130 + D + + + R+ + P+ S L++ T Sbjct: 69 YMFRTTDGQRWMFIRRADAGYPEIRNMVNGALAAVTCGPFVQNYINSASRLKFLSMSDTT 128 Query: 131 VFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTA 190 + ++ D + K + I+ L + + S S A + A T Sbjct: 129 LVLNPDVATRFVAPSAGITKTR-AYAVIRKLSSNYQTFYLNSDAGSAATVYDGSAGVKTR 187 Query: 191 RITSDMK-IFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFG 249 + I +D + + Sbjct: 188 EWVAQRLMEQCIAHMPGLTISRVANVVRISGPEAIINTLNGGNDWDETAFVLIKGRVSAA 247 Query: 250 YSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDV------SKDGRSIS 303 A ++ + + Y + + + Sbjct: 248 SDLPAQMFPGESVMVDLENGATKSAYWVTYDRTTNSYKETAWLDNFANAGNWDASTMPVR 307 Query: 304 VAPQSQTLFQAGVSVVSWFMSAWGEQE------GYP-SHVTFHNNRLLFSGSKGDELSVY 356 + F+ + G P + + RL FS + V Sbjct: 308 IHQTGVNSFEIQPVDWVPRKVGDNDSNAPAPFNGAPITDMALWKGRLWFSSASW----VV 363 Query: 357 LSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSI 416 S ++F D + + + ++ + F + ++V + L Sbjct: 364 GSQPDDLFNFWQDSA-REVVASDPVKVQ-AEADLGSVSHLAGFRDNLMVFLRGAQCSLDG 421 Query: 417 SL--SKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQ---GFRFNE 471 S + ACPP VG+ +++ R EQ + Sbjct: 422 SQPVKPDTAALGVATRYDVDAACPPSVVGNVMLYTGSQEGRSVLWEYQFEQATENNYAED 481 Query: 472 ITQLADHLFNQRILQLVYQ-EEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHM 530 +++ + ++V + + +W L D + + + A+ AW+ Sbjct: 482 LSKHIPRYCPGSVRRIVGSAQSGRTFLWSSL---DAATLYVHSSYWQAQQRAQNAWNKLT 538 Query: 531 IS 532 + Sbjct: 539 FA 540 >gi|225626361|ref|YP_002727857.1| putative tail tubular protein B [Pseudomonas phage phikF77] gi|225594870|emb|CAX63155.1| putative tail tubular protein B [Pseudomonas phage phikF77] Length = 826 Score = 111 bits (277), Expect = 3e-22, Method: Composition-based stats. Identities = 55/591 (9%), Positives = 121/591 (20%), Gaps = 89/591 (15%) Query: 18 PRLLQS---RKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRVFSFSI 74 P LL + +++ N++ L ++ + F Sbjct: 9 PNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLLHTDQP-WPRPFLY 67 Query: 75 PDG----GYALLVFGD-KKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGST 129 A+LV +L + R Y D + L A Sbjct: 68 HTNLGGRSIAMLVAQHRGELYLFDERDGRLLMGQPLVHDY---LKAADYRQLRAATVADD 124 Query: 130 AVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTST 189 + P G + M VK NA + + Sbjct: 125 LFIANLSVKPEADRTDVKGVDPNKAGWLYIKAGQYSKAFSMTIKVKDNATGTTYSHTATY 184 Query: 190 ARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFG 249 + + +G + + + K Y + + Sbjct: 185 VTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYTLPNSTKKYPKVDPDTAAATVA 244 Query: 250 YSKGATYVKDN------------NITWITVLNLSSKTSRESASGAVA------PYYVWGD 291 V+D ++ N + S + G Sbjct: 245 GYLNQRGVQDGYIAFRGDGDIVVEVSTDMGNNYGIASGGMSLNATADLPALLPGAGTPGT 304 Query: 292 IKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSA-------------------------- 325 + + ++ F+ + W A Sbjct: 305 GVQFMDGAVMATGSTKAPVYFEWDSANRRWAERAAYGTDWVLKKMPLALRWDEATDTYSL 364 Query: 326 -----------WGEQEGYPSHVTF-------HNNRLLFSGSKGDELSVYLSSFGAFYDFS 367 + + VT RL+ + V +S+ + + Sbjct: 365 NELDYDRRGSGDEDTNPTFNFVTRGITGMTTFQGRLVLLSQE----YVCMSASNNPHRWF 420 Query: 368 LDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL--SKGLSID 425 + + A F + ++V ++ + ++ Sbjct: 421 KKSAAA-LNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPGGGIVTPRTAVI 479 Query: 426 FRRVSGSGVYACPPVSVGDCLVFVCGV-----GRRIKYISGSTEQGFRFNEITQLADHLF 480 P G + F G S ST+ + ++T Sbjct: 480 SITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYM 539 Query: 481 NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMI 531 Y + S ++V + + A+H + Sbjct: 540 PGPAE---YIQAAASSGYLVFGTSTADEMICHQYLWQGNEKVQNAFHRWTL 587 >gi|195546741|ref|YP_002117819.1| tail tubular protein B [Pseudomonas phage PT2] gi|165880750|gb|ABY71005.1| tail tubular protein B [Pseudomonas phage PT2] Length = 826 Score = 110 bits (275), Expect = 5e-22, Method: Composition-based stats. Identities = 59/620 (9%), Positives = 128/620 (20%), Gaps = 97/620 (15%) Query: 18 PRLLQS---RKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRVFSFSI 74 P LL + +++ N++ L ++ R + F Sbjct: 9 PNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLRHTDQP-WPRPFLY 67 Query: 75 PDG----GYALLVFGD-KKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGST 129 A+LV +L + R Y D + L A Sbjct: 68 HTNLGGRSIAMLVAQHRGELYLFDERDGRLLMGQPLVHDY---LKANDYRQLRAATVADD 124 Query: 130 AVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTST 189 + P G + M VK NA + + Sbjct: 125 LFIANLSVKPEADRTDIKGVDPNKAGWLYIKAGQYSKAFSMTIKVKDNATGTTYSHTATY 184 Query: 190 ARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFG 249 + + +G + + + K Y + + Sbjct: 185 VTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYTLPNSTKKYPKVDPDANAATIA 244 Query: 250 YSKGATYVKDN------------NITWITVLNLSSKTSRESASGAVA------PYYVWGD 291 V+D ++ N + S + G Sbjct: 245 GYLNQRGVQDGYIAFRGDADIHVEVSTDMGNNYGIASGGMSLNATADLPALLPGVGAPGV 304 Query: 292 IKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSA-------------------------- 325 + + ++ F+ + W A Sbjct: 305 GVQFMDGAVMATGSTKAPVYFEWDSANRRWAERAAYGTDWVLKKMPLALRWDEATDTYSL 364 Query: 326 -----------WGEQEGYPSHVTF-------HNNRLLFSGSKGDELSVYLSSFGAFYDFS 367 + + VT RL+ + V +S+ + + Sbjct: 365 NELEYDRRGSGDEDTNPTFNFVTRGITGMTTFQGRLVLLSQE----YVCMSASNNPHRWF 420 Query: 368 LDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL--SKGLSID 425 + + A F + ++V ++ + ++ Sbjct: 421 KKSAAA-LNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPGGGIVTPRTAVI 479 Query: 426 FRRVSGSGVYACPPVSVGDCLVFVCGV-----GRRIKYISGSTEQGFRFNEITQLADHLF 480 P G + F G S ST+ + ++T Sbjct: 480 SITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYM 539 Query: 481 NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSA 540 Y + S ++V + + A+H + Sbjct: 540 PGP---AEYIQAAASSGYLVFGTSTADEMICHQYLWQGNEKVQNAFHRWTLR-------- 588 Query: 541 ASFPNDNRGGTSLWMLVALS 560 G +L +L+ Sbjct: 589 HQIIGAYFTGDNLMVLIQKG 608 >gi|195546679|ref|YP_002117760.1| tail tubular protein B [Pseudomonas phage PT5] gi|158187640|gb|ABW23117.1| tail tubular protein B [Pseudomonas phage PT5] Length = 826 Score = 110 bits (273), Expect = 9e-22, Method: Composition-based stats. Identities = 57/620 (9%), Positives = 126/620 (20%), Gaps = 97/620 (15%) Query: 18 PRLLQS---RKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRVFSFSI 74 P LL + +++ N++ L ++ + F Sbjct: 9 PNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLLHTDQP-WPRPFLY 67 Query: 75 PDG----GYALLVFGD-KKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGST 129 A+LV +L + R Y D + L A Sbjct: 68 HTNLGGRSIAMLVAQHRGELYLFDERDGRLLMGQPLVHDY---LKANDYRQLRAATVADD 124 Query: 130 AVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTST 189 + P G + M VK NA + + Sbjct: 125 LFIANLSVKPEADRTDIKGVDPNKAGWLYIKAGQYSKAFSMTIKVKDNATGTTYSHTATY 184 Query: 190 ARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFG 249 + + +G + + + K Y + + Sbjct: 185 VTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYTLPNSTKKYPKVDPDANAATIA 244 Query: 250 YSKGATYVKDN------------NITWITVLNLSSKTSRESASGAVA------PYYVWGD 291 V+D ++ N + S + G Sbjct: 245 GYLNQRGVQDGYIAFRGDADIHVEVSTDMGNNYGIASGGMSLNATADLPALLPGVGAPGV 304 Query: 292 IKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSA-------------------------- 325 + + ++ F+ + W A Sbjct: 305 GVQFMDGAVMATGSTKAPVYFEWDSANRRWAERAAYGTDWVLKKMPLALRWDEATDTYSL 364 Query: 326 -----------WGEQEGYPSHVTF-------HNNRLLFSGSKGDELSVYLSSFGAFYDFS 367 + + VT RL+ + V +S+ + + Sbjct: 365 NELEYDRRGSGDEDTNPTFNFVTRGITGMTTFQGRLVLLSQE----YVCMSASNNPHRWF 420 Query: 368 LDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL--SKGLSID 425 + + A F + ++V ++ + ++ Sbjct: 421 KKSAAA-LNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPGGGIVTPRTAVI 479 Query: 426 FRRVSGSGVYACPPVSVGDCLVFVCGV-----GRRIKYISGSTEQGFRFNEITQLADHLF 480 P G + F G S ST+ + ++T Sbjct: 480 SITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYM 539 Query: 481 NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSA 540 Y + S ++V + + A+H + Sbjct: 540 PGP---AEYIQAAASSGYLVFGTSTADEMICHQYLWQGNEKVQNAFHRWTLR-------- 588 Query: 541 ASFPNDNRGGTSLWMLVALS 560 +L +L+ Sbjct: 589 HQIIGAYFTDDNLMVLIQKG 608 >gi|33300845|ref|NP_877473.1| tail tubular protein B [Pseudomonas phage phiKMV] gi|33284816|emb|CAD44225.1| tail tubular protein B [Enterobacteria phage phiKMV] Length = 826 Score = 110 bits (273), Expect = 9e-22, Method: Composition-based stats. Identities = 58/620 (9%), Positives = 127/620 (20%), Gaps = 97/620 (15%) Query: 18 PRLLQS---RKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRVFSFSI 74 P LL + +++ N++ L ++ + F Sbjct: 9 PNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLLHTDQP-WPRPFLY 67 Query: 75 PDG----GYALLVFGD-KKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGST 129 A+LV +L + R Y D + L A Sbjct: 68 HTNLGGRSIAMLVAQHRGELYLFDERDGRLLMGQPLVHDY---LKANDYRQLRAATVADD 124 Query: 130 AVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTST 189 + P G + M VK NA + + Sbjct: 125 LFIANLSVKPEADRTDIKGVDPNKAGWLYIKAGQYSKAFSMTIKVKDNATGTTYSHTATY 184 Query: 190 ARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFG 249 + + +G + + + K Y + + Sbjct: 185 VTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYTLPNSTKKYPKVDPDANAATIA 244 Query: 250 YSKGATYVKDN------------NITWITVLNLSSKTSRESASGAVA------PYYVWGD 291 V+D ++ N + S + G Sbjct: 245 GYLNQRGVQDGYIAFRGDADIHVEVSTDMGNNYGIASGGMSLNATADLPALLPGVGAPGV 304 Query: 292 IKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSA-------------------------- 325 + + ++ F+ + W A Sbjct: 305 GVQFMDGAVMATGSTKAPVYFEWDSANRRWAERAAYGTDWVLKKMPLALRWDEATDTYSL 364 Query: 326 -----------WGEQEGYPSHVTF-------HNNRLLFSGSKGDELSVYLSSFGAFYDFS 367 + + VT RL+ + V +S+ + + Sbjct: 365 NELEYDRRGSGDEDTNPTFNFVTRGITGMTTFQGRLVLLSQE----YVCMSASNNPHRWF 420 Query: 368 LDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL--SKGLSID 425 + + A F + ++V ++ + ++ Sbjct: 421 KKSAAA-LNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPGGGIVTPRTAVI 479 Query: 426 FRRVSGSGVYACPPVSVGDCLVFVCGV-----GRRIKYISGSTEQGFRFNEITQLADHLF 480 P G + F G S ST+ + ++T Sbjct: 480 SITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYM 539 Query: 481 NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSA 540 Y + S ++V + + A+H + Sbjct: 540 PGP---AEYIQAAASSGYLVFGTSTADEMICHQYLWQGNEKVQNAFHRWTLR-------- 588 Query: 541 ASFPNDNRGGTSLWMLVALS 560 G +L +L+ Sbjct: 589 HQIIGAYFTGDNLMVLIQKG 608 >gi|167600480|ref|YP_001671979.1| tail tubular protein B [Pseudomonas phage LUZ19] gi|161168343|emb|CAP45507.1| tail tubular protein B [Pseudomonas phage LUZ19] Length = 826 Score = 110 bits (273), Expect = 9e-22, Method: Composition-based stats. Identities = 58/620 (9%), Positives = 127/620 (20%), Gaps = 97/620 (15%) Query: 18 PRLLQS---RKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRVFSFSI 74 P LL + +++ N++ L ++ + F Sbjct: 9 PNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLLHTDQP-WPRPFLY 67 Query: 75 PDG----GYALLVFGD-KKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGST 129 A+LV +L + R Y D + L A Sbjct: 68 HTNLGGRSIAMLVAQHRGELYLFDERDGRLLMGQPLVHDY---LKANDYRQLRAATVADD 124 Query: 130 AVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTST 189 + P G + M VK NA + + Sbjct: 125 LFIANLSVKPEADRTDIKGVDPNKAGWLYIKAGQYSKAFSMTIKVKDNATGTTYSHTATY 184 Query: 190 ARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFG 249 + + +G + + + K Y + + Sbjct: 185 VTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYTLPNSTKKYPKVDPDANAATIA 244 Query: 250 YSKGATYVKDN------------NITWITVLNLSSKTSRESASGAVA------PYYVWGD 291 V+D ++ N + S + G Sbjct: 245 GYLNQRGVQDGYIAFRGDADIHVEVSTDMGNNYGIASGGMSLNATADLPALLPGAGAPGV 304 Query: 292 IKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSA-------------------------- 325 + + ++ F+ + W A Sbjct: 305 GVQFMDGAVMATGSTKAPVYFEWDSANRRWAERAAYGTDWVLKKMPLALRWDEATDTYSL 364 Query: 326 -----------WGEQEGYPSHVTF-------HNNRLLFSGSKGDELSVYLSSFGAFYDFS 367 + + VT RL+ + V +S+ + + Sbjct: 365 NELEYDRRGSGDEDTNPTFNFVTRGITGMTTFQGRLVLLSQE----YVCMSASNNPHRWF 420 Query: 368 LDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL--SKGLSID 425 + + A F + ++V ++ + ++ Sbjct: 421 KKSAAA-LNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPGGGIVTPRTAVI 479 Query: 426 FRRVSGSGVYACPPVSVGDCLVFVCGV-----GRRIKYISGSTEQGFRFNEITQLADHLF 480 P G + F G S ST+ + ++T Sbjct: 480 SITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYM 539 Query: 481 NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSA 540 Y + S ++V + + A+H + Sbjct: 540 PGP---AEYIQAAASSGYLVFGTSTADEMICHQYLWQGNEKVQNAFHRWTLR-------- 588 Query: 541 ASFPNDNRGGTSLWMLVALS 560 G +L +L+ Sbjct: 589 HQIIGAYFTGDNLMVLIQKG 608 >gi|158345061|ref|YP_001522826.1| putative tail tubular protein B [Pseudomonas phage LKD16] gi|114796414|emb|CAK25970.1| putative tail tubular protein B [Pseudomonas phage LKD16] Length = 826 Score = 108 bits (268), Expect = 4e-21, Method: Composition-based stats. Identities = 60/620 (9%), Positives = 131/620 (21%), Gaps = 97/620 (15%) Query: 18 PRLLQSRKDL---SLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRVFSFSI 74 P LL +++ N++ L ++ + + Sbjct: 9 PNLLMGVSQQVAFERLPGQLSEQINMVSDPVSGLRRRSGIELMASLLHTDQP-WPRPYLY 67 Query: 75 PDG----GYALLVFGD-KKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGST 129 A+LV +L + + Y D + L A Sbjct: 68 HTNLGGRSIAMLVAQHRGELYLFDEKDGRLLMGQPLVHDY---LKASDYRQLRAATVADD 124 Query: 130 AVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTST 189 + + P G S T + VK NA + + Sbjct: 125 LFIANLEVRPEADKADVLGVDPSKTGWLYIKAGQYSKAFSLTIKVKDNATGTTYSHTATY 184 Query: 190 ARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFG 249 + + +G + + + K Y + + Sbjct: 185 VTPDNASTNPNLAEAPFQTSVGYIAWQLFGKFFGAPEYTLPNSTKKYPKVDPDPAAATVA 244 Query: 250 YSKGATYVKDN-----NITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISV 304 V+D I V + + + D+ + + Sbjct: 245 GYLNQRGVQDGYIAFRGDGDIVVEVSTDMGNNYGIASGGMSLNATADLPALLPGAGTPGT 304 Query: 305 APQ-------------SQTLFQAGVSVVSWFMSAW------------------------- 326 Q + F + W A Sbjct: 305 GVQFMDGAIMATGSTKAPVYFAWDAANRRWAERAAYGTDWVLKKMPLALRWDESTDTYSL 364 Query: 327 ----------GEQEGYPSH---------VTFHNNRLLFSGSKGDELSVYLSSFGAFYDFS 367 G++E P+ +T RL+ + V +S+ + + Sbjct: 365 NELEYDRRGSGDEETNPTFNFVKRGITGMTTFQGRLVLLSQE----YVCMSASNNPHRWF 420 Query: 368 LDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL--SKGLSID 425 + + A F + ++V ++ + ++ Sbjct: 421 KKSAAA-LNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPGGGIVTPRTAVI 479 Query: 426 FRRVSGSGVYACPPVSVGDCLVFVCGV-----GRRIKYISGSTEQGFRFNEITQLADHLF 480 P G + F G S ST+ + ++T Sbjct: 480 SITTQYDVDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYM 539 Query: 481 NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSA 540 Y + S ++V + + A+H + Sbjct: 540 PGP---AEYIQAAASSGYLVFGTSAADEMICHQYLWQGNEKVQNAYHRWTLR-------- 588 Query: 541 ASFPNDNRGGTSLWMLVALS 560 G +L +L+ Sbjct: 589 HQIIGAYFTGDNLMVLIQKG 608 >gi|229604955|ref|YP_002875655.1| putative tail tubular protein B [Vibrio phage VP93] gi|227977000|gb|ACP44102.1| putative tail tubular protein B [Vibrio phage VP93] Length = 780 Score = 108 bits (268), Expect = 4e-21, Method: Composition-based stats. Identities = 60/559 (10%), Positives = 133/559 (23%), Gaps = 40/559 (7%) Query: 21 LQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQE--YRDCRLDPRSNRVFS--FSIPD 76 + + A + N++ + P D + +++ Sbjct: 16 VSQQVPRERVAGQCSAQVNMLSDPVTGIRRRPGSLFVSVHDFGPIGEGDALYTQYLERGA 75 Query: 77 GGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKD 136 G L++ + ++ R + D +S++ G ++ + Sbjct: 76 DGRHLVINTNTGGWWLLDREAKNIVSEGNLSYL----LAADRRSIQTTSMGGVTYILNTE 131 Query: 137 HPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDM 196 P D T F+ V + T D Sbjct: 132 KRPSATTDNSDKKDPKTT--GFYFVKSGAFSKEYDISVVWSEGSQTVTYTTPDGTTAGDA 189 Query: 197 KIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATY 256 P R + Y +T+ GYS + Sbjct: 190 DQSVPEAIARKLVEALIAVGVDFAVRVGPYIYFELITGTDLKITSTSGSPYIGYSNQSQV 249 Query: 257 VKDNNITWITVLNLSSKTSRESAS-------GAVAPYYVWGDIKDVSKDGRSISVAPQSQ 309 + ++ + S + VW + D + P Sbjct: 250 NLETDLPARLHPSADGALCAVGQSERALVWYRYSSEKGVWLESGDYNSVTAISVDVPYKI 309 Query: 310 TLFQAGVSVVSWFMSAWGEQEGYPSHVTF--------HNNRLLFSGSKGDELSVYLSSFG 361 ++ ++ P+ + RL+ V +S+ G Sbjct: 310 VDDNVEQHIMEGRLAGDDLTNPAPTFLEERRITGIGTFQGRLVLLSGA----YVCMSATG 365 Query: 362 AFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSI--SLS 419 F DPT + A S F + +++ D++ ++ L Sbjct: 366 EPDRFFRSTV-SSLDPTDRIDIASGSAQNSVFRQALQFNKDLILLGDSTQAVVPSLQQLL 424 Query: 420 KGLSIDFRRVSG-SGVYACPPVSVGDCLVFVCGVG---RRIKYISGS--TEQGFRFNEIT 473 + S + PV+ L++ + + S T + ++T Sbjct: 425 APDNASVVLTSDLACNAFVAPVTTSQTLMYPAPRSEAFSAVLELVPSQYTSSQYVSQDVT 484 Query: 474 QLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISD 533 + + ++ DN F+++G+ AWH + Sbjct: 485 THIPRYIEGEARFMQSASAANIVLMA--TTGDNRQVIAHEYHFTSQGKVHQAWHKWVFPY 542 Query: 534 KHYVLSAASFPNDNRGGTS 552 + L A Sbjct: 543 RVASLHFARDRVVLFAADD 561 >gi|48696643|ref|YP_024422.1| hypothetical protein VP2p15 [Vibrio phage VP2] gi|40950041|gb|AAR97632.1| hypothetical protein [Vibrio phage VP2] Length = 594 Score = 107 bits (267), Expect = 4e-21, Method: Composition-based stats. Identities = 57/436 (13%), Positives = 120/436 (27%), Gaps = 46/436 (10%) Query: 155 FDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSD-MKIFKPLDKGRSIRLGCH 213 F + F + +S SI A + + G Sbjct: 4 FSQTSFKGGVIAPRLQFNEYESAYHHSIEDAVNFVVTEQGSLITRCGSEEVGLCQDGEVR 63 Query: 214 PPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSK 273 S + + + + + + + Sbjct: 64 LFRLPAVDAPSNDVIVEVGNTNIAVWVND--VRQVVANTPSEWRNTIDRIQTA------- 114 Query: 274 TSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYP 333 G A G + V + + + +Q + W YP Sbjct: 115 ---YDTIGDDAGAANTGRLIMVHPALQPKRLYRDNNNAWQFVNMHTGAVPAEWSPSN-YP 170 Query: 334 SHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTI 393 V NR+ + GS + + G D + DP + Sbjct: 171 QTVGIFQNRVWYVGSPVHRTYFWATRAGKLEDIAPSTANNPNDPISFVGIMEGT-----P 225 Query: 394 HWMHPFGEGVLVGCDTSLWLLSISLS---KGLSIDFRRVSGSGVYACPPVSVGDCLVFVC 450 W+ + + +G + + L+ S + RR S G A + + ++F Sbjct: 226 CWIIASSDVLTIGTTINDYQLAASTGVSVTAATAILRRSSVQGTAAVQGIPAEEQVIFCS 285 Query: 451 GVGRRIKYISGSTE-QGFRFNEITQLADHLFN-------QRILQLVYQEEPHSIVWVVLE 502 ++ ++ E + +E++ A HLF + ++ Y + +WVVLE Sbjct: 286 RNKSKVYAMNYVREQDNWIPDEMSSQAQHLFTPISSAKGASVRRVAYISDAAKSLWVVLE 345 Query: 503 PKDNSFPRLLGCRFSAEGEGDFAWHTHMI-SDKHYVLSAASFPNDNRGGTSLWMLVALS- 560 + C AW + K ++AA P+ ++ V S Sbjct: 346 NGQIN----YCCF--DRTTDTKAWTQLELSGGKVIDIAAAFNPDS----DYAYVAVVRSK 395 Query: 561 --AGEERSF--TVRLN 572 G ++++ +++ Sbjct: 396 AINGVQKNYTVLEKIS 411 Score = 65.3 bits (157), Expect = 3e-08, Method: Composition-based stats. Identities = 35/304 (11%), Positives = 75/304 (24%), Gaps = 26/304 (8%) Query: 7 TKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRS 66 ++ SF G ++PRL + + + + + + N + G L++ +E C+ Sbjct: 5 SQTSFKGGVIAPRLQFNEYESA-YHHSIEDAVNFVVTEQGSLITRCGSEEVGLCQD--GE 61 Query: 67 NRVF--SFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPAL-----FGKTYKTPYTFKDNK 119 R+F ++ G+ + + V + +T Y Sbjct: 62 VRLFRLPAVDAPSNDVIVEVGNTNIAVWVNDVRQVVANTPSEWRNTIDRIQTAY--DTIG 119 Query: 120 SLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKIS-----------FTFDEIKFLPPPWLGD 168 A + VH P L + ++ + Sbjct: 120 DDAGAANTGRLIMVHPALQPKRLYRDNNNAWQFVNMHTGAVPAEWSPSNYPQTVGIFQNR 179 Query: 169 GMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAY 228 G + + I + P +++ Sbjct: 180 VWYVGSPVHRTYFWATRAGKLEDIAPSTANNPNDPISFVGIMEGTPCWIIASSDVLTIGT 239 Query: 229 IVADDKVYRSLTTGRSGDR---FGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAP 285 + D ++ S + S T V+ S S+ A V Sbjct: 240 TINDYQLAASTGVSVTAATAILRRSSVQGTAAVQGIPAEEQVIFCSRNKSKVYAMNYVRE 299 Query: 286 YYVW 289 W Sbjct: 300 QDNW 303 >gi|50282960|ref|YP_053016.1| hypothetical protein VP5_gp14 [Vibrio phage VP5] Length = 594 Score = 106 bits (263), Expect = 2e-20, Method: Composition-based stats. Identities = 47/312 (15%), Positives = 98/312 (31%), Gaps = 33/312 (10%) Query: 277 ESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHV 336 G G + V + + + ++ + W YP V Sbjct: 115 YDTIGDDLGAANTGRLIMVHPALQPKRLYRDNNNAWKFVNMHTGAVPAEWSSSN-YPQTV 173 Query: 337 TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM 396 NR+ + GS + + G D + DP + W+ Sbjct: 174 GIFQNRVWYVGSPVHRTYFWATRAGKLEDIAPSTANNPNDPISFVGIMEGT-----PCWI 228 Query: 397 HPFGEGVLVGCDTSLWLLSISLS---KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVG 453 + + +G + + L+ S + RR S G A + + ++F Sbjct: 229 IASSDVLTIGTTINDYQLAASTGVSVTAATAILRRSSVQGTAAVQGIPAEEQVIFCSRNK 288 Query: 454 RRIKYISGSTE-QGFRFNEITQLADHLFN-------QRILQLVYQEEPHSIVWVVLEPKD 505 ++ ++ E + +E++ A HLF + ++ Y + +WVVLE Sbjct: 289 SKVYAMNYVREQDNWIPDEMSSQAQHLFTPISSARGASVRRVAYISDAAKSLWVVLENGK 348 Query: 506 NSFPRLLGCRFSAEGEGDFAWHTHMI-SDKHYVLSAASFPNDNRGGTSLWMLVALS---A 561 + C AW + K ++AA P+ ++ V S Sbjct: 349 IN----YCCF--DRTTDTKAWTQLELSGGKVIDIAAAFNPDS----DYAYVAVVRSKVVN 398 Query: 562 GEERSF--TVRL 571 G ++++ ++ Sbjct: 399 GAQKNYTVLEKI 410 Score = 62.6 bits (150), Expect = 2e-07, Method: Composition-based stats. Identities = 35/304 (11%), Positives = 75/304 (24%), Gaps = 26/304 (8%) Query: 7 TKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRS 66 ++ SF G ++PRL + + + + + + N + G L++ +E C+ Sbjct: 5 SQTSFKGGVIAPRLQFNEYESA-YHHSIEDAVNFVVTEQGSLITRCGSEEVGLCQD--GE 61 Query: 67 NRVF--SFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPAL-----FGKTYKTPYTFKDNK 119 R+F ++ G+ + + V + +T Y Sbjct: 62 VRLFRLPAIDAPSNDIIVEVGNANIAVWVNDVRQVVAATPSEWRNTLDRIQTAY--DTIG 119 Query: 120 SLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKIS-----------FTFDEIKFLPPPWLGD 168 A + VH P L + ++ + Sbjct: 120 DDLGAANTGRLIMVHPALQPKRLYRDNNNAWKFVNMHTGAVPAEWSSSNYPQTVGIFQNR 179 Query: 169 GMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAY 228 G + + I + P +++ Sbjct: 180 VWYVGSPVHRTYFWATRAGKLEDIAPSTANNPNDPISFVGIMEGTPCWIIASSDVLTIGT 239 Query: 229 IVADDKVYRSLTTGRSGDR---FGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAP 285 + D ++ S + S T V+ S S+ A V Sbjct: 240 TINDYQLAASTGVSVTAATAILRRSSVQGTAAVQGIPAEEQVIFCSRNKSKVYAMNYVRE 299 Query: 286 YYVW 289 W Sbjct: 300 QDNW 303 >gi|158345179|ref|YP_001522886.1| putative tail tubular protein B [Enterobacteria phage LKA1] gi|114796475|emb|CAK25013.1| putative tail tubular protein B [Pseudomonas phage LKA1] Length = 777 Score = 104 bits (260), Expect = 3e-20, Method: Composition-based stats. Identities = 68/540 (12%), Positives = 140/540 (25%), Gaps = 41/540 (7%) Query: 21 LQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRVFS--FSIPDGG 78 + + V N+ + D +NR+ + Sbjct: 15 VSQQTAKDRLEGQVESQLNMQSDLVTGPRRRSPVHLIADAMAATDANRLAYSLATFSGRE 74 Query: 79 YALLVFG-DKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDH 137 LLV D L I+ + T +S+ +A + + + Sbjct: 75 VLLLVDTLDGTLTILDDATGEVLFTGTNSY-----LTAGTGRSIRFAALDDSVFVANTEV 129 Query: 138 PPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGM-ISGVKSNAKLSISQADTSTARITSDM 196 P L+ T ++ +S ++ S T++A S Sbjct: 130 IPQTQLWSGASAYPDPTRAGYLYVVAGAFSKQYRLSITNQVTGVTTSVDVTTSATEASQA 189 Query: 197 KIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATY 256 + + R+ A Y + + SG F + A Sbjct: 190 TGEYVITQLRTAAEADATIGTAAGFAYYQDGAYLYVTAPEAIAVSTDSGSNFLRASNAAS 249 Query: 257 VKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDV-SKDGRSISVAPQSQTLFQAG 315 ++D + + + + Y+ W D++ +D + A + Sbjct: 250 IRDAAELPAKLPADADGFIIATGAAKNKTYFRWVDLERKWDEDASRGAQAELIDMPLRIT 309 Query: 316 VSVVSW-------FMSAWGEQEGYP---------SHVTFHNNRLLFSGSKGDELSVYLSS 359 S ++ A G+ P S +T RL+ + V +S+ Sbjct: 310 YSAPNFSLTALNYERRASGDATSNPALKFTEQGISGMTTMQGRLVLLAGE----YVCMSA 365 Query: 360 FGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL- 418 G + D + A T AS + F + +++ T L+ + Sbjct: 366 SGNPLRWFRASVSTQSDD-DPIEVAATAPVASPYEYAVAFNKDLVLFAKTHQGLVPGANL 424 Query: 419 -SKGLSIDFRRVSGSGVYACPPVSVGDCLVFVC-GVGRRIKYISG----STEQGFRFNEI 472 + + S +C PV G + F G T+ ++ Sbjct: 425 LTSRNATAAVVTEYSFQNSCSPVVAGRTVFFASPRSGPWSAVWEMLPSQYTDAQVEASDS 484 Query: 473 TQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMIS 532 T + L + V+ + + + + AWH Sbjct: 485 TSHLPKYIAGPVRFLATSSTTSIV---VVGTSNLRELVVHEYLWQGGEKVHAAWHKWSFP 541 >gi|312062879|gb|ADQ12741.1| putative tail tubular protein B [Acinetobacter phage phiAB1] Length = 763 Score = 101 bits (252), Expect = 3e-19, Method: Composition-based stats. Identities = 49/553 (8%), Positives = 131/553 (23%), Gaps = 49/553 (8%) Query: 9 HSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNR 68 SF G + + + NL+ L ++ P S+ Sbjct: 8 PSFLKG------VSQQTPQERSDGQLGAQLNLLSDAVTGLRRRGGVKFQAKLTGIPNSSY 61 Query: 69 VFSFSIPDGGYALLVFG-DKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFG 127 + I Y ++V L+I S S+ V Sbjct: 62 IRLIDINGVNYIMIVDTVTGTLKIYNFDGSLL-----KAHQTDYLKASNGKASIRSTVSR 116 Query: 128 STAVFVHKDHPPHHLLYIQDGD-KISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQAD 186 + ++ + T I + + + LS Sbjct: 117 NNCFVLNTEQVITKTPTGGTNPIPNPSTMGYISIRSGQFSKMYSVDIKSGSYTLSFGVGT 176 Query: 187 TSTARITSDMKIFKPLDKGR-----------SIRLGCHPPEWAKNTNYSIGAYIVADDKV 235 + + + + + R ++ + ++ Sbjct: 177 SGSEAWQATPEWVATEMENRIKEDTTLNARYTVVREGSTVALKAKSAIDTNLLVIESGTG 236 Query: 236 YRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDV 295 + T S G + + +I + ++ + + + G + Sbjct: 237 STYIQTSNSSRVQGKQDIIANLPNILDKYIIAVGTVGNSAYYQYNATTSTWKECGVYEAP 296 Query: 296 S-KDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTF-------HNNRLLFSG 347 I Q + + + P V F + +RL+ Sbjct: 297 YKFTNEPIYWYFDDTDTIQVKSLDIQPRTAGDDDNNPLPKFVDFGITGISAYQSRLVLLS 356 Query: 348 SKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGC 407 V +S+ F + + + + T SA+ + P+ + +++ Sbjct: 357 G----SYVNMSATADF-NVYMRTTVEELQDDDPIEVSSTALSAAQFEYAVPYNKDLVLLA 411 Query: 408 DTSLWLLSISLSKGLSIDFR---RVSGSGVYACPPVSVGDCLVFVCGVGR---RIKYISG 461 ++ + + + A P V L + G ++ + Sbjct: 412 QNQQAVIPANSTVLTPKTAVIYPSSKANISMASEPQVVSRSLYYTYQRGTDYYQVGEMIP 471 Query: 462 S--TEQGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAE 519 + ++ + + + + V+ D + ++ E Sbjct: 472 NAYSDAQYYAQNLADHIPLYATGVCTSITGSTTDNMAVF----SSDQKELLVHQYLWAGE 527 Query: 520 GEGDFAWHTHMIS 532 ++H + Sbjct: 528 DRPLMSFHKWELP 540 >gi|291334458|gb|ADD94112.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C1161] gi|291334665|gb|ADD94312.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C695] gi|291336445|gb|ADD96000.1| hypothetical protein [uncultured organism MedDCM-OCT-S04-C1073] Length = 121 Score = 99.2 bits (245), Expect = 2e-18, Method: Composition-based stats. Identities = 17/107 (15%), Positives = 37/107 (34%), Gaps = 5/107 (4%) Query: 81 LLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPH 140 +L FG++ ++ + +PY + ++YA H +HP Sbjct: 1 MLEFGNQYIRFYKDNGQIL--SSGSAYEISSPYLEAELFDIKYAQSADVMYLCHPNHPVK 58 Query: 141 HLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADT 187 L S+T + F P++ + + + + + Q T Sbjct: 59 KLARTGH---TSWTLTSVDFQNGPFMDHNIETTTITASHTNAGQTGT 102 >gi|291334515|gb|ADD94168.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C1201] Length = 99 Score = 95.4 bits (235), Expect = 3e-17, Method: Composition-based stats. Identities = 15/100 (15%), Positives = 34/100 (34%), Gaps = 5/100 (5%) Query: 81 LLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPH 140 +L FG++ ++ + +PY + ++YA H +HP Sbjct: 1 MLEFGNQYIRFYKDNGQIL--SSGSAYEISSPYLEAELFDIKYAQSADVMYLCHPNHPVK 58 Query: 141 HLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 L S+T + F P++ + + + + Sbjct: 59 KLARTGH---TSWTLTSVDFQNGPFMDHNIETTTITASHT 95 >gi|148747829|ref|YP_001285795.1| tail tubular protein B [Phormidium phage Pf-WMP3] gi|146230062|gb|ABQ12470.1| tail tubular protein B [Phormidium phage Pf-WMP3] Length = 1027 Score = 88.8 bits (218), Expect = 2e-15, Method: Composition-based stats. Identities = 55/450 (12%), Positives = 118/450 (26%), Gaps = 34/450 (7%) Query: 128 STAVFVHKDHPP--HHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQA 185 T + +P + D S T W + + S + Sbjct: 262 DTIQGTYGRYPMLLYKTATFNDTYTFSNTGQPANADSYGWGDGSVYNVGASAYLNTSPFF 321 Query: 186 DTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSG 245 T T + + + R L + A N + A Y S G + Sbjct: 322 ATFGDTRTPTPQPPETVHLLRQRELRFNYGNGATGANLRVTVDGTALSANYSSTVAGTNR 381 Query: 246 DRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVA 305 Y T + + + S + AV V + Sbjct: 382 AYALYKADGTLCTSASDLAYYIAFTGATPLGISPTAAVTITNVDRTYIGSAAT------- 434 Query: 306 PQSQTLFQAGVSVVSWFMSAWGE--QEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAF 363 Q+ + G + + W +P T + +RL+ G D V S+ G Sbjct: 435 -QTDNAYVQGGYFKVYGLGLWANYGTGQFPRIATVYQSRLVLGGFTNDPTRVVFSATGDT 493 Query: 364 ------YDFSLDGEYGCYDPTKALTTAVTDFSAST-IHWMHPFGEGVLVGCDTSLWLLSI 416 Y+F + + V+ A + + + + V + + + Sbjct: 494 VEGGVKYNFFQVTDDLDGLDSDPFDLVVSSSQADDYVTGLVEWQSSLFVLTRRATFRANG 553 Query: 417 SLSKGLSI---DFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQG-FRFNEI 472 + S V V + ++ G + ++ E G ++ E Sbjct: 554 GDATISPARRFVNYISSLGLVNPFSVVRTDTAVFYLSDSG--VFNLTPRVEDGEYQAIEK 611 Query: 473 TQLADHLFNQRILQLVYQEE------PHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAW 526 + +F + V +++V L + ++ + +W Sbjct: 612 SIKIRKVFGKTTSTAVSSAAWMSFDQNRKVLYVALPRGSETTVASALYVYNTFRD---SW 668 Query: 527 HTHMISDKHYVLSAASFPNDNRGGTSLWML 556 + + + + G + L M+ Sbjct: 669 TQYDTLGGFKTYTGHPYVDTVLGDSFLLMV 698 >gi|289976625|gb|ADD21670.1| tail tubular protein B [Caulobacter phage Cd1] Length = 857 Score = 86.5 bits (212), Expect = 1e-14, Method: Composition-based stats. Identities = 45/438 (10%), Positives = 118/438 (26%), Gaps = 24/438 (5%) Query: 135 KDHPPHHLLYIQ--DGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARI 192 D+ L + ++ + + P + + + + S +S +++ Sbjct: 222 PDYQKKVLDRTNAYNSAVTAWIGEAAEDSTPENIANKLAAQFTSQGVTGVSVINSTVIVD 281 Query: 193 TSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSK 252 + D G + E S + + ++ + Sbjct: 282 NAQFVEASGDDGGDGTLMRAVGNEVTALDLVSTVHW----GGKVVKVRPKKNNGEDAFYL 337 Query: 253 GATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLF 312 A + N ++ + V V G + S + +A + Sbjct: 338 QAELKEGNGPWGEVSWKETAGYEMKPVEVFVQGTVVGGTLYLASTAAKLTEIAGGVHPDY 397 Query: 313 QAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEY 372 +A V ++ +RL+ +++ S G ++++ Sbjct: 398 KANVVGDDISCPLPYFFGKSIDYLGMFQDRLVIGSGA----TLFFSRPGDYFNWFRTSVL 453 Query: 373 GCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL--SKGLSIDFRRVS 430 D + TI + +L+ + ++ + + + Sbjct: 454 -TVDDRDPIEMYALGSEDDTIKTSTTYDRNILLFGKRMQYTVNGRQPLTPKSASIVILSA 512 Query: 431 GSGVYACPPVSVGDCLVFVC-GVG-RRIKYISG-STEQGFRFNEITQLADHLFNQRILQL 487 P + G+ + + G + I ++Q D + +Q+ Sbjct: 513 HEDAIDADPQNSGNFVFYGKWRNGVSSLHQIQMGMLADSPESFNVSQQLDQYLQGKPVQI 572 Query: 488 VYQEEPHSIVWVVLEPKDNSFPRLLGCRFSA----EGEGDFAWHTHMISDKHYVLSAASF 543 V P+++V D S L + AW + V++A + Sbjct: 573 VALTSPNTVVL----RTDASRNTLYTYTYLDTPAGSERLFDAWSKWTWDETLGVVTALAR 628 Query: 544 PNDNRGGTSLWMLVALSA 561 + + +L V + Sbjct: 629 HDGDILSFTLRKGVDRTG 646 >gi|291335873|gb|ADD95469.1| hypothetical protein [uncultured phage MedDCM-OCT-S08-C304] Length = 147 Score = 83.0 bits (203), Expect = 1e-13, Method: Composition-based stats. Identities = 20/136 (14%), Positives = 41/136 (30%), Gaps = 12/136 (8%) Query: 300 RSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHV-------TFHNNRLLFSGSKGDE 352 I + Q+ F + PS V F NRL+F + Sbjct: 1 MPIQLVRQANGTFTVSQATWENADVGDTLTNPNPSFVGKTVNQLVFFRNRLVFLSDEN-- 58 Query: 353 LSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLW 412 V +S G F++F + P + + + + ++ G+L+ + Sbjct: 59 --VIMSRPGEFFNFWSKTA-TTFTPQDVIDLSCSSEYPAIVYDGIQVNAGLLLFTKNQQF 115 Query: 413 LLSISLSKGLSIDFRR 428 +L+ + Sbjct: 116 MLTTDSDILSPETAKL 131 >gi|291334275|gb|ADD93938.1| hypothetical protein BTH_I0919 [uncultured marine bacterium MedDCM-OCT-S08-C235] Length = 323 Score = 82.3 bits (201), Expect = 2e-13, Method: Composition-based stats. Identities = 24/161 (14%), Positives = 46/161 (28%), Gaps = 19/161 (11%) Query: 415 SISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISG-STEQGFRFNEIT 473 + + RR + G ++ +FV GR ++ + E + I+ Sbjct: 9 RTNSLTPSNFTARRQTTHGCSHVNVKTLEGGALFVQKHGRAVRELLFTDLELSYSATNIS 68 Query: 474 QLADHLFNQRILQLVYQEE---PHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHM 530 LA HL + + Q P S + G + E W Sbjct: 69 LLASHLVQTPVDMTILQGTAERPESYAIFINSDGT------AGVFHAVRAEKLAGWTEWK 122 Query: 531 ISDKHYVLSAASFPNDNRGGTSLWMLVALSAGEERSFTVRL 571 + A+F + G+ L+ V + + Sbjct: 123 TTTG------ATFKSIEAVGSRLFFTVYRD---STYYIEEM 154 >gi|197935887|ref|YP_002213723.1| tail tuber protein B [Ralstonia phage RSB1] gi|197927050|dbj|BAG70392.1| tail tuber protein B [Ralstonia phage RSB1] Length = 861 Score = 79.6 bits (194), Expect = 1e-12, Method: Composition-based stats. Identities = 42/503 (8%), Positives = 117/503 (23%), Gaps = 45/503 (8%) Query: 85 GDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLY 144 G + + + + YT + Y T+ D + Sbjct: 173 GGAYSRTYKLVIRGEPDNYPGTPVFTATYTTMASS---YPNLLDTSDIAQSDPEYQKKVN 229 Query: 145 IQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDK 204 + S + + A+LS + Sbjct: 230 DRVNAYNSAVNKWVGDALASTQPQNI------AAQLSGQLVAGGYNNLAVVGGSIFMDHI 283 Query: 205 GRSIRLGCHPPEWAKNTNYSIGA----YIVADDKVYRSLTTGRSGDRFGYSKGATYVKDN 260 + + + D+ + + + + T Sbjct: 284 LDMTCDDSGDGTLFRAVFNEVDDPAKLSTIHGDQKIVRVKPKGTDETYYMRAVKTDTAAA 343 Query: 261 NITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVS 320 + + + +++ A+A S + ++ + ++ Sbjct: 344 HFGPVQWVEGAAQVVTPGQVFAIASITSTTLTLANSPAQLATAIGSPVPGYAASVCGDMT 403 Query: 321 WFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKA 380 + SH+ +R++ + + +S G ++++ + D Sbjct: 404 DKGAVPYFFGRKVSHMAMFQDRMVIVSN----GVILMSRTGDYFNWFRKSKLR-VDDDDP 458 Query: 381 LTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL--SKGLSIDFRRVSGSGVYACP 438 + I + + + + + + L + + Sbjct: 459 VEAFALGSEDDIISQSSSYNKDLFLFGERGQYALPGRSAITPKTISITQVAGERDAMLAR 518 Query: 439 PVSVGDCLVFV-------CGVGRRIKYISGSTEQG-FRFNEITQLADH----LFNQRILQ 486 P+ VG+ L + + + G F+ T A R+++ Sbjct: 519 PIPVGNLLFYGKYEAKPDQSGPSKYAASLNQFQLGLFQDTPETYNASQQLDGYLQGRVIE 578 Query: 487 LVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDF----AWHTHMISDKHYVLSAAS 542 L +P++ V D L RF + +W + + Sbjct: 579 LASLPKPYT----VFCRTDGLDTGLYTYRFIDQQGTQARQFDSWSRWEWDAR-----VGT 629 Query: 543 FPNDNRGGTSLWMLVALSAGEER 565 +L+ V + + Sbjct: 630 LIGLTTYKATLYAYVMRTNAQGV 652 >gi|297171931|gb|ADI22918.1| hypothetical protein [uncultured Rhizobium sp. HF0500_35F13] Length = 336 Score = 75.3 bits (183), Expect = 3e-11, Method: Composition-based stats. Identities = 21/95 (22%), Positives = 38/95 (40%), Gaps = 13/95 (13%) Query: 487 LVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKH-----YVLSAA 541 + YQEEP SI++ V E L+ + + + AWH H+ S A Sbjct: 1 MAYQEEPLSIIYAVRE-----DGELVALTYQRDQQ-VVAWHRHIFGGAFGTGNAVCESIA 54 Query: 542 SFPNDNRGGTSLWMLVALS-AGEERSFTVRLNLLD 575 P + +++++ + G + + LN D Sbjct: 55 VIP-TDLDEYEVYVIIKRTINGATKRYVEVLNTFD 88 >gi|291335793|gb|ADD95394.1| hypothetical protein [uncultured phage MedDCM-OCT-S05-C532] Length = 295 Score = 75.0 bits (182), Expect = 3e-11, Method: Composition-based stats. Identities = 29/296 (9%), Positives = 74/296 (25%), Gaps = 26/296 (8%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + T T + + G L + D V+ + N+IP L+ P Q Sbjct: 1 MASVTQTIPTLTGG------LSQQPDELKIPGQVSVANNVIPDVTHGLLKRPGGQLVASI 54 Query: 61 RL-------DPRSNRVFSFSIPDGGYALLVF---GDKKL-QIVVVRSSTKWSPALFGKTY 109 + + FS+ + + GD + + + T Sbjct: 55 SDNGTAALNSQTNGKWFSYYRDETESYIGQISRTGDVNMWRCSDGAAMTVNYDGATSSAL 114 Query: 110 KTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDG 169 T + D++ ++ ++ ++ D Sbjct: 115 ATYLSHSDDQDIQTLTLNDYTFITNRTKTVAMSSTVETVRPPEVFIDLRATAYARQYALN 174 Query: 170 MISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYI 229 + + + + ++ + +++ G + R + Sbjct: 175 LYDNTNTTTETTATRISVDLVKSSNNYCDSNGGMVGHASRPSQ---------STRCDDTA 225 Query: 230 VADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAP 285 Y R D + N T+ + ++ +S + + Sbjct: 226 GDGRDAYAPNVGTRIFDIDDGASLTDEANSGNYTYTIDVKAANGSSVNRGTNLYSE 281 >gi|291335767|gb|ADD95369.1| hypothetical protein [uncultured phage MedDCM-OCT-S05-C429] Length = 364 Score = 73.0 bits (177), Expect = 1e-10, Method: Composition-based stats. Identities = 20/173 (11%), Positives = 58/173 (33%), Gaps = 21/173 (12%) Query: 404 LVGCDTSLWLLSISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGST 463 ++ D+ ++ S + + + A P+S+G + F+ G+ ++ + Sbjct: 1 MLTTDSDVF------SPTTAKINALSTYNFNSATNPISLGTTIGFLDNAGKFSRFFEMAQ 54 Query: 464 EQGFRFNEI---TQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEG 520 Q EI + + LF + + + E + I + + S + Sbjct: 55 LQREGEPEIIEQSAVVSDLFEKDLKIISNSRENNVIFF---SEEGTSTLYGYKYFDNIRE 111 Query: 521 EGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSA-GEERSFTVRLN 572 AW ++ +L+++V + + + ++++ Sbjct: 112 RKLAAWFKWTLTGTIQYHCV--------QDDNLFVVVRNNNKDQLLKYAIKMD 156 >gi|291335769|gb|ADD95371.1| hypothetical protein [uncultured phage MedDCM-OCT-S05-C429] Length = 100 Score = 70.3 bits (170), Expect = 9e-10, Method: Composition-based stats. Identities = 13/98 (13%), Positives = 28/98 (28%), Gaps = 11/98 (11%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M N T T + + G + + D V N +P L+ P + Sbjct: 1 MANVTQTIPNITQG------ISQQPDEYKVPGQVKDMVNALPDVTHGLLKRPAGKFVASL 54 Query: 61 RL----DPRSNRVFSFSIPDGGYALLVF-GDKKLQIVV 93 + R F + + + + +++ Sbjct: 55 SDGTNNSTTNGRWFHYYRDETEQYIGQIAQNGVIKMWD 92 >gi|46581000|ref|YP_011808.1| hypothetical protein DVU2596 [Desulfovibrio vulgaris str. Hildenborough] gi|46450421|gb|AAS97068.1| hypothetical protein DVU_2596 [Desulfovibrio vulgaris str. Hildenborough] Length = 259 Score = 66.5 bits (160), Expect = 1e-08, Method: Composition-based stats. Identities = 16/81 (19%), Positives = 22/81 (27%), Gaps = 11/81 (13%) Query: 497 VWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWML 556 +W V L+ E E WH H+ + G LW+ Sbjct: 1 MWCV-----TEDGGLIAMTRIPEHE-VAGWHRHVTDGAVLSVCTIPGT----AGDELWVA 50 Query: 557 VALSAGEERS-FTVRLNLLDD 576 V G RL+ D Sbjct: 51 VRREGGGMVRCCIERLDPPFD 71 >gi|224164141|ref|XP_002338648.1| predicted protein [Populus trichocarpa] gi|222873077|gb|EEF10208.1| predicted protein [Populus trichocarpa] Length = 350 Score = 66.1 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 8/67 (11%), Positives = 21/67 (31%), Gaps = 6/67 (8%) Query: 506 NSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSAGE-E 564 S LL + + +G W H ++ +++ +V + G Sbjct: 3 RSDGTLLSLTYVKD-QGVLGWARHTTDGTFESVAVI----PEGTEDAVYAVVKRTIGSRT 57 Query: 565 RSFTVRL 571 + ++ Sbjct: 58 VRYVEKI 64 >gi|291335768|gb|ADD95370.1| hypothetical protein [uncultured phage MedDCM-OCT-S05-C429] Length = 274 Score = 65.7 bits (158), Expect = 2e-08, Method: Composition-based stats. Identities = 17/243 (6%), Positives = 45/243 (18%), Gaps = 44/243 (18%) Query: 192 ITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYS 251 I G + + +G + + Sbjct: 10 IKKSTAFNASTSVGELLNV-------VSGKVMDVGDLPTQCKHGMVVKVVNSEAEEDDHY 62 Query: 252 KGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTL 311 + + GA G + + + + + Sbjct: 63 VKFFGSLKSGGNPDNDADYLDG------EGAWEECAEPGRKIRLKRSTMPVILIRTADGN 116 Query: 312 F-------------------QAGVSVVSWFMSAWGEQEGYPSHV-------TFHNNRLLF 345 F + + PS + F NR Sbjct: 117 FRLTELDGSSYTVTTASGNVTSSAPQWDDALVGDDVTNPEPSFIGKTISKLMFFRNRFSI 176 Query: 346 SGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLV 405 + + +S G F +F + + + + + + G+++ Sbjct: 177 LSDE----YIVMSRPGDFTNFFAKSAIQLI-ASDPIDISASSEYPAVLFDGIQTNTGLIL 231 Query: 406 GCD 408 Sbjct: 232 FTK 234 >gi|291335686|gb|ADD95291.1| hypothetical protein [uncultured phage MedDCM-OCT-S05-C139] Length = 190 Score = 60.3 bits (144), Expect = 9e-07, Method: Composition-based stats. Identities = 10/112 (8%), Positives = 27/112 (24%), Gaps = 12/112 (10%) Query: 304 VAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHV-------TFHNNRLLFSGSKGDELSVY 356 + + + PS + F NR + + Sbjct: 44 TVTTASGNVTSSAPQWDDALVGDDVTNPEPSFIGKTISKLMFFRNRFSILSDE----YIV 99 Query: 357 LSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCD 408 +S G F +F + + + + + + G+++ Sbjct: 100 MSRPGDFTNFFAKSAIQLI-ASDPIDISASSEYPAVLFDGIQTNTGLILFTK 150 >gi|285809804|gb|ADC36195.1| tail tubular protein B [Acinetobacter phage phiAB2] Length = 383 Score = 59.9 bits (143), Expect = 1e-06, Method: Composition-based stats. Identities = 38/391 (9%), Positives = 87/391 (22%), Gaps = 36/391 (9%) Query: 9 HSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNR 68 SF G + + + NL+ L ++ P S+ Sbjct: 8 PSFLKG------VSQQTPQERSDGQLGAQLNLLSDAVTGLRRRGGVKFQAKLTGIPNSSY 61 Query: 69 VFSFSIPDGGYALLVFG-DKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFG 127 + I Y ++V L+I S S+ V Sbjct: 62 IRLIDINGVNYIMIVDTVTGTLKIYNFDGSLL-----KAHQTDYLKASNGKASIRSTVSR 116 Query: 128 STAVFVHKDHPPHHLLYIQDGD-KISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQAD 186 + ++ + T I + + + LS Sbjct: 117 NNCFVLNTEQVITKTPTGGTNPIPNPSTMGYISIRSGQFSKMYSVDIKSGSYTLSFGVGT 176 Query: 187 TSTARITSDMKIFKPLDKGR-----------SIRLGCHPPEWAKNTNYSIGAYIVADDKV 235 + +A + + + R ++ + ++ Sbjct: 177 SGSAAWQATPEWVATEMENRIKEDTTLNARYTVVREGSTVALKAKSAIDTNLLVIESGTG 236 Query: 236 YRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDV 295 + T S G + + +I + ++ + + + G + Sbjct: 237 STYIQTSNSSRVQGKQDIIANLPNILDKYIIAVGTVGNSAYYQYNATTSTWKECGVYEAP 296 Query: 296 S-KDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTF-------HNNRLLFSG 347 I Q + + + P V F + +RL+ Sbjct: 297 YKFTNEPIYWYFDDTDTIQVKSLDIQPRTAGDDDNNPLPKFVDFGITGISAYQSRLVLLS 356 Query: 348 SKGDELSVYLSSFGAFYDFSLDGEYGCYDPT 378 V +S+ F + D Sbjct: 357 G----SYVNMSATADFNVYMRTTVEELQDDD 383 >gi|227485219|ref|ZP_03915535.1| conserved hypothetical protein [Anaerococcus lactolyticus ATCC 51172] gi|227236799|gb|EEI86814.1| conserved hypothetical protein [Anaerococcus lactolyticus ATCC 51172] Length = 75 Score = 56.5 bits (134), Expect = 1e-05, Method: Composition-based stats. Identities = 15/80 (18%), Positives = 23/80 (28%), Gaps = 11/80 (13%) Query: 494 HSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSL 553 S +W+V L E AW + S AS P+ L Sbjct: 1 DSFIWLVRN-----DGILATMAVDRAQE-VIAWSRQTTLGAY--ESVASIPSA--NNDVL 50 Query: 554 WMLVAL-SAGEERSFTVRLN 572 + LV G+ + + Sbjct: 51 YALVRRQVNGQTVRYVEVFD 70 >gi|291334274|gb|ADD93937.1| hypothetical protein [uncultured marine bacterium MedDCM-OCT-S08-C235] Length = 119 Score = 56.1 bits (133), Expect = 2e-05, Method: Composition-based stats. Identities = 22/113 (19%), Positives = 39/113 (34%), Gaps = 5/113 (4%) Query: 256 YVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAG 315 + D + I+ N + + +G+ + D + G + + + Sbjct: 3 GLADGSTIVISGANTVDTITASNINGSRTITVLNEDSYSFTAGGSANADNTDAGGGVSIF 62 Query: 316 VSVV-----SWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAF 363 V+ W + GYP+ TFH+ RL F GS V+ S F Sbjct: 63 VTSPNQPNSQWQEQTYSTIRGYPASATFHDGRLWFGGSSSLPDWVWASKVDEF 115 >gi|256845613|ref|ZP_05551071.1| predicted protein [Fusobacterium sp. 3_1_36A2] gi|256719172|gb|EEU32727.1| predicted protein [Fusobacterium sp. 3_1_36A2] Length = 637 Score = 54.5 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 55/423 (13%), Positives = 123/423 (29%), Gaps = 37/423 (8%) Query: 1 MVNTTWTKHS-FSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRD 59 M K + F GE+ RL R + ++ Q K NLI G ++ + Sbjct: 1 MERV--FKSNMFVYGEVGERLSGIR-ESEIYQQSAQKIENLIINEMG------NLKIAKK 51 Query: 60 CRLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNK 119 + + + + V D + + K + + Y P T K Sbjct: 52 LEGTNFQHNLIQLIDTKYNFYVGVTKDNNV-----ATYGKTNNDIGNLLYTHPIT---VK 103 Query: 120 SLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAK 179 ++ +FV D I++G+ + ++ LP + V K Sbjct: 104 NIRIIKMCDERLFVIGDITEVFEFNIENGEIGKSNYLDLIKLPI-KERKNVSFDVYRVYK 162 Query: 180 LSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSL 239 + T+ + D+ +I + K S+ + + + Sbjct: 163 VGSDYRVALIGTFTNPTLSYNENDRTVTIGNSVKVEVFYKIYKASVSKENIDPNLLRDGF 222 Query: 240 TTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDG 299 T + Y T + + I + + S + G+ APY + D + Sbjct: 223 TFAVFKNYLPYVGYKTSINEKKIGRVIEKSHIIGNSEVNF-GSNAPYSIANGKYDSTYGS 281 Query: 300 RSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSS 359 + + G + + + V +R++ +Y S Sbjct: 282 TYFIINRKVDGEISYGKLL---------NIKQNITTVGIFQDRMVILND----GYLYFSK 328 Query: 360 FGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLS 419 ++DF D + + + + G+ + + ++++S + Sbjct: 329 KSDYFDFRNDTKI----DSAFFFKPTPINNIYPEMYDIYVGDKIFIPTSHGVYVVSTNNI 384 Query: 420 KGL 422 Sbjct: 385 LTS 387 >gi|302339301|ref|YP_003804507.1| hypothetical protein Spirs_2810 [Spirochaeta smaragdinae DSM 11293] gi|301636486|gb|ADK81913.1| hypothetical protein Spirs_2810 [Spirochaeta smaragdinae DSM 11293] Length = 570 Score = 54.5 bits (129), Expect = 5e-05, Method: Composition-based stats. Identities = 11/56 (19%), Positives = 20/56 (35%), Gaps = 4/56 (7%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQE 56 M F+ G +SPR++ R D + V++ + L G + Sbjct: 1 MSRQRILVTDFTRGIVSPRMVP-RIDQTK---AVSELTGFVVLPDGGIRRREGTIY 52 >gi|290996598|ref|XP_002680869.1| predicted protein [Naegleria gruberi] gi|284094491|gb|EFC48125.1| predicted protein [Naegleria gruberi] Length = 1407 Score = 53.4 bits (126), Expect = 1e-04, Method: Composition-based stats. Identities = 49/443 (11%), Positives = 108/443 (24%), Gaps = 62/443 (13%) Query: 72 FSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKT----------------PYTF 115 F +G + + + +++V V + + A G T Y Sbjct: 173 FVSSNGNVYISEYQNHYIRMVNVSTGVITTVAGNGTQIGTSGTGLGFGYNGDGIPATYAR 232 Query: 116 KDNKSLEYAVFGSTAVFVH-KDHPPHHLLYIQDGDKISFTFDE-------IKFLPPPWLG 167 N + + + +L ++ T +E + Sbjct: 233 LTNPQGIFVTSNNEIYIADAGNFRIRKVLTNGTIITVAGTGEEGYNGDGMLATAAKLDYP 292 Query: 168 DGMISGVKSNA------------KLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPP 215 G+ L+ T TS +K + + + Sbjct: 293 YGVSVDSNGEIWIAELGSNRLRKVLTNGTIVTIAGTGTSSYTNYKDNVQANLVNVSPIRV 352 Query: 216 EWAKNTNYSIGAY----IVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLS 271 I ++ + G G F + N+ Sbjct: 353 FSTSPGEVFISDNMRLRRISTSTGIITTVAGIGGSTFSGDGSQATKATFKFMTNQLANVV 412 Query: 272 SKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEG 331 ++ + I+ V +G I++A + + M A Q Sbjct: 413 KTSNGQYLIAD----TGNHRIRKVFANGTIITIAGTGVAGYNSDY------MDASTAQLN 462 Query: 332 YPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSAS 391 YPS V N + S S + F + ++ G + + D + Sbjct: 463 YPSSVFEFKNEVYISDSVNRRIRKI------FTNGTIVTIAGTGSQPPSSGY-LGDDGVN 515 Query: 392 TIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSI-----DFRRVSGSGVYACPPVSVGDCL 446 + F G+ V ++++ L + ++ + S P S + + Sbjct: 516 ALSARLYFPTGIFVTSANEVFIVDNFLIRKINSNGIITNVAGTISSESTFIPGSSQANSV 575 Query: 447 VFVCGVGRRIKYISGSTEQGFRF 469 G + + Sbjct: 576 TISVDGGIYVSPTGFYFLAYYNS 598 >gi|291335874|gb|ADD95470.1| hypothetical protein [uncultured phage MedDCM-OCT-S08-C304] Length = 326 Score = 51.5 bits (121), Expect = 4e-04, Method: Composition-based stats. Identities = 14/106 (13%), Positives = 34/106 (32%), Gaps = 12/106 (11%) Query: 471 EITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHM 530 + +++ L ++ I LV + +S V+ + D S + AW T Sbjct: 14 DQSKVISRLLDKDI-SLVSESRENSAVFFSKKGTD--EIYCFRYFNSGDKRLLQAWCTWT 70 Query: 531 ISDKHYVLSAASFPNDNRGGTSLWMLVALSAGEERSFTVRLNLLDD 576 ++ ++ + +++ L L D+ Sbjct: 71 LAGNIQYHCML---------DDALFVITRNNNKDQMVKYSLKLDDN 107 >gi|290972086|ref|XP_002668792.1| predicted protein [Naegleria gruberi] gi|284082314|gb|EFC36048.1| predicted protein [Naegleria gruberi] Length = 679 Score = 49.1 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 29/301 (9%), Positives = 70/301 (23%), Gaps = 28/301 (9%) Query: 72 FSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLE-----YAVF 126 F + + F + +++ ++ + + N + Sbjct: 17 FVSSNNEVYIADFCNHRIRKILENGNIVTIAGNGNYGFSGDNGPATNAQFNYPCSVFVSS 76 Query: 127 GSTAVFVH-KDHPPHHLLYIQDG----DKISFTFDEIKFLPPPWLGDGMISGVKSNAKLS 181 + +H +L + + F S S+ Sbjct: 77 KNEVYITDYSNHSIRKILENGNIITIAGNGTVGFSGDSGPATNAQLYNPSSVFVSSKNEV 136 Query: 182 ISQAD-----------TSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIV 230 + I + D G + + P ++ + Sbjct: 137 YFTDQHNNRIRKILENGNIITIAGNGTYGFSGDNGPATNAQLYNPYSVFVSSNNEVYITD 196 Query: 231 ADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWG 290 + R + + + + DN LN + + ++ Sbjct: 197 YSNHRIRKILENGNIVTIAGNGNYGFSGDNGPATNAQLNRPNSVFVSNNEVYISD-QSNQ 255 Query: 291 DIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKG 350 I+ + ++G I++A F A Q P+ V NN + S Sbjct: 256 RIRKILENGNIITIAGNGNYGFSGDNG------PATNAQLNRPNSVFVSNNEVYISDQSN 309 Query: 351 D 351 Sbjct: 310 Q 310 >gi|290986743|ref|XP_002676083.1| predicted protein [Naegleria gruberi] gi|284089683|gb|EFC43339.1| predicted protein [Naegleria gruberi] Length = 733 Score = 47.2 bits (110), Expect = 0.007, Method: Composition-based stats. Identities = 28/296 (9%), Positives = 75/296 (25%), Gaps = 18/296 (6%) Query: 72 FSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEY------AV 125 F + + + + +++ ++ + + N + Y + Sbjct: 17 FVSSNNEVYIADYSNHRIRKILKNGNIATIAGKGTCGFSGDNGPATNAQIYYPSSVFVSS 76 Query: 126 FGSTAVFVHKDHPPHHLLYIQDG-DKISFTFDEIKFLPPPWLGDGMIS-----GVKSNAK 179 + +H +L + P + +N Sbjct: 77 NNEVYIADQSNHRIRKILENGNIVTIAGNGIGGFSGDNGPATNAQIYYPYSVFVSSNNVV 136 Query: 180 LSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSL 239 + + +I + I G S G + P N +G ++ ++++VY + Sbjct: 137 YIVDYGNNRVRKILGNGNIVTIAGNGTSGFSGDNGPATNAQLNNPVGVFVSSNNEVYIAD 196 Query: 240 TTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDG 299 + + + + N N + ++ + + ++ V Sbjct: 197 QSNHRIRKILENGNIVTIAGNGTGGFGGDNGPATNAQLYIP--YSVFVSNNEVYIVDYGN 254 Query: 300 RSISVAPQSQTLF----QAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGD 351 I + + A Q PS V NN + + Sbjct: 255 NRIRKILGNGNIVTIAGNGTSGFSGDNGPATNAQLNRPSSVFVSNNEVYIADLNNH 310 >gi|225559312|gb|EEH07595.1| endochitinase [Ajellomyces capsulatus G186AR] Length = 859 Score = 47.2 bits (110), Expect = 0.008, Method: Composition-based stats. Identities = 30/343 (8%), Positives = 64/343 (18%), Gaps = 5/343 (1%) Query: 101 SPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKF 160 + ++ T Y Y T D + Sbjct: 377 TDTVYPTGTDTAYPTGT--DTAYPTGTDTVYPTGTDTAYPTGTDTAYPTGTDTVY---PT 431 Query: 161 LPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKN 220 G + + + +D D P Sbjct: 432 GTDTAYPTGTDTVYPTGTDTVYPTGTDTVYPTGTDTVYPTGTDTVYPTGTDTAYPTGTDT 491 Query: 221 TNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESAS 280 + T + G D T + T + Sbjct: 492 VYPTGTDTAYPTGTDTVYPTGTDTVYPTGTDTVYPTGTDTVYPTGTDTAYPTGTEIVYPT 551 Query: 281 GAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHN 340 + Y D + + T++ + + G + YP+ + Sbjct: 552 DSETSYPTANPTDDYPTGYPTGTYPVSPGTVYPTAYPTDTETVYPTGTESSYPTETMYPT 611 Query: 341 NRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFG 400 + + + + G Y T + T + + +P G Sbjct: 612 GSETVHPTNSETNYPTANPTDDYPTGYPTGTYPVGSGTVNPISTETAYPTARPTDAYPTG 671 Query: 401 EGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVYACPPVSVG 443 L DT + + S + + Sbjct: 672 TETLCPTDTESSYPTETAYPTGSETASSTAYPSDHYPTAYPTD 714 >gi|156839191|ref|XP_001643289.1| hypothetical protein Kpol_1027p5 [Vanderwaltozyma polyspora DSM 70294] gi|156113893|gb|EDO15431.1| hypothetical protein Kpol_1027p5 [Vanderwaltozyma polyspora DSM 70294] Length = 884 Score = 46.8 bits (109), Expect = 0.009, Method: Composition-based stats. Identities = 42/450 (9%), Positives = 88/450 (19%), Gaps = 43/450 (9%) Query: 54 MQEYRDCRLDPRSNRVFSFSIPDGGYALLVFG-DKKLQIVVVRSSTKWSPALFGKTYKTP 112 + V + D L G D ++I V + T Sbjct: 67 TIRLGSLNDVNTRSSVLCMTRSDDEKYLFSAGADSLVRIWSV-GEMNGDSYIQINENATI 125 Query: 113 YTFKDNKS---LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDG 169 YT D + Y T ++ L +Q+ K + F P Sbjct: 126 YTITDIGDIFSIRYLDSLDTLFIGCQNASMLFLDNLQERIKSEDFNSDTDFERLPHRRYD 185 Query: 170 MISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYI 229 S+ + ++ + I + N I + Sbjct: 186 KFFDSNGPGGNLKSKEKIDSPLFSTSSPENLINN---CILEIPSENIISYAHNGFIYSIY 242 Query: 230 VADDKVYRSLTTG---------RSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESAS 280 + GD T I+ + Sbjct: 243 RLQHTFLENNDKSIVAKEFIITGGGDGLSKLWKVTKDSIGQISVDLDPEFFDNDDSVLSQ 302 Query: 281 GAVAPYYVWGDIKD--------------VSKDGRSISVAPQS-------QTLFQAGVSVV 319 P+ G + + S Sbjct: 303 TFEFPFLYCGLSDGVLKIWDLNTRQLVSTLRTPDPYDIISLSIYHNHVFAINESGITHFY 362 Query: 320 SWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTK 379 W +G + L+ ++ S G K Sbjct: 363 DNKFHNWDPNQGKILSSEVFERKCNVCNKPVSLLTGGNDGSLTLWNLSHLMNIGDSTENK 422 Query: 380 ALTTAVTDFSASTIHW---MHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVYA 436 +++I + + +L + ++S + + + Sbjct: 423 YTEHQCIRERSNSITYYKPAVLDNDSMLDTVRELIAFQTVSQNPDTTQQMDSRRCANHLQ 482 Query: 437 CPPVSVGDCL--VFVCGVGRRIKYISGSTE 464 V G +F G + + + + Sbjct: 483 QLFVEFGASKTQIFPASTGNPVVFAQFNGD 512 >gi|296283200|ref|ZP_06861198.1| VCBS [Citromicrobium bathyomarinum JL354] Length = 1533 Score = 46.4 bits (108), Expect = 0.013, Method: Composition-based stats. Identities = 49/373 (13%), Positives = 85/373 (22%), Gaps = 13/373 (3%) Query: 86 DKKLQIVVVRSSTKWSPALFGKTYK---TPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHL 142 D L+IV T + A Y + E V + P Sbjct: 793 DSALRIVDSAGQTISANATASVIDPGSDFTYDAYLTHTFEAGGTYYIEVTNERGEMPAGS 852 Query: 143 LYIQDGDKISFTFDEIKFLPPPWLGDGMI-SGVKSNAKLSISQADTSTARI------TSD 195 Y + T + + G G V L + A T + T Sbjct: 853 SYTMNVSLTGATIPSLSAAATVYGGTGDDVYEVAGAGDLLVEYAGGGTDTVLSRVSYTLG 912 Query: 196 MKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGAT 255 I S + + + ++ L D G Sbjct: 913 ANIENLTLVSGSGAVEAAGNDLDNLLRGNAADNVIRGGAGDDILVGSGGNDAIDGGAGTD 972 Query: 256 YVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAG 315 + + + + SG ++ + DG A + F+ Sbjct: 973 TAVFSGNRSDYTIFNIANGQVQQISGPDGVDTLFSVERLAFDDGIYALGAQAGELQFRYD 1032 Query: 316 VSVVSWFMSAWGEQEGYPSHVTFHN--NRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYG 373 W E YP V N R G V L + G G Sbjct: 1033 QFGAGDAAGGWSSNERYPRTVADVNGDGRADLIGFASSGTFVALGQANGTFAPLQLGIAG 1092 Query: 374 CYDPTKALTTAVTDFSASTIHWMH-PFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGS 432 A A D T+ ++ ++ ++ S + ++G Sbjct: 1093 FGSADAAGGWADGDRFPRTMGDVNGDGRADIIGFGSGGTYVSYGQASGTFAAPVLALAGF 1152 Query: 433 GVYACPPVSVGDC 445 G + + Sbjct: 1153 GSADAAGGWLDNT 1165 >gi|290991612|ref|XP_002678429.1| predicted protein [Naegleria gruberi] gi|284092041|gb|EFC45685.1| predicted protein [Naegleria gruberi] Length = 992 Score = 45.7 bits (106), Expect = 0.020, Method: Composition-based stats. Identities = 55/489 (11%), Positives = 132/489 (26%), Gaps = 58/489 (11%) Query: 55 QEYRDCRLDPRSNRVFSFSIPDGGYALL---------VFGDKKL--QIVVVRSSTKWSPA 103 + S+ V S G + VF + + + S A Sbjct: 190 EYISALSTQLPSSMVIGISQSTGELYIGMSGDILVRKVFTNGTIVSIVKKDNSLVDTITA 249 Query: 104 LFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPP 163 L YT + + L+Y++ T + + + + + T + + Sbjct: 250 LTVSNSSVYYTESNRRVLQYSIENGTTTVIGGSLDIFNSNFQDNILATTATLQNTRGIAV 309 Query: 164 PWLGDGMISGVKSNAK-------LSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPE 216 GD S T+ + + + P Sbjct: 310 SETGDVYFSESSEFYSNGRVRKIKPDGYIVTTAGNMLDLNSGYNGDNILAVNAKLKSPES 369 Query: 217 WAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSR 276 + + + + ++ + L+ G+ G N+ ++ L+ + Sbjct: 370 VVVSNSGEVYISDTGNSRIRKILSNGQIVTVVGRGNF-----RNSPSYNGDYILAINANI 424 Query: 277 ESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQE-----G 331 ++ SG + I D + S+ + + G Sbjct: 425 KNPSGILLSSTNELYIADTENYRIRKVLTN---GTIVTIAGTGSYTEDTFVDLATNIGIG 481 Query: 332 YPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSAS 391 P + N + F +K + LS+ YDP L F + Sbjct: 482 QPKALALFGNEIYF-STKSHRVKKILSNGTLIT--YAGTGIYGYDPGDVLAVNTKLFFPN 538 Query: 392 TIHWMHPFGEGVL----------VGCDTSLWLLSISLSKGLSIDFRRVSGSGVYACPPVS 441 + ++P G+ ++ V + ++ ++ + ++ + D + + + Sbjct: 539 GL-DVYPNGDLLIADSSNHVIRKVLTNGTVIRVAGTGTRAYNGDNILAVNAHLSEPSGIH 597 Query: 442 V--GDCLVFVCGVGRRIKYISGS---------TEQGFRFNEITQLADHLFNQRILQLVYQ 490 + ++F R++ I + G+ + L+ F + L Sbjct: 598 ILSNGEILFSDKYNYRVRKILTNGTIITIAGIGTYGYNGENLPALSTKFF--GVTGLALS 655 Query: 491 EEPHSIVWV 499 SI Sbjct: 656 PVDGSIYLA 664 >gi|327404334|ref|YP_004345172.1| hypothetical protein Fluta_2348 [Fluviicola taffensis DSM 16823] gi|327319842|gb|AEA44334.1| hypothetical protein Fluta_2348 [Fluviicola taffensis DSM 16823] Length = 818 Score = 45.3 bits (105), Expect = 0.029, Method: Composition-based stats. Identities = 28/260 (10%), Positives = 58/260 (22%), Gaps = 16/260 (6%) Query: 54 MQEYRDCRLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPY 113 F +LV D +L+++ +P PY Sbjct: 194 TTHSVPLIAFTGQTVYIGFRNNSNDKFILVIDDIELRVINNFDLEVTTPTQN------PY 247 Query: 114 TFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISG 173 T L +H + F S Sbjct: 248 TLAPANQLTTTQNLKLEAVIHNQGIQAM---------TNVALGCRVFKDGLLETTVTSSI 298 Query: 174 VKSNAKLSISQADTSTARITSDMK-IFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVAD 232 + S A + S T+ TS+ FK + ++ +A Sbjct: 299 LPSLASGAASAPMTANYTPTSNGVYTFKYFPIATEADMSTSNDTILSTIPITVTDAEMAR 358 Query: 233 DKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDI 292 D G G+ +++ ++ + + + + A+ G Sbjct: 359 DNGVIVGQLGIGSGTGGFMGQVFNIENTTSLKEVKVHFTRGYTGKKLATAIFNTNGSGVP 418 Query: 293 KDVSKDGRSISVAPQSQTLF 312 ++ S + Sbjct: 419 TTFLASTDTLLYIDDSARTY 438 >gi|118469414|ref|YP_886504.1| HNH endonuclease domain-containing protein [Mycobacterium smegmatis str. MC2 155] gi|118170701|gb|ABK71597.1| HNH endonuclease domain protein [Mycobacterium smegmatis str. MC2 155] Length = 544 Score = 44.5 bits (103), Expect = 0.048, Method: Composition-based stats. Identities = 11/157 (7%), Positives = 32/157 (20%), Gaps = 6/157 (3%) Query: 53 LMQEYRDCRLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTP 112 + V +F +L + L I + + + Sbjct: 130 GTRTVARIAGPGGPRMVTTFVRTPADTVMLAHTNGYLTINKATPTAETVGMFAPAETRDA 189 Query: 113 YTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMIS 172 S + + P D + + G++ Sbjct: 190 TGGPVPSSYLVKQLAPQLYVLAEVIDPR-----PDTAWPVYVDPPLHLTGAGGAPLGLLD 244 Query: 173 GVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIR 209 + + A ++ + + G ++ Sbjct: 245 SFADSVSSLANTATSAVKTA-ASATVSGAKAVGSFVK 280 >gi|290999745|ref|XP_002682440.1| predicted protein [Naegleria gruberi] gi|284096067|gb|EFC49696.1| predicted protein [Naegleria gruberi] Length = 731 Score = 44.1 bits (102), Expect = 0.063, Method: Composition-based stats. Identities = 38/403 (9%), Positives = 98/403 (24%), Gaps = 26/403 (6%) Query: 73 SIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEY--------- 123 G + + +++ +++ + + Y Y+ L Y Sbjct: 107 VNDLGEVYIADTYNHRIRKILLNGTIITVAGVGSAGYSGDYSTAMQAKLNYPHGIYVKKV 166 Query: 124 AVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSIS 183 G+ +GD + T + L + + I Sbjct: 167 FSNGTIITIAGNGEGDADGYGKYNGDNMLATLSSLNLPTTVALNSLNEVFIADSQNHRIR 226 Query: 184 QADTSTARIT---SDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLT 240 + S T + + + + P ++N +I + ++ Sbjct: 227 KVSNSGIISTVAGTGVSGYSGDGIPANTTKLNTPNGITIDSNDNIIIADRNNHRIRLISN 286 Query: 241 TGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGR 300 + + + + L+ + + + I+ V +G Sbjct: 287 SSGIISTLAGNGTTGSRDEEVLATSAKLSRPADVTIGYDGELIITDTDNFVIRIVKLNGM 346 Query: 301 SISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSF 360 ++A F + + +PS + F + L+F + Sbjct: 347 ISTIAGTGFERFNGDRAT------SLSTLINHPSSMAFKDGELIFCDRSNHRVRRISKDG 400 Query: 361 GAFYDFSLDGEYGCYDPTKALTTA------VTDFSASTIHWMHPFG-EGVLVGCDTSLWL 413 D A+ V S I+ + +V + ++ Sbjct: 401 SVKTIAGNGIGGYNGDGMLAIDAQLNYPHGVASDSIGNIYISDSYNHRVRIVFTNGTIST 460 Query: 414 LSI-SLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRR 455 ++ S + S Y G+ +F+ Sbjct: 461 IAGNGNSGFNKDGIQATSSQLNYPFGIALNGNDELFISDRSNH 503 >gi|300697024|ref|YP_003747685.1| hypothetical protein RCFBP_mp10482 [Ralstonia solanacearum CFBP2957] gi|299073748|emb|CBJ53269.1| conserved exported protein of unknown function, RHS repeat [Ralstonia solanacearum CFBP2957] Length = 795 Score = 43.8 bits (101), Expect = 0.074, Method: Composition-based stats. Identities = 37/366 (10%), Positives = 85/366 (23%), Gaps = 10/366 (2%) Query: 85 GDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLY 144 G + + + ++ T YT + T +L Sbjct: 75 GAAPMTVFNYDGQDRVRQVTDPRSLVTTYTVDGLGNTTRQQSPDTGTSNATYDVAGNLTR 134 Query: 145 IQDG--DKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPL 202 D + +D + + G + D SD Sbjct: 135 RTDARGKITRYRYDAVNRMTHAVFASGTPIAFTYDGGKHPEPNDIGHLTHISDESGQ--- 191 Query: 203 DKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNI 262 R K + + Y T+G S + Sbjct: 192 ---TRWRFNGFGNVVRKTQSTTANGETKKQVVAYAYGTSGSSTGHVTSMTYPSSSVIG-- 246 Query: 263 TWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWF 322 + + +A G+VA G + F + + Sbjct: 247 YSYDAGGRIAGLTLTTAHGSVALLSNIQYQPFGKPAGWTWGNGTAYTRSFDLSGRLTQFP 306 Query: 323 MSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALT 382 + A P+ ++ N S + S G+ + + +G D + ++ Sbjct: 307 LGATSGTGATPNGLSRTVNYDAASRITAYTHADTSGSTGSSTATAANQTFGYDDQDRLIS 366 Query: 383 TAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVYACPPVSV 442 + S S + + G +G + + + ++ + + + A Sbjct: 367 YLPANSSQSYSYDANGNRTGQTIGGSSYTQTVDPASNRQTASTGPTPTKNSYDAAGNQIG 426 Query: 443 GDCLVF 448 + Sbjct: 427 DGSTTY 432 >gi|300697031|ref|YP_003747692.1| hypothetical protein RCFBP_mp10489 [Ralstonia solanacearum CFBP2957] gi|299073755|emb|CBJ53276.1| conserved exported protein of unknown function [Ralstonia solanacearum CFBP2957] Length = 796 Score = 43.8 bits (101), Expect = 0.075, Method: Composition-based stats. Identities = 37/366 (10%), Positives = 85/366 (23%), Gaps = 10/366 (2%) Query: 85 GDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLY 144 G + + + ++ T YT + T +L Sbjct: 75 GAAPMTVFNYDGQDRVRQVTDPRSLVTTYTVDGLGNTTRQQSPDTGTSNATYDVAGNLTR 134 Query: 145 IQDG--DKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPL 202 D + +D + + G + D SD Sbjct: 135 RTDARGKITRYRYDAVNRMTHAVFASGTPIAFTYDGGKHPEPNDIGHLTHISDESGQ--- 191 Query: 203 DKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNI 262 R K + + Y T+G S + Sbjct: 192 ---TRWRFNGFGNVVRKTQSTTANGETKKQVVAYAYGTSGSSTGHVTSMTYPSSSVIG-- 246 Query: 263 TWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWF 322 + + +A+G+VA G + F + + Sbjct: 247 YSYDAGGRIAGLTLTTANGSVALLSNIQYQPFGKPAGWTWGNGTAYTRSFDLSGRLTQFP 306 Query: 323 MSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALT 382 + A P+ ++ N S + S G+ + + +G D + ++ Sbjct: 307 LGATSGTGATPNGLSRTVNYDAASRITAYTHADTSGSTGSSTAATANQTFGYDDQGRLIS 366 Query: 383 TAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVYACPPVSV 442 S S + + G +G + + + ++ + + + A Sbjct: 367 YLPGSSSQSYSYDANGNRTGQTIGGSSYTQTVDPASNRQTASTGPTPTKNSYDAAGNQIG 426 Query: 443 GDCLVF 448 + Sbjct: 427 DGSTTY 432 >gi|290995436|ref|XP_002680301.1| predicted protein [Naegleria gruberi] gi|284093921|gb|EFC47557.1| predicted protein [Naegleria gruberi] Length = 699 Score = 43.8 bits (101), Expect = 0.076, Method: Composition-based stats. Identities = 29/278 (10%), Positives = 68/278 (24%), Gaps = 22/278 (7%) Query: 72 FSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLE-----YAVF 126 F + + + +++ ++ + K N L + Sbjct: 17 FVSSNNEVYIADCFNNRIRKILENGTIVTIAGNGTKGSSGDNGLATNAQLNRPYSVFVSS 76 Query: 127 GSTAVFV-HKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQA 185 + ++ +L + G+G+ N + +Q Sbjct: 77 NNEVYIADQGNNRIRKILENGNI--------------ITIAGNGIHGFSGDNGLATNAQL 122 Query: 186 DTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSG 245 T + S D+G + D+ + + S Sbjct: 123 YTPCSVFVSSNNEVYIADQGNHRIRKILENGNIVTIAGNGIHGFSGDNGLATNAQLNSSY 182 Query: 246 DRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVA 305 F S Y+ D I + + + +G + ++ G I Sbjct: 183 SVFVSSNNEVYIADYFNNRIRKILENGNIITIAGNGTHGFNGDNENGNIITIAGNGIHGF 242 Query: 306 PQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRL 343 L + + E Y + ++NNR+ Sbjct: 243 NGDNGLATNARLNHPFSVFVSSNNEVYIAD--YYNNRI 278 Score = 41.8 bits (96), Expect = 0.29, Method: Composition-based stats. Identities = 26/271 (9%), Positives = 62/271 (22%), Gaps = 36/271 (13%) Query: 72 FSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLE-----YAVF 126 F + + G+ +++ ++ + + N L + Sbjct: 73 FVSSNNEVYIADQGNNRIRKILENGNIITIAGNGIHGFSGDNGLATNAQLYTPCSVFVSS 132 Query: 127 GSTAVFV-HKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQA 185 + +H +L + + S Sbjct: 133 NNEVYIADQGNHRIRKILENGNI---------------------VTIAGNGIHGFSGDNG 171 Query: 186 DTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSG 245 + A++ S +F + I + N +I + + Sbjct: 172 LATNAQLNSSYSVFVSSNNEVYIADYFNNRIRKILENGNIITIAGNGTHGFNGDNENGNI 231 Query: 246 DRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVA 305 + + DN + LN S + Y I+ + ++G I++A Sbjct: 232 ITIAGNGIHGFNGDNGLATNARLNHPFSVFVSSNNEVYIADYYNNRIRKILENGNIITIA 291 Query: 306 PQSQTLFQAGVSVVSWFMSAWGEQEGYPSHV 336 F + YP Sbjct: 292 GNGTAGFSGDSPF---------DIRTYPHIG 313 >gi|290971766|ref|XP_002668650.1| predicted protein [Naegleria gruberi] gi|284082136|gb|EFC35906.1| predicted protein [Naegleria gruberi] Length = 728 Score = 43.8 bits (101), Expect = 0.077, Method: Composition-based stats. Identities = 25/267 (9%), Positives = 69/267 (25%), Gaps = 21/267 (7%) Query: 72 FSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLE-----YAVF 126 F + + +G+ +++ ++ + + L + Sbjct: 17 FVSSNNEVYIADYGNHRIRKILENGNIVTIAGNGTAGFSGDNGIATKAQLNGPVGVFVSS 76 Query: 127 GSTAVFVH-KDHPPHHLLYIQDG-------------DKISFTFDEIKFLPPPWLGDGMIS 172 + +H +L + D T +++ F ++ Sbjct: 77 NNEVYIADYDNHRIRKILENGNIVIIAGKGTAGFSGDNGLATKEKLNFPRCVFVSSNNEV 136 Query: 173 GVKSNAKLSISQAD--TSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIV 230 + I + + I + D G + + P ++ + Sbjct: 137 YIADQINHRIRKILENGNIVTIAGNGPYGFCGDNGLATNAQLNSPAGVFVSSNNEIYIAD 196 Query: 231 ADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWG 290 D+ R + + A + DN + LN S + + Sbjct: 197 YDNHRIRKILENGNIVTIAGKGTAGFSGDNGLATKEKLNFPRCVFVSSNNEVYIADQINH 256 Query: 291 DIKDVSKDGRSISVAPQSQTLFQAGVS 317 I+ + ++G +++A F Sbjct: 257 RIRKILENGNIVTIAGNGPYGFCGDNG 283 >gi|114762697|ref|ZP_01442131.1| RTX toxin, putative [Pelagibaca bermudensis HTCC2601] gi|114544607|gb|EAU47613.1| RTX toxin, putative [Roseovarius sp. HTCC2601] Length = 1769 Score = 43.4 bits (100), Expect = 0.099, Method: Composition-based stats. Identities = 28/348 (8%), Positives = 75/348 (21%), Gaps = 20/348 (5%) Query: 84 FGDKKL-QIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHL 142 + Q+ + + K T D ++ +A H Sbjct: 850 LTPNYIAQVSGDDTGSVTEDTAQTTGGKLDVTDPDEGQAQFVPM-PSAAGAHGTFAVQ-- 906 Query: 143 LYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPL 202 ++T+ P + +S T +T + Sbjct: 907 ------PDGTWTYTLDNDQPAVQALTSGGRQLTDTVTVSTIDGTTQQITVTINGTDDGAQ 960 Query: 203 DKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNI 262 G ++ + + + + + F T+ + Sbjct: 961 ITGTAVGTVTEDTHLTTSGKLDVTDPDAGEAAFVPMPSAAGAHGTFTVDADGTW--SYQL 1018 Query: 263 TWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWF 322 + + + + V G ++ IS + L S Sbjct: 1019 DNSQAAVQALGPNSAPLTDTLTVTSVDGTSHVLTVT---ISGTNDAPGLTATTASATEDG 1075 Query: 323 MSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEY-GCYDPTKAL 381 G P + L ++ + L G++ F L Sbjct: 1076 AQVTGSL---PGTDVDTGDSLSYAVTGATPAGFTLDPDGSWS-FDPSNAAYQSLAEGLGL 1131 Query: 382 TTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRV 429 + + + V +++ +++ + + Sbjct: 1132 PVQIAVSVTDSAGATTASTLTITVTGTDDQPVVAGAVTLPGGPEDQTQ 1179 >gi|327183554|gb|AEA32001.1| hypothetical protein LAB52_05270 [Lactobacillus amylovorus GRL 1118] Length = 403 Score = 43.4 bits (100), Expect = 0.11, Method: Composition-based stats. Identities = 33/265 (12%), Positives = 69/265 (26%), Gaps = 7/265 (2%) Query: 168 DGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGA 227 D K +I+ S S+ K+F + E S + Sbjct: 9 DNYTWTFKPTVTYNIASTTASAGLSGSNRKVFDGSGVTTAQINHGGSIEVTFTYPGSTDS 68 Query: 228 YI--VADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAP 285 + + D + + + G + + I +S T ++ Sbjct: 69 SMYKLQDGDYTWNTSDHNAPKNVGIYTITLTDSGSATSEIIAKPISGVTISDNDQSKTYD 128 Query: 286 YYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLL- 344 G D + +V+ + + S W+ ++ + + P++V + RL Sbjct: 129 GQAAGLDLDALSISGTDTVSGTALSDTGIQASDFDWYYASGNKLDEVPNNVGTYEARLTD 188 Query: 345 ---FSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKAL-TTAVTDFSASTIHWMHPFG 400 + + + G T S S I + Sbjct: 189 RALAALQNANPNYSFSEVNGTIKYMINPKVATDKLGNSGTKTYNGQGTSVSDIINSVTWN 248 Query: 401 EGVLVGCDTSLWLLSISLSKGLSID 425 G LV D W+ + + Sbjct: 249 PGGLVTGDDYEWMTKNTDGTYSVMT 273 >gi|291335687|gb|ADD95292.1| hypothetical protein [uncultured phage MedDCM-OCT-S05-C139] Length = 290 Score = 43.0 bits (99), Expect = 0.12, Method: Composition-based stats. Identities = 5/89 (5%), Positives = 28/89 (31%), Gaps = 11/89 (12%) Query: 485 LQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFP 544 +++ +++++ + S + AW ++ Sbjct: 4 FKIISNSRENNVIFF--SEEGTSTLYGYKYFDNIRERKLAAWFKWTLTGTIQYHCV---- 57 Query: 545 NDNRGGTSLWMLVALSA-GEERSFTVRLN 572 +L+++V + + + ++++ Sbjct: 58 ----QDDNLFVVVRNNNKDQLLKYAIKMD 82 >gi|207341183|gb|EDZ69306.1| YOR098Cp-like protein [Saccharomyces cerevisiae AWRI1631] Length = 1076 Score = 42.6 bits (98), Expect = 0.17, Method: Composition-based stats. Identities = 38/311 (12%), Positives = 74/311 (23%), Gaps = 12/311 (3%) Query: 147 DGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGR 206 SF P P LG + + +K + S ST + Sbjct: 765 SNSPTSFFDGSASSTPIPVLGKPTDATGDTTSKSAFSFGTASTNGTNASANSTSFSFNAP 824 Query: 207 SIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWIT 266 + G TN + + D+ S T +G FG+S T + Sbjct: 825 ATGNGTTTASNTSGTNIAGTFNVGKPDQSIASGNTNGAGSAFGFSSSGTAATGAASNQSS 884 Query: 267 VLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAW 326 ++ + + + +K + + + F + + + Sbjct: 885 FNFGNNGAGGLNPFTSATSST-NANAGLFNKPPSTNAQNINVPSAFNFTGNNSTPGGGSV 943 Query: 327 GEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVT 386 G + T +F+GS SF F+ T Sbjct: 944 FNMNGNTNANT------VFAGSNNQPHQSQTPSFNTNSSFTPSTVPNINFSGLNGGITNT 997 Query: 387 DFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVYACPPVSVGDCL 446 + + S ++ S + G P +G Sbjct: 998 ATNTLRPSDIFGANA-----ASGSNSNVTNPSSIFGGAGGVPTTSFGQPQSAPNQMGMGT 1052 Query: 447 VFVCGVGRRIK 457 +G + Sbjct: 1053 NNGMSMGGGVM 1063 >gi|290975761|ref|XP_002670610.1| predicted protein [Naegleria gruberi] gi|284084171|gb|EFC37866.1| predicted protein [Naegleria gruberi] Length = 308 Score = 42.6 bits (98), Expect = 0.18, Method: Composition-based stats. Identities = 23/270 (8%), Positives = 61/270 (22%), Gaps = 21/270 (7%) Query: 72 FSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLE-----YAVF 126 F + + F + +++ ++ + + N + Sbjct: 17 FVSSNNEVYIADFCNHRIRKILENGNIVTIAGNGNYGFSGDNGPATNAQFNYPCSVFVSS 76 Query: 127 GSTAVFVH-KDHPPHHLLYIQDG----DKISFTFDEIKFLPPPWLGDGMISGVKSNAKLS 181 + +H +L + + F S S+ Sbjct: 77 KNEVYITDYSNHRIRKILENGNIITIAGNGTVGFSGDNGPATNAQLYNPSSVFVSSNNEV 136 Query: 182 ISQA-----------DTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIV 230 + + I + D G + + P ++ + Sbjct: 137 YIADFCNHRIRKILENGNIVTIAGNGNYGFSGDNGPATNAQFNYPCSVFVSSKNEVYITD 196 Query: 231 ADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWG 290 + R + + + + DN L S S + Sbjct: 197 YSNHRIRKILENGNIITIAGNGTVGFSGDNGPATNAQLYNPSSVFVSSNNEVYFTDQHNN 256 Query: 291 DIKDVSKDGRSISVAPQSQTLFQAGVSVVS 320 I+ + ++G I++A F + Sbjct: 257 RIRKILENGNIITIAGNGNYGFSGDNGPAT 286 >gi|325114611|emb|CBZ50167.1| conserved hypothetical protein [Neospora caninum Liverpool] Length = 1314 Score = 42.2 bits (97), Expect = 0.21, Method: Composition-based stats. Identities = 46/400 (11%), Positives = 90/400 (22%), Gaps = 36/400 (9%) Query: 148 GDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRS 207 D + T + F P P + +S+S A + + Sbjct: 128 WDSETATPVKTIFRPHPTGVQAVDITPDGRFIVSLSAAIPRELIVEAGNGPNPGSKGATH 187 Query: 208 IRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITV 267 E + T S + S R + + + Sbjct: 188 GSGKGEQGEDGETTKSSKTDGREGGANERTATDANASTARSSDRSSLESSERTGRSTQST 247 Query: 268 LNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMS--- 324 N S+ + + V +V W Sbjct: 248 WNGQSELDASDGTRSPQDPQVSSASSQERGSFGKRQTYQSV--------AVWDWREPGNA 299 Query: 325 ----AWGEQEGYPSHVTFHNN--RLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPT 378 A V F++ + + K + ++ F Sbjct: 300 PICVAVIATPDLQHSVLFNSTDVHEILTNGKRRVFFWFWEETSDYFHFYSPALQAKDFKQ 359 Query: 379 KALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVYACP 438 K + F ++ +G +V W LS+ + D RR Sbjct: 360 KIGDFTRSIFLPNSTKAATGTVDGDVVL-----WDLSLIVDGLSRPDERRAVRILNLNRA 414 Query: 439 PVSV-----GDCLVFVCGVGRRIKYISGSTE-----QGFRFNEITQLADHLFNQRILQLV 488 V+ + +V G I + GF I ++ R + Sbjct: 415 AVTFLFVHDENWIVAGFADG-VIGFYDFQFRISRWFDGFNAGPINSISFDYMPSRGYLKL 473 Query: 489 YQEEPHSIV---WVVLEPKDNSFPRLLGCRFSAEGEGDFA 525 + H ++ W + +L+G F + Sbjct: 474 WNYHTHRLIVNHWFEKLSPNVGDGKLMGVGFGNGQVKIYG 513 >gi|197302833|ref|ZP_03167885.1| hypothetical protein RUMLAC_01562 [Ruminococcus lactaris ATCC 29176] gi|197298070|gb|EDY32618.1| hypothetical protein RUMLAC_01562 [Ruminococcus lactaris ATCC 29176] Length = 2612 Score = 42.2 bits (97), Expect = 0.23, Method: Composition-based stats. Identities = 32/448 (7%), Positives = 84/448 (18%), Gaps = 29/448 (6%) Query: 37 SRNLIPLRYGPLVSMPLMQEYRDC-RLDPRSNR---VFSFSIPDGGYALLVFGDKKLQIV 92 N G + L D + Sbjct: 1676 IENSTVTAKGG-NLRSGTDYIPGIGKNSSGRASEIGKIQILNSTVESFRLEEKDGTNYVY 1734 Query: 93 V---------VRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLL 143 + + ++ + L Sbjct: 1735 DKLHTKELPGIPAENITICGSTVNGKTIDHSPDEYGKCALCDKYDLGYCYEHGLLTLEGL 1794 Query: 144 YIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLD 203 D + + A I + +T + F Sbjct: 1795 TDCAHDGSEKKLTGLSHQTGENKTKQLTENTDYTA---IYSNNVHPYTLTPGDEGFDSKK 1851 Query: 204 KGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNIT 263 + G ++I A + G + Sbjct: 1852 APKVTLYGTGNYCGKAEHYFTISENAAAAPTITTDTLPGGKVGEAYSQTLSATGTTPITW 1911 Query: 264 WITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFM 323 I NL + + + A+G ++ + + + + + + + + Sbjct: 1912 GIDSGNLPAGLTLDEATGEISGTPTAAGTASFTVKAENSAGSDTKELSITITKAAPAEYT 1971 Query: 324 SAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTT 383 + G + + SGS + G ++ G ++ Sbjct: 1972 VRFNANGGGGTMA----DVTGVSGSYTLPSCGFTEPEGKQFNGWSTSADGSV-----ISG 2022 Query: 384 AVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVYACPPVSVG 443 + S+ T + + + + + ++ + A Sbjct: 2023 TTYEVSSDTTFYAIWESKEYSIIVTDGKATIGAGSEISKAAQGTTITLTANAAPDGKVFD 2082 Query: 444 DCLVFVCGVGRRIKYISGSTEQGFRFNE 471 +V G + S F + Sbjct: 2083 K---WVVESGNTTLEDANSETTTFIMPD 2107 >gi|190407430|gb|EDV10697.1| nucleoporin NUP1 [Saccharomyces cerevisiae RM11-1a] Length = 1076 Score = 42.2 bits (97), Expect = 0.23, Method: Composition-based stats. Identities = 37/311 (11%), Positives = 74/311 (23%), Gaps = 12/311 (3%) Query: 147 DGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGR 206 SF P P LG + + +K + S +T + Sbjct: 765 SNSPTSFFDGSASSTPIPVLGKPTDATGDTTSKSAFSFGTANTNGTNASANSTSFSFNAP 824 Query: 207 SIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWIT 266 + G TN + + D+ S T +G FG+S T + Sbjct: 825 ATGNGTTTASNTSGTNIAGTFNVGKPDQSIASGNTNGAGSAFGFSSSGTAATGAASNQSS 884 Query: 267 VLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAW 326 ++ + + + +K + + + F + + + Sbjct: 885 FNFGNNGAGGLNPFTSATSST-NANAGLFNKPPSTNAQNINVPSAFNFTGNNSTPGGGSV 943 Query: 327 GEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVT 386 G + T +F+GS SF F+ T Sbjct: 944 FNMNGNTNANT------VFAGSNNQPHQSQTPSFNTNSSFTPSTVPNINFSGLNGGITNT 997 Query: 387 DFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVYACPPVSVGDCL 446 + + S ++ S + G P +G Sbjct: 998 ATNTLRPSDIFGANA-----ASGSNSNVTNPSSIFGGAGGVPTTSFGQPQSAPNQMGMGT 1052 Query: 447 VFVCGVGRRIK 457 +G + Sbjct: 1053 NNGMSMGGGVM 1063 >gi|323335507|gb|EGA76792.1| Nup1p [Saccharomyces cerevisiae Vin13] Length = 1076 Score = 42.2 bits (97), Expect = 0.24, Method: Composition-based stats. Identities = 37/311 (11%), Positives = 74/311 (23%), Gaps = 12/311 (3%) Query: 147 DGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGR 206 SF P P LG + + +K + S +T + Sbjct: 765 SNSPTSFFDGSASSTPIPVLGKPTDATGDTTSKSAFSFGTANTNGTNASANSTSFSFNAP 824 Query: 207 SIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWIT 266 + G TN + + D+ S T +G FG+S T + Sbjct: 825 ATGNGTTTASNTSGTNIAGTFNVGKPDQSIASGNTNGAGSAFGFSSSGTAATGAASNQSS 884 Query: 267 VLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAW 326 ++ + + + +K + + + F + + + Sbjct: 885 FNFGNNGAGGLNPFTSATSST-NANAGLFNKPPSTNAQNINVPSAFNFTGNNSTPGGGSV 943 Query: 327 GEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVT 386 G + T +F+GS SF F+ T Sbjct: 944 FNMNGNTNANT------VFAGSNNQPHQSQTPSFNTNSSFTPSTVPNINFSGLNGGITNT 997 Query: 387 DFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVYACPPVSVGDCL 446 + + S ++ S + G P +G Sbjct: 998 ATNTLRPSDIFGANA-----ASGSNSNVTNPSSIFGGAGGVPTTSFGQPQSAPNQMGMGT 1052 Query: 447 VFVCGVGRRIK 457 +G + Sbjct: 1053 NNGMSMGGGVM 1063 >gi|256272978|gb|EEU07942.1| Nup1p [Saccharomyces cerevisiae JAY291] Length = 1076 Score = 42.2 bits (97), Expect = 0.24, Method: Composition-based stats. Identities = 37/311 (11%), Positives = 74/311 (23%), Gaps = 12/311 (3%) Query: 147 DGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGR 206 SF P P LG + + +K + S +T + Sbjct: 765 SNSPTSFFDGSASSTPIPVLGKPTDATGDTTSKSAFSFGTANTNGTNASANSTSFSFNAP 824 Query: 207 SIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWIT 266 + G TN + + D+ S T +G FG+S T + Sbjct: 825 ATGNGTTTASNTSGTNIAGTFNVGKPDQSIASGNTNGAGSAFGFSSSGTAATGAASNQSS 884 Query: 267 VLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAW 326 ++ + + + +K + + + F + + + Sbjct: 885 FNFGNNGAGGLNPFTSATSST-NANAGLFNKPPSTNAQNINVPSAFNFTGNNSTPGGGSV 943 Query: 327 GEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVT 386 G + T +F+GS SF F+ T Sbjct: 944 FNMNGNTNANT------VFAGSNNQPHQSQTPSFNTNSSFTPSTVPNINFSGLNGGITNT 997 Query: 387 DFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVYACPPVSVGDCL 446 + + S ++ S + G P +G Sbjct: 998 ATNTLRPSDIFGANA-----ASGSNSNVTNPSSIFGGAGGVPTTSFGQPQSAPNQMGMGT 1052 Query: 447 VFVCGVGRRIK 457 +G + Sbjct: 1053 NNGMSMGGGVM 1063 >gi|259149580|emb|CAY86384.1| Nup1p [Saccharomyces cerevisiae EC1118] Length = 1076 Score = 42.2 bits (97), Expect = 0.24, Method: Composition-based stats. Identities = 37/311 (11%), Positives = 74/311 (23%), Gaps = 12/311 (3%) Query: 147 DGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGR 206 SF P P LG + + +K + S +T + Sbjct: 765 SNSPTSFFDGSASSTPIPVLGKPTDATGDTTSKSAFSFGTANTNGTNASANSTSFSFNAP 824 Query: 207 SIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWIT 266 + G TN + + D+ S T +G FG+S T + Sbjct: 825 ATGNGTTTASNTSGTNIAGTFNVGKPDQSIASGNTNGAGSAFGFSSSGTAATGAASNQSS 884 Query: 267 VLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAW 326 ++ + + + +K + + + F + + + Sbjct: 885 FNFGNNGAGGLNPFTSATSST-NANAGLFNKPPSTNAQNINVPSAFNFTGNNSTPGGGSV 943 Query: 327 GEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVT 386 G + T +F+GS SF F+ T Sbjct: 944 FNMNGNTNANT------VFAGSNNQPHQSQTPSFNTNSSFTPSTVPNINFSGLNGGITNT 997 Query: 387 DFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVYACPPVSVGDCL 446 + + S ++ S + G P +G Sbjct: 998 ATNTLRPSDIFGANA-----ASGSNSNVTNPSSIFGGAGGVPTTSFGQPQSAPNQMGMGT 1052 Query: 447 VFVCGVGRRIK 457 +G + Sbjct: 1053 NNGMSMGGGVM 1063 >gi|83746022|ref|ZP_00943077.1| Hypothetical Protein RRSL_04046 [Ralstonia solanacearum UW551] gi|83727205|gb|EAP74328.1| Hypothetical Protein RRSL_04046 [Ralstonia solanacearum UW551] Length = 757 Score = 42.2 bits (97), Expect = 0.25, Method: Composition-based stats. Identities = 43/366 (11%), Positives = 83/366 (22%), Gaps = 11/366 (3%) Query: 92 VVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDG--D 149 + ++ T YT + T +L D Sbjct: 82 FNYDGQDRVRQVTDPRSLVTTYTVDGLGNTTRQQSPDTGTTNATYDVAGNLTRRTDARGK 141 Query: 150 KISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIR 209 + +D + + G + D SD R Sbjct: 142 ITRYRYDAVNRMTHAVFASGTPIAFTYDGGKHPEPNDIGHLTHISDESGQ------TRWR 195 Query: 210 LGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLN 269 K + + Y T+G S Sbjct: 196 FNGFGNVVRKTQSTTANGETKKQVVAYAYGTSGSSTGHI--ISMTYPSSSVIGYSYDAGG 253 Query: 270 LSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQ 329 + + +A G+VA G + F + + + A Sbjct: 254 RIAGLTLTTAHGSVALLSNIQYQPFGKPAGWTWGNGTAYTRSFDLSGRLTQFPLGATSGT 313 Query: 330 EGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFS 389 P+ ++ N S + S G+ + + +G D + ++ + S Sbjct: 314 GATPNGLSRTVNYDAASRITAYTHTDTSGSTGSSTATAANQTFGYDDQGRLISYLPANSS 373 Query: 390 ASTIHWMHPFGEGVLV-GCDTSLWLLSISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVF 448 S + + G + G + + S S + S + S A G Sbjct: 374 QSYSYDANGNRTGQTIGGSSYTQTVDSASNRQTASTGPTATTNSYDAAGNQTGDGSTTYS 433 Query: 449 VCGVGR 454 GR Sbjct: 434 YSDRGR 439 >gi|326912092|ref|XP_003202388.1| PREDICTED: activating transcription factor 7-interacting protein 1-like [Meleagris gallopavo] Length = 1086 Score = 42.2 bits (97), Expect = 0.27, Method: Composition-based stats. Identities = 21/214 (9%), Positives = 44/214 (20%), Gaps = 11/214 (5%) Query: 132 FVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTAR 191 V HP + P +S S A Sbjct: 688 AVSTTHPVAQTTRTSLPTVGTSGLHNSTSSRGPIHMKIPLSAFNSTAPTEPPTITAPRVE 747 Query: 192 ITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYS 251 + R+ + + + + + D+ S + ++ + Sbjct: 748 NQTSRPPTDSSANKRTAEGTTQSGKVTGSDSGGVIDLTLDDEDDVSSQAEAKKQNQTPPT 807 Query: 252 KGA-----------TYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGR 300 + T + + S+T+ A V + Sbjct: 808 AQSIPAQPLSRPLQTLQPNPLQQTGVPTSGPSQTTIHVLPTAPTTVNVTHRPVTQTAAKL 867 Query: 301 SISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPS 334 I P + + + S G PS Sbjct: 868 PIPRTPSNHQVVYTTIPAPPAQNSVRGAVMPSPS 901 >gi|290447212|emb|CBK19441.1| C. elegans protein F20C5.2e, partially confirmed by transcript evidence [Caenorhabditis elegans] Length = 1124 Score = 42.2 bits (97), Expect = 0.27, Method: Composition-based stats. Identities = 18/182 (9%), Positives = 52/182 (28%), Gaps = 10/182 (5%) Query: 285 PYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLL 344 + V S ++ + S + + G P+ F + RL+ Sbjct: 588 EWNVNAFQSTSSNSSTPLNNTIEVNEDGVFTRSSGADSGVSVSGGNGTPATSQFLDKRLV 647 Query: 345 FSGSKGDELSVYLSSFGAFYDFSLDGEYG--CYDPTKALTTAVTDFSASTIHWMHPF-GE 401 + +S+ ++ ++ + + A+ F GE Sbjct: 648 ATPGCRRPMSMC-------ERMLVETAREQFGAQRRPPISGSGSFVEATIPEETIRFCGE 700 Query: 402 GVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISG 461 V+V ++ ++ S + + + + +++ V V + + + Sbjct: 701 NVVVFSALERFVPEVTDSDPSTFSNSMMMSARRPSIENLTIDASKVLVPILNQSTMILKY 760 Query: 462 ST 463 Sbjct: 761 VF 762 >gi|71986820|ref|NP_001023139.1| Kinesin-Like Protein family member (klp-11) [Caenorhabditis elegans] gi|21615432|emb|CAD36488.1| C. elegans protein F20C5.2b, partially confirmed by transcript evidence [Caenorhabditis elegans] Length = 1130 Score = 42.2 bits (97), Expect = 0.27, Method: Composition-based stats. Identities = 18/182 (9%), Positives = 52/182 (28%), Gaps = 10/182 (5%) Query: 285 PYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLL 344 + V S ++ + S + + G P+ F + RL+ Sbjct: 574 EWNVNAFQSTSSNSSTPLNNTIEVNEDGVFTRSSGADSGVSVSGGNGTPATSQFLDKRLV 633 Query: 345 FSGSKGDELSVYLSSFGAFYDFSLDGEYG--CYDPTKALTTAVTDFSASTIHWMHPF-GE 401 + +S+ ++ ++ + + A+ F GE Sbjct: 634 ATPGCRRPMSMC-------ERMLVETAREQFGAQRRPPISGSGSFVEATIPEETIRFCGE 686 Query: 402 GVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISG 461 V+V ++ ++ S + + + + +++ V V + + + Sbjct: 687 NVVVFSALERFVPEVTDSDPSTFSNSMMMSARRPSIENLTIDASKVLVPILNQSTMILKY 746 Query: 462 ST 463 Sbjct: 747 VF 748 >gi|255070605|ref|XP_002507384.1| predicted protein [Micromonas sp. RCC299] gi|226522659|gb|ACO68642.1| predicted protein [Micromonas sp. RCC299] Length = 937 Score = 41.8 bits (96), Expect = 0.33, Method: Composition-based stats. Identities = 29/293 (9%), Positives = 62/293 (21%), Gaps = 32/293 (10%) Query: 58 RDCRLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKD 117 D S + G LLV + + TP + Sbjct: 51 ATLTDDGGGKHGAILSFSEDGSRLLVGSN----FYPYHVDVYEWQSGSSAW--TPLGSRI 104 Query: 118 NKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISF-----------TFDEIKFLPPPWL 166 + + + D + + + + ++ + Sbjct: 105 SPPQGFIQSA----CLSGDGKVVAISDYDNDGIVGWWTVTVYHYASGSWQRVGSDILGSS 160 Query: 167 GDGMISGVKSNAKLSISQADTSTARITS-DMKIFKPLDKGRSI-------RLGCHPPEWA 218 +G ++ V ++ + + ++S D F GR L W Sbjct: 161 SEGYVAKVSLSSDGKVLAIGNNDQTLSSYDSTAFNATRTGRVRIYQWPASDLTASGVAWT 220 Query: 219 KNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRES 278 + V+ + G + G I T S Sbjct: 221 QMGETIEAWSTVSGTTD--DFSFGPYSRKVYADTGTLSGDGKRIAVFTPDGYSQNGYVYE 278 Query: 279 ASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVS-VVSWFMSAWGEQE 330 + +++ S + + V W AW Sbjct: 279 WKSSSWSVVGDSITLSLAESTVSAASVSYDGNVVAGSYGYVYKWSSGAWSSIR 331 >gi|61097891|ref|NP_001012831.1| activating transcription factor 7-interacting protein 1 [Gallus gallus] gi|82233722|sp|Q5ZIE8|MCAF1_CHICK RecName: Full=Activating transcription factor 7-interacting protein 1; AltName: Full=MBD1-containing chromatin-associated factor 1 gi|53136222|emb|CAG32495.1| hypothetical protein RCJMB04_27g4 [Gallus gallus] Length = 1085 Score = 41.8 bits (96), Expect = 0.34, Method: Composition-based stats. Identities = 12/169 (7%), Positives = 34/169 (20%) Query: 132 FVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTAR 191 V HP + P +S S A Sbjct: 687 AVSTTHPVAQTTRTSLPTVGTSGLHNSTSSRGPIHMKIPLSAFNSTAPTEPPTITAPRVE 746 Query: 192 ITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYS 251 + R+ + + + + + D+ S + ++ + Sbjct: 747 NQTSRPPTDSSANKRTAEGPTQSVKVTGSDSGGVIDLTLDDEDDVSSQAEAKKQNQTAST 806 Query: 252 KGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGR 300 + + + + + + SG + + Sbjct: 807 AQSIPTQPLSRPLPPLQPNPLQQTGVPTSGPSQTTIHVLPTAPTTVNVT 855 >gi|326806946|tpe|CBL80809.2| TPA: mucin-5B [Bos taurus] Length = 6724 Score = 41.4 bits (95), Expect = 0.37, Method: Composition-based stats. Identities = 22/230 (9%), Positives = 52/230 (22%), Gaps = 4/230 (1%) Query: 102 PALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKF- 160 + TP + S+ ST V ++ Sbjct: 5227 STATTERVSTPTSVTGLSSMVTTERTSTPTSVPGPSSTATTERTSTPTSVTGPSSTATTE 5286 Query: 161 -LPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAK 219 + P G S + + + ++ +T + G S + + + Sbjct: 5287 RVSTPTSVPGSSSTATTERTSTHTSVTVPSSTVTMERTSTSTSVTGPSSTVTTE--KVST 5344 Query: 220 NTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESA 279 T+ + + V + V + + +T + S + + Sbjct: 5345 PTSVTGPSSTVTTEGVSTPTSVTGPSSTATTERTSTPTSVTGPSSTVTTEGVSTPTSVTG 5404 Query: 280 SGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQ 329 + A V+ + + S G S Sbjct: 5405 PSSTATTERTSTPTSVTGSSSTATTERVSTPTSVMGPSSTVTTERVSTPT 5454 >gi|255305655|ref|ZP_05349827.1| toxin A [Clostridium difficile ATCC 43255] gi|144926|gb|AAA23283.1| toxin A [Clostridium difficile] Length = 2710 Score = 41.4 bits (95), Expect = 0.42, Method: Composition-based stats. Identities = 45/496 (9%), Positives = 104/496 (20%), Gaps = 41/496 (8%) Query: 71 SFSIPDGGYALL-VF-GDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGS 128 F + G L VF G + ++ + Y++ + + K + Sbjct: 1882 HFYFNNDGVMQLGVFKGPDGFEYFAPANTQNNNIEGQAIVYQSKFLTLNGKKYYFDNNSK 1941 Query: 129 TAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTS 188 + + ++ + + + S +++ + Sbjct: 1942 AVT----GWRIINNEKYYFNPNNAIAAVGLQVIDNNKYYFNPDTAIISKGWQTVNGSRYY 1997 Query: 189 TARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTG---RSG 245 T+ G+ + S G A Y + G Sbjct: 1998 FDTDTAIAFNGYKTIDGKHFYFDSDCVVKIGVFSTSNGFEYFAPANTYNNNIEGQAIVYQ 2057 Query: 246 DRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVA 305 +F G Y DNN +T + W I + + Sbjct: 2058 SKFLTLNGKKYYFDNNSKAVTGWQTIDSKKYYFNTNTAEAATGWQTIDGKKYYFNTNTAE 2117 Query: 306 PQSQTLFQAGVSVVSWFMSAWGEQEGYPSHV-TFHNNRLLFSGSKGDELSVYLSSFGAFY 364 G + + S T N + + + G F Sbjct: 2118 ------AATGWQTIDGKKYYFNTNTAIASTGYTIINGKHFYFNTDGIMQIGVFKGPNGFE 2171 Query: 365 DFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVG-----CDTSLWLLSISLS 419 F+ +A+ + + + + G + + +++ Sbjct: 2172 YFAPANTDANNIEGQAILYQNEFLTLNGKKYYFGSDSKAVTGWRIINNKKYYFNPNNAIA 2231 Query: 420 KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQLADHL 479 Y G + F N +++ + Sbjct: 2232 AIHLCTINNDKYYFSYDGILQ-----------NGYITIERNNFY---FDANNESKMVTGV 2277 Query: 480 FNQR----ILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKH 535 F + ++ F + + W I K Sbjct: 2278 FKGPNGFEYFAPANTHNNNIEGQAIVYQNKFLTLNGKKYYFDNDSKAVTGW--QTIDGKK 2335 Query: 536 YVLSAASFPNDNRGGT 551 Y + + T Sbjct: 2336 YYFNLNTAEAATGWQT 2351 >gi|171691236|ref|XP_001910543.1| hypothetical protein [Podospora anserina S mat+] gi|170945566|emb|CAP71679.1| unnamed protein product [Podospora anserina S mat+] Length = 944 Score = 41.4 bits (95), Expect = 0.45, Method: Composition-based stats. Identities = 33/341 (9%), Positives = 68/341 (19%), Gaps = 21/341 (6%) Query: 94 VRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISF 153 +S + + PY+ ++A G+ AV S Sbjct: 153 PANSPANTVVPITISTDLPYSTSQVVPGDFAQIGTVAVGSSPTTAAATDGRPNPLRSEST 212 Query: 154 TFD-EIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKG------- 205 + + + G + + K T T Sbjct: 213 ATEPPLTQPSSNFADTGTQPAIGDSGKFGQDGTQTITEAAPDANPAGFIGIVLPSTTLDE 272 Query: 206 RSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWI 265 + ++ V T T + + Sbjct: 273 VVSTVTKETTIIGVPATTTVIGVTTESGLVLSFTETRTVDQVVTLVPSPTTIFNVVTAVS 332 Query: 266 TVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSA 325 S G + S + +++ + TL VV Sbjct: 333 PSFVTLSVEILSDDDGTPTVTVINTPPPVFSPEVITVTDSRGVPTLTVTTDVVVPPRTKV 392 Query: 326 WGEQEGYP-SHVTFHNNRLLFSGSKGDELS----VYLSSFGAFYDFSLDGEYGCYDPTKA 380 +G P + +T + ++S F F L Sbjct: 393 VTNFQGVPTATITEFP---TVPTDTPKPQAEVSVYFISRAQYFVGFFLPTILAVMLTIPI 449 Query: 381 LTTAVTDFSASTIHWM-----HPFGEGVLVGCDTSLWLLSI 416 + H + P E + + ++S Sbjct: 450 RMIDMAAKQYQPWHALTQRMGVPAEESLCLRTGGFHGIVSS 490 >gi|242019932|ref|XP_002430412.1| DNA-binding protein Ewg, putative [Pediculus humanus corporis] gi|212515542|gb|EEB17674.1| DNA-binding protein Ewg, putative [Pediculus humanus corporis] Length = 526 Score = 41.1 bits (94), Expect = 0.47, Method: Composition-based stats. Identities = 29/193 (15%), Positives = 49/193 (25%), Gaps = 2/193 (1%) Query: 111 TPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGM 170 T + ++ + V + P L I + D P L DG Sbjct: 287 TKVIAAAQAQITFSPTHNALAQVQTSYAPAVLQTISNPDGTVSIIQVDPNNPIITLPDGT 346 Query: 171 ISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIV 230 + V+ A + SQ D + T ++ + G S+ + + A Sbjct: 347 TAQVQGVATIHASQGDGTQTVHT--VQTIQDSVTGESVAVDLNNVTEATLNQDGQIILTG 404 Query: 231 ADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWG 290 D Y +G S T V + + V P G Sbjct: 405 EDGHGYPVSVSGMITVPVSASMYQTVVANIQHLTQASDGTMQVVTPVVQVPKVEPSNENG 464 Query: 291 DIKDVSKDGRSIS 303 +I Sbjct: 465 VETITVTSSGNIV 477 >gi|1351266|sp|P16154|TOXA_CLODI RecName: Full=Toxin A gi|40441|emb|CAA36094.1| unnamed protein product [Clostridium difficile] gi|1770135|emb|CAA63564.1| tcdA [Clostridium difficile] Length = 2710 Score = 41.1 bits (94), Expect = 0.48, Method: Composition-based stats. Identities = 46/496 (9%), Positives = 105/496 (21%), Gaps = 41/496 (8%) Query: 71 SFSIPDGGYALL-VF-GDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGS 128 F + G L VF G + ++ + Y++ + + K + Sbjct: 1882 HFYFNNDGVMQLGVFKGPDGFEYFAPANTQNNNIEGQAIVYQSKFLTLNGKKYYFDNNSK 1941 Query: 129 TAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTS 188 + + ++ + + + S +++ + Sbjct: 1942 AVT----GWRIINNEKYYFNPNNAIAAVGLQVIDNNKYYFNPDTAIISKGWQTVNGSRYY 1997 Query: 189 TARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTG---RSG 245 T+ G+ + S G A Y + G Sbjct: 1998 FDTDTAIAFNGYKTIDGKHFYFDSDCVVKIGVFSTSNGFEYFAPANTYNNNIEGQAIVYQ 2057 Query: 246 DRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVA 305 +F G Y DNN +T L + W I + + Sbjct: 2058 SKFLTLNGKKYYFDNNSKAVTGLQTIDSKKYYFNTNTAEAATGWQTIDGKKYYFNTNTAE 2117 Query: 306 PQSQTLFQAGVSVVSWFMSAWGEQEGYPSHV-TFHNNRLLFSGSKGDELSVYLSSFGAFY 364 G + + S T N + + + G F Sbjct: 2118 ------AATGWQTIDGKKYYFNTNTAIASTGYTIINGKHFYFNTDGIMQIGVFKGPNGFE 2171 Query: 365 DFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVG-----CDTSLWLLSISLS 419 F+ +A+ + + + + G + + +++ Sbjct: 2172 YFAPANTDANNIEGQAILYQNEFLTLNGKKYYFGSDSKAVTGWRIINNKKYYFNPNNAIA 2231 Query: 420 KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQLADHL 479 Y G + F N +++ + Sbjct: 2232 AIHLCTINNDKYYFSYDGILQ-----------NGYITIERNNFY---FDANNESKMVTGV 2277 Query: 480 FNQR----ILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKH 535 F + ++ F + + W I K Sbjct: 2278 FKGPNGFEYFAPANTHNNNIEGQAIVYQNKFLTLNGKKYYFDNDSKAVTGW--QTIDGKK 2335 Query: 536 YVLSAASFPNDNRGGT 551 Y + + T Sbjct: 2336 YYFNLNTAEAATGWQT 2351 >gi|223935789|ref|ZP_03627704.1| NHL repeat containing protein [bacterium Ellin514] gi|223895390|gb|EEF61836.1| NHL repeat containing protein [bacterium Ellin514] Length = 755 Score = 41.1 bits (94), Expect = 0.52, Method: Composition-based stats. Identities = 42/408 (10%), Positives = 91/408 (22%), Gaps = 25/408 (6%) Query: 75 PDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFK-----DNKSLEYAVFGST 129 DG + G+ ++++ S G T T + A G+ Sbjct: 217 SDGNIYVADTGNGTIRVIPPGGSVTTLAGSPGNYGSTNGTGSAAQFYQPMGVAVAANGTV 276 Query: 130 AVFVHKDHPPHHLL-------------YIQDGDKISFTFDEIKFLPPPWLGDGMISGVKS 176 V + +H + D G + Sbjct: 277 YVADNLNHTIRAVTSGGVVTTLAGLAGNYGSKDGTGSNARFYAPQGVAVSGSTVFVVDTG 336 Query: 177 NAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVY 236 N + + + + I G S + P A + + ++ + + Sbjct: 337 NGTIRQISSGGAVTTLAGSASIGNADGTGGSAKFYW-PKGTAVDASGNVFVSDTFNHTIR 395 Query: 237 RSLTTGRSGDRFGYSKGATYVKDNNITWITV----LNLSSKTSRESASGAVAPYYVWGDI 292 + G G + + + + + + A A Sbjct: 396 KITAAGTVSTLAGTAGSSGTNNGVGGGAQFYAPQGIAVDTGGNAYVADTANNVIRKVTSG 455 Query: 293 KDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDE 352 V+ + V Q ++ G Y S H R + G Sbjct: 456 GTVTTLAGTAGVEGQGDGTGSNAQFSGPQAVALDGAANVYVSDTGNHTIRKISPGGAVTT 515 Query: 353 LSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLW 412 + + G + AV S + + D S+ Sbjct: 516 FAGFPGHPGNLDSNMDNNGTNTARFYSPSGLAVDS-SGNVYVADTGNHTIRKITADGSVS 574 Query: 413 LLSISLSKGLSID-FRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYI 459 L+ + D R + + L + ++ + Sbjct: 575 TLAGLPGVWGNADGTNRDARFFQPEGISIDSQGNLFVMDSGNHTMRML 622 >gi|119962248|ref|YP_948356.1| hypothetical protein AAur_2638 [Arthrobacter aurescens TC1] gi|119949107|gb|ABM08018.1| conserved hypothetical protein [Arthrobacter aurescens TC1] Length = 282 Score = 41.1 bits (94), Expect = 0.52, Method: Composition-based stats. Identities = 18/197 (9%), Positives = 41/197 (20%), Gaps = 12/197 (6%) Query: 308 SQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFS 367 + F + E G P VT + RL + + G + S Sbjct: 27 VKGRFDVVTHDEPFVRIEISEITGDPLTVTLVDGRLEVRHQLQGPQGWFRNLMGTVNNTS 86 Query: 368 LDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFR 427 + + S + + + L + ++ Sbjct: 87 SNAVIVGIALPSGVDVEAGTVSGDGMVSGISGRTRLNTVSGSVLADSTSGELHVNTVSGE 146 Query: 428 RVSGSGVYACPPVSVGDCLVFVC-----GVGRRIKYISGSTEQ-------GFRFNEITQL 475 ++ + SV + +S + ++T Sbjct: 147 VIARNHDGVLTAKSVSGEVTASGKFKNVRASTVSGDLSFDLQDYTNDLGANSVSGDLTIR 206 Query: 476 ADHLFNQRILQLVYQEE 492 H I+ Sbjct: 207 LPHDVGLDIVAKSASGT 223 >gi|20090615|ref|NP_616690.1| cell surface protein [Methanosarcina acetivorans C2A] gi|19915655|gb|AAM05170.1| cell surface protein [Methanosarcina acetivorans C2A] Length = 906 Score = 41.1 bits (94), Expect = 0.53, Method: Composition-based stats. Identities = 32/309 (10%), Positives = 80/309 (25%), Gaps = 14/309 (4%) Query: 98 TKWSPALFGKTYKTPY------TFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKI 151 T + KT Y + A + Sbjct: 372 TVTNDGGSDSEVKTDYITVSESSTPTEPEPVAAFTADVTNGTVPLTVNFTDQSTEAPTSW 431 Query: 152 SFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLG 211 ++ FD + ++++ A+ + + + Sbjct: 432 AWDFDNDGTVDSTEQNPSYTYTSAGTYTVNLTVANAEGSDSEVKIDYITVSESSTPTEPE 491 Query: 212 CHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLS 271 A TN ++ + D +S + S + ++ + T+ + N + Sbjct: 492 PVAAFIADVTNGTVPLTVNFTD---QSTGSPTSWLWDFGDNTSATEQNPSHTYNSAGNYT 548 Query: 272 SKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEG 331 + S SG + V D VS+ P + V ++ + G Sbjct: 549 VNLTVISESGNSSE--VKADYITVSESSTPTEPEPVAAFTADVTNGTVPLTVNFTDQSTG 606 Query: 332 YPSHVTF-HNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDP--TKALTTAVTDF 388 P+ + +N ++ + Y + + ++ E G + +T + Sbjct: 607 MPTSWAWDFDNDGNMDSTEQNPSYTYTAEGNYTVNLTVSSEVGSDSEVKVEYITVTDSST 666 Query: 389 SASTIHWMH 397 + + Sbjct: 667 TPEARPDLI 675 >gi|291231773|ref|XP_002735837.1| PREDICTED: egg bindin receptor 1-like [Saccoglossus kowalevskii] Length = 1328 Score = 41.1 bits (94), Expect = 0.56, Method: Composition-based stats. Identities = 36/374 (9%), Positives = 74/374 (19%), Gaps = 25/374 (6%) Query: 95 RSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGS--TAVFVHKDHPPHHLLYIQDGDKIS 152 S T + D+ L+ +S Sbjct: 351 SGSLFPIGVTTVTYTATDASSNTALCTFVVTVTDIEVPFVACPDNIEPPLVTDASTAFVS 410 Query: 153 FTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGC 212 ++ L + S + SD Sbjct: 411 WSPPTATDNSLAVLTESTNYATPSGWFPIGTTTVYYNFTDPSDNTASCAFQITVIDLQRP 470 Query: 213 HPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSS 272 + G + + T S S Sbjct: 471 KITYCPSDIVGQTGDSSIEVSWTVPTATDNSGEVPAITSNHDPPYDCPLGVTNVEYIFSD 530 Query: 273 KTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGY 332 SG + D+ + ++ + V+W + + G Sbjct: 531 G------SGNTIACSFTVTVDDIGPPTVTNCPDDITEATSSSKTIAVTWSEPSATDNSGI 584 Query: 333 PSHVTFHNNRLLFSGS--KGDELSVYLSSFGAFYDF-------SLDGEYGCYDPTKALTT 383 P V R G +V + A +F +++ A Sbjct: 585 PVTV----ERTNIPGDAFPVGMTTVTYTFTDASSNFAKCNFVVTVEDSLMTTTEVIADNA 640 Query: 384 AVTDFSASTIHWMHPFGEGVL---VGCDTSLWLLSISLSKGLSIDFRRVSGSG-VYACPP 439 + ST+ + + + + + S+ S+ S +S Sbjct: 641 SDNTQPTSTLFPVLCITGDMCDTSLTIEEVERMSSVDDSELTSNSLLLISRWMLENNASS 700 Query: 440 VSVGDCLVFVCGVG 453 + V + F G Sbjct: 701 LDVANETYFTISHG 714 >gi|110637563|ref|YP_677770.1| xyloglucanase [Cytophaga hutchinsonii ATCC 33406] gi|110280244|gb|ABG58430.1| CHU large protein; candidate xyloglucanase, glycoside hydrolase family 74 protein [Cytophaga hutchinsonii ATCC 33406] Length = 1288 Score = 41.1 bits (94), Expect = 0.60, Method: Composition-based stats. Identities = 27/242 (11%), Positives = 54/242 (22%), Gaps = 3/242 (1%) Query: 95 RSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFT 154 S A G ++ A + Sbjct: 889 AGSALSLAANTGTGLTYQWSNAAGTISGATASTYAATVAGTYKVTVTNTATTCSATSADK 948 Query: 155 FDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHP 214 LP + S +A + + T S+ + Sbjct: 949 TITATALPTAAITTTASSFCAGSALSLAANSGTGLTYQWSNAAGTISGATASTYAANVAG 1008 Query: 215 PEWAKNTNYSIGAYIVADDKVYRSL---TTGRSGDRFGYSKGATYVKDNNITWITVLNLS 271 TN + + DK + T + + G+ N S Sbjct: 1009 TYKVTVTNSATTCSATSADKTITATALPTAAITTTANSFCAGSALSLAANSGTGLTYQWS 1068 Query: 272 SKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEG 331 + S + A V+ + + + S + ++W+ A G+ +G Sbjct: 1069 NAAGTISGATASTYAVNVAGTYKVTVTNSATTCSATSADKTVTVTNSLTWYEDADGDGKG 1128 Query: 332 YP 333 P Sbjct: 1129 DP 1130 >gi|170088711|ref|XP_001875578.1| predicted protein [Laccaria bicolor S238N-H82] gi|164648838|gb|EDR13080.1| predicted protein [Laccaria bicolor S238N-H82] Length = 1496 Score = 40.7 bits (93), Expect = 0.63, Method: Composition-based stats. Identities = 43/374 (11%), Positives = 95/374 (25%), Gaps = 34/374 (9%) Query: 78 GYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDH 137 DK +++ V++ L G Y S S Sbjct: 1014 QAYCFWIYDKTVRVWDVQTGQSAMDPLKGHD---HYVTSVAFSPNGKHIASGCY------ 1064 Query: 138 PPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGV-------------KSNAKLSISQ 184 D + + + P G G+ + + + Sbjct: 1065 ---------DKTVRVWDAQTGQSVVDPLKGHGVYVTSVAFSPDSRHIVSGSDDKTVRVWD 1115 Query: 185 ADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRS 244 A T + +T + + + ++ + + G Sbjct: 1116 AQTGQSVMTPFEG--HDDYVTSVAFSPDGRHIVSGSDDKTVRVWDAQTGQSVMDPLKGHG 1173 Query: 245 GDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISV 304 + + ++ + + + +SA + + + S DGR I+ Sbjct: 1174 SSVTSVAFSPDGRHIVSGSYDKTVRVWDVQTGQSAMDPIKGHDHYVTSVAFSPDGRHIAS 1233 Query: 305 APQSQTLFQAGVSVVSWFMSAWGEQEGYPSHV-TFHNNRLLFSGSKGDELSVYLSSFGAF 363 +T+ + + Y + V + R + SGS + V+ + F Sbjct: 1234 GCYDKTVRVWDAQTGQIVVDPLKGHDLYVTSVACSPDGRHIISGSDDKTVRVWDAQTVTF 1293 Query: 364 YDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLS 423 G D T + A T S H G + ++S S + + Sbjct: 1294 SPDGRHVVSGSDDKTVRVWDAQTGQSVMDPLKGHGDGVTSVAFSSDGRHIVSGSGDETVR 1353 Query: 424 IDFRRVSGSGVYAC 437 + ++S Sbjct: 1354 VWDAQISSRITDPV 1367 >gi|328858331|gb|EGG07444.1| hypothetical protein MELLADRAFT_35562 [Melampsora larici-populina 98AG31] Length = 1510 Score = 40.7 bits (93), Expect = 0.68, Method: Composition-based stats. Identities = 27/215 (12%), Positives = 52/215 (24%), Gaps = 24/215 (11%) Query: 364 YDFSLDGEYGCYDPTKALTT-AVTDFSASTIHWMHPF--GEGVLVGCD----TSLWLLSI 416 YDFS D T+A V + + + GE + V + ++L Sbjct: 968 YDFSKSNAAIASDTTQAFGILDVETRRENQVPSVLNSHAGEQLSVHTGLPMGRNPFMLQR 1027 Query: 417 SLSKGLSIDFRRVS-----GSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNE 471 S D Y P G + + + + + + Sbjct: 1028 FSSTYEGHDANIRYLLERVFLDSYREPLTEFGPVVSEGIPRKKAFRSMFSIRNRASTSSN 1087 Query: 472 ITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMI 531 T L HL P V + + + + H Sbjct: 1088 GT-LVGHLVEHTARISSIAVSPD----FVFFVTGSHDGTVKVWDSIRLEKNVTSKSRHTY 1142 Query: 532 SDKHYVLSAASFPNDNRGGT-----SLWMLVALSA 561 + + + + + + +LW V Sbjct: 1143 TQGGKITCVCALEHSHCVASASTNGTLW--VHRID 1175 >gi|320120601|gb|EFE29168.2| S-layer y domain-containing protein [Filifactor alocis ATCC 35896] Length = 1384 Score = 40.7 bits (93), Expect = 0.70, Method: Composition-based stats. Identities = 45/417 (10%), Positives = 103/417 (24%), Gaps = 28/417 (6%) Query: 57 YRDCRLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTK-WSPALFGKTYKTPYTF 115 + + + F D G + L+ + A+ T++T Sbjct: 353 IGEGKSSIEIDSTHPFKFADTGEYV------TLENIKNGGKIVPADAAICSVTFETGDGA 406 Query: 116 KDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVK 175 + + P + + D + F + D Sbjct: 407 TEV--------APQGINKGGKIKPTVTPVRKGYRFAGWQKDGLPFDISTAILDDTTLTAI 458 Query: 176 SNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKV 235 N+ + + PL+ G ++ G +T++ I D + Sbjct: 459 WNSLPDTEYQGEGDVTVELAGSEYYPLEPGHTVLDGGTWVVVDDDTSFHERITIKGDVNI 518 Query: 236 YRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDV 295 G + A + ++ ++ A A V+ Sbjct: 519 I---------LTDGKTLTANKGIAVTSKDHSKFSVYAQNQGTGALKAFPDETVYSAGIGG 569 Query: 296 SKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSV 355 + RS QA S + + G + + ++ GS+ ++ Sbjct: 570 DEGKRSCGTINIYGGRIQASGSDLGAAIGGSAFGNG--GTIGIYGGQVDVQGSRNYGEAI 627 Query: 356 YLSSFGAFYDFSLDGEYGCYDPTKALT-TAVTDFSASTIHWMHPFGEGVLV-GCDTSLWL 413 S G + D + + + F + L+ G +T + Sbjct: 628 GFSYAGGVEPHNADITLSWTRESDYIRLYPAPGGQPARYKGNVTFSKKFLLDGTNTRAFW 687 Query: 414 LSISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFN 470 S + + + + VG + V + I+ +G Sbjct: 688 KSANNNIDNRKIVPKTKMLWSDIQEKLDVGGSIKLTSNVSAKSGDIALVVPEGKNAT 744 >gi|303239417|ref|ZP_07325944.1| cell wall/surface repeat protein [Acetivibrio cellulolyticus CD2] gi|302592980|gb|EFL62701.1| cell wall/surface repeat protein [Acetivibrio cellulolyticus CD2] Length = 2467 Score = 40.7 bits (93), Expect = 0.70, Method: Composition-based stats. Identities = 41/390 (10%), Positives = 89/390 (22%), Gaps = 21/390 (5%) Query: 70 FSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGST 129 ++ Y + FG + + K G + Y + + Sbjct: 1146 VPYTFDGNLYYFVEFG-GYIG---STGTIKKIANDPGSPDALLTVSASATVVTYTITYNL 1201 Query: 130 AVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTST 189 + D+ P S T + G + NA +IS DT Sbjct: 1202 NDGTNPDNAP-----TGYTHGTSVTLPTPTKSNFTFGGWFDNESLTGNAVTTISTTDTGN 1256 Query: 190 ARITSDMKIFKPLDKGRSIRLGCHPPEW----------AKNTNYSIGAYIVADDKVYRSL 239 + I G + + A + + D Sbjct: 1257 KAFWAKWSIIPITAAGVTGMVAPSAGGTPIAVGSLTAEAGTYTVTSLTWKNNDGTAATLT 1316 Query: 240 TTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDG 299 + Y + + + + +G V+ V G+ Sbjct: 1317 PEEKFKADTIYKAEIELTSAVGNKFQASGFTPTVNAGTAGAGTVSGGDVEGNKLTFMVTF 1376 Query: 300 RSISVAPQSQTLFQAGVSVVSWFMSAWG--EQEGYPSHVTFHNNRLLFSGSKGDELSVYL 357 + + + + +S+ S G G T+++ + Y Sbjct: 1377 DTTAAQSVTGIGVTIQPTKMSYTESTDGILALNGMAITETYNDGSTGTVTFTDGTAAGYT 1436 Query: 358 SSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSIS 417 +S + ++ T + + P + S + S Sbjct: 1437 ASPVNGDTLTNAAHNNIKVTITHTASSQTAQTVNLTVNPVPDTQATPSFSPASDAIAFGS 1496 Query: 418 LSKGLSIDFRRVSGSGVYACPPVSVGDCLV 447 S + + P +VG + Sbjct: 1497 TVTITSAGADHIYYTTDGTNPATTVGGSTL 1526 >gi|258541252|ref|YP_003186685.1| outer membrane protein [Acetobacter pasteurianus IFO 3283-01] gi|256632330|dbj|BAH98305.1| outer membrane protein [Acetobacter pasteurianus IFO 3283-01] gi|256635387|dbj|BAI01356.1| outer membrane protein [Acetobacter pasteurianus IFO 3283-03] gi|256638442|dbj|BAI04404.1| outer membrane protein [Acetobacter pasteurianus IFO 3283-07] gi|256641496|dbj|BAI07451.1| outer membrane protein [Acetobacter pasteurianus IFO 3283-22] gi|256644551|dbj|BAI10499.1| outer membrane protein [Acetobacter pasteurianus IFO 3283-26] gi|256647606|dbj|BAI13547.1| outer membrane protein [Acetobacter pasteurianus IFO 3283-32] gi|256650659|dbj|BAI16593.1| outer membrane protein [Acetobacter pasteurianus IFO 3283-01-42C] gi|256653650|dbj|BAI19577.1| outer membrane protein [Acetobacter pasteurianus IFO 3283-12] Length = 1051 Score = 40.7 bits (93), Expect = 0.70, Method: Composition-based stats. Identities = 48/464 (10%), Positives = 102/464 (21%), Gaps = 39/464 (8%) Query: 85 GDKKLQIVVVRSSTKWS--PALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHL 142 + + + +S + S ++ + + Sbjct: 212 SNGNMDVYSGGTSISATLKEPDATLNLSGGNASGTLLSAGAVNVYTSGTLTNTTVQSGII 271 Query: 143 LYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPL 202 I + G + + S + IS T+T+ I S Sbjct: 272 NLSGGSATIVNATHGSGGIIVNEGGRLTSAMLASGGYVHISAGGTATSDIVSSSGTEYVD 331 Query: 203 DKGRSIR---LGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKD 259 + G SI L + + ++ A I++ T +G + + Sbjct: 332 NGGSSISAQILTSNANIIVSSGGFATDAKIISGYATVYDNGTMVNGSIQSGIITVSGGRV 391 Query: 260 NNITWITVLNLSSKTSRESA-----SGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQA 314 NI SA + Y Sbjct: 392 ANINADNGGGFDVSGGNVSALHINTGSFINLYNGGSATDITGSGSNLSDGNGGVNVFGGT 451 Query: 315 GVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGC 374 S G +VTF++N F+ + + ++ Sbjct: 452 LTGASFQDGSTLSATGGTIQNVTFNSNGYGFASNATLTSTTINANGNLVVYDGATTNNTV 511 Query: 375 YDPTKALT-TAVTDFSASTIHW-------------MHPFGEGVLVGCDTSLWLLSISLSK 420 T A + S + I ++P E S +LS + + Sbjct: 512 VSGTNAFEAVSAGGSSINAIISDSGNEYANAGATIINPTAESGGAITIHSQGVLSNATIQ 571 Query: 421 GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQLADHLF 480 + G + + G ++ G + + + L Sbjct: 572 NGASLSIESGGQLSGSVTLQNGGTAAIYSDAGGTIV----MDGDTTNTG----LVISGLT 623 Query: 481 NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDF 524 I+ V +S +L + + Sbjct: 624 EGGIVSTVISG-------FNGTSGGDSDGIVLDGIKEGDVQDVS 660 >gi|290973961|ref|XP_002669715.1| predicted protein [Naegleria gruberi] gi|284083266|gb|EFC36971.1| predicted protein [Naegleria gruberi] Length = 710 Score = 40.7 bits (93), Expect = 0.73, Method: Composition-based stats. Identities = 25/274 (9%), Positives = 62/274 (22%), Gaps = 46/274 (16%) Query: 72 FSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLE-----YAVF 126 + + + F +++++ V+ + K + N L + Sbjct: 115 YVSSNNEVYIADFCNQRIRKVLQNGNIITIAGNGTKGFSGDNGPATNAQLNGPAGVFVSN 174 Query: 127 GSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQAD 186 + + +H + Sbjct: 175 NEVYIADYSNHVIRKISQNGTI-------------------------------------- 196 Query: 187 TSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGD 246 I + K D G + + P ++ + + V R + + Sbjct: 197 ---VTIAGNGKPGFSGDNGLATNAQLYNPSGTFVSSNNEVYISDCFNHVIRKILQNGTIV 253 Query: 247 RFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAP 306 + + DN + L S + I+ V +G +++A Sbjct: 254 TIAGNGKGGFSGDNGLATNAQLYSPLGVFVSSNNEVYISDCFNHRIRKVLHNGNIVTIAG 313 Query: 307 QSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHN 340 F +G + FH+ Sbjct: 314 NGTPGFSGDSPFDISLYPHFGNSSSLTRRIEFHS 347 >gi|255088519|ref|XP_002506182.1| predicted protein [Micromonas sp. RCC299] gi|226521453|gb|ACO67440.1| predicted protein [Micromonas sp. RCC299] Length = 609 Score = 40.7 bits (93), Expect = 0.75, Method: Composition-based stats. Identities = 33/282 (11%), Positives = 76/282 (26%), Gaps = 13/282 (4%) Query: 168 DGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGA 227 +G + V ++ + + + A + S A Sbjct: 167 EGYYAEVSLSSDGKVLAIGNNNQTL---SSYDSSDHNATMTGRVRIYQWPASDLTASGVA 223 Query: 228 YIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPY- 286 + + + T+ FG Y ++ KT+ + G V + Sbjct: 224 WTQMGEPIEAWSTSSGFYPWFGPYSQKVYADAGKLSGDGKRVALFKTNGWAQKGYVYEWK 283 Query: 287 -YVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQE----GYPSHVTFHNN 341 W + D + S++ + + +V W AW GY + ++ Sbjct: 284 SSSWSIVGDSIDLEGTASISYDGNVVAGSYGNVYKWSSGAWSSIRTEYFGYYTSLSRDGT 343 Query: 342 RLLFSGSKGDELSV---YLSSFGAFYDF-SLDGEYGCYDPTKALTTAVTDFSASTIHWMH 397 R+ ++ S + + + + S ++ + GE ++ + + Sbjct: 344 RVAYADSWNEGVVLVHQWDSEAESWGRMLDIRGESASDQVGAMVSLSGDGSRVAVFSDGA 403 Query: 398 PFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVYACPP 439 VG + + S G S C Sbjct: 404 KHTRVFEVGTTCDTSVAPPNASVGNCPAKLASGSSCQPTCNS 445 >gi|171909629|ref|ZP_02925099.1| hypothetical protein VspiD_00615 [Verrucomicrobium spinosum DSM 4136] Length = 5664 Score = 40.7 bits (93), Expect = 0.76, Method: Composition-based stats. Identities = 21/191 (10%), Positives = 52/191 (27%), Gaps = 5/191 (2%) Query: 161 LPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKN 220 + + + + A S T + G + + Sbjct: 3253 SSGSVPQYYLTTSTAGSWNYGDASASYSG---TVSSHPEPNAETGNLDWSYSYTGSSSFT 3309 Query: 221 TNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESAS 280 ++Y IG+ + + ++ S + S ++ T + + + Sbjct: 3310 SSYQIGSSTCDETGAWSGTSSNYSFGEWI-SGPISWPAWAPSTSGMPTSEAPTERSYNRD 3368 Query: 281 GAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHN 340 A + G + P+ + + + ++W+ AW PS Sbjct: 3369 SQEATVSLSGVYSTGDLITDVHASFPEYKFVGEFQNP-IAWYEGAWSWSGYTPSTSILFA 3427 Query: 341 NRLLFSGSKGD 351 +RLL+ Sbjct: 3428 SRLLWLNDSTY 3438 >gi|325171208|ref|YP_004251180.1| hypothetical protein ViPhICP2p09 [Vibrio phage ICP2] gi|323512234|gb|ADX87691.1| conserved hypothetical protein [Vibrio phage ICP2] gi|323512306|gb|ADX87762.1| hypothetical protein TU12-16_00040 [Vibrio phage ICP2_2006_A] Length = 734 Score = 40.3 bits (92), Expect = 0.82, Method: Composition-based stats. Identities = 46/435 (10%), Positives = 89/435 (20%), Gaps = 47/435 (10%) Query: 73 SIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDN-KSLEYAVFGSTAV 131 S +G +L + R + ++ + S+ Sbjct: 30 SFREGENFILSKANA----WERRKGLGLEDSGTLYPSYVDFSDQTLVSSVHVWQ------ 79 Query: 132 FVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTAR 191 H P L+ F + + + ++ Sbjct: 80 -THYSAIPEILVVQFGDKLHFFDTSVDPLSNGKLFINN--QEFLTTEGTTEDIISGASVE 136 Query: 192 ITSDMKIFKPLDKGRSIRLGCHPPEWAKNT----------NYSIGAYIVADDKVYRSLTT 241 I A+ A K + Sbjct: 137 GIFVFATQDADPISLQIMDIQSDSITARTKIVVDRKVLFLETRDVWGRSAPSKERPKTLS 196 Query: 242 GRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRS 301 T ++ I + A G + Sbjct: 197 SDYLYELINQGWDTKKINSTYATIGAYPSGYDIWWLYKTTAGTDANAIGKFTPSRMKDST 256 Query: 302 ISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDE--------- 352 + Q + A + G PS + R+ ++G + Sbjct: 257 TTGIGQERQNTPAPRGSTVASLQVLAS--GKPSCIQTFAGRVFYAGFQATPRKIDDVRPD 314 Query: 353 --LSVYLSS---FGAFYDFSLDGEYGCYDPTKALTTAVTD----FSASTIHWMHPFGEGV 403 V+ S A + + AL +A I M G+ Sbjct: 315 FRNHVFFSQLVKSNAEINKCYQFADPTSEVDSALVDTDGGFIKINAARKIVAMEEVSSGL 374 Query: 404 LVGCDTSLWLLSISLSKGLSID---FRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYIS 460 + + +WLLS + S +++ G + V VF I Sbjct: 375 FIIAENGVWLLSGTSDGLFSATGYHVDKITDYGCVSPRSVVAYGDTVFYWAEEGIIVLSP 434 Query: 461 GSTEQGFRFNEITQL 475 T +T+L Sbjct: 435 DQTTGKHSAQNLTEL 449 >gi|290995104|ref|XP_002680171.1| predicted protein [Naegleria gruberi] gi|284093791|gb|EFC47427.1| predicted protein [Naegleria gruberi] Length = 928 Score = 40.3 bits (92), Expect = 0.83, Method: Composition-based stats. Identities = 23/246 (9%), Positives = 62/246 (25%), Gaps = 17/246 (6%) Query: 72 FSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAV 131 F + + +G+++++ ++ + ++ N L Sbjct: 17 FVSSNNEVYIADYGNQRIRKILKNGNIVTIAGNGTAGFRGDNGPATNAQL---------- 66 Query: 132 FVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTAR 191 + P+ + + + F + G + S + A+ Sbjct: 67 -----YNPYSVFVSSNNEVYIADFSNHRIRKILENGKIVTIAGNGTGGFSGDNGPATNAQ 121 Query: 192 ITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYS 251 + + +F + I + N +I + G + + + Sbjct: 122 LNNPYSVFVSSNNEVYIVDYNNHRIRKILKNGNIVTIAGNGTGGFS-GDNGPATNAQLNN 180 Query: 252 KGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTL 311 +V NN +I + + + +G + G I P Sbjct: 181 PMGVFVSSNNEVYIADY-YNHRIRKILENGNIVTIAGNGTAGFSGDSPFDIRTYPHIGNK 239 Query: 312 FQAGVS 317 G Sbjct: 240 LLTGNG 245 >gi|320529456|ref|ZP_08030543.1| fagellar hook-basal body protein [Selenomonas artemidis F0399] gi|320138293|gb|EFW30188.1| fagellar hook-basal body protein [Selenomonas artemidis F0399] Length = 661 Score = 40.3 bits (92), Expect = 0.84, Method: Composition-based stats. Identities = 21/258 (8%), Positives = 50/258 (19%), Gaps = 1/258 (0%) Query: 56 EYRDCRLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTF 115 + R +F G ++ +Q + S K + K P Sbjct: 84 FVVKKGNETYYTRNGAFEFDADGNYVMPGSGHYVQGWMANSEGKLITSGNVGNIKIPKGK 143 Query: 116 KDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVK 175 N + + + ++ D + + P + Sbjct: 144 SMNSEPTTTATYTNNLNASTKRSIVKSVVVRYADGTTENVTDYTPPPEDGKP-SVSVTTT 202 Query: 176 SNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKV 235 K+++ + S + + DD Sbjct: 203 GGTKITVDSTADYDFASAATGTPLNGKKLWTSTVDSVTQTATGQIKKMVLEGGTGNDDDP 262 Query: 236 YRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDV 295 T S + TY ++ ++ + + Sbjct: 263 LTRFATAGSTLSLTAVENGTYKIGGTYKLTGTIDTATLQADGTIKLTFQAATPPAVTPPD 322 Query: 296 SKDGRSISVAPQSQTLFQ 313 S + F Sbjct: 323 VIVPAPPSGTYKHGDTFT 340 >gi|255009828|ref|ZP_05281954.1| hypothetical protein Bfra3_11876 [Bacteroides fragilis 3_1_12] gi|313147614|ref|ZP_07809807.1| predicted protein [Bacteroides fragilis 3_1_12] gi|313136381|gb|EFR53741.1| predicted protein [Bacteroides fragilis 3_1_12] Length = 1465 Score = 40.3 bits (92), Expect = 0.87, Method: Composition-based stats. Identities = 27/298 (9%), Positives = 69/298 (23%), Gaps = 15/298 (5%) Query: 55 QEYRDCRLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYT 114 + D P + +++ + + V + ++ S A + Sbjct: 915 KHVGDTWYTPTNKKLYFYVKGNVSQFKNVISKNGIHFWKKGTAPTISNAPASSWNTSTLK 974 Query: 115 FKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGV 174 L Y K + + + + L ++ ++S + Sbjct: 975 EAHVSDLYYNTAAKKLYIYSKK--VEYDNNGNPITSYYWNEKDDENLL--FVSVKVVSYL 1030 Query: 175 KSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDK 234 L I+ +T T ++ + + EW TN D Sbjct: 1031 ADGVTLFINTPNTYTIGDCFIQDLYIKIANTTRTTGSYNSSEWTTKTNVLYYWQRSVDQT 1090 Query: 235 VYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSK-----------TSRESASGAV 283 + K +V + + + + Sbjct: 1091 ALDAYEAASKAQDTADGKRRVFVSTPYAPYDIGDLWVNGADLRRCQTAKVVGQSYSINDW 1150 Query: 284 APYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNN 341 + + K V G S Q + ++ ++ + + + N Sbjct: 1151 VIAVNYDNTKTVIDGGIVTSGTVQLAGSGGSILAGITGEGTEASSVRFWAGASKENRN 1208 >gi|298707033|emb|CBJ29835.1| probable extracellular nuclease [Ectocarpus siliculosus] Length = 1053 Score = 40.3 bits (92), Expect = 0.96, Method: Composition-based stats. Identities = 41/436 (9%), Positives = 84/436 (19%), Gaps = 34/436 (7%) Query: 99 KWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEI 158 G Y+ L + G + + + ++ Sbjct: 197 TTDDGGHGGAIFAAYST-----LVFDGSGDATLTTNSA------SRDGGAIYVLWSDISW 245 Query: 159 KFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWA 218 + D + S + G ++ W Sbjct: 246 ESSESNVFSDNVADRNGGAIYTHGSTVSWDG---DGTHLSYNSGTLGGAVYAYDSTVSWN 302 Query: 219 KNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRES 278 + Y +Y +T + + + + Sbjct: 303 GDGTYLTSNSANDGGAIYADASTVSWDGDATEFSHNSADSQGGVIHAAPGSTVYWDGDGT 362 Query: 279 ASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTF 338 Y G I T F + A+ + T Sbjct: 363 KFSFNLAYSDGGAIYTHLST----VYWDGDDTEFTNNYGGQGGSIRAYDSNMSWIGDGTQ 418 Query: 339 HNNRLLFSGSKGDELSVYLSSFGA---FYDFSLDGEYGCYDPTKALTTAVTDFSASTIHW 395 ++ G LS G F + S G ++ + + + + + Sbjct: 419 FSSSSSSEGGAMYVTRTNLSWDGNGTHFSNISASFAGGAIRAGDSILSWHGEMTFFSNNS 478 Query: 396 MHPFGEGVLVGCDTSLW----------LLSISLSKGLSIDFRRVSGSGVYACPPVSVGDC 445 G + + SLW + I + G Sbjct: 479 ASDDGGAINMDSAGSLWCDGNTIFSNNIAGGDGGALSVILVQAQDY---LIPVVHMSGGA 535 Query: 446 LVFVCGVGRRIKYISGSTEQGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKD 505 V G E + F +IT ++ N + E + Sbjct: 536 FVGNTAAGDGGATYISDIEDRYNFEDITYESNSATNGGAVAASRAEATGTFSRCSFLGNT 595 Query: 506 NSFPRLLGCRFSAEGE 521 S F + Sbjct: 596 ASKNGGAVETFDGSEQ 611 >gi|290985545|ref|XP_002675486.1| predicted protein [Naegleria gruberi] gi|284089082|gb|EFC42742.1| predicted protein [Naegleria gruberi] Length = 819 Score = 40.3 bits (92), Expect = 1.0, Method: Composition-based stats. Identities = 21/267 (7%), Positives = 61/267 (22%), Gaps = 21/267 (7%) Query: 72 FSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKD------NKSLEYAV 125 F + FG+ +++ ++ + + Y ++ + Sbjct: 99 FVSSTNEVYISDFGNYRIRKILRNGNIVTIAGTGEEGYSGDGGPAINAQISAVNNIFVSQ 158 Query: 126 FGSTAVFVHKDHPPHHLLYIQDG-DKISFTFDEIKFLPPPWLGDGMIS-----GVKSNAK 179 ++H +L P + + + ++ Sbjct: 159 NDEVYFSDFRNHRIRKILRNGTIVTIAGTGEQGFSGDGGPAINAKLNTPCGVFVSNNDEV 218 Query: 180 LSISQA---------DTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIV 230 + D + I + D G + P ++ + Sbjct: 219 YIVDYKSHRIRKMLQDGTIITIAGTGEQGFGGDGGPATSAQLSHPCGVFVSSTNEVYITD 278 Query: 231 ADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWG 290 + + R + + + Y D + ++ Sbjct: 279 SYNYRIRKILRNGNITTIAGTGVKGYSGDGGLAINAQISYVENIFVSQNDEVYIADTNNH 338 Query: 291 DIKDVSKDGRSISVAPQSQTLFQAGVS 317 I+ + KDG ++A + F Sbjct: 339 RIRKILKDGTIETIAGNGEKGFGGDSP 365 Score = 39.5 bits (90), Expect = 1.6, Method: Composition-based stats. Identities = 25/267 (9%), Positives = 64/267 (23%), Gaps = 21/267 (7%) Query: 72 FSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVF----- 126 F G+ +++ ++ + K Y N + Y Sbjct: 482 FVSSTNEVFFADSGNYRIRKILRNGNIVTIAGTGEKGYSGDGRPAINAQISYVQNIFVSQ 541 Query: 127 GSTAVFVH-KDHPPHHLLYIQDGDKISFTFD-EIKFLPPPWLGDGMIS-----GVKSNAK 179 F +H +L I+ T + P + S ++ Sbjct: 542 NDEIYFSDFGNHRIRKILRNGTIVTIAGTGEKGFSGDGGPATSAQLDSPCGVFVSNNDEV 601 Query: 180 LSISQA---------DTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIV 230 + + I + D G +I + P ++ + + Sbjct: 602 YIVDYNNHRIRKILRNGIINTIAGTGEEGFSGDGGPAINAQVNHPCGVFVSSTNEVYIMN 661 Query: 231 ADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWG 290 + + R + + + Y D + ++ Sbjct: 662 SGNYRIRKILRNANITTIAGTGVKGYSGDGGLAINAQISYVDNIFVSRNDEVYIADTENH 721 Query: 291 DIKDVSKDGRSISVAPQSQTLFQAGVS 317 I+ + ++G ++A + F Sbjct: 722 RIRKILRNGTIKTIAGNGEEGFGGDSP 748 >gi|254974271|ref|ZP_05270743.1| toxin A [Clostridium difficile QCD-66c26] gi|255313396|ref|ZP_05354979.1| toxin A [Clostridium difficile QCD-76w55] gi|255516083|ref|ZP_05383759.1| toxin A [Clostridium difficile QCD-97b34] gi|255649180|ref|ZP_05396082.1| toxin A [Clostridium difficile QCD-37x79] gi|260682356|ref|YP_003213641.1| toxin A [Clostridium difficile CD196] gi|260685955|ref|YP_003217088.1| toxin A [Clostridium difficile R20291] gi|260208519|emb|CBA61156.1| toxin A [Clostridium difficile CD196] gi|260211971|emb|CBE02483.1| toxin A [Clostridium difficile R20291] Length = 2710 Score = 39.9 bits (91), Expect = 1.0, Method: Composition-based stats. Identities = 41/471 (8%), Positives = 99/471 (21%), Gaps = 39/471 (8%) Query: 71 SFSIPDGGYALL-VF-GDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGS 128 F + G L VF G + ++ + Y++ + + K + Sbjct: 1882 HFYFNNNGVMQLGVFKGPDGFEYFAPANTQNNNIEGQAIVYQSKFLTLNGKKYYFDNDSK 1941 Query: 129 TAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTS 188 + + ++ + + + S +++ + Sbjct: 1942 AVT----GWRIINNEKYYFNPNNAIAAVGLQVIDNNKYYFNPDTAIISKGWQTVNGSRYY 1997 Query: 189 TARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTG---RSG 245 T+ G+ + S G A Y + G Sbjct: 1998 FDTDTAIAFNGYKTIDGKHFYFDSDCVVKIGVFSGSNGFEYFAPANTYNNNIEGQAIVYQ 2057 Query: 246 DRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVA 305 +F G Y DNN +T + W I + + Sbjct: 2058 SKFLTLNGKKYYFDNNSKAVTGWQTIDSKKYYFNTNTAEAATGWQTIDGKKYYFNTNTAE 2117 Query: 306 PQSQTLFQAGVSVVSWFMSAWGEQEGYPSHV-TFHNNRLLFSGSKGDELSVYLSSFGAFY 364 G + + S T N + + + G F Sbjct: 2118 ------AATGWQTIDGKKYYFNTNTSIASTGYTIINGKYFYFNTDGIMQIGVFKVPNGFE 2171 Query: 365 DFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVG-----CDTSLWLLSISLS 419 F+ + +A+ + + + + G + + +++ Sbjct: 2172 YFAPANTHNNNIEGQAILYQNKFLTLNGKKYYFGSDSKAITGWQTIDGKKYYFNPNNAIA 2231 Query: 420 KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQLADHL 479 Y G + F N +++ + Sbjct: 2232 ATHLCTINNDKYYFSYDGILQ-----------NGYITIERNNFY---FDANNESKMVTGV 2277 Query: 480 FNQR----ILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAW 526 F + ++ F + + W Sbjct: 2278 FKGPNGFEYFAPANTHNNNIEGQAIVYQNKFLTLNGKKYYFDNDSKAVTGW 2328 >gi|126698240|ref|YP_001087137.1| toxin A [Clostridium difficile 630] gi|115249677|emb|CAJ67494.1| Toxin A [Clostridium difficile] Length = 2710 Score = 39.9 bits (91), Expect = 1.1, Method: Composition-based stats. Identities = 45/496 (9%), Positives = 104/496 (20%), Gaps = 41/496 (8%) Query: 71 SFSIPDGGYALL-VF-GDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGS 128 F + G L VF G + ++ + Y++ + + K + Sbjct: 1882 HFYFNNDGVMQLGVFKGPDGFEYFAPANTQNNNIEGQAIVYQSKFLTLNGKKYYFDNDSK 1941 Query: 129 TAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTS 188 + + ++ + + + S +++ + Sbjct: 1942 AVT----GWRIINNEKYYFNPNNAIAAVGLQVIDNNKYYFNPDTAIISKGWQTVNGSRYY 1997 Query: 189 TARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTG---RSG 245 T+ G+ + S G A Y + G Sbjct: 1998 FDTDTAIAFNGYKTIDGKHFYFDSDCVVKIGVFSTSNGFEYFAPANTYNNNIEGQAIVYQ 2057 Query: 246 DRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVA 305 +F G Y DNN +T + W I + + Sbjct: 2058 SKFLTLNGKKYYFDNNSKAVTGWQTIDSKKYYFNTNTAEAATGWQTIDGKKYYFNTNTAE 2117 Query: 306 PQSQTLFQAGVSVVSWFMSAWGEQEGYPSHV-TFHNNRLLFSGSKGDELSVYLSSFGAFY 364 G + + S T N + + + G F Sbjct: 2118 ------AATGWQTIDGKKYYFNTNTAIASTGYTIINGKHFYFNTDGIMQIGVFKGPNGFE 2171 Query: 365 DFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVG-----CDTSLWLLSISLS 419 F+ +A+ + + + + G + + +++ Sbjct: 2172 YFAPANTDANNIEGQAILYQNEFLTLNGKKYYFGSDSKAVTGWRIINNKKYYFNPNNAIA 2231 Query: 420 KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQLADHL 479 Y G + F N +++ + Sbjct: 2232 AIHLCTINNDKYYFSYDGILQ-----------NGYITIERNNFY---FDANNESKMVTGV 2277 Query: 480 FNQR----ILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKH 535 F + ++ F + + W I K Sbjct: 2278 FKGPNGFEYFAPANTHNNNIEGQAIVYQNKFLTLNGKKYYFDNDSKAVTGW--QTIDGKK 2335 Query: 536 YVLSAASFPNDNRGGT 551 Y + + T Sbjct: 2336 YYFNLNTAEAATGWQT 2351 >gi|83312376|ref|YP_422640.1| RTX toxins and related Ca2+-binding protein [Magnetospirillum magneticum AMB-1] gi|82947217|dbj|BAE52081.1| RTX toxins and related Ca2+-binding protein [Magnetospirillum magneticum AMB-1] Length = 1139 Score = 39.9 bits (91), Expect = 1.1, Method: Composition-based stats. Identities = 40/386 (10%), Positives = 87/386 (22%), Gaps = 29/386 (7%) Query: 94 VRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISF 153 ++ + ++ + ++ G F Y + Sbjct: 405 DGTTGGTVAVKDFGSASMVWSNTSYATFSFSTVGDKLYFS---------PYTSTYGAEPW 455 Query: 154 TFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCH 213 D S A + T+ + + Sbjct: 456 VSDGTTAGTILLKDIVAGGTTAGYPASGNSSASGGFFQWTAGDGKVYFTTQSGDLYSTDG 515 Query: 214 PPEWAKNTNY--SIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLS 271 N S+ + + +Y G +G+ + +I + + Sbjct: 516 TAAGTAKVNGISSVYGFESSTATMYLGGNDGTNGNELLSWDRTSLGLIKDINSGSSSAMP 575 Query: 272 SKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEG 331 ++ + P+ + S +G S + Sbjct: 576 VYLTKMGGNFYFTPFQLNDSNGAELWKSDGTSGGTALVKDINSGSSGSNIA--------- 626 Query: 332 YPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVT-DFSA 390 +T +NN+L FS + G S+ D L + + Sbjct: 627 ---SITVYNNKLYFSARSAQPNTTPSFVTGTAQSLSVAFNGAAVDLKSYLHVSDSDSSQT 683 Query: 391 STIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVC 450 T G L + S ++ G +I + +G V V D Sbjct: 684 ETWSQSVAPSHGTLSFSSATATSGSTDVTPGGTITYTPTTGYSGSDTFTVQVSDG----- 738 Query: 451 GVGRRIKYISGSTEQGFRFNEITQLA 476 G + + + +T A Sbjct: 739 NGGTATRVFNVTVASNVSPTFVTATA 764 >gi|223939715|ref|ZP_03631587.1| Immunoglobulin I-set domain protein [bacterium Ellin514] gi|223891586|gb|EEF58075.1| Immunoglobulin I-set domain protein [bacterium Ellin514] Length = 727 Score = 39.9 bits (91), Expect = 1.1, Method: Composition-based stats. Identities = 29/276 (10%), Positives = 62/276 (22%), Gaps = 15/276 (5%) Query: 52 PLMQEYRDCRLDPRSNRVFSFSIPDGGYALLVFG----DKKLQIVVVRSSTKWSPALFGK 107 P N++F + +V D Q+ V S + G Sbjct: 141 PGTYRIGVSASSNAPNQIFPIDLATNTDYQVVVSYNTADSYAQLWVNPLSFSDTSVSTGD 200 Query: 108 TYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLG 167 KT +S + S F + ++ + + Sbjct: 201 PVKT---QVYLQSFGFRQASSFGNFFCSVSNLATATTFDEAATNVWSLTPVA-PVILYQP 256 Query: 168 DGMISGVKSNAKLSISQADTSTARITSD---MKIFKPLDKGRSIRLGCHPPEWAKNTNYS 224 + + + A LS+ A + + G + + Y+ Sbjct: 257 KNVTNFTGNPATLSVVANGQGLAGLNYQWQKGGVNISNPAGNANTFTISSLALTDSGFYT 316 Query: 225 IGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVA 284 + + + V T +TV + +A+GA Sbjct: 317 VVVSNPTTGLSVT----SAAAYISANNNPIPPVISQQPTNLTVYYGQTANFSVNANGAQP 372 Query: 285 PYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVS 320 Y W +++ + Sbjct: 373 ITYQWLYNNSPIGGATDATLSILNVNTNNGTTGTYK 408 >gi|34328219|ref|NP_038751.2| podocalyxin precursor [Mus musculus] gi|17369446|sp|Q9R0M4|PODXL_MOUSE RecName: Full=Podocalyxin; AltName: Full=Podocalyxin-like protein 1; Short=PC; Short=PCLP-1; Flags: Precursor gi|16755123|gb|AAL27890.1|AF290208_1 podocalyxin [Mus musculus] gi|9937467|gb|AAG02458.1| podocalyxin [Mus musculus] gi|30851371|gb|AAH52442.1| Podocalyxin-like [Mus musculus] gi|32451600|gb|AAH54530.1| Podocalyxin-like [Mus musculus] gi|148681765|gb|EDL13712.1| podocalyxin-like [Mus musculus] Length = 503 Score = 39.9 bits (91), Expect = 1.1, Method: Composition-based stats. Identities = 22/194 (11%), Positives = 44/194 (22%), Gaps = 9/194 (4%) Query: 125 VFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQ 184 +T+ V HP L + + P S S Sbjct: 41 QSATTSTEVTTGHPVASTLASTQPSNPTPFTTSTQSPSMPTSTPNPTSNQSGGNLTSSVS 100 Query: 185 ADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRS 244 T + F + G + ++G V+ + T+ Sbjct: 101 EVDKTKTSSPSSTAFTSSSGQTASSGGKSGDSFTTAPTTTLGLINVSSQPTDLNTTSKL- 159 Query: 245 GDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISV 304 +T DN + ++ S T+ S D +++ Sbjct: 160 --------LSTPTTDNTTSPQQPVDSSPSTASHPVGQHTPAAVPSSSGSTPSTDNSTLTW 211 Query: 305 APQSQTLFQAGVSV 318 P + + Sbjct: 212 KPTTHKPLGTSEAT 225 >gi|89891494|ref|ZP_01202999.1| conserved hypothetical protein [Flavobacteria bacterium BBFL7] gi|89516268|gb|EAS18930.1| conserved hypothetical protein [Flavobacteria bacterium BBFL7] Length = 1788 Score = 39.9 bits (91), Expect = 1.2, Method: Composition-based stats. Identities = 30/279 (10%), Positives = 70/279 (25%), Gaps = 28/279 (10%) Query: 85 GDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLY 144 G+ L++ +T + YT +N G + ++ Sbjct: 43 GENYLRVYDPSGTTLLD----LCNPASCYTGANNSYSTSVNMG--CLSDANNYSIRMYDR 96 Query: 145 IQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDK 204 D+ + T + + G ++ S + +A + S + Sbjct: 97 YG--DQWNGTGANVTITSGGNVVLSTNHGGGGSSTASFNVYGGGSACV-SGPQEIDIYGN 153 Query: 205 GRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITW 264 G I P+ T+ I V+ +G + S N Sbjct: 154 GSLISDNDTTPDTIDGTDLGIIEGAGTLSSVFTITNSGSNDLVLTGSPRVEITGIN---- 209 Query: 265 ITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMS 324 + +G + + + + + ++ Sbjct: 210 -AADFSVVTQPNATITGGSSEDVTINFSRTTAGTSNATVTILSNDGNEATYNFDITAQSV 268 Query: 325 AWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAF 363 A P + ++ N + S + S+ G+F Sbjct: 269 A-------PQYTMYYEN-------FDNGASGWTSNTGSF 293 >gi|290983204|ref|XP_002674319.1| nucleoporin Nup153 [Naegleria gruberi] gi|284087908|gb|EFC41575.1| nucleoporin Nup153 [Naegleria gruberi] Length = 1192 Score = 39.9 bits (91), Expect = 1.2, Method: Composition-based stats. Identities = 26/244 (10%), Positives = 52/244 (21%), Gaps = 15/244 (6%) Query: 154 TFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCH 213 T + + F + + S G G Sbjct: 276 TGNTLSFGFTQEESTTKPFSFNFGSSTTTEPTTGSFNFAKPSEPEKPKESVG-GFNPGTG 334 Query: 214 PPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSK 273 S + + + + G+ F + K AT D++ + + Sbjct: 335 QVLSFGFNPGSGSSSSSSTGFNPGNTSLGKGAVPFSFGKLATNNDDDSSSSDESSSEPVP 394 Query: 274 TSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTL----FQAGVSVVSWFMSAWGEQ 329 T ++ + +S + AP + Sbjct: 395 TKAPTSFSFGNTSSEPQSFPSFGFNNKSETTAPVINFPMNPQISKTYEDMEDDEQPITSI 454 Query: 330 EGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFS 389 P V SK +V G+ +DF + L F+ Sbjct: 455 TSTPKAV----------RSKKPPNTVSYVKAGSKFDFKKKVDEEDLLDNDPLVLEEDKFA 504 Query: 390 ASTI 393 + Sbjct: 505 MNRP 508 >gi|124006721|ref|ZP_01691552.1| conserved hypothetical protein [Microscilla marina ATCC 23134] gi|123987629|gb|EAY27329.1| conserved hypothetical protein [Microscilla marina ATCC 23134] Length = 3079 Score = 39.9 bits (91), Expect = 1.3, Method: Composition-based stats. Identities = 24/233 (10%), Positives = 51/233 (21%), Gaps = 4/233 (1%) Query: 91 IVVVRSSTKWSPAL---FGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQD 147 + + + A T YT D + + V+ D L Sbjct: 311 VYTFDLTAANADATLTQVSFTTAGTYTASDINAFTLWFSADNTLDVNTDQAIASLTTALG 370 Query: 148 GDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRS 207 +FT + + V A ++ + T G + Sbjct: 371 AGVHTFTAFTQAINGGTTGYFFVTTNVAPLATVNNTIEVTPAITTADLTFSGVVNKLGTA 430 Query: 208 IRLGCHPPEW-AKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWIT 266 + G N + + + +V + G D + N T Sbjct: 431 VAGGTQTIVACNAPDNVTNLSATALNTEVLLNWVNGLCYDEILVVAKSGSTVTNVPTGDG 490 Query: 267 VLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVV 319 + + + V+ + +F +V Sbjct: 491 SAYTADAAFGSGTDLGASEFVVFKGTATSETITSLTNNTTYFFKVFGRKGTVW 543 >gi|42523973|ref|NP_969353.1| hypothetical protein Bd2548 [Bdellovibrio bacteriovorus HD100] gi|39576181|emb|CAE80346.1| hypothetical protein Bd2548 [Bdellovibrio bacteriovorus HD100] Length = 1660 Score = 39.9 bits (91), Expect = 1.3, Method: Composition-based stats. Identities = 27/331 (8%), Positives = 73/331 (22%), Gaps = 15/331 (4%) Query: 93 VVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKIS 152 + A+ T + +V G+ + + Sbjct: 617 DAQGRVTSGAAVAAADITTALGYTPVNKAGDSVTGNLIF------DNTKGSEYKGTSANT 670 Query: 153 FTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGC 212 T + + + ++ + +T + G Sbjct: 671 ATLTGPNAAIGTSYVLRLPATQGTANQVMSVDGSGNLGWMT----LGSLATSGTVNNSNW 726 Query: 213 HPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSS 272 + + + +G + ++ Sbjct: 727 SGTALSIANGGTGATTQAGAANAVLPSQSTNAGKYLTTNGTDVSWAAVPTVTYGTTAGTA 786 Query: 273 KTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGY 332 ++ +G V ++ + + AP + + + +W SA + Sbjct: 787 LQGNQTFAGDVTGTVGVMKVEKLQNRS-VAATAPTNGQVLKWNNGTSTWEPSADTDTNTT 845 Query: 333 PSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSAST 392 + T L +G+ +V L++ G + T T Sbjct: 846 YTAGT----GLSLAGTVFSVDTVPLANGGTGATTQAGAQTALGIGTAGTKDTGTISGKVP 901 Query: 393 IHWMHPFGEGVLVGCDTSLWLLSISLSKGLS 423 + + + D + L+ S S Sbjct: 902 LIGLTGITANSMCTSDGTSSLVCNSPIPTGS 932 >gi|261331074|emb|CBH14063.1| hypothetical protein, conserved [Trypanosoma brucei gambiense DAL972] Length = 548 Score = 39.5 bits (90), Expect = 1.4, Method: Composition-based stats. Identities = 27/226 (11%), Positives = 53/226 (23%) Query: 93 VVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKIS 152 W + +P+T +S+ Y A V PP + Sbjct: 195 DPSDRVTWFDDDDDFGHISPFTNVRGRSIYYFKVLCEASAVPTPLPPASPHSENTTADEN 254 Query: 153 FTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGC 212 T D DG + + + AD +T + G G Sbjct: 255 TTADGNTTADGNITTDGNTNADGNTNADGNTTADGNTTADGNTNADGNTTADGNITTDGN 314 Query: 213 HPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSS 272 + + + A + + D + T N T + Sbjct: 315 TNADGNTTADGNTTADGNTTADGNTNADGNTTTDENTTADENTNADGNTTTDGNTNADGN 374 Query: 273 KTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSV 318 T+ + + D + + A + T + + Sbjct: 375 TTADGNITTDGNTNADGNTTADGNTTADGNTNADGNTTTDENTTAD 420 >gi|6324672|ref|NP_014741.1| Nup1p [Saccharomyces cerevisiae S288c] gi|128907|sp|P20676|NUP1_YEAST RecName: Full=Nucleoporin NUP1; AltName: Full=Nuclear pore protein NUP1 gi|172056|gb|AAA34822.1| nucleoporin (NUP1) (put.); putative [Saccharomyces cerevisiae] gi|1164945|emb|CAA64020.1| YOR3182c [Saccharomyces cerevisiae] gi|1420275|emb|CAA99295.1| NUP1 [Saccharomyces cerevisiae] gi|285814982|tpg|DAA10875.1| TPA: Nup1p [Saccharomyces cerevisiae S288c] Length = 1076 Score = 39.5 bits (90), Expect = 1.6, Method: Composition-based stats. Identities = 38/311 (12%), Positives = 75/311 (24%), Gaps = 12/311 (3%) Query: 147 DGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGR 206 SF P P LG + + +K + S +T + Sbjct: 765 SNSPTSFFDGSASSTPIPVLGKPTDATGNTTSKSAFSFGTANTNGTNASANSTSFSFNAP 824 Query: 207 SIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWIT 266 + G TN + + D+ S T +G FG+S T + Sbjct: 825 ATGNGTTTTSNTSGTNIAGTFNVGKPDQSIASGNTNGAGSAFGFSSSGTAATGAASNQSS 884 Query: 267 VLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAW 326 ++ + + + +K + + + F + + + Sbjct: 885 FNFGNNGAGGLNPFTSATSST-NANAGLFNKPPSTNAQNVNVPSAFNFTGNNSTPGGGSV 943 Query: 327 GEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVT 386 G + T +F+GS SF F+ T Sbjct: 944 FNMNGNTNANT------VFAGSNNQPHQSQTPSFNTNSSFTPSTVPNINFSGLNGGITNT 997 Query: 387 DFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVYACPPVSVGDCL 446 +A + S ++ S + G P +G Sbjct: 998 ATNALRPSDIFGANA-----ASGSNSNVTNPSSIFGGAGGVPTTSFGQPQSAPNQMGMGT 1052 Query: 447 VFVCGVGRRIK 457 +G + Sbjct: 1053 NNGMSMGGGVM 1063 >gi|16755124|gb|AAL27891.1| podocalyxin [Mus musculus] Length = 465 Score = 39.5 bits (90), Expect = 1.7, Method: Composition-based stats. Identities = 22/194 (11%), Positives = 44/194 (22%), Gaps = 9/194 (4%) Query: 125 VFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQ 184 +T+ V HP L + + P S S Sbjct: 41 QSATTSTEVTTGHPVASTLASTQPSNPTPFTTSTQSPSMPTSTPNPTSNQSGGNLTSSVS 100 Query: 185 ADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRS 244 T + F + G + ++G V+ + T+ Sbjct: 101 EVDKTKTSSPSSTAFTSSSGQTASSGGKSGDSFTTAPTTTLGLINVSSQPTDLNTTSKL- 159 Query: 245 GDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISV 304 +T DN + ++ S T+ S D +++ Sbjct: 160 --------LSTPTTDNTTSPQQPVDSSPSTASHPVGQHTPAAVPSSSGSTPSTDNSTLTW 211 Query: 305 APQSQTLFQAGVSV 318 P + + Sbjct: 212 KPTTHKPLGTSEAT 225 >gi|257487378|ref|ZP_05641419.1| BNR repeat-containing glycosyl hydrolase [Pseudomonas syringae pv. tabaci ATCC 11528] Length = 1627 Score = 39.5 bits (90), Expect = 1.8, Method: Composition-based stats. Identities = 49/377 (12%), Positives = 96/377 (25%), Gaps = 3/377 (0%) Query: 98 TKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDE 157 T + + + + TA V D+ S Sbjct: 706 TLDNTGFTNASGNAGSGVTSSNNYAIDTLRPTATIVVADNALAVGETSLVTITFSEAVSG 765 Query: 158 IKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEW 217 + + S+ ++ + T TA ITS + G + G Sbjct: 766 FTNADLSVANGTLSAVSSSDGGITWTATLTPTAGITSASNSVTLNNGGVTDLAGNAGSGL 825 Query: 218 AKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRE 277 + NY+I V + V + + + V N + T Sbjct: 826 TLSNNYAIDQTRPTASIVIADNALSAGETSLVTITFSEAVSGFDNSDLNVPNGTLSTVNS 885 Query: 278 SASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVT 337 + G + V+ IS+ T +++ PS Sbjct: 886 NDGGITWTATFTPNAN-VNASTGQISLNSAGVTDLAGNAGSGIISSASFTVDTTRPSATI 944 Query: 338 FHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMH 397 + L +G + + F + L G + +T + T + Sbjct: 945 VVADNALSAGETTLVTFTFSQAVSGFSNADLSVANGTLSAVSSSDGGITWTATFTPNANV 1004 Query: 398 PFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIK 457 ++ +T + S S G + + + V D L+ + R Sbjct: 1005 TDAGNLITLDNTGVTNASGSTGSGTTASNN-YTIDTQRPTATIVVTDSLLAIGETSRVTI 1063 Query: 458 YISGSTEQGFRFNEITQ 474 S GF ++T Sbjct: 1064 TFS-EAVSGFSNADLTV 1079 >gi|6448471|dbj|BAA86912.1| podocalyxin-like protein 1 [Mus musculus] Length = 503 Score = 39.5 bits (90), Expect = 1.8, Method: Composition-based stats. Identities = 22/194 (11%), Positives = 44/194 (22%), Gaps = 9/194 (4%) Query: 125 VFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQ 184 +T+ V HP L + + P S S Sbjct: 41 QSATTSTEVTTGHPVASTLASTQPSNPTPFTTSTQSPFMPTSTPNPTSNQSGGNLTSSVS 100 Query: 185 ADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRS 244 T + F + G + ++G V+ + T+ Sbjct: 101 EVDKTKTSSPSSTAFTSSSGQTASSGGKSGDSFTTAPTTTLGLINVSSQPTDLNTTSKL- 159 Query: 245 GDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISV 304 +T DN + ++ S T+ S D +++ Sbjct: 160 --------LSTPTTDNTTSPQQPVDSSPSTASHPVGQHTPAAVPSSSGSTPSTDNSTLTW 211 Query: 305 APQSQTLFQAGVSV 318 P + + Sbjct: 212 KPTTHKPLGTSEAT 225 >gi|229816885|ref|ZP_04447167.1| hypothetical protein BIFANG_02133 [Bifidobacterium angulatum DSM 20098] gi|229785630|gb|EEP21744.1| hypothetical protein BIFANG_02133 [Bifidobacterium angulatum DSM 20098] Length = 1043 Score = 39.1 bits (89), Expect = 1.8, Method: Composition-based stats. Identities = 21/247 (8%), Positives = 49/247 (19%), Gaps = 6/247 (2%) Query: 75 PDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVH 134 D + + K ++ TA Sbjct: 627 SDTTVYAHWAIKSYIVAFDSAGGSAVDAQKVQYGSKVVSPAAPTRTGHTFQGWYTARNGG 686 Query: 135 KDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITS 194 + + +T + G + T+T + Sbjct: 687 SKYDFGQAVTGDITLYAHWTVNSYTLTFDGNGGKPTETSRTVAYGSPYGTMPTATRTGYT 746 Query: 195 DMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGA 254 + G + + + Y Y + G K Sbjct: 747 FEGWYTAKSGGSQVYMS------TAMGASNATVYAHWTANTYTATFDSNGGSAVASQKVQ 800 Query: 255 TYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQA 314 + N T + + + +G + DV+ R + + T Sbjct: 801 YGSRINRPADPTRTGYTFQGWYTAKNGGTRYDFDKAVTGDVTLYARWAVITFRDVTSSTP 860 Query: 315 GVSVVSW 321 + ++W Sbjct: 861 HSADIAW 867 >gi|146313045|ref|YP_001178119.1| outer membrane autotransporter [Enterobacter sp. 638] gi|145319921|gb|ABP62068.1| outer membrane autotransporter barrel domain [Enterobacter sp. 638] Length = 863 Score = 39.1 bits (89), Expect = 1.8, Method: Composition-based stats. Identities = 33/305 (10%), Positives = 71/305 (23%), Gaps = 18/305 (5%) Query: 81 LLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVF----GSTAVFVHKD 136 + FGD + + + +L A G+ + H Sbjct: 252 VDAFGDVAIGMYGTTHDSLVLNNSTVTGDIGAINENGATTLSLANNSVVKGNVTLEGHSA 311 Query: 137 HPPHHLLYIQDGDKISFTFDE---IKFLPPPWLGDGMISGVKSNAKLSISQADTSTARIT 193 + DG+ + I + + +G + + + + Sbjct: 312 NDLLVDNSTVDGNVNASQNSGNTTITLQNNAAVNGDITTGKGDDTLVLTNNSRVDGNVDG 371 Query: 194 SDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKG 253 D +D G SI E T+ + + +D L G Sbjct: 372 GDGSDTLSMDAGSSISGQISQFETVNTTSNNSISIDKINDTTTWDLQNGSRLVAQSTGSN 431 Query: 254 ATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQ 313 AT + + + +S + I + + + F Sbjct: 432 ATVTMSTDSFVDFGTITGANNAVVVSSITASARDQKNVILGTFNTASTNTPQAYAGATFT 491 Query: 314 AGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYG 373 G V A+ +NN L + + + + ++ G Sbjct: 492 NGQQSVENRSGAYN-----------YNNELNIAAADSAPQTQRAADNSQSWNIEFTSAKG 540 Query: 374 CYDPT 378 Sbjct: 541 SLASD 545 >gi|328683463|ref|NP_112612.2| low-density lipoprotein receptor-related protein 4 precursor [Rattus norvegicus] gi|328671584|dbj|BAD18061.2| LDL receptor-related protein 4 [Rattus norvegicus] Length = 1905 Score = 39.1 bits (89), Expect = 1.8, Method: Composition-based stats. Identities = 22/266 (8%), Positives = 57/266 (21%), Gaps = 10/266 (3%) Query: 87 KKLQIVVVRSS--TKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLY 144 K + P+ T P F+ S A + + + + Sbjct: 698 GKNRCGDNNGGCTHLCLPSGQNYTCACPTGFRKINSHACAQSLDKFLLFARRMDIRRISF 757 Query: 145 IQDGD-----KISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIF 199 + ++ + + V ++ T + + Sbjct: 758 DTEDLSDDVIPLADVRSAVALDWDSRDDHVYWTDVSTDTISRAKWDGTGQKVV---VDTS 814 Query: 200 KPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKD 259 G +I + W I + R + Sbjct: 815 LESPAGLAIDWVTNKLYWTDAGTDRIEVANTDGSMRTVLIWENLDRPRDIVVEPMGGYMY 874 Query: 260 NNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVV 319 + + + ++ W + + + + A + Sbjct: 875 WTDWGASPKIERAGMDASNRQVIISSNLTWPNGLAIDYGSQRLYWADAGMKTIEFAGLDG 934 Query: 320 SWFMSAWGEQEGYPSHVTFHNNRLLF 345 S G Q +P +T + R+ + Sbjct: 935 SKRKVLIGSQLPHPFGLTLYGQRIYW 960 >gi|47116978|sp|Q9QYP1|LRP4_RAT RecName: Full=Low-density lipoprotein receptor-related protein 4; Short=LRP-4; AltName: Full=Multiple epidermal growth factor-like domains 7; Flags: Precursor Length = 1905 Score = 39.1 bits (89), Expect = 1.8, Method: Composition-based stats. Identities = 22/266 (8%), Positives = 57/266 (21%), Gaps = 10/266 (3%) Query: 87 KKLQIVVVRSS--TKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLY 144 K + P+ T P F+ S A + + + + Sbjct: 698 GKNRCGDNNGGCTHLCLPSGQNYTCACPTGFRKINSHACAQSLDKFLLFARRMDIRRISF 757 Query: 145 IQDGD-----KISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIF 199 + ++ + + V ++ T + + Sbjct: 758 DTEDLSDDVIPLADVRSAVALDWDSRDDHVYWTDVSTDTISRAKWDGTGQKVV---VDTS 814 Query: 200 KPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKD 259 G +I + W I + R + Sbjct: 815 LESPAGLAIDWVTNKLYWTDAGTDRIEVANTDGSMRTVLIWENLDRPRDIVVEPMGGYMY 874 Query: 260 NNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVV 319 + + + ++ W + + + + A + Sbjct: 875 WTDWGASPKIERAGMDASNRQVIISSNLTWPNGLAIDYGSQRLYWADAGMKTIEFAGLDG 934 Query: 320 SWFMSAWGEQEGYPSHVTFHNNRLLF 345 S G Q +P +T + R+ + Sbjct: 935 SKRKVLIGSQLPHPFGLTLYGQRIYW 960 >gi|149022634|gb|EDL79528.1| low density lipoprotein receptor-related protein 4, isoform CRA_b [Rattus norvegicus] Length = 1414 Score = 39.1 bits (89), Expect = 1.8, Method: Composition-based stats. Identities = 22/266 (8%), Positives = 57/266 (21%), Gaps = 10/266 (3%) Query: 87 KKLQIVVVRSS--TKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLY 144 K + P+ T P F+ S A + + + + Sbjct: 698 GKNRCGDNNGGCTHLCLPSGQNYTCACPTGFRKINSHACAQSLDKFLLFARRMDIRRISF 757 Query: 145 IQDGD-----KISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIF 199 + ++ + + V ++ T + + Sbjct: 758 DTEDLSDDVIPLADVRSAVALDWDSRDDHVYWTDVSTDTISRAKWDGTGQKVV---VDTS 814 Query: 200 KPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKD 259 G +I + W I + R + Sbjct: 815 LESPAGLAIDWVTNKLYWTDAGTDRIEVANTDGSMRTVLIWENLDRPRDIVVEPMGGYMY 874 Query: 260 NNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVV 319 + + + ++ W + + + + A + Sbjct: 875 WTDWGASPKIERAGMDASNRQVIISSNLTWPNGLAIDYGSQRLYWADAGMKTIEFAGLDG 934 Query: 320 SWFMSAWGEQEGYPSHVTFHNNRLLF 345 S G Q +P +T + R+ + Sbjct: 935 SKRKVLIGSQLPHPFGLTLYGQRIYW 960 >gi|317053337|ref|YP_004119104.1| outer membrane autotransporter barrel domain-containing protein [Pantoea sp. At-9b] gi|316953076|gb|ADU72548.1| outer membrane autotransporter barrel domain protein [Pantoea sp. At-9b] Length = 1409 Score = 39.1 bits (89), Expect = 1.8, Method: Composition-based stats. Identities = 38/320 (11%), Positives = 90/320 (28%), Gaps = 13/320 (4%) Query: 141 HLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFK 200 ++ + D+ + + IS +++ + Sbjct: 60 TVVSGAGVSQTLNNGDDAENVTVTSNARQYISAGAEATLTTVTNSGNQVIYSGGLAYSTT 119 Query: 201 PLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDN 260 D G + W N Y+ A VY ++ + + A + N Sbjct: 120 LSDSGSYQYVNSGAEAWFTTVNNEATQYVSAGGYVYWTILSSGGTLELTPNASAYDITVN 179 Query: 261 NITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVS 320 + + S+ ++ GA+ G +++ + Sbjct: 180 SGGRAHIAGGSAGWITLNSGGALTVTAGGVATAISQLAGGALTADTSTTLDGNNS----- 234 Query: 321 WFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCY--DPT 378 + A+ G +++ NN + S G+ + + S G S G Sbjct: 235 --LGAFSVSGGQANNLLLENNGIFSVLSGGNATNTTVGSAGLAVVMSGGTADGTTVNSGG 292 Query: 379 KALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLW----LLSISLSKGLSIDFRRVSGSGV 434 + + + +AS ++ EG G + + + + L+ G +I+ Sbjct: 293 RQIIYSGGSATASILNGGLETVEGTATGTTINQYGEQDVNTGGLAIGTTINSTGTQYVYG 352 Query: 435 YACPPVSVGDCLVFVCGVGR 454 A + + +V G Sbjct: 353 TATSAIVNSGGVQYVQSDGS 372 >gi|322708086|gb|EFY99663.1| prefoldin subunit 3, putative [Metarhizium anisopliae ARSEF 23] Length = 2275 Score = 39.1 bits (89), Expect = 1.9, Method: Composition-based stats. Identities = 28/258 (10%), Positives = 55/258 (21%), Gaps = 15/258 (5%) Query: 91 IVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAV--FVHKDHP----PHHLLY 144 I S T + G TP S Y++ G T + P L+ Sbjct: 1693 IFDPTSQTSSQTGVPGSGTTTP-ATGTLASQSYSITGPTTTPIVTTRQFPLNTTVASLVT 1751 Query: 145 IQDGDKISFTFDEIK-----FLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIF 199 + N + S D + + Sbjct: 1752 GGGDLTTALGASATTSGAQFITSRQNNSIPTTQSDPFNTATTQSTTDQYGSTGNTLGSSA 1811 Query: 200 KPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKD 259 I P + ++ + ++ + +++ + G S+ V Sbjct: 1812 TTTSVKEFITSSQSEPTTSASSTDFSTSSAPSNGQTTQNVPGSTT---IGNSEPTATVST 1868 Query: 260 NNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVV 319 N ++ P + +S A S T Sbjct: 1869 LTSGPPQTSNTGDASNPTGLPTVTLPGLTTTSSSTEQQGSQSTGTATSSPTTTITVTPTG 1928 Query: 320 SWFMSAWGEQEGYPSHVT 337 +P+ T Sbjct: 1929 QPDSKVPTAFSSFPTATT 1946 >gi|124009915|ref|ZP_01694581.1| conserved hypothetical protein [Microscilla marina ATCC 23134] gi|123984066|gb|EAY24439.1| conserved hypothetical protein [Microscilla marina ATCC 23134] Length = 768 Score = 39.1 bits (89), Expect = 1.9, Method: Composition-based stats. Identities = 27/261 (10%), Positives = 53/261 (20%), Gaps = 16/261 (6%) Query: 76 DGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGS------- 128 G + F + K++ V + + + S AV + Sbjct: 347 AGDMYIPEFTNGKIRKVAYPDLNLKTTSSLAVGATHDFGSATVGSNTGAVTFTAENLGSG 406 Query: 129 --TAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQAD 186 T D + + V + A+ + + Sbjct: 407 NLTLTGSAGSFATLGGTNAGDFSISQASLTSPIAESGNKTFTVTFTPVAAGARSATLTIN 466 Query: 187 T-------STARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSL 239 + T ++T +D G S A + Sbjct: 467 SDDPNENPYTIKLTGTATACNAVDAGSIGSAQTICSGGTPALLTSTTAASGGNGSFTYQW 526 Query: 240 TTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDG 299 + G F GAT + G+ V + Sbjct: 527 QSSGDGTNFSNVSGATSATYQPPALSQNTYYRRTATSGGGCGSANSANVLLTVNAPQAPT 586 Query: 300 RSISVAPQSQTLFQAGVSVVS 320 SI+ T+ + Sbjct: 587 VSITSDDADNTIAPGTKVTFT 607 >gi|55793857|gb|AAV65851.1| CD45 precursor [Ictalurus punctatus] Length = 1645 Score = 39.1 bits (89), Expect = 1.9, Method: Composition-based stats. Identities = 32/290 (11%), Positives = 58/290 (20%), Gaps = 19/290 (6%) Query: 138 PPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMK 197 P L Q + + LP + + S A TS + TS Sbjct: 226 PCTPLHQHQSHTTSNERENYTTGLPTSTPSHQHPNTTVTVTTAENSSASTSDSARTSSPM 285 Query: 198 IFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYV 257 S + + ++ Sbjct: 286 NPISPPLTTSTATDNDDIGTPARNSSHSNITTAGAVTEMNIT---GFPPSTPLHQHQSHT 342 Query: 258 KDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVS 317 N +S S + + V S R+ S + Sbjct: 343 TSNERENYMTGLPTSTPSHQHPNTTVTVTTAENSSASTSDSARTSSPMNPISPPLTTSTA 402 Query: 318 VVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDP 377 G P+ + H+N + +G++ + + S Sbjct: 403 T-------DNNDTGTPARNSSHSN-ITTAGAENYTAGL-PTITSEHQHQSYITLNTTVAD 453 Query: 378 TKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFR 427 T +T H + + S S S S Sbjct: 454 N-----NTTALLPNTTKHQHS--RATVTMTTAEIVSTSTSDSPRTSSTMN 496 >gi|269104660|ref|ZP_06157356.1| putative hemagglutinin/hemolysin-related protein [Photobacterium damselae subsp. damselae CIP 102761] gi|268161300|gb|EEZ39797.1| putative hemagglutinin/hemolysin-related protein [Photobacterium damselae subsp. damselae CIP 102761] Length = 3986 Score = 39.1 bits (89), Expect = 2.0, Method: Composition-based stats. Identities = 32/250 (12%), Positives = 67/250 (26%), Gaps = 10/250 (4%) Query: 95 RSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFT 154 T + + K +T + + S+ + D +++ Sbjct: 1107 DGKTVGTTTVENHDGKLTWTAQVDGSVLEHASADSVKAT-VTTTDAAGNRATATDDHTYS 1165 Query: 155 FDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHP 214 D V ++ S + + G+++ Sbjct: 1166 IDTDIAAKITISSIATDDVVNADEAHSKVPVTGTVGADVKAGDTVTVIVDGKTVGTTTVE 1225 Query: 215 P-----EWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLN 269 W + S+ + D TT +G+R + Y D +IT + Sbjct: 1226 NHDGKLTWTAQVDGSVLEHASTDSVKATVTTTDAAGNRATATDDHLYSIDTDITAKITIT 1285 Query: 270 LSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVS----WFMSA 325 + +A A + V G + K G +++V +T+ V W Sbjct: 1286 SIATDDVINADEAHSKVPVTGTVGADVKAGDTVTVIVDGKTVGTTTVENHDGKLTWTAQV 1345 Query: 326 WGEQEGYPSH 335 G + S Sbjct: 1346 DGSVLEHASA 1355 Score = 38.7 bits (88), Expect = 3.0, Method: Composition-based stats. Identities = 32/250 (12%), Positives = 67/250 (26%), Gaps = 10/250 (4%) Query: 95 RSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFT 154 T + + K +T + + S+ + D +++ Sbjct: 1323 DGKTVGTTTVENHDGKLTWTAQVDGSVLEHASADSVKAT-VTTTDAAGNRATATDDHTYS 1381 Query: 155 FDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHP 214 D V ++ S + + G+++ Sbjct: 1382 IDTDIAAKITITSIATDDVVNADEAHSKVPVTGTVGADVKAGDTVTVIVDGKTVGTTTVE 1441 Query: 215 P-----EWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLN 269 W + S+ + AD TT +G+R + Y D +I + Sbjct: 1442 NRDGKLTWTAQVDGSVLEHASADSVKATVTTTDAAGNRATATDDHLYSIDTDIAAKITIT 1501 Query: 270 LSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVS----WFMSA 325 + +A A + V G + K G +++V +T+ V W Sbjct: 1502 SIATDDVINADEAHSKVPVTGTVGADVKAGDTVTVIVDGKTVGTTTVENHDGKLTWTAQV 1561 Query: 326 WGEQEGYPSH 335 G + S Sbjct: 1562 DGSVLEHAST 1571 Score = 38.4 bits (87), Expect = 3.8, Method: Composition-based stats. Identities = 33/249 (13%), Positives = 66/249 (26%), Gaps = 8/249 (3%) Query: 95 RSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFT 154 T + + K +T + + S+ + I D Sbjct: 2079 DGKTVGTATVENHDGKLTWTAQVDGSVLEHASADSVKATVTTTDAAGNRAIATDDHTYSI 2138 Query: 155 FDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKG----RSIRL 210 +I A + T A + + + +D ++ Sbjct: 2139 DTDIAAKITISSIATDDVVNADEAHSKVPVTGTVGADVKAGDTVTVIVDGKTVGTTTVEN 2198 Query: 211 GCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNL 270 W + S+ + D TT +G+ + TY D +I + Sbjct: 2199 HDGKLTWTAQVDGSVLEHASTDSVKATVTTTDAAGNSATATDDHTYSIDTDIAAKITITS 2258 Query: 271 SSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVS----WFMSAW 326 + +A A + V G + K G +++V +T+ V W Sbjct: 2259 IATDDVVNADEAHSKVPVTGTVGADVKAGDTVTVIVDGKTVGTTTVENHDGKLTWTAQVD 2318 Query: 327 GEQEGYPSH 335 G + S Sbjct: 2319 GSVLEHASA 2327 >gi|73669489|ref|YP_305504.1| cell surface protein [Methanosarcina barkeri str. Fusaro] gi|72396651|gb|AAZ70924.1| cell surface protein [Methanosarcina barkeri str. Fusaro] Length = 1842 Score = 39.1 bits (89), Expect = 2.0, Method: Composition-based stats. Identities = 26/310 (8%), Positives = 65/310 (20%), Gaps = 16/310 (5%) Query: 92 VVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKI 151 +T ++ + P T E + L D Sbjct: 1312 YYYEGATGFTTPTWNSVACYPLTAAPVADFE----ADVTSGIGPMIVKFTDLSTSSPDTW 1367 Query: 152 SFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLG 211 ++ FD G + N + + T T +T + Sbjct: 1368 AWDFDND----------GTADSTEQNPSYTYTSVGTYTVNLTVANANGTDSEVKTDYITV 1417 Query: 212 CHPPEWAKNTNYSIGAYIVADD-KVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNL 270 P A+ + N Sbjct: 1418 SEPSTPAEPVAAFTADVTAGTAPLTVNFTDQSTGTPTSWIWEFGDGANSTEQKPSHTYNE 1477 Query: 271 SSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQE 330 + + ++ + P + + V ++ + Sbjct: 1478 AGNYTVNLTVKNSIGSNSTVKTNYITVSSTPVEPEPVAAFIADVTSGTVPLIVNFMDQST 1537 Query: 331 GYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSA 390 G P+ + + ++ + + Y ++ + ++ E G K V+ S+ Sbjct: 1538 GSPTS-WIWDFGDGTNATEQNPVHTYTATGTYTVNLTVSNEDGNDSDIKTGYIKVSSQSS 1596 Query: 391 STIHWMHPFG 400 + Sbjct: 1597 AKPVAAFTAS 1606 >gi|330806900|ref|YP_004351362.1| hypothetical protein PSEBR_a225 [Pseudomonas brassicacearum subsp. brassicacearum NFM421] gi|327375008|gb|AEA66358.1| Conserved hypothetical protein [Pseudomonas brassicacearum subsp. brassicacearum NFM421] Length = 2412 Score = 39.1 bits (89), Expect = 2.2, Method: Composition-based stats. Identities = 40/374 (10%), Positives = 95/374 (25%), Gaps = 3/374 (0%) Query: 101 SPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKF 160 + + T ++ + TA V D Q + Sbjct: 1593 NTGVSDAAGNTGAGTTNSTNYAIDTQVPTATIVVADTSLSIGETSQVTITFNEAVSGFDN 1652 Query: 161 LPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKN 220 + + S+ ++ + T +A I+ + + G G + Sbjct: 1653 SDLTISNGTLSNVSSSDGGVTWTATFTPSASISDTSNLITLDNTGVVNVSGNAGVGTTDS 1712 Query: 221 TNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESAS 280 NY++ V V ++V N + S+ Sbjct: 1713 NNYAVDTVRPTATIVVADTAIAAGETSLVTITFNEAVTGFTDADLSVANGTLSG-LSSSD 1771 Query: 281 GAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHN 340 G + + V+ I++A + + + PS + Sbjct: 1772 GGITWTATFTPTSGVTDTSNVITLANSGVADLAGNAGSGTTDSNNYSVDSQRPSATIVLS 1831 Query: 341 NRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFG 400 + +L G + + F + L G + +T + T Sbjct: 1832 DSVLKPGETAQVTITFSEAVTGFSNADLSVANGTLSAVSSSDGGLTWTATFTPTLGVTDT 1891 Query: 401 EGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYIS 460 ++ +T + + + G + + + V D + + + S Sbjct: 1892 SNLITLDNTGVSDAAGNTGTGTTDSAN-YAVETQVPTATIVVADSALRIGETSQVTITFS 1950 Query: 461 GSTEQGFRFNEITQ 474 GF +++T Sbjct: 1951 -EAVSGFDNSDLTI 1963 >gi|20089735|ref|NP_615810.1| cell surface protein [Methanosarcina acetivorans C2A] gi|19914669|gb|AAM04290.1| cell surface protein [Methanosarcina acetivorans C2A] Length = 2566 Score = 38.7 bits (88), Expect = 2.4, Method: Composition-based stats. Identities = 25/253 (9%), Positives = 55/253 (21%), Gaps = 12/253 (4%) Query: 101 SPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKF 160 + + + + + + + G D I Sbjct: 820 ISTGSVNSVAWDFNNDGITDSTF-QNPVYTFETNGIYTVNLTVTGPSGSDSEVKRDYINV 878 Query: 161 LPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKN 220 + L + + +++ T+ S L G ++ + Sbjct: 879 ISNVDLTVSTNPTLYPSNNNTVTATVTNIGTENSPAFSVNFLIDGINMTAEAAGLAGGSS 938 Query: 221 TNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESAS 280 T S+ V + + T ++ N + + + Sbjct: 939 TTVSVVDIKRHLGDVVNITVKADPENTVAETNETNNEYTTTATVVSSGNYYTGGRFYTGN 998 Query: 281 GAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQE-GYPSHVTFH 339 Y G+I G S W + P+ VT Sbjct: 999 DLETGAYQEGNIAVKYSQGDSGYK----------SGGGWYSTTVHWTNTDLPIPADVTVK 1048 Query: 340 NNRLLFSGSKGDE 352 RL S + + Sbjct: 1049 EARLYQSYTWNNP 1061 >gi|73669308|ref|YP_305323.1| hypothetical protein Mbar_A1802 [Methanosarcina barkeri str. Fusaro] gi|72396470|gb|AAZ70743.1| hypothetical protein Mbar_A1802 [Methanosarcina barkeri str. Fusaro] Length = 2036 Score = 38.7 bits (88), Expect = 2.4, Method: Composition-based stats. Identities = 16/237 (6%), Positives = 42/237 (17%), Gaps = 4/237 (1%) Query: 88 KLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQD 147 L + + + TP + + Sbjct: 1678 NLTVANANGTDSEVKTDYITVSSTPVEPEPVAAF----IADVTSGTVPLIVNFMDQSTSS 1733 Query: 148 GDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRS 207 + F + + L++S D S + + + + Sbjct: 1734 PTSWLWDFGDGTNATEQNPVHTYTATGTYTVNLTVSNEDGSDSEVKTGYIKVSSQSSAKP 1793 Query: 208 IRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITV 267 + P K G F + Y K T Sbjct: 1794 VAAFTASPTSGKTPLKVKFTDTSTGSPTSWFWKFGDGSKSFLQNPIHKYSKAGTYTVNLT 1853 Query: 268 LNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMS 324 + + + + + + + + + + W Sbjct: 1854 VKNAKGKNTVTKTEYIKVITKPVANFSANPTSGKAPLKVKFTDTSTGTPAKWIWDFG 1910 >gi|167515828|ref|XP_001742255.1| hypothetical protein [Monosiga brevicollis MX1] gi|163778879|gb|EDQ92493.1| predicted protein [Monosiga brevicollis MX1] Length = 399 Score = 38.7 bits (88), Expect = 2.4, Method: Composition-based stats. Identities = 34/373 (9%), Positives = 83/373 (22%), Gaps = 53/373 (14%) Query: 192 ITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYS 251 + + + F + + + + Y D + + + + S Sbjct: 48 VVTTLAQFASSVFAADLNNDGYLDILSATVRGKVEWYRNHADGTFSNPISISTIMSRTQS 107 Query: 252 KGATYVKDNNITWITVLNLSSK-TSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQT 310 A + ++ + +++ +G V + + Sbjct: 108 VYAADLDNDGSLDVLSGSINDNNVVWWRNNGNGTFMNEMLISDAVDFTSMVYAADLNNDG 167 Query: 311 LFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDG 370 + AW G +F + R++ + G Sbjct: 168 RLDVLSASRDDNKVAWYPNNG---EGSFSDQRIITLNALGASSVY--------------- 209 Query: 371 EYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVS 430 D L + + W G G L + + + + Sbjct: 210 -AADLDGDGHLDVLSASSGDNKLAWYRNDGNG---TFSGELAITTEADDAVTAHAADLDG 265 Query: 431 GSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQLADHLFNQRILQLVYQ 490 + D +V+ G F I+ Sbjct: 266 DGHLDVLGASVGDDRVVWYRNQGNGT-----------------------FTGPIVITTTA 302 Query: 491 EEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGG 550 P S+ V L+ ++E + AW+ + + G Sbjct: 303 SNPSSLYAVDLDNDGRLD-----VLGTSELDNKVAWYRNNGDGTFSSENV--ISTAAAGA 355 Query: 551 TSLWMLVALSAGE 563 +S++ + G Sbjct: 356 SSVYAADLDNDGS 368 >gi|118576014|ref|YP_875757.1| hypothetical protein CENSYa_0820 [Cenarchaeum symbiosum A] gi|118194535|gb|ABK77453.1| hypothetical protein CENSYa_0820 [Cenarchaeum symbiosum A] Length = 11910 Score = 38.7 bits (88), Expect = 2.5, Method: Composition-based stats. Identities = 39/413 (9%), Positives = 102/413 (24%), Gaps = 32/413 (7%) Query: 72 FSIPDGGYALLVFGDKKLQIVVVRSSTKWS--PALFGKTYKTPYTFKDNKSLEYAVFGST 129 F G L V GD ++ + ++ A+F +Y T L ++ G Sbjct: 7643 FEFSSDGTLLFVLGDSNKRLYRYDLAAPYAAHTAVFNASYSLSNTVGRVSGLAFSEIGLF 7702 Query: 130 AVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTST 189 + I + + + ++++ T Sbjct: 7703 YYLSEQGGMTVR-------RFIVASELFVPSPAIGGGFYNLSGQGIRPTEVNVENNGTVM 7755 Query: 190 ARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTN--YSIGAYIVADDKVYRSLTTGRSGDR 247 + D G + P + + + + R Sbjct: 7756 FVLDRDSAFVHGYSLGAQDDVRSASPSSMLDVSAYATAATGMAFSGDGLRIFVLDGGNST 7815 Query: 248 FGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQ 307 + ++ L+++ A + + + + +++ Sbjct: 7816 VHRFDMLYPYDLSGAAYVDSLDIAIAGGNTHDVAFSADGLLMFAVGAIDDTVYTFALSTP 7875 Query: 308 SQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFS 367 + A G G P+ + + + + +S G Y Sbjct: 7876 YDITPSLYAPGID----ADGGAPGEPAVIAVSSGGHVAAA---------ISGTGDIYWRE 7922 Query: 368 LDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTS--LWLLSISLSKGLSID 425 L + A + + S + + + + + + L+ ++ Sbjct: 7923 LAVPHNLDTAGPASSVPLGIGSPAGLAFSTNGARMFAADTNGTIFQYTLAEDYDLSTAVP 7982 Query: 426 FRR-VSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQLAD 477 +G G ++ L+FV ++ S F +I+ Sbjct: 7983 DTTWQTGVGDVCGISLASEGSLIFVASGDDSVR--RYSLASSF---DISAAGP 8030 >gi|9630489|ref|NP_046920.1| gp25 [Enterobacteria phage N15] gi|3192708|gb|AAC19061.1| gp25 [Enterobacteria phage N15] Length = 470 Score = 38.7 bits (88), Expect = 2.6, Method: Composition-based stats. Identities = 32/270 (11%), Positives = 67/270 (24%), Gaps = 20/270 (7%) Query: 189 TARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRF 248 ++ I L G + P ++ + + + + G + Sbjct: 104 QQVFSAAGNITVKLPDGTTFTGPSWPSVISQTSTLNGKTGGLVQGSLL-VTPGDSIGVKS 162 Query: 249 GYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGD----------IKDVSKD 298 G T V N+ + V + + SG V GD I D Sbjct: 163 GTGGDKTIVLVNSPSDGPVGTYVNSIAGNYYSGNWRMGAVRGDGVDVSRVQLNIYDGVSS 222 Query: 299 GRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLF---SGSKGDELSV 355 S P + + + AWG G+ +++F+ L G Sbjct: 223 SASFMFYPNELFKASSCGAPGDFRGDAWGVLNGWAKNISFYRENLSSPNNGGFVPFGRWN 282 Query: 356 YLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLS 415 S G + L + + + + + + Sbjct: 283 SYCSGGYYSTAGLGSLATGPQSFADIVMTTLCDAGN------AGQRTFYFQTTSGDIVTT 336 Query: 416 ISLSKGLSIDFRRVSGSGVYACPPVSVGDC 445 + + + F + + + V D Sbjct: 337 GAGAAPGNYIFSKQANCDITLKHNVKYDDG 366 >gi|293341112|ref|XP_001076773.2| PREDICTED: mCG6879-like [Rattus norvegicus] Length = 1704 Score = 38.7 bits (88), Expect = 2.6, Method: Composition-based stats. Identities = 22/281 (7%), Positives = 58/281 (20%), Gaps = 8/281 (2%) Query: 100 WSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIK 159 + T + + ++ D ++ T Sbjct: 803 TASQTVLTEESTTWRSSSISTETAVAPETSFSTALTDVSTTSPARTASTNETHGTVTSQT 862 Query: 160 FLPPPWLGDGMISGVKSNAKLS-ISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWA 218 P S A S S T+ ++I P Sbjct: 863 GFTPGSATFPTSSWSTEPAVTSETSYTSADNEASTASPSTVISTQATQTIGTSQTVPTQE 922 Query: 219 KNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRES 278 T + + TT + + + + +T+ Sbjct: 923 STTLPTESVSTETAGSPPMTHTTSLTETSTASPGAPISTQGTQTSEKPQTIFTQETTTYP 982 Query: 279 ASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTF 338 + V D + + + + Q + + + E +P Sbjct: 983 HTTISTETAVPPDTSPSTAVTGTFTTSTTVPVSTQETQATDTSQTALTQESTTFPPSTLS 1042 Query: 339 HNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTK 379 + + + ++ ++ S +P + Sbjct: 1043 -------TDTSVPPDILLSTALSDYFTTSPTITVSTQEPRE 1076 >gi|290982352|ref|XP_002673894.1| predicted protein [Naegleria gruberi] gi|284087481|gb|EFC41150.1| predicted protein [Naegleria gruberi] Length = 2807 Score = 38.7 bits (88), Expect = 2.6, Method: Composition-based stats. Identities = 39/407 (9%), Positives = 96/407 (23%), Gaps = 29/407 (7%) Query: 75 PDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAV------FGS 128 G + F + +++ + + Y N L Y G Sbjct: 549 SSGEIYIADFNNHRIRKINISGYISTIAGTGSVGYSGDGGLATNAQLYYPQTVAVSSSGE 608 Query: 129 TAVFVHKDHPPHHLLYIQDGDKISFTFD-------EIKFLPPPWLGDGMISG------VK 175 + +H + I+ T + + + + Sbjct: 609 IYIADAYNHRIRKINTSGYISTIAGTGSVGYSGDGGLATSAQLYYPFSVAISSVGEIYIA 668 Query: 176 SNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKV 235 I + +TS T + S+G + D Sbjct: 669 DTYNHRIRKINTSGYISTISGTGSGGYSGDGGLATSAQLNYPFSVAVSSVGEIYIVDTNN 728 Query: 236 YRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDV 295 YR SG + T N I + + + V Sbjct: 729 YRIRKINTSGY--ISTIAGTGTGGYNGDSILATSAQLNYPYGLTISSTSEIIVADYYNHR 786 Query: 296 SKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQ---EGYPSHVTFHNNRLLFSGSKGDE 352 + + F G + F+SA+ + G +N+R+ + G Sbjct: 787 IRKINTSGYISTIAGGFGDGDMATTSFISAYSFEFTLNGEIIIADSNNHRIRKITTLGYI 846 Query: 353 LSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLW 412 ++ + + + + + + V + Sbjct: 847 STISGTGTAGYNGDEILATNSQLNNPNGIALSSNSE---IYIADTNNHRIRKVNASGYIS 903 Query: 413 LLSISLSKGLSIDFRRVSGSGVYACPPVSV--GDCLVFVCGVGRRIK 457 ++ + + G + D + + + +++ ++ RI+ Sbjct: 904 TIAGTGTGGYNGDGVLATSAQLNYPNGIAIQENGEILIADNNNHRIR 950 >gi|146301913|ref|YP_001196504.1| glycoside hydrolase family protein [Flavobacterium johnsoniae UW101] Length = 1332 Score = 38.7 bits (88), Expect = 2.7, Method: Composition-based stats. Identities = 40/396 (10%), Positives = 88/396 (22%), Gaps = 21/396 (5%) Query: 64 PRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEY 123 S V + ++F + + I K T S+ + Sbjct: 600 NSSANVAKYVRNVTEQYDVLFFNTQTSI-EDAGLFKNQTNKILIDVYTTAPVGTVVSMNF 658 Query: 124 AVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSIS 183 ++ ++P ++ + F G + + L + Sbjct: 659 ENSAASL---PANYPTGRNSNYVAITTKQNQWETLTFYYNSSPDAGTSNLAVNQMVLLFN 715 Query: 184 QADTSTARI------TSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYR 237 + + K+ G + VA+ Sbjct: 716 SGSYTNDTYYFDNIRIASTKLPDTFTPGVVYEDYQNTHNITFRDAIGTYTANVANPSAGG 775 Query: 238 SLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSK 297 T+ G S N T + + + T + + + Sbjct: 776 INTSSNVGRYVRKSTELYDNFSFNTTLNNIGDFKAGTKKFAMDVYTSAPVGSIISWQAES 835 Query: 298 DGRSISVAPQSQTLF--QAGVSVVSWFMSAWGEQ-EGYPSHVTFHNNRLLFSGSK--GDE 352 S P + +W + S NR +F Sbjct: 836 SASIPSNYPVGRHSIYQGVVKQTNTWHTITFTYVSTPDASTADNDVNRFVFLFEPGTNSG 895 Query: 353 LSVYLSSFGAFYDFSLDGEYG-----CYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGC 407 + Y + A S + G A+T A ++ + G + Sbjct: 896 NTYYFDNLRALNLVSTETPAGLPSPWISTDLGAVTPAGEATHSNGTFTIKGSGTDIWETS 955 Query: 408 DTSLWLLSI-SLSKGLSIDFRRVSGSGVYACPPVSV 442 D ++ + + ++ + YA V Sbjct: 956 DQFQYVNQPITGDAEIIAKVNSLTNTNTYAKAGVMF 991 >gi|328872857|gb|EGG21224.1| hypothetical protein DFA_01099 [Dictyostelium fasciculatum] Length = 1339 Score = 38.7 bits (88), Expect = 2.8, Method: Composition-based stats. Identities = 29/328 (8%), Positives = 61/328 (18%), Gaps = 40/328 (12%) Query: 31 AQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRVFSFSIPDGGYALLVFGDKKLQ 90 A + N L + SFS +L ++ Sbjct: 768 ATSLDSVTNFWTLPT-----------MGAVNIVNYGTTWVSFSYSSNNGRVLGANTFAIR 816 Query: 91 IV--------------VVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAV---FGSTAVFV 133 + T + + V Sbjct: 817 VNGVLSTNTTCSSSTSCYVGGLTAGSTPSISILSTNNGETSITPGTASQKLYNSVNTLTV 876 Query: 134 HKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARIT 193 I + + V+ + LS T A +T Sbjct: 877 TPSLQTSSSFSISYSSLEGIPGQTTYLVLLDDVSYPSCPTVQGDCSLSPLSPKTYNATVT 936 Query: 194 SDMKIFK---PLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGY 250 + + + +P E + + + + G + Y Sbjct: 937 ATNDGLVLVKTIMVLVTTHPSMNPIEVGEYGTTWV---------EFDYSSIGGTAGGNSY 987 Query: 251 SKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQT 310 + ++ +S ++ Y + P S Sbjct: 988 TINVAGSDITTVSCKQGPYCKIVGLTAGSSVVISIYVTNNGEDSSTVSTTVTLYKPTSPP 1047 Query: 311 LFQAGVSVVSWFMSAWGEQEGYPSHVTF 338 + +W E +G P F Sbjct: 1048 TITLSRISATTLNVSWVENDGVPGQSLF 1075 >gi|332669695|ref|YP_004452703.1| hypothetical protein Celf_1181 [Cellulomonas fimi ATCC 484] gi|332338733|gb|AEE45316.1| protein of unknown function UPF0182 [Cellulomonas fimi ATCC 484] Length = 1019 Score = 38.7 bits (88), Expect = 3.0, Method: Composition-based stats. Identities = 43/365 (11%), Positives = 82/365 (22%), Gaps = 24/365 (6%) Query: 191 RITSDMKIFKPLDKGRSIRLGCHPPEWAKN--TNYSIGAYIVADDKVYRSLTTGRSGDRF 248 I + + + D S E + + + D +V R Sbjct: 354 NIDATLAAYGLEDVQTSEYNAKVTTEAGALRADADTTASVRLLDPQVVSPSFKQLQQIRG 413 Query: 249 GYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQS 308 Y + D + G W D + V Sbjct: 414 FYHFPDSLSVDRYEVEGESRDTVIAVRELDLDGLDDQQRNW--TNDTTVYTHGFGVVAAY 471 Query: 309 QTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSL 368 + W + + R+ F + S+ + G ++F Sbjct: 472 GNTTAGRGAPDFWEGGIPSR-----GSMGEYEPRIYF-SPQAPTYSIVGAPSGDGWEFDY 525 Query: 369 DGEYGCYDPTKALTTAVTDFSA------STIHWMHPFGEGVLVGCDTSLWLLSISLSKGL 422 + T + + + FG+ LV + + I + Sbjct: 526 PSDDAAGQELTRFPTQDVSAGPSIGNPWNKLLYALKFGDEQLVFSNRVTDVSQILYDRNP 585 Query: 423 SIDFRRVSGSGVYACP--PVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQLADHLF 480 +V+ P V + ++ S + T+ A L Sbjct: 586 RDRVAKVAPYLTLDGRVYPAVVDGRVKWIVDGYTTSDQYPYSAGRSLESA--TRDA--LT 641 Query: 481 NQRILQLVYQEEP-HSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLS 539 Q + + I V D + + E AW + LS Sbjct: 642 ETTETIQALQPKTVNYIRNSVKATVDAYDGSVDLYAWDPEDPVLAAWSE-VFPTSLQPLS 700 Query: 540 AASFP 544 S P Sbjct: 701 EISGP 705 >gi|290989086|ref|XP_002677176.1| predicted protein [Naegleria gruberi] gi|284090782|gb|EFC44432.1| predicted protein [Naegleria gruberi] Length = 2103 Score = 38.4 bits (87), Expect = 3.1, Method: Composition-based stats. Identities = 33/263 (12%), Positives = 65/263 (24%), Gaps = 8/263 (3%) Query: 75 PDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVH 134 G + +G+ +++ V Y N A S V Sbjct: 564 SSGELYIADYGNHRIRKVSNNGIITTIAGNGNTIY--------NGDGIDAANASLYSPVD 615 Query: 135 KDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITS 194 ++ +YI D G G N S + ++ + + Sbjct: 616 VSIGANNEIYIADAGNYRIRKIFTNGTIVTIAGTGTNGFSGDNGLGSNATIGYPSSVLFN 675 Query: 195 DMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGA 254 ++ IR + + D + S G Sbjct: 676 SGNVYFTDIVYCVIRKIYSNGTITTISGKAGTCTYGGDGGKASNAQLSYPAGIAISSTGD 735 Query: 255 TYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQA 314 Y+ DN I V++ + A + Y G + ++ + + S Sbjct: 736 IYISDNYNHRIRVISSVTGIISNIAGTGRSEYNGDGLHESITNFAYPVGLTFDSSENLIV 795 Query: 315 GVSVVSWFMSAWGEQEGYPSHVT 337 + SW + G S + Sbjct: 796 CETTSSWKIRKILATTGMVSTIA 818 >gi|222431108|gb|ABQ07185.2| Candidate beta-1,3-glucanase; Glycoside hydrolase family 16 [Flavobacterium johnsoniae UW101] Length = 1316 Score = 38.4 bits (87), Expect = 3.2, Method: Composition-based stats. Identities = 40/396 (10%), Positives = 88/396 (22%), Gaps = 21/396 (5%) Query: 64 PRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEY 123 S V + ++F + + I K T S+ + Sbjct: 584 NSSANVAKYVRNVTEQYDVLFFNTQTSI-EDAGLFKNQTNKILIDVYTTAPVGTVVSMNF 642 Query: 124 AVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSIS 183 ++ ++P ++ + F G + + L + Sbjct: 643 ENSAASL---PANYPTGRNSNYVAITTKQNQWETLTFYYNSSPDAGTSNLAVNQMVLLFN 699 Query: 184 QADTSTARI------TSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYR 237 + + K+ G + VA+ Sbjct: 700 SGSYTNDTYYFDNIRIASTKLPDTFTPGVVYEDYQNTHNITFRDAIGTYTANVANPSAGG 759 Query: 238 SLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSK 297 T+ G S N T + + + T + + + Sbjct: 760 INTSSNVGRYVRKSTELYDNFSFNTTLNNIGDFKAGTKKFAMDVYTSAPVGSIISWQAES 819 Query: 298 DGRSISVAPQSQTLF--QAGVSVVSWFMSAWGEQ-EGYPSHVTFHNNRLLFSGSK--GDE 352 S P + +W + S NR +F Sbjct: 820 SASIPSNYPVGRHSIYQGVVKQTNTWHTITFTYVSTPDASTADNDVNRFVFLFEPGTNSG 879 Query: 353 LSVYLSSFGAFYDFSLDGEYG-----CYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGC 407 + Y + A S + G A+T A ++ + G + Sbjct: 880 NTYYFDNLRALNLVSTETPAGLPSPWISTDLGAVTPAGEATHSNGTFTIKGSGTDIWETS 939 Query: 408 DTSLWLLSI-SLSKGLSIDFRRVSGSGVYACPPVSV 442 D ++ + + ++ + YA V Sbjct: 940 DQFQYVNQPITGDAEIIAKVNSLTNTNTYAKAGVMF 975 >gi|73669306|ref|YP_305321.1| hypothetical protein Mbar_A1800 [Methanosarcina barkeri str. Fusaro] gi|72396468|gb|AAZ70741.1| hypothetical protein Mbar_A1800 [Methanosarcina barkeri str. Fusaro] Length = 2272 Score = 38.4 bits (87), Expect = 3.2, Method: Composition-based stats. Identities = 32/351 (9%), Positives = 81/351 (23%), Gaps = 32/351 (9%) Query: 143 LYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPL 202 G S+ +D + +++ + S+ K Sbjct: 1528 TDTSSGSPASWAWDFENDGTVDSTEQNPSYTYNAAGNYTVNLTVINANGTDSEAKTDYIT 1587 Query: 203 DKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNI 262 + A T ++ + D+ + T+ G + V Sbjct: 1588 VSSTPVEPEPIAAFTADVTRGTVPLTVNFTDQSTGTPTSWLWDFGDGTNATEQNVSH--- 1644 Query: 263 TWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWF 322 T+I+ N + + +A G + + + + V + Sbjct: 1645 TYISAGNYTVNLTVANADGNDSE-VKTDYVVVSEPLPGAPVANFTANVTTGTAPLTVEFT 1703 Query: 323 MSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFY-DFSLDGEYGCYDPTKAL 381 + G G+ N+ + ++ + S+ G + + ++ G K Sbjct: 1704 DISTGSPTGWQWD---FNDDGIIDSTEQNP-VYTYSTVGNYTVNLTVVNADGNDSEVKTE 1759 Query: 382 TTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVYACPPVS 441 V++ + + + +G+ Sbjct: 1760 YIVVSEPLPGAPVANFTA-------------TPTSGNAPLTVNFTDQSTGNISSYAWDFD 1806 Query: 442 VGDCLVFVCGV--------GRRIKYISGSTEQGFRFNEIT--QLADHLFNQ 482 + G ++ S E G T + L Sbjct: 1807 NDGTVDSTEQNPIYTYSVAGTYTVNLTVSNEDGNDSEVKTEYIIVSELLPG 1857 Score = 37.6 bits (85), Expect = 5.3, Method: Composition-based stats. Identities = 33/300 (11%), Positives = 77/300 (25%), Gaps = 16/300 (5%) Query: 88 KLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQD 147 L ++ + + + TP + + Sbjct: 1568 NLTVINANGTDSEAKTDYITVSSTPVEPEPIAAFT----ADVTRGTVPLTVNFTDQSTGT 1623 Query: 148 GDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRS 207 + F + + IS L+++ AD + + + +D + G Sbjct: 1624 PTSWLWDFGDGTNATEQNVSHTYISAGNYTVNLTVANADGNDSEVKTDYVVVSEPLPGAP 1683 Query: 208 IRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITV 267 + + N V + TG D + ++ T+ TV Sbjct: 1684 V------ANFTANVTTGTAPLTVEFTDISTGSPTGWQWDFNDDGIIDSTEQNPVYTYSTV 1737 Query: 268 LNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWG 327 N + + +A G + I + + V++ + G Sbjct: 1738 GNYTVNLTVVNADGNDSE-VKTEYIVVSEPLPGAPVANFTATPTSGNAPLTVNFTDQSTG 1796 Query: 328 EQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFY-DFSLDGEYGCYDPTKALTTAVT 386 Y +N ++ + + S G + + ++ E G K V+ Sbjct: 1797 NISSYAWD---FDNDGTVDSTEQNPI-YTYSVAGTYTVNLTVSNEDGNDSEVKTEYIIVS 1852 >gi|148263538|ref|YP_001230244.1| YD repeat-containing protein [Geobacter uraniireducens Rf4] gi|146397038|gb|ABQ25671.1| YD repeat protein [Geobacter uraniireducens Rf4] Length = 1600 Score = 38.4 bits (87), Expect = 3.3, Method: Composition-based stats. Identities = 36/379 (9%), Positives = 83/379 (21%), Gaps = 23/379 (6%) Query: 75 PDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFV- 133 G +L D + + GKTY YT + + + Sbjct: 467 NVDGSYVLTDVDGTVNNFNQNGKISATVEPSGKTYGFAYTADSVTVTDPYNKSTIFSILY 526 Query: 134 -HKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARI 192 + +P L + + + + +S + + Sbjct: 527 YNSINPMTGTLGTGWSHNYEIALQDQGNGAILFKDGQLSRLYTRSGDTYVSPPGDYSTLV 586 Query: 193 TSDMKIFKPLDKGRSIRLGCHPPEWAK--NTNYSIGAYIVADDKVYRSLTTGRSGDRFGY 250 + F +K + N + + + F Y Sbjct: 587 KNTDGTFVITEKDGLNHNFDQWGRILSRLDKNGTAMTFAYDGGNLSGVTDGAGRTVTFAY 646 Query: 251 SKGATYVKDNNITWITVLNLSSKTSRESAS----GAVAPYYVWGDIKDVSKDGRSISVAP 306 + + + + + G + Y D V Sbjct: 647 DGTNKLLSVTDPKGNAYTFGYDGGNLITVTNPDSGQWSYTYDPAGFLLTKADPGGNVVTY 706 Query: 307 QSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDF 366 + S + Y + V GS + + + G + + Sbjct: 707 VYDDTHRVISGTDPEGRSRDLD---YAASV---------PGSDTAKTTTFKEKDGGEWQY 754 Query: 367 SLDGEYGCYDP-TKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSID 425 + D G T L + S + ++G T + + ++ Sbjct: 755 TYDTSAGTLTSKTDPLGNTTSFTYDSRKKML--TKTEPVIGTTTYSYDANDYMTSLTDPL 812 Query: 426 FRRVSGSGVYACPPVSVGD 444 S + ++V Sbjct: 813 SNTTSYTYNSRGQVLTVSG 831 >gi|320162846|gb|EFW39745.1| receptor-linked protein tyrosine phosphatase [Capsaspora owczarzaki ATCC 30864] Length = 2156 Score = 38.4 bits (87), Expect = 3.3, Method: Composition-based stats. Identities = 29/365 (7%), Positives = 71/365 (19%), Gaps = 8/365 (2%) Query: 83 VFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHL 142 VF + + V + + + +T + D Sbjct: 211 VFTGRTYSVAPVIGTALAAGTITATQVPLSWTVTSSGQESNLALAQVLSRNGVDLTTLPA 270 Query: 143 LYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKP- 201 F P+ + A S S ++ + T+ Sbjct: 271 GTASYTGSFPQPFSFTDSGLSPYTPYTYSIRATTVAGNSTSSPVSALSVTTASAPPTVAF 330 Query: 202 -LDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDN 260 + + +Y++ A S + + + Sbjct: 331 LTTAPYITQNSVTDLDPCTLYSYTLTATTNDGQTFTTSAKSFTTLADKAVLSPTVTSLNY 390 Query: 261 NITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVS 320 + +S S SG + Y + + + + + S Sbjct: 391 TYNAFSFAWTNSALSPCPGSGGTSGYQLSLSVNSGAATLVNPTTTTSYSLSAGVLPSSTY 450 Query: 321 WFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDF-SLDGEYGCYDPTK 379 F + S F+ + ++ F G Sbjct: 451 AFYLRFTNTNNNVSADALLTTFTTFANTPTVTALGVTANSSNSLTFQWTGTANGGGPLFY 510 Query: 380 ALTTAVTD-----FSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGV 434 + A ++ + T S + ++ + Sbjct: 511 KVDRTSPSVLSIKNFADSLTSATDNTGLLPFTDYTYSVQARNSQTPTANLSTVATATFKT 570 Query: 435 YACPP 439 A Sbjct: 571 AASQA 575 >gi|32471663|ref|NP_864656.1| fibrinogen-binding protein [Rhodopirellula baltica SH 1] gi|32397034|emb|CAD72337.1| probable fibrinogen-binding protein homolog-possibly involved in cell-cell attachment [Rhodopirellula baltica SH 1] Length = 4630 Score = 38.4 bits (87), Expect = 3.4, Method: Composition-based stats. Identities = 33/395 (8%), Positives = 68/395 (17%), Gaps = 32/395 (8%) Query: 69 VFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGS 128 VF D A D ++ + SL G+ Sbjct: 1047 VFDLDFDDQDRAYFSTYDSDYRVYRLGQLNYPETIPSNTQIDVVENDAATVSLSGVADGN 1106 Query: 129 TAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQA--- 185 + L ++++ + + + S Sbjct: 1107 ETAASNGSFTVAQTLAAATDTTLTYSVSGTAKSGDDYSTLDGTVTIAAGTTSSTISVPVF 1166 Query: 186 ------DTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVAD-DKVYRS 238 T + +T + N ++G Sbjct: 1167 DDLIVEGTESVTVTLTGITNSSPGVSIETGANTASIDIVDNDTATVGFVGSGPFSFESAD 1226 Query: 239 LTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKD 298 T S + A + T ++ + + S Sbjct: 1227 GTFNPSLFQSTIVNDAFWQSHRFEVVGTSTTIADVGGYFRNTDPASATLFAAITALTSDS 1286 Query: 299 GRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLS 358 S + + + V +S + + D LS Sbjct: 1287 DYPDSNDLSTTDVVASTTFSVPGNLSGGDVMTPF--------------SATLDPGWYALS 1332 Query: 359 SFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPF------GEGVLVGCDTSLW 412 + G D + + + + P Sbjct: 1333 FGTNAFG-GPSAGSGTTDGVGMIVLSNDLAPSQFPFSIQPGIRFNNTNAATRFVVTGEES 1391 Query: 413 LLSISLSKGLSIDFRRVSGSGVYAC-PPVSVGDCL 446 + I S SVG Sbjct: 1392 TRASESGPTNGIVHLTQSAEATADTVVTYSVGGTA 1426 >gi|241765878|ref|ZP_04763812.1| Fibronectin type III domain protein [Acidovorax delafieldii 2AN] gi|241364202|gb|EER59392.1| Fibronectin type III domain protein [Acidovorax delafieldii 2AN] Length = 739 Score = 38.4 bits (87), Expect = 3.4, Method: Composition-based stats. Identities = 24/198 (12%), Positives = 42/198 (21%), Gaps = 4/198 (2%) Query: 127 GSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNA----KLSI 182 T ++ I F P G + + A + Sbjct: 14 AYTFTVTARNTAGSGAASTASAAVTPKANQTITFANPGAQNFGTSPTLTATASSGLTPTF 73 Query: 183 SQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTG 242 S T ITS + +I + V Sbjct: 74 SSITTGVCTITSGGALTFVTAGSCTINADQAGNGTYLAATTVGRTFTVNAVVPGAPTGAV 133 Query: 243 RSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSI 302 + S T + IT +++ +ASG +P V G Sbjct: 134 GTAGGGQVSVAFTAPVFTGGSAITGYTVTASPGGATASGVASPLIVTGLTNGTPYTFTVT 193 Query: 303 SVAPQSQTLFQAGVSVVS 320 + + V+ Sbjct: 194 ATNLAGTGAASTASATVT 211 Score = 37.6 bits (85), Expect = 5.7, Method: Composition-based stats. Identities = 42/369 (11%), Positives = 80/369 (21%), Gaps = 18/369 (4%) Query: 91 IVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFG--STAVFVHKDHPPHHLLYIQDG 148 + S+ T + G T + Sbjct: 149 VFTGGSAITGYTVTASPGGATASGVASPLIVTGLTNGTPYTFTVTATNLAGTGAASTASA 208 Query: 149 DKISFTFDEIKFLPPPWLGDGMISGVKSNAK----LSISQADTSTARITSDMKIFKPLDK 204 I F P G + + A + + + T IT + Sbjct: 209 TVTPKGTQTITFANPGAQNFGTTPTLSATASSGLIPTFTSSTTGVCTITFGGALTFVTTG 268 Query: 205 GRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITW 264 +I + V + S T + Sbjct: 269 TCTINADQAGDGTYGAATTVSRTFTVNPVVPGAPTGVVGTAGAAQASVAFTAPVFTGGSA 328 Query: 265 ITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMS 324 IT +++ +ASG +P V G + + V Sbjct: 329 ITGYTVTASPGGATASGVASPLIVTGLTNGTPYTFTVTATNLAGTGAASTASTAV----- 383 Query: 325 AWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTA 384 P +TF N G+ + +S G F+ C + T Sbjct: 384 ----TPKAPQTITFGNPGTQILGAPLTLTA--TASSGLTVTFTSSTPGVCTVTPAGVVTY 437 Query: 385 VTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVYACPPVSVGD 444 ++ + S G + + ++ S L+ S S + + Sbjct: 438 ISAGTCSVNADQAGNGSYLAATTANQSFTVNPPPSGVLTF-ATPTSASVLLGNTLANPAT 496 Query: 445 CLVFVCGVG 453 + G Sbjct: 497 STLMGGSYG 505 >gi|156048656|ref|XP_001590295.1| hypothetical protein SS1G_09060 [Sclerotinia sclerotiorum 1980] gi|154693456|gb|EDN93194.1| hypothetical protein SS1G_09060 [Sclerotinia sclerotiorum 1980 UF-70] Length = 932 Score = 38.4 bits (87), Expect = 3.5, Method: Composition-based stats. Identities = 38/289 (13%), Positives = 68/289 (23%), Gaps = 19/289 (6%) Query: 72 FSIPDGGYALLV---FGDKKLQIVVVRSSTKWSPALFGKTYKTPYT-FKDNKSLEYAVFG 127 F+ G LL F G T Y S G Sbjct: 458 FNSDIGNPYLLEYNVFSPS----FFGAIQYINLNTDDGATLVNNYGLAGGFPSYTLIFTG 513 Query: 128 STAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPP--------WLGDGMISGVKSNAK 179 H + Y + FT+D + P LG + S Sbjct: 514 DK-YTSPPQHSGGLVDYYSNFGPNYFTYDLKPQISAPGGHILSTYPLGPTSNYAILSGTS 572 Query: 180 LSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSL 239 ++ A + S +++ P W + +I + + Sbjct: 573 MATPYVAGCFALLKSQFPSASISQILNLLQVTATPVNWVWD--STILSATAQQGAGLINA 630 Query: 240 TTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDG 299 D+ T N++ + + S+ + G Sbjct: 631 HDAIFAQSVISPGQIVLGDDSTHTVFGAANITIENTSGSSKTYTLSHVGAGYTDGQLSGQ 690 Query: 300 RSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGS 348 S +A +F ++ S + P +NR +F G Sbjct: 691 DSNQIALYGTGVFPTPTVTLASGESKTVDFSITPPTGVVASNRPVFGGF 739 >gi|6681362|dbj|BAA88688.1| MEGF7 [Rattus norvegicus] Length = 1298 Score = 38.4 bits (87), Expect = 3.6, Method: Composition-based stats. Identities = 22/266 (8%), Positives = 57/266 (21%), Gaps = 10/266 (3%) Query: 87 KKLQIVVVRSS--TKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLY 144 K + P+ T P F+ S A + + + + Sbjct: 91 GKNRCGDNNGGCTHLCLPSGQNYTCACPTGFRKINSHACAQSLDKFLLFARRMDIRRISF 150 Query: 145 IQDGD-----KISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIF 199 + ++ + + V ++ T + + Sbjct: 151 DTEDLSDDVIPLADVRSAVALDWDSRDDHVYWTDVSTDTISRAKWDGTGQKVV---VDTS 207 Query: 200 KPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKD 259 G +I + W I + R + Sbjct: 208 LESPAGLAIDWVTNKLYWTDAGTDRIEVANTDGSMRTVLIWENLDRPRDIVVEPMGGYMY 267 Query: 260 NNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVV 319 + + + ++ W + + + + A + Sbjct: 268 WTDWGASPKIERAGMDASNRQVIISSNLTWPNGLAIDYGSQRLYWADAGMKTIEFAGLDG 327 Query: 320 SWFMSAWGEQEGYPSHVTFHNNRLLF 345 S G Q +P +T + R+ + Sbjct: 328 SKRKVLIGSQLPHPFGLTLYGQRIYW 353 >gi|302681737|ref|XP_003030550.1| hypothetical protein SCHCODRAFT_235989 [Schizophyllum commune H4-8] gi|300104241|gb|EFI95647.1| hypothetical protein SCHCODRAFT_235989 [Schizophyllum commune H4-8] Length = 1175 Score = 38.4 bits (87), Expect = 3.6, Method: Composition-based stats. Identities = 30/284 (10%), Positives = 60/284 (21%), Gaps = 40/284 (14%) Query: 186 DTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSG 245 DT + D G S ++ + G A R + Sbjct: 718 DTPGTSGNAAGAGTSGADTGASAMDTDTTTSDGPVSSATTGTAPAASGTTSRRTPEASTR 777 Query: 246 DRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVA 305 + + + ++T + + + S + Sbjct: 778 QLRSPPEVSYTATSSAPPYVTTPAFAVGYGATGSPYGSTSTTGYASTSTTGYASTSSA-- 835 Query: 306 PQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNR----------------------- 342 G + S A YP+ ++ R Sbjct: 836 ---------GYASTSSAGYASTSTAAYPAQPADYSQRPSTGYSTSSSTEYASRPSTGYTS 886 Query: 343 LLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEG 402 + + Y + + YD T A+ + F+ + I + Sbjct: 887 AGYPTDPTRPSTGYAAPSASTSYAPTSTPENTYDQTYAMAGVASSFTPAGIGQGYSAQYT 946 Query: 403 VLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVYACPPVSVGDCL 446 + + S S S PPV G + Sbjct: 947 SPSSSPETRYAASSSP------VAHTTSPGVHATSPPVPAGPSV 984 >gi|256424202|ref|YP_003124855.1| hypothetical protein Cpin_5223 [Chitinophaga pinensis DSM 2588] gi|256039110|gb|ACU62654.1| hypothetical protein Cpin_5223 [Chitinophaga pinensis DSM 2588] Length = 1228 Score = 38.4 bits (87), Expect = 3.7, Method: Composition-based stats. Identities = 40/259 (15%), Positives = 71/259 (27%), Gaps = 11/259 (4%) Query: 85 GDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKD-NKSLEYAVFGSTAVFVHKDHPPHHLL 143 G + V A G TP ++ Y + G D+ + Sbjct: 277 GAGGGRFSAVPGVGLTIDAANGDI--TPAGANPGTYTIRYTITG---TAPCPDYVTTTTV 331 Query: 144 YIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLD 203 + + + I + S + + +TST IT Sbjct: 332 TVNSTPAATIAYPAICSSDGVTSVQITGANGGSFSSTTGLSLNTSTGAITPGTSTPGTYT 391 Query: 204 KGRSIRLGCHPPEWAKNTNYSIGAYIVAD----DKVYRSLTTGRSGDRFGYSKGATYVKD 259 +I ++ NT +I VA V ++T G Sbjct: 392 VTYTIPPSPPCAGFSTNTQVTITRAPVATISYQPAVLCNVTGGTPNPPVTPLVTGNTGGT 451 Query: 260 NNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVV 319 IT LN++ T + +GA P G ++ + T+ + + Sbjct: 452 FTITPANGLNINPATGTITPAGA-TPGVYTISYAITGTGGCALFSTSATVTVNSTPTATI 510 Query: 320 SWFMSAWGEQEGYPSHVTF 338 + S + P VTF Sbjct: 511 RYAGSPYCGSTNTPQTVTF 529 >gi|251799499|ref|YP_003014230.1| Fibronectin type III domain protein [Paenibacillus sp. JDR-2] gi|247547125|gb|ACT04144.1| Fibronectin type III domain protein [Paenibacillus sp. JDR-2] Length = 550 Score = 38.4 bits (87), Expect = 3.7, Method: Composition-based stats. Identities = 18/186 (9%), Positives = 42/186 (22%) Query: 142 LLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKP 201 L + D T + S + S S + T A+ + + Sbjct: 320 LSWTASTDNAGVTGYNVYRNGVLAGTASGTSYSDTGLSASTSYSYTVKAKDAAGNESAAS 379 Query: 202 LDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNN 261 + N + + + Y S ++ N Sbjct: 380 STVSATTLAAGSGGTTGGTYNVNGSTGTYIEAENYTSKNGTFVSAACSACSNGLNMETPN 439 Query: 262 ITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSW 321 + + N + T + G+ + + + S + L +W Sbjct: 440 GSGDSNANYIAYTINVTNGGSFYVHLLSSGVDSSSDSFTVALDSASGSQLTTTSNGTWAW 499 Query: 322 FMSAWG 327 + Sbjct: 500 KKPSSS 505 >gi|115666275|ref|XP_785445.2| PREDICTED: hypothetical protein [Strongylocentrotus purpuratus] gi|115975741|ref|XP_001177873.1| PREDICTED: hypothetical protein [Strongylocentrotus purpuratus] Length = 3342 Score = 38.4 bits (87), Expect = 3.7, Method: Composition-based stats. Identities = 43/489 (8%), Positives = 118/489 (24%), Gaps = 29/489 (5%) Query: 91 IVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDK 150 I + + ++ +T + A + T + P Sbjct: 1796 IYSITGGDPKNAFSINQSTGAIFTVGALDREDEATYTLTITATDQGTSPRSGTTTIRVTV 1855 Query: 151 ISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRL 210 ++ F + NA + A + D+ + Sbjct: 1856 TDLNDNDPVFGSMSYYKSIP-ESTAINATILTVVATDDDEGLNGDVYYTLDNTTIGLFSI 1914 Query: 211 GCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNL 270 E + ++ T + + +++ + Sbjct: 1915 DPEHGEITTTGKFDYEKETRY---TFQVTATDSGVFGPRSERVQVIIDISDVNDNAPVFK 1971 Query: 271 SSKTS-RESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQ 329 + + + + + D +Q + V+ + Sbjct: 1972 TIPIRANVTQDASSNTFVANVEADDKDSGVNGEVNYRFTQQSSSFAIDTVT---GVITTK 2028 Query: 330 EGYPSHVTFHNNRLLF-SGSKGDELS----VYLSSFGAFYDFSLDGEYGCYDPTKALTTA 384 P + +H + F GS + V++ + G+ Y A Sbjct: 2029 SLNPGTLFYHLEVMAFDLGSPSLSSNGIVEVWVGTSGSGGLQFGQQTYLVQPSEAADNGD 2088 Query: 385 VTDFSASTIHWMHPFGEGVLVGCDTSL---WLLSISLSKGLSIDFRRVSGSGVYACPPVS 441 V ++ + + V + + + + + + P + Sbjct: 2089 VVLSLSAFLPDGSSSNDIVYSLVSGNENGAFGIQVQAGGSAILVVADTTKLDYETQPNIR 2148 Query: 442 VGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWV-- 499 + + + + + N+ A R V+ E P+S ++V Sbjct: 2149 LVAEAMRTPENSSPMYGYATVQVELTDAND---NAPQFVQDRYQSRVW-EVPNSDIYVTQ 2204 Query: 500 --VLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLV 557 + + + + S + FA I +++ A + + + +V Sbjct: 2205 VSATDADEGTNGAIYYEVTSGNTDNAFA-----IDHVTGIVTTAKSLDYEIEDSYVLTVV 2259 Query: 558 ALSAGEERS 566 A G + Sbjct: 2260 ARDGGSPQL 2268 >gi|218440548|ref|YP_002378877.1| hypothetical protein PCC7424_3623 [Cyanothece sp. PCC 7424] gi|218173276|gb|ACK72009.1| hypothetical protein PCC7424_3623 [Cyanothece sp. PCC 7424] Length = 514 Score = 38.4 bits (87), Expect = 3.8, Method: Composition-based stats. Identities = 45/389 (11%), Positives = 107/389 (27%), Gaps = 30/389 (7%) Query: 142 LLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKP 201 L I D + + D F+ +L +G + +LS + + + Sbjct: 15 TLQISDNNILWTNQDPNTFITALYLYNGSQTIEIDRDELSTTLGLSGNNVVWKTPLGRNT 74 Query: 202 LDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNN 261 ++ + G + N +Y ++ D V S + G + + Y+ T NN Sbjct: 75 YNENLYLYNGSEIIQIDSNNHY--DWVRISGDNVVWSASDGTDNEIYLYNGSQTLQLTNN 132 Query: 262 ITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSW 321 +S + Y + + +G V + +S Sbjct: 133 DINDINPLISGNNI------VWSSYDANNNYEIFFYNGS--QVIQITNNNIGDFNPEISG 184 Query: 322 FMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKAL 381 AW S V F+N + D G +S + + Sbjct: 185 NNIAWSGYVNGNSEVFFYNGSETIQLTNNDIDDYSPQISGNNIAWSTPNKEIYLYNGSQI 244 Query: 382 TTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVYACPPVS 441 + + + + V G D + + + ++S + + Sbjct: 245 -IQLANNYNNDLSLKFSGDNLVWSGNDGND----NEIYFYNGSEVIQLSNNNIDDRVSQI 299 Query: 442 VGDCLVFVCGVGRRIKYISGSTE----------QGFRFNEITQLADHLFNQRILQLVYQE 491 G+ +++V G + + ++ +L+ + + Sbjct: 300 SGNTVLWVSDDGTDKNVYFYNGSQVIQLTNNNIDNYSDSDYPKLSGNYIV-----WAASD 354 Query: 492 EPHSIVWVVLEPKDNSFPRLLGCRFSAEG 520 + +++ + S + RF Sbjct: 355 GTDNEIYLADTREFASLNQAPVYRFYNSE 383 >gi|319641561|ref|ZP_07996249.1| hypothetical protein HMPREF9011_01847 [Bacteroides sp. 3_1_40A] gi|317386835|gb|EFV67726.1| hypothetical protein HMPREF9011_01847 [Bacteroides sp. 3_1_40A] Length = 561 Score = 38.4 bits (87), Expect = 3.9, Method: Composition-based stats. Identities = 23/226 (10%), Positives = 48/226 (21%), Gaps = 4/226 (1%) Query: 84 FGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLL 143 F + K + ++++T W + KD + T L+ Sbjct: 223 FTNGKTFVYKMKNATDWQAGGEYTYTVSLAAAKDPGYTIESNGSYTVYNADGLMNVAELV 282 Query: 144 YIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMK----IF 199 D +I W G+ +T++ + Sbjct: 283 NGGKSDINITLDTDIDLTGKDWTPIGIDYDNSYKGTFDGGGHTIKGLTVTTNDQFVGLFG 342 Query: 200 KPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKD 259 G + + N G + + + V Sbjct: 343 YLNRAGTVKNVVMEGIQITSNHMLMSGNTGGVVGFSWGIIENCSVSGSVSGTNCVGGVVG 402 Query: 260 NNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVA 305 + + SS + + WG + G I Sbjct: 403 SQKAGSIIGCSSSAIVKGTRYVGGVAGEKWGTMTACYATGNVILEI 448 >gi|223934991|ref|ZP_03626910.1| NHL repeat containing protein [bacterium Ellin514] gi|223896444|gb|EEF62886.1| NHL repeat containing protein [bacterium Ellin514] Length = 1064 Score = 38.0 bits (86), Expect = 4.0, Method: Composition-based stats. Identities = 42/408 (10%), Positives = 87/408 (21%), Gaps = 30/408 (7%) Query: 75 PDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKT-PYTFKDNKSLEYAVFGSTAVFV 133 G ++ G+ ++ + G T + V Sbjct: 245 SSGNLYVVDTGNGTIRKITSSGVVTTFAGSAGNYGATNGIGANALFYAPQGITIDLFGCV 304 Query: 134 HKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARIT 193 + +H + D T + + + ++ T Sbjct: 305 YVADTGNHTIRKITSDGTVTTLAGLAGNYGSADSVNSSASFWNPQGITSDATGNLYIADT 364 Query: 194 SDMKIFKPLDKGRSIRLGCHPPEWAKNTNYS-------IGAYIVADDKVYRSLTTGRSGD 246 + I G P + + S + A VY + T ++ Sbjct: 365 GNNTIRTITPGGSVTTFAGLPSIGSADGLSSDARFRFPQAVAVDAATNVYVADTANQTIR 424 Query: 247 RFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAP 306 + S + + +V N+ + G + D Sbjct: 425 KISPSGLVCTLAGSIGHPGSVNNIGTNALFSGPQGITVDGVGNIYVADTLNHIIRRITPD 484 Query: 307 QSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHN--NRLLFSGSKGD------ELSVYLS 358 + T F V + + Y + + + + + + Sbjct: 485 GAATTFAGSAGVSGTANGTNTDAQFYAPQGLAVDGTGNVFVADTFNNLIRKITPGGAVTT 544 Query: 359 SFGAFYDFSLDGEYGCYDP---------TKALTTAVTDFSASTIHWMHPFGEGVLVGCDT 409 G F +F A V D+ TI + P G +V Sbjct: 545 LAGNFENFGSSDGTNSNARFYWPSGVAVDNAGNVFVADYMNHTIRELIPSGTNWIVNTVA 604 Query: 410 SLWLLSISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIK 457 L S+ + V L I+ Sbjct: 605 GLAGFWGSIDGTNTSA-----RFFQPRSLSVDASGALYVADSGNHAIR 647 >gi|94309559|ref|YP_582769.1| Outer membrane autotransporter barrel [Cupriavidus metallidurans CH34] gi|93353411|gb|ABF07500.1| hypothetical protein Rmet_0614 [Cupriavidus metallidurans CH34] Length = 1741 Score = 38.0 bits (86), Expect = 4.2, Method: Composition-based stats. Identities = 26/306 (8%), Positives = 65/306 (21%), Gaps = 8/306 (2%) Query: 142 LLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKP 201 L + + +G + S + + D I Sbjct: 681 LTAGTLTGSAIGNMTLNQSSNSIAALGPISTGGDFALTTTRSLGQSGALSVGGDTTINAG 740 Query: 202 LDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNN 261 + T + + T G T N Sbjct: 741 TNAISLTNASNSFAGAVSLTGGTTIISSASALTFGNVNTDTLLATSLGPMNLGTGTVRGN 800 Query: 262 ITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSW 321 ++ T+ +++ S +G+ G I + + + Sbjct: 801 LSASTIDKAITQSGALSVAGSTTISAGTGAITLTDAGNSFQGPIAATGSSVALRAAGDLR 860 Query: 322 FMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKAL 381 + G S L G+ +S + + G Sbjct: 861 VSALNNSTNGAVS--------LTAGGALTLPVSAINTGTSNLQLAANGGTLLANAALSGS 912 Query: 382 TTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVYACPPVS 441 ++ ++ + + W+ ++ + + + A S Sbjct: 913 NVTISARDGIALYGPVTASGQLALSTSAGQWIFALGDVRAATTQLSSGTLRIGNATTTGS 972 Query: 442 VGDCLV 447 +G +V Sbjct: 973 IGGNVV 978 >gi|302760495|ref|XP_002963670.1| hypothetical protein SELMODRAFT_405011 [Selaginella moellendorffii] gi|300168938|gb|EFJ35541.1| hypothetical protein SELMODRAFT_405011 [Selaginella moellendorffii] Length = 403 Score = 38.0 bits (86), Expect = 4.3, Method: Composition-based stats. Identities = 16/104 (15%), Positives = 32/104 (30%), Gaps = 1/104 (0%) Query: 473 TQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMIS 532 T ++ F ++V SI LE + L AW ++ Sbjct: 140 TIISFQAFPDSSFRVVSGVNSRSITVSGLESHSHDKLELHMYYNCCGAAVVAAWENWVVD 199 Query: 533 -DKHYVLSAASFPNDNRGGTSLWMLVALSAGEERSFTVRLNLLD 575 +L++ + + + S GE+ ++ LD Sbjct: 200 IGAFQILASLPILGLCSDESHFFFITRTSLGEKEYSLKSVSFLD 243 >gi|114571321|ref|YP_758001.1| outer membrane autotransporter [Maricaulis maris MCS10] gi|114341783|gb|ABI67063.1| outer membrane autotransporter barrel domain [Maricaulis maris MCS10] Length = 2886 Score = 38.0 bits (86), Expect = 4.8, Method: Composition-based stats. Identities = 35/257 (13%), Positives = 69/257 (26%), Gaps = 12/257 (4%) Query: 82 LVFGDKKLQIVVVRSSTKWSPALFGKTYKT-----PYTFKDNKSLEYAVFGSTAVFVHKD 136 G+ + S+T ++ + T + + + + + + Sbjct: 1274 FAVGNGAVSNFSATSATVYTATITPAADGTVTVDVAGGAAQDSAGNDSTAATQFSIENDE 1333 Query: 137 HPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDG---MISGVKSNAKLSISQADTSTARIT 193 P +L D +S F G G G + + TA IT Sbjct: 1334 TVPTVVLTTGSVDPVSGAFTITATFSEGVNGFGLGDFSVGNGGASNFAAMSVTVYTATIT 1393 Query: 194 SDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKG 253 +D G + A T +SI D+ + + S D + Sbjct: 1394 PASDGSVTVDVGANAAQDGAGNGNAAATQFSIE----NDETLPTVALSTGSADPVSGTFT 1449 Query: 254 ATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQ 313 T ++T V + S S A + I + ++ VA Sbjct: 1450 ITATFSESVTGFAVGDFSVGNGSASDFAATSATVYTATITPAADGTVTVDVAGAVAQDAA 1509 Query: 314 AGVSVVSWFMSAWGEQE 330 + + S ++ Sbjct: 1510 GNDNSAATQFSIENDET 1526 >gi|254447526|ref|ZP_05060992.1| hyalin repeat protein [gamma proteobacterium HTCC5015] gi|198262869|gb|EDY87148.1| hyalin repeat protein [gamma proteobacterium HTCC5015] Length = 474 Score = 38.0 bits (86), Expect = 5.0, Method: Composition-based stats. Identities = 26/252 (10%), Positives = 65/252 (25%), Gaps = 20/252 (7%) Query: 202 LDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNN 261 D+G + + I + ++ Y N+ Sbjct: 114 EDEGSELWVTDGTEAGTFLLKDHITGANSGSPNQFTIYK-----NQLFYRAKNADDFPND 168 Query: 262 ITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSW 321 W T + T + SG + + + + + + + Sbjct: 169 TLWKTDGTKAGTTIAVNISGLDLYPDITVFQQQLVFSAKDDTSGSEVWISDGTTIGSQLL 228 Query: 322 FMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKAL 381 + G G+P++ T N L F S + + + + Sbjct: 229 KDTNAGSDHGFPANFTEFNGALFFGSSNNNSGRALW-----------KSDGTTAGTSIVV 277 Query: 382 TTAVTDFSASTIHWMHPFGEGVLV----GCDTSLWLLSISLSKGLSIDFRRVSGSGVYAC 437 + A+ + F + + G + ++ ++ +G+G Sbjct: 278 DLGNANTLANNPRDLTVFNQSLYFGAEDGTEGHELWITNGNPVATAVVDDIQTGTGSSEA 337 Query: 438 PPVSVGDCLVFV 449 +SV + +F Sbjct: 338 GSLSVFNGQLFF 349 >gi|113477401|ref|YP_723462.1| hypothetical protein Tery_3964 [Trichodesmium erythraeum IMS101] gi|110168449|gb|ABG52989.1| hypothetical protein Tery_3964 [Trichodesmium erythraeum IMS101] Length = 940 Score = 37.6 bits (85), Expect = 5.3, Method: Composition-based stats. Identities = 20/220 (9%), Positives = 51/220 (23%), Gaps = 6/220 (2%) Query: 84 FGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLL 143 FGD + + W+ + G + +PY+ + S S + Sbjct: 351 FGDGYVAKFDSNGNLVWAKQIGGSNWDSPYSITTDSSGN---VYSITTDSSGNVLVGGSF 407 Query: 144 YIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLS---ISQADTSTARITSDMKIFK 200 + D S + + ++ S + Sbjct: 408 RSNIDIDGDWNNDLTSNGDLDGYVAKFDSNGNLVWAKQLGGSNWDNVNSITTDSSGNVLV 467 Query: 201 PLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDN 260 +I + + ++ G D G Y+ Sbjct: 468 GGYFDGNIDIDDDGNNDFTSNGFTDGYVAKFDSNGNLVWAKQIGGSSDDYANSIATDSSG 527 Query: 261 NITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGR 300 N+ + + + + + + + G + +G Sbjct: 528 NVFVGGIFSANIDIDGDRNNDLTSNGFTDGYVAKFDSNGN 567 >gi|298485827|ref|ZP_07003905.1| Flagellar hook-length control protein fliK [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] gi|298159651|gb|EFI00694.1| Flagellar hook-length control protein fliK [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] Length = 981 Score = 37.6 bits (85), Expect = 5.3, Method: Composition-based stats. Identities = 39/336 (11%), Positives = 81/336 (24%), Gaps = 3/336 (0%) Query: 94 VRSSTKWSPALFGKTYKTPYTFKDNKSLEYA--VFGSTAVFVHKDHPPHHLLYIQDGDKI 151 ++ S Y+ TA V D Sbjct: 258 DSTNLITLNNTGVADLAGNIGSGVTNSNNYSIDTIQPTATIVVADSALSVGETSLVTITF 317 Query: 152 SFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLG 211 S + + S+ ++ + T T+ I+S + G + G Sbjct: 318 SEAVSGFTNADLNIANGTLSAVSSSDGGITWTATLTPTSGISSASNSVTLNNGGVTDLAG 377 Query: 212 CHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLS 271 + NY+I V + V + + + V N + Sbjct: 378 NVGSGLTLSNNYTIDQTRPTASIVIADNALSAGETSLVTITFSEAVSGFDNSDLNVPNGT 437 Query: 272 SKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEG 331 T + G + V+ IS+ T +++ Sbjct: 438 LSTVSSNDGGITWTATFTPNAN-VNASTGQISLNSAGVTDLAGNAGSGIISSASFTVDTT 496 Query: 332 YPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSAS 391 PS + L +G + + F + L G + +T + Sbjct: 497 RPSATILVADNALSAGETSLVTFTFSQAVSGFSNADLSVANGTLSAVSSSDGGITWTATF 556 Query: 392 TIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFR 427 T + ++ +T + S + G + Sbjct: 557 TPNANVTDASNLITLDNTGVTNASGNTGSGTTASNN 592 >gi|327193134|gb|EGE60044.1| 2',3'-cyclic-nucleotide 2'-phosphodiesterase protein [Rhizobium etli CNPAF512] Length = 662 Score = 37.6 bits (85), Expect = 5.4, Method: Composition-based stats. Identities = 25/222 (11%), Positives = 57/222 (25%), Gaps = 15/222 (6%) Query: 51 MPLMQEYRDCRLDP------RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPAL 104 Y D ++ + +++ + S+ ++ Sbjct: 442 RGGADYYTDVPAGDIAIKNVADLYLYP---NTVQA--VAITGAQVKNWLEMSAGMFNHID 496 Query: 105 FGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPP 164 G P D S + V + PP + + + + + F P Sbjct: 497 VGAKDA-PLLNADFPSYNFDVIDGVTYQIDLSQPPKYDSSGKAINPDTNRIQNLAFDGKP 555 Query: 165 WLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYS 224 V +N + S I +D IF+ D R + + + N + Sbjct: 556 IDPAQKFVVVTNNYRAG---GGGSFPEIAADKVIFQAPDTNRDVIVRYVHEQGTINPSAD 612 Query: 225 IGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWIT 266 + +G +F + + ++D Sbjct: 613 ANWTFRPLPGTTVTFESGPKAKQFLAAVKSVKIEDAGDGADG 654 >gi|330890515|gb|EGH23176.1| BNR repeat-containing glycosyl hydrolase [Pseudomonas syringae pv. mori str. 301020] Length = 1237 Score = 37.6 bits (85), Expect = 5.6, Method: Composition-based stats. Identities = 47/442 (10%), Positives = 102/442 (23%), Gaps = 27/442 (6%) Query: 57 YRDCRLDPRSNRVFSFSIPD-----GGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKT 111 D L + +F+ + + V + W+ L T Sbjct: 693 VADTALAAGETSLVTFTFSEVVTGFDNTDISVANGTLTAVSSSDGGKTWTATLTPTANLT 752 Query: 112 PYTFKDNKSLEYAV----FGSTAVFVHKDHPPHHLLYIQDGDKI--------------SF 153 T + + + + ++ +F Sbjct: 753 STTNQISLNRAGVQDLSGNAGSGTATSNNYAIDTSRPTATIVLADNSLSIGETSQVTITF 812 Query: 154 TFDEIKFLPPPWLGDGMISGVKSNAKLSISQAD-TSTARITSDMKIFKPLDKGRSIRLGC 212 + F + + + A T T IT + + G + G Sbjct: 813 SEAVSGFTNADLTVVNGTLSTVTTSNNIVWTATFTPTNNITDSTNVITLDNTGVTDAAGN 872 Query: 213 HPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSS 272 + NY+I + + + V +TV N + Sbjct: 873 TGSGTTTSNNYAIDTQRPTASILVADASLTAGETSLVTITFSEAVSGFTNADLTVPNGTL 932 Query: 273 KTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGY 332 T S+ G + + +V+ IS+ T + + Sbjct: 933 STV-TSSDGGITWTATYTPNNNVNDTTNLISLNNAGVTDLAGNAGSGTSNSGNFTIDTVR 991 Query: 333 PSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSAST 392 PS + L +G + + F + L G + +T + T Sbjct: 992 PSATVVVADSTLSAGETSLVTITFSEAVTGFNNADLTIANGTLSAVSSSDGGITWTATLT 1051 Query: 393 IHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGV 452 ++ + + S + G + V + L Sbjct: 1052 PTANVTDTTNLITLNASGVANASGNAGTGTISSNNYAIDTQRPTASIVVADNALGIGETS 1111 Query: 453 GRRIKYISGSTEQGFRFNEITQ 474 I + GF +++ Sbjct: 1112 LVTITFSE--AVSGFTNADLSI 1131 >gi|219853189|ref|YP_002467621.1| PKD domain containing protein [Methanosphaerula palustris E1-9c] gi|219547448|gb|ACL17898.1| PKD domain containing protein [Methanosphaerula palustris E1-9c] Length = 930 Score = 37.6 bits (85), Expect = 5.6, Method: Composition-based stats. Identities = 26/260 (10%), Positives = 64/260 (24%), Gaps = 8/260 (3%) Query: 171 ISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIV 230 +++ A T + ++ K G I N+ G Sbjct: 371 DGQFIYPYSIAVDSAGNVYVVDTGNNRVQKFTSTGTFITQWGGEGFGDGQFNFPGGITAD 430 Query: 231 ADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWG 290 + VY T +F + + + + N + + A Sbjct: 431 SAGNVYVVDTENDRVQKFTSTGEFITKWGGDGSGVGEFNYPYGIAVDRAGNVYVVDTGNN 490 Query: 291 DIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKG 350 ++ + G I+ S + ++ + G V NNR S G Sbjct: 491 RVQIFTSTGTFIAQWGGS----GSRDGQFNYPGGIAVDSAGNVYVVDESNNRFQKFTSTG 546 Query: 351 DELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTS 410 + ++ + S +F+ + ++ W+ + Sbjct: 547 EFITKWGSEGLGDGEFTYPRDVAVDSGGNVYIVDESNSRIQKFSWVAQIMPLIPSFTA-- 604 Query: 411 LWLLSISLSKGLSIDFRRVS 430 + + + + Sbjct: 605 --VPTAGSAPLTVQFIDTTT 622 >gi|269955235|ref|YP_003325024.1| Fibronectin type III domain-containing protein [Xylanimonas cellulosilytica DSM 15894] gi|269303916|gb|ACZ29466.1| Fibronectin type III domain protein [Xylanimonas cellulosilytica DSM 15894] Length = 2039 Score = 37.6 bits (85), Expect = 5.7, Method: Composition-based stats. Identities = 19/220 (8%), Positives = 46/220 (20%), Gaps = 18/220 (8%) Query: 83 VFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTF-KDNKSLEYAVFGSTAVFVHKDHPPHH 141 + + + S +T S +AV Sbjct: 1697 AISNYYVDVYR-DGSLVQENVDLKTATSHDFTGLTTTASYTFAVSA-------------- 1741 Query: 142 LLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVK-SNAKLSISQADTSTARITSDMKIFK 200 S + P + + ++ S AD + + + Sbjct: 1742 -KNKAGEGATSSRSNAAIPYGAPKAPTNVKATDNKGVPTVTWSAADGNGSPVIDYTVTAS 1800 Query: 201 PLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDN 260 + + T Y+ + +G + Sbjct: 1801 GGKTMTTTGTSVNFTGLTAGTTYTFTVTARNLGGTSSASAASGGVKAYGLPSAPSVTWTK 1860 Query: 261 NITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGR 300 +++ +S +G V+ + K + GR Sbjct: 1861 TTATDGYFTVNAPSSWNGDTGTVSWSLSGSETKSGTGTGR 1900 >gi|258515209|ref|YP_003191431.1| hypothetical protein Dtox_1972 [Desulfotomaculum acetoxidans DSM 771] gi|257778914|gb|ACV62808.1| hypothetical protein Dtox_1972 [Desulfotomaculum acetoxidans DSM 771] Length = 1502 Score = 37.6 bits (85), Expect = 5.7, Method: Composition-based stats. Identities = 22/260 (8%), Positives = 47/260 (18%), Gaps = 6/260 (2%) Query: 75 PDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVH 134 + G L G + + T T + + T V Sbjct: 1018 NEDGSYYLPVGQGYIYYYGRNGYLTATGTFDVTESTTGITLPALTEHQQSDGKVTVSAVS 1077 Query: 135 KDHPPHHLLYIQDGDKISFTFDEI----KFLPPPWLGDGMISGVKSNAKLSISQADTSTA 190 + + + + +I Sbjct: 1078 LNSVLRDKQEVSYKAGEATDLASAGYVEYNNGGYTVLHALIDAFNQGNTKIPFTCARGNL 1137 Query: 191 RITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGY 250 + G + K + + + ++ Sbjct: 1138 TPDIAINGNTAEGAGWVCEVAGKELSGDKLASTLVKNGDRIVYYYNANFAGMQNAWFEET 1197 Query: 251 SKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQT 310 + T +D +T + + SGA V +S P S Sbjct: 1198 NVTVTQGEDAELTLVGADVKNDGGGVAGISGA--KILVNSQNTGLSTGAGGSVTLPGSLI 1255 Query: 311 LFQAGVSVVSWFMSAWGEQE 330 V + + G Sbjct: 1256 DTPGQYIVTAVKENEDGNNT 1275 >gi|323302870|gb|EGA56674.1| Nup1p [Saccharomyces cerevisiae FostersB] Length = 1045 Score = 37.6 bits (85), Expect = 5.8, Method: Composition-based stats. Identities = 30/227 (13%), Positives = 58/227 (25%), Gaps = 7/227 (3%) Query: 147 DGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGR 206 SF P P LG + + +K + S T + Sbjct: 765 SNSPTSFFDGSASSTPIPVLGKPTDATGBTTSKSAFSFGTAXTNGTNASANSTSFSFNAP 824 Query: 207 SIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWIT 266 + G TN + + D+ S T +G FG+S T + Sbjct: 825 ATGNGTTTXSNTSGTNIAGTFNVGKPDQSIASGNTNGAGSAFGFSSSGTAATGAASNQSS 884 Query: 267 VLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAW 326 ++ + + + +K + + + F + + + Sbjct: 885 FNFGNNGAGGLNPFTSATSST-NANAGLFNKPPSTNAQNXNVPSAFNFTGNNSTPGGGSV 943 Query: 327 GEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYG 373 G + T +F+GS SF F+ Sbjct: 944 FNMNGNTNANT------VFAGSNNQPHQSQTXSFNTNSSFTPSTVPN 984 >gi|222054134|ref|YP_002536496.1| YD repeat protein [Geobacter sp. FRC-32] gi|221563423|gb|ACM19395.1| YD repeat protein [Geobacter sp. FRC-32] Length = 1348 Score = 37.6 bits (85), Expect = 5.8, Method: Composition-based stats. Identities = 19/297 (6%), Positives = 58/297 (19%), Gaps = 9/297 (3%) Query: 92 VVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKI 151 + + T T Y++ +L+ + + + L Sbjct: 742 YKYDDLGRVYQTISPDTNTTTYSYDPAGNLKTKTDAKGIIIAYTYDDANRLTRTSFPTDP 801 Query: 152 SFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLG 211 + T+ + + + + + T D + G Sbjct: 802 AITYSYDTCINGKGRVCTITDQSGTTTYEYTKKGQIAKETRTIDGIAYITQY--TYDMNG 859 Query: 212 CHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLS 271 + +Y S G + + +T+ L + Sbjct: 860 NTKTIIYPSGRVITYSYSNDKPTTVSSTYAGITTTIANNISYKPFGGMTALTYGNGLART 919 Query: 272 SKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEG 331 + + + +G ++ + Sbjct: 920 ITYDNQYRISTMITGTLQNLTYGYDANGNITAITNTLDNT-KNKSYTYDSLDRLGSGTGP 978 Query: 332 YPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDF 388 + + ++ G + + S ++ S T Sbjct: 979 WGTITWTYD------GVGNRQTQIDSSGTSSYSYQSGSNRLTGITGANPATFGYDTN 1029 >gi|254172674|ref|ZP_04879349.1| conserved hypothetical protein [Thermococcus sp. AM4] gi|214033603|gb|EEB74430.1| conserved hypothetical protein [Thermococcus sp. AM4] Length = 4292 Score = 37.6 bits (85), Expect = 6.3, Method: Composition-based stats. Identities = 28/300 (9%), Positives = 64/300 (21%), Gaps = 27/300 (9%) Query: 126 FGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQA 185 H ++ L ++ +G + N + I Sbjct: 1005 NSDVGEGEHGLRAVVVNSNGSASQFWAWYVYPRPNLTITFVRPTPENGARLNVRKIIINV 1064 Query: 186 DTST--ARITSDMKIFKPLDKGRSIRLGC---HPPEWAKNTNYSIGAYIVADDKVYRSLT 240 +S +R+T + +G + + A + R++ Sbjct: 1065 TSSLDLSRVTLEWNGVNKSMEGSGRNWWALMENLTDGTYTFRVYGSAGGINGSTEERAVE 1124 Query: 241 TGRSGDRFGYSKGATYVKDNNITWITVLNLSS----KTSRESASGAVAPYYVWGDIKDVS 296 + F A + + + V + W + Sbjct: 1125 IDATAPEFLEYGQAEDEVIVGDKAEVFAKWTDAHLEGAVLVTNATLVDGEFTWTESPLQI 1184 Query: 297 KDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVY 356 DG S + ++G + P RL Sbjct: 1185 ADGWSNGTITTDENFAGKVFCWYIRARDSFGNENRTPQMCFRVEERLRILS--------- 1235 Query: 357 LSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSI 416 FS + + + ++T + + W + S + S Sbjct: 1236 ---------FSPEEREVELRENETASFSITLNRIANVSWAVNGTVVLNEETSESTYENSS 1286 >gi|254412475|ref|ZP_05026249.1| filamentous haemagglutinin family N-terminal domain protein [Microcoleus chthonoplastes PCC 7420] gi|196180785|gb|EDX75775.1| filamentous haemagglutinin family N-terminal domain protein [Microcoleus chthonoplastes PCC 7420] Length = 1737 Score = 37.6 bits (85), Expect = 6.3, Method: Composition-based stats. Identities = 47/441 (10%), Positives = 94/441 (21%), Gaps = 37/441 (8%) Query: 27 LSLHAQGVAKSRNLIPLRYGP-----LVSMPLMQEYRDCRLDPRSNRVFSFSIPDGGYAL 81 L G N + G L++ + + L+ Sbjct: 310 LGRVNGGYPSIINGLIQVTGGNSNLFLMNPSGIVFGANASLN------VPADFTATTATG 363 Query: 82 LVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHH 141 + F V + + T + AV Sbjct: 364 IGFDGGWFNAVGSTNYINLVGNPNAFEFATSQPGSIVNAGNLAVSEGQT----------- 412 Query: 142 LLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKP 201 L + + T + G + +S ++ + + Sbjct: 413 LSLVGGNVINTGTMEATAGTITIAAVPGTSRLRLTQTGQVLSLEVSANNLNSITPLLLPE 472 Query: 202 LDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNN 261 L G + G E + S T S D G + Sbjct: 473 LLTGSNEETGLTVNEDNTAQTAAGTVIPQQPGTAIVSGTVDTSADSVGGNIDIFGTVIGL 532 Query: 262 ITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGV-SVVS 320 I + E G D + G +S+ ++ G V S Sbjct: 533 IDQAQINVSGDTGGGEIRVGGEYKGQGTVPTADTTVVGNQVSINADARVNGNGGRVIVWS 592 Query: 321 WFMSAWGEQ--------EGYPSHV-TFHNNRLLFSGSK---GDELSVYLSSFGAFYDFSL 368 + + G V T N L G + S ++ ++ Sbjct: 593 DNFTRFSGTITARGGTENGNGGFVETSGKNVLESIGGTVNTSAANGLPGSWLLDPWNVTI 652 Query: 369 --DGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDF 426 D G + +A + + + I G V + Sbjct: 653 TDDAPTGTFTGGIFTPSAESQINVNDIVNALNGGTDVTITTAGEEGNEGNQEGTITVNAA 712 Query: 427 RRVSGSGVYACPPVSVGDCLV 447 +S + + + ++ Sbjct: 713 LDISLNAGNTTLSLEADNDII 733 >gi|301100912|ref|XP_002899545.1| alpha-glucosidase, putative [Phytophthora infestans T30-4] gi|262103853|gb|EEY61905.1| alpha-glucosidase, putative [Phytophthora infestans T30-4] Length = 808 Score = 37.6 bits (85), Expect = 6.5, Method: Composition-based stats. Identities = 26/269 (9%), Positives = 61/269 (22%), Gaps = 15/269 (5%) Query: 69 VFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTY----KTPYTFKDNKSLEYA 124 F F + VF D + ++ + + G + + D + + Sbjct: 83 WFQFDVSSATSLEFVFNDGVGVVWDNNNNANYKVSAAGTYSVVSKVSGFKTGDLPYIHF- 141 Query: 125 VFGSTAVFVHKDHP----PHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 + + + + + + + I NA Sbjct: 142 -NAGSGWTTVPGYAMSSSTYAGKFSAANGWYQYDTSSTSSVEITFDDGNGIWDSNLNANY 200 Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLT 240 + T + + + T+ S A ++ + + Sbjct: 201 IRTSPGTYAFVNQNTATPTSSPSVKGYVNGPGY-----AVTSASEDAGVLTINLAVNAAP 255 Query: 241 TGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGR 300 T + T K + + + S E + D S Sbjct: 256 TSTPYGTDLSALVVTVTKTESDSVRVKIVDKSNKRWEVPKSLFTAGTLGTDSTAKSAATD 315 Query: 301 SISVAPQSQTLFQAGVSVVSWFMSAWGEQ 329 + +Q LF V S + + Sbjct: 316 PLYSFNYTQNLFTFKVVRKSDGYTLFDSS 344 >gi|327539886|gb|EGF26488.1| polymorphic outer membrane protein [Rhodopirellula baltica WH47] Length = 3495 Score = 37.6 bits (85), Expect = 6.6, Method: Composition-based stats. Identities = 20/260 (7%), Positives = 46/260 (17%), Gaps = 10/260 (3%) Query: 69 VFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGS 128 VF D A D ++ + SL G+ Sbjct: 1013 VFDLDFDDQDRAYFSTYDSDYRVYRLGQLNYPETIPSNTQIDVVENDAATVSLSGVADGN 1072 Query: 129 TAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQA--- 185 + L ++++ + + + S Sbjct: 1073 ETAASNGSFTVAQTLAAATDTTLTYSVSGTAKSGDDYSTLDGTVTIAAGTTSSTISVPVF 1132 Query: 186 ------DTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVAD-DKVYRS 238 T + +T + N ++G Sbjct: 1133 DDLIVEGTESVTVTLTGITNSSPGVSIETGANTASIDIVDNDTATVGIVGSGPFSFESAD 1192 Query: 239 LTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKD 298 T S + A + T ++ + + S Sbjct: 1193 GTFNPSLFQSTIVNDAFWQSHRFEVVGTSTTIADVGGYFRNTDPASATLFAAITALTSDS 1252 Query: 299 GRSISVAPQSQTLFQAGVSV 318 S + + + Sbjct: 1253 DYPDSNDLSTTDVVASTTFS 1272 >gi|260797338|ref|XP_002593660.1| hypothetical protein BRAFLDRAFT_131952 [Branchiostoma floridae] gi|229278887|gb|EEN49671.1| hypothetical protein BRAFLDRAFT_131952 [Branchiostoma floridae] Length = 3505 Score = 37.6 bits (85), Expect = 6.6, Method: Composition-based stats. Identities = 30/323 (9%), Positives = 68/323 (21%), Gaps = 23/323 (7%) Query: 7 TKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRS 66 + FSAGE SP + +R G + N+ + D + Sbjct: 2797 IQARFSAGEGSPVTIVTRNSAGRFDDG--EDHNIRVT-------RAGDRFEISIDDDAKR 2847 Query: 67 NRVFS----FSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLE 122 + I + + + T + + + Sbjct: 2848 SGKLPDVDNKVISVNKLYIGGIPGNMERNFRNMAGTLSPFKGCIRDLVLNGNLINMGDMV 2907 Query: 123 YAVFGSTAVFVHKD-------HPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVK 175 V + T + P P + I + Sbjct: 2908 EFNKADIGRCVTPSELLQITTTVTMITTDVSGITPPVQTMSSVSMPPGPDMSTRQIGEMT 2967 Query: 176 SNAKLSISQADTSTARIT-SDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDK 234 + S + T + + ++ + SI A K Sbjct: 2968 AGESESPRPITQPSTMQTKPMTTVQHSTESLTTMERTTSKMSTIEMDTTSIPAQKEPTTK 3027 Query: 235 VYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASG--AVAPYYVWGDI 292 + + R + T T + ++++ + A + V Sbjct: 3028 GFSTTQRPPLTIRPTSPRPVTTQAMTTEQVPTTGQTTDRSTQGPTTTEMAESTTKVPSVP 3087 Query: 293 KDVSKDGRSISVAPQSQTLFQAG 315 + V ++ Sbjct: 3088 GTTTVTPAPPVVLTTAEVPTPTT 3110 >gi|290995070|ref|XP_002680154.1| predicted protein [Naegleria gruberi] gi|284093774|gb|EFC47410.1| predicted protein [Naegleria gruberi] Length = 636 Score = 37.2 bits (84), Expect = 6.8, Method: Composition-based stats. Identities = 32/374 (8%), Positives = 90/374 (24%), Gaps = 14/374 (3%) Query: 75 PDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEY-----AVFGST 129 + + +++ + Y + + L Y Sbjct: 56 SSDETYIADTNNHRIRKITTSGIISTIAGNGTAGYSGDGSSAKSAQLYYPSGVAISSSDE 115 Query: 130 AVFVHK-DHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTS 188 V + ++ + I+ + + + ++IS +D + Sbjct: 116 IYIVDRSNNRIRKITTSGIISTIAGN-----GTAGYSGDVATSAKLYYPSGIAISSSDET 170 Query: 189 TARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRF 248 T++ +I K G + + S + + ++ Sbjct: 171 YIADTNNHRIRKITTSGIISTIAGNGTAGYSGDGSSAKSAQLYYPSGVAISSSDEIYIVD 230 Query: 249 GYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQS 308 + + + I N ++ S + +S A I S D I+ + Sbjct: 231 RSNNRIRKITTSGIISTIAGNGTAGYSGDGSSATSAQLNSPSGIAISSSDEIYIADMFNN 290 Query: 309 QT-LFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSS--FGAFYD 365 + + + + G S T + + +Y++ Sbjct: 291 RIRKITTSGIISTIAGTGTSGYSGDGSSATSIQLYFPYGVAVSLSDEIYIADMFNNRIRK 350 Query: 366 FSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSID 425 + G + T + I + + + + + I+ S +S Sbjct: 351 ITTSGIISTIAGGIGDGLSATTAYINAITFEFSSSGEIYIADTNNHRIRKITTSGIISTI 410 Query: 426 FRRVSGSGVYACPP 439 + Sbjct: 411 AGTGTSGYSGDGSS 424 >gi|167918846|ref|ZP_02505937.1| cable pili-associated 22 kDa adhesin protein [Burkholderia pseudomallei BCC215] Length = 2030 Score = 37.2 bits (84), Expect = 6.8, Method: Composition-based stats. Identities = 14/239 (5%), Positives = 44/239 (18%), Gaps = 6/239 (2%) Query: 86 DKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYI 145 +++ V T S +T L + G + V P + Sbjct: 94 GAYVRLYDVTGGTTVSVGEAVADSSGNWTTTLTSPLSGSASGVSHSLVAVGVDPAGNTSM 153 Query: 146 QD------GDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIF 199 D + + + + ++ +T + + Sbjct: 154 TSGPDVVVIDTSTPQPSAPALSTADEFNGNPSVTTNARPTFTGTSEAGASVTLTENGAVL 213 Query: 200 KPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKD 259 + + + S + + + A + Sbjct: 214 GVGTADSTGHWSIQTNSLVAGGHTITATAVDIAGNSNVSPSAAIAVAANVPTPPAPLLIT 273 Query: 260 NNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSV 318 + T + +R + ++ + Sbjct: 274 PDDTSPIDNTNNDDLTRVTTPHFTGSTTAGYNVTLFVDGVSVGQGVAGGNGSWTIQDGT 332 >gi|88602453|ref|YP_502631.1| hypothetical protein Mhun_1164 [Methanospirillum hungatei JF-1] gi|88187915|gb|ABD40912.1| PKD [Methanospirillum hungatei JF-1] Length = 1011 Score = 37.2 bits (84), Expect = 7.0, Method: Composition-based stats. Identities = 24/272 (8%), Positives = 67/272 (24%), Gaps = 6/272 (2%) Query: 56 EYRDCRLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTF 115 + S+++ + G + F + + W G + + Sbjct: 412 HVTGLYANFTSDKLVGYQNTTGESIPVNFTSNSTDVFG-ATYYHWDFGNGGSSIQPNAKT 470 Query: 116 KDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVK 175 N Y V + ++ + ++ I + +F + K+ P Sbjct: 471 TYNSPGNYTVNFTVGNSCNQYNSTQKIITIIERPIANFDYSP-KYGTFPLQVQFTDLTTD 529 Query: 176 SNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKV 235 S + + D ++ +T + + G + + + + Sbjct: 530 SPDQYEWNFGD-GSSTVTDKNPVHTFNNPGTYLITQVVRNTTVYPIWTNTLTKNIILSEG 588 Query: 236 YR---SLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDI 292 + S R F IT + + + Y Sbjct: 589 FNVSFSTNKSRGVSPFTVQFTDLSQPSAWITNWSWNFGDNTPVSTQKNPIHTFYGANNYT 648 Query: 293 KDVSKDGRSISVAPQSQTLFQAGVSVVSWFMS 324 +++ + ++ + + FM Sbjct: 649 INLTVWNTTTGARGSAENTIEVVEPIYPDFMP 680 >gi|198424099|ref|XP_002122888.1| PREDICTED: similar to protein tyrosine phosphatase, receptor type, B [Ciona intestinalis] Length = 2362 Score = 37.2 bits (84), Expect = 7.1, Method: Composition-based stats. Identities = 20/250 (8%), Positives = 49/250 (19%), Gaps = 9/250 (3%) Query: 90 QIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGD 149 +++ + + D ++A +T +F Sbjct: 74 KVINTTAGAAIVGNTEYTITVYAVSSTDATDFKFATNQTTTIFSAPVLTSVTGTNSTIAV 133 Query: 150 KISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIR 209 +S+T+D + A + + T + Sbjct: 134 DLSWTYDN---GGGANAVSEYLIKWDGGGSTGSPTAGSGSTTATISSLSANTEY---TFS 187 Query: 210 LGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLN 269 + +T+ A V S + N + Sbjct: 188 ITAVSATVRGDTSAPSSATTVFGAPTSFSTAGATTTSIDLTWTAPAVGGGKNNVLAYTIQ 247 Query: 270 LSSKTSRESASGAVAPYYVW---GDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAW 326 + + + +S + ++ T QA Sbjct: 248 WTGGAGGSKETSGTTDTISSLSANTAYSFTVAAKSKAGTGEASTPLQAITLPSLPEQPTL 307 Query: 327 GEQEGYPSHV 336 P+ V Sbjct: 308 TRSTTNPTTV 317 >gi|194228056|ref|XP_001914937.1| PREDICTED: similar to Gene model 784, (NCBI) [Equus caballus] Length = 1407 Score = 37.2 bits (84), Expect = 7.3, Method: Composition-based stats. Identities = 29/252 (11%), Positives = 58/252 (23%), Gaps = 16/252 (6%) Query: 86 DKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYI 145 + + K + + YT +T D+ PH Y Sbjct: 208 NSYMVNNTSLLVNKTNDFSSIPGIPSTYTVDYAPGTY--TVDNTLSTFTADNAPH--TYT 263 Query: 146 QDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKG 205 D ++T D+ + + S + T D + Sbjct: 264 GDSTSSTYTVDD------------TSGAYTVDNAPRTNIVGNSLSTYTVDNVPCSYTVEN 311 Query: 206 RSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWI 265 + + + V + + + V D T+I Sbjct: 312 TLSTCTVNNTLSTHTVDSAPSTCTVDSAPATNTANNTLNIYTAHNTPNTYTVDDPPNTYI 371 Query: 266 TVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSA 325 S+ ++ + S + + D S + APQ+ T Sbjct: 372 ADNTPSNSSTDHAPSTSTTDTSLPPSTIDSVPSPSSTNYAPQTSTSDGTLTPSSIDGTPG 431 Query: 326 WGEQEGYPSHVT 337 + P T Sbjct: 432 SSISDSAPDTPT 443 >gi|159040394|ref|YP_001539647.1| hypothetical protein Sare_4907 [Salinispora arenicola CNS-205] gi|157919229|gb|ABW00657.1| hypothetical protein Sare_4907 [Salinispora arenicola CNS-205] Length = 825 Score = 37.2 bits (84), Expect = 7.8, Method: Composition-based stats. Identities = 30/306 (9%), Positives = 65/306 (21%), Gaps = 18/306 (5%) Query: 92 VVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKI 151 + + + + + T + + D + Sbjct: 286 FRDAGGVRLAAVTDDVDGVSGW---QQLGVTGTAPAKTTTLTVRLYSRQSSTGTTMWDDV 342 Query: 152 SFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLG 211 S + P D ++ V S S T D + + G Sbjct: 343 SLQSSTDRAYDPTLAPDAVVLAVGDQRIESYSGVSRVMHPGTKDGDPAQAGVGAGVVLTG 402 Query: 212 CHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLS 271 W N S A T+ +G S + T N + Sbjct: 403 TAAGTWDANPRISGSVLREAPGYRMWYTTSSGTG--LATSVDGRVWSRDGRTTTVTANGN 460 Query: 272 SKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWG---- 327 R P A QS S ++ W Sbjct: 461 GGVVRNP---TWTPGGPQPQYFTSRSTSDFRYHALQSADGVSWTAPTDSIPINGWDVVNV 517 Query: 328 ----EQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTT 383 + + + + ++ + + + +V++S+ + ++ D Sbjct: 518 TWDPATQRFVAMLKYYP--VSYPSTPTGPRTVWVSTSADYKTWTAPQPAFAADHFDNELI 575 Query: 384 AVTDFS 389 Sbjct: 576 TDAGTQ 581 >gi|146340765|ref|YP_001205813.1| hypothetical protein BRADO3823 [Bradyrhizobium sp. ORS278] gi|146193571|emb|CAL77588.1| conserved hypothetical protein [Bradyrhizobium sp. ORS278] Length = 1094 Score = 37.2 bits (84), Expect = 7.8, Method: Composition-based stats. Identities = 27/355 (7%), Positives = 72/355 (20%), Gaps = 31/355 (8%) Query: 87 KKLQIVVVRSSTKWSPALFGKTYKTPY--------------TFKDNKSLEYAVFGSTAVF 132 ++ +++ + T +P T + + ++ Sbjct: 296 SAVRFGSTSAASYTVNSATQITATSPAGSGTVDVTVTTAGGTSATSAADQFTYIPLVTAI 355 Query: 133 VHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARI 192 P + +KF + S + A A T + Sbjct: 356 SPASGPTTGSTAVTITGNGFTGASAVKFGAANATSFTVNSATQITATSPSGAAGTIDVTV 415 Query: 193 TSD--------MKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRS 244 T+ F + + +T+ I T + Sbjct: 416 TTSGQTSPTSAADQFTYAAAPTVTSISPSSGPASGSTSVIITGTGFTAATAVSFGATAAT 475 Query: 245 GDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISV 304 + T + V + +++ Y I +S + Sbjct: 476 SYTVNSATQITAFAPAGTGTVDVRVTGVGGTSATSAADQFSYLGAPAITAISPATGPSAG 535 Query: 305 APQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFS---------GSKGDELSV 355 +G V ++ + + + Sbjct: 536 GTSVTISGSGFAGTTGLGAVKFGAVNATSYTVNSASSITAIAPAGTGAVDVTVTNNAQTS 595 Query: 356 YLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTS 410 +++ G F + + +T+ + G + D Sbjct: 596 AVTAAGRFSYVTTATQTSLASSRNPSEFRQPVTFTATVTAVSGTATGTVTFADGG 650 >gi|148262832|ref|YP_001229538.1| polymorphic outer membrane protein [Geobacter uraniireducens Rf4] gi|146396332|gb|ABQ24965.1| polymorphic outer membrane protein [Geobacter uraniireducens Rf4] Length = 2042 Score = 37.2 bits (84), Expect = 8.0, Method: Composition-based stats. Identities = 33/327 (10%), Positives = 68/327 (20%), Gaps = 23/327 (7%) Query: 126 FGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL----S 181 T + I F P G + + A + Sbjct: 711 TAYTFTVTATNSAGTGSASAASNSVTPAAAQTITFNNPGAQNFGTSPTLTATATSSLTVT 770 Query: 182 ISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTT 241 + + T +T+ + +I ++ V Sbjct: 771 FTSSTTGVCTVTAGGALTFVTTGTCTINADQAGNGSFLAATTVSRSFTVIAVVPGAPTIG 830 Query: 242 GRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRS 301 + S T N IT ++S ++SGA +P V G + Sbjct: 831 IATAGDTQASVAFTAPVSNGGASITGYTVTSNPGGLTSSGASSPITVTGLTNGTAYTFTV 890 Query: 302 ISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNN-------------------R 342 + + + V+ P++ + R Sbjct: 891 TAHNSAGTGSASSASNSVTPNPGPTVVNVAVPANGIYKAGSNLDFTVTWDSAATVTGTPR 950 Query: 343 LLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEG 402 + + Y S G T +T + TI Sbjct: 951 IALLIGSAMVYATYQSGSGTASTLFRYTVLPGQTDTDGITVGALSLNGGTIQNSSGTDAT 1010 Query: 403 VLVGCDTSLWLLSISLSKGLSIDFRRV 429 + + S + + + Sbjct: 1011 LTLNSVASTVNVLVDTTAPTLSSIATS 1037 >gi|227827468|ref|YP_002829248.1| Fibronectin type III domain protein [Sulfolobus islandicus M.14.25] gi|229584683|ref|YP_002843185.1| Fibronectin type III domain protein [Sulfolobus islandicus M.16.27] gi|227459264|gb|ACP37950.1| Fibronectin type III domain protein [Sulfolobus islandicus M.14.25] gi|228019733|gb|ACP55140.1| Fibronectin type III domain protein [Sulfolobus islandicus M.16.27] Length = 725 Score = 37.2 bits (84), Expect = 8.4, Method: Composition-based stats. Identities = 36/386 (9%), Positives = 94/386 (24%), Gaps = 31/386 (8%) Query: 90 QIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGD 149 ++ + ++ + +PY F N ++ +T PP + + + + Sbjct: 116 YVLKLNGNSWVVVSEMPLPAYSPYIFVYNNAIYVIGGENTTSPAGLYFPPSNAIRLFYPN 175 Query: 150 KISFTFDEI--KFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRS 207 S+ S + + + S + + L+ Sbjct: 176 NDSWRIIGYMPVPTYGGGYVFNGTSLIIVSGYIGYSAYTNDILIYSPQNNNWTILNGVLP 235 Query: 208 IRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITV 267 + + + + + +Y + + G + Y G + Sbjct: 236 YWIHDSALAYYRGVLFIV------GGYIYTAGSGGVNNAILAYYNGNLQRVGYLPVPVYS 289 Query: 268 LNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWG 327 + +G + DVS P + + W Sbjct: 290 AGYVQVGNMLYLAGGIGSSL-----SDVSALQLITFNFPPLPPKITSYSAGNESVTLGWN 344 Query: 328 EQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFG------AFYDFSLDGEYGCYDPTKAL 381 + + N + F+ S + G +++ G P+ + Sbjct: 345 PVRLSSGYEIIYWNNMGFNSSINVGNVTSYTVTGLKDGITYYFEVLAYNSIGYSSPSSII 404 Query: 382 TTA-VTDFSASTIHWMHPFGEGVLV------GCD-----TSLWLLSISLSKGLSIDFRRV 429 T + + + + + V + ++ S S Sbjct: 405 TLTPASVPNPPQLVSVKYGNDNVTLNWLPPTFSGGYLLLGYYVIVKNENSMVSSHFVNST 464 Query: 430 SGSGVYACPPVSVGDCLVFVCGVGRR 455 S + P V+ + V +G Sbjct: 465 SLTISNLTPNVTYNVFIYAVNKLGNS 490 >gi|323487552|ref|ZP_08092845.1| hypothetical protein HMPREF9474_04596 [Clostridium symbiosum WAL-14163] gi|323399153|gb|EGA91558.1| hypothetical protein HMPREF9474_04596 [Clostridium symbiosum WAL-14163] Length = 2180 Score = 37.2 bits (84), Expect = 8.5, Method: Composition-based stats. Identities = 26/251 (10%), Positives = 58/251 (23%), Gaps = 3/251 (1%) Query: 84 FGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVH--KDHPPHH 141 F + ++++ + A G +P + + V + + + + Sbjct: 1058 FENNEIKVRKKSYTVTVGTAANGTVSASPTSAAAGTEVTLTVNPDSGYQLEALTVYKTSN 1117 Query: 142 LLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKP 201 FT NAK I S A+ T++ Sbjct: 1118 TSTTVTVSNNKFTMPSYNVTVSATFQKTADQTAVDNAKAIIEGGSYSVAQATANSVADVK 1177 Query: 202 LDKGRSIRLGCHPPEW-AKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDN 260 +I +I + + + G +G+ + Sbjct: 1178 TWLATTINSLSGMSGTNVTVQAGNITVSDFTAAQADTTGSGGSNGNFKFTVSLSKNGAAA 1237 Query: 261 NITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVS 320 T T + + SA G+ + P + S+ Sbjct: 1238 TTTSKTGTITKTPYNPSSAKEITGFTIPSGNTDINQTNHTIAVTMPAGTNVTSLTPSITV 1297 Query: 321 WFMSAWGEQEG 331 ++ G Sbjct: 1298 SDKASVSPASG 1308 >gi|296284681|ref|ZP_06862679.1| VCBS [Citromicrobium bathyomarinum JL354] Length = 1045 Score = 37.2 bits (84), Expect = 8.6, Method: Composition-based stats. Identities = 31/285 (10%), Positives = 67/285 (23%), Gaps = 7/285 (2%) Query: 166 LGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSI 225 + + + + + A ++ + + G + S Sbjct: 481 IAFTVDLAAGAATAGNATYAISNIQNVLASPSSGYST---TVYGDGLSNAIGVDPVSSSG 537 Query: 226 GAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAP 285 +V + TG G+ T + + + SG Sbjct: 538 TGSMVFYGRGGNDTLTGGLGNDILDGGEGTDTAVFSGSRDAYAITQIADGQFEVSGPDGT 597 Query: 286 YYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFH--NNRL 343 + DG + + W + YP V + R Sbjct: 598 DTLTSIEHLQFADGTYVFGPTTGPVSLGYAGFGYAPEAGGWADNTTYPRGVADIDGDGRA 657 Query: 344 LFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPF--GE 401 G L LS+ + + G A A + TI ++ + Sbjct: 658 DLIGFGSAGLFAALSNGDGTFGETFLAYNGFGASDAAGGWANDNLYPRTIADVNGDGLQD 717 Query: 402 GVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVYACPPVSVGDCL 446 + G L + + G ++ F + + + G Sbjct: 718 LIGFGSAGVSIALGQAPASGQAVAFGPATLAYAGFGASDAAGGWT 762 >gi|269104273|ref|ZP_06156969.1| hypothetical cytosolic protein [Photobacterium damselae subsp. damselae CIP 102761] gi|268160913|gb|EEZ39410.1| hypothetical cytosolic protein [Photobacterium damselae subsp. damselae CIP 102761] Length = 3902 Score = 37.2 bits (84), Expect = 8.7, Method: Composition-based stats. Identities = 22/278 (7%), Positives = 63/278 (22%), Gaps = 12/278 (4%) Query: 93 VVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFG-STAVFVHKDHPPHHLLYIQDGDKI 151 + +T T + + T+ P ++ + Sbjct: 630 TIDGNTLTVEGTMCNGAAIQETSYELQFYVITSGAGDTSTASTTAEPVGGWSFMTNIGNE 689 Query: 152 SFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLG 211 + + + + +I+ + S + + G Sbjct: 690 ---ITGLTYGEGTTYLGRLTNQTNGTFSGTITVSGISAGAQIGAISLSDSNSSGSFYAGQ 746 Query: 212 CHPPEWAKNTNYSIGAYIVADDKV-------YRSLTTGRSGDRFGYSKGATYVKDNNITW 264 + ++ Y A D YR+L + + Sbjct: 747 TSEFSATQAVTSTVADYGDAPDSGAGIGTGNYRTLLADNGPSHTSSTSLLIGTNATDEES 806 Query: 265 ITVLNLSSKTSRESASGAVAP-YYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFM 323 + + ++ I+D + + + + W Sbjct: 807 DALGTGVTTADGDNNDATNDEDSVSNLTIEDTATTFSETIDVTNTTGSTAYLYAWIDWDD 866 Query: 324 SAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFG 361 S + + + S+ T + ++ + S + S G Sbjct: 867 SGTFDVDEFVSNGTGTDEQIDIADSAISASIDWSSISG 904 >gi|294054434|ref|YP_003548092.1| hypothetical protein Caka_0900 [Coraliomargarita akajimensis DSM 45221] gi|293613767|gb|ADE53922.1| hypothetical protein Caka_0900 [Coraliomargarita akajimensis DSM 45221] Length = 776 Score = 36.8 bits (83), Expect = 9.2, Method: Composition-based stats. Identities = 30/281 (10%), Positives = 69/281 (24%), Gaps = 21/281 (7%) Query: 86 DKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYI 145 ++ + + + L E G P + + Sbjct: 155 NQYIGFIDNAVADTNGELLVYIDDGEGNGNSSRTWYEGVAVGDPYSLPEPPPLPGGAVEV 214 Query: 146 QDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLS--ISQADTSTARITSDMKIFKPLD 203 ++ DE +L G + + A ++ ++++ Sbjct: 215 APDGVWTWFNDERAIWHLGYLYAGYVRSDGHVGLSRFDPATATSTHVQLSTSSSQQVDDH 274 Query: 204 KGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNIT 263 SI ++ DD++ + + F T + Sbjct: 275 NNCSI-------------------TVLPDDRLLVVYSKHNANWSFFSRISTTTTPASLAD 315 Query: 264 WITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFM 323 W + S+ S A+ + ++ S G Sbjct: 316 WGSEQVTSTPASNTYANTYRLSGESNKIYNFHRSINFNPTITTSSDNGVTWGTPTHFIDT 375 Query: 324 SAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFY 364 G YP + + H +R+ + G +V S + +Y Sbjct: 376 GNNGSVRPYPRYCSNHTDRIDLIYTDGHPRAVANSVYHMYY 416 >gi|219124937|ref|XP_002182749.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] gi|217406095|gb|EEC46036.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] Length = 1567 Score = 36.8 bits (83), Expect = 9.6, Method: Composition-based stats. Identities = 26/285 (9%), Positives = 66/285 (23%), Gaps = 26/285 (9%) Query: 59 DCRLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDN 118 + +N + +G + + +++ T W+ + + Sbjct: 1279 TIQSSAANNNWTAVIYGNGTFVAVAATGIGDRVMTSPYGTTWT--IRASAADNDWNGLTY 1336 Query: 119 KSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLG--------DGM 170 + ST + P + + + + + ++ M Sbjct: 1337 GDGIFVAVASTGLGNRVMTSPDGIAWASRPSAADNNWTAVAYGNGIFVAVAASGIGNRIM 1396 Query: 171 ISGVKSNAKLSISQADTSTARITSDMKIFKP---LDKGRSIRLGCHPPEWAKNTNYSIGA 227 S + L + D +T F G + +W T+ + Sbjct: 1397 TSRDGTTWTLRGNAVDNEWRSVTYAEGTFVAVASTGIGNRVMTSPDGIQWTIQTSAADNW 1456 Query: 228 YIVA--DDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAP 285 + D + ++ +GDR S + + Sbjct: 1457 WSAVTYGDGTFVAVAATGTGDRVMTSPDGITWTTQTSAPDIDWRSVTYGDGIFVA----- 1511 Query: 286 YYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQE 330 + S R ++ Q + W +G Sbjct: 1512 ------VASTSIGNRVMTSPDGITWTTQGSANDNDWHSVTYGNTT 1550 >gi|295108261|emb|CBL22214.1| Bacterial surface proteins containing Ig-like domains [Ruminococcus obeum A2-162] Length = 815 Score = 36.8 bits (83), Expect = 9.8, Method: Composition-based stats. Identities = 16/179 (8%), Positives = 46/179 (25%), Gaps = 11/179 (6%) Query: 356 YLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLS 415 + + D E D + + + + + + + G+ V + Sbjct: 52 FSDDSISIEDSDDVTEADTADDSILIDNSDSAEYSESDTDVFSAGDEVDAFTAADEVSVQ 111 Query: 416 ISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGF---RFNEI 472 V S ++ + ++ G + + ++E N+I Sbjct: 112 ADEEAKTHSIKVTVVNSKGVVSGMYAMDNAIITKQDDGTYLVKMHQASENREYMALTNDI 171 Query: 473 TQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMI 531 T H + + + + + + + P ++ AW Sbjct: 172 TAATQHRVDWYVADSNW--------YYTIPVANLTDPVYASFSYTKNVNKGAAWSNVQT 222 >gi|146317870|ref|YP_001197582.1| sugar ABC transporter periplasmic protein [Streptococcus suis 05ZYH33] gi|253751108|ref|YP_003024249.1| surface-anchored protein [Streptococcus suis SC84] gi|145688676|gb|ABP89182.1| ABC-type xylose transport system, periplasmic component [Streptococcus suis 05ZYH33] gi|251815397|emb|CAZ50970.1| putative surface-anchored protein [Streptococcus suis SC84] Length = 1238 Score = 36.8 bits (83), Expect = 9.8, Method: Composition-based stats. Identities = 26/243 (10%), Positives = 56/243 (23%), Gaps = 1/243 (0%) Query: 57 YRDCRLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFK 116 + + RV DG L F K + I + S + Sbjct: 205 IGEVKQWNT-FRVVFKENSDGSVYALEFTGKAVSIKKLSSIDAPNQTGEKYAETGHNLGS 263 Query: 117 DNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKS 176 + + V G T + P ++ + + + D +I Sbjct: 264 EEHRIRLVVRGDTVTVSDNEIPLLSYSSPENWEGATASIVFTPISNRSVSLDDIIIRQTR 323 Query: 177 NAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVY 236 + + + +T + + P E + Y + V Sbjct: 324 ALRSLLVVSRIDGQEVTDIQPGSIRGNTSQVFVGDSLPLEVIEKPGYQFIGFKDEFGNVV 383 Query: 237 RSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVS 296 T D A + + T + + W ++ + Sbjct: 384 DLSTFSVPNDESDLVIYADFQTAEVVNRETKTFYIDSIEGNDTNSGESETNAWKTLEQLR 443 Query: 297 KDG 299 K+ Sbjct: 444 KNT 446 >gi|331009026|gb|EGH89082.1| BNR repeat-containing glycosyl hydrolase [Pseudomonas syringae pv. tabaci ATCC 11528] Length = 385 Score = 36.8 bits (83), Expect = 10.0, Method: Composition-based stats. Identities = 46/363 (12%), Positives = 91/363 (25%), Gaps = 2/363 (0%) Query: 98 TKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDE 157 T + + + + TA V D+ S Sbjct: 17 TLDNTGFTNASGNAGSGVTSSNNYAIDTLRPTATIVVADNALAVGETSLVTITFSEAVSG 76 Query: 158 IKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEW 217 + + S+ ++ + T TA ITS + G + G Sbjct: 77 FTNADLNIANGTLSAVSSSDGGITWTATLTPTAGITSASNSVTLNNGGVTDLAGNAGSGL 136 Query: 218 AKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRE 277 + NY+I V + V + + + V N + T Sbjct: 137 TLSNNYAIDQTRPTASIVIADNALSAGETSLVTITFSEAVSGFDNSDLNVPNGTLSTVNS 196 Query: 278 SASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVT 337 + G + V+ IS+ T +++ PS Sbjct: 197 NDGGITWTATFTPNAN-VNASTGQISLNSAGVTDLAGNAGSGIISSASFTVDTTRPSATI 255 Query: 338 FHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMH 397 + L +G + + F + L G + +T + T + Sbjct: 256 VVADNALSAGETTLVTFTFSQAVSGFSNADLSVANGTLSAVSSSDGGITWTATFTPNANV 315 Query: 398 PFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIK 457 ++ +T + S S G + + + V D L+ + R Sbjct: 316 TDAGNLITLDNTGVTNASGSTGSGTTASNN-YTIDTQRPTATIVVTDSLLAIGETSRVTI 374 Query: 458 YIS 460 S Sbjct: 375 TFS 377 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.308 0.113 0.287 Lambda K H 0.267 0.0349 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 3,137,727,379 Number of Sequences: 14124377 Number of extensions: 120482630 Number of successful extensions: 268638 Number of sequences better than 10.0: 2124 Number of HSP's better than 10.0 without gapping: 392 Number of HSP's successfully gapped in prelim test: 1732 Number of HSP's that attempted gapping in prelim test: 265690 Number of HSP's gapped (non-prelim): 3431 length of query: 578 length of database: 4,842,793,630 effective HSP length: 145 effective length of query: 433 effective length of database: 2,794,758,965 effective search space: 1210130631845 effective search space used: 1210130631845 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.3 bits) S2: 84 (37.2 bits)