BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254781208|ref|YP_003065621.1| hypothetical protein CLIBASIA_05575 [Candidatus Liberibacter asiaticus str. psy62] (578 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done Results from round 1 >gi|254781208|ref|YP_003065621.1| hypothetical protein CLIBASIA_05575 [Candidatus Liberibacter asiaticus str. psy62] gi|254040885|gb|ACT57681.1| hypothetical protein CLIBASIA_05575 [Candidatus Liberibacter asiaticus str. psy62] gi|317120673|gb|ADV02496.1| hypothetical protein SC1_gp080 [Liberibacter phage SC1] gi|317120817|gb|ADV02638.1| hypothetical protein SC1_gp080 [Candidatus Liberibacter asiaticus] Length = 578 Score = 1194 bits (3088), Expect = 0.0, Method: Compositional matrix adjust. Identities = 578/578 (100%), Positives = 578/578 (100%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC Sbjct: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS Sbjct: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL Sbjct: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLT 240 SISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLT Sbjct: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLT 240 Query: 241 TGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGR 300 TGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGR Sbjct: 241 TGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGR 300 Query: 301 SISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSF 360 SISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSF Sbjct: 301 SISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSF 360 Query: 361 GAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSK 420 GAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSK Sbjct: 361 GAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSK 420 Query: 421 GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQLADHLF 480 GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQLADHLF Sbjct: 421 GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQLADHLF 480 Query: 481 NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSA 540 NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSA Sbjct: 481 NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSA 540 Query: 541 ASFPNDNRGGTSLWMLVALSAGEERSFTVRLNLLDDFK 578 ASFPNDNRGGTSLWMLVALSAGEERSFTVRLNLLDDFK Sbjct: 541 ASFPNDNRGGTSLWMLVALSAGEERSFTVRLNLLDDFK 578 >gi|315122895|ref|YP_004063384.1| hypothetical protein CKC_05755 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496297|gb|ADR52896.1| hypothetical protein CKC_05755 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 588 Score = 422 bits (1084), Expect = e-115, Method: Compositional matrix adjust. Identities = 222/567 (39%), Positives = 337/567 (59%), Gaps = 23/567 (4%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M +TK SF+ GE+SP+++QSR DL LH+QG+++ N+IPL+ G LV P + Y Sbjct: 1 MPKGAYTKRSFAGGEVSPQIMQSRSDLELHSQGLSQCFNMIPLQDGSLVRRPPLYRYEHI 60 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 L P+++R+ SF++ L +FG+KK+ V V T P F + Y TPY+F++ + Sbjct: 61 DLPPKASRILSFALGGDDAVLFIFGEKKMVYVEV---TGIKPPQFIRFYDTPYSFREAEQ 117 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 L+ A G+ V VH H P+ + + + G F+++ F PPPWLG + G K +AKL Sbjct: 118 LDVARMGTLIVLVHPKHSPYKIEFTEAG----VIFEKMVFAPPPWLGLREVGGKKHDAKL 173 Query: 181 SISQADTSTARIT--SDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRS 238 ++ + T +IT S + IFK D GR +RLG P +W NT Y A++ KVYR Sbjct: 174 RVTLSATRKGKITVTSTLPIFKTKDVGRMLRLGWLPKDWTANTLYPENAFMQMYGKVYRC 233 Query: 239 LTTGRSGDRFGYSKGATYVKDNNITWITV-------LNLSSKTSRESASGAVAPYYVWGD 291 +T G SG F ++ TY++D +TW + ++ K++ + PYYVWG+ Sbjct: 234 ITEGISGKEFEDNRRDTYIRDGGVTWKVIASSQALSVDKDGKSTLGTGGQYRTPYYVWGE 293 Query: 292 IKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGD 351 I + + +++ V S + W MSAWGE+EGYPSHV+F+NNRL FSGSK D Sbjct: 294 IVNCT-GAKTVEVMLHEGFCVTDSNSTLYWNMSAWGEREGYPSHVSFYNNRLCFSGSKFD 352 Query: 352 ELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSL 411 +VY S + F DFS D G D K+L+ A+TD + S I W P +G+++G DTSL Sbjct: 353 PQAVYFSGYNTFTDFSPDTIEGNVDYRKSLSVAITDDTMSAIRWFRPMEKGLVIGTDTSL 412 Query: 412 WLLSISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNE 471 W++ + +G ++ RR++G GVY PP+S+GD L+FV G GRRI+ I G++EQGF+F E Sbjct: 413 WIVILDFERGFNLVSRRLAGIGVYEAPPLSIGDELIFVQGAGRRIQIIGGASEQGFQFLE 472 Query: 472 ITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMI 531 +TQ DHL + RI QL YQE+P+S++WV+ N+ LLGC A + +WH H + Sbjct: 473 LTQNVDHLLDYRIRQLAYQEDPYSLLWVL-----NNKGELLGCSLHANSKEKGSWHVHKL 527 Query: 532 SDKHY-VLSAASFPNDNRGGTSLWMLV 557 + ++S +S ++G T++W+L+ Sbjct: 528 GGRGVKIMSLSSCLCLDQGETTVWLLL 554 >gi|315121933|ref|YP_004062422.1| hypothetical protein CKC_00915 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495335|gb|ADR51934.1| hypothetical protein CKC_00915 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 588 Score = 419 bits (1076), Expect = e-115, Method: Compositional matrix adjust. Identities = 223/568 (39%), Positives = 334/568 (58%), Gaps = 23/568 (4%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M +TK SF+ GE+SP+++QSR DL LH+QG+++ N+IPL G LV P + Y Sbjct: 1 MPKGAYTKRSFAGGEVSPQIIQSRSDLELHSQGLSQCFNMIPLSDGSLVRRPPLHRYEHI 60 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 L P+++R+ SF++ L +FG+KK+ V V T P F + Y TPY+F++ + Sbjct: 61 DLPPKASRILSFALGGDEAVLFIFGEKKMVYVEV---TGIKPPQFIRFYGTPYSFREAEQ 117 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 L+ A G+ V VH H P+ + + + G F+++ F PPPWLG + G K +AKL Sbjct: 118 LDVARMGTLIVLVHPKHSPYKIEFTEAG----VIFEKMVFAPPPWLGRREVGGKKHDAKL 173 Query: 181 SISQADTSTARIT--SDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRS 238 ++ + T +IT S + IFKP D GR + LG P +W NT Y A++ KVYR Sbjct: 174 RVTLSATRKGKITVTSTLPIFKPKDVGRMLCLGWLPKDWTANTLYPENAFMQMYGKVYRC 233 Query: 239 LTTGRSGDRFGYSKGATYVKDNNITWITV-------LNLSSKTSRESASGAVAPYYVWGD 291 +T G SG F ++ TY++D +TW + ++ K++ + PYYVWG+ Sbjct: 234 ITEGISGKEFEDNRRDTYIRDGGVTWKVIASSQALSVDKDGKSTLGTGGQYRTPYYVWGE 293 Query: 292 IKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGD 351 I + + +++ V S + W MSAWGE+EGYPSHV+F+NNRL FSGSK D Sbjct: 294 IVNCT-GAKTVEVMLHEGFCVTDSNSTLYWNMSAWGEREGYPSHVSFYNNRLCFSGSKFD 352 Query: 352 ELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSL 411 +VY S + F DFS D G D K+L+ A+TD + S I W P +G+++G DTSL Sbjct: 353 PQAVYFSGYNTFTDFSPDTIEGNVDYRKSLSVAITDDTMSAIRWFRPMEKGLVIGTDTSL 412 Query: 412 WLLSISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNE 471 W++ + +G ++ RR++G GVY PP+S+GD L+FV G GRRI+ I G++EQGF+F E Sbjct: 413 WIVILDFERGFNLVSRRLAGIGVYEAPPLSIGDELIFVQGAGRRIQIIGGASEQGFQFLE 472 Query: 472 ITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMI 531 +TQ DHL + RI QL YQE+P+S++WV+ N+ LL C A + +WHTH Sbjct: 473 LTQNVDHLLDYRIRQLAYQEDPYSLLWVL-----NNKGELLSCSLHANSKEKGSWHTHKS 527 Query: 532 SDKHY-VLSAASFPNDNRGGTSLWMLVA 558 ++S +S ++G T++W LV+ Sbjct: 528 GGGWVKIMSLSSCLCLDQGETTIWFLVS 555 >gi|317120716|gb|ADV02538.1| hypothetical protein SC2_gp080 [Liberibacter phage SC2] gi|317120777|gb|ADV02598.1| hypothetical protein SC2_gp080 [Candidatus Liberibacter asiaticus] Length = 590 Score = 215 bits (547), Expect = 2e-53, Method: Compositional matrix adjust. Identities = 167/573 (29%), Positives = 265/573 (46%), Gaps = 51/573 (8%) Query: 8 KHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSN 67 K+SF++GE+SP + QS +L ++ +A N IPLR G L+ P + Y + Sbjct: 8 KNSFASGEVSPFVHQSGSNLKIYQSCLAHCHNYIPLRTGALMRRPGTRIYHVFDDVDKPQ 67 Query: 68 RVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFG 127 R+FSF ++V G KL I R T + PY +D +E A Sbjct: 68 RLFSFVKDAYTAYIIVLGYLKLHIFERRMGGCSKVT----TIEVPYKKEDVDEIEVAQNI 123 Query: 128 STAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADT 187 T VH HPP L ++ D + F E+ F P L + I K + L +T Sbjct: 124 DTLWMVHPKHPPCQL-ELKGKD---WEFKEVLFKHVPPLKEQFIDDKKVSINLKTPFENT 179 Query: 188 STAR-----ITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTG 242 T + + +D ++FK +D GR + LG P W +T Y +Y+V +D++ + + G Sbjct: 180 ETGKTGMVSVEADGEMFKEMDIGRELNLGFRPQRWIPDTWYLDNSYVVHNDRLLKCINKG 239 Query: 243 RS-GDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVW--GDIKDVSKDG 299 +S + +S KD + W V ES G +W G IK K Sbjct: 240 KSQSTEWTFSDKEHQQKDGSCLWEKV---------ESTKGNARNLLIWVTGVIKRF-KTA 289 Query: 300 RSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSS 359 + + + + Q + W + WG++EGYPS +TF NRL+ SG K + +V+ S Sbjct: 290 KCVLLELKGAFPLQNDLPTKHWLLGEWGQKEGYPSCITFFGNRLVLSGGKHNPQTVHFSK 349 Query: 360 FGAFYDFSLDGEY-GCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSIS- 417 F DF+ E G D T + + + I W+ G+LVG +++LWL++ + Sbjct: 350 LDDFTDFNQISEQGGNTDLTSSFSVLLGSDVRQGIQWLSHTDSGLLVGTESALWLITQTS 409 Query: 418 ----LSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISG-----STEQGFR 468 +SK ++ R + G A P+ VG VF+ GR + + G +T+ +R Sbjct: 410 QNEVVSKA-TVAIRSIGNFGSIAVSPILVGSHCVFIKDTGRDLISLVGNRSADNTKTEYR 468 Query: 469 FNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHT 528 F ++ A+H+ + + + V Q+ P+SI+WVVL RL+GC F + E AWHT Sbjct: 469 FRDLNLFAEHILTKGVWEAVLQQSPYSIIWVVLRDG-----RLVGCTFDPDNEV-CAWHT 522 Query: 529 HMI----SDKHYVLSAASFPNDNRGGTSLWMLV 557 H + + H + S ASF + G LW+LV Sbjct: 523 HDLGGFYTQIHSLTSCASFLD---GQDDLWLLV 552 >gi|227355852|ref|ZP_03840245.1| conserved hypothetical protein [Proteus mirabilis ATCC 29906] gi|227164171|gb|EEI49068.1| conserved hypothetical protein [Proteus mirabilis ATCC 29906] Length = 820 Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust. Identities = 120/517 (23%), Positives = 218/517 (42%), Gaps = 62/517 (11%) Query: 10 SFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRV 69 SFS GE++P L R DL+ ++ + K N I +YG + + P + + + + +R+ Sbjct: 9 SFSGGEIAPSLY-GRVDLAKYSTALRKCHNFIVRQYGGVENRPGTRFIAETKYQNKKSRL 67 Query: 70 FSFSIPDGGYALLVFGDKKLQIV-----VVRSSTKWSPALFGKTYKTPYTFKDNKSLEYA 124 F L FGD+ +++ V+ + + +F TPY D L+Y Sbjct: 68 IPFQFSTVQTYALEFGDRYIRVFKDGGQVLYADGEHKGEVF--ELATPYKEADLFDLKYT 125 Query: 125 VFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSI-S 183 VH D+PP L D D E K +G + ++ + + + Sbjct: 126 QSADVMTIVHTDYPPMELQRY-DHDDWKLVSVETK--------NGPFEDINTDKAMKVYA 176 Query: 184 QADTSTARITSDMKIFKPLDKGRSIRLGCHP----PEWAKNTNYSIGAYIVADDKVYRSL 239 A T +TS IF G+ L P W + ++ AD YR+ Sbjct: 177 SASTGQITLTSTHDIFGSEQIGKQFYLEQRDIDAVPVWETDKTTNLNDQRRADSNYYRAN 236 Query: 240 TTGRSGD-RFGYSKGATYVK---DNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDV 295 + G++G R +++G ++ D I W + S G V I+ V Sbjct: 237 SGGKTGTLRPSHTEGMSWDGWGGDTGIQWEYL---------HSGFGIVK-------IETV 280 Query: 296 SKDGRS-----ISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKG 350 S+DG++ +S P S + + S W + W + +GYPS V ++ RL F+GS+ Sbjct: 281 SEDGKTATGKVLSYIP-SNAVGEDNASH-KWARAVWNDVDGYPSTVVYYQQRLFFAGSRA 338 Query: 351 DELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM-HPFGEGVLVGCDT 409 +++ S G + DF G +P + + ++ ++ + H G LV + Sbjct: 339 YPQTIWASRSGDYKDF------GRNNPIQDDDRIIYTYAGRQVNEIRHLIDVGSLVALTS 392 Query: 410 -SLWLLSISLSKGL---SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE- 464 + ++ +K L S +G PP+SV + +++ G ++ +S S + Sbjct: 393 GGEYQITGDQNKVLTPSSFSMSSQGANGSSDLPPISVANIALYIQEKGSAVRDLSYSFDV 452 Query: 465 QGFRFNEITQLADHLFNQ-RILQLVYQEEPHSIVWVV 500 G++ ++T LA+HLF + RI+ + P+SI W + Sbjct: 453 DGYQGTDLTMLANHLFQRHRIVDWSFTTVPYSIAWCI 489 >gi|268589382|ref|ZP_06123603.1| hypothetical protein PROVRETT_05514 [Providencia rettgeri DSM 1131] gi|291315409|gb|EFE55862.1| hypothetical protein PROVRETT_05514 [Providencia rettgeri DSM 1131] Length = 818 Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust. Identities = 125/547 (22%), Positives = 231/547 (42%), Gaps = 76/547 (13%) Query: 10 SFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRV 69 SFS GE++P L R DL+ ++ + K N I +YG + + P + + + R+ Sbjct: 9 SFSGGEIAPSLY-GRIDLAKYSTALRKCSNFIVRQYGGIENRPGTKFIAAAKYPNKKCRL 67 Query: 70 FSFSIPDGGYALLVFGDKKLQIV-----VVRSSTKWSPALFGKTYKTPYTFKDNKSLEYA 124 F L GDK ++++ V+ + ++ +F TPY D +L++ Sbjct: 68 IPFQFSTVQTYALEMGDKYMRVIKDGGQVLYADGEYKGEIF--ELATPYKEADLFNLKFT 125 Query: 125 VFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNA--KLSI 182 VH D+PP L + D+ K +P +G + ++ KL + Sbjct: 126 QSADVMTIVHADYPPMELQ--------RYDHDDWKLVPVE-TRNGPFEDINTDKERKLYV 176 Query: 183 SQADTSTARITSDMKIFKPLDKGRSIRLGCHP----PEWAKNTNYSIGAYIVADDKVYRS 238 S A T +++ IF G+ I + P W + +I A YR+ Sbjct: 177 S-ASTGDVTLSATHNIFGAELVGKQIYIEQQAIDAVPVWETDKTTNINDQRRAGANYYRA 235 Query: 239 LTTGRSGD-RFGYSKGATYVK---DNNITW--------ITVLNLSSKTSRESASGAVAPY 286 T G+SG R +++G ++ D I W I +N S T +A+G V Y Sbjct: 236 NTAGKSGTLRPSHTEGMSWDGWGGDAGIQWEYLHSGFGIVKIN-SVSTDGLTATGKVVLY 294 Query: 287 YVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFS 346 S +V ++ T W S W + +GYPS V ++ RL F+ Sbjct: 295 I------------PSNAVGEENATY--------KWARSVWNDVDGYPSTVMYYQQRLFFA 334 Query: 347 GSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM-HPFGEGVLV 405 GS+ +++ S G + DF G +P + + ++ ++ + H G LV Sbjct: 335 GSRAYPQTIWASRSGDYKDF------GKNNPIQDDDRIIYTYAGRQVNEIRHLIDVGSLV 388 Query: 406 GCDT-SLWLLSISLSKGL---SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISG 461 + + ++ +K L S F +G PP++V + +++ G ++ ++ Sbjct: 389 ALTSGGEYQITGDQNKVLTPSSFSFSSQGANGCSDVPPIAVANIALYIQEKGSAVRDLAY 448 Query: 462 STE-QGFRFNEITQLADHLFNQ-RILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAE 519 S + G++ ++T +A+HLF + +I+ + P+SI W + +D+ +LL + E Sbjct: 449 SFDVDGYQGTDLTIMANHLFQRHQIIDWAFSIVPYSIAWCI---RDDG--KLLSLTYLRE 503 Query: 520 GEGDFAW 526 + FAW Sbjct: 504 QQV-FAW 509 >gi|48697202|ref|YP_024932.1| hypothetical protein BcepC6B_gp12 [Burkholderia phage BcepC6B] gi|47779008|gb|AAT38371.1| gp12 [Burkholderia phage BcepC6B] Length = 768 Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust. Identities = 129/584 (22%), Positives = 228/584 (39%), Gaps = 62/584 (10%) Query: 10 SFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRV 69 SF AGELSP LL +R DL+ + G N I GP + + + + + + Sbjct: 10 SFDAGELSP-LLGARVDLAKYPNGCQVMENFIATVQGPAIRRGGKRFVAATKDSTKQSWL 68 Query: 70 FSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKD--NKSLEYAVFG 127 F + DG +L FGD ++ V R + A TPY D + +A+ Sbjct: 69 LPFIVADGIAYMLEFGDHYIRFFVNRGQLVNAGAPV--EIATPYALADLTTEDGTFAIRA 126 Query: 128 S----TAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSI- 182 + T H +P LL +F+ + F+ P+ + V S+ + + Sbjct: 127 TQSADTMYLFHGGYPTQKLLRTS---ATTFSLQPVTFVGGPF------AAVNSDNNVRVH 177 Query: 183 SQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKN--TNYSIGAYIV--ADDKVYRS 238 + A T + + +F+P D G L + K + IG + D+VY Sbjct: 178 ASAGTGAVTLVASASVFRPSDVGTLFYLEQEDNSFVKPWVVHQKIGPSELRRVGDRVYLC 237 Query: 239 LTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPY--------YVWG 290 G + + ++ T+ + W S T + GA Y + G Sbjct: 238 TAVGTATPQVTGTETPTHTSGSR--WDGTGQDESATDEYGSIGAEWEYQHSGYGTVLITG 295 Query: 291 DIKDVSKDGRSISVAPQSQTLFQAGVSVVS----WFMSAWGEQEGYPSHVTFHNNRLLFS 346 D G + P + V ++ W S + +G+P TF NRL Sbjct: 296 YTNDQVVTGTVATNDPADPGMLPNTVVTLTGTYKWARSLFNSTDGFPQMGTFWRNRLCLM 355 Query: 347 GSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVG 406 + +SV + F F D + A+ + + + WM + +L+G Sbjct: 356 RDRWLAMSVS-ADFETFKTKDADQQTD----DSAIVQQLNARQLNKLAWMVE-SDSLLIG 409 Query: 407 CDTSLWLLSISLSK----GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIK-YISG 461 W++ + + +++ R + G PV VG ++FV GR+++ + Sbjct: 410 MTGDEWVIGPANASQPVSAANLNAARRTSYGSKRIQPVQVGGTIMFVQKAGRKLRDFKYD 469 Query: 462 STEQGFRFNEITQLADHLFNQR------ILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCR 515 + + ++T++ADH+ R I+ L +Q+EPHS+VW + +L+GC Sbjct: 470 FSSDNYVSTDVTKIADHITRGRAGTNSGIMSLCFQQEPHSVVWAA-----RADGQLIGCT 524 Query: 516 FSAE-GEGD-FAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLV 557 + E G D + WH H ++ +V AS P + LW++V Sbjct: 525 YDEEAGRSDVYGWHRHPDANG-FVECVASMPAPDGASDDLWVIV 567 >gi|212710810|ref|ZP_03318938.1| hypothetical protein PROVALCAL_01878 [Providencia alcalifaciens DSM 30120] gi|212686507|gb|EEB46035.1| hypothetical protein PROVALCAL_01878 [Providencia alcalifaciens DSM 30120] Length = 818 Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust. Identities = 121/538 (22%), Positives = 228/538 (42%), Gaps = 58/538 (10%) Query: 10 SFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRV 69 SFS GE++P L R DL+ ++ + K N + +YG + + P + + + R+ Sbjct: 9 SFSGGEIAPSLY-GRIDLAKYSTALRKCENFLVRQYGGIENRPGTKFIAAAKYPNKKCRL 67 Query: 70 FSFSIPDGGYALLVFGDKKLQIV-----VVRSSTKWSPALFGKTYKTPYTFKDNKSLEYA 124 F L GDK ++++ V+ + + +F T TPY D +L++ Sbjct: 68 IPFQFSTVQTYALEMGDKYMRVIKDGGQVLYADGEHKGEIFELT--TPYKEADLFNLKFT 125 Query: 125 VFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSI-S 183 VH D+PP L + D+ K +P +G + + + + Sbjct: 126 QSADVMTIVHADYPPMELQ--------RYDHDDWKLVPVE-TRNGPFEDINVDKERKVYV 176 Query: 184 QADTSTARITSDMKIFKPLDKGRSIRLGCHP----PEWAKNTNYSIGAYIVADDKVYRSL 239 A T +T+ IF G+ I + P W + A YR+ Sbjct: 177 SASTGEVTLTATHNIFGAELVGKQIYIEQQAVDAVPVWETDKTTIKNDQRRAGSNYYRAN 236 Query: 240 TTGRSGD-RFGYSKGATYVK---DNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDV 295 T+G+SG R +++G ++ D I W + S G V V D + Sbjct: 237 TSGKSGTLRPSHTEGMSWDGWGGDTGIQWEYL---------HSGFGIVKINSVSTD--GL 285 Query: 296 SKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSV 355 + G+ IS P S + ++ + W S W + +GYPS V ++ RL F+GS+ ++ Sbjct: 286 TATGKVISYIP-SNAVGESNATY-KWARSVWNDVDGYPSTVMYYQQRLFFAGSRAYPQTI 343 Query: 356 YLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM-HPFGEGVLVGCDT-SLWL 413 + S G + DF G +P + + ++ ++ + H G LV + + Sbjct: 344 WASRSGDYKDF------GKNNPIQDDDRIIYTYAGRQVNEIRHLIDVGSLVALTSGGEYQ 397 Query: 414 LSISLSKGL---SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRF 469 ++ +K L S F +G PP++V + +++ G ++ ++ S + G++ Sbjct: 398 ITGDQNKVLTPSSFSFSSQGANGCSDVPPIAVANIALYIQEKGSAVRDLAYSFDVDGYQG 457 Query: 470 NEITQLADHLFNQ-RILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAW 526 ++T +A+HLF + +I+ + P+SI W + +D+ +LL + E + FAW Sbjct: 458 TDLTIMANHLFQRHQIIDWAFTIVPYSIAWCI---RDDG--KLLSLTYLREQQV-FAW 509 >gi|221213947|ref|ZP_03586920.1| conserved hypothetical protein [Burkholderia multivorans CGD1] gi|221166124|gb|EED98597.1| conserved hypothetical protein [Burkholderia multivorans CGD1] Length = 766 Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 132/584 (22%), Positives = 221/584 (37%), Gaps = 64/584 (10%) Query: 10 SFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRV 69 SF AGELSP LL +R DL+ +A G N I GP V + + + + Sbjct: 10 SFDAGELSP-LLGARVDLAKYANGCLLLENFIATVQGPAVRRGGKRYVSAIKDSGKQAWL 68 Query: 70 FSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKD--NKSLEYAVFG 127 F + DG +L FGD+ ++ V R A TPY D + +A+ Sbjct: 69 LPFIVSDGIAYMLEFGDQYIRFYVNRGQLVNDSAPV--EIATPYALADLVTEDGTFAIRA 126 Query: 128 S----TAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSIS 183 + T H +P L +F + F+ P+ + V N + + Sbjct: 127 TQSADTMYLFHGAYPTQKLSRTS---ATTFELQPVTFVGGPF------ATVNDNNSIRVQ 177 Query: 184 QADTS-TARITSDMKIFKPLDKGRSIRLGCHPPE----WAKNTNYSIGAYIVADDKVYRS 238 + S +T++ +F+ D G + P WA + + D+ YR Sbjct: 178 ASGQSGDVTLTANADVFRASDVGTLFYVEQEQPTGIVPWAVHAESHVNDIRRVGDRTYRC 237 Query: 239 LTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSR-------ESASGAVAPYYVWGD 291 G + + + T + W + E A + G Sbjct: 238 TQIGLNAPQV--TGQETPIHTEGRRWDGDGRDPDGDTYGSIGVEWEYQHSGYATVLITGF 295 Query: 292 IKDVSKDGRSISVAPQSQTLFQAGV---SVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGS 348 + + P + V W S + +G+P TF +NRL Sbjct: 296 VNARQVSATVTTNNPNDPCMIPKPVVDSGTYKWARSLFNSTDGFPQMGTFWSNRLCVMRD 355 Query: 349 KGDELSVYLSSFGAFYDF-SLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGC 407 + +SV F +F + D + D A+ + + + WM + +LVG Sbjct: 356 RWIAMSVSAD----FENFKTKDADQQTDD--SAIVQQLNARRLNKLAWMVE-SDSLLVGM 408 Query: 408 DTSLWLL-----SISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIK-YISG 461 W++ S++LS ++ RR + G PV VG ++FV GR+++ + Sbjct: 409 TGDEWVIGKSNASLALS-ATNMSARRRTSYGSKRLQPVEVGGTILFVQKAGRKLRDFKYD 467 Query: 462 STEQGFRFNEITQLADHLFNQR------ILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCR 515 + + ++T++ADH+ R I+ L YQ+EPHSIVW + +L+GC Sbjct: 468 FSSDNYVSTDVTKIADHVTRGRSGTNSGIMSLCYQQEPHSIVWAA-----RADGQLIGCT 522 Query: 516 FSAE-GEGD-FAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLV 557 + E G D + WH H + +V AS P + LWM+V Sbjct: 523 YDEEAGRSDVYGWHRHPDVNG-FVECVASMPAPDGASDDLWMIV 565 >gi|221201505|ref|ZP_03574544.1| conserved hypothetical protein [Burkholderia multivorans CGD2M] gi|221207939|ref|ZP_03580945.1| hypothetical protein BURMUCGD2_2474 [Burkholderia multivorans CGD2] gi|221172124|gb|EEE04565.1| hypothetical protein BURMUCGD2_2474 [Burkholderia multivorans CGD2] gi|221178773|gb|EEE11181.1| conserved hypothetical protein [Burkholderia multivorans CGD2M] Length = 767 Score = 94.7 bits (234), Expect = 3e-17, Method: Compositional matrix adjust. Identities = 137/597 (22%), Positives = 230/597 (38%), Gaps = 89/597 (14%) Query: 10 SFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRV 69 SF AGELSP LL +R D++ + G N I GP V + + + + Sbjct: 10 SFDAGELSP-LLGARVDIAKYPNGCKVMENFIATVQGPAVRRGGKRFVAAVKDSSKQAWL 68 Query: 70 FSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKD--NKSLEYAVFG 127 F + DG +L FGD ++ V R + A TPY D + +A+ Sbjct: 69 LPFIVSDGIAYMLEFGDHYIRFYVDRG--QLVNAGGPVEIATPYALADLVTEDGTFAIRA 126 Query: 128 S----TAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSIS 183 + T H +PP LL +F+ ++ F+ P+ GV A Sbjct: 127 TQSADTMYLFHGAYPPQKLLRTS---ATTFSLQQVTFVSGPFQTINSDEGVTVKAS---- 179 Query: 184 QADTSTARITSDMKIFKPLDKGRSIRLGCHP-----PEWAKNTNYSIGAYIVADDKVYRS 238 T +T+ +F D G L + P T G D+ Y S Sbjct: 180 -GQTGAVTLTATAPVFSQADVGALFYLEQNDNTSVLPWSVHGTILETGLVRRVGDRTYVS 238 Query: 239 LTTGRSGDRFGYSKGATYVK----DNNIT--------------------WITVLNLSSKT 274 G + + S+ T+ + D ++T + TVL ++S + Sbjct: 239 TAIGPTAPQVTGSETPTHTRGRRYDGDLTDLANDNYGTIGIEWEYQHSGYATVL-ITSVS 297 Query: 275 SRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPS 334 + A+G V + + + PQS + G W + + +GYP Sbjct: 298 DSQHATGTV-----------TTNNPTDPCIIPQS--IVDTGT--YKWAHALFNAADGYPQ 342 Query: 335 HVTFHNNRLLFSGSKGDELSVYLSSFGAFYDF-SLDGEYGCYDPTKALTTAVTDFSASTI 393 TF NRL + SV F +F S D + D A+ + + + Sbjct: 343 MGTFWRNRLWMMRDRWLVGSVSAD----FENFASKDADQQTDD--SAIVQQLNARQLNKL 396 Query: 394 HWMHPFGEGVLVGCDTSLWLLSISLSK----GLSIDFRRVSGSGVYACPPVSVGDCLVFV 449 WM + +++G W++ + + +++ R + G PV VG ++FV Sbjct: 397 AWMVE-SDSLIIGMTGDEWVIGPANASQPVSATNLNAARRTSYGSKRIQPVQVGGTIMFV 455 Query: 450 CGVGRRIK-YISGSTEQGFRFNEITQLADHLF------NQRILQLVYQEEPHSIVWVVLE 502 GR+++ + + F ++T+LADH+ N I+ L +Q+EPHSIVW Sbjct: 456 QKAGRKLRDFKYDFSSDNFVSTDVTKLADHITRGRSGTNNGIMSLCFQQEPHSIVWAA-- 513 Query: 503 PKDNSFPRLLGCRFSAE-GEGD-FAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLV 557 + +L+GC + E G D + WH H ++ +V AS P + LW++V Sbjct: 514 ---RADGQLIGCTYDEEAGRSDVYGWHRHPDANG-FVECVASMPAPDGASDDLWLIV 566 >gi|288959382|ref|YP_003449723.1| hypothetical protein AZL_025410 [Azospirillum sp. B510] gi|288911690|dbj|BAI73179.1| hypothetical protein AZL_025410 [Azospirillum sp. B510] Length = 665 Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 77/277 (27%), Positives = 122/277 (44%), Gaps = 20/277 (7%) Query: 288 VWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEG-YPSHVTFHNNRLLFS 346 VWG + ++ G SV + + + W + AWG G +P+ VTFH NRL F+ Sbjct: 202 VWGWCR-ITAFGSVTSVTATVEAAWGGTTATAFWRLGAWGATTGTWPTAVTFHENRLAFA 260 Query: 347 GSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMH-PFGEGVLV 405 + +V+LS G F +F E G A+T D + I W+ FG + Sbjct: 261 ALQ----TVWLSCSGDFDNFGPTTENGTVAADNAITLTAADDQVNVIRWLRSAFGVLIAG 316 Query: 406 GCDTSLWLLSISLSKGLS---IDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGS 462 + + SL + L+ RV +G PV V LVF RR+ ++ Sbjct: 317 TSGGPFAIQASSLREALTPINATMPRVHVAGAADVQPVRVATNLVFPSRSRRRLHLLNAE 376 Query: 463 -TEQGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGE 521 G+ ++ +A H+ + + YQ+EP S++W+VL+ D + L G + E + Sbjct: 377 FAAAGYSAPDLALVASHITRHAVKAMAYQQEPWSVMWLVLD--DGT---LAGVTYVPELD 431 Query: 522 GDFAWHTHMISDKHY-VLSAASFPNDNRGGTSLWMLV 557 AWH H + VLS A P +R LW++V Sbjct: 432 -ILAWHRHPLGGTAVKVLSVACIPAADR--DELWLVV 465 >gi|262043557|ref|ZP_06016670.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|259039091|gb|EEW40249.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] Length = 511 Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust. Identities = 117/492 (23%), Positives = 200/492 (40%), Gaps = 37/492 (7%) Query: 5 TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDP 64 +W + SFS GE++P L R D++ + + K N I +YG + + P Q + Sbjct: 4 SWIQPSFSGGEIAPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTQFIAAAKYPD 62 Query: 65 RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYA 124 R R+ F L FG ++ V+ + TPYT D L++ Sbjct: 63 RKCRLIPFQFSTVQTYALEFGHNYMR-VIKDGGLVLTTGDVIYELATPYTENDVFGLKFT 121 Query: 125 VFGSTAVFVHKDHPPHHLL-YIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSIS 183 VH +PP L Y D +I +++ P+ + +K + Sbjct: 122 QSADVMTIVHPSYPPKELRRYAHDNWQIV----DVQTTNGPFEDINV-----DESKTVWA 172 Query: 184 QADTSTARITSDMKIFKPLDKGRSIRLGCHP-----PEWAKNTNYSIGAYIVADDKVYRS 238 A T T +TS IF G+ L P P W + + SI AD YR+ Sbjct: 173 SAPTGTITLTSSSAIFGAEQVGKLFYLE-QPAVDSVPVWETSKSTSIEDIRRADSNYYRA 231 Query: 239 LTTGRSGD-RFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSK 297 T G++G R +++G + S G V V GD + Sbjct: 232 NTAGKTGTLRPSHTEGMAWDGWGGTG--DDDTGVQWEYLHSGFGIVRITAVAGDGLTATA 289 Query: 298 DGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYL 357 D +S P++ + A + W AW GYP+ V ++ RL F+ S +++ Sbjct: 290 D--VVSRIPEN--VVGADKASYKWARYAWNSVNGYPATVVYYQQRLYFAASPAYPQTIWA 345 Query: 358 SSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM-HPFGEGVLVGCDT-SLWLLS 415 S G + DF G +PT+ V ++ ++ + H G LV + ++++ Sbjct: 346 SRTGDYKDF------GKSNPTQDDDRIVYTYAGRQVNEIRHLIDVGSLVVLTSGGEFVVT 399 Query: 416 ISLSKGLSIDFRRVSGSGVYAC---PPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNE 471 +K L+ +S G C PP++V + +F+ G ++ ++ S + GF+ N+ Sbjct: 400 GDQNKVLTPSAFSLSSQGSNGCSDVPPIAVSNIALFIQEKGSVVRDLAYSFDVDGFQGND 459 Query: 472 ITQLADHLFNQR 483 +T LA+HLF +R Sbjct: 460 LTILANHLFQKR 471 >gi|218886166|ref|YP_002435487.1| hypothetical protein DvMF_1065 [Desulfovibrio vulgaris str. 'Miyazaki F'] gi|218757120|gb|ACL08019.1| conserved hypothetical protein [Desulfovibrio vulgaris str. 'Miyazaki F'] Length = 692 Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 72/250 (28%), Positives = 114/250 (45%), Gaps = 21/250 (8%) Query: 332 YPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSAS 391 YPS V F RL F+GS+ +++ S G + + + D A+T + + S Sbjct: 274 YPSSVQFWQQRLCFAGSRSHPQTIWASRTGCYENMDVSRPLQTDD---AVTVTIASETVS 330 Query: 392 TIHWMHPFGEGVLVGCDTSLWLLSISLSKGLS-----IDFRRVSGSGVYACPPVSVGDCL 446 + WM P +LVG W LS S+ S ++F+ GS PP++VGD + Sbjct: 331 AVRWMMP-ARKLLVGTGGGEWTLSGQGSEPFSPLSCLLEFQSARGSA--ELPPLAVGDGV 387 Query: 447 VFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFNQR-ILQLVYQEEPHSIVWVVLEPK 504 + V GR ++ S + G+ + T LA+H+ R I+ YQ+ PHS+VW ++ Sbjct: 388 LAVQRGGRAVRDFRYSLDVDGYSGADQTILAEHMLRGRNIVDWAYQQSPHSVVWCAMD-- 445 Query: 505 DNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAA-SFPNDNRGGTSLWMLVALSA-G 562 D + + G AE + WH H L P+D GG LW++V G Sbjct: 446 DGT---MAGLTLIAEHQ-VAGWHRHDTGGAVEALCVVPGPPSDPAGGDELWLVVRRDVDG 501 Query: 563 EERSFTVRLN 572 +R + RL+ Sbjct: 502 VQRRYIERLD 511 Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust. Identities = 28/91 (30%), Positives = 48/91 (52%), Gaps = 1/91 (1%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M TT ++SF+AGELSP L+ +R D + +A G RN++ +GP P ++ C Sbjct: 1 MARTTLIQNSFNAGELSP-LMAARGDQARYASGCRVLRNMLLHPHGPAFRRPGLRFMGAC 59 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQI 91 + R+ F +G +L F ++L++ Sbjct: 60 VDETVPPRLVPFVFNEGQAYVLEFAPERLRV 90 >gi|301046400|ref|ZP_07193560.1| conserved domain protein [Escherichia coli MS 185-1] gi|300301626|gb|EFJ58011.1| conserved domain protein [Escherichia coli MS 185-1] Length = 821 Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust. Identities = 121/526 (23%), Positives = 219/526 (41%), Gaps = 72/526 (13%) Query: 5 TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQ-------EY 57 +W + SF+ GE+ P L R D++ + + K N I +YG + + P + Sbjct: 4 SWIQPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYPN 62 Query: 58 RDCRLDP-RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFK 116 R CRL P + + V ++++ G + V D L V+ S+ + A TPYT Sbjct: 63 RKCRLIPFQFSTVQTYALEFGHQYMRVIKDGAL--VLNSSNVIYEIA-------TPYTEA 113 Query: 117 DNKSLEYAVFGSTAVFVHKDHPPHHLL-YIQDGDKISFTFDEIKFLPPPWLGDGMISGVK 175 D +++ VH +PP L Y D ++ + +G + Sbjct: 114 DLFRIKFTQSADVLTLVHPAYPPKELRRYAHDNWQLVDVVTK----------NGPFEDIN 163 Query: 176 SNAKLSI-SQADTSTARITSDMKIFKPLDKGRSIRLGCHP-----PEWAKNTNYSIGAYI 229 + +++ + A T T +T+ IF G+ L P P W + + SIG Sbjct: 164 IDESVTVYASASTGTITLTASASIFGAEQVGKLFYLE-QPAVDSVPVWETSKSTSIGDIR 222 Query: 230 VADDKVYRSLTTGRSGD-RFGYSKGATYVKD-------NNITWITVLNLSSKTSRESASG 281 AD YR++T G++G R +++G ++ I W + + +A+G Sbjct: 223 RADSNYYRAVTAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEWEYLHSGFGIARISAANG 282 Query: 282 AVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNN 341 A V IS P SQ + + S W AW GYP V ++ Sbjct: 283 TTATAEV-------------ISYIP-SQVVGEDNASY-KWAKYAWNSINGYPGTVVYYQQ 327 Query: 342 RLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM-HPFG 400 RL F+ S +++ S G + DF G +PT+ + ++ ++ + H Sbjct: 328 RLYFAASTAFPQTIWASRTGDYKDF------GKSNPTQDDDRIIYTYAGRQVNEIRHLID 381 Query: 401 EGVLVGCDT-SLWLLSISLSKGL---SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRI 456 G LV + ++++ +K L S F +G PP++V + +FV G + Sbjct: 382 VGSLVALTSGGEYVITGDQNKALTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVV 441 Query: 457 KYISGSTE-QGFRFNEITQLADHLFNQR-ILQLVYQEEPHSIVWVV 500 + ++ S + G++ N++T LA+HLF + I+ + P+S + + Sbjct: 442 RDLAYSFDVDGYQGNDLTILANHLFQKHSIVDWCFSIVPYSSAFCI 487 >gi|89152436|ref|YP_512269.1| hypothetical protein PhiV10p15 [Escherichia phage phiV10] gi|74055459|gb|AAZ95908.1| hypothetical protein PhiV10p15 [Escherichia phage phiV10] Length = 823 Score = 84.0 bits (206), Expect = 6e-14, Method: Compositional matrix adjust. Identities = 121/526 (23%), Positives = 219/526 (41%), Gaps = 72/526 (13%) Query: 5 TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQ-------EY 57 +W + SF+ GE+ P L R D++ + + K N I +YG + + P + Sbjct: 4 SWIQPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYPN 62 Query: 58 RDCRLDP-RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFK 116 R CRL P + + V ++++ G + V D L V+ S+ + A TPYT Sbjct: 63 RKCRLIPFQFSTVQTYALEFGHQYMRVIKDGAL--VLNSSNVIYEIA-------TPYTEA 113 Query: 117 DNKSLEYAVFGSTAVFVHKDHPPHHLL-YIQDGDKISFTFDEIKFLPPPWLGDGMISGVK 175 D +++ VH +PP L Y D ++ + +G + Sbjct: 114 DLFRIKFTQSADVLTLVHPAYPPKELRRYAHDNWQLVDVVTK----------NGPFEDIN 163 Query: 176 SNAKLSI-SQADTSTARITSDMKIFKPLDKGRSIRLGCHP-----PEWAKNTNYSIGAYI 229 + L++ + A T T +T+ IF G+ L P P W + + SIG Sbjct: 164 IDESLTVYASASTGTITLTASASIFGAEQVGKLFYLE-QPAVDSVPVWETSKSTSIGDIR 222 Query: 230 VADDKVYRSLTTGRSGD-RFGYSKGATYVKD-------NNITWITVLNLSSKTSRESASG 281 AD YR++T G++G R +++G ++ I W L+ +R +A Sbjct: 223 RADSNYYRAVTAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEW-EYLHSGFGIARITA-- 279 Query: 282 AVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNN 341 + + IS P SQ + + S W AW GYP V ++ Sbjct: 280 ----------VNGTTATAEVISYIP-SQVVGEDNASY-KWAKYAWNSVNGYPGTVVYYQQ 327 Query: 342 RLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM-HPFG 400 RL F+ S +++ S G + DF G +PT+ + ++ ++ + H Sbjct: 328 RLYFAASTAFPQTIWASRTGDYKDF------GKSNPTQDDDRIIYTYAGRQVNEIRHLID 381 Query: 401 EGVLVGCDT-SLWLLSISLSKGL---SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRI 456 G LV + ++++ +K L S F +G PP++V + +FV G + Sbjct: 382 VGSLVALTSGGEYVITGDQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVV 441 Query: 457 KYISGSTE-QGFRFNEITQLADHLFNQR-ILQLVYQEEPHSIVWVV 500 + ++ S + G++ N++T LA+HLF + I+ + P+S + + Sbjct: 442 RDLAYSFDVDGYQGNDLTILANHLFQKHSIVDWCFSIVPYSSAFCI 487 >gi|187736306|ref|YP_001878418.1| hypothetical protein Amuc_1819 [Akkermansia muciniphila ATCC BAA-835] gi|187426358|gb|ACD05637.1| hypothetical protein Amuc_1819 [Akkermansia muciniphila ATCC BAA-835] Length = 822 Score = 83.6 bits (205), Expect = 7e-14, Method: Compositional matrix adjust. Identities = 75/254 (29%), Positives = 112/254 (44%), Gaps = 18/254 (7%) Query: 321 WFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKA 380 W A+G + GYP V FH RL F G+ G +++ S F F+ D Sbjct: 413 WSFGAFGVRNGYPCTVEFHQGRLWFGGTPGQPQTLWASRVDDFSAFTPGIP---ADSPMI 469 Query: 381 LTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSID---FRRVSGSGVYAC 437 LT A + + I W+ G+++G W LS + S+GL+ F R SG G + Sbjct: 470 LTMAAS--QQNRISWIASL-RGLMIGTSEGEWRLSATNSEGLNASNAGFERHSGVGSASL 526 Query: 438 PPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFNQRILQLVYQEEPHSI 496 +SV + L+FV G +++ + S E G++ +++ L+DHL + I+ Q Sbjct: 527 DALSVENSLLFVQQGGMKVRELFYSLEADGYQTRDVSLLSDHLLGEGIVDWTVQRSTAFH 586 Query: 497 VWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPND-NRGGTSLWM 555 VW VL D S C + AWH H + + +LS AS N +W Sbjct: 587 VWCVL--GDGSAV----CMTLNREQNVVAWHAHRL-EHGRILSVASLRGSRNTPDEEVWF 639 Query: 556 LVALSAGEERSFTV 569 VA GEE TV Sbjct: 640 AVARGEGEEACITV 653 >gi|327252176|gb|EGE63848.1| phage protein [Escherichia coli STEC_7v] Length = 823 Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 121/526 (23%), Positives = 219/526 (41%), Gaps = 72/526 (13%) Query: 5 TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQ-------EY 57 +W + SF+ GE+ P L R D++ + + K N I +YG + + P + Sbjct: 4 SWIQPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYPN 62 Query: 58 RDCRLDP-RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFK 116 R CRL P + + V ++++ G + V D L V+ S+ + A TPYT Sbjct: 63 RKCRLIPFQFSTVQTYALEFGHQYMRVIKDGAL--VLNSSNVIYEIA-------TPYTEA 113 Query: 117 DNKSLEYAVFGSTAVFVHKDHPPHHLL-YIQDGDKISFTFDEIKFLPPPWLGDGMISGVK 175 D +++ VH +PP L Y D ++ + +G + Sbjct: 114 DLFRIKFTQSADVLTLVHPAYPPKELRRYAHDNWQLVDVVTK----------NGPFEDIN 163 Query: 176 SNAKLSI-SQADTSTARITSDMKIFKPLDKGRSIRLGCHP-----PEWAKNTNYSIGAYI 229 + +++ + A T T +T+ IF G+ L P P W + + SIG Sbjct: 164 IDESVTVYASASTGTITLTASASIFGAEQVGKLFYLE-QPAVDSVPVWETSKSTSIGDIR 222 Query: 230 VADDKVYRSLTTGRSGD-RFGYSKGATYVKD-------NNITWITVLNLSSKTSRESASG 281 AD YR++T G++G R +++G ++ I W + + +A+G Sbjct: 223 RADSNYYRAVTAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEWEYLHSGFGIARISAANG 282 Query: 282 AVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNN 341 A V IS P SQ + + S W AW GYP V ++ Sbjct: 283 TTATAEV-------------ISYIP-SQVVGEDNASY-KWAKYAWNSVNGYPGTVVYYQQ 327 Query: 342 RLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM-HPFG 400 RL F+ S +++ S G + DF G +PT+ + ++ ++ + H Sbjct: 328 RLYFAASTAFPQTIWASRTGDYKDF------GKSNPTQDDDRIIYTYAGRQVNEIRHLID 381 Query: 401 EGVLVGCDT-SLWLLSISLSKGL---SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRI 456 G LV + ++++ +K L S F +G PP++V + +FV G + Sbjct: 382 VGSLVALTSGGEYVITGDQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVV 441 Query: 457 KYISGSTE-QGFRFNEITQLADHLFNQR-ILQLVYQEEPHSIVWVV 500 + ++ S + G++ N++T LA+HLF + I+ + P+S + + Sbjct: 442 RDLAYSFDVDGYQGNDLTILANHLFQKHSIVDWCFSIVPYSSAFCI 487 >gi|323156125|gb|EFZ42284.1| phage protein [Escherichia coli EPECa14] Length = 823 Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 121/526 (23%), Positives = 219/526 (41%), Gaps = 72/526 (13%) Query: 5 TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQ-------EY 57 +W SF+ GE+ P L R D++ + + K N I +YG + + P + Sbjct: 4 SWIHPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYPN 62 Query: 58 RDCRLDP-RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFK 116 R CRL P + + V ++++ G + V D L V+ S+ + A TPYT Sbjct: 63 RKCRLIPFQFSTVQTYALEFGHQYMRVIKDGAL--VLNSSNVIYEIA-------TPYTEA 113 Query: 117 DNKSLEYAVFGSTAVFVHKDHPPHHLL-YIQDGDKISFTFDEIKFLPPPWLGDGMISGVK 175 D +++ VH +PP L Y D ++ + +G + Sbjct: 114 DLFRIKFTQSADVLTLVHPAYPPKELRRYAHDNWQLVDVVTK----------NGPFEDIN 163 Query: 176 SNAKLSI-SQADTSTARITSDMKIFKPLDKGRSIRLGCHP-----PEWAKNTNYSIGAYI 229 + +++ + A T T +T++ IF G+ L P P W + + SIG Sbjct: 164 IDESVTVYASASTGTITLTANASIFGAEQVGKLFYLE-QPAVDSVPVWETSKSTSIGDIR 222 Query: 230 VADDKVYRSLTTGRSGD-RFGYSKGATYVKD-------NNITWITVLNLSSKTSRESASG 281 AD YR++T G++G R +++G ++ I W + + +A+G Sbjct: 223 RADSNYYRAVTAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEWEYLHSGFGIARISAANG 282 Query: 282 AVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNN 341 A V IS P SQ + + S W AW GYP V ++ Sbjct: 283 TTATAEV-------------ISYIP-SQVVGEDNASY-KWAKYAWNSVNGYPGTVVYYQQ 327 Query: 342 RLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM-HPFG 400 RL F+ S +++ S G + DF G +PT+ + ++ ++ + H Sbjct: 328 RLYFAASTAFPQTIWASRTGDYKDF------GKSNPTQDDDRIIYTYAGRQVNEIRHLID 381 Query: 401 EGVLVGCDT-SLWLLSISLSKGL---SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRI 456 G LV + ++++ +K L S F +G PP++V + +FV G + Sbjct: 382 VGSLVALTSGGEYVITGDQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVV 441 Query: 457 KYISGSTE-QGFRFNEITQLADHLFNQR-ILQLVYQEEPHSIVWVV 500 + ++ S + G++ N++T LA+HLF + I+ + P+S + + Sbjct: 442 RDLAYSFDVDGYQGNDLTILANHLFQKHSIVDWCFSIVPYSSAFCI 487 >gi|332344346|gb|AEE57680.1| conserved hypothetical protein [Escherichia coli UMNK88] Length = 823 Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 121/526 (23%), Positives = 219/526 (41%), Gaps = 72/526 (13%) Query: 5 TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQ-------EY 57 +W + SF+ GE+ P L R D++ + + K N I +YG + + P + Sbjct: 4 SWIQPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYPN 62 Query: 58 RDCRLDP-RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFK 116 R CRL P + + V ++++ G + V D L V+ S+ + A TPYT Sbjct: 63 RKCRLIPFQFSTVQTYALEFGHQYMRVIKDGAL--VLNSSNVIYEIA-------TPYTEA 113 Query: 117 DNKSLEYAVFGSTAVFVHKDHPPHHLL-YIQDGDKISFTFDEIKFLPPPWLGDGMISGVK 175 D +++ VH +PP L Y D ++ + +G + Sbjct: 114 DLFRIKFTQSADVLTLVHPAYPPKELRRYAHDNWQLVDVVTK----------NGPFEDIN 163 Query: 176 SNAKLSI-SQADTSTARITSDMKIFKPLDKGRSIRLGCHP-----PEWAKNTNYSIGAYI 229 + +++ + A T T +T+ IF G+ L P P W + + SIG Sbjct: 164 IDESVTVYASASTGTITLTASASIFGAEQVGKLFYLE-QPAVDSVPVWETSKSTSIGDIR 222 Query: 230 VADDKVYRSLTTGRSGD-RFGYSKGATYVKD-------NNITWITVLNLSSKTSRESASG 281 AD YR++T G++G R +++G ++ I W + + +A+G Sbjct: 223 RADSNYYRAVTAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEWEYLHSGFGIARISAANG 282 Query: 282 AVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNN 341 A V IS P SQ + + S W AW GYP V ++ Sbjct: 283 TTATAEV-------------ISYIP-SQVVGEDNASY-KWAKYAWDSINGYPGTVVYYQQ 327 Query: 342 RLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM-HPFG 400 RL F+ S +++ S G + DF G +PT+ + ++ ++ + H Sbjct: 328 RLYFAASTAFPQTIWASRTGDYKDF------GKSNPTQDDDRIIYTYAGRQVNEIRHLID 381 Query: 401 EGVLVGCDT-SLWLLSISLSKGL---SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRI 456 G LV + ++++ +K L S F +G PP++V + +FV G + Sbjct: 382 VGSLVALTSGGEYVITGDQNKVLAPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVV 441 Query: 457 KYISGSTE-QGFRFNEITQLADHLFNQR-ILQLVYQEEPHSIVWVV 500 + ++ S + G++ N++T LA+HLF + I+ + P+S + + Sbjct: 442 RDLAYSFDVDGYQGNDLTILANHLFQKHSIVDWCFSIVPYSSAFCI 487 >gi|294493191|gb|ADE91947.1| conserved hypothetical protein [Escherichia coli IHE3034] Length = 823 Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 120/526 (22%), Positives = 219/526 (41%), Gaps = 72/526 (13%) Query: 5 TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQ-------EY 57 +W + SF+ GE+ P L R D++ + + K N I +YG + + P + Sbjct: 4 SWIQPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYPN 62 Query: 58 RDCRLDP-RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFK 116 R CRL P + + V ++++ G + V D L V+ S+ + A TPYT Sbjct: 63 RKCRLIPFQFSTVQTYALEFGHQYMRVIKDGAL--VLNSSNVIYEIA-------TPYTEA 113 Query: 117 DNKSLEYAVFGSTAVFVHKDHPPHHLL-YIQDGDKISFTFDEIKFLPPPWLGDGMISGVK 175 D +++ VH +PP L Y D ++ + +G + Sbjct: 114 DLFRIKFTQSADVLTLVHPAYPPKELRRYAHDNWQLVDVVTK----------NGPFEDIN 163 Query: 176 SNAKLSI-SQADTSTARITSDMKIFKPLDKGRSIRLGCHP-----PEWAKNTNYSIGAYI 229 + +++ + A T T +T+ IF G+ L P P W + + SIG Sbjct: 164 IDESVTVYASASTGTITLTASASIFGAEQVGKLFYLE-QPAVDSVPVWETSKSTSIGDIR 222 Query: 230 VADDKVYRSLTTGRSGD-RFGYSKGATYVKD-------NNITWITVLNLSSKTSRESASG 281 AD YR++T G++G R +++G ++ I W L+ +R +A Sbjct: 223 RADSNYYRAVTAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEW-EYLHSGFGIARITA-- 279 Query: 282 AVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNN 341 + + IS P SQ + + S W AW GYP V ++ Sbjct: 280 ----------VNGTTATAEVISYIP-SQVVGEDNASY-KWAKYAWNSVNGYPGTVVYYQQ 327 Query: 342 RLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM-HPFG 400 RL F+ S +++ S G + DF G +PT+ + ++ ++ + H Sbjct: 328 RLYFAASTAFPQTIWASRTGDYKDF------GKSNPTQDDDRIIYTYAGRQVNEIRHLID 381 Query: 401 EGVLVGCDT-SLWLLSISLSKGL---SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRI 456 G LV + ++++ +K L S F +G PP++V + +FV G + Sbjct: 382 VGSLVALTSGGEYVITGDQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVV 441 Query: 457 KYISGSTE-QGFRFNEITQLADHLFNQR-ILQLVYQEEPHSIVWVV 500 + ++ S + G++ N++T LA+HLF + I+ + P+S + + Sbjct: 442 RDLAYSFDVDGYQGNDLTILANHLFQKHSIVDWCFSIVPYSSAFCI 487 >gi|300898435|ref|ZP_07116776.1| conserved domain protein [Escherichia coli MS 198-1] gi|300357902|gb|EFJ73772.1| conserved domain protein [Escherichia coli MS 198-1] Length = 823 Score = 80.5 bits (197), Expect = 7e-13, Method: Compositional matrix adjust. Identities = 119/526 (22%), Positives = 219/526 (41%), Gaps = 72/526 (13%) Query: 5 TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQ-------EY 57 +W + SF+ GE+ P L R D++ + + K N I +YG + + P + Sbjct: 4 SWIQPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYPN 62 Query: 58 RDCRLDP-RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFK 116 R CRL P + + V ++++ G + V D L V+ S+ + A TPYT Sbjct: 63 RKCRLIPFQFSTVQTYALEFGHQYMRVIKDGAL--VLNSSNVIYEIA-------TPYTEA 113 Query: 117 DNKSLEYAVFGSTAVFVHKDHPPHHLL-YIQDGDKISFTFDEIKFLPPPWLGDGMISGVK 175 D +++ VH +PP L Y D ++ + +G + Sbjct: 114 DLFRIKFTQSADVLTLVHPAYPPKELRRYAHDNWQLVDVVTK----------NGPFEDIN 163 Query: 176 SNAKLSI-SQADTSTARITSDMKIFKPLDKGRSIRLGCHP-----PEWAKNTNYSIGAYI 229 + +++ + A T T +T+ IF G+ L P P W + + SIG Sbjct: 164 IDESVTVYASASTGTITLTASASIFGAEQVGKLFYLE-QPAVDSVPVWETSKSTSIGDIR 222 Query: 230 VADDKVYRSLTTGRSGD-RFGYSKGATYVKD-------NNITWITVLNLSSKTSRESASG 281 AD YR++T G++G R +++G ++ I W L+ +R +A Sbjct: 223 RADSNYYRAVTAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEW-EYLHSGFGIARITA-- 279 Query: 282 AVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNN 341 + + IS P SQ + + S W AW GYP V ++ Sbjct: 280 ----------VNGTTATAEVISYIP-SQVVGEDNASY-KWAKYAWNSVNGYPGTVVYYQQ 327 Query: 342 RLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM-HPFG 400 RL F+ S +++ S G + DF G +PT+ + ++ ++ + H Sbjct: 328 RLYFAASTAFPQTIWASRTGDYKDF------GKSNPTQDDDRIIYTYAGRQVNEIRHLID 381 Query: 401 EGVLVGCDT-SLWLLSISLSKGL---SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRI 456 G LV + ++++ +K L S F +G PP++V + +FV G + Sbjct: 382 VGSLVALTSGGEYVITGDQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVV 441 Query: 457 KYISGSTE-QGFRFNEITQLADHLFNQR-ILQLVYQEEPHSIVWVV 500 + ++ S + G++ +++T LA+HLF + I+ + P+S + + Sbjct: 442 RDLAYSFDVDGYQGSDLTILANHLFQKHSIVDWCFSIVPYSSAFCI 487 >gi|46580124|ref|YP_010932.1| hypothetical protein DVU1714 [Desulfovibrio vulgaris str. Hildenborough] gi|46449540|gb|AAS96191.1| conserved hypothetical protein [Desulfovibrio vulgaris str. Hildenborough] gi|311233883|gb|ADP86737.1| hypothetical protein Deval_1582 [Desulfovibrio vulgaris RCH1] Length = 697 Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 140/589 (23%), Positives = 223/589 (37%), Gaps = 109/589 (18%) Query: 8 KHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMP---LMQEYRDCRLDP 64 + +F+ GE+SP LL +R D + G RN +PL GP+ P M ++ P Sbjct: 8 QQAFNGGEISP-LLTARADQIRYQTGALTMRNAVPLAQGPVTRRPGLRFMGAAKEQGAGP 66 Query: 65 RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALF----GKTYK--TPYTFKDN 118 R+ SF L FG +++ W A G+ Y+ +PY D Sbjct: 67 --VRLVSFVFSAAQSRALEFGPGYVRV--------WMDAGLVSKNGQPYEVASPYGAADI 116 Query: 119 KSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLP----PPWLGDGMISGV 174 L +A ++HPP L D D + F F+P P L G + Sbjct: 117 AGLRFAQSADVIYIASRNHPPRKLSRHADDD---WRFITPTFMPTQAAPGALTLGTLGTT 173 Query: 175 KSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDK 234 + S T+ + T + + P +G W + + ++ + + + Sbjct: 174 PGPGNETYSYKVTAVSATTGEESLASP--EGTITTTAMSSTYWVRVSWAAVPGAV--EYR 229 Query: 235 VYRSLTTGRSGDRFGY----SKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWG 290 VY+ R FG+ G T+ D NI GA Sbjct: 230 VYK-----RRYGVFGFIGRAVGGDTFFDDRNI------------------GA-------- 258 Query: 291 DIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKG 350 D +D P+++ F +A GE YP V F RL F+GS Sbjct: 259 DTEDT---------VPEAKNPF-----------TAAGE---YPGLVFFWQQRLGFAGSDK 295 Query: 351 DELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVL-VGCDT 409 L+V+LS AF + + D +A + + W+ G+ L +G + Sbjct: 296 RPLTVWLSQSAAFENLAASRPPQDDDGIEA---TLAGQRQNRFVWIE--GDRTLCLGTEG 350 Query: 410 SLWLLSISLSKGL---SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQ- 465 W LS + S+ F+ G P V GD L++V G ++ + S E+ Sbjct: 351 GEWTLSGQEGGPVTPTSLQFQSHGVRGSEGVPAVRAGDSLLYVQRGGGVVREFTYSFERD 410 Query: 466 GFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGD-F 524 G+ ++T L L +++ YQ+ PHSIVW VL+ D + L R E D Sbjct: 411 GYVAPDLTLLTGVLRGRKVRAWAYQQSPHSIVWCVLD--DGTLAALTFLR-----EHDVV 463 Query: 525 AWHTHMISDKHYVLSAASFPNDNRGGT-SLWMLVALS-AGEERSFTVRL 571 WH H ++ + GGT ++WMLV + G+ER + R+ Sbjct: 464 GWHRHDTDGVVEDVTVIPGGDATAGGTDTVWMLVRRTVGGQERRYVERM 512 >gi|242278913|ref|YP_002991042.1| hypothetical protein Desal_1441 [Desulfovibrio salexigens DSM 2638] gi|242121807|gb|ACS79503.1| hypothetical protein Desal_1441 [Desulfovibrio salexigens DSM 2638] Length = 698 Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 70/250 (28%), Positives = 115/250 (46%), Gaps = 33/250 (13%) Query: 304 VAPQSQTLFQAGVSVVSWFMS---------AWGEQEGYPSHVTFHNNRLLFSGSKGDELS 354 V P+ Q + S V W M W ++G+PS VTF RL F+ S + + Sbjct: 246 VHPEVQPYKLSRTSHVDWKMELVAFSSPPQEWNSEKGFPSCVTFFEERLCFAASPSNPQT 305 Query: 355 VYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLL 414 +++S G++ DF++ D A T ++ + I WM + +++G W Sbjct: 306 IWMSKAGSYEDFAVSSPVVDDD---ACTYTLSADQVNAIRWMVS-AKKLIMGTSGGEWW- 360 Query: 415 SISLSKGLSID--------FRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-Q 465 LS G S+D RR + G A PPV VG ++F+ GR I+ +S S E Sbjct: 361 ---LSGGSSLDSVTPNSVMVRRETTHGSAAIPPVVVGGVMLFLQREGRTIRELSYSFEAD 417 Query: 466 GFRFNEITQLADHLF-NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDF 524 G+ ++T LA+HL + I + YQ+ P S++W+ +D+ ++G + E E Sbjct: 418 GYTAPDLTILAEHLTRSNSITEWAYQQSPDSVIWMT---RDDGV--MVGLTYQREHE-VV 471 Query: 525 AWHTHMISDK 534 +H H K Sbjct: 472 GFHRHTTDGK 481 >gi|30387391|ref|NP_848220.1| hypothetical protein epsilon15p12 [Enterobacteria phage epsilon15] gi|30266046|gb|AAO06075.1| 12 [Salmonella phage epsilon15] Length = 825 Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 116/504 (23%), Positives = 200/504 (39%), Gaps = 63/504 (12%) Query: 5 TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDP 64 +W + SF+ GE+ P L R D+S + + K N I +YG + + P + + Sbjct: 4 SWIQPSFAGGEIGPSLY-GRIDMSKYQVALRKCDNFIVRQYGGVENRPGTRFVGPAKYPD 62 Query: 65 RSNRVFSFSIPDGGYALLVFGDKKLQI------VVVRSSTKWSPALFGKTYKTPYTFKDN 118 R R+ F L FG +++ V+ S+ + A+ PY D Sbjct: 63 RKCRLIPFQFSTVQTYALEFGHNYMRVIKDGAYVLTTSNVIYELAM-------PYADTDL 115 Query: 119 KSLEYAVFGSTAVFVHKDHPPHHLL-YIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSN 177 +++ VH +PP L Y D +I ++ P+ + VK Sbjct: 116 FRIKFTQSADVLTLVHPAYPPKELRRYAHDNWQIV----DVTTKNGPFEDINVDETVKVY 171 Query: 178 AKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHP-----PEWAKNTNYSIGAYIVAD 232 A A T T +T+ IF G+ L P P W + +I AD Sbjct: 172 AS-----ASTGTITLTASSAIFGAEQVGKLFYLE-QPAVDSVPVWETSKTTAINDVRRAD 225 Query: 233 DKVYRSLTTGRSGD-RFGYSKGATY-------VKDNNITWITVLNLSSKTSRESASGAVA 284 YR+ T+G++G R +++G ++ D I W + S G Sbjct: 226 SNYYRANTSGKTGTLRPSHTEGMSWDGWGGTGSDDTGIQWEYL---------HSGFGIAK 276 Query: 285 PYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLL 344 V GD + D +S P SQ + A S W AW GYPS V ++ RL Sbjct: 277 ITAVAGDGLTATAD--VVSFIP-SQVVGSANASY-KWAKYAWNSVNGYPSTVVYYQQRLY 332 Query: 345 FSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM-HPFGEGV 403 F+ S +++ S G + DF G +P + + ++ ++ + H G Sbjct: 333 FAASTAYPQTIWASRTGDYKDF------GKNNPIQDDDRIIYTYAGRQVNEIRHLIDVGN 386 Query: 404 LVGCDT-SLWLLSISLSKGLS---IDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYI 459 LV + + +S +K L+ F +G PP++V + +F+ G ++ + Sbjct: 387 LVALTSGGEYTISGDQNKVLTPSAFSFSSQGNNGSSNVPPIAVANIALFIQEKGSVVRDL 446 Query: 460 SGSTE-QGFRFNEITQLADHLFNQ 482 + S + G++ ++T LA+HLF + Sbjct: 447 AYSFDVDGYQGTDLTILANHLFQK 470 >gi|120601703|ref|YP_966103.1| hypothetical protein Dvul_0653 [Desulfovibrio vulgaris DP4] gi|120561932|gb|ABM27676.1| conserved hypothetical protein [Desulfovibrio vulgaris DP4] Length = 699 Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 130/574 (22%), Positives = 213/574 (37%), Gaps = 90/574 (15%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M T ++SF+AGELSP L+ +R D + + G A N++ +G P ++ + Sbjct: 1 MARATIVRNSFNAGELSP-LMAARVDQARYPNGCASLCNMLLHPHGGAWRRPGLR-FMGL 58 Query: 61 RLDPRSN-RVFSFSIPDGGYALLVFGDKKLQI---VVVRSSTKWSPALFGKTYKTPYTFK 116 DP R+ F + +L FG + L+I + P +TP+ + Sbjct: 59 AADPAGPVRLIPFVFSEAQAYVLEFGPRSLRIWHGGGLVLGGDGEPFRL----ETPWAGE 114 Query: 117 DNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLP---PPWLGDGMISG 173 +L + V PP L D + ++ FLP PP +G+ Sbjct: 115 QLTALRWCQSADMLYLVSHAGPPRRLERHGHAD---WRLVDVSFLPGVSPP---EGLHCT 168 Query: 174 VKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADD 233 VK + + T+ R + + + P + P ++ + ++ V D Sbjct: 169 VKPAGSRTWTYVVTAVHRESGEESLPTPPLQVTG------PDALSQTASVTLAWTPVQDA 222 Query: 234 KVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIK 293 YR G +G+ A GA Y G Sbjct: 223 GEYRVYRAGGGASVYGFLGSA--------------------------GAGETYTDTGRTP 256 Query: 294 DVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDEL 353 D P+++ F GE + +PS F RL F+G++ Sbjct: 257 DFDAG------PPEARNPFS-------------GEGD-WPSCAVFWQQRLCFAGTRNGPQ 296 Query: 354 SVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWL 413 +++ S GA+ +FS+ D A+T + + S + W+ P +LVG W Sbjct: 297 TIWASRSGAYGNFSVSRPLRDDD---AVTVTIAADTVSAVRWLMP-ARRLLVGTGGGEWT 352 Query: 414 LSISLSK---GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRF 469 LS + LS R S G P+SVGD ++ + GR ++ S + G+ Sbjct: 353 LSGQGEQPFSPLSCSLERQSSRGSGDVQPLSVGDAVLALQRGGRVVREFRYSLDVDGYAG 412 Query: 470 NEITQLADHLF-NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHT 528 ++T LA+HL +RI+ +Q+ P VW V E L+ E E WH Sbjct: 413 TDLTILAEHLTRGRRIIDWAWQQSPSGTVWCVTEDGG-----LIAMTRIPEHE-VAGWHR 466 Query: 529 HMISDKHYVLSAASFPNDNRGGTSLWMLVALSAG 562 H+ VLS + P G LW+ V G Sbjct: 467 HVTDGA--VLSVCTIPGT--AGDELWVAVRREGG 496 >gi|215487813|ref|YP_002330244.1| hypothetical protein E2348C_2746 [Escherichia coli O127:H6 str. E2348/69] gi|215265885|emb|CAS10294.1| predicted protein [Escherichia coli O127:H6 str. E2348/69] Length = 825 Score = 78.2 bits (191), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 116/504 (23%), Positives = 199/504 (39%), Gaps = 63/504 (12%) Query: 5 TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDP 64 +W + SF+ GE+ P L R D+S + + K N I +YG + + P + + Sbjct: 4 SWIQPSFAGGEIGPSLY-GRIDMSKYQVALRKCDNFIVRQYGGVENRPGTRFVGPAKYPD 62 Query: 65 RSNRVFSFSIPDGGYALLVFGDKKLQI------VVVRSSTKWSPALFGKTYKTPYTFKDN 118 R R+ F L FG +++ V+ S+ + A+ PY D Sbjct: 63 RKCRLIPFQFSTVQTYALEFGHNYMRVIKDGEYVLTTSNVIYELAM-------PYADTDL 115 Query: 119 KSLEYAVFGSTAVFVHKDHPPHHLL-YIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSN 177 +++ VH +PP L Y D +I ++ P+ + VK Sbjct: 116 FRIKFTQSADVLTLVHPAYPPKELRRYAHDNWQIV----DVTTKNGPFEDINVDDTVKVY 171 Query: 178 AKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHP-----PEWAKNTNYSIGAYIVAD 232 A A T T +T+ IF G+ L P P W + +I AD Sbjct: 172 AS-----ASTGTITLTASSAIFGAEQVGKLFYLE-QPAVDSVPVWETSKTTAINDVRRAD 225 Query: 233 DKVYRSLTTGRSGD-RFGYSKGATY-------VKDNNITWITVLNLSSKTSRESASGAVA 284 YR+ T G++G R +++G ++ D I W + S G Sbjct: 226 SNYYRANTAGKTGTLRPSHTEGMSWDGWGGTGSDDTGIQWEYL---------HSGFGIAK 276 Query: 285 PYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLL 344 V GD + D +S P SQ + A S W AW GYPS V ++ RL Sbjct: 277 ITAVSGDGLTATAD--VVSFIP-SQVVGSANASY-KWAKYAWNSVNGYPSTVVYYQQRLY 332 Query: 345 FSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM-HPFGEGV 403 F+ S +++ S G + DF G +P + + ++ ++ + H G Sbjct: 333 FAASTAYPQTIWASRTGDYKDF------GKNNPIQDDDRIIYTYAGRQVNEIRHLIDVGN 386 Query: 404 LVGCDT-SLWLLSISLSKGLS---IDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYI 459 LV + + +S +K L+ F +G PP++V + +F+ G ++ + Sbjct: 387 LVALTSGGEYTISGDQNKVLTPSAFSFSSQGNNGSSNVPPIAVANIALFIQEKGSVVRDL 446 Query: 460 SGSTE-QGFRFNEITQLADHLFNQ 482 + S + G++ ++T LA+HLF + Sbjct: 447 AYSFDVDGYQGTDLTILANHLFQK 470 >gi|292670776|ref|ZP_06604202.1| hypothetical protein HMPREF7545_1740 [Selenomonas noxia ATCC 43541] gi|292647397|gb|EFF65369.1| hypothetical protein HMPREF7545_1740 [Selenomonas noxia ATCC 43541] Length = 762 Score = 77.8 bits (190), Expect = 5e-12, Method: Compositional matrix adjust. Identities = 60/220 (27%), Positives = 110/220 (50%), Gaps = 22/220 (10%) Query: 323 MSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALT 382 +SAW ++GYP V+F +RL+F+GS+ + + S G +Y+F ++ D A+T Sbjct: 345 LSAWSAKKGYPQAVSFFEDRLVFAGSRAKPQTYWASQSGDYYNFWVNTPQQDSD---AIT 401 Query: 383 TAVTDFSASTIHWMHPFGEGVLVGCDTSLWL------LSISLSKGLSIDFRRVSGSGVYA 436 ++ + I + PFGE +++ + + + K ++R G+ Sbjct: 402 GTLSGGQMNGIRAIIPFGEMLMLTSGGEYKVGGGNETFTPTNQKAEPQEYR-----GINN 456 Query: 437 CPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFN-QRILQLVYQEEPH 494 PV +G +V+V G I+ ++ S + + ++++ LA HLF I+ L YQ+ P+ Sbjct: 457 LTPVVIGGRIVYVQHQGSVIRDLTYSYDVDKYTGDDVSLLAAHLFEGHTIVALAYQQTPN 516 Query: 495 SIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDK 534 ++VW V E D + LLG + E + +AWH H + K Sbjct: 517 TVVWCVRE--DGA---LLGMTYIKE-QDVYAWHKHTTAGK 550 Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust. Identities = 38/140 (27%), Positives = 59/140 (42%), Gaps = 14/140 (10%) Query: 8 KHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSN 67 K SF+ GEL+P L R DL + G + +N+I LRYG P + + R+ Sbjct: 10 KPSFAGGELTPALY-GRTDLQKYDVGASTLKNMIVLRYGGATRRPGFRHVAKTQGGKRA- 67 Query: 68 RVFSFSIPDGGYALLVFGDKKLQI-----VVVRSSTKWSPALFGKTYKTPYTFKDNKSLE 122 R+ F +L F +++ +VV+ +P + T YT D ++ Sbjct: 68 RLIPFQYSTEQSYVLEFTAGCIRVFTKGGIVVKDD---APLVI----PTSYTEADLSDIK 120 Query: 123 YAVFGSTAVFVHKDHPPHHL 142 Y VH +HPP L Sbjct: 121 YTQSADVLFLVHVNHPPMTL 140 >gi|218700982|ref|YP_002408611.1| hypothetical protein ECIAI39_2672 [Escherichia coli IAI39] gi|218370968|emb|CAR18795.1| conserved hypothetical protein from phage origin [Escherichia coli IAI39] gi|323948677|gb|EGB44582.1| hypothetical protein ERKG_04900 [Escherichia coli H252] Length = 823 Score = 77.4 bits (189), Expect = 6e-12, Method: Compositional matrix adjust. Identities = 120/521 (23%), Positives = 217/521 (41%), Gaps = 72/521 (13%) Query: 5 TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQ-------EY 57 +W + SF+ GE+ P L R D++ + + K N I +YG + + P + Sbjct: 4 SWIQPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYPN 62 Query: 58 RDCRLDP-RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFK 116 R CRL P + + V ++++ G + V D L V+ S+ + A TPYT Sbjct: 63 RKCRLIPFQFSTVQTYALEFGHQYMRVIKDGAL--VLNSSNVIYEIA-------TPYTEA 113 Query: 117 DNKSLEYAVFGSTAVFVHKDHPPHHLL-YIQDGDKISFTFDEIKFLPPPWLGDGMISGVK 175 D +++ VH +PP L Y D ++ + +G + Sbjct: 114 DLFRIKFTQSADVLTLVHPAYPPKELRRYAHDNWQLVDVVTK----------NGPFEDIN 163 Query: 176 SNAKLSI-SQADTSTARITSDMKIFKPLDKGRSIRLGCHP-----PEWAKNTNYSIGAYI 229 + +++ + A T T +T+ + IF G+ L P P W + + SIG Sbjct: 164 IDESVTVYASASTGTITLTASVSIFGAEQVGKLFYLE-QPAVDSVPVWETSKSTSIGDIR 222 Query: 230 VADDKVYRSLTTGRSGD-RFGYSKGATYVKD-------NNITWITVLNLSSKTSRESASG 281 AD YR++T G++G R +++G ++ I W L+ +R +A+ Sbjct: 223 RADSNYYRAVTAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEW-EYLHSGFGIARITAAN 281 Query: 282 AVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNN 341 IS P SQ + + S W AW GYP V ++ Sbjct: 282 GTT------------ATAEVISYIP-SQVVGEDNASY-KWAKYAWNSVNGYPGTVVYYQQ 327 Query: 342 RLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM-HPFG 400 RL F+ S +++ S G + DF G +PT+ + ++ ++ + H Sbjct: 328 RLYFAASTAFPQTIWASRTGDYKDF------GKSNPTQDDDRIIYTYAGRQVNEIRHLID 381 Query: 401 EGVLVGCDT-SLWLLSISLSKGL---SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRI 456 G LV + ++++ +K L S F +G PP++V + +FV G + Sbjct: 382 VGSLVALTSGGEYVITGDQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVV 441 Query: 457 KYISGSTE-QGFRFNEITQLADHLFNQR-ILQLVYQEEPHS 495 + ++ S + G++ N++T LA+HLF + I+ + P+S Sbjct: 442 RDLAYSFDVDGYQGNDLTILANHLFQKHSIVDWCFSIVPYS 482 >gi|169795391|ref|YP_001713184.1| phage-like protein [Acinetobacter baumannii AYE] gi|169148318|emb|CAM86183.1| hypothetical protein; putative phage related protein [Acinetobacter baumannii AYE] Length = 697 Score = 77.0 bits (188), Expect = 7e-12, Method: Compositional matrix adjust. Identities = 120/511 (23%), Positives = 199/511 (38%), Gaps = 74/511 (14%) Query: 8 KHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSN 67 K++ S+GELSP LL +R D+ +A G K N +PL G P ++R + + Sbjct: 12 KNNLSSGELSP-LLWTRTDIQQYANGAKKLLNALPLVEGGAKKRP-GTKFRS--IFAGAL 67 Query: 68 RVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKT--PY-TFKDNKSLEYA 124 R+ F LL+ G L++ ++P + Y+T PY T + + ++YA Sbjct: 68 RLIPFIANSENTYLLILGVSFLKV--------YNPRTYAVVYETVTPYNTAQKVREVQYA 119 Query: 125 VFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQ 184 FV D P LL D F P LG S +++S Sbjct: 120 HTKYRMYFVQGDTPVQRLLCSADFTNWQFAAFTFGVNPNDELG--------STPNVALSP 171 Query: 185 ADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRS 244 + T ++ S P W+ Y G ++ + K +R+ + Sbjct: 172 SGTEVGKVIS--------------LTASSFPNWSNTETYLTGDRVIHNSKTWRA-----T 212 Query: 245 GDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIK-----DVSKDG 299 D G AT + W V N ++ ++ G++ G +K D S+ Sbjct: 213 ADNKGVEPSATTPE-----WEEVTNEAANVFTPASVGSIVEIN-GGQVKITEYVDPSRVN 266 Query: 300 RSISVAPQSQTLFQAGVSVVSWFMS--AWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYL 357 + V S A SW + A+ + GYP V F RL+F+ +K ++ Sbjct: 267 GEVLVKLTSDVQAIAK----SWVLKSIAFSAEAGYPKAVCFFKQRLVFANTKTSPNQMWF 322 Query: 358 SSF---GAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLL 414 S G F + + D + A + A + + I + G V + + Sbjct: 323 SRIGDDGNFLETTQDAD--------AFSIASSSAQSDNILHLSQRGGVVALTGGAEFLIN 374 Query: 415 SISLSKGLSIDFRRVSGSGVYA-CPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEI 472 S S + GV A P VG+ L+FV G R++ +S E G E+ Sbjct: 375 SQGPLTPASAQIDEHTSYGVQANVKPCRVGNELLFVQRGGERLRAMSYRYEVDGLVSPEL 434 Query: 473 TQLADHLFNQR--ILQLVYQEEPHSIVWVVL 501 +Q+A H+ I +L +Q+ P+SIVW+V+ Sbjct: 435 SQIAPHIPENHAGIKELTFQQTPNSIVWIVM 465 >gi|324008552|gb|EGB77771.1| conserved domain protein [Escherichia coli MS 57-2] Length = 823 Score = 77.0 bits (188), Expect = 8e-12, Method: Compositional matrix adjust. Identities = 120/526 (22%), Positives = 219/526 (41%), Gaps = 72/526 (13%) Query: 5 TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQ-------EY 57 +W + SF+ GE+ P L R D++ + + K N I +YG + + P + Sbjct: 4 SWIQPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYPN 62 Query: 58 RDCRLDP-RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFK 116 R CRL P + + V ++++ G + V D L V+ S+ + A TPYT Sbjct: 63 RKCRLIPFQFSTVQTYALEFGHQYMRVIKDGAL--VLNSSNVIYEIA-------TPYTEA 113 Query: 117 DNKSLEYAVFGSTAVFVHKDHPPHHLL-YIQDGDKISFTFDEIKFLPPPWLGDGMISGVK 175 D +++ VH +PP L Y D ++ + +G + Sbjct: 114 DLFRIKFTQSADVLTLVHPAYPPKELRRYAHDNWQLVDVVTK----------NGPFEDIN 163 Query: 176 SNAKLSI-SQADTSTARITSDMKIFKPLDKGRSIRLGCHP-----PEWAKNTNYSIGAYI 229 + +++ + A T T +T+ IF G+ L P P W + + SIG Sbjct: 164 IDESVTVYASASTGTITLTASASIFGAEQVGKLFYLE-QPAVDSVPVWETSKSTSIGDIR 222 Query: 230 VADDKVYRSLTTGRSGD-RFGYSKGATYVKD-------NNITWITVLNLSSKTSRESASG 281 AD YR++T G++G R +++G ++ I W L+ +R +A+ Sbjct: 223 RADSNYYRAVTAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEW-EYLHSGFGIARITAA- 280 Query: 282 AVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNN 341 + IS P SQ + + S W AW GYP V ++ Sbjct: 281 -----------NGTTATAEVISYIP-SQVVGEDNASY-KWAKYAWNSVNGYPGTVVYYQQ 327 Query: 342 RLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM-HPFG 400 RL F+ S +++ S G + DF G +PT+ + ++ ++ + H Sbjct: 328 RLYFAASTAFPQTIWASRTGDYKDF------GKSNPTQDDDRIIYTYAGRQVNEIRHLID 381 Query: 401 EGVLVGCDT-SLWLLSISLSKGL---SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRI 456 G LV + ++++ +K L S F +G PP++V + +FV G + Sbjct: 382 VGSLVALTSGGEYVITGDQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVV 441 Query: 457 KYISGSTE-QGFRFNEITQLADHLFNQR-ILQLVYQEEPHSIVWVV 500 + ++ S + G++ N++T LA+HLF + I+ + P+S + + Sbjct: 442 RDLAYSFDVDGYQGNDLTILANHLFQKHSIVDWCFSIVPYSSAFCI 487 >gi|331648168|ref|ZP_08349258.1| conserved hypothetical protein [Escherichia coli M605] gi|331043028|gb|EGI15168.1| conserved hypothetical protein [Escherichia coli M605] Length = 823 Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 120/521 (23%), Positives = 217/521 (41%), Gaps = 72/521 (13%) Query: 5 TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQ-------EY 57 +W + SF+ GE+ P L R D++ + + K N I +YG + + P + Sbjct: 4 SWIQPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYPN 62 Query: 58 RDCRLDP-RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFK 116 R CRL P + + V ++++ G + V D L V+ S+ + A TPYT Sbjct: 63 RKCRLIPFQFSTVQTYALEFGHQYMRVIKDGAL--VLNSSNVIYEIA-------TPYTEA 113 Query: 117 DNKSLEYAVFGSTAVFVHKDHPPHHLL-YIQDGDKISFTFDEIKFLPPPWLGDGMISGVK 175 D +++ VH +PP L Y D ++ + +G + Sbjct: 114 DLFRIKFTQSADVLTLVHPAYPPKELRRYAHDNWQLVDVVTK----------NGPFEDIN 163 Query: 176 SNAKLSI-SQADTSTARITSDMKIFKPLDKGRSIRLGCHP-----PEWAKNTNYSIGAYI 229 + +++ + A T T +T+ IF G+ L P P W + + SIG Sbjct: 164 IDESVTVYASASTGTITLTASASIFGAEQVGKLFYLE-QPAVDSVPVWETSKSTSIGDIR 222 Query: 230 VADDKVYRSLTTGRSGD-RFGYSKGATYVKD-------NNITWITVLNLSSKTSRESASG 281 AD YR++T G++G R +++G ++ I W L+ +R +A+ Sbjct: 223 RADSNYYRAVTAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEW-EYLHSGFGIARITAA- 280 Query: 282 AVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNN 341 + IS P SQ + + S W AW GYP V ++ Sbjct: 281 -----------NGTTATAEVISYIP-SQVVGEDNASY-KWAKYAWNSVNGYPGTVVYYQQ 327 Query: 342 RLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM-HPFG 400 RL F+ S +++ S G + DF G +PT+ + ++ ++ + H Sbjct: 328 RLYFAASTAFPQTIWASRTGDYKDF------GKSNPTQDDDRIIYTYAGRQVNEIRHLID 381 Query: 401 EGVLVGCDT-SLWLLSISLSKGL---SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRI 456 G LV + ++++ +K L S F +G PP++V + +FV G + Sbjct: 382 VGSLVALTSGGEYVITGDQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVV 441 Query: 457 KYISGSTE-QGFRFNEITQLADHLFNQR-ILQLVYQEEPHS 495 + ++ S + G++ N++T LA+HLF + I+ + P+S Sbjct: 442 RDLAYSFDVDGYQGNDLTILANHLFQKHSIVDWCFSIVPYS 482 >gi|298381710|ref|ZP_06991309.1| conserved hypothetical protein [Escherichia coli FVEC1302] gi|298279152|gb|EFI20666.1| conserved hypothetical protein [Escherichia coli FVEC1302] Length = 823 Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 120/521 (23%), Positives = 217/521 (41%), Gaps = 72/521 (13%) Query: 5 TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQ-------EY 57 +W + SF+ GE+ P L R D++ + + K N I +YG + + P + Sbjct: 4 SWIQPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYPN 62 Query: 58 RDCRLDP-RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFK 116 R CRL P + + V ++++ G + V D L V+ S+ + A TPYT Sbjct: 63 RKCRLIPFQFSTVQTYALEFGHQYMRVIKDGAL--VLNSSNVIYEIA-------TPYTEA 113 Query: 117 DNKSLEYAVFGSTAVFVHKDHPPHHLL-YIQDGDKISFTFDEIKFLPPPWLGDGMISGVK 175 D +++ VH +PP L Y D ++ + +G + Sbjct: 114 DLFRIKFTQSADVLTLVHPAYPPKELRRYAHDNWQLVDVVTK----------NGPFEDIN 163 Query: 176 SNAKLSI-SQADTSTARITSDMKIFKPLDKGRSIRLGCHP-----PEWAKNTNYSIGAYI 229 + +++ + A T T +T+ IF G+ L P P W + + SIG Sbjct: 164 IDESVTVYASASTGTITLTASASIFGAEQVGKLFYLE-QPAVDSVPVWETSKSTSIGDIR 222 Query: 230 VADDKVYRSLTTGRSGD-RFGYSKGATYVKD-------NNITWITVLNLSSKTSRESASG 281 AD YR++T G++G R +++G ++ I W L+ +R +A+ Sbjct: 223 RADSNYYRAVTAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEW-EYLHSGFGIARITAA- 280 Query: 282 AVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNN 341 + IS P SQ + + S W AW GYP V ++ Sbjct: 281 -----------NGTTATAEVISYIP-SQVVGEDNASY-KWAKYAWNSVNGYPGTVVYYQQ 327 Query: 342 RLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM-HPFG 400 RL F+ S +++ S G + DF G +PT+ + ++ ++ + H Sbjct: 328 RLYFAASTAFPQTIWASRTGDYKDF------GKSNPTQDDDRIIYTYAGRQVNEIRHLID 381 Query: 401 EGVLVGCDT-SLWLLSISLSKGL---SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRI 456 G LV + ++++ +K L S F +G PP++V + +FV G + Sbjct: 382 VGSLVALTSGGEYVITGDQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVV 441 Query: 457 KYISGSTE-QGFRFNEITQLADHLFNQR-ILQLVYQEEPHS 495 + ++ S + G++ N++T LA+HLF + I+ + P+S Sbjct: 442 RDLAYSFDVDGYQGNDLTILANHLFQKHSIVDWCFSIVPYS 482 >gi|117624704|ref|YP_853617.1| hypothetical protein APECO1_4049 [Escherichia coli APEC O1] gi|115513828|gb|ABJ01903.1| conserved hypothetical protein [Escherichia coli APEC O1] Length = 823 Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 116/519 (22%), Positives = 213/519 (41%), Gaps = 58/519 (11%) Query: 5 TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQ-------EY 57 +W + SF+ GE+ P L R D++ + + K N I +YG + + P + Sbjct: 4 SWIQPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYPN 62 Query: 58 RDCRLDP-RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFK 116 R CRL P + + V ++++ G + V D L V+ S+ + A TPYT Sbjct: 63 RKCRLIPFQFSTVQTYALEFGHQYMRVIKDGAL--VLNSSNVIYEIA-------TPYTEA 113 Query: 117 DNKSLEYAVFGSTAVFVHKDHPPHHLL-YIQDGDKISFTFDEIKFLPPPWLGDGMISGVK 175 D +++ VH +PP L Y D ++ + +G + Sbjct: 114 DLFRIKFTQSADVLTLVHPAYPPKELRRYAHDNWQLVDVVTK----------NGPFEDIN 163 Query: 176 SNAKLSI-SQADTSTARITSDMKIFKPLDKGRSIRLGCHP-----PEWAKNTNYSIGAYI 229 + +++ + A T T +T+ IF G+ L P P W + + SIG Sbjct: 164 IDESVTVYASASTGTITLTASASIFGAEQVGKLFYLE-QPAVDSVPVWETSKSTSIGDIR 222 Query: 230 VADDKVYRSLTTGRSGD-RFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYV 288 AD YR++T G++G R +++G ++ + + Sbjct: 223 RADSNYYRAVTAGKTGTLRPSHTEGTSWDGWGGSGDDDIGIEWEYLHSGFGIARITAANG 282 Query: 289 WGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGS 348 +V IS P SQ + + S W AW GYP V ++ RL F+ S Sbjct: 283 TTATAEV------ISYIP-SQVVGEDNASY-KWAKYAWNSVNGYPGTVVYYQQRLYFAAS 334 Query: 349 KGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM-HPFGEGVLVGC 407 +++ S G + DF G +PT+ + ++ ++ + H G LV Sbjct: 335 TAFPQTIWASRTGDYKDF------GKSNPTQDDDRIIYTYAGRQVNEIRHLIDVGSLVAL 388 Query: 408 DT-SLWLLSISLSKGL---SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGST 463 + ++++ +K L S F +G PP++V + +FV G ++ ++ S Sbjct: 389 TSGGEYVITGDQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVVRDLAYSF 448 Query: 464 E-QGFRFNEITQLADHLFNQR-ILQLVYQEEPHSIVWVV 500 + G++ N++T LA+HLF + I+ + P+S + + Sbjct: 449 DVDGYQGNDLTILANHLFQKHSIVDWCFSIVPYSSAFCI 487 >gi|304398395|ref|ZP_07380269.1| conserved hypothetical protein [Pantoea sp. aB] gi|304354261|gb|EFM18634.1| conserved hypothetical protein [Pantoea sp. aB] Length = 824 Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 114/520 (21%), Positives = 200/520 (38%), Gaps = 68/520 (13%) Query: 10 SFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRV 69 SF+ GE+SP + R DL+ ++ + + RN I +YG L + P + + + R R+ Sbjct: 9 SFAGGEISPNVY-GRVDLAKYSIALRRCRNFIVRQYGGLENRPGTRFIAEAKYPDRKCRL 67 Query: 70 FSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKT----YKTPYTFKDNKSLEYAV 125 F L FG +++ L G TPY D L+ Sbjct: 68 IPFQFSTVQTYALEFGHNYMRVY-----KDGGQVLDGNNQVYELATPYQEADLFELKITQ 122 Query: 126 FGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQA 185 HK + P L S+ E+ P+ + VK A S Q Sbjct: 123 SADVMTICHKAYAPRELRRF---GHASWELVEVVTKNGPFEDINIDPSVKVYA--SSYQG 177 Query: 186 DTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVYRSLTT 241 + + + ++ IF G+ L P W + ++G A D Y +LT Sbjct: 178 NIT---LNANASIFGSEQVGKLFYLEQVNVDSTPVWETDKAVAVGMTRRAGDNYYVALTA 234 Query: 242 GRSGD-RFGYSKGATYV-------KDNNITW------ITVLNLSSKTSRESASGAVAPYY 287 G++G R +++GA + D I W + ++S +S + AV Y Sbjct: 235 GKTGTLRPSHTEGAAWDGWGSNGDNDTGIQWEYQHSGFGIARITSVSSDGYIAAAVVQTY 294 Query: 288 VWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSG 347 + D +V P + W AW + GYP VT++ RL+F+ Sbjct: 295 MPND-----------AVGPTKASY--------KWAKFAWNQVNGYPGTVTYYQQRLIFAA 335 Query: 348 SKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM-HPFGEGVLVG 406 S +++ S G + DF G P V ++ ++ + H G LV Sbjct: 336 SIKYPQTIWCSKTGDYKDF------GKTSPIADDDRIVYTYAGKQVNEIRHLIDVGSLVA 389 Query: 407 CDTSLWLLSIS-LSKGLS---IDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGS 462 + + +K L+ F G + P++V + +F+ G ++ ++ S Sbjct: 390 LTSGGQFQIVGDQNKTLTPTAFSFSSQGADGASSVAPITVSNIALFIQEKGSVVRDLAYS 449 Query: 463 TE-QGFRFNEITQLADHLFN-QRILQLVYQEEPHSIVWVV 500 + G++ +++T LA+HLFN R++ + P+S W V Sbjct: 450 FDVDGYQGSDLTVLANHLFNGYRLVDWTFSVVPYSAGWAV 489 >gi|294648405|ref|ZP_06725904.1| phage protein [Acinetobacter haemolyticus ATCC 19194] gi|292825710|gb|EFF84414.1| phage protein [Acinetobacter haemolyticus ATCC 19194] Length = 706 Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust. Identities = 130/571 (22%), Positives = 217/571 (38%), Gaps = 68/571 (11%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M K++F++GELSP + R DL + G + N +P+ G L + Sbjct: 1 MAKINLIKNNFTSGELSPHIWM-RTDLQQYRNGTKEMLNFLPIIEGGLKRRGGTEA---L 56 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 + + R+ F I LL+F ++ ++ + + K+ TPYT +D K Sbjct: 57 AITAGAIRILPFIISHSTAYLLIFKPNQIDVLDINGTVV-------KSLSTPYTAQDIKE 109 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKIS-FTFDEIKFLPPPWLGDGMISGVKSNAK 179 + Y H HP L +++ + ++ +++D F PP + V++ A Sbjct: 110 ISYTQNRYQFYIAHSKHP---LAWLRASEDLTNWSYDPFDFYVPP------LEEVETPAL 160 Query: 180 LSISQADTSTARITSDMKIFKPLDKGRSIRLG--CHPPEWAKNTNYSIGAYIVADDKVYR 237 S + T + D + + G CH N Y A + Sbjct: 161 PLKSNEKNAGKVATLTASPYNIYDNSKRYQAGEICH--HTINNVKYYFRALRITQGNTPS 218 Query: 238 SLTTGRSGDRFGYSKGATYVKDNNITWITV---LNLSSKTSRESASGAVAPYYVWGDI-K 293 T+G Y + T + T V + ++ R V+P V G+I Sbjct: 219 FGTSGPEASPDYYWETTTVTEAQAFTAADVDKFVFINEGIVR--IDTYVSPSTVTGEILV 276 Query: 294 DVSKDGRSISVA-PQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDE 352 +S D +I+ A Q +F+ + GYP VT + RL+ +G+K Sbjct: 277 KLSTDIEAIANAWTLKQDIFEVSL--------------GYPRAVTMYQQRLVIAGTKTYP 322 Query: 353 LSVYLSSFGAFYDF---SLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDT 409 V+LS G +F + DG+ +A +D + +H G V+ G Sbjct: 323 NYVWLSRVGDVTNFLPTTSDGD-------SFTVSASSDQLTNVLHLAQSRGICVMTGGSE 375 Query: 410 SLWLLSISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIK-YISGSTEQGFR 468 + S++ + S P+ VG L+FV RI+ + + Sbjct: 376 LVISSQNSMTPTNTSILEHTSFGSTENIKPIKVGSELIFVQRGAERIRTLLYDYSIDSLT 435 Query: 469 FNEITQLADHLFNQR--ILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAW 526 NE+T LA H+ + ++VY EP SI+W VL +L + E + AW Sbjct: 436 SNELTVLASHIAKKSGGFKEMVYCAEPDSIIWFVL-----GNGKLASLTLNRE-QSVIAW 489 Query: 527 HTHMISDKHYVLSAASFPNDNRGGTSLWMLV 557 TH I VLS S P+ G L+ LV Sbjct: 490 STHDIGGT--VLSLTSLPS-TTGADRLYFLV 517 >gi|332875218|ref|ZP_08443051.1| carbohydrate binding domain protein [Acinetobacter baumannii 6014059] gi|332736662|gb|EGJ67656.1| carbohydrate binding domain protein [Acinetobacter baumannii 6014059] Length = 692 Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust. Identities = 121/511 (23%), Positives = 196/511 (38%), Gaps = 74/511 (14%) Query: 8 KHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSN 67 K++ S+GELSP LL +R D+ +A G K N +PL G P ++R + + Sbjct: 7 KNNLSSGELSP-LLWTRTDIQQYANGAKKLLNALPLVEGGAKKRP-GTKFRS--IFAGAL 62 Query: 68 RVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYK--TPY-TFKDNKSLEYA 124 R+ F LL+ G L++ ++P + Y+ TPY T + + ++YA Sbjct: 63 RLIPFIANSENTYLLILGVSFLKV--------YNPRTYAVVYEAVTPYNTAQKVREVQYA 114 Query: 125 VFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQ 184 FV D P LL D F P LG S +++S Sbjct: 115 HTKYRMYFVQGDTPVQRLLCSADFTNWQFAAFTFGVNPNDELG--------STPNVALSP 166 Query: 185 ADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRS 244 + T ++ S P W+ Y G ++ K +R+ Sbjct: 167 SGTEVGKVIS--------------LTASSFPNWSNTETYLTGDRVIHTSKTWRATI---- 208 Query: 245 GDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIK-----DVSKDG 299 D G AT + W V N ++ S+ G++ G +K D S+ Sbjct: 209 -DNKGVEPSATTSE-----WEEVTNEAANVFTPSSVGSIVEIN-GGQVKITQYVDPSRVN 261 Query: 300 RSISVAPQSQTLFQAGVSVVSWFMS--AWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYL 357 + V S A SW + A+ GYP V F RL+F+ +K ++ Sbjct: 262 GEVLVKLTSTVQAIAK----SWVLKSIAFSATAGYPKAVCFFKQRLVFANTKTSPNQMWF 317 Query: 358 SSF---GAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLL 414 S G F + + D + A + A + + I + G V + + Sbjct: 318 SRIGDDGNFLETTQDAD--------AFSIASSSAQSDNILHLSQRGGVVALTGGAEFLIN 369 Query: 415 SISLSKGLSIDFRRVSGSGVYA-CPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEI 472 S S + GV A P VG+ L+FV G R++ +S E G E+ Sbjct: 370 SQGPLTPASAQIDEHTSYGVQANVKPCRVGNELLFVQRGGERLRAMSYRYEVDGLVSPEL 429 Query: 473 TQLADHLFNQR--ILQLVYQEEPHSIVWVVL 501 +Q+A H+ I +L +Q+ P+SIVW+V+ Sbjct: 430 SQIAPHIPENHAGIKELTFQQTPNSIVWIVM 460 >gi|293609614|ref|ZP_06691916.1| predicted protein [Acinetobacter sp. SH024] gi|292828066|gb|EFF86429.1| predicted protein [Acinetobacter sp. SH024] Length = 692 Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 117/508 (23%), Positives = 195/508 (38%), Gaps = 68/508 (13%) Query: 8 KHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSN 67 K++ S+GELSP LL +R D+ +A G K N +PL G P ++R + + Sbjct: 7 KNNLSSGELSP-LLWTRTDIQQYANGAKKLLNALPLVEGGAKKRP-GTKFRS--IFAGAL 62 Query: 68 RVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKT--PY-TFKDNKSLEYA 124 R+ F LL+ G L++ ++P + Y+T PY T + + ++YA Sbjct: 63 RLIPFIANSENTYLLILGVSFLKV--------YNPRTYAVVYETVTPYNTAQKVREVQYA 114 Query: 125 VFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQ 184 FV D P LL D F P LG S +++S Sbjct: 115 HTKYRMYFVQGDTPVQRLLCSADFTNWQFAAFTFGVNPNDELG--------STPNVALSP 166 Query: 185 ADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRS 244 + T ++ S P W+ Y G ++ K +R+ Sbjct: 167 SGTEVGKVIS--------------LTASSFPNWSNTETYLTGDRVIHSGKTWRATI---- 208 Query: 245 GDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISV 304 D G AT + W V N ++ S G++ + G +++ V Sbjct: 209 -DNKGVEPTATTSE-----WEEVTNEAANVFTPSNVGSIIE--INGGQVKITQYVDPSRV 260 Query: 305 APQSQTLFQAGVSVV--SWFMS--AWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSF 360 + + V + SW + A+ GYP V F RL+F+ +K ++ S Sbjct: 261 NGEVLVKLTSAVQAIAKSWVLKSIAFSATAGYPKAVCFFKQRLVFANTKTSPNQMWFSRI 320 Query: 361 ---GAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSIS 417 G F + + D + A + A + + I + G V + + S Sbjct: 321 GDDGNFLETTQDAD--------AFSIASSSAQSDNILHLSQRGGVVALTGGAEFLINSQG 372 Query: 418 LSKGLSIDFRRVSGSGVYA-CPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQL 475 S + GV A P VG+ L+FV G R++ +S E G E++Q+ Sbjct: 373 PLTPASAQIDEHTSYGVQANVKPCRVGNELLFVQRGGERLRAMSYRYEVDGLISPELSQI 432 Query: 476 ADHLFNQR--ILQLVYQEEPHSIVWVVL 501 A H+ I +L +Q+ P+SIVW+V+ Sbjct: 433 APHIPENHAGIKELTFQQTPNSIVWIVM 460 >gi|282848883|ref|ZP_06258273.1| hypothetical protein HMPREF1035_1392 [Veillonella parvula ATCC 17745] gi|282581388|gb|EFB86781.1| hypothetical protein HMPREF1035_1392 [Veillonella parvula ATCC 17745] Length = 772 Score = 69.3 bits (168), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 108/577 (18%), Positives = 222/577 (38%), Gaps = 78/577 (13%) Query: 10 SFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRV 69 +F+ GE+SP + SR DL + + ++ N++ YG + Q + + R+ Sbjct: 11 AFTTGEVSPDV-SSRFDLEQYKSALLEAENVVIRPYGAVAKRQGSQYVGQVKYSDKPTRL 69 Query: 70 FSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALF-GKTYKTPYTFKDNKSLEYAVFGS 128 F F+ +L FGDK +++ W+ ++ G TP+T L + G Sbjct: 70 FEFTTNTNNSFMLEFGDKYIRV--------WNYGVYTGIEVTTPFTSDILFDLNCSQSGD 121 Query: 129 TAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTS 188 +P L D D + + K P+ D + + V S ++ +S Sbjct: 122 VMFICSGKYPIQTLSRYSDTD---WRLEAYKLTEQPY--DTINTDVNSTVTVTGDTIRSS 176 Query: 189 TARITSDM--------------------KIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAY 228 +DM + + +K RS G + N NY++ +Y Sbjct: 177 KDLFNADMVGMVMQLGYFVAAVHTKNTGTVVEKKEK-RSFMGGFNKWNEYNNINYNVESY 235 Query: 229 IVADDKVYRSLT----TGRSGDRFGYSKGATYV--------KDNNITWITVLNLSSKTSR 276 D ++ T TG + + G T+ D N+T + ++K Sbjct: 236 STDQDLAWKFTTHGTWTGTVKLQITTNNGTTWKDYRTYSSNNDYNVTDAGKIEPNAKLRI 295 Query: 277 ES--ASG------AVAPYYVWGDIK-DVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWG 327 +S SG ++ PY WG ++ D +++ + + + S W M +WG Sbjct: 296 QSDIKSGECNVDLSILPYTTWGIVEFKEFVDSKTMKINILNGIVENEATS--KWKMGSWG 353 Query: 328 EQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTD 387 GYP TF+ +R + + + + +++S G + +F ++ G ++T V + Sbjct: 354 RSNGYPKLCTFYQDRFVVAATNKNPNYIWMSRTGDYPNFGVEKVEGTITDDSSITLPVIN 413 Query: 388 FSASTIHWMHPFGEGVLVGCDTSLWLLSISLS-KGLSIDFRRVSGSGVYACPPVSVGDCL 446 I + P + +++ W++S + + + + + G +C P +G+ Sbjct: 414 RKMYEIRHLVPANDLIILTSGNE-WIVSGDKTITPTNCNLKTQTQRGALSCEPQFIGNRC 472 Query: 447 VFVCGVGRRIKYISGSTE------QGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVV 500 VFV G ++ + S E Q T++ +L + Y ++P SI++ + Sbjct: 473 VFVQERGGTVRDMGYSYESDNYTGQDLTLFVKTRVRGYL----TITSAYAQDPDSIIYYI 528 Query: 501 LEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYV 537 + + C + + W +H +++ Y+ Sbjct: 529 RNDGE------INCLTYIPEQKVYGW-SHFVTNGKYL 558 >gi|195541813|gb|ACF98016.1| hypothetical protein [uncultured bacterium 878] Length = 926 Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust. Identities = 66/293 (22%), Positives = 113/293 (38%), Gaps = 39/293 (13%) Query: 288 VWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSG 347 WG K ++ ++SV + F + +W + + + GYPS VTF+ RL + G Sbjct: 356 TWGYAK-ITAYTSAVSVTADVLSNFGGTAASSAWRLGLYSQGGGYPSCVTFYEGRLFWGG 414 Query: 348 SKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSAS---------TIHWMHP 398 V D S+ Y + P+ + D + + + WM Sbjct: 415 CPLAPTRV---------DGSMSSNYETFSPSSTASVVADDNAVAYPLDSGDVNNVLWMKD 465 Query: 399 FGEGVLVGCDTSLWLLSISLSKG----LSIDFRRVSGSGVY-ACPPVSVGDCLVFVCGVG 453 +G+LVG W++ + G ++ R + G Y PV G ++FV Sbjct: 466 DEKGLLVGTKGGEWVVRANTLNGALTPTNVKATRATTYGSYEGSQPVRTGKDIIFVQRKR 525 Query: 454 RRIKYISGSTE-QGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLL 512 R+++ ++ + E GF ++T L+ H+ QL +Q EP VW+ D P L Sbjct: 526 RKVRNLNYTYEIDGFNAGDLTILSGHIGRLEFGQLAFQSEPEGWVWMTR--GDGQLPVLT 583 Query: 513 GCRFSAEGEGDFAWHTHMISDKH--------YVLSAASFPNDNRGGTSLWMLV 557 R E W ++ V S S P+ N +W++V Sbjct: 584 YDR----DEQKIGWSRQIMGGYQDAARRRPPIVRSVCSIPDPNDARDEVWLIV 632 >gi|78357587|ref|YP_389036.1| hypothetical protein Dde_2545 [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] gi|78219992|gb|ABB39341.1| conserved hypothetical protein [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] Length = 700 Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust. Identities = 62/231 (26%), Positives = 98/231 (42%), Gaps = 19/231 (8%) Query: 332 YPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSAS 391 +P V F+ RL F+G+ +++ S + ++ D A+T + + Sbjct: 279 WPGCVQFYQQRLCFAGTDEKPQTIWCSQSANYESMNISSPLRDDD---AVTVTIAADRVN 335 Query: 392 TIHWMHPFGEGVLVGCDTSLWLLSISLSKGLS---IDFRRVSGSGVYACPPVSVGDCLVF 448 I WM P +LVG W LS S L+ RR + G P+ +G ++F Sbjct: 336 RIRWMMP-ARRLLVGTAGGEWQLSGSGDAPLTPVDAQLRRDTMHGSAGLMPLVIGQSILF 394 Query: 449 VCGVGRRIKYISGSTE-QGFRFNEITQLADHLF-NQRILQLVYQEEPHSIVWVVLEPKDN 506 V GR ++ + E G+ ++T LA+HL +RI+ YQ+ P S+VW L Sbjct: 395 VQRDGRTVREFRYALESDGYDAGDLTILAEHLMRGRRIVSWCYQQSPASVVWCAL----- 449 Query: 507 SFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLV 557 S L F E E WH H +V + + P D G +W+ V Sbjct: 450 SDGTLAAMTFLREHE-VVGWHRH--DTDGFVEAVTAIPGDE--GDEVWLSV 495 >gi|257139843|ref|ZP_05588105.1| hypothetical protein BthaA_11681 [Burkholderia thailandensis E264] Length = 489 Score = 67.0 bits (162), Expect = 8e-09, Method: Compositional matrix adjust. Identities = 48/184 (26%), Positives = 83/184 (45%), Gaps = 16/184 (8%) Query: 324 SAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDF---SLDGEYGCYDPTKA 380 S W +GYP+ V+ RL +GS G + V+ S G + DF + DGE YD Sbjct: 84 SMWNSIDGYPAAVSLFKQRLYAAGSTGYPMRVWASGIGLYLDFTPGTKDGEAFGYDMASD 143 Query: 381 LTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVYACP-- 438 +++ I GE V ++ + +++ V VY C Sbjct: 144 QVNQTVHLASAKILAALTQGEEFTVTGGSAGAITPTNIN---------VDSQSVYGCARA 194 Query: 439 -PVSVGDCLVFVCGVGRRIKYIS-GSTEQGFRFNEITQLADHLFNQRILQLVYQEEPHSI 496 PV VG+ +V+V G++++ ++ +R +T+LA H+ I+ + +Q EP + Sbjct: 195 RPVRVGNEIVYVQRAGKKVRAMTYDLNTDAYRSQNLTRLAAHVTESGIVDVAFQAEPTPV 254 Query: 497 VWVV 500 VW+V Sbjct: 255 VWMV 258 >gi|332160974|ref|YP_004297551.1| hypothetical protein YE105_C1352 [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|325665204|gb|ADZ41848.1| Hypothetical phage protein [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|330862130|emb|CBX72294.1| hypothetical protein YEW_AK02310 [Yersinia enterocolitica W22703] Length = 657 Score = 67.0 bits (162), Expect = 9e-09, Method: Compositional matrix adjust. Identities = 100/504 (19%), Positives = 195/504 (38%), Gaps = 98/504 (19%) Query: 8 KHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSN 67 K +F+AGE+SPRL+ R D++ +A G N + + +G ++ P + + + Sbjct: 7 KTNFTAGEISPRLM-GRVDIARYANGAKTVENAVCVIHGGVMRRPGSRFAAKAKFGDQKA 65 Query: 68 RVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFG 127 R+ + +L FG+ ++ ++ + +PYT SL Y Sbjct: 66 RLIPYVFNRSQAYVLEFGNGYVRFY--QNGAQIGAGSTPYEIASPYTSAMLSSLNYVQGA 123 Query: 128 STAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADT 187 T VH+D PP+ L D + P P+ Sbjct: 124 DTMFLVHQDVPPYRLQRKGQTDWV--------LEPAPF---------------------- 153 Query: 188 STARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDR 247 I KP D+ R P +W K S+ ++ + +L+ SG Sbjct: 154 ----------IVKPFDEIRDT-----PEKWCKP---SVKEFV--GSAITLTLSDAESG-- 191 Query: 248 FGYSKGATYVKDNNITWITV----LNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSIS 303 G GA +V + +++ + +++ + TS A+G + R++ Sbjct: 192 -GALTGAGWVGADVGSYVRINSGLVHIQAVTSAAVATGVI----------------RTVL 234 Query: 304 VAPQSQTLFQAGVSVVSWFM--SAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFG 361 A QS S +W + W + GYP T + RL+ +GS ++++S G Sbjct: 235 SAVQSS-------SPGAWTREDAVWSAEFGYPGAATLYQQRLVLAGSPKYPQTIWMSETG 287 Query: 362 AFYDFSL----DGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSIS 417 + F L D + + V +T+ + GE + G S +I+ Sbjct: 288 IYLSFELGTDDDDAISFTVSSDQINPIVHLAQMNTLIALTSTGEFTITGGGES----AIT 343 Query: 418 LSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQ--GFRFNEITQL 475 + +I + S G + PV VG ++F+ R++ ++ + + N+++ L Sbjct: 344 PT---NISVKNPSPYGCNSIKPVRVGTEIMFMQRANRKLFAVAYDPDSFVAYSANDLSVL 400 Query: 476 ADHLFNQRILQLVYQEEPHSIVWV 499 ++H+ + + YQ+EP + +W+ Sbjct: 401 SEHITLSGAVDMAYQQEPDAFIWM 424 >gi|220903983|ref|YP_002479295.1| hypothetical protein Ddes_0709 [Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774] gi|219868282|gb|ACL48617.1| conserved hypothetical protein [Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774] Length = 689 Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 68/247 (27%), Positives = 116/247 (46%), Gaps = 22/247 (8%) Query: 332 YPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSAS 391 YP V FH R++ + + + + Y+S G F +F DP + L + S Sbjct: 278 YPGIVAFHQQRMVLAATPKNPQAFYMSRVGDFENFRKSRPLQDDDPVEYL---IASGSID 334 Query: 392 TIHWMHPFGEGVLVGCDTSLWLLS----ISLSKG-LSIDFRRVSGSGVYACPPVSVGDCL 446 + W FG+ +L+G S + S S++ G +SI + GS A P+ +G+ + Sbjct: 335 AVTWAASFGD-LLIGTSGSEYKASGGDGASITAGNISITAQSYWGSAGLA--PIIIGNSI 391 Query: 447 VFVCGVGRRIKYISGSTEQ-GFRFNEITQLADHLFN-QRILQLVYQEEPHSIVWVVLEPK 504 + V G R++ + S E+ G+ N+++ +A HLF ILQ YQ+ P S +W V + Sbjct: 392 LHVQRHGSRVRDLFYSLEKDGYAGNDLSIMAPHLFEGHTILQWAYQQTPGSTIWCV---R 448 Query: 505 DNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSAGEE 564 D+ LL + E + + W + + VLSAA+ + +G T + + G+ Sbjct: 449 DDGL--LLAFTYMKEHD-IWGWSRQITQGR--VLSAAAISGE-KGDTLMLVTERRIDGQP 502 Query: 565 RSFTVRL 571 R F RL Sbjct: 503 RIFLERL 509 >gi|83720451|ref|YP_441475.1| hypothetical protein BTH_I0919 [Burkholderia thailandensis E264] gi|83654276|gb|ABC38339.1| conserved hypothetical protein [Burkholderia thailandensis E264] Length = 405 Score = 66.2 bits (160), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 47/182 (25%), Positives = 82/182 (45%), Gaps = 16/182 (8%) Query: 326 WGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDF---SLDGEYGCYDPTKALT 382 W +GYP+ V+ RL +GS G + V+ S G + DF + DGE YD Sbjct: 2 WNSIDGYPAAVSLFKQRLYAAGSTGYPMRVWASGIGLYLDFTPGTKDGEAFGYDMASDQV 61 Query: 383 TAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVYACP---P 439 +++ I GE V ++ + +++ V VY C P Sbjct: 62 NQTVHLASAKILAALTQGEEFTVTGGSAGAITPTNIN---------VDSQSVYGCARARP 112 Query: 440 VSVGDCLVFVCGVGRRIKYIS-GSTEQGFRFNEITQLADHLFNQRILQLVYQEEPHSIVW 498 V VG+ +V+V G++++ ++ +R +T+LA H+ I+ + +Q EP +VW Sbjct: 113 VRVGNEIVYVQRAGKKVRAMTYDLNTDAYRSQNLTRLAAHVTESGIVDVAFQAEPTPVVW 172 Query: 499 VV 500 +V Sbjct: 173 MV 174 >gi|309702804|emb|CBJ02135.1| hypothetical phage protein [Escherichia coli ETEC H10407] Length = 807 Score = 66.2 bits (160), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 110/494 (22%), Positives = 195/494 (39%), Gaps = 72/494 (14%) Query: 21 LQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRVFSFSIPDGGYA 80 + R D++ + + K N I +YG + + P + + + R R+ F Sbjct: 1 MYGRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGEAKYPTRKCRLIPFQFSTVQTY 60 Query: 81 LLVFGDKKLQI------VVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVH 134 L FG +++ V+ S+ + A+ PY D +++ VH Sbjct: 61 ALEFGHNYMRVIKDGAYVLNSSNVIYELAM-------PYADTDLFRIKFTQSADVLTLVH 113 Query: 135 KDHPPHHLL-YIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARIT 193 +PP L Y D +I ++ P+ + VK A A T T +T Sbjct: 114 PAYPPKELRRYAHDNWQIV----DVTTKNGPFEDINVDETVKVYAS-----ASTGTITLT 164 Query: 194 SDMKIFKPLDKGRSIRLGCHP-----PEWAKNTNYSIGAYIVADDKVYRSLTTGRSGD-R 247 + IF G+ L P P W + +I AD YR+ T+G++G R Sbjct: 165 ASSAIFGAEQVGKLFYLE-QPAIDSVPVWETSKTTAINDVRRADSNYYRANTSGKTGTLR 223 Query: 248 FGYSKGATYV-------KDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGR 300 +++G ++ D I W L+ +R +A VS DG Sbjct: 224 PSHTEGMSWDGWGGTGDSDTGIQW-EYLHSGFGIARITA---------------VSSDGL 267 Query: 301 S-----ISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSV 355 + +S P SQ + A S W AW GYPS V ++ RL F+ S ++ Sbjct: 268 TATATVVSYIP-SQVVGSANGSY-KWARYAWNSVNGYPSTVVYYQQRLYFAASTAYPQTI 325 Query: 356 YLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM-HPFGEGVLVGCDT-SLWL 413 + S G + DF G +P + + ++ ++ + H G LV + + Sbjct: 326 WASRTGDYKDF------GKNNPIQDDDRIIYTYAGRQVNEIRHLIDVGNLVALTSGGEYT 379 Query: 414 LSISLSKGLS---IDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRF 469 +S +K L+ F +G PP++V + +F+ G ++ ++ S + G++ Sbjct: 380 ISGDQNKVLTPSAFSFSSQGNNGSSNVPPIAVANIALFIQEKGSVVRDLAYSFDVDGYQG 439 Query: 470 NEITQLADHLFNQR 483 ++T LA+HLF +R Sbjct: 440 TDLTILANHLFQKR 453 >gi|254251749|ref|ZP_04945067.1| hypothetical protein BDAG_00946 [Burkholderia dolosa AUO158] gi|124894358|gb|EAY68238.1| hypothetical protein BDAG_00946 [Burkholderia dolosa AUO158] Length = 545 Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 48/179 (26%), Positives = 81/179 (45%), Gaps = 10/179 (5%) Query: 326 WGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSL---DGEYGCYDPTKALT 382 W +GYP V+ + RL +GS G V+ S+ G +YDF+ DG+ YD Sbjct: 142 WNPTDGYPCAVSLYQQRLYAAGSSGYPERVWASATGLYYDFTPGTDDGDGFSYDVASDQV 201 Query: 383 TAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVYACPPVSV 442 + ++S I + GE + S+ +I+ R S G PV V Sbjct: 202 NQIMHLASSRILTVLTQGEEFTIDGG------SVGSITPTNINVRSQSIYGTARPRPVRV 255 Query: 443 GDCLVFVCGVGRRIKYISGS-TEQGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVV 500 G+ L+F ++I+ ++ FR +T+LA H+ ++ + +Q EP +VW+V Sbjct: 256 GNELIFPQRAAKKIRSMAYDFNTDSFRSQNLTRLAAHITESGVVDIAFQAEPTPVVWMV 314 >gi|330007163|ref|ZP_08305905.1| hypothetical protein HMPREF9538_03594 [Klebsiella sp. MS 92-3] gi|328535510|gb|EGF61970.1| hypothetical protein HMPREF9538_03594 [Klebsiella sp. MS 92-3] Length = 825 Score = 63.9 bits (154), Expect = 6e-08, Method: Compositional matrix adjust. Identities = 116/575 (20%), Positives = 210/575 (36%), Gaps = 92/575 (16%) Query: 10 SFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRV 69 S + GE+SP L R DL + + + RN I + G + + P + + R +R+ Sbjct: 9 SLAGGEISPSLY-GRIDLEKYQTSLRRCRNFIVRQSGGIENRPGFRFLGSAKYADRYSRL 67 Query: 70 FSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGK------TYKTPYTFKDNKSLEY 123 F L GD ++ WS TP+ L++ Sbjct: 68 IPFQFSVSQTYALELGDHYFRV--------WSNGALVTDGGSPVEVATPWPVSVISELKF 119 Query: 124 AVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSI- 182 H D+PP + + D + G + ++ +++ Sbjct: 120 TQSADVMTVCHNDYPPLEIRRYGEADWRTAAVTTTS---------GPFQDLNTDDSVTVY 170 Query: 183 SQADTSTARITSDMKIFKPLDKGRSIRLGCHPPE----WAKNTNYSIGAYIVADDKVYRS 238 + T + +T+ IFK G+ + + W + + +G + YR Sbjct: 171 ASGRTGSVTLTASSPIFKSQHVGKLFYMEQKAVDSVGRWETDKDIGVGDECRYQENFYRC 230 Query: 239 L---------------TTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAV 283 + TTG S D +G N + W + S G Sbjct: 231 VDGGSNGTTGTVAPTHTTGDSWDGWGLGG------RNGVLWRYL---------HSGFGVC 275 Query: 284 APYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVS-------WFMSAWGEQEGYPSHV 336 + GD + D V P+ + VV W AW + +GYP V Sbjct: 276 RITAIAGDGLTATAD-----VVPRQDGEIELPAQVVGSTFATYKWAHYAWNDTDGYPGTV 330 Query: 337 TFHNNRLLFSGSKGDELSVYLSSFGAFYDF-----SLDGEYGCYD-PTKALTTAVTDFSA 390 T++ RL+F GS+ +++ S G +++F +D + Y+ + L + Sbjct: 331 TYYQQRLIFGGSRAFPQTIWCSRTGDYHNFYRSNPKVDDDAITYNYAGRQLNKILHLLDV 390 Query: 391 STIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVC 450 + + GE + G +++ + G ++ + +GS A P++VG ++V Sbjct: 391 GQLIVLTSGGEFKVTGDSNG----NLTGTGGFAMSGQSFNGSSDLA--PINVGSVALYVQ 444 Query: 451 GVGRRIKYISGSTEQ-GFRFNEITQLADHLFN-QRILQLVYQEEPHSIVWVVLEPKDNSF 508 G I+ + S +Q ++ +++T LA HLFN I +P S+ W S Sbjct: 445 QKGSIIRDLFYSFDQDSYQSSDLTLLASHLFNGYSIRDWALSVQPFSVAWCA-----RSD 499 Query: 509 PRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASF 543 LLG + E + +AWH H +++ YV S S Sbjct: 500 GMLLGLTYLRE-QQVYAWHPHPMTNG-YVESICSI 532 >gi|212703338|ref|ZP_03311466.1| hypothetical protein DESPIG_01381 [Desulfovibrio piger ATCC 29098] gi|212673248|gb|EEB33731.1| hypothetical protein DESPIG_01381 [Desulfovibrio piger ATCC 29098] Length = 703 Score = 62.8 bits (151), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 51/174 (29%), Positives = 86/174 (49%), Gaps = 12/174 (6%) Query: 333 PSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSAST 392 PS V FH R++ +G++ + YLS G F +F DP + L + S Sbjct: 291 PSVVAFHQQRMVLAGTRDSPQAFYLSRSGDFENFRKSRPLQDDDPVEYL---IASGSIDA 347 Query: 393 IHWMHPFGEGVLVGCDTSLWLLSISLSK----GLSIDFRRVSGSGVYACPPVSVGDCLVF 448 I W FG+ +L+G S + S + S ++I + GS A P+ +G+ ++ Sbjct: 348 IAWAASFGD-LLLGTSGSEYKASGNGSAITPGNITITAQSYWGSAGLA--PIIIGNAILH 404 Query: 449 VCGVGRRIKYISGSTEQ-GFRFNEITQLADHLFN-QRILQLVYQEEPHSIVWVV 500 V G ++ + S E+ G+ N+++ LA HLF R+ Q YQ+ P S++W+V Sbjct: 405 VQRHGAHVRDLFYSLEKDGYAGNDLSILAPHLFEGHRLRQWAYQQTPGSVLWIV 458 >gi|303327644|ref|ZP_07358084.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3] gi|302862005|gb|EFL84939.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3] Length = 681 Score = 62.0 bits (149), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 115/520 (22%), Positives = 196/520 (37%), Gaps = 119/520 (22%) Query: 10 SFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMP----LMQEYRDCRLDPR 65 +F+ GE++P L +R DL +A + N +P +G P L C L P Sbjct: 7 NFTGGEVTP-TLSARYDLGRYANSLKIMENFLPNLHGDAYRRPGTYFLENLGEGCVLLP- 64 Query: 66 SNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAV 125 FSF+ G L FG+K L+IV V + + ++PY D + YA Sbjct: 65 ----FSFNAEAGQNFALAFGEKSLRIVNVNGY------VVAEAMESPYALADVPEISYAQ 114 Query: 126 FGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKF---------LPPPWLGDG------- 169 G HKD+ H ++ +++ + W G G Sbjct: 115 VGDVVYLAHKDYALHKVVRTGSAPAYAWSIGTVALNTSLAAPAAPTAAWQGGGGSYTLRY 174 Query: 170 MISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYI 229 +S V ++ K S+ A STA G +P +W + + + Sbjct: 175 KVSAVDADGKESLPSAVGSTAS-------------------GKYPTDWTEGNHCVLSWQA 215 Query: 230 V---ADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPY 286 V A+ +YR + G G G ++G ++ N A A P Sbjct: 216 VEGAAEYNIYRE-SAGYYG-FIGIAQGTSFDDQNY----------------EADIADTPK 257 Query: 287 YVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFS 346 W D + G +++ Q L S S++MS G+ E F +R L Sbjct: 258 EDWDPFADGNNPG-TVTFHQQRMVLAGTRNSPQSFYMSRTGDFE------NFRKSRPL-- 308 Query: 347 GSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVG 406 D + L+S ++DG I W FG+ +L+G Sbjct: 309 -QDDDPVEYQLAS------GTVDG----------------------IVWAASFGD-LLLG 338 Query: 407 CDTSLWLLS----ISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGS 462 ++ + + +K +I + GS A P+ +G+ ++ G R++ + S Sbjct: 339 TASAEYKATGDNGAITAKNCTITAQSYWGSAKIA--PIIIGNSVMHCQRHGSRVRDLYYS 396 Query: 463 TEQ-GFRFNEITQLADHLFN-QRILQLVYQEEPHSIVWVV 500 E+ G+ N+++ LA HLF+ I Q +Q+ P S++W+V Sbjct: 397 LEKDGYAGNDLSVLAPHLFDGHTIRQWAFQQTPGSVLWLV 436 >gi|225157020|ref|ZP_03724959.1| hypothetical protein ObacDRAFT_8085 [Opitutaceae bacterium TAV2] gi|224802748|gb|EEG20999.1| hypothetical protein ObacDRAFT_8085 [Opitutaceae bacterium TAV2] Length = 773 Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 115/547 (21%), Positives = 203/547 (37%), Gaps = 80/547 (14%) Query: 9 HSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNR 68 ++F+AGE +P+L R DL + + N+ + YG + +R Sbjct: 7 NNFTAGEWTPKL-DGRSDLQKYDAACRRLENMRVMPYGGARFRSAFGYVAKTKSAATPSR 65 Query: 69 VFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGS 128 + F +L + L++ S +PAL + +PY +++Y Sbjct: 66 LMPFQFSTEQKFMLEWAHLALRVY----SAGAAPALL-QEIASPYPAAAVFAIQYRQIND 120 Query: 129 TAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTS 188 VH D+P L D D + + + + PP L + + + KLS+S D Sbjct: 121 VVYLVHPDYPVQRLARHADAD---WRLEAVDWAFPPMLDENV-----TETKLSLSAVDGV 172 Query: 189 TARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSI-----GAYIVADDKVYRSLTTGR 243 +T+ +F+P G L H E A +T+ S+ G + A V T Sbjct: 173 NVTMTASAALFQPGHVGSYWELR-HLKE-AASTSVSLATTSGGPFHSAAISVQGDWT-AN 229 Query: 244 SGDRFGYSKGATYVKDNNITWITVLNLSSKTSRE-SASG--------------------- 281 S +R+ + D TW TV ++++ R SASG Sbjct: 230 STERWYGTLSIERSLDGGTTWETVRKFTAESDRNISASGHQEELAQFRLKYQPTGDPFGA 289 Query: 282 ----AVAP--------------YYVWGDIKDVS-KDGRSISVAPQSQTLFQAGVSVVSWF 322 AP YV +K + D + V + A + W Sbjct: 290 GVWVGKAPTNYVKARAMLETTDAYVTALVKVTAYTDSTHVKVTVIDKAATVAATDI--WC 347 Query: 323 MSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALT 382 SAW G+P + + RL+F G++ +++ S F +F +YG D Sbjct: 348 ESAWSPYRGFPRTIGLYEQRLIFGGTRHQPNTMWGSKTDDFENF----KYGEDDDAAVAY 403 Query: 383 TAVTDFSAS---TIHWMHPFGEGVLVGCDTSLWLLSISLSKGLS---IDFRRVSGSGVYA 436 T F+AS + W+ + + + + L+ I R S +G Sbjct: 404 T----FAASEQNNVQWVESLKRIQAATTAREFTVAAGNTDEPLTPSNIVVRSESANGAAH 459 Query: 437 CPPVSVGDCLVFVCGVGRRIKYISGSTEQ-GFRFNEITQLADHLFNQRILQLVYQEEPHS 495 PV V D +++V R++ ++ S E+ G+ ++T LA + + QL + +P Sbjct: 460 LQPVLVNDAILYVERQSRKVMEMAYSIEKDGYASVDLTLLAAPVTESGVKQLAFARQPDP 519 Query: 496 IVWVVLE 502 ++ V E Sbjct: 520 LLLAVTE 526 >gi|212703239|ref|ZP_03311367.1| hypothetical protein DESPIG_01281 [Desulfovibrio piger ATCC 29098] gi|212673505|gb|EEB33988.1| hypothetical protein DESPIG_01281 [Desulfovibrio piger ATCC 29098] Length = 694 Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 62/241 (25%), Positives = 102/241 (42%), Gaps = 28/241 (11%) Query: 328 EQEG-YPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTK---ALTT 383 E EG YPS V FH RL F+ S ++++LS G F + P K A+ Sbjct: 270 EGEGNYPSQVFFHQQRLGFAASNSRPITIWLSRSGEFESMAKS------TPPKDDDAIEV 323 Query: 384 AVTDFSASTIHWMHPFGEGVLVGCDTSLWLLS----ISLSKGLSIDFRRVSGSGVYACPP 439 + AS I W+ P + G + S W L ++L+ + + + G A Sbjct: 324 TLAATQASRIVWLQPDRSALAFGTEGSEWTLEPSEGVALTPATASFQLQTTNGGSDAVAA 383 Query: 440 VSVGDCLVFVC-GVGRRIKYISGSTEQGFRFNEITQLADHLFNQ-RILQLVYQEEPHSIV 497 +SVG +++V G G ++ + + ++ LA H+ ++ +Q+EP++++ Sbjct: 384 LSVGGSVLYVQRGAGAIREFAYNYSADKYLGQDLNILARHMLRDVDVVAWSWQQEPYAVL 443 Query: 498 WVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMIS-DKHYVLSAASFPNDNRGGTSLWML 556 W VL S L G + E E WH H + D V P+D +W L Sbjct: 444 WSVL-----SDGTLAGLTYMKEQE-IVGWHRHTTAGDFVDVAGIPGTPDDQ-----VWFL 492 Query: 557 V 557 V Sbjct: 493 V 493 Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 38/156 (24%), Positives = 68/156 (43%), Gaps = 6/156 (3%) Query: 7 TKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRS 66 T++ + GE+SP LL+ R D ++ G + RN +P+ G + P + D Sbjct: 6 TQNVLNGGEISP-LLRGRVDQPRYSTGAREMRNFVPMPQGGVTRRPGTRYLGTALGDGGR 64 Query: 67 NRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVF 126 F FS G +L FGD+ +++ + K +++P+ D +++ YA Sbjct: 65 LVPFVFSATQG--RMLEFGDRAMRVWLPDGRVVADEEGAPKIFESPFAAADLRAVRYAQS 122 Query: 127 GSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLP 162 F H + P L D D + + E+ F+P Sbjct: 123 ADVIYFAHPGYAPRKLARHADDD---WRWSELTFMP 155 >gi|167041089|gb|ABZ05850.1| hypothetical protein ALOHA_HF400048F7ctg1g17 [uncultured marine microorganism HF4000_48F7] Length = 999 Score = 60.5 bits (145), Expect = 7e-07, Method: Compositional matrix adjust. Identities = 61/277 (22%), Positives = 119/277 (42%), Gaps = 43/277 (15%) Query: 314 AGV-SVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFS----L 368 AGV + W + ++ GYP V + RL+F+G+ + +++ S F++FS L Sbjct: 428 AGVGATTEWQLGSFSGTTGYPRTVQLYQQRLVFAGTAEESQTIFFSKTADFFNFSATEPL 487 Query: 369 DGEYGCYDPT------------KALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSI 416 + G D + A++ ++ + I W+ + + +G ++ L Sbjct: 488 GQQTGQRDSSGRSIVGEQIFEDAAISLTISSDTVDQIEWISE-DQRLTIGTSGGIYQLYG 546 Query: 417 SLSKGLSIDFR-RVSGSGVYACPPVS----VGDCLVFVCGVGRRIKYIS-GSTEQGFRFN 470 S F ++ +AC P + VG+ L++V GR+++ ++ + + Sbjct: 547 STDDLTLTPFNFSITKVSAWACDPTALPAKVGNNLLYVQNNGRKLRELAFDKVQDQYSAA 606 Query: 471 EITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHM 530 ++T ++ + ++ YQ++P+S++W + RL G + + AWH H Sbjct: 607 DLTLRSEDISESGLIATAYQDQPYSVLWCLRNDG-----RLAGLTY-VDLLQMRAWHRHT 660 Query: 531 ISDKHY---------VLSAASFPNDNRGG-TSLWMLV 557 I HY V S AS P RG L+M+V Sbjct: 661 IGGAHYDDTHGSQAKVESIASIP---RGTHDQLYMIV 694 >gi|220918520|ref|YP_002493824.1| hypothetical protein A2cp1_3428 [Anaeromyxobacter dehalogenans 2CP-1] gi|219956374|gb|ACL66758.1| hypothetical protein A2cp1_3428 [Anaeromyxobacter dehalogenans 2CP-1] Length = 825 Score = 60.1 bits (144), Expect = 9e-07, Method: Compositional matrix adjust. Identities = 139/576 (24%), Positives = 211/576 (36%), Gaps = 117/576 (20%) Query: 8 KHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVS---MPLMQEYRD----- 59 + SF+AGEL PRL R DL+ + G+ ++RN G ++ P ++E +D Sbjct: 8 QGSFAAGELGPRL-HGRHDLAKYQVGLRRARNFFLSPEGAALNRPGTPFVREAKDSAAGV 66 Query: 60 ---CRLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYK--TPYT 114 RL P F FS G L FG ++ V +T P + Y+ TPY Sbjct: 67 DRGARLIP-----FIFSEDLGQAYELEFGQGYVRFHV-GGATIADPLNSAQPYELATPYL 120 Query: 115 FKDNKSLEYAVFGSTAVFVHKDHPPHHL--LYIQDGDKISFTFDEIKFLPPP----WLGD 168 D L+YA G K + P L L + + +FD +P P +LG Sbjct: 121 AADLPRLKYAQQGDVVTLTCKGYDPRELRRLAHDSWELVPLSFD----VPAPNGVVYLGV 176 Query: 169 GMISGVKSNAKLSISQADTSTARITSDMKIFK----PLDKGRSIRLGCHPPEWAKNTNYS 224 + V ++A Q I D + PL + R I +G W Y Sbjct: 177 EALENV-ADATHPARQWAWQVTEIWEDESGLQWETSPL-RVRKIAVGAGA-TWHTGFTYP 233 Query: 225 IGAYIVADDKVYRSLTTGRSGD-----RFGYSKGATY--------------VKDNNITWI 265 +GA + + ++S+ G G ATY V ++N Sbjct: 234 LGACVSYAGQFWQSVIADNRGHVPEAVMVGDPPAATYPYWTPVGAVPDPFAVYESNAPTD 293 Query: 266 TVLNLSSKTSRESASGA-----------------------------VAPYYVWGDIKDVS 296 VL +T + ASGA VA + GD D+S Sbjct: 294 VVL-FPDRTIKLWASGAWTGVDGSRLVGRRVYRGRGTVFGYVGEFEVAEFRDTGDTPDLS 352 Query: 297 KDGRSISVAPQSQ---TLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDEL 353 PQ + T+F VV EQ PS VTFH R G+ Sbjct: 353 YS------PPQGRNPFTVFGPAGEVVRL------EQ---PSVVTFHAERRSLLGTAQRPA 397 Query: 354 SVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWL 413 +LS G +Y+F D A + + W +L+G + +W Sbjct: 398 HAFLSRTGDYYNFDRHTPALVDD---AFELELAGRLREEVRWAV-GAAALLIGTQSGVWA 453 Query: 414 LSIS----LSKGLSIDFRRVSGSGVYACP---PVSVGDCLVFVCGVGRRIK-YISGSTEQ 465 + L G + + S Y P P +VGD +++V G ++ + Q Sbjct: 454 IRPPSGEVLGPGKATAVPQSSAGSSYLDPLVVPSAVGDAVLYVRTKGSGVRDLVYDDGRQ 513 Query: 466 GFRFNEITQLADHLFNQ-RILQLVYQEEPHSIVWVV 500 GF ++++ LA HLF I +QE+P S+ W+V Sbjct: 514 GFVGSDLSLLAKHLFTGYSIKAWTFQEDPWSVAWLV 549 >gi|320175038|gb|EFW50151.1| 12 [Shigella dysenteriae CDC 74-1112] Length = 799 Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 110/504 (21%), Positives = 204/504 (40%), Gaps = 71/504 (14%) Query: 27 LSLHAQGVAKSRNLIPLRYGPLVSMPLMQ-------EYRDCRLDP-RSNRVFSFSIPDGG 78 ++ + + K N I +YG + + P + R CRL P + + V ++++ G Sbjct: 1 MAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYPNRKCRLIPFQFSTVQTYALEFGH 60 Query: 79 YALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHP 138 + V D L V+ S+ + A TPYT D +++ VH +P Sbjct: 61 QYMRVIKDGAL--VLNSSNVIYEIA-------TPYTEADLFRIKFTQSADVLTLVHPAYP 111 Query: 139 PHHLL-YIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSI-SQADTSTARITSDM 196 P L Y D ++ + +G + + +++ + A T T +T+ Sbjct: 112 PKELRRYAHDNWQLVDVVTK----------NGPFEDINIDESVTVYASASTGTITLTASA 161 Query: 197 KIFKPLDKGRSIRLGCHP-----PEWAKNTNYSIGAYIVADDKVYRSLTTGRSGD-RFGY 250 IF G+ L P P W + + SIG AD YR++T G++G R + Sbjct: 162 SIFGAEQVGKLFYLE-QPAVDSVPVWETSKSTSIGDIRRADSNYYRAVTAGKTGTLRPSH 220 Query: 251 SKGATYVKD-------NNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSIS 303 ++G ++ I W L+ +R +A+ IS Sbjct: 221 TEGTSWDGWGGSGDDDTGIEW-EYLHSGFGIARITAANGTT------------ATAEVIS 267 Query: 304 VAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAF 363 P SQ + + S W W GYP V ++ RL F+ S +++ S G + Sbjct: 268 YIP-SQVVGEDNASY-KWAKYTWNSVNGYPGTVVYYQQRLYFAASTAFPQTIWASRTGDY 325 Query: 364 YDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM-HPFGEGVLVGCDT-SLWLLSISLSKG 421 DF G +PT+ + ++ ++ + H G LV + ++++ +K Sbjct: 326 KDF------GKSNPTQDDDRIIYTYAGRQVNEIRHLIDVGSLVALTSGGEYVITGDQNKV 379 Query: 422 L---SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLAD 477 L S F +G PP++V + +FV G ++ ++ S + G++ N++T LA+ Sbjct: 380 LTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVVRDLAYSFDVDGYQGNDLTILAN 439 Query: 478 HLFNQR-ILQLVYQEEPHSIVWVV 500 HLF + I+ + P+S + + Sbjct: 440 HLFQKHSIVDWCFSIVPYSSAFCI 463 >gi|118590938|ref|ZP_01548338.1| hypothetical protein SIAM614_19796 [Stappia aggregata IAM 12614] gi|118436460|gb|EAV43101.1| hypothetical protein SIAM614_19796 [Stappia aggregata IAM 12614] Length = 810 Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 43/185 (23%), Positives = 81/185 (43%), Gaps = 13/185 (7%) Query: 321 WFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKA 380 W + AW G+P + +H NRL F+G+ + ++ S F +FS+ D A Sbjct: 386 WRLGAWSGTTGWPETIGWHKNRLAFAGTSEEPQKIWESQTEDFTNFSVSHVLKASD---A 442 Query: 381 LTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKG----LSIDFRRVSGSGVYA 436 +T + + I W+ + ++VG ++ + + + ++D + + G Sbjct: 443 VTAGILSGQVNRIQWLVDDND-LIVGTTRAVRAVGKATDQDPYGPENVDQKPETNFGAND 501 Query: 437 CPPVSVGDCLVFVCGVG---RRIKYISGSTEQGFRFNEITQLADHLFNQRILQLVYQEEP 493 P+ VG L++ G R + Y GS G ++++ HLF I YQ+ P Sbjct: 502 VSPIKVGSVLIYYGPYGTDMREMAYDFGS--DGRVSQAVSEVQSHLFQSGIAGACYQQYP 559 Query: 494 HSIVW 498 S++W Sbjct: 560 DSVIW 564 >gi|85059168|ref|YP_454870.1| hypothetical protein SG1190 [Sodalis glossinidius str. 'morsitans'] gi|84779688|dbj|BAE74465.1| hypothetical phage protein [Sodalis glossinidius str. 'morsitans'] Length = 662 Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 45/186 (24%), Positives = 84/186 (45%), Gaps = 15/186 (8%) Query: 324 SAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSL----DGEYGCYDPTK 379 S W + GYP VT + RL+ +GS +++ S GA+ F L D + Sbjct: 255 SVWTDNLGYPGAVTLYQQRLVLAGSPKYPQTIWWSETGAYLSFELGTKDDAAISFTLSSD 314 Query: 380 ALTTAVTDFSASTIHWMHPFGEGVLV-GCDTSLWLLSISLSKGLSIDFRRVSGSGVYACP 438 L V +T+ + GE + G D ++ +IS+ + S G Sbjct: 315 QLNPIVHLAQMNTLIALTYGGEFTITSGNDAAITPTNISV--------KNPSPYGCNRIR 366 Query: 439 PVSVGDCLVFVCGVGRRIKYISGSTEQ--GFRFNEITQLADHLFNQRILQLVYQEEPHSI 496 P+ VG ++F+ GR++ ++ + + N++T LA+H+ + + YQ++P + Sbjct: 367 PLRVGTEILFIQRAGRKLYAVAYDPDSFVSYAANDLTVLAEHITAGGVRDMAYQQQPDGL 426 Query: 497 VWVVLE 502 +W+V E Sbjct: 427 IWLVRE 432 Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust. Identities = 50/197 (25%), Positives = 80/197 (40%), Gaps = 21/197 (10%) Query: 8 KHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSN 67 K +F+AGE+SPRL+ R D+ +A G +N + + G ++ P + + R Sbjct: 7 KTNFTAGEVSPRLM-GRVDIMRYANGAKAIQNGVVVVQGGVMRRPGTRFAAAAKYSDRPA 65 Query: 68 RVFSFSIPDGGYALLVFGDKKLQI------VVVRSSTKWSPALFGKTYKTPYTFKDNKSL 121 R+ + +L FGD L++ VV ++T + A +PY+ S+ Sbjct: 66 RLIPYVFNRSQAYVLEFGDGYLRVYQKGKPVVNANNTPYEIA-------SPYSADRLPSV 118 Query: 122 EYAVFGSTAVFVHKDHPPHHLL-------YIQDGDKISFTFDEIKFLPPPWLGDGMISGV 174 Y T VH P+ L ++ I FDEI+ P W V Sbjct: 119 NYVQGADTMFLVHPAVKPYRLQRRGQTDWVLEPAPFIVEPFDEIRETPKKWCRPSAKEFV 178 Query: 175 KSNAKLSISQADTSTAR 191 S L++S AD R Sbjct: 179 GSEVTLTLSDADPGENR 195 >gi|295096862|emb|CBK85952.1| hypothetical protein ENC_24250 [Enterobacter cloacae subsp. cloacae NCTC 9394] Length = 662 Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 48/188 (25%), Positives = 84/188 (44%), Gaps = 23/188 (12%) Query: 324 SAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTT 383 S W + GYP VT + RL+ +GS +++ S G + F E G D T Sbjct: 255 SVWTNEFGYPGAVTLYQQRLVLAGSPKYPQTIWWSETGVYLSF----EIGTEDDDAISFT 310 Query: 384 AVTDFSASTIHW--MHPF------GEGVLV-GCDTSLWLLSISLSKGLSIDFRRVSGSGV 434 +D +H M+ GE + G D ++ +IS+ + S G Sbjct: 311 LSSDQLNPIVHLAQMNTLIALTYGGEFTITSGNDAAITPTNISV--------KNPSPYGC 362 Query: 435 YACPPVSVGDCLVFVCGVGRRIKYISGSTEQ--GFRFNEITQLADHLFNQRILQLVYQEE 492 PV VG ++FV GR++ ++ + + N++T LA+H+ +L + YQ++ Sbjct: 363 NGIRPVRVGTEIMFVQRAGRKLYAVAYDPDSFVSYSANDMTVLAEHITAGGVLDMAYQQQ 422 Query: 493 PHSIVWVV 500 P + +W+V Sbjct: 423 PDAFIWMV 430 Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust. Identities = 48/192 (25%), Positives = 81/192 (42%), Gaps = 21/192 (10%) Query: 8 KHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSN 67 K +F+AGE+SPRL+ R D++ +A G N + + G +V P + + + + Sbjct: 7 KTNFTAGEVSPRLM-GRVDIARYANGAKIIENAVVVVQGGVVRRPGTRFAAATKHGDKKS 65 Query: 68 RVFSFSIPDGGYALLVFGDKKLQI------VVVRSSTKWSPALFGKTYKTPYTFKDNKSL 121 R+ + +L FGD ++I +V +T + A +PYT ++ Sbjct: 66 RLIPYVFNRSQAYMLEFGDGYMRIFQNGKQLVNEDNTPYEIA-------SPYTADMLPAV 118 Query: 122 EYAVFGSTAVFVHKDHPPHHLL-------YIQDGDKISFTFDEIKFLPPPWLGDGMISGV 174 Y T VH+ PH L ++ I FDE++ P W + V Sbjct: 119 NYVQGADTMFLVHQSVKPHRLQRRGQTDWVLEPAPFIVEPFDEVRDTPQKWCKPSVKEFV 178 Query: 175 KSNAKLSISQAD 186 S L++S AD Sbjct: 179 GSEITLTLSDAD 190 >gi|262043403|ref|ZP_06016528.1| hypothetical protein HMPREF0484_3546 [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|259039229|gb|EEW40375.1| hypothetical protein HMPREF0484_3546 [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] Length = 664 Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 45/184 (24%), Positives = 83/184 (45%), Gaps = 15/184 (8%) Query: 324 SAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSL----DGEYGCYDPTK 379 S W ++ GYP VT + RL+ +GS +++ S G + F L D + Sbjct: 257 SVWTDEFGYPGAVTLYQQRLVLAGSPRYPQTIWWSESGVYLSFELGTDDDDAISFTLSSD 316 Query: 380 ALTTAVTDFSASTIHWMHPFGE-GVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVYACP 438 L V +T+ + GE + G D ++ +IS+ + S G Sbjct: 317 QLNPIVHLAQMNTLIALTYGGEFTITAGNDAAITPTNISV--------KNPSPYGCNGIR 368 Query: 439 PVSVGDCLVFVCGVGRRIKYISGSTEQ--GFRFNEITQLADHLFNQRILQLVYQEEPHSI 496 PV VG ++FV GR++ ++ + + N++T LA+H+ ++ + YQ++P + Sbjct: 369 PVRVGTEIMFVQRSGRKLYAVAYDPDSYVAYSANDMTVLAEHITEGGVIDMAYQQQPDAF 428 Query: 497 VWVV 500 W+V Sbjct: 429 TWLV 432 Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 48/189 (25%), Positives = 79/189 (41%), Gaps = 21/189 (11%) Query: 8 KHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSN 67 K +F+AGE+SPRL+ R D+ +A G N + + G ++ P Q + + + Sbjct: 7 KTNFTAGEISPRLM-GRVDIDRYANGAKTLENSVVVVQGGVMRRPGSQFVAATKYGDKKS 65 Query: 68 RVFSFSIPDGGYALLVFGDKKLQI------VVVRSSTKWSPALFGKTYKTPYTFKDNKSL 121 R+ + +L FGD L+I +V +T + A +PYT S+ Sbjct: 66 RLIPYVFNRTQAYILEFGDGYLRIYQDGKQLVNDDNTPYEIA-------SPYTSDMLPSV 118 Query: 122 EYAVFGSTAVFVHKDHPPHHLL-------YIQDGDKISFTFDEIKFLPPPWLGDGMISGV 174 Y T VH+D P+ L ++ I FDE++ P W + V Sbjct: 119 NYVQGADTMFLVHQDVKPYRLQRRGQTDWVLEPAPFIVEPFDEVRDTPQKWCKPSVKEFV 178 Query: 175 KSNAKLSIS 183 S L++S Sbjct: 179 GSEITLTLS 187 >gi|303328570|ref|ZP_07359005.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3] gi|302861336|gb|EFL84275.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3] Length = 696 Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 53/237 (22%), Positives = 101/237 (42%), Gaps = 19/237 (8%) Query: 332 YPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSAS 391 +PS V FH RL ++ + ++++LS G DF + A+ + A+ Sbjct: 277 WPSQVFFHQQRLGWAATANRPITIWLSRPG---DFEIMAASTPPKDDDAIEATLAATQAN 333 Query: 392 TIHWMHPFGEGVLVGCDTSLWLLSISLSKGLS---IDFR-RVSGSGVYACPPVSVGDCLV 447 I W+ P + + G + S W LS L+ + F + + G A VSVG ++ Sbjct: 334 RIVWLQPDRQSLTFGTEGSEWTLSAGEGVALTPSNVSFEMQTANGGDNATQAVSVGGGVL 393 Query: 448 FVCGVGRRIKYIS-GSTEQGFRFNEITQLADHLFNQRILQL-VYQEEPHSIVWVVLEPKD 505 ++ G+ ++ + + + ++T LA H+ ++ +Q+EP++++W L Sbjct: 394 YLQRGGKAVRQFAYNYSADKYLGQDVTILARHILRDAVVTAWAFQQEPYAVLWCAL---- 449 Query: 506 NSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSAG 562 S L G + E + WH H + ++A D++ W LV G Sbjct: 450 -SDGTLAGLTYMPE-QDVMGWHRHDTDGRFEDVAAMPGTPDDQ----TWFLVRRGCG 500 Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust. Identities = 40/160 (25%), Positives = 66/160 (41%), Gaps = 16/160 (10%) Query: 8 KHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPL-----MQEYRDCRL 62 ++ + GE++P L++ R D + G + RN +P+ G + P M RL Sbjct: 7 QNVLNGGEITP-LMRGRVDQPRYGTGAREMRNFVPMPQGGVTRRPGTRFLGMAHGDAARL 65 Query: 63 DPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLE 122 P F FS G +L FGDK L++ + K +++PY D L Sbjct: 66 IP-----FVFSATQG--RMLEFGDKTLRVWLPDGRLVADENGEPKVFESPYAVGDLHELR 118 Query: 123 YAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLP 162 +A H+ + P L D D + + E+ F+P Sbjct: 119 FAQSADVVYLAHQGYAPRRLSRHADDD---WRWSELAFVP 155 >gi|262043657|ref|ZP_06016766.1| hypothetical protein HMPREF0484_3785 [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|259038995|gb|EEW40157.1| hypothetical protein HMPREF0484_3785 [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] Length = 758 Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust. Identities = 133/610 (21%), Positives = 220/610 (36%), Gaps = 110/610 (18%) Query: 8 KHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSN 67 K SF+AG LSP ++ + D A V +N IPL GP Q + S+ Sbjct: 8 KRSFNAGILSP-VMYGQVDFDKWASAVKYMKNFIPLPQGPARRRGGTQYAGSVK--NSSD 64 Query: 68 RV----FSFSIPDG-------GYALLVFGDKKL---QIVVVRSSTKWSPALFGKTYKTPY 113 RV F FS + GY F +L + ++ ST W + K Sbjct: 65 RVWLASFQFSTTEAFILEFGPGYIRFWFNHAQLLDDENNILEVSTPWGAGDLTRNGKFGL 124 Query: 114 TFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLG-DGMIS 172 + + + + Y + ++P + L +++ E F P+ + S Sbjct: 125 SLQQSADVIYITC------TNGNYPVYKLTR---NTNTNWSLAEASFSGGPFADINSDKS 175 Query: 173 GVKSNAKLSISQAD-----------TSTARITSDMKIFKPLDKGRSIRLGC--------- 212 V + I D TS IT++ IF+ L G + Sbjct: 176 SVVYTDQFRIWSEDGNDLPDGTPTTTSLCNITANTDIFQALHVGCLFYIEASTDAVDDDT 235 Query: 213 ----HPPEWAKNTN--YSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWIT 266 + P WA T +S G + +D K Y + ++G+ TW Sbjct: 236 GHSGYIPAWAAGTTETFSTGVFCRSDGKYYEDMDGTKTGN-------------TQPTW-- 280 Query: 267 VLNLSSKTSRESASGAVAPYYV----WGDIK------DVSKDGRSISVAPQSQTLFQAGV 316 ++ R+ + G + + WG I+ S G+ ++ P S + Sbjct: 281 ----TAGAHRDGSGGDASLWRYSGGGWGIIEITAVNSATSATGKIVTELPPS--VRNTVG 334 Query: 317 SVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYD 376 + W + YP F RL+F+G + ++ S G +FS + Sbjct: 335 KTYKYAFGDWSDVLRYPQFAAFFRGRLVFAGRQ----KIWSSVAGDLQNFSPMTNGYEAE 390 Query: 377 PTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLW------LLSISLSKGLSIDFRRVS 430 ++ + D + T+ W+ + +G + L S+ + ++ Sbjct: 391 SDDSINDRIDD-TQDTMQWLVASAGKIFIGTAGYEFSYGEQSLTSVFGAGNTKVELNSTI 449 Query: 431 GSGVYACPPVSVGDCLVFVCGVGRRI---KYISGSTEQGFRFNEITQLADHLFNQRILQL 487 GS + D + FV GR++ Y SGS F LA HLF I+ L Sbjct: 450 GSNEVQAE--RLFDRVAFVQRAGRKVMIAAYDSGS--DSFSATNSCILAPHLFTSEIIAL 505 Query: 488 VYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDN 547 YQ+EP+ I+WV+LE +LLG + AE + WH H V S P+ + Sbjct: 506 AYQQEPNRILWVLLEEG-----KLLGLTYDAE-QNITGWHEHATGGA--VESIKVIPDID 557 Query: 548 RGGTSLWMLV 557 G LWM+V Sbjct: 558 GGRDELWMVV 567 >gi|290968641|ref|ZP_06560179.1| conserved hypothetical protein [Megasphaera genomosp. type_1 str. 28L] gi|290781294|gb|EFD93884.1| conserved hypothetical protein [Megasphaera genomosp. type_1 str. 28L] Length = 1039 Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust. Identities = 44/190 (23%), Positives = 91/190 (47%), Gaps = 4/190 (2%) Query: 316 VSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCY 375 V V ++ S+W ++ GYP F +RL+F+G+K + S++ S G + +FS++ G Sbjct: 570 VPVDAFAFSSWNDRNGYPKLSCFFQDRLVFAGTKKEPYSLWFSRTGDYNNFSVEKAEGTV 629 Query: 376 DPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRV-SGSGV 434 A+ + + I + P + ++V + W++S + + +V + G Sbjct: 630 TEDSAIKLDLIVRNLYEIRHLVPSND-LIVLTSGNEWIISGDTAITPTKCTPKVQTMRGA 688 Query: 435 YACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFNQ-RILQLVYQEE 492 C P +G+ L++V G I+ S + + +E+ A HL + +++ Y + Sbjct: 689 SNCKPWHIGNRLIYVQRDGGTIRDFGYSYDSDNYNGDELNLFASHLTKRHQMVSSAYCQN 748 Query: 493 PHSIVWVVLE 502 P+S ++ V E Sbjct: 749 PYSTLYFVRE 758 Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust. Identities = 39/187 (20%), Positives = 79/187 (42%), Gaps = 18/187 (9%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M N T++SF+ GE+SP + + R DL + + ++ N + YG + + Sbjct: 1 MQNVFITQNSFTTGEISPEVAE-RTDLEKYKSALLQAENAVVSPYGSVSRRTGSKYIGAI 59 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 + + + F LL G K +++ W + TP+ + K Sbjct: 60 KYADKEAVLVPFMDSSDRSYLLEVGYKYIRV--------WKDETMEQEIDTPFEYP--KE 109 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 L + G TA +P + LL+ + + F +P P+ D +IS +++ + + Sbjct: 110 LNFTQSGDTAFICSGRYPVYELLHGRYWELRKFD------IPKPYF-DDIISAIENVSDV 162 Query: 181 SISQADT 187 + +++DT Sbjct: 163 NYTESDT 169 >gi|298485990|ref|ZP_07004064.1| predicted phage protein [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] gi|298159467|gb|EFI00514.1| predicted phage protein [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] Length = 716 Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust. Identities = 43/182 (23%), Positives = 88/182 (48%), Gaps = 12/182 (6%) Query: 324 SAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTT 383 S W + +GYPS T + RL+ +GS +++ S G + +F E G D A++ Sbjct: 311 SVWNDFDGYPSTGTLYEQRLVAAGSPNYPQTIWESRTGEYLNF----ELGTKD-DDAMSF 365 Query: 384 AVTDFSASTIHWMHPFGEGVLVGCD-TSLWLLSISLSKGLSIDFRRVSGSGVYACP---P 439 V+ + I MH LV + ++ + K ++ ++ VY C P Sbjct: 366 NVSSDQINPI--MHVGQVKALVTLTYGGEFTVTGGVEKPITPTNIQIKNQSVYGCNGVRP 423 Query: 440 VSVGDCLVFVCGVGRRIKYISGSTEQ-GFRFNEITQLADHLFNQRILQLVYQEEPHSIVW 498 + +G+ L FV GR+++ ++ + + +++ L++H ++ + +Q+EP SI++ Sbjct: 424 IRIGNELYFVQRAGRKLRAMAYKYDSDSYGSPDMSVLSEHATKSGVVDMAFQQEPESILF 483 Query: 499 VV 500 +V Sbjct: 484 MV 485 >gi|291334457|gb|ADD94111.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C1161] Length = 206 Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust. Identities = 31/126 (24%), Positives = 63/126 (50%), Gaps = 7/126 (5%) Query: 423 SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFN 481 +I ++ S +G ++VG+ +F+ R+++ ++ + + G+ ++T LA+H+ Sbjct: 30 NILIKKQSNNGAANVDALAVGNATLFLQRARRKLRELAYNFDVDGYVAPDLTILAEHISE 89 Query: 482 QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAA 541 QL YQ+EP+ ++W V +L+G + E + AWH H+ S A Sbjct: 90 GGFKQLSYQQEPNQVIWGVRND-----GQLVGLTYQRE-QQVVAWHRHIFGGSAVCESVA 143 Query: 542 SFPNDN 547 + P D+ Sbjct: 144 TIPTDD 149 >gi|291334666|gb|ADD94313.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C695] Length = 189 Score = 48.1 bits (113), Expect = 0.004, Method: Composition-based stats. Identities = 31/126 (24%), Positives = 63/126 (50%), Gaps = 7/126 (5%) Query: 423 SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFN 481 +I ++ S +G ++VG+ +F+ R+++ ++ + + G+ ++T LA+H+ Sbjct: 31 NILIKKQSNNGAANVDALAVGNATLFLQRARRKLRELAYNFDVDGYVAPDLTILAEHISE 90 Query: 482 QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAA 541 QL YQ+EP+ ++W V +L+G + E + AWH H+ S A Sbjct: 91 GGFKQLSYQQEPNQVIWGVRNDG-----QLVGLTYQRE-QQVVAWHRHIFGGSAVCESVA 144 Query: 542 SFPNDN 547 + P D+ Sbjct: 145 TIPTDD 150 >gi|54302254|ref|YP_132247.1| hypothetical protein PBPRB0574 [Photobacterium profundum SS9] gi|46915675|emb|CAG22447.1| hypothetical protein PBPRB0574 [Photobacterium profundum SS9] Length = 919 Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust. Identities = 42/192 (21%), Positives = 78/192 (40%), Gaps = 10/192 (5%) Query: 317 SVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYD 376 S W + W GYP T+ RL + + +V+LS +F DFS D Sbjct: 408 STYKWAIEIWRNSTGYPRCGTYFQQRLSMANTISHPQTVWLSRTDSFNDFSKTRPILADD 467 Query: 377 PTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSID----FRRVSGS 432 ++ + + I + P +L+ LW L+ S + + + Sbjct: 468 ---SMRYDINSLQVNEIFNIVPL-NSLLLFTSGGLWSLAQDQQGAFSAESPPSVKMQNYE 523 Query: 433 GVYACPPVSVGDCLVFVCGVGRRIKYISGS-TEQGFRFNEITQLADHLF-NQRILQLVYQ 490 G P+ G ++V R ++ I S + F ++T A HLF ++R+++ Y Sbjct: 524 GANKLRPIVAGSTAIYVQQGDRIVRDIQFSWSSDSFEGVDLTVRASHLFKHKRVVEWAYA 583 Query: 491 EEPHSIVWVVLE 502 + P ++WV+ + Sbjct: 584 KNPDKLIWVIFD 595 >gi|146276492|ref|YP_001166651.1| hypothetical protein Rsph17025_0440 [Rhodobacter sphaeroides ATCC 17025] gi|145554733|gb|ABP69346.1| hypothetical protein Rsph17025_0440 [Rhodobacter sphaeroides ATCC 17025] Length = 754 Score = 45.1 bits (105), Expect = 0.030, Method: Compositional matrix adjust. Identities = 47/195 (24%), Positives = 83/195 (42%), Gaps = 15/195 (7%) Query: 315 GVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDG--EY 372 GV W AW ++ GYPS V + RL + + + +V+ S+ G F DF LDG + Sbjct: 308 GVPTYRWSEGAWSKRYGYPSTVEIYEQRLAAAATPSEPRTVWFSAVGDFQDF-LDGTEDD 366 Query: 373 GCYDPTKALTTAVTDF-----SASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFR 427 + T A +T+V A+ +H + GE +T ++ + F Sbjct: 367 QSFAYTVAGSTSVNRIINLQRGAAGLH-IFALGEEYSTRSETRSSVIGPK-----NAVFG 420 Query: 428 RVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEI-TQLADHLFNQRILQ 486 SG G P++ +F+ +R+ + S +Q + + ++ A H+ Q Sbjct: 421 LDSGVGSSTAKPITPSGNPIFISRDRKRVLEMVYSLDQDRPVSRVLSRTAQHVGGAGFEQ 480 Query: 487 LVYQEEPHSIVWVVL 501 +V+Q P W+ L Sbjct: 481 IVWQAAPEPTAWLRL 495 >gi|226940469|ref|YP_002795543.1| hypothetical protein LHK_01546 [Laribacter hongkongensis HLHK9] gi|226715396|gb|ACO74534.1| hypothetical protein LHK_01546 [Laribacter hongkongensis HLHK9] Length = 874 Score = 43.9 bits (102), Expect = 0.069, Method: Compositional matrix adjust. Identities = 58/271 (21%), Positives = 114/271 (42%), Gaps = 22/271 (8%) Query: 268 LNLSSKTSRESASGAVAPYYVWGDIKDVSK-DGRSISVAPQSQTLFQAGVSVVSWFMSAW 326 + L+ S + A+ P + G I V+ +G S AP + G S ++ Sbjct: 399 VQLAVTDSGGGSGAALEPVIIDGAITAVNVINGGSGYFAPVVSVSYAGGGSGATFGQPVV 458 Query: 327 GEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTK---ALTT 383 YP V++ R F+G+ +++++ G + G P + + Sbjct: 459 KSSGDYPGAVSYFEQRRCFAGTTRKPQNIWMTKSGT------ESNMGYSLPVRDDDRIAF 512 Query: 384 AVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGL---SIDFRRVSGSGVYACPPV 440 V+ A+TI + P + +L+ ++ W ++ S + SI R S G PV Sbjct: 513 RVSAREANTIRHIVPLAQ-LLLLTSSAEWRVTSVNSDAITPRSISVRPQSYIGASNVQPV 571 Query: 441 SVGDCLVFVCGVGRRIKYISGSTEQ-GFRFNEITQLADHLFNQ-RILQLVYQEEPHSIVW 498 + + L++ G ++ ++ + + GF +++ A HLF+ I+ + + + P +VW Sbjct: 572 IINNTLIYASARGGHVRELAYNWQAGGFVTGDLSIRAPHLFDDFEIVDMAFGKSPQPVVW 631 Query: 499 VVLEPKDNSFPRLLGCRFSAEGEGDFAWHTH 529 V +S L+G + E + AWH H Sbjct: 632 FV-----SSSGCLIGLTYVPEQQVG-AWHWH 656 >gi|296532340|ref|ZP_06895077.1| conserved hypothetical protein [Roseomonas cervicalis ATCC 49957] gi|296267336|gb|EFH13224.1| conserved hypothetical protein [Roseomonas cervicalis ATCC 49957] Length = 626 Score = 43.5 bits (101), Expect = 0.10, Method: Compositional matrix adjust. Identities = 40/175 (22%), Positives = 74/175 (42%), Gaps = 10/175 (5%) Query: 321 WFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKA 380 W +A+ G+P FH +RL+ GS+ ++LS G ++F L G +A Sbjct: 222 WDEAAFSAVRGWPVTACFHQDRLVLGGSRDLPNRLWLSRSGDLFNFDL----GSGLDDQA 277 Query: 381 LTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSG---VYAC 437 + + + I + G + V + W+++ SI R + G Sbjct: 278 IEFGLLSDQVNAIRAVFS-GRHLQVFTSGAEWMVTGEPMTPASIQLHRQTRIGSPVARII 336 Query: 438 PPVSVGDCLVFVCGVGRRI-KYISGSTEQGFRFNEITQLADHLFNQRILQLVYQE 491 PPV V +FV G+ + +Y +Q ++ N++ +A HL Q + + Y + Sbjct: 337 PPVDVDGSTIFVARSGQAVHEYAYTDVQQAYQANDLALVARHLV-QTPVSMAYDQ 390 >gi|323699364|ref|ZP_08111276.1| hypothetical protein DND132_1955 [Desulfovibrio sp. ND132] gi|323459296|gb|EGB15161.1| hypothetical protein DND132_1955 [Desulfovibrio desulfuricans ND132] Length = 698 Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust. Identities = 49/198 (24%), Positives = 88/198 (44%), Gaps = 14/198 (7%) Query: 380 ALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLS---IDFRRVSGSGVYA 436 A+ ++ A+ I ++ P + +G W LS S S L+ + + G Sbjct: 330 AIEVTLSGRQANAIEFIVPR-RALWIGTAGGEWTLSASSSDPLTPSNVKAAQEGTGGASG 388 Query: 437 CPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFNQRILQLVYQEEPHS 495 P +VG ++V GR+I+ +S E + ++T L++H+ + QL Y +EP S Sbjct: 389 VRPEAVGFAALYVQRAGRKIREMSYRYESDAYVSKDLTLLSEHITEGGLTQLAYVQEPDS 448 Query: 496 IVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWM 555 I++ V + + L+ + + E A + +++D V AAS ND LW+ Sbjct: 449 ILYGV---RGDGI--LVALTYVPDQE--VAAWSRIVTDG-VVERAASVYNDAEKRDELWI 500 Query: 556 LVALSA-GEERSFTVRLN 572 V + GE R + L Sbjct: 501 TVLRTVNGETRRYVEYLE 518 Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust. Identities = 18/45 (40%), Positives = 27/45 (60%), Gaps = 1/45 (2%) Query: 324 SAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSL 368 AWGE + YPS V F+ RL+ + ++ +++LS G F DF L Sbjct: 164 EAWGEND-YPSAVCFYEQRLVLAATRSRPATLWLSRTGEFSDFRL 207 >gi|325971691|ref|YP_004247882.1| hypothetical protein SpiBuddy_1864 [Spirochaeta sp. Buddy] gi|324026929|gb|ADY13688.1| hypothetical protein SpiBuddy_1864 [Spirochaeta sp. Buddy] Length = 551 Score = 42.0 bits (97), Expect = 0.27, Method: Compositional matrix adjust. Identities = 37/144 (25%), Positives = 65/144 (45%), Gaps = 12/144 (8%) Query: 14 GELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRVFSFS 73 GE+SP+L R DL ++ QG ++ + G + P ++ R F+ Sbjct: 11 GEISPKL-GGRLDLEMNTQGCEILKDFRNMLQGGITRRPPLKHVAQTV----RGRTIPFT 65 Query: 74 IPDGGYALLVFGDKKLQI----VVVRSSTKWSPALFGKTY-KTPYTFKDNKSLEYAVFGS 128 + G L+ +KKL++ V+ + + P+ G Y T Y D S++YA + Sbjct: 66 LSSGESFLVELSNKKLRVWRKGVLGFYTVTFLPS--GNDYLPTDYLEADVWSIQYAQYYD 123 Query: 129 TAVFVHKDHPPHHLLYIQDGDKIS 152 VHKD+ PH ++Y + + S Sbjct: 124 RLYLVHKDYQPHVVVYAAEAFQFS 147 >gi|144898783|emb|CAM75647.1| conserved hypothetical protein [Magnetospirillum gryphiswaldense MSR-1] Length = 635 Score = 42.0 bits (97), Expect = 0.32, Method: Compositional matrix adjust. Identities = 40/165 (24%), Positives = 68/165 (41%), Gaps = 11/165 (6%) Query: 321 WFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKA 380 W A G+P V FH +RL+ GS+ ++LS ++F L G +A Sbjct: 230 WEEQALSAVRGWPVSVCFHQDRLVIGGSRDQPNRLWLSKSSDLFNFDL----GEALDDEA 285 Query: 381 LTTAVTDFSASTIHWMHPF-GEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGV---YA 436 + A+ + I H F G + V + W++S SI R + G Sbjct: 286 IEFALLSDQVNAIR--HVFSGRHLQVFTSGAEWMVSGQPLTPSSIQLTRQTRVGSPIDRT 343 Query: 437 CPPVSVGDCLVFVCGVGRRIK-YISGSTEQGFRFNEITQLADHLF 480 PP V +FV G+ ++ ++ EQ ++ ++ LA H+ Sbjct: 344 VPPRDVDGATLFVSRNGKDLREFLFADVEQAYQSGDLAMLAKHVM 388 >gi|209966375|ref|YP_002299290.1| hypothetical protein RC1_3113 [Rhodospirillum centenum SW] gi|209959841|gb|ACJ00478.1| conserved hypothetical protein [Rhodospirillum centenum SW] Length = 638 Score = 41.6 bits (96), Expect = 0.33, Method: Compositional matrix adjust. Identities = 37/150 (24%), Positives = 60/150 (40%), Gaps = 14/150 (9%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M K +F+ GELSP LL R DL + G RN++ L G + P Sbjct: 1 MTRLRSVKAAFTGGELSPDLL-GRGDLRSYETGALALRNVLILPTGGVTRRPGTAYLATL 59 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 P R+ +F+ LL F D++L++ ++ +TP+T Sbjct: 60 ---PGPGRLAAFAFDTEQAYLLAFTDRRLEVFRDGATE--------AVLETPWTAGQLAQ 108 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDK 150 L + + H D PP + ++ GD+ Sbjct: 109 LAWTQSADVLLVCHPDVPPRRI--VRSGDR 136 >gi|291334718|gb|ADD94364.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C890] Length = 135 Score = 41.6 bits (96), Expect = 0.36, Method: Composition-based stats. Identities = 24/82 (29%), Positives = 41/82 (50%), Gaps = 6/82 (7%) Query: 466 GFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFA 525 G+ ++T LA+H+ QL YQ+EP+ ++W V + +L+G + E + A Sbjct: 21 GYVAPDLTILAEHISEGGFKQLSYQQEPNQVIWGV-----RNDGQLVGLTYQRE-QQVVA 74 Query: 526 WHTHMISDKHYVLSAASFPNDN 547 WH H+ S A+ P D+ Sbjct: 75 WHRHIFGGSAVCESVATIPTDD 96 >gi|83313369|ref|YP_423633.1| hypothetical protein amb4270 [Magnetospirillum magneticum AMB-1] gi|82948210|dbj|BAE53074.1| hypothetical protein [Magnetospirillum magneticum AMB-1] Length = 634 Score = 41.6 bits (96), Expect = 0.37, Method: Compositional matrix adjust. Identities = 43/172 (25%), Positives = 71/172 (41%), Gaps = 18/172 (10%) Query: 326 WGEQ-----EGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDF----SLDGEYGCYD 376 W EQ G+P V FH RL GS+G ++LS ++F LD E + Sbjct: 229 WEEQSFSPLRGWPVSVCFHQGRLAIGGSRGLPNRLWLSKSMDLFNFDLGTGLDDEAIEFS 288 Query: 377 PTKALTTAVTD-FSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVY 435 A+ FS + E ++VG + L I L++ RV Sbjct: 289 LLSTQVDAIRAVFSGRHLQVFTSGAEWMVVG--SPLTPTKIQLNRQT-----RVGSPVDR 341 Query: 436 ACPPVSVGDCLVFVCGVGRRIK-YISGSTEQGFRFNEITQLADHLFNQRILQ 486 + PP V FV GR ++ ++ +Q ++ N+++ +A H+ N + Q Sbjct: 342 SVPPRDVDGATHFVSRSGRDLREFLFADVDQAYQANDLSMVAKHVMNTPVDQ 393 >gi|291334514|gb|ADD94167.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C1201] gi|291336446|gb|ADD96001.1| hypothetical protein [uncultured organism MedDCM-OCT-S04-C1073] Length = 153 Score = 41.2 bits (95), Expect = 0.42, Method: Composition-based stats. Identities = 24/82 (29%), Positives = 40/82 (48%), Gaps = 6/82 (7%) Query: 466 GFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFA 525 G+ ++T LA+H+ QL YQ+EP+ ++W V +L+G + E + A Sbjct: 21 GYVAPDLTILAEHISEGGFKQLSYQQEPNQVIWGVRNDG-----QLVGLTYQRE-QQVVA 74 Query: 526 WHTHMISDKHYVLSAASFPNDN 547 WH H+ S A+ P D+ Sbjct: 75 WHRHIFGGSAVCESVATIPTDD 96 >gi|167032763|ref|YP_001667994.1| hypothetical protein PputGB1_1755 [Pseudomonas putida GB-1] gi|166859251|gb|ABY97658.1| conserved hypothetical protein [Pseudomonas putida GB-1] Length = 774 Score = 40.8 bits (94), Expect = 0.58, Method: Compositional matrix adjust. Identities = 39/174 (22%), Positives = 69/174 (39%), Gaps = 16/174 (9%) Query: 8 KHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSN 67 + SFSAGE++P +R DL+ + + RN + L G + + + + Sbjct: 6 QPSFSAGEVAPATY-ARVDLARYYTALKTCRNFVVLPEGGAQNRSGTRFITEVKDSAART 64 Query: 68 RVFSFSIPDGGYALLVFGDKKLQIV-----VVRSSTKWSPALFGKTYKTPYTFKDNKSLE 122 R+ F +L FG+ ++ + VV T + A +PYT L+ Sbjct: 65 RLIPFQFSTEQTYILEFGNLYIRFISMGGQVVSGVTPYEIA-------SPYTTAQLPDLK 117 Query: 123 YAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKS 176 + VH DHPP L + ++T I F P G+++ ++ Sbjct: 118 FTQSADVMTIVHPDHPPRELSRLA---PTNWTLTAITFEPGIAAPTGLVATART 168 >gi|317152064|ref|YP_004120112.1| hypothetical protein Daes_0341 [Desulfovibrio aespoeensis Aspo-2] gi|316942315|gb|ADU61366.1| hypothetical protein Daes_0341 [Desulfovibrio aespoeensis Aspo-2] Length = 698 Score = 40.8 bits (94), Expect = 0.67, Method: Compositional matrix adjust. Identities = 31/125 (24%), Positives = 60/125 (48%), Gaps = 5/125 (4%) Query: 380 ALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGL---SIDFRRVSGSGVYA 436 A+ ++ A+ I ++ G+ + VG W L SL + SI + G A Sbjct: 330 AIEVTLSGRQANAIEFLVARGK-LWVGTAGGEWTLGGSLGDPVTPESIKASQEGSCGASA 388 Query: 437 CPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFNQRILQLVYQEEPHS 495 P +VG +++ GR+I+ ++ E + ++T L++H+ + Q+ Y +EP S Sbjct: 389 TRPEAVGFATLYIQRAGRKIREMAYRYESDAYVSRDLTILSEHITKPGLTQMAYVQEPDS 448 Query: 496 IVWVV 500 I++ V Sbjct: 449 ILYCV 453 >gi|288959323|ref|YP_003449664.1| hypothetical protein AZL_024820 [Azospirillum sp. B510] gi|288911631|dbj|BAI73120.1| hypothetical protein AZL_024820 [Azospirillum sp. B510] Length = 632 Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust. Identities = 38/141 (26%), Positives = 56/141 (39%), Gaps = 12/141 (8%) Query: 8 KHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSN 67 K +F+AGE+S RLL R DL + G RNL P + L P Sbjct: 9 KTNFTAGEVSRRLL-GRGDLKAYDNGALALRNLF---IDPTGGVTRRSGLAFTALAPGDG 64 Query: 68 RVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFG 127 R+ +F LLVF D++ I V + ++ + + P+T + + Sbjct: 65 RLVAFERNSEQTYLLVFTDRR--IDVFQGGSRLA------SVAAPWTLTQLAQITWTQSA 116 Query: 128 STAVFVHKDHPPHHLLYIQDG 148 T + H D PP L DG Sbjct: 117 DTLLVCHPDLPPRKLTRGDDG 137 >gi|41179374|ref|NP_958682.1| Bbp13 [Bordetella phage BPP-1] gi|45569506|ref|NP_996575.1| hypothetical protein BMP-1p12 [Bordetella phage BMP-1] gi|45580757|ref|NP_996623.1| hypothetical protein BIP-1p12 [Bordetella phage BIP-1] gi|40950113|gb|AAR97679.1| Bbp13 [Bordetella phage BPP-1] Length = 681 Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust. Identities = 46/202 (22%), Positives = 87/202 (43%), Gaps = 13/202 (6%) Query: 332 YPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSAS 391 YP+ V++ R F+G+ +++++ G + ++ D + + V A+ Sbjct: 271 YPAAVSYFEQRRCFAGTTNKPQNIWMTRSGT--ESAMSYSLPVRDDDR-VAFRVAAREAN 327 Query: 392 TIHWMHPFGEGVLVGCDTSLWLLSIS--LSKGLSIDFRRVSGSGVYACPPVSVGDCLVFV 449 I + P E +L+ + S++ +I R S G PV V + ++ Sbjct: 328 AIRHIVPLTELLLLTSSGEWRVASVNSDAVTPTTISVRPQSYVGATDVQPVVVNNTTIYG 387 Query: 450 CGVGRRIKYISGS-TEQGFRFNEITQLADHLF-NQRILQLVYQEEPHSIVWVVLEPKDNS 507 G ++ ++ + GF +++ A HLF N IL + Y + P IVW + +S Sbjct: 388 AARGGHVRELAYNWQANGFVTGDLSLRAAHLFDNLDILDMAYAKAPQPIVWFI-----SS 442 Query: 508 FPRLLGCRFSAEGEGDFAWHTH 529 +LLG + E + AWH H Sbjct: 443 SGKLLGLTYVPEQQIG-AWHQH 463 >gi|291336965|gb|ADD96491.1| hypothetical protein [uncultured organism MedDCM-OCT-S11-C1587] Length = 474 Score = 38.9 bits (89), Expect = 2.2, Method: Compositional matrix adjust. Identities = 17/54 (31%), Positives = 26/54 (48%) Query: 314 AGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFS 367 G + + +W +GYP TFH RL F G K +++ S F+DF+ Sbjct: 243 GGTFIDGGYEDSWSGSKGYPRTATFHEGRLYFGGVKSRPNTIFASRVARFFDFN 296 >gi|291336926|gb|ADD96454.1| hypothetical protein [uncultured organism MedDCM-OCT-S09-C787] Length = 158 Score = 38.1 bits (87), Expect = 3.8, Method: Composition-based stats. Identities = 20/79 (25%), Positives = 45/79 (56%), Gaps = 1/79 (1%) Query: 423 SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFN 481 +I ++ S +G ++VG+ +F+ R+++ ++ + + G+ ++T LA+H+ Sbjct: 60 NILIKKQSNNGAANVDALAVGNATLFLQRARRKLRELAYNFDVDGYVAPDLTILAEHISE 119 Query: 482 QRILQLVYQEEPHSIVWVV 500 QL YQ+EP+ ++W V Sbjct: 120 GGFKQLSYQQEPNQVIWGV 138 >gi|187476936|ref|YP_784960.1| phage protein [Bordetella avium 197N] gi|115421522|emb|CAJ48031.1| phage protein [Bordetella avium 197N] Length = 681 Score = 37.4 bits (85), Expect = 7.9, Method: Compositional matrix adjust. Identities = 46/202 (22%), Positives = 83/202 (41%), Gaps = 13/202 (6%) Query: 332 YPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSAS 391 YP+ V++ R F+G+ +++++ G S D + V A+ Sbjct: 271 YPAAVSYFEQRRCFAGTINKPQNIWMTRSGTESAMSYSLPVRSDD---RVAFRVAAREAN 327 Query: 392 TIHWMHPFGEGVLVGCDTSLWLLSIS--LSKGLSIDFRRVSGSGVYACPPVSVGDCLVFV 449 I + P E +L+ + S++ +I R S G PV V + ++ Sbjct: 328 AIRHIVPLTELLLLTSSGEWRVASVNSDAVTPTTISVRPQSYVGATDVQPVVVNNTAIYG 387 Query: 450 CGVGRRIKYISGS-TEQGFRFNEITQLADHLF-NQRILQLVYQEEPHSIVWVVLEPKDNS 507 G ++ ++ + GF +++ HLF N IL + Y + P IVW + +S Sbjct: 388 AARGGHVRELAYNWQANGFVTGDLSLRCAHLFDNLNILDMAYAKAPQPIVWFI-----SS 442 Query: 508 FPRLLGCRFSAEGEGDFAWHTH 529 +LLG + E + AWH H Sbjct: 443 SGKLLGLTYVPEQQIG-AWHQH 463 Searching..................................................done Results from round 2 >gi|254781208|ref|YP_003065621.1| hypothetical protein CLIBASIA_05575 [Candidatus Liberibacter asiaticus str. psy62] gi|254040885|gb|ACT57681.1| hypothetical protein CLIBASIA_05575 [Candidatus Liberibacter asiaticus str. psy62] gi|317120673|gb|ADV02496.1| hypothetical protein SC1_gp080 [Liberibacter phage SC1] gi|317120817|gb|ADV02638.1| hypothetical protein SC1_gp080 [Candidatus Liberibacter asiaticus] Length = 578 Score = 687 bits (1772), Expect = 0.0, Method: Composition-based stats. Identities = 578/578 (100%), Positives = 578/578 (100%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC Sbjct: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS Sbjct: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL Sbjct: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLT 240 SISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLT Sbjct: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLT 240 Query: 241 TGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGR 300 TGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGR Sbjct: 241 TGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGR 300 Query: 301 SISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSF 360 SISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSF Sbjct: 301 SISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSF 360 Query: 361 GAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSK 420 GAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSK Sbjct: 361 GAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSK 420 Query: 421 GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQLADHLF 480 GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQLADHLF Sbjct: 421 GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQLADHLF 480 Query: 481 NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSA 540 NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSA Sbjct: 481 NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSA 540 Query: 541 ASFPNDNRGGTSLWMLVALSAGEERSFTVRLNLLDDFK 578 ASFPNDNRGGTSLWMLVALSAGEERSFTVRLNLLDDFK Sbjct: 541 ASFPNDNRGGTSLWMLVALSAGEERSFTVRLNLLDDFK 578 >gi|212710810|ref|ZP_03318938.1| hypothetical protein PROVALCAL_01878 [Providencia alcalifaciens DSM 30120] gi|212686507|gb|EEB46035.1| hypothetical protein PROVALCAL_01878 [Providencia alcalifaciens DSM 30120] Length = 818 Score = 549 bits (1414), Expect = e-154, Method: Composition-based stats. Identities = 117/585 (20%), Positives = 221/585 (37%), Gaps = 53/585 (9%) Query: 5 TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDP 64 + + SFS GE++P L R DL+ ++ + K N + +YG + + P + + Sbjct: 4 SIIQPSFSGGEIAPSLY-GRIDLAKYSTALRKCENFLVRQYGGIENRPGTKFIAAAKYPN 62 Query: 65 RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPA---LFGKTYKTPYTFKDNKSL 121 + R+ F L GDK ++++ ++ TPY D +L Sbjct: 63 KKCRLIPFQFSTVQTYALEMGDKYMRVIKDGGQVLYADGEHKGEIFELTTPYKEADLFNL 122 Query: 122 EYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLS 181 ++ VH D+PP L D + ++ P+ + K Sbjct: 123 KFTQSADVMTIVHADYPPMELQRYDHDD---WKLVPVETRNGPFEDINVDKERKVYV--- 176 Query: 182 ISQADTSTARITSDMKIFKPLDKGRSIRLGCHP----PEWAKNTNYSIGAYIVADDKVYR 237 A T +T+ IF G+ I + P W + A YR Sbjct: 177 --SASTGEVTLTATHNIFGAELVGKQIYIEQQAVDAVPVWETDKTTIKNDQRRAGSNYYR 234 Query: 238 SLTTGRSGD-RFGYSKGATYVKDN---NITWITVLNLSSKTSRESASGAVAPYYVWGDIK 293 + T+G+SG R +++G ++ I W + S G V V D Sbjct: 235 ANTSGKSGTLRPSHTEGMSWDGWGGDTGIQWEYL---------HSGFGIVKINSVSTDG- 284 Query: 294 DVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDEL 353 ++ G+ IS P + + W S W + +GYPS V ++ RL F+GS+ Sbjct: 285 -LTATGKVISYIPSNAV--GESNATYKWARSVWNDVDGYPSTVMYYQQRLFFAGSRAYPQ 341 Query: 354 SVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWL 413 +++ S G + DF + D + + I + G V + + Sbjct: 342 TIWASRSGDYKDFGKNNPIQDDDR---IIYTYAGRQVNEIRHLIDVGSLVAL-TSGGEYQ 397 Query: 414 LSISLSK---GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRF 469 ++ +K S F +G PP++V + +++ G ++ ++ S + G++ Sbjct: 398 ITGDQNKVLTPSSFSFSSQGANGCSDVPPIAVANIALYIQEKGSAVRDLAYSFDVDGYQG 457 Query: 470 NEITQLADHLF-NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHT 528 ++T +A+HLF +I+ + P+SI W + + +LL + E + FAW Sbjct: 458 TDLTIMANHLFQRHQIIDWAFTIVPYSIAWCIRDDG-----KLLSLTYLRE-QQVFAWAP 511 Query: 529 HMISDKHYVLSAASFPNDNRGGTSLWMLVALSAGE-ERSFTVRLN 572 + S S +++ +V G+ + RL+ Sbjct: 512 QDTDGQF--ESTCSI--SEGNEDAVYFIVCRKVGDGTVRYIERLS 552 >gi|268589382|ref|ZP_06123603.1| hypothetical protein PROVRETT_05514 [Providencia rettgeri DSM 1131] gi|291315409|gb|EFE55862.1| hypothetical protein PROVRETT_05514 [Providencia rettgeri DSM 1131] Length = 818 Score = 549 bits (1413), Expect = e-154, Method: Composition-based stats. Identities = 115/586 (19%), Positives = 224/586 (38%), Gaps = 55/586 (9%) Query: 5 TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDP 64 + + SFS GE++P L R DL+ ++ + K N I +YG + + P + + Sbjct: 4 SIIQPSFSGGEIAPSLY-GRIDLAKYSTALRKCSNFIVRQYGGIENRPGTKFIAAAKYPN 62 Query: 65 RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALF---GKTYKTPYTFKDNKSL 121 + R+ F L GDK ++++ ++ + TPY D +L Sbjct: 63 KKCRLIPFQFSTVQTYALEMGDKYMRVIKDGGQVLYADGEYKGEIFELATPYKEADLFNL 122 Query: 122 EYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLS 181 ++ VH D+PP L D + ++ P+ + ++ + Sbjct: 123 KFTQSADVMTIVHADYPPMELQRYDHDD---WKLVPVETRNGPF------EDINTDKERK 173 Query: 182 IS-QADTSTARITSDMKIFKPLDKGRSIRLGCHP----PEWAKNTNYSIGAYIVADDKVY 236 + A T +++ IF G+ I + P W + +I A Y Sbjct: 174 LYVSASTGDVTLSATHNIFGAELVGKQIYIEQQAIDAVPVWETDKTTNINDQRRAGANYY 233 Query: 237 RSLTTGRSGD-RFGYSKGATYVKDN---NITWITVLNLSSKTSRESASGAVAPYYVWGDI 292 R+ T G+SG R +++G ++ I W + S G V V D Sbjct: 234 RANTAGKSGTLRPSHTEGMSWDGWGGDAGIQWEYL---------HSGFGIVKINSVSTDG 284 Query: 293 KDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDE 352 ++ G+ + P + + W S W + +GYPS V ++ RL F+GS+ Sbjct: 285 --LTATGKVVLYIPSNAV--GEENATYKWARSVWNDVDGYPSTVMYYQQRLFFAGSRAYP 340 Query: 353 LSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLW 412 +++ S G + DF + D + + I + G V + + Sbjct: 341 QTIWASRSGDYKDFGKNNPIQDDDR---IIYTYAGRQVNEIRHLIDVGSLVAL-TSGGEY 396 Query: 413 LLSISLSK---GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFR 468 ++ +K S F +G PP++V + +++ G ++ ++ S + G++ Sbjct: 397 QITGDQNKVLTPSSFSFSSQGANGCSDVPPIAVANIALYIQEKGSAVRDLAYSFDVDGYQ 456 Query: 469 FNEITQLADHLF-NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWH 527 ++T +A+HLF +I+ + P+SI W + + +LL + E + FAW Sbjct: 457 GTDLTIMANHLFQRHQIIDWAFSIVPYSIAWCIRDDG-----KLLSLTYLRE-QQVFAWA 510 Query: 528 THMISDKHYVLSAASFPNDNRGGTSLWMLV-ALSAGEERSFTVRLN 572 + S S +++ +V G + RL+ Sbjct: 511 PQETDGQF--ESTCSV--SEGNEDAVYFIVCRKVGGGTVRYIERLS 552 >gi|227355852|ref|ZP_03840245.1| conserved hypothetical protein [Proteus mirabilis ATCC 29906] gi|227164171|gb|EEI49068.1| conserved hypothetical protein [Proteus mirabilis ATCC 29906] Length = 820 Score = 541 bits (1394), Expect = e-152, Method: Composition-based stats. Identities = 123/584 (21%), Positives = 222/584 (38%), Gaps = 53/584 (9%) Query: 5 TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDP 64 + + SFS GE++P L R DL+ ++ + K N I +YG + + P + + + Sbjct: 4 SLIQPSFSGGEIAPSLY-GRVDLAKYSTALRKCHNFIVRQYGGVENRPGTRFIAETKYQN 62 Query: 65 RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPA---LFGKTYKTPYTFKDNKSL 121 + +R+ F L FGD+ +++ ++ TPY D L Sbjct: 63 KKSRLIPFQFSTVQTYALEFGDRYIRVFKDGGQVLYADGEHKGEVFELATPYKEADLFDL 122 Query: 122 EYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLS 181 +Y VH D+PP L D + ++ P+ +K A Sbjct: 123 KYTQSADVMTIVHTDYPPMELQRYDHDD---WKLVSVETKNGPFEDINTDKAMKVYA--- 176 Query: 182 ISQADTSTARITSDMKIFKPLDKGRSIRLGCHP----PEWAKNTNYSIGAYIVADDKVYR 237 A T +TS IF G+ L P W + ++ AD YR Sbjct: 177 --SASTGQITLTSTHDIFGSEQIGKQFYLEQRDIDAVPVWETDKTTNLNDQRRADSNYYR 234 Query: 238 SLTTGRSGD-RFGYSKGATYVKDN---NITWITVLNLSSKTSRESASGAVAPYYVWGDIK 293 + + G++G R +++G ++ I W + S G V V D K Sbjct: 235 ANSGGKTGTLRPSHTEGMSWDGWGGDTGIQWEYL---------HSGFGIVKIETVSEDGK 285 Query: 294 DVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDEL 353 + G+ +S P + + W + W + +GYPS V ++ RL F+GS+ Sbjct: 286 --TATGKVLSYIPSNAV--GEDNASHKWARAVWNDVDGYPSTVVYYQQRLFFAGSRAYPQ 341 Query: 354 SVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWL 413 +++ S G + DF + D + + I + G V + + Sbjct: 342 TIWASRSGDYKDFGRNNPIQDDDR---IIYTYAGRQVNEIRHLIDVGSLVAL-TSGGEYQ 397 Query: 414 LSISLSK---GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRF 469 ++ +K S +G PP+SV + +++ G ++ +S S + G++ Sbjct: 398 ITGDQNKVLTPSSFSMSSQGANGSSDLPPISVANIALYIQEKGSAVRDLSYSFDVDGYQG 457 Query: 470 NEITQLADHLF-NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHT 528 ++T LA+HLF RI+ + P+SI W + + +L + E + FAW Sbjct: 458 TDLTMLANHLFQRHRIVDWSFTTVPYSIAWCIRDDG-----LMLALTYLRE-QQVFAWAP 511 Query: 529 HMISDKHYVLSAASFPNDNRGGTSLWMLVALSA-GEERSFTVRL 571 K S S S + +V + G++ + RL Sbjct: 512 QSTEGKF--ESTCSI--SEGNEDSAYFIVQRTVNGKQVRYVERL 551 >gi|30387391|ref|NP_848220.1| hypothetical protein epsilon15p12 [Enterobacteria phage epsilon15] gi|30266046|gb|AAO06075.1| 12 [Salmonella phage epsilon15] Length = 825 Score = 539 bits (1389), Expect = e-151, Method: Composition-based stats. Identities = 114/579 (19%), Positives = 216/579 (37%), Gaps = 41/579 (7%) Query: 5 TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDP 64 +W + SF+ GE+ P L R D+S + + K N I +YG + + P + + Sbjct: 4 SWIQPSFAGGEIGPSLY-GRIDMSKYQVALRKCDNFIVRQYGGVENRPGTRFVGPAKYPD 62 Query: 65 RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYA 124 R R+ F L FG +++ + + + + PY D +++ Sbjct: 63 RKCRLIPFQFSTVQTYALEFGHNYMRV-IKDGAYVLTTSNVIYELAMPYADTDLFRIKFT 121 Query: 125 VFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQ 184 VH +PP L ++ ++ P+ + VK A Sbjct: 122 QSADVLTLVHPAYPPKELRRYAHD---NWQIVDVTTKNGPFEDINVDETVKVYA-----S 173 Query: 185 ADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVYRSLT 240 A T T +T+ IF G+ L P W + +I AD YR+ T Sbjct: 174 ASTGTITLTASSAIFGAEQVGKLFYLEQPAVDSVPVWETSKTTAINDVRRADSNYYRANT 233 Query: 241 TGRSGD-RFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDG 299 +G++G R +++G ++ S G V GD ++ Sbjct: 234 SGKTGTLRPSHTEGMSWDGWGGTGSDDTGIQWE--YLHSGFGIAKITAVAGDG--LTATA 289 Query: 300 RSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSS 359 +S P + + + W AW GYPS V ++ RL F+ S +++ S Sbjct: 290 DVVSFIPSQ--VVGSANASYKWAKYAWNSVNGYPSTVVYYQQRLYFAASTAYPQTIWASR 347 Query: 360 FGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLS 419 G + DF + D + + I + G V + + +S + Sbjct: 348 TGDYKDFGKNNPIQDDDR---IIYTYAGRQVNEIRHLIDVGNLVAL-TSGGEYTISGDQN 403 Query: 420 KGLS---IDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQL 475 K L+ F +G PP++V + +F+ G ++ ++ S + G++ ++T L Sbjct: 404 KVLTPSAFSFSSQGNNGSSNVPPIAVANIALFIQEKGSVVRDLAYSFDVDGYQGTDLTIL 463 Query: 476 ADHLFN-QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDK 534 A+HLF I+ + P+S + + + +LL + + + FAW + K Sbjct: 464 ANHLFQKHSIVDWSFCIVPYSSAFCIRDDG-----KLLVLTYLRD-QQVFAWAPQSSAGK 517 Query: 535 HYVLSAASFPNDNRGGTSLWMLVALS-AGEERSFTVRLN 572 + S S +++ +V + G+ + RL+ Sbjct: 518 Y--ESTCSI--SEGSEDAVYFVVNRTINGQTVRYIERLS 552 >gi|215487813|ref|YP_002330244.1| hypothetical protein E2348C_2746 [Escherichia coli O127:H6 str. E2348/69] gi|215265885|emb|CAS10294.1| predicted protein [Escherichia coli O127:H6 str. E2348/69] Length = 825 Score = 538 bits (1386), Expect = e-151, Method: Composition-based stats. Identities = 115/579 (19%), Positives = 213/579 (36%), Gaps = 41/579 (7%) Query: 5 TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDP 64 +W + SF+ GE+ P L R D+S + + K N I +YG + + P + + Sbjct: 4 SWIQPSFAGGEIGPSLY-GRIDMSKYQVALRKCDNFIVRQYGGVENRPGTRFVGPAKYPD 62 Query: 65 RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYA 124 R R+ F L FG +++ + + + PY D +++ Sbjct: 63 RKCRLIPFQFSTVQTYALEFGHNYMRV-IKDGEYVLTTSNVIYELAMPYADTDLFRIKFT 121 Query: 125 VFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQ 184 VH +PP L ++ ++ P+ + VK A Sbjct: 122 QSADVLTLVHPAYPPKELRRYAHD---NWQIVDVTTKNGPFEDINVDDTVKVYA-----S 173 Query: 185 ADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVYRSLT 240 A T T +T+ IF G+ L P W + +I AD YR+ T Sbjct: 174 ASTGTITLTASSAIFGAEQVGKLFYLEQPAVDSVPVWETSKTTAINDVRRADSNYYRANT 233 Query: 241 TGRSGD-RFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDG 299 G++G R +++G ++ S G V GD ++ Sbjct: 234 AGKTGTLRPSHTEGMSWDGWGGTGSDDTGIQWE--YLHSGFGIAKITAVSGDG--LTATA 289 Query: 300 RSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSS 359 +S P + + + W AW GYPS V ++ RL F+ S +++ S Sbjct: 290 DVVSFIPSQ--VVGSANASYKWAKYAWNSVNGYPSTVVYYQQRLYFAASTAYPQTIWASR 347 Query: 360 FGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLS 419 G + DF + D + + I + G V + + +S + Sbjct: 348 TGDYKDFGKNNPIQDDDR---IIYTYAGRQVNEIRHLIDVGNLVAL-TSGGEYTISGDQN 403 Query: 420 KGLS---IDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQL 475 K L+ F +G PP++V + +F+ G ++ ++ S + G++ ++T L Sbjct: 404 KVLTPSAFSFSSQGNNGSSNVPPIAVANIALFIQEKGSVVRDLAYSFDVDGYQGTDLTIL 463 Query: 476 ADHLFN-QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDK 534 A+HLF I+ + P+S + + + +LL + + + FAW S K Sbjct: 464 ANHLFQKHSIVDWSFCIVPYSSAFCIRDDG-----KLLVLTYLRD-QQVFAWAPQSSSGK 517 Query: 535 HYVLSAASFPNDNRGGTSLWMLVAL-SAGEERSFTVRLN 572 + S S +++ +V G+ + RL+ Sbjct: 518 Y--ESTCSI--SEGSEDAVYFVVNRNINGQTVRYIERLS 552 >gi|315122895|ref|YP_004063384.1| hypothetical protein CKC_05755 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496297|gb|ADR52896.1| hypothetical protein CKC_05755 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 588 Score = 537 bits (1382), Expect = e-150, Method: Composition-based stats. Identities = 224/583 (38%), Positives = 339/583 (58%), Gaps = 25/583 (4%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M +TK SF+ GE+SP+++QSR DL LH+QG+++ N+IPL+ G LV P + Y Sbjct: 1 MPKGAYTKRSFAGGEVSPQIMQSRSDLELHSQGLSQCFNMIPLQDGSLVRRPPLYRYEHI 60 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 L P+++R+ SF++ L +FG+KK+ V V T P F + Y TPY+F++ + Sbjct: 61 DLPPKASRILSFALGGDDAVLFIFGEKKMVYVEV---TGIKPPQFIRFYDTPYSFREAEQ 117 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 L+ A G+ V VH H P+ + + + G F+++ F PPPWLG + G K +AKL Sbjct: 118 LDVARMGTLIVLVHPKHSPYKIEFTEAG----VIFEKMVFAPPPWLGLREVGGKKHDAKL 173 Query: 181 SISQADT--STARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRS 238 ++ + T +TS + IFK D GR +RLG P +W NT Y A++ KVYR Sbjct: 174 RVTLSATRKGKITVTSTLPIFKTKDVGRMLRLGWLPKDWTANTLYPENAFMQMYGKVYRC 233 Query: 239 LTTGRSGDRFGYSKGATYVKDNNITWITV-------LNLSSKTSRESASGAVAPYYVWGD 291 +T G SG F ++ TY++D +TW + ++ K++ + PYYVWG+ Sbjct: 234 ITEGISGKEFEDNRRDTYIRDGGVTWKVIASSQALSVDKDGKSTLGTGGQYRTPYYVWGE 293 Query: 292 IKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGD 351 I + + +++ V S + W MSAWGE+EGYPSHV+F+NNRL FSGSK D Sbjct: 294 IVNCT-GAKTVEVMLHEGFCVTDSNSTLYWNMSAWGEREGYPSHVSFYNNRLCFSGSKFD 352 Query: 352 ELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSL 411 +VY S + F DFS D G D K+L+ A+TD + S I W P +G+++G DTSL Sbjct: 353 PQAVYFSGYNTFTDFSPDTIEGNVDYRKSLSVAITDDTMSAIRWFRPMEKGLVIGTDTSL 412 Query: 412 WLLSISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNE 471 W++ + +G ++ RR++G GVY PP+S+GD L+FV G GRRI+ I G++EQGF+F E Sbjct: 413 WIVILDFERGFNLVSRRLAGIGVYEAPPLSIGDELIFVQGAGRRIQIIGGASEQGFQFLE 472 Query: 472 ITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMI 531 +TQ DHL + RI QL YQE+P+S++WV+ N+ LLGC A + +WH H + Sbjct: 473 LTQNVDHLLDYRIRQLAYQEDPYSLLWVL-----NNKGELLGCSLHANSKEKGSWHVHKL 527 Query: 532 SDKHY-VLSAASFPNDNRGGTSLWMLVAL--SAGEERSFTVRL 571 + ++S +S ++G T++W+L+ G RL Sbjct: 528 GGRGVKIMSLSSCLCLDQGETTVWLLLRRMNEDGVSSIGLERL 570 >gi|89152436|ref|YP_512269.1| hypothetical protein PhiV10p15 [Escherichia phage phiV10] gi|74055459|gb|AAZ95908.1| hypothetical protein PhiV10p15 [Escherichia phage phiV10] Length = 823 Score = 529 bits (1362), Expect = e-148, Method: Composition-based stats. Identities = 118/587 (20%), Positives = 219/587 (37%), Gaps = 57/587 (9%) Query: 4 TTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLD 63 +W + SF+ GE+ P L R D++ + + K N I +YG + + P + + Sbjct: 3 ISWIQPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYP 61 Query: 64 PRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEY 123 R R+ F L FG + +++ + + + + TPYT D +++ Sbjct: 62 NRKCRLIPFQFSTVQTYALEFGHQYMRV-IKDGALVLNSSNVIYEIATPYTEADLFRIKF 120 Query: 124 AVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSIS 183 VH +PP L ++ ++ P+ + + A Sbjct: 121 TQSADVLTLVHPAYPPKELRRYAHD---NWQLVDVVTKNGPFEDINIDESLTVYA----- 172 Query: 184 QADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVYRSL 239 A T T +T+ IF G+ L P W + + SIG AD YR++ Sbjct: 173 SASTGTITLTASASIFGAEQVGKLFYLEQPAVDSVPVWETSKSTSIGDIRRADSNYYRAV 232 Query: 240 TTGRSGD-RFGYSKGA-------TYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGD 291 T G++G R +++G + D I W + S G V G Sbjct: 233 TAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEWEYL---------HSGFGIARITAVNG- 282 Query: 292 IKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGD 351 + IS P + + W AW GYP V ++ RL F+ S Sbjct: 283 ---TTATAEVISYIPSQ--VVGEDNASYKWAKYAWNSVNGYPGTVVYYQQRLYFAASTAF 337 Query: 352 ELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSL 411 +++ S G + DF D + + I + G V + Sbjct: 338 PQTIWASRTGDYKDFGKSNPTQDDDR---IIYTYAGRQVNEIRHLIDVGSLVAL-TSGGE 393 Query: 412 WLLSISLSK---GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGF 467 ++++ +K S F +G PP++V + +FV G ++ ++ S + G+ Sbjct: 394 YVITGDQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVVRDLAYSFDVDGY 453 Query: 468 RFNEITQLADHLFN-QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAW 526 + N++T LA+HLF I+ + P+S + + + +LL + + + FAW Sbjct: 454 QGNDLTILANHLFQKHSIVDWCFSIVPYSSAFCIRDDG-----KLLVMTYLRD-QQVFAW 507 Query: 527 HTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSA-GEERSFTVRLN 572 + K+ S S +++ +V + G+ + RL+ Sbjct: 508 APQSSTGKY--ESTCSI--SEGNEDAVYFVVNRTVNGQTVRYIERLS 550 >gi|294493191|gb|ADE91947.1| conserved hypothetical protein [Escherichia coli IHE3034] Length = 823 Score = 529 bits (1361), Expect = e-148, Method: Composition-based stats. Identities = 118/587 (20%), Positives = 219/587 (37%), Gaps = 57/587 (9%) Query: 4 TTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLD 63 +W + SF+ GE+ P L R D++ + + K N I +YG + + P + + Sbjct: 3 ISWIQPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYP 61 Query: 64 PRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEY 123 R R+ F L FG + +++ + + + + TPYT D +++ Sbjct: 62 NRKCRLIPFQFSTVQTYALEFGHQYMRV-IKDGALVLNSSNVIYEIATPYTEADLFRIKF 120 Query: 124 AVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSIS 183 VH +PP L ++ ++ P+ + V A Sbjct: 121 TQSADVLTLVHPAYPPKELRRYAHD---NWQLVDVVTKNGPFEDINIDESVTVYA----- 172 Query: 184 QADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVYRSL 239 A T T +T+ IF G+ L P W + + SIG AD YR++ Sbjct: 173 SASTGTITLTASASIFGAEQVGKLFYLEQPAVDSVPVWETSKSTSIGDIRRADSNYYRAV 232 Query: 240 TTGRSGD-RFGYSKGA-------TYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGD 291 T G++G R +++G + D I W + S G V G Sbjct: 233 TAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEWEYL---------HSGFGIARITAVNG- 282 Query: 292 IKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGD 351 + IS P + + W AW GYP V ++ RL F+ S Sbjct: 283 ---TTATAEVISYIPSQ--VVGEDNASYKWAKYAWNSVNGYPGTVVYYQQRLYFAASTAF 337 Query: 352 ELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSL 411 +++ S G + DF D + + I + G V + Sbjct: 338 PQTIWASRTGDYKDFGKSNPTQDDDR---IIYTYAGRQVNEIRHLIDVGSLVAL-TSGGE 393 Query: 412 WLLSISLSK---GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGF 467 ++++ +K S F +G PP++V + +FV G ++ ++ S + G+ Sbjct: 394 YVITGDQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVVRDLAYSFDVDGY 453 Query: 468 RFNEITQLADHLFN-QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAW 526 + N++T LA+HLF I+ + P+S + + + +LL + + + FAW Sbjct: 454 QGNDLTILANHLFQKHSIVDWCFSIVPYSSAFCIRDDG-----KLLVMTYLRD-QQVFAW 507 Query: 527 HTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSA-GEERSFTVRLN 572 + K+ S S +++ ++ + G+ + RL+ Sbjct: 508 APQSSTGKY--ESTCSI--SEGNEDAVYFVINRTVNGQTVRYIERLS 550 >gi|327252176|gb|EGE63848.1| phage protein [Escherichia coli STEC_7v] Length = 823 Score = 529 bits (1361), Expect = e-148, Method: Composition-based stats. Identities = 118/587 (20%), Positives = 218/587 (37%), Gaps = 57/587 (9%) Query: 4 TTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLD 63 +W + SF+ GE+ P L R D++ + + K N I +YG + + P + + Sbjct: 3 ISWIQPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYP 61 Query: 64 PRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEY 123 R R+ F L FG + +++ + + + + TPYT D +++ Sbjct: 62 NRKCRLIPFQFSTVQTYALEFGHQYMRV-IKDGALVLNSSNVIYEIATPYTEADLFRIKF 120 Query: 124 AVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSIS 183 VH +PP L ++ ++ P+ + V A Sbjct: 121 TQSADVLTLVHPAYPPKELRRYAHD---NWQLVDVVTKNGPFEDINIDESVTVYA----- 172 Query: 184 QADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVYRSL 239 A T T +T+ IF G+ L P W + + SIG AD YR++ Sbjct: 173 SASTGTITLTASASIFGAEQVGKLFYLEQPAVDSVPVWETSKSTSIGDIRRADSNYYRAV 232 Query: 240 TTGRSGD-RFGYSKGA-------TYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGD 291 T G++G R +++G + D I W + S G G Sbjct: 233 TAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEWEYL---------HSGFGIARISAANG- 282 Query: 292 IKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGD 351 + IS P + + W AW GYP V ++ RL F+ S Sbjct: 283 ---TTATAEVISYIPSQ--VVGEDNASYKWAKYAWNSVNGYPGTVVYYQQRLYFAASTAF 337 Query: 352 ELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSL 411 +++ S G + DF D + + I + G V + Sbjct: 338 PQTIWASRTGDYKDFGKSNPTQDDDR---IIYTYAGRQVNEIRHLIDVGSLVAL-TSGGE 393 Query: 412 WLLSISLSK---GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGF 467 ++++ +K S F +G PP++V + +FV G ++ ++ S + G+ Sbjct: 394 YVITGDQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVVRDLAYSFDVDGY 453 Query: 468 RFNEITQLADHLFN-QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAW 526 + N++T LA+HLF I+ + P+S + + + +LL + + + FAW Sbjct: 454 QGNDLTILANHLFQKHSIVDWCFSIVPYSSAFCIRDDG-----KLLVMTYLRD-QQVFAW 507 Query: 527 HTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSA-GEERSFTVRLN 572 + K+ S S +++ +V + G+ + RL+ Sbjct: 508 APQSSTGKY--ESTCSI--SEGNEDAVYFVVNRTVNGQTVRYIERLS 550 >gi|300898435|ref|ZP_07116776.1| conserved domain protein [Escherichia coli MS 198-1] gi|300357902|gb|EFJ73772.1| conserved domain protein [Escherichia coli MS 198-1] Length = 823 Score = 528 bits (1360), Expect = e-148, Method: Composition-based stats. Identities = 118/587 (20%), Positives = 219/587 (37%), Gaps = 57/587 (9%) Query: 4 TTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLD 63 +W + SF+ GE+ P L R D++ + + K N I +YG + + P + + Sbjct: 3 ISWIQPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYP 61 Query: 64 PRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEY 123 R R+ F L FG + +++ + + + + TPYT D +++ Sbjct: 62 NRKCRLIPFQFSTVQTYALEFGHQYMRV-IKDGALVLNSSNVIYEIATPYTEADLFRIKF 120 Query: 124 AVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSIS 183 VH +PP L ++ ++ P+ + V A Sbjct: 121 TQSADVLTLVHPAYPPKELRRYAHD---NWQLVDVVTKNGPFEDINIDESVTVYA----- 172 Query: 184 QADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVYRSL 239 A T T +T+ IF G+ L P W + + SIG AD YR++ Sbjct: 173 SASTGTITLTASASIFGAEQVGKLFYLEQPAVDSVPVWETSKSTSIGDIRRADSNYYRAV 232 Query: 240 TTGRSGD-RFGYSKGA-------TYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGD 291 T G++G R +++G + D I W + S G V G Sbjct: 233 TAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEWEYL---------HSGFGIARITAVNG- 282 Query: 292 IKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGD 351 + IS P + + W AW GYP V ++ RL F+ S Sbjct: 283 ---TTATAEVISYIPSQ--VVGEDNASYKWAKYAWNSVNGYPGTVVYYQQRLYFAASTAF 337 Query: 352 ELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSL 411 +++ S G + DF D + + I + G V + Sbjct: 338 PQTIWASRTGDYKDFGKSNPTQDDDR---IIYTYAGRQVNEIRHLIDVGSLVAL-TSGGE 393 Query: 412 WLLSISLSK---GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGF 467 ++++ +K S F +G PP++V + +FV G ++ ++ S + G+ Sbjct: 394 YVITGDQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVVRDLAYSFDVDGY 453 Query: 468 RFNEITQLADHLFN-QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAW 526 + +++T LA+HLF I+ + P+S + + + +LL + + + FAW Sbjct: 454 QGSDLTILANHLFQKHSIVDWCFSIVPYSSAFCIRDDG-----KLLVMTYLRD-QQVFAW 507 Query: 527 HTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSA-GEERSFTVRLN 572 + K+ S S +++ +V + G+ + RL+ Sbjct: 508 APQSSTGKY--ESTCSI--SEGNEDAVYFVVNRTVNGQTVRYIERLS 550 >gi|301046400|ref|ZP_07193560.1| conserved domain protein [Escherichia coli MS 185-1] gi|300301626|gb|EFJ58011.1| conserved domain protein [Escherichia coli MS 185-1] Length = 821 Score = 528 bits (1359), Expect = e-147, Method: Composition-based stats. Identities = 118/587 (20%), Positives = 219/587 (37%), Gaps = 57/587 (9%) Query: 4 TTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLD 63 +W + SF+ GE+ P L R D++ + + K N I +YG + + P + + Sbjct: 3 ISWIQPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYP 61 Query: 64 PRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEY 123 R R+ F L FG + +++ + + + + TPYT D +++ Sbjct: 62 NRKCRLIPFQFSTVQTYALEFGHQYMRV-IKDGALVLNSSNVIYEIATPYTEADLFRIKF 120 Query: 124 AVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSIS 183 VH +PP L ++ ++ P+ + V A Sbjct: 121 TQSADVLTLVHPAYPPKELRRYAHD---NWQLVDVVTKNGPFEDINIDESVTVYA----- 172 Query: 184 QADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVYRSL 239 A T T +T+ IF G+ L P W + + SIG AD YR++ Sbjct: 173 SASTGTITLTASASIFGAEQVGKLFYLEQPAVDSVPVWETSKSTSIGDIRRADSNYYRAV 232 Query: 240 TTGRSGD-RFGYSKGA-------TYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGD 291 T G++G R +++G + D I W + S G G Sbjct: 233 TAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEWEYL---------HSGFGIARISAANG- 282 Query: 292 IKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGD 351 + IS P + + W AW GYP V ++ RL F+ S Sbjct: 283 ---TTATAEVISYIPSQ--VVGEDNASYKWAKYAWNSINGYPGTVVYYQQRLYFAASTAF 337 Query: 352 ELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSL 411 +++ S G + DF D + + I + G V + Sbjct: 338 PQTIWASRTGDYKDFGKSNPTQDDDR---IIYTYAGRQVNEIRHLIDVGSLVAL-TSGGE 393 Query: 412 WLLSISLSKGLS---IDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGF 467 ++++ +K L+ F +G PP++V + +FV G ++ ++ S + G+ Sbjct: 394 YVITGDQNKALTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVVRDLAYSFDVDGY 453 Query: 468 RFNEITQLADHLFN-QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAW 526 + N++T LA+HLF I+ + P+S + + + +LL + + + FAW Sbjct: 454 QGNDLTILANHLFQKHSIVDWCFSIVPYSSAFCIRDDG-----KLLVMTYLRD-QQVFAW 507 Query: 527 HTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSA-GEERSFTVRLN 572 + K+ S S +++ +V + G+ + RL+ Sbjct: 508 APQSSTGKY--ESTCSI--SEGNEDAVYFVVNRTVNGQTVRYIERLS 550 >gi|323156125|gb|EFZ42284.1| phage protein [Escherichia coli EPECa14] Length = 823 Score = 527 bits (1356), Expect = e-147, Method: Composition-based stats. Identities = 118/587 (20%), Positives = 218/587 (37%), Gaps = 57/587 (9%) Query: 4 TTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLD 63 +W SF+ GE+ P L R D++ + + K N I +YG + + P + + Sbjct: 3 ISWIHPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYP 61 Query: 64 PRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEY 123 R R+ F L FG + +++ + + + + TPYT D +++ Sbjct: 62 NRKCRLIPFQFSTVQTYALEFGHQYMRV-IKDGALVLNSSNVIYEIATPYTEADLFRIKF 120 Query: 124 AVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSIS 183 VH +PP L ++ ++ P+ + V A Sbjct: 121 TQSADVLTLVHPAYPPKELRRYAHD---NWQLVDVVTKNGPFEDINIDESVTVYA----- 172 Query: 184 QADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVYRSL 239 A T T +T++ IF G+ L P W + + SIG AD YR++ Sbjct: 173 SASTGTITLTANASIFGAEQVGKLFYLEQPAVDSVPVWETSKSTSIGDIRRADSNYYRAV 232 Query: 240 TTGRSGD-RFGYSKGA-------TYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGD 291 T G++G R +++G + D I W + S G G Sbjct: 233 TAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEWEYL---------HSGFGIARISAANG- 282 Query: 292 IKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGD 351 + IS P + + W AW GYP V ++ RL F+ S Sbjct: 283 ---TTATAEVISYIPSQ--VVGEDNASYKWAKYAWNSVNGYPGTVVYYQQRLYFAASTAF 337 Query: 352 ELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSL 411 +++ S G + DF D + + I + G V + Sbjct: 338 PQTIWASRTGDYKDFGKSNPTQDDDR---IIYTYAGRQVNEIRHLIDVGSLVAL-TSGGE 393 Query: 412 WLLSISLSK---GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGF 467 ++++ +K S F +G PP++V + +FV G ++ ++ S + G+ Sbjct: 394 YVITGDQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVVRDLAYSFDVDGY 453 Query: 468 RFNEITQLADHLFN-QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAW 526 + N++T LA+HLF I+ + P+S + + + +LL + + + FAW Sbjct: 454 QGNDLTILANHLFQKHSIVDWCFSIVPYSSAFCIRDDG-----KLLVMTYLRD-QQVFAW 507 Query: 527 HTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSA-GEERSFTVRLN 572 + K+ S S +++ +V + G+ + RL+ Sbjct: 508 APQSSTGKY--ESTCSI--SEGNEDAVYFVVNRTVNGQTVRYIERLS 550 >gi|332344346|gb|AEE57680.1| conserved hypothetical protein [Escherichia coli UMNK88] Length = 823 Score = 525 bits (1353), Expect = e-147, Method: Composition-based stats. Identities = 118/587 (20%), Positives = 218/587 (37%), Gaps = 57/587 (9%) Query: 4 TTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLD 63 +W + SF+ GE+ P L R D++ + + K N I +YG + + P + + Sbjct: 3 ISWIQPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYP 61 Query: 64 PRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEY 123 R R+ F L FG + +++ + + + + TPYT D +++ Sbjct: 62 NRKCRLIPFQFSTVQTYALEFGHQYMRV-IKDGALVLNSSNVIYEIATPYTEADLFRIKF 120 Query: 124 AVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSIS 183 VH +PP L ++ ++ P+ + V A Sbjct: 121 TQSADVLTLVHPAYPPKELRRYAHD---NWQLVDVVTKNGPFEDINIDESVTVYA----- 172 Query: 184 QADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVYRSL 239 A T T +T+ IF G+ L P W + + SIG AD YR++ Sbjct: 173 SASTGTITLTASASIFGAEQVGKLFYLEQPAVDSVPVWETSKSTSIGDIRRADSNYYRAV 232 Query: 240 TTGRSGD-RFGYSKGA-------TYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGD 291 T G++G R +++G + D I W + S G G Sbjct: 233 TAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEWEYL---------HSGFGIARISAANG- 282 Query: 292 IKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGD 351 + IS P + + W AW GYP V ++ RL F+ S Sbjct: 283 ---TTATAEVISYIPSQ--VVGEDNASYKWAKYAWDSINGYPGTVVYYQQRLYFAASTAF 337 Query: 352 ELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSL 411 +++ S G + DF D + + I + G V + Sbjct: 338 PQTIWASRTGDYKDFGKSNPTQDDDR---IIYTYAGRQVNEIRHLIDVGSLVAL-TSGGE 393 Query: 412 WLLSISLSK---GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGF 467 ++++ +K S F +G PP++V + +FV G ++ ++ S + G+ Sbjct: 394 YVITGDQNKVLAPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVVRDLAYSFDVDGY 453 Query: 468 RFNEITQLADHLFN-QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAW 526 + N++T LA+HLF I+ + P+S + + + +LL + + + FAW Sbjct: 454 QGNDLTILANHLFQKHSIVDWCFSIVPYSSAFCIRDDG-----KLLVMTYLRD-QQVFAW 507 Query: 527 HTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSA-GEERSFTVRLN 572 + K+ S S +++ +V + G+ + RL+ Sbjct: 508 APQSSTGKY--ESTCSI--SEGNEDAVYFVVNRTVNGQTVRYIERLS 550 >gi|315121933|ref|YP_004062422.1| hypothetical protein CKC_00915 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495335|gb|ADR51934.1| hypothetical protein CKC_00915 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 588 Score = 525 bits (1352), Expect = e-147, Method: Composition-based stats. Identities = 225/583 (38%), Positives = 337/583 (57%), Gaps = 25/583 (4%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M +TK SF+ GE+SP+++QSR DL LH+QG+++ N+IPL G LV P + Y Sbjct: 1 MPKGAYTKRSFAGGEVSPQIIQSRSDLELHSQGLSQCFNMIPLSDGSLVRRPPLHRYEHI 60 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 L P+++R+ SF++ L +FG+KK+ V V T P F + Y TPY+F++ + Sbjct: 61 DLPPKASRILSFALGGDEAVLFIFGEKKMVYVEV---TGIKPPQFIRFYGTPYSFREAEQ 117 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 L+ A G+ V VH H P+ + + + G F+++ F PPPWLG + G K +AKL Sbjct: 118 LDVARMGTLIVLVHPKHSPYKIEFTEAG----VIFEKMVFAPPPWLGRREVGGKKHDAKL 173 Query: 181 SISQADT--STARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRS 238 ++ + T +TS + IFKP D GR + LG P +W NT Y A++ KVYR Sbjct: 174 RVTLSATRKGKITVTSTLPIFKPKDVGRMLCLGWLPKDWTANTLYPENAFMQMYGKVYRC 233 Query: 239 LTTGRSGDRFGYSKGATYVKDNNITWITV-------LNLSSKTSRESASGAVAPYYVWGD 291 +T G SG F ++ TY++D +TW + ++ K++ + PYYVWG+ Sbjct: 234 ITEGISGKEFEDNRRDTYIRDGGVTWKVIASSQALSVDKDGKSTLGTGGQYRTPYYVWGE 293 Query: 292 IKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGD 351 I + + +++ V S + W MSAWGE+EGYPSHV+F+NNRL FSGSK D Sbjct: 294 IVNCT-GAKTVEVMLHEGFCVTDSNSTLYWNMSAWGEREGYPSHVSFYNNRLCFSGSKFD 352 Query: 352 ELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSL 411 +VY S + F DFS D G D K+L+ A+TD + S I W P +G+++G DTSL Sbjct: 353 PQAVYFSGYNTFTDFSPDTIEGNVDYRKSLSVAITDDTMSAIRWFRPMEKGLVIGTDTSL 412 Query: 412 WLLSISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNE 471 W++ + +G ++ RR++G GVY PP+S+GD L+FV G GRRI+ I G++EQGF+F E Sbjct: 413 WIVILDFERGFNLVSRRLAGIGVYEAPPLSIGDELIFVQGAGRRIQIIGGASEQGFQFLE 472 Query: 472 ITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMI 531 +TQ DHL + RI QL YQE+P+S++WV+ N+ LL C A + +WHTH Sbjct: 473 LTQNVDHLLDYRIRQLAYQEDPYSLLWVL-----NNKGELLSCSLHANSKEKGSWHTHKS 527 Query: 532 SDK-HYVLSAASFPNDNRGGTSLWMLVALS--AGEERSFTVRL 571 ++S +S ++G T++W LV+ + G RL Sbjct: 528 GGGWVKIMSLSSCLCLDQGETTIWFLVSRTNEDGVSSIGLERL 570 >gi|331648168|ref|ZP_08349258.1| conserved hypothetical protein [Escherichia coli M605] gi|331043028|gb|EGI15168.1| conserved hypothetical protein [Escherichia coli M605] Length = 823 Score = 524 bits (1348), Expect = e-146, Method: Composition-based stats. Identities = 117/587 (19%), Positives = 217/587 (36%), Gaps = 57/587 (9%) Query: 4 TTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLD 63 +W + SF+ GE+ P L R D++ + + K N I +YG + + P + + Sbjct: 3 ISWIQPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYP 61 Query: 64 PRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEY 123 R R+ F L FG + +++ + + + + TPYT D +++ Sbjct: 62 NRKCRLIPFQFSTVQTYALEFGHQYMRV-IKDGALVLNSSNVIYEIATPYTEADLFRIKF 120 Query: 124 AVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSIS 183 VH +PP L ++ ++ P+ + V A Sbjct: 121 TQSADVLTLVHPAYPPKELRRYAHD---NWQLVDVVTKNGPFEDINIDESVTVYA----- 172 Query: 184 QADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVYRSL 239 A T T +T+ IF G+ L P W + + SIG AD YR++ Sbjct: 173 SASTGTITLTASASIFGAEQVGKLFYLEQPAVDSVPVWETSKSTSIGDIRRADSNYYRAV 232 Query: 240 TTGRSGD-RFGYSKGA-------TYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGD 291 T G++G R +++G + D I W + S G Sbjct: 233 TAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEWEYL---------HSGFGIARITAA--- 280 Query: 292 IKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGD 351 + IS P + + W AW GYP V ++ RL F+ S Sbjct: 281 -NGTTATAEVISYIPSQ--VVGEDNASYKWAKYAWNSVNGYPGTVVYYQQRLYFAASTAF 337 Query: 352 ELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSL 411 +++ S G + DF D + + I + G V + Sbjct: 338 PQTIWASRTGDYKDFGKSNPTQDDDR---IIYTYAGRQVNEIRHLIDVGSLVAL-TSGGE 393 Query: 412 WLLSISLSK---GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGF 467 ++++ +K S F +G PP++V + +FV G ++ ++ S + G+ Sbjct: 394 YVITGDQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVVRDLAYSFDVDGY 453 Query: 468 RFNEITQLADHLFN-QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAW 526 + N++T LA+HLF I+ + P+S + + + +LL + + + FAW Sbjct: 454 QGNDLTILANHLFQKHSIVDWCFSIVPYSSAFCIRDDG-----KLLVMTYLRD-QQVFAW 507 Query: 527 HTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSA-GEERSFTVRLN 572 + K+ S S +++ +V + G+ + RL+ Sbjct: 508 APQSSTGKY--ESTCSI--SEGNEDAVYFVVNRTVNGQTVRYIERLS 550 >gi|298381710|ref|ZP_06991309.1| conserved hypothetical protein [Escherichia coli FVEC1302] gi|298279152|gb|EFI20666.1| conserved hypothetical protein [Escherichia coli FVEC1302] Length = 823 Score = 523 bits (1347), Expect = e-146, Method: Composition-based stats. Identities = 117/587 (19%), Positives = 217/587 (36%), Gaps = 57/587 (9%) Query: 4 TTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLD 63 +W + SF+ GE+ P L R D++ + + K N I +YG + + P + + Sbjct: 3 ISWIQPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYP 61 Query: 64 PRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEY 123 R R+ F L FG + +++ + + + + TPYT D +++ Sbjct: 62 NRKCRLIPFQFSTVQTYALEFGHQYMRV-IKDGALVLNSSNVIYEIATPYTEADLFRIKF 120 Query: 124 AVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSIS 183 VH +PP L ++ ++ P+ + V A Sbjct: 121 TQSADVLTLVHPAYPPKELRRYAHD---NWQLVDVVTKNGPFEDINIDESVTVYA----- 172 Query: 184 QADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVYRSL 239 A T T +T+ IF G+ L P W + + SIG AD YR++ Sbjct: 173 SASTGTITLTASASIFGAEQVGKLFYLEQPAVDSVPVWETSKSTSIGDIRRADSNYYRAV 232 Query: 240 TTGRSGD-RFGYSKGA-------TYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGD 291 T G++G R +++G + D I W + S G Sbjct: 233 TAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEWEYL---------HSGFGIARITAA--- 280 Query: 292 IKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGD 351 + IS P + + W AW GYP V ++ RL F+ S Sbjct: 281 -NGTTATAEVISYIPSQ--VVGEDNASYKWAKYAWNSVNGYPGTVVYYQQRLYFAASTAF 337 Query: 352 ELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSL 411 +++ S G + DF D + + I + G V + Sbjct: 338 PQTIWASRTGDYKDFGKSNPTQDDDR---IIYTYAGRQVNEIRHLIDVGSLVAL-TSGGE 393 Query: 412 WLLSISLSK---GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGF 467 ++++ +K S F +G PP++V + +FV G ++ ++ S + G+ Sbjct: 394 YVITGDQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVVRDLAYSFDVDGY 453 Query: 468 RFNEITQLADHLFN-QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAW 526 + N++T LA+HLF I+ + P+S + + + +LL + + + FAW Sbjct: 454 QGNDLTILANHLFQKHSIVDWCFSIVPYSSAFCIRDDG-----KLLVMTYLRD-QQVFAW 507 Query: 527 HTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSA-GEERSFTVRLN 572 + K+ S S +++ +V + G+ + RL+ Sbjct: 508 APQSSTGKY--ESTCSI--SEGNEDAVYFVVNRTVNGQTVRYIERLS 550 >gi|117624704|ref|YP_853617.1| hypothetical protein APECO1_4049 [Escherichia coli APEC O1] gi|115513828|gb|ABJ01903.1| conserved hypothetical protein [Escherichia coli APEC O1] Length = 823 Score = 523 bits (1346), Expect = e-146, Method: Composition-based stats. Identities = 114/580 (19%), Positives = 214/580 (36%), Gaps = 43/580 (7%) Query: 4 TTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLD 63 +W + SF+ GE+ P L R D++ + + K N I +YG + + P + + Sbjct: 3 ISWIQPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYP 61 Query: 64 PRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEY 123 R R+ F L FG + +++ + + + + TPYT D +++ Sbjct: 62 NRKCRLIPFQFSTVQTYALEFGHQYMRV-IKDGALVLNSSNVIYEIATPYTEADLFRIKF 120 Query: 124 AVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSIS 183 VH +PP L ++ ++ P+ + V A Sbjct: 121 TQSADVLTLVHPAYPPKELRRYAHD---NWQLVDVVTKNGPFEDINIDESVTVYA----- 172 Query: 184 QADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVYRSL 239 A T T +T+ IF G+ L P W + + SIG AD YR++ Sbjct: 173 SASTGTITLTASASIFGAEQVGKLFYLEQPAVDSVPVWETSKSTSIGDIRRADSNYYRAV 232 Query: 240 TTGRSGD-RFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKD 298 T G++G R +++G ++ S G + Sbjct: 233 TAGKTGTLRPSHTEGTSWDGWGGSG--DDDIGIEWEYLHSGFGIARITAA----NGTTAT 286 Query: 299 GRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLS 358 IS P + + W AW GYP V ++ RL F+ S +++ S Sbjct: 287 AEVISYIPSQ--VVGEDNASYKWAKYAWNSVNGYPGTVVYYQQRLYFAASTAFPQTIWAS 344 Query: 359 SFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL 418 G + DF D + + I + G V + ++++ Sbjct: 345 RTGDYKDFGKSNPTQDDDR---IIYTYAGRQVNEIRHLIDVGSLVAL-TSGGEYVITGDQ 400 Query: 419 SK---GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQ 474 +K S F +G PP++V + +FV G ++ ++ S + G++ N++T Sbjct: 401 NKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVVRDLAYSFDVDGYQGNDLTI 460 Query: 475 LADHLFN-QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISD 533 LA+HLF I+ + P+S + + + +LL + + + FAW + Sbjct: 461 LANHLFQKHSIVDWCFSIVPYSSAFCIRDDG-----KLLVMTYLRD-QQVFAWAPQSSTG 514 Query: 534 KHYVLSAASFPNDNRGGTSLWMLVALSA-GEERSFTVRLN 572 K+ S S +++ +V + G+ + RL+ Sbjct: 515 KY--ESTCSI--SEGNEDAVYFVVNRTVNGQTVRYIERLS 550 >gi|324008552|gb|EGB77771.1| conserved domain protein [Escherichia coli MS 57-2] Length = 823 Score = 523 bits (1346), Expect = e-146, Method: Composition-based stats. Identities = 117/587 (19%), Positives = 217/587 (36%), Gaps = 57/587 (9%) Query: 4 TTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLD 63 +W + SF+ GE+ P L R D++ + + K N I +YG + + P + + Sbjct: 3 ISWIQPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYP 61 Query: 64 PRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEY 123 R R+ F L FG + +++ + + + + TPYT D +++ Sbjct: 62 NRKCRLIPFQFSTVQTYALEFGHQYMRV-IKDGALVLNSSNVIYEIATPYTEADLFRIKF 120 Query: 124 AVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSIS 183 VH +PP L ++ ++ P+ + V A Sbjct: 121 TQSADVLTLVHPAYPPKELRRYAHD---NWQLVDVVTKNGPFEDINIDESVTVYA----- 172 Query: 184 QADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVYRSL 239 A T T +T+ IF G+ L P W + + SIG AD YR++ Sbjct: 173 SASTGTITLTASASIFGAEQVGKLFYLEQPAVDSVPVWETSKSTSIGDIRRADSNYYRAV 232 Query: 240 TTGRSGD-RFGYSKGA-------TYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGD 291 T G++G R +++G + D I W + S G Sbjct: 233 TAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEWEYL---------HSGFGIARITAA--- 280 Query: 292 IKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGD 351 + IS P + + W AW GYP V ++ RL F+ S Sbjct: 281 -NGTTATAEVISYIPSQ--VVGEDNASYKWAKYAWNSVNGYPGTVVYYQQRLYFAASTAF 337 Query: 352 ELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSL 411 +++ S G + DF D + + I + G V + Sbjct: 338 PQTIWASRTGDYKDFGKSNPTQDDDR---IIYTYAGRQVNEIRHLIDVGSLVAL-TSGGE 393 Query: 412 WLLSISLSK---GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGF 467 ++++ +K S F +G PP++V + +FV G ++ ++ S + G+ Sbjct: 394 YVITGDQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVVRDLAYSFDVDGY 453 Query: 468 RFNEITQLADHLFN-QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAW 526 + N++T LA+HLF I+ + P+S + + + +LL + + + FAW Sbjct: 454 QGNDLTILANHLFQKHSIVDWCFSIVPYSSAFCIRDDG-----KLLVMTYLRD-QQVFAW 507 Query: 527 HTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSA-GEERSFTVRLN 572 + K+ S S +++ +V + G+ + RL+ Sbjct: 508 APQSSTGKY--ESTCSI--SEGNEDAVYFVVNRTVNGQTVRYIERLS 550 >gi|218700982|ref|YP_002408611.1| hypothetical protein ECIAI39_2672 [Escherichia coli IAI39] gi|218370968|emb|CAR18795.1| conserved hypothetical protein from phage origin [Escherichia coli IAI39] gi|323948677|gb|EGB44582.1| hypothetical protein ERKG_04900 [Escherichia coli H252] Length = 823 Score = 522 bits (1345), Expect = e-146, Method: Composition-based stats. Identities = 117/587 (19%), Positives = 218/587 (37%), Gaps = 57/587 (9%) Query: 4 TTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLD 63 +W + SF+ GE+ P L R D++ + + K N I +YG + + P + + Sbjct: 3 ISWIQPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYP 61 Query: 64 PRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEY 123 R R+ F L FG + +++ + + + + TPYT D +++ Sbjct: 62 NRKCRLIPFQFSTVQTYALEFGHQYMRV-IKDGALVLNSSNVIYEIATPYTEADLFRIKF 120 Query: 124 AVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSIS 183 VH +PP L ++ ++ P+ + V A Sbjct: 121 TQSADVLTLVHPAYPPKELRRYAHD---NWQLVDVVTKNGPFEDINIDESVTVYA----- 172 Query: 184 QADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVYRSL 239 A T T +T+ + IF G+ L P W + + SIG AD YR++ Sbjct: 173 SASTGTITLTASVSIFGAEQVGKLFYLEQPAVDSVPVWETSKSTSIGDIRRADSNYYRAV 232 Query: 240 TTGRSGD-RFGYSKGA-------TYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGD 291 T G++G R +++G + D I W + S G Sbjct: 233 TAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEWEYL---------HSGFGIARITAA--- 280 Query: 292 IKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGD 351 + IS P + + W AW GYP V ++ RL F+ S Sbjct: 281 -NGTTATAEVISYIPSQ--VVGEDNASYKWAKYAWNSVNGYPGTVVYYQQRLYFAASTAF 337 Query: 352 ELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSL 411 +++ S G + DF D + + I + G V + Sbjct: 338 PQTIWASRTGDYKDFGKSNPTQDDDR---IIYTYAGRQVNEIRHLIDVGSLVAL-TSGGE 393 Query: 412 WLLSISLSK---GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGF 467 ++++ +K S F +G PP++V + +FV G ++ ++ S + G+ Sbjct: 394 YVITGDQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVVRDLAYSFDVDGY 453 Query: 468 RFNEITQLADHLFN-QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAW 526 + N++T LA+HLF I+ + P+S + + + +LL + + + FAW Sbjct: 454 QGNDLTILANHLFQKHSIVDWCFSIVPYSSAFCIRDDG-----KLLVMTYLRD-QQVFAW 507 Query: 527 HTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSA-GEERSFTVRLN 572 + K+ S S +++ +V + G+ + RL+ Sbjct: 508 APQSSTGKY--ESTCSI--SEGNEDAVYFVVNRTVNGQTVRYIERLS 550 >gi|309702804|emb|CBJ02135.1| hypothetical phage protein [Escherichia coli ETEC H10407] Length = 807 Score = 517 bits (1332), Expect = e-144, Method: Composition-based stats. Identities = 106/563 (18%), Positives = 208/563 (36%), Gaps = 40/563 (7%) Query: 21 LQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRVFSFSIPDGGYA 80 + R D++ + + K N I +YG + + P + + + R R+ F Sbjct: 1 MYGRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGEAKYPTRKCRLIPFQFSTVQTY 60 Query: 81 LLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPH 140 L FG +++ + + + + PY D +++ VH +PP Sbjct: 61 ALEFGHNYMRV-IKDGAYVLNSSNVIYELAMPYADTDLFRIKFTQSADVLTLVHPAYPPK 119 Query: 141 HLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFK 200 L ++ ++ P+ + VK A A T T +T+ IF Sbjct: 120 ELRRYAHD---NWQIVDVTTKNGPFEDINVDETVKVYA-----SASTGTITLTASSAIFG 171 Query: 201 PLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGD-RFGYSKGAT 255 G+ L P W + +I AD YR+ T+G++G R +++G + Sbjct: 172 AEQVGKLFYLEQPAIDSVPVWETSKTTAINDVRRADSNYYRANTSGKTGTLRPSHTEGMS 231 Query: 256 YVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAG 315 + S G V D ++ +S P + + Sbjct: 232 WDGWGGTGDSDTGIQWE--YLHSGFGIARITAVSSDG--LTATATVVSYIPSQ--VVGSA 285 Query: 316 VSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCY 375 W AW GYPS V ++ RL F+ S +++ S G + DF + Sbjct: 286 NGSYKWARYAWNSVNGYPSTVVYYQQRLYFAASTAYPQTIWASRTGDYKDFGKNNPIQDD 345 Query: 376 DPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLS---IDFRRVSGS 432 D + + I + G V + + +S +K L+ F + Sbjct: 346 DR---IIYTYAGRQVNEIRHLIDVGNLVAL-TSGGEYTISGDQNKVLTPSAFSFSSQGNN 401 Query: 433 GVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFNQR-ILQLVYQ 490 G PP++V + +F+ G ++ ++ S + G++ ++T LA+HLF +R I+ + Sbjct: 402 GSSNVPPIAVANIALFIQEKGSVVRDLAYSFDVDGYQGTDLTILANHLFQKRSIVDWSFC 461 Query: 491 EEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGG 550 P+S + + + +LL + + + FAW + K+ S S Sbjct: 462 IVPYSSAFCIRDDG-----KLLVLTYLRD-QQVFAWAPQSSTGKY--ESTCSI--SEGSE 511 Query: 551 TSLWMLVALS-AGEERSFTVRLN 572 +++ +V + G+ + + RL+ Sbjct: 512 DAVYFVVNRTINGQTKRYIERLS 534 >gi|262043557|ref|ZP_06016670.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|259039091|gb|EEW40249.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] Length = 511 Score = 516 bits (1328), Expect = e-144, Method: Composition-based stats. Identities = 113/534 (21%), Positives = 203/534 (38%), Gaps = 36/534 (6%) Query: 4 TTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLD 63 +W + SFS GE++P L R D++ + + K N I +YG + + P Q + Sbjct: 3 VSWIQPSFSGGEIAPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTQFIAAAKYP 61 Query: 64 PRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEY 123 R R+ F L FG +++ + + TPYT D L++ Sbjct: 62 DRKCRLIPFQFSTVQTYALEFGHNYMRV-IKDGGLVLTTGDVIYELATPYTENDVFGLKF 120 Query: 124 AVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSIS 183 VH +PP L ++ +++ P+ + + Sbjct: 121 TQSADVMTIVHPSYPPKELRRYAHD---NWQIVDVQTTNGPFEDINVDESKTV-----WA 172 Query: 184 QADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVYRSL 239 A T T +TS IF G+ L P W + + SI AD YR+ Sbjct: 173 SAPTGTITLTSSSAIFGAEQVGKLFYLEQPAVDSVPVWETSKSTSIEDIRRADSNYYRAN 232 Query: 240 TTGRSGD-RFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKD 298 T G++G R +++G + S G V V GD ++ Sbjct: 233 TAGKTGTLRPSHTEGMAWDGWGGT--GDDDTGVQWEYLHSGFGIVRITAVAGDG--LTAT 288 Query: 299 GRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLS 358 +S P++ + A + W AW GYP+ V ++ RL F+ S +++ S Sbjct: 289 ADVVSRIPEN--VVGADKASYKWARYAWNSVNGYPATVVYYQQRLYFAASPAYPQTIWAS 346 Query: 359 SFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL 418 G + DF D + + I + G ++V ++++ Sbjct: 347 RTGDYKDFGKSNPTQDDDR---IVYTYAGRQVNEIRHLIDVG-SLVVLTSGGEFVVTGDQ 402 Query: 419 SKGLS---IDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQ 474 +K L+ +G PP++V + +F+ G ++ ++ S + GF+ N++T Sbjct: 403 NKVLTPSAFSLSSQGSNGCSDVPPIAVSNIALFIQEKGSVVRDLAYSFDVDGFQGNDLTI 462 Query: 475 LADHLFNQR-ILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWH 527 LA+HLF +R I+ + P S + V + +LL + + + FAW Sbjct: 463 LANHLFQKRSIVDWAFCIVPFSSAFCVRDDG-----KLLVLTYLRD-QQVFAWS 510 >gi|304398395|ref|ZP_07380269.1| conserved hypothetical protein [Pantoea sp. aB] gi|304354261|gb|EFM18634.1| conserved hypothetical protein [Pantoea sp. aB] Length = 824 Score = 508 bits (1307), Expect = e-141, Method: Composition-based stats. Identities = 111/579 (19%), Positives = 203/579 (35%), Gaps = 41/579 (7%) Query: 5 TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDP 64 + + SF+ GE+SP + R DL+ ++ + + RN I +YG L + P + + + Sbjct: 4 SLIQPSFAGGEISPNVY-GRVDLAKYSIALRRCRNFIVRQYGGLENRPGTRFIAEAKYPD 62 Query: 65 RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYA 124 R R+ F L FG +++ TPY D L+ Sbjct: 63 RKCRLIPFQFSTVQTYALEFGHNYMRVY-KDGGQVLDGNNQVYELATPYQEADLFELKIT 121 Query: 125 VFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQ 184 HK + P L S+ E+ P+ + VK A Sbjct: 122 QSADVMTICHKAYAPRELRRFGHA---SWELVEVVTKNGPFEDINIDPSVKVYA-----S 173 Query: 185 ADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVYRSLT 240 + + ++ IF G+ L P W + ++G A D Y +LT Sbjct: 174 SYQGNITLNANASIFGSEQVGKLFYLEQVNVDSTPVWETDKAVAVGMTRRAGDNYYVALT 233 Query: 241 TGRSGD-RFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDG 299 G++G R +++GA + + + G I Sbjct: 234 AGKTGTLRPSHTEGAAWDGWGSNGDNDTGIQWEYQHSGFGIARITSVSSDGYI----AAA 289 Query: 300 RSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSS 359 + P + W AW + GYP VT++ RL+F+ S +++ S Sbjct: 290 VVQTYMPNDAV--GPTKASYKWAKFAWNQVNGYPGTVTYYQQRLIFAASIKYPQTIWCSK 347 Query: 360 FGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLS 419 G + DF D + + I + G V + + + + Sbjct: 348 TGDYKDFGKTSPIADDDR---IVYTYAGKQVNEIRHLIDVGSLVAL-TSGGQFQIVGDQN 403 Query: 420 K---GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQL 475 K + F G + P++V + +F+ G ++ ++ S + G++ +++T L Sbjct: 404 KTLTPTAFSFSSQGADGASSVAPITVSNIALFIQEKGSVVRDLAYSFDVDGYQGSDLTVL 463 Query: 476 ADHLFNQ-RILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDK 534 A+HLFN R++ + P+S W V S LL + E + FAW + Sbjct: 464 ANHLFNGYRLVDWTFSVVPYSAGWAVR-----SDGMLLCLTYLRE-QQVFAWAPQP--GE 515 Query: 535 HYVLSAASFPNDNRGGTSLWMLVALSA-GEERSFTVRLN 572 S S +++ V + G + + RL+ Sbjct: 516 GKFESTCSI--SEGTEDAVYFSVQRTVNGASKRYIERLS 552 >gi|330007163|ref|ZP_08305905.1| hypothetical protein HMPREF9538_03594 [Klebsiella sp. MS 92-3] gi|328535510|gb|EGF61970.1| hypothetical protein HMPREF9538_03594 [Klebsiella sp. MS 92-3] Length = 825 Score = 504 bits (1296), Expect = e-140, Method: Composition-based stats. Identities = 113/585 (19%), Positives = 210/585 (35%), Gaps = 45/585 (7%) Query: 5 TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDP 64 + + S + GE+SP L R DL + + + RN I + G + + P + + Sbjct: 4 SLVQPSLAGGEISPSLY-GRIDLEKYQTSLRRCRNFIVRQSGGIENRPGFRFLGSAKYAD 62 Query: 65 RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYA 124 R +R+ F L GD ++ + TP+ L++ Sbjct: 63 RYSRLIPFQFSVSQTYALELGDHYFRVWSN--GALVTDGGSPVEVATPWPVSVISELKFT 120 Query: 125 VFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQ 184 H D+PP + + D + + P+ V A Sbjct: 121 QSADVMTVCHNDYPPLEIRRYGEAD---WRTAAVTTTSGPFQDLNTDDSVTVYA-----S 172 Query: 185 ADTSTARITSDMKIFKPLDKGRSIRLGCHPPE----WAKNTNYSIGAYIVADDKVYRSLT 240 T + +T+ IFK G+ + + W + + +G + YR + Sbjct: 173 GRTGSVTLTASSPIFKSQHVGKLFYMEQKAVDSVGRWETDKDIGVGDECRYQENFYRCVD 232 Query: 241 TGRSGDR----FGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVS 296 G +G ++ G ++ VL S G + GD + Sbjct: 233 GGSNGTTGTVAPTHTTGDSWDGWGLGGRNGVL----WRYLHSGFGVCRITAIAGDGLTAT 288 Query: 297 KD--GRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELS 354 D R + + + W AW + +GYP VT++ RL+F GS+ + Sbjct: 289 ADVVPRQDGEIELPAQVVGSTFATYKWAHYAWNDTDGYPGTVTYYQQRLIFGGSRAFPQT 348 Query: 355 VYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLL 414 ++ S G +++F D A+T + I + G+ ++V + + Sbjct: 349 IWCSRTGDYHNFYRSNPKVDDD---AITYNYAGRQLNKILHLLDVGQ-LIVLTSGGEFKV 404 Query: 415 SISLSKGLS----IDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRF 469 + + L+ S +G P++VG ++V G I+ + S + ++ Sbjct: 405 TGDSNGNLTGTGGFAMSGQSFNGSSDLAPINVGSVALYVQQKGSIIRDLFYSFDQDSYQS 464 Query: 470 NEITQLADHLFNQ-RILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHT 528 +++T LA HLFN I +P S+ W S LLG + E + +AWH Sbjct: 465 SDLTLLASHLFNGYSIRDWALSVQPFSVAWCAR-----SDGMLLGLTYLRE-QQVYAWHP 518 Query: 529 HMISDKHYVLSAASFPNDNRGGTSLWMLVALSA-GEERSFTVRLN 572 H +++ YV S S +++ L+ + G + RLN Sbjct: 519 HPMTNG-YVESICSI--SEGQEDAVYALIRRTVNGSTVRYIERLN 560 >gi|320175038|gb|EFW50151.1| 12 [Shigella dysenteriae CDC 74-1112] Length = 799 Score = 488 bits (1256), Expect = e-135, Method: Composition-based stats. Identities = 107/564 (18%), Positives = 203/564 (35%), Gaps = 56/564 (9%) Query: 27 LSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRVFSFSIPDGGYALLVFGD 86 ++ + + K N I +YG + + P + + R R+ F L FG Sbjct: 1 MAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYPNRKCRLIPFQFSTVQTYALEFGH 60 Query: 87 KKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQ 146 + +++ + + + + TPYT D +++ VH +PP L Sbjct: 61 QYMRV-IKDGALVLNSSNVIYEIATPYTEADLFRIKFTQSADVLTLVHPAYPPKELRRYA 119 Query: 147 DGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGR 206 ++ ++ P+ + V A A T T +T+ IF G+ Sbjct: 120 HD---NWQLVDVVTKNGPFEDINIDESVTVYA-----SASTGTITLTASASIFGAEQVGK 171 Query: 207 SIRLGC----HPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGD-RFGYSKGA------- 254 L P W + + SIG AD YR++T G++G R +++G Sbjct: 172 LFYLEQPAVDSVPVWETSKSTSIGDIRRADSNYYRAVTAGKTGTLRPSHTEGTSWDGWGG 231 Query: 255 TYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQA 314 + D I W + S G + IS P + Sbjct: 232 SGDDDTGIEWEYL---------HSGFGIARITAA----NGTTATAEVISYIPSQ--VVGE 276 Query: 315 GVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGC 374 + W W GYP V ++ RL F+ S +++ S G + DF Sbjct: 277 DNASYKWAKYTWNSVNGYPGTVVYYQQRLYFAASTAFPQTIWASRTGDYKDFGKSNPTQD 336 Query: 375 YDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSK---GLSIDFRRVSG 431 D + + I + G V + ++++ +K S F Sbjct: 337 DDR---IIYTYAGRQVNEIRHLIDVGSLVAL-TSGGEYVITGDQNKVLTPSSFAFSSQGS 392 Query: 432 SGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFN-QRILQLVY 489 +G PP++V + +FV G ++ ++ S + G++ N++T LA+HLF I+ + Sbjct: 393 NGSSNVPPIAVANIALFVQEKGSVVRDLAYSFDVDGYQGNDLTILANHLFQKHSIVDWCF 452 Query: 490 QEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRG 549 P+S + + + +LL + + + FAW + K+ S S Sbjct: 453 SIVPYSSAFCIRDDG-----KLLVMTYLRD-QQVFAWAPQSSTGKY--ESTCSI--SEGN 502 Query: 550 GTSLWMLVALSA-GEERSFTVRLN 572 +++ +V + G+ + RL+ Sbjct: 503 EDAVYFVVNRTVNGQTVRYIERLS 526 >gi|48697202|ref|YP_024932.1| hypothetical protein BcepC6B_gp12 [Burkholderia phage BcepC6B] gi|47779008|gb|AAT38371.1| gp12 [Burkholderia phage BcepC6B] Length = 768 Score = 487 bits (1254), Expect = e-135, Method: Composition-based stats. Identities = 129/616 (20%), Positives = 222/616 (36%), Gaps = 67/616 (10%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + SF AGELSP LL +R DL+ + G N I GP + + Sbjct: 1 MPKAAPQQVSFDAGELSP-LLGARVDLAKYPNGCQVMENFIATVQGPAIRRGGKRFVAAT 59 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDN-- 118 + + + + F + DG +L FGD ++ V R A TPY D Sbjct: 60 KDSTKQSWLLPFIVADGIAYMLEFGDHYIRFFVNRGQLV--NAGAPVEIATPYALADLTT 117 Query: 119 ----KSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGV 174 ++ T H +P LL +F+ + F+ P+ + V Sbjct: 118 EDGTFAIRATQSADTMYLFHGGYPTQKLLRTS---ATTFSLQPVTFVGGPF------AAV 168 Query: 175 KSNAKLSI-SQADTSTARITSDMKIFKPLDKGRSIRLGCHPPE----WAKNTNYSIGAYI 229 S+ + + + A T + + +F+P D G L W + Sbjct: 169 NSDNNVRVHASAGTGAVTLVASASVFRPSDVGTLFYLEQEDNSFVKPWVVHQKIGPSELR 228 Query: 230 VADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPY--- 286 D+VY G + + ++ T+ + W S T + GA Y Sbjct: 229 RVGDRVYLCTAVGTATPQVTGTETPTHT--SGSRWDGTGQDESATDEYGSIGAEWEYQHS 286 Query: 287 -----YVWGDIKDVSKDGRSISVAPQSQTLFQAG----VSVVSWFMSAWGEQEGYPSHVT 337 + G D G + P + W S + +G+P T Sbjct: 287 GYGTVLITGYTNDQVVTGTVATNDPADPGMLPNTVVTLTGTYKWARSLFNSTDGFPQMGT 346 Query: 338 FHNNRLLFSGSKGDELSVYLSSFGAFYDF-SLDGEYGCYDPTKALTTAVTDFSASTIHWM 396 F NRL + +S F F + D + D A+ + + + WM Sbjct: 347 FWRNRLCLMRDRWLA----MSVSADFETFKTKDADQQTDD--SAIVQQLNARQLNKLAWM 400 Query: 397 HPFGEGVLVGCDTSLWLLSISLS----KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGV 452 + +L+G W++ + + +++ R + G PV VG ++FV Sbjct: 401 VE-SDSLLIGMTGDEWVIGPANASQPVSAANLNAARRTSYGSKRIQPVQVGGTIMFVQKA 459 Query: 453 GRRIKYISGST-EQGFRFNEITQLADHLFN------QRILQLVYQEEPHSIVWVVLEPKD 505 GR+++ + ++T++ADH+ I+ L +Q+EPHS+VW Sbjct: 460 GRKLRDFKYDFSSDNYVSTDVTKIADHITRGRAGTNSGIMSLCFQQEPHSVVWAAR---- 515 Query: 506 NSFPRLLGCRFSAE--GEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSA-G 562 + +L+GC + E + WH H ++ +V AS P + LW++V G Sbjct: 516 -ADGQLIGCTYDEEAGRSDVYGWHRHPDANG-FVECVASMPAPDGASDDLWVIVRRQVNG 573 Query: 563 EERSFTVRLN--LLDD 576 + + LN L DD Sbjct: 574 QTVRYVEYLNPALQDD 589 >gi|221213947|ref|ZP_03586920.1| conserved hypothetical protein [Burkholderia multivorans CGD1] gi|221166124|gb|EED98597.1| conserved hypothetical protein [Burkholderia multivorans CGD1] Length = 766 Score = 484 bits (1246), Expect = e-134, Method: Composition-based stats. Identities = 129/613 (21%), Positives = 218/613 (35%), Gaps = 63/613 (10%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + SF AGELSP LL +R DL+ +A G N I GP V + Sbjct: 1 MPKAAAQQVSFDAGELSP-LLGARVDLAKYANGCLLLENFIATVQGPAVRRGGKRYVSAI 59 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDN-- 118 + + + F + DG +L FGD+ ++ V R A TPY D Sbjct: 60 KDSGKQAWLLPFIVSDGIAYMLEFGDQYIRFYVNRGQLVNDSA--PVEIATPYALADLVT 117 Query: 119 ----KSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGV 174 ++ T H +P L +F + F+ P+ + + Sbjct: 118 EDGTFAIRATQSADTMYLFHGAYPTQKLSRTS---ATTFELQPVTFVGGPFATVNDNNSI 174 Query: 175 KSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPE----WAKNTNYSIGAYIV 230 + A + +T++ +F+ D G + P WA + + Sbjct: 175 RVQA-----SGQSGDVTLTANADVFRASDVGTLFYVEQEQPTGIVPWAVHAESHVNDIRR 229 Query: 231 ADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESAS-------GAV 283 D+ YR G + + + + + W + S Sbjct: 230 VGDRTYRCTQIGLNAPQVTGQETPIHTE--GRRWDGDGRDPDGDTYGSIGVEWEYQHSGY 287 Query: 284 APYYVWGDIKDVSKDGRSISVAPQSQTLFQAGV---SVVSWFMSAWGEQEGYPSHVTFHN 340 A + G + + P + V W S + +G+P TF + Sbjct: 288 ATVLITGFVNARQVSATVTTNNPNDPCMIPKPVVDSGTYKWARSLFNSTDGFPQMGTFWS 347 Query: 341 NRLLFSGSKGDELSVYLSSFGAFYDF-SLDGEYGCYDPTKALTTAVTDFSASTIHWMHPF 399 NRL + + +S F +F + D + D A+ + + + WM Sbjct: 348 NRLCVMRDRW----IAMSVSADFENFKTKDADQQTDD--SAIVQQLNARRLNKLAWMVE- 400 Query: 400 GEGVLVGCDTSLWLLSISLS----KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRR 455 + +LVG W++ S + ++ RR + G PV VG ++FV GR+ Sbjct: 401 SDSLLVGMTGDEWVIGKSNASLALSATNMSARRRTSYGSKRLQPVEVGGTILFVQKAGRK 460 Query: 456 IKYISGST-EQGFRFNEITQLADHLFN------QRILQLVYQEEPHSIVWVVLEPKDNSF 508 ++ + ++T++ADH+ I+ L YQ+EPHSIVW + Sbjct: 461 LRDFKYDFSSDNYVSTDVTKIADHVTRGRSGTNSGIMSLCYQQEPHSIVWAAR-----AD 515 Query: 509 PRLLGCRFSAE--GEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVAL-SAGEER 565 +L+GC + E + WH H + +V AS P + LWM+V G+ Sbjct: 516 GQLIGCTYDEEAGRSDVYGWHRHPDVNG-FVECVASMPAPDGASDDLWMIVRRQINGQSV 574 Query: 566 SFTVRLN--LLDD 576 + LN L DD Sbjct: 575 RYVEYLNQSLQDD 587 >gi|221201505|ref|ZP_03574544.1| conserved hypothetical protein [Burkholderia multivorans CGD2M] gi|221207939|ref|ZP_03580945.1| hypothetical protein BURMUCGD2_2474 [Burkholderia multivorans CGD2] gi|221172124|gb|EEE04565.1| hypothetical protein BURMUCGD2_2474 [Burkholderia multivorans CGD2] gi|221178773|gb|EEE11181.1| conserved hypothetical protein [Burkholderia multivorans CGD2M] Length = 767 Score = 482 bits (1239), Expect = e-134, Method: Composition-based stats. Identities = 131/612 (21%), Positives = 217/612 (35%), Gaps = 60/612 (9%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + SF AGELSP LL +R D++ + G N I GP V + Sbjct: 1 MPKAAAQQVSFDAGELSP-LLGARVDIAKYPNGCKVMENFIATVQGPAVRRGGKRFVAAV 59 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDN-- 118 + + + F + DG +L FGD ++ V R + TPY D Sbjct: 60 KDSSKQAWLLPFIVSDGIAYMLEFGDHYIRFYVDRGQLVNAGG--PVEIATPYALADLVT 117 Query: 119 ----KSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGV 174 ++ T H +PP LL +F+ ++ F+ P+ GV Sbjct: 118 EDGTFAIRATQSADTMYLFHGAYPPQKLLRTS---ATTFSLQQVTFVSGPFQTINSDEGV 174 Query: 175 KSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHP-----PEWAKNTNYSIGAYI 229 A T +T+ +F D G L + P T G Sbjct: 175 TVKA-----SGQTGAVTLTATAPVFSQADVGALFYLEQNDNTSVLPWSVHGTILETGLVR 229 Query: 230 VADDKVYRSLTTGRSGDRFGYSKGATYVK----DNNITW-ITVLNLSSKTSRESASGAVA 284 D+ Y S G + + S+ T+ + D ++T + E A Sbjct: 230 RVGDRTYVSTAIGPTAPQVTGSETPTHTRGRRYDGDLTDLANDNYGTIGIEWEYQHSGYA 289 Query: 285 PYYVWGDIKDVSKDGRSISVAPQSQTLFQAG---VSVVSWFMSAWGEQEGYPSHVTFHNN 341 + G + P + W + + +GYP TF N Sbjct: 290 TVLITSVSDSQHATGTVTTNNPTDPCIIPQSIVDTGTYKWAHALFNAADGYPQMGTFWRN 349 Query: 342 RLLFSGSKGDELSVYLSSFGAFYDF-SLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFG 400 RL + S F +F S D + D A+ + + + WM Sbjct: 350 RLWMMRDRWLV----GSVSADFENFASKDADQQTDD--SAIVQQLNARQLNKLAWMVE-S 402 Query: 401 EGVLVGCDTSLWLLSISLS----KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRI 456 + +++G W++ + + +++ R + G PV VG ++FV GR++ Sbjct: 403 DSLIIGMTGDEWVIGPANASQPVSATNLNAARRTSYGSKRIQPVQVGGTIMFVQKAGRKL 462 Query: 457 KYISGST-EQGFRFNEITQLADHLFNQ------RILQLVYQEEPHSIVWVVLEPKDNSFP 509 + F ++T+LADH+ I+ L +Q+EPHSIVW + Sbjct: 463 RDFKYDFSSDNFVSTDVTKLADHITRGRSGTNNGIMSLCFQQEPHSIVWAAR-----ADG 517 Query: 510 RLLGCRFSAE--GEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVAL-SAGEERS 566 +L+GC + E + WH H ++ +V AS P + LW++V G+ Sbjct: 518 QLIGCTYDEEAGRSDVYGWHRHPDANG-FVECVASMPAPDGASDDLWLIVRRQINGQTVR 576 Query: 567 FTVRLN--LLDD 576 + LN L DD Sbjct: 577 YVEYLNPALQDD 588 >gi|317120716|gb|ADV02538.1| hypothetical protein SC2_gp080 [Liberibacter phage SC2] gi|317120777|gb|ADV02598.1| hypothetical protein SC2_gp080 [Candidatus Liberibacter asiaticus] Length = 590 Score = 478 bits (1230), Expect = e-132, Method: Composition-based stats. Identities = 160/591 (27%), Positives = 258/591 (43%), Gaps = 43/591 (7%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M K+SF++GE+SP + QS +L ++ +A N IPLR G L+ P + Y Sbjct: 1 MTKAIHFKNSFASGEVSPFVHQSGSNLKIYQSCLAHCHNYIPLRTGALMRRPGTRIYHVF 60 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 + R+FSF ++V G KL I R T + PY +D Sbjct: 61 DDVDKPQRLFSFVKDAYTAYIIVLGYLKLHIFERRMGGCSK----VTTIEVPYKKEDVDE 116 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 +E A T VH HPP L + F E+ F P L + I K + L Sbjct: 117 IEVAQNIDTLWMVHPKHPPCQLELKGKD----WEFKEVLFKHVPPLKEQFIDDKKVSINL 172 Query: 181 SISQADTSTAR-----ITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKV 235 +T T + + +D ++FK +D GR + LG P W +T Y +Y+V +D++ Sbjct: 173 KTPFENTETGKTGMVSVEADGEMFKEMDIGRELNLGFRPQRWIPDTWYLDNSYVVHNDRL 232 Query: 236 YRSLTTGRS-GDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVW-GDIK 293 + + G+S + +S KD + W V ES G +W + Sbjct: 233 LKCINKGKSQSTEWTFSDKEHQQKDGSCLWEKV---------ESTKGNARNLLIWVTGVI 283 Query: 294 DVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDEL 353 K + + + + Q + W + WG++EGYPS +TF NRL+ SG K + Sbjct: 284 KRFKTAKCVLLELKGAFPLQNDLPTKHWLLGEWGQKEGYPSCITFFGNRLVLSGGKHNPQ 343 Query: 354 SVYLSSFGAFYDFSLDGEY-GCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLW 412 +V+ S F DF+ E G D T + + + I W+ G+LVG +++LW Sbjct: 344 TVHFSKLDDFTDFNQISEQGGNTDLTSSFSVLLGSDVRQGIQWLSHTDSGLLVGTESALW 403 Query: 413 LLSISLS----KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYI-----SGST 463 L++ + ++ R + G A P+ VG VF+ GR + + + +T Sbjct: 404 LITQTSQNEVVSKATVAIRSIGNFGSIAVSPILVGSHCVFIKDTGRDLISLVGNRSADNT 463 Query: 464 EQGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGD 523 + +RF ++ A+H+ + + + V Q+ P+SI+WVVL RL+GC F + E Sbjct: 464 KTEYRFRDLNLFAEHILTKGVWEAVLQQSPYSIIWVVLRDG-----RLVGCTFDPDNE-V 517 Query: 524 FAWHTHMISD-KHYVLSAASFPNDNRGGTSLWMLVAL--SAGEERSFTVRL 571 AWHTH + + S S + G LW+LV G + +L Sbjct: 518 CAWHTHDLGGFYTQIHSLTSCASFLDGQDDLWLLVERLDDTGRKTRSLEKL 568 >gi|120601703|ref|YP_966103.1| hypothetical protein Dvul_0653 [Desulfovibrio vulgaris DP4] gi|120561932|gb|ABM27676.1| conserved hypothetical protein [Desulfovibrio vulgaris DP4] Length = 699 Score = 453 bits (1165), Expect = e-125, Method: Composition-based stats. Identities = 126/578 (21%), Positives = 207/578 (35%), Gaps = 77/578 (13%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M T ++SF+AGELSP L+ +R D + + G A N++ +G P ++ Sbjct: 1 MARATIVRNSFNAGELSP-LMAARVDQARYPNGCASLCNMLLHPHGGAWRRPGLRFMGLA 59 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 R+ F + +L FG + L+I +TP+ + + Sbjct: 60 ADPAGPVRLIPFVFSEAQAYVLEFGPRSLRIWHGGGLVLGGDGE-PFRLETPWAGEQLTA 118 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 L + V PP L D + ++ FLP +G+ VK Sbjct: 119 LRWCQSADMLYLVSHAGPPRRLERHGHAD---WRLVDVSFLPGVSPPEGLHCTVKPAGSR 175 Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLT 240 + + T+ R + + + P + P ++ + ++ V D YR Sbjct: 176 TWTYVVTAVHRESGEESLPTPPLQVT------GPDALSQTASVTLAWTPVQDAGEYRVYR 229 Query: 241 TGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGR 300 G +G+ A GA Y G D Sbjct: 230 AGGGASVYGFLGSA--------------------------GAGETYTDTGRTPDFDAG-- 261 Query: 301 SISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSF 360 P+++ F +PS F RL F+G++ +++ S Sbjct: 262 ----PPEARNPFSGEGD--------------WPSCAVFWQQRLCFAGTRNGPQTIWASRS 303 Query: 361 GAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSK 420 GA+ +FS+ D A+T + + S + W+ P +LVG W LS + Sbjct: 304 GAYGNFSVSRPLRDDD---AVTVTIAADTVSAVRWLMP-ARRLLVGTGGGEWTLSGQGEQ 359 Query: 421 ---GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLA 476 LS R S G P+SVGD ++ + GR ++ S + G+ ++T LA Sbjct: 360 PFSPLSCSLERQSSRGSGDVQPLSVGDAVLALQRGGRVVREFRYSLDVDGYAGTDLTILA 419 Query: 477 DHLFN-QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKH 535 +HL +RI+ +Q+ P VW V E L+ E E WH H+ Sbjct: 420 EHLTRGRRIIDWAWQQSPSGTVWCVTEDGG-----LIAMTRIPEHE-VAGWHRHVTDGA- 472 Query: 536 YVLSAASFPNDNRGGTSLWMLVAL-SAGEERSFTVRLN 572 VLS + P G LW+ V G R RL+ Sbjct: 473 -VLSVCTIPG--TAGDELWVAVRREGGGMVRCCIERLD 507 >gi|218886166|ref|YP_002435487.1| hypothetical protein DvMF_1065 [Desulfovibrio vulgaris str. 'Miyazaki F'] gi|218757120|gb|ACL08019.1| conserved hypothetical protein [Desulfovibrio vulgaris str. 'Miyazaki F'] Length = 692 Score = 436 bits (1120), Expect = e-120, Method: Composition-based stats. Identities = 125/579 (21%), Positives = 206/579 (35%), Gaps = 75/579 (12%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M TT ++SF+AGELSP L+ +R D + +A G RN++ +GP P ++ C Sbjct: 1 MARTTLIQNSFNAGELSP-LMAARGDQARYASGCRVLRNMLLHPHGPAFRRPGLRFMGAC 59 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 + R+ F +G +L F ++L++ R PY + + Sbjct: 60 VDETVPPRLVPFVFNEGQAYVLEFAPERLRVW-WRGGLVLGEGGAPLVVPAPYAAEHLPT 118 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 L + V P L D + + F P G+ S + Sbjct: 119 LRWCQSADVLYLVTPHAAPRKLERHGHAD---WRLVAVNFGPRVATPTGLRSTGAPSGTR 175 Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLT 240 T+ + T + + L A+ + ++ V YR Sbjct: 176 QHRYVITAVSVDTGEESLPTAE-------LAVTAGTPAEGSAVNLAWTAVEGASEYRVYK 228 Query: 241 TGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGR 300 G +G A + Y G D ++ Sbjct: 229 AGGGASVYGLLGTAATGE--------------------------TYADTGRTPDFAEG-- 260 Query: 301 SISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSF 360 P+ + F+ + YPS V F RL F+GS+ +++ S Sbjct: 261 ----PPEHRNPFEG--------------TDDYPSSVQFWQQRLCFAGSRSHPQTIWASRT 302 Query: 361 GAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSK 420 G + + + D A+T + + S + WM P +LVG W LS S+ Sbjct: 303 GCYENMDVSRPLQTDD---AVTVTIASETVSAVRWMMP-ARKLLVGTGGGEWTLSGQGSE 358 Query: 421 ---GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLA 476 LS S G PP++VGD ++ V GR ++ S + G+ + T LA Sbjct: 359 PFSPLSCLLEFQSARGSAELPPLAVGDGVLAVQRGGRAVRDFRYSLDVDGYSGADQTILA 418 Query: 477 DHLFNQR-ILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKH 535 +H+ R I+ YQ+ PHS+VW ++ + G AE + WH H Sbjct: 419 EHMLRGRNIVDWAYQQSPHSVVWCAMDDG-----TMAGLTLIAEHQ-VAGWHRHDTGGAV 472 Query: 536 YVLSAASFPNDN-RGGTSLWMLVALS-AGEERSFTVRLN 572 L P + GG LW++V G +R + RL+ Sbjct: 473 EALCVVPGPPSDPAGGDELWLVVRRDVDGVQRRYIERLD 511 >gi|282848883|ref|ZP_06258273.1| hypothetical protein HMPREF1035_1392 [Veillonella parvula ATCC 17745] gi|282581388|gb|EFB86781.1| hypothetical protein HMPREF1035_1392 [Veillonella parvula ATCC 17745] Length = 772 Score = 415 bits (1067), Expect = e-114, Method: Composition-based stats. Identities = 107/610 (17%), Positives = 225/610 (36%), Gaps = 70/610 (11%) Query: 6 WTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPR 65 ++ +F+ GE+SP + SR DL + + ++ N++ YG + Q + + Sbjct: 7 ISQLAFTTGEVSPDV-SSRFDLEQYKSALLEAENVVIRPYGAVAKRQGSQYVGQVKYSDK 65 Query: 66 SNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAV 125 R+F F+ +L FGDK +++ T G TP+T L + Sbjct: 66 PTRLFEFTTNTNNSFMLEFGDKYIRVWNYGVYT-------GIEVTTPFTSDILFDLNCSQ 118 Query: 126 FGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQA 185 G +P L D D + + K P+ + V S ++ Sbjct: 119 SGDVMFICSGKYPIQTLSRYSDTD---WRLEAYKLTEQPYDTIN--TDVNSTVTVTGDTI 173 Query: 186 DTSTARITSDM-------------------KIFKPLDKGRSIRLGCHPPEWAKNTNYSIG 226 +S +DM + RS G + N NY++ Sbjct: 174 RSSKDLFNADMVGMVMQLGYFVAAVHTKNTGTVVEKKEKRSFMGGFNKWNEYNNINYNVE 233 Query: 227 AYIVADDKVYRSLT----TGRSGDRFGYSKGATYVK--------DNNITWITVLNLSSKT 274 +Y D ++ T TG + + G T+ D N+T + ++K Sbjct: 234 SYSTDQDLAWKFTTHGTWTGTVKLQITTNNGTTWKDYRTYSSNNDYNVTDAGKIEPNAKL 293 Query: 275 SRES--------ASGAVAPYYVWGDIK-DVSKDGRSISVAPQSQTLFQAGVSVVSWFMSA 325 +S ++ PY WG ++ D +++ + + + S W M + Sbjct: 294 RIQSDIKSGECNVDLSILPYTTWGIVEFKEFVDSKTMKINILNGIVENEATS--KWKMGS 351 Query: 326 WGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAV 385 WG GYP TF+ +R + + + + +++S G + +F ++ G ++T V Sbjct: 352 WGRSNGYPKLCTFYQDRFVVAATNKNPNYIWMSRTGDYPNFGVEKVEGTITDDSSITLPV 411 Query: 386 TDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLS-KGLSIDFRRVSGSGVYACPPVSVGD 444 + I + P + +++ + W++S + + + + + G +C P +G+ Sbjct: 412 INRKMYEIRHLVPAND-LIILTSGNEWIVSGDKTITPTNCNLKTQTQRGALSCEPQFIGN 470 Query: 445 CLVFVCGVGRRIKYISGSTE-QGFRFNEITQLAD-HLFNQRILQLVYQEEPHSIVWVVLE 502 VFV G ++ + S E + ++T + + Y ++P SI++ + Sbjct: 471 RCVFVQERGGTVRDMGYSYESDNYTGQDLTLFVKTRVRGYLTITSAYAQDPDSIIYYIRN 530 Query: 503 PKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALS-A 561 + + C + + W H +++ Y+ + SL+ L+ + Sbjct: 531 DGE------INCLTYIPEQKVYGWS-HFVTNGKYLYCESV---SEGEQDSLYTLIERTLQ 580 Query: 562 GEERSFTVRL 571 G++ R+ Sbjct: 581 GKKVKCIERM 590 >gi|262043657|ref|ZP_06016766.1| hypothetical protein HMPREF0484_3785 [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|259038995|gb|EEW40157.1| hypothetical protein HMPREF0484_3785 [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] Length = 758 Score = 414 bits (1063), Expect = e-113, Method: Composition-based stats. Identities = 118/614 (19%), Positives = 199/614 (32%), Gaps = 75/614 (12%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M K SF+AG LSP ++ + D A V +N IPL GP Q Sbjct: 1 MSKIRPIKRSFNAGILSP-VMYGQVDFDKWASAVKYMKNFIPLPQGPARRRGGTQYAGSV 59 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDN-- 118 + + SF +L FG ++ + TP+ D Sbjct: 60 KNSSDRVWLASFQFSTTEAFILEFGPGYIRFWFNHAQL-LDDENNILEVSTPWGAGDLTR 118 Query: 119 ---KSLEYAVFGSTAVFVH--KDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLG-DGMIS 172 L ++P + L +++ E F P+ + S Sbjct: 119 NGKFGLSLQQSADVIYITCTNGNYPVYKLTR---NTNTNWSLAEASFSGGPFADINSDKS 175 Query: 173 GVKSNAKLSISQAD-----------TSTARITSDMKIFKPLDKGRSIRLGC--------- 212 V + I D TS IT++ IF+ L G + Sbjct: 176 SVVYTDQFRIWSEDGNDLPDGTPTTTSLCNITANTDIFQALHVGCLFYIEASTDAVDDDT 235 Query: 213 ----HPPEWAKNT--NYSIGAYIVADDKVYRSLTTGRSG-DRFGYSKGATYVKDNNITWI 265 + P WA T +S G + +D K Y + ++G + ++ GA Sbjct: 236 GHSGYIPAWAAGTTETFSTGVFCRSDGKYYEDMDGTKTGNTQPTWTAGAHRDGSGG---- 291 Query: 266 TVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSA 325 + + G + S G+ ++ P S + + Sbjct: 292 ------DASLWRYSGGGWGIIEITAVNSATSATGKIVTELPPS--VRNTVGKTYKYAFGD 343 Query: 326 WGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAV 385 W + YP F RL+F+G ++ S G +FS + ++ + Sbjct: 344 WSDVLRYPQFAAFFRGRLVFAGR----QKIWSSVAGDLQNFSPMTNGYEAESDDSINDRI 399 Query: 386 TDFSASTIHWMHPFGEGVLVGCDTSLW------LLSISLSKGLSIDFRRVSGSGVYACPP 439 D + T+ W+ + +G + L S+ + ++ G Sbjct: 400 -DDTQDTMQWLVASAGKIFIGTAGYEFSYGEQSLTSVFGAGNTKVELNSTI--GSNEVQA 456 Query: 440 VSVGDCLVFVCGVGRRIKYISG-STEQGFRFNEITQLADHLFNQRILQLVYQEEPHSIVW 498 + D + FV GR++ + S F LA HLF I+ L YQ+EP+ I+W Sbjct: 457 ERLFDRVAFVQRAGRKVMIAAYDSGSDSFSATNSCILAPHLFTSEIIALAYQQEPNRILW 516 Query: 499 VVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVA 558 V+LE + AE + WH H V S P+ + G LWM+V Sbjct: 517 VLLEEGKLLGL-----TYDAE-QNITGWHEHATGGA--VESIKVIPDIDGGRDELWMVVK 568 Query: 559 LS-AGEERSFTVRL 571 + G + + Sbjct: 569 RTINGATVRYLEYM 582 >gi|294648405|ref|ZP_06725904.1| phage protein [Acinetobacter haemolyticus ATCC 19194] gi|292825710|gb|EFF84414.1| phage protein [Acinetobacter haemolyticus ATCC 19194] Length = 706 Score = 405 bits (1040), Expect = e-110, Method: Composition-based stats. Identities = 123/583 (21%), Positives = 203/583 (34%), Gaps = 56/583 (9%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M K++F++GELSP + R DL + G + N +P+ G L Sbjct: 1 MAKINLIKNNFTSGELSPHIWM-RTDLQQYRNGTKEMLNFLPIIEGGLKRRGGT---EAL 56 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 + + R+ F I LL+F ++ ++ + + K+ TPYT +D K Sbjct: 57 AITAGAIRILPFIISHSTAYLLIFKPNQIDVLDINGTVV-------KSLSTPYTAQDIKE 109 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 + Y H HP L D ++++D F PP + V++ A Sbjct: 110 ISYTQNRYQFYIAHSKHPLAWLR--ASEDLTNWSYDPFDFYVPP------LEEVETPALP 161 Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLT 240 S + T + D + + G N Y A + T Sbjct: 162 LKSNEKNAGKVATLTASPYNIYDNSKRYQAGEICHHTINNVKYYFRALRITQGNTPSFGT 221 Query: 241 TGRSGDRFGYSKGATYVKDNNITWITVLNLSS-KTSRESASGAVAPYYVWGDI-KDVSKD 298 +G Y + T + T V V+P V G+I +S D Sbjct: 222 SGPEASPDYYWETTTVTEAQAFTAADVDKFVFINEGIVRIDTYVSPSTVTGEILVKLSTD 281 Query: 299 GRSISVAPQSQTLFQAGVSVVSWFMS--AWGEQEGYPSHVTFHNNRLLFSGSKGDELSVY 356 +I+ +W + + GYP VT + RL+ +G+K V+ Sbjct: 282 IEAIAN---------------AWTLKQDIFEVSLGYPRAVTMYQQRLVIAGTKTYPNYVW 326 Query: 357 LSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSI 416 LS G +F + T + + + + + ++ + L + S Sbjct: 327 LSRVGDVTNFLP-----TTSDGDSFTVSASSDQLTNVLHLAQSRGICVMTGGSELVISSQ 381 Query: 417 SLSKGLSIDFRRVSGSGV-YACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQ 474 + + + G P+ VG L+FV RI+ + NE+T Sbjct: 382 NSMTPTNTSILEHTSFGSTENIKPIKVGSELIFVQRGAERIRTLLYDYSIDSLTSNELTV 441 Query: 475 LADHLFN--QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMIS 532 LA H+ ++VY EP SI+W VL +L + E + AW TH I Sbjct: 442 LASHIAKKSGGFKEMVYCAEPDSIIWFVL-----GNGKLASLTLNRE-QSVIAWSTHDIG 495 Query: 533 DKHYVLSAASFPNDNRGGTSLWMLVALSAGEERSFTVRLNLLD 575 VLS S P+ G L+ LV + + LLD Sbjct: 496 GT--VLSLTSLPS-TTGADRLYFLVNRNGTVQIEQMKEELLLD 535 >gi|298485990|ref|ZP_07004064.1| predicted phage protein [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] gi|298159467|gb|EFI00514.1| predicted phage protein [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] Length = 716 Score = 402 bits (1033), Expect = e-110, Method: Composition-based stats. Identities = 109/581 (18%), Positives = 206/581 (35%), Gaps = 44/581 (7%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M T + +F+AGELSPR+L R D++ + G N PL +G + Sbjct: 1 MAKLTLIQTNFTAGELSPRML-GRVDIARYQNGAKVIENAWPLVHGGVTRRNGTLFCAAA 59 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 + R R+ + ++ FGD ++I G +PY + Sbjct: 60 KFPDRRARLVPYVFNTEQAYMIEFGDFYIRIYYPNG------GWTGVELASPYGQTMLAA 113 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 LEY T H P + L I ++ F+ P+ GM Sbjct: 114 LEYVQGADTMFLFHGRVPIYRLKRIS---NTEWSLAPAPFVTTPFEERGMDFAFAMAIT- 169 Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAK-NTNYSIGAYIVADDKVYRSL 239 + A + + +T F D GR I G ++ S+ ++ S Sbjct: 170 --NPAAGAASTVTPGAPAFFISDVGREIWAGSGIARITAFGSSGSVSVLVINAF----SQ 223 Query: 240 TTGRSGDRFGYSKGA-TYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKD 298 T + G + T + + L L + R G Sbjct: 224 TLYPTWSLKGSPQTTCTASAFSPVGATVTLTLGAAGWRPEDVGKFVKLNGGLFQISGFTS 283 Query: 299 GRSISVAPQSQTLFQAGVSVVSWFM--SAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVY 356 ++ +S +W + S W + +GYPS T + RL+ +GS +++ Sbjct: 284 STVVNAVIRSIATSVVAAPAGAWSLEASVWNDFDGYPSTGTLYEQRLVAAGSPNYPQTIW 343 Query: 357 LSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCD-TSLWLLS 415 S G + +F L + A++ V+ + I MH LV + ++ Sbjct: 344 ESRTGEYLNFELGTK-----DDDAMSFNVSSDQINPI--MHVGQVKALVTLTYGGEFTVT 396 Query: 416 ISLSK---GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNE 471 + K +I + S G P+ +G+ L FV GR+++ ++ + + + Sbjct: 397 GGVEKPITPTNIQIKNQSVYGCNGVRPIRIGNELYFVQRAGRKLRAMAYKYDSDSYGSPD 456 Query: 472 ITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMI 531 ++ L++H ++ + +Q+EP SI+++V S + + + W + Sbjct: 457 MSVLSEHATKSGVVDMAFQQEPESILFMVR-----SDGVMATMTVDRD-QDVVGWARQVT 510 Query: 532 SDKHYVLSAASFPNDNRGGTSLWMLVALSA-GEERSFTVRL 571 + S A P+ G +W +V + G+ + R Sbjct: 511 DGAY--ESVAVIPSAE--GDQVWAVVRRTVNGQNVRYLERF 547 >gi|46580124|ref|YP_010932.1| hypothetical protein DVU1714 [Desulfovibrio vulgaris str. Hildenborough] gi|46449540|gb|AAS96191.1| conserved hypothetical protein [Desulfovibrio vulgaris str. Hildenborough] gi|311233883|gb|ADP86737.1| hypothetical protein Deval_1582 [Desulfovibrio vulgaris RCH1] Length = 697 Score = 402 bits (1033), Expect = e-110, Method: Composition-based stats. Identities = 126/582 (21%), Positives = 205/582 (35%), Gaps = 81/582 (13%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + +F+ GE+SP LL +R D + G RN +PL GP+ P ++ Sbjct: 1 MGTIYPVQQAFNGGEISP-LLTARADQIRYQTGALTMRNAVPLAQGPVTRRPGLRFMGAA 59 Query: 61 RL-DPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNK 119 + R+ SF L FG +++ + + + +PY D Sbjct: 60 KEQGAGPVRLVSFVFSAAQSRALEFGPGYVRVWMDAGLVSKNGQPY--EVASPYGAADIA 117 Query: 120 SLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLP----PPWLGDGMISGVK 175 L +A ++HPP L D D + F F+P P L G + Sbjct: 118 GLRFAQSADVIYIASRNHPPRKLSRHADDD---WRFITPTFMPTQAAPGALTLGTLGTTP 174 Query: 176 SNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKV 235 + S T+ + T + + P G W + + ++ + + +V Sbjct: 175 GPGNETYSYKVTAVSATTGEESLASPE--GTITTTAMSSTYWVRVSWAAVPGAV--EYRV 230 Query: 236 YRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDV 295 Y+ G G G T+ D NI GA V Sbjct: 231 YK-RRYGVFGFIGRAVGGDTFFDDRNI------------------GADTEDTV------- 264 Query: 296 SKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSV 355 P+++ F A YP V F RL F+GS L+V Sbjct: 265 ----------PEAKNPFTAAGE--------------YPGLVFFWQQRLGFAGSDKRPLTV 300 Query: 356 YLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLS 415 +LS AF + + D +A + + W+ + +G + W LS Sbjct: 301 WLSQSAAFENLAASRPPQDDDGIEA---TLAGQRQNRFVWI-EGDRTLCLGTEGGEWTLS 356 Query: 416 ISLSK---GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNE 471 S+ F+ G P V GD L++V G ++ + S E G+ + Sbjct: 357 GQEGGPVTPTSLQFQSHGVRGSEGVPAVRAGDSLLYVQRGGGVVREFTYSFERDGYVAPD 416 Query: 472 ITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMI 531 +T L L +++ YQ+ PHSIVW VL+ L F E + WH H Sbjct: 417 LTLLTGVLRGRKVRAWAYQQSPHSIVWCVLDDG-----TLAALTFLREHD-VVGWHRHDT 470 Query: 532 SDKHYVLSAASFPNDNRGG-TSLWMLVALS-AGEERSFTVRL 571 ++ + GG ++WMLV + G+ER + R+ Sbjct: 471 DGVVEDVTVIPGGDATAGGTDTVWMLVRRTVGGQERRYVERM 512 >gi|292670776|ref|ZP_06604202.1| hypothetical protein HMPREF7545_1740 [Selenomonas noxia ATCC 43541] gi|292647397|gb|EFF65369.1| hypothetical protein HMPREF7545_1740 [Selenomonas noxia ATCC 43541] Length = 762 Score = 400 bits (1028), Expect = e-109, Method: Composition-based stats. Identities = 119/601 (19%), Positives = 211/601 (35%), Gaps = 65/601 (10%) Query: 7 TKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRS 66 K SF+ GEL+P L R DL + G + +N+I LRYG P + + + Sbjct: 9 LKPSFAGGELTPALY-GRTDLQKYDVGASTLKNMIVLRYGGATRRPGFRHVAKTQ-GGKR 66 Query: 67 NRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVF 126 R+ F +L F +++ A T YT D ++Y Sbjct: 67 ARLIPFQYSTEQSYVLEFTAGCIRVFTKGGIVVKDDAPLVIP--TSYTEADLSDIKYTQS 124 Query: 127 GSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQAD 186 VH +HPP L D + F+ + P+ G+K + Sbjct: 125 ADVLFLVHVNHPPMTLTRYGVTD---WKFERMDIAGGPFEDPNTKDGLKI-----GASGV 176 Query: 187 TSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKN--TNYSIGAYIVADDKVYRSLTTGRS 244 + + + F G IRLG K+ + V VY + Sbjct: 177 QGEITLKASVDYFTEDMVGSLIRLGHTMSGQLKSGIPTTPLVVRCVPSGTVYVESFGFWN 236 Query: 245 GDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYY----------VWGD--- 291 G + + + T + G Y VW + Sbjct: 237 GSFIVEKHDKSTDTWIALQEQHANRTQNYTLNYTNKGDDIVEYRVRSEKFDTSVWSNENE 296 Query: 292 -----------------IKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPS 334 + ++ + S A + + +SAW ++GYP Sbjct: 297 RQRGYVTIQTFAQDYYGVARITAVNSATSAAATVTRELADTEATNDFSLSAWSAKKGYPQ 356 Query: 335 HVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIH 394 V+F +RL+F+GS+ + + S G +Y+F ++ D A+T ++ + I Sbjct: 357 AVSFFEDRLVFAGSRAKPQTYWASQSGDYYNFWVNTPQQDSD---AITGTLSGGQMNGIR 413 Query: 395 WMHPFGEGVLVGCDTSLWLLSISLS--KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGV 452 + PFGE +++ + + + G+ PV +G +V+V Sbjct: 414 AIIPFGEMLML-TSGGEYKVGGGNETFTPTNQKAEPQEYRGINNLTPVVIGGRIVYVQHQ 472 Query: 453 GRRIKYISGSTE-QGFRFNEITQLADHLFN-QRILQLVYQEEPHSIVWVVLEPKDNSFPR 510 G I+ ++ S + + ++++ LA HLF I+ L YQ+ P+++VW V E Sbjct: 473 GSVIRDLTYSYDVDKYTGDDVSLLAAHLFEGHTIVALAYQQTPNTVVWCVREDG-----A 527 Query: 511 LLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSAGEERSFTVR 570 LLG + E + +AWH H + K + D LW +V + + Sbjct: 528 LLGMTYIKE-QDVYAWHKHTTAGKFTD--VCTISGDR--EEELWAVVERDGAH---YVEQ 579 Query: 571 L 571 + Sbjct: 580 M 580 >gi|225157020|ref|ZP_03724959.1| hypothetical protein ObacDRAFT_8085 [Opitutaceae bacterium TAV2] gi|224802748|gb|EEG20999.1| hypothetical protein ObacDRAFT_8085 [Opitutaceae bacterium TAV2] Length = 773 Score = 398 bits (1023), Expect = e-108, Method: Composition-based stats. Identities = 115/617 (18%), Positives = 209/617 (33%), Gaps = 77/617 (12%) Query: 9 HSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNR 68 ++F+AGE +P+L R DL + + N+ + YG + +R Sbjct: 7 NNFTAGEWTPKL-DGRSDLQKYDAACRRLENMRVMPYGGARFRSAFGYVAKTKSAATPSR 65 Query: 69 VFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGS 128 + F +L + L++ S +PAL + +PY +++Y Sbjct: 66 LMPFQFSTEQKFMLEWAHLALRVY----SAGAAPALL-QEIASPYPAAAVFAIQYRQIND 120 Query: 129 TAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTS 188 VH D+P L D D + + + + PP L + + KLS+S D Sbjct: 121 VVYLVHPDYPVQRLARHADAD---WRLEAVDWAFPPMLDENVTET-----KLSLSAVDGV 172 Query: 189 TARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSI---GAYIVADDKVYRSLTTGRSG 245 +T+ +F+P G L + + + + G + A V T S Sbjct: 173 NVTMTASAALFQPGHVGSYWELRHLKEAASTSVSLATTSGGPFHSAAISVQGDWTA-NST 231 Query: 246 DRFGYSKGATYVKDNNITWITVLNLSSKTSRE-SASGAVAP-------YYVWGD------ 291 +R+ + D TW TV ++++ R SASG Y GD Sbjct: 232 ERWYGTLSIERSLDGGTTWETVRKFTAESDRNISASGHQEELAQFRLKYQPTGDPFGAGV 291 Query: 292 -------------------------IKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAW 326 + V+ S V + W SAW Sbjct: 292 WVGKAPTNYVKARAMLETTDAYVTALVKVTAYTDSTHVKVTVIDKAATVAATDIWCESAW 351 Query: 327 GEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVT 386 G+P + + RL+F G++ +++ S F +F D A+ Sbjct: 352 SPYRGFPRTIGLYEQRLIFGGTRHQPNTMWGSKTDDFENFK-----YGEDDDAAVAYTFA 406 Query: 387 DFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGL---SIDFRRVSGSGVYACPPVSVG 443 + + W+ + + + + L +I R S +G PV V Sbjct: 407 ASEQNNVQWVESLKRIQAATTAREFTVAAGNTDEPLTPSNIVVRSESANGAAHLQPVLVN 466 Query: 444 DCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLE 502 D +++V R++ ++ S E G+ ++T LA + + QL + +P ++ V E Sbjct: 467 DAILYVERQSRKVMEMAYSIEKDGYASVDLTLLAAPVTESGVKQLAFARQPDPLLLAVTE 526 Query: 503 PKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALS-A 561 + L + + AW + + S A+ +W +V + Sbjct: 527 NGN-----LAVLTYDRP-QDVTAWARWITNGAF--ESVATLQG--TPEDEIWAVVRRTIG 576 Query: 562 GEERSFTVRLNLLDDFK 578 G RL D K Sbjct: 577 GVPVRTIERLTPETDSK 593 >gi|262043403|ref|ZP_06016528.1| hypothetical protein HMPREF0484_3546 [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|259039229|gb|EEW40375.1| hypothetical protein HMPREF0484_3546 [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] Length = 664 Score = 391 bits (1003), Expect = e-106, Method: Composition-based stats. Identities = 107/583 (18%), Positives = 200/583 (34%), Gaps = 108/583 (18%) Query: 3 NTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRL 62 K +F+AGE+SPRL+ R D+ +A G N + + G ++ P Q + Sbjct: 2 RANLIKTNFTAGEISPRLM-GRVDIDRYANGAKTLENSVVVVQGGVMRRPGSQFVAATKY 60 Query: 63 DPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLE 122 + +R+ + +L FGD L+I +PYT S+ Sbjct: 61 GDKKSRLIPYVFNRTQAYILEFGDGYLRIYQDGKQLVNDD-NTPYEIASPYTSDMLPSVN 119 Query: 123 YAVFGSTAVFVHKDHPPHHLLYIQDGDK-------ISFTFDEIKFLPPPWLGDGMISGVK 175 Y T VH+D P+ L D I FDE++ P W + V Sbjct: 120 YVQGADTMFLVHQDVKPYRLQRRGQTDWVLEPAPFIVEPFDEVRDTPQKWCKPSVKEFVG 179 Query: 176 SNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYS--IGAYIVADD 233 S L ++ D D PP + + +G+Y+ + Sbjct: 180 SEITL----------TLSDDEPPEGSED----------PPPFTGDGWVPEDVGSYVRINS 219 Query: 234 KVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIK 293 ++ + S TS + A G + Sbjct: 220 --------------------------------GLVLIKSVTSAQVAVGTIRT-------- 239 Query: 294 DVSKDGRSISVAPQSQTLFQAGVSVVSWFM--SAWGEQEGYPSHVTFHNNRLLFSGSKGD 351 D+S S +W S W ++ GYP VT + RL+ +GS Sbjct: 240 DLSAT---------------QAASPGAWTREDSVWTDEFGYPGAVTLYQQRLVLAGSPRY 284 Query: 352 ELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSL 411 +++ S G + F L D A++ ++ + I + + + Sbjct: 285 PQTIWWSESGVYLSFELGT-----DDDDAISFTLSSDQLNPIVHLAQMNTLIALTYGGEF 339 Query: 412 WLLSISLS--KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQ--GF 467 + + + + +I + S G PV VG ++FV GR++ ++ + + Sbjct: 340 TITAGNDAAITPTNISVKNPSPYGCNGIRPVRVGTEIMFVQRSGRKLYAVAYDPDSYVAY 399 Query: 468 RFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWH 527 N++T LA+H+ ++ + YQ++P + W+V ++ + AW Sbjct: 400 SANDMTVLAEHITEGGVIDMAYQQQPDAFTWLVRNDG-----VMVTMAIDR-AQNVVAWS 453 Query: 528 THMISDKHYVLSAASFPNDNRGGTSLWMLVALSA-GEERSFTV 569 + S S A+ P+ ++ +V + G+ + Sbjct: 454 RQITSGAF--ESVATIPSAT--DDVVYAIVRRTVNGQTVRYVE 492 >gi|303328570|ref|ZP_07359005.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3] gi|302861336|gb|EFL84275.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3] Length = 696 Score = 390 bits (1001), Expect = e-106, Method: Composition-based stats. Identities = 105/582 (18%), Positives = 193/582 (33%), Gaps = 88/582 (15%) Query: 7 TKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRS 66 ++ + GE++P L++ R D + G + RN +P+ G + P + D + Sbjct: 6 IQNVLNGGEITP-LMRGRVDQPRYGTGAREMRNFVPMPQGGVTRRPGTRFLGMAHGD--A 62 Query: 67 NRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVF 126 R+ F +L FGDK L++ + K +++PY D L +A Sbjct: 63 ARLIPFVFSATQGRMLEFGDKTLRVWLPDGRLVADENGEPKVFESPYAVGDLHELRFAQS 122 Query: 127 GSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGV------KSNAKL 180 H+ + P L D D + + E+ F+P D + V NA Sbjct: 123 ADVVYLAHQGYAPRRLSRHADDD---WRWSELAFVPAIAAPDNVSLQVIDRGYNGDNATR 179 Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLT 240 + A T+ T Sbjct: 180 VYTYAVTAVDEKTGQESGAGAE-----------------------------------VSI 204 Query: 241 TGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGR 300 T ++ + Y A + + V G + D + Sbjct: 205 TAKALNSVSYIIRAAWPAVEGAAYYRVYKKKYGV-----FGYIGRSDAECSFDDENIGAD 259 Query: 301 SISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSF 360 + P+ + F + +PS V FH RL ++ + ++++LS Sbjct: 260 TEDTPPEHKNPFASEGD--------------WPSQVFFHQQRLGWAATANRPITIWLSRP 305 Query: 361 GAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLS- 419 G F + D A+ + A+ I W+ P + + G + S W LS Sbjct: 306 GDFEIMAASTPPKDDD---AIEATLAATQANRIVWLQPDRQSLTFGTEGSEWTLSAGEGV 362 Query: 420 --KGLSIDFRRVS-GSGVYACPPVSVGDCLVFVCGVGRRIKYISGST-EQGFRFNEITQL 475 ++ F + G A VSVG ++++ G+ ++ + + + ++T L Sbjct: 363 ALTPSNVSFEMQTANGGDNATQAVSVGGGVLYLQRGGKAVRQFAYNYSADKYLGQDVTIL 422 Query: 476 ADHLFNQRIL-QLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDK 534 A H+ ++ +Q+EP++++W L S L G + E + WH H + Sbjct: 423 ARHILRDAVVTAWAFQQEPYAVLWCAL-----SDGTLAGLTYMPE-QDVMGWHRHDTDGR 476 Query: 535 HYVLSAASFPNDNRGGTSLWMLVALSAGEERSFTVRLNLLDD 576 A+ P W LV G RL+ D Sbjct: 477 F--EDVAAMPG--TPDDQTWFLVRRGCG---LCVERLDSFFD 511 >gi|78357587|ref|YP_389036.1| hypothetical protein Dde_2545 [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] gi|78219992|gb|ABB39341.1| conserved hypothetical protein [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] Length = 700 Score = 389 bits (1000), Expect = e-106, Method: Composition-based stats. Identities = 114/591 (19%), Positives = 199/591 (33%), Gaps = 91/591 (15%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRD- 59 M T T++SF+ GELSP LL SR D + G RN+ +G V P M+ Sbjct: 1 MSRITLTRNSFNGGELSP-LLSSRIDQQRYTAGCRTLRNMTVYPHGAAVRRPGMRHMGTG 59 Query: 60 ---CRLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFK 116 + R+ F +L G+ +++ + +TP+ Sbjct: 60 LSLQPAGSAAVRLVPFVFSQEQAYVLELGEGVMRVWKDDGLVVSADGS-PVCVETPWKGD 118 Query: 117 DNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKS 176 +SL+Y V + P L D + ++F G+ + Sbjct: 119 ALQSLQYCQSADVMYLVCRQCAPRKLARHAHDD---WRITLLEFGAGLPAPQGLTAAAGG 175 Query: 177 NAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVY 236 A+ + T+ A + + + ++ + Sbjct: 176 AAEREYAYVVTAVAPDGGEESLPSEAVNVT------------AAASLNVRDMVR------ 217 Query: 237 RSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVS 296 +TW V + +S +G + Y+ Sbjct: 218 -------------------------LTWQPVEGAGAYCVYKSIAGGGSYGYI-------- 244 Query: 297 KDGRSISVAP-QSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSV 355 G++ V + + + + + + +P V F+ RL F+G+ ++ Sbjct: 245 --GKAAGVPAYEDRGAEPDFGQGPPEYRNPFDGEGRWPGCVQFYQQRLCFAGTDEKPQTI 302 Query: 356 YLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLS 415 + S + ++ D A+T + + I WM P +LVG W LS Sbjct: 303 WCSQSANYESMNISSPLRDDD---AVTVTIAADRVNRIRWMMP-ARRLLVGTAGGEWQLS 358 Query: 416 ISLSKGLS---IDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNE 471 S L+ RR + G P+ +G ++FV GR ++ + E G+ + Sbjct: 359 GSGDAPLTPVDAQLRRDTMHGSAGLMPLVIGQSILFVQRDGRTVREFRYALESDGYDAGD 418 Query: 472 ITQLADHLFN-QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHM 530 +T LA+HL +RI+ YQ+ P S+VW L S L F E E WH H Sbjct: 419 LTILAEHLMRGRRIVSWCYQQSPASVVWCAL-----SDGTLAAMTFLREHE-VVGWHRHD 472 Query: 531 ISDKHYVLSAASFPNDNRGGTSLWMLVAL---------SAGEERSFTVRLN 572 +V + + P D G +W+ V + EE RL Sbjct: 473 TDG--FVEAVTAIPGDE--GDEVWLSVRRVRVLHDENGTRQEEVRSIERLE 519 >gi|169795391|ref|YP_001713184.1| phage-like protein [Acinetobacter baumannii AYE] gi|169148318|emb|CAM86183.1| hypothetical protein; putative phage related protein [Acinetobacter baumannii AYE] Length = 697 Score = 389 bits (999), Expect = e-106, Method: Composition-based stats. Identities = 124/565 (21%), Positives = 206/565 (36%), Gaps = 67/565 (11%) Query: 6 WTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPR 65 K++ S+GELSP LL +R D+ +A G K N +PL G P + + Sbjct: 10 ILKNNLSSGELSP-LLWTRTDIQQYANGAKKLLNALPLVEGGAKKRPGTKFRS---IFAG 65 Query: 66 SNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPY-TFKDNKSLEYA 124 + R+ F LL+ G L++ R+ TPY T + + ++YA Sbjct: 66 ALRLIPFIANSENTYLLILGVSFLKVYNPRTYAV------VYETVTPYNTAQKVREVQYA 119 Query: 125 VFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQ 184 FV D P LL D F P LG S +++S Sbjct: 120 HTKYRMYFVQGDTPVQRLLCSADFTNWQFAAFTFGVNPNDELG--------STPNVALSP 171 Query: 185 ADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRS 244 + T ++ S P W+ Y G ++ + K +R+ + Sbjct: 172 SGTEVGKVIS--------------LTASSFPNWSNTETYLTGDRVIHNSKTWRA-----T 212 Query: 245 GDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISV 304 D G AT + W V N ++ ++ G++ + G +++ V Sbjct: 213 ADNKGVEPSATTPE-----WEEVTNEAANVFTPASVGSIVE--INGGQVKITEYVDPSRV 265 Query: 305 APQSQTLFQAGVSVV--SWFMS--AWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSF 360 + + V + SW + A+ + GYP V F RL+F+ +K ++ S Sbjct: 266 NGEVLVKLTSDVQAIAKSWVLKSIAFSAEAGYPKAVCFFKQRLVFANTKTSPNQMWFSRI 325 Query: 361 GAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSK 420 G +F A + A + + I + G V + + S Sbjct: 326 GDDGNF-----LETTQDADAFSIASSSAQSDNILHLSQRGGVVALTGGAEFLINSQGPLT 380 Query: 421 GLSIDFRRVSGSGV-YACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADH 478 S + GV P VG+ L+FV G R++ +S E G E++Q+A H Sbjct: 381 PASAQIDEHTSYGVQANVKPCRVGNELLFVQRGGERLRAMSYRYEVDGLVSPELSQIAPH 440 Query: 479 LFNQ--RILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHY 536 + I +L +Q+ P+SIVW+V+ S L + AW H + Sbjct: 441 IPENHAGIKELTFQQTPNSIVWIVMGDGAVSSITL------NRDQEMNAWSQHDFGGQ-- 492 Query: 537 VLSAASFPNDNRGGTSLWMLVALSA 561 VLS + P G +ML + Sbjct: 493 VLSICALP-TGLGEDQCFMLTNRNG 516 >gi|212703239|ref|ZP_03311367.1| hypothetical protein DESPIG_01281 [Desulfovibrio piger ATCC 29098] gi|212673505|gb|EEB33988.1| hypothetical protein DESPIG_01281 [Desulfovibrio piger ATCC 29098] Length = 694 Score = 389 bits (999), Expect = e-106, Method: Composition-based stats. Identities = 111/574 (19%), Positives = 199/574 (34%), Gaps = 82/574 (14%) Query: 7 TKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRS 66 T++ + GE+SP LL+ R D ++ G + RN +P+ G + P + D Sbjct: 6 TQNVLNGGEISP-LLRGRVDQPRYSTGAREMRNFVPMPQGGVTRRPGTRYLGTALGDGG- 63 Query: 67 NRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVF 126 R+ F +L FGD+ +++ + K +++P+ D +++ YA Sbjct: 64 -RLVPFVFSATQGRMLEFGDRAMRVWLPDGRVVADEEGAPKIFESPFAAADLRAVRYAQS 122 Query: 127 GSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQAD 186 F H + P L D D + + E+ F+P + + K ++S Sbjct: 123 ADVIYFAHPGYAPRKLARHADDD---WRWSELTFMPA----------IATPKKPALSTVG 169 Query: 187 T--STARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRS 244 T + + DKG+ P E A + ++ + Sbjct: 170 TPEGDKKTDYTYCVTAIDDKGQ----ESSPSEPASISAQALNS----------------- 208 Query: 245 GDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISV 304 + ++ T V G Y I D + + Sbjct: 209 ---VDFHIRISWEAVEGATGYRVYKKKMGVFGYIGKGGADETY----IDDKNIGADTEDT 261 Query: 305 APQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFY 364 P+ + F+ + YPS V FH RL F+ S ++++LS G F Sbjct: 262 PPEYEDPFEGEGN--------------YPSQVFFHQQRLGFAASNSRPITIWLSRSGEFE 307 Query: 365 DFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSI 424 + D A+ + AS I W+ P + G + S W L S L+ Sbjct: 308 SMAKSTPPKDDD---AIEVTLAATQASRIVWLQPDRSALAFGTEGSEWTLEPSEGVALTP 364 Query: 425 DFRR----VSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGST-EQGFRFNEITQLADHL 479 + G A +SVG +++V I+ + + + ++ LA H+ Sbjct: 365 ATASFQLQTTNGGSDAVAALSVGGSVLYVQRGAGAIREFAYNYSADKYLGQDLNILARHM 424 Query: 480 FNQ-RILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVL 538 ++ +Q+EP++++W VL S L G + E + WH H + Sbjct: 425 LRDVDVVAWSWQQEPYAVLWSVL-----SDGTLAGLTYMKE-QEIVGWHRHTTAGDFVD- 477 Query: 539 SAASFPNDNRGGTSLWMLVALSAGEERSFTVRLN 572 A P +W LV + F RL Sbjct: 478 -VAGIPG--TPDDQVWFLVRRGG---QVFVERLE 505 >gi|332875218|ref|ZP_08443051.1| carbohydrate binding domain protein [Acinetobacter baumannii 6014059] gi|332736662|gb|EGJ67656.1| carbohydrate binding domain protein [Acinetobacter baumannii 6014059] Length = 692 Score = 388 bits (996), Expect = e-105, Method: Composition-based stats. Identities = 125/565 (22%), Positives = 203/565 (35%), Gaps = 67/565 (11%) Query: 6 WTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPR 65 K++ S+GELSP LL +R D+ +A G K N +PL G P + + Sbjct: 5 ILKNNLSSGELSP-LLWTRTDIQQYANGAKKLLNALPLVEGGAKKRPGTKFRS---IFAG 60 Query: 66 SNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPY-TFKDNKSLEYA 124 + R+ F LL+ G L++ R+ TPY T + + ++YA Sbjct: 61 ALRLIPFIANSENTYLLILGVSFLKVYNPRTYAV------VYEAVTPYNTAQKVREVQYA 114 Query: 125 VFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQ 184 FV D P LL D F P LG S +++S Sbjct: 115 HTKYRMYFVQGDTPVQRLLCSADFTNWQFAAFTFGVNPNDELG--------STPNVALSP 166 Query: 185 ADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRS 244 + T ++ S P W+ Y G ++ K +R+ Sbjct: 167 SGTEVGKVIS--------------LTASSFPNWSNTETYLTGDRVIHTSKTWRATI---- 208 Query: 245 GDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISV 304 D G AT + W V N ++ S+ G++ + G +++ V Sbjct: 209 -DNKGVEPSATTSE-----WEEVTNEAANVFTPSSVGSIVE--INGGQVKITQYVDPSRV 260 Query: 305 APQSQTLFQAGVSVV--SWFMS--AWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSF 360 + + V + SW + A+ GYP V F RL+F+ +K ++ S Sbjct: 261 NGEVLVKLTSTVQAIAKSWVLKSIAFSATAGYPKAVCFFKQRLVFANTKTSPNQMWFSRI 320 Query: 361 GAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSK 420 G +F A + A + + I + G V + + S Sbjct: 321 GDDGNF-----LETTQDADAFSIASSSAQSDNILHLSQRGGVVALTGGAEFLINSQGPLT 375 Query: 421 GLSIDFRRVSGSGV-YACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADH 478 S + GV P VG+ L+FV G R++ +S E G E++Q+A H Sbjct: 376 PASAQIDEHTSYGVQANVKPCRVGNELLFVQRGGERLRAMSYRYEVDGLVSPELSQIAPH 435 Query: 479 LFNQ--RILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHY 536 + I +L +Q+ P+SIVW+V+ S L + AW H + Sbjct: 436 IPENHAGIKELTFQQTPNSIVWIVMGDGAVSSITL------NRDQEMNAWSQHDFGGQ-- 487 Query: 537 VLSAASFPNDNRGGTSLWMLVALSA 561 VLS + P G +ML + Sbjct: 488 VLSICALP-TGLGEDQCFMLTNRNG 511 >gi|293609614|ref|ZP_06691916.1| predicted protein [Acinetobacter sp. SH024] gi|292828066|gb|EFF86429.1| predicted protein [Acinetobacter sp. SH024] Length = 692 Score = 387 bits (994), Expect = e-105, Method: Composition-based stats. Identities = 125/565 (22%), Positives = 202/565 (35%), Gaps = 67/565 (11%) Query: 6 WTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPR 65 K++ S+GELSP LL +R D+ +A G K N +PL G P + + Sbjct: 5 ILKNNLSSGELSP-LLWTRTDIQQYANGAKKLLNALPLVEGGAKKRPGTKFRS---IFAG 60 Query: 66 SNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPY-TFKDNKSLEYA 124 + R+ F LL+ G L++ R+ TPY T + + ++YA Sbjct: 61 ALRLIPFIANSENTYLLILGVSFLKVYNPRTYAV------VYETVTPYNTAQKVREVQYA 114 Query: 125 VFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQ 184 FV D P LL D F P LG S +++S Sbjct: 115 HTKYRMYFVQGDTPVQRLLCSADFTNWQFAAFTFGVNPNDELG--------STPNVALSP 166 Query: 185 ADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRS 244 + T ++ S P W+ Y G ++ K +R+ Sbjct: 167 SGTEVGKVIS--------------LTASSFPNWSNTETYLTGDRVIHSGKTWRATI---- 208 Query: 245 GDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISV 304 D G AT + W V N ++ S G++ + G +++ V Sbjct: 209 -DNKGVEPTATTSE-----WEEVTNEAANVFTPSNVGSIIE--INGGQVKITQYVDPSRV 260 Query: 305 APQSQTLFQAGVSVV--SWFMS--AWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSF 360 + + V + SW + A+ GYP V F RL+F+ +K ++ S Sbjct: 261 NGEVLVKLTSAVQAIAKSWVLKSIAFSATAGYPKAVCFFKQRLVFANTKTSPNQMWFSRI 320 Query: 361 GAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSK 420 G +F A + A + + I + G V + + S Sbjct: 321 GDDGNF-----LETTQDADAFSIASSSAQSDNILHLSQRGGVVALTGGAEFLINSQGPLT 375 Query: 421 GLSIDFRRVSGSGV-YACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADH 478 S + GV P VG+ L+FV G R++ +S E G E++Q+A H Sbjct: 376 PASAQIDEHTSYGVQANVKPCRVGNELLFVQRGGERLRAMSYRYEVDGLISPELSQIAPH 435 Query: 479 LFNQ--RILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHY 536 + I +L +Q+ P+SIVW+V+ S L + AW H + Sbjct: 436 IPENHAGIKELTFQQTPNSIVWIVMGDGAVSSITL------NRDQEMNAWSQHDFGGQ-- 487 Query: 537 VLSAASFPNDNRGGTSLWMLVALSA 561 VLS + P G +ML + Sbjct: 488 VLSICALP-TGLGEDQCFMLTIRNG 511 >gi|332160974|ref|YP_004297551.1| hypothetical protein YE105_C1352 [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|325665204|gb|ADZ41848.1| Hypothetical phage protein [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|330862130|emb|CBX72294.1| hypothetical protein YEW_AK02310 [Yersinia enterocolitica W22703] Length = 657 Score = 386 bits (991), Expect = e-105, Method: Composition-based stats. Identities = 105/581 (18%), Positives = 210/581 (36%), Gaps = 105/581 (18%) Query: 3 NTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRL 62 K +F+AGE+SPRL+ R D++ +A G N + + +G ++ P + + Sbjct: 2 RANLIKTNFTAGEISPRLM-GRVDIARYANGAKTVENAVCVIHGGVMRRPGSRFAAKAKF 60 Query: 63 DPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLE 122 + R+ + +L FG+ ++ + +PYT SL Sbjct: 61 GDQKARLIPYVFNRSQAYVLEFGNGYVRFYQN--GAQIGAGSTPYEIASPYTSAMLSSLN 118 Query: 123 YAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSI 182 Y T VH+D PP+ L D + P P+ Sbjct: 119 YVQGADTMFLVHQDVPPYRLQRKGQTDWV--------LEPAPF----------------- 153 Query: 183 SQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTG 242 I KP D+ R P +W K S+ ++ + +L+ Sbjct: 154 ---------------IVKPFDEIRDT-----PEKWCKP---SVKEFV--GSAITLTLSDA 188 Query: 243 RSGDRFGYSKGATYVKDNNITWI----TVLNLSSKTSRESASGAVAPYYVWGDIKDVSKD 298 SG G GA +V + +++ ++++ + TS A+G + Sbjct: 189 ESG---GALTGAGWVGADVGSYVRINSGLVHIQAVTSAAVATGVI--------------- 230 Query: 299 GRSISVAPQSQTLFQAGVSVVSWFM--SAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVY 356 R++ A QS S +W + W + GYP T + RL+ +GS +++ Sbjct: 231 -RTVLSAVQSS-------SPGAWTREDAVWSAEFGYPGAATLYQQRLVLAGSPKYPQTIW 282 Query: 357 LSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSI 416 +S G + F L D A++ V+ + I + + + + Sbjct: 283 MSETGIYLSFELGT-----DDDDAISFTVSSDQINPIVHLAQMNTLIALTSTGEFTITGG 337 Query: 417 SLS--KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQ--GFRFNEI 472 S +I + S G + PV VG ++F+ R++ ++ + + N++ Sbjct: 338 GESAITPTNISVKNPSPYGCNSIKPVRVGTEIMFMQRANRKLFAVAYDPDSFVAYSANDL 397 Query: 473 TQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMIS 532 + L++H+ + + YQ+EP + +W+ + +L + AW + + Sbjct: 398 SVLSEHITLSGAVDMAYQQEPDAFIWMTR-----ADGQLAVATIDR-AQDVIAWSRQVTT 451 Query: 533 DKHYVLSAASFPNDNRGGTSLWMLVAL-SAGEERSFTVRLN 572 + S + P +++LV G+ + + Sbjct: 452 GAY--ESVVTIPAST--NDVVYVLVKRVINGQIVRYVEVFD 488 >gi|295096862|emb|CBK85952.1| hypothetical protein ENC_24250 [Enterobacter cloacae subsp. cloacae NCTC 9394] Length = 662 Score = 379 bits (972), Expect = e-103, Method: Composition-based stats. Identities = 102/577 (17%), Positives = 197/577 (34%), Gaps = 92/577 (15%) Query: 3 NTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRL 62 K +F+AGE+SPRL+ R D++ +A G N + + G +V P + + Sbjct: 2 RANLIKTNFTAGEVSPRLM-GRVDIARYANGAKIIENAVVVVQGGVVRRPGTRFAAATKH 60 Query: 63 DPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLE 122 + +R+ + +L FGD ++I +PYT ++ Sbjct: 61 GDKKSRLIPYVFNRSQAYMLEFGDGYMRIFQNGKQLVNED-NTPYEIASPYTADMLPAVN 119 Query: 123 YAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSI 182 Y T VH+ PH L D + P P+ Sbjct: 120 YVQGADTMFLVHQSVKPHRLQRRGQTDWV--------LEPAPF----------------- 154 Query: 183 SQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTG 242 I +P D+ R P +W K S+ ++ ++ +T Sbjct: 155 ---------------IVEPFDEVRDT-----PQKWCKP---SVKEFVGSE------ITLT 185 Query: 243 RSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSI 302 S G ++ + W + + G V + + Sbjct: 186 LSDADPGDNETPPFTGAG---W---VAQDVGSYVRINEGLVLIKSIT--------SAQVA 231 Query: 303 SVAPQSQTLFQAGVSVVSWFM--SAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSF 360 +S S SW S W + GYP VT + RL+ +GS +++ S Sbjct: 232 VGTIRSDLSATQAASPGSWTREDSVWTNEFGYPGAVTLYQQRLVLAGSPKYPQTIWWSET 291 Query: 361 GAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLS- 419 G + F + E A++ ++ + I + + + + S + + Sbjct: 292 GVYLSFEIGTE-----DDDAISFTLSSDQLNPIVHLAQMNTLIALTYGGEFTITSGNDAA 346 Query: 420 -KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQ--GFRFNEITQLA 476 +I + S G PV VG ++FV GR++ ++ + + N++T LA Sbjct: 347 ITPTNISVKNPSPYGCNGIRPVRVGTEIMFVQRAGRKLYAVAYDPDSFVSYSANDMTVLA 406 Query: 477 DHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHY 536 +H+ +L + YQ++P + +W+V + + AW + + Sbjct: 407 EHITAGGVLDMAYQQQPDAFIWMVRADG------VAVTMAIDRAQDVIAWSRQVTAGAF- 459 Query: 537 VLSAASFPNDNRGGTSLWMLVALS-AGEERSFTVRLN 572 S A+ P+D ++ +V G+ + + Sbjct: 460 -ESVATIPSDT--DDVVYAIVRREINGQTVRYVEVFD 493 >gi|303327644|ref|ZP_07358084.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3] gi|302862005|gb|EFL84939.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3] Length = 681 Score = 376 bits (964), Expect = e-102, Method: Composition-based stats. Identities = 119/588 (20%), Positives = 207/588 (35%), Gaps = 122/588 (20%) Query: 10 SFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRV 69 +F+ GE++P L +R DL +A + N +P +G P + + Sbjct: 7 NFTGGEVTPTL-SARYDLGRYANSLKIMENFLPNLHGDAYRRPGTYFLENL---GEGCVL 62 Query: 70 FSFSIPD--GGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFG 127 FS G L FG+K L+IV V ++PY D + YA G Sbjct: 63 LPFSFNAEAGQNFALAFGEKSLRIVNVNGYVVAEA------MESPYALADVPEISYAQVG 116 Query: 128 STAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLP---------PPWLGDG-------MI 171 HKD+ H ++ +++ + W G G + Sbjct: 117 DVVYLAHKDYALHKVVRTGSAPAYAWSIGTVALNTSLAAPAAPTAAWQGGGGSYTLRYKV 176 Query: 172 SGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIV- 230 S V ++ K S+ A STA G +P +W + + + V Sbjct: 177 SAVDADGKESLPSAVGSTAS-------------------GKYPTDWTEGNHCVLSWQAVE 217 Query: 231 --ADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYV 288 A+ +YR + G G G ++G ++ N A A P Sbjct: 218 GAAEYNIYRE-SAGYYG-FIGIAQGTSFDDQNY----------------EADIADTPKED 259 Query: 289 WGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGS 348 W D + P VTFH R++ +G+ Sbjct: 260 WDPFADGNN-----------------------------------PGTVTFHQQRMVLAGT 284 Query: 349 KGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCD 408 + S Y+S G F +F DP + + + I W FG+ +L+G Sbjct: 285 RNSPQSFYMSRTGDFENFRKSRPLQDDDP---VEYQLASGTVDGIVWAASFGD-LLLGTA 340 Query: 409 TSLWLLSISLS--KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-Q 465 ++ + + + S G P+ +G+ ++ G R++ + S E Sbjct: 341 SAEYKATGDNGAITAKNCTITAQSYWGSAKIAPIIIGNSVMHCQRHGSRVRDLYYSLEKD 400 Query: 466 GFRFNEITQLADHLFN-QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDF 524 G+ N+++ LA HLF+ I Q +Q+ P S++W+V + LL + E + + Sbjct: 401 GYAGNDLSVLAPHLFDGHTIRQWAFQQTPGSVLWLVRDDG-----VLLALTYMKE-QDIW 454 Query: 525 AWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALS-AGEERSFTVRL 571 W + + V S A+ +N L ++V S G + + RL Sbjct: 455 GWSRQITDGR--VRSVAALSGENA--DELLLVVERSVDGARKYYLERL 498 >gi|85059168|ref|YP_454870.1| hypothetical protein SG1190 [Sodalis glossinidius str. 'morsitans'] gi|84779688|dbj|BAE74465.1| hypothetical phage protein [Sodalis glossinidius str. 'morsitans'] Length = 662 Score = 370 bits (949), Expect = e-100, Method: Composition-based stats. Identities = 100/577 (17%), Positives = 188/577 (32%), Gaps = 92/577 (15%) Query: 3 NTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRL 62 K +F+AGE+SPRL+ R D+ +A G +N + + G ++ P + + Sbjct: 2 RANLIKTNFTAGEVSPRLM-GRVDIMRYANGAKAIQNGVVVVQGGVMRRPGTRFAAAAKY 60 Query: 63 DPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLE 122 R R+ + +L FGD L++ + + +PY+ S+ Sbjct: 61 SDRPARLIPYVFNRSQAYVLEFGDGYLRVY-QKGKPVVNANNTPYEIASPYSADRLPSVN 119 Query: 123 YAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSI 182 Y T VH P+ L D + P P+ Sbjct: 120 YVQGADTMFLVHPAVKPYRLQRRGQTDWV--------LEPAPF----------------- 154 Query: 183 SQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTG 242 I +P D+ R P +W + Sbjct: 155 ---------------IVEPFDEIRET-----PKKWCR----------------------- 171 Query: 243 RSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSI 302 S F S+ + D + + GA + + Sbjct: 172 PSAKEFVGSEVTLTLSDADPGENRNPPFTGAGWVAQDVGAYVRINGGLVLIQRIDSAQVA 231 Query: 303 SVAPQSQTLFQAGVSVVSWFM--SAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSF 360 +S + S SW S W + GYP VT + RL+ +GS +++ S Sbjct: 232 VGTLRSDLNAKQAASPGSWTREESVWTDNLGYPGAVTLYQQRLVLAGSPKYPQTIWWSET 291 Query: 361 GAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLS- 419 GA+ F L + A++ ++ + I + + + + S + + Sbjct: 292 GAYLSFELGTK-----DDAAISFTLSSDQLNPIVHLAQMNTLIALTYGGEFTITSGNDAA 346 Query: 420 -KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQ--GFRFNEITQLA 476 +I + S G P+ VG ++F+ GR++ ++ + + N++T LA Sbjct: 347 ITPTNISVKNPSPYGCNRIRPLRVGTEILFIQRAGRKLYAVAYDPDSFVSYAANDLTVLA 406 Query: 477 DHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHY 536 +H+ + + YQ++P ++W+V E + + AW M Sbjct: 407 EHITAGGVRDMAYQQQPDGLIWLVREDGVAVTVTM------DRAQDVVAWSRQMTEGAF- 459 Query: 537 VLSAASFPNDNRGGTSLWMLVAL-SAGEERSFTVRLN 572 S S P++ L+ LV G + + Sbjct: 460 -ESVTSIPSER--DDVLYALVRRHINGHTVRYVEVFD 493 >gi|220918520|ref|YP_002493824.1| hypothetical protein A2cp1_3428 [Anaeromyxobacter dehalogenans 2CP-1] gi|219956374|gb|ACL66758.1| hypothetical protein A2cp1_3428 [Anaeromyxobacter dehalogenans 2CP-1] Length = 825 Score = 357 bits (915), Expect = 3e-96, Method: Composition-based stats. Identities = 131/635 (20%), Positives = 217/635 (34%), Gaps = 101/635 (15%) Query: 8 KHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLD---- 63 + SF+AGEL PRL R DL+ + G+ ++RN G ++ P R+ + Sbjct: 8 QGSFAAGELGPRLH-GRHDLAKYQVGLRRARNFFLSPEGAALNRPGTPFVREAKDSAAGV 66 Query: 64 PRSNRVFSFSIPD--GGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYK--TPYTFKDNK 119 R R+ F + G L FG ++ V +T P + Y+ TPY D Sbjct: 67 DRGARLIPFIFSEDLGQAYELEFGQGYVRFHV-GGATIADPLNSAQPYELATPYLAADLP 125 Query: 120 SLEYAVFGSTAVFVHKDHPPHHLLY--IQDGDKISFTFDEIKFLPPPWLGDGMISGVKSN 177 L+YA G K + P L + + +FD +LG + V ++ Sbjct: 126 RLKYAQQGDVVTLTCKGYDPRELRRLAHDSWELVPLSFDVPAPNGVVYLGVEALENV-AD 184 Query: 178 AKLSISQADTSTARITSDMKIFK---PLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDK 234 A Q I D + + R I +G W Y +GA + + Sbjct: 185 ATHPARQWAWQVTEIWEDESGLQWETSPLRVRKIAVGAGAT-WHTGFTYPLGACVSYAGQ 243 Query: 235 VYRSLTTGRSGDRF-----GYSKGATYVKDNNITWI-------------TVLNLSSKTSR 276 ++S+ G G ATY + + V+ +T + Sbjct: 244 FWQSVIADNRGHVPEAVMVGDPPAATYPYWTPVGAVPDPFAVYESNAPTDVVLFPDRTIK 303 Query: 277 ESASGA-----------------------------VAPYYVWGDIKDVSKDGRSISVAPQ 307 ASGA VA + GD D+S PQ Sbjct: 304 LWASGAWTGVDGSRLVGRRVYRGRGTVFGYVGEFEVAEFRDTGDTPDLSYS------PPQ 357 Query: 308 SQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFS 367 + F PS VTFH R G+ +LS G +Y+F Sbjct: 358 GRNPFTVFGPAGEVVRLE------QPSVVTFHAERRSLLGTAQRPAHAFLSRTGDYYNFD 411 Query: 368 LDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLL---SISLSKGLSI 424 D A + + W +L+G + +W + S + Sbjct: 412 RHTPALVDD---AFELELAGRLREEVRWAV-GAAALLIGTQSGVWAIRPPSGEVLGPGKA 467 Query: 425 DFRRVSGSGVYACPPV----SVGDCLVFVCGVGRRIKYISG-STEQGFRFNEITQLADHL 479 S +G P+ +VGD +++V G ++ + QGF ++++ LA HL Sbjct: 468 TAVPQSSAGSSYLDPLVVPSAVGDAVLYVRTKGSGVRDLVYDDGRQGFVGSDLSLLAKHL 527 Query: 480 FNQ-RILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVL 538 F I +QE+P S+ W+V S +LL + + + +AW H V Sbjct: 528 FTGYSIKAWTFQEDPWSVAWLVR-----SDGKLLSLTYVRD-QEVWAWAWHDTQG--IVE 579 Query: 539 SAASFPNDNRGGTSLWMLVAL--SAGEERSFTVRL 571 + P +++++V G + R+ Sbjct: 580 DVCAIP--EGTEDAVYLIVKRQIGDGTWHRYVERM 612 >gi|220903983|ref|YP_002479295.1| hypothetical protein Ddes_0709 [Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774] gi|219868282|gb|ACL48617.1| conserved hypothetical protein [Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774] Length = 689 Score = 356 bits (913), Expect = 6e-96, Method: Composition-based stats. Identities = 122/584 (20%), Positives = 206/584 (35%), Gaps = 103/584 (17%) Query: 9 HSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNR 68 ++F+ GE++P L +R DL+ + ++ N++P +G P + + + Sbjct: 8 NNFTGGEIAPTL-SARYDLARYRNCLSCMENMLPGLHGDTARRPGTRFVANL---DGHSV 63 Query: 69 VFSFSIP--DGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVF 126 + FS +LVFG L I + +TPY + + + YA Sbjct: 64 LIPFSFNALTSQNFVLVFGSHCLHIAGEQG------LENIPVIETPYAPGELQDISYAQV 117 Query: 127 GSTAVFVHKDHPPHHLLYIQDGDKIS--------FTFDEIKFLP---PPWLGDGMISGVK 175 G T H +HP H ++ + + ++ +++ P L SG Sbjct: 118 GDTVYLAHSNHPLHKVVRRDAPENRTQFEEAAYAWSLEKVALNASLAAPELPSVTFSGSA 177 Query: 176 SNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIV---AD 232 + L A A S GR HP +W + + +I V + Sbjct: 178 GSYTLRYKVAAVDAAGRESLPSPAGQCANGR------HPSDWVQGNSAAISWAAVEGAVE 231 Query: 233 DKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDI 292 +YR G G G S G + N A A P W Sbjct: 232 YNIYRE-EAGYFG-FIGVSGGLNFNDQNY----------------QADTADTPKEDWDPF 273 Query: 293 KDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDE 352 D + YP V FH R++ + + + Sbjct: 274 ADGN-----------------------------------YPGIVAFHQQRMVLAATPKNP 298 Query: 353 LSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLW 412 + Y+S G F +F DP + L + S + W FG+ +L+G S + Sbjct: 299 QAFYMSRVGDFENFRKSRPLQDDDPVEYL---IASGSIDAVTWAASFGD-LLIGTSGSEY 354 Query: 413 LLSISLS---KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFR 468 S +I S G P+ +G+ ++ V G R++ + S E G+ Sbjct: 355 KASGGDGASITAGNISITAQSYWGSAGLAPIIIGNSILHVQRHGSRVRDLFYSLEKDGYA 414 Query: 469 FNEITQLADHLFN-QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWH 527 N+++ +A HLF ILQ YQ+ P S +W V + LL + E + + W Sbjct: 415 GNDLSIMAPHLFEGHTILQWAYQQTPGSTIWCVRDDG-----LLLAFTYMKEHD-IWGWS 468 Query: 528 THMISDKHYVLSAASFPNDNRGGTSLWMLVALSAGEERSFTVRL 571 + + VLSAA+ +G T + + G+ R F RL Sbjct: 469 RQITQGR--VLSAAAISG-EKGDTLMLVTERRIDGQPRIFLERL 509 >gi|167032763|ref|YP_001667994.1| hypothetical protein PputGB1_1755 [Pseudomonas putida GB-1] gi|166859251|gb|ABY97658.1| conserved hypothetical protein [Pseudomonas putida GB-1] Length = 774 Score = 355 bits (910), Expect = 1e-95, Method: Composition-based stats. Identities = 97/574 (16%), Positives = 194/574 (33%), Gaps = 79/574 (13%) Query: 4 TTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLD 63 T + SFSAGE++P +R DL+ + + RN + L G + + + + Sbjct: 2 TEVIQPSFSAGEVAPATY-ARVDLARYYTALKTCRNFVVLPEGGAQNRSGTRFITEVKDS 60 Query: 64 PRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEY 123 R+ F +L FG+ ++ + + + +PYT L++ Sbjct: 61 AARTRLIPFQFSTEQTYILEFGNLYIRFISMGGQVVS--GVTPYEIASPYTTAQLPDLKF 118 Query: 124 AVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSIS 183 VH DHPP L + ++T I F P G+++ ++ + Sbjct: 119 TQSADVMTIVHPDHPPRELSRLAP---TNWTLTAITFEPGIAAPTGLVATARTGGSGDTT 175 Query: 184 QADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGR 243 + ++T+ I WA NT Sbjct: 176 EYQ---YKVTAVSSI-----------SEGSVESWASNTATV------------------- 202 Query: 244 SGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSIS 303 + F GAT +K+S + + DI ++ Sbjct: 203 --NSFDDKPGATLAWTAVAGADHYNVYKNKSSGVFGFIGQSAGVTFNDINITPATDNTV- 259 Query: 304 VAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAF 363 P F G + PS V ++ R+ F+ S+ + +V++S G F Sbjct: 260 --PIGYNPFADGNN---------------PSVVGYYQQRMAFAASRANPQTVWMSRTGDF 302 Query: 364 YDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLS---K 420 ++F D + + + I + E + + + ++ S Sbjct: 303 HNFGYSDPNKDDDG---IEFVIASRQVNQIRHLVSLRELLAM-TSGAEIAITGSSDSGIT 358 Query: 421 GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGST-EQGFRFNEITQLADHL 479 ++ S G P + +++ G ++ ++ + GF+ +++ L+ HL Sbjct: 359 PANVSAVEQSYFGSSDVIPAIYANTALYIQARGGKLSTLAYNYVSDGFQPQDVSVLSSHL 418 Query: 480 FNQ-RILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVL 538 I + P+ ++W+V LLG F + + + W H V Sbjct: 419 LRGFTIQDQAFALAPNGVLWLVRNDG-----MLLGFTFLPD-QQVYGWSWHDTDGA--VE 470 Query: 539 SAASFPNDNRGGTSLWMLVALS-AGEERSFTVRL 571 + AS P D+ +L+M+V + G + + R+ Sbjct: 471 AVASVPEDD--EDALYMIVRRTINGVTKRYIERM 502 >gi|212703338|ref|ZP_03311466.1| hypothetical protein DESPIG_01381 [Desulfovibrio piger ATCC 29098] gi|212673248|gb|EEB33731.1| hypothetical protein DESPIG_01381 [Desulfovibrio piger ATCC 29098] Length = 703 Score = 337 bits (865), Expect = 2e-90, Method: Composition-based stats. Identities = 126/591 (21%), Positives = 201/591 (34%), Gaps = 103/591 (17%) Query: 9 HSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNR 68 H+F+ GE+SP +L +R DLS + V N++P +G + P Sbjct: 6 HNFTGGEVSP-ILAARYDLSRYGSSVQCMENMLPGLHGDVRRRPGTLFLGSLE---GEAV 61 Query: 69 VFSFSIPD--GGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVF 126 + FS +LV L I + + + AL TPY + + A Sbjct: 62 LLPFSFNALAEQNFVLVLSGHSLCIADIHGFDRQTGALPRLP--TPYEARHLLEICAAQV 119 Query: 127 GSTAVFVHKDHPPHHLLYIQDGD-------------KISFTFDEIKFL---PPPWL-GDG 169 G T H +P H L+ D +T + + P P Sbjct: 120 GDTVYLAHTAYPLHKLVRSTYSDPEAPLPDNAIRSHGYRWTLEAVALNSSLPAPQAPDCT 179 Query: 170 MISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYI 229 + G + + ++ K + G G HP +W I Sbjct: 180 FVRGNNDDDAGLGYTLRYKIVAVDANGKQSLASEAGSC--DGKHPSDWVVGNRTDISWTA 237 Query: 230 V---ADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPY 286 V + +YR G G G S G T+ +N A A P Sbjct: 238 VEGATEYNIYRE-EAGYYG-FIGVSSGTTFSDNNY----------------QADTADTPR 279 Query: 287 YVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFS 346 W D + PS V FH R++ + Sbjct: 280 EDWDPFADGNN-----------------------------------PSVVAFHQQRMVLA 304 Query: 347 GSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVG 406 G++ + YLS G F +F DP + L + S I W FG+ +L+G Sbjct: 305 GTRDSPQAFYLSRSGDFENFRKSRPLQDDDPVEYL---IASGSIDAIAWAASFGD-LLLG 360 Query: 407 CDTSLWLLSISLS--KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE 464 S + S + S +I S G P+ +G+ ++ V G ++ + S E Sbjct: 361 TSGSEYKASGNGSAITPGNITITAQSYWGSAGLAPIIIGNAILHVQRHGAHVRDLFYSLE 420 Query: 465 -QGFRFNEITQLADHLFN-QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEG 522 G+ N+++ LA HLF R+ Q YQ+ P S++W+V + LL + E + Sbjct: 421 KDGYAGNDLSILAPHLFEGHRLRQWAYQQTPGSVLWIVRDDG-----LLLALTYLKEHD- 474 Query: 523 DFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVAL--SAGEERSFTVRL 571 + W H + VLS S + L ++V + G R RL Sbjct: 475 IWGWSRHPTAG--EVLSVCSISGPD--SDELLLVVRRRDADGGSRYCLERL 521 >gi|41179374|ref|NP_958682.1| Bbp13 [Bordetella phage BPP-1] gi|45569506|ref|NP_996575.1| hypothetical protein BMP-1p12 [Bordetella phage BMP-1] gi|45580757|ref|NP_996623.1| hypothetical protein BIP-1p12 [Bordetella phage BIP-1] gi|40950113|gb|AAR97679.1| Bbp13 [Bordetella phage BPP-1] Length = 681 Score = 311 bits (796), Expect = 2e-82, Method: Composition-based stats. Identities = 100/577 (17%), Positives = 185/577 (32%), Gaps = 81/577 (14%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M N + SF GE+SP + R D + G+A RN + GP + R+ Sbjct: 1 MSNVRVLQRSFGGGEISPE-MFGRIDDVKYQSGLAICRNFVVKPQGPAENRAGFAFVREV 59 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 + + R+ F+ ++ G + + PY D + Sbjct: 60 KDSAKKVRLIPFTYSVTQTMVIELGAGYFRFHTNGGTLL--DGAVPYEIANPYAEADLFN 117 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 + Y VH ++ P L + ++ I F P V + + Sbjct: 118 IHYVQSADVLTLVHPNYAPRELRRLG---ATNWQLATIAFTSP----------VATPTSV 164 Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLT 240 + + + T D YR + Sbjct: 165 TATSNNKGT-------------------------------------------DYTYRYVV 181 Query: 241 TGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKD-G 299 T + S T N + T SAS + Y V+ + + G Sbjct: 182 TALDAEGKTES---APSSAGTCTNNLFTNGGANTIAWSASSGASRYNVYKEQGGLYGYIG 238 Query: 300 RSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSS 359 ++ + + + + + YP+ V++ R F+G+ +++++ Sbjct: 239 QTTGTSLVDDNIAPDLSVTPPIYDAVFNAAGDYPAAVSYFEQRRCFAGTTNKPQNIWMTR 298 Query: 360 FGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLS 419 G S D V A+ I + P E +L+ + S++ Sbjct: 299 SGTESAMSYSLPVRDDDRVA---FRVAAREANAIRHIVPLTELLLLTSSGEWRVASVNSD 355 Query: 420 --KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLA 476 +I R S G PV V + ++ G ++ ++ + + GF +++ A Sbjct: 356 AVTPTTISVRPQSYVGATDVQPVVVNNTTIYGAARGGHVRELAYNWQANGFVTGDLSLRA 415 Query: 477 DHLFNQ-RILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKH 535 HLF+ IL + Y + P IVW + +S +LLG + E + AWH H Sbjct: 416 AHLFDNLDILDMAYAKAPQPIVWFI-----SSSGKLLGLTYVPE-QQIGAWHQHDTDG-- 467 Query: 536 YVLSAASFPNDNRGGTSLWMLVALS-AGEERSFTVRL 571 S A L+ +V + G E + R+ Sbjct: 468 VFESCAVV--AEGNEDRLYAVVRRTIGGNEVRYVERM 502 >gi|303257570|ref|ZP_07343582.1| conserved hypothetical protein [Burkholderiales bacterium 1_1_47] gi|302859540|gb|EFL82619.1| conserved hypothetical protein [Burkholderiales bacterium 1_1_47] Length = 687 Score = 308 bits (788), Expect = 2e-81, Method: Composition-based stats. Identities = 99/590 (16%), Positives = 179/590 (30%), Gaps = 89/590 (15%) Query: 4 TTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLD 63 T + SF+ GE+SP + R D + + G+ N + GP+ + P + R+ + Sbjct: 5 TKVLQRSFAGGEISPE-MFGRTDDTKYQTGLETCLNFLCRPQGPIENRPGFEFVREVKDS 63 Query: 64 PRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEY 123 + R+ F ++ G K + TP+ D LEY Sbjct: 64 SKKVRLIPFIFNAQQTFVIELGHKYARFH--SFGATLMNGNQPYEITTPWDEDDLFELEY 121 Query: 124 AVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKF-----LPPPWLGDGMISGVKSNA 178 H+D+ P + + D + I F P + + Sbjct: 122 VQSNDIITVTHEDYAPTEIRRYSNTD---WRLATISFSSTLATPTNVTAVRETTTGNEDK 178 Query: 179 KLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRS 238 + + +D I S + C +A T I V+ YR Sbjct: 179 NADKYTFQYKVSCLNADKTIESEP----SAAVSCTANLYATGTTIKISCSAVSGASYYRF 234 Query: 239 LTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKD 298 Y I G + I D Sbjct: 235 -----------------YKNQGGI-----------------YGYLGDSETTSIIDDNIAP 260 Query: 299 GRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLS 358 I+ + YPS V + R F+G K D V + Sbjct: 261 KTDITPRRYDSVVSSGN----------------YPSAVGYFEQRRWFAGFKTDPQRVVAT 304 Query: 359 SFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL 418 G D + D + + + I + P +L+ + + + + Sbjct: 305 RSGTESDMTYSLPSKDDDR---INFRIAATEFNKILHISPLSHLILLTTGSEIRISPQNS 361 Query: 419 S--KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQ-GFRFNEITQL 475 SI R S +G P+ + L+F ++ ++ + GF ++ Sbjct: 362 DAITPSSISARPQSYNGATTVRPLVYNNNLIFASARDGHVRELAYQYQAGGFVSGDLCLR 421 Query: 476 ADHLFN-QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDK 534 + HLF+ + I Q+ P+ I+W V +S LLG + E + +WH H Sbjct: 422 SQHLFDFKTIKDATAQKAPYPIMWFV-----SSDGNLLGLTYIPE-QQVGSWHRHNTDG- 474 Query: 535 HYVLSAASFPNDNRGGTSLWMLVALS-AGEERSFTVRL------NLLDDF 577 S + +L+ ++ + G ++ + R+ NL D F Sbjct: 475 -VFESCCAV--SEGVEDALYCVIRRTINGSQKRYVERMRTRNFKNLADAF 521 >gi|187476936|ref|YP_784960.1| phage protein [Bordetella avium 197N] gi|115421522|emb|CAJ48031.1| phage protein [Bordetella avium 197N] Length = 681 Score = 307 bits (785), Expect = 5e-81, Method: Composition-based stats. Identities = 97/577 (16%), Positives = 182/577 (31%), Gaps = 81/577 (14%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M N + SF GE+SP + R D + G+A RN + GP+ + R+ Sbjct: 1 MSNVRVLQRSFGGGEISPE-MFGRIDDVKYQSGLAICRNFVVKPQGPVENRAGFSFVREV 59 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 + + R+ F+ ++ G + + PYT D S Sbjct: 60 KDSTKKVRLIPFTYSVTQTMVIELGAGYFRFHTDGGTLL--NGDTPYEIANPYTEADLFS 117 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAK- 179 + Y VH ++ P L I D + I F+ + G+ + + Sbjct: 118 IHYVQSADVLTLVHPNYAPRELRRIGATD---WQLATIAFMSSVAMPTGVTATSNNKGTD 174 Query: 180 LSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSL 239 + T+ Sbjct: 175 YTYRYVVTALDAEGKTESAPS--------------------------------------- 195 Query: 240 TTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDG 299 + G + + GA + + + + + G + + D Sbjct: 196 SAGICANNLFTNGGANTIAWSAAS--GASRYNVYKEQGGLYGYIGQTTGTSLVDDNIAPD 253 Query: 300 RSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSS 359 S++ P +F A YP+ V++ R F+G+ +++++ Sbjct: 254 LSVT-PPIYDAVFNAAGD--------------YPAAVSYFEQRRCFAGTINKPQNIWMTR 298 Query: 360 FGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLS 419 G S D V A+ I + P E +L+ + S++ Sbjct: 299 SGTESAMSYSLPVRSDDRVA---FRVAAREANAIRHIVPLTELLLLTSSGEWRVASVNSD 355 Query: 420 --KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLA 476 +I R S G PV V + ++ G ++ ++ + + GF +++ Sbjct: 356 AVTPTTISVRPQSYVGATDVQPVVVNNTAIYGAARGGHVRELAYNWQANGFVTGDLSLRC 415 Query: 477 DHLFNQ-RILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKH 535 HLF+ IL + Y + P IVW + +S +LLG + E + AWH H Sbjct: 416 AHLFDNLNILDMAYAKAPQPIVWFI-----SSSGKLLGLTYVPE-QQIGAWHQHDTEG-- 467 Query: 536 YVLSAASFPNDNRGGTSLWMLVAL-SAGEERSFTVRL 571 S A L+++V G+E + R+ Sbjct: 468 VFESCAVV--AEGNEDRLYVVVRRIIGGKEVRYIERM 502 >gi|119386474|ref|YP_917529.1| hypothetical protein Pden_3767 [Paracoccus denitrificans PD1222] gi|119377069|gb|ABL71833.1| conserved hypothetical protein [Paracoccus denitrificans PD1222] Length = 679 Score = 298 bits (763), Expect = 2e-78, Method: Composition-based stats. Identities = 96/573 (16%), Positives = 187/573 (32%), Gaps = 79/573 (13%) Query: 4 TTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLD 63 + +F++G L P L R DL+ + + K RN+ +G + + P ++ + Sbjct: 3 AARIQPTFASGVLGPAL-WGRIDLARYDSALRKGRNVFVHAHGGVSNRPGLRFVCEVMDS 61 Query: 64 PRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEY 123 +R+ F ++L+ G ++ V + + + T TP+T ++L+ Sbjct: 62 AHRHRLLPFVREADDASILIMGQNEMGFVKNGARLQSGGVDY--TIATPWTATQAQALDA 119 Query: 124 AVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSIS 183 H+ P ++ + D T + P + Sbjct: 120 VQSVDVIFAAHRQVAPRRIMRNGETDWSIATVPINPTVAAPTISSVTPRNSGDETYRYRV 179 Query: 184 QADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGR 243 A + + + SI + ++ T + + +VYR Sbjct: 180 TAVVGGVESFASAPLATTAAELLSIEGAWNDIAFSAVTGAT-------EYRVYRMRNGVP 232 Query: 244 SGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSIS 303 G++ G ++ DN + Sbjct: 233 GY--IGFTTGTSFRDDN-------------------------------------ISPDST 253 Query: 304 VAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAF 363 V P Q S + YPS V+ + RL F S +V+LS G + Sbjct: 254 VTPPVQA-------------SLFDAAGKYPSVVSIYQQRLAFGASDAQPETVWLSRVGDY 300 Query: 364 YDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLS-KGL 422 +F+ D + + + I M E ++ + L Sbjct: 301 LNFTRSQNMTSSDRAEFD---MAGEQLNRIRAMLQLRELLVFTSAGEFSVSGPDGGFDAL 357 Query: 423 SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFN 481 + + G P+ D ++FV GR ++ + + E G+ N++ A H Sbjct: 358 NPIVTQHGYIGSATVKPLVADDTVLFVDRSGRGVRDLRYAYESDGYSGNDLAIFASHFLQ 417 Query: 482 -QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSA 540 +RI+ + P SI+WVVL+ +LL + E + +AW I V S Sbjct: 418 GRRIVGWAMAKNPWSIIWVVLD-----NGKLLALTYKREHQ-VWAWTEMDIDGA--VESV 469 Query: 541 ASFPNDNRGGTSLWMLVAL-SAGEERSFTVRLN 572 A P + +++V G++R + R + Sbjct: 470 ACIP--EGASDATYLIVRRLIDGQQRRYVERFD 500 >gi|118590938|ref|ZP_01548338.1| hypothetical protein SIAM614_19796 [Stappia aggregata IAM 12614] gi|118436460|gb|EAV43101.1| hypothetical protein SIAM614_19796 [Stappia aggregata IAM 12614] Length = 810 Score = 298 bits (762), Expect = 2e-78, Method: Composition-based stats. Identities = 100/645 (15%), Positives = 202/645 (31%), Gaps = 101/645 (15%) Query: 7 TKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRS 66 + +FS GEL P L+ R DL L +A+ RN + L+ G L + + + R Sbjct: 5 LQATFSRGELDPELIY-RSDLELFRSSLAECRNFLTLKRGGLRRRGGTKFIAELKDSSRQ 63 Query: 67 NRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVF 126 + F +G Y +L FG ++ TPY+ L++ Sbjct: 64 GWLIPFEFGNGQYYMLEFGHHIFRVFTSEGRVG------TVEVATPYSSGVLPRLKFVQS 117 Query: 127 GSTAVFVHKDHPPHHLLYIQ------------DGDKISFTFDEIKFLPPPWLGDGMISGV 174 T P L + DG + P Sbjct: 118 TDTLFIAGGGVAPQALKRLSELSWAIEPMSFRDGPYLDVNISPTNLKPAATGNAVPKMTS 177 Query: 175 KSNAKLSISQADT------------STARITSDMKIFKPLDKGRSIRLGCH--------- 213 + ++S ++ ++S + S+ + + Sbjct: 178 NTAPSGTVSASNGSASAWQLFNRSEGKTVLSSGATGWVQYQFPGSVVIDAYMLQAPNDNS 237 Query: 214 -----PPEWAKNTNYSIGAYIVAD---------DKVYRSLT----TGRSGDRFGYSKGAT 255 P +W + + + + D +R T + R +++G Sbjct: 238 QNDDMPWQWNIEASNNGSDWTILDTQDGQDTWSSNEWREYDFHNETAFTHYRLSFTQGGG 297 Query: 256 YVKDNNITWITVLNLSSKTSRESASGAVAPYYV-----------------------WGDI 292 DN+ V + + A + W Sbjct: 298 SASDNSAIGQLVFHRAGNDQSPFTLTASGTGGINGGAGFQPSDVGRHIRFRGSDGFWRWF 357 Query: 293 KDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDE 352 + S+ + Q + W + AW G+P + +H NRL F+G+ + Sbjct: 358 RIHSRQSATSVKVQLFGQALQDTKAQSIWRLGAWSGTTGWPETIGWHKNRLAFAGTSEEP 417 Query: 353 LSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLW 412 ++ S F +FS+ D A+T + + I W+ + ++VG ++ Sbjct: 418 QKIWESQTEDFTNFSVSHVLKASD---AVTAGILSGQVNRIQWLVDDND-LIVGTTRAVR 473 Query: 413 LLSISLS----KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGST-EQGF 467 + + ++D + + G P+ VG L++ G ++ ++ G Sbjct: 474 AVGKATDQDPYGPENVDQKPETNFGANDVSPIKVGSVLIYYGPYGTDMREMAYDFGSDGR 533 Query: 468 RFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWH 527 ++++ HLF I YQ+ P S++W + + +G + + + Sbjct: 534 VSQAVSEVQSHLFQSGIAGACYQQYPDSVIW-----QWDQKGSGIGFTYER-QQQVYGMQ 587 Query: 528 THMISDKHYVLSAASFPNDNRGGTSLWMLVALS-AGEERSFTVRL 571 H V A G ++WM+V + G+ R + + Sbjct: 588 RHDFGG--VVECMADLSGA--GADTVWMIVKRTIDGQTRRYIEIM 628 >gi|167041089|gb|ABZ05850.1| hypothetical protein ALOHA_HF400048F7ctg1g17 [uncultured marine microorganism HF4000_48F7] Length = 999 Score = 289 bits (739), Expect = 8e-76, Method: Composition-based stats. Identities = 111/707 (15%), Positives = 216/707 (30%), Gaps = 162/707 (22%) Query: 3 NTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRL 62 + SF+ G++SPR +Q +L + +A N++ L G L P + Sbjct: 2 RIQALQSSFADGQISPR-MQGMVELESYKSSLATLENMVVLPQGSLTRRPGTFFAATTK- 59 Query: 63 DPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKT----------P 112 R+ FS G +L FG+ ++ + + T Sbjct: 60 ANGQARLIPFSRGQGTSLVLEFGNLYIRFFANDGPVRTDDIAATYSQTTTTVTVTKSTHG 119 Query: 113 YTFKDNKSLEYA------------------------------------------------ 124 Y+ D L++ Sbjct: 120 YSASDEVYLDFTSGNGVDGFYTIATVADANTFTVTSTTSQTTSGNVNLSQRFEVTTTYTA 179 Query: 125 -VFGSTAVFVHKD-----HPPHHLLYIQDGDKISFTFDEIK--------FLPPPWLGDGM 170 A D HP H ++ S+ + P L DG Sbjct: 180 SQVNDIAFTQSADVLFLVHPDHVPARLERNATNSWALTNLLPSLISGTYTRPTTVLTDGP 239 Query: 171 ISGVK-SNAKLSISQADTSTARITSDMKIFKPLDKGRS-----------IRLGCHP---- 214 + ++ L+++ A S + + G L HP Sbjct: 240 FKAMNTTDTTLTVALAANSDFTTSFSNGSLSLEEVGTVSPSNVDVATNAFTLANHPLVNG 299 Query: 215 ---------------PEWAKNTNYSIGA-------YIVADDKVYRSLTTGRSGDRFGYSK 252 P + T+Y + + + +T + +K Sbjct: 300 QTVQFSSIPSGFASTPTLSATTDYFVVSATQNTFKLATSAGGTPVDITAAPTSADLTVNK 359 Query: 253 G---------ATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSIS 303 T I T + + +AP G + V + ++ Sbjct: 360 SFVDKDVYIKVTASATTGINDDTGFQTTDVGRYIRLNTEIAPQIKHGYGEIVERTSTTV- 418 Query: 304 VAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAF 363 V Q +T + W + ++ GYP V + RL+F+G+ + +++ S F Sbjct: 419 VLVQLKTAIAGVGATTEWQLGSFSGTTGYPRTVQLYQQRLVFAGTAEESQTIFFSKTADF 478 Query: 364 YDFSLDGEYGCYD----------------PTKALTTAVTDFSASTIHWMHPFGEGVLVGC 407 ++FS G A++ ++ + I W+ + + +G Sbjct: 479 FNFSATEPLGQQTGQRDSSGRSIVGEQIFEDAAISLTISSDTVDQIEWISE-DQRLTIGT 537 Query: 408 DTSLWLLSISLSKGLSIDFR-RVSGSGVYACPPVS----VGDCLVFVCGVGRRIKYIS-G 461 ++ L S F ++ +AC P + VG+ L++V GR+++ ++ Sbjct: 538 SGGIYQLYGSTDDLTLTPFNFSITKVSAWACDPTALPAKVGNNLLYVQNNGRKLRELAFD 597 Query: 462 STEQGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGE 521 + + ++T ++ + ++ YQ++P+S++W + RL G + + Sbjct: 598 KVQDQYSAADLTLRSEDISESGLIATAYQDQPYSVLWCLRNDG-----RLAGLTYV-DLL 651 Query: 522 GDFAWHTHMISDKHY---------VLSAASFPNDNRGGTSLWMLVAL 559 AWH H I HY V S AS P L+M+V Sbjct: 652 QMRAWHRHTIGGAHYDDTHGSQAKVESIASIP--RGTHDQLYMIVKR 696 >gi|195541813|gb|ACF98016.1| hypothetical protein [uncultured bacterium 878] Length = 926 Score = 288 bits (736), Expect = 2e-75, Method: Composition-based stats. Identities = 90/518 (17%), Positives = 171/518 (33%), Gaps = 36/518 (6%) Query: 72 FSIPDGGYALLVFGDKK-LQIVVVRSSTKWSPALFGKTYKTP--YTFKDNKSLEYAVFGS 128 F + + + D+ I S + + Y P Y D +++A Sbjct: 144 FRVANRTASTFELNDQHGAPINGNGYSAFAAGGTAARVYTLPTTYQDADLAQMKFAQSAD 203 Query: 129 TAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTS 188 H ++ P L ++ +I F P+L V + S + Sbjct: 204 ILYIAHTEYVPRKLQRYGP---TNWVLSQIDFQDGPYLPVNGAQTVLTP---SAASGAGI 257 Query: 189 TARITSDMKIFKPLDKGRS-IRLGCHPPEWAKNTNYSIGAYI-VADDKVYRSLTTGRSGD 246 T + + I + G +R+ W I + + + T R Sbjct: 258 TISSATSVAITGAANNGAGAVRITSANHGWKTGDKIDITGIVGTTEANA--TWTVTRVNA 315 Query: 247 RFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAP 306 G+T+ ++ T + WG K ++ ++SV Sbjct: 316 NTYDLNGSTFANAYASGGTAKPHIFESTDL-GRLIRIQHASTWGYAK-ITAYTSAVSVTA 373 Query: 307 QSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDF 366 + F + +W + + + GYPS VTF+ RL + G V S + F Sbjct: 374 DVLSNFGGTAASSAWRLGLYSQGGGYPSCVTFYEGRLFWGGCPLAPTRVDGSMSSNYETF 433 Query: 367 SLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL----SKGL 422 S A+ + + + WM +G+LVG W++ + Sbjct: 434 SPSSTASVVADDNAVAYPLDSGDVNNVLWMKDDEKGLLVGTKGGEWVVRANTLNGALTPT 493 Query: 423 SIDFRRVSGSGVY-ACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLF 480 ++ R + G Y PV G ++FV R+++ ++ + E GF ++T L+ H+ Sbjct: 494 NVKATRATTYGSYEGSQPVRTGKDIIFVQRKRRKVRNLNYTYEIDGFNAGDLTILSGHIG 553 Query: 481 NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKH----- 535 QL +Q EP VW+ +L + + + W ++ Sbjct: 554 RLEFGQLAFQSEPEGWVWMTR-----GDGQLPVLTYDRDEQKI-GWSRQIMGGYQDAARR 607 Query: 536 ---YVLSAASFPNDNRGGTSLWMLVAL-SAGEERSFTV 569 V S S P+ N +W++V G+ + Sbjct: 608 RPPIVRSVCSIPDPNDARDEVWLIVQRMIDGKTERYVE 645 Score = 95.7 bits (236), Expect = 2e-17, Method: Composition-based stats. Identities = 20/96 (20%), Positives = 35/96 (36%), Gaps = 1/96 (1%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 MV + ++F AGE +P + + R DLS + N +P GP P Sbjct: 1 MVRASPNFNAFDAGEFAP-ITEGRTDLSRYGFACRILENFMPRVVGPAARRPGTSFIAST 59 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRS 96 R + + F ++ FG+ ++ Sbjct: 60 RYPEKDALLVRFEYSTEQAYVMEFGNLYVRFYRNDG 95 >gi|323699364|ref|ZP_08111276.1| hypothetical protein DND132_1955 [Desulfovibrio sp. ND132] gi|323459296|gb|EGB15161.1| hypothetical protein DND132_1955 [Desulfovibrio desulfuricans ND132] Length = 698 Score = 284 bits (726), Expect = 3e-74, Method: Composition-based stats. Identities = 109/583 (18%), Positives = 189/583 (32%), Gaps = 76/583 (13%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M T +F+AGE+SPRL + R DLS + G N +G + + Sbjct: 1 MSIATPAITNFTAGEISPRL-EGRTDLSKYFNGCRTLLNFHVHPHGGTSRRAGFRFVAES 59 Query: 61 RLDPRSNRVFSFSIPDGGYALLVF-----GDKKLQIVVVRSSTKWSPALFGKTYKTPYTF 115 + + F G +L F G ++++ A + + PYT Sbjct: 60 LGQAKPVLLIPFEYSAGQTYVLEFAEDAAGQGRMRVFSGHGLVLSDGAPYVRDI--PYTA 117 Query: 116 KDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWL-GDGMISGV 174 + L+YA + + VH DHP ++ + D +T +E+ FL P G+ Sbjct: 118 DEFDELDYAQSAGSLILVHPDHPVREMVRVDHDD---WTLEEMTFLGQPEAWGENDYPSA 174 Query: 175 KSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDK 234 + + A T + T + R W +AD Sbjct: 175 VCFYEQRLVLAATRSRPATLWLSRTGEFSDFRLRTREVPLDGW--------RDLEIADAN 226 Query: 235 VYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKD 294 L G++GD G + ++R Y Sbjct: 227 G-DGLRDGKAGDNVLLLAGNGFE----ARDALKGQHPDGSTRYYRYKGTGNYATVNSNVT 281 Query: 295 VSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELS 354 ++ + ++ + +W F Sbjct: 282 LTFAAEPGANQLEAIWDEDGVLDDAAWD---------------------CFG-------- 312 Query: 355 VYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLL 414 G D E D A+ ++ A+ I ++ P + +G W L Sbjct: 313 -----VGDRTDGPAGAEPLEDD---AIEVTLSGRQANAIEFIVPR-RALWIGTAGGEWTL 363 Query: 415 SISLSKGL---SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFN 470 S S S L ++ + G P +VG ++V GR+I+ +S E + Sbjct: 364 SASSSDPLTPSNVKAAQEGTGGASGVRPEAVGFAALYVQRAGRKIREMSYRYESDAYVSK 423 Query: 471 EITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHM 530 ++T L++H+ + QL Y +EP SI++ V L+ + + + AW + Sbjct: 424 DLTLLSEHITEGGLTQLAYVQEPDSILYGVR-----GDGILVALTYVPD-QEVAAWSRIV 477 Query: 531 ISDKHYVLSAASFPNDNRGGTSLWMLVALSA-GEERSFTVRLN 572 V AAS ND LW+ V + GE R + L Sbjct: 478 TDG--VVERAASVYNDAEKRDELWITVLRTVNGETRRYVEYLE 518 >gi|317152064|ref|YP_004120112.1| hypothetical protein Daes_0341 [Desulfovibrio aespoeensis Aspo-2] gi|316942315|gb|ADU61366.1| hypothetical protein Daes_0341 [Desulfovibrio aespoeensis Aspo-2] Length = 698 Score = 284 bits (725), Expect = 4e-74, Method: Composition-based stats. Identities = 104/581 (17%), Positives = 185/581 (31%), Gaps = 72/581 (12%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M TT + +F+AGE+SPRL R DLS + G N +G + Sbjct: 1 MSITTPSLTNFTAGEISPRL-AGRIDLSRYFNGCRTLENFHVHPHGGATRRCGFRFVTQA 59 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGD---KKLQIVVVRSSTKWSPALFGKTYKTPYTFKD 117 R+ + F +L FG+ + ++ V PY Sbjct: 60 LNPDRAGLLVPFESNADTAYVLEFGEDAAGQGRMRVFSGHGVVMAGDAPYALDVPYRADQ 119 Query: 118 NKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWL-GDGMISGVKS 176 +L YA G + H HP L + + ++++F+ P +G V + Sbjct: 120 LDTLRYAQSGDELILAHPAHPVRRLTRLAHD---QWQLEDMEFIGCPETWTEGNHPSVVA 176 Query: 177 NAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVY 236 + + A T T + R W + D Sbjct: 177 FFEQRLVLAATPDKPGTLWFSRTGGIGDFRLRTREVPLDGW--------RDREITDSNS- 227 Query: 237 RSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVS 296 L G++GD F G + K + + +T+R A G K V+ Sbjct: 228 DGLRDGKAGDTFLLLDGDGFEKLDGL----KGQHPDRTTRYYRYKGAANLTASGADKTVT 283 Query: 297 KDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVY 356 P+ + + W E P Sbjct: 284 -----FRHEPEGAQIEPIRDAEGELNNGFWECFE--PG---------------------- 314 Query: 357 LSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSI 416 D + A+ ++ A+ I ++ G+ + VG W L Sbjct: 315 --------DRTEAPAGEAPLDDDAIEVTLSGRQANAIEFLVARGK-LWVGTAGGEWTLGG 365 Query: 417 SLSKGLS---IDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEI 472 SL ++ I + G A P +VG +++ GR+I+ ++ E + ++ Sbjct: 366 SLGDPVTPESIKASQEGSCGASATRPEAVGFATLYIQRAGRKIREMAYRYESDAYVSRDL 425 Query: 473 TQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMIS 532 T L++H+ + Q+ Y +EP SI++ V L+ + + + AW + Sbjct: 426 TILSEHITKPGLTQMAYVQEPDSILYCVR-----GDGALIALTYEPD-QEVAAWSRMLTD 479 Query: 533 DKHYVLSAASFPNDNRGGTSLWMLVALSA-GEERSFTVRLN 572 V A+ N LW ++ + G ER + L Sbjct: 480 GA--VECVAAVYNQAGKRDVLWAVIRRTVNGLERRYVEFLE 518 >gi|146276492|ref|YP_001166651.1| hypothetical protein Rsph17025_0440 [Rhodobacter sphaeroides ATCC 17025] gi|145554733|gb|ABP69346.1| hypothetical protein Rsph17025_0440 [Rhodobacter sphaeroides ATCC 17025] Length = 754 Score = 280 bits (715), Expect = 5e-73, Method: Composition-based stats. Identities = 105/598 (17%), Positives = 182/598 (30%), Gaps = 66/598 (11%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M T+ + +FS+GEL P LL R D G+AK + +PL G + P Sbjct: 1 MTRTSPPQVAFSSGELDP-LLHRRFDYQRFQTGLAKCQGFLPLAQGGVTRAPGTIYRGRT 59 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 R + FS +L F ++++ R TP+ S Sbjct: 60 R-GDARCVLVPFSFAANDSCILEFTPGRMRVW--RYGALVMSGGAPYELVTPFDETSLSS 116 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 L + V P L + ++T P+ + A Sbjct: 117 LSWVQSADVVYMVDGRQPMQRLARLALD---NWTIGAQALRKGPFRVQNTDEAITLTA-- 171 Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGCHP----PEWAKNTNY------------- 223 A T +T+ F G ++L P W + Y Sbjct: 172 ---SAAKGTITLTASAAFFTADHVGSLMQLRPKDNTSVPAWTADEEYGSETWGGPLVGFE 228 Query: 224 ---SIGAYIVADDKVYRSLTTGRSG-DRFGYSKGATYVKDNNITWITVLNLSSKTSRESA 279 Y + ++G +++G V + W + + Sbjct: 229 TEPPADVLRRYGANTYLLVQGTKAGSTPPIHTEGDYMVDSDPTVWRFISDD--------- 279 Query: 280 SGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFH 339 V + + P GV W AW ++ GYPS V + Sbjct: 280 ---VGIVRITQILSPTQARAAVTRTIPTGCI----GVPTYRWSEGAWSKRYGYPSTVEIY 332 Query: 340 NNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPF 399 RL + + + +V+ S+ G F DF G D T S + I + Sbjct: 333 EQRLAAAATPSEPRTVWFSAVGDFQDF----LDGTEDDQSFAYTVAGSTSVNRIINLQRG 388 Query: 400 GEGVLVGCDTSLWL----LSISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRR 455 G+ + + S+ + F SG G P++ +F+ +R Sbjct: 389 AAGLHIFALGEEYSTRSETRSSVIGPKNAVFGLDSGVGSSTAKPITPSGNPIFISRDRKR 448 Query: 456 IKYISGSTEQGF-RFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGC 514 + + S +Q +++ A H+ Q+V+Q P W+ L L+ Sbjct: 449 VLEMVYSLDQDRPVSRVLSRTAQHVGGAGFEQIVWQAAPEPTAWLRL-----GTGELVAM 503 Query: 515 RFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLV-ALSAGEERSFTVRL 571 + + E W ++ +V + A +P G L M V G+ L Sbjct: 504 VYDPDEE-VLGWAPVPVAGG-FVDALAVYPAAGGGSDILTMAVLREIDGQTVRMIEEL 559 >gi|242278913|ref|YP_002991042.1| hypothetical protein Desal_1441 [Desulfovibrio salexigens DSM 2638] gi|242121807|gb|ACS79503.1| hypothetical protein Desal_1441 [Desulfovibrio salexigens DSM 2638] Length = 698 Score = 265 bits (677), Expect = 1e-68, Method: Composition-based stats. Identities = 73/285 (25%), Positives = 123/285 (43%), Gaps = 28/285 (9%) Query: 303 SVAPQSQTLFQAGVSVVSWFMSA---------WGEQEGYPSHVTFHNNRLLFSGSKGDEL 353 V P+ Q + S V W M W ++G+PS VTF RL F+ S + Sbjct: 245 LVHPEVQPYKLSRTSHVDWKMELVAFSSPPQEWNSEKGFPSCVTFFEERLCFAASPSNPQ 304 Query: 354 SVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWL 413 ++++S G++ DF++ D A T ++ + I WM + +++G W Sbjct: 305 TIWMSKAGSYEDFAVSSPVVDDD---ACTYTLSADQVNAIRWMVS-AKKLIMGTSGGEWW 360 Query: 414 LSISLS----KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFR 468 LS S S+ RR + G A PPV VG ++F+ GR I+ +S S E G+ Sbjct: 361 LSGGSSLDSVTPNSVMVRRETTHGSAAIPPVVVGGVMLFLQREGRTIRELSYSFEADGYT 420 Query: 469 FNEITQLADHLFNQ-RILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWH 527 ++T LA+HL I + YQ+ P S++W+ + ++G + E E +H Sbjct: 421 APDLTILAEHLTRSNSITEWAYQQSPDSVIWMTRDDG-----VMVGLTYQREHE-VVGFH 474 Query: 528 THMISDKHYVLSAASFPNDNRGGTSLWMLVALSAGEERSFTVRLN 572 H K S + P + + ++ G R + R+ Sbjct: 475 RHTTDGKF--RSVCTVPGPTQEEVWV-VVEREVGGISRKYVERME 516 Score = 100 bits (248), Expect = 8e-19, Method: Composition-based stats. Identities = 20/120 (16%), Positives = 36/120 (30%), Gaps = 4/120 (3%) Query: 53 LMQEYRDCRLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTP 112 E R + R+ F +L F D+ ++I ++P Sbjct: 167 GAVEAVQVREINPATRLIPFEFSTEQAYVLEFTDRNIRIF-KNGGIVVDDQGSPVEIQSP 225 Query: 113 YTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMIS 172 YT D + + VH + P+ L D + + + F PP + Sbjct: 226 YTETDLPGIRFTQSADVMYLVHPEVQPYKLSRTSHVD---WKMELVAFSSPPQEWNSEKG 282 Score = 74.1 bits (180), Expect = 6e-11, Method: Composition-based stats. Identities = 18/57 (31%), Positives = 29/57 (50%), Gaps = 1/57 (1%) Query: 4 TTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 + +FSAGELSPRL R DL+ ++ G+A+ N+ +G + R+ Sbjct: 3 VSLIMTNFSAGELSPRL-GGRVDLAKYSNGLAELENMFTHPHGGASRRTGFRFIREV 58 >gi|288959382|ref|YP_003449723.1| hypothetical protein AZL_025410 [Azospirillum sp. B510] gi|288911690|dbj|BAI73179.1| hypothetical protein AZL_025410 [Azospirillum sp. B510] Length = 665 Score = 263 bits (672), Expect = 5e-68, Method: Composition-based stats. Identities = 102/581 (17%), Positives = 185/581 (31%), Gaps = 109/581 (18%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M T +++F+ GE+SPR+ + R DL V + N++ + GP P + Sbjct: 1 MSRATPAQYAFTGGEISPRI-KGRTDLERIRNAVEEMTNMVAVPEGPSERRPGTRFANST 59 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 + S + F ++ + + + T+ Y+ D Sbjct: 60 K-GDASAVLIPFEFSTQQAYIIEATAGAFRFYRDGGQIVSGSSPYEVTHA--YSAADLPF 116 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 L + V HPP L ++ E P+L + S Sbjct: 117 LRWTQSADVLFLVCPGHPPRTLSRTGH---TAWNLAEWVMRDGPYL------DLNSGPTT 167 Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLT 240 + + +T+ +F D GR +RL W G + S+T Sbjct: 168 LTPSGTSGSVTLTASAALFAATDVGRLVRLRI-ANVW--------GWCRITAFGSVTSVT 218 Query: 241 TGRSGDRFGYSKGATYV----KDNNITWITVLNLSSKTSRESASGAVAP--YYVWGDIKD 294 G + A + TW T + +A V + + Sbjct: 219 ATVEAAWGGTTATAFWRLGAWGATTGTWPTAVTFHENRLAFAALQTVWLSCSGDFDNFGP 278 Query: 295 VSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELS 354 +++G + + T V+V+ W SA+G +L +G+ G + Sbjct: 279 TTENGTVAADNAITLTAADDQVNVIRWLRSAFG---------------VLIAGTSGGPFA 323 Query: 355 VYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLL 414 + S +ALT + + +H V + + Sbjct: 324 IQAS-----------------SLREALTPI--NATMPRVH----------VAGAADVQPV 354 Query: 415 SISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGST-EQGFRFNEIT 473 ++ + LVF RR+ ++ G+ ++ Sbjct: 355 RVATN--------------------------LVFPSRSRRRLHLLNAEFAAAGYSAPDLA 388 Query: 474 QLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISD 533 +A H+ + + YQ+EP S++W+VL+ L G + E + AWH H + Sbjct: 389 LVASHITRHAVKAMAYQQEPWSVMWLVLDDG-----TLAGVTYVPELD-ILAWHRHPLGG 442 Query: 534 KHY-VLSAASFPNDNRGGTSLWMLVAL-SAGEERSFTVRLN 572 VLS A P +R LW++V AG R L Sbjct: 443 TAVKVLSVACIPAADR--DELWLVVERVVAGGIRRHVEILE 481 >gi|187736306|ref|YP_001878418.1| hypothetical protein Amuc_1819 [Akkermansia muciniphila ATCC BAA-835] gi|187426358|gb|ACD05637.1| hypothetical protein Amuc_1819 [Akkermansia muciniphila ATCC BAA-835] Length = 822 Score = 244 bits (622), Expect = 3e-62, Method: Composition-based stats. Identities = 69/258 (26%), Positives = 107/258 (41%), Gaps = 18/258 (6%) Query: 317 SVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYD 376 W A+G + GYP V FH RL F G+ G +++ S F F+ Sbjct: 409 DTNDWSFGAFGVRNGYPCTVEFHQGRLWFGGTPGQPQTLWASRVDDFSAFTPGIP----- 463 Query: 377 PTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSID---FRRVSGSG 433 + + + I W+ G+++G W LS + S+GL+ F R SG G Sbjct: 464 ADSPMILTMAASQQNRISWIASL-RGLMIGTSEGEWRLSATNSEGLNASNAGFERHSGVG 522 Query: 434 VYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFNQRILQLVYQEE 492 + +SV + L+FV G +++ + S E G++ +++ L+DHL + I+ Q Sbjct: 523 SASLDALSVENSLLFVQQGGMKVRELFYSLEADGYQTRDVSLLSDHLLGEGIVDWTVQRS 582 Query: 493 PHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPND-NRGGT 551 VW VL C + AWH H + + +LS AS N Sbjct: 583 TAFHVWCVLGDGS------AVCMTLNREQNVVAWHAHRL-EHGRILSVASLRGSRNTPDE 635 Query: 552 SLWMLVALSAGEERSFTV 569 +W VA GEE TV Sbjct: 636 EVWFAVARGEGEEACITV 653 Score = 139 bits (349), Expect = 1e-30, Method: Composition-based stats. Identities = 60/381 (15%), Positives = 112/381 (29%), Gaps = 40/381 (10%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + SF+AGEL+P L R DL ++G ++ N + +G L P + Sbjct: 1 MAKQVLQRLSFTAGELTPWL-AGRADLDPVSRGASRLINFLVSPFGGLRRRPGTRLVARA 59 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPY-TFKDNK 119 R+ SF G +L G ++ + TP+ T + Sbjct: 60 GCREGMVRLVSFKYSTGVQFMLEVGRGYVRYF-KNGALLTDTEGGVLETLTPWKTDEQVS 118 Query: 120 SLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAK 179 +L V PP L D D + + ++F P+ +++ V+ + Sbjct: 119 NLRMQQLNDVIYCVEPSTPPMTLARYADDD---WRLEALEFSGIPYES-SLLNAVRLECR 174 Query: 180 LSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSL 239 + + + + T+D +F P +G+ VA+ Sbjct: 175 M-VREGGVNRLLATADDDVFTPEMEGKEFL-----------RITRKYGETVAEGNQMPFY 222 Query: 240 TTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDG 299 + + +++ + G P + + Sbjct: 223 HLTTLSRDLYKGETFSMNREDGWRQAYTCIRDFSRESDYQEGVDRPERYTAFFEKGADAS 282 Query: 300 RSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLS- 358 I V + +W + W GYP + NR V+ S Sbjct: 283 TRIYVNG-----AWTLETTGTWD-AEWEICRGYPDGSNYLPNR---------PELVWHSV 327 Query: 359 -----SFGAFYDFSLDGEYGC 374 G +F+L G Sbjct: 328 KSFQQREGFRNNFTLSGNEEE 348 >gi|290968641|ref|ZP_06560179.1| conserved hypothetical protein [Megasphaera genomosp. type_1 str. 28L] gi|290781294|gb|EFD93884.1| conserved hypothetical protein [Megasphaera genomosp. type_1 str. 28L] Length = 1039 Score = 241 bits (614), Expect = 3e-61, Method: Composition-based stats. Identities = 80/465 (17%), Positives = 158/465 (33%), Gaps = 55/465 (11%) Query: 153 FT-FDEIKFLPPP------WLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKG 205 ++ FD + P + + + + ++ITS IF K Sbjct: 368 WSDFDNVALFGNPTGACFIYFLAAEKEETPHPDSIEDTSLQITDSKITSSNSIFVQALKN 427 Query: 206 RSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWI 265 I++ + VY S S+ N W Sbjct: 428 TKIKIVQTQQSKSVEMTLGENEEEQTSGAVYVGEKWKISTSGIHNSRIVIERSLNGQQWH 487 Query: 266 TVLNLSSKTSRES-ASGAVAP-------YYVWGDIKDVSKDGRSISVAPQSQT------- 310 SK + SG+ G I KD ++SV + Sbjct: 488 EYRKYISKDDQNFMESGSEKEKCYLRVKAKTQGKINTERKDSDNLSVVLSALPFENEGII 547 Query: 311 -----------------LFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDEL 353 V V ++ S+W ++ GYP F +RL+F+G+K + Sbjct: 548 EITDIVSPKEIKYTAIEPVIPNVPVDAFAFSSWNDRNGYPKLSCFFQDRLVFAGTKKEPY 607 Query: 354 SVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWL 413 S++ S G + +FS++ G A+ + + I + P + ++V + W+ Sbjct: 608 SLWFSRTGDYNNFSVEKAEGTVTEDSAIKLDLIVRNLYEIRHLVPSND-LIVLTSGNEWI 666 Query: 414 LSISLS-KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNE 471 +S + + + G C P +G+ L++V G I+ S + + +E Sbjct: 667 ISGDTAITPTKCTPKVQTMRGASNCKPWHIGNRLIYVQRDGGTIRDFGYSYDSDNYNGDE 726 Query: 472 ITQLADHLF-NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHM 530 + A HL +++ Y + P+S ++ V E + + C + + AW TH Sbjct: 727 LNLFASHLTKRHQMVSSAYCQNPYSTLYFVREDGE------IICLMLIKEQNVCAW-THW 779 Query: 531 ISDKHYVLSAASFPNDNRGGTSLWMLVALSAGEER--SFTVRLNL 573 + Y+ + G L+++V + E + + + +L Sbjct: 780 NTHGKYLDCCSVL---ENGKDYLYVIVERTNREAQIVRYLEKFDL 821 Score = 140 bits (352), Expect = 6e-31, Method: Composition-based stats. Identities = 49/323 (15%), Positives = 92/323 (28%), Gaps = 22/323 (6%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M N T++SF+ GE+SP + R DL + + ++ N + YG + + Sbjct: 1 MQNVFITQNSFTTGEISPEV-AERTDLEKYKSALLQAENAVVSPYGSVSRRTGSKYIGAI 59 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 + + + F LL G K +++ W + TP+ + K Sbjct: 60 KYADKEAVLVPFMDSSDRSYLLEVGYKYIRV--------WKDETMEQEIDTPFEYP--KE 109 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLG----DGMISGVKS 176 L + G TA +P + LL+ + + P + +S V Sbjct: 110 LNFTQSGDTAFICSGRYPVYELLH-----GRYWELRKFDIPKPYFDDIISAIENVSDVNY 164 Query: 177 NAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVY 236 + + T T + G + + + Sbjct: 165 TESDTPVFSQTKAGDYTFTPTVSGLYKIVLFGGAGGKKGTIEHYAGSTKHDEAIYHYEYG 224 Query: 237 RSLTTGRSGDR-FGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAP-YYVWGDIKD 294 + G+ TY +R G V + G +D Sbjct: 225 VAGNEGQKKIVTVKLKAKTTYSIHVGKGGEDGDKHKKGIARGWEEGDVYNSFLNGGPGED 284 Query: 295 VSKDGRSISVAPQSQTLFQAGVS 317 + G S V ++ S Sbjct: 285 TTVKGNSDGVNIVAKGGATFTGS 307 >gi|260549511|ref|ZP_05823729.1| Bbp13 [Acinetobacter sp. RUH2624] gi|260407304|gb|EEX00779.1| Bbp13 [Acinetobacter sp. RUH2624] Length = 678 Score = 234 bits (597), Expect = 3e-59, Method: Composition-based stats. Identities = 87/548 (15%), Positives = 168/548 (30%), Gaps = 72/548 (13%) Query: 7 TKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRS 66 ++SF+ G +SP + R D + + GVAK +N+ +G LV + Sbjct: 1 MQYSFNGGVISPD-MFGRIDQAKYQTGVAKCKNMYVELFGGLVYRAGFRYVHHYPKTLGK 59 Query: 67 NRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVF 126 R+ F + +L F + R + + PY + L YA Sbjct: 60 MRLIRFVFSEEQAVVLAFRAGAVNFFA-RGGMLLNNVGEPLEVELPYAEEHLMQLRYAQS 118 Query: 127 GSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQAD 186 H D+PP ++ + S +S+ Sbjct: 119 ADVVTITHPDYPPRKIIRKGATEW-------------------------STEVVSVGYGL 153 Query: 187 TSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAY-IVADDKVYRSLTTGRSG 245 T + + I +G H ++ +Y + A + +T Sbjct: 154 TPPQNVAATAHIEDKYKEG----GNMHDSYIERDYSYQVTAVDEQNESAASTKVTVKNDI 209 Query: 246 DRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVA 305 G T+ T + L S + + D + SI+ Sbjct: 210 TLAGNYNTITWDVVTGATRYNIFKLRSGLAS-----YIGETTETSFTDDNIETNGSIT-P 263 Query: 306 PQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYD 365 P + F+ P+ V +H R ++ G + +S + Sbjct: 264 PLIRNPFEFN-----------------PTAVAYHGQRKVYGGGYQSPQWIRMSRTATDDN 306 Query: 366 FSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLS-KGLSI 424 F D ++ + + + + +LV ++W LS + S+ Sbjct: 307 FGYHIPTQDTD---SIQIRFAARDGNGVKHLITLND-LLVLTSGAMWKLSSDGAMTAASV 362 Query: 425 DFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKY---ISGSTEQGFRFNEITQLADHLFN 481 + + +G PV V VF + SG ++ +++ + LF+ Sbjct: 363 NMNKQYSTGANDVTPVEVDGAAVFASDQTGHVHEASLASGYNASYYQTLDLSIMCPQLFD 422 Query: 482 -QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSA 540 +I+ P +I++ V + LL + + +AW H K LS Sbjct: 423 GHKIIDCAAIRNPLNIIYFVRDDG-----VLLSLTYEP-QQQVWAWAEHHTDGKF--LSV 474 Query: 541 ASFPNDNR 548 A P +N+ Sbjct: 475 AEIPEENQ 482 >gi|260557972|ref|ZP_05830184.1| Bbp13 [Acinetobacter baumannii ATCC 19606] gi|260408482|gb|EEX01788.1| Bbp13 [Acinetobacter baumannii ATCC 19606] Length = 678 Score = 231 bits (589), Expect = 2e-58, Method: Composition-based stats. Identities = 80/560 (14%), Positives = 162/560 (28%), Gaps = 72/560 (12%) Query: 7 TKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRS 66 ++SF+ G +SP + R D + + GVAK +NL +G +V + Sbjct: 1 MQYSFNGGVISPD-MFGRIDQAKYQTGVAKCKNLYVELFGGVVYRAGFRYVHHYPKTMGK 59 Query: 67 NRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVF 126 R+ F + +L + + PY L YA Sbjct: 60 MRLIRFVFSEEQAVVLAIRAGAINFFA-DGGMLLNENNEPLEVAVPYAEDHLMQLRYAQS 118 Query: 127 GSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQAD 186 H ++PP ++ + I+ + + G G V + A + Sbjct: 119 ADVVTITHPNYPPRKIIRKSATEWIT-ELVTVGY------GVGTPQNVAATAHIEDKYKP 171 Query: 187 TSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGD 246 S + D + E A + + + Sbjct: 172 GG-----SMHDSYIERDYSYQVTAVDEQNESAASLKVVVQNDLTLAGNY----------- 215 Query: 247 RFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAP 306 T+ + L S + + + S I Sbjct: 216 -----NTITWDAVTGANRYNIFKLRSGLASFIGETTETSFTDDNIETNGSITPPLIR--- 267 Query: 307 QSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDF 366 E YP+ V +H R ++ G + +S +F Sbjct: 268 --------------------NPFEFYPTAVAYHGQRKVYGGGYKSPQWIRMSRTATDDNF 307 Query: 367 SLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLS-KGLSID 425 D ++ + + + + +++ +LW +S + S++ Sbjct: 308 GYHIPTQDTD---SIQIRFAARDGNGVKHLVTMSDLLIL-TSGALWKMSADGAVTAASVN 363 Query: 426 FRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYI---SGSTEQGFRFNEITQLADHLFNQ 482 + +G PV V +F + I SG ++ +++ + LF+ Sbjct: 364 MNKQYSTGANDVTPVEVDGATIFSSDQTGHVHEISLASGYNASFYQTIDLSIMCPQLFDG 423 Query: 483 R-ILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAA 541 + I+ P +I++ V LL + + + +AW H + K LS A Sbjct: 424 QKIIDCALLRNPLNIIYFVR-----GDGVLLSLTYEPK-QQVWAWAEHHTNGKF--LSIA 475 Query: 542 SFPNDNRGGTSLWMLVALSA 561 P D++ L+ + Sbjct: 476 EIPEDDQSV--LYAFIERDG 493 >gi|254251749|ref|ZP_04945067.1| hypothetical protein BDAG_00946 [Burkholderia dolosa AUO158] gi|124894358|gb|EAY68238.1| hypothetical protein BDAG_00946 [Burkholderia dolosa AUO158] Length = 545 Score = 221 bits (562), Expect = 3e-55, Method: Composition-based stats. Identities = 56/265 (21%), Positives = 100/265 (37%), Gaps = 23/265 (8%) Query: 315 GVSVVSWFMS--AWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEY 372 W + W +GYP V+ + RL +GS G V+ S+ G +YDF+ Sbjct: 129 TAPPDGWMLKTFMWNPTDGYPCAVSLYQQRLYAAGSSGYPERVWASATGLYYDFTPGT-- 186 Query: 373 GCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLL---SISLSKGLSIDFRRV 429 D + V + I + + V + + S+ +I+ R Sbjct: 187 ---DDGDGFSYDVASDQVNQIMHLAS-SRILTVLTQGEEFTIDGGSVGSITPTNINVRSQ 242 Query: 430 SGSGVYACPPVSVGDCLVFVCGVGRRIKYISGST-EQGFRFNEITQLADHLFNQRILQLV 488 S G PV VG+ L+F ++I+ ++ FR +T+LA H+ ++ + Sbjct: 243 SIYGTARPRPVRVGNELIFPQRAAKKIRSMAYDFNTDSFRSQNLTRLAAHITESGVVDIA 302 Query: 489 YQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNR 548 +Q EP +VW+V + L+ + + E + H S P + Sbjct: 303 FQAEPTPVVWMVR-----ADGVLISMTYDRD-ENVCGFARHTTDGAF--KSVCCIPGAD- 353 Query: 549 GGTSLWMLVALS-AGEERSFTVRLN 572 G L+ +V + G RL+ Sbjct: 354 -GDVLFAVVQRTINGNVVQNVERLD 377 >gi|257139843|ref|ZP_05588105.1| hypothetical protein BthaA_11681 [Burkholderia thailandensis E264] Length = 489 Score = 219 bits (557), Expect = 1e-54, Method: Composition-based stats. Identities = 56/320 (17%), Positives = 112/320 (35%), Gaps = 23/320 (7%) Query: 260 NNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVV 319 + R G+ + I + + Sbjct: 18 GGTLGAVYEYGVGQAWRAQDVGSYVEINGGLVQLIAFESASRIFGVIKRELASTLTAPAS 77 Query: 320 SWFM--SAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDP 377 W + S W +GYP+ V+ RL +GS G + V+ S G + DF+ + G Sbjct: 78 GWALKSSMWNSIDGYPAAVSLFKQRLYAAGSTGYPMRVWASGIGLYLDFTPGTKDG---- 133 Query: 378 TKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSK---GLSIDFRRVSGSGV 434 +A + + + + + + ++ + +I+ S G Sbjct: 134 -EAFGYDMASDQVNQTVHLAS-AKILAALTQGEEFTVTGGSAGAITPTNINVDSQSVYGC 191 Query: 435 YACPPVSVGDCLVFVCGVGRRIKYISGST-EQGFRFNEITQLADHLFNQRILQLVYQEEP 493 PV VG+ +V+V G++++ ++ +R +T+LA H+ I+ + +Q EP Sbjct: 192 ARARPVRVGNEIVYVQRAGKKVRAMTYDLNTDAYRSQNLTRLAAHVTESGIVDVAFQAEP 251 Query: 494 HSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSL 553 +VW+V + L+ + + E + H+ S P D G L Sbjct: 252 TPVVWMVR-----ADGVLVSMTYDRD-ENVCGFARHVTDGLF--KSVCCIPGDE--GDVL 301 Query: 554 WMLVALS-AGEERSFTVRLN 572 + +V + G + RL+ Sbjct: 302 FAVVQRTINGATVQYVERLD 321 >gi|83720451|ref|YP_441475.1| hypothetical protein BTH_I0919 [Burkholderia thailandensis E264] gi|83654276|gb|ABC38339.1| conserved hypothetical protein [Burkholderia thailandensis E264] Length = 405 Score = 217 bits (551), Expect = 6e-54, Method: Composition-based stats. Identities = 51/253 (20%), Positives = 101/253 (39%), Gaps = 21/253 (8%) Query: 325 AWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTA 384 W +GYP+ V+ RL +GS G + V+ S G + DF+ + G +A Sbjct: 1 MWNSIDGYPAAVSLFKQRLYAAGSTGYPMRVWASGIGLYLDFTPGTKDG-----EAFGYD 55 Query: 385 VTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSK---GLSIDFRRVSGSGVYACPPVS 441 + + + + + + ++ + +I+ S G PV Sbjct: 56 MASDQVNQTVHLAS-AKILAALTQGEEFTVTGGSAGAITPTNINVDSQSVYGCARARPVR 114 Query: 442 VGDCLVFVCGVGRRIKYISGST-EQGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVV 500 VG+ +V+V G++++ ++ +R +T+LA H+ I+ + +Q EP +VW+V Sbjct: 115 VGNEIVYVQRAGKKVRAMTYDLNTDAYRSQNLTRLAAHVTESGIVDVAFQAEPTPVVWMV 174 Query: 501 LEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALS 560 + L+ + + E + H+ S P D G L+ +V + Sbjct: 175 R-----ADGVLVSMTYDRD-ENVCGFARHVTDGLF--KSVCCIPGDE--GDVLFAVVQRT 224 Query: 561 -AGEERSFTVRLN 572 G + RL+ Sbjct: 225 INGATVQYVERLD 237 >gi|42526655|ref|NP_971753.1| hypothetical protein TDE1145 [Treponema denticola ATCC 35405] gi|41816848|gb|AAS11634.1| hypothetical protein TDE_1145 [Treponema denticola ATCC 35405] Length = 647 Score = 193 bits (491), Expect = 5e-47, Method: Composition-based stats. Identities = 71/561 (12%), Positives = 163/561 (29%), Gaps = 83/561 (14%) Query: 9 HSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNR 68 +F+ GE+S L R DL ++ V++ N ++ G + + + R Sbjct: 4 TNFAGGEVSKNLY-GRIDLPIYQNSVSRLENFDIMQTGGIKRRGGTERIGKLK---GYAR 59 Query: 69 VFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYK-TP----YTFKDNKSLEY 123 + F + + + G + ++I + + A F + TP Y D ++Y Sbjct: 60 LIPFIVNNTLSFIFEIGSEYIRIWKN--GSLLTLAGFPVEFSPTPDLPLYQKSDLSEIQY 117 Query: 124 AVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAK-LSI 182 A + H+ + P+ + + + + Sbjct: 118 AQTYDSLYLAHRHYKPYVIKWQGGDAFT-------------------FGSLNITGNAHKL 158 Query: 183 SQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTG 242 + S + +F+ GR + + KV+ Sbjct: 159 PF--QGSDNYPSCVALFQ----GRLFF-----------ASTIREPQKIWASKVFEYENFT 201 Query: 243 RSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSI 302 + + + S+K ++S Sbjct: 202 YFDTVVSKT--------TQLKNPDLRVFSAKAVKDSDVLTELTKDFTDITNITDYYVSGH 253 Query: 303 SVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGA 362 P+ + + A ++E + N Sbjct: 254 KGIPKDTKVLSVTSDSMKISKPATVDKEDIVLSIHLWRN-------ADSPQ------ADD 300 Query: 363 FYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGL 422 + D + P A + I W+ P + +++G ++S W++S Sbjct: 301 YKDTEIINNV--TAPDHAFYFEIGSDKNDKIKWITPSKD-LIIGTESSEWVMS-DGVTAQ 356 Query: 423 SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGST-EQGFRFNEITQLADHLF- 480 I+ + S GV +G ++++ GR ++ + E ++ ++TQ A HL Sbjct: 357 RIEVQLQSRYGVADLQGSLIGRSVIYIGQGGRSLRDYAYDFQEHTYKSIDLTQAASHLLI 416 Query: 481 NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSA 540 + + Y P +++ LE + G AW ++ + + + Sbjct: 417 ESKAVDFDYTNSPVQKIYLSLEDGSACV-----LLYDKNT-GIAAWTKIVLGNGK-IKNI 469 Query: 541 ASFPNDNRGGTSLWMLVALSA 561 + P +G ++ V Sbjct: 470 VTVPGL-KGFDDVYFEVERKG 489 >gi|54302254|ref|YP_132247.1| hypothetical protein PBPRB0574 [Photobacterium profundum SS9] gi|46915675|emb|CAG22447.1| hypothetical protein PBPRB0574 [Photobacterium profundum SS9] Length = 919 Score = 185 bits (470), Expect = 2e-44, Method: Composition-based stats. Identities = 50/253 (19%), Positives = 93/253 (36%), Gaps = 20/253 (7%) Query: 314 AGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYG 373 S W + W GYP T+ RL + + +V+LS +F DFS Sbjct: 405 TSRSTYKWAIEIWRNSTGYPRCGTYFQQRLSMANTISHPQTVWLSRTDSFNDFSKTRPIL 464 Query: 374 CYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSI----DFRRV 429 D + + + I + P +L LW L+ S + Sbjct: 465 ADDSMRYD---INSLQVNEIFNIVPLNSLLLF-TSGGLWSLAQDQQGAFSAESPPSVKMQ 520 Query: 430 SGSGVYACPPVSVGDCLVFVCGVGRRIKYISGS-TEQGFRFNEITQLADHLFNQ-RILQL 487 + G P+ G ++V R ++ I S + F ++T A HLF R+++ Sbjct: 521 NYEGANKLRPIVAGSTAIYVQQGDRIVRDIQFSWSSDSFEGVDLTVRASHLFKHKRVVEW 580 Query: 488 VYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDN 547 Y + P ++WV+ + + + E + + W H + K+ + AS + Sbjct: 581 AYAKNPDKLIWVIFDDGTAAT-----LTYMKE-QQIWGWCPHTTNGKY--KNVASV--EE 630 Query: 548 RGGTSLWMLVALS 560 +S++ +V Sbjct: 631 GSRSSIYFVVERI 643 Score = 165 bits (416), Expect = 3e-38, Method: Composition-based stats. Identities = 51/368 (13%), Positives = 100/368 (27%), Gaps = 27/368 (7%) Query: 6 WTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPR 65 ++ S SAGELSP + R D + G+AK+ N +G + + P Sbjct: 5 LSQPSMSAGELSPE-MYGRVDTDHYRIGLAKAENFFVNYHGGISNRPGTT-LSYITARNE 62 Query: 66 SNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAV 125 + F +L FG + +++ + + + TPY + L Y Sbjct: 63 VVALIPFQFSAFDSFMLEFGTEYMRV-MSKGKYITDNSGVKIQVVTPYLAGEILDLSYTQ 121 Query: 126 FGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQA 185 H++H + I + + + P+ + A + Sbjct: 122 SADVLTIFHRNHAIQQIKRYS---NIDWRVEPLINKLGPFESININESQFMYADKNGD-- 176 Query: 186 DTSTARITSDMKIFKPLDKGRSIRLGCHPP----EWAKNTNYSIGAYIVADDKVYRSLTT 241 + S+ F G+ + L +W + + G Y Sbjct: 177 VGEQITLISNFDAFTSDLVGKMVYLDQEETGDISQWMQRYEVAEGDQTYNAGNYYICTKA 236 Query: 242 GRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGD-IKDVSKDGR 300 + + V W +R++ G Y G + + Sbjct: 237 ELYNGKKAQTGDIAPVHSTGERWDGPGKFLPDDNRDANIGVRWAYLNSGYGVVKIISVTD 296 Query: 301 SISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSF 360 + + V W +P T R + L+ Sbjct: 297 ARHAICEVLVRLPDSVVGGERSKLTWN----FPGETT---QRTFSLATP--PLT-----S 342 Query: 361 GAFYDFSL 368 DF++ Sbjct: 343 NTMKDFTV 350 >gi|226940469|ref|YP_002795543.1| hypothetical protein LHK_01546 [Laribacter hongkongensis HLHK9] gi|226715396|gb|ACO74534.1| hypothetical protein LHK_01546 [Laribacter hongkongensis HLHK9] Length = 874 Score = 185 bits (469), Expect = 2e-44, Method: Composition-based stats. Identities = 68/381 (17%), Positives = 136/381 (35%), Gaps = 33/381 (8%) Query: 210 LGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLN 269 G A + + Y V K + G+ W + Sbjct: 329 TGSGARLSATVGSVACDGYSVTAIKTVTVIDGGKGYTSPSIVTVVKQDGRPITGWGPIHA 388 Query: 270 LSSKTSRE------------SASGAVAPYYVWGDIKDVSK-DGRSISVAPQSQTLFQAGV 316 S ++ + A+ P + G I V+ +G S AP + G Sbjct: 389 TYSVSTSPNTVQLAVTDSGGGSGAALEPVIIDGAITAVNVINGGSGYFAPVVSVSYAGGG 448 Query: 317 SVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYD 376 S ++ YP V++ R F+G+ +++++ G + D Sbjct: 449 SGATFGQPVVKSSGDYPGAVSYFEQRRCFAGTTRKPQNIWMTKSGTESNMGYSLPVRDDD 508 Query: 377 PTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLS---IDFRRVSGSG 433 + V+ A+TI + P + +L+ ++ W ++ S ++ I R S G Sbjct: 509 R---IAFRVSAREANTIRHIVPLAQLLLL-TSSAEWRVTSVNSDAITPRSISVRPQSYIG 564 Query: 434 VYACPPVSVGDCLVFVCGVGRRIKYISGSTEQ-GFRFNEITQLADHLFNQ-RILQLVYQE 491 PV + + L++ G ++ ++ + + GF +++ A HLF+ I+ + + + Sbjct: 565 ASNVQPVIINNTLIYASARGGHVRELAYNWQAGGFVTGDLSIRAPHLFDDFEIVDMAFGK 624 Query: 492 EPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGT 551 P +VW V +S L+G + E + AWH H S A+ Sbjct: 625 SPQPVVWFV-----SSSGCLIGLTYVPE-QQVGAWHWHDTDG--VFESCAAV--AEGAED 674 Query: 552 SLWMLVALSA-GEERSFTVRL 571 L+ ++ + G R + R+ Sbjct: 675 VLYCVIRRTVNGCSRRYVERM 695 Score = 166 bits (419), Expect = 1e-38, Method: Composition-based stats. Identities = 53/318 (16%), Positives = 94/318 (29%), Gaps = 19/318 (5%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + SF+ GE++P R D + + G+A RN + +GP ++ R+ Sbjct: 1 MATVKLLQRSFAGGEVTPEFF-GRIDDAKYQSGLAVCRNFVLAPHGPAMNRAGFAFVREV 59 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSST-KWSPALFGKTYKTPYTFKDNK 119 + R+ F+ ++ G + ++ + PY + Sbjct: 60 KDSNLKVRLIPFTYSTTQTMVIELGAGYFRFHTQGATLMQPDAPDSPYEVSNPYREDELF 119 Query: 120 SLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLP--PPWLGDGMISGVKSN 177 L Y VH +HPP L + ++ + P P + S Sbjct: 120 DLHYVQSADVMTLVHPNHPPQELRRLG---ATNWELKPVSLQPVIAPPENAAASTAGCSE 176 Query: 178 AKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVAD---DK 234 AK T+ + + RS + +I A Sbjct: 177 AKYDYEYVVTAVMVDLVNESAASNVATVRS-------NVYETGCTNTISWSASAGAYRYN 229 Query: 235 VYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKD 294 VY+ G G + G + V DN ++ A + G Sbjct: 230 VYK--KEGGVYGYIGQTAGLSLVDDNISPDLSKTPPIYDNVFSVAGQIESVPVTAGGSFY 287 Query: 295 VSKDGRSISVAPQSQTLF 312 + G SV + LF Sbjct: 288 GTHTGIIQSVTVLNGVLF 305 >gi|291334666|gb|ADD94313.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C695] Length = 189 Score = 177 bits (448), Expect = 5e-42, Method: Composition-based stats. Identities = 35/175 (20%), Positives = 78/175 (44%), Gaps = 13/175 (7%) Query: 401 EGVLVGCDTSLWLLSISLS----KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRI 456 +++G + +S + +I ++ S +G ++VG+ +F+ R++ Sbjct: 5 RTLIIGTAGGEFAVSGGGTDIAITPTNILIKKQSNNGAANVDALAVGNATLFLQRARRKL 64 Query: 457 KYISGSTE-QGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCR 515 + ++ + + G+ ++T LA+H+ QL YQ+EP+ ++W V +L+G Sbjct: 65 RELAYNFDVDGYVAPDLTILAEHISEGGFKQLSYQQEPNQVIWGVRNDG-----QLVGLT 119 Query: 516 FSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALS-AGEERSFTV 569 + E + AWH H+ S A+ P D+ W++ + G + + Sbjct: 120 YQRE-QQVVAWHRHIFGGSAVCESVATIPTDDS-EYQTWVINKRTINGSTKRYVE 172 >gi|291334457|gb|ADD94111.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C1161] Length = 206 Score = 174 bits (440), Expect = 4e-41, Method: Composition-based stats. Identities = 36/176 (20%), Positives = 78/176 (44%), Gaps = 13/176 (7%) Query: 400 GEGVLVGCDTSLWLLSISLS----KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRR 455 V++G + +S + +I ++ S +G ++VG+ +F+ R+ Sbjct: 3 DGHVIIGTAGGEFAVSGGGTDIAITPTNILIKKQSNNGAANVDALAVGNATLFLQRARRK 62 Query: 456 IKYISGSTE-QGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGC 514 ++ ++ + + G+ ++T LA+H+ QL YQ+EP+ ++W V +L+G Sbjct: 63 LRELAYNFDVDGYVAPDLTILAEHISEGGFKQLSYQQEPNQVIWGVRNDG-----QLVGL 117 Query: 515 RFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALS-AGEERSFTV 569 + E + AWH H+ S A+ P D+ W++ + G + + Sbjct: 118 TYQRE-QQVVAWHRHIFGGSAVCESVATIPTDDS-EYQTWVINKRTINGSTKRYVE 171 >gi|291336928|gb|ADD96456.1| hypothetical protein [uncultured organism MedDCM-OCT-S09-C787] Length = 138 Score = 173 bits (438), Expect = 8e-41, Method: Composition-based stats. Identities = 29/140 (20%), Positives = 49/140 (35%), Gaps = 3/140 (2%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M +F+ GELSPRL R DL+ + G N+I +G Q + Sbjct: 1 MARVAVQLTNFTGGELSPRL-DGRNDLAKYPTGCKTLENMIVFPHGSAARRSGTQFVAEV 59 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 + + R+ F +L FG++ ++ +PY + Sbjct: 60 KDSSKETRLIPFEFSTTQTYMLEFGNQYIRFYKDNGQILS--GGSAYEISSPYLEAELFD 117 Query: 121 LEYAVFGSTAVFVHKDHPPH 140 ++YA H +HP Sbjct: 118 IKYAQSADVMYICHPNHPVK 137 >gi|291336965|gb|ADD96491.1| hypothetical protein [uncultured organism MedDCM-OCT-S11-C1587] Length = 474 Score = 147 bits (371), Expect = 4e-33, Method: Composition-based stats. Identities = 65/478 (13%), Positives = 138/478 (28%), Gaps = 83/478 (17%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + +F+ GE+ P LL++R D++ + + ++RN+I G + P Sbjct: 1 MSRAVSIQSNFTTGEVDP-LLRARIDINQYYNALEQARNVIVQPQGGIERRPG------- 52 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 F V ++ + L + T Sbjct: 53 ---------LQFIFE----------------VPSAANPQNGMKLVPFEFST--------- 78 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 +FVH + F + + + + Sbjct: 79 ---TQS-YMLLFVH---------------NRMYIFKDKELVTN----INSSGNDYLTTTI 115 Query: 181 SISQADTSTARITSDMKIFKPLDKG--RSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRS 238 + + T ++D I D + +R H + ++ + +S Sbjct: 116 TSTVLATMDHTQSADTLIVVQEDMAPKKIVRGAAHNTWTISDISFEF----IPKFNFTQS 171 Query: 239 LTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKD 298 TT G + + + E+ G + + S + Sbjct: 172 ETTINQTITPSAVDGNITITAGG---NVFASGNLNQYIEANDGMGR-ARITRFVSATSVE 227 Query: 299 GRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLS 358 + + G + + +W +GYP TFH RL F G K +++ S Sbjct: 228 AIVEIPFFNTTAIASGGTFIDGGYEDSWSGSKGYPRTATFHEGRLYFGGVKSRPNTIFAS 287 Query: 359 SFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWL--LSI 416 F+DF+ G ++ ++ S + I M + + +L ++ Sbjct: 288 RVARFFDFNP----GEALDDDSIELTISTDSTNAITGMFSGRDLQIFTKGGEFFLPQSTL 343 Query: 417 SLSKGLSIDFRRVSGSGV-YACPPVSVGDCLVFVCGVGRRIKY-ISGSTEQGFRFNEI 472 ++ + G PV +F+ G+ ++ + E + N I Sbjct: 344 DPITPTNVVVNGATRRGSQEGIKPVGAESGTLFIQRAGKSLREFLFSDVELSYISNNI 401 >gi|296532340|ref|ZP_06895077.1| conserved hypothetical protein [Roseomonas cervicalis ATCC 49957] gi|296267336|gb|EFH13224.1| conserved hypothetical protein [Roseomonas cervicalis ATCC 49957] Length = 626 Score = 147 bits (370), Expect = 5e-33, Method: Composition-based stats. Identities = 64/363 (17%), Positives = 123/363 (33%), Gaps = 45/363 (12%) Query: 218 AKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRE 277 + NT++SI + + YR + G + S T L S+ + Sbjct: 133 SSNTSWSIAPWSFVREPFYRFASPGVTLAPSATSGSVT------------LTASAAAFQP 180 Query: 278 SASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVT 337 +G + + G V+ + S + + W +A+ G+P Sbjct: 181 GHAGVR--FRLGGKRVLVTAVASATSATASVEETLPGTAASADWDEAAFSAVRGWPVTAC 238 Query: 338 FHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMH 397 FH +RL+ GS+ ++LS G ++F + G +A+ + + I Sbjct: 239 FHQDRLVLGGSRDLPNRLWLSRSGDLFNF----DLGSGLDDQAIEFGLLSDQVNAIR-AV 293 Query: 398 PFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGV---YACPPVSVGDCLVFVCGVGR 454 G + V + W+++ SI R + G PPV V +FV G+ Sbjct: 294 FSGRHLQVFTSGAEWMVTGEPMTPASIQLHRQTRIGSPVARIIPPVDVDGSTIFVARSGQ 353 Query: 455 RIKYISG-STEQGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLG 513 + + +Q ++ N++ +A HL + + Y + ++ V ++ L Sbjct: 354 AVHEYAYTDVQQAYQANDLALVARHLVQTPV-SMAYDQT-RRLLHVAMQGGW-----LAT 406 Query: 514 CRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSAGEERSFTVRLNL 573 E AW L+ ++W V + +RL Sbjct: 407 LTLYR-AEQVTAWTRQDTDGAFRALA--------EIDGTVWCAVERAGA------MRLER 451 Query: 574 LDD 576 DD Sbjct: 452 FDD 454 Score = 141 bits (355), Expect = 3e-31, Method: Composition-based stats. Identities = 44/211 (20%), Positives = 76/211 (36%), Gaps = 21/211 (9%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M TK SF+AGEL +LL R DL + G + RN+ G L P ++ + Sbjct: 1 MAAGRSTKTSFTAGELGDQLL-GRGDLRAYENGARRLRNVFIQPTGGLTRRPGLRHVAEL 59 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 P R+ +F L+V + L++ + + P+T + Sbjct: 60 ---PGPARLIAFEFNTEQTYLVVLTHQGLRVFLGDVQVA--------SLAGPWTAAMLDA 108 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 + + T + +H D P + S++ F+ P+ S Sbjct: 109 IAWTQSADTLLLLHPDMVPQRVTRSS---NTSWSIAPWSFVREPFYRFA------SPGVT 159 Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLG 211 A + + +T+ F+P G RLG Sbjct: 160 LAPSATSGSVTLTASAAAFQPGHAGVRFRLG 190 >gi|307946248|ref|ZP_07661583.1| hypothetical protein TRICHSKD4_4953 [Roseibium sp. TrichSKD4] gi|307769912|gb|EFO29138.1| hypothetical protein TRICHSKD4_4953 [Roseibium sp. TrichSKD4] Length = 681 Score = 146 bits (369), Expect = 8e-33, Method: Composition-based stats. Identities = 104/585 (17%), Positives = 173/585 (29%), Gaps = 82/585 (14%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 + + +F+AGEL P LL R L + G N++ + G + + Sbjct: 2 VARPGRLQSAFTAGELDP-LLHERSQLKYFSTGADHMENVVSIPQGGFGLRGGLLDIGAV 60 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTP-YTFKDNK 119 P ++R+F F DG LVF K++ W + + P + Sbjct: 61 D--PAASRLFDFKASDGSAYDLVFAPGKME--------AWGNSGKLQDLAIPALSETMLP 110 Query: 120 SLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAK 179 L A T + +H D P + + +++ D + P G Sbjct: 111 GLNDAQQRDTMILLHADLQPQRIKHAGPQ---AWSADAVPLTGLPSYDYGA--------- 158 Query: 180 LSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSL 239 + S + R+ F LD L E SIG R Sbjct: 159 -TYSNGVAAVWRLE-----FVGLDANSIFTLTISQEE-----TVSIGYTTAMGTLASRVR 207 Query: 240 TTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESA-SGAVAPYYVWGDIKDVSKD 298 T D + G + + + A SG V + + Sbjct: 208 TA--VQDLPNVAPGISVASAGGSKIAVTFSGENNAGDGWAVSGNVINKADAAILAAKT-- 263 Query: 299 GRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLS 358 ++ VAP + G+P F+N RLL G KG + S Sbjct: 264 --TVGVAPGEPVI---------------SSVRGWPRCGAFYNQRLLLGGFKGLPNAWMFS 306 Query: 359 SFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL 418 G +++F D + + + V + + P + W+ L Sbjct: 307 LQGDYFNF--DERFSAANGPALIPMDVDGGEV--VEQIVPSRNLAIFTNGAEYWIAERGL 362 Query: 419 SKGLSIDFRRVSGSGVYACPPVSVGDCLV-FVCGVGRRIKYISG-STEQGFRFNEITQLA 476 S+ + + GV P+ + + FV G I E F +I+ L Sbjct: 363 SRTEPPNHVQAGERGVKNGVPIVANEGALNFVSSTGSVIGEFRYTDVEGNFVSRDISLLG 422 Query: 477 DHLFNQRILQLVYQEEPHSIV----WVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMIS 532 HL + + S +VLE LL + A+ Sbjct: 423 SHLIID-VKDQAMRRAEKSTSGNLNGIVLEDGQARLATLL------REQDVTAFSRMTSD 475 Query: 533 DKHY-VLSAASFPNDNRGGTSLWMLVALSAGEERSFTVRLNLLDD 576 H+ +S G + +V AG V LLD+ Sbjct: 476 SGHFKAVSV-------NGRNEMSWIVDRPAGRRLERLVTGYLLDE 513 >gi|83313369|ref|YP_423633.1| hypothetical protein amb4270 [Magnetospirillum magneticum AMB-1] gi|82948210|dbj|BAE53074.1| hypothetical protein [Magnetospirillum magneticum AMB-1] Length = 634 Score = 146 bits (367), Expect = 1e-32, Method: Composition-based stats. Identities = 41/206 (19%), Positives = 73/206 (35%), Gaps = 15/206 (7%) Query: 6 WTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPR 65 +TK SF+AGE+ L R DL+L+A G RN++ G + P ++ R Sbjct: 7 FTKTSFTAGEVDVDL-AGRGDLALYANGAKSLRNVVVAPIGGVRRRPGLRHVAPAR---G 62 Query: 66 SNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAV 125 R+ +F LL D ++ I ++ +TP++ L + Sbjct: 63 PGRLIAFEFNTEQTYLLALSDHRMDI--------YADGAKVAELETPWSTAQVAQLSWTQ 114 Query: 126 FGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQA 185 T + VH D P + S+ + + + +A Sbjct: 115 SADTLLVVHPDVEPRKITRTGAN---SWVLETWSYYQEDGILYVPTHKFAKDAVTLTPSG 171 Query: 186 DTSTARITSDMKIFKPLDKGRSIRLG 211 + T +T+ +F G R+G Sbjct: 172 TSGTITLTASEAVFDAAHAGCRFRVG 197 Score = 133 bits (334), Expect = 8e-29, Method: Composition-based stats. Identities = 65/380 (17%), Positives = 123/380 (32%), Gaps = 49/380 (12%) Query: 217 WAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSR 276 W ++ + + +V D R +T R+G + +Y +++ I ++ + Sbjct: 112 WTQSADTLL---VVHPDVEPRKIT--RTGANSWVLETWSYYQEDGILYVPTHKFAKDAVT 166 Query: 277 ESASGAVAP------------------YYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSV 318 + SG + V G +S + + + + Sbjct: 167 LTPSGTSGTITLTASEAVFDAAHAGCRFRVGGKQVLISAVTSATQAQAEVKQTLGGTAAT 226 Query: 319 VSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPT 378 W ++ G+P V FH RL GS+G ++LS ++F + G Sbjct: 227 EDWEEQSFSPLRGWPVSVCFHQGRLAIGGSRGLPNRLWLSKSMDLFNF----DLGTGLDD 282 Query: 379 KALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGV---Y 435 +A+ ++ I G + V + W++ S I R + G Sbjct: 283 EAIEFSLLSTQVDAIR-AVFSGRHLQVFTSGAEWMVVGSPLTPTKIQLNRQTRVGSPVDR 341 Query: 436 ACPPVSVGDCLVFVCGVGRRIKY-ISGSTEQGFRFNEITQLADHLFNQRILQLVYQEEPH 494 + PP V FV GR ++ + +Q ++ N+++ +A H+ N + Y Sbjct: 342 SVPPRDVDGATHFVSRSGRDLREFLFADVDQAYQANDLSMVAKHVMNTPV-DQDYDAS-R 399 Query: 495 SIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLW 554 + VV+ + + E AW S A D Sbjct: 400 RLFHVVM-----ADGLMATLTVYR-AEKVTAWTVFETQGAF--RSVAVVDGDTH------ 445 Query: 555 MLVALSAGEE-RSFTVRLNL 573 +LV F LNL Sbjct: 446 VLVERGGSHVIECFDDTLNL 465 >gi|209966375|ref|YP_002299290.1| hypothetical protein RC1_3113 [Rhodospirillum centenum SW] gi|209959841|gb|ACJ00478.1| conserved hypothetical protein [Rhodospirillum centenum SW] Length = 638 Score = 145 bits (365), Expect = 2e-32, Method: Composition-based stats. Identities = 46/215 (21%), Positives = 71/215 (33%), Gaps = 14/215 (6%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M K +F+ GELSP LL R DL + G RN++ L G + P Sbjct: 1 MTRLRSVKAAFTGGELSPDLL-GRGDLRSYETGALALRNVLILPTGGVTRRPGTAYLATL 59 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 P R+ +F+ LL F D++L++ +TP+T Sbjct: 60 ---PGPGRLAAFAFDTEQAYLLAFTDRRLEVF--------RDGATEAVLETPWTAGQLAQ 108 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKI--SFTFDEIKFLPPPWLGDGMISGVKSNA 178 L + + H D PP ++ D ++ F +K L A Sbjct: 109 LAWTQSADVLLVCHPDVPPRRIVRSGDRRWRCEAWRFSTVKTADGRALQRLPFHRFADAA 168 Query: 179 KLSISQADTSTARITSDMKIFKPLDKGRSIRLGCH 213 R+ + +F GR RL Sbjct: 169 VTLTPSGTRGRVRVRASAPVFDGAHAGRPFRLRRR 203 Score = 128 bits (321), Expect = 3e-27, Method: Composition-based stats. Identities = 41/252 (16%), Positives = 79/252 (31%), Gaps = 23/252 (9%) Query: 312 FQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGE 371 + W A+ G+P FH +RL+ GS+ ++LS G +DF Sbjct: 224 VPDAEPSIDWDEPAFSPLRGWPVSACFHQDRLVIGGSRDLPNRLWLSRSGDLFDFDP--- 280 Query: 372 YGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSG 431 G + +A+ A+ + I + G + V + W ++ + R S Sbjct: 281 -GEGEDDEAIEFAILSDQVNAIRQVFS-GRHLQVFTTGAEWAVTGEPLTPKEVRLDRQSR 338 Query: 432 SGVY---ACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQLADHLFNQRILQLV 488 G P V +F G +++ E + ++T A HL + Sbjct: 339 VGSGPGRQIPAREVDGATLFAGRDGAVREFLWTDLESSYSTTDLTLAAGHLCRAPVE--- 395 Query: 489 YQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNR 548 +P + + ++ + + E W V S A + Sbjct: 396 LDVDPGRRLLLAVQ----ADGGVAALTLDR-AEQVTGWTRLETDGA--VRSLAVVRGEVH 448 Query: 549 GGTSLWMLVALS 560 W++ Sbjct: 449 -----WLVERQG 455 >gi|291336926|gb|ADD96454.1| hypothetical protein [uncultured organism MedDCM-OCT-S09-C787] Length = 158 Score = 144 bits (362), Expect = 5e-32, Method: Composition-based stats. Identities = 26/141 (18%), Positives = 63/141 (44%), Gaps = 6/141 (4%) Query: 369 DGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLS----KGLSI 424 D +G ++ + + I +M +++G + +S + +I Sbjct: 3 DNYHGTVADDDSIIYTIASNQVNAIRFMTAT-RTLIIGTAGGEFAVSGGGTDIAITPTNI 61 Query: 425 DFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFNQR 483 ++ S +G ++VG+ +F+ R+++ ++ + + G+ ++T LA+H+ Sbjct: 62 LIKKQSNNGAANVDALAVGNATLFLQRARRKLRELAYNFDVDGYVAPDLTILAEHISEGG 121 Query: 484 ILQLVYQEEPHSIVWVVLEPK 504 QL YQ+EP+ ++W V Sbjct: 122 FKQLSYQQEPNQVIWGVRNDG 142 >gi|144898783|emb|CAM75647.1| conserved hypothetical protein [Magnetospirillum gryphiswaldense MSR-1] Length = 635 Score = 144 bits (362), Expect = 5e-32, Method: Composition-based stats. Identities = 42/211 (19%), Positives = 74/211 (35%), Gaps = 15/211 (7%) Query: 3 NTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRL 62 N T K +F+AGELS +L R DL+ + G + RN+ G + P ++ + Sbjct: 5 NITLAKTNFTAGELSLDML-GRGDLAAYGNGAKRLRNVFIAPIGGVSRRPGLR---HVDI 60 Query: 63 DPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLE 122 R+ +F LLV D L I ++ + TP+T + + Sbjct: 61 ARGKGRLIAFEFNTEQTYLLVLTDLHLDI--------YADGVAVAHVDTPWTEAQLQQIN 112 Query: 123 YAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSI 182 + T + VH + P L ++T F + ++ Sbjct: 113 WTQTADTLLIVHPEVAPRKLTRTAHS---AWTISNWMFHEADGVLFQPYHKFAADEVTLQ 169 Query: 183 SQADTSTARITSDMKIFKPLDKGRSIRLGCH 213 A + + +T+ F G +RL Sbjct: 170 PSATSGSITLTASAAFFVAGHVGTRLRLQQK 200 Score = 143 bits (359), Expect = 1e-31, Method: Composition-based stats. Identities = 49/286 (17%), Positives = 99/286 (34%), Gaps = 26/286 (9%) Query: 293 KDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDE 352 +++ + + + + W A G+P V FH +RL+ GS+ Sbjct: 202 VEITAIASATQASATVKQNLVNTSAHKDWEEQALSAVRGWPVSVCFHQDRLVIGGSRDQP 261 Query: 353 LSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLW 412 ++LS ++F + G +A+ A+ + I + G + V + W Sbjct: 262 NRLWLSKSSDLFNF----DLGEALDDEAIEFALLSDQVNAIRHVFS-GRHLQVFTSGAEW 316 Query: 413 LLSISLSKGLSIDFRRVSGSGV---YACPPVSVGDCLVFVCGVGRRIKY-ISGSTEQGFR 468 ++S SI R + G PP V +FV G+ ++ + EQ ++ Sbjct: 317 MVSGQPLTPSSIQLTRQTRVGSPIDRTVPPRDVDGATLFVSRNGKDLREFLFADVEQAYQ 376 Query: 469 FNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHT 528 ++ LA H+ + Y + ++ V+ L E AW Sbjct: 377 SGDLAMLAKHVMLAPV-DQDY--DAGRRLFHVVM----GDGGLATVTVYR-SEKVTAWTG 428 Query: 529 HMISDKHYVLSAASFPNDNRGGTSLWMLVALSAGEE-RSFTVRLNL 573 H+ + + ++ +++LV F L+L Sbjct: 429 HVTAGRFLAVAVV--------EGEVYVLVEREGIVSVECFDESLSL 466 >gi|288959323|ref|YP_003449664.1| hypothetical protein AZL_024820 [Azospirillum sp. B510] gi|288911631|dbj|BAI73120.1| hypothetical protein AZL_024820 [Azospirillum sp. B510] Length = 632 Score = 138 bits (346), Expect = 3e-30, Method: Composition-based stats. Identities = 59/325 (18%), Positives = 104/325 (32%), Gaps = 42/325 (12%) Query: 270 LSSKTSRESASGAVAPYYVWGDIKDVSKDGR----------------SISVAPQSQTLFQ 313 + T S +G + D +DG + V + Sbjct: 162 DPAVTVTPSGTGGAITVTASAPVFDPRQDGTRLRIRGKQLLVTGVVSATQVNATVKETLA 221 Query: 314 AGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYG 373 W A+ G+P FH +RL+ GS+ ++LS ++F + G Sbjct: 222 DTQPTPQWEEQAFSALRGWPVSAAFHQDRLVIGGSRDLPNRLWLSRSAQIWNF----DLG 277 Query: 374 CYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSG 433 +A+ + + + G + V + ++++ S+ +R + G Sbjct: 278 EGLDDQAIEFGILSDQVNAVR-AVFSGRHLQVFTSGAEYMVTGDPLTPQSMQVKRQTRIG 336 Query: 434 V---YACPPVSVGDCLVFVCGVGRRIKY-ISGSTEQGFRFNEITQLADHLFNQRILQLVY 489 A PP V +FV R I+ + TE ++ N++ LA HL Y Sbjct: 337 SPMDRAIPPRDVEGATLFVPRNRREIREFLFTDTEAAYQANDLALLARHLVASP-RDQDY 395 Query: 490 QEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRG 549 + +++V +E L E AW V S A+ Sbjct: 396 DQN-RRLLFVAMEDG-----TLGALTAYR-AEDVTAWTLLETDGA--VRSVAAV------ 440 Query: 550 GTSLWMLV-ALSAGEERSFTVRLNL 573 G ++ LV F LNL Sbjct: 441 GDEVYALVERRGFWTIERFDDGLNL 465 Score = 136 bits (342), Expect = 9e-30, Method: Composition-based stats. Identities = 46/211 (21%), Positives = 69/211 (32%), Gaps = 15/211 (7%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M K +F+AGE+S RLL R DL + G RNL G + + Sbjct: 2 MGRLHQVKTNFTAGEVSRRLL-GRGDLKAYDNGALALRNLFIDPTGGVTRRSGLAF---T 57 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 L P R+ +F LLVF D+++ + + P+T Sbjct: 58 ALAPGDGRLVAFERNSEQTYLLVFTDRRIDVF--------QGGSRLASVAAPWTLTQLAQ 109 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180 + + T + H D PP L GD + E F L A Sbjct: 110 ITWTQSADTLLVCHPDLPPRKLTR---GDDGGWALAEWAFAVEGGLVRTPFHRFGDPAVT 166 Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLG 211 +T+ +F P G +R+ Sbjct: 167 VTPSGTGGAITVTASAPVFDPRQDGTRLRIR 197 >gi|291334718|gb|ADD94364.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C890] Length = 135 Score = 137 bits (344), Expect = 5e-30, Method: Composition-based stats. Identities = 28/125 (22%), Positives = 58/125 (46%), Gaps = 9/125 (7%) Query: 447 VFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKD 505 +F+ R+++ ++ + + G+ ++T LA+H+ QL YQ+EP+ ++W V Sbjct: 1 MFLQRARRKLRELAYNFDVDGYVAPDLTILAEHISEGGFKQLSYQQEPNQVIWGVRNDG- 59 Query: 506 NSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALS-AGEE 564 +L+G + E + AWH H+ S A+ P D+ W++ + G Sbjct: 60 ----QLVGLTYQRE-QQVVAWHRHIFGGSAVCESVATIPTDDS-EYQTWVINKRTINGST 113 Query: 565 RSFTV 569 + + Sbjct: 114 KRYVE 118 >gi|291334514|gb|ADD94167.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C1201] gi|291336446|gb|ADD96001.1| hypothetical protein [uncultured organism MedDCM-OCT-S04-C1073] Length = 153 Score = 136 bits (343), Expect = 7e-30, Method: Composition-based stats. Identities = 28/125 (22%), Positives = 58/125 (46%), Gaps = 9/125 (7%) Query: 447 VFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKD 505 +F+ R+++ ++ + + G+ ++T LA+H+ QL YQ+EP+ ++W V Sbjct: 1 MFLQRARRKLRELAYNFDVDGYVAPDLTILAEHISEGGFKQLSYQQEPNQVIWGVRNDG- 59 Query: 506 NSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALS-AGEE 564 +L+G + E + AWH H+ S A+ P D+ W++ + G Sbjct: 60 ----QLVGLTYQRE-QQVVAWHRHIFGGSAVCESVATIPTDDS-EYQTWVINKRTINGST 113 Query: 565 RSFTV 569 + + Sbjct: 114 KRYVE 118 >gi|83721618|ref|YP_441474.1| gp12 [Burkholderia thailandensis E264] gi|257139844|ref|ZP_05588106.1| gp12, putative [Burkholderia thailandensis E264] gi|83655443|gb|ABC39506.1| gp12, putative [Burkholderia thailandensis E264] Length = 188 Score = 136 bits (342), Expect = 1e-29, Method: Composition-based stats. Identities = 29/134 (21%), Positives = 44/134 (32%), Gaps = 4/134 (2%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M T + + +AGELSP L + DL +A GV N IP G ++ Sbjct: 1 MAKITTIQSNLNAGELSPPL-EGHIDLDRYANGVKTMLNAIPQIEGGARRRFGFRQVAAT 59 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 + + R+ F + GD + + TP++ Sbjct: 60 K-TTGATRLVPFVFSKSQAYFVELGDAYARFYTDSGQ--IQQSGVPIELATPWSASQLFE 116 Query: 121 LEYAVFGSTAVFVH 134 LEY T H Sbjct: 117 LEYTQNSDTMFIAH 130 >gi|77734533|emb|CAI59394.2| hypothetical protein pSG3.03 [Sodalis glossinidius] Length = 517 Score = 115 bits (287), Expect = 2e-23, Method: Composition-based stats. Identities = 31/122 (25%), Positives = 55/122 (45%), Gaps = 13/122 (10%) Query: 453 GRRIKYISGSTE-QGFRFNEITQLADHLFNQ-RILQLVYQEEPHSIVWVVLEPKDNSFPR 510 ++ ++ S + GF+ N++T LA+H F ++L + P S+VW V Sbjct: 188 RSAVRDLAYSFDVDGFQGNDLTVLANHFFTGFQLLDWAFTITPLSVVWCVRNDG-----T 242 Query: 511 LLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALS-AGEERSFTV 569 LLG + E + AWH H + K+ + S +L+ +V + G+ R + Sbjct: 243 LLGLTYLRE-QQVAAWHQHPAAGKY--EAVCSI--SEGTEDALYCVVNRTIQGQPRRYVE 297 Query: 570 RL 571 RL Sbjct: 298 RL 299 >gi|89886023|ref|YP_516220.1| hypothetical protein SGPHI_0042 [Sodalis phage phiSG1] gi|89191758|dbj|BAE80505.1| conserved hypothetical protein [Sodalis phage phiSG1] gi|125470053|gb|ABN42245.1| gp40 [Sodalis phage phiSG1] Length = 517 Score = 115 bits (287), Expect = 3e-23, Method: Composition-based stats. Identities = 31/122 (25%), Positives = 55/122 (45%), Gaps = 13/122 (10%) Query: 453 GRRIKYISGSTE-QGFRFNEITQLADHLFNQ-RILQLVYQEEPHSIVWVVLEPKDNSFPR 510 ++ ++ S + GF+ N++T LA+H F ++L + P S+VW V Sbjct: 188 RSAVRDLAYSFDVDGFQGNDLTVLANHFFTGFQLLDWAFTITPLSVVWCVRNDG-----T 242 Query: 511 LLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALS-AGEERSFTV 569 LLG + E + AWH H + K+ + S +L+ +V + G+ R + Sbjct: 243 LLGLTYLRE-QQVAAWHQHPAAGKY--EAVCSI--SEGTEDALYCVVNRTIQGQPRRYVE 297 Query: 570 RL 571 RL Sbjct: 298 RL 299 >gi|48696643|ref|YP_024422.1| hypothetical protein VP2p15 [Vibrio phage VP2] gi|40950041|gb|AAR97632.1| hypothetical protein [Vibrio phage VP2] Length = 594 Score = 103 bits (257), Expect = 6e-20, Method: Composition-based stats. Identities = 64/346 (18%), Positives = 116/346 (33%), Gaps = 42/346 (12%) Query: 247 RFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAP 306 G + A +V D V+ + R + Y GD + GR I V P Sbjct: 80 EVGNTNIAVWVNDV----RQVVANTPSEWRNTIDRIQTAYDTIGDDAGAANTGRLIMVHP 135 Query: 307 QSQTLFQAGVSVVSWFM---------SAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYL 357 Q + +W + W YP V NR+ + GS + Sbjct: 136 ALQPKRLYRDNNNAWQFVNMHTGAVPAEWSPSN-YPQTVGIFQNRVWYVGSPVHRTYFWA 194 Query: 358 SSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSIS 417 + G D + DP + W+ + + +G + + L+ S Sbjct: 195 TRAGKLEDIAPSTANNPNDPISFVGIMEGTPC-----WIIASSDVLTIGTTINDYQLAAS 249 Query: 418 LS---KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEIT 473 + RR S G A + + ++F ++ ++ E + +E++ Sbjct: 250 TGVSVTAATAILRRSSVQGTAAVQGIPAEEQVIFCSRNKSKVYAMNYVREQDNWIPDEMS 309 Query: 474 QLADHLFN-------QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAW 526 A HLF + ++ Y + +WVVLE ++ C F + AW Sbjct: 310 SQAQHLFTPISSAKGASVRRVAYISDAAKSLWVVLE-----NGQINYCCFDRTTDTK-AW 363 Query: 527 HTHMISDKHYVLSAASFPNDNRGGTSLWMLVALS---AGEERSFTV 569 +S + AA+F D ++ V S G ++++TV Sbjct: 364 TQLELSGGKVIDIAAAFNPD---SDYAYVAVVRSKAINGVQKNYTV 406 Score = 56.8 bits (135), Expect = 9e-06, Method: Composition-based stats. Identities = 48/329 (14%), Positives = 91/329 (27%), Gaps = 30/329 (9%) Query: 4 TTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLD 63 +++ SF G ++PRL + + + + + + N + G L++ +E C+ Sbjct: 2 ADFSQTSFKGGVIAPRLQFNEYESA-YHHSIEDAVNFVVTEQGSLITRCGSEEVGLCQDG 60 Query: 64 PRSNRVFSFSIPDGGYALLVFGDKKLQIVVVR-----SSTKWSPALFGKTYKTPYTFKDN 118 ++ G+ + + V ++T +T Y Sbjct: 61 EVRLFRLPAVDAPSNDVIVEVGNTNIAVWVNDVRQVVANTPSEWRNTIDRIQTAY--DTI 118 Query: 119 KSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKS-- 176 A + VH P L + + F +P W V Sbjct: 119 GDDAGAANTGRLIMVHPALQPKRLYR-DNNNAWQFVNMHTGAVPAEWSPSNYPQTVGIFQ 177 Query: 177 -------NAKLSISQADTSTARI--TSDMKIFKPLDKGRSIRLGCHPPEWAKNTN----- 222 + T ++ + P D + + P W ++ Sbjct: 178 NRVWYVGSPVHRTYFWATRAGKLEDIAPSTANNPNDPISFVGIMEGTPCWIIASSDVLTI 237 Query: 223 -YSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASG 281 +I Y +A S+T + R +G V V+ S S+ A Sbjct: 238 GTTINDYQLAAS-TGVSVTAATAILRRSSVQGTAAV-QGIPAEEQVIFCSRNKSKVYAMN 295 Query: 282 AVAPYYVWGDIKDVSKDGRSISVAPQSQT 310 V W I D P S Sbjct: 296 YVREQDNW--IPDEMSSQAQHLFTPISSA 322 >gi|259419134|ref|ZP_05743051.1| hypothetical protein SCH4B_4402 [Silicibacter sp. TrichCH4B] gi|259345356|gb|EEW57210.1| hypothetical protein SCH4B_4402 [Silicibacter sp. TrichCH4B] Length = 715 Score = 102 bits (254), Expect = 2e-19, Method: Composition-based stats. Identities = 96/531 (18%), Positives = 169/531 (31%), Gaps = 53/531 (9%) Query: 3 NTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRL 62 T + FS G++ P Q R D+ L A+ V + N + L G + M+ Sbjct: 5 KETIWQKDFSLGQVRPE-AQERDDIDLVARSVKEGLNCVVLSTGQMEGRSGMRFLNATAS 63 Query: 63 DPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLE 122 + +G L F L + ++ +++ + + + Sbjct: 64 SQGREV----DLGEGRVFDLHFVPSGLILYDSNNTVEYTGNITWTAAPKKWGIYTFDEIS 119 Query: 123 YAVFGS----TAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNA 178 + V + + + P L+ +DG S++F E+ F G S + N Sbjct: 120 FWVVADPDSSSILIGSQHFPIQALILNEDG---SWSFGEMAFATG-LAGAIHQSYWRYNE 175 Query: 179 KLSI-SQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYR 237 +SI A T +T+ I+ +G +IR +N +G + + Sbjct: 176 TVSIQPSARTGAITVTASEAIWTADHEGMAIR--------YQNREIILGTLVSS----TV 223 Query: 238 SLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESA-SGAVAPYYVWGDIKDVS 296 Y + V + + ++ + +G+V Sbjct: 224 INAAVTEELPPTYDITVSSVSNYQVGEAVEHSVLGGQGIITGIAGSVITVMATSRYDGFD 283 Query: 297 KDGRSISVAPQSQTLFQAGVSVVS------WFMSAWGEQEGYPSHVTFHNNRLLFSGSKG 350 VAP + A + + W M GY + H +R+ G Sbjct: 284 TVASPKLVAPNAAQPISAVAAAATPAATVIWEMQMQSPVHGYAGYAVRHLSRVFLCDFPG 343 Query: 351 DELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDT- 409 + S GA DF + E V S T+ +M + + + Sbjct: 344 APQAFAASVVGAINDFKMGSE-----DADGFVDTVGADSGGTLRFMASVEDLLFLTSKGI 398 Query: 410 -SLWLLSISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKY--ISGSTEQG 466 S S +I R S G + P++V D VFV VG+RI ++G Sbjct: 399 YSHQTRDGSAITPATIRPVRFSRVGCASVEPIAVDDGCVFVDAVGQRIYAATLAGDIYTK 458 Query: 467 FRFNEITQLADHLFNQRILQLVY-------QEEPHSIVWVVLEPKDNSFPR 510 +R +T L L I VY E S V+VV + + Sbjct: 459 WRAEPMTSLHPQL----IKDAVYLGATSSGSENAESFVYVVNSDGSVALGQ 505 >gi|325971691|ref|YP_004247882.1| hypothetical protein SpiBuddy_1864 [Spirochaeta sp. Buddy] gi|324026929|gb|ADY13688.1| hypothetical protein SpiBuddy_1864 [Spirochaeta sp. Buddy] Length = 551 Score = 102 bits (253), Expect = 2e-19, Method: Composition-based stats. Identities = 45/275 (16%), Positives = 87/275 (31%), Gaps = 60/275 (21%) Query: 332 YPSHVTFHNNRLLFSGSKGDELSVYLSSF--------GAFYDFSL--------------- 368 YPS V NRL FS + + ++S F F + Sbjct: 166 YPSVVGICQNRLWFSAAILKPYTTWVSRPPYDGSNNHHDFTTFDVIEVNTEVIKDPSTWP 225 Query: 369 -------DGEYGCYDPTK----------------ALTTAVTDFSASTIHWMHPFGEGVLV 405 D D +K A+ + TI W + + + Sbjct: 226 KTTNEQGDEMIDFSDSSKFVETVKEIEEVINAKCAMEIELASGRNDTIKW-VAGMDNIFI 284 Query: 406 GCDTSLWLLSISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQ 465 G + + W+ + +S G P ++ D + F+ G R++ ++ ++ Sbjct: 285 GTEANEWMCPFDID-PTKQSASMLSSYGSLPIQPQTLHDGIFFLQR-GNRLREMT-RSQN 341 Query: 466 GFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFA 525 G N+++ ADH+ I QL + P +++ +L L + G Sbjct: 342 GSISNDLSFTADHILFAGIRQLATLKNPDPMIFCLLNDG-----TLAVLCYDKNY-GMQG 395 Query: 526 WHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALS 560 W + L+ P ++ G ++ V Sbjct: 396 WSRWSTQGEFMCLA----PYEDEDGQKMFAHVRRG 426 Score = 101 bits (251), Expect = 3e-19, Method: Composition-based stats. Identities = 59/375 (15%), Positives = 122/375 (32%), Gaps = 39/375 (10%) Query: 9 HSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNR 68 +++ GE+SP+L R DL ++ QG ++ + G + P ++ R Sbjct: 6 NNWMYGEISPKL-GGRLDLEMNTQGCEILKDFRNMLQGGITRRPPLKHVAQTV----RGR 60 Query: 69 VFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGK---TYKTPYTFKDNKSLEYAV 125 F++ G L+ +KKL++ ++ T Y D S++YA Sbjct: 61 TIPFTLSSGESFLVELSNKKLRVWRKGVLGFYTVTFLPSGNDYLPTDYLEADVWSIQYAQ 120 Query: 126 FGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQA 185 + VHKD+ PH ++Y + F P+ + + Sbjct: 121 YYDRLYLVHKDYQPHVVVYAAEA-----------FQFSPFTAETDAGKQLGKSTGYYPSV 169 Query: 186 DT-STARITSDMKIFKPLD--KGRSIRLGCHPPEWAKNTNY-SIGAYIVADDKVYRSLTT 241 R+ I KP R G + + + ++ D + T Sbjct: 170 VGICQNRLWFSAAILKPYTTWVSRPPYDGSNNHHDFTTFDVIEVNTEVIKDPSTWPKTTN 229 Query: 242 GRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRS 301 + + +S + +V+ + + ++ +V G + Sbjct: 230 EQGDEMIDFSDSSKFVETVKEIEEVINAKCAMEIELASGRNDTIKWVAGMDNIFIGTEAN 289 Query: 302 ISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSH---VTFHNNRLLFSGSKGDELSVYLS 358 + P + S+ +S++G P F R G++ E++ S Sbjct: 290 EWMCPFDIDPTKQSASM----LSSYGSLPIQPQTLHDGIFFLQR----GNRLREMT--RS 339 Query: 359 SFGAFYD---FSLDG 370 G+ + F+ D Sbjct: 340 QNGSISNDLSFTADH 354 >gi|50282960|ref|YP_053016.1| hypothetical protein VP5_gp14 [Vibrio phage VP5] Length = 594 Score = 101 bits (252), Expect = 3e-19, Method: Composition-based stats. Identities = 64/346 (18%), Positives = 116/346 (33%), Gaps = 42/346 (12%) Query: 247 RFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAP 306 G + A +V D V+ + R + Y GD + GR I V P Sbjct: 80 EVGNANIAVWVNDV----RQVVAATPSEWRNTLDRIQTAYDTIGDDLGAANTGRLIMVHP 135 Query: 307 QSQTLFQAGVSVVSWFM---------SAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYL 357 Q + +W + W YP V NR+ + GS + Sbjct: 136 ALQPKRLYRDNNNAWKFVNMHTGAVPAEWSSSN-YPQTVGIFQNRVWYVGSPVHRTYFWA 194 Query: 358 SSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSIS 417 + G D + DP + W+ + + +G + + L+ S Sbjct: 195 TRAGKLEDIAPSTANNPNDPISFVGIMEGTPC-----WIIASSDVLTIGTTINDYQLAAS 249 Query: 418 LS---KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEIT 473 + RR S G A + + ++F ++ ++ E + +E++ Sbjct: 250 TGVSVTAATAILRRSSVQGTAAVQGIPAEEQVIFCSRNKSKVYAMNYVREQDNWIPDEMS 309 Query: 474 QLADHLFN-------QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAW 526 A HLF + ++ Y + +WVVLE ++ C F + AW Sbjct: 310 SQAQHLFTPISSARGASVRRVAYISDAAKSLWVVLE-----NGKINYCCFDRTTDTK-AW 363 Query: 527 HTHMISDKHYVLSAASFPNDNRGGTSLWMLVALS---AGEERSFTV 569 +S + AA+F D ++ V S G ++++TV Sbjct: 364 TQLELSGGKVIDIAAAFNPD---SDYAYVAVVRSKVVNGAQKNYTV 406 Score = 56.4 bits (134), Expect = 1e-05, Method: Composition-based stats. Identities = 50/329 (15%), Positives = 93/329 (28%), Gaps = 30/329 (9%) Query: 4 TTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLD 63 +++ SF G ++PRL + + + + + + N + G L++ +E C+ Sbjct: 2 ADFSQTSFKGGVIAPRLQFNEYESA-YHHSIEDAVNFVVTEQGSLITRCGSEEVGLCQDG 60 Query: 64 PRSNRVFSFSIPDGGYALLVFGDKKLQIVVVR-----SSTKWSPALFGKTYKTPYTFKDN 118 ++ G+ + + V ++T +T Y Sbjct: 61 EVRLFRLPAIDAPSNDIIVEVGNANIAVWVNDVRQVVAATPSEWRNTLDRIQTAY-DTIG 119 Query: 119 KSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKS-- 176 L A G + VH P L + + F +P W V Sbjct: 120 DDLGAANTG-RLIMVHPALQPKRLYR-DNNNAWKFVNMHTGAVPAEWSSSNYPQTVGIFQ 177 Query: 177 -------NAKLSISQADTSTARI--TSDMKIFKPLDKGRSIRLGCHPPEWAKNTN----- 222 + T ++ + P D + + P W ++ Sbjct: 178 NRVWYVGSPVHRTYFWATRAGKLEDIAPSTANNPNDPISFVGIMEGTPCWIIASSDVLTI 237 Query: 223 -YSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASG 281 +I Y +A S+T + R +G V V+ S S+ A Sbjct: 238 GTTINDYQLAAS-TGVSVTAATAILRRSSVQGTAAV-QGIPAEEQVIFCSRNKSKVYAMN 295 Query: 282 AVAPYYVWGDIKDVSKDGRSISVAPQSQT 310 V W I D P S Sbjct: 296 YVREQDNW--IPDEMSSQAQHLFTPISSA 322 >gi|291334273|gb|ADD93936.1| hypothetical protein [uncultured marine bacterium MedDCM-OCT-S08-C235] Length = 229 Score = 101 bits (250), Expect = 5e-19, Method: Composition-based stats. Identities = 34/206 (16%), Positives = 66/206 (32%), Gaps = 14/206 (6%) Query: 7 TKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRS 66 K +F +GEL P L+ R D + +A G K +N+ G + Y P + Sbjct: 11 LKTTFQSGELDP-LMNLRSDTTAYANGAKKMQNVSLFSQGGFKRRNGTKRYASL---PGN 66 Query: 67 NRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKT-PYTFKDNKSLEYAV 125 R+ F D + F + ++ I + + +T + P+T +++ Sbjct: 67 ARLVGFDFDDNEQYICAFSNNRVDIYYLSND------SLTQTITSCPWTTSILFEMQFTQ 120 Query: 126 FGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQA 185 G T + H + +F+ F + + Sbjct: 121 AGDTMIITHPSMATQVITRTS---LTAFSRSNYTFDSDSENVYQPYYKFAGSGVTLSASG 177 Query: 186 DTSTARITSDMKIFKPLDKGRSIRLG 211 T + ITS F +++ Sbjct: 178 TTGSVTITSSADHFSSDYVNVYLKIE 203 >gi|291334458|gb|ADD94112.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C1161] gi|291334665|gb|ADD94312.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C695] gi|291336445|gb|ADD96000.1| hypothetical protein [uncultured organism MedDCM-OCT-S04-C1073] Length = 121 Score = 96.5 bits (238), Expect = 1e-17, Method: Composition-based stats. Identities = 20/111 (18%), Positives = 38/111 (34%), Gaps = 7/111 (6%) Query: 81 LLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPH 140 +L FG++ ++ S + + +PY + ++YA H +HP Sbjct: 1 MLEFGNQYIRFYKDNGQILSSGSAY--EISSPYLEAELFDIKYAQSADVMYLCHPNHPVK 58 Query: 141 HLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTAR 191 L S+T + F P++ I A + + T T Sbjct: 59 KLARTGH---TSWTLTSVDFQNGPFMDHN-IETTTITASHT-NAGQTGTLT 104 >gi|291334515|gb|ADD94168.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C1201] Length = 99 Score = 93.8 bits (231), Expect = 6e-17, Method: Composition-based stats. Identities = 18/105 (17%), Positives = 36/105 (34%), Gaps = 6/105 (5%) Query: 81 LLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPH 140 +L FG++ ++ S + + +PY + ++YA H +HP Sbjct: 1 MLEFGNQYIRFYKDNGQILSSGSAY--EISSPYLEAELFDIKYAQSADVMYLCHPNHPVK 58 Query: 141 HLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQA 185 L S+T + F P++ I A + + Sbjct: 59 KLARTGH---TSWTLTSVDFQNGPFMDHN-IETTTITASHTYCRW 99 >gi|13186158|emb|CAC33469.1| hypothetical protein [Legionella pneumophila] Length = 818 Score = 93.8 bits (231), Expect = 8e-17, Method: Composition-based stats. Identities = 84/541 (15%), Positives = 162/541 (29%), Gaps = 91/541 (16%) Query: 7 TKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRS 66 ++F+ GEL P L +R DL ++ +G K RN+I L G P Sbjct: 6 ISNTFNRGELDPTLF-ARDDLDIYDKGARKLRNMIALWTGAARIAPGTIYVD-------- 56 Query: 67 NRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVF 126 + + + L K + Y + Y + Sbjct: 57 ----------------------MMVDRENGNAVIQDPLMVKGFDFTYDAD--AEITYTII 92 Query: 127 GSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQAD 186 + G I+F + D + + V S A L+ D Sbjct: 93 I-----------------RKSGTNIAFDI---------YYADALQTTVTSTAYLATQIQD 126 Query: 187 TSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGD 246 A + I + R ++ G W+ T + Y G + + Sbjct: 127 IHVAAAHDRVLILHENVQIRQLKRGASHSSWSLTT------FEPRVYPTYDFSVIGEATN 180 Query: 247 RFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAP 306 + T+ IT+ + S+ + G I V+ + + Sbjct: 181 YQSF----TFTLSATTGSITITSSSAVFTHNHVGGLFRSLGGTARITAVASTTSASATVL 236 Query: 307 QSQTLFQAGVSVVSWFMSAWGEQE---------GYPSHVTFHNNRLLFSGSKGDELSVYL 357 + T ++ S W G+P+ F+ NRL+ S + V L Sbjct: 237 DNFTGTSCAGNLSSLAEKLWNSDTTTAPVSANRGWPARGVFYLNRLILGRSLAVKNLVNL 296 Query: 358 SSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSIS 417 S+ G + +F + D A + ++ + + +L L+ S Sbjct: 297 STAGVYDNF----DDADLDGLVAFSVTFNGKGEQSVQSIVA-DDSILFTTANKLFAQSPL 351 Query: 418 LSKGLSID---FRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQG-FRFNEIT 473 + ++I+ F S S + S+ + +FV ++ ST G + T Sbjct: 352 VESPITINNVYFAPQSQSPATSIEAASIDNQTLFVSSDRTKVMQAMYSTADGKYITLPAT 411 Query: 474 QLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISD 533 L++ + + + EP I + ++ LL + + W + Sbjct: 412 MLSNSIVDYINSNGTW--EPAGISTRLYLATQDNGTMLLYSTL--QTQNVAGWSLRTTTG 467 Query: 534 K 534 K Sbjct: 468 K 468 >gi|158425207|ref|YP_001526499.1| tail tubular protein B [Azorhizobium caulinodans ORS 571] gi|158332096|dbj|BAF89581.1| tail tubular protein B [Azorhizobium caulinodans ORS 571] Length = 785 Score = 78.4 bits (191), Expect = 3e-12, Method: Composition-based stats. Identities = 81/565 (14%), Positives = 162/565 (28%), Gaps = 75/565 (13%) Query: 47 PLVSMPLMQEYRD-CRLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALF 105 L P + P + V + ++V + L++ + Sbjct: 41 GLTKRPPTRHVAKLINSLPENAHVHIINRDAAERYVVVAFNGDLRVYGFDGVERTVNFPH 100 Query: 106 GKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPW 165 GK Y + S F++KD ++ P Sbjct: 101 GK----GYLANTSASFGAVTVADYTFFLNKD--------------VTVAMSPETKAGRPP 142 Query: 166 LGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIR-LGCHPPEWAKNTNYS 224 G + K I + A + + + + L W Sbjct: 143 EGIVFVRQGNYACKYRIIVDGQAVAEKITSQTDPNDIQSSKIAQDLAAIINSWGSMVASV 202 Query: 225 IGA---YIVADDKVYRSLTT---GRSGDRFGYSKGATY------------VKDNNITWIT 266 IG+ AD + T G +G + T+ V+ + Sbjct: 203 IGSTIHIRRADSLGFSLTTEDSLGDTGLVCMTKQTQTFANLPARAVQGYQVEISGTPGNP 262 Query: 267 VLNLSSKTSRESASGAVAPYY-VWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSA 325 N + + + G + + + ++ D ++ + W A Sbjct: 263 YDNFWVEYDQAGSGGNNGVWREIAAPGRQIAFDPATMPHVLVREANGSFTFKQADWEKCA 322 Query: 326 WGEQEGYP---------SHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYD 376 G E P S + F+ NRL F SV S F++F + D Sbjct: 323 AGSDETTPRPSFVGQRISDIFFYRNRLGFISD----ESVIFSRSAKFFNFWRETATDLLD 378 Query: 377 PTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLL-SISLSKGLSIDFRRVSGS-GV 434 + + S + PF E +L+ D + ++L + + + +V+ Sbjct: 379 TDP-IDITTSHVKVSILRHAIPFNESLLLFSDQTQFMLGAGEVLTPSGVSLDQVTEFETS 437 Query: 435 YACPPVSVGDCLVFVCGVGR--RIKYISGSTEQGFRFNEITQLADHLFNQRILQLVY--Q 490 PV G + F G ++ + + N + +H+ I V+ Sbjct: 438 SRAKPVGAGQFVYFCTSRGEFTGVRE--YYIDGSTKTNNANDVTNHVPRY-IRGKVFKLC 494 Query: 491 EEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDF--AWHTHMISDKHYVLSAASFPNDNR 548 + + V L D L ++ G+ +W + +L+A Sbjct: 495 ASTNEDMLVALSDTDRD--TLYVYKYYNSGQEKVQSSWSRWKLQPGDVILNAEFI----- 547 Query: 549 GGTSLWMLVALSAGEERSFTVRLNL 573 ++LW++V + G + RLN+ Sbjct: 548 -ESTLWLIVRRADGV---YLDRLNI 568 >gi|46581000|ref|YP_011808.1| hypothetical protein DVU2596 [Desulfovibrio vulgaris str. Hildenborough] gi|46450421|gb|AAS97068.1| hypothetical protein DVU_2596 [Desulfovibrio vulgaris str. Hildenborough] Length = 259 Score = 75.7 bits (184), Expect = 2e-11, Method: Composition-based stats. Identities = 21/77 (27%), Positives = 27/77 (35%), Gaps = 11/77 (14%) Query: 497 VWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWML 556 +W V E L+ E E WH H+ VLS + P G LW+ Sbjct: 1 MWCVTEDGG-----LIAMTRIPEHE-VAGWHRHVTDGA--VLSVCTIPG--TAGDELWVA 50 Query: 557 VAL-SAGEERSFTVRLN 572 V G R RL+ Sbjct: 51 VRREGGGMVRCCIERLD 67 >gi|297171931|gb|ADI22918.1| hypothetical protein [uncultured Rhizobium sp. HF0500_35F13] Length = 336 Score = 74.1 bits (180), Expect = 6e-11, Method: Composition-based stats. Identities = 22/95 (23%), Positives = 39/95 (41%), Gaps = 13/95 (13%) Query: 487 LVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDK-----HYVLSAA 541 + YQEEP SI++ V E + L+ + + + AWH H+ S A Sbjct: 1 MAYQEEPLSIIYAVREDGE-----LVALTYQRD-QQVVAWHRHIFGGAFGTGNAVCESIA 54 Query: 542 SFPNDNRGGTSLWMLVALS-AGEERSFTVRLNLLD 575 P D +++++ + G + + LN D Sbjct: 55 VIPTDLD-EYEVYVIIKRTINGATKRYVEVLNTFD 88 >gi|317487276|ref|ZP_07946071.1| hypothetical protein HMPREF0179_03434 [Bilophila wadsworthia 3_1_6] gi|316921466|gb|EFV42757.1| hypothetical protein HMPREF0179_03434 [Bilophila wadsworthia 3_1_6] Length = 794 Score = 73.7 bits (179), Expect = 7e-11, Method: Composition-based stats. Identities = 78/566 (13%), Positives = 165/566 (29%), Gaps = 59/566 (10%) Query: 48 LVSMPLMQEYRDCRLDPRSNRVFSFSIPDGGY--ALLVFGDKKLQIVVVRSSTKWSPALF 105 L P + R P +N + S I ++ + + + + K Sbjct: 43 LKRRPATRHLARIRDTPAANGIASHHINRDETEQYIVTADASGINVFDLEGNAKTVSVTG 102 Query: 106 GKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPW 165 N+ L + +++ L +S + Sbjct: 103 TGAAYLAAATAPNRDLRFLTINDYTFVLNRRVAVKTL------PDLSPKRQPEAIVFIKQ 156 Query: 166 LGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSI 225 + N + A LD ++I ++ T+ S Sbjct: 157 ASYNTTYELILNGTTHAFTTEDGIAPADEPADKLSSLDICKAIADQIPKDAFSVQTSNST 216 Query: 226 GAYIVADD------------KVYRSLTTGRSGDRFGYSKG-------ATYVKDNNITWIT 266 D + S+ G+ RF + D + ++ Sbjct: 217 IWIRRHDGGDFTVKVQDSRSNTHTSVCKGKV-QRFSDLPTVAPRGFVTEIIGDASSSFDN 275 Query: 267 VLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAW 326 + + A G+ D ++ A Q + W Sbjct: 276 YFCVFEPSDAGDAFGSGTWKETVKPGIPCKLDPATLPHALIRQADGTFTFGPLEWGERIC 335 Query: 327 GEQEG--YPSHV-------TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDP 377 G+++ +PS V F+ NRL F + V +S G F++F L D Sbjct: 336 GDEDSAPFPSFVGRTLNGLFFYRNRLSFLSGEN----VVMSEVGEFFNFFLTTVTTLVDS 391 Query: 378 TKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL-SKGLSIDFRRVSGS-GVY 435 + A + +S +H F G+L+ D S ++L ++ + V+ Sbjct: 392 D-VVDVAASHTKSSILHHAVTFSGGLLLFSDQSQFVLEHDTVLSNATVSIKPVTEFEASM 450 Query: 436 ACPPVSVGDCLVFVCGVG--RRIKYISGSTEQGFRFNEITQLADHLFNQRILQLV-YQEE 492 PVS G + F G ++ + N+ + + H+ + + Sbjct: 451 KAAPVSSGKTVFFATDKGEWGGVRE-YITLPDNSDQNDASDITAHVPRYVRGNVSRLECS 509 Query: 493 PHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTS 552 + + +VL + + L ++ + AW + VLSAA T Sbjct: 510 TNEDMLLVLSEEMRTSLWLYKYFWNGSEKIQSAWSRWDMCG--EVLSAAIL------NTG 561 Query: 553 LWMLVALSAGEERSFTVRLNLLDDFK 578 +++++ G + ++++ +K Sbjct: 562 VYLIMQYGDGV---YLEKMDITPGYK 584 >gi|320158424|ref|YP_004190802.1| tail tubular protein B [Vibrio vulnificus MO6-24/O] gi|319933736|gb|ADV88599.1| tail tubular protein B [Vibrio vulnificus MO6-24/O] Length = 931 Score = 73.4 bits (178), Expect = 1e-10, Method: Composition-based stats. Identities = 58/361 (16%), Positives = 118/361 (32%), Gaps = 28/361 (7%) Query: 225 IGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVA 284 +G +D Y S +++ Y N ++ ++ + S Sbjct: 423 VGKADSENDGYYVKWVDKTS----MWTESTAYGLANEFNPASMPHILRRHQDSSKVSVDN 478 Query: 285 PYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLL 344 PY ++ ++ R++ + S M+ QE Y S + F RL Sbjct: 479 PYGIYFKLEQGVWSKRTVGDELSAPIPSFVSTQDESGAMT----QERYISAMAFFRGRLW 534 Query: 345 FSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVL 404 G S G ++F D A TIH P +G++ Sbjct: 535 LLGGD----YACGSVVGDKFNFFRSTALTVLDDDPIDGYTDLTGQAETIHAAIPSSDGLV 590 Query: 405 VGCDTSLWLL-SISLSKGLSIDFRRVSGSGV-YACPPVSVGDCLVFVCGVGR--RIKYIS 460 V + +L+ S + + +F R++ C PV +GD + F + + Sbjct: 591 VFTERGQYLISSQGMMSPTTFEFTRIASYATDNRCDPVLIGDRISFATKTSEYTSVSEMY 650 Query: 461 GSTEQG-FRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAE 519 + G + NE+T + +L+ ++ ++ + R+ F Sbjct: 651 VADTTGVRKANEVTSHCPTYIEGSVHRLLANATSNTEFLIMRGQGETLTGRMFIYDFLMN 710 Query: 520 GEGDF--AWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVAL--SAGEERSFTVRLNLLD 575 G AW + V + + L++++ S ++R R++L+ Sbjct: 711 GNERVQSAWSQWTFNGAVVVDGVLT-------SSELYLVMVRATSDKDKRMTVERIDLVQ 763 Query: 576 D 576 D Sbjct: 764 D 764 >gi|26989008|ref|NP_744433.1| tail tubular protein B [Pseudomonas putida KT2440] gi|24983829|gb|AAN67897.1|AE016421_9 tail tubular protein B [Pseudomonas putida KT2440] Length = 781 Score = 73.0 bits (177), Expect = 1e-10, Method: Composition-based stats. Identities = 73/579 (12%), Positives = 152/579 (26%), Gaps = 76/579 (13%) Query: 38 RNLIPLRYGPLVSMPLMQEYRDCRLDP-RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRS 96 N I L+ P P S V + + + + L++ V Sbjct: 32 ENGISTVSEGLMKRPPTTHLARVTASPLESAFVHTINRDASERYQVAITNGGLRVFAVDG 91 Query: 97 STKWSPALFGKTYKTPYTFKDNKSLEYA--VFGSTAVFVHKDHPPHHLLYIQDGDKISFT 154 + T Y + + ++ V+K Sbjct: 92 T----ERTVSFPDGTGYLAASDPASDFTAITVADYTFIVNKA-------------ITVAN 134 Query: 155 FDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHP 214 + P +I G I T T D + + Sbjct: 135 RAAVSAPRGPEALISVIQGNYGRTYGVILNGVTVATYATPDGSDATKTSLASTDYIATEL 194 Query: 215 PEWAKNTNYSIGAYIVADDKVYRSLTTGRSGD-----------------RFGYSKGATYV 257 ++ ++ + A +Y + T + D + + + Sbjct: 195 VAGIQSAGFT---CVRAGSCLYITSTADFTIDCYDGFNNNAMKAYKKVVQSFSTLPSNCT 251 Query: 258 KDNNITWITVLNLSSKTSRESASGAVAP--YYVW----GDIKDVSKDGRSISVAPQSQTL 311 + + + + V VW G + DG ++ Sbjct: 252 QAGGCLFEITGDPGDSSDDYYVYYDVGTDSTGVWRECVGPGVALGLDGSTMPHTLVRNAD 311 Query: 312 FQAGVSVVSWFMSAWGE--QEGYPSHV-------TFHNNRLLFSGSKGDELSVYLSSFGA 362 +W G+ PS V F+ NRL F +V S G Sbjct: 312 GTFTFQAATWTDRVAGDADTNEDPSFVGRTINDVVFYRNRLGFLAD----EAVIFSESGK 367 Query: 363 FYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLL-SISLSKG 421 +++F D + + T + + F + +L+ D +L+ + Sbjct: 368 YWNFYRTTVTELLDSDP-IDVSSTYTKVAILKHAVSFNKQLLLFSDEVQFLIDNGDTLTP 426 Query: 422 LSIDFRRVSGSGVYAC-PPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQLADHLF 480 +I + + A P SVG + F T+ N+ T +A H+ Sbjct: 427 KTISIKPSTEFVCNALTTPQSVGKNVYFASDRENWTAIREYFTDTNDVSNDSTDVASHVP 486 Query: 481 N---QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYV 537 + ++ + VL D + + + + +W D + Sbjct: 487 QYIPSGVFKIASSSSED--MLCVLTTGDRHSIYVYKFYWDGDTKVQSSWSKWTFPDTDTI 544 Query: 538 LSAASFPNDNRGGTSLWMLVALSAGEERSFTVRLNLLDD 576 LSA + +++ + + G + +L + D Sbjct: 545 LSAEFL------DSEVFLAINRADG---LYFEKLTVATD 574 >gi|325272824|ref|ZP_08139161.1| tail tubular protein B [Pseudomonas sp. TJI-51] gi|324102029|gb|EGB99538.1| tail tubular protein B [Pseudomonas sp. TJI-51] Length = 781 Score = 70.7 bits (171), Expect = 7e-10, Method: Composition-based stats. Identities = 80/575 (13%), Positives = 154/575 (26%), Gaps = 68/575 (11%) Query: 38 RNLIPLRYGPLVSMPLMQEYRDCRLDP-RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRS 96 N I L+ P P S V + + + + L++ V Sbjct: 32 ENGISTVSEGLMKRPPTTHLARVTASPLESAFVHTINRDSTERYQVAITNGGLRVFAVDG 91 Query: 97 STKWSPALFGKTYKTPYTFKDNKSLEYA--VFGSTAVFVHKD-------------HPPHH 141 S T Y + + ++ V+K P Sbjct: 92 S----ERTVSFPDGTSYLAASDPASDFTAITVADYTFIVNKAITVANRAAVSGTRGPEAL 147 Query: 142 LLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKP 201 + IQ ++ + S A S T F Sbjct: 148 ISVIQGNYGRTYGVILNGVTVATYATP-DGSDATKTALASTDYIATELVAGIQSA-GFTC 205 Query: 202 LDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRS------GDRFGYSKGAT 255 + G + + + + A KV +S +T S G F + Sbjct: 206 VRAGSCLYITSTADFTIDCYDGFNNNAMKAYKKVVQSFSTLPSNCTQAGGCLFEITGDPG 265 Query: 256 YVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAG 315 D+ + V S+ RE G + DG ++ Sbjct: 266 DSSDDYYVYYDVGTDSTGVWRECV----------GPGVALGLDGSTMPHTLVRNADGTFT 315 Query: 316 VSVVSWFMSAWGE--QEGYPSHV-------TFHNNRLLFSGSKGDELSVYLSSFGAFYDF 366 +W G+ PS V F+ NRL F +V S G +++F Sbjct: 316 FQAATWTDRVAGDADTNEDPSFVGRTINDVVFYRNRLGFLAD----EAVIFSESGKYWNF 371 Query: 367 SLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLL-SISLSKGLSID 425 D + + T + + F + +L+ D +L+ + +I Sbjct: 372 YRTTVTELLDSDP-IDVSSTYTKVAILKHAVSFNKQLLLFSDEVQFLIDNGDTLTPKTIS 430 Query: 426 FRRVSGSGVYAC-PPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQLADHLFN--- 481 + + A P SVG + F T+ N+ T +A H+ Sbjct: 431 IKPSTEFVCNALTTPQSVGKNVYFASDRENWTAIREYFTDTNDVSNDSTDVASHVPQYIP 490 Query: 482 QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAA 541 + ++ + VL D + + + + +W D Sbjct: 491 SGVFKIASSSSED--MLCVLTTGDRHSIYVYKFYWDGDTKVQSSWSKWTFPDTD------ 542 Query: 542 SFPNDNRGGTSLWMLVALSAGEERSFTVRLNLLDD 576 + N + +++ + + G + +L + D Sbjct: 543 TILNAEFLDSEVFLAINRADG---LYFEKLTVATD 574 >gi|9627472|ref|NP_042000.1| tail tubular protein B [Enterobacteria phage T7] gi|139659|sp|P03747|VTTB_BPT7 RecName: Full=Tail tubular protein B gi|15606|emb|CAA24430.1| unnamed protein product [Enterobacteria phage T7] gi|37956682|gb|AAP33952.1| gene 12 [Enterobacteria phage T7] Length = 794 Score = 68.0 bits (164), Expect = 5e-09, Method: Composition-based stats. Identities = 74/586 (12%), Positives = 171/586 (29%), Gaps = 66/586 (11%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + + + G + + D+ + ++ N L P + Sbjct: 1 MALISQSIKNLKGG------ISQQPDILRYPDQGSRQVNGWSSETEGLQKRPPLVFLNTL 54 Query: 61 RLDPRSNRVFSFSI-----PDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTF 115 D + + Y VF +++ + + K G Y T Sbjct: 55 G-DNGALGQAPYIHLINRDEHEQYYA-VFTGSGIRVFDLSGNEKQVRYPNGSNYIK--TA 110 Query: 116 KDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVK 175 L V+++ + + D + + G +I + Sbjct: 111 NPRNDLRMVTVADYTFIVNRNVVAQKNTKSVNLPNYNPNQDGLINVRGGQYGRELIVHIN 170 Query: 176 SNAKLSISQADTSTAR-ITSDMKIFKPLDKGRSIRLGCHPPEWAKNT-----NYSIGAYI 229 D S + + + + + +R + +W N + + + Sbjct: 171 GKDVAKYKIPDGSQPEHVNNTDAQWLAEELAKQMRT--NLSDWTVNVGQGFIHVTAPSGQ 228 Query: 230 VADDKVYRSLTTGRSGDRFGY---SKGATYVKDNNITWITVLNLSSKTSRESASGAVAPY 286 D + + + + S N + ++ +SK++ + A Sbjct: 229 QIDSFTTKDGYADQLINPVTHYAQSFSKLPPNAPNGYMVKIVGDASKSADQYYVRYDAER 288 Query: 287 YVWGDIKDVSKDGRSI-SVAPQSQTLFQAGVSVVSWFMSAWG-------EQEGYPSHV-- 336 VW + + + + + P + G W W + +PS V Sbjct: 289 KVWTETLGWNTEDQVLWETMPHALVRAADGNFDFKWL--EWSPKSCGDVDTNPWPSFVGS 346 Query: 337 -----TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSAS 391 F NRL F + + LS +++F D + AV+ + Sbjct: 347 SINDVFFFRNRLGFLSGEN----IILSRTAKYFNFYPASIANLSDDDP-IDVAVSTNRIA 401 Query: 392 TIHWMHPFGEGVLVGCDTSLWLLSISLS-KGLSIDFRRVSGSGVYA-CPPVSVGDCLVFV 449 + + PF E +L+ D + ++L+ S + S++ + V P +G + F Sbjct: 402 ILKYAVPFSEELLIWSDEAQFVLTASGTLTSKSVELNLTTQFDVQDRARPFGIGRNVYFA 461 Query: 450 CGVGRRIKYISGSTEQGFRFNEITQL--ADHLFNQ-------RILQLVYQEEPHSIVWVV 500 + S + + +++ + A+ + + + + + V Sbjct: 462 SP-----RSSFTSIHRYYAVQDVSSVKNAEDITSHVPNYIPNGVFSICGSGTENFC--SV 514 Query: 501 LEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPND 546 L D S + + E +W + VL+ S +D Sbjct: 515 LSHGDPSKIFMYKFLYLNEELRQQSWSHWDFGENVQVLACQSISSD 560 >gi|265525004|gb|ACY75867.1| tail tubular protein B [Enterobacteria phage T7] Length = 794 Score = 68.0 bits (164), Expect = 5e-09, Method: Composition-based stats. Identities = 74/586 (12%), Positives = 171/586 (29%), Gaps = 66/586 (11%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + + + G + + D+ + ++ N L P + Sbjct: 1 MALISQSIKNLKGG------ISQQPDILRYPDQGSRQVNGWSSETEGLQKRPPLVFLNTL 54 Query: 61 RLDPRSNRVFSFSI-----PDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTF 115 D + + Y VF +++ + + K G Y T Sbjct: 55 G-DNGALGQAPYIHLINRDEHEQYYA-VFTGSGIRVFDLSGNEKQVRYPNGSNYIK--TA 110 Query: 116 KDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVK 175 L V+++ + + D + + G +I + Sbjct: 111 NPRNDLRMVTVADYTFIVNRNVVAQKNTKSVNLPNYNPNQDGLINVRGGQYGRELIVHIN 170 Query: 176 SNAKLSISQADTSTAR-ITSDMKIFKPLDKGRSIRLGCHPPEWAKNT-----NYSIGAYI 229 D S + + + + + +R + +W N + + + Sbjct: 171 GKDVAKYKIPDGSQPEHVNNTDAQWLAEELAKQMRT--NLSDWTVNVGQGFIHVTAPSGQ 228 Query: 230 VADDKVYRSLTTGRSGDRFGY---SKGATYVKDNNITWITVLNLSSKTSRESASGAVAPY 286 D + + + + S N + ++ +SK++ + A Sbjct: 229 QIDSFTTKDGYADQLINPVTHYAQSFSKLPPNAPNGYMVKIVGDASKSADQYYVRYDAER 288 Query: 287 YVWGDIKDVSKDGRSI-SVAPQSQTLFQAGVSVVSWFMSAWG-------EQEGYPSHV-- 336 VW + + + + + P + G W W + +PS V Sbjct: 289 KVWTETLGWNTEDQVLWETMPHALVRAADGNFDFKWL--EWSPKSCGDVDTNPWPSFVGS 346 Query: 337 -----TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSAS 391 F NRL F + + LS +++F D + AV+ + Sbjct: 347 SINDVFFFRNRLGFLSGEN----IILSRTAKYFNFYPASIANLSDDDP-IDVAVSTNRIA 401 Query: 392 TIHWMHPFGEGVLVGCDTSLWLLSISLS-KGLSIDFRRVSGSGVYA-CPPVSVGDCLVFV 449 + + PF E +L+ D + ++L+ S + S++ + V P +G + F Sbjct: 402 ILKYAVPFSEELLIWSDEAQFVLTASGTLTSKSVELNLTTQFDVQDRARPFGIGRNVYFA 461 Query: 450 CGVGRRIKYISGSTEQGFRFNEITQL--ADHLFNQ-------RILQLVYQEEPHSIVWVV 500 + S + + +++ + A+ + + + + + V Sbjct: 462 SP-----RSSFTSIHRYYAVQDVSSVKNAEDITSHVPNYIPNGVFSICGSGTENFC--SV 514 Query: 501 LEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPND 546 L D S + + E +W + VL+ S +D Sbjct: 515 LSHGDPSKIFMYKFLYLNEELRQQSWSHWDFGENVQVLACQSISSD 560 >gi|37956840|gb|AAP34107.1| gene 12 [Enterobacteria phage T7] Length = 794 Score = 67.6 bits (163), Expect = 6e-09, Method: Composition-based stats. Identities = 76/586 (12%), Positives = 171/586 (29%), Gaps = 66/586 (11%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + + + G + + D+ + ++ N L P + Sbjct: 1 MALISQSIKNLKGG------ISQQPDILRYPDQGSRQVNGWSSETEGLQKRPPLVFLNTL 54 Query: 61 RLDPRSNRVFSFSI-----PDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTF 115 D + + Y VF +++ + + K G Y T Sbjct: 55 G-DNGALGQAPYIHLINRDEHEQYYA-VFTGSGIRVFDLSGNEKQVRYPNGSNYIK--TA 110 Query: 116 KDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVK 175 L V+++ + + D + + G +I + Sbjct: 111 NPRNDLRMVTVADYTFIVNRNVVAQKNTKSVNLPNYNSNQDGLINVRGGQYGRELIVHIN 170 Query: 176 SNAKLSISQADTSTAR-ITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYS-IGAYIVADD 233 D S + + + + + +R + +W N I + Sbjct: 171 GKDVAKYKIPDGSQPEHVNNTDAQWLAEELAKQMRT--NLSDWTVNVGQGFIHVIAPSGQ 228 Query: 234 KVYRSLTTGRSGDR-------FGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPY 286 ++ T D+ + S N + ++ +SK++ + A Sbjct: 229 QIDSFTTKDGYADQLINSVTHYAQSFSKLPPNAPNGYMVKIVGDASKSADQYYVRYDAER 288 Query: 287 YVWGDIKDVSKDGRSI-SVAPQSQTLFQAGVSVVSWFMSAWG-------EQEGYPSHV-- 336 VW + + + + + P + G W W + +PS V Sbjct: 289 KVWTETLGWNTEDQVLWETMPHALVRAADGNFDFKWL--EWSPKSCGDVDTNPWPSFVGS 346 Query: 337 -----TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSAS 391 F NRL F + + LS +++F D + AV+ + Sbjct: 347 SINDVFFFRNRLGFLSGEN----IILSRTAKYFNFYPASIANLSDDDP-IDVAVSTNRIA 401 Query: 392 TIHWMHPFGEGVLVGCDTSLWLLSISLS-KGLSIDFRRVSGSGVYA-CPPVSVGDCLVFV 449 + + PF E +L+ D + ++L+ S + S++ + V P +G + F Sbjct: 402 ILKYAVPFSEELLIWSDEAQFVLTASGTLTSKSVELNLTTQFDVQDRARPFGIGRNVYFA 461 Query: 450 CGVGRRIKYISGSTEQGFRFNEITQL--ADHLFNQ-------RILQLVYQEEPHSIVWVV 500 + S + + +++ + A+ + + + + + V Sbjct: 462 SS-----RSSFTSIHRYYAVQDVSSVKNAEDITSHVPNYIPNGVFSICGSGTENFC--SV 514 Query: 501 LEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPND 546 L D S + + E +W + VL+ S +D Sbjct: 515 LSHGDPSKIFMYKFLYLNEELRQQSWSHWDFGENVQVLACQSISSD 560 >gi|37956893|gb|AAP34159.1| gene 12 [Enterobacteria phage T7] Length = 794 Score = 66.4 bits (160), Expect = 1e-08, Method: Composition-based stats. Identities = 76/586 (12%), Positives = 171/586 (29%), Gaps = 66/586 (11%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + + + G + + D+ + ++ N L P + Sbjct: 1 MALISQSIKNLKGG------ISQQPDILRYPDQGSRQVNGWSSETEGLQKRPPLVFLNTL 54 Query: 61 RLDPRSNRVFSFSI-----PDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTF 115 D + + Y VF +++ + + K G Y T Sbjct: 55 G-DNGALGQAPYIHLINRDEHEQYYA-VFTGSGIRVFDLSGNEKQVRYPNGSNYIK--TA 110 Query: 116 KDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVK 175 L V+++ + + D + + G +I + Sbjct: 111 NPRNDLRMVTVADYTFIVNRNVVAQKNTKSVNLPNYNSNQDGLINVRGGQYGRELIVHIN 170 Query: 176 SNAKLSISQADTSTAR-ITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYS-IGAYIVADD 233 D S + + + + + +R + +W N I + Sbjct: 171 GKDVAKYKIPDGSQPEHVNNTDAQWLAEELAKQMRT--NLSDWTVNVGQGFIHVIAPSGQ 228 Query: 234 KVYRSLTTGRSGDR-------FGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPY 286 ++ T D+ + S N + ++ +SK++ + A Sbjct: 229 QIDSFTTKDGYADQLINSVTHYAQSFSKLPPNAPNGYMVKIVGDASKSADQYYVRYDAER 288 Query: 287 YVWGDIKDVSKDGRSI-SVAPQSQTLFQAGVSVVSWFMSAWG-------EQEGYPSHV-- 336 VW + + + + + P + G W W + +PS V Sbjct: 289 KVWTETLGWNTEDQVLWETMPHALVRAADGNFDFKWL--EWSPKSCGDVDTNPWPSFVGS 346 Query: 337 -----TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSAS 391 F NRL F + + LS +++F D + AV+ + Sbjct: 347 SINDVFFFRNRLGFLSGEN----IILSRTAKYFNFYPASIANLSDDDP-IDVAVSTNRIA 401 Query: 392 TIHWMHPFGEGVLVGCDTSLWLLSISLS-KGLSIDFRRVSGSGVYA-CPPVSVGDCLVFV 449 + + PF E +L+ D + ++L+ S + S++ + V P +G + F Sbjct: 402 ILKYAVPFSEELLIWSDEAQFVLTASGTLTSKSVELNLTTQFDVQDRARPFGIGRNVYFA 461 Query: 450 CGVGRRIKYISGSTEQGFRFNEITQL--ADHLFNQ-------RILQLVYQEEPHSIVWVV 500 + S + + +++ + A+ + + + + + V Sbjct: 462 SS-----RPSFTSIHRYYAVQDVSSVKNAEDITSHVPNYIPNGVFSICGSGTENFC--SV 514 Query: 501 LEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPND 546 L D S + + E +W + VL+ S +D Sbjct: 515 LSHGDPSKIFMYKFLYLNEELRQQSWSHWDFGENVQVLACQSISSD 560 >gi|194100399|ref|YP_002003974.1| gp12 [Enterobacteria phage 13a] gi|193201446|gb|ACF15923.1| gp12 [Enterobacteria phage 13a] Length = 794 Score = 66.0 bits (159), Expect = 2e-08, Method: Composition-based stats. Identities = 73/587 (12%), Positives = 169/587 (28%), Gaps = 68/587 (11%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + + + G + + D+ + ++ N L P + + Sbjct: 1 MALISQSIKNLKGG------ISQQPDILRYPDQGSRQVNGWSSETEGLQKRPPLVFLKTL 54 Query: 61 RLDPRSNRVFSFSI-----PDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTF 115 + + Y VF +++ + + K G Y T Sbjct: 55 GY-NGALGQAPYIHLINRDEHEQYYA-VFTGSGIRVFDLAGNEKQVRYPNGSNYIN--TA 110 Query: 116 KDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVK 175 L V+++ + + D + + G +I + Sbjct: 111 NPRNDLRMVTVADYTFIVNRNVVAQKNTNSVNLPNYNPNQDGLINVRGGQYGRELIVHIN 170 Query: 176 SN--AKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNT-----NYSIGAY 228 AK I +D + ++ + +W N + + + Sbjct: 171 GKDVAKYKIPDGSKPEHVNNTDAQWLAEELAN---QMRTNLSDWTVNVGQGFIHVTAPSG 227 Query: 229 IVADDKVYRSLTTGRSGDRFGY---SKGATYVKDNNITWITVLNLSSKTSRESASGAVAP 285 D + + + + S N + ++ +SK++ + Sbjct: 228 QQIDSFTTKDGYADQLINPVTHYAQSFSKLPPNAPNGYMVKIVGDASKSADQYYVRYDDE 287 Query: 286 YYVWGDIKDVSKDGRSI-SVAPQSQTLFQAGVSVVSWFMSAWG-------EQEGYPSHV- 336 VW + + + + + P + G W W + +PS V Sbjct: 288 RKVWTETLGWNTEDQVLWETMPHALVRAADGNFDFKWL--EWSPKSCGDVDTNPWPSFVG 345 Query: 337 ------TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSA 390 F NRL F + + LS +++F D + AV+ Sbjct: 346 SSINDVFFFRNRLGFLSGEN----IILSRTAKYFNFYPASIANLSDDDP-IDVAVSTNRI 400 Query: 391 STIHWMHPFGEGVLVGCDTSLWLLSISLS-KGLSIDFRRVSGSGVYA-CPPVSVGDCLVF 448 + + + PF E +L+ D + ++L+ S + S++ + V P +G + F Sbjct: 401 AILKYAVPFSEELLIWSDEAQFVLTASGTLTSKSVELNLTTQFDVQDRARPFGIGRNVYF 460 Query: 449 VCGVGRRIKYISGSTEQGFRFNEITQL--ADHLFNQ-------RILQLVYQEEPHSIVWV 499 + S + + +++ + A+ + + + + + Sbjct: 461 ASP-----RSSFTSIHRYYAVQDVSSVKNAEDITSHVPNYIPNGVFSICGSGTENFC--S 513 Query: 500 VLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPND 546 VL D S + + E +W + VL+ S +D Sbjct: 514 VLSHGDPSKIFMYKFLYLNEELRQQSWSHWDFGENVQVLACQSINSD 560 >gi|37956735|gb|AAP34004.1| gene 12 [Enterobacteria phage T7] gi|37956785|gb|AAP34053.1| gene 12 [Enterobacteria phage T7] Length = 794 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 73/586 (12%), Positives = 171/586 (29%), Gaps = 66/586 (11%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + + + G + + D+ + ++ N L P + Sbjct: 1 MALISQSIKNLKGG------ISQQPDILRYPDQGSRQVNGWSSETEGLQKRPPLVFLNTL 54 Query: 61 RLDPRSNRVFSFSI-----PDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTF 115 D + + Y VF +++ + + K G Y T Sbjct: 55 G-DNGALGQAPYIHLINRDEHEQYYA-VFTGSGIRVFDLSGNEKQVRYPNGSNYIK--TA 110 Query: 116 KDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVK 175 L V+++ + + D + + G +I + Sbjct: 111 NPRNDLRMVTVADYTFIVNRNIVAQKNTKSVNLPNYNPNQDGLINVRGGQYGRELIVHIN 170 Query: 176 SNAKLSISQADTSTAR-ITSDMKIFKPLDKGRSIRLGCHPPEWAKNT-----NYSIGAYI 229 D S + + + + + +R + +W N + + + Sbjct: 171 GKDVAKYKIPDGSQPEHVNNTDAQWLAEELAKQMRT--NLSDWTVNVGQGFIHVTAPSGQ 228 Query: 230 VADDKVYRSLTTGRSGDRFGY---SKGATYVKDNNITWITVLNLSSKTSRESASGAVAPY 286 D + + + + S N + ++ +SK++ + A Sbjct: 229 QIDSFTTKDGYADQLINPVTHYAQSFSKLPPNAPNGYMVKIVGDASKSADQYYVRYDAER 288 Query: 287 YVWGDIKDVSKDGRSI-SVAPQSQTLFQAGVSVVSWFMSAWG-------EQEGYPSHV-- 336 VW + + + + + P + G W W + +PS V Sbjct: 289 KVWTETLGWNTEDQVLWETMPHALVRAADGNFDFKWL--EWSPKSCGDVDTNPWPSFVGS 346 Query: 337 -----TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSAS 391 F NRL F + + LS +++F D + AV+ + Sbjct: 347 SINDVFFFRNRLGFLSGEN----IILSRTAKYFNFYPASIANLSDDDP-IDVAVSTNRIA 401 Query: 392 TIHWMHPFGEGVLVGCDTSLWLLSISLS-KGLSIDFRRVSGSGVYA-CPPVSVGDCLVFV 449 + + PF E +L+ D + ++L+ S + S++ + V P +G + F Sbjct: 402 ILKYAVPFSEELLIWSDEAQFVLTASGTLTSKSVELNLTTQFDVQDRARPFGIGRNVYFA 461 Query: 450 CGVGRRIKYISGSTEQGFRFNEITQL--ADHLFNQ-------RILQLVYQEEPHSIVWVV 500 + S + + +++ + A+ + + + + + V Sbjct: 462 SP-----RSSFTSIHRYYAVQDVSSVKNAEDITSHVPNYIPNGVFSICGSGTENFC--SV 514 Query: 501 LEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPND 546 L + S + + E +W + VL+ S +D Sbjct: 515 LSHGNPSKIFMYKFLYLNEELRQQSWSHWDFGENVQVLACQSISSD 560 >gi|254505331|ref|ZP_05117479.1| hypothetical protein SADFL11_PLAS29 [Labrenzia alexandrii DFL-11] gi|222436175|gb|EEE42857.1| hypothetical protein SADFL11_PLAS29 [Labrenzia alexandrii DFL-11] Length = 683 Score = 64.5 bits (155), Expect = 5e-08, Method: Composition-based stats. Identities = 55/326 (16%), Positives = 105/326 (32%), Gaps = 32/326 (9%) Query: 241 TGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGR 300 + GD + S GA+ + D + +++ + ++ + I Sbjct: 162 SATDGDVYRISNGASPLDDYYVKYVSADTEWVECAKPGEVIGFDAKTMPHQIVREEDGSF 221 Query: 301 SISVAPQSQTLFQAGVSVV--SWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLS 358 S+S S SV S+ A+ + + F NRL F + + S Sbjct: 222 SVSRVEWSDRQVGDAESVKDPSFVGRAFKD-------IFFFKNRLGFVSDENT----FFS 270 Query: 359 SFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLW-LLSIS 417 F++ D D + A + + + W+ PF + + D + + L S Sbjct: 271 QAADFFNLWPDQANVVGDSDP-VDIAASTTKVTILQWVVPFRRALFLSADLAQFELASSD 329 Query: 418 LSKGLSIDFRRVSGS-GVYACPPVSVGDCLVF-VCGVGRRIKYISGSTEQGFR--FNEIT 473 S+ + C P ++GD L F G+ + Y + ++T Sbjct: 330 FMTPTSVAVDLATSYEATNLCRPTTLGDELYFAAEKQGKTVIYEYFYDDDTLSNTAIDVT 389 Query: 474 QLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDF--AWHTHMI 531 + A+ I VY E +I +L D + R G+ AW Sbjct: 390 KHAE----GYIPGRVYLMEGSAIANTLLCVADGDSASMYTYRVFWNGQEKIQSAWSRWTF 445 Query: 532 SDKHYVLSAASFPNDNRGGTSLWMLV 557 + Y+ + ++LV Sbjct: 446 DNS-YIDGVKVI------NDTAYVLV 464 >gi|254503713|ref|ZP_05115864.1| hypothetical protein SADFL11_3752 [Labrenzia alexandrii DFL-11] gi|222439784|gb|EEE46463.1| hypothetical protein SADFL11_3752 [Labrenzia alexandrii DFL-11] Length = 634 Score = 64.1 bits (154), Expect = 7e-08, Method: Composition-based stats. Identities = 55/326 (16%), Positives = 105/326 (32%), Gaps = 32/326 (9%) Query: 241 TGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGR 300 + GD + S GA+ + D + +++ + ++ + I Sbjct: 113 SATDGDVYRISNGASPLDDYYVKYVSADTEWVECAKPGEVIGFDAKTMPHQIVREEDGSF 172 Query: 301 SISVAPQSQTLFQAGVSVV--SWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLS 358 S+S S SV S+ A+ + + F NRL F + + S Sbjct: 173 SVSRVEWSDRQVGDAESVKDPSFVGRAFKD-------IFFFKNRLGFVSDENT----FFS 221 Query: 359 SFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLW-LLSIS 417 F++ D D + A + + + W+ PF + + D + + L S Sbjct: 222 QAADFFNLWPDQANVVGDSDP-VDIAASTTKVTILQWVVPFRRALFLSADLAQFELASSD 280 Query: 418 LSKGLSIDFRRVSGS-GVYACPPVSVGDCLVF-VCGVGRRIKYISGSTEQGFR--FNEIT 473 S+ + C P ++GD L F G+ + Y + ++T Sbjct: 281 FMTPTSVAVDLATSYEATNLCRPTTLGDELYFAAEKQGKTVIYEYFYDDDTLSNTAIDVT 340 Query: 474 QLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDF--AWHTHMI 531 + A+ I VY E +I +L D + R G+ AW Sbjct: 341 KHAE----GYIPGRVYLMEGSAIANTLLCVADGDSASMYTYRVFWNGQEKIQSAWSRWTF 396 Query: 532 SDKHYVLSAASFPNDNRGGTSLWMLV 557 + Y+ + ++LV Sbjct: 397 DNS-YIDGVKVI------NDTAYVLV 415 >gi|30387490|ref|NP_848299.1| tail protein [Yersinia pestis phage phiA1122] gi|30314127|gb|AAP20535.1| tail protein [Yersinia pestis phage phiA1122] Length = 794 Score = 63.7 bits (153), Expect = 9e-08, Method: Composition-based stats. Identities = 47/307 (15%), Positives = 101/307 (32%), Gaps = 40/307 (13%) Query: 266 TVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVA-PQSQTLFQAGVSVVSWFMS 324 ++ +SK++ + A VW + + + + + P + G W Sbjct: 268 KIVGDASKSADQYYVRYDAERKVWTETLGWNTENQVLLETMPHALVRAADGNFDFKWL-- 325 Query: 325 AWG-------EQEGYPSHV-------TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDG 370 W + +PS V F NRL F + + LS +++F Sbjct: 326 EWSPKSCGDVDTNPWPSFVGSSINDVFFFRNRLGFLSGEN----IILSRTAKYFNFYPAS 381 Query: 371 EYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLS-KGLSIDFRRV 429 + + AV+ + + + PF E +L+ D + ++L+ S + S++ Sbjct: 382 IANLSNDDP-IDVAVSTNRIAILKYAVPFSEELLIWSDEAQFVLTASGTLTSRSVELNLT 440 Query: 430 SGSGVYA-CPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQL------ADHLFN- 481 + V P +G + F + S + + +++ + H+ N Sbjct: 441 TQFDVQDRARPYGIGRNVYFASP-----RSSYTSIHRYYAVQDVSSVKNSEDITSHVPNY 495 Query: 482 --QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLS 539 + + + VL D S + + E +W + VL+ Sbjct: 496 IPNGVFSICGSGTENFC--SVLSHGDPSKIFMYKFLYLNEELRQQSWSHWDFGENVQVLA 553 Query: 540 AASFPND 546 S +D Sbjct: 554 CQSISSD 560 >gi|323512066|gb|ADX87527.1| tail tubular protein B [Vibrio phage ICP3_2009_B] Length = 794 Score = 61.4 bits (147), Expect = 4e-07, Method: Composition-based stats. Identities = 79/577 (13%), Positives = 166/577 (28%), Gaps = 61/577 (10%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHA-QGVAKSRNLIPLRYGPLVSMPLMQEYRD 59 M + + + G + + D+ ++ QG + G L P + Sbjct: 1 MALISQSIKNLKGG------ISQQPDILRYSDQGSKQINGFSSEVEG-LQKRPPSVHVKR 53 Query: 60 CRL---DPRSNRVFSFSIPDGGYALLVFGDKKLQIVVV-RSSTKWSPALFGKTYKTPYTF 115 + + + + F +++ + K A G +Y T + Sbjct: 54 LTDQFGLGQKPYCHIINRDEVERYAVFFTGSNIRVFDLFTGDEKTVNAPNGLSYVT--SS 111 Query: 116 KDNKSLEYAVFGSTAVFVHKDHP--------PHHLLYIQDGDKISFTFDEIKFLPPPWLG 167 K L ++++ P L + + + Sbjct: 112 NPRKDLRMVTVADYTFILNRNVSTAQGTTNTPSGLAPFGHFGLVVIRGGQYGRTYRVKVN 171 Query: 168 DGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGA 227 + + ++ + A D +++G ++ G ++K+ I + Sbjct: 172 GSVEASFETPLGDQVEHAKQIDIAYIIDQLAAGLINRGWAVTKGSGYFYFSKSGTVLINS 231 Query: 228 YIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYY 287 V D L G D ++ Y +N I I V + + A Sbjct: 232 LEVEDG-YNGQLAWGIINDVQKTTQLPVYAPNNYI--IRVSGDPTLNQDDYYVKFDASRN 288 Query: 288 VWGDIKDVSKDGRSIS-VAPQSQTLFQAGVSVVS---WFMSAWGE--QEGYPSHV----- 336 VW + + P G W A G+ YPS + Sbjct: 289 VWTECPAPNIKADYNKDTMPHVLIREADGTFTFKQADWTHRAAGDDDTNPYPSFIGNSIN 348 Query: 337 --TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIH 394 F NRL F + V LS G +++F + D + AV+ S + Sbjct: 349 DIFFFRNRLGFLSGEN----VILSGSGNYFNFFPESVAVLTDTDP-IDVAVSTNRISILK 403 Query: 395 WMHPFGEGVLVGCDTSLWLLSISLS-KGLSIDFRRVSGSGV-YACPPVSVGDCLVFVCGV 452 + PF E +++ D + ++LS ++ + V P +G + FV Sbjct: 404 YAVPFSEELILWSDQAQFVLSSDGGLTPTTVRLDLTTEFEVTEQTRPFGIGRGVYFVSP- 462 Query: 453 GRRIKYISGSTEQGFRFNEITQL------ADHLF---NQRILQLVYQEEPHSIVWVVLEP 503 + S + + ++TQ+ + H+ + ++ + + +L Sbjct: 463 ----RAKFSSVRRFYAVQDVTQVKNAEDISAHVPSYVENGVFKMSGSSTENFLT--ILTE 516 Query: 504 KDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSA 540 + + E +W VL Sbjct: 517 GNEQRVYFYKFLYLQEQLVQQSWSHWDFGVNCRVLCC 553 >gi|167565012|ref|ZP_02357928.1| tail tubular protein B [Burkholderia oklahomensis EO147] Length = 776 Score = 61.0 bits (146), Expect = 5e-07, Method: Composition-based stats. Identities = 81/582 (13%), Positives = 162/582 (27%), Gaps = 75/582 (12%) Query: 39 NLIPLRY-GPLVSMPLMQEYRDCRLDPRSNR-VFSFSIPDGGYALLV----FGDKKLQIV 92 N +P G L + P + + F DG + + G +++ + Sbjct: 36 NFLPSVDIGGLADRVGTTCIANLAAAPYKSEGTYMFRTTDGQRWMFIRRADAGYPEIRNM 95 Query: 93 VVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKIS 152 V + + F + Y L++ T + ++ D + K Sbjct: 96 VNGALAAVTCGPFVQNY-----INSASRLKFLSMSDTTLVLNPDVATRFVAPSAGITKTR 150 Query: 153 FTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTST------------------ARITS 194 + I+ L + + S S A + A T I+ Sbjct: 151 -AYAVIRKLSSNYQTFYLNSDAGSAATVYDGSAGVKTREWVAQRLMEQCIAHMPGLTISR 209 Query: 195 DMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGA 254 + + I +W + I + A + + G S + Sbjct: 210 VANVVRISGPEAIINTLNGGNDWDETAFVLIKGRVSAASDLPAQMFPGESVMVDLENGAT 269 Query: 255 TYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQA 314 +T+ N +T+ + + + G + + + Sbjct: 270 KSAYW--VTYDRTTNSYKETAWLDNFANAGNWDASTMPVRIHQTGVNSFEIQPVDWVPRK 327 Query: 315 GVSVVSWFMSAWGEQEGYPSH-VTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYG 373 S + + G P + RL FS + V S ++F D Sbjct: 328 VGDNDSNAPAPFN---GAPITDMALWKGRLWFSSASW----VVGSQPDDLFNFWQDSARE 380 Query: 374 CYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL-SKGLSIDFRRVSGS 432 A D ++ + F + ++V + L S K + + Sbjct: 381 VVASDPVKVQAEAD--LGSVSHLAGFRDNLMVFLRGAQCSLDGSQPVKPDTAALGVATRY 438 Query: 433 GV-YACPPVSVGDCLVFV--CGVGRRIKYISGSTEQGFRFNEITQLADHLFN------QR 483 V ACPP VG+ +++ + EQ N L+ H+ +R Sbjct: 439 DVDAACPPSVVGNVMLYTGSQEGRSVLWE--YQFEQATENNYAEDLSKHIPRYCPGSVRR 496 Query: 484 ILQLVYQEEPHSIVWVVLEPK------------DNSFPRLLGCRFSAEGEGDFAWHTHMI 531 I+ + + +W L+ + F + WH + Sbjct: 497 IVGSA--QSGRTFLWSSLDAATLYVHSSYWQAQQRAQNAWNKLTF---AQMSNIWHHWVD 551 Query: 532 SDKHYVLSAASFPNDNRGGTSLWMLVALSAGEERSFTVRLNL 573 YVL S + + + V + GE R +RL++ Sbjct: 552 EGNLYVLGQTSV----GYLSLVAVPVDANLGEHREIDLRLDM 589 >gi|68299742|ref|YP_249591.1| Tail tubular protein B [Vibriophage VP4] gi|66473281|gb|AAY46290.1| tail tubular protein B [Vibriophage VP4] Length = 794 Score = 61.0 bits (146), Expect = 5e-07, Method: Composition-based stats. Identities = 45/278 (16%), Positives = 85/278 (30%), Gaps = 36/278 (12%) Query: 287 YVWGDIKDVSKDGRSISVA-PQSQTLFQAGVSVVS---WFMSAWGE--QEGYPSHV---- 336 VW + + P G W A G+ YPS + Sbjct: 288 NVWTECPAPNIKADYNKATMPHVLIREADGTFTFKQADWTHRAAGDDETNPYPSFIGNSI 347 Query: 337 ---TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTI 393 F NRL F + V LS G +++F + D + AV+ S + Sbjct: 348 NDIFFFRNRLGFLSGEN----VILSGSGNYFNFFPESVAVLTDTDP-IDVAVSTNRISIL 402 Query: 394 HWMHPFGEGVLVGCDTSLWLLSISLS-KGLSIDFRRVSGSGV-YACPPVSVGDCLVFVCG 451 + PF E +++ D + ++LS +I + V P +G + FV Sbjct: 403 KYAVPFSEELILWSDQAQFVLSSDGGLTPTTIRLDLTTEFEVTEQARPYGIGRGVYFVSP 462 Query: 452 VGRRIKYISGSTEQGFRFNEITQL------ADHLF---NQRILQLVYQEEPHSIVWVVLE 502 + S + + ++TQ+ + H+ + ++ + + +L Sbjct: 463 -----RAKFSSVRRFYAVQDVTQVKNAEDISAHVPYYVENGVFKMSGSSTENFLT--ILT 515 Query: 503 PKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSA 540 + + E +W VL Sbjct: 516 EGNEQRVYFYKFLYLQEQLVQQSWSHWDFGVNCRVLCC 553 >gi|326633075|ref|YP_004306686.1| predicted tail tubular protein B [Salmonella phage Vi06] gi|301170548|emb|CBV65236.1| predicted tail tubular protein B [Salmonella phage Vi06] Length = 795 Score = 61.0 bits (146), Expect = 5e-07, Method: Composition-based stats. Identities = 42/318 (13%), Positives = 100/318 (31%), Gaps = 44/318 (13%) Query: 266 TVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSA 325 ++ +SK++ + VW + + + + + + A + Sbjct: 269 KIVGDASKSADQYYVRYDTTRKVWSETLGWNVNDQLLFETMPHALVRAADGN-FELKRIE 327 Query: 326 WG-------EQEGYPSH-------VTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGE 371 W + +PS V F NRL + + LS +++F Sbjct: 328 WSPKTCGDDDTNPWPSFMDSTINDVFFFRNRLGLLSGEN----IILSRTAKYFNFYPASI 383 Query: 372 YGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLS-KGLSIDFRRVS 430 D + AV+ + + + PF E +L+ D + ++L+ S + SI+ + Sbjct: 384 ATLSDDDP-IDVAVSTNRIAILKYAVPFSEELLIWSDEAQFVLTASGTLTSRSIELNLTT 442 Query: 431 GSGVYA-CPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQL--ADHLFNQ----- 482 V P +G + F + S + + +++ + A+ + Sbjct: 443 QFDVQDRARPFGIGRNVYFASP-----RSSFTSIHRYYAVQDVSSVKNAEDITAHVQNYI 497 Query: 483 --RILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSA 540 + + + VL D S + + E +W VL+ Sbjct: 498 PNGVFDICGSSTENFCA--VLSQGDQSKIFMYKFLYLNEELRQQSWSHWDFGSNVQVLAC 555 Query: 541 ASFPNDNRGGTSLWMLVA 558 + +++++ Sbjct: 556 QCI------NSDMYVILR 567 >gi|325171313|ref|YP_004251284.1| tail tubular protein B [Vibrio phage ICP3] gi|323512019|gb|ADX87481.1| tail tubular protein B [Vibrio phage ICP3] Length = 794 Score = 61.0 bits (146), Expect = 5e-07, Method: Composition-based stats. Identities = 44/278 (15%), Positives = 85/278 (30%), Gaps = 36/278 (12%) Query: 287 YVWGDIKDVSKDGRSISVA-PQSQTLFQAGVSVVS---WFMSAWGE--QEGYPSHV---- 336 VW + + P G W A G+ YPS + Sbjct: 288 NVWTECPAPNIKADYNKATMPHVLIREADGTFTFKQADWTHRAAGDDDTNPYPSFIGNSI 347 Query: 337 ---TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTI 393 F NRL F + V LS G +++F + D + AV+ S + Sbjct: 348 NDIFFFRNRLGFLSGEN----VILSGSGNYFNFFPESVAVLTDTDP-IDVAVSTNRISIL 402 Query: 394 HWMHPFGEGVLVGCDTSLWLLSISLS-KGLSIDFRRVSGSGV-YACPPVSVGDCLVFVCG 451 + PF E +++ D + ++LS ++ + V P +G + FV Sbjct: 403 KYAVPFSEELILWSDQAQFVLSSDGGLTPTTVRLDLTTEFEVTEQTRPFGIGRGVYFVSP 462 Query: 452 VGRRIKYISGSTEQGFRFNEITQL------ADHLF---NQRILQLVYQEEPHSIVWVVLE 502 + S + + ++TQ+ + H+ + ++ + + +L Sbjct: 463 -----RAKFSSVRRFYAVQDVTQVKNAEDISAHVPSYVENGVFKMSGSSTENFLT--ILT 515 Query: 503 PKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSA 540 + + E +W VL Sbjct: 516 EGNEQRVYFYKFLYLQEQLVQQSWSHWDFGVNCRVLCC 553 >gi|323512212|gb|ADX87670.1| tail tubular protein B [Vibrio phage ICP3_2007_A] Length = 794 Score = 61.0 bits (146), Expect = 6e-07, Method: Composition-based stats. Identities = 79/577 (13%), Positives = 166/577 (28%), Gaps = 61/577 (10%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHA-QGVAKSRNLIPLRYGPLVSMPLMQEYRD 59 M + + + G + + D+ ++ QG + G L P + Sbjct: 1 MALISQSIKNLKGG------ISQQPDILRYSDQGSKQINGFSSEVEG-LQKRPPSVHVKR 53 Query: 60 CRL---DPRSNRVFSFSIPDGGYALLVFGDKKLQIVVV-RSSTKWSPALFGKTYKTPYTF 115 + + + + F +++ + K A G +Y T + Sbjct: 54 LTDQFGLGQKPYCHIINRDEVERYAVFFTGSNIRVFDLFTGDEKTVNAPNGLSYVT--SS 111 Query: 116 KDNKSLEYAVFGSTAVFVHKDHP--------PHHLLYIQDGDKISFTFDEIKFLPPPWLG 167 K L ++++ P L + + + Sbjct: 112 NPRKDLRMVTVADYTFILNRNVSTAQGTTNTPSGLAPFGHFGLVVIRGGQYGRTYRVKVN 171 Query: 168 DGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGA 227 + + ++ + A D +++G ++ G ++K+ I + Sbjct: 172 GSVEASFETPLGDQVEHAKQIDIAYIIDQLAAGLINRGWAVTKGSGYFYFSKSGTVIINS 231 Query: 228 YIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYY 287 V D L G D ++ Y +N I I V + + A Sbjct: 232 LEVEDG-YNGQLAWGIINDVQKTTQLPVYAPNNYI--IRVSGDPTLNQDDYYVKFDASRN 288 Query: 288 VWGDIKDVSKDGRSISVA-PQSQTLFQAGVSVVS---WFMSAWGE--QEGYPSHV----- 336 VW + + P G W A G+ YPS + Sbjct: 289 VWTECPAPNIKADYNKATMPHVLIREADGTFTFKQADWTHRAAGDDDTNPYPSFIGNSIN 348 Query: 337 --TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIH 394 F NRL F + V LS G +++F + D + AV+ S + Sbjct: 349 DIFFFRNRLGFLSGEN----VILSGSGNYFNFFPESVAVLTDTDP-IDVAVSTNRISILK 403 Query: 395 WMHPFGEGVLVGCDTSLWLLSISLS-KGLSIDFRRVSGSGV-YACPPVSVGDCLVFVCGV 452 + PF E +++ D + ++LS ++ + V P +G + FV Sbjct: 404 YAVPFSEELILWSDQAQFVLSSDGGLTPTTVRLDLTTEFEVTEQTRPFGIGRGVYFVSP- 462 Query: 453 GRRIKYISGSTEQGFRFNEITQL------ADHLF---NQRILQLVYQEEPHSIVWVVLEP 503 + S + + ++TQ+ + H+ + ++ + + +L Sbjct: 463 ----RAKFSSVRRFYAVQDVTQVKNAEDISAHVPSYVENGVFKMSGSSTENFLT--ILTE 516 Query: 504 KDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSA 540 + + E +W VL Sbjct: 517 GNEQRVYFYKFLYLQEQLVQQSWSHWDFGVNCRVLCC 553 >gi|323512164|gb|ADX87623.1| tail tubular protein B [Vibrio phage ICP3_2008_A] Length = 795 Score = 61.0 bits (146), Expect = 6e-07, Method: Composition-based stats. Identities = 79/577 (13%), Positives = 166/577 (28%), Gaps = 61/577 (10%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHA-QGVAKSRNLIPLRYGPLVSMPLMQEYRD 59 M + + + G + + D+ ++ QG + G L P + Sbjct: 1 MALISQSIKNLKGG------ISQQPDILRYSDQGSKQINGFSSEVEG-LQKRPPSVHVKR 53 Query: 60 CRL---DPRSNRVFSFSIPDGGYALLVFGDKKLQIVVV-RSSTKWSPALFGKTYKTPYTF 115 + + + + F +++ + K A G +Y T + Sbjct: 54 LTDQFGLGQKPYCHIINRDEVERYAVFFTGSNIRVFDLFTGDEKTVNAPNGLSYVT--SS 111 Query: 116 KDNKSLEYAVFGSTAVFVHKDHP--------PHHLLYIQDGDKISFTFDEIKFLPPPWLG 167 K L ++++ P L + + + Sbjct: 112 NPRKDLRMVTVADYTFILNRNVSTAQGTTNTPSGLAPFGHFGLVVIRGGQYGRTYRVKVN 171 Query: 168 DGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGA 227 + + ++ + A D +++G ++ G ++K+ I + Sbjct: 172 GSVEASFETPLGDQVEHAKQIDIAYIIDQLAAGLINRGWAVTKGSGYFYFSKSGTVIINS 231 Query: 228 YIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYY 287 V D L G D ++ Y +N I I V + + A Sbjct: 232 LEVEDG-YNGQLAWGIINDVQKTTQLPVYAPNNYI--IRVSGDPTLNQDDYYVKFDASRN 288 Query: 288 VWGDIKDVSKDGRSISVA-PQSQTLFQAGVSVVS---WFMSAWGE--QEGYPSHV----- 336 VW + + P G W A G+ YPS + Sbjct: 289 VWTECPAPNIKADYNKATMPHVLIREADGTFTFKQADWTHRAAGDDDTNPYPSFIGNSIN 348 Query: 337 --TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIH 394 F NRL F + V LS G +++F + D + AV+ S + Sbjct: 349 DIFFFRNRLGFLSGEN----VILSGSGNYFNFFPESVAVLTDTDP-IDVAVSTNRISILK 403 Query: 395 WMHPFGEGVLVGCDTSLWLLSISLS-KGLSIDFRRVSGSGV-YACPPVSVGDCLVFVCGV 452 + PF E +++ D + ++LS ++ + V P +G + FV Sbjct: 404 YAVPFSEELILWSDQAQFVLSSDGGLTPTTVRLDLTTEFEVTEQTRPFGIGRGVYFVSP- 462 Query: 453 GRRIKYISGSTEQGFRFNEITQL------ADHLF---NQRILQLVYQEEPHSIVWVVLEP 503 + S + + ++TQ+ + H+ + ++ + + +L Sbjct: 463 ----RAKFSSVRRFYAVQDVTQVKNAEDISAHVPSYVENGVFKMSGSSTENFLT--ILTE 516 Query: 504 KDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSA 540 + + E +W VL Sbjct: 517 GNEQRVYFYKFLYLQEQLVQQSWSHWDFGVNCRVLCC 553 >gi|323512115|gb|ADX87575.1| tail tubular protein B [Vibrio phage ICP3_2009_A] Length = 794 Score = 61.0 bits (146), Expect = 6e-07, Method: Composition-based stats. Identities = 79/577 (13%), Positives = 166/577 (28%), Gaps = 61/577 (10%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHA-QGVAKSRNLIPLRYGPLVSMPLMQEYRD 59 M + + + G + + D+ ++ QG + G L P + Sbjct: 1 MALISQSIKNLKGG------ISQQPDILRYSDQGSKQINGFSSEVEG-LQKRPPSVHVKR 53 Query: 60 CRL---DPRSNRVFSFSIPDGGYALLVFGDKKLQIVVV-RSSTKWSPALFGKTYKTPYTF 115 + + + + F +++ + K A G +Y T + Sbjct: 54 LTDQFGLGQKPYCHIINRDEVERYAVFFTGSNIRVFDLFTGDEKTVNAPNGLSYVT--SS 111 Query: 116 KDNKSLEYAVFGSTAVFVHKDHP--------PHHLLYIQDGDKISFTFDEIKFLPPPWLG 167 K L ++++ P L + + + Sbjct: 112 NPRKDLRMVTVADYTFILNRNVSTAQGTTNTPSGLAPFGHFGLVVIRGGQYGRTYRVKVN 171 Query: 168 DGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGA 227 + + ++ + A D +++G ++ G ++K+ I + Sbjct: 172 GSVEASFETPLGDQVEHAKQIDIAYIIDQLAAGLINRGWAVTKGSGYFYFSKSGTVIINS 231 Query: 228 YIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYY 287 V D L G D ++ Y +N I I V + + A Sbjct: 232 LEVEDG-YNGQLAWGIINDVQKTTQLPVYAPNNYI--IRVSGDPTLNQDDYYVKFDASRN 288 Query: 288 VWGDIKDVSKDGRSIS-VAPQSQTLFQAGVSVVS---WFMSAWGE--QEGYPSHV----- 336 VW + + P G W A G+ YPS + Sbjct: 289 VWTECPAPNIKADYNKDTMPHVLIREADGTFTFKQADWTHRAAGDDDTNPYPSFIGNSIN 348 Query: 337 --TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIH 394 F NRL F + V LS G +++F + D + AV+ S + Sbjct: 349 DIFFFRNRLGFLSGEN----VILSGSGNYFNFFPESVAVLTDTDP-IDVAVSTNRISILK 403 Query: 395 WMHPFGEGVLVGCDTSLWLLSISLS-KGLSIDFRRVSGSGV-YACPPVSVGDCLVFVCGV 452 + PF E +++ D + ++LS ++ + V P +G + FV Sbjct: 404 YAVPFSEELILWSDQAQFVLSSDGGLTPTTVRLDLTTEFEVTEQTRPFGIGRGVYFVSP- 462 Query: 453 GRRIKYISGSTEQGFRFNEITQL------ADHLF---NQRILQLVYQEEPHSIVWVVLEP 503 + S + + ++TQ+ + H+ + ++ + + +L Sbjct: 463 ----RAKFSSVRRFYAVQDVTQVKNAEDISAHVPSYVENGVFKMSGSSTENFLT--ILTE 516 Query: 504 KDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSA 540 + + E +W VL Sbjct: 517 GNEQRVYFYKFLYLQEQLVQQSWSHWDFGVNCRVLCC 553 >gi|291334275|gb|ADD93938.1| hypothetical protein BTH_I0919 [uncultured marine bacterium MedDCM-OCT-S08-C235] Length = 323 Score = 59.9 bits (143), Expect = 1e-06, Method: Composition-based stats. Identities = 24/148 (16%), Positives = 43/148 (29%), Gaps = 16/148 (10%) Query: 417 SLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKY-ISGSTEQGFRFNEITQL 475 + + RR + G ++ +FV GR ++ + E + I+ L Sbjct: 11 NSLTPSNFTARRQTTHGCSHVNVKTLEGGALFVQKHGRAVRELLFTDLELSYSATNISLL 70 Query: 476 ADHLFNQRILQLVYQ---EEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMIS 532 A HL + + Q E P S + G + E W + Sbjct: 71 ASHLVQTPVDMTILQGTAERPESYAIFINSDGT------AGVFHAVRAEKLAGWTEWKTT 124 Query: 533 DKHYVLSAASFPNDNRGGTSLWMLVALS 560 S + G+ L+ V Sbjct: 125 TGATFKSIEAV------GSRLFFTVYRD 146 >gi|281416199|ref|YP_003347934.1| tail tubular protein B [Vibrio phage N4] gi|237701506|gb|ACR16499.1| tail tubular protein B [Vibrio phage N4] Length = 794 Score = 58.7 bits (140), Expect = 3e-06, Method: Composition-based stats. Identities = 44/276 (15%), Positives = 85/276 (30%), Gaps = 36/276 (13%) Query: 287 YVWGDIKDVSKDGRSISVA-PQSQTLFQAGVSVVS---WFMSAWGE--QEGYPSHV---- 336 VW + + P G W A G+ YPS + Sbjct: 288 NVWTECPAPNIKADYNKATMPHVLIREADGTFTFKQADWTHRAAGDDDTNPYPSFIGNSI 347 Query: 337 ---TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTI 393 F NRL F + V LS G +++F + D + AV+ S + Sbjct: 348 NDIFFFRNRLGFLSGEN----VILSGSGNYFNFFPESVAVLTDTDP-IDVAVSTNRISIL 402 Query: 394 HWMHPFGEGVLVGCDTSLWLLSISLS-KGLSIDFRRVSGSGV-YACPPVSVGDCLVFVCG 451 + PF E +++ D + ++LS ++ + V P +G + FV Sbjct: 403 KYAVPFSEELILWSDQAQFVLSSDGGLTPTTVRLDLTTEFEVTEQARPFGIGRGVYFVSP 462 Query: 452 VGRRIKYISGSTEQGFRFNEITQL------ADHLF---NQRILQLVYQEEPHSIVWVVLE 502 + S + + ++TQ+ + H+ + ++ + + +L Sbjct: 463 -----RAKFSSVRRFYAVQDVTQVKNAEDISAHVPSYVENGVFKMSGSSTENFLT--ILT 515 Query: 503 PKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVL 538 + + E +W VL Sbjct: 516 EGNEQRVYFYKFLYLQEQLVQQSWSHWDFGVNCRVL 551 >gi|313892508|ref|ZP_07826097.1| tail tubular protein B family protein [Dialister microaerophilus UPII 345-E] gi|313119087|gb|EFR42290.1| tail tubular protein B family protein [Dialister microaerophilus UPII 345-E] Length = 807 Score = 57.9 bits (138), Expect = 4e-06, Method: Composition-based stats. Identities = 63/536 (11%), Positives = 152/536 (28%), Gaps = 84/536 (15%) Query: 26 DLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRVFSFSI-----PDGGYA 80 D+ + + + N L P +D + + + +++ + Sbjct: 19 DILRFPEQLEEQTNGFSTESSGLQKRPPTLFIKDLGVHTTTTQAKNYACHTVDRDEEEKY 78 Query: 81 LLVFGDKKLQIVVVRSS---TKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDH 137 +++F + + + ++ + + T + L+ V+ + Sbjct: 79 IMLFTGEDILVYDLKGKQYKVTYEDEKSKQYITT---ENPREELKMVTIADHTFVVNTEV 135 Query: 138 PPHHLLYIQDGDKISFT-------------------------FDEIKFLPPPWLGDGMIS 172 +D ++ + D + Sbjct: 136 VVK---MSEDKVPWKWSDHEALIHIQKGNYGREYSIKINGKKVAKYTTPDGGEASDIKYT 192 Query: 173 GVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVAD 232 + + T T D K + + + Y I ++ V+D Sbjct: 193 DTNYIRDILGNAIQTEEVLYT-DGKYHNQSSGWQVTYYNSAFKIYHPD--YYINSFEVSD 249 Query: 233 DKVYRSLTTGRSGDR-FGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGD 291 ++ + + F + T + + + T + + VW + Sbjct: 250 GFNGEAMHAIKHAVQKFNHLPADAPD---GYTVKVIGDKHTGTDDYYVTFDGKEH-VWKE 305 Query: 292 IKDVSK----DGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGY--PSHV-------TF 338 + D ++ Q+ + +W G++E PS V Sbjct: 306 CAKPNISKGFDAETMPHILVRQSDGTFKLKKANWDERKAGDEESNEPPSFVDNTINDIFL 365 Query: 339 HNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHP 398 NRL F + + LS +F++F L D T + AV++ S S + Sbjct: 366 FRNRLGFLSGEN----IILSRSASFFNFWLASAVELQD-TDTIDLAVSNNSVSILEHAVL 420 Query: 399 FGEGVLVGCDTSLWLLS--ISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVF-VCGV-GR 454 F E +L+ + + ++++ L+ + + S P+ G + F V Sbjct: 421 FNEELLLFSNNAQFIMTSEGILTPQKASVYFATSFPSATEVVPIKAGRRVYFPVKRALYS 480 Query: 455 RIKYISGSTEQGFRFNEITQLADHL--------------FNQRILQLVYQEEPHSI 496 I+ + E + + H+ N+ I+ + P S+ Sbjct: 481 GIRE-YYTLEDTRGSKDAQDITAHVPSLIPNGIHKLWECTNESIILVASNATPDSL 535 >gi|281416310|ref|YP_003347550.1| tail tubular protein B [Klebsiella phage KP32] gi|262410429|gb|ACY66694.1| tail tubular protein B [Klebsiella phage KP32] Length = 791 Score = 57.9 bits (138), Expect = 4e-06, Method: Composition-based stats. Identities = 48/314 (15%), Positives = 94/314 (29%), Gaps = 36/314 (11%) Query: 266 TVLNLSSKTSRESASGAVAPYYVW----GDIKDVSKDGRSISVAPQSQTLFQAGVSVVSW 321 ++ +SKT+ + VW G + D ++ + W Sbjct: 266 KIVGDTSKTADQYYVKYDKSQKVWKETVGWNISIGLDYTTMPWTLVRAADGNFDLGYHDW 325 Query: 322 FMSAWGEQEGYPSH---------VTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEY 372 G+++ P V F NRL F + + +S +++F Sbjct: 326 KDRRAGDEDTNPQPSFVNSTITDVFFFRNRLGFISGEN----IVMSRTSKYFEFYPPSVA 381 Query: 373 GCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL-SKGLSIDFRRVSG 431 Y L AV+ S + + F E +L+ D + ++LS + + + Sbjct: 382 -NYTDDDPLDVAVSHNRVSVLKYAVSFAEELLLWSDEAQFVLSANGVLSAKTAQLDLTTQ 440 Query: 432 SG-VYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFR----FNEITQLADHLFNQRILQ 486 P +G + + + Q ++T H+ N I Sbjct: 441 FDVSDRARPYGIGRNIYYASPRSSFTSIMRYYAVQDVSSVKNAEDMT---AHVPNY-IPN 496 Query: 487 LVYQEEPHSI--VWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFP 544 VY VL S + + E +W D V++A Sbjct: 497 GVYSINGSGTENFACVLTKGAPSKVFIYKFLYMDENIRQQSWSHWDFGDGVEVMAANCI- 555 Query: 545 NDNRGGTSLWMLVA 558 ++++ML+ Sbjct: 556 -----NSTMYMLMR 564 >gi|310005669|gb|ADP00057.1| tail tube B [Cyanophage 9515-10a] Length = 1000 Score = 57.6 bits (137), Expect = 5e-06, Method: Composition-based stats. Identities = 38/236 (16%), Positives = 77/236 (32%), Gaps = 18/236 (7%) Query: 225 IGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVA 284 + + V + + D + A + W + S S Sbjct: 418 LPSMCKHGYIVQVANSENVDADNYYVKFLADNGSGGSGKWEETVR-PHNFSSGSDPMVKG 476 Query: 285 PYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGE--QEGYPSH------- 335 V+ + + +T A + W G+ +PS Sbjct: 477 LDPATMPHALVNNRNGTFTFKKLDETTANADNTDNYWKYREVGDDETNPFPSFKGLEIQK 536 Query: 336 VTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHW 395 + FH NRL F + V +S G +++F + D + V+D + I+ Sbjct: 537 IFFHRNRLGFVAN----EQVVMSRPGDYFNFFVVSAITTSD-DNPIDITVSDIKPAFINH 591 Query: 396 MHPFGEGVLVGCDTSLWLL--SISLSKGLSIDFRRVSGSGV-YACPPVSVGDCLVF 448 + P +GV++ D ++L + + +++S A PV +G ++F Sbjct: 592 VLPVQKGVMMFSDNGQFILFTESDIFSPKTARLKKISSYECDDALQPVDMGTSVMF 647 >gi|194473836|ref|YP_002048660.1| tail tubular protein B [Morganella phage MmP1] gi|194307057|gb|ACF42039.1| tail tubular protein B [Morganella phage MmP1] Length = 819 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 66/524 (12%), Positives = 150/524 (28%), Gaps = 70/524 (13%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M + + + G + + D+ + A N L P + + Sbjct: 1 MALVSQSTKNLKGG------ISQQPDILRYPDQGAAQVNGWSSETEGLQKRPPLVFVKQL 54 Query: 61 RLDP--RSNRVFSFS-IPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKD 117 S+ + + + L+ F +++ + K P +D Sbjct: 55 GGKNYLGSDPLVHYINRSEDEKYLVAFSGTGVKVFDMEGKEYTVHNNNAAYLKAPNPKQD 114 Query: 118 NKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSN 177 + + V+++ + G + D + + G + + Sbjct: 115 LRMVTV---ADYTFIVNRNITVKNRSEKSTGGTFNPKSDCLIAVRGSQYGRTIKVTINGV 171 Query: 178 AKLSIS----QADTSTARITSDMKI---FKPLDKGRSIRLGCHPP--------EWAKNTN 222 +++ + I++D I L G++ P E+ T Sbjct: 172 DRVNFTLHDGAEAWQGRTISTDKVIRYIVDQLTTGKTTEGQGSLPGLGHYGVFEYVTTTP 231 Query: 223 YSIGAYIV-ADDKVYRSLTTGRSGDRFGYSKG---------ATYVKDNNIT--------W 264 G + D VY G+ D + G YV+ + Sbjct: 232 LPSGWTVKGMDGFVYIKAPAGQQIDTITTTDGYSDQLVYPVTHYVQTTAKLPLNAPDNYY 291 Query: 265 ITVLNLSSKTSRESASGAVAPYYVW----GDIKDVSKDGRSISVAPQSQTLFQAGVSVVS 320 I V+ + T+ + VW G + ++ A ++ V + Sbjct: 292 IKVVGEAEGTADQYYLKFDKDARVWREAIGWNAILGFQKDTMPHALIRRSDGNFEVKALD 351 Query: 321 WFMSAWGEQEGYP---------SHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGE 371 W G+ + P S V F NRL F + + +S G ++ Sbjct: 352 WSDKEAGDDDTNPDVSLVDRTISDVFFFRNRLGFVSGEN----IVMSRTGRYFKLYPASV 407 Query: 372 YGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL-SKGLSIDFRRVS 430 D + + + PF E +L+ + + ++L+ +++ + Sbjct: 408 AAISDDDPIDVAVSYNRVVD-LQFAVPFTEELLLWANGAQFILTAQGILSPKTVELNLST 466 Query: 431 GSGVY-ACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEIT 473 V+ PV +G + + + S + F +++ Sbjct: 467 QFSVHTGARPVGIGRNVYYASP-----RATFTSINRYFTVQDVS 505 >gi|326536942|ref|YP_004306349.1| tail tubular protein B [Pseudomonas phage phiIBB-PF7A] gi|318054518|gb|ADV35694.1| tail tubular protein B [Pseudomonas phage phiIBB-PF7A] Length = 807 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 52/337 (15%), Positives = 104/337 (30%), Gaps = 49/337 (14%) Query: 265 ITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQ---AGVSVVSW 321 L + + S Y G + + I+ ++ A W Sbjct: 273 EGYLVEITGEATRSGDNYWVRYDGAGRVWKETVKPGIIAGLNRATMPRGLVRAADGQFDW 332 Query: 322 FMSAWG-------EQEGYPSHV-------TFHNNRLLFSGSKGDELSVYLSSFGAFYDFS 367 + W E PS V F NRL F + V +S +++F Sbjct: 333 KVLDWNNRGCGDDETNPLPSFVGGTINDVFFFRNRLGFLSGEN----VIMSRSSRYFNFF 388 Query: 368 LDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLL-SISLSKGLSIDF 426 D L AV+ S + + PF E +L+ D + ++L S + +++ Sbjct: 389 PPSVAALSDDDP-LDIAVSHNRISILKYAVPFSEQLLLWSDQAQFVLSSQGILSPKTVEL 447 Query: 427 RRVSGSGVYA-CPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEIT------QLADHL 479 + V P +G + F + S ++ + +++ ++ H+ Sbjct: 448 NLTTEFDVQDTARPFGIGRGVYF-----SAPRAAYTSLKRYYAVQDVSDVKNAEDVSAHV 502 Query: 480 F---NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHY 536 R+ + + + +L L + AE +W Sbjct: 503 PSYIENRVFNIHGSGTENYVT--LLSDGAPGIVYLYKFLYMAEDIAQQSWSHWEFGQNVN 560 Query: 537 VLSAASFPNDNRGGTSLWMLVALSAGEERSFTVRLNL 573 +L AAS G+ +++L+ G R+ Sbjct: 561 ILGAASI------GSYMYLLMDRPEG---IVLERMEF 588 >gi|194100501|ref|YP_002003346.1| gp12 [Yersinia phage Yepe2] gi|193201234|gb|ACF15715.1| gp12 [Yersinia phage Yepe2] Length = 792 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 77/557 (13%), Positives = 158/557 (28%), Gaps = 79/557 (14%) Query: 47 PLVSMPLMQEYRDC----RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSP 102 L P + L + Y ++ G + Sbjct: 41 GLQKRPPFVFTKTIGDQNALGAKPLVHLINRDSAEQYYVVFTGQGVRVFDLNGKEYDVKG 100 Query: 103 ALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKD-------HPPHHLLYIQD-------- 147 L + P L V+++ P + L D Sbjct: 101 DLSYVKVENP-----RDDLRMVTVADYTFIVNRNMVVRPDTTPLYTLKENGDCLINIRGG 155 Query: 148 --GDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKG 205 G ++FT + K GD +++A+ + + + KG Sbjct: 156 MYGRTLAFTINNTKIAYEIAHGDAPEHSKQTDAQW--------LVKKLAGLARLNVAFKG 207 Query: 206 RSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWI 265 + G +N I + D + + + S V+ N + Sbjct: 208 WTFTEGPGYIHVIAPSNSQINSLSTEDGYADQLMNAVMHTSQ---SFSRLPVEAPNGYTV 264 Query: 266 TVLNLSSKTSRESASGAVAPYYVW----GDIKDVSKDGRSISVAPQSQTLFQAGVSVVSW 321 ++ +SKTS VW G +G ++ A Q + + W Sbjct: 265 KIVGDTSKTSDMFYVQYDNLKKVWKEVAGWGVQKGLNGDTMPHALVRQADGSFQMQALPW 324 Query: 322 FMSAWGEQEGYPSH---------VTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEY 372 G+ + P+ V F NRL F + + +S ++ Sbjct: 325 AQRTCGDMDTNPTPSIVDQTINDVFFFRNRLGFLAGEN----IVMSRTSKYFSLFPASVA 380 Query: 373 GCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL-SKGLSIDFRRVSG 431 D + AV+ S + + PF E +L+ D + ++LS S++ + Sbjct: 381 NLSDDDP-IDVAVSHNRISILKYAVPFSEELLLWSDQAQFVLSAQGILSPKSVELNLTTE 439 Query: 432 SG-VYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQ------LADHLFN--- 481 P VG + F + S + + +++ ++ H+ + Sbjct: 440 FDVSDRARPFGVGRGVYFASP-----RASYTSLNRYYAVQDVSSVKSAEDMSAHVPSYIP 494 Query: 482 QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAA 541 + + + I VL S L + E +W + VL+ Sbjct: 495 NGVFSIRGSSTENFIA--VLSSNAPSRIFLYKFLYLNEEISQQSWSHWELGSNVTVLACD 552 Query: 542 SFPNDNRGGTSLWMLVA 558 S G+++++++ Sbjct: 553 SI------GSTMYLVLR 563 >gi|312436378|gb|ADQ83187.1| tail tubular protein B [Yersinia phage Yep-phi] Length = 792 Score = 55.2 bits (131), Expect = 3e-05, Method: Composition-based stats. Identities = 73/552 (13%), Positives = 159/552 (28%), Gaps = 69/552 (12%) Query: 47 PLVSMPLMQEYRDC----RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSP 102 L P + L + Y ++ G +++ + ++S Sbjct: 41 GLQKRPPFVFTKTIGDQNALGAKPLVHLINRDSAEQYYVVFTGQG-VRVFDLDGK-EYSV 98 Query: 103 ALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKD-------HPPHHLLYIQD-----GDK 150 K D + + V+++ P + L D Sbjct: 99 KGDLSYVKVGNPRDDLRMVTV---ADYTFIVNRNMVVRPDTTPLYTLKENGDCLINIRGG 155 Query: 151 ISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRL 210 + + V ++K + +Q + + + KG + Sbjct: 156 MYGRTLAFTINNTKIAYEIAHGDVPEHSKQTDAQW---LVKKLAGLARLNVAFKGWTFTE 212 Query: 211 GCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNL 270 G +N I + D + + + S V+ N + ++ Sbjct: 213 GPGYIHVIAPSNSQINSLSTEDGYADQLMNAVMHTSQ---SFSRLPVEAPNGYTVKIVGD 269 Query: 271 SSKTSRESASGAVAPYYVW----GDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAW 326 +SKTS VW G +G ++ A Q + + W Sbjct: 270 TSKTSDMFYVQYDNLKKVWKEVAGWGVQKGLNGDTMPHALVRQADGSFQMQALPWAQRTC 329 Query: 327 GEQEGYPSH---------VTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDP 377 G+ + P+ V F NRL F + + +S ++ D Sbjct: 330 GDMDTNPTPSIVDQTINDVFFFRNRLGFLAGEN----IVMSRTSKYFSLFPASVANLSDD 385 Query: 378 TKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL-SKGLSIDFRRVSGSG-VY 435 + AV+ S + + PF E +L+ D + ++LS S++ + Sbjct: 386 DP-IDVAVSHNRISILKYAVPFSEELLLWSDQAQFVLSAQGILSPKSVELNLTTEFDVSD 444 Query: 436 ACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQ------LADHLFN---QRILQ 486 P VG + F + S + + +++ ++ H+ + + Sbjct: 445 RARPFGVGRGVYFASP-----RASYTSLNRYYAVQDVSSVKSAEDMSAHVPSYIPNGVFS 499 Query: 487 LVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPND 546 + + I VL S L + E +W + VL+ S Sbjct: 500 IRGSSTENFI--SVLSSNAPSRIFLYKFLYLNEEIAQQSWSHWELGSNVTVLACDSI--- 554 Query: 547 NRGGTSLWMLVA 558 G+++++++ Sbjct: 555 ---GSTMYLVLR 563 >gi|194100452|ref|YP_002003825.1| gp12 [Klebsiella phage K11] gi|193201391|gb|ACF15869.1| gp12 [Klebsiella phage K11] Length = 791 Score = 54.5 bits (129), Expect = 5e-05, Method: Composition-based stats. Identities = 50/314 (15%), Positives = 95/314 (30%), Gaps = 36/314 (11%) Query: 266 TVLNLSSKTSRESASGAVAPYYVW----GDIKDVSKDGRSISVAPQSQTLFQAGVSVVSW 321 ++ +SKT+ + A VW G V + ++ + W Sbjct: 266 KIVGDTSKTADQYYVKYDASQKVWKETVGWNISVGLEYHTMPWTLVRAADGNFDLGYHEW 325 Query: 322 FMSAWGEQEGYPSH---------VTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEY 372 G+ + P V F NRL F + + LS +++F Sbjct: 326 RDRRAGDDDTNPQPSFVNSTITDVFFFRNRLGFISGEN----IVLSRTSKYFEFYPPSVA 381 Query: 373 GCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL-SKGLSIDFRRVSG 431 Y L AV+ S + + F E +L+ D + ++LS + + + Sbjct: 382 -NYTDDDPLDVAVSHNRVSVLKYAVSFAEELLLWSDEAQFVLSANGVLSAKTAQLDLTTQ 440 Query: 432 SGVYA-CPPVSVGDCLVFVCGVGRRIKYISGSTEQGFR----FNEITQLADHLFNQRILQ 486 V P +G + + + Q ++T H+ N I Sbjct: 441 FDVSDRARPYGIGRNIYYASPRSSFTSIMRYYAVQDVSSVKNAEDMT---AHVPNY-IPN 496 Query: 487 LVYQEEPHSI--VWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFP 544 VY VL S + + E +W D V++A Sbjct: 497 GVYSINGSGTENFACVLTKGAPSKVFIYKFLYMDENIRQQSWSHWDFGDGVEVMAANCI- 555 Query: 545 NDNRGGTSLWMLVA 558 +++++L+ Sbjct: 556 -----NSTMYLLMR 564 >gi|119637778|ref|YP_919014.1| Tubular tail protein B [Yersinia phage Berlin] gi|119391809|emb|CAJ70682.1| hypothetical protein [Yersinia phage Berlin] Length = 792 Score = 54.1 bits (128), Expect = 7e-05, Method: Composition-based stats. Identities = 79/557 (14%), Positives = 159/557 (28%), Gaps = 79/557 (14%) Query: 47 PLVSMPLMQEYRDC----RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSP 102 L P + L + Y ++ G + Sbjct: 41 GLQKRPPFVFTKTIGDQNALGAKPLVHLINRDSAEQYYVVFTGQGVRVFDLNGKEYDVKG 100 Query: 103 ALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKD-------HPPHHLLYIQD-------- 147 L + P L V+++ P + L D Sbjct: 101 DLSYVKVENP-----RDDLRMVTVADYTFIVNRNMVVRPDTTPLYTLKENGDCLINIRGG 155 Query: 148 --GDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKG 205 G ++FT + K GD +++A+ + + + KG Sbjct: 156 MYGRTLAFTINNTKIAYEIAHGDAPEHSKQTDAQW--------LVKKLAGLARLNVAFKG 207 Query: 206 RSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWI 265 + G +N I + D + + + S V+ N + Sbjct: 208 WTFTEGPGYIHVIAPSNSQINSLSTEDGYADQLMNAVMHTSQ---SFSRLPVEAPNGYTV 264 Query: 266 TVLNLSSKTSRESASGAVAPYYVW----GDIKDVSKDGRSISVAPQSQTLFQAGVSVVSW 321 ++ +SKTS VW G +G ++ A Q + V+ W Sbjct: 265 KIVGDTSKTSDMFYVQYDNMKKVWKEVAGWGVQKGLNGGTMPHALVRQADGSFQMQVLPW 324 Query: 322 FMSAWGEQEGYPSH---------VTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEY 372 G+ + P+ V F NRL F + + +S ++ Sbjct: 325 TQRTCGDMDTNPTPSIVDQKINDVFFFRNRLGFLAGEN----IVMSRTSKYFSLFPASVA 380 Query: 373 GCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL-SKGLSIDFRRVSG 431 D + AV+ S + + PF E +L+ D + ++LS S++ + Sbjct: 381 NLSDDDP-IDVAVSHNRISILKYAVPFSEELLLWSDQAQFVLSAQGILSPKSVELNLTTE 439 Query: 432 SG-VYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQ------LADHLFN--- 481 P VG + F + S + + +++ ++ H+ N Sbjct: 440 FDVSDRARPFGVGRGVYFASP-----RASYTSLNRYYAVQDVSSVKSAEDMSAHVPNYIP 494 Query: 482 QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAA 541 + + + I VL S L + E +W + VL+ Sbjct: 495 NGVFSIRGSSTENFI--SVLSSNAPSRIFLYKFLYLNEEIAQQSWSHWELGSNVTVLACD 552 Query: 542 SFPNDNRGGTSLWMLVA 558 S G+++++++ Sbjct: 553 SI------GSTMYLVLR 563 >gi|212671415|ref|YP_002308415.1| tubular tail protein B [Kluyvera phage Kvp1] gi|211997259|gb|ACJ14576.1| tubular tail protein B [Kluyvera phage Kvp1] Length = 793 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 75/552 (13%), Positives = 159/552 (28%), Gaps = 68/552 (12%) Query: 47 PLVSMP---LMQEYRDCRLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPA 103 L P + D + V + +VF +++ + Sbjct: 41 GLQKRPPFIFTKTIGDAGFLGGAPLVHLINRDSIEQYYVVFTGSGVKVFDLNG------R 94 Query: 104 LFGKTYKTPYTFKDNK--SLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDE---I 158 + T Y N L V++ ++ + D I Sbjct: 95 EYAVHGDTSYANCANPRDDLRMVTVADYTFVVNRS----KVVQANKDPIYTIREDGECLI 150 Query: 159 KFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWA 218 + I +A I+ + +D + G + W Sbjct: 151 NIRGGQYGRTFTIRLNGISASYKIADGANAPEVEQTDAQWLVKKMAQLLREGGANTWGWT 210 Query: 219 KNTNY-SIGAYIVADDKVYR-SLTTGRSGDRFGYSKGATYV------KDNNITWITVLNL 270 N I D+ +++ + G G + + N + ++ Sbjct: 211 VNEGAGYIHVVSRGDEPIWKVEVEDGYGGQLMSAVMHTSQSFSKLPAEAPNGYSVQIVGD 270 Query: 271 SSKTSRESASGAVAPYYVW----GDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAW 326 +SKTS A VW G + ++ A Q+ + + W Sbjct: 271 TSKTSDAFYVQYDAARKVWKEVAGWGVQKGLNNGTMPHALIRQSDGSFKMEALPWDERKC 330 Query: 327 GEQEGYP---------SHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDP 377 G+ P + V F NRL F + + +S ++ D Sbjct: 331 GDMNTNPDPSIVDQKINDVFFFRNRLGFLAGEN----IVMSRTSKYFSLFPASVANLSDD 386 Query: 378 TKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL-SKGLSIDFRRVSGSGVYA 436 + AV+ ST+ + PF E +L+ D + ++LS S S++ + V Sbjct: 387 DP-IDVAVSHNRISTLKYAVPFSEELLLWSDQAQFVLSASGILSPKSVELNLTTEFDVSD 445 Query: 437 -CPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQ------LADHLFN---QRILQ 486 P +G + F + S + + +++ ++ H+ + + Sbjct: 446 KARPYGIGRGVYFASP-----RASYTSINRYYAVQDVSSVKSAEDMSAHVPSYIPNGVFS 500 Query: 487 LVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPND 546 + + + VL S + + E +W + VL+ S Sbjct: 501 IRGSGTENFV--SVLSANAPSKIFMYKFLYLNEENVQQSWSHWELGSNVTVLACDSI--- 555 Query: 547 NRGGTSLWMLVA 558 G+++++L+ Sbjct: 556 ---GSTMYLLLR 564 >gi|326424995|ref|YP_004286217.1| virion structural protein [Pseudomonas phage phi15] gi|325048399|emb|CBZ42012.1| virion structural protein [Pseudomonas phage phi15] Length = 793 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 78/628 (12%), Positives = 168/628 (26%), Gaps = 95/628 (15%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEY--- 57 M ++ + + G + + D+ + A+ N L P + Sbjct: 1 MPLSSQSIKNLKGG------ISQQPDVLRYPNQGAQQINGWSSETKGLQKRPPLVFIKRL 54 Query: 58 RDCRLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKD 117 + V + L+F + L I + + + + T +D Sbjct: 55 AESGHFGTKPLVHLINRDAFEQYQLIFHNGALTIFDLAGNN-YPVSGSLSYIATANPRED 113 Query: 118 NKSLEYAVFGSTAV----------FVHKDHPPHH----LLYIQDGDKISFTFDEIKFLPP 163 + L A + H +P + + + Sbjct: 114 LRLLTVADYTFILNRTKTVEMSSELTHTGYPALNSRALVSCRGGQYGRTLRIRANGVELA 173 Query: 164 PWLGDGMISGVKSNAKLSISQADTSTARI---------TSDMKIFKPLDKGRSIRLGCHP 214 + ++ + ++ D T+ + G Sbjct: 174 SYELPDGLAENNTELSKEVAAMDAQAIVKELVKRVNAGTATHGFSAAEGPSHLVIYGNGQ 233 Query: 215 PEWAKNTNYSIGAYIVADDKVYRSLTT-----GRSGDRFGYSKGATYVKDNN-ITWITVL 268 P T +++ TT +G + A+ DN + + Sbjct: 234 PINNIYTEDGYADQLISGLIYQVQTTTKLPITAPAGYLVEITGEASRSGDNYWVRYDGAA 293 Query: 269 NLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGE 328 + +T + + P ++ A Q ++W G+ Sbjct: 294 KVWKETVKPGIISGINP--------------GTMPHALIRQADGTFSFGPLTWAKRTAGD 339 Query: 329 --QEGYPSHV-------TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTK 379 PS V F NRL F + + +S ++ D Sbjct: 340 DETNPMPSLVDNKLNDVFFFRNRLGFLSGEN----IIMSKTAKYFQLFPSSVAASADDDP 395 Query: 380 ALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL-SKGLSIDFRRVSGSGVY-AC 437 + AV+ S + + PF E +L+ D + + L+ S + + V A Sbjct: 396 -IDVAVSHSRISILKYAVPFSEQLLLWSDQAQFTLTSSGVLSAKTAQLDLTTEFDVLDAA 454 Query: 438 PPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQ------LADHLFN---QRILQLV 488 P +G + F R + +++ ++ H+ ++ + Sbjct: 455 RPYGLGRGVYFAAPRARFCSIKRY-----YAVADVSNVKNAEDVSGHVPTYIPNKVHNVN 509 Query: 489 YQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNR 548 + + VL D S + + E +W H K +LS S Sbjct: 510 GSGTENFV--SVLTDGDPSKVFIYKFLYQDENLAQQSWS-HWTFGKCKILSMFSI----- 561 Query: 549 GGTSLWMLVALSAGEERSFTVRLNLLDD 576 G+ + ++ + G RL +D Sbjct: 562 -GSYTYTIMDRAEGV---VLERLEFTND 585 >gi|310005781|gb|ADP00167.1| tail tube protein B [Cyanophage NATL2A-133] Length = 985 Score = 52.6 bits (124), Expect = 2e-04, Method: Composition-based stats. Identities = 32/166 (19%), Positives = 64/166 (38%), Gaps = 17/166 (10%) Query: 295 VSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGE--QEGYPSH-------VTFHNNRLLF 345 V+ + + +T A + W G+ +PS + FH NRL Sbjct: 467 VNNRNGTFTFKKLDETTANADSNDNYWKYREVGDDITNPFPSFKGLKISKIFFHRNRLGL 526 Query: 346 SGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLV 405 + V +S G +++F + D + V+D + I+ + P +GV++ Sbjct: 527 IAN----EQVVMSRPGDYFNFQIVSAITTSD-DNPVDITVSDIKPAFINHVLPIQKGVMM 581 Query: 406 GCDTSLWLL--SISLSKGLSIDFRRVSGSGVY-ACPPVSVGDCLVF 448 D +LL + + +++S Y A P+ +G ++F Sbjct: 582 FSDNGQFLLFTESDIFSPKTARLKKLSSYETYPALDPIDMGTSVMF 627 Score = 40.6 bits (93), Expect = 0.75, Method: Composition-based stats. Identities = 27/325 (8%), Positives = 75/325 (23%), Gaps = 20/325 (6%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M +F G + + D + + N +P L+ P + + Sbjct: 1 MPAINQRIPNFLGG------VSQQPDTIKYPGQLRVCDNAVPDVTFGLMKRPPGEFVKTL 54 Query: 61 RLDPRSNRVFSFSIPDGGYALLVF-------GDKKLQIVVVRSSTKWSPALFGKTYKTPY 113 + L+ G K ++I + + + S Y Sbjct: 55 TNANADGYWYEILRDGDEKYLVQMTALSSYSGTKPIRIWNLLTGVEQSLTNSNGDSLFSY 114 Query: 114 TFKDNKSLEYA-VFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMIS 172 + ++ YA + H D + + + + + ++ Sbjct: 115 MEQSGTTIPYATQTIQDYTIISNPHKTVTTTGTTDAPLANGNYAFARLDTIAYNTEYILY 174 Query: 173 GVKSNAKLSISQADTSTARITSDMKIFKPLDKGRS-IRLGCHPPEWAKNTNYSIGAYIVA 231 + + T+ + D + G ++ + + ++ Sbjct: 175 TGSTAPAANKYYRVTALSVDKGTNDGNTWDDTNKDGRYAGLAQFSFSDSLCEDVEGHVTV 234 Query: 232 DDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGD 291 + Y T G T +++ + Sbjct: 235 NAASYVDSNTAN-----YDGGGTAQSNFLGYTQNYKTRYTAQIVLKDGGLIKTGSESTAL 289 Query: 292 IKDVSKDGRSISVAPQSQTLFQAGV 316 + IS + + + + Sbjct: 290 SRHHDITIEGISYRVKVKAVEEVDT 314 >gi|148724484|ref|YP_001285450.1| tail tube B [Cyanophage Syn5] gi|145588129|gb|ABP87948.1| tail tube B [Synechococcus phage Syn5] Length = 905 Score = 52.6 bits (124), Expect = 2e-04, Method: Composition-based stats. Identities = 58/396 (14%), Positives = 119/396 (30%), Gaps = 64/396 (16%) Query: 176 SNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKV 235 + L I Q + + + G I + ++++G V Sbjct: 297 TAGTLDIGQITAGLVNSVNLISNYSAQAVGNVIEIER-----TDGRDFNLG---VRGGAT 348 Query: 236 YRSLTTGR-SGDRFGYSKGATYVK--------DNNITWITVLNLSSKTSRESASGAVAPY 286 R++T + + + G + +N + + S SG+ Sbjct: 349 NRAMTAIKGTANSIVDLPGQCFDGFELKVINTENAESDDYYVVFRSAAEGIPGSGSWEET 408 Query: 287 YVWGDIKDVSKDGRSISVAPQSQTLF-----QAGVSVVSWFMSAWGE--QEGYPSHV--- 336 G + + ++ Q+ F ++ W G+ PS V Sbjct: 409 VAPGIERGFNTSTMPHALIRQADGNFTLEALNDEGTITGWAQREVGDDDTNPKPSFVGRG 468 Query: 337 ----TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSAST 392 F+NNRL F V +S G +++F + D + + + Sbjct: 469 ISDMFFYNNRLGFLSEDA----VIMSQPGDYFNFFVTSAITISDSDP-IDVTASSTKPAI 523 Query: 393 IHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVY----ACPPVSVGDCLVF 448 + +G+++ + S +LL S S +++ Y PVS G + F Sbjct: 524 LRAAIGAPKGLILFAENSQFLL-ASQEVVFSTATIKLTEISDYFYRSLAKPVSTGVSIAF 582 Query: 449 VCGVG--RRIKYISGSTEQGF-RFNEITQLADHLFNQRILQLVYQEEPHSIVWVV----- 500 V +I +S + + +IT++ + P + W V Sbjct: 583 VSEADTYSKIFEMSIDSVDNRPQVADITRIVP------------EYVPTGLTWSVSTPNN 630 Query: 501 --LEPKDNSFPRLLGCRFSA-EGEGDFAWHTHMISD 533 + DNS + F+ W ++ Sbjct: 631 SMMLFGDNSNTAYIFKFFNQGNERQVAGWSKWILPG 666 >gi|291334274|gb|ADD93937.1| hypothetical protein [uncultured marine bacterium MedDCM-OCT-S08-C235] Length = 119 Score = 51.8 bits (122), Expect = 3e-04, Method: Composition-based stats. Identities = 21/111 (18%), Positives = 37/111 (33%), Gaps = 5/111 (4%) Query: 259 DNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQ-----TLFQ 313 D + I+ N + + +G+ + D + G + + + + Sbjct: 6 DGSTIVISGANTVDTITASNINGSRTITVLNEDSYSFTAGGSANADNTDAGGGVSIFVTS 65 Query: 314 AGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFY 364 W + GYP+ TFH+ RL F GS V+ S F Sbjct: 66 PNQPNSQWQEQTYSTIRGYPASATFHDGRLWFGGSSSLPDWVWASKVDEFL 116 >gi|311875239|emb|CBX44498.1| putative tail tubular protein B [Erwinia phage phiEa1H] gi|311875360|emb|CBX45101.1| putative tail tubular protein B protein [Erwinia phage phiEa100] Length = 806 Score = 51.8 bits (122), Expect = 3e-04, Method: Composition-based stats. Identities = 68/595 (11%), Positives = 152/595 (25%), Gaps = 82/595 (13%) Query: 39 NLIPLRYGPLVSMPLMQEYRDCRLDPRSNRVFSFSI--PDGGYALLVFGDKKLQIVVVRS 96 N +P L + + +N + D ++ ++ ++ Sbjct: 32 NAVPNVVDGLKTRMGSKHLARILNSLDANSLIHHYKRGDDAEEYFVILQPGQVPVIFTVG 91 Query: 97 STKWSPALFGKTYKTPYTFKDNKS--LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFT 154 P + T + + G +++ P ++ + Sbjct: 92 GL-ACPVNTQGSAATYLSSSSLPRETTQLMTIGDYTFVLNRKMPVQ------ARGDVTPS 144 Query: 155 FDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHP 214 D + + + N +++ T+ + K D R+ + Sbjct: 145 LDNKGLVYVAYANFSFTYQILINGQVAAEHK-------TASSEDVKNEDLVRTDYVAGKL 197 Query: 215 PEWAKNTNYSIGAYIVADD------------KVYRSLTTGRSGD-----RFGYSKGATYV 257 E + S + + D + G G R + T Sbjct: 198 LENFNSRTASFPGFSMYQDGNVLVVDNSNGANYALTTVDGADGQDLVAIRHKVTNLDTLP 257 Query: 258 KDNNITWITVLNLSS-------KTSRESASGAVAPYYVW-GDIKDVSKDGRSISVAPQSQ 309 + + + + ES G+ + + ++ + Sbjct: 258 NRAPVGYKVQVWPTGSKPESRYWLQAESQDGSKVTWVETIAPGVRKGWNAATMPHVLVRE 317 Query: 310 TLFQAGVSVVSWFMSAWGE-------QEGYPSHVT-----------FHNNRLLFSGSKGD 351 +L G + ++ W + +PS + NRL+ Sbjct: 318 SLNANGSANFTYRPGEWEDRDVGDDLTNDFPSLLNDSSPQPISSMLMVQNRLML----TS 373 Query: 352 ELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHW-MHPFGEGVLVGCDTS 410 +V S F+DF D I W G+ VL D Sbjct: 374 GEAVVASRTSRFFDFFRYTVLATVDTDP-FDVFADIEEVYNIRWSAQMDGDVVLFTSDQQ 432 Query: 411 LWLLSISLSKGLSIDFRRVSGSG-VYACPPVSVGDCLVFV--CGVGRRIKYIS-GSTEQG 466 L S R V+ P GD ++F G I+ S Sbjct: 433 FTLPGDKPLTPTSAVIRPVTQFKMTPGVKPAPSGDSILFAFDQGSYSGIREFFTDSYSDT 492 Query: 467 FRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAW 526 + T D ++L+L + ++ D + + + + AW Sbjct: 493 KKAQPATSHVDKYIRGKVLELSASSSFNRA--FIITSSDRNILYVYDWLYEGTEKVQNAW 550 Query: 527 HTHMISDKHYVLSAASFPNDNRGGTSLWMLVAL---SAGEERSFTVRLNLLDDFK 578 H + + + L++++ S G + +++ D+ + Sbjct: 551 HKWSFPAGTVLHAVS------YSNEKLYLVLTRTNTSGGVAGVYIEVMDMGDELE 599 >gi|125999999|ref|YP_001039670.1| tail tubular protein B-like protein [Erwinia amylovora phage Era103] gi|121621855|gb|ABM63429.1| tail tubular protein B-like protein [Enterobacteria phage Era103] Length = 806 Score = 51.8 bits (122), Expect = 3e-04, Method: Composition-based stats. Identities = 68/595 (11%), Positives = 152/595 (25%), Gaps = 82/595 (13%) Query: 39 NLIPLRYGPLVSMPLMQEYRDCRLDPRSNRVFSFSI--PDGGYALLVFGDKKLQIVVVRS 96 N +P L + + +N + D ++ ++ ++ Sbjct: 32 NAVPNVVDGLKTRMGSKHLARILNSLDANSLIHHYKRGDDAEEYFVILQPGQVPVIFTVG 91 Query: 97 STKWSPALFGKTYKTPYTFKDNKS--LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFT 154 P + T + + G +++ P ++ + Sbjct: 92 GL-ACPVNTQGSAATYLSSSSLPRETTQLMTIGDYTFVLNRKMPVQ------ARGDVTPS 144 Query: 155 FDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHP 214 D + + + N +++ T+ + K D R+ + Sbjct: 145 LDNKGLVYVAYANFSFTYQILINGQVAAEHK-------TASSEDVKNEDLVRTDYVAGKL 197 Query: 215 PEWAKNTNYSIGAYIVADD------------KVYRSLTTGRSGD-----RFGYSKGATYV 257 E + S + + D + G G R + T Sbjct: 198 LENFNSRTASFPGFSMYQDGNVLVVDNSNGANYALTTVDGADGQDLVAIRHKVTNLDTLP 257 Query: 258 KDNNITWITVLNLSS-------KTSRESASGAVAPYYVW-GDIKDVSKDGRSISVAPQSQ 309 + + + + ES G+ + + ++ + Sbjct: 258 NRAPVGYKVQVWPTGSKPESRYWLQAESQDGSKVTWVETIAPGVRKGWNAATMPHVLVRE 317 Query: 310 TLFQAGVSVVSWFMSAWGE-------QEGYPSHVT-----------FHNNRLLFSGSKGD 351 +L G + ++ W + +PS + NRL+ Sbjct: 318 SLNANGSANFTYRPGEWEDRDVGDDLTNDFPSLLNDSSPQPISSMLMVQNRLML----TS 373 Query: 352 ELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHW-MHPFGEGVLVGCDTS 410 +V S F+DF D I W G+ VL D Sbjct: 374 GEAVVASRTSRFFDFFRYTVLATVDTDP-FDVFADIEEVYNIRWSAQMDGDVVLFTSDQQ 432 Query: 411 LWLLSISLSKGLSIDFRRVSGSG-VYACPPVSVGDCLVFV--CGVGRRIKYIS-GSTEQG 466 L S R V+ P GD ++F G I+ S Sbjct: 433 FTLPGDKPLTPTSAVIRPVTQFKMTPGVKPAPSGDSILFAFDQGSYSGIREFFTDSYSDT 492 Query: 467 FRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAW 526 + T D ++L+L + ++ D + + + + AW Sbjct: 493 KKAQPATSHVDKYIRGKVLELSASSSFNRA--FIITSPDRNILYVYDWLYEGTEKVQNAW 550 Query: 527 HTHMISDKHYVLSAASFPNDNRGGTSLWMLVAL---SAGEERSFTVRLNLLDDFK 578 H + + + L++++ S G + +++ D+ + Sbjct: 551 HKWSFPAGTVLHAVS------YSNEKLYLVLTRTNTSGGVAGVYIEVMDMGDELE 599 >gi|194100345|ref|YP_002003775.1| gp12 [Enterobacteria phage EcoDS1] gi|193201340|gb|ACF15819.1| gp12 [Enterobacteria phage EcoDS1] Length = 785 Score = 51.0 bits (120), Expect = 5e-04, Method: Composition-based stats. Identities = 67/434 (15%), Positives = 120/434 (27%), Gaps = 38/434 (8%) Query: 47 PLVSMPLMQEYRDCRLDPRSNRVFSFSIPDGG-YALLVFGDKKLQIVVVRSSTKWSPALF 105 L P R +D SN F D +VF +Q+V + + + Sbjct: 41 GLQKRPPTVFKRRLNIDVGSNPKFHLINRDEQEQYYIVFNGSNIQVVDLSGN------QY 94 Query: 106 GKTYKTPYTFKDNK--SLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPP 163 + + Y N + V++ I Sbjct: 95 SVSGEVDYVKSSNPRDDIRVVTVADYTFIVNRKVVVKGGSEKSHSGYNRKARALINLRGG 154 Query: 164 PW---LGDGMISGVKSNAKLSI-------SQADTSTARITSDMKIFKPLDKGRSIRLGCH 213 + L G+ GVK + KL + A + + + LG Sbjct: 155 QYGRTLKVGINGGVKVSHKLPAGNDAENDPPKVDAQAIGAALRDLLVAAYPTFTFDLGSG 214 Query: 214 PPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSK 273 + I + D + ++ + I N S+ Sbjct: 215 FLLITAPSGTDINSVETEDGYANQLISPVLDTVQTISKLPLAAPNGYIIKIQGETNSSAD 274 Query: 274 TSRESASGAVAPYYVW---GDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQE 330 + G + ++ QS F+ S S + Sbjct: 275 EYYVMYDSNTKTWKETVEPGVVTGFDNTTMPHALVRQSDGSFEFKTLDWSKRGSGNDDTN 334 Query: 331 GYPSHV-------TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTT 383 PS V F+ NRL F + V +S +++ F D + Sbjct: 335 PMPSFVDATINDVFFYRNRLGFLSGEN----VIMSRSASYFAFFPKSAATLSDDDP-IDV 389 Query: 384 AVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSIS---LSKGLSIDFRRVSGSGVYACPPV 440 AV+ S + + PF E +L+ D ++++ S +K + +D G A P Sbjct: 390 AVSHPRISILKYAVPFSEQLLLWSDEVQFVMTSSGVLTAKSIQLDVGSEFSLGDNA-RPF 448 Query: 441 SVGDCLVFVCGVGR 454 +VG + F G Sbjct: 449 AVGRSVFFSAPRGS 462 >gi|194100290|ref|YP_002003488.1| gp12 [Enterobacteria phage BA14] gi|193201285|gb|ACF15765.1| gp12 [Enterobacteria phage BA14] Length = 795 Score = 50.6 bits (119), Expect = 6e-04, Method: Composition-based stats. Identities = 49/317 (15%), Positives = 104/317 (32%), Gaps = 42/317 (13%) Query: 266 TVLNLSSKTSRESASGAVAPYYVW----GDIKDVSKDGRSISVAPQSQTLFQAGVSVVSW 321 ++ +SKTS + VW G +G ++ A Q+ + + W Sbjct: 268 KIVGDTSKTSDQFYVQYDNVKKVWKEVAGWGVQKGLNGGTMPHALVRQSDGSFQMQALPW 327 Query: 322 FMSAWGEQEGYPSH---------VTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEY 372 G+ + P+ V F NRL F + + +S ++ Sbjct: 328 SQRTCGDMDTNPTPSIVDQTINDVFFFRNRLGFLAGEN----IVMSRTSKYFSLFPASVA 383 Query: 373 GCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL-SKGLSIDFRRVSG 431 D + AV+ S + + PF E +L+ D + ++LS S++ + Sbjct: 384 NLSDDDP-IDVAVSHNRISILKYAVPFSEELLLWSDQAQFVLSAQGILSPKSVELNLTTE 442 Query: 432 SG-VYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQ------LADHLFN--- 481 P VG + F + S + + +++ ++ H+ + Sbjct: 443 FDVSDRARPFGVGRGVYFASP-----RASYTSLNRYYAVQDVSSVKSAEDMSAHVPSYIP 497 Query: 482 QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAA 541 + + + I VL S L + E +W + VL+ Sbjct: 498 NGVFSIRGSGTENFI--SVLSANAPSKIFLYKFLYLNEEIAQQSWSHWELGSNVTVLACD 555 Query: 542 SFPNDNRGGTSLWMLVA 558 S G+++++++ Sbjct: 556 SI------GSTMYLVLR 566 >gi|326536137|ref|YP_004300571.1| gp12 [Enterobacteria phage 285P] gi|256861526|gb|ACV32482.1| gp12 [Enterobacteria phage 285P] Length = 795 Score = 50.6 bits (119), Expect = 7e-04, Method: Composition-based stats. Identities = 49/317 (15%), Positives = 104/317 (32%), Gaps = 42/317 (13%) Query: 266 TVLNLSSKTSRESASGAVAPYYVW----GDIKDVSKDGRSISVAPQSQTLFQAGVSVVSW 321 ++ +SKTS + VW G +G ++ A Q+ + + W Sbjct: 268 KIVGDTSKTSDQFYVQYDNVKKVWKEVAGWGVQKGLNGGTMPHALVRQSDGSFQMQALPW 327 Query: 322 FMSAWGEQEGYPSH---------VTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEY 372 G+ + P+ V F NRL F + + +S ++ Sbjct: 328 SQRTCGDMDTNPTPSIVDQSINDVFFFRNRLGFLAGEN----IVMSRTSKYFSLFPASVA 383 Query: 373 GCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL-SKGLSIDFRRVSG 431 D + AV+ S + + PF E +L+ D + ++LS S++ + Sbjct: 384 NLSDDDP-IDVAVSHNRISILKYAVPFSEELLLWSDQAQFVLSAQGILSPKSVELNLTTE 442 Query: 432 SG-VYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQ------LADHLFN--- 481 P VG + F + S + + +++ ++ H+ + Sbjct: 443 FDVSDRARPFGVGRGVYFASP-----RASYTSLNRYYAVQDVSSVKSAEDMSAHVPSYIP 497 Query: 482 QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAA 541 + + + I VL S L + E +W + VL+ Sbjct: 498 NGVFSIRGSGTENFI--SVLSANAPSKIFLYKFLYLNEEIAQQSWSHWELGSNVTVLACD 555 Query: 542 SFPNDNRGGTSLWMLVA 558 S G+++++++ Sbjct: 556 SI------GSTMYLVLR 566 >gi|148747829|ref|YP_001285795.1| tail tubular protein B [Phormidium phage Pf-WMP3] gi|146230062|gb|ABQ12470.1| tail tubular protein B [Phormidium phage Pf-WMP3] Length = 1027 Score = 50.6 bits (119), Expect = 8e-04, Method: Composition-based stats. Identities = 53/367 (14%), Positives = 106/367 (28%), Gaps = 35/367 (9%) Query: 128 STAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWL---GDGMISGVKSNAKL-SIS 183 T + +P LLY ++TF GDG + V ++A L + Sbjct: 262 DTIQGTYGRYPM--LLYKTATFNDTYTFSNTGQPANADSYGWGDGSVYNVGASAYLNTSP 319 Query: 184 QADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTG- 242 T T + + + R L + A N + A Y S G Sbjct: 320 FFATFGDTRTPTPQPPETVHLLRQRELRFNYGNGATGANLRVTVDGTALSANYSSTVAGT 379 Query: 243 RSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSI 302 + G +++ + + + S + AV V D + G + Sbjct: 380 NRAYALYKADGTLCTSASDLAY-YIAFTGATPLGISPTAAVTITNV-----DRTYIGSAA 433 Query: 303 SVAPQSQTLFQAGVSVVSWFMSAWG--EQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSF 360 + Q+ + G + + W +P T + +RL+ G D V S+ Sbjct: 434 T---QTDNAYVQGGYFKVYGLGLWANYGTGQFPRIATVYQSRLVLGGFTNDPTRVVFSAT 490 Query: 361 GA-------FYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWL 413 G + F + + D + + + + + + V + Sbjct: 491 GDTVEGGVKYNFFQVTDDLDGLDSDPFDLVVSSSQADDYVTGLVEWQSSLFVLTRRA--T 548 Query: 414 LSISLSKGLSIDFRRVSGSGVY--ACPP---VSVGDCLVFVCGVGRRIKYISGSTEQG-F 467 + RR P V + ++ G + ++ E G + Sbjct: 549 FRANGGDATISPARRFVNYISSLGLVNPFSVVRTDTAVFYLSDSG--VFNLTPRVEDGEY 606 Query: 468 RFNEITQ 474 + E + Sbjct: 607 QAIEKSI 613 >gi|18640503|ref|NP_570344.1| tail protein A [Synechococcus phage P60] gi|18478733|gb|AAL73282.1| tail protein A [Synechococcus phage P60] Length = 680 Score = 50.2 bits (118), Expect = 0.001, Method: Composition-based stats. Identities = 42/353 (11%), Positives = 93/353 (26%), Gaps = 46/353 (13%) Query: 18 PRLLQS---RKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRVFSFSI 74 P LL + D V ++RN+ + P + P+ + Sbjct: 9 PNLLGGISQQPDPLKLPGQVKQARNVQLDPTFGALKRPGTELIMQVTGIPKRAKWIPIMR 68 Query: 75 PD-GGYALLVF-------GDKKLQIVVVRSSTKWSPALFGKTYK--TPYTFKDNKSLEYA 124 Y + ++ GD ++++ +++ + + + G + P D +++ Sbjct: 69 DAREHYYVAIYREGANESGDLRIRVFDLKAGVERAVSFVGGEVEEYFPGDETDWEAIRSL 128 Query: 125 VFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQ 184 G + + P SF+ + G G V + S Q Sbjct: 129 TIGDYTFLSNPNVQP-------TTWSRSFSRRPEGLVTIGAAGYGTSYIVDFATEDSGQQ 181 Query: 185 ADTSTARITSDMKIFKPLDKGRSIRLGCHPPEW----------AKNTNYSIGAYIVADDK 234 + + + K D W + + + Sbjct: 182 RRWAVQEMQAPKTKRKKGDGSPDEAGETTVNNWNGTGLSFRVKVEARAFLVDDGEEYGHN 241 Query: 235 VYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKT--------------SRESAS 280 +T G+ V + W + ++ +S Sbjct: 242 YIPYVTLLTPGNNTSPFPDTIRVDVSGEGWDIKVTKQIQSKVYANLGTAQFTTPVDQSGG 301 Query: 281 GAVAPYYVWGDIKDVSKDGRSISVAPQS--QTLFQAGVSVVSWFMSAWGEQEG 331 GA V G ++ G + + + + + + MSA G G Sbjct: 302 GASTSDIVTGLSAAINGLGTFTAESIGNVIRVRYSDPTRTDEFTMSARGGTSG 354 Score = 44.8 bits (104), Expect = 0.034, Method: Composition-based stats. Identities = 22/134 (16%), Positives = 39/134 (29%), Gaps = 8/134 (5%) Query: 337 TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM 396 + NRL F V +S G +++F D + Sbjct: 482 FMYKNRLGFLTQDA----VIMSQVGDYFNFYATSGVTISDADPIDMATSDTKPVKLEAAI 537 Query: 397 HPFGEGVLVGCDTSLWLLSISLS-KGLSIDFRRVSGSG-VYACPPVSVGDCLVFVCGVG- 453 +L G L S S + ++S PV G ++F +G Sbjct: 538 SSTSGAILFGNQAQFRLSSPDESFGPKTATLDKISNYTYESKADPVQTGVSMIFPTNMGT 597 Query: 454 -RRIKYISGSTEQG 466 + +S + +G Sbjct: 598 YSSVYELSTESAKG 611 >gi|325171208|ref|YP_004251180.1| hypothetical protein ViPhICP2p09 [Vibrio phage ICP2] gi|323512234|gb|ADX87691.1| conserved hypothetical protein [Vibrio phage ICP2] gi|323512306|gb|ADX87762.1| hypothetical protein TU12-16_00040 [Vibrio phage ICP2_2006_A] Length = 734 Score = 49.9 bits (117), Expect = 0.001, Method: Composition-based stats. Identities = 62/393 (15%), Positives = 117/393 (29%), Gaps = 49/393 (12%) Query: 110 KTPYTFKDNKSLEYAVFGSTAVFVHKDHPPH---HLLYIQDGDKISFTFDEIKFLPPPWL 166 +T Y+ + FG F P L + E Sbjct: 79 QTHYSA--IPEILVVQFGDKLHFFDTSVDPLSNGKLFINNQEFLTTEGTTEDIISGASVE 136 Query: 167 GDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIG 226 G + + ++ +S+ D + IT+ KI R + W ++ Sbjct: 137 GIFVFATQDADP-ISLQIMDIQSDSITARTKIV----VDRKVLFLETRDVWGRSAPSKER 191 Query: 227 AYIVADDKVYRSLTTGRSGDRFGYSKGA--TYVKDNNITWITVLNLSSKTSRESASGAVA 284 ++ D +Y + G + + Y +I W L ++ + +A G Sbjct: 192 PKTLSSDYLYELINQGWDTKKINSTYATIGAYPSGYDIWW---LYKTTAGTDANAIGKFT 248 Query: 285 PYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLL 344 P + KD + + + Q S V+ G PS + R+ Sbjct: 249 PSRM--------KDSTTTGIGQERQNTPAPRGSTVASLQVLAS---GKPSCIQTFAGRVF 297 Query: 345 FSGSKGDE-----------LSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTD------ 387 ++G + V+ S + ++ Y DPT + +A+ D Sbjct: 298 YAGFQATPRKIDDVRPDFRNHVFFSQL-VKSNAEINKCYQFADPTSEVDSALVDTDGGFI 356 Query: 388 --FSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSID---FRRVSGSGVYACPPVSV 442 +A I M G+ + + +WLLS + S +++ G + V Sbjct: 357 KINAARKIVAMEEVSSGLFIIAENGVWLLSGTSDGLFSATGYHVDKITDYGCVSPRSVVA 416 Query: 443 GDCLVFVCGVGRRIKYISGSTEQGFRFNEITQL 475 VF I T +T+L Sbjct: 417 YGDTVFYWAEEGIIVLSPDQTTGKHSAQNLTEL 449 >gi|29366731|ref|NP_813776.1| tail tubular protein B [Pseudomonas phage gh-1] gi|29243590|gb|AAO73169.1|AF493143_30 tail tubular protein B [Pseudomonas phage gh-1] Length = 808 Score = 49.9 bits (117), Expect = 0.001, Method: Composition-based stats. Identities = 69/491 (14%), Positives = 137/491 (27%), Gaps = 57/491 (11%) Query: 26 DLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCR---LDPRSNRVFSFSIPDGGYALL 82 D+ + A N L P + + V + + Sbjct: 20 DILRFSNQGALQINGWSSETQGLQKRPPTTFTKRLQNKGFLGTKPLVHLINRDAQEQYFV 79 Query: 83 VFGDKKLQIVVVRSST----KWSPALFGKTYKTPYTFKDNKSLEYAVFGSTA-----VFV 133 F L + ++ + ++ +T + V +T Sbjct: 80 GFSGTGLAVWDLKGNNYTVRGYNGYANCANPRTDLRLITVADYTFVVNRNTVCQMGSTLT 139 Query: 134 HKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSI------SQADT 187 H +P I + + + G + +K + A Sbjct: 140 HAAYPRLDGRAIINVRGGQYGRTLSITINGDGTGSSPQASIKMPNGSAEKVPAGDPYAGM 199 Query: 188 STARITSDMKIFKPLDKGRSIRLGCHPPEWAKNT-NYSIGAYIVADDKVYRSLTTGRSGD 246 + +T I L + ++ LG + T I A A+D V + T D Sbjct: 200 NQVDMTDASWIAAELARQLTVSLGGSGWSFQAGTGWILINA--PANDNVRQIATKDGYAD 257 Query: 247 -----RFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRS 301 + T + N L + S S Y G + + + Sbjct: 258 TLLSGFIYQVQTFTKLPANAPP--GYLVEITGESARSGDNYWVQYDASGKVWKETAKPKI 315 Query: 302 IS---VAPQSQTLFQAGVSVVSWFMSAWG-------EQEGYPSHV-------TFHNNRLL 344 I+ A L +A W W + PS V F NRL Sbjct: 316 IAGFNNATLPHALVRAADGQFDWTPLTWDGRNAGDDDTNPMPSFVGATINDVFFFRNRLG 375 Query: 345 FSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVL 404 F + V +S +++F D + A++ S + + PF E +L Sbjct: 376 FLSGEN----VVMSRTSKYFNFFPSSVATLSDDDP-IDVAISHNRISILKYAVPFSEQLL 430 Query: 405 VGCDTSLWLLSI-SLSKGLSIDFRRVSGSG-VYACPPVSVGDCLVFVCGVGRRIKYISGS 462 + D + ++LS ++ +I+ + P +G + F + S Sbjct: 431 LWSDQAQFVLSSKTILSSKTIELDLTTEFDVSDGARPYGIGRGVYF-----AAPRASFTS 485 Query: 463 TEQGFRFNEIT 473 ++ + +++ Sbjct: 486 LKRYYAIQDVS 496 >gi|9634037|ref|NP_052111.1| tail tubular protein B [Yersinia phage phiYeO3-12] gi|6599028|emb|CAB63632.1| tail tubular protein B [Yersinia phage phiYeO3-12] Length = 801 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 31/165 (18%), Positives = 60/165 (36%), Gaps = 21/165 (12%) Query: 329 QEGYPSH-------VTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKAL 381 YPS + F NRL F + + LS +++F Y + Sbjct: 344 TNPYPSFTGQTINDIFFFRNRLGFLSGEN----IILSRTSKYFNFFP-ASVSNYSDDDPI 398 Query: 382 TTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL-SKGLSIDFRRVSGSGVYA-CPP 439 AV+ ST+ + PF E +L+ D + ++L+ S S++ + V P Sbjct: 399 DVAVSHNRVSTLKYAVPFSEELLLWSDQAQFVLTASGILSSRSVELNLTTQFDVQDRARP 458 Query: 440 VSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQL--ADHLFNQ 482 VG + F + S + + +++ + A+ + Sbjct: 459 HGVGRNVYFASP-----RASFTSINRYYAVQDVSSVKNAEDMTAH 498 >gi|291335597|gb|ADD95206.1| tail tubular protein B [uncultured phage MedDCM-OCT-S04-C650] Length = 845 Score = 49.5 bits (116), Expect = 0.002, Method: Composition-based stats. Identities = 67/511 (13%), Positives = 125/511 (24%), Gaps = 74/511 (14%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 M T T +F G + + D V + N P L+ P M+ Sbjct: 1 MPAITQTIPNFLGG------VSRQNDDKKLINQVTECVNGYPDPTYGLLKRPGMEHVNVL 54 Query: 61 RLDPR----------SNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYK 110 + + F + G + + + T + G Y Sbjct: 55 KKADGTAFSKTELADAAWFFI-DRDNAGSYIGAIKGTNIYVWTKEDGTFCTVNNTGTAYL 113 Query: 111 TP-----YTFKDNKSLEYAVFGSTAVFVH--KDHPPHHLLYIQDGDKISFTFDEIKFLPP 163 T Y F+ + + + + + ++ ++ D I + Sbjct: 114 TGTQQSDYHFRSVQDVTVITNKTVTTAMQATPAAAVKSVGTLKLN-SVTDGLDYIVTIQG 172 Query: 164 PWLGDGMISGVKSNAKLSISQADTST---------ARITSDMKIFKPLDKGRSIRLGCHP 214 S + L +D +T A I + G L + Sbjct: 173 IATSISAQSHTTFDDMLVYDSSDVNTNHHLVDAIKATIEAQHSASNADFDG-VWSLEAYT 231 Query: 215 PEWAKNTNYSIGAYIVADDKVYRSLTT---------------------GRSGDRFGYSKG 253 N A + + T G S + S Sbjct: 232 NSLVIKRNAGTNAVVTDYTAPTGAATAFTIEAKGGLGNAGIEVFQDSVGSSAELSVESFN 291 Query: 254 ATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQ 313 +VK N + G G D ++ ++ Sbjct: 292 GHHVKVRNTNSADDDYYLEFEAFNGTRGKGFWKEAKGVDVSPGLDAATMPFQLENVGATT 351 Query: 314 AGVSVVSWFMSAWGEQE--------GYPSHVTF-HNNRLLFSGSKGDELSVYLSSFGAFY 364 + W G+ GY TF +NNR ++L + Sbjct: 352 FNFKPIPWTARLVGDTNSNPDPSFIGYKITSTFFYNNRFGVLSEDN----IFLGVANDSF 407 Query: 365 DFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTS---LWLLSISLSKG 421 +F + D + V ++ + P +G+L+ ++ S + Sbjct: 408 NFFVKSALTQVDSDP-IDLNVASVRPVVLNDVLPSPQGLLLFSARQQFQVYSASATTMTP 466 Query: 422 LSIDFRRVSGSG-VYACPPVSVGDCLVFVCG 451 + R +S PV VG FV Sbjct: 467 KTTVIRSISNYEMSSDISPVDVGTTAAFVNR 497 >gi|189427235|ref|YP_001949785.1| gp12 [Salmonella phage phiSG-JL2] gi|189085888|gb|ACD75703.1| gp12 [Salmonella phage phiSG-JL2] Length = 801 Score = 49.1 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 30/154 (19%), Positives = 56/154 (36%), Gaps = 19/154 (12%) Query: 329 QEGYPSH-------VTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKAL 381 YPS + F NRL F + + LS +++F Y + Sbjct: 344 TNPYPSFTGQTINDIFFFRNRLGFLSGEN----IILSRTSKYFNFFP-ASVSNYSDDDPI 398 Query: 382 TTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL-SKGLSIDFRRVSGSGVYA-CPP 439 AV+ ST+ + PF E +L+ D + ++L+ S S++ + V P Sbjct: 399 DVAVSHNRVSTLKYAVPFSEELLLWSDQAQFVLTASGILSSRSVELNLTTQFDVQDRARP 458 Query: 440 VSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEIT 473 VG + F + S + + +++ Sbjct: 459 HGVGRNVYFASP-----RASFTSINRYYAVQDVS 487 >gi|17570828|ref|NP_523337.1| tail tubular protein B [Enterobacteria phage T3] gi|17384312|emb|CAC86300.1| tail tubular protein B [Enterobacteria phage T3] Length = 801 Score = 49.1 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 30/165 (18%), Positives = 59/165 (35%), Gaps = 21/165 (12%) Query: 329 QEGYPSH-------VTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKAL 381 YPS + F NRL F + + LS +++F Y + Sbjct: 344 TNPYPSFTGQTINDIFFFRNRLGFLSGEN----IILSRTSKYFNFFP-ASVSNYSDDDPI 398 Query: 382 TTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSI-SLSKGLSIDFRRVSGSGVYA-CPP 439 AV+ ST+ + PF E +L+ D + ++L+ + S+ + V P Sbjct: 399 DVAVSHDRVSTLKYAVPFSEELLLWSDQAQFVLTASDILSSRSVGLNLTTQFDVQDRARP 458 Query: 440 VSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQL--ADHLFNQ 482 VG + F + S + + +++ + A+ + Sbjct: 459 HGVGRNVYF-----SSPRASFTSINRYYAVQDVSSVKNAEDMTAH 498 >gi|310005866|gb|ADP00251.1| tail tube protein B [Cyanophage Syn26] Length = 977 Score = 48.3 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 50/390 (12%), Positives = 109/390 (27%), Gaps = 37/390 (9%) Query: 88 KLQIVVVRSSTKWSPALFGKTYKTPYTFK--DNKSLEYA----VFGSTAVFVHKDHPPHH 141 +I S ++ + T Y + L Y G D Sbjct: 280 YFRIRTTGQSVPFTTGAGNEQVTT-YQARYTTTFDLLYGGSGWQQGDYFYVWMDDGYYKV 338 Query: 142 LLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKP 201 ++ +I I+ P P+ + I+ + + DT Sbjct: 339 VIEAISTTQIQANLGLIRPNPTPFDTETTITASGILGDIRQAIIDTGNFTS------ANV 392 Query: 202 LDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNN 261 G + + P N + +S+ + GY + + Sbjct: 393 QQIGNGLYITR--PSGTFNATAPTSDLLKVMSSEVKSVDDLPDQCKHGYVVKVANSEAD- 449 Query: 262 ITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSW 321 + R G +++ D ++ + Q VS +W Sbjct: 450 -EDDYYVKFFGNNDR---DGDGVWEECAKPGRNIEFDKGTMPIQLVRQANGTFLVSQATW 505 Query: 322 FMSAWGE--QEGYPSHV-------TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEY 372 + G+ PS V F NRL+F + V +S G F++F Sbjct: 506 ENAEVGDDLTNPNPSFVGKTVNQLVFFRNRLVFLSDEN----VIMSRPGEFFNFW-SKTA 560 Query: 373 GCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLS--KGLSIDFRRVS 430 + P + + + + ++ G+L+ ++L+ + V+ Sbjct: 561 TTFTPMDVIDLSCSSEYPAIVYDGIQVNAGLLLFTKNQQFMLTTDSDILSPETAKLNAVA 620 Query: 431 GSGVYA-CPPVSVGDCLVFVCGVGRRIKYI 459 P+++G + F+ + ++ Sbjct: 621 SYNFNEKTNPINLGTTVAFIDNANQFTRFF 650 >gi|224164141|ref|XP_002338648.1| predicted protein [Populus trichocarpa] gi|222873077|gb|EEF10208.1| predicted protein [Populus trichocarpa] Length = 350 Score = 48.3 bits (113), Expect = 0.004, Method: Composition-based stats. Identities = 11/67 (16%), Positives = 22/67 (32%), Gaps = 6/67 (8%) Query: 506 NSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSAGEER 565 S LL + + +G W H S A P +++ +V + G Sbjct: 3 RSDGTLLSLTYVKD-QGVLGWARHTTDGTF--ESVAVIP--EGTEDAVYAVVKRTIGSRT 57 Query: 566 -SFTVRL 571 + ++ Sbjct: 58 VRYVEKI 64 >gi|77118200|ref|YP_338122.1| tail tube [Enterobacteria phage K1F] gi|72527944|gb|AAZ72996.1| tail tube [Enterobacteria phage K1F] gi|83308152|emb|CAJ29385.1| gp12 protein [Enterobacteria phage K1F] Length = 785 Score = 47.9 bits (112), Expect = 0.004, Method: Composition-based stats. Identities = 73/439 (16%), Positives = 122/439 (27%), Gaps = 48/439 (10%) Query: 47 PLVSMPLMQEYRDCRLDPRSNRVFSFSIPDGG-YALLVFGDKKLQIVVVRSSTKWSPALF 105 L P R +D SN F D +VF +QIV + + ++S + Sbjct: 41 GLQKRPPTVFKRRLNIDVGSNPKFHLINRDEQEQYYIVFNGSNIQIVDLSGN-QYSVSGS 99 Query: 106 GKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPW 165 K+ D + V++ I + Sbjct: 100 VDYVKSSNPRDD---IRVVTVADYTFVVNRKVVVKGGSEKSHSGYNRKARALINLRGGQY 156 Query: 166 ---LGDGMISGVKS-------------NAKLSISQADTSTARITSDMKIFKPLDKGRSIR 209 L G+ GVK K+ + + D G Sbjct: 157 GRTLKVGINGGVKVSHKLPAGNDAENDPPKVDAQAIGAALRDLLVTAYPTFTFDLGSGFL 216 Query: 210 LGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKG--ATYVKDNNITWITV 267 L P N+ + Y S G + N + Sbjct: 217 LITAPSGTDINSVETEDGYANQLISPVLDTVQTISKLPLAAPNGYIIKIQGETNSSADEY 276 Query: 268 LNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSW-FMSAW 326 + ++ V P V G D ++ A Q+ + W A Sbjct: 277 YVMYDSNTKTWKE-TVEPGVVTGF------DNTTMPHALVRQSDGSFEFKALDWSKRGAG 329 Query: 327 GE-QEGYPSHV-------TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPT 378 + PS V F+ NRL F + V +S +++ F D Sbjct: 330 NDDTNPMPSFVDATINDVFFYRNRLGFLSGEN----VIMSRSASYFAFFPKSVATLSDDD 385 Query: 379 KALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSIS---LSKGLSIDFRRVSGSGVY 435 + AV+ S + + PF E +L+ D ++++ S SK + +D G Sbjct: 386 P-IDVAVSHPRISILKYAVPFSEQLLLWSDEVQFVMTSSGVLTSKSIQLDVGSEFALGDN 444 Query: 436 ACPPVSVGDCLVFVCGVGR 454 A P +VG + F G Sbjct: 445 A-RPFAVGRSVFFSAPRGS 462 >gi|315518952|dbj|BAJ51829.1| putative tail tubular protein B [Ralstonia phage RSB2] Length = 788 Score = 46.4 bits (108), Expect = 0.014, Method: Composition-based stats. Identities = 27/116 (23%), Positives = 46/116 (39%), Gaps = 9/116 (7%) Query: 336 VTFHNNRL-LFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIH 394 + F NRL + +G V LS+ G F+ F D + AV+ ST+H Sbjct: 351 IFFFRNRLGILAGEN-----VILSASGEFFKFWPKSVVTAADTDP-IDVAVSHNRVSTLH 404 Query: 395 WMHPFGEGVLVGCDTSLWLL-SISLSKGLSIDFRRVSGS-GVYACPPVSVGDCLVF 448 F E +L+ D + ++L S + ++ + PV+ G + F Sbjct: 405 HAVSFAEELLLWSDQTQFILKSDGILSTKTVKVDTATEFESAIDARPVAAGRGVYF 460 >gi|167841461|ref|ZP_02468145.1| tail tubular protein B [Burkholderia thailandensis MSMB43] Length = 853 Score = 46.0 bits (107), Expect = 0.016, Method: Composition-based stats. Identities = 27/177 (15%), Positives = 59/177 (33%), Gaps = 23/177 (12%) Query: 286 YYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQE---GYP-------SH 335 + V G KD I P + V + S G+++ P S Sbjct: 356 FAVGGITKDGDTFA--IGSGPAQLNAYSTDFQVPKFAGSVCGDKDQTGAIPYFFGKRISL 413 Query: 336 VTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYG--CYDPTKALTTAVTDFSASTI 393 + +RL+ +V++S G +++F DP +A D + Sbjct: 414 LAMFQDRLVIVSD----GTVFMSRTGDYFNFFRKTMLSVHDDDPIQAYALGAADDVITR- 468 Query: 394 HWMHPFGEGVLVGCDTSLWLLSISL-SKGLSIDFRRVSG-SGVYACPPVSVGDCLVF 448 + + + + + + + ++ + +I V+ C PV G+ + + Sbjct: 469 --CVTYNKNLFLFGLRNQYTIPGNVAASPANITISPVAAERDAILCQPVVHGNIVFY 523 >gi|302339301|ref|YP_003804507.1| hypothetical protein Spirs_2810 [Spirochaeta smaragdinae DSM 11293] gi|301636486|gb|ADK81913.1| hypothetical protein Spirs_2810 [Spirochaeta smaragdinae DSM 11293] Length = 570 Score = 45.6 bits (106), Expect = 0.024, Method: Composition-based stats. Identities = 43/210 (20%), Positives = 76/210 (36%), Gaps = 31/210 (14%) Query: 321 WFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKA 380 W ++ QE T + R++ +VY+S + DF D Sbjct: 153 WALAEKTSQE----ISTIYQARMIAVNRTW--GTVYMSVAYIYLDF---------DSDGH 197 Query: 381 LTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSIS----LSKGLSIDFRRVSGSGVYA 436 L + W+ FG V +G D S W+L+ + +++SG G Sbjct: 198 LELIPDFYGFEHPRWIVAFGGDVYIGTDKSEWMLTSGYPYFTDDLGGLMMQKISGIGADL 257 Query: 437 CPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQLADHLFNQRILQLVYQEEPHSI 496 V G ++ + R ++ I S+ F+ + +L + N I+Q+ E S Sbjct: 258 --AVVFGSSII-LAKDRRLVR-IVYSSAGEFQSQSMAEL---IDNTDIIQIDV-IEYGSH 309 Query: 497 VWVVLEPKDNSFPRLLGCRFSAEGEGDFAW 526 ++V +D L C + G AW Sbjct: 310 RYLVFIDRDRRLWCLTEC----QNTGVAAW 335 Score = 44.5 bits (103), Expect = 0.051, Method: Composition-based stats. Identities = 27/141 (19%), Positives = 41/141 (29%), Gaps = 20/141 (14%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYR-- 58 M F+ G +SPR + R D + V++ + L G + R Sbjct: 1 MSRQRILVTDFTRGIVSPR-MVPRIDQTK---AVSELTGFVVLPDGGIRRREGTIYARRG 56 Query: 59 ------DCRLDPRSNRVFSFSIPDGGYALLVFGD--KKLQIVVVRSSTKWSPALFGKTYK 110 DC P L D ++L + + + T S A Sbjct: 57 LGVLPTDCEAVPAFTTFDKRITGTETLHLAWINDAPRQLNVQNMTNRTIQSVASESLEAG 116 Query: 111 TPYTFK-----DNKSLEYAVF 126 P D +SL YA Sbjct: 117 KPLLDSGKFNNDLESL-YAQN 136 >gi|326434186|gb|EGD79756.1| hypothetical protein PTSG_10740 [Salpingoeca sp. ATCC 50818] Length = 1352 Score = 44.5 bits (103), Expect = 0.050, Method: Composition-based stats. Identities = 45/284 (15%), Positives = 91/284 (32%), Gaps = 40/284 (14%) Query: 216 EWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITW-------ITVL 268 W+ + + A V+R TG++ + + + +T + + Sbjct: 111 TWSHDGSRLFSADDEGQLCVWRISKTGKASLTYEHKADTGFTHCVAVTAGSEDTSMVFLG 170 Query: 269 NLSSKTSRESASGAVAP-YYVWGDIKDVSKDGRSISVAPQSQT-------LFQAGVSVVS 320 A+GA P + V I D+ D S S+ +Q + G Sbjct: 171 TFDKGVMLADATGACTPSFPVTDKIVDLVFDPESASLVVATQDMMVVHHHVAPDGAVKDK 230 Query: 321 WFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKA 380 + G+ S + F +L + + +++ V+ S G Y S+ E G + P Sbjct: 231 YEFKMSGKAF---SSLGFAGPGILIAATSENQIRVWCSEEGDSYSLSIAHERGEFTP-SD 286 Query: 381 LTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVYACPPV 440 + V A+ I V + W KG + R PP+ Sbjct: 287 IIQTVAYNPANRIIAGTSRNGMVFM------WKF-----KGEAFTDRDSWSF----LPPI 331 Query: 441 SVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQLADHLFNQRI 484 + LV + + +++ ++ L +H+ + Sbjct: 332 ELRGSLVGAQWGPDGLLLVRNASDT------VSILREHIMRKHF 369 >gi|304404646|ref|ZP_07386307.1| Kelch repeat-containing protein [Paenibacillus curdlanolyticus YK9] gi|304346453|gb|EFM12286.1| Kelch repeat-containing protein [Paenibacillus curdlanolyticus YK9] Length = 697 Score = 43.3 bits (100), Expect = 0.099, Method: Composition-based stats. Identities = 40/244 (16%), Positives = 72/244 (29%), Gaps = 26/244 (10%) Query: 242 GRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRS 301 G+ G + + Y + ++W V + R S++G +W D + Sbjct: 436 GKMWAYAGQYQNSVYSSSDGVSWTCVTREAPWAGRRSSAGVSFMGAIWLFGGDTVNGDAN 495 Query: 302 ISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRL-LFSGSKGDELS---VYL 357 ++ W G + G NR+ +F G + V+ Sbjct: 496 DVWVSPDGVNWKCATPNAPW-----GPRNGL--CAVVFQNRMWVFGGRDHQGNTYNDVWA 548 Query: 358 SSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLV--GCDTSLWLLS 415 S GA + L + P A V I V + W Sbjct: 549 SDNGAH--WELITPQAGWSPRDAAAAVVYQNQIYMIGGSRSGSSLQEVWSTDNGRDWKPL 606 Query: 416 ISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGS--TEQGFRFNEIT 473 + + F V +G L + G +++ +S T+ G R+ +T Sbjct: 607 ANGNVPWLSRFDS---------KAVVLGTNLYLIGGTNSQVRGLSDMWVTQDGSRWEAVT 657 Query: 474 QLAD 477 Q A Sbjct: 658 QQAP 661 >gi|88705445|ref|ZP_01103156.1| ATP-dependent DNA helicase RecG [Congregibacter litoralis KT71] gi|88700535|gb|EAQ97643.1| ATP-dependent DNA helicase RecG [Congregibacter litoralis KT71] Length = 686 Score = 43.3 bits (100), Expect = 0.12, Method: Composition-based stats. Identities = 23/168 (13%), Positives = 49/168 (29%), Gaps = 18/168 (10%) Query: 40 LIPLRYGPLVSMPLMQEYRDCRLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTK 99 + G + + + F L GD+ L+ Sbjct: 67 ALVSVSGGGRRRS---LIVKLQDGTGTATLRFFHFSQAQKNALQQGDR-LRCFGTVRRGA 122 Query: 100 WSPALFGKTYK------------TP-YTFKD-NKSLEYAVFGSTAVFVHKDHPPHHLLYI 145 + Y+ TP Y + ++ A+ V HPP LL + Sbjct: 123 QQAEMIHPEYRRSIHISDNEESLTPIYPSTEGVSQGQWRKLSDQALSVLAKHPPEELLPV 182 Query: 146 QDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARIT 193 ++ D + PPP + + A+L ++ + + +++ Sbjct: 183 RENDYGLSSALRFLHRPPPEADQQALRDGRHPAQLRLALEELTAHQLS 230 >gi|253583142|ref|ZP_04860350.1| predicted protein [Fusobacterium varium ATCC 27725] gi|251835034|gb|EES63587.1| predicted protein [Fusobacterium varium ATCC 27725] Length = 654 Score = 42.9 bits (99), Expect = 0.13, Method: Composition-based stats. Identities = 58/456 (12%), Positives = 129/456 (28%), Gaps = 52/456 (11%) Query: 1 MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60 + +T + ++++ GE +L ++ D ++ + N++P G L + Sbjct: 10 ISSTNFLQNNWQMGEAGNKLAVNK-DSEMYMTTANRIVNMLPTELGGLEVLKEHTPRSIS 68 Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120 L ++ +I + + + R T F + Y F +N Sbjct: 69 GLPAGYDKPIIRAINTPFNF-------YICMCMDRIFTMNKSNQFL----SGYVFAEN-- 115 Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKS--NA 178 + G + F + F + S N Sbjct: 116 -GFTRAGKLVLI--------------------DKFVLVTFPNGNRYDLEISSSGNIGLND 154 Query: 179 KLSISQADTSTARITSDMKIFKPLDK--GRSIRLGCHPPEWAKNTNYSIGAYIVADDKVY 236 S S + + + I++ G + ++ + + I I D K+ Sbjct: 155 NFSASITNPLLHKSKVQVDIYQTRKIMIGTTEKIRPYKIRTTDLQEFLISGNIGQDGKLL 214 Query: 237 RSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDI---- 292 + R Y + + NI+ +T +G Y + Sbjct: 215 FKYNKEFNITRIYYPYQSDMINMENISGLTENEWFVIIHDVDTTGGGRFYMGNSPVDFTN 274 Query: 293 KDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDE 352 G + A S+++ + + W + G+ V NR++ S Sbjct: 275 PKTDVTGTYYTTAKVSRSVGNTSLLSYGIMIDLWNNKVGF-HTVAEFQNRMVVSNGT--- 330 Query: 353 LSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLW 412 ++ S G + F D + + + +V Sbjct: 331 -YIFFSKVGDYNYF---LNGELDDDAFFIKLGYVNGEQPIVKNFITGRGLWVVTNKGIFL 386 Query: 413 LLSISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVF 448 + ++ KG S+D R + V + + L + Sbjct: 387 ICYNNIVKGSSLDIRMIVADECGN-EAVDINNTLYY 421 >gi|196038454|ref|ZP_03105763.1| hypothetical protein BC059799_3729 [Bacillus cereus NVH0597-99] gi|228934985|ref|ZP_04097816.1| hypothetical protein bthur0009_34390 [Bacillus thuringiensis serovar andalousiensis BGSC 4AW1] gi|196030862|gb|EDX69460.1| hypothetical protein BC059799_3729 [Bacillus cereus NVH0597-99] gi|228824885|gb|EEM70686.1| hypothetical protein bthur0009_34390 [Bacillus thuringiensis serovar andalousiensis BGSC 4AW1] Length = 830 Score = 42.9 bits (99), Expect = 0.15, Method: Composition-based stats. Identities = 22/95 (23%), Positives = 33/95 (34%), Gaps = 20/95 (21%) Query: 191 RITSDMKIFKPLDKGRSIR-----------------LGCHPPEWAKNTNYSIGAYIVADD 233 + S F +G + + W Y I +YI A+ Sbjct: 724 SLVSSEPTFGTYSRGELLYNDTPTVGGYIGWVCITAGTANGDFWIAEKEYQINSYINANG 783 Query: 234 KVYRSLTTGRSG-DRFGYSKGATYVKDNNITWITV 267 VY+S+ G SG ++ G + KD NI W V Sbjct: 784 NVYKSVGRGTSGKTAPSHTNGTS--KDGNIVWEYV 816 >gi|66047262|ref|YP_237103.1| insecticidal toxin protein, putative [Pseudomonas syringae pv. syringae B728a] gi|63257969|gb|AAY39065.1| insecticidal toxin protein, putative [Pseudomonas syringae pv. syringae B728a] Length = 1617 Score = 42.5 bits (98), Expect = 0.18, Method: Composition-based stats. Identities = 32/266 (12%), Positives = 75/266 (28%), Gaps = 5/266 (1%) Query: 167 GDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIG 226 G + ++++ + DT+ + + + F+ + I +A+ Y IG Sbjct: 133 KSGYFTQLENDINQNRINVDTAQEAVKAYLASFEEVANLTIINGYIDSDRFAQGKYYFIG 192 Query: 227 AYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPY 286 + +R++ + G ++G + W + + + P Sbjct: 193 TSRAENIYYWRTVDMNERAYQEG-TEGPKFDNPTPGAWSDWKRAEIGINANTLERTIRPV 251 Query: 287 YVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFS 346 Y + D ++ + + P V N RL+F+ Sbjct: 252 YFNNRLFVAWVDLVHVTEQVAVTLPEGTVKPAADGSIPITPPADIAPLTVVTPNVRLVFN 311 Query: 347 GSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVG 406 S + S+ + D + KA+ D ++ I + E + + Sbjct: 312 ISYKKYDDSW-SAPHIYMD--VTTPNVVTRAGKAVNLE-NDLNSIAIFDVSASPESLFIA 367 Query: 407 CDTSLWLLSISLSKGLSIDFRRVSGS 432 L S + Sbjct: 368 MYAGETLAPGDTDGSTSTYAFLHTAF 393 >gi|330973553|gb|EGH73619.1| insecticidal toxin protein, putative [Pseudomonas syringae pv. aceris str. M302273PT] Length = 1189 Score = 42.5 bits (98), Expect = 0.21, Method: Composition-based stats. Identities = 32/266 (12%), Positives = 75/266 (28%), Gaps = 5/266 (1%) Query: 167 GDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIG 226 G + ++++ + DT+ + + + F+ + I +A+ Y IG Sbjct: 133 KSGYFTQLENDINQNRINVDTAQEAVKAYLASFEEVANLTIINGYIDSDRFAQGKYYFIG 192 Query: 227 AYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPY 286 + +R++ + G ++G + W + + + P Sbjct: 193 TSRAENIYYWRTVDMNERAYQEG-TEGPKFDNPTPGAWSDWKRAEIGINANTLERTIRPV 251 Query: 287 YVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFS 346 Y + D ++ + + P V N RL+F+ Sbjct: 252 YFNNRLFVAWVDLVHVTEQVAVTLPEGTVKPAADGSIPITPPADIAPLTVVTPNVRLVFN 311 Query: 347 GSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVG 406 S + S+ + D + KA+ D ++ I + E + + Sbjct: 312 ISYKKYDDSW-SAPHIYMD--VTTPNVVTRAGKAVNLE-NDLNSIAIFDVSASPESLFIA 367 Query: 407 CDTSLWLLSISLSKGLSIDFRRVSGS 432 L S + Sbjct: 368 MYAGETLAPGDTDGSTSTYAFLHTAF 393 >gi|302308918|ref|NP_986066.2| AFR519Cp [Ashbya gossypii ATCC 10895] gi|299790857|gb|AAS53890.2| AFR519Cp [Ashbya gossypii ATCC 10895] Length = 821 Score = 41.0 bits (94), Expect = 0.59, Method: Composition-based stats. Identities = 36/306 (11%), Positives = 87/306 (28%), Gaps = 23/306 (7%) Query: 15 ELSPRLLQSRKDLSLHAQGVAKSR-NLIPLRYGPLVSMPLMQEYRDCRLDPRSNRVFSFS 73 ELS ++ ++ +L A+ + + R + I LR + P + R Sbjct: 3 ELSEQVERTLGNLEKKAEFLEEQRGHFIALRQRLVEYDP-EKYAAHAGDGGSGVR----- 56 Query: 74 IPDGGYALLVFGDKKL--QIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAV 131 LVFG+ L ++ + + + + + LE A Sbjct: 57 -------GLVFGEVILSTRVYLSLGCEYYVEKQPAEAVA--WVEGRLRLLEDAQDQFRVQ 107 Query: 132 FVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTAR 191 H L + + + P + + + ++ + Sbjct: 108 IAHAKSTLRELAALDGAGGADWAAESSGEDGLPLMEIR--EELDEDGNVTSGAVRRAGGP 165 Query: 192 ITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYS 251 ++ + + + E + N + G+ A V R+ + D S Sbjct: 166 EAGAKRVDAAEEGLPLMEIREELDE---DGNVTGGSVRRAGGNVQRAGRAASASDAGHKS 222 Query: 252 KGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTL 311 + + + + + +G + +Y + ++ S+ V + Sbjct: 223 QDTGAARPDQAAEERLEQDLAPQQPAEDAGGLDEFYEVLEEMGITAPRESVDVGTPVEAA 282 Query: 312 FQAGVS 317 VS Sbjct: 283 ESGPVS 288 >gi|310005690|gb|ADP00077.1| tail tube protein B [Cyanophage NATL1A-7] Length = 1056 Score = 40.6 bits (93), Expect = 0.71, Method: Composition-based stats. Identities = 38/308 (12%), Positives = 91/308 (29%), Gaps = 34/308 (11%) Query: 165 WLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYS 224 + +G+ + + S +T +T+ D+ + Sbjct: 427 FSAEGIAEDIDQTGTYARSS---NTITVTAASHGLSNGDQIILDITSGGATDGFYTIANV 483 Query: 225 IGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVA 284 D +++ G + T + W V+ ++ + +A Sbjct: 484 TTNTFTVTDSASGTISAGETCSF-------TPARFGEGVWEEVVQPGKDIEIDNTTMPIA 536 Query: 285 PYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGE--QEGYPSHV------ 336 V ++ G Q+ + S W+ G+ PS + Sbjct: 537 LTRVLPGSFSINGGGS------QTYSNGAFRFSYPDWYKRDCGDDITNPEPSFIGQTIQK 590 Query: 337 -TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHW 395 F NR+ +V LS FY+F + + + + ++ Sbjct: 591 MVFFRNRIALL----SAENVILSRVNDFYNFWNKTAMAISNADP-IDLQSSSTYPTKLYD 645 Query: 396 MHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVYA----CPPVSVGDCLVFVCG 451 G+++ + +LLS L+ + ++S +A P+ +G + F+ Sbjct: 646 AVEQAGGLVIFSASEQFLLSSGAEALLTPETAKISYVSSHAFNPDTSPIELGTTIGFLNS 705 Query: 452 VGRRIKYI 459 + ++ Sbjct: 706 TAKNTRFF 713 >gi|312621233|ref|YP_004022846.1| glycoside hydrolase family 16 [Caldicellulosiruptor kronotskyensis 2002] gi|312201700|gb|ADQ45027.1| glycoside hydrolase family 16 [Caldicellulosiruptor kronotskyensis 2002] Length = 2435 Score = 40.2 bits (92), Expect = 0.93, Method: Composition-based stats. Identities = 43/282 (15%), Positives = 81/282 (28%), Gaps = 24/282 (8%) Query: 72 FSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAV 131 + + G + + V V S W+P + + T + + + G+ Sbjct: 1144 YGVSGGRSYSYIIAVSNSKFVRVDPSNAWNPLTASASDAS--TDAELFEIVFKADGNVGF 1201 Query: 132 --------FVHKD---HPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISG-----VK 175 V D P + LL ++ +P + + V Sbjct: 1202 ASKALNNNLVCADSWSTPDYKLLPRSSYSADPGGWETFTLVPQGDGTIAIKANNGGRFVT 1261 Query: 176 SNAKLSISQADTSTARITSDMKIFKPLDKGR-SIRLGCHPPEWAKNTNYSIGAYIVADDK 234 I +A ++T + I P G+ S+ + + +VA Sbjct: 1262 VEPTTGILKATSATVGVNEKFIIVTPYAPGQPSVTIDEVLDNSVTFHWSVPSSSVVAGYN 1321 Query: 235 VYRSLTTG--RSGDRFGYSKGATYVKD--NNITWITVLNLSSKTSRESASGAVAPYYVWG 290 VYR+ T+G +Y T + + E+ S V + G Sbjct: 1322 VYRATTSGGPYIKLNKALLTTTSYTDTSMTANTTYYYIVAAVNARGETKSPEVMVKTLSG 1381 Query: 291 DIKDVSKDGRSISVAPQSQTL-FQAGVSVVSWFMSAWGEQEG 331 I + S S TL + A S+ + + G Sbjct: 1382 PIPAIPTGLDITSCTQNSITLNWNAAAGAQSYNIYRSTSRFG 1423 >gi|313122738|ref|YP_004044665.1| chitin-binding protein [Halogeometricum borinquense DSM 11551] gi|312296220|gb|ADQ69309.1| uncharacterized protein contain chitin-binding domain type 3 [Halogeometricum borinquense DSM 11551] Length = 562 Score = 39.5 bits (90), Expect = 1.4, Method: Composition-based stats. Identities = 17/146 (11%), Positives = 42/146 (28%), Gaps = 19/146 (13%) Query: 193 TSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSK 252 T+ ++ PP+W +T Y+ G +V + ++ + + + Sbjct: 11 TASALFTTIAGASATVAGAESPPKWDPDTTYTSGDRVVYEGYIWEA-------KWWTH-- 61 Query: 253 GATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLF 312 G K + W + G + I+ ++ + Sbjct: 62 GTEPQKKSGNPWKQIRE----------DGGGGGTELTAVIETNTETVTVGETVTLDASKS 111 Query: 313 QAGVSVVSWFMSAWGEQEGYPSHVTF 338 ++ W + G + V+F Sbjct: 112 TGDITSYEWTVGDRDPVTGVETTVSF 137 >gi|313674279|ref|YP_004052275.1| glycoside hydrolase family 16 [Marivirga tractuosa DSM 4126] gi|312940977|gb|ADR20167.1| glycoside hydrolase family 16 [Marivirga tractuosa DSM 4126] Length = 364 Score = 39.5 bits (90), Expect = 1.6, Method: Composition-based stats. Identities = 34/264 (12%), Positives = 78/264 (29%), Gaps = 20/264 (7%) Query: 140 HHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIF 199 + + DK+ E+ + + + A++ IS + T I + I Sbjct: 58 YTFSFGDGSDKLRDDDGEVTYSYAESGDYTIEVNAHTTAEVFISSSQEVTITIQQNSDID 117 Query: 200 KPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKD 259 G + W + D G + +G ++ Y ++ Sbjct: 118 DEGYVSPMEYEGYNL-VWQDEFE---ADQLSDDYTF----EIGTGSNGWGNNESQYYREE 169 Query: 260 NNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVV 319 N L + +K + + IK+ +++ + G+ Sbjct: 170 NTRLEEGYLVIQAKKENFQGQEYTSSRIITEGIKEFKYG----RFDIRARMPYGQGIWPA 225 Query: 320 SWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSV-----YLSSFGAFYDFSLDGEYGC 374 W + + Q G+P + + G +G E +V + S+ G + +F Sbjct: 226 IWMLGSNFRQVGWPHCGEI--DIMEMIGGQGREATVHGTVHWQSNEG-YANFGHSKNLSD 282 Query: 375 YDPTKALTTAVTDFSASTIHWMHP 398 + ++I W+ Sbjct: 283 GTLADKFHVFSIIWDENSIQWLID 306 >gi|197935887|ref|YP_002213723.1| tail tuber protein B [Ralstonia phage RSB1] gi|197927050|dbj|BAG70392.1| tail tuber protein B [Ralstonia phage RSB1] Length = 861 Score = 39.5 bits (90), Expect = 1.6, Method: Composition-based stats. Identities = 33/207 (15%), Positives = 65/207 (31%), Gaps = 19/207 (9%) Query: 254 ATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQ 313 TY T + E A+ V P V+ S + Q T Sbjct: 329 ETYYMRAVKTDTAAAHFGPVQWVEGAAQVVTPGQVFAIASITSTTLTLANSPAQLATAIG 388 Query: 314 AGVSVVSWFM-SAWGEQEGYP-------SHVTFHNNRLLFSGSKGDELSVYLSSFGAFYD 365 + V + + ++ P SH+ +R++ + + +S G +++ Sbjct: 389 SPVPGYAASVCGDMTDKGAVPYFFGRKVSHMAMFQDRMVIVSN----GVILMSRTGDYFN 444 Query: 366 -FSLDG-EYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSI-SLSKGL 422 F DP +A D S + + + + + + L S Sbjct: 445 WFRKSKLRVDDDDPVEAFALGSEDDIISQ---SSSYNKDLFLFGERGQYALPGRSAITPK 501 Query: 423 SIDFRRVSG-SGVYACPPVSVGDCLVF 448 +I +V+G P+ VG+ L + Sbjct: 502 TISITQVAGERDAMLARPIPVGNLLFY 528 >gi|319776426|ref|YP_004138914.1| tail fiber protein [Haemophilus influenzae F3047] gi|317451017|emb|CBY87248.1| probable tail fiber protein [Haemophilus influenzae F3047] Length = 747 Score = 39.1 bits (89), Expect = 2.0, Method: Composition-based stats. Identities = 28/198 (14%), Positives = 69/198 (34%), Gaps = 19/198 (9%) Query: 198 IFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYV 257 +FK LD+ + + PEW+ +Y+ G+ + D YR+L ++ + S +V Sbjct: 50 LFKRLDEKHTYLMQRGLPEWSATQDYTKGSCVQFDGVSYRALKNSKN-NSPNESDSQYWV 108 Query: 258 KDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVS 317 + W L+ ++ + + D + + +++ + + + Sbjct: 109 R-----WGFALSEIARATLQQYGIVQLSSATNSDSETKAATSKAVK-TAYDKAVEAKTTA 162 Query: 318 VVSWFMSAWGEQEGYPSHVTFHNNRLLFSGS-KGDELSVYLSSFGAFYDFSLDGEYGCYD 376 ++ G S NR++ + + + +Y S G + + G D Sbjct: 163 DGKVGLNGNESINGEKS----FENRIVAKRNIRISDNPIYASR-GDYLN------IGAND 211 Query: 377 PTKALTTAVTDFSASTIH 394 ++ T+ Sbjct: 212 GDCWFEYKSSNREIGTLR 229 >gi|254522602|ref|ZP_05134657.1| Carbohydrate binding domain protein [Stenotrophomonas sp. SKA14] gi|219720193|gb|EED38718.1| Carbohydrate binding domain protein [Stenotrophomonas sp. SKA14] Length = 1475 Score = 39.1 bits (89), Expect = 2.0, Method: Composition-based stats. Identities = 18/107 (16%), Positives = 37/107 (34%), Gaps = 10/107 (9%) Query: 214 PPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSK 273 EWA T Y G ++ D +YR+L + G W + + +S Sbjct: 849 ADEWAAGTTYPAGDFVRHDGTLYRAL-----AENVDVEPGTAP-----AVWEAIGDYTSV 898 Query: 274 TSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVS 320 +A+ +++ +V++ ++ P + V S Sbjct: 899 GDALAAAISMSTKNASDIAAEVTRVDAVVAKLPADGGQAASTGQVSS 945 >gi|325089518|gb|EGC42828.1| conserved hypothetical protein [Ajellomyces capsulatus H88] Length = 1104 Score = 39.1 bits (89), Expect = 2.2, Method: Composition-based stats. Identities = 13/56 (23%), Positives = 22/56 (39%) Query: 200 KPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGAT 255 P+ + PPEW++NT Y +G + D VY + + + K Sbjct: 180 GPVTESTETTAAAGPPEWSENTAYKVGDQVSYDGHVYVCIQAHTTVIGWEPPKTPA 235 >gi|240279230|gb|EER42735.1| conserved hypothetical protein [Ajellomyces capsulatus H143] Length = 1104 Score = 39.1 bits (89), Expect = 2.2, Method: Composition-based stats. Identities = 13/56 (23%), Positives = 22/56 (39%) Query: 200 KPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGAT 255 P+ + PPEW++NT Y +G + D VY + + + K Sbjct: 180 GPVTESTETTAAAGPPEWSENTAYKVGDQVSYDGHVYVCIQAHTTVIGWEPPKTPA 235 >gi|225562313|gb|EEH10592.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR] Length = 1104 Score = 39.1 bits (89), Expect = 2.3, Method: Composition-based stats. Identities = 13/56 (23%), Positives = 22/56 (39%) Query: 200 KPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGAT 255 P+ + PPEW++NT Y +G + D VY + + + K Sbjct: 180 GPVTESTETTAAAGPPEWSENTAYKVGDQVSYDGHVYVCIQAHTTVIGWEPPKTPA 235 >gi|332828789|gb|EGK01481.1| hypothetical protein HMPREF9455_02314 [Dysgonomonas gadei ATCC BAA-286] Length = 623 Score = 38.7 bits (88), Expect = 2.8, Method: Composition-based stats. Identities = 27/207 (13%), Positives = 59/207 (28%), Gaps = 34/207 (16%) Query: 228 YIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYY 287 YI A D +R++ + + + Y++ + + + + A Sbjct: 394 YIGASDNTFRAIDIKTGKLVWEFPEVKGYIETRPLIYKDKIFFGAWDETMYALDKHTGRL 453 Query: 288 VWGDIKD--------------VSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYP 333 +W ++ + D + + T A W MS W +E Sbjct: 454 LWKWVEGRKGILYSPAAVWPVAAHDRVFFTAPDRVMTAVDANTGETIWRMSDWKVRE--- 510 Query: 334 SHVTFHNN--RL----------LFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYD----P 377 + + RL +S + ++ ++ G YD + Sbjct: 511 -TIGLSEDKERLYSKTMQDSVVCYSATSAKPQQIWAANVGYGYDHAPSMPVEKDSVVFGS 569 Query: 378 TKALTTAVTDFSASTIHWMHPFGEGVL 404 TK + + W H G ++ Sbjct: 570 TKNGIIFAIEGKTGKLLWKHKVGNSII 596 >gi|329123905|ref|ZP_08252457.1| phage tail fiber protein [Haemophilus aegyptius ATCC 11116] gi|327468100|gb|EGF13587.1| phage tail fiber protein [Haemophilus aegyptius ATCC 11116] Length = 240 Score = 38.7 bits (88), Expect = 3.1, Method: Composition-based stats. Identities = 28/198 (14%), Positives = 69/198 (34%), Gaps = 19/198 (9%) Query: 198 IFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYV 257 +FK LD+ + + PEW+ +Y+ G+ + D YR+L ++ + S +V Sbjct: 50 LFKRLDEKHTYLMQRGLPEWSATQDYTKGSCVQFDGVSYRALKNSKN-NSPNESDSQYWV 108 Query: 258 KDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVS 317 + W L+ ++ + + D + + +++ + + + Sbjct: 109 R-----WGFALSEIARATLQQYGIVQLSSATNSDSETKAATSKAVK-TAYDKAVEAKTTA 162 Query: 318 VVSWFMSAWGEQEGYPSHVTFHNNRLLFSGS-KGDELSVYLSSFGAFYDFSLDGEYGCYD 376 ++ G S NR++ + + + +Y S G + + G D Sbjct: 163 DGKVGLNGNESINGEKS----FENRIVAKRNIRISDNPIYASR-GDYLN------IGAND 211 Query: 377 PTKALTTAVTDFSASTIH 394 ++ T+ Sbjct: 212 GDCWFEYKSSNREIGTLR 229 >gi|12056574|gb|AAG47946.1|AF222787_1 cycloinulo-oligosaccharide fructanotransferase [Paenibacillus macerans] Length = 1333 Score = 38.3 bits (87), Expect = 3.3, Method: Composition-based stats. Identities = 42/256 (16%), Positives = 78/256 (30%), Gaps = 27/256 (10%) Query: 86 DKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYI 145 D K+++ + TP T + + K++ P L + Sbjct: 552 DGKIKLYLNGEEVASQATPVNVPI-TPSTES--------------LIIGKNNKPVELAGV 596 Query: 146 QDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQA-DTSTARITSDMKIFKPLDK 204 + S DE+K +++G +S L A I D +F D+ Sbjct: 597 FSFNMFSGLLDEVKLHNKALTNQEILAGYESVKALHGGSIPKIPNADIDEDPSVFD-GDQ 655 Query: 205 GRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITW 264 R P W + I + K + G + +V D+ + W Sbjct: 656 HRPQYHAMPPQNWMNEAHAPI----YYNGKYHLFYQHNPQGPFWHQIHWGHWVSDDMVNW 711 Query: 265 ITV---LNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSW 321 V L + T + + + Y + + S++P +T + Sbjct: 712 ENVRPALAPEAGTLDPDGTWSGSAAYDRNGNPVLFYTAGNDSLSPNQRTGLATPADLSDP 771 Query: 322 FMSAWGEQEGYPSHVT 337 ++ W E YP VT Sbjct: 772 YLEKW---EKYPKPVT 784 >gi|254521915|ref|ZP_05133970.1| Carbohydrate binding domain protein [Stenotrophomonas sp. SKA14] gi|219719506|gb|EED38031.1| Carbohydrate binding domain protein [Stenotrophomonas sp. SKA14] Length = 1553 Score = 38.3 bits (87), Expect = 3.3, Method: Composition-based stats. Identities = 17/107 (15%), Positives = 36/107 (33%), Gaps = 10/107 (9%) Query: 214 PPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSK 273 EW T Y G ++ D +YR+L + G W + + +S Sbjct: 950 ADEWVAGTTYPAGDFVRHDGTLYRAL-----AENVDVEPGTDP-----AVWEAIGDYTSV 999 Query: 274 TSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVS 320 +A+ +++ +V++ ++ P + V S Sbjct: 1000 GDALAAAISMSTKNASDIAAEVTRVDAVVAKLPADGGQAASTGQVAS 1046 >gi|309271529|ref|XP_003085345.1| PREDICTED: hypothetical protein LOC100503043 [Mus musculus] Length = 2318 Score = 38.3 bits (87), Expect = 3.5, Method: Composition-based stats. Identities = 46/335 (13%), Positives = 88/335 (26%), Gaps = 44/335 (13%) Query: 107 KTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWL 166 + P+ + + ++ D ++T E + W Sbjct: 1064 QAIAGPWAVSQVTDGSW-----------PAVQASGVSWVVDQATGTWTVAENQTGAVSWA 1112 Query: 167 GDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWA-----KNT 221 G G I + + + T+ T K R W + Sbjct: 1113 GAGNIVSIGY---WTGAVDQTNAVSWTGTTDQVGVEVKPRFEDQASEKGSWVVAGVQTSG 1169 Query: 222 NYSIGAYIVADDKVY-RSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESAS 280 +G+ + + + ++ + R G A +S ++ +S+S Sbjct: 1170 ETRLGSEDQSSGRSWTETVDQANAASRLGTVDQAGGTSWAGTGDQVGGVSTSGSADQSSS 1229 Query: 281 GAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHN 340 G+ W ++++ + QS + G + +W G PS + Sbjct: 1230 GS------WAGTRNLAGERSWTGTGDQSDGAAKPGFENQTSDEGSWAGTIGQPSGGSKSV 1283 Query: 341 NRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFG 400 + +G + + LS G F LD G P A Sbjct: 1284 SEAQSAGRSWADSADQLS--GGFLVGPLDQANGESQPVSGELAASGVDQ----------- 1330 Query: 401 EGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVY 435 W S S G S R +G Sbjct: 1331 -----TSGGGCWTGSGDQSGGESRLGPRDQSNGES 1360 >gi|159037262|ref|YP_001536515.1| chitin-binding domain-containing protein [Salinispora arenicola CNS-205] gi|157916097|gb|ABV97524.1| chitin-binding domain 3 protein [Salinispora arenicola CNS-205] Length = 338 Score = 38.3 bits (87), Expect = 3.9, Method: Composition-based stats. Identities = 27/164 (16%), Positives = 41/164 (25%), Gaps = 8/164 (4%) Query: 161 LPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKN 220 P P + A + + + T TA TS + P W Sbjct: 181 SPTPTASPTSTASPTPTASPTSTASPTPTASPTSTASPTPTGTPSPTSTGTPAPESWQVG 240 Query: 221 TNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESAS 280 T Y IG + D YR+ + + + W V + Sbjct: 241 TTYQIGDEVTYDGVSYRARQAHTATPGWEPPRVPA-------LWTAVTPPPATGDPAPGD 293 Query: 281 G-AVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFM 323 G AV Y GD A + ++ W Sbjct: 294 GWAVGIAYQIGDEVTYDGVSYLARQAHTATPGWEPPHVPSLWIR 337 >gi|256376322|ref|YP_003099982.1| glycoside hydrolase family 6 [Actinosynnema mirum DSM 43827] gi|255920625|gb|ACU36136.1| glycoside hydrolase family 6 [Actinosynnema mirum DSM 43827] Length = 605 Score = 37.9 bits (86), Expect = 4.8, Method: Composition-based stats. Identities = 42/327 (12%), Positives = 72/327 (22%), Gaps = 23/327 (7%) Query: 141 HLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFK 200 G + F + N + Sbjct: 243 KQARTAPGGNLVFQVVIYNLPGRDCAALASNGELGPNDLPRYKTEYIDKIAGILARPAYA 302 Query: 201 PLDKGRSIRLGCHPPEWAK----NTNYSIGAYIVADDKV-----YRSLTTGRSGDRFGYS 251 L I + P T + A+ Y G G+ + Y Sbjct: 303 SLRIVAVIEIDSLPNLVTNVSPRPTQTPNCDTMKANQNYQNGVAYAVSKLGDIGNVYNYL 362 Query: 252 KGAT--YVKDNNITWITVLNLSSKTSRESASG--AVAPYYVWGDIKDVSKDGRSISVAPQ 307 ++ + +S S G V G I + + Sbjct: 363 DSGHHGWIGWGDPIPEYDNFHASAKMMASILGREGATKADVHGFITNTANYSALEEPFWT 422 Query: 308 SQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSG-SKGDELSVYLSSFGAFYDF 366 + W + G T L+ +G G + + S G Sbjct: 423 VDDVVGGQAVKEKSKWVDWNDFNGELGFATAFRQELVANGFDAGVGMLIDTSRNGWGGSG 482 Query: 367 SLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGV--------LVGCDTSLWLLSISL 418 + DP+ + + D +W + G G+ D +W+ Sbjct: 483 RPTAKSSSTDPSVYVDQSRIDKRIQKGNWCNQSGAGLGERPKAAPKPNIDAYVWIKPPGE 542 Query: 419 SKGLSIDFRRVSGSGVYA-CPPVSVGD 444 S G S G G C P G+ Sbjct: 543 SDGSSTQIPNNEGKGFDRMCDPTYGGN 569 >gi|281211601|gb|EFA85763.1| hypothetical protein PPL_00993 [Polysphondylium pallidum PN500] Length = 310 Score = 36.8 bits (83), Expect = 9.6, Method: Composition-based stats. Identities = 32/232 (13%), Positives = 53/232 (22%), Gaps = 39/232 (16%) Query: 109 YKTP--YTFKDNKSLEYAVFGSTAVFVHKDHP-PHHLLYIQDGDKISFTFDE-------- 157 TP Y D+ + Y G T +H + H L Y + D I++T D Sbjct: 9 ITTPSYYGITDSPA--YIQLGGTVYCIHHGYENNHELWYTKSNDLITWTADAQFVDVQTT 66 Query: 158 ------------IKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKG 205 F G ++ VK Sbjct: 67 FSPAAIVFNSIIYGFHNGSPNSSGDLNYVKVTGNSVTQDNPIHGLPEWKSSNSPSATVFN 126 Query: 206 RSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWI 265 + L H P + K+ + + + Y + + + Sbjct: 127 NLMYLAYHGPN--------------NNGKLLLASSPDGVASNWSYKEVPGITITGSPSMA 172 Query: 266 TVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVS 317 T R +A G D + +V A S Sbjct: 173 TFNGKIYIVFRNTALGNGVYVTSTSDTNTWTTPTLIPNVQVSGDPKLTATAS 224 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.308 0.118 0.326 Lambda K H 0.267 0.0367 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 9,905,769,435 Number of Sequences: 14124377 Number of extensions: 387266902 Number of successful extensions: 853322 Number of sequences better than 10.0: 245 Number of HSP's better than 10.0 without gapping: 123 Number of HSP's successfully gapped in prelim test: 208 Number of HSP's that attempted gapping in prelim test: 852110 Number of HSP's gapped (non-prelim): 503 length of query: 578 length of database: 4,842,793,630 effective HSP length: 145 effective length of query: 433 effective length of database: 2,794,758,965 effective search space: 1210130631845 effective search space used: 1210130631845 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.3 bits) S2: 84 (37.1 bits)