BLASTP 2.2.22 [Sep-27-2009]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.


Reference for composition-based statistics starting in round 2:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,  
Eugene V. Koonin, and Stephen F. Altschul (2001), 
"Improving the accuracy of PSI-BLAST protein database searches with 
composition-based statistics and other refinements",  Nucleic Acids Res. 29:2994-3005.

Query= gi|254781208|ref|YP_003065621.1| hypothetical protein
CLIBASIA_05575 [Candidatus Liberibacter asiaticus str. psy62]
         (578 letters)

Database: nr 
           14,124,377 sequences; 4,842,793,630 total letters

Searching..................................................done


Results from round 1


>gi|254781208|ref|YP_003065621.1| hypothetical protein CLIBASIA_05575 [Candidatus Liberibacter
           asiaticus str. psy62]
 gi|254040885|gb|ACT57681.1| hypothetical protein CLIBASIA_05575 [Candidatus Liberibacter
           asiaticus str. psy62]
 gi|317120673|gb|ADV02496.1| hypothetical protein SC1_gp080 [Liberibacter phage SC1]
 gi|317120817|gb|ADV02638.1| hypothetical protein SC1_gp080 [Candidatus Liberibacter asiaticus]
          Length = 578

 Score = 1194 bits (3088), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 578/578 (100%), Positives = 578/578 (100%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC
Sbjct: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60

Query: 61  RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120
           RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS
Sbjct: 61  RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120

Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180
           LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL
Sbjct: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180

Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLT 240
           SISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLT
Sbjct: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLT 240

Query: 241 TGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGR 300
           TGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGR
Sbjct: 241 TGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGR 300

Query: 301 SISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSF 360
           SISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSF
Sbjct: 301 SISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSF 360

Query: 361 GAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSK 420
           GAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSK
Sbjct: 361 GAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSK 420

Query: 421 GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQLADHLF 480
           GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQLADHLF
Sbjct: 421 GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQLADHLF 480

Query: 481 NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSA 540
           NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSA
Sbjct: 481 NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSA 540

Query: 541 ASFPNDNRGGTSLWMLVALSAGEERSFTVRLNLLDDFK 578
           ASFPNDNRGGTSLWMLVALSAGEERSFTVRLNLLDDFK
Sbjct: 541 ASFPNDNRGGTSLWMLVALSAGEERSFTVRLNLLDDFK 578


>gi|315122895|ref|YP_004063384.1| hypothetical protein CKC_05755 [Candidatus Liberibacter
           solanacearum CLso-ZC1]
 gi|313496297|gb|ADR52896.1| hypothetical protein CKC_05755 [Candidatus Liberibacter
           solanacearum CLso-ZC1]
          Length = 588

 Score =  422 bits (1084), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 222/567 (39%), Positives = 337/567 (59%), Gaps = 23/567 (4%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           M    +TK SF+ GE+SP+++QSR DL LH+QG+++  N+IPL+ G LV  P +  Y   
Sbjct: 1   MPKGAYTKRSFAGGEVSPQIMQSRSDLELHSQGLSQCFNMIPLQDGSLVRRPPLYRYEHI 60

Query: 61  RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120
            L P+++R+ SF++      L +FG+KK+  V V   T   P  F + Y TPY+F++ + 
Sbjct: 61  DLPPKASRILSFALGGDDAVLFIFGEKKMVYVEV---TGIKPPQFIRFYDTPYSFREAEQ 117

Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180
           L+ A  G+  V VH  H P+ + + + G      F+++ F PPPWLG   + G K +AKL
Sbjct: 118 LDVARMGTLIVLVHPKHSPYKIEFTEAG----VIFEKMVFAPPPWLGLREVGGKKHDAKL 173

Query: 181 SISQADTSTARIT--SDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRS 238
            ++ + T   +IT  S + IFK  D GR +RLG  P +W  NT Y   A++    KVYR 
Sbjct: 174 RVTLSATRKGKITVTSTLPIFKTKDVGRMLRLGWLPKDWTANTLYPENAFMQMYGKVYRC 233

Query: 239 LTTGRSGDRFGYSKGATYVKDNNITWITV-------LNLSSKTSRESASGAVAPYYVWGD 291
           +T G SG  F  ++  TY++D  +TW  +       ++   K++  +      PYYVWG+
Sbjct: 234 ITEGISGKEFEDNRRDTYIRDGGVTWKVIASSQALSVDKDGKSTLGTGGQYRTPYYVWGE 293

Query: 292 IKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGD 351
           I + +   +++ V            S + W MSAWGE+EGYPSHV+F+NNRL FSGSK D
Sbjct: 294 IVNCT-GAKTVEVMLHEGFCVTDSNSTLYWNMSAWGEREGYPSHVSFYNNRLCFSGSKFD 352

Query: 352 ELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSL 411
             +VY S +  F DFS D   G  D  K+L+ A+TD + S I W  P  +G+++G DTSL
Sbjct: 353 PQAVYFSGYNTFTDFSPDTIEGNVDYRKSLSVAITDDTMSAIRWFRPMEKGLVIGTDTSL 412

Query: 412 WLLSISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNE 471
           W++ +   +G ++  RR++G GVY  PP+S+GD L+FV G GRRI+ I G++EQGF+F E
Sbjct: 413 WIVILDFERGFNLVSRRLAGIGVYEAPPLSIGDELIFVQGAGRRIQIIGGASEQGFQFLE 472

Query: 472 ITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMI 531
           +TQ  DHL + RI QL YQE+P+S++WV+     N+   LLGC   A  +   +WH H +
Sbjct: 473 LTQNVDHLLDYRIRQLAYQEDPYSLLWVL-----NNKGELLGCSLHANSKEKGSWHVHKL 527

Query: 532 SDKHY-VLSAASFPNDNRGGTSLWMLV 557
             +   ++S +S    ++G T++W+L+
Sbjct: 528 GGRGVKIMSLSSCLCLDQGETTVWLLL 554


>gi|315121933|ref|YP_004062422.1| hypothetical protein CKC_00915 [Candidatus Liberibacter
           solanacearum CLso-ZC1]
 gi|313495335|gb|ADR51934.1| hypothetical protein CKC_00915 [Candidatus Liberibacter
           solanacearum CLso-ZC1]
          Length = 588

 Score =  419 bits (1076), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 223/568 (39%), Positives = 334/568 (58%), Gaps = 23/568 (4%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           M    +TK SF+ GE+SP+++QSR DL LH+QG+++  N+IPL  G LV  P +  Y   
Sbjct: 1   MPKGAYTKRSFAGGEVSPQIIQSRSDLELHSQGLSQCFNMIPLSDGSLVRRPPLHRYEHI 60

Query: 61  RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120
            L P+++R+ SF++      L +FG+KK+  V V   T   P  F + Y TPY+F++ + 
Sbjct: 61  DLPPKASRILSFALGGDEAVLFIFGEKKMVYVEV---TGIKPPQFIRFYGTPYSFREAEQ 117

Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180
           L+ A  G+  V VH  H P+ + + + G      F+++ F PPPWLG   + G K +AKL
Sbjct: 118 LDVARMGTLIVLVHPKHSPYKIEFTEAG----VIFEKMVFAPPPWLGRREVGGKKHDAKL 173

Query: 181 SISQADTSTARIT--SDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRS 238
            ++ + T   +IT  S + IFKP D GR + LG  P +W  NT Y   A++    KVYR 
Sbjct: 174 RVTLSATRKGKITVTSTLPIFKPKDVGRMLCLGWLPKDWTANTLYPENAFMQMYGKVYRC 233

Query: 239 LTTGRSGDRFGYSKGATYVKDNNITWITV-------LNLSSKTSRESASGAVAPYYVWGD 291
           +T G SG  F  ++  TY++D  +TW  +       ++   K++  +      PYYVWG+
Sbjct: 234 ITEGISGKEFEDNRRDTYIRDGGVTWKVIASSQALSVDKDGKSTLGTGGQYRTPYYVWGE 293

Query: 292 IKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGD 351
           I + +   +++ V            S + W MSAWGE+EGYPSHV+F+NNRL FSGSK D
Sbjct: 294 IVNCT-GAKTVEVMLHEGFCVTDSNSTLYWNMSAWGEREGYPSHVSFYNNRLCFSGSKFD 352

Query: 352 ELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSL 411
             +VY S +  F DFS D   G  D  K+L+ A+TD + S I W  P  +G+++G DTSL
Sbjct: 353 PQAVYFSGYNTFTDFSPDTIEGNVDYRKSLSVAITDDTMSAIRWFRPMEKGLVIGTDTSL 412

Query: 412 WLLSISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNE 471
           W++ +   +G ++  RR++G GVY  PP+S+GD L+FV G GRRI+ I G++EQGF+F E
Sbjct: 413 WIVILDFERGFNLVSRRLAGIGVYEAPPLSIGDELIFVQGAGRRIQIIGGASEQGFQFLE 472

Query: 472 ITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMI 531
           +TQ  DHL + RI QL YQE+P+S++WV+     N+   LL C   A  +   +WHTH  
Sbjct: 473 LTQNVDHLLDYRIRQLAYQEDPYSLLWVL-----NNKGELLSCSLHANSKEKGSWHTHKS 527

Query: 532 SDKHY-VLSAASFPNDNRGGTSLWMLVA 558
                 ++S +S    ++G T++W LV+
Sbjct: 528 GGGWVKIMSLSSCLCLDQGETTIWFLVS 555


>gi|317120716|gb|ADV02538.1| hypothetical protein SC2_gp080 [Liberibacter phage SC2]
 gi|317120777|gb|ADV02598.1| hypothetical protein SC2_gp080 [Candidatus Liberibacter asiaticus]
          Length = 590

 Score =  215 bits (547), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 167/573 (29%), Positives = 265/573 (46%), Gaps = 51/573 (8%)

Query: 8   KHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSN 67
           K+SF++GE+SP + QS  +L ++   +A   N IPLR G L+  P  + Y       +  
Sbjct: 8   KNSFASGEVSPFVHQSGSNLKIYQSCLAHCHNYIPLRTGALMRRPGTRIYHVFDDVDKPQ 67

Query: 68  RVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFG 127
           R+FSF        ++V G  KL I   R            T + PY  +D   +E A   
Sbjct: 68  RLFSFVKDAYTAYIIVLGYLKLHIFERRMGGCSKVT----TIEVPYKKEDVDEIEVAQNI 123

Query: 128 STAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADT 187
            T   VH  HPP  L  ++  D   + F E+ F   P L +  I   K +  L     +T
Sbjct: 124 DTLWMVHPKHPPCQL-ELKGKD---WEFKEVLFKHVPPLKEQFIDDKKVSINLKTPFENT 179

Query: 188 STAR-----ITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTG 242
            T +     + +D ++FK +D GR + LG  P  W  +T Y   +Y+V +D++ + +  G
Sbjct: 180 ETGKTGMVSVEADGEMFKEMDIGRELNLGFRPQRWIPDTWYLDNSYVVHNDRLLKCINKG 239

Query: 243 RS-GDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVW--GDIKDVSKDG 299
           +S    + +S      KD +  W  V         ES  G      +W  G IK   K  
Sbjct: 240 KSQSTEWTFSDKEHQQKDGSCLWEKV---------ESTKGNARNLLIWVTGVIKRF-KTA 289

Query: 300 RSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSS 359
           + + +  +     Q  +    W +  WG++EGYPS +TF  NRL+ SG K +  +V+ S 
Sbjct: 290 KCVLLELKGAFPLQNDLPTKHWLLGEWGQKEGYPSCITFFGNRLVLSGGKHNPQTVHFSK 349

Query: 360 FGAFYDFSLDGEY-GCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSIS- 417
              F DF+   E  G  D T + +  +       I W+     G+LVG +++LWL++ + 
Sbjct: 350 LDDFTDFNQISEQGGNTDLTSSFSVLLGSDVRQGIQWLSHTDSGLLVGTESALWLITQTS 409

Query: 418 ----LSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISG-----STEQGFR 468
               +SK  ++  R +   G  A  P+ VG   VF+   GR +  + G     +T+  +R
Sbjct: 410 QNEVVSKA-TVAIRSIGNFGSIAVSPILVGSHCVFIKDTGRDLISLVGNRSADNTKTEYR 468

Query: 469 FNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHT 528
           F ++   A+H+  + + + V Q+ P+SI+WVVL        RL+GC F  + E   AWHT
Sbjct: 469 FRDLNLFAEHILTKGVWEAVLQQSPYSIIWVVLRDG-----RLVGCTFDPDNEV-CAWHT 522

Query: 529 HMI----SDKHYVLSAASFPNDNRGGTSLWMLV 557
           H +    +  H + S ASF +   G   LW+LV
Sbjct: 523 HDLGGFYTQIHSLTSCASFLD---GQDDLWLLV 552


>gi|227355852|ref|ZP_03840245.1| conserved hypothetical protein [Proteus mirabilis ATCC 29906]
 gi|227164171|gb|EEI49068.1| conserved hypothetical protein [Proteus mirabilis ATCC 29906]
          Length = 820

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 120/517 (23%), Positives = 218/517 (42%), Gaps = 62/517 (11%)

Query: 10  SFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRV 69
           SFS GE++P L   R DL+ ++  + K  N I  +YG + + P  +   + +   + +R+
Sbjct: 9   SFSGGEIAPSLY-GRVDLAKYSTALRKCHNFIVRQYGGVENRPGTRFIAETKYQNKKSRL 67

Query: 70  FSFSIPDGGYALLVFGDKKLQIV-----VVRSSTKWSPALFGKTYKTPYTFKDNKSLEYA 124
             F         L FGD+ +++      V+ +  +    +F     TPY   D   L+Y 
Sbjct: 68  IPFQFSTVQTYALEFGDRYIRVFKDGGQVLYADGEHKGEVF--ELATPYKEADLFDLKYT 125

Query: 125 VFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSI-S 183
                   VH D+PP  L    D D       E K        +G    + ++  + + +
Sbjct: 126 QSADVMTIVHTDYPPMELQRY-DHDDWKLVSVETK--------NGPFEDINTDKAMKVYA 176

Query: 184 QADTSTARITSDMKIFKPLDKGRSIRLGCHP----PEWAKNTNYSIGAYIVADDKVYRSL 239
            A T    +TS   IF     G+   L        P W  +   ++     AD   YR+ 
Sbjct: 177 SASTGQITLTSTHDIFGSEQIGKQFYLEQRDIDAVPVWETDKTTNLNDQRRADSNYYRAN 236

Query: 240 TTGRSGD-RFGYSKGATYVK---DNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDV 295
           + G++G  R  +++G ++     D  I W  +          S  G V        I+ V
Sbjct: 237 SGGKTGTLRPSHTEGMSWDGWGGDTGIQWEYL---------HSGFGIVK-------IETV 280

Query: 296 SKDGRS-----ISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKG 350
           S+DG++     +S  P S  + +   S   W  + W + +GYPS V ++  RL F+GS+ 
Sbjct: 281 SEDGKTATGKVLSYIP-SNAVGEDNASH-KWARAVWNDVDGYPSTVVYYQQRLFFAGSRA 338

Query: 351 DELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM-HPFGEGVLVGCDT 409
              +++ S  G + DF      G  +P +     +  ++   ++ + H    G LV   +
Sbjct: 339 YPQTIWASRSGDYKDF------GRNNPIQDDDRIIYTYAGRQVNEIRHLIDVGSLVALTS 392

Query: 410 -SLWLLSISLSKGL---SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE- 464
              + ++   +K L   S        +G    PP+SV +  +++   G  ++ +S S + 
Sbjct: 393 GGEYQITGDQNKVLTPSSFSMSSQGANGSSDLPPISVANIALYIQEKGSAVRDLSYSFDV 452

Query: 465 QGFRFNEITQLADHLFNQ-RILQLVYQEEPHSIVWVV 500
            G++  ++T LA+HLF + RI+   +   P+SI W +
Sbjct: 453 DGYQGTDLTMLANHLFQRHRIVDWSFTTVPYSIAWCI 489


>gi|268589382|ref|ZP_06123603.1| hypothetical protein PROVRETT_05514 [Providencia rettgeri DSM 1131]
 gi|291315409|gb|EFE55862.1| hypothetical protein PROVRETT_05514 [Providencia rettgeri DSM 1131]
          Length = 818

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 125/547 (22%), Positives = 231/547 (42%), Gaps = 76/547 (13%)

Query: 10  SFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRV 69
           SFS GE++P L   R DL+ ++  + K  N I  +YG + + P  +     +   +  R+
Sbjct: 9   SFSGGEIAPSLY-GRIDLAKYSTALRKCSNFIVRQYGGIENRPGTKFIAAAKYPNKKCRL 67

Query: 70  FSFSIPDGGYALLVFGDKKLQIV-----VVRSSTKWSPALFGKTYKTPYTFKDNKSLEYA 124
             F         L  GDK ++++     V+ +  ++   +F     TPY   D  +L++ 
Sbjct: 68  IPFQFSTVQTYALEMGDKYMRVIKDGGQVLYADGEYKGEIF--ELATPYKEADLFNLKFT 125

Query: 125 VFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNA--KLSI 182
                   VH D+PP  L          +  D+ K +P     +G    + ++   KL +
Sbjct: 126 QSADVMTIVHADYPPMELQ--------RYDHDDWKLVPVE-TRNGPFEDINTDKERKLYV 176

Query: 183 SQADTSTARITSDMKIFKPLDKGRSIRLGCHP----PEWAKNTNYSIGAYIVADDKVYRS 238
           S A T    +++   IF     G+ I +        P W  +   +I     A    YR+
Sbjct: 177 S-ASTGDVTLSATHNIFGAELVGKQIYIEQQAIDAVPVWETDKTTNINDQRRAGANYYRA 235

Query: 239 LTTGRSGD-RFGYSKGATYVK---DNNITW--------ITVLNLSSKTSRESASGAVAPY 286
            T G+SG  R  +++G ++     D  I W        I  +N S  T   +A+G V  Y
Sbjct: 236 NTAGKSGTLRPSHTEGMSWDGWGGDAGIQWEYLHSGFGIVKIN-SVSTDGLTATGKVVLY 294

Query: 287 YVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFS 346
                         S +V  ++ T          W  S W + +GYPS V ++  RL F+
Sbjct: 295 I------------PSNAVGEENATY--------KWARSVWNDVDGYPSTVMYYQQRLFFA 334

Query: 347 GSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM-HPFGEGVLV 405
           GS+    +++ S  G + DF      G  +P +     +  ++   ++ + H    G LV
Sbjct: 335 GSRAYPQTIWASRSGDYKDF------GKNNPIQDDDRIIYTYAGRQVNEIRHLIDVGSLV 388

Query: 406 GCDT-SLWLLSISLSKGL---SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISG 461
              +   + ++   +K L   S  F     +G    PP++V +  +++   G  ++ ++ 
Sbjct: 389 ALTSGGEYQITGDQNKVLTPSSFSFSSQGANGCSDVPPIAVANIALYIQEKGSAVRDLAY 448

Query: 462 STE-QGFRFNEITQLADHLFNQ-RILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAE 519
           S +  G++  ++T +A+HLF + +I+   +   P+SI W +   +D+   +LL   +  E
Sbjct: 449 SFDVDGYQGTDLTIMANHLFQRHQIIDWAFSIVPYSIAWCI---RDDG--KLLSLTYLRE 503

Query: 520 GEGDFAW 526
            +  FAW
Sbjct: 504 QQV-FAW 509


>gi|48697202|ref|YP_024932.1| hypothetical protein BcepC6B_gp12 [Burkholderia phage BcepC6B]
 gi|47779008|gb|AAT38371.1| gp12 [Burkholderia phage BcepC6B]
          Length = 768

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 129/584 (22%), Positives = 228/584 (39%), Gaps = 62/584 (10%)

Query: 10  SFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRV 69
           SF AGELSP LL +R DL+ +  G     N I    GP +     +     +   + + +
Sbjct: 10  SFDAGELSP-LLGARVDLAKYPNGCQVMENFIATVQGPAIRRGGKRFVAATKDSTKQSWL 68

Query: 70  FSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKD--NKSLEYAVFG 127
             F + DG   +L FGD  ++  V R     + A       TPY   D   +   +A+  
Sbjct: 69  LPFIVADGIAYMLEFGDHYIRFFVNRGQLVNAGAPV--EIATPYALADLTTEDGTFAIRA 126

Query: 128 S----TAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSI- 182
           +    T    H  +P   LL        +F+   + F+  P+      + V S+  + + 
Sbjct: 127 TQSADTMYLFHGGYPTQKLLRTS---ATTFSLQPVTFVGGPF------AAVNSDNNVRVH 177

Query: 183 SQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKN--TNYSIGAYIV--ADDKVYRS 238
           + A T    + +   +F+P D G    L      + K    +  IG   +    D+VY  
Sbjct: 178 ASAGTGAVTLVASASVFRPSDVGTLFYLEQEDNSFVKPWVVHQKIGPSELRRVGDRVYLC 237

Query: 239 LTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPY--------YVWG 290
              G +  +   ++  T+   +   W       S T    + GA   Y         + G
Sbjct: 238 TAVGTATPQVTGTETPTHTSGSR--WDGTGQDESATDEYGSIGAEWEYQHSGYGTVLITG 295

Query: 291 DIKDVSKDGRSISVAPQSQTLFQAGVSVVS----WFMSAWGEQEGYPSHVTFHNNRLLFS 346
              D    G   +  P    +    V  ++    W  S +   +G+P   TF  NRL   
Sbjct: 296 YTNDQVVTGTVATNDPADPGMLPNTVVTLTGTYKWARSLFNSTDGFPQMGTFWRNRLCLM 355

Query: 347 GSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVG 406
             +   +SV  + F  F     D +        A+   +     + + WM    + +L+G
Sbjct: 356 RDRWLAMSVS-ADFETFKTKDADQQTD----DSAIVQQLNARQLNKLAWMVE-SDSLLIG 409

Query: 407 CDTSLWLLSISLSK----GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIK-YISG 461
                W++  + +       +++  R +  G     PV VG  ++FV   GR+++ +   
Sbjct: 410 MTGDEWVIGPANASQPVSAANLNAARRTSYGSKRIQPVQVGGTIMFVQKAGRKLRDFKYD 469

Query: 462 STEQGFRFNEITQLADHLFNQR------ILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCR 515
            +   +   ++T++ADH+   R      I+ L +Q+EPHS+VW        +  +L+GC 
Sbjct: 470 FSSDNYVSTDVTKIADHITRGRAGTNSGIMSLCFQQEPHSVVWAA-----RADGQLIGCT 524

Query: 516 FSAE-GEGD-FAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLV 557
           +  E G  D + WH H  ++  +V   AS P  +     LW++V
Sbjct: 525 YDEEAGRSDVYGWHRHPDANG-FVECVASMPAPDGASDDLWVIV 567


>gi|212710810|ref|ZP_03318938.1| hypothetical protein PROVALCAL_01878 [Providencia alcalifaciens DSM
           30120]
 gi|212686507|gb|EEB46035.1| hypothetical protein PROVALCAL_01878 [Providencia alcalifaciens DSM
           30120]
          Length = 818

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 121/538 (22%), Positives = 228/538 (42%), Gaps = 58/538 (10%)

Query: 10  SFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRV 69
           SFS GE++P L   R DL+ ++  + K  N +  +YG + + P  +     +   +  R+
Sbjct: 9   SFSGGEIAPSLY-GRIDLAKYSTALRKCENFLVRQYGGIENRPGTKFIAAAKYPNKKCRL 67

Query: 70  FSFSIPDGGYALLVFGDKKLQIV-----VVRSSTKWSPALFGKTYKTPYTFKDNKSLEYA 124
             F         L  GDK ++++     V+ +  +    +F  T  TPY   D  +L++ 
Sbjct: 68  IPFQFSTVQTYALEMGDKYMRVIKDGGQVLYADGEHKGEIFELT--TPYKEADLFNLKFT 125

Query: 125 VFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSI-S 183
                   VH D+PP  L          +  D+ K +P     +G    +  + +  +  
Sbjct: 126 QSADVMTIVHADYPPMELQ--------RYDHDDWKLVPVE-TRNGPFEDINVDKERKVYV 176

Query: 184 QADTSTARITSDMKIFKPLDKGRSIRLGCHP----PEWAKNTNYSIGAYIVADDKVYRSL 239
            A T    +T+   IF     G+ I +        P W  +          A    YR+ 
Sbjct: 177 SASTGEVTLTATHNIFGAELVGKQIYIEQQAVDAVPVWETDKTTIKNDQRRAGSNYYRAN 236

Query: 240 TTGRSGD-RFGYSKGATYVK---DNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDV 295
           T+G+SG  R  +++G ++     D  I W  +          S  G V    V  D   +
Sbjct: 237 TSGKSGTLRPSHTEGMSWDGWGGDTGIQWEYL---------HSGFGIVKINSVSTD--GL 285

Query: 296 SKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSV 355
           +  G+ IS  P S  + ++  +   W  S W + +GYPS V ++  RL F+GS+    ++
Sbjct: 286 TATGKVISYIP-SNAVGESNATY-KWARSVWNDVDGYPSTVMYYQQRLFFAGSRAYPQTI 343

Query: 356 YLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM-HPFGEGVLVGCDT-SLWL 413
           + S  G + DF      G  +P +     +  ++   ++ + H    G LV   +   + 
Sbjct: 344 WASRSGDYKDF------GKNNPIQDDDRIIYTYAGRQVNEIRHLIDVGSLVALTSGGEYQ 397

Query: 414 LSISLSKGL---SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRF 469
           ++   +K L   S  F     +G    PP++V +  +++   G  ++ ++ S +  G++ 
Sbjct: 398 ITGDQNKVLTPSSFSFSSQGANGCSDVPPIAVANIALYIQEKGSAVRDLAYSFDVDGYQG 457

Query: 470 NEITQLADHLFNQ-RILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAW 526
            ++T +A+HLF + +I+   +   P+SI W +   +D+   +LL   +  E +  FAW
Sbjct: 458 TDLTIMANHLFQRHQIIDWAFTIVPYSIAWCI---RDDG--KLLSLTYLREQQV-FAW 509


>gi|221213947|ref|ZP_03586920.1| conserved hypothetical protein [Burkholderia multivorans CGD1]
 gi|221166124|gb|EED98597.1| conserved hypothetical protein [Burkholderia multivorans CGD1]
          Length = 766

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 132/584 (22%), Positives = 221/584 (37%), Gaps = 64/584 (10%)

Query: 10  SFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRV 69
           SF AGELSP LL +R DL+ +A G     N I    GP V     +     +   +   +
Sbjct: 10  SFDAGELSP-LLGARVDLAKYANGCLLLENFIATVQGPAVRRGGKRYVSAIKDSGKQAWL 68

Query: 70  FSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKD--NKSLEYAVFG 127
             F + DG   +L FGD+ ++  V R       A       TPY   D   +   +A+  
Sbjct: 69  LPFIVSDGIAYMLEFGDQYIRFYVNRGQLVNDSAPV--EIATPYALADLVTEDGTFAIRA 126

Query: 128 S----TAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSIS 183
           +    T    H  +P   L         +F    + F+  P+      + V  N  + + 
Sbjct: 127 TQSADTMYLFHGAYPTQKLSRTS---ATTFELQPVTFVGGPF------ATVNDNNSIRVQ 177

Query: 184 QADTS-TARITSDMKIFKPLDKGRSIRLGCHPPE----WAKNTNYSIGAYIVADDKVYRS 238
            +  S    +T++  +F+  D G    +    P     WA +    +       D+ YR 
Sbjct: 178 ASGQSGDVTLTANADVFRASDVGTLFYVEQEQPTGIVPWAVHAESHVNDIRRVGDRTYRC 237

Query: 239 LTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSR-------ESASGAVAPYYVWGD 291
              G +  +   +   T +      W          +        E      A   + G 
Sbjct: 238 TQIGLNAPQV--TGQETPIHTEGRRWDGDGRDPDGDTYGSIGVEWEYQHSGYATVLITGF 295

Query: 292 IKDVSKDGRSISVAPQSQTLFQAGV---SVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGS 348
           +          +  P    +    V       W  S +   +G+P   TF +NRL     
Sbjct: 296 VNARQVSATVTTNNPNDPCMIPKPVVDSGTYKWARSLFNSTDGFPQMGTFWSNRLCVMRD 355

Query: 349 KGDELSVYLSSFGAFYDF-SLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGC 407
           +   +SV       F +F + D +    D   A+   +     + + WM    + +LVG 
Sbjct: 356 RWIAMSVSAD----FENFKTKDADQQTDD--SAIVQQLNARRLNKLAWMVE-SDSLLVGM 408

Query: 408 DTSLWLL-----SISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIK-YISG 461
               W++     S++LS   ++  RR +  G     PV VG  ++FV   GR+++ +   
Sbjct: 409 TGDEWVIGKSNASLALS-ATNMSARRRTSYGSKRLQPVEVGGTILFVQKAGRKLRDFKYD 467

Query: 462 STEQGFRFNEITQLADHLFNQR------ILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCR 515
            +   +   ++T++ADH+   R      I+ L YQ+EPHSIVW        +  +L+GC 
Sbjct: 468 FSSDNYVSTDVTKIADHVTRGRSGTNSGIMSLCYQQEPHSIVWAA-----RADGQLIGCT 522

Query: 516 FSAE-GEGD-FAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLV 557
           +  E G  D + WH H   +  +V   AS P  +     LWM+V
Sbjct: 523 YDEEAGRSDVYGWHRHPDVNG-FVECVASMPAPDGASDDLWMIV 565


>gi|221201505|ref|ZP_03574544.1| conserved hypothetical protein [Burkholderia multivorans CGD2M]
 gi|221207939|ref|ZP_03580945.1| hypothetical protein BURMUCGD2_2474 [Burkholderia multivorans CGD2]
 gi|221172124|gb|EEE04565.1| hypothetical protein BURMUCGD2_2474 [Burkholderia multivorans CGD2]
 gi|221178773|gb|EEE11181.1| conserved hypothetical protein [Burkholderia multivorans CGD2M]
          Length = 767

 Score = 94.7 bits (234), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 137/597 (22%), Positives = 230/597 (38%), Gaps = 89/597 (14%)

Query: 10  SFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRV 69
           SF AGELSP LL +R D++ +  G     N I    GP V     +     +   +   +
Sbjct: 10  SFDAGELSP-LLGARVDIAKYPNGCKVMENFIATVQGPAVRRGGKRFVAAVKDSSKQAWL 68

Query: 70  FSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKD--NKSLEYAVFG 127
             F + DG   +L FGD  ++  V R   +   A       TPY   D   +   +A+  
Sbjct: 69  LPFIVSDGIAYMLEFGDHYIRFYVDRG--QLVNAGGPVEIATPYALADLVTEDGTFAIRA 126

Query: 128 S----TAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSIS 183
           +    T    H  +PP  LL        +F+  ++ F+  P+       GV   A     
Sbjct: 127 TQSADTMYLFHGAYPPQKLLRTS---ATTFSLQQVTFVSGPFQTINSDEGVTVKAS---- 179

Query: 184 QADTSTARITSDMKIFKPLDKGRSIRLGCHP-----PEWAKNTNYSIGAYIVADDKVYRS 238
              T    +T+   +F   D G    L  +      P     T    G      D+ Y S
Sbjct: 180 -GQTGAVTLTATAPVFSQADVGALFYLEQNDNTSVLPWSVHGTILETGLVRRVGDRTYVS 238

Query: 239 LTTGRSGDRFGYSKGATYVK----DNNIT--------------------WITVLNLSSKT 274
              G +  +   S+  T+ +    D ++T                    + TVL ++S +
Sbjct: 239 TAIGPTAPQVTGSETPTHTRGRRYDGDLTDLANDNYGTIGIEWEYQHSGYATVL-ITSVS 297

Query: 275 SRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPS 334
             + A+G V            + +     + PQS  +   G     W  + +   +GYP 
Sbjct: 298 DSQHATGTV-----------TTNNPTDPCIIPQS--IVDTGT--YKWAHALFNAADGYPQ 342

Query: 335 HVTFHNNRLLFSGSKGDELSVYLSSFGAFYDF-SLDGEYGCYDPTKALTTAVTDFSASTI 393
             TF  NRL     +    SV       F +F S D +    D   A+   +     + +
Sbjct: 343 MGTFWRNRLWMMRDRWLVGSVSAD----FENFASKDADQQTDD--SAIVQQLNARQLNKL 396

Query: 394 HWMHPFGEGVLVGCDTSLWLLSISLSK----GLSIDFRRVSGSGVYACPPVSVGDCLVFV 449
            WM    + +++G     W++  + +       +++  R +  G     PV VG  ++FV
Sbjct: 397 AWMVE-SDSLIIGMTGDEWVIGPANASQPVSATNLNAARRTSYGSKRIQPVQVGGTIMFV 455

Query: 450 CGVGRRIK-YISGSTEQGFRFNEITQLADHLF------NQRILQLVYQEEPHSIVWVVLE 502
              GR+++ +    +   F   ++T+LADH+       N  I+ L +Q+EPHSIVW    
Sbjct: 456 QKAGRKLRDFKYDFSSDNFVSTDVTKLADHITRGRSGTNNGIMSLCFQQEPHSIVWAA-- 513

Query: 503 PKDNSFPRLLGCRFSAE-GEGD-FAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLV 557
               +  +L+GC +  E G  D + WH H  ++  +V   AS P  +     LW++V
Sbjct: 514 ---RADGQLIGCTYDEEAGRSDVYGWHRHPDANG-FVECVASMPAPDGASDDLWLIV 566


>gi|288959382|ref|YP_003449723.1| hypothetical protein AZL_025410 [Azospirillum sp. B510]
 gi|288911690|dbj|BAI73179.1| hypothetical protein AZL_025410 [Azospirillum sp. B510]
          Length = 665

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 77/277 (27%), Positives = 122/277 (44%), Gaps = 20/277 (7%)

Query: 288 VWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEG-YPSHVTFHNNRLLFS 346
           VWG  + ++  G   SV    +  +    +   W + AWG   G +P+ VTFH NRL F+
Sbjct: 202 VWGWCR-ITAFGSVTSVTATVEAAWGGTTATAFWRLGAWGATTGTWPTAVTFHENRLAFA 260

Query: 347 GSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMH-PFGEGVLV 405
             +    +V+LS  G F +F    E G      A+T    D   + I W+   FG  +  
Sbjct: 261 ALQ----TVWLSCSGDFDNFGPTTENGTVAADNAITLTAADDQVNVIRWLRSAFGVLIAG 316

Query: 406 GCDTSLWLLSISLSKGLS---IDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGS 462
                  + + SL + L+       RV  +G     PV V   LVF     RR+  ++  
Sbjct: 317 TSGGPFAIQASSLREALTPINATMPRVHVAGAADVQPVRVATNLVFPSRSRRRLHLLNAE 376

Query: 463 -TEQGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGE 521
               G+   ++  +A H+    +  + YQ+EP S++W+VL+  D +   L G  +  E +
Sbjct: 377 FAAAGYSAPDLALVASHITRHAVKAMAYQQEPWSVMWLVLD--DGT---LAGVTYVPELD 431

Query: 522 GDFAWHTHMISDKHY-VLSAASFPNDNRGGTSLWMLV 557
              AWH H +      VLS A  P  +R    LW++V
Sbjct: 432 -ILAWHRHPLGGTAVKVLSVACIPAADR--DELWLVV 465


>gi|262043557|ref|ZP_06016670.1| conserved hypothetical protein [Klebsiella pneumoniae subsp.
           rhinoscleromatis ATCC 13884]
 gi|259039091|gb|EEW40249.1| conserved hypothetical protein [Klebsiella pneumoniae subsp.
           rhinoscleromatis ATCC 13884]
          Length = 511

 Score = 87.4 bits (215), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 117/492 (23%), Positives = 200/492 (40%), Gaps = 37/492 (7%)

Query: 5   TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDP 64
           +W + SFS GE++P L   R D++ +   + K  N I  +YG + + P  Q     +   
Sbjct: 4   SWIQPSFSGGEIAPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTQFIAAAKYPD 62

Query: 65  RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYA 124
           R  R+  F         L FG   ++ V+       +         TPYT  D   L++ 
Sbjct: 63  RKCRLIPFQFSTVQTYALEFGHNYMR-VIKDGGLVLTTGDVIYELATPYTENDVFGLKFT 121

Query: 125 VFGSTAVFVHKDHPPHHLL-YIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSIS 183
                   VH  +PP  L  Y  D  +I     +++    P+    +       +K   +
Sbjct: 122 QSADVMTIVHPSYPPKELRRYAHDNWQIV----DVQTTNGPFEDINV-----DESKTVWA 172

Query: 184 QADTSTARITSDMKIFKPLDKGRSIRLGCHP-----PEWAKNTNYSIGAYIVADDKVYRS 238
            A T T  +TS   IF     G+   L   P     P W  + + SI     AD   YR+
Sbjct: 173 SAPTGTITLTSSSAIFGAEQVGKLFYLE-QPAVDSVPVWETSKSTSIEDIRRADSNYYRA 231

Query: 239 LTTGRSGD-RFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSK 297
            T G++G  R  +++G  +                     S  G V    V GD    + 
Sbjct: 232 NTAGKTGTLRPSHTEGMAWDGWGGTG--DDDTGVQWEYLHSGFGIVRITAVAGDGLTATA 289

Query: 298 DGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYL 357
           D   +S  P++  +  A  +   W   AW    GYP+ V ++  RL F+ S     +++ 
Sbjct: 290 D--VVSRIPEN--VVGADKASYKWARYAWNSVNGYPATVVYYQQRLYFAASPAYPQTIWA 345

Query: 358 SSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM-HPFGEGVLVGCDT-SLWLLS 415
           S  G + DF      G  +PT+     V  ++   ++ + H    G LV   +   ++++
Sbjct: 346 SRTGDYKDF------GKSNPTQDDDRIVYTYAGRQVNEIRHLIDVGSLVVLTSGGEFVVT 399

Query: 416 ISLSKGLSIDFRRVSGSGVYAC---PPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNE 471
              +K L+     +S  G   C   PP++V +  +F+   G  ++ ++ S +  GF+ N+
Sbjct: 400 GDQNKVLTPSAFSLSSQGSNGCSDVPPIAVSNIALFIQEKGSVVRDLAYSFDVDGFQGND 459

Query: 472 ITQLADHLFNQR 483
           +T LA+HLF +R
Sbjct: 460 LTILANHLFQKR 471


>gi|218886166|ref|YP_002435487.1| hypothetical protein DvMF_1065 [Desulfovibrio vulgaris str.
           'Miyazaki F']
 gi|218757120|gb|ACL08019.1| conserved hypothetical protein [Desulfovibrio vulgaris str.
           'Miyazaki F']
          Length = 692

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 72/250 (28%), Positives = 114/250 (45%), Gaps = 21/250 (8%)

Query: 332 YPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSAS 391
           YPS V F   RL F+GS+    +++ S  G + +  +       D   A+T  +   + S
Sbjct: 274 YPSSVQFWQQRLCFAGSRSHPQTIWASRTGCYENMDVSRPLQTDD---AVTVTIASETVS 330

Query: 392 TIHWMHPFGEGVLVGCDTSLWLLSISLSKGLS-----IDFRRVSGSGVYACPPVSVGDCL 446
            + WM P    +LVG     W LS   S+  S     ++F+   GS     PP++VGD +
Sbjct: 331 AVRWMMP-ARKLLVGTGGGEWTLSGQGSEPFSPLSCLLEFQSARGSA--ELPPLAVGDGV 387

Query: 447 VFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFNQR-ILQLVYQEEPHSIVWVVLEPK 504
           + V   GR ++    S +  G+   + T LA+H+   R I+   YQ+ PHS+VW  ++  
Sbjct: 388 LAVQRGGRAVRDFRYSLDVDGYSGADQTILAEHMLRGRNIVDWAYQQSPHSVVWCAMD-- 445

Query: 505 DNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAA-SFPNDNRGGTSLWMLVALSA-G 562
           D +   + G    AE +    WH H        L      P+D  GG  LW++V     G
Sbjct: 446 DGT---MAGLTLIAEHQ-VAGWHRHDTGGAVEALCVVPGPPSDPAGGDELWLVVRRDVDG 501

Query: 563 EERSFTVRLN 572
            +R +  RL+
Sbjct: 502 VQRRYIERLD 511



 Score = 46.2 bits (108), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 28/91 (30%), Positives = 48/91 (52%), Gaps = 1/91 (1%)

Query: 1  MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
          M  TT  ++SF+AGELSP L+ +R D + +A G    RN++   +GP    P ++    C
Sbjct: 1  MARTTLIQNSFNAGELSP-LMAARGDQARYASGCRVLRNMLLHPHGPAFRRPGLRFMGAC 59

Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQI 91
            +    R+  F   +G   +L F  ++L++
Sbjct: 60 VDETVPPRLVPFVFNEGQAYVLEFAPERLRV 90


>gi|301046400|ref|ZP_07193560.1| conserved domain protein [Escherichia coli MS 185-1]
 gi|300301626|gb|EFJ58011.1| conserved domain protein [Escherichia coli MS 185-1]
          Length = 821

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 121/526 (23%), Positives = 219/526 (41%), Gaps = 72/526 (13%)

Query: 5   TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQ-------EY 57
           +W + SF+ GE+ P L   R D++ +   + K  N I  +YG + + P  +         
Sbjct: 4   SWIQPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYPN 62

Query: 58  RDCRLDP-RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFK 116
           R CRL P + + V ++++  G   + V  D  L  V+  S+  +  A       TPYT  
Sbjct: 63  RKCRLIPFQFSTVQTYALEFGHQYMRVIKDGAL--VLNSSNVIYEIA-------TPYTEA 113

Query: 117 DNKSLEYAVFGSTAVFVHKDHPPHHLL-YIQDGDKISFTFDEIKFLPPPWLGDGMISGVK 175
           D   +++         VH  +PP  L  Y  D  ++     +          +G    + 
Sbjct: 114 DLFRIKFTQSADVLTLVHPAYPPKELRRYAHDNWQLVDVVTK----------NGPFEDIN 163

Query: 176 SNAKLSI-SQADTSTARITSDMKIFKPLDKGRSIRLGCHP-----PEWAKNTNYSIGAYI 229
            +  +++ + A T T  +T+   IF     G+   L   P     P W  + + SIG   
Sbjct: 164 IDESVTVYASASTGTITLTASASIFGAEQVGKLFYLE-QPAVDSVPVWETSKSTSIGDIR 222

Query: 230 VADDKVYRSLTTGRSGD-RFGYSKGATYVKD-------NNITWITVLNLSSKTSRESASG 281
            AD   YR++T G++G  R  +++G ++            I W  + +        +A+G
Sbjct: 223 RADSNYYRAVTAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEWEYLHSGFGIARISAANG 282

Query: 282 AVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNN 341
             A   V             IS  P SQ + +   S   W   AW    GYP  V ++  
Sbjct: 283 TTATAEV-------------ISYIP-SQVVGEDNASY-KWAKYAWNSINGYPGTVVYYQQ 327

Query: 342 RLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM-HPFG 400
           RL F+ S     +++ S  G + DF      G  +PT+     +  ++   ++ + H   
Sbjct: 328 RLYFAASTAFPQTIWASRTGDYKDF------GKSNPTQDDDRIIYTYAGRQVNEIRHLID 381

Query: 401 EGVLVGCDT-SLWLLSISLSKGL---SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRI 456
            G LV   +   ++++   +K L   S  F     +G    PP++V +  +FV   G  +
Sbjct: 382 VGSLVALTSGGEYVITGDQNKALTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVV 441

Query: 457 KYISGSTE-QGFRFNEITQLADHLFNQR-ILQLVYQEEPHSIVWVV 500
           + ++ S +  G++ N++T LA+HLF +  I+   +   P+S  + +
Sbjct: 442 RDLAYSFDVDGYQGNDLTILANHLFQKHSIVDWCFSIVPYSSAFCI 487


>gi|89152436|ref|YP_512269.1| hypothetical protein PhiV10p15 [Escherichia phage phiV10]
 gi|74055459|gb|AAZ95908.1| hypothetical protein PhiV10p15 [Escherichia phage phiV10]
          Length = 823

 Score = 84.0 bits (206), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 121/526 (23%), Positives = 219/526 (41%), Gaps = 72/526 (13%)

Query: 5   TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQ-------EY 57
           +W + SF+ GE+ P L   R D++ +   + K  N I  +YG + + P  +         
Sbjct: 4   SWIQPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYPN 62

Query: 58  RDCRLDP-RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFK 116
           R CRL P + + V ++++  G   + V  D  L  V+  S+  +  A       TPYT  
Sbjct: 63  RKCRLIPFQFSTVQTYALEFGHQYMRVIKDGAL--VLNSSNVIYEIA-------TPYTEA 113

Query: 117 DNKSLEYAVFGSTAVFVHKDHPPHHLL-YIQDGDKISFTFDEIKFLPPPWLGDGMISGVK 175
           D   +++         VH  +PP  L  Y  D  ++     +          +G    + 
Sbjct: 114 DLFRIKFTQSADVLTLVHPAYPPKELRRYAHDNWQLVDVVTK----------NGPFEDIN 163

Query: 176 SNAKLSI-SQADTSTARITSDMKIFKPLDKGRSIRLGCHP-----PEWAKNTNYSIGAYI 229
            +  L++ + A T T  +T+   IF     G+   L   P     P W  + + SIG   
Sbjct: 164 IDESLTVYASASTGTITLTASASIFGAEQVGKLFYLE-QPAVDSVPVWETSKSTSIGDIR 222

Query: 230 VADDKVYRSLTTGRSGD-RFGYSKGATYVKD-------NNITWITVLNLSSKTSRESASG 281
            AD   YR++T G++G  R  +++G ++            I W   L+     +R +A  
Sbjct: 223 RADSNYYRAVTAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEW-EYLHSGFGIARITA-- 279

Query: 282 AVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNN 341
                     +   +     IS  P SQ + +   S   W   AW    GYP  V ++  
Sbjct: 280 ----------VNGTTATAEVISYIP-SQVVGEDNASY-KWAKYAWNSVNGYPGTVVYYQQ 327

Query: 342 RLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM-HPFG 400
           RL F+ S     +++ S  G + DF      G  +PT+     +  ++   ++ + H   
Sbjct: 328 RLYFAASTAFPQTIWASRTGDYKDF------GKSNPTQDDDRIIYTYAGRQVNEIRHLID 381

Query: 401 EGVLVGCDT-SLWLLSISLSKGL---SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRI 456
            G LV   +   ++++   +K L   S  F     +G    PP++V +  +FV   G  +
Sbjct: 382 VGSLVALTSGGEYVITGDQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVV 441

Query: 457 KYISGSTE-QGFRFNEITQLADHLFNQR-ILQLVYQEEPHSIVWVV 500
           + ++ S +  G++ N++T LA+HLF +  I+   +   P+S  + +
Sbjct: 442 RDLAYSFDVDGYQGNDLTILANHLFQKHSIVDWCFSIVPYSSAFCI 487


>gi|187736306|ref|YP_001878418.1| hypothetical protein Amuc_1819 [Akkermansia muciniphila ATCC
           BAA-835]
 gi|187426358|gb|ACD05637.1| hypothetical protein Amuc_1819 [Akkermansia muciniphila ATCC
           BAA-835]
          Length = 822

 Score = 83.6 bits (205), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 75/254 (29%), Positives = 112/254 (44%), Gaps = 18/254 (7%)

Query: 321 WFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKA 380
           W   A+G + GYP  V FH  RL F G+ G   +++ S    F  F+        D    
Sbjct: 413 WSFGAFGVRNGYPCTVEFHQGRLWFGGTPGQPQTLWASRVDDFSAFTPGIP---ADSPMI 469

Query: 381 LTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSID---FRRVSGSGVYAC 437
           LT A +    + I W+     G+++G     W LS + S+GL+     F R SG G  + 
Sbjct: 470 LTMAAS--QQNRISWIASL-RGLMIGTSEGEWRLSATNSEGLNASNAGFERHSGVGSASL 526

Query: 438 PPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFNQRILQLVYQEEPHSI 496
             +SV + L+FV   G +++ +  S E  G++  +++ L+DHL  + I+    Q      
Sbjct: 527 DALSVENSLLFVQQGGMKVRELFYSLEADGYQTRDVSLLSDHLLGEGIVDWTVQRSTAFH 586

Query: 497 VWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPND-NRGGTSLWM 555
           VW VL   D S      C      +   AWH H + +   +LS AS     N     +W 
Sbjct: 587 VWCVL--GDGSAV----CMTLNREQNVVAWHAHRL-EHGRILSVASLRGSRNTPDEEVWF 639

Query: 556 LVALSAGEERSFTV 569
            VA   GEE   TV
Sbjct: 640 AVARGEGEEACITV 653


>gi|327252176|gb|EGE63848.1| phage protein [Escherichia coli STEC_7v]
          Length = 823

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 121/526 (23%), Positives = 219/526 (41%), Gaps = 72/526 (13%)

Query: 5   TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQ-------EY 57
           +W + SF+ GE+ P L   R D++ +   + K  N I  +YG + + P  +         
Sbjct: 4   SWIQPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYPN 62

Query: 58  RDCRLDP-RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFK 116
           R CRL P + + V ++++  G   + V  D  L  V+  S+  +  A       TPYT  
Sbjct: 63  RKCRLIPFQFSTVQTYALEFGHQYMRVIKDGAL--VLNSSNVIYEIA-------TPYTEA 113

Query: 117 DNKSLEYAVFGSTAVFVHKDHPPHHLL-YIQDGDKISFTFDEIKFLPPPWLGDGMISGVK 175
           D   +++         VH  +PP  L  Y  D  ++     +          +G    + 
Sbjct: 114 DLFRIKFTQSADVLTLVHPAYPPKELRRYAHDNWQLVDVVTK----------NGPFEDIN 163

Query: 176 SNAKLSI-SQADTSTARITSDMKIFKPLDKGRSIRLGCHP-----PEWAKNTNYSIGAYI 229
            +  +++ + A T T  +T+   IF     G+   L   P     P W  + + SIG   
Sbjct: 164 IDESVTVYASASTGTITLTASASIFGAEQVGKLFYLE-QPAVDSVPVWETSKSTSIGDIR 222

Query: 230 VADDKVYRSLTTGRSGD-RFGYSKGATYVKD-------NNITWITVLNLSSKTSRESASG 281
            AD   YR++T G++G  R  +++G ++            I W  + +        +A+G
Sbjct: 223 RADSNYYRAVTAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEWEYLHSGFGIARISAANG 282

Query: 282 AVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNN 341
             A   V             IS  P SQ + +   S   W   AW    GYP  V ++  
Sbjct: 283 TTATAEV-------------ISYIP-SQVVGEDNASY-KWAKYAWNSVNGYPGTVVYYQQ 327

Query: 342 RLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM-HPFG 400
           RL F+ S     +++ S  G + DF      G  +PT+     +  ++   ++ + H   
Sbjct: 328 RLYFAASTAFPQTIWASRTGDYKDF------GKSNPTQDDDRIIYTYAGRQVNEIRHLID 381

Query: 401 EGVLVGCDT-SLWLLSISLSKGL---SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRI 456
            G LV   +   ++++   +K L   S  F     +G    PP++V +  +FV   G  +
Sbjct: 382 VGSLVALTSGGEYVITGDQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVV 441

Query: 457 KYISGSTE-QGFRFNEITQLADHLFNQR-ILQLVYQEEPHSIVWVV 500
           + ++ S +  G++ N++T LA+HLF +  I+   +   P+S  + +
Sbjct: 442 RDLAYSFDVDGYQGNDLTILANHLFQKHSIVDWCFSIVPYSSAFCI 487


>gi|323156125|gb|EFZ42284.1| phage protein [Escherichia coli EPECa14]
          Length = 823

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 121/526 (23%), Positives = 219/526 (41%), Gaps = 72/526 (13%)

Query: 5   TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQ-------EY 57
           +W   SF+ GE+ P L   R D++ +   + K  N I  +YG + + P  +         
Sbjct: 4   SWIHPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYPN 62

Query: 58  RDCRLDP-RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFK 116
           R CRL P + + V ++++  G   + V  D  L  V+  S+  +  A       TPYT  
Sbjct: 63  RKCRLIPFQFSTVQTYALEFGHQYMRVIKDGAL--VLNSSNVIYEIA-------TPYTEA 113

Query: 117 DNKSLEYAVFGSTAVFVHKDHPPHHLL-YIQDGDKISFTFDEIKFLPPPWLGDGMISGVK 175
           D   +++         VH  +PP  L  Y  D  ++     +          +G    + 
Sbjct: 114 DLFRIKFTQSADVLTLVHPAYPPKELRRYAHDNWQLVDVVTK----------NGPFEDIN 163

Query: 176 SNAKLSI-SQADTSTARITSDMKIFKPLDKGRSIRLGCHP-----PEWAKNTNYSIGAYI 229
            +  +++ + A T T  +T++  IF     G+   L   P     P W  + + SIG   
Sbjct: 164 IDESVTVYASASTGTITLTANASIFGAEQVGKLFYLE-QPAVDSVPVWETSKSTSIGDIR 222

Query: 230 VADDKVYRSLTTGRSGD-RFGYSKGATYVKD-------NNITWITVLNLSSKTSRESASG 281
            AD   YR++T G++G  R  +++G ++            I W  + +        +A+G
Sbjct: 223 RADSNYYRAVTAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEWEYLHSGFGIARISAANG 282

Query: 282 AVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNN 341
             A   V             IS  P SQ + +   S   W   AW    GYP  V ++  
Sbjct: 283 TTATAEV-------------ISYIP-SQVVGEDNASY-KWAKYAWNSVNGYPGTVVYYQQ 327

Query: 342 RLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM-HPFG 400
           RL F+ S     +++ S  G + DF      G  +PT+     +  ++   ++ + H   
Sbjct: 328 RLYFAASTAFPQTIWASRTGDYKDF------GKSNPTQDDDRIIYTYAGRQVNEIRHLID 381

Query: 401 EGVLVGCDT-SLWLLSISLSKGL---SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRI 456
            G LV   +   ++++   +K L   S  F     +G    PP++V +  +FV   G  +
Sbjct: 382 VGSLVALTSGGEYVITGDQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVV 441

Query: 457 KYISGSTE-QGFRFNEITQLADHLFNQR-ILQLVYQEEPHSIVWVV 500
           + ++ S +  G++ N++T LA+HLF +  I+   +   P+S  + +
Sbjct: 442 RDLAYSFDVDGYQGNDLTILANHLFQKHSIVDWCFSIVPYSSAFCI 487


>gi|332344346|gb|AEE57680.1| conserved hypothetical protein [Escherichia coli UMNK88]
          Length = 823

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 121/526 (23%), Positives = 219/526 (41%), Gaps = 72/526 (13%)

Query: 5   TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQ-------EY 57
           +W + SF+ GE+ P L   R D++ +   + K  N I  +YG + + P  +         
Sbjct: 4   SWIQPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYPN 62

Query: 58  RDCRLDP-RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFK 116
           R CRL P + + V ++++  G   + V  D  L  V+  S+  +  A       TPYT  
Sbjct: 63  RKCRLIPFQFSTVQTYALEFGHQYMRVIKDGAL--VLNSSNVIYEIA-------TPYTEA 113

Query: 117 DNKSLEYAVFGSTAVFVHKDHPPHHLL-YIQDGDKISFTFDEIKFLPPPWLGDGMISGVK 175
           D   +++         VH  +PP  L  Y  D  ++     +          +G    + 
Sbjct: 114 DLFRIKFTQSADVLTLVHPAYPPKELRRYAHDNWQLVDVVTK----------NGPFEDIN 163

Query: 176 SNAKLSI-SQADTSTARITSDMKIFKPLDKGRSIRLGCHP-----PEWAKNTNYSIGAYI 229
            +  +++ + A T T  +T+   IF     G+   L   P     P W  + + SIG   
Sbjct: 164 IDESVTVYASASTGTITLTASASIFGAEQVGKLFYLE-QPAVDSVPVWETSKSTSIGDIR 222

Query: 230 VADDKVYRSLTTGRSGD-RFGYSKGATYVKD-------NNITWITVLNLSSKTSRESASG 281
            AD   YR++T G++G  R  +++G ++            I W  + +        +A+G
Sbjct: 223 RADSNYYRAVTAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEWEYLHSGFGIARISAANG 282

Query: 282 AVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNN 341
             A   V             IS  P SQ + +   S   W   AW    GYP  V ++  
Sbjct: 283 TTATAEV-------------ISYIP-SQVVGEDNASY-KWAKYAWDSINGYPGTVVYYQQ 327

Query: 342 RLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM-HPFG 400
           RL F+ S     +++ S  G + DF      G  +PT+     +  ++   ++ + H   
Sbjct: 328 RLYFAASTAFPQTIWASRTGDYKDF------GKSNPTQDDDRIIYTYAGRQVNEIRHLID 381

Query: 401 EGVLVGCDT-SLWLLSISLSKGL---SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRI 456
            G LV   +   ++++   +K L   S  F     +G    PP++V +  +FV   G  +
Sbjct: 382 VGSLVALTSGGEYVITGDQNKVLAPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVV 441

Query: 457 KYISGSTE-QGFRFNEITQLADHLFNQR-ILQLVYQEEPHSIVWVV 500
           + ++ S +  G++ N++T LA+HLF +  I+   +   P+S  + +
Sbjct: 442 RDLAYSFDVDGYQGNDLTILANHLFQKHSIVDWCFSIVPYSSAFCI 487


>gi|294493191|gb|ADE91947.1| conserved hypothetical protein [Escherichia coli IHE3034]
          Length = 823

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 120/526 (22%), Positives = 219/526 (41%), Gaps = 72/526 (13%)

Query: 5   TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQ-------EY 57
           +W + SF+ GE+ P L   R D++ +   + K  N I  +YG + + P  +         
Sbjct: 4   SWIQPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYPN 62

Query: 58  RDCRLDP-RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFK 116
           R CRL P + + V ++++  G   + V  D  L  V+  S+  +  A       TPYT  
Sbjct: 63  RKCRLIPFQFSTVQTYALEFGHQYMRVIKDGAL--VLNSSNVIYEIA-------TPYTEA 113

Query: 117 DNKSLEYAVFGSTAVFVHKDHPPHHLL-YIQDGDKISFTFDEIKFLPPPWLGDGMISGVK 175
           D   +++         VH  +PP  L  Y  D  ++     +          +G    + 
Sbjct: 114 DLFRIKFTQSADVLTLVHPAYPPKELRRYAHDNWQLVDVVTK----------NGPFEDIN 163

Query: 176 SNAKLSI-SQADTSTARITSDMKIFKPLDKGRSIRLGCHP-----PEWAKNTNYSIGAYI 229
            +  +++ + A T T  +T+   IF     G+   L   P     P W  + + SIG   
Sbjct: 164 IDESVTVYASASTGTITLTASASIFGAEQVGKLFYLE-QPAVDSVPVWETSKSTSIGDIR 222

Query: 230 VADDKVYRSLTTGRSGD-RFGYSKGATYVKD-------NNITWITVLNLSSKTSRESASG 281
            AD   YR++T G++G  R  +++G ++            I W   L+     +R +A  
Sbjct: 223 RADSNYYRAVTAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEW-EYLHSGFGIARITA-- 279

Query: 282 AVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNN 341
                     +   +     IS  P SQ + +   S   W   AW    GYP  V ++  
Sbjct: 280 ----------VNGTTATAEVISYIP-SQVVGEDNASY-KWAKYAWNSVNGYPGTVVYYQQ 327

Query: 342 RLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM-HPFG 400
           RL F+ S     +++ S  G + DF      G  +PT+     +  ++   ++ + H   
Sbjct: 328 RLYFAASTAFPQTIWASRTGDYKDF------GKSNPTQDDDRIIYTYAGRQVNEIRHLID 381

Query: 401 EGVLVGCDT-SLWLLSISLSKGL---SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRI 456
            G LV   +   ++++   +K L   S  F     +G    PP++V +  +FV   G  +
Sbjct: 382 VGSLVALTSGGEYVITGDQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVV 441

Query: 457 KYISGSTE-QGFRFNEITQLADHLFNQR-ILQLVYQEEPHSIVWVV 500
           + ++ S +  G++ N++T LA+HLF +  I+   +   P+S  + +
Sbjct: 442 RDLAYSFDVDGYQGNDLTILANHLFQKHSIVDWCFSIVPYSSAFCI 487


>gi|300898435|ref|ZP_07116776.1| conserved domain protein [Escherichia coli MS 198-1]
 gi|300357902|gb|EFJ73772.1| conserved domain protein [Escherichia coli MS 198-1]
          Length = 823

 Score = 80.5 bits (197), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 119/526 (22%), Positives = 219/526 (41%), Gaps = 72/526 (13%)

Query: 5   TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQ-------EY 57
           +W + SF+ GE+ P L   R D++ +   + K  N I  +YG + + P  +         
Sbjct: 4   SWIQPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYPN 62

Query: 58  RDCRLDP-RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFK 116
           R CRL P + + V ++++  G   + V  D  L  V+  S+  +  A       TPYT  
Sbjct: 63  RKCRLIPFQFSTVQTYALEFGHQYMRVIKDGAL--VLNSSNVIYEIA-------TPYTEA 113

Query: 117 DNKSLEYAVFGSTAVFVHKDHPPHHLL-YIQDGDKISFTFDEIKFLPPPWLGDGMISGVK 175
           D   +++         VH  +PP  L  Y  D  ++     +          +G    + 
Sbjct: 114 DLFRIKFTQSADVLTLVHPAYPPKELRRYAHDNWQLVDVVTK----------NGPFEDIN 163

Query: 176 SNAKLSI-SQADTSTARITSDMKIFKPLDKGRSIRLGCHP-----PEWAKNTNYSIGAYI 229
            +  +++ + A T T  +T+   IF     G+   L   P     P W  + + SIG   
Sbjct: 164 IDESVTVYASASTGTITLTASASIFGAEQVGKLFYLE-QPAVDSVPVWETSKSTSIGDIR 222

Query: 230 VADDKVYRSLTTGRSGD-RFGYSKGATYVKD-------NNITWITVLNLSSKTSRESASG 281
            AD   YR++T G++G  R  +++G ++            I W   L+     +R +A  
Sbjct: 223 RADSNYYRAVTAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEW-EYLHSGFGIARITA-- 279

Query: 282 AVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNN 341
                     +   +     IS  P SQ + +   S   W   AW    GYP  V ++  
Sbjct: 280 ----------VNGTTATAEVISYIP-SQVVGEDNASY-KWAKYAWNSVNGYPGTVVYYQQ 327

Query: 342 RLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM-HPFG 400
           RL F+ S     +++ S  G + DF      G  +PT+     +  ++   ++ + H   
Sbjct: 328 RLYFAASTAFPQTIWASRTGDYKDF------GKSNPTQDDDRIIYTYAGRQVNEIRHLID 381

Query: 401 EGVLVGCDT-SLWLLSISLSKGL---SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRI 456
            G LV   +   ++++   +K L   S  F     +G    PP++V +  +FV   G  +
Sbjct: 382 VGSLVALTSGGEYVITGDQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVV 441

Query: 457 KYISGSTE-QGFRFNEITQLADHLFNQR-ILQLVYQEEPHSIVWVV 500
           + ++ S +  G++ +++T LA+HLF +  I+   +   P+S  + +
Sbjct: 442 RDLAYSFDVDGYQGSDLTILANHLFQKHSIVDWCFSIVPYSSAFCI 487


>gi|46580124|ref|YP_010932.1| hypothetical protein DVU1714 [Desulfovibrio vulgaris str.
           Hildenborough]
 gi|46449540|gb|AAS96191.1| conserved hypothetical protein [Desulfovibrio vulgaris str.
           Hildenborough]
 gi|311233883|gb|ADP86737.1| hypothetical protein Deval_1582 [Desulfovibrio vulgaris RCH1]
          Length = 697

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 140/589 (23%), Positives = 223/589 (37%), Gaps = 109/589 (18%)

Query: 8   KHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMP---LMQEYRDCRLDP 64
           + +F+ GE+SP LL +R D   +  G    RN +PL  GP+   P    M   ++    P
Sbjct: 8   QQAFNGGEISP-LLTARADQIRYQTGALTMRNAVPLAQGPVTRRPGLRFMGAAKEQGAGP 66

Query: 65  RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALF----GKTYK--TPYTFKDN 118
              R+ SF         L FG   +++        W  A      G+ Y+  +PY   D 
Sbjct: 67  --VRLVSFVFSAAQSRALEFGPGYVRV--------WMDAGLVSKNGQPYEVASPYGAADI 116

Query: 119 KSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLP----PPWLGDGMISGV 174
             L +A          ++HPP  L    D D   + F    F+P    P  L  G +   
Sbjct: 117 AGLRFAQSADVIYIASRNHPPRKLSRHADDD---WRFITPTFMPTQAAPGALTLGTLGTT 173

Query: 175 KSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDK 234
                 + S   T+ +  T +  +  P  +G           W + +  ++   +  + +
Sbjct: 174 PGPGNETYSYKVTAVSATTGEESLASP--EGTITTTAMSSTYWVRVSWAAVPGAV--EYR 229

Query: 235 VYRSLTTGRSGDRFGY----SKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWG 290
           VY+     R    FG+      G T+  D NI                  GA        
Sbjct: 230 VYK-----RRYGVFGFIGRAVGGDTFFDDRNI------------------GA-------- 258

Query: 291 DIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKG 350
           D +D           P+++  F           +A GE   YP  V F   RL F+GS  
Sbjct: 259 DTEDT---------VPEAKNPF-----------TAAGE---YPGLVFFWQQRLGFAGSDK 295

Query: 351 DELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVL-VGCDT 409
             L+V+LS   AF + +        D  +A    +     +   W+   G+  L +G + 
Sbjct: 296 RPLTVWLSQSAAFENLAASRPPQDDDGIEA---TLAGQRQNRFVWIE--GDRTLCLGTEG 350

Query: 410 SLWLLSISLSKGL---SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQ- 465
             W LS      +   S+ F+     G    P V  GD L++V   G  ++  + S E+ 
Sbjct: 351 GEWTLSGQEGGPVTPTSLQFQSHGVRGSEGVPAVRAGDSLLYVQRGGGVVREFTYSFERD 410

Query: 466 GFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGD-F 524
           G+   ++T L   L  +++    YQ+ PHSIVW VL+  D +   L   R     E D  
Sbjct: 411 GYVAPDLTLLTGVLRGRKVRAWAYQQSPHSIVWCVLD--DGTLAALTFLR-----EHDVV 463

Query: 525 AWHTHMISDKHYVLSAASFPNDNRGGT-SLWMLVALS-AGEERSFTVRL 571
            WH H        ++     +   GGT ++WMLV  +  G+ER +  R+
Sbjct: 464 GWHRHDTDGVVEDVTVIPGGDATAGGTDTVWMLVRRTVGGQERRYVERM 512


>gi|242278913|ref|YP_002991042.1| hypothetical protein Desal_1441 [Desulfovibrio salexigens DSM 2638]
 gi|242121807|gb|ACS79503.1| hypothetical protein Desal_1441 [Desulfovibrio salexigens DSM 2638]
          Length = 698

 Score = 79.7 bits (195), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 70/250 (28%), Positives = 115/250 (46%), Gaps = 33/250 (13%)

Query: 304 VAPQSQTLFQAGVSVVSWFMS---------AWGEQEGYPSHVTFHNNRLLFSGSKGDELS 354
           V P+ Q    +  S V W M           W  ++G+PS VTF   RL F+ S  +  +
Sbjct: 246 VHPEVQPYKLSRTSHVDWKMELVAFSSPPQEWNSEKGFPSCVTFFEERLCFAASPSNPQT 305

Query: 355 VYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLL 414
           +++S  G++ DF++       D   A T  ++    + I WM    + +++G     W  
Sbjct: 306 IWMSKAGSYEDFAVSSPVVDDD---ACTYTLSADQVNAIRWMVS-AKKLIMGTSGGEWW- 360

Query: 415 SISLSKGLSID--------FRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-Q 465
              LS G S+D         RR +  G  A PPV VG  ++F+   GR I+ +S S E  
Sbjct: 361 ---LSGGSSLDSVTPNSVMVRRETTHGSAAIPPVVVGGVMLFLQREGRTIRELSYSFEAD 417

Query: 466 GFRFNEITQLADHLF-NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDF 524
           G+   ++T LA+HL  +  I +  YQ+ P S++W+    +D+    ++G  +  E E   
Sbjct: 418 GYTAPDLTILAEHLTRSNSITEWAYQQSPDSVIWMT---RDDGV--MVGLTYQREHE-VV 471

Query: 525 AWHTHMISDK 534
            +H H    K
Sbjct: 472 GFHRHTTDGK 481


>gi|30387391|ref|NP_848220.1| hypothetical protein epsilon15p12 [Enterobacteria phage epsilon15]
 gi|30266046|gb|AAO06075.1| 12 [Salmonella phage epsilon15]
          Length = 825

 Score = 79.0 bits (193), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 116/504 (23%), Positives = 200/504 (39%), Gaps = 63/504 (12%)

Query: 5   TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDP 64
           +W + SF+ GE+ P L   R D+S +   + K  N I  +YG + + P  +     +   
Sbjct: 4   SWIQPSFAGGEIGPSLY-GRIDMSKYQVALRKCDNFIVRQYGGVENRPGTRFVGPAKYPD 62

Query: 65  RSNRVFSFSIPDGGYALLVFGDKKLQI------VVVRSSTKWSPALFGKTYKTPYTFKDN 118
           R  R+  F         L FG   +++      V+  S+  +  A+       PY   D 
Sbjct: 63  RKCRLIPFQFSTVQTYALEFGHNYMRVIKDGAYVLTTSNVIYELAM-------PYADTDL 115

Query: 119 KSLEYAVFGSTAVFVHKDHPPHHLL-YIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSN 177
             +++         VH  +PP  L  Y  D  +I     ++     P+    +   VK  
Sbjct: 116 FRIKFTQSADVLTLVHPAYPPKELRRYAHDNWQIV----DVTTKNGPFEDINVDETVKVY 171

Query: 178 AKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHP-----PEWAKNTNYSIGAYIVAD 232
           A      A T T  +T+   IF     G+   L   P     P W  +   +I     AD
Sbjct: 172 AS-----ASTGTITLTASSAIFGAEQVGKLFYLE-QPAVDSVPVWETSKTTAINDVRRAD 225

Query: 233 DKVYRSLTTGRSGD-RFGYSKGATY-------VKDNNITWITVLNLSSKTSRESASGAVA 284
              YR+ T+G++G  R  +++G ++         D  I W  +          S  G   
Sbjct: 226 SNYYRANTSGKTGTLRPSHTEGMSWDGWGGTGSDDTGIQWEYL---------HSGFGIAK 276

Query: 285 PYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLL 344
              V GD    + D   +S  P SQ +  A  S   W   AW    GYPS V ++  RL 
Sbjct: 277 ITAVAGDGLTATAD--VVSFIP-SQVVGSANASY-KWAKYAWNSVNGYPSTVVYYQQRLY 332

Query: 345 FSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM-HPFGEGV 403
           F+ S     +++ S  G + DF      G  +P +     +  ++   ++ + H    G 
Sbjct: 333 FAASTAYPQTIWASRTGDYKDF------GKNNPIQDDDRIIYTYAGRQVNEIRHLIDVGN 386

Query: 404 LVGCDT-SLWLLSISLSKGLS---IDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYI 459
           LV   +   + +S   +K L+     F     +G    PP++V +  +F+   G  ++ +
Sbjct: 387 LVALTSGGEYTISGDQNKVLTPSAFSFSSQGNNGSSNVPPIAVANIALFIQEKGSVVRDL 446

Query: 460 SGSTE-QGFRFNEITQLADHLFNQ 482
           + S +  G++  ++T LA+HLF +
Sbjct: 447 AYSFDVDGYQGTDLTILANHLFQK 470


>gi|120601703|ref|YP_966103.1| hypothetical protein Dvul_0653 [Desulfovibrio vulgaris DP4]
 gi|120561932|gb|ABM27676.1| conserved hypothetical protein [Desulfovibrio vulgaris DP4]
          Length = 699

 Score = 79.0 bits (193), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 130/574 (22%), Positives = 213/574 (37%), Gaps = 90/574 (15%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           M   T  ++SF+AGELSP L+ +R D + +  G A   N++   +G     P ++ +   
Sbjct: 1   MARATIVRNSFNAGELSP-LMAARVDQARYPNGCASLCNMLLHPHGGAWRRPGLR-FMGL 58

Query: 61  RLDPRSN-RVFSFSIPDGGYALLVFGDKKLQI---VVVRSSTKWSPALFGKTYKTPYTFK 116
             DP    R+  F   +    +L FG + L+I     +       P       +TP+  +
Sbjct: 59  AADPAGPVRLIPFVFSEAQAYVLEFGPRSLRIWHGGGLVLGGDGEPFRL----ETPWAGE 114

Query: 117 DNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLP---PPWLGDGMISG 173
              +L +         V    PP  L      D   +   ++ FLP   PP   +G+   
Sbjct: 115 QLTALRWCQSADMLYLVSHAGPPRRLERHGHAD---WRLVDVSFLPGVSPP---EGLHCT 168

Query: 174 VKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADD 233
           VK     + +   T+  R + +  +  P  +         P   ++  + ++    V D 
Sbjct: 169 VKPAGSRTWTYVVTAVHRESGEESLPTPPLQVTG------PDALSQTASVTLAWTPVQDA 222

Query: 234 KVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIK 293
             YR    G     +G+   A                          GA   Y   G   
Sbjct: 223 GEYRVYRAGGGASVYGFLGSA--------------------------GAGETYTDTGRTP 256

Query: 294 DVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDEL 353
           D           P+++  F              GE + +PS   F   RL F+G++    
Sbjct: 257 DFDAG------PPEARNPFS-------------GEGD-WPSCAVFWQQRLCFAGTRNGPQ 296

Query: 354 SVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWL 413
           +++ S  GA+ +FS+       D   A+T  +   + S + W+ P    +LVG     W 
Sbjct: 297 TIWASRSGAYGNFSVSRPLRDDD---AVTVTIAADTVSAVRWLMP-ARRLLVGTGGGEWT 352

Query: 414 LSISLSK---GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRF 469
           LS    +    LS    R S  G     P+SVGD ++ +   GR ++    S +  G+  
Sbjct: 353 LSGQGEQPFSPLSCSLERQSSRGSGDVQPLSVGDAVLALQRGGRVVREFRYSLDVDGYAG 412

Query: 470 NEITQLADHLF-NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHT 528
            ++T LA+HL   +RI+   +Q+ P   VW V E        L+      E E    WH 
Sbjct: 413 TDLTILAEHLTRGRRIIDWAWQQSPSGTVWCVTEDGG-----LIAMTRIPEHE-VAGWHR 466

Query: 529 HMISDKHYVLSAASFPNDNRGGTSLWMLVALSAG 562
           H+      VLS  + P     G  LW+ V    G
Sbjct: 467 HVTDGA--VLSVCTIPGT--AGDELWVAVRREGG 496


>gi|215487813|ref|YP_002330244.1| hypothetical protein E2348C_2746 [Escherichia coli O127:H6 str.
           E2348/69]
 gi|215265885|emb|CAS10294.1| predicted protein [Escherichia coli O127:H6 str. E2348/69]
          Length = 825

 Score = 78.2 bits (191), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 116/504 (23%), Positives = 199/504 (39%), Gaps = 63/504 (12%)

Query: 5   TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDP 64
           +W + SF+ GE+ P L   R D+S +   + K  N I  +YG + + P  +     +   
Sbjct: 4   SWIQPSFAGGEIGPSLY-GRIDMSKYQVALRKCDNFIVRQYGGVENRPGTRFVGPAKYPD 62

Query: 65  RSNRVFSFSIPDGGYALLVFGDKKLQI------VVVRSSTKWSPALFGKTYKTPYTFKDN 118
           R  R+  F         L FG   +++      V+  S+  +  A+       PY   D 
Sbjct: 63  RKCRLIPFQFSTVQTYALEFGHNYMRVIKDGEYVLTTSNVIYELAM-------PYADTDL 115

Query: 119 KSLEYAVFGSTAVFVHKDHPPHHLL-YIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSN 177
             +++         VH  +PP  L  Y  D  +I     ++     P+    +   VK  
Sbjct: 116 FRIKFTQSADVLTLVHPAYPPKELRRYAHDNWQIV----DVTTKNGPFEDINVDDTVKVY 171

Query: 178 AKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHP-----PEWAKNTNYSIGAYIVAD 232
           A      A T T  +T+   IF     G+   L   P     P W  +   +I     AD
Sbjct: 172 AS-----ASTGTITLTASSAIFGAEQVGKLFYLE-QPAVDSVPVWETSKTTAINDVRRAD 225

Query: 233 DKVYRSLTTGRSGD-RFGYSKGATY-------VKDNNITWITVLNLSSKTSRESASGAVA 284
              YR+ T G++G  R  +++G ++         D  I W  +          S  G   
Sbjct: 226 SNYYRANTAGKTGTLRPSHTEGMSWDGWGGTGSDDTGIQWEYL---------HSGFGIAK 276

Query: 285 PYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLL 344
              V GD    + D   +S  P SQ +  A  S   W   AW    GYPS V ++  RL 
Sbjct: 277 ITAVSGDGLTATAD--VVSFIP-SQVVGSANASY-KWAKYAWNSVNGYPSTVVYYQQRLY 332

Query: 345 FSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM-HPFGEGV 403
           F+ S     +++ S  G + DF      G  +P +     +  ++   ++ + H    G 
Sbjct: 333 FAASTAYPQTIWASRTGDYKDF------GKNNPIQDDDRIIYTYAGRQVNEIRHLIDVGN 386

Query: 404 LVGCDT-SLWLLSISLSKGLS---IDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYI 459
           LV   +   + +S   +K L+     F     +G    PP++V +  +F+   G  ++ +
Sbjct: 387 LVALTSGGEYTISGDQNKVLTPSAFSFSSQGNNGSSNVPPIAVANIALFIQEKGSVVRDL 446

Query: 460 SGSTE-QGFRFNEITQLADHLFNQ 482
           + S +  G++  ++T LA+HLF +
Sbjct: 447 AYSFDVDGYQGTDLTILANHLFQK 470


>gi|292670776|ref|ZP_06604202.1| hypothetical protein HMPREF7545_1740 [Selenomonas noxia ATCC 43541]
 gi|292647397|gb|EFF65369.1| hypothetical protein HMPREF7545_1740 [Selenomonas noxia ATCC 43541]
          Length = 762

 Score = 77.8 bits (190), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 60/220 (27%), Positives = 110/220 (50%), Gaps = 22/220 (10%)

Query: 323 MSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALT 382
           +SAW  ++GYP  V+F  +RL+F+GS+    + + S  G +Y+F ++      D   A+T
Sbjct: 345 LSAWSAKKGYPQAVSFFEDRLVFAGSRAKPQTYWASQSGDYYNFWVNTPQQDSD---AIT 401

Query: 383 TAVTDFSASTIHWMHPFGEGVLVGCDTSLWL------LSISLSKGLSIDFRRVSGSGVYA 436
             ++    + I  + PFGE +++       +       + +  K    ++R     G+  
Sbjct: 402 GTLSGGQMNGIRAIIPFGEMLMLTSGGEYKVGGGNETFTPTNQKAEPQEYR-----GINN 456

Query: 437 CPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFN-QRILQLVYQEEPH 494
             PV +G  +V+V   G  I+ ++ S +   +  ++++ LA HLF    I+ L YQ+ P+
Sbjct: 457 LTPVVIGGRIVYVQHQGSVIRDLTYSYDVDKYTGDDVSLLAAHLFEGHTIVALAYQQTPN 516

Query: 495 SIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDK 534
           ++VW V E  D +   LLG  +  E +  +AWH H  + K
Sbjct: 517 TVVWCVRE--DGA---LLGMTYIKE-QDVYAWHKHTTAGK 550



 Score = 42.7 bits (99), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 38/140 (27%), Positives = 59/140 (42%), Gaps = 14/140 (10%)

Query: 8   KHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSN 67
           K SF+ GEL+P L   R DL  +  G +  +N+I LRYG     P  +     +   R+ 
Sbjct: 10  KPSFAGGELTPALY-GRTDLQKYDVGASTLKNMIVLRYGGATRRPGFRHVAKTQGGKRA- 67

Query: 68  RVFSFSIPDGGYALLVFGDKKLQI-----VVVRSSTKWSPALFGKTYKTPYTFKDNKSLE 122
           R+  F        +L F    +++     +VV+     +P +      T YT  D   ++
Sbjct: 68  RLIPFQYSTEQSYVLEFTAGCIRVFTKGGIVVKDD---APLVI----PTSYTEADLSDIK 120

Query: 123 YAVFGSTAVFVHKDHPPHHL 142
           Y         VH +HPP  L
Sbjct: 121 YTQSADVLFLVHVNHPPMTL 140


>gi|218700982|ref|YP_002408611.1| hypothetical protein ECIAI39_2672 [Escherichia coli IAI39]
 gi|218370968|emb|CAR18795.1| conserved hypothetical protein from phage origin [Escherichia coli
           IAI39]
 gi|323948677|gb|EGB44582.1| hypothetical protein ERKG_04900 [Escherichia coli H252]
          Length = 823

 Score = 77.4 bits (189), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 120/521 (23%), Positives = 217/521 (41%), Gaps = 72/521 (13%)

Query: 5   TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQ-------EY 57
           +W + SF+ GE+ P L   R D++ +   + K  N I  +YG + + P  +         
Sbjct: 4   SWIQPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYPN 62

Query: 58  RDCRLDP-RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFK 116
           R CRL P + + V ++++  G   + V  D  L  V+  S+  +  A       TPYT  
Sbjct: 63  RKCRLIPFQFSTVQTYALEFGHQYMRVIKDGAL--VLNSSNVIYEIA-------TPYTEA 113

Query: 117 DNKSLEYAVFGSTAVFVHKDHPPHHLL-YIQDGDKISFTFDEIKFLPPPWLGDGMISGVK 175
           D   +++         VH  +PP  L  Y  D  ++     +          +G    + 
Sbjct: 114 DLFRIKFTQSADVLTLVHPAYPPKELRRYAHDNWQLVDVVTK----------NGPFEDIN 163

Query: 176 SNAKLSI-SQADTSTARITSDMKIFKPLDKGRSIRLGCHP-----PEWAKNTNYSIGAYI 229
            +  +++ + A T T  +T+ + IF     G+   L   P     P W  + + SIG   
Sbjct: 164 IDESVTVYASASTGTITLTASVSIFGAEQVGKLFYLE-QPAVDSVPVWETSKSTSIGDIR 222

Query: 230 VADDKVYRSLTTGRSGD-RFGYSKGATYVKD-------NNITWITVLNLSSKTSRESASG 281
            AD   YR++T G++G  R  +++G ++            I W   L+     +R +A+ 
Sbjct: 223 RADSNYYRAVTAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEW-EYLHSGFGIARITAAN 281

Query: 282 AVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNN 341
                               IS  P SQ + +   S   W   AW    GYP  V ++  
Sbjct: 282 GTT------------ATAEVISYIP-SQVVGEDNASY-KWAKYAWNSVNGYPGTVVYYQQ 327

Query: 342 RLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM-HPFG 400
           RL F+ S     +++ S  G + DF      G  +PT+     +  ++   ++ + H   
Sbjct: 328 RLYFAASTAFPQTIWASRTGDYKDF------GKSNPTQDDDRIIYTYAGRQVNEIRHLID 381

Query: 401 EGVLVGCDT-SLWLLSISLSKGL---SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRI 456
            G LV   +   ++++   +K L   S  F     +G    PP++V +  +FV   G  +
Sbjct: 382 VGSLVALTSGGEYVITGDQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVV 441

Query: 457 KYISGSTE-QGFRFNEITQLADHLFNQR-ILQLVYQEEPHS 495
           + ++ S +  G++ N++T LA+HLF +  I+   +   P+S
Sbjct: 442 RDLAYSFDVDGYQGNDLTILANHLFQKHSIVDWCFSIVPYS 482


>gi|169795391|ref|YP_001713184.1| phage-like protein [Acinetobacter baumannii AYE]
 gi|169148318|emb|CAM86183.1| hypothetical protein; putative phage related protein [Acinetobacter
           baumannii AYE]
          Length = 697

 Score = 77.0 bits (188), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 120/511 (23%), Positives = 199/511 (38%), Gaps = 74/511 (14%)

Query: 8   KHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSN 67
           K++ S+GELSP LL +R D+  +A G  K  N +PL  G     P   ++R   +   + 
Sbjct: 12  KNNLSSGELSP-LLWTRTDIQQYANGAKKLLNALPLVEGGAKKRP-GTKFRS--IFAGAL 67

Query: 68  RVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKT--PY-TFKDNKSLEYA 124
           R+  F        LL+ G   L++        ++P  +   Y+T  PY T +  + ++YA
Sbjct: 68  RLIPFIANSENTYLLILGVSFLKV--------YNPRTYAVVYETVTPYNTAQKVREVQYA 119

Query: 125 VFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQ 184
                  FV  D P   LL   D     F        P   LG        S   +++S 
Sbjct: 120 HTKYRMYFVQGDTPVQRLLCSADFTNWQFAAFTFGVNPNDELG--------STPNVALSP 171

Query: 185 ADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRS 244
           + T   ++ S                    P W+    Y  G  ++ + K +R+     +
Sbjct: 172 SGTEVGKVIS--------------LTASSFPNWSNTETYLTGDRVIHNSKTWRA-----T 212

Query: 245 GDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIK-----DVSKDG 299
            D  G    AT  +     W  V N ++     ++ G++      G +K     D S+  
Sbjct: 213 ADNKGVEPSATTPE-----WEEVTNEAANVFTPASVGSIVEIN-GGQVKITEYVDPSRVN 266

Query: 300 RSISVAPQSQTLFQAGVSVVSWFMS--AWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYL 357
             + V   S     A     SW +   A+  + GYP  V F   RL+F+ +K     ++ 
Sbjct: 267 GEVLVKLTSDVQAIAK----SWVLKSIAFSAEAGYPKAVCFFKQRLVFANTKTSPNQMWF 322

Query: 358 SSF---GAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLL 414
           S     G F + + D +        A + A +   +  I  +   G  V +       + 
Sbjct: 323 SRIGDDGNFLETTQDAD--------AFSIASSSAQSDNILHLSQRGGVVALTGGAEFLIN 374

Query: 415 SISLSKGLSIDFRRVSGSGVYA-CPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEI 472
           S       S      +  GV A   P  VG+ L+FV   G R++ +S   E  G    E+
Sbjct: 375 SQGPLTPASAQIDEHTSYGVQANVKPCRVGNELLFVQRGGERLRAMSYRYEVDGLVSPEL 434

Query: 473 TQLADHLFNQR--ILQLVYQEEPHSIVWVVL 501
           +Q+A H+      I +L +Q+ P+SIVW+V+
Sbjct: 435 SQIAPHIPENHAGIKELTFQQTPNSIVWIVM 465


>gi|324008552|gb|EGB77771.1| conserved domain protein [Escherichia coli MS 57-2]
          Length = 823

 Score = 77.0 bits (188), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 120/526 (22%), Positives = 219/526 (41%), Gaps = 72/526 (13%)

Query: 5   TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQ-------EY 57
           +W + SF+ GE+ P L   R D++ +   + K  N I  +YG + + P  +         
Sbjct: 4   SWIQPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYPN 62

Query: 58  RDCRLDP-RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFK 116
           R CRL P + + V ++++  G   + V  D  L  V+  S+  +  A       TPYT  
Sbjct: 63  RKCRLIPFQFSTVQTYALEFGHQYMRVIKDGAL--VLNSSNVIYEIA-------TPYTEA 113

Query: 117 DNKSLEYAVFGSTAVFVHKDHPPHHLL-YIQDGDKISFTFDEIKFLPPPWLGDGMISGVK 175
           D   +++         VH  +PP  L  Y  D  ++     +          +G    + 
Sbjct: 114 DLFRIKFTQSADVLTLVHPAYPPKELRRYAHDNWQLVDVVTK----------NGPFEDIN 163

Query: 176 SNAKLSI-SQADTSTARITSDMKIFKPLDKGRSIRLGCHP-----PEWAKNTNYSIGAYI 229
            +  +++ + A T T  +T+   IF     G+   L   P     P W  + + SIG   
Sbjct: 164 IDESVTVYASASTGTITLTASASIFGAEQVGKLFYLE-QPAVDSVPVWETSKSTSIGDIR 222

Query: 230 VADDKVYRSLTTGRSGD-RFGYSKGATYVKD-------NNITWITVLNLSSKTSRESASG 281
            AD   YR++T G++G  R  +++G ++            I W   L+     +R +A+ 
Sbjct: 223 RADSNYYRAVTAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEW-EYLHSGFGIARITAA- 280

Query: 282 AVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNN 341
                         +     IS  P SQ + +   S   W   AW    GYP  V ++  
Sbjct: 281 -----------NGTTATAEVISYIP-SQVVGEDNASY-KWAKYAWNSVNGYPGTVVYYQQ 327

Query: 342 RLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM-HPFG 400
           RL F+ S     +++ S  G + DF      G  +PT+     +  ++   ++ + H   
Sbjct: 328 RLYFAASTAFPQTIWASRTGDYKDF------GKSNPTQDDDRIIYTYAGRQVNEIRHLID 381

Query: 401 EGVLVGCDT-SLWLLSISLSKGL---SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRI 456
            G LV   +   ++++   +K L   S  F     +G    PP++V +  +FV   G  +
Sbjct: 382 VGSLVALTSGGEYVITGDQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVV 441

Query: 457 KYISGSTE-QGFRFNEITQLADHLFNQR-ILQLVYQEEPHSIVWVV 500
           + ++ S +  G++ N++T LA+HLF +  I+   +   P+S  + +
Sbjct: 442 RDLAYSFDVDGYQGNDLTILANHLFQKHSIVDWCFSIVPYSSAFCI 487


>gi|331648168|ref|ZP_08349258.1| conserved hypothetical protein [Escherichia coli M605]
 gi|331043028|gb|EGI15168.1| conserved hypothetical protein [Escherichia coli M605]
          Length = 823

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 120/521 (23%), Positives = 217/521 (41%), Gaps = 72/521 (13%)

Query: 5   TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQ-------EY 57
           +W + SF+ GE+ P L   R D++ +   + K  N I  +YG + + P  +         
Sbjct: 4   SWIQPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYPN 62

Query: 58  RDCRLDP-RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFK 116
           R CRL P + + V ++++  G   + V  D  L  V+  S+  +  A       TPYT  
Sbjct: 63  RKCRLIPFQFSTVQTYALEFGHQYMRVIKDGAL--VLNSSNVIYEIA-------TPYTEA 113

Query: 117 DNKSLEYAVFGSTAVFVHKDHPPHHLL-YIQDGDKISFTFDEIKFLPPPWLGDGMISGVK 175
           D   +++         VH  +PP  L  Y  D  ++     +          +G    + 
Sbjct: 114 DLFRIKFTQSADVLTLVHPAYPPKELRRYAHDNWQLVDVVTK----------NGPFEDIN 163

Query: 176 SNAKLSI-SQADTSTARITSDMKIFKPLDKGRSIRLGCHP-----PEWAKNTNYSIGAYI 229
            +  +++ + A T T  +T+   IF     G+   L   P     P W  + + SIG   
Sbjct: 164 IDESVTVYASASTGTITLTASASIFGAEQVGKLFYLE-QPAVDSVPVWETSKSTSIGDIR 222

Query: 230 VADDKVYRSLTTGRSGD-RFGYSKGATYVKD-------NNITWITVLNLSSKTSRESASG 281
            AD   YR++T G++G  R  +++G ++            I W   L+     +R +A+ 
Sbjct: 223 RADSNYYRAVTAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEW-EYLHSGFGIARITAA- 280

Query: 282 AVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNN 341
                         +     IS  P SQ + +   S   W   AW    GYP  V ++  
Sbjct: 281 -----------NGTTATAEVISYIP-SQVVGEDNASY-KWAKYAWNSVNGYPGTVVYYQQ 327

Query: 342 RLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM-HPFG 400
           RL F+ S     +++ S  G + DF      G  +PT+     +  ++   ++ + H   
Sbjct: 328 RLYFAASTAFPQTIWASRTGDYKDF------GKSNPTQDDDRIIYTYAGRQVNEIRHLID 381

Query: 401 EGVLVGCDT-SLWLLSISLSKGL---SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRI 456
            G LV   +   ++++   +K L   S  F     +G    PP++V +  +FV   G  +
Sbjct: 382 VGSLVALTSGGEYVITGDQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVV 441

Query: 457 KYISGSTE-QGFRFNEITQLADHLFNQR-ILQLVYQEEPHS 495
           + ++ S +  G++ N++T LA+HLF +  I+   +   P+S
Sbjct: 442 RDLAYSFDVDGYQGNDLTILANHLFQKHSIVDWCFSIVPYS 482


>gi|298381710|ref|ZP_06991309.1| conserved hypothetical protein [Escherichia coli FVEC1302]
 gi|298279152|gb|EFI20666.1| conserved hypothetical protein [Escherichia coli FVEC1302]
          Length = 823

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 120/521 (23%), Positives = 217/521 (41%), Gaps = 72/521 (13%)

Query: 5   TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQ-------EY 57
           +W + SF+ GE+ P L   R D++ +   + K  N I  +YG + + P  +         
Sbjct: 4   SWIQPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYPN 62

Query: 58  RDCRLDP-RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFK 116
           R CRL P + + V ++++  G   + V  D  L  V+  S+  +  A       TPYT  
Sbjct: 63  RKCRLIPFQFSTVQTYALEFGHQYMRVIKDGAL--VLNSSNVIYEIA-------TPYTEA 113

Query: 117 DNKSLEYAVFGSTAVFVHKDHPPHHLL-YIQDGDKISFTFDEIKFLPPPWLGDGMISGVK 175
           D   +++         VH  +PP  L  Y  D  ++     +          +G    + 
Sbjct: 114 DLFRIKFTQSADVLTLVHPAYPPKELRRYAHDNWQLVDVVTK----------NGPFEDIN 163

Query: 176 SNAKLSI-SQADTSTARITSDMKIFKPLDKGRSIRLGCHP-----PEWAKNTNYSIGAYI 229
            +  +++ + A T T  +T+   IF     G+   L   P     P W  + + SIG   
Sbjct: 164 IDESVTVYASASTGTITLTASASIFGAEQVGKLFYLE-QPAVDSVPVWETSKSTSIGDIR 222

Query: 230 VADDKVYRSLTTGRSGD-RFGYSKGATYVKD-------NNITWITVLNLSSKTSRESASG 281
            AD   YR++T G++G  R  +++G ++            I W   L+     +R +A+ 
Sbjct: 223 RADSNYYRAVTAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEW-EYLHSGFGIARITAA- 280

Query: 282 AVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNN 341
                         +     IS  P SQ + +   S   W   AW    GYP  V ++  
Sbjct: 281 -----------NGTTATAEVISYIP-SQVVGEDNASY-KWAKYAWNSVNGYPGTVVYYQQ 327

Query: 342 RLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM-HPFG 400
           RL F+ S     +++ S  G + DF      G  +PT+     +  ++   ++ + H   
Sbjct: 328 RLYFAASTAFPQTIWASRTGDYKDF------GKSNPTQDDDRIIYTYAGRQVNEIRHLID 381

Query: 401 EGVLVGCDT-SLWLLSISLSKGL---SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRI 456
            G LV   +   ++++   +K L   S  F     +G    PP++V +  +FV   G  +
Sbjct: 382 VGSLVALTSGGEYVITGDQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVV 441

Query: 457 KYISGSTE-QGFRFNEITQLADHLFNQR-ILQLVYQEEPHS 495
           + ++ S +  G++ N++T LA+HLF +  I+   +   P+S
Sbjct: 442 RDLAYSFDVDGYQGNDLTILANHLFQKHSIVDWCFSIVPYS 482


>gi|117624704|ref|YP_853617.1| hypothetical protein APECO1_4049 [Escherichia coli APEC O1]
 gi|115513828|gb|ABJ01903.1| conserved hypothetical protein [Escherichia coli APEC O1]
          Length = 823

 Score = 76.3 bits (186), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 116/519 (22%), Positives = 213/519 (41%), Gaps = 58/519 (11%)

Query: 5   TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQ-------EY 57
           +W + SF+ GE+ P L   R D++ +   + K  N I  +YG + + P  +         
Sbjct: 4   SWIQPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYPN 62

Query: 58  RDCRLDP-RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFK 116
           R CRL P + + V ++++  G   + V  D  L  V+  S+  +  A       TPYT  
Sbjct: 63  RKCRLIPFQFSTVQTYALEFGHQYMRVIKDGAL--VLNSSNVIYEIA-------TPYTEA 113

Query: 117 DNKSLEYAVFGSTAVFVHKDHPPHHLL-YIQDGDKISFTFDEIKFLPPPWLGDGMISGVK 175
           D   +++         VH  +PP  L  Y  D  ++     +          +G    + 
Sbjct: 114 DLFRIKFTQSADVLTLVHPAYPPKELRRYAHDNWQLVDVVTK----------NGPFEDIN 163

Query: 176 SNAKLSI-SQADTSTARITSDMKIFKPLDKGRSIRLGCHP-----PEWAKNTNYSIGAYI 229
            +  +++ + A T T  +T+   IF     G+   L   P     P W  + + SIG   
Sbjct: 164 IDESVTVYASASTGTITLTASASIFGAEQVGKLFYLE-QPAVDSVPVWETSKSTSIGDIR 222

Query: 230 VADDKVYRSLTTGRSGD-RFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYV 288
            AD   YR++T G++G  R  +++G ++          +               +     
Sbjct: 223 RADSNYYRAVTAGKTGTLRPSHTEGTSWDGWGGSGDDDIGIEWEYLHSGFGIARITAANG 282

Query: 289 WGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGS 348
                +V      IS  P SQ + +   S   W   AW    GYP  V ++  RL F+ S
Sbjct: 283 TTATAEV------ISYIP-SQVVGEDNASY-KWAKYAWNSVNGYPGTVVYYQQRLYFAAS 334

Query: 349 KGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM-HPFGEGVLVGC 407
                +++ S  G + DF      G  +PT+     +  ++   ++ + H    G LV  
Sbjct: 335 TAFPQTIWASRTGDYKDF------GKSNPTQDDDRIIYTYAGRQVNEIRHLIDVGSLVAL 388

Query: 408 DT-SLWLLSISLSKGL---SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGST 463
            +   ++++   +K L   S  F     +G    PP++V +  +FV   G  ++ ++ S 
Sbjct: 389 TSGGEYVITGDQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVVRDLAYSF 448

Query: 464 E-QGFRFNEITQLADHLFNQR-ILQLVYQEEPHSIVWVV 500
           +  G++ N++T LA+HLF +  I+   +   P+S  + +
Sbjct: 449 DVDGYQGNDLTILANHLFQKHSIVDWCFSIVPYSSAFCI 487


>gi|304398395|ref|ZP_07380269.1| conserved hypothetical protein [Pantoea sp. aB]
 gi|304354261|gb|EFM18634.1| conserved hypothetical protein [Pantoea sp. aB]
          Length = 824

 Score = 76.3 bits (186), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 114/520 (21%), Positives = 200/520 (38%), Gaps = 68/520 (13%)

Query: 10  SFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRV 69
           SF+ GE+SP +   R DL+ ++  + + RN I  +YG L + P  +   + +   R  R+
Sbjct: 9   SFAGGEISPNVY-GRVDLAKYSIALRRCRNFIVRQYGGLENRPGTRFIAEAKYPDRKCRL 67

Query: 70  FSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKT----YKTPYTFKDNKSLEYAV 125
             F         L FG   +++            L G        TPY   D   L+   
Sbjct: 68  IPFQFSTVQTYALEFGHNYMRVY-----KDGGQVLDGNNQVYELATPYQEADLFELKITQ 122

Query: 126 FGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQA 185
                   HK + P  L         S+   E+     P+    +   VK  A  S  Q 
Sbjct: 123 SADVMTICHKAYAPRELRRF---GHASWELVEVVTKNGPFEDINIDPSVKVYA--SSYQG 177

Query: 186 DTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVYRSLTT 241
           + +   + ++  IF     G+   L        P W  +   ++G    A D  Y +LT 
Sbjct: 178 NIT---LNANASIFGSEQVGKLFYLEQVNVDSTPVWETDKAVAVGMTRRAGDNYYVALTA 234

Query: 242 GRSGD-RFGYSKGATYV-------KDNNITW------ITVLNLSSKTSRESASGAVAPYY 287
           G++G  R  +++GA +         D  I W        +  ++S +S    + AV   Y
Sbjct: 235 GKTGTLRPSHTEGAAWDGWGSNGDNDTGIQWEYQHSGFGIARITSVSSDGYIAAAVVQTY 294

Query: 288 VWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSG 347
           +  D           +V P   +          W   AW +  GYP  VT++  RL+F+ 
Sbjct: 295 MPND-----------AVGPTKASY--------KWAKFAWNQVNGYPGTVTYYQQRLIFAA 335

Query: 348 SKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM-HPFGEGVLVG 406
           S     +++ S  G + DF      G   P       V  ++   ++ + H    G LV 
Sbjct: 336 SIKYPQTIWCSKTGDYKDF------GKTSPIADDDRIVYTYAGKQVNEIRHLIDVGSLVA 389

Query: 407 CDTSLWLLSIS-LSKGLS---IDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGS 462
             +      +   +K L+     F      G  +  P++V +  +F+   G  ++ ++ S
Sbjct: 390 LTSGGQFQIVGDQNKTLTPTAFSFSSQGADGASSVAPITVSNIALFIQEKGSVVRDLAYS 449

Query: 463 TE-QGFRFNEITQLADHLFN-QRILQLVYQEEPHSIVWVV 500
            +  G++ +++T LA+HLFN  R++   +   P+S  W V
Sbjct: 450 FDVDGYQGSDLTVLANHLFNGYRLVDWTFSVVPYSAGWAV 489


>gi|294648405|ref|ZP_06725904.1| phage protein [Acinetobacter haemolyticus ATCC 19194]
 gi|292825710|gb|EFF84414.1| phage protein [Acinetobacter haemolyticus ATCC 19194]
          Length = 706

 Score = 74.7 bits (182), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 130/571 (22%), Positives = 217/571 (38%), Gaps = 68/571 (11%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           M      K++F++GELSP +   R DL  +  G  +  N +P+  G L      +     
Sbjct: 1   MAKINLIKNNFTSGELSPHIWM-RTDLQQYRNGTKEMLNFLPIIEGGLKRRGGTEA---L 56

Query: 61  RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120
            +   + R+  F I      LL+F   ++ ++ +  +         K+  TPYT +D K 
Sbjct: 57  AITAGAIRILPFIISHSTAYLLIFKPNQIDVLDINGTVV-------KSLSTPYTAQDIKE 109

Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKIS-FTFDEIKFLPPPWLGDGMISGVKSNAK 179
           + Y          H  HP   L +++  + ++ +++D   F  PP      +  V++ A 
Sbjct: 110 ISYTQNRYQFYIAHSKHP---LAWLRASEDLTNWSYDPFDFYVPP------LEEVETPAL 160

Query: 180 LSISQADTSTARITSDMKIFKPLDKGRSIRLG--CHPPEWAKNTNYSIGAYIVADDKVYR 237
              S    +    T     +   D  +  + G  CH      N  Y   A  +       
Sbjct: 161 PLKSNEKNAGKVATLTASPYNIYDNSKRYQAGEICH--HTINNVKYYFRALRITQGNTPS 218

Query: 238 SLTTGRSGDRFGYSKGATYVKDNNITWITV---LNLSSKTSRESASGAVAPYYVWGDI-K 293
             T+G       Y +  T  +    T   V   + ++    R      V+P  V G+I  
Sbjct: 219 FGTSGPEASPDYYWETTTVTEAQAFTAADVDKFVFINEGIVR--IDTYVSPSTVTGEILV 276

Query: 294 DVSKDGRSISVA-PQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDE 352
            +S D  +I+ A    Q +F+  +              GYP  VT +  RL+ +G+K   
Sbjct: 277 KLSTDIEAIANAWTLKQDIFEVSL--------------GYPRAVTMYQQRLVIAGTKTYP 322

Query: 353 LSVYLSSFGAFYDF---SLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDT 409
             V+LS  G   +F   + DG+           +A +D   + +H     G  V+ G   
Sbjct: 323 NYVWLSRVGDVTNFLPTTSDGD-------SFTVSASSDQLTNVLHLAQSRGICVMTGGSE 375

Query: 410 SLWLLSISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIK-YISGSTEQGFR 468
            +     S++   +      S        P+ VG  L+FV     RI+  +   +     
Sbjct: 376 LVISSQNSMTPTNTSILEHTSFGSTENIKPIKVGSELIFVQRGAERIRTLLYDYSIDSLT 435

Query: 469 FNEITQLADHLFNQR--ILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAW 526
            NE+T LA H+  +     ++VY  EP SI+W VL        +L     + E +   AW
Sbjct: 436 SNELTVLASHIAKKSGGFKEMVYCAEPDSIIWFVL-----GNGKLASLTLNRE-QSVIAW 489

Query: 527 HTHMISDKHYVLSAASFPNDNRGGTSLWMLV 557
            TH I     VLS  S P+   G   L+ LV
Sbjct: 490 STHDIGGT--VLSLTSLPS-TTGADRLYFLV 517


>gi|332875218|ref|ZP_08443051.1| carbohydrate binding domain protein [Acinetobacter baumannii
           6014059]
 gi|332736662|gb|EGJ67656.1| carbohydrate binding domain protein [Acinetobacter baumannii
           6014059]
          Length = 692

 Score = 74.7 bits (182), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 121/511 (23%), Positives = 196/511 (38%), Gaps = 74/511 (14%)

Query: 8   KHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSN 67
           K++ S+GELSP LL +R D+  +A G  K  N +PL  G     P   ++R   +   + 
Sbjct: 7   KNNLSSGELSP-LLWTRTDIQQYANGAKKLLNALPLVEGGAKKRP-GTKFRS--IFAGAL 62

Query: 68  RVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYK--TPY-TFKDNKSLEYA 124
           R+  F        LL+ G   L++        ++P  +   Y+  TPY T +  + ++YA
Sbjct: 63  RLIPFIANSENTYLLILGVSFLKV--------YNPRTYAVVYEAVTPYNTAQKVREVQYA 114

Query: 125 VFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQ 184
                  FV  D P   LL   D     F        P   LG        S   +++S 
Sbjct: 115 HTKYRMYFVQGDTPVQRLLCSADFTNWQFAAFTFGVNPNDELG--------STPNVALSP 166

Query: 185 ADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRS 244
           + T   ++ S                    P W+    Y  G  ++   K +R+      
Sbjct: 167 SGTEVGKVIS--------------LTASSFPNWSNTETYLTGDRVIHTSKTWRATI---- 208

Query: 245 GDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIK-----DVSKDG 299
            D  G    AT  +     W  V N ++     S+ G++      G +K     D S+  
Sbjct: 209 -DNKGVEPSATTSE-----WEEVTNEAANVFTPSSVGSIVEIN-GGQVKITQYVDPSRVN 261

Query: 300 RSISVAPQSQTLFQAGVSVVSWFMS--AWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYL 357
             + V   S     A     SW +   A+    GYP  V F   RL+F+ +K     ++ 
Sbjct: 262 GEVLVKLTSTVQAIAK----SWVLKSIAFSATAGYPKAVCFFKQRLVFANTKTSPNQMWF 317

Query: 358 SSF---GAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLL 414
           S     G F + + D +        A + A +   +  I  +   G  V +       + 
Sbjct: 318 SRIGDDGNFLETTQDAD--------AFSIASSSAQSDNILHLSQRGGVVALTGGAEFLIN 369

Query: 415 SISLSKGLSIDFRRVSGSGVYA-CPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEI 472
           S       S      +  GV A   P  VG+ L+FV   G R++ +S   E  G    E+
Sbjct: 370 SQGPLTPASAQIDEHTSYGVQANVKPCRVGNELLFVQRGGERLRAMSYRYEVDGLVSPEL 429

Query: 473 TQLADHLFNQR--ILQLVYQEEPHSIVWVVL 501
           +Q+A H+      I +L +Q+ P+SIVW+V+
Sbjct: 430 SQIAPHIPENHAGIKELTFQQTPNSIVWIVM 460


>gi|293609614|ref|ZP_06691916.1| predicted protein [Acinetobacter sp. SH024]
 gi|292828066|gb|EFF86429.1| predicted protein [Acinetobacter sp. SH024]
          Length = 692

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 117/508 (23%), Positives = 195/508 (38%), Gaps = 68/508 (13%)

Query: 8   KHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSN 67
           K++ S+GELSP LL +R D+  +A G  K  N +PL  G     P   ++R   +   + 
Sbjct: 7   KNNLSSGELSP-LLWTRTDIQQYANGAKKLLNALPLVEGGAKKRP-GTKFRS--IFAGAL 62

Query: 68  RVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKT--PY-TFKDNKSLEYA 124
           R+  F        LL+ G   L++        ++P  +   Y+T  PY T +  + ++YA
Sbjct: 63  RLIPFIANSENTYLLILGVSFLKV--------YNPRTYAVVYETVTPYNTAQKVREVQYA 114

Query: 125 VFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQ 184
                  FV  D P   LL   D     F        P   LG        S   +++S 
Sbjct: 115 HTKYRMYFVQGDTPVQRLLCSADFTNWQFAAFTFGVNPNDELG--------STPNVALSP 166

Query: 185 ADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRS 244
           + T   ++ S                    P W+    Y  G  ++   K +R+      
Sbjct: 167 SGTEVGKVIS--------------LTASSFPNWSNTETYLTGDRVIHSGKTWRATI---- 208

Query: 245 GDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISV 304
            D  G    AT  +     W  V N ++     S  G++    + G    +++      V
Sbjct: 209 -DNKGVEPTATTSE-----WEEVTNEAANVFTPSNVGSIIE--INGGQVKITQYVDPSRV 260

Query: 305 APQSQTLFQAGVSVV--SWFMS--AWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSF 360
             +      + V  +  SW +   A+    GYP  V F   RL+F+ +K     ++ S  
Sbjct: 261 NGEVLVKLTSAVQAIAKSWVLKSIAFSATAGYPKAVCFFKQRLVFANTKTSPNQMWFSRI 320

Query: 361 ---GAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSIS 417
              G F + + D +        A + A +   +  I  +   G  V +       + S  
Sbjct: 321 GDDGNFLETTQDAD--------AFSIASSSAQSDNILHLSQRGGVVALTGGAEFLINSQG 372

Query: 418 LSKGLSIDFRRVSGSGVYA-CPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQL 475
                S      +  GV A   P  VG+ L+FV   G R++ +S   E  G    E++Q+
Sbjct: 373 PLTPASAQIDEHTSYGVQANVKPCRVGNELLFVQRGGERLRAMSYRYEVDGLISPELSQI 432

Query: 476 ADHLFNQR--ILQLVYQEEPHSIVWVVL 501
           A H+      I +L +Q+ P+SIVW+V+
Sbjct: 433 APHIPENHAGIKELTFQQTPNSIVWIVM 460


>gi|282848883|ref|ZP_06258273.1| hypothetical protein HMPREF1035_1392 [Veillonella parvula ATCC
           17745]
 gi|282581388|gb|EFB86781.1| hypothetical protein HMPREF1035_1392 [Veillonella parvula ATCC
           17745]
          Length = 772

 Score = 69.3 bits (168), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 108/577 (18%), Positives = 222/577 (38%), Gaps = 78/577 (13%)

Query: 10  SFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRV 69
           +F+ GE+SP +  SR DL  +   + ++ N++   YG +      Q     +   +  R+
Sbjct: 11  AFTTGEVSPDV-SSRFDLEQYKSALLEAENVVIRPYGAVAKRQGSQYVGQVKYSDKPTRL 69

Query: 70  FSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALF-GKTYKTPYTFKDNKSLEYAVFGS 128
           F F+       +L FGDK +++        W+  ++ G    TP+T      L  +  G 
Sbjct: 70  FEFTTNTNNSFMLEFGDKYIRV--------WNYGVYTGIEVTTPFTSDILFDLNCSQSGD 121

Query: 129 TAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTS 188
                   +P   L    D D   +  +  K    P+  D + + V S   ++     +S
Sbjct: 122 VMFICSGKYPIQTLSRYSDTD---WRLEAYKLTEQPY--DTINTDVNSTVTVTGDTIRSS 176

Query: 189 TARITSDM--------------------KIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAY 228
                +DM                     + +  +K RS   G +      N NY++ +Y
Sbjct: 177 KDLFNADMVGMVMQLGYFVAAVHTKNTGTVVEKKEK-RSFMGGFNKWNEYNNINYNVESY 235

Query: 229 IVADDKVYRSLT----TGRSGDRFGYSKGATYV--------KDNNITWITVLNLSSKTSR 276
               D  ++  T    TG    +   + G T+          D N+T    +  ++K   
Sbjct: 236 STDQDLAWKFTTHGTWTGTVKLQITTNNGTTWKDYRTYSSNNDYNVTDAGKIEPNAKLRI 295

Query: 277 ES--ASG------AVAPYYVWGDIK-DVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWG 327
           +S   SG      ++ PY  WG ++     D +++ +   +  +     S   W M +WG
Sbjct: 296 QSDIKSGECNVDLSILPYTTWGIVEFKEFVDSKTMKINILNGIVENEATS--KWKMGSWG 353

Query: 328 EQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTD 387
              GYP   TF+ +R + + +  +   +++S  G + +F ++   G      ++T  V +
Sbjct: 354 RSNGYPKLCTFYQDRFVVAATNKNPNYIWMSRTGDYPNFGVEKVEGTITDDSSITLPVIN 413

Query: 388 FSASTIHWMHPFGEGVLVGCDTSLWLLSISLS-KGLSIDFRRVSGSGVYACPPVSVGDCL 446
                I  + P  + +++      W++S   +    + + +  +  G  +C P  +G+  
Sbjct: 414 RKMYEIRHLVPANDLIILTSGNE-WIVSGDKTITPTNCNLKTQTQRGALSCEPQFIGNRC 472

Query: 447 VFVCGVGRRIKYISGSTE------QGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVV 500
           VFV   G  ++ +  S E      Q       T++  +L     +   Y ++P SI++ +
Sbjct: 473 VFVQERGGTVRDMGYSYESDNYTGQDLTLFVKTRVRGYL----TITSAYAQDPDSIIYYI 528

Query: 501 LEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYV 537
               +      + C      +  + W +H +++  Y+
Sbjct: 529 RNDGE------INCLTYIPEQKVYGW-SHFVTNGKYL 558


>gi|195541813|gb|ACF98016.1| hypothetical protein [uncultured bacterium 878]
          Length = 926

 Score = 68.2 bits (165), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 66/293 (22%), Positives = 113/293 (38%), Gaps = 39/293 (13%)

Query: 288 VWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSG 347
            WG  K ++    ++SV     + F    +  +W +  + +  GYPS VTF+  RL + G
Sbjct: 356 TWGYAK-ITAYTSAVSVTADVLSNFGGTAASSAWRLGLYSQGGGYPSCVTFYEGRLFWGG 414

Query: 348 SKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSAS---------TIHWMHP 398
                  V         D S+   Y  + P+   +    D + +          + WM  
Sbjct: 415 CPLAPTRV---------DGSMSSNYETFSPSSTASVVADDNAVAYPLDSGDVNNVLWMKD 465

Query: 399 FGEGVLVGCDTSLWLLSISLSKG----LSIDFRRVSGSGVY-ACPPVSVGDCLVFVCGVG 453
             +G+LVG     W++  +   G     ++   R +  G Y    PV  G  ++FV    
Sbjct: 466 DEKGLLVGTKGGEWVVRANTLNGALTPTNVKATRATTYGSYEGSQPVRTGKDIIFVQRKR 525

Query: 454 RRIKYISGSTE-QGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLL 512
           R+++ ++ + E  GF   ++T L+ H+      QL +Q EP   VW+     D   P L 
Sbjct: 526 RKVRNLNYTYEIDGFNAGDLTILSGHIGRLEFGQLAFQSEPEGWVWMTR--GDGQLPVLT 583

Query: 513 GCRFSAEGEGDFAWHTHMISDKH--------YVLSAASFPNDNRGGTSLWMLV 557
             R     E    W   ++             V S  S P+ N     +W++V
Sbjct: 584 YDR----DEQKIGWSRQIMGGYQDAARRRPPIVRSVCSIPDPNDARDEVWLIV 632


>gi|78357587|ref|YP_389036.1| hypothetical protein Dde_2545 [Desulfovibrio desulfuricans subsp.
           desulfuricans str. G20]
 gi|78219992|gb|ABB39341.1| conserved hypothetical protein [Desulfovibrio desulfuricans subsp.
           desulfuricans str. G20]
          Length = 700

 Score = 68.2 bits (165), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 62/231 (26%), Positives = 98/231 (42%), Gaps = 19/231 (8%)

Query: 332 YPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSAS 391
           +P  V F+  RL F+G+     +++ S    +   ++       D   A+T  +     +
Sbjct: 279 WPGCVQFYQQRLCFAGTDEKPQTIWCSQSANYESMNISSPLRDDD---AVTVTIAADRVN 335

Query: 392 TIHWMHPFGEGVLVGCDTSLWLLSISLSKGLS---IDFRRVSGSGVYACPPVSVGDCLVF 448
            I WM P    +LVG     W LS S    L+      RR +  G     P+ +G  ++F
Sbjct: 336 RIRWMMP-ARRLLVGTAGGEWQLSGSGDAPLTPVDAQLRRDTMHGSAGLMPLVIGQSILF 394

Query: 449 VCGVGRRIKYISGSTE-QGFRFNEITQLADHLF-NQRILQLVYQEEPHSIVWVVLEPKDN 506
           V   GR ++    + E  G+   ++T LA+HL   +RI+   YQ+ P S+VW  L     
Sbjct: 395 VQRDGRTVREFRYALESDGYDAGDLTILAEHLMRGRRIVSWCYQQSPASVVWCAL----- 449

Query: 507 SFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLV 557
           S   L    F  E E    WH H      +V +  + P D   G  +W+ V
Sbjct: 450 SDGTLAAMTFLREHE-VVGWHRH--DTDGFVEAVTAIPGDE--GDEVWLSV 495


>gi|257139843|ref|ZP_05588105.1| hypothetical protein BthaA_11681 [Burkholderia thailandensis E264]
          Length = 489

 Score = 67.0 bits (162), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 48/184 (26%), Positives = 83/184 (45%), Gaps = 16/184 (8%)

Query: 324 SAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDF---SLDGEYGCYDPTKA 380
           S W   +GYP+ V+    RL  +GS G  + V+ S  G + DF   + DGE   YD    
Sbjct: 84  SMWNSIDGYPAAVSLFKQRLYAAGSTGYPMRVWASGIGLYLDFTPGTKDGEAFGYDMASD 143

Query: 381 LTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVYACP-- 438
                   +++ I      GE   V   ++  +   +++         V    VY C   
Sbjct: 144 QVNQTVHLASAKILAALTQGEEFTVTGGSAGAITPTNIN---------VDSQSVYGCARA 194

Query: 439 -PVSVGDCLVFVCGVGRRIKYIS-GSTEQGFRFNEITQLADHLFNQRILQLVYQEEPHSI 496
            PV VG+ +V+V   G++++ ++       +R   +T+LA H+    I+ + +Q EP  +
Sbjct: 195 RPVRVGNEIVYVQRAGKKVRAMTYDLNTDAYRSQNLTRLAAHVTESGIVDVAFQAEPTPV 254

Query: 497 VWVV 500
           VW+V
Sbjct: 255 VWMV 258


>gi|332160974|ref|YP_004297551.1| hypothetical protein YE105_C1352 [Yersinia enterocolitica subsp.
           palearctica 105.5R(r)]
 gi|325665204|gb|ADZ41848.1| Hypothetical phage protein [Yersinia enterocolitica subsp.
           palearctica 105.5R(r)]
 gi|330862130|emb|CBX72294.1| hypothetical protein YEW_AK02310 [Yersinia enterocolitica W22703]
          Length = 657

 Score = 67.0 bits (162), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 100/504 (19%), Positives = 195/504 (38%), Gaps = 98/504 (19%)

Query: 8   KHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSN 67
           K +F+AGE+SPRL+  R D++ +A G     N + + +G ++  P  +     +   +  
Sbjct: 7   KTNFTAGEISPRLM-GRVDIARYANGAKTVENAVCVIHGGVMRRPGSRFAAKAKFGDQKA 65

Query: 68  RVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFG 127
           R+  +        +L FG+  ++    ++  +           +PYT     SL Y    
Sbjct: 66  RLIPYVFNRSQAYVLEFGNGYVRFY--QNGAQIGAGSTPYEIASPYTSAMLSSLNYVQGA 123

Query: 128 STAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADT 187
            T   VH+D PP+ L      D +          P P+                      
Sbjct: 124 DTMFLVHQDVPPYRLQRKGQTDWV--------LEPAPF---------------------- 153

Query: 188 STARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDR 247
                     I KP D+ R       P +W K    S+  ++     +  +L+   SG  
Sbjct: 154 ----------IVKPFDEIRDT-----PEKWCKP---SVKEFV--GSAITLTLSDAESG-- 191

Query: 248 FGYSKGATYVKDNNITWITV----LNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSIS 303
            G   GA +V  +  +++ +    +++ + TS   A+G +                R++ 
Sbjct: 192 -GALTGAGWVGADVGSYVRINSGLVHIQAVTSAAVATGVI----------------RTVL 234

Query: 304 VAPQSQTLFQAGVSVVSWFM--SAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFG 361
            A QS        S  +W    + W  + GYP   T +  RL+ +GS     ++++S  G
Sbjct: 235 SAVQSS-------SPGAWTREDAVWSAEFGYPGAATLYQQRLVLAGSPKYPQTIWMSETG 287

Query: 362 AFYDFSL----DGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSIS 417
            +  F L    D        +  +   V     +T+  +   GE  + G   S    +I+
Sbjct: 288 IYLSFELGTDDDDAISFTVSSDQINPIVHLAQMNTLIALTSTGEFTITGGGES----AIT 343

Query: 418 LSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQ--GFRFNEITQL 475
            +   +I  +  S  G  +  PV VG  ++F+    R++  ++   +    +  N+++ L
Sbjct: 344 PT---NISVKNPSPYGCNSIKPVRVGTEIMFMQRANRKLFAVAYDPDSFVAYSANDLSVL 400

Query: 476 ADHLFNQRILQLVYQEEPHSIVWV 499
           ++H+     + + YQ+EP + +W+
Sbjct: 401 SEHITLSGAVDMAYQQEPDAFIWM 424


>gi|220903983|ref|YP_002479295.1| hypothetical protein Ddes_0709 [Desulfovibrio desulfuricans subsp.
           desulfuricans str. ATCC 27774]
 gi|219868282|gb|ACL48617.1| conserved hypothetical protein [Desulfovibrio desulfuricans subsp.
           desulfuricans str. ATCC 27774]
          Length = 689

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 68/247 (27%), Positives = 116/247 (46%), Gaps = 22/247 (8%)

Query: 332 YPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSAS 391
           YP  V FH  R++ + +  +  + Y+S  G F +F         DP + L   +   S  
Sbjct: 278 YPGIVAFHQQRMVLAATPKNPQAFYMSRVGDFENFRKSRPLQDDDPVEYL---IASGSID 334

Query: 392 TIHWMHPFGEGVLVGCDTSLWLLS----ISLSKG-LSIDFRRVSGSGVYACPPVSVGDCL 446
            + W   FG+ +L+G   S +  S     S++ G +SI  +   GS   A  P+ +G+ +
Sbjct: 335 AVTWAASFGD-LLIGTSGSEYKASGGDGASITAGNISITAQSYWGSAGLA--PIIIGNSI 391

Query: 447 VFVCGVGRRIKYISGSTEQ-GFRFNEITQLADHLFN-QRILQLVYQEEPHSIVWVVLEPK 504
           + V   G R++ +  S E+ G+  N+++ +A HLF    ILQ  YQ+ P S +W V   +
Sbjct: 392 LHVQRHGSRVRDLFYSLEKDGYAGNDLSIMAPHLFEGHTILQWAYQQTPGSTIWCV---R 448

Query: 505 DNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSAGEE 564
           D+    LL   +  E +  + W   +   +  VLSAA+   + +G T + +      G+ 
Sbjct: 449 DDGL--LLAFTYMKEHD-IWGWSRQITQGR--VLSAAAISGE-KGDTLMLVTERRIDGQP 502

Query: 565 RSFTVRL 571
           R F  RL
Sbjct: 503 RIFLERL 509


>gi|83720451|ref|YP_441475.1| hypothetical protein BTH_I0919 [Burkholderia thailandensis E264]
 gi|83654276|gb|ABC38339.1| conserved hypothetical protein [Burkholderia thailandensis E264]
          Length = 405

 Score = 66.2 bits (160), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 47/182 (25%), Positives = 82/182 (45%), Gaps = 16/182 (8%)

Query: 326 WGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDF---SLDGEYGCYDPTKALT 382
           W   +GYP+ V+    RL  +GS G  + V+ S  G + DF   + DGE   YD      
Sbjct: 2   WNSIDGYPAAVSLFKQRLYAAGSTGYPMRVWASGIGLYLDFTPGTKDGEAFGYDMASDQV 61

Query: 383 TAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVYACP---P 439
                 +++ I      GE   V   ++  +   +++         V    VY C    P
Sbjct: 62  NQTVHLASAKILAALTQGEEFTVTGGSAGAITPTNIN---------VDSQSVYGCARARP 112

Query: 440 VSVGDCLVFVCGVGRRIKYIS-GSTEQGFRFNEITQLADHLFNQRILQLVYQEEPHSIVW 498
           V VG+ +V+V   G++++ ++       +R   +T+LA H+    I+ + +Q EP  +VW
Sbjct: 113 VRVGNEIVYVQRAGKKVRAMTYDLNTDAYRSQNLTRLAAHVTESGIVDVAFQAEPTPVVW 172

Query: 499 VV 500
           +V
Sbjct: 173 MV 174


>gi|309702804|emb|CBJ02135.1| hypothetical phage protein [Escherichia coli ETEC H10407]
          Length = 807

 Score = 66.2 bits (160), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 110/494 (22%), Positives = 195/494 (39%), Gaps = 72/494 (14%)

Query: 21  LQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRVFSFSIPDGGYA 80
           +  R D++ +   + K  N I  +YG + + P  +   + +   R  R+  F        
Sbjct: 1   MYGRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGEAKYPTRKCRLIPFQFSTVQTY 60

Query: 81  LLVFGDKKLQI------VVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVH 134
            L FG   +++      V+  S+  +  A+       PY   D   +++         VH
Sbjct: 61  ALEFGHNYMRVIKDGAYVLNSSNVIYELAM-------PYADTDLFRIKFTQSADVLTLVH 113

Query: 135 KDHPPHHLL-YIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARIT 193
             +PP  L  Y  D  +I     ++     P+    +   VK  A      A T T  +T
Sbjct: 114 PAYPPKELRRYAHDNWQIV----DVTTKNGPFEDINVDETVKVYAS-----ASTGTITLT 164

Query: 194 SDMKIFKPLDKGRSIRLGCHP-----PEWAKNTNYSIGAYIVADDKVYRSLTTGRSGD-R 247
           +   IF     G+   L   P     P W  +   +I     AD   YR+ T+G++G  R
Sbjct: 165 ASSAIFGAEQVGKLFYLE-QPAIDSVPVWETSKTTAINDVRRADSNYYRANTSGKTGTLR 223

Query: 248 FGYSKGATYV-------KDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGR 300
             +++G ++         D  I W   L+     +R +A               VS DG 
Sbjct: 224 PSHTEGMSWDGWGGTGDSDTGIQW-EYLHSGFGIARITA---------------VSSDGL 267

Query: 301 S-----ISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSV 355
           +     +S  P SQ +  A  S   W   AW    GYPS V ++  RL F+ S     ++
Sbjct: 268 TATATVVSYIP-SQVVGSANGSY-KWARYAWNSVNGYPSTVVYYQQRLYFAASTAYPQTI 325

Query: 356 YLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM-HPFGEGVLVGCDT-SLWL 413
           + S  G + DF      G  +P +     +  ++   ++ + H    G LV   +   + 
Sbjct: 326 WASRTGDYKDF------GKNNPIQDDDRIIYTYAGRQVNEIRHLIDVGNLVALTSGGEYT 379

Query: 414 LSISLSKGLS---IDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRF 469
           +S   +K L+     F     +G    PP++V +  +F+   G  ++ ++ S +  G++ 
Sbjct: 380 ISGDQNKVLTPSAFSFSSQGNNGSSNVPPIAVANIALFIQEKGSVVRDLAYSFDVDGYQG 439

Query: 470 NEITQLADHLFNQR 483
            ++T LA+HLF +R
Sbjct: 440 TDLTILANHLFQKR 453


>gi|254251749|ref|ZP_04945067.1| hypothetical protein BDAG_00946 [Burkholderia dolosa AUO158]
 gi|124894358|gb|EAY68238.1| hypothetical protein BDAG_00946 [Burkholderia dolosa AUO158]
          Length = 545

 Score = 65.1 bits (157), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 48/179 (26%), Positives = 81/179 (45%), Gaps = 10/179 (5%)

Query: 326 WGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSL---DGEYGCYDPTKALT 382
           W   +GYP  V+ +  RL  +GS G    V+ S+ G +YDF+    DG+   YD      
Sbjct: 142 WNPTDGYPCAVSLYQQRLYAAGSSGYPERVWASATGLYYDFTPGTDDGDGFSYDVASDQV 201

Query: 383 TAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVYACPPVSV 442
             +   ++S I  +   GE   +         S+      +I+ R  S  G     PV V
Sbjct: 202 NQIMHLASSRILTVLTQGEEFTIDGG------SVGSITPTNINVRSQSIYGTARPRPVRV 255

Query: 443 GDCLVFVCGVGRRIKYISGS-TEQGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVV 500
           G+ L+F     ++I+ ++       FR   +T+LA H+    ++ + +Q EP  +VW+V
Sbjct: 256 GNELIFPQRAAKKIRSMAYDFNTDSFRSQNLTRLAAHITESGVVDIAFQAEPTPVVWMV 314


>gi|330007163|ref|ZP_08305905.1| hypothetical protein HMPREF9538_03594 [Klebsiella sp. MS 92-3]
 gi|328535510|gb|EGF61970.1| hypothetical protein HMPREF9538_03594 [Klebsiella sp. MS 92-3]
          Length = 825

 Score = 63.9 bits (154), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 116/575 (20%), Positives = 210/575 (36%), Gaps = 92/575 (16%)

Query: 10  SFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRV 69
           S + GE+SP L   R DL  +   + + RN I  + G + + P  +     +   R +R+
Sbjct: 9   SLAGGEISPSLY-GRIDLEKYQTSLRRCRNFIVRQSGGIENRPGFRFLGSAKYADRYSRL 67

Query: 70  FSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGK------TYKTPYTFKDNKSLEY 123
             F         L  GD   ++        WS               TP+       L++
Sbjct: 68  IPFQFSVSQTYALELGDHYFRV--------WSNGALVTDGGSPVEVATPWPVSVISELKF 119

Query: 124 AVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSI- 182
                     H D+PP  +    + D  +                G    + ++  +++ 
Sbjct: 120 TQSADVMTVCHNDYPPLEIRRYGEADWRTAAVTTTS---------GPFQDLNTDDSVTVY 170

Query: 183 SQADTSTARITSDMKIFKPLDKGRSIRLGCHPPE----WAKNTNYSIGAYIVADDKVYRS 238
           +   T +  +T+   IFK    G+   +     +    W  + +  +G      +  YR 
Sbjct: 171 ASGRTGSVTLTASSPIFKSQHVGKLFYMEQKAVDSVGRWETDKDIGVGDECRYQENFYRC 230

Query: 239 L---------------TTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAV 283
           +               TTG S D +G          N + W  +          S  G  
Sbjct: 231 VDGGSNGTTGTVAPTHTTGDSWDGWGLGG------RNGVLWRYL---------HSGFGVC 275

Query: 284 APYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVS-------WFMSAWGEQEGYPSHV 336
               + GD    + D     V P+     +    VV        W   AW + +GYP  V
Sbjct: 276 RITAIAGDGLTATAD-----VVPRQDGEIELPAQVVGSTFATYKWAHYAWNDTDGYPGTV 330

Query: 337 TFHNNRLLFSGSKGDELSVYLSSFGAFYDF-----SLDGEYGCYD-PTKALTTAVTDFSA 390
           T++  RL+F GS+    +++ S  G +++F      +D +   Y+   + L   +     
Sbjct: 331 TYYQQRLIFGGSRAFPQTIWCSRTGDYHNFYRSNPKVDDDAITYNYAGRQLNKILHLLDV 390

Query: 391 STIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVC 450
             +  +   GE  + G        +++ + G ++  +  +GS   A  P++VG   ++V 
Sbjct: 391 GQLIVLTSGGEFKVTGDSNG----NLTGTGGFAMSGQSFNGSSDLA--PINVGSVALYVQ 444

Query: 451 GVGRRIKYISGSTEQ-GFRFNEITQLADHLFN-QRILQLVYQEEPHSIVWVVLEPKDNSF 508
             G  I+ +  S +Q  ++ +++T LA HLFN   I       +P S+ W        S 
Sbjct: 445 QKGSIIRDLFYSFDQDSYQSSDLTLLASHLFNGYSIRDWALSVQPFSVAWCA-----RSD 499

Query: 509 PRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASF 543
             LLG  +  E +  +AWH H +++  YV S  S 
Sbjct: 500 GMLLGLTYLRE-QQVYAWHPHPMTNG-YVESICSI 532


>gi|212703338|ref|ZP_03311466.1| hypothetical protein DESPIG_01381 [Desulfovibrio piger ATCC 29098]
 gi|212673248|gb|EEB33731.1| hypothetical protein DESPIG_01381 [Desulfovibrio piger ATCC 29098]
          Length = 703

 Score = 62.8 bits (151), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 51/174 (29%), Positives = 86/174 (49%), Gaps = 12/174 (6%)

Query: 333 PSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSAST 392
           PS V FH  R++ +G++    + YLS  G F +F         DP + L   +   S   
Sbjct: 291 PSVVAFHQQRMVLAGTRDSPQAFYLSRSGDFENFRKSRPLQDDDPVEYL---IASGSIDA 347

Query: 393 IHWMHPFGEGVLVGCDTSLWLLSISLSK----GLSIDFRRVSGSGVYACPPVSVGDCLVF 448
           I W   FG+ +L+G   S +  S + S      ++I  +   GS   A  P+ +G+ ++ 
Sbjct: 348 IAWAASFGD-LLLGTSGSEYKASGNGSAITPGNITITAQSYWGSAGLA--PIIIGNAILH 404

Query: 449 VCGVGRRIKYISGSTEQ-GFRFNEITQLADHLFN-QRILQLVYQEEPHSIVWVV 500
           V   G  ++ +  S E+ G+  N+++ LA HLF   R+ Q  YQ+ P S++W+V
Sbjct: 405 VQRHGAHVRDLFYSLEKDGYAGNDLSILAPHLFEGHRLRQWAYQQTPGSVLWIV 458


>gi|303327644|ref|ZP_07358084.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3]
 gi|302862005|gb|EFL84939.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3]
          Length = 681

 Score = 62.0 bits (149), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 115/520 (22%), Positives = 196/520 (37%), Gaps = 119/520 (22%)

Query: 10  SFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMP----LMQEYRDCRLDPR 65
           +F+ GE++P  L +R DL  +A  +    N +P  +G     P    L      C L P 
Sbjct: 7   NFTGGEVTP-TLSARYDLGRYANSLKIMENFLPNLHGDAYRRPGTYFLENLGEGCVLLP- 64

Query: 66  SNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAV 125
               FSF+   G    L FG+K L+IV V         +  +  ++PY   D   + YA 
Sbjct: 65  ----FSFNAEAGQNFALAFGEKSLRIVNVNGY------VVAEAMESPYALADVPEISYAQ 114

Query: 126 FGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKF---------LPPPWLGDG------- 169
            G      HKD+  H ++        +++   +               W G G       
Sbjct: 115 VGDVVYLAHKDYALHKVVRTGSAPAYAWSIGTVALNTSLAAPAAPTAAWQGGGGSYTLRY 174

Query: 170 MISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYI 229
            +S V ++ K S+  A  STA                    G +P +W +  +  +    
Sbjct: 175 KVSAVDADGKESLPSAVGSTAS-------------------GKYPTDWTEGNHCVLSWQA 215

Query: 230 V---ADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPY 286
           V   A+  +YR  + G  G   G ++G ++   N                  A  A  P 
Sbjct: 216 VEGAAEYNIYRE-SAGYYG-FIGIAQGTSFDDQNY----------------EADIADTPK 257

Query: 287 YVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFS 346
             W    D +  G +++   Q   L     S  S++MS  G+ E       F  +R L  
Sbjct: 258 EDWDPFADGNNPG-TVTFHQQRMVLAGTRNSPQSFYMSRTGDFE------NFRKSRPL-- 308

Query: 347 GSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVG 406
               D +   L+S       ++DG                      I W   FG+ +L+G
Sbjct: 309 -QDDDPVEYQLAS------GTVDG----------------------IVWAASFGD-LLLG 338

Query: 407 CDTSLWLLS----ISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGS 462
             ++ +  +       +K  +I  +   GS   A  P+ +G+ ++     G R++ +  S
Sbjct: 339 TASAEYKATGDNGAITAKNCTITAQSYWGSAKIA--PIIIGNSVMHCQRHGSRVRDLYYS 396

Query: 463 TEQ-GFRFNEITQLADHLFN-QRILQLVYQEEPHSIVWVV 500
            E+ G+  N+++ LA HLF+   I Q  +Q+ P S++W+V
Sbjct: 397 LEKDGYAGNDLSVLAPHLFDGHTIRQWAFQQTPGSVLWLV 436


>gi|225157020|ref|ZP_03724959.1| hypothetical protein ObacDRAFT_8085 [Opitutaceae bacterium TAV2]
 gi|224802748|gb|EEG20999.1| hypothetical protein ObacDRAFT_8085 [Opitutaceae bacterium TAV2]
          Length = 773

 Score = 61.6 bits (148), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 115/547 (21%), Positives = 203/547 (37%), Gaps = 80/547 (14%)

Query: 9   HSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNR 68
           ++F+AGE +P+L   R DL  +     +  N+  + YG              +     +R
Sbjct: 7   NNFTAGEWTPKL-DGRSDLQKYDAACRRLENMRVMPYGGARFRSAFGYVAKTKSAATPSR 65

Query: 69  VFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGS 128
           +  F        +L +    L++     S   +PAL  +   +PY      +++Y     
Sbjct: 66  LMPFQFSTEQKFMLEWAHLALRVY----SAGAAPALL-QEIASPYPAAAVFAIQYRQIND 120

Query: 129 TAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTS 188
               VH D+P   L    D D   +  + + +  PP L + +     +  KLS+S  D  
Sbjct: 121 VVYLVHPDYPVQRLARHADAD---WRLEAVDWAFPPMLDENV-----TETKLSLSAVDGV 172

Query: 189 TARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSI-----GAYIVADDKVYRSLTTGR 243
              +T+   +F+P   G    L  H  E A +T+ S+     G +  A   V    T   
Sbjct: 173 NVTMTASAALFQPGHVGSYWELR-HLKE-AASTSVSLATTSGGPFHSAAISVQGDWT-AN 229

Query: 244 SGDRFGYSKGATYVKDNNITWITVLNLSSKTSRE-SASG--------------------- 281
           S +R+  +       D   TW TV   ++++ R  SASG                     
Sbjct: 230 STERWYGTLSIERSLDGGTTWETVRKFTAESDRNISASGHQEELAQFRLKYQPTGDPFGA 289

Query: 282 ----AVAP--------------YYVWGDIKDVS-KDGRSISVAPQSQTLFQAGVSVVSWF 322
                 AP               YV   +K  +  D   + V    +    A   +  W 
Sbjct: 290 GVWVGKAPTNYVKARAMLETTDAYVTALVKVTAYTDSTHVKVTVIDKAATVAATDI--WC 347

Query: 323 MSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALT 382
            SAW    G+P  +  +  RL+F G++    +++ S    F +F    +YG  D      
Sbjct: 348 ESAWSPYRGFPRTIGLYEQRLIFGGTRHQPNTMWGSKTDDFENF----KYGEDDDAAVAY 403

Query: 383 TAVTDFSAS---TIHWMHPFGEGVLVGCDTSLWLLSISLSKGLS---IDFRRVSGSGVYA 436
           T    F+AS    + W+                + + +  + L+   I  R  S +G   
Sbjct: 404 T----FAASEQNNVQWVESLKRIQAATTAREFTVAAGNTDEPLTPSNIVVRSESANGAAH 459

Query: 437 CPPVSVGDCLVFVCGVGRRIKYISGSTEQ-GFRFNEITQLADHLFNQRILQLVYQEEPHS 495
             PV V D +++V    R++  ++ S E+ G+   ++T LA  +    + QL +  +P  
Sbjct: 460 LQPVLVNDAILYVERQSRKVMEMAYSIEKDGYASVDLTLLAAPVTESGVKQLAFARQPDP 519

Query: 496 IVWVVLE 502
           ++  V E
Sbjct: 520 LLLAVTE 526


>gi|212703239|ref|ZP_03311367.1| hypothetical protein DESPIG_01281 [Desulfovibrio piger ATCC 29098]
 gi|212673505|gb|EEB33988.1| hypothetical protein DESPIG_01281 [Desulfovibrio piger ATCC 29098]
          Length = 694

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 62/241 (25%), Positives = 102/241 (42%), Gaps = 28/241 (11%)

Query: 328 EQEG-YPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTK---ALTT 383
           E EG YPS V FH  RL F+ S    ++++LS  G F   +         P K   A+  
Sbjct: 270 EGEGNYPSQVFFHQQRLGFAASNSRPITIWLSRSGEFESMAKS------TPPKDDDAIEV 323

Query: 384 AVTDFSASTIHWMHPFGEGVLVGCDTSLWLLS----ISLSKGLSIDFRRVSGSGVYACPP 439
            +    AS I W+ P    +  G + S W L     ++L+   +    + +  G  A   
Sbjct: 324 TLAATQASRIVWLQPDRSALAFGTEGSEWTLEPSEGVALTPATASFQLQTTNGGSDAVAA 383

Query: 440 VSVGDCLVFVC-GVGRRIKYISGSTEQGFRFNEITQLADHLFNQ-RILQLVYQEEPHSIV 497
           +SVG  +++V  G G   ++    +   +   ++  LA H+     ++   +Q+EP++++
Sbjct: 384 LSVGGSVLYVQRGAGAIREFAYNYSADKYLGQDLNILARHMLRDVDVVAWSWQQEPYAVL 443

Query: 498 WVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMIS-DKHYVLSAASFPNDNRGGTSLWML 556
           W VL     S   L G  +  E E    WH H  + D   V      P+D      +W L
Sbjct: 444 WSVL-----SDGTLAGLTYMKEQE-IVGWHRHTTAGDFVDVAGIPGTPDDQ-----VWFL 492

Query: 557 V 557
           V
Sbjct: 493 V 493



 Score = 51.6 bits (122), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 38/156 (24%), Positives = 68/156 (43%), Gaps = 6/156 (3%)

Query: 7   TKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRS 66
           T++  + GE+SP LL+ R D   ++ G  + RN +P+  G +   P  +       D   
Sbjct: 6   TQNVLNGGEISP-LLRGRVDQPRYSTGAREMRNFVPMPQGGVTRRPGTRYLGTALGDGGR 64

Query: 67  NRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVF 126
              F FS   G   +L FGD+ +++ +             K +++P+   D +++ YA  
Sbjct: 65  LVPFVFSATQG--RMLEFGDRAMRVWLPDGRVVADEEGAPKIFESPFAAADLRAVRYAQS 122

Query: 127 GSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLP 162
                F H  + P  L    D D   + + E+ F+P
Sbjct: 123 ADVIYFAHPGYAPRKLARHADDD---WRWSELTFMP 155


>gi|167041089|gb|ABZ05850.1| hypothetical protein ALOHA_HF400048F7ctg1g17 [uncultured marine
           microorganism HF4000_48F7]
          Length = 999

 Score = 60.5 bits (145), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 61/277 (22%), Positives = 119/277 (42%), Gaps = 43/277 (15%)

Query: 314 AGV-SVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFS----L 368
           AGV +   W + ++    GYP  V  +  RL+F+G+  +  +++ S    F++FS    L
Sbjct: 428 AGVGATTEWQLGSFSGTTGYPRTVQLYQQRLVFAGTAEESQTIFFSKTADFFNFSATEPL 487

Query: 369 DGEYGCYDPT------------KALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSI 416
             + G  D +             A++  ++  +   I W+    + + +G    ++ L  
Sbjct: 488 GQQTGQRDSSGRSIVGEQIFEDAAISLTISSDTVDQIEWISE-DQRLTIGTSGGIYQLYG 546

Query: 417 SLSKGLSIDFR-RVSGSGVYACPPVS----VGDCLVFVCGVGRRIKYIS-GSTEQGFRFN 470
           S        F   ++    +AC P +    VG+ L++V   GR+++ ++    +  +   
Sbjct: 547 STDDLTLTPFNFSITKVSAWACDPTALPAKVGNNLLYVQNNGRKLRELAFDKVQDQYSAA 606

Query: 471 EITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHM 530
           ++T  ++ +    ++   YQ++P+S++W +         RL G  +  +     AWH H 
Sbjct: 607 DLTLRSEDISESGLIATAYQDQPYSVLWCLRNDG-----RLAGLTY-VDLLQMRAWHRHT 660

Query: 531 ISDKHY---------VLSAASFPNDNRGG-TSLWMLV 557
           I   HY         V S AS P   RG    L+M+V
Sbjct: 661 IGGAHYDDTHGSQAKVESIASIP---RGTHDQLYMIV 694


>gi|220918520|ref|YP_002493824.1| hypothetical protein A2cp1_3428 [Anaeromyxobacter dehalogenans
           2CP-1]
 gi|219956374|gb|ACL66758.1| hypothetical protein A2cp1_3428 [Anaeromyxobacter dehalogenans
           2CP-1]
          Length = 825

 Score = 60.1 bits (144), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 139/576 (24%), Positives = 211/576 (36%), Gaps = 117/576 (20%)

Query: 8   KHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVS---MPLMQEYRD----- 59
           + SF+AGEL PRL   R DL+ +  G+ ++RN      G  ++    P ++E +D     
Sbjct: 8   QGSFAAGELGPRL-HGRHDLAKYQVGLRRARNFFLSPEGAALNRPGTPFVREAKDSAAGV 66

Query: 60  ---CRLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYK--TPYT 114
               RL P     F FS   G    L FG   ++  V   +T   P    + Y+  TPY 
Sbjct: 67  DRGARLIP-----FIFSEDLGQAYELEFGQGYVRFHV-GGATIADPLNSAQPYELATPYL 120

Query: 115 FKDNKSLEYAVFGSTAVFVHKDHPPHHL--LYIQDGDKISFTFDEIKFLPPP----WLGD 168
             D   L+YA  G       K + P  L  L     + +  +FD    +P P    +LG 
Sbjct: 121 AADLPRLKYAQQGDVVTLTCKGYDPRELRRLAHDSWELVPLSFD----VPAPNGVVYLGV 176

Query: 169 GMISGVKSNAKLSISQADTSTARITSDMKIFK----PLDKGRSIRLGCHPPEWAKNTNYS 224
             +  V ++A     Q       I  D    +    PL + R I +G     W     Y 
Sbjct: 177 EALENV-ADATHPARQWAWQVTEIWEDESGLQWETSPL-RVRKIAVGAGA-TWHTGFTYP 233

Query: 225 IGAYIVADDKVYRSLTTGRSGD-----RFGYSKGATY--------------VKDNNITWI 265
           +GA +    + ++S+     G        G    ATY              V ++N    
Sbjct: 234 LGACVSYAGQFWQSVIADNRGHVPEAVMVGDPPAATYPYWTPVGAVPDPFAVYESNAPTD 293

Query: 266 TVLNLSSKTSRESASGA-----------------------------VAPYYVWGDIKDVS 296
            VL    +T +  ASGA                             VA +   GD  D+S
Sbjct: 294 VVL-FPDRTIKLWASGAWTGVDGSRLVGRRVYRGRGTVFGYVGEFEVAEFRDTGDTPDLS 352

Query: 297 KDGRSISVAPQSQ---TLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDEL 353
                    PQ +   T+F     VV        EQ   PS VTFH  R    G+     
Sbjct: 353 YS------PPQGRNPFTVFGPAGEVVRL------EQ---PSVVTFHAERRSLLGTAQRPA 397

Query: 354 SVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWL 413
             +LS  G +Y+F         D   A    +       + W       +L+G  + +W 
Sbjct: 398 HAFLSRTGDYYNFDRHTPALVDD---AFELELAGRLREEVRWAV-GAAALLIGTQSGVWA 453

Query: 414 LSIS----LSKGLSIDFRRVSGSGVYACP---PVSVGDCLVFVCGVGRRIK-YISGSTEQ 465
           +       L  G +    + S    Y  P   P +VGD +++V   G  ++  +     Q
Sbjct: 454 IRPPSGEVLGPGKATAVPQSSAGSSYLDPLVVPSAVGDAVLYVRTKGSGVRDLVYDDGRQ 513

Query: 466 GFRFNEITQLADHLFNQ-RILQLVYQEEPHSIVWVV 500
           GF  ++++ LA HLF    I    +QE+P S+ W+V
Sbjct: 514 GFVGSDLSLLAKHLFTGYSIKAWTFQEDPWSVAWLV 549


>gi|320175038|gb|EFW50151.1| 12 [Shigella dysenteriae CDC 74-1112]
          Length = 799

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 110/504 (21%), Positives = 204/504 (40%), Gaps = 71/504 (14%)

Query: 27  LSLHAQGVAKSRNLIPLRYGPLVSMPLMQ-------EYRDCRLDP-RSNRVFSFSIPDGG 78
           ++ +   + K  N I  +YG + + P  +         R CRL P + + V ++++  G 
Sbjct: 1   MAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYPNRKCRLIPFQFSTVQTYALEFGH 60

Query: 79  YALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHP 138
             + V  D  L  V+  S+  +  A       TPYT  D   +++         VH  +P
Sbjct: 61  QYMRVIKDGAL--VLNSSNVIYEIA-------TPYTEADLFRIKFTQSADVLTLVHPAYP 111

Query: 139 PHHLL-YIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSI-SQADTSTARITSDM 196
           P  L  Y  D  ++     +          +G    +  +  +++ + A T T  +T+  
Sbjct: 112 PKELRRYAHDNWQLVDVVTK----------NGPFEDINIDESVTVYASASTGTITLTASA 161

Query: 197 KIFKPLDKGRSIRLGCHP-----PEWAKNTNYSIGAYIVADDKVYRSLTTGRSGD-RFGY 250
            IF     G+   L   P     P W  + + SIG    AD   YR++T G++G  R  +
Sbjct: 162 SIFGAEQVGKLFYLE-QPAVDSVPVWETSKSTSIGDIRRADSNYYRAVTAGKTGTLRPSH 220

Query: 251 SKGATYVKD-------NNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSIS 303
           ++G ++            I W   L+     +R +A+                     IS
Sbjct: 221 TEGTSWDGWGGSGDDDTGIEW-EYLHSGFGIARITAANGTT------------ATAEVIS 267

Query: 304 VAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAF 363
             P SQ + +   S   W    W    GYP  V ++  RL F+ S     +++ S  G +
Sbjct: 268 YIP-SQVVGEDNASY-KWAKYTWNSVNGYPGTVVYYQQRLYFAASTAFPQTIWASRTGDY 325

Query: 364 YDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM-HPFGEGVLVGCDT-SLWLLSISLSKG 421
            DF      G  +PT+     +  ++   ++ + H    G LV   +   ++++   +K 
Sbjct: 326 KDF------GKSNPTQDDDRIIYTYAGRQVNEIRHLIDVGSLVALTSGGEYVITGDQNKV 379

Query: 422 L---SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLAD 477
           L   S  F     +G    PP++V +  +FV   G  ++ ++ S +  G++ N++T LA+
Sbjct: 380 LTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVVRDLAYSFDVDGYQGNDLTILAN 439

Query: 478 HLFNQR-ILQLVYQEEPHSIVWVV 500
           HLF +  I+   +   P+S  + +
Sbjct: 440 HLFQKHSIVDWCFSIVPYSSAFCI 463


>gi|118590938|ref|ZP_01548338.1| hypothetical protein SIAM614_19796 [Stappia aggregata IAM 12614]
 gi|118436460|gb|EAV43101.1| hypothetical protein SIAM614_19796 [Stappia aggregata IAM 12614]
          Length = 810

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 43/185 (23%), Positives = 81/185 (43%), Gaps = 13/185 (7%)

Query: 321 WFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKA 380
           W + AW    G+P  + +H NRL F+G+  +   ++ S    F +FS+       D   A
Sbjct: 386 WRLGAWSGTTGWPETIGWHKNRLAFAGTSEEPQKIWESQTEDFTNFSVSHVLKASD---A 442

Query: 381 LTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKG----LSIDFRRVSGSGVYA 436
           +T  +     + I W+    + ++VG   ++  +  +  +      ++D +  +  G   
Sbjct: 443 VTAGILSGQVNRIQWLVDDND-LIVGTTRAVRAVGKATDQDPYGPENVDQKPETNFGAND 501

Query: 437 CPPVSVGDCLVFVCGVG---RRIKYISGSTEQGFRFNEITQLADHLFNQRILQLVYQEEP 493
             P+ VG  L++    G   R + Y  GS   G     ++++  HLF   I    YQ+ P
Sbjct: 502 VSPIKVGSVLIYYGPYGTDMREMAYDFGS--DGRVSQAVSEVQSHLFQSGIAGACYQQYP 559

Query: 494 HSIVW 498
            S++W
Sbjct: 560 DSVIW 564


>gi|85059168|ref|YP_454870.1| hypothetical protein SG1190 [Sodalis glossinidius str. 'morsitans']
 gi|84779688|dbj|BAE74465.1| hypothetical phage protein [Sodalis glossinidius str. 'morsitans']
          Length = 662

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 45/186 (24%), Positives = 84/186 (45%), Gaps = 15/186 (8%)

Query: 324 SAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSL----DGEYGCYDPTK 379
           S W +  GYP  VT +  RL+ +GS     +++ S  GA+  F L    D        + 
Sbjct: 255 SVWTDNLGYPGAVTLYQQRLVLAGSPKYPQTIWWSETGAYLSFELGTKDDAAISFTLSSD 314

Query: 380 ALTTAVTDFSASTIHWMHPFGEGVLV-GCDTSLWLLSISLSKGLSIDFRRVSGSGVYACP 438
            L   V     +T+  +   GE  +  G D ++   +IS+        +  S  G     
Sbjct: 315 QLNPIVHLAQMNTLIALTYGGEFTITSGNDAAITPTNISV--------KNPSPYGCNRIR 366

Query: 439 PVSVGDCLVFVCGVGRRIKYISGSTEQ--GFRFNEITQLADHLFNQRILQLVYQEEPHSI 496
           P+ VG  ++F+   GR++  ++   +    +  N++T LA+H+    +  + YQ++P  +
Sbjct: 367 PLRVGTEILFIQRAGRKLYAVAYDPDSFVSYAANDLTVLAEHITAGGVRDMAYQQQPDGL 426

Query: 497 VWVVLE 502
           +W+V E
Sbjct: 427 IWLVRE 432



 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 50/197 (25%), Positives = 80/197 (40%), Gaps = 21/197 (10%)

Query: 8   KHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSN 67
           K +F+AGE+SPRL+  R D+  +A G    +N + +  G ++  P  +     +   R  
Sbjct: 7   KTNFTAGEVSPRLM-GRVDIMRYANGAKAIQNGVVVVQGGVMRRPGTRFAAAAKYSDRPA 65

Query: 68  RVFSFSIPDGGYALLVFGDKKLQI------VVVRSSTKWSPALFGKTYKTPYTFKDNKSL 121
           R+  +        +L FGD  L++      VV  ++T +  A       +PY+     S+
Sbjct: 66  RLIPYVFNRSQAYVLEFGDGYLRVYQKGKPVVNANNTPYEIA-------SPYSADRLPSV 118

Query: 122 EYAVFGSTAVFVHKDHPPHHLL-------YIQDGDKISFTFDEIKFLPPPWLGDGMISGV 174
            Y     T   VH    P+ L         ++    I   FDEI+  P  W        V
Sbjct: 119 NYVQGADTMFLVHPAVKPYRLQRRGQTDWVLEPAPFIVEPFDEIRETPKKWCRPSAKEFV 178

Query: 175 KSNAKLSISQADTSTAR 191
            S   L++S AD    R
Sbjct: 179 GSEVTLTLSDADPGENR 195


>gi|295096862|emb|CBK85952.1| hypothetical protein ENC_24250 [Enterobacter cloacae subsp. cloacae
           NCTC 9394]
          Length = 662

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 48/188 (25%), Positives = 84/188 (44%), Gaps = 23/188 (12%)

Query: 324 SAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTT 383
           S W  + GYP  VT +  RL+ +GS     +++ S  G +  F    E G  D      T
Sbjct: 255 SVWTNEFGYPGAVTLYQQRLVLAGSPKYPQTIWWSETGVYLSF----EIGTEDDDAISFT 310

Query: 384 AVTDFSASTIHW--MHPF------GEGVLV-GCDTSLWLLSISLSKGLSIDFRRVSGSGV 434
             +D     +H   M+        GE  +  G D ++   +IS+        +  S  G 
Sbjct: 311 LSSDQLNPIVHLAQMNTLIALTYGGEFTITSGNDAAITPTNISV--------KNPSPYGC 362

Query: 435 YACPPVSVGDCLVFVCGVGRRIKYISGSTEQ--GFRFNEITQLADHLFNQRILQLVYQEE 492
               PV VG  ++FV   GR++  ++   +    +  N++T LA+H+    +L + YQ++
Sbjct: 363 NGIRPVRVGTEIMFVQRAGRKLYAVAYDPDSFVSYSANDMTVLAEHITAGGVLDMAYQQQ 422

Query: 493 PHSIVWVV 500
           P + +W+V
Sbjct: 423 PDAFIWMV 430



 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 81/192 (42%), Gaps = 21/192 (10%)

Query: 8   KHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSN 67
           K +F+AGE+SPRL+  R D++ +A G     N + +  G +V  P  +     +   + +
Sbjct: 7   KTNFTAGEVSPRLM-GRVDIARYANGAKIIENAVVVVQGGVVRRPGTRFAAATKHGDKKS 65

Query: 68  RVFSFSIPDGGYALLVFGDKKLQI------VVVRSSTKWSPALFGKTYKTPYTFKDNKSL 121
           R+  +        +L FGD  ++I      +V   +T +  A       +PYT     ++
Sbjct: 66  RLIPYVFNRSQAYMLEFGDGYMRIFQNGKQLVNEDNTPYEIA-------SPYTADMLPAV 118

Query: 122 EYAVFGSTAVFVHKDHPPHHLL-------YIQDGDKISFTFDEIKFLPPPWLGDGMISGV 174
            Y     T   VH+   PH L         ++    I   FDE++  P  W    +   V
Sbjct: 119 NYVQGADTMFLVHQSVKPHRLQRRGQTDWVLEPAPFIVEPFDEVRDTPQKWCKPSVKEFV 178

Query: 175 KSNAKLSISQAD 186
            S   L++S AD
Sbjct: 179 GSEITLTLSDAD 190


>gi|262043403|ref|ZP_06016528.1| hypothetical protein HMPREF0484_3546 [Klebsiella pneumoniae subsp.
           rhinoscleromatis ATCC 13884]
 gi|259039229|gb|EEW40375.1| hypothetical protein HMPREF0484_3546 [Klebsiella pneumoniae subsp.
           rhinoscleromatis ATCC 13884]
          Length = 664

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 45/184 (24%), Positives = 83/184 (45%), Gaps = 15/184 (8%)

Query: 324 SAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSL----DGEYGCYDPTK 379
           S W ++ GYP  VT +  RL+ +GS     +++ S  G +  F L    D        + 
Sbjct: 257 SVWTDEFGYPGAVTLYQQRLVLAGSPRYPQTIWWSESGVYLSFELGTDDDDAISFTLSSD 316

Query: 380 ALTTAVTDFSASTIHWMHPFGE-GVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVYACP 438
            L   V     +T+  +   GE  +  G D ++   +IS+        +  S  G     
Sbjct: 317 QLNPIVHLAQMNTLIALTYGGEFTITAGNDAAITPTNISV--------KNPSPYGCNGIR 368

Query: 439 PVSVGDCLVFVCGVGRRIKYISGSTEQ--GFRFNEITQLADHLFNQRILQLVYQEEPHSI 496
           PV VG  ++FV   GR++  ++   +    +  N++T LA+H+    ++ + YQ++P + 
Sbjct: 369 PVRVGTEIMFVQRSGRKLYAVAYDPDSYVAYSANDMTVLAEHITEGGVIDMAYQQQPDAF 428

Query: 497 VWVV 500
            W+V
Sbjct: 429 TWLV 432



 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 48/189 (25%), Positives = 79/189 (41%), Gaps = 21/189 (11%)

Query: 8   KHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSN 67
           K +F+AGE+SPRL+  R D+  +A G     N + +  G ++  P  Q     +   + +
Sbjct: 7   KTNFTAGEISPRLM-GRVDIDRYANGAKTLENSVVVVQGGVMRRPGSQFVAATKYGDKKS 65

Query: 68  RVFSFSIPDGGYALLVFGDKKLQI------VVVRSSTKWSPALFGKTYKTPYTFKDNKSL 121
           R+  +        +L FGD  L+I      +V   +T +  A       +PYT     S+
Sbjct: 66  RLIPYVFNRTQAYILEFGDGYLRIYQDGKQLVNDDNTPYEIA-------SPYTSDMLPSV 118

Query: 122 EYAVFGSTAVFVHKDHPPHHLL-------YIQDGDKISFTFDEIKFLPPPWLGDGMISGV 174
            Y     T   VH+D  P+ L         ++    I   FDE++  P  W    +   V
Sbjct: 119 NYVQGADTMFLVHQDVKPYRLQRRGQTDWVLEPAPFIVEPFDEVRDTPQKWCKPSVKEFV 178

Query: 175 KSNAKLSIS 183
            S   L++S
Sbjct: 179 GSEITLTLS 187


>gi|303328570|ref|ZP_07359005.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3]
 gi|302861336|gb|EFL84275.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3]
          Length = 696

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 53/237 (22%), Positives = 101/237 (42%), Gaps = 19/237 (8%)

Query: 332 YPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSAS 391
           +PS V FH  RL ++ +    ++++LS  G   DF +           A+   +    A+
Sbjct: 277 WPSQVFFHQQRLGWAATANRPITIWLSRPG---DFEIMAASTPPKDDDAIEATLAATQAN 333

Query: 392 TIHWMHPFGEGVLVGCDTSLWLLSISLSKGLS---IDFR-RVSGSGVYACPPVSVGDCLV 447
            I W+ P  + +  G + S W LS      L+   + F  + +  G  A   VSVG  ++
Sbjct: 334 RIVWLQPDRQSLTFGTEGSEWTLSAGEGVALTPSNVSFEMQTANGGDNATQAVSVGGGVL 393

Query: 448 FVCGVGRRIKYIS-GSTEQGFRFNEITQLADHLFNQRILQL-VYQEEPHSIVWVVLEPKD 505
           ++   G+ ++  +   +   +   ++T LA H+    ++    +Q+EP++++W  L    
Sbjct: 394 YLQRGGKAVRQFAYNYSADKYLGQDVTILARHILRDAVVTAWAFQQEPYAVLWCAL---- 449

Query: 506 NSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSAG 562
            S   L G  +  E +    WH H    +   ++A     D++     W LV    G
Sbjct: 450 -SDGTLAGLTYMPE-QDVMGWHRHDTDGRFEDVAAMPGTPDDQ----TWFLVRRGCG 500



 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 40/160 (25%), Positives = 66/160 (41%), Gaps = 16/160 (10%)

Query: 8   KHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPL-----MQEYRDCRL 62
           ++  + GE++P L++ R D   +  G  + RN +P+  G +   P      M      RL
Sbjct: 7   QNVLNGGEITP-LMRGRVDQPRYGTGAREMRNFVPMPQGGVTRRPGTRFLGMAHGDAARL 65

Query: 63  DPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLE 122
            P     F FS   G   +L FGDK L++ +             K +++PY   D   L 
Sbjct: 66  IP-----FVFSATQG--RMLEFGDKTLRVWLPDGRLVADENGEPKVFESPYAVGDLHELR 118

Query: 123 YAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLP 162
           +A         H+ + P  L    D D   + + E+ F+P
Sbjct: 119 FAQSADVVYLAHQGYAPRRLSRHADDD---WRWSELAFVP 155


>gi|262043657|ref|ZP_06016766.1| hypothetical protein HMPREF0484_3785 [Klebsiella pneumoniae subsp.
           rhinoscleromatis ATCC 13884]
 gi|259038995|gb|EEW40157.1| hypothetical protein HMPREF0484_3785 [Klebsiella pneumoniae subsp.
           rhinoscleromatis ATCC 13884]
          Length = 758

 Score = 53.9 bits (128), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 133/610 (21%), Positives = 220/610 (36%), Gaps = 110/610 (18%)

Query: 8   KHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSN 67
           K SF+AG LSP ++  + D    A  V   +N IPL  GP       Q     +    S+
Sbjct: 8   KRSFNAGILSP-VMYGQVDFDKWASAVKYMKNFIPLPQGPARRRGGTQYAGSVK--NSSD 64

Query: 68  RV----FSFSIPDG-------GYALLVFGDKKL---QIVVVRSSTKWSPALFGKTYKTPY 113
           RV    F FS  +        GY    F   +L   +  ++  ST W      +  K   
Sbjct: 65  RVWLASFQFSTTEAFILEFGPGYIRFWFNHAQLLDDENNILEVSTPWGAGDLTRNGKFGL 124

Query: 114 TFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLG-DGMIS 172
           + + +  + Y          + ++P + L         +++  E  F   P+   +   S
Sbjct: 125 SLQQSADVIYITC------TNGNYPVYKLTR---NTNTNWSLAEASFSGGPFADINSDKS 175

Query: 173 GVKSNAKLSISQAD-----------TSTARITSDMKIFKPLDKGRSIRLGC--------- 212
            V    +  I   D           TS   IT++  IF+ L  G    +           
Sbjct: 176 SVVYTDQFRIWSEDGNDLPDGTPTTTSLCNITANTDIFQALHVGCLFYIEASTDAVDDDT 235

Query: 213 ----HPPEWAKNTN--YSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWIT 266
               + P WA  T   +S G +  +D K Y  +   ++G+                TW  
Sbjct: 236 GHSGYIPAWAAGTTETFSTGVFCRSDGKYYEDMDGTKTGN-------------TQPTW-- 280

Query: 267 VLNLSSKTSRESASGAVAPYYV----WGDIK------DVSKDGRSISVAPQSQTLFQAGV 316
               ++   R+ + G  + +      WG I+        S  G+ ++  P S  +     
Sbjct: 281 ----TAGAHRDGSGGDASLWRYSGGGWGIIEITAVNSATSATGKIVTELPPS--VRNTVG 334

Query: 317 SVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYD 376
               +    W +   YP    F   RL+F+G +     ++ S  G   +FS        +
Sbjct: 335 KTYKYAFGDWSDVLRYPQFAAFFRGRLVFAGRQ----KIWSSVAGDLQNFSPMTNGYEAE 390

Query: 377 PTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLW------LLSISLSKGLSIDFRRVS 430
              ++   + D +  T+ W+      + +G     +      L S+  +    ++     
Sbjct: 391 SDDSINDRIDD-TQDTMQWLVASAGKIFIGTAGYEFSYGEQSLTSVFGAGNTKVELNSTI 449

Query: 431 GSGVYACPPVSVGDCLVFVCGVGRRI---KYISGSTEQGFRFNEITQLADHLFNQRILQL 487
           GS         + D + FV   GR++    Y SGS    F       LA HLF   I+ L
Sbjct: 450 GSNEVQAE--RLFDRVAFVQRAGRKVMIAAYDSGS--DSFSATNSCILAPHLFTSEIIAL 505

Query: 488 VYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDN 547
            YQ+EP+ I+WV+LE       +LLG  + AE +    WH H       V S    P+ +
Sbjct: 506 AYQQEPNRILWVLLEEG-----KLLGLTYDAE-QNITGWHEHATGGA--VESIKVIPDID 557

Query: 548 RGGTSLWMLV 557
            G   LWM+V
Sbjct: 558 GGRDELWMVV 567


>gi|290968641|ref|ZP_06560179.1| conserved hypothetical protein [Megasphaera genomosp. type_1 str.
           28L]
 gi|290781294|gb|EFD93884.1| conserved hypothetical protein [Megasphaera genomosp. type_1 str.
           28L]
          Length = 1039

 Score = 53.5 bits (127), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 44/190 (23%), Positives = 91/190 (47%), Gaps = 4/190 (2%)

Query: 316 VSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCY 375
           V V ++  S+W ++ GYP    F  +RL+F+G+K +  S++ S  G + +FS++   G  
Sbjct: 570 VPVDAFAFSSWNDRNGYPKLSCFFQDRLVFAGTKKEPYSLWFSRTGDYNNFSVEKAEGTV 629

Query: 376 DPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRV-SGSGV 434
               A+   +   +   I  + P  + ++V    + W++S   +   +    +V +  G 
Sbjct: 630 TEDSAIKLDLIVRNLYEIRHLVPSND-LIVLTSGNEWIISGDTAITPTKCTPKVQTMRGA 688

Query: 435 YACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFNQ-RILQLVYQEE 492
             C P  +G+ L++V   G  I+    S +   +  +E+   A HL  + +++   Y + 
Sbjct: 689 SNCKPWHIGNRLIYVQRDGGTIRDFGYSYDSDNYNGDELNLFASHLTKRHQMVSSAYCQN 748

Query: 493 PHSIVWVVLE 502
           P+S ++ V E
Sbjct: 749 PYSTLYFVRE 758



 Score = 39.7 bits (91), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 39/187 (20%), Positives = 79/187 (42%), Gaps = 18/187 (9%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           M N   T++SF+ GE+SP + + R DL  +   + ++ N +   YG +      +     
Sbjct: 1   MQNVFITQNSFTTGEISPEVAE-RTDLEKYKSALLQAENAVVSPYGSVSRRTGSKYIGAI 59

Query: 61  RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120
           +   +   +  F        LL  G K +++        W      +   TP+ +   K 
Sbjct: 60  KYADKEAVLVPFMDSSDRSYLLEVGYKYIRV--------WKDETMEQEIDTPFEYP--KE 109

Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180
           L +   G TA      +P + LL+ +  +   F       +P P+  D +IS +++ + +
Sbjct: 110 LNFTQSGDTAFICSGRYPVYELLHGRYWELRKFD------IPKPYF-DDIISAIENVSDV 162

Query: 181 SISQADT 187
           + +++DT
Sbjct: 163 NYTESDT 169


>gi|298485990|ref|ZP_07004064.1| predicted phage protein [Pseudomonas savastanoi pv. savastanoi
           NCPPB 3335]
 gi|298159467|gb|EFI00514.1| predicted phage protein [Pseudomonas savastanoi pv. savastanoi
           NCPPB 3335]
          Length = 716

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 43/182 (23%), Positives = 88/182 (48%), Gaps = 12/182 (6%)

Query: 324 SAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTT 383
           S W + +GYPS  T +  RL+ +GS     +++ S  G + +F    E G  D   A++ 
Sbjct: 311 SVWNDFDGYPSTGTLYEQRLVAAGSPNYPQTIWESRTGEYLNF----ELGTKD-DDAMSF 365

Query: 384 AVTDFSASTIHWMHPFGEGVLVGCD-TSLWLLSISLSKGLSIDFRRVSGSGVYACP---P 439
            V+    + I  MH      LV       + ++  + K ++    ++    VY C    P
Sbjct: 366 NVSSDQINPI--MHVGQVKALVTLTYGGEFTVTGGVEKPITPTNIQIKNQSVYGCNGVRP 423

Query: 440 VSVGDCLVFVCGVGRRIKYISGSTEQ-GFRFNEITQLADHLFNQRILQLVYQEEPHSIVW 498
           + +G+ L FV   GR+++ ++   +   +   +++ L++H     ++ + +Q+EP SI++
Sbjct: 424 IRIGNELYFVQRAGRKLRAMAYKYDSDSYGSPDMSVLSEHATKSGVVDMAFQQEPESILF 483

Query: 499 VV 500
           +V
Sbjct: 484 MV 485


>gi|291334457|gb|ADD94111.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C1161]
          Length = 206

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 31/126 (24%), Positives = 63/126 (50%), Gaps = 7/126 (5%)

Query: 423 SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFN 481
           +I  ++ S +G      ++VG+  +F+    R+++ ++ + +  G+   ++T LA+H+  
Sbjct: 30  NILIKKQSNNGAANVDALAVGNATLFLQRARRKLRELAYNFDVDGYVAPDLTILAEHISE 89

Query: 482 QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAA 541
               QL YQ+EP+ ++W V         +L+G  +  E +   AWH H+        S A
Sbjct: 90  GGFKQLSYQQEPNQVIWGVRND-----GQLVGLTYQRE-QQVVAWHRHIFGGSAVCESVA 143

Query: 542 SFPNDN 547
           + P D+
Sbjct: 144 TIPTDD 149


>gi|291334666|gb|ADD94313.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C695]
          Length = 189

 Score = 48.1 bits (113), Expect = 0.004,   Method: Composition-based stats.
 Identities = 31/126 (24%), Positives = 63/126 (50%), Gaps = 7/126 (5%)

Query: 423 SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFN 481
           +I  ++ S +G      ++VG+  +F+    R+++ ++ + +  G+   ++T LA+H+  
Sbjct: 31  NILIKKQSNNGAANVDALAVGNATLFLQRARRKLRELAYNFDVDGYVAPDLTILAEHISE 90

Query: 482 QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAA 541
               QL YQ+EP+ ++W V         +L+G  +  E +   AWH H+        S A
Sbjct: 91  GGFKQLSYQQEPNQVIWGVRNDG-----QLVGLTYQRE-QQVVAWHRHIFGGSAVCESVA 144

Query: 542 SFPNDN 547
           + P D+
Sbjct: 145 TIPTDD 150


>gi|54302254|ref|YP_132247.1| hypothetical protein PBPRB0574 [Photobacterium profundum SS9]
 gi|46915675|emb|CAG22447.1| hypothetical protein PBPRB0574 [Photobacterium profundum SS9]
          Length = 919

 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 42/192 (21%), Positives = 78/192 (40%), Gaps = 10/192 (5%)

Query: 317 SVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYD 376
           S   W +  W    GYP   T+   RL  + +     +V+LS   +F DFS        D
Sbjct: 408 STYKWAIEIWRNSTGYPRCGTYFQQRLSMANTISHPQTVWLSRTDSFNDFSKTRPILADD 467

Query: 377 PTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSID----FRRVSGS 432
              ++   +     + I  + P    +L+     LW L+       S +     +  +  
Sbjct: 468 ---SMRYDINSLQVNEIFNIVPL-NSLLLFTSGGLWSLAQDQQGAFSAESPPSVKMQNYE 523

Query: 433 GVYACPPVSVGDCLVFVCGVGRRIKYISGS-TEQGFRFNEITQLADHLF-NQRILQLVYQ 490
           G     P+  G   ++V    R ++ I  S +   F   ++T  A HLF ++R+++  Y 
Sbjct: 524 GANKLRPIVAGSTAIYVQQGDRIVRDIQFSWSSDSFEGVDLTVRASHLFKHKRVVEWAYA 583

Query: 491 EEPHSIVWVVLE 502
           + P  ++WV+ +
Sbjct: 584 KNPDKLIWVIFD 595


>gi|146276492|ref|YP_001166651.1| hypothetical protein Rsph17025_0440 [Rhodobacter sphaeroides ATCC
           17025]
 gi|145554733|gb|ABP69346.1| hypothetical protein Rsph17025_0440 [Rhodobacter sphaeroides ATCC
           17025]
          Length = 754

 Score = 45.1 bits (105), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 47/195 (24%), Positives = 83/195 (42%), Gaps = 15/195 (7%)

Query: 315 GVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDG--EY 372
           GV    W   AW ++ GYPS V  +  RL  + +  +  +V+ S+ G F DF LDG  + 
Sbjct: 308 GVPTYRWSEGAWSKRYGYPSTVEIYEQRLAAAATPSEPRTVWFSAVGDFQDF-LDGTEDD 366

Query: 373 GCYDPTKALTTAVTDF-----SASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFR 427
             +  T A +T+V         A+ +H +   GE      +T   ++        +  F 
Sbjct: 367 QSFAYTVAGSTSVNRIINLQRGAAGLH-IFALGEEYSTRSETRSSVIGPK-----NAVFG 420

Query: 428 RVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEI-TQLADHLFNQRILQ 486
             SG G     P++     +F+    +R+  +  S +Q    + + ++ A H+      Q
Sbjct: 421 LDSGVGSSTAKPITPSGNPIFISRDRKRVLEMVYSLDQDRPVSRVLSRTAQHVGGAGFEQ 480

Query: 487 LVYQEEPHSIVWVVL 501
           +V+Q  P    W+ L
Sbjct: 481 IVWQAAPEPTAWLRL 495


>gi|226940469|ref|YP_002795543.1| hypothetical protein LHK_01546 [Laribacter hongkongensis HLHK9]
 gi|226715396|gb|ACO74534.1| hypothetical protein LHK_01546 [Laribacter hongkongensis HLHK9]
          Length = 874

 Score = 43.9 bits (102), Expect = 0.069,   Method: Compositional matrix adjust.
 Identities = 58/271 (21%), Positives = 114/271 (42%), Gaps = 22/271 (8%)

Query: 268 LNLSSKTSRESASGAVAPYYVWGDIKDVSK-DGRSISVAPQSQTLFQAGVSVVSWFMSAW 326
           + L+   S   +  A+ P  + G I  V+  +G S   AP     +  G S  ++     
Sbjct: 399 VQLAVTDSGGGSGAALEPVIIDGAITAVNVINGGSGYFAPVVSVSYAGGGSGATFGQPVV 458

Query: 327 GEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTK---ALTT 383
                YP  V++   R  F+G+     +++++  G       +   G   P +    +  
Sbjct: 459 KSSGDYPGAVSYFEQRRCFAGTTRKPQNIWMTKSGT------ESNMGYSLPVRDDDRIAF 512

Query: 384 AVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGL---SIDFRRVSGSGVYACPPV 440
            V+   A+TI  + P  + +L+   ++ W ++   S  +   SI  R  S  G     PV
Sbjct: 513 RVSAREANTIRHIVPLAQ-LLLLTSSAEWRVTSVNSDAITPRSISVRPQSYIGASNVQPV 571

Query: 441 SVGDCLVFVCGVGRRIKYISGSTEQ-GFRFNEITQLADHLFNQ-RILQLVYQEEPHSIVW 498
            + + L++    G  ++ ++ + +  GF   +++  A HLF+   I+ + + + P  +VW
Sbjct: 572 IINNTLIYASARGGHVRELAYNWQAGGFVTGDLSIRAPHLFDDFEIVDMAFGKSPQPVVW 631

Query: 499 VVLEPKDNSFPRLLGCRFSAEGEGDFAWHTH 529
            V     +S   L+G  +  E +   AWH H
Sbjct: 632 FV-----SSSGCLIGLTYVPEQQVG-AWHWH 656


>gi|296532340|ref|ZP_06895077.1| conserved hypothetical protein [Roseomonas cervicalis ATCC 49957]
 gi|296267336|gb|EFH13224.1| conserved hypothetical protein [Roseomonas cervicalis ATCC 49957]
          Length = 626

 Score = 43.5 bits (101), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 40/175 (22%), Positives = 74/175 (42%), Gaps = 10/175 (5%)

Query: 321 WFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKA 380
           W  +A+    G+P    FH +RL+  GS+     ++LS  G  ++F L    G     +A
Sbjct: 222 WDEAAFSAVRGWPVTACFHQDRLVLGGSRDLPNRLWLSRSGDLFNFDL----GSGLDDQA 277

Query: 381 LTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSG---VYAC 437
           +   +     + I  +   G  + V    + W+++       SI   R +  G       
Sbjct: 278 IEFGLLSDQVNAIRAVFS-GRHLQVFTSGAEWMVTGEPMTPASIQLHRQTRIGSPVARII 336

Query: 438 PPVSVGDCLVFVCGVGRRI-KYISGSTEQGFRFNEITQLADHLFNQRILQLVYQE 491
           PPV V    +FV   G+ + +Y     +Q ++ N++  +A HL  Q  + + Y +
Sbjct: 337 PPVDVDGSTIFVARSGQAVHEYAYTDVQQAYQANDLALVARHLV-QTPVSMAYDQ 390


>gi|323699364|ref|ZP_08111276.1| hypothetical protein DND132_1955 [Desulfovibrio sp. ND132]
 gi|323459296|gb|EGB15161.1| hypothetical protein DND132_1955 [Desulfovibrio desulfuricans
           ND132]
          Length = 698

 Score = 43.5 bits (101), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 49/198 (24%), Positives = 88/198 (44%), Gaps = 14/198 (7%)

Query: 380 ALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLS---IDFRRVSGSGVYA 436
           A+   ++   A+ I ++ P    + +G     W LS S S  L+   +   +    G   
Sbjct: 330 AIEVTLSGRQANAIEFIVPR-RALWIGTAGGEWTLSASSSDPLTPSNVKAAQEGTGGASG 388

Query: 437 CPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFNQRILQLVYQEEPHS 495
             P +VG   ++V   GR+I+ +S   E   +   ++T L++H+    + QL Y +EP S
Sbjct: 389 VRPEAVGFAALYVQRAGRKIREMSYRYESDAYVSKDLTLLSEHITEGGLTQLAYVQEPDS 448

Query: 496 IVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWM 555
           I++ V   + +    L+   +  + E   A  + +++D   V  AAS  ND      LW+
Sbjct: 449 ILYGV---RGDGI--LVALTYVPDQE--VAAWSRIVTDG-VVERAASVYNDAEKRDELWI 500

Query: 556 LVALSA-GEERSFTVRLN 572
            V  +  GE R +   L 
Sbjct: 501 TVLRTVNGETRRYVEYLE 518



 Score = 39.3 bits (90), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 18/45 (40%), Positives = 27/45 (60%), Gaps = 1/45 (2%)

Query: 324 SAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSL 368
            AWGE + YPS V F+  RL+ + ++    +++LS  G F DF L
Sbjct: 164 EAWGEND-YPSAVCFYEQRLVLAATRSRPATLWLSRTGEFSDFRL 207


>gi|325971691|ref|YP_004247882.1| hypothetical protein SpiBuddy_1864 [Spirochaeta sp. Buddy]
 gi|324026929|gb|ADY13688.1| hypothetical protein SpiBuddy_1864 [Spirochaeta sp. Buddy]
          Length = 551

 Score = 42.0 bits (97), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 37/144 (25%), Positives = 65/144 (45%), Gaps = 12/144 (8%)

Query: 14  GELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRVFSFS 73
           GE+SP+L   R DL ++ QG    ++   +  G +   P ++            R   F+
Sbjct: 11  GEISPKL-GGRLDLEMNTQGCEILKDFRNMLQGGITRRPPLKHVAQTV----RGRTIPFT 65

Query: 74  IPDGGYALLVFGDKKLQI----VVVRSSTKWSPALFGKTY-KTPYTFKDNKSLEYAVFGS 128
           +  G   L+   +KKL++    V+   +  + P+  G  Y  T Y   D  S++YA +  
Sbjct: 66  LSSGESFLVELSNKKLRVWRKGVLGFYTVTFLPS--GNDYLPTDYLEADVWSIQYAQYYD 123

Query: 129 TAVFVHKDHPPHHLLYIQDGDKIS 152
               VHKD+ PH ++Y  +  + S
Sbjct: 124 RLYLVHKDYQPHVVVYAAEAFQFS 147


>gi|144898783|emb|CAM75647.1| conserved hypothetical protein [Magnetospirillum gryphiswaldense
           MSR-1]
          Length = 635

 Score = 42.0 bits (97), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 40/165 (24%), Positives = 68/165 (41%), Gaps = 11/165 (6%)

Query: 321 WFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKA 380
           W   A     G+P  V FH +RL+  GS+     ++LS     ++F L    G     +A
Sbjct: 230 WEEQALSAVRGWPVSVCFHQDRLVIGGSRDQPNRLWLSKSSDLFNFDL----GEALDDEA 285

Query: 381 LTTAVTDFSASTIHWMHPF-GEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGV---YA 436
           +  A+     + I   H F G  + V    + W++S       SI   R +  G      
Sbjct: 286 IEFALLSDQVNAIR--HVFSGRHLQVFTSGAEWMVSGQPLTPSSIQLTRQTRVGSPIDRT 343

Query: 437 CPPVSVGDCLVFVCGVGRRIK-YISGSTEQGFRFNEITQLADHLF 480
            PP  V    +FV   G+ ++ ++    EQ ++  ++  LA H+ 
Sbjct: 344 VPPRDVDGATLFVSRNGKDLREFLFADVEQAYQSGDLAMLAKHVM 388


>gi|209966375|ref|YP_002299290.1| hypothetical protein RC1_3113 [Rhodospirillum centenum SW]
 gi|209959841|gb|ACJ00478.1| conserved hypothetical protein [Rhodospirillum centenum SW]
          Length = 638

 Score = 41.6 bits (96), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 37/150 (24%), Positives = 60/150 (40%), Gaps = 14/150 (9%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           M      K +F+ GELSP LL  R DL  +  G    RN++ L  G +   P        
Sbjct: 1   MTRLRSVKAAFTGGELSPDLL-GRGDLRSYETGALALRNVLILPTGGVTRRPGTAYLATL 59

Query: 61  RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120
              P   R+ +F+       LL F D++L++    ++            +TP+T      
Sbjct: 60  ---PGPGRLAAFAFDTEQAYLLAFTDRRLEVFRDGATE--------AVLETPWTAGQLAQ 108

Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDK 150
           L +       +  H D PP  +  ++ GD+
Sbjct: 109 LAWTQSADVLLVCHPDVPPRRI--VRSGDR 136


>gi|291334718|gb|ADD94364.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C890]
          Length = 135

 Score = 41.6 bits (96), Expect = 0.36,   Method: Composition-based stats.
 Identities = 24/82 (29%), Positives = 41/82 (50%), Gaps = 6/82 (7%)

Query: 466 GFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFA 525
           G+   ++T LA+H+      QL YQ+EP+ ++W V      +  +L+G  +  E +   A
Sbjct: 21  GYVAPDLTILAEHISEGGFKQLSYQQEPNQVIWGV-----RNDGQLVGLTYQRE-QQVVA 74

Query: 526 WHTHMISDKHYVLSAASFPNDN 547
           WH H+        S A+ P D+
Sbjct: 75  WHRHIFGGSAVCESVATIPTDD 96


>gi|83313369|ref|YP_423633.1| hypothetical protein amb4270 [Magnetospirillum magneticum AMB-1]
 gi|82948210|dbj|BAE53074.1| hypothetical protein [Magnetospirillum magneticum AMB-1]
          Length = 634

 Score = 41.6 bits (96), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 43/172 (25%), Positives = 71/172 (41%), Gaps = 18/172 (10%)

Query: 326 WGEQ-----EGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDF----SLDGEYGCYD 376
           W EQ      G+P  V FH  RL   GS+G    ++LS     ++F     LD E   + 
Sbjct: 229 WEEQSFSPLRGWPVSVCFHQGRLAIGGSRGLPNRLWLSKSMDLFNFDLGTGLDDEAIEFS 288

Query: 377 PTKALTTAVTD-FSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVY 435
                  A+   FS   +       E ++VG  + L    I L++       RV      
Sbjct: 289 LLSTQVDAIRAVFSGRHLQVFTSGAEWMVVG--SPLTPTKIQLNRQT-----RVGSPVDR 341

Query: 436 ACPPVSVGDCLVFVCGVGRRIK-YISGSTEQGFRFNEITQLADHLFNQRILQ 486
           + PP  V     FV   GR ++ ++    +Q ++ N+++ +A H+ N  + Q
Sbjct: 342 SVPPRDVDGATHFVSRSGRDLREFLFADVDQAYQANDLSMVAKHVMNTPVDQ 393


>gi|291334514|gb|ADD94167.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C1201]
 gi|291336446|gb|ADD96001.1| hypothetical protein [uncultured organism MedDCM-OCT-S04-C1073]
          Length = 153

 Score = 41.2 bits (95), Expect = 0.42,   Method: Composition-based stats.
 Identities = 24/82 (29%), Positives = 40/82 (48%), Gaps = 6/82 (7%)

Query: 466 GFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFA 525
           G+   ++T LA+H+      QL YQ+EP+ ++W V         +L+G  +  E +   A
Sbjct: 21  GYVAPDLTILAEHISEGGFKQLSYQQEPNQVIWGVRNDG-----QLVGLTYQRE-QQVVA 74

Query: 526 WHTHMISDKHYVLSAASFPNDN 547
           WH H+        S A+ P D+
Sbjct: 75  WHRHIFGGSAVCESVATIPTDD 96


>gi|167032763|ref|YP_001667994.1| hypothetical protein PputGB1_1755 [Pseudomonas putida GB-1]
 gi|166859251|gb|ABY97658.1| conserved hypothetical protein [Pseudomonas putida GB-1]
          Length = 774

 Score = 40.8 bits (94), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 39/174 (22%), Positives = 69/174 (39%), Gaps = 16/174 (9%)

Query: 8   KHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSN 67
           + SFSAGE++P    +R DL+ +   +   RN + L  G   +    +   + +      
Sbjct: 6   QPSFSAGEVAPATY-ARVDLARYYTALKTCRNFVVLPEGGAQNRSGTRFITEVKDSAART 64

Query: 68  RVFSFSIPDGGYALLVFGDKKLQIV-----VVRSSTKWSPALFGKTYKTPYTFKDNKSLE 122
           R+  F        +L FG+  ++ +     VV   T +  A       +PYT      L+
Sbjct: 65  RLIPFQFSTEQTYILEFGNLYIRFISMGGQVVSGVTPYEIA-------SPYTTAQLPDLK 117

Query: 123 YAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKS 176
           +         VH DHPP  L  +      ++T   I F P      G+++  ++
Sbjct: 118 FTQSADVMTIVHPDHPPRELSRLA---PTNWTLTAITFEPGIAAPTGLVATART 168


>gi|317152064|ref|YP_004120112.1| hypothetical protein Daes_0341 [Desulfovibrio aespoeensis Aspo-2]
 gi|316942315|gb|ADU61366.1| hypothetical protein Daes_0341 [Desulfovibrio aespoeensis Aspo-2]
          Length = 698

 Score = 40.8 bits (94), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 31/125 (24%), Positives = 60/125 (48%), Gaps = 5/125 (4%)

Query: 380 ALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGL---SIDFRRVSGSGVYA 436
           A+   ++   A+ I ++   G+ + VG     W L  SL   +   SI   +    G  A
Sbjct: 330 AIEVTLSGRQANAIEFLVARGK-LWVGTAGGEWTLGGSLGDPVTPESIKASQEGSCGASA 388

Query: 437 CPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFNQRILQLVYQEEPHS 495
             P +VG   +++   GR+I+ ++   E   +   ++T L++H+    + Q+ Y +EP S
Sbjct: 389 TRPEAVGFATLYIQRAGRKIREMAYRYESDAYVSRDLTILSEHITKPGLTQMAYVQEPDS 448

Query: 496 IVWVV 500
           I++ V
Sbjct: 449 ILYCV 453


>gi|288959323|ref|YP_003449664.1| hypothetical protein AZL_024820 [Azospirillum sp. B510]
 gi|288911631|dbj|BAI73120.1| hypothetical protein AZL_024820 [Azospirillum sp. B510]
          Length = 632

 Score = 39.3 bits (90), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 38/141 (26%), Positives = 56/141 (39%), Gaps = 12/141 (8%)

Query: 8   KHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSN 67
           K +F+AGE+S RLL  R DL  +  G    RNL      P   +          L P   
Sbjct: 9   KTNFTAGEVSRRLL-GRGDLKAYDNGALALRNLF---IDPTGGVTRRSGLAFTALAPGDG 64

Query: 68  RVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFG 127
           R+ +F        LLVF D++  I V +  ++ +      +   P+T      + +    
Sbjct: 65  RLVAFERNSEQTYLLVFTDRR--IDVFQGGSRLA------SVAAPWTLTQLAQITWTQSA 116

Query: 128 STAVFVHKDHPPHHLLYIQDG 148
            T +  H D PP  L    DG
Sbjct: 117 DTLLVCHPDLPPRKLTRGDDG 137


>gi|41179374|ref|NP_958682.1| Bbp13 [Bordetella phage BPP-1]
 gi|45569506|ref|NP_996575.1| hypothetical protein BMP-1p12 [Bordetella phage BMP-1]
 gi|45580757|ref|NP_996623.1| hypothetical protein BIP-1p12 [Bordetella phage BIP-1]
 gi|40950113|gb|AAR97679.1| Bbp13 [Bordetella phage BPP-1]
          Length = 681

 Score = 39.3 bits (90), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 46/202 (22%), Positives = 87/202 (43%), Gaps = 13/202 (6%)

Query: 332 YPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSAS 391
           YP+ V++   R  F+G+     +++++  G   + ++       D  + +   V    A+
Sbjct: 271 YPAAVSYFEQRRCFAGTTNKPQNIWMTRSGT--ESAMSYSLPVRDDDR-VAFRVAAREAN 327

Query: 392 TIHWMHPFGEGVLVGCDTSLWLLSIS--LSKGLSIDFRRVSGSGVYACPPVSVGDCLVFV 449
            I  + P  E +L+       + S++       +I  R  S  G     PV V +  ++ 
Sbjct: 328 AIRHIVPLTELLLLTSSGEWRVASVNSDAVTPTTISVRPQSYVGATDVQPVVVNNTTIYG 387

Query: 450 CGVGRRIKYISGS-TEQGFRFNEITQLADHLF-NQRILQLVYQEEPHSIVWVVLEPKDNS 507
              G  ++ ++ +    GF   +++  A HLF N  IL + Y + P  IVW +     +S
Sbjct: 388 AARGGHVRELAYNWQANGFVTGDLSLRAAHLFDNLDILDMAYAKAPQPIVWFI-----SS 442

Query: 508 FPRLLGCRFSAEGEGDFAWHTH 529
             +LLG  +  E +   AWH H
Sbjct: 443 SGKLLGLTYVPEQQIG-AWHQH 463


>gi|291336965|gb|ADD96491.1| hypothetical protein [uncultured organism MedDCM-OCT-S11-C1587]
          Length = 474

 Score = 38.9 bits (89), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 17/54 (31%), Positives = 26/54 (48%)

Query: 314 AGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFS 367
            G  +   +  +W   +GYP   TFH  RL F G K    +++ S    F+DF+
Sbjct: 243 GGTFIDGGYEDSWSGSKGYPRTATFHEGRLYFGGVKSRPNTIFASRVARFFDFN 296


>gi|291336926|gb|ADD96454.1| hypothetical protein [uncultured organism MedDCM-OCT-S09-C787]
          Length = 158

 Score = 38.1 bits (87), Expect = 3.8,   Method: Composition-based stats.
 Identities = 20/79 (25%), Positives = 45/79 (56%), Gaps = 1/79 (1%)

Query: 423 SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFN 481
           +I  ++ S +G      ++VG+  +F+    R+++ ++ + +  G+   ++T LA+H+  
Sbjct: 60  NILIKKQSNNGAANVDALAVGNATLFLQRARRKLRELAYNFDVDGYVAPDLTILAEHISE 119

Query: 482 QRILQLVYQEEPHSIVWVV 500
               QL YQ+EP+ ++W V
Sbjct: 120 GGFKQLSYQQEPNQVIWGV 138


>gi|187476936|ref|YP_784960.1| phage protein [Bordetella avium 197N]
 gi|115421522|emb|CAJ48031.1| phage protein [Bordetella avium 197N]
          Length = 681

 Score = 37.4 bits (85), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 46/202 (22%), Positives = 83/202 (41%), Gaps = 13/202 (6%)

Query: 332 YPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSAS 391
           YP+ V++   R  F+G+     +++++  G     S        D    +   V    A+
Sbjct: 271 YPAAVSYFEQRRCFAGTINKPQNIWMTRSGTESAMSYSLPVRSDD---RVAFRVAAREAN 327

Query: 392 TIHWMHPFGEGVLVGCDTSLWLLSIS--LSKGLSIDFRRVSGSGVYACPPVSVGDCLVFV 449
            I  + P  E +L+       + S++       +I  R  S  G     PV V +  ++ 
Sbjct: 328 AIRHIVPLTELLLLTSSGEWRVASVNSDAVTPTTISVRPQSYVGATDVQPVVVNNTAIYG 387

Query: 450 CGVGRRIKYISGS-TEQGFRFNEITQLADHLF-NQRILQLVYQEEPHSIVWVVLEPKDNS 507
              G  ++ ++ +    GF   +++    HLF N  IL + Y + P  IVW +     +S
Sbjct: 388 AARGGHVRELAYNWQANGFVTGDLSLRCAHLFDNLNILDMAYAKAPQPIVWFI-----SS 442

Query: 508 FPRLLGCRFSAEGEGDFAWHTH 529
             +LLG  +  E +   AWH H
Sbjct: 443 SGKLLGLTYVPEQQIG-AWHQH 463


Searching..................................................done


Results from round 2




>gi|254781208|ref|YP_003065621.1| hypothetical protein CLIBASIA_05575 [Candidatus Liberibacter
           asiaticus str. psy62]
 gi|254040885|gb|ACT57681.1| hypothetical protein CLIBASIA_05575 [Candidatus Liberibacter
           asiaticus str. psy62]
 gi|317120673|gb|ADV02496.1| hypothetical protein SC1_gp080 [Liberibacter phage SC1]
 gi|317120817|gb|ADV02638.1| hypothetical protein SC1_gp080 [Candidatus Liberibacter asiaticus]
          Length = 578

 Score =  687 bits (1772), Expect = 0.0,   Method: Composition-based stats.
 Identities = 578/578 (100%), Positives = 578/578 (100%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC
Sbjct: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60

Query: 61  RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120
           RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS
Sbjct: 61  RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120

Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180
           LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL
Sbjct: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180

Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLT 240
           SISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLT
Sbjct: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLT 240

Query: 241 TGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGR 300
           TGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGR
Sbjct: 241 TGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGR 300

Query: 301 SISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSF 360
           SISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSF
Sbjct: 301 SISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSF 360

Query: 361 GAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSK 420
           GAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSK
Sbjct: 361 GAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSK 420

Query: 421 GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQLADHLF 480
           GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQLADHLF
Sbjct: 421 GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQLADHLF 480

Query: 481 NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSA 540
           NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSA
Sbjct: 481 NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSA 540

Query: 541 ASFPNDNRGGTSLWMLVALSAGEERSFTVRLNLLDDFK 578
           ASFPNDNRGGTSLWMLVALSAGEERSFTVRLNLLDDFK
Sbjct: 541 ASFPNDNRGGTSLWMLVALSAGEERSFTVRLNLLDDFK 578


>gi|212710810|ref|ZP_03318938.1| hypothetical protein PROVALCAL_01878 [Providencia alcalifaciens DSM
           30120]
 gi|212686507|gb|EEB46035.1| hypothetical protein PROVALCAL_01878 [Providencia alcalifaciens DSM
           30120]
          Length = 818

 Score =  549 bits (1414), Expect = e-154,   Method: Composition-based stats.
 Identities = 117/585 (20%), Positives = 221/585 (37%), Gaps = 53/585 (9%)

Query: 5   TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDP 64
           +  + SFS GE++P L   R DL+ ++  + K  N +  +YG + + P  +     +   
Sbjct: 4   SIIQPSFSGGEIAPSLY-GRIDLAKYSTALRKCENFLVRQYGGIENRPGTKFIAAAKYPN 62

Query: 65  RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPA---LFGKTYKTPYTFKDNKSL 121
           +  R+  F         L  GDK ++++       ++            TPY   D  +L
Sbjct: 63  KKCRLIPFQFSTVQTYALEMGDKYMRVIKDGGQVLYADGEHKGEIFELTTPYKEADLFNL 122

Query: 122 EYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLS 181
           ++         VH D+PP  L      D   +    ++    P+    +    K      
Sbjct: 123 KFTQSADVMTIVHADYPPMELQRYDHDD---WKLVPVETRNGPFEDINVDKERKVYV--- 176

Query: 182 ISQADTSTARITSDMKIFKPLDKGRSIRLGCHP----PEWAKNTNYSIGAYIVADDKVYR 237
              A T    +T+   IF     G+ I +        P W  +          A    YR
Sbjct: 177 --SASTGEVTLTATHNIFGAELVGKQIYIEQQAVDAVPVWETDKTTIKNDQRRAGSNYYR 234

Query: 238 SLTTGRSGD-RFGYSKGATYVKDN---NITWITVLNLSSKTSRESASGAVAPYYVWGDIK 293
           + T+G+SG  R  +++G ++        I W  +          S  G V    V  D  
Sbjct: 235 ANTSGKSGTLRPSHTEGMSWDGWGGDTGIQWEYL---------HSGFGIVKINSVSTDG- 284

Query: 294 DVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDEL 353
            ++  G+ IS  P +        +   W  S W + +GYPS V ++  RL F+GS+    
Sbjct: 285 -LTATGKVISYIPSNAV--GESNATYKWARSVWNDVDGYPSTVMYYQQRLFFAGSRAYPQ 341

Query: 354 SVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWL 413
           +++ S  G + DF  +      D    +         + I  +   G  V +      + 
Sbjct: 342 TIWASRSGDYKDFGKNNPIQDDDR---IIYTYAGRQVNEIRHLIDVGSLVAL-TSGGEYQ 397

Query: 414 LSISLSK---GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRF 469
           ++   +K     S  F     +G    PP++V +  +++   G  ++ ++ S +  G++ 
Sbjct: 398 ITGDQNKVLTPSSFSFSSQGANGCSDVPPIAVANIALYIQEKGSAVRDLAYSFDVDGYQG 457

Query: 470 NEITQLADHLF-NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHT 528
            ++T +A+HLF   +I+   +   P+SI W + +       +LL   +  E +  FAW  
Sbjct: 458 TDLTIMANHLFQRHQIIDWAFTIVPYSIAWCIRDDG-----KLLSLTYLRE-QQVFAWAP 511

Query: 529 HMISDKHYVLSAASFPNDNRGGTSLWMLVALSAGE-ERSFTVRLN 572
                +    S  S         +++ +V    G+    +  RL+
Sbjct: 512 QDTDGQF--ESTCSI--SEGNEDAVYFIVCRKVGDGTVRYIERLS 552


>gi|268589382|ref|ZP_06123603.1| hypothetical protein PROVRETT_05514 [Providencia rettgeri DSM 1131]
 gi|291315409|gb|EFE55862.1| hypothetical protein PROVRETT_05514 [Providencia rettgeri DSM 1131]
          Length = 818

 Score =  549 bits (1413), Expect = e-154,   Method: Composition-based stats.
 Identities = 115/586 (19%), Positives = 224/586 (38%), Gaps = 55/586 (9%)

Query: 5   TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDP 64
           +  + SFS GE++P L   R DL+ ++  + K  N I  +YG + + P  +     +   
Sbjct: 4   SIIQPSFSGGEIAPSLY-GRIDLAKYSTALRKCSNFIVRQYGGIENRPGTKFIAAAKYPN 62

Query: 65  RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALF---GKTYKTPYTFKDNKSL 121
           +  R+  F         L  GDK ++++       ++   +        TPY   D  +L
Sbjct: 63  KKCRLIPFQFSTVQTYALEMGDKYMRVIKDGGQVLYADGEYKGEIFELATPYKEADLFNL 122

Query: 122 EYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLS 181
           ++         VH D+PP  L      D   +    ++    P+        + ++ +  
Sbjct: 123 KFTQSADVMTIVHADYPPMELQRYDHDD---WKLVPVETRNGPF------EDINTDKERK 173

Query: 182 IS-QADTSTARITSDMKIFKPLDKGRSIRLGCHP----PEWAKNTNYSIGAYIVADDKVY 236
           +   A T    +++   IF     G+ I +        P W  +   +I     A    Y
Sbjct: 174 LYVSASTGDVTLSATHNIFGAELVGKQIYIEQQAIDAVPVWETDKTTNINDQRRAGANYY 233

Query: 237 RSLTTGRSGD-RFGYSKGATYVKDN---NITWITVLNLSSKTSRESASGAVAPYYVWGDI 292
           R+ T G+SG  R  +++G ++        I W  +          S  G V    V  D 
Sbjct: 234 RANTAGKSGTLRPSHTEGMSWDGWGGDAGIQWEYL---------HSGFGIVKINSVSTDG 284

Query: 293 KDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDE 352
             ++  G+ +   P +        +   W  S W + +GYPS V ++  RL F+GS+   
Sbjct: 285 --LTATGKVVLYIPSNAV--GEENATYKWARSVWNDVDGYPSTVMYYQQRLFFAGSRAYP 340

Query: 353 LSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLW 412
            +++ S  G + DF  +      D    +         + I  +   G  V +      +
Sbjct: 341 QTIWASRSGDYKDFGKNNPIQDDDR---IIYTYAGRQVNEIRHLIDVGSLVAL-TSGGEY 396

Query: 413 LLSISLSK---GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFR 468
            ++   +K     S  F     +G    PP++V +  +++   G  ++ ++ S +  G++
Sbjct: 397 QITGDQNKVLTPSSFSFSSQGANGCSDVPPIAVANIALYIQEKGSAVRDLAYSFDVDGYQ 456

Query: 469 FNEITQLADHLF-NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWH 527
             ++T +A+HLF   +I+   +   P+SI W + +       +LL   +  E +  FAW 
Sbjct: 457 GTDLTIMANHLFQRHQIIDWAFSIVPYSIAWCIRDDG-----KLLSLTYLRE-QQVFAWA 510

Query: 528 THMISDKHYVLSAASFPNDNRGGTSLWMLV-ALSAGEERSFTVRLN 572
                 +    S  S         +++ +V     G    +  RL+
Sbjct: 511 PQETDGQF--ESTCSV--SEGNEDAVYFIVCRKVGGGTVRYIERLS 552


>gi|227355852|ref|ZP_03840245.1| conserved hypothetical protein [Proteus mirabilis ATCC 29906]
 gi|227164171|gb|EEI49068.1| conserved hypothetical protein [Proteus mirabilis ATCC 29906]
          Length = 820

 Score =  541 bits (1394), Expect = e-152,   Method: Composition-based stats.
 Identities = 123/584 (21%), Positives = 222/584 (38%), Gaps = 53/584 (9%)

Query: 5   TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDP 64
           +  + SFS GE++P L   R DL+ ++  + K  N I  +YG + + P  +   + +   
Sbjct: 4   SLIQPSFSGGEIAPSLY-GRVDLAKYSTALRKCHNFIVRQYGGVENRPGTRFIAETKYQN 62

Query: 65  RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPA---LFGKTYKTPYTFKDNKSL 121
           + +R+  F         L FGD+ +++        ++            TPY   D   L
Sbjct: 63  KKSRLIPFQFSTVQTYALEFGDRYIRVFKDGGQVLYADGEHKGEVFELATPYKEADLFDL 122

Query: 122 EYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLS 181
           +Y         VH D+PP  L      D   +    ++    P+        +K  A   
Sbjct: 123 KYTQSADVMTIVHTDYPPMELQRYDHDD---WKLVSVETKNGPFEDINTDKAMKVYA--- 176

Query: 182 ISQADTSTARITSDMKIFKPLDKGRSIRLGCHP----PEWAKNTNYSIGAYIVADDKVYR 237
              A T    +TS   IF     G+   L        P W  +   ++     AD   YR
Sbjct: 177 --SASTGQITLTSTHDIFGSEQIGKQFYLEQRDIDAVPVWETDKTTNLNDQRRADSNYYR 234

Query: 238 SLTTGRSGD-RFGYSKGATYVKDN---NITWITVLNLSSKTSRESASGAVAPYYVWGDIK 293
           + + G++G  R  +++G ++        I W  +          S  G V    V  D K
Sbjct: 235 ANSGGKTGTLRPSHTEGMSWDGWGGDTGIQWEYL---------HSGFGIVKIETVSEDGK 285

Query: 294 DVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDEL 353
             +  G+ +S  P +        +   W  + W + +GYPS V ++  RL F+GS+    
Sbjct: 286 --TATGKVLSYIPSNAV--GEDNASHKWARAVWNDVDGYPSTVVYYQQRLFFAGSRAYPQ 341

Query: 354 SVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWL 413
           +++ S  G + DF  +      D    +         + I  +   G  V +      + 
Sbjct: 342 TIWASRSGDYKDFGRNNPIQDDDR---IIYTYAGRQVNEIRHLIDVGSLVAL-TSGGEYQ 397

Query: 414 LSISLSK---GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRF 469
           ++   +K     S        +G    PP+SV +  +++   G  ++ +S S +  G++ 
Sbjct: 398 ITGDQNKVLTPSSFSMSSQGANGSSDLPPISVANIALYIQEKGSAVRDLSYSFDVDGYQG 457

Query: 470 NEITQLADHLF-NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHT 528
            ++T LA+HLF   RI+   +   P+SI W + +        +L   +  E +  FAW  
Sbjct: 458 TDLTMLANHLFQRHRIVDWSFTTVPYSIAWCIRDDG-----LMLALTYLRE-QQVFAWAP 511

Query: 529 HMISDKHYVLSAASFPNDNRGGTSLWMLVALSA-GEERSFTVRL 571
                K    S  S         S + +V  +  G++  +  RL
Sbjct: 512 QSTEGKF--ESTCSI--SEGNEDSAYFIVQRTVNGKQVRYVERL 551


>gi|30387391|ref|NP_848220.1| hypothetical protein epsilon15p12 [Enterobacteria phage epsilon15]
 gi|30266046|gb|AAO06075.1| 12 [Salmonella phage epsilon15]
          Length = 825

 Score =  539 bits (1389), Expect = e-151,   Method: Composition-based stats.
 Identities = 114/579 (19%), Positives = 216/579 (37%), Gaps = 41/579 (7%)

Query: 5   TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDP 64
           +W + SF+ GE+ P L   R D+S +   + K  N I  +YG + + P  +     +   
Sbjct: 4   SWIQPSFAGGEIGPSLY-GRIDMSKYQVALRKCDNFIVRQYGGVENRPGTRFVGPAKYPD 62

Query: 65  RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYA 124
           R  R+  F         L FG   +++ +   +   + +        PY   D   +++ 
Sbjct: 63  RKCRLIPFQFSTVQTYALEFGHNYMRV-IKDGAYVLTTSNVIYELAMPYADTDLFRIKFT 121

Query: 125 VFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQ 184
                   VH  +PP  L         ++   ++     P+    +   VK  A      
Sbjct: 122 QSADVLTLVHPAYPPKELRRYAHD---NWQIVDVTTKNGPFEDINVDETVKVYA-----S 173

Query: 185 ADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVYRSLT 240
           A T T  +T+   IF     G+   L        P W  +   +I     AD   YR+ T
Sbjct: 174 ASTGTITLTASSAIFGAEQVGKLFYLEQPAVDSVPVWETSKTTAINDVRRADSNYYRANT 233

Query: 241 TGRSGD-RFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDG 299
           +G++G  R  +++G ++                     S  G      V GD   ++   
Sbjct: 234 SGKTGTLRPSHTEGMSWDGWGGTGSDDTGIQWE--YLHSGFGIAKITAVAGDG--LTATA 289

Query: 300 RSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSS 359
             +S  P    +  +  +   W   AW    GYPS V ++  RL F+ S     +++ S 
Sbjct: 290 DVVSFIPSQ--VVGSANASYKWAKYAWNSVNGYPSTVVYYQQRLYFAASTAYPQTIWASR 347

Query: 360 FGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLS 419
            G + DF  +      D    +         + I  +   G  V +      + +S   +
Sbjct: 348 TGDYKDFGKNNPIQDDDR---IIYTYAGRQVNEIRHLIDVGNLVAL-TSGGEYTISGDQN 403

Query: 420 KGLS---IDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQL 475
           K L+     F     +G    PP++V +  +F+   G  ++ ++ S +  G++  ++T L
Sbjct: 404 KVLTPSAFSFSSQGNNGSSNVPPIAVANIALFIQEKGSVVRDLAYSFDVDGYQGTDLTIL 463

Query: 476 ADHLFN-QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDK 534
           A+HLF    I+   +   P+S  + + +       +LL   +  + +  FAW     + K
Sbjct: 464 ANHLFQKHSIVDWSFCIVPYSSAFCIRDDG-----KLLVLTYLRD-QQVFAWAPQSSAGK 517

Query: 535 HYVLSAASFPNDNRGGTSLWMLVALS-AGEERSFTVRLN 572
           +   S  S         +++ +V  +  G+   +  RL+
Sbjct: 518 Y--ESTCSI--SEGSEDAVYFVVNRTINGQTVRYIERLS 552


>gi|215487813|ref|YP_002330244.1| hypothetical protein E2348C_2746 [Escherichia coli O127:H6 str.
           E2348/69]
 gi|215265885|emb|CAS10294.1| predicted protein [Escherichia coli O127:H6 str. E2348/69]
          Length = 825

 Score =  538 bits (1386), Expect = e-151,   Method: Composition-based stats.
 Identities = 115/579 (19%), Positives = 213/579 (36%), Gaps = 41/579 (7%)

Query: 5   TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDP 64
           +W + SF+ GE+ P L   R D+S +   + K  N I  +YG + + P  +     +   
Sbjct: 4   SWIQPSFAGGEIGPSLY-GRIDMSKYQVALRKCDNFIVRQYGGVENRPGTRFVGPAKYPD 62

Query: 65  RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYA 124
           R  R+  F         L FG   +++ +       + +        PY   D   +++ 
Sbjct: 63  RKCRLIPFQFSTVQTYALEFGHNYMRV-IKDGEYVLTTSNVIYELAMPYADTDLFRIKFT 121

Query: 125 VFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQ 184
                   VH  +PP  L         ++   ++     P+    +   VK  A      
Sbjct: 122 QSADVLTLVHPAYPPKELRRYAHD---NWQIVDVTTKNGPFEDINVDDTVKVYA-----S 173

Query: 185 ADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVYRSLT 240
           A T T  +T+   IF     G+   L        P W  +   +I     AD   YR+ T
Sbjct: 174 ASTGTITLTASSAIFGAEQVGKLFYLEQPAVDSVPVWETSKTTAINDVRRADSNYYRANT 233

Query: 241 TGRSGD-RFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDG 299
            G++G  R  +++G ++                     S  G      V GD   ++   
Sbjct: 234 AGKTGTLRPSHTEGMSWDGWGGTGSDDTGIQWE--YLHSGFGIAKITAVSGDG--LTATA 289

Query: 300 RSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSS 359
             +S  P    +  +  +   W   AW    GYPS V ++  RL F+ S     +++ S 
Sbjct: 290 DVVSFIPSQ--VVGSANASYKWAKYAWNSVNGYPSTVVYYQQRLYFAASTAYPQTIWASR 347

Query: 360 FGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLS 419
            G + DF  +      D    +         + I  +   G  V +      + +S   +
Sbjct: 348 TGDYKDFGKNNPIQDDDR---IIYTYAGRQVNEIRHLIDVGNLVAL-TSGGEYTISGDQN 403

Query: 420 KGLS---IDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQL 475
           K L+     F     +G    PP++V +  +F+   G  ++ ++ S +  G++  ++T L
Sbjct: 404 KVLTPSAFSFSSQGNNGSSNVPPIAVANIALFIQEKGSVVRDLAYSFDVDGYQGTDLTIL 463

Query: 476 ADHLFN-QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDK 534
           A+HLF    I+   +   P+S  + + +       +LL   +  + +  FAW     S K
Sbjct: 464 ANHLFQKHSIVDWSFCIVPYSSAFCIRDDG-----KLLVLTYLRD-QQVFAWAPQSSSGK 517

Query: 535 HYVLSAASFPNDNRGGTSLWMLVAL-SAGEERSFTVRLN 572
           +   S  S         +++ +V     G+   +  RL+
Sbjct: 518 Y--ESTCSI--SEGSEDAVYFVVNRNINGQTVRYIERLS 552


>gi|315122895|ref|YP_004063384.1| hypothetical protein CKC_05755 [Candidatus Liberibacter
           solanacearum CLso-ZC1]
 gi|313496297|gb|ADR52896.1| hypothetical protein CKC_05755 [Candidatus Liberibacter
           solanacearum CLso-ZC1]
          Length = 588

 Score =  537 bits (1382), Expect = e-150,   Method: Composition-based stats.
 Identities = 224/583 (38%), Positives = 339/583 (58%), Gaps = 25/583 (4%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           M    +TK SF+ GE+SP+++QSR DL LH+QG+++  N+IPL+ G LV  P +  Y   
Sbjct: 1   MPKGAYTKRSFAGGEVSPQIMQSRSDLELHSQGLSQCFNMIPLQDGSLVRRPPLYRYEHI 60

Query: 61  RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120
            L P+++R+ SF++      L +FG+KK+  V V   T   P  F + Y TPY+F++ + 
Sbjct: 61  DLPPKASRILSFALGGDDAVLFIFGEKKMVYVEV---TGIKPPQFIRFYDTPYSFREAEQ 117

Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180
           L+ A  G+  V VH  H P+ + + + G      F+++ F PPPWLG   + G K +AKL
Sbjct: 118 LDVARMGTLIVLVHPKHSPYKIEFTEAG----VIFEKMVFAPPPWLGLREVGGKKHDAKL 173

Query: 181 SISQADT--STARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRS 238
            ++ + T      +TS + IFK  D GR +RLG  P +W  NT Y   A++    KVYR 
Sbjct: 174 RVTLSATRKGKITVTSTLPIFKTKDVGRMLRLGWLPKDWTANTLYPENAFMQMYGKVYRC 233

Query: 239 LTTGRSGDRFGYSKGATYVKDNNITWITV-------LNLSSKTSRESASGAVAPYYVWGD 291
           +T G SG  F  ++  TY++D  +TW  +       ++   K++  +      PYYVWG+
Sbjct: 234 ITEGISGKEFEDNRRDTYIRDGGVTWKVIASSQALSVDKDGKSTLGTGGQYRTPYYVWGE 293

Query: 292 IKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGD 351
           I + +   +++ V            S + W MSAWGE+EGYPSHV+F+NNRL FSGSK D
Sbjct: 294 IVNCT-GAKTVEVMLHEGFCVTDSNSTLYWNMSAWGEREGYPSHVSFYNNRLCFSGSKFD 352

Query: 352 ELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSL 411
             +VY S +  F DFS D   G  D  K+L+ A+TD + S I W  P  +G+++G DTSL
Sbjct: 353 PQAVYFSGYNTFTDFSPDTIEGNVDYRKSLSVAITDDTMSAIRWFRPMEKGLVIGTDTSL 412

Query: 412 WLLSISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNE 471
           W++ +   +G ++  RR++G GVY  PP+S+GD L+FV G GRRI+ I G++EQGF+F E
Sbjct: 413 WIVILDFERGFNLVSRRLAGIGVYEAPPLSIGDELIFVQGAGRRIQIIGGASEQGFQFLE 472

Query: 472 ITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMI 531
           +TQ  DHL + RI QL YQE+P+S++WV+     N+   LLGC   A  +   +WH H +
Sbjct: 473 LTQNVDHLLDYRIRQLAYQEDPYSLLWVL-----NNKGELLGCSLHANSKEKGSWHVHKL 527

Query: 532 SDKHY-VLSAASFPNDNRGGTSLWMLVAL--SAGEERSFTVRL 571
             +   ++S +S    ++G T++W+L+      G       RL
Sbjct: 528 GGRGVKIMSLSSCLCLDQGETTVWLLLRRMNEDGVSSIGLERL 570


>gi|89152436|ref|YP_512269.1| hypothetical protein PhiV10p15 [Escherichia phage phiV10]
 gi|74055459|gb|AAZ95908.1| hypothetical protein PhiV10p15 [Escherichia phage phiV10]
          Length = 823

 Score =  529 bits (1362), Expect = e-148,   Method: Composition-based stats.
 Identities = 118/587 (20%), Positives = 219/587 (37%), Gaps = 57/587 (9%)

Query: 4   TTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLD 63
            +W + SF+ GE+ P L   R D++ +   + K  N I  +YG + + P  +     +  
Sbjct: 3   ISWIQPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYP 61

Query: 64  PRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEY 123
            R  R+  F         L FG + +++ +   +   + +       TPYT  D   +++
Sbjct: 62  NRKCRLIPFQFSTVQTYALEFGHQYMRV-IKDGALVLNSSNVIYEIATPYTEADLFRIKF 120

Query: 124 AVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSIS 183
                    VH  +PP  L         ++   ++     P+    +   +   A     
Sbjct: 121 TQSADVLTLVHPAYPPKELRRYAHD---NWQLVDVVTKNGPFEDINIDESLTVYA----- 172

Query: 184 QADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVYRSL 239
            A T T  +T+   IF     G+   L        P W  + + SIG    AD   YR++
Sbjct: 173 SASTGTITLTASASIFGAEQVGKLFYLEQPAVDSVPVWETSKSTSIGDIRRADSNYYRAV 232

Query: 240 TTGRSGD-RFGYSKGA-------TYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGD 291
           T G++G  R  +++G        +   D  I W  +          S  G      V G 
Sbjct: 233 TAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEWEYL---------HSGFGIARITAVNG- 282

Query: 292 IKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGD 351
               +     IS  P    +     +   W   AW    GYP  V ++  RL F+ S   
Sbjct: 283 ---TTATAEVISYIPSQ--VVGEDNASYKWAKYAWNSVNGYPGTVVYYQQRLYFAASTAF 337

Query: 352 ELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSL 411
             +++ S  G + DF         D    +         + I  +   G  V +      
Sbjct: 338 PQTIWASRTGDYKDFGKSNPTQDDDR---IIYTYAGRQVNEIRHLIDVGSLVAL-TSGGE 393

Query: 412 WLLSISLSK---GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGF 467
           ++++   +K     S  F     +G    PP++V +  +FV   G  ++ ++ S +  G+
Sbjct: 394 YVITGDQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVVRDLAYSFDVDGY 453

Query: 468 RFNEITQLADHLFN-QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAW 526
           + N++T LA+HLF    I+   +   P+S  + + +       +LL   +  + +  FAW
Sbjct: 454 QGNDLTILANHLFQKHSIVDWCFSIVPYSSAFCIRDDG-----KLLVMTYLRD-QQVFAW 507

Query: 527 HTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSA-GEERSFTVRLN 572
                + K+   S  S         +++ +V  +  G+   +  RL+
Sbjct: 508 APQSSTGKY--ESTCSI--SEGNEDAVYFVVNRTVNGQTVRYIERLS 550


>gi|294493191|gb|ADE91947.1| conserved hypothetical protein [Escherichia coli IHE3034]
          Length = 823

 Score =  529 bits (1361), Expect = e-148,   Method: Composition-based stats.
 Identities = 118/587 (20%), Positives = 219/587 (37%), Gaps = 57/587 (9%)

Query: 4   TTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLD 63
            +W + SF+ GE+ P L   R D++ +   + K  N I  +YG + + P  +     +  
Sbjct: 3   ISWIQPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYP 61

Query: 64  PRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEY 123
            R  R+  F         L FG + +++ +   +   + +       TPYT  D   +++
Sbjct: 62  NRKCRLIPFQFSTVQTYALEFGHQYMRV-IKDGALVLNSSNVIYEIATPYTEADLFRIKF 120

Query: 124 AVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSIS 183
                    VH  +PP  L         ++   ++     P+    +   V   A     
Sbjct: 121 TQSADVLTLVHPAYPPKELRRYAHD---NWQLVDVVTKNGPFEDINIDESVTVYA----- 172

Query: 184 QADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVYRSL 239
            A T T  +T+   IF     G+   L        P W  + + SIG    AD   YR++
Sbjct: 173 SASTGTITLTASASIFGAEQVGKLFYLEQPAVDSVPVWETSKSTSIGDIRRADSNYYRAV 232

Query: 240 TTGRSGD-RFGYSKGA-------TYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGD 291
           T G++G  R  +++G        +   D  I W  +          S  G      V G 
Sbjct: 233 TAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEWEYL---------HSGFGIARITAVNG- 282

Query: 292 IKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGD 351
               +     IS  P    +     +   W   AW    GYP  V ++  RL F+ S   
Sbjct: 283 ---TTATAEVISYIPSQ--VVGEDNASYKWAKYAWNSVNGYPGTVVYYQQRLYFAASTAF 337

Query: 352 ELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSL 411
             +++ S  G + DF         D    +         + I  +   G  V +      
Sbjct: 338 PQTIWASRTGDYKDFGKSNPTQDDDR---IIYTYAGRQVNEIRHLIDVGSLVAL-TSGGE 393

Query: 412 WLLSISLSK---GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGF 467
           ++++   +K     S  F     +G    PP++V +  +FV   G  ++ ++ S +  G+
Sbjct: 394 YVITGDQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVVRDLAYSFDVDGY 453

Query: 468 RFNEITQLADHLFN-QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAW 526
           + N++T LA+HLF    I+   +   P+S  + + +       +LL   +  + +  FAW
Sbjct: 454 QGNDLTILANHLFQKHSIVDWCFSIVPYSSAFCIRDDG-----KLLVMTYLRD-QQVFAW 507

Query: 527 HTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSA-GEERSFTVRLN 572
                + K+   S  S         +++ ++  +  G+   +  RL+
Sbjct: 508 APQSSTGKY--ESTCSI--SEGNEDAVYFVINRTVNGQTVRYIERLS 550


>gi|327252176|gb|EGE63848.1| phage protein [Escherichia coli STEC_7v]
          Length = 823

 Score =  529 bits (1361), Expect = e-148,   Method: Composition-based stats.
 Identities = 118/587 (20%), Positives = 218/587 (37%), Gaps = 57/587 (9%)

Query: 4   TTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLD 63
            +W + SF+ GE+ P L   R D++ +   + K  N I  +YG + + P  +     +  
Sbjct: 3   ISWIQPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYP 61

Query: 64  PRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEY 123
            R  R+  F         L FG + +++ +   +   + +       TPYT  D   +++
Sbjct: 62  NRKCRLIPFQFSTVQTYALEFGHQYMRV-IKDGALVLNSSNVIYEIATPYTEADLFRIKF 120

Query: 124 AVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSIS 183
                    VH  +PP  L         ++   ++     P+    +   V   A     
Sbjct: 121 TQSADVLTLVHPAYPPKELRRYAHD---NWQLVDVVTKNGPFEDINIDESVTVYA----- 172

Query: 184 QADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVYRSL 239
            A T T  +T+   IF     G+   L        P W  + + SIG    AD   YR++
Sbjct: 173 SASTGTITLTASASIFGAEQVGKLFYLEQPAVDSVPVWETSKSTSIGDIRRADSNYYRAV 232

Query: 240 TTGRSGD-RFGYSKGA-------TYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGD 291
           T G++G  R  +++G        +   D  I W  +          S  G        G 
Sbjct: 233 TAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEWEYL---------HSGFGIARISAANG- 282

Query: 292 IKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGD 351
               +     IS  P    +     +   W   AW    GYP  V ++  RL F+ S   
Sbjct: 283 ---TTATAEVISYIPSQ--VVGEDNASYKWAKYAWNSVNGYPGTVVYYQQRLYFAASTAF 337

Query: 352 ELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSL 411
             +++ S  G + DF         D    +         + I  +   G  V +      
Sbjct: 338 PQTIWASRTGDYKDFGKSNPTQDDDR---IIYTYAGRQVNEIRHLIDVGSLVAL-TSGGE 393

Query: 412 WLLSISLSK---GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGF 467
           ++++   +K     S  F     +G    PP++V +  +FV   G  ++ ++ S +  G+
Sbjct: 394 YVITGDQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVVRDLAYSFDVDGY 453

Query: 468 RFNEITQLADHLFN-QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAW 526
           + N++T LA+HLF    I+   +   P+S  + + +       +LL   +  + +  FAW
Sbjct: 454 QGNDLTILANHLFQKHSIVDWCFSIVPYSSAFCIRDDG-----KLLVMTYLRD-QQVFAW 507

Query: 527 HTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSA-GEERSFTVRLN 572
                + K+   S  S         +++ +V  +  G+   +  RL+
Sbjct: 508 APQSSTGKY--ESTCSI--SEGNEDAVYFVVNRTVNGQTVRYIERLS 550


>gi|300898435|ref|ZP_07116776.1| conserved domain protein [Escherichia coli MS 198-1]
 gi|300357902|gb|EFJ73772.1| conserved domain protein [Escherichia coli MS 198-1]
          Length = 823

 Score =  528 bits (1360), Expect = e-148,   Method: Composition-based stats.
 Identities = 118/587 (20%), Positives = 219/587 (37%), Gaps = 57/587 (9%)

Query: 4   TTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLD 63
            +W + SF+ GE+ P L   R D++ +   + K  N I  +YG + + P  +     +  
Sbjct: 3   ISWIQPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYP 61

Query: 64  PRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEY 123
            R  R+  F         L FG + +++ +   +   + +       TPYT  D   +++
Sbjct: 62  NRKCRLIPFQFSTVQTYALEFGHQYMRV-IKDGALVLNSSNVIYEIATPYTEADLFRIKF 120

Query: 124 AVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSIS 183
                    VH  +PP  L         ++   ++     P+    +   V   A     
Sbjct: 121 TQSADVLTLVHPAYPPKELRRYAHD---NWQLVDVVTKNGPFEDINIDESVTVYA----- 172

Query: 184 QADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVYRSL 239
            A T T  +T+   IF     G+   L        P W  + + SIG    AD   YR++
Sbjct: 173 SASTGTITLTASASIFGAEQVGKLFYLEQPAVDSVPVWETSKSTSIGDIRRADSNYYRAV 232

Query: 240 TTGRSGD-RFGYSKGA-------TYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGD 291
           T G++G  R  +++G        +   D  I W  +          S  G      V G 
Sbjct: 233 TAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEWEYL---------HSGFGIARITAVNG- 282

Query: 292 IKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGD 351
               +     IS  P    +     +   W   AW    GYP  V ++  RL F+ S   
Sbjct: 283 ---TTATAEVISYIPSQ--VVGEDNASYKWAKYAWNSVNGYPGTVVYYQQRLYFAASTAF 337

Query: 352 ELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSL 411
             +++ S  G + DF         D    +         + I  +   G  V +      
Sbjct: 338 PQTIWASRTGDYKDFGKSNPTQDDDR---IIYTYAGRQVNEIRHLIDVGSLVAL-TSGGE 393

Query: 412 WLLSISLSK---GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGF 467
           ++++   +K     S  F     +G    PP++V +  +FV   G  ++ ++ S +  G+
Sbjct: 394 YVITGDQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVVRDLAYSFDVDGY 453

Query: 468 RFNEITQLADHLFN-QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAW 526
           + +++T LA+HLF    I+   +   P+S  + + +       +LL   +  + +  FAW
Sbjct: 454 QGSDLTILANHLFQKHSIVDWCFSIVPYSSAFCIRDDG-----KLLVMTYLRD-QQVFAW 507

Query: 527 HTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSA-GEERSFTVRLN 572
                + K+   S  S         +++ +V  +  G+   +  RL+
Sbjct: 508 APQSSTGKY--ESTCSI--SEGNEDAVYFVVNRTVNGQTVRYIERLS 550


>gi|301046400|ref|ZP_07193560.1| conserved domain protein [Escherichia coli MS 185-1]
 gi|300301626|gb|EFJ58011.1| conserved domain protein [Escherichia coli MS 185-1]
          Length = 821

 Score =  528 bits (1359), Expect = e-147,   Method: Composition-based stats.
 Identities = 118/587 (20%), Positives = 219/587 (37%), Gaps = 57/587 (9%)

Query: 4   TTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLD 63
            +W + SF+ GE+ P L   R D++ +   + K  N I  +YG + + P  +     +  
Sbjct: 3   ISWIQPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYP 61

Query: 64  PRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEY 123
            R  R+  F         L FG + +++ +   +   + +       TPYT  D   +++
Sbjct: 62  NRKCRLIPFQFSTVQTYALEFGHQYMRV-IKDGALVLNSSNVIYEIATPYTEADLFRIKF 120

Query: 124 AVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSIS 183
                    VH  +PP  L         ++   ++     P+    +   V   A     
Sbjct: 121 TQSADVLTLVHPAYPPKELRRYAHD---NWQLVDVVTKNGPFEDINIDESVTVYA----- 172

Query: 184 QADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVYRSL 239
            A T T  +T+   IF     G+   L        P W  + + SIG    AD   YR++
Sbjct: 173 SASTGTITLTASASIFGAEQVGKLFYLEQPAVDSVPVWETSKSTSIGDIRRADSNYYRAV 232

Query: 240 TTGRSGD-RFGYSKGA-------TYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGD 291
           T G++G  R  +++G        +   D  I W  +          S  G        G 
Sbjct: 233 TAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEWEYL---------HSGFGIARISAANG- 282

Query: 292 IKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGD 351
               +     IS  P    +     +   W   AW    GYP  V ++  RL F+ S   
Sbjct: 283 ---TTATAEVISYIPSQ--VVGEDNASYKWAKYAWNSINGYPGTVVYYQQRLYFAASTAF 337

Query: 352 ELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSL 411
             +++ S  G + DF         D    +         + I  +   G  V +      
Sbjct: 338 PQTIWASRTGDYKDFGKSNPTQDDDR---IIYTYAGRQVNEIRHLIDVGSLVAL-TSGGE 393

Query: 412 WLLSISLSKGLS---IDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGF 467
           ++++   +K L+     F     +G    PP++V +  +FV   G  ++ ++ S +  G+
Sbjct: 394 YVITGDQNKALTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVVRDLAYSFDVDGY 453

Query: 468 RFNEITQLADHLFN-QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAW 526
           + N++T LA+HLF    I+   +   P+S  + + +       +LL   +  + +  FAW
Sbjct: 454 QGNDLTILANHLFQKHSIVDWCFSIVPYSSAFCIRDDG-----KLLVMTYLRD-QQVFAW 507

Query: 527 HTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSA-GEERSFTVRLN 572
                + K+   S  S         +++ +V  +  G+   +  RL+
Sbjct: 508 APQSSTGKY--ESTCSI--SEGNEDAVYFVVNRTVNGQTVRYIERLS 550


>gi|323156125|gb|EFZ42284.1| phage protein [Escherichia coli EPECa14]
          Length = 823

 Score =  527 bits (1356), Expect = e-147,   Method: Composition-based stats.
 Identities = 118/587 (20%), Positives = 218/587 (37%), Gaps = 57/587 (9%)

Query: 4   TTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLD 63
            +W   SF+ GE+ P L   R D++ +   + K  N I  +YG + + P  +     +  
Sbjct: 3   ISWIHPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYP 61

Query: 64  PRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEY 123
            R  R+  F         L FG + +++ +   +   + +       TPYT  D   +++
Sbjct: 62  NRKCRLIPFQFSTVQTYALEFGHQYMRV-IKDGALVLNSSNVIYEIATPYTEADLFRIKF 120

Query: 124 AVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSIS 183
                    VH  +PP  L         ++   ++     P+    +   V   A     
Sbjct: 121 TQSADVLTLVHPAYPPKELRRYAHD---NWQLVDVVTKNGPFEDINIDESVTVYA----- 172

Query: 184 QADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVYRSL 239
            A T T  +T++  IF     G+   L        P W  + + SIG    AD   YR++
Sbjct: 173 SASTGTITLTANASIFGAEQVGKLFYLEQPAVDSVPVWETSKSTSIGDIRRADSNYYRAV 232

Query: 240 TTGRSGD-RFGYSKGA-------TYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGD 291
           T G++G  R  +++G        +   D  I W  +          S  G        G 
Sbjct: 233 TAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEWEYL---------HSGFGIARISAANG- 282

Query: 292 IKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGD 351
               +     IS  P    +     +   W   AW    GYP  V ++  RL F+ S   
Sbjct: 283 ---TTATAEVISYIPSQ--VVGEDNASYKWAKYAWNSVNGYPGTVVYYQQRLYFAASTAF 337

Query: 352 ELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSL 411
             +++ S  G + DF         D    +         + I  +   G  V +      
Sbjct: 338 PQTIWASRTGDYKDFGKSNPTQDDDR---IIYTYAGRQVNEIRHLIDVGSLVAL-TSGGE 393

Query: 412 WLLSISLSK---GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGF 467
           ++++   +K     S  F     +G    PP++V +  +FV   G  ++ ++ S +  G+
Sbjct: 394 YVITGDQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVVRDLAYSFDVDGY 453

Query: 468 RFNEITQLADHLFN-QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAW 526
           + N++T LA+HLF    I+   +   P+S  + + +       +LL   +  + +  FAW
Sbjct: 454 QGNDLTILANHLFQKHSIVDWCFSIVPYSSAFCIRDDG-----KLLVMTYLRD-QQVFAW 507

Query: 527 HTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSA-GEERSFTVRLN 572
                + K+   S  S         +++ +V  +  G+   +  RL+
Sbjct: 508 APQSSTGKY--ESTCSI--SEGNEDAVYFVVNRTVNGQTVRYIERLS 550


>gi|332344346|gb|AEE57680.1| conserved hypothetical protein [Escherichia coli UMNK88]
          Length = 823

 Score =  525 bits (1353), Expect = e-147,   Method: Composition-based stats.
 Identities = 118/587 (20%), Positives = 218/587 (37%), Gaps = 57/587 (9%)

Query: 4   TTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLD 63
            +W + SF+ GE+ P L   R D++ +   + K  N I  +YG + + P  +     +  
Sbjct: 3   ISWIQPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYP 61

Query: 64  PRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEY 123
            R  R+  F         L FG + +++ +   +   + +       TPYT  D   +++
Sbjct: 62  NRKCRLIPFQFSTVQTYALEFGHQYMRV-IKDGALVLNSSNVIYEIATPYTEADLFRIKF 120

Query: 124 AVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSIS 183
                    VH  +PP  L         ++   ++     P+    +   V   A     
Sbjct: 121 TQSADVLTLVHPAYPPKELRRYAHD---NWQLVDVVTKNGPFEDINIDESVTVYA----- 172

Query: 184 QADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVYRSL 239
            A T T  +T+   IF     G+   L        P W  + + SIG    AD   YR++
Sbjct: 173 SASTGTITLTASASIFGAEQVGKLFYLEQPAVDSVPVWETSKSTSIGDIRRADSNYYRAV 232

Query: 240 TTGRSGD-RFGYSKGA-------TYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGD 291
           T G++G  R  +++G        +   D  I W  +          S  G        G 
Sbjct: 233 TAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEWEYL---------HSGFGIARISAANG- 282

Query: 292 IKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGD 351
               +     IS  P    +     +   W   AW    GYP  V ++  RL F+ S   
Sbjct: 283 ---TTATAEVISYIPSQ--VVGEDNASYKWAKYAWDSINGYPGTVVYYQQRLYFAASTAF 337

Query: 352 ELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSL 411
             +++ S  G + DF         D    +         + I  +   G  V +      
Sbjct: 338 PQTIWASRTGDYKDFGKSNPTQDDDR---IIYTYAGRQVNEIRHLIDVGSLVAL-TSGGE 393

Query: 412 WLLSISLSK---GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGF 467
           ++++   +K     S  F     +G    PP++V +  +FV   G  ++ ++ S +  G+
Sbjct: 394 YVITGDQNKVLAPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVVRDLAYSFDVDGY 453

Query: 468 RFNEITQLADHLFN-QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAW 526
           + N++T LA+HLF    I+   +   P+S  + + +       +LL   +  + +  FAW
Sbjct: 454 QGNDLTILANHLFQKHSIVDWCFSIVPYSSAFCIRDDG-----KLLVMTYLRD-QQVFAW 507

Query: 527 HTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSA-GEERSFTVRLN 572
                + K+   S  S         +++ +V  +  G+   +  RL+
Sbjct: 508 APQSSTGKY--ESTCSI--SEGNEDAVYFVVNRTVNGQTVRYIERLS 550


>gi|315121933|ref|YP_004062422.1| hypothetical protein CKC_00915 [Candidatus Liberibacter
           solanacearum CLso-ZC1]
 gi|313495335|gb|ADR51934.1| hypothetical protein CKC_00915 [Candidatus Liberibacter
           solanacearum CLso-ZC1]
          Length = 588

 Score =  525 bits (1352), Expect = e-147,   Method: Composition-based stats.
 Identities = 225/583 (38%), Positives = 337/583 (57%), Gaps = 25/583 (4%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           M    +TK SF+ GE+SP+++QSR DL LH+QG+++  N+IPL  G LV  P +  Y   
Sbjct: 1   MPKGAYTKRSFAGGEVSPQIIQSRSDLELHSQGLSQCFNMIPLSDGSLVRRPPLHRYEHI 60

Query: 61  RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120
            L P+++R+ SF++      L +FG+KK+  V V   T   P  F + Y TPY+F++ + 
Sbjct: 61  DLPPKASRILSFALGGDEAVLFIFGEKKMVYVEV---TGIKPPQFIRFYGTPYSFREAEQ 117

Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180
           L+ A  G+  V VH  H P+ + + + G      F+++ F PPPWLG   + G K +AKL
Sbjct: 118 LDVARMGTLIVLVHPKHSPYKIEFTEAG----VIFEKMVFAPPPWLGRREVGGKKHDAKL 173

Query: 181 SISQADT--STARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRS 238
            ++ + T      +TS + IFKP D GR + LG  P +W  NT Y   A++    KVYR 
Sbjct: 174 RVTLSATRKGKITVTSTLPIFKPKDVGRMLCLGWLPKDWTANTLYPENAFMQMYGKVYRC 233

Query: 239 LTTGRSGDRFGYSKGATYVKDNNITWITV-------LNLSSKTSRESASGAVAPYYVWGD 291
           +T G SG  F  ++  TY++D  +TW  +       ++   K++  +      PYYVWG+
Sbjct: 234 ITEGISGKEFEDNRRDTYIRDGGVTWKVIASSQALSVDKDGKSTLGTGGQYRTPYYVWGE 293

Query: 292 IKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGD 351
           I + +   +++ V            S + W MSAWGE+EGYPSHV+F+NNRL FSGSK D
Sbjct: 294 IVNCT-GAKTVEVMLHEGFCVTDSNSTLYWNMSAWGEREGYPSHVSFYNNRLCFSGSKFD 352

Query: 352 ELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSL 411
             +VY S +  F DFS D   G  D  K+L+ A+TD + S I W  P  +G+++G DTSL
Sbjct: 353 PQAVYFSGYNTFTDFSPDTIEGNVDYRKSLSVAITDDTMSAIRWFRPMEKGLVIGTDTSL 412

Query: 412 WLLSISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNE 471
           W++ +   +G ++  RR++G GVY  PP+S+GD L+FV G GRRI+ I G++EQGF+F E
Sbjct: 413 WIVILDFERGFNLVSRRLAGIGVYEAPPLSIGDELIFVQGAGRRIQIIGGASEQGFQFLE 472

Query: 472 ITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMI 531
           +TQ  DHL + RI QL YQE+P+S++WV+     N+   LL C   A  +   +WHTH  
Sbjct: 473 LTQNVDHLLDYRIRQLAYQEDPYSLLWVL-----NNKGELLSCSLHANSKEKGSWHTHKS 527

Query: 532 SDK-HYVLSAASFPNDNRGGTSLWMLVALS--AGEERSFTVRL 571
                 ++S +S    ++G T++W LV+ +   G       RL
Sbjct: 528 GGGWVKIMSLSSCLCLDQGETTIWFLVSRTNEDGVSSIGLERL 570


>gi|331648168|ref|ZP_08349258.1| conserved hypothetical protein [Escherichia coli M605]
 gi|331043028|gb|EGI15168.1| conserved hypothetical protein [Escherichia coli M605]
          Length = 823

 Score =  524 bits (1348), Expect = e-146,   Method: Composition-based stats.
 Identities = 117/587 (19%), Positives = 217/587 (36%), Gaps = 57/587 (9%)

Query: 4   TTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLD 63
            +W + SF+ GE+ P L   R D++ +   + K  N I  +YG + + P  +     +  
Sbjct: 3   ISWIQPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYP 61

Query: 64  PRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEY 123
            R  R+  F         L FG + +++ +   +   + +       TPYT  D   +++
Sbjct: 62  NRKCRLIPFQFSTVQTYALEFGHQYMRV-IKDGALVLNSSNVIYEIATPYTEADLFRIKF 120

Query: 124 AVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSIS 183
                    VH  +PP  L         ++   ++     P+    +   V   A     
Sbjct: 121 TQSADVLTLVHPAYPPKELRRYAHD---NWQLVDVVTKNGPFEDINIDESVTVYA----- 172

Query: 184 QADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVYRSL 239
            A T T  +T+   IF     G+   L        P W  + + SIG    AD   YR++
Sbjct: 173 SASTGTITLTASASIFGAEQVGKLFYLEQPAVDSVPVWETSKSTSIGDIRRADSNYYRAV 232

Query: 240 TTGRSGD-RFGYSKGA-------TYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGD 291
           T G++G  R  +++G        +   D  I W  +          S  G          
Sbjct: 233 TAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEWEYL---------HSGFGIARITAA--- 280

Query: 292 IKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGD 351
               +     IS  P    +     +   W   AW    GYP  V ++  RL F+ S   
Sbjct: 281 -NGTTATAEVISYIPSQ--VVGEDNASYKWAKYAWNSVNGYPGTVVYYQQRLYFAASTAF 337

Query: 352 ELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSL 411
             +++ S  G + DF         D    +         + I  +   G  V +      
Sbjct: 338 PQTIWASRTGDYKDFGKSNPTQDDDR---IIYTYAGRQVNEIRHLIDVGSLVAL-TSGGE 393

Query: 412 WLLSISLSK---GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGF 467
           ++++   +K     S  F     +G    PP++V +  +FV   G  ++ ++ S +  G+
Sbjct: 394 YVITGDQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVVRDLAYSFDVDGY 453

Query: 468 RFNEITQLADHLFN-QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAW 526
           + N++T LA+HLF    I+   +   P+S  + + +       +LL   +  + +  FAW
Sbjct: 454 QGNDLTILANHLFQKHSIVDWCFSIVPYSSAFCIRDDG-----KLLVMTYLRD-QQVFAW 507

Query: 527 HTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSA-GEERSFTVRLN 572
                + K+   S  S         +++ +V  +  G+   +  RL+
Sbjct: 508 APQSSTGKY--ESTCSI--SEGNEDAVYFVVNRTVNGQTVRYIERLS 550


>gi|298381710|ref|ZP_06991309.1| conserved hypothetical protein [Escherichia coli FVEC1302]
 gi|298279152|gb|EFI20666.1| conserved hypothetical protein [Escherichia coli FVEC1302]
          Length = 823

 Score =  523 bits (1347), Expect = e-146,   Method: Composition-based stats.
 Identities = 117/587 (19%), Positives = 217/587 (36%), Gaps = 57/587 (9%)

Query: 4   TTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLD 63
            +W + SF+ GE+ P L   R D++ +   + K  N I  +YG + + P  +     +  
Sbjct: 3   ISWIQPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYP 61

Query: 64  PRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEY 123
            R  R+  F         L FG + +++ +   +   + +       TPYT  D   +++
Sbjct: 62  NRKCRLIPFQFSTVQTYALEFGHQYMRV-IKDGALVLNSSNVIYEIATPYTEADLFRIKF 120

Query: 124 AVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSIS 183
                    VH  +PP  L         ++   ++     P+    +   V   A     
Sbjct: 121 TQSADVLTLVHPAYPPKELRRYAHD---NWQLVDVVTKNGPFEDINIDESVTVYA----- 172

Query: 184 QADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVYRSL 239
            A T T  +T+   IF     G+   L        P W  + + SIG    AD   YR++
Sbjct: 173 SASTGTITLTASASIFGAEQVGKLFYLEQPAVDSVPVWETSKSTSIGDIRRADSNYYRAV 232

Query: 240 TTGRSGD-RFGYSKGA-------TYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGD 291
           T G++G  R  +++G        +   D  I W  +          S  G          
Sbjct: 233 TAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEWEYL---------HSGFGIARITAA--- 280

Query: 292 IKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGD 351
               +     IS  P    +     +   W   AW    GYP  V ++  RL F+ S   
Sbjct: 281 -NGTTATAEVISYIPSQ--VVGEDNASYKWAKYAWNSVNGYPGTVVYYQQRLYFAASTAF 337

Query: 352 ELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSL 411
             +++ S  G + DF         D    +         + I  +   G  V +      
Sbjct: 338 PQTIWASRTGDYKDFGKSNPTQDDDR---IIYTYAGRQVNEIRHLIDVGSLVAL-TSGGE 393

Query: 412 WLLSISLSK---GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGF 467
           ++++   +K     S  F     +G    PP++V +  +FV   G  ++ ++ S +  G+
Sbjct: 394 YVITGDQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVVRDLAYSFDVDGY 453

Query: 468 RFNEITQLADHLFN-QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAW 526
           + N++T LA+HLF    I+   +   P+S  + + +       +LL   +  + +  FAW
Sbjct: 454 QGNDLTILANHLFQKHSIVDWCFSIVPYSSAFCIRDDG-----KLLVMTYLRD-QQVFAW 507

Query: 527 HTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSA-GEERSFTVRLN 572
                + K+   S  S         +++ +V  +  G+   +  RL+
Sbjct: 508 APQSSTGKY--ESTCSI--SEGNEDAVYFVVNRTVNGQTVRYIERLS 550


>gi|117624704|ref|YP_853617.1| hypothetical protein APECO1_4049 [Escherichia coli APEC O1]
 gi|115513828|gb|ABJ01903.1| conserved hypothetical protein [Escherichia coli APEC O1]
          Length = 823

 Score =  523 bits (1346), Expect = e-146,   Method: Composition-based stats.
 Identities = 114/580 (19%), Positives = 214/580 (36%), Gaps = 43/580 (7%)

Query: 4   TTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLD 63
            +W + SF+ GE+ P L   R D++ +   + K  N I  +YG + + P  +     +  
Sbjct: 3   ISWIQPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYP 61

Query: 64  PRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEY 123
            R  R+  F         L FG + +++ +   +   + +       TPYT  D   +++
Sbjct: 62  NRKCRLIPFQFSTVQTYALEFGHQYMRV-IKDGALVLNSSNVIYEIATPYTEADLFRIKF 120

Query: 124 AVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSIS 183
                    VH  +PP  L         ++   ++     P+    +   V   A     
Sbjct: 121 TQSADVLTLVHPAYPPKELRRYAHD---NWQLVDVVTKNGPFEDINIDESVTVYA----- 172

Query: 184 QADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVYRSL 239
            A T T  +T+   IF     G+   L        P W  + + SIG    AD   YR++
Sbjct: 173 SASTGTITLTASASIFGAEQVGKLFYLEQPAVDSVPVWETSKSTSIGDIRRADSNYYRAV 232

Query: 240 TTGRSGD-RFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKD 298
           T G++G  R  +++G ++                     S  G              +  
Sbjct: 233 TAGKTGTLRPSHTEGTSWDGWGGSG--DDDIGIEWEYLHSGFGIARITAA----NGTTAT 286

Query: 299 GRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLS 358
              IS  P    +     +   W   AW    GYP  V ++  RL F+ S     +++ S
Sbjct: 287 AEVISYIPSQ--VVGEDNASYKWAKYAWNSVNGYPGTVVYYQQRLYFAASTAFPQTIWAS 344

Query: 359 SFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL 418
             G + DF         D    +         + I  +   G  V +      ++++   
Sbjct: 345 RTGDYKDFGKSNPTQDDDR---IIYTYAGRQVNEIRHLIDVGSLVAL-TSGGEYVITGDQ 400

Query: 419 SK---GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQ 474
           +K     S  F     +G    PP++V +  +FV   G  ++ ++ S +  G++ N++T 
Sbjct: 401 NKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVVRDLAYSFDVDGYQGNDLTI 460

Query: 475 LADHLFN-QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISD 533
           LA+HLF    I+   +   P+S  + + +       +LL   +  + +  FAW     + 
Sbjct: 461 LANHLFQKHSIVDWCFSIVPYSSAFCIRDDG-----KLLVMTYLRD-QQVFAWAPQSSTG 514

Query: 534 KHYVLSAASFPNDNRGGTSLWMLVALSA-GEERSFTVRLN 572
           K+   S  S         +++ +V  +  G+   +  RL+
Sbjct: 515 KY--ESTCSI--SEGNEDAVYFVVNRTVNGQTVRYIERLS 550


>gi|324008552|gb|EGB77771.1| conserved domain protein [Escherichia coli MS 57-2]
          Length = 823

 Score =  523 bits (1346), Expect = e-146,   Method: Composition-based stats.
 Identities = 117/587 (19%), Positives = 217/587 (36%), Gaps = 57/587 (9%)

Query: 4   TTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLD 63
            +W + SF+ GE+ P L   R D++ +   + K  N I  +YG + + P  +     +  
Sbjct: 3   ISWIQPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYP 61

Query: 64  PRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEY 123
            R  R+  F         L FG + +++ +   +   + +       TPYT  D   +++
Sbjct: 62  NRKCRLIPFQFSTVQTYALEFGHQYMRV-IKDGALVLNSSNVIYEIATPYTEADLFRIKF 120

Query: 124 AVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSIS 183
                    VH  +PP  L         ++   ++     P+    +   V   A     
Sbjct: 121 TQSADVLTLVHPAYPPKELRRYAHD---NWQLVDVVTKNGPFEDINIDESVTVYA----- 172

Query: 184 QADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVYRSL 239
            A T T  +T+   IF     G+   L        P W  + + SIG    AD   YR++
Sbjct: 173 SASTGTITLTASASIFGAEQVGKLFYLEQPAVDSVPVWETSKSTSIGDIRRADSNYYRAV 232

Query: 240 TTGRSGD-RFGYSKGA-------TYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGD 291
           T G++G  R  +++G        +   D  I W  +          S  G          
Sbjct: 233 TAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEWEYL---------HSGFGIARITAA--- 280

Query: 292 IKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGD 351
               +     IS  P    +     +   W   AW    GYP  V ++  RL F+ S   
Sbjct: 281 -NGTTATAEVISYIPSQ--VVGEDNASYKWAKYAWNSVNGYPGTVVYYQQRLYFAASTAF 337

Query: 352 ELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSL 411
             +++ S  G + DF         D    +         + I  +   G  V +      
Sbjct: 338 PQTIWASRTGDYKDFGKSNPTQDDDR---IIYTYAGRQVNEIRHLIDVGSLVAL-TSGGE 393

Query: 412 WLLSISLSK---GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGF 467
           ++++   +K     S  F     +G    PP++V +  +FV   G  ++ ++ S +  G+
Sbjct: 394 YVITGDQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVVRDLAYSFDVDGY 453

Query: 468 RFNEITQLADHLFN-QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAW 526
           + N++T LA+HLF    I+   +   P+S  + + +       +LL   +  + +  FAW
Sbjct: 454 QGNDLTILANHLFQKHSIVDWCFSIVPYSSAFCIRDDG-----KLLVMTYLRD-QQVFAW 507

Query: 527 HTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSA-GEERSFTVRLN 572
                + K+   S  S         +++ +V  +  G+   +  RL+
Sbjct: 508 APQSSTGKY--ESTCSI--SEGNEDAVYFVVNRTVNGQTVRYIERLS 550


>gi|218700982|ref|YP_002408611.1| hypothetical protein ECIAI39_2672 [Escherichia coli IAI39]
 gi|218370968|emb|CAR18795.1| conserved hypothetical protein from phage origin [Escherichia coli
           IAI39]
 gi|323948677|gb|EGB44582.1| hypothetical protein ERKG_04900 [Escherichia coli H252]
          Length = 823

 Score =  522 bits (1345), Expect = e-146,   Method: Composition-based stats.
 Identities = 117/587 (19%), Positives = 218/587 (37%), Gaps = 57/587 (9%)

Query: 4   TTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLD 63
            +W + SF+ GE+ P L   R D++ +   + K  N I  +YG + + P  +     +  
Sbjct: 3   ISWIQPSFAGGEIGPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYP 61

Query: 64  PRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEY 123
            R  R+  F         L FG + +++ +   +   + +       TPYT  D   +++
Sbjct: 62  NRKCRLIPFQFSTVQTYALEFGHQYMRV-IKDGALVLNSSNVIYEIATPYTEADLFRIKF 120

Query: 124 AVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSIS 183
                    VH  +PP  L         ++   ++     P+    +   V   A     
Sbjct: 121 TQSADVLTLVHPAYPPKELRRYAHD---NWQLVDVVTKNGPFEDINIDESVTVYA----- 172

Query: 184 QADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVYRSL 239
            A T T  +T+ + IF     G+   L        P W  + + SIG    AD   YR++
Sbjct: 173 SASTGTITLTASVSIFGAEQVGKLFYLEQPAVDSVPVWETSKSTSIGDIRRADSNYYRAV 232

Query: 240 TTGRSGD-RFGYSKGA-------TYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGD 291
           T G++G  R  +++G        +   D  I W  +          S  G          
Sbjct: 233 TAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEWEYL---------HSGFGIARITAA--- 280

Query: 292 IKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGD 351
               +     IS  P    +     +   W   AW    GYP  V ++  RL F+ S   
Sbjct: 281 -NGTTATAEVISYIPSQ--VVGEDNASYKWAKYAWNSVNGYPGTVVYYQQRLYFAASTAF 337

Query: 352 ELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSL 411
             +++ S  G + DF         D    +         + I  +   G  V +      
Sbjct: 338 PQTIWASRTGDYKDFGKSNPTQDDDR---IIYTYAGRQVNEIRHLIDVGSLVAL-TSGGE 393

Query: 412 WLLSISLSK---GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGF 467
           ++++   +K     S  F     +G    PP++V +  +FV   G  ++ ++ S +  G+
Sbjct: 394 YVITGDQNKVLTPSSFAFSSQGSNGSSNVPPIAVANIALFVQEKGSVVRDLAYSFDVDGY 453

Query: 468 RFNEITQLADHLFN-QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAW 526
           + N++T LA+HLF    I+   +   P+S  + + +       +LL   +  + +  FAW
Sbjct: 454 QGNDLTILANHLFQKHSIVDWCFSIVPYSSAFCIRDDG-----KLLVMTYLRD-QQVFAW 507

Query: 527 HTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSA-GEERSFTVRLN 572
                + K+   S  S         +++ +V  +  G+   +  RL+
Sbjct: 508 APQSSTGKY--ESTCSI--SEGNEDAVYFVVNRTVNGQTVRYIERLS 550


>gi|309702804|emb|CBJ02135.1| hypothetical phage protein [Escherichia coli ETEC H10407]
          Length = 807

 Score =  517 bits (1332), Expect = e-144,   Method: Composition-based stats.
 Identities = 106/563 (18%), Positives = 208/563 (36%), Gaps = 40/563 (7%)

Query: 21  LQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRVFSFSIPDGGYA 80
           +  R D++ +   + K  N I  +YG + + P  +   + +   R  R+  F        
Sbjct: 1   MYGRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGEAKYPTRKCRLIPFQFSTVQTY 60

Query: 81  LLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPH 140
            L FG   +++ +   +   + +        PY   D   +++         VH  +PP 
Sbjct: 61  ALEFGHNYMRV-IKDGAYVLNSSNVIYELAMPYADTDLFRIKFTQSADVLTLVHPAYPPK 119

Query: 141 HLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFK 200
            L         ++   ++     P+    +   VK  A      A T T  +T+   IF 
Sbjct: 120 ELRRYAHD---NWQIVDVTTKNGPFEDINVDETVKVYA-----SASTGTITLTASSAIFG 171

Query: 201 PLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGD-RFGYSKGAT 255
               G+   L        P W  +   +I     AD   YR+ T+G++G  R  +++G +
Sbjct: 172 AEQVGKLFYLEQPAIDSVPVWETSKTTAINDVRRADSNYYRANTSGKTGTLRPSHTEGMS 231

Query: 256 YVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAG 315
           +                     S  G      V  D   ++     +S  P    +  + 
Sbjct: 232 WDGWGGTGDSDTGIQWE--YLHSGFGIARITAVSSDG--LTATATVVSYIPSQ--VVGSA 285

Query: 316 VSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCY 375
                W   AW    GYPS V ++  RL F+ S     +++ S  G + DF  +      
Sbjct: 286 NGSYKWARYAWNSVNGYPSTVVYYQQRLYFAASTAYPQTIWASRTGDYKDFGKNNPIQDD 345

Query: 376 DPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLS---IDFRRVSGS 432
           D    +         + I  +   G  V +      + +S   +K L+     F     +
Sbjct: 346 DR---IIYTYAGRQVNEIRHLIDVGNLVAL-TSGGEYTISGDQNKVLTPSAFSFSSQGNN 401

Query: 433 GVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFNQR-ILQLVYQ 490
           G    PP++V +  +F+   G  ++ ++ S +  G++  ++T LA+HLF +R I+   + 
Sbjct: 402 GSSNVPPIAVANIALFIQEKGSVVRDLAYSFDVDGYQGTDLTILANHLFQKRSIVDWSFC 461

Query: 491 EEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGG 550
             P+S  + + +       +LL   +  + +  FAW     + K+   S  S        
Sbjct: 462 IVPYSSAFCIRDDG-----KLLVLTYLRD-QQVFAWAPQSSTGKY--ESTCSI--SEGSE 511

Query: 551 TSLWMLVALS-AGEERSFTVRLN 572
            +++ +V  +  G+ + +  RL+
Sbjct: 512 DAVYFVVNRTINGQTKRYIERLS 534


>gi|262043557|ref|ZP_06016670.1| conserved hypothetical protein [Klebsiella pneumoniae subsp.
           rhinoscleromatis ATCC 13884]
 gi|259039091|gb|EEW40249.1| conserved hypothetical protein [Klebsiella pneumoniae subsp.
           rhinoscleromatis ATCC 13884]
          Length = 511

 Score =  516 bits (1328), Expect = e-144,   Method: Composition-based stats.
 Identities = 113/534 (21%), Positives = 203/534 (38%), Gaps = 36/534 (6%)

Query: 4   TTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLD 63
            +W + SFS GE++P L   R D++ +   + K  N I  +YG + + P  Q     +  
Sbjct: 3   VSWIQPSFSGGEIAPSLY-GRIDMAKYQVALRKCDNFIVRQYGGVENRPGTQFIAAAKYP 61

Query: 64  PRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEY 123
            R  R+  F         L FG   +++ +       +         TPYT  D   L++
Sbjct: 62  DRKCRLIPFQFSTVQTYALEFGHNYMRV-IKDGGLVLTTGDVIYELATPYTENDVFGLKF 120

Query: 124 AVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSIS 183
                    VH  +PP  L         ++   +++    P+    +            +
Sbjct: 121 TQSADVMTIVHPSYPPKELRRYAHD---NWQIVDVQTTNGPFEDINVDESKTV-----WA 172

Query: 184 QADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVYRSL 239
            A T T  +TS   IF     G+   L        P W  + + SI     AD   YR+ 
Sbjct: 173 SAPTGTITLTSSSAIFGAEQVGKLFYLEQPAVDSVPVWETSKSTSIEDIRRADSNYYRAN 232

Query: 240 TTGRSGD-RFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKD 298
           T G++G  R  +++G  +                     S  G V    V GD   ++  
Sbjct: 233 TAGKTGTLRPSHTEGMAWDGWGGT--GDDDTGVQWEYLHSGFGIVRITAVAGDG--LTAT 288

Query: 299 GRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLS 358
              +S  P++  +  A  +   W   AW    GYP+ V ++  RL F+ S     +++ S
Sbjct: 289 ADVVSRIPEN--VVGADKASYKWARYAWNSVNGYPATVVYYQQRLYFAASPAYPQTIWAS 346

Query: 359 SFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL 418
             G + DF         D    +         + I  +   G  ++V      ++++   
Sbjct: 347 RTGDYKDFGKSNPTQDDDR---IVYTYAGRQVNEIRHLIDVG-SLVVLTSGGEFVVTGDQ 402

Query: 419 SKGLS---IDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQ 474
           +K L+           +G    PP++V +  +F+   G  ++ ++ S +  GF+ N++T 
Sbjct: 403 NKVLTPSAFSLSSQGSNGCSDVPPIAVSNIALFIQEKGSVVRDLAYSFDVDGFQGNDLTI 462

Query: 475 LADHLFNQR-ILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWH 527
           LA+HLF +R I+   +   P S  + V +       +LL   +  + +  FAW 
Sbjct: 463 LANHLFQKRSIVDWAFCIVPFSSAFCVRDDG-----KLLVLTYLRD-QQVFAWS 510


>gi|304398395|ref|ZP_07380269.1| conserved hypothetical protein [Pantoea sp. aB]
 gi|304354261|gb|EFM18634.1| conserved hypothetical protein [Pantoea sp. aB]
          Length = 824

 Score =  508 bits (1307), Expect = e-141,   Method: Composition-based stats.
 Identities = 111/579 (19%), Positives = 203/579 (35%), Gaps = 41/579 (7%)

Query: 5   TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDP 64
           +  + SF+ GE+SP +   R DL+ ++  + + RN I  +YG L + P  +   + +   
Sbjct: 4   SLIQPSFAGGEISPNVY-GRVDLAKYSIALRRCRNFIVRQYGGLENRPGTRFIAEAKYPD 62

Query: 65  RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYA 124
           R  R+  F         L FG   +++                   TPY   D   L+  
Sbjct: 63  RKCRLIPFQFSTVQTYALEFGHNYMRVY-KDGGQVLDGNNQVYELATPYQEADLFELKIT 121

Query: 125 VFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQ 184
                    HK + P  L         S+   E+     P+    +   VK  A      
Sbjct: 122 QSADVMTICHKAYAPRELRRFGHA---SWELVEVVTKNGPFEDINIDPSVKVYA-----S 173

Query: 185 ADTSTARITSDMKIFKPLDKGRSIRLGC----HPPEWAKNTNYSIGAYIVADDKVYRSLT 240
           +      + ++  IF     G+   L        P W  +   ++G    A D  Y +LT
Sbjct: 174 SYQGNITLNANASIFGSEQVGKLFYLEQVNVDSTPVWETDKAVAVGMTRRAGDNYYVALT 233

Query: 241 TGRSGD-RFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDG 299
            G++G  R  +++GA +    +                     +      G I       
Sbjct: 234 AGKTGTLRPSHTEGAAWDGWGSNGDNDTGIQWEYQHSGFGIARITSVSSDGYI----AAA 289

Query: 300 RSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSS 359
              +  P          +   W   AW +  GYP  VT++  RL+F+ S     +++ S 
Sbjct: 290 VVQTYMPNDAV--GPTKASYKWAKFAWNQVNGYPGTVTYYQQRLIFAASIKYPQTIWCSK 347

Query: 360 FGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLS 419
            G + DF         D    +         + I  +   G  V +      + +    +
Sbjct: 348 TGDYKDFGKTSPIADDDR---IVYTYAGKQVNEIRHLIDVGSLVAL-TSGGQFQIVGDQN 403

Query: 420 K---GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQL 475
           K     +  F      G  +  P++V +  +F+   G  ++ ++ S +  G++ +++T L
Sbjct: 404 KTLTPTAFSFSSQGADGASSVAPITVSNIALFIQEKGSVVRDLAYSFDVDGYQGSDLTVL 463

Query: 476 ADHLFNQ-RILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDK 534
           A+HLFN  R++   +   P+S  W V      S   LL   +  E +  FAW       +
Sbjct: 464 ANHLFNGYRLVDWTFSVVPYSAGWAVR-----SDGMLLCLTYLRE-QQVFAWAPQP--GE 515

Query: 535 HYVLSAASFPNDNRGGTSLWMLVALSA-GEERSFTVRLN 572
               S  S         +++  V  +  G  + +  RL+
Sbjct: 516 GKFESTCSI--SEGTEDAVYFSVQRTVNGASKRYIERLS 552


>gi|330007163|ref|ZP_08305905.1| hypothetical protein HMPREF9538_03594 [Klebsiella sp. MS 92-3]
 gi|328535510|gb|EGF61970.1| hypothetical protein HMPREF9538_03594 [Klebsiella sp. MS 92-3]
          Length = 825

 Score =  504 bits (1296), Expect = e-140,   Method: Composition-based stats.
 Identities = 113/585 (19%), Positives = 210/585 (35%), Gaps = 45/585 (7%)

Query: 5   TWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDP 64
           +  + S + GE+SP L   R DL  +   + + RN I  + G + + P  +     +   
Sbjct: 4   SLVQPSLAGGEISPSLY-GRIDLEKYQTSLRRCRNFIVRQSGGIENRPGFRFLGSAKYAD 62

Query: 65  RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYA 124
           R +R+  F         L  GD   ++         +         TP+       L++ 
Sbjct: 63  RYSRLIPFQFSVSQTYALELGDHYFRVWSN--GALVTDGGSPVEVATPWPVSVISELKFT 120

Query: 125 VFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQ 184
                    H D+PP  +    + D   +    +     P+        V   A      
Sbjct: 121 QSADVMTVCHNDYPPLEIRRYGEAD---WRTAAVTTTSGPFQDLNTDDSVTVYA-----S 172

Query: 185 ADTSTARITSDMKIFKPLDKGRSIRLGCHPPE----WAKNTNYSIGAYIVADDKVYRSLT 240
             T +  +T+   IFK    G+   +     +    W  + +  +G      +  YR + 
Sbjct: 173 GRTGSVTLTASSPIFKSQHVGKLFYMEQKAVDSVGRWETDKDIGVGDECRYQENFYRCVD 232

Query: 241 TGRSGDR----FGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVS 296
            G +G        ++ G ++          VL         S  G      + GD    +
Sbjct: 233 GGSNGTTGTVAPTHTTGDSWDGWGLGGRNGVL----WRYLHSGFGVCRITAIAGDGLTAT 288

Query: 297 KD--GRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELS 354
            D   R          +  +  +   W   AW + +GYP  VT++  RL+F GS+    +
Sbjct: 289 ADVVPRQDGEIELPAQVVGSTFATYKWAHYAWNDTDGYPGTVTYYQQRLIFGGSRAFPQT 348

Query: 355 VYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLL 414
           ++ S  G +++F         D   A+T        + I  +   G+ ++V      + +
Sbjct: 349 IWCSRTGDYHNFYRSNPKVDDD---AITYNYAGRQLNKILHLLDVGQ-LIVLTSGGEFKV 404

Query: 415 SISLSKGLS----IDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRF 469
           +   +  L+          S +G     P++VG   ++V   G  I+ +  S +   ++ 
Sbjct: 405 TGDSNGNLTGTGGFAMSGQSFNGSSDLAPINVGSVALYVQQKGSIIRDLFYSFDQDSYQS 464

Query: 470 NEITQLADHLFNQ-RILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHT 528
           +++T LA HLFN   I       +P S+ W        S   LLG  +  E +  +AWH 
Sbjct: 465 SDLTLLASHLFNGYSIRDWALSVQPFSVAWCAR-----SDGMLLGLTYLRE-QQVYAWHP 518

Query: 529 HMISDKHYVLSAASFPNDNRGGTSLWMLVALSA-GEERSFTVRLN 572
           H +++  YV S  S         +++ L+  +  G    +  RLN
Sbjct: 519 HPMTNG-YVESICSI--SEGQEDAVYALIRRTVNGSTVRYIERLN 560


>gi|320175038|gb|EFW50151.1| 12 [Shigella dysenteriae CDC 74-1112]
          Length = 799

 Score =  488 bits (1256), Expect = e-135,   Method: Composition-based stats.
 Identities = 107/564 (18%), Positives = 203/564 (35%), Gaps = 56/564 (9%)

Query: 27  LSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRVFSFSIPDGGYALLVFGD 86
           ++ +   + K  N I  +YG + + P  +     +   R  R+  F         L FG 
Sbjct: 1   MAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYPNRKCRLIPFQFSTVQTYALEFGH 60

Query: 87  KKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQ 146
           + +++ +   +   + +       TPYT  D   +++         VH  +PP  L    
Sbjct: 61  QYMRV-IKDGALVLNSSNVIYEIATPYTEADLFRIKFTQSADVLTLVHPAYPPKELRRYA 119

Query: 147 DGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGR 206
                ++   ++     P+    +   V   A      A T T  +T+   IF     G+
Sbjct: 120 HD---NWQLVDVVTKNGPFEDINIDESVTVYA-----SASTGTITLTASASIFGAEQVGK 171

Query: 207 SIRLGC----HPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGD-RFGYSKGA------- 254
              L        P W  + + SIG    AD   YR++T G++G  R  +++G        
Sbjct: 172 LFYLEQPAVDSVPVWETSKSTSIGDIRRADSNYYRAVTAGKTGTLRPSHTEGTSWDGWGG 231

Query: 255 TYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQA 314
           +   D  I W  +          S  G              +     IS  P    +   
Sbjct: 232 SGDDDTGIEWEYL---------HSGFGIARITAA----NGTTATAEVISYIPSQ--VVGE 276

Query: 315 GVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGC 374
             +   W    W    GYP  V ++  RL F+ S     +++ S  G + DF        
Sbjct: 277 DNASYKWAKYTWNSVNGYPGTVVYYQQRLYFAASTAFPQTIWASRTGDYKDFGKSNPTQD 336

Query: 375 YDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSK---GLSIDFRRVSG 431
            D    +         + I  +   G  V +      ++++   +K     S  F     
Sbjct: 337 DDR---IIYTYAGRQVNEIRHLIDVGSLVAL-TSGGEYVITGDQNKVLTPSSFAFSSQGS 392

Query: 432 SGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFN-QRILQLVY 489
           +G    PP++V +  +FV   G  ++ ++ S +  G++ N++T LA+HLF    I+   +
Sbjct: 393 NGSSNVPPIAVANIALFVQEKGSVVRDLAYSFDVDGYQGNDLTILANHLFQKHSIVDWCF 452

Query: 490 QEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRG 549
              P+S  + + +       +LL   +  + +  FAW     + K+   S  S       
Sbjct: 453 SIVPYSSAFCIRDDG-----KLLVMTYLRD-QQVFAWAPQSSTGKY--ESTCSI--SEGN 502

Query: 550 GTSLWMLVALSA-GEERSFTVRLN 572
             +++ +V  +  G+   +  RL+
Sbjct: 503 EDAVYFVVNRTVNGQTVRYIERLS 526


>gi|48697202|ref|YP_024932.1| hypothetical protein BcepC6B_gp12 [Burkholderia phage BcepC6B]
 gi|47779008|gb|AAT38371.1| gp12 [Burkholderia phage BcepC6B]
          Length = 768

 Score =  487 bits (1254), Expect = e-135,   Method: Composition-based stats.
 Identities = 129/616 (20%), Positives = 222/616 (36%), Gaps = 67/616 (10%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           M      + SF AGELSP LL +R DL+ +  G     N I    GP +     +     
Sbjct: 1   MPKAAPQQVSFDAGELSP-LLGARVDLAKYPNGCQVMENFIATVQGPAIRRGGKRFVAAT 59

Query: 61  RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDN-- 118
           +   + + +  F + DG   +L FGD  ++  V R       A       TPY   D   
Sbjct: 60  KDSTKQSWLLPFIVADGIAYMLEFGDHYIRFFVNRGQLV--NAGAPVEIATPYALADLTT 117

Query: 119 ----KSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGV 174
                ++       T    H  +P   LL        +F+   + F+  P+      + V
Sbjct: 118 EDGTFAIRATQSADTMYLFHGGYPTQKLLRTS---ATTFSLQPVTFVGGPF------AAV 168

Query: 175 KSNAKLSI-SQADTSTARITSDMKIFKPLDKGRSIRLGCHPPE----WAKNTNYSIGAYI 229
            S+  + + + A T    + +   +F+P D G    L          W  +         
Sbjct: 169 NSDNNVRVHASAGTGAVTLVASASVFRPSDVGTLFYLEQEDNSFVKPWVVHQKIGPSELR 228

Query: 230 VADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPY--- 286
              D+VY     G +  +   ++  T+   +   W       S T    + GA   Y   
Sbjct: 229 RVGDRVYLCTAVGTATPQVTGTETPTHT--SGSRWDGTGQDESATDEYGSIGAEWEYQHS 286

Query: 287 -----YVWGDIKDVSKDGRSISVAPQSQTLFQAG----VSVVSWFMSAWGEQEGYPSHVT 337
                 + G   D    G   +  P    +             W  S +   +G+P   T
Sbjct: 287 GYGTVLITGYTNDQVVTGTVATNDPADPGMLPNTVVTLTGTYKWARSLFNSTDGFPQMGT 346

Query: 338 FHNNRLLFSGSKGDELSVYLSSFGAFYDF-SLDGEYGCYDPTKALTTAVTDFSASTIHWM 396
           F  NRL     +       +S    F  F + D +    D   A+   +     + + WM
Sbjct: 347 FWRNRLCLMRDRWLA----MSVSADFETFKTKDADQQTDD--SAIVQQLNARQLNKLAWM 400

Query: 397 HPFGEGVLVGCDTSLWLLSISLS----KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGV 452
               + +L+G     W++  + +       +++  R +  G     PV VG  ++FV   
Sbjct: 401 VE-SDSLLIGMTGDEWVIGPANASQPVSAANLNAARRTSYGSKRIQPVQVGGTIMFVQKA 459

Query: 453 GRRIKYISGST-EQGFRFNEITQLADHLFN------QRILQLVYQEEPHSIVWVVLEPKD 505
           GR+++          +   ++T++ADH+          I+ L +Q+EPHS+VW       
Sbjct: 460 GRKLRDFKYDFSSDNYVSTDVTKIADHITRGRAGTNSGIMSLCFQQEPHSVVWAAR---- 515

Query: 506 NSFPRLLGCRFSAE--GEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSA-G 562
            +  +L+GC +  E      + WH H  ++  +V   AS P  +     LW++V     G
Sbjct: 516 -ADGQLIGCTYDEEAGRSDVYGWHRHPDANG-FVECVASMPAPDGASDDLWVIVRRQVNG 573

Query: 563 EERSFTVRLN--LLDD 576
           +   +   LN  L DD
Sbjct: 574 QTVRYVEYLNPALQDD 589


>gi|221213947|ref|ZP_03586920.1| conserved hypothetical protein [Burkholderia multivorans CGD1]
 gi|221166124|gb|EED98597.1| conserved hypothetical protein [Burkholderia multivorans CGD1]
          Length = 766

 Score =  484 bits (1246), Expect = e-134,   Method: Composition-based stats.
 Identities = 129/613 (21%), Positives = 218/613 (35%), Gaps = 63/613 (10%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           M      + SF AGELSP LL +R DL+ +A G     N I    GP V     +     
Sbjct: 1   MPKAAAQQVSFDAGELSP-LLGARVDLAKYANGCLLLENFIATVQGPAVRRGGKRYVSAI 59

Query: 61  RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDN-- 118
           +   +   +  F + DG   +L FGD+ ++  V R       A       TPY   D   
Sbjct: 60  KDSGKQAWLLPFIVSDGIAYMLEFGDQYIRFYVNRGQLVNDSA--PVEIATPYALADLVT 117

Query: 119 ----KSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGV 174
                ++       T    H  +P   L         +F    + F+  P+      + +
Sbjct: 118 EDGTFAIRATQSADTMYLFHGAYPTQKLSRTS---ATTFELQPVTFVGGPFATVNDNNSI 174

Query: 175 KSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPE----WAKNTNYSIGAYIV 230
           +  A        +    +T++  +F+  D G    +    P     WA +    +     
Sbjct: 175 RVQA-----SGQSGDVTLTANADVFRASDVGTLFYVEQEQPTGIVPWAVHAESHVNDIRR 229

Query: 231 ADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESAS-------GAV 283
             D+ YR    G +  +    +   + +     W          +  S            
Sbjct: 230 VGDRTYRCTQIGLNAPQVTGQETPIHTE--GRRWDGDGRDPDGDTYGSIGVEWEYQHSGY 287

Query: 284 APYYVWGDIKDVSKDGRSISVAPQSQTLFQAGV---SVVSWFMSAWGEQEGYPSHVTFHN 340
           A   + G +          +  P    +    V       W  S +   +G+P   TF +
Sbjct: 288 ATVLITGFVNARQVSATVTTNNPNDPCMIPKPVVDSGTYKWARSLFNSTDGFPQMGTFWS 347

Query: 341 NRLLFSGSKGDELSVYLSSFGAFYDF-SLDGEYGCYDPTKALTTAVTDFSASTIHWMHPF 399
           NRL     +     + +S    F +F + D +    D   A+   +     + + WM   
Sbjct: 348 NRLCVMRDRW----IAMSVSADFENFKTKDADQQTDD--SAIVQQLNARRLNKLAWMVE- 400

Query: 400 GEGVLVGCDTSLWLLSISLS----KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRR 455
            + +LVG     W++  S +       ++  RR +  G     PV VG  ++FV   GR+
Sbjct: 401 SDSLLVGMTGDEWVIGKSNASLALSATNMSARRRTSYGSKRLQPVEVGGTILFVQKAGRK 460

Query: 456 IKYISGST-EQGFRFNEITQLADHLFN------QRILQLVYQEEPHSIVWVVLEPKDNSF 508
           ++          +   ++T++ADH+          I+ L YQ+EPHSIVW        + 
Sbjct: 461 LRDFKYDFSSDNYVSTDVTKIADHVTRGRSGTNSGIMSLCYQQEPHSIVWAAR-----AD 515

Query: 509 PRLLGCRFSAE--GEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVAL-SAGEER 565
            +L+GC +  E      + WH H   +  +V   AS P  +     LWM+V     G+  
Sbjct: 516 GQLIGCTYDEEAGRSDVYGWHRHPDVNG-FVECVASMPAPDGASDDLWMIVRRQINGQSV 574

Query: 566 SFTVRLN--LLDD 576
            +   LN  L DD
Sbjct: 575 RYVEYLNQSLQDD 587


>gi|221201505|ref|ZP_03574544.1| conserved hypothetical protein [Burkholderia multivorans CGD2M]
 gi|221207939|ref|ZP_03580945.1| hypothetical protein BURMUCGD2_2474 [Burkholderia multivorans CGD2]
 gi|221172124|gb|EEE04565.1| hypothetical protein BURMUCGD2_2474 [Burkholderia multivorans CGD2]
 gi|221178773|gb|EEE11181.1| conserved hypothetical protein [Burkholderia multivorans CGD2M]
          Length = 767

 Score =  482 bits (1239), Expect = e-134,   Method: Composition-based stats.
 Identities = 131/612 (21%), Positives = 217/612 (35%), Gaps = 60/612 (9%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           M      + SF AGELSP LL +R D++ +  G     N I    GP V     +     
Sbjct: 1   MPKAAAQQVSFDAGELSP-LLGARVDIAKYPNGCKVMENFIATVQGPAVRRGGKRFVAAV 59

Query: 61  RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDN-- 118
           +   +   +  F + DG   +L FGD  ++  V R     +         TPY   D   
Sbjct: 60  KDSSKQAWLLPFIVSDGIAYMLEFGDHYIRFYVDRGQLVNAGG--PVEIATPYALADLVT 117

Query: 119 ----KSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGV 174
                ++       T    H  +PP  LL        +F+  ++ F+  P+       GV
Sbjct: 118 EDGTFAIRATQSADTMYLFHGAYPPQKLLRTS---ATTFSLQQVTFVSGPFQTINSDEGV 174

Query: 175 KSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHP-----PEWAKNTNYSIGAYI 229
              A        T    +T+   +F   D G    L  +      P     T    G   
Sbjct: 175 TVKA-----SGQTGAVTLTATAPVFSQADVGALFYLEQNDNTSVLPWSVHGTILETGLVR 229

Query: 230 VADDKVYRSLTTGRSGDRFGYSKGATYVK----DNNITW-ITVLNLSSKTSRESASGAVA 284
              D+ Y S   G +  +   S+  T+ +    D ++T        +     E      A
Sbjct: 230 RVGDRTYVSTAIGPTAPQVTGSETPTHTRGRRYDGDLTDLANDNYGTIGIEWEYQHSGYA 289

Query: 285 PYYVWGDIKDVSKDGRSISVAPQSQTLFQAG---VSVVSWFMSAWGEQEGYPSHVTFHNN 341
              +          G   +  P    +            W  + +   +GYP   TF  N
Sbjct: 290 TVLITSVSDSQHATGTVTTNNPTDPCIIPQSIVDTGTYKWAHALFNAADGYPQMGTFWRN 349

Query: 342 RLLFSGSKGDELSVYLSSFGAFYDF-SLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFG 400
           RL     +        S    F +F S D +    D   A+   +     + + WM    
Sbjct: 350 RLWMMRDRWLV----GSVSADFENFASKDADQQTDD--SAIVQQLNARQLNKLAWMVE-S 402

Query: 401 EGVLVGCDTSLWLLSISLS----KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRI 456
           + +++G     W++  + +       +++  R +  G     PV VG  ++FV   GR++
Sbjct: 403 DSLIIGMTGDEWVIGPANASQPVSATNLNAARRTSYGSKRIQPVQVGGTIMFVQKAGRKL 462

Query: 457 KYISGST-EQGFRFNEITQLADHLFNQ------RILQLVYQEEPHSIVWVVLEPKDNSFP 509
           +          F   ++T+LADH+          I+ L +Q+EPHSIVW        +  
Sbjct: 463 RDFKYDFSSDNFVSTDVTKLADHITRGRSGTNNGIMSLCFQQEPHSIVWAAR-----ADG 517

Query: 510 RLLGCRFSAE--GEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVAL-SAGEERS 566
           +L+GC +  E      + WH H  ++  +V   AS P  +     LW++V     G+   
Sbjct: 518 QLIGCTYDEEAGRSDVYGWHRHPDANG-FVECVASMPAPDGASDDLWLIVRRQINGQTVR 576

Query: 567 FTVRLN--LLDD 576
           +   LN  L DD
Sbjct: 577 YVEYLNPALQDD 588


>gi|317120716|gb|ADV02538.1| hypothetical protein SC2_gp080 [Liberibacter phage SC2]
 gi|317120777|gb|ADV02598.1| hypothetical protein SC2_gp080 [Candidatus Liberibacter asiaticus]
          Length = 590

 Score =  478 bits (1230), Expect = e-132,   Method: Composition-based stats.
 Identities = 160/591 (27%), Positives = 258/591 (43%), Gaps = 43/591 (7%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           M      K+SF++GE+SP + QS  +L ++   +A   N IPLR G L+  P  + Y   
Sbjct: 1   MTKAIHFKNSFASGEVSPFVHQSGSNLKIYQSCLAHCHNYIPLRTGALMRRPGTRIYHVF 60

Query: 61  RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120
               +  R+FSF        ++V G  KL I   R            T + PY  +D   
Sbjct: 61  DDVDKPQRLFSFVKDAYTAYIIVLGYLKLHIFERRMGGCSK----VTTIEVPYKKEDVDE 116

Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180
           +E A    T   VH  HPP  L          + F E+ F   P L +  I   K +  L
Sbjct: 117 IEVAQNIDTLWMVHPKHPPCQLELKGKD----WEFKEVLFKHVPPLKEQFIDDKKVSINL 172

Query: 181 SISQADTSTAR-----ITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKV 235
                +T T +     + +D ++FK +D GR + LG  P  W  +T Y   +Y+V +D++
Sbjct: 173 KTPFENTETGKTGMVSVEADGEMFKEMDIGRELNLGFRPQRWIPDTWYLDNSYVVHNDRL 232

Query: 236 YRSLTTGRS-GDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVW-GDIK 293
            + +  G+S    + +S      KD +  W  V         ES  G      +W   + 
Sbjct: 233 LKCINKGKSQSTEWTFSDKEHQQKDGSCLWEKV---------ESTKGNARNLLIWVTGVI 283

Query: 294 DVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDEL 353
              K  + + +  +     Q  +    W +  WG++EGYPS +TF  NRL+ SG K +  
Sbjct: 284 KRFKTAKCVLLELKGAFPLQNDLPTKHWLLGEWGQKEGYPSCITFFGNRLVLSGGKHNPQ 343

Query: 354 SVYLSSFGAFYDFSLDGEY-GCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLW 412
           +V+ S    F DF+   E  G  D T + +  +       I W+     G+LVG +++LW
Sbjct: 344 TVHFSKLDDFTDFNQISEQGGNTDLTSSFSVLLGSDVRQGIQWLSHTDSGLLVGTESALW 403

Query: 413 LLSISLS----KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYI-----SGST 463
           L++ +         ++  R +   G  A  P+ VG   VF+   GR +  +     + +T
Sbjct: 404 LITQTSQNEVVSKATVAIRSIGNFGSIAVSPILVGSHCVFIKDTGRDLISLVGNRSADNT 463

Query: 464 EQGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGD 523
           +  +RF ++   A+H+  + + + V Q+ P+SI+WVVL        RL+GC F  + E  
Sbjct: 464 KTEYRFRDLNLFAEHILTKGVWEAVLQQSPYSIIWVVLRDG-----RLVGCTFDPDNE-V 517

Query: 524 FAWHTHMISD-KHYVLSAASFPNDNRGGTSLWMLVAL--SAGEERSFTVRL 571
            AWHTH +      + S  S  +   G   LW+LV      G +     +L
Sbjct: 518 CAWHTHDLGGFYTQIHSLTSCASFLDGQDDLWLLVERLDDTGRKTRSLEKL 568


>gi|120601703|ref|YP_966103.1| hypothetical protein Dvul_0653 [Desulfovibrio vulgaris DP4]
 gi|120561932|gb|ABM27676.1| conserved hypothetical protein [Desulfovibrio vulgaris DP4]
          Length = 699

 Score =  453 bits (1165), Expect = e-125,   Method: Composition-based stats.
 Identities = 126/578 (21%), Positives = 207/578 (35%), Gaps = 77/578 (13%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           M   T  ++SF+AGELSP L+ +R D + +  G A   N++   +G     P ++     
Sbjct: 1   MARATIVRNSFNAGELSP-LMAARVDQARYPNGCASLCNMLLHPHGGAWRRPGLRFMGLA 59

Query: 61  RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120
                  R+  F   +    +L FG + L+I                  +TP+  +   +
Sbjct: 60  ADPAGPVRLIPFVFSEAQAYVLEFGPRSLRIWHGGGLVLGGDGE-PFRLETPWAGEQLTA 118

Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180
           L +         V    PP  L      D   +   ++ FLP     +G+   VK     
Sbjct: 119 LRWCQSADMLYLVSHAGPPRRLERHGHAD---WRLVDVSFLPGVSPPEGLHCTVKPAGSR 175

Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLT 240
           + +   T+  R + +  +  P  +         P   ++  + ++    V D   YR   
Sbjct: 176 TWTYVVTAVHRESGEESLPTPPLQVT------GPDALSQTASVTLAWTPVQDAGEYRVYR 229

Query: 241 TGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGR 300
            G     +G+   A                          GA   Y   G   D      
Sbjct: 230 AGGGASVYGFLGSA--------------------------GAGETYTDTGRTPDFDAG-- 261

Query: 301 SISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSF 360
                P+++  F                   +PS   F   RL F+G++    +++ S  
Sbjct: 262 ----PPEARNPFSGEGD--------------WPSCAVFWQQRLCFAGTRNGPQTIWASRS 303

Query: 361 GAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSK 420
           GA+ +FS+       D   A+T  +   + S + W+ P    +LVG     W LS    +
Sbjct: 304 GAYGNFSVSRPLRDDD---AVTVTIAADTVSAVRWLMP-ARRLLVGTGGGEWTLSGQGEQ 359

Query: 421 ---GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLA 476
               LS    R S  G     P+SVGD ++ +   GR ++    S +  G+   ++T LA
Sbjct: 360 PFSPLSCSLERQSSRGSGDVQPLSVGDAVLALQRGGRVVREFRYSLDVDGYAGTDLTILA 419

Query: 477 DHLFN-QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKH 535
           +HL   +RI+   +Q+ P   VW V E        L+      E E    WH H+     
Sbjct: 420 EHLTRGRRIIDWAWQQSPSGTVWCVTEDGG-----LIAMTRIPEHE-VAGWHRHVTDGA- 472

Query: 536 YVLSAASFPNDNRGGTSLWMLVAL-SAGEERSFTVRLN 572
            VLS  + P     G  LW+ V     G  R    RL+
Sbjct: 473 -VLSVCTIPG--TAGDELWVAVRREGGGMVRCCIERLD 507


>gi|218886166|ref|YP_002435487.1| hypothetical protein DvMF_1065 [Desulfovibrio vulgaris str.
           'Miyazaki F']
 gi|218757120|gb|ACL08019.1| conserved hypothetical protein [Desulfovibrio vulgaris str.
           'Miyazaki F']
          Length = 692

 Score =  436 bits (1120), Expect = e-120,   Method: Composition-based stats.
 Identities = 125/579 (21%), Positives = 206/579 (35%), Gaps = 75/579 (12%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           M  TT  ++SF+AGELSP L+ +R D + +A G    RN++   +GP    P ++    C
Sbjct: 1   MARTTLIQNSFNAGELSP-LMAARGDQARYASGCRVLRNMLLHPHGPAFRRPGLRFMGAC 59

Query: 61  RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120
             +    R+  F   +G   +L F  ++L++   R                PY  +   +
Sbjct: 60  VDETVPPRLVPFVFNEGQAYVLEFAPERLRVW-WRGGLVLGEGGAPLVVPAPYAAEHLPT 118

Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180
           L +         V     P  L      D   +    + F P      G+ S    +   
Sbjct: 119 LRWCQSADVLYLVTPHAAPRKLERHGHAD---WRLVAVNFGPRVATPTGLRSTGAPSGTR 175

Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLT 240
                 T+ +  T +  +           L       A+ +  ++    V     YR   
Sbjct: 176 QHRYVITAVSVDTGEESLPTAE-------LAVTAGTPAEGSAVNLAWTAVEGASEYRVYK 228

Query: 241 TGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGR 300
            G     +G    A   +                           Y   G   D ++   
Sbjct: 229 AGGGASVYGLLGTAATGE--------------------------TYADTGRTPDFAEG-- 260

Query: 301 SISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSF 360
                P+ +  F+                + YPS V F   RL F+GS+    +++ S  
Sbjct: 261 ----PPEHRNPFEG--------------TDDYPSSVQFWQQRLCFAGSRSHPQTIWASRT 302

Query: 361 GAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSK 420
           G + +  +       D   A+T  +   + S + WM P    +LVG     W LS   S+
Sbjct: 303 GCYENMDVSRPLQTDD---AVTVTIASETVSAVRWMMP-ARKLLVGTGGGEWTLSGQGSE 358

Query: 421 ---GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLA 476
               LS      S  G    PP++VGD ++ V   GR ++    S +  G+   + T LA
Sbjct: 359 PFSPLSCLLEFQSARGSAELPPLAVGDGVLAVQRGGRAVRDFRYSLDVDGYSGADQTILA 418

Query: 477 DHLFNQR-ILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKH 535
           +H+   R I+   YQ+ PHS+VW  ++        + G    AE +    WH H      
Sbjct: 419 EHMLRGRNIVDWAYQQSPHSVVWCAMDDG-----TMAGLTLIAEHQ-VAGWHRHDTGGAV 472

Query: 536 YVLSAASFPNDN-RGGTSLWMLVALS-AGEERSFTVRLN 572
             L     P  +  GG  LW++V     G +R +  RL+
Sbjct: 473 EALCVVPGPPSDPAGGDELWLVVRRDVDGVQRRYIERLD 511


>gi|282848883|ref|ZP_06258273.1| hypothetical protein HMPREF1035_1392 [Veillonella parvula ATCC
           17745]
 gi|282581388|gb|EFB86781.1| hypothetical protein HMPREF1035_1392 [Veillonella parvula ATCC
           17745]
          Length = 772

 Score =  415 bits (1067), Expect = e-114,   Method: Composition-based stats.
 Identities = 107/610 (17%), Positives = 225/610 (36%), Gaps = 70/610 (11%)

Query: 6   WTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPR 65
            ++ +F+ GE+SP +  SR DL  +   + ++ N++   YG +      Q     +   +
Sbjct: 7   ISQLAFTTGEVSPDV-SSRFDLEQYKSALLEAENVVIRPYGAVAKRQGSQYVGQVKYSDK 65

Query: 66  SNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAV 125
             R+F F+       +L FGDK +++      T       G    TP+T      L  + 
Sbjct: 66  PTRLFEFTTNTNNSFMLEFGDKYIRVWNYGVYT-------GIEVTTPFTSDILFDLNCSQ 118

Query: 126 FGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQA 185
            G         +P   L    D D   +  +  K    P+      + V S   ++    
Sbjct: 119 SGDVMFICSGKYPIQTLSRYSDTD---WRLEAYKLTEQPYDTIN--TDVNSTVTVTGDTI 173

Query: 186 DTSTARITSDM-------------------KIFKPLDKGRSIRLGCHPPEWAKNTNYSIG 226
            +S     +DM                          + RS   G +      N NY++ 
Sbjct: 174 RSSKDLFNADMVGMVMQLGYFVAAVHTKNTGTVVEKKEKRSFMGGFNKWNEYNNINYNVE 233

Query: 227 AYIVADDKVYRSLT----TGRSGDRFGYSKGATYVK--------DNNITWITVLNLSSKT 274
           +Y    D  ++  T    TG    +   + G T+          D N+T    +  ++K 
Sbjct: 234 SYSTDQDLAWKFTTHGTWTGTVKLQITTNNGTTWKDYRTYSSNNDYNVTDAGKIEPNAKL 293

Query: 275 SRES--------ASGAVAPYYVWGDIK-DVSKDGRSISVAPQSQTLFQAGVSVVSWFMSA 325
             +S           ++ PY  WG ++     D +++ +   +  +     S   W M +
Sbjct: 294 RIQSDIKSGECNVDLSILPYTTWGIVEFKEFVDSKTMKINILNGIVENEATS--KWKMGS 351

Query: 326 WGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAV 385
           WG   GYP   TF+ +R + + +  +   +++S  G + +F ++   G      ++T  V
Sbjct: 352 WGRSNGYPKLCTFYQDRFVVAATNKNPNYIWMSRTGDYPNFGVEKVEGTITDDSSITLPV 411

Query: 386 TDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLS-KGLSIDFRRVSGSGVYACPPVSVGD 444
            +     I  + P  + +++    + W++S   +    + + +  +  G  +C P  +G+
Sbjct: 412 INRKMYEIRHLVPAND-LIILTSGNEWIVSGDKTITPTNCNLKTQTQRGALSCEPQFIGN 470

Query: 445 CLVFVCGVGRRIKYISGSTE-QGFRFNEITQLAD-HLFNQRILQLVYQEEPHSIVWVVLE 502
             VFV   G  ++ +  S E   +   ++T      +     +   Y ++P SI++ +  
Sbjct: 471 RCVFVQERGGTVRDMGYSYESDNYTGQDLTLFVKTRVRGYLTITSAYAQDPDSIIYYIRN 530

Query: 503 PKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALS-A 561
             +      + C      +  + W  H +++  Y+   +          SL+ L+  +  
Sbjct: 531 DGE------INCLTYIPEQKVYGWS-HFVTNGKYLYCESV---SEGEQDSLYTLIERTLQ 580

Query: 562 GEERSFTVRL 571
           G++     R+
Sbjct: 581 GKKVKCIERM 590


>gi|262043657|ref|ZP_06016766.1| hypothetical protein HMPREF0484_3785 [Klebsiella pneumoniae subsp.
           rhinoscleromatis ATCC 13884]
 gi|259038995|gb|EEW40157.1| hypothetical protein HMPREF0484_3785 [Klebsiella pneumoniae subsp.
           rhinoscleromatis ATCC 13884]
          Length = 758

 Score =  414 bits (1063), Expect = e-113,   Method: Composition-based stats.
 Identities = 118/614 (19%), Positives = 199/614 (32%), Gaps = 75/614 (12%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           M      K SF+AG LSP ++  + D    A  V   +N IPL  GP       Q     
Sbjct: 1   MSKIRPIKRSFNAGILSP-VMYGQVDFDKWASAVKYMKNFIPLPQGPARRRGGTQYAGSV 59

Query: 61  RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDN-- 118
           +       + SF        +L FG   ++     +              TP+   D   
Sbjct: 60  KNSSDRVWLASFQFSTTEAFILEFGPGYIRFWFNHAQL-LDDENNILEVSTPWGAGDLTR 118

Query: 119 ---KSLEYAVFGSTAVFVH--KDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLG-DGMIS 172
                L                ++P + L         +++  E  F   P+   +   S
Sbjct: 119 NGKFGLSLQQSADVIYITCTNGNYPVYKLTR---NTNTNWSLAEASFSGGPFADINSDKS 175

Query: 173 GVKSNAKLSISQAD-----------TSTARITSDMKIFKPLDKGRSIRLGC--------- 212
            V    +  I   D           TS   IT++  IF+ L  G    +           
Sbjct: 176 SVVYTDQFRIWSEDGNDLPDGTPTTTSLCNITANTDIFQALHVGCLFYIEASTDAVDDDT 235

Query: 213 ----HPPEWAKNT--NYSIGAYIVADDKVYRSLTTGRSG-DRFGYSKGATYVKDNNITWI 265
               + P WA  T   +S G +  +D K Y  +   ++G  +  ++ GA           
Sbjct: 236 GHSGYIPAWAAGTTETFSTGVFCRSDGKYYEDMDGTKTGNTQPTWTAGAHRDGSGG---- 291

Query: 266 TVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSA 325
                   +    + G      +       S  G+ ++  P S  +         +    
Sbjct: 292 ------DASLWRYSGGGWGIIEITAVNSATSATGKIVTELPPS--VRNTVGKTYKYAFGD 343

Query: 326 WGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAV 385
           W +   YP    F   RL+F+G       ++ S  G   +FS        +   ++   +
Sbjct: 344 WSDVLRYPQFAAFFRGRLVFAGR----QKIWSSVAGDLQNFSPMTNGYEAESDDSINDRI 399

Query: 386 TDFSASTIHWMHPFGEGVLVGCDTSLW------LLSISLSKGLSIDFRRVSGSGVYACPP 439
            D +  T+ W+      + +G     +      L S+  +    ++       G      
Sbjct: 400 -DDTQDTMQWLVASAGKIFIGTAGYEFSYGEQSLTSVFGAGNTKVELNSTI--GSNEVQA 456

Query: 440 VSVGDCLVFVCGVGRRIKYISG-STEQGFRFNEITQLADHLFNQRILQLVYQEEPHSIVW 498
             + D + FV   GR++   +  S    F       LA HLF   I+ L YQ+EP+ I+W
Sbjct: 457 ERLFDRVAFVQRAGRKVMIAAYDSGSDSFSATNSCILAPHLFTSEIIALAYQQEPNRILW 516

Query: 499 VVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVA 558
           V+LE             + AE +    WH H       V S    P+ + G   LWM+V 
Sbjct: 517 VLLEEGKLLGL-----TYDAE-QNITGWHEHATGGA--VESIKVIPDIDGGRDELWMVVK 568

Query: 559 LS-AGEERSFTVRL 571
            +  G    +   +
Sbjct: 569 RTINGATVRYLEYM 582


>gi|294648405|ref|ZP_06725904.1| phage protein [Acinetobacter haemolyticus ATCC 19194]
 gi|292825710|gb|EFF84414.1| phage protein [Acinetobacter haemolyticus ATCC 19194]
          Length = 706

 Score =  405 bits (1040), Expect = e-110,   Method: Composition-based stats.
 Identities = 123/583 (21%), Positives = 203/583 (34%), Gaps = 56/583 (9%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           M      K++F++GELSP +   R DL  +  G  +  N +P+  G L            
Sbjct: 1   MAKINLIKNNFTSGELSPHIWM-RTDLQQYRNGTKEMLNFLPIIEGGLKRRGGT---EAL 56

Query: 61  RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120
            +   + R+  F I      LL+F   ++ ++ +  +         K+  TPYT +D K 
Sbjct: 57  AITAGAIRILPFIISHSTAYLLIFKPNQIDVLDINGTVV-------KSLSTPYTAQDIKE 109

Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180
           + Y          H  HP   L      D  ++++D   F  PP      +  V++ A  
Sbjct: 110 ISYTQNRYQFYIAHSKHPLAWLR--ASEDLTNWSYDPFDFYVPP------LEEVETPALP 161

Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLT 240
             S    +    T     +   D  +  + G        N  Y   A  +         T
Sbjct: 162 LKSNEKNAGKVATLTASPYNIYDNSKRYQAGEICHHTINNVKYYFRALRITQGNTPSFGT 221

Query: 241 TGRSGDRFGYSKGATYVKDNNITWITVLNLSS-KTSRESASGAVAPYYVWGDI-KDVSKD 298
           +G       Y +  T  +    T   V                V+P  V G+I   +S D
Sbjct: 222 SGPEASPDYYWETTTVTEAQAFTAADVDKFVFINEGIVRIDTYVSPSTVTGEILVKLSTD 281

Query: 299 GRSISVAPQSQTLFQAGVSVVSWFMS--AWGEQEGYPSHVTFHNNRLLFSGSKGDELSVY 356
             +I+                +W +    +    GYP  VT +  RL+ +G+K     V+
Sbjct: 282 IEAIAN---------------AWTLKQDIFEVSLGYPRAVTMYQQRLVIAGTKTYPNYVW 326

Query: 357 LSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSI 416
           LS  G   +F             + T + +    + +  +       ++   + L + S 
Sbjct: 327 LSRVGDVTNFLP-----TTSDGDSFTVSASSDQLTNVLHLAQSRGICVMTGGSELVISSQ 381

Query: 417 SLSKGLSIDFRRVSGSGV-YACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQ 474
           +     +      +  G      P+ VG  L+FV     RI+ +           NE+T 
Sbjct: 382 NSMTPTNTSILEHTSFGSTENIKPIKVGSELIFVQRGAERIRTLLYDYSIDSLTSNELTV 441

Query: 475 LADHLFN--QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMIS 532
           LA H+        ++VY  EP SI+W VL        +L     + E +   AW TH I 
Sbjct: 442 LASHIAKKSGGFKEMVYCAEPDSIIWFVL-----GNGKLASLTLNRE-QSVIAWSTHDIG 495

Query: 533 DKHYVLSAASFPNDNRGGTSLWMLVALSAGEERSFTVRLNLLD 575
               VLS  S P+   G   L+ LV  +   +        LLD
Sbjct: 496 GT--VLSLTSLPS-TTGADRLYFLVNRNGTVQIEQMKEELLLD 535


>gi|298485990|ref|ZP_07004064.1| predicted phage protein [Pseudomonas savastanoi pv. savastanoi
           NCPPB 3335]
 gi|298159467|gb|EFI00514.1| predicted phage protein [Pseudomonas savastanoi pv. savastanoi
           NCPPB 3335]
          Length = 716

 Score =  402 bits (1033), Expect = e-110,   Method: Composition-based stats.
 Identities = 109/581 (18%), Positives = 206/581 (35%), Gaps = 44/581 (7%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           M   T  + +F+AGELSPR+L  R D++ +  G     N  PL +G +            
Sbjct: 1   MAKLTLIQTNFTAGELSPRML-GRVDIARYQNGAKVIENAWPLVHGGVTRRNGTLFCAAA 59

Query: 61  RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120
           +   R  R+  +        ++ FGD  ++I              G    +PY      +
Sbjct: 60  KFPDRRARLVPYVFNTEQAYMIEFGDFYIRIYYPNG------GWTGVELASPYGQTMLAA 113

Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180
           LEY     T    H   P + L  I       ++     F+  P+   GM          
Sbjct: 114 LEYVQGADTMFLFHGRVPIYRLKRIS---NTEWSLAPAPFVTTPFEERGMDFAFAMAIT- 169

Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAK-NTNYSIGAYIVADDKVYRSL 239
             + A  + + +T     F   D GR I  G          ++ S+   ++       S 
Sbjct: 170 --NPAAGAASTVTPGAPAFFISDVGREIWAGSGIARITAFGSSGSVSVLVINAF----SQ 223

Query: 240 TTGRSGDRFGYSKGA-TYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKD 298
           T   +    G  +   T    + +     L L +   R    G                 
Sbjct: 224 TLYPTWSLKGSPQTTCTASAFSPVGATVTLTLGAAGWRPEDVGKFVKLNGGLFQISGFTS 283

Query: 299 GRSISVAPQSQTLFQAGVSVVSWFM--SAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVY 356
              ++   +S           +W +  S W + +GYPS  T +  RL+ +GS     +++
Sbjct: 284 STVVNAVIRSIATSVVAAPAGAWSLEASVWNDFDGYPSTGTLYEQRLVAAGSPNYPQTIW 343

Query: 357 LSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCD-TSLWLLS 415
            S  G + +F L  +        A++  V+    + I  MH      LV       + ++
Sbjct: 344 ESRTGEYLNFELGTK-----DDDAMSFNVSSDQINPI--MHVGQVKALVTLTYGGEFTVT 396

Query: 416 ISLSK---GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNE 471
             + K     +I  +  S  G     P+ +G+ L FV   GR+++ ++   +   +   +
Sbjct: 397 GGVEKPITPTNIQIKNQSVYGCNGVRPIRIGNELYFVQRAGRKLRAMAYKYDSDSYGSPD 456

Query: 472 ITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMI 531
           ++ L++H     ++ + +Q+EP SI+++V      S   +       + +    W   + 
Sbjct: 457 MSVLSEHATKSGVVDMAFQQEPESILFMVR-----SDGVMATMTVDRD-QDVVGWARQVT 510

Query: 532 SDKHYVLSAASFPNDNRGGTSLWMLVALSA-GEERSFTVRL 571
              +   S A  P+    G  +W +V  +  G+   +  R 
Sbjct: 511 DGAY--ESVAVIPSAE--GDQVWAVVRRTVNGQNVRYLERF 547


>gi|46580124|ref|YP_010932.1| hypothetical protein DVU1714 [Desulfovibrio vulgaris str.
           Hildenborough]
 gi|46449540|gb|AAS96191.1| conserved hypothetical protein [Desulfovibrio vulgaris str.
           Hildenborough]
 gi|311233883|gb|ADP86737.1| hypothetical protein Deval_1582 [Desulfovibrio vulgaris RCH1]
          Length = 697

 Score =  402 bits (1033), Expect = e-110,   Method: Composition-based stats.
 Identities = 126/582 (21%), Positives = 205/582 (35%), Gaps = 81/582 (13%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           M      + +F+ GE+SP LL +R D   +  G    RN +PL  GP+   P ++     
Sbjct: 1   MGTIYPVQQAFNGGEISP-LLTARADQIRYQTGALTMRNAVPLAQGPVTRRPGLRFMGAA 59

Query: 61  RL-DPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNK 119
           +       R+ SF         L FG   +++ +       +   +     +PY   D  
Sbjct: 60  KEQGAGPVRLVSFVFSAAQSRALEFGPGYVRVWMDAGLVSKNGQPY--EVASPYGAADIA 117

Query: 120 SLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLP----PPWLGDGMISGVK 175
            L +A          ++HPP  L    D D   + F    F+P    P  L  G +    
Sbjct: 118 GLRFAQSADVIYIASRNHPPRKLSRHADDD---WRFITPTFMPTQAAPGALTLGTLGTTP 174

Query: 176 SNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKV 235
                + S   T+ +  T +  +  P   G           W + +  ++   +  + +V
Sbjct: 175 GPGNETYSYKVTAVSATTGEESLASPE--GTITTTAMSSTYWVRVSWAAVPGAV--EYRV 230

Query: 236 YRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDV 295
           Y+    G  G       G T+  D NI                  GA     V       
Sbjct: 231 YK-RRYGVFGFIGRAVGGDTFFDDRNI------------------GADTEDTV------- 264

Query: 296 SKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSV 355
                     P+++  F A                 YP  V F   RL F+GS    L+V
Sbjct: 265 ----------PEAKNPFTAAGE--------------YPGLVFFWQQRLGFAGSDKRPLTV 300

Query: 356 YLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLS 415
           +LS   AF + +        D  +A    +     +   W+      + +G +   W LS
Sbjct: 301 WLSQSAAFENLAASRPPQDDDGIEA---TLAGQRQNRFVWI-EGDRTLCLGTEGGEWTLS 356

Query: 416 ISLSK---GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNE 471
                     S+ F+     G    P V  GD L++V   G  ++  + S E  G+   +
Sbjct: 357 GQEGGPVTPTSLQFQSHGVRGSEGVPAVRAGDSLLYVQRGGGVVREFTYSFERDGYVAPD 416

Query: 472 ITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMI 531
           +T L   L  +++    YQ+ PHSIVW VL+        L    F  E +    WH H  
Sbjct: 417 LTLLTGVLRGRKVRAWAYQQSPHSIVWCVLDDG-----TLAALTFLREHD-VVGWHRHDT 470

Query: 532 SDKHYVLSAASFPNDNRGG-TSLWMLVALS-AGEERSFTVRL 571
                 ++     +   GG  ++WMLV  +  G+ER +  R+
Sbjct: 471 DGVVEDVTVIPGGDATAGGTDTVWMLVRRTVGGQERRYVERM 512


>gi|292670776|ref|ZP_06604202.1| hypothetical protein HMPREF7545_1740 [Selenomonas noxia ATCC 43541]
 gi|292647397|gb|EFF65369.1| hypothetical protein HMPREF7545_1740 [Selenomonas noxia ATCC 43541]
          Length = 762

 Score =  400 bits (1028), Expect = e-109,   Method: Composition-based stats.
 Identities = 119/601 (19%), Positives = 211/601 (35%), Gaps = 65/601 (10%)

Query: 7   TKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRS 66
            K SF+ GEL+P L   R DL  +  G +  +N+I LRYG     P  +     +   + 
Sbjct: 9   LKPSFAGGELTPALY-GRTDLQKYDVGASTLKNMIVLRYGGATRRPGFRHVAKTQ-GGKR 66

Query: 67  NRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVF 126
            R+  F        +L F    +++           A       T YT  D   ++Y   
Sbjct: 67  ARLIPFQYSTEQSYVLEFTAGCIRVFTKGGIVVKDDAPLVIP--TSYTEADLSDIKYTQS 124

Query: 127 GSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQAD 186
                 VH +HPP  L      D   + F+ +     P+       G+K       +   
Sbjct: 125 ADVLFLVHVNHPPMTLTRYGVTD---WKFERMDIAGGPFEDPNTKDGLKI-----GASGV 176

Query: 187 TSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKN--TNYSIGAYIVADDKVYRSLTTGRS 244
                + + +  F     G  IRLG       K+      +    V    VY       +
Sbjct: 177 QGEITLKASVDYFTEDMVGSLIRLGHTMSGQLKSGIPTTPLVVRCVPSGTVYVESFGFWN 236

Query: 245 GDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYY----------VWGD--- 291
           G         +      +         + T   +  G     Y          VW +   
Sbjct: 237 GSFIVEKHDKSTDTWIALQEQHANRTQNYTLNYTNKGDDIVEYRVRSEKFDTSVWSNENE 296

Query: 292 -----------------IKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPS 334
                            +  ++    + S A           +   + +SAW  ++GYP 
Sbjct: 297 RQRGYVTIQTFAQDYYGVARITAVNSATSAAATVTRELADTEATNDFSLSAWSAKKGYPQ 356

Query: 335 HVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIH 394
            V+F  +RL+F+GS+    + + S  G +Y+F ++      D   A+T  ++    + I 
Sbjct: 357 AVSFFEDRLVFAGSRAKPQTYWASQSGDYYNFWVNTPQQDSD---AITGTLSGGQMNGIR 413

Query: 395 WMHPFGEGVLVGCDTSLWLLSISLS--KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGV 452
            + PFGE +++      + +          +         G+    PV +G  +V+V   
Sbjct: 414 AIIPFGEMLML-TSGGEYKVGGGNETFTPTNQKAEPQEYRGINNLTPVVIGGRIVYVQHQ 472

Query: 453 GRRIKYISGSTE-QGFRFNEITQLADHLFN-QRILQLVYQEEPHSIVWVVLEPKDNSFPR 510
           G  I+ ++ S +   +  ++++ LA HLF    I+ L YQ+ P+++VW V E        
Sbjct: 473 GSVIRDLTYSYDVDKYTGDDVSLLAAHLFEGHTIVALAYQQTPNTVVWCVREDG-----A 527

Query: 511 LLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSAGEERSFTVR 570
           LLG  +  E +  +AWH H  + K       +   D      LW +V         +  +
Sbjct: 528 LLGMTYIKE-QDVYAWHKHTTAGKFTD--VCTISGDR--EEELWAVVERDGAH---YVEQ 579

Query: 571 L 571
           +
Sbjct: 580 M 580


>gi|225157020|ref|ZP_03724959.1| hypothetical protein ObacDRAFT_8085 [Opitutaceae bacterium TAV2]
 gi|224802748|gb|EEG20999.1| hypothetical protein ObacDRAFT_8085 [Opitutaceae bacterium TAV2]
          Length = 773

 Score =  398 bits (1023), Expect = e-108,   Method: Composition-based stats.
 Identities = 115/617 (18%), Positives = 209/617 (33%), Gaps = 77/617 (12%)

Query: 9   HSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNR 68
           ++F+AGE +P+L   R DL  +     +  N+  + YG              +     +R
Sbjct: 7   NNFTAGEWTPKL-DGRSDLQKYDAACRRLENMRVMPYGGARFRSAFGYVAKTKSAATPSR 65

Query: 69  VFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGS 128
           +  F        +L +    L++     S   +PAL  +   +PY      +++Y     
Sbjct: 66  LMPFQFSTEQKFMLEWAHLALRVY----SAGAAPALL-QEIASPYPAAAVFAIQYRQIND 120

Query: 129 TAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTS 188
               VH D+P   L    D D   +  + + +  PP L + +        KLS+S  D  
Sbjct: 121 VVYLVHPDYPVQRLARHADAD---WRLEAVDWAFPPMLDENVTET-----KLSLSAVDGV 172

Query: 189 TARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSI---GAYIVADDKVYRSLTTGRSG 245
              +T+   +F+P   G    L       + + + +    G +  A   V    T   S 
Sbjct: 173 NVTMTASAALFQPGHVGSYWELRHLKEAASTSVSLATTSGGPFHSAAISVQGDWTA-NST 231

Query: 246 DRFGYSKGATYVKDNNITWITVLNLSSKTSRE-SASGAVAP-------YYVWGD------ 291
           +R+  +       D   TW TV   ++++ R  SASG           Y   GD      
Sbjct: 232 ERWYGTLSIERSLDGGTTWETVRKFTAESDRNISASGHQEELAQFRLKYQPTGDPFGAGV 291

Query: 292 -------------------------IKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAW 326
                                    +  V+    S  V            +   W  SAW
Sbjct: 292 WVGKAPTNYVKARAMLETTDAYVTALVKVTAYTDSTHVKVTVIDKAATVAATDIWCESAW 351

Query: 327 GEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVT 386
               G+P  +  +  RL+F G++    +++ S    F +F         D   A+     
Sbjct: 352 SPYRGFPRTIGLYEQRLIFGGTRHQPNTMWGSKTDDFENFK-----YGEDDDAAVAYTFA 406

Query: 387 DFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGL---SIDFRRVSGSGVYACPPVSVG 443
               + + W+                + + +  + L   +I  R  S +G     PV V 
Sbjct: 407 ASEQNNVQWVESLKRIQAATTAREFTVAAGNTDEPLTPSNIVVRSESANGAAHLQPVLVN 466

Query: 444 DCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLE 502
           D +++V    R++  ++ S E  G+   ++T LA  +    + QL +  +P  ++  V E
Sbjct: 467 DAILYVERQSRKVMEMAYSIEKDGYASVDLTLLAAPVTESGVKQLAFARQPDPLLLAVTE 526

Query: 503 PKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALS-A 561
             +     L    +    +   AW   + +      S A+          +W +V  +  
Sbjct: 527 NGN-----LAVLTYDRP-QDVTAWARWITNGAF--ESVATLQG--TPEDEIWAVVRRTIG 576

Query: 562 GEERSFTVRLNLLDDFK 578
           G       RL    D K
Sbjct: 577 GVPVRTIERLTPETDSK 593


>gi|262043403|ref|ZP_06016528.1| hypothetical protein HMPREF0484_3546 [Klebsiella pneumoniae subsp.
           rhinoscleromatis ATCC 13884]
 gi|259039229|gb|EEW40375.1| hypothetical protein HMPREF0484_3546 [Klebsiella pneumoniae subsp.
           rhinoscleromatis ATCC 13884]
          Length = 664

 Score =  391 bits (1003), Expect = e-106,   Method: Composition-based stats.
 Identities = 107/583 (18%), Positives = 200/583 (34%), Gaps = 108/583 (18%)

Query: 3   NTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRL 62
                K +F+AGE+SPRL+  R D+  +A G     N + +  G ++  P  Q     + 
Sbjct: 2   RANLIKTNFTAGEISPRLM-GRVDIDRYANGAKTLENSVVVVQGGVMRRPGSQFVAATKY 60

Query: 63  DPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLE 122
             + +R+  +        +L FGD  L+I                   +PYT     S+ 
Sbjct: 61  GDKKSRLIPYVFNRTQAYILEFGDGYLRIYQDGKQLVNDD-NTPYEIASPYTSDMLPSVN 119

Query: 123 YAVFGSTAVFVHKDHPPHHLLYIQDGDK-------ISFTFDEIKFLPPPWLGDGMISGVK 175
           Y     T   VH+D  P+ L      D        I   FDE++  P  W    +   V 
Sbjct: 120 YVQGADTMFLVHQDVKPYRLQRRGQTDWVLEPAPFIVEPFDEVRDTPQKWCKPSVKEFVG 179

Query: 176 SNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYS--IGAYIVADD 233
           S   L           ++ D       D          PP +  +      +G+Y+  + 
Sbjct: 180 SEITL----------TLSDDEPPEGSED----------PPPFTGDGWVPEDVGSYVRINS 219

Query: 234 KVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIK 293
                                            ++ + S TS + A G +          
Sbjct: 220 --------------------------------GLVLIKSVTSAQVAVGTIRT-------- 239

Query: 294 DVSKDGRSISVAPQSQTLFQAGVSVVSWFM--SAWGEQEGYPSHVTFHNNRLLFSGSKGD 351
           D+S                    S  +W    S W ++ GYP  VT +  RL+ +GS   
Sbjct: 240 DLSAT---------------QAASPGAWTREDSVWTDEFGYPGAVTLYQQRLVLAGSPRY 284

Query: 352 ELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSL 411
             +++ S  G +  F L       D   A++  ++    + I  +      + +      
Sbjct: 285 PQTIWWSESGVYLSFELGT-----DDDDAISFTLSSDQLNPIVHLAQMNTLIALTYGGEF 339

Query: 412 WLLSISLS--KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQ--GF 467
            + + + +     +I  +  S  G     PV VG  ++FV   GR++  ++   +    +
Sbjct: 340 TITAGNDAAITPTNISVKNPSPYGCNGIRPVRVGTEIMFVQRSGRKLYAVAYDPDSYVAY 399

Query: 468 RFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWH 527
             N++T LA+H+    ++ + YQ++P +  W+V          ++        +   AW 
Sbjct: 400 SANDMTVLAEHITEGGVIDMAYQQQPDAFTWLVRNDG-----VMVTMAIDR-AQNVVAWS 453

Query: 528 THMISDKHYVLSAASFPNDNRGGTSLWMLVALSA-GEERSFTV 569
             + S      S A+ P+       ++ +V  +  G+   +  
Sbjct: 454 RQITSGAF--ESVATIPSAT--DDVVYAIVRRTVNGQTVRYVE 492


>gi|303328570|ref|ZP_07359005.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3]
 gi|302861336|gb|EFL84275.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3]
          Length = 696

 Score =  390 bits (1001), Expect = e-106,   Method: Composition-based stats.
 Identities = 105/582 (18%), Positives = 193/582 (33%), Gaps = 88/582 (15%)

Query: 7   TKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRS 66
            ++  + GE++P L++ R D   +  G  + RN +P+  G +   P  +       D  +
Sbjct: 6   IQNVLNGGEITP-LMRGRVDQPRYGTGAREMRNFVPMPQGGVTRRPGTRFLGMAHGD--A 62

Query: 67  NRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVF 126
            R+  F        +L FGDK L++ +             K +++PY   D   L +A  
Sbjct: 63  ARLIPFVFSATQGRMLEFGDKTLRVWLPDGRLVADENGEPKVFESPYAVGDLHELRFAQS 122

Query: 127 GSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGV------KSNAKL 180
                  H+ + P  L    D D   + + E+ F+P     D +   V        NA  
Sbjct: 123 ADVVYLAHQGYAPRRLSRHADDD---WRWSELAFVPAIAAPDNVSLQVIDRGYNGDNATR 179

Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLT 240
             + A T+    T                                               
Sbjct: 180 VYTYAVTAVDEKTGQESGAGAE-----------------------------------VSI 204

Query: 241 TGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGR 300
           T ++ +   Y   A +       +  V             G +          D +    
Sbjct: 205 TAKALNSVSYIIRAAWPAVEGAAYYRVYKKKYGV-----FGYIGRSDAECSFDDENIGAD 259

Query: 301 SISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSF 360
           +    P+ +  F +                 +PS V FH  RL ++ +    ++++LS  
Sbjct: 260 TEDTPPEHKNPFASEGD--------------WPSQVFFHQQRLGWAATANRPITIWLSRP 305

Query: 361 GAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLS- 419
           G F   +        D   A+   +    A+ I W+ P  + +  G + S W LS     
Sbjct: 306 GDFEIMAASTPPKDDD---AIEATLAATQANRIVWLQPDRQSLTFGTEGSEWTLSAGEGV 362

Query: 420 --KGLSIDFRRVS-GSGVYACPPVSVGDCLVFVCGVGRRIKYISGST-EQGFRFNEITQL 475
                ++ F   +   G  A   VSVG  ++++   G+ ++  + +     +   ++T L
Sbjct: 363 ALTPSNVSFEMQTANGGDNATQAVSVGGGVLYLQRGGKAVRQFAYNYSADKYLGQDVTIL 422

Query: 476 ADHLFNQRIL-QLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDK 534
           A H+    ++    +Q+EP++++W  L     S   L G  +  E +    WH H    +
Sbjct: 423 ARHILRDAVVTAWAFQQEPYAVLWCAL-----SDGTLAGLTYMPE-QDVMGWHRHDTDGR 476

Query: 535 HYVLSAASFPNDNRGGTSLWMLVALSAGEERSFTVRLNLLDD 576
                 A+ P         W LV    G       RL+   D
Sbjct: 477 F--EDVAAMPG--TPDDQTWFLVRRGCG---LCVERLDSFFD 511


>gi|78357587|ref|YP_389036.1| hypothetical protein Dde_2545 [Desulfovibrio desulfuricans subsp.
           desulfuricans str. G20]
 gi|78219992|gb|ABB39341.1| conserved hypothetical protein [Desulfovibrio desulfuricans subsp.
           desulfuricans str. G20]
          Length = 700

 Score =  389 bits (1000), Expect = e-106,   Method: Composition-based stats.
 Identities = 114/591 (19%), Positives = 199/591 (33%), Gaps = 91/591 (15%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRD- 59
           M   T T++SF+ GELSP LL SR D   +  G    RN+    +G  V  P M+     
Sbjct: 1   MSRITLTRNSFNGGELSP-LLSSRIDQQRYTAGCRTLRNMTVYPHGAAVRRPGMRHMGTG 59

Query: 60  ---CRLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFK 116
                    + R+  F        +L  G+  +++         +        +TP+   
Sbjct: 60  LSLQPAGSAAVRLVPFVFSQEQAYVLELGEGVMRVWKDDGLVVSADGS-PVCVETPWKGD 118

Query: 117 DNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKS 176
             +SL+Y         V +   P  L      D   +    ++F        G+ +    
Sbjct: 119 ALQSLQYCQSADVMYLVCRQCAPRKLARHAHDD---WRITLLEFGAGLPAPQGLTAAAGG 175

Query: 177 NAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVY 236
            A+   +   T+ A    +  +                       + ++   +       
Sbjct: 176 AAEREYAYVVTAVAPDGGEESLPSEAVNVT------------AAASLNVRDMVR------ 217

Query: 237 RSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVS 296
                                    +TW  V    +    +S +G  +  Y+        
Sbjct: 218 -------------------------LTWQPVEGAGAYCVYKSIAGGGSYGYI-------- 244

Query: 297 KDGRSISVAP-QSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSV 355
             G++  V   + +            + + +  +  +P  V F+  RL F+G+     ++
Sbjct: 245 --GKAAGVPAYEDRGAEPDFGQGPPEYRNPFDGEGRWPGCVQFYQQRLCFAGTDEKPQTI 302

Query: 356 YLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLS 415
           + S    +   ++       D   A+T  +     + I WM P    +LVG     W LS
Sbjct: 303 WCSQSANYESMNISSPLRDDD---AVTVTIAADRVNRIRWMMP-ARRLLVGTAGGEWQLS 358

Query: 416 ISLSKGLS---IDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNE 471
            S    L+      RR +  G     P+ +G  ++FV   GR ++    + E  G+   +
Sbjct: 359 GSGDAPLTPVDAQLRRDTMHGSAGLMPLVIGQSILFVQRDGRTVREFRYALESDGYDAGD 418

Query: 472 ITQLADHLFN-QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHM 530
           +T LA+HL   +RI+   YQ+ P S+VW  L     S   L    F  E E    WH H 
Sbjct: 419 LTILAEHLMRGRRIVSWCYQQSPASVVWCAL-----SDGTLAAMTFLREHE-VVGWHRHD 472

Query: 531 ISDKHYVLSAASFPNDNRGGTSLWMLVAL---------SAGEERSFTVRLN 572
                +V +  + P D   G  +W+ V           +  EE     RL 
Sbjct: 473 TDG--FVEAVTAIPGDE--GDEVWLSVRRVRVLHDENGTRQEEVRSIERLE 519


>gi|169795391|ref|YP_001713184.1| phage-like protein [Acinetobacter baumannii AYE]
 gi|169148318|emb|CAM86183.1| hypothetical protein; putative phage related protein [Acinetobacter
           baumannii AYE]
          Length = 697

 Score =  389 bits (999), Expect = e-106,   Method: Composition-based stats.
 Identities = 124/565 (21%), Positives = 206/565 (36%), Gaps = 67/565 (11%)

Query: 6   WTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPR 65
             K++ S+GELSP LL +R D+  +A G  K  N +PL  G     P  +      +   
Sbjct: 10  ILKNNLSSGELSP-LLWTRTDIQQYANGAKKLLNALPLVEGGAKKRPGTKFRS---IFAG 65

Query: 66  SNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPY-TFKDNKSLEYA 124
           + R+  F        LL+ G   L++   R+              TPY T +  + ++YA
Sbjct: 66  ALRLIPFIANSENTYLLILGVSFLKVYNPRTYAV------VYETVTPYNTAQKVREVQYA 119

Query: 125 VFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQ 184
                  FV  D P   LL   D     F        P   LG        S   +++S 
Sbjct: 120 HTKYRMYFVQGDTPVQRLLCSADFTNWQFAAFTFGVNPNDELG--------STPNVALSP 171

Query: 185 ADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRS 244
           + T   ++ S                    P W+    Y  G  ++ + K +R+     +
Sbjct: 172 SGTEVGKVIS--------------LTASSFPNWSNTETYLTGDRVIHNSKTWRA-----T 212

Query: 245 GDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISV 304
            D  G    AT  +     W  V N ++     ++ G++    + G    +++      V
Sbjct: 213 ADNKGVEPSATTPE-----WEEVTNEAANVFTPASVGSIVE--INGGQVKITEYVDPSRV 265

Query: 305 APQSQTLFQAGVSVV--SWFMS--AWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSF 360
             +      + V  +  SW +   A+  + GYP  V F   RL+F+ +K     ++ S  
Sbjct: 266 NGEVLVKLTSDVQAIAKSWVLKSIAFSAEAGYPKAVCFFKQRLVFANTKTSPNQMWFSRI 325

Query: 361 GAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSK 420
           G   +F             A + A +   +  I  +   G  V +       + S     
Sbjct: 326 GDDGNF-----LETTQDADAFSIASSSAQSDNILHLSQRGGVVALTGGAEFLINSQGPLT 380

Query: 421 GLSIDFRRVSGSGV-YACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADH 478
             S      +  GV     P  VG+ L+FV   G R++ +S   E  G    E++Q+A H
Sbjct: 381 PASAQIDEHTSYGVQANVKPCRVGNELLFVQRGGERLRAMSYRYEVDGLVSPELSQIAPH 440

Query: 479 LFNQ--RILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHY 536
           +      I +L +Q+ P+SIVW+V+     S   L         +   AW  H    +  
Sbjct: 441 IPENHAGIKELTFQQTPNSIVWIVMGDGAVSSITL------NRDQEMNAWSQHDFGGQ-- 492

Query: 537 VLSAASFPNDNRGGTSLWMLVALSA 561
           VLS  + P    G    +ML   + 
Sbjct: 493 VLSICALP-TGLGEDQCFMLTNRNG 516


>gi|212703239|ref|ZP_03311367.1| hypothetical protein DESPIG_01281 [Desulfovibrio piger ATCC 29098]
 gi|212673505|gb|EEB33988.1| hypothetical protein DESPIG_01281 [Desulfovibrio piger ATCC 29098]
          Length = 694

 Score =  389 bits (999), Expect = e-106,   Method: Composition-based stats.
 Identities = 111/574 (19%), Positives = 199/574 (34%), Gaps = 82/574 (14%)

Query: 7   TKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRS 66
           T++  + GE+SP LL+ R D   ++ G  + RN +P+  G +   P  +       D   
Sbjct: 6   TQNVLNGGEISP-LLRGRVDQPRYSTGAREMRNFVPMPQGGVTRRPGTRYLGTALGDGG- 63

Query: 67  NRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVF 126
            R+  F        +L FGD+ +++ +             K +++P+   D +++ YA  
Sbjct: 64  -RLVPFVFSATQGRMLEFGDRAMRVWLPDGRVVADEEGAPKIFESPFAAADLRAVRYAQS 122

Query: 127 GSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQAD 186
                F H  + P  L    D D   + + E+ F+P           + +  K ++S   
Sbjct: 123 ADVIYFAHPGYAPRKLARHADDD---WRWSELTFMPA----------IATPKKPALSTVG 169

Query: 187 T--STARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRS 244
           T     +      +    DKG+       P E A  +  ++ +                 
Sbjct: 170 TPEGDKKTDYTYCVTAIDDKGQ----ESSPSEPASISAQALNS----------------- 208

Query: 245 GDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISV 304
                +    ++      T   V             G     Y    I D +    +   
Sbjct: 209 ---VDFHIRISWEAVEGATGYRVYKKKMGVFGYIGKGGADETY----IDDKNIGADTEDT 261

Query: 305 APQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFY 364
            P+ +  F+   +              YPS V FH  RL F+ S    ++++LS  G F 
Sbjct: 262 PPEYEDPFEGEGN--------------YPSQVFFHQQRLGFAASNSRPITIWLSRSGEFE 307

Query: 365 DFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSI 424
             +        D   A+   +    AS I W+ P    +  G + S W L  S    L+ 
Sbjct: 308 SMAKSTPPKDDD---AIEVTLAATQASRIVWLQPDRSALAFGTEGSEWTLEPSEGVALTP 364

Query: 425 DFRR----VSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGST-EQGFRFNEITQLADHL 479
                    +  G  A   +SVG  +++V      I+  + +     +   ++  LA H+
Sbjct: 365 ATASFQLQTTNGGSDAVAALSVGGSVLYVQRGAGAIREFAYNYSADKYLGQDLNILARHM 424

Query: 480 FNQ-RILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVL 538
                ++   +Q+EP++++W VL     S   L G  +  E +    WH H  +      
Sbjct: 425 LRDVDVVAWSWQQEPYAVLWSVL-----SDGTLAGLTYMKE-QEIVGWHRHTTAGDFVD- 477

Query: 539 SAASFPNDNRGGTSLWMLVALSAGEERSFTVRLN 572
             A  P        +W LV       + F  RL 
Sbjct: 478 -VAGIPG--TPDDQVWFLVRRGG---QVFVERLE 505


>gi|332875218|ref|ZP_08443051.1| carbohydrate binding domain protein [Acinetobacter baumannii
           6014059]
 gi|332736662|gb|EGJ67656.1| carbohydrate binding domain protein [Acinetobacter baumannii
           6014059]
          Length = 692

 Score =  388 bits (996), Expect = e-105,   Method: Composition-based stats.
 Identities = 125/565 (22%), Positives = 203/565 (35%), Gaps = 67/565 (11%)

Query: 6   WTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPR 65
             K++ S+GELSP LL +R D+  +A G  K  N +PL  G     P  +      +   
Sbjct: 5   ILKNNLSSGELSP-LLWTRTDIQQYANGAKKLLNALPLVEGGAKKRPGTKFRS---IFAG 60

Query: 66  SNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPY-TFKDNKSLEYA 124
           + R+  F        LL+ G   L++   R+              TPY T +  + ++YA
Sbjct: 61  ALRLIPFIANSENTYLLILGVSFLKVYNPRTYAV------VYEAVTPYNTAQKVREVQYA 114

Query: 125 VFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQ 184
                  FV  D P   LL   D     F        P   LG        S   +++S 
Sbjct: 115 HTKYRMYFVQGDTPVQRLLCSADFTNWQFAAFTFGVNPNDELG--------STPNVALSP 166

Query: 185 ADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRS 244
           + T   ++ S                    P W+    Y  G  ++   K +R+      
Sbjct: 167 SGTEVGKVIS--------------LTASSFPNWSNTETYLTGDRVIHTSKTWRATI---- 208

Query: 245 GDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISV 304
            D  G    AT  +     W  V N ++     S+ G++    + G    +++      V
Sbjct: 209 -DNKGVEPSATTSE-----WEEVTNEAANVFTPSSVGSIVE--INGGQVKITQYVDPSRV 260

Query: 305 APQSQTLFQAGVSVV--SWFMS--AWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSF 360
             +      + V  +  SW +   A+    GYP  V F   RL+F+ +K     ++ S  
Sbjct: 261 NGEVLVKLTSTVQAIAKSWVLKSIAFSATAGYPKAVCFFKQRLVFANTKTSPNQMWFSRI 320

Query: 361 GAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSK 420
           G   +F             A + A +   +  I  +   G  V +       + S     
Sbjct: 321 GDDGNF-----LETTQDADAFSIASSSAQSDNILHLSQRGGVVALTGGAEFLINSQGPLT 375

Query: 421 GLSIDFRRVSGSGV-YACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADH 478
             S      +  GV     P  VG+ L+FV   G R++ +S   E  G    E++Q+A H
Sbjct: 376 PASAQIDEHTSYGVQANVKPCRVGNELLFVQRGGERLRAMSYRYEVDGLVSPELSQIAPH 435

Query: 479 LFNQ--RILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHY 536
           +      I +L +Q+ P+SIVW+V+     S   L         +   AW  H    +  
Sbjct: 436 IPENHAGIKELTFQQTPNSIVWIVMGDGAVSSITL------NRDQEMNAWSQHDFGGQ-- 487

Query: 537 VLSAASFPNDNRGGTSLWMLVALSA 561
           VLS  + P    G    +ML   + 
Sbjct: 488 VLSICALP-TGLGEDQCFMLTNRNG 511


>gi|293609614|ref|ZP_06691916.1| predicted protein [Acinetobacter sp. SH024]
 gi|292828066|gb|EFF86429.1| predicted protein [Acinetobacter sp. SH024]
          Length = 692

 Score =  387 bits (994), Expect = e-105,   Method: Composition-based stats.
 Identities = 125/565 (22%), Positives = 202/565 (35%), Gaps = 67/565 (11%)

Query: 6   WTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPR 65
             K++ S+GELSP LL +R D+  +A G  K  N +PL  G     P  +      +   
Sbjct: 5   ILKNNLSSGELSP-LLWTRTDIQQYANGAKKLLNALPLVEGGAKKRPGTKFRS---IFAG 60

Query: 66  SNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPY-TFKDNKSLEYA 124
           + R+  F        LL+ G   L++   R+              TPY T +  + ++YA
Sbjct: 61  ALRLIPFIANSENTYLLILGVSFLKVYNPRTYAV------VYETVTPYNTAQKVREVQYA 114

Query: 125 VFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQ 184
                  FV  D P   LL   D     F        P   LG        S   +++S 
Sbjct: 115 HTKYRMYFVQGDTPVQRLLCSADFTNWQFAAFTFGVNPNDELG--------STPNVALSP 166

Query: 185 ADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRS 244
           + T   ++ S                    P W+    Y  G  ++   K +R+      
Sbjct: 167 SGTEVGKVIS--------------LTASSFPNWSNTETYLTGDRVIHSGKTWRATI---- 208

Query: 245 GDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISV 304
            D  G    AT  +     W  V N ++     S  G++    + G    +++      V
Sbjct: 209 -DNKGVEPTATTSE-----WEEVTNEAANVFTPSNVGSIIE--INGGQVKITQYVDPSRV 260

Query: 305 APQSQTLFQAGVSVV--SWFMS--AWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSF 360
             +      + V  +  SW +   A+    GYP  V F   RL+F+ +K     ++ S  
Sbjct: 261 NGEVLVKLTSAVQAIAKSWVLKSIAFSATAGYPKAVCFFKQRLVFANTKTSPNQMWFSRI 320

Query: 361 GAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSK 420
           G   +F             A + A +   +  I  +   G  V +       + S     
Sbjct: 321 GDDGNF-----LETTQDADAFSIASSSAQSDNILHLSQRGGVVALTGGAEFLINSQGPLT 375

Query: 421 GLSIDFRRVSGSGV-YACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADH 478
             S      +  GV     P  VG+ L+FV   G R++ +S   E  G    E++Q+A H
Sbjct: 376 PASAQIDEHTSYGVQANVKPCRVGNELLFVQRGGERLRAMSYRYEVDGLISPELSQIAPH 435

Query: 479 LFNQ--RILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHY 536
           +      I +L +Q+ P+SIVW+V+     S   L         +   AW  H    +  
Sbjct: 436 IPENHAGIKELTFQQTPNSIVWIVMGDGAVSSITL------NRDQEMNAWSQHDFGGQ-- 487

Query: 537 VLSAASFPNDNRGGTSLWMLVALSA 561
           VLS  + P    G    +ML   + 
Sbjct: 488 VLSICALP-TGLGEDQCFMLTIRNG 511


>gi|332160974|ref|YP_004297551.1| hypothetical protein YE105_C1352 [Yersinia enterocolitica subsp.
           palearctica 105.5R(r)]
 gi|325665204|gb|ADZ41848.1| Hypothetical phage protein [Yersinia enterocolitica subsp.
           palearctica 105.5R(r)]
 gi|330862130|emb|CBX72294.1| hypothetical protein YEW_AK02310 [Yersinia enterocolitica W22703]
          Length = 657

 Score =  386 bits (991), Expect = e-105,   Method: Composition-based stats.
 Identities = 105/581 (18%), Positives = 210/581 (36%), Gaps = 105/581 (18%)

Query: 3   NTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRL 62
                K +F+AGE+SPRL+  R D++ +A G     N + + +G ++  P  +     + 
Sbjct: 2   RANLIKTNFTAGEISPRLM-GRVDIARYANGAKTVENAVCVIHGGVMRRPGSRFAAKAKF 60

Query: 63  DPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLE 122
             +  R+  +        +L FG+  ++        +           +PYT     SL 
Sbjct: 61  GDQKARLIPYVFNRSQAYVLEFGNGYVRFYQN--GAQIGAGSTPYEIASPYTSAMLSSLN 118

Query: 123 YAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSI 182
           Y     T   VH+D PP+ L      D +          P P+                 
Sbjct: 119 YVQGADTMFLVHQDVPPYRLQRKGQTDWV--------LEPAPF----------------- 153

Query: 183 SQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTG 242
                          I KP D+ R       P +W K    S+  ++     +  +L+  
Sbjct: 154 ---------------IVKPFDEIRDT-----PEKWCKP---SVKEFV--GSAITLTLSDA 188

Query: 243 RSGDRFGYSKGATYVKDNNITWI----TVLNLSSKTSRESASGAVAPYYVWGDIKDVSKD 298
            SG   G   GA +V  +  +++     ++++ + TS   A+G +               
Sbjct: 189 ESG---GALTGAGWVGADVGSYVRINSGLVHIQAVTSAAVATGVI--------------- 230

Query: 299 GRSISVAPQSQTLFQAGVSVVSWFM--SAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVY 356
            R++  A QS        S  +W    + W  + GYP   T +  RL+ +GS     +++
Sbjct: 231 -RTVLSAVQSS-------SPGAWTREDAVWSAEFGYPGAATLYQQRLVLAGSPKYPQTIW 282

Query: 357 LSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSI 416
           +S  G +  F L       D   A++  V+    + I  +      + +       +   
Sbjct: 283 MSETGIYLSFELGT-----DDDDAISFTVSSDQINPIVHLAQMNTLIALTSTGEFTITGG 337

Query: 417 SLS--KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQ--GFRFNEI 472
             S     +I  +  S  G  +  PV VG  ++F+    R++  ++   +    +  N++
Sbjct: 338 GESAITPTNISVKNPSPYGCNSIKPVRVGTEIMFMQRANRKLFAVAYDPDSFVAYSANDL 397

Query: 473 TQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMIS 532
           + L++H+     + + YQ+EP + +W+       +  +L         +   AW   + +
Sbjct: 398 SVLSEHITLSGAVDMAYQQEPDAFIWMTR-----ADGQLAVATIDR-AQDVIAWSRQVTT 451

Query: 533 DKHYVLSAASFPNDNRGGTSLWMLVAL-SAGEERSFTVRLN 572
             +   S  + P        +++LV     G+   +    +
Sbjct: 452 GAY--ESVVTIPAST--NDVVYVLVKRVINGQIVRYVEVFD 488


>gi|295096862|emb|CBK85952.1| hypothetical protein ENC_24250 [Enterobacter cloacae subsp. cloacae
           NCTC 9394]
          Length = 662

 Score =  379 bits (972), Expect = e-103,   Method: Composition-based stats.
 Identities = 102/577 (17%), Positives = 197/577 (34%), Gaps = 92/577 (15%)

Query: 3   NTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRL 62
                K +F+AGE+SPRL+  R D++ +A G     N + +  G +V  P  +     + 
Sbjct: 2   RANLIKTNFTAGEVSPRLM-GRVDIARYANGAKIIENAVVVVQGGVVRRPGTRFAAATKH 60

Query: 63  DPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLE 122
             + +R+  +        +L FGD  ++I                   +PYT     ++ 
Sbjct: 61  GDKKSRLIPYVFNRSQAYMLEFGDGYMRIFQNGKQLVNED-NTPYEIASPYTADMLPAVN 119

Query: 123 YAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSI 182
           Y     T   VH+   PH L      D +          P P+                 
Sbjct: 120 YVQGADTMFLVHQSVKPHRLQRRGQTDWV--------LEPAPF----------------- 154

Query: 183 SQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTG 242
                          I +P D+ R       P +W K    S+  ++ ++      +T  
Sbjct: 155 ---------------IVEPFDEVRDT-----PQKWCKP---SVKEFVGSE------ITLT 185

Query: 243 RSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSI 302
            S    G ++   +       W   +     +      G V    +           +  
Sbjct: 186 LSDADPGDNETPPFTGAG---W---VAQDVGSYVRINEGLVLIKSIT--------SAQVA 231

Query: 303 SVAPQSQTLFQAGVSVVSWFM--SAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSF 360
               +S        S  SW    S W  + GYP  VT +  RL+ +GS     +++ S  
Sbjct: 232 VGTIRSDLSATQAASPGSWTREDSVWTNEFGYPGAVTLYQQRLVLAGSPKYPQTIWWSET 291

Query: 361 GAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLS- 419
           G +  F +  E        A++  ++    + I  +      + +       + S + + 
Sbjct: 292 GVYLSFEIGTE-----DDDAISFTLSSDQLNPIVHLAQMNTLIALTYGGEFTITSGNDAA 346

Query: 420 -KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQ--GFRFNEITQLA 476
               +I  +  S  G     PV VG  ++FV   GR++  ++   +    +  N++T LA
Sbjct: 347 ITPTNISVKNPSPYGCNGIRPVRVGTEIMFVQRAGRKLYAVAYDPDSFVSYSANDMTVLA 406

Query: 477 DHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHY 536
           +H+    +L + YQ++P + +W+V          +         +   AW   + +    
Sbjct: 407 EHITAGGVLDMAYQQQPDAFIWMVRADG------VAVTMAIDRAQDVIAWSRQVTAGAF- 459

Query: 537 VLSAASFPNDNRGGTSLWMLVALS-AGEERSFTVRLN 572
             S A+ P+D      ++ +V     G+   +    +
Sbjct: 460 -ESVATIPSDT--DDVVYAIVRREINGQTVRYVEVFD 493


>gi|303327644|ref|ZP_07358084.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3]
 gi|302862005|gb|EFL84939.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3]
          Length = 681

 Score =  376 bits (964), Expect = e-102,   Method: Composition-based stats.
 Identities = 119/588 (20%), Positives = 207/588 (35%), Gaps = 122/588 (20%)

Query: 10  SFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRV 69
           +F+ GE++P L  +R DL  +A  +    N +P  +G     P      +         +
Sbjct: 7   NFTGGEVTPTL-SARYDLGRYANSLKIMENFLPNLHGDAYRRPGTYFLENL---GEGCVL 62

Query: 70  FSFSIPD--GGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFG 127
             FS     G    L FG+K L+IV V               ++PY   D   + YA  G
Sbjct: 63  LPFSFNAEAGQNFALAFGEKSLRIVNVNGYVVAEA------MESPYALADVPEISYAQVG 116

Query: 128 STAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLP---------PPWLGDG-------MI 171
                 HKD+  H ++        +++   +               W G G        +
Sbjct: 117 DVVYLAHKDYALHKVVRTGSAPAYAWSIGTVALNTSLAAPAAPTAAWQGGGGSYTLRYKV 176

Query: 172 SGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIV- 230
           S V ++ K S+  A  STA                    G +P +W +  +  +    V 
Sbjct: 177 SAVDADGKESLPSAVGSTAS-------------------GKYPTDWTEGNHCVLSWQAVE 217

Query: 231 --ADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYV 288
             A+  +YR  + G  G   G ++G ++   N                  A  A  P   
Sbjct: 218 GAAEYNIYRE-SAGYYG-FIGIAQGTSFDDQNY----------------EADIADTPKED 259

Query: 289 WGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGS 348
           W    D +                                    P  VTFH  R++ +G+
Sbjct: 260 WDPFADGNN-----------------------------------PGTVTFHQQRMVLAGT 284

Query: 349 KGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCD 408
           +    S Y+S  G F +F         DP   +   +   +   I W   FG+ +L+G  
Sbjct: 285 RNSPQSFYMSRTGDFENFRKSRPLQDDDP---VEYQLASGTVDGIVWAASFGD-LLLGTA 340

Query: 409 TSLWLLSISLS--KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-Q 465
           ++ +  +         +      S  G     P+ +G+ ++     G R++ +  S E  
Sbjct: 341 SAEYKATGDNGAITAKNCTITAQSYWGSAKIAPIIIGNSVMHCQRHGSRVRDLYYSLEKD 400

Query: 466 GFRFNEITQLADHLFN-QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDF 524
           G+  N+++ LA HLF+   I Q  +Q+ P S++W+V +        LL   +  E +  +
Sbjct: 401 GYAGNDLSVLAPHLFDGHTIRQWAFQQTPGSVLWLVRDDG-----VLLALTYMKE-QDIW 454

Query: 525 AWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALS-AGEERSFTVRL 571
            W   +   +  V S A+   +N     L ++V  S  G  + +  RL
Sbjct: 455 GWSRQITDGR--VRSVAALSGENA--DELLLVVERSVDGARKYYLERL 498


>gi|85059168|ref|YP_454870.1| hypothetical protein SG1190 [Sodalis glossinidius str. 'morsitans']
 gi|84779688|dbj|BAE74465.1| hypothetical phage protein [Sodalis glossinidius str. 'morsitans']
          Length = 662

 Score =  370 bits (949), Expect = e-100,   Method: Composition-based stats.
 Identities = 100/577 (17%), Positives = 188/577 (32%), Gaps = 92/577 (15%)

Query: 3   NTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRL 62
                K +F+AGE+SPRL+  R D+  +A G    +N + +  G ++  P  +     + 
Sbjct: 2   RANLIKTNFTAGEVSPRLM-GRVDIMRYANGAKAIQNGVVVVQGGVMRRPGTRFAAAAKY 60

Query: 63  DPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLE 122
             R  R+  +        +L FGD  L++   +     +         +PY+     S+ 
Sbjct: 61  SDRPARLIPYVFNRSQAYVLEFGDGYLRVY-QKGKPVVNANNTPYEIASPYSADRLPSVN 119

Query: 123 YAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSI 182
           Y     T   VH    P+ L      D +          P P+                 
Sbjct: 120 YVQGADTMFLVHPAVKPYRLQRRGQTDWV--------LEPAPF----------------- 154

Query: 183 SQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTG 242
                          I +P D+ R       P +W +                       
Sbjct: 155 ---------------IVEPFDEIRET-----PKKWCR----------------------- 171

Query: 243 RSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSI 302
            S   F  S+    + D +         +         GA         +       +  
Sbjct: 172 PSAKEFVGSEVTLTLSDADPGENRNPPFTGAGWVAQDVGAYVRINGGLVLIQRIDSAQVA 231

Query: 303 SVAPQSQTLFQAGVSVVSWFM--SAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSF 360
               +S    +   S  SW    S W +  GYP  VT +  RL+ +GS     +++ S  
Sbjct: 232 VGTLRSDLNAKQAASPGSWTREESVWTDNLGYPGAVTLYQQRLVLAGSPKYPQTIWWSET 291

Query: 361 GAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLS- 419
           GA+  F L  +        A++  ++    + I  +      + +       + S + + 
Sbjct: 292 GAYLSFELGTK-----DDAAISFTLSSDQLNPIVHLAQMNTLIALTYGGEFTITSGNDAA 346

Query: 420 -KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQ--GFRFNEITQLA 476
               +I  +  S  G     P+ VG  ++F+   GR++  ++   +    +  N++T LA
Sbjct: 347 ITPTNISVKNPSPYGCNRIRPLRVGTEILFIQRAGRKLYAVAYDPDSFVSYAANDLTVLA 406

Query: 477 DHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHY 536
           +H+    +  + YQ++P  ++W+V E        +         +   AW   M      
Sbjct: 407 EHITAGGVRDMAYQQQPDGLIWLVREDGVAVTVTM------DRAQDVVAWSRQMTEGAF- 459

Query: 537 VLSAASFPNDNRGGTSLWMLVAL-SAGEERSFTVRLN 572
             S  S P++      L+ LV     G    +    +
Sbjct: 460 -ESVTSIPSER--DDVLYALVRRHINGHTVRYVEVFD 493


>gi|220918520|ref|YP_002493824.1| hypothetical protein A2cp1_3428 [Anaeromyxobacter dehalogenans
           2CP-1]
 gi|219956374|gb|ACL66758.1| hypothetical protein A2cp1_3428 [Anaeromyxobacter dehalogenans
           2CP-1]
          Length = 825

 Score =  357 bits (915), Expect = 3e-96,   Method: Composition-based stats.
 Identities = 131/635 (20%), Positives = 217/635 (34%), Gaps = 101/635 (15%)

Query: 8   KHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLD---- 63
           + SF+AGEL PRL   R DL+ +  G+ ++RN      G  ++ P     R+ +      
Sbjct: 8   QGSFAAGELGPRLH-GRHDLAKYQVGLRRARNFFLSPEGAALNRPGTPFVREAKDSAAGV 66

Query: 64  PRSNRVFSFSIPD--GGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYK--TPYTFKDNK 119
            R  R+  F   +  G    L FG   ++  V   +T   P    + Y+  TPY   D  
Sbjct: 67  DRGARLIPFIFSEDLGQAYELEFGQGYVRFHV-GGATIADPLNSAQPYELATPYLAADLP 125

Query: 120 SLEYAVFGSTAVFVHKDHPPHHLLY--IQDGDKISFTFDEIKFLPPPWLGDGMISGVKSN 177
            L+YA  G       K + P  L        + +  +FD        +LG   +  V ++
Sbjct: 126 RLKYAQQGDVVTLTCKGYDPRELRRLAHDSWELVPLSFDVPAPNGVVYLGVEALENV-AD 184

Query: 178 AKLSISQADTSTARITSDMKIFK---PLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDK 234
           A     Q       I  D    +      + R I +G     W     Y +GA +    +
Sbjct: 185 ATHPARQWAWQVTEIWEDESGLQWETSPLRVRKIAVGAGAT-WHTGFTYPLGACVSYAGQ 243

Query: 235 VYRSLTTGRSGDRF-----GYSKGATYVKDNNITWI-------------TVLNLSSKTSR 276
            ++S+     G        G    ATY     +  +              V+    +T +
Sbjct: 244 FWQSVIADNRGHVPEAVMVGDPPAATYPYWTPVGAVPDPFAVYESNAPTDVVLFPDRTIK 303

Query: 277 ESASGA-----------------------------VAPYYVWGDIKDVSKDGRSISVAPQ 307
             ASGA                             VA +   GD  D+S         PQ
Sbjct: 304 LWASGAWTGVDGSRLVGRRVYRGRGTVFGYVGEFEVAEFRDTGDTPDLSYS------PPQ 357

Query: 308 SQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFS 367
            +  F                    PS VTFH  R    G+       +LS  G +Y+F 
Sbjct: 358 GRNPFTVFGPAGEVVRLE------QPSVVTFHAERRSLLGTAQRPAHAFLSRTGDYYNFD 411

Query: 368 LDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLL---SISLSKGLSI 424
                   D   A    +       + W       +L+G  + +W +   S  +      
Sbjct: 412 RHTPALVDD---AFELELAGRLREEVRWAV-GAAALLIGTQSGVWAIRPPSGEVLGPGKA 467

Query: 425 DFRRVSGSGVYACPPV----SVGDCLVFVCGVGRRIKYISG-STEQGFRFNEITQLADHL 479
                S +G     P+    +VGD +++V   G  ++ +      QGF  ++++ LA HL
Sbjct: 468 TAVPQSSAGSSYLDPLVVPSAVGDAVLYVRTKGSGVRDLVYDDGRQGFVGSDLSLLAKHL 527

Query: 480 FNQ-RILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVL 538
           F    I    +QE+P S+ W+V      S  +LL   +  + +  +AW  H       V 
Sbjct: 528 FTGYSIKAWTFQEDPWSVAWLVR-----SDGKLLSLTYVRD-QEVWAWAWHDTQG--IVE 579

Query: 539 SAASFPNDNRGGTSLWMLVAL--SAGEERSFTVRL 571
              + P       +++++V      G    +  R+
Sbjct: 580 DVCAIP--EGTEDAVYLIVKRQIGDGTWHRYVERM 612


>gi|220903983|ref|YP_002479295.1| hypothetical protein Ddes_0709 [Desulfovibrio desulfuricans subsp.
           desulfuricans str. ATCC 27774]
 gi|219868282|gb|ACL48617.1| conserved hypothetical protein [Desulfovibrio desulfuricans subsp.
           desulfuricans str. ATCC 27774]
          Length = 689

 Score =  356 bits (913), Expect = 6e-96,   Method: Composition-based stats.
 Identities = 122/584 (20%), Positives = 206/584 (35%), Gaps = 103/584 (17%)

Query: 9   HSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNR 68
           ++F+ GE++P L  +R DL+ +   ++   N++P  +G     P  +   +       + 
Sbjct: 8   NNFTGGEIAPTL-SARYDLARYRNCLSCMENMLPGLHGDTARRPGTRFVANL---DGHSV 63

Query: 69  VFSFSIP--DGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVF 126
           +  FS         +LVFG   L I   +              +TPY   + + + YA  
Sbjct: 64  LIPFSFNALTSQNFVLVFGSHCLHIAGEQG------LENIPVIETPYAPGELQDISYAQV 117

Query: 127 GSTAVFVHKDHPPHHLLYIQDGDKIS--------FTFDEIKFLP---PPWLGDGMISGVK 175
           G T    H +HP H ++     +  +        ++ +++        P L     SG  
Sbjct: 118 GDTVYLAHSNHPLHKVVRRDAPENRTQFEEAAYAWSLEKVALNASLAAPELPSVTFSGSA 177

Query: 176 SNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIV---AD 232
            +  L    A    A   S          GR      HP +W +  + +I    V    +
Sbjct: 178 GSYTLRYKVAAVDAAGRESLPSPAGQCANGR------HPSDWVQGNSAAISWAAVEGAVE 231

Query: 233 DKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDI 292
             +YR    G  G   G S G  +   N                  A  A  P   W   
Sbjct: 232 YNIYRE-EAGYFG-FIGVSGGLNFNDQNY----------------QADTADTPKEDWDPF 273

Query: 293 KDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDE 352
            D +                                   YP  V FH  R++ + +  + 
Sbjct: 274 ADGN-----------------------------------YPGIVAFHQQRMVLAATPKNP 298

Query: 353 LSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLW 412
            + Y+S  G F +F         DP + L   +   S   + W   FG+ +L+G   S +
Sbjct: 299 QAFYMSRVGDFENFRKSRPLQDDDPVEYL---IASGSIDAVTWAASFGD-LLIGTSGSEY 354

Query: 413 LLSISLS---KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFR 468
             S          +I     S  G     P+ +G+ ++ V   G R++ +  S E  G+ 
Sbjct: 355 KASGGDGASITAGNISITAQSYWGSAGLAPIIIGNSILHVQRHGSRVRDLFYSLEKDGYA 414

Query: 469 FNEITQLADHLFN-QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWH 527
            N+++ +A HLF    ILQ  YQ+ P S +W V +        LL   +  E +  + W 
Sbjct: 415 GNDLSIMAPHLFEGHTILQWAYQQTPGSTIWCVRDDG-----LLLAFTYMKEHD-IWGWS 468

Query: 528 THMISDKHYVLSAASFPNDNRGGTSLWMLVALSAGEERSFTVRL 571
             +   +  VLSAA+     +G T + +      G+ R F  RL
Sbjct: 469 RQITQGR--VLSAAAISG-EKGDTLMLVTERRIDGQPRIFLERL 509


>gi|167032763|ref|YP_001667994.1| hypothetical protein PputGB1_1755 [Pseudomonas putida GB-1]
 gi|166859251|gb|ABY97658.1| conserved hypothetical protein [Pseudomonas putida GB-1]
          Length = 774

 Score =  355 bits (910), Expect = 1e-95,   Method: Composition-based stats.
 Identities = 97/574 (16%), Positives = 194/574 (33%), Gaps = 79/574 (13%)

Query: 4   TTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLD 63
           T   + SFSAGE++P    +R DL+ +   +   RN + L  G   +    +   + +  
Sbjct: 2   TEVIQPSFSAGEVAPATY-ARVDLARYYTALKTCRNFVVLPEGGAQNRSGTRFITEVKDS 60

Query: 64  PRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEY 123
               R+  F        +L FG+  ++ + +         +      +PYT      L++
Sbjct: 61  AARTRLIPFQFSTEQTYILEFGNLYIRFISMGGQVVS--GVTPYEIASPYTTAQLPDLKF 118

Query: 124 AVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSIS 183
                    VH DHPP  L  +      ++T   I F P      G+++  ++      +
Sbjct: 119 TQSADVMTIVHPDHPPRELSRLAP---TNWTLTAITFEPGIAAPTGLVATARTGGSGDTT 175

Query: 184 QADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGR 243
           +      ++T+   I                  WA NT                      
Sbjct: 176 EYQ---YKVTAVSSI-----------SEGSVESWASNTATV------------------- 202

Query: 244 SGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSIS 303
             + F    GAT                +K+S        +    + DI        ++ 
Sbjct: 203 --NSFDDKPGATLAWTAVAGADHYNVYKNKSSGVFGFIGQSAGVTFNDINITPATDNTV- 259

Query: 304 VAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAF 363
             P     F  G +               PS V ++  R+ F+ S+ +  +V++S  G F
Sbjct: 260 --PIGYNPFADGNN---------------PSVVGYYQQRMAFAASRANPQTVWMSRTGDF 302

Query: 364 YDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLS---K 420
           ++F         D    +   +     + I  +    E + +    +   ++ S      
Sbjct: 303 HNFGYSDPNKDDDG---IEFVIASRQVNQIRHLVSLRELLAM-TSGAEIAITGSSDSGIT 358

Query: 421 GLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGST-EQGFRFNEITQLADHL 479
             ++     S  G     P    +  +++   G ++  ++ +    GF+  +++ L+ HL
Sbjct: 359 PANVSAVEQSYFGSSDVIPAIYANTALYIQARGGKLSTLAYNYVSDGFQPQDVSVLSSHL 418

Query: 480 FNQ-RILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVL 538
                I    +   P+ ++W+V          LLG  F  + +  + W  H       V 
Sbjct: 419 LRGFTIQDQAFALAPNGVLWLVRNDG-----MLLGFTFLPD-QQVYGWSWHDTDGA--VE 470

Query: 539 SAASFPNDNRGGTSLWMLVALS-AGEERSFTVRL 571
           + AS P D+    +L+M+V  +  G  + +  R+
Sbjct: 471 AVASVPEDD--EDALYMIVRRTINGVTKRYIERM 502


>gi|212703338|ref|ZP_03311466.1| hypothetical protein DESPIG_01381 [Desulfovibrio piger ATCC 29098]
 gi|212673248|gb|EEB33731.1| hypothetical protein DESPIG_01381 [Desulfovibrio piger ATCC 29098]
          Length = 703

 Score =  337 bits (865), Expect = 2e-90,   Method: Composition-based stats.
 Identities = 126/591 (21%), Positives = 201/591 (34%), Gaps = 103/591 (17%)

Query: 9   HSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNR 68
           H+F+ GE+SP +L +R DLS +   V    N++P  +G +   P                
Sbjct: 6   HNFTGGEVSP-ILAARYDLSRYGSSVQCMENMLPGLHGDVRRRPGTLFLGSLE---GEAV 61

Query: 69  VFSFSIPD--GGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVF 126
           +  FS         +LV     L I  +    + + AL      TPY  +    +  A  
Sbjct: 62  LLPFSFNALAEQNFVLVLSGHSLCIADIHGFDRQTGALPRLP--TPYEARHLLEICAAQV 119

Query: 127 GSTAVFVHKDHPPHHLLYIQDGD-------------KISFTFDEIKFL---PPPWL-GDG 169
           G T    H  +P H L+     D                +T + +      P P      
Sbjct: 120 GDTVYLAHTAYPLHKLVRSTYSDPEAPLPDNAIRSHGYRWTLEAVALNSSLPAPQAPDCT 179

Query: 170 MISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYI 229
            + G   +              + ++ K     + G     G HP +W       I    
Sbjct: 180 FVRGNNDDDAGLGYTLRYKIVAVDANGKQSLASEAGSC--DGKHPSDWVVGNRTDISWTA 237

Query: 230 V---ADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPY 286
           V    +  +YR    G  G   G S G T+  +N                  A  A  P 
Sbjct: 238 VEGATEYNIYRE-EAGYYG-FIGVSSGTTFSDNNY----------------QADTADTPR 279

Query: 287 YVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFS 346
             W    D +                                    PS V FH  R++ +
Sbjct: 280 EDWDPFADGNN-----------------------------------PSVVAFHQQRMVLA 304

Query: 347 GSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVG 406
           G++    + YLS  G F +F         DP + L   +   S   I W   FG+ +L+G
Sbjct: 305 GTRDSPQAFYLSRSGDFENFRKSRPLQDDDPVEYL---IASGSIDAIAWAASFGD-LLLG 360

Query: 407 CDTSLWLLSISLS--KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE 464
              S +  S + S     +I     S  G     P+ +G+ ++ V   G  ++ +  S E
Sbjct: 361 TSGSEYKASGNGSAITPGNITITAQSYWGSAGLAPIIIGNAILHVQRHGAHVRDLFYSLE 420

Query: 465 -QGFRFNEITQLADHLFN-QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEG 522
             G+  N+++ LA HLF   R+ Q  YQ+ P S++W+V +        LL   +  E + 
Sbjct: 421 KDGYAGNDLSILAPHLFEGHRLRQWAYQQTPGSVLWIVRDDG-----LLLALTYLKEHD- 474

Query: 523 DFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVAL--SAGEERSFTVRL 571
            + W  H  +    VLS  S    +     L ++V    + G  R    RL
Sbjct: 475 IWGWSRHPTAG--EVLSVCSISGPD--SDELLLVVRRRDADGGSRYCLERL 521


>gi|41179374|ref|NP_958682.1| Bbp13 [Bordetella phage BPP-1]
 gi|45569506|ref|NP_996575.1| hypothetical protein BMP-1p12 [Bordetella phage BMP-1]
 gi|45580757|ref|NP_996623.1| hypothetical protein BIP-1p12 [Bordetella phage BIP-1]
 gi|40950113|gb|AAR97679.1| Bbp13 [Bordetella phage BPP-1]
          Length = 681

 Score =  311 bits (796), Expect = 2e-82,   Method: Composition-based stats.
 Identities = 100/577 (17%), Positives = 185/577 (32%), Gaps = 81/577 (14%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           M N    + SF  GE+SP  +  R D   +  G+A  RN +    GP  +       R+ 
Sbjct: 1   MSNVRVLQRSFGGGEISPE-MFGRIDDVKYQSGLAICRNFVVKPQGPAENRAGFAFVREV 59

Query: 61  RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120
           +   +  R+  F+       ++  G    +      +              PY   D  +
Sbjct: 60  KDSAKKVRLIPFTYSVTQTMVIELGAGYFRFHTNGGTLL--DGAVPYEIANPYAEADLFN 117

Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180
           + Y         VH ++ P  L  +      ++    I F  P          V +   +
Sbjct: 118 IHYVQSADVLTLVHPNYAPRELRRLG---ATNWQLATIAFTSP----------VATPTSV 164

Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLT 240
           + +  +  T                                           D  YR + 
Sbjct: 165 TATSNNKGT-------------------------------------------DYTYRYVV 181

Query: 241 TGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKD-G 299
           T    +    S           T     N  + T   SAS   + Y V+ +   +    G
Sbjct: 182 TALDAEGKTES---APSSAGTCTNNLFTNGGANTIAWSASSGASRYNVYKEQGGLYGYIG 238

Query: 300 RSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSS 359
           ++   +     +          + + +     YP+ V++   R  F+G+     +++++ 
Sbjct: 239 QTTGTSLVDDNIAPDLSVTPPIYDAVFNAAGDYPAAVSYFEQRRCFAGTTNKPQNIWMTR 298

Query: 360 FGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLS 419
            G     S        D        V    A+ I  + P  E +L+       + S++  
Sbjct: 299 SGTESAMSYSLPVRDDDRVA---FRVAAREANAIRHIVPLTELLLLTSSGEWRVASVNSD 355

Query: 420 --KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLA 476
                +I  R  S  G     PV V +  ++    G  ++ ++ + +  GF   +++  A
Sbjct: 356 AVTPTTISVRPQSYVGATDVQPVVVNNTTIYGAARGGHVRELAYNWQANGFVTGDLSLRA 415

Query: 477 DHLFNQ-RILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKH 535
            HLF+   IL + Y + P  IVW +     +S  +LLG  +  E +   AWH H      
Sbjct: 416 AHLFDNLDILDMAYAKAPQPIVWFI-----SSSGKLLGLTYVPE-QQIGAWHQHDTDG-- 467

Query: 536 YVLSAASFPNDNRGGTSLWMLVALS-AGEERSFTVRL 571
              S A           L+ +V  +  G E  +  R+
Sbjct: 468 VFESCAVV--AEGNEDRLYAVVRRTIGGNEVRYVERM 502


>gi|303257570|ref|ZP_07343582.1| conserved hypothetical protein [Burkholderiales bacterium 1_1_47]
 gi|302859540|gb|EFL82619.1| conserved hypothetical protein [Burkholderiales bacterium 1_1_47]
          Length = 687

 Score =  308 bits (788), Expect = 2e-81,   Method: Composition-based stats.
 Identities = 99/590 (16%), Positives = 179/590 (30%), Gaps = 89/590 (15%)

Query: 4   TTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLD 63
           T   + SF+ GE+SP  +  R D + +  G+    N +    GP+ + P  +  R+ +  
Sbjct: 5   TKVLQRSFAGGEISPE-MFGRTDDTKYQTGLETCLNFLCRPQGPIENRPGFEFVREVKDS 63

Query: 64  PRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEY 123
            +  R+  F        ++  G K  +                    TP+   D   LEY
Sbjct: 64  SKKVRLIPFIFNAQQTFVIELGHKYARFH--SFGATLMNGNQPYEITTPWDEDDLFELEY 121

Query: 124 AVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKF-----LPPPWLGDGMISGVKSNA 178
                     H+D+ P  +    + D   +    I F      P         +    + 
Sbjct: 122 VQSNDIITVTHEDYAPTEIRRYSNTD---WRLATISFSSTLATPTNVTAVRETTTGNEDK 178

Query: 179 KLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRS 238
                      + + +D  I        S  + C    +A  T   I    V+    YR 
Sbjct: 179 NADKYTFQYKVSCLNADKTIESEP----SAAVSCTANLYATGTTIKISCSAVSGASYYRF 234

Query: 239 LTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKD 298
                            Y     I                  G +        I D    
Sbjct: 235 -----------------YKNQGGI-----------------YGYLGDSETTSIIDDNIAP 260

Query: 299 GRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLS 358
              I+       +                    YPS V +   R  F+G K D   V  +
Sbjct: 261 KTDITPRRYDSVVSSGN----------------YPSAVGYFEQRRWFAGFKTDPQRVVAT 304

Query: 359 SFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL 418
             G   D +        D    +   +     + I  + P    +L+   + + +   + 
Sbjct: 305 RSGTESDMTYSLPSKDDDR---INFRIAATEFNKILHISPLSHLILLTTGSEIRISPQNS 361

Query: 419 S--KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQ-GFRFNEITQL 475
                 SI  R  S +G     P+   + L+F       ++ ++   +  GF   ++   
Sbjct: 362 DAITPSSISARPQSYNGATTVRPLVYNNNLIFASARDGHVRELAYQYQAGGFVSGDLCLR 421

Query: 476 ADHLFN-QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDK 534
           + HLF+ + I     Q+ P+ I+W V     +S   LLG  +  E +   +WH H     
Sbjct: 422 SQHLFDFKTIKDATAQKAPYPIMWFV-----SSDGNLLGLTYIPE-QQVGSWHRHNTDG- 474

Query: 535 HYVLSAASFPNDNRGGTSLWMLVALS-AGEERSFTVRL------NLLDDF 577
               S  +         +L+ ++  +  G ++ +  R+      NL D F
Sbjct: 475 -VFESCCAV--SEGVEDALYCVIRRTINGSQKRYVERMRTRNFKNLADAF 521


>gi|187476936|ref|YP_784960.1| phage protein [Bordetella avium 197N]
 gi|115421522|emb|CAJ48031.1| phage protein [Bordetella avium 197N]
          Length = 681

 Score =  307 bits (785), Expect = 5e-81,   Method: Composition-based stats.
 Identities = 97/577 (16%), Positives = 182/577 (31%), Gaps = 81/577 (14%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           M N    + SF  GE+SP  +  R D   +  G+A  RN +    GP+ +       R+ 
Sbjct: 1   MSNVRVLQRSFGGGEISPE-MFGRIDDVKYQSGLAICRNFVVKPQGPVENRAGFSFVREV 59

Query: 61  RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120
           +   +  R+  F+       ++  G    +      +              PYT  D  S
Sbjct: 60  KDSTKKVRLIPFTYSVTQTMVIELGAGYFRFHTDGGTLL--NGDTPYEIANPYTEADLFS 117

Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAK- 179
           + Y         VH ++ P  L  I   D   +    I F+    +  G+ +   +    
Sbjct: 118 IHYVQSADVLTLVHPNYAPRELRRIGATD---WQLATIAFMSSVAMPTGVTATSNNKGTD 174

Query: 180 LSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSL 239
            +     T+                                                   
Sbjct: 175 YTYRYVVTALDAEGKTESAPS--------------------------------------- 195

Query: 240 TTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDG 299
           + G   +    + GA  +  +  +       +    +    G +        + D     
Sbjct: 196 SAGICANNLFTNGGANTIAWSAAS--GASRYNVYKEQGGLYGYIGQTTGTSLVDDNIAPD 253

Query: 300 RSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSS 359
            S++  P    +F A                 YP+ V++   R  F+G+     +++++ 
Sbjct: 254 LSVT-PPIYDAVFNAAGD--------------YPAAVSYFEQRRCFAGTINKPQNIWMTR 298

Query: 360 FGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLS 419
            G     S        D        V    A+ I  + P  E +L+       + S++  
Sbjct: 299 SGTESAMSYSLPVRSDDRVA---FRVAAREANAIRHIVPLTELLLLTSSGEWRVASVNSD 355

Query: 420 --KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLA 476
                +I  R  S  G     PV V +  ++    G  ++ ++ + +  GF   +++   
Sbjct: 356 AVTPTTISVRPQSYVGATDVQPVVVNNTAIYGAARGGHVRELAYNWQANGFVTGDLSLRC 415

Query: 477 DHLFNQ-RILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKH 535
            HLF+   IL + Y + P  IVW +     +S  +LLG  +  E +   AWH H      
Sbjct: 416 AHLFDNLNILDMAYAKAPQPIVWFI-----SSSGKLLGLTYVPE-QQIGAWHQHDTEG-- 467

Query: 536 YVLSAASFPNDNRGGTSLWMLVAL-SAGEERSFTVRL 571
              S A           L+++V     G+E  +  R+
Sbjct: 468 VFESCAVV--AEGNEDRLYVVVRRIIGGKEVRYIERM 502


>gi|119386474|ref|YP_917529.1| hypothetical protein Pden_3767 [Paracoccus denitrificans PD1222]
 gi|119377069|gb|ABL71833.1| conserved hypothetical protein [Paracoccus denitrificans PD1222]
          Length = 679

 Score =  298 bits (763), Expect = 2e-78,   Method: Composition-based stats.
 Identities = 96/573 (16%), Positives = 187/573 (32%), Gaps = 79/573 (13%)

Query: 4   TTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLD 63
               + +F++G L P L   R DL+ +   + K RN+    +G + + P ++   +    
Sbjct: 3   AARIQPTFASGVLGPAL-WGRIDLARYDSALRKGRNVFVHAHGGVSNRPGLRFVCEVMDS 61

Query: 64  PRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEY 123
              +R+  F       ++L+ G  ++  V   +  +     +  T  TP+T    ++L+ 
Sbjct: 62  AHRHRLLPFVREADDASILIMGQNEMGFVKNGARLQSGGVDY--TIATPWTATQAQALDA 119

Query: 124 AVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSIS 183
                     H+   P  ++   + D    T      +  P +                 
Sbjct: 120 VQSVDVIFAAHRQVAPRRIMRNGETDWSIATVPINPTVAAPTISSVTPRNSGDETYRYRV 179

Query: 184 QADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGR 243
            A        +   +     +  SI    +   ++  T  +       + +VYR      
Sbjct: 180 TAVVGGVESFASAPLATTAAELLSIEGAWNDIAFSAVTGAT-------EYRVYRMRNGVP 232

Query: 244 SGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSIS 303
                G++ G ++  DN                                          +
Sbjct: 233 GY--IGFTTGTSFRDDN-------------------------------------ISPDST 253

Query: 304 VAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAF 363
           V P  Q              S +     YPS V+ +  RL F  S     +V+LS  G +
Sbjct: 254 VTPPVQA-------------SLFDAAGKYPSVVSIYQQRLAFGASDAQPETVWLSRVGDY 300

Query: 364 YDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLS-KGL 422
            +F+        D  +     +     + I  M    E ++        +         L
Sbjct: 301 LNFTRSQNMTSSDRAEFD---MAGEQLNRIRAMLQLRELLVFTSAGEFSVSGPDGGFDAL 357

Query: 423 SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFN 481
           +    +    G     P+   D ++FV   GR ++ +  + E  G+  N++   A H   
Sbjct: 358 NPIVTQHGYIGSATVKPLVADDTVLFVDRSGRGVRDLRYAYESDGYSGNDLAIFASHFLQ 417

Query: 482 -QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSA 540
            +RI+     + P SI+WVVL+       +LL   +  E +  +AW    I     V S 
Sbjct: 418 GRRIVGWAMAKNPWSIIWVVLD-----NGKLLALTYKREHQ-VWAWTEMDIDGA--VESV 469

Query: 541 ASFPNDNRGGTSLWMLVAL-SAGEERSFTVRLN 572
           A  P       + +++V     G++R +  R +
Sbjct: 470 ACIP--EGASDATYLIVRRLIDGQQRRYVERFD 500


>gi|118590938|ref|ZP_01548338.1| hypothetical protein SIAM614_19796 [Stappia aggregata IAM 12614]
 gi|118436460|gb|EAV43101.1| hypothetical protein SIAM614_19796 [Stappia aggregata IAM 12614]
          Length = 810

 Score =  298 bits (762), Expect = 2e-78,   Method: Composition-based stats.
 Identities = 100/645 (15%), Positives = 202/645 (31%), Gaps = 101/645 (15%)

Query: 7   TKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRS 66
            + +FS GEL P L+  R DL L    +A+ RN + L+ G L      +   + +   R 
Sbjct: 5   LQATFSRGELDPELIY-RSDLELFRSSLAECRNFLTLKRGGLRRRGGTKFIAELKDSSRQ 63

Query: 67  NRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVF 126
             +  F   +G Y +L FG    ++                   TPY+      L++   
Sbjct: 64  GWLIPFEFGNGQYYMLEFGHHIFRVFTSEGRVG------TVEVATPYSSGVLPRLKFVQS 117

Query: 127 GSTAVFVHKDHPPHHLLYIQ------------DGDKISFTFDEIKFLPPPWLGDGMISGV 174
             T         P  L  +             DG  +          P            
Sbjct: 118 TDTLFIAGGGVAPQALKRLSELSWAIEPMSFRDGPYLDVNISPTNLKPAATGNAVPKMTS 177

Query: 175 KSNAKLSISQADT------------STARITSDMKIFKPLDKGRSIRLGCH--------- 213
            +    ++S ++                 ++S    +       S+ +  +         
Sbjct: 178 NTAPSGTVSASNGSASAWQLFNRSEGKTVLSSGATGWVQYQFPGSVVIDAYMLQAPNDNS 237

Query: 214 -----PPEWAKNTNYSIGAYIVAD---------DKVYRSLT----TGRSGDRFGYSKGAT 255
                P +W    + +   + + D            +R       T  +  R  +++G  
Sbjct: 238 QNDDMPWQWNIEASNNGSDWTILDTQDGQDTWSSNEWREYDFHNETAFTHYRLSFTQGGG 297

Query: 256 YVKDNNITWITVLNLSSKTSRESASGAVAPYYV-----------------------WGDI 292
              DN+     V + +          A     +                       W   
Sbjct: 298 SASDNSAIGQLVFHRAGNDQSPFTLTASGTGGINGGAGFQPSDVGRHIRFRGSDGFWRWF 357

Query: 293 KDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDE 352
           +  S+   +           Q   +   W + AW    G+P  + +H NRL F+G+  + 
Sbjct: 358 RIHSRQSATSVKVQLFGQALQDTKAQSIWRLGAWSGTTGWPETIGWHKNRLAFAGTSEEP 417

Query: 353 LSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLW 412
             ++ S    F +FS+       D   A+T  +     + I W+    + ++VG   ++ 
Sbjct: 418 QKIWESQTEDFTNFSVSHVLKASD---AVTAGILSGQVNRIQWLVDDND-LIVGTTRAVR 473

Query: 413 LLSISLS----KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGST-EQGF 467
            +  +         ++D +  +  G     P+ VG  L++    G  ++ ++      G 
Sbjct: 474 AVGKATDQDPYGPENVDQKPETNFGANDVSPIKVGSVLIYYGPYGTDMREMAYDFGSDGR 533

Query: 468 RFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWH 527
               ++++  HLF   I    YQ+ P S++W     + +     +G  +    +  +   
Sbjct: 534 VSQAVSEVQSHLFQSGIAGACYQQYPDSVIW-----QWDQKGSGIGFTYER-QQQVYGMQ 587

Query: 528 THMISDKHYVLSAASFPNDNRGGTSLWMLVALS-AGEERSFTVRL 571
            H       V   A       G  ++WM+V  +  G+ R +   +
Sbjct: 588 RHDFGG--VVECMADLSGA--GADTVWMIVKRTIDGQTRRYIEIM 628


>gi|167041089|gb|ABZ05850.1| hypothetical protein ALOHA_HF400048F7ctg1g17 [uncultured marine
           microorganism HF4000_48F7]
          Length = 999

 Score =  289 bits (739), Expect = 8e-76,   Method: Composition-based stats.
 Identities = 111/707 (15%), Positives = 216/707 (30%), Gaps = 162/707 (22%)

Query: 3   NTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRL 62
                + SF+ G++SPR +Q   +L  +   +A   N++ L  G L   P        + 
Sbjct: 2   RIQALQSSFADGQISPR-MQGMVELESYKSSLATLENMVVLPQGSLTRRPGTFFAATTK- 59

Query: 63  DPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKT----------P 112
                R+  FS   G   +L FG+  ++        +        +  T           
Sbjct: 60  ANGQARLIPFSRGQGTSLVLEFGNLYIRFFANDGPVRTDDIAATYSQTTTTVTVTKSTHG 119

Query: 113 YTFKDNKSLEYA------------------------------------------------ 124
           Y+  D   L++                                                 
Sbjct: 120 YSASDEVYLDFTSGNGVDGFYTIATVADANTFTVTSTTSQTTSGNVNLSQRFEVTTTYTA 179

Query: 125 -VFGSTAVFVHKD-----HPPHHLLYIQDGDKISFTFDEIK--------FLPPPWLGDGM 170
                 A     D     HP H    ++     S+    +           P   L DG 
Sbjct: 180 SQVNDIAFTQSADVLFLVHPDHVPARLERNATNSWALTNLLPSLISGTYTRPTTVLTDGP 239

Query: 171 ISGVK-SNAKLSISQADTSTARITSDMKIFKPLDKGRS-----------IRLGCHP---- 214
              +  ++  L+++ A  S    +         + G               L  HP    
Sbjct: 240 FKAMNTTDTTLTVALAANSDFTTSFSNGSLSLEEVGTVSPSNVDVATNAFTLANHPLVNG 299

Query: 215 ---------------PEWAKNTNYSIGA-------YIVADDKVYRSLTTGRSGDRFGYSK 252
                          P  +  T+Y + +          +       +T   +      +K
Sbjct: 300 QTVQFSSIPSGFASTPTLSATTDYFVVSATQNTFKLATSAGGTPVDITAAPTSADLTVNK 359

Query: 253 G---------ATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSIS 303
                      T      I   T    +        +  +AP    G  + V +   ++ 
Sbjct: 360 SFVDKDVYIKVTASATTGINDDTGFQTTDVGRYIRLNTEIAPQIKHGYGEIVERTSTTV- 418

Query: 304 VAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAF 363
           V  Q +T      +   W + ++    GYP  V  +  RL+F+G+  +  +++ S    F
Sbjct: 419 VLVQLKTAIAGVGATTEWQLGSFSGTTGYPRTVQLYQQRLVFAGTAEESQTIFFSKTADF 478

Query: 364 YDFSLDGEYGCYD----------------PTKALTTAVTDFSASTIHWMHPFGEGVLVGC 407
           ++FS     G                      A++  ++  +   I W+    + + +G 
Sbjct: 479 FNFSATEPLGQQTGQRDSSGRSIVGEQIFEDAAISLTISSDTVDQIEWISE-DQRLTIGT 537

Query: 408 DTSLWLLSISLSKGLSIDFR-RVSGSGVYACPPVS----VGDCLVFVCGVGRRIKYIS-G 461
              ++ L  S        F   ++    +AC P +    VG+ L++V   GR+++ ++  
Sbjct: 538 SGGIYQLYGSTDDLTLTPFNFSITKVSAWACDPTALPAKVGNNLLYVQNNGRKLRELAFD 597

Query: 462 STEQGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGE 521
             +  +   ++T  ++ +    ++   YQ++P+S++W +         RL G  +  +  
Sbjct: 598 KVQDQYSAADLTLRSEDISESGLIATAYQDQPYSVLWCLRNDG-----RLAGLTYV-DLL 651

Query: 522 GDFAWHTHMISDKHY---------VLSAASFPNDNRGGTSLWMLVAL 559
              AWH H I   HY         V S AS P        L+M+V  
Sbjct: 652 QMRAWHRHTIGGAHYDDTHGSQAKVESIASIP--RGTHDQLYMIVKR 696


>gi|195541813|gb|ACF98016.1| hypothetical protein [uncultured bacterium 878]
          Length = 926

 Score =  288 bits (736), Expect = 2e-75,   Method: Composition-based stats.
 Identities = 90/518 (17%), Positives = 171/518 (33%), Gaps = 36/518 (6%)

Query: 72  FSIPDGGYALLVFGDKK-LQIVVVRSSTKWSPALFGKTYKTP--YTFKDNKSLEYAVFGS 128
           F + +   +     D+    I     S   +     + Y  P  Y   D   +++A    
Sbjct: 144 FRVANRTASTFELNDQHGAPINGNGYSAFAAGGTAARVYTLPTTYQDADLAQMKFAQSAD 203

Query: 129 TAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTS 188
                H ++ P  L         ++   +I F   P+L       V +    S +     
Sbjct: 204 ILYIAHTEYVPRKLQRYGP---TNWVLSQIDFQDGPYLPVNGAQTVLTP---SAASGAGI 257

Query: 189 TARITSDMKIFKPLDKGRS-IRLGCHPPEWAKNTNYSIGAYI-VADDKVYRSLTTGRSGD 246
           T    + + I    + G   +R+      W       I   +   +     + T  R   
Sbjct: 258 TISSATSVAITGAANNGAGAVRITSANHGWKTGDKIDITGIVGTTEANA--TWTVTRVNA 315

Query: 247 RFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAP 306
                 G+T+            ++   T        +     WG  K ++    ++SV  
Sbjct: 316 NTYDLNGSTFANAYASGGTAKPHIFESTDL-GRLIRIQHASTWGYAK-ITAYTSAVSVTA 373

Query: 307 QSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDF 366
              + F    +  +W +  + +  GYPS VTF+  RL + G       V  S    +  F
Sbjct: 374 DVLSNFGGTAASSAWRLGLYSQGGGYPSCVTFYEGRLFWGGCPLAPTRVDGSMSSNYETF 433

Query: 367 SLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL----SKGL 422
           S            A+   +     + + WM    +G+LVG     W++  +         
Sbjct: 434 SPSSTASVVADDNAVAYPLDSGDVNNVLWMKDDEKGLLVGTKGGEWVVRANTLNGALTPT 493

Query: 423 SIDFRRVSGSGVY-ACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLF 480
           ++   R +  G Y    PV  G  ++FV    R+++ ++ + E  GF   ++T L+ H+ 
Sbjct: 494 NVKATRATTYGSYEGSQPVRTGKDIIFVQRKRRKVRNLNYTYEIDGFNAGDLTILSGHIG 553

Query: 481 NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKH----- 535
                QL +Q EP   VW+          +L    +  + +    W   ++         
Sbjct: 554 RLEFGQLAFQSEPEGWVWMTR-----GDGQLPVLTYDRDEQKI-GWSRQIMGGYQDAARR 607

Query: 536 ---YVLSAASFPNDNRGGTSLWMLVAL-SAGEERSFTV 569
               V S  S P+ N     +W++V     G+   +  
Sbjct: 608 RPPIVRSVCSIPDPNDARDEVWLIVQRMIDGKTERYVE 645



 Score = 95.7 bits (236), Expect = 2e-17,   Method: Composition-based stats.
 Identities = 20/96 (20%), Positives = 35/96 (36%), Gaps = 1/96 (1%)

Query: 1  MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
          MV  +   ++F AGE +P + + R DLS +        N +P   GP    P        
Sbjct: 1  MVRASPNFNAFDAGEFAP-ITEGRTDLSRYGFACRILENFMPRVVGPAARRPGTSFIAST 59

Query: 61 RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRS 96
          R   +   +  F        ++ FG+  ++      
Sbjct: 60 RYPEKDALLVRFEYSTEQAYVMEFGNLYVRFYRNDG 95


>gi|323699364|ref|ZP_08111276.1| hypothetical protein DND132_1955 [Desulfovibrio sp. ND132]
 gi|323459296|gb|EGB15161.1| hypothetical protein DND132_1955 [Desulfovibrio desulfuricans
           ND132]
          Length = 698

 Score =  284 bits (726), Expect = 3e-74,   Method: Composition-based stats.
 Identities = 109/583 (18%), Positives = 189/583 (32%), Gaps = 76/583 (13%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           M   T    +F+AGE+SPRL + R DLS +  G     N     +G        +   + 
Sbjct: 1   MSIATPAITNFTAGEISPRL-EGRTDLSKYFNGCRTLLNFHVHPHGGTSRRAGFRFVAES 59

Query: 61  RLDPRSNRVFSFSIPDGGYALLVF-----GDKKLQIVVVRSSTKWSPALFGKTYKTPYTF 115
               +   +  F    G   +L F     G  ++++           A + +    PYT 
Sbjct: 60  LGQAKPVLLIPFEYSAGQTYVLEFAEDAAGQGRMRVFSGHGLVLSDGAPYVRDI--PYTA 117

Query: 116 KDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWL-GDGMISGV 174
            +   L+YA    + + VH DHP   ++ +   D   +T +E+ FL  P   G+      
Sbjct: 118 DEFDELDYAQSAGSLILVHPDHPVREMVRVDHDD---WTLEEMTFLGQPEAWGENDYPSA 174

Query: 175 KSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDK 234
               +  +  A T +   T  +         R          W            +AD  
Sbjct: 175 VCFYEQRLVLAATRSRPATLWLSRTGEFSDFRLRTREVPLDGW--------RDLEIADAN 226

Query: 235 VYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKD 294
               L  G++GD      G  +                 ++R         Y        
Sbjct: 227 G-DGLRDGKAGDNVLLLAGNGFE----ARDALKGQHPDGSTRYYRYKGTGNYATVNSNVT 281

Query: 295 VSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELS 354
           ++      +   ++       +   +W                       F         
Sbjct: 282 LTFAAEPGANQLEAIWDEDGVLDDAAWD---------------------CFG-------- 312

Query: 355 VYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLL 414
                 G   D     E    D   A+   ++   A+ I ++ P    + +G     W L
Sbjct: 313 -----VGDRTDGPAGAEPLEDD---AIEVTLSGRQANAIEFIVPR-RALWIGTAGGEWTL 363

Query: 415 SISLSKGL---SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFN 470
           S S S  L   ++   +    G     P +VG   ++V   GR+I+ +S   E   +   
Sbjct: 364 SASSSDPLTPSNVKAAQEGTGGASGVRPEAVGFAALYVQRAGRKIREMSYRYESDAYVSK 423

Query: 471 EITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHM 530
           ++T L++H+    + QL Y +EP SI++ V          L+   +  + +   AW   +
Sbjct: 424 DLTLLSEHITEGGLTQLAYVQEPDSILYGVR-----GDGILVALTYVPD-QEVAAWSRIV 477

Query: 531 ISDKHYVLSAASFPNDNRGGTSLWMLVALSA-GEERSFTVRLN 572
                 V  AAS  ND      LW+ V  +  GE R +   L 
Sbjct: 478 TDG--VVERAASVYNDAEKRDELWITVLRTVNGETRRYVEYLE 518


>gi|317152064|ref|YP_004120112.1| hypothetical protein Daes_0341 [Desulfovibrio aespoeensis Aspo-2]
 gi|316942315|gb|ADU61366.1| hypothetical protein Daes_0341 [Desulfovibrio aespoeensis Aspo-2]
          Length = 698

 Score =  284 bits (725), Expect = 4e-74,   Method: Composition-based stats.
 Identities = 104/581 (17%), Positives = 185/581 (31%), Gaps = 72/581 (12%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           M  TT +  +F+AGE+SPRL   R DLS +  G     N     +G        +     
Sbjct: 1   MSITTPSLTNFTAGEISPRL-AGRIDLSRYFNGCRTLENFHVHPHGGATRRCGFRFVTQA 59

Query: 61  RLDPRSNRVFSFSIPDGGYALLVFGD---KKLQIVVVRSSTKWSPALFGKTYKTPYTFKD 117
               R+  +  F        +L FG+    + ++ V                  PY    
Sbjct: 60  LNPDRAGLLVPFESNADTAYVLEFGEDAAGQGRMRVFSGHGVVMAGDAPYALDVPYRADQ 119

Query: 118 NKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWL-GDGMISGVKS 176
             +L YA  G   +  H  HP   L  +       +  ++++F+  P    +G    V +
Sbjct: 120 LDTLRYAQSGDELILAHPAHPVRRLTRLAHD---QWQLEDMEFIGCPETWTEGNHPSVVA 176

Query: 177 NAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVY 236
             +  +  A T     T        +   R          W            + D    
Sbjct: 177 FFEQRLVLAATPDKPGTLWFSRTGGIGDFRLRTREVPLDGW--------RDREITDSNS- 227

Query: 237 RSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVS 296
             L  G++GD F    G  + K + +          +T+R       A     G  K V+
Sbjct: 228 DGLRDGKAGDTFLLLDGDGFEKLDGL----KGQHPDRTTRYYRYKGAANLTASGADKTVT 283

Query: 297 KDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVY 356
                    P+   +     +        W   E  P                       
Sbjct: 284 -----FRHEPEGAQIEPIRDAEGELNNGFWECFE--PG---------------------- 314

Query: 357 LSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSI 416
                   D +            A+   ++   A+ I ++   G+ + VG     W L  
Sbjct: 315 --------DRTEAPAGEAPLDDDAIEVTLSGRQANAIEFLVARGK-LWVGTAGGEWTLGG 365

Query: 417 SLSKGLS---IDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEI 472
           SL   ++   I   +    G  A  P +VG   +++   GR+I+ ++   E   +   ++
Sbjct: 366 SLGDPVTPESIKASQEGSCGASATRPEAVGFATLYIQRAGRKIREMAYRYESDAYVSRDL 425

Query: 473 TQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMIS 532
           T L++H+    + Q+ Y +EP SI++ V          L+   +  + +   AW   +  
Sbjct: 426 TILSEHITKPGLTQMAYVQEPDSILYCVR-----GDGALIALTYEPD-QEVAAWSRMLTD 479

Query: 533 DKHYVLSAASFPNDNRGGTSLWMLVALSA-GEERSFTVRLN 572
               V   A+  N       LW ++  +  G ER +   L 
Sbjct: 480 GA--VECVAAVYNQAGKRDVLWAVIRRTVNGLERRYVEFLE 518


>gi|146276492|ref|YP_001166651.1| hypothetical protein Rsph17025_0440 [Rhodobacter sphaeroides ATCC
           17025]
 gi|145554733|gb|ABP69346.1| hypothetical protein Rsph17025_0440 [Rhodobacter sphaeroides ATCC
           17025]
          Length = 754

 Score =  280 bits (715), Expect = 5e-73,   Method: Composition-based stats.
 Identities = 105/598 (17%), Positives = 182/598 (30%), Gaps = 66/598 (11%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           M  T+  + +FS+GEL P LL  R D      G+AK +  +PL  G +   P        
Sbjct: 1   MTRTSPPQVAFSSGELDP-LLHRRFDYQRFQTGLAKCQGFLPLAQGGVTRAPGTIYRGRT 59

Query: 61  RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120
           R       +  FS       +L F   ++++   R               TP+      S
Sbjct: 60  R-GDARCVLVPFSFAANDSCILEFTPGRMRVW--RYGALVMSGGAPYELVTPFDETSLSS 116

Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180
           L +         V    P   L  +      ++T         P+        +   A  
Sbjct: 117 LSWVQSADVVYMVDGRQPMQRLARLALD---NWTIGAQALRKGPFRVQNTDEAITLTA-- 171

Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGCHP----PEWAKNTNY------------- 223
               A   T  +T+    F     G  ++L        P W  +  Y             
Sbjct: 172 ---SAAKGTITLTASAAFFTADHVGSLMQLRPKDNTSVPAWTADEEYGSETWGGPLVGFE 228

Query: 224 ---SIGAYIVADDKVYRSLTTGRSG-DRFGYSKGATYVKDNNITWITVLNLSSKTSRESA 279
                          Y  +   ++G     +++G   V  +   W  + +          
Sbjct: 229 TEPPADVLRRYGANTYLLVQGTKAGSTPPIHTEGDYMVDSDPTVWRFISDD--------- 279

Query: 280 SGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFH 339
              V    +   +             P        GV    W   AW ++ GYPS V  +
Sbjct: 280 ---VGIVRITQILSPTQARAAVTRTIPTGCI----GVPTYRWSEGAWSKRYGYPSTVEIY 332

Query: 340 NNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPF 399
             RL  + +  +  +V+ S+ G F DF      G  D      T     S + I  +   
Sbjct: 333 EQRLAAAATPSEPRTVWFSAVGDFQDF----LDGTEDDQSFAYTVAGSTSVNRIINLQRG 388

Query: 400 GEGVLVGCDTSLWL----LSISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRR 455
             G+ +      +        S+    +  F   SG G     P++     +F+    +R
Sbjct: 389 AAGLHIFALGEEYSTRSETRSSVIGPKNAVFGLDSGVGSSTAKPITPSGNPIFISRDRKR 448

Query: 456 IKYISGSTEQGF-RFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGC 514
           +  +  S +Q       +++ A H+      Q+V+Q  P    W+ L         L+  
Sbjct: 449 VLEMVYSLDQDRPVSRVLSRTAQHVGGAGFEQIVWQAAPEPTAWLRL-----GTGELVAM 503

Query: 515 RFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLV-ALSAGEERSFTVRL 571
            +  + E    W    ++   +V + A +P    G   L M V     G+       L
Sbjct: 504 VYDPDEE-VLGWAPVPVAGG-FVDALAVYPAAGGGSDILTMAVLREIDGQTVRMIEEL 559


>gi|242278913|ref|YP_002991042.1| hypothetical protein Desal_1441 [Desulfovibrio salexigens DSM 2638]
 gi|242121807|gb|ACS79503.1| hypothetical protein Desal_1441 [Desulfovibrio salexigens DSM 2638]
          Length = 698

 Score =  265 bits (677), Expect = 1e-68,   Method: Composition-based stats.
 Identities = 73/285 (25%), Positives = 123/285 (43%), Gaps = 28/285 (9%)

Query: 303 SVAPQSQTLFQAGVSVVSWFMSA---------WGEQEGYPSHVTFHNNRLLFSGSKGDEL 353
            V P+ Q    +  S V W M           W  ++G+PS VTF   RL F+ S  +  
Sbjct: 245 LVHPEVQPYKLSRTSHVDWKMELVAFSSPPQEWNSEKGFPSCVTFFEERLCFAASPSNPQ 304

Query: 354 SVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWL 413
           ++++S  G++ DF++       D   A T  ++    + I WM    + +++G     W 
Sbjct: 305 TIWMSKAGSYEDFAVSSPVVDDD---ACTYTLSADQVNAIRWMVS-AKKLIMGTSGGEWW 360

Query: 414 LSISLS----KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFR 468
           LS   S       S+  RR +  G  A PPV VG  ++F+   GR I+ +S S E  G+ 
Sbjct: 361 LSGGSSLDSVTPNSVMVRRETTHGSAAIPPVVVGGVMLFLQREGRTIRELSYSFEADGYT 420

Query: 469 FNEITQLADHLFNQ-RILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWH 527
             ++T LA+HL     I +  YQ+ P S++W+  +        ++G  +  E E    +H
Sbjct: 421 APDLTILAEHLTRSNSITEWAYQQSPDSVIWMTRDDG-----VMVGLTYQREHE-VVGFH 474

Query: 528 THMISDKHYVLSAASFPNDNRGGTSLWMLVALSAGEERSFTVRLN 572
            H    K    S  + P   +    + ++     G  R +  R+ 
Sbjct: 475 RHTTDGKF--RSVCTVPGPTQEEVWV-VVEREVGGISRKYVERME 516



 Score =  100 bits (248), Expect = 8e-19,   Method: Composition-based stats.
 Identities = 20/120 (16%), Positives = 36/120 (30%), Gaps = 4/120 (3%)

Query: 53  LMQEYRDCRLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTP 112
              E    R    + R+  F        +L F D+ ++I                  ++P
Sbjct: 167 GAVEAVQVREINPATRLIPFEFSTEQAYVLEFTDRNIRIF-KNGGIVVDDQGSPVEIQSP 225

Query: 113 YTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMIS 172
           YT  D   + +         VH +  P+ L      D   +  + + F  PP   +    
Sbjct: 226 YTETDLPGIRFTQSADVMYLVHPEVQPYKLSRTSHVD---WKMELVAFSSPPQEWNSEKG 282



 Score = 74.1 bits (180), Expect = 6e-11,   Method: Composition-based stats.
 Identities = 18/57 (31%), Positives = 29/57 (50%), Gaps = 1/57 (1%)

Query: 4  TTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           +    +FSAGELSPRL   R DL+ ++ G+A+  N+    +G        +  R+ 
Sbjct: 3  VSLIMTNFSAGELSPRL-GGRVDLAKYSNGLAELENMFTHPHGGASRRTGFRFIREV 58


>gi|288959382|ref|YP_003449723.1| hypothetical protein AZL_025410 [Azospirillum sp. B510]
 gi|288911690|dbj|BAI73179.1| hypothetical protein AZL_025410 [Azospirillum sp. B510]
          Length = 665

 Score =  263 bits (672), Expect = 5e-68,   Method: Composition-based stats.
 Identities = 102/581 (17%), Positives = 185/581 (31%), Gaps = 109/581 (18%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           M   T  +++F+ GE+SPR+ + R DL      V +  N++ +  GP    P  +     
Sbjct: 1   MSRATPAQYAFTGGEISPRI-KGRTDLERIRNAVEEMTNMVAVPEGPSERRPGTRFANST 59

Query: 61  RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120
           +    S  +  F        ++       +            + +  T+   Y+  D   
Sbjct: 60  K-GDASAVLIPFEFSTQQAYIIEATAGAFRFYRDGGQIVSGSSPYEVTHA--YSAADLPF 116

Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180
           L +         V   HPP  L         ++   E      P+L       + S    
Sbjct: 117 LRWTQSADVLFLVCPGHPPRTLSRTGH---TAWNLAEWVMRDGPYL------DLNSGPTT 167

Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLT 240
                 + +  +T+   +F   D GR +RL      W        G   +       S+T
Sbjct: 168 LTPSGTSGSVTLTASAALFAATDVGRLVRLRI-ANVW--------GWCRITAFGSVTSVT 218

Query: 241 TGRSGDRFGYSKGATYV----KDNNITWITVLNLSSKTSRESASGAVAP--YYVWGDIKD 294
                   G +  A +          TW T +         +A   V       + +   
Sbjct: 219 ATVEAAWGGTTATAFWRLGAWGATTGTWPTAVTFHENRLAFAALQTVWLSCSGDFDNFGP 278

Query: 295 VSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELS 354
            +++G   +    + T     V+V+ W  SA+G               +L +G+ G   +
Sbjct: 279 TTENGTVAADNAITLTAADDQVNVIRWLRSAFG---------------VLIAGTSGGPFA 323

Query: 355 VYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLL 414
           +  S                    +ALT    + +   +H          V     +  +
Sbjct: 324 IQAS-----------------SLREALTPI--NATMPRVH----------VAGAADVQPV 354

Query: 415 SISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGST-EQGFRFNEIT 473
            ++ +                          LVF     RR+  ++      G+   ++ 
Sbjct: 355 RVATN--------------------------LVFPSRSRRRLHLLNAEFAAAGYSAPDLA 388

Query: 474 QLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISD 533
            +A H+    +  + YQ+EP S++W+VL+        L G  +  E +   AWH H +  
Sbjct: 389 LVASHITRHAVKAMAYQQEPWSVMWLVLDDG-----TLAGVTYVPELD-ILAWHRHPLGG 442

Query: 534 KHY-VLSAASFPNDNRGGTSLWMLVAL-SAGEERSFTVRLN 572
               VLS A  P  +R    LW++V    AG  R     L 
Sbjct: 443 TAVKVLSVACIPAADR--DELWLVVERVVAGGIRRHVEILE 481


>gi|187736306|ref|YP_001878418.1| hypothetical protein Amuc_1819 [Akkermansia muciniphila ATCC
           BAA-835]
 gi|187426358|gb|ACD05637.1| hypothetical protein Amuc_1819 [Akkermansia muciniphila ATCC
           BAA-835]
          Length = 822

 Score =  244 bits (622), Expect = 3e-62,   Method: Composition-based stats.
 Identities = 69/258 (26%), Positives = 107/258 (41%), Gaps = 18/258 (6%)

Query: 317 SVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYD 376
               W   A+G + GYP  V FH  RL F G+ G   +++ S    F  F+         
Sbjct: 409 DTNDWSFGAFGVRNGYPCTVEFHQGRLWFGGTPGQPQTLWASRVDDFSAFTPGIP----- 463

Query: 377 PTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSID---FRRVSGSG 433
               +   +     + I W+     G+++G     W LS + S+GL+     F R SG G
Sbjct: 464 ADSPMILTMAASQQNRISWIASL-RGLMIGTSEGEWRLSATNSEGLNASNAGFERHSGVG 522

Query: 434 VYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFNQRILQLVYQEE 492
             +   +SV + L+FV   G +++ +  S E  G++  +++ L+DHL  + I+    Q  
Sbjct: 523 SASLDALSVENSLLFVQQGGMKVRELFYSLEADGYQTRDVSLLSDHLLGEGIVDWTVQRS 582

Query: 493 PHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPND-NRGGT 551
               VW VL            C      +   AWH H + +   +LS AS     N    
Sbjct: 583 TAFHVWCVLGDGS------AVCMTLNREQNVVAWHAHRL-EHGRILSVASLRGSRNTPDE 635

Query: 552 SLWMLVALSAGEERSFTV 569
            +W  VA   GEE   TV
Sbjct: 636 EVWFAVARGEGEEACITV 653



 Score =  139 bits (349), Expect = 1e-30,   Method: Composition-based stats.
 Identities = 60/381 (15%), Positives = 112/381 (29%), Gaps = 40/381 (10%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           M      + SF+AGEL+P L   R DL   ++G ++  N +   +G L   P  +     
Sbjct: 1   MAKQVLQRLSFTAGELTPWL-AGRADLDPVSRGASRLINFLVSPFGGLRRRPGTRLVARA 59

Query: 61  RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPY-TFKDNK 119
                  R+ SF    G   +L  G   ++      +             TP+ T +   
Sbjct: 60  GCREGMVRLVSFKYSTGVQFMLEVGRGYVRYF-KNGALLTDTEGGVLETLTPWKTDEQVS 118

Query: 120 SLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAK 179
           +L           V    PP  L    D D   +  + ++F   P+    +++ V+   +
Sbjct: 119 NLRMQQLNDVIYCVEPSTPPMTLARYADDD---WRLEALEFSGIPYES-SLLNAVRLECR 174

Query: 180 LSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSL 239
           + + +   +    T+D  +F P  +G+                       VA+       
Sbjct: 175 M-VREGGVNRLLATADDDVFTPEMEGKEFL-----------RITRKYGETVAEGNQMPFY 222

Query: 240 TTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDG 299
                       +  +  +++                +   G   P       +  +   
Sbjct: 223 HLTTLSRDLYKGETFSMNREDGWRQAYTCIRDFSRESDYQEGVDRPERYTAFFEKGADAS 282

Query: 300 RSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLS- 358
             I V            +  +W  + W    GYP    +  NR            V+ S 
Sbjct: 283 TRIYVNG-----AWTLETTGTWD-AEWEICRGYPDGSNYLPNR---------PELVWHSV 327

Query: 359 -----SFGAFYDFSLDGEYGC 374
                  G   +F+L G    
Sbjct: 328 KSFQQREGFRNNFTLSGNEEE 348


>gi|290968641|ref|ZP_06560179.1| conserved hypothetical protein [Megasphaera genomosp. type_1 str.
           28L]
 gi|290781294|gb|EFD93884.1| conserved hypothetical protein [Megasphaera genomosp. type_1 str.
           28L]
          Length = 1039

 Score =  241 bits (614), Expect = 3e-61,   Method: Composition-based stats.
 Identities = 80/465 (17%), Positives = 158/465 (33%), Gaps = 55/465 (11%)

Query: 153 FT-FDEIKFLPPP------WLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKG 205
           ++ FD +     P      +              +  +    + ++ITS   IF    K 
Sbjct: 368 WSDFDNVALFGNPTGACFIYFLAAEKEETPHPDSIEDTSLQITDSKITSSNSIFVQALKN 427

Query: 206 RSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWI 265
             I++       +                VY       S      S+       N   W 
Sbjct: 428 TKIKIVQTQQSKSVEMTLGENEEEQTSGAVYVGEKWKISTSGIHNSRIVIERSLNGQQWH 487

Query: 266 TVLNLSSKTSRES-ASGAVAP-------YYVWGDIKDVSKDGRSISVAPQSQT------- 310
                 SK  +    SG+              G I    KD  ++SV   +         
Sbjct: 488 EYRKYISKDDQNFMESGSEKEKCYLRVKAKTQGKINTERKDSDNLSVVLSALPFENEGII 547

Query: 311 -----------------LFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDEL 353
                                 V V ++  S+W ++ GYP    F  +RL+F+G+K +  
Sbjct: 548 EITDIVSPKEIKYTAIEPVIPNVPVDAFAFSSWNDRNGYPKLSCFFQDRLVFAGTKKEPY 607

Query: 354 SVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWL 413
           S++ S  G + +FS++   G      A+   +   +   I  + P  + ++V    + W+
Sbjct: 608 SLWFSRTGDYNNFSVEKAEGTVTEDSAIKLDLIVRNLYEIRHLVPSND-LIVLTSGNEWI 666

Query: 414 LSISLS-KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNE 471
           +S   +        +  +  G   C P  +G+ L++V   G  I+    S +   +  +E
Sbjct: 667 ISGDTAITPTKCTPKVQTMRGASNCKPWHIGNRLIYVQRDGGTIRDFGYSYDSDNYNGDE 726

Query: 472 ITQLADHLF-NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHM 530
           +   A HL    +++   Y + P+S ++ V E  +      + C    + +   AW TH 
Sbjct: 727 LNLFASHLTKRHQMVSSAYCQNPYSTLYFVREDGE------IICLMLIKEQNVCAW-THW 779

Query: 531 ISDKHYVLSAASFPNDNRGGTSLWMLVALSAGEER--SFTVRLNL 573
            +   Y+   +       G   L+++V  +  E +   +  + +L
Sbjct: 780 NTHGKYLDCCSVL---ENGKDYLYVIVERTNREAQIVRYLEKFDL 821



 Score =  140 bits (352), Expect = 6e-31,   Method: Composition-based stats.
 Identities = 49/323 (15%), Positives = 92/323 (28%), Gaps = 22/323 (6%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           M N   T++SF+ GE+SP +   R DL  +   + ++ N +   YG +      +     
Sbjct: 1   MQNVFITQNSFTTGEISPEV-AERTDLEKYKSALLQAENAVVSPYGSVSRRTGSKYIGAI 59

Query: 61  RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120
           +   +   +  F        LL  G K +++        W      +   TP+ +   K 
Sbjct: 60  KYADKEAVLVPFMDSSDRSYLLEVGYKYIRV--------WKDETMEQEIDTPFEYP--KE 109

Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLG----DGMISGVKS 176
           L +   G TA      +P + LL+        +   +     P +         +S V  
Sbjct: 110 LNFTQSGDTAFICSGRYPVYELLH-----GRYWELRKFDIPKPYFDDIISAIENVSDVNY 164

Query: 177 NAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVY 236
               +   + T     T    +            G            +     +   +  
Sbjct: 165 TESDTPVFSQTKAGDYTFTPTVSGLYKIVLFGGAGGKKGTIEHYAGSTKHDEAIYHYEYG 224

Query: 237 RSLTTGRSGDR-FGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAP-YYVWGDIKD 294
            +   G+            TY                  +R    G V   +   G  +D
Sbjct: 225 VAGNEGQKKIVTVKLKAKTTYSIHVGKGGEDGDKHKKGIARGWEEGDVYNSFLNGGPGED 284

Query: 295 VSKDGRSISVAPQSQTLFQAGVS 317
            +  G S  V   ++       S
Sbjct: 285 TTVKGNSDGVNIVAKGGATFTGS 307


>gi|260549511|ref|ZP_05823729.1| Bbp13 [Acinetobacter sp. RUH2624]
 gi|260407304|gb|EEX00779.1| Bbp13 [Acinetobacter sp. RUH2624]
          Length = 678

 Score =  234 bits (597), Expect = 3e-59,   Method: Composition-based stats.
 Identities = 87/548 (15%), Positives = 168/548 (30%), Gaps = 72/548 (13%)

Query: 7   TKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRS 66
            ++SF+ G +SP  +  R D + +  GVAK +N+    +G LV     +           
Sbjct: 1   MQYSFNGGVISPD-MFGRIDQAKYQTGVAKCKNMYVELFGGLVYRAGFRYVHHYPKTLGK 59

Query: 67  NRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVF 126
            R+  F   +    +L F    +     R     +        + PY  +    L YA  
Sbjct: 60  MRLIRFVFSEEQAVVLAFRAGAVNFFA-RGGMLLNNVGEPLEVELPYAEEHLMQLRYAQS 118

Query: 127 GSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQAD 186
                  H D+PP  ++     +                          S   +S+    
Sbjct: 119 ADVVTITHPDYPPRKIIRKGATEW-------------------------STEVVSVGYGL 153

Query: 187 TSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAY-IVADDKVYRSLTTGRSG 245
           T    + +   I     +G       H     ++ +Y + A     +      +T     
Sbjct: 154 TPPQNVAATAHIEDKYKEG----GNMHDSYIERDYSYQVTAVDEQNESAASTKVTVKNDI 209

Query: 246 DRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVA 305
              G     T+      T   +  L S  +       +          D  +   SI+  
Sbjct: 210 TLAGNYNTITWDVVTGATRYNIFKLRSGLAS-----YIGETTETSFTDDNIETNGSIT-P 263

Query: 306 PQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYD 365
           P  +  F+                   P+ V +H  R ++ G       + +S      +
Sbjct: 264 PLIRNPFEFN-----------------PTAVAYHGQRKVYGGGYQSPQWIRMSRTATDDN 306

Query: 366 FSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLS-KGLSI 424
           F         D   ++         + +  +    + +LV    ++W LS   +    S+
Sbjct: 307 FGYHIPTQDTD---SIQIRFAARDGNGVKHLITLND-LLVLTSGAMWKLSSDGAMTAASV 362

Query: 425 DFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKY---ISGSTEQGFRFNEITQLADHLFN 481
           +  +   +G     PV V    VF       +      SG     ++  +++ +   LF+
Sbjct: 363 NMNKQYSTGANDVTPVEVDGAAVFASDQTGHVHEASLASGYNASYYQTLDLSIMCPQLFD 422

Query: 482 -QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSA 540
             +I+       P +I++ V +        LL   +    +  +AW  H    K   LS 
Sbjct: 423 GHKIIDCAAIRNPLNIIYFVRDDG-----VLLSLTYEP-QQQVWAWAEHHTDGKF--LSV 474

Query: 541 ASFPNDNR 548
           A  P +N+
Sbjct: 475 AEIPEENQ 482


>gi|260557972|ref|ZP_05830184.1| Bbp13 [Acinetobacter baumannii ATCC 19606]
 gi|260408482|gb|EEX01788.1| Bbp13 [Acinetobacter baumannii ATCC 19606]
          Length = 678

 Score =  231 bits (589), Expect = 2e-58,   Method: Composition-based stats.
 Identities = 80/560 (14%), Positives = 162/560 (28%), Gaps = 72/560 (12%)

Query: 7   TKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRS 66
            ++SF+ G +SP  +  R D + +  GVAK +NL    +G +V     +           
Sbjct: 1   MQYSFNGGVISPD-MFGRIDQAKYQTGVAKCKNLYVELFGGVVYRAGFRYVHHYPKTMGK 59

Query: 67  NRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVF 126
            R+  F   +    +L      +           +          PY       L YA  
Sbjct: 60  MRLIRFVFSEEQAVVLAIRAGAINFFA-DGGMLLNENNEPLEVAVPYAEDHLMQLRYAQS 118

Query: 127 GSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQAD 186
                  H ++PP  ++     + I+     + +      G G    V + A +      
Sbjct: 119 ADVVTITHPNYPPRKIIRKSATEWIT-ELVTVGY------GVGTPQNVAATAHIEDKYKP 171

Query: 187 TSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGD 246
                  S    +   D    +       E A +    +   +                 
Sbjct: 172 GG-----SMHDSYIERDYSYQVTAVDEQNESAASLKVVVQNDLTLAGNY----------- 215

Query: 247 RFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAP 306
                   T+          +  L S  +          +       + S     I    
Sbjct: 216 -----NTITWDAVTGANRYNIFKLRSGLASFIGETTETSFTDDNIETNGSITPPLIR--- 267

Query: 307 QSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDF 366
                                  E YP+ V +H  R ++ G       + +S      +F
Sbjct: 268 --------------------NPFEFYPTAVAYHGQRKVYGGGYKSPQWIRMSRTATDDNF 307

Query: 367 SLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLS-KGLSID 425
                    D   ++         + +  +    + +++    +LW +S   +    S++
Sbjct: 308 GYHIPTQDTD---SIQIRFAARDGNGVKHLVTMSDLLIL-TSGALWKMSADGAVTAASVN 363

Query: 426 FRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYI---SGSTEQGFRFNEITQLADHLFNQ 482
             +   +G     PV V    +F       +  I   SG     ++  +++ +   LF+ 
Sbjct: 364 MNKQYSTGANDVTPVEVDGATIFSSDQTGHVHEISLASGYNASFYQTIDLSIMCPQLFDG 423

Query: 483 R-ILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAA 541
           + I+       P +I++ V          LL   +  + +  +AW  H  + K   LS A
Sbjct: 424 QKIIDCALLRNPLNIIYFVR-----GDGVLLSLTYEPK-QQVWAWAEHHTNGKF--LSIA 475

Query: 542 SFPNDNRGGTSLWMLVALSA 561
             P D++    L+  +    
Sbjct: 476 EIPEDDQSV--LYAFIERDG 493


>gi|254251749|ref|ZP_04945067.1| hypothetical protein BDAG_00946 [Burkholderia dolosa AUO158]
 gi|124894358|gb|EAY68238.1| hypothetical protein BDAG_00946 [Burkholderia dolosa AUO158]
          Length = 545

 Score =  221 bits (562), Expect = 3e-55,   Method: Composition-based stats.
 Identities = 56/265 (21%), Positives = 100/265 (37%), Gaps = 23/265 (8%)

Query: 315 GVSVVSWFMS--AWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEY 372
                 W +    W   +GYP  V+ +  RL  +GS G    V+ S+ G +YDF+     
Sbjct: 129 TAPPDGWMLKTFMWNPTDGYPCAVSLYQQRLYAAGSSGYPERVWASATGLYYDFTPGT-- 186

Query: 373 GCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLL---SISLSKGLSIDFRRV 429
              D     +  V     + I  +      + V      + +   S+      +I+ R  
Sbjct: 187 ---DDGDGFSYDVASDQVNQIMHLAS-SRILTVLTQGEEFTIDGGSVGSITPTNINVRSQ 242

Query: 430 SGSGVYACPPVSVGDCLVFVCGVGRRIKYISGST-EQGFRFNEITQLADHLFNQRILQLV 488
           S  G     PV VG+ L+F     ++I+ ++       FR   +T+LA H+    ++ + 
Sbjct: 243 SIYGTARPRPVRVGNELIFPQRAAKKIRSMAYDFNTDSFRSQNLTRLAAHITESGVVDIA 302

Query: 489 YQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNR 548
           +Q EP  +VW+V      +   L+   +  + E    +  H         S    P  + 
Sbjct: 303 FQAEPTPVVWMVR-----ADGVLISMTYDRD-ENVCGFARHTTDGAF--KSVCCIPGAD- 353

Query: 549 GGTSLWMLVALS-AGEERSFTVRLN 572
            G  L+ +V  +  G       RL+
Sbjct: 354 -GDVLFAVVQRTINGNVVQNVERLD 377


>gi|257139843|ref|ZP_05588105.1| hypothetical protein BthaA_11681 [Burkholderia thailandensis E264]
          Length = 489

 Score =  219 bits (557), Expect = 1e-54,   Method: Composition-based stats.
 Identities = 56/320 (17%), Positives = 112/320 (35%), Gaps = 23/320 (7%)

Query: 260 NNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVV 319
                        +  R    G+              +    I    + +          
Sbjct: 18  GGTLGAVYEYGVGQAWRAQDVGSYVEINGGLVQLIAFESASRIFGVIKRELASTLTAPAS 77

Query: 320 SWFM--SAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDP 377
            W +  S W   +GYP+ V+    RL  +GS G  + V+ S  G + DF+   + G    
Sbjct: 78  GWALKSSMWNSIDGYPAAVSLFKQRLYAAGSTGYPMRVWASGIGLYLDFTPGTKDG---- 133

Query: 378 TKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSK---GLSIDFRRVSGSGV 434
            +A    +     +    +    + +        + ++   +      +I+    S  G 
Sbjct: 134 -EAFGYDMASDQVNQTVHLAS-AKILAALTQGEEFTVTGGSAGAITPTNINVDSQSVYGC 191

Query: 435 YACPPVSVGDCLVFVCGVGRRIKYISGST-EQGFRFNEITQLADHLFNQRILQLVYQEEP 493
               PV VG+ +V+V   G++++ ++       +R   +T+LA H+    I+ + +Q EP
Sbjct: 192 ARARPVRVGNEIVYVQRAGKKVRAMTYDLNTDAYRSQNLTRLAAHVTESGIVDVAFQAEP 251

Query: 494 HSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSL 553
             +VW+V      +   L+   +  + E    +  H+        S    P D   G  L
Sbjct: 252 TPVVWMVR-----ADGVLVSMTYDRD-ENVCGFARHVTDGLF--KSVCCIPGDE--GDVL 301

Query: 554 WMLVALS-AGEERSFTVRLN 572
           + +V  +  G    +  RL+
Sbjct: 302 FAVVQRTINGATVQYVERLD 321


>gi|83720451|ref|YP_441475.1| hypothetical protein BTH_I0919 [Burkholderia thailandensis E264]
 gi|83654276|gb|ABC38339.1| conserved hypothetical protein [Burkholderia thailandensis E264]
          Length = 405

 Score =  217 bits (551), Expect = 6e-54,   Method: Composition-based stats.
 Identities = 51/253 (20%), Positives = 101/253 (39%), Gaps = 21/253 (8%)

Query: 325 AWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTA 384
            W   +GYP+ V+    RL  +GS G  + V+ S  G + DF+   + G     +A    
Sbjct: 1   MWNSIDGYPAAVSLFKQRLYAAGSTGYPMRVWASGIGLYLDFTPGTKDG-----EAFGYD 55

Query: 385 VTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSK---GLSIDFRRVSGSGVYACPPVS 441
           +     +    +    + +        + ++   +      +I+    S  G     PV 
Sbjct: 56  MASDQVNQTVHLAS-AKILAALTQGEEFTVTGGSAGAITPTNINVDSQSVYGCARARPVR 114

Query: 442 VGDCLVFVCGVGRRIKYISGST-EQGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVV 500
           VG+ +V+V   G++++ ++       +R   +T+LA H+    I+ + +Q EP  +VW+V
Sbjct: 115 VGNEIVYVQRAGKKVRAMTYDLNTDAYRSQNLTRLAAHVTESGIVDVAFQAEPTPVVWMV 174

Query: 501 LEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALS 560
                 +   L+   +  + E    +  H+        S    P D   G  L+ +V  +
Sbjct: 175 R-----ADGVLVSMTYDRD-ENVCGFARHVTDGLF--KSVCCIPGDE--GDVLFAVVQRT 224

Query: 561 -AGEERSFTVRLN 572
             G    +  RL+
Sbjct: 225 INGATVQYVERLD 237


>gi|42526655|ref|NP_971753.1| hypothetical protein TDE1145 [Treponema denticola ATCC 35405]
 gi|41816848|gb|AAS11634.1| hypothetical protein TDE_1145 [Treponema denticola ATCC 35405]
          Length = 647

 Score =  193 bits (491), Expect = 5e-47,   Method: Composition-based stats.
 Identities = 71/561 (12%), Positives = 163/561 (29%), Gaps = 83/561 (14%)

Query: 9   HSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNR 68
            +F+ GE+S  L   R DL ++   V++  N   ++ G +      +     +      R
Sbjct: 4   TNFAGGEVSKNLY-GRIDLPIYQNSVSRLENFDIMQTGGIKRRGGTERIGKLK---GYAR 59

Query: 69  VFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYK-TP----YTFKDNKSLEY 123
           +  F + +    +   G + ++I      +  + A F   +  TP    Y   D   ++Y
Sbjct: 60  LIPFIVNNTLSFIFEIGSEYIRIWKN--GSLLTLAGFPVEFSPTPDLPLYQKSDLSEIQY 117

Query: 124 AVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAK-LSI 182
           A    +    H+ + P+ + +                             +        +
Sbjct: 118 AQTYDSLYLAHRHYKPYVIKWQGGDAFT-------------------FGSLNITGNAHKL 158

Query: 183 SQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTG 242
                 +    S + +F+    GR               +       +   KV+      
Sbjct: 159 PF--QGSDNYPSCVALFQ----GRLFF-----------ASTIREPQKIWASKVFEYENFT 201

Query: 243 RSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSI 302
                   +          +    +   S+K  ++S                        
Sbjct: 202 YFDTVVSKT--------TQLKNPDLRVFSAKAVKDSDVLTELTKDFTDITNITDYYVSGH 253

Query: 303 SVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGA 362
              P+   +       +     A  ++E     +    N                     
Sbjct: 254 KGIPKDTKVLSVTSDSMKISKPATVDKEDIVLSIHLWRN-------ADSPQ------ADD 300

Query: 363 FYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGL 422
           + D  +        P  A    +       I W+ P  + +++G ++S W++S       
Sbjct: 301 YKDTEIINNV--TAPDHAFYFEIGSDKNDKIKWITPSKD-LIIGTESSEWVMS-DGVTAQ 356

Query: 423 SIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGST-EQGFRFNEITQLADHLF- 480
            I+ +  S  GV       +G  ++++   GR ++  +    E  ++  ++TQ A HL  
Sbjct: 357 RIEVQLQSRYGVADLQGSLIGRSVIYIGQGGRSLRDYAYDFQEHTYKSIDLTQAASHLLI 416

Query: 481 NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSA 540
             + +   Y   P   +++ LE             +     G  AW   ++ +   + + 
Sbjct: 417 ESKAVDFDYTNSPVQKIYLSLEDGSACV-----LLYDKNT-GIAAWTKIVLGNGK-IKNI 469

Query: 541 ASFPNDNRGGTSLWMLVALSA 561
            + P   +G   ++  V    
Sbjct: 470 VTVPGL-KGFDDVYFEVERKG 489


>gi|54302254|ref|YP_132247.1| hypothetical protein PBPRB0574 [Photobacterium profundum SS9]
 gi|46915675|emb|CAG22447.1| hypothetical protein PBPRB0574 [Photobacterium profundum SS9]
          Length = 919

 Score =  185 bits (470), Expect = 2e-44,   Method: Composition-based stats.
 Identities = 50/253 (19%), Positives = 93/253 (36%), Gaps = 20/253 (7%)

Query: 314 AGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYG 373
              S   W +  W    GYP   T+   RL  + +     +V+LS   +F DFS      
Sbjct: 405 TSRSTYKWAIEIWRNSTGYPRCGTYFQQRLSMANTISHPQTVWLSRTDSFNDFSKTRPIL 464

Query: 374 CYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSI----DFRRV 429
             D  +     +     + I  + P    +L      LW L+       S       +  
Sbjct: 465 ADDSMRYD---INSLQVNEIFNIVPLNSLLLF-TSGGLWSLAQDQQGAFSAESPPSVKMQ 520

Query: 430 SGSGVYACPPVSVGDCLVFVCGVGRRIKYISGS-TEQGFRFNEITQLADHLFNQ-RILQL 487
           +  G     P+  G   ++V    R ++ I  S +   F   ++T  A HLF   R+++ 
Sbjct: 521 NYEGANKLRPIVAGSTAIYVQQGDRIVRDIQFSWSSDSFEGVDLTVRASHLFKHKRVVEW 580

Query: 488 VYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDN 547
            Y + P  ++WV+ +    +        +  E +  + W  H  + K+   + AS   + 
Sbjct: 581 AYAKNPDKLIWVIFDDGTAAT-----LTYMKE-QQIWGWCPHTTNGKY--KNVASV--EE 630

Query: 548 RGGTSLWMLVALS 560
              +S++ +V   
Sbjct: 631 GSRSSIYFVVERI 643



 Score =  165 bits (416), Expect = 3e-38,   Method: Composition-based stats.
 Identities = 51/368 (13%), Positives = 100/368 (27%), Gaps = 27/368 (7%)

Query: 6   WTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPR 65
            ++ S SAGELSP  +  R D   +  G+AK+ N     +G + + P             
Sbjct: 5   LSQPSMSAGELSPE-MYGRVDTDHYRIGLAKAENFFVNYHGGISNRPGTT-LSYITARNE 62

Query: 66  SNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAV 125
              +  F        +L FG + +++ + +       +       TPY   +   L Y  
Sbjct: 63  VVALIPFQFSAFDSFMLEFGTEYMRV-MSKGKYITDNSGVKIQVVTPYLAGEILDLSYTQ 121

Query: 126 FGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQA 185
                   H++H    +        I +  + +     P+    +       A  +    
Sbjct: 122 SADVLTIFHRNHAIQQIKRYS---NIDWRVEPLINKLGPFESININESQFMYADKNGD-- 176

Query: 186 DTSTARITSDMKIFKPLDKGRSIRLGCHPP----EWAKNTNYSIGAYIVADDKVYRSLTT 241
                 + S+   F     G+ + L         +W +    + G         Y     
Sbjct: 177 VGEQITLISNFDAFTSDLVGKMVYLDQEETGDISQWMQRYEVAEGDQTYNAGNYYICTKA 236

Query: 242 GRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGD-IKDVSKDGR 300
                +   +     V      W          +R++  G    Y   G  +  +     
Sbjct: 237 ELYNGKKAQTGDIAPVHSTGERWDGPGKFLPDDNRDANIGVRWAYLNSGYGVVKIISVTD 296

Query: 301 SISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSF 360
           +     +        V         W     +P   T    R     +    L+      
Sbjct: 297 ARHAICEVLVRLPDSVVGGERSKLTWN----FPGETT---QRTFSLATP--PLT-----S 342

Query: 361 GAFYDFSL 368
               DF++
Sbjct: 343 NTMKDFTV 350


>gi|226940469|ref|YP_002795543.1| hypothetical protein LHK_01546 [Laribacter hongkongensis HLHK9]
 gi|226715396|gb|ACO74534.1| hypothetical protein LHK_01546 [Laribacter hongkongensis HLHK9]
          Length = 874

 Score =  185 bits (469), Expect = 2e-44,   Method: Composition-based stats.
 Identities = 68/381 (17%), Positives = 136/381 (35%), Gaps = 33/381 (8%)

Query: 210 LGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLN 269
            G      A   + +   Y V   K    +  G+                    W  +  
Sbjct: 329 TGSGARLSATVGSVACDGYSVTAIKTVTVIDGGKGYTSPSIVTVVKQDGRPITGWGPIHA 388

Query: 270 LSSKTSRE------------SASGAVAPYYVWGDIKDVSK-DGRSISVAPQSQTLFQAGV 316
             S ++               +  A+ P  + G I  V+  +G S   AP     +  G 
Sbjct: 389 TYSVSTSPNTVQLAVTDSGGGSGAALEPVIIDGAITAVNVINGGSGYFAPVVSVSYAGGG 448

Query: 317 SVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYD 376
           S  ++          YP  V++   R  F+G+     +++++  G   +          D
Sbjct: 449 SGATFGQPVVKSSGDYPGAVSYFEQRRCFAGTTRKPQNIWMTKSGTESNMGYSLPVRDDD 508

Query: 377 PTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLS---IDFRRVSGSG 433
               +   V+   A+TI  + P  + +L+   ++ W ++   S  ++   I  R  S  G
Sbjct: 509 R---IAFRVSAREANTIRHIVPLAQLLLL-TSSAEWRVTSVNSDAITPRSISVRPQSYIG 564

Query: 434 VYACPPVSVGDCLVFVCGVGRRIKYISGSTEQ-GFRFNEITQLADHLFNQ-RILQLVYQE 491
                PV + + L++    G  ++ ++ + +  GF   +++  A HLF+   I+ + + +
Sbjct: 565 ASNVQPVIINNTLIYASARGGHVRELAYNWQAGGFVTGDLSIRAPHLFDDFEIVDMAFGK 624

Query: 492 EPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGT 551
            P  +VW V     +S   L+G  +  E +   AWH H         S A+         
Sbjct: 625 SPQPVVWFV-----SSSGCLIGLTYVPE-QQVGAWHWHDTDG--VFESCAAV--AEGAED 674

Query: 552 SLWMLVALSA-GEERSFTVRL 571
            L+ ++  +  G  R +  R+
Sbjct: 675 VLYCVIRRTVNGCSRRYVERM 695



 Score =  166 bits (419), Expect = 1e-38,   Method: Composition-based stats.
 Identities = 53/318 (16%), Positives = 94/318 (29%), Gaps = 19/318 (5%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           M      + SF+ GE++P     R D + +  G+A  RN +   +GP ++       R+ 
Sbjct: 1   MATVKLLQRSFAGGEVTPEFF-GRIDDAKYQSGLAVCRNFVLAPHGPAMNRAGFAFVREV 59

Query: 61  RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSST-KWSPALFGKTYKTPYTFKDNK 119
           +      R+  F+       ++  G    +     ++  +            PY   +  
Sbjct: 60  KDSNLKVRLIPFTYSTTQTMVIELGAGYFRFHTQGATLMQPDAPDSPYEVSNPYREDELF 119

Query: 120 SLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLP--PPWLGDGMISGVKSN 177
            L Y         VH +HPP  L  +      ++    +   P   P       +   S 
Sbjct: 120 DLHYVQSADVMTLVHPNHPPQELRRLG---ATNWELKPVSLQPVIAPPENAAASTAGCSE 176

Query: 178 AKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVAD---DK 234
           AK       T+      +      +   RS         +      +I     A      
Sbjct: 177 AKYDYEYVVTAVMVDLVNESAASNVATVRS-------NVYETGCTNTISWSASAGAYRYN 229

Query: 235 VYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKD 294
           VY+    G      G + G + V DN    ++            A    +     G    
Sbjct: 230 VYK--KEGGVYGYIGQTAGLSLVDDNISPDLSKTPPIYDNVFSVAGQIESVPVTAGGSFY 287

Query: 295 VSKDGRSISVAPQSQTLF 312
            +  G   SV   +  LF
Sbjct: 288 GTHTGIIQSVTVLNGVLF 305


>gi|291334666|gb|ADD94313.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C695]
          Length = 189

 Score =  177 bits (448), Expect = 5e-42,   Method: Composition-based stats.
 Identities = 35/175 (20%), Positives = 78/175 (44%), Gaps = 13/175 (7%)

Query: 401 EGVLVGCDTSLWLLSISLS----KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRI 456
             +++G     + +S   +       +I  ++ S +G      ++VG+  +F+    R++
Sbjct: 5   RTLIIGTAGGEFAVSGGGTDIAITPTNILIKKQSNNGAANVDALAVGNATLFLQRARRKL 64

Query: 457 KYISGSTE-QGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCR 515
           + ++ + +  G+   ++T LA+H+      QL YQ+EP+ ++W V         +L+G  
Sbjct: 65  RELAYNFDVDGYVAPDLTILAEHISEGGFKQLSYQQEPNQVIWGVRNDG-----QLVGLT 119

Query: 516 FSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALS-AGEERSFTV 569
           +  E +   AWH H+        S A+ P D+      W++   +  G  + +  
Sbjct: 120 YQRE-QQVVAWHRHIFGGSAVCESVATIPTDDS-EYQTWVINKRTINGSTKRYVE 172


>gi|291334457|gb|ADD94111.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C1161]
          Length = 206

 Score =  174 bits (440), Expect = 4e-41,   Method: Composition-based stats.
 Identities = 36/176 (20%), Positives = 78/176 (44%), Gaps = 13/176 (7%)

Query: 400 GEGVLVGCDTSLWLLSISLS----KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRR 455
              V++G     + +S   +       +I  ++ S +G      ++VG+  +F+    R+
Sbjct: 3   DGHVIIGTAGGEFAVSGGGTDIAITPTNILIKKQSNNGAANVDALAVGNATLFLQRARRK 62

Query: 456 IKYISGSTE-QGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGC 514
           ++ ++ + +  G+   ++T LA+H+      QL YQ+EP+ ++W V         +L+G 
Sbjct: 63  LRELAYNFDVDGYVAPDLTILAEHISEGGFKQLSYQQEPNQVIWGVRNDG-----QLVGL 117

Query: 515 RFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALS-AGEERSFTV 569
            +  E +   AWH H+        S A+ P D+      W++   +  G  + +  
Sbjct: 118 TYQRE-QQVVAWHRHIFGGSAVCESVATIPTDDS-EYQTWVINKRTINGSTKRYVE 171


>gi|291336928|gb|ADD96456.1| hypothetical protein [uncultured organism MedDCM-OCT-S09-C787]
          Length = 138

 Score =  173 bits (438), Expect = 8e-41,   Method: Composition-based stats.
 Identities = 29/140 (20%), Positives = 49/140 (35%), Gaps = 3/140 (2%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           M        +F+ GELSPRL   R DL+ +  G     N+I   +G        Q   + 
Sbjct: 1   MARVAVQLTNFTGGELSPRL-DGRNDLAKYPTGCKTLENMIVFPHGSAARRSGTQFVAEV 59

Query: 61  RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120
           +   +  R+  F        +L FG++ ++                    +PY   +   
Sbjct: 60  KDSSKETRLIPFEFSTTQTYMLEFGNQYIRFYKDNGQILS--GGSAYEISSPYLEAELFD 117

Query: 121 LEYAVFGSTAVFVHKDHPPH 140
           ++YA         H +HP  
Sbjct: 118 IKYAQSADVMYICHPNHPVK 137


>gi|291336965|gb|ADD96491.1| hypothetical protein [uncultured organism MedDCM-OCT-S11-C1587]
          Length = 474

 Score =  147 bits (371), Expect = 4e-33,   Method: Composition-based stats.
 Identities = 65/478 (13%), Positives = 138/478 (28%), Gaps = 83/478 (17%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           M      + +F+ GE+ P LL++R D++ +   + ++RN+I    G +   P        
Sbjct: 1   MSRAVSIQSNFTTGEVDP-LLRARIDINQYYNALEQARNVIVQPQGGIERRPG------- 52

Query: 61  RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120
                      F                   V   ++ +    L    + T         
Sbjct: 53  ---------LQFIFE----------------VPSAANPQNGMKLVPFEFST--------- 78

Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180
                     +FVH                  + F + + +                  +
Sbjct: 79  ---TQS-YMLLFVH---------------NRMYIFKDKELVTN----INSSGNDYLTTTI 115

Query: 181 SISQADTSTARITSDMKIFKPLDKG--RSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRS 238
           + +   T     ++D  I    D    + +R   H      + ++      +      +S
Sbjct: 116 TSTVLATMDHTQSADTLIVVQEDMAPKKIVRGAAHNTWTISDISFEF----IPKFNFTQS 171

Query: 239 LTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKD 298
            TT           G   +           + +     E+  G      +   +   S +
Sbjct: 172 ETTINQTITPSAVDGNITITAGG---NVFASGNLNQYIEANDGMGR-ARITRFVSATSVE 227

Query: 299 GRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLS 358
                    +  +   G  +   +  +W   +GYP   TFH  RL F G K    +++ S
Sbjct: 228 AIVEIPFFNTTAIASGGTFIDGGYEDSWSGSKGYPRTATFHEGRLYFGGVKSRPNTIFAS 287

Query: 359 SFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWL--LSI 416
               F+DF+     G      ++   ++  S + I  M    +  +       +L   ++
Sbjct: 288 RVARFFDFNP----GEALDDDSIELTISTDSTNAITGMFSGRDLQIFTKGGEFFLPQSTL 343

Query: 417 SLSKGLSIDFRRVSGSGV-YACPPVSVGDCLVFVCGVGRRIKY-ISGSTEQGFRFNEI 472
                 ++     +  G      PV      +F+   G+ ++  +    E  +  N I
Sbjct: 344 DPITPTNVVVNGATRRGSQEGIKPVGAESGTLFIQRAGKSLREFLFSDVELSYISNNI 401


>gi|296532340|ref|ZP_06895077.1| conserved hypothetical protein [Roseomonas cervicalis ATCC 49957]
 gi|296267336|gb|EFH13224.1| conserved hypothetical protein [Roseomonas cervicalis ATCC 49957]
          Length = 626

 Score =  147 bits (370), Expect = 5e-33,   Method: Composition-based stats.
 Identities = 64/363 (17%), Positives = 123/363 (33%), Gaps = 45/363 (12%)

Query: 218 AKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRE 277
           + NT++SI  +    +  YR  + G +      S   T            L  S+   + 
Sbjct: 133 SSNTSWSIAPWSFVREPFYRFASPGVTLAPSATSGSVT------------LTASAAAFQP 180

Query: 278 SASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVT 337
             +G    + + G    V+    + S     +       +   W  +A+    G+P    
Sbjct: 181 GHAGVR--FRLGGKRVLVTAVASATSATASVEETLPGTAASADWDEAAFSAVRGWPVTAC 238

Query: 338 FHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMH 397
           FH +RL+  GS+     ++LS  G  ++F    + G     +A+   +     + I    
Sbjct: 239 FHQDRLVLGGSRDLPNRLWLSRSGDLFNF----DLGSGLDDQAIEFGLLSDQVNAIR-AV 293

Query: 398 PFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGV---YACPPVSVGDCLVFVCGVGR 454
             G  + V    + W+++       SI   R +  G       PPV V    +FV   G+
Sbjct: 294 FSGRHLQVFTSGAEWMVTGEPMTPASIQLHRQTRIGSPVARIIPPVDVDGSTIFVARSGQ 353

Query: 455 RIKYISG-STEQGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLG 513
            +   +    +Q ++ N++  +A HL    +  + Y +    ++ V ++        L  
Sbjct: 354 AVHEYAYTDVQQAYQANDLALVARHLVQTPV-SMAYDQT-RRLLHVAMQGGW-----LAT 406

Query: 514 CRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSAGEERSFTVRLNL 573
                  E   AW           L+            ++W  V  +        +RL  
Sbjct: 407 LTLYR-AEQVTAWTRQDTDGAFRALA--------EIDGTVWCAVERAGA------MRLER 451

Query: 574 LDD 576
            DD
Sbjct: 452 FDD 454



 Score =  141 bits (355), Expect = 3e-31,   Method: Composition-based stats.
 Identities = 44/211 (20%), Positives = 76/211 (36%), Gaps = 21/211 (9%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           M     TK SF+AGEL  +LL  R DL  +  G  + RN+     G L   P ++   + 
Sbjct: 1   MAAGRSTKTSFTAGELGDQLL-GRGDLRAYENGARRLRNVFIQPTGGLTRRPGLRHVAEL 59

Query: 61  RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120
              P   R+ +F        L+V   + L++ +              +   P+T     +
Sbjct: 60  ---PGPARLIAFEFNTEQTYLVVLTHQGLRVFLGDVQVA--------SLAGPWTAAMLDA 108

Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180
           + +     T + +H D  P  +         S++     F+  P+          S    
Sbjct: 109 IAWTQSADTLLLLHPDMVPQRVTRSS---NTSWSIAPWSFVREPFYRFA------SPGVT 159

Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLG 211
               A + +  +T+    F+P   G   RLG
Sbjct: 160 LAPSATSGSVTLTASAAAFQPGHAGVRFRLG 190


>gi|307946248|ref|ZP_07661583.1| hypothetical protein TRICHSKD4_4953 [Roseibium sp. TrichSKD4]
 gi|307769912|gb|EFO29138.1| hypothetical protein TRICHSKD4_4953 [Roseibium sp. TrichSKD4]
          Length = 681

 Score =  146 bits (369), Expect = 8e-33,   Method: Composition-based stats.
 Identities = 104/585 (17%), Positives = 173/585 (29%), Gaps = 82/585 (14%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           +      + +F+AGEL P LL  R  L   + G     N++ +  G       + +    
Sbjct: 2   VARPGRLQSAFTAGELDP-LLHERSQLKYFSTGADHMENVVSIPQGGFGLRGGLLDIGAV 60

Query: 61  RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTP-YTFKDNK 119
              P ++R+F F   DG    LVF   K++         W  +   +    P  +     
Sbjct: 61  D--PAASRLFDFKASDGSAYDLVFAPGKME--------AWGNSGKLQDLAIPALSETMLP 110

Query: 120 SLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAK 179
            L  A    T + +H D  P  + +       +++ D +     P    G          
Sbjct: 111 GLNDAQQRDTMILLHADLQPQRIKHAGPQ---AWSADAVPLTGLPSYDYGA--------- 158

Query: 180 LSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSL 239
            + S    +  R+      F  LD      L     E       SIG          R  
Sbjct: 159 -TYSNGVAAVWRLE-----FVGLDANSIFTLTISQEE-----TVSIGYTTAMGTLASRVR 207

Query: 240 TTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESA-SGAVAPYYVWGDIKDVSKD 298
           T     D    + G +             +  +      A SG V        +   +  
Sbjct: 208 TA--VQDLPNVAPGISVASAGGSKIAVTFSGENNAGDGWAVSGNVINKADAAILAAKT-- 263

Query: 299 GRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLS 358
             ++ VAP    +                   G+P    F+N RLL  G KG   +   S
Sbjct: 264 --TVGVAPGEPVI---------------SSVRGWPRCGAFYNQRLLLGGFKGLPNAWMFS 306

Query: 359 SFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL 418
             G +++F  D  +   +    +   V       +  + P     +       W+    L
Sbjct: 307 LQGDYFNF--DERFSAANGPALIPMDVDGGEV--VEQIVPSRNLAIFTNGAEYWIAERGL 362

Query: 419 SKGLSIDFRRVSGSGVYACPPVSVGDCLV-FVCGVGRRIKYISG-STEQGFRFNEITQLA 476
           S+    +  +    GV    P+   +  + FV   G  I        E  F   +I+ L 
Sbjct: 363 SRTEPPNHVQAGERGVKNGVPIVANEGALNFVSSTGSVIGEFRYTDVEGNFVSRDISLLG 422

Query: 477 DHLFNQRILQLVYQEEPHSIV----WVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMIS 532
            HL    +     +    S       +VLE        LL        +   A+      
Sbjct: 423 SHLIID-VKDQAMRRAEKSTSGNLNGIVLEDGQARLATLL------REQDVTAFSRMTSD 475

Query: 533 DKHY-VLSAASFPNDNRGGTSLWMLVALSAGEERSFTVRLNLLDD 576
             H+  +S         G   +  +V   AG      V   LLD+
Sbjct: 476 SGHFKAVSV-------NGRNEMSWIVDRPAGRRLERLVTGYLLDE 513


>gi|83313369|ref|YP_423633.1| hypothetical protein amb4270 [Magnetospirillum magneticum AMB-1]
 gi|82948210|dbj|BAE53074.1| hypothetical protein [Magnetospirillum magneticum AMB-1]
          Length = 634

 Score =  146 bits (367), Expect = 1e-32,   Method: Composition-based stats.
 Identities = 41/206 (19%), Positives = 73/206 (35%), Gaps = 15/206 (7%)

Query: 6   WTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPR 65
           +TK SF+AGE+   L   R DL+L+A G    RN++    G +   P ++     R    
Sbjct: 7   FTKTSFTAGEVDVDL-AGRGDLALYANGAKSLRNVVVAPIGGVRRRPGLRHVAPAR---G 62

Query: 66  SNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAV 125
             R+ +F        LL   D ++ I        ++        +TP++      L +  
Sbjct: 63  PGRLIAFEFNTEQTYLLALSDHRMDI--------YADGAKVAELETPWSTAQVAQLSWTQ 114

Query: 126 FGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQA 185
              T + VH D  P  +         S+  +   +     +          +A       
Sbjct: 115 SADTLLVVHPDVEPRKITRTGAN---SWVLETWSYYQEDGILYVPTHKFAKDAVTLTPSG 171

Query: 186 DTSTARITSDMKIFKPLDKGRSIRLG 211
            + T  +T+   +F     G   R+G
Sbjct: 172 TSGTITLTASEAVFDAAHAGCRFRVG 197



 Score =  133 bits (334), Expect = 8e-29,   Method: Composition-based stats.
 Identities = 65/380 (17%), Positives = 123/380 (32%), Gaps = 49/380 (12%)

Query: 217 WAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSR 276
           W ++ +  +   +V  D   R +T  R+G      +  +Y +++ I ++     +     
Sbjct: 112 WTQSADTLL---VVHPDVEPRKIT--RTGANSWVLETWSYYQEDGILYVPTHKFAKDAVT 166

Query: 277 ESASGAVAP------------------YYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSV 318
            + SG                      + V G    +S    +     + +       + 
Sbjct: 167 LTPSGTSGTITLTASEAVFDAAHAGCRFRVGGKQVLISAVTSATQAQAEVKQTLGGTAAT 226

Query: 319 VSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPT 378
             W   ++    G+P  V FH  RL   GS+G    ++LS     ++F    + G     
Sbjct: 227 EDWEEQSFSPLRGWPVSVCFHQGRLAIGGSRGLPNRLWLSKSMDLFNF----DLGTGLDD 282

Query: 379 KALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGV---Y 435
           +A+  ++       I      G  + V    + W++  S      I   R +  G     
Sbjct: 283 EAIEFSLLSTQVDAIR-AVFSGRHLQVFTSGAEWMVVGSPLTPTKIQLNRQTRVGSPVDR 341

Query: 436 ACPPVSVGDCLVFVCGVGRRIKY-ISGSTEQGFRFNEITQLADHLFNQRILQLVYQEEPH 494
           + PP  V     FV   GR ++  +    +Q ++ N+++ +A H+ N  +    Y     
Sbjct: 342 SVPPRDVDGATHFVSRSGRDLREFLFADVDQAYQANDLSMVAKHVMNTPV-DQDYDAS-R 399

Query: 495 SIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLW 554
            +  VV+     +   +         E   AW            S A    D        
Sbjct: 400 RLFHVVM-----ADGLMATLTVYR-AEKVTAWTVFETQGAF--RSVAVVDGDTH------ 445

Query: 555 MLVALSAGEE-RSFTVRLNL 573
           +LV          F   LNL
Sbjct: 446 VLVERGGSHVIECFDDTLNL 465


>gi|209966375|ref|YP_002299290.1| hypothetical protein RC1_3113 [Rhodospirillum centenum SW]
 gi|209959841|gb|ACJ00478.1| conserved hypothetical protein [Rhodospirillum centenum SW]
          Length = 638

 Score =  145 bits (365), Expect = 2e-32,   Method: Composition-based stats.
 Identities = 46/215 (21%), Positives = 71/215 (33%), Gaps = 14/215 (6%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           M      K +F+ GELSP LL  R DL  +  G    RN++ L  G +   P        
Sbjct: 1   MTRLRSVKAAFTGGELSPDLL-GRGDLRSYETGALALRNVLILPTGGVTRRPGTAYLATL 59

Query: 61  RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120
              P   R+ +F+       LL F D++L++                  +TP+T      
Sbjct: 60  ---PGPGRLAAFAFDTEQAYLLAFTDRRLEVF--------RDGATEAVLETPWTAGQLAQ 108

Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKI--SFTFDEIKFLPPPWLGDGMISGVKSNA 178
           L +       +  H D PP  ++   D      ++ F  +K      L           A
Sbjct: 109 LAWTQSADVLLVCHPDVPPRRIVRSGDRRWRCEAWRFSTVKTADGRALQRLPFHRFADAA 168

Query: 179 KLSISQADTSTARITSDMKIFKPLDKGRSIRLGCH 213
                       R+ +   +F     GR  RL   
Sbjct: 169 VTLTPSGTRGRVRVRASAPVFDGAHAGRPFRLRRR 203



 Score =  128 bits (321), Expect = 3e-27,   Method: Composition-based stats.
 Identities = 41/252 (16%), Positives = 79/252 (31%), Gaps = 23/252 (9%)

Query: 312 FQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGE 371
                  + W   A+    G+P    FH +RL+  GS+     ++LS  G  +DF     
Sbjct: 224 VPDAEPSIDWDEPAFSPLRGWPVSACFHQDRLVIGGSRDLPNRLWLSRSGDLFDFDP--- 280

Query: 372 YGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSG 431
            G  +  +A+  A+     + I  +   G  + V    + W ++        +   R S 
Sbjct: 281 -GEGEDDEAIEFAILSDQVNAIRQVFS-GRHLQVFTTGAEWAVTGEPLTPKEVRLDRQSR 338

Query: 432 SGVY---ACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQLADHLFNQRILQLV 488
            G       P   V    +F    G   +++    E  +   ++T  A HL    +    
Sbjct: 339 VGSGPGRQIPAREVDGATLFAGRDGAVREFLWTDLESSYSTTDLTLAAGHLCRAPVE--- 395

Query: 489 YQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNR 548
              +P   + + ++    +   +         E    W          V S A    +  
Sbjct: 396 LDVDPGRRLLLAVQ----ADGGVAALTLDR-AEQVTGWTRLETDGA--VRSLAVVRGEVH 448

Query: 549 GGTSLWMLVALS 560
                W++    
Sbjct: 449 -----WLVERQG 455


>gi|291336926|gb|ADD96454.1| hypothetical protein [uncultured organism MedDCM-OCT-S09-C787]
          Length = 158

 Score =  144 bits (362), Expect = 5e-32,   Method: Composition-based stats.
 Identities = 26/141 (18%), Positives = 63/141 (44%), Gaps = 6/141 (4%)

Query: 369 DGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLS----KGLSI 424
           D  +G      ++   +     + I +M      +++G     + +S   +       +I
Sbjct: 3   DNYHGTVADDDSIIYTIASNQVNAIRFMTAT-RTLIIGTAGGEFAVSGGGTDIAITPTNI 61

Query: 425 DFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFNQR 483
             ++ S +G      ++VG+  +F+    R+++ ++ + +  G+   ++T LA+H+    
Sbjct: 62  LIKKQSNNGAANVDALAVGNATLFLQRARRKLRELAYNFDVDGYVAPDLTILAEHISEGG 121

Query: 484 ILQLVYQEEPHSIVWVVLEPK 504
             QL YQ+EP+ ++W V    
Sbjct: 122 FKQLSYQQEPNQVIWGVRNDG 142


>gi|144898783|emb|CAM75647.1| conserved hypothetical protein [Magnetospirillum gryphiswaldense
           MSR-1]
          Length = 635

 Score =  144 bits (362), Expect = 5e-32,   Method: Composition-based stats.
 Identities = 42/211 (19%), Positives = 74/211 (35%), Gaps = 15/211 (7%)

Query: 3   NTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRL 62
           N T  K +F+AGELS  +L  R DL+ +  G  + RN+     G +   P ++      +
Sbjct: 5   NITLAKTNFTAGELSLDML-GRGDLAAYGNGAKRLRNVFIAPIGGVSRRPGLR---HVDI 60

Query: 63  DPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLE 122
                R+ +F        LLV  D  L I        ++  +      TP+T    + + 
Sbjct: 61  ARGKGRLIAFEFNTEQTYLLVLTDLHLDI--------YADGVAVAHVDTPWTEAQLQQIN 112

Query: 123 YAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSI 182
           +     T + VH +  P  L         ++T     F     +         ++     
Sbjct: 113 WTQTADTLLIVHPEVAPRKLTRTAHS---AWTISNWMFHEADGVLFQPYHKFAADEVTLQ 169

Query: 183 SQADTSTARITSDMKIFKPLDKGRSIRLGCH 213
             A + +  +T+    F     G  +RL   
Sbjct: 170 PSATSGSITLTASAAFFVAGHVGTRLRLQQK 200



 Score =  143 bits (359), Expect = 1e-31,   Method: Composition-based stats.
 Identities = 49/286 (17%), Positives = 99/286 (34%), Gaps = 26/286 (9%)

Query: 293 KDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDE 352
            +++    +   +   +       +   W   A     G+P  V FH +RL+  GS+   
Sbjct: 202 VEITAIASATQASATVKQNLVNTSAHKDWEEQALSAVRGWPVSVCFHQDRLVIGGSRDQP 261

Query: 353 LSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLW 412
             ++LS     ++F    + G     +A+  A+     + I  +   G  + V    + W
Sbjct: 262 NRLWLSKSSDLFNF----DLGEALDDEAIEFALLSDQVNAIRHVFS-GRHLQVFTSGAEW 316

Query: 413 LLSISLSKGLSIDFRRVSGSGV---YACPPVSVGDCLVFVCGVGRRIKY-ISGSTEQGFR 468
           ++S       SI   R +  G       PP  V    +FV   G+ ++  +    EQ ++
Sbjct: 317 MVSGQPLTPSSIQLTRQTRVGSPIDRTVPPRDVDGATLFVSRNGKDLREFLFADVEQAYQ 376

Query: 469 FNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHT 528
             ++  LA H+    +    Y  +    ++ V+         L         E   AW  
Sbjct: 377 SGDLAMLAKHVMLAPV-DQDY--DAGRRLFHVVM----GDGGLATVTVYR-SEKVTAWTG 428

Query: 529 HMISDKHYVLSAASFPNDNRGGTSLWMLVALSAGEE-RSFTVRLNL 573
           H+ + +   ++             +++LV          F   L+L
Sbjct: 429 HVTAGRFLAVAVV--------EGEVYVLVEREGIVSVECFDESLSL 466


>gi|288959323|ref|YP_003449664.1| hypothetical protein AZL_024820 [Azospirillum sp. B510]
 gi|288911631|dbj|BAI73120.1| hypothetical protein AZL_024820 [Azospirillum sp. B510]
          Length = 632

 Score =  138 bits (346), Expect = 3e-30,   Method: Composition-based stats.
 Identities = 59/325 (18%), Positives = 104/325 (32%), Gaps = 42/325 (12%)

Query: 270 LSSKTSRESASGAVAPYYVWGDIKDVSKDGR----------------SISVAPQSQTLFQ 313
             + T   S +G          + D  +DG                 +  V    +    
Sbjct: 162 DPAVTVTPSGTGGAITVTASAPVFDPRQDGTRLRIRGKQLLVTGVVSATQVNATVKETLA 221

Query: 314 AGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYG 373
                  W   A+    G+P    FH +RL+  GS+     ++LS     ++F    + G
Sbjct: 222 DTQPTPQWEEQAFSALRGWPVSAAFHQDRLVIGGSRDLPNRLWLSRSAQIWNF----DLG 277

Query: 374 CYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSG 433
                +A+   +     + +      G  + V    + ++++       S+  +R +  G
Sbjct: 278 EGLDDQAIEFGILSDQVNAVR-AVFSGRHLQVFTSGAEYMVTGDPLTPQSMQVKRQTRIG 336

Query: 434 V---YACPPVSVGDCLVFVCGVGRRIKY-ISGSTEQGFRFNEITQLADHLFNQRILQLVY 489
                A PP  V    +FV    R I+  +   TE  ++ N++  LA HL         Y
Sbjct: 337 SPMDRAIPPRDVEGATLFVPRNRREIREFLFTDTEAAYQANDLALLARHLVASP-RDQDY 395

Query: 490 QEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRG 549
            +    +++V +E        L         E   AW          V S A+       
Sbjct: 396 DQN-RRLLFVAMEDG-----TLGALTAYR-AEDVTAWTLLETDGA--VRSVAAV------ 440

Query: 550 GTSLWMLV-ALSAGEERSFTVRLNL 573
           G  ++ LV          F   LNL
Sbjct: 441 GDEVYALVERRGFWTIERFDDGLNL 465



 Score =  136 bits (342), Expect = 9e-30,   Method: Composition-based stats.
 Identities = 46/211 (21%), Positives = 69/211 (32%), Gaps = 15/211 (7%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           M      K +F+AGE+S RLL  R DL  +  G    RNL     G +     +      
Sbjct: 2   MGRLHQVKTNFTAGEVSRRLL-GRGDLKAYDNGALALRNLFIDPTGGVTRRSGLAF---T 57

Query: 61  RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120
            L P   R+ +F        LLVF D+++ +                +   P+T      
Sbjct: 58  ALAPGDGRLVAFERNSEQTYLLVFTDRRIDVF--------QGGSRLASVAAPWTLTQLAQ 109

Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKL 180
           + +     T +  H D PP  L     GD   +   E  F     L           A  
Sbjct: 110 ITWTQSADTLLVCHPDLPPRKLTR---GDDGGWALAEWAFAVEGGLVRTPFHRFGDPAVT 166

Query: 181 SISQADTSTARITSDMKIFKPLDKGRSIRLG 211
                      +T+   +F P   G  +R+ 
Sbjct: 167 VTPSGTGGAITVTASAPVFDPRQDGTRLRIR 197


>gi|291334718|gb|ADD94364.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C890]
          Length = 135

 Score =  137 bits (344), Expect = 5e-30,   Method: Composition-based stats.
 Identities = 28/125 (22%), Positives = 58/125 (46%), Gaps = 9/125 (7%)

Query: 447 VFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKD 505
           +F+    R+++ ++ + +  G+   ++T LA+H+      QL YQ+EP+ ++W V     
Sbjct: 1   MFLQRARRKLRELAYNFDVDGYVAPDLTILAEHISEGGFKQLSYQQEPNQVIWGVRNDG- 59

Query: 506 NSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALS-AGEE 564
               +L+G  +  E +   AWH H+        S A+ P D+      W++   +  G  
Sbjct: 60  ----QLVGLTYQRE-QQVVAWHRHIFGGSAVCESVATIPTDDS-EYQTWVINKRTINGST 113

Query: 565 RSFTV 569
           + +  
Sbjct: 114 KRYVE 118


>gi|291334514|gb|ADD94167.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C1201]
 gi|291336446|gb|ADD96001.1| hypothetical protein [uncultured organism MedDCM-OCT-S04-C1073]
          Length = 153

 Score =  136 bits (343), Expect = 7e-30,   Method: Composition-based stats.
 Identities = 28/125 (22%), Positives = 58/125 (46%), Gaps = 9/125 (7%)

Query: 447 VFVCGVGRRIKYISGSTE-QGFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKD 505
           +F+    R+++ ++ + +  G+   ++T LA+H+      QL YQ+EP+ ++W V     
Sbjct: 1   MFLQRARRKLRELAYNFDVDGYVAPDLTILAEHISEGGFKQLSYQQEPNQVIWGVRNDG- 59

Query: 506 NSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALS-AGEE 564
               +L+G  +  E +   AWH H+        S A+ P D+      W++   +  G  
Sbjct: 60  ----QLVGLTYQRE-QQVVAWHRHIFGGSAVCESVATIPTDDS-EYQTWVINKRTINGST 113

Query: 565 RSFTV 569
           + +  
Sbjct: 114 KRYVE 118


>gi|83721618|ref|YP_441474.1| gp12 [Burkholderia thailandensis E264]
 gi|257139844|ref|ZP_05588106.1| gp12, putative [Burkholderia thailandensis E264]
 gi|83655443|gb|ABC39506.1| gp12, putative [Burkholderia thailandensis E264]
          Length = 188

 Score =  136 bits (342), Expect = 1e-29,   Method: Composition-based stats.
 Identities = 29/134 (21%), Positives = 44/134 (32%), Gaps = 4/134 (2%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           M   T  + + +AGELSP L +   DL  +A GV    N IP   G        ++    
Sbjct: 1   MAKITTIQSNLNAGELSPPL-EGHIDLDRYANGVKTMLNAIPQIEGGARRRFGFRQVAAT 59

Query: 61  RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120
           +    + R+  F         +  GD   +            +       TP++      
Sbjct: 60  K-TTGATRLVPFVFSKSQAYFVELGDAYARFYTDSGQ--IQQSGVPIELATPWSASQLFE 116

Query: 121 LEYAVFGSTAVFVH 134
           LEY     T    H
Sbjct: 117 LEYTQNSDTMFIAH 130


>gi|77734533|emb|CAI59394.2| hypothetical protein pSG3.03 [Sodalis glossinidius]
          Length = 517

 Score =  115 bits (287), Expect = 2e-23,   Method: Composition-based stats.
 Identities = 31/122 (25%), Positives = 55/122 (45%), Gaps = 13/122 (10%)

Query: 453 GRRIKYISGSTE-QGFRFNEITQLADHLFNQ-RILQLVYQEEPHSIVWVVLEPKDNSFPR 510
              ++ ++ S +  GF+ N++T LA+H F   ++L   +   P S+VW V          
Sbjct: 188 RSAVRDLAYSFDVDGFQGNDLTVLANHFFTGFQLLDWAFTITPLSVVWCVRNDG-----T 242

Query: 511 LLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALS-AGEERSFTV 569
           LLG  +  E +   AWH H  + K+   +  S         +L+ +V  +  G+ R +  
Sbjct: 243 LLGLTYLRE-QQVAAWHQHPAAGKY--EAVCSI--SEGTEDALYCVVNRTIQGQPRRYVE 297

Query: 570 RL 571
           RL
Sbjct: 298 RL 299


>gi|89886023|ref|YP_516220.1| hypothetical protein SGPHI_0042 [Sodalis phage phiSG1]
 gi|89191758|dbj|BAE80505.1| conserved hypothetical protein [Sodalis phage phiSG1]
 gi|125470053|gb|ABN42245.1| gp40 [Sodalis phage phiSG1]
          Length = 517

 Score =  115 bits (287), Expect = 3e-23,   Method: Composition-based stats.
 Identities = 31/122 (25%), Positives = 55/122 (45%), Gaps = 13/122 (10%)

Query: 453 GRRIKYISGSTE-QGFRFNEITQLADHLFNQ-RILQLVYQEEPHSIVWVVLEPKDNSFPR 510
              ++ ++ S +  GF+ N++T LA+H F   ++L   +   P S+VW V          
Sbjct: 188 RSAVRDLAYSFDVDGFQGNDLTVLANHFFTGFQLLDWAFTITPLSVVWCVRNDG-----T 242

Query: 511 LLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALS-AGEERSFTV 569
           LLG  +  E +   AWH H  + K+   +  S         +L+ +V  +  G+ R +  
Sbjct: 243 LLGLTYLRE-QQVAAWHQHPAAGKY--EAVCSI--SEGTEDALYCVVNRTIQGQPRRYVE 297

Query: 570 RL 571
           RL
Sbjct: 298 RL 299


>gi|48696643|ref|YP_024422.1| hypothetical protein VP2p15 [Vibrio phage VP2]
 gi|40950041|gb|AAR97632.1| hypothetical protein [Vibrio phage VP2]
          Length = 594

 Score =  103 bits (257), Expect = 6e-20,   Method: Composition-based stats.
 Identities = 64/346 (18%), Positives = 116/346 (33%), Gaps = 42/346 (12%)

Query: 247 RFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAP 306
             G +  A +V D       V+  +    R +       Y   GD    +  GR I V P
Sbjct: 80  EVGNTNIAVWVNDV----RQVVANTPSEWRNTIDRIQTAYDTIGDDAGAANTGRLIMVHP 135

Query: 307 QSQTLFQAGVSVVSWFM---------SAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYL 357
             Q       +  +W           + W     YP  V    NR+ + GS       + 
Sbjct: 136 ALQPKRLYRDNNNAWQFVNMHTGAVPAEWSPSN-YPQTVGIFQNRVWYVGSPVHRTYFWA 194

Query: 358 SSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSIS 417
           +  G   D +        DP   +             W+    + + +G   + + L+ S
Sbjct: 195 TRAGKLEDIAPSTANNPNDPISFVGIMEGTPC-----WIIASSDVLTIGTTINDYQLAAS 249

Query: 418 LS---KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEIT 473
                   +   RR S  G  A   +   + ++F      ++  ++   E   +  +E++
Sbjct: 250 TGVSVTAATAILRRSSVQGTAAVQGIPAEEQVIFCSRNKSKVYAMNYVREQDNWIPDEMS 309

Query: 474 QLADHLFN-------QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAW 526
             A HLF          + ++ Y  +    +WVVLE       ++  C F    +   AW
Sbjct: 310 SQAQHLFTPISSAKGASVRRVAYISDAAKSLWVVLE-----NGQINYCCFDRTTDTK-AW 363

Query: 527 HTHMISDKHYVLSAASFPNDNRGGTSLWMLVALS---AGEERSFTV 569
               +S    +  AA+F  D       ++ V  S    G ++++TV
Sbjct: 364 TQLELSGGKVIDIAAAFNPD---SDYAYVAVVRSKAINGVQKNYTV 406



 Score = 56.8 bits (135), Expect = 9e-06,   Method: Composition-based stats.
 Identities = 48/329 (14%), Positives = 91/329 (27%), Gaps = 30/329 (9%)

Query: 4   TTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLD 63
             +++ SF  G ++PRL  +  + + +   +  + N +    G L++    +E   C+  
Sbjct: 2   ADFSQTSFKGGVIAPRLQFNEYESA-YHHSIEDAVNFVVTEQGSLITRCGSEEVGLCQDG 60

Query: 64  PRSNRVFSFSIPDGGYALLVFGDKKLQIVVVR-----SSTKWSPALFGKTYKTPYTFKDN 118
                            ++  G+  + + V       ++T           +T Y     
Sbjct: 61  EVRLFRLPAVDAPSNDVIVEVGNTNIAVWVNDVRQVVANTPSEWRNTIDRIQTAY--DTI 118

Query: 119 KSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKS-- 176
                A      + VH    P  L    + +   F       +P  W        V    
Sbjct: 119 GDDAGAANTGRLIMVHPALQPKRLYR-DNNNAWQFVNMHTGAVPAEWSPSNYPQTVGIFQ 177

Query: 177 -------NAKLSISQADTSTARI--TSDMKIFKPLDKGRSIRLGCHPPEWAKNTN----- 222
                  +         T   ++   +      P D    + +    P W   ++     
Sbjct: 178 NRVWYVGSPVHRTYFWATRAGKLEDIAPSTANNPNDPISFVGIMEGTPCWIIASSDVLTI 237

Query: 223 -YSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASG 281
             +I  Y +A      S+T   +  R    +G   V         V+  S   S+  A  
Sbjct: 238 GTTINDYQLAAS-TGVSVTAATAILRRSSVQGTAAV-QGIPAEEQVIFCSRNKSKVYAMN 295

Query: 282 AVAPYYVWGDIKDVSKDGRSISVAPQSQT 310
            V     W  I D           P S  
Sbjct: 296 YVREQDNW--IPDEMSSQAQHLFTPISSA 322


>gi|259419134|ref|ZP_05743051.1| hypothetical protein SCH4B_4402 [Silicibacter sp. TrichCH4B]
 gi|259345356|gb|EEW57210.1| hypothetical protein SCH4B_4402 [Silicibacter sp. TrichCH4B]
          Length = 715

 Score =  102 bits (254), Expect = 2e-19,   Method: Composition-based stats.
 Identities = 96/531 (18%), Positives = 169/531 (31%), Gaps = 53/531 (9%)

Query: 3   NTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRL 62
             T  +  FS G++ P   Q R D+ L A+ V +  N + L  G +     M+       
Sbjct: 5   KETIWQKDFSLGQVRPE-AQERDDIDLVARSVKEGLNCVVLSTGQMEGRSGMRFLNATAS 63

Query: 63  DPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLE 122
                      + +G    L F    L +    ++ +++  +        +       + 
Sbjct: 64  SQGREV----DLGEGRVFDLHFVPSGLILYDSNNTVEYTGNITWTAAPKKWGIYTFDEIS 119

Query: 123 YAVFGS----TAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNA 178
           + V       + +   +  P   L+  +DG   S++F E+ F      G    S  + N 
Sbjct: 120 FWVVADPDSSSILIGSQHFPIQALILNEDG---SWSFGEMAFATG-LAGAIHQSYWRYNE 175

Query: 179 KLSI-SQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYR 237
            +SI   A T    +T+   I+    +G +IR         +N    +G  + +      
Sbjct: 176 TVSIQPSARTGAITVTASEAIWTADHEGMAIR--------YQNREIILGTLVSS----TV 223

Query: 238 SLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESA-SGAVAPYYVWGDIKDVS 296
                       Y    + V +  +      ++       +  +G+V             
Sbjct: 224 INAAVTEELPPTYDITVSSVSNYQVGEAVEHSVLGGQGIITGIAGSVITVMATSRYDGFD 283

Query: 297 KDGRSISVAPQSQTLFQAGVSVVS------WFMSAWGEQEGYPSHVTFHNNRLLFSGSKG 350
                  VAP +     A  +  +      W M       GY  +   H +R+      G
Sbjct: 284 TVASPKLVAPNAAQPISAVAAAATPAATVIWEMQMQSPVHGYAGYAVRHLSRVFLCDFPG 343

Query: 351 DELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDT- 409
              +   S  GA  DF +  E             V   S  T+ +M    + + +     
Sbjct: 344 APQAFAASVVGAINDFKMGSE-----DADGFVDTVGADSGGTLRFMASVEDLLFLTSKGI 398

Query: 410 -SLWLLSISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKY--ISGSTEQG 466
            S      S     +I   R S  G  +  P++V D  VFV  VG+RI    ++G     
Sbjct: 399 YSHQTRDGSAITPATIRPVRFSRVGCASVEPIAVDDGCVFVDAVGQRIYAATLAGDIYTK 458

Query: 467 FRFNEITQLADHLFNQRILQLVY-------QEEPHSIVWVVLEPKDNSFPR 510
           +R   +T L   L    I   VY        E   S V+VV      +  +
Sbjct: 459 WRAEPMTSLHPQL----IKDAVYLGATSSGSENAESFVYVVNSDGSVALGQ 505


>gi|325971691|ref|YP_004247882.1| hypothetical protein SpiBuddy_1864 [Spirochaeta sp. Buddy]
 gi|324026929|gb|ADY13688.1| hypothetical protein SpiBuddy_1864 [Spirochaeta sp. Buddy]
          Length = 551

 Score =  102 bits (253), Expect = 2e-19,   Method: Composition-based stats.
 Identities = 45/275 (16%), Positives = 87/275 (31%), Gaps = 60/275 (21%)

Query: 332 YPSHVTFHNNRLLFSGSKGDELSVYLSSF--------GAFYDFSL--------------- 368
           YPS V    NRL FS +     + ++S            F  F +               
Sbjct: 166 YPSVVGICQNRLWFSAAILKPYTTWVSRPPYDGSNNHHDFTTFDVIEVNTEVIKDPSTWP 225

Query: 369 -------DGEYGCYDPTK----------------ALTTAVTDFSASTIHWMHPFGEGVLV 405
                  D      D +K                A+   +      TI W     + + +
Sbjct: 226 KTTNEQGDEMIDFSDSSKFVETVKEIEEVINAKCAMEIELASGRNDTIKW-VAGMDNIFI 284

Query: 406 GCDTSLWLLSISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQ 465
           G + + W+    +          +S  G     P ++ D + F+   G R++ ++  ++ 
Sbjct: 285 GTEANEWMCPFDID-PTKQSASMLSSYGSLPIQPQTLHDGIFFLQR-GNRLREMT-RSQN 341

Query: 466 GFRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFA 525
           G   N+++  ADH+    I QL   + P  +++ +L         L    +     G   
Sbjct: 342 GSISNDLSFTADHILFAGIRQLATLKNPDPMIFCLLNDG-----TLAVLCYDKNY-GMQG 395

Query: 526 WHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALS 560
           W       +   L+    P ++  G  ++  V   
Sbjct: 396 WSRWSTQGEFMCLA----PYEDEDGQKMFAHVRRG 426



 Score =  101 bits (251), Expect = 3e-19,   Method: Composition-based stats.
 Identities = 59/375 (15%), Positives = 122/375 (32%), Gaps = 39/375 (10%)

Query: 9   HSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNR 68
           +++  GE+SP+L   R DL ++ QG    ++   +  G +   P ++            R
Sbjct: 6   NNWMYGEISPKL-GGRLDLEMNTQGCEILKDFRNMLQGGITRRPPLKHVAQTV----RGR 60

Query: 69  VFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGK---TYKTPYTFKDNKSLEYAV 125
              F++  G   L+   +KKL++        ++            T Y   D  S++YA 
Sbjct: 61  TIPFTLSSGESFLVELSNKKLRVWRKGVLGFYTVTFLPSGNDYLPTDYLEADVWSIQYAQ 120

Query: 126 FGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQA 185
           +      VHKD+ PH ++Y  +            F   P+  +         +       
Sbjct: 121 YYDRLYLVHKDYQPHVVVYAAEA-----------FQFSPFTAETDAGKQLGKSTGYYPSV 169

Query: 186 DT-STARITSDMKIFKPLD--KGRSIRLGCHPPEWAKNTNY-SIGAYIVADDKVYRSLTT 241
                 R+     I KP      R    G +        +   +   ++ D   +   T 
Sbjct: 170 VGICQNRLWFSAAILKPYTTWVSRPPYDGSNNHHDFTTFDVIEVNTEVIKDPSTWPKTTN 229

Query: 242 GRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRS 301
            +  +   +S  + +V+        +    +     ++       +V G          +
Sbjct: 230 EQGDEMIDFSDSSKFVETVKEIEEVINAKCAMEIELASGRNDTIKWVAGMDNIFIGTEAN 289

Query: 302 ISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSH---VTFHNNRLLFSGSKGDELSVYLS 358
             + P      +   S+    +S++G     P       F   R    G++  E++   S
Sbjct: 290 EWMCPFDIDPTKQSASM----LSSYGSLPIQPQTLHDGIFFLQR----GNRLREMT--RS 339

Query: 359 SFGAFYD---FSLDG 370
             G+  +   F+ D 
Sbjct: 340 QNGSISNDLSFTADH 354


>gi|50282960|ref|YP_053016.1| hypothetical protein VP5_gp14 [Vibrio phage VP5]
          Length = 594

 Score =  101 bits (252), Expect = 3e-19,   Method: Composition-based stats.
 Identities = 64/346 (18%), Positives = 116/346 (33%), Gaps = 42/346 (12%)

Query: 247 RFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAP 306
             G +  A +V D       V+  +    R +       Y   GD    +  GR I V P
Sbjct: 80  EVGNANIAVWVNDV----RQVVAATPSEWRNTLDRIQTAYDTIGDDLGAANTGRLIMVHP 135

Query: 307 QSQTLFQAGVSVVSWFM---------SAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYL 357
             Q       +  +W           + W     YP  V    NR+ + GS       + 
Sbjct: 136 ALQPKRLYRDNNNAWKFVNMHTGAVPAEWSSSN-YPQTVGIFQNRVWYVGSPVHRTYFWA 194

Query: 358 SSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSIS 417
           +  G   D +        DP   +             W+    + + +G   + + L+ S
Sbjct: 195 TRAGKLEDIAPSTANNPNDPISFVGIMEGTPC-----WIIASSDVLTIGTTINDYQLAAS 249

Query: 418 LS---KGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTE-QGFRFNEIT 473
                   +   RR S  G  A   +   + ++F      ++  ++   E   +  +E++
Sbjct: 250 TGVSVTAATAILRRSSVQGTAAVQGIPAEEQVIFCSRNKSKVYAMNYVREQDNWIPDEMS 309

Query: 474 QLADHLFN-------QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAW 526
             A HLF          + ++ Y  +    +WVVLE       ++  C F    +   AW
Sbjct: 310 SQAQHLFTPISSARGASVRRVAYISDAAKSLWVVLE-----NGKINYCCFDRTTDTK-AW 363

Query: 527 HTHMISDKHYVLSAASFPNDNRGGTSLWMLVALS---AGEERSFTV 569
               +S    +  AA+F  D       ++ V  S    G ++++TV
Sbjct: 364 TQLELSGGKVIDIAAAFNPD---SDYAYVAVVRSKVVNGAQKNYTV 406



 Score = 56.4 bits (134), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 50/329 (15%), Positives = 93/329 (28%), Gaps = 30/329 (9%)

Query: 4   TTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLD 63
             +++ SF  G ++PRL  +  + + +   +  + N +    G L++    +E   C+  
Sbjct: 2   ADFSQTSFKGGVIAPRLQFNEYESA-YHHSIEDAVNFVVTEQGSLITRCGSEEVGLCQDG 60

Query: 64  PRSNRVFSFSIPDGGYALLVFGDKKLQIVVVR-----SSTKWSPALFGKTYKTPYTFKDN 118
                            ++  G+  + + V       ++T           +T Y     
Sbjct: 61  EVRLFRLPAIDAPSNDIIVEVGNANIAVWVNDVRQVVAATPSEWRNTLDRIQTAY-DTIG 119

Query: 119 KSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKS-- 176
             L  A  G   + VH    P  L    + +   F       +P  W        V    
Sbjct: 120 DDLGAANTG-RLIMVHPALQPKRLYR-DNNNAWKFVNMHTGAVPAEWSSSNYPQTVGIFQ 177

Query: 177 -------NAKLSISQADTSTARI--TSDMKIFKPLDKGRSIRLGCHPPEWAKNTN----- 222
                  +         T   ++   +      P D    + +    P W   ++     
Sbjct: 178 NRVWYVGSPVHRTYFWATRAGKLEDIAPSTANNPNDPISFVGIMEGTPCWIIASSDVLTI 237

Query: 223 -YSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASG 281
             +I  Y +A      S+T   +  R    +G   V         V+  S   S+  A  
Sbjct: 238 GTTINDYQLAAS-TGVSVTAATAILRRSSVQGTAAV-QGIPAEEQVIFCSRNKSKVYAMN 295

Query: 282 AVAPYYVWGDIKDVSKDGRSISVAPQSQT 310
            V     W  I D           P S  
Sbjct: 296 YVREQDNW--IPDEMSSQAQHLFTPISSA 322


>gi|291334273|gb|ADD93936.1| hypothetical protein [uncultured marine bacterium
           MedDCM-OCT-S08-C235]
          Length = 229

 Score =  101 bits (250), Expect = 5e-19,   Method: Composition-based stats.
 Identities = 34/206 (16%), Positives = 66/206 (32%), Gaps = 14/206 (6%)

Query: 7   TKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRS 66
            K +F +GEL P L+  R D + +A G  K +N+     G        + Y      P +
Sbjct: 11  LKTTFQSGELDP-LMNLRSDTTAYANGAKKMQNVSLFSQGGFKRRNGTKRYASL---PGN 66

Query: 67  NRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKT-PYTFKDNKSLEYAV 125
            R+  F   D    +  F + ++ I  + +          +T  + P+T      +++  
Sbjct: 67  ARLVGFDFDDNEQYICAFSNNRVDIYYLSND------SLTQTITSCPWTTSILFEMQFTQ 120

Query: 126 FGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQA 185
            G T +  H       +         +F+     F                +     +  
Sbjct: 121 AGDTMIITHPSMATQVITRTS---LTAFSRSNYTFDSDSENVYQPYYKFAGSGVTLSASG 177

Query: 186 DTSTARITSDMKIFKPLDKGRSIRLG 211
            T +  ITS    F        +++ 
Sbjct: 178 TTGSVTITSSADHFSSDYVNVYLKIE 203


>gi|291334458|gb|ADD94112.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C1161]
 gi|291334665|gb|ADD94312.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C695]
 gi|291336445|gb|ADD96000.1| hypothetical protein [uncultured organism MedDCM-OCT-S04-C1073]
          Length = 121

 Score = 96.5 bits (238), Expect = 1e-17,   Method: Composition-based stats.
 Identities = 20/111 (18%), Positives = 38/111 (34%), Gaps = 7/111 (6%)

Query: 81  LLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPH 140
           +L FG++ ++          S + +     +PY   +   ++YA         H +HP  
Sbjct: 1   MLEFGNQYIRFYKDNGQILSSGSAY--EISSPYLEAELFDIKYAQSADVMYLCHPNHPVK 58

Query: 141 HLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTAR 191
            L         S+T   + F   P++    I      A  + +   T T  
Sbjct: 59  KLARTGH---TSWTLTSVDFQNGPFMDHN-IETTTITASHT-NAGQTGTLT 104


>gi|291334515|gb|ADD94168.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C1201]
          Length = 99

 Score = 93.8 bits (231), Expect = 6e-17,   Method: Composition-based stats.
 Identities = 18/105 (17%), Positives = 36/105 (34%), Gaps = 6/105 (5%)

Query: 81  LLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPH 140
           +L FG++ ++          S + +     +PY   +   ++YA         H +HP  
Sbjct: 1   MLEFGNQYIRFYKDNGQILSSGSAY--EISSPYLEAELFDIKYAQSADVMYLCHPNHPVK 58

Query: 141 HLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQA 185
            L         S+T   + F   P++    I      A  +  + 
Sbjct: 59  KLARTGH---TSWTLTSVDFQNGPFMDHN-IETTTITASHTYCRW 99


>gi|13186158|emb|CAC33469.1| hypothetical protein [Legionella pneumophila]
          Length = 818

 Score = 93.8 bits (231), Expect = 8e-17,   Method: Composition-based stats.
 Identities = 84/541 (15%), Positives = 162/541 (29%), Gaps = 91/541 (16%)

Query: 7   TKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRS 66
             ++F+ GEL P L  +R DL ++ +G  K RN+I L  G     P              
Sbjct: 6   ISNTFNRGELDPTLF-ARDDLDIYDKGARKLRNMIALWTGAARIAPGTIYVD-------- 56

Query: 67  NRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVF 126
                                 + +     +      L  K +   Y       + Y + 
Sbjct: 57  ----------------------MMVDRENGNAVIQDPLMVKGFDFTYDAD--AEITYTII 92

Query: 127 GSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQAD 186
                              + G  I+F           +  D + + V S A L+    D
Sbjct: 93  I-----------------RKSGTNIAFDI---------YYADALQTTVTSTAYLATQIQD 126

Query: 187 TSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGD 246
              A     + I     + R ++ G     W+  T      +       Y     G + +
Sbjct: 127 IHVAAAHDRVLILHENVQIRQLKRGASHSSWSLTT------FEPRVYPTYDFSVIGEATN 180

Query: 247 RFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAP 306
              +    T+        IT+ + S+  +     G          I  V+    + +   
Sbjct: 181 YQSF----TFTLSATTGSITITSSSAVFTHNHVGGLFRSLGGTARITAVASTTSASATVL 236

Query: 307 QSQTLFQAGVSVVSWFMSAWGEQE---------GYPSHVTFHNNRLLFSGSKGDELSVYL 357
            + T      ++ S     W             G+P+   F+ NRL+   S   +  V L
Sbjct: 237 DNFTGTSCAGNLSSLAEKLWNSDTTTAPVSANRGWPARGVFYLNRLILGRSLAVKNLVNL 296

Query: 358 SSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSIS 417
           S+ G + +F    +    D   A +         ++  +    + +L      L+  S  
Sbjct: 297 STAGVYDNF----DDADLDGLVAFSVTFNGKGEQSVQSIVA-DDSILFTTANKLFAQSPL 351

Query: 418 LSKGLSID---FRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGSTEQG-FRFNEIT 473
           +   ++I+   F   S S   +    S+ +  +FV     ++     ST  G +     T
Sbjct: 352 VESPITINNVYFAPQSQSPATSIEAASIDNQTLFVSSDRTKVMQAMYSTADGKYITLPAT 411

Query: 474 QLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISD 533
            L++ + +       +  EP  I   +     ++   LL      + +    W     + 
Sbjct: 412 MLSNSIVDYINSNGTW--EPAGISTRLYLATQDNGTMLLYSTL--QTQNVAGWSLRTTTG 467

Query: 534 K 534
           K
Sbjct: 468 K 468


>gi|158425207|ref|YP_001526499.1| tail tubular protein B [Azorhizobium caulinodans ORS 571]
 gi|158332096|dbj|BAF89581.1| tail tubular protein B [Azorhizobium caulinodans ORS 571]
          Length = 785

 Score = 78.4 bits (191), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 81/565 (14%), Positives = 162/565 (28%), Gaps = 75/565 (13%)

Query: 47  PLVSMPLMQEYRD-CRLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALF 105
            L   P  +         P +  V   +       ++V  +  L++       +      
Sbjct: 41  GLTKRPPTRHVAKLINSLPENAHVHIINRDAAERYVVVAFNGDLRVYGFDGVERTVNFPH 100

Query: 106 GKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPW 165
           GK     Y    + S           F++KD              ++           P 
Sbjct: 101 GK----GYLANTSASFGAVTVADYTFFLNKD--------------VTVAMSPETKAGRPP 142

Query: 166 LGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIR-LGCHPPEWAKNTNYS 224
            G   +       K  I     + A   +       +   +  + L      W       
Sbjct: 143 EGIVFVRQGNYACKYRIIVDGQAVAEKITSQTDPNDIQSSKIAQDLAAIINSWGSMVASV 202

Query: 225 IGA---YIVADDKVYRSLTT---GRSGDRFGYSKGATY------------VKDNNITWIT 266
           IG+      AD   +   T    G +G      +  T+            V+ +      
Sbjct: 203 IGSTIHIRRADSLGFSLTTEDSLGDTGLVCMTKQTQTFANLPARAVQGYQVEISGTPGNP 262

Query: 267 VLNLSSKTSRESASGAVAPYY-VWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSA 325
             N   +  +  + G    +  +    + ++ D  ++      +           W   A
Sbjct: 263 YDNFWVEYDQAGSGGNNGVWREIAAPGRQIAFDPATMPHVLVREANGSFTFKQADWEKCA 322

Query: 326 WGEQEGYP---------SHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYD 376
            G  E  P         S + F+ NRL F        SV  S    F++F  +      D
Sbjct: 323 AGSDETTPRPSFVGQRISDIFFYRNRLGFISD----ESVIFSRSAKFFNFWRETATDLLD 378

Query: 377 PTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLL-SISLSKGLSIDFRRVSGS-GV 434
               +    +    S +    PF E +L+  D + ++L +  +     +   +V+     
Sbjct: 379 TDP-IDITTSHVKVSILRHAIPFNESLLLFSDQTQFMLGAGEVLTPSGVSLDQVTEFETS 437

Query: 435 YACPPVSVGDCLVFVCGVGR--RIKYISGSTEQGFRFNEITQLADHLFNQRILQLVY--Q 490
               PV  G  + F    G    ++      +   + N    + +H+    I   V+   
Sbjct: 438 SRAKPVGAGQFVYFCTSRGEFTGVRE--YYIDGSTKTNNANDVTNHVPRY-IRGKVFKLC 494

Query: 491 EEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDF--AWHTHMISDKHYVLSAASFPNDNR 548
              +  + V L   D     L   ++   G+     +W    +     +L+A        
Sbjct: 495 ASTNEDMLVALSDTDRD--TLYVYKYYNSGQEKVQSSWSRWKLQPGDVILNAEFI----- 547

Query: 549 GGTSLWMLVALSAGEERSFTVRLNL 573
             ++LW++V  + G    +  RLN+
Sbjct: 548 -ESTLWLIVRRADGV---YLDRLNI 568


>gi|46581000|ref|YP_011808.1| hypothetical protein DVU2596 [Desulfovibrio vulgaris str.
           Hildenborough]
 gi|46450421|gb|AAS97068.1| hypothetical protein DVU_2596 [Desulfovibrio vulgaris str.
           Hildenborough]
          Length = 259

 Score = 75.7 bits (184), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 21/77 (27%), Positives = 27/77 (35%), Gaps = 11/77 (14%)

Query: 497 VWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWML 556
           +W V E        L+      E E    WH H+      VLS  + P     G  LW+ 
Sbjct: 1   MWCVTEDGG-----LIAMTRIPEHE-VAGWHRHVTDGA--VLSVCTIPG--TAGDELWVA 50

Query: 557 VAL-SAGEERSFTVRLN 572
           V     G  R    RL+
Sbjct: 51  VRREGGGMVRCCIERLD 67


>gi|297171931|gb|ADI22918.1| hypothetical protein [uncultured Rhizobium sp. HF0500_35F13]
          Length = 336

 Score = 74.1 bits (180), Expect = 6e-11,   Method: Composition-based stats.
 Identities = 22/95 (23%), Positives = 39/95 (41%), Gaps = 13/95 (13%)

Query: 487 LVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDK-----HYVLSAA 541
           + YQEEP SI++ V E  +     L+   +  + +   AWH H+             S A
Sbjct: 1   MAYQEEPLSIIYAVREDGE-----LVALTYQRD-QQVVAWHRHIFGGAFGTGNAVCESIA 54

Query: 542 SFPNDNRGGTSLWMLVALS-AGEERSFTVRLNLLD 575
             P D      +++++  +  G  + +   LN  D
Sbjct: 55  VIPTDLD-EYEVYVIIKRTINGATKRYVEVLNTFD 88


>gi|317487276|ref|ZP_07946071.1| hypothetical protein HMPREF0179_03434 [Bilophila wadsworthia 3_1_6]
 gi|316921466|gb|EFV42757.1| hypothetical protein HMPREF0179_03434 [Bilophila wadsworthia 3_1_6]
          Length = 794

 Score = 73.7 bits (179), Expect = 7e-11,   Method: Composition-based stats.
 Identities = 78/566 (13%), Positives = 165/566 (29%), Gaps = 59/566 (10%)

Query: 48  LVSMPLMQEYRDCRLDPRSNRVFSFSIPDGGY--ALLVFGDKKLQIVVVRSSTKWSPALF 105
           L   P  +     R  P +N + S  I        ++      + +  +  + K      
Sbjct: 43  LKRRPATRHLARIRDTPAANGIASHHINRDETEQYIVTADASGINVFDLEGNAKTVSVTG 102

Query: 106 GKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPW 165
                       N+ L +         +++      L        +S        +    
Sbjct: 103 TGAAYLAAATAPNRDLRFLTINDYTFVLNRRVAVKTL------PDLSPKRQPEAIVFIKQ 156

Query: 166 LGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSI 225
                   +  N        +   A           LD  ++I        ++  T+ S 
Sbjct: 157 ASYNTTYELILNGTTHAFTTEDGIAPADEPADKLSSLDICKAIADQIPKDAFSVQTSNST 216

Query: 226 GAYIVADD------------KVYRSLTTGRSGDRFGYSKG-------ATYVKDNNITWIT 266
                 D               + S+  G+   RF               + D + ++  
Sbjct: 217 IWIRRHDGGDFTVKVQDSRSNTHTSVCKGKV-QRFSDLPTVAPRGFVTEIIGDASSSFDN 275

Query: 267 VLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAW 326
              +   +    A G+               D  ++  A   Q         + W     
Sbjct: 276 YFCVFEPSDAGDAFGSGTWKETVKPGIPCKLDPATLPHALIRQADGTFTFGPLEWGERIC 335

Query: 327 GEQEG--YPSHV-------TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDP 377
           G+++   +PS V        F+ NRL F   +     V +S  G F++F L       D 
Sbjct: 336 GDEDSAPFPSFVGRTLNGLFFYRNRLSFLSGEN----VVMSEVGEFFNFFLTTVTTLVDS 391

Query: 378 TKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL-SKGLSIDFRRVSGS-GVY 435
              +  A +   +S +H    F  G+L+  D S ++L         ++  + V+      
Sbjct: 392 D-VVDVAASHTKSSILHHAVTFSGGLLLFSDQSQFVLEHDTVLSNATVSIKPVTEFEASM 450

Query: 436 ACPPVSVGDCLVFVCGVG--RRIKYISGSTEQGFRFNEITQLADHLFNQRILQLV-YQEE 492
              PVS G  + F    G    ++    +       N+ + +  H+       +   +  
Sbjct: 451 KAAPVSSGKTVFFATDKGEWGGVRE-YITLPDNSDQNDASDITAHVPRYVRGNVSRLECS 509

Query: 493 PHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTS 552
            +  + +VL  +  +   L    ++   +   AW    +     VLSAA         T 
Sbjct: 510 TNEDMLLVLSEEMRTSLWLYKYFWNGSEKIQSAWSRWDMCG--EVLSAAIL------NTG 561

Query: 553 LWMLVALSAGEERSFTVRLNLLDDFK 578
           +++++    G    +  ++++   +K
Sbjct: 562 VYLIMQYGDGV---YLEKMDITPGYK 584


>gi|320158424|ref|YP_004190802.1| tail tubular protein B [Vibrio vulnificus MO6-24/O]
 gi|319933736|gb|ADV88599.1| tail tubular protein B [Vibrio vulnificus MO6-24/O]
          Length = 931

 Score = 73.4 bits (178), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 58/361 (16%), Positives = 118/361 (32%), Gaps = 28/361 (7%)

Query: 225 IGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVA 284
           +G     +D  Y       S     +++   Y   N     ++ ++  +    S      
Sbjct: 423 VGKADSENDGYYVKWVDKTS----MWTESTAYGLANEFNPASMPHILRRHQDSSKVSVDN 478

Query: 285 PYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLL 344
           PY ++  ++      R++     +           S  M+    QE Y S + F   RL 
Sbjct: 479 PYGIYFKLEQGVWSKRTVGDELSAPIPSFVSTQDESGAMT----QERYISAMAFFRGRLW 534

Query: 345 FSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVL 404
             G          S  G  ++F         D             A TIH   P  +G++
Sbjct: 535 LLGGD----YACGSVVGDKFNFFRSTALTVLDDDPIDGYTDLTGQAETIHAAIPSSDGLV 590

Query: 405 VGCDTSLWLL-SISLSKGLSIDFRRVSGSGV-YACPPVSVGDCLVFVCGVGR--RIKYIS 460
           V  +   +L+ S  +    + +F R++       C PV +GD + F         +  + 
Sbjct: 591 VFTERGQYLISSQGMMSPTTFEFTRIASYATDNRCDPVLIGDRISFATKTSEYTSVSEMY 650

Query: 461 GSTEQG-FRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAE 519
            +   G  + NE+T          + +L+     ++   ++    +    R+    F   
Sbjct: 651 VADTTGVRKANEVTSHCPTYIEGSVHRLLANATSNTEFLIMRGQGETLTGRMFIYDFLMN 710

Query: 520 GEGDF--AWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVAL--SAGEERSFTVRLNLLD 575
           G      AW     +    V    +        + L++++    S  ++R    R++L+ 
Sbjct: 711 GNERVQSAWSQWTFNGAVVVDGVLT-------SSELYLVMVRATSDKDKRMTVERIDLVQ 763

Query: 576 D 576
           D
Sbjct: 764 D 764


>gi|26989008|ref|NP_744433.1| tail tubular protein B [Pseudomonas putida KT2440]
 gi|24983829|gb|AAN67897.1|AE016421_9 tail tubular protein B [Pseudomonas putida KT2440]
          Length = 781

 Score = 73.0 bits (177), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 73/579 (12%), Positives = 152/579 (26%), Gaps = 76/579 (13%)

Query: 38  RNLIPLRYGPLVSMPLMQEYRDCRLDP-RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRS 96
            N I      L+  P           P  S  V + +        +   +  L++  V  
Sbjct: 32  ENGISTVSEGLMKRPPTTHLARVTASPLESAFVHTINRDASERYQVAITNGGLRVFAVDG 91

Query: 97  STKWSPALFGKTYKTPYTFKDNKSLEYA--VFGSTAVFVHKDHPPHHLLYIQDGDKISFT 154
           +             T Y    + + ++           V+K                   
Sbjct: 92  T----ERTVSFPDGTGYLAASDPASDFTAITVADYTFIVNKA-------------ITVAN 134

Query: 155 FDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHP 214
              +     P     +I G        I    T     T D           +  +    
Sbjct: 135 RAAVSAPRGPEALISVIQGNYGRTYGVILNGVTVATYATPDGSDATKTSLASTDYIATEL 194

Query: 215 PEWAKNTNYSIGAYIVADDKVYRSLTTGRSGD-----------------RFGYSKGATYV 257
               ++  ++    + A   +Y + T   + D                 +   +  +   
Sbjct: 195 VAGIQSAGFT---CVRAGSCLYITSTADFTIDCYDGFNNNAMKAYKKVVQSFSTLPSNCT 251

Query: 258 KDNNITWITVLNLSSKTSRESASGAVAP--YYVW----GDIKDVSKDGRSISVAPQSQTL 311
           +     +    +    +        V      VW    G    +  DG ++         
Sbjct: 252 QAGGCLFEITGDPGDSSDDYYVYYDVGTDSTGVWRECVGPGVALGLDGSTMPHTLVRNAD 311

Query: 312 FQAGVSVVSWFMSAWGE--QEGYPSHV-------TFHNNRLLFSGSKGDELSVYLSSFGA 362
                   +W     G+      PS V        F+ NRL F        +V  S  G 
Sbjct: 312 GTFTFQAATWTDRVAGDADTNEDPSFVGRTINDVVFYRNRLGFLAD----EAVIFSESGK 367

Query: 363 FYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLL-SISLSKG 421
           +++F         D    +  + T    + +     F + +L+  D   +L+ +      
Sbjct: 368 YWNFYRTTVTELLDSDP-IDVSSTYTKVAILKHAVSFNKQLLLFSDEVQFLIDNGDTLTP 426

Query: 422 LSIDFRRVSGSGVYAC-PPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQLADHLF 480
            +I  +  +     A   P SVG  + F              T+     N+ T +A H+ 
Sbjct: 427 KTISIKPSTEFVCNALTTPQSVGKNVYFASDRENWTAIREYFTDTNDVSNDSTDVASHVP 486

Query: 481 N---QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYV 537
                 + ++         +  VL   D     +    +  + +   +W      D   +
Sbjct: 487 QYIPSGVFKIASSSSED--MLCVLTTGDRHSIYVYKFYWDGDTKVQSSWSKWTFPDTDTI 544

Query: 538 LSAASFPNDNRGGTSLWMLVALSAGEERSFTVRLNLLDD 576
           LSA          + +++ +  + G    +  +L +  D
Sbjct: 545 LSAEFL------DSEVFLAINRADG---LYFEKLTVATD 574


>gi|325272824|ref|ZP_08139161.1| tail tubular protein B [Pseudomonas sp. TJI-51]
 gi|324102029|gb|EGB99538.1| tail tubular protein B [Pseudomonas sp. TJI-51]
          Length = 781

 Score = 70.7 bits (171), Expect = 7e-10,   Method: Composition-based stats.
 Identities = 80/575 (13%), Positives = 154/575 (26%), Gaps = 68/575 (11%)

Query: 38  RNLIPLRYGPLVSMPLMQEYRDCRLDP-RSNRVFSFSIPDGGYALLVFGDKKLQIVVVRS 96
            N I      L+  P           P  S  V + +        +   +  L++  V  
Sbjct: 32  ENGISTVSEGLMKRPPTTHLARVTASPLESAFVHTINRDSTERYQVAITNGGLRVFAVDG 91

Query: 97  STKWSPALFGKTYKTPYTFKDNKSLEYA--VFGSTAVFVHKD-------------HPPHH 141
           S             T Y    + + ++           V+K               P   
Sbjct: 92  S----ERTVSFPDGTSYLAASDPASDFTAITVADYTFIVNKAITVANRAAVSGTRGPEAL 147

Query: 142 LLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKP 201
           +  IQ     ++           +      S     A  S     T           F  
Sbjct: 148 ISVIQGNYGRTYGVILNGVTVATYATP-DGSDATKTALASTDYIATELVAGIQSA-GFTC 205

Query: 202 LDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRS------GDRFGYSKGAT 255
           +  G  + +           +      + A  KV +S +T  S      G  F  +    
Sbjct: 206 VRAGSCLYITSTADFTIDCYDGFNNNAMKAYKKVVQSFSTLPSNCTQAGGCLFEITGDPG 265

Query: 256 YVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAG 315
              D+   +  V   S+   RE            G    +  DG ++             
Sbjct: 266 DSSDDYYVYYDVGTDSTGVWRECV----------GPGVALGLDGSTMPHTLVRNADGTFT 315

Query: 316 VSVVSWFMSAWGE--QEGYPSHV-------TFHNNRLLFSGSKGDELSVYLSSFGAFYDF 366
               +W     G+      PS V        F+ NRL F        +V  S  G +++F
Sbjct: 316 FQAATWTDRVAGDADTNEDPSFVGRTINDVVFYRNRLGFLAD----EAVIFSESGKYWNF 371

Query: 367 SLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLL-SISLSKGLSID 425
                    D    +  + T    + +     F + +L+  D   +L+ +       +I 
Sbjct: 372 YRTTVTELLDSDP-IDVSSTYTKVAILKHAVSFNKQLLLFSDEVQFLIDNGDTLTPKTIS 430

Query: 426 FRRVSGSGVYAC-PPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQLADHLFN--- 481
            +  +     A   P SVG  + F              T+     N+ T +A H+     
Sbjct: 431 IKPSTEFVCNALTTPQSVGKNVYFASDRENWTAIREYFTDTNDVSNDSTDVASHVPQYIP 490

Query: 482 QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAA 541
             + ++         +  VL   D     +    +  + +   +W      D        
Sbjct: 491 SGVFKIASSSSED--MLCVLTTGDRHSIYVYKFYWDGDTKVQSSWSKWTFPDTD------ 542

Query: 542 SFPNDNRGGTSLWMLVALSAGEERSFTVRLNLLDD 576
           +  N     + +++ +  + G    +  +L +  D
Sbjct: 543 TILNAEFLDSEVFLAINRADG---LYFEKLTVATD 574


>gi|9627472|ref|NP_042000.1| tail tubular protein B [Enterobacteria phage T7]
 gi|139659|sp|P03747|VTTB_BPT7 RecName: Full=Tail tubular protein B
 gi|15606|emb|CAA24430.1| unnamed protein product [Enterobacteria phage T7]
 gi|37956682|gb|AAP33952.1| gene 12 [Enterobacteria phage T7]
          Length = 794

 Score = 68.0 bits (164), Expect = 5e-09,   Method: Composition-based stats.
 Identities = 74/586 (12%), Positives = 171/586 (29%), Gaps = 66/586 (11%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           M   + +  +   G      +  + D+  +    ++  N        L   P +      
Sbjct: 1   MALISQSIKNLKGG------ISQQPDILRYPDQGSRQVNGWSSETEGLQKRPPLVFLNTL 54

Query: 61  RLDPRSNRVFSFSI-----PDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTF 115
             D  +     +           Y   VF    +++  +  + K      G  Y    T 
Sbjct: 55  G-DNGALGQAPYIHLINRDEHEQYYA-VFTGSGIRVFDLSGNEKQVRYPNGSNYIK--TA 110

Query: 116 KDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVK 175
                L           V+++          +    +   D +  +     G  +I  + 
Sbjct: 111 NPRNDLRMVTVADYTFIVNRNVVAQKNTKSVNLPNYNPNQDGLINVRGGQYGRELIVHIN 170

Query: 176 SNAKLSISQADTSTAR-ITSDMKIFKPLDKGRSIRLGCHPPEWAKNT-----NYSIGAYI 229
                     D S    + +    +   +  + +R   +  +W  N      + +  +  
Sbjct: 171 GKDVAKYKIPDGSQPEHVNNTDAQWLAEELAKQMRT--NLSDWTVNVGQGFIHVTAPSGQ 228

Query: 230 VADDKVYRSLTTGRSGDRFGY---SKGATYVKDNNITWITVLNLSSKTSRESASGAVAPY 286
             D    +     +  +   +   S         N   + ++  +SK++ +      A  
Sbjct: 229 QIDSFTTKDGYADQLINPVTHYAQSFSKLPPNAPNGYMVKIVGDASKSADQYYVRYDAER 288

Query: 287 YVWGDIKDVSKDGRSI-SVAPQSQTLFQAGVSVVSWFMSAWG-------EQEGYPSHV-- 336
            VW +    + + + +    P +      G     W    W        +   +PS V  
Sbjct: 289 KVWTETLGWNTEDQVLWETMPHALVRAADGNFDFKWL--EWSPKSCGDVDTNPWPSFVGS 346

Query: 337 -----TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSAS 391
                 F  NRL F   +     + LS    +++F         D    +  AV+    +
Sbjct: 347 SINDVFFFRNRLGFLSGEN----IILSRTAKYFNFYPASIANLSDDDP-IDVAVSTNRIA 401

Query: 392 TIHWMHPFGEGVLVGCDTSLWLLSISLS-KGLSIDFRRVSGSGVYA-CPPVSVGDCLVFV 449
            + +  PF E +L+  D + ++L+ S +    S++    +   V     P  +G  + F 
Sbjct: 402 ILKYAVPFSEELLIWSDEAQFVLTASGTLTSKSVELNLTTQFDVQDRARPFGIGRNVYFA 461

Query: 450 CGVGRRIKYISGSTEQGFRFNEITQL--ADHLFNQ-------RILQLVYQEEPHSIVWVV 500
                  +    S  + +   +++ +  A+ + +         +  +      +     V
Sbjct: 462 SP-----RSSFTSIHRYYAVQDVSSVKNAEDITSHVPNYIPNGVFSICGSGTENFC--SV 514

Query: 501 LEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPND 546
           L   D S   +    +  E     +W      +   VL+  S  +D
Sbjct: 515 LSHGDPSKIFMYKFLYLNEELRQQSWSHWDFGENVQVLACQSISSD 560


>gi|265525004|gb|ACY75867.1| tail tubular protein B [Enterobacteria phage T7]
          Length = 794

 Score = 68.0 bits (164), Expect = 5e-09,   Method: Composition-based stats.
 Identities = 74/586 (12%), Positives = 171/586 (29%), Gaps = 66/586 (11%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           M   + +  +   G      +  + D+  +    ++  N        L   P +      
Sbjct: 1   MALISQSIKNLKGG------ISQQPDILRYPDQGSRQVNGWSSETEGLQKRPPLVFLNTL 54

Query: 61  RLDPRSNRVFSFSI-----PDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTF 115
             D  +     +           Y   VF    +++  +  + K      G  Y    T 
Sbjct: 55  G-DNGALGQAPYIHLINRDEHEQYYA-VFTGSGIRVFDLSGNEKQVRYPNGSNYIK--TA 110

Query: 116 KDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVK 175
                L           V+++          +    +   D +  +     G  +I  + 
Sbjct: 111 NPRNDLRMVTVADYTFIVNRNVVAQKNTKSVNLPNYNPNQDGLINVRGGQYGRELIVHIN 170

Query: 176 SNAKLSISQADTSTAR-ITSDMKIFKPLDKGRSIRLGCHPPEWAKNT-----NYSIGAYI 229
                     D S    + +    +   +  + +R   +  +W  N      + +  +  
Sbjct: 171 GKDVAKYKIPDGSQPEHVNNTDAQWLAEELAKQMRT--NLSDWTVNVGQGFIHVTAPSGQ 228

Query: 230 VADDKVYRSLTTGRSGDRFGY---SKGATYVKDNNITWITVLNLSSKTSRESASGAVAPY 286
             D    +     +  +   +   S         N   + ++  +SK++ +      A  
Sbjct: 229 QIDSFTTKDGYADQLINPVTHYAQSFSKLPPNAPNGYMVKIVGDASKSADQYYVRYDAER 288

Query: 287 YVWGDIKDVSKDGRSI-SVAPQSQTLFQAGVSVVSWFMSAWG-------EQEGYPSHV-- 336
            VW +    + + + +    P +      G     W    W        +   +PS V  
Sbjct: 289 KVWTETLGWNTEDQVLWETMPHALVRAADGNFDFKWL--EWSPKSCGDVDTNPWPSFVGS 346

Query: 337 -----TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSAS 391
                 F  NRL F   +     + LS    +++F         D    +  AV+    +
Sbjct: 347 SINDVFFFRNRLGFLSGEN----IILSRTAKYFNFYPASIANLSDDDP-IDVAVSTNRIA 401

Query: 392 TIHWMHPFGEGVLVGCDTSLWLLSISLS-KGLSIDFRRVSGSGVYA-CPPVSVGDCLVFV 449
            + +  PF E +L+  D + ++L+ S +    S++    +   V     P  +G  + F 
Sbjct: 402 ILKYAVPFSEELLIWSDEAQFVLTASGTLTSKSVELNLTTQFDVQDRARPFGIGRNVYFA 461

Query: 450 CGVGRRIKYISGSTEQGFRFNEITQL--ADHLFNQ-------RILQLVYQEEPHSIVWVV 500
                  +    S  + +   +++ +  A+ + +         +  +      +     V
Sbjct: 462 SP-----RSSFTSIHRYYAVQDVSSVKNAEDITSHVPNYIPNGVFSICGSGTENFC--SV 514

Query: 501 LEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPND 546
           L   D S   +    +  E     +W      +   VL+  S  +D
Sbjct: 515 LSHGDPSKIFMYKFLYLNEELRQQSWSHWDFGENVQVLACQSISSD 560


>gi|37956840|gb|AAP34107.1| gene 12 [Enterobacteria phage T7]
          Length = 794

 Score = 67.6 bits (163), Expect = 6e-09,   Method: Composition-based stats.
 Identities = 76/586 (12%), Positives = 171/586 (29%), Gaps = 66/586 (11%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           M   + +  +   G      +  + D+  +    ++  N        L   P +      
Sbjct: 1   MALISQSIKNLKGG------ISQQPDILRYPDQGSRQVNGWSSETEGLQKRPPLVFLNTL 54

Query: 61  RLDPRSNRVFSFSI-----PDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTF 115
             D  +     +           Y   VF    +++  +  + K      G  Y    T 
Sbjct: 55  G-DNGALGQAPYIHLINRDEHEQYYA-VFTGSGIRVFDLSGNEKQVRYPNGSNYIK--TA 110

Query: 116 KDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVK 175
                L           V+++          +    +   D +  +     G  +I  + 
Sbjct: 111 NPRNDLRMVTVADYTFIVNRNVVAQKNTKSVNLPNYNSNQDGLINVRGGQYGRELIVHIN 170

Query: 176 SNAKLSISQADTSTAR-ITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYS-IGAYIVADD 233
                     D S    + +    +   +  + +R   +  +W  N     I     +  
Sbjct: 171 GKDVAKYKIPDGSQPEHVNNTDAQWLAEELAKQMRT--NLSDWTVNVGQGFIHVIAPSGQ 228

Query: 234 KVYRSLTTGRSGDR-------FGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPY 286
           ++    T     D+       +  S         N   + ++  +SK++ +      A  
Sbjct: 229 QIDSFTTKDGYADQLINSVTHYAQSFSKLPPNAPNGYMVKIVGDASKSADQYYVRYDAER 288

Query: 287 YVWGDIKDVSKDGRSI-SVAPQSQTLFQAGVSVVSWFMSAWG-------EQEGYPSHV-- 336
            VW +    + + + +    P +      G     W    W        +   +PS V  
Sbjct: 289 KVWTETLGWNTEDQVLWETMPHALVRAADGNFDFKWL--EWSPKSCGDVDTNPWPSFVGS 346

Query: 337 -----TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSAS 391
                 F  NRL F   +     + LS    +++F         D    +  AV+    +
Sbjct: 347 SINDVFFFRNRLGFLSGEN----IILSRTAKYFNFYPASIANLSDDDP-IDVAVSTNRIA 401

Query: 392 TIHWMHPFGEGVLVGCDTSLWLLSISLS-KGLSIDFRRVSGSGVYA-CPPVSVGDCLVFV 449
            + +  PF E +L+  D + ++L+ S +    S++    +   V     P  +G  + F 
Sbjct: 402 ILKYAVPFSEELLIWSDEAQFVLTASGTLTSKSVELNLTTQFDVQDRARPFGIGRNVYFA 461

Query: 450 CGVGRRIKYISGSTEQGFRFNEITQL--ADHLFNQ-------RILQLVYQEEPHSIVWVV 500
                  +    S  + +   +++ +  A+ + +         +  +      +     V
Sbjct: 462 SS-----RSSFTSIHRYYAVQDVSSVKNAEDITSHVPNYIPNGVFSICGSGTENFC--SV 514

Query: 501 LEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPND 546
           L   D S   +    +  E     +W      +   VL+  S  +D
Sbjct: 515 LSHGDPSKIFMYKFLYLNEELRQQSWSHWDFGENVQVLACQSISSD 560


>gi|37956893|gb|AAP34159.1| gene 12 [Enterobacteria phage T7]
          Length = 794

 Score = 66.4 bits (160), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 76/586 (12%), Positives = 171/586 (29%), Gaps = 66/586 (11%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           M   + +  +   G      +  + D+  +    ++  N        L   P +      
Sbjct: 1   MALISQSIKNLKGG------ISQQPDILRYPDQGSRQVNGWSSETEGLQKRPPLVFLNTL 54

Query: 61  RLDPRSNRVFSFSI-----PDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTF 115
             D  +     +           Y   VF    +++  +  + K      G  Y    T 
Sbjct: 55  G-DNGALGQAPYIHLINRDEHEQYYA-VFTGSGIRVFDLSGNEKQVRYPNGSNYIK--TA 110

Query: 116 KDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVK 175
                L           V+++          +    +   D +  +     G  +I  + 
Sbjct: 111 NPRNDLRMVTVADYTFIVNRNVVAQKNTKSVNLPNYNSNQDGLINVRGGQYGRELIVHIN 170

Query: 176 SNAKLSISQADTSTAR-ITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYS-IGAYIVADD 233
                     D S    + +    +   +  + +R   +  +W  N     I     +  
Sbjct: 171 GKDVAKYKIPDGSQPEHVNNTDAQWLAEELAKQMRT--NLSDWTVNVGQGFIHVIAPSGQ 228

Query: 234 KVYRSLTTGRSGDR-------FGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPY 286
           ++    T     D+       +  S         N   + ++  +SK++ +      A  
Sbjct: 229 QIDSFTTKDGYADQLINSVTHYAQSFSKLPPNAPNGYMVKIVGDASKSADQYYVRYDAER 288

Query: 287 YVWGDIKDVSKDGRSI-SVAPQSQTLFQAGVSVVSWFMSAWG-------EQEGYPSHV-- 336
            VW +    + + + +    P +      G     W    W        +   +PS V  
Sbjct: 289 KVWTETLGWNTEDQVLWETMPHALVRAADGNFDFKWL--EWSPKSCGDVDTNPWPSFVGS 346

Query: 337 -----TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSAS 391
                 F  NRL F   +     + LS    +++F         D    +  AV+    +
Sbjct: 347 SINDVFFFRNRLGFLSGEN----IILSRTAKYFNFYPASIANLSDDDP-IDVAVSTNRIA 401

Query: 392 TIHWMHPFGEGVLVGCDTSLWLLSISLS-KGLSIDFRRVSGSGVYA-CPPVSVGDCLVFV 449
            + +  PF E +L+  D + ++L+ S +    S++    +   V     P  +G  + F 
Sbjct: 402 ILKYAVPFSEELLIWSDEAQFVLTASGTLTSKSVELNLTTQFDVQDRARPFGIGRNVYFA 461

Query: 450 CGVGRRIKYISGSTEQGFRFNEITQL--ADHLFNQ-------RILQLVYQEEPHSIVWVV 500
                  +    S  + +   +++ +  A+ + +         +  +      +     V
Sbjct: 462 SS-----RPSFTSIHRYYAVQDVSSVKNAEDITSHVPNYIPNGVFSICGSGTENFC--SV 514

Query: 501 LEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPND 546
           L   D S   +    +  E     +W      +   VL+  S  +D
Sbjct: 515 LSHGDPSKIFMYKFLYLNEELRQQSWSHWDFGENVQVLACQSISSD 560


>gi|194100399|ref|YP_002003974.1| gp12 [Enterobacteria phage 13a]
 gi|193201446|gb|ACF15923.1| gp12 [Enterobacteria phage 13a]
          Length = 794

 Score = 66.0 bits (159), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 73/587 (12%), Positives = 169/587 (28%), Gaps = 68/587 (11%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           M   + +  +   G      +  + D+  +    ++  N        L   P +   +  
Sbjct: 1   MALISQSIKNLKGG------ISQQPDILRYPDQGSRQVNGWSSETEGLQKRPPLVFLKTL 54

Query: 61  RLDPRSNRVFSFSI-----PDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTF 115
                +     +           Y   VF    +++  +  + K      G  Y    T 
Sbjct: 55  GY-NGALGQAPYIHLINRDEHEQYYA-VFTGSGIRVFDLAGNEKQVRYPNGSNYIN--TA 110

Query: 116 KDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVK 175
                L           V+++          +    +   D +  +     G  +I  + 
Sbjct: 111 NPRNDLRMVTVADYTFIVNRNVVAQKNTNSVNLPNYNPNQDGLINVRGGQYGRELIVHIN 170

Query: 176 SN--AKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNT-----NYSIGAY 228
               AK  I           +D +           ++  +  +W  N      + +  + 
Sbjct: 171 GKDVAKYKIPDGSKPEHVNNTDAQWLAEELAN---QMRTNLSDWTVNVGQGFIHVTAPSG 227

Query: 229 IVADDKVYRSLTTGRSGDRFGY---SKGATYVKDNNITWITVLNLSSKTSRESASGAVAP 285
              D    +     +  +   +   S         N   + ++  +SK++ +        
Sbjct: 228 QQIDSFTTKDGYADQLINPVTHYAQSFSKLPPNAPNGYMVKIVGDASKSADQYYVRYDDE 287

Query: 286 YYVWGDIKDVSKDGRSI-SVAPQSQTLFQAGVSVVSWFMSAWG-------EQEGYPSHV- 336
             VW +    + + + +    P +      G     W    W        +   +PS V 
Sbjct: 288 RKVWTETLGWNTEDQVLWETMPHALVRAADGNFDFKWL--EWSPKSCGDVDTNPWPSFVG 345

Query: 337 ------TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSA 390
                  F  NRL F   +     + LS    +++F         D    +  AV+    
Sbjct: 346 SSINDVFFFRNRLGFLSGEN----IILSRTAKYFNFYPASIANLSDDDP-IDVAVSTNRI 400

Query: 391 STIHWMHPFGEGVLVGCDTSLWLLSISLS-KGLSIDFRRVSGSGVYA-CPPVSVGDCLVF 448
           + + +  PF E +L+  D + ++L+ S +    S++    +   V     P  +G  + F
Sbjct: 401 AILKYAVPFSEELLIWSDEAQFVLTASGTLTSKSVELNLTTQFDVQDRARPFGIGRNVYF 460

Query: 449 VCGVGRRIKYISGSTEQGFRFNEITQL--ADHLFNQ-------RILQLVYQEEPHSIVWV 499
                   +    S  + +   +++ +  A+ + +         +  +      +     
Sbjct: 461 ASP-----RSSFTSIHRYYAVQDVSSVKNAEDITSHVPNYIPNGVFSICGSGTENFC--S 513

Query: 500 VLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPND 546
           VL   D S   +    +  E     +W      +   VL+  S  +D
Sbjct: 514 VLSHGDPSKIFMYKFLYLNEELRQQSWSHWDFGENVQVLACQSINSD 560


>gi|37956735|gb|AAP34004.1| gene 12 [Enterobacteria phage T7]
 gi|37956785|gb|AAP34053.1| gene 12 [Enterobacteria phage T7]
          Length = 794

 Score = 65.3 bits (157), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 73/586 (12%), Positives = 171/586 (29%), Gaps = 66/586 (11%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           M   + +  +   G      +  + D+  +    ++  N        L   P +      
Sbjct: 1   MALISQSIKNLKGG------ISQQPDILRYPDQGSRQVNGWSSETEGLQKRPPLVFLNTL 54

Query: 61  RLDPRSNRVFSFSI-----PDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTF 115
             D  +     +           Y   VF    +++  +  + K      G  Y    T 
Sbjct: 55  G-DNGALGQAPYIHLINRDEHEQYYA-VFTGSGIRVFDLSGNEKQVRYPNGSNYIK--TA 110

Query: 116 KDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVK 175
                L           V+++          +    +   D +  +     G  +I  + 
Sbjct: 111 NPRNDLRMVTVADYTFIVNRNIVAQKNTKSVNLPNYNPNQDGLINVRGGQYGRELIVHIN 170

Query: 176 SNAKLSISQADTSTAR-ITSDMKIFKPLDKGRSIRLGCHPPEWAKNT-----NYSIGAYI 229
                     D S    + +    +   +  + +R   +  +W  N      + +  +  
Sbjct: 171 GKDVAKYKIPDGSQPEHVNNTDAQWLAEELAKQMRT--NLSDWTVNVGQGFIHVTAPSGQ 228

Query: 230 VADDKVYRSLTTGRSGDRFGY---SKGATYVKDNNITWITVLNLSSKTSRESASGAVAPY 286
             D    +     +  +   +   S         N   + ++  +SK++ +      A  
Sbjct: 229 QIDSFTTKDGYADQLINPVTHYAQSFSKLPPNAPNGYMVKIVGDASKSADQYYVRYDAER 288

Query: 287 YVWGDIKDVSKDGRSI-SVAPQSQTLFQAGVSVVSWFMSAWG-------EQEGYPSHV-- 336
            VW +    + + + +    P +      G     W    W        +   +PS V  
Sbjct: 289 KVWTETLGWNTEDQVLWETMPHALVRAADGNFDFKWL--EWSPKSCGDVDTNPWPSFVGS 346

Query: 337 -----TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSAS 391
                 F  NRL F   +     + LS    +++F         D    +  AV+    +
Sbjct: 347 SINDVFFFRNRLGFLSGEN----IILSRTAKYFNFYPASIANLSDDDP-IDVAVSTNRIA 401

Query: 392 TIHWMHPFGEGVLVGCDTSLWLLSISLS-KGLSIDFRRVSGSGVYA-CPPVSVGDCLVFV 449
            + +  PF E +L+  D + ++L+ S +    S++    +   V     P  +G  + F 
Sbjct: 402 ILKYAVPFSEELLIWSDEAQFVLTASGTLTSKSVELNLTTQFDVQDRARPFGIGRNVYFA 461

Query: 450 CGVGRRIKYISGSTEQGFRFNEITQL--ADHLFNQ-------RILQLVYQEEPHSIVWVV 500
                  +    S  + +   +++ +  A+ + +         +  +      +     V
Sbjct: 462 SP-----RSSFTSIHRYYAVQDVSSVKNAEDITSHVPNYIPNGVFSICGSGTENFC--SV 514

Query: 501 LEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPND 546
           L   + S   +    +  E     +W      +   VL+  S  +D
Sbjct: 515 LSHGNPSKIFMYKFLYLNEELRQQSWSHWDFGENVQVLACQSISSD 560


>gi|254505331|ref|ZP_05117479.1| hypothetical protein SADFL11_PLAS29 [Labrenzia alexandrii DFL-11]
 gi|222436175|gb|EEE42857.1| hypothetical protein SADFL11_PLAS29 [Labrenzia alexandrii DFL-11]
          Length = 683

 Score = 64.5 bits (155), Expect = 5e-08,   Method: Composition-based stats.
 Identities = 55/326 (16%), Positives = 105/326 (32%), Gaps = 32/326 (9%)

Query: 241 TGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGR 300
           +   GD +  S GA+ + D  + +++      + ++           +   I        
Sbjct: 162 SATDGDVYRISNGASPLDDYYVKYVSADTEWVECAKPGEVIGFDAKTMPHQIVREEDGSF 221

Query: 301 SISVAPQSQTLFQAGVSVV--SWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLS 358
           S+S    S        SV   S+   A+ +       + F  NRL F   +      + S
Sbjct: 222 SVSRVEWSDRQVGDAESVKDPSFVGRAFKD-------IFFFKNRLGFVSDENT----FFS 270

Query: 359 SFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLW-LLSIS 417
               F++   D      D    +  A +    + + W+ PF   + +  D + + L S  
Sbjct: 271 QAADFFNLWPDQANVVGDSDP-VDIAASTTKVTILQWVVPFRRALFLSADLAQFELASSD 329

Query: 418 LSKGLSIDFRRVSGS-GVYACPPVSVGDCLVF-VCGVGRRIKYISGSTEQGFR--FNEIT 473
                S+     +       C P ++GD L F     G+ + Y     +        ++T
Sbjct: 330 FMTPTSVAVDLATSYEATNLCRPTTLGDELYFAAEKQGKTVIYEYFYDDDTLSNTAIDVT 389

Query: 474 QLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDF--AWHTHMI 531
           + A+      I   VY  E  +I   +L   D     +   R    G+     AW     
Sbjct: 390 KHAE----GYIPGRVYLMEGSAIANTLLCVADGDSASMYTYRVFWNGQEKIQSAWSRWTF 445

Query: 532 SDKHYVLSAASFPNDNRGGTSLWMLV 557
            +  Y+              + ++LV
Sbjct: 446 DNS-YIDGVKVI------NDTAYVLV 464


>gi|254503713|ref|ZP_05115864.1| hypothetical protein SADFL11_3752 [Labrenzia alexandrii DFL-11]
 gi|222439784|gb|EEE46463.1| hypothetical protein SADFL11_3752 [Labrenzia alexandrii DFL-11]
          Length = 634

 Score = 64.1 bits (154), Expect = 7e-08,   Method: Composition-based stats.
 Identities = 55/326 (16%), Positives = 105/326 (32%), Gaps = 32/326 (9%)

Query: 241 TGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGR 300
           +   GD +  S GA+ + D  + +++      + ++           +   I        
Sbjct: 113 SATDGDVYRISNGASPLDDYYVKYVSADTEWVECAKPGEVIGFDAKTMPHQIVREEDGSF 172

Query: 301 SISVAPQSQTLFQAGVSVV--SWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLS 358
           S+S    S        SV   S+   A+ +       + F  NRL F   +      + S
Sbjct: 173 SVSRVEWSDRQVGDAESVKDPSFVGRAFKD-------IFFFKNRLGFVSDENT----FFS 221

Query: 359 SFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLW-LLSIS 417
               F++   D      D    +  A +    + + W+ PF   + +  D + + L S  
Sbjct: 222 QAADFFNLWPDQANVVGDSDP-VDIAASTTKVTILQWVVPFRRALFLSADLAQFELASSD 280

Query: 418 LSKGLSIDFRRVSGS-GVYACPPVSVGDCLVF-VCGVGRRIKYISGSTEQGFR--FNEIT 473
                S+     +       C P ++GD L F     G+ + Y     +        ++T
Sbjct: 281 FMTPTSVAVDLATSYEATNLCRPTTLGDELYFAAEKQGKTVIYEYFYDDDTLSNTAIDVT 340

Query: 474 QLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDF--AWHTHMI 531
           + A+      I   VY  E  +I   +L   D     +   R    G+     AW     
Sbjct: 341 KHAE----GYIPGRVYLMEGSAIANTLLCVADGDSASMYTYRVFWNGQEKIQSAWSRWTF 396

Query: 532 SDKHYVLSAASFPNDNRGGTSLWMLV 557
            +  Y+              + ++LV
Sbjct: 397 DNS-YIDGVKVI------NDTAYVLV 415


>gi|30387490|ref|NP_848299.1| tail protein [Yersinia pestis phage phiA1122]
 gi|30314127|gb|AAP20535.1| tail protein [Yersinia pestis phage phiA1122]
          Length = 794

 Score = 63.7 bits (153), Expect = 9e-08,   Method: Composition-based stats.
 Identities = 47/307 (15%), Positives = 101/307 (32%), Gaps = 40/307 (13%)

Query: 266 TVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVA-PQSQTLFQAGVSVVSWFMS 324
            ++  +SK++ +      A   VW +    + + + +    P +      G     W   
Sbjct: 268 KIVGDASKSADQYYVRYDAERKVWTETLGWNTENQVLLETMPHALVRAADGNFDFKWL-- 325

Query: 325 AWG-------EQEGYPSHV-------TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDG 370
            W        +   +PS V        F  NRL F   +     + LS    +++F    
Sbjct: 326 EWSPKSCGDVDTNPWPSFVGSSINDVFFFRNRLGFLSGEN----IILSRTAKYFNFYPAS 381

Query: 371 EYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLS-KGLSIDFRRV 429
                +    +  AV+    + + +  PF E +L+  D + ++L+ S +    S++    
Sbjct: 382 IANLSNDDP-IDVAVSTNRIAILKYAVPFSEELLIWSDEAQFVLTASGTLTSRSVELNLT 440

Query: 430 SGSGVYA-CPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQL------ADHLFN- 481
           +   V     P  +G  + F        +    S  + +   +++ +        H+ N 
Sbjct: 441 TQFDVQDRARPYGIGRNVYFASP-----RSSYTSIHRYYAVQDVSSVKNSEDITSHVPNY 495

Query: 482 --QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLS 539
               +  +      +     VL   D S   +    +  E     +W      +   VL+
Sbjct: 496 IPNGVFSICGSGTENFC--SVLSHGDPSKIFMYKFLYLNEELRQQSWSHWDFGENVQVLA 553

Query: 540 AASFPND 546
             S  +D
Sbjct: 554 CQSISSD 560


>gi|323512066|gb|ADX87527.1| tail tubular protein B [Vibrio phage ICP3_2009_B]
          Length = 794

 Score = 61.4 bits (147), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 79/577 (13%), Positives = 166/577 (28%), Gaps = 61/577 (10%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHA-QGVAKSRNLIPLRYGPLVSMPLMQEYRD 59
           M   + +  +   G      +  + D+  ++ QG  +         G L   P     + 
Sbjct: 1   MALISQSIKNLKGG------ISQQPDILRYSDQGSKQINGFSSEVEG-LQKRPPSVHVKR 53

Query: 60  CRL---DPRSNRVFSFSIPDGGYALLVFGDKKLQIVVV-RSSTKWSPALFGKTYKTPYTF 115
                   +       +  +     + F    +++  +     K   A  G +Y T  + 
Sbjct: 54  LTDQFGLGQKPYCHIINRDEVERYAVFFTGSNIRVFDLFTGDEKTVNAPNGLSYVT--SS 111

Query: 116 KDNKSLEYAVFGSTAVFVHKDHP--------PHHLLYIQDGDKISFTFDEIKFLPPPWLG 167
              K L           ++++          P  L        +     +        + 
Sbjct: 112 NPRKDLRMVTVADYTFILNRNVSTAQGTTNTPSGLAPFGHFGLVVIRGGQYGRTYRVKVN 171

Query: 168 DGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGA 227
             + +  ++     +  A         D      +++G ++  G     ++K+    I +
Sbjct: 172 GSVEASFETPLGDQVEHAKQIDIAYIIDQLAAGLINRGWAVTKGSGYFYFSKSGTVLINS 231

Query: 228 YIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYY 287
             V D      L  G   D    ++   Y  +N I  I V    +    +      A   
Sbjct: 232 LEVEDG-YNGQLAWGIINDVQKTTQLPVYAPNNYI--IRVSGDPTLNQDDYYVKFDASRN 288

Query: 288 VWGDIKDVSKDGRSIS-VAPQSQTLFQAGVSVVS---WFMSAWGE--QEGYPSHV----- 336
           VW +    +          P        G        W   A G+     YPS +     
Sbjct: 289 VWTECPAPNIKADYNKDTMPHVLIREADGTFTFKQADWTHRAAGDDDTNPYPSFIGNSIN 348

Query: 337 --TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIH 394
              F  NRL F   +     V LS  G +++F  +      D    +  AV+    S + 
Sbjct: 349 DIFFFRNRLGFLSGEN----VILSGSGNYFNFFPESVAVLTDTDP-IDVAVSTNRISILK 403

Query: 395 WMHPFGEGVLVGCDTSLWLLSISLS-KGLSIDFRRVSGSGV-YACPPVSVGDCLVFVCGV 452
           +  PF E +++  D + ++LS        ++     +   V     P  +G  + FV   
Sbjct: 404 YAVPFSEELILWSDQAQFVLSSDGGLTPTTVRLDLTTEFEVTEQTRPFGIGRGVYFVSP- 462

Query: 453 GRRIKYISGSTEQGFRFNEITQL------ADHLF---NQRILQLVYQEEPHSIVWVVLEP 503
               +    S  + +   ++TQ+      + H+       + ++      + +   +L  
Sbjct: 463 ----RAKFSSVRRFYAVQDVTQVKNAEDISAHVPSYVENGVFKMSGSSTENFLT--ILTE 516

Query: 504 KDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSA 540
            +          +  E     +W          VL  
Sbjct: 517 GNEQRVYFYKFLYLQEQLVQQSWSHWDFGVNCRVLCC 553


>gi|167565012|ref|ZP_02357928.1| tail tubular protein B [Burkholderia oklahomensis EO147]
          Length = 776

 Score = 61.0 bits (146), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 81/582 (13%), Positives = 162/582 (27%), Gaps = 75/582 (12%)

Query: 39  NLIPLRY-GPLVSMPLMQEYRDCRLDPRSNR-VFSFSIPDGGYALLV----FGDKKLQIV 92
           N +P    G L          +    P  +   + F   DG   + +     G  +++ +
Sbjct: 36  NFLPSVDIGGLADRVGTTCIANLAAAPYKSEGTYMFRTTDGQRWMFIRRADAGYPEIRNM 95

Query: 93  VVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKIS 152
           V  +    +   F + Y           L++     T + ++ D     +       K  
Sbjct: 96  VNGALAAVTCGPFVQNY-----INSASRLKFLSMSDTTLVLNPDVATRFVAPSAGITKTR 150

Query: 153 FTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTST------------------ARITS 194
             +  I+ L   +    + S   S A +    A   T                    I+ 
Sbjct: 151 -AYAVIRKLSSNYQTFYLNSDAGSAATVYDGSAGVKTREWVAQRLMEQCIAHMPGLTISR 209

Query: 195 DMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGA 254
              + +       I       +W +     I   + A   +   +  G S      +   
Sbjct: 210 VANVVRISGPEAIINTLNGGNDWDETAFVLIKGRVSAASDLPAQMFPGESVMVDLENGAT 269

Query: 255 TYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQA 314
                  +T+    N   +T+          +        + + G +         + + 
Sbjct: 270 KSAYW--VTYDRTTNSYKETAWLDNFANAGNWDASTMPVRIHQTGVNSFEIQPVDWVPRK 327

Query: 315 GVSVVSWFMSAWGEQEGYPSH-VTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYG 373
                S   + +    G P   +     RL FS +      V  S     ++F  D    
Sbjct: 328 VGDNDSNAPAPFN---GAPITDMALWKGRLWFSSASW----VVGSQPDDLFNFWQDSARE 380

Query: 374 CYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL-SKGLSIDFRRVSGS 432
                     A  D    ++  +  F + ++V    +   L  S   K  +      +  
Sbjct: 381 VVASDPVKVQAEAD--LGSVSHLAGFRDNLMVFLRGAQCSLDGSQPVKPDTAALGVATRY 438

Query: 433 GV-YACPPVSVGDCLVFV--CGVGRRIKYISGSTEQGFRFNEITQLADHLFN------QR 483
            V  ACPP  VG+ +++         +       EQ    N    L+ H+        +R
Sbjct: 439 DVDAACPPSVVGNVMLYTGSQEGRSVLWE--YQFEQATENNYAEDLSKHIPRYCPGSVRR 496

Query: 484 ILQLVYQEEPHSIVWVVLEPK------------DNSFPRLLGCRFSAEGEGDFAWHTHMI 531
           I+     +   + +W  L+                +        F    +    WH  + 
Sbjct: 497 IVGSA--QSGRTFLWSSLDAATLYVHSSYWQAQQRAQNAWNKLTF---AQMSNIWHHWVD 551

Query: 532 SDKHYVLSAASFPNDNRGGTSLWMLVALSAGEERSFTVRLNL 573
               YVL   S        + + + V  + GE R   +RL++
Sbjct: 552 EGNLYVLGQTSV----GYLSLVAVPVDANLGEHREIDLRLDM 589


>gi|68299742|ref|YP_249591.1| Tail tubular protein B [Vibriophage VP4]
 gi|66473281|gb|AAY46290.1| tail tubular protein B [Vibriophage VP4]
          Length = 794

 Score = 61.0 bits (146), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 45/278 (16%), Positives = 85/278 (30%), Gaps = 36/278 (12%)

Query: 287 YVWGDIKDVSKDGRSISVA-PQSQTLFQAGVSVVS---WFMSAWGE--QEGYPSHV---- 336
            VW +    +          P        G        W   A G+     YPS +    
Sbjct: 288 NVWTECPAPNIKADYNKATMPHVLIREADGTFTFKQADWTHRAAGDDETNPYPSFIGNSI 347

Query: 337 ---TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTI 393
               F  NRL F   +     V LS  G +++F  +      D    +  AV+    S +
Sbjct: 348 NDIFFFRNRLGFLSGEN----VILSGSGNYFNFFPESVAVLTDTDP-IDVAVSTNRISIL 402

Query: 394 HWMHPFGEGVLVGCDTSLWLLSISLS-KGLSIDFRRVSGSGV-YACPPVSVGDCLVFVCG 451
            +  PF E +++  D + ++LS        +I     +   V     P  +G  + FV  
Sbjct: 403 KYAVPFSEELILWSDQAQFVLSSDGGLTPTTIRLDLTTEFEVTEQARPYGIGRGVYFVSP 462

Query: 452 VGRRIKYISGSTEQGFRFNEITQL------ADHLF---NQRILQLVYQEEPHSIVWVVLE 502
                +    S  + +   ++TQ+      + H+       + ++      + +   +L 
Sbjct: 463 -----RAKFSSVRRFYAVQDVTQVKNAEDISAHVPYYVENGVFKMSGSSTENFLT--ILT 515

Query: 503 PKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSA 540
             +          +  E     +W          VL  
Sbjct: 516 EGNEQRVYFYKFLYLQEQLVQQSWSHWDFGVNCRVLCC 553


>gi|326633075|ref|YP_004306686.1| predicted tail tubular protein B [Salmonella phage Vi06]
 gi|301170548|emb|CBV65236.1| predicted tail tubular protein B [Salmonella phage Vi06]
          Length = 795

 Score = 61.0 bits (146), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 42/318 (13%), Positives = 100/318 (31%), Gaps = 44/318 (13%)

Query: 266 TVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSA 325
            ++  +SK++ +          VW +    + + + +        +  A  +        
Sbjct: 269 KIVGDASKSADQYYVRYDTTRKVWSETLGWNVNDQLLFETMPHALVRAADGN-FELKRIE 327

Query: 326 WG-------EQEGYPSH-------VTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGE 371
           W        +   +PS        V F  NRL     +     + LS    +++F     
Sbjct: 328 WSPKTCGDDDTNPWPSFMDSTINDVFFFRNRLGLLSGEN----IILSRTAKYFNFYPASI 383

Query: 372 YGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLS-KGLSIDFRRVS 430
               D    +  AV+    + + +  PF E +L+  D + ++L+ S +    SI+    +
Sbjct: 384 ATLSDDDP-IDVAVSTNRIAILKYAVPFSEELLIWSDEAQFVLTASGTLTSRSIELNLTT 442

Query: 431 GSGVYA-CPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQL--ADHLFNQ----- 482
              V     P  +G  + F        +    S  + +   +++ +  A+ +        
Sbjct: 443 QFDVQDRARPFGIGRNVYFASP-----RSSFTSIHRYYAVQDVSSVKNAEDITAHVQNYI 497

Query: 483 --RILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSA 540
              +  +      +     VL   D S   +    +  E     +W          VL+ 
Sbjct: 498 PNGVFDICGSSTENFCA--VLSQGDQSKIFMYKFLYLNEELRQQSWSHWDFGSNVQVLAC 555

Query: 541 ASFPNDNRGGTSLWMLVA 558
                     + +++++ 
Sbjct: 556 QCI------NSDMYVILR 567


>gi|325171313|ref|YP_004251284.1| tail tubular protein B [Vibrio phage ICP3]
 gi|323512019|gb|ADX87481.1| tail tubular protein B [Vibrio phage ICP3]
          Length = 794

 Score = 61.0 bits (146), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 44/278 (15%), Positives = 85/278 (30%), Gaps = 36/278 (12%)

Query: 287 YVWGDIKDVSKDGRSISVA-PQSQTLFQAGVSVVS---WFMSAWGE--QEGYPSHV---- 336
            VW +    +          P        G        W   A G+     YPS +    
Sbjct: 288 NVWTECPAPNIKADYNKATMPHVLIREADGTFTFKQADWTHRAAGDDDTNPYPSFIGNSI 347

Query: 337 ---TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTI 393
               F  NRL F   +     V LS  G +++F  +      D    +  AV+    S +
Sbjct: 348 NDIFFFRNRLGFLSGEN----VILSGSGNYFNFFPESVAVLTDTDP-IDVAVSTNRISIL 402

Query: 394 HWMHPFGEGVLVGCDTSLWLLSISLS-KGLSIDFRRVSGSGV-YACPPVSVGDCLVFVCG 451
            +  PF E +++  D + ++LS        ++     +   V     P  +G  + FV  
Sbjct: 403 KYAVPFSEELILWSDQAQFVLSSDGGLTPTTVRLDLTTEFEVTEQTRPFGIGRGVYFVSP 462

Query: 452 VGRRIKYISGSTEQGFRFNEITQL------ADHLF---NQRILQLVYQEEPHSIVWVVLE 502
                +    S  + +   ++TQ+      + H+       + ++      + +   +L 
Sbjct: 463 -----RAKFSSVRRFYAVQDVTQVKNAEDISAHVPSYVENGVFKMSGSSTENFLT--ILT 515

Query: 503 PKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSA 540
             +          +  E     +W          VL  
Sbjct: 516 EGNEQRVYFYKFLYLQEQLVQQSWSHWDFGVNCRVLCC 553


>gi|323512212|gb|ADX87670.1| tail tubular protein B [Vibrio phage ICP3_2007_A]
          Length = 794

 Score = 61.0 bits (146), Expect = 6e-07,   Method: Composition-based stats.
 Identities = 79/577 (13%), Positives = 166/577 (28%), Gaps = 61/577 (10%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHA-QGVAKSRNLIPLRYGPLVSMPLMQEYRD 59
           M   + +  +   G      +  + D+  ++ QG  +         G L   P     + 
Sbjct: 1   MALISQSIKNLKGG------ISQQPDILRYSDQGSKQINGFSSEVEG-LQKRPPSVHVKR 53

Query: 60  CRL---DPRSNRVFSFSIPDGGYALLVFGDKKLQIVVV-RSSTKWSPALFGKTYKTPYTF 115
                   +       +  +     + F    +++  +     K   A  G +Y T  + 
Sbjct: 54  LTDQFGLGQKPYCHIINRDEVERYAVFFTGSNIRVFDLFTGDEKTVNAPNGLSYVT--SS 111

Query: 116 KDNKSLEYAVFGSTAVFVHKDHP--------PHHLLYIQDGDKISFTFDEIKFLPPPWLG 167
              K L           ++++          P  L        +     +        + 
Sbjct: 112 NPRKDLRMVTVADYTFILNRNVSTAQGTTNTPSGLAPFGHFGLVVIRGGQYGRTYRVKVN 171

Query: 168 DGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGA 227
             + +  ++     +  A         D      +++G ++  G     ++K+    I +
Sbjct: 172 GSVEASFETPLGDQVEHAKQIDIAYIIDQLAAGLINRGWAVTKGSGYFYFSKSGTVIINS 231

Query: 228 YIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYY 287
             V D      L  G   D    ++   Y  +N I  I V    +    +      A   
Sbjct: 232 LEVEDG-YNGQLAWGIINDVQKTTQLPVYAPNNYI--IRVSGDPTLNQDDYYVKFDASRN 288

Query: 288 VWGDIKDVSKDGRSISVA-PQSQTLFQAGVSVVS---WFMSAWGE--QEGYPSHV----- 336
           VW +    +          P        G        W   A G+     YPS +     
Sbjct: 289 VWTECPAPNIKADYNKATMPHVLIREADGTFTFKQADWTHRAAGDDDTNPYPSFIGNSIN 348

Query: 337 --TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIH 394
              F  NRL F   +     V LS  G +++F  +      D    +  AV+    S + 
Sbjct: 349 DIFFFRNRLGFLSGEN----VILSGSGNYFNFFPESVAVLTDTDP-IDVAVSTNRISILK 403

Query: 395 WMHPFGEGVLVGCDTSLWLLSISLS-KGLSIDFRRVSGSGV-YACPPVSVGDCLVFVCGV 452
           +  PF E +++  D + ++LS        ++     +   V     P  +G  + FV   
Sbjct: 404 YAVPFSEELILWSDQAQFVLSSDGGLTPTTVRLDLTTEFEVTEQTRPFGIGRGVYFVSP- 462

Query: 453 GRRIKYISGSTEQGFRFNEITQL------ADHLF---NQRILQLVYQEEPHSIVWVVLEP 503
               +    S  + +   ++TQ+      + H+       + ++      + +   +L  
Sbjct: 463 ----RAKFSSVRRFYAVQDVTQVKNAEDISAHVPSYVENGVFKMSGSSTENFLT--ILTE 516

Query: 504 KDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSA 540
            +          +  E     +W          VL  
Sbjct: 517 GNEQRVYFYKFLYLQEQLVQQSWSHWDFGVNCRVLCC 553


>gi|323512164|gb|ADX87623.1| tail tubular protein B [Vibrio phage ICP3_2008_A]
          Length = 795

 Score = 61.0 bits (146), Expect = 6e-07,   Method: Composition-based stats.
 Identities = 79/577 (13%), Positives = 166/577 (28%), Gaps = 61/577 (10%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHA-QGVAKSRNLIPLRYGPLVSMPLMQEYRD 59
           M   + +  +   G      +  + D+  ++ QG  +         G L   P     + 
Sbjct: 1   MALISQSIKNLKGG------ISQQPDILRYSDQGSKQINGFSSEVEG-LQKRPPSVHVKR 53

Query: 60  CRL---DPRSNRVFSFSIPDGGYALLVFGDKKLQIVVV-RSSTKWSPALFGKTYKTPYTF 115
                   +       +  +     + F    +++  +     K   A  G +Y T  + 
Sbjct: 54  LTDQFGLGQKPYCHIINRDEVERYAVFFTGSNIRVFDLFTGDEKTVNAPNGLSYVT--SS 111

Query: 116 KDNKSLEYAVFGSTAVFVHKDHP--------PHHLLYIQDGDKISFTFDEIKFLPPPWLG 167
              K L           ++++          P  L        +     +        + 
Sbjct: 112 NPRKDLRMVTVADYTFILNRNVSTAQGTTNTPSGLAPFGHFGLVVIRGGQYGRTYRVKVN 171

Query: 168 DGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGA 227
             + +  ++     +  A         D      +++G ++  G     ++K+    I +
Sbjct: 172 GSVEASFETPLGDQVEHAKQIDIAYIIDQLAAGLINRGWAVTKGSGYFYFSKSGTVIINS 231

Query: 228 YIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYY 287
             V D      L  G   D    ++   Y  +N I  I V    +    +      A   
Sbjct: 232 LEVEDG-YNGQLAWGIINDVQKTTQLPVYAPNNYI--IRVSGDPTLNQDDYYVKFDASRN 288

Query: 288 VWGDIKDVSKDGRSISVA-PQSQTLFQAGVSVVS---WFMSAWGE--QEGYPSHV----- 336
           VW +    +          P        G        W   A G+     YPS +     
Sbjct: 289 VWTECPAPNIKADYNKATMPHVLIREADGTFTFKQADWTHRAAGDDDTNPYPSFIGNSIN 348

Query: 337 --TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIH 394
              F  NRL F   +     V LS  G +++F  +      D    +  AV+    S + 
Sbjct: 349 DIFFFRNRLGFLSGEN----VILSGSGNYFNFFPESVAVLTDTDP-IDVAVSTNRISILK 403

Query: 395 WMHPFGEGVLVGCDTSLWLLSISLS-KGLSIDFRRVSGSGV-YACPPVSVGDCLVFVCGV 452
           +  PF E +++  D + ++LS        ++     +   V     P  +G  + FV   
Sbjct: 404 YAVPFSEELILWSDQAQFVLSSDGGLTPTTVRLDLTTEFEVTEQTRPFGIGRGVYFVSP- 462

Query: 453 GRRIKYISGSTEQGFRFNEITQL------ADHLF---NQRILQLVYQEEPHSIVWVVLEP 503
               +    S  + +   ++TQ+      + H+       + ++      + +   +L  
Sbjct: 463 ----RAKFSSVRRFYAVQDVTQVKNAEDISAHVPSYVENGVFKMSGSSTENFLT--ILTE 516

Query: 504 KDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSA 540
            +          +  E     +W          VL  
Sbjct: 517 GNEQRVYFYKFLYLQEQLVQQSWSHWDFGVNCRVLCC 553


>gi|323512115|gb|ADX87575.1| tail tubular protein B [Vibrio phage ICP3_2009_A]
          Length = 794

 Score = 61.0 bits (146), Expect = 6e-07,   Method: Composition-based stats.
 Identities = 79/577 (13%), Positives = 166/577 (28%), Gaps = 61/577 (10%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHA-QGVAKSRNLIPLRYGPLVSMPLMQEYRD 59
           M   + +  +   G      +  + D+  ++ QG  +         G L   P     + 
Sbjct: 1   MALISQSIKNLKGG------ISQQPDILRYSDQGSKQINGFSSEVEG-LQKRPPSVHVKR 53

Query: 60  CRL---DPRSNRVFSFSIPDGGYALLVFGDKKLQIVVV-RSSTKWSPALFGKTYKTPYTF 115
                   +       +  +     + F    +++  +     K   A  G +Y T  + 
Sbjct: 54  LTDQFGLGQKPYCHIINRDEVERYAVFFTGSNIRVFDLFTGDEKTVNAPNGLSYVT--SS 111

Query: 116 KDNKSLEYAVFGSTAVFVHKDHP--------PHHLLYIQDGDKISFTFDEIKFLPPPWLG 167
              K L           ++++          P  L        +     +        + 
Sbjct: 112 NPRKDLRMVTVADYTFILNRNVSTAQGTTNTPSGLAPFGHFGLVVIRGGQYGRTYRVKVN 171

Query: 168 DGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGA 227
             + +  ++     +  A         D      +++G ++  G     ++K+    I +
Sbjct: 172 GSVEASFETPLGDQVEHAKQIDIAYIIDQLAAGLINRGWAVTKGSGYFYFSKSGTVIINS 231

Query: 228 YIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYY 287
             V D      L  G   D    ++   Y  +N I  I V    +    +      A   
Sbjct: 232 LEVEDG-YNGQLAWGIINDVQKTTQLPVYAPNNYI--IRVSGDPTLNQDDYYVKFDASRN 288

Query: 288 VWGDIKDVSKDGRSIS-VAPQSQTLFQAGVSVVS---WFMSAWGE--QEGYPSHV----- 336
           VW +    +          P        G        W   A G+     YPS +     
Sbjct: 289 VWTECPAPNIKADYNKDTMPHVLIREADGTFTFKQADWTHRAAGDDDTNPYPSFIGNSIN 348

Query: 337 --TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIH 394
              F  NRL F   +     V LS  G +++F  +      D    +  AV+    S + 
Sbjct: 349 DIFFFRNRLGFLSGEN----VILSGSGNYFNFFPESVAVLTDTDP-IDVAVSTNRISILK 403

Query: 395 WMHPFGEGVLVGCDTSLWLLSISLS-KGLSIDFRRVSGSGV-YACPPVSVGDCLVFVCGV 452
           +  PF E +++  D + ++LS        ++     +   V     P  +G  + FV   
Sbjct: 404 YAVPFSEELILWSDQAQFVLSSDGGLTPTTVRLDLTTEFEVTEQTRPFGIGRGVYFVSP- 462

Query: 453 GRRIKYISGSTEQGFRFNEITQL------ADHLF---NQRILQLVYQEEPHSIVWVVLEP 503
               +    S  + +   ++TQ+      + H+       + ++      + +   +L  
Sbjct: 463 ----RAKFSSVRRFYAVQDVTQVKNAEDISAHVPSYVENGVFKMSGSSTENFLT--ILTE 516

Query: 504 KDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSA 540
            +          +  E     +W          VL  
Sbjct: 517 GNEQRVYFYKFLYLQEQLVQQSWSHWDFGVNCRVLCC 553


>gi|291334275|gb|ADD93938.1| hypothetical protein BTH_I0919 [uncultured marine bacterium
           MedDCM-OCT-S08-C235]
          Length = 323

 Score = 59.9 bits (143), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 24/148 (16%), Positives = 43/148 (29%), Gaps = 16/148 (10%)

Query: 417 SLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKY-ISGSTEQGFRFNEITQL 475
           +     +   RR +  G       ++    +FV   GR ++  +    E  +    I+ L
Sbjct: 11  NSLTPSNFTARRQTTHGCSHVNVKTLEGGALFVQKHGRAVRELLFTDLELSYSATNISLL 70

Query: 476 ADHLFNQRILQLVYQ---EEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMIS 532
           A HL    +   + Q   E P S    +            G   +   E    W     +
Sbjct: 71  ASHLVQTPVDMTILQGTAERPESYAIFINSDGT------AGVFHAVRAEKLAGWTEWKTT 124

Query: 533 DKHYVLSAASFPNDNRGGTSLWMLVALS 560
                 S  +       G+ L+  V   
Sbjct: 125 TGATFKSIEAV------GSRLFFTVYRD 146


>gi|281416199|ref|YP_003347934.1| tail tubular protein B [Vibrio phage N4]
 gi|237701506|gb|ACR16499.1| tail tubular protein B [Vibrio phage N4]
          Length = 794

 Score = 58.7 bits (140), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 44/276 (15%), Positives = 85/276 (30%), Gaps = 36/276 (13%)

Query: 287 YVWGDIKDVSKDGRSISVA-PQSQTLFQAGVSVVS---WFMSAWGE--QEGYPSHV---- 336
            VW +    +          P        G        W   A G+     YPS +    
Sbjct: 288 NVWTECPAPNIKADYNKATMPHVLIREADGTFTFKQADWTHRAAGDDDTNPYPSFIGNSI 347

Query: 337 ---TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTI 393
               F  NRL F   +     V LS  G +++F  +      D    +  AV+    S +
Sbjct: 348 NDIFFFRNRLGFLSGEN----VILSGSGNYFNFFPESVAVLTDTDP-IDVAVSTNRISIL 402

Query: 394 HWMHPFGEGVLVGCDTSLWLLSISLS-KGLSIDFRRVSGSGV-YACPPVSVGDCLVFVCG 451
            +  PF E +++  D + ++LS        ++     +   V     P  +G  + FV  
Sbjct: 403 KYAVPFSEELILWSDQAQFVLSSDGGLTPTTVRLDLTTEFEVTEQARPFGIGRGVYFVSP 462

Query: 452 VGRRIKYISGSTEQGFRFNEITQL------ADHLF---NQRILQLVYQEEPHSIVWVVLE 502
                +    S  + +   ++TQ+      + H+       + ++      + +   +L 
Sbjct: 463 -----RAKFSSVRRFYAVQDVTQVKNAEDISAHVPSYVENGVFKMSGSSTENFLT--ILT 515

Query: 503 PKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVL 538
             +          +  E     +W          VL
Sbjct: 516 EGNEQRVYFYKFLYLQEQLVQQSWSHWDFGVNCRVL 551


>gi|313892508|ref|ZP_07826097.1| tail tubular protein B family protein [Dialister microaerophilus
           UPII 345-E]
 gi|313119087|gb|EFR42290.1| tail tubular protein B family protein [Dialister microaerophilus
           UPII 345-E]
          Length = 807

 Score = 57.9 bits (138), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 63/536 (11%), Positives = 152/536 (28%), Gaps = 84/536 (15%)

Query: 26  DLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRVFSFSI-----PDGGYA 80
           D+    + + +  N        L   P     +D  +   + +  +++       +    
Sbjct: 19  DILRFPEQLEEQTNGFSTESSGLQKRPPTLFIKDLGVHTTTTQAKNYACHTVDRDEEEKY 78

Query: 81  LLVFGDKKLQIVVVRSS---TKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDH 137
           +++F  + + +  ++       +      +   T       + L+          V+ + 
Sbjct: 79  IMLFTGEDILVYDLKGKQYKVTYEDEKSKQYITT---ENPREELKMVTIADHTFVVNTEV 135

Query: 138 PPHHLLYIQDGDKISFT-------------------------FDEIKFLPPPWLGDGMIS 172
                   +D     ++                           +          D   +
Sbjct: 136 VVK---MSEDKVPWKWSDHEALIHIQKGNYGREYSIKINGKKVAKYTTPDGGEASDIKYT 192

Query: 173 GVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVAD 232
                  +  +   T     T D K        +          +  +  Y I ++ V+D
Sbjct: 193 DTNYIRDILGNAIQTEEVLYT-DGKYHNQSSGWQVTYYNSAFKIYHPD--YYINSFEVSD 249

Query: 233 DKVYRSLTTGRSGDR-FGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGD 291
                ++   +   + F +            T   + +  + T     +     + VW +
Sbjct: 250 GFNGEAMHAIKHAVQKFNHLPADAPD---GYTVKVIGDKHTGTDDYYVTFDGKEH-VWKE 305

Query: 292 IKDVSK----DGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGY--PSHV-------TF 338
               +     D  ++      Q+     +   +W     G++E    PS V         
Sbjct: 306 CAKPNISKGFDAETMPHILVRQSDGTFKLKKANWDERKAGDEESNEPPSFVDNTINDIFL 365

Query: 339 HNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHP 398
             NRL F   +     + LS   +F++F L       D T  +  AV++ S S +     
Sbjct: 366 FRNRLGFLSGEN----IILSRSASFFNFWLASAVELQD-TDTIDLAVSNNSVSILEHAVL 420

Query: 399 FGEGVLVGCDTSLWLLS--ISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVF-VCGV-GR 454
           F E +L+  + + ++++    L+   +  +   S        P+  G  + F V      
Sbjct: 421 FNEELLLFSNNAQFIMTSEGILTPQKASVYFATSFPSATEVVPIKAGRRVYFPVKRALYS 480

Query: 455 RIKYISGSTEQGFRFNEITQLADHL--------------FNQRILQLVYQEEPHSI 496
            I+    + E      +   +  H+               N+ I+ +     P S+
Sbjct: 481 GIRE-YYTLEDTRGSKDAQDITAHVPSLIPNGIHKLWECTNESIILVASNATPDSL 535


>gi|281416310|ref|YP_003347550.1| tail tubular protein B [Klebsiella phage KP32]
 gi|262410429|gb|ACY66694.1| tail tubular protein B [Klebsiella phage KP32]
          Length = 791

 Score = 57.9 bits (138), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 48/314 (15%), Positives = 94/314 (29%), Gaps = 36/314 (11%)

Query: 266 TVLNLSSKTSRESASGAVAPYYVW----GDIKDVSKDGRSISVAPQSQTLFQAGVSVVSW 321
            ++  +SKT+ +          VW    G    +  D  ++             +    W
Sbjct: 266 KIVGDTSKTADQYYVKYDKSQKVWKETVGWNISIGLDYTTMPWTLVRAADGNFDLGYHDW 325

Query: 322 FMSAWGEQEGYPSH---------VTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEY 372
                G+++  P           V F  NRL F   +     + +S    +++F      
Sbjct: 326 KDRRAGDEDTNPQPSFVNSTITDVFFFRNRLGFISGEN----IVMSRTSKYFEFYPPSVA 381

Query: 373 GCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL-SKGLSIDFRRVSG 431
             Y     L  AV+    S + +   F E +L+  D + ++LS +      +      + 
Sbjct: 382 -NYTDDDPLDVAVSHNRVSVLKYAVSFAEELLLWSDEAQFVLSANGVLSAKTAQLDLTTQ 440

Query: 432 SG-VYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFR----FNEITQLADHLFNQRILQ 486
                   P  +G  + +          +     Q         ++T    H+ N  I  
Sbjct: 441 FDVSDRARPYGIGRNIYYASPRSSFTSIMRYYAVQDVSSVKNAEDMT---AHVPNY-IPN 496

Query: 487 LVYQEEPHSI--VWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFP 544
            VY            VL     S   +    +  E     +W      D   V++A    
Sbjct: 497 GVYSINGSGTENFACVLTKGAPSKVFIYKFLYMDENIRQQSWSHWDFGDGVEVMAANCI- 555

Query: 545 NDNRGGTSLWMLVA 558
                 ++++ML+ 
Sbjct: 556 -----NSTMYMLMR 564


>gi|310005669|gb|ADP00057.1| tail tube B [Cyanophage 9515-10a]
          Length = 1000

 Score = 57.6 bits (137), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 38/236 (16%), Positives = 77/236 (32%), Gaps = 18/236 (7%)

Query: 225 IGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVA 284
           + +       V  + +     D +     A      +  W   +      S  S      
Sbjct: 418 LPSMCKHGYIVQVANSENVDADNYYVKFLADNGSGGSGKWEETVR-PHNFSSGSDPMVKG 476

Query: 285 PYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGE--QEGYPSH------- 335
                     V+    + +     +T   A  +   W     G+     +PS        
Sbjct: 477 LDPATMPHALVNNRNGTFTFKKLDETTANADNTDNYWKYREVGDDETNPFPSFKGLEIQK 536

Query: 336 VTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHW 395
           + FH NRL F  +      V +S  G +++F +       D    +   V+D   + I+ 
Sbjct: 537 IFFHRNRLGFVAN----EQVVMSRPGDYFNFFVVSAITTSD-DNPIDITVSDIKPAFINH 591

Query: 396 MHPFGEGVLVGCDTSLWLL--SISLSKGLSIDFRRVSGSGV-YACPPVSVGDCLVF 448
           + P  +GV++  D   ++L     +    +   +++S      A  PV +G  ++F
Sbjct: 592 VLPVQKGVMMFSDNGQFILFTESDIFSPKTARLKKISSYECDDALQPVDMGTSVMF 647


>gi|194473836|ref|YP_002048660.1| tail tubular protein B [Morganella phage MmP1]
 gi|194307057|gb|ACF42039.1| tail tubular protein B [Morganella phage MmP1]
          Length = 819

 Score = 55.6 bits (132), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 66/524 (12%), Positives = 150/524 (28%), Gaps = 70/524 (13%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           M   + +  +   G      +  + D+  +    A   N        L   P +   +  
Sbjct: 1   MALVSQSTKNLKGG------ISQQPDILRYPDQGAAQVNGWSSETEGLQKRPPLVFVKQL 54

Query: 61  RLDP--RSNRVFSFS-IPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKD 117
                  S+ +  +    +    L+ F    +++  +               K P   +D
Sbjct: 55  GGKNYLGSDPLVHYINRSEDEKYLVAFSGTGVKVFDMEGKEYTVHNNNAAYLKAPNPKQD 114

Query: 118 NKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSN 177
            + +           V+++    +      G   +   D +  +     G  +   +   
Sbjct: 115 LRMVTV---ADYTFIVNRNITVKNRSEKSTGGTFNPKSDCLIAVRGSQYGRTIKVTINGV 171

Query: 178 AKLSIS----QADTSTARITSDMKI---FKPLDKGRSIRLGCHPP--------EWAKNTN 222
            +++ +            I++D  I      L  G++       P        E+   T 
Sbjct: 172 DRVNFTLHDGAEAWQGRTISTDKVIRYIVDQLTTGKTTEGQGSLPGLGHYGVFEYVTTTP 231

Query: 223 YSIGAYIV-ADDKVYRSLTTGRSGDRFGYSKG---------ATYVKDNNIT--------W 264
              G  +   D  VY     G+  D    + G           YV+             +
Sbjct: 232 LPSGWTVKGMDGFVYIKAPAGQQIDTITTTDGYSDQLVYPVTHYVQTTAKLPLNAPDNYY 291

Query: 265 ITVLNLSSKTSRESASGAVAPYYVW----GDIKDVSKDGRSISVAPQSQTLFQAGVSVVS 320
           I V+  +  T+ +          VW    G    +     ++  A   ++     V  + 
Sbjct: 292 IKVVGEAEGTADQYYLKFDKDARVWREAIGWNAILGFQKDTMPHALIRRSDGNFEVKALD 351

Query: 321 WFMSAWGEQEGYP---------SHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGE 371
           W     G+ +  P         S V F  NRL F   +     + +S  G ++       
Sbjct: 352 WSDKEAGDDDTNPDVSLVDRTISDVFFFRNRLGFVSGEN----IVMSRTGRYFKLYPASV 407

Query: 372 YGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL-SKGLSIDFRRVS 430
               D          +     + +  PF E +L+  + + ++L+        +++    +
Sbjct: 408 AAISDDDPIDVAVSYNRVVD-LQFAVPFTEELLLWANGAQFILTAQGILSPKTVELNLST 466

Query: 431 GSGVY-ACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEIT 473
              V+    PV +G  + +        +    S  + F   +++
Sbjct: 467 QFSVHTGARPVGIGRNVYYASP-----RATFTSINRYFTVQDVS 505


>gi|326536942|ref|YP_004306349.1| tail tubular protein B [Pseudomonas phage phiIBB-PF7A]
 gi|318054518|gb|ADV35694.1| tail tubular protein B [Pseudomonas phage phiIBB-PF7A]
          Length = 807

 Score = 55.6 bits (132), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 52/337 (15%), Positives = 104/337 (30%), Gaps = 49/337 (14%)

Query: 265 ITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQ---AGVSVVSW 321
              L   +  +  S       Y   G +   +     I+   ++        A      W
Sbjct: 273 EGYLVEITGEATRSGDNYWVRYDGAGRVWKETVKPGIIAGLNRATMPRGLVRAADGQFDW 332

Query: 322 FMSAWG-------EQEGYPSHV-------TFHNNRLLFSGSKGDELSVYLSSFGAFYDFS 367
            +  W        E    PS V        F  NRL F   +     V +S    +++F 
Sbjct: 333 KVLDWNNRGCGDDETNPLPSFVGGTINDVFFFRNRLGFLSGEN----VIMSRSSRYFNFF 388

Query: 368 LDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLL-SISLSKGLSIDF 426
                   D    L  AV+    S + +  PF E +L+  D + ++L S  +    +++ 
Sbjct: 389 PPSVAALSDDDP-LDIAVSHNRISILKYAVPFSEQLLLWSDQAQFVLSSQGILSPKTVEL 447

Query: 427 RRVSGSGVYA-CPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEIT------QLADHL 479
              +   V     P  +G  + F        +    S ++ +   +++       ++ H+
Sbjct: 448 NLTTEFDVQDTARPFGIGRGVYF-----SAPRAAYTSLKRYYAVQDVSDVKNAEDVSAHV 502

Query: 480 F---NQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHY 536
                 R+  +      + +   +L         L    + AE     +W          
Sbjct: 503 PSYIENRVFNIHGSGTENYVT--LLSDGAPGIVYLYKFLYMAEDIAQQSWSHWEFGQNVN 560

Query: 537 VLSAASFPNDNRGGTSLWMLVALSAGEERSFTVRLNL 573
           +L AAS       G+ +++L+    G       R+  
Sbjct: 561 ILGAASI------GSYMYLLMDRPEG---IVLERMEF 588


>gi|194100501|ref|YP_002003346.1| gp12 [Yersinia phage Yepe2]
 gi|193201234|gb|ACF15715.1| gp12 [Yersinia phage Yepe2]
          Length = 792

 Score = 55.6 bits (132), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 77/557 (13%), Positives = 158/557 (28%), Gaps = 79/557 (14%)

Query: 47  PLVSMPLMQEYRDC----RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSP 102
            L   P     +       L  +             Y ++  G       +         
Sbjct: 41  GLQKRPPFVFTKTIGDQNALGAKPLVHLINRDSAEQYYVVFTGQGVRVFDLNGKEYDVKG 100

Query: 103 ALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKD-------HPPHHLLYIQD-------- 147
            L     + P        L           V+++        P + L    D        
Sbjct: 101 DLSYVKVENP-----RDDLRMVTVADYTFIVNRNMVVRPDTTPLYTLKENGDCLINIRGG 155

Query: 148 --GDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKG 205
             G  ++FT +  K       GD      +++A+           +  + +       KG
Sbjct: 156 MYGRTLAFTINNTKIAYEIAHGDAPEHSKQTDAQW--------LVKKLAGLARLNVAFKG 207

Query: 206 RSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWI 265
            +   G         +N  I +    D    + +       +   S     V+  N   +
Sbjct: 208 WTFTEGPGYIHVIAPSNSQINSLSTEDGYADQLMNAVMHTSQ---SFSRLPVEAPNGYTV 264

Query: 266 TVLNLSSKTSRESASGAVAPYYVW----GDIKDVSKDGRSISVAPQSQTLFQAGVSVVSW 321
            ++  +SKTS            VW    G       +G ++  A   Q      +  + W
Sbjct: 265 KIVGDTSKTSDMFYVQYDNLKKVWKEVAGWGVQKGLNGDTMPHALVRQADGSFQMQALPW 324

Query: 322 FMSAWGEQEGYPSH---------VTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEY 372
                G+ +  P+          V F  NRL F   +     + +S    ++        
Sbjct: 325 AQRTCGDMDTNPTPSIVDQTINDVFFFRNRLGFLAGEN----IVMSRTSKYFSLFPASVA 380

Query: 373 GCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL-SKGLSIDFRRVSG 431
              D    +  AV+    S + +  PF E +L+  D + ++LS        S++    + 
Sbjct: 381 NLSDDDP-IDVAVSHNRISILKYAVPFSEELLLWSDQAQFVLSAQGILSPKSVELNLTTE 439

Query: 432 SG-VYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQ------LADHLFN--- 481
                   P  VG  + F        +    S  + +   +++       ++ H+ +   
Sbjct: 440 FDVSDRARPFGVGRGVYFASP-----RASYTSLNRYYAVQDVSSVKSAEDMSAHVPSYIP 494

Query: 482 QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAA 541
             +  +      + I   VL     S   L    +  E     +W    +     VL+  
Sbjct: 495 NGVFSIRGSSTENFIA--VLSSNAPSRIFLYKFLYLNEEISQQSWSHWELGSNVTVLACD 552

Query: 542 SFPNDNRGGTSLWMLVA 558
           S       G+++++++ 
Sbjct: 553 SI------GSTMYLVLR 563


>gi|312436378|gb|ADQ83187.1| tail tubular protein B [Yersinia phage Yep-phi]
          Length = 792

 Score = 55.2 bits (131), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 73/552 (13%), Positives = 159/552 (28%), Gaps = 69/552 (12%)

Query: 47  PLVSMPLMQEYRDC----RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSP 102
            L   P     +       L  +             Y ++  G   +++  +    ++S 
Sbjct: 41  GLQKRPPFVFTKTIGDQNALGAKPLVHLINRDSAEQYYVVFTGQG-VRVFDLDGK-EYSV 98

Query: 103 ALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKD-------HPPHHLLYIQD-----GDK 150
                  K      D + +           V+++        P + L    D        
Sbjct: 99  KGDLSYVKVGNPRDDLRMVTV---ADYTFIVNRNMVVRPDTTPLYTLKENGDCLINIRGG 155

Query: 151 ISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRL 210
           +                +     V  ++K + +Q      +  + +       KG +   
Sbjct: 156 MYGRTLAFTINNTKIAYEIAHGDVPEHSKQTDAQW---LVKKLAGLARLNVAFKGWTFTE 212

Query: 211 GCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNL 270
           G         +N  I +    D    + +       +   S     V+  N   + ++  
Sbjct: 213 GPGYIHVIAPSNSQINSLSTEDGYADQLMNAVMHTSQ---SFSRLPVEAPNGYTVKIVGD 269

Query: 271 SSKTSRESASGAVAPYYVW----GDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAW 326
           +SKTS            VW    G       +G ++  A   Q      +  + W     
Sbjct: 270 TSKTSDMFYVQYDNLKKVWKEVAGWGVQKGLNGDTMPHALVRQADGSFQMQALPWAQRTC 329

Query: 327 GEQEGYPSH---------VTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDP 377
           G+ +  P+          V F  NRL F   +     + +S    ++           D 
Sbjct: 330 GDMDTNPTPSIVDQTINDVFFFRNRLGFLAGEN----IVMSRTSKYFSLFPASVANLSDD 385

Query: 378 TKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL-SKGLSIDFRRVSGSG-VY 435
              +  AV+    S + +  PF E +L+  D + ++LS        S++    +      
Sbjct: 386 DP-IDVAVSHNRISILKYAVPFSEELLLWSDQAQFVLSAQGILSPKSVELNLTTEFDVSD 444

Query: 436 ACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQ------LADHLFN---QRILQ 486
              P  VG  + F        +    S  + +   +++       ++ H+ +     +  
Sbjct: 445 RARPFGVGRGVYFASP-----RASYTSLNRYYAVQDVSSVKSAEDMSAHVPSYIPNGVFS 499

Query: 487 LVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPND 546
           +      + I   VL     S   L    +  E     +W    +     VL+  S    
Sbjct: 500 IRGSSTENFI--SVLSSNAPSRIFLYKFLYLNEEIAQQSWSHWELGSNVTVLACDSI--- 554

Query: 547 NRGGTSLWMLVA 558
              G+++++++ 
Sbjct: 555 ---GSTMYLVLR 563


>gi|194100452|ref|YP_002003825.1| gp12 [Klebsiella phage K11]
 gi|193201391|gb|ACF15869.1| gp12 [Klebsiella phage K11]
          Length = 791

 Score = 54.5 bits (129), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 50/314 (15%), Positives = 95/314 (30%), Gaps = 36/314 (11%)

Query: 266 TVLNLSSKTSRESASGAVAPYYVW----GDIKDVSKDGRSISVAPQSQTLFQAGVSVVSW 321
            ++  +SKT+ +      A   VW    G    V  +  ++             +    W
Sbjct: 266 KIVGDTSKTADQYYVKYDASQKVWKETVGWNISVGLEYHTMPWTLVRAADGNFDLGYHEW 325

Query: 322 FMSAWGEQEGYPSH---------VTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEY 372
                G+ +  P           V F  NRL F   +     + LS    +++F      
Sbjct: 326 RDRRAGDDDTNPQPSFVNSTITDVFFFRNRLGFISGEN----IVLSRTSKYFEFYPPSVA 381

Query: 373 GCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL-SKGLSIDFRRVSG 431
             Y     L  AV+    S + +   F E +L+  D + ++LS +      +      + 
Sbjct: 382 -NYTDDDPLDVAVSHNRVSVLKYAVSFAEELLLWSDEAQFVLSANGVLSAKTAQLDLTTQ 440

Query: 432 SGVYA-CPPVSVGDCLVFVCGVGRRIKYISGSTEQGFR----FNEITQLADHLFNQRILQ 486
             V     P  +G  + +          +     Q         ++T    H+ N  I  
Sbjct: 441 FDVSDRARPYGIGRNIYYASPRSSFTSIMRYYAVQDVSSVKNAEDMT---AHVPNY-IPN 496

Query: 487 LVYQEEPHSI--VWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFP 544
            VY            VL     S   +    +  E     +W      D   V++A    
Sbjct: 497 GVYSINGSGTENFACVLTKGAPSKVFIYKFLYMDENIRQQSWSHWDFGDGVEVMAANCI- 555

Query: 545 NDNRGGTSLWMLVA 558
                 +++++L+ 
Sbjct: 556 -----NSTMYLLMR 564


>gi|119637778|ref|YP_919014.1| Tubular tail protein B [Yersinia phage Berlin]
 gi|119391809|emb|CAJ70682.1| hypothetical protein [Yersinia phage Berlin]
          Length = 792

 Score = 54.1 bits (128), Expect = 7e-05,   Method: Composition-based stats.
 Identities = 79/557 (14%), Positives = 159/557 (28%), Gaps = 79/557 (14%)

Query: 47  PLVSMPLMQEYRDC----RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSP 102
            L   P     +       L  +             Y ++  G       +         
Sbjct: 41  GLQKRPPFVFTKTIGDQNALGAKPLVHLINRDSAEQYYVVFTGQGVRVFDLNGKEYDVKG 100

Query: 103 ALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKD-------HPPHHLLYIQD-------- 147
            L     + P        L           V+++        P + L    D        
Sbjct: 101 DLSYVKVENP-----RDDLRMVTVADYTFIVNRNMVVRPDTTPLYTLKENGDCLINIRGG 155

Query: 148 --GDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKG 205
             G  ++FT +  K       GD      +++A+           +  + +       KG
Sbjct: 156 MYGRTLAFTINNTKIAYEIAHGDAPEHSKQTDAQW--------LVKKLAGLARLNVAFKG 207

Query: 206 RSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWI 265
            +   G         +N  I +    D    + +       +   S     V+  N   +
Sbjct: 208 WTFTEGPGYIHVIAPSNSQINSLSTEDGYADQLMNAVMHTSQ---SFSRLPVEAPNGYTV 264

Query: 266 TVLNLSSKTSRESASGAVAPYYVW----GDIKDVSKDGRSISVAPQSQTLFQAGVSVVSW 321
            ++  +SKTS            VW    G       +G ++  A   Q      + V+ W
Sbjct: 265 KIVGDTSKTSDMFYVQYDNMKKVWKEVAGWGVQKGLNGGTMPHALVRQADGSFQMQVLPW 324

Query: 322 FMSAWGEQEGYPSH---------VTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEY 372
                G+ +  P+          V F  NRL F   +     + +S    ++        
Sbjct: 325 TQRTCGDMDTNPTPSIVDQKINDVFFFRNRLGFLAGEN----IVMSRTSKYFSLFPASVA 380

Query: 373 GCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL-SKGLSIDFRRVSG 431
              D    +  AV+    S + +  PF E +L+  D + ++LS        S++    + 
Sbjct: 381 NLSDDDP-IDVAVSHNRISILKYAVPFSEELLLWSDQAQFVLSAQGILSPKSVELNLTTE 439

Query: 432 SG-VYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQ------LADHLFN--- 481
                   P  VG  + F        +    S  + +   +++       ++ H+ N   
Sbjct: 440 FDVSDRARPFGVGRGVYFASP-----RASYTSLNRYYAVQDVSSVKSAEDMSAHVPNYIP 494

Query: 482 QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAA 541
             +  +      + I   VL     S   L    +  E     +W    +     VL+  
Sbjct: 495 NGVFSIRGSSTENFI--SVLSSNAPSRIFLYKFLYLNEEIAQQSWSHWELGSNVTVLACD 552

Query: 542 SFPNDNRGGTSLWMLVA 558
           S       G+++++++ 
Sbjct: 553 SI------GSTMYLVLR 563


>gi|212671415|ref|YP_002308415.1| tubular tail protein B [Kluyvera phage Kvp1]
 gi|211997259|gb|ACJ14576.1| tubular tail protein B [Kluyvera phage Kvp1]
          Length = 793

 Score = 52.9 bits (125), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 75/552 (13%), Positives = 159/552 (28%), Gaps = 68/552 (12%)

Query: 47  PLVSMP---LMQEYRDCRLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPA 103
            L   P     +   D      +  V   +        +VF    +++  +         
Sbjct: 41  GLQKRPPFIFTKTIGDAGFLGGAPLVHLINRDSIEQYYVVFTGSGVKVFDLNG------R 94

Query: 104 LFGKTYKTPYTFKDNK--SLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDE---I 158
            +     T Y    N    L           V++      ++        +   D    I
Sbjct: 95  EYAVHGDTSYANCANPRDDLRMVTVADYTFVVNRS----KVVQANKDPIYTIREDGECLI 150

Query: 159 KFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWA 218
                 +     I     +A   I+    +     +D +             G +   W 
Sbjct: 151 NIRGGQYGRTFTIRLNGISASYKIADGANAPEVEQTDAQWLVKKMAQLLREGGANTWGWT 210

Query: 219 KNTNY-SIGAYIVADDKVYR-SLTTGRSGDRFGYSKGATYV------KDNNITWITVLNL 270
            N     I      D+ +++  +  G  G         +        +  N   + ++  
Sbjct: 211 VNEGAGYIHVVSRGDEPIWKVEVEDGYGGQLMSAVMHTSQSFSKLPAEAPNGYSVQIVGD 270

Query: 271 SSKTSRESASGAVAPYYVW----GDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAW 326
           +SKTS        A   VW    G       +  ++  A   Q+     +  + W     
Sbjct: 271 TSKTSDAFYVQYDAARKVWKEVAGWGVQKGLNNGTMPHALIRQSDGSFKMEALPWDERKC 330

Query: 327 GEQEGYP---------SHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDP 377
           G+    P         + V F  NRL F   +     + +S    ++           D 
Sbjct: 331 GDMNTNPDPSIVDQKINDVFFFRNRLGFLAGEN----IVMSRTSKYFSLFPASVANLSDD 386

Query: 378 TKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL-SKGLSIDFRRVSGSGVYA 436
              +  AV+    ST+ +  PF E +L+  D + ++LS S      S++    +   V  
Sbjct: 387 DP-IDVAVSHNRISTLKYAVPFSEELLLWSDQAQFVLSASGILSPKSVELNLTTEFDVSD 445

Query: 437 -CPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQ------LADHLFN---QRILQ 486
              P  +G  + F        +    S  + +   +++       ++ H+ +     +  
Sbjct: 446 KARPYGIGRGVYFASP-----RASYTSINRYYAVQDVSSVKSAEDMSAHVPSYIPNGVFS 500

Query: 487 LVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPND 546
           +      + +   VL     S   +    +  E     +W    +     VL+  S    
Sbjct: 501 IRGSGTENFV--SVLSANAPSKIFMYKFLYLNEENVQQSWSHWELGSNVTVLACDSI--- 555

Query: 547 NRGGTSLWMLVA 558
              G+++++L+ 
Sbjct: 556 ---GSTMYLLLR 564


>gi|326424995|ref|YP_004286217.1| virion structural protein [Pseudomonas phage phi15]
 gi|325048399|emb|CBZ42012.1| virion structural protein [Pseudomonas phage phi15]
          Length = 793

 Score = 52.9 bits (125), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 78/628 (12%), Positives = 168/628 (26%), Gaps = 95/628 (15%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEY--- 57
           M  ++ +  +   G      +  + D+  +    A+  N        L   P +      
Sbjct: 1   MPLSSQSIKNLKGG------ISQQPDVLRYPNQGAQQINGWSSETKGLQKRPPLVFIKRL 54

Query: 58  RDCRLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKD 117
            +         V   +        L+F +  L I  +  +  +  +       T    +D
Sbjct: 55  AESGHFGTKPLVHLINRDAFEQYQLIFHNGALTIFDLAGNN-YPVSGSLSYIATANPRED 113

Query: 118 NKSLEYAVFGSTAV----------FVHKDHPPHH----LLYIQDGDKISFTFDEIKFLPP 163
            + L  A +                 H  +P  +    +         +           
Sbjct: 114 LRLLTVADYTFILNRTKTVEMSSELTHTGYPALNSRALVSCRGGQYGRTLRIRANGVELA 173

Query: 164 PWLGDGMISGVKSNAKLSISQADTSTARI---------TSDMKIFKPLDKGRSIRLGCHP 214
            +     ++   +     ++  D               T+             +  G   
Sbjct: 174 SYELPDGLAENNTELSKEVAAMDAQAIVKELVKRVNAGTATHGFSAAEGPSHLVIYGNGQ 233

Query: 215 PEWAKNTNYSIGAYIVADDKVYRSLTT-----GRSGDRFGYSKGATYVKDNN-ITWITVL 268
           P     T       +++        TT       +G     +  A+   DN  + +    
Sbjct: 234 PINNIYTEDGYADQLISGLIYQVQTTTKLPITAPAGYLVEITGEASRSGDNYWVRYDGAA 293

Query: 269 NLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGE 328
            +  +T +      + P               ++  A   Q         ++W     G+
Sbjct: 294 KVWKETVKPGIISGINP--------------GTMPHALIRQADGTFSFGPLTWAKRTAGD 339

Query: 329 --QEGYPSHV-------TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTK 379
                 PS V        F  NRL F   +     + +S    ++           D   
Sbjct: 340 DETNPMPSLVDNKLNDVFFFRNRLGFLSGEN----IIMSKTAKYFQLFPSSVAASADDDP 395

Query: 380 ALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL-SKGLSIDFRRVSGSGVY-AC 437
            +  AV+    S + +  PF E +L+  D + + L+ S      +      +   V  A 
Sbjct: 396 -IDVAVSHSRISILKYAVPFSEQLLLWSDQAQFTLTSSGVLSAKTAQLDLTTEFDVLDAA 454

Query: 438 PPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQ------LADHLFN---QRILQLV 488
            P  +G  + F     R            +   +++       ++ H+      ++  + 
Sbjct: 455 RPYGLGRGVYFAAPRARFCSIKRY-----YAVADVSNVKNAEDVSGHVPTYIPNKVHNVN 509

Query: 489 YQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNR 548
                + +   VL   D S   +    +  E     +W  H    K  +LS  S      
Sbjct: 510 GSGTENFV--SVLTDGDPSKVFIYKFLYQDENLAQQSWS-HWTFGKCKILSMFSI----- 561

Query: 549 GGTSLWMLVALSAGEERSFTVRLNLLDD 576
            G+  + ++  + G       RL   +D
Sbjct: 562 -GSYTYTIMDRAEGV---VLERLEFTND 585


>gi|310005781|gb|ADP00167.1| tail tube protein B [Cyanophage NATL2A-133]
          Length = 985

 Score = 52.6 bits (124), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 32/166 (19%), Positives = 64/166 (38%), Gaps = 17/166 (10%)

Query: 295 VSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGE--QEGYPSH-------VTFHNNRLLF 345
           V+    + +     +T   A  +   W     G+     +PS        + FH NRL  
Sbjct: 467 VNNRNGTFTFKKLDETTANADSNDNYWKYREVGDDITNPFPSFKGLKISKIFFHRNRLGL 526

Query: 346 SGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLV 405
             +      V +S  G +++F +       D    +   V+D   + I+ + P  +GV++
Sbjct: 527 IAN----EQVVMSRPGDYFNFQIVSAITTSD-DNPVDITVSDIKPAFINHVLPIQKGVMM 581

Query: 406 GCDTSLWLL--SISLSKGLSIDFRRVSGSGVY-ACPPVSVGDCLVF 448
             D   +LL     +    +   +++S    Y A  P+ +G  ++F
Sbjct: 582 FSDNGQFLLFTESDIFSPKTARLKKLSSYETYPALDPIDMGTSVMF 627



 Score = 40.6 bits (93), Expect = 0.75,   Method: Composition-based stats.
 Identities = 27/325 (8%), Positives = 75/325 (23%), Gaps = 20/325 (6%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           M        +F  G      +  + D   +   +    N +P     L+  P  +  +  
Sbjct: 1   MPAINQRIPNFLGG------VSQQPDTIKYPGQLRVCDNAVPDVTFGLMKRPPGEFVKTL 54

Query: 61  RLDPRSNRVFSFSIPDGGYALLVF-------GDKKLQIVVVRSSTKWSPALFGKTYKTPY 113
                    +          L+         G K ++I  + +  + S           Y
Sbjct: 55  TNANADGYWYEILRDGDEKYLVQMTALSSYSGTKPIRIWNLLTGVEQSLTNSNGDSLFSY 114

Query: 114 TFKDNKSLEYA-VFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMIS 172
             +   ++ YA         +   H         D    +  +   +     +  + ++ 
Sbjct: 115 MEQSGTTIPYATQTIQDYTIISNPHKTVTTTGTTDAPLANGNYAFARLDTIAYNTEYILY 174

Query: 173 GVKSNAKLSISQADTSTARITSDMKIFKPLDKGRS-IRLGCHPPEWAKNTNYSIGAYIVA 231
              +    +     T+ +            D  +     G     ++ +    +  ++  
Sbjct: 175 TGSTAPAANKYYRVTALSVDKGTNDGNTWDDTNKDGRYAGLAQFSFSDSLCEDVEGHVTV 234

Query: 232 DDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGD 291
           +   Y    T           G         T       +++   +              
Sbjct: 235 NAASYVDSNTAN-----YDGGGTAQSNFLGYTQNYKTRYTAQIVLKDGGLIKTGSESTAL 289

Query: 292 IKDVSKDGRSISVAPQSQTLFQAGV 316
            +        IS   + + + +   
Sbjct: 290 SRHHDITIEGISYRVKVKAVEEVDT 314


>gi|148724484|ref|YP_001285450.1| tail tube B [Cyanophage Syn5]
 gi|145588129|gb|ABP87948.1| tail tube B [Synechococcus phage Syn5]
          Length = 905

 Score = 52.6 bits (124), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 58/396 (14%), Positives = 119/396 (30%), Gaps = 64/396 (16%)

Query: 176 SNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKV 235
           +   L I Q         + +  +     G  I +           ++++G   V     
Sbjct: 297 TAGTLDIGQITAGLVNSVNLISNYSAQAVGNVIEIER-----TDGRDFNLG---VRGGAT 348

Query: 236 YRSLTTGR-SGDRFGYSKGATYVK--------DNNITWITVLNLSSKTSRESASGAVAPY 286
            R++T  + + +      G  +          +N  +    +   S       SG+    
Sbjct: 349 NRAMTAIKGTANSIVDLPGQCFDGFELKVINTENAESDDYYVVFRSAAEGIPGSGSWEET 408

Query: 287 YVWGDIKDVSKDGRSISVAPQSQTLF-----QAGVSVVSWFMSAWGE--QEGYPSHV--- 336
              G  +  +      ++  Q+   F         ++  W     G+      PS V   
Sbjct: 409 VAPGIERGFNTSTMPHALIRQADGNFTLEALNDEGTITGWAQREVGDDDTNPKPSFVGRG 468

Query: 337 ----TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSAST 392
                F+NNRL F         V +S  G +++F +       D    +    +    + 
Sbjct: 469 ISDMFFYNNRLGFLSEDA----VIMSQPGDYFNFFVTSAITISDSDP-IDVTASSTKPAI 523

Query: 393 IHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVY----ACPPVSVGDCLVF 448
           +       +G+++  + S +LL  S     S    +++    Y       PVS G  + F
Sbjct: 524 LRAAIGAPKGLILFAENSQFLL-ASQEVVFSTATIKLTEISDYFYRSLAKPVSTGVSIAF 582

Query: 449 VCGVG--RRIKYISGSTEQGF-RFNEITQLADHLFNQRILQLVYQEEPHSIVWVV----- 500
           V       +I  +S  +     +  +IT++              +  P  + W V     
Sbjct: 583 VSEADTYSKIFEMSIDSVDNRPQVADITRIVP------------EYVPTGLTWSVSTPNN 630

Query: 501 --LEPKDNSFPRLLGCRFSA-EGEGDFAWHTHMISD 533
             +   DNS    +   F+         W   ++  
Sbjct: 631 SMMLFGDNSNTAYIFKFFNQGNERQVAGWSKWILPG 666


>gi|291334274|gb|ADD93937.1| hypothetical protein [uncultured marine bacterium
           MedDCM-OCT-S08-C235]
          Length = 119

 Score = 51.8 bits (122), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 21/111 (18%), Positives = 37/111 (33%), Gaps = 5/111 (4%)

Query: 259 DNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQ-----TLFQ 313
           D +   I+  N     +  + +G+     +  D    +  G + +    +       +  
Sbjct: 6   DGSTIVISGANTVDTITASNINGSRTITVLNEDSYSFTAGGSANADNTDAGGGVSIFVTS 65

Query: 314 AGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFY 364
                  W    +    GYP+  TFH+ RL F GS      V+ S    F 
Sbjct: 66  PNQPNSQWQEQTYSTIRGYPASATFHDGRLWFGGSSSLPDWVWASKVDEFL 116


>gi|311875239|emb|CBX44498.1| putative tail tubular protein B [Erwinia phage phiEa1H]
 gi|311875360|emb|CBX45101.1| putative tail tubular protein B protein [Erwinia phage phiEa100]
          Length = 806

 Score = 51.8 bits (122), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 68/595 (11%), Positives = 152/595 (25%), Gaps = 82/595 (13%)

Query: 39  NLIPLRYGPLVSMPLMQEYRDCRLDPRSNRVFSFSI--PDGGYALLVFGDKKLQIVVVRS 96
           N +P     L +    +          +N +        D     ++    ++ ++    
Sbjct: 32  NAVPNVVDGLKTRMGSKHLARILNSLDANSLIHHYKRGDDAEEYFVILQPGQVPVIFTVG 91

Query: 97  STKWSPALFGKTYKTPYTFKDNKS--LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFT 154
                P     +  T  +         +    G     +++  P            ++ +
Sbjct: 92  GL-ACPVNTQGSAATYLSSSSLPRETTQLMTIGDYTFVLNRKMPVQ------ARGDVTPS 144

Query: 155 FDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHP 214
            D    +   +        +  N +++           T+  +  K  D  R+  +    
Sbjct: 145 LDNKGLVYVAYANFSFTYQILINGQVAAEHK-------TASSEDVKNEDLVRTDYVAGKL 197

Query: 215 PEWAKNTNYSIGAYIVADD------------KVYRSLTTGRSGD-----RFGYSKGATYV 257
            E   +   S   + +  D                +   G  G      R   +   T  
Sbjct: 198 LENFNSRTASFPGFSMYQDGNVLVVDNSNGANYALTTVDGADGQDLVAIRHKVTNLDTLP 257

Query: 258 KDNNITWITVLNLSS-------KTSRESASGAVAPYYVW-GDIKDVSKDGRSISVAPQSQ 309
               + +   +  +            ES  G+   +            +  ++      +
Sbjct: 258 NRAPVGYKVQVWPTGSKPESRYWLQAESQDGSKVTWVETIAPGVRKGWNAATMPHVLVRE 317

Query: 310 TLFQAGVSVVSWFMSAWGE-------QEGYPSHVT-----------FHNNRLLFSGSKGD 351
           +L   G +  ++    W +          +PS +               NRL+       
Sbjct: 318 SLNANGSANFTYRPGEWEDRDVGDDLTNDFPSLLNDSSPQPISSMLMVQNRLML----TS 373

Query: 352 ELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHW-MHPFGEGVLVGCDTS 410
             +V  S    F+DF         D                I W     G+ VL   D  
Sbjct: 374 GEAVVASRTSRFFDFFRYTVLATVDTDP-FDVFADIEEVYNIRWSAQMDGDVVLFTSDQQ 432

Query: 411 LWLLSISLSKGLSIDFRRVSGSG-VYACPPVSVGDCLVFV--CGVGRRIKYIS-GSTEQG 466
             L         S   R V+         P   GD ++F    G    I+     S    
Sbjct: 433 FTLPGDKPLTPTSAVIRPVTQFKMTPGVKPAPSGDSILFAFDQGSYSGIREFFTDSYSDT 492

Query: 467 FRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAW 526
            +    T   D     ++L+L      +     ++   D +   +    +    +   AW
Sbjct: 493 KKAQPATSHVDKYIRGKVLELSASSSFNRA--FIITSSDRNILYVYDWLYEGTEKVQNAW 550

Query: 527 HTHMISDKHYVLSAASFPNDNRGGTSLWMLVAL---SAGEERSFTVRLNLLDDFK 578
           H         + + +           L++++     S G    +   +++ D+ +
Sbjct: 551 HKWSFPAGTVLHAVS------YSNEKLYLVLTRTNTSGGVAGVYIEVMDMGDELE 599


>gi|125999999|ref|YP_001039670.1| tail tubular protein B-like protein [Erwinia amylovora phage
           Era103]
 gi|121621855|gb|ABM63429.1| tail tubular protein B-like protein [Enterobacteria phage Era103]
          Length = 806

 Score = 51.8 bits (122), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 68/595 (11%), Positives = 152/595 (25%), Gaps = 82/595 (13%)

Query: 39  NLIPLRYGPLVSMPLMQEYRDCRLDPRSNRVFSFSI--PDGGYALLVFGDKKLQIVVVRS 96
           N +P     L +    +          +N +        D     ++    ++ ++    
Sbjct: 32  NAVPNVVDGLKTRMGSKHLARILNSLDANSLIHHYKRGDDAEEYFVILQPGQVPVIFTVG 91

Query: 97  STKWSPALFGKTYKTPYTFKDNKS--LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFT 154
                P     +  T  +         +    G     +++  P            ++ +
Sbjct: 92  GL-ACPVNTQGSAATYLSSSSLPRETTQLMTIGDYTFVLNRKMPVQ------ARGDVTPS 144

Query: 155 FDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHP 214
            D    +   +        +  N +++           T+  +  K  D  R+  +    
Sbjct: 145 LDNKGLVYVAYANFSFTYQILINGQVAAEHK-------TASSEDVKNEDLVRTDYVAGKL 197

Query: 215 PEWAKNTNYSIGAYIVADD------------KVYRSLTTGRSGD-----RFGYSKGATYV 257
            E   +   S   + +  D                +   G  G      R   +   T  
Sbjct: 198 LENFNSRTASFPGFSMYQDGNVLVVDNSNGANYALTTVDGADGQDLVAIRHKVTNLDTLP 257

Query: 258 KDNNITWITVLNLSS-------KTSRESASGAVAPYYVW-GDIKDVSKDGRSISVAPQSQ 309
               + +   +  +            ES  G+   +            +  ++      +
Sbjct: 258 NRAPVGYKVQVWPTGSKPESRYWLQAESQDGSKVTWVETIAPGVRKGWNAATMPHVLVRE 317

Query: 310 TLFQAGVSVVSWFMSAWGE-------QEGYPSHVT-----------FHNNRLLFSGSKGD 351
           +L   G +  ++    W +          +PS +               NRL+       
Sbjct: 318 SLNANGSANFTYRPGEWEDRDVGDDLTNDFPSLLNDSSPQPISSMLMVQNRLML----TS 373

Query: 352 ELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHW-MHPFGEGVLVGCDTS 410
             +V  S    F+DF         D                I W     G+ VL   D  
Sbjct: 374 GEAVVASRTSRFFDFFRYTVLATVDTDP-FDVFADIEEVYNIRWSAQMDGDVVLFTSDQQ 432

Query: 411 LWLLSISLSKGLSIDFRRVSGSG-VYACPPVSVGDCLVFV--CGVGRRIKYIS-GSTEQG 466
             L         S   R V+         P   GD ++F    G    I+     S    
Sbjct: 433 FTLPGDKPLTPTSAVIRPVTQFKMTPGVKPAPSGDSILFAFDQGSYSGIREFFTDSYSDT 492

Query: 467 FRFNEITQLADHLFNQRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAW 526
            +    T   D     ++L+L      +     ++   D +   +    +    +   AW
Sbjct: 493 KKAQPATSHVDKYIRGKVLELSASSSFNRA--FIITSPDRNILYVYDWLYEGTEKVQNAW 550

Query: 527 HTHMISDKHYVLSAASFPNDNRGGTSLWMLVAL---SAGEERSFTVRLNLLDDFK 578
           H         + + +           L++++     S G    +   +++ D+ +
Sbjct: 551 HKWSFPAGTVLHAVS------YSNEKLYLVLTRTNTSGGVAGVYIEVMDMGDELE 599


>gi|194100345|ref|YP_002003775.1| gp12 [Enterobacteria phage EcoDS1]
 gi|193201340|gb|ACF15819.1| gp12 [Enterobacteria phage EcoDS1]
          Length = 785

 Score = 51.0 bits (120), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 67/434 (15%), Positives = 120/434 (27%), Gaps = 38/434 (8%)

Query: 47  PLVSMPLMQEYRDCRLDPRSNRVFSFSIPDGG-YALLVFGDKKLQIVVVRSSTKWSPALF 105
            L   P     R   +D  SN  F     D      +VF    +Q+V +  +       +
Sbjct: 41  GLQKRPPTVFKRRLNIDVGSNPKFHLINRDEQEQYYIVFNGSNIQVVDLSGN------QY 94

Query: 106 GKTYKTPYTFKDNK--SLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPP 163
             + +  Y    N    +           V++                      I     
Sbjct: 95  SVSGEVDYVKSSNPRDDIRVVTVADYTFIVNRKVVVKGGSEKSHSGYNRKARALINLRGG 154

Query: 164 PW---LGDGMISGVKSNAKLSI-------SQADTSTARITSDMKIFKPLDKGRSIRLGCH 213
            +   L  G+  GVK + KL              + A   +   +        +  LG  
Sbjct: 155 QYGRTLKVGINGGVKVSHKLPAGNDAENDPPKVDAQAIGAALRDLLVAAYPTFTFDLGSG 214

Query: 214 PPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSK 273
                  +   I +    D    + ++      +              I      N S+ 
Sbjct: 215 FLLITAPSGTDINSVETEDGYANQLISPVLDTVQTISKLPLAAPNGYIIKIQGETNSSAD 274

Query: 274 TSRESASGAVAPYYVW---GDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQE 330
                       +      G +          ++  QS   F+      S   S   +  
Sbjct: 275 EYYVMYDSNTKTWKETVEPGVVTGFDNTTMPHALVRQSDGSFEFKTLDWSKRGSGNDDTN 334

Query: 331 GYPSHV-------TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTT 383
             PS V        F+ NRL F   +     V +S   +++ F         D    +  
Sbjct: 335 PMPSFVDATINDVFFYRNRLGFLSGEN----VIMSRSASYFAFFPKSAATLSDDDP-IDV 389

Query: 384 AVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSIS---LSKGLSIDFRRVSGSGVYACPPV 440
           AV+    S + +  PF E +L+  D   ++++ S    +K + +D       G  A  P 
Sbjct: 390 AVSHPRISILKYAVPFSEQLLLWSDEVQFVMTSSGVLTAKSIQLDVGSEFSLGDNA-RPF 448

Query: 441 SVGDCLVFVCGVGR 454
           +VG  + F    G 
Sbjct: 449 AVGRSVFFSAPRGS 462


>gi|194100290|ref|YP_002003488.1| gp12 [Enterobacteria phage BA14]
 gi|193201285|gb|ACF15765.1| gp12 [Enterobacteria phage BA14]
          Length = 795

 Score = 50.6 bits (119), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 49/317 (15%), Positives = 104/317 (32%), Gaps = 42/317 (13%)

Query: 266 TVLNLSSKTSRESASGAVAPYYVW----GDIKDVSKDGRSISVAPQSQTLFQAGVSVVSW 321
            ++  +SKTS +          VW    G       +G ++  A   Q+     +  + W
Sbjct: 268 KIVGDTSKTSDQFYVQYDNVKKVWKEVAGWGVQKGLNGGTMPHALVRQSDGSFQMQALPW 327

Query: 322 FMSAWGEQEGYPSH---------VTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEY 372
                G+ +  P+          V F  NRL F   +     + +S    ++        
Sbjct: 328 SQRTCGDMDTNPTPSIVDQTINDVFFFRNRLGFLAGEN----IVMSRTSKYFSLFPASVA 383

Query: 373 GCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL-SKGLSIDFRRVSG 431
              D    +  AV+    S + +  PF E +L+  D + ++LS        S++    + 
Sbjct: 384 NLSDDDP-IDVAVSHNRISILKYAVPFSEELLLWSDQAQFVLSAQGILSPKSVELNLTTE 442

Query: 432 SG-VYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQ------LADHLFN--- 481
                   P  VG  + F        +    S  + +   +++       ++ H+ +   
Sbjct: 443 FDVSDRARPFGVGRGVYFASP-----RASYTSLNRYYAVQDVSSVKSAEDMSAHVPSYIP 497

Query: 482 QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAA 541
             +  +      + I   VL     S   L    +  E     +W    +     VL+  
Sbjct: 498 NGVFSIRGSGTENFI--SVLSANAPSKIFLYKFLYLNEEIAQQSWSHWELGSNVTVLACD 555

Query: 542 SFPNDNRGGTSLWMLVA 558
           S       G+++++++ 
Sbjct: 556 SI------GSTMYLVLR 566


>gi|326536137|ref|YP_004300571.1| gp12 [Enterobacteria phage 285P]
 gi|256861526|gb|ACV32482.1| gp12 [Enterobacteria phage 285P]
          Length = 795

 Score = 50.6 bits (119), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 49/317 (15%), Positives = 104/317 (32%), Gaps = 42/317 (13%)

Query: 266 TVLNLSSKTSRESASGAVAPYYVW----GDIKDVSKDGRSISVAPQSQTLFQAGVSVVSW 321
            ++  +SKTS +          VW    G       +G ++  A   Q+     +  + W
Sbjct: 268 KIVGDTSKTSDQFYVQYDNVKKVWKEVAGWGVQKGLNGGTMPHALVRQSDGSFQMQALPW 327

Query: 322 FMSAWGEQEGYPSH---------VTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEY 372
                G+ +  P+          V F  NRL F   +     + +S    ++        
Sbjct: 328 SQRTCGDMDTNPTPSIVDQSINDVFFFRNRLGFLAGEN----IVMSRTSKYFSLFPASVA 383

Query: 373 GCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL-SKGLSIDFRRVSG 431
              D    +  AV+    S + +  PF E +L+  D + ++LS        S++    + 
Sbjct: 384 NLSDDDP-IDVAVSHNRISILKYAVPFSEELLLWSDQAQFVLSAQGILSPKSVELNLTTE 442

Query: 432 SG-VYACPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQ------LADHLFN--- 481
                   P  VG  + F        +    S  + +   +++       ++ H+ +   
Sbjct: 443 FDVSDRARPFGVGRGVYFASP-----RASYTSLNRYYAVQDVSSVKSAEDMSAHVPSYIP 497

Query: 482 QRILQLVYQEEPHSIVWVVLEPKDNSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAA 541
             +  +      + I   VL     S   L    +  E     +W    +     VL+  
Sbjct: 498 NGVFSIRGSGTENFI--SVLSANAPSKIFLYKFLYLNEEIAQQSWSHWELGSNVTVLACD 555

Query: 542 SFPNDNRGGTSLWMLVA 558
           S       G+++++++ 
Sbjct: 556 SI------GSTMYLVLR 566


>gi|148747829|ref|YP_001285795.1| tail tubular protein B [Phormidium phage Pf-WMP3]
 gi|146230062|gb|ABQ12470.1| tail tubular protein B [Phormidium phage Pf-WMP3]
          Length = 1027

 Score = 50.6 bits (119), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 53/367 (14%), Positives = 106/367 (28%), Gaps = 35/367 (9%)

Query: 128 STAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWL---GDGMISGVKSNAKL-SIS 183
            T    +  +P   LLY       ++TF              GDG +  V ++A L +  
Sbjct: 262 DTIQGTYGRYPM--LLYKTATFNDTYTFSNTGQPANADSYGWGDGSVYNVGASAYLNTSP 319

Query: 184 QADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTG- 242
              T     T   +  + +   R   L  +    A   N  +     A    Y S   G 
Sbjct: 320 FFATFGDTRTPTPQPPETVHLLRQRELRFNYGNGATGANLRVTVDGTALSANYSSTVAGT 379

Query: 243 RSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSI 302
                   + G      +++ +  +    +     S + AV    V     D +  G + 
Sbjct: 380 NRAYALYKADGTLCTSASDLAY-YIAFTGATPLGISPTAAVTITNV-----DRTYIGSAA 433

Query: 303 SVAPQSQTLFQAGVSVVSWFMSAWG--EQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSF 360
           +   Q+   +  G     + +  W       +P   T + +RL+  G   D   V  S+ 
Sbjct: 434 T---QTDNAYVQGGYFKVYGLGLWANYGTGQFPRIATVYQSRLVLGGFTNDPTRVVFSAT 490

Query: 361 GA-------FYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWL 413
           G        +  F +  +    D         +  +   +  +  +   + V    +   
Sbjct: 491 GDTVEGGVKYNFFQVTDDLDGLDSDPFDLVVSSSQADDYVTGLVEWQSSLFVLTRRA--T 548

Query: 414 LSISLSKGLSIDFRRVSGSGVY--ACPP---VSVGDCLVFVCGVGRRIKYISGSTEQG-F 467
              +         RR            P   V     + ++   G  +  ++   E G +
Sbjct: 549 FRANGGDATISPARRFVNYISSLGLVNPFSVVRTDTAVFYLSDSG--VFNLTPRVEDGEY 606

Query: 468 RFNEITQ 474
           +  E + 
Sbjct: 607 QAIEKSI 613


>gi|18640503|ref|NP_570344.1| tail protein A [Synechococcus phage P60]
 gi|18478733|gb|AAL73282.1| tail protein A [Synechococcus phage P60]
          Length = 680

 Score = 50.2 bits (118), Expect = 0.001,   Method: Composition-based stats.
 Identities = 42/353 (11%), Positives = 93/353 (26%), Gaps = 46/353 (13%)

Query: 18  PRLLQS---RKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCRLDPRSNRVFSFSI 74
           P LL     + D       V ++RN+        +  P  +        P+  +      
Sbjct: 9   PNLLGGISQQPDPLKLPGQVKQARNVQLDPTFGALKRPGTELIMQVTGIPKRAKWIPIMR 68

Query: 75  PD-GGYALLVF-------GDKKLQIVVVRSSTKWSPALFGKTYK--TPYTFKDNKSLEYA 124
                Y + ++       GD ++++  +++  + + +  G   +   P    D +++   
Sbjct: 69  DAREHYYVAIYREGANESGDLRIRVFDLKAGVERAVSFVGGEVEEYFPGDETDWEAIRSL 128

Query: 125 VFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQ 184
             G      + +  P            SF+      +     G G    V    + S  Q
Sbjct: 129 TIGDYTFLSNPNVQP-------TTWSRSFSRRPEGLVTIGAAGYGTSYIVDFATEDSGQQ 181

Query: 185 ADTSTARITSDMKIFKPLDKGRSIRLGCHPPEW----------AKNTNYSIGAYIVADDK 234
              +   + +     K  D             W           +   + +         
Sbjct: 182 RRWAVQEMQAPKTKRKKGDGSPDEAGETTVNNWNGTGLSFRVKVEARAFLVDDGEEYGHN 241

Query: 235 VYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKT--------------SRESAS 280
               +T    G+          V  +   W   +    ++                +S  
Sbjct: 242 YIPYVTLLTPGNNTSPFPDTIRVDVSGEGWDIKVTKQIQSKVYANLGTAQFTTPVDQSGG 301

Query: 281 GAVAPYYVWGDIKDVSKDGRSISVAPQS--QTLFQAGVSVVSWFMSAWGEQEG 331
           GA     V G    ++  G   + +  +  +  +        + MSA G   G
Sbjct: 302 GASTSDIVTGLSAAINGLGTFTAESIGNVIRVRYSDPTRTDEFTMSARGGTSG 354



 Score = 44.8 bits (104), Expect = 0.034,   Method: Composition-based stats.
 Identities = 22/134 (16%), Positives = 39/134 (29%), Gaps = 8/134 (5%)

Query: 337 TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWM 396
             + NRL F         V +S  G +++F         D                   +
Sbjct: 482 FMYKNRLGFLTQDA----VIMSQVGDYFNFYATSGVTISDADPIDMATSDTKPVKLEAAI 537

Query: 397 HPFGEGVLVGCDTSLWLLSISLS-KGLSIDFRRVSGSG-VYACPPVSVGDCLVFVCGVG- 453
                 +L G      L S   S    +    ++S         PV  G  ++F   +G 
Sbjct: 538 SSTSGAILFGNQAQFRLSSPDESFGPKTATLDKISNYTYESKADPVQTGVSMIFPTNMGT 597

Query: 454 -RRIKYISGSTEQG 466
              +  +S  + +G
Sbjct: 598 YSSVYELSTESAKG 611


>gi|325171208|ref|YP_004251180.1| hypothetical protein ViPhICP2p09 [Vibrio phage ICP2]
 gi|323512234|gb|ADX87691.1| conserved hypothetical protein [Vibrio phage ICP2]
 gi|323512306|gb|ADX87762.1| hypothetical protein TU12-16_00040 [Vibrio phage ICP2_2006_A]
          Length = 734

 Score = 49.9 bits (117), Expect = 0.001,   Method: Composition-based stats.
 Identities = 62/393 (15%), Positives = 117/393 (29%), Gaps = 49/393 (12%)

Query: 110 KTPYTFKDNKSLEYAVFGSTAVFVHKDHPPH---HLLYIQDGDKISFTFDEIKFLPPPWL 166
           +T Y+      +    FG    F      P     L         +    E         
Sbjct: 79  QTHYSA--IPEILVVQFGDKLHFFDTSVDPLSNGKLFINNQEFLTTEGTTEDIISGASVE 136

Query: 167 GDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIG 226
           G  + +   ++  +S+   D  +  IT+  KI       R +        W ++      
Sbjct: 137 GIFVFATQDADP-ISLQIMDIQSDSITARTKIV----VDRKVLFLETRDVWGRSAPSKER 191

Query: 227 AYIVADDKVYRSLTTGRSGDRFGYSKGA--TYVKDNNITWITVLNLSSKTSRESASGAVA 284
              ++ D +Y  +  G    +   +      Y    +I W   L  ++  +  +A G   
Sbjct: 192 PKTLSSDYLYELINQGWDTKKINSTYATIGAYPSGYDIWW---LYKTTAGTDANAIGKFT 248

Query: 285 PYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLL 344
           P  +        KD  +  +  + Q       S V+          G PS +     R+ 
Sbjct: 249 PSRM--------KDSTTTGIGQERQNTPAPRGSTVASLQVLAS---GKPSCIQTFAGRVF 297

Query: 345 FSGSKGDE-----------LSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTD------ 387
           ++G +                V+ S      +  ++  Y   DPT  + +A+ D      
Sbjct: 298 YAGFQATPRKIDDVRPDFRNHVFFSQL-VKSNAEINKCYQFADPTSEVDSALVDTDGGFI 356

Query: 388 --FSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSID---FRRVSGSGVYACPPVSV 442
              +A  I  M     G+ +  +  +WLLS +     S       +++  G  +   V  
Sbjct: 357 KINAARKIVAMEEVSSGLFIIAENGVWLLSGTSDGLFSATGYHVDKITDYGCVSPRSVVA 416

Query: 443 GDCLVFVCGVGRRIKYISGSTEQGFRFNEITQL 475
               VF       I      T        +T+L
Sbjct: 417 YGDTVFYWAEEGIIVLSPDQTTGKHSAQNLTEL 449


>gi|29366731|ref|NP_813776.1| tail tubular protein B [Pseudomonas phage gh-1]
 gi|29243590|gb|AAO73169.1|AF493143_30 tail tubular protein B [Pseudomonas phage gh-1]
          Length = 808

 Score = 49.9 bits (117), Expect = 0.001,   Method: Composition-based stats.
 Identities = 69/491 (14%), Positives = 137/491 (27%), Gaps = 57/491 (11%)

Query: 26  DLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDCR---LDPRSNRVFSFSIPDGGYALL 82
           D+   +   A   N        L   P     +  +          V   +        +
Sbjct: 20  DILRFSNQGALQINGWSSETQGLQKRPPTTFTKRLQNKGFLGTKPLVHLINRDAQEQYFV 79

Query: 83  VFGDKKLQIVVVRSST----KWSPALFGKTYKTPYTFKDNKSLEYAVFGSTA-----VFV 133
            F    L +  ++ +      ++        +T           + V  +T         
Sbjct: 80  GFSGTGLAVWDLKGNNYTVRGYNGYANCANPRTDLRLITVADYTFVVNRNTVCQMGSTLT 139

Query: 134 HKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSI------SQADT 187
           H  +P      I +     +       +     G    + +K     +         A  
Sbjct: 140 HAAYPRLDGRAIINVRGGQYGRTLSITINGDGTGSSPQASIKMPNGSAEKVPAGDPYAGM 199

Query: 188 STARITSDMKIFKPLDKGRSIRLGCHPPEWAKNT-NYSIGAYIVADDKVYRSLTTGRSGD 246
           +   +T    I   L +  ++ LG     +   T    I A   A+D V +  T     D
Sbjct: 200 NQVDMTDASWIAAELARQLTVSLGGSGWSFQAGTGWILINA--PANDNVRQIATKDGYAD 257

Query: 247 -----RFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRS 301
                     +  T +  N       L   +  S  S       Y   G +   +   + 
Sbjct: 258 TLLSGFIYQVQTFTKLPANAPP--GYLVEITGESARSGDNYWVQYDASGKVWKETAKPKI 315

Query: 302 IS---VAPQSQTLFQAGVSVVSWFMSAWG-------EQEGYPSHV-------TFHNNRLL 344
           I+    A     L +A      W    W        +    PS V        F  NRL 
Sbjct: 316 IAGFNNATLPHALVRAADGQFDWTPLTWDGRNAGDDDTNPMPSFVGATINDVFFFRNRLG 375

Query: 345 FSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVL 404
           F   +     V +S    +++F         D    +  A++    S + +  PF E +L
Sbjct: 376 FLSGEN----VVMSRTSKYFNFFPSSVATLSDDDP-IDVAISHNRISILKYAVPFSEQLL 430

Query: 405 VGCDTSLWLLSI-SLSKGLSIDFRRVSGSG-VYACPPVSVGDCLVFVCGVGRRIKYISGS 462
           +  D + ++LS  ++    +I+    +         P  +G  + F        +    S
Sbjct: 431 LWSDQAQFVLSSKTILSSKTIELDLTTEFDVSDGARPYGIGRGVYF-----AAPRASFTS 485

Query: 463 TEQGFRFNEIT 473
            ++ +   +++
Sbjct: 486 LKRYYAIQDVS 496


>gi|9634037|ref|NP_052111.1| tail tubular protein B [Yersinia phage phiYeO3-12]
 gi|6599028|emb|CAB63632.1| tail tubular protein B [Yersinia phage phiYeO3-12]
          Length = 801

 Score = 49.5 bits (116), Expect = 0.001,   Method: Composition-based stats.
 Identities = 31/165 (18%), Positives = 60/165 (36%), Gaps = 21/165 (12%)

Query: 329 QEGYPSH-------VTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKAL 381
              YPS        + F  NRL F   +     + LS    +++F        Y     +
Sbjct: 344 TNPYPSFTGQTINDIFFFRNRLGFLSGEN----IILSRTSKYFNFFP-ASVSNYSDDDPI 398

Query: 382 TTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL-SKGLSIDFRRVSGSGVYA-CPP 439
             AV+    ST+ +  PF E +L+  D + ++L+ S      S++    +   V     P
Sbjct: 399 DVAVSHNRVSTLKYAVPFSEELLLWSDQAQFVLTASGILSSRSVELNLTTQFDVQDRARP 458

Query: 440 VSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQL--ADHLFNQ 482
             VG  + F        +    S  + +   +++ +  A+ +   
Sbjct: 459 HGVGRNVYFASP-----RASFTSINRYYAVQDVSSVKNAEDMTAH 498


>gi|291335597|gb|ADD95206.1| tail tubular protein B [uncultured phage MedDCM-OCT-S04-C650]
          Length = 845

 Score = 49.5 bits (116), Expect = 0.002,   Method: Composition-based stats.
 Identities = 67/511 (13%), Positives = 125/511 (24%), Gaps = 74/511 (14%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           M   T T  +F  G      +  + D       V +  N  P     L+  P M+     
Sbjct: 1   MPAITQTIPNFLGG------VSRQNDDKKLINQVTECVNGYPDPTYGLLKRPGMEHVNVL 54

Query: 61  RLDPR----------SNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYK 110
           +              +   F     + G  +       + +      T  +    G  Y 
Sbjct: 55  KKADGTAFSKTELADAAWFFI-DRDNAGSYIGAIKGTNIYVWTKEDGTFCTVNNTGTAYL 113

Query: 111 TP-----YTFKDNKSLEYAVFGSTAVFVH--KDHPPHHLLYIQDGDKISFTFDEIKFLPP 163
           T      Y F+  + +      +    +          +  ++    ++   D I  +  
Sbjct: 114 TGTQQSDYHFRSVQDVTVITNKTVTTAMQATPAAAVKSVGTLKLN-SVTDGLDYIVTIQG 172

Query: 164 PWLGDGMISGVKSNAKLSISQADTST---------ARITSDMKIFKPLDKGRSIRLGCHP 214
                   S    +  L    +D +T         A I +          G    L  + 
Sbjct: 173 IATSISAQSHTTFDDMLVYDSSDVNTNHHLVDAIKATIEAQHSASNADFDG-VWSLEAYT 231

Query: 215 PEWAKNTNYSIGAYIVADDKVYRSLTT---------------------GRSGDRFGYSKG 253
                  N    A +        + T                      G S +    S  
Sbjct: 232 NSLVIKRNAGTNAVVTDYTAPTGAATAFTIEAKGGLGNAGIEVFQDSVGSSAELSVESFN 291

Query: 254 ATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQ 313
             +VK  N             +     G        G       D  ++    ++     
Sbjct: 292 GHHVKVRNTNSADDDYYLEFEAFNGTRGKGFWKEAKGVDVSPGLDAATMPFQLENVGATT 351

Query: 314 AGVSVVSWFMSAWGEQE--------GYPSHVTF-HNNRLLFSGSKGDELSVYLSSFGAFY 364
                + W     G+          GY    TF +NNR            ++L      +
Sbjct: 352 FNFKPIPWTARLVGDTNSNPDPSFIGYKITSTFFYNNRFGVLSEDN----IFLGVANDSF 407

Query: 365 DFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTS---LWLLSISLSKG 421
           +F +       D    +   V       ++ + P  +G+L+        ++  S +    
Sbjct: 408 NFFVKSALTQVDSDP-IDLNVASVRPVVLNDVLPSPQGLLLFSARQQFQVYSASATTMTP 466

Query: 422 LSIDFRRVSGSG-VYACPPVSVGDCLVFVCG 451
            +   R +S         PV VG    FV  
Sbjct: 467 KTTVIRSISNYEMSSDISPVDVGTTAAFVNR 497


>gi|189427235|ref|YP_001949785.1| gp12 [Salmonella phage phiSG-JL2]
 gi|189085888|gb|ACD75703.1| gp12 [Salmonella phage phiSG-JL2]
          Length = 801

 Score = 49.1 bits (115), Expect = 0.002,   Method: Composition-based stats.
 Identities = 30/154 (19%), Positives = 56/154 (36%), Gaps = 19/154 (12%)

Query: 329 QEGYPSH-------VTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKAL 381
              YPS        + F  NRL F   +     + LS    +++F        Y     +
Sbjct: 344 TNPYPSFTGQTINDIFFFRNRLGFLSGEN----IILSRTSKYFNFFP-ASVSNYSDDDPI 398

Query: 382 TTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISL-SKGLSIDFRRVSGSGVYA-CPP 439
             AV+    ST+ +  PF E +L+  D + ++L+ S      S++    +   V     P
Sbjct: 399 DVAVSHNRVSTLKYAVPFSEELLLWSDQAQFVLTASGILSSRSVELNLTTQFDVQDRARP 458

Query: 440 VSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEIT 473
             VG  + F        +    S  + +   +++
Sbjct: 459 HGVGRNVYFASP-----RASFTSINRYYAVQDVS 487


>gi|17570828|ref|NP_523337.1| tail tubular protein B [Enterobacteria phage T3]
 gi|17384312|emb|CAC86300.1| tail tubular protein B [Enterobacteria phage T3]
          Length = 801

 Score = 49.1 bits (115), Expect = 0.002,   Method: Composition-based stats.
 Identities = 30/165 (18%), Positives = 59/165 (35%), Gaps = 21/165 (12%)

Query: 329 QEGYPSH-------VTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKAL 381
              YPS        + F  NRL F   +     + LS    +++F        Y     +
Sbjct: 344 TNPYPSFTGQTINDIFFFRNRLGFLSGEN----IILSRTSKYFNFFP-ASVSNYSDDDPI 398

Query: 382 TTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSI-SLSKGLSIDFRRVSGSGVYA-CPP 439
             AV+    ST+ +  PF E +L+  D + ++L+   +    S+     +   V     P
Sbjct: 399 DVAVSHDRVSTLKYAVPFSEELLLWSDQAQFVLTASDILSSRSVGLNLTTQFDVQDRARP 458

Query: 440 VSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQL--ADHLFNQ 482
             VG  + F        +    S  + +   +++ +  A+ +   
Sbjct: 459 HGVGRNVYF-----SSPRASFTSINRYYAVQDVSSVKNAEDMTAH 498


>gi|310005866|gb|ADP00251.1| tail tube protein B [Cyanophage Syn26]
          Length = 977

 Score = 48.3 bits (113), Expect = 0.003,   Method: Composition-based stats.
 Identities = 50/390 (12%), Positives = 109/390 (27%), Gaps = 37/390 (9%)

Query: 88  KLQIVVVRSSTKWSPALFGKTYKTPYTFK--DNKSLEYA----VFGSTAVFVHKDHPPHH 141
             +I     S  ++     +   T Y  +      L Y       G        D     
Sbjct: 280 YFRIRTTGQSVPFTTGAGNEQVTT-YQARYTTTFDLLYGGSGWQQGDYFYVWMDDGYYKV 338

Query: 142 LLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKP 201
           ++      +I      I+  P P+  +  I+       +  +  DT              
Sbjct: 339 VIEAISTTQIQANLGLIRPNPTPFDTETTITASGILGDIRQAIIDTGNFTS------ANV 392

Query: 202 LDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNN 261
              G  + +    P    N        +       +S+       + GY       + + 
Sbjct: 393 QQIGNGLYITR--PSGTFNATAPTSDLLKVMSSEVKSVDDLPDQCKHGYVVKVANSEAD- 449

Query: 262 ITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSW 321
                 +       R    G           +++  D  ++ +    Q      VS  +W
Sbjct: 450 -EDDYYVKFFGNNDR---DGDGVWEECAKPGRNIEFDKGTMPIQLVRQANGTFLVSQATW 505

Query: 322 FMSAWGE--QEGYPSHV-------TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEY 372
             +  G+      PS V        F  NRL+F   +     V +S  G F++F      
Sbjct: 506 ENAEVGDDLTNPNPSFVGKTVNQLVFFRNRLVFLSDEN----VIMSRPGEFFNFW-SKTA 560

Query: 373 GCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLS--KGLSIDFRRVS 430
             + P   +  + +    + ++       G+L+      ++L+         +     V+
Sbjct: 561 TTFTPMDVIDLSCSSEYPAIVYDGIQVNAGLLLFTKNQQFMLTTDSDILSPETAKLNAVA 620

Query: 431 GSGVYA-CPPVSVGDCLVFVCGVGRRIKYI 459
                    P+++G  + F+    +  ++ 
Sbjct: 621 SYNFNEKTNPINLGTTVAFIDNANQFTRFF 650


>gi|224164141|ref|XP_002338648.1| predicted protein [Populus trichocarpa]
 gi|222873077|gb|EEF10208.1| predicted protein [Populus trichocarpa]
          Length = 350

 Score = 48.3 bits (113), Expect = 0.004,   Method: Composition-based stats.
 Identities = 11/67 (16%), Positives = 22/67 (32%), Gaps = 6/67 (8%)

Query: 506 NSFPRLLGCRFSAEGEGDFAWHTHMISDKHYVLSAASFPNDNRGGTSLWMLVALSAGEER 565
            S   LL   +  + +G   W  H         S A  P       +++ +V  + G   
Sbjct: 3   RSDGTLLSLTYVKD-QGVLGWARHTTDGTF--ESVAVIP--EGTEDAVYAVVKRTIGSRT 57

Query: 566 -SFTVRL 571
             +  ++
Sbjct: 58  VRYVEKI 64


>gi|77118200|ref|YP_338122.1| tail tube [Enterobacteria phage K1F]
 gi|72527944|gb|AAZ72996.1| tail tube [Enterobacteria phage K1F]
 gi|83308152|emb|CAJ29385.1| gp12 protein [Enterobacteria phage K1F]
          Length = 785

 Score = 47.9 bits (112), Expect = 0.004,   Method: Composition-based stats.
 Identities = 73/439 (16%), Positives = 122/439 (27%), Gaps = 48/439 (10%)

Query: 47  PLVSMPLMQEYRDCRLDPRSNRVFSFSIPDGG-YALLVFGDKKLQIVVVRSSTKWSPALF 105
            L   P     R   +D  SN  F     D      +VF    +QIV +  + ++S +  
Sbjct: 41  GLQKRPPTVFKRRLNIDVGSNPKFHLINRDEQEQYYIVFNGSNIQIVDLSGN-QYSVSGS 99

Query: 106 GKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPW 165
               K+     D   +           V++                      I      +
Sbjct: 100 VDYVKSSNPRDD---IRVVTVADYTFVVNRKVVVKGGSEKSHSGYNRKARALINLRGGQY 156

Query: 166 ---LGDGMISGVKS-------------NAKLSISQADTSTARITSDMKIFKPLDKGRSIR 209
              L  G+  GVK                K+       +   +          D G    
Sbjct: 157 GRTLKVGINGGVKVSHKLPAGNDAENDPPKVDAQAIGAALRDLLVTAYPTFTFDLGSGFL 216

Query: 210 LGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKG--ATYVKDNNITWITV 267
           L   P     N+  +   Y               S        G       + N +    
Sbjct: 217 LITAPSGTDINSVETEDGYANQLISPVLDTVQTISKLPLAAPNGYIIKIQGETNSSADEY 276

Query: 268 LNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSW-FMSAW 326
             +    ++      V P  V G       D  ++  A   Q+        + W    A 
Sbjct: 277 YVMYDSNTKTWKE-TVEPGVVTGF------DNTTMPHALVRQSDGSFEFKALDWSKRGAG 329

Query: 327 GE-QEGYPSHV-------TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPT 378
            +     PS V        F+ NRL F   +     V +S   +++ F         D  
Sbjct: 330 NDDTNPMPSFVDATINDVFFYRNRLGFLSGEN----VIMSRSASYFAFFPKSVATLSDDD 385

Query: 379 KALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSIS---LSKGLSIDFRRVSGSGVY 435
             +  AV+    S + +  PF E +L+  D   ++++ S    SK + +D       G  
Sbjct: 386 P-IDVAVSHPRISILKYAVPFSEQLLLWSDEVQFVMTSSGVLTSKSIQLDVGSEFALGDN 444

Query: 436 ACPPVSVGDCLVFVCGVGR 454
           A  P +VG  + F    G 
Sbjct: 445 A-RPFAVGRSVFFSAPRGS 462


>gi|315518952|dbj|BAJ51829.1| putative tail tubular protein B [Ralstonia phage RSB2]
          Length = 788

 Score = 46.4 bits (108), Expect = 0.014,   Method: Composition-based stats.
 Identities = 27/116 (23%), Positives = 46/116 (39%), Gaps = 9/116 (7%)

Query: 336 VTFHNNRL-LFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIH 394
           + F  NRL + +G       V LS+ G F+ F         D    +  AV+    ST+H
Sbjct: 351 IFFFRNRLGILAGEN-----VILSASGEFFKFWPKSVVTAADTDP-IDVAVSHNRVSTLH 404

Query: 395 WMHPFGEGVLVGCDTSLWLL-SISLSKGLSIDFRRVSGS-GVYACPPVSVGDCLVF 448
               F E +L+  D + ++L S  +    ++     +         PV+ G  + F
Sbjct: 405 HAVSFAEELLLWSDQTQFILKSDGILSTKTVKVDTATEFESAIDARPVAAGRGVYF 460


>gi|167841461|ref|ZP_02468145.1| tail tubular protein B [Burkholderia thailandensis MSMB43]
          Length = 853

 Score = 46.0 bits (107), Expect = 0.016,   Method: Composition-based stats.
 Identities = 27/177 (15%), Positives = 59/177 (33%), Gaps = 23/177 (12%)

Query: 286 YYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQE---GYP-------SH 335
           + V G  KD       I   P     +     V  +  S  G+++     P       S 
Sbjct: 356 FAVGGITKDGDTFA--IGSGPAQLNAYSTDFQVPKFAGSVCGDKDQTGAIPYFFGKRISL 413

Query: 336 VTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYG--CYDPTKALTTAVTDFSASTI 393
           +    +RL+         +V++S  G +++F           DP +A      D   +  
Sbjct: 414 LAMFQDRLVIVSD----GTVFMSRTGDYFNFFRKTMLSVHDDDPIQAYALGAADDVITR- 468

Query: 394 HWMHPFGEGVLVGCDTSLWLLSISL-SKGLSIDFRRVSG-SGVYACPPVSVGDCLVF 448
                + + + +    + + +  ++ +   +I    V+       C PV  G+ + +
Sbjct: 469 --CVTYNKNLFLFGLRNQYTIPGNVAASPANITISPVAAERDAILCQPVVHGNIVFY 523


>gi|302339301|ref|YP_003804507.1| hypothetical protein Spirs_2810 [Spirochaeta smaragdinae DSM 11293]
 gi|301636486|gb|ADK81913.1| hypothetical protein Spirs_2810 [Spirochaeta smaragdinae DSM 11293]
          Length = 570

 Score = 45.6 bits (106), Expect = 0.024,   Method: Composition-based stats.
 Identities = 43/210 (20%), Positives = 76/210 (36%), Gaps = 31/210 (14%)

Query: 321 WFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKA 380
           W ++    QE      T +  R++         +VY+S    + DF         D    
Sbjct: 153 WALAEKTSQE----ISTIYQARMIAVNRTW--GTVYMSVAYIYLDF---------DSDGH 197

Query: 381 LTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSIS----LSKGLSIDFRRVSGSGVYA 436
           L      +      W+  FG  V +G D S W+L+            +  +++SG G   
Sbjct: 198 LELIPDFYGFEHPRWIVAFGGDVYIGTDKSEWMLTSGYPYFTDDLGGLMMQKISGIGADL 257

Query: 437 CPPVSVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQLADHLFNQRILQLVYQEEPHSI 496
              V  G  ++ +    R ++ I  S+   F+   + +L   + N  I+Q+    E  S 
Sbjct: 258 --AVVFGSSII-LAKDRRLVR-IVYSSAGEFQSQSMAEL---IDNTDIIQIDV-IEYGSH 309

Query: 497 VWVVLEPKDNSFPRLLGCRFSAEGEGDFAW 526
            ++V   +D     L  C    +  G  AW
Sbjct: 310 RYLVFIDRDRRLWCLTEC----QNTGVAAW 335



 Score = 44.5 bits (103), Expect = 0.051,   Method: Composition-based stats.
 Identities = 27/141 (19%), Positives = 41/141 (29%), Gaps = 20/141 (14%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYR-- 58
           M         F+ G +SPR +  R D +     V++    + L  G +         R  
Sbjct: 1   MSRQRILVTDFTRGIVSPR-MVPRIDQTK---AVSELTGFVVLPDGGIRRREGTIYARRG 56

Query: 59  ------DCRLDPRSNRVFSFSIPDGGYALLVFGD--KKLQIVVVRSSTKWSPALFGKTYK 110
                 DC   P                L    D  ++L +  + + T  S A       
Sbjct: 57  LGVLPTDCEAVPAFTTFDKRITGTETLHLAWINDAPRQLNVQNMTNRTIQSVASESLEAG 116

Query: 111 TPYTFK-----DNKSLEYAVF 126
            P         D +SL YA  
Sbjct: 117 KPLLDSGKFNNDLESL-YAQN 136


>gi|326434186|gb|EGD79756.1| hypothetical protein PTSG_10740 [Salpingoeca sp. ATCC 50818]
          Length = 1352

 Score = 44.5 bits (103), Expect = 0.050,   Method: Composition-based stats.
 Identities = 45/284 (15%), Positives = 91/284 (32%), Gaps = 40/284 (14%)

Query: 216 EWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITW-------ITVL 268
            W+ + +    A       V+R   TG++   + +     +     +T        + + 
Sbjct: 111 TWSHDGSRLFSADDEGQLCVWRISKTGKASLTYEHKADTGFTHCVAVTAGSEDTSMVFLG 170

Query: 269 NLSSKTSRESASGAVAP-YYVWGDIKDVSKDGRSISVAPQSQT-------LFQAGVSVVS 320
                     A+GA  P + V   I D+  D  S S+   +Q        +   G     
Sbjct: 171 TFDKGVMLADATGACTPSFPVTDKIVDLVFDPESASLVVATQDMMVVHHHVAPDGAVKDK 230

Query: 321 WFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKA 380
           +     G+     S + F    +L + +  +++ V+ S  G  Y  S+  E G + P   
Sbjct: 231 YEFKMSGKAF---SSLGFAGPGILIAATSENQIRVWCSEEGDSYSLSIAHERGEFTP-SD 286

Query: 381 LTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVYACPPV 440
           +   V    A+ I         V +      W       KG +   R          PP+
Sbjct: 287 IIQTVAYNPANRIIAGTSRNGMVFM------WKF-----KGEAFTDRDSWSF----LPPI 331

Query: 441 SVGDCLVFVCGVGRRIKYISGSTEQGFRFNEITQLADHLFNQRI 484
            +   LV        +  +  +++       ++ L +H+  +  
Sbjct: 332 ELRGSLVGAQWGPDGLLLVRNASDT------VSILREHIMRKHF 369


>gi|304404646|ref|ZP_07386307.1| Kelch repeat-containing protein [Paenibacillus curdlanolyticus YK9]
 gi|304346453|gb|EFM12286.1| Kelch repeat-containing protein [Paenibacillus curdlanolyticus YK9]
          Length = 697

 Score = 43.3 bits (100), Expect = 0.099,   Method: Composition-based stats.
 Identities = 40/244 (16%), Positives = 72/244 (29%), Gaps = 26/244 (10%)

Query: 242 GRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRS 301
           G+     G  + + Y   + ++W  V   +    R S++G      +W    D      +
Sbjct: 436 GKMWAYAGQYQNSVYSSSDGVSWTCVTREAPWAGRRSSAGVSFMGAIWLFGGDTVNGDAN 495

Query: 302 ISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRL-LFSGSKGDELS---VYL 357
                     ++       W     G + G         NR+ +F G      +   V+ 
Sbjct: 496 DVWVSPDGVNWKCATPNAPW-----GPRNGL--CAVVFQNRMWVFGGRDHQGNTYNDVWA 548

Query: 358 SSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLV--GCDTSLWLLS 415
           S  GA   + L      + P  A    V       I           V    +   W   
Sbjct: 549 SDNGAH--WELITPQAGWSPRDAAAAVVYQNQIYMIGGSRSGSSLQEVWSTDNGRDWKPL 606

Query: 416 ISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVFVCGVGRRIKYISGS--TEQGFRFNEIT 473
            + +      F             V +G  L  + G   +++ +S    T+ G R+  +T
Sbjct: 607 ANGNVPWLSRFDS---------KAVVLGTNLYLIGGTNSQVRGLSDMWVTQDGSRWEAVT 657

Query: 474 QLAD 477
           Q A 
Sbjct: 658 QQAP 661


>gi|88705445|ref|ZP_01103156.1| ATP-dependent DNA helicase RecG [Congregibacter litoralis KT71]
 gi|88700535|gb|EAQ97643.1| ATP-dependent DNA helicase RecG [Congregibacter litoralis KT71]
          Length = 686

 Score = 43.3 bits (100), Expect = 0.12,   Method: Composition-based stats.
 Identities = 23/168 (13%), Positives = 49/168 (29%), Gaps = 18/168 (10%)

Query: 40  LIPLRYGPLVSMPLMQEYRDCRLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTK 99
            +    G              +    +  +  F         L  GD+ L+         
Sbjct: 67  ALVSVSGGGRRRS---LIVKLQDGTGTATLRFFHFSQAQKNALQQGDR-LRCFGTVRRGA 122

Query: 100 WSPALFGKTYK------------TP-YTFKD-NKSLEYAVFGSTAVFVHKDHPPHHLLYI 145
               +    Y+            TP Y   +     ++      A+ V   HPP  LL +
Sbjct: 123 QQAEMIHPEYRRSIHISDNEESLTPIYPSTEGVSQGQWRKLSDQALSVLAKHPPEELLPV 182

Query: 146 QDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARIT 193
           ++ D    +       PPP      +   +  A+L ++  + +  +++
Sbjct: 183 RENDYGLSSALRFLHRPPPEADQQALRDGRHPAQLRLALEELTAHQLS 230


>gi|253583142|ref|ZP_04860350.1| predicted protein [Fusobacterium varium ATCC 27725]
 gi|251835034|gb|EES63587.1| predicted protein [Fusobacterium varium ATCC 27725]
          Length = 654

 Score = 42.9 bits (99), Expect = 0.13,   Method: Composition-based stats.
 Identities = 58/456 (12%), Positives = 129/456 (28%), Gaps = 52/456 (11%)

Query: 1   MVNTTWTKHSFSAGELSPRLLQSRKDLSLHAQGVAKSRNLIPLRYGPLVSMPLMQEYRDC 60
           + +T + ++++  GE   +L  ++ D  ++     +  N++P   G L  +         
Sbjct: 10  ISSTNFLQNNWQMGEAGNKLAVNK-DSEMYMTTANRIVNMLPTELGGLEVLKEHTPRSIS 68

Query: 61  RLDPRSNRVFSFSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKS 120
            L    ++    +I              + + + R  T      F     + Y F +N  
Sbjct: 69  GLPAGYDKPIIRAINTPFNF-------YICMCMDRIFTMNKSNQFL----SGYVFAEN-- 115

Query: 121 LEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKS--NA 178
             +   G   +                       F  + F         + S      N 
Sbjct: 116 -GFTRAGKLVLI--------------------DKFVLVTFPNGNRYDLEISSSGNIGLND 154

Query: 179 KLSISQADTSTARITSDMKIFKPLDK--GRSIRLGCHPPEWAKNTNYSIGAYIVADDKVY 236
             S S  +    +    + I++      G + ++  +         + I   I  D K+ 
Sbjct: 155 NFSASITNPLLHKSKVQVDIYQTRKIMIGTTEKIRPYKIRTTDLQEFLISGNIGQDGKLL 214

Query: 237 RSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDI---- 292
                  +  R  Y   +  +   NI+ +T             +G    Y     +    
Sbjct: 215 FKYNKEFNITRIYYPYQSDMINMENISGLTENEWFVIIHDVDTTGGGRFYMGNSPVDFTN 274

Query: 293 KDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDE 352
                 G   + A  S+++    +      +  W  + G+   V    NR++ S      
Sbjct: 275 PKTDVTGTYYTTAKVSRSVGNTSLLSYGIMIDLWNNKVGF-HTVAEFQNRMVVSNGT--- 330

Query: 353 LSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLW 412
             ++ S  G +  F         D    +     +     +          +V       
Sbjct: 331 -YIFFSKVGDYNYF---LNGELDDDAFFIKLGYVNGEQPIVKNFITGRGLWVVTNKGIFL 386

Query: 413 LLSISLSKGLSIDFRRVSGSGVYACPPVSVGDCLVF 448
           +   ++ KG S+D R +          V + + L +
Sbjct: 387 ICYNNIVKGSSLDIRMIVADECGN-EAVDINNTLYY 421


>gi|196038454|ref|ZP_03105763.1| hypothetical protein BC059799_3729 [Bacillus cereus NVH0597-99]
 gi|228934985|ref|ZP_04097816.1| hypothetical protein bthur0009_34390 [Bacillus thuringiensis
           serovar andalousiensis BGSC 4AW1]
 gi|196030862|gb|EDX69460.1| hypothetical protein BC059799_3729 [Bacillus cereus NVH0597-99]
 gi|228824885|gb|EEM70686.1| hypothetical protein bthur0009_34390 [Bacillus thuringiensis
           serovar andalousiensis BGSC 4AW1]
          Length = 830

 Score = 42.9 bits (99), Expect = 0.15,   Method: Composition-based stats.
 Identities = 22/95 (23%), Positives = 33/95 (34%), Gaps = 20/95 (21%)

Query: 191 RITSDMKIFKPLDKGRSIR-----------------LGCHPPEWAKNTNYSIGAYIVADD 233
            + S    F    +G  +                     +   W     Y I +YI A+ 
Sbjct: 724 SLVSSEPTFGTYSRGELLYNDTPTVGGYIGWVCITAGTANGDFWIAEKEYQINSYINANG 783

Query: 234 KVYRSLTTGRSG-DRFGYSKGATYVKDNNITWITV 267
            VY+S+  G SG     ++ G +  KD NI W  V
Sbjct: 784 NVYKSVGRGTSGKTAPSHTNGTS--KDGNIVWEYV 816


>gi|66047262|ref|YP_237103.1| insecticidal toxin protein, putative [Pseudomonas syringae pv.
           syringae B728a]
 gi|63257969|gb|AAY39065.1| insecticidal toxin protein, putative [Pseudomonas syringae pv.
           syringae B728a]
          Length = 1617

 Score = 42.5 bits (98), Expect = 0.18,   Method: Composition-based stats.
 Identities = 32/266 (12%), Positives = 75/266 (28%), Gaps = 5/266 (1%)

Query: 167 GDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIG 226
             G  + ++++   +    DT+   + + +  F+ +     I        +A+   Y IG
Sbjct: 133 KSGYFTQLENDINQNRINVDTAQEAVKAYLASFEEVANLTIINGYIDSDRFAQGKYYFIG 192

Query: 227 AYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPY 286
                +   +R++       + G ++G  +       W          +  +    + P 
Sbjct: 193 TSRAENIYYWRTVDMNERAYQEG-TEGPKFDNPTPGAWSDWKRAEIGINANTLERTIRPV 251

Query: 287 YVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFS 346
           Y    +     D   ++                   +      +  P  V   N RL+F+
Sbjct: 252 YFNNRLFVAWVDLVHVTEQVAVTLPEGTVKPAADGSIPITPPADIAPLTVVTPNVRLVFN 311

Query: 347 GSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVG 406
            S       + S+   + D  +          KA+     D ++  I  +    E + + 
Sbjct: 312 ISYKKYDDSW-SAPHIYMD--VTTPNVVTRAGKAVNLE-NDLNSIAIFDVSASPESLFIA 367

Query: 407 CDTSLWLLSISLSKGLSIDFRRVSGS 432
                 L         S      +  
Sbjct: 368 MYAGETLAPGDTDGSTSTYAFLHTAF 393


>gi|330973553|gb|EGH73619.1| insecticidal toxin protein, putative [Pseudomonas syringae pv.
           aceris str. M302273PT]
          Length = 1189

 Score = 42.5 bits (98), Expect = 0.21,   Method: Composition-based stats.
 Identities = 32/266 (12%), Positives = 75/266 (28%), Gaps = 5/266 (1%)

Query: 167 GDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIG 226
             G  + ++++   +    DT+   + + +  F+ +     I        +A+   Y IG
Sbjct: 133 KSGYFTQLENDINQNRINVDTAQEAVKAYLASFEEVANLTIINGYIDSDRFAQGKYYFIG 192

Query: 227 AYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPY 286
                +   +R++       + G ++G  +       W          +  +    + P 
Sbjct: 193 TSRAENIYYWRTVDMNERAYQEG-TEGPKFDNPTPGAWSDWKRAEIGINANTLERTIRPV 251

Query: 287 YVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFS 346
           Y    +     D   ++                   +      +  P  V   N RL+F+
Sbjct: 252 YFNNRLFVAWVDLVHVTEQVAVTLPEGTVKPAADGSIPITPPADIAPLTVVTPNVRLVFN 311

Query: 347 GSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVG 406
            S       + S+   + D  +          KA+     D ++  I  +    E + + 
Sbjct: 312 ISYKKYDDSW-SAPHIYMD--VTTPNVVTRAGKAVNLE-NDLNSIAIFDVSASPESLFIA 367

Query: 407 CDTSLWLLSISLSKGLSIDFRRVSGS 432
                 L         S      +  
Sbjct: 368 MYAGETLAPGDTDGSTSTYAFLHTAF 393


>gi|302308918|ref|NP_986066.2| AFR519Cp [Ashbya gossypii ATCC 10895]
 gi|299790857|gb|AAS53890.2| AFR519Cp [Ashbya gossypii ATCC 10895]
          Length = 821

 Score = 41.0 bits (94), Expect = 0.59,   Method: Composition-based stats.
 Identities = 36/306 (11%), Positives = 87/306 (28%), Gaps = 23/306 (7%)

Query: 15  ELSPRLLQSRKDLSLHAQGVAKSR-NLIPLRYGPLVSMPLMQEYRDCRLDPRSNRVFSFS 73
           ELS ++ ++  +L   A+ + + R + I LR   +   P  +            R     
Sbjct: 3   ELSEQVERTLGNLEKKAEFLEEQRGHFIALRQRLVEYDP-EKYAAHAGDGGSGVR----- 56

Query: 74  IPDGGYALLVFGDKKL--QIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAV 131
                   LVFG+  L  ++ +      +      +     +     + LE A       
Sbjct: 57  -------GLVFGEVILSTRVYLSLGCEYYVEKQPAEAVA--WVEGRLRLLEDAQDQFRVQ 107

Query: 132 FVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTAR 191
             H       L  +       +  +       P +       +  +  ++      +   
Sbjct: 108 IAHAKSTLRELAALDGAGGADWAAESSGEDGLPLMEIR--EELDEDGNVTSGAVRRAGGP 165

Query: 192 ITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYS 251
                ++    +    + +     E   + N + G+   A   V R+     + D    S
Sbjct: 166 EAGAKRVDAAEEGLPLMEIREELDE---DGNVTGGSVRRAGGNVQRAGRAASASDAGHKS 222

Query: 252 KGATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTL 311
           +     + +      +    +       +G +  +Y   +   ++    S+ V    +  
Sbjct: 223 QDTGAARPDQAAEERLEQDLAPQQPAEDAGGLDEFYEVLEEMGITAPRESVDVGTPVEAA 282

Query: 312 FQAGVS 317
               VS
Sbjct: 283 ESGPVS 288


>gi|310005690|gb|ADP00077.1| tail tube protein B [Cyanophage NATL1A-7]
          Length = 1056

 Score = 40.6 bits (93), Expect = 0.71,   Method: Composition-based stats.
 Identities = 38/308 (12%), Positives = 91/308 (29%), Gaps = 34/308 (11%)

Query: 165 WLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYS 224
           +  +G+   +      + S    +T  +T+        D+           +        
Sbjct: 427 FSAEGIAEDIDQTGTYARSS---NTITVTAASHGLSNGDQIILDITSGGATDGFYTIANV 483

Query: 225 IGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVA 284
                   D    +++ G +          T  +     W  V+        ++ +  +A
Sbjct: 484 TTNTFTVTDSASGTISAGETCSF-------TPARFGEGVWEEVVQPGKDIEIDNTTMPIA 536

Query: 285 PYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGE--QEGYPSHV------ 336
              V      ++  G       Q+ +      S   W+    G+      PS +      
Sbjct: 537 LTRVLPGSFSINGGGS------QTYSNGAFRFSYPDWYKRDCGDDITNPEPSFIGQTIQK 590

Query: 337 -TFHNNRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHW 395
             F  NR+          +V LS    FY+F         +    +    +    + ++ 
Sbjct: 591 MVFFRNRIALL----SAENVILSRVNDFYNFWNKTAMAISNADP-IDLQSSSTYPTKLYD 645

Query: 396 MHPFGEGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVYA----CPPVSVGDCLVFVCG 451
                 G+++   +  +LLS      L+ +  ++S    +A      P+ +G  + F+  
Sbjct: 646 AVEQAGGLVIFSASEQFLLSSGAEALLTPETAKISYVSSHAFNPDTSPIELGTTIGFLNS 705

Query: 452 VGRRIKYI 459
             +  ++ 
Sbjct: 706 TAKNTRFF 713


>gi|312621233|ref|YP_004022846.1| glycoside hydrolase family 16 [Caldicellulosiruptor kronotskyensis
            2002]
 gi|312201700|gb|ADQ45027.1| glycoside hydrolase family 16 [Caldicellulosiruptor kronotskyensis
            2002]
          Length = 2435

 Score = 40.2 bits (92), Expect = 0.93,   Method: Composition-based stats.
 Identities = 43/282 (15%), Positives = 81/282 (28%), Gaps = 24/282 (8%)

Query: 72   FSIPDGGYALLVFGDKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAV 131
            + +  G     +      + V V  S  W+P     +  +  T  +   + +   G+   
Sbjct: 1144 YGVSGGRSYSYIIAVSNSKFVRVDPSNAWNPLTASASDAS--TDAELFEIVFKADGNVGF 1201

Query: 132  --------FVHKD---HPPHHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISG-----VK 175
                     V  D    P + LL           ++    +P       + +      V 
Sbjct: 1202 ASKALNNNLVCADSWSTPDYKLLPRSSYSADPGGWETFTLVPQGDGTIAIKANNGGRFVT 1261

Query: 176  SNAKLSISQADTSTARITSDMKIFKPLDKGR-SIRLGCHPPEWAKNTNYSIGAYIVADDK 234
                  I +A ++T  +     I  P   G+ S+ +                + +VA   
Sbjct: 1262 VEPTTGILKATSATVGVNEKFIIVTPYAPGQPSVTIDEVLDNSVTFHWSVPSSSVVAGYN 1321

Query: 235  VYRSLTTG--RSGDRFGYSKGATYVKD--NNITWITVLNLSSKTSRESASGAVAPYYVWG 290
            VYR+ T+G              +Y        T    +  +     E+ S  V    + G
Sbjct: 1322 VYRATTSGGPYIKLNKALLTTTSYTDTSMTANTTYYYIVAAVNARGETKSPEVMVKTLSG 1381

Query: 291  DIKDVSKDGRSISVAPQSQTL-FQAGVSVVSWFMSAWGEQEG 331
             I  +       S    S TL + A     S+ +     + G
Sbjct: 1382 PIPAIPTGLDITSCTQNSITLNWNAAAGAQSYNIYRSTSRFG 1423


>gi|313122738|ref|YP_004044665.1| chitin-binding protein [Halogeometricum borinquense DSM 11551]
 gi|312296220|gb|ADQ69309.1| uncharacterized protein contain chitin-binding domain type 3
           [Halogeometricum borinquense DSM 11551]
          Length = 562

 Score = 39.5 bits (90), Expect = 1.4,   Method: Composition-based stats.
 Identities = 17/146 (11%), Positives = 42/146 (28%), Gaps = 19/146 (13%)

Query: 193 TSDMKIFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSK 252
           T+            ++     PP+W  +T Y+ G  +V +  ++ +         + +  
Sbjct: 11  TASALFTTIAGASATVAGAESPPKWDPDTTYTSGDRVVYEGYIWEA-------KWWTH-- 61

Query: 253 GATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLF 312
           G    K +   W  +             G      +   I+  ++            +  
Sbjct: 62  GTEPQKKSGNPWKQIRE----------DGGGGGTELTAVIETNTETVTVGETVTLDASKS 111

Query: 313 QAGVSVVSWFMSAWGEQEGYPSHVTF 338
              ++   W +       G  + V+F
Sbjct: 112 TGDITSYEWTVGDRDPVTGVETTVSF 137


>gi|313674279|ref|YP_004052275.1| glycoside hydrolase family 16 [Marivirga tractuosa DSM 4126]
 gi|312940977|gb|ADR20167.1| glycoside hydrolase family 16 [Marivirga tractuosa DSM 4126]
          Length = 364

 Score = 39.5 bits (90), Expect = 1.6,   Method: Composition-based stats.
 Identities = 34/264 (12%), Positives = 78/264 (29%), Gaps = 20/264 (7%)

Query: 140 HHLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIF 199
           +   +    DK+     E+ +         +     + A++ IS +   T  I  +  I 
Sbjct: 58  YTFSFGDGSDKLRDDDGEVTYSYAESGDYTIEVNAHTTAEVFISSSQEVTITIQQNSDID 117

Query: 200 KPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKD 259
                      G +   W           +  D         G   + +G ++   Y ++
Sbjct: 118 DEGYVSPMEYEGYNL-VWQDEFE---ADQLSDDYTF----EIGTGSNGWGNNESQYYREE 169

Query: 260 NNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVV 319
           N       L + +K          +   +   IK+            +++  +  G+   
Sbjct: 170 NTRLEEGYLVIQAKKENFQGQEYTSSRIITEGIKEFKYG----RFDIRARMPYGQGIWPA 225

Query: 320 SWFMSAWGEQEGYPSHVTFHNNRLLFSGSKGDELSV-----YLSSFGAFYDFSLDGEYGC 374
            W + +   Q G+P       + +   G +G E +V     + S+ G + +F        
Sbjct: 226 IWMLGSNFRQVGWPHCGEI--DIMEMIGGQGREATVHGTVHWQSNEG-YANFGHSKNLSD 282

Query: 375 YDPTKALTTAVTDFSASTIHWMHP 398
                        +  ++I W+  
Sbjct: 283 GTLADKFHVFSIIWDENSIQWLID 306


>gi|197935887|ref|YP_002213723.1| tail tuber protein B [Ralstonia phage RSB1]
 gi|197927050|dbj|BAG70392.1| tail tuber protein B [Ralstonia phage RSB1]
          Length = 861

 Score = 39.5 bits (90), Expect = 1.6,   Method: Composition-based stats.
 Identities = 33/207 (15%), Positives = 65/207 (31%), Gaps = 19/207 (9%)

Query: 254 ATYVKDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQ 313
            TY      T     +       E A+  V P  V+      S      +   Q  T   
Sbjct: 329 ETYYMRAVKTDTAAAHFGPVQWVEGAAQVVTPGQVFAIASITSTTLTLANSPAQLATAIG 388

Query: 314 AGVSVVSWFM-SAWGEQEGYP-------SHVTFHNNRLLFSGSKGDELSVYLSSFGAFYD 365
           + V   +  +     ++   P       SH+    +R++   +      + +S  G +++
Sbjct: 389 SPVPGYAASVCGDMTDKGAVPYFFGRKVSHMAMFQDRMVIVSN----GVILMSRTGDYFN 444

Query: 366 -FSLDG-EYGCYDPTKALTTAVTDFSASTIHWMHPFGEGVLVGCDTSLWLLSI-SLSKGL 422
            F          DP +A      D   S       + + + +  +   + L   S     
Sbjct: 445 WFRKSKLRVDDDDPVEAFALGSEDDIISQ---SSSYNKDLFLFGERGQYALPGRSAITPK 501

Query: 423 SIDFRRVSG-SGVYACPPVSVGDCLVF 448
           +I   +V+G        P+ VG+ L +
Sbjct: 502 TISITQVAGERDAMLARPIPVGNLLFY 528


>gi|319776426|ref|YP_004138914.1| tail fiber protein [Haemophilus influenzae F3047]
 gi|317451017|emb|CBY87248.1| probable tail fiber protein [Haemophilus influenzae F3047]
          Length = 747

 Score = 39.1 bits (89), Expect = 2.0,   Method: Composition-based stats.
 Identities = 28/198 (14%), Positives = 69/198 (34%), Gaps = 19/198 (9%)

Query: 198 IFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYV 257
           +FK LD+  +  +    PEW+   +Y+ G+ +  D   YR+L   ++ +    S    +V
Sbjct: 50  LFKRLDEKHTYLMQRGLPEWSATQDYTKGSCVQFDGVSYRALKNSKN-NSPNESDSQYWV 108

Query: 258 KDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVS 317
           +     W   L+  ++ + +             D +  +   +++      + +     +
Sbjct: 109 R-----WGFALSEIARATLQQYGIVQLSSATNSDSETKAATSKAVK-TAYDKAVEAKTTA 162

Query: 318 VVSWFMSAWGEQEGYPSHVTFHNNRLLFSGS-KGDELSVYLSSFGAFYDFSLDGEYGCYD 376
                ++      G  S      NR++   + +  +  +Y S  G + +       G  D
Sbjct: 163 DGKVGLNGNESINGEKS----FENRIVAKRNIRISDNPIYASR-GDYLN------IGAND 211

Query: 377 PTKALTTAVTDFSASTIH 394
                    ++    T+ 
Sbjct: 212 GDCWFEYKSSNREIGTLR 229


>gi|254522602|ref|ZP_05134657.1| Carbohydrate binding domain protein [Stenotrophomonas sp. SKA14]
 gi|219720193|gb|EED38718.1| Carbohydrate binding domain protein [Stenotrophomonas sp. SKA14]
          Length = 1475

 Score = 39.1 bits (89), Expect = 2.0,   Method: Composition-based stats.
 Identities = 18/107 (16%), Positives = 37/107 (34%), Gaps = 10/107 (9%)

Query: 214 PPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSK 273
             EWA  T Y  G ++  D  +YR+L      +      G          W  + + +S 
Sbjct: 849 ADEWAAGTTYPAGDFVRHDGTLYRAL-----AENVDVEPGTAP-----AVWEAIGDYTSV 898

Query: 274 TSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVS 320
               +A+ +++         +V++    ++  P       +   V S
Sbjct: 899 GDALAAAISMSTKNASDIAAEVTRVDAVVAKLPADGGQAASTGQVSS 945


>gi|325089518|gb|EGC42828.1| conserved hypothetical protein [Ajellomyces capsulatus H88]
          Length = 1104

 Score = 39.1 bits (89), Expect = 2.2,   Method: Composition-based stats.
 Identities = 13/56 (23%), Positives = 22/56 (39%)

Query: 200 KPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGAT 255
            P+ +         PPEW++NT Y +G  +  D  VY  +    +   +   K   
Sbjct: 180 GPVTESTETTAAAGPPEWSENTAYKVGDQVSYDGHVYVCIQAHTTVIGWEPPKTPA 235


>gi|240279230|gb|EER42735.1| conserved hypothetical protein [Ajellomyces capsulatus H143]
          Length = 1104

 Score = 39.1 bits (89), Expect = 2.2,   Method: Composition-based stats.
 Identities = 13/56 (23%), Positives = 22/56 (39%)

Query: 200 KPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGAT 255
            P+ +         PPEW++NT Y +G  +  D  VY  +    +   +   K   
Sbjct: 180 GPVTESTETTAAAGPPEWSENTAYKVGDQVSYDGHVYVCIQAHTTVIGWEPPKTPA 235


>gi|225562313|gb|EEH10592.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
          Length = 1104

 Score = 39.1 bits (89), Expect = 2.3,   Method: Composition-based stats.
 Identities = 13/56 (23%), Positives = 22/56 (39%)

Query: 200 KPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGAT 255
            P+ +         PPEW++NT Y +G  +  D  VY  +    +   +   K   
Sbjct: 180 GPVTESTETTAAAGPPEWSENTAYKVGDQVSYDGHVYVCIQAHTTVIGWEPPKTPA 235


>gi|332828789|gb|EGK01481.1| hypothetical protein HMPREF9455_02314 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 623

 Score = 38.7 bits (88), Expect = 2.8,   Method: Composition-based stats.
 Identities = 27/207 (13%), Positives = 59/207 (28%), Gaps = 34/207 (16%)

Query: 228 YIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESASGAVAPYY 287
           YI A D  +R++        + + +   Y++   + +   +   +      A        
Sbjct: 394 YIGASDNTFRAIDIKTGKLVWEFPEVKGYIETRPLIYKDKIFFGAWDETMYALDKHTGRL 453

Query: 288 VWGDIKD--------------VSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYP 333
           +W  ++                + D    +   +  T   A      W MS W  +E   
Sbjct: 454 LWKWVEGRKGILYSPAAVWPVAAHDRVFFTAPDRVMTAVDANTGETIWRMSDWKVRE--- 510

Query: 334 SHVTFHNN--RL----------LFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYD----P 377
             +    +  RL           +S +      ++ ++ G  YD +              
Sbjct: 511 -TIGLSEDKERLYSKTMQDSVVCYSATSAKPQQIWAANVGYGYDHAPSMPVEKDSVVFGS 569

Query: 378 TKALTTAVTDFSASTIHWMHPFGEGVL 404
           TK       +     + W H  G  ++
Sbjct: 570 TKNGIIFAIEGKTGKLLWKHKVGNSII 596


>gi|329123905|ref|ZP_08252457.1| phage tail fiber protein [Haemophilus aegyptius ATCC 11116]
 gi|327468100|gb|EGF13587.1| phage tail fiber protein [Haemophilus aegyptius ATCC 11116]
          Length = 240

 Score = 38.7 bits (88), Expect = 3.1,   Method: Composition-based stats.
 Identities = 28/198 (14%), Positives = 69/198 (34%), Gaps = 19/198 (9%)

Query: 198 IFKPLDKGRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYV 257
           +FK LD+  +  +    PEW+   +Y+ G+ +  D   YR+L   ++ +    S    +V
Sbjct: 50  LFKRLDEKHTYLMQRGLPEWSATQDYTKGSCVQFDGVSYRALKNSKN-NSPNESDSQYWV 108

Query: 258 KDNNITWITVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVS 317
           +     W   L+  ++ + +             D +  +   +++      + +     +
Sbjct: 109 R-----WGFALSEIARATLQQYGIVQLSSATNSDSETKAATSKAVK-TAYDKAVEAKTTA 162

Query: 318 VVSWFMSAWGEQEGYPSHVTFHNNRLLFSGS-KGDELSVYLSSFGAFYDFSLDGEYGCYD 376
                ++      G  S      NR++   + +  +  +Y S  G + +       G  D
Sbjct: 163 DGKVGLNGNESINGEKS----FENRIVAKRNIRISDNPIYASR-GDYLN------IGAND 211

Query: 377 PTKALTTAVTDFSASTIH 394
                    ++    T+ 
Sbjct: 212 GDCWFEYKSSNREIGTLR 229


>gi|12056574|gb|AAG47946.1|AF222787_1 cycloinulo-oligosaccharide fructanotransferase [Paenibacillus
           macerans]
          Length = 1333

 Score = 38.3 bits (87), Expect = 3.3,   Method: Composition-based stats.
 Identities = 42/256 (16%), Positives = 78/256 (30%), Gaps = 27/256 (10%)

Query: 86  DKKLQIVVVRSSTKWSPALFGKTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYI 145
           D K+++ +                 TP T                + + K++ P  L  +
Sbjct: 552 DGKIKLYLNGEEVASQATPVNVPI-TPSTES--------------LIIGKNNKPVELAGV 596

Query: 146 QDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQA-DTSTARITSDMKIFKPLDK 204
              +  S   DE+K          +++G +S   L          A I  D  +F   D+
Sbjct: 597 FSFNMFSGLLDEVKLHNKALTNQEILAGYESVKALHGGSIPKIPNADIDEDPSVFD-GDQ 655

Query: 205 GRSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITW 264
            R       P  W    +  I      + K +        G  +       +V D+ + W
Sbjct: 656 HRPQYHAMPPQNWMNEAHAPI----YYNGKYHLFYQHNPQGPFWHQIHWGHWVSDDMVNW 711

Query: 265 ITV---LNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSW 321
             V   L   + T     + + +  Y       +     + S++P  +T       +   
Sbjct: 712 ENVRPALAPEAGTLDPDGTWSGSAAYDRNGNPVLFYTAGNDSLSPNQRTGLATPADLSDP 771

Query: 322 FMSAWGEQEGYPSHVT 337
           ++  W   E YP  VT
Sbjct: 772 YLEKW---EKYPKPVT 784


>gi|254521915|ref|ZP_05133970.1| Carbohydrate binding domain protein [Stenotrophomonas sp. SKA14]
 gi|219719506|gb|EED38031.1| Carbohydrate binding domain protein [Stenotrophomonas sp. SKA14]
          Length = 1553

 Score = 38.3 bits (87), Expect = 3.3,   Method: Composition-based stats.
 Identities = 17/107 (15%), Positives = 36/107 (33%), Gaps = 10/107 (9%)

Query: 214  PPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSK 273
              EW   T Y  G ++  D  +YR+L      +      G          W  + + +S 
Sbjct: 950  ADEWVAGTTYPAGDFVRHDGTLYRAL-----AENVDVEPGTDP-----AVWEAIGDYTSV 999

Query: 274  TSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVS 320
                +A+ +++         +V++    ++  P       +   V S
Sbjct: 1000 GDALAAAISMSTKNASDIAAEVTRVDAVVAKLPADGGQAASTGQVAS 1046


>gi|309271529|ref|XP_003085345.1| PREDICTED: hypothetical protein LOC100503043 [Mus musculus]
          Length = 2318

 Score = 38.3 bits (87), Expect = 3.5,   Method: Composition-based stats.
 Identities = 46/335 (13%), Positives = 88/335 (26%), Gaps = 44/335 (13%)

Query: 107  KTYKTPYTFKDNKSLEYAVFGSTAVFVHKDHPPHHLLYIQDGDKISFTFDEIKFLPPPWL 166
            +    P+         +                  + ++ D    ++T  E +     W 
Sbjct: 1064 QAIAGPWAVSQVTDGSW-----------PAVQASGVSWVVDQATGTWTVAENQTGAVSWA 1112

Query: 167  GDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWA-----KNT 221
            G G I  +      + +   T+    T          K R          W       + 
Sbjct: 1113 GAGNIVSIGY---WTGAVDQTNAVSWTGTTDQVGVEVKPRFEDQASEKGSWVVAGVQTSG 1169

Query: 222  NYSIGAYIVADDKVY-RSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESAS 280
               +G+   +  + +  ++    +  R G    A                +S ++ +S+S
Sbjct: 1170 ETRLGSEDQSSGRSWTETVDQANAASRLGTVDQAGGTSWAGTGDQVGGVSTSGSADQSSS 1229

Query: 281  GAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHN 340
            G+      W   ++++ +        QS    + G    +    +W    G PS  +   
Sbjct: 1230 GS------WAGTRNLAGERSWTGTGDQSDGAAKPGFENQTSDEGSWAGTIGQPSGGSKSV 1283

Query: 341  NRLLFSGSKGDELSVYLSSFGAFYDFSLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFG 400
            +    +G    + +  LS  G F    LD   G   P      A                
Sbjct: 1284 SEAQSAGRSWADSADQLS--GGFLVGPLDQANGESQPVSGELAASGVDQ----------- 1330

Query: 401  EGVLVGCDTSLWLLSISLSKGLSIDFRRVSGSGVY 435
                       W  S   S G S    R   +G  
Sbjct: 1331 -----TSGGGCWTGSGDQSGGESRLGPRDQSNGES 1360


>gi|159037262|ref|YP_001536515.1| chitin-binding domain-containing protein [Salinispora arenicola
           CNS-205]
 gi|157916097|gb|ABV97524.1| chitin-binding domain 3 protein [Salinispora arenicola CNS-205]
          Length = 338

 Score = 38.3 bits (87), Expect = 3.9,   Method: Composition-based stats.
 Identities = 27/164 (16%), Positives = 41/164 (25%), Gaps = 8/164 (4%)

Query: 161 LPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKGRSIRLGCHPPEWAKN 220
            P P       +     A  + + + T TA  TS            +      P  W   
Sbjct: 181 SPTPTASPTSTASPTPTASPTSTASPTPTASPTSTASPTPTGTPSPTSTGTPAPESWQVG 240

Query: 221 TNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWITVLNLSSKTSRESAS 280
           T Y IG  +  D   YR+     +   +   +           W  V    +        
Sbjct: 241 TTYQIGDEVTYDGVSYRARQAHTATPGWEPPRVPA-------LWTAVTPPPATGDPAPGD 293

Query: 281 G-AVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVSVVSWFM 323
           G AV   Y  GD             A  +   ++       W  
Sbjct: 294 GWAVGIAYQIGDEVTYDGVSYLARQAHTATPGWEPPHVPSLWIR 337


>gi|256376322|ref|YP_003099982.1| glycoside hydrolase family 6 [Actinosynnema mirum DSM 43827]
 gi|255920625|gb|ACU36136.1| glycoside hydrolase family 6 [Actinosynnema mirum DSM 43827]
          Length = 605

 Score = 37.9 bits (86), Expect = 4.8,   Method: Composition-based stats.
 Identities = 42/327 (12%), Positives = 72/327 (22%), Gaps = 23/327 (7%)

Query: 141 HLLYIQDGDKISFTFDEIKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFK 200
                  G  + F                    +  N                     + 
Sbjct: 243 KQARTAPGGNLVFQVVIYNLPGRDCAALASNGELGPNDLPRYKTEYIDKIAGILARPAYA 302

Query: 201 PLDKGRSIRLGCHPPEWAK----NTNYSIGAYIVADDKV-----YRSLTTGRSGDRFGYS 251
            L     I +   P          T       + A+        Y     G  G+ + Y 
Sbjct: 303 SLRIVAVIEIDSLPNLVTNVSPRPTQTPNCDTMKANQNYQNGVAYAVSKLGDIGNVYNYL 362

Query: 252 KGAT--YVKDNNITWITVLNLSSKTSRESASG--AVAPYYVWGDIKDVSKDGRSISVAPQ 307
                 ++   +         +S     S  G        V G I + +           
Sbjct: 363 DSGHHGWIGWGDPIPEYDNFHASAKMMASILGREGATKADVHGFITNTANYSALEEPFWT 422

Query: 308 SQTLFQAGVSVVSWFMSAWGEQEGYPSHVTFHNNRLLFSG-SKGDELSVYLSSFGAFYDF 366
              +              W +  G     T     L+ +G   G  + +  S  G     
Sbjct: 423 VDDVVGGQAVKEKSKWVDWNDFNGELGFATAFRQELVANGFDAGVGMLIDTSRNGWGGSG 482

Query: 367 SLDGEYGCYDPTKALTTAVTDFSASTIHWMHPFGEGV--------LVGCDTSLWLLSISL 418
               +    DP+  +  +  D      +W +  G G+            D  +W+     
Sbjct: 483 RPTAKSSSTDPSVYVDQSRIDKRIQKGNWCNQSGAGLGERPKAAPKPNIDAYVWIKPPGE 542

Query: 419 SKGLSIDFRRVSGSGVYA-CPPVSVGD 444
           S G S       G G    C P   G+
Sbjct: 543 SDGSSTQIPNNEGKGFDRMCDPTYGGN 569


>gi|281211601|gb|EFA85763.1| hypothetical protein PPL_00993 [Polysphondylium pallidum PN500]
          Length = 310

 Score = 36.8 bits (83), Expect = 9.6,   Method: Composition-based stats.
 Identities = 32/232 (13%), Positives = 53/232 (22%), Gaps = 39/232 (16%)

Query: 109 YKTP--YTFKDNKSLEYAVFGSTAVFVHKDHP-PHHLLYIQDGDKISFTFDE-------- 157
             TP  Y   D+ +  Y   G T   +H  +   H L Y +  D I++T D         
Sbjct: 9   ITTPSYYGITDSPA--YIQLGGTVYCIHHGYENNHELWYTKSNDLITWTADAQFVDVQTT 66

Query: 158 ------------IKFLPPPWLGDGMISGVKSNAKLSISQADTSTARITSDMKIFKPLDKG 205
                         F        G ++ VK                              
Sbjct: 67  FSPAAIVFNSIIYGFHNGSPNSSGDLNYVKVTGNSVTQDNPIHGLPEWKSSNSPSATVFN 126

Query: 206 RSIRLGCHPPEWAKNTNYSIGAYIVADDKVYRSLTTGRSGDRFGYSKGATYVKDNNITWI 265
             + L  H P                + K+  + +       + Y +        + +  
Sbjct: 127 NLMYLAYHGPN--------------NNGKLLLASSPDGVASNWSYKEVPGITITGSPSMA 172

Query: 266 TVLNLSSKTSRESASGAVAPYYVWGDIKDVSKDGRSISVAPQSQTLFQAGVS 317
           T         R +A G         D    +      +V         A  S
Sbjct: 173 TFNGKIYIVFRNTALGNGVYVTSTSDTNTWTTPTLIPNVQVSGDPKLTATAS 224


  Database: nr
    Posted date:  May 22, 2011 12:22 AM
  Number of letters in database: 999,999,966
  Number of sequences in database:  2,987,313
  
  Database: /data/usr2/db/fasta/nr.01
    Posted date:  May 22, 2011 12:30 AM
  Number of letters in database: 999,999,796
  Number of sequences in database:  2,903,041
  
  Database: /data/usr2/db/fasta/nr.02
    Posted date:  May 22, 2011 12:36 AM
  Number of letters in database: 999,999,281
  Number of sequences in database:  2,904,016
  
  Database: /data/usr2/db/fasta/nr.03
    Posted date:  May 22, 2011 12:41 AM
  Number of letters in database: 999,999,960
  Number of sequences in database:  2,935,328
  
  Database: /data/usr2/db/fasta/nr.04
    Posted date:  May 22, 2011 12:46 AM
  Number of letters in database: 842,794,627
  Number of sequences in database:  2,394,679
  
Lambda     K      H
   0.308    0.118    0.326 

Lambda     K      H
   0.267   0.0367    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 9,905,769,435
Number of Sequences: 14124377
Number of extensions: 387266902
Number of successful extensions: 853322
Number of sequences better than 10.0: 245
Number of HSP's better than 10.0 without gapping: 123
Number of HSP's successfully gapped in prelim test: 208
Number of HSP's that attempted gapping in prelim test: 852110
Number of HSP's gapped (non-prelim): 503
length of query: 578
length of database: 4,842,793,630
effective HSP length: 145
effective length of query: 433
effective length of database: 2,794,758,965
effective search space: 1210130631845
effective search space used: 1210130631845
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.3 bits)
S2: 84 (37.1 bits)