BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 012468
         (463 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|224125166|ref|XP_002319516.1| predicted protein [Populus trichocarpa]
 gi|222857892|gb|EEE95439.1| predicted protein [Populus trichocarpa]
          Length = 468

 Score =  645 bits (1663), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 305/424 (71%), Positives = 359/424 (84%), Gaps = 4/424 (0%)

Query: 1   MSEIKAETTHHHPLIISE-PDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDW 59
           M+EIKA+    H L I+E  D+S Q+    +   R+ASLDI+RGL VALMILVD AGG+W
Sbjct: 1   MAEIKADIALDHRLTIAEVTDISAQKPDPKI---RVASLDIYRGLTVALMILVDDAGGEW 57

Query: 60  PEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILL 119
           P+I HAPWNGCNLADFVMPFFLFIVG+AI LA KRI  R  AV++VI RTLKLLFWGI+L
Sbjct: 58  PKIGHAPWNGCNLADFVMPFFLFIVGMAIPLAFKRITSRHHAVRRVIVRTLKLLFWGIML 117

Query: 120 QGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFR 179
           QGGFSHAPD+LTYGVD++ IR CG+LQRIA +YL+V+L+EIFTK  Q ++   G  SI++
Sbjct: 118 QGGFSHAPDKLTYGVDMKKIRWCGILQRIAFAYLVVALMEIFTKKKQTRELPPGWLSIYK 177

Query: 180 LYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNA 239
           LY   WLM AC+LV+YLA++YGTYVP WQFT+ ++DSADYGKVF V C VR KL+PPCNA
Sbjct: 178 LYSSQWLMGACILVIYLAVIYGTYVPHWQFTVNDRDSADYGKVFTVECAVRGKLDPPCNA 237

Query: 240 VGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSI 299
           VG+IDR++LGINHMY HPAW+RS+ACT++SP+EGP R  APSWC APFEPEG+LSS+S++
Sbjct: 238 VGFIDREILGINHMYQHPAWKRSEACTENSPYEGPFRTSAPSWCKAPFEPEGILSSISAV 297

Query: 300 LSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCV 359
           LSTIIGVHFGHV+++ +GH ARLK W+ MGFALLI GL LHFT+AIPLNKQLYT SYVCV
Sbjct: 298 LSTIIGVHFGHVLVYMRGHAARLKHWIVMGFALLILGLVLHFTHAIPLNKQLYTFSYVCV 357

Query: 360 TSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPH 419
           TSGAAALVFS+IYALVDIW  K  F PLAWIGMNAMLVYVMAAEGIFAGFINGWYY DPH
Sbjct: 358 TSGAAALVFSSIYALVDIWGWKCIFQPLAWIGMNAMLVYVMAAEGIFAGFINGWYYNDPH 417

Query: 420 NTLV 423
           NTL+
Sbjct: 418 NTLI 421


>gi|18421151|ref|NP_568500.1| heparan-alpha-glucosaminide N-acetyltransferase [Arabidopsis
           thaliana]
 gi|14334592|gb|AAK59474.1| unknown protein [Arabidopsis thaliana]
 gi|26983902|gb|AAN86203.1| unknown protein [Arabidopsis thaliana]
 gi|332006336|gb|AED93719.1| heparan-alpha-glucosaminide N-acetyltransferase [Arabidopsis
           thaliana]
          Length = 472

 Score =  635 bits (1639), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 307/425 (72%), Positives = 357/425 (84%), Gaps = 2/425 (0%)

Query: 1   MSEIKAETTHHHPLIISEPDVSDQQEKSHLKTQR--LASLDIFRGLAVALMILVDHAGGD 58
           M+EIK E +H   L+  + D S    +  L   R  LASLDIFRGL VALMILVD AGGD
Sbjct: 1   MAEIKVERSHDQHLLEPKEDTSSSYTRRSLAGNRPRLASLDIFRGLTVALMILVDDAGGD 60

Query: 59  WPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGIL 118
           WP I+HAPWNGCNLADFVMPFFLFIVGV+IAL+LKRI ++ +A KKV FRT KLLFWG+L
Sbjct: 61  WPMIAHAPWNGCNLADFVMPFFLFIVGVSIALSLKRISNKFEACKKVGFRTCKLLFWGLL 120

Query: 119 LQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIF 178
           LQGGFSHAPDELTYGVDV M+R CG+LQRIALSYL+V+LVEIFTKD  +++ S GRFSIF
Sbjct: 121 LQGGFSHAPDELTYGVDVTMMRFCGILQRIALSYLVVALVEIFTKDSHEENLSTGRFSIF 180

Query: 179 RLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCN 238
           + Y WHW++AA VLV+YLA LYGTYVPDW+F + +KDS  YGK+ +V+CGVR KLNPPCN
Sbjct: 181 KSYYWHWIVAASVLVIYLATLYGTYVPDWEFVVYDKDSVLYGKILSVSCGVRGKLNPPCN 240

Query: 239 AVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSS 298
           AVGY+DR+VLGINHMYHHPAWRRSKACT DSP+EG +R+DAPSWC APFEPEG+LSS+S+
Sbjct: 241 AVGYVDRQVLGINHMYHHPAWRRSKACTDDSPYEGAIRQDAPSWCRAPFEPEGILSSISA 300

Query: 299 ILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVC 358
           ILSTIIGVHFGH+I+H KGH ARLK W++ G  LL  GLTLHFT+ +PLNKQLY+ SY+C
Sbjct: 301 ILSTIIGVHFGHIILHLKGHSARLKHWISTGLVLLALGLTLHFTHLMPLNKQLYSFSYIC 360

Query: 359 VTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDP 418
           VTSGAAALVFS++Y+LVDI   K+ FLPL WIGMNAMLVYVM AEGI A F NGWYY  P
Sbjct: 361 VTSGAAALVFSSLYSLVDILEWKHMFLPLKWIGMNAMLVYVMGAEGILAAFFNGWYYRHP 420

Query: 419 HNTLV 423
           HNTL+
Sbjct: 421 HNTLI 425


>gi|297812935|ref|XP_002874351.1| hypothetical protein ARALYDRAFT_489556 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297320188|gb|EFH50610.1| hypothetical protein ARALYDRAFT_489556 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 453

 Score =  625 bits (1613), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 291/407 (71%), Positives = 346/407 (85%)

Query: 20  DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
           ++  ++  +    QRLASLDIFRGL VALMILVD AGGDWP I+HAPWNGCNLADFVMPF
Sbjct: 3   EIKVERSLAGNNRQRLASLDIFRGLTVALMILVDDAGGDWPMIAHAPWNGCNLADFVMPF 62

Query: 80  FLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMI 139
           FLFIVGV+IAL+LKRI ++ +A KKV FRT KLLFWG+LLQGGFSHAPDEL+YGVDV M+
Sbjct: 63  FLFIVGVSIALSLKRISNKFEACKKVCFRTCKLLFWGLLLQGGFSHAPDELSYGVDVTMM 122

Query: 140 RLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALL 199
           R CG+LQRIALSYL+V+L+EIFTKD+ +++ S GR SIF+ Y  HW++   VLV+YLA L
Sbjct: 123 RFCGILQRIALSYLVVALIEIFTKDLHEENLSTGRLSIFKSYYCHWIVGVSVLVIYLATL 182

Query: 200 YGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAW 259
           YGTYVPDW+F + +KDS  YGK+ +V+CGVR KLNPPCNAVGY+DR+VL INHMYHHPAW
Sbjct: 183 YGTYVPDWEFVVNDKDSILYGKIQSVSCGVRGKLNPPCNAVGYVDRQVLVINHMYHHPAW 242

Query: 260 RRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHL 319
           RRSKA T DSP+EG LR+DAPSWCHAPFEPEG+LSS+S+ILSTIIGVHFGH+IIH +GHL
Sbjct: 243 RRSKAFTDDSPYEGALRQDAPSWCHAPFEPEGILSSISAILSTIIGVHFGHIIIHLQGHL 302

Query: 320 ARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWN 379
           ARLK W++ G   L  GLTLHFT+ +PLNKQLY+ SY+CVTSGAAALVFS++Y+LVD+  
Sbjct: 303 ARLKHWISTGLVFLTLGLTLHFTHLMPLNKQLYSFSYICVTSGAAALVFSSLYSLVDVLE 362

Query: 380 LKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLVCFL 426
            K+ FLPL WIGMNAMLVYVM AEGI A F NGWYY  PHNTL+ ++
Sbjct: 363 WKHMFLPLKWIGMNAMLVYVMGAEGILAAFFNGWYYRHPHNTLITWI 409


>gi|359487632|ref|XP_003633626.1| PREDICTED: LOW QUALITY PROTEIN: heparan-alpha-glucosaminide
           N-acetyltransferase-like [Vitis vinifera]
          Length = 499

 Score =  625 bits (1611), Expect = e-176,   Method: Compositional matrix adjust.
 Identities = 313/445 (70%), Positives = 363/445 (81%), Gaps = 8/445 (1%)

Query: 10  HHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNG 69
           + H LIIS+     ++     KT+RLASLDIFRGL VALMILVD AGG+WP I HAPWNG
Sbjct: 41  NQHRLIISDSGFPPEERPQ--KTKRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWNG 98

Query: 70  CNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDE 129
           CNLADFVMPFFLFIVGVAIALALKRIPDR  A+KKV  RTLKLLFWG+LLQG F+  PD+
Sbjct: 99  CNLADFVMPFFLFIVGVAIALALKRIPDRLMAIKKVTLRTLKLLFWGLLLQGSFTQDPDK 158

Query: 130 LTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAA 189
           LTYGVD++ IR CG+LQ IAL+YL+V+L+EI TK  Q KD S G+FSIF+LYCWHWLM A
Sbjct: 159 LTYGVDMKKIRWCGILQXIALAYLVVALLEITTKKAQAKDLSPGQFSIFKLYCWHWLMGA 218

Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLG 249
           CVL+VY+A+ YGTYVPDW FT+ ++DSADYGKV  V CG R KL+PPCN VGYIDR++LG
Sbjct: 219 CVLIVYMAVSYGTYVPDWHFTVHDRDSADYGKVLTVACGARGKLDPPCNVVGYIDREILG 278

Query: 250 INHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFG 309
           +NHMY HPAW RSKAC + SP +GP RKDAPSWC+APFEPEG+LSS+S+ILSTIIGVHFG
Sbjct: 279 MNHMYQHPAWTRSKACNEYSPDKGPFRKDAPSWCYAPFEPEGILSSISAILSTIIGVHFG 338

Query: 310 HVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFS 369
           HV++H KGH  RLK WV MGFALL+ G+TLHFT AIPLNKQLYT SYVCVTSGAAALVFS
Sbjct: 339 HVLMHLKGHSDRLKHWVVMGFALLVLGITLHFTGAIPLNKQLYTFSYVCVTSGAAALVFS 398

Query: 370 AIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLVCFLFII 429
             Y LVD+W +++  LPL WIGMNAMLVYVMAAEG+FA FINGWYYGDPHNTL+ ++   
Sbjct: 399 FFYILVDVWGMRFLCLPLEWIGMNAMLVYVMAAEGVFAKFINGWYYGDPHNTLINWIQQH 458

Query: 430 SYILHSFLWELRK---FLYVQFCNL 451
            +I    +W  RK    LYV F  +
Sbjct: 459 IFIQ---VWHSRKVGILLYVIFAEI 480


>gi|449454063|ref|XP_004144775.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Cucumis sativus]
 gi|449490878|ref|XP_004158735.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Cucumis sativus]
          Length = 490

 Score =  615 bits (1587), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 313/454 (68%), Positives = 362/454 (79%), Gaps = 7/454 (1%)

Query: 1   MSEIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWP 60
           M EIK ++T HHP  +   D SD      +K++RLASLDIFRGL VALMILVD AGG+WP
Sbjct: 22  MEEIKPDSTSHHPHRLISVD-SDALLPKPVKSKRLASLDIFRGLTVALMILVDDAGGEWP 80

Query: 61  EISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQ 120
            I HAPW GCNLADFVMPFFLFIVG+AIALALKRIP++  A++KV  RTLKLLFWG+LLQ
Sbjct: 81  MIGHAPWYGCNLADFVMPFFLFIVGMAIALALKRIPNQLMAIEKVTLRTLKLLFWGLLLQ 140

Query: 121 GGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRL 180
           GG+SHAPD+LTYGVDVR IRL G+LQRIAL+YL+V+ VE+ ++  Q   Q    FSIF+ 
Sbjct: 141 GGYSHAPDKLTYGVDVRKIRLFGILQRIALAYLVVAFVEVLSRKTQSNVQPFNHFSIFKS 200

Query: 181 YCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAV 240
           Y W+WL+ AC+LVVY ALLYG YVPDWQFT+ + +S  YG+ F V CGVR  L+PPCNAV
Sbjct: 201 YFWNWLVGACILVVYFALLYGIYVPDWQFTVTDSESVYYGRNFTVACGVRGNLDPPCNAV 260

Query: 241 GYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSIL 300
           GYIDRKVLGINH+Y HPAWRRS+ACT++SP+ G  R +APSWC APFEPEG+LSS+S+IL
Sbjct: 261 GYIDRKVLGINHLYAHPAWRRSEACTENSPYAGSFRDNAPSWCFAPFEPEGILSSISAIL 320

Query: 301 STIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVT 360
           STIIGVHFGHV+IH + H ARLKQWVTMGF LLI GL LHFT+AIPLNKQLYT SYVCVT
Sbjct: 321 STIIGVHFGHVLIHFQDHSARLKQWVTMGFTLLILGLVLHFTHAIPLNKQLYTFSYVCVT 380

Query: 361 SGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHN 420
           SGAAALVFS  Y LVDIW L+  FLPL WIGMNAMLVYVMAA GIFAGFINGWYY DPHN
Sbjct: 381 SGAAALVFSVFYTLVDIWGLRPLFLPLEWIGMNAMLVYVMAAAGIFAGFINGWYYDDPHN 440

Query: 421 TLVCFLFIISYILHSFLWELRK---FLYVQFCNL 451
           TL+   + I   L   +W  +K    LYV F  +
Sbjct: 441 TLI---YWIKKHLFIGVWHSKKVGILLYVIFAEI 471


>gi|242059773|ref|XP_002459032.1| hypothetical protein SORBIDRAFT_03g044830 [Sorghum bicolor]
 gi|241931007|gb|EES04152.1| hypothetical protein SORBIDRAFT_03g044830 [Sorghum bicolor]
          Length = 481

 Score =  605 bits (1561), Expect = e-170,   Method: Compositional matrix adjust.
 Identities = 281/404 (69%), Positives = 338/404 (83%)

Query: 20  DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
           D +D  EK+  +++R+ASLD+FRGL VALMILVD AGG+WP I HAPWNGCNLADFVMPF
Sbjct: 31  DEADDNEKAPRRSRRVASLDVFRGLTVALMILVDGAGGEWPVIGHAPWNGCNLADFVMPF 90

Query: 80  FLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMI 139
           FLFIVG+AI L+LKRIPDR  AV++V+ RTLKLLFWGILLQG +SHAPDELTYGVD++ +
Sbjct: 91  FLFIVGMAIPLSLKRIPDRGRAVRRVVIRTLKLLFWGILLQGRYSHAPDELTYGVDMKHV 150

Query: 140 RLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALL 199
           R  G+LQRIAL+YL+V+++EI TKD + +DQS   FSIFR+Y   W++A C+LV+YLAL+
Sbjct: 151 RWGGILQRIALAYLVVAVLEIVTKDAKIQDQSSSGFSIFRMYLSQWIVACCILVIYLALV 210

Query: 200 YGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAW 259
           YG YVPDW+F + N DS +YGKV  VTCG R  L+PPCNAVGYIDRKVLGINHMY  PAW
Sbjct: 211 YGIYVPDWEFRVRNVDSPNYGKVLTVTCGTRGILDPPCNAVGYIDRKVLGINHMYQKPAW 270

Query: 260 RRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHL 319
           RR +ACT DSP EG  R DAP+WC APFEPEG+LSS+S++LSTIIGVH+GHV++H K H 
Sbjct: 271 RRHRACTDDSPHEGHFRNDAPAWCVAPFEPEGILSSLSAVLSTIIGVHYGHVLVHMKSHT 330

Query: 320 ARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWN 379
            RL+QWVTMG  LL+ G+ LHF++AIPLNKQLYT SY+CVT+GAA +VFS +Y LVDI +
Sbjct: 331 DRLRQWVTMGICLLVLGIILHFSHAIPLNKQLYTFSYICVTAGAAGVVFSVLYFLVDIVS 390

Query: 380 LKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLV 423
           L+Y F PL WIGMNAMLVYVMAAEGIF GF+NGWYY   +NTLV
Sbjct: 391 LRYVFAPLQWIGMNAMLVYVMAAEGIFEGFLNGWYYEGTNNTLV 434


>gi|356572978|ref|XP_003554642.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Glycine max]
          Length = 464

 Score =  600 bits (1546), Expect = e-169,   Method: Compositional matrix adjust.
 Identities = 310/424 (73%), Positives = 356/424 (83%), Gaps = 8/424 (1%)

Query: 1   MSEIKAETTHHHPLIISEP-DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDW 59
           M+EIK E    H L +SE   +SD   K+  KT+R+ASLDIFRGL VALMILVD AGG W
Sbjct: 1   MAEIKGE----HSLNVSEELPLSD---KNLPKTKRVASLDIFRGLTVALMILVDDAGGQW 53

Query: 60  PEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILL 119
           P I HAPWNGCNLADFVMPFFLFIVG+AI LALKRIP+R  AVKKVI RTLKLLFWG+LL
Sbjct: 54  PMIGHAPWNGCNLADFVMPFFLFIVGMAIPLALKRIPNRLLAVKKVIVRTLKLLFWGLLL 113

Query: 120 QGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFR 179
           QGGFSHAPD LTYGVD++ IR CG+LQRIAL+YL+V+LVEIF++  Q +D      SIF+
Sbjct: 114 QGGFSHAPDNLTYGVDMKHIRWCGILQRIALAYLVVALVEIFSRSAQARDPEPTHLSIFK 173

Query: 180 LYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNA 239
           LY WHWL+ AC+L VYLALLYG +VPDWQFT+ N DS   G    VTCGVR KL+PPCNA
Sbjct: 174 LYYWHWLVGACILAVYLALLYGIHVPDWQFTVHNPDSIYNGTTLTVTCGVRGKLDPPCNA 233

Query: 240 VGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSI 299
           VGYIDR+V+GINHMY  PAWRRS+ACT++SP+EGP +K+APSWC+APFEPEG+LSS+S+I
Sbjct: 234 VGYIDREVIGINHMYKRPAWRRSEACTENSPYEGPFKKNAPSWCYAPFEPEGILSSISAI 293

Query: 300 LSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCV 359
           LSTIIG+HFGHV+IH + H +RLK W+ +G ALL  GL LHFT+AIPLNKQLYTLSYVCV
Sbjct: 294 LSTIIGLHFGHVLIHLQDHPSRLKHWLLLGLALLTSGLILHFTHAIPLNKQLYTLSYVCV 353

Query: 360 TSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPH 419
           TSGAAAL+FSA Y +VDIW L + FLPL WIGMNAMLVYVMAAEGIFAGFINGWYYGDPH
Sbjct: 354 TSGAAALLFSAFYIMVDIWGLTFLFLPLKWIGMNAMLVYVMAAEGIFAGFINGWYYGDPH 413

Query: 420 NTLV 423
           NTLV
Sbjct: 414 NTLV 417


>gi|326493552|dbj|BAJ85237.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326511587|dbj|BAJ91938.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 486

 Score =  598 bits (1541), Expect = e-168,   Method: Compositional matrix adjust.
 Identities = 283/423 (66%), Positives = 346/423 (81%), Gaps = 6/423 (1%)

Query: 7   ETTHHHPLIISEPDVSDQQEKS-----HLKTQRLASLDIFRGLAVALMILVDHAGGDWPE 61
           E    H    +  D+ D  EK         ++R+ASLD+FRGL VALMILVD AGG+WP 
Sbjct: 17  EDPDRHRTHEAADDLDDDGEKKASRPSSSSSRRVASLDVFRGLTVALMILVDGAGGEWPV 76

Query: 62  ISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQG 121
           I HAPW+GCNLADFVMPFFLFIVG+AI L+LKRIPDR  AV++V+ RTLKLLFWGILLQG
Sbjct: 77  IGHAPWHGCNLADFVMPFFLFIVGMAIPLSLKRIPDRGWAVRRVVIRTLKLLFWGILLQG 136

Query: 122 GFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSV-GRFSIFRL 180
           G+SHAPDELTYGVD++ IR CG+LQRIAL+YL+V+++EI TKD + +DQS  G FS+FRL
Sbjct: 137 GYSHAPDELTYGVDMKHIRWCGILQRIALAYLVVAVIEIATKDARVQDQSSSGFFSVFRL 196

Query: 181 YCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAV 240
           Y   W++A C+L++YL+L+YG YVPDW+FT+ N DS +YGKV  VTCG R  L+PPCNAV
Sbjct: 197 YLSQWIVACCILLIYLSLVYGVYVPDWEFTVRNVDSPNYGKVLTVTCGTRGNLSPPCNAV 256

Query: 241 GYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSIL 300
           GYIDRKVLGINH+Y  PAWRR + CT DSP EGP ++DAP+WC +PFEPEGLLSS S++L
Sbjct: 257 GYIDRKVLGINHLYQKPAWRRHRDCTDDSPHEGPFKRDAPAWCASPFEPEGLLSSFSAVL 316

Query: 301 STIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVT 360
           STIIGVH+GHV++H K H+ RLKQWVTMG ALL+ G+ LHF++AIPLNKQLYTLSY+CVT
Sbjct: 317 STIIGVHYGHVLVHMKSHMDRLKQWVTMGVALLLLGIILHFSHAIPLNKQLYTLSYICVT 376

Query: 361 SGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHN 420
           +GAA ++FS +Y LVD+ +L+Y F PL W+GMNAMLVYVMAA GIF GF+NGWYY  P+N
Sbjct: 377 AGAAGIIFSMLYFLVDVVHLRYVFAPLRWVGMNAMLVYVMAAAGIFEGFLNGWYYDGPNN 436

Query: 421 TLV 423
           TLV
Sbjct: 437 TLV 439


>gi|356504028|ref|XP_003520801.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Glycine max]
          Length = 465

 Score =  598 bits (1541), Expect = e-168,   Method: Compositional matrix adjust.
 Identities = 310/425 (72%), Positives = 355/425 (83%), Gaps = 9/425 (2%)

Query: 1   MSEIKAETTHHHPLIISE--PDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGD 58
           M+EIK E    H L +S+  P+VSD   K+  KT+R+ASLDIFRGL VALMILVD AG  
Sbjct: 1   MAEIKGE----HSLNVSQELPEVSD---KNLPKTKRVASLDIFRGLTVALMILVDDAGEQ 53

Query: 59  WPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGIL 118
           WP I HAPWNGCNLADFVMPFFLFIVG+AI LALKRIP+R  AVKKVI RTLKLLFWG+L
Sbjct: 54  WPMIGHAPWNGCNLADFVMPFFLFIVGMAIPLALKRIPNRLLAVKKVIVRTLKLLFWGLL 113

Query: 119 LQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIF 178
           LQGGFSHAPD LTYGVD++ IR CG+LQRIAL+YL+V+LVEIF++  Q +D      SIF
Sbjct: 114 LQGGFSHAPDNLTYGVDMKHIRWCGILQRIALAYLVVALVEIFSRSTQARDPEPTHLSIF 173

Query: 179 RLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCN 238
            LY WHWL+ AC+LVVYLALLYG +VPDW FT+ N DS   G    VTCGVR KL+PPCN
Sbjct: 174 NLYYWHWLVGACILVVYLALLYGIHVPDWGFTVHNPDSIYNGTTLTVTCGVRGKLDPPCN 233

Query: 239 AVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSS 298
           AVGYIDR+VLGINHMY  PAWRRS+ACT++SP+EGP +K+APSWC+APFEPEG+LSS+S+
Sbjct: 234 AVGYIDREVLGINHMYKRPAWRRSEACTENSPYEGPFKKNAPSWCYAPFEPEGILSSISA 293

Query: 299 ILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVC 358
           ILSTIIG+HFGHV+IH + H +RLK W+ +G ALL  GL LHFT+AIPLNKQLYTLSYVC
Sbjct: 294 ILSTIIGLHFGHVLIHLQDHPSRLKHWLLLGLALLTSGLILHFTHAIPLNKQLYTLSYVC 353

Query: 359 VTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDP 418
           VTSGAAAL+FSA Y  VDIW L + FLPL WIGMNAMLVYVMAAEGIFAGFINGWYYGDP
Sbjct: 354 VTSGAAALLFSAFYITVDIWGLTFLFLPLKWIGMNAMLVYVMAAEGIFAGFINGWYYGDP 413

Query: 419 HNTLV 423
           HNTL+
Sbjct: 414 HNTLI 418


>gi|296089693|emb|CBI39512.3| unnamed protein product [Vitis vinifera]
          Length = 481

 Score =  594 bits (1531), Expect = e-167,   Method: Compositional matrix adjust.
 Identities = 302/445 (67%), Positives = 348/445 (78%), Gaps = 26/445 (5%)

Query: 10  HHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNG 69
           + H LIIS+     ++     KT+RLASLDIFRGL VALMILVD AGG+WP I HAPWNG
Sbjct: 41  NQHRLIISDSGFPPEERPQ--KTKRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWNG 98

Query: 70  CNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDE 129
           CNLADFVMPFFLFIVGVAIALALKRIPDR  A+KKV  RTLKLLFWG+LLQG F+  PD+
Sbjct: 99  CNLADFVMPFFLFIVGVAIALALKRIPDRLMAIKKVTLRTLKLLFWGLLLQGSFTQDPDK 158

Query: 130 LTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAA 189
           LTYGVD++ IR CG+LQ                   Q KD S G+FSIF+LYCWHWLM A
Sbjct: 159 LTYGVDMKKIRWCGILQ------------------AQAKDLSPGQFSIFKLYCWHWLMGA 200

Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLG 249
           CVL+VY+A+ YGTYVPDW FT+ ++DSADYGKV  V CG R KL+PPCN VGYIDR++LG
Sbjct: 201 CVLIVYMAVSYGTYVPDWHFTVHDRDSADYGKVLTVACGARGKLDPPCNVVGYIDREILG 260

Query: 250 INHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFG 309
           +NHMY HPAW RSKAC + SP +GP RKDAPSWC+APFEPEG+LSS+S+ILSTIIGVHFG
Sbjct: 261 MNHMYQHPAWTRSKACNEYSPDKGPFRKDAPSWCYAPFEPEGILSSISAILSTIIGVHFG 320

Query: 310 HVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFS 369
           HV++H KGH  RLK WV MGFALL+ G+TLHFT AIPLNKQLYT SYVCVTSGAAALVFS
Sbjct: 321 HVLMHLKGHSDRLKHWVVMGFALLVLGITLHFTGAIPLNKQLYTFSYVCVTSGAAALVFS 380

Query: 370 AIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLVCFLFII 429
             Y LVD+W +++  LPL WIGMNAMLVYVMAAEG+FA FINGWYYGDPHNTL+ ++   
Sbjct: 381 FFYILVDVWGMRFLCLPLEWIGMNAMLVYVMAAEGVFAKFINGWYYGDPHNTLINWIQQH 440

Query: 430 SYILHSFLWELRK---FLYVQFCNL 451
            +I    +W  RK    LYV F  +
Sbjct: 441 IFIQ---VWHSRKVGILLYVIFAEI 462


>gi|212723192|ref|NP_001131974.1| uncharacterized protein LOC100193372 [Zea mays]
 gi|194693076|gb|ACF80622.1| unknown [Zea mays]
 gi|413951397|gb|AFW84046.1| hypothetical protein ZEAMMB73_047978 [Zea mays]
          Length = 484

 Score =  583 bits (1503), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 279/410 (68%), Positives = 340/410 (82%), Gaps = 6/410 (1%)

Query: 20  DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
           D +D  EK+   ++R+ASLD+FRGL VALMILVD AGG+WP I HAPWNGCNLADFVMPF
Sbjct: 28  DEADANEKAPRPSRRVASLDVFRGLTVALMILVDGAGGEWPVIGHAPWNGCNLADFVMPF 87

Query: 80  FLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMI 139
           FLFIVG+A+ LALKRIPDR  AV++V+ RTLKLLFWGILLQGG+SHAPDEL YGVD+R +
Sbjct: 88  FLFIVGMAVPLALKRIPDRGRAVRRVVVRTLKLLFWGILLQGGYSHAPDELAYGVDMRHV 147

Query: 140 RLCGVLQRIALSYLLVSLVEIFTKDVQD--KDQ---SVGRFS-IFRLYCWHWLMAACVLV 193
           R  G+LQRIAL+YL+V+++E+ TKD     +DQ   S GRFS +FR+Y   W++A C+LV
Sbjct: 148 RWGGILQRIALAYLVVAVLEMVTKDGAKVHQDQPPGSSGRFSRVFRMYLSQWIVACCILV 207

Query: 194 VYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
           VYL+L YG YVPDW+F + N DS DYGKV  V CG R  L+PPCNAVGYIDR+VLGINHM
Sbjct: 208 VYLSLAYGVYVPDWEFRVRNADSPDYGKVLTVRCGTRGALDPPCNAVGYIDRRVLGINHM 267

Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
           Y  PAWRR +ACT DSP EGP R+DAP+WC APFEPEG+LSS+S++LST++GVH+GHV++
Sbjct: 268 YQKPAWRRHRACTDDSPHEGPFREDAPAWCVAPFEPEGILSSLSAVLSTVVGVHYGHVLV 327

Query: 314 HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
           H K H  RL+QWVTMG ALL+ G+ LHF++AIPLNKQLYT SY+CVT+GAA +VFSA+Y 
Sbjct: 328 HMKSHTDRLRQWVTMGVALLVLGIILHFSHAIPLNKQLYTFSYICVTAGAAGVVFSALYF 387

Query: 374 LVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLV 423
           LVD+ +L+Y F PL W+GMNAMLVYVMAAEG+F GF+NGWYY   +NTLV
Sbjct: 388 LVDVVSLRYAFAPLRWVGMNAMLVYVMAAEGVFEGFLNGWYYESTNNTLV 437


>gi|357126662|ref|XP_003565006.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Brachypodium distachyon]
          Length = 485

 Score =  581 bits (1498), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 273/393 (69%), Positives = 332/393 (84%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           +++R+ASLD+FRGL VALMILVD AGG+WP I HAPW+GCNLADFVMPFFLFIVG+AI L
Sbjct: 46  RSRRVASLDVFRGLTVALMILVDGAGGEWPVIGHAPWDGCNLADFVMPFFLFIVGMAIPL 105

Query: 91  ALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIAL 150
           +LKRIPDR  AV++V+ RTLKLLFWGILLQGG+SHAPDEL YGVD++ IR CG+LQRIA 
Sbjct: 106 SLKRIPDRGRAVRRVVIRTLKLLFWGILLQGGYSHAPDELAYGVDMKHIRWCGILQRIAF 165

Query: 151 SYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFT 210
           +YL+V+++EI TKD   +DQS   FSIFR+Y   W++A C+L++YL+L+YG YVPDW+F 
Sbjct: 166 AYLVVAVIEIATKDANIQDQSSSGFSIFRMYFSQWIVACCILLIYLSLVYGIYVPDWEFR 225

Query: 211 IINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSP 270
           + N DS +YGKV  VTCG R KL+PPCNAVGYIDRKVLGINH+Y  PAWRR +ACT DSP
Sbjct: 226 VRNVDSPNYGKVLTVTCGTRGKLSPPCNAVGYIDRKVLGINHLYQKPAWRRHRACTDDSP 285

Query: 271 FEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGF 330
            EGP + DAP+WC +PFEPEGLLSS S++LSTIIGVH+GHV++H K H+ RLKQWVTMG 
Sbjct: 286 HEGPFKSDAPAWCASPFEPEGLLSSFSAVLSTIIGVHYGHVLVHMKSHMDRLKQWVTMGI 345

Query: 331 ALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWI 390
           ALL+ G+ LHF++AIPLNKQLYT SY+CVT+GAA +VFS +Y LVD+ +L Y F PL WI
Sbjct: 346 ALLLLGIILHFSHAIPLNKQLYTFSYICVTAGAAGIVFSMLYFLVDMVSLGYVFAPLRWI 405

Query: 391 GMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLV 423
           GMNAMLVY+MAA G+F GF+NGWYY   +NTLV
Sbjct: 406 GMNAMLVYIMAAAGVFEGFLNGWYYDGTNNTLV 438


>gi|413951398|gb|AFW84047.1| hypothetical protein ZEAMMB73_047978 [Zea mays]
          Length = 503

 Score =  571 bits (1472), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 279/429 (65%), Positives = 340/429 (79%), Gaps = 25/429 (5%)

Query: 20  DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
           D +D  EK+   ++R+ASLD+FRGL VALMILVD AGG+WP I HAPWNGCNLADFVMPF
Sbjct: 28  DEADANEKAPRPSRRVASLDVFRGLTVALMILVDGAGGEWPVIGHAPWNGCNLADFVMPF 87

Query: 80  FLFIVGVAIALALK-------------------RIPDRADAVKKVIFRTLKLLFWGILLQ 120
           FLFIVG+A+ LALK                   RIPDR  AV++V+ RTLKLLFWGILLQ
Sbjct: 88  FLFIVGMAVPLALKVRRRRRSSRPSVVHAMHAHRIPDRGRAVRRVVVRTLKLLFWGILLQ 147

Query: 121 GGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQD--KDQ---SVGRF 175
           GG+SHAPDEL YGVD+R +R  G+LQRIAL+YL+V+++E+ TKD     +DQ   S GRF
Sbjct: 148 GGYSHAPDELAYGVDMRHVRWGGILQRIALAYLVVAVLEMVTKDGAKVHQDQPPGSSGRF 207

Query: 176 S-IFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLN 234
           S +FR+Y   W++A C+LVVYL+L YG YVPDW+F + N DS DYGKV  V CG R  L+
Sbjct: 208 SRVFRMYLSQWIVACCILVVYLSLAYGVYVPDWEFRVRNADSPDYGKVLTVRCGTRGALD 267

Query: 235 PPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLS 294
           PPCNAVGYIDR+VLGINHMY  PAWRR +ACT DSP EGP R+DAP+WC APFEPEG+LS
Sbjct: 268 PPCNAVGYIDRRVLGINHMYQKPAWRRHRACTDDSPHEGPFREDAPAWCVAPFEPEGILS 327

Query: 295 SVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTL 354
           S+S++LST++GVH+GHV++H K H  RL+QWVTMG ALL+ G+ LHF++AIPLNKQLYT 
Sbjct: 328 SLSAVLSTVVGVHYGHVLVHMKSHTDRLRQWVTMGVALLVLGIILHFSHAIPLNKQLYTF 387

Query: 355 SYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWY 414
           SY+CVT+GAA +VFSA+Y LVD+ +L+Y F PL W+GMNAMLVYVMAAEG+F GF+NGWY
Sbjct: 388 SYICVTAGAAGVVFSALYFLVDVVSLRYAFAPLRWVGMNAMLVYVMAAEGVFEGFLNGWY 447

Query: 415 YGDPHNTLV 423
           Y   +NTLV
Sbjct: 448 YESTNNTLV 456


>gi|222619812|gb|EEE55944.1| hypothetical protein OsJ_04649 [Oryza sativa Japonica Group]
          Length = 846

 Score =  568 bits (1464), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 273/392 (69%), Positives = 329/392 (83%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALA 91
           ++R+ASLD+FRGL VALMILVD AGG+WP I HAPWNGCNLADFVMPFFLFIVG+AI L+
Sbjct: 408 SRRVASLDVFRGLTVALMILVDGAGGEWPVIGHAPWNGCNLADFVMPFFLFIVGMAIPLS 467

Query: 92  LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALS 151
           LKRIPDR  AV++V+ RTLKLLFWGILLQGG+SHAPD+L+YGVD++ +R CG+LQRIAL+
Sbjct: 468 LKRIPDRGRAVRRVVLRTLKLLFWGILLQGGYSHAPDDLSYGVDMKHVRWCGILQRIALA 527

Query: 152 YLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTI 211
           YL+V+++EI TK+ + +DQS   FSIFR+Y   W++A C+LV+YL+L+YG YVPDW F +
Sbjct: 528 YLVVAVLEIVTKNAKVQDQSSSGFSIFRMYFSQWIVACCILVIYLSLVYGIYVPDWDFRV 587

Query: 212 INKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPF 271
            +  + ++GK+  VTCG R KL+PPCNAVGYIDRKVLGINHMYH PAWRR K CT DSP 
Sbjct: 588 SDVKNPNFGKILTVTCGTRGKLSPPCNAVGYIDRKVLGINHMYHRPAWRRHKDCTDDSPH 647

Query: 272 EGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFA 331
           EGP + D+P+WC+APFEPEGLLSS+S++LSTIIGVH+GHV++H K H  RLKQW  MG  
Sbjct: 648 EGPFKTDSPAWCYAPFEPEGLLSSLSAVLSTIIGVHYGHVLVHMKSHTDRLKQWSIMGIT 707

Query: 332 LLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIG 391
           LLI GLTLHF++AIPLNKQLYT SY+CVT+GAA +VF   Y LVDI NL YPF PL W G
Sbjct: 708 LLILGLTLHFSHAIPLNKQLYTFSYICVTAGAAGIVFCMFYFLVDILNLHYPFAPLKWTG 767

Query: 392 MNAMLVYVMAAEGIFAGFINGWYYGDPHNTLV 423
           MNAMLVYVMAA GIF GF+NGWYY   +NTLV
Sbjct: 768 MNAMLVYVMAAAGIFEGFLNGWYYEGTNNTLV 799


>gi|224123608|ref|XP_002330163.1| predicted protein [Populus trichocarpa]
 gi|222871619|gb|EEF08750.1| predicted protein [Populus trichocarpa]
          Length = 468

 Score =  567 bits (1461), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 286/423 (67%), Positives = 340/423 (80%), Gaps = 2/423 (0%)

Query: 1   MSEIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWP 60
           M++IKA  ++   L+I+  D +          +R+ASLDIFRGL VALMILVD AGG+WP
Sbjct: 1   MADIKAYISYAKRLLIA--DGTHFSAPKPDPERRVASLDIFRGLTVALMILVDDAGGEWP 58

Query: 61  EISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQ 120
           ++ HAPW+G NLADFVMPFFLFIVG+AI L  K I  R  AVKK+I RTLKLLFWGI+LQ
Sbjct: 59  KMGHAPWHGSNLADFVMPFFLFIVGMAIPLTFKGITSRDHAVKKMIVRTLKLLFWGIMLQ 118

Query: 121 GGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRL 180
           GGFSHAPD+L+YGVD++ IR CG+LQRIA +YL+++L+EIFTK  Q KD   GR SIFRL
Sbjct: 119 GGFSHAPDKLSYGVDMKKIRWCGILQRIAFAYLVMALMEIFTKKDQTKDLPPGRLSIFRL 178

Query: 181 YCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAV 240
           Y   WL+ AC+LVVYLA++YG YVP WQFT+ +++S+DYGKVF V C VR KL+P CNA+
Sbjct: 179 YGSQWLVGACILVVYLAVIYGMYVPHWQFTVNDEESSDYGKVFTVECAVRGKLDPACNAI 238

Query: 241 GYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSIL 300
            YIDRK+LGINH+Y HPAW+RS+ACT+ S +E P +  AP+WC APFEP+G+LSS+SS+L
Sbjct: 239 AYIDRKILGINHLYQHPAWKRSEACTEASLYEAPFQTSAPTWCKAPFEPDGILSSISSVL 298

Query: 301 STIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVT 360
           STI G HFGHV +H KG  ARLK W  MG ALLI GL LHFT+A+PLNKQLYT SYVCVT
Sbjct: 299 STITGAHFGHVHVHLKGDTARLKHWTVMGLALLILGLVLHFTHAMPLNKQLYTFSYVCVT 358

Query: 361 SGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHN 420
           SGAAALVFSAIY LVD+W  K  FLP  WIGMNAMLVYVMAAEGIFAGFINGWYY DPHN
Sbjct: 359 SGAAALVFSAIYILVDMWGRKSMFLPFQWIGMNAMLVYVMAAEGIFAGFINGWYYNDPHN 418

Query: 421 TLV 423
           TL+
Sbjct: 419 TLI 421


>gi|357511851|ref|XP_003626214.1| Heparan-alpha-glucosaminide N-acetyltransferase [Medicago
           truncatula]
 gi|355501229|gb|AES82432.1| Heparan-alpha-glucosaminide N-acetyltransferase [Medicago
           truncatula]
          Length = 483

 Score =  562 bits (1448), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 293/439 (66%), Positives = 345/439 (78%), Gaps = 21/439 (4%)

Query: 1   MSEIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVA------------- 47
           M EI  E + H   ++SE +    +E    K +R+ASLDIFRGL VA             
Sbjct: 1   MEEIIGEHSVH---VVSEVEPVSAKELPK-KVKRVASLDIFRGLTVADGDLTVFVAVKYR 56

Query: 48  ---LMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKK 104
              LMILVD AGG+WP I HAPWNGCNLADFVMPFFLFIVG+AI L+LK+IP++  AVKK
Sbjct: 57  AKQLMILVDDAGGEWPAIGHAPWNGCNLADFVMPFFLFIVGMAIPLSLKKIPNKLLAVKK 116

Query: 105 VIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKD 164
           VI RTLKLLFWG+LLQGG+SHAPD L+YGVD++ IR CG+LQRIAL+YL+V+LVEI ++ 
Sbjct: 117 VIVRTLKLLFWGLLLQGGYSHAPDHLSYGVDMKHIRWCGILQRIALAYLVVALVEIISRS 176

Query: 165 VQDKDQ-SVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVF 223
            QD+D       SIF LY WHWL+AAC+LVVY+ LLYG +VPDWQFT+ N DS   G  F
Sbjct: 177 RQDRDDPEPTNLSIFTLYYWHWLVAACILVVYMPLLYGIHVPDWQFTVHNPDSIYNGTTF 236

Query: 224 NVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWC 283
            VTCGVR KL+PPCNAVGYIDR+VLGINH+Y  PA RRS+ACT   P+EGP +K AP+WC
Sbjct: 237 TVTCGVRGKLDPPCNAVGYIDREVLGINHVYKKPASRRSEACTVKPPYEGPFKKTAPAWC 296

Query: 284 HAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTN 343
           +APFEPEG+LSS+S+ILSTIIG+H+GHV+IH + HL+RLKQW+ +G ALL  G  LHF++
Sbjct: 297 YAPFEPEGILSSISAILSTIIGLHYGHVLIHLQDHLSRLKQWILLGLALLTLGFILHFSH 356

Query: 344 AIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAE 403
            IPLNKQLYTLSYVCVTSGAAALVFSA Y +VDIW LK  F+P  WIGMNAMLVYVMAAE
Sbjct: 357 VIPLNKQLYTLSYVCVTSGAAALVFSAFYVMVDIWGLKLLFIPFKWIGMNAMLVYVMAAE 416

Query: 404 GIFAGFINGWYYGDPHNTL 422
           GIFAGFINGWYY DP NTL
Sbjct: 417 GIFAGFINGWYYDDPRNTL 435


>gi|115442029|ref|NP_001045294.1| Os01g0931100 [Oryza sativa Japonica Group]
 gi|57899654|dbj|BAD87323.1| unknown protein [Oryza sativa Japonica Group]
 gi|57900117|dbj|BAD88179.1| unknown protein [Oryza sativa Japonica Group]
 gi|113534825|dbj|BAF07208.1| Os01g0931100 [Oryza sativa Japonica Group]
 gi|215697092|dbj|BAG91086.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 488

 Score =  560 bits (1443), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 273/392 (69%), Positives = 329/392 (83%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALA 91
           ++R+ASLD+FRGL VALMILVD AGG+WP I HAPWNGCNLADFVMPFFLFIVG+AI L+
Sbjct: 50  SRRVASLDVFRGLTVALMILVDGAGGEWPVIGHAPWNGCNLADFVMPFFLFIVGMAIPLS 109

Query: 92  LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALS 151
           LKRIPDR  AV++V+ RTLKLLFWGILLQGG+SHAPD+L+YGVD++ +R CG+LQRIAL+
Sbjct: 110 LKRIPDRGRAVRRVVLRTLKLLFWGILLQGGYSHAPDDLSYGVDMKHVRWCGILQRIALA 169

Query: 152 YLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTI 211
           YL+V+++EI TK+ + +DQS   FSIFR+Y   W++A C+LV+YL+L+YG YVPDW F +
Sbjct: 170 YLVVAVLEIVTKNAKVQDQSSSGFSIFRMYFSQWIVACCILVIYLSLVYGIYVPDWDFRV 229

Query: 212 INKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPF 271
            +  + ++GK+  VTCG R KL+PPCNAVGYIDRKVLGINHMYH PAWRR K CT DSP 
Sbjct: 230 SDVKNPNFGKILTVTCGTRGKLSPPCNAVGYIDRKVLGINHMYHRPAWRRHKDCTDDSPH 289

Query: 272 EGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFA 331
           EGP + D+P+WC+APFEPEGLLSS+S++LSTIIGVH+GHV++H K H  RLKQW  MG  
Sbjct: 290 EGPFKTDSPAWCYAPFEPEGLLSSLSAVLSTIIGVHYGHVLVHMKSHTDRLKQWSIMGIT 349

Query: 332 LLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIG 391
           LLI GLTLHF++AIPLNKQLYT SY+CVT+GAA +VF   Y LVDI NL YPF PL W G
Sbjct: 350 LLILGLTLHFSHAIPLNKQLYTFSYICVTAGAAGIVFCMFYFLVDILNLHYPFAPLKWTG 409

Query: 392 MNAMLVYVMAAEGIFAGFINGWYYGDPHNTLV 423
           MNAMLVYVMAA GIF GF+NGWYY   +NTLV
Sbjct: 410 MNAMLVYVMAAAGIFEGFLNGWYYEGTNNTLV 441


>gi|224069583|ref|XP_002326379.1| predicted protein [Populus trichocarpa]
 gi|222833572|gb|EEE72049.1| predicted protein [Populus trichocarpa]
          Length = 496

 Score =  523 bits (1347), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 256/435 (58%), Positives = 317/435 (72%), Gaps = 11/435 (2%)

Query: 22  SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFL 81
            D+Q     K++R+A+LD FRGL + LMILVD AGG +P I H+PWNGC LADFVMPFFL
Sbjct: 51  GDRQPVVKQKSKRVATLDAFRGLTIVLMILVDDAGGVYPRIDHSPWNGCTLADFVMPFFL 110

Query: 82  FIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRL 141
           FIVGVAIALA KRIP R DAVKK+I RTLKLLFWG+LLQGG+SHAP +L YGVD+++IR 
Sbjct: 111 FIVGVAIALAFKRIPKRRDAVKKIILRTLKLLFWGVLLQGGYSHAPSDLAYGVDMKLIRW 170

Query: 142 CGVLQ-RIALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALL 199
            G+LQ RIAL Y++V+L+E +  K+ Q  +     F+IF  Y W W+      V+Y+   
Sbjct: 171 FGILQQRIALVYMVVALIEALIPKNRQTIEPD--HFTIFTAYRWQWIAGFISFVIYMVTT 228

Query: 200 YGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAW 259
           +  YVPDW FT+   D     + + V CG+R  L P CNAVGY+DR+V GINH+Y +P W
Sbjct: 229 FALYVPDWSFTV---DEDHERRRYTVECGMRGHLGPACNAVGYVDREVWGINHLYQYPVW 285

Query: 260 RRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHL 319
            R KACT  SP  GP RKDAPSWC APFEPEGLLSS+S+ILS  IG+H+GHV+IH KGH 
Sbjct: 286 SRLKACTLSSPGSGPFRKDAPSWCRAPFEPEGLLSSISAILSGTIGIHYGHVLIHFKGHA 345

Query: 320 ARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWN 379
            RL+QWV+MG  LLI  + LHFT+AIP+NKQLY+ SYVC T+GAA +VFS  Y L+D+W 
Sbjct: 346 ERLRQWVSMGVILLIVAIILHFTDAIPINKQLYSFSYVCFTAGAAGIVFSGFYVLIDVWG 405

Query: 380 LKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLVCFL--FIISYILHSFL 437
           L+ PFL L WIGMNAMLVYVMAA+GIF GFINGWYY  P NTLV ++   + + + HS  
Sbjct: 406 LRPPFLFLEWIGMNAMLVYVMAAQGIFEGFINGWYYKSPDNTLVYWIQEHVFNDVWHSE- 464

Query: 438 WELRKFLYVQFCNLS 452
             +   LYV F  ++
Sbjct: 465 -RVGTLLYVIFAQIA 478


>gi|356503734|ref|XP_003520659.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Glycine max]
          Length = 508

 Score =  523 bits (1346), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 244/402 (60%), Positives = 300/402 (74%), Gaps = 2/402 (0%)

Query: 22  SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFL 81
             +Q     KT+R+A+LD FRGL + LMILVD AG  +P I H+PWNGC LADFVMPFFL
Sbjct: 62  EQEQPPVKQKTKRVATLDAFRGLTIVLMILVDDAGEAYPRIDHSPWNGCTLADFVMPFFL 121

Query: 82  FIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRL 141
           FIVGVAIALALKRI     +VKK+I RTLKLLFWGI+LQGG+SHAPD+L YGV+++ IR 
Sbjct: 122 FIVGVAIALALKRISKIKHSVKKIILRTLKLLFWGIILQGGYSHAPDDLEYGVNMKFIRW 181

Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
           CG+LQRIAL Y +V+L+E FT  ++    + G  SIF  Y W W       ++Y+   + 
Sbjct: 182 CGILQRIALVYCVVALIETFTTKLRPTTLASGHLSIFAAYKWQWFGGFVAFLIYMITTFS 241

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
            YVPDW F  ++  + D  K + V CG+R  L P CNAVG++DR+V G+NH+Y  P WRR
Sbjct: 242 LYVPDWSF--VDHFNGDEPKRYTVICGMRGHLGPACNAVGHVDRQVWGVNHLYSQPVWRR 299

Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
            KACT  SP  GP R DAPSWC APFEPEGLLSS+S+ILS  IG+H+GHV+IH KGH  R
Sbjct: 300 LKACTFSSPGSGPFRDDAPSWCLAPFEPEGLLSSISAILSGTIGIHYGHVLIHFKGHSER 359

Query: 322 LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLK 381
           LKQWV+MGF LLI  + LHFT+A+P+NKQLY+ SYVC T+GAA +VFS  Y L+D+W L+
Sbjct: 360 LKQWVSMGFVLLIIAIILHFTDALPINKQLYSFSYVCFTAGAAGIVFSGFYILIDVWGLR 419

Query: 382 YPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLV 423
            PFL L WIGMNAMLV+VMAAEGIFA F+NGWYY DP ++LV
Sbjct: 420 TPFLFLEWIGMNAMLVFVMAAEGIFAAFVNGWYYEDPRSSLV 461


>gi|255556868|ref|XP_002519467.1| conserved hypothetical protein [Ricinus communis]
 gi|223541330|gb|EEF42881.1| conserved hypothetical protein [Ricinus communis]
          Length = 519

 Score =  519 bits (1337), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 261/476 (54%), Positives = 326/476 (68%), Gaps = 26/476 (5%)

Query: 5   KAETTHHHPLIISEPDVSD-----QQEKSHL----------KTQRLASLDIFRGLAVALM 49
           K + TH    +I E +++      +QE   L          KT+R+A+LD FRGL V LM
Sbjct: 42  KLDKTHDGGGVIPEKELTSSTVLVEQEGEQLQQPEQLPVKQKTKRVATLDAFRGLTVVLM 101

Query: 50  ILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRT 109
           ILVD+AG  +  I H+PWNGC LADFVMPFFLFIVGVAIALALKRIP + DAVKK+  RT
Sbjct: 102 ILVDNAGESYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKRIPRKRDAVKKISLRT 161

Query: 110 LKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKD 169
           LKLLFWGILLQGG+SHAP +L+YGVD+++IR CG+LQRIAL Y+ V+L+E  T   +   
Sbjct: 162 LKLLFWGILLQGGYSHAPVDLSYGVDMKLIRWCGILQRIALVYMFVALIETLTIKERQTV 221

Query: 170 QSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGV 229
                FSIF  Y W W+      ++Y+   Y  YVPDW FT  + +       + V CG+
Sbjct: 222 LQPNHFSIFTAYRWQWIGGFIAFLIYMITTYALYVPDWSFTAYDDNRPTR---YTVKCGM 278

Query: 230 RAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEP 289
           R  L P CNAVGY+DR+V GINH+Y +P W R KACT  SP  GPLR DAPSWC APFEP
Sbjct: 279 RGHLGPACNAVGYVDREVWGINHLYQYPVWSRLKACTFSSPATGPLRADAPSWCLAPFEP 338

Query: 290 EGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNK 349
           EGLLS++S+ILS  IG+H+GHV+IH KGH  RLKQWV+MG  L +  + LHFT+AIP+NK
Sbjct: 339 EGLLSTISAILSGTIGIHYGHVLIHFKGHSERLKQWVSMGLGLFLIAIILHFTDAIPINK 398

Query: 350 QLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGF 409
           QLY+ SYVC T+GAA +VFS  Y L+D+  L+ PFL L WIGMNAMLVYVMAA+GIF GF
Sbjct: 399 QLYSFSYVCFTAGAAGIVFSGFYILIDVLGLRIPFLFLEWIGMNAMLVYVMAAQGIFEGF 458

Query: 410 INGWYYGDPHNTLVCFLFIISYILHSFLW---ELRKFLYVQFCNLSWKFQSLVDNI 462
           INGW+Y   +NTLV   + I   +   +W   +L   LYV F  ++  F ++V  I
Sbjct: 459 INGWFYKSNNNTLV---YWIQEHVFDKVWNSEKLGNLLYVIFAQIT--FWAVVSGI 509


>gi|356548323|ref|XP_003542552.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Glycine max]
          Length = 419

 Score =  514 bits (1324), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 247/406 (60%), Positives = 299/406 (73%), Gaps = 7/406 (1%)

Query: 49  MILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFR 108
           M+LVD AGG +P I H+PWNGC LADFVMPFFLFIVGVAIALALKRIP    AVKK+I R
Sbjct: 1   MVLVDDAGGAYPRIDHSPWNGCTLADFVMPFFLFIVGVAIALALKRIPKVKYAVKKIILR 60

Query: 109 TLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDK 168
           TLKLLFWGILLQGG+SHAPD+L+YGVD+R IR CG+LQRIAL Y +V+L+E +T  ++  
Sbjct: 61  TLKLLFWGILLQGGYSHAPDDLSYGVDMRFIRWCGILQRIALVYCVVALIETYTTKLRPS 120

Query: 169 DQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCG 228
               G  SIF  Y W WL      V+Y+  ++  YVPDW F   N D     K + V CG
Sbjct: 121 TLKPGHLSIFTAYRWQWLGGFVAFVIYMVTIFSLYVPDWSFVDYNSDKP---KRYTVECG 177

Query: 229 VRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFE 288
           +R  L P CNAVGY+DR+V G+NH+Y  P W R KACT  SP EGPLRK+AP+WC APFE
Sbjct: 178 MRGHLGPACNAVGYVDRQVWGVNHLYSQPVWTRLKACTLSSPAEGPLRKNAPAWCRAPFE 237

Query: 289 PEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLN 348
           PEG LSSV +ILS  IG+H+GHV+IH KGH  RLKQW++MGF LL  GL LHFT+AIP+N
Sbjct: 238 PEGFLSSVLAILSGTIGIHYGHVLIHFKGHFERLKQWLSMGFVLLTLGLILHFTDAIPIN 297

Query: 349 KQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAG 408
           KQLY+ SYVC T+GAA +VFS  Y L+D+W L+ PFL L WIGMNAMLV+VMAA+GIFA 
Sbjct: 298 KQLYSFSYVCFTAGAAGIVFSVFYLLIDVWGLRTPFLFLEWIGMNAMLVFVMAAQGIFAA 357

Query: 409 FINGWYYGDPHNTLVCFL--FIISYILHSFLWELRKFLYVQFCNLS 452
           F+NGWYY DP N+LV ++   + + + HS    L   LYV F  ++
Sbjct: 358 FVNGWYYKDPDNSLVYWIQNHVFTNVWHSE--RLGTLLYVIFAEIT 401


>gi|356570776|ref|XP_003553560.1| PREDICTED: LOW QUALITY PROTEIN: heparan-alpha-glucosaminide
           N-acetyltransferase-like [Glycine max]
          Length = 509

 Score =  510 bits (1313), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 238/402 (59%), Positives = 295/402 (73%), Gaps = 2/402 (0%)

Query: 22  SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFL 81
             +Q     KT+R+A+LD FRGL + LMILVD AG  +P I H+PWNGC LADFVMPFFL
Sbjct: 63  EQEQPVVKQKTKRIATLDAFRGLTIVLMILVDDAGEAYPRIDHSPWNGCTLADFVMPFFL 122

Query: 82  FIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRL 141
           FIVG+AIALALKRI     AVKK+I RTLKLLFWGI+LQGG+SHAPD+L YGV+++ IR 
Sbjct: 123 FIVGIAIALALKRIAKIKHAVKKIILRTLKLLFWGIILQGGYSHAPDDLEYGVNMKFIRW 182

Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
           CG+LQRIAL Y +V+L+E FT  ++    + G  SIF  Y W W       ++Y+   + 
Sbjct: 183 CGILQRIALVYCVVALIETFTTKLRPTTLASGHLSIFTAYKWQWFGGFVAFIIYMITTFT 242

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
            YVP W F  ++  + D  K + V CG+R  L P CNAVG++DR+V G+NH+Y  P WRR
Sbjct: 243 LYVPHWSF--LDHFNGDEPKRYTVICGMRGHLGPACNAVGHVDRQVWGVNHLYSQPVWRR 300

Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
            K     SP  GP R DAPSWC +PFEPEGLLSS+S+ILS  IG+H+GH++IH KGH  R
Sbjct: 301 LKMTIDYSPASGPFRDDAPSWCRSPFEPEGLLSSISAILSGTIGIHYGHILIHFKGHSER 360

Query: 322 LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLK 381
           LKQWV MGF LLI  + LHFT+A+P+NKQLY+ SYVC T+GAA +VFS +Y LVD+W L+
Sbjct: 361 LKQWVLMGFVLLIIAIILHFTDALPINKQLYSFSYVCFTAGAAGIVFSGLYILVDVWGLR 420

Query: 382 YPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLV 423
            PFL L WIGMNAMLV+VMAAEGIFA F+NGWY  +P N+LV
Sbjct: 421 TPFLFLEWIGMNAMLVFVMAAEGIFAAFVNGWYCENPRNSLV 462


>gi|359481929|ref|XP_002268831.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Vitis vinifera]
 gi|297739972|emb|CBI30154.3| unnamed protein product [Vitis vinifera]
          Length = 489

 Score =  510 bits (1313), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 249/443 (56%), Positives = 313/443 (70%), Gaps = 9/443 (2%)

Query: 22  SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFL 81
            ++Q     K++R+A+LD FRGL + LMILVD AGG +  I H+PWNGC LADFVMPFFL
Sbjct: 44  GEEQPLIKQKSKRVATLDAFRGLTIVLMILVDDAGGSYARIDHSPWNGCTLADFVMPFFL 103

Query: 82  FIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRL 141
           FIVGVA+ALALK+IP  + AVKK+  RTLKLLFWGILLQGG+SHAPD+L+YGVD++ IR 
Sbjct: 104 FIVGVAVALALKKIPRISLAVKKISLRTLKLLFWGILLQGGYSHAPDDLSYGVDMKHIRW 163

Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
            G+LQRIA+ Y +V+L+E  T   +      G FSI   Y W W+      ++Y+   Y 
Sbjct: 164 FGILQRIAVVYFVVALIETLTTKRRPTVIDSGHFSILSAYKWQWIGGFVAFLIYMITTYA 223

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
            YVPDW F I   D     K + V CG+R  L P CNAVGY+DR+V GINH+Y  P W R
Sbjct: 224 LYVPDWSFVI---DQDHEAKRYTVKCGMRGHLGPACNAVGYVDRQVWGINHLYSQPVWTR 280

Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
            KACT  SP  GP R+DAPSWC+APFEPEGLLS++S+ILS  IG+H+GHV+IH KGH  R
Sbjct: 281 LKACTLSSPNSGPFREDAPSWCYAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHAER 340

Query: 322 LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLK 381
           LKQWV+MG  LLI  + LHFT+AIP+NKQLY+ SYVC T+GAA +V SA Y ++D+W  +
Sbjct: 341 LKQWVSMGIVLLIVAIILHFTDAIPINKQLYSFSYVCFTAGAAGIVLSAFYLVIDVWGFR 400

Query: 382 YPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLVCFL--FIISYILHSFLWE 439
            PFL L WIGMNAMLV+VMAA+GIFA FINGWY+    N+LV ++   +   + HS    
Sbjct: 401 TPFLFLEWIGMNAMLVFVMAAQGIFAAFINGWYFESSDNSLVHWIQRHVFIDVWHSE--R 458

Query: 440 LRKFLYVQFCNLSWKFQSLVDNI 462
           L   LYV F  ++  F ++V  I
Sbjct: 459 LGTLLYVIFAEIT--FWAVVSGI 479


>gi|147817637|emb|CAN64496.1| hypothetical protein VITISV_004036 [Vitis vinifera]
          Length = 511

 Score =  498 bits (1281), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 240/424 (56%), Positives = 295/424 (69%), Gaps = 25/424 (5%)

Query: 22  SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFL 81
            ++Q     K++R+A+LD FRGL + LMILVD AGG +  I H+PWNGC LADFVMPFFL
Sbjct: 44  GEEQPLIKQKSKRVATLDAFRGLTIVLMILVDDAGGSYARIDHSPWNGCTLADFVMPFFL 103

Query: 82  FIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR- 140
           FIVGVA+ALALK+IP  + AVKK+  RTLKLLFWGILLQGG+SHAPD+L+YGVD++ IR 
Sbjct: 104 FIVGVAVALALKKIPRISLAVKKISLRTLKLLFWGILLQGGYSHAPDDLSYGVDMKHIRW 163

Query: 141 ---------------------LCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFR 179
                                L G LQRIA+ Y +V+L+E  T   +      G FSI  
Sbjct: 164 FGILQVFPLPLFTGKSIPSSSLSGFLQRIAVVYFVVALIETLTTKRRPTVIDSGHFSILS 223

Query: 180 LYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNA 239
            Y W W+      ++Y+   Y  YVPDW F I   D     K + V CG+R  L P CNA
Sbjct: 224 AYKWQWIGGFVAFLIYMITTYALYVPDWSFVI---DQDHEAKRYTVKCGMRGHLGPACNA 280

Query: 240 VGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSI 299
           VGY+DR+V GINH+Y  P W R KACT  SP  GP R+DAPSWC+APFEPEGLLS++S+I
Sbjct: 281 VGYVDRQVWGINHLYSQPVWTRLKACTLSSPNSGPFREDAPSWCYAPFEPEGLLSTISAI 340

Query: 300 LSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCV 359
           LS  IG+H+GHV+IH KGH  RLKQWV+MG  LLI  + LHFT+AIP+NKQLY+ SYVC 
Sbjct: 341 LSGTIGIHYGHVLIHFKGHAERLKQWVSMGIVLLIVAIILHFTDAIPINKQLYSFSYVCF 400

Query: 360 TSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPH 419
           T+GAA +V SA Y ++D+W  + PFL L WIGMNAMLV+VMAA+GIFA FINGWY+    
Sbjct: 401 TAGAAGIVXSAFYLVIDVWGFRTPFLFLEWIGMNAMLVFVMAAQGIFAAFINGWYFESSD 460

Query: 420 NTLV 423
           N+LV
Sbjct: 461 NSLV 464


>gi|242067981|ref|XP_002449267.1| hypothetical protein SORBIDRAFT_05g006970 [Sorghum bicolor]
 gi|241935110|gb|EES08255.1| hypothetical protein SORBIDRAFT_05g006970 [Sorghum bicolor]
          Length = 512

 Score =  490 bits (1262), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 245/456 (53%), Positives = 308/456 (67%), Gaps = 20/456 (4%)

Query: 16  ISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADF 75
           + +  V+  +E    K++R+A+LD FRGL + LMILVD AGG +  I H+PWNGC LADF
Sbjct: 38  VEKERVAVAEEVPKKKSRRVAALDAFRGLTIVLMILVDDAGGAYERIDHSPWNGCTLADF 97

Query: 76  VMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
           VMPFFLFIVGVAIA ALKR+P+  +AVK++  RTLK+LFWG+LLQGG+SHAPD+L+YGVD
Sbjct: 98  VMPFFLFIVGVAIAFALKRVPNMGNAVKRITIRTLKMLFWGVLLQGGYSHAPDDLSYGVD 157

Query: 136 VRMIRLCGVLQ--------------------RIALSYLLVSLVEIFTKDVQDKDQSVGRF 175
           ++ IR  G+LQ                    RIAL Y +V+L+E FT  V+      G +
Sbjct: 158 MKKIRWMGILQLYIYHGNNLDSFLFFTLGHQRIALVYFIVALIEAFTVKVRPTTVRSGPY 217

Query: 176 SIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNP 235
           +IF  + W WL      V+Y+   +  YVPDW +   N    + GK F V CGVRA L  
Sbjct: 218 AIFNAHRWQWLGGFIAFVIYMVTTFSLYVPDWSYVYHNDGDVNDGKQFTVKCGVRASLEQ 277

Query: 236 PCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSS 295
            CNAVGY+DR+V GINH+Y  P W RSK CT  SP  GPLR DAP WC APFEPEGLLSS
Sbjct: 278 ACNAVGYVDRQVWGINHLYTQPVWIRSKDCTSSSPNMGPLRADAPEWCLAPFEPEGLLSS 337

Query: 296 VSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLS 355
           +SS+LS  IG+H+GHV+IH K H  RLK W+  GF+LL+  + LHFTNAIP+NKQLY+ S
Sbjct: 338 ISSVLSGTIGIHYGHVLIHFKTHKERLKHWLVTGFSLLVLAIILHFTNAIPINKQLYSFS 397

Query: 356 YVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYY 415
           YVC T GAA +V SA Y L+D+W L+ PFL L WIGMNAMLV+V+ A+GI AGF+NGWYY
Sbjct: 398 YVCFTGGAAGIVLSAFYILIDVWGLRKPFLFLEWIGMNAMLVFVLGAQGILAGFVNGWYY 457

Query: 416 GDPHNTLVCFLFIISYILHSFLWELRKFLYVQFCNL 451
             P N LV ++    ++       L   LYV FC +
Sbjct: 458 ESPDNNLVNWIVKHVFVDVWHSQNLGTLLYVIFCEI 493


>gi|357152403|ref|XP_003576108.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Brachypodium distachyon]
          Length = 498

 Score =  489 bits (1260), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 241/402 (59%), Positives = 299/402 (74%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIV 84
           +E    K+ R+A+LD FRGL + +MILVD AG  +  + H+PWNGC LADFVMPFFLFIV
Sbjct: 53  EEPQKKKSTRVAALDAFRGLTIVVMILVDDAGSSYERMDHSPWNGCTLADFVMPFFLFIV 112

Query: 85  GVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
           GVAIA A+KR+P+   AVKKV  RTLK++FWG+LLQGG+SHAPD+L YGVD++MIR CG+
Sbjct: 113 GVAIAFAMKRVPNMGAAVKKVSVRTLKMIFWGLLLQGGYSHAPDDLAYGVDMKMIRWCGI 172

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           LQRIAL Y  V+L+E+FT  V+      G ++IF  Y W WL A  VLV+Y+   +  YV
Sbjct: 173 LQRIALVYFAVALIEVFTTKVRPTTVRSGPYAIFDAYRWQWLGAFIVLVIYMITTFSLYV 232

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
           PDW F   N    + GK F V CGVR  L+P CNAVG+IDR+V GINH+Y  P W R+K 
Sbjct: 233 PDWSFVYHNDGDINDGKRFTVQCGVRGHLDPACNAVGFIDRQVWGINHLYSQPVWIRTKD 292

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
           CT  SP  G LR DAP+WC  PFEPEGLLSS+SSI+S  IG+H+GHV+IH K H  RL  
Sbjct: 293 CTFSSPETGKLRDDAPAWCLGPFEPEGLLSSISSIISGTIGIHYGHVLIHFKTHKERLTH 352

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPF 384
           W++MGFALL+ G+ LHFTNAIP+NKQLY+ SY+C T GAA +V SA YAL+D+W L+ PF
Sbjct: 353 WLSMGFALLLLGILLHFTNAIPINKQLYSFSYICFTGGAAGIVLSAFYALIDVWGLRVPF 412

Query: 385 LPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLVCFL 426
           L L WIGMNAMLV+V+AA+GIFA F+NGWYY     TLV ++
Sbjct: 413 LFLEWIGMNAMLVFVLAAQGIFAAFMNGWYYESQDKTLVNWI 454


>gi|449440411|ref|XP_004137978.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Cucumis sativus]
 gi|449517341|ref|XP_004165704.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Cucumis sativus]
          Length = 488

 Score =  488 bits (1257), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 252/453 (55%), Positives = 314/453 (69%), Gaps = 20/453 (4%)

Query: 5   KAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISH 64
           K E     P I+ E  +         KT+R+A+LD FRGL + LMILVD AGG +  I H
Sbjct: 33  KEEEKEVAPTIVEEAQLRQ-------KTKRVATLDAFRGLTIVLMILVDDAGGAYSRIDH 85

Query: 65  APWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFS 124
           +PWNGC LADFVMPFFLFIVGVAIALA KRI      V K+  RT+KL+FWG++LQGG+S
Sbjct: 86  SPWNGCTLADFVMPFFLFIVGVAIALAFKRIGSIKQGVMKISLRTIKLVFWGLILQGGYS 145

Query: 125 HAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSV---GRFSIFRLY 181
           HAPD+L YGVD++ IR CG+LQRIAL Y +V+++E FT     K + V   G FSIF  Y
Sbjct: 146 HAPDDLEYGVDMKHIRWCGILQRIALVYFVVAMIEAFT--TIGKPRVVLDHGHFSIFTAY 203

Query: 182 CWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVG 241
              W+      ++Y+   Y  YVP+W F+++  D   +   + V CGVR  L P CNAVG
Sbjct: 204 --RWIGGFAAFIIYIITTYALYVPNWSFSVLEDDQLLHH--YTVVCGVRGHLGPACNAVG 259

Query: 242 YIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILS 301
           ++DR+V GINH+Y +P W R K CT  +P EGPLR DA SWC APFEPEGLLSSVS+ILS
Sbjct: 260 HVDRQVWGINHLYSYPVWIRHKDCTFSAPDEGPLRDDAASWCLAPFEPEGLLSSVSAILS 319

Query: 302 TIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTS 361
             IG+H+GHV++H K H  RLKQWV+MGF   I G+ LHFTNAIP+NKQLY+LSYVC T+
Sbjct: 320 GTIGIHYGHVLLHFKTHSQRLKQWVSMGFGFFIIGIILHFTNAIPINKQLYSLSYVCFTA 379

Query: 362 GAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNT 421
           GAA +VFS  Y L+D+W  + PFL L WIGMNAMLV+VMAA+GIFA FINGWYY DP N+
Sbjct: 380 GAAGIVFSGFYILIDVWGFRKPFLFLEWIGMNAMLVFVMAAQGIFAAFINGWYYKDPENS 439

Query: 422 LVCFL--FIISYILHSFLWELRKFLYVQFCNLS 452
           LV ++   I   + HS   +L   LYV F  ++
Sbjct: 440 LVHWIQKHIFINVWHSR--KLGTLLYVIFAEIA 470


>gi|302754694|ref|XP_002960771.1| hypothetical protein SELMODRAFT_75452 [Selaginella moellendorffii]
 gi|300171710|gb|EFJ38310.1| hypothetical protein SELMODRAFT_75452 [Selaginella moellendorffii]
          Length = 493

 Score =  484 bits (1246), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 240/438 (54%), Positives = 310/438 (70%), Gaps = 16/438 (3%)

Query: 2   SEIKAETTHHHPLIISEPDVSDQQEKSHLKTQ----------RLASLDIFRGLAVALMIL 51
            +++   T  H +   E D   ++E    + Q          R+A+LD+FRGL VALM+L
Sbjct: 9   EKLEGHDTKEHKISFHEEDRHVKKESLLNEDQAVAVPLKKPVRIATLDVFRGLTVALMVL 68

Query: 52  VDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLK 111
           VD AGG+WP I+H+PWNGC LAD VMPFFLFIVGVAIALALKRIPD+  A +KV+ RTLK
Sbjct: 69  VDDAGGEWPRINHSPWNGCTLADLVMPFFLFIVGVAIALALKRIPDQVAATQKVVIRTLK 128

Query: 112 LLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQS 171
           LLFWG+LLQGGFSHAPD+L+YGVD+R IR CG+LQRIA  YL+V+LVEI T   +  +  
Sbjct: 129 LLFWGLLLQGGFSHAPDDLSYGVDMRKIRWCGILQRIAFGYLIVALVEIATTKSRSLELP 188

Query: 172 VGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFT------IINKDSADYGKVFNV 225
            G+F IF+LY WHW  A  V+++Y ++ YG YVPDW F       +++     +    NV
Sbjct: 189 KGQFGIFKLYKWHWACALAVVIIYHSVAYGLYVPDWHFIDSGHRFVVSLAKFVFSSQINV 248

Query: 226 TCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHA 285
            CGVR  + P CNAVG+IDR +LGINH+Y  P W R+++C  DSP EG    +AP+WC A
Sbjct: 249 QCGVRGDIGPACNAVGHIDRTILGINHLYQSPEWTRTQSCDLDSPAEGDPPANAPAWCKA 308

Query: 286 PFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           PFEPEG+LSS+S+ILS IIG+H+GHV+IH KGH+ R+  W     ALL+    LHFT+AI
Sbjct: 309 PFEPEGILSSISAILSCIIGIHYGHVLIHFKGHMKRVLHWTIPAAALLVLATILHFTHAI 368

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGI 405
           PLNKQLY+ SYVC T+GAA ++FS +Y ++DI+ +  P L   W+G+NAM V+VMAA GI
Sbjct: 369 PLNKQLYSFSYVCFTAGAAGMIFSLLYVVIDIFGINQPTLIFQWMGLNAMFVFVMAASGI 428

Query: 406 FAGFINGWYYGDPHNTLV 423
            A F+NGWY+ DP N LV
Sbjct: 429 AAAFVNGWYWHDPSNNLV 446


>gi|302804288|ref|XP_002983896.1| hypothetical protein SELMODRAFT_119497 [Selaginella moellendorffii]
 gi|300148248|gb|EFJ14908.1| hypothetical protein SELMODRAFT_119497 [Selaginella moellendorffii]
          Length = 493

 Score =  483 bits (1242), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 235/399 (58%), Positives = 297/399 (74%), Gaps = 6/399 (1%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           K  R+A+LD+FRGL VALM+LVD AGG+WP I+H+PWNGC LAD VMPFFLFIVGVAIAL
Sbjct: 48  KPVRIATLDVFRGLTVALMVLVDDAGGEWPRINHSPWNGCTLADLVMPFFLFIVGVAIAL 107

Query: 91  ALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIAL 150
           ALKRIPD+  A +KV+ RTLKLLFWG+LLQGGFSHAPD+L+YGVD+R IR CG+LQRIA 
Sbjct: 108 ALKRIPDQVAATQKVVIRTLKLLFWGLLLQGGFSHAPDDLSYGVDMRKIRWCGILQRIAF 167

Query: 151 SYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFT 210
            YL+V+LVEI T   +  +   G F IF+LY WHW  A  V+++Y ++ YG YVPDW F 
Sbjct: 168 GYLIVALVEIATTKSRSLELPKGHFGIFKLYKWHWACALAVVIIYHSVAYGLYVPDWHFI 227

Query: 211 ------IINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                 +++     +    NV CGVR  + P CNAVG+IDR +LGINH+Y  P W R+++
Sbjct: 228 DSGHRFVVSLAKFVFSSQINVQCGVRGDIGPACNAVGHIDRTILGINHLYQSPEWTRTQS 287

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
           C  DSP EG    +AP+WC APFEPEG+LSS+S+ILS IIG+H+GHV+IH KGH+ R+  
Sbjct: 288 CDLDSPAEGDPPANAPAWCKAPFEPEGILSSISAILSCIIGIHYGHVLIHFKGHMKRVLH 347

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPF 384
           W     ALL+    LHFT+AIPLNKQLY+ SYVC T+GAA ++FS +Y ++DI+ +  P 
Sbjct: 348 WTIPAAALLVLATILHFTHAIPLNKQLYSFSYVCFTAGAAGMIFSLLYVVIDIFGINQPT 407

Query: 385 LPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLV 423
           L   W+G+NAM V+VMAA GI A F+NGWY+ DP N LV
Sbjct: 408 LIFQWMGLNAMFVFVMAASGIAAAFVNGWYWHDPSNNLV 446


>gi|357510831|ref|XP_003625704.1| Heparan-alpha-glucosaminide N-acetyltransferase [Medicago
           truncatula]
 gi|355500719|gb|AES81922.1| Heparan-alpha-glucosaminide N-acetyltransferase [Medicago
           truncatula]
          Length = 444

 Score =  412 bits (1059), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 191/323 (59%), Positives = 239/323 (73%), Gaps = 4/323 (1%)

Query: 101 AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEI 160
            VKK+I RTLKLLFWGILLQGG+SHAPDEL YGV+++ IR CG+LQRIAL Y +V+L+E 
Sbjct: 79  TVKKIILRTLKLLFWGILLQGGYSHAPDELVYGVNMKFIRWCGILQRIALVYCIVALIET 138

Query: 161 FTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYG 220
           FT  ++    S GR +IF  Y   W       ++Y+   +  YVP+W F  ++  + D  
Sbjct: 139 FTTKLRPTTLSPGRIAIFTAY--KWFGGFMAFLIYMITTFALYVPNWSF--VDHVNNDEP 194

Query: 221 KVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAP 280
           K + V CG+R  L P CNAVGY+DR+  G+NH+Y  P WRR KACT  SP EGP R DAP
Sbjct: 195 KRYTVICGMRGHLGPACNAVGYVDRQTWGVNHLYSQPVWRRLKACTFSSPSEGPFRDDAP 254

Query: 281 SWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLH 340
           SWC APFEPEGLLSS+S+ILS  IG+H+GHV+IH K H  RLKQW +MGF LL+  + LH
Sbjct: 255 SWCLAPFEPEGLLSSISAILSGTIGIHYGHVLIHFKSHSERLKQWFSMGFVLLVVAIILH 314

Query: 341 FTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVM 400
           FT+AIP+NKQLY+ SYVC T+GAA ++FS +Y L+D+W ++ PFL L WIGMNAMLVYV+
Sbjct: 315 FTDAIPINKQLYSFSYVCFTAGAAGIIFSILYILIDVWGIRTPFLFLEWIGMNAMLVYVL 374

Query: 401 AAEGIFAGFINGWYYGDPHNTLV 423
           AAEGIFA F+NGWYY DP  +LV
Sbjct: 375 AAEGIFAAFVNGWYYEDPEKSLV 397


>gi|242075654|ref|XP_002447763.1| hypothetical protein SORBIDRAFT_06g015200 [Sorghum bicolor]
 gi|241938946|gb|EES12091.1| hypothetical protein SORBIDRAFT_06g015200 [Sorghum bicolor]
          Length = 446

 Score =  411 bits (1056), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 199/398 (50%), Positives = 275/398 (69%), Gaps = 6/398 (1%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           + QRL SLD+FRG+ V LMI+VD AG   P ++H+PW+G  +ADFVMPFFLFIVGVA+AL
Sbjct: 53  RPQRLVSLDVFRGITVLLMIIVDDAGAFIPAMNHSPWDGVTVADFVMPFFLFIVGVALAL 112

Query: 91  ALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIAL 150
           A KR+PD+ DA KK + R LKL   G++LQGGF H    LT+GVD++ IRL G+LQRIA+
Sbjct: 113 AYKRVPDKLDATKKAVLRALKLFCLGLVLQGGFFHGVRSLTFGVDLQEIRLMGILQRIAI 172

Query: 151 SYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFT 210
           +YLL +L EI+ K  +D D     + + + Y +  L+ A V + Y+ LLYGTYVPDW++ 
Sbjct: 173 AYLLTALCEIWLKGDEDVDYG---YDLLKRYRYQLLVGAVVAITYMCLLYGTYVPDWEYQ 229

Query: 211 IINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSP 270
                S +  K F V CGVR   +P CNAVG IDRK+LGI H+Y  P + RSK C+ DSP
Sbjct: 230 TSGPGSIE--KSFFVKCGVRGDTSPGCNAVGMIDRKILGIQHLYGRPVYARSKQCSIDSP 287

Query: 271 FEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGF 330
             GPL  DAPSWC APF+PEGLLSSV +I++ +IG+ +GH+I+H + H  R+  W+   F
Sbjct: 288 QNGPLPPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQYGHIIVHFQKHRERIMNWLIPSF 347

Query: 331 ALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWI 390
           ++L+    + F   + +NK LY+LSY   T+GAA L+FS IY LVDI+  + P + + W+
Sbjct: 348 SMLVLAFAMDFFG-LHMNKPLYSLSYTLATAGAAGLLFSGIYTLVDIYGYRRPTVAMEWM 406

Query: 391 GMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLVCFLFI 428
           GM+A+++YV+ A  I   FI+G+Y+ +P N L+ F+ I
Sbjct: 407 GMHALMIYVLIACNILPIFIHGFYWKEPKNNLLKFIGI 444


>gi|226509496|ref|NP_001144452.1| uncharacterized protein LOC100277415 [Zea mays]
 gi|195642330|gb|ACG40633.1| hypothetical protein [Zea mays]
          Length = 441

 Score =  409 bits (1050), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 192/393 (48%), Positives = 267/393 (67%), Gaps = 4/393 (1%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           + QRLASLD+FRG+ V LMI+VD AGG  P ++H+PW+G  +ADF+MPFFLFIVGV++ L
Sbjct: 46  RQQRLASLDVFRGITVLLMIIVDDAGGFLPALNHSPWDGVTVADFIMPFFLFIVGVSLTL 105

Query: 91  ALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIAL 150
           A KR+PDR +A +K + R LKL   G++LQGGF H    LT+GVD+  IRL G+LQRIA+
Sbjct: 106 AYKRVPDRVEATRKAVLRALKLFCLGLVLQGGFFHGVHSLTFGVDLTKIRLMGILQRIAI 165

Query: 151 SYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFT 210
           +YLL ++ EI+ K   D D   G   + R Y +   +   + + Y  LLYG YVPDW++ 
Sbjct: 166 AYLLAAVCEIWLKGDDDVDSGYG---LLRRYRYQLFVGLVLSIAYSILLYGMYVPDWEYQ 222

Query: 211 IINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSP 270
           I    S+   K F+V CGVR    P CNAVG +DR VLGI+H+Y  P + R+K C+ D P
Sbjct: 223 IAGPGSSSTEKSFSVKCGVRGDTGPACNAVGMVDRTVLGIDHLYRRPVYARTKECSIDYP 282

Query: 271 FEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGF 330
             GPL  DAPSWC APF+PEGLLSSV +I++ +IG+ FGHVIIH + H  R+  W+   F
Sbjct: 283 ENGPLPPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQFGHVIIHFEKHRGRITSWLVPSF 342

Query: 331 ALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWI 390
           ++L     + F   + +NK LYT+SY   T+GAA L+F+ IYALVD++  + P + + W+
Sbjct: 343 SMLALAFVMDFVG-MRMNKPLYTMSYTLATAGAAGLLFAGIYALVDLYGFRRPTIAMEWM 401

Query: 391 GMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLV 423
           G +A+++YV+ A  I   FI G+Y+ DP+N+L+
Sbjct: 402 GKHALMIYVLVACNILPMFIRGFYWRDPNNSLL 434


>gi|219885579|gb|ACL53164.1| unknown [Zea mays]
 gi|413937084|gb|AFW71635.1| hypothetical protein ZEAMMB73_862609 [Zea mays]
          Length = 482

 Score =  409 bits (1050), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 192/393 (48%), Positives = 267/393 (67%), Gaps = 4/393 (1%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           + QRLASLD+FRG+ V LMI+VD AGG  P ++H+PW+G  +ADF+MPFFLFIVGV++ L
Sbjct: 87  RQQRLASLDVFRGITVLLMIIVDDAGGFLPALNHSPWDGVTVADFIMPFFLFIVGVSLTL 146

Query: 91  ALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIAL 150
           A KR+PDR +A +K + R LKL   G++LQGGF H    LT+GVD+  IRL G+LQRIA+
Sbjct: 147 AYKRVPDRVEATRKAVLRALKLFCLGLVLQGGFFHGVHSLTFGVDLTKIRLMGILQRIAI 206

Query: 151 SYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFT 210
           +YLL ++ EI+ K   D D   G   + R Y +   +   + + Y  LLYG YVPDW++ 
Sbjct: 207 AYLLAAVCEIWLKGDDDVDSGYG---LLRRYRYQLFVGLVLSIAYSILLYGMYVPDWEYQ 263

Query: 211 IINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSP 270
           I    S+   K F+V CGVR    P CNAVG +DR VLGI+H+Y  P + R+K C+ D P
Sbjct: 264 IAGPGSSSTEKSFSVKCGVRGDTGPACNAVGMVDRTVLGIDHLYRRPVYARTKECSIDYP 323

Query: 271 FEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGF 330
             GPL  DAPSWC APF+PEGLLSSV +I++ +IG+ FGHVIIH + H  R+  W+   F
Sbjct: 324 ENGPLPPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQFGHVIIHFEKHRGRIASWLVPSF 383

Query: 331 ALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWI 390
           ++L     + F   + +NK LYT+SY   T+GAA L+F+ IYALVD++  + P + + W+
Sbjct: 384 SMLALAFVMDFVG-MRMNKPLYTMSYTLATAGAAGLLFAGIYALVDLYGFRRPTIAMEWM 442

Query: 391 GMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLV 423
           G +A+++YV+ A  I   FI G+Y+ DP+N+L+
Sbjct: 443 GKHALMIYVLVACNILPMFIRGFYWRDPNNSLL 475


>gi|413937082|gb|AFW71633.1| hypothetical protein ZEAMMB73_862609 [Zea mays]
          Length = 441

 Score =  408 bits (1049), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 192/393 (48%), Positives = 267/393 (67%), Gaps = 4/393 (1%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           + QRLASLD+FRG+ V LMI+VD AGG  P ++H+PW+G  +ADF+MPFFLFIVGV++ L
Sbjct: 46  RQQRLASLDVFRGITVLLMIIVDDAGGFLPALNHSPWDGVTVADFIMPFFLFIVGVSLTL 105

Query: 91  ALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIAL 150
           A KR+PDR +A +K + R LKL   G++LQGGF H    LT+GVD+  IRL G+LQRIA+
Sbjct: 106 AYKRVPDRVEATRKAVLRALKLFCLGLVLQGGFFHGVHSLTFGVDLTKIRLMGILQRIAI 165

Query: 151 SYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFT 210
           +YLL ++ EI+ K   D D   G   + R Y +   +   + + Y  LLYG YVPDW++ 
Sbjct: 166 AYLLAAVCEIWLKGDDDVDSGYG---LLRRYRYQLFVGLVLSIAYSILLYGMYVPDWEYQ 222

Query: 211 IINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSP 270
           I    S+   K F+V CGVR    P CNAVG +DR VLGI+H+Y  P + R+K C+ D P
Sbjct: 223 IAGPGSSSTEKSFSVKCGVRGDTGPACNAVGMVDRTVLGIDHLYRRPVYARTKECSIDYP 282

Query: 271 FEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGF 330
             GPL  DAPSWC APF+PEGLLSSV +I++ +IG+ FGHVIIH + H  R+  W+   F
Sbjct: 283 ENGPLPPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQFGHVIIHFEKHRGRIASWLVPSF 342

Query: 331 ALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWI 390
           ++L     + F   + +NK LYT+SY   T+GAA L+F+ IYALVD++  + P + + W+
Sbjct: 343 SMLALAFVMDFVG-MRMNKPLYTMSYTLATAGAAGLLFAGIYALVDLYGFRRPTIAMEWM 401

Query: 391 GMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLV 423
           G +A+++YV+ A  I   FI G+Y+ DP+N+L+
Sbjct: 402 GKHALMIYVLVACNILPMFIRGFYWRDPNNSLL 434


>gi|219886509|gb|ACL53629.1| unknown [Zea mays]
 gi|414587417|tpg|DAA37988.1| TPA: hypothetical protein ZEAMMB73_167983 [Zea mays]
          Length = 438

 Score =  405 bits (1042), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 197/396 (49%), Positives = 272/396 (68%), Gaps = 6/396 (1%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           QRL SLD+FRG+ V LMI+VD AG   P ++H+PW+G  +ADFVMPFFLFIVGVA+ALA 
Sbjct: 47  QRLVSLDVFRGITVLLMIIVDDAGSFIPAMNHSPWDGVTVADFVMPFFLFIVGVALALAY 106

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
           KR+PD+ DA KK + R LKL   G++LQGGF H    LT+GVD++ IRL G+LQRIA++Y
Sbjct: 107 KRVPDKLDATKKAVLRALKLFCLGLVLQGGFFHGVRSLTFGVDLQEIRLMGILQRIAIAY 166

Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
           LL +L EI+ K  +D D     + + + Y +   + A V + Y++LLYGTYV DW++   
Sbjct: 167 LLTALCEIWLKGDEDVDYG---YDLLKRYRYQLFVGAIVGITYMSLLYGTYVRDWEYQTS 223

Query: 213 NKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFE 272
              S +  K F V CGVR   +P CNAVG IDR++LGI H+Y  P + RSK C+ DSP  
Sbjct: 224 GPGSIE--KSFFVKCGVRGDTSPGCNAVGMIDRRILGIQHLYGRPVYARSKQCSIDSPQN 281

Query: 273 GPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL 332
           GPL  DAPSWC APF+PEGLLSSV +I++ +IG+ +GHVI+H + H  R+  W+   F++
Sbjct: 282 GPLPPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQYGHVIVHFQKHRERIMNWLIPSFSM 341

Query: 333 LIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGM 392
           L+    + F   + +NK LYTLSY   T+GAA L+F  IY LVDI+  + P + + W+GM
Sbjct: 342 LVLAFAMDFL-GLRMNKPLYTLSYTLATAGAAGLLFCGIYTLVDIYGYRRPTVAMEWMGM 400

Query: 393 NAMLVYVMAAEGIFAGFINGWYYGDPHNTLVCFLFI 428
           +A+++YV+ A  I   FI+G+Y+ +P N L+ F+ I
Sbjct: 401 HALMIYVLIACNILPIFIHGFYWKEPQNNLLKFIGI 436


>gi|224033113|gb|ACN35632.1| unknown [Zea mays]
 gi|413918233|gb|AFW58165.1| hypothetical protein ZEAMMB73_985435 [Zea mays]
          Length = 444

 Score =  404 bits (1039), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 196/396 (49%), Positives = 273/396 (68%), Gaps = 6/396 (1%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           QRL SLD+FRG+ V LMI+VD AG   P ++H+PW+G  +ADFVMPFFLFIVGVA+ALA 
Sbjct: 53  QRLVSLDVFRGITVLLMIIVDDAGAFIPAMNHSPWDGVTVADFVMPFFLFIVGVALALAY 112

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
           KR+PD+ DA +K + R LKL   G++LQGGF H    L++GVD++ IRL GVLQRIA++Y
Sbjct: 113 KRVPDKLDASRKALLRALKLFCLGLVLQGGFFHGVRSLSFGVDLQEIRLMGVLQRIAIAY 172

Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
           LL +L EI+ +  +D D     + + + Y +   + A V + Y++LLYGTYVPDW++   
Sbjct: 173 LLTALCEIWIRGDEDVDYG---YDLLKRYRYQLFVGAVVAITYMSLLYGTYVPDWEYQTS 229

Query: 213 NKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFE 272
              S +  K   V CGVR   +P CNAVG IDRK+LGI H+Y  P + RSK C+ DSP  
Sbjct: 230 APGSTE--KHLFVKCGVRGDTSPGCNAVGMIDRKILGIQHLYGRPVYARSKQCSIDSPQN 287

Query: 273 GPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL 332
           GPL  DAPSWC APF+PEGLLSSV +I++ +IG+ +GHVI+H + H  R+  W+   F++
Sbjct: 288 GPLPSDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQYGHVIVHFQKHRERMMNWLIPSFSM 347

Query: 333 LIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGM 392
           L+    + F   + +NK LYTLSY   T+GAA L+FS IY LVDI+  + P + + W+GM
Sbjct: 348 LVLAFAMDFFG-LHMNKPLYTLSYTLGTAGAAGLLFSGIYTLVDIYGYRRPTVAMEWMGM 406

Query: 393 NAMLVYVMAAEGIFAGFINGWYYGDPHNTLVCFLFI 428
           +A+++YV+ A  +   FI+G+Y+ +P N L+ F+ I
Sbjct: 407 HALMIYVLIACNVLPIFIHGFYWKEPKNNLLKFIGI 442


>gi|226494648|ref|NP_001146383.1| uncharacterized protein LOC100279961 [Zea mays]
 gi|219886923|gb|ACL53836.1| unknown [Zea mays]
 gi|413918231|gb|AFW58163.1| hypothetical protein ZEAMMB73_985435 [Zea mays]
          Length = 469

 Score =  404 bits (1038), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 196/396 (49%), Positives = 273/396 (68%), Gaps = 6/396 (1%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           QRL SLD+FRG+ V LMI+VD AG   P ++H+PW+G  +ADFVMPFFLFIVGVA+ALA 
Sbjct: 78  QRLVSLDVFRGITVLLMIIVDDAGAFIPAMNHSPWDGVTVADFVMPFFLFIVGVALALAY 137

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
           KR+PD+ DA +K + R LKL   G++LQGGF H    L++GVD++ IRL GVLQRIA++Y
Sbjct: 138 KRVPDKLDASRKALLRALKLFCLGLVLQGGFFHGVRSLSFGVDLQEIRLMGVLQRIAIAY 197

Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
           LL +L EI+ +  +D D     + + + Y +   + A V + Y++LLYGTYVPDW++   
Sbjct: 198 LLTALCEIWIRGDEDVDYG---YDLLKRYRYQLFVGAVVAITYMSLLYGTYVPDWEYQTS 254

Query: 213 NKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFE 272
              S +  K   V CGVR   +P CNAVG IDRK+LGI H+Y  P + RSK C+ DSP  
Sbjct: 255 APGSTE--KHLFVKCGVRGDTSPGCNAVGMIDRKILGIQHLYGRPVYARSKQCSIDSPQN 312

Query: 273 GPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL 332
           GPL  DAPSWC APF+PEGLLSSV +I++ +IG+ +GHVI+H + H  R+  W+   F++
Sbjct: 313 GPLPSDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQYGHVIVHFQKHRERMMNWLIPSFSM 372

Query: 333 LIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGM 392
           L+    + F   + +NK LYTLSY   T+GAA L+FS IY LVDI+  + P + + W+GM
Sbjct: 373 LVLAFAMDFFG-LHMNKPLYTLSYTLGTAGAAGLLFSGIYTLVDIYGYRRPTVAMEWMGM 431

Query: 393 NAMLVYVMAAEGIFAGFINGWYYGDPHNTLVCFLFI 428
           +A+++YV+ A  +   FI+G+Y+ +P N L+ F+ I
Sbjct: 432 HALMIYVLIACNVLPIFIHGFYWKEPKNNLLKFIGI 467


>gi|326505544|dbj|BAJ95443.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 429

 Score =  403 bits (1035), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 193/396 (48%), Positives = 269/396 (67%), Gaps = 6/396 (1%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           QRL SLD+FRG+ V LMI+VD AG   P ++H+PW G  +ADFVMPFFLFIVGVA+ALA 
Sbjct: 38  QRLVSLDVFRGITVLLMIIVDDAGSFLPAMNHSPWEGVTIADFVMPFFLFIVGVALALAY 97

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
           KR+PD+ DA +K   R LKL   G++LQGGF H    LT+GVD+  IRL G+LQRIA++Y
Sbjct: 98  KRVPDKLDATRKATLRALKLFCVGLVLQGGFFHGVRSLTFGVDIAQIRLMGILQRIAIAY 157

Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
           L+ +L +I+ K   D D  +    + + Y +  L    + + Y+ALLYGTYVPDW++ I 
Sbjct: 158 LVTALCQIWLKGDDDVDSGL---DLIKRYKYQLLAGLLITITYMALLYGTYVPDWEYRIS 214

Query: 213 NKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFE 272
                +  K F V CGVR    P CNAVG IDRK+LGI H+Y  P + RS+ C+ DSP  
Sbjct: 215 GPGFTE--KTFTVRCGVRGDSGPGCNAVGMIDRKILGIQHLYGRPVYARSQQCSIDSPQN 272

Query: 273 GPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL 332
           GPL  DAPSWC APF+PEGLLSSV +I++ +IG+ +GH+I+H + H  R+  W+   F +
Sbjct: 273 GPLPPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQYGHIIVHFQKHKERIMHWLVPSFGM 332

Query: 333 LIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGM 392
           L+    + F   + +NK LYTLSY   T+GAA L+F+ IY LVD++  + P + + W+GM
Sbjct: 333 LVLAFAMDFFG-MHMNKPLYTLSYTLCTAGAAGLLFAGIYTLVDLYGYRRPTVAMEWMGM 391

Query: 393 NAMLVYVMAAEGIFAGFINGWYYGDPHNTLVCFLFI 428
           +A++++V+ A  I   FI+G+Y+G+P+N L+ F+ I
Sbjct: 392 HALMIFVLIACNILPIFIHGFYWGEPNNNLLKFIGI 427


>gi|168035930|ref|XP_001770461.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162678169|gb|EDQ64630.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 487

 Score =  402 bits (1033), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 205/403 (50%), Positives = 277/403 (68%), Gaps = 12/403 (2%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           K+ RLASLD+FRGL++A+MILVD+AGG WP I+H+PW G  LADFVMPFFLFIVGVA+AL
Sbjct: 42  KSPRLASLDVFRGLSIAVMILVDNAGGVWPSINHSPWTGITLADFVMPFFLFIVGVALAL 101

Query: 91  ALKRIP-DRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
             KRI  D+  A +K + RT KLL  G+++QGG+ H   + +YGVD+  IR CGVLQRIA
Sbjct: 102 TYKRITRDKKVASQKALGRTAKLLIVGLVIQGGYFHGLHDTSYGVDLERIRWCGVLQRIA 161

Query: 150 LSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQF 209
           L+Y++V+L EI+    + +D S   F+IF+ Y +HW +AA ++  YLALLYG YVPDW F
Sbjct: 162 LAYMVVALCEIWAPR-RRQDVSNDNFAIFKTYHFHWAVAAAIVATYLALLYGVYVPDWDF 220

Query: 210 ---TIINKDSADY------GKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWR 260
              T++N  +         G +  V CGVR  + P CNAVGY+DR +LG++H+Y  P +R
Sbjct: 221 IPPTVLNSTALHVSVVRVNGSMSEVHCGVRGNIGPACNAVGYLDRTILGVSHLYQRPVFR 280

Query: 261 RSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLA 320
           R+ AC+ +SP  GPL   AP WC APF+PEGLLSS+S++ S  +G+HFGHV++H K H+A
Sbjct: 281 RTPACSVNSPDYGPLPSGAPDWCKAPFDPEGLLSSLSAVGSCFLGLHFGHVLVHRKEHIA 340

Query: 321 RLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNL 380
           RL  W+ M   LLI    L     +P NK LY++SY+  T GAA +VF+  Y LVD++  
Sbjct: 341 RLWDWMIMSLVLLIV-GLLLHLLGVPFNKPLYSVSYMLFTGGAAGVVFAGFYLLVDVYGW 399

Query: 381 KYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLV 423
           + P   L W+G NA+L+YV+ AEG+F   + G Y+  P N LV
Sbjct: 400 RGPTFLLEWLGQNALLMYVLVAEGVFPAALQGIYWRKPENNLV 442


>gi|302796996|ref|XP_002980259.1| hypothetical protein SELMODRAFT_112263 [Selaginella moellendorffii]
 gi|300151875|gb|EFJ18519.1| hypothetical protein SELMODRAFT_112263 [Selaginella moellendorffii]
          Length = 401

 Score =  402 bits (1032), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 187/376 (49%), Positives = 259/376 (68%), Gaps = 10/376 (2%)

Query: 48  LMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIF 107
           +MILVD+AGG+WP I+H+PWNG  LAD VMPFFLFIVGVA+AL  K+IP + D+ +K I 
Sbjct: 1   MMILVDNAGGEWPAINHSPWNGVTLADLVMPFFLFIVGVALALVYKKIPSKLDSTRKAIL 60

Query: 108 RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQD 167
           R+LKL F G+ LQGG+ H  ++L+YGVD+ +IR CG+LQRIA  YL+V+L E++   VQ 
Sbjct: 61  RSLKLFFLGVFLQGGYFHGENDLSYGVDLTLIRWCGILQRIAFVYLVVALCEVWLPRVQG 120

Query: 168 KDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTC 227
                  F   + Y +HW+     L VYL+LLYG  VPDWQF + N  +        VTC
Sbjct: 121 S-----YFGFMQNYLFHWIFVVVTLTVYLSLLYGLKVPDWQFELPNNRNI----TMTVTC 171

Query: 228 GVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPF 287
           G R+ L+PPCNAVGY+DR++LG+NH+   P + R+++C+ +SP  GPL  DAP WCHAPF
Sbjct: 172 GTRSNLDPPCNAVGYVDRQILGVNHLDQRPVFIRTESCSINSPDYGPLPADAPVWCHAPF 231

Query: 288 EPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPL 347
           +PEG+LSSVS+I++  IG+H+GH I+  K H  R+  ++   F LL  G  LH    I +
Sbjct: 232 DPEGILSSVSAIVTCFIGLHYGHFIVQCKEHKQRIINFIVPAFILLALGYVLHLL-GIKM 290

Query: 348 NKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFA 407
           NK LY+ SY+C T+GAA  VF  +Y LVD+++++YP L L W+GMN++++Y +AA  +  
Sbjct: 291 NKPLYSFSYMCFTAGAAGAVFCLLYILVDVYDIRYPTLLLEWMGMNSLIIYTLAATDVLV 350

Query: 408 GFINGWYYGDPHNTLV 423
            FI G+Y+  P   LV
Sbjct: 351 DFIQGFYWKQPQKNLV 366


>gi|242065256|ref|XP_002453917.1| hypothetical protein SORBIDRAFT_04g021400 [Sorghum bicolor]
 gi|241933748|gb|EES06893.1| hypothetical protein SORBIDRAFT_04g021400 [Sorghum bicolor]
          Length = 439

 Score =  402 bits (1032), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 189/393 (48%), Positives = 267/393 (67%), Gaps = 6/393 (1%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           +  RL SLD+FRG+ V LMI+VD AGG  P ++H+PW+G  +ADFVMPFFLFIVGV++ L
Sbjct: 46  RQPRLVSLDVFRGITVLLMIIVDDAGGFLPSLNHSPWDGVTIADFVMPFFLFIVGVSLTL 105

Query: 91  ALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIAL 150
           A KR+PD+ +A KK + R LKL   G++LQGGF H    LT+GVD+  IRL G+LQRIA+
Sbjct: 106 AYKRVPDKLEATKKAVLRALKLFCLGLVLQGGFFHGVHSLTFGVDLTKIRLMGILQRIAI 165

Query: 151 SYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFT 210
           +YLL ++ EI+ K   D D   G   + R Y +   +   + + Y  LLYG YVPDW++ 
Sbjct: 166 AYLLAAICEIWLKGDDDVDSGYG---LLRRYRYQLFVGLVLSIAYTILLYGIYVPDWEYK 222

Query: 211 IINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSP 270
           I    S +  K F+V CGVR    P CNAVG +DR +LGI+H+Y  P + R+K C+ + P
Sbjct: 223 ISGPGSTE--KSFSVKCGVRGDTGPACNAVGMVDRTILGIDHLYRRPVYARTKECSINYP 280

Query: 271 FEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGF 330
             GPL  DAPSWC APF+PEGLLSSV +I++ +IG+ FGH+IIH + H  R+  W+   F
Sbjct: 281 ENGPLPPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQFGHIIIHFEKHRGRITNWLIPSF 340

Query: 331 ALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWI 390
           ++L     + F+  + +NK LYT+SY   T+GAA L+F+ IYALVD++  + P +P+ W+
Sbjct: 341 SMLALAFLMDFSG-MRMNKPLYTISYTLATAGAAGLLFAGIYALVDLYGFRKPTIPMEWM 399

Query: 391 GMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLV 423
           G +A+++YV+ A  I   FI G+Y+ DP+N+L+
Sbjct: 400 GKHALMIYVLVACNILPMFIRGFYWRDPNNSLL 432


>gi|359473865|ref|XP_002275105.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Vitis vinifera]
 gi|296085565|emb|CBI29297.3| unnamed protein product [Vitis vinifera]
          Length = 444

 Score =  398 bits (1022), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 191/392 (48%), Positives = 269/392 (68%), Gaps = 4/392 (1%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
             +RL SLD+FRGL VA+MILVD AGG  P I+H+PWNG  LADFVMPFFLFIVGV++AL
Sbjct: 49  SKRRLVSLDVFRGLTVAIMILVDDAGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 108

Query: 91  ALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIAL 150
           A K +     A K  + R LKLL +G+ LQGG+ H  + LTYGVD+  IRL G+LQRIA+
Sbjct: 109 AYKNLSSGYLATKMAVVRALKLLVFGLFLQGGYFHGLNNLTYGVDIEQIRLAGILQRIAV 168

Query: 151 SYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFT 210
           +Y L ++ EI+ K   D +   G  S+ + Y + W +   + V Y +LLYG YVPDW+++
Sbjct: 169 AYFLAAVCEIWLKG--DSNVKSGS-SLLKKYQFQWAVVLVLTVAYCSLLYGLYVPDWEYS 225

Query: 211 IINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSP 270
           I ++ S+   K+F V CGVR+   P CNAVG IDR VLGI H+Y  P + R K C+ +SP
Sbjct: 226 IPSETSSSALKIFKVKCGVRSDTGPACNAVGMIDRNVLGIQHLYKRPIYARMKQCSINSP 285

Query: 271 FEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGF 330
             GPL  +AP+WC APF+PEGLLSSV +I++ ++G+H+GH+I+H K H  R+  W+    
Sbjct: 286 DYGPLPPNAPTWCQAPFDPEGLLSSVMAIVTCLVGLHYGHIIVHFKDHKDRILHWIVPSS 345

Query: 331 ALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWI 390
            LL+ G  L F   + +NK LYTLSY+CVT+GAA ++F+ IY +VD++  + P + + W+
Sbjct: 346 CLLVLGFALDFF-GMHVNKALYTLSYMCVTAGAAGILFAGIYLMVDMYGYRRPTIVMEWM 404

Query: 391 GMNAMLVYVMAAEGIFAGFINGWYYGDPHNTL 422
           GM+A+++Y++AA  I   F+ G+Y+  P N +
Sbjct: 405 GMHALMIYILAACNILPVFLQGFYWRRPQNNI 436


>gi|115485801|ref|NP_001068044.1| Os11g0543500 [Oryza sativa Japonica Group]
 gi|77551354|gb|ABA94151.1| expressed protein [Oryza sativa Japonica Group]
 gi|113645266|dbj|BAF28407.1| Os11g0543500 [Oryza sativa Japonica Group]
 gi|125577433|gb|EAZ18655.1| hypothetical protein OsJ_34172 [Oryza sativa Japonica Group]
 gi|215701389|dbj|BAG92813.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 448

 Score =  397 bits (1021), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 194/406 (47%), Positives = 266/406 (65%), Gaps = 3/406 (0%)

Query: 23  DQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLF 82
           D +  +    QRL SLD+FRG+ VALMILVD  GG  P ISH+PW+G  LADFV PFFLF
Sbjct: 44  DGEAAATTTRQRLVSLDVFRGITVALMILVDDVGGIVPAISHSPWDGVTLADFVFPFFLF 103

Query: 83  IVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
           IVGV++A A K++PD+  A KK + R +KL   G++LQGGF H   ELTYGVD+R IRL 
Sbjct: 104 IVGVSLAFAYKKVPDKMLATKKAMLRAVKLFIVGLILQGGFFHGIHELTYGVDIRKIRLM 163

Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
           GVLQRIA++YL+V+L EI+ + V           +   Y     +   ++V YL +LYG 
Sbjct: 164 GVLQRIAIAYLVVALCEIWLRRVSSGGNIGSGSMLITRYHHQMFVGLVLVVTYLVILYGL 223

Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
           +VPDW++ + + DS    K F V CGV+    P CNAVG IDR VLGI H+Y HP + ++
Sbjct: 224 HVPDWEYEVTSPDSTV--KHFLVKCGVKGDTGPGCNAVGMIDRSVLGIQHLYAHPVYLKT 281

Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
           + C+  SP  GPL  +APSWC APF+PEGLLSS+ +I++ +IG+  GHVI+H K H  R+
Sbjct: 282 EQCSMASPRNGPLPPNAPSWCEAPFDPEGLLSSLMAIVTCLIGLQIGHVIVHFKKHNERI 341

Query: 323 KQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKY 382
           K+W  +   LL  G +LH    + +NK LY+LSY CVT+G A L F AIY LVD+   K 
Sbjct: 342 KRWSILSLCLLTLGFSLHLF-GLHMNKSLYSLSYTCVTTGTAGLFFVAIYLLVDVKGYKR 400

Query: 383 PFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLVCFLFI 428
           P LP+ W+G +A++++V+ A  +    + G+Y+ +P N L+  + I
Sbjct: 401 PVLPMEWMGKHALMIFVLVACNVIPVLVQGFYWKEPSNNLLKLIGI 446


>gi|302759310|ref|XP_002963078.1| hypothetical protein SELMODRAFT_78688 [Selaginella moellendorffii]
 gi|300169939|gb|EFJ36541.1| hypothetical protein SELMODRAFT_78688 [Selaginella moellendorffii]
          Length = 401

 Score =  394 bits (1011), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 184/376 (48%), Positives = 257/376 (68%), Gaps = 10/376 (2%)

Query: 48  LMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIF 107
           +MILVD+AGG+WP I+H+PWNG  LAD VMPFFLFIVGVA+AL  K+IP + D+ +K I 
Sbjct: 1   MMILVDNAGGEWPAINHSPWNGVTLADLVMPFFLFIVGVALALVYKKIPSKLDSTRKAIL 60

Query: 108 RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQD 167
           R+LKL F G+ LQGG+ H  ++L+YGVD+ +IR CG+LQRIA  Y++V+L E++   VQ 
Sbjct: 61  RSLKLFFLGVFLQGGYFHGENDLSYGVDLTLIRWCGILQRIAFVYVIVALCEVWLPRVQG 120

Query: 168 KDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTC 227
                  F I + Y +HW+     L VYL+LLYG  VP WQF + N  +        VTC
Sbjct: 121 S-----YFGIMQNYLFHWIFVVVTLTVYLSLLYGLKVPHWQFELPNNRNI----TMTVTC 171

Query: 228 GVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPF 287
           G R+ L+P CNAVGY+DR++LG+NH+   P + R+++C+ +SP  GPL  DAP WCHAPF
Sbjct: 172 GTRSNLDPACNAVGYVDRQILGVNHLDQQPVFIRTESCSINSPDYGPLPADAPVWCHAPF 231

Query: 288 EPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPL 347
           +PEG+LSSVS+I++  IG+H+GH I+  K H  R+  ++     LL  G  LH    I +
Sbjct: 232 DPEGILSSVSAIVTCFIGLHYGHFIVQCKEHKQRIINFIVPAVILLALGYVLHLL-GIKM 290

Query: 348 NKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFA 407
           NK LY+ SY+C T+GAA  VF  +Y LVD+++++YP L L W+GMN++++Y +AA  +  
Sbjct: 291 NKPLYSFSYMCFTAGAAGAVFCLLYILVDVYDIRYPTLLLEWMGMNSLIIYTLAATDVLV 350

Query: 408 GFINGWYYGDPHNTLV 423
            FI G+Y+  P   LV
Sbjct: 351 VFIQGFYWKQPQKNLV 366


>gi|218185886|gb|EEC68313.1| hypothetical protein OsI_36402 [Oryza sativa Indica Group]
          Length = 450

 Score =  392 bits (1006), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 196/406 (48%), Positives = 267/406 (65%), Gaps = 3/406 (0%)

Query: 23  DQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLF 82
           D +  +    QRL SLD+FRG+ VALMILVD  GG  P ISH+PW+G  LADFV PFFLF
Sbjct: 46  DGEAAATTTRQRLVSLDVFRGITVALMILVDDVGGIVPAISHSPWDGVTLADFVFPFFLF 105

Query: 83  IVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
           IVGV++A A K++PD+  A KK + R +KL   G++LQGGF H   ELTYGVD+R IRL 
Sbjct: 106 IVGVSLAFAYKKVPDKMLATKKAMLRAVKLFIVGLILQGGFFHGIHELTYGVDIRKIRLM 165

Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
           GVLQRIA++YL+V+L EI+ + V           +   Y     +   ++V YL +LYG 
Sbjct: 166 GVLQRIAIAYLVVALCEIWLRRVSSGGDIGSGSMLITRYHHQMFVGLVLVVTYLVILYGL 225

Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
           +VPDW++ + + DS    K F V CGV+    P CNAVG IDR VLGI H+Y HP + ++
Sbjct: 226 HVPDWEYEVTSLDSTV--KHFLVKCGVKGDTGPGCNAVGMIDRSVLGIQHLYAHPVYLKT 283

Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
           + C+ DSP  GPL  +APSWC APF+PEGLLSS+ +I++ +IG+  GHVI+H K H  R+
Sbjct: 284 EQCSMDSPRNGPLPPNAPSWCEAPFDPEGLLSSLMAIVTCLIGLQIGHVIVHFKKHNERI 343

Query: 323 KQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKY 382
           K+W T+   LL  G +LH    + +NK LY+LSY CVT+G A L F AIY LVD+   K 
Sbjct: 344 KRWSTLSLCLLTLGFSLHLF-GLHMNKSLYSLSYTCVTAGTAGLFFVAIYLLVDVKGYKR 402

Query: 383 PFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLVCFLFI 428
           P  P+ W+G +A++++V+ A  I    + G+Y+ +P N L+  + I
Sbjct: 403 PVFPMEWMGKHALMIFVLVACNIVPVLVQGFYWKEPSNNLLKLIGI 448


>gi|212723180|ref|NP_001132467.1| uncharacterized protein LOC100193923 [Zea mays]
 gi|194694464|gb|ACF81316.1| unknown [Zea mays]
 gi|414587418|tpg|DAA37989.1| TPA: hypothetical protein ZEAMMB73_167983 [Zea mays]
          Length = 391

 Score =  386 bits (991), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 188/382 (49%), Positives = 261/382 (68%), Gaps = 6/382 (1%)

Query: 47  ALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVI 106
           ALMI+VD AG   P ++H+PW+G  +ADFVMPFFLFIVGVA+ALA KR+PD+ DA KK +
Sbjct: 14  ALMIIVDDAGSFIPAMNHSPWDGVTVADFVMPFFLFIVGVALALAYKRVPDKLDATKKAV 73

Query: 107 FRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQ 166
            R LKL   G++LQGGF H    LT+GVD++ IRL G+LQRIA++YLL +L EI+ K  +
Sbjct: 74  LRALKLFCLGLVLQGGFFHGVRSLTFGVDLQEIRLMGILQRIAIAYLLTALCEIWLKGDE 133

Query: 167 DKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVT 226
           D D     + + + Y +   + A V + Y++LLYGTYV DW++      S +  K F V 
Sbjct: 134 DVDYG---YDLLKRYRYQLFVGAIVGITYMSLLYGTYVRDWEYQTSGPGSIE--KSFFVK 188

Query: 227 CGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP 286
           CGVR   +P CNAVG IDR++LGI H+Y  P + RSK C+ DSP  GPL  DAPSWC AP
Sbjct: 189 CGVRGDTSPGCNAVGMIDRRILGIQHLYGRPVYARSKQCSIDSPQNGPLPPDAPSWCQAP 248

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIP 346
           F+PEGLLSSV +I++ +IG+ +GHVI+H + H  R+  W+   F++L+    + F   + 
Sbjct: 249 FDPEGLLSSVMAIVTCLIGLQYGHVIVHFQKHRERIMNWLIPSFSMLVLAFAMDFL-GLR 307

Query: 347 LNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIF 406
           +NK LYTLSY   T+GAA L+F  IY LVDI+  + P + + W+GM+A+++YV+ A  I 
Sbjct: 308 MNKPLYTLSYTLATAGAAGLLFCGIYTLVDIYGYRRPTVAMEWMGMHALMIYVLIACNIL 367

Query: 407 AGFINGWYYGDPHNTLVCFLFI 428
             FI+G+Y+ +P N L+ F+ I
Sbjct: 368 PIFIHGFYWKEPQNNLLKFIGI 389


>gi|195642128|gb|ACG40532.1| hypothetical protein [Zea mays]
          Length = 379

 Score =  385 bits (990), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 187/381 (49%), Positives = 261/381 (68%), Gaps = 6/381 (1%)

Query: 48  LMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIF 107
           LMI+VD AG   P ++H+PW+G  +ADFVMPFFLFIVGVA+ALA KR+PD+ DA KK + 
Sbjct: 3   LMIIVDDAGSFIPAMNHSPWDGVTVADFVMPFFLFIVGVALALAYKRVPDKLDATKKAVL 62

Query: 108 RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQD 167
           R LKL   G++LQGGF H    LT+GVD++ IRL G+LQRIA++YLL +L EI+ K  +D
Sbjct: 63  RALKLFCLGLVLQGGFFHGVRSLTFGVDLQEIRLMGILQRIAIAYLLTALCEIWLKGDED 122

Query: 168 KDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTC 227
            D     + + + Y +  L+ A V + Y++LLYGTYVPD ++      S +  K F V C
Sbjct: 123 VDYG---YDLLKRYRYQLLVGAVVAITYMSLLYGTYVPDCEYQTSGPGSIE--KSFFVKC 177

Query: 228 GVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPF 287
           GVR   +P CNAVG IDR++LGI H+Y  P + RSK C+ DSP  GPL  DAPSWC APF
Sbjct: 178 GVRGDTSPGCNAVGMIDRRILGIQHLYGRPVYARSKQCSIDSPQNGPLPPDAPSWCQAPF 237

Query: 288 EPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPL 347
           +PEGLLSSV +I++ +IG+ +GH+I+H + H  R+  W+   F++L+    + F   + +
Sbjct: 238 DPEGLLSSVMAIVTCLIGLQYGHIIVHFQKHRERIMNWLIPSFSMLVLAFAMDFL-GLRM 296

Query: 348 NKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFA 407
           NK LYTLSY   T+GAA L+F  IY LVDI+  + P + + W+GM+A+++YV+ A  I  
Sbjct: 297 NKPLYTLSYTLATAGAAGLLFCGIYTLVDIYGYRRPTVAMEWMGMHALMIYVLIACNILP 356

Query: 408 GFINGWYYGDPHNTLVCFLFI 428
            FI+G+Y+ +P N L+ F+ I
Sbjct: 357 IFIHGFYWKEPQNNLLKFIGI 377


>gi|116309454|emb|CAH66526.1| H0502B11.6 [Oryza sativa Indica Group]
 gi|218194797|gb|EEC77224.1| hypothetical protein OsI_15768 [Oryza sativa Indica Group]
          Length = 448

 Score =  383 bits (983), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 193/391 (49%), Positives = 267/391 (68%), Gaps = 6/391 (1%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           QRL SLD+FRG+ V LMILVD AG   P I+H+PW+G  LADFVMPFFLFIVGVA+ALA 
Sbjct: 57  QRLVSLDVFRGITVLLMILVDDAGAFLPAINHSPWDGVTLADFVMPFFLFIVGVALALAY 116

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
           KR+P++ +A +K I R LKL   G++LQGGF H    LT+G+D+  IRL G+LQRIA++Y
Sbjct: 117 KRVPNKLEATRKAILRALKLFCVGLVLQGGFFHGVRSLTFGIDMEKIRLMGILQRIAIAY 176

Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
           ++ +L EI+ K   D D     F + +   +   +   V++ Y+  LYGTYVPDW++ I 
Sbjct: 177 IVTALCEIWLKGDDDVDSG---FDLLKRNRYQLFIGLIVMITYMGFLYGTYVPDWEYRIS 233

Query: 213 NKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFE 272
              S +  K F V C VR    P CNAVG IDRK+LGI H+Y  P + RSK C+ +SP  
Sbjct: 234 VPGSTE--KSFFVKCSVRGDTGPGCNAVGMIDRKILGIQHLYCRPVYARSKQCSINSPQN 291

Query: 273 GPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL 332
           GPLR DAPSWC APF+PEGLLSSV +I++ +IG+ +GHVI+H + H  R+ +W+   F++
Sbjct: 292 GPLRPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQYGHVIVHFQKHKERIMKWLIPSFSM 351

Query: 333 LIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGM 392
           LI   +L F   + +NK LYT+SY   T+GAA L+F+ IYALVD++  + P   + W+G 
Sbjct: 352 LILAFSLDFF-GMHMNKPLYTVSYALATAGAAGLLFAGIYALVDMYGHRRPTAVMEWMGT 410

Query: 393 NAMLVYVMAAEGIFAGFINGWYYGDPHNTLV 423
           +A+++YV+ A  I   FI+G+Y+ +P N L+
Sbjct: 411 HALMIYVLIACNILPIFIHGFYWREPKNNLL 441


>gi|115458212|ref|NP_001052706.1| Os04g0404900 [Oryza sativa Japonica Group]
 gi|113564277|dbj|BAF14620.1| Os04g0404900 [Oryza sativa Japonica Group]
 gi|222628804|gb|EEE60936.1| hypothetical protein OsJ_14685 [Oryza sativa Japonica Group]
          Length = 447

 Score =  383 bits (983), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 193/391 (49%), Positives = 267/391 (68%), Gaps = 6/391 (1%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           QRL SLD+FRG+ V LMILVD AG   P I+H+PW+G  LADFVMPFFLFIVGVA+ALA 
Sbjct: 56  QRLVSLDVFRGITVLLMILVDDAGAFLPAINHSPWDGVTLADFVMPFFLFIVGVALALAY 115

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
           KR+P++ +A +K I R LKL   G++LQGGF H    LT+G+D+  IRL G+LQRIA++Y
Sbjct: 116 KRVPNKLEATRKAILRALKLFCVGLVLQGGFFHGVRSLTFGIDMEKIRLMGILQRIAIAY 175

Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
           ++ +L EI+ K   D D     F + +   +   +   V++ Y+  LYGTYVPDW++ I 
Sbjct: 176 IVTALCEIWLKGDDDVDSG---FDLLKRNRYQLFIGLIVMITYMGFLYGTYVPDWEYRIS 232

Query: 213 NKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFE 272
              S +  K F V C VR    P CNAVG IDRK+LGI H+Y  P + RSK C+ +SP  
Sbjct: 233 VPGSTE--KSFFVKCSVRGDTGPGCNAVGMIDRKILGIQHLYCRPVYARSKQCSINSPQN 290

Query: 273 GPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL 332
           GPLR DAPSWC APF+PEGLLSSV +I++ +IG+ +GHVI+H + H  R+ +W+   F++
Sbjct: 291 GPLRPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQYGHVIVHFQKHKERIMKWLIPSFSM 350

Query: 333 LIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGM 392
           LI   +L F   + +NK LYT+SY   T+GAA L+F+ IYALVD++  + P   + W+G 
Sbjct: 351 LILAFSLDFF-GMHMNKPLYTVSYALATAGAAGLLFAGIYALVDMYGHRRPTAVMEWMGT 409

Query: 393 NAMLVYVMAAEGIFAGFINGWYYGDPHNTLV 423
           +A+++YV+ A  I   FI+G+Y+ +P N L+
Sbjct: 410 HALMIYVLIACNILPIFIHGFYWREPKNNLL 440


>gi|224057870|ref|XP_002299365.1| predicted protein [Populus trichocarpa]
 gi|222846623|gb|EEE84170.1| predicted protein [Populus trichocarpa]
          Length = 386

 Score =  378 bits (970), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 189/390 (48%), Positives = 256/390 (65%), Gaps = 4/390 (1%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
           RL SLD+FRGL VALMILVD AGG  P I+H+PWNG  LAD VMPFFLFIVGV++ L  K
Sbjct: 1   RLVSLDVFRGLTVALMILVDDAGGVLPAINHSPWNGLTLADVVMPFFLFIVGVSLGLTYK 60

Query: 94  RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYL 153
           ++  +A A +K I RTLKLL  G+ LQGGF H  ++LTYGVD+  IR  G+LQRIA+ YL
Sbjct: 61  KLSCKAVATRKAILRTLKLLIIGLFLQGGFLHGLNDLTYGVDMTQIRWMGILQRIAIGYL 120

Query: 154 LVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIIN 213
           + ++ EI+ K     +      S+ R Y + W      + +YL+LLYG +VPDW++ I  
Sbjct: 121 VGAMCEIWLKG---GNHVTSGLSMLRKYQFQWAAVLMFVTIYLSLLYGLHVPDWEYQIPV 177

Query: 214 KDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEG 273
             SA   K+F V CGVR    P CNA G IDR +LGI H+Y  P + R+K C+ +SP  G
Sbjct: 178 AASASTPKIFPVKCGVRGHTGPACNAGGMIDRTILGIQHLYRKPIYARTKPCSINSPGYG 237

Query: 274 PLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALL 333
           PL  DAPSWC APF+PEGLLSSV +I++ ++G+H+GH+I+H K H  R   W+      L
Sbjct: 238 PLPPDAPSWCQAPFDPEGLLSSVMAIVTCLVGLHYGHIIVHFKEHKDRTLHWMVPSTCFL 297

Query: 334 IFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMN 393
           +    +     + +NK LYT SY+CVT+GAA +VF+ IY LVD+   ++P L L W+GM+
Sbjct: 298 VL-GLVLDLLGMHVNKALYTFSYMCVTAGAAGIVFTGIYLLVDVCGFRWPMLVLEWMGMH 356

Query: 394 AMLVYVMAAEGIFAGFINGWYYGDPHNTLV 423
           A+L++ +A   I    + G+Y+  P N +V
Sbjct: 357 ALLIFTLATSNILPVVLQGFYWKQPGNNIV 386


>gi|326512130|dbj|BAJ96046.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 488

 Score =  372 bits (954), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 188/439 (42%), Positives = 272/439 (61%), Gaps = 13/439 (2%)

Query: 16  ISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADF 75
           + +P  +D +     K QR+ASLD+FRGL VA+MILVD AGG WP I+HAPW G  +ADF
Sbjct: 36  LPQPPGADAKPGQQ-KPQRVASLDVFRGLTVAMMILVDDAGGAWPGINHAPWLGVTVADF 94

Query: 76  VMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
           VMP FLFI+GV+ AL  K+ P++    KK   R +KL   G++LQGG+ H   +LTYGVD
Sbjct: 95  VMPAFLFIIGVSAALVFKKTPNKIATSKKAACRAIKLFILGVILQGGYIHGRHKLTYGVD 154

Query: 136 VRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVY 195
           +  IR  GVLQRIA+ Y L ++ EI+  +    D  V   S  + Y   W+MA  +  +Y
Sbjct: 155 LDQIRWLGVLQRIAIGYFLAAISEIWLVNNTSVDSPV---SFVKKYFMEWIMAIIISALY 211

Query: 196 LALLYGTYVPDWQFTI------INKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLG 249
           + L++G YVP+W+F +       +  S D G    + CG+   L PPCNAVG++DR +LG
Sbjct: 212 IGLVFGLYVPNWEFKVQTSSSTFSNPSNDVG-FKTIQCGLTGSLGPPCNAVGFVDRVLLG 270

Query: 250 INHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFG 309
            +H+Y +P ++R+K C+ +SP  GPL  +AP WC APF+PEGLLS++ + +S  +G+HFG
Sbjct: 271 ESHLYKNPVYKRTKECSINSPDYGPLPPNAPDWCLAPFDPEGLLSTLMAAVSCFVGLHFG 330

Query: 310 HVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFS 369
           HV+IH K H  R+  W+     L + G  L     +P +K LYT+SY+ +T G +  V  
Sbjct: 331 HVLIHCKTHSQRMMSWLLASTVLTVSGFLLQLL-GMPFSKPLYTVSYMLLTGGVSGFVLL 389

Query: 370 AIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLV-CFLFI 428
            +Y +VD+ ++K P +   W+GMNA++VYV+AA  +F   I G+Y+  P N LV     +
Sbjct: 390 LLYCIVDVIHIKKPLILFQWVGMNALIVYVLAACELFPTLIQGFYWRSPENNLVDATESL 449

Query: 429 ISYILHSFLWELRKFLYVQ 447
           +  I HS  W    F+ V+
Sbjct: 450 LQAIFHSKRWGTLAFVLVE 468


>gi|255548527|ref|XP_002515320.1| conserved hypothetical protein [Ricinus communis]
 gi|223545800|gb|EEF47304.1| conserved hypothetical protein [Ricinus communis]
          Length = 460

 Score =  371 bits (952), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 185/410 (45%), Positives = 264/410 (64%), Gaps = 9/410 (2%)

Query: 19  PDVSDQQEKSHLK----TQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLAD 74
           P  S   E+  L      QRL SLD+FRGL +ALMILVD AGG +P I+H+PW G  LAD
Sbjct: 32  PSSSSSDEREALPPPTPNQRLMSLDVFRGLTIALMILVDDAGGAFPSINHSPWFGVTLAD 91

Query: 75  FVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGV 134
           FVMPFFLF VGV+I+L  K+I  ++ A KKV+ RT+KL   G+LLQGG+ H  + LTYG+
Sbjct: 92  FVMPFFLFGVGVSISLVFKKISSKSVATKKVMLRTIKLFLLGVLLQGGYFHGRNHLTYGI 151

Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
           DV  IR  GVLQRI++ YL  S+ EI+  +    D  +   +  + Y   W+++  +  +
Sbjct: 152 DVLKIRWLGVLQRISIGYLFASISEIWLVNHCIVDSPL---AFMKKYYAQWMVSLILCSL 208

Query: 195 YLALLYGTYVPDWQFTIINKDSADYGK-VFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
           Y  LLY  +VP+W+F   + +   YG     V CGVR  L PPCNAVG IDR +LG +H+
Sbjct: 209 YTCLLYFLFVPNWEFEASSINLFGYGSGTQTVICGVRGSLEPPCNAVGLIDRFLLGEHHL 268

Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
           Y  P +RR+K C+ +SP  GPL  ++P WC APF+PEG+LSS+ + ++ ++G+ FGHV++
Sbjct: 269 YQRPVYRRTKQCSVNSPDYGPLPPNSPPWCLAPFDPEGILSSLMAAVTCLLGLQFGHVLV 328

Query: 314 HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
           H K H+ R+  W+   F+LL+ G  L     IP +K LYTLSY C+T+GA+ L+ + I+ 
Sbjct: 329 HLKDHMQRILVWLISSFSLLVTGFVLKLI-GIPFSKPLYTLSYTCITTGASGLLLTIIFY 387

Query: 374 LVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLV 423
            VD+ + +     L W+GMNA+++Y +AA  +F   + G+Y+  P N LV
Sbjct: 388 AVDVKHFRKAIAILQWMGMNALIIYALAACDLFPAALQGFYWQSPENNLV 437


>gi|224131042|ref|XP_002320987.1| predicted protein [Populus trichocarpa]
 gi|222861760|gb|EEE99302.1| predicted protein [Populus trichocarpa]
          Length = 481

 Score =  369 bits (947), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 203/458 (44%), Positives = 283/458 (61%), Gaps = 16/458 (3%)

Query: 1   MSEIKAETTHHHP--------LIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILV 52
           + E + E   H+P          I+    +     S   TQRL SLD+FRGL VALMILV
Sbjct: 9   LDERQREPLLHNPRSLSNEEEEEITNTPSTSSSNASPPPTQRLLSLDVFRGLTVALMILV 68

Query: 53  DHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKL 112
           D AGG +P I+H+PW G  LADFVMPFFLF+VGV+I+L  K++  +  A KKVI RT+KL
Sbjct: 69  DDAGGAFPCINHSPWFGVTLADFVMPFFLFVVGVSISLVFKKVSSKPMATKKVIQRTIKL 128

Query: 113 LFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSV 172
              G+LLQGG+ H    LTYGVDV  IR  GVLQRI++ YL  ++ EI+  D    D  +
Sbjct: 129 FLLGLLLQGGYFHGRHNLTYGVDVGKIRWMGVLQRISIGYLFAAMSEIWLVDSITVDSPM 188

Query: 173 GRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKD--SADYGKVFNVTCGVR 230
              +  + Y   W++A      Y+ LLYG YVPDW+F + + +    ++G    V CGVR
Sbjct: 189 ---AFVKKYYIQWMVAFLFCTFYMCLLYGLYVPDWEFEVPSTNLFEHEFGTKI-VNCGVR 244

Query: 231 AKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPE 290
             L PPCNAVG IDR  LG +H+Y HP +RR+K C+ +SP  GPL  ++P WC APF+PE
Sbjct: 245 GSLEPPCNAVGLIDRFFLGEHHLYQHPVYRRTKHCSVNSPDYGPLPPNSPGWCLAPFDPE 304

Query: 291 GLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQ 350
           G+LSS+ + ++  +G+ FGH+++H KGH+ RL  W    F +LI G        +PL K 
Sbjct: 305 GILSSLMAAITCFLGLQFGHILVHFKGHMQRLCLWSVCSFIILITGYVFELL-GVPLCKP 363

Query: 351 LYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFI 410
           LYTLSY+C+T+GA+ L  + I+ +VD+ + + P + L W+GMNA+++Y +AA  +F   I
Sbjct: 364 LYTLSYMCITAGASGLALTIIFYIVDVKHFRKPTMILQWMGMNALIIYALAACDLFPAAI 423

Query: 411 NGWYYGDPHNTLVCFL-FIISYILHSFLWELRKFLYVQ 447
            G+Y+G P N LV     +   +LHS  W    F+ V+
Sbjct: 424 QGFYWGSPENNLVDDTESLFQVMLHSKKWGTLVFVIVE 461


>gi|125582342|gb|EAZ23273.1| hypothetical protein OsJ_06967 [Oryza sativa Japonica Group]
          Length = 423

 Score =  369 bits (946), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 175/379 (46%), Positives = 254/379 (67%), Gaps = 6/379 (1%)

Query: 48  LMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIF 107
           LMI+VD AG   P ++H+PW+G  +ADFVMPFFLF+VG+++ LA KR+PD+ +A KK + 
Sbjct: 47  LMIIVDDAGAFLPALNHSPWDGVTIADFVMPFFLFMVGISLTLAYKRVPDKLEATKKAVL 106

Query: 108 RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQD 167
           R LKL   G++LQGGF H    LT+GVD+  IRL G+LQRIA++YLL ++ EI+ K   D
Sbjct: 107 RALKLFCLGLVLQGGFFHGVRSLTFGVDITKIRLMGILQRIAIAYLLAAICEIWLKGDDD 166

Query: 168 KDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTC 227
            D  +    + R Y +  ++A  +  +Y  +L G YVPDW++ I    S +  K F+V C
Sbjct: 167 VDCGL---DVIRRYRYQLVVALLLSTMYTVILNGVYVPDWEYQISGPGSTE--KSFSVRC 221

Query: 228 GVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPF 287
           GVR    P CNAVG +DR +LGI+H+Y  P + R+K C+ + P  GPL  DAPSWC APF
Sbjct: 222 GVRGDTGPACNAVGMLDRTILGIDHLYRRPVYARTKQCSINYPQNGPLPPDAPSWCQAPF 281

Query: 288 EPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPL 347
           +PEGLLSSV +I++ +IG+ FGH+IIH + H  R+  W+   F++L    ++ F   I +
Sbjct: 282 DPEGLLSSVMAIVTCLIGLQFGHIIIHFEKHKGRIINWLIPSFSMLALAFSMDFI-GIRM 340

Query: 348 NKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFA 407
           NK LYT+SY   TSGAA L+F+ IY LVD++  +   +P+ W+G +A+++YV+ A  I  
Sbjct: 341 NKPLYTISYALATSGAAGLLFAGIYTLVDVYGFRKLTIPMEWMGKHALMIYVLVACNILP 400

Query: 408 GFINGWYYGDPHNTLVCFL 426
            FI+G+Y+ +P N L+ F+
Sbjct: 401 IFIHGFYWREPKNNLLKFI 419


>gi|242071239|ref|XP_002450896.1| hypothetical protein SORBIDRAFT_05g020800 [Sorghum bicolor]
 gi|241936739|gb|EES09884.1| hypothetical protein SORBIDRAFT_05g020800 [Sorghum bicolor]
          Length = 455

 Score =  369 bits (946), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 192/398 (48%), Positives = 263/398 (66%), Gaps = 4/398 (1%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           + QRLASLD+FRG+ V LMILVD  GG  P ISH+PW+G  LADFV PFFLFIVGV++A 
Sbjct: 60  RGQRLASLDVFRGITVVLMILVDDVGGLVPAISHSPWDGVTLADFVFPFFLFIVGVSLAF 119

Query: 91  ALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIAL 150
           A KR+P++  A KK + R  KL   G+LLQGG+ H   +L+YGVD+  IRL G+LQRIA+
Sbjct: 120 AYKRVPNKTLATKKALIRASKLFLLGLLLQGGYFHTIHDLSYGVDLHKIRLMGILQRIAI 179

Query: 151 SYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFT 210
           +Y  V+L EI+ +     D   G + + R Y     +   + V Y  LLYG YVPDW++ 
Sbjct: 180 AYFAVALCEIWLRG-GASDNGAGGYVLIRRYRHQLFVGLVLTVTYTVLLYGMYVPDWEYV 238

Query: 211 IINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSP 270
           + + D+    K F V CGVR    P CNAVG IDR VLGI H+Y HP + ++  C+ +SP
Sbjct: 239 VTSPDTTL--KNFMVKCGVRGDTGPGCNAVGMIDRCVLGIQHLYAHPVYLKTAQCSINSP 296

Query: 271 FEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGF 330
             GPL  DAP+WC APF+PEGLLSS+ +I++ +IG+  GHVI+H K H  R+ +W     
Sbjct: 297 RNGPLPSDAPTWCEAPFDPEGLLSSLMAIVTCLIGLQIGHVIVHFKQHSKRIVRWSIPSL 356

Query: 331 ALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWI 390
            LLI G++L     + +NK LY+LSY CVT+G+A L F+ IY LVD++  K PF P+ W+
Sbjct: 357 ILLILGVSLDLF-GMHMNKSLYSLSYTCVTTGSAGLFFAGIYLLVDVYFYKKPFFPMEWV 415

Query: 391 GMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLVCFLFI 428
           G +A++ +V+ A  I    I+G+Y+ +P N L+ F+ I
Sbjct: 416 GKHALMFFVLVACNIAPILIHGFYWREPQNNLLKFIGI 453


>gi|218190872|gb|EEC73299.1| hypothetical protein OsI_07466 [Oryza sativa Indica Group]
          Length = 454

 Score =  368 bits (945), Expect = 3e-99,   Method: Compositional matrix adjust.
 Identities = 175/379 (46%), Positives = 254/379 (67%), Gaps = 6/379 (1%)

Query: 48  LMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIF 107
           LMI+VD AG   P ++H+PW+G  +ADFVMPFFLF+VG+++ LA KR+PD+ +A KK + 
Sbjct: 78  LMIIVDDAGAFLPALNHSPWDGVTIADFVMPFFLFMVGISLTLAYKRVPDKLEATKKAVL 137

Query: 108 RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQD 167
           R LKL   G++LQGGF H    LT+GVD+  IRL G+LQRIA++YLL ++ EI+ K   D
Sbjct: 138 RALKLFCLGLVLQGGFFHGVRSLTFGVDITKIRLMGILQRIAIAYLLAAICEIWLKGDDD 197

Query: 168 KDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTC 227
            D  +    + R Y +  ++A  +  +Y  +L G YVPDW++ I    S +  K F+V C
Sbjct: 198 VDCGL---DVIRRYRYQLVVALLLSTMYTVILNGVYVPDWEYQISGPGSTE--KSFSVRC 252

Query: 228 GVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPF 287
           GVR    P CNAVG +DR +LGI+H+Y  P + R+K C+ + P  GPL  DAPSWC APF
Sbjct: 253 GVRGDTGPACNAVGMLDRTILGIDHLYRRPVYARTKQCSINYPQNGPLPPDAPSWCQAPF 312

Query: 288 EPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPL 347
           +PEGLLSSV +I++ +IG+ FGH+IIH + H  R+  W+   F++L    ++ F   I +
Sbjct: 313 DPEGLLSSVMAIVTCLIGLQFGHIIIHFEKHKGRIINWLIPSFSMLALAFSMDFI-GIRM 371

Query: 348 NKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFA 407
           NK LYT+SY   TSGAA L+F+ IY LVD++  +   +P+ W+G +A+++YV+ A  I  
Sbjct: 372 NKPLYTISYALATSGAAGLLFAGIYTLVDVYGFRKLTIPMEWMGKHALMIYVLVACNILP 431

Query: 408 GFINGWYYGDPHNTLVCFL 426
            FI+G+Y+ +P N L+ F+
Sbjct: 432 IFIHGFYWREPKNNLLKFI 450


>gi|115446433|ref|NP_001046996.1| Os02g0526000 [Oryza sativa Japonica Group]
 gi|49388281|dbj|BAD25399.1| unknown protein [Oryza sativa Japonica Group]
 gi|49388287|dbj|BAD25402.1| unknown protein [Oryza sativa Japonica Group]
 gi|113536527|dbj|BAF08910.1| Os02g0526000 [Oryza sativa Japonica Group]
          Length = 376

 Score =  366 bits (939), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 174/378 (46%), Positives = 253/378 (66%), Gaps = 6/378 (1%)

Query: 49  MILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFR 108
           MI+VD AG   P ++H+PW+G  +ADFVMPFFLF+VG+++ LA KR+PD+ +A KK + R
Sbjct: 1   MIIVDDAGAFLPALNHSPWDGVTIADFVMPFFLFMVGISLTLAYKRVPDKLEATKKAVLR 60

Query: 109 TLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDK 168
            LKL   G++LQGGF H    LT+GVD+  IRL G+LQRIA++YLL ++ EI+ K   D 
Sbjct: 61  ALKLFCLGLVLQGGFFHGVRSLTFGVDITKIRLMGILQRIAIAYLLAAICEIWLKGDDDV 120

Query: 169 DQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCG 228
           D  +    + R Y +  ++A  +  +Y  +L G YVPDW++ I    S +  K F+V CG
Sbjct: 121 DCGL---DVIRRYRYQLVVALLLSTMYTVILNGVYVPDWEYQISGPGSTE--KSFSVRCG 175

Query: 229 VRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFE 288
           VR    P CNAVG +DR +LGI+H+Y  P + R+K C+ + P  GPL  DAPSWC APF+
Sbjct: 176 VRGDTGPACNAVGMLDRTILGIDHLYRRPVYARTKQCSINYPQNGPLPPDAPSWCQAPFD 235

Query: 289 PEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLN 348
           PEGLLSSV +I++ +IG+ FGH+IIH + H  R+  W+   F++L    ++ F   I +N
Sbjct: 236 PEGLLSSVMAIVTCLIGLQFGHIIIHFEKHKGRIINWLIPSFSMLALAFSMDFI-GIRMN 294

Query: 349 KQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAG 408
           K LYT+SY   TSGAA L+F+ IY LVD++  +   +P+ W+G +A+++YV+ A  I   
Sbjct: 295 KPLYTISYALATSGAAGLLFAGIYTLVDVYGFRKLTIPMEWMGKHALMIYVLVACNILPI 354

Query: 409 FINGWYYGDPHNTLVCFL 426
           FI+G+Y+ +P N L+ F+
Sbjct: 355 FIHGFYWREPKNNLLKFI 372


>gi|357134575|ref|XP_003568892.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Brachypodium distachyon]
          Length = 495

 Score =  364 bits (935), Expect = 4e-98,   Method: Compositional matrix adjust.
 Identities = 187/449 (41%), Positives = 268/449 (59%), Gaps = 12/449 (2%)

Query: 6   AETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHA 65
           A   H +P   S         K   K  R+ASLD+FRGL VA+MILVD AGG WP I+HA
Sbjct: 22  ASEIHPYPESPSPRQPPGTDAKPERKPHRVASLDVFRGLTVAMMILVDDAGGAWPGINHA 81

Query: 66  PWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSH 125
           PW G  +ADFVMP FLFI+GV+ AL  KR  ++    KK  +R  KL   G++LQGG+ H
Sbjct: 82  PWLGVTVADFVMPAFLFIIGVSAALVFKRTQNKIATSKKAAYRAFKLFILGVILQGGYIH 141

Query: 126 APDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHW 185
               LTYGVD+  IR  GVLQRIA+ Y L ++ EI+  +    D  V   S  + Y   W
Sbjct: 142 GRHNLTYGVDLDHIRWLGVLQRIAIGYFLAAMSEIWLVNNISVDSPV---SFVKKYFMEW 198

Query: 186 LMAACVLVVYLALLYGTYVPDWQFTI------INKDSADYGKVFNVTCGVRAKLNPPCNA 239
           +MA  +  +Y++L++G YVP+W+F +       +  S + G    V CG+R  L PPCNA
Sbjct: 199 VMAIMISALYISLIFGLYVPNWEFKVQTSNLTFSNGSNEIG-FKTVQCGLRGSLGPPCNA 257

Query: 240 VGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSI 299
           VG++DR +LG NH+Y +P ++R+K C+ +SP  G L  +AP WC APF+PEGLLS++ + 
Sbjct: 258 VGFVDRVLLGENHLYKNPVYKRTKECSVNSPDYGALPPNAPDWCLAPFDPEGLLSTLMAA 317

Query: 300 LSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCV 359
           +S  +G+HFGHV+IH + H  R+  W+     L   G  L  +  +P +K LYT+SY+ +
Sbjct: 318 VSCFVGLHFGHVLIHCQNHSQRMLSWLLASTVLTASGFLLQLS-GMPFSKPLYTVSYMLL 376

Query: 360 TSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPH 419
           T G +  +   +Y +VD+  +K P +   W+GMNA++VYV+AA  +F   + G+Y+  P 
Sbjct: 377 TGGVSGFLLLLLYYIVDVIQIKKPLILFQWMGMNALIVYVLAACELFPTLLQGFYWRSPE 436

Query: 420 NTLV-CFLFIISYILHSFLWELRKFLYVQ 447
           N LV     ++  I  S  W    F+ V+
Sbjct: 437 NNLVDATQSLLQIIFQSKRWGTLAFVLVE 465


>gi|356527477|ref|XP_003532336.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Glycine max]
          Length = 463

 Score =  364 bits (935), Expect = 4e-98,   Method: Compositional matrix adjust.
 Identities = 200/427 (46%), Positives = 283/427 (66%), Gaps = 9/427 (2%)

Query: 1   MSEIKAETT----HHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAG 56
           +S   A+TT    H + +I  +  ++  Q +   K+ RL SLD+FRGL VALMILVD AG
Sbjct: 35  VSPTIAQTTPLHLHINNIIEEQHIIARHQPQPQPKSPRLVSLDVFRGLTVALMILVDDAG 94

Query: 57  GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWG 116
           G  P ++H+PWNG  LAD+VMPFFLFIVGV++AL  K++    DA +K   R LKLL  G
Sbjct: 95  GLIPALNHSPWNGLTLADYVMPFFLFIVGVSLALTYKKLSCGVDASRKASLRALKLLVLG 154

Query: 117 ILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFS 176
           + LQGG+ H  ++LTYGVD++ IR  G+LQRI ++YL+ +L EI+ K     D      S
Sbjct: 155 LFLQGGYFHRVNDLTYGVDLKQIRWMGILQRIGVAYLVAALCEIWLK---SDDTVNSGPS 211

Query: 177 IFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPP 236
           + R Y + W +A  +  +YL LLYG YVPDW + I  + S++  K F+V CGVR    P 
Sbjct: 212 LLRKYRYQWAVALILSFLYLCLLYGLYVPDWVYQIQTEPSSE-PKTFSVKCGVRGNTGPA 270

Query: 237 CNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSV 296
           CNAVG IDR +LGI+H+Y  P + R   C+ +SP  GPL  DAP+WC APF+PEGLLSSV
Sbjct: 271 CNAVGMIDRTILGIHHLYQRPIYARMPECSINSPNYGPLPPDAPAWCQAPFDPEGLLSSV 330

Query: 297 SSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSY 356
            +I++ +IG+H+GH+I+H K H  R+  W+     L++FGL L     + +NK LY+LSY
Sbjct: 331 MAIVTCLIGLHYGHIIVHFKDHRVRIIYWMIPTSCLVVFGLALDLF-GMHINKVLYSLSY 389

Query: 357 VCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYG 416
            CVT+GAA ++F  IY +VD+   +   L L W+GM+A+++Y++AA  +F  F+ G+Y+G
Sbjct: 390 TCVTAGAAGILFVGIYLMVDVCGCRRMTLVLEWMGMHALMIYILAACNVFPIFLQGFYWG 449

Query: 417 DPHNTLV 423
            PHN ++
Sbjct: 450 SPHNNIL 456


>gi|388508176|gb|AFK42154.1| unknown [Lotus japonicus]
          Length = 467

 Score =  364 bits (934), Expect = 5e-98,   Method: Compositional matrix adjust.
 Identities = 192/404 (47%), Positives = 268/404 (66%), Gaps = 5/404 (1%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIV 84
             K   ++QRL S+D+FRGL VALMILVD AGG  P ++H+PW+G  +ADFVMP FLFIV
Sbjct: 67  NHKPQSQSQRLVSIDVFRGLTVALMILVDDAGGLLPALNHSPWDGLTIADFVMPLFLFIV 126

Query: 85  GVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
           G+++AL  K++     A +K I R LKLL  G+ LQGG+ H  ++LT+GVD++ IRL G+
Sbjct: 127 GLSLALTYKKLSCPVIATRKAILRALKLLALGLFLQGGYFHRINDLTFGVDMKQIRLMGI 186

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           LQRIA++YLL +L EI+ K     D      S+ R Y + W +A  +   YL LLYG YV
Sbjct: 187 LQRIAIAYLLTALCEIWLK---CDDIVKSGSSLLRKYRYQWAVAFVLSGFYLCLLYGLYV 243

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
           PDW++  I  DS+   K F+V CGV A   P CN VG IDRK+LGI H+Y  P + R   
Sbjct: 244 PDWEYQ-IPTDSSSVPKTFSVKCGVWADTGPACNVVGMIDRKILGIQHLYRRPIYARMPE 302

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
           C+ +SP  GPL  DAP+WC APF+PEGLLSSV +I++ +IG+H+GH+I+H K H  R+  
Sbjct: 303 CSINSPDYGPLPPDAPAWCQAPFDPEGLLSSVMAIVTCLIGLHYGHIIVHYKDHRVRIIH 362

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPF 384
           W+     L++FG  LH    + +NK LY+ SY CVT+GAA ++  AIY +VD+       
Sbjct: 363 WMIPTSCLIVFGFALHLF-GMHVNKVLYSFSYTCVTAGAAGILLVAIYLMVDVCGYSRVT 421

Query: 385 LPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLVCFLFI 428
             + W+G +A+++YV+AA  IF  F+ G+Y+G+PHN ++  + I
Sbjct: 422 KVMEWMGKHALMIYVLAACNIFPIFLQGFYWGNPHNNILKLIGI 465


>gi|357149263|ref|XP_003575052.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Brachypodium distachyon]
          Length = 432

 Score =  364 bits (934), Expect = 6e-98,   Method: Compositional matrix adjust.
 Identities = 190/420 (45%), Positives = 270/420 (64%), Gaps = 14/420 (3%)

Query: 17  SEPDVSDQQEK--------SHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWN 68
           + PD+     K        S    QRL SLD+FRG+ V LMI+VD AGG  P ++H+PW+
Sbjct: 17  TTPDLESGASKASPLPTPVSPAARQRLVSLDVFRGITVLLMIIVDDAGGFLPALNHSPWD 76

Query: 69  GCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPD 128
           G  + DFVMPFFLFIVGV++ LA KR+P+R +A KK + R LKL   G++LQGGF H   
Sbjct: 77  GVTIGDFVMPFFLFIVGVSLTLAYKRVPERLEATKKAVLRALKLFCLGLVLQGGFFHGVR 136

Query: 129 ELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMA 188
            LT+GVD+  IRL G+LQRIA++YL+ ++ EI+ K   + D+ +    + R Y +   + 
Sbjct: 137 SLTFGVDITEIRLMGILQRIAIAYLIAAICEIWLKGNDEVDRGL---DLLRRYRYQLFVG 193

Query: 189 ACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
             + V+Y  LLYG YVPDW++ I    S +  K   V CGVR    P CNAVG +DR +L
Sbjct: 194 LLLSVMYTVLLYGIYVPDWEYQITGPGSTE--KSLLVKCGVRGDTGPGCNAVGMVDRTML 251

Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
           GI+H+Y  P + R+K C+ D P  GPL  DAPSWC APF+PEGLLSSV +I++ ++G+ F
Sbjct: 252 GIDHLYRRPVYARTKECSIDYPENGPLPPDAPSWCQAPFDPEGLLSSVMAIVTCLMGLQF 311

Query: 309 GHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVF 368
           GHVIIH + H  R+  W+   F++L     + F   + +NK LYT+SY   T+GAA   F
Sbjct: 312 GHVIIHFEKHKERIINWLIPSFSMLALAFLMDFI-GMRMNKPLYTISYTFATAGAAGFFF 370

Query: 369 SAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLVCFLFI 428
           + IY LVD++  + P +P+ W+G +A+++YV+ A  I   FI+G+Y+ +P N L+ F+ I
Sbjct: 371 AGIYTLVDMYGFRKPTIPMEWLGKHALMIYVLVACNILPMFIHGFYWKEPKNNLLKFIGI 430


>gi|38346153|emb|CAE02025.2| OSJNBb0118P14.13 [Oryza sativa Japonica Group]
          Length = 415

 Score =  362 bits (928), Expect = 3e-97,   Method: Compositional matrix adjust.
 Identities = 189/415 (45%), Positives = 272/415 (65%), Gaps = 13/415 (3%)

Query: 15  IISEPDVSDQQEKSHL------KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWN 68
           +++ PD  +Q++   +      + Q L  + +F    + LMILVD AG   P I+H+PW+
Sbjct: 1   MVAWPDNRNQRQGIQVSVVDVAEGQWLTCVHLFMP-EMPLMILVDDAGAFLPAINHSPWD 59

Query: 69  GCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPD 128
           G  LADFVMPFFLFIVGVA+ALA KR+P++ +A +K I R LKL   G++LQGGF H   
Sbjct: 60  GVTLADFVMPFFLFIVGVALALAYKRVPNKLEATRKAILRALKLFCVGLVLQGGFFHGVR 119

Query: 129 ELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMA 188
            LT+G+D+  IRL G+LQRIA++Y++ +L EI+ K   D D     F + +   +   + 
Sbjct: 120 SLTFGIDMEKIRLMGILQRIAIAYIVTALCEIWLKGDDDVDSG---FDLLKRNRYQLFIG 176

Query: 189 ACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
             V++ Y+  LYGTYVPDW++ I    S +  K F V C VR    P CNAVG IDRK+L
Sbjct: 177 LIVMITYMGFLYGTYVPDWEYRISVPGSTE--KSFFVKCSVRGDTGPGCNAVGMIDRKIL 234

Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
           GI H+Y  P + RSK C+ +SP  GPLR DAPSWC APF+PEGLLSSV +I++ +IG+ +
Sbjct: 235 GIQHLYCRPVYARSKQCSINSPQNGPLRPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQY 294

Query: 309 GHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVF 368
           GHVI+H + H  R+ +W+   F++LI   +L F   + +NK LYT+SY   T+GAA L+F
Sbjct: 295 GHVIVHFQKHKERIMKWLIPSFSMLILAFSLDFF-GMHMNKPLYTVSYALATAGAAGLLF 353

Query: 369 SAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLV 423
           + IYALVD++  + P   + W+G +A+++YV+ A  I   FI+G+Y+ +P N L+
Sbjct: 354 AGIYALVDMYGHRRPTAVMEWMGTHALMIYVLIACNILPIFIHGFYWREPKNNLL 408


>gi|356534906|ref|XP_003535992.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Glycine max]
          Length = 489

 Score =  362 bits (928), Expect = 3e-97,   Method: Compositional matrix adjust.
 Identities = 206/440 (46%), Positives = 281/440 (63%), Gaps = 11/440 (2%)

Query: 13  PLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNL 72
           PL  S P  +D    S L  QRL+SLD+FRGL VALMILVD+ G  +P ++H+PW G  L
Sbjct: 36  PLPQSNP--TDTSSLS-LPNQRLSSLDVFRGLTVALMILVDNVGRAFPSLNHSPWFGVTL 92

Query: 73  ADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTY 132
           ADFVMPFFLF+VGV+I L  K++  + +A KKVI RTLKL   G+LLQGG+ H   +LTY
Sbjct: 93  ADFVMPFFLFVVGVSIGLVFKKVSSKPNATKKVISRTLKLFLLGLLLQGGYFHGHGKLTY 152

Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
           GVD+  IR  GVLQRI++ Y   S+ EI+  +      S   F   R Y   W+ +  + 
Sbjct: 153 GVDLSKIRWLGVLQRISIGYFFASISEIWLVNHNILVDSPAGF--VRKYSIQWMFSILLC 210

Query: 193 VVYLALLYGTYVPDWQFT----IINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
            VYL LLYG YVP+W+F     + + DS+    + NV C VR  L PPCN VG+IDR +L
Sbjct: 211 SVYLCLLYGLYVPNWKFKHSNLLSSSDSSHLSIIQNVHCEVRGSLEPPCNVVGFIDRLIL 270

Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
           G +HMY  P + R+K C+ +SP  GPL  D+P WC APF+PEG+LSS+ + ++  +G+ +
Sbjct: 271 GEDHMYQRPVYIRTKECSVNSPDYGPLPPDSPGWCLAPFDPEGILSSLMAAITCFMGLQY 330

Query: 309 GHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVF 368
           GH+I+H +GH  R+  W    F+LL+ G  L     +PL+K LYTLSY C+T+GA+ LV 
Sbjct: 331 GHIIVHLQGHKQRVLLWSVFSFSLLLIGYILEIL-GMPLSKALYTLSYTCITAGASGLVL 389

Query: 369 SAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLV-CFLF 427
           +AIY +VDI +L+ P + L W+GMNA++VY +AA  IF   I G+Y+  P N LV     
Sbjct: 390 TAIYYIVDIEHLRKPTVLLQWMGMNALVVYALAACDIFPAVIQGFYWHSPENNLVDASEA 449

Query: 428 IISYILHSFLWELRKFLYVQ 447
           ++  I HS  W    F+ V+
Sbjct: 450 LMQIIFHSKRWGTLAFVIVE 469


>gi|449446789|ref|XP_004141153.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Cucumis sativus]
          Length = 494

 Score =  362 bits (928), Expect = 3e-97,   Method: Compositional matrix adjust.
 Identities = 189/420 (45%), Positives = 278/420 (66%), Gaps = 9/420 (2%)

Query: 16  ISEPDVSDQQE---KSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNL 72
           I EP  S       +S  +  RL SLD+FRG+ VALMI+VD+AGG  P I+H+PW+G  L
Sbjct: 79  IDEPQFSSSVRPILRSSDQCHRLVSLDVFRGITVALMIVVDYAGGVMPAINHSPWDGLTL 138

Query: 73  ADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTY 132
           AD VMPFFLFIVGV++ALA K+IP R  A +K + RTLKLLF G+ LQGGF H  + LTY
Sbjct: 139 ADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHGVNNLTY 198

Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
           GVD++ IR  G+LQRIA++Y L +L EI+ K     D      ++ R Y    + A  + 
Sbjct: 199 GVDIQQIRWMGILQRIAIAYFLAALCEIWLK---GSDYVNSETALRRKYQLQLVAAVVLT 255

Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYG--KVFNVTCGVRAKLNPPCNAVGYIDRKVLGI 250
           ++YLAL YG YVPDW++ + +  ++D    K+F+V CG R    P CNAVG IDRK+ GI
Sbjct: 256 MLYLALSYGLYVPDWEYQVPSLTTSDVASPKIFSVKCGTRGDTGPACNAVGMIDRKIFGI 315

Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
            H+Y  P + R++ C+ ++P  GPL  DAPSWC APF+PEGLLS+V ++++ ++G+H+GH
Sbjct: 316 QHLYKRPIYARTEQCSINAPDYGPLPPDAPSWCQAPFDPEGLLSTVMAVVTCLVGLHYGH 375

Query: 311 VIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSA 370
           +I+H K H  R+  W+     L++  + L F   + +NK LYT+SY+ VT+GAA L+F+ 
Sbjct: 376 IIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFL-GMHINKVLYTVSYMSVTAGAAGLLFTG 434

Query: 371 IYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLVCFLFIIS 430
           IY +VD+++ +   + + W+G +A+++YV+AA  +    + G+Y G P N ++  + + S
Sbjct: 435 IYLMVDVYSWRRMNVVMEWMGKHALVIYVLAACNVLPVILQGFYLGQPQNNILRLIGVPS 494


>gi|356569086|ref|XP_003552737.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Glycine max]
          Length = 461

 Score =  361 bits (926), Expect = 5e-97,   Method: Compositional matrix adjust.
 Identities = 201/420 (47%), Positives = 281/420 (66%), Gaps = 9/420 (2%)

Query: 6   AETT--HHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEIS 63
            +TT  H H +I  +  +S  Q +   K+ RL SLD+FRGL VALMILVD AGG  P ++
Sbjct: 42  GQTTPLHIHNIIEEQRIISRHQPQP--KSPRLVSLDVFRGLTVALMILVDDAGGLIPALN 99

Query: 64  HAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGF 123
           H+PWNG  LAD+VMPFFLFIVGV++AL+ K++    DA +K   R LKLL  G+ LQGG+
Sbjct: 100 HSPWNGLTLADYVMPFFLFIVGVSLALSYKKLSCGVDASRKASLRALKLLALGLFLQGGY 159

Query: 124 SHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCW 183
            H  ++LT+GVD++ IR  G+LQRIA++YL+V+L EI+ K     D      S+ R Y +
Sbjct: 160 FHRVNDLTFGVDIKQIRWMGILQRIAVAYLVVALCEIWLK---SDDTVNSGPSLLRKYRY 216

Query: 184 HWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYI 243
            W +A  +  +YL LLYG YVPDW + I  + SA+  K F+V CGVR    P CN VG I
Sbjct: 217 QWAVALILSFLYLCLLYGLYVPDWVYQIQTEPSAE-PKTFSVKCGVRGNTGPACNVVGMI 275

Query: 244 DRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTI 303
           DR +LGI H+Y  P + R   C+ +SP  GPL  DAP+WC APF+PEGLLSSV +I++ +
Sbjct: 276 DRMILGIQHLYKRPIYARMPECSINSPNYGPLPPDAPAWCQAPFDPEGLLSSVMAIVTCL 335

Query: 304 IGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGA 363
           IG+H+GH+I+H K H  R+  W+     LL+FGL L     + +NK LY+LSY CVT+GA
Sbjct: 336 IGLHYGHIIVHFKDHRVRIIYWMIPTSCLLVFGLALDLF-GMHINKVLYSLSYTCVTAGA 394

Query: 364 AALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLV 423
           A ++F  IY +VD+   +   L + W+GM+A+++Y++AA  +F  F+ G+Y+G PHN ++
Sbjct: 395 AGVLFVGIYLMVDVCGCRRMTLVMEWMGMHALMIYILAACNVFPIFLQGFYWGSPHNNIL 454


>gi|168007055|ref|XP_001756224.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162692734|gb|EDQ79090.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 411

 Score =  361 bits (926), Expect = 5e-97,   Method: Compositional matrix adjust.
 Identities = 193/393 (49%), Positives = 256/393 (65%), Gaps = 20/393 (5%)

Query: 49  MILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRAD-AVKKVIF 107
           MILVD+AGG WP I+H+PW+G  LADFV+PFFLFIVGVA+AL  K+I +    A +K I 
Sbjct: 1   MILVDYAGGIWPAINHSPWDGVTLADFVLPFFLFIVGVALALTYKKIINEKQLASQKAIG 60

Query: 108 RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFT----- 162
           R+LKL+  G+ +QGG+ H     +YGVD+  IR CGVLQRIAL+Y++V+L EI+      
Sbjct: 61  RSLKLVIVGLFIQGGYFHGVHNTSYGVDLESIRWCGVLQRIALAYMVVALCEIWAPRGHY 120

Query: 163 KDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKV 222
             +    +S  RF  FR       +AA ++ +YL LLYG YVPDW+F      SA    V
Sbjct: 121 DSMNVYIKSTRRFGTFRA------VAAAIVAIYLVLLYGVYVPDWEFV-----SAADSTV 169

Query: 223 FNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSW 282
           F V CGVR  + P CN VGY+DR +LG++H+Y    +RR+ AC+  SP  GPL   AP W
Sbjct: 170 FQVKCGVRGDVGPSCNVVGYLDRTLLGLSHLYQKAVYRRAPACSVLSPDYGPLPAGAPVW 229

Query: 283 CHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFT 342
           C APF+PEGLLSS+S+I+S  +G+HFGHV++H K H ARLK WV M   LL+ G  LH  
Sbjct: 230 CKAPFDPEGLLSSMSAIVSCFLGLHFGHVLVHHKEHNARLKDWVLMSLTLLVTGALLHVL 289

Query: 343 NAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
             +P NK LY++SY+  T GAA LVF+  Y LVD+   +   + L W+G +AM++YV+ A
Sbjct: 290 -GMPWNKPLYSVSYMLFTGGAAGLVFAGYYFLVDVHGWRSSTILLEWLGQHAMVIYVLVA 348

Query: 403 EGIFAGFINGWYYGDPHNTLV--CFLFIISYIL 433
           EG+F   + G Y G P N LV  C L  +S I+
Sbjct: 349 EGVFIAALQGLYVGSPENNLVRPCVLTTVSIII 381


>gi|186530230|ref|NP_199601.2| uncharacterized protein [Arabidopsis thaliana]
 gi|332008203|gb|AED95586.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 440

 Score =  360 bits (924), Expect = 8e-97,   Method: Compositional matrix adjust.
 Identities = 192/394 (48%), Positives = 263/394 (66%), Gaps = 7/394 (1%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           +RL SLD+FRGL VA MILVD  GG  P I+H+PW+G  LADFVMPFFLFIVGV++A A 
Sbjct: 44  ERLVSLDVFRGLTVAFMILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIVGVSLAFAY 103

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
           K +  R  A +K + R+LKLL  G+ LQGGF H  + LTYG+DV  IRL G+LQRIA++Y
Sbjct: 104 KNLSCRFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGILQRIAIAY 163

Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
           L+V+L EI+ K   +    +   S+ + Y +HW++A  +  +YL+LLYG YVPDW++ I+
Sbjct: 164 LVVALCEIWLKGNHNVSSEL---SMIKKYRFHWVVAFVITTIYLSLLYGLYVPDWEYQIL 220

Query: 213 NKDSADYGKVF---NVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDS 269
            +D       F    V CGVR    P CNAVG +DR  LGI H+Y  P + R+K C+ + 
Sbjct: 221 KEDQGSTLTTFLNLKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTKQCSINY 280

Query: 270 PFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMG 329
           P  GPL  DAPSWC APF+PEGLLSS+ + ++ ++G+H+GH+IIH K H  RL QW+   
Sbjct: 281 PNNGPLPPDAPSWCQAPFDPEGLLSSLMATVTCLVGLHYGHIIIHFKDHKKRLNQWILRS 340

Query: 330 FALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAW 389
           F LL+          + LNK LYTLSY+CVTSGA+  + SAIY +VD++  K   L L W
Sbjct: 341 FCLLML-GLALNLFGMHLNKPLYTLSYMCVTSGASGFLLSAIYLMVDVYGYKRASLVLEW 399

Query: 390 IGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLV 423
           +G++A+ +YV+ A  +    I+G+Y+ +P N L+
Sbjct: 400 MGIHALPIYVLIACNLVFLIIHGFYWKNPINNLL 433


>gi|357442361|ref|XP_003591458.1| Heparan-alpha-glucosaminide N-acetyltransferase [Medicago
           truncatula]
 gi|355480506|gb|AES61709.1| Heparan-alpha-glucosaminide N-acetyltransferase [Medicago
           truncatula]
          Length = 476

 Score =  359 bits (922), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 203/448 (45%), Positives = 278/448 (62%), Gaps = 16/448 (3%)

Query: 2   SEIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPE 61
           S I   T H + L    P VS       +  QRL SLD+FRGL VALMILVD  G  +P 
Sbjct: 23  SSILTLTVHENEL----PPVS-------VPNQRLVSLDVFRGLTVALMILVDDVGRAFPS 71

Query: 62  ISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQG 121
           ++H+PW G  LADFVMPFFLF VGV+IAL  K++  + +A KK+I RT+KL   G+LLQG
Sbjct: 72  LNHSPWFGVTLADFVMPFFLFGVGVSIALVFKKVSSKQNATKKIISRTIKLFLLGLLLQG 131

Query: 122 GFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLY 181
           G+ H    LTYG+D+  +R  GVLQRI++ Y L S+ EI+  +      S   F   R Y
Sbjct: 132 GYFHGRGNLTYGLDLTKLRWFGVLQRISIGYFLASMSEIWLVNGNILVDSPAAF--VRKY 189

Query: 182 CWHWLMAACVLVVYLALLYGTYVPDWQFTIINKD-SADYGKVFNVTCGVRAKLNPPCNAV 240
              W+ +  +  VYL LLYG YVP+W+F   N         + NV C +R  L+PPCNAV
Sbjct: 190 SIQWIFSILLCSVYLCLLYGLYVPNWEFEHSNLLWPGRVSTIQNVHCDMRGSLDPPCNAV 249

Query: 241 GYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSIL 300
           G+IDR +LG +HMY  P +RR+K C+ +SP  GPL  D+P WC APF+PEG+LSS+ + +
Sbjct: 250 GFIDRLILGEDHMYQRPVYRRTKECSVNSPDYGPLPPDSPGWCLAPFDPEGILSSLMAAI 309

Query: 301 STIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVT 360
           +  +G+ FGH+++  + H  R+  W    F+LL+ G  L     IPL+K LYTLS++ +T
Sbjct: 310 TCFVGLQFGHILVIFQAHKQRVLLWSVFSFSLLVVGYVLEIL-GIPLSKALYTLSFMFIT 368

Query: 361 SGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHN 420
           +GA+ LV +AIY +VDI  L+ P + L W+GMNA++VY +AA  IF   I G+Y+  P N
Sbjct: 369 AGASGLVLTAIYYIVDIKQLRKPTVLLQWMGMNALIVYALAACDIFPAVIQGFYWRSPEN 428

Query: 421 TLV-CFLFIISYILHSFLWELRKFLYVQ 447
            LV     +I  ILHS  W    F+ ++
Sbjct: 429 NLVDASEALIQNILHSEKWGTLAFVIIE 456


>gi|212724122|ref|NP_001131867.1| uncharacterized protein LOC100193245 [Zea mays]
 gi|194692766|gb|ACF80467.1| unknown [Zea mays]
 gi|413948803|gb|AFW81452.1| hypothetical protein ZEAMMB73_255914 [Zea mays]
 gi|413948804|gb|AFW81453.1| hypothetical protein ZEAMMB73_255914 [Zea mays]
          Length = 492

 Score =  359 bits (922), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 180/419 (42%), Positives = 263/419 (62%), Gaps = 14/419 (3%)

Query: 10  HHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNG 69
            H PL     D +  Q +   K +R+ASLD+FRG  VA+MILVD AGG WP I+HAPW G
Sbjct: 38  QHPPL-----DAAATQLEEQRKPERVASLDVFRGFTVAMMILVDDAGGAWPGINHAPWFG 92

Query: 70  CNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDE 129
             +ADFVMP FLFI+GV+ AL  K++ ++  A KK   R  KL   G++LQGG+ H   +
Sbjct: 93  VTVADFVMPAFLFIIGVSAALVFKKMANKTAATKKAAIRASKLFILGVILQGGYIHGRHK 152

Query: 130 LTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAA 189
           LTYGVD+  IR  GVLQRIA+ Y + ++ EI+  +    D  V      + Y   W MA 
Sbjct: 153 LTYGVDLDHIRWLGVLQRIAIGYFVAAMSEIWLVNNNLVDSPV---PFVKKYFIEWFMAI 209

Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDS-----ADYGKVFNVTCGVRAKLNPPCNAVGYID 244
            + V+Y+AL++G YV +W+F I   +S     ++  +   + CGVR  L PPCNAVG +D
Sbjct: 210 AITVLYVALVFGLYVANWEFEIQTSNSTLSIPSNSIETKMIQCGVRGSLGPPCNAVGLVD 269

Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
           R +LG NH+Y +P ++R+K C+ +SP  GPL  +AP WC APF+PEGLLS++ ++++  +
Sbjct: 270 RVLLGENHLYKNPVYKRTKECSINSPDYGPLPPNAPDWCLAPFDPEGLLSTLMAVVTCFV 329

Query: 305 GVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAA 364
           G+ FGHV+IH K H  R+  W+ +   +L     L     +P +K LYT++Y+ +T G +
Sbjct: 330 GLFFGHVLIHCKNHSQRMLIWL-LASVVLTISAYLVLLLGMPFSKPLYTVNYMLLTGGVS 388

Query: 365 ALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLV 423
             +   +Y +VD+ ++K PF+   W+GMNA++VYV+AA  +F   I G+Y+  P N LV
Sbjct: 389 GFLLLLLYYIVDVIHIKKPFVLFQWMGMNALIVYVLAACELFPTLIQGFYWRSPENNLV 447


>gi|195652797|gb|ACG45866.1| hypothetical protein [Zea mays]
          Length = 492

 Score =  357 bits (917), Expect = 6e-96,   Method: Compositional matrix adjust.
 Identities = 180/419 (42%), Positives = 261/419 (62%), Gaps = 14/419 (3%)

Query: 10  HHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNG 69
            H PL     D +  Q +   K +R+ASLD+FRG  VA+ ILVD AGG WP I+HAPW G
Sbjct: 38  QHPPL-----DAAATQLEEQRKPERVASLDVFRGFTVAMXILVDDAGGAWPGINHAPWFG 92

Query: 70  CNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDE 129
             +ADFVMP FLFI+GV+ AL  K++ ++  A KK   R  KL   G++LQGG+ H   +
Sbjct: 93  VTVADFVMPAFLFIIGVSAALVFKKMANKTAATKKAAIRASKLFILGVILQGGYIHGRHK 152

Query: 130 LTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAA 189
           LTYGVD+  IR  GVLQRIA+ Y + ++ EI+  +    D  V      + Y   W MA 
Sbjct: 153 LTYGVDLDHIRWLGVLQRIAIGYFVAAMSEIWLVNNNLVDSPV---PFVKKYFIEWFMAI 209

Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDS-----ADYGKVFNVTCGVRAKLNPPCNAVGYID 244
            + V+Y+AL++G YV +W+F I   +S     ++  +   + CGVR  L PPCNAVG +D
Sbjct: 210 AITVLYVALVFGLYVANWEFEIQTSNSTLSIPSNSIETKMIQCGVRGSLGPPCNAVGLVD 269

Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
           R +LG NH+Y +P ++R+K C+ +SP  GPL  +AP WC APF+PEGLLS++ ++++  +
Sbjct: 270 RVLLGENHLYKNPVYKRTKECSINSPDYGPLPPNAPDWCLAPFDPEGLLSTLMAVVTCFV 329

Query: 305 GVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAA 364
           G+ FGHV+IH K H  R+  W+     L I    L     +P +K LYT++Y+ +T G +
Sbjct: 330 GLFFGHVLIHCKNHSQRMLIWLLASVVLTI-SAYLVLLLGMPFSKPLYTVNYMLLTGGVS 388

Query: 365 ALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLV 423
             +   +Y +VD+ ++K PF+   W+GMNA++VYV+AA  +F   I G+Y+  P N LV
Sbjct: 389 GFLLLLLYYIVDVIHIKKPFVLFQWMGMNALIVYVLAACELFPTLIQGFYWRSPENNLV 447


>gi|255543288|ref|XP_002512707.1| conserved hypothetical protein [Ricinus communis]
 gi|223548668|gb|EEF50159.1| conserved hypothetical protein [Ricinus communis]
          Length = 426

 Score =  357 bits (916), Expect = 6e-96,   Method: Compositional matrix adjust.
 Identities = 176/342 (51%), Positives = 235/342 (68%), Gaps = 4/342 (1%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
            RL SLD+FRGL VALMILVD+AGG  P I+H+PWNG  LAD VMPFFLFIVGV++ L  
Sbjct: 50  HRLLSLDVFRGLTVALMILVDYAGGILPAINHSPWNGLTLADLVMPFFLFIVGVSLGLTY 109

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
           K++P +A A +K I RTLKLL  G  LQGG+ H  ++LTYGV+V  +RL G+LQRIA++Y
Sbjct: 110 KKLPCKAVATRKAILRTLKLLTLGFFLQGGYLHGLNDLTYGVNVEKLRLMGILQRIAIAY 169

Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
           L+ +L EI+ K     D      S+ R Y + W MA  ++  YL+L+YG YVPDW++ I 
Sbjct: 170 LVGALCEIWLKGDDHVDSCS---SLLRKYRFQWAMALVLISTYLSLIYGLYVPDWEYQIP 226

Query: 213 NKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFE 272
            + S+   K+F V CGVR    P CNAVG IDR  LGI H+Y  P + R+K C+ +SP  
Sbjct: 227 AEASSSPAKIFLVKCGVRGNTGPACNAVGLIDRTTLGIQHLYGKPVYARTKLCSINSPDY 286

Query: 273 GPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL 332
           GPL  DAPSWC APF+PEG+LSSV ++++ +IG+H+GH+I+H K H  R+  W+     L
Sbjct: 287 GPLPADAPSWCQAPFDPEGILSSVMAVVTCLIGLHYGHIIVHFKDHRNRMLHWMIPSICL 346

Query: 333 LIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYAL 374
           +  GL L F   + +NK LY+ SY+ VT+GAA ++F+ IY L
Sbjct: 347 IGLGLALDFL-GMHVNKALYSFSYMSVTAGAAGILFTGIYKL 387


>gi|297791891|ref|XP_002863830.1| hypothetical protein ARALYDRAFT_494835 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297309665|gb|EFH40089.1| hypothetical protein ARALYDRAFT_494835 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 432

 Score =  355 bits (910), Expect = 4e-95,   Method: Compositional matrix adjust.
 Identities = 196/413 (47%), Positives = 265/413 (64%), Gaps = 14/413 (3%)

Query: 14  LIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLA 73
           L IS P +   +E       RL SLD+FRGL VALMILVD  G   P I+H+PW+G  LA
Sbjct: 24  LQISRPSLPPDKE-------RLVSLDVFRGLTVALMILVDDVGEILPSINHSPWDGVTLA 76

Query: 74  DFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYG 133
           DFVMPFFLFIVGV++A A K +  R  A +K + R+LKLL  G+ LQGGF H  + LTYG
Sbjct: 77  DFVMPFFLFIVGVSLAFAYKNLSCRFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYG 136

Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV 193
           +DV  IR  G+LQRIA++YL+ +L EI+ K   +    +   S+ + Y +HW++A  +  
Sbjct: 137 IDVEKIRFMGILQRIAIAYLVAALCEIWLKGNHNVSSEL---SMIKKYRFHWVVAFVITT 193

Query: 194 VYLALLYGTYVPDWQFTIINKDSADYGKVF---NVTCGVRAKLNPPCNAVGYIDRKVLGI 250
           +YL+LLYG YV DW++ I  +D       F    V CGVR    P CNAVG +DR  LGI
Sbjct: 194 IYLSLLYGLYVSDWEYQISTEDQGSTLTTFLNLKVKCGVRGHTGPGCNAVGMLDRMFLGI 253

Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
            H+Y  P + R+K C+  SP  GPL  DAPSWC APF+PEGLLSS+ +I++ ++G+H+GH
Sbjct: 254 QHLYRKPVYARTKQCSISSPNNGPLPPDAPSWCQAPFDPEGLLSSLMAIVTCLVGLHYGH 313

Query: 311 VIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSA 370
           +IIH K H  RL QW+   F LL+          + LNK LYTLSY+CVTSGA+  + SA
Sbjct: 314 IIIHFKDHKKRLNQWILRSFCLLML-GLALNLFGMHLNKPLYTLSYMCVTSGASGFLLSA 372

Query: 371 IYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLV 423
           IY +VD++  K   L L W+G++A+ +YV+ A  +    I+G+Y+  P N L+
Sbjct: 373 IYLMVDVYGYKRASLVLEWMGIHALPIYVLIACNLVFLIIHGFYWKKPINNLL 425


>gi|115462187|ref|NP_001054693.1| Os05g0155700 [Oryza sativa Japonica Group]
 gi|54291854|gb|AAV32222.1| unknown protein [Oryza sativa Japonica Group]
 gi|113578244|dbj|BAF16607.1| Os05g0155700 [Oryza sativa Japonica Group]
 gi|215694847|dbj|BAG90038.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218196128|gb|EEC78555.1| hypothetical protein OsI_18526 [Oryza sativa Indica Group]
 gi|222630256|gb|EEE62388.1| hypothetical protein OsJ_17178 [Oryza sativa Japonica Group]
          Length = 491

 Score =  353 bits (905), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 186/462 (40%), Positives = 277/462 (59%), Gaps = 24/462 (5%)

Query: 6   AETTHHHPLIISEPDVSD------------QQEKSHLKTQRLASLDIFRGLAVALMILVD 53
           A+  H  PL+ S  D  +                +  K +R+ASLD+FRGL VA+MILVD
Sbjct: 14  ADAGHRRPLLASADDDDEIRPYPASSPSPQHPAGAERKPRRVASLDVFRGLTVAMMILVD 73

Query: 54  HAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLL 113
            AGG WP ++H+PW G  +ADFVMP FLFI+GV+ AL  K+ P++  A KK   R +KL 
Sbjct: 74  DAGGAWPGMNHSPWLGVTVADFVMPAFLFIIGVSAALVFKKTPNKTVATKKAAIRAIKLF 133

Query: 114 FWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVG 173
             G++LQGG+ H    LTYG+D+  IR  GVLQRIA+ Y L ++ EI+  +    D ++ 
Sbjct: 134 ILGVILQGGYIHGRHNLTYGIDLDHIRWLGVLQRIAIGYFLAAISEIWLVNNISVDSAI- 192

Query: 174 RFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDS-------ADYGKVFNVT 226
             S  + Y   W++A  +  +Y+ LL G YV +W+F +   +S        +   +  + 
Sbjct: 193 --SFVKKYFMEWIVAVMISALYVGLLLGLYVSNWEFKVQTSNSILTIPTPGNEIGMKMIQ 250

Query: 227 CGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP 286
           CGVR  L PPCNAVG++DR +LG NH+Y +P ++R+K C+ +SP  GPL  +AP WC AP
Sbjct: 251 CGVRGSLGPPCNAVGFVDRVLLGENHLYKNPVYKRTKECSVNSPDYGPLPPNAPDWCLAP 310

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIP 346
           F+PEGLLS++ + ++  +G+HFGHV++H K H  R+  W+     L + G  L     +P
Sbjct: 311 FDPEGLLSTLMAAVTCFVGLHFGHVLVHCKDHSPRMLLWLLASTVLTVSGFLLQLL-GMP 369

Query: 347 LNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIF 406
            +K LYT+SY+ +T G +  +   +Y +VD+ N+K PF+   W+GMNA++VYV+AA  IF
Sbjct: 370 FSKPLYTVSYMLLTGGVSGFLLLLLYYIVDVINIKKPFILFQWMGMNALIVYVLAACEIF 429

Query: 407 AGFINGWYYGDPHNTLVCFL-FIISYILHSFLWELRKFLYVQ 447
              + G+Y+  P N LV     ++  I HS  W    F+ ++
Sbjct: 430 PTLVQGFYWRSPENNLVDLTESLLQTIFHSKRWGTLAFVVLE 471


>gi|32487909|emb|CAE05368.1| OJ000315_02.13 [Oryza sativa Japonica Group]
          Length = 452

 Score =  349 bits (896), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 189/439 (43%), Positives = 272/439 (61%), Gaps = 37/439 (8%)

Query: 15  IISEPDVSDQQEKSHL------KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWN 68
           +++ PD  +Q++   +      + Q L  + +F    + LMILVD AG   P I+H+PW+
Sbjct: 1   MVAWPDNRNQRQGIQVSVVDVAEGQWLTCVHLFMP-EMPLMILVDDAGAFLPAINHSPWD 59

Query: 69  GCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPD 128
           G  LADFVMPFFLFIVGVA+ALA KR+P++ +A +K I R LKL   G++LQGGF H   
Sbjct: 60  GVTLADFVMPFFLFIVGVALALAYKRVPNKLEATRKAILRALKLFCVGLVLQGGFFHGVR 119

Query: 129 ELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMA 188
            LT+G+D+  IRL G+LQRIA++Y++ +L EI+ K   D D     F + +   +   + 
Sbjct: 120 SLTFGIDMEKIRLMGILQRIAIAYIVTALCEIWLKGDDDVDSG---FDLLKRNRYQLFIG 176

Query: 189 ACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
             V++ Y+  LYGTYVPDW++ I    S +  K F V C VR    P CNAVG IDRK+L
Sbjct: 177 LIVMITYMGFLYGTYVPDWEYRISVPGSTE--KSFFVKCSVRGDTGPGCNAVGMIDRKIL 234

Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLL--------------- 293
           GI H+Y  P + RSK C+ +SP  GPLR DAPSWC APF+PEGLL               
Sbjct: 235 GIQHLYCRPVYARSKQCSINSPQNGPLRPDAPSWCQAPFDPEGLLRLQQYNISFANFAKF 294

Query: 294 ---------SSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNA 344
                    SSV +I++ +IG+ +GHVI+H + H  R+ +W+   F++LI   +L F   
Sbjct: 295 SLFFLDSRISSVMAIVTCLIGLQYGHVIVHFQKHKERIMKWLIPSFSMLILAFSLDFF-G 353

Query: 345 IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEG 404
           + +NK LYT+SY   T+GAA L+F+ IYALVD++  + P   + W+G +A+++YV+ A  
Sbjct: 354 MHMNKPLYTVSYALATAGAAGLLFAGIYALVDMYGHRRPTAVMEWMGTHALMIYVLIACN 413

Query: 405 IFAGFINGWYYGDPHNTLV 423
           I   FI+G+Y+ +P N L+
Sbjct: 414 ILPIFIHGFYWREPKNNLI 432


>gi|224072443|ref|XP_002303734.1| predicted protein [Populus trichocarpa]
 gi|222841166|gb|EEE78713.1| predicted protein [Populus trichocarpa]
          Length = 381

 Score =  348 bits (893), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 180/381 (47%), Positives = 253/381 (66%), Gaps = 4/381 (1%)

Query: 48  LMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIF 107
           LMILVD AGG  P I+H+PWNG  LAD VMPFFLF+VGV++ L  K++P +A A +K I 
Sbjct: 3   LMILVDDAGGVLPAINHSPWNGLTLADVVMPFFLFMVGVSLGLTYKKLPSKAVATRKAIL 62

Query: 108 RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQD 167
           R LKLL  G+ LQGGF H  ++LT+GVD+  IR  G+LQRIA+ YL+ ++ EI+ K   D
Sbjct: 63  RALKLLVIGLFLQGGFLHGLNDLTFGVDMVQIRWMGILQRIAIGYLIGAMCEIWLKG--D 120

Query: 168 KDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTC 227
              + G  S+ R Y   W     ++ +YL+LLYG YVPDW++ I    S+   K+F V C
Sbjct: 121 NHVASG-LSMLRKYQLQWGAVVVLVSLYLSLLYGLYVPDWEYEIPVAASSSSPKIFRVKC 179

Query: 228 GVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPF 287
           GVR      CNAVG IDR VLGI H+Y  P + R+KAC+ +SP  GPL  DAPSWC APF
Sbjct: 180 GVRGTTGSACNAVGMIDRTVLGIQHLYRKPIYARTKACSINSPDYGPLPPDAPSWCQAPF 239

Query: 288 EPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPL 347
           +PEGLLSSV +I++ ++G+H+GH+I+H K H  R+  W+      ++ GL L  +  + +
Sbjct: 240 DPEGLLSSVMAIVTCLVGLHYGHIIVHFKEHKDRILHWMVPSTCFVVLGLVLDLS-GMHV 298

Query: 348 NKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFA 407
           NK LYT SY+CVT+GAA +VF+ IY LVD+   + P L L W+GM+A++++++A   +  
Sbjct: 299 NKALYTFSYMCVTAGAAGIVFTGIYMLVDVCGFRRPTLVLEWMGMHALMIFILATSNVLP 358

Query: 408 GFINGWYYGDPHNTLVCFLFI 428
             + G+Y+  P N ++  + I
Sbjct: 359 VVMQGFYWKQPGNNILRLIGI 379


>gi|224080634|ref|XP_002306188.1| predicted protein [Populus trichocarpa]
 gi|222849152|gb|EEE86699.1| predicted protein [Populus trichocarpa]
          Length = 422

 Score =  345 bits (885), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 179/420 (42%), Positives = 262/420 (62%), Gaps = 20/420 (4%)

Query: 11  HHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGC 70
           H PL+    D+ +Q   S  KT R+ASLD+FRGL V LM+LVD+ G   P I+H+PWNG 
Sbjct: 6   HKPLL----DIEEQPRTSK-KTPRVASLDVFRGLCVFLMMLVDYGGAIVPIIAHSPWNGL 60

Query: 71  NLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDEL 130
           +LADFVMPFFLFI GV++AL  KR+P+R +A +K + R ++L   G++LQGG+ H  + L
Sbjct: 61  HLADFVMPFFLFIAGVSLALVYKRVPNRIEATRKAVLRAVELFLLGVILQGGYFHGINFL 120

Query: 131 TYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAAC 190
           TYGVD++ IR  G+LQRI++ Y+  +L EI+      +D      S  + Y WHW  A  
Sbjct: 121 TYGVDMKRIRWLGILQRISIGYIFAALCEIWLSCRSRRD-----VSFLKSYYWHWGAAFS 175

Query: 191 VLVVYLALLYGTYVPDWQFTIINKDSA----DYGKVF---NVTCGVRAKLNPPCNAVGYI 243
           +  +YL LLYG YVPDWQF + N  S+    ++  V+    V C VR  L P CN+ G I
Sbjct: 176 LSAIYLGLLYGLYVPDWQFEMSNATSSVFPTNHSYVYMLTQVKCSVRGDLGPACNSAGMI 235

Query: 244 DRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTI 303
           DR VLGI+H+Y  P +R  K C   +   G + + APSWCHAPF+PEG+LSS+++ ++ I
Sbjct: 236 DRYVLGIDHLYKKPVYRNLKECNMST--NGQVPESAPSWCHAPFDPEGVLSSITAAVACI 293

Query: 304 IGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGA 363
           IG+ +GH + H + H  R++ W+    +LL+ GL L      P+NK LYT  Y+ +T  +
Sbjct: 294 IGLQYGHSLAHLQDHKQRMQNWILFSLSLLLVGLLLAVVGD-PVNKSLYTFGYMLITCAS 352

Query: 364 AALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLV 423
           A + +SAIY LVD++  +     L W+G +++ ++V+    +    I G+Y+  P N L+
Sbjct: 353 AGITYSAIYLLVDVYGYRCLTFALEWMGKHSLSIFVLITSNLAVIAIQGFYWKAPENNLI 412


>gi|356516509|ref|XP_003526936.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Glycine max]
          Length = 416

 Score =  338 bits (867), Expect = 4e-90,   Method: Compositional matrix adjust.
 Identities = 173/410 (42%), Positives = 252/410 (61%), Gaps = 12/410 (2%)

Query: 18  EPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVM 77
           +P + +  E +  +  R+ASLD+FRGL+V LMI VD+A   +P I+HAPWNG +LADFVM
Sbjct: 5   QPLLLNDSEPTQFQNTRIASLDVFRGLSVFLMIFVDYAASIFPIIAHAPWNGIHLADFVM 64

Query: 78  PFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
           PFFLFI G+++AL  KR P R  A  K   R L L   GILLQGG+ H    LT+GVD++
Sbjct: 65  PFFLFIAGISLALVYKRRPHRTQATWKAFARALNLFALGILLQGGYFHGVTSLTFGVDIQ 124

Query: 138 MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA 197
            IR  G+LQRI++ Y++ +L EI+    + K+         + Y W W +A  +L +Y  
Sbjct: 125 RIRWLGILQRISIGYIVAALCEIWLPAPRWKE-----LGFVKSYYWQWFVAVILLALYSG 179

Query: 198 LLYGTYVPDWQFTIINKDSA----DYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
           LLYG YVPDWQF +    S+      G ++ V C VR  L P CN+ G IDR +LG++H+
Sbjct: 180 LLYGLYVPDWQFDVSASTSSLPPIGGGDIYMVNCSVRGDLGPACNSAGMIDRYILGLDHL 239

Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
           Y  P +R  K C   +  +G +   +PSWCHAPF+PEG+LSS+++ +S IIG+ +GHV+ 
Sbjct: 240 YRKPVYRNLKGCNMSA--KGQVSDSSPSWCHAPFDPEGILSSITAAVSCIIGLQYGHVLA 297

Query: 314 HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
           H + H  RL  W+    + L  GL L     IPLNK LYT+SY+ +TS A+ L F A+Y 
Sbjct: 298 HLQDHKGRLYNWMCFSLSFLALGLFLALI-GIPLNKSLYTVSYMLLTSAASGLTFIALYF 356

Query: 374 LVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLV 423
           LVD+   +     L W+G +++ ++V+ +  +    + G+Y+  P N ++
Sbjct: 357 LVDVHGHRRLTALLEWMGKHSLSIFVIVSSNLAVIAVQGFYWTKPENNII 406


>gi|413920627|gb|AFW60559.1| hypothetical protein ZEAMMB73_831897 [Zea mays]
          Length = 343

 Score =  336 bits (861), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 171/308 (55%), Positives = 208/308 (67%), Gaps = 4/308 (1%)

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRIAL Y  V+L+E  T  V+      G ++IF  Y W WL      VVY+   +  YVP
Sbjct: 19  QRIALVYFFVALIEALTVKVRPTTVRSGPYAIFDAYRWQWLGGLVAFVVYMVTTFSLYVP 78

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
           DW F   N+   + GK F V CGVRA L   CNAVGY+DR+V GINH+Y  P W RSK C
Sbjct: 79  DWSFVYHNEGDVNDGKQFTVKCGVRASLEQACNAVGYVDRQVWGINHLYTQPVWIRSKDC 138

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
           T  SP  GPLR DAP+WC APFEPEGLLSS+SS+LS  IG+H+GHV+IH K H  RLK W
Sbjct: 139 TSSSPNMGPLRSDAPAWCLAPFEPEGLLSSISSVLSGTIGIHYGHVLIHFKTHKERLKHW 198

Query: 326 VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFL 385
           +  GF+LL+ G+ LHFTNAIP+NKQLY+ SYVC T GAA +V SA Y L+D+W L+ PFL
Sbjct: 199 LLTGFSLLVLGIILHFTNAIPINKQLYSFSYVCFTGGAAGIVLSAFYILIDVWGLRTPFL 258

Query: 386 PLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLVCFLF--IISYILHSFLWELRKF 443
            L WIGMNAMLV+V+ A+GI A F+NGWYYG P NTLV ++   +   + HS    L   
Sbjct: 259 FLEWIGMNAMLVFVLGAQGILAAFVNGWYYGSPDNTLVNWIVKHVFVNVWHSQ--RLGTL 316

Query: 444 LYVQFCNL 451
           LYV FC +
Sbjct: 317 LYVMFCEI 324


>gi|147844298|emb|CAN82113.1| hypothetical protein VITISV_031338 [Vitis vinifera]
          Length = 401

 Score =  335 bits (860), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 169/344 (49%), Positives = 228/344 (66%), Gaps = 20/344 (5%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           +RL SLD+FRGL VA+MILVD AGG  P I+H+PWNG  LADFVMPFFLFIVGV++ALA 
Sbjct: 51  RRLVSLDVFRGLTVAIMILVDDAGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALAY 110

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
           K +     A K              +  GG+ H  + LTYGVD+  IRL G+LQRIA++Y
Sbjct: 111 KNLSSGYLATK--------------MASGGYFHGLNNLTYGVDIEQIRLAGILQRIAVAY 156

Query: 153 LLVSLVEIFTK-DVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTI 211
            L ++ EI+ K D   K  S    S+ + Y + W +   + V Y +LLYG YVPDW+++I
Sbjct: 157 FLAAVCEIWLKGDXNVKSGS----SLLKKYQFQWAVVLVLTVAYCSLLYGLYVPDWEYSI 212

Query: 212 INKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPF 271
            ++ S+   K+F V CGVR+   P CNAVG IDR VLGI H+Y  P + R K C+ +SP 
Sbjct: 213 PSETSSSALKIFKVKCGVRSDTGPACNAVGMIDRNVLGIQHLYKRPIYARMKQCSINSPD 272

Query: 272 EGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFA 331
            GPL  +AP+WC APF+PEGLLSSV +I++ ++G+H+GH+I+H K H  R+  W+     
Sbjct: 273 YGPLPPNAPTWCQAPFDPEGLLSSVMAIVTCLVGLHYGHIIVHFKDHKDRILHWIVPSSC 332

Query: 332 LLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALV 375
           LL+ G  L F   + +NK LYTLSY+CVT+GAA ++F+ IY +V
Sbjct: 333 LLVLGFALDFF-GMHVNKALYTLSYMCVTAGAAGILFAGIYLMV 375


>gi|224103167|ref|XP_002312951.1| predicted protein [Populus trichocarpa]
 gi|222849359|gb|EEE86906.1| predicted protein [Populus trichocarpa]
          Length = 419

 Score =  335 bits (860), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 169/417 (40%), Positives = 263/417 (63%), Gaps = 17/417 (4%)

Query: 11  HHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGC 70
           H PL+    D+ +Q   S  K  R ASLD+FRGL V LM+LVD+ G   P I+H+PWNG 
Sbjct: 6   HKPLL----DIEEQLHTSK-KPPRAASLDVFRGLCVFLMMLVDYGGAIIPIIAHSPWNGL 60

Query: 71  NLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDEL 130
           +LAD VMPFFLFI GV++AL  K++P+R +A  K + + +KL   G+++QGG+ H  + L
Sbjct: 61  HLADSVMPFFLFIAGVSLALVYKKVPNRIEATWKAVLKAIKLFLLGVVIQGGYFHGINSL 120

Query: 131 TYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAAC 190
           TYGVD++ IR  G+LQ+I++ Y++ +L EI+      +       S  + Y WHW +A  
Sbjct: 121 TYGVDMKRIRWLGILQKISVGYIVAALCEIWLSCRTRRG-----VSFLKSYYWHWCVAFS 175

Query: 191 VLVVYLALLYGTYVPDWQFTIINKDSA----DYGKVFNVTCGVRAKLNPPCNAVGYIDRK 246
           +  +YL LLYG YVPDWQF + N  S+    ++  V+ V C +R  L P CN+ G IDR 
Sbjct: 176 LSAIYLGLLYGLYVPDWQFEMSNATSSVFPTNHSNVYMVKCSLRGDLGPACNSAGMIDRY 235

Query: 247 VLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGV 306
           +LGI+H+Y  P +R  K C   +  +G +  ++ SWCHAPF+PEG+LSS+++ ++ IIG+
Sbjct: 236 ILGIDHLYKKPVYRNLKECNMST--DGQVPDNSASWCHAPFDPEGVLSSLTAAVTCIIGL 293

Query: 307 HFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAAL 366
            +GH++ H + H  R++ W    F+LL+ GL L      P+NK LYT SY+ +TS +A +
Sbjct: 294 QYGHLLAHLQDHKGRMENWTLFSFSLLVVGLLLVVIGD-PVNKSLYTFSYMLITSASAGI 352

Query: 367 VFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLV 423
            +SA+Y LVD+++ +     L W+G +++ ++V+ +  +    I G+ +  P N ++
Sbjct: 353 TYSALYLLVDVYDYRCLTFVLEWMGKHSLSIFVLVSSNLAVITIQGFCWAAPENNMI 409


>gi|449528551|ref|XP_004171267.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Cucumis sativus]
          Length = 380

 Score =  335 bits (858), Expect = 4e-89,   Method: Compositional matrix adjust.
 Identities = 173/384 (45%), Positives = 258/384 (67%), Gaps = 6/384 (1%)

Query: 49  MILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFR 108
           MI+VD+AGG  P I+H+PW+G  LAD VMPFFLFIVGV++ALA K+IP R  A +K + R
Sbjct: 1   MIVVDYAGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLR 60

Query: 109 TLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDK 168
           TLKLLF G+ LQGGF H  + LTYGVD++ IR  G+LQRIA++Y L +L EI+ K     
Sbjct: 61  TLKLLFLGLFLQGGFLHGVNNLTYGVDIQQIRWMGILQRIAIAYFLAALCEIWLK---GS 117

Query: 169 DQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYG--KVFNVT 226
           D      ++ R Y    + A  + ++YLAL YG YVPDW++ + +  ++D    K+F+V 
Sbjct: 118 DYVNSETALRRKYQLQLVAAVVLTMLYLALSYGLYVPDWEYQVPSLTTSDVASPKIFSVK 177

Query: 227 CGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP 286
           CG R    P CNAVG IDRK+ GI H+Y  P + R++ C+ ++P  GPL  DAPSWC AP
Sbjct: 178 CGTRGDTGPACNAVGMIDRKIFGIQHLYKRPIYARTEQCSINAPDYGPLPPDAPSWCQAP 237

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIP 346
           F+PEGLLS+V ++++ ++G+H+GH+I+H K H  R+  W+     L++  + L F   + 
Sbjct: 238 FDPEGLLSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFL-GMH 296

Query: 347 LNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIF 406
           +NK LYT+SY+ VT+GAA L+F+ IY +VD+++ +   + + W+G +A+++YV+AA  + 
Sbjct: 297 INKVLYTVSYMSVTAGAAGLLFTGIYLMVDVYSWRRMNVVMEWMGKHALVIYVLAACNVL 356

Query: 407 AGFINGWYYGDPHNTLVCFLFIIS 430
              + G+Y G P N ++  + + S
Sbjct: 357 PVILQGFYLGQPQNNILRLIGVPS 380


>gi|449458622|ref|XP_004147046.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Cucumis sativus]
 gi|449489633|ref|XP_004158370.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Cucumis sativus]
          Length = 418

 Score =  333 bits (855), Expect = 8e-89,   Method: Compositional matrix adjust.
 Identities = 175/413 (42%), Positives = 249/413 (60%), Gaps = 15/413 (3%)

Query: 17  SEPDVSDQQE--KSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLAD 74
           S P + +QQE   S  K  R+ SLD+FRGL+V +M+LVD+ G   P ISH+PW G +LAD
Sbjct: 4   SRPLLKNQQELPASSGKAPRVVSLDVFRGLSVFMMMLVDYGGSFLPIISHSPWIGLHLAD 63

Query: 75  FVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGV 134
           FVMP+FLFI GV++AL  K +  +  A +    R L L   G+ LQGG+ H    LTYGV
Sbjct: 64  FVMPWFLFIAGVSVALVYKEVESKVAAARNAACRGLYLFLLGVFLQGGYFHGITSLTYGV 123

Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEI-FTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV 193
           D+  IR  G+LQRI++ YL+ +L EI  T+  +++ Q    FS      WHW +   +L 
Sbjct: 124 DLESIRWLGILQRISIGYLIAALCEIWLTRCTREEAQHTKSFS------WHWCIIFFLLS 177

Query: 194 VYLALLYGTYVPDWQFTIINKDSADYGK---VFNVTCGVRAKLNPPCNAVGYIDRKVLGI 250
           +Y+ L YG YVPDW F I    S+       V+ V C +R  L P CN+ G IDR VLGI
Sbjct: 178 LYMGLSYGLYVPDWDFKISAPSSSLPLSGSYVYKVNCSLRGDLGPACNSAGMIDRYVLGI 237

Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
           +H+Y  P +R  K C   S   G   + +PSWC APFEPEGLLSS+++ ++ IIG+ +GH
Sbjct: 238 HHLYTKPVYRNLKECNISS--SGQFPETSPSWCRAPFEPEGLLSSLTATVACIIGLQYGH 295

Query: 311 VIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSA 370
           ++   + H  R   W  + F +L FG+ L F   IP+NK LYT+SY+ +TS +A ++F A
Sbjct: 296 ILARAQDHKTRTNGWFLLSFKILAFGIFLVFI-GIPVNKSLYTVSYMLITSASAGIIFCA 354

Query: 371 IYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLV 423
           +Y LVDI   +     L W+G +++ +YV+    I    + G+Y+  P+N +V
Sbjct: 355 LYILVDIHGYRRLTCALEWMGKHSLSIYVLVISNILVIGLQGFYWKSPNNNIV 407


>gi|357467537|ref|XP_003604053.1| Heparan-alpha-glucosaminide N-acetyltransferase [Medicago
           truncatula]
 gi|355493101|gb|AES74304.1| Heparan-alpha-glucosaminide N-acetyltransferase [Medicago
           truncatula]
          Length = 421

 Score =  332 bits (852), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 171/405 (42%), Positives = 246/405 (60%), Gaps = 14/405 (3%)

Query: 27  KSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGV 86
            S  +  R+AS+D+FRGL+V LMI VD+ G  +P ISHAPWNG +LADFVMPFFLF+VG+
Sbjct: 12  NSETQFPRVASVDVFRGLSVFLMIFVDYGGSIFPIISHAPWNGLHLADFVMPFFLFLVGI 71

Query: 87  AIALALKRIPDR--ADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
           ++AL  K    R    +  K + R+ +L   GILLQGG+ H     TYGVDV+ IR  GV
Sbjct: 72  SLALVYKNKRSRPTQSSTWKPLLRSFQLFILGILLQGGYFHGIHSFTYGVDVQTIRFFGV 131

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           LQRI++ Y++ +L +I    +  K       S F+ Y  HW +AA +L ++  LLYG +V
Sbjct: 132 LQRISIGYIVAALCQICLPTLPSKHT-----SFFKTYYSHWFVAAILLAIHSGLLYGLHV 186

Query: 205 PDWQFTIINKDSA----DYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWR 260
           PDWQF      S+      G V+ V C VR  L P CN+ G IDR +LG++H+Y  P +R
Sbjct: 187 PDWQFDASLSTSSLPPIQAGNVYTVNCSVRGDLGPACNSAGMIDRYILGLDHLYKKPVFR 246

Query: 261 RSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLA 320
             K C   S   G +   +PSWCHAPF+PEG+LSS+++ +S IIG+ +GH++ + + H  
Sbjct: 247 NLKECNMSS--TGQVSDSSPSWCHAPFDPEGILSSITAAVSCIIGLQYGHILANLEDHKG 304

Query: 321 RLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNL 380
           RL QW+    + L  G  L     IPLNK LYT+SY+ ++S A+ L F A+Y LVD++  
Sbjct: 305 RLNQWLGFSVSFLALGWFLALI-GIPLNKSLYTVSYMLLSSAASGLTFMALYILVDVYGY 363

Query: 381 KYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLVCF 425
           +     L W+G +++ ++V+ +  +    I G+Y   P   +V F
Sbjct: 364 RRLTSVLEWMGKHSLSIFVLVSSNLAVIAIQGFYLTKPEYNIVRF 408


>gi|62701854|gb|AAX92927.1| hypothetical protein LOC_Os11g14080 [Oryza sativa Japonica Group]
 gi|77549602|gb|ABA92399.1| D8Ertd354e protein, putative [Oryza sativa Japonica Group]
 gi|125576749|gb|EAZ17971.1| hypothetical protein OsJ_33516 [Oryza sativa Japonica Group]
          Length = 447

 Score =  330 bits (845), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 179/402 (44%), Positives = 234/402 (58%), Gaps = 49/402 (12%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIV 84
           +E+   K++R+A+LD FRGL + LMILVD AGG +  + H+PWNGC LADFVMPFFLFIV
Sbjct: 51  EEEPRKKSKRVAALDAFRGLTIVLMILVDDAGGAYERMDHSPWNGCTLADFVMPFFLFIV 110

Query: 85  GVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
           GVAIA ALKR+P    AVKK+  RTLK+LFWG+LLQGG+SHAPD+L+YGVD++ IR CG+
Sbjct: 111 GVAIAFALKRVPKLGAAVKKITIRTLKMLFWGLLLQGGYSHAPDDLSYGVDMKKIRWCGI 170

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           LQRIAL Y +V+L+E FT  V+      G ++IF  Y W WL     L +Y+   +  YV
Sbjct: 171 LQRIALVYFVVALIEAFTTKVRPTTVRSGPYAIFHAYRWQWLGGFVALFIYMVTTFSLYV 230

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
           PDW +   N    + GK F V   V    +  C   G++D     + ++     W  +  
Sbjct: 231 PDWSYVYHNDGDVNDGKQFTVLLAVFPD-HVQCGVRGHLDPACNAVGYV-DRVVWGINHL 288

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
            TQ            P W  + F    ++ SV                            
Sbjct: 289 YTQ------------PVWIRSKFN---IIDSVRD-------------------------N 308

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPF 384
           W  +      F        AIP+NKQLY+LSYVC T+GAA +V SA Y L+D+W L+ PF
Sbjct: 309 WDPLWTCSRSF-------QAIPINKQLYSLSYVCFTAGAAGVVLSAFYILIDVWGLRTPF 361

Query: 385 LPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLVCFL 426
           L L WIGMNAMLV+V+AA+ IF  F+NGWYY  P NTLV ++
Sbjct: 362 LFLEWIGMNAMLVFVLAAQAIFPAFVNGWYYDSPGNTLVSWI 403


>gi|255581844|ref|XP_002531722.1| conserved hypothetical protein [Ricinus communis]
 gi|223528625|gb|EEF30642.1| conserved hypothetical protein [Ricinus communis]
          Length = 397

 Score =  324 bits (831), Expect = 5e-86,   Method: Compositional matrix adjust.
 Identities = 165/379 (43%), Positives = 239/379 (63%), Gaps = 11/379 (2%)

Query: 49  MILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFR 108
           M+LVD+ G  +P I+H+PWNG +LADFVMPFFLFI GV++AL  K++  R DA  K + R
Sbjct: 1   MMLVDYGGSIFPIIAHSPWNGLHLADFVMPFFLFIAGVSLALVYKKVTKRIDATWKAMLR 60

Query: 109 TLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDK 168
            +KL F G+ LQGG+ H  + LTYGVD+  IR  G+LQRI++ Y++ +L EI+   +  +
Sbjct: 61  AVKLFFLGVFLQGGYFHGINSLTYGVDIERIRWFGILQRISIGYIVAALCEIW---LSRR 117

Query: 169 DQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSA----DYGKVFN 224
            QS      F+ Y WHW++A  +  VYL LLYG YVPDWQF + N  S+    +   V+ 
Sbjct: 118 TQSQREIGFFKNYYWHWVVAFSLSAVYLGLLYGLYVPDWQFEMSNAASSALPINGSNVYM 177

Query: 225 VTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCH 284
           V C VR  L P CN+ G IDR VLG +H+Y  P  R  K C   +   G + + +PSWCH
Sbjct: 178 VKCSVRGDLGPACNSAGMIDRYVLGFDHLYTKPVHRNLKECNMTN---GQVSESSPSWCH 234

Query: 285 APFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNA 344
           APF+PEGLLSS+++ ++ IIG+  GHV+ H + H  R++ W     +LL+ G  L F   
Sbjct: 235 APFDPEGLLSSLTAAITCIIGLQCGHVLAHIQEHKGRIESWSLFSASLLLLGSVLAFI-G 293

Query: 345 IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEG 404
           IP+NK LYT+SY+ +TS  + + F A+Y LVD++  +    PL W+G +++ ++++    
Sbjct: 294 IPVNKSLYTISYMLITSALSGITFCALYLLVDVYGYRRVTFPLEWMGKHSLSIFILVTSN 353

Query: 405 IFAGFINGWYYGDPHNTLV 423
           I    I G+Y+ +P   LV
Sbjct: 354 ILIIAIQGFYWKNPEKNLV 372


>gi|413918232|gb|AFW58164.1| hypothetical protein ZEAMMB73_985435 [Zea mays]
          Length = 423

 Score =  321 bits (822), Expect = 5e-85,   Method: Compositional matrix adjust.
 Identities = 155/335 (46%), Positives = 223/335 (66%), Gaps = 6/335 (1%)

Query: 94  RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYL 153
           R+PD+ DA +K + R LKL   G++LQGGF H    L++GVD++ IRL GVLQRIA++YL
Sbjct: 93  RVPDKLDASRKALLRALKLFCLGLVLQGGFFHGVRSLSFGVDLQEIRLMGVLQRIAIAYL 152

Query: 154 LVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIIN 213
           L +L EI+ +  +D D     + + + Y +   + A V + Y++LLYGTYVPDW++    
Sbjct: 153 LTALCEIWIRGDEDVDYG---YDLLKRYRYQLFVGAVVAITYMSLLYGTYVPDWEYQTSA 209

Query: 214 KDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEG 273
             S +  K   V CGVR   +P CNAVG IDRK+LGI H+Y  P + RSK C+ DSP  G
Sbjct: 210 PGSTE--KHLFVKCGVRGDTSPGCNAVGMIDRKILGIQHLYGRPVYARSKQCSIDSPQNG 267

Query: 274 PLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALL 333
           PL  DAPSWC APF+PEGLLSSV +I++ +IG+ +GHVI+H + H  R+  W+   F++L
Sbjct: 268 PLPSDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQYGHVIVHFQKHRERMMNWLIPSFSML 327

Query: 334 IFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMN 393
           +    + F   + +NK LYTLSY   T+GAA L+FS IY LVDI+  + P + + W+GM+
Sbjct: 328 VLAFAMDFF-GLHMNKPLYTLSYTLGTAGAAGLLFSGIYTLVDIYGYRRPTVAMEWMGMH 386

Query: 394 AMLVYVMAAEGIFAGFINGWYYGDPHNTLVCFLFI 428
           A+++YV+ A  +   FI+G+Y+ +P N L+ F+ I
Sbjct: 387 ALMIYVLIACNVLPIFIHGFYWKEPKNNLLKFIGI 421


>gi|395146531|gb|AFN53685.1| putative aquaporin PIP2-8 [Linum usitatissimum]
          Length = 694

 Score =  318 bits (814), Expect = 5e-84,   Method: Compositional matrix adjust.
 Identities = 167/426 (39%), Positives = 252/426 (59%), Gaps = 43/426 (10%)

Query: 3   EIKAETTHHHPLIISEPDVSDQQEKSHL-----KTQRLASLDIFRGLAVALMILVDHAGG 57
           E + + +   PL    PD S +++   L     K +RLASLD FRGL + LM+LVD+ G 
Sbjct: 13  EDEMQKSFRPPL--PPPDFSGREDGQLLMLYRKKNKRLASLDAFRGLCIFLMMLVDYGGH 70

Query: 58  DWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGI 117
            +P I+H+ WNG +LADFVMPFFLFIVGV+IAL  K+ P+R +A +K + +++KL   GI
Sbjct: 71  VFPTIAHSAWNGIHLADFVMPFFLFIVGVSIALVYKKAPNRVEATRKALLKSVKLFLVGI 130

Query: 118 LLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSI 177
           LLQ                         QRI++ Y++ ++ EI+    +  D       I
Sbjct: 131 LLQE------------------------QRISIGYIVGAICEIWLSIRRKGD-----VGI 161

Query: 178 FRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPC 237
            + Y WHW+ A  ++ VY  L YG YVPDWQF++      D   VF V C V+  + P C
Sbjct: 162 IKSYYWHWIAALAIVAVYARLSYGLYVPDWQFSL----PGDQHHVFTVKCSVKGDVGPAC 217

Query: 238 NAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVS 297
           N+ G IDR VLG++H+Y  P ++  K C   S  + P  +DAPSWCHAPF+PEGLLSS++
Sbjct: 218 NSAGMIDRYVLGLSHLYAKPVYKNLKVCNMSSNKQVP--EDAPSWCHAPFDPEGLLSSLT 275

Query: 298 SILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYV 357
           + ++ IIG+ FGHV+ H + H  RL+ W       L+ GL L      P+NK LY++SY+
Sbjct: 276 AAVTCIIGLQFGHVLAHIQDHKGRLENWSGFSVFFLVLGLFL-VRLGFPINKPLYSISYM 334

Query: 358 CVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGD 417
            +TS +A + F+A+Y LVD++  ++  LPL W+G +++ ++++ +  +    I G+Y+  
Sbjct: 335 LITSASAGITFAALYLLVDVYGQRWLTLPLEWMGKHSLTIFMVVSSNLAVIAIQGFYWKS 394

Query: 418 PHNTLV 423
           P N ++
Sbjct: 395 PENNIM 400


>gi|334188248|ref|NP_001190487.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332008209|gb|AED95592.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 435

 Score =  313 bits (802), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 177/383 (46%), Positives = 235/383 (61%), Gaps = 44/383 (11%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           +RL SLD+FRGL VA MILVD  GG  P I+H+PW+G  LADFVMPFFLFIVGV++A A 
Sbjct: 44  ERLVSLDVFRGLTVAFMILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIVGVSLAFAY 103

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
           K +  R  A +K + R+LKLL  G+ LQGGF H  + LTYG+DV  IRL G+LQRIA++Y
Sbjct: 104 KNLSCRFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGILQRIAIAY 163

Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
           L+V+L EI+ K   +    +   S+ + Y +HW++A  +  +YL+LLYG YVPDW++ I+
Sbjct: 164 LVVALCEIWLKGNHNVSSEL---SMIKKYRFHWVVAFVITTIYLSLLYGLYVPDWEYQIL 220

Query: 213 NKDSADYGKVF---NVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDS 269
            +D       F    V CGVR    P CNAVG +DR  LGI H+Y  P + R+K C+ + 
Sbjct: 221 KEDQGSTLTTFLNLKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTKQCSINY 280

Query: 270 PFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH-----TKG------- 317
           P  GPL  DAPSWC APF+PEGLLSS+ + ++ ++G+H+GH+IIH     +KG       
Sbjct: 281 PNNGPLPPDAPSWCQAPFDPEGLLSSLMATVTCLVGLHYGHIIIHFKRNGSKGQVYNEPS 340

Query: 318 -------------------------HLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLY 352
                                    H  RL QW+   F LL+          + LNK LY
Sbjct: 341 ISIRPFFFILSETYLLLYVINFLQDHKKRLNQWILRSFCLLML-GLALNLFGMHLNKPLY 399

Query: 353 TLSYVCVTSGAAALVFSAIYALV 375
           TLSY+CVTSGA+  + SAIY +V
Sbjct: 400 TLSYMCVTSGASGFLLSAIYLMV 422


>gi|326510109|dbj|BAJ87271.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 304

 Score =  310 bits (793), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 146/294 (49%), Positives = 196/294 (66%), Gaps = 5/294 (1%)

Query: 48  LMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIF 107
           LMI+VD AG   P ++H+PW G  +ADFVMPFFLFIVGVA+ALA KR+PD+ DA +K   
Sbjct: 8   LMIIVDDAGSFLPAMNHSPWEGVTIADFVMPFFLFIVGVALALAYKRVPDKLDATRKATL 67

Query: 108 RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQD 167
           R LKL   G++LQGGF H    LT+GVD+  IRL G+LQRIA++YL+ +L +I+ K   D
Sbjct: 68  RALKLFCVGLVLQGGFFHGVRSLTFGVDIAQIRLMGILQRIAIAYLVTALCQIWLKGDDD 127

Query: 168 KDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTC 227
            D  +    + + Y +  L    + + Y+ALLYGTYVPDW++ I      +  K F V C
Sbjct: 128 VDSGL---DLIKRYKYQLLAGLLITITYMALLYGTYVPDWEYRISGPGFTE--KTFTVRC 182

Query: 228 GVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPF 287
           GVR    P CNAVG IDRK+LGI H+Y  P + RS+ C+ DSP  GPL  DAPSWC APF
Sbjct: 183 GVRGDSGPGCNAVGMIDRKILGIQHLYGRPVYARSQQCSIDSPQNGPLPPDAPSWCQAPF 242

Query: 288 EPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHF 341
           +PEGLLSSV +I++ +IG+ +GH+I+H + H  R+  W+   F +L+    + F
Sbjct: 243 DPEGLLSSVMAIVTCLIGLQYGHIIVHFQKHKERIMHWLVPSFGMLVLAFAMDF 296


>gi|238481501|ref|NP_001154765.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332008206|gb|AED95589.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 435

 Score =  301 bits (771), Expect = 5e-79,   Method: Compositional matrix adjust.
 Identities = 173/399 (43%), Positives = 251/399 (62%), Gaps = 16/399 (4%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           +RL SLD+FRGL VA MILVD  GG  P I+H+PW+G  LADFVMPFFLFIVGV++A A 
Sbjct: 38  ERLVSLDVFRGLTVAFMILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIVGVSLAFAY 97

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
           K +  R  A +K + R+LKLL  G+ LQGGF H  + LTYG+DV  IRL G+LQRIA++Y
Sbjct: 98  KNLSCRFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGILQRIAIAY 157

Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
           L+V+L EI+ K   +    +   S+ + Y +HW++A  +  +YL+LLYG YVPDW++ I+
Sbjct: 158 LVVALCEIWLKGNHNVSSEL---SMIKKYRFHWVVAFVITTIYLSLLYGLYVPDWEYQIL 214

Query: 213 NKDSADYGKVF---NVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDS 269
            +D       F    V CGVR    P CNAVG +DR  LGI H+Y  P + R+K C+ + 
Sbjct: 215 KEDQGSTLTTFLNLKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTKQCSINY 274

Query: 270 PFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW---- 325
           P  GPL  DAPSWC APF+PEGLLSS+ + ++ ++G+H+GH+IIH K + ++ + +    
Sbjct: 275 PNNGPLPPDAPSWCQAPFDPEGLLSSLMATVTCLVGLHYGHIIIHFKRNGSKGQVYNEPS 334

Query: 326 VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAI-YALVDIWNLKYPF 384
           +++  +   F  ++ FT  +  + +  T        G    V   I   LVD++  K   
Sbjct: 335 ISIRRSQKAFE-SMDFTFFLSSDVRSRTEP----LWGLGIFVIRDIPNGLVDVYGYKRAS 389

Query: 385 LPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLV 423
           L L W+G++A+ +YV+ A  +    I+G+Y+ +P N L+
Sbjct: 390 LVLEWMGIHALPIYVLIACNLVFLIIHGFYWKNPINNLL 428


>gi|186530235|ref|NP_001119392.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332008204|gb|AED95587.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 359

 Score =  288 bits (737), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 148/291 (50%), Positives = 201/291 (69%), Gaps = 6/291 (2%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           +RL SLD+FRGL VA MILVD  GG  P I+H+PW+G  LADFVMPFFLFIVGV++A A 
Sbjct: 44  ERLVSLDVFRGLTVAFMILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIVGVSLAFAY 103

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
           K +  R  A +K + R+LKLL  G+ LQGGF H  + LTYG+DV  IRL G+LQRIA++Y
Sbjct: 104 KNLSCRFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGILQRIAIAY 163

Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
           L+V+L EI+ K   +    +   S+ + Y +HW++A  +  +YL+LLYG YVPDW++ I+
Sbjct: 164 LVVALCEIWLKGNHNVSSEL---SMIKKYRFHWVVAFVITTIYLSLLYGLYVPDWEYQIL 220

Query: 213 NKDSADYGKVF---NVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDS 269
            +D       F    V CGVR    P CNAVG +DR  LGI H+Y  P + R+K C+ + 
Sbjct: 221 KEDQGSTLTTFLNLKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTKQCSINY 280

Query: 270 PFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLA 320
           P  GPL  DAPSWC APF+PEGLLSS+ + ++ ++G+H+GH+IIH K +++
Sbjct: 281 PNNGPLPPDAPSWCQAPFDPEGLLSSLMATVTCLVGLHYGHIIIHFKVNIS 331


>gi|125533951|gb|EAY80499.1| hypothetical protein OsI_35679 [Oryza sativa Indica Group]
          Length = 444

 Score =  283 bits (724), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 166/405 (40%), Positives = 221/405 (54%), Gaps = 58/405 (14%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIV 84
           +E+   K++R+A+LD FRGL + LMILVD AGG +  + H+PWNGC LADFVMPFFLFIV
Sbjct: 51  EEEPRKKSKRVAALDAFRGLTIVLMILVDDAGGAYERMDHSPWNGCTLADFVMPFFLFIV 110

Query: 85  GVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
           GVAIA ALKR+P    AVKK+  RTLK+LFWG+LLQGG+SHAPD+L+YGVD++ IR CG+
Sbjct: 111 GVAIAFALKRVPKLGAAVKKITIRTLKMLFWGLLLQGGYSHAPDDLSYGVDMKKIRWCGI 170

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKD---QSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
           LQ + + +          +   D+    +S    S  RL           L +Y+   + 
Sbjct: 171 LQNLLVLFDNAEDSFGVLRGCSDRGIHHKSSAYDSAVRL------GGFVALFIYMVTTFS 224

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
            YVPDW +   N    + GK F V   V    +  C   G++D     + ++     W  
Sbjct: 225 LYVPDWSYIYHNDGDVNDGKQFTVLLAVFPD-HVQCGVRGHLDPACNAVGYV-DRVVWGI 282

Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
           +   TQ            P W  + F    ++ SV                         
Sbjct: 283 NHLYTQ------------PVWIRSKFN---IVDSVRD----------------------- 304

Query: 322 LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLK 381
              W  +      F        AIP+NKQLY+LSYVC T+GAA +V SA Y L+D+W L+
Sbjct: 305 --NWDPLWTRSRSF-------QAIPINKQLYSLSYVCFTAGAAGVVLSAFYILIDVWGLR 355

Query: 382 YPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLVCFL 426
            PFL L WIGMNAMLV+V+AA+ IF  F+NGWYY  P NTLV ++
Sbjct: 356 TPFLFLEWIGMNAMLVFVLAAQAIFPAFVNGWYYDSPGNTLVSWI 400


>gi|326432441|gb|EGD78011.1| hypothetical protein PTSG_09649 [Salpingoeca sp. ATCC 50818]
          Length = 1087

 Score =  280 bits (717), Expect = 8e-73,   Method: Compositional matrix adjust.
 Identities = 160/434 (36%), Positives = 236/434 (54%), Gaps = 52/434 (11%)

Query: 24   QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFI 83
            Q  +  L  +RL+SLD+FRG  VALM+ VD  G  +P I H+PWNG  LADFVMPFF FI
Sbjct: 627  QPAQRSLPKERLSSLDVFRGFTVALMVFVDETGAAFPPIDHSPWNGVRLADFVMPFFDFI 686

Query: 84   VGVAIALALKR--------IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
            VGV++AL+ K+         P    A++K   R LKL   G+L QGG     D + Y  D
Sbjct: 687  VGVSLALSFKKFDLEDATTTPRVWPALRKATIRFLKLFILGMLTQGGI----DIMNY--D 740

Query: 136  VRMIRLCGVLQRIALSYLLVSLVEIFT------KDVQDKDQSVG----RFSIFRLYCWHW 185
            +  IR+ G+LQR+A+ Y  V+L+EIF       ++  + D   G       +   Y WHW
Sbjct: 741  LAHIRIMGILQRVAVCYYAVALMEIFLPRNKKYRNYNETDTVTGWAVDVLHMLWRYKWHW 800

Query: 186  LMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDR 245
              AAC+   +  ++YG  VPD                F   CG R  L P CNA  YIDR
Sbjct: 801  FTAACLFATHTGIMYGVNVPD---------------AFGEECG-RGVLTPACNAATYIDR 844

Query: 246  KVLGINHMY----------HHPAWRRSKACTQDSPFEGPLRKDAPSWC-HAPFEPEGLLS 294
             VL + HMY          +   ++R   C+  SP +    +DAP+WC H PF+PEGL+S
Sbjct: 845  NVLTVEHMYFPANGGDKSGNDVTFQRLPECSTCSPGKCVPPEDAPAWCLHGPFDPEGLVS 904

Query: 295  SVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTL 354
            S+++I++T+IG+H+GHV+   +   AR+  W   G   L+ G  LHF+ A  +N  LY++
Sbjct: 905  SLNAIIATVIGIHYGHVLRRVQSPKARIVHWTAFGVVQLVIGFALHFSGAFVMNTDLYSI 964

Query: 355  SYVCVTSGAAALVFSAIYALVDIWNL-KYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGW 413
            SY  VT+G   ++ +  Y +VD  ++ ++ +    ++GMNA+++Y+ A   I    +  +
Sbjct: 965  SYTLVTAGTGGVLLALFYVIVDRLHVGEWAWSGCRYMGMNAIVMYLCAEGDIIPYVLAAF 1024

Query: 414  YYGDPHNTLVCFLF 427
            Y+  P N L   L+
Sbjct: 1025 YWNKPENNLANILW 1038


>gi|238481505|ref|NP_001154767.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332008208|gb|AED95591.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 340

 Score =  276 bits (706), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 150/337 (44%), Positives = 217/337 (64%), Gaps = 7/337 (2%)

Query: 90  LALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
           ++   +P +  A +K + R+LKLL  G+ LQGGF H  + LTYG+DV  IRL G+LQRIA
Sbjct: 1   MSFAVLPSQFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGILQRIA 60

Query: 150 LSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQF 209
           ++YL+V+L EI+ K   +    +   S+ + Y +HW++A  +  +YL+LLYG YVPDW++
Sbjct: 61  IAYLVVALCEIWLKGNHNVSSEL---SMIKKYRFHWVVAFVITTIYLSLLYGLYVPDWEY 117

Query: 210 TIINKDSADYGKVF---NVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
            I+ +D       F    V CGVR    P CNAVG +DR  LGI H+Y  P + R+K C+
Sbjct: 118 QILKEDQGSTLTTFLNLKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTKQCS 177

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
            + P  GPL  DAPSWC APF+PEGLLSS+ + ++ ++G+H+GH+IIH K H  RL QW+
Sbjct: 178 INYPNNGPLPPDAPSWCQAPFDPEGLLSSLMATVTCLVGLHYGHIIIHFKDHKKRLNQWI 237

Query: 327 TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLP 386
              F LL+          + LNK LYTLSY+CVTSGA+  + SAIY +VD++  K   L 
Sbjct: 238 LRSFCLLML-GLALNLFGMHLNKPLYTLSYMCVTSGASGFLLSAIYLMVDVYGYKRASLV 296

Query: 387 LAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLV 423
           L W+G++A+ +YV+ A  +    I+G+Y+ +P N L+
Sbjct: 297 LEWMGIHALPIYVLIACNLVFLIIHGFYWKNPINNLL 333


>gi|395146473|gb|AFN53630.1| putative aquaporin PIP2-8 [Linum usitatissimum]
          Length = 692

 Score =  264 bits (674), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 131/331 (39%), Positives = 204/331 (61%), Gaps = 14/331 (4%)

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
           K++P+R +A +K   +++KL   GILLQGGF H    LTYGVD+  IRL G+LQRI++ Y
Sbjct: 96  KKVPNRVEATRKAFLKSVKLFLVGILLQGGFFHGLHSLTYGVDIERIRLLGILQRISIGY 155

Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
           ++ ++ EI+   V+ K    G   I + Y  HW+ A  ++VVY  L YG YVPDWQF + 
Sbjct: 156 IVGAICEIWL-SVRRK----GDVGIIKSYYSHWVAALAIVVVYARLSYGLYVPDWQFAL- 209

Query: 213 NKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFE 272
                D   V+ V C V+  + P CN+ G +DR VLG++H+Y  P ++  K C   S  +
Sbjct: 210 ---PQDQHHVYTVKCSVKGDVGPACNSAGMMDRYVLGLSHLYAKPVYKNLKICNMSSNKQ 266

Query: 273 GPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFA- 331
            P  +DAPSWCHAPF+PEGLLSS+++ ++ IIG+ FGHV+ H + H  RL+ W   GF+ 
Sbjct: 267 VP--EDAPSWCHAPFDPEGLLSSLTAAVTCIIGLQFGHVLAHVQDHKGRLENW--SGFSV 322

Query: 332 LLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIG 391
             +           P+NK LY++SY+ +TS +A + F+A+Y LVD++  ++  LP  W+G
Sbjct: 323 FFLLLGLFLVLLGFPINKPLYSISYMLITSASAGITFAALYLLVDVYGQRWLSLPFEWMG 382

Query: 392 MNAMLVYVMAAEGIFAGFINGWYYGDPHNTL 422
            +++ ++++ +  +    I G+Y+  P N +
Sbjct: 383 KHSLTIFMVVSSNLAVIAIQGFYWKSPENNV 413


>gi|167522597|ref|XP_001745636.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163775985|gb|EDQ89607.1| predicted protein [Monosiga brevicollis MX1]
          Length = 1047

 Score =  251 bits (642), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 171/463 (36%), Positives = 245/463 (52%), Gaps = 70/463 (15%)

Query: 4   IKAETTHHHPLIISEPDVSDQQEKSHLK-----------TQRLASLDIFRGLAVALMILV 52
           ++   ++  PL+ +  D S+ Q KS++             +RL++LD++RGL +A+MILV
Sbjct: 566 VRPRDSNRTPLLPASTD-SNIQSKSNIDLATDPVAPKPPRERLSALDVYRGLTIAVMILV 624

Query: 53  DHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIP-------DRADAVKKV 105
           D  G  +P I HAPWNG +LAD V+P F FIVGV+IALA KR          R  A KK 
Sbjct: 625 DETGAAFPPIDHAPWNGLHLADTVVPSFDFIVGVSIALAFKRFDLEAGAQGQRWTAFKKA 684

Query: 106 IFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDV 165
             R LK LF GI                 D+  IR+ G+LQR+A+ Y  V+L+EIF   +
Sbjct: 685 TDRFLK-LFGGITFM------------NYDLTNIRIFGILQRVAVCYFAVALMEIFLPRL 731

Query: 166 QD---------KDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDS 216
                       D       +F  Y WHW  AA +L V+ ++LYG  VPD          
Sbjct: 732 TGALPADNGTWADWMRRTQHLFWRYRWHWFSAALLLAVHTSILYGVDVPD---------- 781

Query: 217 ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY-----HHPA-----WRRSKACT 266
                 F   CG R +L P CNA  YIDR +L + HMY       PA     ++R   C+
Sbjct: 782 -----AFGERCG-RGQLTPACNAATYIDRLILTVPHMYFPENGGDPAHADVTFKRLPECS 835

Query: 267 QDSPFEGPLRKDAPSWC-HAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
             SP       DAP+WC H PF+PEGL+SS+++I++TIIGVH+GHV+   K  + R+ QW
Sbjct: 836 SCSPGLCVAPADAPAWCLHGPFDPEGLVSSLTAIVTTIIGVHYGHVLRQIKSPMERIFQW 895

Query: 326 VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIW-NLKYPF 384
            +     L+ GL LHF+  IPLN  LY++S+V VT G   L+    Y +VD     ++ +
Sbjct: 896 SSFALLQLLLGLILHFS-GIPLNINLYSVSFVLVTGGMTGLLLVLCYLIVDYRPTARWLW 954

Query: 385 LPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLVCFLF 427
           LP  W+G NA+++++ A   +    ++ +Y  DP  +L   L+
Sbjct: 955 LPFMWLGTNAIVIFLCAEGDVIDWVLSCFYLEDPDRSLANILW 997


>gi|413937083|gb|AFW71634.1| hypothetical protein ZEAMMB73_862609 [Zea mays]
          Length = 317

 Score =  250 bits (639), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 118/233 (50%), Positives = 158/233 (67%), Gaps = 3/233 (1%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           + QRLASLD+FRG+ V LMI+VD AGG  P ++H+PW+G  +ADF+MPFFLFIVGV++ L
Sbjct: 87  RQQRLASLDVFRGITVLLMIIVDDAGGFLPALNHSPWDGVTVADFIMPFFLFIVGVSLTL 146

Query: 91  ALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIAL 150
           A KR+PDR +A +K + R LKL   G++LQGGF H    LT+GVD+  IRL G+LQRIA+
Sbjct: 147 AYKRVPDRVEATRKAVLRALKLFCLGLVLQGGFFHGVHSLTFGVDLTKIRLMGILQRIAI 206

Query: 151 SYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFT 210
           +YLL ++ EI+ K   D D   G   + R Y +   +   + + Y  LLYG YVPDW++ 
Sbjct: 207 AYLLAAVCEIWLKGDDDVDSGYG---LLRRYRYQLFVGLVLSIAYSILLYGMYVPDWEYQ 263

Query: 211 IINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
           I    S+   K F+V CGVR    P CNAVG +DR VLGI+H+Y  P + R+K
Sbjct: 264 IAGPGSSSTEKSFSVKCGVRGDTGPACNAVGMVDRTVLGIDHLYRRPVYARTK 316


>gi|255642425|gb|ACU21476.1| unknown [Glycine max]
          Length = 326

 Score =  235 bits (599), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 123/289 (42%), Positives = 173/289 (59%), Gaps = 12/289 (4%)

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
           +R P R  A  K   R L L   GILLQGG+ H    LT+GVD++ IR  G+LQRI++ Y
Sbjct: 41  QRRPHRTQATWKAFARALNLFALGILLQGGYFHGVTSLTFGVDIQRIRWLGILQRISIGY 100

Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
           ++ +L EI+    + K+         + Y W W +A  +L +Y  LLYG YVPDWQF + 
Sbjct: 101 IVAALCEIWLPAPRWKE-----LGFVKSYYWQWFVAVILLALYSGLLYGLYVPDWQFDVS 155

Query: 213 NKDSA----DYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
              S+      G ++ V C VR  L P CN+ G IDR +LG++H+Y  P +R  K C   
Sbjct: 156 ASTSSLPPIGGGDIYMVNCSVRGDLGPACNSAGMIDRYILGLDHLYRKPVYRNLKGCNMS 215

Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTM 328
           +  +G +   +PSWCHAPF+PEG+LSS+++ +S IIG+ +GHV+ H + H  RL  W+  
Sbjct: 216 A--KGQVSDSSPSWCHAPFDPEGILSSITAAVSCIIGLQYGHVLAHLQDHKGRLYNWMCF 273

Query: 329 GFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDI 377
             + L  GL L     IPLNK LYT+SY+ +TS A+ L F A+Y LVD+
Sbjct: 274 SLSFLALGLFLALI-GIPLNKSLYTVSYMLLTSAASGLTFIALYFLVDV 321


>gi|186530239|ref|NP_001119393.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332008205|gb|AED95588.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 292

 Score =  224 bits (570), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 119/231 (51%), Positives = 157/231 (67%), Gaps = 3/231 (1%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           +RL SLD+FRGL VA MILVD  GG  P I+H+PW+G  LADFVMPFFLFIVGV++A A 
Sbjct: 44  ERLVSLDVFRGLTVAFMILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIVGVSLAFAY 103

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
           K +  R  A +K + R+LKLL  G+ LQGGF H  + LTYG+DV  IRL G+LQRIA++Y
Sbjct: 104 KNLSCRFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGILQRIAIAY 163

Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
           L+V+L EI+ K   +    +   S+ + Y +HW++A  +  +YL+LLYG YVPDW++ I+
Sbjct: 164 LVVALCEIWLKGNHNVSSEL---SMIKKYRFHWVVAFVITTIYLSLLYGLYVPDWEYQIL 220

Query: 213 NKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
            +D       F V CGVR    P CNAVG +DR  LGI H+Y  P + R+K
Sbjct: 221 KEDQGSTLTTFLVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTK 271


>gi|238481503|ref|NP_001154766.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332008207|gb|AED95590.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 295

 Score =  220 bits (561), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 119/234 (50%), Positives = 157/234 (67%), Gaps = 6/234 (2%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           +RL SLD+FRGL VA MILVD  GG  P I+H+PW+G  LADFVMPFFLFIVGV++A A 
Sbjct: 44  ERLVSLDVFRGLTVAFMILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIVGVSLAFAY 103

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
           K +  R  A +K + R+LKLL  G+ LQGGF H  + LTYG+DV  IRL G+LQRIA++Y
Sbjct: 104 KNLSCRFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGILQRIAIAY 163

Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
           L+V+L EI+ K   +    +   S+ + Y +HW++A  +  +YL+LLYG YVPDW++ I+
Sbjct: 164 LVVALCEIWLKGNHNVSSEL---SMIKKYRFHWVVAFVITTIYLSLLYGLYVPDWEYQIL 220

Query: 213 NKDSADYGKVF---NVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
            +D       F    V CGVR    P CNAVG +DR  LGI H+Y  P + R+K
Sbjct: 221 KEDQGSTLTTFLNLKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTK 274


>gi|413947252|gb|AFW79901.1| hypothetical protein ZEAMMB73_198786 [Zea mays]
          Length = 505

 Score =  201 bits (511), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 101/229 (44%), Positives = 139/229 (60%), Gaps = 5/229 (2%)

Query: 147 RIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPD 206
           RIA++YLL ++ EI+ K   D D   G   + R Y +   +   + + Y  LLYG YVPD
Sbjct: 279 RIAIAYLLAAVCEIWLKGDDDVDSGYG---LLRRYRYQLFVGLVLSIAYSILLYGIYVPD 335

Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
           W++ I    S+   K F V CGVR    P CNAVG +DR +LGI+H+Y  P + R+K C+
Sbjct: 336 WEYQIAGPGSSSTKKSFFVKCGVRGDTRPACNAVGMVDRTILGIDHLYRRPVYVRTKECS 395

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
            D    GPL  DAPSWC APF+PEGLLS V +I++ +IG+ F HVIIH + H  R+  W+
Sbjct: 396 IDYLENGPLPPDAPSWCQAPFDPEGLLSFVMAIVTCLIGLQFRHVIIHFEKHRGRIASWL 455

Query: 327 TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALV 375
              F++L     + F   + +NK LYT+SY  + +GAA L+F  IY LV
Sbjct: 456 VPSFSMLALAFVMDFV-GMRMNKPLYTMSYT-LAAGAAGLLFPGIYVLV 502


>gi|413953638|gb|AFW86287.1| hypothetical protein ZEAMMB73_717084 [Zea mays]
          Length = 357

 Score =  197 bits (500), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 119/364 (32%), Positives = 175/364 (48%), Gaps = 91/364 (25%)

Query: 97  DRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVS 156
           ++  A KK   R  KL   G++LQGG+ H   +LTYGVD+  IR  GVLQRIA+ Y + +
Sbjct: 3   NKTAATKKAAIRASKLFILGVILQGGYIHGRHKLTYGVDLDHIRWLGVLQRIAIGYFVAA 62

Query: 157 LVEIFTKDVQDKDQ-------------SVGRF-------------------SIFRLYCWH 184
           + EI+  +    D              ++G F                      + Y   
Sbjct: 63  MSEIWLVNNNLVDSPVPFVKKYFIEWIAIGYFVAAMSEIWLVNNNLVDSPVPFVKKYFIE 122

Query: 185 WLMAACVLVVYLALLYGTYVPDWQFTIINKDS-----ADYGKVFNVTCGVRAKLNPPCNA 239
           W MA  + V+Y+AL++G YV +W+F I   +S     ++  +   + CGVR  L PPCNA
Sbjct: 123 WFMAIAITVLYVALVFGLYVANWEFEIQTSNSTLSIPSNSIETKMIQCGVRGSLGPPCNA 182

Query: 240 VGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSI 299
           VG +DR +LG NH+Y +P ++R+K C+ +SP  GPL  +AP WC APF+PEGLLS     
Sbjct: 183 VGLVDRVLLGENHLYKNPVYKRTKECSINSPDYGPLPPNAPDWCLAPFDPEGLLS----- 237

Query: 300 LSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCV 359
                                  K   T+ + LL                          
Sbjct: 238 -----------------------KPLYTVNYMLL-------------------------- 248

Query: 360 TSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPH 419
           T G +  +   +Y +VD+ ++K PF+   W+GMNA++VYV+AA  +F   I G+Y+  P 
Sbjct: 249 TGGVSGFLLLLLYYIVDVIHIKKPFVLFQWMGMNALIVYVLAACELFPTLIQGFYWRSPE 308

Query: 420 NTLV 423
           N LV
Sbjct: 309 NNLV 312


>gi|242007028|ref|XP_002424344.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212507744|gb|EEB11606.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 497

 Score =  187 bits (476), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 123/393 (31%), Positives = 201/393 (51%), Gaps = 61/393 (15%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFI 83
               S +K  R+ SLD FRGLA+ +MI V++ GGD+    H+PWNG  +ADFV P+F++I
Sbjct: 99  NDRYSRIKNSRIKSLDAFRGLAILIMIFVNYNGGDYSVFKHSPWNGITIADFVFPWFIWI 158

Query: 84  VGVAIALALKRIPDRADAVKKVIFRTLKLLFW----GILLQGGFSHAPDELTYGVDVRMI 139
           +G +  L++     RA + K++ FR LK  F+    GI+L  G   +            +
Sbjct: 159 MGASTVLSIDNNFRRAQSKKEIFFRILKRSFYLIALGIVLNSGHRDSKG---------FL 209

Query: 140 RLCGVLQRIALSYLLVSLVEIFT-KDVQDKDQSVGRFSIFRLYCW-HWLMAACVLVVYLA 197
           R+CGVLQRI L+Y +++ +EIF  K + ++      FS   +  W  WL+   ++ +++ 
Sbjct: 210 RVCGVLQRIGLTYFIIASLEIFALKSLLNEHFGPWNFSRNIIKIWIQWLVPILLVAIHVI 269

Query: 198 LLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHH 256
           + +  +VP            +++    N T G          A GYIDR ++  NHMYH 
Sbjct: 270 ITFTLHVPGCPLGYTGPGGLSNHSAFRNCTGG----------AAGYIDRLIITDNHMYHR 319

Query: 257 PAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK 316
            ++               L+   PS    PF+PEGLL +++S+    +GV    ++I+ +
Sbjct: 320 GSF---------------LKIFKPS---VPFDPEGLLGTLTSVFCAFLGVQSARILINHE 361

Query: 317 GHLARLKQWVTMGFALLIFGLTLHF-------TNAIPLNKQLYTLSYVCVTSGAAALVFS 369
              +++K W+   F  ++ GL   F       +  IP+NK L++LSYV  TS  A L+ +
Sbjct: 362 NSFSKIKSWI---FWAIVMGLISGFLCNWSQNSGIIPINKNLWSLSYVLATSSIAFLILT 418

Query: 370 AIYALVD---IWNLKYPFLPLAWIGMNAMLVYV 399
             Y L+D   +WN      PL + GMNA+ +Y+
Sbjct: 419 TFYTLIDFLKVWNG----FPLIYPGMNAIALYL 447


>gi|384249073|gb|EIE22555.1| hypothetical protein COCSUDRAFT_42235 [Coccomyxa subellipsoidea
           C-169]
          Length = 395

 Score =  186 bits (472), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 128/387 (33%), Positives = 194/387 (50%), Gaps = 58/387 (14%)

Query: 46  VALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL--KRIPDRADAVK 103
           +ALM+ V+HAG + P ++HA W+G +LAD VMP FL +VGV++AL+L  +    R   ++
Sbjct: 1   MALMLFVNHAGHEVPWVAHAAWDGVHLADLVMPCFLLLVGVSVALSLGPRASGPRRPLLR 60

Query: 104 KVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFT- 162
           KV+ RT KL   G+L+QGG            D+  +R CGVLQRIAL + LVSLV ++  
Sbjct: 61  KVLARTGKLAGLGLLIQGGVGAGAFP---AWDLSRLRYCGVLQRIALCFALVSLVVLYLP 117

Query: 163 -------KDVQDK-DQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINK 214
                  + + D+ D+S    + FR Y   W++   + V +          +W    +  
Sbjct: 118 QTPSPRLQSLLDRGDESASLMAPFRFYALWWILGTALFVAF----------NWMALFLRP 167

Query: 215 DSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGP 274
                       C  R  L   CN   Y+D ++LG +H+Y  P+ RR+     + P E  
Sbjct: 168 PG----------CLARPALTADCNVAAYVDARLLGRSHLYPWPSCRRA-----NPPCE-- 210

Query: 275 LRKDAPSWCHAPFEPEGLLSSVSSIL-STIIGVHFGHVIIHTKGHLARLKQWVTMGFALL 333
                        +PEGL +++S  L ST +G+ FG V++  +GH ARL+ W      L 
Sbjct: 211 -----------YLDPEGLFATLSGALASTFLGLWFGAVLLTLRGHRARLRSWAYASVLLT 259

Query: 334 IFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLK---YPFLPLAWI 390
             GL LH T A+P NK LY+ S V +T+G++      +Y   ++   K       P  W+
Sbjct: 260 ELGLALHVTGAVPFNKNLYSASSVLLTAGSSGAFLGLVYLFTEVAPTKIFERVAAPFMWL 319

Query: 391 GMNAMLVYVMAAEGIFAGFINGWYYGD 417
           GMN++ VY  A + I    I   Y+GD
Sbjct: 320 GMNSIAVY--AGDEILEKAIPWIYWGD 344


>gi|255635187|gb|ACU17949.1| unknown [Glycine max]
          Length = 217

 Score =  186 bits (472), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 93/200 (46%), Positives = 127/200 (63%), Gaps = 5/200 (2%)

Query: 18  EPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVM 77
           +P + +  E +  +  R+ASLD+FRGL+V LMI VD+A   +P I+HAPWNG +LADFVM
Sbjct: 5   QPLLLNDSEPTQFQNTRIASLDVFRGLSVFLMIFVDYAASIFPIIAHAPWNGTHLADFVM 64

Query: 78  PFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
           PFFLFI G+++AL  KR P R  A  K   R L L   GILLQGG+ H    LT+GVD++
Sbjct: 65  PFFLFIAGISLALVYKRRPHRTQATWKAFARALNLFALGILLQGGYFHGVTSLTFGVDIQ 124

Query: 138 MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA 197
            IR  G+LQRI++ Y++ +L EI+    + K+         + Y W W +A  +L +Y  
Sbjct: 125 RIRWLGILQRISIGYIVAALCEIWLPAPRWKE-----LGFVKSYYWQWFVAVILLALYSG 179

Query: 198 LLYGTYVPDWQFTIINKDSA 217
           LLYG YVPDWQF +    S+
Sbjct: 180 LLYGLYVPDWQFDVSASTSS 199


>gi|391346547|ref|XP_003747534.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Metaseiulus occidentalis]
          Length = 564

 Score =  181 bits (458), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 124/408 (30%), Positives = 203/408 (49%), Gaps = 51/408 (12%)

Query: 6   AETTHHHPLIISEPDVSDQQEKSHLKTQ---RLASLDIFRGLAVALMILVDHAGGDWPEI 62
           AET+   PL  S      +  +  +      R+ SLD FRG  + LMI V++ GG     
Sbjct: 145 AETSSLGPLEASSSTAGSRPPEDGIGKAGKPRIKSLDAFRGFCLFLMIFVNYGGGGLWLF 204

Query: 63  SHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWGIL 118
            H PW+G   AD + P+F++I+GV++A++L+ +  +   + ++ F    R++KL   G++
Sbjct: 205 EHIPWDGLTFADLLFPWFVWIMGVSMAISLRSMRRKCVPLSEIFFKILSRSVKLFLLGLI 264

Query: 119 LQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSV--GRFS 176
           L        + +    D+  +R+ GVLQR A+SY +V+ + +F     D  ++    +  
Sbjct: 265 L--------NSMGKNNDISKLRIPGVLQRFAVSYFVVASMHMFFSRATDAAETAKWAKIR 316

Query: 177 IFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAKLNP 235
              LY   W+M   ++ +++ L +   VPD     +      + G  FN T G       
Sbjct: 317 DVALYWQEWVMMISLVAIHVLLTFLLDVPDCPKGYLGPGGLHENGTHFNCTGG------- 369

Query: 236 PCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSS 295
              A GYIDR VLG NHMY HP   +    +Q                  PF+PEG+L  
Sbjct: 370 ---AAGYIDRVVLGPNHMYGHPTTEKIYETSQ------------------PFDPEGVLGC 408

Query: 296 VSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL-LIFGLTLHFTNA---IPLNKQL 351
           ++SI  T +G+  G +++       RL +W+  G  L L+ G+   F+     IP+NK L
Sbjct: 409 LTSIFLTFLGLQAGKILLTFNNPGRRLSRWICWGVLLGLLAGILCGFSKEDGWIPINKNL 468

Query: 352 YTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
           ++LSYV  T+G A L+ S  Y ++D+  L +  +P  + GMN++LVYV
Sbjct: 469 WSLSYVLCTAGLAFLLLSVFYLIIDVLAL-WSAVPFIYPGMNSILVYV 515


>gi|356536971|ref|XP_003537005.1| PREDICTED: uncharacterized protein LOC100781855 [Glycine max]
          Length = 357

 Score =  177 bits (449), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 89/162 (54%), Positives = 109/162 (67%), Gaps = 2/162 (1%)

Query: 5   KAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHA--GGDWPEI 62
           K E  H         D  +  + + +  +  +   I       LM+L D A  GG +P I
Sbjct: 20  KGELKHEIERTNGNGDSIEHDKDARITQEGESVQQIVEQEQPLLMVLEDDADAGGAYPRI 79

Query: 63  SHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGG 122
            H+PWNGC LADFVMPFFLF+VGVAIALALKRIP    AVK +I RTLKLLFWGILLQGG
Sbjct: 80  DHSPWNGCTLADFVMPFFLFVVGVAIALALKRIPKVKYAVKNIILRTLKLLFWGILLQGG 139

Query: 123 FSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKD 164
           +SHAPD+L+YGVD+R IR CG+LQRIAL Y  V+L+E +T +
Sbjct: 140 YSHAPDDLSYGVDMRFIRWCGILQRIALVYCAVALIETYTTN 181



 Score =  166 bits (419), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 96/182 (52%), Positives = 124/182 (68%), Gaps = 4/182 (2%)

Query: 273 GPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL 332
           G L++ A  +C          + +S+ LS  IG+H+GHV+IH KGH  RLKQW+ MGF L
Sbjct: 160 GILQRIALVYCAVALIETYTTNCISASLSGTIGIHYGHVLIHFKGHSERLKQWLLMGFLL 219

Query: 333 LIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGM 392
           L  GL LHFT AIP+NKQLY+ SYVC T+GAA +VFS  Y L++   +KYPFL L WIGM
Sbjct: 220 LTLGLMLHFTEAIPINKQLYSFSYVCFTAGAAGIVFSVFYLLLNGILIKYPFLFLEWIGM 279

Query: 393 NAMLVYVMAAEGIFAGFINGWYYGDPHNTLVCFL--FIISYILHSFLWELRKFLYVQFCN 450
           NAMLV+VMAA+GIFA F+NGWYY DP NT+V ++   + + + HS    L   LYV F  
Sbjct: 280 NAMLVFVMAAQGIFAAFVNGWYYKDPDNTIVYWIQNHVFTNVWHSE--RLGTLLYVIFAE 337

Query: 451 LS 452
           ++
Sbjct: 338 IT 339


>gi|312381520|gb|EFR27253.1| hypothetical protein AND_06166 [Anopheles darlingi]
          Length = 782

 Score =  166 bits (421), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 124/387 (32%), Positives = 191/387 (49%), Gaps = 58/387 (14%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           +++  +RL SLD  RG+A+ LMI V+  GG +  I HA WNG ++AD V P+FLFI+GV 
Sbjct: 388 ANIARKRLQSLDTLRGIAIMLMIFVNSGGGHYWWIEHATWNGLHVADLVFPWFLFIMGVC 447

Query: 88  IALALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCG 143
           I ++L+    R   +   V  +  R++ L   G+ L             G ++  +R+ G
Sbjct: 448 IPISLRGQLARNVSKRQIVSSITTRSISLFLIGLCLNS---------MNGPNMANLRIFG 498

Query: 144 VLQRIALSYLLVSLVEIFTKDVQDKDQS---VGRFSIFRLYCWHWLMAACVLVVYLALLY 200
           VLQR  ++Y +VSLV +F    Q   Q         I RL    W++   ++V+YLA++ 
Sbjct: 499 VLQRFGVAYFVVSLVHLFCHREQIASQHRFVRANVDIIRL-VRQWIIVGLLVVIYLAVIL 557

Query: 201 GTYVPDWQFTIINKDSADYGKVF-NVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAW 259
               P                V+ N T G+           GYIDR +LG++H+Y HP  
Sbjct: 558 LIPAPGCPRGYFGPGGKHLFNVYPNCTGGI----------TGYIDRVLLGMSHLYQHPTA 607

Query: 260 RRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHL 319
           R          ++G            PF+PEG  + + +IL   +G+  G  I+   GH 
Sbjct: 608 RYV--------YDG-----------QPFDPEGPFACLPTILQVFLGLQCGSTILSFTGHR 648

Query: 320 ARLKQWVTMGFAL-LIFGLTLHFTNA---IPLNKQLYTLSYVCVTSGAAALVFSAIYALV 375
            RL+++     AL L+ G+   F+     +P+NK L++LSYV  T+  A L+    Y  +
Sbjct: 649 QRLQRFAVWSVALGLVAGVLCGFSKNDGWLPVNKNLWSLSYVLATASLAYLLLLICYYTI 708

Query: 376 D---IWNLKYPFLPLAWIGMNAMLVYV 399
           D   +WN  YPFL   + GMNA+L+YV
Sbjct: 709 DVKRVWN-GYPFL---YAGMNAILLYV 731


>gi|413918234|gb|AFW58166.1| hypothetical protein ZEAMMB73_985435 [Zea mays]
          Length = 202

 Score =  166 bits (419), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 80/137 (58%), Positives = 105/137 (76%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           QRL SLD+FRG+ V LMI+VD AG   P ++H+PW+G  +ADFVMPFFLFIVGVA+ALA 
Sbjct: 53  QRLVSLDVFRGITVLLMIIVDDAGAFIPAMNHSPWDGVTVADFVMPFFLFIVGVALALAY 112

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
           KR+PD+ DA +K + R LKL   G++LQGGF H    L++GVD++ IRL GVLQRIA++Y
Sbjct: 113 KRVPDKLDASRKALLRALKLFCLGLVLQGGFFHGVRSLSFGVDLQEIRLMGVLQRIAIAY 172

Query: 153 LLVSLVEIFTKDVQDKD 169
           LL +L EI+ +  +D D
Sbjct: 173 LLTALCEIWIRGDEDVD 189


>gi|270004236|gb|EFA00684.1| hypothetical protein TcasGA2_TC003561 [Tribolium castaneum]
          Length = 569

 Score =  162 bits (409), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 123/392 (31%), Positives = 194/392 (49%), Gaps = 57/392 (14%)

Query: 22  SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFL 81
           S+ +EK     +RL SLD FRG+++ +MI V++  G +P + HA WNG +LAD V P+F+
Sbjct: 167 SETKEKKPEGKKRLKSLDTFRGISIVIMIFVNYGSGGYPVLDHATWNGLHLADLVFPWFM 226

Query: 82  FIVGVAIALAL----KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
           +I+G  + ++L    K+     D    V+ R++KL   G+ L             G  + 
Sbjct: 227 WIMGACMPISLTSSFKKQISNKDIFLNVLKRSIKLFCLGVFLNA-----------GPYLE 275

Query: 138 MIRLCGVLQRIALSYLLVSLVEIF--TKDVQDKDQSVGRFSIFRLYCWH-WLMAACVLVV 194
            +R+ GVLQR  + YL+V+ + +F   ++  +    +G+F    L  W  W++   +  V
Sbjct: 276 CMRIFGVLQRFGICYLVVTTICLFLMKREFSESKHKIGKFFTDILVLWKGWIVVLIIFFV 335

Query: 195 ---YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
              +L LL     P  +  +      + GK FN T G          A GYID  +LG N
Sbjct: 336 HCMFLFLLADEGCP--RGYLGPGGLHENGKHFNCTGG----------ATGYIDAVILG-N 382

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           H Y  P  +     TQ                   F+PEG+L  ++SI+   IGV  G  
Sbjct: 383 HRYQKPTSKEIYLGTQ------------------AFDPEGILGCLTSIVHVFIGVQAGIT 424

Query: 312 IIHTKGHLARLKQWVTMG-FALLIFGLTLHFTNA---IPLNKQLYTLSYVCVTSGAAALV 367
           ++  K H ARL +W++    A ++ G    F+     IP+NK L+++S+V VTS  A L+
Sbjct: 425 LLVYKEHSARLIRWLSWSVLAGIVGGALCGFSKEDGLIPVNKNLWSISFVLVTSCFAFLL 484

Query: 368 FSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
            S  Y L+D+ N  +   P  + GMNA+L+YV
Sbjct: 485 LSICYVLIDVKNW-WSGKPFLFAGMNAILLYV 515


>gi|91079154|ref|XP_966977.1| PREDICTED: similar to CG6903 CG6903-PA [Tribolium castaneum]
          Length = 533

 Score =  162 bits (409), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 123/392 (31%), Positives = 194/392 (49%), Gaps = 57/392 (14%)

Query: 22  SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFL 81
           S+ +EK     +RL SLD FRG+++ +MI V++  G +P + HA WNG +LAD V P+F+
Sbjct: 167 SETKEKKPEGKKRLKSLDTFRGISIVIMIFVNYGSGGYPVLDHATWNGLHLADLVFPWFM 226

Query: 82  FIVGVAIALAL----KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
           +I+G  + ++L    K+     D    V+ R++KL   G+ L             G  + 
Sbjct: 227 WIMGACMPISLTSSFKKQISNKDIFLNVLKRSIKLFCLGVFLNA-----------GPYLE 275

Query: 138 MIRLCGVLQRIALSYLLVSLVEIF--TKDVQDKDQSVGRFSIFRLYCWH-WLMAACVLVV 194
            +R+ GVLQR  + YL+V+ + +F   ++  +    +G+F    L  W  W++   +  V
Sbjct: 276 CMRIFGVLQRFGICYLVVTTICLFLMKREFSESKHKIGKFFTDILVLWKGWIVVLIIFFV 335

Query: 195 ---YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
              +L LL     P  +  +      + GK FN T G          A GYID  +LG N
Sbjct: 336 HCMFLFLLADEGCP--RGYLGPGGLHENGKHFNCTGG----------ATGYIDAVILG-N 382

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           H Y  P  +     TQ                   F+PEG+L  ++SI+   IGV  G  
Sbjct: 383 HRYQKPTSKEIYLGTQ------------------AFDPEGILGCLTSIVHVFIGVQAGIT 424

Query: 312 IIHTKGHLARLKQWVTMG-FALLIFGLTLHFTNA---IPLNKQLYTLSYVCVTSGAAALV 367
           ++  K H ARL +W++    A ++ G    F+     IP+NK L+++S+V VTS  A L+
Sbjct: 425 LLVYKEHSARLIRWLSWSVLAGIVGGALCGFSKEDGLIPVNKNLWSISFVLVTSCFAFLL 484

Query: 368 FSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
            S  Y L+D+ N  +   P  + GMNA+L+YV
Sbjct: 485 LSICYVLIDVKNW-WSGKPFLFAGMNAILLYV 515


>gi|301608954|ref|XP_002934053.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Xenopus (Silurana) tropicalis]
          Length = 633

 Score =  162 bits (409), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 125/401 (31%), Positives = 197/401 (49%), Gaps = 53/401 (13%)

Query: 17  SEPDVSDQQEKSHL---KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLA 73
           +  D+S Q+  S       QRL SLD FRGLA+ +M+ V++ GG +    H  WNG  +A
Sbjct: 217 NRADISSQETYSRAWNPSVQRLRSLDTFRGLALTIMVFVNYGGGGYWFFKHQSWNGLTVA 276

Query: 74  DFVMPFFLFIVGVAIALALKRI----PDRADAVKKVIFRTLKLLFWGI-LLQGGFSHAPD 128
           D V P+F+FI+G +I L+L  +      R + + KV++R+++L   G+ ++   +   P 
Sbjct: 277 DLVFPWFVFIMGTSIYLSLNSMLSKGSSRWNLLGKVLWRSVQLFLIGLFVINVNYCRGP- 335

Query: 129 ELTYGVDVRMIRLCGVLQRIALSYLLVSLVE-IFTKDVQDK-DQSVGRFSIFRLYC-W-H 184
                +    IR+ GVLQR++L+YL VS +E IF+K   D   QS   F +  +   W  
Sbjct: 336 -----LSFSEIRIMGVLQRLSLTYLAVSALELIFSKPTPDALTQSRTCFLLQDVLSHWPK 390

Query: 185 WLMAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYI 243
           W++   +  V+L L     VPD     +      D+GK  N T G          A GYI
Sbjct: 391 WIVILALEAVWLCLTLLLQVPDCPLGYLGPGGIGDFGKFPNCTGG----------AAGYI 440

Query: 244 DRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTI 303
           DR +LG  H+Y HP        T +  ++  +          P++PEGLL +++ ++   
Sbjct: 441 DRMILGQGHIYQHP--------TSNVIYKSTM----------PYDPEGLLGTINCVVMAF 482

Query: 304 IGVHFGHVIIHTKGH----LARLKQW-VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVC 358
            G+  G +++  K      L R   W + MG    +          IP+NK L+++SY+ 
Sbjct: 483 FGLQAGIILVLYKNQHKYVLVRFFSWAIIMGVLSAVLTKCSTNEGIIPVNKNLWSISYIT 542

Query: 359 VTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
             S  A  +   IY LVD+  L +   P  + GMN++LVYV
Sbjct: 543 TLSCFAYFLLMLIYFLVDVKKL-WSGAPFYYPGMNSILVYV 582


>gi|443694948|gb|ELT95966.1| hypothetical protein CAPTEDRAFT_92095 [Capitella teleta]
          Length = 431

 Score =  159 bits (402), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 123/430 (28%), Positives = 203/430 (47%), Gaps = 69/430 (16%)

Query: 1   MSEIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWP 60
           +S  K ++T  +     E D         +K +RL SLD FRGL + LMI V++ GG + 
Sbjct: 7   ISSAKTDSTRRNS---EEKDEGKLITPKEVKKERLRSLDAFRGLNILLMIFVNYGGGGYW 63

Query: 61  EISHAPWNGCNLADFVMPFFLFIVGVAIALAL----KRIPDRADAVKKVIFRTLKLLFWG 116
             SHA WNG  + D + P+F+FI+G ++ L +    K+  D  + +  +I+R++KL   G
Sbjct: 64  YFSHAVWNGLYITDLIFPWFIFIMGTSLGLGISSLVKKEVDPVEGLWGIIWRSVKLFAVG 123

Query: 117 ILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFS 176
           I+     S+         D+  IR+ GVLQR+A+ Y + ++V    + +Q   +S G  S
Sbjct: 124 IMYNTKSSN---------DLENIRMTGVLQRLAMVYFITAIVHYAGESLQCCMRSRGTVS 174

Query: 177 IFR-------LYCWHWLMAACVLVVYLALLYGTYVPDWQFTII-----NKDSADYGKVFN 224
            +R        Y   W+    ++ +Y    Y   VP  +   +     ++D A  G    
Sbjct: 175 RWRHILSDLAPYFGEWITMLVIIGIYCYFTYWFAVPGCEAGYVGPGGLHRDGAHAG---- 230

Query: 225 VTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCH 284
             C   A L        YID KV  + H+Y  P  R       DS               
Sbjct: 231 --CTGGAAL--------YIDLKVYTMRHIYQWPDIR--TIYQTDS--------------- 263

Query: 285 APFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW-----VTMGFALLIFGLTL 339
             F+PEGLL +++SI    +G+  G +++  KGH  RL +W     +T G   L+    L
Sbjct: 264 -AFDPEGLLGTLTSIFLCFLGLQAGKILVCHKGHRERLVRWLIWAIITGGIGTLLCKAQL 322

Query: 340 HFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
                +P+NK + ++S+V V++G   ++ S +Y L+D W L +   P  + GMN++++Y+
Sbjct: 323 E-DGWVPINKNIMSISFVLVSAGTGFIMLSVMYILIDSWKL-WNGQPFTYAGMNSIVLYM 380

Query: 400 MAAEGIFAGF 409
                IF G+
Sbjct: 381 --CHSIFQGY 388


>gi|332028000|gb|EGI68051.1| Heparan-alpha-glucosaminide N-acetyltransferase [Acromyrmex
           echinatior]
          Length = 557

 Score =  159 bits (402), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 129/450 (28%), Positives = 200/450 (44%), Gaps = 56/450 (12%)

Query: 4   IKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEIS 63
           I A  +  H +      V D+     L  +R+ ++D FRG +   MI V+   G +  + 
Sbjct: 137 ISAGRSLWHMITKCVTGVKDKSNNKKLAKRRVKAIDTFRGASTLFMIFVNDGSGSYTVLE 196

Query: 64  HAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGF 123
           H  W+G  L D V P F++I+GV I +AL     R  +  ++ +  LK  F   L+    
Sbjct: 197 HTIWDGMLLGDIVFPCFMWIMGVCIPIALSAQLKRGVSKLQISYSILKRSFLLFLIGVSL 256

Query: 124 SHAPDELTYGVD--VRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKD-QSVGRFSIFRL 180
           +      T G D  V  IR+ GVLQR  ++YL+VSLV +     Q K  ++     I R 
Sbjct: 257 N------TLGTDSQVENIRIFGVLQRFGVTYLVVSLVYLCFPSQQSKILRNTSPTWIMRK 310

Query: 181 Y------CWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAKL 233
                    HW +    ++V+ AL +   VP      +      + GK FN T G     
Sbjct: 311 MQDILSLLPHWFVMLIFVIVHCALTFCLPVPGCPTGYLGPGGMHEDGKYFNCTGG----- 365

Query: 234 NPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLL 293
                A GYID+ VL +NH+Y +P  +                         PF+PEG+L
Sbjct: 366 -----ATGYIDKTVLTLNHIYQYPTIKSVYG-------------------SGPFDPEGIL 401

Query: 294 SSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYT 353
             +++I    +GVH G +++  K    R+ +W+         G   HFTN IP+NK L++
Sbjct: 402 GCLTAIFQVFLGVHAGTILMLYKDWKDRVMRWLLWAVFYACLGCAFHFTNTIPVNKNLWS 461

Query: 354 LSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGW 413
           LS+V VT+  +    S  Y L+D+  + +   P    GMNA+L+YV      +  F   W
Sbjct: 462 LSFVFVTTSFSLAFLSGCYLLIDVAQV-WRGGPFRIPGMNALLLYV-GHSVCYEIFPFHW 519

Query: 414 YYGDPHNTLVCFL---------FIISYILH 434
             G   +  +C +          II+YI+H
Sbjct: 520 RIGAMDSRALCLIESIWVVILWAIIAYIMH 549


>gi|443685781|gb|ELT89271.1| hypothetical protein CAPTEDRAFT_227545 [Capitella teleta]
          Length = 605

 Score =  158 bits (400), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 127/416 (30%), Positives = 197/416 (47%), Gaps = 56/416 (13%)

Query: 3   EIKAETTHHHPLIISEPDVSDQQEKSHLKT--QRLASLDIFRGLAVALMILVDHAGGDWP 60
           E   +     P + +E    D +  +  KT  +RL SLD FRG+++ +MI V++ GG + 
Sbjct: 178 EDLGDPEESDPEMQTESATDDAETTAVNKTHKERLRSLDAFRGMSLTIMIFVNYGGGGYW 237

Query: 61  EISHAPWNGCNLADFVMPFFLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWG 116
              H+ WNG  LAD V P+F +I+G A+AL+    ++R   +     K+I RT  L   G
Sbjct: 238 FFDHSYWNGLTLADLVFPWFTWIIGTALALSIQGQMRRGKTKFSIAAKIIRRTCVLFALG 297

Query: 117 ILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVGRF 175
           I+L  G    P      VDV+ +R+ GVLQR+A+SYL+V+L+  IF K   +KD    R 
Sbjct: 298 IVLGSGGGSEP------VDVQTLRIPGVLQRLAISYLVVALLHLIFAK--ANKDHQPSRL 349

Query: 176 SIFRLYCWHWLMAACVLVV---YLALLYGTYVPDWQFTIINKDSA-----DYGKVFNVTC 227
            + R    HW     VLV+   +L L +   + D + T            + GK  N T 
Sbjct: 350 DMVRDITDHWPQWGIVLVMVACHLGLTFLLPISDVEGTCPTGYLGPGGLHEGGKYENCTG 409

Query: 228 GVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPF 287
           G          A   IDR      H+Y  P  +      +                  P 
Sbjct: 410 G----------AAAVIDRWFFSRQHVYQTPTCKEVYKTVE------------------PH 441

Query: 288 EPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL-LIFGLTLHFTNA-- 344
           +PEG+L +++SI    +G+  G ++   K    R+++W+  G  L LI GL   F     
Sbjct: 442 DPEGILGTLTSIFLCFLGLQAGVILTTFKQKSPRMRRWIVWGIILGLIAGLLCGFKQDGG 501

Query: 345 -IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
            IP+NK L++LS+V   +  A ++ +  Y L+D+  L +   P  + GMN++ VYV
Sbjct: 502 WIPVNKNLWSLSFVLGLASMAFVLLAVFYLLIDVHGL-WSGAPFLYPGMNSIAVYV 556


>gi|307178470|gb|EFN67159.1| Heparan-alpha-glucosaminide N-acetyltransferase [Camponotus
           floridanus]
          Length = 512

 Score =  158 bits (399), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 115/392 (29%), Positives = 187/392 (47%), Gaps = 44/392 (11%)

Query: 17  SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFV 76
           +E + +    +   K +R+ ++D FRG+    MI V+   G +  + HA WNG  L D V
Sbjct: 105 TEEERTPNNNEKATKHRRVKAIDTFRGVCTLFMIFVNDGSGSYTTLGHATWNGMLLGDLV 164

Query: 77  MPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDV 136
            P F++I+GV + +AL     R  +  ++ F   K  F   L+      A + L     +
Sbjct: 165 FPCFMWIMGVCVPIALSAQLKRGLSKLEISFSIFKRSFLLFLI----GIALNTLGTNAQL 220

Query: 137 RMIRLCGVLQRIALSYLLVSLVEI-FTKDVQDKDQSVGRF-------SIFRLYCWHWLMA 188
             IR+ GVLQR  ++YL+VSL+ + FT       Q++ +         I  L   HW + 
Sbjct: 221 ENIRIFGVLQRFGITYLIVSLLYLCFTPQQPKVAQNLSQTWMTHKMQDILSLLP-HWCIM 279

Query: 189 ACVLVVYLALLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAKLNPPCNAVGYIDRKV 247
             +++V+ A+ +   +P      +      + GK FN T G          A GYIDR +
Sbjct: 280 LTLVMVHCAVTFCLPIPGCPTGYLGPGGRHEDGKYFNCTGG----------ATGYIDRIL 329

Query: 248 LGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVH 307
           L ++H+Y  P        T DS +              PF+PEG+L  ++SI    +GVH
Sbjct: 330 LTLSHIYQWP--------TIDSIYGS-----------GPFDPEGILGCLTSIFQVFLGVH 370

Query: 308 FGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALV 367
            G +++  KG   R+ +W+         G   HFTN IP+NK L++LS+V V++  A   
Sbjct: 371 TGVILMMYKGWKERIIRWLVWAVFYGCLGCIFHFTNIIPINKNLWSLSFVLVSTCFALAF 430

Query: 368 FSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
            S  Y L+D+  + +   P    GMNA++++V
Sbjct: 431 LSGCYLLIDVVRI-WRGGPFRIPGMNALMLFV 461


>gi|167538367|ref|XP_001750848.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163770669|gb|EDQ84352.1| predicted protein [Monosiga brevicollis MX1]
          Length = 779

 Score =  157 bits (398), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 126/472 (26%), Positives = 208/472 (44%), Gaps = 88/472 (18%)

Query: 6   AETT----HHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPE 61
            ETT     H  ++ S+  +   Q+     +QRL SLD FRG A+ +MI V+  GG +  
Sbjct: 309 GETTGLLLQHATVMPSDAGMHAIQDMKR-SSQRLRSLDSFRGFALTIMIFVNFNGGFYWF 367

Query: 62  ISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKV---IFRTLKLLF-WGI 117
            +H+ WNG  +AD V P+F++I+G ++A+A   +  R      +   IFR + +LF +GI
Sbjct: 368 FNHSAWNGLTVADLVFPWFIWIMGTSMAIAFNSLLKRQTPTTTILYKIFRRMLILFAFGI 427

Query: 118 LLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGR--- 174
            + G F           D+R  R+ GVLQR A+SYL+V+LV ++   ++    SV     
Sbjct: 428 FIIGNFH----------DLRNGRIPGVLQRFAVSYLVVALVMLYAPKMESWCASVSTSDS 477

Query: 175 ------------------------------FSIFRL-------YCWHWLMAACVLVVYLA 197
                                            F L       Y W W+    +++++  
Sbjct: 478 PTPALVRGIAKPGSGHQLDVAADIAEMKPWVRTFLLHTRDLTPYIWEWVAMFVIIIIHTC 537

Query: 198 LLYGTYVPDWQFTIINKDSA--DYGKVFNVTCGVRAKLNPPCN--AVGYIDRKVLGINHM 253
           + +   VP      I    A  +YG+       V  +    C   A GYIDR+V G  H+
Sbjct: 538 ITFLLPVPGCPTGYIGPGGALAEYGQFAPPEGEVCGESTFCCEGGASGYIDRQVFGWRHI 597

Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
           Y  P                      P +   P++PEGLL S++SI+   +G+  G +I+
Sbjct: 598 YDQP-------------------TSQPIYETGPYDPEGLLGSLTSIVMCFLGLQSGKIIV 638

Query: 314 HTKGHLARLKQWVTMGFALLIFGLTL----HFTNAIPLNKQLYTLSYVCVTSGAAALVFS 369
           H K H  R + W+     L +    L         IP++K L++LS++ + +  A  + +
Sbjct: 639 HYKSHAQRSRHWLMWALVLGVIATGLCGASQNNGVIPVSKNLWSLSFIILLASFAFFLLT 698

Query: 370 AIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNT 421
             Y ++D+W   +   P  ++GMN++ +Y+   E   A F   W + D  N 
Sbjct: 699 VFYWVIDVWQF-WDGAPFRYVGMNSIFIYIF-HETFGANFPLSWAWMDGDNN 748


>gi|348529394|ref|XP_003452198.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Oreochromis niloticus]
          Length = 600

 Score =  157 bits (398), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 123/402 (30%), Positives = 196/402 (48%), Gaps = 61/402 (15%)

Query: 19  PDVSDQQEKSHL-KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVM 77
           P V+D      L  ++RL SLD FRG+++ +M+ V++ GG +    H  WNG  +AD V 
Sbjct: 188 PPVTDNILPPPLTSSKRLRSLDTFRGISLVIMVFVNYGGGRYWFFRHESWNGLTVADLVF 247

Query: 78  PFFLFIVGVAIALALKRI----PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYG 133
           P+F+FI+G +IAL++  +      R   ++K ++R+L+L   G+L+       P+     
Sbjct: 248 PWFVFIMGTSIALSINALLRAGATRCSLLRKAVWRSLQLFIIGVLVIN-----PNYCQGA 302

Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVEIFTK----DVQDKDQSVGRFSIFRLYCWHWLMAA 189
           +    +R+ GVLQR+A SYL+V+ +++       DV   D        F LY   WL   
Sbjct: 303 LAWENLRIPGVLQRLAWSYLVVACLDLLVARGQLDVITVDAWWSPAIDFLLYWPAWLCVI 362

Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
            + V++L L +   VPD     +      D G   N T G          A G+IDR +L
Sbjct: 363 LLEVLWLFLTFLLPVPDCPTGYLGPGGIGDMGLYVNCTGG----------AAGFIDRLLL 412

Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
           G  HMY +P+ R   A                     P++PEG+L S++SIL   +G+  
Sbjct: 413 GEKHMYQNPSSRVIYA------------------TRIPYDPEGVLGSINSILMAFLGLQA 454

Query: 309 GHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNA-----------IPLNKQLYTLSYV 357
           G +I+H +     +          LI+GL+L   +A           IP+NK L++LSYV
Sbjct: 455 GKIILHYRERPKSIMS------RFLIWGLSLGVISAFLTKCSTDRGFIPVNKNLWSLSYV 508

Query: 358 CVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
              +  A ++   +Y +VD+ N  +   P  + GMN++LVYV
Sbjct: 509 TTLACFAFVLLMLVYYIVDV-NKWWSGAPFYYPGMNSILVYV 549


>gi|242022263|ref|XP_002431560.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212516863|gb|EEB18822.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 607

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 115/394 (29%), Positives = 183/394 (46%), Gaps = 54/394 (13%)

Query: 20  DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
           D    Q  S     R+ S+D FRGLAV LMI V+   G +  + HA WNG  +ADFV P+
Sbjct: 185 DDRTTQASSKPARHRIKSIDTFRGLAVVLMIFVNDGAGHYWFLEHATWNGILVADFVFPW 244

Query: 80  FLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
           FL+++G+ I ++    LKR   R   +  VI R + L   G+LL        + +  G D
Sbjct: 245 FLWVMGLCIPISIRTQLKRNVSRWKILGHVIKRGILLFGLGVLL--------NTVGIGSD 296

Query: 136 VRMIRLCGVLQRIALSYLLVSLVEI-FT-KDVQDKDQSVGR-----FSIFRLYCWHWLMA 188
           +  IR+ GVLQR ++ YL+++++ + FT + + ++++  G      F    +    W++ 
Sbjct: 297 LETIRIPGVLQRFSIVYLIIAILGVCFTPRSISNENRFPGSSFRETFQDIIIIFPQWIVI 356

Query: 189 ACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
             ++  Y   ++ + VP      +       G  FN   G            GY+D+ +L
Sbjct: 357 LSIVAAYCYFVFFSPVPGCPSGYLGPGGIQDGGRFNECTG---------GMTGYVDKVLL 407

Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
           G+ H+Y +P                     +  +   PF+PEGLL  + SI     GV  
Sbjct: 408 GVEHIYKNPT-------------------SSKVYKSGPFDPEGLLGVMPSIFQAFFGVQA 448

Query: 309 GHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVF 368
           G  +++     A+L +W T G    I  L L     +P+NK L++LSYV  T+ +A L+ 
Sbjct: 449 GATLLYHPEWKAKLIRWFTWGILNGILALLLSLPGIVPINKNLWSLSYVFTTTSSAFLIL 508

Query: 369 SAIYALVD---IWNLKYPFLPLAWIGMNAMLVYV 399
             IY   D    WN     +P    GMN  ++YV
Sbjct: 509 CVIYFFQDHLRFWN----GVPFKGPGMNPTILYV 538


>gi|449500329|ref|XP_004174928.1| PREDICTED: LOW QUALITY PROTEIN: heparan-alpha-glucosaminide
           N-acetyltransferase-like [Taeniopygia guttata]
          Length = 789

 Score =  155 bits (393), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 132/425 (31%), Positives = 203/425 (47%), Gaps = 65/425 (15%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           QRL SLD FRGL++ +M+ V++ GG +    H  WNG  +AD V P+F+FI+G +IALAL
Sbjct: 399 QRLRSLDTFRGLSLVIMVFVNYGGGKYWFFKHVSWNGLTVADLVFPWFVFIMGTSIALAL 458

Query: 93  KRI----PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
             +      +   ++K+I+R+  L+  GI++       P+     +    +R+ GVLQR+
Sbjct: 459 GSMLRWGSSKWKVLRKIIWRSFVLILLGIIVVN-----PNYCLGPLSWDNLRIPGVLQRL 513

Query: 149 ALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
             +YL+V+ +E +FT+     D+          Y   W+    +  ++L L +   VPD 
Sbjct: 514 GFTYLVVAALELLFTR----ADRRFPALQDILPYWPQWIFILVLETIWLCLTFLLPVPDC 569

Query: 208 QFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
               +      D+GK  N T G          A GYIDR +LG  HMY HP    S   T
Sbjct: 570 PRGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLILGEKHMYQHP----SSGVT 615

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGH----LARL 322
             S                P++PEG+L +++SI+   +G+  G + +  K H    ++R 
Sbjct: 616 YQSTM--------------PYDPEGILGTINSIVMAFLGLQAGKITLFYKDHPKQIMSRF 661

Query: 323 KQW-VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDI--WN 379
             W + MG    I          IP+NK L+++SYV   S  A ++   IY LVD+  W 
Sbjct: 662 IIWGIVMGVISAILTKCSKEEGFIPVNKNLWSISYVTTMSCFAFVLLLLIYYLVDVKKWW 721

Query: 380 LKYPFLPLAWIGMNAMLVYVMAAEGIFAGFIN-GWYYGDPHN---------TLVCFLFII 429
              PFL   + GMN++LVY+     +FA +    W   D  +         T      II
Sbjct: 722 SGAPFL---YPGMNSILVYI--GHQVFANYFPFKWKMQDSQSHAEHLIQNLTATTLWVII 776

Query: 430 SYILH 434
           SYIL+
Sbjct: 777 SYILY 781


>gi|198434539|ref|XP_002120178.1| PREDICTED: similar to heparan-alpha-glucosaminide
           N-acetyltransferase [Ciona intestinalis]
          Length = 624

 Score =  155 bits (393), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 121/417 (29%), Positives = 198/417 (47%), Gaps = 56/417 (13%)

Query: 7   ETTHHHPLIISEPDVSDQQEKSHL---KTQRLASLDIFRGLAVALMILVDHAGGDWPEIS 63
           ET  H  L  +EP+   +   + L   K++R+ S+D FRGL + +M+ V+  GGD+    
Sbjct: 189 ETQIHEDLGNTEPNSVQEANPTPLVREKSERIKSIDTFRGLCLVVMVFVNFRGGDYWFFH 248

Query: 64  HAPWNGCNLADFVMPFFLFIVGVAIALAL-----KRIPDRADAVKKVIFRTLKLLFWGIL 118
           H+PW+G  +AD V P+F+FI+GV I L++     K +P+   A  K+I RT+ L   G+ 
Sbjct: 249 HSPWHGLTVADLVFPWFMFIMGVNITLSINSLITKNVPNSKIAY-KLIRRTVLLFGLGMF 307

Query: 119 LQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIF 178
           +          + +       R+ GVLQR A++Y L  +++        + ++  + +  
Sbjct: 308 V----------VNHSTSWAAFRVPGVLQRFAIAYFLPFVLQWAFHLTPIEIETRAKTNEG 357

Query: 179 RLYCWH-----------WLMAACVLVVYLALLYGTYVPDWQFTIINKDSADY-GKVFNVT 226
            L  WH           WL+   +  ++L L +   +P      +     D  GK  N T
Sbjct: 358 ELKWWHWCKDVVPYWLQWLIVLAMEALWLFLTFLLPIPGCPTGYLGPGGLDNDGKYINET 417

Query: 227 CGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP 286
           C           A GYIDR + G  H+Y HP  +     T  S                P
Sbjct: 418 C--------VGGAAGYIDRVIFGEAHIYGHPTCKNVYYPTYTSD------------QRVP 457

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNA-- 344
           ++PEGLL S++S +  I+G   G + ++ K  L R  +W+   F L +  + L   +A  
Sbjct: 458 YDPEGLLGSINSCIIVILGCQAGKIFLYYKHPLDRAMRWILWCFFLGVISIILCKASANG 517

Query: 345 --IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
             IP+NK L+T ++V   +  A  +   IY LVD+  + +   PL ++GMN++LVYV
Sbjct: 518 GWIPVNKNLWTTTFVTTLACMAFFIIPVIYYLVDVKKV-WTGRPLDFVGMNSILVYV 573


>gi|417411833|gb|JAA52338.1| Putative heparan-alpha-glucosaminide n-acetyltransferase, partial
           [Desmodus rotundus]
          Length = 595

 Score =  155 bits (391), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 118/381 (30%), Positives = 190/381 (49%), Gaps = 50/381 (13%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
           RL  +D FRGLA+ LM+ V++ GG +    HA WNG  +ADFV P+F+FI+G +I L++ 
Sbjct: 199 RLRCVDTFRGLALILMVFVNYGGGQYWYFKHASWNGLTVADFVFPWFVFIMGSSIFLSMS 258

Query: 94  RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            +  R  +    + KV +R+  L+  G+++       P+     +    +RL GVLQR+ 
Sbjct: 259 SVLQRGCSKFRLLGKVAWRSFLLICIGVIVVN-----PNYCLGPLSWDKVRLPGVLQRLG 313

Query: 150 LSYLLVSLVEI-FTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
           ++Y +V+++E+ F K V ++    GR S  +  +  W  WL    +  ++LAL +   VP
Sbjct: 314 VTYFVVAVLELLFAKPVPERGAWEGRCSSLQDIMSSWPQWLFILMLESIWLALTFFLPVP 373

Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                 +      D GK  N T G          A GYIDR +LG +H+Y HP+      
Sbjct: 374 GCPTGYLGPGGIGDLGKYPNCTGG----------AAGYIDRLLLGEDHIYQHPS------ 417

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
                                 ++PEG+L +++SI+   +GV  G ++++    TK  L 
Sbjct: 418 ------------STVLYHTRVAYDPEGILGTINSIVMAFLGVQAGKILLYYKEQTKDILI 465

Query: 321 RLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIW 378
           R   W      L+  GLT    N   IP+NK L+++SYV   S  A  +   +Y +VD+ 
Sbjct: 466 RFTAWCCF-LGLISVGLTKISENEGFIPVNKNLWSISYVTTLSSFAFFILLILYPIVDVK 524

Query: 379 NLKYPFLPLAWIGMNAMLVYV 399
            + +   P  + GMN++LVYV
Sbjct: 525 GV-WTGTPFFYPGMNSILVYV 544


>gi|307201549|gb|EFN81312.1| Heparan-alpha-glucosaminide N-acetyltransferase [Harpegnathos
           saltator]
          Length = 564

 Score =  154 bits (390), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 121/432 (28%), Positives = 199/432 (46%), Gaps = 56/432 (12%)

Query: 22  SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFL 81
            ++        +R+ ++D FRG +   MI V+   G +  + H  WNG    D V P F+
Sbjct: 162 KEEATNKEPTKRRVKAIDAFRGASTLFMIFVNDGSGSYSVLGHTTWNGMLPGDLVFPCFM 221

Query: 82  FIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRL 141
           +I+GV + +AL     R     ++ F  LK  F  +L   G S   + L     +  IR+
Sbjct: 222 WIMGVCVPIALSAQLRRGIPKLEIAFTVLKRSF--LLFLIGVSL--NTLGTNAQLEKIRV 277

Query: 142 CGVLQRIALSYLLVSLVEIFTK---DVQDKDQSVGRFSI----FRLYCWHWLMAACVLVV 194
            GVLQR  ++YL+VS++ +  +    +QD+D S  R +      ++   +W     +++V
Sbjct: 278 FGVLQRFGVTYLVVSVMYLCLEPSLQLQDQDSSRNRVTRVLRDMQVLLPYWSFMLILVMV 337

Query: 195 YLALLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
           +  L +G  VP+     +      + G   N T G          A GYIDR VL INH+
Sbjct: 338 HCGLTFGLAVPNCPTGYLGPGGTHEDGYYMNCTGG----------AAGYIDRVVLTINHI 387

Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
           +  P                     A  +   PF+PEG+L  +++     +GVH G +++
Sbjct: 388 FAGPT-------------------IASVYGSGPFDPEGILGCLTATFQVYLGVHAGVILM 428

Query: 314 HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
             K    R+ +W++      + G  LHF N IP+NK L++LS+V V++  A    S  Y 
Sbjct: 429 MYKNWKERVVRWLSWAVLYGVLGCILHFCNVIPVNKNLWSLSFVFVSTSFALAFLSGCYL 488

Query: 374 LVDIWNLKYPFLPLAWIGMNAMLVYV--MAAEGIFAGFINGWYYGDPHNTLVCFL----- 426
           L+D+  + +   P    GMNA+++YV  M    IF      W  G+  +  +CF+     
Sbjct: 489 LIDVVRV-WQGGPFRIAGMNALVLYVGHMMCYQIFPFH---WRIGEMDSRALCFIESIWV 544

Query: 427 ----FIISYILH 434
                II+YI+H
Sbjct: 545 VVLWTIIAYIMH 556


>gi|345781561|ref|XP_539948.3| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Canis
           lupus familiaris]
          Length = 638

 Score =  154 bits (389), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 122/393 (31%), Positives = 193/393 (49%), Gaps = 52/393 (13%)

Query: 24  QQEKSHLKTQ--RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFL 81
           QQE  H  +   RL S+D FRGLA+ LM+ V++ GG +    H+ WNG  +AD V P+F+
Sbjct: 230 QQEAWHPPSALPRLRSIDTFRGLALILMVFVNYGGGKYWYFKHSSWNGLTVADLVFPWFV 289

Query: 82  FIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
           FI+G +I L++  +  R  +    + K+ +R+  L   G+++       P+     +   
Sbjct: 290 FIMGSSIFLSMTSMLQRGCSKFRLLGKIAWRSFLLFCIGVVIVN-----PNYCLGPLSWD 344

Query: 138 MIRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLV 193
            +R+ GVLQR+ ++Y +V+++E IF K V +   S  R    R  +  W  WL    +  
Sbjct: 345 KVRIPGVLQRLGVTYFVVAVLELIFAKPVPESCASERRCFSLRDIILSWPQWLFILLLES 404

Query: 194 VYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
           ++L L +   VP      +      D GK  N T G          A GYIDR +LG +H
Sbjct: 405 IWLGLTFFLPVPGCPTGYLGPGGIGDLGKYPNCTGG----------AAGYIDRLLLGDDH 454

Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
           +Y HP    S A    +                P++PEG+L ++SSI+   +G+  G ++
Sbjct: 455 IYQHP----SSAVLYHT--------------KVPYDPEGILGTISSIVMAFLGIQAGKIL 496

Query: 313 IH----TKGHLARLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAAL 366
           ++    TK  L R   W      L+   LT    N   IP+NK L+++SYV   S  A  
Sbjct: 497 LYYKDQTKDILIRFTAWCCF-LGLISVALTKISENEGFIPINKNLWSISYVTTLSSFAFF 555

Query: 367 VFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
           +   +Y +VD+  L +   P  + GMN++LVYV
Sbjct: 556 ILLILYPIVDVKGL-WTGTPFFYPGMNSILVYV 587


>gi|443731781|gb|ELU16770.1| hypothetical protein CAPTEDRAFT_135912, partial [Capitella teleta]
          Length = 388

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 114/377 (30%), Positives = 185/377 (49%), Gaps = 47/377 (12%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL--- 90
           RL SLD FRG+++ +MI V++ GG +    H+ WNG  LAD V P+F+FI+G ++AL   
Sbjct: 1   RLKSLDTFRGISLVIMIFVNYRGGGYWFFRHSAWNGLTLADLVFPWFVFIMGTSMALSFR 60

Query: 91  -ALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            AL+R   R   + KV+ R + L   G+++      A       VD+R +R+ GVLQR+A
Sbjct: 61  GALRRGIPRFKLILKVLKRAMILFALGVMISNSKGKA-------VDLRTLRVPGVLQRLA 113

Query: 150 LSYLLVSLVEIFTKDVQDKDQ--SVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
           L+YL++ ++E       D  Q  S  R  +  L    W      + V+  L +   VP  
Sbjct: 114 LTYLVLGIMEAALAKSHDPHQWWSSVRDVVGNLG--QWAAVLMFVAVHCCLTFLLPVPGC 171

Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
               +      +G  +    G          A  YIDR + G  HMY HP        T 
Sbjct: 172 PKGYLGPGGLQHGGAYENCTG---------GATAYIDRMIFGTEHMYGHP--------TC 214

Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVT 327
             P++  +          P +PEG+L +++SI    +G+  G VI+  +G  +R+ +W+ 
Sbjct: 215 MIPYQTTV----------PLDPEGVLGTLTSIFLCFLGLQAGKVILIFQGWKSRVSRWMC 264

Query: 328 MGFAL-LIFGLTLHFTNA---IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYP 383
                 L+ G    F+     IP+NK L++LSYV   +  A L+ S  +  VDI+ + + 
Sbjct: 265 WSLVTGLVAGCLCKFSAEDGFIPINKNLWSLSYVMALASMAFLLLSVCFLAVDIFRV-WS 323

Query: 384 FLPLAWIGMNAMLVYVM 400
             P  + GMN++++Y++
Sbjct: 324 GAPFIYPGMNSIVIYLL 340


>gi|157112232|ref|XP_001657450.1| hypothetical protein AaeL_AAEL000933 [Aedes aegypti]
 gi|108883723|gb|EAT47948.1| AAEL000933-PA [Aedes aegypti]
          Length = 569

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 120/403 (29%), Positives = 190/403 (47%), Gaps = 52/403 (12%)

Query: 8   TTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPW 67
           +    PL  + P  S    +   +  RL SLD FRG+A+ LMI V+  GG +  I HA W
Sbjct: 157 SAREAPLAAASPSSSGHPVEP--RKTRLQSLDTFRGIAIMLMIFVNSGGGHYWWIEHATW 214

Query: 68  NGCNLADFVMPFFLFIVGVAIALALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGF 123
           NG ++AD V P+FLFI+GV I ++L+    R   R   +  V  R+ KL   G+ L    
Sbjct: 215 NGLHVADLVFPWFLFIMGVCIPISLRSQVSRNIPRKTILANVAVRSFKLFCIGLCLNS-- 272

Query: 124 SHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQS-VGRFSIFRLYC 182
                    G  V  +RL GVLQR  ++Y +VS + ++      + Q  + R ++  L  
Sbjct: 273 -------INGPQVANLRLFGVLQRFGVAYFVVSAIHLYCYSESIEFQGRLARLNVDILRL 325

Query: 183 W-HWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVF-NVTCGVRAKLNPPCNAV 240
           W HW++   ++ +YL +++    P                ++ N T G+           
Sbjct: 326 WKHWIIMGAIVFIYLLIMFLVAAPGCPSGYFGPGGKHLMAMYPNCTGGI----------T 375

Query: 241 GYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSIL 300
           GY+DR +LG NH+Y HP  R                 DA +     F+PEG    + +IL
Sbjct: 376 GYLDRIILGNNHLYQHPTARYV--------------YDAQA-----FDPEGPFGCLPTIL 416

Query: 301 STIIGVHFGHVII-HTK--GHLARLKQWVTMGFALLIFGLTLHFTNA-IPLNKQLYTLSY 356
              +G+  G +I+ HT+    + R+  W T+   L          +  IP+NK L++LSY
Sbjct: 417 QVFLGLQCGVLILTHTEVMARIRRMAAWGTVLGLLGGILCGFSKNDGWIPVNKNLWSLSY 476

Query: 357 VCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
           V VT+  A ++    + L+D+  L +   P  + GMNA+++YV
Sbjct: 477 VLVTASLAFVLLLICFVLIDVKRL-WTGNPFLYAGMNAIILYV 518


>gi|417411831|gb|JAA52337.1| Putative heparan-alpha-glucosaminide n-acetyltransferase, partial
           [Desmodus rotundus]
          Length = 595

 Score =  152 bits (384), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 117/381 (30%), Positives = 189/381 (49%), Gaps = 50/381 (13%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
           RL  +D FRGLA+ LM+ V++ GG +    HA WNG  +AD V P+F+FI+G +I L++ 
Sbjct: 199 RLRCVDTFRGLALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMS 258

Query: 94  RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            +  R  +    + KV +R+  L+  G+++       P+     +    +RL GVLQR+ 
Sbjct: 259 SVLQRGCSKFRLLGKVAWRSFLLICIGVIVVN-----PNYCLGPLSWDKVRLPGVLQRLG 313

Query: 150 LSYLLVSLVEI-FTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
           ++Y +V+++E+ F K V ++    GR S  +  +  W  WL    +  ++LAL +   VP
Sbjct: 314 VTYFVVAVLELLFAKPVPERGAWEGRCSSLQDIMSSWPQWLFILMLESIWLALTFFLPVP 373

Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                 +      D GK  N T G          A GYIDR +LG +H+Y HP+      
Sbjct: 374 GCPTGYLGPGGIGDLGKYPNCTGG----------AAGYIDRLLLGEDHIYQHPS------ 417

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
                                 ++PEG+L +++SI+   +GV  G ++++    TK  L 
Sbjct: 418 ------------STVLYHTRVAYDPEGILGTINSIVMAFLGVQAGKILLYYKEQTKDILI 465

Query: 321 RLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIW 378
           R   W      L+  GLT    N   IP+NK L+++SYV   S  A  +   +Y +VD+ 
Sbjct: 466 RFTAWCCF-LGLISVGLTKISENEGFIPVNKNLWSISYVTTLSSFAFFILLILYPIVDVK 524

Query: 379 NLKYPFLPLAWIGMNAMLVYV 399
            + +   P  + GMN++LVYV
Sbjct: 525 GV-WTGTPFFYPGMNSILVYV 544


>gi|403303686|ref|XP_003942455.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Saimiri boliviensis boliviensis]
          Length = 631

 Score =  152 bits (383), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 118/381 (30%), Positives = 189/381 (49%), Gaps = 50/381 (13%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
           RL S+D FRG+A+ LM+ V++ GG +    HA WNG  +AD V P+F+FI+G +I L++ 
Sbjct: 235 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGTSIFLSMT 294

Query: 94  RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            I  R  +    + K+ +R+  L+  GI++       P+     +    +R+ GVLQR+ 
Sbjct: 295 SIMQRGCSKFRLLGKIAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGVLQRLG 349

Query: 150 LSYLLVSLVE-IFTKDVQD---KDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           ++Y +V+++E +F K V +    ++S             WL+   +  ++L L +   VP
Sbjct: 350 VTYFVVAVLELLFAKPVPEHCASERSCLSLQDITSSWPQWLLILALEGLWLGLTFLLPVP 409

Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                 +      D+GK  N T G          A GYIDR +LG NH+Y HP    S A
Sbjct: 410 GCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDNHLYQHP----SSA 455

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
               +                 ++PEG+L +++SIL   +GV  G ++++    TK  L 
Sbjct: 456 VLYHT--------------EVAYDPEGILGTINSILMAFLGVQAGKILLYYKARTKDILI 501

Query: 321 RLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIW 378
           R   W  +   L+   LT    N   IP+NK L++LSYV   S  A  +   +Y +VD+ 
Sbjct: 502 RFTAWCCI-LGLISVALTKVSENEGFIPVNKNLWSLSYVTTLSSFAFFILLVLYPVVDVK 560

Query: 379 NLKYPFLPLAWIGMNAMLVYV 399
            L +   P  + GMN++LVYV
Sbjct: 561 GL-WTGTPFFYPGMNSILVYV 580


>gi|390367684|ref|XP_789038.3| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Strongylocentrotus purpuratus]
          Length = 624

 Score =  151 bits (382), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 111/398 (27%), Positives = 193/398 (48%), Gaps = 54/398 (13%)

Query: 16  ISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADF 75
           ++E D +  Q  S  K +RL SLD FRG+++ +MI V++ GG +   +H+ WNG  +AD 
Sbjct: 215 VAEADSNSIQRPSRDKPKRLKSLDAFRGMSLVIMIFVNYGGGQYSFFNHSIWNGLTVADL 274

Query: 76  VMPFFLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELT 131
           V P+F++I+GV+I ++    ++    R     K+I R + L   GI+L G          
Sbjct: 275 VFPWFIWIMGVSITMSFYALVRHGVSRRVIFTKIIRRFVILFGLGIILDG---------- 324

Query: 132 YGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRL------YCWHW 185
            G+D    R+ GVLQRIA SYL+V+ V +F    +D++  + R  ++R       Y + W
Sbjct: 325 -GIDFSTFRVPGVLQRIAFSYLVVATVHLFAVKHKDEEYRI-RHVVYRELRDLLDYWYEW 382

Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDR 245
           ++    L +++ L +   VP      +       G+         + +N    A  YID+
Sbjct: 383 IIMISFLALHICLTFFLPVPGCPTGYLGPGGPLVGE-------NESLVNCTGGAANYIDK 435

Query: 246 KVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIG 305
            +L  NH Y     R+    T                   P +PEG+L +++SI  T +G
Sbjct: 436 VILTYNHTYPRGTPRKIYQTT------------------VPHDPEGILGTLTSIFMTFLG 477

Query: 306 VHFG---HVIIHTKGHLARLKQW--VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVT 360
           +  G   H+  + +  + R   W  VT   A  + G +      IP+NK L+++S++  T
Sbjct: 478 LQAGKIFHLFSYPRDRILRFLGWCVVTGVIAGALCGFSKE-DGIIPVNKNLWSVSFILAT 536

Query: 361 SGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVY 398
           +  A  + +  Y L+D+  + +  +P  ++GMN++ VY
Sbjct: 537 ASMAFFLLAIFYYLIDV-QIWWTGVPFYFVGMNSIAVY 573


>gi|443685179|gb|ELT88879.1| hypothetical protein CAPTEDRAFT_26311, partial [Capitella teleta]
          Length = 361

 Score =  151 bits (382), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 114/380 (30%), Positives = 186/380 (48%), Gaps = 54/380 (14%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL--- 90
           RL SLD FRG+++ +MI V++ GG +    H+ WNG  LAD V P+F+FI+G ++AL   
Sbjct: 1   RLKSLDTFRGISLVIMIFVNYRGGGYWFFRHSAWNGLTLADLVFPWFVFIMGTSMALSFR 60

Query: 91  -ALKRIPDRADAVKKVIFRTLKLLFWGILL---QGGFSHAPDELTYGVDVRMIRLCGVLQ 146
            AL+R   R   + KV+ R + L   G+++   +G F           D+R +R+ GVLQ
Sbjct: 61  GALRRGIPRFKLILKVLKRAMILFALGVMISNSKGAF-----------DLRTLRVPGVLQ 109

Query: 147 RIALSYLLVSLVEIFTKDVQDKDQ--SVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           R+AL+YL++ ++E       D  Q  S+ R  +  L    W      + V+  L +   V
Sbjct: 110 RLALTYLVLGIMEAALAKSHDPHQWWSLVRDVVGNLG--QWAAVLMFVAVHCCLTFLLPV 167

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
           P      +      +G  +    G          A  YIDR + G  HMY HP       
Sbjct: 168 PGCPKGYLGPGGLQHGGAYENCTG---------GATAYIDRMIFGTEHMYGHP------- 211

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
            T   P++             P +PEG+L +++SI    +G+  G VI+  +G  +R+ +
Sbjct: 212 -TCMIPYQ----------TTVPLDPEGVLGTLTSIFLCFLGLQAGKVILIFQGWKSRVSR 260

Query: 325 WVTMGFAL-LIFGLTLHFTNA---IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNL 380
           W+       L+ G    F+     IP+NK L++LSYV   +  A L+ S  +  VDI+ +
Sbjct: 261 WMCWSLVTGLVAGCLCKFSAEDGFIPINKNLWSLSYVMALASMAFLLLSVCFLAVDIFRV 320

Query: 381 KYPFLPLAWIGMNAMLVYVM 400
            +   P  + GMN++++Y++
Sbjct: 321 -WSGAPFIYPGMNSIVIYLL 339


>gi|395842491|ref|XP_003794051.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase
           [Otolemur garnettii]
          Length = 677

 Score =  150 bits (378), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 123/396 (31%), Positives = 199/396 (50%), Gaps = 53/396 (13%)

Query: 22  SDQQEKS-HLKTQ--RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMP 78
           SD Q  + HL T   RL  +D FRG+++ LM+ V++ GG +    H+ WNG  +AD V P
Sbjct: 266 SDAQPATWHLSTHPPRLRCVDTFRGISLTLMVFVNYGGGKYWYFKHSSWNGLTVADLVFP 325

Query: 79  FFLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGV 134
           +F+FI+G ++ L+    L+R   +   ++K+ +R+  L+  GI++       P+     +
Sbjct: 326 WFVFIMGSSVFLSMTSVLQRGCSKGRLLRKIAWRSFLLICIGIIIVN-----PNYCLGPL 380

Query: 135 DVRMIRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSV-GRFSIFRLY-CW-HWLMAAC 190
               +R+ GVLQR+ ++Y +V+++E +F K V +   S  G FS+  +   W  WL+   
Sbjct: 381 SWDKVRIPGVLQRLGVTYFVVAVLELLFAKPVPENCASQRGCFSLGDVTSSWPQWLLILT 440

Query: 191 VLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLG 249
           +  V+L L +   VP      +      D GK  N T G          A GYID  +LG
Sbjct: 441 LESVWLCLTFFLPVPGCPTGYLGPGGIGDLGKYPNCTGG----------AAGYIDHLLLG 490

Query: 250 INHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFG 309
            NH+YHHP    S A    +                 ++PEG+L +++SI+   +GV  G
Sbjct: 491 ENHLYHHP----SSAVLYHT--------------EVAYDPEGILGTINSIVMAFLGVQAG 532

Query: 310 HVIIH----TKGHLARLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGA 363
            ++++    TK  L R   W  +   L+   LT    N   IP+NK L+++SYV   S  
Sbjct: 533 KILLYYKDQTKDILMRFAGWCCI-LGLISVALTKVSENEGFIPVNKNLWSISYVTTLSCF 591

Query: 364 AALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
           A  +   +Y +VD+  L +   P  + GMN++LVYV
Sbjct: 592 AFFILLVLYPVVDVKGL-WTGAPFFYPGMNSILVYV 626


>gi|194226375|ref|XP_001488696.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Equus caballus]
          Length = 663

 Score =  149 bits (375), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 122/403 (30%), Positives = 195/403 (48%), Gaps = 51/403 (12%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALA- 91
           QRL  +D FRG+A+ +M+ V++ GG +    H+ WNG  +AD V P+F+FI+G +I L+ 
Sbjct: 266 QRLRCVDTFRGIALIIMVFVNYGGGKYWYFKHSSWNGLTVADLVFPWFVFIMGSSIFLSM 325

Query: 92  ---LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
              L+R   +   + K+ +R+  L+  GI++       P+     +    +R+ GVLQR+
Sbjct: 326 TSTLQRGCSKFRLLGKIAWRSFLLISLGIVVVN-----PNYCLGPLSWDKLRIPGVLQRL 380

Query: 149 ALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYV 204
            ++Y +V+++E +F K V     S  R S  R  L  W  WL    +  ++L L +   V
Sbjct: 381 GVTYFVVAVLELLFAKPVPGSGASERRCSSLRDILSSWPQWLFILLLESIWLGLTFFLPV 440

Query: 205 PDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
           P      +      D G+  N T G          A GYIDR +LG +H+Y HP    S 
Sbjct: 441 PGCPTGYLGPGGIGDLGRYPNCTGG----------AAGYIDRLLLGEDHLYQHP----SS 486

Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHL 319
           A    +                 ++PEG+L +++SI+   +GV  G ++++    TK  L
Sbjct: 487 AVLYHT--------------EVAYDPEGILGTINSIVMAFLGVQAGRILLYYKDQTKAIL 532

Query: 320 ARLKQWVTMGFALLIFGLTLHFTN--AIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDI 377
            R   W      L+   LT    N  +IP+NK L+++SYV   S  A  +   +Y  VD+
Sbjct: 533 LRFTAWSCF-LGLISVALTKVSENEGSIPINKNLWSISYVTTLSSFAFFILLVLYPTVDV 591

Query: 378 WNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHN 420
             L +   P  + GMN++LVYV   E   + F   W  GD  +
Sbjct: 592 KGL-WTGTPFFYPGMNSLLVYV-GHEVFESYFPFQWKLGDDQS 632


>gi|194762450|ref|XP_001963347.1| GF20351 [Drosophila ananassae]
 gi|190629006|gb|EDV44423.1| GF20351 [Drosophila ananassae]
          Length = 576

 Score =  149 bits (375), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 122/386 (31%), Positives = 188/386 (48%), Gaps = 59/386 (15%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           + +RL SLD FRGL++ LMI V+  GG +  I HA WNG +LAD V P FL+I+GV I L
Sbjct: 182 QRKRLRSLDTFRGLSIVLMIFVNSGGGGYTWIDHAAWNGLHLADLVFPSFLWIMGVCIPL 241

Query: 91  ALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQ 146
           ++K    R   +     ++++R++KL   G+ L        +EL         R+ GVLQ
Sbjct: 242 SVKAQLSRGASKGRICLRILWRSIKLFAIGLCLNSMSGPGLEEL---------RIMGVLQ 292

Query: 147 RIALSYLLVSLVEIFT--KDVQDKDQSVGR-FSIFRLYCWHWLMAACVLVVYLALLYGTY 203
           R  +++L+V ++      +D     +S  R      L+     +   ++  YL L +G  
Sbjct: 293 RFGVAFLVVGVLHTLCSRRDPISPQRSWQRAVHDICLFSGELAVLLALVATYLGLTFGLR 352

Query: 204 VPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPC--NAVGYIDRKVLGINHMYHHPAWRR 261
           VP        K     G  F+         NP C   A GY+D KVLG  H+Y HP    
Sbjct: 353 VPG-----CPKGYLGPGGKFDYAS------NPNCIGGAAGYVDLKVLGNAHIYQHP---- 397

Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
           +     DS               A F+PEG+   + S++  ++G   G  ++      +R
Sbjct: 398 TAKYVYDS---------------AAFDPEGIFGCILSVVQVLLGAFAGVTLLVHPTWQSR 442

Query: 322 LKQWVTMGFAL-LIFGLTLHFT---NAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDI 377
           +++W+ +   L LI G    F+    AIP+NK L++LS+VCVT   A ++ S +Y  +D+
Sbjct: 443 IRRWLILAVVLGLIGGALCGFSREGGAIPVNKNLWSLSFVCVTVSLALIILSLLYYFIDV 502

Query: 378 ---WNLK-YPFLPLAWIGMNAMLVYV 399
              WN   YPF      GMNA+++YV
Sbjct: 503 RQTWNWSGYPFTE---CGMNAIVMYV 525


>gi|332241088|ref|XP_003269721.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase
           [Nomascus leucogenys]
          Length = 654

 Score =  149 bits (375), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 118/381 (30%), Positives = 189/381 (49%), Gaps = 50/381 (13%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
           RL S+D FRG+A+ LM+ V++ GG +    HA WNG  +AD V P+F+FI+G +I L++ 
Sbjct: 258 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMA 317

Query: 94  RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            I  R  +    + K+ +R+  L+  GI++       P+     +    +R+ GVLQR+ 
Sbjct: 318 SILQRGCSKFRLLGKIAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGVLQRLG 372

Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
           ++Y +V+++E +F K V +   S       R     W  WL+   +  ++L L +   VP
Sbjct: 373 VTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVP 432

Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                 +      D+GK  N T G          A GYIDR +LG +H+Y HP    S A
Sbjct: 433 GCPIGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDDHLYQHP----SSA 478

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
               +                 ++PEG+L +++SI+   +GV  G ++++    TK  L 
Sbjct: 479 VLYHT--------------EVAYDPEGILGTINSIVMAFLGVQAGKILLYYKARTKDILI 524

Query: 321 RLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIW 378
           R   W  +   L+   LT    N   IP+NK L++LSYV   S  A  +   +Y +VD+ 
Sbjct: 525 RFTAWCCI-LGLISVALTKVSENERFIPVNKNLWSLSYVTTLSSFAFFILLVLYPVVDVK 583

Query: 379 NLKYPFLPLAWIGMNAMLVYV 399
            L +   P  + GMN++LVYV
Sbjct: 584 GL-WTGTPFFYPGMNSILVYV 603


>gi|74208071|dbj|BAE29143.1| unnamed protein product [Mus musculus]
          Length = 656

 Score =  148 bits (374), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 117/400 (29%), Positives = 191/400 (47%), Gaps = 53/400 (13%)

Query: 17  SEPDVSD-QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADF 75
           ++P  +D Q E       RL  +D FRGLA+ LM+ V++ GG +    H+ WNG  +AD 
Sbjct: 242 ADPLSADYQPETRRSSANRLRCVDTFRGLALVLMVFVNYGGGKYWYFKHSSWNGLTVADL 301

Query: 76  VMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELT 131
           V P+F+FI+G +I L++  I  R  +    + K+++R+  L+  G+++       P+   
Sbjct: 302 VFPWFVFIMGTSIFLSMTSILQRGCSKFKLLGKIVWRSFLLICIGVIIVN-----PNYCL 356

Query: 132 YGVDVRMIRLCGVLQRIALSYLLVSLVEIFT-KDVQDKDQSVGRFSIFRLY----CW-HW 185
             +    +R+ GVLQR+ ++Y +V+++E F  K V   D      S F L      W  W
Sbjct: 357 GPLSWDKVRIPGVLQRLGVTYFVVAVLEFFFWKPV--PDSCTLESSCFSLRDITSSWPQW 414

Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYID 244
           L    +  ++LAL +   VP      +      D GK  + T G          A GYID
Sbjct: 415 LTILTLESIWLALTFFLPVPGCPTGYLGPGGIGDLGKYPHCTGG----------AAGYID 464

Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
           R +LG NH+Y HP+                            ++PEG+L +++SI+   +
Sbjct: 465 RLLLGDNHLYQHPS------------------STVLYHTEVAYDPEGVLGTINSIVMAFL 506

Query: 305 GVHFGHVIIH----TKGHLARLKQWVT-MGFALLIFGLTLHFTNAIPLNKQLYTLSYVCV 359
           GV  G ++++    TK  L R   W   +G   ++          IP+NK L+++SYV  
Sbjct: 507 GVQAGKILVYYKDQTKAILTRFAAWCCILGLISIVLTKVSANEGFIPINKNLWSISYVTT 566

Query: 360 TSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
            S  A  +   +Y +VD+  L +   P  + GMN++LVYV
Sbjct: 567 LSCFAFFILLILYPVVDVKGL-WTGTPFFYPGMNSILVYV 605


>gi|158294726|ref|XP_315774.3| AGAP005761-PA [Anopheles gambiae str. PEST]
 gi|157015699|gb|EAA10745.3| AGAP005761-PA [Anopheles gambiae str. PEST]
          Length = 581

 Score =  148 bits (374), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 119/380 (31%), Positives = 179/380 (47%), Gaps = 64/380 (16%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           +RL SLD FRG+A+ LMI V+  GG +  I HA WNG ++AD V P+FLFI+GV + ++L
Sbjct: 202 KRLQSLDTFRGIAIMLMIFVNSGGGHYWWIEHATWNGLHVADLVFPWFLFIMGVCVPISL 261

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
           +   +R         R++KL   G+ L             G  +  +R+ GVLQR  ++Y
Sbjct: 262 RGQLNRN--------RSVKLFIIGLCLNS---------MNGPSMANLRIFGVLQRFGIAY 304

Query: 153 LLVSLVEIFTKDVQDKDQSVGRF-----SIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
           L+VS V +   + Q + QS  R       I RL    WL+   + V+YL +++    P  
Sbjct: 305 LVVSTVHLLCHEQQVQVQSQNRLLRASEDIVRLKK-QWLVIGLLTVLYLVVMFFVPAPGC 363

Query: 208 QFTIINKDSADYGKVF-NVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
                          F N T G+           GYIDR +LGI H+Y HP  R      
Sbjct: 364 PSAYFGPGGKHLYNAFPNCTGGI----------TGYIDRALLGIAHLYQHPTARYV---- 409

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
               ++G            PF+PEG    + +IL   +G+  G  I+    H  R+ ++ 
Sbjct: 410 ----YDG-----------MPFDPEGPFGCLPTILQVFLGLQCGCTILAYTEHRQRMVRFA 454

Query: 327 TMGFAL-LIFGLTLHFTNA---IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDI---WN 379
           +    L L  G    FT     IP+NK L++LSYV  T+  A  +    Y  +D+   W+
Sbjct: 455 SWSLVLGLAAGALCGFTKNDGWIPINKNLWSLSYVLATASLAHALLLLCYYAIDVKRAWH 514

Query: 380 LKYPFLPLAWIGMNAMLVYV 399
            +    P  + GMNA+++YV
Sbjct: 515 GR----PFVYAGMNAIVLYV 530


>gi|26330552|dbj|BAC29006.1| unnamed protein product [Mus musculus]
 gi|74213594|dbj|BAE35603.1| unnamed protein product [Mus musculus]
 gi|74225342|dbj|BAE31601.1| unnamed protein product [Mus musculus]
          Length = 624

 Score =  148 bits (374), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 117/400 (29%), Positives = 191/400 (47%), Gaps = 53/400 (13%)

Query: 17  SEPDVSD-QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADF 75
           ++P  +D Q E       RL  +D FRGLA+ LM+ V++ GG +    H+ WNG  +AD 
Sbjct: 210 ADPLSADYQPETRRSSANRLRCVDTFRGLALVLMVFVNYGGGKYWYFKHSSWNGLTVADL 269

Query: 76  VMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELT 131
           V P+F+FI+G +I L++  I  R  +    + K+++R+  L+  G+++       P+   
Sbjct: 270 VFPWFVFIMGTSIFLSMTSILQRGCSKLKLLGKIVWRSFLLICIGVIIVN-----PNYCL 324

Query: 132 YGVDVRMIRLCGVLQRIALSYLLVSLVEIFT-KDVQDKDQSVGRFSIFRLY----CW-HW 185
             +    +R+ GVLQR+ ++Y +V+++E F  K V   D      S F L      W  W
Sbjct: 325 GPLSWDKVRIPGVLQRLGVTYFVVAVLEFFFWKPV--PDSCTLESSCFSLRDITSSWPQW 382

Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYID 244
           L    +  ++LAL +   VP      +      D GK  + T G          A GYID
Sbjct: 383 LTILTLESIWLALTFFLPVPGCPTGYLGPGGIGDLGKYPHCTGG----------AAGYID 432

Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
           R +LG NH+Y HP+                            ++PEG+L +++SI+   +
Sbjct: 433 RLLLGDNHLYQHPS------------------STVLYHTEVAYDPEGVLGTINSIVMAFL 474

Query: 305 GVHFGHVIIH----TKGHLARLKQWVT-MGFALLIFGLTLHFTNAIPLNKQLYTLSYVCV 359
           GV  G ++++    TK  L R   W   +G   ++          IP+NK L+++SYV  
Sbjct: 475 GVQAGKILVYYKDQTKAILTRFAAWCCILGLISIVLTKVSANEGFIPINKNLWSISYVTT 534

Query: 360 TSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
            S  A  +   +Y +VD+  L +   P  + GMN++LVYV
Sbjct: 535 LSCFAFFILLILYPVVDVKGL-WTGTPFFYPGMNSILVYV 573


>gi|432099917|gb|ELK28811.1| Heparan-alpha-glucosaminide N-acetyltransferase [Myotis davidii]
          Length = 586

 Score =  148 bits (374), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 117/380 (30%), Positives = 187/380 (49%), Gaps = 48/380 (12%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
           RL  +D FRGLA+ LM+ V++ GG +    HA WNG  +AD V P+F+FI+G +I L++ 
Sbjct: 190 RLRCVDTFRGLALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMT 249

Query: 94  RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            I  R  +    + KV +R+  L+  GI++       P+     +    +R+ GVLQR+ 
Sbjct: 250 SILQRGCSKFRLLGKVAWRSFLLICIGIVIVN-----PNYCLGPLSWDKVRIPGVLQRLG 304

Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
           ++Y +V+++E +F K V +   S  R S  +     W  WL+   +  V+LAL +   VP
Sbjct: 305 VTYFVVAVLELLFAKPVPESCVSERRCSCLQDITSSWPQWLVILMLESVWLALTFFLPVP 364

Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                 +      D GK  N T G          A GYIDR +LG +H+Y HP+      
Sbjct: 365 GCPTGYLGPGGIGDLGKYPNCTGG----------AAGYIDRLLLGDDHIYQHPSSNVLYH 414

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
            T                    ++PEG+L +++SI+   +GV  G ++++    TK  L 
Sbjct: 415 TT------------------VAYDPEGILGTINSIVMAFLGVQAGKILLYYKDQTKDILI 456

Query: 321 RLKQ-WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWN 379
           R    W  +G   ++          IP+NK L+++SYV   S  A  +   +Y +VD+  
Sbjct: 457 RFTAWWCFLGLISVVLTKVSENEGFIPVNKNLWSISYVTTLSSFAFFLLLILYPIVDVKG 516

Query: 380 LKYPFLPLAWIGMNAMLVYV 399
           L +   P  + GMN++LVYV
Sbjct: 517 L-WTGAPFYYPGMNSILVYV 535


>gi|297682811|ref|XP_002819101.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Pongo
           abelii]
          Length = 645

 Score =  148 bits (374), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 119/381 (31%), Positives = 189/381 (49%), Gaps = 50/381 (13%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
           RL S+D FRG+A+ LM+ V++ GG +    HA WNG  +AD V P+F+FI+G +I L++ 
Sbjct: 249 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMT 308

Query: 94  RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            I  R  +    + K+ +R+  L+  GI++       P+     +    +R+ GVLQR+ 
Sbjct: 309 SILQRGCSKFRLLGKIAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGVLQRLG 363

Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
           ++Y +V+++E +F K V +   S       R     W  WL+   +  ++L L +   VP
Sbjct: 364 VTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVP 423

Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                 +      D+GK  N T G          A GYIDR +LG +H+Y HP    S A
Sbjct: 424 GCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDDHLYQHP----SSA 469

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
               +                 ++PEG+L +V+SI+   +GV  G ++++    TK  L 
Sbjct: 470 VLYHT--------------EVAYDPEGILGTVNSIVMAFLGVQAGKILLYYKARTKDILI 515

Query: 321 RLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIW 378
           R   W  +   L+   LT    N   IP+NK L++LSYV   S  A  +   +Y +VD+ 
Sbjct: 516 RFTAWCCI-LGLISVALTKVSENEGFIPVNKNLWSLSYVTTLSSFAFFILLVLYPVVDVK 574

Query: 379 NLKYPFLPLAWIGMNAMLVYV 399
            L +   P  + GMN++LVYV
Sbjct: 575 GL-WTGTPFFYPGMNSILVYV 594


>gi|115292433|ref|NP_084160.1| heparan-alpha-glucosaminide N-acetyltransferase [Mus musculus]
 gi|341940800|sp|Q3UDW8.2|HGNAT_MOUSE RecName: Full=Heparan-alpha-glucosaminide N-acetyltransferase;
           AltName: Full=Transmembrane protein 76
          Length = 656

 Score =  148 bits (373), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 117/400 (29%), Positives = 191/400 (47%), Gaps = 53/400 (13%)

Query: 17  SEPDVSD-QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADF 75
           ++P  +D Q E       RL  +D FRGLA+ LM+ V++ GG +    H+ WNG  +AD 
Sbjct: 242 ADPLSADYQPETRRSSANRLRCVDTFRGLALVLMVFVNYGGGKYWYFKHSSWNGLTVADL 301

Query: 76  VMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELT 131
           V P+F+FI+G +I L++  I  R  +    + K+++R+  L+  G+++       P+   
Sbjct: 302 VFPWFVFIMGTSIFLSMTSILQRGCSKLKLLGKIVWRSFLLICIGVIIVN-----PNYCL 356

Query: 132 YGVDVRMIRLCGVLQRIALSYLLVSLVEIFT-KDVQDKDQSVGRFSIFRLY----CW-HW 185
             +    +R+ GVLQR+ ++Y +V+++E F  K V   D      S F L      W  W
Sbjct: 357 GPLSWDKVRIPGVLQRLGVTYFVVAVLEFFFWKPV--PDSCTLESSCFSLRDITSSWPQW 414

Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYID 244
           L    +  ++LAL +   VP      +      D GK  + T G          A GYID
Sbjct: 415 LTILTLESIWLALTFFLPVPGCPTGYLGPGGIGDLGKYPHCTGG----------AAGYID 464

Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
           R +LG NH+Y HP+                            ++PEG+L +++SI+   +
Sbjct: 465 RLLLGDNHLYQHPS------------------STVLYHTEVAYDPEGVLGTINSIVMAFL 506

Query: 305 GVHFGHVIIH----TKGHLARLKQWVT-MGFALLIFGLTLHFTNAIPLNKQLYTLSYVCV 359
           GV  G ++++    TK  L R   W   +G   ++          IP+NK L+++SYV  
Sbjct: 507 GVQAGKILVYYKDQTKAILTRFAAWCCILGLISIVLTKVSANEGFIPINKNLWSISYVTT 566

Query: 360 TSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
            S  A  +   +Y +VD+  L +   P  + GMN++LVYV
Sbjct: 567 LSCFAFFILLILYPVVDVKGL-WTGTPFFYPGMNSILVYV 605


>gi|23272280|gb|AAH24084.1| Hgsnat protein [Mus musculus]
 gi|148700869|gb|EDL32816.1| DNA segment, Chr 8, ERATO Doi 354, expressed [Mus musculus]
          Length = 624

 Score =  148 bits (373), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 117/400 (29%), Positives = 191/400 (47%), Gaps = 53/400 (13%)

Query: 17  SEPDVSD-QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADF 75
           ++P  +D Q E       RL  +D FRGLA+ LM+ V++ GG +    H+ WNG  +AD 
Sbjct: 210 ADPLSADYQPETRRSSANRLRCVDTFRGLALVLMVFVNYGGGKYWYFKHSSWNGLTVADL 269

Query: 76  VMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELT 131
           V P+F+FI+G +I L++  I  R  +    + K+++R+  L+  G+++       P+   
Sbjct: 270 VFPWFVFIMGTSIFLSMTSILQRGCSKLKLLGKIVWRSFLLICIGVIIVN-----PNYCL 324

Query: 132 YGVDVRMIRLCGVLQRIALSYLLVSLVEIFT-KDVQDKDQSVGRFSIFRLY----CW-HW 185
             +    +R+ GVLQR+ ++Y +V+++E F  K V   D      S F L      W  W
Sbjct: 325 GPLSWDKVRIPGVLQRLGVTYFVVAVLEFFFWKPV--PDSCTLESSCFSLRDITSSWPQW 382

Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYID 244
           L    +  ++LAL +   VP      +      D GK  + T G          A GYID
Sbjct: 383 LTILTLESIWLALTFFLPVPGCPTGYLGPGGIGDLGKYPHCTGG----------AAGYID 432

Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
           R +LG NH+Y HP+                            ++PEG+L +++SI+   +
Sbjct: 433 RLLLGDNHLYQHPS------------------STVLYHTEVAYDPEGVLGTINSIVMAFL 474

Query: 305 GVHFGHVIIH----TKGHLARLKQWVT-MGFALLIFGLTLHFTNAIPLNKQLYTLSYVCV 359
           GV  G ++++    TK  L R   W   +G   ++          IP+NK L+++SYV  
Sbjct: 475 GVQAGKILVYYKDQTKAILTRFAAWCCILGLISIVLTKVSANEGFIPINKNLWSISYVTT 534

Query: 360 TSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
            S  A  +   +Y +VD+  L +   P  + GMN++LVYV
Sbjct: 535 LSCFAFFILLILYPVVDVKGL-WTGTPFFYPGMNSILVYV 573


>gi|74198170|dbj|BAE35261.1| unnamed protein product [Mus musculus]
          Length = 624

 Score =  148 bits (373), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 115/392 (29%), Positives = 186/392 (47%), Gaps = 52/392 (13%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFI 83
           Q E       RL  +D FRGLA+ LM+ V++ GG +    H+ WNG  +AD V P+F+FI
Sbjct: 218 QPETRRSSANRLRCVDTFRGLALVLMVFVNYGGGKYWYFKHSSWNGLTVADLVFPWFVFI 277

Query: 84  VGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMI 139
           +G +I L++  I  R  +    + K+++R+  L+  G+++       P+     +    +
Sbjct: 278 MGTSIFLSMTSILQRGCSKLKLLGKIVWRSFLLICIGVIIVN-----PNYCLGPLSWDKV 332

Query: 140 RLCGVLQRIALSYLLVSLVEIFT-KDVQDKDQSVGRFSIFRLY----CW-HWLMAACVLV 193
           R+ GVLQR+ ++Y +V+++E F  K V   D      S F L      W  WL    +  
Sbjct: 333 RIPGVLQRLGVTYFVVAVLEFFFWKPV--PDSCTLESSCFSLRDITSSWPQWLTILTLES 390

Query: 194 VYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
           ++LAL +   VP      +      D GK  + T G          A GYIDR +LG NH
Sbjct: 391 IWLALTFFLPVPGCPTGYLGPGGIGDLGKYPHCTGG----------AAGYIDRLLLGDNH 440

Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
           +Y HP+                            ++PEG+L +++SI+   +GV  G ++
Sbjct: 441 LYQHPS------------------STVLYHTEVAYDPEGVLGTINSIVMAFLGVQAGKIL 482

Query: 313 IH----TKGHLARLKQWVT-MGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALV 367
           ++    TK  L R   W   +G   ++          IP+NK L+++SYV   S  A  +
Sbjct: 483 VYYKDQTKAILTRFAAWCCILGLISIVLTKVSANEGFIPINKNLWSISYVTTLSCFAFFI 542

Query: 368 FSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
              +Y +VD+  L +   P  + GMN++LVYV
Sbjct: 543 LLILYPVVDVKGL-WTGTPFFYPGMNSILVYV 573


>gi|51491261|emb|CAH18694.1| hypothetical protein [Homo sapiens]
          Length = 459

 Score =  148 bits (373), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 120/397 (30%), Positives = 193/397 (48%), Gaps = 50/397 (12%)

Query: 18  EPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVM 77
           + DV     +      RL S+D FRG+A+ LM+ V++ GG +    HA WNG  +AD V 
Sbjct: 47  DGDVQPATWRLSALPPRLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVF 106

Query: 78  PFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYG 133
           P+F+FI+G +I L++  I  R  +    + K+ +R+  L+  GI++       P+     
Sbjct: 107 PWFVFIMGSSIFLSMTSILQRGCSKFRLLGKIAWRSFLLICIGIIIVN-----PNYCLGP 161

Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAA 189
           +    +R+ GVLQR+ ++Y +V+++E +F K V +   S       R     W  WL+  
Sbjct: 162 LSWDKVRIPGVLQRLGVTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLIL 221

Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
            +  ++L L +   VP      +      D+GK  N T G          A GYIDR +L
Sbjct: 222 VLEGLWLGLTFLLPVPGCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLL 271

Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
           G +H+Y HP    S A    +                 ++PEG+L +++SI+   +GV  
Sbjct: 272 GDDHLYQHP----SSAVLYHT--------------EVAYDPEGILGTINSIVMAFLGVQA 313

Query: 309 GHVIIH----TKGHLARLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSG 362
           G ++++    TK  L R   W  +   L+   LT    N   IP+NK L++LSYV   S 
Sbjct: 314 GKILLYYKARTKDILIRFTAWCCI-LGLISVALTKVSENEGFIPVNKNLWSLSYVTTLSS 372

Query: 363 AAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
            A  +   +Y +VD+  L +   P  + GMN++LVYV
Sbjct: 373 FAFFILLVLYPVVDVKGL-WTGTPFFYPGMNSILVYV 408


>gi|397505549|ref|XP_003823319.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase isoform
           1 [Pan paniscus]
          Length = 585

 Score =  148 bits (373), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 118/381 (30%), Positives = 189/381 (49%), Gaps = 50/381 (13%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
           RL S+D FRG+A+ LM+ V++ GG +    HA WNG  +AD V P+F+FI+G +I L++ 
Sbjct: 189 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMT 248

Query: 94  RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            I  R  +    + K+ +R+  L+  GI++       P+     +    +R+ GVLQR+ 
Sbjct: 249 SILQRGCSKFRLLGKIAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGVLQRLG 303

Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
           ++Y +V+++E +F K V +   S       R     W  WL+   +  ++L L +   VP
Sbjct: 304 VTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVP 363

Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                 +      D+GK  N T G          A GYIDR +LG +H+Y HP    S A
Sbjct: 364 GCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDDHLYQHP----SSA 409

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
               +                 ++PEG+L +++SI+   +GV  G ++++    TK  L 
Sbjct: 410 VLYHT--------------EVAYDPEGILGTINSIVMAFLGVQAGKILLYYKARTKDILI 455

Query: 321 RLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIW 378
           R   W  +   L+   LT    N   IP+NK L++LSYV   S  A  +   +Y +VD+ 
Sbjct: 456 RFTAWCCI-LGLISVALTKVSENEGFIPVNKNLWSLSYVTTLSSFAFFILLVLYPVVDVK 514

Query: 379 NLKYPFLPLAWIGMNAMLVYV 399
            L +   P  + GMN++LVYV
Sbjct: 515 GL-WTGTPFFYPGMNSILVYV 534


>gi|150378452|ref|NP_689632.2| heparan-alpha-glucosaminide N-acetyltransferase precursor [Homo
           sapiens]
 gi|332826066|ref|XP_519741.3| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Pan
           troglodytes]
 gi|194385774|dbj|BAG65262.1| unnamed protein product [Homo sapiens]
 gi|410222096|gb|JAA08267.1| heparan-alpha-glucosaminide N-acetyltransferase [Pan troglodytes]
 gi|410256018|gb|JAA15976.1| heparan-alpha-glucosaminide N-acetyltransferase [Pan troglodytes]
 gi|410299048|gb|JAA28124.1| heparan-alpha-glucosaminide N-acetyltransferase [Pan troglodytes]
          Length = 635

 Score =  147 bits (372), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 118/381 (30%), Positives = 189/381 (49%), Gaps = 50/381 (13%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
           RL S+D FRG+A+ LM+ V++ GG +    HA WNG  +AD V P+F+FI+G +I L++ 
Sbjct: 239 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMT 298

Query: 94  RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            I  R  +    + K+ +R+  L+  GI++       P+     +    +R+ GVLQR+ 
Sbjct: 299 SILQRGCSKFRLLGKIAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGVLQRLG 353

Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
           ++Y +V+++E +F K V +   S       R     W  WL+   +  ++L L +   VP
Sbjct: 354 VTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVP 413

Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                 +      D+GK  N T G          A GYIDR +LG +H+Y HP    S A
Sbjct: 414 GCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDDHLYQHP----SSA 459

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
               +                 ++PEG+L +++SI+   +GV  G ++++    TK  L 
Sbjct: 460 VLYHT--------------EVAYDPEGILGTINSIVMAFLGVQAGKILLYYKARTKDILI 505

Query: 321 RLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIW 378
           R   W  +   L+   LT    N   IP+NK L++LSYV   S  A  +   +Y +VD+ 
Sbjct: 506 RFTAWCCI-LGLISVALTKVSENEGFIPVNKNLWSLSYVTTLSSFAFFILLVLYPVVDVK 564

Query: 379 NLKYPFLPLAWIGMNAMLVYV 399
            L +   P  + GMN++LVYV
Sbjct: 565 GL-WTGTPFFYPGMNSILVYV 584


>gi|124007195|sp|Q68CP4.2|HGNAT_HUMAN RecName: Full=Heparan-alpha-glucosaminide N-acetyltransferase;
           AltName: Full=Transmembrane protein 76
          Length = 663

 Score =  147 bits (372), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 118/381 (30%), Positives = 189/381 (49%), Gaps = 50/381 (13%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
           RL S+D FRG+A+ LM+ V++ GG +    HA WNG  +AD V P+F+FI+G +I L++ 
Sbjct: 267 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMT 326

Query: 94  RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            I  R  +    + K+ +R+  L+  GI++       P+     +    +R+ GVLQR+ 
Sbjct: 327 SILQRGCSKFRLLGKIAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGVLQRLG 381

Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
           ++Y +V+++E +F K V +   S       R     W  WL+   +  ++L L +   VP
Sbjct: 382 VTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVP 441

Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                 +      D+GK  N T G          A GYIDR +LG +H+Y HP    S A
Sbjct: 442 GCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDDHLYQHP----SSA 487

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
               +                 ++PEG+L +++SI+   +GV  G ++++    TK  L 
Sbjct: 488 VLYHT--------------EVAYDPEGILGTINSIVMAFLGVQAGKILLYYKARTKDILI 533

Query: 321 RLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIW 378
           R   W  +   L+   LT    N   IP+NK L++LSYV   S  A  +   +Y +VD+ 
Sbjct: 534 RFTAWCCI-LGLISVALTKVSENEGFIPVNKNLWSLSYVTTLSSFAFFILLVLYPVVDVK 592

Query: 379 NLKYPFLPLAWIGMNAMLVYV 399
            L +   P  + GMN++LVYV
Sbjct: 593 GL-WTGTPFFYPGMNSILVYV 612


>gi|410332579|gb|JAA35236.1| heparan-alpha-glucosaminide N-acetyltransferase [Pan troglodytes]
          Length = 635

 Score =  147 bits (372), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 118/381 (30%), Positives = 189/381 (49%), Gaps = 50/381 (13%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
           RL S+D FRG+A+ LM+ V++ GG +    HA WNG  +AD V P+F+FI+G +I L++ 
Sbjct: 239 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMT 298

Query: 94  RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            I  R  +    + K+ +R+  L+  GI++       P+     +    +R+ GVLQR+ 
Sbjct: 299 SILQRGCSKFRLLGKIAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGVLQRLG 353

Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
           ++Y +V+++E +F K V +   S       R     W  WL+   +  ++L L +   VP
Sbjct: 354 VTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVP 413

Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                 +      D+GK  N T G          A GYIDR +LG +H+Y HP    S A
Sbjct: 414 GCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDDHLYQHP----SSA 459

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
               +                 ++PEG+L +++SI+   +GV  G ++++    TK  L 
Sbjct: 460 VLYHT--------------EVAYDPEGILGTINSIVMAFLGVQAGKILLYYKARTKDILI 505

Query: 321 RLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIW 378
           R   W  +   L+   LT    N   IP+NK L++LSYV   S  A  +   +Y +VD+ 
Sbjct: 506 RFTAWCCI-LGLISVALTKVSENEGFIPVNKNLWSLSYVTTLSSFAFFILLVLYPVVDVK 564

Query: 379 NLKYPFLPLAWIGMNAMLVYV 399
            L +   P  + GMN++LVYV
Sbjct: 565 GL-WTGTPFFYPGMNSILVYV 584


>gi|291409013|ref|XP_002720836.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase
           [Oryctolagus cuniculus]
          Length = 613

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 119/403 (29%), Positives = 194/403 (48%), Gaps = 54/403 (13%)

Query: 24  QQEKSHLKT--QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFL 81
           Q E  HL     RL  +D FRG+A+ LM+ V++ GG +    H+ WNG  +AD V P+F+
Sbjct: 205 QPETWHLSAAKHRLRCVDTFRGIALVLMVFVNYGGGRYWYFRHSSWNGLTVADLVFPWFV 264

Query: 82  FIVGVAIAL----ALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
           FI+G +I L    AL+R   +   + K+ +R+  L+  GI++       P+     +   
Sbjct: 265 FIMGSSIFLSMMSALQRGCSKLRLLGKIAWRSFLLIMIGIVIVN-----PNYCLGPLSWD 319

Query: 138 MIRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLV 193
            +R+ GVLQR+ ++Y +V+++E +F K V +        +  R     W  WL+   +  
Sbjct: 320 KVRIPGVLQRLGVTYFVVAVLELLFAKPVPENWVLESSCTCLRDVTSSWPQWLLILLLES 379

Query: 194 VYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
           ++L L +   VP      +      D+GK  N T G          A GYIDR +LG +H
Sbjct: 380 IWLGLSFFLPVPGCPTGYLGPGGIGDWGKYPNCTGG----------AAGYIDRVLLGDDH 429

Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
           +Y HP+                            ++PEG+L +++SI++  +GV  G ++
Sbjct: 430 LYKHPS------------------STVLYHTEVAYDPEGILGTINSIVTAFLGVQAGKIL 471

Query: 313 I----HTKGHLARLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAAL 366
           +     TK  L R   W  +   L+   LT    N   IP+NK L+++SYV   S  A  
Sbjct: 472 LFYKDQTKSILIRFTAWSCI-LGLISVALTKISENEGFIPINKNLWSISYVTTLSSFAFF 530

Query: 367 VFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGF 409
           +   +Y +VD+  L +   P  + GMN++LVYV     +F G+
Sbjct: 531 ILLVLYPIVDVKGL-WTGAPFFYPGMNSILVYV--GHKVFEGY 570


>gi|432907420|ref|XP_004077635.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Oryzias latipes]
          Length = 482

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 116/389 (29%), Positives = 191/389 (49%), Gaps = 52/389 (13%)

Query: 26  EKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVG 85
           +    +  RL SLD FRG A+ +M+ V++ GG +    HAPWNG  +AD VMP+F+FI+G
Sbjct: 80  DSRGTRPARLLSLDTFRGFALTVMVFVNYGGGGYWFFEHAPWNGLTVADLVMPWFVFIMG 139

Query: 86  VAIALAL----KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRL 141
            ++ LA     +R   R   + K+ +RT+ L+  G       +++P +    +    +R+
Sbjct: 140 TSVVLAFSSMQRRGVGRRQLLGKITWRTVVLMLLGFCF---LNYSPRDGP--LSWSWLRI 194

Query: 142 CGVLQRIALSYLLVSLVEIF--TKDV-QDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL 198
            GVLQR+A +Y ++SL++ F   K + + ++          L+   WL+   +  ++L +
Sbjct: 195 PGVLQRLAFTYFVLSLLQTFWGRKAIPESENHWWNPVQDVVLFWPQWLLIFLLETLWLCI 254

Query: 199 LYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHP 257
            +   VP+     +      D+G   N T G          A G IDR + G N MY +P
Sbjct: 255 TFLMPVPNCPTGYLGAGGIGDHGLYPNCTGG----------AAGSIDRWMFGDN-MYRYP 303

Query: 258 AWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK- 316
             ++     Q                  P++PEG+L +++SI+   +G+  G +I+  K 
Sbjct: 304 TCKKLYRTEQ------------------PYDPEGVLGTINSIVMGFLGMQAGKIIVFYKR 345

Query: 317 --GH-LARLKQW-VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIY 372
             GH L R   W V +G +  I          IP+NK L++LSYV  T   + L+   +Y
Sbjct: 346 KSGHILWRYLTWAVILGISAAILSKCTRDGGFIPVNKNLWSLSYVTCTGALSFLLLGGMY 405

Query: 373 ALVDI--WNLKYPFLPLAWIGMNAMLVYV 399
            L+D+  W    PFL   + GMN++LVYV
Sbjct: 406 FLIDVRGWWGGQPFL---YPGMNSILVYV 431


>gi|395501613|ref|XP_003755186.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Sarcophilus harrisii]
          Length = 425

 Score =  147 bits (370), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 117/397 (29%), Positives = 186/397 (46%), Gaps = 52/397 (13%)

Query: 20  DVSDQQEKSHLKTQRLASLDIFR--GLAVALMILVDHAGGDWPEISHAPWNGCNLADFVM 77
           D S    K  +   R   L +    GL++ LM+ V++ GG +    HAPWNG  +AD VM
Sbjct: 13  DASVFNNKGKIINFRPWELSLVSKHGLSLTLMVFVNYGGGGYWFFEHAPWNGLTVADLVM 72

Query: 78  PFFLFIVGVAIALALKRIPDRA----DAVKKVIFRTLKLLFWGIL-LQGGFSHAPDELTY 132
           P+F+FI+G ++ L    +  R       ++KV +RT  L+  G+L L  G +  P   ++
Sbjct: 73  PWFVFILGTSVGLTFHNMQKRGVKNIQLLRKVAWRTGVLIIIGVLFLNYGPADGPLSWSW 132

Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGR-FSIFR---LYCWHWLMA 188
                  RL GVLQR+  +Y  V+L++I       +   V   ++ FR   LY   W + 
Sbjct: 133 A------RLPGVLQRLGFTYFAVALMQIAFGVTDTQIYQVNLWWAPFRDVILYWKEWFII 186

Query: 189 ACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKV 247
             + +++L L +   VP      +      D GK FN T G          A  YID+ +
Sbjct: 187 ISLEILWLCLTFLLPVPGCPRGYLGPGGIGDEGKYFNCTGG----------AAAYIDKWI 236

Query: 248 LGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVH 307
           LG NH+Y  P+ +     TQ                  PF+PEG+L +++SIL    G+ 
Sbjct: 237 LGENHLYQFPSCKELYKTTQ------------------PFDPEGILGTINSILMAFFGLQ 278

Query: 308 FGHVIIHTKGH----LARLKQW-VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSG 362
            G +I+  +      L R   W + +G    I  +       IP+NK L++LS+V  TS 
Sbjct: 279 AGKIILMYRNKPRSILRRFLIWSILLGIISAILTMGTQNEGFIPVNKNLWSLSFVTTTSC 338

Query: 363 AAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
            + ++   +Y + D+ N  +   P  + GMN++LVYV
Sbjct: 339 FSFILLGLLYYITDV-NGWWNGCPFIFPGMNSILVYV 374


>gi|91078976|ref|XP_974454.1| PREDICTED: similar to heparan-alpha-glucosaminide
           N-acetyltransferase [Tribolium castaneum]
 gi|270003686|gb|EFA00134.1| hypothetical protein TcasGA2_TC002950 [Tribolium castaneum]
          Length = 566

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 115/387 (29%), Positives = 192/387 (49%), Gaps = 42/387 (10%)

Query: 19  PDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMP 78
           P V ++         R+ S+D+FRG  + +MI V++ GG +   SH+ WNG  +AD V P
Sbjct: 166 PLVIERTPSIRKHPHRIKSIDVFRGFCIMIMIFVNYGGGKYWFFSHSVWNGLTVADLVFP 225

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRM 138
           +FL+++GV+ A++L+    RA   ++++   ++  F  ILL    +   +  T G     
Sbjct: 226 WFLWLMGVSFAVSLQAKLRRAVPRRQLVIGVMRRSFILILLGIIINSNQNLQTIG----S 281

Query: 139 IRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA 197
           +R  GVLQRI + Y +V ++E IFTK  + + +SV       +    WL    ++V++  
Sbjct: 282 LRFPGVLQRIGVCYFIVGMLEIIFTK--RSEVESVSCIYDVAVAWPQWLCVTVLVVIHTC 339

Query: 198 LLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHH 256
           + +   VP      +      D G+ +N T GV           GYIDR+V G  HM+ +
Sbjct: 340 VTFLGDVPGCGRGYLGPGGLDDNGRFYNCTGGV----------AGYIDRQVFG-EHMHKN 388

Query: 257 PAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK 316
           P  ++                         F+PEG+L +++S+L+   GV  G  +   +
Sbjct: 389 PVCKKLYE------------------IDVYFDPEGILGTLTSVLTVYFGVQAGRTLNTYQ 430

Query: 317 GHLARLKQWVTMG-FALLIFGLTLHFTNA---IPLNKQLYTLSYVCVTSGAAALVFSAIY 372
              A++ +WV  G  A L+ G    F      IPLNKQL++LS+  V SG A ++ + ++
Sbjct: 431 NVKAKVIRWVVWGSLAGLLGGALCEFKQNDGLIPLNKQLWSLSFALVLSGMAFIIQAFLF 490

Query: 373 ALVDIWNLKYPFLPLAWIGMNAMLVYV 399
            LVDI   K+   P  + GMN++ +YV
Sbjct: 491 VLVDILR-KWGGRPFFYPGMNSLFLYV 516


>gi|355779672|gb|EHH64148.1| Heparan-alpha-glucosaminide N-acetyltransferase, partial [Macaca
           fascicularis]
          Length = 596

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 113/380 (29%), Positives = 185/380 (48%), Gaps = 48/380 (12%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
           RL S+D FRG+A+ LM+ V++ GG +    HA WNG  +AD V P+F+FI+G +I L++ 
Sbjct: 200 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMT 259

Query: 94  RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            I  R  +    + K+ +R+  L+  GI++       P+     +    +R+ GVLQR+ 
Sbjct: 260 SILQRGCSKFRLLGKIAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGVLQRLG 314

Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
           ++Y +V+++E +F K V +   S       R     W  WL+   +  ++L L +   VP
Sbjct: 315 VTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLILALEGLWLGLTFLLPVP 374

Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                 +      D+GK  N T G          A GYIDR +LG +H+Y HP+      
Sbjct: 375 GCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDDHLYQHPS------ 418

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
                                 ++PEG+L +++SI+   +GV  G ++++    TK  L 
Sbjct: 419 ------------STVLYHTEVAYDPEGILGTINSIVMAFLGVQAGKILLYYKAQTKDILI 466

Query: 321 RLKQWVT-MGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWN 379
           R   W   +G   ++          IP+NK L++LSYV   S  A  +   +Y +VD+  
Sbjct: 467 RFTAWCCILGLISVVLTKVSENEGFIPVNKNLWSLSYVTTLSSFAFFILLVLYPVVDVKG 526

Query: 380 LKYPFLPLAWIGMNAMLVYV 399
           L +   P  + GMN++LVYV
Sbjct: 527 L-WTGTPFFYPGMNSILVYV 545


>gi|363733262|ref|XP_420455.3| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Gallus
           gallus]
          Length = 581

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 117/377 (31%), Positives = 183/377 (48%), Gaps = 40/377 (10%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           QRL SLD FRGL++ +M+ V++ GG +    H  WNG  +AD V P+F+FI+G +I+L+L
Sbjct: 184 QRLRSLDTFRGLSLIIMVFVNYGGGKYWFFKHESWNGLTVADLVFPWFVFIMGTSISLSL 243

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
                   + +KV+++ L   F  ILL G     P+     +    +R+ GVLQR+ L+Y
Sbjct: 244 SSTLRWGSSKQKVLWKILWRSFLLILL-GVIVVNPNYCLGALSWENLRIPGVLQRLGLTY 302

Query: 153 LLVSLVE-IFTKDVQDK---DQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
           L+V+ +E +FT+   D    + S         +   W+    + V++L L +   VP   
Sbjct: 303 LVVAALELLFTRTGADSGTLEMSCPALQDILPFWPQWIFILMLEVIWLCLTFLLPVPGCP 362

Query: 209 FTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
              +      D+G   N T G          A GYIDR VLG  H+Y HP+       T 
Sbjct: 363 RGYLGPGGIGDFGNYLNCTGG----------AAGYIDRLVLGEKHIYQHPSCNVLYQTT- 411

Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGH----LARLK 323
                             P++PEG+L ++++IL   +G+  G +I+  K      ++R  
Sbjct: 412 -----------------VPYDPEGILGTINTILMAFLGLQAGKIILSYKDQHKQIMSRFF 454

Query: 324 QW-VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKY 382
            W V MG    I          IP+NK L++ SYV   S  A ++   +Y LVD+  L +
Sbjct: 455 IWSVVMGIISAILTKCSKEEGFIPINKNLWSTSYVTTMSCFAFILLLLMYYLVDVKRL-W 513

Query: 383 PFLPLAWIGMNAMLVYV 399
              P  + GMN++LVY+
Sbjct: 514 SGTPFFYPGMNSILVYI 530


>gi|281351504|gb|EFB27088.1| hypothetical protein PANDA_006846 [Ailuropoda melanoleuca]
          Length = 557

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 114/383 (29%), Positives = 192/383 (50%), Gaps = 50/383 (13%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALA 91
           + RL  +D FRG+A+ LM+ V++ GG +    H+ WNG  +AD V P+F+FI+G ++ L+
Sbjct: 159 SPRLRCVDTFRGIALILMVFVNYGGGRYWYFKHSSWNGLTVADLVFPWFVFIMGSSVFLS 218

Query: 92  LKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
           +  +  R  +    + K+ +R+  L+  G+++       P+     +    +R+ GVLQR
Sbjct: 219 MTSVLQRGCSKFKLLGKIAWRSFLLICIGVVIVN-----PNYCLGPLSWDKVRIPGVLQR 273

Query: 148 IALSYLLVSLVE-IFTKDVQDKDQSV-GRFSIFR-LYCW-HWLMAACVLVVYLALLYGTY 203
           + ++Y +V+++E IF K V +   S  G FS+   ++ W  WL    +  ++L L +   
Sbjct: 274 LGVTYFVVAVLELIFAKPVPESCASERGCFSLRDIIFSWPQWLFILMLESIWLGLTFFLP 333

Query: 204 VPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
           VP      +      D+GK  N T G          A GYIDR +LG +H+Y HP    S
Sbjct: 334 VPGCPTGYLGPGGIGDWGKYPNCTGG----------AAGYIDRLLLGDDHIYQHP----S 379

Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGH 318
            A    +                 ++PEG+L +++SI+   +G+  G ++++    TK  
Sbjct: 380 SAVLYHT--------------EVAYDPEGILGTINSIVMAFLGIQAGKILLYYKDQTKDI 425

Query: 319 LARLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           L R   W      L+   LT    N   IP+NK L+++SYV   S  A  +   +Y +VD
Sbjct: 426 LIRFTAWCCF-LGLISVALTKVSENEGFIPINKNLWSISYVTTLSSFAFFILLILYPIVD 484

Query: 377 IWNLKYPFLPLAWIGMNAMLVYV 399
           +  L +   P  + GMN++LVYV
Sbjct: 485 VKGL-WTGTPFFYPGMNSILVYV 506


>gi|380789677|gb|AFE66714.1| heparan-alpha-glucosaminide N-acetyltransferase precursor [Macaca
           mulatta]
 gi|383410547|gb|AFH28487.1| heparan-alpha-glucosaminide N-acetyltransferase [Macaca mulatta]
 gi|384945386|gb|AFI36298.1| heparan-alpha-glucosaminide N-acetyltransferase [Macaca mulatta]
          Length = 635

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 113/380 (29%), Positives = 185/380 (48%), Gaps = 48/380 (12%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
           RL S+D FRG+A+ LM+ V++ GG +    HA WNG  +AD V P+F+FI+G +I L++ 
Sbjct: 239 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMT 298

Query: 94  RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            I  R  +    + K+ +R+  L+  GI++       P+     +    +R+ GVLQR+ 
Sbjct: 299 SILQRGCSKFRLLGKIAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGVLQRLG 353

Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
           ++Y +V+++E +F K V +   S       R     W  WL+   +  ++L L +   VP
Sbjct: 354 VTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLILALEGLWLGLTFLLPVP 413

Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                 +      D+GK  N T G          A GYIDR +LG +H+Y HP+      
Sbjct: 414 GCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDDHLYQHPS------ 457

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
                                 ++PEG+L +++SI+   +GV  G ++++    TK  L 
Sbjct: 458 ------------STVLYHTEVAYDPEGILGTINSIVMAFLGVQAGKILLYYKAQTKDILI 505

Query: 321 RLKQWVT-MGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWN 379
           R   W   +G   ++          IP+NK L++LSYV   S  A  +   +Y +VD+  
Sbjct: 506 RFTAWCCILGLISVVLTKVSENEGFIPVNKNLWSLSYVTTLSSFAFFILLVLYPVVDVKG 565

Query: 380 LKYPFLPLAWIGMNAMLVYV 399
           L +   P  + GMN++LVYV
Sbjct: 566 L-WTGTPFFYPGMNSILVYV 584


>gi|395507548|ref|XP_003758085.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase
           [Sarcophilus harrisii]
          Length = 634

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 119/394 (30%), Positives = 195/394 (49%), Gaps = 48/394 (12%)

Query: 20  DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
           D+  +  +  L   RL SLD FRG+A+ +M+ V++ GG +    H  WNG  LAD V P+
Sbjct: 224 DLQVEAWRLTLPVYRLRSLDTFRGIALIIMVFVNYGGGKYWFFKHESWNGLTLADLVFPW 283

Query: 80  FLFIVGVAIALALKRIPDRA----DAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
           F+FI+G +IAL+L  +  R       + K+++R+L L   GI +       P+     + 
Sbjct: 284 FVFIMGSSIALSLSSMLRRGCSKWKLLGKILWRSLLLCVIGIFIVN-----PNYCLGPLS 338

Query: 136 VRMIRLCGVLQRIALSYLLVSLVE-IFTKDVQDK---DQSVGRFSIFRLYCWHWLMAACV 191
              +R+ GVLQR+ L+YL+V+++E +F K V +    ++S   F     Y   W+    +
Sbjct: 339 WDKLRIPGVLQRLGLTYLVVAVLELLFAKAVPENSAMERSCSSFQDIISYWPQWIFILML 398

Query: 192 LVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGI 250
              ++ + +   VP      +      D+GK  N T G          A GYIDR +LG 
Sbjct: 399 EAAWVCVTFLLPVPGCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGE 448

Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
           +H+Y HP+                   +       P++PEGLL +++SI+   +GV  G 
Sbjct: 449 DHIYQHPS------------------PNVLYHTKVPYDPEGLLGTINSIVMAFLGVQAGK 490

Query: 311 VIIHTKGHLARLKQWVTMGFALL--IFGLTLHFTNA---IPLNKQLYTLSYVCVTSGAAA 365
           +++  K    ++     +  A+L  I G+   F+     IP+NK L+++SYV   S  A 
Sbjct: 491 ILLFYKDQPKQIMLRFLLWSAMLGIISGVLTKFSQNEGFIPVNKNLWSISYVTTLSFFAF 550

Query: 366 LVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
           L+   +Y LVD+  L +   P  + GMN++LVYV
Sbjct: 551 LLLLFMYFLVDVARL-WSGAPFFYPGMNSILVYV 583


>gi|426256612|ref|XP_004021932.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Ovis
           aries]
          Length = 641

 Score =  146 bits (368), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 131/448 (29%), Positives = 216/448 (48%), Gaps = 67/448 (14%)

Query: 17  SEPDVSDQQEKSHLKTQ---RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLA 73
           S    SD Q ++  ++    RL  +D FRG+A+ LM+ V++ GG +    H+ WNG  +A
Sbjct: 225 SPSRASDPQPEAWRRSAAPLRLRCVDTFRGMALILMVFVNYGGGKYWYFKHSSWNGLTVA 284

Query: 74  DFVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDE 129
           D V P+F+FI+G +I L++  I  R  +    + K+++R+  L+  GI     F   P+ 
Sbjct: 285 DLVFPWFVFIMGASIFLSMASILQRGCSKLRLLGKIVWRSFLLICIGI-----FVVNPNY 339

Query: 130 LTYGVDVRMIRLCGVLQRIALSYLLVSLVE-IFTKDVQDK---DQSVGRFSIFRLYC-W- 183
               +     R+ GVLQR+  +Y +V+++E +F K V +    ++S   FS+  +   W 
Sbjct: 340 CLGPLSWEKARIPGVLQRLGATYFVVAVLELLFAKPVPETCALERSC--FSLLDITASWP 397

Query: 184 HWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYI 243
            WL    +  V+LAL +   VP      +       G  +    G          A GY+
Sbjct: 398 QWLFVLILEGVWLALTFFLPVPGCPTGYLGPGGIGDGGRYRNCTG---------GAAGYV 448

Query: 244 DRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTI 303
           DR +LG  H+Y HP    S A    +                 ++PEG+L +++SI+   
Sbjct: 449 DRLLLGDQHLYQHP----SSAVLYHT--------------EVAYDPEGILGTINSIVMAF 490

Query: 304 IGVHFGHVIIH----TKGHLARLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYV 357
           +GV  G ++++    T+G L R   W  +   L+   LT    N   IP+NK L+++SYV
Sbjct: 491 LGVQAGKILLYYKDQTRGILIRFAAWGCL-LGLVSVALTKASENEGFIPVNKNLWSVSYV 549

Query: 358 CVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFIN-GWYYG 416
              S  A L+  A+Y +VD+  L +   P  + GMN++LVYV     +FA +    W  G
Sbjct: 550 TTLSSLAFLILLALYPVVDVKGL-WTGAPFFYPGMNSILVYV--GHEVFASYFPFQWKLG 606

Query: 417 DP--------HNTLVCFLFI-ISYILHS 435
           D          NT+   L++ I+Y+L+ 
Sbjct: 607 DQQSHKEHLVQNTVATALWVLIAYVLYK 634


>gi|355697915|gb|EHH28463.1| Heparan-alpha-glucosaminide N-acetyltransferase, partial [Macaca
           mulatta]
          Length = 596

 Score =  146 bits (368), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 112/380 (29%), Positives = 185/380 (48%), Gaps = 48/380 (12%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
           RL S+D FRG+A+ LM+ V++ GG +    HA WNG  +AD V P+F+FI+G +I L++ 
Sbjct: 200 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMT 259

Query: 94  RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            +  R  +    + K+ +R+  L+  GI++       P+     +    +R+ GVLQR+ 
Sbjct: 260 SVLQRGCSKFRLLGKIAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGVLQRLG 314

Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
           ++Y +V+++E +F K V +   S       R     W  WL+   +  ++L L +   VP
Sbjct: 315 VTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLILALEGLWLGLTFLLPVP 374

Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                 +      D+GK  N T G          A GYIDR +LG +H+Y HP+      
Sbjct: 375 GCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDDHLYQHPS------ 418

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
                                 ++PEG+L +++SI+   +GV  G ++++    TK  L 
Sbjct: 419 ------------STVLYHTEVAYDPEGILGTINSIVMAFLGVQAGKILLYYKAQTKDILI 466

Query: 321 RLKQWVT-MGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWN 379
           R   W   +G   ++          IP+NK L++LSYV   S  A  +   +Y +VD+  
Sbjct: 467 RFTAWCCILGLISVVLTKVSENEGFIPVNKNLWSLSYVTTLSSFAFFILLVLYPVVDVKG 526

Query: 380 LKYPFLPLAWIGMNAMLVYV 399
           L +   P  + GMN++LVYV
Sbjct: 527 L-WTGTPFFYPGMNSILVYV 545


>gi|195447210|ref|XP_002071113.1| GK25317 [Drosophila willistoni]
 gi|194167198|gb|EDW82099.1| GK25317 [Drosophila willistoni]
          Length = 537

 Score =  146 bits (368), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 118/394 (29%), Positives = 187/394 (47%), Gaps = 58/394 (14%)

Query: 20  DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
            + D   K+  + +RL SLD FRGLA+ LMI V+  GG +  I H  WNG +LAD V P 
Sbjct: 176 SIGDAAAKA-TQRKRLRSLDTFRGLAIVLMIFVNSGGGGYDSIDHVAWNGLHLADLVFPC 234

Query: 80  FLFIVGVAIALALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
           FL+I+GV I L++K    R   +     ++I+R+ KL   G+ L             G  
Sbjct: 235 FLWIMGVCIPLSIKSQLGRGTSKIQICGRIIWRSFKLFAIGVCLNS---------INGPK 285

Query: 136 VRMIRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCWHWLMAACVL 192
           +  +R+ GVLQR  +++L+V L+  + ++      Q   + SI+   ++   + +   ++
Sbjct: 286 LEQLRVMGVLQRFGVAFLVVGLLHTVCSRRDHISPQQAWQRSIYDICIFSGEFAVLLALI 345

Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPC--NAVGYIDRKVLGI 250
             YL L YG  VP      +       GK  N         NP C   A GYID++VLG 
Sbjct: 346 ATYLGLTYGLKVPGCPRGYLGPG----GKSNNAA-------NPHCIGGAAGYIDQQVLGN 394

Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
            H+Y +P  +     T                    F+PEGL   + S++  ++G   G 
Sbjct: 395 AHIYQYPTAKYVYDAT-------------------AFDPEGLFGCLLSVVHVLLGAFAGV 435

Query: 311 VIIHTKGHLARLKQW-VTMGFALLIFGLTLHFTNA---IPLNKQLYTLSYVCVTSGAAAL 366
            ++      +R+K+W +      L+ G    F+     IP+NK L++LS+V VT   A L
Sbjct: 436 TLLVHPTWQSRMKRWTILALLLGLMGGALCGFSKENGLIPVNKNLWSLSFVFVTVTVALL 495

Query: 367 VFSAIYALVDI--WNLKYPFLPLAWIGMNAMLVY 398
           +   +Y ++D+  W   YPF      GMNA+++Y
Sbjct: 496 LLDFLYYIIDVRQWWSGYPFTE---CGMNAIIMY 526


>gi|431902215|gb|ELK08716.1| Heparan-alpha-glucosaminide N-acetyltransferase [Pteropus alecto]
          Length = 585

 Score =  146 bits (368), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 115/380 (30%), Positives = 184/380 (48%), Gaps = 48/380 (12%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
           RL  +D FRG+A+ LM+ V++ GG +    HA WNG  +AD V P+F+FI+G +I L++ 
Sbjct: 189 RLRCVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMT 248

Query: 94  RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            I  R  +    + KV +R+  L+  GI     F   P+     +    +R+ GVLQR+ 
Sbjct: 249 SILQRGCSKFRLLGKVTWRSFLLICIGI-----FIVNPNYCLGPLSWDKLRIPGVLQRLG 303

Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
           ++Y +V+++E +F K V +      R S  +  +  W  WL    +  ++LAL +   VP
Sbjct: 304 VTYFVVAVLELLFAKPVPESCTVERRCSSLQDIISSWPQWLFILMLESIWLALTFFLPVP 363

Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                 +      D GK  N T G          A GYIDR +LG +H+Y HP+      
Sbjct: 364 GCPTGYLGPGGIGDLGKYPNCTGG----------AAGYIDRLLLGDDHLYQHPS------ 407

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
                                 ++PEG+L +++SI+   +GV  G ++++    TK  L 
Sbjct: 408 ------------STVLYHTKVAYDPEGILGTINSIVMAFLGVQAGKILLYYKDQTKDILI 455

Query: 321 RLKQWVT-MGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWN 379
           R   W   +G   + F         IP+NK L+++SYV   S  A  +   +Y +VD+  
Sbjct: 456 RFTAWCCFLGLMSVAFTKISENEGFIPVNKNLWSISYVTTLSSFAFFILLILYPVVDVKG 515

Query: 380 LKYPFLPLAWIGMNAMLVYV 399
           L +   P  + GMN++LVYV
Sbjct: 516 L-WSGAPFFYPGMNSILVYV 534


>gi|195041852|ref|XP_001991329.1| GH12115 [Drosophila grimshawi]
 gi|193901087|gb|EDV99953.1| GH12115 [Drosophila grimshawi]
          Length = 573

 Score =  145 bits (366), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 123/395 (31%), Positives = 188/395 (47%), Gaps = 58/395 (14%)

Query: 20  DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
            + D   K+  + +RL SLD FRGL + LMI V+  GG +  I HA WNG +LAD V P 
Sbjct: 171 SIGDAAAKA-TQRKRLRSLDTFRGLCIVLMIFVNSGGGGYSWIEHAAWNGLHLADIVFPS 229

Query: 80  FLFIVGVAIALALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
           FL+I+GV I L++K    R   +     ++++R  KL   G+ L             G  
Sbjct: 230 FLWIMGVCIPLSIKAQLARGTSKTRICLRIVWRACKLFAIGLCLNS---------VNGPQ 280

Query: 136 VRMIRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCWHWLMAACVL 192
           +  +RL GVLQR  ++YLLV+++  + ++  Q   Q   + +I+   L+   + +   ++
Sbjct: 281 LEQLRLMGVLQRFGIAYLLVAILHTVCSRRDQLSPQRAWQRAIYDICLFSGEFAVLLALI 340

Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPC--NAVGYIDRKVLGI 250
             YL L +G  VP      +       GK  N         +P C   A GYID  VLG 
Sbjct: 341 ATYLGLTFGLRVPGCPVGYLGPG----GKHNNAA-------HPNCIGGAAGYIDLLVLGN 389

Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
            H+Y HP  +     T                    F+PEG+   + S++ T++G   G 
Sbjct: 390 AHIYQHPTAKYVYDAT-------------------AFDPEGIFGCLLSVVQTLLGAFAGV 430

Query: 311 VIIHTKGHLARLKQWV---TMGFALLIFGLTLHFTNA-IPLNKQLYTLSYVCVTSGAAAL 366
            ++       RLK+W+   T+   L          N  IP+NK L++LS+V VT+  A +
Sbjct: 431 TLLVHSTWQGRLKRWLLVATVLGLLGGALCGFSKENGIIPINKNLWSLSFVFVTAALALV 490

Query: 367 VFSAIYALVDIWNL--KYPFLPLAWIGMNAMLVYV 399
           + S IY +VD+  L   YPF      GMNA+++YV
Sbjct: 491 LLSLIYYVVDVRQLWSGYPFKE---CGMNAIIMYV 522


>gi|195476975|ref|XP_002100049.1| GE16376 [Drosophila yakuba]
 gi|194187573|gb|EDX01157.1| GE16376 [Drosophila yakuba]
          Length = 576

 Score =  145 bits (366), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 118/385 (30%), Positives = 187/385 (48%), Gaps = 57/385 (14%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           + +RL SLD FRGL++ LMI V+  GG +  I HA WNG +LAD V P FL+I+GV I L
Sbjct: 182 QRKRLRSLDTFRGLSIVLMIFVNSGGGGYAWIEHAAWNGLHLADVVFPSFLWIMGVCIPL 241

Query: 91  ALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQ 146
           ++K    R   +A    ++++R++KL   G+ L             G ++  +R  GVLQ
Sbjct: 242 SVKSQLSRGSSKARICLRILWRSIKLFVIGLCLNS---------MSGPNLEQLRFMGVLQ 292

Query: 147 RIALSYLLVSLVEIFTKDVQD-KDQSVGRFSIFRLYCWHWLMAACVLVV--YLALLYGTY 203
           R  ++YL+V ++       +    Q + + ++  +  +   +A  + +V  YL L YG  
Sbjct: 293 RFGVAYLVVGVLHTLCCRREPISPQRLWQRAVHDVCLFSGELAVLLALVATYLGLTYGLR 352

Query: 204 VPDWQFTIINKDSA-DYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
           VP      +      DY    N   G          A GY+D +VLG  H+Y HP  +  
Sbjct: 353 VPGCPRGYLGPGGKHDYNAHPNCIGG----------AAGYVDLQVLGNAHIYQHPTAKYV 402

Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
              T                    F+PEG+   + S++  ++G   G  ++      +R+
Sbjct: 403 YDST-------------------AFDPEGIFGCILSVVQVLLGAFAGVTLLVHPNWQSRI 443

Query: 323 KQWVTMGFAL-LIFGLTLHFT---NAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDI- 377
           ++W  +   L LI G    F+    AIP+NK L++LS+VCVT   A L+ S +Y  +D+ 
Sbjct: 444 RRWTFLAILLGLIGGALCGFSREGGAIPVNKNLWSLSFVCVTVSLALLILSLLYYFIDVR 503

Query: 378 --WNLK-YPFLPLAWIGMNAMLVYV 399
             W+   YPF      GMNA+++YV
Sbjct: 504 ETWSWSGYPFTE---CGMNAIVMYV 525


>gi|345482764|ref|XP_001600799.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Nasonia vitripennis]
          Length = 569

 Score =  145 bits (366), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 124/444 (27%), Positives = 207/444 (46%), Gaps = 59/444 (13%)

Query: 20  DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
           D  + +       +R+ SLD  RG+++ LMI V++    +  + HA WNG  + D V P 
Sbjct: 173 DEDELEVGKKTAKRRVRSLDTVRGMSILLMIFVNNGAAGYALLEHATWNGLLVGDLVFPC 232

Query: 80  FLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYG 133
           F++I+GV I L+    L R   R    + ++ R++ L   G+ L   GG +         
Sbjct: 233 FMWIMGVCIPLSISAQLSRGSSRLRLCRAIVKRSVYLFAIGLALNTLGGRNQ-------- 284

Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV 193
             +  IR+ GVLQR  L+YL+  +V        DK QS             W++A  +L 
Sbjct: 285 --LERIRIFGVLQRFGLAYLVAGIVYALAARPDDK-QSKRMLGDVVALIPQWIVALLILA 341

Query: 194 VYLALLYGTYVPDWQFTIINKDS--ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
            + A+++   VP      +      AD GK +N + G          A GY+D+ +LG++
Sbjct: 342 AHCAVVFLLPVPGCPRGYLGPGGRHAD-GKYWNCSGG----------ATGYVDKVLLGVD 390

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           H+Y  P        T +S +              PF+PEG+L S++SI    +G+  G +
Sbjct: 391 HIYQLP--------TANSVYG-----------SGPFDPEGVLGSLTSIFQVFLGIQAGQI 431

Query: 312 IIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAI 371
           +       ARL +W+     L   G  LH+TN +P+NK L+++S+V VT+  +  + S  
Sbjct: 432 LRTYGSWKARLVRWLLWAVLLGAVGAALHYTNVVPVNKNLWSVSFVLVTTCFSLGLLSLC 491

Query: 372 YALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLVCFLFIISY 431
           Y L+D+  + +   P    GMNA+++Y    + ++  F   W YG P N+        ++
Sbjct: 492 YLLIDVLGV-WDGGPFRVPGMNALVMYA-GHQILYDMFPFHWRYG-PMNSH-------TW 541

Query: 432 ILHSFLWELRKFLYVQFCNLSWKF 455
           +L   LW +  + YV +     KF
Sbjct: 542 LLAESLWCVGLWTYVAYAMHRKKF 565


>gi|301765942|ref|XP_002918389.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Ailuropoda melanoleuca]
          Length = 851

 Score =  145 bits (365), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 114/383 (29%), Positives = 192/383 (50%), Gaps = 50/383 (13%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALA 91
           + RL  +D FRG+A+ LM+ V++ GG +    H+ WNG  +AD V P+F+FI+G ++ L+
Sbjct: 453 SPRLRCVDTFRGIALILMVFVNYGGGRYWYFKHSSWNGLTVADLVFPWFVFIMGSSVFLS 512

Query: 92  LKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
           +  +  R  +    + K+ +R+  L+  G+++       P+     +    +R+ GVLQR
Sbjct: 513 MTSVLQRGCSKFKLLGKIAWRSFLLICIGVVIVN-----PNYCLGPLSWDKVRIPGVLQR 567

Query: 148 IALSYLLVSLVE-IFTKDVQDKDQSV-GRFSIFR-LYCW-HWLMAACVLVVYLALLYGTY 203
           + ++Y +V+++E IF K V +   S  G FS+   ++ W  WL    +  ++L L +   
Sbjct: 568 LGVTYFVVAVLELIFAKPVPESCASERGCFSLRDIIFSWPQWLFILMLESIWLGLTFFLP 627

Query: 204 VPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
           VP      +      D+GK  N T G          A GYIDR +LG +H+Y HP    S
Sbjct: 628 VPGCPTGYLGPGGIGDWGKYPNCTGG----------AAGYIDRLLLGDDHIYQHP----S 673

Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGH 318
            A    +                 ++PEG+L +++SI+   +G+  G ++++    TK  
Sbjct: 674 SAVLYHT--------------EVAYDPEGILGTINSIVMAFLGIQAGKILLYYKDQTKDI 719

Query: 319 LARLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           L R   W      L+   LT    N   IP+NK L+++SYV   S  A  +   +Y +VD
Sbjct: 720 LIRFTAWCCF-LGLISVALTKVSENEGFIPINKNLWSISYVTTLSSFAFFILLILYPIVD 778

Query: 377 IWNLKYPFLPLAWIGMNAMLVYV 399
           +  L +   P  + GMN++LVYV
Sbjct: 779 VKGL-WTGTPFFYPGMNSILVYV 800


>gi|193664422|ref|XP_001945789.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           isoform 1 [Acyrthosiphon pisum]
          Length = 568

 Score =  145 bits (365), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 117/396 (29%), Positives = 195/396 (49%), Gaps = 53/396 (13%)

Query: 17  SEPDVSDQQEKSHLKTQ--RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLAD 74
           +EP +   Q  + +K    R+ SLD FRG+++ LM+ V+  GG +    HAPWNG  LAD
Sbjct: 162 NEPVIIHPQIPTPVKNNSYRITSLDTFRGISIILMVFVNLGGGHYWFFEHAPWNGITLAD 221

Query: 75  FVMPFFLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDEL 130
           F++P+F +++GV+IA++    L+    R     +VI R++ LL  G++L           
Sbjct: 222 FILPWFCWVMGVSIAISLRSQLRSSTKRKYVFGRVIRRSIALLIMGLVLN---------S 272

Query: 131 TYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFR---LYCWHWLM 187
               ++R  R  GVLQR+AL Y + + +E      Q    +  R  + R        W +
Sbjct: 273 VNNNNLRTFRPLGVLQRLALIYFIAATLETIFMKPQPYFTNT-RLDVIRDIIESARQWFI 331

Query: 188 AACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKV 247
              ++ ++  + +   VP        K     G ++N +    +  N    A GYIDR V
Sbjct: 332 VIILVAIHTVITFFLPVPG-----CPKGYLGPGGLYNSS----SNTNCTGGAAGYIDRLV 382

Query: 248 LGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVH 307
            G NHMY              SP         P +   PF+PEG+LS++++ L   +GVH
Sbjct: 383 FGENHMY------------PGSP--------KPVYQSIPFDPEGILSTLTNTLLVYMGVH 422

Query: 308 FGHVIIHTKGHLARLKQWVTMGFAL-LIFGLTLHFTNA---IPLNKQLYTLSYVCVTSGA 363
            G +I+  +    R+K+W+     L LI G   +F+     IP+NK L++LSY  +T  +
Sbjct: 423 AGRIILCYQYTNERIKRWIAWTIVLGLIGGCLCNFSKEDGLIPINKNLFSLSYAFITGSS 482

Query: 364 AALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
           A L+F  ++ +++ W L +   P   IG N++++Y+
Sbjct: 483 AFLIFIILFLIIEHWRL-WGGSPFNEIGQNSIMLYL 517


>gi|410956346|ref|XP_003984803.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Felis
           catus]
          Length = 629

 Score =  144 bits (364), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 116/381 (30%), Positives = 186/381 (48%), Gaps = 50/381 (13%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
           RL  +D FRG+A+ LM+ V++ GG +    H+ WNG  +AD V P+F+FI+G +I L++ 
Sbjct: 233 RLRCVDTFRGIALILMVFVNYGGGKYWYFKHSSWNGLTVADLVFPWFVFIMGSSIFLSMT 292

Query: 94  RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            I  R  +    + K+ +R+  L+  G+     F   P+     +    +R+ GVLQR+ 
Sbjct: 293 SILQRGCSKLKLMGKIGWRSFLLICIGM-----FIVNPNYCLGPLSWDKVRIPGVLQRLG 347

Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
           ++Y +V+++E IF K V +   S       R  ++ W  WL    +  ++L L +   VP
Sbjct: 348 VTYFVVAMLELIFAKPVPESCASERSCFSLRDIIFSWPQWLFILMLESIWLGLTFFLPVP 407

Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                 +      D GK  N T G          A GYIDR +LG +H+Y HP    S A
Sbjct: 408 GCPTGYLGPGGIGDLGKYPNCTGG----------AAGYIDRLLLGDDHIYQHP----SSA 453

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
               +                 ++PEG+L +++SI+   +GV  G ++++    TK  L 
Sbjct: 454 VLYHT--------------EVAYDPEGILGTINSIVMAFLGVQAGKILLYYKDQTKDILI 499

Query: 321 RLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIW 378
           R   W      L+   LT    N   IP+NK L+++SYV   S  A  +   +Y +VD+ 
Sbjct: 500 RFTAWCCF-LGLISIALTKISENEGFIPINKNLWSISYVTTLSSFAFFILLILYPIVDVK 558

Query: 379 NLKYPFLPLAWIGMNAMLVYV 399
            L +   P  + GMN++LVYV
Sbjct: 559 GL-WTGTPFFYPGMNSILVYV 578


>gi|297491309|ref|XP_002698775.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Bos
           taurus]
 gi|296472360|tpg|DAA14475.1| TPA: Heparan-alpha-glucosaminide N-acetyltransferase-like [Bos
           taurus]
          Length = 723

 Score =  144 bits (364), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 125/422 (29%), Positives = 201/422 (47%), Gaps = 54/422 (12%)

Query: 17  SEPDVSDQQEKSHLKTQ---RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLA 73
           S    SD Q ++  ++    RL  +D FRG+A+ LM+ V++ GG +    H+ WNG  +A
Sbjct: 307 SPSRASDPQPEAWRRSAAPLRLRCVDTFRGMALILMVFVNYGGGKYWYFKHSSWNGLTVA 366

Query: 74  DFVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDE 129
           D V P+F+FI+G +I L++  I  R  +    + K+ +R+  L+  GI     F   P  
Sbjct: 367 DLVFPWFVFIMGTSIFLSMTSILQRGCSKFRLLGKIAWRSFLLICIGI-----FVVNPKY 421

Query: 130 LTYGVDVRMIRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVGR-FSIFRLYC-W-HW 185
               +     R+ GVLQR+  +Y +V+++E +F K V +   S    FS+  +   W  W
Sbjct: 422 CLGPLSWEKARIPGVLQRLGATYFVVAVLELLFAKPVPETCASERSCFSLLDITASWPQW 481

Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDR 245
           L    +  V+LAL +   VP      +       G  +    G          A GY+DR
Sbjct: 482 LFVLILEGVWLALTFFLPVPGCPTGYLGPGGIGDGGRYRNCTG---------GAAGYVDR 532

Query: 246 KVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIG 305
            +LG  H+Y HP    S A    +                 ++PEG+L +++SI+   +G
Sbjct: 533 LLLGDQHLYQHP----SSAVLYHT--------------EVAYDPEGILGTINSIVMAFLG 574

Query: 306 VHFGHVIIH----TKGHLARLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCV 359
           V  G ++++    T+G L R   W  +   L+   LT    N   IP+NK L+++SYV  
Sbjct: 575 VQAGKILLYYKDQTRGILIRFAAWGCL-LGLVSVALTKASENEGFIPVNKNLWSISYVTT 633

Query: 360 TSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFIN-GWYYGDP 418
            S  A L+  A+Y +VD+  L +   P  + GMN++LVYV     +FA +    W  GD 
Sbjct: 634 LSSLAFLILLALYPVVDVKGL-WTGAPFFYPGMNSILVYV--GHEVFANYFPFQWKLGDQ 690

Query: 419 HN 420
            +
Sbjct: 691 QS 692


>gi|328696746|ref|XP_003240114.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           isoform 2 [Acyrthosiphon pisum]
          Length = 591

 Score =  144 bits (364), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 117/396 (29%), Positives = 195/396 (49%), Gaps = 53/396 (13%)

Query: 17  SEPDVSDQQEKSHLKTQ--RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLAD 74
           +EP +   Q  + +K    R+ SLD FRG+++ LM+ V+  GG +    HAPWNG  LAD
Sbjct: 185 NEPVIIHPQIPTPVKNNSYRITSLDTFRGISIILMVFVNLGGGHYWFFEHAPWNGITLAD 244

Query: 75  FVMPFFLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDEL 130
           F++P+F +++GV+IA++    L+    R     +VI R++ LL  G++L           
Sbjct: 245 FILPWFCWVMGVSIAISLRSQLRSSTKRKYVFGRVIRRSIALLIMGLVLN---------S 295

Query: 131 TYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFR---LYCWHWLM 187
               ++R  R  GVLQR+AL Y + + +E      Q    +  R  + R        W +
Sbjct: 296 VNNNNLRTFRPLGVLQRLALIYFIAATLETIFMKPQPYFTNT-RLDVIRDIIESARQWFI 354

Query: 188 AACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKV 247
              ++ ++  + +   VP        K     G ++N +    +  N    A GYIDR V
Sbjct: 355 VIILVAIHTVITFFLPVPG-----CPKGYLGPGGLYNSS----SNTNCTGGAAGYIDRLV 405

Query: 248 LGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVH 307
            G NHMY              SP         P +   PF+PEG+LS++++ L   +GVH
Sbjct: 406 FGENHMY------------PGSP--------KPVYQSIPFDPEGILSTLTNTLLVYMGVH 445

Query: 308 FGHVIIHTKGHLARLKQWVTMGFAL-LIFGLTLHFTNA---IPLNKQLYTLSYVCVTSGA 363
            G +I+  +    R+K+W+     L LI G   +F+     IP+NK L++LSY  +T  +
Sbjct: 446 AGRIILCYQYTNERIKRWIAWTIVLGLIGGCLCNFSKEDGLIPINKNLFSLSYAFITGSS 505

Query: 364 AALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
           A L+F  ++ +++ W L +   P   IG N++++Y+
Sbjct: 506 AFLIFIILFLIIEHWRL-WGGSPFNEIGQNSIMLYL 540


>gi|194679266|ref|XP_588978.4| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Bos
           taurus]
          Length = 734

 Score =  144 bits (364), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 125/422 (29%), Positives = 201/422 (47%), Gaps = 54/422 (12%)

Query: 17  SEPDVSDQQEKSHLKTQ---RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLA 73
           S    SD Q ++  ++    RL  +D FRG+A+ LM+ V++ GG +    H+ WNG  +A
Sbjct: 318 SPSRASDPQPEAWRRSAAPLRLRCVDTFRGMALILMVFVNYGGGKYWYFKHSSWNGLTVA 377

Query: 74  DFVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDE 129
           D V P+F+FI+G +I L++  I  R  +    + K+ +R+  L+  GI     F   P  
Sbjct: 378 DLVFPWFVFIMGTSIFLSMTSILQRGCSKFRLLGKIAWRSFLLICIGI-----FVVNPKY 432

Query: 130 LTYGVDVRMIRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVGR-FSIFRLYC-W-HW 185
               +     R+ GVLQR+  +Y +V+++E +F K V +   S    FS+  +   W  W
Sbjct: 433 CLGPLSWEKARIPGVLQRLGATYFVVAVLELLFAKPVPETCASERSCFSLLDITASWPQW 492

Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDR 245
           L    +  V+LAL +   VP      +       G  +    G          A GY+DR
Sbjct: 493 LFVLILEGVWLALTFFLPVPGCPTGYLGPGGIGDGGRYRNCTG---------GAAGYVDR 543

Query: 246 KVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIG 305
            +LG  H+Y HP    S A    +                 ++PEG+L +++SI+   +G
Sbjct: 544 LLLGDQHLYQHP----SSAVLYHT--------------EVAYDPEGILGTINSIVMAFLG 585

Query: 306 VHFGHVIIH----TKGHLARLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCV 359
           V  G ++++    T+G L R   W  +   L+   LT    N   IP+NK L+++SYV  
Sbjct: 586 VQAGKILLYYKDQTRGILIRFAAWGCL-LGLVSVALTKASENEGFIPVNKNLWSISYVTT 644

Query: 360 TSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFIN-GWYYGDP 418
            S  A L+  A+Y +VD+  L +   P  + GMN++LVYV     +FA +    W  GD 
Sbjct: 645 LSSLAFLILLALYPVVDVKGL-WTGAPFFYPGMNSILVYV--GHEVFANYFPFQWKLGDQ 701

Query: 419 HN 420
            +
Sbjct: 702 QS 703


>gi|24639786|ref|NP_572198.1| CG6903 [Drosophila melanogaster]
 gi|7290544|gb|AAF45996.1| CG6903 [Drosophila melanogaster]
 gi|21483396|gb|AAM52673.1| LD22376p [Drosophila melanogaster]
          Length = 576

 Score =  143 bits (361), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 115/401 (28%), Positives = 187/401 (46%), Gaps = 61/401 (15%)

Query: 17  SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFV 76
           +  D   +      + +RL SLD FRG+++ LMI V+  GG +  I HA WNG +LAD V
Sbjct: 168 AAADSIGEAATKATQRKRLRSLDTFRGISIVLMIFVNSGGGGYAWIEHAAWNGLHLADVV 227

Query: 77  MPFFLFIVGVAIALALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTY 132
            P FL+I+GV I L++K    R   +A    ++++R++KL   G+ L             
Sbjct: 228 FPSFLWIMGVCIPLSVKSQLSRGSSKARICLRILWRSIKLFVIGLCLNS---------MS 278

Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLVEIFT--KDVQDKDQSVGR-FSIFRLYCWHWLMAA 189
           G ++  +R+ GVLQR  ++YL+V+++      ++     +S  R      L+     +  
Sbjct: 279 GPNLEQLRIMGVLQRFGVAYLVVAILHTLCCRREPISPQRSWQRAVHDVCLFSGELAVLL 338

Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAKLNPPC--NAVGYIDRK 246
            ++  YL L +G  VP      +      DY              +P C   A GY D +
Sbjct: 339 ALVATYLGLTFGLRVPGCPRGYLGPGGKHDYNA------------HPKCIGGAAGYADLQ 386

Query: 247 VLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGV 306
           VLG  H+Y HP  +     T                    F+PEG+   + S++  ++G 
Sbjct: 387 VLGNAHIYQHPTAKYVYDST-------------------AFDPEGIFGCILSVVQVLLGA 427

Query: 307 HFGHVIIHTKGHLARLKQWVTMGFALLIFGLTL----HFTNAIPLNKQLYTLSYVCVTSG 362
             G  ++      +R+++W  +   L + G  L        AIP+NK L++LS+VCVT  
Sbjct: 428 FAGVTLLVHPNFQSRIRRWTLLAILLGLIGGALCGFSREGGAIPMNKNLWSLSFVCVTVS 487

Query: 363 AAALVFSAIYALVDI---WNLK-YPFLPLAWIGMNAMLVYV 399
            A L+ S +Y  +D+   W+   YPF      GMNA+++YV
Sbjct: 488 LALLILSLMYYFIDVRETWSWSGYPFTE---CGMNAIVMYV 525


>gi|126272886|ref|XP_001369919.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Monodelphis domestica]
          Length = 389

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 114/371 (30%), Positives = 177/371 (47%), Gaps = 50/371 (13%)

Query: 44  LAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL----KRIPDRA 99
           L++ LMI V++ GG +    HAPWNG  +AD VMP+F+FI+G ++ LA     ++   + 
Sbjct: 3   LSLTLMIFVNYGGGGYWFFEHAPWNGLTIADLVMPWFVFILGTSVGLAFHVMQRKGVKKF 62

Query: 100 DAVKKVIFRTLKLLFWGIL-LQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLV 158
              +KV +RT  L+  G L L  G    P   ++       RL GVLQR+  +Y +V+L+
Sbjct: 63  KLFRKVAWRTGVLIAIGALFLNYGPVDGPLSWSWA------RLPGVLQRLGFTYFIVALM 116

Query: 159 EIF--TKDVQDKDQSV--GRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINK 214
           +I     D+Q     V    F    LY   W++   +  ++L L +   VP      +  
Sbjct: 117 QIAFGVADMQKYQVGVWWAPFRDIVLYWQEWIIIIGLECIWLCLTFLLPVPGCPRGYLGP 176

Query: 215 DS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEG 273
               D GK FN T G          A  YID+ +LG NH+Y  P+ +     TQ      
Sbjct: 177 GGIGDEGKYFNCTGG----------AAAYIDKWILGENHLYRFPSCKELYKTTQ------ 220

Query: 274 PLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGH----LARLKQW-VTM 328
                       PF+PEG+L +++SI+    G+  G +I+  +      L R   W V +
Sbjct: 221 ------------PFDPEGILGTINSIIMAFFGLQAGKIILMYRSKPRSILKRFLIWSVLL 268

Query: 329 GFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLA 388
           G    I  L       IP+NK L++LS+V  TS  +  +   +Y ++D+    +   PL 
Sbjct: 269 GIISAILTLGTQNEGFIPVNKNLWSLSFVTTTSCFSFFLLGLLYYIIDM-KGWWSGCPLI 327

Query: 389 WIGMNAMLVYV 399
           + GMN++LVYV
Sbjct: 328 YPGMNSILVYV 338


>gi|149057830|gb|EDM09073.1| rCG43316 [Rattus norvegicus]
          Length = 626

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 115/395 (29%), Positives = 192/395 (48%), Gaps = 50/395 (12%)

Query: 20  DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
           D   +  ++     RL  +D FRG+A+ LM+ V++ GG +    H+ WNG  +AD V P+
Sbjct: 216 DCQPETRRASALPHRLRCVDTFRGVALILMVFVNYGGGKYWYFKHSSWNGLTVADLVFPW 275

Query: 80  FLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
           F+FI+G +I L+    L+R   +   + K+ +R+  L+  GI++       P+     + 
Sbjct: 276 FVFIMGSSIFLSMTSLLQRGCSKIKLLGKIAWRSFLLICIGIIIVN-----PNYCLGPLS 330

Query: 136 VRMIRLCGVLQRIALSYLLVSLVEIFT-KDVQDK---DQSVGRFSIFRLYCWHWLMAACV 191
              +R+ GVLQR+ ++Y +V+++E+F  K V D    ++S             WL+   +
Sbjct: 331 WDKVRIPGVLQRLGVTYFVVAMLELFFWKPVPDSCTLERSCLSLRDITSSWPQWLIILIL 390

Query: 192 LVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGI 250
             ++LAL +   VP      +      D GK  + T G          A GYIDR +LG 
Sbjct: 391 ESIWLALTFFLPVPGCPTGYLGPGGIGDLGKYPHCTGG----------AAGYIDRLLLGD 440

Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
           +H+Y HP    S A    +                 ++PEG+L +++SI+   +GV  G 
Sbjct: 441 SHLYQHP----SSAVLYHT--------------EVAYDPEGVLGTINSIVMAFLGVQAGK 482

Query: 311 VIIH----TKGHLARLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAA 364
           ++++    TK  L R   W  +   L+   LT    N   IP+NK L+++SYV   S  A
Sbjct: 483 ILLYYKDQTKAILIRFAAWCCI-LGLISIALTKMSANEGFIPINKNLWSISYVTTLSCFA 541

Query: 365 ALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
             +   +Y +VD+  L +   P  + GMN++LVYV
Sbjct: 542 FSILLILYPVVDVKGL-WTGTPFFYPGMNSILVYV 575


>gi|354472121|ref|XP_003498289.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Cricetulus griseus]
          Length = 782

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 117/393 (29%), Positives = 191/393 (48%), Gaps = 52/393 (13%)

Query: 24  QQEKSHLKT--QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFL 81
           Q E  H      RL  +D FRG+A+ LM+ V++ GG +    H+ WNG  +AD V P+F+
Sbjct: 374 QPETRHTSALPYRLRCVDTFRGIALILMVFVNYGGGKYWYFKHSSWNGLTVADLVFPWFV 433

Query: 82  FIVGVAIAL----ALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
           FI+G ++ L    AL R   +   + K+ +R+  L+  GI++       P+     +   
Sbjct: 434 FIMGSSVFLSMTSALHRGCSKFRLLGKITWRSFLLICIGIIVVN-----PNYCLGPLSWD 488

Query: 138 MIRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQ-SVGRFSIFRLYC-W-HWLMAACVLV 193
            +R+ GVLQR+ ++Y +V+++E IF+K V D+        S+  + C W  WL+   +  
Sbjct: 489 KVRIPGVLQRLGVTYFVVAVLELIFSKPVPDRCALERSYLSLRDITCSWPQWLVVLILES 548

Query: 194 VYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
           ++LAL +   VP      +      D GK  + T G          A GYID  +LG NH
Sbjct: 549 IWLALTFFLPVPGCPTGYLGPGGIGDMGKYPHCTGG----------ASGYIDHLLLGDNH 598

Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
           +Y HP+                            ++PEG+L +++SI+   +GV  G ++
Sbjct: 599 LYQHPS------------------STVLYHTQVAYDPEGILGTINSIVMAFLGVQAGKIL 640

Query: 313 IH----TKGHLARLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAAL 366
           ++    TK  L R   W  +   L+   LT    +   IP+NK L+++SYV   S  A  
Sbjct: 641 LYYKDQTKAILMRFTAWCCI-LGLISIALTKMSADEGFIPINKNLWSISYVSTLSCFAFF 699

Query: 367 VFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
           +   +Y +VD+  L +   P  + GMN++LVYV
Sbjct: 700 ILLILYPVVDVKGL-WTGTPFFYPGMNSILVYV 731


>gi|195340719|ref|XP_002036960.1| GM12376 [Drosophila sechellia]
 gi|194131076|gb|EDW53119.1| GM12376 [Drosophila sechellia]
          Length = 576

 Score =  142 bits (359), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 116/401 (28%), Positives = 189/401 (47%), Gaps = 61/401 (15%)

Query: 17  SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFV 76
           +  D   +      + +RL SLD FRGL++ LMI V+  GG +  I HA WNG +LAD V
Sbjct: 168 AAADSIGEAATKATQRKRLRSLDTFRGLSIVLMIFVNSGGGGYAWIEHAAWNGLHLADIV 227

Query: 77  MPFFLFIVGVAIALALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTY 132
            P FL+I+GV I L++K    R   +A    +++ R++KL   G+ L             
Sbjct: 228 FPSFLWIMGVCIPLSVKSQLSRGSSKARICLRILVRSIKLFVIGLCLNS---------MS 278

Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLVEIFT--KDVQDKDQSVGR-FSIFRLYCWHWLMAA 189
           G ++  +R+ GVLQR  ++YL+V ++      ++     +S  R      L+     +  
Sbjct: 279 GPNLEQLRVMGVLQRFGVAYLVVGVLHTLCCRREPISPQRSWQRAVHDVCLFSGELAVLL 338

Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAKLNPPC--NAVGYIDRK 246
            ++  YL L +G  VP      +      DY              +P C   A GY D +
Sbjct: 339 ALVATYLGLTFGLRVPGCPRGYLGPGGKHDYNA------------HPKCIGGAAGYADLQ 386

Query: 247 VLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGV 306
           VLG  H+Y HP  +     T                    F+PEG+   + S++  ++G 
Sbjct: 387 VLGNAHIYQHPTAKYVYDSTA-------------------FDPEGIFGCILSVVQVLLGA 427

Query: 307 HFGHVIIHTKGHLARLKQWVTMGFALLIFGLTL-HFTN---AIPLNKQLYTLSYVCVTSG 362
             G  ++    + +R+++W  +   L + G  L  F++   AIP+NK L++LS+VCVT  
Sbjct: 428 FAGVTLLVHPNYQSRIRRWTLLAILLGLIGGALCGFSSEGGAIPVNKNLWSLSFVCVTVS 487

Query: 363 AAALVFSAIYALVDI---WNLK-YPFLPLAWIGMNAMLVYV 399
            A ++ S +Y  +D+   W+   YPF      GMNA+++YV
Sbjct: 488 LALVILSLMYYFIDVRETWSWSGYPFTE---CGMNAIVMYV 525


>gi|326427923|gb|EGD73493.1| heparan-alpha-glucosaminide N-acetyltransferase [Salpingoeca sp.
           ATCC 50818]
          Length = 788

 Score =  142 bits (359), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 116/437 (26%), Positives = 202/437 (46%), Gaps = 76/437 (17%)

Query: 4   IKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEIS 63
           +  +     PL+ S   + + +        RL SLD FRG+A+ +MI V++ GGD+    
Sbjct: 336 LNTQKYTRDPLLSSTHAIGNPKRSK----TRLQSLDSFRGMALTIMIFVNYGGGDYNFFD 391

Query: 64  HAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILL 119
           H+ WNG  +AD V P+F++I+G ++A+    +  R   ++ +++    RTL L   G++ 
Sbjct: 392 HSVWNGLTVADLVFPWFIWIMGTSMAITFNSLFKRHTPLRTILYKVARRTLLLFGIGVIF 451

Query: 120 QGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIF------------------ 161
                     +    D+R  R+ GVLQR A++YL+V+LV IF                  
Sbjct: 452 ----------INVVHDLRFARVPGVLQRFAIAYLVVALVIIFVPKAVSLLRNVDEVTPLI 501

Query: 162 ---TKDVQD--KDQSVGRFSIFR------LYCWHWLMAACVLVVYLALLYGTYVPDWQFT 210
              T  V++   D   G   + R       Y   W+    ++V++  + +   VP     
Sbjct: 502 RRLTPTVRNPASDLDPGGCGMLRHLPDVAPYVGEWIAIIVLVVIHTCITFLLPVPGCPTG 561

Query: 211 IINKDSA--DYGKVF--NVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
            I    A  ++G+    N +C V         A G++DR +L   H+Y  P  + +    
Sbjct: 562 YIGPGGALAEFGQFAPANGSC-VNGTFCCEGGAAGHVDRWLLSWKHIYGSPTSQET---- 616

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
                          +    ++PEG+L S++SIL   +G+  G +I+H K   AR  +W+
Sbjct: 617 ---------------YQTGAYDPEGILGSLTSILICYLGLQSGKIIVHYKAARARSVRWL 661

Query: 327 TMGF--ALLIFGLTLHFTN--AIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKY 382
             G     +  GL     N   IP++K L++LS+V + SG   +  +A Y L+DIW + +
Sbjct: 662 AWGVLCCAIATGLCGGSKNDGVIPVSKNLWSLSFVLLMSGFGFISLTAFYWLIDIWRV-W 720

Query: 383 PFLPLAWIGMNAMLVYV 399
              P  ++G+N++ +YV
Sbjct: 721 DGAPFRYVGLNSIFIYV 737


>gi|326918494|ref|XP_003205523.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Meleagris gallopavo]
          Length = 532

 Score =  142 bits (359), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 113/385 (29%), Positives = 186/385 (48%), Gaps = 52/385 (13%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           QRL SLD FRGL++ +M+ V++ GG +    H  WNG  +AD V P+F+FI+G +I+L+L
Sbjct: 131 QRLRSLDTFRGLSLIIMVFVNYGGGKYWFFKHESWNGLTVADLVFPWFVFIMGTSISLSL 190

Query: 93  KRI----PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
             +      +   + K+++R+  L   G+++       P+     +    +R+ GVLQR+
Sbjct: 191 SSMLRWGSSKQKVLGKILWRSFLLTLLGVIVVN-----PNYCLGALSWENLRIPGVLQRL 245

Query: 149 ALSYLLVSLVE-IFTKDVQ-------DKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLY 200
            L+YL+V+ +E +FT+ V         ++ S         +   W+    + V++L L +
Sbjct: 246 GLTYLVVAALELLFTRAVNISPSLHLMQEMSYPALQDVLPFWPQWIFILTLEVIWLCLTF 305

Query: 201 GTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAW 259
              VP      +      D+GK  N T G          A GYIDR VLG  H+Y HP+ 
Sbjct: 306 LLPVPGCPRGYLGPGGIGDFGKYANCTGG----------AAGYIDRLVLGEKHIYQHPSC 355

Query: 260 RRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGH- 318
                 T                   P++PEG+L ++++IL   +G+  G +I+  K   
Sbjct: 356 NVLYQTT------------------VPYDPEGILGTINTILMAFLGLQAGKIILSYKDQH 397

Query: 319 ---LARLKQW-VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYAL 374
              ++R   W + MG    I          IP+NK L++ SYV   S  A ++   +Y L
Sbjct: 398 KQIMSRFLIWSLVMGIISAILTKCSKEEGFIPINKNLWSTSYVTTMSCFAFILLLLMYYL 457

Query: 375 VDIWNLKYPFLPLAWIGMNAMLVYV 399
           VD+  L +   P  + GMN++LVY+
Sbjct: 458 VDVKRL-WSGTPFFYPGMNSILVYI 481


>gi|432845830|ref|XP_004065874.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Oryzias latipes]
          Length = 622

 Score =  142 bits (358), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 115/383 (30%), Positives = 190/383 (49%), Gaps = 52/383 (13%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           +RL SLD FRG+A+ +M+ V++ GG +    H  WNG  +AD V P+F+F++G +IAL++
Sbjct: 225 KRLRSLDTFRGIALVIMVFVNYGGGRYWFFRHESWNGLTVADLVFPWFVFVMGTSIALSI 284

Query: 93  KRI----PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
             +      R   ++K+++R+++L   G+     F   P+    G+    +R+ GVLQR+
Sbjct: 285 NSLLRAGLTRGSLLRKIVWRSIQLFLIGV-----FIINPNYCQGGLSWENLRIPGVLQRL 339

Query: 149 ALSYLLVSLVEIFTK----DVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           A SYL+V+ +++       DV   D     F    LY   W++   + V++L+L +   V
Sbjct: 340 AFSYLVVASLDLMVARGHLDVLQTDAWWSPFLDVLLYWPAWVVVLLLEVLWLSLTFLLPV 399

Query: 205 PDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
           PD     +      D G   N T G          A G++DR +LG  H+Y  P+  R  
Sbjct: 400 PDCPTGYLGPGGIGDMGLYANCTGG----------AAGFLDRWLLGEKHIYQTPS-SRVL 448

Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK----GHL 319
             TQ                  P++PEG+L S++S+L   +G+  G +I+H +      +
Sbjct: 449 YLTQ-----------------IPYDPEGVLGSINSVLMAFLGLQAGKIILHYRDLPTSIM 491

Query: 320 ARLKQW-VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDI- 377
           +R   W + MG    +          IP+NK L++LSYV   +  A ++   +Y  VD+ 
Sbjct: 492 SRFLIWGLFMGVISAVLTRCSTDQGFIPVNKNLWSLSYVTTLACFAFVLLVLVYYTVDVK 551

Query: 378 -WNLKYPFLPLAWIGMNAMLVYV 399
            W    PF    + GMN++LVYV
Sbjct: 552 KWWSGSPFY---YPGMNSILVYV 571


>gi|350412149|ref|XP_003489557.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Bombus impatiens]
          Length = 571

 Score =  142 bits (358), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 114/383 (29%), Positives = 193/383 (50%), Gaps = 55/383 (14%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALA 91
           + R+ S+D FRG+A+ LMI V++ GG +   +H+ W G ++AD ++P+F +I+G++I ++
Sbjct: 179 SSRIQSVDAFRGIAILLMIFVNNGGGKYVFFNHSAWFGLSVADLILPWFAWIMGMSITIS 238

Query: 92  ----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
               L+    R       + R+  L+  G++L    S + ++L         R  G+LQ 
Sbjct: 239 KRAELRLTTSRVKITLCCLRRSAILILLGLMLNSIDSKSLNDL---------RFPGILQL 289

Query: 148 IALSYLLVSLVE-IFTK-DVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGT 202
           +A+SY + +++E IF K   QD     GRF+IFR  L  W  WL+ A ++  +  + +  
Sbjct: 290 LAVSYFVCAILETIFMKPHSQDILLQFGRFAIFRDILDSWPQWLIMAGIMTTHTLITFFL 349

Query: 203 YVPDWQFTIINKDSADY--GKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWR 260
           ++P+            +  GK  N T G          A GYIDR + G NH Y      
Sbjct: 350 HMPNCPTGYFGPGGKYHYRGKYMNCTAG----------AAGYIDRLIFG-NHTYSK---- 394

Query: 261 RSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLA 320
                 +DS +   LR D          PEGL++++S+I    +GVH G +++      A
Sbjct: 395 -----IKDSIYGQILRYD----------PEGLMNTISAIFIVYLGVHAGKILLLYYQGNA 439

Query: 321 RLKQWVTMG-FALLIFGLTLHFTN---AIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           RL +W     F  +I G+  +F N    IP++K++ TLSYV   S  A L+++ +Y L+D
Sbjct: 440 RLIRWFLWAIFTGIIAGILCNFENEGGVIPVSKRMMTLSYVLTCSSFAFLLYAILYFLID 499

Query: 377 IWNLKYPFLPLAWIGMNAMLVYV 399
            +   +   P  + G+N + +YV
Sbjct: 500 -YKQFWSGAPFIYAGINPIFLYV 521


>gi|322790964|gb|EFZ15612.1| hypothetical protein SINV_04659 [Solenopsis invicta]
          Length = 581

 Score =  141 bits (356), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 117/380 (30%), Positives = 195/380 (51%), Gaps = 53/380 (13%)

Query: 16  ISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADF 75
           + E + S    ++   + R+ S+D FRG+A+ LMI VD+ GG +   +H+ WNG  +AD 
Sbjct: 154 LQEAETSTPIVRTSRSSTRIRSIDTFRGIALLLMIFVDNGGGKYVFFNHSAWNGLTVADL 213

Query: 76  VMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
           V+P+F +I+G++I ++ +     +++  K+IFR L+     +LL    +    E      
Sbjct: 214 VLPWFAWIMGLSITISKRSELRVSNSRMKIIFRCLQRALVLVLLGLMLNSMSME-----S 268

Query: 136 VRMIRLCGVLQRIALSYLLVSLVE-IFTK-DVQDKDQSVGRFSIFR--LYCW-HWLMAAC 190
           ++ +R  GVLQ +A+SY + + +E IF K   QD     GRFSI R  L  W  WL+   
Sbjct: 269 LKHLRFPGVLQLLAVSYFVCATIETIFMKAHSQDDVLQFGRFSILRDILNNWAQWLIILA 328

Query: 191 VLVVYLALLYGTYVPDWQFTIINK--DSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
           ++V ++ + +   VP+     +    + + YGK  N T G          A GYIDR V 
Sbjct: 329 IMVTHILITFLLPVPNCPTGYLGPGGNYSRYGKFPNCTGG----------AAGYIDRLVF 378

Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
           G +H+Y           TQ+ P  G +          P +PEG+++++S IL   +GVH 
Sbjct: 379 G-SHVYSK---------TQN-PVYGTI---------LPHDPEGIMNTMSIILVVYMGVHA 418

Query: 309 GHVII---HTKGHLARLKQWVTMGFALLIFGLTLHF---TNAIPLNKQLYTLSYVCVTSG 362
           G +++      G + R   W ++    LI GL  HF   +  IP++K++ +LS+V   S 
Sbjct: 419 GKILLLYYQCNGRVIRWLLWSSV--TGLIAGLLCHFDKESGVIPVSKKMMSLSFVLTVSC 476

Query: 363 AAALVFSAIYALVD---IWN 379
            A L+++ ++ LVD    WN
Sbjct: 477 FAFLLYAILHFLVDYKQCWN 496


>gi|313242995|emb|CBY39713.1| unnamed protein product [Oikopleura dioica]
          Length = 597

 Score =  141 bits (355), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 114/409 (27%), Positives = 192/409 (46%), Gaps = 70/409 (17%)

Query: 20  DVSDQQEKSHLKTQ-----------RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWN 68
           D +D+QE   ++             R  SLD  RGL++ +MI V++ GG++  + H  WN
Sbjct: 182 DQTDKQEDQEVQEDEPAPPAPAKKKRYKSLDTLRGLSLIIMIFVNYGGGEYWFMEHVAWN 241

Query: 69  GCNLADFVMPFFLFIVGVAIALAL----KRIPDRADAVKKVIFRTLKLLFWGILLQGGFS 124
           G  +AD VMP+FLF+ GV+I +AL    KR   + +   +++ R++KL+  G++  GG  
Sbjct: 242 GLTVADLVMPWFLFMSGVSIRIALQSRIKRGISKTEISYEILVRSVKLIGLGMITIGG-- 299

Query: 125 HAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSV--GRFSIFRLYC 182
                          R  GVLQRI  SY +V+++ +   +  DK+     G F       
Sbjct: 300 --------NESWEYFRFPGVLQRIGFSYFVVAIIHLLVIEHPDKEPETNWGLFKEMSFNF 351

Query: 183 WHWLMAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVG 241
              L++  +L  ++ L Y   +P            ++ G+ ++   G          A G
Sbjct: 352 KEHLISWSILGAFICLTYLLPIPGCPTGYTGPGGLSENGEHYHCIGG----------AAG 401

Query: 242 YIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILS 301
           YIDRK+LG  H+Y+ P          D P               PF+PEGLL +++SI  
Sbjct: 402 YIDRKLLGEKHIYNWP------TAYHDEP------------NGVPFDPEGLLGTLTSIFM 443

Query: 302 TIIGVHFGH----------VIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQL 351
             +G+  G           +I+H  G LA +     M  A + FG        IP+NK L
Sbjct: 444 VYLGLQAGKCFDIFKTPKSIILHLLG-LAAVYGISGMLLATIGFGKIAE--AIIPINKNL 500

Query: 352 YTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVM 400
           +++S+V V S  A  + + +Y ++D+ +  +   P  ++GMN++L+Y++
Sbjct: 501 WSVSFVFVLSSMAFTLLAFLYFIIDVRDW-WDGAPCYFVGMNSILIYLL 548


>gi|194888520|ref|XP_001976930.1| GG18736 [Drosophila erecta]
 gi|190648579|gb|EDV45857.1| GG18736 [Drosophila erecta]
          Length = 576

 Score =  141 bits (355), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 115/401 (28%), Positives = 187/401 (46%), Gaps = 61/401 (15%)

Query: 17  SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFV 76
           +  D   +      + +RL SLD FRGL++ LMI V+  GG +  I HA WNG +LAD V
Sbjct: 168 AAADSIGEAATKATQRKRLRSLDTFRGLSIVLMIFVNSGGGGYAWIEHAAWNGLHLADVV 227

Query: 77  MPFFLFIVGVAIALALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTY 132
            P FL+I+GV I L++K    R   +A    ++++R++KL   G+ L             
Sbjct: 228 FPSFLWIMGVCIPLSVKSQLSRGSSKARICLRILWRSIKLFVIGLCLNS---------MS 278

Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLVEIFT--KDVQDKDQSVGR-FSIFRLYCWHWLMAA 189
           G ++  +R  GVLQR  ++YL+V ++      ++     +S  R      L+     +  
Sbjct: 279 GPNLEQLRFMGVLQRFGVAYLVVGVLHTLCCRREPISPQRSWQRAVHDVCLFSGELAVLL 338

Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAKLNPPC--NAVGYIDRK 246
            ++  YL L +G  VP      +      DY              +P C   A GY D +
Sbjct: 339 ALVATYLGLTFGLRVPGCPRGYLGPGGKHDYNA------------HPHCIGGAAGYADLQ 386

Query: 247 VLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGV 306
           VLG  H+Y HP  +     T                    F+PEG+   + S++  ++G 
Sbjct: 387 VLGNAHIYQHPTAKYVYDST-------------------AFDPEGVFGCILSVVQALLGA 427

Query: 307 HFGHVIIHTKGHLARLKQWVTMGFALLIFGLTL----HFTNAIPLNKQLYTLSYVCVTSG 362
             G  ++      +R+++W+ +   L + G  L        AIP+NK L++LS+VCVT  
Sbjct: 428 FAGVTLLVHPNWQSRMRRWMLLAILLGLIGGALCGFSREGGAIPVNKNLWSLSFVCVTVS 487

Query: 363 AAALVFSAIYALVDI---WNLK-YPFLPLAWIGMNAMLVYV 399
            A ++ S +Y ++D+   W+   YPF      GMNA+++YV
Sbjct: 488 LALVILSLLYYIIDVRETWSWSGYPFTE---CGMNAIVMYV 525


>gi|260816362|ref|XP_002602940.1| hypothetical protein BRAFLDRAFT_251788 [Branchiostoma floridae]
 gi|229288254|gb|EEN58952.1| hypothetical protein BRAFLDRAFT_251788 [Branchiostoma floridae]
          Length = 512

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 121/394 (30%), Positives = 187/394 (47%), Gaps = 55/394 (13%)

Query: 22  SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFL 81
           S  Q  S    +RL SLD FRGL++A+M+ V++ GG +    HA WNG  +AD V P+F+
Sbjct: 109 SSTQPASQ-GIRRLRSLDTFRGLSLAVMVFVNYGGGGYWFFKHARWNGLTVADLVFPWFV 167

Query: 82  FIVGVAIALALKRI----PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
           FI+G +IAL+ +R+      R   + KVI RT+ L   G+ +           T      
Sbjct: 168 FIMGTSIALSFRRLLKKGVSRLSLLWKVIQRTVILFLLGLFIINTKKGHNSWST------ 221

Query: 138 MIRLCGVLQRIALSYLLVSLVEIFTKD--VQDKDQSVGRFSIFR--LYCW-HWLMAACVL 192
            +R+ GVLQR+AL+Y +V+L+E +     +        R +  R  +  W  WL    V+
Sbjct: 222 -LRIPGVLQRLALTYFIVALMESWKPRGYLSLYLLQTSRIAPIRDIVNSWGQWLFMIVVV 280

Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
            ++L L++   VP+     +          +N T G          A GYIDR V   +H
Sbjct: 281 TLHLVLMFWLQVPNCPIGYLGPGGLSDIAHYNCTGG----------AAGYIDRAVFTDDH 330

Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
           +Y HP        T  + +E             PFEPEGLL +++S L   +G+   ++ 
Sbjct: 331 IYQHP--------TPITVYE----------TEVPFEPEGLLGTLTSALLCFLGLQVKNMY 372

Query: 313 IH-TKGHLARLKQWVTMGFALLIFGLTL----HFTNAIPLNKQLYTLSYVCVTSGAAALV 367
           ++    +     +   + F L   G  L         IPLNK L++LS+V V S  A ++
Sbjct: 373 MYIIMNYCVFTNRSPAICFCLGAIGGLLCNGRQNEGWIPLNKNLWSLSFVLVLSCFAFVL 432

Query: 368 FSAIYALVDI--WNLKYPFLPLAWIGMNAMLVYV 399
            S  Y +VD+  W    PF     +GMN++LVY+
Sbjct: 433 LSVCYIIVDVKQWWTGAPFYQ---VGMNSILVYI 463


>gi|195399031|ref|XP_002058124.1| GJ15666 [Drosophila virilis]
 gi|194150548|gb|EDW66232.1| GJ15666 [Drosophila virilis]
          Length = 572

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 127/394 (32%), Positives = 194/394 (49%), Gaps = 58/394 (14%)

Query: 21  VSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFF 80
           + D  EK+  + +RL SLD FRGL++ LMI V+  GG +  I HA WNG +LAD V P F
Sbjct: 171 IGDAAEKA-TQRKRLRSLDTFRGLSIVLMIFVNSGGGGYSWIEHAAWNGLHLADLVFPSF 229

Query: 81  LFIVGVAIALALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDV 136
           L+I+GV I L++K    R   ++    ++++R+ KL   G+ L           T G  +
Sbjct: 230 LWIMGVCIPLSIKSQLGRGISKSRICGRIVWRSCKLFAIGLCLNS---------TNGPQL 280

Query: 137 RMIRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCWHWLMAACVLV 193
             +RL GVLQR  +++L+V L+  + ++  Q   Q   + +I+   L+     +   ++ 
Sbjct: 281 EQLRLMGVLQRFGIAFLVVGLLHTVCSRRDQLSPQRAWQRAIYDICLFSGELAVLLALIA 340

Query: 194 VYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPC--NAVGYIDRKVLGIN 251
            YL L +G  VP      +       GK  N         NP C   A GYIDR+VLG  
Sbjct: 341 AYLGLTFGLPVPGCPRGYLGPG----GKHNNAA-------NPNCIGGAAGYIDRQVLGNA 389

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           H+Y HP  +     T                    F+PEG+   + S++ T++G   G  
Sbjct: 390 HIYQHPTAKYVYDATA-------------------FDPEGIFGCLLSVVQTLLGAFAGVT 430

Query: 312 IIHTKGHLARLKQW-VTMGFALLIFGLTLHFT---NAIPLNKQLYTLSYVCVTSGAAALV 367
           ++      ARLK+W +      L+ G    FT     IP+NK L++LS+V VT   A ++
Sbjct: 431 LLVHATWQARLKRWLLGATLLGLLGGALCGFTREQGVIPVNKNLWSLSFVFVTVALALVL 490

Query: 368 FSAIYALVDIWNL--KYPFLPLAWIGMNAMLVYV 399
            S +Y +VD+  L   YPF      GMNA+++YV
Sbjct: 491 LSLLYYVVDVRQLWSGYPFTE---CGMNAIIMYV 521


>gi|428306334|ref|YP_007143159.1| heparan-alpha-glucosaminide N-acetyltransferase [Crinalium
           epipsammum PCC 9333]
 gi|428247869|gb|AFZ13649.1| Heparan-alpha-glucosaminide N-acetyltransferase [Crinalium
           epipsammum PCC 9333]
          Length = 371

 Score =  140 bits (353), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 111/375 (29%), Positives = 177/375 (47%), Gaps = 81/375 (21%)

Query: 33  QRLASLDIFRGLAVALMILVDHAG-GD--WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
            RL SLD+FRG+A+A MILV+ AG  D  +P ++HA WNG   AD V PFFLFI+GVA+A
Sbjct: 1   MRLTSLDVFRGMAIAGMILVNKAGVADQVYPALAHADWNGWTFADLVFPFFLFIIGVAMA 60

Query: 90  LALKRIPDRADAVKKVIF-----RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
            +  +  +  +   K ++     R+  L   G+LL G +++         D   IR+ GV
Sbjct: 61  FSFAKYTEGDNKPTKQLYLRILRRSAILFILGLLLNGFWNY---------DFSTIRVMGV 111

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           LQRI+++YLL SL  +    +  K Q              W +AA +L+ Y  ++    V
Sbjct: 112 LQRISVAYLLASLAVL---TLPKKGQ--------------WALAAVLLIGYWLIMSFVPV 154

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
           P +   ++ ++                      N   YIDR ++G  H+Y    +     
Sbjct: 155 PGYGAGVLTREG---------------------NFGAYIDRLIIGAAHLYKGDNY----- 188

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
                              ++  +PEGL SS+ +++S +IG   G  +            
Sbjct: 189 -------------------NSLGDPEGLFSSLPAVVSVLIGYFTGEWLRKQPERSRTSIN 229

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPF 384
            +  G + L+ G   +F    P+NK+L+T SYV  T+G A ++ +A Y L+D+   +   
Sbjct: 230 MLIAGLSCLVVGEVWNFW--FPINKKLWTSSYVLFTAGIALILLAACYELIDVRKRREWG 287

Query: 385 LPLAWIGMNAMLVYV 399
            P   +GMNA+LV+V
Sbjct: 288 RPFEILGMNAILVFV 302


>gi|307178500|gb|EFN67189.1| Heparan-alpha-glucosaminide N-acetyltransferase [Camponotus
           floridanus]
          Length = 466

 Score =  140 bits (352), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 113/398 (28%), Positives = 201/398 (50%), Gaps = 54/398 (13%)

Query: 16  ISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADF 75
           + E + S+    ++  + R+ S+D FRG+A+ LMI V++ GG++   +H+ WNG  +AD 
Sbjct: 59  LQEAETSNPIIGTNRSSTRIRSVDTFRGIAILLMIFVNNRGGEYVFFNHSAWNGLTVADL 118

Query: 76  VMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLK----LLFWGILLQGGFSHAPDELT 131
           V+P+F +I+G++I ++ +     +++  K+I R L+    L+  G++L    S++   L 
Sbjct: 119 VLPWFAWIMGLSITISKRSELRVSNSRTKIILRCLQRAFILILLGLMLNSIRSNSLQNL- 177

Query: 132 YGVDVRMIRLCGVLQRIALSYLLVSLVE-IFTK-DVQDKDQSVGRFSIFR--LYCW-HWL 186
                   R  GVLQ +A+SY + + +E IF +   QD     GRF+  R  L  W  WL
Sbjct: 178 --------RFPGVLQLLAVSYFVCATIETIFMRMHSQDDLLQFGRFTFLRDILNNWAQWL 229

Query: 187 MAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDR 245
           +   ++V +  + +   VP+     +     + +G   N T G          A GYIDR
Sbjct: 230 IILAIVVTHTLITFLLPVPNCPTGYLGPGGYSHFGNFPNCTGG----------AAGYIDR 279

Query: 246 KVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIG 305
            V G +HMY+             +P  G +          P +PEG++++VS IL   +G
Sbjct: 280 LVFG-SHMYNK----------TKNPVYGTI---------LPHDPEGIMNTVSIILVVYLG 319

Query: 306 VHFGHVIIHTKGHLARLKQWVT-MGFALLIFGLTLHFTN---AIPLNKQLYTLSYVCVTS 361
           VH G +++      AR+ +W+   G   +I GL  +F      IP++K++ +LS+V   S
Sbjct: 320 VHAGKILLLYYQCNARVVRWLLWSGVTGIIAGLLCNFDKEGGVIPVSKKMMSLSFVLTVS 379

Query: 362 GAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
             A L+F+ ++  VD +   +   P  + G+N + +Y+
Sbjct: 380 CFAFLLFTILHFFVD-YKQYWSGAPFIYAGLNPVTLYI 416


>gi|380028317|ref|XP_003697852.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Apis florea]
          Length = 555

 Score =  140 bits (352), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 116/443 (26%), Positives = 195/443 (44%), Gaps = 70/443 (15%)

Query: 23  DQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLF 82
           D+        +R+ ++D  RG +  LMI V+   G +  + HA WNG    D + P F++
Sbjct: 166 DETAMKQPSKRRVKAIDTVRGASTLLMIFVNDGSGGYRILGHATWNGLLPGDLLFPCFIW 225

Query: 83  IVGVAIALAL----KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRM 138
           I+GV I +A+    KR+  R   +  ++ R++ +   G+ L        + ++ G  +  
Sbjct: 226 IMGVCIPIAMASQMKRMLPRHVILYGIVKRSILMFLIGLSL--------NTVSTGPQLET 277

Query: 139 IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL 198
           IR+ GVLQR  ++YL+V+L+       + K   V     F L    W +   ++ V+  +
Sbjct: 278 IRVFGVLQRFGITYLIVALIYFCLMARKPKKTQV--MQDFLLLLPQWCVMLVIVAVHCVI 335

Query: 199 LYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHP 257
            +   VP      +      D  K F+   G          A GYIDR +L   H++H  
Sbjct: 336 TFCLKVPGCPTGYLGPGGLHDDAKYFDCVGG----------AAGYIDRMILKEPHLHHSA 385

Query: 258 AWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKG 317
              +S                       P++PEG+L ++++     +G+H G +++  K 
Sbjct: 386 TVYKS----------------------GPYDPEGILGTLTTTFQVFLGLHAGIIMMTYKD 423

Query: 318 HLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD- 376
              R+ +W+         G  LHF+N IP+NK+L++LS+V VT+  +    SA Y LVD 
Sbjct: 424 WKERVIRWLAWAAFFSCIGCILHFSNIIPVNKKLWSLSFVFVTTSFSLAFLSACYLLVDV 483

Query: 377 --IWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLVCFL-------- 426
             +WN      P    GMN +L+YV      +  F   W  G+  N  +           
Sbjct: 484 IKVWNGG----PFRIPGMNGLLLYV-GHMVCYQNFPFHWSIGNMDNRALRLCEAIWGPGL 538

Query: 427 -FIISYILHSFLWELRKFLYVQF 448
             II+YI+H      RK +Y+  
Sbjct: 539 WTIIAYIMH------RKRIYITL 555


>gi|126304129|ref|XP_001381943.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase
           [Monodelphis domestica]
          Length = 638

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 115/383 (30%), Positives = 187/383 (48%), Gaps = 52/383 (13%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
            RL SLD FRG+++ +MI V++ GG +    H  WNG  +AD V P+F+FI+G +IAL+L
Sbjct: 241 HRLRSLDTFRGISLIIMIFVNYGGGKYWFFKHESWNGLTVADLVFPWFVFIMGSSIALSL 300

Query: 93  KRIPDRA----DAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
             +  R       + K+++R+  L   G+L+       P+     +    +R+ GVLQR+
Sbjct: 301 SSMLRRGCSKWKLLGKILWRSFLLCVIGVLIMN-----PNYCLGPLSWDKLRIPGVLQRL 355

Query: 149 ALSYLLVSLVEI-FTKDVQDK---DQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
            L+YL+V+++E+ F K V +    +     F     Y   W+    +  V++ + +   V
Sbjct: 356 GLTYLVVAVLELLFAKAVPENSTMESLCASFQDIISYWPQWIFILMLEAVWVCVTFLLPV 415

Query: 205 PDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
           P      +      D+GK  N T G          A GYIDR +LG +H+Y HP+     
Sbjct: 416 PGCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGEDHIYQHPS----- 460

Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGH----L 319
                         +        ++PEGLL +++SI+   +GV  G +++  K      +
Sbjct: 461 -------------PNVLYHTKVAYDPEGLLGTINSIVMAFLGVQAGKILLFYKDQHKQIM 507

Query: 320 ARLKQWVTMGFALLIFGLTLHFTNA---IPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
            R   W  M    +I G+   F+     IP+NK L+++SYV   S  A L+   +Y LVD
Sbjct: 508 LRFLLWSAM--LAIISGVLTKFSQNEGFIPVNKNLWSISYVTTLSFFAFLLLLFMYFLVD 565

Query: 377 IWNLKYPFLPLAWIGMNAMLVYV 399
           +  L +   P  + GMN++LVYV
Sbjct: 566 VARL-WSGAPFFYPGMNSILVYV 587


>gi|340727662|ref|XP_003402158.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Bombus terrestris]
          Length = 554

 Score =  139 bits (350), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 107/389 (27%), Positives = 180/389 (46%), Gaps = 55/389 (14%)

Query: 22  SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFL 81
           +D         +R+ ++D  RG +  LMI V+   G +  + HA WNG    D + P F+
Sbjct: 159 ADDGAMKQPAKRRVKAIDTVRGASTLLMIFVNDGSGGYRTLGHATWNGLLPGDLLFPCFI 218

Query: 82  FIVGVAIALAL----KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
           +I+GV I +A+    KR+  +   +  ++ R++ L   G+ L        + ++ G  + 
Sbjct: 219 WIMGVCIPIAMSSQMKRMTLKHQILYGIVKRSILLFLIGLSL--------NTVSTGGQLE 270

Query: 138 MIRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQS--VGRFSIFRLYCWHWLMAACVLVV 194
            IR+ GVLQR  ++YL+V+L+  +       K QS  +     F L    W +   ++VV
Sbjct: 271 TIRIFGVLQRFGITYLVVALLYFLLMSRRPSKIQSPMLREVQDFLLLLPQWCVMLVIVVV 330

Query: 195 YLALLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
           + A+ +   VP      +      D  K F+   G          A GYIDR +L   H+
Sbjct: 331 HCAITFCLNVPGCPTGYLGPGGLHDDAKYFDCVGG----------AAGYIDRMILKEAHL 380

Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
           ++     +S                       P++PEG+L ++++     +G+H G +++
Sbjct: 381 HYSATVYKS----------------------GPYDPEGILGTLTTAFQVFLGLHAGIIMM 418

Query: 314 HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
             K    R+ +W+         G  LHFTN IP+NK+L++LS+V VT+  +    SA Y 
Sbjct: 419 TYKDWKERVIRWLAWAAFFGCVGCVLHFTNVIPVNKKLWSLSFVFVTTSFSLAFLSACYL 478

Query: 374 LVD---IWNLKYPFLPLAWIGMNAMLVYV 399
           LVD   +WN      P    GMN +L+YV
Sbjct: 479 LVDVVKVWNGG----PFRIPGMNGLLLYV 503


>gi|383849627|ref|XP_003700446.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Megachile rotundata]
          Length = 552

 Score =  139 bits (350), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 111/406 (27%), Positives = 184/406 (45%), Gaps = 57/406 (14%)

Query: 23  DQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLF 82
           D   +  +K +R+ ++D  RG +  LMI V+   G +  + HA WNG    D + P F++
Sbjct: 159 DDTARQPVK-RRVKAIDTVRGASTLLMIFVNDGSGGYKTLGHATWNGLLPGDLLFPCFIW 217

Query: 83  IVGVAIALAL----KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRM 138
           I+GV I +AL    KR+  +   +  ++ R++ L   G+ L        + +  G  +  
Sbjct: 218 IMGVCIPIALGSQLKRMVPKHVILYGILKRSVLLFLIGVSL--------NTVGTGPQLES 269

Query: 139 IRLCGVLQRIALSYLLVSLVEIF---TKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVY 195
           IR+ GVLQR  ++Y +V+++ +F    +  + +   +     F L    W +   ++  +
Sbjct: 270 IRIFGVLQRFGVTYFIVAVIYLFLISKRPTKVQSPMLRDVQDFLLLLPQWTVMLAIVAAH 329

Query: 196 LALLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
             + +   VP      +      D  K F+   G          A GYID+ VL   H++
Sbjct: 330 CIITFCLPVPGCPTGYLGPGGLHDDAKYFDCVGG----------AAGYIDKVVLKEQHLH 379

Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
           H     +S                      APF+PEG+L  ++S     +G+H G +++ 
Sbjct: 380 HSMTVYKS----------------------APFDPEGILGCLTSTFHVFLGLHAGIIMMT 417

Query: 315 TKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYAL 374
            K    R+ +W+         G  LHFTN IP+NK+L++LS+V VT+  +    SA Y L
Sbjct: 418 YKDWKERVIRWLAWAAFFSCIGCALHFTNVIPVNKKLWSLSFVFVTTSFSLAFLSACYLL 477

Query: 375 VD---IWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGD 417
           VD   +WN      P    GMNA+L+YV      +  F   W  GD
Sbjct: 478 VDVVKVWNGG----PFRIPGMNALLLYV-GHSVCYRNFPFHWSIGD 518


>gi|350423601|ref|XP_003493532.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Bombus impatiens]
          Length = 565

 Score =  139 bits (350), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 108/393 (27%), Positives = 181/393 (46%), Gaps = 56/393 (14%)

Query: 18  EPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVM 77
           +  V D   K   K +R+ ++D  RG +  LMI V+   G +  + HA WNG    D + 
Sbjct: 167 KSQVDDGAMKQPAK-RRVKAIDTVRGASTLLMIFVNDGSGGYRTLGHATWNGLLPGDLLF 225

Query: 78  PFFLFIVGVAIALAL----KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYG 133
           P F++I+GV I +A+    KR+  +   +  ++ R++ L   G+ L        + ++ G
Sbjct: 226 PCFIWIMGVCIPIAMSSQMKRMTPKRQILYGIVKRSILLFLIGLSL--------NTVSTG 277

Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKD-QS--VGRFSIFRLYCWHWLMAAC 190
             +  IR+ GVLQR  ++Y +V+L+       + +  QS  +     F L    W +   
Sbjct: 278 GQLETIRIFGVLQRFGITYFVVALLYFLLMSRRPRKIQSPMLREVQDFLLLLPQWCVMLV 337

Query: 191 VLVVYLALLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLG 249
           ++VV+  + +   VP      +      D  K F+   G          A GYIDR +L 
Sbjct: 338 IVVVHCVITFCLNVPGCPTGYLGPGGLHDDAKYFDCVGG----------AAGYIDRVILK 387

Query: 250 INHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFG 309
             H++H     +S                       P++PEG+L ++++     +G+H G
Sbjct: 388 EAHLHHSATVYKS----------------------GPYDPEGILGTLTAAFQVFLGLHAG 425

Query: 310 HVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFS 369
            +++  K    R+ +W+         G  LHFTN IP+NK+L++LS+V VT+  +    S
Sbjct: 426 IIMMTYKDWKERVIRWLAWAAFFGCVGCVLHFTNVIPVNKKLWSLSFVFVTTSFSLAFLS 485

Query: 370 AIYALVD---IWNLKYPFLPLAWIGMNAMLVYV 399
           A Y LVD   +WN      P    GMN +L+YV
Sbjct: 486 ACYLLVDVVKVWNGG----PFRIPGMNGLLLYV 514


>gi|328780782|ref|XP_396570.4| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Apis mellifera]
          Length = 569

 Score =  139 bits (349), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 103/391 (26%), Positives = 178/391 (45%), Gaps = 55/391 (14%)

Query: 20  DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
            + D         +R+ ++D  RG +  LMI V+   G +  + HA WNG    D + P 
Sbjct: 172 QLDDTTAMKQPSKRRVKAIDTVRGASTLLMIFVNDGSGGYRILGHATWNGLLPGDLLFPC 231

Query: 80  FLFIVGVAIALAL----KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
           F++I+GV I +A+    KR+  +      ++ R++ +   G+ L        + ++ G  
Sbjct: 232 FIWIMGVCIPIAMAGQMKRMLPKHMIFYGIVKRSILMFLIGLSL--------NTVSTGPQ 283

Query: 136 VRMIRLCGVLQRIALSYLLVSLVEI---FTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
           +  IR+ GVLQR  ++Y +V+L+ +     K  + +   +     F L    W +   ++
Sbjct: 284 LETIRIFGVLQRFGITYFIVALIYLCLMTRKPKKTQSPMLKEVQDFLLLLPQWCVMLVIV 343

Query: 193 VVYLALLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
            V+  + +   VP      +      D  K F+   G          A GYIDR +L  +
Sbjct: 344 AVHCFITFCLKVPGCPTGYLGPGGLHDDAKYFDCVGG----------AAGYIDRMILKES 393

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           H++H     +S                       P++PEG+L ++++     +G+H G +
Sbjct: 394 HLHHSATVYKS----------------------GPYDPEGILGTLTTTFQVFLGLHAGII 431

Query: 312 IIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAI 371
           ++  K    R+ +W+T        G  LHFTN IP+NK+L++LS+V VT+  +    SA 
Sbjct: 432 MMTYKDWKERVIRWLTWAAFFSCIGCILHFTNIIPVNKKLWSLSFVFVTTSFSLAFLSAC 491

Query: 372 YALVD---IWNLKYPFLPLAWIGMNAMLVYV 399
           Y LVD   +WN      P    GMN +L+YV
Sbjct: 492 YLLVDVIKVWNGG----PFRIPGMNGLLLYV 518


>gi|320168011|gb|EFW44910.1| hgsnat protein [Capsaspora owczarzaki ATCC 30864]
          Length = 800

 Score =  139 bits (349), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 117/420 (27%), Positives = 200/420 (47%), Gaps = 57/420 (13%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
           R+ SLD FRG+A+++M+ V++ GG +    H+ WNG  +AD V P+F+F++GV+++L+ +
Sbjct: 419 RVRSLDTFRGIALSIMLFVNYGGGGYWFFDHSTWNGLTVADLVFPWFIFMMGVSMSLSFE 478

Query: 94  RIPDRADAVK----KVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
           ++  R         KVI R++ L   G+ L          +   +     R+ GVLQR A
Sbjct: 479 KLRRRGAPRGALFLKVIRRSMTLFALGLFL----------VCRQIIFATWRMPGVLQRFA 528

Query: 150 LSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQF 209
           +SYL V+ + +F           G F     +   W++    + ++  + +   VP    
Sbjct: 529 VSYLFVAAIVMFVPIFATLP---GPFRDLTSHWLQWVVIGIFITIHTCITFLYDVPGCGT 585

Query: 210 TIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
             I      D+G+  N T G          A GYID +V G  H+Y  P  +        
Sbjct: 586 GYIGPGGIGDFGQYMNCTGG----------AAGYIDSQVFG-RHIYQAPTAQ-------- 626

Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTM 328
                        +    ++PEGLL  ++S++ T +G   G +++    H ARL++W   
Sbjct: 627 -----------AYYLTGAYDPEGLLGCLTSVVITFLGYQAGRILVTFSTHSARLRRWAAW 675

Query: 329 GFALLIFGLTLHFTNA----IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPF 384
           G  L + GL L         IP+NK L++LS+V + +G   L+ ++ Y +VD+  L +  
Sbjct: 676 GVGLGLLGLILCKGTQNDGWIPINKNLWSLSFVLIMAGLGYLMLASCYIIVDVRKL-WDG 734

Query: 385 LPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLVCFLFIISYILHSFLWELRKFL 444
            P  + GMN++ VY M +E +F  +    Y  +  NT    LF+  ++  + LW L  +L
Sbjct: 735 APFIYPGMNSIFVY-MGSE-LFFYYFPFTYANEVENTHGALLFM--HLTATALWILIAYL 790


>gi|328790778|ref|XP_623715.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Apis mellifera]
          Length = 572

 Score =  138 bits (348), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 117/402 (29%), Positives = 198/402 (49%), Gaps = 77/402 (19%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALA 91
           + R+ S+D FRG+A+ LMI V++ GG +   +H+ W G ++AD V+P+F +I+G+ I ++
Sbjct: 180 STRIHSVDTFRGIAILLMIFVNNGGGKYIFFNHSAWFGLSIADLVLPWFAWIMGLMITVS 239

Query: 92  ----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
               L+    R       + R+  L+F G++L    S +  +L         R  GVLQ 
Sbjct: 240 KRTELRLTTSRIKITLYCLRRSAILIFLGLMLNSKDSESLHDL---------RFPGVLQL 290

Query: 148 IALSYLLVSLVE-IFTK-DVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLY-- 200
           + +SY + +++E IF K   QD     GRF++FR  L  W  WL+ A ++  +  + +  
Sbjct: 291 LGVSYFVCAILETIFMKPHSQDILHQFGRFAMFRDILESWPQWLIMAGIVTTHTLITFLL 350

Query: 201 -------GTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
                  G + P  ++          GK  N T G          A GYIDR + G NH 
Sbjct: 351 PISNCPKGYFGPGGEYHF-------RGKYINCTAG----------AAGYIDRLIFG-NHT 392

Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
           Y+H         T++  +   LR D          PEGL++++S+I    +GVH G +++
Sbjct: 393 YNH---------TENFLYGQILRYD----------PEGLMNTISAIFIVYLGVHAGKILL 433

Query: 314 HTKGHLARLKQWVTMG-FALLIFGLTLHFTNA---IPLNKQLYTLSYVCVTSGAAALVFS 369
                 +R+ +W     F  +I G+  +F      IP++K++ TLSYV + S  A L+++
Sbjct: 434 LYYQCNSRVIRWFLWTVFTGIIAGILCNFETQGGIIPISKRMMTLSYVLICSSFAFLLYA 493

Query: 370 AIYALVD---IWNLKYPFLPLAWIGMNAMLVYV--MAAEGIF 406
            +Y L+D    WN      P  + G+N + +YV  +  +G+F
Sbjct: 494 LLYVLIDYKQFWNGA----PFVYAGINPIFLYVGHILTKGLF 531


>gi|281209034|gb|EFA83209.1| hypothetical protein PPL_03999 [Polysphondylium pallidum PN500]
          Length = 1154

 Score =  138 bits (348), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 110/407 (27%), Positives = 182/407 (44%), Gaps = 75/407 (18%)

Query: 26  EKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVG 85
           E +  K  R+ SLD+FRGL++ +MI V++ GG +   +H+ WNG  +AD V P+F+FI+G
Sbjct: 217 ESNDPKKDRMKSLDVFRGLSITIMIFVNYGGGGYWFFNHSYWNGLTVADLVFPWFVFIMG 276

Query: 86  VAIALALKRIPDRADAVK----KVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRL 141
            A+ ++   +  R    K    K++ R++ L   G+ L             G D++  R+
Sbjct: 277 CAMPMSFNALESRGVPKKTIVIKLVRRSITLFALGMFLNN-----------GNDLQHWRI 325

Query: 142 CGVLQRIALSYLLVSLVEIFTK---------------------DVQDKDQS--VGRFSIF 178
            GVLQR  +SYL+  L+ +F                        +QD+ +S     F+  
Sbjct: 326 LGVLQRFGISYLVTGLIMMFVPVWRYRQLDDLSEEQQPLYGGGSIQDRIRSRYPRMFADI 385

Query: 179 RLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPC 237
             Y   W++A  +L V+  + +   VP      I        G+  N T G         
Sbjct: 386 LPYWIQWVVALMLLSVWFLVTFLLPVPGCPTGYIGPGGIGSQGQYANCTGG--------- 436

Query: 238 NAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVS 297
            A  Y+D K+ G NH+Y  P  +                     +    ++PEG L  ++
Sbjct: 437 -AARYVDLKIFGENHIYQTPTCQT-------------------IYNTGSYDPEGTLGYIT 476

Query: 298 SILSTIIGVHFGHVIIHTKGHLARLKQW-----VTMGFALLIFGLTLHFTNAIPLNKQLY 352
           SI    +GV  G  I+  K    RL +W     V  G A  + G++ +    +P+NK L+
Sbjct: 477 SIFMCFLGVQCGRTILAFKKASCRLIRWSIWGVVLCGIAAGLCGMSQN-NGWLPINKNLW 535

Query: 353 TLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
           T S+V + SG    V S +Y  +D+  L +   P  ++GMN + +Y+
Sbjct: 536 TPSFVLLLSGFGFFVLSFMYIFIDLKKL-WNGAPFIYVGMNPITIYM 581


>gi|340371415|ref|XP_003384241.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Amphimedon queenslandica]
          Length = 743

 Score =  138 bits (348), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 126/438 (28%), Positives = 207/438 (47%), Gaps = 64/438 (14%)

Query: 17  SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFV 76
           +  D+ ++   S  K +RL SLD FRG+++ +MI V++ GG +   +H+ WNG  +AD V
Sbjct: 342 ATTDLLNEDPLSTRKKERLRSLDTFRGMSLIIMIFVNYGGGGYWFFNHSIWNGITVADLV 401

Query: 77  MPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDV 136
            P+F++I+GV+I  + K    + D+ K  +++ ++                  L  G  +
Sbjct: 402 FPWFVWIMGVSIVYSFK--GRKKDSFKLRLYQVVRRSVI-------LLGLGLFLNNGYRL 452

Query: 137 RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLV 193
              R+ GVLQR A++Y +V++ E+    V +K +        R   + W  WL+   +  
Sbjct: 453 SHWRIPGVLQRFAIAYFVVAMTELLAPMVYNKYKLKWDVISVRDLTHNWVQWLVIVFLES 512

Query: 194 VYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
           ++L + +    P      +     AD GK  N T G+           GYID  +L  NH
Sbjct: 513 LWLIITFSLKAPGCPRGYLGPGGRADGGKYSNCTGGI----------AGYIDSWILTDNH 562

Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
           +Y HP  +                     +    ++PEG+L S++SI+    GV  G ++
Sbjct: 563 IYGHPTCK-------------------AIYHTGSYDPEGILGSINSIVMCFFGVQAGRIL 603

Query: 313 IHTKGHLARLKQWVTMGFALLIFGLTLHFTNA------IPLNKQLYTLSYVCVTSGAAAL 366
           IH K   +R+ ++V  G  LL+ GL      A      IPLNK L++LS++ V +G   +
Sbjct: 604 IHHKQFGSRIVRFVVWG--LLMGGLGTILCEATLNKGVIPLNKNLWSLSFILVIAGLGYI 661

Query: 367 VFSAIYALVD---IWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWY-YGDPHNTL 422
           + +  Y ++D   IWN    F P    GMN++LVYV  +E +   F  GW    D H   
Sbjct: 662 LLALFYFIIDVIKIWNGAPFFYP----GMNSILVYV-GSELLEGTFPFGWKGMEDSH--- 713

Query: 423 VCFLFIISYILHSFLWEL 440
               F+IS ++   LW L
Sbjct: 714 --IEFLISNVISVSLWCL 729


>gi|345481194|ref|XP_001603332.2| PREDICTED: LOW QUALITY PROTEIN: heparan-alpha-glucosaminide
           N-acetyltransferase-like [Nasonia vitripennis]
          Length = 570

 Score =  137 bits (346), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 113/407 (27%), Positives = 193/407 (47%), Gaps = 53/407 (13%)

Query: 5   KAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISH 64
           K   +H+    + E + + +         RL +LD FRG+AV LMI V++ GG++  ++H
Sbjct: 154 KHAESHNDIDRLQESESTPEMVAVSKTAMRLQALDAFRGIAVLLMIFVNNGGGEYVFLNH 213

Query: 65  APWNGCNLADFVMPFFLFIVGVAIALALK---RIP-DRADAVKKVIFRTLKLLFWGILLQ 120
           A WNG  +AD V+P+F + +G  I  +++   R+   R   +   + RT+ L+ +G+ + 
Sbjct: 214 AAWNGLTVADLVLPWFAWAMGFTIVNSVRVHLRVSVSRTRLIIMQLRRTVLLILFGLFIN 273

Query: 121 GGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFR- 179
              +    EL         R  GVLQ +A++Y + S++E      Q   Q  GRF   + 
Sbjct: 274 SQHNSTLSEL---------RFPGVLQLLAVAYFICSVIETCLASPQRTFQ-FGRFVFLQD 323

Query: 180 -LYCW-HWLMAACVLVVYLALLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAKLNPP 236
            L  W  W++   +++V+  + +  +VP      +       YG   N T G        
Sbjct: 324 ILERWTQWMVVLVIILVHTCITFFLHVPGCPRGYLGPGGYHHYGLNVNCTGG-------- 375

Query: 237 CNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSV 296
             A GYIDR + G  HMY              +P  GP           P +PEGL++++
Sbjct: 376 --AAGYIDRLIFG-QHMYQKTM----------NPVYGPT---------LPHDPEGLMNTI 413

Query: 297 SSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL-LIFGLTLHFTNA---IPLNKQLY 352
           S++L   +GV  G + +      +R+ +W     A  L+ G+  +F+     +P+NK + 
Sbjct: 414 SAVLIVFMGVQAGRIFVTYYQANSRIIRWFAWFVATGLLAGILCNFSQEKGWVPVNKNMM 473

Query: 353 TLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
           +LS+V  TS  A L+FS +Y L+D     +  +P  + G N +L+YV
Sbjct: 474 SLSFVLCTSSFAFLLFSILYYLIDHKKF-WSGVPFIYAGANPILLYV 519


>gi|348577435|ref|XP_003474490.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Cavia porcellus]
          Length = 638

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 113/385 (29%), Positives = 188/385 (48%), Gaps = 54/385 (14%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALA-- 91
           RL  LD FRG+A+ LM+ V++ GG +    H+ WNG  +AD V P+F+FI+G ++ L+  
Sbjct: 238 RLRCLDTFRGIALILMVFVNYGGGRYWYFRHSSWNGLTVADLVFPWFVFIMGSSVFLSVT 297

Query: 92  --LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
             L+R   +   + K+ +R+  L+  GI++       P+     +    +R+ GVLQR+ 
Sbjct: 298 SVLQRGCSKLKLLGKIAWRSFLLICIGIVIVN-----PNYCLGPLSWDKVRIPGVLQRLG 352

Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTY---VP 205
           ++Y +V+++E +FTK V +   S  RF   R     W     +L++    L  T+   VP
Sbjct: 353 VTYFVVAVLELLFTKPVHENCVSDRRFPFLRDITCSWPQWLLILLLESLWLGLTFLLPVP 412

Query: 206 DWQFT-----IINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWR 260
              +      +      D GK  N T G          A GYID  +LG +H+Y HP   
Sbjct: 413 GCPYVSEPGYLGPGGIGDLGKYVNCTGG----------AAGYIDHLLLGSDHLYQHP--- 459

Query: 261 RSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TK 316
            S A    +                 ++PEG+L +++SI+   +GV  G ++++    TK
Sbjct: 460 -SSAVLYHT--------------KVAYDPEGILGTINSIVMAFLGVQAGKILLYYKDQTK 504

Query: 317 GHLARLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAALVFSAIYAL 374
             L R   W  +   ++   LT    N   IP+NK L+++SYV   S  +  +   +Y +
Sbjct: 505 DILIRFTAWCCV-LGVISAALTKMSENEGFIPVNKNLWSISYVTTLSTFSFCILLVLYFI 563

Query: 375 VDIWNLKYPFLPLAWIGMNAMLVYV 399
           +D+  L +   P  + GMN++LVYV
Sbjct: 564 IDVRGL-WTGTPFFYPGMNSILVYV 587


>gi|449283383|gb|EMC90042.1| Heparan-alpha-glucosaminide N-acetyltransferase [Columba livia]
          Length = 560

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 115/399 (28%), Positives = 193/399 (48%), Gaps = 50/399 (12%)

Query: 17  SEPDVSDQQEK--SHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLAD 74
           ++P  SD   +  S    QRL SLD FRGL++ +M+ V++ GG +    H  WNG  +AD
Sbjct: 145 ADPISSDPAPQLWSSAPRQRLRSLDTFRGLSLIIMVFVNYGGGKYWFFKHESWNGLTVAD 204

Query: 75  FVMPFFLFIVGV----AIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDEL 130
            V P+F+FI+G     +++  L++   +   + K+++R+  L+  G+++       P+  
Sbjct: 205 LVFPWFVFIMGTSISLSLSSMLRQGSSKWKVLGKILWRSFLLILLGVIVVN-----PNYC 259

Query: 131 TYGVDVRMIRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR---LYCWHWL 186
              +    +R+ GVLQR+ L+YL+V+ +E +FT+   +           R    Y   W+
Sbjct: 260 LGPLSWENLRIPGVLQRLGLAYLVVAALELLFTRAGAESGTLETPCPALRDILPYWPQWV 319

Query: 187 MAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDR 245
               + V++L L +   VP      +      D+G   N T G          A GYIDR
Sbjct: 320 FVLMLEVLWLCLTFLLPVPGCPRGYLGPGGIGDFGNYANCTGG----------AAGYIDR 369

Query: 246 KVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIG 305
            +LG  H+Y HP+   S    Q +                P++PEG+L ++++I    +G
Sbjct: 370 LLLGDKHIYQHPS---SNVIYQTT---------------MPYDPEGILGTINTIFMAFLG 411

Query: 306 VHFGHVIIHTKGH----LARLKQW-VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVT 360
           +  G +I+  K      ++R   W + MG    I          IP+NK L+++SYV  T
Sbjct: 412 LQAGKIILFYKDQHKRIMSRFFIWTIVMGIISAILTKCSKEEGFIPINKNLWSVSYVTTT 471

Query: 361 SGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
           S  A ++   IY LVD+  L +   P  + GMN++LVY+
Sbjct: 472 SCFAFVLLLLIYYLVDVKRL-WSGAPFFYPGMNSILVYI 509


>gi|307209305|gb|EFN86390.1| Heparan-alpha-glucosaminide N-acetyltransferase [Harpegnathos
           saltator]
          Length = 552

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 110/407 (27%), Positives = 202/407 (49%), Gaps = 46/407 (11%)

Query: 3   EIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI 62
           ++  ++ H     + E + S+   ++     R+ S+D FRG+++ LMI V++ GG +   
Sbjct: 132 KLSPDSVHDDLDRLQEAESSNPVIRTSRVNTRIRSVDTFRGISILLMIFVNNGGGKYVFF 191

Query: 63  SHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGG 122
           +H+ WNG  +AD V+P+F +I+G++I ++ +     +++  K+I R L+  F  ILL   
Sbjct: 192 NHSVWNGLTVADLVLPWFAWIMGLSITISKRSELRVSNSRGKIILRCLQRAFILILLGLM 251

Query: 123 FSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVE-IFTK-DVQDKDQSVGRFSIFR- 179
            +    +      ++ +R  G+LQ +A+SY + + +E IF +   QD     GRF++ R 
Sbjct: 252 LNSIHTK-----SLKDLRFPGILQLLAVSYFVCATIETIFMRAHSQDDLLQFGRFTVLRD 306

Query: 180 -LYCW-HWLMAACVLVVYLALLYGTYVPDWQFTIINKDSAD-YGKVFNVTCGVRAKLNPP 236
            L  W  W +   +   +  + +   V D     +       +GK  N T G        
Sbjct: 307 ILDSWAQWSIIVAIATTHTLITFLLPVLDCPKGYLGPGGYHLFGKNANCTGG-------- 358

Query: 237 CNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSV 296
             A GYIDR V G +HMY+             +P  G +          P++PEG+++++
Sbjct: 359 --AAGYIDRLVFG-SHMYNK----------THNPVYGTI---------LPYDPEGIMNTI 396

Query: 297 SSILSTIIGVHFGHVIIHTKGHLARLKQWVT-MGFALLIFGLTLHF---TNAIPLNKQLY 352
           S IL   +GVH G +++      AR+ +W+   G   LI G+  HF   +  IP+NK++ 
Sbjct: 397 SVILVVYMGVHAGKILLLYYQCNARIVRWLLWSGVTGLIAGILCHFDKESGVIPVNKRMM 456

Query: 353 TLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
           +LS+V   S  A L+++ ++  VD +   +   P  + G+N + +Y+
Sbjct: 457 SLSFVLTVSCFAFLLYAILHFFVD-YKQYWSGAPFIYAGLNPITLYI 502


>gi|332027964|gb|EGI68015.1| Heparan-alpha-glucosaminide N-acetyltransferase [Acromyrmex
           echinatior]
          Length = 569

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 115/416 (27%), Positives = 208/416 (50%), Gaps = 48/416 (11%)

Query: 3   EIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI 62
           ++  +  H     + E + ++   +++  + R+ S+D FRG+++ LMI V++ GG +   
Sbjct: 149 KLSPDNVHDDLDELQEAETANIMIRTNRSSIRIRSVDTFRGISILLMIFVNNGGGQYMFF 208

Query: 63  SHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGG 122
           +H+ WNG  +AD V+P+F +I+G++I ++ +     +++  K+I R L+     ILL   
Sbjct: 209 NHSAWNGLTVADLVLPWFAWIMGLSITISKRSELRVSNSRGKIIVRCLQRTIILILLGLM 268

Query: 123 FSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVE-IFTKD-VQDKDQSVGRFSIFR- 179
            +       Y   +  +R  GVLQ +A+SY + + +E IF K   QD     GRF++ R 
Sbjct: 269 LNS-----IYAKSLDDLRFPGVLQLLAVSYFICATIETIFMKTHPQDDVLQFGRFTVLRD 323

Query: 180 -LYCW-HWLMAACVLVVYLALLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAKLNPP 236
            L  W  WL+   ++  ++ + +   VP+     +       +G+  N T G        
Sbjct: 324 ILNNWAQWLIILAIMTTHILITFLLPVPNCPTGYLGPGGYHHFGEFANCTGG-------- 375

Query: 237 CNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSV 296
             A GYIDR V G +HMY           TQ+ P  G +          P +PEG+++++
Sbjct: 376 --AAGYIDRLVFG-SHMYSK---------TQN-PVYGTI---------LPHDPEGIMNTI 413

Query: 297 SSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL-LIFGLTLHFTN---AIPLNKQLY 352
           S IL   +GVH G +++      A++ +W+   F   LI G+   F      IP++K++ 
Sbjct: 414 SIILVVYLGVHAGKILLLYYQCNAKVIRWLLWSFVTGLIAGILCDFDKEFGVIPVSKKMM 473

Query: 353 TLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV--MAAEGIF 406
           +LS+V   S  A L+++ +Y  VD +   +   P  + G+N + +YV  +  +GIF
Sbjct: 474 SLSFVLTVSCFAFLLYAILYVFVD-YKQYWSGAPFNYAGLNPITLYVGHIVTKGIF 528


>gi|426359530|ref|XP_004047024.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Gorilla
           gorilla gorilla]
          Length = 635

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 114/381 (29%), Positives = 185/381 (48%), Gaps = 50/381 (13%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
           RL S+D FRG+A+ LM+ V++ GG +    HA WNG      ++  F+FI+G +I L++ 
Sbjct: 239 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGAEGCIEMIEMFVFIMGSSIFLSMT 298

Query: 94  RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            I  R  +    + K+ +R+  L+  GI++       P+     +    +R+ GVLQR+ 
Sbjct: 299 SILQRGCSKFRLLGKIAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGVLQRLG 353

Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
           ++Y +V+++E +F K V +   S       R     W  WL+   +  ++L L +   VP
Sbjct: 354 VTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVP 413

Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                 +      D+GK  N T G          A GYIDR +LG +H+Y HP    S A
Sbjct: 414 GCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDDHLYQHP----SSA 459

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
               +                 ++PEG+L +++SI+   +GV  G ++++    TK  L 
Sbjct: 460 VLYHT--------------EVAYDPEGILGTINSIVMAFLGVQAGKILLYYKARTKDILI 505

Query: 321 RLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIW 378
           R   W  +   L+   LT    N   IP+NK L++LSYV   S  A  +   +Y +VD+ 
Sbjct: 506 RFTAWCCI-LGLISVALTKVSENEGFIPVNKNLWSLSYVTTLSSFAFFILLVLYPVVDVK 564

Query: 379 NLKYPFLPLAWIGMNAMLVYV 399
            L +   P  + GMN++LVYV
Sbjct: 565 GL-WTGTPFFYPGMNSILVYV 584


>gi|17229379|ref|NP_485927.1| hypothetical protein all1887 [Nostoc sp. PCC 7120]
 gi|17130977|dbj|BAB73586.1| all1887 [Nostoc sp. PCC 7120]
          Length = 375

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 114/380 (30%), Positives = 177/380 (46%), Gaps = 83/380 (21%)

Query: 33  QRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
            RL SLD+FRG+ +A MILV+ AG     +P ++HA W+GC   D V PFFLFIVGVA++
Sbjct: 1   MRLTSLDVFRGITIAGMILVNMAGVADDVYPPLAHAEWHGCTPTDLVFPFFLFIVGVAMS 60

Query: 90  LALKRIPDRADAVKKV---IFRTLKLLF-WGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
            +L +          V   IFR   +LF  G+LL G ++       +  D+  IR+ GVL
Sbjct: 61  FSLSKYTQENKPTSVVYWRIFRRAAILFVLGLLLNGFWNKG----IWTFDLSNIRIMGVL 116

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRI+LSYL  SL  +   ++  K Q              W++A  +LV Y   +    VP
Sbjct: 117 QRISLSYLFASLTVL---NLPRKGQ--------------WILAGVLLVGYWLTMMYVPVP 159

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
           D+   ++ ++                      N   YIDR ++  +H+Y    ++     
Sbjct: 160 DYGAGVLTREG---------------------NFGAYIDRLIIPKSHLYAGDGFKNLG-- 196

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI----IHTKGHLAR 321
                                 +PEGL S++ +I+S + G   G  I    + T+  L  
Sbjct: 197 ----------------------DPEGLFSTIPAIVSVLAGYFTGEWIRKQPVQTRTSLGL 234

Query: 322 LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLK 381
                  G   LI G    +    P+NK+L+T SYV  TSG A L+ +A Y L+++  +K
Sbjct: 235 ----ALFGIGCLIVGWGWGWV--FPINKKLWTSSYVVFTSGWALLLLAACYELIEVRLIK 288

Query: 382 YPFLPLAWIGMNAMLVYVMA 401
               P   +G+NA+ ++V++
Sbjct: 289 RWGKPFEIMGLNAIALFVLS 308


>gi|321474731|gb|EFX85695.1| hypothetical protein DAPPUDRAFT_309035 [Daphnia pulex]
          Length = 588

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 122/426 (28%), Positives = 194/426 (45%), Gaps = 71/426 (16%)

Query: 19  PDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMP 78
           P V+D+      K+ RL SLD FRG+ + LMI V+   G +    HA WNG  LAD + P
Sbjct: 180 PAVADEITPKK-KSSRLKSLDTFRGITIVLMIFVNDGAGQYFIFQHATWNGLQLADVIFP 238

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKVIF-----RTLKLLFWGILLQGGFSHAPDELTYG 133
           +F++I+GV + ++L+    R ++ K  IF     R+  L F GI+         + L   
Sbjct: 239 WFMWIMGVCMPISLRSSLRRKES-KLTIFAGILRRSCLLFFLGIM--------NNSLGGP 289

Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVEIF--TKDVQDKDQSVGRFSIFR--LYCW-HWLM- 187
           VD+  +R+ GVLQR A++YL V    +     D+     S     +F+  +  W  W++ 
Sbjct: 290 VDLGRLRVPGVLQRFAITYLAVGTAGLLLTPADLSAPHPSSKARKLFQDIVVLWPQWILF 349

Query: 188 -----AACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY 242
                A C +  +L +  G  V       ++ D+A  G                  A GY
Sbjct: 350 LLLVAAHCFITFFLPVEEGCPVGYLGPAGLHLDNAYPGHCIG-------------GAAGY 396

Query: 243 IDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILST 302
           IDR +L + H+++ P                        +   P++PEG+L S+      
Sbjct: 397 IDRLMLSVQHIFNKPT-------------------TIGVYGSGPYDPEGILGSMLCTFQV 437

Query: 303 IIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNA----IPLNKQLYTLSYVC 358
            +G   G  ++   G  +RL +W+       + G  L   +     IP+NK L++LS+V 
Sbjct: 438 FLGAQAGMTLLIFSGWKSRLIRWLAWSVLTGLIGALLCLASQNDGWIPVNKNLWSLSFVL 497

Query: 359 VTSGAAALVFSAIYALVDI---WNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYY 415
           VT+G A  +  A Y L+D+   WN   PFL   + GMN +L+Y +  +G ++ F   W  
Sbjct: 498 VTTGLAFFLLGACYWLIDVQEWWN-GAPFL---YPGMNGILMY-LGHQGAYSLFPWHWEN 552

Query: 416 GDPHNT 421
           G P NT
Sbjct: 553 G-PMNT 557


>gi|351712254|gb|EHB15173.1| Heparan-alpha-glucosaminide N-acetyltransferase, partial
           [Heterocephalus glaber]
          Length = 537

 Score =  135 bits (341), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 104/382 (27%), Positives = 180/382 (47%), Gaps = 52/382 (13%)

Query: 17  SEPDVSDQQEKSHLKT---QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLA 73
           ++P   D Q ++   +    RL  LD FRG+A+ LM+ V++ GG +    H+ WNG  +A
Sbjct: 180 ADPLTGDPQPEAQCASASGHRLRCLDTFRGIALVLMVFVNYGGGRYWYFKHSSWNGLTVA 239

Query: 74  DFVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDE 129
           D V P+F+FI+G ++ L++  +  R  +    + K+ +R+  L+  GI++       P+ 
Sbjct: 240 DLVFPWFVFIMGSSVFLSMTSVLQRGCSKFKLLGKIAWRSFLLICIGIVIVN-----PNY 294

Query: 130 LTYGVDVRMIRLCGVLQRIALSYLLVSLVE-IFTKDVQDK---DQSVGRFSIFRLYCWHW 185
               +    +R+ GVLQR+ ++Y +V+++E +F K + +    ++S             W
Sbjct: 295 CLGPLSWDKVRIPGVLQRLGVTYFVVAVLELLFAKPIPENCVLERSCPSLRDITSSWSQW 354

Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYID 244
           L+   +  ++L L +   VP      +      D GK  N T G          A  YID
Sbjct: 355 LLILLLEGIWLGLTFLLPVPGCPTGYLGPGGIGDLGKYANCTGG----------AARYID 404

Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
             +LG +H+Y HP+                           P++PEG+L +++SI+   +
Sbjct: 405 HLLLGSDHLYQHPS------------------STVLYHTEVPYDPEGILGTINSIVMAFL 446

Query: 305 GVHFGHVIIHTKGH----LARLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVC 358
           GV  G +++  KG     L R   W  +   L+   LT    N   IP+NK L+++SYV 
Sbjct: 447 GVQAGKILLCYKGQTKDILIRFTAWCCV-LGLISVALTKMSENKGFIPINKNLWSISYVT 505

Query: 359 VTSGAAALVFSAIYALVDIWNL 380
             S  A  +  A+Y +VD+  L
Sbjct: 506 TLSAFAFFILLALYLVVDVRGL 527


>gi|186683151|ref|YP_001866347.1| hypothetical protein Npun_R2871 [Nostoc punctiforme PCC 73102]
 gi|186465603|gb|ACC81404.1| conserved hypothetical protein [Nostoc punctiforme PCC 73102]
          Length = 375

 Score =  135 bits (341), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 109/375 (29%), Positives = 182/375 (48%), Gaps = 77/375 (20%)

Query: 33  QRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
            RL SLD+FRG+ +A MILV+ AG     +P ++HA W+GC   D V PFFLFIVGVA+ 
Sbjct: 1   MRLTSLDVFRGITIAAMILVNMAGVADNIYPPLAHADWHGCTPTDLVFPFFLFIVGVAMT 60

Query: 90  LALKRIPDRADAVKKVIFRTLK----LLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
            +L +  +       + +R L+    L   G+LL G ++       +  D+  IR+ GVL
Sbjct: 61  FSLSKYTEDNKPTSAIYWRILRRAAILFALGLLLNGFWNQG----VWTFDLSSIRIMGVL 116

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRI+++YLL SL+ +   ++  K Q              W++AA +L+ Y  ++    VP
Sbjct: 117 QRISITYLLASLIVL---NLPRKGQ--------------WILAAVILIGYWLMMMYLPVP 159

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
           D+   ++ ++                      N   YIDR ++   H+Y    ++     
Sbjct: 160 DYGAGVLTREG---------------------NLGAYIDRMIIPKAHLYKGDGFK----- 193

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
                F G              +PEGL S++ +I+S + G +F    I ++   +R    
Sbjct: 194 -----FMG--------------DPEGLFSTIPAIVSVLAG-YFTGQWIRSQPVQSRTSIG 233

Query: 326 VTM-GFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPF 384
           + + G   LI G    +T   P+NK+L+T SYV  +SG A ++ +A Y L+++   +   
Sbjct: 234 LGLFGVGCLIIGWAWGWT--FPINKKLWTSSYVVFSSGWALILLAACYELIEVRLNRRWS 291

Query: 385 LPLAWIGMNAMLVYV 399
            P   +G+NA+ ++V
Sbjct: 292 KPFEIMGLNAIALFV 306


>gi|298491757|ref|YP_003721934.1| hypothetical protein Aazo_3034 ['Nostoc azollae' 0708]
 gi|298233675|gb|ADI64811.1| Protein of unknown function DUF2261, transmembrane ['Nostoc
           azollae' 0708]
          Length = 376

 Score =  135 bits (340), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 107/370 (28%), Positives = 174/370 (47%), Gaps = 66/370 (17%)

Query: 33  QRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
            RL SLD+FRG+ +A MILV+  G     +  + HA WNGC   D V PFFLFIVGVA+ 
Sbjct: 1   MRLTSLDVFRGITIAGMILVNMVGVADHKYSLLDHAEWNGCTPTDLVFPFFLFIVGVAMT 60

Query: 90  LALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            +L +        K V  R L+      LL    +   ++  +  D+  IR  GVLQRI+
Sbjct: 61  FSLSKYTADNKPTKAVYLRILRRAAILFLLGLLLNGFWNKGVWTFDLSSIRFMGVLQRIS 120

Query: 150 LSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQF 209
           LSYL  SL+ +    V  K+Q              W++A  +L+ Y   +    VPD+  
Sbjct: 121 LSYLFASLIVL---KVPGKNQ--------------WVLAGVLLIGYWLTMMYVPVPDYGA 163

Query: 210 TIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDS 269
            ++ ++                      N  G+IDR ++   H+Y               
Sbjct: 164 GVLTREG---------------------NFGGFIDRLIIPKAHLY--------------- 187

Query: 270 PFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMG 329
                 + D  ++     +PEGL S++ +I+S ++G +F  + I  + HL        + 
Sbjct: 188 ------KGDGFNYLG---DPEGLYSTIPAIVSVLVG-YFAGIRIKERKHLNSQTSMDFVL 237

Query: 330 FALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAW 389
           F L    +++ +  A P+NK+L+T SYV  T+G A ++ +A Y L+++  +K    P   
Sbjct: 238 FGLCCLVVSIIWDVAFPINKKLWTSSYVVFTTGWALMLLAACYELIEVRLIKRWSKPFEI 297

Query: 390 IGMNAMLVYV 399
           +G+NA+ ++V
Sbjct: 298 MGLNAIALFV 307


>gi|328870644|gb|EGG19017.1| hypothetical protein DFA_02260 [Dictyostelium fasciculatum]
          Length = 759

 Score =  135 bits (339), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 118/431 (27%), Positives = 179/431 (41%), Gaps = 110/431 (25%)

Query: 1   MSEIKAETTHHHPLI-----ISEPDVSDQQEKSHLKT----------QRLASLDIFRGLA 45
           +  I    T   PL+     +S P  +D    +  KT          +R+ SLD  RGL 
Sbjct: 3   IENISHNHTEKSPLLNEQQHVSLPINNDDSTATITKTPSATPTTTQRKRVLSLDTVRGLT 62

Query: 46  VALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAV 102
           +  MILVD+ GG    WP +    WNG + AD + P FLFI G ++ALALK   +     
Sbjct: 63  IFGMILVDNQGGPQVIWPLL-ETEWNGLSTADLIFPSFLFICGFSVALALKSAKNDIKTW 121

Query: 103 KKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFT 162
             +I RTL L F    L        + + +       R+ GVLQRI++ Y          
Sbjct: 122 YNIIRRTLLLFFIQAFL--------NLMAHKFVFDSFRVMGVLQRISICYFACCC----- 168

Query: 163 KDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKV 222
                       F +  L      + AC   +YL+++YG  VP                 
Sbjct: 169 -----------SFLLLPLVGQRIFLVACA-AIYLSVMYGLDVPG---------------- 200

Query: 223 FNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSW 282
               CG R  L P CNA  YID  VLG N ++                            
Sbjct: 201 ----CG-RGVLTPSCNAGSYIDNSVLGANMIH---------------------------- 227

Query: 283 CHAPFEPEGLLSSVSSILSTIIGVHFGHVII-----HTKGHLARLKQWVTMGFALLIFGL 337
              P +PEGLLS+ S+ ++T +G+  G +       H   HL  L +W+ +     + G+
Sbjct: 228 ---PNDPEGLLSTFSAFITTWMGLELGRIFTRFYRKHDYAHLNILIRWIGIAVVFGVTGI 284

Query: 338 TLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDI--WN------LKYPFLPLAW 389
            L  T  +P+NK +++ S+  +T    +L+ S  Y L+D+  W+      +++   P  W
Sbjct: 285 ALGVTK-MPVNKLIWSFSFALITVACGSLLISVAYYLLDVVEWSPTVKRHIEFSIQPFMW 343

Query: 390 IGMNAMLVYVM 400
           IGMN + +Y +
Sbjct: 344 IGMNPISIYTL 354


>gi|195133238|ref|XP_002011046.1| GI16326 [Drosophila mojavensis]
 gi|193907021|gb|EDW05888.1| GI16326 [Drosophila mojavensis]
          Length = 570

 Score =  134 bits (337), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 120/393 (30%), Positives = 188/393 (47%), Gaps = 54/393 (13%)

Query: 20  DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
            + D   K+  + QR+ SLD FRGL++ LMI V+  GG +  I HA WNG +LAD V P 
Sbjct: 168 SIGDAAAKA-TQRQRMRSLDTFRGLSIVLMIFVNSGGGGYSWIEHAAWNGLHLADIVFPT 226

Query: 80  FLFIVGVAIALALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
           FL+I+GV I L++K    R   +     ++++R  KL   G+ L           T G  
Sbjct: 227 FLWIMGVCIPLSIKAQLGRGISKPRICLRIVWRACKLFAIGLCLNS---------TNGPQ 277

Query: 136 VRMIRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCWHWLMAACVL 192
           +  +RL GVLQR  ++YL+  ++  I ++      Q   + +I+   L+     +   ++
Sbjct: 278 LEQLRLMGVLQRFGIAYLVAGVLHTICSRRDYLSPQRAWQRAIYDICLFSGELAVLLALI 337

Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPC--NAVGYIDRKVLGI 250
             YL L +G  VP      +       GK  N         +P C   A GY+DR +LG 
Sbjct: 338 AAYLGLTFGLRVPGCPRGYLGPG----GKHNNAA-------DPNCIGGAAGYVDRLILGN 386

Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
            H+Y HP                    DA +     F+PEG+   + SI+  ++G   G 
Sbjct: 387 AHIYQHPT--------------AKFVYDASA-----FDPEGVFGCLLSIVQAMLGCFAGV 427

Query: 311 VIIHTKGHLARLKQWVTMGFALLIFGLTL-HFTNA---IPLNKQLYTLSYVCVTSGAAAL 366
            ++      ARL++W+     L + G  L  F+     IP+NK L++LS+V VT   A +
Sbjct: 428 TLLVHVTWQARLRRWLLGATLLGVLGGALCGFSKEHGIIPVNKNLWSLSFVFVTVALALV 487

Query: 367 VFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
           + S +Y +VD+  L +   P    GMNA+++YV
Sbjct: 488 LLSVLYYVVDVRQL-WSGSPFTECGMNAIIMYV 519


>gi|170027692|ref|XP_001841731.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167862301|gb|EDS25684.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 558

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 96/279 (34%), Positives = 138/279 (49%), Gaps = 30/279 (10%)

Query: 5   KAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISH 64
           ++ T     L  + P +S Q      KT RL SLD FRG+A+ LMI V+  GGD+  I H
Sbjct: 263 RSRTPSEPQLSPNSPTISVQATGVPQKT-RLRSLDTFRGIAIMLMIFVNSGGGDYWWIEH 321

Query: 65  APWNGCNLADFVMPFFLFIVGVAIALALK----RIPDRADAVKKVIFRTLKLLFWGILLQ 120
           A WNG ++AD V P+FLFI+GV I ++L+    R   R + +K V  R+LKL   G+ L 
Sbjct: 322 ATWNGLHVADLVFPWFLFIMGVCIPISLRSQLGRNVPRYEILKNVAVRSLKLFLIGLCLN 381

Query: 121 GGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQ---SVGRFSI 177
                       G  V  +RL GVLQR  ++Y +VS + ++     D+ Q   +     I
Sbjct: 382 S---------INGPTVADLRLFGVLQRFGVAYFVVSAIHLYCYQENDQLQHPLARSHADI 432

Query: 178 FRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVF-NVTCGVRAKLNPP 236
            RL+  HW++   ++ VYL +++   VP+               ++ N T G+       
Sbjct: 433 LRLW-KHWVIVGTIVFVYLLVIFFVPVPNCPSGYFGPGGKHLMLLYPNCTGGI------- 484

Query: 237 CNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPL 275
               GYIDR+VLGI H+Y HP  R         P EGP 
Sbjct: 485 ---TGYIDRQVLGIRHLYQHPTARYMYDAMPFDP-EGPF 519


>gi|196012186|ref|XP_002115956.1| hypothetical protein TRIADDRAFT_59909 [Trichoplax adhaerens]
 gi|190581732|gb|EDV21808.1| hypothetical protein TRIADDRAFT_59909 [Trichoplax adhaerens]
          Length = 580

 Score =  133 bits (335), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 108/365 (29%), Positives = 170/365 (46%), Gaps = 75/365 (20%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFI 83
            +E  H   QR+ ++D FRGL + +MI V+  GG +      PWNG   AD ++P+F+FI
Sbjct: 238 SEESIHPLAQRIYAVDAFRGLCITIMIFVNSGGGGYWYFRSTPWNGLTFADLILPWFIFI 297

Query: 84  VGVAIALAL--------KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
           VG+ IAL+          R+P  + AV KV+ R++ L   G+ L             GV+
Sbjct: 298 VGICIALSFYNHRYITASRLPP-SSAVLKVLSRSVILFLIGLFLND-----------GVN 345

Query: 136 VRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYC-WH-WLMAACVLV 193
           +   R+ G LQ++A+SY++VSL  ++    +  D      +I  + C W  W+    +L 
Sbjct: 346 LSTWRIPGNLQKVAISYIVVSLSVLYL--AKPPDTITNLRAIREIVCIWKIWIGMIGLLS 403

Query: 194 VYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
           +YL+L++   VP            +D    +N T G          A GYIDR + G NH
Sbjct: 404 IYLSLIFALPVPGCPTGYFGPGGLSDDANHYNCTGG----------ATGYIDRFIFG-NH 452

Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
           +  +P+ +                       H PF+ EG LS+++SIL+  +G+      
Sbjct: 453 LDANPSCKVLYR------------------THMPFDSEGCLSTLTSILTCFMGLQ----- 489

Query: 313 IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIY 372
           + T   L   KQ   MGF              IPLN+ L++LSY+ +  G A  V   +Y
Sbjct: 490 VATGVALCGGKQ--NMGF--------------IPLNRNLWSLSYITLLGGLAYFVLMMLY 533

Query: 373 ALVDI 377
            L+D+
Sbjct: 534 LLIDV 538


>gi|75909960|ref|YP_324256.1| hypothetical protein Ava_3756 [Anabaena variabilis ATCC 29413]
 gi|75703685|gb|ABA23361.1| conserved hypothetical protein [Anabaena variabilis ATCC 29413]
          Length = 375

 Score =  133 bits (335), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 112/374 (29%), Positives = 171/374 (45%), Gaps = 75/374 (20%)

Query: 33  QRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
            RL SLD+FRG+ +A MILV+ AG     +P ++HA W+GC   D V PFFLFIVGVA++
Sbjct: 1   MRLTSLDVFRGITIAGMILVNMAGVADDVYPPLAHAEWHGCTPTDLVFPFFLFIVGVAMS 60

Query: 90  LALKRIPDR---ADAVKKVIFRTLKLLF-WGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
            +L +         AV   IFR   +LF  G+LL G ++       +  D+  IR+ GVL
Sbjct: 61  FSLSKYTQENKPTSAVYWRIFRRAAILFVLGLLLNGFWNKG----IWTFDLSNIRIMGVL 116

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRI+LSYL  SL  +   ++  K Q              W++A  +LV Y   +    VP
Sbjct: 117 QRISLSYLFASLAVL---NLPRKGQ--------------WILAGVLLVGYWLTMMYVPVP 159

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
           D+   ++ ++                      N   Y+DR ++   H+Y    ++     
Sbjct: 160 DYGAGVLTREG---------------------NFGAYVDRLIIPQAHLYAGDGFKNLG-- 196

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
                                 +PEGL S++ +I+S + G   G  I             
Sbjct: 197 ----------------------DPEGLFSTIPAIVSVLAGYFTGEWIRKQSVQTRTSVGL 234

Query: 326 VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFL 385
              G   LI G    +    P+NK+L+T SYV  TSG A L+ +A Y L+++  +K    
Sbjct: 235 ALFGIGCLIVGWGWGWV--FPINKKLWTSSYVVFTSGWALLLLAACYELIEVRLIKRWSK 292

Query: 386 PLAWIGMNAMLVYV 399
           P   +G+NA+ ++V
Sbjct: 293 PWEIMGLNAIALFV 306


>gi|66826507|ref|XP_646608.1| hypothetical protein DDB_G0270192 [Dictyostelium discoideum AX4]
 gi|60474509|gb|EAL72446.1| hypothetical protein DDB_G0270192 [Dictyostelium discoideum AX4]
          Length = 426

 Score =  132 bits (332), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 115/401 (28%), Positives = 174/401 (43%), Gaps = 93/401 (23%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +R+ SLD  RGL +  MILVD+  G+   WP ++   WNG + AD + P F+FI G +IA
Sbjct: 44  RRMGSLDAVRGLTIFGMILVDNQAGNDVIWP-LNETEWNGLSTADLIFPSFIFISGFSIA 102

Query: 90  LALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
           LALK   +       +I RTL L F    +Q   +   D   +       R+ GVLQRIA
Sbjct: 103 LALKNSKNTTSTWYGIIRRTLLLFF----IQCFLNLMGDHFNFTT----FRIMGVLQRIA 154

Query: 150 LSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQF 209
           + Y    L             S   F IF       L    V V Y++++Y   VP    
Sbjct: 155 ICYFFSCL-------------SFLCFPIFL----QRLFLLSVTVTYISIMYALNVPK--- 194

Query: 210 TIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDS 269
                            CG RA L   CNA  YID KV G+N M             ++S
Sbjct: 195 -----------------CG-RANLTQNCNAGAYIDSKVFGLNIM-------------KES 223

Query: 270 PFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII-----HTKGHLARLKQ 324
              GP   D          PEGL+S++SS ++  +G+ FG +       H  G+   + +
Sbjct: 224 NLNGPYYND----------PEGLISTMSSFITAWMGLEFGRIFTRFYKKHDFGNTDIIVR 273

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDI--WN--- 379
           W+ +    ++  ++L  T  +P NK++++ S+   T GA+  +    + L+D+  W    
Sbjct: 274 WILLVILFMVPAISLGAT-VMPFNKKIWSFSFALFTVGASGSLILIAFILIDVIDWESLK 332

Query: 380 -------LKYPFLPLAWIGMNAMLVYVMAA--EGIFAGFIN 411
                  +     P+ WIG N + +Y +    E I   +IN
Sbjct: 333 CEKVRKIIDLIIKPMKWIGQNPITIYSLMVFIEIILMYYIN 373


>gi|390344818|ref|XP_795043.3| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Strongylocentrotus purpuratus]
          Length = 680

 Score =  132 bits (332), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 122/425 (28%), Positives = 199/425 (46%), Gaps = 79/425 (18%)

Query: 5   KAETTHHHPLIISEPDVSDQ---------QEKSHLKTQRLASLDIFRGLAVALMILVDHA 55
           K+   +H+  I+S     DQ         + KS L   RL S+D FRGLA+  ++L    
Sbjct: 255 KSLPINHNGSILSNGSQDDQTPLTFPASDKPKSSL---RLRSVDTFRGLAITHLVLGASG 311

Query: 56  GGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLK 111
            G +   +HA W G  +ADF+ P+F+FI+G +I L+   +  +  +     KK++FR++ 
Sbjct: 312 DGHFWYSNHARWYGITVADFMFPWFVFIMGTSIHLSFNILLSKGLSYCAIFKKIVFRSIS 371

Query: 112 LLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQ-DKDQ 170
           L   G+ +Q   SH         D+R +R+ GVLQR  ++Y +V+   + ++ +Q  + +
Sbjct: 372 LFIMGVCIQ---SHN--------DLRNLRIPGVLQRFGITYFIVASSYLLSRRLQARRAE 420

Query: 171 SVGR-FSIFRLYC--WHWLMAACVLVVYLALLYGTYVPDWQFTIINKDS---ADYGKVFN 224
             G+ + +FR         +AAC LVV+L L +   VP               + G++ N
Sbjct: 421 KTGKCYMMFRDITDYLELPLAACCLVVHLCLTFLLPVPGCPLGYQGPGGPLVGENGELTN 480

Query: 225 VTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCH 284
            T G          A GYIDR      H+            T D  +   +R D      
Sbjct: 481 CTGG----------ASGYIDRTFFTEAHLI--------LVNTCDDVYRTIVRSD------ 516

Query: 285 APFEPEGLLSSVSSILSTIIGVHFG---HVIIHTKGHLARLKQWVTMGFALLIFGLTLHF 341
               PEG+L + +SI   + G+  G   H+    +G L RL  W   G AL+     L  
Sbjct: 517 ----PEGILGTFTSIALCVFGLQSGKILHLFTTVRGRLVRLLLW---GLALISCSAVLCK 569

Query: 342 TNA----IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDI---WNLKYPFLPLAWIGMNA 394
            +     IPLNK L+++S++ +T G A +V +  + L+D+   WN      PL + GMN+
Sbjct: 570 CSMADGWIPLNKNLWSVSFIALTGGTAFIVQALFHVLIDVTHFWNGA----PLFYAGMNS 625

Query: 395 MLVYV 399
           +L+Y+
Sbjct: 626 ILLYI 630


>gi|118378164|ref|XP_001022258.1| hypothetical protein TTHERM_00500990 [Tetrahymena thermophila]
 gi|89304025|gb|EAS02013.1| hypothetical protein TTHERM_00500990 [Tetrahymena thermophila
           SB210]
          Length = 827

 Score =  132 bits (331), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 100/385 (25%), Positives = 167/385 (43%), Gaps = 85/385 (22%)

Query: 22  SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMP 78
            D Q+ +    QRL  LDI+RGL +  MILVD+ G     WP +    WNG + AD V P
Sbjct: 450 KDIQQPAAAPKQRLECLDIYRGLTMVGMILVDNMGNSSVIWP-LDETEWNGLSTADCVFP 508

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRM 138
            FLFI G+AI LA+K   ++     +++ R +KL   G+ L    ++   +         
Sbjct: 509 SFLFISGMAITLAIKHNGNKKQQFFRILERFVKLFVIGVALNAACANYKQQF-------- 560

Query: 139 IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL 198
            R+ GVLQRIA+ Y + S   +F ++                +   +++    L++Y+  
Sbjct: 561 -RIMGVLQRIAICYFVTSTSYLFLQN----------------FAVQFVLNGVFLLIYIYF 603

Query: 199 LYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPA 258
           +Y   VPD                    CG    + P CN   Y+D ++  +N+M     
Sbjct: 604 MYFFDVPD-------------------GCGAN-NVTPTCNFGRYLDMQIFTLNYMM---- 639

Query: 259 WRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGH 318
                                      P +PEGL +++ ++++T IG+ +G  +   K  
Sbjct: 640 --------------------------KPSDPEGLFTTLGALVTTFIGLCYGLALQEFKSQ 673

Query: 319 LARLKQ-WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDI 377
             RL   W  M   L+  G    F    P+NK++++ S+V +    +    +  + +VDI
Sbjct: 674 KKRLSCIWFVMSLVLVFIGGICCFL--TPINKKVWSPSFVFIVGSMSGAFLNLCFIVVDI 731

Query: 378 WN---LKYPFLPLAWIGMNAMLVYV 399
           +N   L      L W+G+N + V+V
Sbjct: 732 YNNLKLNKALEFLKWLGLNPLFVFV 756


>gi|121489785|emb|CAK18864.1| hypothetical protein [Phillyrea latifolia]
          Length = 129

 Score =  132 bits (331), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 64/131 (48%), Positives = 84/131 (64%), Gaps = 3/131 (2%)

Query: 230 RAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEP 289
              L P CN+ G IDR VLGI+H+Y  P +R  K C   S   G + + APSWCHAPF+P
Sbjct: 1   NGDLGPACNSAGMIDRNVLGIDHLYAKPVYRNLKECNISS--HGQVPETAPSWCHAPFDP 58

Query: 290 EGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNK 349
           EG+LSS+++ +S IIG+ +GH+++  + H  RL  W    FA L  GL L F   IPLNK
Sbjct: 59  EGILSSLTAAVSCIIGLQYGHILVRLQDHKERLCNWSIFSFAFLGLGLFLAFV-GIPLNK 117

Query: 350 QLYTLSYVCVT 360
            LYT+SY+ VT
Sbjct: 118 SLYTISYLLVT 128


>gi|270160204|ref|ZP_06188860.1| membrane protein, putative [Legionella longbeachae D-4968]
 gi|289165026|ref|YP_003455164.1| hypothetical protein LLO_1691 [Legionella longbeachae NSW150]
 gi|269988543|gb|EEZ94798.1| membrane protein, putative [Legionella longbeachae D-4968]
 gi|288858199|emb|CBJ12067.1| putative membrane protein [Legionella longbeachae NSW150]
          Length = 372

 Score =  132 bits (331), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 110/376 (29%), Positives = 174/376 (46%), Gaps = 84/376 (22%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           L+ +R+ SLD+FRGL +ALM+LV+  G    +P + H+ WNGC LAD V P FLFIVG+ 
Sbjct: 6   LQNERILSLDVFRGLTMALMVLVNSLGTRISYPILLHSEWNGCTLADLVFPSFLFIVGMT 65

Query: 88  IALALKR---IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
             ++LKR      + +    +  RT+ L   GI L        +     VD+  IR+ G+
Sbjct: 66  TVISLKRHIKEESKTEIYYSIFKRTIILFLLGIFL--------NVFPKNVDISSIRIYGI 117

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           LQRIAL YL+ + + + T           R  IF               ++L +L G   
Sbjct: 118 LQRIALCYLICAFIYLHTTI---------RAQIF---------------IFLGILLGY-- 151

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
             W F             F++      +L    N VGYID+ +    H+           
Sbjct: 152 --WYFL----------ACFHLPVSGMNQLTITRNWVGYIDQLLFSPKHLLFR-------- 191

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
                                 F+PEGLLS++ SI +T+ G+  G++++     + + K+
Sbjct: 192 ---------------------NFDPEGLLSTIPSIATTLSGLIAGNLLL---AQIQKQKK 227

Query: 325 WVTMGFALLIFGLTLHFTN-AIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYP 383
            + M  + L+F L     N + P+NK L+T S+V   SG + ++FS  Y ++DI      
Sbjct: 228 CILMVASGLVFLLLAWLWNYSFPINKNLWTSSFVLWCSGFSLIIFSFCYFIIDIKGYNKW 287

Query: 384 FLPLAWIGMNAMLVYV 399
            LP   +GMNA+ +++
Sbjct: 288 SLPFKILGMNALFIFI 303


>gi|156401294|ref|XP_001639226.1| predicted protein [Nematostella vectensis]
 gi|156226353|gb|EDO47163.1| predicted protein [Nematostella vectensis]
          Length = 387

 Score =  131 bits (329), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 96/375 (25%), Positives = 168/375 (44%), Gaps = 50/375 (13%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
           RL SLD FRG+++ +MI V+  GG +   +H+ WNG  +AD V P+F++I+GV++ L+ +
Sbjct: 1   RLKSLDTFRGISLTVMIFVNFGGGGYYFFAHSIWNGLTVADLVFPWFMWIMGVSMVLSFR 60

Query: 94  RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYL 153
            +  +  +  ++I +  K     +L   G   + +   Y       R+ GVLQR A  Y 
Sbjct: 61  VLRRKQISTYRIIIKITKRTL--LLFALGLFTSNNLTNY-------RIPGVLQRFAACYF 111

Query: 154 LVSLVEIFTKDVQDKDQSVGRF-----SIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
           +V+++++      +  Q  G +      +  L+   WL+    L++Y+ + Y T +    
Sbjct: 112 VVAVIQVLAGPSVEDSQPRGSWWDGIRDVVSLWA-QWLLMFAFLIIYVVVTYATELHGCP 170

Query: 209 FTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
                    +D    FN T G+ +                       H  +W   K   Q
Sbjct: 171 RGYTGPGGISDNSSAFNCTGGMAS-----------------------HVDSWLLGKHVYQ 207

Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVT 327
              F+   R           +PEG++ +++SI    +GV  GH +     H  RL +W  
Sbjct: 208 RGTFKDMYRTTVAH------DPEGVMGTLTSIFIVFLGVQAGHTLFTFSHHRQRLVRWFV 261

Query: 328 MGFALLIFGLTL----HFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYP 383
               L +  + L         IP+NK L+++S+V  T   A L+ S  Y  +++W L + 
Sbjct: 262 WAVLLGVIAIGLSGGTQNDGVIPINKNLWSISFVLATGSMAFLLLSFCYVTIEVWEL-WN 320

Query: 384 FLPLAWIGMNAMLVY 398
             P  + GMN++LVY
Sbjct: 321 GAPFIYPGMNSILVY 335


>gi|423226736|ref|ZP_17213201.1| hypothetical protein HMPREF1062_05387 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392627009|gb|EIY21050.1| hypothetical protein HMPREF1062_05387 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 368

 Score =  131 bits (329), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 116/398 (29%), Positives = 179/398 (44%), Gaps = 104/398 (26%)

Query: 17  SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLAD 74
           S P +S   +K     +RL SLD+ RG+ V  MILV+++GG   +  + H+ WNG  L D
Sbjct: 4   SHPPISTSPQK-----KRLLSLDVLRGITVVGMILVNNSGGKLSYDSLQHSAWNGLTLCD 58

Query: 75  FVMPFFLFIVGVAIALALKRIPDRADA--VKKVIFRTLKLLFWG-------ILLQGGFSH 125
            V PFFLFI+GV+  +AL +   +A    V+KV+ RTL +L  G        +  G F  
Sbjct: 59  LVFPFFLFIMGVSTYIALSKFHFQASGSVVRKVLKRTLVILCIGWAIHWFHFICDGDF-- 116

Query: 126 APDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHW 185
                        +RL GVL RIAL Y +VS V ++        + +G           W
Sbjct: 117 --------FPFAHLRLTGVLPRIALCYCVVSFVALYV-----NHKYIG-----------W 152

Query: 186 LMAACVLVVYLALLY--GTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYI 243
           ++  C++  Y  LL     Y PD        D+                     N +  I
Sbjct: 153 II-GCLIAGYAVLLCIGNGYAPD--------DT---------------------NLLAII 182

Query: 244 DRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTI 303
           DR VLG +H+YH                             +P +PEGL S++S+I  T+
Sbjct: 183 DRNVLGADHLYH----------------------------KSPIDPEGLTSTLSAIAHTL 214

Query: 304 IGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGA 363
           IG   G +I+  +    +  +    GF L+  G  L  T A+PLNK++++ ++V VT G 
Sbjct: 215 IGFCCGKIILAKEALEQKTLKLFVAGFILMACGFVL--TEALPLNKRIWSPTFVLVTCGL 272

Query: 364 AALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA 401
           AA++ S +   +D+   K         G+N + +YV++
Sbjct: 273 AAMLQSVLIYFIDMKEKKNWCRFFEIFGVNPLFLYVLS 310


>gi|434403337|ref|YP_007146222.1| hypothetical protein Cylst_1247 [Cylindrospermum stagnale PCC 7417]
 gi|428257592|gb|AFZ23542.1| hypothetical protein Cylst_1247 [Cylindrospermum stagnale PCC 7417]
          Length = 375

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 111/378 (29%), Positives = 173/378 (45%), Gaps = 83/378 (21%)

Query: 33  QRLASLDIFRGLAVALMILVDHAG-GD--WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
            RL SLD+FRG+ +A MILV+ AG  D  +P + HA WNGC   D V PFFLFIVGVA+ 
Sbjct: 1   MRLTSLDVFRGMTIAAMILVNMAGVADEIYPLLDHAKWNGCTPTDLVFPFFLFIVGVAMT 60

Query: 90  LALKRIPDRADAVKKVIFRTLK----LLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
            +L +        K V  R L+    L   G+LL G ++       +  D+  IR  GVL
Sbjct: 61  FSLSKYTAANKPTKAVYLRILRRAAILFALGLLLNGFWNKG----VWTFDLSNIRFMGVL 116

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRI+L+YLL SL  +    +  K Q              W++A  +LV Y   +    VP
Sbjct: 117 QRISLTYLLASLAVL---QLPRKGQ--------------WILAVVLLVGYWLTMMYVPVP 159

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
           D+   ++ ++                      N   +IDR ++   H+Y    +      
Sbjct: 160 DYGAGVLTREG---------------------NFGAFIDRLIIPKAHLYKGDGFNLLG-- 196

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI----IHTKGHLAR 321
                                 +PEGL S++ ++++ + G   G  I    + ++  L  
Sbjct: 197 ----------------------DPEGLFSTIPAVVNVLAGYFAGEWIRSQPVKSRTSLGL 234

Query: 322 LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLK 381
               V +G A L+ G    +    P+NK+++T SYV  TSG A L+ +A Y L+++  +K
Sbjct: 235 ----VLVGVACLVIGWGWGWI--FPINKKIWTSSYVVFTSGWALLLLAACYELIEVRLIK 288

Query: 382 YPFLPLAWIGMNAMLVYV 399
               P   +G+NA+ ++V
Sbjct: 289 RWSKPFEIMGLNAIALFV 306


>gi|383859754|ref|XP_003705357.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Megachile rotundata]
          Length = 572

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 117/408 (28%), Positives = 199/408 (48%), Gaps = 65/408 (15%)

Query: 17  SEPDVSDQQEKSHLKTQ----------RLASLDIFRGLAVALMILVDHAGGDWPEISHAP 66
           + PD  D+ +++   T           R+ S+D FRG+A+ LMI V++ GG +   +H+ 
Sbjct: 155 NAPDDLDRLQETESTTHPVIRTTRASTRIRSVDTFRGIAILLMIFVNNGGGKYVFFNHSA 214

Query: 67  WNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVK-KVIFRTLKLLFWGILLQGGFSH 125
           W G  +AD V+P+F +I+G+ I ++ KR   R    + K++   ++     ILL    + 
Sbjct: 215 WYGLTVADLVLPWFAWIMGLTITIS-KRAELRVTVSRVKIMLHCIRRSLVLILLGLMLNS 273

Query: 126 APDELTYGVDVRMIRLCGVLQRIALSYLLVSLVE-IFTKD-VQDKDQSVGRFSIFR--LY 181
             +          +R  GVLQ + +SY + S++E IF K   QD     GRF+ FR  L 
Sbjct: 274 IKNN-----SFSDLRFPGVLQLLGVSYFVCSMLETIFMKPHSQDTLLQFGRFASFRDILD 328

Query: 182 CW-HWLMAACVLVVYLALLYGTYVPDWQFTIINKDSA-DY-GKVFNVTCGVRAKLNPPCN 238
            W  WL+ A ++  +  + +   VP+            +Y GK  N T G          
Sbjct: 329 SWPQWLVMAVIMTTHTLITFLLPVPNCPKGYFGPGGQYEYRGKYMNCTAG---------- 378

Query: 239 AVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSS 298
           A GYIDR + G NHMY  P         ++S +   LR          ++PEGL++++S+
Sbjct: 379 AAGYIDRLIFG-NHMYPKP---------KESIYGDILR----------YDPEGLMNTISA 418

Query: 299 ILSTIIGVHFGHVIIHTKGHLARLKQWVTMG-FALLIFGLTLHFTN---AIPLNKQLYTL 354
           I    +GVH G +++    + +R+ +W+       +I G   HF+     IP++K++ TL
Sbjct: 419 IFIVYLGVHAGKILLLYYQYNSRVIRWILWAVLTGIIAGNLCHFSTQGGVIPVSKRMMTL 478

Query: 355 SYVCVTSGAAALVFSAIYALVD---IWNLKYPFLPLAWIGMNAMLVYV 399
           S+    S  A L+++ +Y+L+D   +W+      P  + G N +L+YV
Sbjct: 479 SFDLTCSSFAFLLYAILYSLIDCKQVWSGA----PFIYAGTNPILLYV 522


>gi|294463099|gb|ADE77087.1| unknown [Picea sitchensis]
          Length = 218

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 63/140 (45%), Positives = 95/140 (67%), Gaps = 1/140 (0%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIP 346
           F+PEGLLSS+ ++++  IG+HFGH+++H KGH  R+ Q +     L+ FG+ L     +P
Sbjct: 44  FDPEGLLSSIMAVVTCFIGLHFGHILVHFKGHSERVLQCIIPSLGLIFFGIALGIF-GMP 102

Query: 347 LNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIF 406
           LNK LYT +Y+CVTSGAA L+F  IY LVD++  + P + L W+GMNA+ +YV+ A  +F
Sbjct: 103 LNKPLYTFNYMCVTSGAAGLLFVGIYLLVDLYGYRRPTMLLEWMGMNALTLYVLVASDLF 162

Query: 407 AGFINGWYYGDPHNTLVCFL 426
              I G+Y+  P N +V ++
Sbjct: 163 FIAIQGFYWRTPANNIVTYI 182


>gi|427710153|ref|YP_007052530.1| heparan-alpha-glucosaminide N-acetyltransferase [Nostoc sp. PCC
           7107]
 gi|427362658|gb|AFY45380.1| Heparan-alpha-glucosaminide N-acetyltransferase [Nostoc sp. PCC
           7107]
          Length = 387

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 118/394 (29%), Positives = 181/394 (45%), Gaps = 81/394 (20%)

Query: 23  DQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPF 79
           +  E S   + RL SLD+FRG+A+A MILV++ G     +P + HA W+GC   D V PF
Sbjct: 7   NVMENSSTPSTRLVSLDVFRGIAIASMILVNNPGSWDYIYPPLDHAEWHGCTPTDLVFPF 66

Query: 80  FLFIVGVAIALALKRIPDRADAVKKVIFRTLK---LLFWGILLQGGFSHAPDELTYGV-- 134
           FLFIVGVA+  +  +          V  R L+   +LF   L    F+   D L  G+  
Sbjct: 67  FLFIVGVAMPFSFAKYTPENRPTATVYQRILRRGLILFALGLFLALFTLTLDWLIKGITP 126

Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
           +   +R+ GVLQRI+L+Y++ +L  +          ++ R  +       W++AA +L+ 
Sbjct: 127 NFSTLRIMGVLQRISLAYVIAALAVL----------NLSRRGL-------WILAAVILIG 169

Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
           Y   +    VP             +G            L P  N  GYIDR +LG  H+Y
Sbjct: 170 YWLAMQFIPVP------------GFGA---------GNLTPEGNLGGYIDRIILG-KHIY 207

Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
                                           F+PEGL S++ ++++  +G +F    + 
Sbjct: 208 R----------------------------SGSFDPEGLFSTLPAVVTVFLG-YFTGDWLR 238

Query: 315 TKGHLARLK-QWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
           T+   +R     V  G   LI G    F    P+NKQL+T SYV  T+G A L  +  Y 
Sbjct: 239 TQPIKSRTSINLVIFGLITLIIGQLWGFI--FPINKQLWTSSYVLFTAGWALLTLALCYE 296

Query: 374 LVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFA 407
           L+++ + +   LPL  +G+NA+ ++V  A GIF 
Sbjct: 297 LIEVRHFRRWGLPLEVMGLNAIFLFV--ASGIFT 328


>gi|119583586|gb|EAW63182.1| hCG1993224, isoform CRA_a [Homo sapiens]
          Length = 382

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 109/366 (29%), Positives = 177/366 (48%), Gaps = 50/366 (13%)

Query: 49  MILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADA----VKK 104
           M+ V++ GG +    HA WNG  +AD V P+F+FI+G +I L++  I  R  +    + K
Sbjct: 1   MVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMTSILQRGCSKFRLLGK 60

Query: 105 VIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVE-IFTK 163
           + +R+  L+  GI++       P+     +    +R+ GVLQR+ ++Y +V+++E +F K
Sbjct: 61  IAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGVLQRLGVTYFVVAVLELLFAK 115

Query: 164 DVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVPDWQFTIINKDS-ADY 219
            V +   S       R     W  WL+   +  ++L L +   VP      +      D+
Sbjct: 116 PVPEHCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVPGCPTGYLGPGGIGDF 175

Query: 220 GKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDA 279
           GK  N T G          A GYIDR +LG +H+Y HP    S A    +          
Sbjct: 176 GKYPNCTGG----------AAGYIDRLLLGDDHLYQHP----SSAVLYHT---------- 211

Query: 280 PSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLARLKQWVTMGFALLIF 335
                  ++PEG+L +++SI+   +GV  G ++++    TK  L R   W  +   L+  
Sbjct: 212 ----EVAYDPEGILGTINSIVMAFLGVQAGKILLYYKARTKDILIRFTAWCCI-LGLISV 266

Query: 336 GLTLHFTNA--IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMN 393
            LT    N   IP+NK L++LSYV   S  A  +   +Y +VD+  L +   P  + GMN
Sbjct: 267 ALTKVSENEGFIPVNKNLWSLSYVTTLSSFAFFILLVLYPVVDVKGL-WTGTPFFYPGMN 325

Query: 394 AMLVYV 399
           ++LVYV
Sbjct: 326 SILVYV 331


>gi|414077874|ref|YP_006997192.1| hypothetical protein ANA_C12665 [Anabaena sp. 90]
 gi|413971290|gb|AFW95379.1| hypothetical protein ANA_C12665 [Anabaena sp. 90]
          Length = 376

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 105/373 (28%), Positives = 174/373 (46%), Gaps = 74/373 (19%)

Query: 34  RLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           RL+SLD+FRG+ +A MIL + AG     +  +SHA W+GC   D + P FLFIVGVA+  
Sbjct: 2   RLSSLDVFRGITIAAMILANMAGVADDVYRPLSHAQWHGCTPTDLIFPCFLFIVGVAMTF 61

Query: 91  ALKRIPDRADAVKKVIFRTLK----LLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQ 146
           +L +   +    K V  R L+    L   G++L G ++       +  D+  IRL GVLQ
Sbjct: 62  SLAKYTAQNKPTKAVYLRILRRTAILFILGLVLNGFWNQG----VWTFDLSSIRLMGVLQ 117

Query: 147 RIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPD 206
           RIAL+YL  SL+ +    +  K Q              WL+A  +L+ Y   +    VPD
Sbjct: 118 RIALTYLFASLIVL---KLPRKSQ--------------WLVAGGLLIAYWLTMMYIPVPD 160

Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
           +   ++ ++                      N   +IDR ++   H+Y    +       
Sbjct: 161 YGAGVLTREG---------------------NFGAFIDRLIIPKAHLYKGDGFN------ 193

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
               F G              +PEGL S++ +I+S + G   G  I   K   ++    +
Sbjct: 194 ----FLG--------------DPEGLFSTIPAIVSVLAGYFTGQWIKDKKQATSQTSMDL 235

Query: 327 TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLP 386
            + F L    + + +  + P+NK+++T SYV  T+G A ++ +A Y L+++  +K    P
Sbjct: 236 VL-FGLCCLVIAIIWDVSFPINKKIWTSSYVLFTTGWALMLLAACYELIEVRLIKRWSKP 294

Query: 387 LAWIGMNAMLVYV 399
              +G+NA+ ++V
Sbjct: 295 FEIMGLNAIALFV 307


>gi|348507459|ref|XP_003441273.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Oreochromis niloticus]
          Length = 460

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 118/402 (29%), Positives = 181/402 (45%), Gaps = 74/402 (18%)

Query: 20  DVSDQQEKSH----LKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADF 75
           + S   E +H     K  RL SLD FRG A+ +M+ V++ GG +    HAPWNG  +AD 
Sbjct: 60  EESHASETAHGTVKAKPTRLLSLDTFRGFALTVMVFVNYGGGGYWFFQHAPWNGLTVADL 119

Query: 76  VMPFFLFIVGVAIALAL----KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDE-- 129
           VMP+F+F++G ++ LA     +R   R   ++K+ +RT+ LL  G       +++P +  
Sbjct: 120 VMPWFVFVIGTSVVLAFSSMQRRGVSRLQLLRKITWRTVVLLLLGFCF---LNYSPRDGP 176

Query: 130 ---LTYGVDVRMIRLCGVLQRIALSYLLVSLVEI-FTKDVQDKDQSVGRFSIFRLYCWHW 185
              L    D R     G+       +LL S+    +   VQD            LY   W
Sbjct: 177 CSVLVLAEDPRSAAASGL-------HLLCSVSPYNWWNPVQD----------ILLYWPQW 219

Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYID 244
           L+   +  ++L L +   VP+     +      D G   N T G          A GYID
Sbjct: 220 LIIILLETLWLCLTFLMPVPNCPTGYLGAGGIGDNGLYPNCTGG----------AAGYID 269

Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
           R + G N MY +P  +     TQ                  PF+PEG+L +++SI+   +
Sbjct: 270 RWMFGDN-MYRYPTCKEMYRTTQ------------------PFDPEGVLGTINSIVIGFL 310

Query: 305 GVHFGHVIIHTKGH----LARLKQW-VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCV 359
           G+  G ++I  K      L R   W V +G +  I          IP+NK L++LSYV  
Sbjct: 311 GMQAGKILIFYKKMNIHILCRFLVWTVILGISAAILSKCTRDGGFIPVNKNLWSLSYVMC 370

Query: 360 TSGAAALVFSAIYALVDI--WNLKYPFLPLAWIGMNAMLVYV 399
               + L+   ++ +VDI  W    PF+   + GMN++ VYV
Sbjct: 371 MGCLSFLLLGGMFFVVDIKGWWGGQPFI---YPGMNSIFVYV 409


>gi|374580713|ref|ZP_09653807.1| hypothetical protein DesyoDRAFT_2145 [Desulfosporosinus youngiae
           DSM 17734]
 gi|374416795|gb|EHQ89230.1| hypothetical protein DesyoDRAFT_2145 [Desulfosporosinus youngiae
           DSM 17734]
          Length = 375

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 109/385 (28%), Positives = 176/385 (45%), Gaps = 86/385 (22%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMILVDHAGG--DWP-EISHAPWNGCNLADFVMPFFL 81
            EK  L   R   +DIFRGL ++LM++  + G   + P ++ HA WNG  + DFV PFF+
Sbjct: 1   MEKGKL---RFDCIDIFRGLTISLMLICSNPGNITNIPAQLRHADWNGATIGDFVFPFFI 57

Query: 82  FIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
           F +G+ + +A+ R  ++  +  ++I     R++ +   G++L G  +          D+ 
Sbjct: 58  FSMGIVVPIAINRRLEKGISQMRIIINVLNRSIVMFLLGLILNGFPTF---------DLA 108

Query: 138 MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKD-QSVGRFSIFRLYCWHWLMAACVLVVYL 196
           +IR+ GVLQRIA+ Y   +L+ +  K +  KD   +G  +         L+A  +L +Y 
Sbjct: 109 IIRVPGVLQRIAIVYFCSALIYLLFKSIVKKDLVQIGILT---------LIAVLLLAIYY 159

Query: 197 ALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHH 256
            LL G  VP  +                   G++  L      V YID K L   H+Y  
Sbjct: 160 WLLKGLQVPGIE-------------------GLKGGL------VSYIDLKYLK-GHLY-- 191

Query: 257 PAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK 316
                                  P+     F+PEG+LS++ ++ S IIGV  G + +   
Sbjct: 192 ----------------------TPT-----FDPEGILSTIPALSSGIIGVVVGMIFLRRD 224

Query: 317 GHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
               ++  +V  G  L+IF     F    P NKQL++ S+V +TSG   LV +  Y L D
Sbjct: 225 SRFVKMTIFVCSGILLIIFA--EWFNAYFPYNKQLWSSSFVLLTSGFGILVLTIFYLLTD 282

Query: 377 IWNLKYPFLPLAWIGMNAMLVYVMA 401
           I  +     P   IG + + VY ++
Sbjct: 283 ILKIGRTLTPFKAIGASPIFVYFIS 307


>gi|282898832|ref|ZP_06306819.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
           CS-505]
 gi|281196359|gb|EFA71269.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
           CS-505]
          Length = 375

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 105/374 (28%), Positives = 171/374 (45%), Gaps = 75/374 (20%)

Query: 33  QRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
            RL SLD+FRGL +A+MI+ + AG     +P +SHAPWNGC   D V PFFLFIVGVA++
Sbjct: 1   MRLISLDVFRGLTIAMMIIANMAGVVPDVYPFLSHAPWNGCTPTDLVFPFFLFIVGVAMS 60

Query: 90  LALKRIPDRADAVKKVIF----RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
            +L +    +     V F    R + L   G+LL G ++          D++ +R+ GVL
Sbjct: 61  FSLSKYSLESKLDNLVYFNLCRRAVILFTLGLLLNGFWNQGVGSF----DLQSLRVMGVL 116

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRI L+YL  SL+ +    + +K Q              W +A  +L+ Y   +    VP
Sbjct: 117 QRIGLAYLFASLIVL---KLPEKTQ--------------WALAGILLIFYWLTMMYIPVP 159

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
           D+   ++ ++                      N   +IDR ++   H+Y    +      
Sbjct: 160 DYGAGMLTREG---------------------NFGAFIDRLIIAKPHLYAGDGFN----- 193

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
                F G              +PEGL S++ +I++ + G   G  I  +  +       
Sbjct: 194 -----FRG--------------DPEGLFSTIPAIVNVLFGYFAGQWIRKSTINSHTSMDL 234

Query: 326 VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFL 385
           V  G   L+ G+   +    P+NK+L+T SYV  ++G   +  +A Y L+++  +K    
Sbjct: 235 VLWGLCSLVVGMI--WDGLFPINKKLWTSSYVLFSTGWGLVFLAACYDLIEVRKIKRWSK 292

Query: 386 PLAWIGMNAMLVYV 399
               IG+NA+ ++V
Sbjct: 293 GFEIIGLNAIALFV 306


>gi|427728534|ref|YP_007074771.1| hypothetical protein Nos7524_1293 [Nostoc sp. PCC 7524]
 gi|427364453|gb|AFY47174.1| hypothetical protein Nos7524_1293 [Nostoc sp. PCC 7524]
          Length = 380

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 110/375 (29%), Positives = 171/375 (45%), Gaps = 77/375 (20%)

Query: 33  QRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
            RL SLD+FRG+ +A MILV+ AG     +  ++HA W+GC   D V PFFLFIVGVA+ 
Sbjct: 15  MRLTSLDVFRGITIAAMILVNMAGVADDVYLPLTHADWHGCTPTDLVFPFFLFIVGVAMT 74

Query: 90  LALKRIPDRADAVKKVIFRTLK----LLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
            +L +          + +R L+    L   G+ L G ++       +  D   IR+ GVL
Sbjct: 75  FSLSKYTQDNKPTSAIYWRILRRAAILFILGLFLNGFWNQG----VWTFDFTSIRMMGVL 130

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVY-LALLYGTYV 204
           QRI+LSYLL SL+ +    +  K Q              WL+A  +L+ Y LA++Y   V
Sbjct: 131 QRISLSYLLASLIVL---KLPRKGQ--------------WLLAGVLLIGYWLAMMY-IPV 172

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
           PD+   ++ ++                      N   Y+DR ++   H+Y    +     
Sbjct: 173 PDYGAGVLTREG---------------------NFGAYVDRLIIPKAHLYKGDGFN---- 207

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
                 F G              +PEGL S++ +I+S + G   G  I            
Sbjct: 208 ------FMG--------------DPEGLFSTIPAIVSVLAGYFTGEWIRKQPVQTRTSVG 247

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPF 384
               G   L+ G    +    P+NK+L+T SYV  TSG A L+ +A Y L+++  ++   
Sbjct: 248 LALFGIGCLMIGWAWGWV--FPINKKLWTSSYVVFTSGWALLLLAACYELIEVRLMRRWS 305

Query: 385 LPLAWIGMNAMLVYV 399
            P   +G+NA+ ++V
Sbjct: 306 KPFEIMGLNAIALFV 320


>gi|397505551|ref|XP_003823320.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase isoform
           2 [Pan paniscus]
          Length = 622

 Score =  129 bits (325), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 118/418 (28%), Positives = 189/418 (45%), Gaps = 87/418 (20%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
           RL S+D FRG+A+ LM+ V++ GG +    HA WNG  +AD V P+F+FI+G +I L++ 
Sbjct: 189 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMT 248

Query: 94  RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            I  R  +    + K+ +R+  L+  GI++       P+     +    +R+ GVLQR+ 
Sbjct: 249 SILQRGCSKFRLLGKIAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGVLQRLG 303

Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
           ++Y +V+++E +F K V +   S       R     W  WL+   +  ++L L +   VP
Sbjct: 304 VTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVP 363

Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                 +      D+GK  N T G          A GYIDR +LG +H+Y HP    S A
Sbjct: 364 GCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDDHLYQHP----SSA 409

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF---------------- 308
               +                 ++PEG+L +++SI+   +GV                  
Sbjct: 410 VLYHT--------------EVAYDPEGILGTINSIVMAFLGVQCCPDWVTKQACLTEPLS 455

Query: 309 ---------------------GHVIIH----TKGHLARLKQWVTMGFALLIFGLTLHFTN 343
                                G ++++    TK  L R   W  +   L+   LT    N
Sbjct: 456 PLWRILFGPCLEVRATEPAQAGKILLYYKARTKDILIRFTAWCCI-LGLISVALTKVSEN 514

Query: 344 A--IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
              IP+NK L++LSYV   S  A  +   +Y +VD+  L +   P  + GMN++LVYV
Sbjct: 515 EGFIPVNKNLWSLSYVTTLSSFAFFILLVLYPVVDVKGL-WTGTPFFYPGMNSILVYV 571


>gi|297172331|gb|ADI23307.1| uncharacterized conserved protein [uncultured nuHF2 cluster
           bacterium HF0770_19K18]
          Length = 373

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 105/375 (28%), Positives = 167/375 (44%), Gaps = 82/375 (21%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGV 86
           K+ RL SLD FRGL +A MI+V+  G +W  +     HA W+GC   D V PFFLFIVGV
Sbjct: 6   KSDRLLSLDAFRGLTIAFMIIVNTPG-NWSYVYGPLRHAEWHGCTPTDLVFPFFLFIVGV 64

Query: 87  AI--ALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
           A+  + A       +D +KK+ +RT+ +  +G+LL     +A   +    D   +R+ GV
Sbjct: 65  AMRFSFAQHNYQPSSDLLKKIFWRTVTIFSFGLLL-----NAYPFIRQNWDWSSLRIMGV 119

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           LQRI L+Y L +++ ++  + +                  W+    +L+ Y         
Sbjct: 120 LQRIGLAYGLAAILSLYLSEKK-----------------LWISCGIILIGY--------- 153

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
             W   ++   S  +G                 N    ID  +LG NH+     WR +  
Sbjct: 154 --WLILLLFGGSDPFGL--------------SSNIARTIDIAILGENHL-----WRGTG- 191

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
                                PF+PEGLLS++ +I++ +IG   G +I      ++ ++ 
Sbjct: 192 --------------------IPFDPEGLLSTIPAIVTVLIGFSIGQLIQENSNRISLVQT 231

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPF 384
            +  G  +   G    F    P+NKQL+T +YV  T G A+   +A   L+DI   K   
Sbjct: 232 ILIRGAGIAAVGWLWGFI--FPINKQLWTSTYVLYTGGLASFFLAAFIWLIDIRGYKKLS 289

Query: 385 LPLAWIGMNAMLVYV 399
            P    G N++ V++
Sbjct: 290 WPFMIFGTNSIFVFI 304


>gi|224537467|ref|ZP_03678006.1| hypothetical protein BACCELL_02346 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224520905|gb|EEF90010.1| hypothetical protein BACCELL_02346 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 368

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 113/398 (28%), Positives = 179/398 (44%), Gaps = 104/398 (26%)

Query: 17  SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLAD 74
           S P +S   +K     +RL SLD+ RG+ V  MILV+++GG   +  + H+ WNG  L D
Sbjct: 4   SHPPISTSPQK-----KRLLSLDVLRGITVVGMILVNNSGGKLSYDSLQHSAWNGLTLCD 58

Query: 75  FVMPFFLFIVGVAIALALKRIPDRADA--VKKVIFRTLKLLFWG-------ILLQGGFSH 125
            V PFFLFI+G++  +AL +   +A    V+K++ RTL +L  G        +  G F  
Sbjct: 59  LVFPFFLFIMGISTYIALGKFHFQASGSVVRKILKRTLVILCIGWAIHWFHFICDGDF-- 116

Query: 126 APDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHW 185
                        +RL GVL RIAL Y +VS V ++        + +G           W
Sbjct: 117 --------FPFAHLRLTGVLPRIALCYCVVSFVALYV-----NHKYIG-----------W 152

Query: 186 LMAACVLVVYLALLY--GTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYI 243
           ++  C++  Y  LL     Y PD        D+                     N +  I
Sbjct: 153 II-GCLIAGYAVLLCIGNGYAPD--------DT---------------------NLLAII 182

Query: 244 DRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTI 303
           DR +LG +H+YH                             +P +PEGL S++S+I  T+
Sbjct: 183 DRNILGADHLYH----------------------------KSPIDPEGLTSTLSAIAHTL 214

Query: 304 IGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGA 363
           IG   G +I+  +    +  +    GF L+  G  L  T A+PLNK++++ ++V VT G 
Sbjct: 215 IGFCCGKIILAKEALEQKTLKLFVAGFILMACGFVL--TEALPLNKRIWSPTFVLVTCGL 272

Query: 364 AALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA 401
           AA++ S +   +D+   K         G+N + +YV++
Sbjct: 273 AAMLQSVLIYFIDMKEKKNWCRFFEIFGVNPLFLYVLS 310


>gi|427709244|ref|YP_007051621.1| hypothetical protein Nos7107_3914 [Nostoc sp. PCC 7107]
 gi|427361749|gb|AFY44471.1| hypothetical protein Nos7107_3914 [Nostoc sp. PCC 7107]
          Length = 375

 Score =  129 bits (323), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 109/374 (29%), Positives = 168/374 (44%), Gaps = 75/374 (20%)

Query: 33  QRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
            RL SLD+FRG+ +A MILV+ AG     +P ++HA W+GC   D V PFFLFIVGVA+ 
Sbjct: 1   MRLTSLDVFRGITIAAMILVNMAGVADDVYPLLAHADWHGCTPTDLVFPFFLFIVGVAMT 60

Query: 90  LALKRIPDRADAVKKV---IFRTLKLLF-WGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
            +L +          V   IFR   +LF  G+LL   ++          D   IR+ GVL
Sbjct: 61  FSLSKYTADNKPTSTVYLRIFRRAAILFALGLLLNVFWNKGVGTF----DFSSIRIMGVL 116

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRI+LSYLL SL  +   ++  K Q              W++AA +L+ Y   +    VP
Sbjct: 117 QRISLSYLLASLAVL---NLPRKGQ--------------WILAAVLLIGYWLTMMYVPVP 159

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
           ++   ++ ++                      N   Y DR ++   H+Y    ++     
Sbjct: 160 EYGAGVLTREG---------------------NFGAYFDRLIIPQTHLYAGDGFKSMG-- 196

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
                                 +PEGL S++ +++S + G   G  I             
Sbjct: 197 ----------------------DPEGLFSTIPAVVSVLAGYFTGQWIRKQPVQTRTSVGL 234

Query: 326 VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFL 385
           +  G   LI G    +    P+NK+L+T SYV  TSG A L+ +A Y L+++  +K    
Sbjct: 235 ILFGMGCLIIGWAWGWV--FPINKKLWTSSYVIFTSGWALLLLAACYELIEVRLIKRWSK 292

Query: 386 PLAWIGMNAMLVYV 399
           P   +G+NA+ ++V
Sbjct: 293 PFEIMGLNAIALFV 306


>gi|344281343|ref|XP_003412439.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Loxodonta africana]
          Length = 782

 Score =  129 bits (323), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 113/405 (27%), Positives = 188/405 (46%), Gaps = 60/405 (14%)

Query: 20  DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
           D+  +  +      RL  +D FRGLA+ +M+ V++ GG +    HA WNG  +AD V P 
Sbjct: 362 DIQLEAWRPSAPPSRLRCVDTFRGLALIIMVFVNYGGGKYWYFKHASWNGLTVADLVFPC 421

Query: 80  FLFI--------------VGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSH 125
           FL I              + +++   L+R   +   + K+ +R+  L+  G+++      
Sbjct: 422 FLEILFGEDLLCTRDPLEIFLSMTSILQRGCSKFKLLGKIAWRSFLLICIGVVIVN---- 477

Query: 126 APDELTYGVDVRMIRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVGR-FSIFRLYC- 182
            P+     +    +R+ GVLQR+ ++Y +V+++E +F K V +   S    FS+  L   
Sbjct: 478 -PNYCLGPLSWDKVRIPGVLQRLGVTYFVVAVLELLFAKPVPENCASERSCFSLRDLTAS 536

Query: 183 W-HWLMAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAV 240
           W  WL    +  ++L L +   VP      +      D+GK  N T G          A 
Sbjct: 537 WPQWLFILTLESIWLTLTFFLPVPGCPTGYLGPGGIGDWGKYPNCTGG----------AA 586

Query: 241 GYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSIL 300
           GY+DR +LG  H+Y HP    S A    +                 ++PEG+L +++SI+
Sbjct: 587 GYMDRVLLGDEHLYQHP----SSAVLYHT--------------EMAYDPEGILGTINSIV 628

Query: 301 STIIGVHFGHVIIH----TKGHLARLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTL 354
              +GV  G ++++    TK  + R   W  +   L    LT    N   IP+NK L+++
Sbjct: 629 MAFLGVQAGKILLYYKDQTKDIVIRFTAWCCI-LGLTSAALTKVSENEGFIPVNKNLWSI 687

Query: 355 SYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
           SYV   S  A  +   +Y +VD+  L +   P  + GMN++LVYV
Sbjct: 688 SYVTTLSSFAFFILLVLYPIVDVKGL-WTGTPFFYPGMNSILVYV 731


>gi|332709783|ref|ZP_08429740.1| hypothetical protein LYNGBM3L_44860 [Moorea producens 3L]
 gi|332351381|gb|EGJ30964.1| hypothetical protein LYNGBM3L_44860 [Moorea producens 3L]
          Length = 366

 Score =  128 bits (322), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 113/376 (30%), Positives = 168/376 (44%), Gaps = 82/376 (21%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
            RL SLD+FRG+A+A MILV++ G     +P + HA W+G    D V P FLFI GVA+A
Sbjct: 1   MRLTSLDVFRGIAMASMILVNNPGSWSYVYPPLLHAKWHGFTPTDLVFPAFLFIAGVAMA 60

Query: 90  LALKRIPDRADAVKK---VIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQ 146
            +L +  +   +V +    I R   +LF   LL  GF       TY  D   IR+ GVLQ
Sbjct: 61  FSLVKYTNNNQSVSQGYWRIGRRCAILFALGLLLNGFP------TYNWDT--IRIMGVLQ 112

Query: 147 RIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPD 206
           RI+L+Y L ++  +   +++ +                W++   VL+ Y A +    VP 
Sbjct: 113 RISLAYFLSAVAVL---NLRRRGL--------------WVLTGIVLLGYWAAMSLVPVP- 154

Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
                      DYG            L P  N   YIDR VLG NH+Y            
Sbjct: 155 -----------DYGA---------GNLTPEGNFAAYIDRMVLGTNHLYK----------- 183

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
                             A F+PEGL S+  ++++ + G   G  + H           V
Sbjct: 184 -----------------QAQFDPEGLFSTFPAVVTVLAGYFVGDWLRHQPIPSRTSLGLV 226

Query: 327 TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLP 386
             G   L  G    F    P+NKQL+T SYV  ++G + L+ +A Y L+++   +    P
Sbjct: 227 LFGVGCLGLGWVWDFW--FPINKQLWTSSYVVFSAGWSMLLLAACYELIEVRGFRRWGWP 284

Query: 387 LAWIGMNAMLVYVMAA 402
           L  +G+NA+ ++V + 
Sbjct: 285 LEVMGLNAIFLFVASG 300


>gi|224496100|ref|NP_001139059.1| uncharacterized protein LOC565246 precursor [Danio rerio]
          Length = 582

 Score =  128 bits (322), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 111/414 (26%), Positives = 187/414 (45%), Gaps = 66/414 (15%)

Query: 2   SEIKAETTHH---HPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGD 58
             +K    HH   + + + EP+   Q ++S  K+ RL SLD FRG ++ +M+ V++ GG 
Sbjct: 168 QRLKNRMCHHGSQNSMEMEEPNTEQQIDESKPKSSRLKSLDTFRGFSLTVMVFVNYGGGG 227

Query: 59  WPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRA----DAVKKVIFRTLKLLF 114
           +    HAPWNG  +AD VMP+F+FI+G ++ LA   +  +       ++KV +RT+ L+ 
Sbjct: 228 YWFFQHAPWNGLTVADLVMPWFVFIIGTSVMLAFTSMHRKGVSLLQLLRKVTWRTVVLML 287

Query: 115 WGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVG 173
            G      +S     L    D R     G+       +L  S  +  +   +QD      
Sbjct: 288 IGFCFM-NYSPRDGILVLAADTRSSPASGL-------HLFRSGTDHNWWNPIQD------ 333

Query: 174 RFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAK 232
                 LY   WL    +  ++L L +   VP+     +      D G   N T G    
Sbjct: 334 ----VILYWPEWLFIVLLETLWLCLTFLLPVPNCPTGYLGAGGVGDAGLYPNCTGG---- 385

Query: 233 LNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGL 292
                 A  +ID+   G N M+ +P  +     T+                  PF+PEG+
Sbjct: 386 ------AAAHIDKWFFGDN-MFWYPTCKVLYRTTE------------------PFDPEGV 420

Query: 293 LSSVSSILSTIIGVHFGHVIIH----TKGHLARLKQW-VTMGFALLIFGLTLHFTNAIPL 347
           L +++SI+   +G+  G +++      KG LAR   W + +G +  I          IP+
Sbjct: 421 LGTINSIVMGFLGMQAGKILLFFRQMNKGILARFLVWALILGISAAILSKCTRDEGFIPV 480

Query: 348 NKQLYTLSYVCVTSGAAALVFSAIYALVDI--WNLKYPFLPLAWIGMNAMLVYV 399
           NK L++LS+V   +  + L+ + +Y ++D+  W    PF+   + GMN++ VYV
Sbjct: 481 NKNLWSLSFVTCMACMSFLLLAVMYFIIDVKKWWGGQPFI---FPGMNSIFVYV 531


>gi|328869407|gb|EGG17785.1| transmembrane protein [Dictyostelium fasciculatum]
          Length = 651

 Score =  128 bits (321), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 106/430 (24%), Positives = 183/430 (42%), Gaps = 104/430 (24%)

Query: 26  EKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVG 85
           E +  K  RL SLD+FRGL++ +MI V++ GG +   +H+ WNG  +AD V P+F+FI+G
Sbjct: 218 ESNQPKKDRLKSLDVFRGLSITIMIFVNYGGGGYWFFNHSYWNGLTVADLVFPWFIFIMG 277

Query: 86  VAI-----ALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR 140
           +A+     A+ ++ +P R   + K++ R++ L   G+ +             G ++   R
Sbjct: 278 IAMPLSFNAMEIRGVPKRTIFI-KLVRRSVILFSLGLFINN-----------GNNLGHWR 325

Query: 141 LCGVLQRIALSYLLVSLVEIF--------------------------------------- 161
           + GVLQR  +SY +   + +F                                       
Sbjct: 326 ILGVLQRFGVSYFVTGCIMMFVPLYRPNGGGGGNSHHQYNRFDGTGNDREREPSESDPLF 385

Query: 162 -TKDVQD--KDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDS-A 217
            +  +Q+  K  S    + F  +   WL A  +L V+  + +   VP      +      
Sbjct: 386 QSSSIQEKFKAHSASMLADFIPFWLQWLFALLILAVWFLVTFLLPVPGCPTGYLGPGGLG 445

Query: 218 DYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRK 277
           D G+  N T G          A   +D  +   NH++  P  +                 
Sbjct: 446 DQGQHVNCTGG----------AAKIVDLHIFSNNHIFQTPTCQ----------------- 478

Query: 278 DAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGL 337
             P +    ++PEG L  ++S+    +GVH G  I+  K + +RL +W  +  ++L+ G+
Sbjct: 479 --PIYNTGAYDPEGTLGYLTSVFMCFLGVHAGRTIMTYKSNRSRLIRWTIL--SILLCGI 534

Query: 338 TLHFTNA------IPLNKQLYTLSYVCVTSGAAALVFSAIYALVD---IWNLKYPFLPLA 388
                        IP+NK L+T S++ + SG    V +  Y +VD   IWN      PL 
Sbjct: 535 AAGLCGVSQNGGWIPINKNLWTPSFIFLLSGFGFFVLAIFYVVVDIKRIWNGA----PLV 590

Query: 389 WIGMNAMLVY 398
           ++GMN + +Y
Sbjct: 591 YVGMNPITIY 600


>gi|427716050|ref|YP_007064044.1| hypothetical protein Cal7507_0722 [Calothrix sp. PCC 7507]
 gi|427348486|gb|AFY31210.1| hypothetical protein Cal7507_0722 [Calothrix sp. PCC 7507]
          Length = 375

 Score =  127 bits (320), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 107/373 (28%), Positives = 175/373 (46%), Gaps = 69/373 (18%)

Query: 33  QRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
            RL SLD+FRG+ +A MILV+  G     +P + HA WNGC   D V PFFLFIVGVA+ 
Sbjct: 1   MRLTSLDVFRGITIAAMILVNMVGVADDKYPLLDHAEWNGCTPTDLVFPFFLFIVGVAMT 60

Query: 90  LALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            +L +  +       V +R L+     + L    +   ++  +  D+  IRL GVLQRI+
Sbjct: 61  FSLSKYTEGNKPNSSVYWRILRRAAILLALGLLLNGFWNKGVWTFDLSSIRLMGVLQRIS 120

Query: 150 LSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQF 209
           LSYL+ S+  +   ++  K Q              W++AA +L+ Y   +    VP    
Sbjct: 121 LSYLVASVTVL---NLPRKGQ--------------WILAAVLLIGYWLTMMYLPVPGHGA 163

Query: 210 TIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDS 269
            ++ ++                      N   YIDR ++   H+Y    +          
Sbjct: 164 GVLTREG---------------------NLGAYIDRLIIPKAHLYKGDKFN--------- 193

Query: 270 PFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTM- 328
            F G              +PEGL S++ +I+S + G +F  + I ++   +R    + + 
Sbjct: 194 -FMG--------------DPEGLFSTIPAIVSVLAG-YFAGLWIRSQPVRSRTSIGLALF 237

Query: 329 GFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLA 388
           G   LI G    +T   P+NK+L+T SYV  +SG A L+ +A Y L+++  ++    P  
Sbjct: 238 GIGCLIIGWAWGWT--FPINKKLWTSSYVVFSSGWALLLLAACYELIEVRLIRRWSKPFE 295

Query: 389 WIGMNAMLVYVMA 401
            +G+NA+ ++V +
Sbjct: 296 IMGLNAIALFVAS 308


>gi|392968994|ref|ZP_10334410.1| Protein of unknown function DUF2261, transmembrane [Fibrisoma limi
           BUZ 3]
 gi|387843356|emb|CCH56464.1| Protein of unknown function DUF2261, transmembrane [Fibrisoma limi
           BUZ 3]
          Length = 390

 Score =  127 bits (320), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 117/391 (29%), Positives = 172/391 (43%), Gaps = 87/391 (22%)

Query: 22  SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDW----PEISHAPWNGCNLADFVM 77
           S +  +S    QRL SLD FRGL VA MILV++AG DW      + HA WNG    D + 
Sbjct: 9   SVRSSESLTNPQRLLSLDAFRGLTVAGMILVNNAG-DWQYVYAPLKHAAWNGWTPTDLIF 67

Query: 78  PFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
           PFFLFIVGV+I  AL    +  + + K++ R+  L   G+ L    +  P       D+ 
Sbjct: 68  PFFLFIVGVSITFALAGGQEHTNVIGKILKRSFTLFMLGLFL----AFFPK-----FDIT 118

Query: 138 MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA 197
            +R+ GVLQRIAL YL  SL+ + T   Q                  WL+AA ++  +L 
Sbjct: 119 TVRIPGVLQRIALVYLACSLIYLRTTTRQQT----------------WLLAALLVGYWLV 162

Query: 198 LLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHP 257
           +           T++      Y           A L P  N   ++DR VL  +H+Y   
Sbjct: 163 M-----------TVVPVPGVGY-----------ANLEPTTNLAAWLDRTVLTTDHLY--- 197

Query: 258 AWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKG 317
             R +K                       ++PEGLLS++ +I + + GV  G  +     
Sbjct: 198 --RSTKV----------------------WDPEGLLSTIPAIGTGLAGVLVGTWLRRRDV 233

Query: 318 HLARLKQWV-TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
             A    W+ T+G    + GL   +    P+NK L+T SYV + +G A L  +  Y L+D
Sbjct: 234 ADADKIAWLFTVGCLTTLGGLI--WDAWFPINKALWTSSYVLLAAGLAMLGLALCYWLID 291

Query: 377 IWNLKYPFL-----PLAWIGMNAMLVYVMAA 402
                 P       P    G+NA+ V+ ++ 
Sbjct: 292 AQPALRPTARRIAQPFVAFGVNAITVFFLSG 322


>gi|427385206|ref|ZP_18881711.1| hypothetical protein HMPREF9447_02744 [Bacteroides oleiciplenus YIT
           12058]
 gi|425727374|gb|EKU90234.1| hypothetical protein HMPREF9447_02744 [Bacteroides oleiciplenus YIT
           12058]
          Length = 368

 Score =  127 bits (320), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 111/397 (27%), Positives = 174/397 (43%), Gaps = 102/397 (25%)

Query: 17  SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLAD 74
           S+P   + Q K     +RL SLD+ RG+ V  MILV+++GG   +  + H+ WNG  L D
Sbjct: 4   SQPSTFNSQPK-----KRLLSLDVLRGITVVGMILVNNSGGKLSYESLQHSAWNGLTLCD 58

Query: 75  FVMPFFLFIVGVAIALALKRIPDRADA--VKKVIFRTLKLLFWG-------ILLQGGFSH 125
            V PFFLFI+G++ ++AL +   +A    V+K++ RTL +L  G        +  G FS 
Sbjct: 59  LVFPFFLFIMGISTSIALSKFHFQASGSVVRKILKRTLIILCIGWVIHWFDFICDGDFS- 117

Query: 126 APDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHW 185
                        +RL GVL RIAL Y + S V ++        + +G           W
Sbjct: 118 ---------PFAHLRLTGVLPRIALCYCVASFVALYV-----NHKYIG-----------W 152

Query: 186 LMAACVLVVYLALLYGT-YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYID 244
           L+   +      L  G  Y PD                               N +  ID
Sbjct: 153 LIGILLAGYTFLLCIGNGYAPD-----------------------------STNLLAIID 183

Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
           R VLG +H+YH                             +P +PEGL S+ S+I  T+I
Sbjct: 184 RNVLGADHLYH----------------------------KSPIDPEGLTSTFSAIAHTLI 215

Query: 305 GVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAA 364
           G   G +I+  K    +  +   +GF L+  G  L    A+PLNK++++ ++V VT G A
Sbjct: 216 GFCCGKLILAKKNLEQKTLKLFVVGFILMACGFCL--MEALPLNKRIWSPTFVLVTCGLA 273

Query: 365 ALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA 401
           A++ + +   +D+   K         G+N + +YV++
Sbjct: 274 AMLQALLIYFIDLKEKKNWCRFFEIFGVNPLFLYVLS 310


>gi|340727561|ref|XP_003402110.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Bombus terrestris]
          Length = 571

 Score =  127 bits (318), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 111/383 (28%), Positives = 191/383 (49%), Gaps = 55/383 (14%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALA 91
           + R+ S+D FRG+ + LMI V++ GG +   +H+ W G ++AD ++P+F +I+G++I ++
Sbjct: 179 SSRIQSVDAFRGITILLMIFVNNGGGKYVFFNHSAWFGLSVADLILPWFAWIMGMSITIS 238

Query: 92  ----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
               L+    R       + R+  L+  G++L    S + ++L         R  GVLQ 
Sbjct: 239 KRAELRLTTSRVKITLCCLRRSAILILLGLMLNSIDSKSLNDL---------RFPGVLQL 289

Query: 148 IALSYLLVSLVE-IFTK-DVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGT 202
           +++SY + +++E IF K   QD     GRF+IFR  L  W  WL+ A ++  +  + +  
Sbjct: 290 LSVSYFVCAILETIFMKPHSQDILLQFGRFAIFRDILDSWPQWLIMAGIMTTHTLITFFL 349

Query: 203 YVPDWQFTIINKDSADY--GKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWR 260
           ++P+            +  GK  N T G          A GYIDR + G NH Y      
Sbjct: 350 HIPNCPTGYFGPGGKYHYRGKYMNCTAG----------AAGYIDRLIFG-NHTYSR---- 394

Query: 261 RSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLA 320
                  +S +   LR D          PEGL++++S+I    +GVH G +++      A
Sbjct: 395 -----ITNSIYGQILRYD----------PEGLMNTISAIFIVYLGVHAGKILLLYYQGNA 439

Query: 321 RLKQWVTMG-FALLIFGLTLHFTN---AIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           RL +W     F  +I G+  +F +    IP++K++ TLSYV   S  A L+++ +Y L+D
Sbjct: 440 RLIRWFLWAVFTGIIAGILCNFESEGGVIPVSKRMMTLSYVLTCSSFAFLLYALLYFLID 499

Query: 377 IWNLKYPFLPLAWIGMNAMLVYV 399
            +   +   P  + G+N + +YV
Sbjct: 500 -YKQFWSGAPFIYAGINPIFLYV 521


>gi|440684188|ref|YP_007158983.1| hypothetical protein Anacy_4727 [Anabaena cylindrica PCC 7122]
 gi|428681307|gb|AFZ60073.1| hypothetical protein Anacy_4727 [Anabaena cylindrica PCC 7122]
          Length = 376

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 108/375 (28%), Positives = 170/375 (45%), Gaps = 76/375 (20%)

Query: 33  QRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
            RL SLD+FRG+ +A MILV+  G     +  + HA WNGC   D V PFFLFIVGVA+ 
Sbjct: 1   MRLTSLDVFRGITIAGMILVNMVGVADNKYSLLDHAEWNGCTPTDLVFPFFLFIVGVAMT 60

Query: 90  LALKRIPDRADAVKKVIFRTLK----LLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
            +L +        K V  R L+    L   G+LL G ++       +  D+  IRL GVL
Sbjct: 61  FSLSKYTADNKPTKAVYLRILRRAAILFILGLLLNGFWNKG----VWTFDLSSIRLMGVL 116

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRI+LSYL  SL+ +    +  K Q +              +A  +L+ Y   +    VP
Sbjct: 117 QRISLSYLFASLIVL---KLPRKSQLI--------------LAGVLLIGYWLTMMYIPVP 159

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
           ++   ++ ++                      N   +IDR ++    +Y    +      
Sbjct: 160 EYGAGVLTREG---------------------NFGAFIDRLIIPKAQLYKGDGFN----- 193

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLK-Q 324
                F G              +PEGL S++ +I+S + G   G  I   K   ++    
Sbjct: 194 -----FMG--------------DPEGLYSTIPAIVSVLAGYFTGEWIKDKKQANSQTSMD 234

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPF 384
            V  G   L+ G+   +  A P+NK+L+T SYV  T+G A ++ +A Y L+++  +K   
Sbjct: 235 LVLFGLCCLVIGII--WDVAFPINKKLWTSSYVVFTTGWALMLLAACYELIEVRVIKRWS 292

Query: 385 LPLAWIGMNAMLVYV 399
            P   +G+NA+ ++V
Sbjct: 293 KPFEIMGLNAIALFV 307


>gi|189465173|ref|ZP_03013958.1| hypothetical protein BACINT_01518 [Bacteroides intestinalis DSM
           17393]
 gi|189437447|gb|EDV06432.1| hypothetical protein BACINT_01518 [Bacteroides intestinalis DSM
           17393]
          Length = 369

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 110/395 (27%), Positives = 181/395 (45%), Gaps = 100/395 (25%)

Query: 18  EPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLADF 75
           +P +S   +K     +RL SLD+ RG+ V  MILV+++GG   +  + H+ WNG  L D 
Sbjct: 6   QPPISGYPQK-----KRLLSLDVLRGITVVGMILVNNSGGKLSYDSLQHSAWNGLTLCDL 60

Query: 76  VMPFFLFIVGVAIALALKRIPDRADA--VKKVIFRTLKLLFWG-------ILLQGGFSHA 126
           V PFFLFI+G++  +AL +   +A    ++K++ RTL +L  G        + +G F   
Sbjct: 61  VFPFFLFIMGISTYIALNKFHFQASGPVIRKILKRTLVILCIGWAIHWFHFICEGDF--- 117

Query: 127 PDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWL 186
                    +  +RL GVL RIAL Y  VS V ++      K + +G           W+
Sbjct: 118 -------FPLAHLRLTGVLPRIALCYCAVSFVALYV-----KPKYIG-----------WM 154

Query: 187 MAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRK 246
           +    L++  A+L G         I N  + D                   N +  IDR 
Sbjct: 155 IG--FLIIGYAVLLG---------IGNGYTLD-----------------STNILAIIDRN 186

Query: 247 VLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGV 306
           VLG +H+YH                             +P +PEGL S++++I  T+IG 
Sbjct: 187 VLGADHLYH----------------------------KSPIDPEGLTSTLAAIAHTLIGF 218

Query: 307 HFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAAL 366
             G +I+  +    +  +    GF L+  G  L  T A+PLNK++++ ++V VT G AA+
Sbjct: 219 CCGRIILAKEALEQKTLKLFVAGFILMACGFVL--TEALPLNKRIWSPTFVLVTCGLAAM 276

Query: 367 VFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA 401
           + + +   +D+   K         G+N + +YV++
Sbjct: 277 LQAVLIYFIDMKEKKNWCRFFEVFGVNPLFLYVLS 311


>gi|282896863|ref|ZP_06304869.1| conserved hypothetical protein [Raphidiopsis brookii D9]
 gi|281198272|gb|EFA73162.1| conserved hypothetical protein [Raphidiopsis brookii D9]
          Length = 375

 Score =  126 bits (316), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 104/374 (27%), Positives = 172/374 (45%), Gaps = 75/374 (20%)

Query: 33  QRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
            RL SLD+FRGL +A+MI+ + AG     +P +SHA WNGC   D V PFFLFIVGVA++
Sbjct: 1   MRLISLDVFRGLTIAMMIIANMAGVAPDVYPFLSHALWNGCTPTDLVYPFFLFIVGVAMS 60

Query: 90  LALKRIPDRADAVKKVIF----RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
            +L +    +   K V F    R + L   G+LL G ++          D++ +R+ GVL
Sbjct: 61  FSLSKYSLESKLDKFVYFNLCRRAVILFTLGLLLNGFWNQGVGSF----DLQSLRVMGVL 116

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRI+L+YL+ SL+ +      +K Q              W +A  +L+ Y   +    VP
Sbjct: 117 QRISLAYLVASLIVL---KFPEKTQ--------------WALAGILLIFYWLTMMYIPVP 159

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
           D+   ++ ++                      N   +IDR ++   H+Y    +      
Sbjct: 160 DYGAGMLTREG---------------------NFGAFIDRLIIAKPHLYAGDGFN----- 193

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
                F G              +PEGL S++ +I++ + G   G  +  +  +       
Sbjct: 194 -----FRG--------------DPEGLFSTIPAIVNVLFGYFAGQWMRKSTINSHTSMDL 234

Query: 326 VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFL 385
           V  G   L+ G+   +    P+NK+L+T SYV  ++G   +  +A Y L+++  +K    
Sbjct: 235 VLWGLCSLVVGMI--WDGLFPINKKLWTSSYVLFSTGWGLVFLAACYDLIEVRKIKRWSK 292

Query: 386 PLAWIGMNAMLVYV 399
               IG+NA+ ++V
Sbjct: 293 GFEIIGLNAIALFV 306


>gi|410900570|ref|XP_003963769.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Takifugu rubripes]
          Length = 581

 Score =  126 bits (316), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 114/384 (29%), Positives = 178/384 (46%), Gaps = 64/384 (16%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           + +RL SLD FRG A+ +M+ V++ GG +    HAPWNG  +AD VMP+F+FI+G ++ L
Sbjct: 196 RPKRLLSLDTFRGFALTVMVFVNYGGGGYWFFQHAPWNGLTVADLVMPWFVFIIGTSVVL 255

Query: 91  ALKRIP----DRADAVKKVIFRTLKLLFWGILLQGGF---SHAPDELTYGVDVRMIRLCG 143
           A + +      R   ++K+ +RT      G+LL  GF   +++P +    V V + +   
Sbjct: 256 AFRSMQRRRVRRLQLLRKITWRT------GVLLMLGFCFLNYSPRDGPCSVLV-LAQDSW 308

Query: 144 VLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTY 203
                 L  L       +   VQD            +Y   WL+   +  ++L + +   
Sbjct: 309 SPAASGLHLLHSITPHRWWSSVQD----------VVVYWPQWLIIILLETLWLCVTFLMP 358

Query: 204 VPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
           VPD     +      D+G   N T G          A GYIDR + G N MY +P  +  
Sbjct: 359 VPDCPTGYLGAGGIGDHGLYPNCTGG----------AAGYIDRWMFGDN-MYRYPTCKEM 407

Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGH---- 318
              TQ                  PF+PEG+L +V+SI+   +G+  G +++  +G     
Sbjct: 408 YQTTQ------------------PFDPEGVLGTVNSIVMGFLGMQAGKILLFYRGADVHI 449

Query: 319 LARLKQW-VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDI 377
           L R   W V +G +  I          IP+NK L++LSYV      + L+ + IY + D+
Sbjct: 450 LCRFLLWAVILGISAAILTKCTRDEGFIPVNKNLWSLSYVTCMGCFSFLLLAGIYLVTDM 509

Query: 378 --WNLKYPFLPLAWIGMNAMLVYV 399
             W    PFL   + GMN++LVYV
Sbjct: 510 KGWWAGQPFL---YPGMNSILVYV 530


>gi|226225918|ref|YP_002760024.1| hypothetical membrane protein [Gemmatimonas aurantiaca T-27]
 gi|226089109|dbj|BAH37554.1| hypothetical membrane protein [Gemmatimonas aurantiaca T-27]
          Length = 401

 Score =  126 bits (316), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 119/413 (28%), Positives = 180/413 (43%), Gaps = 86/413 (20%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFL 81
                 K +RL SLD+FRG+ VA M+LV++ G     +P + HAPW+G    D + PFFL
Sbjct: 3   SPTGSFKAERLLSLDVFRGMTVAGMLLVNNPGTWSAIYPPLQHAPWHGWTPTDLIFPFFL 62

Query: 82  FIVGVAIALALKRIPDRADAVKKVIFRTLK----LLFWGILLQG--GFSHAPDELTYGVD 135
           FIVG+   L+L+    R D  + ++ + LK    +  +G+LL G   F+  P        
Sbjct: 63  FIVGITTELSLRARRARGDDEQAILRQILKRGALIFLFGLLLAGFPFFTWPPGLPGASFG 122

Query: 136 VRMI------RLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAA 189
            R+I      R+ GVLQRI ++YL  +L+   T     + Q V              + A
Sbjct: 123 ERVIDRFEHWRIMGVLQRIGVAYLCGALL---TWRTTVRQQGV--------------ILA 165

Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAV-GYIDRKVL 248
            +L  Y AL+    VPD                       R  L+ P   +  ++DR VL
Sbjct: 166 ALLFGYWALMTLVPVPD------------------TGVAGRFVLDKPDQLLSAWLDRTVL 207

Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
           G+NH++                        A +W     +PEGLLS++ +I + I G   
Sbjct: 208 GVNHLW----------------------SGAKTW-----DPEGLLSTIPAIGTMICGTFA 240

Query: 309 GHVIIHTKGHL-ARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALV 367
           G  I   +  L  RL     +G   ++ GL  H+    P+NK ++T SYV  T+G  A+ 
Sbjct: 241 GRWIARQELTLHERLVALFAVGALAMMVGLMWHWV--FPINKSIWTSSYVVFTAGTGAVT 298

Query: 368 FSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFING---WYYGD 417
            +    L+D   L+    P    G N ML ++    G+ A  I+    W  GD
Sbjct: 299 LATCMWLIDGIGLRRWTFPFVVYGTNPMLAFL--GSGLMARCISSLWTWETGD 349


>gi|345856403|ref|ZP_08808889.1| putative membrane protein [Desulfosporosinus sp. OT]
 gi|344330527|gb|EGW41819.1| putative membrane protein [Desulfosporosinus sp. OT]
          Length = 381

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 113/389 (29%), Positives = 179/389 (46%), Gaps = 100/389 (25%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFL 81
           +EK   K  RL  +D+FRG+AVA+M++V + G    ++P++ HA WNG  +AD   PFF+
Sbjct: 8   EEKG--KFGRLNCIDVFRGIAVAIMLIVTNPGNPLRNYPQLRHAAWNGYTVADLAFPFFM 65

Query: 82  FIVGVAIALAL-KRIPDRADAV---KKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
            I+G+ I  A+ KRI +    +     ++ R++ L   GILL G   +         D+ 
Sbjct: 66  LIMGMVIPYAVDKRIKEGKSNLSIFNHILIRSIGLFCIGILLNGFPVY---------DLS 116

Query: 138 MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKD--QSVGRFSIFRLYCWHWLMAACVLVVY 195
           +IR+ GVLQRIA++YL   ++E+  K    K   Q +   S          +A  ++ VY
Sbjct: 117 IIRIPGVLQRIAIAYLCTGIIELIVKATVKKSYLQIIVESS----------LALSIISVY 166

Query: 196 LALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYH 255
             LL     PD++                             N V  ID   L   H+Y 
Sbjct: 167 SVLLIKYSFPDYK-----------------------------NLVQTIDLYFLK-GHLY- 195

Query: 256 HPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHT 315
                                   P W     +PEG+L++ SSI + I G   G+ I+  
Sbjct: 196 -----------------------TPDW-----DPEGILTTFSSIATAIFGSIAGN-ILFN 226

Query: 316 KGHLARLKQWVTMGFALLIFGLTLHFTNAI-----PLNKQLYTLSYVCVTSGAAALVFSA 370
           + + AR K+++T    + I+G+      +I     P NK L++ SYV +T+G A L  S 
Sbjct: 227 RDNKAR-KKFIT----IFIYGVVTIIVASIIQRWFPYNKNLWSSSYVLITAGIAYLTISM 281

Query: 371 IYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
           ++ ++D+   K  F PL  +G N + VYV
Sbjct: 282 LFLVIDVAGFKALFKPLMILGSNPIFVYV 310


>gi|256425421|ref|YP_003126074.1| hypothetical protein Cpin_6469 [Chitinophaga pinensis DSM 2588]
 gi|256040329|gb|ACU63873.1| conserved hypothetical protein [Chitinophaga pinensis DSM 2588]
          Length = 358

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 130/422 (30%), Positives = 184/422 (43%), Gaps = 94/422 (22%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           QRL SLD FRGL VA MILV++ G     +P + H+ WNGC   D V PFFLF+VGV++ 
Sbjct: 3   QRLLSLDFFRGLTVAAMILVNNPGSWSYVYPPLEHSKWNGCTPTDLVFPFFLFMVGVSVT 62

Query: 90  LALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            AL     +AD           +    IL   G +     L    D   +R+ GVLQRI+
Sbjct: 63  FALSS--RKADVSGHTSLIIHIIRRAAILFAIGLAF---RLIPSFDFHNLRILGVLQRIS 117

Query: 150 LSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV--LVVYLALLYGTYVPDW 207
           + +L++SL+ +       K  +  R          WL   C+  LV+Y  L+    VP  
Sbjct: 118 IVFLVISLLYL-------KTGTKPRI---------WL---CISFLVIYWLLMTVVPVP-- 156

Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
                      YG          A L    N   +IDR VLG  H++             
Sbjct: 157 ----------GYGP---------ANLEAETNLAAWIDRTVLGEQHLW------------- 184

Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV- 326
                    K A +W     +PEGLLS++ +I + ++G+  G  +       A    W+ 
Sbjct: 185 ---------KQARTW-----DPEGLLSTLPAISTGLLGIMTGDWLRRKDVADADKVSWLF 230

Query: 327 TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLP 386
             GF  +I GL   +    P+NK L+T S+V  T G AA+  +  Y L+D+   K    P
Sbjct: 231 AAGFLSVIAGLI--WDGFFPINKSLWTSSFVLYTGGLAAMGLALSYWLIDVQQYKSITPP 288

Query: 387 LAWIGMNAMLVYVM--AAEGIFAGFINGWY--YG---DPHN-------TLVCFLFIISYI 432
               G NA+  YV+  A   IF G   G +  YG    P N       TLV  +FI  +I
Sbjct: 289 FVAFGRNAITAYVLSGAIPMIFKGMSGGMFRAYGAFLSPFNASLAAAITLVILMFIPVWI 348

Query: 433 LH 434
           ++
Sbjct: 349 MY 350


>gi|327275365|ref|XP_003222444.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Anolis carolinensis]
          Length = 632

 Score =  125 bits (313), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 111/384 (28%), Positives = 180/384 (46%), Gaps = 54/384 (14%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGV----AI 88
            RL SLD FRGLA+ +M+ V++ GG +    H  WNG  +AD V P+F+FI+G     ++
Sbjct: 235 HRLRSLDTFRGLALIIMVFVNYGGGKYWFFKHQSWNGLTVADLVFPWFVFIMGTSISLSL 294

Query: 89  ALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
           +  L+R   +   + K+++R+L L   G+++       P+     +    +R+ GVLQR+
Sbjct: 295 SSMLRRGCSKWKLLGKILWRSLLLFLIGVIIVN-----PNYCLGPLSWENLRIPGVLQRL 349

Query: 149 ALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYV 204
           + +Y +V+++E +F K V D           R  L  W  WL    +  V+L L +   V
Sbjct: 350 SCTYFVVAVLELLFAKPVPDNSTLEIPCPALRDILPYWPQWLFMMALETVWLCLTFLLNV 409

Query: 205 PDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
           P      +      D+G   N T G          A  YID  +LG  H+Y HP+   S 
Sbjct: 410 PGCPNGYLGPGGIGDFGNYPNCTGG----------AAAYIDHVLLGEKHIYQHPS---SN 456

Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGH----L 319
              Q +                 F+PEG+L +++S++   +G+  G +++  K      +
Sbjct: 457 VLYQTT---------------VAFDPEGILGTINSVIMAFLGLQAGKILLFYKDQHRQIM 501

Query: 320 ARLKQW-VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD-- 376
            R   W V MG    +          IP+NK L+++SYV   S    ++   IY LVD  
Sbjct: 502 LRFCIWSVVMGVISAVLTECSKDEGFIPVNKNLWSISYVTTLSSFGFVLLLLIYYLVDVK 561

Query: 377 -IWNLKYPFLPLAWIGMNAMLVYV 399
            +W+    F P    GMN++L+YV
Sbjct: 562 KVWSGSPFFFP----GMNSILIYV 581


>gi|119512372|ref|ZP_01631456.1| hypothetical protein N9414_19342 [Nodularia spumigena CCY9414]
 gi|119462961|gb|EAW43914.1| hypothetical protein N9414_19342 [Nodularia spumigena CCY9414]
          Length = 369

 Score =  124 bits (312), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 103/369 (27%), Positives = 174/369 (47%), Gaps = 75/369 (20%)

Query: 34  RLASLDIFRGLAVALMILVDHAG--GD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           RL+SLD+FRG+ +A MILV+ AG  G+ +P ++HA W+GC   D V PFFLFIVGVA++ 
Sbjct: 2   RLSSLDVFRGITIAAMILVNMAGVAGEVYPPLAHADWHGCTPTDLVFPFFLFIVGVAMSF 61

Query: 91  ALKRIPDRADAVKKVIFRTLKLLF-WGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
           +L +  ++       IFR   +LF  G+LL G ++       +  D+  IR+ GVLQRI+
Sbjct: 62  SLSKYTEKG---YSRIFRRAAILFALGLLLNGFWNQG----IWTFDLSKIRIMGVLQRIS 114

Query: 150 LSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQF 209
           L+YLL SL  +   ++  K Q              W++A  +L+ Y   +    VP++  
Sbjct: 115 LAYLLASLAVL---NLPRKGQ--------------WILAGVLLIGYWLTMMYVPVPEYGA 157

Query: 210 TIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDS 269
            ++ ++                      N   YIDR ++   H+Y    ++         
Sbjct: 158 GVLTREG---------------------NFGAYIDRLIIPQVHLYAGDGYQNLG------ 190

Query: 270 PFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMG 329
                             +PEGL S++ ++++ + G   G  I +              G
Sbjct: 191 ------------------DPEGLFSTIPAVVNVLAGYFTGQWIRNQPVKTRTSIGLGLFG 232

Query: 330 FALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAW 389
              LI G    +    P+NK+L+T SYV  ++G A L+ +A Y L+++  ++        
Sbjct: 233 IGCLIIGWAWGWI--FPINKKLWTSSYVVFSNGWALLLLAACYELIEVRQIRRWSKAFEI 290

Query: 390 IGMNAMLVY 398
           +G+NA+ ++
Sbjct: 291 MGLNAIALF 299


>gi|374263976|ref|ZP_09622521.1| hypothetical protein LDG_8987 [Legionella drancourtii LLAP12]
 gi|363535543|gb|EHL28992.1| hypothetical protein LDG_8987 [Legionella drancourtii LLAP12]
          Length = 372

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 99/371 (26%), Positives = 168/371 (45%), Gaps = 75/371 (20%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGD--WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           ++R+ SLD+FRGL +ALM+LV+  G    +P + HA WNGC LAD V P FLFIVGV   
Sbjct: 5   SKRILSLDVFRGLTMALMVLVNSQGSRSIYPILDHAAWNGCTLADLVFPAFLFIVGVTTV 64

Query: 90  LALKRIPDRADAVKKVIFRT-LKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
           ++L R     +A +  I+++ LK      L     +  P    + +    +R+ G+LQRI
Sbjct: 65  VSLNRQVTTNEAARLDIYKSILKRSILLFLFGLFLNAFPFH--FDLSFANLRIYGILQRI 122

Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
           A+ Y + +L+ +   +   K Q +  + I   Y W+W+               T +P   
Sbjct: 123 AICYFICALIYL---NTTVKTQIILFWGILLGY-WYWI---------------TQIPVPG 163

Query: 209 FTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
           F+                     +L+   N V Y+D+ +                     
Sbjct: 164 FS-------------------GGQLSLANNWVAYVDKMIF-------------------- 184

Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTM 328
                     +P   H  F+PEGL+S++S++ +T+ G+  GH ++       +      +
Sbjct: 185 ----------SPVHLHKNFDPEGLISTISAVATTLAGLITGHFLLMQLSKKKKCLLMFLV 234

Query: 329 GFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLA 388
           G A L+ G    +    P+NK L+T S+V  T G + +VF+  + ++D+       LP  
Sbjct: 235 GMAFLVLGWAWGYY--FPINKNLWTSSFVLWTGGVSLIVFALCFYIIDVLGYSKWALPFK 292

Query: 389 WIGMNAMLVYV 399
             GMNA+ +++
Sbjct: 293 IFGMNALFIFI 303


>gi|260061394|ref|YP_003194474.1| hypothetical protein RB2501_07335 [Robiginitalea biformata
           HTCC2501]
 gi|88785526|gb|EAR16695.1| hypothetical protein RB2501_07335 [Robiginitalea biformata
           HTCC2501]
          Length = 382

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 115/396 (29%), Positives = 183/396 (46%), Gaps = 98/396 (24%)

Query: 22  SDQQEKSHLKTQ------RLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCN 71
           +DQ+E+++ + Q      R+ S+DIFRGL +ALMILV+   G W  +     HA W+G  
Sbjct: 6   NDQRERTNPEKQTKAMKERIVSVDIFRGLTIALMILVN-TPGTWEAVYAPFRHAEWHGYT 64

Query: 72  LADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELT 131
             D V PFFLFIVG +I  A +     A   +K+I RTLKL+  GI L G F+  P    
Sbjct: 65  PTDLVFPFFLFIVGTSIVFAYRNKQPDAATHRKIIVRTLKLILLGIFL-GAFTVEPP--- 120

Query: 132 YGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV 191
           +      IR  GVLQRI + +   +L+ + T                    +  L+A  V
Sbjct: 121 FFEPFSEIRFPGVLQRIGVVFFAAALLFLHTN-------------------YKTLLAITV 161

Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
           +++    ++  ++P      +  +     +V             P N   YID  V G +
Sbjct: 162 VILLGYWVWMAFIP------LGGEPPSLERV-------------PNNWANYIDVHVFG-S 201

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           H Y                     + D        ++PEGLLS++ +I S ++G+  G V
Sbjct: 202 HTY---------------------KPD--------YDPEGLLSTLPAIASALLGIFTGRV 232

Query: 312 IIHTKGHLARLKQWVTMGFALL-----IFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAAL 366
           ++  +   A   QW+ +  A L     I+GL        P+NK L+T S+V VT+G A L
Sbjct: 233 LVSDR---ANKTQWMLLAGAALLAAGSIWGLVF------PVNKALWTSSFVLVTAGWANL 283

Query: 367 VFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
           + + IY L D+  +++  +   + G NA+ VY +++
Sbjct: 284 LLALIYYLTDVKKMQFGSI-FRYAGANAITVYFLSS 318


>gi|321463338|gb|EFX74354.1| hypothetical protein DAPPUDRAFT_129175 [Daphnia pulex]
          Length = 409

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 93/368 (25%), Positives = 174/368 (47%), Gaps = 47/368 (12%)

Query: 42  RGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADA 101
           R L +  MI+V++ GG +    H+PWNG  +AD + P F++I+G +  L+L     RA +
Sbjct: 29  RRLTIVFMIIVNYGGGGYWFFEHSPWNGITIADVIFPCFVWILGASCVLSLNSQLRRALS 88

Query: 102 VKKVIF----RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSL 157
            +++++    R++ +L  G++L    ++         +++  R+ GVLQR++  YL+V+L
Sbjct: 89  KQRILYSTVRRSVAMLVIGLVLNSLSNN---------NIKTFRIPGVLQRMSFVYLIVAL 139

Query: 158 VEIFTKDVQDKDQSVGRFSIFRLYC-W-HWLMAACVLVVYLALLYGTYVPDWQFTIINKD 215
           +E+   D +D  +      I  + C W  W++    +   L + +   VPD         
Sbjct: 140 IELTGFDPEDNQRYAWFAPIRDIVCSWRQWIIVTVFVSTQLLITFLLPVPDCPLGYTGAG 199

Query: 216 SADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPL 275
             +   ++    G  A+L         +D  + G +H+Y  P        T  + ++  L
Sbjct: 200 GLEKNGLYRNCTGGAARL---------VDVSLFGNDHIYQRP--------TPRAIYDATL 242

Query: 276 RKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL-LI 334
                      F+PEG L  ++ +L   +G     V++    +  R+ +W+       L 
Sbjct: 243 ----------AFDPEGALGGLTCVLCAYLGAEAAKVLLVFPANKQRIVRWMLWALVTGLS 292

Query: 335 FGLTLHFT---NAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIG 391
            G+   F      IP+NK L++LSYV VTS  A ++ + +Y  +D+ +  +   P  + G
Sbjct: 293 GGILCDFKIDDGPIPINKNLWSLSYVLVTSSIAFILLTILYVFIDVLSW-WSGAPFRYAG 351

Query: 392 MNAMLVYV 399
           MNA+L+YV
Sbjct: 352 MNALLLYV 359


>gi|334121382|ref|ZP_08495452.1| hypothetical protein MicvaDRAFT_2176 [Microcoleus vaginatus FGP-2]
 gi|333455096|gb|EGK83757.1| hypothetical protein MicvaDRAFT_2176 [Microcoleus vaginatus FGP-2]
          Length = 406

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 117/417 (28%), Positives = 182/417 (43%), Gaps = 118/417 (28%)

Query: 34  RLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           R  SLD+FRG+A+A MILV++ G     +P + HA W+GC   D + PFFLFIVG A++ 
Sbjct: 2   RFKSLDVFRGIAIASMILVNNPGSWEQVYPPLDHAEWHGCTPTDLIFPFFLFIVGCAMSF 61

Query: 91  ALKR-----------------IPDRADAVKKVIFRTLKLL-----FWGI----------- 117
           +L +                   +++++ K  +  +L LL     +W I           
Sbjct: 62  SLSKYTQNYPQTGIETSKITQTKEKSESAKNPLPSSLFLLPYSNIYWRIARRAAILFILG 121

Query: 118 LLQGGFSHAPDELTYGVDVR---MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGR 174
           LL    S A D L     V     IR+ GVLQRI L+Y + ++  I   ++  ++Q    
Sbjct: 122 LLLNTSSIALDVLLNSAPVENFGKIRIMGVLQRIGLAYFISAIAII---NLSPRNQK--- 175

Query: 175 FSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLN 234
                      L+A  VL+ Y A L                      VF V      +L 
Sbjct: 176 -----------LLAVAVLLGYWAAL---------------------TVFAVGGYTAGELT 203

Query: 235 PPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLS 294
           P  N  GY+DR +LG  H+Y                               PF+PEGLLS
Sbjct: 204 PEGNLGGYVDRLILGSQHLYK----------------------------GGPFDPEGLLS 235

Query: 295 SVSSILSTIIGVHFGHVI----IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQ 350
           ++ ++++ +IG   G  +    I T+  +         G + ++ G    F    P+NKQ
Sbjct: 236 TLPAVVTVLIGYFTGEWLRVQPIKTRTSI----NLAICGLSCVVIGRLWGFL--FPINKQ 289

Query: 351 LYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFA 407
           L+T SYV  T+G A L+ +  Y  +++ + K+   P   +G+NA+ ++V  A GI A
Sbjct: 290 LWTSSYVVFTAGWALLLLATCYETIEVRDWKWG-RPCEIMGLNAIFLFV--ASGIVA 343


>gi|405978397|gb|EKC42791.1| Heparan-alpha-glucosaminide N-acetyltransferase, partial
           [Crassostrea gigas]
          Length = 549

 Score =  122 bits (307), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 86/266 (32%), Positives = 132/266 (49%), Gaps = 32/266 (12%)

Query: 2   SEIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPE 61
           S+I  ++   H     + + S QQ  +H K +RL SLD FRGL++ +M+ V++ GG +  
Sbjct: 185 SDISEDSGTAH-----DRNNSPQQYSTHNKRERLKSLDTFRGLSLMIMVFVNYGGGGYWF 239

Query: 62  ISHAPWNGCNLADFVMPFFLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGI 117
             H PWNG  +AD V P+F+FI+G A+  +    +KR   R   + KV+ R + L F GI
Sbjct: 240 FDHPPWNGITVADLVFPWFIFIMGTAMNYSFRGMMKRGTPRYRMLYKVLRRAILLFFIGI 299

Query: 118 LLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGR-FS 176
           +L   +          V+++ IR+ GVLQR +L+YL++ L E+        ++  GR +S
Sbjct: 300 VLNTNWGP--------VNLKTIRIPGVLQRFSLTYLVLGLFEVCFSRYDTPEKYQGRCWS 351

Query: 177 IFR---LYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAK 232
             R   L+   W +A  +L  Y+ L +   +       I      D  K +N T G    
Sbjct: 352 SLRDILLFLPQWFLALGILAAYVCLTFLLPIGPCPTGYIGPGGLHDSSKYYNCTAG---- 407

Query: 233 LNPPCNAVGYIDRKVLGINHMYHHPA 258
                 A  YID  VLG NH+Y  P 
Sbjct: 408 ------AAAYIDIMVLGKNHIYGKPT 427


>gi|284041413|ref|YP_003391343.1| heparan-alpha-glucosaminide N-acetyltransferase [Spirosoma linguale
           DSM 74]
 gi|283820706|gb|ADB42544.1| Heparan-alpha-glucosaminide N-acetyltransferase [Spirosoma linguale
           DSM 74]
          Length = 364

 Score =  122 bits (306), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 110/380 (28%), Positives = 169/380 (44%), Gaps = 90/380 (23%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           +  RL SLD  RG  +A MI+ +  G +   +  + H+ WNG +  D + PFFLFIVGV+
Sbjct: 3   QPHRLISLDAMRGFTIAAMIVANFPGSEEFVYFTLRHSRWNGLSFTDLIAPFFLFIVGVS 62

Query: 88  IALALKRIPDRADA------VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRL 141
           I LA  R   RAD       ++K++ R+LK+   G+ L    +  PD      D   +R 
Sbjct: 63  IVLAYAR--KRADGSPKGPLIQKIVLRSLKIFAVGMFL----NLLPD-----FDFATLRW 111

Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
            G L RIA+ +L+ +L+ + T   Q                  W+    +L  +LA+   
Sbjct: 112 TGTLHRIAIVFLVCALLFLTTSWRQQA----------------WIATLTLLAYWLAM--- 152

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
           T +P          + D G+V          L P  N   ++DR+ L             
Sbjct: 153 TQIP----------TPDVGRVV---------LEPGQNLAAWLDRRYL------------- 180

Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
                       P R    +W     +PEG+LS+  SI++ I+G+  G +++      A+
Sbjct: 181 ------------PGRMWQGTW-----DPEGILSTFPSIVTGILGMLAGRLMVSPASQTAK 223

Query: 322 LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLK 381
           +   +T G      G     T   P+N+ L+T S+V VT+G AALV  A+Y LVD+    
Sbjct: 224 VSYLMTAGVFTAALGYFWGLT--FPVNENLWTSSFVLVTAGFAALVLGAVYFLVDVLGHT 281

Query: 382 YPFLPLAWIGMNAMLVYVMA 401
               P    G NA+ VYV+A
Sbjct: 282 SGTKPGVVFGANAITVYVLA 301


>gi|260788586|ref|XP_002589330.1| hypothetical protein BRAFLDRAFT_217958 [Branchiostoma floridae]
 gi|229274507|gb|EEN45341.1| hypothetical protein BRAFLDRAFT_217958 [Branchiostoma floridae]
          Length = 382

 Score =  122 bits (306), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 108/388 (27%), Positives = 184/388 (47%), Gaps = 58/388 (14%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           +RL SLD FRG+ + +M  V++ GG +  + H+ WNG  +AD V P+F++I+G + AL+ 
Sbjct: 1   RRLKSLDTFRGMCLCIMAFVNYGGGGYWFLDHSVWNGITVADLVFPWFMWIMGTSTALSF 60

Query: 93  KRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
           + +  +A        K++ RT+ L   G+ +     +APD      D   IR+ GVLQR 
Sbjct: 61  RGLQRKATPKLTIFGKIVRRTITLFLLGLFI----VNAPD------DWATIRIPGVLQRF 110

Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
           A+SY  VS + +   + +     V        Y   WL   C+L V+  L +   VP   
Sbjct: 111 AVSYFAVSTMMLLHMETEWYRDLVP-------YWKQWLFVLCLLAVHTCLTFLMPVPGCP 163

Query: 209 FTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
              +           N T G          A GYID  +L  +H+Y             +
Sbjct: 164 TGYLGAGGLSDLDHTNCTGG----------AAGYIDNWLLTQDHIY-----------GDE 202

Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTM 328
           +P    L +   +     ++PEG+L S++SI  T +G+  G +++  + H +RL +W+  
Sbjct: 203 TPKVRILYQILVN-----YDPEGVLGSLTSIFMTFLGLQAGKILLSYEDHGSRLVRWLLW 257

Query: 329 GFALLIFGLTL----HFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDI--WNLKY 382
           G  L +  + L         +P+NK L+++++V   +  A ++ S  Y LVD+  W   +
Sbjct: 258 GIGLGLLAILLCEGRQNGGWVPINKNLWSVTFVLSLASMAFILLSVYYFLVDVRRWWTGF 317

Query: 383 PFLPLAWIGMNAMLVYVMAAEGIFAGFI 410
           PF      GMN++ VY+    G+F  ++
Sbjct: 318 PFFM---AGMNSISVYL--CHGVFQKYL 340


>gi|10177926|dbj|BAB11337.1| unnamed protein product [Arabidopsis thaliana]
          Length = 384

 Score =  122 bits (305), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 69/118 (58%), Positives = 86/118 (72%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           +RL SLD+FRGL VA MILVD  GG  P I+H+PW+G  LADFVMPFFLFIVGV++A A 
Sbjct: 144 ERLVSLDVFRGLTVAFMILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIVGVSLAFAY 203

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIAL 150
           K +  R  A +K + R+LKLL  G+ LQGGF H  + LTYG+DV  IRL G+LQ + +
Sbjct: 204 KNLSCRFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGILQNLKV 261



 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 50/98 (51%), Positives = 69/98 (70%)

Query: 223 FNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSW 282
             V CGVR    P CNAVG +DR  LGI H+Y  P + R+K C+ + P  GPL  DAPSW
Sbjct: 259 LKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTKQCSINYPNNGPLPPDAPSW 318

Query: 283 CHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLA 320
           C APF+PEGLLSS+ + ++ ++G+H+GH+IIH K +++
Sbjct: 319 CQAPFDPEGLLSSLMATVTCLVGLHYGHIIIHFKVNIS 356


>gi|223940501|ref|ZP_03632350.1| conserved hypothetical protein [bacterium Ellin514]
 gi|223890825|gb|EEF57337.1| conserved hypothetical protein [bacterium Ellin514]
          Length = 410

 Score =  122 bits (305), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 115/414 (27%), Positives = 181/414 (43%), Gaps = 91/414 (21%)

Query: 15  IISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMI----LVDHAGGDWP---------E 61
           I+S P  S  Q  S   T+RL SLD  RG  +  ++    LV      WP         +
Sbjct: 4   ILSPPLQSKPQVTSPSTTKRLLSLDALRGFDMFWIVGGEELVHALYNAWPNGPLGIINSQ 63

Query: 62  ISHAPWNGCNLADFVMPFFLFIVGVAIALALKR---IPDRADAVKKVIFRTLKLLFWGIL 118
           + H  W G    D + P F+FIVGV++  +L +   +  +A A+K+V FR+L L  +G+L
Sbjct: 64  MDHKVWQGVAFYDLIFPLFVFIVGVSLVFSLTKAIEVNGKAAALKRVFFRSLLLYVFGLL 123

Query: 119 LQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIF 178
           + GG S   D          IR  GVLQRIA+ Y   SLV                F  F
Sbjct: 124 IYGGISKGIDG---------IRWMGVLQRIAICYFSTSLV----------------FCFF 158

Query: 179 RLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKV-------------FNV 225
           +L     + AA +L+ Y AL+  T+VP   F  +   SA   ++              + 
Sbjct: 159 KLRG-MIVAAAALLLTYWALM--TFVP---FPDVRPASASPQEITKHNGFTNVAQLNLSS 212

Query: 226 TCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHA 285
           T  +  +  P  N   Y+D+K L        P ++                     W   
Sbjct: 213 TTMLHGQFIPGVNLANYVDQKYL--------PGYK---------------------W-DG 242

Query: 286 PFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAI 345
            ++PEGLLS++ +I++ ++GV F  +++       + K  +  G  +    L   +    
Sbjct: 243 TYDPEGLLSTLPAIVTCLLGV-FAGLLLRNPNVPDQKKVLLLAGAGIAGVALGFLWGLEF 301

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
           P+ K+L+T SYV V  G A +  +A Y +++IW  +    P  WIGMN + +Y+
Sbjct: 302 PVIKKLWTSSYVLVAGGYACIFLAAFYQVIEIWQWRRWCTPFVWIGMNPISIYL 355


>gi|284040246|ref|YP_003390176.1| hypothetical protein Slin_5410 [Spirosoma linguale DSM 74]
 gi|283819539|gb|ADB41377.1| Protein of unknown function DUF2261, transmembrane [Spirosoma
           linguale DSM 74]
          Length = 385

 Score =  121 bits (304), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 117/393 (29%), Positives = 178/393 (45%), Gaps = 84/393 (21%)

Query: 16  ISEP--DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNG 69
           ++EP    SD   K  + T RL SLD FRGL VA MILV++ G DW  I     HAPW+G
Sbjct: 3   VNEPIQKASDYGLKP-VGTSRLLSLDFFRGLTVAAMILVNNPG-DWGHIYAPLEHAPWHG 60

Query: 70  CNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDE 129
               D + PFFLFIVGV+I  AL+    +   V K++ R++ L     LL    +  P  
Sbjct: 61  WTPTDLIFPFFLFIVGVSITFALEGGKSKKGVVGKIVKRSVTL----FLLGLFLNFFPK- 115

Query: 130 LTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAA 189
                D+ ++R+ GVLQRIA+ YL+ SL+ + T   Q            +LY     +  
Sbjct: 116 ----FDITLVRIPGVLQRIAVVYLVCSLIFLKTNSRQ------------QLY-----ILV 154

Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLG 249
            VL+ Y  L+          T++      Y           A L P  N   + D  +L 
Sbjct: 155 IVLIGYWLLM----------TVVPVPGVGY-----------ANLEPATNLAAWFDYTILT 193

Query: 250 INHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFG 309
             H+Y                      K A +W     +PEG+LS++ ++ + +IG+  G
Sbjct: 194 PAHVY----------------------KPAKTW-----DPEGVLSTLPAVGTGLIGMLVG 226

Query: 310 HVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFS 369
             +  ++  +A    W+     L   G  + +    P+NK L+T SYV +  G A L  +
Sbjct: 227 TWLRSSR-PVADKVAWLFATGCLATLG-GVMWDGFFPINKALWTSSYVLLAGGLAMLGLA 284

Query: 370 AIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
             Y L+D+ N +   LP    G+NA+ V+ ++ 
Sbjct: 285 LCYWLIDVQNYRRGVLPFVAFGVNAITVFFLSG 317


>gi|428319268|ref|YP_007117150.1| Heparan-alpha-glucosaminide N-acetyltransferase [Oscillatoria
           nigro-viridis PCC 7112]
 gi|428242948|gb|AFZ08734.1| Heparan-alpha-glucosaminide N-acetyltransferase [Oscillatoria
           nigro-viridis PCC 7112]
          Length = 406

 Score =  121 bits (304), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 124/438 (28%), Positives = 187/438 (42%), Gaps = 118/438 (26%)

Query: 34  RLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           RL SLD+FRG+A+A MILV++ G     +P + HA WNGC   D V PFFLFIVG A++ 
Sbjct: 2   RLKSLDVFRGIAIASMILVNNPGSWEQVYPPLDHAEWNGCTPTDLVFPFFLFIVGCAMSF 61

Query: 91  ALKR----IPDRADAVKKVIFRTLKL------------------LFWGI----------- 117
           +L +     P       K+I +  KL                  ++W I           
Sbjct: 62  SLSKYIQNYPKTGIETSKIIQKNEKLESDKNPFPSSFFLLPASNIYWRIARRAAILFILG 121

Query: 118 LLQGGFSHAPDELTYGVDVR---MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGR 174
           LL    S A D L     V     IR+ GVLQRI L+Y + ++  +   ++  ++Q    
Sbjct: 122 LLLNTSSIALDVLLNSAPVENFGKIRIMGVLQRIGLAYFIGAIAIL---NLSPRNQK--- 175

Query: 175 FSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLN 234
                      L+AA VL+ Y   L                      VF V      +L 
Sbjct: 176 -----------LLAAAVLLGYWGAL---------------------TVFAVGGYTAGELT 203

Query: 235 PPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLS 294
           P  N  GY+DR +LG  H+Y                               PF+PEGLLS
Sbjct: 204 PEGNLGGYVDRLILGSQHLYK----------------------------GGPFDPEGLLS 235

Query: 295 SVSSILSTIIGVHFGHVI----IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQ 350
           ++ ++++ +IG   G  +    I T+  +         G + ++ G    F    P+NKQ
Sbjct: 236 TLPAVVTVLIGYFTGEWLRVQPIKTRTSI----NLAICGLSCVVIGHLWGFL--FPINKQ 289

Query: 351 LYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFI 410
           L+T SYV  T+G A L+ +A Y  +++   K+   P   +G+NA+ ++V  A GI A  +
Sbjct: 290 LWTSSYVVFTAGWALLLLAACYETIEVRGWKWG-RPFEIMGVNAIFLFV--ASGIVARIL 346

Query: 411 NGWYYGDPHNTLVCFLFI 428
              + G   N    + +I
Sbjct: 347 LKTHIGTGANAPTTYTWI 364


>gi|395803976|ref|ZP_10483217.1| hypothetical protein FF52_18915 [Flavobacterium sp. F52]
 gi|395433620|gb|EJF99572.1| hypothetical protein FF52_18915 [Flavobacterium sp. F52]
          Length = 423

 Score =  121 bits (304), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 110/415 (26%), Positives = 172/415 (41%), Gaps = 118/415 (28%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDW----PEISHAPWNGCNLADFVMPFFLFIVGVAI 88
           +RL SLD+FRGL + LM +V++ G DW    P + HA WNGC   D V PFF+FI+GVA+
Sbjct: 4   ERLISLDVFRGLTILLMTIVNNPG-DWGNVYPPLLHAHWNGCTPTDLVFPFFIFIMGVAV 62

Query: 89  ALALKRIPDRADAVKKVIFRTLKLLFWGIL------------------------------ 118
            LA+           K++ R+L++   GI                               
Sbjct: 63  PLAMPEKKYDETTFNKILIRSLRMFCLGIFFNFFGKIQLFGLDGIPLLLVRLIITFAVGY 122

Query: 119 -LQGGFSHAPDE------------LTYG--VDVRMIRLCGVLQRIALSYLLVSLVEIFTK 163
            L G FS+                L YG   +   +RL GVLQRIA+ Y +VSL+ + T 
Sbjct: 123 ALMGNFSNKLKNIFAFSILAIYIILAYGGFENYADVRLPGVLQRIAIVYFVVSLLYLKT- 181

Query: 164 DVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVF 223
                 +    F+   L   +W +   V V  +                           
Sbjct: 182 -----SRKTQLFTGIVLLFGYWAIMTLVPVPGIG-------------------------- 210

Query: 224 NVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWC 283
                  A L    N   ++D  +L   HMYH                      +  +W 
Sbjct: 211 ------EANLERGTNLAAWVDSVLLK-GHMYH----------------------ETNTW- 240

Query: 284 HAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTN 343
               +PEG+LS++ SI++ IIG+  G +++     + + ++   +G +L+ FGL      
Sbjct: 241 ----DPEGILSTIPSIVNGIIGLFIGQILLLNITKIQKAQRMGMIGTSLIFFGLIWDL-- 294

Query: 344 AIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVY 398
             P+NK ++T SYV  T+G A +  + +Y ++DI   K  F      G+N M+V+
Sbjct: 295 VFPINKSIWTSSYVLYTTGLATVCLTVLYYIIDIAEYKKGFKLFVIWGVNPMIVF 349


>gi|385809567|ref|YP_005845963.1| heparan-alpha-glucosaminide N-acetyltransferase [Ignavibacterium
           album JCM 16511]
 gi|383801615|gb|AFH48695.1| Heparan-alpha-glucosaminide N-acetyltransferase [Ignavibacterium
           album JCM 16511]
          Length = 378

 Score =  121 bits (303), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 107/380 (28%), Positives = 173/380 (45%), Gaps = 81/380 (21%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
           T+RL SLD+FRG+ +  MILV++ G     +P++ HA W+GC   D + PFFLFIVGVA+
Sbjct: 4   TERLVSLDVFRGITIMGMILVNNPGTWSAVYPQLLHAEWHGCTFTDLIFPFFLFIVGVAV 63

Query: 89  ALALKRIPDRADAVK----KVIFRTLKLLFWGILLQGGFSHAPDELTYG--VDVRMIRLC 142
           + +L +   +  ++K     +I RT+ L   GI+L G     P  L +G       +R+ 
Sbjct: 64  SYSLTKRKAQGGSMKSLYLNIIRRTVILFLLGIILNG----FPFGLLFGHQFSWETLRIP 119

Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
           GVLQRIA+ Y + + + + T     K Q             +W  AA +L++Y A++   
Sbjct: 120 GVLQRIAIVYFVAAFLFLTT---STKFQ-------------YWFTAA-ILILYAAVMSFI 162

Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
            VP   +                     A   P  N   +ID+ +LG +HM     W  +
Sbjct: 163 PVPGIGY---------------------ANFEPGKNLSAWIDQMILG-SHM-----WSGT 195

Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
           K                       ++PEG+LS++ +I S ++G+  G+ +        ++
Sbjct: 196 KL----------------------WDPEGILSTIPAIGSAMLGIFTGNWLRSENDQKEKV 233

Query: 323 KQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKY 382
                    L++ G    F    PLNK L+T SYV  T+G A       Y  +D+  + +
Sbjct: 234 VYLFIWANVLMLAGWIWSF--WFPLNKNLWTSSYVLWTAGLALHFLGFCYWFIDVKKITW 291

Query: 383 PFLPLAWIGMNAMLVYVMAA 402
              P    GMNA+ V+ ++ 
Sbjct: 292 WTKPFLVYGMNAITVFFLSG 311


>gi|387793162|ref|YP_006258227.1| hypothetical protein Solca_4061 [Solitalea canadensis DSM 3403]
 gi|379655995|gb|AFD09051.1| hypothetical protein Solca_4061 [Solitalea canadensis DSM 3403]
          Length = 393

 Score =  121 bits (303), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 111/399 (27%), Positives = 179/399 (44%), Gaps = 93/399 (23%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPF 79
            Q  +     RL SLD+FRG  VA MILV++ G  W  I     HA WNGC   D + PF
Sbjct: 1   MQASASEPKPRLLSLDVFRGATVAAMILVNNPG-SWSNIYAPLEHAKWNGCTPTDLIFPF 59

Query: 80  FLFIVGVAIALAL----KRIPDRADAVKKVIFRTLKLLFWGILL---------QGGFSHA 126
           FLFIVG++IA AL     R  + + A+K +  R+LKL   G++L         + G    
Sbjct: 60  FLFIVGISIAYALSGKKSRPEEHSAAIKSITIRSLKLFGLGLILALFPIVYFDKFGEVDV 119

Query: 127 PDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWL 186
            D++   +    +R+ GVLQRI + + +  ++ I     + K +++          W   
Sbjct: 120 WDQIV--MRFSGVRIMGVLQRIGIVFFIAGIIFI-----KAKPKTI---------AWT-- 161

Query: 187 MAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGV-RAKLNPPCNAVGYIDR 245
            A  +LV+Y  L+  T+VP                      GV  A L P  N   +IDR
Sbjct: 162 -AGSLLVIYYLLM--TFVP--------------------VPGVGYANLEPETNLGAWIDR 198

Query: 246 KVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIG 305
            +L  +H+     W++SK                 +W     +PEGLL ++ ++ + ++G
Sbjct: 199 LILTTDHL-----WKQSK-----------------TW-----DPEGLLGTIPAVATGLLG 231

Query: 306 VHFGHVIIHTKGHLARLKQWVTMGFALLIFGLT--LHFTNAIPLNKQLYTLSYVCVTSGA 363
              G  +     +  +    +T  F    FG+   L +    P+NK L+T S+V  T+G 
Sbjct: 232 TLCGTWMKKPMDNALK----ITWLFVFAAFGIAAGLVWDLFFPINKSLWTSSFVLYTAGL 287

Query: 364 AALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
            +  F+ +Y L+D+   K    P    G+NA+  + ++ 
Sbjct: 288 GSAFFAVLYWLIDVQGYKRYTTPFVAFGVNAITAFFLSG 326


>gi|428308802|ref|YP_007119779.1| hypothetical protein Mic7113_0454 [Microcoleus sp. PCC 7113]
 gi|428250414|gb|AFZ16373.1| hypothetical protein Mic7113_0454 [Microcoleus sp. PCC 7113]
          Length = 381

 Score =  121 bits (303), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 110/377 (29%), Positives = 168/377 (44%), Gaps = 82/377 (21%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAG--GD---WPEISHAPWNGCNLADFVMPFFLFIVGV 86
           ++RL SLD+FRG+ +A MILV+  G  GD   +P + HA WNG    D V PFFLFIVG 
Sbjct: 9   SKRLTSLDVFRGITIAGMILVNMIGVAGDKNVYPPLLHADWNGFTPTDLVFPFFLFIVGA 68

Query: 87  AIALALKRIPDRADAVK----KVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
           A+A +  +             ++I R+L L   GILL G + +         +   IR+ 
Sbjct: 69  AMAFSFSKYKHGNKPTPTVYWRIIRRSLILFALGILLNGFWEY---------NWSSIRIM 119

Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
           GVLQRI+L+YL+ SL+ +   +V  K Q              W +AA +L+ Y   +   
Sbjct: 120 GVLQRISLTYLIASLIVL---NVPRKGQ--------------WAIAAFLLIGYWFAMSLI 162

Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
            VP            DYG            L    N   Y DR ++   H+Y        
Sbjct: 163 PVP------------DYG---------MGNLTREGNFGAYFDRLIIPTAHLY-------- 193

Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
               +   F G              +PEGL S++ +++S + G   G  +          
Sbjct: 194 ----KGDDFNG------------MGDPEGLFSTLPAVVSVLFGYLTGDWLRQQPIKSTTS 237

Query: 323 KQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKY 382
              V +G + L+ G    F    P+NK+L+T SYV  T+G A L+ +A Y  +++   + 
Sbjct: 238 MDLVLLGLSCLVIGQVWDF--WFPINKKLWTSSYVLFTTGWALLLLAACYESIEVRQRQR 295

Query: 383 PFLPLAWIGMNAMLVYV 399
              P   +G+NA+ ++V
Sbjct: 296 WAKPFEMMGLNAISIFV 312


>gi|354568330|ref|ZP_08987495.1| hypothetical protein FJSC11DRAFT_3703 [Fischerella sp. JSC-11]
 gi|353540693|gb|EHC10166.1| hypothetical protein FJSC11DRAFT_3703 [Fischerella sp. JSC-11]
          Length = 384

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 113/362 (31%), Positives = 164/362 (45%), Gaps = 86/362 (23%)

Query: 33  QRLASLDIFRGLAVALMILVDHAG----GDWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
            RL SLD+FRG+ +A MILV+ A       +P + HA W+GC   D V PFFLFIVGVA+
Sbjct: 1   MRLTSLDVFRGITIAGMILVNTASIAEPNVYPPLLHAEWHGCTPTDLVFPFFLFIVGVAM 60

Query: 89  ALALKRIPDRADAVKK-----------VIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
           + +  +  D     +K           +I R   L   G+LL G ++       +  D  
Sbjct: 61  SFSFSKYTDSKLHGEKEKVFVSLPYWRIIRRAAILFVLGLLLNGFWNQG----VWTFDFN 116

Query: 138 MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVY-L 196
            IR+ GVLQRI+L+YLL SLV     ++  K Q              W++A  +L+ Y L
Sbjct: 117 SIRVMGVLQRISLTYLLASLVVF---NIPRKGQ--------------WILAGVLLIGYWL 159

Query: 197 ALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHH 256
           A++   YVP           + YG       GV   L    N   YIDR ++   H+Y  
Sbjct: 160 AMM---YVP----------VSGYG------AGV---LTRDGNLGAYIDRLIIPKAHLYKG 197

Query: 257 PAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK 316
                      +  F G              +PEGL S++ +I+S + G   G  I   K
Sbjct: 198 ----------DNYNFMG--------------DPEGLFSTIPAIVSVLAGYFAGQWIRSQK 233

Query: 317 GHLARLK-QWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALV 375
              ++     V  G + L+ G       A P+NK+L+T SYV  T+G A L+ +  Y L+
Sbjct: 234 QINSKTSMDLVLFGLSCLVIGGIWDL--AFPINKKLWTSSYVVFTTGWALLLLAFCYELI 291

Query: 376 DI 377
           D+
Sbjct: 292 DV 293


>gi|125981811|ref|XP_001354909.1| GA19944 [Drosophila pseudoobscura pseudoobscura]
 gi|54643221|gb|EAL31965.1| GA19944 [Drosophila pseudoobscura pseudoobscura]
          Length = 574

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 109/380 (28%), Positives = 183/380 (48%), Gaps = 48/380 (12%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           + +RL SLD FRGL++ LMI V+  GG +  I HA WNG +LAD V P FL+I+GV I L
Sbjct: 181 QRKRLRSLDTFRGLSIVLMIFVNSGGGGYAWIEHAAWNGLHLADLVFPSFLWIMGVCIPL 240

Query: 91  ALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQ 146
           ++K    R   +A    ++++R++KL   G+ L             G  +  +RL GVLQ
Sbjct: 241 SVKAQLSRGNSKARICLRILWRSIKLFAIGLCLNS---------MSGPSLEQLRLMGVLQ 291

Query: 147 RIALSYLLVSLVEIF-TKDVQDKDQSVGRFSIFR--LYCWHWLMAACVLVVYLALLYGTY 203
           R  +++L+V ++    ++  Q + Q     +I+   L+     +   ++  YL L +G  
Sbjct: 292 RFGIAFLVVGILHTLCSRREQVQPQRAWHRAIYDVCLFSGELAVLLALIAAYLGLTFGLP 351

Query: 204 VPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
           VP      +                + A  N    A GY+D +VLG  H+Y HP    + 
Sbjct: 352 VPGCPRGYLGPGGKH---------DLAAHPNCIGGAAGYVDLQVLGNAHIYQHP----TA 398

Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFG-HVIIHTKGHLARL 322
               DS               + F+PEG+   + S++  ++G   G  +++HT       
Sbjct: 399 KYVYDS---------------SAFDPEGVFGCLLSVVQVLLGAFAGLTLLVHTTWQTRLR 443

Query: 323 KQWVTMGFALLIFGLTLHFT---NAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWN 379
           +  +      L+ G    F+     IP+NK L++LS+V VT   A L+ + +Y ++D+ +
Sbjct: 444 RWLLLSLLLGLVGGALCGFSREGGVIPVNKNLWSLSFVFVTVSVALLLLALLYYIIDVRD 503

Query: 380 LKYPFLPLAWIGMNAMLVYV 399
             +   P +  GMNA+++YV
Sbjct: 504 GWWSGWPFSECGMNAIIMYV 523


>gi|305665862|ref|YP_003862149.1| hypothetical protein FB2170_06240 [Maribacter sp. HTCC2170]
 gi|88710633|gb|EAR02865.1| hypothetical protein FB2170_06240 [Maribacter sp. HTCC2170]
          Length = 362

 Score =  120 bits (301), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 108/376 (28%), Positives = 170/376 (45%), Gaps = 86/376 (22%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGVAI 88
            R+ S+DIFRGL + LMILV+  G  W  +     HA W+G    D V PFFLFIVG +I
Sbjct: 3   NRVISVDIFRGLTIVLMILVNTPG-TWSSVYTPFLHAEWHGYTPTDLVFPFFLFIVGTSI 61

Query: 89  ALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
           + A ++        KK+  R+LKL+  G+ L G F+ +     +  D   IR  GVLQRI
Sbjct: 62  SFAYQKKKASTQTYKKIAVRSLKLIGLGLFL-GAFTLS---FPFIKDFADIRFPGVLQRI 117

Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAAC-VLVVYLALLYGTYVPDW 207
            + +L  ++                   +F  + W  L+  C VL+V   LL G YVP  
Sbjct: 118 GVVFLFTAV-------------------LFVNFNWKTLLGICIVLLVGYWLLMG-YVP-- 155

Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLN-PPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
                               G+ +  +  P N   Y+D K+ G  H Y            
Sbjct: 156 ------------------VEGIESTFDRAPNNLANYLDVKIFG-THNY------------ 184

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
                    + D        ++PEG LS++ SI S + GV  G ++   K +    K  V
Sbjct: 185 ---------KPD--------YDPEGFLSTLPSIASALTGVFTGLILTSKKDN----KTMV 223

Query: 327 TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLP 386
            +G  +++  L   +    P+NK L++ S+V VTSG A +  + +Y + D+  +++  + 
Sbjct: 224 LVGLGVVMLALGYLWHTVFPINKALWSSSFVLVTSGWANIFLALVYYISDVRQIEFGSI- 282

Query: 387 LAWIGMNAMLVYVMAA 402
             + G NA+ VY +++
Sbjct: 283 FKYAGANAITVYFLSS 298


>gi|281209662|gb|EFA83830.1| hypothetical protein PPL_02898 [Polysphondylium pallidum PN500]
          Length = 409

 Score =  120 bits (301), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 112/399 (28%), Positives = 162/399 (40%), Gaps = 99/399 (24%)

Query: 20  DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFV 76
           DV           +R+ SLD  RGL +  MILVD+ GG    WP +    WNG + AD +
Sbjct: 24  DVDKDTTSKPPPKKRMLSLDTARGLTIFGMILVDNQGGPEVIWP-LKETDWNGISTADLI 82

Query: 77  MPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDV 136
            P FLFI G +I+LALK   +       +I RT+ LLF   L     +H     T+    
Sbjct: 83  FPSFLFICGFSISLALKNAKNDRPTWINIIRRTI-LLFGIQLFLNLMAHKFVFSTF---- 137

Query: 137 RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV-LVVY 195
              R+ GVLQRI+L Y                       S   L  W   +A  +   +Y
Sbjct: 138 ---RVMGVLQRISLCYCFSCC------------------SFMLLPKWAQRVALVISATIY 176

Query: 196 LALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYH 255
           L L+Y   VP                     CG R  +   CNA GYID  +L  N ++ 
Sbjct: 177 LCLMYAYPVPG--------------------CG-RGNITRSCNAAGYIDNLILRKNMIH- 214

Query: 256 HPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII-- 313
                                         P +PEG +S+ S+ ++T +GV  G ++   
Sbjct: 215 ------------------------------PTDPEGFISTFSAFITTWMGVELGRILTTH 244

Query: 314 --HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAI 371
                G    L +W+++G    + GL L  TN I  NK +++ S+  +T    AL  SA+
Sbjct: 245 ARSADGWKDILIRWLSIGMVCAMIGLFLDATNVIQFNKIIWSFSFAMLTVACGALFLSAL 304

Query: 372 YALVDIWNLKYPFL----------PLAWIGMNAMLVYVM 400
           Y  +D+   K+P            P  WIG N + +Y +
Sbjct: 305 YYSMDV--AKWPETVRHYIEIAAQPFIWIGTNPITIYTL 341


>gi|393783262|ref|ZP_10371437.1| hypothetical protein HMPREF1071_02305 [Bacteroides salyersiae
           CL02T12C01]
 gi|392669541|gb|EIY63029.1| hypothetical protein HMPREF1071_02305 [Bacteroides salyersiae
           CL02T12C01]
          Length = 365

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 109/386 (28%), Positives = 176/386 (45%), Gaps = 80/386 (20%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           QRL SLD+ RG+ VA MILV++AG     +  + HA W+G   AD V P F+F++G++  
Sbjct: 9   QRLLSLDVLRGITVAGMILVNNAGACGYGYAPLRHAKWDGFTPADLVFPMFMFLMGISTY 68

Query: 90  LALKRIPDRAD-AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
           ++L++   +    + K+I R   L+  GI ++    H+ +   +  D   +R+ GV+QR+
Sbjct: 69  ISLRKYNFQWQLTIGKIIKRAFLLILIGIAMKW-LIHSFETGIWN-DWEHMRILGVMQRL 126

Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
            + Y + +++ +F            RF                L + L LL G ++    
Sbjct: 127 GICYGITAVMALFIPH--------KRF----------------LPIALLLLIGYFILQLA 162

Query: 209 FTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
                K                     P N +  +D  VLG +HMY            Q 
Sbjct: 163 GNGFEKS--------------------PDNIMAIVDSTVLGTSHMY-----------LQG 191

Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTM 328
             F                EPEG+LS++ ++   +IG   GH++I+ K +  R++Q   M
Sbjct: 192 RQF---------------VEPEGILSTIPAVAQVMIGFVCGHMLINRKDNQERMQQLFFM 236

Query: 329 GFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLA 388
           G  LL  G  L +  A PLNK+L++ S+V VT G AAL  + +  ++D+   K       
Sbjct: 237 GTLLLFAGFLLSY--ACPLNKRLWSPSFVLVTCGIAALALAVLIEIIDVRKKKEWCTFFK 294

Query: 389 WIGMNAMLVYVMAAEGIFAGFINGWY 414
             G+N +L+YV AAE IF      W+
Sbjct: 295 VFGVNPLLLYV-AAE-IFGDLFRTWH 318


>gi|374309722|ref|YP_005056152.1| hypothetical protein [Granulicella mallensis MP5ACTX8]
 gi|358751732|gb|AEU35122.1| hypothetical protein AciX8_0773 [Granulicella mallensis MP5ACTX8]
          Length = 377

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 104/388 (26%), Positives = 177/388 (45%), Gaps = 71/388 (18%)

Query: 46  VALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALA----LKRIPDR 98
           +A MILV   G     +P++ HA WNG    D + P FL I+GVA+  +    ++R  DR
Sbjct: 3   IAGMILVTDPGTYSAVYPQLMHAQWNGATATDMIFPSFLVIIGVAMTFSFASRIERGADR 62

Query: 99  ADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLV 158
              +  V+ R++ L+F G+L+ G     P+      ++  IR+ G+LQRIAL Y   SL+
Sbjct: 63  RQILWHVLTRSVLLIFLGLLVNG----FPEY-----NLHTIRIPGILQRIALCYFAGSLL 113

Query: 159 EIFT---KDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKD 215
            +     KD   + Q + R ++           A +LV+Y  LL G  VP +        
Sbjct: 114 YLAVSGKKDANTESQRLRRGTVIGAVL------AGLLVLYWVLLKGYPVPGFG------- 160

Query: 216 SADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPL 275
                           +L+   N   Y DRK+ G+ H++ +            +P  G  
Sbjct: 161 --------------SGRLDSLGNVAAYFDRKIFGVQHLWAYGL----------TPGYG-- 194

Query: 276 RKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIF 335
                      F+PEGLLS++ ++ + + GV  G  +   +    +       G AL++ 
Sbjct: 195 ---------VTFDPEGLLSTLPALATLLFGVLAGEWLRTNQARGRKALVLAVAGVALVLV 245

Query: 336 GLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAM 395
           GL L  +  +PLNK++ T ++   + G A L+F+  Y ++D+   +   +PL   G NA+
Sbjct: 246 GLAL--SPLLPLNKKILTSTFAIFSGGVALLLFAGFYFVLDVKRWRRGVMPLLVFGTNAI 303

Query: 396 LVYVMAAEGIFAGFINGWYYGDPHNTLV 423
             +V+++  I    ++ W+      TLV
Sbjct: 304 FAFVVSS--IITTLLDRWHLALGDGTLV 329


>gi|300865789|ref|ZP_07110543.1| conserved membrane hypothetical protein [Oscillatoria sp. PCC 6506]
 gi|300336202|emb|CBN55698.1| conserved membrane hypothetical protein [Oscillatoria sp. PCC 6506]
          Length = 376

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 113/384 (29%), Positives = 174/384 (45%), Gaps = 90/384 (23%)

Query: 34  RLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           RL SLD+FRG+A+A MILV++ G     +P + HA WNGC   D + PFFLF VG A++ 
Sbjct: 2   RLTSLDVFRGIAIASMILVNNPGSWDYVYPPLDHAEWNGCTPTDLIFPFFLFAVGAAMSF 61

Query: 91  ALKRIPDRADAVKKVIFRTLK---LLFWGILLQGGFSHAPDELTYGVDVR---MIRLCGV 144
           +L +  +    +  V +R L+   LLF   LL   FS   D L  G  +     IR+ GV
Sbjct: 62  SLSKYTEENPPISTVYWRILRRATLLFLLGLLLNSFSIFLDVLLNGSPIENFGKIRILGV 121

Query: 145 LQRIALSYLL--VSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
           LQRI+L+Y L  ++++ + +++++                   ++AA +L+ Y   L   
Sbjct: 122 LQRISLAYFLAAIAILNLSSRNLR-------------------ILAATLLLGYWGALTLI 162

Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
            VP             YG            L P  N   YIDR +LG  H+Y        
Sbjct: 163 PVP------------GYGANL---------LTPEGNLGAYIDRLILGTQHLYR------- 194

Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
                                   ++PE LL ++ +I +T++  +F    +  +    R 
Sbjct: 195 ---------------------QGQYDPESLLGTLPAI-ATVLAGYFTTQWLRVQPIKTR- 231

Query: 323 KQW--VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDI--W 378
             W  V  G A L  G    F    P+NKQL+T SYV +T+G A L+ +  Y  +++  W
Sbjct: 232 TTWNLVIFGLASLTIGQLWGF--WFPINKQLWTSSYVLLTAGWAILLLAICYETIEVRRW 289

Query: 379 NLKYPFLPLAWIGMNAMLVYVMAA 402
               PF     +G+NA+ V+V + 
Sbjct: 290 QWGRPF---EIMGLNAIFVFVASG 310


>gi|333031144|ref|ZP_08459205.1| putative transmembrane protein [Bacteroides coprosuis DSM 18011]
 gi|332741741|gb|EGJ72223.1| putative transmembrane protein [Bacteroides coprosuis DSM 18011]
          Length = 363

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 97/375 (25%), Positives = 173/375 (46%), Gaps = 80/375 (21%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
           ++RL SLD+ RG+ VA MILV++ G    ++  + HA W+G N AD V P F+F++G++ 
Sbjct: 8   SKRLLSLDVLRGITVAGMILVNNTGSCGYNYTALRHASWDGLNFADLVFPMFMFMMGIST 67

Query: 89  ALALKRIP-DRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
            ++L++   ++  A  K+  RT  L+  G+ ++   +     +  G+++  +RL GV+QR
Sbjct: 68  YISLRKYENNKKTAFYKIFKRTSLLIIIGLFMECIITW----IEVGLNLSTLRLMGVMQR 123

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
           + L Y                    G  ++  LY  H      +L + L++L G ++   
Sbjct: 124 LGLCY--------------------GITALLSLYVPH----KYLLKIALSVLLGYFIIQI 159

Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
             +  +K +                     N +G +DR VLG+NH+Y            Q
Sbjct: 160 VGSGFDKSAE--------------------NVIGVVDRSVLGVNHIY-----------LQ 188

Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVT 327
              F                +PEG+LS++ +I   +IG   G  I+  + H  ++     
Sbjct: 189 GKQF---------------VDPEGVLSTLPAIAQVMIGFFCGRKILEKREHKQQMLILYR 233

Query: 328 MGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPL 387
           +G   L  G    F+   P+NK+L++ ++V VTSG A +  S +   +DI   K+     
Sbjct: 234 LGSLFLFVGFV--FSYVCPINKRLWSPTFVLVTSGVACMALSLLIDTLDIKQKKHWSRFF 291

Query: 388 AWIGMNAMLVYVMAA 402
              G N +++YV+A+
Sbjct: 292 EVFGANPLILYVVAS 306


>gi|449664780|ref|XP_002169793.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Hydra magnipapillata]
          Length = 369

 Score =  118 bits (295), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 108/365 (29%), Positives = 175/365 (47%), Gaps = 61/365 (16%)

Query: 49  MILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL----KRIPDRADAVKK 104
           MI V++ GG +   SH+ WNG  +AD + P+F+FI+G +I +++    K++  R   V K
Sbjct: 1   MIFVNYGGGGYYFFSHSSWNGLTVADLLFPWFIFIMGSSIYISMHSLRKKLSKRKMTV-K 59

Query: 105 VIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKD 164
           +I+R+ KL                 L  G D+   RL GVLQR A+SY +V+LV ++   
Sbjct: 60  IIYRSFKL-----------LLLGLFLNNGFDLANWRLPGVLQRFAISYFVVALVFLWFDS 108

Query: 165 VQDKDQSVGRFSIFR--LYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGK 221
             ++ ++    ++FR   + +  ++   +L +YL ++Y   VP             D+G 
Sbjct: 109 PNEESETNSWKNMFRDVWFPFQHIVMLLLLTIYLLIIYLLNVPGCPKGYFGPGGDGDHGA 168

Query: 222 VFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPS 281
               T G          A GY+DR V G+NH+Y +P  +    C                
Sbjct: 169 YEKCTGG----------ASGYVDRTVFGLNHIYKNPTCKSLYNCFT-------------- 204

Query: 282 WCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK---GHLARLKQWVTM--GFALLIFG 336
                ++PEGLL ++ SIL T +G+     ++  K   GH+ R   W  +    A+ + G
Sbjct: 205 -----YDPEGLLGTIPSILLTYLGLQAARTLLFYKSKNGHIIRWFIWSVLLGALAVGLCG 259

Query: 337 LTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDI---WNLKYPFLPLAWIGMN 393
            TL+   AIP+NK L++LSY+      A L+    Y LVD+   WN      P  + GMN
Sbjct: 260 GTLN-DGAIPINKNLWSLSYIFAMGSTAYLLLLICYVLVDVLKWWNGA----PFYYAGMN 314

Query: 394 AMLVY 398
           ++L+Y
Sbjct: 315 SILLY 319


>gi|428300562|ref|YP_007138868.1| hypothetical protein Cal6303_3980 [Calothrix sp. PCC 6303]
 gi|428237106|gb|AFZ02896.1| hypothetical protein Cal6303_3980 [Calothrix sp. PCC 6303]
          Length = 402

 Score =  118 bits (295), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 111/400 (27%), Positives = 175/400 (43%), Gaps = 102/400 (25%)

Query: 33  QRLASLDIFRGLAVALMILVDHAG----GDWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
            RL SLD+FRG  +A MILV+         +P + HA W+GC LAD V PFFLFIVGVA+
Sbjct: 1   MRLTSLDVFRGATIAGMILVNMVSLAEPNVYPALLHADWHGCTLADLVFPFFLFIVGVAM 60

Query: 89  ALALKRIPD-------RADAVK--------------------KVIFRTLKLLFWGILLQG 121
           + +  +  D         DA+                     K IFR   +LF   L   
Sbjct: 61  SFSFAKYTDVIPKVEKEKDAIGALQQFLAKESSAAGGAKPPYKKIFRRGAILFALGLFLN 120

Query: 122 GFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLY 181
            F ++ + L Y  D   +R+ GVLQRIAL+YL  SL+ +    +  K Q           
Sbjct: 121 LFWNSKN-LPY-FDFSTLRIMGVLQRIALTYLFASLIVL---KLPKKAQ----------- 164

Query: 182 CWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVG 241
              W++A  +LV Y  L+    +P++    I   + ++                      
Sbjct: 165 ---WIVAGVLLVGYWLLMMYVPIPEYGAGEIGTRTGNFA--------------------A 201

Query: 242 YIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILS 301
           YIDR ++   H+Y    +                            +PEGL S++ +I+S
Sbjct: 202 YIDRFIIPKAHLYKGDGFNNFG------------------------DPEGLFSTIPAIVS 237

Query: 302 TIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFG---LTLHFTNAIPLNKQLYTLSYVC 358
            ++G +F    I ++     +K   ++G AL   G   +   F    P+NK+L+T SYV 
Sbjct: 238 -VLGGYFSGQWIRSQ----PVKTRTSLGLALAGIGCLIIAWLFALDFPINKKLWTSSYVI 292

Query: 359 VTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVY 398
            T+G A L+ +  Y L+++  ++    P   +GMNA+ ++
Sbjct: 293 FTTGWALLLLAGCYELIEVQKIRSWAKPFEVLGMNAIAIF 332


>gi|149280688|ref|ZP_01886799.1| hypothetical protein PBAL39_24475 [Pedobacter sp. BAL39]
 gi|149228553|gb|EDM33961.1| hypothetical protein PBAL39_24475 [Pedobacter sp. BAL39]
          Length = 385

 Score =  118 bits (295), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 112/377 (29%), Positives = 163/377 (43%), Gaps = 78/377 (20%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAIA 89
           RL SLD FRG  VA MILV++ G DW  I     HA W+GC   D V PFFLFIVGV+IA
Sbjct: 13  RLLSLDFFRGATVAAMILVNNPG-DWGHIYAPLEHADWHGCTPTDLVFPFFLFIVGVSIA 71

Query: 90  LALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDV-RMIRLCGVLQRI 148
            A+        +  K I + LK       L    S  P+  +  V+  + +R+ GVLQRI
Sbjct: 72  YAMGSKKTDPSSHGKTILKALKRTLILFGLGLFLSLFPNVFSNPVEAFQQVRIPGVLQRI 131

Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
           A+ + + S+  IF K  +         +IFR           +L  Y A++    VP   
Sbjct: 132 AVVFFICSI--IFLKSSER--------TIFRTMV-------IILAAYWAIMTFIPVPGTG 174

Query: 209 FTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
           F  + K++                     N   +IDR V    H+     W+ SK     
Sbjct: 175 FPNLEKET---------------------NLGAWIDRGVFTEAHL-----WKSSKT---- 204

Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV-T 327
                             ++PEGLLS++ +I + + G+  G  +       A    W+ +
Sbjct: 205 ------------------WDPEGLLSTLPAIATGLFGILVGSYLKRKDIEPATKIAWLFS 246

Query: 328 MGFALLIFGL--TLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFL 385
            G A    GL   L F    P+NKQL+T S+V  T G A  + S  Y ++D+        
Sbjct: 247 TGAAATALGLLWDLQF----PINKQLWTSSFVLYTGGLATTILSLSYWIIDVQQYNRFTK 302

Query: 386 PLAWIGMNAMLVYVMAA 402
           P    G+NA+ V+ ++ 
Sbjct: 303 PFVVYGVNAITVFFLSG 319


>gi|255038072|ref|YP_003088693.1| hypothetical protein Dfer_4326 [Dyadobacter fermentans DSM 18053]
 gi|254950828|gb|ACT95528.1| conserved hypothetical protein [Dyadobacter fermentans DSM 18053]
          Length = 368

 Score =  117 bits (294), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 109/385 (28%), Positives = 172/385 (44%), Gaps = 88/385 (22%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFL 81
            E   + + RL SLD  RG  +A MI+V+  G +   +P + H+ WNG    D + P FL
Sbjct: 1   MENPSVPSSRLLSLDAMRGFTIAAMIMVNFPGHEDYVFPTLRHSKWNGLTFTDLIAPTFL 60

Query: 82  FIVGVAIALAL--KRIPD--RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
           FIVGV+I LA   KR+ +  ++   +K++ R+LK+   G+ L    +  PD      +  
Sbjct: 61  FIVGVSITLAYSKKRLSNAPKSGLYRKIVIRSLKIFAVGMFL----NMLPD-----FNFS 111

Query: 138 MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA 197
            +R  G L RIA+ +L+ +++ + T   Q    +VG                 +LV+Y  
Sbjct: 112 DLRYTGTLHRIAIVFLVCAILFLNTSWKQQLGIAVG-----------------ILVLYWL 154

Query: 198 LLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHP 257
            L G   P              GKV          L P  N   ++D++ L         
Sbjct: 155 ALTGIPTP------------GIGKVM---------LEPGVNLAAWVDQQYL--------- 184

Query: 258 AWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKG 317
                           P +    +W     +PEG+LS+  +I +TI G+  G +++    
Sbjct: 185 ----------------PGKMWQGNW-----DPEGILSTFPAIATTITGILAGRLMLLPFS 223

Query: 318 HLARLKQWVTMGFALLIFGLTLHFTNAI-PLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
              +    +T GFA    G   +F N I P+N+ L+T S+V VTSG A+++F A+Y L+D
Sbjct: 224 PNEKSNFLLTAGFATAALG---YFWNLIFPVNENLWTSSFVLVTSGFASMLFGALYFLID 280

Query: 377 IWNLKYPFLPLAWIGMNAMLVYVMA 401
           I        P    G NA+  YV+A
Sbjct: 281 IRGRTAGIAPGVIFGANAIAAYVLA 305


>gi|431796483|ref|YP_007223387.1| hypothetical protein Echvi_1106 [Echinicola vietnamensis DSM 17526]
 gi|430787248|gb|AGA77377.1| hypothetical protein Echvi_1106 [Echinicola vietnamensis DSM 17526]
          Length = 369

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 111/381 (29%), Positives = 173/381 (45%), Gaps = 86/381 (22%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVG 85
           +  QR+ +LD+FRG+ +  MILV++ G  W  +     HA W+GC   D + PFFLFIVG
Sbjct: 1   MPKQRILALDVFRGITIFAMILVNNPG-SWSHVYAPLLHAKWHGCTPTDLIFPFFLFIVG 59

Query: 86  VAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRL 141
           VAI L+    LK+   +   ++K + R LKL+                + Y  D+  +R 
Sbjct: 60  VAIELSLGGQLKKGTPKGFLLRKSLIRALKLIG--------LGLLLTAIPY-FDLAHLRF 110

Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
            GVLQRI L Y + +++ ++             FS   L   +WL   C+          
Sbjct: 111 PGVLQRIGLVYFISTVMYLYW------SPKALVFSSGILLIGYWL---CM---------- 151

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
           T++P                   V     A L P  N   +ID++VL   HM     W +
Sbjct: 152 TFIP-------------------VPGIGPANLEPGTNLAAWIDQQVL-TGHM-----WSQ 186

Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
           +K                       ++PEGL S++ +I++ ++GV  G ++     H AR
Sbjct: 187 TKT----------------------WDPEGLFSTLPAIVTCLLGVACGKILTGNSSHKAR 224

Query: 322 LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLK 381
           L +W   G  L+  GL   ++   PLNK L+T S+V  T+G A L  +A Y ++D+   K
Sbjct: 225 LTKWGIAGVTLVFGGLA--WSLFFPLNKALWTSSFVLYTAGWAFLGLAACYWILDVKGWK 282

Query: 382 YPFLPLAWIGMNAMLVYVMAA 402
              LP    GMNA+ V+ ++ 
Sbjct: 283 KWSLPFVIYGMNAITVFFLSG 303


>gi|116621919|ref|YP_824075.1| hypothetical protein Acid_2804 [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116225081|gb|ABJ83790.1| conserved hypothetical protein [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 367

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 124/434 (28%), Positives = 193/434 (44%), Gaps = 105/434 (24%)

Query: 34  RLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           RL SLD FRG  +ALM+LV++AG     + ++ H+PW+G  + D V P FL+IVGVAI L
Sbjct: 12  RLVSLDAFRGATIALMVLVNNAGSGLDSYRQLEHSPWHGWTITDTVFPSFLWIVGVAITL 71

Query: 91  AL-KRIPD---RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQ 146
           +L KR+ +   R+  + +++ R   L  +G+ +   F H         D+   R+ GVLQ
Sbjct: 72  SLGKRVAEGVPRSHLLPQILRRAAILFVFGLFVY-AFPH--------FDLGTQRILGVLQ 122

Query: 147 RIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPD 206
           RIA+ YL  S++ +++  V+ +        I  L   +W+M   + V             
Sbjct: 123 RIAICYLAASVIFLYS-GVRGQI-----LWILGLLAAYWMMMTLIPV------------- 163

Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
                       YG           +L+   N   YID   LG  H YH           
Sbjct: 164 ----------PGYGP---------GRLDVEGNFAHYIDHLALG-RHNYH----------- 192

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
                         +W     +PEGL+S++ +I + + GV  GH I+  +  LA    W+
Sbjct: 193 -----------STRTW-----DPEGLVSTLPAIATALFGVLAGH-ILRCRRTLAERTSWM 235

Query: 327 -TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFL 385
            T G  LL  GL    T  +P+NK+L+T S+    +G    VF+    L+D    + P  
Sbjct: 236 FTAGSLLLAAGLIC--TAWLPINKKLWTDSFCLFMAGLDFTVFAFFAWLIDGQGWRRPVK 293

Query: 386 PLAWIGMNAMLVYVMAAEGI-----FAGFINGWY------YGDPHN-------TLVCFLF 427
           PL  +GMN++ +Y M +EG+      AG     Y         P N         V  ++
Sbjct: 294 PLVVLGMNSIAIY-MVSEGVAEFLDAAGLQKPIYQRVFVPLASPANASLLYSLAFVAAMY 352

Query: 428 IISYILHSFLWELR 441
           ++++ LH   W LR
Sbjct: 353 LVAWFLHRRRWFLR 366


>gi|270339962|ref|ZP_06203500.1| conserved hypothetical protein [Prevotella bergensis DSM 17361]
 gi|270333113|gb|EFA43899.1| conserved hypothetical protein [Prevotella bergensis DSM 17361]
          Length = 389

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 118/426 (27%), Positives = 184/426 (43%), Gaps = 103/426 (24%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGD--WPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           QRL SLD+ RGL V LMI V++  G   +  + H+ WNG  L D V PFFLF+VGV+  L
Sbjct: 18  QRLISLDVLRGLTVMLMIFVNNGAGTQIFSPLRHSRWNGMTLCDLVFPFFLFMVGVSTYL 77

Query: 91  ALKR--IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
           +L++      A  ++K+  RT  L   G+ +   F  A +     +D+  +R+ GV+QRI
Sbjct: 78  SLRKSNFAWSAKTLRKIARRTALLFLIGLTIN-WFDMACNG--SPLDLAHLRIMGVMQRI 134

Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
           AL Y   + V I +  V             RL+   WL+A  ++   L L+ G       
Sbjct: 135 ALCYGATAFVAILSSKVPQ-----------RLHLIPWLIAVLLIAYSLLLIIG------- 176

Query: 209 FTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
                    DY              +   N +  +D  +LG +H+YH             
Sbjct: 177 ------GGYDY--------------SSATNLLAIVDTHILGYDHLYH------------- 203

Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLA---RLKQW 325
                           +P +PEGLLS++ +I  T+IG     + I  +G  +    ++ +
Sbjct: 204 ---------------RSPVDPEGLLSTLPAIAHTLIGFWVARLTIGKQGSHSTQNTVRMF 248

Query: 326 VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFL 385
           +  G  L++ G+ L  T  +PLNK++++ SYV  T G A+     +  L++       FL
Sbjct: 249 LLAGAMLVVSGIIL--TLLLPLNKRIWSPSYVLTTCGLASFTQGLLVYLIEDTRQTASFL 306

Query: 386 P----LAW-----IGMNAMLVYV--------MAAEGI----FAGF----INGWYYGDPHN 420
                 AW      G N + +YV        + A GI    FAG      NG++    + 
Sbjct: 307 NKDKGRAWTFFLIFGSNPLFLYVASEVLAISLGAAGIKDAVFAGLQTIITNGYWASAAYA 366

Query: 421 TLVCFL 426
           TL   L
Sbjct: 367 TLFVLL 372


>gi|255534024|ref|YP_003094396.1| hypothetical protein Phep_4143 [Pedobacter heparinus DSM 2366]
 gi|255347008|gb|ACU06334.1| conserved hypothetical protein [Pedobacter heparinus DSM 2366]
          Length = 384

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 110/380 (28%), Positives = 168/380 (44%), Gaps = 78/380 (20%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGV 86
           K  RL SLD FRG  VA MILV++ G DW  I     HA WNGC   D + PFFLFIVGV
Sbjct: 9   KPVRLLSLDFFRGATVAAMILVNNPG-DWGHIYAPLEHAEWNGCTPTDLIFPFFLFIVGV 67

Query: 87  AIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDV-RMIRLCGVL 145
           +IA A+        +  K I + LK       L    S  P   T  ++  + +R+ GVL
Sbjct: 68  SIAYAMGGKKADPSSHGKTIVKALKRASILFGLGLFLSLFPKVFTAPLEAFQQVRIPGVL 127

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRIA+ +L+ ++  IF K+ +         +IF++          +L VY AL+    VP
Sbjct: 128 QRIAVVFLISAI--IFLKNTEK--------NIFKILL-------AILAVYWALMTFIPVP 170

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
              +  + K++                     N   ++DR +L   H++           
Sbjct: 171 GVGYANLEKET---------------------NLGAWLDRSILTEAHLW----------- 198

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
                      K A +W     +PEG+LS++ +I + + G+  G  +       A    W
Sbjct: 199 -----------KSAKTW-----DPEGILSTLPAIATGLFGILVGVYLKRKDVDAATKISW 242

Query: 326 V-TMGFALLIFGL--TLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKY 382
           +   G A +  GL   L F    P+NK L+T S+V  T G A ++ S  Y ++D+     
Sbjct: 243 LFCTGCAAVALGLLWDLQF----PINKSLWTSSFVLYTGGLATMILSLCYWIIDVQQYNR 298

Query: 383 PFLPLAWIGMNAMLVYVMAA 402
              P    G+NA+ V+ ++ 
Sbjct: 299 FTKPFVVYGVNAITVFFLSG 318


>gi|410664067|ref|YP_006916438.1| hypothetical protein M5M_07585 [Simiduia agarivorans SA1 = DSM
           21679]
 gi|409026424|gb|AFU98708.1| hypothetical protein M5M_07585 [Simiduia agarivorans SA1 = DSM
           21679]
          Length = 390

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 113/380 (29%), Positives = 182/380 (47%), Gaps = 77/380 (20%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAIA 89
           RL S+D+ RGLA+A M+LV++ G  W  +    +HA W+G    D + P FL++VG++I 
Sbjct: 15  RLMSVDVLRGLAIAAMVLVNNPG-SWSHVYAPLAHAEWHGWTPTDVIFPLFLYVVGLSIV 73

Query: 90  LALK----RIPDRAD---AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
           LA K     +P R+    A K         LF+    +  FS   D+L   +DVR++   
Sbjct: 74  LAQKGETFALPGRSTWLRAAKLFGLGLFLALFYFPFAKPEFSWWRDQL---LDVRIL--- 127

Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
           GVLQRIAL YL    +    +  Q                  WL+    LV ++ L Y  
Sbjct: 128 GVLQRIALVYLACCYLAWLCQKRQ------------------WLLWLATLV-FMWLAYAL 168

Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
                  +I   D  D G+++      R +L    +   ++D+ +LG  H+Y+  A    
Sbjct: 169 -----MLSIPYAD--DTGEIY------RGQLVFGNHFSAWLDQLLLGREHLYYQTA---- 211

Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
                  PF               F+PEGLL+++ +I S ++GV  G + +   GH +RL
Sbjct: 212 ------QPFA--------------FDPEGLLTTLPAISSGLLGVLAG-LQLKAAGHSSRL 250

Query: 323 KQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKY 382
           + W   G  +L+ G  LH     P+NK L+T S+V VT+G + L+ +++Y L D+   + 
Sbjct: 251 EIWFAGGVLMLVAGQLLH--PVCPINKALWTPSFVLVTAGVSQLLLASLYWLCDVRGHQR 308

Query: 383 PFLPLAWIGMNAMLVYVMAA 402
              PL   G+NA+ ++++A 
Sbjct: 309 WLSPLLVFGVNAIALFMLAG 328


>gi|332664355|ref|YP_004447143.1| hypothetical protein [Haliscomenobacter hydrossis DSM 1100]
 gi|332333169|gb|AEE50270.1| Protein of unknown function DUF2261, transmembrane
           [Haliscomenobacter hydrossis DSM 1100]
          Length = 438

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 112/416 (26%), Positives = 176/416 (42%), Gaps = 93/416 (22%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGVA 87
           + RL SLD+ RGL +A MILV++ G DW  +     HA W+GC   D+V PFFLF+VGVA
Sbjct: 4   SNRLLSLDVMRGLTIAGMILVNNPG-DWGNVYGPLLHADWHGCTPTDWVFPFFLFMVGVA 62

Query: 88  IALALKRIPDRADAV----KKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCG 143
           I LAL +  D  + +    +K+I R+L ++  G+ L    +H     T       +    
Sbjct: 63  IPLALGKRKDEGEDLRKIYRKIISRSLIIIGLGLFLT---AHPTFYFTDKTSPWYV---- 115

Query: 144 VLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWH----------WLMAACVLV 193
               + L  +  ++V +FT++V ++ Q       F+   W           W   AC++V
Sbjct: 116 ----VHLIIMATAMVAVFTREVLNQKQ-------FQTETWQQRRKWVSYLAWSAIACMVV 164

Query: 194 ----------------------VYLA--LLYGTYVPDWQFTIINKDSADYGKVFN---VT 226
                                 VYLA   L+    P  Q          Y  +     V 
Sbjct: 165 LGIFYYDFSHMRFPGVLQRIGLVYLACGFLFLKASPRMQLLTGVGLLLLYWGLMTLVPVP 224

Query: 227 CGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP 286
            G+   L    N   ++DR +   NH+     W   K                       
Sbjct: 225 GGIAPNLEAETNLGAWLDRAIFSTNHL-----WAAVKT---------------------- 257

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIP 346
           ++PEGLLS++ +I + I G+  G  +   K    ++   + +G   L+F L L +    P
Sbjct: 258 WDPEGLLSTIPAIGTGIAGMLAGEWVRSEKSDYEKVSGLLAVG--ALLFALGLIWNEFFP 315

Query: 347 LNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
           LNK+++T SYV   +G + L    IY LVDI   K    P    G+NA+  ++++ 
Sbjct: 316 LNKKIWTSSYVVYMAGISLLFLGTIYWLVDIKGWKGWIAPFQIYGVNALFAFLLSG 371


>gi|294777712|ref|ZP_06743163.1| putative membrane protein [Bacteroides vulgatus PC510]
 gi|319640295|ref|ZP_07995020.1| hypothetical protein HMPREF9011_00617 [Bacteroides sp. 3_1_40A]
 gi|294448780|gb|EFG17329.1| putative membrane protein [Bacteroides vulgatus PC510]
 gi|317388070|gb|EFV68924.1| hypothetical protein HMPREF9011_00617 [Bacteroides sp. 3_1_40A]
          Length = 372

 Score =  115 bits (289), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 104/393 (26%), Positives = 168/393 (42%), Gaps = 90/393 (22%)

Query: 17  SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLAD 74
            E D    Q+   +K +RL SLD  RG+ VA MILV++AGG   +  + H+ WNG    D
Sbjct: 4   KELDTETAQQALPIK-KRLLSLDALRGITVAGMILVNNAGGKVSYAPLQHSAWNGLTPCD 62

Query: 75  FVMPFFLFIVGVAIALALKRIPDRA--DAVKKVIFRTLKLLF--WGILLQGGFSHA--PD 128
            V PFFLFI+G++  ++L +         V K++ RT  +L   W I   G F H    D
Sbjct: 63  LVFPFFLFIMGISTYISLNKFNFNVSLQVVTKILKRTFLILCIGWAI---GWFDHVCEGD 119

Query: 129 ELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMA 188
            L +      +R+ GVLQRIAL Y ++S   +F                           
Sbjct: 120 FLPF----VHLRIPGVLQRIALCYCVISFTALFMNH------------------------ 151

Query: 189 ACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
                         ++P   F ++   +        + C          N +  IDR++ 
Sbjct: 152 -------------KFIPTLTFILLVSYTV-------ILCMGNGYTCDESNILSIIDRQLF 191

Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
           G  H+Y                              +P +PEG +S++S+I  T IG   
Sbjct: 192 GEAHLYQ----------------------------KSPIDPEGFVSTLSAIAHTCIGFSC 223

Query: 309 GHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVF 368
           G  II +     ++ +    GF L+  G  L   +A+PLNK++++ ++V VT GAA++  
Sbjct: 224 GKWIIQSHQTENKVLRLFLTGFILMSIGYLL--ADALPLNKRIWSPTFVLVTCGAASMSL 281

Query: 369 SAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA 401
           + +   +DI N +         G+N + +YV++
Sbjct: 282 ATLMYYIDIRNKQKWCRFFIIFGVNPLFLYVLS 314


>gi|365877201|ref|ZP_09416706.1| hypothetical protein EAAG1_13068 [Elizabethkingia anophelis Ag1]
 gi|442587874|ref|ZP_21006688.1| hypothetical protein D505_08600 [Elizabethkingia anophelis R26]
 gi|365755061|gb|EHM96995.1| hypothetical protein EAAG1_13068 [Elizabethkingia anophelis Ag1]
 gi|442562373|gb|ELR79594.1| hypothetical protein D505_08600 [Elizabethkingia anophelis R26]
          Length = 400

 Score =  115 bits (289), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 106/395 (26%), Positives = 170/395 (43%), Gaps = 90/395 (22%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGV 86
           +K+ R  SLD+FRG  VALMILV++ G     +P + HA W+GC   D V PFFLF VG 
Sbjct: 1   MKSARYYSLDVFRGATVALMILVNNPGTWSAIYPPLEHAKWHGCTPTDLVFPFFLFAVGN 60

Query: 87  AIALALKRIPDRADAV--KKVIFRTLKLLFWGILLQ--GGFSHAPDELTY----GVDVRM 138
           A+   + +      +V  KKVI RTL +   G+ L     F    D L++      D   
Sbjct: 61  AMTFVIPKFQQHNSSVFWKKVIKRTLLIFGIGLFLNWCPFFQWDHDSLSFISWESSDENG 120

Query: 139 IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL 198
           +R+ GVLQRIA++Y   S++  + K+              ++  W   ++  +LV+Y   
Sbjct: 121 VRIMGVLQRIAIAYFFASVIAYYFKE--------------KMVLW---ISGALLVIY--- 160

Query: 199 LYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPA 258
                   W  T+    +  Y         +   +  P      ID  +LGI H Y    
Sbjct: 161 --------WLLTLFLGGTDPY--------SLEGFIGVP------IDHSILGIAHEYKG-- 196

Query: 259 WRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI-----I 313
                        EG            PF+PEGL S++ +I   + G   G+ I     I
Sbjct: 197 -------------EG-----------VPFDPEGLFSTIPAISQVLFGYLIGNYIQKKGNI 232

Query: 314 HTKGHLARLKQWVTMGFALLIFGLTLHFTNAI-----PLNKQLYTLSYVCVTSGAAALVF 368
              G   +     +M   L I G+   F + +     P NK++++ SY  +T+G A +V 
Sbjct: 233 QWFGKSLKENSIYSMLSGLFILGIIALFISYVWQLDFPYNKKIWSSSYTLLTTGLAIMVL 292

Query: 369 SAIYALVDIWNLKYPFLPLAWI-GMNAMLVYVMAA 402
             +   ++I  ++   +    + G N + +YV++ 
Sbjct: 293 GVLIWFIEILEIRNGLMKFFDVFGKNPLFIYVISG 327


>gi|262406057|ref|ZP_06082607.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|294648122|ref|ZP_06725665.1| conserved domain protein [Bacteroides ovatus SD CC 2a]
 gi|294806856|ref|ZP_06765681.1| conserved domain protein [Bacteroides xylanisolvens SD CC 1b]
 gi|345510562|ref|ZP_08790129.1| hypothetical protein BSAG_00775 [Bacteroides sp. D1]
 gi|229443274|gb|EEO49065.1| hypothetical protein BSAG_00775 [Bacteroides sp. D1]
 gi|262356932|gb|EEZ06022.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|292636506|gb|EFF54981.1| conserved domain protein [Bacteroides ovatus SD CC 2a]
 gi|294445885|gb|EFG14527.1| conserved domain protein [Bacteroides xylanisolvens SD CC 1b]
          Length = 361

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 105/375 (28%), Positives = 174/375 (46%), Gaps = 80/375 (21%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
            +RL SLD+ RG+ VA MILV++ G    ++   +HA W+G + AD V P F+F++G++ 
Sbjct: 4   NKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMGIST 63

Query: 89  ALALKRIPDRAD-AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
            ++L +   +   A+ K+I R+L L+F G++++  F  A D   Y  D+  +RL GV+QR
Sbjct: 64  YISLCKYNFQCRPAIAKIIKRSLLLIFIGLVME-WFITAIDSGNY-FDLSQLRLMGVMQR 121

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL-LYGTYVPD 206
           + + Y + +L+ +    +  K                  +A  +LVVY    L+G     
Sbjct: 122 LGICYGITALLAV---TIPHKK--------------FMPLAIILLVVYFIFQLFGNGF-- 162

Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
                  + S D                   N VG +D  +LG NHMY            
Sbjct: 163 -------EKSVD-------------------NIVGIVDSAILGSNHMY-----------L 185

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
           Q   F                +PEG+LS++ ++   +IG   G +II  K +  R+    
Sbjct: 186 QGRQFV---------------DPEGILSTIPAVSQVMIGFVCGKIIIDIKDNDRRMLNLF 230

Query: 327 TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLP 386
            +G  LL  G  L +  A PLNK+L++ S+V +T G A L  + +  ++D+   K  F  
Sbjct: 231 LIGTTLLFAGYLLSY--ACPLNKRLWSPSFVLLTCGIATLSLALLLYIIDVKQNKKWFSF 288

Query: 387 LAWIGMNAMLVYVMA 401
               G N +++YV +
Sbjct: 289 FETFGANPLVIYVFS 303


>gi|345517324|ref|ZP_08796801.1| hypothetical protein BSFG_00542 [Bacteroides sp. 4_3_47FAA]
 gi|345457717|gb|EET14395.2| hypothetical protein BSFG_00542 [Bacteroides sp. 4_3_47FAA]
          Length = 365

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 104/392 (26%), Positives = 168/392 (42%), Gaps = 90/392 (22%)

Query: 18  EPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLADF 75
           E D    Q+   +K +RL SLD  RG+ VA MILV++AGG   +  + H+ WNG    D 
Sbjct: 5   ELDTETAQQALPIK-KRLLSLDALRGITVAGMILVNNAGGKVSYAPLQHSAWNGLTPCDL 63

Query: 76  VMPFFLFIVGVAIALALKRIPDRA--DAVKKVIFRTLKLLF--WGILLQGGFSHA--PDE 129
           V PFFLFI+G++  ++L +         V K++ RT  +L   W I   G F H    D 
Sbjct: 64  VFPFFLFIMGISTYISLNKFNFNVSLQVVTKILKRTFLILCIGWAI---GWFDHVCEGDF 120

Query: 130 LTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAA 189
           L +      +R+ GVLQRIAL Y ++S   +F                            
Sbjct: 121 LPF----VHLRIPGVLQRIALCYCVISFTALFMNH------------------------- 151

Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLG 249
                        ++P   F ++   +        + C          N +  IDR++ G
Sbjct: 152 ------------KFIPTLTFILLVSYTV-------ILCMGNGYTCDESNILSIIDRQLFG 192

Query: 250 INHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFG 309
             H+Y                              +P +PEG +S++S+I  T IG   G
Sbjct: 193 EAHLYQ----------------------------KSPIDPEGFVSTLSAIAHTCIGFSCG 224

Query: 310 HVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFS 369
             II +     ++ +    GF L+  G  L   +A+PLNK++++ ++V VT GAA++  +
Sbjct: 225 KWIIQSHQTENKVLRLFLTGFILMSIGYLL--ADALPLNKRIWSPTFVLVTCGAASMSLA 282

Query: 370 AIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA 401
            +   +DI N +         G+N + +YV++
Sbjct: 283 TLMYYIDIRNKQKWCRFFIIFGVNPLFLYVLS 314


>gi|146302702|ref|YP_001197293.1| hypothetical protein Fjoh_4975 [Flavobacterium johnsoniae UW101]
 gi|146157120|gb|ABQ07974.1| Uncharacterized protein [Flavobacterium johnsoniae UW101]
          Length = 423

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 116/418 (27%), Positives = 175/418 (41%), Gaps = 124/418 (29%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDW----PEISHAPWNGCNLADFVMPFFLFIVGVAI 88
           +RL SLD+FRGL + LM +V++ G DW    P + HA W+GC   D V PFF+FI+GVA+
Sbjct: 4   ERLISLDVFRGLTILLMTIVNNPG-DWGNVYPPLLHAEWHGCTPTDLVFPFFIFIMGVAV 62

Query: 89  ALALKRIPDR---ADAVKKVIFRTLKLLFWGIL--------------------------- 118
            LA+   PD+   +    K++ R+L++L  GI                            
Sbjct: 63  PLAM---PDKFYDSTTFNKILVRSLRMLCLGIFFNFFGKIQLFGLEGIPLLIGRLAITIA 119

Query: 119 ----LQGGFSHAPDE------------LTY-GVDV-RMIRLCGVLQRIALSYLLVSLVEI 160
               L G FS                 L Y G++    +RL GVLQRIA+ Y +VSL  +
Sbjct: 120 VGYALMGSFSSKVKNILAFSILFIYLFLAYSGIEAYHDVRLPGVLQRIAIVYFVVSL--L 177

Query: 161 FTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYG 220
           + K  Q      G                 +L+ Y A++    VP               
Sbjct: 178 YLKTSQRTQIITG---------------IVLLLGYWAIMTLIPVPGIG------------ 210

Query: 221 KVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAP 280
                     A L    N   ++D  +L   HMY                      +   
Sbjct: 211 ---------EANLEKGTNLASWLDSVLLK-GHMY----------------------RGTI 238

Query: 281 SWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLH 340
           +W     +PEG+LS++ SI++ IIG+  G V+      + + ++    G  L+ FGL   
Sbjct: 239 TW-----DPEGILSTLPSIVNGIIGLLIGQVLQRDTTKILKAQKMGIAGTILIFFGLMWD 293

Query: 341 FTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVY 398
                P+NK L+T SYV  T+G A +  + +Y  +DI + K  F P    G+N M+V+
Sbjct: 294 L--VFPINKSLWTSSYVLYTTGLATVFLTILYYTIDIADYKKGFKPFLIWGVNPMIVF 349


>gi|158337501|ref|YP_001518676.1| hypothetical protein AM1_4380 [Acaryochloris marina MBIC11017]
 gi|158307742|gb|ABW29359.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
          Length = 383

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 99/375 (26%), Positives = 180/375 (48%), Gaps = 86/375 (22%)

Query: 38  LDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKR 94
           LD+FRG+A+A M+LV+ +G     +P++ HA W+G  LAD V PFFLF++G ++A ++ R
Sbjct: 13  LDVFRGIAIAGMLLVNKSGLVKDAYPQLQHADWHGWTLADLVFPFFLFVLGASMAFSMAR 72

Query: 95  -----IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
                   +     K++ R++ L   G+ L G +S+         ++  +R+ G+LQRI+
Sbjct: 73  HTASLTQPKRRVYLKILRRSVVLFGLGLFLNGFWSY---------NLSTLRVMGILQRIS 123

Query: 150 LSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVY-LALLYGTYVPDWQ 208
           L+YL+ +LV +    +  K Q              W M   +LV Y LAL   +++P  +
Sbjct: 124 LTYLVSALVIL---KLPRKSQ--------------WGMTGLLLVGYWLAL---SFIPVPE 163

Query: 209 FTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
           F   N                   L    N   Y+DR ++G +H+Y    +         
Sbjct: 164 FGAGN-------------------LTRTGNFGAYVDRLIIGSSHLYVGDQF--------- 195

Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLK----Q 324
                          ++  +PEGL S++ +I + ++G +F    I  +G   ++K     
Sbjct: 196 ---------------NSMGDPEGLFSTLPAIATVLLG-YFAGDWIRKRGSGLKIKTSRQS 239

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPF 384
                + L+  GL L ++   P+NK+L+T SYV  T G A ++ +  Y L+++  ++   
Sbjct: 240 LALASYGLISTGLGLLWSIWFPINKKLWTSSYVLFTVGIALILLAVCYELIEVRRIRLWS 299

Query: 385 LPLAWIGMNAMLVYV 399
            P   +G+N+++V++
Sbjct: 300 KPFEVLGLNSIVVFM 314


>gi|372268395|ref|ZP_09504443.1| hypothetical protein AlS89_10850 [Alteromonas sp. S89]
          Length = 395

 Score =  115 bits (288), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 107/401 (26%), Positives = 184/401 (45%), Gaps = 77/401 (19%)

Query: 34  RLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           RL S+D+ RG+A+A M+LV++ G     +  ++HA W+G    D + P FLF+VGV++ L
Sbjct: 16  RLMSVDLLRGIAIAAMVLVNNPGSWSFVYAPMAHAQWHGWTPTDVIFPLFLFVVGVSMVL 75

Query: 91  ALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRM--IRLCGVLQRI 148
           +  +  D          R LKL   G+ L   F +  D     ++ R+  IR  GVLQRI
Sbjct: 76  STGKRGDFPPVGWAQWSRALKLFALGLFLAIFFYNFRDASYNWIEDRLEGIRWMGVLQRI 135

Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
           AL Y+L                          Y   WL A  +LV   A +  + VP W 
Sbjct: 136 ALVYILCC------------------------YLVRWLPAKGLLV---AAILCSVVP-WT 167

Query: 209 FTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
             ++    +  G+VF      + +L    +   ++D+ +LG  H+Y+  A          
Sbjct: 168 LMLVVPYQSASGEVF------QGQLAFGNHFAAWLDQWLLGSAHVYYRDA---------- 211

Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGV----HFGHVIIHTKGHLARLKQ 324
            PF               F+PEG+L++ S+  + ++GV     +     + +  L   + 
Sbjct: 212 QPFA--------------FDPEGVLTTFSAASTCLLGVLAALAWKSADSNGEAQLRLCRN 257

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPF 384
           W+  G  +++ G  +H  + +P+NK L++ S+V VT+G + L+ + +Y LVDI   +   
Sbjct: 258 WLVAGTLMVLVGQLMH--SIVPINKALWSPSFVLVTAGVSLLLMAGLYYLVDIRERRRAL 315

Query: 385 LPLAWIGMNAMLVYVMAAE--------GIFAGFINGWYYGD 417
            PL   G+NA+ ++++A           +  G + GW +G 
Sbjct: 316 APLLVFGVNAIALFMLAGVVGRILIMIPVGDGTLKGWLFGS 356


>gi|345513910|ref|ZP_08793425.1| hypothetical protein BSEG_01940 [Bacteroides dorei 5_1_36/D4]
 gi|229435722|gb|EEO45799.1| hypothetical protein BSEG_01940 [Bacteroides dorei 5_1_36/D4]
          Length = 372

 Score =  115 bits (288), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 103/393 (26%), Positives = 169/393 (43%), Gaps = 90/393 (22%)

Query: 17  SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLAD 74
            E +    Q+   +K +RL SLD  RG+ VA MILV++AGG   +  + H+ WNG    D
Sbjct: 4   EELNTETAQQAPPIK-KRLLSLDALRGITVAGMILVNNAGGKVSYAPLQHSVWNGLTPCD 62

Query: 75  FVMPFFLFIVGVAIALALKRIPDRA--DAVKKVIFRTLKLLF--WGILLQGGFSHA--PD 128
            V PFFLFI+G++  ++L +         V K++ RT  +L   W I   G F H    D
Sbjct: 63  LVFPFFLFIMGISTYISLNKFNFNVSLQVVTKILKRTFLILCIGWAI---GWFDHVCEGD 119

Query: 129 ELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMA 188
            L +      +R+ GVLQRIAL Y ++S   +F                           
Sbjct: 120 FLPF----VHLRIPGVLQRIALCYCVISFTALFMNH------------------------ 151

Query: 189 ACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
                         ++P   F ++   +        + C          N +  IDR++ 
Sbjct: 152 -------------KFIPALTFILLVSYTV-------ILCMGNGYACDESNILSIIDRQLF 191

Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
           G  H+Y                              +P +PEG +S++S+I  T IG  +
Sbjct: 192 GEAHLYQ----------------------------KSPIDPEGFVSTLSAIAHTCIGFSY 223

Query: 309 GHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVF 368
           G  II +     ++ +    GF L+  G  L   +A+PLNK++++ ++V VT GAA++  
Sbjct: 224 GKWIIQSHQTENKVLRLFLTGFILISIGYLL--ADALPLNKRIWSPTFVLVTCGAASMSL 281

Query: 369 SAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA 401
           + +   +DI N +         G+N + +YV++
Sbjct: 282 ATLMYYIDIRNKQKWCRFFIIFGVNPLFLYVLS 314


>gi|441501363|ref|ZP_20983482.1| N-acetylglucosamine related transporter, NagX [Fulvivirga
           imtechensis AK7]
 gi|441434899|gb|ELR68324.1| N-acetylglucosamine related transporter, NagX [Fulvivirga
           imtechensis AK7]
          Length = 368

 Score =  115 bits (288), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 112/380 (29%), Positives = 174/380 (45%), Gaps = 87/380 (22%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           K +RL SLD+FRG+ +A MI+V++ G     +P + HA W+GC L D V PFFLFIVGVA
Sbjct: 3   KNKRLLSLDVFRGITIAAMIVVNNPGSWAAVYPPLLHAGWHGCTLTDLVFPFFLFIVGVA 62

Query: 88  IALALKRIPDRADAVKKVIFRTLK---LLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
           + L+L R  +     K++IF  LK   +LF   L    F +         D+  +R+ GV
Sbjct: 63  VCLSLSRAVEDKGRHKQIIFTVLKRSVILFLIGLFLNAFPY--------FDLYHLRIPGV 114

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           LQRIA+ + + +   ++ K        +G               + +L+VY  L      
Sbjct: 115 LQRIAVVFFICAF--LYLKTGWKVQVYIG---------------SAILMVYWLL------ 151

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
               F II    A  G            L    N   ++D ++L   HM     W  +K 
Sbjct: 152 ----FLIIPIPGAATG-----------SLESGANLAAWVDSQLL-TGHM-----WEVTK- 189

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
                           +W     +PEG+LS++ +I++ IIGV  G  ++  +    ++  
Sbjct: 190 ----------------TW-----DPEGVLSTLPAIVTGIIGVLVGQWLMADRTEKEKVIY 228

Query: 325 WVTMGFALLIFGL--TLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKY 382
                  L++ GL   L F    P+NK L+T SYV  T+G A    + +Y L+DI   + 
Sbjct: 229 LFVAANLLIVAGLFWDLFF----PINKSLWTSSYVLYTAGIAIHFLAFLYWLLDIKMQRS 284

Query: 383 PF-LPLAWIGMNAMLVYVMA 401
            F  P    G+NA+ VY+++
Sbjct: 285 KFWTPFKAFGINAIFVYMLS 304


>gi|423215264|ref|ZP_17201791.1| hypothetical protein HMPREF1074_03323 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392691832|gb|EIY85072.1| hypothetical protein HMPREF1074_03323 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 371

 Score =  115 bits (288), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 110/380 (28%), Positives = 170/380 (44%), Gaps = 78/380 (20%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGV 86
           +K++RL SLD+ RG+ +  MILV++ G     +  + HA WNG    D V PFF+FI+GV
Sbjct: 1   MKSERLLSLDVLRGITIVGMILVNNPGTWESVYAPLRHAEWNGLTPTDLVFPFFMFIMGV 60

Query: 87  AIALALKRIPD--RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD--VRMIRLC 142
           +++ AL R         + K++ RT+ L   G+ L   FS     +  GV+     IR+ 
Sbjct: 61  SMSFALSRFDHHFSRSFITKLVRRTVILFLLGLFLS-WFSL----VCAGVEQPFSQIRIL 115

Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
           GVLQR+AL+Y   SL+ +               S+ R     W ++A +L+ Y+ LL   
Sbjct: 116 GVLQRLALAYFFGSLLIM---------------SVRRPANLAW-ISAIILIGYIVLL--- 156

Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
                            G  F ++           N +   DR + G  H+Y        
Sbjct: 157 ---------------ALGNGFELS---------EQNIIAVTDRTLFGETHLY-------- 184

Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
                        R+  P      F+PEGLLS++  I   IIG   G+++        RL
Sbjct: 185 -------------REWLPDGGRIFFDPEGLLSTLPCIAQVIIGYFCGNILREKTEIHHRL 231

Query: 323 KQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKY 382
            Q   +G ALL  G  L +    PLNK++++ ++V VT G A+L    +  L+DI   + 
Sbjct: 232 LQISILGIALLFAGWLLSY--GCPLNKKVWSPTFVLVTCGFASLFLVFLTWLIDIRKKQK 289

Query: 383 PFLPLAWIGMNAMLVYVMAA 402
              P    G N + +YV+A 
Sbjct: 290 WAYPFHVFGTNPLFIYVVAG 309


>gi|399028182|ref|ZP_10729485.1| hypothetical protein PMI10_01306 [Flavobacterium sp. CF136]
 gi|398074259|gb|EJL65410.1| hypothetical protein PMI10_01306 [Flavobacterium sp. CF136]
          Length = 423

 Score =  115 bits (288), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 120/415 (28%), Positives = 191/415 (46%), Gaps = 74/415 (17%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDW----PEISHAPWNGCNLADFVMPFFLFIVGVAI 88
           +RL SLD+FRGL + LM +V++ G DW    P + HA WNGC   D V PFF+FI+GVA+
Sbjct: 4   ERLISLDVFRGLTILLMTIVNNPG-DWGHVFPPLLHAKWNGCTPTDLVFPFFIFIMGVAV 62

Query: 89  ALALKRIPDRA---DAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
            LA+   PD+        K++ R+L++L  GI     F+       +G++   + +  ++
Sbjct: 63  PLAM---PDKIYDDTTFNKILVRSLRMLCLGIF----FNFFEKIQLFGLEGIPLLIGRLI 115

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCW---------------HWLMAAC 190
             IA+ Y+L+        +   K +++  FSI  +Y +                 L    
Sbjct: 116 ITIAVGYVLMG-------NFSSKLKNIFAFSILIIYLFLAYSEIEAYQDVRLPGVLQRIA 168

Query: 191 VLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNV--TCGV-RAKLNPPCNAVGYIDRKV 247
           V+   ++LLY       Q          Y  V N+    G+  A L    N   ++D  +
Sbjct: 169 VVYFVVSLLYLKTSQKTQIITGVFLLLGYWAVMNLIPVPGIGEANLEKGTNLASWLDSIL 228

Query: 248 LGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVH 307
           L   HMYH                      +  +W     +PEG+LS++ SI++ IIG+ 
Sbjct: 229 LK-GHMYH----------------------ETKTW-----DPEGILSTIPSIVNGIIGLL 260

Query: 308 FGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALV 367
            G ++      +   K+   +G AL+I GL  +     P+NK ++T SYV  T+G AA  
Sbjct: 261 IGQLLFLKIAKIEIAKKIALIGIALIITGLLWNI--VFPINKSIWTSSYVLYTTGLAATT 318

Query: 368 FSAIYALVDIWNLKYPF-LPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNT 421
            S +Y ++DI   K  F L L W G+N M+V+   A  I    +    + +PH T
Sbjct: 319 LSVLYFIIDIAEYKKGFKLFLIW-GVNPMIVFF--ASQIIPQALVMIQFENPHKT 370


>gi|163786877|ref|ZP_02181325.1| hypothetical protein FBALC1_16867 [Flavobacteriales bacterium
           ALC-1]
 gi|159878737|gb|EDP72793.1| hypothetical protein FBALC1_16867 [Flavobacteriales bacterium
           ALC-1]
          Length = 361

 Score =  115 bits (288), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 107/375 (28%), Positives = 164/375 (43%), Gaps = 83/375 (22%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
           + R+ S+DI RGL +  MILV+  G     +  + HA W+G    D + PFFLFIVG++I
Sbjct: 2   SARIESVDILRGLTILAMILVNTPGTWGHVYTPLRHAEWHGLTPTDLIFPFFLFIVGISI 61

Query: 89  ALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
             A K  P+     KK+I R+LKL+  G+ L     H P    +  D    R+ GVLQRI
Sbjct: 62  YFAYKNKPNTKLTYKKIIIRSLKLIGLGLFLNLFLPHFP----FFNDFETHRIPGVLQRI 117

Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYC-WHWLMAACVLVVYLALLYGTYVPDW 207
            L +L                      SI  L C W  L A  + ++    L   ++P  
Sbjct: 118 GLVFLFS--------------------SILYLNCSWKSLTAIGITIILGYWLCLGFIPFQ 157

Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
             ++   D A                  P N   YID  +LG  HM     W+       
Sbjct: 158 DGSLPTFDRA------------------PNNWANYIDLNILG-EHM-----WKTD----- 188

Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVT 327
                              ++PEGL+S++ +I + I G+  G ++      L  +K    
Sbjct: 189 -------------------YDPEGLISTIPAIATCISGILIGKLL----DGLTHIKYLFI 225

Query: 328 MGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPL 387
                LI G  L+     P NK +++ S+V VTSG A L+ + IY L DI  +++  +  
Sbjct: 226 TALGFLISGYILNI--WFPTNKAIWSSSFVLVTSGWATLILAVIYYLKDIRKIEFGTV-F 282

Query: 388 AWIGMNAMLVYVMAA 402
            ++GMNA+ +Y +++
Sbjct: 283 KYVGMNAITIYFLSS 297


>gi|223935576|ref|ZP_03627492.1| conserved hypothetical protein [bacterium Ellin514]
 gi|223895584|gb|EEF62029.1| conserved hypothetical protein [bacterium Ellin514]
          Length = 410

 Score =  115 bits (288), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 112/388 (28%), Positives = 173/388 (44%), Gaps = 88/388 (22%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVA 87
           + RL SLD+FRG  +A M+LV++ G  W  I     HA WNG    D + PFFL+IVGVA
Sbjct: 22  SARLMSLDVFRGATIASMMLVNNPG-SWDSIYRQLDHAEWNGWTFTDLIFPFFLWIVGVA 80

Query: 88  IALALKRIPD----RADAVKKVIFRT-------LKLLFWGILLQGGFSHAPDELTYGVDV 136
           I L+ ++  D    R +    V+ R        L L F+  L+ G +        +  ++
Sbjct: 81  IPLSTQKRLDGGASRTNLWLHVVRRAAIIFGLGLFLAFFSFLINGSYGRLGGFGPWFNEI 140

Query: 137 -RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVY 195
              IR+ GVLQRIA+ YL+ S + + TK        +G      L   +W++  CV V  
Sbjct: 141 CGTIRIPGVLQRIAVCYLIASTIYLTTKLRGQIAWLIG------LLAAYWVLMKCVPV-- 192

Query: 196 LALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYH 255
                                  +G       GV   L P  N   Y+D  VLG  H +H
Sbjct: 193 ---------------------PGHG------AGV---LTPEGNFSAYVDGNVLG-RHTWH 221

Query: 256 HPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHT 315
                                        AP++PEG++S++ +I + + G+  G +++  
Sbjct: 222 ----------------------------GAPWDPEGVISTIPAIATCLFGILTGQLLL-I 252

Query: 316 KGHLARLKQWV-TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYAL 374
           K  + +   WV   G  L++ G  ++    +P+NK L+T SY    +G A  VF+  Y L
Sbjct: 253 KRSVEQKTTWVFVSGILLILAGAVMNIW--LPINKNLWTSSYSVFMAGMAMNVFAVFYWL 310

Query: 375 VDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
           VD+   +    P A  GMNA+ V+++A 
Sbjct: 311 VDVKGCQKWAKPFAIYGMNAITVFMLAG 338


>gi|237711644|ref|ZP_04542125.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA]
 gi|423230937|ref|ZP_17217341.1| hypothetical protein HMPREF1063_03161 [Bacteroides dorei
           CL02T00C15]
 gi|423244648|ref|ZP_17225723.1| hypothetical protein HMPREF1064_01929 [Bacteroides dorei
           CL02T12C06]
 gi|229454339|gb|EEO60060.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA]
 gi|392630057|gb|EIY24059.1| hypothetical protein HMPREF1063_03161 [Bacteroides dorei
           CL02T00C15]
 gi|392641497|gb|EIY35273.1| hypothetical protein HMPREF1064_01929 [Bacteroides dorei
           CL02T12C06]
          Length = 372

 Score =  115 bits (287), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 103/393 (26%), Positives = 169/393 (43%), Gaps = 90/393 (22%)

Query: 17  SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLAD 74
            E +    Q+   +K +RL SLD  RG+ VA MILV++AGG   +  + H+ WNG    D
Sbjct: 4   EELNTETAQQAPPIK-KRLLSLDALRGITVAGMILVNNAGGKVSYAPLQHSVWNGLTPCD 62

Query: 75  FVMPFFLFIVGVAIALALKRIPDRA--DAVKKVIFRTLKLLF--WGILLQGGFSHA--PD 128
            V PFFLFI+G++  ++L +         V K++ RT  +L   W I   G F H    D
Sbjct: 63  LVFPFFLFIMGISTYISLNKFNFNVSLQVVTKILKRTFLILCIGWAI---GWFDHVCEGD 119

Query: 129 ELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMA 188
            L +      +R+ GVLQRIAL Y ++S   +F                           
Sbjct: 120 FLPF----VHLRIPGVLQRIALCYCVISFTALFMNH------------------------ 151

Query: 189 ACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
                         ++P   F ++   +        + C          N +  IDR++ 
Sbjct: 152 -------------KFIPALTFILLVSYTV-------ILCMGNGYACDESNILSIIDRQLF 191

Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
           G  H+Y                              +P +PEG +S++S+I  T IG  +
Sbjct: 192 GEAHLYQ----------------------------KSPIDPEGFVSTLSAIAHTCIGFSY 223

Query: 309 GHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVF 368
           G  II +     ++ +    GF L+  G  L   +A+PLNK++++ ++V VT GAA++  
Sbjct: 224 GKWIIQSHQTENKVLRLFLTGFILISIGYLL--ADALPLNKRIWSPTFVLVTCGAASMSL 281

Query: 369 SAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA 401
           + +   +DI N +         G+N + +YV++
Sbjct: 282 ATLMYYIDIRNKQKWCRFFIIFGVNPLFLYVLS 314


>gi|299140549|ref|ZP_07033687.1| membrane protein [Prevotella oris C735]
 gi|298577515|gb|EFI49383.1| membrane protein [Prevotella oris C735]
          Length = 359

 Score =  115 bits (287), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 103/376 (27%), Positives = 169/376 (44%), Gaps = 83/376 (22%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGD--WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           ++ +RL SLD+ RG  V LMILV++  G   +  + H+ WNG    D V PFFLFI+G++
Sbjct: 1   MEKKRLLSLDVLRGATVCLMILVNNGAGKHIYATLQHSKWNGMTPCDLVFPFFLFIMGIS 60

Query: 88  IALALKRIP---DRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
             L+L++      R  A  K++ RT+ L F G+ +   F  A       +D+  +R+  V
Sbjct: 61  TYLSLEKTNFTWSRQVAF-KIVKRTVLLFFIGLFIN-WFDMAISG--NALDLSHLRIWAV 116

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           +QRIA+ Y  V                    SIF L C H      ++++  A       
Sbjct: 117 MQRIAICYFAV--------------------SIFALCCNHRHTIPAIVILLAA------- 149

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
             +   +I  +   Y              +   N +  ID ++ GI H+YH         
Sbjct: 150 --YSLLLIWGNGYAY--------------DSQQNILAQIDIRLFGIEHLYH--------- 184

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
                              ++P +PEG  SS+S+I  T+IG + G  +   K    ++ +
Sbjct: 185 -------------------NSPVDPEGTGSSLSAIAHTLIGFYCGKRMSDAKSTEEKVLR 225

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPF 384
           ++  G  L+I G  + F   +PLNK++++ SYVC+T G AA+    +   +DI  +K   
Sbjct: 226 FLITGGFLVIIGYIVSF--GLPLNKRIWSPSYVCMTCGLAAVTQGLLMYCIDIKGIKTTR 283

Query: 385 LPLAWI-GMNAMLVYV 399
           L    + G N + +YV
Sbjct: 284 LTFFLVFGTNPLFLYV 299


>gi|54297581|ref|YP_123950.1| hypothetical protein lpp1632 [Legionella pneumophila str. Paris]
 gi|53751366|emb|CAH12784.1| hypothetical protein lpp1632 [Legionella pneumophila str. Paris]
          Length = 372

 Score =  114 bits (286), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 103/375 (27%), Positives = 161/375 (42%), Gaps = 82/375 (21%)

Query: 30  LKTQRLASLDIFRGLAVALMILVD-HAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           LK QRL SLD+FRG+ + LMI+V+  A  D +P   H  WNGC LAD V PFFLFIVG+ 
Sbjct: 6   LKPQRLLSLDVFRGMTIVLMIIVNGQAAIDPYPIFEHVDWNGCTLADLVFPFFLFIVGLT 65

Query: 88  IALALKRIPDR---ADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
             ++LK   +R         +I R++                 +   + ++   IR+ G+
Sbjct: 66  SVISLKNQMERKAKTSLYSAIIERSVV--------LFLLGLFLNVFPHPIEFDSIRIYGI 117

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           LQRIA+ YL+ + + +   +   K Q                       ++L LL G ++
Sbjct: 118 LQRIAVCYLISAFIYL---NTSIKTQ---------------------FFIFLVLLLGYWI 153

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
              Q  +       YG           +L    + V Y D+     +H+Y          
Sbjct: 154 IMTQVPV-----PGYGA---------NQLTKDGSWVSYFDQLFFSASHLYEK-------- 191

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
                                 ++PEG +S+ +SI +T+ GV  G ++I+      +   
Sbjct: 192 ---------------------TYDPEGFVSTFTSIATTLSGVLAGSLLINPCNQFKKFYL 230

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPF 384
              +G   L+ G   + +   P+NK L+T SYV  TSG A LVF+  Y L+D   +K   
Sbjct: 231 LAGVGMLFLLLGWLWNMS--FPINKNLWTSSYVLWTSGLALLVFAFCYLLIDRLGVKKWS 288

Query: 385 LPLAWIGMNAMLVYV 399
           +     GMNA+  +V
Sbjct: 289 VFFKIFGMNALFAFV 303


>gi|338210835|ref|YP_004654884.1| hypothetical protein [Runella slithyformis DSM 19594]
 gi|336304650|gb|AEI47752.1| Protein of unknown function DUF2261, transmembrane [Runella
           slithyformis DSM 19594]
          Length = 363

 Score =  114 bits (286), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 109/374 (29%), Positives = 168/374 (44%), Gaps = 84/374 (22%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGVAI 88
            RL SLD+FRG+ VA MILV++ G DW  +     HA WNGC   D + PFFLFIVGV++
Sbjct: 3   NRLLSLDVFRGMTVAAMILVNNPG-DWDHVYAPLLHAHWNGCTPTDLIFPFFLFIVGVSV 61

Query: 89  ALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
           A A+ + P    ++ K+I R+  L      L    +  P       D   +R+ GVLQRI
Sbjct: 62  AFAMGKNP---PSLLKIIKRSAIL----FGLGLFLNLYPK-----FDFANVRIPGVLQRI 109

Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
           AL YL+ SL  IF K  + K Q +                  +L+ Y  L+         
Sbjct: 110 ALVYLVCSL--IFIKTTR-KTQVI--------------TTVLLLIAYWLLM--------- 143

Query: 209 FTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
            T++      Y           A L P  N   ++DR +L   H+     W+ +K     
Sbjct: 144 -TLVPVPGVGY-----------ANLEPTTNLGAWVDRGLLTTAHL-----WKSAKV---- 182

Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTM 328
                             ++PEG+ S++ +I + ++GV  G  +   K    ++      
Sbjct: 183 ------------------WDPEGMFSTIPAIGTGLLGVLTGQWLRSEKPVAEKMAWLFLS 224

Query: 329 GFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLA 388
           G AL++ GL   +    P+NK L+T S+V    G A +  +A Y L+D+   +    PL 
Sbjct: 225 GNALILGGLI--WNEFFPINKSLWTSSFVLYAGGWAMVGLAACYWLIDVQGYRRYTAPLV 282

Query: 389 WIGMNAMLVYVMAA 402
             G+NA+ V+ ++ 
Sbjct: 283 AFGVNAITVFFLSG 296


>gi|383115204|ref|ZP_09935962.1| hypothetical protein BSGG_2914 [Bacteroides sp. D2]
 gi|313695379|gb|EFS32214.1| hypothetical protein BSGG_2914 [Bacteroides sp. D2]
          Length = 361

 Score =  114 bits (286), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 110/375 (29%), Positives = 178/375 (47%), Gaps = 80/375 (21%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
            +RL SLD+ RG+ VA MILV++ G    ++   +HA W+G + AD V P F+F++G++ 
Sbjct: 4   NKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMGIST 63

Query: 89  ALALKRIPDRAD-AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
            ++L +   +   A+ K+I R+L L+F G++++  F  A D   Y  D+  +RL GV+QR
Sbjct: 64  YISLCKYNFQCRPAIAKIIKRSLLLIFIGLVME-WFITAIDSGNY-FDLSQLRLMGVMQR 121

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL-LYGTYVPD 206
           + + Y + +L+ +    +  K     RF           +A  +LVVY    L+G     
Sbjct: 122 LGICYGITALLAV---TIPHK-----RF---------MPLAIILLVVYFIFQLFGNGF-- 162

Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
                  + SAD                   N VG ID  +LG NHMY            
Sbjct: 163 -------EKSAD-------------------NIVGMIDSAILGSNHMY-----------L 185

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
           Q   F                +PEG+LS++ ++   +IG   G +II  K +  R+    
Sbjct: 186 QGRQFV---------------DPEGILSTIPAVSQVMIGFVCGKIIIDIKDNERRMLNLF 230

Query: 327 TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLP 386
            +G  LL  G  L +  A PLNK+L++ S+V +T G AAL  + +  ++D+   K  F  
Sbjct: 231 LIGTTLLFVGYLLSY--ACPLNKRLWSPSFVLLTCGIAALSLALLLYIIDVKQNKKWFSF 288

Query: 387 LAWIGMNAMLVYVMA 401
               G N +++YV +
Sbjct: 289 FEAFGANPLVIYVFS 303


>gi|281423205|ref|ZP_06254118.1| putative membrane protein [Prevotella oris F0302]
 gi|281402541|gb|EFB33372.1| putative membrane protein [Prevotella oris F0302]
          Length = 359

 Score =  114 bits (284), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 103/376 (27%), Positives = 167/376 (44%), Gaps = 83/376 (22%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGD--WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           ++ +RL SLD+ RG  V LMILV++  G   +  + H+ WNG    D V PFFLFI+G++
Sbjct: 1   MEKKRLLSLDVLRGATVCLMILVNNGAGKHIYATLQHSKWNGMTPCDLVFPFFLFIMGIS 60

Query: 88  IALALKRIP---DRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
             L+LK+      R  A  K++ RT+ L   G+ +   F  A       +D   +R+  V
Sbjct: 61  TYLSLKKTNFTWSRQVAF-KIVKRTVLLFLIGLFIN-WFDMAISG--NALDFSHLRIWAV 116

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           +QRIA+ Y  V                    SIF L C H      ++++  A       
Sbjct: 117 MQRIAICYFAV--------------------SIFALCCNHRHTIPAIVILLAA------- 149

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
             +   +I  +   Y              +   N +  ID ++ GI H+YH         
Sbjct: 150 --YNLLLIWGNGYAY--------------DSQQNILAQIDIRLFGIEHLYH--------- 184

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
                              ++P +PEG  SS+S+I  T+IG + G  +   K    ++ +
Sbjct: 185 -------------------NSPVDPEGTGSSLSAIAHTLIGFYCGKRMSDAKSTEEKVLR 225

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPF 384
           ++  G  L+I G  + F   +PLNK++++ SYVC+T G AA+    +   +DI  +K   
Sbjct: 226 FLITGGFLVIIGYIVSF--GLPLNKRIWSPSYVCMTCGLAAVTQGLLMYCIDIKGIKTTR 283

Query: 385 LPLAWI-GMNAMLVYV 399
           L    + G N + +YV
Sbjct: 284 LTFFLVFGTNPLFLYV 299


>gi|423241434|ref|ZP_17222547.1| hypothetical protein HMPREF1065_03170 [Bacteroides dorei
           CL03T12C01]
 gi|392641327|gb|EIY35104.1| hypothetical protein HMPREF1065_03170 [Bacteroides dorei
           CL03T12C01]
          Length = 372

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 103/393 (26%), Positives = 168/393 (42%), Gaps = 90/393 (22%)

Query: 17  SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLAD 74
            E +    Q+   +K +RL SLD  RG+ VA MILV++AGG   +  + H+ WNG    D
Sbjct: 4   EELNTETAQQAPPIK-KRLLSLDALRGITVAGMILVNNAGGKVSYAPLQHSVWNGLTPCD 62

Query: 75  FVMPFFLFIVGVAIALALKRIPDRA--DAVKKVIFRTLKLLF--WGILLQGGFSHA--PD 128
            V PFFLFI+G++  ++L +         V K++ RT  +L   W I   G F H    D
Sbjct: 63  LVFPFFLFIMGISTYISLNKFNFNVSLQVVTKILKRTFLILCIGWAI---GWFDHVCEGD 119

Query: 129 ELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMA 188
            L +      +R+ GVLQRIAL Y ++S   +F                           
Sbjct: 120 FLPF----VHLRIPGVLQRIALCYCVISFTALFMNH------------------------ 151

Query: 189 ACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
                         ++P   F ++   +        + C          N +  IDR++ 
Sbjct: 152 -------------KFIPALTFILLVSYTV-------ILCMGNGYACDESNILSIIDRQLF 191

Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
           G  H+Y                              +P +PEG +S++S+I  T IG   
Sbjct: 192 GEAHLYQ----------------------------KSPIDPEGFVSTLSAIAHTCIGFSC 223

Query: 309 GHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVF 368
           G  II +     ++ +    GF L+  G  L   +A+PLNK++++ ++V VT GAA++  
Sbjct: 224 GKWIIQSHQTENKVLRLFLTGFILISIGYLL--ADALPLNKRIWSPTFVLVTCGAASMSL 281

Query: 369 SAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA 401
           + +   +DI N +         G+N + +YV++
Sbjct: 282 ATLMYYIDIRNKQKWCRFFIIFGVNPLFLYVLS 314


>gi|225012704|ref|ZP_03703139.1| conserved hypothetical protein [Flavobacteria bacterium MS024-2A]
 gi|225003237|gb|EEG41212.1| conserved hypothetical protein [Flavobacteria bacterium MS024-2A]
          Length = 366

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 108/380 (28%), Positives = 162/380 (42%), Gaps = 87/380 (22%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGV 86
           K QRL +LDI RGL + LMI+V+  G  W E+     HA WNG    D+V P FLFIVGV
Sbjct: 4   KNQRLLALDILRGLTIILMIVVNDPG-SWSEVYAPFLHAEWNGLTPTDYVFPTFLFIVGV 62

Query: 87  AIALALKRIPD----RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
           +I L+L +  +    R+   KKV++R LK+   GI L          L    +   IR  
Sbjct: 63  SIVLSLSKQLEAGKTRSQIAKKVLWRALKIYLVGIFLW---------LWPSFNFEGIRWV 113

Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
           GVL RIAL +L   L+ ++T         +G      +   +W+M A V           
Sbjct: 114 GVLPRIALVFLACGLIFLYTTKKTQWYLGIG------ILLGYWIMMAYV----------- 156

Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
                                            P   +G+ D  V         P   ++
Sbjct: 157 ---------------------------------PVPGIGFPDLSV---------P--EKN 172

Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
            A   DS F  P R    +W     +PEG LS++ +I+S +IG+  G+V++  +    RL
Sbjct: 173 WAHYLDS-FLIPGRLWKYTW-----DPEGFLSTLPAIVSGLIGMWAGYVLMKKEELKTRL 226

Query: 323 KQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKY 382
            Q   +GF LL  G  + +    P NK L++ S+     G   L  +A     D+   ++
Sbjct: 227 NQLFFIGFILLFLGDAMQWV--FPFNKNLWSSSFTLFMGGIGMLSLAAFSYYFDVRQSRF 284

Query: 383 PFLPLAWIGMNAMLVYVMAA 402
            F      G+N++  Y M++
Sbjct: 285 KFEFAHVFGVNSIFAYSMSS 304


>gi|298479647|ref|ZP_06997847.1| membrane protein [Bacteroides sp. D22]
 gi|336403243|ref|ZP_08583960.1| hypothetical protein HMPREF0127_01273 [Bacteroides sp. 1_1_30]
 gi|295084924|emb|CBK66447.1| Uncharacterized conserved protein [Bacteroides xylanisolvens XB1A]
 gi|298274037|gb|EFI15598.1| membrane protein [Bacteroides sp. D22]
 gi|335946636|gb|EGN08437.1| hypothetical protein HMPREF0127_01273 [Bacteroides sp. 1_1_30]
          Length = 371

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 109/380 (28%), Positives = 169/380 (44%), Gaps = 78/380 (20%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGV 86
           +K++RL SLD+ RG+ +  MILV++ G     +  + HA WNG    D V PFF+FI+GV
Sbjct: 1   MKSERLLSLDVLRGITIVGMILVNNPGTWESVYAPLRHAEWNGLTPTDLVFPFFMFIMGV 60

Query: 87  AIALALKRIPD--RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD--VRMIRLC 142
           +++ AL R         + K++ RT+ L   G+ L   FS     +  GV+     IR+ 
Sbjct: 61  SMSFALSRFDHHFSRSFITKLVRRTVILFLLGLFLS-WFSL----VCAGVEQPFSQIRIL 115

Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
           GVLQR+AL+Y   SL+ +               S+ R     W ++A +L+ Y+ LL   
Sbjct: 116 GVLQRLALAYFFGSLLIM---------------SVRRPANLAW-ISAIILIGYIVLL--- 156

Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
                            G  F ++           N +   DR + G  H+Y        
Sbjct: 157 ---------------ALGNGFELS---------EQNIIAVTDRTLFGETHLY-------- 184

Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
                        R+  P      F+PEGLLS++  I   IIG   G+++        RL
Sbjct: 185 -------------REWLPDGGRIFFDPEGLLSTLPCIAQVIIGYFCGNILREKTEIHHRL 231

Query: 323 KQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKY 382
            Q   +G  LL  G  L +    PLNK++++ ++V VT G A+L    +  L+DI   + 
Sbjct: 232 LQISILGIVLLFAGWLLSY--GCPLNKKVWSPTFVLVTCGFASLFLVLLTWLIDIRKKQK 289

Query: 383 PFLPLAWIGMNAMLVYVMAA 402
              P    G N + +YV+A 
Sbjct: 290 WAYPFHVFGTNPLFIYVVAG 309


>gi|262407085|ref|ZP_06083634.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|294648023|ref|ZP_06725570.1| putative membrane protein [Bacteroides ovatus SD CC 2a]
 gi|294809833|ref|ZP_06768513.1| putative membrane protein [Bacteroides xylanisolvens SD CC 1b]
 gi|345512215|ref|ZP_08791750.1| hypothetical protein BSAG_03359 [Bacteroides sp. D1]
 gi|229445856|gb|EEO51647.1| hypothetical protein BSAG_03359 [Bacteroides sp. D1]
 gi|262355788|gb|EEZ04879.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|292636642|gb|EFF55113.1| putative membrane protein [Bacteroides ovatus SD CC 2a]
 gi|294442971|gb|EFG11758.1| putative membrane protein [Bacteroides xylanisolvens SD CC 1b]
          Length = 371

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 109/380 (28%), Positives = 169/380 (44%), Gaps = 78/380 (20%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGV 86
           +K++RL SLD+ RG+ +  MILV++ G     +  + HA WNG    D V PFF+FI+GV
Sbjct: 1   MKSERLLSLDVLRGITIVGMILVNNPGTWESVYAPLRHAEWNGLTPTDLVFPFFMFIMGV 60

Query: 87  AIALALKRIPD--RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD--VRMIRLC 142
           +++ AL R         + K++ RT+ L   G+ L   FS     +  GV+     IR+ 
Sbjct: 61  SMSFALSRFDHHFSRSFITKLVRRTVILFLLGLFLS-WFSL----VCAGVEQPFSQIRIL 115

Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
           GVLQR+AL+Y   SL+ +               S+ R     W ++A +L+ Y+ LL   
Sbjct: 116 GVLQRLALAYFFGSLLIM---------------SVRRPANLAW-ISAIILIGYIVLL--- 156

Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
                            G  F ++           N +   DR + G  H+Y        
Sbjct: 157 ---------------ALGNGFELS---------EQNIIAVTDRTLFGETHLY-------- 184

Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
                        R+  P      F+PEGLLS++  I   IIG   G+++        RL
Sbjct: 185 -------------REWLPDGGRIFFDPEGLLSTLPCIAQVIIGYFCGNILREKTEIHHRL 231

Query: 323 KQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKY 382
            Q   +G  LL  G  L +    PLNK++++ ++V VT G A+L    +  L+DI   + 
Sbjct: 232 LQISILGIVLLFAGWLLSY--GCPLNKKVWSPTFVLVTCGFASLFLVFLTWLIDIRKKQK 289

Query: 383 PFLPLAWIGMNAMLVYVMAA 402
              P    G N + +YV+A 
Sbjct: 290 WAYPFHVFGTNPLFIYVVAG 309


>gi|113475212|ref|YP_721273.1| hypothetical protein Tery_1515 [Trichodesmium erythraeum IMS101]
 gi|110166260|gb|ABG50800.1| conserved hypothetical protein [Trichodesmium erythraeum IMS101]
          Length = 366

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 102/372 (27%), Positives = 163/372 (43%), Gaps = 82/372 (22%)

Query: 34  RLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           RL SLD+FRG+ +A MI+V++ G     +P + HA W+GC   D + PFFLFI+GVA+  
Sbjct: 2   RLKSLDVFRGITIASMIIVNNPGSWNHVYPPLLHAKWHGCTPTDLIFPFFLFIMGVAMTF 61

Query: 91  ALKRIPDRADAVKKV---IFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
           +L +  D+   +  +   IFR   ++F   LL  GF +         ++  IR+ GVLQR
Sbjct: 62  SLSKYTDKNQPIPHIYQRIFRRCLIIFLFGLLLNGFPN--------YNLATIRVMGVLQR 113

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
           I+L YLL ++             ++   S  +LY     +A  +L+ Y   +    VP  
Sbjct: 114 ISLVYLLAAI-------------AILNLSRKQLYG----LATTLLIGYWIAMQLIPVP-- 154

Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
                      YG            L+P  N   YIDR +L   H+              
Sbjct: 155 ----------GYG---------LGNLSPEGNFAAYIDRLILTQQHL-------------- 181

Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVT 327
                         W    ++PEGL S++ +I++ +IG   G  + H   +       V 
Sbjct: 182 --------------WAGKQYDPEGLFSTLPAIVTVLIGYLTGEWLKHQSTNSRTTLNMVI 227

Query: 328 MGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPL 387
            G + L+ G         P+NK L+T SYV  T+G A L  +  Y  +++        P 
Sbjct: 228 SGLSCLVVGYLWGL--FFPINKALWTSSYVIFTAGWALLFLAICYQAMEVRGWHKWGHPF 285

Query: 388 AWIGMNAMLVYV 399
             +G+NA+ +++
Sbjct: 286 EVMGLNAIFIFI 297


>gi|418746616|ref|ZP_13302939.1| PF07786 family protein [Leptospira santarosai str. CBC379]
 gi|410792596|gb|EKR90528.1| PF07786 family protein [Leptospira santarosai str. CBC379]
          Length = 375

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 115/404 (28%), Positives = 179/404 (44%), Gaps = 100/404 (24%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPF 79
            +++S     R+ SLD+FRG+ V  MILV++ G  W  I     HA WNGC   D V PF
Sbjct: 1   MEKQSTQNKDRILSLDLFRGMTVIGMILVNNPG-SWSYIYSPLKHAEWNGCTPTDLVFPF 59

Query: 80  FLFIVGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVD 135
           FLF VG +I ++L  K   +R+D    +  R+  L+  G+ L   G +S A         
Sbjct: 60  FLFAVGTSIPISLYSKNGINRSDIWIGICIRSANLILLGLFLNFFGEWSFAE-------- 111

Query: 136 VRMIRLCGVLQRIALSYLLV-SLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
              +R+ GVLQRI   Y +V SL  +F        + +  FS+             +L++
Sbjct: 112 ---LRIPGVLQRIGFVYWVVASLCLVF------PGKKILVFSV------------AILLI 150

Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
           +  +L    +P  +  +  +   D G                     +IDR + G  H+ 
Sbjct: 151 HTWILTQIALPG-ESVVSLEQGKDIG--------------------AWIDRTIFGEKHL- 188

Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
               WR SK                       ++PEG LS V+S+++T+ GV  G ++  
Sbjct: 189 ----WRFSKT----------------------WDPEGFLSGVASVVTTLFGVLCGFIL-- 220

Query: 315 TKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAAL---VFSAI 371
                 R ++   +G  +L   + L +  ++P+NK L+T SY   T+G + L    F  +
Sbjct: 221 ----FLRERRNKILGLGILFSFVGLLWDRSLPMNKSLWTGSYSVYTAGLSFLSIWFFEYL 276

Query: 372 YALV--DIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGW 413
            +L+    WNLK  F P    G NA+LV+V    GI A  +N W
Sbjct: 277 SSLIISKGWNLKILFQPFLVFGKNAVLVFV--GSGILARTLNLW 318


>gi|375149929|ref|YP_005012370.1| hypothetical protein [Niastella koreensis GR20-10]
 gi|361063975|gb|AEW02967.1| Protein of unknown function DUF2261, transmembrane [Niastella
           koreensis GR20-10]
          Length = 392

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 113/396 (28%), Positives = 162/396 (40%), Gaps = 105/396 (26%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
           +R  SLD+FRG  VA MILV++ G  W  +     HAPW+GC   D V PFFLF VG A+
Sbjct: 3   KRFYSLDVFRGATVAFMILVNNPG-SWSNLYAPLEHAPWHGCTPTDLVFPFFLFAVGNAL 61

Query: 89  ALALKRIPDRADA--VKKVIFRTLKLLFWGILLQGG--FSHAPDELTY------GVDVRM 138
           A  + R+ +      +KKVI R+  +   G  L          D LT+      G +  +
Sbjct: 62  AFVMPRLQEAGTTAFLKKVITRSFLIFLIGFFLNWSPFIRWDNDHLTFKAWEYAGANGNL 121

Query: 139 --IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYL 196
             IR+ GVLQRIAL Y   SL+  F K                      +  A V    L
Sbjct: 122 IGIRILGVLQRIALCYFFASLIIYFFK----------------------IRGAFVSAFVL 159

Query: 197 ALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY----IDRKVLGINH 252
            L Y      W   +   ++AD                 P +  GY    +D+ +LG +H
Sbjct: 160 LLGY------WVLCMFFGNAAD-----------------PYSLNGYFGLGVDKAILGESH 196

Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
           MYH                EG             F+PEG+ S++++I+  I G   G  I
Sbjct: 197 MYHG---------------EG-----------VAFDPEGITSTLTAIVQVIFGYFVGFYI 230

Query: 313 IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIY 372
                +   L      G  L+  G    +    P+NK+++T SYV  T+G A LV S   
Sbjct: 231 QQKGKNFEMLSHLFVAGCILIFTGYA--WDMMFPINKKIWTSSYVLYTTGLAILVLSLCI 288

Query: 373 ALVDIWNLKYPFLPLAW------IGMNAMLVYVMAA 402
            L++    K      AW       G NA+ ++ ++ 
Sbjct: 289 FLIEFKEAK-----GAWSRFFDVFGKNALFIFFLSG 319


>gi|344203119|ref|YP_004788262.1| hypothetical protein [Muricauda ruestringensis DSM 13258]
 gi|343955041|gb|AEM70840.1| hypothetical protein Murru_1800 [Muricauda ruestringensis DSM
           13258]
          Length = 375

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 109/384 (28%), Positives = 164/384 (42%), Gaps = 82/384 (21%)

Query: 23  DQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEIS---HAPWNGCNLADFVMPF 79
           +Q   S LK + L SLD+FRGL VALMI+V+  G      S   HA WNG  L D V P 
Sbjct: 6   NQPNMSKLKNRYL-SLDVFRGLDVALMIIVNSPGNGSTTFSPLLHADWNGFTLTDLVFPT 64

Query: 80  FLFIVGVAIALALKRIPDRADAV--KKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
           FLF+VG +++ ++K+          KKV+ RT  +   G L+         +L       
Sbjct: 65  FLFVVGNSMSFSMKKYESMGKPAFFKKVLKRTAIIFLLGFLMYWYPFFDDGQLK---PFS 121

Query: 138 MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA 197
             R+ GVLQRIAL Y+  S++  F K               +   W   ++A  LV Y  
Sbjct: 122 ETRVFGVLQRIALCYMFASIILHFVKT--------------KTAIW---LSALFLVGYHL 164

Query: 198 LLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHP 257
           +L G                               L    NAV  +D  ++G NHMYH  
Sbjct: 165 ILIG----------------------------FGDLTLTGNAVLKLDEWLIGANHMYHG- 195

Query: 258 AWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKG 317
                         EG             F+PEGLLS++ +I++ IIG   G  I +   
Sbjct: 196 --------------EG-----------IAFDPEGLLSTLPAIVNVIIGYLAGRFIQNNGQ 230

Query: 318 HLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDI 377
           +   + + +  GFAL+  GL       +P+NK+L+T S+V +T G      + +  ++D+
Sbjct: 231 NFETVAKLMMFGFALVFAGLAWDLV--LPINKKLWTSSFVLLTCGIDLFAIAILIYILDM 288

Query: 378 WNLKYPFLPLAWIGMNAMLVYVMA 401
              K         G N + +Y+++
Sbjct: 289 KKAKSWSYFFEVFGKNTLFIYLLS 312


>gi|299144716|ref|ZP_07037784.1| putative membrane protein [Bacteroides sp. 3_1_23]
 gi|298515207|gb|EFI39088.1| putative membrane protein [Bacteroides sp. 3_1_23]
          Length = 361

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 109/375 (29%), Positives = 178/375 (47%), Gaps = 80/375 (21%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
            +RL SLD+ RG+ VA MILV++ G    ++   +HA W+G + AD V P F+F++G++ 
Sbjct: 4   NKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMGIST 63

Query: 89  ALALKRIPDRAD-AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
            ++L +   +   A+ K+I R+L L+F G++++  F  A D   Y  D+  +RL GV+QR
Sbjct: 64  YISLCKYNFQCRPAIAKIIKRSLLLIFIGLVME-WFITAIDSGNY-FDLSQLRLMGVMQR 121

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL-LYGTYVPD 206
           + + Y + +L+ +    +  K     RF           +A  +L+VY    L+G     
Sbjct: 122 LGICYGITALLAV---TIPHK-----RF---------MPLAIILLIVYFIFQLFGNGF-- 162

Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
                  + SAD                   N VG ID  +LG NHMY            
Sbjct: 163 -------EKSAD-------------------NIVGMIDSAILGSNHMY-----------L 185

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
           Q   F                +PEG+LS++ ++   +IG   G +II  K +  R+    
Sbjct: 186 QGRQFV---------------DPEGILSTIPAVSQVMIGFVCGKIIIDIKDNDRRMLNLF 230

Query: 327 TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLP 386
            +G  LL  G  L +  A PLNK+L++ S+V +T G AAL  + +  ++D+   K  F  
Sbjct: 231 LIGTTLLFAGYLLSY--ACPLNKRLWSPSFVLLTCGIAALSLALLLYIIDVKQNKKWFSF 288

Query: 387 LAWIGMNAMLVYVMA 401
               G N +++YV +
Sbjct: 289 FEAFGANPLVIYVFS 303


>gi|332983392|ref|YP_004464833.1| heparan-alpha-glucosaminide N-acetyltransferase [Mahella
           australiensis 50-1 BON]
 gi|332701070|gb|AEE98011.1| Heparan-alpha-glucosaminide N-acetyltransferase [Mahella
           australiensis 50-1 BON]
          Length = 368

 Score =  112 bits (281), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 101/375 (26%), Positives = 169/375 (45%), Gaps = 84/375 (22%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +R+ S+D  RG+ +  MI +++ G      P + HAPWNG  LAD   P F+F++G+ I 
Sbjct: 4   KRIQSIDALRGICITAMIFMNNPGNSKYTSPLLLHAPWNGITLADLFFPCFIFVMGMVIP 63

Query: 90  LALKRIPDRADAVKKVIFRTLK---LLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQ 146
           ++  +   +     ++I   LK   +LF    L G F +A        D++ +R+ GVLQ
Sbjct: 64  VSFGKRMAKGQTKGQLIAHLLKRSAMLF----LIGLFLNAFPCF----DMQHVRILGVLQ 115

Query: 147 RIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPD 206
           RIAL Y    L+ +F+  +                   ++++A +L+ Y  LL    VP 
Sbjct: 116 RIALVYFFSGLIFLFSSTMS-----------------MFIISAAILIGYYLLLRFVPVPG 158

Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
           +   +  +                       N + YID K+L   H+Y            
Sbjct: 159 YGAGVFERTG---------------------NLIQYIDLKLLK-GHLY------------ 184

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
                        P W     +PEGLLS++ +I S+++G+  G +++  K +  +L  ++
Sbjct: 185 ------------TPDW-----DPEGLLSTLPAIASSLLGILTGCLLVSDKKNTNKL--YI 225

Query: 327 TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLP 386
            +  + L F  ++      PLNK L++ S+V  T+G A L+ S  Y L DI NL     P
Sbjct: 226 MLVCSALAFISSIITQKWFPLNKNLWSSSFVLFTTGFALLLLSVCYWLADINNLATLIKP 285

Query: 387 LAWIGMNAMLVYVMA 401
               G NA+LVY ++
Sbjct: 286 FIIFGSNAILVYTLS 300


>gi|330792857|ref|XP_003284503.1| hypothetical protein DICPUDRAFT_18260 [Dictyostelium purpureum]
 gi|325085533|gb|EGC38938.1| hypothetical protein DICPUDRAFT_18260 [Dictyostelium purpureum]
          Length = 373

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 103/382 (26%), Positives = 162/382 (42%), Gaps = 95/382 (24%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +R++SLD+ RG+ +  MILVD+ GG    WP +    WNG + AD + P FLFI G ++A
Sbjct: 2   KRMSSLDVARGITIFGMILVDNQGGPDVIWP-LKETEWNGLSTADLIFPSFLFICGFSVA 60

Query: 90  LALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
           LALK   +       +I RT  L F    L        + + +       R+ GVLQRIA
Sbjct: 61  LALKTAKNTRSTWYNIIRRTFLLFFIQCFL--------NLMAHHFVFSSFRVMGVLQRIA 112

Query: 150 LSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQF 209
           L Y L                S   F  F ++     +     V Y++++Y   VP    
Sbjct: 113 LCYFL----------------SCVSFLCFPVFLQRLFLLGTT-VTYISVMYALPVPG--- 152

Query: 210 TIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDS 269
                            CG +  L P CNA  Y+D KV G N ++               
Sbjct: 153 -----------------CG-KGVLTPTCNAGAYLDFKVFGPNMIH--------------- 179

Query: 270 PFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII-----HTKGHLARLKQ 324
                           P +PEGLLS++S+ ++T +G+ FG V       +   ++  + +
Sbjct: 180 ----------------PNDPEGLLSTLSAFITTWMGLEFGRVFTTYYRKYDYSNVDLIVR 223

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDI--W-NLK 381
           W+ M    ++  + L  T  +P NK +++ S+   T G    + +  Y L+D+  W N  
Sbjct: 224 WIVMIALFIVPAIGLGAT-VMPFNKLIWSFSFALFTVGTGGCLITVAYILIDVIEWGNKA 282

Query: 382 YPFL-----PLAWIGMNAMLVY 398
             F+     P  +IG N + +Y
Sbjct: 283 RRFIDLLSKPFIYIGTNPITIY 304


>gi|421110364|ref|ZP_15570862.1| PF07786 family protein [Leptospira santarosai str. JET]
 gi|410804289|gb|EKS10409.1| PF07786 family protein [Leptospira santarosai str. JET]
          Length = 375

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 116/404 (28%), Positives = 176/404 (43%), Gaps = 100/404 (24%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPF 79
            +++S     R+ SLD+FRG+ V  MILV++ G  W  I     HA WNGC   D V PF
Sbjct: 1   MEKQSTQNKDRILSLDLFRGMTVIGMILVNNPG-SWSYIYSPLKHAEWNGCTPTDLVFPF 59

Query: 80  FLFIVGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVD 135
           FLF VG +I ++L  K   +R+D    +  R+  L+  G+ L   G +S A         
Sbjct: 60  FLFAVGTSIPISLYSKNGINRSDIWIGICIRSANLILLGLFLNFFGEWSFAE-------- 111

Query: 136 VRMIRLCGVLQRIALSYLLV-SLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
              +R+ GVLQRI   Y +V SL  +F        + +  FS+  L    W++    L  
Sbjct: 112 ---LRIPGVLQRIGFVYWVVASLCLVF------PGKKILVFSVPILLIHTWILTQIAL-- 160

Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
                     P  +  +  +   D G                     +IDR + G  H+ 
Sbjct: 161 ----------PG-ESVVSLEQGKDIG--------------------AWIDRTIFGEKHL- 188

Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
               WR SK                       ++PEG LS V+S+++T+ GV  G ++  
Sbjct: 189 ----WRFSKT----------------------WDPEGFLSGVASVVTTLFGVLCGFIL-- 220

Query: 315 TKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAAL---VFSAI 371
                 R ++   +G  +L   + L +  ++P+NK L+T SY   T+G + L    F  +
Sbjct: 221 ----FLRERRNKILGLGILFSFVGLLWDRSLPMNKSLWTGSYSVYTAGLSFLSIWFFEYL 276

Query: 372 YALV--DIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGW 413
            +L+    WNLK  F P    G NA+LV+V    GI A  +N W
Sbjct: 277 SSLIISKGWNLKILFQPFLVFGKNAVLVFV--GSGILARTLNLW 318


>gi|293371912|ref|ZP_06618316.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
 gi|292633158|gb|EFF51735.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
          Length = 361

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 108/375 (28%), Positives = 178/375 (47%), Gaps = 80/375 (21%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
            +RL SLD+ RG+ VA MILV++ G    ++   +HA W+G + AD V P F+F++G++ 
Sbjct: 4   NKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMGIST 63

Query: 89  ALALKRIPDRAD-AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
            ++L +   +   A+ K+I R+L L+F G++++  F  A D   Y  D+  +RL GV+QR
Sbjct: 64  YISLCKYNFQCRPAIAKIIKRSLLLIFIGLVME-WFITAIDSGNY-FDLSQLRLMGVMQR 121

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL-LYGTYVPD 206
           + + Y + +L+ +    +  K     RF           +A  +L+VY    L+G     
Sbjct: 122 LGICYGITALLAV---TIPHK-----RF---------MPLAIILLIVYFIFQLFGNGF-- 162

Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
                  + SAD                   N VG +D  +LG NHMY            
Sbjct: 163 -------EKSAD-------------------NIVGIVDSAILGSNHMY-----------L 185

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
           Q   F                +PEG+LS++ ++   +IG   G +II  K +  R+    
Sbjct: 186 QGRQFV---------------DPEGILSTIPAVSQVMIGFVCGKIIIDIKDNDRRMLNLF 230

Query: 327 TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLP 386
            +G  LL  G  L +  A PLNK+L++ S+V +T G AAL  + +  ++D+   K  F  
Sbjct: 231 LIGTTLLFAGYLLSY--ACPLNKRLWSPSFVLLTCGIAALSLALLLYIIDVKQNKKWFSF 288

Query: 387 LAWIGMNAMLVYVMA 401
               G N +++YV +
Sbjct: 289 FEAFGANPLVIYVFS 303


>gi|160884063|ref|ZP_02065066.1| hypothetical protein BACOVA_02039 [Bacteroides ovatus ATCC 8483]
 gi|423291476|ref|ZP_17270324.1| hypothetical protein HMPREF1069_05367 [Bacteroides ovatus
           CL02T12C04]
 gi|156110405|gb|EDO12150.1| hypothetical protein BACOVA_02039 [Bacteroides ovatus ATCC 8483]
 gi|392663476|gb|EIY57026.1| hypothetical protein HMPREF1069_05367 [Bacteroides ovatus
           CL02T12C04]
          Length = 361

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 109/375 (29%), Positives = 177/375 (47%), Gaps = 80/375 (21%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
            +RL SLD+ RG+ VA MILV++ G    ++   +HA W+G + AD V P F+F++G++ 
Sbjct: 4   NKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMGIST 63

Query: 89  ALALKRIPDRAD-AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
            ++L +   +   A+ K+I R+L L+F G++++  F  A D   Y  D+  +RL GV+QR
Sbjct: 64  YISLCKYNFQCRPAIAKIIKRSLLLIFIGLVME-WFITAIDSGNY-FDLSQLRLMGVMQR 121

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL-LYGTYVPD 206
           + + Y + +L+ +    +  K     RF           +A  +L VY    L+G     
Sbjct: 122 LGICYGITALLAV---TIPHK-----RF---------MPLAIILLAVYFIFQLFGNGF-- 162

Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
                  + SAD                   N VG ID  +LG NHMY            
Sbjct: 163 -------EKSAD-------------------NIVGMIDSAILGSNHMY-----------L 185

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
           Q   F                +PEG+LS++ ++   +IG   G +II  K +  R+    
Sbjct: 186 QGRQFV---------------DPEGILSTIPAVSQVMIGFVCGKIIIDIKDNDRRMLNLF 230

Query: 327 TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLP 386
            +G  LL  G  L +  A PLNK+L++ S+V +T G AAL  + +  ++D+   K  F  
Sbjct: 231 LIGTTLLFVGYLLSY--ACPLNKRLWSPSFVLLTCGIAALSLALLLYIIDVKQNKKWFSF 288

Query: 387 LAWIGMNAMLVYVMA 401
               G N +++YV +
Sbjct: 289 FEAFGANPLVIYVFS 303


>gi|347738959|ref|ZP_08870332.1| hypothetical protein AZA_89781 [Azospirillum amazonense Y2]
 gi|346917867|gb|EGY00078.1| hypothetical protein AZA_89781 [Azospirillum amazonense Y2]
          Length = 400

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 129/449 (28%), Positives = 205/449 (45%), Gaps = 98/449 (21%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPE----ISHAPWNGCNLADFVMPFFLFIVGV 86
            + RL SLD+ RGLAVA MILV   G DW +    + HAPW+G  LAD V P FLF VG+
Sbjct: 19  TSPRLPSLDVLRGLAVAGMILVVSPG-DWSKAYTPLKHAPWDGWTLADMVFPTFLFSVGL 77

Query: 87  AIALALKRIP-DRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
           AIAL+  +I  +R  A  K+  R L L+  G++L        + L Y  D+  +RL G+L
Sbjct: 78  AIALSFTKIAQNRRAAGVKIARRALALIVLGLVL--------NALPY-FDLAHLRLPGIL 128

Query: 146 QRIALSYLLVSLVEIFT-KDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           QRIAL Y+L +L+ + T +   D   +V + ++        +  A VL+ Y A+L    V
Sbjct: 129 QRIALCYVLATLLCLVTARTGADGSPTVNQRALL-------IAMAVVLLGYCAVLAWVPV 181

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAK-LNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
           P                      G+ A  L+P  +   +IDR V  + H++         
Sbjct: 182 P----------------------GIGAGHLDPGGSLAAWIDRGVFTVPHLW--------- 210

Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLK 323
                     P   DA       ++PEGLLS++ + ++ ++GV  G  +  ++  L  L 
Sbjct: 211 ----------PYGTDAAGAVV--YDPEGLLSTLPATVNVLVGVLAGTALKASRSRLNLLV 258

Query: 324 QWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYP 383
             V     L++ GL L     + +NK+++T S+   +SG +  V   +  L+D      P
Sbjct: 259 AAVM----LMMAGLALD--PVLVINKRIWTSSFALFSSGFSLAVLVVLSVLLDQGPAALP 312

Query: 384 ---FLPLAWIGMNAMLV--------------YVMAAEGIFA------GFINGWYYGDPH- 419
                P   +G NA+L               +++ A+G         G + G    DP+ 
Sbjct: 313 QTLSFPFRVLGGNAILAFTLSQILSAVGSVPFIVTAQGRMTPQAWGDGLV-GAAIADPYL 371

Query: 420 NTLVCFLFIISYILHSFLWELRKFLYVQF 448
            +  C L I+S ++ + L   R+ L+V+ 
Sbjct: 372 ASFACALGILSLVVLTLLPLHRRGLHVRL 400


>gi|374312698|ref|YP_005059128.1| hypothetical protein [Granulicella mallensis MP5ACTX8]
 gi|358754708|gb|AEU38098.1| protein of unknown function DUF1624 [Granulicella mallensis
           MP5ACTX8]
          Length = 383

 Score =  112 bits (279), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 103/380 (27%), Positives = 172/380 (45%), Gaps = 85/380 (22%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
           + RL S+D+ RGL VA MILV++ G +   +  ++H+ WNG    D V P FLFI+G+++
Sbjct: 15  SSRLLSIDVLRGLTVAFMILVNNNGNNDLAYRALNHSLWNGFTPTDLVFPTFLFIMGISM 74

Query: 89  ALALKRIPDRADAVKKVIF------RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
            L+      RA A    +       R   L+F+G+++ G      D L         R+ 
Sbjct: 75  VLSFS--AHRAKATSNTVMLTSIGRRFALLIFFGLVVNGFPYFHLDTL---------RIY 123

Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
           GVLQRIA+ YLL +L+++ T  +  +         F     +W++   V V       G 
Sbjct: 124 GVLQRIAVCYLLAALLQLVTDRIAPRV-----VLFFAAVIGYWVLLRFVPVP------GH 172

Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
            +P   F +++ D                      N V ++DR      H++ H  + ++
Sbjct: 173 GIPGRDFPLLDHD---------------------INLVAWLDR------HIFPHRLFEKT 205

Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
           +                        +PEGLLS + +  STI+G+  G  I   +    +L
Sbjct: 206 R------------------------DPEGLLSDIPAFASTILGLLAGAWIKQARPAGQKL 241

Query: 323 KQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKY 382
                 G AL + GL   ++ + P+NK+L+T SYV    G + L+ +  Y +++I  L+ 
Sbjct: 242 MGLFGAGIALAVAGLL--WSQSFPINKKLWTSSYVLYAGGLSILLLAVAYYVIEIRQLRG 299

Query: 383 PFL-PLAWIGMNAMLVYVMA 401
            +  PL   G NA+ VYV++
Sbjct: 300 RWTYPLLVFGTNAITVYVIS 319


>gi|156401292|ref|XP_001639225.1| predicted protein [Nematostella vectensis]
 gi|156226352|gb|EDO47162.1| predicted protein [Nematostella vectensis]
          Length = 376

 Score =  112 bits (279), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 100/372 (26%), Positives = 169/372 (45%), Gaps = 68/372 (18%)

Query: 49  MILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFR 108
           MI V+  GG +    HA WNG  +AD V P+F++I+GV+I L+ K +  R     K+  +
Sbjct: 1   MIFVNFGGGGYYFFGHAAWNGLLVADLVFPWFIWIMGVSITLSFKSLKRRKVKKWKICLK 60

Query: 109 TLK--LLFWGI-LLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDV 165
            ++  L+ +G+ L    F+          D+   R+ GVLQR A  Y++++L+++F    
Sbjct: 61  VIRRSLILFGLGLFTSNFN----------DLETYRIPGVLQRFAACYIVIALMQLFLGPS 110

Query: 166 QDKDQSV---------GRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPD--WQFTIINK 214
           +++ Q +            SI++     WL    +L +Y+ + Y   +      +T    
Sbjct: 111 EEQTQVLYPKWWDPIRDVVSIWK----QWLAMLLLLAIYVTVTYAVKLDGCPRGYTGPGG 166

Query: 215 DSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGP 274
               Y + FN T GV            YIDRK  G  H+Y  P        T    ++  
Sbjct: 167 IGRGYPEAFNCTGGV----------ANYIDRKFFG-KHIYQWP--------TVKQLYKTK 207

Query: 275 LRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLI 334
           L          P EPEG L +++SI    +GV  G ++   +    R+ +W+  G  L +
Sbjct: 208 L----------PHEPEGFLGTLTSIFLVFLGVQAGRILHTYRKSTERITRWLAWGVFLGL 257

Query: 335 FGLTL----HFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDI---WNLKYPFLPL 387
            G+ L         +P+NK L+++S++ VT  ++  + +  Y   D    WN    F P 
Sbjct: 258 IGVGLCKASENEGVVPINKNLWSVSFILVTGSSSFFLLTFCYIFTDSLGWWNGAPFFYP- 316

Query: 388 AWIGMNAMLVYV 399
              GMN++L+YV
Sbjct: 317 ---GMNSILLYV 325


>gi|298480127|ref|ZP_06998326.1| membrane protein [Bacteroides sp. D22]
 gi|336404355|ref|ZP_08585053.1| hypothetical protein HMPREF0127_02366 [Bacteroides sp. 1_1_30]
 gi|295085510|emb|CBK67033.1| Uncharacterized conserved protein [Bacteroides xylanisolvens XB1A]
 gi|298273936|gb|EFI15498.1| membrane protein [Bacteroides sp. D22]
 gi|335943683|gb|EGN05522.1| hypothetical protein HMPREF0127_02366 [Bacteroides sp. 1_1_30]
          Length = 361

 Score =  112 bits (279), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 108/375 (28%), Positives = 178/375 (47%), Gaps = 80/375 (21%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
            +RL SLD+ RG+ VA MILV++ G    ++   +HA W+G + AD V P F+F++G++ 
Sbjct: 4   NKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMGIST 63

Query: 89  ALALKRIPDRAD-AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
            ++L +   +   A+ K+I R+L L+F G++++  F  + D   Y  D+  +RL GV+QR
Sbjct: 64  YISLCKYDFQCRPAITKIIKRSLLLIFIGLVME-WFITSIDSGNY-FDLSQLRLMGVMQR 121

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL-LYGTYVPD 206
           + + Y + +L+ +    +  K     RF           +A  +L+VY    L+G     
Sbjct: 122 LGICYGITALLAV---AIPHK-----RF---------MPLAIILLIVYFIFQLFGNGF-- 162

Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
                  + SAD                   N VG ID  +LG NHMY            
Sbjct: 163 -------EKSAD-------------------NIVGMIDSAILGANHMY-----------L 185

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
           Q   F                +PEG+LS++ ++   +IG   G +II  K +  R+    
Sbjct: 186 QGRQFV---------------DPEGILSTIPAVSQVMIGFVCGKIIIDIKDNDRRMLNLF 230

Query: 327 TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLP 386
            +G  LL  G  L +  A PLNK+L++ S+V +T G AAL  + +  ++D+   K  F  
Sbjct: 231 LIGTTLLFAGYLLSY--ACPLNKRLWSPSFVLLTCGIAALSLALLLYIIDVKQNKKWFSF 288

Query: 387 LAWIGMNAMLVYVMA 401
               G N +++YV +
Sbjct: 289 FEAFGANPLVIYVFS 303


>gi|189502592|ref|YP_001958309.1| hypothetical protein Aasi_1258 [Candidatus Amoebophilus asiaticus
           5a2]
 gi|189498033|gb|ACE06580.1| hypothetical protein Aasi_1258 [Candidatus Amoebophilus asiaticus
           5a2]
          Length = 380

 Score =  112 bits (279), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 108/385 (28%), Positives = 173/385 (44%), Gaps = 83/385 (21%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIV 84
            HLK QRL SLD FRGL VA MIL ++ G     +  + HA W+GC   D + PFFLFIV
Sbjct: 2   QHLK-QRLVSLDFFRGLTVAGMILANNPGSWGHIYAPLKHAEWHGCTPTDLIFPFFLFIV 60

Query: 85  GVAIALAL---KRIPDR-ADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDV-RMI 139
           GV+IA A+   K +P+  +  + K + R L L   GI L    +  P   T  ++  + +
Sbjct: 61  GVSIAFAIGSKKELPETHSQLILKSVRRMLTLFCLGIFL----ALYPKIFTSPIEAFKTV 116

Query: 140 RLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALL 199
           R+ GVLQR A+ Y + ++  IF K                           +L + L LL
Sbjct: 117 RIPGVLQRTAIVYFISTI--IFLK----------------------FTPRTILKIMLGLL 152

Query: 200 YGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAW 259
              ++     T +      Y           A L    N   +IDR +L   H++     
Sbjct: 153 VSYWI---LMTFVPVPGIGY-----------ANLEQETNLAAWIDRNLLTEPHLW----- 193

Query: 260 RRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGH- 318
                            K   +W     +PEG+L ++S+I + + G+  G +++  K   
Sbjct: 194 -----------------KAVRTW-----DPEGILGTISAIATGLSGILAG-ILLQRKDQS 230

Query: 319 -LARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDI 377
              ++ +  + G   +I GL  +F    P+NK L+T S+V  TSG A ++ +  Y ++D+
Sbjct: 231 DTEKIARLFSSGTLAVITGLIWNFI--FPINKSLWTSSFVLYTSGLAYIILALCYWIIDV 288

Query: 378 WNLKYPFLPLAWIGMNAMLVYVMAA 402
              K    P+   G+NA+ V+ ++ 
Sbjct: 289 KGYKRFTKPIVAYGVNAITVFFVSG 313


>gi|265753064|ref|ZP_06088633.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
 gi|263236250|gb|EEZ21745.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
          Length = 372

 Score =  112 bits (279), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 102/393 (25%), Positives = 167/393 (42%), Gaps = 90/393 (22%)

Query: 17  SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLAD 74
            E +    Q+   +K +RL SLD  RG+ VA MILV++AGG   +  + H+ WNG    D
Sbjct: 4   EELNTETAQQAPPIK-KRLLSLDALRGITVAGMILVNNAGGKVSYAPLQHSVWNGLTPCD 62

Query: 75  FVMPFFLFIVGVAIALALKRIPDRA--DAVKKVIFRTLKLLF--WGILLQGGFSHA--PD 128
            V PFFLFI+G++  ++L +         V K++ RT  +L   W I   G F H    D
Sbjct: 63  LVFPFFLFIMGISTYISLNKFNFNVSLQVVTKILKRTFLILCIGWAI---GWFDHVCEGD 119

Query: 129 ELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMA 188
            L +      +R+ GVLQRIAL Y ++S   +F                           
Sbjct: 120 FLPF----VHLRIPGVLQRIALCYCVISFTALFMNH------------------------ 151

Query: 189 ACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
                         ++P   F ++   +        + C          N +  IDR++ 
Sbjct: 152 -------------KFIPALTFILLVSYTV-------ILCMGNGYACDESNILSIIDRQLF 191

Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
           G  H+Y                              +P +PEG +S++S+I  T IG   
Sbjct: 192 GEAHLYQ----------------------------KSPIDPEGFVSTLSAIAHTCIGFSC 223

Query: 309 GHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVF 368
           G  II +     ++ +    GF L+  G  L   + +PLNK++++ ++V VT GAA++  
Sbjct: 224 GKWIIQSHQTENKVLRLFLTGFILISIGYLL--ADVLPLNKRIWSPTFVLVTCGAASMSL 281

Query: 369 SAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA 401
           + +   +DI N +         G+N + +YV++
Sbjct: 282 ATLMYYIDIRNKQKWCRFFIIFGVNPLFLYVLS 314


>gi|423214205|ref|ZP_17200733.1| hypothetical protein HMPREF1074_02265 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392693150|gb|EIY86385.1| hypothetical protein HMPREF1074_02265 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 361

 Score =  112 bits (279), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 108/375 (28%), Positives = 178/375 (47%), Gaps = 80/375 (21%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
            +RL SLD+ RG+ VA MILV++ G    ++   +HA W+G + AD V P F+F++G++ 
Sbjct: 4   NKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMGIST 63

Query: 89  ALALKRIPDRAD-AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
            ++L +   +   A+ K+I R+L L+F G++++  F  + D   Y  D+  +RL GV+QR
Sbjct: 64  YISLCKYNFQCRPAIAKIIKRSLLLIFIGLVME-WFITSIDSGNY-FDLSQLRLMGVMQR 121

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL-LYGTYVPD 206
           + + Y + +L+ +    +  K     RF           +A  +L VY    L+G     
Sbjct: 122 LGICYGITALLAV---TIPHK-----RF---------MPLAIILLAVYFIFQLFGNGF-- 162

Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
                  + SAD                   N VG ID  +LG NHMY            
Sbjct: 163 -------EKSAD-------------------NIVGMIDSAILGSNHMY-----------L 185

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
           Q   F                +PEG+LS++ ++   +IG   G +II+ K +  R+    
Sbjct: 186 QGRQFV---------------DPEGILSTIPAVSQVMIGFVCGKIIINIKDNDRRMLNLF 230

Query: 327 TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLP 386
            +G  LL  G  L +  A PLNK+L++ S+V +T G AAL  + +  ++D+   K  F  
Sbjct: 231 LIGTTLLFAGYLLSY--ACPLNKRLWSPSFVLLTCGIAALSLALLLYIIDVKQNKKWFSF 288

Query: 387 LAWIGMNAMLVYVMA 401
               G N +++YV +
Sbjct: 289 FEAFGANPLVIYVFS 303


>gi|255013110|ref|ZP_05285236.1| putative transmembrane protein [Bacteroides sp. 2_1_7]
 gi|256838332|ref|ZP_05543842.1| conserved hypothetical protein [Parabacteroides sp. D13]
 gi|410102572|ref|ZP_11297498.1| hypothetical protein HMPREF0999_01270 [Parabacteroides sp. D25]
 gi|423333958|ref|ZP_17311739.1| hypothetical protein HMPREF1075_03390 [Parabacteroides distasonis
           CL03T12C09]
 gi|256739251|gb|EEU52575.1| conserved hypothetical protein [Parabacteroides sp. D13]
 gi|409226793|gb|EKN19699.1| hypothetical protein HMPREF1075_03390 [Parabacteroides distasonis
           CL03T12C09]
 gi|409238644|gb|EKN31435.1| hypothetical protein HMPREF0999_01270 [Parabacteroides sp. D25]
          Length = 368

 Score =  111 bits (278), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 104/378 (27%), Positives = 174/378 (46%), Gaps = 78/378 (20%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           ++ RL SLD+ RG+ +A MILV++ G     +  + HA WNG    D V PFF+FI+GV+
Sbjct: 4   QSGRLLSLDVMRGITIAGMILVNNPGSWKYVYTPLEHARWNGLTPTDLVFPFFMFIMGVS 63

Query: 88  IALALKRIPDR--ADAVKKVIFRTLKLLFWGILLQGGFSHAP-DELTYGVDVRMIRLCGV 144
           +  +L++   +   ++V KV+ RT+ +   G+ L   F H   +  T   D + +R+ GV
Sbjct: 64  MFFSLRKYNFKLSKESVTKVLRRTVLIFLVGLGLN-LFGHVCYNGFT---DFQNLRILGV 119

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           +QR+AL+Y   SL+ +               +I   Y     +AA +L+ Y ALL  T+ 
Sbjct: 120 MQRLALAYGFGSLIGL---------------AINHKYILQ--VAAGILIFYWALLGFTHS 162

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
            +     +++DS                       +  +DR + G +HMYH         
Sbjct: 163 ME-----MSEDS----------------------IIAIVDRTLFGTSHMYH--------- 186

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH-TKGHLARLK 323
                        D        F+PEGLLS + SI   ++G + G VI    K +   ++
Sbjct: 187 ------------DDMADGTRIAFDPEGLLSCIGSIAHVLLGFYVGKVIQDCKKNNELIIR 234

Query: 324 QWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYP 383
                G  +L  G  L +    P+NK++++ ++V VT G A+L  + +  ++DI   K  
Sbjct: 235 NIFIFGTIILFAGFLLSY--GCPINKKIWSSTFVLVTCGFASLFLALLIWIIDINGKKKW 292

Query: 384 FLPLAWIGMNAMLVYVMA 401
            L     G+N + +YV  
Sbjct: 293 TLFFESFGINPLYLYVQG 310


>gi|150009610|ref|YP_001304353.1| transmembrane protein [Parabacteroides distasonis ATCC 8503]
 gi|262383102|ref|ZP_06076239.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
 gi|298374002|ref|ZP_06983960.1| membrane protein [Bacteroides sp. 3_1_19]
 gi|149938034|gb|ABR44731.1| putative transmembrane protein [Parabacteroides distasonis ATCC
           8503]
 gi|262295980|gb|EEY83911.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
 gi|298268370|gb|EFI10025.1| membrane protein [Bacteroides sp. 3_1_19]
          Length = 368

 Score =  111 bits (278), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 104/378 (27%), Positives = 174/378 (46%), Gaps = 78/378 (20%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           ++ RL SLD+ RG+ +A MILV++ G     +  + HA WNG    D V PFF+FI+GV+
Sbjct: 4   QSGRLLSLDVMRGITIAGMILVNNPGSWKYVYTPLEHARWNGLTPTDLVFPFFMFIMGVS 63

Query: 88  IALALKRIPDR--ADAVKKVIFRTLKLLFWGILLQGGFSHAP-DELTYGVDVRMIRLCGV 144
           +  +L++   +   ++V KV+ RT+ +   G+ L   F H   +  T   D + +R+ GV
Sbjct: 64  MFFSLRKYNFKLSKESVTKVLRRTVLIFLVGLGLN-LFGHVCYNGFT---DFQNLRILGV 119

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           +QR+AL+Y   SL+ +               +I   Y     +AA +L+ Y ALL  T+ 
Sbjct: 120 MQRLALAYGFGSLIGL---------------AINHKYILQ--VAAGILIFYWALLGFTHS 162

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
            +     +++DS                       +  +DR + G +HMYH         
Sbjct: 163 ME-----MSEDS----------------------IIAIVDRTLFGTSHMYH--------- 186

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH-TKGHLARLK 323
                        D        F+PEGLLS + SI   ++G + G VI    K +   ++
Sbjct: 187 ------------DDMADGTRIAFDPEGLLSCIGSIAHVLLGFYVGKVIQDCKKNNELIIR 234

Query: 324 QWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYP 383
                G  +L  G  L +    P+NK++++ ++V VT G A+L  + +  ++DI   K  
Sbjct: 235 NIFIFGTIILFAGFLLSY--GCPINKKIWSSTFVLVTCGFASLFLALLIWIIDINGKKKW 292

Query: 384 FLPLAWIGMNAMLVYVMA 401
            L     G+N + +YV  
Sbjct: 293 TLFFESFGINPLYLYVQG 310


>gi|423293378|ref|ZP_17271505.1| hypothetical protein HMPREF1070_00170 [Bacteroides ovatus
           CL03T12C18]
 gi|392678321|gb|EIY71729.1| hypothetical protein HMPREF1070_00170 [Bacteroides ovatus
           CL03T12C18]
          Length = 361

 Score =  111 bits (277), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 108/375 (28%), Positives = 177/375 (47%), Gaps = 80/375 (21%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
            +RL SLD+ RG+ VA MILV++ G    ++   +HA W+G + AD V P F+F++G++ 
Sbjct: 4   NKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMGIST 63

Query: 89  ALALKRIPDRAD-AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
            ++L +   +   A+ K+I R+L L+F G++++  F  + D   Y  D+  +RL GV+QR
Sbjct: 64  YISLCKYNFQCRPAIAKIIKRSLLLIFIGLVME-WFITSIDSGNY-FDLSQLRLMGVMQR 121

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL-LYGTYVPD 206
           + + Y + +L+ +    +  K     RF           +A  +LVVY    L+G     
Sbjct: 122 LGICYGITALLAV---TIPHK-----RF---------MPLAIILLVVYFIFQLFGNGF-- 162

Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
                  + S D                   N VG ID  +LG NHMY            
Sbjct: 163 -------EKSVD-------------------NIVGMIDSAILGANHMY-----------L 185

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
           Q   F                +PEG+LS++ ++   +IG   G +II  K +  R+    
Sbjct: 186 QGRQFV---------------DPEGILSTIPAVSQVMIGFVCGKIIIDIKDNDRRMLNLF 230

Query: 327 TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLP 386
            +G  LL  G  L +  A PLNK+L++ S+V +T G AAL  + +  ++D+   K  F  
Sbjct: 231 LIGTTLLFVGYLLSY--ACPLNKRLWSPSFVLLTCGIAALSLALLLYIIDVKQNKKWFSF 288

Query: 387 LAWIGMNAMLVYVMA 401
               G N +++YV +
Sbjct: 289 FEAFGANPLVIYVFS 303


>gi|237719042|ref|ZP_04549523.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
 gi|229451820|gb|EEO57611.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
          Length = 361

 Score =  111 bits (277), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 108/375 (28%), Positives = 177/375 (47%), Gaps = 80/375 (21%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
            +RL SLD+ RG+ VA MILV++ G    ++   +HA W+G + AD V P F+F++G++ 
Sbjct: 4   NKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMGIST 63

Query: 89  ALALKRIPDRAD-AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
            ++L +   +   A+ K+I R+L L+F G++++  F  + D   Y  D+  +RL GV+QR
Sbjct: 64  YISLCKYNFQCRPAIAKIIKRSLLLIFIGLVME-WFITSIDSGNY-FDLSQLRLMGVMQR 121

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL-LYGTYVPD 206
           + + Y + +L+ +    +  K     RF           +A  +LVVY    L+G     
Sbjct: 122 LGICYGITALLAV---TIPHK-----RF---------MPLAIILLVVYFIFQLFGNGF-- 162

Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
                  + S D                   N VG ID  +LG NHMY            
Sbjct: 163 -------EKSVD-------------------NIVGMIDSAILGANHMY-----------L 185

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
           Q   F                +PEG+LS++ ++   +IG   G +II  K +  R+    
Sbjct: 186 QGRQFV---------------DPEGILSTIPAVSQVMIGFVCGKIIIDIKDNDRRMLNLF 230

Query: 327 TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLP 386
            +G  LL  G  L +  A PLNK+L++ S+V +T G AAL  + +  ++D+   K  F  
Sbjct: 231 LIGTTLLFVGYLLSY--ACPLNKRLWSPSFVLLTCGIAALSLALLLYIIDVKQNKKWFSF 288

Query: 387 LAWIGMNAMLVYVMA 401
               G N +++YV +
Sbjct: 289 FEAFGANPLVIYVFS 303


>gi|295132874|ref|YP_003583550.1| hypothetical protein ZPR_1009 [Zunongwangia profunda SM-A87]
 gi|294980889|gb|ADF51354.1| membrane protein [Zunongwangia profunda SM-A87]
          Length = 376

 Score =  111 bits (277), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 108/386 (27%), Positives = 171/386 (44%), Gaps = 95/386 (24%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
           ++R  SLD+FRGL +ALMILV+  G     +P + HA W G  LAD V P FLF VG A+
Sbjct: 6   SERFLSLDVFRGLTIALMILVNTPGTGADLYPYLVHAQWFGFTLADLVFPSFLFAVGNAM 65

Query: 89  ALALKRIPDR--ADAVKKVIFRTLKLLFWGILL---------QGGFSHAPDELTYGVDVR 137
           + ++++  +   AD  KKV+ RT  +   G L+         +G    +P   T      
Sbjct: 66  SFSMRKFQEAAPADFWKKVLKRTAIIFLLGFLMYWFPFFRMNEGHLELSPFSET------ 119

Query: 138 MIRLCGVLQRIALSYLLVS-LVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYL 196
             R+ GVLQRIAL Y   + LV  F+       +++G             + A +L+ Y 
Sbjct: 120 --RIMGVLQRIALCYFFGAVLVRYFSV------KTIG------------FICAAILLAYW 159

Query: 197 ALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHH 256
            +LYG   P  +                        L    NA    D  +LG  H+Y  
Sbjct: 160 GILYGFGEPGHE------------------------LEMATNAAAKFDYAILGEGHIY-- 193

Query: 257 PAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK 316
                              +KDA      PF+PEG+LS++ SI++ + G +   V I  K
Sbjct: 194 -------------------KKDA-----IPFDPEGILSTLPSIVNVLAG-YLAGVFIRRK 228

Query: 317 GH-LARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALV 375
           G     + + +  GF  L+F L   +    PL+K+L+T  +  +T G    + +A+   V
Sbjct: 229 GKSYETIAKLMLAGF--LVFALAEWWALIFPLSKKLWTSPFAMLTIGLNLSMLAALIFAV 286

Query: 376 DIWNLKYPFLPLAWIGMNAMLVYVMA 401
           ++ N+K+        G N +++Y+ +
Sbjct: 287 ELKNIKFGTNFFNVFGKNPLVIYLFS 312


>gi|422005552|ref|ZP_16352731.1| hypothetical protein LSS_18678 [Leptospira santarosai serovar
           Shermani str. LT 821]
 gi|417255773|gb|EKT85231.1| hypothetical protein LSS_18678 [Leptospira santarosai serovar
           Shermani str. LT 821]
          Length = 375

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 118/404 (29%), Positives = 178/404 (44%), Gaps = 100/404 (24%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPF 79
            +++S     R+ SLD+FRG+ V  MILV++ G  W  I     HA WNGC   D V PF
Sbjct: 1   MEKQSTQNKDRILSLDLFRGMTVIGMILVNNPG-SWSYIYSPLKHAEWNGCTPTDLVFPF 59

Query: 80  FLFIVGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVD 135
           FLF VG +I ++L  K   +R+D    +  R+  L+  G+ L   G +S A         
Sbjct: 60  FLFAVGTSIPISLYSKNGINRSDIWIGICIRSANLILLGLFLNFFGEWSFAE-------- 111

Query: 136 VRMIRLCGVLQRIALSYLLV-SLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
              +R+ GVLQRI   Y +V SL  +F      K   V    I  ++ W        ++ 
Sbjct: 112 ---LRIPGVLQRIGFVYWVVASLCLVF----PGKKILVFLVPILLIHTW--------ILT 156

Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
            +AL   + V           S + GK                +   +IDR + G  H+ 
Sbjct: 157 QIALPGESVV-----------SLEQGK----------------DIGAWIDRTIFGEKHL- 188

Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
               WR SK                       ++PEG LS V+S+++T+ GV  G ++  
Sbjct: 189 ----WRFSKT----------------------WDPEGFLSGVASVVTTLFGVLCGFIL-- 220

Query: 315 TKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAAL---VFSAI 371
                 R ++   +G  +L   + L +  ++P+NK L+T SY   T+G + L    F  +
Sbjct: 221 ----FLRERRNKILGLGILFSFVGLLWDRSLPMNKSLWTGSYSVYTAGLSFLSIWFFEYL 276

Query: 372 YALV--DIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGW 413
            +L+    WNLK  F P    G NA+LV+V    GI A  +N W
Sbjct: 277 SSLIISKGWNLKILFQPFLVFGKNAILVFV--GSGILARTLNLW 318


>gi|296222155|ref|XP_002757067.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase, partial
           [Callithrix jacchus]
          Length = 516

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 85/283 (30%), Positives = 139/283 (49%), Gaps = 42/283 (14%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
           RL S+D FRG+A+ LM+ V++ GG +    HA WNG  +AD V P+F+FI+G ++ L++ 
Sbjct: 267 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGTSVFLSMT 326

Query: 94  RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            I  R  +    + K+ +R+  L+  GI++       P+     +    +R+ GVLQR+ 
Sbjct: 327 SILQRGCSKFRLLGKIAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGVLQRLG 381

Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
           ++Y +V+++E +F K V +   S       R     W  WL+   +  ++L L +   VP
Sbjct: 382 VTYFVVAVLELLFAKPVPEHCTSERSCLSLRDITSSWPQWLLILALEGLWLGLTFLLPVP 441

Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                 +      D+GK  N T G          A GYIDR +LG +H+Y HP    S A
Sbjct: 442 GCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDDHLYQHP----SSA 487

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVH 307
               +                 ++PEG+L +++SIL   +GV 
Sbjct: 488 VLYHT--------------EVAYDPEGILGTINSILMAFLGVQ 516


>gi|423289836|ref|ZP_17268686.1| hypothetical protein HMPREF1069_03729 [Bacteroides ovatus
           CL02T12C04]
 gi|392666578|gb|EIY60091.1| hypothetical protein HMPREF1069_03729 [Bacteroides ovatus
           CL02T12C04]
          Length = 371

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 108/389 (27%), Positives = 165/389 (42%), Gaps = 96/389 (24%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVG 85
           +K++RL SLDI RG+ +  MILV++ G  W  I     HA WNG    D V PFF+FI+G
Sbjct: 1   MKSERLLSLDILRGITIVGMILVNNPG-TWESIYAPLRHAEWNGLTPTDLVFPFFMFIMG 59

Query: 86  VAIALALKRIPDRADA--VKKVIFRTLKLLFWGILL----------QGGFSHAPDELTYG 133
           V+++ AL R         + K++ RT+ L   G+ L          +  FSH        
Sbjct: 60  VSMSFALSRFDHHFSRGFIIKLVRRTVILFLLGLFLSWFSLVCTGVEQPFSH-------- 111

Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV 193
                IR+ GVLQR+AL+Y   SL+ +  +   +               W   ++  +L 
Sbjct: 112 -----IRILGVLQRLALAYFFGSLLIVGVRRPAN-------------LAW---ISGIILA 150

Query: 194 VYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
            Y  LL                    G  F ++           N +   DR + G  H+
Sbjct: 151 GYSILL------------------ALGHGFELS---------EQNIIAVTDRTLFGEAHL 183

Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
           Y                     R+  P      F+PEGLLS++  I   IIG   G+++ 
Sbjct: 184 Y---------------------REWLPDGGRIFFDPEGLLSTLPCIAQVIIGYFCGNILR 222

Query: 314 HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
                  RL Q   +G ALL  G  L +    PLNK++++ ++V VT G A+L+   +  
Sbjct: 223 EKTEIHHRLLQISILGIALLFAGWLLSY--GCPLNKKVWSPTFVLVTCGFASLLLVFLTW 280

Query: 374 LVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
           L+DI   +    P    G N + +Y++A 
Sbjct: 281 LIDIRKKQKWGYPFHVFGTNPLFIYIVAG 309


>gi|160883406|ref|ZP_02064409.1| hypothetical protein BACOVA_01375 [Bacteroides ovatus ATCC 8483]
 gi|156111126|gb|EDO12871.1| hypothetical protein BACOVA_01375 [Bacteroides ovatus ATCC 8483]
          Length = 371

 Score =  110 bits (275), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 108/389 (27%), Positives = 165/389 (42%), Gaps = 96/389 (24%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVG 85
           +K++RL SLDI RG+ +  MILV++ G  W  I     HA WNG    D V PFF+FI+G
Sbjct: 1   MKSERLLSLDILRGITIVGMILVNNPG-TWESIYAPLRHAEWNGLTPTDLVFPFFMFIMG 59

Query: 86  VAIALALKRIPDRADA--VKKVIFRTLKLLFWGILL----------QGGFSHAPDELTYG 133
           V+++ AL R         + K++ RT+ L   G+ L          +  FSH        
Sbjct: 60  VSMSFALSRFDHHFSRGFIIKLVRRTVILFLLGLFLSWFSLVCTGVEQPFSH-------- 111

Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV 193
                IR+ GVLQR+AL+Y   SL+ +  +   +               W   ++  +L 
Sbjct: 112 -----IRILGVLQRLALAYFFGSLLIVGVRRPAN-------------LAW---ISGIILA 150

Query: 194 VYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
            Y  LL                    G  F ++           N +   DR + G  H+
Sbjct: 151 GYSILL------------------ALGHGFELS---------EQNIIAVTDRTLFGEAHL 183

Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
           Y                     R+  P      F+PEGLLS++  I   IIG   G+++ 
Sbjct: 184 Y---------------------REWLPDGGRIFFDPEGLLSTLPCIAQVIIGYFCGNILR 222

Query: 314 HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
                  RL Q   +G ALL  G  L +    PLNK++++ ++V VT G A+L+   +  
Sbjct: 223 EKTEIHHRLLQISILGIALLFAGWLLSY--GCPLNKKVWSPTFVLVTCGFASLLLVFLTW 280

Query: 374 LVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
           L+DI   +    P    G N + +Y++A 
Sbjct: 281 LIDIRKKQKWGYPFHVFGTNPLFIYIVAG 309


>gi|336416828|ref|ZP_08597160.1| hypothetical protein HMPREF1017_04268 [Bacteroides ovatus
           3_8_47FAA]
 gi|335937266|gb|EGM99170.1| hypothetical protein HMPREF1017_04268 [Bacteroides ovatus
           3_8_47FAA]
          Length = 371

 Score =  110 bits (275), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 108/389 (27%), Positives = 165/389 (42%), Gaps = 96/389 (24%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVG 85
           +K++RL SLDI RG+ +  MILV++ G  W  I     HA WNG    D V PFF+FI+G
Sbjct: 1   MKSERLLSLDILRGITIVGMILVNNPG-TWESIYAPLRHAEWNGLTPTDLVFPFFMFIMG 59

Query: 86  VAIALALKRIPDRADA--VKKVIFRTLKLLFWGILL----------QGGFSHAPDELTYG 133
           V+++ AL R         + K++ RT+ L   G+ L          +  FSH        
Sbjct: 60  VSMSFALSRFDHHFSRGFIIKLVRRTVILFLLGLFLSWFSLVCTGVEQPFSH-------- 111

Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV 193
                IR+ GVLQR+AL+Y   SL+ +  +   +               W   ++  +L 
Sbjct: 112 -----IRILGVLQRLALAYFFGSLLIVGVRRPAN-------------LAW---ISGIILA 150

Query: 194 VYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
            Y  LL                    G  F ++           N +   DR + G  H+
Sbjct: 151 GYSILL------------------ALGHGFELS---------EQNIIAVTDRTLFGEAHL 183

Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
           Y                     R+  P      F+PEGLLS++  I   IIG   G+++ 
Sbjct: 184 Y---------------------REWLPDGGRIFFDPEGLLSTLPCIAQVIIGYFCGNILR 222

Query: 314 HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
                  RL Q   +G ALL  G  L +    PLNK++++ ++V VT G A+L+   +  
Sbjct: 223 EKTEIHHRLLQISILGIALLFAGWLLSY--GCPLNKKVWSPTFVLVTCGFASLLLVFLTW 280

Query: 374 LVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
           L+DI   +    P    G N + +Y++A 
Sbjct: 281 LIDIRKKQKWGYPFHVFGTNPLFIYIVAG 309


>gi|307610361|emb|CBW99930.1| hypothetical protein LPW_16871 [Legionella pneumophila 130b]
          Length = 372

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 111/430 (25%), Positives = 179/430 (41%), Gaps = 99/430 (23%)

Query: 30  LKTQRLASLDIFRGLAVALMILVD-HAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           LK QRL SLD+FRG+ + LMI V+  A  D +P   H  WNGC LAD V PFFLFIVG+ 
Sbjct: 6   LKPQRLLSLDVFRGMTIVLMIFVNGQAAIDPYPIFEHVDWNGCTLADLVFPFFLFIVGLT 65

Query: 88  IALALKRIPDRADAV---KKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
             ++LK   +R +       +I R++                 +   + ++   IR+ G+
Sbjct: 66  SVISLKNQMERKEKTSLYSAIIERSVV--------LFLLGLFLNVFPHPIEFDSIRIYGI 117

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           LQRIA+ YL+ + + +   +   K Q                       ++L LL G ++
Sbjct: 118 LQRIAVCYLISAFIYL---NTSIKTQ---------------------FFIFLVLLLGYWI 153

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
              Q  +       YG           +L    + V Y D+     +H+Y          
Sbjct: 154 IMTQVPV-----PGYGA---------NQLTKDGSWVSYFDQLFFSASHLYEK-------- 191

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
                                 ++PEG LS+ +SI +T+ GV  G ++I+      +   
Sbjct: 192 ---------------------TYDPEGFLSTFTSIATTLSGVLAGSLLINPCNQFKKF-- 228

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPF 384
           ++  G  LL   L   +  + P+NK L+T SYV  TSG A L F+  Y L+D   +K   
Sbjct: 229 YLLAGVGLLFLLLGWLWNMSFPINKNLWTSSYVLWTSGLALLAFAFCYLLIDRLGVKKWS 288

Query: 385 LPLAWIGMNAMLVYVM-----------------AAEGIFAGFINGWYYGDPHNTLVCFLF 427
           +     GMNA+  +V                   ++     ++  +++G   N     L+
Sbjct: 289 VFFKIFGMNALFAFVFHVLLLKLQYVFKITTPDGSKMALISYLKDYFFGGFSNHNAALLY 348

Query: 428 IISYILHSFL 437
            IS++  +FL
Sbjct: 349 SISFLFLNFL 358


>gi|383111974|ref|ZP_09932776.1| hypothetical protein BSGG_3641 [Bacteroides sp. D2]
 gi|423296747|ref|ZP_17274817.1| hypothetical protein HMPREF1070_03482 [Bacteroides ovatus
           CL03T12C18]
 gi|313696106|gb|EFS32941.1| hypothetical protein BSGG_3641 [Bacteroides sp. D2]
 gi|392669124|gb|EIY62615.1| hypothetical protein HMPREF1070_03482 [Bacteroides ovatus
           CL03T12C18]
          Length = 371

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 108/389 (27%), Positives = 165/389 (42%), Gaps = 96/389 (24%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVG 85
           +K++RL SLDI RG+ +  MILV++ G  W  I     HA WNG    D V PFF+FI+G
Sbjct: 1   MKSERLLSLDILRGITIVGMILVNNPG-TWESIYAPLRHAEWNGLTPTDLVFPFFMFIMG 59

Query: 86  VAIALALKRIPDRADA--VKKVIFRTLKLLFWGILL----------QGGFSHAPDELTYG 133
           V+++ AL R         + K++ RT+ L   G+ L          +  FSH        
Sbjct: 60  VSMSFALSRFDHHFSRGFIIKLVRRTVILFLLGLFLSWFSLVCTGVEQPFSH-------- 111

Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV 193
                IR+ GVLQR+AL+Y   SL+ +  +   +               W   ++  +L 
Sbjct: 112 -----IRILGVLQRLALAYFFGSLLIVGVRRPAN-------------LAW---ISGIILA 150

Query: 194 VYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
            Y  LL                    G  F ++           N +   DR + G  H+
Sbjct: 151 GYSILL------------------ALGHGFELS---------EQNIIAVTDRTLFGEAHL 183

Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
           Y                     R+  P      F+PEGLLS++  I   IIG   G+++ 
Sbjct: 184 Y---------------------REWLPDGGRIFFDPEGLLSTLPCIAQVIIGYFCGNILR 222

Query: 314 HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
                  RL Q   +G ALL  G  L +    PLNK++++ ++V VT G A+L+   +  
Sbjct: 223 EKTEIHHRLLQISILGIALLFAGWLLSY--GCPLNKKVWSPTFVLVTCGFASLLLVFLTW 280

Query: 374 LVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
           L+DI   +    P    G N + +Y++A 
Sbjct: 281 LIDIRKKQKWGYPFHVFGTNPLFIYIVAG 309


>gi|402878146|ref|XP_003902762.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Papio
           anubis]
          Length = 708

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 96/345 (27%), Positives = 161/345 (46%), Gaps = 48/345 (13%)

Query: 69  GCNLADFVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFS 124
           G  +AD V P+F+FI+G +I L++  I  R  +    + K+ +R+  L+  GI++     
Sbjct: 347 GLTVADLVFPWFVFIMGSSIFLSMTSILQRGCSKFRLLGKIAWRSFLLICIGIIIVN--- 403

Query: 125 HAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEI-FTKDVQDKDQSVGRFSIFR--LY 181
             P+     +    +R+ GVLQR+ ++Y +V+++E+ F K V +   S       R    
Sbjct: 404 --PNYCLGPLSWDKVRIPGVLQRLGVTYFVVAVLELLFAKPVPEHCASERSCLSLRDITS 461

Query: 182 CW-HWLMAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNA 239
            W  WL+   +  ++L L +   VP      +      D+GK  N T G          A
Sbjct: 462 SWPQWLLILALEGLWLGLTFLLPVPGCPTGYLGPGGIGDFGKYPNCTGG----------A 511

Query: 240 VGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSI 299
            GYIDR +LG +H+Y HP+                            ++PEG+L +++SI
Sbjct: 512 AGYIDRLLLGDDHLYQHPS------------------STVLYHTEVAYDPEGILGTINSI 553

Query: 300 LSTIIGVHFGHVIIH----TKGHLARLKQWVT-MGFALLIFGLTLHFTNAIPLNKQLYTL 354
           +   +GV  G ++++    TK  L R   W   +G   ++          IP+NK L++L
Sbjct: 554 VMAFLGVQAGKILLYYKAQTKDILIRFTAWCCILGLISVVLTKVSENEGFIPVNKNLWSL 613

Query: 355 SYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
           SYV   S  A  +   +Y +VD+  L +   P  + GMN++LVYV
Sbjct: 614 SYVTTLSSFAFFILLVLYPVVDVKGL-WTGTPFFYPGMNSILVYV 657


>gi|301307595|ref|ZP_07213552.1| conserved hypothetical protein [Bacteroides sp. 20_3]
 gi|423337400|ref|ZP_17315144.1| hypothetical protein HMPREF1059_01069 [Parabacteroides distasonis
           CL09T03C24]
 gi|300834269|gb|EFK64882.1| conserved hypothetical protein [Bacteroides sp. 20_3]
 gi|409237229|gb|EKN30029.1| hypothetical protein HMPREF1059_01069 [Parabacteroides distasonis
           CL09T03C24]
          Length = 368

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 103/378 (27%), Positives = 174/378 (46%), Gaps = 78/378 (20%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           ++ RL SLD+ RG+ +A MILV++ G     +  + HA WNG    D V PFF+FI+GV+
Sbjct: 4   QSGRLLSLDVMRGITIAGMILVNNPGSWKYVYTPLEHARWNGLTPTDLVFPFFMFIMGVS 63

Query: 88  IALALKRIPDR--ADAVKKVIFRTLKLLFWGILLQGGFSHAP-DELTYGVDVRMIRLCGV 144
           +  +L++   +   ++V KV+ RT+ +   G+ L   F H   +  T   D + +R+ GV
Sbjct: 64  MFFSLRKYNFKLSKESVTKVLRRTVLIFLVGLGLN-LFGHVCYNGFT---DFQNLRILGV 119

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           +QR+AL+Y   SL+ +               +I   Y     +AA +L+ Y ALL  T+ 
Sbjct: 120 MQRLALAYGFGSLIGL---------------AINHKYILQ--VAAGILIFYWALLGFTHS 162

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
            +     +++DS                       +  +D+ + G +HMYH         
Sbjct: 163 ME-----MSEDS----------------------IIAIVDKALFGTSHMYH--------- 186

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH-TKGHLARLK 323
                        D        F+PEGLLS + SI   ++G + G VI    K +   ++
Sbjct: 187 ------------DDMADGTRIAFDPEGLLSCIGSIAHVLLGFYVGKVIQDCKKNNELIIR 234

Query: 324 QWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYP 383
                G  +L  G  L +    P+NK++++ ++V VT G A+L  + +  ++DI   K  
Sbjct: 235 NIFIFGTIILFAGFLLSY--GCPINKKIWSSTFVLVTCGFASLFLALLIWIIDINGKKKW 292

Query: 384 FLPLAWIGMNAMLVYVMA 401
            L     G+N + +YV  
Sbjct: 293 TLFFESFGINPLYLYVQG 310


>gi|373460170|ref|ZP_09551926.1| hypothetical protein HMPREF9944_00190 [Prevotella maculosa OT 289]
 gi|371956555|gb|EHO74341.1| hypothetical protein HMPREF9944_00190 [Prevotella maculosa OT 289]
          Length = 359

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 101/381 (26%), Positives = 170/381 (44%), Gaps = 89/381 (23%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGD--WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           ++ +RL SLD+ RG+ V LMILV++  G+  +  + H+ WNG    D V PFFLFI+G++
Sbjct: 1   MEKKRLLSLDVLRGMTVCLMILVNNGAGEHIYSTLQHSKWNGMTPCDLVFPFFLFIMGIS 60

Query: 88  IALALKRIP---DRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYG--VDVRMIRLC 142
             L+LK+     +R  A  K+  RT+ L   G+     F +  D L  G  +D   +R+ 
Sbjct: 61  TFLSLKQTNFAWNRQTAC-KIAKRTVLLFAIGL-----FINWFDLLLQGRALDFEHLRIW 114

Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
           GV+QRIA+ Y     V +F   +  K                 L+A  ++   + L+ G 
Sbjct: 115 GVMQRIAICY---GAVSVFALSINHKRTLP-------------LIATLLIAYAMFLMLGN 158

Query: 203 -YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
            Y  D Q                             N +  ID  + G  H+YH      
Sbjct: 159 GYAYDSQ----------------------------QNLIAQIDIHLFGQAHLYH------ 184

Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
                                  +P +PEGL SS+ +I  T+IG + G ++   +    +
Sbjct: 185 ----------------------KSPVDPEGLASSLPAIAHTLIGFYCGRLMAMARTTEEK 222

Query: 322 LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLK 381
           + +++ +G  L++ G    F   +PLNK++++ SYVC+T G AA     +  ++D+  + 
Sbjct: 223 VLKFMLVGGVLVLIGYLASF--GLPLNKRIWSPSYVCITCGLAATCLGLLMYVIDMKGVS 280

Query: 382 YPFLPLAWI-GMNAMLVYVMA 401
              L    + G N + +YV++
Sbjct: 281 RSRLTFFLVFGTNPLFLYVVS 301


>gi|294895713|ref|XP_002775269.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239881343|gb|EER07085.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 323

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 104/392 (26%), Positives = 152/392 (38%), Gaps = 103/392 (26%)

Query: 41  FRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRAD 100
            RG+ +++M++VD  G   P I HAPWNG +LAD VMP F+FI                 
Sbjct: 1   MRGVVMSIMLIVDVCGKAVPSIGHAPWNGLHLADIVMPGFIFI----------------- 43

Query: 101 AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEI 160
                                      D LT G+D+   R  G+LQRIA+ Y    L+  
Sbjct: 44  ---------------------------DTLTVGLDLYTFRAPGILQRIAVCYAAAVLLAK 76

Query: 161 FTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYG 220
              D+   D   G               + VLVV L  +    V +W   ++        
Sbjct: 77  LVSDLSPNDTVKGALK----------NNSRVLVVGLLCI----VINWAIMLLGPQPKGC- 121

Query: 221 KVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAP 280
                    R  L P CN    IDR V G  HMY+ P W                     
Sbjct: 122 --------PRGSLTPQCNVASNIDRMVFGPEHMYN-PLW--------------------- 151

Query: 281 SWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLH 340
                  +PEGLLS++ S+ +  +G+  G  I     H   L + V  G  L + G+ L 
Sbjct: 152 -------DPEGLLSTLPSLATVALGLACGKFIQSRPSH-TELLRLVGCGLLLDLCGMGLG 203

Query: 341 FTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVM 400
               IP++K L+T SY C+ +G   + F  I +     N+     P  W+GMNA+  + +
Sbjct: 204 IV--IPVSKVLWTPSY-CLLTGGICVAFLGIVSSRVGGNVV--LAPFKWLGMNAISFFCL 258

Query: 401 A-AEGIFAGFINGWYYGDPHNTLVCFLFIISY 431
           +   G+F+  +   Y  DP    V ++    Y
Sbjct: 259 SDCSGLFSCLLGSIYVADPTTGNVLYVLTKGY 290


>gi|325299496|ref|YP_004259413.1| hypothetical protein [Bacteroides salanitronis DSM 18170]
 gi|324319049|gb|ADY36940.1| hypothetical protein Bacsa_2393 [Bacteroides salanitronis DSM
           18170]
          Length = 356

 Score =  109 bits (273), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 109/376 (28%), Positives = 171/376 (45%), Gaps = 87/376 (23%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWP--EISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           QRL SLD+ RG+ V  MI+V++AGG++    + H+ WNG    D V PFFLFI+G++  +
Sbjct: 3   QRLLSLDVLRGITVFGMIVVNNAGGEYSYDSLRHSVWNGLTPCDLVFPFFLFIMGISTYI 62

Query: 91  ALKRI---PDRADAVKKVIFRTLK--LLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
           AL++    P  A  ++K++ RTL   L+ WGI     F+   D L +      IR+ GVL
Sbjct: 63  ALRKFQFQPSPA-VLRKIVRRTLLIFLIGWGIYWFE-FACEGDFLPFA----HIRILGVL 116

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
            RIAL Y +VSL+ ++   V+ K  +             W+    +L   L L +G    
Sbjct: 117 PRIALCYGIVSLLALY---VRPKGLA-------------WIAGILLLGYALLLQWG---- 156

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
                  N  + D                   N +   D KVLG  H+YH          
Sbjct: 157 -------NGYAMD-----------------STNILAIWDTKVLGYEHLYHK--------- 183

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
                              +P +PEGL S++++I  TIIG   G +I+ T+    ++ + 
Sbjct: 184 -------------------SPVDPEGLTSTLAAIAHTIIGFLCGRLIMETQDLGQKIVKL 224

Query: 326 VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFL 385
              GF L   G  L     +PLNK++++ SY   T G AA++ + +   +D    K    
Sbjct: 225 FVAGFLLAAAGFLL--VEWLPLNKRIWSPSYALATCGMAAMLQATLLYFIDAQGKKRWCR 282

Query: 386 PLAWIGMNAMLVYVMA 401
                G+N + +YV++
Sbjct: 283 IFEVFGVNPLFLYVLS 298


>gi|336412607|ref|ZP_08592960.1| hypothetical protein HMPREF1017_00068 [Bacteroides ovatus
           3_8_47FAA]
 gi|335942653|gb|EGN04495.1| hypothetical protein HMPREF1017_00068 [Bacteroides ovatus
           3_8_47FAA]
          Length = 361

 Score =  109 bits (273), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 106/375 (28%), Positives = 177/375 (47%), Gaps = 80/375 (21%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
            +RL SLD+ RG+ VA MILV++ G    ++   +HA W+G + AD V P F+F++G++ 
Sbjct: 4   NKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMGIST 63

Query: 89  ALALKRIPDRAD-AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
            ++L +   +   A+ K+I R+L L+F G++++  F  + D   Y  D+  +RL GV+QR
Sbjct: 64  YISLCKYNFQCRPAIAKIIKRSLLLIFIGLVME-WFITSIDSGNY-FDLSQLRLMGVMQR 121

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL-LYGTYVPD 206
           + + Y + +L+ +    +  K     RF           +A  +L+VY    L+G     
Sbjct: 122 LGICYGITALLAV---TIPHK-----RF---------MPLAIILLIVYFIFQLFGNGF-- 162

Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
                  + S D                   N VG +D  +LG NHMY            
Sbjct: 163 -------EKSVD-------------------NIVGIVDSAILGSNHMY-----------L 185

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
           Q   F                +PEG+LS++ ++   +IG   G +II  K +  R+    
Sbjct: 186 QGRQFV---------------DPEGILSTIPAVSQVMIGFVCGKIIIDIKDNDRRMLNLF 230

Query: 327 TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLP 386
            +G  LL  G  L +  A PLNK+L++ S+V +T G AAL  + +  ++D+   K  F  
Sbjct: 231 LIGTTLLFAGYLLSY--ACPLNKRLWSPSFVLLTCGIAALSLALLLYIIDVKQNKKWFSF 288

Query: 387 LAWIGMNAMLVYVMA 401
               G N +++YV +
Sbjct: 289 FEAFGANPLVIYVFS 303


>gi|381188372|ref|ZP_09895934.1| N-acetylglucosamine related transporter, NagX [Flavobacterium
           frigoris PS1]
 gi|379650160|gb|EIA08733.1| N-acetylglucosamine related transporter, NagX [Flavobacterium
           frigoris PS1]
          Length = 430

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 114/425 (26%), Positives = 176/425 (41%), Gaps = 124/425 (29%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVG 85
           +  QR+ SLD+ RG+ + +M+LV++ G  W  +     HA WNGC   D V PFF+F++G
Sbjct: 1   MTKQRIISLDVLRGITIMMMVLVNNPG-SWDNVFAPLEHANWNGCTPTDLVFPFFIFVLG 59

Query: 86  VAIALALKRIPDRADAVKKVIFRTLKLLFWGI---------------------------- 117
            AI LA+        +  K++ R+L+++  G+                            
Sbjct: 60  AAIPLAILTKELNQQSFLKILTRSLRIISLGLFLGFYGKIEIFNLVGYPLLISKLIITGI 119

Query: 118 ---LLQGGFSHAPD-------------ELTYGVDV-RMIRLCGVLQRIALSYLLVSLVEI 160
              +L G F                      G+     +RL GVLQRI + Y   SLV +
Sbjct: 120 VAYILLGNFKQKIKFSLVLTLFFVFVFLAFSGITAYNEVRLPGVLQRIGIVYFFTSLVYL 179

Query: 161 FTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYG 220
            T   + K Q +              +   +LV Y A +    VPD+             
Sbjct: 180 KT---EIKGQII--------------IIGLLLVGYWATMTLIPVPDFG------------ 210

Query: 221 KVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAP 280
                     A LN   N  G+ID  +L  NH+     W  SK                 
Sbjct: 211 ---------PANLNKGTNLAGWIDNLLLK-NHL-----WSFSKT---------------- 239

Query: 281 SWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTM---GFALLIFGL 337
                 ++PEG+LS++ +I S IIG+  G ++      LA+ ++ + M   G AL+I GL
Sbjct: 240 ------WDPEGILSTIPAIASGIIGLLVGQLL---NSSLAKKEKGLKMFGAGLALVISGL 290

Query: 338 TLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLV 397
              +    PLNK L+T S+V  T+G A L  +A Y  +DI   K    P+   G+N M+V
Sbjct: 291 I--WNEFFPLNKSLWTSSFVLYTAGFATLFLAAFYYAIDIKGYKNWTKPILVWGVNPMIV 348

Query: 398 YVMAA 402
           + ++ 
Sbjct: 349 FFLSG 353


>gi|440791267|gb|ELR12512.1| transmembrane protein [Acanthamoeba castellanii str. Neff]
          Length = 825

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 101/435 (23%), Positives = 175/435 (40%), Gaps = 93/435 (21%)

Query: 17  SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFV 76
            + D  +  + +   + R+ SLD  RGLA+A+MI V++ GG +   +H+ WNG  +AD V
Sbjct: 381 GQADRQNGPKPAPRVSSRVNSLDAVRGLAIAIMIFVNYGGGGYWFFNHSAWNGITVADLV 440

Query: 77  MPFFLFIVGVAIALALKRIPDRADA-------------VKKVIFRTLKLLFWGILLQGGF 123
            P+F++I+G ++A++   +  +                +  V+ R         ++    
Sbjct: 441 FPWFIWIMGTSMAISFTSLEKKLLGLFQNNGYEWETWRIPGVLMRFAVAYLVVGVVVLFV 500

Query: 124 SHAPDELTYGVDVRMIR-------LCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGR-- 174
              P  LTY +  R            G    I       S    F +D    D    R  
Sbjct: 501 PRWPWRLTYRIYRRFTSGGHHRHAADGAASPILERKRFASEAINFDEDTDKSDDGFSRNV 560

Query: 175 ------------------FSIFR--LYCW-HWLMAACVLVVYLALLYGTYVPDWQFTIIN 213
                             +++F   L  W HWL+A  +L VY  + +   VP        
Sbjct: 561 FAAEDDESCENMMKDKWVYTLFGDILPFWPHWLVAFSLLFVYFMITFFLDVPG------- 613

Query: 214 KDSADYGKVFNVTCGVRAKLNPPCN-----AVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
                        CG R  L P  +     A GYID+K+   +H+Y+ P  +        
Sbjct: 614 -------------CG-RGYLGPDISTATGGAAGYIDKKIFTEDHIYNQPTCQ-------- 651

Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW--- 325
                      P +    ++PEG L +++SI    +G+  G  ++  K H  R+ +W   
Sbjct: 652 -----------PLYLTGSYDPEGTLGNLTSIFMVFLGLQSGRTLMAWKDHKHRVVRWYIW 700

Query: 326 -VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPF 384
            + +GF  L            PLNK L++ S++  T+  A  + +  Y L+D++ + +  
Sbjct: 701 SIVLGFIALGLCEAKQNGGFFPLNKNLWSPSFILATAAMAFFLLATFYLLIDVFPI-WSG 759

Query: 385 LPLAWIGMNAMLVYV 399
            P  +IGMN +L+Y+
Sbjct: 760 SPFRFIGMNPILIYL 774


>gi|386819709|ref|ZP_10106925.1| hypothetical protein JoomaDRAFT_1633 [Joostella marina DSM 19592]
 gi|386424815|gb|EIJ38645.1| hypothetical protein JoomaDRAFT_1633 [Joostella marina DSM 19592]
          Length = 366

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 108/380 (28%), Positives = 168/380 (44%), Gaps = 83/380 (21%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +R  SLD+FRGL + LMI+V+  G     +  + HAPWNG  L D V P FLF+VG A++
Sbjct: 6   ERYLSLDVFRGLTLFLMIIVNTPGSWSFIYKPLHHAPWNGFTLTDLVFPTFLFVVGNAMS 65

Query: 90  LALKRIPDRADA--VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
            +LK+  +  +   +KKV  RT  +   G LL   F    D       +   R+ GVLQR
Sbjct: 66  FSLKKFEEIGNTAFLKKVFKRTFLIFLIGFLLY-WFPFFKDGALK--PISETRIFGVLQR 122

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
           IAL Y   +L+  +      K +    FS+  L  +H ++          LL+G      
Sbjct: 123 IALCYCFAALILHYW-----KPKGALIFSVIALVGYHIIL----------LLFG------ 161

Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
                                    L    NA    D  ++G +HMY    +        
Sbjct: 162 ------------------------DLTMQGNAAIKADLWLIGSSHMYKGEGF-------- 189

Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVT 327
                             PF+PEG+LS++ +I++ I G +F  V +  KG        +T
Sbjct: 190 ------------------PFDPEGVLSTLPAIVNVIAG-YFAGVFLQQKGKTYEAIAKLT 230

Query: 328 MGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPL 387
           M   +LIF L L +   +P+NK+L+T SYV +T G   +V S +  +++I   +      
Sbjct: 231 MVGGVLIF-LALGWDLLLPINKKLWTSSYVLLTVGIDVMVLSILVFVIEILKKRSWTYFF 289

Query: 388 AWIGMNAMLVYVMAAEGIFA 407
              G N + +Y+++  G+FA
Sbjct: 290 EVFGKNPLFIYILS--GVFA 307


>gi|116789271|gb|ABK25182.1| unknown [Picea sitchensis]
          Length = 124

 Score =  108 bits (271), Expect = 4e-21,   Method: Composition-based stats.
 Identities = 48/91 (52%), Positives = 66/91 (72%)

Query: 5   KAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISH 64
           +  TT    +I+ +   +   + ++    R+A+LD+FRGL +A+MILVD AGG WP+I+H
Sbjct: 11  QPSTTESSNVIVIQDGQTIPAKPTNETKTRVATLDVFRGLTIAVMILVDDAGGKWPQINH 70

Query: 65  APWNGCNLADFVMPFFLFIVGVAIALALKRI 95
           +PWNGC LADFVMPFFLFIVGVA+AL  K +
Sbjct: 71  SPWNGCTLADFVMPFFLFIVGVAVALTFKVV 101


>gi|54294550|ref|YP_126965.1| hypothetical protein lpl1626 [Legionella pneumophila str. Lens]
 gi|53754382|emb|CAH15866.1| hypothetical protein lpl1626 [Legionella pneumophila str. Lens]
          Length = 372

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 111/430 (25%), Positives = 178/430 (41%), Gaps = 99/430 (23%)

Query: 30  LKTQRLASLDIFRGLAVALMILVD-HAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           LK QRL SLD+FRG+ + LMI V+  A  D +P   H  WNGC LAD V PFFLFIVG+ 
Sbjct: 6   LKPQRLLSLDVFRGMTIVLMIFVNGQAAIDPYPIFEHVDWNGCTLADLVFPFFLFIVGLT 65

Query: 88  IALALKRIPDRADAV---KKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
             ++LK   +R +       +I R++                 +   + ++   IR+ G+
Sbjct: 66  SVISLKNQMERKEKTSLYSAIIERSVV--------LFLLGLFLNVFPHPIEFDSIRIYGI 117

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           LQRIA+ YL+ + + +   +   K Q                       ++L LL G ++
Sbjct: 118 LQRIAVCYLISAFIYL---NTSIKTQ---------------------FFIFLVLLLGYWI 153

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
              Q  +       YG           +L    + V Y D+      H+Y          
Sbjct: 154 IMTQVPV-----PGYGA---------NQLTKDGSWVSYFDQLFFSAPHLYEK-------- 191

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
                                 ++PEG LS+ +SI +T+ GV  G ++I+      +   
Sbjct: 192 ---------------------TYDPEGFLSTFTSIATTLSGVLAGSLLINPCNQFKKF-- 228

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPF 384
           ++  G  LL   L   +  + P+NK L+T SYV  TSG A L F+  Y L+D   +K   
Sbjct: 229 YLLAGVGLLFLLLGWLWNMSFPINKNLWTSSYVLWTSGLALLAFAFCYLLIDRLGVKKWS 288

Query: 385 LPLAWIGMNAMLVYVM-----------------AAEGIFAGFINGWYYGDPHNTLVCFLF 427
           +     GMNA+  +V                   ++     ++  +++G   N     L+
Sbjct: 289 VFFKIFGMNALFAFVFHVLLLKLQYVFKITTPDGSKMALISYLKDYFFGGFSNHNAALLY 348

Query: 428 IISYILHSFL 437
            IS++  +FL
Sbjct: 349 SISFLFLNFL 358


>gi|409990365|ref|ZP_11273749.1| hypothetical protein APPUASWS_05524 [Arthrospira platensis str.
           Paraca]
 gi|291567406|dbj|BAI89678.1| hypothetical protein [Arthrospira platensis NIES-39]
 gi|409938771|gb|EKN80051.1| hypothetical protein APPUASWS_05524 [Arthrospira platensis str.
           Paraca]
          Length = 378

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 106/385 (27%), Positives = 165/385 (42%), Gaps = 89/385 (23%)

Query: 34  RLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           RL SLD+FRG+A+A MILV++ G     +P + HA W+GC   D V P FL IVGVAIA 
Sbjct: 9   RLISLDVFRGIAIAAMILVNNPGSWGYMYPVLQHAQWHGCTPTDVVFPSFLLIVGVAIAF 68

Query: 91  ALKRIPDR----ADAVKKVIF----RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
           +L +         D V   ++    R   LLF   L+  GF +         D+  IR+ 
Sbjct: 69  SLSKFSPEHRLGGDGVPPSVYSRIGRRCLLLFLLGLILNGFPN--------YDLANIRIM 120

Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
           GVLQRIA++Y L ++  +     Q                  WL++   L+ Y   +   
Sbjct: 121 GVLQRIAIAYGLSAIAILNLSRRQ-----------------LWLISIFTLIGYWLAMTMI 163

Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
            VP +                         L+P  N   +ID+ +LG +H+         
Sbjct: 164 PVPGYS---------------------PGNLSPEGNLGAFIDQTILGSHHL--------- 193

Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
                              W   P++PEGL S+  + ++ IIG   G  +     +   +
Sbjct: 194 -------------------WRGGPYDPEGLFSTAPATVTVIIGYLTGEWLKSQPRNSLTV 234

Query: 323 KQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKY 382
              V    + L+ G         P+NK L+T SYV VT+G   L+ +  Y ++++ N + 
Sbjct: 235 INLVMFALSSLVVGYLWGIW--FPINKALWTSSYVLVTAGWGLLLLAFCYGVIEVKNWRR 292

Query: 383 PFLPLAWIGMNAMLVYVMAAEGIFA 407
              P   +G+NA+ ++V  A G+ A
Sbjct: 293 WGKPFEIMGVNAIFLFV--ASGLLA 315


>gi|52841889|ref|YP_095688.1| hypothetical protein lpg1661 [Legionella pneumophila subsp.
           pneumophila str. Philadelphia 1]
 gi|378777523|ref|YP_005185961.1| hypothetical protein lp12_1599 [Legionella pneumophila subsp.
           pneumophila ATCC 43290]
 gi|52629000|gb|AAU27741.1| hypothetical protein lpg1661 [Legionella pneumophila subsp.
           pneumophila str. Philadelphia 1]
 gi|364508338|gb|AEW51862.1| hypothetical protein lp12_1599 [Legionella pneumophila subsp.
           pneumophila ATCC 43290]
          Length = 372

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 111/430 (25%), Positives = 179/430 (41%), Gaps = 99/430 (23%)

Query: 30  LKTQRLASLDIFRGLAVALMILVD-HAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           LK QRL SLD+FRG+ + LMI V+  A  D +P   H  WNGC LAD V PFFLFIVG+ 
Sbjct: 6   LKPQRLLSLDVFRGMTIVLMIFVNGQAAIDPYPIFEHVDWNGCTLADLVFPFFLFIVGLT 65

Query: 88  IALALKRIPDRADAV---KKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
             ++LK   +R         +I R++ L               +   + ++   IR+ G+
Sbjct: 66  SVISLKNQMERKAKTSLYSAIIERSVVLF--------LLGLFLNVFPHPIEFDSIRIYGI 117

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           LQRIA+ YL+ + + +   +   K Q                       ++L LL G ++
Sbjct: 118 LQRIAVCYLISAFIYL---NTSIKTQ---------------------FFIFLVLLLGYWI 153

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
              Q  +       YG           +L    + V Y D+     +H+Y          
Sbjct: 154 IMTQVPV-----PGYGA---------NQLTKDGSWVSYFDQLFFSASHLYEK-------- 191

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
                                 ++PEG LS+ +SI +T+ GV  G ++I       +   
Sbjct: 192 ---------------------TYDPEGFLSTFTSIATTLSGVLAGSLLISPCNQFKKF-- 228

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPF 384
           ++  G  LL   L   +  + P+NK L+T SYV  TSG A L F+  Y L+D + +K   
Sbjct: 229 YLLAGIGLLFLLLGWLWNMSFPINKNLWTSSYVLWTSGLALLAFAFCYLLIDRFGVKKWS 288

Query: 385 LPLAWIGMNAMLVYVM-----------------AAEGIFAGFINGWYYGDPHNTLVCFLF 427
           +     GMNA+  +V                   ++ +   ++  +++G   N     L+
Sbjct: 289 VFFKVFGMNALFAFVFHVLLLKLQYAFKITTPDGSKMVLISYLKDYFFGGFSNHNAALLY 348

Query: 428 IISYILHSFL 437
            I ++  +FL
Sbjct: 349 SICFLFLNFL 358


>gi|294674520|ref|YP_003575136.1| hypothetical protein PRU_1851 [Prevotella ruminicola 23]
 gi|294472648|gb|ADE82037.1| putative membrane protein [Prevotella ruminicola 23]
          Length = 357

 Score =  108 bits (270), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 100/375 (26%), Positives = 169/375 (45%), Gaps = 83/375 (22%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDWPE-ISHAPWNGCNLADFVMPFFLFIVGVAI 88
           ++++RL SLDI RG+ VA MILV++  G+  E + H+ WNG    D V PFFLFI+G++ 
Sbjct: 1   MESKRLLSLDILRGITVAGMILVNNGWGESFEMLRHSKWNGMTPCDLVFPFFLFIMGISC 60

Query: 89  ALALKR--IPDRADAVKKVIFRTLKLLFWGILLQGGFSHA--PDELTYGVDVRMIRLCGV 144
            L+L +         +++++ RT+ L   G+ +   F HA   D L +G     +R+  V
Sbjct: 61  YLSLVKSEFKPTPQVIRRIVKRTVLLFAIGLFIN-WFDHAIEGDLLCFG----HLRIWAV 115

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           +QRIAL Y +VSL                    F L+C H                    
Sbjct: 116 MQRIALCYGIVSL--------------------FALFCNH-------------------- 135

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
             +  ++I    A Y  +  +  G     N   N +   D K+ G +H+YH         
Sbjct: 136 -KYTLSVIGGLLAIYTAILILGNGYAEDAN--VNVLAQADLKLFGYDHIYH--------- 183

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
                               +P +PEGL+ ++SS+   ++G + G +I   +    ++  
Sbjct: 184 -------------------KSPVDPEGLMGTISSVAHVLLGFYCGMLIRKRETVEQKVIA 224

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPF 384
              +G   +I G  L +   +PLNK++++ SYV +T G A+L+ + +  ++DI       
Sbjct: 225 LFVVGAVCVIGGYLLSY--GLPLNKRIWSPSYVLMTCGLASLMQALLMYVIDIQKKSGWT 282

Query: 385 LPLAWIGMNAMLVYV 399
                 G+NA+ +YV
Sbjct: 283 TFFHVFGVNALALYV 297


>gi|148359197|ref|YP_001250404.1| hypothetical protein LPC_1091 [Legionella pneumophila str. Corby]
 gi|296107241|ref|YP_003618941.1| hypothetical protein lpa_02399 [Legionella pneumophila 2300/99
           Alcoy]
 gi|148280970|gb|ABQ55058.1| conserved hypothetical protein [Legionella pneumophila str. Corby]
 gi|295649142|gb|ADG24989.1| hypothetical protein lpa_02399 [Legionella pneumophila 2300/99
           Alcoy]
          Length = 372

 Score =  108 bits (269), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 104/375 (27%), Positives = 161/375 (42%), Gaps = 82/375 (21%)

Query: 30  LKTQRLASLDIFRGLAVALMILVD-HAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           LK QRL SLD+FRG+ + LMI+V+  A  D +P   H  WNGC LAD V PFFLFIVG+ 
Sbjct: 6   LKPQRLLSLDVFRGMTIVLMIIVNGQAAIDPYPIFEHVDWNGCTLADLVFPFFLFIVGLT 65

Query: 88  IALALKRIPDR---ADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
             ++LK   +R         +I R++                 +   + ++   IR+ G+
Sbjct: 66  SVISLKNQMERKAKTSLYSAIIERSVV--------LFLLGLFLNVFPHPIEFDSIRIYGI 117

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           LQRIA+ YL+ + + +   +   K Q                       ++L LL G ++
Sbjct: 118 LQRIAVCYLISAFIYL---NTSIKTQ---------------------FFIFLVLLLGYWI 153

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
              Q  +       YG           +L    + V Y D+     +H+Y          
Sbjct: 154 IMTQVPV-----PGYGA---------NQLTKDGSWVSYFDQLFFSASHLYEK-------- 191

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
                                 ++PEG LS+ +SI +T+ GV  G ++I+      +   
Sbjct: 192 ---------------------TYDPEGFLSTFTSIATTLSGVLAGSLLINPCNQFKKF-- 228

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPF 384
           ++  G  LL   L   +  + P+NK L+T SYV  TSG A L F+  Y L+D   +K   
Sbjct: 229 YLLAGVGLLFLLLGWLWNMSFPINKNLWTSSYVLWTSGLALLAFAFCYLLIDRLGVKKWS 288

Query: 385 LPLAWIGMNAMLVYV 399
           +     GMNA+  +V
Sbjct: 289 VFFKIFGMNALFAFV 303


>gi|397664114|ref|YP_006505652.1| Putative heparan-alpha-glucosaminide N-acetyltransferase
           [Legionella pneumophila subsp. pneumophila]
 gi|395127525|emb|CCD05722.1| Putative heparan-alpha-glucosaminide N-acetyltransferase
           [Legionella pneumophila subsp. pneumophila]
          Length = 372

 Score =  108 bits (269), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 104/375 (27%), Positives = 160/375 (42%), Gaps = 82/375 (21%)

Query: 30  LKTQRLASLDIFRGLAVALMILVD-HAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           LK QRL SLD+FRG+ + LMI V+  A  D +P   H  WNGC LAD V PFFLFIVG+ 
Sbjct: 6   LKPQRLLSLDVFRGMTIVLMIFVNGQAAIDPYPIFEHVDWNGCTLADLVFPFFLFIVGLT 65

Query: 88  IALALKRIPDR---ADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
             ++LK   +R         +I R++                 +   + ++   IR+ G+
Sbjct: 66  SVISLKNQMERKAKTSLYSAIIERSVV--------LFLLGLFLNVFPHPIEFDSIRIYGI 117

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           LQRIA+ YL+ + + +   +   K Q                       ++L LL G ++
Sbjct: 118 LQRIAVCYLISAFIYL---NTSIKTQ---------------------FFIFLVLLLGYWI 153

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
              Q  +       YG           +L    + V Y D+     +H+Y          
Sbjct: 154 IMTQVPV-----PGYGA---------NQLTKDGSWVSYFDQLFFSASHLYEK-------- 191

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
                                 ++PEG LS+ +SI +T+ GV  G ++I+      +   
Sbjct: 192 ---------------------TYDPEGFLSTFTSIATTLSGVLAGSLLINPCNQFKKF-- 228

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPF 384
           ++  G  LL   L   +  + P+NK L+T SYV  TSG A L F+  Y L+D   +K   
Sbjct: 229 YLLAGGGLLFLLLGWLWNMSFPINKNLWTSSYVLWTSGLALLAFAFCYLLIDSLGVKKWS 288

Query: 385 LPLAWIGMNAMLVYV 399
           +     GMNA+  +V
Sbjct: 289 VFFKIFGMNALFAFV 303


>gi|119491291|ref|ZP_01623345.1| hypothetical protein L8106_21879 [Lyngbya sp. PCC 8106]
 gi|119453455|gb|EAW34617.1| hypothetical protein L8106_21879 [Lyngbya sp. PCC 8106]
          Length = 371

 Score =  108 bits (269), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 103/378 (27%), Positives = 170/378 (44%), Gaps = 83/378 (21%)

Query: 34  RLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           RL SLD+FRG+A+A MILV++ G     +P + HA W+G    D V P FLFIVGVA+  
Sbjct: 2   RLTSLDVFRGIAIASMILVNNPGSWNHVYPLLKHAEWHGYTPTDLVFPSFLFIVGVAMTF 61

Query: 91  AL-KRIPDRADAVK----KVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
           ++ K +P+  +  +    K+  R L+      LL    +  P+      D+  IR+ GVL
Sbjct: 62  SMSKYLPENRNLEENISPKIYLRILRRCLILFLLGLLLNGYPN-----YDLANIRIMGVL 116

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRI+L+Y L ++  +     Q    S+G                 +L+ Y  ++    VP
Sbjct: 117 QRISLAYGLSAITILHLSRKQIWGLSIG-----------------LLIGYAVVMQLIPVP 159

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
           +                     GV   L P  N   Y+DR +LG +H+            
Sbjct: 160 N--------------------SGV-VNLTPEGNFAAYLDRLILGEHHLL----------- 187

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
                                ++PEGLLS++ ++++ +IG   G+ +     +       
Sbjct: 188 -----------------GGGKYDPEGLLSTLPAVVTVLIGYLTGNWLKKQPINSQTSLNL 230

Query: 326 VTMGFALLIFGLTLHFTNAI-PLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPF 384
           V +G   +I G   H    I P+NK L+T SYV VT+G A ++ +A Y L+++   +   
Sbjct: 231 VIIGLCNIIVG---HLWGLIFPINKSLWTSSYVLVTAGWALVLLAACYELIEVRQQQKWG 287

Query: 385 LPLAWIGMNAMLVYVMAA 402
            P   +G+N++ ++V + 
Sbjct: 288 FPFEVMGLNSIFLFVASG 305


>gi|397667386|ref|YP_006508923.1| putative Heparan-alpha-glucosaminide N-acetyltransferase
           [Legionella pneumophila subsp. pneumophila]
 gi|395130797|emb|CCD09044.1| putative Heparan-alpha-glucosaminide N-acetyltransferase
           [Legionella pneumophila subsp. pneumophila]
          Length = 372

 Score =  108 bits (269), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 104/375 (27%), Positives = 160/375 (42%), Gaps = 82/375 (21%)

Query: 30  LKTQRLASLDIFRGLAVALMILVD-HAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           LK QRL SLD+FRG+ + LMI V+  A  D +P   H  WNGC LAD V PFFLFIVG+ 
Sbjct: 6   LKPQRLLSLDVFRGMTIVLMIFVNGQAAIDPYPIFEHVDWNGCTLADLVFPFFLFIVGLT 65

Query: 88  IALALKRIPDR---ADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
             ++LK   +R         +I R++                 +   + ++   IR+ G+
Sbjct: 66  SVISLKNQMERKAKTSLYSAIIERSVV--------LFLLGLFLNVFPHPIEFDSIRIYGI 117

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           LQRIA+ YL+ + + +   +   K Q                       ++L LL G ++
Sbjct: 118 LQRIAVCYLISAFIYL---NTSIKTQ---------------------FFIFLVLLLGYWI 153

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
              Q  +       YG           +L    + V Y D+     +H+Y          
Sbjct: 154 IMTQVPV-----PGYGA---------NQLTKDGSWVSYFDQLFFSASHLYEK-------- 191

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
                                 ++PEG LS+ +SI +T+ GV  G ++I+      +   
Sbjct: 192 ---------------------TYDPEGFLSTFTSIATTLSGVLAGSLLINPCNQFKKF-- 228

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPF 384
           ++  G  LL   L   +  + P+NK L+T SYV  TSG A L F+  Y L+D   +K   
Sbjct: 229 YLLAGVGLLFLLLGWLWNMSFPINKNLWTSSYVLWTSGLALLAFAFCYLLIDRLGVKKWS 288

Query: 385 LPLAWIGMNAMLVYV 399
           +     GMNA+  +V
Sbjct: 289 VFFKIFGMNALFAFV 303


>gi|375146803|ref|YP_005009244.1| hypothetical protein [Niastella koreensis GR20-10]
 gi|361060849|gb|AEV99840.1| Protein of unknown function DUF2261, transmembrane [Niastella
           koreensis GR20-10]
          Length = 376

 Score =  108 bits (269), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 109/389 (28%), Positives = 171/389 (43%), Gaps = 97/389 (24%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHA---GGDWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           + +R  +LD+FRG+ +  MI+V+ +      +  + HA W+G    D V P FLF VG A
Sbjct: 3   QQKRFLALDVFRGMTICFMIIVNTSPDGSHTYAPLLHAQWHGFTPTDLVFPSFLFAVGNA 62

Query: 88  IALALKRIPDRADA--VKKVIFRTLKLLFWGILL-----------QGGFSHAPDELTYGV 134
           ++  + R  + +    + K++ RTL +   G L+            G ++  P E T   
Sbjct: 63  MSFVMPRWENASTGFVLGKILKRTLLIFILGYLMYWFPFVRMDKVTGVYAFYPFEKT--- 119

Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
                R+ GVLQRIAL+Y   SL+  F K              FR      ++ A +L++
Sbjct: 120 -----RVFGVLQRIALAYCFASLMLYFLK--------------FRATI---IITAAILLL 157

Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
           Y  +LY              DSAD              L+   NAV  +D  +LG +H+Y
Sbjct: 158 YWPVLY-----------FFGDSAD-------------PLSLAGNAVLKLDLWLLGPDHLY 193

Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
           H                EG            PF+PEG LS+  +I + + G   G  +  
Sbjct: 194 HG---------------EG-----------VPFDPEGFLSTFPAIANVVGGYWVGRFLQQ 227

Query: 315 TKGHLARLKQWVTMGFALLIFGLTLHFTN-AIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
             G    L + +  GFALL+     HF N + P+NK+L+T S+V  T G   L+ +AI  
Sbjct: 228 KGGTYEALTKLMLAGFALLVLA---HFWNYSFPINKKLWTSSFVLHTVGIDCLIIAAIVY 284

Query: 374 LVDIWNLKYPFLPLAWI-GMNAMLVYVMA 401
           + DI   K  + P   + G N + +Y+++
Sbjct: 285 IADI-QQKTSWTPFFQVFGKNPLFIYLLS 312


>gi|311747386|ref|ZP_07721171.1| membrane protein [Algoriphagus sp. PR1]
 gi|126579104|gb|EAZ83268.1| membrane protein [Algoriphagus sp. PR1]
          Length = 381

 Score =  107 bits (268), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 103/388 (26%), Positives = 176/388 (45%), Gaps = 82/388 (21%)

Query: 27  KSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLF 82
           K+ L   R  +LD+ RGL +A MI+V+ AG DW  +    +HA W+G    D V P FLF
Sbjct: 6   KTDLLKNRYLALDVLRGLTIAFMIVVNSAG-DWSNLYAPLAHAKWHGFTPTDLVFPTFLF 64

Query: 83  IVGVAIALALKRIPDRADAV--KKVIFRTLKLLFWGILLQG-GFSHAPDELTYG-VDVRM 138
           +VG A++ ++K++ +   +   KKV  RTL +   G LL    F    +   +  +++  
Sbjct: 65  VVGNAMSFSMKKLQEMPTSAFFKKVGKRTLLIFLIGWLLNAFPFYDISETGNFSLINITE 124

Query: 139 IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL 198
           +RL GVLQRIAL Y   +++ ++   V+                  W+ +   L+ Y  +
Sbjct: 125 VRLFGVLQRIALCYFFAAII-LYIGGVR----------------LGWIFSGIALLTYWGI 167

Query: 199 LYGTYVPDWQFTIINKDSAD-YGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHP 257
           +Y           +  DS+D YG                 NA   +D  ++G++ MY   
Sbjct: 168 MY-----------VFGDSSDPYGLT--------------GNAAIKLDLSLIGVDRMYGG- 201

Query: 258 AWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKG 317
                         EG            PF+PEGLLS++ SI++ I G   G ++     
Sbjct: 202 --------------EG-----------IPFDPEGLLSTLPSIVNVIAGYIIGKMVQKYGN 236

Query: 318 HLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDI 377
            L  +K+ +     L++  L   +    P+NK+++T SYV +T G   ++ + +  ++++
Sbjct: 237 TLESIKKLLIGAVVLIV--LAYIWDIVFPINKKIWTSSYVLLTVGIDMVLLALLVYIIEL 294

Query: 378 WNLKYPFLPLAWIGMNAMLVYVMAAEGI 405
             +K         G N +++YV  A GI
Sbjct: 295 QKVKNWTYFFEVFGRNPLILYV--ASGI 320


>gi|256420508|ref|YP_003121161.1| hypothetical protein Cpin_1463 [Chitinophaga pinensis DSM 2588]
 gi|256035416|gb|ACU58960.1| conserved hypothetical protein [Chitinophaga pinensis DSM 2588]
          Length = 374

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 101/342 (29%), Positives = 155/342 (45%), Gaps = 78/342 (22%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGGD--WPEISHAPWNGCNLADFVMPFFLFIVG 85
           ++   QR   LD+FRGL V  MI+V+  G D  +  ++HA WNGC   D V P FLF VG
Sbjct: 2   TNTTPQRFLPLDVFRGLTVCFMIIVNTPGWDTSYYILNHAQWNGCTPTDMVFPSFLFAVG 61

Query: 86  VAIALALKRIP--DRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRL 141
            A++ ++++    +    + K+  RTL +   G L+       H    L + + +   R+
Sbjct: 62  NAMSFSMRKFQQLENTAVLSKIFRRTLLIFLLGFLMYWLPFVRHTESGLEF-IPLSDTRI 120

Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
            GVLQRIAL Y   SL+                         H+L    V  V   LL G
Sbjct: 121 LGVLQRIALCYCFASLLI------------------------HYLPKKAVWAVSAVLLLG 156

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
            +   + F     D AD             + +   NA  + D+ ++G +H+YH      
Sbjct: 157 YWAVMYAF----GDPAD-------------RYSLTGNAALHFDKLIMGDSHLYHG----- 194

Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
                     EG             F+PEGLLS++ +I++ I G + G + I  +G   +
Sbjct: 195 ----------EG-----------IAFDPEGLLSTLPAIVNVIAGYYTG-LFIQQEGKTGK 232

Query: 322 -LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSG 362
            L++ + MG ALLI  + L ++ A P+NK+L+T SYV  T G
Sbjct: 233 GLRKLLQMG-ALLIL-VALVWSMAFPINKKLWTSSYVLYTVG 272


>gi|418709516|ref|ZP_13270303.1| putative membrane protein [Leptospira interrogans serovar
           Grippotyphosa str. UI 08368]
 gi|421125723|ref|ZP_15585968.1| putative membrane protein [Leptospira interrogans serovar
           Grippotyphosa str. 2006006986]
 gi|421135286|ref|ZP_15595410.1| putative membrane protein [Leptospira interrogans serovar
           Grippotyphosa str. Andaman]
 gi|410020544|gb|EKO87345.1| putative membrane protein [Leptospira interrogans serovar
           Grippotyphosa str. Andaman]
 gi|410436829|gb|EKP85940.1| putative membrane protein [Leptospira interrogans serovar
           Grippotyphosa str. 2006006986]
 gi|410770179|gb|EKR45405.1| putative membrane protein [Leptospira interrogans serovar
           Grippotyphosa str. UI 08368]
 gi|456966468|gb|EMG08068.1| putative membrane protein [Leptospira interrogans serovar
           Grippotyphosa str. LT2186]
          Length = 381

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 109/410 (26%), Positives = 171/410 (41%), Gaps = 110/410 (26%)

Query: 27  KSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFI 83
           ++ L   R+ SLD+FRG+ VA MILV++ G     +  + HA WNGC   D V PFFLF 
Sbjct: 2   ENKLNQNRILSLDLFRGMTVAGMILVNNPGSWSFIYSPLKHARWNGCTPTDLVFPFFLFA 61

Query: 84  VGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRL 141
           VG++I  ++  K     +     +  R++ L+  G+ L      +  EL         R+
Sbjct: 62  VGISIHFSVYSKNKIYLSKTWLGICIRSITLILIGLFLNFFGEWSFSEL---------RI 112

Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
            GVLQRI   Y                                W++A+  L++    +  
Sbjct: 113 PGVLQRIGFVY--------------------------------WVVASLYLILPKRAILI 140

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY----------IDRKVLGIN 251
           +++P                +  V   +  +L PP  ++ Y          IDR V G N
Sbjct: 141 SWIP----------------ILIVHTWILIQLPPPGESIVYLEPGKDIGAWIDRNVFGEN 184

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           H+     W+ SK                       ++PEG  S +SSI ++++GV  G  
Sbjct: 185 HL-----WKFSKT----------------------WDPEGFFSGISSITTSLLGVFCGS- 216

Query: 312 IIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALV---- 367
           I+ +K +  + +     GF +L   + L +   +P+NK L+T SYV  T+G A L     
Sbjct: 217 ILSSKTNETKKQILSIFGFGILFVLVGLLWNQNLPMNKSLWTGSYVIYTAGLAFLSIGFF 276

Query: 368 -FSAIYALVDIWN---LKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGW 413
            F  +      WN   L+  F P    G NA+LV+V    G+ A  +N W
Sbjct: 277 EFLNLLLQTKKWNRLRLETIFQPFLVFGKNAILVFV--GSGLLARILNLW 324


>gi|433652541|ref|YP_007296395.1| hypothetical protein Prede_1583 [Prevotella dentalis DSM 3688]
 gi|433303074|gb|AGB28889.1| hypothetical protein Prede_1583 [Prevotella dentalis DSM 3688]
          Length = 394

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 99/366 (27%), Positives = 163/366 (44%), Gaps = 81/366 (22%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGGD--WPEISHAPWNGCNLADFVMPFFLFIVG 85
           + + +QRL SLD+ RGL V LMI V++  G+  + ++ H+ WNG  L D V PFFLFI+G
Sbjct: 7   TRMSSQRLISLDVLRGLTVMLMIFVNNGAGEQIFAQLQHSRWNGMTLCDLVFPFFLFIMG 66

Query: 86  VAIALALKRIPD--RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCG 143
           V+  L+L++      A   +K+  RTL L   G+ +   F  A     +  D+  +R+ G
Sbjct: 67  VSTYLSLRKTQFVWSARLGRKIARRTLLLFVIGLAIN-WFDMACSGRPF--DLSHLRIMG 123

Query: 144 VLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTY 203
           V+QRIAL Y   +L+ +                     C  WL                 
Sbjct: 124 VMQRIALCYGATALIAVG--------------------CQRWLH---------------- 147

Query: 204 VPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
             D++       +        +  G     +   N +  +D+ VLG  H+YH        
Sbjct: 148 --DFRAMPAIIAALLGAYGALLLMGQGYAYDAAINLLSRVDQAVLGHAHLYH-------- 197

Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII----HTKGHL 319
                                +P +PEGL+S+++++  T+ G +  H  +       G  
Sbjct: 198 --------------------KSPVDPEGLVSTLAAVAHTLAGFYVAHWALGPGRDGAGPA 237

Query: 320 AR--LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDI 377
           AR  + ++   G  L + GL L  +  +PLNK++++ SYVC++ G AAL+ + +  LVD+
Sbjct: 238 ARRSMMRFAAAGTVLAVVGLAL--SPLLPLNKRVWSPSYVCLSCGLAALLQALLILLVDL 295

Query: 378 WNLKYP 383
           W    P
Sbjct: 296 WRRPAP 301


>gi|410926267|ref|XP_003976600.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like,
           partial [Takifugu rubripes]
          Length = 497

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 74/236 (31%), Positives = 123/236 (52%), Gaps = 22/236 (9%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
            ++RL SLD FRG+++ +M+ V++ GG +    H  WNG  +AD V P+F+FI+G +IAL
Sbjct: 278 SSKRLQSLDTFRGISLVIMVFVNYGGGRYWFFRHESWNGLTVADLVFPWFMFIMGTSIAL 337

Query: 91  A----LKRIPDRADAVKKVIFRTLKLLFWGI-LLQGGFSHAPDELTYGVDVRMIRLCGVL 145
           +    L+    R   + KV +R+L+L   G+ ++   +   P  L++G     +R+ GVL
Sbjct: 338 SVHALLRTGSTRLSLLGKVAWRSLQLFLIGLFIINPNYCQGP--LSWGT----LRIPGVL 391

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QR+AL+YL+V+ +++       +  +    +   LY   W+    +  V+L L +   VP
Sbjct: 392 QRLALAYLVVACLDLLVARAHLEIYTTVSSTDVLLYWPAWVCVLLLESVWLFLTFLLPVP 451

Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWR 260
           D     +      D G   N T G          A G+IDR +LG  H+Y +P+ R
Sbjct: 452 DCPTGYLGPGGIGDMGLFPNCTGG----------AAGFIDRWLLGEKHIYQNPSSR 497


>gi|333030942|ref|ZP_08459003.1| putative transmembrane protein [Bacteroides coprosuis DSM 18011]
 gi|332741539|gb|EGJ72021.1| putative transmembrane protein [Bacteroides coprosuis DSM 18011]
          Length = 385

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 101/395 (25%), Positives = 169/395 (42%), Gaps = 94/395 (23%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGV 86
           + ++RL SLDI RG  +  MILV++ G     +  + HA W+G    D + PFF+F++G+
Sbjct: 1   MSSKRLLSLDILRGGTIIGMILVNNPGSWEYIYSPLRHAEWHGLTPTDLIFPFFIFVMGI 60

Query: 87  AIALALKRIPDRADAV----KKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRM---- 138
           +++L+  +  +         +KVI R+ KL      L G F      L  G++ R+    
Sbjct: 61  SMSLSFSKFKNEEYNKTLFWEKVIKRSAKL-----FLLGLFLSWFSLLLEGINNRLEYES 115

Query: 139 ----------IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMA 188
                     IR+ GV+QR+ALSYL+ S+  +     +                 H ++ 
Sbjct: 116 ISEILFPFGQIRILGVMQRLALSYLVGSVFVMLIPKAK-----------------HLVIT 158

Query: 189 ACVLVV-YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKV 247
           + +L++ Y  LL                    G  F+ +           N +  +D  +
Sbjct: 159 SVILLIAYFILL------------------SLGNGFSFSSD---------NIIAIVDNSL 191

Query: 248 LGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVH 307
            G NH+Y    W                    P      F+PEGLLS++  I+  I+G  
Sbjct: 192 FGENHVYLE--W-------------------LPDGERLRFDPEGLLSTIPCIVQVIMGYL 230

Query: 308 FGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALV 367
            G VI   K  L ++     +G  LL  GL L +    PLNK++++ ++  VTSG A L 
Sbjct: 231 CGEVIRKKKDLLNKMMDLAIIGIVLLFIGLLLSY--GCPLNKKIWSPTFELVTSGFAVLA 288

Query: 368 FSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
            + +  ++D   LK    P    G N + +Y+ ++
Sbjct: 289 LTLLIWIIDYKGLKKWCNPFEAFGTNPLFIYIASS 323


>gi|398341237|ref|ZP_10525940.1| hypothetical protein LkirsB1_18883 [Leptospira kirschneri serovar
           Bim str. 1051]
          Length = 383

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 112/402 (27%), Positives = 175/402 (43%), Gaps = 90/402 (22%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFL 81
           + KS     R+ SLD+FRG+ VA MILV++ G     +  + HA WNGC   D V PFFL
Sbjct: 2   ENKSTQNKNRVLSLDLFRGMTVAGMILVNNPGSWSFIYSPLKHAKWNGCTPTDLVFPFFL 61

Query: 82  FIVGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMI 139
           F VG++I L++  K    ++     +  R++ L+  G+ L      +  EL         
Sbjct: 62  FAVGISIQLSVYSKNKIHKSKIWFGICIRSITLILIGLFLNFFGEWSFSEL--------- 112

Query: 140 RLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALL 199
           R+ GVLQRI   Y +V+ + +                  R+    W+    +L+V+  +L
Sbjct: 113 RIPGVLQRIGFVYWIVASLHLILPK--------------RMILISWI---PILLVHTWVL 155

Query: 200 YGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAW 259
                P         +S  Y             L P  +   +IDR V G NH+     W
Sbjct: 156 IQIPAPG--------ESIVY-------------LEPGKDIGAWIDRNVFGENHL-----W 189

Query: 260 RRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHL 319
           + SK                       ++PEGL S +SSI ++++GV  G  I+ +K + 
Sbjct: 190 KFSKT----------------------WDPEGLFSGISSIATSLLGVFCGS-ILSSKTNE 226

Query: 320 ARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYAL----- 374
            + +     GF +L+  + L +   +P+NK L+T SYV  T+G A L       L     
Sbjct: 227 IKKQILSIFGFGILLVLVGLLWDQNLPMNKSLWTGSYVIYTAGLAFLSIGCFELLNFLLR 286

Query: 375 VDIWNL---KYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGW 413
           +  W+    +  F P    G NA+LV+V    G+ A  +N W
Sbjct: 287 IKKWDQLQSEIIFQPFLVFGKNAILVFV--GSGLLARTLNLW 326


>gi|434387287|ref|YP_007097898.1| hypothetical protein Cha6605_3369 [Chamaesiphon minutus PCC 6605]
 gi|428018277|gb|AFY94371.1| hypothetical protein Cha6605_3369 [Chamaesiphon minutus PCC 6605]
          Length = 377

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 98/378 (25%), Positives = 172/378 (45%), Gaps = 77/378 (20%)

Query: 33  QRLASLDIFRGLAVALMILVDHAG-----GDWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
            RL SLD+FRGL +A MILV+ A        +  + HAPW+G  +AD V PFFL+I+GV+
Sbjct: 1   MRLTSLDVFRGLTMATMILVNMASLPNDDRKYAWLDHAPWHGYTIADLVFPFFLYIIGVS 60

Query: 88  IALALKR-----IPDRADAVKKVIFRTLKLLFWGILLQG-GFSHAPDELTYGVDVRMIRL 141
           +A +L +     +P       +++ R+  L   G++L    +++   E  +  ++  +R+
Sbjct: 61  MAFSLAKYTSGDVPLSKQVYWQILRRSAILFGLGLILNNLVWNYNLTEPKFFANLDKLRI 120

Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
            GVLQRI +++   S+  +                     C  W++   +LV Y  +L  
Sbjct: 121 MGVLQRIGIAFFFASIAVLALAQ----------------RC-LWILTGGILVGYWLIL-- 161

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
            ++P         D++D         GV ++     N   Y+DR ++   H++     + 
Sbjct: 162 AFIPA-------LDNSD---------GVFSQFG---NFGAYVDRLIITPAHLH-----KG 197

Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
           SK  +                     +PEGL S++ +I S + G   G  +         
Sbjct: 198 SKNLS---------------------DPEGLFSTLPAISSILFGYLTGTWLKRQPVATRT 236

Query: 322 LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLK 381
               +  G A ++ GL   + +  P+NK+L+T S+V  T+G   +  +A Y LVD+   +
Sbjct: 237 SASLLMYGLAAVVIGLV--WNSFFPINKKLWTSSFVLFTTGWGLISLAACYELVDVRKYR 294

Query: 382 YPFLPLAWIGMNAMLVYV 399
             F P   +G+NA+ +YV
Sbjct: 295 QWFKPFEVMGLNAIFIYV 312


>gi|340347656|ref|ZP_08670761.1| brp/Blh family beta-carotene 15,15'-monooxygenase [Prevotella
           dentalis DSM 3688]
 gi|339608850|gb|EGQ13733.1| brp/Blh family beta-carotene 15,15'-monooxygenase [Prevotella
           dentalis DSM 3688]
          Length = 386

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 99/364 (27%), Positives = 162/364 (44%), Gaps = 81/364 (22%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGD--WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           + +QRL SLD+ RGL V LMI V++  G+  + ++ H+ WNG  L D V PFFLFI+GV+
Sbjct: 1   MSSQRLISLDVLRGLTVMLMIFVNNGAGEQIFAQLQHSRWNGMTLCDLVFPFFLFIMGVS 60

Query: 88  IALALKRIPD--RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
             L+L++      A   +K+  RTL L   G+ +   F  A     +  D+  +R+ GV+
Sbjct: 61  TYLSLRKTQFVWSARLGRKIARRTLLLFVIGLAIN-WFDMACSGRPF--DLSHLRIMGVM 117

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRIAL Y   +L+ +                     C  WL                   
Sbjct: 118 QRIALCYGATALIAVG--------------------CQRWLH------------------ 139

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
           D++       +        +  G     +   N +  +D+ VLG  H+YH          
Sbjct: 140 DFRAMPAIIAALLGAYGALLLMGQGYAYDAAINLLSRVDQAVLGHAHLYH---------- 189

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII----HTKGHLAR 321
                              +P +PEGL+S+++++  T+ G +  H  +       G  AR
Sbjct: 190 ------------------KSPVDPEGLVSTLAAVAHTLAGFYVAHWALGPGRDGAGPAAR 231

Query: 322 --LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWN 379
             + ++   G  L + GL L  +  +PLNK++++ SYVC++ G AAL+ + +  LVD+W 
Sbjct: 232 RSMMRFAAAGTVLAVVGLAL--SPLLPLNKRVWSPSYVCLSCGLAALLQALLILLVDLWR 289

Query: 380 LKYP 383
              P
Sbjct: 290 RPAP 293


>gi|24213473|ref|NP_710954.1| hypothetical protein LA_0773 [Leptospira interrogans serovar Lai
           str. 56601]
 gi|45658672|ref|YP_002758.1| hypothetical protein LIC12842 [Leptospira interrogans serovar
           Copenhageni str. Fiocruz L1-130]
 gi|386073105|ref|YP_005987422.1| hypothetical protein LIF_A0631 [Leptospira interrogans serovar Lai
           str. IPAV]
 gi|417764272|ref|ZP_12412242.1| putative membrane protein [Leptospira interrogans serovar Bulgarica
           str. Mallika]
 gi|417786789|ref|ZP_12434477.1| putative membrane protein [Leptospira interrogans str. C10069]
 gi|418669621|ref|ZP_13231000.1| putative membrane protein [Leptospira interrogans serovar Pyrogenes
           str. 2006006960]
 gi|418701883|ref|ZP_13262801.1| putative membrane protein [Leptospira interrogans serovar Bataviae
           str. L1111]
 gi|418702896|ref|ZP_13263788.1| putative membrane protein [Leptospira interrogans serovar
           Hebdomadis str. R499]
 gi|418717763|ref|ZP_13277304.1| putative membrane protein [Leptospira interrogans str. UI 08452]
 gi|418729566|ref|ZP_13288113.1| putative membrane protein [Leptospira interrogans str. UI 12758]
 gi|421083731|ref|ZP_15544602.1| putative membrane protein [Leptospira santarosai str. HAI1594]
 gi|421102101|ref|ZP_15562711.1| putative membrane protein [Leptospira interrogans serovar
           Icterohaemorrhagiae str. Verdun LP]
 gi|421121310|ref|ZP_15581607.1| putative membrane protein [Leptospira interrogans str. Brem 329]
 gi|24194245|gb|AAN47972.1|AE011263_12 conserved hypothetical protein [Leptospira interrogans serovar Lai
           str. 56601]
 gi|45601916|gb|AAS71395.1| conserved hypothetical protein [Leptospira interrogans serovar
           Copenhageni str. Fiocruz L1-130]
 gi|353456894|gb|AER01439.1| conserved hypothetical protein [Leptospira interrogans serovar Lai
           str. IPAV]
 gi|400353508|gb|EJP05677.1| putative membrane protein [Leptospira interrogans serovar Bulgarica
           str. Mallika]
 gi|409950064|gb|EKO04595.1| putative membrane protein [Leptospira interrogans str. C10069]
 gi|410345744|gb|EKO96814.1| putative membrane protein [Leptospira interrogans str. Brem 329]
 gi|410368246|gb|EKP23624.1| putative membrane protein [Leptospira interrogans serovar
           Icterohaemorrhagiae str. Verdun LP]
 gi|410433648|gb|EKP77988.1| putative membrane protein [Leptospira santarosai str. HAI1594]
 gi|410754552|gb|EKR16202.1| putative membrane protein [Leptospira interrogans serovar Pyrogenes
           str. 2006006960]
 gi|410759015|gb|EKR25234.1| putative membrane protein [Leptospira interrogans serovar Bataviae
           str. L1111]
 gi|410767440|gb|EKR38115.1| putative membrane protein [Leptospira interrogans serovar
           Hebdomadis str. R499]
 gi|410775744|gb|EKR55735.1| putative membrane protein [Leptospira interrogans str. UI 12758]
 gi|410786933|gb|EKR80669.1| putative membrane protein [Leptospira interrogans str. UI 08452]
 gi|456824782|gb|EMF73208.1| putative membrane protein [Leptospira interrogans serovar Canicola
           str. LT1962]
          Length = 381

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 109/410 (26%), Positives = 170/410 (41%), Gaps = 110/410 (26%)

Query: 27  KSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFI 83
           ++ L   R+ SLD+FRG+ VA MILV++ G     +  + HA WNGC   D V PFFLF 
Sbjct: 2   ENKLNQNRILSLDLFRGMTVAGMILVNNPGSWSFIYSPLKHARWNGCTPTDLVFPFFLFA 61

Query: 84  VGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRL 141
           VG++I  ++  K     +     +  R++ L+  G+ L      +  EL         R+
Sbjct: 62  VGISIHFSVYSKNKIYLSKTWLGICIRSITLILIGLFLNFFGEWSFSEL---------RI 112

Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
            GVLQRI   Y                                W++A+  L++    +  
Sbjct: 113 PGVLQRIGFVY--------------------------------WVVASLYLILPKRAILI 140

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY----------IDRKVLGIN 251
           +++P                +  V   +  +L PP  ++ Y          IDR V G N
Sbjct: 141 SWIP----------------ILIVHTWILIQLPPPGESIVYLEPGKDIGAWIDRNVFGEN 184

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           H+     W+ SK                       ++PEG  S +SSI ++++GV  G  
Sbjct: 185 HL-----WKFSKT----------------------WDPEGFFSGISSITTSLLGVFCGS- 216

Query: 312 IIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALV---- 367
           I+ +K +  + +     GF  L   + L +   +P+NK L+T SYV  T+G A L     
Sbjct: 217 ILSSKTNETKKQILSIFGFGTLFVLVGLLWNQNLPMNKSLWTGSYVIYTAGLAFLSIGFF 276

Query: 368 -FSAIYALVDIWN---LKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGW 413
            F  +      WN   L+  F P    G NA+LV+V    G+ A  +N W
Sbjct: 277 EFLNLLLQTKKWNRLRLETIFQPFLVFGKNAILVFV--GSGLLARILNLW 324


>gi|146302547|ref|YP_001197138.1| hypothetical protein Fjoh_4820 [Flavobacterium johnsoniae UW101]
 gi|146156965|gb|ABQ07819.1| Uncharacterized protein [Flavobacterium johnsoniae UW101]
          Length = 423

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 110/422 (26%), Positives = 174/422 (41%), Gaps = 118/422 (27%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGV 86
           +  +RL SLD+FRG  + LM +V++ G     +P + HA W+GC   D V PFF+FI+G 
Sbjct: 1   MTKERLTSLDVFRGFTILLMTIVNNPGSWSSIYPPLEHAEWHGCTPTDLVFPFFVFIMGT 60

Query: 87  AIALALKRIPDRADAVKKVIFRTLKLLFWGILLQ-------------------------- 120
           AI  A+           K++ R+L++   G+ L                           
Sbjct: 61  AIPFAMPVKHFDGAVFNKILVRSLRIFCLGLFLSVFSRIHLFGLEGIPLLGVRLVIAFAV 120

Query: 121 -----GGFS-HAPDELTYGVDVRMI-------------RLCGVLQRIALSYLLVSLVEIF 161
                G FS      L  G+ + ++             R+ GVLQRIA+ Y   S++ + 
Sbjct: 121 AYALLGNFSMKVKTILAVGIFIILLSLAFSGLEHFEDTRIPGVLQRIAIVYFFASILYLK 180

Query: 162 TKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGK 221
           T                 L    W++A+ +++ YL +    +VP   F   N D      
Sbjct: 181 T----------------NLKTQLWVVASILVIYYLLM---AFVPVPGFGPANFDKG---- 217

Query: 222 VFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPS 281
                           N   ++D  VL   H+     W  SK                 +
Sbjct: 218 ---------------TNLAAWLDNLVLN-GHL-----WSVSK-----------------T 239

Query: 282 WCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHF 341
           W     +PEG+LS++ +I + I+G++ G ++         LK+    G  LLI GL  + 
Sbjct: 240 W-----DPEGILSTLPAIGTGILGMYIGQLLNLQTNRTEILKKTAVTGVILLIGGLLWNI 294

Query: 342 TNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNL-KYPFLPLAWIGMNAMLVYVM 400
               P+NK L+T SYV  T+G A L  S +Y ++DI    K+  L L W G+N M+V+  
Sbjct: 295 I--FPINKSLWTSSYVLYTAGIATLCLSLLYYIIDIQGYKKWAKLFLIW-GVNPMIVFFF 351

Query: 401 AA 402
           + 
Sbjct: 352 SG 353


>gi|417760159|ref|ZP_12408187.1| putative membrane protein [Leptospira interrogans str. 2002000624]
 gi|417775681|ref|ZP_12423532.1| putative membrane protein [Leptospira interrogans str. 2002000621]
 gi|418673844|ref|ZP_13235155.1| putative membrane protein [Leptospira interrogans str. 2002000623]
 gi|409944118|gb|EKN89707.1| putative membrane protein [Leptospira interrogans str. 2002000624]
 gi|410574555|gb|EKQ37586.1| putative membrane protein [Leptospira interrogans str. 2002000621]
 gi|410579122|gb|EKQ46972.1| putative membrane protein [Leptospira interrogans str. 2002000623]
          Length = 381

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 109/410 (26%), Positives = 170/410 (41%), Gaps = 110/410 (26%)

Query: 27  KSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFI 83
           ++ L   R+ SLD+FRG+ VA MILV++ G     +  + HA WNGC   D V PFFLF 
Sbjct: 2   ENKLNQNRILSLDLFRGMTVAGMILVNNPGSWSFIYSPLKHARWNGCTPTDLVFPFFLFA 61

Query: 84  VGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRL 141
           VG++I  ++  K     +     +  R++ L+  G+ L      +  EL         R+
Sbjct: 62  VGISIHFSVYSKNKIYLSKTWLGICIRSITLILIGLFLNFFGEWSFSEL---------RI 112

Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
            GVLQRI   Y                                W++A+  L++    +  
Sbjct: 113 PGVLQRIGFVY--------------------------------WVVASLYLILPKRAILI 140

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY----------IDRKVLGIN 251
           +++P                +  V   +  +L PP  ++ Y          IDR V G N
Sbjct: 141 SWIP----------------ILIVHTWILIQLPPPGESIVYLEPGKDIGAWIDRNVFGEN 184

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           H+     W+ SK                       ++PEG  S +SSI ++++GV  G  
Sbjct: 185 HL-----WKFSKT----------------------WDPEGFFSGISSITTSLLGVFCGS- 216

Query: 312 IIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALV---- 367
           I+ +K +  + +     GF  L   + L +   +P+NK L+T SYV  T+G A L     
Sbjct: 217 ILSSKTNETKKQILSIFGFGTLFVLVGLLWNQNLPMNKSLWTGSYVIYTAGLAFLSIGFF 276

Query: 368 -FSAIYALVDIWN---LKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGW 413
            F  +      WN   L+  F P    G NA+LV+V    G+ A  +N W
Sbjct: 277 EFLNLLLQTKKWNQLRLETIFQPFLVFGKNAILVFV--GSGLLARILNLW 324


>gi|418690664|ref|ZP_13251772.1| putative membrane protein [Leptospira interrogans str. FPW2026]
 gi|418722429|ref|ZP_13281595.1| putative membrane protein [Leptospira interrogans str. UI 12621]
 gi|400360164|gb|EJP16144.1| putative membrane protein [Leptospira interrogans str. FPW2026]
 gi|409963797|gb|EKO27519.1| putative membrane protein [Leptospira interrogans str. UI 12621]
 gi|455790461|gb|EMF42326.1| putative membrane protein [Leptospira interrogans serovar Lora str.
           TE 1992]
          Length = 381

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 109/410 (26%), Positives = 170/410 (41%), Gaps = 110/410 (26%)

Query: 27  KSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFI 83
           ++ L   R+ SLD+FRG+ VA MILV++ G     +  + HA WNGC   D V PFFLF 
Sbjct: 2   ENKLNQNRILSLDLFRGMTVAGMILVNNPGSWSFIYSPLKHARWNGCTPTDLVFPFFLFA 61

Query: 84  VGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRL 141
           VG++I  ++  K     +     +  R++ L+  G+ L      +  EL         R+
Sbjct: 62  VGISIHFSVYSKNKIYLSKTWLGICIRSITLILIGLFLNFFGEWSFSEL---------RI 112

Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
            GVLQRI   Y                                W++A+  L++    +  
Sbjct: 113 PGVLQRIGFVY--------------------------------WVVASLYLILPKRAILI 140

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY----------IDRKVLGIN 251
           +++P                +  V   +  +L PP  ++ Y          IDR V G N
Sbjct: 141 SWIP----------------ILIVHTWILIQLPPPGESIVYLEPGKDIGAWIDRNVFGEN 184

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           H+     W+ SK                       ++PEG  S +SSI ++++GV  G  
Sbjct: 185 HL-----WKFSKT----------------------WDPEGFFSGISSITTSLLGVFCGS- 216

Query: 312 IIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALV---- 367
           I+ +K +  + +     GF  L   + L +   +P+NK L+T SYV  T+G A L     
Sbjct: 217 ILSSKTNETKKQILSIFGFGTLFVLVGLLWNQNLPMNKSLWTGSYVIYTAGLAFLSIGFF 276

Query: 368 -FSAIYALVDIWN---LKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGW 413
            F  +      WN   L+  F P    G NA+LV+V    G+ A  +N W
Sbjct: 277 EFLNLLLQTKKWNRLRLETIFQPFLVFGKNAILVFV--GSGLLARILNLW 324


>gi|418752318|ref|ZP_13308585.1| putative membrane protein [Leptospira santarosai str. MOR084]
 gi|409967313|gb|EKO35143.1| putative membrane protein [Leptospira santarosai str. MOR084]
          Length = 363

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 114/391 (29%), Positives = 171/391 (43%), Gaps = 100/391 (25%)

Query: 37  SLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           SLD+FRG+ V  MILV++ G  W  I     HA WNGC   D V PFFLF VG +I ++L
Sbjct: 2   SLDLFRGMTVIGMILVNNPG-SWSYIYSPLKHAEWNGCTPTDLVFPFFLFAVGTSIPISL 60

Query: 93  --KRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVLQRI 148
             K   +R+D    +  R+  L+  G+ L   G +S A            +R+ GVLQRI
Sbjct: 61  YSKNGINRSDIWIGICIRSANLILLGLFLNFFGEWSFAE-----------LRIPGVLQRI 109

Query: 149 ALSYLLV-SLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
              Y +V SL  +F+       + +  FS+  L    W++    L            P  
Sbjct: 110 GFVYWVVASLCLVFS------GKKILVFSVPILLIHTWILTQIAL------------PG- 150

Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
           +  +  +   D G                     +IDR + G  H+     WR SK    
Sbjct: 151 ESVVSLEQGKDIG--------------------AWIDRTIFGEKHL-----WRFSKT--- 182

Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVT 327
                              ++PEG LS V+S+++T+ GV  G ++        R ++   
Sbjct: 183 -------------------WDPEGFLSGVASVVTTLFGVLCGFIL------FLRERRNKI 217

Query: 328 MGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAAL---VFSAIYALV--DIWNLKY 382
           +G  +L   + L +  ++P+NK L+T SY   T+G + L    F  + +L+    WNLK 
Sbjct: 218 LGLGILFSFVGLLWDLSLPMNKSLWTGSYSVYTAGLSFLSIWFFEYLSSLIISKGWNLKI 277

Query: 383 PFLPLAWIGMNAMLVYVMAAEGIFAGFINGW 413
            F P    G NA+LV+V    GI A  +N W
Sbjct: 278 LFQPFLVFGKNAVLVFV--GSGILARTLNLW 306


>gi|410613391|ref|ZP_11324450.1| heparan-alpha-glucosaminide N-acetyltransferase [Glaciecola
           psychrophila 170]
 gi|410167053|dbj|GAC38339.1| heparan-alpha-glucosaminide N-acetyltransferase [Glaciecola
           psychrophila 170]
          Length = 400

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 108/400 (27%), Positives = 179/400 (44%), Gaps = 74/400 (18%)

Query: 15  IISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGC 70
           ++S      Q    H+   RL +LD+FRG+ +  MILV++ G  W  I    +HA W+G 
Sbjct: 1   MMSALKAHCQSIYQHVPANRLLALDVFRGMTITAMILVNNPG-SWQYIYSPLAHAKWHGW 59

Query: 71  NLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQG-------GF 123
            L D + PFF+FIVGV+I+L+ +R  ++      +I   L  +F  +LL          F
Sbjct: 60  TLTDLIFPFFIFIVGVSISLSGQRQKEQGLGHGHIIHHALLRMFKLLLLGCFLALFYYNF 119

Query: 124 SHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCW 183
           S A D   +   +  +R  GVLQRIAL Y+   L+ +F   +Q             + C 
Sbjct: 120 S-AADYDWFTQRLMQMRFMGVLQRIALVYMACVLLWLFLSRLQ------------LVIC- 165

Query: 184 HWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYI 243
              M A ++  +LA+ +  Y  D               + N   G+    N   N   ++
Sbjct: 166 ---MLAILVAYWLAMAFIPYHDD---------------LGNQYVGLLEYAN---NLSAWL 204

Query: 244 DRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTI 303
           D  +    H+Y+  A           PF               F+PEG+LS++ +I S +
Sbjct: 205 DNYLFAKTHLYYSSA----------QPFA--------------FDPEGVLSTLPAIASGL 240

Query: 304 IGVHFGHVIIHTKGHLARLKQWVTM-GFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSG 362
            GV  G  +  +   +    +W+ + G   L+ G    + + +P+NK L+T SYV +TSG
Sbjct: 241 SGVLAGQWLSFSHHSMRHKAKWLAICGVVALLLGQM--WAHWLPINKALWTSSYVLLTSG 298

Query: 363 AAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
            AAL+ + +  ++DI   +    P    G N++  ++ A 
Sbjct: 299 YAALILAGLMYVLDIKQWRLWAAPFVVFGANSIAFFMFAG 338


>gi|404487027|ref|ZP_11022214.1| hypothetical protein HMPREF9448_02670 [Barnesiella intestinihominis
           YIT 11860]
 gi|404335523|gb|EJZ61992.1| hypothetical protein HMPREF9448_02670 [Barnesiella intestinihominis
           YIT 11860]
          Length = 364

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 104/383 (27%), Positives = 175/383 (45%), Gaps = 84/383 (21%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
           ++RL SLD+ RG+ V  MILV++AG     +  + HA W+G   AD V P F+FI+GV+I
Sbjct: 7   SKRLVSLDVLRGITVCGMILVNNAGACGYAYAPLKHAKWDGFTPADLVFPAFMFIMGVSI 66

Query: 89  ALALKRIP-DRADAVKKVIFRTLKLLFWGILLQGGFSH-APDELTYGVDVRMIRLCGVLQ 146
            L+L +   D   ++ +++ RT+ +   G+ L+   +  A  E      +  +R+ GVLQ
Sbjct: 67  YLSLNKSNFDWRISIARILRRTVLIFMSGVALKWILAFIATGEYN---TLENLRIMGVLQ 123

Query: 147 RIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPD 206
           R+ + Y +V+L+ +  +                    H L    + V    LL G Y+  
Sbjct: 124 RLGICYGIVALLAVTVR--------------------HRLFPTIIAV----LLVGYYLLQ 159

Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
                       +G  F    G         N V  +D  VLG +HMY   A        
Sbjct: 160 L-----------FGNGFEKCAG---------NIVSMVDYAVLGKSHMYLGGA-------- 191

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
                                +PEG+LS++ +I   +IG   G VI+  K   +++ +  
Sbjct: 192 ------------------QFVDPEGVLSTIPAIAQVMIGFLCGKVIVGEKEIRSQIVKLA 233

Query: 327 TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSA-IYALVDIWNLKYPFL 385
             G ++ + G    +  A PLNK+L++ S+V VT G  +L+F+  IY + D    ++ F 
Sbjct: 234 VWGTSMFVIGYLWSY--AAPLNKRLWSPSFVLVTCGITSLIFATLIYIIDDSKRTRWSF- 290

Query: 386 PLAWIGMNAMLVYVMA--AEGIF 406
           P   +G+N + +Y+ +  A G+F
Sbjct: 291 PFLVVGVNPLSIYIFSEIAGGLF 313


>gi|374385780|ref|ZP_09643283.1| hypothetical protein HMPREF9449_01669 [Odoribacter laneus YIT
           12061]
 gi|373225482|gb|EHP47816.1| hypothetical protein HMPREF9449_01669 [Odoribacter laneus YIT
           12061]
          Length = 382

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 110/391 (28%), Positives = 174/391 (44%), Gaps = 89/391 (22%)

Query: 30  LKTQ-RLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVG 85
           +KT+ RL +LD+FRG+ +A MILV+  G     +  + HA WNG    D V PFF+FI+G
Sbjct: 1   MKTENRLLALDVFRGITIAGMILVNDPGSWSAVYAPLCHASWNGLTPTDLVFPFFMFIMG 60

Query: 86  VAIALALKRIPD--RADAVKKVIFRTLKLLF-------WGILLQGGF-SHAPDELTYG-- 133
           +++  +L+R        AV K IFR   L+F       W  L  G F S    E T+   
Sbjct: 61  ISMYFSLRRYNSLFSRGAVAK-IFRRAVLIFLIGLGINWFALWFGTFMSMGNGEFTFWER 119

Query: 134 -----VDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMA 188
                  V  IR+ GVLQR+AL+YL  +++ +  +          R+ +F         A
Sbjct: 120 FTQNIFPVADIRILGVLQRLALAYLGGAILCLGIRP---------RYQLFT--------A 162

Query: 189 ACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
             +LV Y  +L                    G+ F         +    N +  +DR VL
Sbjct: 163 VMILVGYFVILV------------------VGEGF---------IRSEHNILSVVDRAVL 195

Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
           G+ H+Y                        A S     F+PEGLLS++      + GV  
Sbjct: 196 GVRHLYG---------------------GGASSGAGMAFDPEGLLSTLPCFAHVLFGVCM 234

Query: 309 GHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVF 368
           G ++   K +  R++Q    G  LL  G    ++ A P+NK++++ +YV ++ GAA+L+F
Sbjct: 235 GRMLGEVKENEIRIRQLFIFGTILLFAGYL--WSYACPVNKRIWSPTYVLISCGAASLLF 292

Query: 369 SAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
           + +   +D+   K         G+N + +YV
Sbjct: 293 ALLIYWIDVKGYKRGCRFFEVFGVNPLFIYV 323


>gi|410941669|ref|ZP_11373463.1| putative membrane protein [Leptospira noguchii str. 2006001870]
 gi|410783218|gb|EKR72215.1| putative membrane protein [Leptospira noguchii str. 2006001870]
          Length = 381

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 122/461 (26%), Positives = 195/461 (42%), Gaps = 127/461 (27%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFF 80
            ++KS+    R+ SLD+FRG+ VA MILV++ G     +  + HA WNGC   D V PFF
Sbjct: 1   MEKKSN--QNRILSLDLFRGMTVAGMILVNNPGSWSFIYTPLKHAKWNGCTPTDLVFPFF 58

Query: 81  LFIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQ--GGFSHAPDELTYGV 134
           LF+VG +I  +L         + K+ F    R++ L+  G+ L   G +S +        
Sbjct: 59  LFVVGTSIPFSLYS--KNKIYISKIWFGICIRSITLILIGLFLNFFGEWSFSK------- 109

Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
               +R+ G+LQRI   Y                                W++A+  L++
Sbjct: 110 ----LRIPGILQRIGFVY--------------------------------WVVASLYLML 133

Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY----------ID 244
              ++  +++P                +  V   V  ++ PP  ++ Y          ID
Sbjct: 134 PKRIILISWIP----------------ILIVHTWVLIQIPPPGESIVYLEPGKDIGAWID 177

Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
           R V G NH+     W+ SK                       ++PEG  S +SSI +T++
Sbjct: 178 RNVFGENHL-----WKFSKT----------------------WDPEGFFSGISSIATTLL 210

Query: 305 GVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAA 364
           GV  G  I+ +K +  + +     GF +L+  + L +   +P+NK L+T SYV  T+G A
Sbjct: 211 GVFCGS-ILSSKTNETKKQILSIFGFGILLVLVGLLWDQNLPMNKSLWTGSYVIYTAGLA 269

Query: 365 ALV-----FSAIYALVDIWN---LKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYG 416
            L      F  +      WN   L+  F P    G NA+LV+V    G+ A  +N W   
Sbjct: 270 FLSIGFFEFLNLLLRAKKWNQLKLETIFQPFLVFGKNAILVFV--GSGLIARTLNLWTIV 327

Query: 417 DPHNTLVC----FLFIISYILHSFLWELRKFLYVQFCNLSW 453
             +  L+     F   +++I +S    L  F+Y  F  L W
Sbjct: 328 LENGKLISIKTFFYSKLNFIGNS---HLESFIYAMFNLLFW 365


>gi|326800650|ref|YP_004318469.1| hypothetical protein [Sphingobacterium sp. 21]
 gi|326551414|gb|ADZ79799.1| hypothetical protein Sph21_3257 [Sphingobacterium sp. 21]
          Length = 396

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 117/394 (29%), Positives = 171/394 (43%), Gaps = 97/394 (24%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           QR  SLD+FRG  VALMILV++ G     +  + HAPW+GC   D V PFFLF VG A++
Sbjct: 3   QRYYSLDVFRGATVALMILVNNPGSWSYAFSPLKHAPWHGCTPTDLVFPFFLFAVGNAMS 62

Query: 90  LALKRIPDRADAV--KKVIFRTLKLLF------WGILLQGGFSHAPDELTYGVDV----R 137
             + R+  +A  V  KKV+ RT+ +        W   +Q  +S+      Y ++      
Sbjct: 63  FVIPRLRTQAGKVFWKKVLKRTILIFLIGLLLNWYPFVQ--WSNDTLLFKYWINPIKSDS 120

Query: 138 MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA 197
            IR+ GVLQRIAL Y   S++  F      K ++V   SIF L   +WL+  C+L+    
Sbjct: 121 GIRILGVLQRIALCYCFASILVYFF-----KTKTVVLISIFILLS-YWLI--CILL---- 168

Query: 198 LLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHP 257
                            DS  Y        G +            ID  +L I HMY   
Sbjct: 169 ----------------GDSDPYS--LQGWFGTK------------IDVSILQIAHMYKG- 197

Query: 258 AWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI----- 312
                         EG            PFEPEG+ S+ ++++  +IG   G  I     
Sbjct: 198 --------------EG-----------VPFEPEGIASTFTAVIQVVIGFLVGQYIQTGTK 232

Query: 313 -IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAI 371
            I     + R    + +  ALL  G  + +  A P+NK+++T SYV  +SG A      +
Sbjct: 233 AIEGPMLIYRTVSTLMVTAALLTLG-GMTWGLAFPINKKIWTSSYVLYSSGLAITALGVL 291

Query: 372 YALVDI---WNLKYPFLPLAWIGMNAMLVYVMAA 402
              V+I    N    F  +   G N + ++VM+A
Sbjct: 292 IWFVEIKGHKNFVTKFFDV--FGKNPLFIFVMSA 323


>gi|196002389|ref|XP_002111062.1| hypothetical protein TRIADDRAFT_54611 [Trichoplax adhaerens]
 gi|190587013|gb|EDV27066.1| hypothetical protein TRIADDRAFT_54611 [Trichoplax adhaerens]
          Length = 431

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 86/317 (27%), Positives = 142/317 (44%), Gaps = 66/317 (20%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           +R+ SLD++RGL   +M   D  GG +    H+ WNG  + D V P F+FI G +++++L
Sbjct: 161 RRIRSLDLYRGLCAIVMAFGDSGGGQYRFFKHSIWNGLTIVDVVFPGFIFISGFSLSISL 220

Query: 93  KRIPDRADAVKKV-----IFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
            +   +    K +     I R+  L F G+L+ G              +   RL GVLQR
Sbjct: 221 VKRLYKMQTPKLILIVTTIRRSFYLFFLGLLING-----------PCQISNWRLLGVLQR 269

Query: 148 IALSYLLVSLVEI--------FTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALL 199
           I++++L+VS + +        FTKD   +++ + +        W  ++    L  Y+ L 
Sbjct: 270 ISVTFLVVSCLAVWLYPTIKSFTKDQVLQEKVLRKM-------WPIMVLIVGLHTYVTLT 322

Query: 200 YGTYVPDWQFTIINK-DSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPA 258
               VPD           +D GK +N T G+           G+IDR V G NH+Y  P 
Sbjct: 323 AA--VPDCPVGYSGPGGKSDDGKYYNCTGGI----------AGFIDRFVFGSNHLYSRPT 370

Query: 259 WRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHT--K 316
            +     +Q                  PF+PEG+L +++SI    +G+  G  I+H    
Sbjct: 371 CKLLYQSSQ------------------PFDPEGVLGTLTSIFLCFLGLQMG--ILHNIFS 410

Query: 317 GHLARLKQWVTMGFALL 333
            +L  ++ W+  G  L+
Sbjct: 411 NNLRIMRTWILFGLLLV 427


>gi|417770421|ref|ZP_12418329.1| putative membrane protein [Leptospira interrogans serovar Pomona
           str. Pomona]
 gi|418680131|ref|ZP_13241383.1| putative membrane protein [Leptospira interrogans serovar Pomona
           str. Kennewicki LC82-25]
 gi|421117858|ref|ZP_15578212.1| putative membrane protein [Leptospira interrogans serovar Canicola
           str. Fiocruz LV133]
 gi|400328139|gb|EJO80376.1| putative membrane protein [Leptospira interrogans serovar Pomona
           str. Kennewicki LC82-25]
 gi|409947562|gb|EKN97558.1| putative membrane protein [Leptospira interrogans serovar Pomona
           str. Pomona]
 gi|410010535|gb|EKO68672.1| putative membrane protein [Leptospira interrogans serovar Canicola
           str. Fiocruz LV133]
 gi|455668600|gb|EMF33807.1| putative membrane protein [Leptospira interrogans serovar Pomona
           str. Fox 32256]
          Length = 381

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 110/413 (26%), Positives = 171/413 (41%), Gaps = 116/413 (28%)

Query: 27  KSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFI 83
           ++ L   R+ SLD+FRG+ VA MILV++ G     +  + HA WNGC   D V PFFLF 
Sbjct: 2   ENKLNQNRILSLDLFRGMTVAGMILVNNPGSWSFIYSPLKHARWNGCTPTDLVFPFFLFA 61

Query: 84  VGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRL 141
           VG++I  ++  K     +     +  R++ L+  G+ L      +  EL         R+
Sbjct: 62  VGISIHFSVYSKNKIYLSKTWLGICIRSITLILIGLFLNFFGEWSFSEL---------RI 112

Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
            GVLQRI   Y                                W++A+  L++    +  
Sbjct: 113 PGVLQRIGFVY--------------------------------WVVASLYLILPKRAILI 140

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY----------IDRKVLGIN 251
           +++P                +  V   +  +L PP  ++ Y          IDR V G N
Sbjct: 141 SWIP----------------ILIVHTWILIQLPPPGESIVYLEPGKDIGAWIDRNVFGEN 184

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           H+     W+ SK                       ++PEG  S +SSI ++++GV  G  
Sbjct: 185 HL-----WKFSKT----------------------WDPEGFFSGISSITTSLLGVFCGS- 216

Query: 312 IIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALV---- 367
           I+ +K +  + +     GF  L   + L +   +P+NK L+T SYV  T+G A L     
Sbjct: 217 ILSSKTNETKKQILSIFGFGTLFVLVGLLWNQNLPMNKSLWTGSYVIYTAGLAFLSIGFF 276

Query: 368 -FSAIYALVDIWN------LKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGW 413
            F  +      WN      +  PFL     G NA+LV+V    G+FA  +N W
Sbjct: 277 EFLNLLLQTKKWNRLRLETISQPFLVF---GKNAILVFV--GSGLFARILNLW 324


>gi|209523049|ref|ZP_03271606.1| conserved hypothetical protein [Arthrospira maxima CS-328]
 gi|376001698|ref|ZP_09779557.1| conserved hypothetical protein (membrane) [Arthrospira sp. PCC
           8005]
 gi|423062475|ref|ZP_17051265.1| hypothetical protein SPLC1_S033650 [Arthrospira platensis C1]
 gi|209496636|gb|EDZ96934.1| conserved hypothetical protein [Arthrospira maxima CS-328]
 gi|375329927|emb|CCE15310.1| conserved hypothetical protein (membrane) [Arthrospira sp. PCC
           8005]
 gi|406716383|gb|EKD11534.1| hypothetical protein SPLC1_S033650 [Arthrospira platensis C1]
          Length = 378

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 109/386 (28%), Positives = 168/386 (43%), Gaps = 89/386 (23%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
            RL SLD+FRG+A+A MILV++ G     +P + HA W+GC   D V P FL I+GVAIA
Sbjct: 8   MRLISLDVFRGIAIAAMILVNNPGSWGYMYPVLQHAEWDGCTPTDVVFPSFLLIMGVAIA 67

Query: 90  LALK------RIPDRA--DAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRL 141
            +L       R+P      +V   I R   LLF   L   GF H         D+  IR+
Sbjct: 68  FSLSKFAREHRLPGEKVPPSVYSRIGRRCLLLFLLGLFLNGFPH--------YDLANIRI 119

Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
            GVLQRIA++Y L ++  +     Q                  WL++   L+ Y   +  
Sbjct: 120 MGVLQRIAIAYGLTAIAILNLSRRQ-----------------LWLISILTLIGYWVAMTI 162

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
             VP             YG            L+P  N   +ID+ +LG +H+        
Sbjct: 163 IPVPS------------YGP---------GNLSPEGNLGAFIDQTILGSHHL-------- 193

Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
                               W   P++PEGL S+  + ++ I+G   G  +     +   
Sbjct: 194 --------------------WRGGPYDPEGLFSTAPATVTVILGYLTGEWLKSQPRNSFT 233

Query: 322 LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLK 381
           +   V    + L+ G    +    P+NK L+T S+V VT+G   L+ +  Y ++++ N +
Sbjct: 234 VINLVMFALSSLVVGYL--WGVWFPINKALWTSSFVLVTAGWGLLLLAFCYGVIEVKNWR 291

Query: 382 YPFLPLAWIGMNAMLVYVMAAEGIFA 407
               PL  +G+NA+ ++V  A G+ A
Sbjct: 292 RWGKPLEIMGVNAIFLFV--ASGLLA 315


>gi|418694540|ref|ZP_13255577.1| putative membrane protein [Leptospira kirschneri str. H1]
 gi|409957715|gb|EKO16619.1| putative membrane protein [Leptospira kirschneri str. H1]
          Length = 383

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 112/412 (27%), Positives = 178/412 (43%), Gaps = 110/412 (26%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFL 81
           + KS     R+ SLD+FRG+ VA MILV++ G     +  + HA WNGC   D V PFFL
Sbjct: 2   ENKSTQNKNRVLSLDLFRGMTVAGMILVNNPGSWSFIYSPLKHAKWNGCTPTDLVFPFFL 61

Query: 82  FIVGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMI 139
           F VG++I L++  K    ++     +  R++ L+  G+ L      +  EL         
Sbjct: 62  FAVGISIQLSVYSKNKIYKSKIWFGICIRSITLILIGLFLNFFGEWSFSEL--------- 112

Query: 140 RLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALL 199
           R+ GVLQRI   Y                                W++A+  L++   ++
Sbjct: 113 RIPGVLQRIGFVY--------------------------------WIVASLHLILPKRMI 140

Query: 200 YGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY----------IDRKVLG 249
             +++P                +  V   V  ++ PP  ++ Y          IDR V G
Sbjct: 141 LISWIP----------------ILLVHTWVLIQIPPPGESIVYLEPGKDIGAWIDRNVFG 184

Query: 250 INHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFG 309
            NH+     W+ SK                       ++PEGL S +SSI ++++GV  G
Sbjct: 185 ENHL-----WKFSKT----------------------WDPEGLFSGISSIATSLLGVFCG 217

Query: 310 HVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAAL--- 366
             I+ +K +  + +     GF +L+  + L +   +P+NK L+T SYV  T+G A L   
Sbjct: 218 S-ILSSKTNEIKKQILSIFGFGILLVLVGLLWDQNLPMNKSLWTGSYVIYTAGLAFLSIG 276

Query: 367 VFSAIYALVDI--WNL---KYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGW 413
            F  +  L+ I  W+    +  F P    G NA+LV+V    G+ A  +N W
Sbjct: 277 CFELLNFLLRIKKWDRLQSETIFQPFLVFGKNAILVFV--GSGLLARTLNLW 326


>gi|421109691|ref|ZP_15570204.1| putative membrane protein [Leptospira kirschneri str. H2]
 gi|410005185|gb|EKO58983.1| putative membrane protein [Leptospira kirschneri str. H2]
          Length = 383

 Score =  105 bits (262), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 110/412 (26%), Positives = 175/412 (42%), Gaps = 110/412 (26%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFL 81
           + KS     R+ SLD+FRG+ VA MILV++ G     +  + HA WNGC   D V PFFL
Sbjct: 2   ENKSTQNKNRVLSLDLFRGMTVAGMILVNNPGSWSFIYSPLKHAKWNGCTPTDLVFPFFL 61

Query: 82  FIVGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMI 139
           F VG++I L++  K    ++     +  R++ L+  G+ L      +  EL         
Sbjct: 62  FAVGISIQLSVYSKNKIYKSKIWFGICIRSITLILIGLFLNFFGEWSFSEL--------- 112

Query: 140 RLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALL 199
           R+ GVLQRI   Y                                W++A+  L++   ++
Sbjct: 113 RIPGVLQRIGFVY--------------------------------WIVASLHLILPKRMI 140

Query: 200 YGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY----------IDRKVLG 249
             +++P                +  V   V  ++ PP  ++ Y          IDR V G
Sbjct: 141 LISWIP----------------ILLVHTWVLIQIPPPGESIVYLEPGKDIGAWIDRNVFG 184

Query: 250 INHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFG 309
            NH+     W+ SK                       ++PEGL S +SSI ++++GV  G
Sbjct: 185 ENHL-----WKFSKT----------------------WDPEGLFSGISSIATSLLGVFCG 217

Query: 310 HVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFS 369
             I+ +K +  + +     GF +L+  + L +   +P+NK L+T SYV  T+G A L   
Sbjct: 218 S-ILSSKTNEIKKQILSIFGFGILLVLVGLLWDQNLPMNKSLWTGSYVIYTAGLAFLSIG 276

Query: 370 AIYAL-----VDIWNL---KYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGW 413
               L     +  W+    +  F P    G NA+LV+V    G+ A  +N W
Sbjct: 277 CFELLNFLLRIKKWDRLQSETIFQPFLVFGKNAILVFV--GSGLLARTLNLW 326


>gi|359460787|ref|ZP_09249350.1| hypothetical protein ACCM5_18818 [Acaryochloris sp. CCMEE 5410]
          Length = 383

 Score =  105 bits (262), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 100/376 (26%), Positives = 173/376 (46%), Gaps = 88/376 (23%)

Query: 38  LDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKR 94
           LD+FRG+A+A M+LV+ +G     +P++ HA W+G  LAD V PFFL ++G ++A ++ R
Sbjct: 13  LDVFRGIAIAGMLLVNKSGLVKEAYPQLLHADWHGWTLADLVFPFFLVVLGASMAFSMAR 72

Query: 95  ------IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
                  P RA  + K++ R+  L   G+ L G +S          +   +R+ G+LQRI
Sbjct: 73  HTASLTQPKRAVYL-KILRRSAVLFGLGLFLNGFWSF---------NFSTLRVMGILQRI 122

Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVY-LALLYGTYVPDW 207
           +L+YL  + V +    +  K Q              W +   +LV Y LAL   +++P  
Sbjct: 123 SLTYLASAFVIL---KLPRKSQ--------------WGLTGLLLVGYWLAL---SFIPVP 162

Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
           +F   N                   L    N   YIDR ++G +H+Y    +        
Sbjct: 163 EFGPGN-------------------LTRTGNFGAYIDRLIIGSSHLYVGDQFNSMG---- 199

Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLK---- 323
                               +PEGL S++ +I + ++G +F    I  +G   ++K    
Sbjct: 200 --------------------DPEGLFSTLPAIATVLLG-YFAGDWIRKRGSGLKIKTSRQ 238

Query: 324 QWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYP 383
                 + L   GL L ++   P+NK+L+T SYV  T G A ++ +  Y L+++  ++  
Sbjct: 239 SLALASYGLFSTGLGLLWSIWFPINKKLWTSSYVLFTVGIALILLAVCYELIEVRRIRLW 298

Query: 384 FLPLAWIGMNAMLVYV 399
             P   +G+N++ V++
Sbjct: 299 SKPFEVLGLNSIAVFM 314


>gi|237720190|ref|ZP_04550671.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
 gi|293371122|ref|ZP_06617659.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
 gi|229450742|gb|EEO56533.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
 gi|292633780|gb|EFF52332.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
          Length = 371

 Score =  105 bits (262), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 110/389 (28%), Positives = 165/389 (42%), Gaps = 96/389 (24%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVG 85
           +K++RL SLDI RG+ +  MILV++ G  W  I     HA WNG    D V PFF+FI+G
Sbjct: 1   MKSERLLSLDILRGITIVGMILVNNPG-TWESIYAPLRHAEWNGLTPTDLVFPFFMFIMG 59

Query: 86  VAIALALKRIPDRADA--VKKVIFRTLKLLF------WGILLQGG----FSHAPDELTYG 133
           V+++ AL R         + K++ RTL L        W  L+  G    FSH        
Sbjct: 60  VSMSFALSRFDHHFSRGFIIKLVRRTLILFLLGLFLSWFSLVCTGVEQPFSH-------- 111

Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV 193
                IR+ GVLQR+AL+Y   SL+ +  +   +               W   ++  +L 
Sbjct: 112 -----IRILGVLQRLALAYFFGSLLIVGVRRPAN-------------LAW---ISGIILA 150

Query: 194 VYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
            Y  LL                    G  F ++           N +   DR + G  H+
Sbjct: 151 GYSTLL------------------ALGHGFELS---------EQNIIAVTDRTLFGEAHL 183

Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
           Y                     R+  P      F+PEGLLS++  I   IIG   G+++ 
Sbjct: 184 Y---------------------REWLPDGGRIFFDPEGLLSTLPCIAQVIIGYFCGNILR 222

Query: 314 HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
                  RL Q   +G ALL  G  L +    PLNK++++ ++V VT G A+L+   +  
Sbjct: 223 EKTEIHHRLLQISILGIALLFAGWLLSY--GCPLNKKVWSPTFVLVTCGFASLLLVFLTW 280

Query: 374 LVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
           L+DI   +    P    G N + +Y++A 
Sbjct: 281 LIDIRKKQKWGYPFHVFGTNPLFIYIVAG 309


>gi|393785792|ref|ZP_10373938.1| hypothetical protein HMPREF1068_00218 [Bacteroides nordii
           CL02T12C05]
 gi|392661411|gb|EIY54997.1| hypothetical protein HMPREF1068_00218 [Bacteroides nordii
           CL02T12C05]
          Length = 361

 Score =  105 bits (261), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 108/386 (27%), Positives = 172/386 (44%), Gaps = 80/386 (20%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
             RL SLD+ RG  VA MILV++AG     +  + HA W+G   AD V P F+F++G++ 
Sbjct: 4   NNRLLSLDVLRGFTVAGMILVNNAGACGYGYAPLRHAKWDGFTPADLVFPMFMFLMGIST 63

Query: 89  ALALKRIPDRAD-AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
            ++L++   +   A+ K+I R L L+  GI ++   + +  E     D   +RL GV+QR
Sbjct: 64  YISLRKYDFQWRLAIGKIIKRALLLILIGIAMKWIINSS--ETGIWTDWEHMRLLGVMQR 121

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
           + + Y   +++ +F            RF                  V L LL G ++   
Sbjct: 122 LGICYGATAIMALFIPH--------KRF----------------FPVALLLLAGYFILQL 157

Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
                       G  F            P N +  +D  VLG NHMY            Q
Sbjct: 158 -----------IGNGFE---------KSPDNIIAIVDSTVLGTNHMY-----------LQ 186

Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVT 327
              F                EPEG+LS++ +I   +IG   G +II+ K +  R+++   
Sbjct: 187 GRQF---------------VEPEGILSTIPAIAQVMIGFVCGRMIINQKDNKERMQKLFF 231

Query: 328 MGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPL 387
           +G  +L  G    F+ A PLNK+L++ S+V +T G AAL  +A+  ++D+   K      
Sbjct: 232 LGTLMLFAGFL--FSYACPLNKRLWSPSFVLLTCGIAALALAALIEIIDVRQKKRWCTFF 289

Query: 388 AWIGMNAMLVYVMAAEGIFAGFINGW 413
              G+N +++YV A   IF   +  W
Sbjct: 290 NVFGVNPLVLYVFAE--IFGDLLRIW 313


>gi|395804141|ref|ZP_10483382.1| hypothetical protein FF52_19760 [Flavobacterium sp. F52]
 gi|395433785|gb|EJF99737.1| hypothetical protein FF52_19760 [Flavobacterium sp. F52]
          Length = 423

 Score =  104 bits (260), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 114/441 (25%), Positives = 180/441 (40%), Gaps = 120/441 (27%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGV 86
           +  +RL SLD+FRG  + LM +V++ G     +P + HA W+GC   D V PFF+FI+G 
Sbjct: 1   MTKERLTSLDVFRGFTIFLMTIVNNPGSWSSIYPPLEHAEWHGCTPTDLVFPFFVFIMGT 60

Query: 87  AIALALKRIPDRADAVKKVIFRTLKLLFWGILLQ-------------------------- 120
           AI  A+           K++ R+L++   G+ L                           
Sbjct: 61  AIPFAMPVKHFDGSVFNKILVRSLRIFCLGLFLSVFSRIHLFGLEGIPLLGLRLIVAFAV 120

Query: 121 -----GGFS-HAPDELTYGVDVRMI-------------RLCGVLQRIALSYLLVSLVEIF 161
                G FS      L  GV + +I             R+ GVLQRIA+ Y   S++ + 
Sbjct: 121 AYALLGNFSMKVKTILAVGVFLILISLSFSGLEHFEDTRIPGVLQRIAIVYFFTSILYLK 180

Query: 162 TKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGK 221
           T     K Q +    +   Y   WL+ A             +VP   F   N D      
Sbjct: 181 T---NLKTQLLVLAGLLVGY---WLLMA-------------FVPVPGFGPANFDKG---- 217

Query: 222 VFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPS 281
                           N   +ID                       D+   G L   + +
Sbjct: 218 ---------------TNLAAWID-----------------------DTLLNGHLWASSKT 239

Query: 282 WCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHF 341
           W     +PEG+LS++ +I + I+G++ G ++  +   +  +K+    G AL+I GL  + 
Sbjct: 240 W-----DPEGILSTLPAIGTGILGMYIGQLLNLSVDKMEIVKKTAIAGTALVIGGLIWNI 294

Query: 342 TNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDI-WNLKYPFLPLAWIGMNAMLVYVM 400
               P+NK L+T SYV  T+G A L  + +Y ++DI  + K+  L L W G+N M+V+  
Sbjct: 295 --FFPINKSLWTSSYVLYTAGIATLCLTLLYYIIDIKGHKKWTKLFLIW-GVNPMIVFFF 351

Query: 401 AAEGIFAGFINGWYYGDPHNT 421
           +  GI    ++     DP  T
Sbjct: 352 S--GIIPRVLSAIKVADPEKT 370


>gi|333378336|ref|ZP_08470067.1| hypothetical protein HMPREF9456_01662 [Dysgonomonas mossii DSM
           22836]
 gi|332883312|gb|EGK03595.1| hypothetical protein HMPREF9456_01662 [Dysgonomonas mossii DSM
           22836]
          Length = 387

 Score =  104 bits (260), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 97/404 (24%), Positives = 174/404 (43%), Gaps = 97/404 (24%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVA 87
           + RL SLD+ RG+ +A MILV++ G  W  I     HA WNG    D + PFF+FI+G++
Sbjct: 6   SSRLLSLDVLRGITIAGMILVNNPG-SWGHIYAPLRHAEWNGLTPTDLIFPFFMFIMGIS 64

Query: 88  IALALKR--IPDRADAVKKVIFRTLKLLFWGILL-----QGGFSHAPDELTYGVDVRM-- 138
             ++L++         ++K++ RT  +   G+ L       G  HA      G   R+  
Sbjct: 65  TFISLRKFNFEFSVPTLRKILKRTFVIFLIGLGLSWLGVSFGTYHALAADNLGFLERLGR 124

Query: 139 -------IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV 191
                  +R+ GV+QR+AL+Y + SL+ IF K                     +++   +
Sbjct: 125 SVTNFEHLRILGVMQRLALTYGITSLIAIFIKHKYIP----------------YIIVVGL 168

Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
           +  +L LL+G           N  + +   +  VT                 D+ +LG+N
Sbjct: 169 VGYFLLLLFG-----------NGFATEGYNILAVT-----------------DQSILGLN 200

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           HMY                                 +PEG+LS++ ++   +IG + G +
Sbjct: 201 HMY----------------------------TEFGLDPEGILSTIPAVCHVLIGFYCGKI 232

Query: 312 IIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAI 371
           ++ TK +  R+     +G  L   G  L +    P+NK++++ ++   T G  A   S +
Sbjct: 233 LMETKDNQQRMLHLFIIGAILTFSGFLLSY--GCPINKKIWSPTFELTTCGLGATFLSLL 290

Query: 372 YALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYY 415
             ++D+   K   +     G+N + +YV+A  G+ A   +G ++
Sbjct: 291 IWIIDVKGYKKWSVFFESFGVNPLFIYVLA--GVMATIADGIHF 332


>gi|410448043|ref|ZP_11302130.1| putative membrane protein [Leptospira sp. Fiocruz LV3954]
 gi|410018124|gb|EKO80169.1| putative membrane protein [Leptospira sp. Fiocruz LV3954]
 gi|456875246|gb|EMF90470.1| putative membrane protein [Leptospira santarosai str. ST188]
          Length = 363

 Score =  104 bits (260), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 116/391 (29%), Positives = 172/391 (43%), Gaps = 100/391 (25%)

Query: 37  SLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           SLD+FRG+ V  MILV++ G  W  I     HA WNGC   D V PFFLF VG +I ++L
Sbjct: 2   SLDLFRGMTVIGMILVNNPG-SWSYIYSPLKHAEWNGCTPTDLVFPFFLFAVGTSIPISL 60

Query: 93  --KRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVLQRI 148
             K   +R+D    +  R+  L+  G+ L   G +S A            +R+ GVLQRI
Sbjct: 61  YSKNGINRSDIWIGICIRSANLILLGLFLNFFGEWSFAE-----------LRIPGVLQRI 109

Query: 149 ALSYLLV-SLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
              Y +V SL  +F      K   V    I  ++ W        ++  +AL   + V   
Sbjct: 110 GFVYWVVASLCLVF----PGKKILVFLVPILLIHTW--------ILTQIALPGESVV--- 154

Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
                   S + GK                +   +IDR + G  H+     WR SK    
Sbjct: 155 --------SLEQGK----------------DIGAWIDRTIFGEKHL-----WRFSKT--- 182

Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVT 327
                              ++PEG LS V+S+++T+ GV  G ++        R ++   
Sbjct: 183 -------------------WDPEGFLSGVASVVTTLFGVLCGFIL------FLRERRNKI 217

Query: 328 MGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAAL---VFSAIYALV--DIWNLKY 382
           +G  +L   + L +  ++P+NK L+T SY   T+G + L    F  + +L+    WNLK 
Sbjct: 218 LGLGILFSFVGLLWDRSLPMNKSLWTGSYSVYTAGLSFLSIWFFEYLSSLIISKGWNLKI 277

Query: 383 PFLPLAWIGMNAMLVYVMAAEGIFAGFINGW 413
            F P    G NA+LV+V    GI A  +N W
Sbjct: 278 LFQPFLVFGKNAVLVFV--GSGILARTLNLW 306


>gi|383934719|ref|ZP_09988159.1| heparan-alpha-glucosaminide N-acetyltransferase [Rheinheimera
           nanhaiensis E407-8]
 gi|383704254|dbj|GAB58250.1| heparan-alpha-glucosaminide N-acetyltransferase [Rheinheimera
           nanhaiensis E407-8]
          Length = 397

 Score =  104 bits (260), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 102/382 (26%), Positives = 173/382 (45%), Gaps = 67/382 (17%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIV 84
           + L T R+ +LD+ RGL +  MILV++ G     +  + HA W+G  + D + PFF+ IV
Sbjct: 14  AKLNTNRMLALDVLRGLTITAMILVNNPGSWNYVYSPLLHAQWHGWTITDLIFPFFIVIV 73

Query: 85  GVAIALALKR--IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPD-ELTYGVD-VRMIR 140
           G+++ L+L++  + ++   +++ + R+ KL   G+LL   + +  D E  Y  D +  +R
Sbjct: 74  GMSLQLSLRQHSLNNKGPLIRQALLRSGKLFGLGLLLALFYYNFRDPEFNYVEDRLLTVR 133

Query: 141 LCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLY 200
             GVLQRI L YL   L+ ++                             ++ VYLA L+
Sbjct: 134 WLGVLQRIGLVYLATVLIVLYFGQRGRLL-----------------WLLGLMAVYLAGLW 176

Query: 201 GTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWR 260
                D Q           G  F      R  L    + V ++D+ VLG NH+Y+  A  
Sbjct: 177 WLPYQDAQ-----------GHEF------RGLLLFGNSFVAWLDQLVLGANHVYYRSA-- 217

Query: 261 RSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLA 320
                   +PF               F+PEGL S++ +I S + GV     +   +    
Sbjct: 218 --------TPFA--------------FDPEGLWSTLPAIASCLTGVLMAQWLQSERSLAQ 255

Query: 321 RLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNL 380
           +++  +  G   +      HF+  +P+NK L+T S+V ++SG  A+  +A   L D+   
Sbjct: 256 KIRGLLLCGVVAVWLAELWHFS--LPVNKSLWTPSFVLLSSGYCAIALAACLWLCDVKGW 313

Query: 381 KYPFLPLAWIGMNAMLVYVMAA 402
           +    P    G NA+L Y+ +A
Sbjct: 314 RRWSAPFVVFGANAILFYLFSA 335


>gi|421093382|ref|ZP_15554106.1| putative membrane protein [Leptospira borgpetersenii str.
           200801926]
 gi|410363365|gb|EKP14394.1| putative membrane protein [Leptospira borgpetersenii str.
           200801926]
          Length = 383

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 112/403 (27%), Positives = 171/403 (42%), Gaps = 90/403 (22%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFF 80
            ++KS     R+ SLD+FRG+ V  MILV++ G     +  + HA WNGC   D V PFF
Sbjct: 1   MEKKSTQNKDRILSLDLFRGMTVVGMILVNNPGSWSYVYSPLKHAEWNGCTPTDLVFPFF 60

Query: 81  LFIVGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRM 138
           LF VG +I ++L  K   +R      +  R + L+  G+ L     +   E T+      
Sbjct: 61  LFAVGASIPISLYSKNGINRIRVWIGICIRGISLILLGLFL-----NFFGEWTF----SE 111

Query: 139 IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL 198
           +R+ GVLQRI   Y +V+ +                F IF            VLV  + +
Sbjct: 112 LRIPGVLQRIGFVYWVVATL----------------FLIFP--------GKKVLVFLIPI 147

Query: 199 LYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPA 258
           L    V  W  T I                    L    +   +IDR++ G  H+     
Sbjct: 148 L---LVHTWILTHIAPPGES-----------MVSLEQGKDIGAWIDRRIFGEKHL----- 188

Query: 259 WRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGH 318
           W+ SK                       ++PEG LS ++SI +++ GV  G ++   +G 
Sbjct: 189 WKFSKT----------------------WDPEGFLSGIASIATSLFGVICGFILFRREGR 226

Query: 319 LARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAAL---VFSAIYALV 375
             + +     G   L   + L +  ++P+NK L+T SY   T+G + L    F  + +L+
Sbjct: 227 -GKNRVLSIFGLGFLFTFVGLLWDRSLPMNKSLWTGSYAVYTTGLSFLCIGFFEYLDSLI 285

Query: 376 DI--WN---LKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGW 413
            +  WN   LK  F P    G NA+LV+V    GI A  +N W
Sbjct: 286 LLKKWNGLDLKIFFQPFFVFGKNAILVFV--GSGILARTLNFW 326


>gi|338212268|ref|YP_004656323.1| hypothetical protein [Runella slithyformis DSM 19594]
 gi|336306089|gb|AEI49191.1| Protein of unknown function DUF2261, transmembrane [Runella
           slithyformis DSM 19594]
          Length = 365

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 112/396 (28%), Positives = 167/396 (42%), Gaps = 92/396 (23%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           +T RL SLD  RG  +A M++V+  G +   +  + H  WNG +  D V P FLF+VGV+
Sbjct: 4   QTNRLVSLDALRGFTIAAMLMVNFPGSEEYVFFTLRHTKWNGLSFTDLVAPIFLFVVGVS 63

Query: 88  IALAL-KRIPD---RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCG 143
           I  A  KR  D     +  +K+I R+LK+   G+ L          L    D   IR  G
Sbjct: 64  IVFAYSKRKWDGRPTGELYRKIIIRSLKIFAVGMFLN---------LMPTFDFSDIRWTG 114

Query: 144 VLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTY 203
            L RIA  +L  +++ + T   Q                  W+ A  ++  +LAL   T 
Sbjct: 115 TLHRIAFVFLGCAVLYLNTNWKQQA----------------WVGAVILVAYWLAL---TL 155

Query: 204 VPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
           +P          +   GKV          L P  N V + D + L               
Sbjct: 156 IP----------TPGIGKVM---------LEPGVNLVAWFDTQFL--------------- 181

Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLK 323
                     P +    +W     +PE +LS+  SI+S I G+  G ++  T     ++ 
Sbjct: 182 ----------PGKMWQGTW-----DPESILSTFPSIVSGITGMLAGQLLQSTFTPNEKVN 226

Query: 324 QWVTMGFALLIFGLTLHF--TNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLK 381
             +T G    +F   L +      P+N+ L+T S+V VTSG A L+  A+Y +VDI    
Sbjct: 227 YLMTAG----VFSAALGYFWGLGFPVNENLWTSSFVLVTSGFACLLLGALYFMVDILGKT 282

Query: 382 YPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGD 417
              LP    G NA+ VYV+    I A F  G  +G+
Sbjct: 283 KGTLPGIIFGANAIAVYVLG--DILALFFYGATFGE 316


>gi|242062186|ref|XP_002452382.1| hypothetical protein SORBIDRAFT_04g024716 [Sorghum bicolor]
 gi|241932213|gb|EES05358.1| hypothetical protein SORBIDRAFT_04g024716 [Sorghum bicolor]
          Length = 108

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 47/97 (48%), Positives = 69/97 (71%), Gaps = 1/97 (1%)

Query: 279 APSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLT 338
           APSWC APF+PEGLLSSV +I++ +IG+ FGH+IIH + H  R+  W+ + F++L     
Sbjct: 13  APSWCQAPFDPEGLLSSVMAIVTCLIGLQFGHIIIHFEKHRGRITNWLILSFSMLALAFL 72

Query: 339 LHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALV 375
           + F+  + ++K LYT+SY   T+G A L+F+ IYALV
Sbjct: 73  MDFSG-MRMSKPLYTMSYTLATAGTAGLLFAGIYALV 108


>gi|300770061|ref|ZP_07079940.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
           33861]
 gi|300762537|gb|EFK59354.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
           33861]
          Length = 404

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 99/381 (25%), Positives = 161/381 (42%), Gaps = 106/381 (27%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDW----PEISHAPWNGCNLADFVMPFFLFIVGVAI 88
           QR  SLD+FRG  VALMI+V++ G  W      + HA W+GC   D V PFFLF VG A+
Sbjct: 3   QRYYSLDVFRGATVALMIMVNNPG-SWGHMFAPLKHAEWHGCTPTDLVFPFFLFAVGNAM 61

Query: 89  ALALKRIPDRADAV--KKVIFRTLKLLFWGILLQGG--FSHAPDEL-----TYGVD-VRM 138
           +  + R+ +   AV  +KV+ RT+ +   G+ +        A D L     +Y  D +R 
Sbjct: 62  SFVIPRLQEAGPAVFWQKVLKRTVLIFLIGLFINWWPFVQWAQDTLVFKQWSYADDSMRG 121

Query: 139 IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL 198
           +R+ GVLQRIAL+Y   S++  + ++              +   W   ++  +LVVY A+
Sbjct: 122 VRILGVLQRIALAYCFASIIAYYFRE--------------KAIIW---ISTFILVVYWAV 164

Query: 199 LYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYI----DRKVLGINHMY 254
                                       C        P +  G+     D ++LG+ H+Y
Sbjct: 165 ----------------------------CAFLGTPGDPYSLQGWFGTAYDIQILGVAHVY 196

Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI-- 312
                            EG            PF+PEGL+S++ +I+  ++G   G  I  
Sbjct: 197 KG---------------EG-----------VPFDPEGLMSTLPAIVQVVLGYLAGTYIKK 230

Query: 313 ------------IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVT 360
                          + H   L      GF L++  L   ++   P+NK+++T SYV  T
Sbjct: 231 QGQVDWLWKKVPASQEPHFKLLSGLFVTGFILVV--LAWVWSLGFPINKKIWTSSYVLYT 288

Query: 361 SGAAALVFSAIYALVDIWNLK 381
           +G   +    +   +++  +K
Sbjct: 289 TGLGVMTIGGMIWFIEVQGVK 309


>gi|410631394|ref|ZP_11342069.1| heparan-alpha-glucosaminide N-acetyltransferase [Glaciecola arctica
           BSs20135]
 gi|410148840|dbj|GAC18936.1| heparan-alpha-glucosaminide N-acetyltransferase [Glaciecola arctica
           BSs20135]
          Length = 366

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 107/384 (27%), Positives = 164/384 (42%), Gaps = 89/384 (23%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGVAIA 89
           R+ ++D+ RGLA+ALM+LV++ G  W  +     HA W+G    D V PFFLF++G ++A
Sbjct: 8   RIEAIDVLRGLALALMLLVNNPG-SWSAVYAPFLHADWHGLTPTDLVFPFFLFVMGASMA 66

Query: 90  LALK-RIPDRADAVKKVIFRTLKLLFWGILLQ-GGFSHAPDELTYGVDVRMIRLCGVLQR 147
            +L+ +I         +  R+  L+F G LLQ   F  APD           R+ GVLQR
Sbjct: 67  CSLRGQIQASGLPWLSIFKRSFLLVFIGFLLQIIPFDQAPDTW---------RIMGVLQR 117

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
           I L +LLV+ +    K+                  W  L A   L+VY  LL        
Sbjct: 118 IGLCFLLVASMLAIIKE-----------------RWLLLSAVVTLIVYWLLL-------- 152

Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
               ++   A Y              +   N+V + D  +LG  HM+             
Sbjct: 153 ----LSAGQAPY--------------SLENNSVRHFDMAILGSAHMWQGKG--------- 185

Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVT 327
                             PF+PEGLLS++ + ++ + G      ++  K    ++ Q + 
Sbjct: 186 -----------------LPFDPEGLLSTIGAAMTVLSGYLICVNVLQQKNQKQQILQLMI 228

Query: 328 MGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPL 387
           +G  LL  G      +  P+NK L+T SYV V+S  A L  + I  L  +  +      L
Sbjct: 229 VGAILLALGFVWSVWH--PINKSLWTSSYVLVSSAFACLSLAVIILLWRVPVVNTVLNGL 286

Query: 388 AWIGMNAMLVYVMAAEGIFAGFIN 411
              G N + +YV  A  +FA F+N
Sbjct: 287 KIYGSNPIFIYV--AAWVFAIFLN 308


>gi|333378010|ref|ZP_08469743.1| hypothetical protein HMPREF9456_01338 [Dysgonomonas mossii DSM
           22836]
 gi|332884030|gb|EGK04310.1| hypothetical protein HMPREF9456_01338 [Dysgonomonas mossii DSM
           22836]
          Length = 389

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 96/395 (24%), Positives = 176/395 (44%), Gaps = 93/395 (23%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
           + RL SLDI RG+ +A MI+V++ G     +  + HA W+G    D V PFF+FI+G++ 
Sbjct: 6   SSRLLSLDILRGITIAGMIMVNNPGSWSYVYAPLGHAAWHGLTPTDLVFPFFMFIMGIST 65

Query: 89  ALALKRIPDRAD--AVKKVIFRTLKLLFWGILLQ------GGFSHAPDE--------LTY 132
            ++L++     +   + K++ RT+ +   G+ L         F+    E        +T 
Sbjct: 66  YISLRKFNFEFNKPTLFKILKRTVVIFLIGLGLGWLSLSFRTFNSLSGEDIGFFERFITA 125

Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
             +   IR+ GV+QR+AL+Y   +L+ IF K                     +++   ++
Sbjct: 126 ITNFEHIRILGVMQRLALTYGATALIAIFVKHKYIP----------------YIIVVTLI 169

Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
             +L LL+G             +  D+ +                N +  +DR +LG +H
Sbjct: 170 GYFLLLLFG-------------NGFDFSE---------------DNIISVLDRAILGADH 201

Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
           MY            +DS                  +PEGLLS++ +I   +IG   G ++
Sbjct: 202 MY------------KDSGLA--------------IDPEGLLSTIPAICHVLIGFCCGEIL 235

Query: 313 IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIY 372
           + TK +  R+++   +G  +   G  L +    P+NK++++ ++V  T G A+ + + + 
Sbjct: 236 LTTKDNNERIQRLFIIGAIMTFLGFLLSY--GCPINKKIWSPTFVLATCGLASTMLALLI 293

Query: 373 ALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFA 407
            ++DI   K         G+N + +YV  A GIF+
Sbjct: 294 WIIDIKGHKKWSAFFESFGVNPLFIYV--AAGIFS 326


>gi|212695334|ref|ZP_03303462.1| hypothetical protein BACDOR_04881, partial [Bacteroides dorei DSM
           17855]
 gi|212662113|gb|EEB22687.1| hypothetical protein BACDOR_04881, partial [Bacteroides dorei DSM
           17855]
          Length = 284

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 95/363 (26%), Positives = 153/363 (42%), Gaps = 90/363 (24%)

Query: 17  SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLAD 74
            E +    Q+   +K +RL SLD  RG+ VA MILV++AGG   +  + H+ WNG    D
Sbjct: 4   EELNTETAQQAPPIK-KRLLSLDALRGITVAGMILVNNAGGKVSYAPLQHSVWNGLTPCD 62

Query: 75  FVMPFFLFIVGVAIALALKRIPDRA--DAVKKVIFRTLKLLF--WGILLQGGFSHA--PD 128
            V PFFLFI+G++  ++L +         V K++ RT  +L   W I   G F H    D
Sbjct: 63  LVFPFFLFIMGISTYISLNKFNFNVSLQVVTKILKRTFLILCIGWAI---GWFDHVCEGD 119

Query: 129 ELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMA 188
            L +      +R+ GVLQRIAL Y ++S   +F                           
Sbjct: 120 FLPF----VHLRIPGVLQRIALCYCVISFTALFMNH------------------------ 151

Query: 189 ACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
                         ++P   F ++   +        + C          N +  IDR++ 
Sbjct: 152 -------------KFIPALTFILLVSYTV-------ILCMGNGYACDESNILSIIDRQLF 191

Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
           G  H+Y                              +P +PEG +S++S+I  T IG   
Sbjct: 192 GEAHLYQ----------------------------KSPIDPEGFVSTLSAIAHTCIGFSC 223

Query: 309 GHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVF 368
           G  II +     ++ +    GF L+  G  L   + +PLNK++++ ++V VT GAA++  
Sbjct: 224 GKWIIQSHQTENKVLRLFLTGFILISIGYLL--ADVLPLNKRIWSPTFVLVTCGAASMSL 281

Query: 369 SAI 371
           + +
Sbjct: 282 ATL 284


>gi|227538516|ref|ZP_03968565.1| transmembrane protein [Sphingobacterium spiritivorum ATCC 33300]
 gi|227241435|gb|EEI91450.1| transmembrane protein [Sphingobacterium spiritivorum ATCC 33300]
          Length = 404

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 99/381 (25%), Positives = 161/381 (42%), Gaps = 106/381 (27%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDW----PEISHAPWNGCNLADFVMPFFLFIVGVAI 88
           QR  SLD+FRG  VALMI+V++ G  W      + HA W+GC   D V PFFLF VG A+
Sbjct: 3   QRYYSLDVFRGATVALMIMVNNPG-SWGHMFAPLKHAEWHGCTPTDLVFPFFLFAVGNAM 61

Query: 89  ALALKRIPDRADAV--KKVIFRTLKLLFWGILLQGG--FSHAPDEL-----TYGVD-VRM 138
           +  + R+ +   AV  +KV+ RT+ +   G+ +        A D L     +Y  D +R 
Sbjct: 62  SFVIPRLQEAGPAVFWQKVLKRTVLIFLIGLFINWWPFVQWAQDTLVFKQWSYADDPMRG 121

Query: 139 IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL 198
           +R+ GVLQRIAL+Y   S++  + ++              +   W   ++  +LVVY A+
Sbjct: 122 VRILGVLQRIALAYCFASIIAYYFRE--------------KAIIW---ISTFILVVYWAV 164

Query: 199 LYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYI----DRKVLGINHMY 254
                                       C        P +  G+     D ++LG+ H+Y
Sbjct: 165 ----------------------------CAFLGTPGDPYSLQGWFGTAYDIQILGVAHVY 196

Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI-- 312
                            EG            PF+PEGL+S++ +I+  ++G   G  I  
Sbjct: 197 KG---------------EG-----------VPFDPEGLMSTLPAIVQVVLGYLAGTYIKK 230

Query: 313 ------------IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVT 360
                          + H   L      GF L++  L   ++   P+NK+++T SYV  T
Sbjct: 231 QGEVDWLWKKVPASQEPHFKLLSGLFVTGFILVV--LAWVWSLGFPINKKIWTSSYVLYT 288

Query: 361 SGAAALVFSAIYALVDIWNLK 381
           +G   +    +   +++  +K
Sbjct: 289 TGLGIMTIGGMIWFIEVQGVK 309


>gi|373953861|ref|ZP_09613821.1| Protein of unknown function DUF2261, transmembrane
           [Mucilaginibacter paludis DSM 18603]
 gi|373890461|gb|EHQ26358.1| Protein of unknown function DUF2261, transmembrane
           [Mucilaginibacter paludis DSM 18603]
          Length = 379

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 104/375 (27%), Positives = 160/375 (42%), Gaps = 78/375 (20%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
           QRL SLD+FRGL VA MILV++ G DW  I     H+ WNGC   D + PFFLFIVGV+I
Sbjct: 14  QRLLSLDVFRGLTVACMILVNNPG-DWAHIYSPLEHSAWNGCTPTDLIFPFFLFIVGVSI 72

Query: 89  ALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
             ++           K++   LK       L    S  P       +   +R+ GVLQRI
Sbjct: 73  VYSMGTKKTDPAQHGKLVLTILKRSLILFCLALFLSLYPK-----FNFHTLRIPGVLQRI 127

Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
           A+ + +  +  IF K  + K Q +    +F L+          L+VY  L+         
Sbjct: 128 AVVFGICGI--IFLKT-ERKTQLI----LFWLF----------LIVYYLLM--------- 161

Query: 209 FTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
            T++      Y           A L P  N   +IDR V+G  H++              
Sbjct: 162 -TLVPVPGVGY-----------ANLQPETNLGAWIDRTVIGNVHLW-------------- 195

Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV-T 327
                   K++ +W     +PEG+L ++ +  + + G+  G  +       +    W+  
Sbjct: 196 --------KESVTW-----DPEGILGTMPATSTGLFGILVGTWLKRKDVDESTKVAWLFC 242

Query: 328 MGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPL 387
            G   +I GL        P+NK L+T S+V    G A L  +  Y ++D+   K    P 
Sbjct: 243 TGIGAVILGLLWDL--FFPINKALWTSSFVLYAGGLATLGLTLFYWIIDVQGYKKFTKPF 300

Query: 388 AWIGMNAMLVYVMAA 402
              G+NA+ V+  ++
Sbjct: 301 VVYGVNAITVFCFSS 315


>gi|71278983|ref|YP_267171.1| hypothetical protein CPS_0413 [Colwellia psychrerythraea 34H]
 gi|71144723|gb|AAZ25196.1| putative membrane protein [Colwellia psychrerythraea 34H]
          Length = 358

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 99/379 (26%), Positives = 163/379 (43%), Gaps = 94/379 (24%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGVAI 88
            R  +LD FRG+ +ALMILV+  G  W  +     HA W+G    D V PFFLFI+G A+
Sbjct: 2   TRYLALDAFRGITIALMILVNTPG-TWSHVYAPLLHAEWDGATPTDLVFPFFLFIIGSAM 60

Query: 89  ALALKR--IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQ 146
             + K+       +  +K+I R   + F G +L        + + + V+    R+ G+LQ
Sbjct: 61  FFSFKKSNFSASPEQFRKIIKRGFIMFFIGFML--------NVIPFTVNAEDWRIMGILQ 112

Query: 147 RIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPD 206
           RI ++Y + + + +          ++ R  +F       + +A +L+ Y ALL       
Sbjct: 113 RIGIAYTVAACLVL----------TLNRTGVF-------IASAVILLAYWALL------- 148

Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
                             ++ G    L    N +  +D  V G NHMY            
Sbjct: 149 ------------------LSMG-EGALTIEGNIIRQLDLAVFGANHMY------------ 177

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
                   +R  A       FEPEGLLS++ +I++ ++G      +   +   + + +  
Sbjct: 178 -------TMRGVA-------FEPEGLLSTIPAIVNMLLGFELTRYLTSIEDKRSSVIKLT 223

Query: 327 TMGFALLIFGLTLHFTN----AIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKY 382
            +G      GL + F       +P+NK L+T SYV  ++G A L+ +A   L+DI     
Sbjct: 224 LIG------GLAVGFGALWGLVLPINKSLWTPSYVIYSTGFACLLLAAFIWLIDIMKQVK 277

Query: 383 PFLPLAWIGMNAMLVYVMA 401
              PL   G N + VYV++
Sbjct: 278 LAEPLLVYGTNPLFVYVLS 296


>gi|418737426|ref|ZP_13293823.1| putative membrane protein [Leptospira borgpetersenii serovar
           Castellonis str. 200801910]
 gi|410746620|gb|EKQ99526.1| putative membrane protein [Leptospira borgpetersenii serovar
           Castellonis str. 200801910]
          Length = 383

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 112/403 (27%), Positives = 167/403 (41%), Gaps = 90/403 (22%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFF 80
            ++KS     R+ SLD+FRG+ V  MILV++ G     +  + HA WNGC   D V PFF
Sbjct: 1   MEKKSTQNKDRILSLDLFRGMTVVGMILVNNPGSWSYVYSPLKHAEWNGCTPTDLVFPFF 60

Query: 81  LFIVGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRM 138
           LF VG +I ++L  K   +R      +  R + L+  G+ L         EL        
Sbjct: 61  LFAVGTSIPISLYSKNGINRIRVWIGICIRGISLILLGLFLNFFGEWTFSEL-------- 112

Query: 139 IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL 198
            R+ GVLQRI   Y +V+ +                F IF            VLV  + +
Sbjct: 113 -RIPGVLQRIGFVYWVVATL----------------FLIFP--------GKKVLVFLIPI 147

Query: 199 LYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPA 258
           L    V  W  T I                    L    +   +IDR + G  H+     
Sbjct: 148 L---LVHTWILTHIAPPGES-----------MVSLEQGKDIGAWIDRTIFGEKHL----- 188

Query: 259 WRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGH 318
           W+ SK                       ++PEG LS ++SI +++ GV  G ++   +G 
Sbjct: 189 WKFSKT----------------------WDPEGFLSGIASIATSLFGVICGFILFRREGR 226

Query: 319 LARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAAL---VFSAIYALV 375
             + +     G   L   + L +  ++P+NK L+T SY   T+G + L    F  + +L+
Sbjct: 227 -GKNRVLSIFGLGFLFTFVGLLWDRSLPMNKSLWTGSYAVYTTGLSFLCIGFFEYLDSLI 285

Query: 376 DI--WN---LKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGW 413
            +  WN   LK  F P    G NA+LV+V    GI A  +N W
Sbjct: 286 LLKKWNDLDLKIFFQPFFVFGKNAILVFV--GSGILARTLNFW 326


>gi|418719584|ref|ZP_13278783.1| putative membrane protein [Leptospira borgpetersenii str. UI 09149]
 gi|410743627|gb|EKQ92369.1| putative membrane protein [Leptospira borgpetersenii str. UI 09149]
          Length = 383

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 112/403 (27%), Positives = 170/403 (42%), Gaps = 90/403 (22%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFF 80
            ++KS     R+ SLD+FRG+ V  MILV++ G     +  + HA WNGC   D V PFF
Sbjct: 1   MEKKSTQNKDRILSLDLFRGMTVVGMILVNNPGSWSYVYSPLKHAEWNGCTPTDLVFPFF 60

Query: 81  LFIVGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRM 138
           LF VG +I ++L  K   +R      +  R + L+  G+ L     +   E T+      
Sbjct: 61  LFAVGASIPISLYSKNGINRIRVWIGICIRGISLILLGLFL-----NFFGEWTF----SE 111

Query: 139 IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL 198
           +R+ GVLQRI   Y +V+ +                F IF            VLV  + +
Sbjct: 112 LRIPGVLQRIGFVYWVVATL----------------FLIFP--------GKKVLVFLIPI 147

Query: 199 LYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPA 258
           L    V  W  T I                    L    +   +IDR + G  H+     
Sbjct: 148 L---LVHTWILTHIAPPGES-----------MVSLEQGKDIGAWIDRTIFGEKHL----- 188

Query: 259 WRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGH 318
           W+ SK                       ++PEG LS ++SI +++ GV  G ++   +G 
Sbjct: 189 WKFSKT----------------------WDPEGFLSGIASIATSLFGVICGFILFRREGR 226

Query: 319 LARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAAL---VFSAIYALV 375
             + +     G   L   + L +  ++P+NK L+T SY   T+G + L    F  + +L+
Sbjct: 227 -GKNRVLSIFGLGFLFTFVGLLWDRSLPMNKSLWTGSYAVYTTGLSFLCIGFFEYLDSLI 285

Query: 376 DI--WN---LKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGW 413
            +  WN   LK  F P    G NA+LV+V    GI A  +N W
Sbjct: 286 LLKKWNGLDLKIFFQPFFVFGKNAILVFV--GSGILARTLNFW 326


>gi|410096828|ref|ZP_11291813.1| hypothetical protein HMPREF1076_00991 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409225445|gb|EKN18364.1| hypothetical protein HMPREF1076_00991 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 367

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 98/384 (25%), Positives = 165/384 (42%), Gaps = 90/384 (23%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           ++ RL SLD+ RG+ +A MI+V++ G     +  + HA WNG    D V PFF+FI+GV+
Sbjct: 3   QSGRLLSLDVMRGITIAGMIMVNNPGSWGYVYAPLRHASWNGLTPTDLVFPFFMFIMGVS 62

Query: 88  IALALKRIPDR--ADAVKKVIFRTLKLLFWGILLQ-------GGFSHAPDELTYGVDVRM 138
           +  +L++   +   ++V KV+ RT+ +   G  L         GFS+  +          
Sbjct: 63  MFFSLRKYDFKLSRESVTKVLKRTVLIFLVGFALNLFGHLCYNGFSNFEN---------- 112

Query: 139 IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL 198
           +R+ GV+QR+AL+Y + SL+ +  K                        AA +L+ Y  L
Sbjct: 113 LRILGVMQRLALAYGIGSLIGLSVKHKYILQT-----------------AAGILLFYWIL 155

Query: 199 LYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPA 258
           L  T                            ++     N +  +DR + G  HMYH   
Sbjct: 156 LAAT---------------------------GSQTLSENNIIAIVDRALFGNTHMYH--- 185

Query: 259 WRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH-TKG 317
                    D   +G             F+PEGLLS + SI   ++G + G +I+   K 
Sbjct: 186 ---------DYLADGT---------RIAFDPEGLLSCLGSIGHVLLGFYVGKMILDCKKN 227

Query: 318 HLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDI 377
           +   ++     G  +L  G  L +    P+NK+L++ ++V  T G  +L  + +  ++DI
Sbjct: 228 NELIIRNLFIFGTIILFLGFLLSY--GCPINKKLWSSTFVLTTCGFGSLFLALLIWIIDI 285

Query: 378 WNLKYPFLPLAWIGMNAMLVYVMA 401
              K   L     G+N + +YV  
Sbjct: 286 NGKKKWSLFFESFGINPLYLYVQG 309


>gi|408382946|ref|ZP_11180487.1| heparan-alpha-glucosaminide N-acetyltransferase [Methanobacterium
           formicicum DSM 3637]
 gi|407814484|gb|EKF85111.1| heparan-alpha-glucosaminide N-acetyltransferase [Methanobacterium
           formicicum DSM 3637]
          Length = 382

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 103/391 (26%), Positives = 166/391 (42%), Gaps = 85/391 (21%)

Query: 16  ISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGD--WPEI-SHAPWNGCNL 72
           ++    +   + + +K +R+ SLD+FRGLAVA MI V+        P I  HA WNG   
Sbjct: 5   VNTNSTNSTVKTNAVKKRRVISLDVFRGLAVAAMIFVNAMAFSEFTPGIFEHATWNGLTF 64

Query: 73  ADFVMPFFLFIVGVAIA--LALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDEL 130
           AD V P FLFIVGV++A   A +    + D     +FR   L   G+ L           
Sbjct: 65  ADLVFPSFLFIVGVSMAYSFAARSKNSKRDLWGHFLFRVGALFTIGVALN---------- 114

Query: 131 TYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAAC 190
            +  D  M+R+ GVLQ IAL+ L  S +  F                     W  L+A  
Sbjct: 115 WFTSDFSMVRIPGVLQLIALASLFASPMARFKPR------------------WILLVAGV 156

Query: 191 VLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGI 250
           +L+++  +L G   P                   +  G   K N   N   +ID +VL +
Sbjct: 157 LLLIHGFILLGVGAP------------------GIPAGTLEKGN---NIDDWIDIQVLTV 195

Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
           NH                                A  +PEG+LS +++    ++G+  G 
Sbjct: 196 NHT-----------------------------IDANGDPEGILSIITATALVLLGLCVGR 226

Query: 311 VIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSA 370
            +   K +L  +   +  G   L+ GL L  +  +P+NKQL+T S++ V +G   L  + 
Sbjct: 227 TLQLRKHNLKTIGILLAGGAISLLLGLAL--SQILPINKQLWTSSFILVCAGIGTLFLTI 284

Query: 371 IYALVDIWNLKYPFLPLAWIGMNAMLVYVMA 401
           ++  +DI  L         +G+NA+++Y+++
Sbjct: 285 LFWYLDIKRLPNVLFWAIPMGLNALIIYILS 315


>gi|345320430|ref|XP_001516736.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase, partial
           [Ornithorhynchus anatinus]
          Length = 448

 Score =  103 bits (256), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 80/284 (28%), Positives = 133/284 (46%), Gaps = 42/284 (14%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           QRL SLD FRG ++ +M+ V++ GG +    H  WNG  +AD V P+F+FI+G +I+L+L
Sbjct: 198 QRLRSLDTFRGFSLIIMVFVNYGGGKYWFFKHEGWNGLTVADLVFPWFVFIMGSSISLSL 257

Query: 93  KRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
             +  R  +    + K+++R+  L   G+L+       P+     +    +R+ GVLQR+
Sbjct: 258 SSMLRRGYSKWRLLWKILWRSFLLFLIGVLIVN-----PNYCLGPLSWDKLRIPGVLQRL 312

Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRL----YCWHWLMAACVLVVYLALLYGTYV 204
             +YL+V+ +E+       +  S+ R   F      Y   W+    +   +L L +   V
Sbjct: 313 GFTYLVVATLELLFAKAVPESNSLERTCSFLQEIISYWPQWIFILMLETAWLCLTFLLPV 372

Query: 205 PDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
           P      +      D+GK  N T G          A GYID  +LG NH+Y HP+     
Sbjct: 373 PGCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDHLLLGENHIYQHPS----- 417

Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVH 307
                         +        ++PEG+L +++SI+   +GV 
Sbjct: 418 -------------PNVLYHTKVAYDPEGILGTINSIVMAFLGVQ 448


>gi|428210738|ref|YP_007083882.1| hypothetical protein Oscil6304_0209 [Oscillatoria acuminata PCC
           6304]
 gi|427999119|gb|AFY79962.1| hypothetical protein Oscil6304_0209 [Oscillatoria acuminata PCC
           6304]
          Length = 398

 Score =  101 bits (251), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 108/395 (27%), Positives = 170/395 (43%), Gaps = 78/395 (19%)

Query: 23  DQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPF 79
           +   ++ L + RL SLD+FRG+A+A MILV++ G     +P + HAPW+G    D + P 
Sbjct: 9   NPSVQNLLNSMRLTSLDVFRGMAIASMILVNNPGSWQQVYPPLLHAPWHGFTPTDLIFPA 68

Query: 80  FLFIVGVAIALALKRIPDR-----ADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYG- 133
           FLFI GVA+A +  +  +      A +V   I R   +LF   L   G +     L  G 
Sbjct: 69  FLFISGVAMAFSFAKYTNSPNSPPAASVYFKILRRALILFGLGLFLNGSTLVLKTLLQGQ 128

Query: 134 -VDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
            +D   +R+ GVLQRI+L+YL  +     T  +    + +G        C          
Sbjct: 129 PLDFGTLRIMGVLQRISLAYLFGA-----TAILNLSRRRLG------FLC---------- 167

Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
              LA+L G +    Q  +      D       T             V Y+DR +L   H
Sbjct: 168 ---LAILLGYWFALTQIPVPGYGPGDLSAKGAGTL------------VAYLDRLILTPPH 212

Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
           +              D  F                EPEGLLS++ S+++ ++G   G  +
Sbjct: 213 ILG------------DGSF----------------EPEGLLSTLPSVVTLLLGFFIGDWL 244

Query: 313 IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIY 372
              K  +         G A++       +    P+NKQL+T SYV +++G + L+ +A Y
Sbjct: 245 --QKQPVTSRTSLQMAGVAVVTIVTGSLWGLVFPINKQLWTSSYVVLSAGWSLLLLAACY 302

Query: 373 ALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFA 407
            LV++   +    P   +G+NA+ V+V  A G  A
Sbjct: 303 ELVEVRQWRSWAFPFKVMGLNAIFVFV--ASGFLA 335


>gi|410638830|ref|ZP_11349383.1| heparan-alpha-glucosaminide N-acetyltransferase [Glaciecola
           lipolytica E3]
 gi|410141358|dbj|GAC16588.1| heparan-alpha-glucosaminide N-acetyltransferase [Glaciecola
           lipolytica E3]
          Length = 365

 Score =  101 bits (251), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 99/377 (26%), Positives = 163/377 (43%), Gaps = 88/377 (23%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
            R  SLDIFRG+ +A M+LV++ G     +P + HA W+G    D + PFFLFIVG A+ 
Sbjct: 2   NRQISLDIFRGITLAAMLLVNNPGSWSFVYPPLLHAKWHGLTPTDLIFPFFLFIVGAAMF 61

Query: 90  LALKRIPDRADAV-----KKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
            ++ R   +A+       +K+  RT+ L   G LL        +   +  D +  R+ GV
Sbjct: 62  HSMGRYLPKANQALQVPWQKIAKRTIVLFAIGFLL--------NIFPFTGDPQNWRIMGV 113

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           LQRIA+ Y + +++       Q                   L+AAC+      LL G ++
Sbjct: 114 LQRIAICYGIAAILICVLHQKQ-------------------LIAACI-----TLLIGYWL 149

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                 ++N     YG                 N V  ID +VLG  H+Y          
Sbjct: 150 ------MLNLVENPYGL--------------ETNLVRLIDIEVLGSAHLYQG-------- 181

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
                                 F+PEGLLS + ++++ + G     ++ + K    R+K 
Sbjct: 182 ------------------FGVAFDPEGLLSCIPAVVTVLAGFFTSKMLANAKTEQQRMK- 222

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPF 384
              + ++L+ F + + +  + P+NK L+T +YV  T+G A L  + I  L D+    + F
Sbjct: 223 -TLLLWSLVTFVVCIAWQFSFPVNKSLWTSTYVLATNGFAWLALAVIIYLHDVKKQTFGF 281

Query: 385 LPLAWIGMNAMLVYVMA 401
                 G N + +YV++
Sbjct: 282 EWAKIYGSNPLFIYVLS 298


>gi|212693969|ref|ZP_03302097.1| hypothetical protein BACDOR_03493 [Bacteroides dorei DSM 17855]
 gi|265751179|ref|ZP_06087242.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
 gi|423228550|ref|ZP_17214956.1| hypothetical protein HMPREF1063_00776 [Bacteroides dorei
           CL02T00C15]
 gi|423243815|ref|ZP_17224891.1| hypothetical protein HMPREF1064_01097 [Bacteroides dorei
           CL02T12C06]
 gi|212663501|gb|EEB24075.1| hypothetical protein BACDOR_03493 [Bacteroides dorei DSM 17855]
 gi|263238075|gb|EEZ23525.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
 gi|392635957|gb|EIY29845.1| hypothetical protein HMPREF1063_00776 [Bacteroides dorei
           CL02T00C15]
 gi|392644181|gb|EIY37924.1| hypothetical protein HMPREF1064_01097 [Bacteroides dorei
           CL02T12C06]
          Length = 363

 Score =  101 bits (251), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 96/376 (25%), Positives = 165/376 (43%), Gaps = 84/376 (22%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +RL +LDI RG+ +A MILV++ G     +  + HA +NG    D V PFF+FI+G++  
Sbjct: 7   KRLLALDILRGITIAGMILVNNPGSWGYVYAPLEHAAFNGLTPTDLVFPFFMFIMGISTY 66

Query: 90  LALKR--IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
           ++L++         ++K++ RT+ +   G+LL      A    T+ ++    R  GV+QR
Sbjct: 67  ISLRKYNFTYSHATLRKIMKRTVIIFCIGLLLN---LLAKSVFTHHLNFEEWRYLGVMQR 123

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA--LLYGTYVP 205
           +A+ Y + SLV I  K                    H    A +LV   A  LL  T   
Sbjct: 124 LAIGYGVTSLVAITVK--------------------HKYFPAIILVTLAAYFLLLAT--- 160

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
                         G  FN             N V   D   LG +HMYH          
Sbjct: 161 --------------GDGFN---------QSETNVVARFDAWALGTSHMYH---------- 187

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
                 EG +           F+PEGLLS+V ++   ++G + G +++  K +  ++++ 
Sbjct: 188 ------EGGM----------AFDPEGLLSTVPAVCHVMVGFYCGKLLLSAKDNAEKIQRL 231

Query: 326 VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFL 385
             +G  L   G  L +    P+NK++++ ++V +T G A+   + +  ++D+   +    
Sbjct: 232 FLIGTILTFAGFLLSY--GCPINKKVWSPTFVIITCGLASSFLALLIWIIDMKGYQNWCA 289

Query: 386 PLAWIGMNAMLVYVMA 401
                G+N + +YV A
Sbjct: 290 FFRSFGVNPLFIYVFA 305


>gi|345516841|ref|ZP_08796327.1| hypothetical protein BSEG_03945 [Bacteroides dorei 5_1_36/D4]
 gi|229437727|gb|EEO47804.1| hypothetical protein BSEG_03945 [Bacteroides dorei 5_1_36/D4]
          Length = 363

 Score =  101 bits (251), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 96/376 (25%), Positives = 165/376 (43%), Gaps = 84/376 (22%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +RL +LDI RG+ +A MILV++ G     +  + HA +NG    D V PFF+FI+G++  
Sbjct: 7   KRLLALDILRGITIAGMILVNNPGSWGYVYAPLEHAAFNGLTPTDLVFPFFMFIMGISTY 66

Query: 90  LALKR--IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
           ++L++         ++K++ RT+ +   G+LL      A    T+ ++    R  GV+QR
Sbjct: 67  ISLRKYNFTYSHATLRKIMKRTVIIFCIGLLLN---LLAKSVFTHHLNFEEWRYLGVMQR 123

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA--LLYGTYVP 205
           +A+ Y + SLV I  K                    H    A +LV   A  LL  T   
Sbjct: 124 LAIGYGVTSLVAITVK--------------------HKYFPAIILVTLAAYFLLLAT--- 160

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
                         G  FN             N V   D   LG +HMYH          
Sbjct: 161 --------------GDGFN---------QSETNVVARFDAWALGTSHMYH---------- 187

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
                 EG +           F+PEGLLS+V ++   ++G + G +++  K +  ++++ 
Sbjct: 188 ------EGGM----------AFDPEGLLSTVPAVCHVMVGFYCGKLLLSAKDNAEKIQRL 231

Query: 326 VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFL 385
             +G  L   G  L +    P+NK++++ ++V +T G A+   + +  ++D+   +    
Sbjct: 232 FLIGTILTFAGFLLSY--GCPINKKVWSPTFVIITCGLASSFLALLIWIIDMKGYQNWCA 289

Query: 386 PLAWIGMNAMLVYVMA 401
                G+N + +YV A
Sbjct: 290 FFRSFGVNPLFIYVFA 305


>gi|405957484|gb|EKC23691.1| Heparan-alpha-glucosaminide N-acetyltransferase [Crassostrea gigas]
          Length = 1901

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 105/414 (25%), Positives = 178/414 (42%), Gaps = 81/414 (19%)

Query: 22   SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFL 81
            +D+Q+ S  K +RL SLD FRG                                    F+
Sbjct: 1531 ADEQD-SRPKKERLKSLDTFRG------------------------------------FI 1553

Query: 82   FIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
            +I+G A+A +    L+R+  +     K+  R+  L   G+L+  G    P  LT+     
Sbjct: 1554 WIMGTAMAYSFTGMLRRVTPKHKIFWKIFKRSCILFVLGLLVNTGGCD-PTRLTH----- 1607

Query: 138  MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSV--GRFSIFRLYCWHWLMAACVLVVY 195
             +R+ GVLQR A +YL+V+ + +F     D       G       +   W++    + V+
Sbjct: 1608 -LRIPGVLQRFAGTYLVVASIHMFFAKTVDVSMYTYWGFIRDIVDFWLEWILHIVFVTVH 1666

Query: 196  LALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCN--AVGYIDRKVLGINHM 253
            + + +   VP      I       G +        A +   C   A GYIDR+V G +H+
Sbjct: 1667 IIITFTLDVPGCGKGYIGP-----GGLHEAVNSTEASVYQNCTGGAAGYIDRQVFGDDHI 1721

Query: 254  YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
            Y  P  +             P+ K        P++PEGLL +++S+    +G+  G +++
Sbjct: 1722 YQSPTCK-------------PIYKTT-----VPYDPEGLLGTLNSVFMCYLGLQAGKILM 1763

Query: 314  HTKGHLARLKQWVTMG-FALLIFGLTLHFTN---AIPLNKQLYTLSYVCVTSGAAALVFS 369
              K   AR+K+++  G F  LI G    F      IPLNK L++LS+V   +G A ++ +
Sbjct: 1764 TFKEPSARVKRFLIWGLFLCLIAGALCGFKKDGGTIPLNKNLWSLSFVLCMAGFAFVLLA 1823

Query: 370  AIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLV 423
              Y  +D++ + +   P  + GMN +++Y M  E +   F   W     H +L+
Sbjct: 1824 FCYVTIDVYKV-WSGAPFYFPGMNPIVLY-MGHEVLHRHFPISWEVSQYHYSLL 1875


>gi|237710367|ref|ZP_04540848.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA]
 gi|229455829|gb|EEO61550.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA]
          Length = 366

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 94/376 (25%), Positives = 162/376 (43%), Gaps = 84/376 (22%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +RL +LDI RG+ +A MILV++ G     +  + HA +NG    D V PFF+FI+G++  
Sbjct: 10  KRLLALDILRGITIAGMILVNNPGSWGYVYAPLEHASFNGLTPTDLVFPFFMFIMGISTY 69

Query: 90  LALKR--IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
           ++L++         ++K++ RT+ +   G+LL      A    T+ ++    R  GV+QR
Sbjct: 70  ISLRKYNFTYSHATLRKIMKRTVIIFCIGLLLN---LLAKSVFTHHLNFEEWRYLGVMQR 126

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA--LLYGTYVP 205
           +A+ Y + SLV I  K                    H    A +LV   A  LL  T   
Sbjct: 127 LAIGYGVTSLVAITVK--------------------HKYFPAIILVTLAAYFLLLAT--- 163

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
                         G  FN             N V   D   LG +HMYH          
Sbjct: 164 --------------GDGFN---------QSETNVVARFDAWALGTSHMYHESG------- 193

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
                                F+PEGLLS+V ++   ++G + G +++  K +  ++++ 
Sbjct: 194 -------------------MAFDPEGLLSTVPAVCHVMVGFYCGKLLLSAKDNAEKIQRL 234

Query: 326 VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFL 385
             +G  L   G  L +    P+NK++++ ++V +T G A+   + +  ++D+   +    
Sbjct: 235 FLIGTILTFAGFLLSY--GCPINKKVWSPTFVIITCGLASSFLALLIWIIDMKGYQNWCA 292

Query: 386 PLAWIGMNAMLVYVMA 401
                G+N + +YV A
Sbjct: 293 FFRSFGVNPLFIYVFA 308


>gi|325286182|ref|YP_004261972.1| heparan-alpha-glucosaminide N-acetyltransferase [Cellulophaga
           lytica DSM 7489]
 gi|324321636|gb|ADY29101.1| Heparan-alpha-glucosaminide N-acetyltransferase [Cellulophaga
           lytica DSM 7489]
          Length = 361

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 100/374 (26%), Positives = 168/374 (44%), Gaps = 82/374 (21%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGVAI 88
            R+ S+DIFRG+ + LMILV++ G  W  +     HA W+G    D V PFFLFIVG++I
Sbjct: 3   NRVVSVDIFRGITIVLMILVNNPG-TWSSVYAPFLHADWHGYTPTDLVFPFFLFIVGISI 61

Query: 89  ALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
             A           +K++ R+LKL+  G+ L G F+ +     +  D   IR  GVLQRI
Sbjct: 62  VYAYHTKEVTGKTYRKIVIRSLKLIGLGLFL-GAFTLS---FPFFKDFNDIRFPGVLQRI 117

Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
            L +   ++                   +F    W  L+A C  ++ +  L+  +VP   
Sbjct: 118 GLVFFFTAI-------------------LFIKLNWKALVAVCAAILIMYWLWMGFVP--- 155

Query: 209 FTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
              IN  +  + +              P N   +ID KVLG +HM     W+        
Sbjct: 156 ---INGTAPTFDR-------------APNNWANFIDLKVLG-SHM-----WKTD------ 187

Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTM 328
                             ++PEG+LS++ +I ++++GV  G ++        ++   + +
Sbjct: 188 ------------------YDPEGVLSTLPAIATSLLGVFVGLLLKSAYKKKTQILLLLGV 229

Query: 329 GFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLA 388
                     L F    P+NK L++ S+V VT+G A ++ + IY   D+ N K+  +   
Sbjct: 230 SLLTAGHIWDLFF----PINKALWSSSFVLVTAGWATIILAVIYYFSDVKNKKFGGV-FK 284

Query: 389 WIGMNAMLVYVMAA 402
           + G NA+ V+ +++
Sbjct: 285 YAGANAITVFFLSS 298


>gi|343082900|ref|YP_004772195.1| hypothetical protein [Cyclobacterium marinum DSM 745]
 gi|342351434|gb|AEL23964.1| Protein of unknown function DUF2261, transmembrane [Cyclobacterium
           marinum DSM 745]
          Length = 365

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 108/376 (28%), Positives = 164/376 (43%), Gaps = 86/376 (22%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
            RL SLD  RG  +A MILV++ G     +P + HA W+G  + DF+ PFF+F+VGV++A
Sbjct: 6   TRLISLDALRGFTIAAMILVNYPGSWSHVYPPLLHAEWHGMTMTDFIFPFFIFMVGVSVA 65

Query: 90  LALKRIPD----RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
            A  +  D    +A   KK+I R +KL   GI L    +  PD      D   +R+ GVL
Sbjct: 66  FAYSKRLDEGVPKAGMYKKIIIRAIKLFVLGIFL----NLIPD-----FDFSHMRIAGVL 116

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRI++ +L  S + +   +V  K Q+               +   +LV Y   +  T +P
Sbjct: 117 QRISIVFLASSFLFL---NVGWKKQAY--------------LGGGILVFYWLCM--TLIP 157

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
                     + D G V          L P  N   +IDR  L                 
Sbjct: 158 ----------TPDEGVVL---------LEPGRNLAAWIDRMFL----------------- 181

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
                   P +    +W     +PEG  +++ +I++ I+G+  G +++        LK  
Sbjct: 182 --------PGKMWEGTW-----DPEGFFTTLPAIVTGILGLLAGRILLSNFNE--NLKSN 226

Query: 326 VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFL 385
             M   L++  + L +    P+NK L+T S+  +T GA  L   A Y LVDI   K    
Sbjct: 227 YLMSIGLVLVLIGLLWAQVFPINKHLWTSSFTLITGGAGFLGLGASYFLVDILGKKKGTS 286

Query: 386 PLAWIGMNAMLVYVMA 401
                G NA+ VYV+A
Sbjct: 287 VGIIFGANAISVYVLA 302


>gi|410721825|ref|ZP_11361152.1| hypothetical protein B655_1618 [Methanobacterium sp. Maddingley
           MBC34]
 gi|410598366|gb|EKQ52947.1| hypothetical protein B655_1618 [Methanobacterium sp. Maddingley
           MBC34]
          Length = 372

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 113/385 (29%), Positives = 169/385 (43%), Gaps = 101/385 (26%)

Query: 33  QRLASLDIFRGLAVALMILVDHAG--GDWPEI-SHAPWNGCNLADFVMPFFLFIVGVAIA 89
            RL SLD+FRG  +A MI+V+  G   D P +  HA W G NLAD V PFFLFIVGV++ 
Sbjct: 6   DRLVSLDVFRGFTIAGMIMVNILGLYPDTPSLLQHASWIGLNLADLVFPFFLFIVGVSMN 65

Query: 90  LALKRIPDRADAVK--KVIFRTLKLLFWGILLQGGFSHAPDELTYGV-DVRMIRLCGVLQ 146
            +      +    K  K +FR   L   G+ L  G         YGV D   IR+ G+LQ
Sbjct: 66  FSFASRSKQPSWKKWGKFLFRVAALYLIGVALVFGL------FFYGVPDFSTIRIPGILQ 119

Query: 147 RIALSYLLVS-LVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
            IALS L  + L  + T+                   W  L A+ +L V  A+L     P
Sbjct: 120 LIALSSLFAAPLARLRTR-------------------WIILAASVILFVQAAILLWVSAP 160

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
                              V  G    L    N  G+ID +V    H+            
Sbjct: 161 ------------------GVPAG---SLEMSNNIAGWIDSQVFTPAHL------------ 187

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI-IHT--KGHLARL 322
                    L K+     H  F+PEG+++ ++     +IG+  G  + +H   KG     
Sbjct: 188 ---------LDKE-----HV-FDPEGMMAVINGTAMVLIGLACGRTLRLHRNWKG----- 227

Query: 323 KQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD------ 376
            Q++ +G  L+   + L  +  +P+ KQL+T S++ V +G AA++ + +Y L+D      
Sbjct: 228 VQYLIIG-GLIALTIGLIISPVMPIIKQLWTSSFILVNAGLAAIILALLYGLMDILKRGK 286

Query: 377 IWNLKYPFLPLAWIGMNAMLVYVMA 401
           I N+  PF      G NA+L+Y+++
Sbjct: 287 ILNVGIPF------GRNALLIYILS 305


>gi|299140911|ref|ZP_07034049.1| membrane protein [Prevotella oris C735]
 gi|298577877|gb|EFI49745.1| membrane protein [Prevotella oris C735]
          Length = 370

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 107/379 (28%), Positives = 172/379 (45%), Gaps = 77/379 (20%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVG 85
           +  QRL SLD+ RGLA+A MILV++ G  W  I     H+ WNG    D V PFF+F +G
Sbjct: 1   MTQQRLISLDMLRGLAMAGMILVNNPG-SWSHIYVPLEHSVWNGLTPTDLVFPFFVFAMG 59

Query: 86  VAIALALKRIPD-RADAVKKVIFRTLKLLFWG-ILLQGGFSHAPDELTYGVDVRMIRLCG 143
           +A+  + K +   RA  ++KV+ R++ L   G +L   G      EL +      +R+ G
Sbjct: 60  MAMGFSTKNLTALRASYLRKVMKRSVLLFVIGLLLTLLGRWLNTGELCFS----QLRVMG 115

Query: 144 VLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTY 203
           VLQR++LSYL+V+L+    K V         F +  L   +W++          LL G  
Sbjct: 116 VLQRLSLSYLVVALIVRRVKGVPTMT-----FVVVALLSGYWVL----------LLLG-- 158

Query: 204 VPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
                          +G  F+             N V  +DR +LG +H+Y         
Sbjct: 159 ---------------HGFDFSAN-----------NIVAVVDRWLLGESHLYI-------- 184

Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLK 323
                        +  P      F+PEGLLS++  +   ++G     ++  ++    R+ 
Sbjct: 185 -------------ERLPDGTPIAFDPEGLLSTIPCVAQVLLGYICSRLLCTSQELPQRIL 231

Query: 324 QWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYP 383
           +   +G  LL+ GL L +    PLNK++++ ++V VTSG A L ++ +    D+      
Sbjct: 232 RLAVIGALLLLSGLLLSYM--CPLNKKIWSSTFVMVTSGYALLAWTVMAWFADLKKQSRW 289

Query: 384 FLPLAWIGMNAMLVYVMAA 402
             PL   G NA+ +YV + 
Sbjct: 290 AYPLVAFGSNALALYVFSG 308


>gi|160885454|ref|ZP_02066457.1| hypothetical protein BACOVA_03454 [Bacteroides ovatus ATCC 8483]
 gi|423290374|ref|ZP_17269223.1| hypothetical protein HMPREF1069_04266 [Bacteroides ovatus
           CL02T12C04]
 gi|156109076|gb|EDO10821.1| hypothetical protein BACOVA_03454 [Bacteroides ovatus ATCC 8483]
 gi|392665761|gb|EIY59284.1| hypothetical protein HMPREF1069_04266 [Bacteroides ovatus
           CL02T12C04]
          Length = 371

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 102/397 (25%), Positives = 171/397 (43%), Gaps = 90/397 (22%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
             +RL +LD+ RG+ +A MILV++ G     +  + HA WNG    D V PFF+FI+G++
Sbjct: 5   SNKRLLALDVMRGITIAGMILVNNPGSWGHAYAPLKHAQWNGLTPTDLVFPFFMFIMGIS 64

Query: 88  IALALKR--IPDRADAVKKVIFRTLKLLFWGILLQ----GGFSHAPDELTYGVDVRMIRL 141
             ++LK+        A  K+I RT+ +   GI L       ++H P      +    IR+
Sbjct: 65  TYISLKKYNFTFSTPAALKIIKRTIVIFLIGIALNWFALLCYTHNP------LPFEQIRI 118

Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
            GV+QR+AL Y   +L+ +  K                    H  +   ++V    LL G
Sbjct: 119 LGVMQRLALCYGASALIALLLK--------------------HKYIPYLIVV----LLVG 154

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
            ++      +I  +   Y +                N +  +DR +LG  HMY       
Sbjct: 155 YFI-----ILITGNGFAYNET---------------NILSIVDRSILGDAHMY------- 187

Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
                QD+                  +PEGLLS++ SI   +IG   G +++  K    +
Sbjct: 188 -----QDN----------------HIDPEGLLSTIPSIAHVLIGFCVGKLLMEVKDIHEK 226

Query: 322 LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLK 381
           L++   +G  L   G    F+   P NK++++ ++V VT G  + + + +  ++DI   K
Sbjct: 227 LERLFLIGTILTFAGFL--FSYGCPFNKKIWSPTFVLVTCGLGSSLLALLVWIIDIKGCK 284

Query: 382 YPFLPLAWIGMNAMLVYVMAAE-GIFAGFINGWYYGD 417
                    G+N + +YVMA    +  G I   Y G+
Sbjct: 285 KWSRFFESFGVNPLFIYVMAGVIAVLVGAITVTYQGE 321


>gi|336415339|ref|ZP_08595679.1| hypothetical protein HMPREF1017_02787 [Bacteroides ovatus
           3_8_47FAA]
 gi|335940935|gb|EGN02797.1| hypothetical protein HMPREF1017_02787 [Bacteroides ovatus
           3_8_47FAA]
          Length = 371

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 102/397 (25%), Positives = 171/397 (43%), Gaps = 90/397 (22%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
             +RL +LD+ RG+ +A MILV++ G     +  + HA WNG    D V PFF+FI+G++
Sbjct: 5   SNKRLLALDVIRGITIAGMILVNNPGSWGHAYAPLKHAQWNGLTPTDLVFPFFMFIMGIS 64

Query: 88  IALALKR--IPDRADAVKKVIFRTLKLLFWGILLQ----GGFSHAPDELTYGVDVRMIRL 141
             ++LK+        A  K+I RT+ +   GI L       ++H P      +    IR+
Sbjct: 65  TYISLKKYNFTFSTPAALKIIKRTIVIFLIGIALNWFALLCYTHNP------LPFEQIRI 118

Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
            GV+QR+AL Y   +L+ +  K                    H  +   ++V    LL G
Sbjct: 119 LGVMQRLALCYGASALIALLLK--------------------HKYIPYLIVV----LLVG 154

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
            ++      +I  +   Y +                N +  +DR +LG  HMY       
Sbjct: 155 YFI-----ILITGNGFAYNET---------------NILSIVDRSILGDAHMY------- 187

Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
                QD+                  +PEGLLS++ SI   +IG   G +++  K    +
Sbjct: 188 -----QDN----------------HIDPEGLLSTIPSIAHVLIGFCVGKLLMEVKDIHEK 226

Query: 322 LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLK 381
           L++   +G  L   G    F+   P NK++++ ++V VT G  + + + +  ++DI   K
Sbjct: 227 LERLFLIGTILTFAGFL--FSYGCPFNKKIWSPTFVLVTCGLGSSLLALLVWIIDIKGCK 284

Query: 382 YPFLPLAWIGMNAMLVYVMAAE-GIFAGFINGWYYGD 417
                    G+N + +YVMA    +  G I   Y G+
Sbjct: 285 KWSRFFESFGVNPLFIYVMAGVIAVLVGAITVTYQGE 321


>gi|290999917|ref|XP_002682526.1| hypothetical protein NAEGRDRAFT_78070 [Naegleria gruberi]
 gi|284096153|gb|EFC49782.1| hypothetical protein NAEGRDRAFT_78070 [Naegleria gruberi]
          Length = 425

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 98/391 (25%), Positives = 159/391 (40%), Gaps = 119/391 (30%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           + +R+ +LDI RG+ + LMI+V++     +  + HA W G    D V PFFLF++G A A
Sbjct: 62  RKERMVALDIMRGMTIMLMIIVNNQPARAFIPLDHAEWFGFTPTDCVFPFFLFVMGYAAA 121

Query: 90  LALKR----------------------------IPDRADAVKKVIFRTLKLLF------- 114
           +   R                              D  D  +K    ++K ++       
Sbjct: 122 IVYSREWPSDVYLYPPSHVKLSIQSYFRELCGKKQDLMDENEKKEEESIKFMYLIPMRKS 181

Query: 115 -------WGILLQG-------GFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEI 160
                  W  L +        GFS +   L +  +   +R+ GV QRIA+ Y +VSL+ +
Sbjct: 182 LYEFVSKWVKLFRRPILMFLIGFSFSV--LAHLFNFTHVRVMGVFQRIAICYFIVSLILV 239

Query: 161 FTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV---VYLALLYGTYVPDWQFTIINKDSA 217
                                 W ++    V++   +Y+ + +G YVP        +   
Sbjct: 240 MVP-------------------WTFVQILIVVLFQAIYITVTFGLYVP-------MEGEG 273

Query: 218 DYGKVFNVTCGVRAKL-NPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLR 276
           D        CG R +L  P C A GYIDR +L  +H+Y            QDS       
Sbjct: 274 D-------GCGTRGELYEPRCTAEGYIDRLILSRDHIY-----------LQDS------- 308

Query: 277 KDAPSWCHAPFEPEGLLSSVSSILSTIIGV-HFGHVIIHTKGHLARLKQWVTMGFALLIF 335
                     ++PEG LSS+S++ +  +G+  F       K    RL  W  MG  +++ 
Sbjct: 309 ----------YDPEGFLSSLSAVTNAFVGILAFKVARAAGKDAHKRLNYWFIMGSLMILA 358

Query: 336 GLTLHFTNAIPLNKQLYTLSYVCVTSGAAAL 366
            L + +   +P+ K+L+T S+  +TSG A L
Sbjct: 359 ALAIDYA-GLPIGKKLWTTSFALITSGIALL 388


>gi|404485250|ref|ZP_11020448.1| hypothetical protein HMPREF9448_00860 [Barnesiella intestinihominis
           YIT 11860]
 gi|404338685|gb|EJZ65130.1| hypothetical protein HMPREF9448_00860 [Barnesiella intestinihominis
           YIT 11860]
          Length = 390

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 96/390 (24%), Positives = 168/390 (43%), Gaps = 91/390 (23%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
            +RL SLDI RG+ +A MI+V++ G     +  + HA W G    D V PFF+FI+G++ 
Sbjct: 6   NKRLLSLDILRGITIAGMIMVNNPGSWGYIYAPLGHAEWIGLTPTDLVFPFFMFIMGIST 65

Query: 89  ALALKRIPDR-ADAVK-KVIFRT-------LKLLFWGILLQGGFSHAPDELTYGV----- 134
            ++L++   R + AV  K+I RT       L + ++G+ ++       + L++       
Sbjct: 66  YMSLRKFDFRLSGAVAWKIIRRTIVIFAIGLAIAWFGLTMRTYHQLGEESLSFFERLGRS 125

Query: 135 --DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
             +   IR+ GV+ R+A+ Y + + + +  K                 Y  H  + +  L
Sbjct: 126 MWNFDHIRILGVMPRLAICYGVAAFIALIVK---------------HKYIPH--IVSVTL 168

Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
           + Y  +L                    GK F  +           N +  +DR +LG NH
Sbjct: 169 IAYFVILI------------------TGKGFEFS---------EDNIISVVDRAILGSNH 201

Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
           MYH                                +PEGLLS++ SI   ++G+  G +I
Sbjct: 202 MYHDNG--------------------------LALDPEGLLSTIPSICHVLVGIFCGGLI 235

Query: 313 IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIY 372
           + TK +  R++     G  L   GL L +    P++K++++ ++V  T G AA   + + 
Sbjct: 236 MRTKDNAVRMQNLFIAGTILTFAGLLLEY--GCPISKKIWSPTFVLTTCGLAASSLALLI 293

Query: 373 ALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
            ++DI   K         G+N + +YV+ A
Sbjct: 294 WIIDIKGYKKWSRFFEAFGINPLFMYVLGA 323


>gi|427781073|gb|JAA55988.1| Putative heparan-alpha-glucosaminide n-acetyltransferase
           [Rhipicephalus pulchellus]
          Length = 337

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 86/349 (24%), Positives = 157/349 (44%), Gaps = 40/349 (11%)

Query: 84  VGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCG 143
           +GV++A+ ++ +  ++    ++  + +K      L+  G     + L+  VD+  +R+ G
Sbjct: 1   MGVSLAMTIRSLLRKSVTRGRIFLQIVK----RSLILFGLGIMTNTLSGDVDLNTLRIPG 56

Query: 144 VLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTY 203
           VLQR+A SYL+ + V +      +             Y   WL+A  +L ++LAL +   
Sbjct: 57  VLQRLAFSYLVAATVHLLFAKPHEGQLVWAPVRDVLAYWPEWLLAIPMLALHLALTFFLP 116

Query: 204 VPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
           VP+     +          F    G          A G+IDR++ G +H+Y  P  R   
Sbjct: 117 VPNCPQGYLGPGGLHLNSSFENCTG---------GAAGFIDRRIFGNSHIYQTPDMRH-- 165

Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLK 323
               D+              H P++PEG L  ++SI    +G+  G +++      AR+ 
Sbjct: 166 --VYDT--------------HLPYDPEGTLGCLTSIFLVFLGLQAGKILLTFPEWKARVI 209

Query: 324 QWVTMGFAL-LIFGLTLHFTNA---IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWN 379
           +W   G    +I G+  +F+     IP+NK L+++S++  T+  A  +   +Y L+D+  
Sbjct: 210 RWCIWGLLCGIIAGVLCNFSKEEGWIPINKNLWSVSFILSTASTAFFLLVVLYYLIDVCG 269

Query: 380 LKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLVCFLFI 428
             +   PL + GMN++ VYV   E +   F   W   + H    C+LF+
Sbjct: 270 W-WSGAPLIYPGMNSLAVYV-GHEILHGVFPWAWQCPESH---WCYLFM 313


>gi|410631381|ref|ZP_11342056.1| heparan-alpha-glucosaminide N-acetyltransferase [Glaciecola arctica
           BSs20135]
 gi|410148827|dbj|GAC18923.1| heparan-alpha-glucosaminide N-acetyltransferase [Glaciecola arctica
           BSs20135]
          Length = 330

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 100/356 (28%), Positives = 160/356 (44%), Gaps = 74/356 (20%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
            RL +LD+FRG+ +  MILV++ G  W  I     HA W+G  L D + PFF+FIVGV+I
Sbjct: 21  NRLLALDVFRGITITAMILVNNPG-SWQHIYGPMRHAQWHGWTLTDLIFPFFIFIVGVSI 79

Query: 89  ALALKRI----PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRM--IRLC 142
            L+ +R+      R+  +K+ + RT KL+  G  L   +     E    V+ R+  IR  
Sbjct: 80  QLSGQRMLASGTSRSSIIKQALLRTFKLVLLGWFLALFYYDFGAEHYNWVEQRLLNIRFM 139

Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
           GVLQRIA+ Y +  L+ +F                        L+ + V ++ L  L   
Sbjct: 140 GVLQRIAVVYFICVLMWLFLSK-------------------RGLLVSFVGILLLYWLALA 180

Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
            +P             Y    N   G+    N   N   ++D  +    H+Y+  A    
Sbjct: 181 AIP------------YYDNAGNSYSGLLEFGN---NLSAWLDSWLFAAPHLYYSSA---- 221

Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
                 +PF               F+PEG+LS++ ++ S I G+  G+++  +  +  R 
Sbjct: 222 ------TPFA--------------FDPEGILSTLPAVASGISGMLVGYLLTQSSLN-TRN 260

Query: 323 KQWV--TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           K  V   +G   ++ G   H     P+NK L+T SYV +TS  A LV +++  ++D
Sbjct: 261 KTIVLLVLGSIGVLLGELWH--GYFPINKALWTSSYVLLTSAYACLVLASLIFILD 314


>gi|359728547|ref|ZP_09267243.1| hypothetical protein Lwei2_17159 [Leptospira weilii str.
           2006001855]
          Length = 383

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 121/405 (29%), Positives = 173/405 (42%), Gaps = 94/405 (23%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPF 79
            ++KS     R+ SLD+FRG+ V  MILV++ G  W  I     HA WNGC   D V PF
Sbjct: 1   MEKKSTQNKDRILSLDLFRGMTVIGMILVNNPG-SWSYIYSPLKHAKWNGCTPTDLVFPF 59

Query: 80  FLFIVGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
           FLF VG +I ++L  K   DR+     +  R++ L+  G+LL      +  EL       
Sbjct: 60  FLFAVGGSIPISLYSKNGIDRSRIWVGICKRSVNLILLGLLLNFFGEWSFAEL------- 112

Query: 138 MIRLCGVLQRIALSYLLV-SLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYL 196
             R+ GVLQRI   Y +V SL  IF        + V  F I  L    W++         
Sbjct: 113 --RIPGVLQRIGFVYWVVASLYLIF------PGKKVLIFLIPVLLVHTWILTHI------ 158

Query: 197 ALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHH 256
                   P  +  +  +   D G                     +IDR ++G  H+   
Sbjct: 159 -------APPGEAMVSLEQGKDIG--------------------AWIDRVIIGEKHL--- 188

Query: 257 PAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK 316
             W+ SK                       ++PEGLLS V+SI +++ GV  G ++   +
Sbjct: 189 --WKFSKT----------------------WDPEGLLSGVASIATSLFGVLCGFILFLRE 224

Query: 317 GHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAAL---VFSAIYA 373
           G   R +   T G   L   + L +  ++P+NK L+T SY   T+G A      F  +  
Sbjct: 225 GG-GRSRVLSTFGLGFLFTFVGLLWDQSLPMNKSLWTGSYAAYTAGLAFFCIGFFEYLNW 283

Query: 374 LV-----DIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGW 413
           LV     +  NLK  F PL   G NA+LV+V    GI A  +N W
Sbjct: 284 LVSSKERNGLNLKIFFQPLLVFGKNAILVFV--GSGILARTLNLW 326


>gi|423239671|ref|ZP_17220787.1| hypothetical protein HMPREF1065_01410 [Bacteroides dorei
           CL03T12C01]
 gi|392645711|gb|EIY39434.1| hypothetical protein HMPREF1065_01410 [Bacteroides dorei
           CL03T12C01]
          Length = 363

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 95/376 (25%), Positives = 164/376 (43%), Gaps = 84/376 (22%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +RL +LDI RG+ +A MILV++ G     +  + H  +NG    D V PFF+FI+G++  
Sbjct: 7   KRLLALDILRGITIAGMILVNNPGSWGYVYAPLEHVAFNGLTPTDLVFPFFMFIMGISTY 66

Query: 90  LALKR--IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
           ++L++         ++K++ RT+ +   G+LL      A    T+ ++    R  GV+QR
Sbjct: 67  ISLRKYNFTYSHATLRKIMKRTVIIFCIGLLLN---LLAKSVFTHHLNFEEWRYLGVMQR 123

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA--LLYGTYVP 205
           +A+ Y + SLV I  K                    H    A +LV   A  LL  T   
Sbjct: 124 LAIGYGVTSLVAITVK--------------------HKYFPAIILVTLAAYFLLLAT--- 160

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
                         G  FN             N V   D   LG +HMYH          
Sbjct: 161 --------------GDGFN---------QSETNVVARFDAWALGTSHMYH---------- 187

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
                 EG +           F+PEGLLS+V ++   ++G + G +++  K +  ++++ 
Sbjct: 188 ------EGGM----------AFDPEGLLSTVPAVCHVMVGFYCGKLLLSAKDNAEKIQRL 231

Query: 326 VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFL 385
             +G  L   G  L +    P+NK++++ ++V +T G A+   + +  ++D+   +    
Sbjct: 232 FLIGTILTFAGFLLSY--GCPINKKVWSPTFVIITCGLASSFLALLIWIIDMKGYQNWCA 289

Query: 386 PLAWIGMNAMLVYVMA 401
                G+N + +YV A
Sbjct: 290 FFRSFGVNPLFIYVFA 305


>gi|372223192|ref|ZP_09501613.1| hypothetical protein MzeaS_12804 [Mesoflavibacter
           zeaxanthinifaciens S86]
          Length = 364

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 104/376 (27%), Positives = 170/376 (45%), Gaps = 84/376 (22%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGVA 87
            +R+ ++DIFRG+ ++LM+LV+  G  W  +     HA W+G    D V PFFLFIVG +
Sbjct: 4   NKRIVAVDIFRGMTISLMVLVNTPG-TWSSVYSPFLHAQWHGYTPTDLVFPFFLFIVGTS 62

Query: 88  IALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
           I  A K         KK+  R LKL+  G+ L G F+ +     +  D   IR  GVLQR
Sbjct: 63  IVFAYKNKKPSLKTYKKIGVRALKLIILGLFL-GAFTLS---FPFFKDFENIRFPGVLQR 118

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
           I + + + S                   ++F    W  L     + V L LL+G +    
Sbjct: 119 IGVVFFITS-------------------ALFLNLNWKKL-----IFVVLVLLFGYWA--- 151

Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
             T+I  D          +   RA    P N   ++D ++ G +H Y             
Sbjct: 152 LMTLIPVDGQP-------STLTRA----PNNLANWLDVQLFG-SHNY------------- 186

Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVT 327
                   + D        ++PEG LS++ +I S ++G+  G +++  +    +      
Sbjct: 187 --------KPD--------YDPEGFLSTLPAIASALLGIFTGEILVGKQNK--KTPLLFG 228

Query: 328 MGFALLIFGLTLHFTNAI-PLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLP 386
           +G  LLI G   H  + + P+NK L+T S+V VT+G A +V + I+ L D   L +  + 
Sbjct: 229 LGGCLLIIG---HLWDIVFPINKALWTSSFVLVTAGWANVVLAIIHYLSDDRKLIFGSI- 284

Query: 387 LAWIGMNAMLVYVMAA 402
             ++G NA+ +Y +++
Sbjct: 285 FKYVGFNAITIYFLSS 300


>gi|338211253|ref|YP_004655306.1| hypothetical protein [Runella slithyformis DSM 19594]
 gi|336305072|gb|AEI48174.1| Protein of unknown function DUF2261, transmembrane [Runella
           slithyformis DSM 19594]
          Length = 374

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 114/457 (24%), Positives = 197/457 (43%), Gaps = 118/457 (25%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAV-------ALMILVDHAGGDW-------PEISHAPWNG 69
            Q  +  +  RL SLD  RG  +        +  L+  A G W        ++SH  WNG
Sbjct: 2   SQPSTDTRPHRLLSLDALRGFDMFWITGGEEIFHLLAKATG-WTGAIIMAEQLSHPDWNG 60

Query: 70  CNLADFVMPFFLFIVGVA----IALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSH 125
               D + P FLF+ GV+    + + L+R  DR   ++KVI R L L+  GI+   G   
Sbjct: 61  FRAYDLIFPLFLFLSGVSAPYSLGVRLERGDDRGKMLRKVIQRGLTLVLLGIIYNNGLQI 120

Query: 126 APDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKD-VQDKDQSVGRFSIFRLYCWH 184
            P E         +R   VL RI L+ +   ++ ++T   VQ              Y W 
Sbjct: 121 KPLE--------DMRFPSVLGRIGLAGMFAQIIYLYTSTRVQ--------------YIW- 157

Query: 185 WLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYID 244
                     +++LL G     W F ++              CG    +   CN V Y+D
Sbjct: 158 ----------FVSLLLGY----WAFVMLVPVPG---------CGA-GLMTMECNPVSYLD 193

Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
           R ++                        G L KD     H   +PEGL+S++ +I + ++
Sbjct: 194 RLII-----------------------PGHLHKD----IH---DPEGLVSTIPAIATGLL 223

Query: 305 GVHFGHVIIHTKGHLARLKQWVTM---GFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTS 361
           G+  G+++   +   +R ++ + +   G   LI G    +    P+NK L+T S+V    
Sbjct: 224 GIFAGNLLRADERSTSRTQKVLVLFVAGILFLIIGKLWDYV--FPINKNLWTSSFVMTVG 281

Query: 362 GAAALVFSAIYALVDI--WNLKYPFLPLAWIGMNAMLVYVM--------AAEGIFAGFIN 411
           G + ++ S  Y ++D+  WN +Y ++ LA IGMN++L+Y+          +E +F G + 
Sbjct: 282 GWSLILLSLFYWIIDVQGWN-RYAWI-LAVIGMNSILIYMAQHFIDFEHTSERLFRGIVK 339

Query: 412 GWYYGDPHNTLVCFL--FIISYILHSFLWELRKFLYV 446
             Y+ +P+  +   +   I+ +++  FL++ + FL V
Sbjct: 340 --YFSEPYQKVFYAIGYVIVEWLMLWFLYKKKVFLKV 374


>gi|421097001|ref|ZP_15557700.1| putative membrane protein [Leptospira borgpetersenii str.
           200901122]
 gi|410800246|gb|EKS02307.1| putative membrane protein [Leptospira borgpetersenii str.
           200901122]
          Length = 383

 Score = 99.4 bits (246), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 120/414 (28%), Positives = 178/414 (42%), Gaps = 112/414 (27%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPF 79
            ++KS    +R+ SLD+FRG+ V  MILV++ G  W  I     HA WNGC   D V PF
Sbjct: 1   MEKKSTQNKERILSLDLFRGMTVIGMILVNNPG-SWSYIYSPLKHAEWNGCTPTDLVFPF 59

Query: 80  FLFIVGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVD 135
           FLF VGV+I ++L  K   +R+     +  R++ L+  G+LL   G +S A         
Sbjct: 60  FLFAVGVSIPISLYSKNGINRSKVWIGICIRSISLILLGLLLNFFGEWSFAE-------- 111

Query: 136 VRMIRLCGVLQRIALSYLLV-SLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
              +R+ GVLQRI   Y  V SL  IF      K   +    +  ++ W           
Sbjct: 112 ---LRVPGVLQRIGFVYWTVASLYLIF----PGKKVLIFLILVLLVHTW----------- 153

Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
               +    +P  + T+  +   D G                     +IDR ++G  H+ 
Sbjct: 154 ----ILTHIIPPGESTVSLEQGKDIG--------------------AWIDRMIIGEKHL- 188

Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
               W+ SK                       ++PEGLLS ++SI +++ GV  G ++  
Sbjct: 189 ----WKFSKT----------------------WDPEGLLSGIASIATSLFGVLCGFILF- 221

Query: 315 TKGHLARLKQWVTMGFALLIFGLTLHFT-------NAIPLNKQLYTLSYVCVTSGAAALV 367
                  L++ V     L IFGL   FT        ++P+NK L+T SY   T+G +   
Sbjct: 222 -------LREGVGKNRVLGIFGLGFLFTFVGLLWDQSLPMNKSLWTGSYAAYTAGLSFFC 274

Query: 368 -----FSAIYALVDIW---NLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGW 413
                +  +  L+  W   NLK  F PL   G NA+LV+V    GI A  +N W
Sbjct: 275 IGFFEYLNLLILLKEWNRLNLKILFQPLLVFGKNAILVFV--GSGILARTLNLW 326


>gi|319952891|ref|YP_004164158.1| heparan-alpha-glucosaminide n-acetyltransferase [Cellulophaga
           algicola DSM 14237]
 gi|319421551|gb|ADV48660.1| Heparan-alpha-glucosaminide N-acetyltransferase [Cellulophaga
           algicola DSM 14237]
          Length = 363

 Score = 99.4 bits (246), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 104/374 (27%), Positives = 174/374 (46%), Gaps = 82/374 (21%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGVAI 88
           +R+ ++DIFRG+ + LMILV++ G  W  +     HA W+G    D V PFFLFIVG +I
Sbjct: 3   ERIVAVDIFRGMTIVLMILVNNPG-TWAAVYAPFLHADWHGYTPTDLVFPFFLFIVGTSI 61

Query: 89  ALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
             A       AD  KK++ R+LKL+  G+LL       P    +  D   IR  GVLQRI
Sbjct: 62  VFAYSTKKPTADTYKKIVSRSLKLIGLGLLLGAFTLVFP----FVKDFSEIRFPGVLQRI 117

Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
            + + + S+                   +F  + W  L+   V ++    L   ++P   
Sbjct: 118 GVVFFITSI-------------------LFLNFNWKQLIGVTVFILIGYWLAMGFIP--- 155

Query: 209 FTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
              +N  ++ + +              P N   Y+D  +LG +HM+              
Sbjct: 156 ---VNGIASTFDRA-------------PNNLANYVDLNILG-SHMW-------------- 184

Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTM 328
                   KD        ++PEGLLS++ +I S ++GV  G +++  + +  +    + +
Sbjct: 185 --------KD-------DYDPEGLLSTIPAIASCLLGVFTGKILLSKQQN--KTTIVLGL 227

Query: 329 GFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLA 388
           G  LLI G   HF    P+NK L++ S+V VT+G A L+ + IY   D+  +++  +   
Sbjct: 228 GLILLIVGHAWHFI--FPINKALWSSSFVLVTAGWANLILALIYYSTDVRKVQFGTI-FK 284

Query: 389 WIGMNAMLVYVMAA 402
           + G NA+ V+ +++
Sbjct: 285 YAGANAITVFFLSS 298


>gi|417780880|ref|ZP_12428636.1| PF07786 family protein [Leptospira weilii str. 2006001853]
 gi|410778851|gb|EKR63473.1| PF07786 family protein [Leptospira weilii str. 2006001853]
          Length = 383

 Score = 99.0 bits (245), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 120/405 (29%), Positives = 173/405 (42%), Gaps = 94/405 (23%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPF 79
            ++KS     R+ SLD+FRG+ V  MILV++ G  W  I     HA WNGC   D V PF
Sbjct: 1   MEKKSTQNKDRILSLDLFRGMTVIGMILVNNPG-SWSYIYSPLKHAKWNGCTPTDLVFPF 59

Query: 80  FLFIVGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
           FLF VG +I ++L  K   DR+     +  R++ L+  G+LL      +  EL       
Sbjct: 60  FLFAVGGSIPISLYSKNGIDRSRIWVGICKRSVNLILLGLLLNFFGEWSFAEL------- 112

Query: 138 MIRLCGVLQRIALSYLLV-SLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYL 196
             R+ GVLQRI   Y +V SL  IF        + V  F I  L    W++         
Sbjct: 113 --RIPGVLQRIGFVYWVVASLYLIF------PGKKVLIFLIPVLLVHTWILTHI------ 158

Query: 197 ALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHH 256
                   P  +  +  +   D G                     +IDR ++G  H+   
Sbjct: 159 -------APPGEAMVSLEQGKDIG--------------------AWIDRVIIGEKHL--- 188

Query: 257 PAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK 316
             W+ SK                       ++PEGLLS V+SI +++ GV  G ++   +
Sbjct: 189 --WKFSKT----------------------WDPEGLLSGVASIATSLFGVLCGFILFLRE 224

Query: 317 GHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAAL---VFSAIYA 373
           G   R +   T G   L   + L +  ++P+NK L+T SY   T+G +      F  +  
Sbjct: 225 GG-GRSRVLSTFGLGFLFTFVGLLWDQSLPMNKSLWTGSYAAYTAGLSFFCIGFFEYLNW 283

Query: 374 LV-----DIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGW 413
           LV     +  NLK  F PL   G NA+LV+V    GI A  +N W
Sbjct: 284 LVSSKERNGLNLKIFFQPLLVFGKNAILVFV--GSGILARTLNLW 326


>gi|153808903|ref|ZP_01961571.1| hypothetical protein BACCAC_03204 [Bacteroides caccae ATCC 43185]
 gi|423220258|ref|ZP_17206753.1| hypothetical protein HMPREF1061_03526 [Bacteroides caccae
           CL03T12C61]
 gi|149128236|gb|EDM19455.1| hypothetical protein BACCAC_03204 [Bacteroides caccae ATCC 43185]
 gi|392623335|gb|EIY17438.1| hypothetical protein HMPREF1061_03526 [Bacteroides caccae
           CL03T12C61]
          Length = 371

 Score = 99.0 bits (245), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 100/396 (25%), Positives = 169/396 (42%), Gaps = 88/396 (22%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGV 86
             +RL +LD+ RG+ +A MILV++ G  W  +     HA WNG    D + PFF+FI+G+
Sbjct: 5   SNKRLLALDVMRGITIAGMILVNNPG-SWGYVYFPLKHAQWNGLTPTDLIFPFFMFIMGI 63

Query: 87  AIALALKR--IPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLC 142
           +  ++L++        A  K+I RT+ +   GI +       +  D L +      IR+ 
Sbjct: 64  STYISLRKYNFTFSTPAALKIIKRTIVIFLIGIAINWFALLCYYHDPLPFA----QIRVL 119

Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
           GV+QR+AL Y   +L+ +  K                     +L+ A ++  ++ L+ G 
Sbjct: 120 GVMQRLALCYGASALIALLIKHKYIP----------------YLIVALLVGYFILLITGN 163

Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
                            G  +N T           N +  +DR +LG  HMY        
Sbjct: 164 -----------------GFAYNET-----------NILSIVDRSILGDAHMY-------- 187

Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
               QD+                  +PEGLLS++ SI   +IG   G +++  K    +L
Sbjct: 188 ----QDN----------------HIDPEGLLSTIPSIAHVLIGFCVGKLLMEVKDIREKL 227

Query: 323 KQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKY 382
           ++   +G  L   G  L +    P NK++++ ++V VT G  +   + +  ++DI   K 
Sbjct: 228 ERLFLIGTILTFAGFLLSY--GCPFNKKIWSPTFVLVTCGLGSSFLALLVWIIDIKGYKK 285

Query: 383 PFLPLAWIGMNAMLVYVMA-AEGIFAGFINGWYYGD 417
                   G+N + +YV+A    I  G I   Y G+
Sbjct: 286 WSRFFESFGVNPLFIYVLADVLAILLGVITVTYQGE 321


>gi|390956359|ref|YP_006420116.1| hypothetical protein Terro_0436 [Terriglobus roseus DSM 18391]
 gi|390411277|gb|AFL86781.1| hypothetical protein Terro_0436 [Terriglobus roseus DSM 18391]
          Length = 404

 Score = 99.0 bits (245), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 104/405 (25%), Positives = 170/405 (41%), Gaps = 94/405 (23%)

Query: 18  EPDVSDQQE----KSHLKTQRLASLDIFRGLAVALMILVDH--AGGDWPEISHAPWNGCN 71
           +P+++   +    ++  KT+RL S+D+ RGL +A MILV++    G + E+ HA WNG  
Sbjct: 10  DPEIASTVDSDVVRTAPKTERLLSVDVLRGLTIAFMILVNNQPGPGAFFELQHAQWNGFT 69

Query: 72  LADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLK----LLFWGILLQG-GFSHA 126
           L D V P FLF+VG+++ L+      +  + K +   TL+    L  +GI++    F H 
Sbjct: 70  LTDLVFPTFLFLVGLSLVLSTAARLAKGASRKTLFLHTLRRSAVLALFGIVVNTFPFQH- 128

Query: 127 PDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWL 186
                    +  IR  GVLQR AL YL+VS + +  K  +DK                  
Sbjct: 129 ---------LDRIRFYGVLQRTALCYLVVSGLCLLRKGWKDKAA---------------- 163

Query: 187 MAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRK 246
           +A   LVVY  L+    VP                 F          +P  N   ++DR 
Sbjct: 164 IAVACLVVYWVLMRFVPVPG----------------FGTPTHEIPINDPNGNLTAWLDRL 207

Query: 247 VLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGV 306
           +    H+Y                                 +PEGLLS++ +I + + GV
Sbjct: 208 IFAPQHLYQQVR-----------------------------DPEGLLSTLPAISTALYGV 238

Query: 307 HFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAAL 366
             G  +  T+   A+       G A+ + G    +    PLNK+L+T S+     G + L
Sbjct: 239 LAGTWLRTTRSTTAKAVGLALGGVAMTVAGWLWSY--GFPLNKKLWTSSFSLWAGGLSLL 296

Query: 367 VFSAIYALVDI--WN--------LKYPFLPLAWIGMNAMLVYVMA 401
           + +    +VD+  W             ++PL   G N++L Y+++
Sbjct: 297 LLALAVYVVDVKRWGRDGVGANATPAAYMPLMVFGTNSILAYMVS 341


>gi|383110831|ref|ZP_09931649.1| hypothetical protein BSGG_1941 [Bacteroides sp. D2]
 gi|313694406|gb|EFS31241.1| hypothetical protein BSGG_1941 [Bacteroides sp. D2]
          Length = 371

 Score = 98.6 bits (244), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 101/397 (25%), Positives = 170/397 (42%), Gaps = 90/397 (22%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
             +RL +LD+ RG+ +A MILV++ G     +  + HA WNG    D V PFF+FI+G++
Sbjct: 5   SNKRLLALDVMRGITIAGMILVNNPGSWGHAYAPLKHAQWNGLTPTDLVFPFFMFIMGIS 64

Query: 88  IALALKR--IPDRADAVKKVIFRTLKLLFWGILLQ----GGFSHAPDELTYGVDVRMIRL 141
             ++LK+        A  K+I RT+ +   GI L       ++H P      +    IR+
Sbjct: 65  TYISLKKYNFTFSTPAALKIIKRTIVIFLIGIALNWFALLCYTHNP------LPFEQIRI 118

Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
            GV+QR+AL Y   +L+ +  K                    H  +   ++V    LL G
Sbjct: 119 LGVMQRLALCYGASALIALLLK--------------------HKYIPYLIVV----LLVG 154

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
            ++      +I  +   Y +                N +  +DR +LG  HMY       
Sbjct: 155 YFI-----ILITGNGFAYNET---------------NILSIVDRSILGDAHMY------- 187

Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
                QD+                  +PEGLLS++ SI   +IG   G +++  K    +
Sbjct: 188 -----QDN----------------HIDPEGLLSTIPSIAHVLIGFCVGKLLMEVKDIHEK 226

Query: 322 LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLK 381
           L++   +G  L   G    F+   P NK++++ ++V  T G  + + + +  ++DI   K
Sbjct: 227 LERLFLIGTILTFAGFL--FSYGCPFNKKIWSPTFVLATCGLGSSLLALLVWIIDIKGCK 284

Query: 382 YPFLPLAWIGMNAMLVYVMAAE-GIFAGFINGWYYGD 417
                    G+N + +YVMA    +  G I   Y G+
Sbjct: 285 KWSRFFESFGVNPLFIYVMAGVIAVLVGAITVTYQGE 321


>gi|375149723|ref|YP_005012164.1| hypothetical protein [Niastella koreensis GR20-10]
 gi|361063769|gb|AEW02761.1| Protein of unknown function DUF2261, transmembrane [Niastella
           koreensis GR20-10]
          Length = 368

 Score = 98.6 bits (244), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 110/388 (28%), Positives = 162/388 (41%), Gaps = 94/388 (24%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGV 86
           +  +R  SLD+FRGL + LMI+V+  G     +  + HA WNG  LAD V P FLF VG 
Sbjct: 2   IPNKRFISLDVFRGLIICLMIIVNTPGSHDTSFALLQHANWNGFTLADLVFPSFLFAVGN 61

Query: 87  AIALALK--RIPDRADAVKKVIFRTLKLL----------FWGILLQGGFSHAPDELTYGV 134
           A+  +++  +   +   + K+  RTL L           F+    QG         T   
Sbjct: 62  ALFFSMQKWKTMTQGQVLAKIGKRTLLLFLLGYLLYWFPFFETNTQGHIVFKSFAGT--- 118

Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
                R+ GVLQRIAL Y + SL+  + K                      +++A +LV 
Sbjct: 119 -----RIMGVLQRIALCYGIASLLIYYLKPKGAL-----------------IVSAIILVA 156

Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
           Y  LL+      W          D G           KLN   NAV   D  +LG +HM 
Sbjct: 157 YPGLLF------WL--------GDPGN----------KLNMVGNAVTKFDLWLLGPDHMN 192

Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
           H                              PFEPEG+LS++ +I + + G   G   I 
Sbjct: 193 HGEV--------------------------VPFEPEGILSTLPAITNVVAGYLVGWY-IQ 225

Query: 315 TKGHLAR-LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
           T G   R L + +  G  L   GL  ++   IP+NK L+T S+V  ++G   L+ +AI  
Sbjct: 226 TAGKTKRMLLRLIATGAGLTFLGLCWNY--VIPINKNLWTSSFVVHSTGLDCLLLAAIIY 283

Query: 374 LVDIWNLKYPFLPLAWIGMNAMLVYVMA 401
           + D  N+          G NA+ +Y+++
Sbjct: 284 IADFLNITRWTWFFEVFGKNALFIYLLS 311


>gi|319641808|ref|ZP_07996487.1| transmembrane protein [Bacteroides sp. 3_1_40A]
 gi|345518546|ref|ZP_08797993.1| hypothetical protein BSFG_02387 [Bacteroides sp. 4_3_47FAA]
 gi|254835931|gb|EET16240.1| hypothetical protein BSFG_02387 [Bacteroides sp. 4_3_47FAA]
 gi|317386564|gb|EFV67464.1| transmembrane protein [Bacteroides sp. 3_1_40A]
          Length = 363

 Score = 98.6 bits (244), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 97/385 (25%), Positives = 169/385 (43%), Gaps = 87/385 (22%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +RL +LDI RG+ +A MILV++ G     +  + HA +NG    D V PFF+FI+G++  
Sbjct: 7   KRLLALDILRGITIAGMILVNNPGSWGHVYTPLEHAAFNGLTPTDLVFPFFMFIMGISTY 66

Query: 90  LALKR--IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
           ++L++         ++K++ RT+ +   G+ L      A    T+ ++   +R  GV+QR
Sbjct: 67  ISLRKYNFTYSHATLRKIVKRTVVIFCIGLFLN---LLAKSVFTHHLNFEELRYLGVMQR 123

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV---VYLALLYGTYV 204
           +A+ Y + SLV I  K                    H    A +LV   VY  LL     
Sbjct: 124 LAIGYGVTSLVAITVK--------------------HKYFPAIILVTLAVYFLLL----- 158

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                          G  FN++           N V   D   LG +HMYH         
Sbjct: 159 -------------AMGDGFNLSA---------TNIVARFDVWALGTSHMYH--------- 187

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
                  +G +           F+PEGLLS++ ++   ++G + G ++   K +  ++++
Sbjct: 188 -------DGGM----------AFDPEGLLSTLPAVCHVMVGFYCGKLLFSAKDNDEKIQR 230

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPF 384
              +G  L   G  L +    P+NK++++ ++V  T G A+   + +  ++DI   +   
Sbjct: 231 LFLVGTILTFAGFLLSY--GCPINKKVWSPTFVITTCGLASSFLALLIWIIDIKGYQGWC 288

Query: 385 LPLAWIGMNAMLVYVMAA-EGIFAG 408
           +     G+N + +YV A   GI  G
Sbjct: 289 VFFRSFGVNPLFIYVFAEIMGILLG 313


>gi|421090259|ref|ZP_15551055.1| putative membrane protein [Leptospira kirschneri str. 200802841]
 gi|421129053|ref|ZP_15589263.1| putative membrane protein [Leptospira kirschneri str. 2008720114]
 gi|410000994|gb|EKO51618.1| putative membrane protein [Leptospira kirschneri str. 200802841]
 gi|410359757|gb|EKP06816.1| putative membrane protein [Leptospira kirschneri str. 2008720114]
          Length = 369

 Score = 98.6 bits (244), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 109/389 (28%), Positives = 172/389 (44%), Gaps = 90/389 (23%)

Query: 38  LDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL-- 92
           +D+FRG+ VA MILV++ G     +  + HA WNGC   D V PFFLF VG++I L++  
Sbjct: 1   MDLFRGMTVAGMILVNNPGSWSFIYSPLKHAKWNGCTPTDLVFPFFLFAVGISIQLSVYS 60

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
           K    ++     +  R++ L+  G+ L      +  EL         R+ GVLQRI   Y
Sbjct: 61  KNKIHKSKIWFGICIRSITLILIGLFLNFFGEWSFSEL---------RIPGVLQRIGFVY 111

Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
            +V+ + +                  R+    W+    +L+V+  +L     P       
Sbjct: 112 WIVASLHLILPK--------------RMILISWI---PILLVHTWVLIQIPAPG------ 148

Query: 213 NKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFE 272
             +S  Y             L P  +   +IDR V G NH+     W+ SK         
Sbjct: 149 --ESIVY-------------LEPGKDIGAWIDRNVFGENHL-----WKFSKT-------- 180

Query: 273 GPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL 332
                         ++PEGL S +SSI ++++GV  G  I+ +K +  + +     GF +
Sbjct: 181 --------------WDPEGLFSGISSIATSLLGVFCGS-ILSSKTNEIKKQILSIFGFGI 225

Query: 333 LIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAAL---VFSAIYALVDI--WNL---KYPF 384
           L+  + L +   +P+NK L+T SYV  T+G A L    F  +  L+ I  W+    +  F
Sbjct: 226 LLVLVGLLWDQNLPMNKSLWTGSYVIYTAGLAFLSIGCFELLNFLLRIKKWDQLQSEIIF 285

Query: 385 LPLAWIGMNAMLVYVMAAEGIFAGFINGW 413
            P    G NA+LV+V    G+ A  +N W
Sbjct: 286 QPFLVFGKNAILVFV--GSGLLARTLNLW 312


>gi|329851309|ref|ZP_08266066.1| hypothetical protein ABI_41500 [Asticcacaulis biprosthecum C19]
 gi|328840155|gb|EGF89727.1| hypothetical protein ABI_41500 [Asticcacaulis biprosthecum C19]
          Length = 384

 Score = 98.2 bits (243), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 104/380 (27%), Positives = 162/380 (42%), Gaps = 75/380 (19%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPE----ISHAPWNGCNLADFVMPFFLFIVGV 86
           +  R  +LDI RGL++  M+L +   G W E    + HA W G    D V P FLF +GV
Sbjct: 7   QGNRWLALDILRGLSIIFMLL-NLNPGSWSEQYGWVLHAKWEGATFIDMVAPVFLFCIGV 65

Query: 87  AIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
           AI L+L+R  +  ++     K ++ R   L+  G+ L               D   +R+ 
Sbjct: 66  AIPLSLRRRIEAGESNGQLAKHILNRAGILVLLGLFLNA---------YPAFDWAHMRIP 116

Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYC-WHWLMAACVLVVYLALLYG 201
           GVLQRI + Y  V+L  +FT   +           FRL     W+    VL+ + ALL  
Sbjct: 117 GVLQRIGVCYGAVALFVLFTARREGG---------FRLNAKAGWIAWTFVLLSWTALLMF 167

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
             VP             +G           + +P  +   Y+DR VL  +HM+  P W  
Sbjct: 168 VPVP------------GFGA---------PRFDPVGSWPAYVDRLVLTTDHMF--PWW-- 202

Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
                   P +G +           F+P+GLLS+     + + G   GH      G  A 
Sbjct: 203 --------PVDGKVV----------FDPDGLLSTWPVCANVLFGALVGHA--RLTGITAP 242

Query: 322 LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLK 381
           + + +  G  L+   + LH T  IP+ K ++T ++   T G + +   A+  LV+ WN  
Sbjct: 243 ILKMLVAGGLLMAAAVGLHTT--IPIIKHIWTATFALFTIGFSLVSLGALTLLVERWNSA 300

Query: 382 YPFLPLAWIGMNAMLVYVMA 401
             F P    G N +L Y+++
Sbjct: 301 PAFYPAQVYGSNPLLAYMLS 320


>gi|150004749|ref|YP_001299493.1| transmembrane protein [Bacteroides vulgatus ATCC 8482]
 gi|294775179|ref|ZP_06740705.1| putative membrane protein [Bacteroides vulgatus PC510]
 gi|149933173|gb|ABR39871.1| putative transmembrane protein [Bacteroides vulgatus ATCC 8482]
 gi|294450991|gb|EFG19465.1| putative membrane protein [Bacteroides vulgatus PC510]
          Length = 363

 Score = 98.2 bits (243), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 97/385 (25%), Positives = 169/385 (43%), Gaps = 87/385 (22%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +RL +LDI RG+ +A MILV++ G     +  + HA +NG    D V PFF+FI+G++  
Sbjct: 7   KRLLALDILRGITIAGMILVNNPGSWGYVYTPLEHAAFNGLTPTDLVFPFFMFIMGISTY 66

Query: 90  LALKR--IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
           ++L++         ++K++ RT+ +   G+ L      A    T+ ++   +R  GV+QR
Sbjct: 67  ISLRKYNFTYSHATLRKIVKRTVVIFCIGLFLN---LLAKSVFTHHLNFEELRYLGVMQR 123

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV---VYLALLYGTYV 204
           +A+ Y + SLV I  K                    H    A +LV   VY  LL     
Sbjct: 124 LAIGYGVTSLVAITVK--------------------HKYFPAIILVTLAVYFLLL----- 158

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                          G  FN++           N V   D   LG +HMYH         
Sbjct: 159 -------------AMGDGFNLSA---------TNIVARFDVWALGTSHMYH--------- 187

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
                  +G +           F+PEGLLS++ ++   ++G + G ++   K +  ++++
Sbjct: 188 -------DGGM----------AFDPEGLLSTLPAVCHVMVGFYCGKLLFSAKDNDEKIQR 230

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPF 384
              +G  L   G  L +    P+NK++++ ++V  T G A+   + +  ++DI   +   
Sbjct: 231 LFLVGTILTFAGFLLSY--GCPINKKVWSPTFVITTCGLASSFLALLIWIIDIKGYQGWC 288

Query: 385 LPLAWIGMNAMLVYVMAA-EGIFAG 408
           +     G+N + +YV A   GI  G
Sbjct: 289 VFFRSFGVNPLFIYVFAEIMGILLG 313


>gi|346224087|ref|ZP_08845229.1| hypothetical protein AtheD1_02868 [Anaerophaga thermohalophila DSM
           12881]
          Length = 369

 Score = 98.2 bits (243), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 104/379 (27%), Positives = 164/379 (43%), Gaps = 87/379 (22%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           K++R  +LD+ RG+ +ALMI V+  G     +  + H+ W+GC   D V PFFLF+VGV+
Sbjct: 3   KSERYLALDVLRGMTIALMITVNTPGSWQYVYAPLRHSSWHGCTPTDLVFPFFLFVVGVS 62

Query: 88  IALALKRIPDRAD--AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
           +  +  +  +  +  +  ++  RTL +   G+ L    +  P  +T   D   +R+ GVL
Sbjct: 63  MFFSFAKYGNTLNKASFNRLGRRTLLIFAIGLFL----NSFPQWMT---DYSSLRIMGVL 115

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRIAL+Y   SL+ +  K               R Y  +  +   +L+VY          
Sbjct: 116 QRIALAYGFASLIVLSMK---------------RKYIPY--LGLAILLVY---------- 148

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
            W           Y    NVT                 D  +LG  H+Y           
Sbjct: 149 -WGILAWFGGDDPYSLEGNVTIP--------------FDAAILGEQHLY----------T 183

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHT-KGHL-ARLK 323
               PF+                PEGLLS++ ++++ ++G   G  I +T K  L ARL 
Sbjct: 184 GFGIPFD----------------PEGLLSTIPAVVTVLLGYLTGVFISNTEKAKLPARLA 227

Query: 324 QWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYP 383
            +   G  + I G    F    P+NK L+T SYV  T+G AALVF+ +  ++D+   K  
Sbjct: 228 LY---GVLVTIIGRLWGF--VFPINKPLWTSSYVLYTAGLAALVFALLIFIIDVKGYKKW 282

Query: 384 FLPLAWIGMNAMLVYVMAA 402
                  GMN + +Y ++ 
Sbjct: 283 TSFFVVFGMNPLFIYALSG 301


>gi|444731031|gb|ELW71398.1| Heparan-alpha-glucosaminide N-acetyltransferase [Tupaia chinensis]
          Length = 732

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 97/378 (25%), Positives = 161/378 (42%), Gaps = 69/378 (18%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
            RL  +D FRG+A+ LM+ V++ GG +    HA WN  +     +P  L  +GV   +  
Sbjct: 265 HRLRCVDTFRGIALILMVFVNYGGGKYWYFKHASWN-VSWDKVRIPGVLQRLGVTFFVV- 322

Query: 93  KRIPDRADAVKKVIFRTLKLLFWG-------------------ILLQGGFSHAPDELTYG 133
                   AV +++F     +++G                   I  +      P  L   
Sbjct: 323 --------AVLELLFAKPVCIYYGVFNFSVNDIYAAAGMFKIQIARENCVEEFPVNLYRD 374

Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVEI-FTKDVQDKDQSVGRFSIFR--LYCW-HWLMAA 189
           +    +R+ GVLQR+ +++ +V+++E+ F K V +   S       R     W  WL+  
Sbjct: 375 LSWDKVRIPGVLQRLGVTFFVVAVLELLFAKPVPENCASERSCLSLRDVTSSWPQWLVIL 434

Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
            +  ++L L +   VP      +      D+GK  N T G          A GYID  +L
Sbjct: 435 MLESIWLGLTFFLPVPGCPKGYLGPGGIGDFGKYPNCTGG----------AAGYIDHLLL 484

Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
           G +H+Y HP+                            ++PEG+L +++SI+   +GV  
Sbjct: 485 GADHLYKHPS------------------STVLYHTEVAYDPEGILGTINSIVMAFLGVQA 526

Query: 309 GHVIIH----TKGHLARLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSG 362
           G ++++    TK  L R   W  +   L+  GLT    N   IP+NK L+++SYV   S 
Sbjct: 527 GKILLYYKDRTKDILIRFTAWCCI-LGLISIGLTKISENEGFIPVNKNLWSISYVTTLSS 585

Query: 363 AAALVFSAIYALVDIWNL 380
            A  +   +Y +VD+  L
Sbjct: 586 FAFFILLILYPVVDVRGL 603


>gi|333029673|ref|ZP_08457734.1| putative transmembrane protein [Bacteroides coprosuis DSM 18011]
 gi|332740270|gb|EGJ70752.1| putative transmembrane protein [Bacteroides coprosuis DSM 18011]
          Length = 387

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 95/392 (24%), Positives = 167/392 (42%), Gaps = 95/392 (24%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGV 86
           + QRL +LDI RG+ +A MILV++ G  W  I     HA WNG    D V PFF+FI+G+
Sbjct: 5   ENQRLLALDILRGITIAGMILVNNPG-SWGSIYAPLGHAEWNGLTPTDLVFPFFMFIMGI 63

Query: 87  AIALALK--RIPDRADAVKKVIFRT-------LKLLFWGILLQGGFSHAP------DELT 131
           +   +L+  +     +A  K++ RT       L + ++ + L+   S A       + L+
Sbjct: 64  STYFSLRKYKFEFSKEAALKILKRTIIIFAIGLGIAWFSLFLRTWNSLASADISIFERLS 123

Query: 132 YGVDV-RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAAC 190
             + V   +R+ GV+ R+AL+Y   +++ +  K        VG                 
Sbjct: 124 QSIFVFENLRILGVMPRLALTYCATAIIALTIKHKYIPTLIVG----------------- 166

Query: 191 VLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGI 250
           +L+VY             F +   +  +Y +                N +  +D+ +LG 
Sbjct: 167 ILIVY------------TFILFLGNGFEYNE---------------TNILSIVDKAILGE 199

Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
           NHMY                                 +PEGL+S++ +I   ++G   G 
Sbjct: 200 NHMYKDNG----------------------------IDPEGLVSTIPAIAHVLLGFFVGK 231

Query: 311 VIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSA 370
           +    K   ++++    MG  L   GL L +    P+NK++++ ++V  T G A+ + + 
Sbjct: 232 IFTEKKDIHSKVEFLFIMGSILTFVGLLLSY--GCPINKKIWSPTFVLTTCGLASTLLAL 289

Query: 371 IYALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
           +  ++DI   K   L     G+N + +YVM A
Sbjct: 290 LIWIIDIKGYKRWSLFFESFGVNPLFMYVMGA 321


>gi|423312333|ref|ZP_17290270.1| hypothetical protein HMPREF1058_00882 [Bacteroides vulgatus
           CL09T03C04]
 gi|392688817|gb|EIY82101.1| hypothetical protein HMPREF1058_00882 [Bacteroides vulgatus
           CL09T03C04]
          Length = 363

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 97/385 (25%), Positives = 169/385 (43%), Gaps = 87/385 (22%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +RL +LDI RG+ +A MILV++ G     +  + HA +NG    D V PFF+FI+G++  
Sbjct: 7   KRLLALDILRGITIAGMILVNNPGSWGYVYTPLEHAAFNGLTPTDLVFPFFMFIMGISTY 66

Query: 90  LALKR--IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
           ++L++         ++K++ RT+ +   G+ L      A    T+ ++   +R  GV+QR
Sbjct: 67  ISLRKYNFTYSHAILRKIVKRTVVIFCIGLFLN---LLAKSVFTHHLNFEELRYLGVMQR 123

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV---VYLALLYGTYV 204
           +A+ Y + SLV I  K                    H    A +LV   VY  LL     
Sbjct: 124 LAIGYGVTSLVAITVK--------------------HKYFPAIILVTLAVYFLLL----- 158

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                          G  FN++           N V   D   LG +HMYH         
Sbjct: 159 -------------AMGDGFNLSV---------TNIVARFDVWALGTSHMYH--------- 187

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
                  +G +           F+PEGLLS++ ++   ++G + G ++   K +  ++++
Sbjct: 188 -------DGGM----------AFDPEGLLSTLPAVCHVMVGFYCGKLLFSAKDNDEKIQR 230

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPF 384
              +G  L   G  L +    P+NK++++ ++V  T G A+   + +  ++DI   +   
Sbjct: 231 LFLVGTILTFAGFLLSY--GCPINKKVWSPTFVITTCGLASSFLALLIWIIDIKGYQGWC 288

Query: 385 LPLAWIGMNAMLVYVMAA-EGIFAG 408
           +     G+N + +YV A   GI  G
Sbjct: 289 VFFRSFGVNPLFIYVFAEIMGILLG 313


>gi|311274235|ref|XP_003134250.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like,
           partial [Sus scrofa]
          Length = 297

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 93/294 (31%), Positives = 136/294 (46%), Gaps = 46/294 (15%)

Query: 140 RLCGVLQRIALSYLLVSLVEI-FTKDVQDKDQSVGRFSIFRLY----CW-HWLMAACVLV 193
           R+ GVLQR+ ++Y +V+++E+ F K V +   S    S F L      W  WL    +  
Sbjct: 6   RIPGVLQRLGVTYFVVAVLELLFAKPVPESCAS--ERSCFSLLDVTSSWPQWLFVLVLEG 63

Query: 194 VYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
           V+LAL +   VP      +      D GK  N T G          A GYIDR +LG +H
Sbjct: 64  VWLALTFFLPVPGCPTGYLGPGGIGDLGKYPNCTGG----------AAGYIDRLLLGDDH 113

Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
           +Y HP    S A    +                 ++PEG+L +++SIL   +GV  G ++
Sbjct: 114 LYQHP----SPAVLYHT--------------KVAYDPEGILGTINSILMAYLGVQAGKIL 155

Query: 313 IH----TKGHLARLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAAL 366
           ++    TKG L R   W      L+   LT    N   IP+NK L++ SYV   S +A L
Sbjct: 156 LYYKDRTKGILIRFAVWGCF-LGLISVALTKASENEGFIPVNKNLWSTSYVTTLSSSAFL 214

Query: 367 VFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHN 420
           +   +Y +VD+  L +   P  + GMN++LVY M  E     F   W  GD  +
Sbjct: 215 ILLVLYPIVDVKGL-WTGTPFFYPGMNSILVY-MGHEVFANYFPFQWRLGDSQS 266


>gi|375110537|ref|ZP_09756759.1| hypothetical protein AJE_11264 [Alishewanella jeotgali KCTC 22429]
 gi|374569481|gb|EHR40642.1| hypothetical protein AJE_11264 [Alishewanella jeotgali KCTC 22429]
          Length = 394

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 117/436 (26%), Positives = 199/436 (45%), Gaps = 89/436 (20%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFF 80
           QQ  +     R+ +LD  RGLA+  MILV++ G     +P + HA W+G    D + P F
Sbjct: 7   QQILAKQPANRMLALDALRGLAILAMILVNNPGSWQYVYPPLLHAEWHGWTPTDLIFPAF 66

Query: 81  LFIVGVAI--ALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSH--APDELTYGVDV 136
           L +VG+AI  +LA +++  +A+ +++   R LKL   G+ L   + +   P+       +
Sbjct: 67  LVMVGMAIPYSLAGRQLLPKAEQIRQGAIRALKLYLLGLFLVLFYYNFRDPNYSYLQQKL 126

Query: 137 RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYL 196
             +R  GVLQRI + Y    L+ +++        + GR          WL   C+L   L
Sbjct: 127 LTVRWSGVLQRIGIVYFCTLLIVLYSG-------TRGRIL--------WLSGLCLLYFLL 171

Query: 197 ALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHH 256
                 +VP        +D  +YG  F    G+    N   N   ++D ++LG NH+Y  
Sbjct: 172 M----QFVP-------YRD--NYGHTF---VGLWEHGN---NLAAWLDHQLLGPNHVYFR 212

Query: 257 PAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK 316
            A          +PF               F+PEG+LS++ +I S + GV     ++ +K
Sbjct: 213 SA----------TPFA--------------FDPEGILSTLPAIASCLSGVLMAQ-LLQSK 247

Query: 317 GHLA-RLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALV 375
             LA +L+  +  G A +      H   A+P+NK L+T ++V ++SG  A++ +    L 
Sbjct: 248 AELAFKLRVLLLSGLAAVWIAELAH--PALPINKMLWTPTFVLLSSGFTAVILALFLWLT 305

Query: 376 DIWNLKYPFLPLAWIGMNAMLVYVMAAEG--------IFAGFINGWYY--------GDPH 419
           ++   +    PL   G+NA+L +++A           +    +  W Y        GD +
Sbjct: 306 EMKRYRLWTAPLLVFGVNAILFFMLAGVAARVLSMVPVAGTTLGNWLYRSAFQPLFGDYN 365

Query: 420 NTL---VCFLFIISYI 432
            +L   +CFL ++SYI
Sbjct: 366 GSLAYAICFL-LLSYI 380


>gi|330805524|ref|XP_003290731.1| hypothetical protein DICPUDRAFT_49381 [Dictyostelium purpureum]
 gi|325079117|gb|EGC32733.1| hypothetical protein DICPUDRAFT_49381 [Dictyostelium purpureum]
          Length = 644

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 50/145 (34%), Positives = 86/145 (59%), Gaps = 15/145 (10%)

Query: 21  VSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFF 80
           +S  + +++ K  RL SLD+FRG ++ +MI V++ GG +   +H+ WNG  +AD V P+F
Sbjct: 194 ISSVERENNKKKDRLKSLDVFRGFSITIMIFVNYGGGGYWFFNHSYWNGLTVADLVFPWF 253

Query: 81  LFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDV 136
           +FI+G+A+ L+   +  R       V+K++ R+  L   G+ +             GV++
Sbjct: 254 VFIMGIAMPLSFNAMERRGTTKLVIVQKLVRRSAILFALGLFINN-----------GVNL 302

Query: 137 RMIRLCGVLQRIALSYLLVSLVEIF 161
           +  R+ GVLQR A+SYL+V L+ +F
Sbjct: 303 QHWRILGVLQRFAISYLIVGLIMLF 327



 Score = 62.8 bits (151), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 49/177 (27%), Positives = 87/177 (49%), Gaps = 21/177 (11%)

Query: 235 PPCNAVGYIDRKVLGINHMYHHPAWRRS-----KACTQDSPFEGPLRKDAPSWCHAPFEP 289
           P C   GY+    +G N +Y +     +     K  T +  F+ P  +D   +  + ++P
Sbjct: 425 PGC-PKGYLGPGGIGDNSLYPNCTGGAARLIDMKIFTNNHIFQSPTCQDI--YKTSSYDP 481

Query: 290 EGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW-----VTMGFALLIFGLTLHFTNA 344
           EG +  ++SI    IGV  G +I+  K + +RL +W     V  G A  + GL+ +    
Sbjct: 482 EGTVGYLTSIFICFIGVQAGRIILIYKSNRSRLIRWMVWSAVCCGIAAGLCGLSQN-DGV 540

Query: 345 IPLNKQLYTLSYVCVTSGAAALVFSAIYALVD---IWNLKYPFLPLAWIGMNAMLVY 398
           IP+NK L++ S+V + +G    V + ++ ++D   IWN      P  ++GMN + +Y
Sbjct: 541 IPINKNLWSPSFVFLMAGFGFFVLTIMFIVIDIKKIWNGS----PFIYVGMNPITIY 593


>gi|408673387|ref|YP_006873135.1| Protein of unknown function DUF2261, transmembrane [Emticicia
           oligotrophica DSM 17448]
 gi|387855011|gb|AFK03108.1| Protein of unknown function DUF2261, transmembrane [Emticicia
           oligotrophica DSM 17448]
          Length = 423

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 116/437 (26%), Positives = 178/437 (40%), Gaps = 120/437 (27%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
           QRL S+D+FRG+ + LM +V++ G DW  I     HA W+GC   D V PFFLFIVG++ 
Sbjct: 3   QRLTSIDVFRGMTIMLMTIVNNPG-DWSHIYAPLEHAEWHGCTPTDLVFPFFLFIVGIST 61

Query: 89  ALA----------LKRIPDRA------------------DAVKKVIFRTLKLLFWGI--- 117
            L+           +RI  RA                    ++ V    ++L+  GI   
Sbjct: 62  VLSSPVKRFDSNTFERIITRALRIFLLGLFLNFFSKIHLGTLEGVPLMLIRLVLTGIATV 121

Query: 118 LLQGGFSHAPD-ELTYGVDVRMIRLC-------------GVLQRIALSYLLVSLVEIFTK 163
           LL G F          G+ V MI LC             GVLQRIA+ YL+VS++   T 
Sbjct: 122 LLLGDFDKKKQFYAAVGLFVFMISLCFSGIEDFASVRIPGVLQRIAMVYLIVSVLYATTN 181

Query: 164 DVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVF 223
                        +  L C         L+ Y AL+    VP       N  +A++ K  
Sbjct: 182 TTTQ--------IVVGLVC---------LLGYWALMALVPVP-------NVGTANFEK-- 215

Query: 224 NVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWC 283
                         N   +ID  +L   H+     W  SK                 +W 
Sbjct: 216 ------------GTNLAAWIDNLLLP-GHL-----WSVSK-----------------TW- 239

Query: 284 HAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTN 343
               +PEG+LS++ +I + + GV  G ++ +      +    ++ G   ++ G    + +
Sbjct: 240 ----DPEGILSTIPAIGTALAGVFTGKLLTNDFPKNKKAIYLLSAGVIGVMIGFL--WND 293

Query: 344 AIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAE 403
             P+NK L+T SYV   +G A LV   +Y ++D+   +    P    G+N M+V+  +  
Sbjct: 294 YFPINKALWTSSYVLYVAGWALLVLGVLYFIIDVLGFEKWTKPFVIFGVNPMVVFFFS-- 351

Query: 404 GIFAGFINGWYYGDPHN 420
           GI    +N      P N
Sbjct: 352 GIIPRALNMIKIAQPEN 368


>gi|340618131|ref|YP_004736584.1| hypothetical protein zobellia_2146 [Zobellia galactanivorans]
 gi|339732928|emb|CAZ96303.1| Conserved hypothetical membrane protein [Zobellia galactanivorans]
          Length = 367

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 103/381 (27%), Positives = 158/381 (41%), Gaps = 90/381 (23%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDW----PEISHAPWNGCNLADFVMPFFLFIVGVAI 88
           +R  +LD+FRGL + LMI+V+  G DW      + HA W+G    D V P FLF VG A 
Sbjct: 2   KRFKALDVFRGLTICLMIIVNTPG-DWDMTFSPLLHAKWHGFTPTDLVFPSFLFAVGNAF 60

Query: 89  ALALKRIPDR--ADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGV 144
           A    +  D+  +D  KK+  RTL +   G  +     FS         V     R+ GV
Sbjct: 61  AFVKTKWADKPLSDIFKKIAKRTLIIFLLGYTMYWIPFFSWTETGDLAAVPFSETRILGV 120

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALL--YGT 202
           LQRIAL Y + +++  F  + Q                   + +A +L+ Y  LL  +G 
Sbjct: 121 LQRIALCYFIGAIMIYFLTNRQ-----------------LIIASAVILLGYWGLLSAFGD 163

Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
           Y  +  F                   VR            IDR +LG +H+Y        
Sbjct: 164 YTLEGNF-------------------VRT-----------IDRMLLGDSHLYMGNG---- 189

Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
                                  PF+PEGLLS++ SI + + G   G  II       +L
Sbjct: 190 ----------------------IPFDPEGLLSTLPSICNVLGGYLVGKYIIDKGIDYEKL 227

Query: 323 KQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWN--L 380
            + + +G  LL+       T   P+NK+L+T S+V +T G   +V S +   ++     +
Sbjct: 228 AKMLLVGAGLLVVAYLWDLT--FPVNKKLWTSSFVVLTVGLDIVVLSVLIYTIEFLKRPI 285

Query: 381 KYPFLPLAWIGMNAMLVYVMA 401
            Y F  +   G N + +Y+++
Sbjct: 286 NYNFFEI--FGKNPLFIYLLS 304


>gi|404404862|ref|ZP_10996446.1| transmembrane protein [Alistipes sp. JC136]
          Length = 382

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 105/424 (24%), Positives = 182/424 (42%), Gaps = 82/424 (19%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHA---GGDWPEISHAPWNGCNLADFVMPFFLFIVGV 86
           +K++RL SLD+ RG+ +A MILV++    G  +  + HA W+G    D + PFF+FI+GV
Sbjct: 1   MKSERLLSLDVMRGMTIAAMILVNNPAVWGKAYAPLQHAFWHGMTPTDLIYPFFVFIMGV 60

Query: 87  AIALALKRIPDRA--DAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
           +   +L +  + A  +A  +++ R+  +   G+LLQ        E++Y            
Sbjct: 61  SAFFSLSKRYEGAGREAFSRILRRSAVIFGVGLLLQ--------EISY------------ 100

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
                  Y   + +   T       ++V  F  FR+      +A   L    ALL     
Sbjct: 101 -----FGYGTANFLSGQTSADATWFETVFPFRTFRIMGVLQGLALVYLFGSAALL----C 151

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
             ++  I+         +  +  G    L+   N +  +DR VLG +H+Y          
Sbjct: 152 LRFRHLIVAAGGLLILYLVLLQTGNGYSLSAD-NIIAVVDRAVLGESHLY---------- 200

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
                      R+  P      FEPEGLLS++  I   ++G   G +++  +    R  +
Sbjct: 201 -----------REWLPDGSRLAFEPEGLLSTLPRIAQFLLGCAAGRILLANEDAPMRFGR 249

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDI-----WN 379
               G AL   GL L + +  PLNK++++ S+   TSG A+L+   +  ++D+     W 
Sbjct: 250 LFAFGTALFFTGLLLQYGD--PLNKKIWSSSFALATSGFASLLLGLLCWVIDLHKQVRWT 307

Query: 380 LKYPFLPLAWIGMNAMLVYVMA---AEGIFAGFINGWYY--------GDPHNTLV---CF 425
             +        G+N + +YV A   +  + A  I  W+Y        GD   +LV   CF
Sbjct: 308 GFFRVF-----GVNPLFLYVAAWVLSVTLGALSIKSWFYTTLIDPLFGDASGSLVYSLCF 362

Query: 426 LFII 429
           + ++
Sbjct: 363 ILLV 366


>gi|355694569|gb|AER99714.1| heparan-alpha-glucosaminide N-acetyltransferase [Mustela putorius
           furo]
          Length = 296

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 84/272 (30%), Positives = 129/272 (47%), Gaps = 41/272 (15%)

Query: 139 IRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVV 194
           +R+ GVLQR+ ++Y +V+++E IF K V +   S       R  ++ W  WL    +  +
Sbjct: 5   VRIPGVLQRLGVTYFVVAVLELIFAKPVPESCASERSCFSLRDIIFSWPQWLFILMLESI 64

Query: 195 YLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
           +LAL +   VP      +      D GK  N T G          A GYIDR +LG +H+
Sbjct: 65  WLALTFFLPVPGCPTGYLGPGGIGDLGKYPNCTGG----------AAGYIDRLLLGDDHI 114

Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
           Y HP    S A    +                 ++PEG+L S++SI+   +GV  G +++
Sbjct: 115 YQHP----SSAVLYHT--------------QVAYDPEGILGSINSIVMAFLGVQAGKILL 156

Query: 314 H----TKGHLARLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAALV 367
           +    TK  L R   W      L+   LT    N   IP+NK L+++SYV   S  A  +
Sbjct: 157 YYKDQTKDILIRFTAWSCF-LGLISVALTKFSENEGFIPINKNLWSVSYVTTLSSFAFFI 215

Query: 368 FSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
              +Y +VD+  L +   P  + GMN++LVYV
Sbjct: 216 LLILYPIVDVKGL-WTGTPFFYPGMNSILVYV 246


>gi|149176468|ref|ZP_01855081.1| hypothetical protein PM8797T_29827 [Planctomyces maris DSM 8797]
 gi|148844581|gb|EDL58931.1| hypothetical protein PM8797T_29827 [Planctomyces maris DSM 8797]
          Length = 518

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 102/444 (22%), Positives = 184/444 (41%), Gaps = 79/444 (17%)

Query: 3   EIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALM------------- 49
           E+ AE        ++E DVS +++K     QRL SLD +RG  +  M             
Sbjct: 43  EVSAEVEPAKA--VTEKDVSLKEKKKPETNQRLVSLDAYRGFVMLAMASGGLAIASVVRN 100

Query: 50  ---ILVDHAGGDWP------------EISHAPWNGCNLADFVMPFFLFIVGVAIALALKR 94
              +L  + G  W             ++SH  W G    D + P F+F+VGV++  ++++
Sbjct: 101 SPEVLDQYNGTQWESSWKTLWQTLSYQLSHVEWTGAGFWDLIQPSFMFMVGVSMPFSVRK 160

Query: 95  IPDRADAVKKV----IFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIAL 150
              + D+  K+    IFR + L+  G+ L   FS         V         VL +I L
Sbjct: 161 RRQKGDSTFKIWMHAIFRAILLVALGVFLSSQFSPERGFTYEDVPQTNFTFANVLCQIGL 220

Query: 151 SYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFT 210
            YL+V     F  +     Q +G  +I   Y W +          LA +  TY+ + Q  
Sbjct: 221 GYLVV----FFYVNRSFATQMIGVVTILGGY-WFFFYQYMPPEDELAAV-KTYLKEVQ-- 272

Query: 211 IINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSP 270
             +KD A++ +      G+ +  N   NA   +DR++L +   Y +P         +D P
Sbjct: 273 --HKDEAEWSQF----SGIGSAWNKHTNAAAAVDRQLLNMFPRYDNP---------KDDP 317

Query: 271 FEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGF 330
            +G        W +        L+ + SI + + G+  G ++I  +    ++K  +  G 
Sbjct: 318 DQGDTF-----WVNK--GGYQTLNFIPSIATMLFGLMAGQLLISNRLEKMKVKWLLQAG- 369

Query: 331 ALLIFGLTLHFTNAI-------------PLNKQLYTLSYVCVTSGAAALVFSAIYALVDI 377
            L+ FG+++    +I             P+ K++++  +   ++G A    +  Y ++D+
Sbjct: 370 -LICFGVSMLLDTSIWPVNINNWEWHLVPIVKRIWSPGWAIFSAGWAFWFLAVFYWIIDV 428

Query: 378 WNLKYPFLPLAWIGMNAMLVYVMA 401
              K    P   +GMN++ +Y MA
Sbjct: 429 KGYKKWAFPFVVVGMNSIAMYCMA 452


>gi|397171248|ref|ZP_10494657.1| hypothetical protein AEST_24230 [Alishewanella aestuarii B11]
 gi|396087147|gb|EJI84748.1| hypothetical protein AEST_24230 [Alishewanella aestuarii B11]
          Length = 394

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 116/436 (26%), Positives = 200/436 (45%), Gaps = 89/436 (20%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFF 80
           QQ  +     R+ +LD  RGLA+  MILV++ G     +P + HA W+G    D + P F
Sbjct: 7   QQILTQQPANRMLALDALRGLAILAMILVNNPGSWQYVYPPLLHAEWHGWTPTDLIFPAF 66

Query: 81  LFIVGVAI--ALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRM 138
           L +VG+AI  +LA +++  +A+ +++   R LKL   G+ L   + +  D     +  ++
Sbjct: 67  LVMVGMAIPYSLAGRQLLPKAELIRQGAIRALKLYLLGLFLVLFYYNFRDPNYSYLQQKL 126

Query: 139 --IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYL 196
             +R  GVLQRI + Y    ++ +++        + GR          WL   C+L   L
Sbjct: 127 LTVRWSGVLQRIGIVYFCTLVIVLYSG-------TRGRIL--------WLSGLCLLYFLL 171

Query: 197 ALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHH 256
                 +VP        +D  +YG  F    G+    N   N   ++D ++LG NH+Y  
Sbjct: 172 M----QFVP-------YRD--NYGHTF---VGLWEHGN---NLAAWLDHQLLGPNHVYFR 212

Query: 257 PAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK 316
            A          +PF               F+PEG+LS++ +I S + GV     ++ ++
Sbjct: 213 SA----------TPFA--------------FDPEGILSTLPAIASCLSGVLMAQ-LLQSQ 247

Query: 317 GHLA-RLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALV 375
             LA +L+  +  G A +      H   A+P+NK L+T ++V ++SG  AL+ +    L 
Sbjct: 248 AELAFKLRVLLLSGLAAVWIAELAH--PALPINKMLWTPTFVLLSSGFTALMLALFLWLT 305

Query: 376 DIWNLKYPFLPLAWIGMNAMLVYVMAAEG--------IFAGFINGWYY--------GDPH 419
           ++   +    PL   G+NA+L +++A           +    +  W Y        GD +
Sbjct: 306 EMKRYRLWTAPLLVFGVNAILFFMLAGVAARVLSMVPVVGTTLGNWLYRSAFQPLFGDYN 365

Query: 420 NTL---VCFLFIISYI 432
            +L   +CFL ++SYI
Sbjct: 366 GSLAYAICFL-LLSYI 380


>gi|217974365|ref|YP_002359116.1| hypothetical protein Sbal223_3208 [Shewanella baltica OS223]
 gi|217499500|gb|ACK47693.1| conserved hypothetical protein [Shewanella baltica OS223]
          Length = 384

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 111/422 (26%), Positives = 173/422 (40%), Gaps = 112/422 (26%)

Query: 22  SDQQEKSHLKTQRLASLDIFRGLAV-----------ALMILVDHAGGDW--PEISHAPWN 68
           ++  E   +   RL SLD  RG  +            L+IL   AG  W   ++ H+ W+
Sbjct: 6   TNAIEPVKVNKPRLVSLDALRGFDMFWILGGEALFGGLLILTGWAGWQWGDEQMHHSQWH 65

Query: 69  GCNLADFVMPFFLFIVGVAIALALKR-----IPDRADAVKKVIFRTLKLLFWGILLQGGF 123
           G    D + P F+F+ GVA+ L+ KR     I +R    +  I R   LL  GIL   G+
Sbjct: 66  GFRFYDLIFPLFIFLSGVALGLSPKRLDKLPISERLPVYRHGIKRLFLLLLLGILYNHGW 125

Query: 124 -SHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYC 182
            + AP       D   IR   VL RIA ++   +L+                        
Sbjct: 126 GTGAP------ADPEKIRYASVLGRIAFAWFFAALL-----------------------V 156

Query: 183 WHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKV---FNVTCGVRAKLNPPCNA 239
           WH  +   ++V  L +L G                 YG +        G    L+P  + 
Sbjct: 157 WHTSLRTQIIVA-LGILLG-----------------YGAIQLWLPFPGGQAGVLSPTESI 198

Query: 240 VGYIDRKVL-GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSS 298
             Y+D  +L G+++    P                              +PEGLLS++ +
Sbjct: 199 NAYVDSILLPGVSYQGRTP------------------------------DPEGLLSTIPA 228

Query: 299 ILSTIIGVHFGHVII--HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSY 356
           I++ + GV  GH I+  H KG  A++      G  LL FG  L     IP+NK+L+T S+
Sbjct: 229 IVNALTGVFVGHFIVKSHPKGEWAKVGLLAAAGGILLAFGWLLDLV--IPVNKELWTSSF 286

Query: 357 VCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA--------AEGIFAG 408
           V VTSG + ++ +  YALVD+   +        IG NA+++Y+ +        A+ +F G
Sbjct: 287 VLVTSGWSMILLAVFYALVDVLKWQKVAFFFVVIGTNAIIIYLASSLVDWKYIAQSVFGG 346

Query: 409 FI 410
            +
Sbjct: 347 LV 348


>gi|317478517|ref|ZP_07937676.1| hypothetical protein HMPREF1007_00792 [Bacteroides sp. 4_1_36]
 gi|316905331|gb|EFV27126.1| hypothetical protein HMPREF1007_00792 [Bacteroides sp. 4_1_36]
          Length = 394

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 96/396 (24%), Positives = 168/396 (42%), Gaps = 99/396 (25%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVA 87
           ++R+ +LDI RG+ +A MI+V++ G  W  I     HA WNG    D V PFF+FI+G++
Sbjct: 7   SKRILALDILRGVTIAGMIMVNNPG-SWGHIYAPLRHAEWNGLTPTDLVFPFFMFIMGIS 65

Query: 88  IALALKRIPDRAD--AVKKVIFRTLKLLFWGILLQGGFSH-------APDELTYGVDV-- 136
             ++LK+        A  K++ RT+ +   G+ + G FS        APD+L++G  +  
Sbjct: 66  TYISLKKYNFEFSHAAGMKILKRTIVIFLIGMAI-GWFSRFCYYWASAPDDLSFGEKLWA 124

Query: 137 -----RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV 191
                  IR+ GV+QR+AL Y   S++ +  K                     +L+A  +
Sbjct: 125 SVWTFDRIRILGVMQRLALCYGAASIIALTMKHKHIP----------------YLIAGLL 168

Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
              ++ L+ G                  G  +N T           N +  +DR +L   
Sbjct: 169 TGYFILLMCGN-----------------GFAYNET-----------NILSVVDRAILTPA 200

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           HMY                                 +PEGLLS++ +I   ++G   G +
Sbjct: 201 HMYKDNG----------------------------IDPEGLLSTIPAIAHVLLGFCVGRL 232

Query: 312 IIHTKGHLARLKQWVTMGFALLIFGLTLHFTN-----AIPLNKQLYTLSYVCVTSGAAAL 366
           ++       R     +    LL+ G+ L F+        P+NK++++ +YV VT G A+ 
Sbjct: 233 MLDGNKSEDRASFLNSQLITLLLVGVILTFSGFLLSYGCPINKKIWSPTYVLVTCGLASS 292

Query: 367 VFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
             + +  ++D+   K   +     G+N + +YV+  
Sbjct: 293 FLALLIWIIDVKGYKKWSMFFEAFGVNPLFMYVLGG 328


>gi|392555555|ref|ZP_10302692.1| hypothetical protein PundN2_08983 [Pseudoalteromonas undina NCIMB
           2128]
          Length = 359

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 94/375 (25%), Positives = 159/375 (42%), Gaps = 83/375 (22%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGVAI 88
            R  +LD  RGL +ALMILV+  G  W  +     HA W+GC   D + PFF+FI+G A+
Sbjct: 2   TRYKALDAMRGLTIALMILVNTPG-SWSHVYAPLLHADWHGCTPTDVIFPFFMFIIGSAM 60

Query: 89  ALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
             + K+    A A +      L+L+  G ++      A +   +  ++  +R+ GVLQRI
Sbjct: 61  FFSFKKTNSAASASQ-----VLRLVKRGAIIF-AIGLALNIYPFTTNIENLRILGVLQRI 114

Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
            ++Y+L S+  +F          + R  +  L       +  +LV Y  LL         
Sbjct: 115 GIAYILASICVLF----------LNRRGVLTL-------SVIILVAYWLLL--------- 148

Query: 209 FTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
                           ++ G         N V  +D  VLG +H++              
Sbjct: 149 ----------------LSVGAENAYTLEHNLVRAVDIAVLGESHLWQGKG---------- 182

Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTM 328
                             F+PEGL+S++ +++S + G     ++  T    A +K+   +
Sbjct: 183 ----------------LAFDPEGLISTLPAVVSVLFGFEVTRLLTSTSCQWASIKR---L 223

Query: 329 GFALLIFGLTLHFTNAI-PLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPL 387
               ++  +    T+ + P+NK L+T S+V  TSG A +V +    L DI   +    PL
Sbjct: 224 LVIGVVAVVIGQLTSMVMPINKSLWTSSFVIYTSGIACIVLAFFVWLCDIVKPERLVNPL 283

Query: 388 AWIGMNAMLVYVMAA 402
              G N + +YV++A
Sbjct: 284 IVYGSNPLFIYVLSA 298


>gi|359438686|ref|ZP_09228688.1| hypothetical protein P20311_2740 [Pseudoalteromonas sp. BSi20311]
 gi|359445329|ref|ZP_09235071.1| hypothetical protein P20439_1393 [Pseudoalteromonas sp. BSi20439]
 gi|358026628|dbj|GAA64937.1| hypothetical protein P20311_2740 [Pseudoalteromonas sp. BSi20311]
 gi|358040838|dbj|GAA71320.1| hypothetical protein P20439_1393 [Pseudoalteromonas sp. BSi20439]
          Length = 359

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 94/375 (25%), Positives = 159/375 (42%), Gaps = 83/375 (22%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGVAI 88
            R  +LD  RGL +ALMILV+  G  W  +     HA W+GC   D + PFF+FI+G A+
Sbjct: 2   TRYKALDAMRGLTIALMILVNTPG-SWSHVYAPLLHADWHGCTPTDVIFPFFMFIIGSAM 60

Query: 89  ALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
             + K+    A A +      L+L+  G ++      A +   +  ++  +R+ GVLQRI
Sbjct: 61  FFSFKKTNSAASASQ-----VLRLVKRGAIIF-AIGLALNIYPFTTNIENLRILGVLQRI 114

Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
            ++Y+L S+  +F          + R  +  L       +  +LV Y  LL         
Sbjct: 115 GIAYILASICVLF----------LNRRGVLTL-------SVIILVAYWLLL--------- 148

Query: 209 FTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
                           ++ G         N V  +D  VLG +H++              
Sbjct: 149 ----------------LSVGAENAYTLEHNLVRAVDIAVLGESHLWQGKG---------- 182

Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTM 328
                             F+PEGL+S++ +++S + G     ++  T    A +K+   +
Sbjct: 183 ----------------LAFDPEGLISTLPAVVSVLFGFEVTRLLTSTSCQWASIKR---L 223

Query: 329 GFALLIFGLTLHFTNAI-PLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPL 387
               ++  +    T+ + P+NK L+T S+V  TSG A +V +    L DI   +    PL
Sbjct: 224 LVIGVVAVVIGQLTSMVMPINKSLWTSSFVIYTSGIACIVLAFFVWLCDIVKPERLVNPL 283

Query: 388 AWIGMNAMLVYVMAA 402
              G N + +YV++A
Sbjct: 284 IVYGSNPLFIYVLSA 298


>gi|352080530|ref|ZP_08951469.1| putative transmembrane protein [Rhodanobacter sp. 2APBS1]
 gi|351683811|gb|EHA66887.1| putative transmembrane protein [Rhodanobacter sp. 2APBS1]
          Length = 353

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 101/356 (28%), Positives = 147/356 (41%), Gaps = 100/356 (28%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
           +RLASLD  RG  VA M+LV+  G DW  +     HA W+GC   D V PFFLF+VGV++
Sbjct: 2   KRLASLDALRGCTVAAMLLVNDPG-DWGHVYWPLEHAAWHGCTPTDLVFPFFLFVVGVSV 60

Query: 89  ALA-LKRIPDRADA---VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
           ALA L R+   A      +   +R L++L  G+ +            + +    +R  GV
Sbjct: 61  ALAILPRLEQGAAPSALTRAATWRALRILALGVAIN-------LLAAWLLPQAHLRFPGV 113

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALL-YGTY 203
           LQRIAL +  V+L  I TK                     W   A +L+ Y  LL  G  
Sbjct: 114 LQRIALCFAGVALFAIHTKPRT-----------------QWWAIAALLIGYWGLLRLGGS 156

Query: 204 VPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
           +  W       DSA +G+                    Y+     G  H           
Sbjct: 157 LEPWTNLASRVDSAVFGRFV------------------YLIDPASGRGH----------- 187

Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLK 323
                                   +PEGLLS++ S+  T++G+  G  +        R +
Sbjct: 188 ------------------------DPEGLLSTLPSLAGTLLGLRMGCWL--------RRE 215

Query: 324 QWVTM---GFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           Q+ T+   G A L+ G    ++  +P NK L+T S+V  T+G A L   A + L+D
Sbjct: 216 QFRTLLLAGIACLLLGAL--WSPWLPFNKNLWTPSFVLWTTGWATLALLAFHVLID 269


>gi|297300348|ref|XP_001115683.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Macaca mulatta]
          Length = 547

 Score = 95.9 bits (237), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 83/294 (28%), Positives = 138/294 (46%), Gaps = 49/294 (16%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNG-----CNLADFVMPFFLFIVGVAI 88
           RL S+D FRG+A+ LM+ V++ GG +    HA WNG     C +  F M  F+FI+G +I
Sbjct: 267 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGTLPMQCGICIFAM-MFVFIMGSSI 325

Query: 89  ALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
            L++  I  R  +    + K+ +R+  L+  GI++       P+     +    +R+ GV
Sbjct: 326 FLSMTSILQRGCSKFRLLGKIAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGV 380

Query: 145 LQRIALSYLLVSLVE-IFTKDVQDK---DQSVGRFSIFRLYCWHWLMAACVLVVYLALLY 200
           LQR+ ++Y +V+++E +F K V +    ++S             WL+   +  ++L L +
Sbjct: 381 LQRLGVTYFVVAVLELLFAKPVPEHCALERSCLSLRDITSSWPQWLLILALEGLWLGLTF 440

Query: 201 GTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAW 259
              VP      +      D+GK  N T G          A GYIDR +LG +H+Y HP+ 
Sbjct: 441 LLPVPGCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDDHLYQHPS- 489

Query: 260 RRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVH-FGHVI 312
                                      ++PEG+L +++SI+   +GV  F H I
Sbjct: 490 -----------------STVLYHTEVAYDPEGILGTINSIVMAFLGVQVFVHFI 526


>gi|66808259|ref|XP_637852.1| transmembrane protein [Dictyostelium discoideum AX4]
 gi|60466271|gb|EAL64333.1| transmembrane protein [Dictyostelium discoideum AX4]
          Length = 675

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 54/138 (39%), Positives = 86/138 (62%), Gaps = 11/138 (7%)

Query: 26  EKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVG 85
           E+ + K  RL SLD+FRG ++ +MI V++ GG +   +H+ WNG  +AD V P+F+FI+G
Sbjct: 198 ERENRKKDRLRSLDVFRGFSITIMIFVNYGGGGYWFFNHSLWNGLTVADLVFPWFVFIMG 257

Query: 86  VAIALALKRIPDRADAVKKVIFRTLKLLFWGILL--QGGFSHAPDELTYGVDVRMIRLCG 143
           +A+ L+   +  R    K++IF+  KLL   I+L   G F      +  GVD++  R+ G
Sbjct: 258 IAMPLSFHAMEKRGTP-KRIIFQ--KLLRRSIILFALGLF------INNGVDLQQWRILG 308

Query: 144 VLQRIALSYLLVSLVEIF 161
           VLQR ++SYL+V  + +F
Sbjct: 309 VLQRFSISYLVVGSIMLF 326



 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 61/249 (24%), Positives = 105/249 (42%), Gaps = 43/249 (17%)

Query: 159 EIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDS-A 217
            I+   +Q +      FS    Y   W+ A  +   +  L++   VP      +     A
Sbjct: 410 NIYMSMIQHETVISKYFSDIAPYWIQWVFALIIFSGWFLLMFLVPVPGCPTGYLGAGGLA 469

Query: 218 DYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRK 277
           D G+  + T G          A   ID K+    H++ +P       C +          
Sbjct: 470 DQGRYQHCTGG----------AARLIDLKIFTEAHIFQNPT------CLE--------VY 505

Query: 278 DAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW-----VTMGFAL 332
             PS     ++PEG +  ++SI    IGV  G +I+  K + +RL +W     V  G A 
Sbjct: 506 KTPS-----YDPEGTVGYLTSIFLCFIGVQAGRIILTYKSNRSRLIRWMVWSVVLCGIAA 560

Query: 333 LIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD---IWNLKYPFLPLAW 389
            + GLT +    +P+NK L++ S++ + +G    V + ++ L+D   IWN      P  +
Sbjct: 561 GLCGLTQN-QGWLPVNKNLWSPSFILLMAGFGFFVLTVMFILIDIKKIWNGS----PFIY 615

Query: 390 IGMNAMLVY 398
           +GMN + +Y
Sbjct: 616 VGMNPITIY 624


>gi|418676277|ref|ZP_13237561.1| putative membrane protein [Leptospira kirschneri serovar
           Grippotyphosa str. RM52]
 gi|418684894|ref|ZP_13246077.1| putative membrane protein [Leptospira kirschneri serovar
           Grippotyphosa str. Moskva]
 gi|418743212|ref|ZP_13299580.1| putative membrane protein [Leptospira kirschneri serovar Valbuzzi
           str. 200702274]
 gi|400323423|gb|EJO71273.1| putative membrane protein [Leptospira kirschneri serovar
           Grippotyphosa str. RM52]
 gi|410740642|gb|EKQ85357.1| putative membrane protein [Leptospira kirschneri serovar
           Grippotyphosa str. Moskva]
 gi|410749503|gb|EKR06488.1| putative membrane protein [Leptospira kirschneri serovar Valbuzzi
           str. 200702274]
          Length = 369

 Score = 95.5 bits (236), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 106/389 (27%), Positives = 168/389 (43%), Gaps = 90/389 (23%)

Query: 38  LDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL-- 92
           +D+FRG+ VA MILV++ G     +  + HA WNGC   D V PFFLF VG++I L++  
Sbjct: 1   MDLFRGMTVAGMILVNNPGSWSFIYSPLKHAKWNGCTPTDLVFPFFLFAVGISIQLSVYS 60

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
           K    ++     +  R++ L+  G+ L      +  EL         R+ GVLQRI   Y
Sbjct: 61  KNKIHKSKIWFGICIRSITLILIGLFLNFFGEWSFSEL---------RIPGVLQRIGFVY 111

Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
            +V+ + +                  R+    W+    +L+V+  +L     P       
Sbjct: 112 WIVASLHLILPK--------------RMILISWI---PILLVHTWVLIQIPAPG------ 148

Query: 213 NKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFE 272
             +S  Y             L P  +   +IDR V G N +     W+ SK         
Sbjct: 149 --ESIVY-------------LEPGKDIGAWIDRNVFGENRL-----WKFSKT-------- 180

Query: 273 GPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL 332
                         ++PEGL S +SSI ++++GV  G  I+ +K +  + +     GF +
Sbjct: 181 --------------WDPEGLFSGISSIATSLLGVFCGS-ILSSKTNEIKKQILSIFGFGI 225

Query: 333 LIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYAL-----VDIWNL---KYPF 384
           L+  + L +   +P+NK L+T SYV  T+G A L       L     +  W+    +  F
Sbjct: 226 LLVLVGLLWDQNLPMNKSLWTGSYVIYTAGLAFLSIGCFELLNFLLRIKKWDQLQSEIIF 285

Query: 385 LPLAWIGMNAMLVYVMAAEGIFAGFINGW 413
            P    G NA+LV+V    G+ A  +N W
Sbjct: 286 QPFLVFGKNAILVFV--GSGLLARTLNLW 312


>gi|418023168|ref|ZP_12662153.1| hypothetical protein Sbal625DRAFT_1278 [Shewanella baltica OS625]
 gi|353537051|gb|EHC06608.1| hypothetical protein Sbal625DRAFT_1278 [Shewanella baltica OS625]
          Length = 384

 Score = 95.1 bits (235), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 105/405 (25%), Positives = 170/405 (41%), Gaps = 102/405 (25%)

Query: 34  RLASLDIFRGLAV-----------ALMILVDHAGGDW--PEISHAPWNGCNLADFVMPFF 80
           RL SLD  RG  +            L+IL   AG  W   ++ H+ W+G N  D + P F
Sbjct: 18  RLMSLDALRGFDMFWILGGEALFGGLLILTGWAGWQWGDEQMHHSQWHGFNFYDLIFPLF 77

Query: 81  LFIVGVAIALALKRIPDRADAVKKVIFR-TLKLLFWGILLQGGFSHAPDELTYGVDVRMI 139
           +F+ GVA+ L+ KR+     + +  ++R  +K LF  +LL   ++H         D   I
Sbjct: 78  IFLSGVALGLSPKRLDKLPMSERLPVYRHGIKRLFLLLLLGILYNHGWGT-GAPADPEKI 136

Query: 140 RLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALL 199
           R   VL RIA ++   +L+                        WH  +   ++V  L +L
Sbjct: 137 RYASVLGRIAFAWFFAALL-----------------------VWHTSLRTQIIVA-LGIL 172

Query: 200 YGTYVPDWQFTIINKDSADYGKV---FNVTCGVRAKLNPPCNAVGYIDRKVL-GINHMYH 255
            G                 YG +        G    L+P  +   Y+D  +L G+++   
Sbjct: 173 LG-----------------YGAMQLWLPFPGGQAGVLSPTESINAYVDSILLPGVSYQGR 215

Query: 256 HPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII-- 313
            P                              +PEGLLS++ +I++ + GV  GH ++  
Sbjct: 216 TP------------------------------DPEGLLSTIPAIVNALTGVFVGHFLVKS 245

Query: 314 HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
           H KG  A++      G  LL FG  L     IP+NK+L+T S+V VTSG + ++ +  YA
Sbjct: 246 HPKGEWAKVGLLAAAGGILLAFGWLLDLV--IPVNKELWTSSFVLVTSGWSMILLAVFYA 303

Query: 374 LVDIWNLKYPFLPLAWIGMNAMLVYVMA--------AEGIFAGFI 410
           LVD+   +        IG NA+++Y+ +        A+ +F G +
Sbjct: 304 LVDVLKWQKSAFFFVVIGTNAIIIYLASSLVDWKYIAQSVFGGLV 348


>gi|398348299|ref|ZP_10533002.1| hypothetical protein Lbro5_13954 [Leptospira broomii str. 5399]
          Length = 399

 Score = 95.1 bits (235), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 115/390 (29%), Positives = 173/390 (44%), Gaps = 94/390 (24%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG----DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
           +RL S+D  RG  VA MILV++ G      WP + HA W GC   D V PFFLF VGV+I
Sbjct: 33  KRLLSIDALRGFTVAGMILVNNPGSWSAIYWP-LKHAKWFGCTPTDLVFPFFLFSVGVSI 91

Query: 89  ALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
                  P  +       F+ LK     IL+ G F H   E +    +  +R+ GVLQRI
Sbjct: 92  -------PFSSIGNGGTFFKILKRASILILI-GLFLHWFGEWS----IDQLRIPGVLQRI 139

Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
            L Y + ++                    +R   +H  +  C+ +++   +   +VP   
Sbjct: 140 GLVYFISAIA-------------------YRSSNFHARILICLSILFGYWILLEFVPP-- 178

Query: 209 FTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
                 DS                L+P  +   ++DR V G NH+     W+ SK     
Sbjct: 179 ---PGSDSVS--------------LSPGKDWGAWLDRIVFGENHL-----WKSSKT---- 212

Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH---TKGHLARLK-Q 324
                             ++PEGLLSS+S++ +T +G  FG V+     TK ++ +    
Sbjct: 213 ------------------WDPEGLLSSLSAVATTFLGFFFGEVLKKDSDTKKNIQKTAFN 254

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPF 384
           ++     +++ G   H     P+NK L+T S+V  T G AAL+ +    L  I ++K   
Sbjct: 255 FILAAIVIMVAGWIWH--QFFPMNKSLWTSSFVLWTGGLAALLLALFLLLESI-SIKSKD 311

Query: 385 LPLA-WI--GMNAMLVYVMAAEGIFAGFIN 411
           L LA WI  G NA+LV+   A GI+A  +N
Sbjct: 312 LLLAPWIPFGRNAILVFF--ASGIWARTLN 339


>gi|270295536|ref|ZP_06201737.1| conserved hypothetical protein [Bacteroides sp. D20]
 gi|270274783|gb|EFA20644.1| conserved hypothetical protein [Bacteroides sp. D20]
          Length = 394

 Score = 95.1 bits (235), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 95/396 (23%), Positives = 167/396 (42%), Gaps = 99/396 (25%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVA 87
           ++R+ +LDI RG+ +A MI+V++ G  W  I     HA WNG    D V PFF+FI+G++
Sbjct: 7   SKRILALDILRGVTIAGMIMVNNPG-SWGHIYAPLRHAEWNGLTPTDLVFPFFMFIMGIS 65

Query: 88  IALALKRIPDRAD--AVKKVIFRTLKLLFWGILLQGGFSH-------APDELTYGVDV-- 136
             ++LK+        A  K++ RT+ +   G+ + G FS        APD+L++G  +  
Sbjct: 66  TYISLKKYNFEFSHAAGMKILKRTIVIFLIGMAI-GWFSRFCYYWASAPDDLSFGEKLWA 124

Query: 137 -----RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV 191
                  IR+ GV+QR+AL Y   S++ +  K                     +L+A  +
Sbjct: 125 SVWTFDRIRILGVMQRLALCYGAASIIALTMKHKHIP----------------YLIAGLL 168

Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
              ++ L+ G                  G  +N T           N +  +DR +L   
Sbjct: 169 TGYFILLMCGN-----------------GFAYNET-----------NILSVVDRAILTPA 200

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           HMY                                 +PEGLLS++ +I   ++G   G +
Sbjct: 201 HMYKDNG----------------------------IDPEGLLSTIPAIAHVLLGFCVGRL 232

Query: 312 IIHTKGHLARLKQWVTMGFALLIFGLTLHFTN-----AIPLNKQLYTLSYVCVTSGAAAL 366
           ++       R     +    L + G+ L F+        P+NK++++ +YV VT G A+ 
Sbjct: 233 MLDGNKSEDRASFLNSQLITLFLVGVILTFSGFLLSYGCPINKKIWSPTYVLVTCGLASS 292

Query: 367 VFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
             + +  ++D+   K   +     G+N + +YV+  
Sbjct: 293 FLALLIWIIDVKGYKKWSMFFEAFGVNPLFMYVLGG 328


>gi|114048505|ref|YP_739055.1| hypothetical protein Shewmr7_3014 [Shewanella sp. MR-7]
 gi|113889947|gb|ABI43998.1| conserved hypothetical protein [Shewanella sp. MR-7]
          Length = 395

 Score = 95.1 bits (235), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 111/434 (25%), Positives = 174/434 (40%), Gaps = 106/434 (24%)

Query: 7   ETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAV-----------ALMILVDHA 55
            TT    +  +  +  +   K      RL SLD  RG  +           AL+IL   A
Sbjct: 2   STTAPESVANTGVNAQNAAAKKRQSKPRLMSLDALRGFDMFWILGGEALFGALLILTGWA 61

Query: 56  GGDW--PEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIP-----DRADAVKKVIFR 108
           G  W   ++ H+ WNG    D + P F+F+ GVA+ L+ KR+      +R    +  I R
Sbjct: 62  GWQWGDTQMHHSEWNGFRFYDLIFPLFIFLSGVALGLSPKRLDKLPMHERMPVYRHGIKR 121

Query: 109 TLKLLFWGILLQGGF-SHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQD 167
              LL  GIL   G+ + AP      VD   +R   VL RIA ++   +L+         
Sbjct: 122 LFLLLLLGILYNHGWGTGAP------VDPEKVRYASVLGRIAFAWFFAALL--------- 166

Query: 168 KDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTC 227
                          WH  +   VLV    L+    V  W                    
Sbjct: 167 --------------VWHTSLRTQVLVALGILVAYGAVQLW---------------LPFPG 197

Query: 228 GVRAKLNPPCNAVGYIDRKVL-GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP 286
           G   +L+P  +   Y+D  +L G+++    P                             
Sbjct: 198 GQAGELSPTESINAYVDSLLLPGVSYQGRTP----------------------------- 228

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVII--HTKGHLARLKQWVTMGFALLIFGLTLHFTNA 344
            +PEG+LS++ ++++ + GV  GH I+  H KG  A++      G   L  G  L     
Sbjct: 229 -DPEGVLSTLPAVVNALAGVFVGHFIVKSHPKGEWAKVGLLSVAGGVCLALGWLLD--GV 285

Query: 345 IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA--- 401
           IP+NK+L+T S+V VTSG + L+ +  YA+VD+   +        IG NA+++Y+ +   
Sbjct: 286 IPVNKELWTSSFVLVTSGWSMLLLALFYAIVDVLKWQKLAFIFVVIGTNAIIIYLASSLV 345

Query: 402 -----AEGIFAGFI 410
                A+ +F G I
Sbjct: 346 DWKYIAQSVFGGVI 359


>gi|320105553|ref|YP_004181143.1| hypothetical protein AciPR4_0312 [Terriglobus saanensis SP1PR4]
 gi|319924074|gb|ADV81149.1| hypothetical protein AciPR4_0312 [Terriglobus saanensis SP1PR4]
          Length = 412

 Score = 95.1 bits (235), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 93/363 (25%), Positives = 154/363 (42%), Gaps = 76/363 (20%)

Query: 22  SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWP--EISHAPWNGCNLADFVMPF 79
           +D   ++  K  RL SLD+ RG+ +  MILV++  G+     + HA WNG    D V P 
Sbjct: 18  TDSAARTTHKPARLLSLDVLRGVTIGFMILVNNQTGEGAFFPLQHAKWNGFTPTDLVFPT 77

Query: 80  FLFIVGVAIALALKRIPDRADAVKKVIFRTLK----LLFWGILLQGG-FSHAPDELTYGV 134
           FL +VG++  L+ +    R  A   +   TL+    L  +G+++    F H         
Sbjct: 78  FLLLVGLSTVLSTEARLARGVAKSTIFLHTLQRSAVLFLFGLIVNNAPFFH--------- 128

Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
            ++ +R+ GVL RIA+ Y +V  + +  +D++            R +      AAC LV 
Sbjct: 129 -LQTLRVYGVLPRIAVCYFIVGSLYLLVRDLKQ-----------RAFILAAAAAAC-LVG 175

Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
           Y AL+    +P                 F          +P  N V YIDR +   +H+Y
Sbjct: 176 YWALMRFIPIPG----------------FGTPTHEIPINDPDGNLVAYIDRHIFSASHLY 219

Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
                      T+D                    PEGLLS++ ++ + + G+  G  +  
Sbjct: 220 EK---------TRD--------------------PEGLLSTIPAVATALFGILAGIWLRT 250

Query: 315 TKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYAL 374
           ++  + + K     G + LI G   H   A P+NK+L+T S+     G + L+ +    L
Sbjct: 251 SRSTMQKAKGIEYAGISFLILGGAWHL--AFPINKKLWTSSFSLWAGGWSLLLLALFIYL 308

Query: 375 VDI 377
           +DI
Sbjct: 309 IDI 311


>gi|160887858|ref|ZP_02068861.1| hypothetical protein BACUNI_00261 [Bacteroides uniformis ATCC 8492]
 gi|156862688|gb|EDO56119.1| hypothetical protein BACUNI_00261 [Bacteroides uniformis ATCC 8492]
          Length = 394

 Score = 95.1 bits (235), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 95/396 (23%), Positives = 167/396 (42%), Gaps = 99/396 (25%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVA 87
           ++R+ +LDI RG+ +A MI+V++ G  W  I     HA WNG    D V PFF+FI+G++
Sbjct: 7   SKRILALDILRGVTIAGMIMVNNPG-SWGHIYAPLRHAEWNGLTPTDLVFPFFMFIMGIS 65

Query: 88  IALALKRIPDRAD--AVKKVIFRTLKLLFWGILLQGGFSH-------APDELTYGVDV-- 136
             ++LK+        A  K++ RT+ +   G+ + G FS        APD+L++G  +  
Sbjct: 66  TYISLKKYNFEFSHAAGMKILKRTIVIFLIGMAI-GWFSRFCYYWASAPDDLSFGEKLWA 124

Query: 137 -----RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV 191
                  IR+ GV+QR+AL Y   S++ +  K                     +L+A  +
Sbjct: 125 SVWTFDRIRILGVMQRLALCYGAASIIALTMKHKHIP----------------YLIAGLL 168

Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
              ++ L+ G                  G  +N T           N +  +DR +L   
Sbjct: 169 TGYFILLMCGN-----------------GFAYNET-----------NILSVVDRAILTPA 200

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           HMY                                 +PEGLLS++ +I   ++G   G +
Sbjct: 201 HMYKDNG----------------------------IDPEGLLSTIPAIAHVLLGFCVGRL 232

Query: 312 IIHTKGHLARLKQWVTMGFALLIFGLTLHFTN-----AIPLNKQLYTLSYVCVTSGAAAL 366
           ++       R     +    L + G+ L F+        P+NK++++ +YV VT G A+ 
Sbjct: 233 MLDGNKSEDRASFLNSQLITLFLVGVILTFSGFLLSYGCPINKKIWSPTYVLVTCGLASS 292

Query: 367 VFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
             + +  ++D+   K   +     G+N + +YV+  
Sbjct: 293 FLALLIWIIDVKGYKKWSMFFEAFGVNPLFMYVLGG 328


>gi|393763917|ref|ZP_10352530.1| hypothetical protein AGRI_13026 [Alishewanella agri BL06]
 gi|392605231|gb|EIW88129.1| hypothetical protein AGRI_13026 [Alishewanella agri BL06]
          Length = 394

 Score = 94.7 bits (234), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 118/436 (27%), Positives = 197/436 (45%), Gaps = 89/436 (20%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFF 80
           QQ  +     R+ +LD  RGLA+  MILV++ G     +P + HA W+G    D + P F
Sbjct: 7   QQILAKQPANRMLALDALRGLAILAMILVNNPGSWQYVYPPLLHAEWHGWTPTDLIFPAF 66

Query: 81  LFIVGVAI--ALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRM 138
           L +VG+AI  +LA +++  +A+ +++   R LKL   G+ L   + +  D     +  ++
Sbjct: 67  LVMVGMAIPYSLAGRQMLPKAELLRQGAIRALKLYLLGLFLVLFYYNFRDPNYSYLQQKL 126

Query: 139 --IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYL 196
             +R  GVLQRI + Y    L+ +++        + GR          WL   C+L   L
Sbjct: 127 LTVRWSGVLQRIGIVYFCTLLIVLYSG-------TRGRVL--------WLSGLCLLYFLL 171

Query: 197 ALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHH 256
                 +VP        +D  +YG  F    G+    N   N   ++D  VLG NH++  
Sbjct: 172 M----QFVP-------YRD--NYGHTF---VGLWEHGN---NLAAWLDHHVLGPNHVFFR 212

Query: 257 PAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK 316
            A          +PF               F+PEG+LS++ +I S + GV     ++ +K
Sbjct: 213 SA----------TPFA--------------FDPEGILSTLPAIASCLSGVLMAQ-LLQSK 247

Query: 317 GHLA-RLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALV 375
             LA +L+     G A +      H   A+P+NK L+T ++V ++SG  AL+      L 
Sbjct: 248 AELAFKLRVLFLAGLAAIWVAELAH--PALPINKMLWTPTFVLLSSGFTALILGLFLWLT 305

Query: 376 DIWNLKYPFLPLAWIGMNAMLVYVMAAEG--------IFAGFINGWYY--------GDPH 419
           ++   +    PL   G+NA+L +++A           +    +  W Y        GD +
Sbjct: 306 EMKRYRLWTAPLLVFGVNAILFFMLAGVAARVLSMVPVAGTTLGNWLYRSAFQPLFGDYN 365

Query: 420 NTL---VCFLFIISYI 432
            +L   +CFL ++SYI
Sbjct: 366 GSLAYAICFL-LLSYI 380


>gi|301625227|ref|XP_002941812.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like,
           partial [Xenopus (Silurana) tropicalis]
          Length = 370

 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 56/155 (36%), Positives = 88/155 (56%), Gaps = 13/155 (8%)

Query: 17  SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFV 76
           SE +  +Q +    +++RL SLD FRG ++ +M+ V++ GG +    HAPWNG  +AD V
Sbjct: 190 SEDNCGEQSKVP--ESRRLYSLDTFRGFSLTIMVFVNYGGGGYWFFEHAPWNGLTVADLV 247

Query: 77  MPFFLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGI-LLQGGFSHAPDELT 131
           MP+F+FI+G ++ALA    LKR   R   + K+ +RT  L   G+  L  G +  P    
Sbjct: 248 MPWFVFIIGTSVALAFNAMLKRGLSRCQLLYKLTWRTCILFAIGVFFLNYGPADGP---- 303

Query: 132 YGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQ 166
             +  R  R+ GVLQR+  +Y +++L+      VQ
Sbjct: 304 --LSWRWARIPGVLQRLGFTYFVIALLHTCFHKVQ 336


>gi|315500593|ref|YP_004089395.1| hypothetical protein Astex_3616 [Asticcacaulis excentricus CB 48]
 gi|315418605|gb|ADU15244.1| protein of unknown function DUF1624 [Asticcacaulis excentricus CB
           48]
          Length = 372

 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 102/377 (27%), Positives = 166/377 (44%), Gaps = 74/377 (19%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDH--AGGD-WPEISHAPWNGCNLADFVMPFFLFIVGV 86
           +   R  +LD+FRGL V +MI+V+   AG + + ++ HA W G  L D V P FLF +G 
Sbjct: 1   MSAARYTALDVFRGLTVCVMIVVNTSPAGAEPFAQLQHAQWFGFTLTDLVFPSFLFAIGN 60

Query: 87  AIALALKRIPDRADAVKKVIFRTLKLLFWGILLQ-GGFSHAPDELTYGV-DVRMIRLCGV 144
           ++  A ++     + + KV+ R+  +   G L+    F H   +  +   D+   R+ GV
Sbjct: 61  SMVFAFRKPLPHKEFLLKVLRRSALIFLLGYLMYWFPFVHQTTDGAWAFNDIGHTRIMGV 120

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           LQRIAL YL  SL          +  SV    I         ++A +L  Y  LLY  + 
Sbjct: 121 LQRIALCYLFASLAA--------RYLSVRGLVI---------LSALLLFGYWGLLY-AFT 162

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
           P          +AD      +T  + AK          ID+ VLG++HMY   A      
Sbjct: 163 P----------AAD---ALTMTGNLGAK----------IDQFVLGLDHMYRGGA------ 193

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
                                 +EPEGLLS++ +I++ + G   G +I+ ++    R   
Sbjct: 194 --------------------KGYEPEGLLSTLPAIVNVLAGYLCGRLILDSEDR--RRTV 231

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPF 384
            +  G  L +    L ++   P +K+L+T S+  +T G   L+ S I A VD+   +   
Sbjct: 232 MILSGAGLALVAAALVWSFGFPFSKRLWTSSFAVLTIGLDCLILSGIIAYVDLAERREGL 291

Query: 385 LPLAWIGMNAMLVYVMA 401
                 G N +++Y+ +
Sbjct: 292 GFFETFGRNPLVIYLFS 308


>gi|196233857|ref|ZP_03132695.1| conserved hypothetical protein [Chthoniobacter flavus Ellin428]
 gi|196222051|gb|EDY16583.1| conserved hypothetical protein [Chthoniobacter flavus Ellin428]
          Length = 437

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 124/457 (27%), Positives = 186/457 (40%), Gaps = 81/457 (17%)

Query: 16  ISEPDVSDQQEKSHLKTQRLASLDIFRGL-------AVALMILVDH-------AGGDWPE 61
           I  P+V    E       RL SLD  RG        A A++  +D        AG  W +
Sbjct: 27  IPSPNVPVSPETGQ-PAGRLVSLDALRGFDMFWIVGAGAVIQSLDKMCRTPFTAGLAW-Q 84

Query: 62  ISHAPWNGCNLADFVMPFFLFIVGVAIALALKR---IPDRADAVKKVIFRTLKLLFWGIL 118
             H  W G +  D + P FLFI+G++I  +L +      +A  + +V  R++ L   G+L
Sbjct: 85  FKHVHWKGLHCYDVIFPLFLFIIGISIVFSLDKALATGGKAQVLTRVARRSVLLFALGVL 144

Query: 119 LQGGFSHA-PDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSI 177
             GGF    P+          ++L GVL RIAL YL  +L+  F +  +           
Sbjct: 145 YYGGFMKPWPN----------VQLGGVLPRIALCYLAAALIYTFIRSTRGLLA------- 187

Query: 178 FRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPC 237
                     AA +L+ Y ALL     PD Q            K        R   + P 
Sbjct: 188 ---------AAAALLIGYWALLAFVPFPDLQLR----------KPVVEEIAERIGSDSPA 228

Query: 238 NAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVS 297
                +  +V G+   Y      R+ A   D  +  P RK         F  EGLLS++ 
Sbjct: 229 AIAAAVPERVHGLYEEY------RNLANYVDFLYM-PGRK-----AQFYFINEGLLSTIP 276

Query: 298 SILSTIIGVHFGHVIIHTKGHLARLKQW-VTMGFALLIFGLTLHFTNAIPLNKQLYTLSY 356
           SI  ++ G   G ++ + K    R   W V  G A ++ G    +   +PL K+++T S+
Sbjct: 277 SIALSLFGAVAGLLLKNQKVLPRRKIAWLVGAGVAFIVLGRV--WAIDLPLIKRIWTSSF 334

Query: 357 VCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA--------AEGIFAG 408
           + V +G +AL+ +  Y +VD+   +    P  WIG NA+ VYV A        AE    G
Sbjct: 335 ILVATGWSALMLALFYYIVDVKQWRKWCQPFIWIGCNALTVYVAAQVLNFDTMAERFAGG 394

Query: 409 FINGWYYGDPHNTLVCFLFIISYILHSFLWELRKFLY 445
            +  ++  D H    C   +I  I  S      +FLY
Sbjct: 395 DVQDFF--DTHIASGCGDLVIDLISLSIAIFFARFLY 429


>gi|322436289|ref|YP_004218501.1| hypothetical protein AciX9_2697 [Granulicella tundricola MP5ACTX9]
 gi|321164016|gb|ADW69721.1| hypothetical protein AciX9_2697 [Granulicella tundricola MP5ACTX9]
          Length = 389

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 108/407 (26%), Positives = 168/407 (41%), Gaps = 102/407 (25%)

Query: 16  ISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNL 72
           ++E  + D Q  S     RL S+D+ RGL +  MILV+ AG +   +  + HA WNG   
Sbjct: 1   MTEQALGDIQRPS-----RLLSIDLLRGLTIGFMILVNDAGSERDAYAPLQHAWWNGFTP 55

Query: 73  ADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTY 132
            D V P FLF+VG+   L+L    DR   V ++       LFW +L +    +    L  
Sbjct: 56  TDLVFPTFLFLVGITTVLSLGSRMDR--NVPRMT------LFWSVLRRAVLIYVVGILAS 107

Query: 133 G---VDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAA 189
                 +  +R  GVL RIAL YL+V  + + +K  +DK                 ++AA
Sbjct: 108 TFPFTHLAGMRFVGVLPRIALCYLIVGSLLLISKSWKDKVV---------------ILAA 152

Query: 190 CVLVVYLALLY-----GTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYID 244
           C L+ Y ALL      G  VP     ++++D                      N   ++D
Sbjct: 153 C-LIGYWALLRFVPVPGYGVPTHDVPLLDRDG---------------------NLAAWLD 190

Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
           R +    H+Y           T+D                    PEGLLS++ ++ + ++
Sbjct: 191 RWMFAPQHLYER---------TRD--------------------PEGLLSTIPAVGTALL 221

Query: 305 GVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAA 364
           G+     +     H  R K     G A +   L L +   +P+NK+++T SYV    G +
Sbjct: 222 GLL--TGLFLRSQHALRTKIMGIAGAATVSILLGLLWNITLPINKKMWTSSYVLFAGGLS 279

Query: 365 ALVFSAIYALVDIWNLKYP----------FLPLAWIGMNAMLVYVMA 401
            L+ +A   L+DI   +            F P    G NA+  YV+A
Sbjct: 280 MLLLAACMTLIDIPAERESKLQRSARSRFFTPFLVFGTNAIAAYVLA 326


>gi|152999681|ref|YP_001365362.1| hypothetical protein Shew185_1147 [Shewanella baltica OS185]
 gi|151364299|gb|ABS07299.1| conserved hypothetical protein [Shewanella baltica OS185]
          Length = 384

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 105/405 (25%), Positives = 169/405 (41%), Gaps = 102/405 (25%)

Query: 34  RLASLDIFRGLAV-----------ALMILVDHAGGDW--PEISHAPWNGCNLADFVMPFF 80
           RL SLD  RG  +            L+IL   AG  W   ++ H+ W+G    D + P F
Sbjct: 18  RLMSLDALRGFDMFWILGGEALFGGLLILTGWAGWQWGDEQMHHSQWHGFRFYDLIFPLF 77

Query: 81  LFIVGVAIALALKRIPDRADAVKKVIFR-TLKLLFWGILLQGGFSHAPDELTYGVDVRMI 139
           +F+ GVA+ L+ KR+     + +  ++R  +K LF  +LL   ++H         D   I
Sbjct: 78  IFLSGVALGLSPKRLDKLPMSERLPVYRHGIKRLFLLLLLGILYNHGWGT-GAPADPEKI 136

Query: 140 RLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALL 199
           R   VL RIA ++   +L+                        WH  +   ++V  L +L
Sbjct: 137 RYASVLGRIAFAWFFAALL-----------------------VWHTSLRTQIIVA-LGIL 172

Query: 200 YGTYVPDWQFTIINKDSADYGKV---FNVTCGVRAKLNPPCNAVGYIDRKVL-GINHMYH 255
            G                 YG +        G    L+P  +   Y+D  +L G+++   
Sbjct: 173 LG-----------------YGAMQLWLPFPGGQAGVLSPTESINAYVDSILLPGVSYQGR 215

Query: 256 HPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII-- 313
            P                              +PEGLLS++ +I++ + GV  GH I+  
Sbjct: 216 TP------------------------------DPEGLLSTIPAIVNALTGVFVGHFIVKS 245

Query: 314 HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
           H KG  A++      G  LL FG  L     IP+NK+L+T S+V VTSG + ++ +  YA
Sbjct: 246 HPKGEWAKVGLLAAAGGILLAFGWLLDLV--IPVNKELWTSSFVLVTSGWSMILLAVFYA 303

Query: 374 LVDIWNLKYPFLPLAWIGMNAMLVYVMA--------AEGIFAGFI 410
           LVD+   +        IG NA+++Y+ +        A+ +F G +
Sbjct: 304 LVDVLKWQKSAFFFVVIGTNAIIIYLASSLVDWKYIAQSVFGGLV 348


>gi|16552925|dbj|BAB71412.1| unnamed protein product [Homo sapiens]
          Length = 367

 Score = 94.0 bits (232), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 82/272 (30%), Positives = 129/272 (47%), Gaps = 41/272 (15%)

Query: 139 IRLCGVLQRIALSYLLVSLVEI-FTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVV 194
           +R+ GVLQR+ ++Y +V+++E+ F K V +   S       R     W  WL+   +  +
Sbjct: 75  VRIPGVLQRLGVTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLILVLEGL 134

Query: 195 YLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
           +L L +   VP      +      D+GK  N T G          A GYIDR +LG +H+
Sbjct: 135 WLGLTFLLPVPGCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDDHL 184

Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
           Y HP    S A    +                 ++PEG+L +++SI+   +GV  G +++
Sbjct: 185 YQHP----SSAVLYHT--------------EVAYDPEGILGTINSIVMAFLGVQAGKILL 226

Query: 314 H----TKGHLARLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAALV 367
           +    TK  L R   W  +   L+   LT    N   IP+NK L++LSYV   S  A  +
Sbjct: 227 YYKARTKDILIRFTAWCCI-LGLISVALTKVSENEGFIPVNKNLWSLSYVTTLSSFAFFI 285

Query: 368 FSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
              +Y +VD+  L +   P  + GMN++LVYV
Sbjct: 286 LLVLYPVVDVKGL-WTGTPFFYPGMNSILVYV 316


>gi|398343267|ref|ZP_10527970.1| hypothetical protein LinasL1_09415 [Leptospira inadai serovar Lyme
           str. 10]
          Length = 399

 Score = 94.0 bits (232), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 112/401 (27%), Positives = 166/401 (41%), Gaps = 92/401 (22%)

Query: 21  VSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFV 76
           V  +       ++RL S+D  RG  VA MILV++ G  W  I     HA W GC   D V
Sbjct: 21  VKQELLNDSFASKRLLSIDALRGFTVAGMILVNNPG-SWSAIYSPLRHAKWFGCTPTDLV 79

Query: 77  MPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDV 136
            PFFLF VGV+I  +            K++ R   L+F G+ L      + D L      
Sbjct: 80  FPFFLFSVGVSIPFS---TIGNGGTFFKILKRASILIFIGLFLHWFGEWSMDRL------ 130

Query: 137 RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYL 196
              R+ GVLQRI L Y + ++                R + FR      ++   +L  Y 
Sbjct: 131 ---RIPGVLQRIGLVYFISAIAY--------------RSASFRTRI---MICISILFGYW 170

Query: 197 ALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHH 256
            LL     P                           L+P  +   ++DR V G NH+   
Sbjct: 171 ILLEFAPPPG---------------------AGSPSLSPGKDWGAWLDRIVFGENHL--- 206

Query: 257 PAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH-- 314
             W+ SK                       ++PEGLL S+S++ +T +G+ FG V+    
Sbjct: 207 --WKSSKT----------------------WDPEGLLGSLSAVATTFLGIFFGEVLKKDS 242

Query: 315 -TKGHLARLK-QWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIY 372
            TKG++ +    ++     L++ G   H     P+NK L+T S+V  T G+AAL+ S   
Sbjct: 243 DTKGNIQKTAFTFILAAIVLMLVGWIWH--QFFPMNKSLWTSSFVLWTGGSAALLLSLFL 300

Query: 373 ALVDIWNLKYPFLPLAWI--GMNAMLVYVMAAEGIFAGFIN 411
            L          L   WI  G NA+LV+ ++  GI+A  +N
Sbjct: 301 LLESSSMKSKDLLFSPWIPFGRNAILVFFVS--GIWARTLN 339


>gi|323447301|gb|EGB03229.1| hypothetical protein AURANDRAFT_68196 [Aureococcus anophagefferens]
          Length = 399

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 57/137 (41%), Positives = 79/137 (57%), Gaps = 12/137 (8%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFI 83
            + +S  +  R+ SLD+ RG AV LMI VD AG  +  + H+PW+G  +AD VMPFF+F+
Sbjct: 4   NEPESARRPPRVRSLDVVRGFAVLLMIFVDDAGSAYAVLDHSPWDGLTIADVVMPFFIFM 63

Query: 84  VGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTY--GVDVRMI 139
           VGV+ ALAL  KR       +  V+ R   L  W + +       PD  TY  G D+  +
Sbjct: 64  VGVSAALALGGKRT------LAPVLRRGATL--WVVGVAVQGGGLPDPTTYAWGYDLGTV 115

Query: 140 RLCGVLQRIALSYLLVS 156
           R CG+LQRIA  Y++ S
Sbjct: 116 RWCGILQRIAACYVVAS 132



 Score = 48.5 bits (114), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 43/160 (26%), Positives = 66/160 (41%), Gaps = 35/160 (21%)

Query: 282 WCHAP--FEPEGLLSSVSSILSTIIGVHFGHVIIHTKG-HLARLKQWVTMGFALLIFG-- 336
           WC AP   +PEG LS V ++ + +IG  F   +   +     R       G+A L+ G  
Sbjct: 167 WCAAPGVADPEGFLSGVLAVATAVIGALFARCLERCRAPRPGRDSDDRGGGYARLVDGDG 226

Query: 337 -----LTLHFTN---------------AIPLNKQLYTLSYVCVTSGAAALVFSAIYALV- 375
                L LH+                   P+NKQL+T SY   T+       +A  AL+ 
Sbjct: 227 DRRGALMLHWALASLALAAAALVAVAAGSPVNKQLWTPSYCLATAALCGFALTAAVALLG 286

Query: 376 DIWN---------LKYPFLPLAWIGMNAMLVYVMAAEGIF 406
           D+ +          +    PL   G NA+L++V+ A G+ 
Sbjct: 287 DLADGGDDAKFRAARALAEPLRRAGRNALLLFVLGASGVL 326


>gi|225010297|ref|ZP_03700769.1| conserved hypothetical protein [Flavobacteria bacterium MS024-3C]
 gi|225005776|gb|EEG43726.1| conserved hypothetical protein [Flavobacteria bacterium MS024-3C]
          Length = 363

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 113/420 (26%), Positives = 181/420 (43%), Gaps = 90/420 (21%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGVA 87
            +R+ S+DIFRG  + LMILV+  G  W  +     HA W+G  L D V PFF+FIVGV+
Sbjct: 4   NKRVPSVDIFRGATLLLMILVNTPG-TWSAVYAPFLHASWHGYTLTDLVFPFFIFIVGVS 62

Query: 88  IALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
           I+L+ K          K+  R+LKL+  G+ L G F+ +     +   V  IR  GVLQR
Sbjct: 63  ISLSYKDKKLNGPVFFKLTKRSLKLIGLGLFL-GAFTIS---FPFIKAVENIRFPGVLQR 118

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
           I L +   S++ ++      K  +     I  L  W W+                     
Sbjct: 119 IGLVFFFASIIYLW----GSKRSTALIIGIILLAYWLWM--------------------- 153

Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
            F  ++  +  Y +  N             N   +ID KVLG +HM     W+       
Sbjct: 154 GFLPLDGVAPTYERAAN-------------NWANFIDFKVLG-SHM-----WKPD----- 189

Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVT 327
                              ++PEG+LS++ +I + ++G   G V+          K  + 
Sbjct: 190 -------------------YDPEGILSTLPAIATALLGTLAGDVLRSNTYQ----KVSLL 226

Query: 328 MGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPL 387
           +   L++ GL      + P+NK L++ S+V VT+G A LV   ++ L ++   ++  +  
Sbjct: 227 LISGLVLLGLGHLLDLSFPINKALWSSSFVMVTAGWANLVLGILFYLREVKKFRFGEV-F 285

Query: 388 AWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHN--TLVCFLFIISYILHSFLWELRKFLY 445
           + +G N + +Y  A+      FI   +Y  P N  ++  FLF   Y+      EL   LY
Sbjct: 286 SKVGANGIAIYFTAS------FITKLFYLIPVNGSSIHGFLFENIYLNKGLAPELSSLLY 339


>gi|116327439|ref|YP_797159.1| hypothetical protein LBL_0655 [Leptospira borgpetersenii serovar
           Hardjo-bovis str. L550]
 gi|116120183|gb|ABJ78226.1| Conserved hypothetical protein [Leptospira borgpetersenii serovar
           Hardjo-bovis str. L550]
          Length = 369

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 115/396 (29%), Positives = 166/396 (41%), Gaps = 104/396 (26%)

Query: 38  LDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL-- 92
           +D+FRG+ V  MILV++ G     +  + HA WNGC   D V PFFLF VG +I ++L  
Sbjct: 1   MDLFRGMTVVGMILVNNPGSWSYVYSPLKHAEWNGCTPTDLVFPFFLFAVGASIPISLYS 60

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
           K   +R      +  R + L     +L G F +   E T+      +R+ GVLQRI   Y
Sbjct: 61  KNGINRIRIWIGICIRGISL-----ILLGLFLNFFGEWTF----SELRIPGVLQRIGFVY 111

Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
            +V+ +                F +F            VLV  + +L    V  W  T I
Sbjct: 112 WVVATL----------------FLVFP--------GKKVLVFLIPIL---LVHTWILTHI 144

Query: 213 NKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFE 272
                               L    +   +IDR + G  H+     W+ SK         
Sbjct: 145 APPGES-----------MVSLEQGKDIGAWIDRTIFGEKHL-----WKFSKT-------- 180

Query: 273 GPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL 332
                         ++PEG LS ++SI +++ GV  G ++   +G   R K  V     L
Sbjct: 181 --------------WDPEGFLSGIASIATSLFGVICGFILFRREG---RGKNRV-----L 218

Query: 333 LIFGLTLHFT-------NAIPLNKQLYTLSYVCVTSGAAAL---VFSAIYALVDI--WN- 379
            IFGL   FT        ++P+NK L+T SY   T+G A L    F  + +L+ +  WN 
Sbjct: 219 SIFGLGFLFTFVGLLWDRSLPMNKSLWTGSYAVYTTGFAFLCIGFFEYLDSLILLKKWNG 278

Query: 380 --LKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGW 413
             LK  F P    G NA+LV+V    GI A  +N W
Sbjct: 279 LDLKIFFQPFFVFGKNAILVFV--GSGILARTLNFW 312


>gi|389799428|ref|ZP_10202419.1| protein involved in N-acetyl-D-glucosamine utilization
           [Rhodanobacter sp. 116-2]
 gi|388442725|gb|EIL98904.1| protein involved in N-acetyl-D-glucosamine utilization
           [Rhodanobacter sp. 116-2]
          Length = 353

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 100/356 (28%), Positives = 147/356 (41%), Gaps = 100/356 (28%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
           +RLASLD  RG  VA M+LV+  G DW  +     HA W+GC   D V  FFLF+VGV++
Sbjct: 2   KRLASLDALRGCTVAAMLLVNDPG-DWGHVYWPLEHAAWHGCTPTDLVFSFFLFVVGVSV 60

Query: 89  ALA-LKRIPDRADA---VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
           ALA L R+   A      +   +R L++L  G+ +            + +    +R  GV
Sbjct: 61  ALAILPRLEQGAAPSALTRAATWRALRILALGVAIN-------LLAAWLLPQAHLRFPGV 113

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALL-YGTY 203
           LQRIAL +  V+L  I TK                     W   A +L+ Y  LL  G  
Sbjct: 114 LQRIALCFAGVALFAIHTKPRT-----------------QWWAIAALLIGYWGLLRLGGS 156

Query: 204 VPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
           +  W       DSA +G+                    Y+     G  H           
Sbjct: 157 LEPWTNLASRVDSAVFGRFV------------------YLIDPASGRGH----------- 187

Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLK 323
                                   +PEGLLS++ S+  T++G+  G  +        R +
Sbjct: 188 ------------------------DPEGLLSTLPSLAGTLLGLRMGCWL--------RRE 215

Query: 324 QWVTM---GFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           Q+ T+   G A L+ G    ++  +P NK L+T S+V  T+G A L   A++ L+D
Sbjct: 216 QFRTLLLAGIACLLLGAL--WSPWLPFNKNLWTPSFVLWTTGWATLALLALHVLID 269


>gi|436833933|ref|YP_007319149.1| Protein of unknown function DUF2261,transmembrane [Fibrella
           aestuarina BUZ 2]
 gi|384065346|emb|CCG98556.1| Protein of unknown function DUF2261,transmembrane [Fibrella
           aestuarina BUZ 2]
          Length = 361

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 103/367 (28%), Positives = 151/367 (41%), Gaps = 89/367 (24%)

Query: 49  MILVDHAGGDWP----EISHAPWNGCNLADFVMPFFLFIVGVAIALALKR-----IPDRA 99
           MILV++AG DW      + HAPWNG    D + PFFLFIVGV+I  AL +     + D  
Sbjct: 1   MILVNNAG-DWAHSYAPLKHAPWNGWTPTDLIFPFFLFIVGVSITFALSKRQTSLLEDEK 59

Query: 100 DAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVE 159
               K+I R + L   G  L          L    D   +R+ GVLQRI + Y + +LV 
Sbjct: 60  TQRLKIIRRGVTLFALGFFLN---------LFPRFDFANVRIMGVLQRIGIVYTVCALVF 110

Query: 160 IFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADY 219
           + T   Q  +                 +   +L+ Y  L+    VP   +          
Sbjct: 111 LRTSPRQQVN-----------------LILLILIGYFLLMTMVPVPGIGY---------- 143

Query: 220 GKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDA 279
                      A L P  N   +IDR +L   H Y     R SK                
Sbjct: 144 -----------ANLEPETNLAAWIDRTILTPAHCY-----RSSKV--------------- 172

Query: 280 PSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK-GHLARLKQ---WVTMGFALLIF 335
                  ++PEGLLS+V +I + ++G+  G  +  T+ G   R  Q   ++ +   L+ F
Sbjct: 173 -------WDPEGLLSTVPAIATGLLGLLAGRWLRSTRYGTTVRESQKALFLFLAGLLMAF 225

Query: 336 GLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAM 395
             TL F    P+NK L+T SYV +  G A    +  Y L+D+         L   G+NA+
Sbjct: 226 VGTL-FDTVFPINKALWTSSYVLLAGGLAMCGLAIFYYLIDVRRAFQLSGLLVAFGVNAI 284

Query: 396 LVYVMAA 402
            V+ ++ 
Sbjct: 285 TVFFLSG 291


>gi|113971267|ref|YP_735060.1| hypothetical protein Shewmr4_2932 [Shewanella sp. MR-4]
 gi|113885951|gb|ABI40003.1| conserved hypothetical protein [Shewanella sp. MR-4]
          Length = 395

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 109/416 (26%), Positives = 171/416 (41%), Gaps = 96/416 (23%)

Query: 20  DVSDQQEKSHLKTQRLASLDIFRGLAV-----------ALMILVDHAGGDW--PEISHAP 66
           +  D   K      RL SLD  RG  +           AL+IL   AG  W   ++ H+ 
Sbjct: 15  NAQDAAAKKSQSKPRLMSLDALRGFDMFWILGGEALFGALLILTGWAGWQWGDTQMHHSE 74

Query: 67  WNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFR-TLKLLFWGILLQGGFSH 125
           WNG    D + P F+F+ GVA+ L+ KR+       +  ++R  +K LF  +LL   ++H
Sbjct: 75  WNGFRFYDLIFPLFIFLSGVALGLSPKRLDKLPMHERMPVYRHGIKRLFLLLLLGILYNH 134

Query: 126 APDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHW 185
                    D   +R   VL RIA ++   +L+   T                R      
Sbjct: 135 GWGT-GVPADPEKVRYASVLGRIAFAWFFAALLVWHTS--------------LRTQV--- 176

Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDR 245
           L+A  +LV Y A+      P  Q                   GV   L+P  +   Y+D 
Sbjct: 177 LVALGILVAYGAMQLWLPFPGGQ------------------AGV---LSPTESINAYVDS 215

Query: 246 KVL-GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
            +L G+++    P                              +PEG+LS++ ++++ + 
Sbjct: 216 LLLPGVSYQGRTP------------------------------DPEGVLSTLPAVVNALA 245

Query: 305 GVHFGHVII--HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSG 362
           GV  GH I+  H KG  A++      G   L  G  L     IP+NK+L+T S+V VTSG
Sbjct: 246 GVFVGHFIVKSHPKGEWAKVGLLSVAGGVCLALGWLLD--GVIPVNKELWTSSFVLVTSG 303

Query: 363 AAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA--------AEGIFAGFI 410
            + L+ +  YALVD+   +        IG NA+++Y+ +        A+ +F G I
Sbjct: 304 WSMLLLALFYALVDVLKWQKLTFIFVVIGTNAIIIYLASSLVDWKYIAQSVFGGVI 359


>gi|445498183|ref|ZP_21465038.1| putative membrane protein DUF1624 [Janthinobacterium sp. HH01]
 gi|444788178|gb|ELX09726.1| putative membrane protein DUF1624 [Janthinobacterium sp. HH01]
          Length = 370

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 99/376 (26%), Positives = 155/376 (41%), Gaps = 81/376 (21%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           QR  ++D+ RGL VALMI+V+  G     +    HA W+G  L D V P F+F+VG A++
Sbjct: 6   QRSQAIDVLRGLTVALMIMVNMPGTPATTYAPFLHAEWHGLTLTDLVFPTFMFVVGTALS 65

Query: 90  LALKRIPDRADA--VKKVIFRTLKLLFWGILL--QGGFSHAPDELTYGVDVRMIRLCGVL 145
             L++     +A  +KK+  RT  +   G L+     FS     LT  + +   R+ GVL
Sbjct: 66  FTLEKYEGMGEAAVLKKIFTRTALIFLCGFLMYWYPFFSTDGGSLTV-LPLSGTRIFGVL 124

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRIAL Y   SL+  + +   +K   V               A   L+ Y  ++YG    
Sbjct: 125 QRIALGYCAGSLILHYWR---EKGALV--------------FAVLALLGYWTVMYG---- 163

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
                       DY                  NA   +D  VLG  HMYH          
Sbjct: 164 ----------FGDY--------------TLAGNAQRKLDLLVLGEAHMYHG--------- 190

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
                 EG             F+PEG+LS++ SI++ + G   G ++         + + 
Sbjct: 191 ------EG-----------IAFDPEGILSTLPSIVNVLAGYFAGRLVRRLGASYETVAKL 233

Query: 326 VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFL 385
              G  L +  L L +++  PLNK+L+T SY  +T        S +  ++D+   +    
Sbjct: 234 AMSGAVLTV--LALCWSSVFPLNKKLWTSSYTLITIAIDLFTLSLLLYVIDMLGKRGWTY 291

Query: 386 PLAWIGMNAMLVYVMA 401
                G N + +Y+ +
Sbjct: 292 FFEVFGRNTLFIYLFS 307


>gi|294627662|ref|ZP_06706244.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 11122]
 gi|292598014|gb|EFF42169.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 11122]
          Length = 388

 Score = 93.2 bits (230), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 94/335 (28%), Positives = 141/335 (42%), Gaps = 75/335 (22%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           K +R  SLD+FRGL + LMILV+ AG     + +++HA W G  LAD V P FLF VG A
Sbjct: 18  KRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSA 77

Query: 88  IALALKRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVL 145
           ++ AL         + +V  R   +L  G+L+     F   PD       V  +RL GVL
Sbjct: 78  MSFALATNTPHLQFLGRVSKRAALILLCGVLMYWFPFFHLQPDGGWSFTTVDQLRLTGVL 137

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRI L YL  +L+  +          +                  +L+ Y ALLY    P
Sbjct: 138 QRIGLCYLAAALLVRYLPPRGIAPVCL-----------------ALLLGYWALLYAFGQP 180

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
                                    A+L+   NA   +D  + G +H+Y           
Sbjct: 181 G------------------------AELSKTGNAGTRLDLWLYGRDHLY----------- 205

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
                     RKD        F+PEGLL ++S+ ++ + G   G  +      +A  +  
Sbjct: 206 ----------RKD------GGFDPEGLLGTLSATVNVLAGYLCGRFLQRQGKTVASTRSL 249

Query: 326 VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVT 360
           +  G  L++  L L +  A PL+K+L++ S+V  T
Sbjct: 250 LLAGAGLVV--LALLWAPAWPLSKKLWSGSFVACT 282


>gi|117921549|ref|YP_870741.1| hypothetical protein Shewana3_3111 [Shewanella sp. ANA-3]
 gi|117613881|gb|ABK49335.1| conserved hypothetical protein [Shewanella sp. ANA-3]
          Length = 395

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 106/429 (24%), Positives = 171/429 (39%), Gaps = 96/429 (22%)

Query: 7   ETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAV-----------ALMILVDHA 55
            TT    +  +  +  +   K      RL SLD  RG  +           AL++L   A
Sbjct: 2   STTAPESITNTGVNAQEAAAKKRQSKPRLMSLDALRGFDMFWILGGEALFGALLMLTGWA 61

Query: 56  GGDW--PEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFR-TLKL 112
           G  W   ++ H+ WNG    D + P F+F+ GVA+ L+ KR+       +  ++R  +K 
Sbjct: 62  GWQWGDTQMHHSEWNGFRFYDLIFPLFIFLSGVALGLSPKRLDKLPMQERMPVYRHGIKR 121

Query: 113 LFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSV 172
           LF  +LL   ++H         D   +R   VL RIA ++   +L+              
Sbjct: 122 LFLLLLLGILYNHGWGT-GAPADPEKVRYASVLGRIAFAWFFAALL-------------- 166

Query: 173 GRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAK 232
                     WH  +   VLV    L+    V  W                    G    
Sbjct: 167 ---------VWHTSLRTQVLVALGILVAYGAVQLW---------------LPFPGGQAGV 202

Query: 233 LNPPCNAVGYIDRKVL-GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEG 291
           L+P  +   Y+D  +L G+++    P                              +PEG
Sbjct: 203 LSPTESINAYVDSLLLPGVSYQGRTP------------------------------DPEG 232

Query: 292 LLSSVSSILSTIIGVHFGHVII--HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNK 349
           +LS++ ++++ + GV  GH I+  H KG  A++      G   L  G  L     IP+NK
Sbjct: 233 VLSTLPAVVNALAGVFVGHFIVKSHPKGEWAKVGLLSVAGGVCLALGWLLG--GVIPVNK 290

Query: 350 QLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA-------- 401
           +L+T S+V VTSG + L+ +  YALVD+   +        IG NA+++Y+ +        
Sbjct: 291 ELWTSSFVLVTSGWSMLLLALFYALVDVLKWQKLAFIFVVIGTNAIIIYLASSLVDWKYI 350

Query: 402 AEGIFAGFI 410
           A+ +F G I
Sbjct: 351 AQSVFGGVI 359


>gi|345880604|ref|ZP_08832150.1| hypothetical protein HMPREF9431_00814 [Prevotella oulorum F0390]
 gi|343922516|gb|EGV33216.1| hypothetical protein HMPREF9431_00814 [Prevotella oulorum F0390]
          Length = 383

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 103/390 (26%), Positives = 166/390 (42%), Gaps = 93/390 (23%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD--WPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
           KT R+ ++DI RG+ +A MILV++ GG   +  + HA W G    D V PFF+FI+G+  
Sbjct: 5   KTSRIEAVDILRGITIAGMILVNNPGGQPVYTPLEHAEWFGLTPTDLVFPFFMFIMGITT 64

Query: 89  ALALKRIPDRAD--AVKKVIFRTLKLLFWGILL-------QGGFSHAPDELTYGVDV--- 136
            L+L++          KK+I R + L   GI +       +G F+     L +   V   
Sbjct: 65  YLSLRKYDFEWSWPCAKKIIKRGMLLYVIGIAISWLMMFCRGLFNEDYAALPFFSHVFAA 124

Query: 137 ----RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
                 IRL GV  R+A  Y+  S+V +  K          RF         WL+AA V 
Sbjct: 125 ANVFDHIRLVGVFPRLAFCYVFASVVALSVKH---------RFI-------PWLIAA-VF 167

Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
           + Y A+L           + N  + D   + NV                 ID  +LG  H
Sbjct: 168 IGYFAVL----------CLGNGFAHDASNICNV-----------------IDEAILGRQH 200

Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
           +Y                     + D P       +PEGLLSS+ ++   +IG   G V+
Sbjct: 201 LY---------------------KWDIP-------DPEGLLSSLPALGHVLIGFCVGRVV 232

Query: 313 IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIY 372
           +       ++++    G  L I G  L +    P++K+L+T ++  VT G A+   + + 
Sbjct: 233 MSATSLNDKIEKLFIYGAVLTILGFLLSY--GCPISKKLWTPTFALVTCGLASTTLALLS 290

Query: 373 ALVDIWNLKYPFLPLAWI-GMNAMLVYVMA 401
            ++D   +K   +    + G+N + +YV A
Sbjct: 291 WVIDKQGVKSHAISFFRVFGVNPLALYVWA 320


>gi|384417772|ref|YP_005627132.1| membrane protein [Xanthomonas oryzae pv. oryzicola BLS256]
 gi|353460685|gb|AEQ94964.1| membrane protein, putative [Xanthomonas oryzae pv. oryzicola
           BLS256]
          Length = 388

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 98/355 (27%), Positives = 149/355 (41%), Gaps = 81/355 (22%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           K +R  SLD+FRGL + LMILV+ AG     + +++HA W G  LAD V P FLF VG A
Sbjct: 18  KRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSA 77

Query: 88  IALALKRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVL 145
           ++ AL         + +V  R   ++  G+L+     F   PD       V  +RL GVL
Sbjct: 78  MSFALATNTPHLQFLGRVSKRAALIVLCGVLMYWFPFFHLQPDGGWAFTTVDQLRLTGVL 137

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV--LVVYLALLYGTY 203
           QRI L YL  +L+  +                        +  ACV  L+ Y ALLY   
Sbjct: 138 QRIGLCYLAAALLVRYLPPRS-------------------IAPACVALLLGYWALLY--- 175

Query: 204 VPDWQFTIINKDSADYGKVFNV-TCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
                  +  +  A+  K  N  TC               +D  + G  H+Y        
Sbjct: 176 -------VFGQPGAELSKTGNAGTC---------------LDLWLYGREHLY-------- 205

Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
                        RKD        F+PEGLL ++S+ ++ + G   G  +       A  
Sbjct: 206 -------------RKD------GGFDPEGLLGTLSATVNVLAGYLCGRFLQRHGKTTAST 246

Query: 323 KQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDI 377
           +  +  G  +++  L L +  A PL+K+L++ S+V  T G   L    +  L+++
Sbjct: 247 RSLLLAGVGMVL--LALLWAPAWPLSKKLWSGSFVACTVGLDLLALGVLVYLLEL 299


>gi|223936398|ref|ZP_03628310.1| conserved hypothetical protein [bacterium Ellin514]
 gi|223894916|gb|EEF61365.1| conserved hypothetical protein [bacterium Ellin514]
          Length = 427

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 114/439 (25%), Positives = 190/439 (43%), Gaps = 109/439 (24%)

Query: 1   MSEIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRG-----------LAVALM 49
           M+E+ A      P+I + P    +   +    QRL S+D  RG           L  AL 
Sbjct: 1   MTELTA------PVIEASPA---KAASTASIPQRLMSVDALRGFDMFWIIGADSLVYALH 51

Query: 50  ILVDHAGGDWP--EISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPD---RADAVKK 104
            L  +   D+   ++ H  W G +  D + P F+FI+GV++  +L +      RA+AVK+
Sbjct: 52  RLSQNRVTDFLGLQLDHCDWAGFHFYDLIFPLFVFIMGVSVVFSLTKAIQQLGRAEAVKR 111

Query: 105 VIFRTLKLLFWGILLQGGFSHA-PDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTK 163
           V  R+  L    ++  GG   A PD          IRL GVL RIAL Y +  L+  F K
Sbjct: 112 VFRRSALLFVVALIYSGGVRSAWPD----------IRLLGVLNRIALCYFVGGLIFCFFK 161

Query: 164 DVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP----------------DW 207
                     R  +         +AA +L+ Y +++  T+VP                D 
Sbjct: 162 P---------RAMV--------AIAAALLIGYWSIM--TFVPIRDIRMAHYKEKHELVDN 202

Query: 208 QFTIINKDS--ADYGKVF-NVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
               I +D+  +D  K+F N T  V AK +   N   ++D K LG               
Sbjct: 203 DVDKIMQDTGVSDPAKIFYNTTNWVTAKYDMGYNVANHLDFKYLG--------------- 247

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLK- 323
                      RK    W     +PEGLLS++ ++ +T +      +++ +  +  R K 
Sbjct: 248 ----------GRKYDTYW-----DPEGLLSTIPAV-ATCLLGILAGLLLRSTNYCDRWKV 291

Query: 324 -QWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKY 382
              +++G A +I G    ++   P+ K+++T S+V V  G +A++    Y +VD+W  + 
Sbjct: 292 IYLLSLGAAGVILGFL--WSIQFPVVKKIWTSSFVLVAGGFSAILLGIFYQVVDVWKYQK 349

Query: 383 PFLPLAWIGMNAMLVYVMA 401
              P  W+GMN++ +Y+ +
Sbjct: 350 WCQPFVWMGMNSITIYLTS 368


>gi|393725858|ref|ZP_10345785.1| hypothetical protein SPAM2_19574 [Sphingomonas sp. PAMC 26605]
          Length = 400

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 97/348 (27%), Positives = 158/348 (45%), Gaps = 73/348 (20%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDW----PEISHAPWNGCNLADFVMPFFLFIVGV 86
           +  RL +LD+ RGLAVA MILV   G DW     ++ HA W+G  LAD V P FLF VG+
Sbjct: 4   RLPRLEALDVLRGLAVAGMILVVSPG-DWSMAYAQLQHAAWHGATLADMVFPTFLFSVGM 62

Query: 87  AIALALKRIPDRADAVKKVIF------RTLKLLFWGILLQGGF-----SHAPDELTYGVD 135
           A+ L+  R+   AD  ++ +F      R++ L+  G++++  +     + AP     G+ 
Sbjct: 63  ALGLSFPRL--MADTAQRRLFWMRLIRRSITLVVLGLVVEATYVWTISAGAPYPGHGGLS 120

Query: 136 VRMIRLCGVLQRIALSYLL-VSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
              +R+ G+LQRI L YLL  +L+ + ++ + D   ++    +  L+C      A +L+ 
Sbjct: 121 --YVRIPGILQRIGLCYLLGGALIVVTSRTIADGRIAIAPQRV--LFC-----IAAILIG 171

Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
           Y ALL    VP +   ++  D                      +   ++DR +  + H+ 
Sbjct: 172 YWALLRFVPVPGFGVGLLTPDG---------------------SLPAFVDRTLFTVPHL- 209

Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
               W    A  Q     GP          A ++PEGLLS++ +  + + G         
Sbjct: 210 ----WPLGSATGQ-----GP----------ATYDPEGLLSTLPATANLLFGA-LAAWAWR 249

Query: 315 TKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSG 362
                A L   +  G  L+I GL L       +NK+L+T S+   +SG
Sbjct: 250 QNSDRATLHVAIA-GTMLIIAGLALD--PVFEINKRLWTSSFALFSSG 294


>gi|329851960|ref|ZP_08266641.1| hypothetical protein ABI_47300 [Asticcacaulis biprosthecum C19]
 gi|328839809|gb|EGF89382.1| hypothetical protein ABI_47300 [Asticcacaulis biprosthecum C19]
          Length = 369

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 103/378 (27%), Positives = 161/378 (42%), Gaps = 78/378 (20%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAG-GDWP--EISHAPWNGCNLADFVMPFFLFIVGV 86
           +  QR  SLD+FRGL VA MI+V+ +G G  P  ++SHA W G  LAD V P FLF VG 
Sbjct: 1   MAGQRFTSLDVFRGLTVAFMIVVNTSGPGAAPFAQLSHATWFGLTLADLVFPAFLFAVGN 60

Query: 87  AIALALKRIPDRADAVKKVIFRTLKLLFWGILLQ-GGFSHAPDELTYGVDVRMIRLCGVL 145
           A++    +       + KV+ R   L   G L+    F HA  +      V   R+ GVL
Sbjct: 61  AMSFGDPKSGPTGRYLGKVVKRAAILFLLGYLMYWFPFVHATADGWALNPVEHTRIPGVL 120

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRIAL +L  ++   +     D  + +G  ++  L  W  LM               + P
Sbjct: 121 QRIALCFLAAAIAVRWL----DVPKLIGLSAVLLLGYWGALM--------------VFGP 162

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
             +                       +L P  N    IDR V GINHMY      + K  
Sbjct: 163 PGE-----------------------QLTPLGNIGALIDRAVFGINHMYA-----KGKG- 193

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
                                ++PEGL S++ +I++ + G   G   I ++  L  +   
Sbjct: 194 ---------------------YDPEGLFSTLPAIVNVLAGYLAGRY-IRSQPDLRTVVIR 231

Query: 326 VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKY--P 383
           + +  A LI    L ++   PL+K+L+T S+  +  G    + + + A V++   K+  P
Sbjct: 232 LAVA-AGLIVAAALAWSLTFPLSKRLWTSSFALINIGIDLGLLAGLIAYVELARQKFGVP 290

Query: 384 FLPLAWIGMNAMLVYVMA 401
           F  +   G N + +Y+ +
Sbjct: 291 FCEV--FGRNPLAIYLFS 306


>gi|336312505|ref|ZP_08567454.1| N-acetylglucosamine transporter, NagX [Shewanella sp. HN-41]
 gi|335864011|gb|EGM69129.1| N-acetylglucosamine transporter, NagX [Shewanella sp. HN-41]
          Length = 384

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 105/420 (25%), Positives = 169/420 (40%), Gaps = 110/420 (26%)

Query: 23  DQQEKSHLKTQRLASLDIFRGLAV-----------ALMILVDHAGGDW--PEISHAPWNG 69
           ++     +   RL SLD  RG  +            L+IL   AG  W   ++ H+ W+G
Sbjct: 7   NETATIKVTKPRLMSLDALRGFDMFWILGGEALFGGLLILTGWAGWQWGDEQMHHSQWHG 66

Query: 70  CNLADFVMPFFLFIVGVAIALALKRI-----PDRADAVKKVIFRTLKLLFWGILLQGGF- 123
               D + P F+F+ GVA+ L+ KR+      +R    +  I R L LL  GIL   G+ 
Sbjct: 67  FRFYDLIFPLFIFLSGVALGLSPKRLDKLPLSERLPVYRHGIKRLLLLLLLGILYNHGWG 126

Query: 124 SHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCW 183
           + AP       D   +R   VL RIA ++   +L+                        W
Sbjct: 127 TGAP------ADPEKVRYASVLGRIAFAWFFAALL-----------------------VW 157

Query: 184 HWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKV---FNVTCGVRAKLNPPCNAV 240
           H  +   ++V  L +L G                 YG +      + G    L+P  +  
Sbjct: 158 HTSLRTQIIVA-LGILLG-----------------YGAIQLWLPFSGGQAGVLSPTESIN 199

Query: 241 GYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSIL 300
            YID  +L                                S+     +PEGLLS++ +++
Sbjct: 200 AYIDSILL-----------------------------PGVSYQGRTLDPEGLLSTIPAVV 230

Query: 301 STIIGVHFGHVII--HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVC 358
           + + GV  GH I+  H +G  A++      G   L  G  L     IP+NK+L+T S+V 
Sbjct: 231 NALAGVFVGHFIVKSHPQGEWAKVGLLAAAGGVCLALGWLLD--GVIPVNKELWTSSFVL 288

Query: 359 VTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA--------AEGIFAGFI 410
           VTSG + ++ +  YALVD+   +        IG NA+++Y+ +        A+ +F G +
Sbjct: 289 VTSGWSMVLLAVFYALVDVLKWQKAAFFFVVIGTNAIIIYLASSLVDWKYIAQSVFGGLV 348


>gi|423304873|ref|ZP_17282872.1| hypothetical protein HMPREF1072_01812 [Bacteroides uniformis
           CL03T00C23]
 gi|423310012|ref|ZP_17287996.1| hypothetical protein HMPREF1073_02746 [Bacteroides uniformis
           CL03T12C37]
 gi|392682836|gb|EIY76175.1| hypothetical protein HMPREF1072_01812 [Bacteroides uniformis
           CL03T00C23]
 gi|392683302|gb|EIY76639.1| hypothetical protein HMPREF1073_02746 [Bacteroides uniformis
           CL03T12C37]
          Length = 394

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 94/396 (23%), Positives = 166/396 (41%), Gaps = 99/396 (25%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVA 87
           ++R+ +LDI RG+ +A MI+V++ G  W  I     HA WNG    D V PFF+FI+G++
Sbjct: 7   SKRILALDILRGVTIAGMIMVNNPG-SWGHIYAPLRHAEWNGLTPTDLVFPFFMFIMGIS 65

Query: 88  IALALKRIPDRAD--AVKKVIFRTLKLLFWGILLQGGFSH-------APDELTYGVDV-- 136
             ++LK+        A  K++ RT+ +   G+ + G FS        APD+L++G  +  
Sbjct: 66  TYISLKKYNFEFSHAAGMKILKRTIVIFLIGMAI-GWFSRFCYYWASAPDDLSFGEKLWA 124

Query: 137 -----RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV 191
                  IR+ GV+QR+AL Y   S++ +  K                     +L+A  +
Sbjct: 125 SVWTFDRIRILGVMQRLALCYGAASIIALTMKHKHIP----------------YLIAGLL 168

Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
              ++ L+ G                  G  +N T           N +  +D  +L   
Sbjct: 169 TGYFILLMCGN-----------------GFAYNET-----------NILSVVDHAILTPA 200

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           HMY                                 +PEGLLS++ +I   ++G   G +
Sbjct: 201 HMYKDNG----------------------------IDPEGLLSTIPAIAHVLLGFCVGRL 232

Query: 312 IIHTKGHLARLKQWVTMGFALLIFGLTLHFTN-----AIPLNKQLYTLSYVCVTSGAAAL 366
           ++       R     +    L + G+ L F+        P+NK++++ +YV VT G A+ 
Sbjct: 233 MLDGNKSEDRASFLNSQLITLFLVGVILTFSGFLLSYGCPINKKIWSPTYVLVTCGLASS 292

Query: 367 VFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
             + +  ++D+   K   +     G+N + +YV+  
Sbjct: 293 FLALLIWIIDVKGYKKWSMFFEAFGVNPLFMYVLGG 328


>gi|295690502|ref|YP_003594195.1| hypothetical protein Cseg_3137 [Caulobacter segnis ATCC 21756]
 gi|295432405|gb|ADG11577.1| Protein of unknown function DUF2261, transmembrane [Caulobacter
           segnis ATCC 21756]
          Length = 372

 Score = 92.4 bits (228), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 93/349 (26%), Positives = 146/349 (41%), Gaps = 74/349 (21%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
             R  SLD+FRGL V LMI+V+ AG     + ++ HAPW G   AD V P FLF VG ++
Sbjct: 5   AARFLSLDVFRGLTVCLMIVVNTAGPGAKAYTQLVHAPWFGFTAADAVFPSFLFAVGCSM 64

Query: 89  ALALKRIPDRADAVKKVIFRTLKLLFWGILLQ-GGFSHAPDELTYGVDVRMIRLCGVLQR 147
           A A  R     + + KV+ R   +   G L+    F    D     +     R+ GVLQR
Sbjct: 65  AFAFSRPIPTNEFLAKVLRRAALIFLLGFLMYWFPFVKKIDGHWALIPFADTRVMGVLQR 124

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
           IAL Y+L +                        +   WL    ++ +   LL G     W
Sbjct: 125 IALCYMLAA------------------------FAVRWLSPRLIVALSAVLLLGY----W 156

Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
              +   D A             A L+   NA  ++D  ++G NH+Y             
Sbjct: 157 AILMTLGDPA-------------APLSKLGNAGTHLDLFLIGQNHLY------------- 190

Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVT 327
                   RKD        F+PEGLL ++ S ++ + G      +    G  + + +   
Sbjct: 191 --------RKD------GGFDPEGLLGTLPSTVNVLAGYLAARFLKENPGSQSAMARMAI 236

Query: 328 MGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
            G  L++ GL   ++   P+ K+L+T S+V +T G   ++ +A+  L++
Sbjct: 237 AGVVLILAGLA--WSPLFPIAKKLWTGSFVLLTVGIDLVLLAALTKLLE 283


>gi|404486905|ref|ZP_11022093.1| hypothetical protein HMPREF9448_02547 [Barnesiella intestinihominis
           YIT 11860]
 gi|404335959|gb|EJZ62425.1| hypothetical protein HMPREF9448_02547 [Barnesiella intestinihominis
           YIT 11860]
          Length = 373

 Score = 92.0 bits (227), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 102/421 (24%), Positives = 168/421 (39%), Gaps = 105/421 (24%)

Query: 31  KTQRLASLDIFRG-----------LAVALMILVDHAGGD--WPEISHAPWNGCNLADFVM 77
           K  RL SLD  RG           L V L  L   A GD     + H PW+G    D + 
Sbjct: 5   KNTRLLSLDTLRGFDMLFIMGFAPLVVTLNALHPTAVGDVIAGHMRHVPWDGFTQHDMIF 64

Query: 78  PFFLFIVGVAIALALKRIPDRADAVKKVIFRTLK----LLFWGILLQGGFS-HAPDELTY 132
           P FLFI G++   +L +      + K +  R  +    L+  G L  G    + PD    
Sbjct: 65  PLFLFIAGISFPFSLAKQRGSGSSDKHIYLRVFRRGVTLVLLGFLYNGFLQLNFPD---- 120

Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
                 +RL  VL RI L+++  + + +  K      +SV    I  +   +WL+ A   
Sbjct: 121 ------VRLASVLGRIGLAWMFGAFIYMSLK------KSVQYGLIVFILVGYWLLLA--- 165

Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
                     +VP         D+A             + L+   N VGYIDR  L    
Sbjct: 166 ----------FVPA-------PDAAG-----------ASPLSIEGNLVGYIDRHCLPGKL 197

Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
           +Y +                              F+PEGLLS++ +I++ ++G++ G ++
Sbjct: 198 IYGN------------------------------FDPEGLLSTLPAIVTALLGIYAGEIV 227

Query: 313 IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIY 372
             T+      K  +  G  +++  + L +    P+NK L++ S+ C   G + L+F+  Y
Sbjct: 228 RSTRLGSGERKSLLLSGIGVVLVAIGLVWNTVFPINKMLWSSSFTCFVGGLSFLLFALFY 287

Query: 373 ALVDIWNLKYPFLPLAWIGMNAMLVYVMAA--------EGIFAGFIN--GWYYGDPHNTL 422
            +VD+   K   L    IG+N++ +Y+           + +F G     G Y G+P   +
Sbjct: 288 YIVDVKGWKSWTLFFRVIGLNSITIYLAQQVVGFSHMNKFLFGGMSQWVGEYAGEPAGAV 347

Query: 423 V 423
           V
Sbjct: 348 V 348


>gi|294667090|ref|ZP_06732315.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
 gi|292603100|gb|EFF46526.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
          Length = 388

 Score = 92.0 bits (227), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 94/335 (28%), Positives = 141/335 (42%), Gaps = 75/335 (22%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           K +R  SLD+FRGL + LMILV+ AG     + +++HA W G  LAD V P FLF VG A
Sbjct: 18  KRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSA 77

Query: 88  IALALKRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVL 145
           ++ AL         + +V  R   +L  G+L+     F   PD       V  +RL GVL
Sbjct: 78  MSFALATNTPHLQFLGRVSKRAALILLCGVLMYWFPFFHLQPDGGWSFTTVDQLRLTGVL 137

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRI L YL  +L+  +          +                  +L+ Y ALLY    P
Sbjct: 138 QRIGLCYLAAALLVRYLPPRGIAPVCL-----------------ALLLGYWALLYAFGQP 180

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
                                    A+L+   NA   +D  + G +H+Y           
Sbjct: 181 G------------------------AELSKTGNAGTRLDLWLYGRDHLY----------- 205

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
                     RKD        F+PEGLL ++S+ ++ + G   G  +      +A  +  
Sbjct: 206 ----------RKD------GGFDPEGLLGTLSATVNVLAGYLCGRFLQRHGKTVASTRSL 249

Query: 326 VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVT 360
           +  G  L++  L L +  A PL+K+L++ S+V  T
Sbjct: 250 LLAGAGLVV--LALLWAPAWPLSKKLWSGSFVACT 282


>gi|375253854|ref|YP_005013021.1| hypothetical protein BFO_0041 [Tannerella forsythia ATCC 43037]
 gi|363406758|gb|AEW20444.1| putative membrane protein [Tannerella forsythia ATCC 43037]
          Length = 390

 Score = 92.0 bits (227), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 102/408 (25%), Positives = 173/408 (42%), Gaps = 94/408 (23%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
            ++RL +LDI RG+ +A MI+V++ G     +  + HA W+G    D V PFF+FI+G++
Sbjct: 8   SSRRLLALDILRGITIAGMIMVNNPGSWSFVYAPLGHAAWHGLTPTDLVFPFFMFIMGIS 67

Query: 88  IALALKR--IPDRADAVKKVIFRT--------------LKLLFWGILLQGGFSHAPDELT 131
             ++LK+        A++K+I RT              L    W  L  GG S       
Sbjct: 68  TYISLKKYDFTFSYSAMRKIIRRTAVIFAIGLGLAWLGLTCRTWHGLADGGLSFGARLWQ 127

Query: 132 YGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV 191
              +   +R+ GV+QR+ALSY   +L+ +  +  +                        +
Sbjct: 128 SVSNFGHLRILGVMQRLALSYGATALIALAIRHHR------------------------I 163

Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
             + +ALL G       +T++    A  G  +N T           N +  +DR VLG+N
Sbjct: 164 PYLIVALLGG-------YTVLLL--AGNGLAYNET-----------NILSIVDRAVLGVN 203

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           H Y                      KD         EPEGLLS++ +I   +IG   G  
Sbjct: 204 HTY----------------------KD------MGIEPEGLLSTLPAIAHVLIGFCCGRA 235

Query: 312 IIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAI 371
           ++       ++ +   +G A+   G  L +    P+NK++++ ++V +T G A+ + + +
Sbjct: 236 MLGVTEVRDKMLRLFLIGTAMAFAGWLLSY--GCPINKKIWSPTFVLITCGMASGLLALL 293

Query: 372 YALVDIWNLKYPFLPLAWIGMNAMLVYVMAAE-GIFAGFINGWYYGDP 418
             ++D+             G+N + +YV  A   I  G I   Y GDP
Sbjct: 294 IWIIDVKRHTKWCRFFEAFGVNPLFMYVAGAVFSILLGSIYLTYNGDP 341


>gi|373948546|ref|ZP_09608507.1| Protein of unknown function DUF2261, transmembrane [Shewanella
           baltica OS183]
 gi|386325609|ref|YP_006021726.1| hypothetical protein [Shewanella baltica BA175]
 gi|333819754|gb|AEG12420.1| Protein of unknown function DUF2261, transmembrane [Shewanella
           baltica BA175]
 gi|373885146|gb|EHQ14038.1| Protein of unknown function DUF2261, transmembrane [Shewanella
           baltica OS183]
          Length = 384

 Score = 92.0 bits (227), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 103/405 (25%), Positives = 168/405 (41%), Gaps = 102/405 (25%)

Query: 34  RLASLDIFRGLAV-----------ALMILVDHAGGDW--PEISHAPWNGCNLADFVMPFF 80
           RL SLD  RG  +            L+IL   AG  W   ++ H+ W+G    D + P F
Sbjct: 18  RLMSLDALRGFDMFWILGGEALFGGLLILTGWAGWQWGDEQMHHSQWHGFRFYDLIFPLF 77

Query: 81  LFIVGVAIALALKRIPDRADAVKKVIFR-TLKLLFWGILLQGGFSHAPDELTYGVDVRMI 139
           +F+ GVA+ L+ KR+     + +  ++R  +K LF  +LL   ++H         D   I
Sbjct: 78  IFLSGVALGLSPKRLDKLPMSERLPVYRHGIKRLFLLLLLGILYNHGWGT-GAPADPEKI 136

Query: 140 RLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALL 199
           R   VL RIA ++   +L+                        WH  +   ++V  L +L
Sbjct: 137 RYASVLGRIAFAWFFAALL-----------------------VWHTSLRTQIIVA-LGIL 172

Query: 200 YGTYVPDWQFTIINKDSADYGKV---FNVTCGVRAKLNPPCNAVGYIDRKVL-GINHMYH 255
            G                 YG +        G    L+P  +   Y+D  +L G+++   
Sbjct: 173 LG-----------------YGAIQLWLPFPGGQAGVLSPTESINAYVDSILLPGVSYQGR 215

Query: 256 HPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII-- 313
            P                              +PEGLLS++ +I++ + GV  GH I+  
Sbjct: 216 TP------------------------------DPEGLLSTIPAIVNALAGVFVGHFIVKS 245

Query: 314 HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
           H KG  A++      G   L+ G  L     IP+NK+L+T S+V VTSG + ++ +  YA
Sbjct: 246 HPKGEWAKVGVLAAAGGIFLVLGWLLDLV--IPVNKELWTSSFVLVTSGWSMILLAVFYA 303

Query: 374 LVDIWNLKYPFLPLAWIGMNAMLVYVMA--------AEGIFAGFI 410
           LVD+   +        IG NA+++Y+ +        A+ +F G +
Sbjct: 304 LVDVLKWQKAAFFFVVIGTNAIIIYLASSLVDWKYIAQSVFGGLV 348


>gi|374374997|ref|ZP_09632655.1| hypothetical protein NiasoDRAFT_0408 [Niabella soli DSM 19437]
 gi|373231837|gb|EHP51632.1| hypothetical protein NiasoDRAFT_0408 [Niabella soli DSM 19437]
          Length = 395

 Score = 91.7 bits (226), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 102/377 (27%), Positives = 145/377 (38%), Gaps = 108/377 (28%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAIA 89
           R  SLD+FRG  V LMILV++ G  W  I     HAPW+G    D V PFFLF VG A++
Sbjct: 4   RYRSLDVFRGATVCLMILVNNPG-SWAHIYAPLDHAPWHGLTPTDLVFPFFLFAVGNAMS 62

Query: 90  LALKRIPDRADAV--KKVIFRTLKLLFWGILL--------QGGFSHAPDELTYGVDVRMI 139
             + R+ +   A   KK+  RTL +   GI L         G    A   +T       I
Sbjct: 63  FVIPRLQEAGPAEFWKKITKRTLIIFGIGIFLNWSPFVRWNGDTLQAVTWVTDPAKNIGI 122

Query: 140 RLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALL 199
           R+ GVLQRIA  Y   S++  + K                         A  L + L L 
Sbjct: 123 RIFGVLQRIAFCYFFASIIVYYLKP----------------------KTAYFLSLVLLLA 160

Query: 200 YGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY----IDRKVLGINHMYH 255
           Y      W   I+  + AD                 P +  G+    ID+ +L I HMY 
Sbjct: 161 Y------WGLCILG-NPAD-----------------PYSLKGWFGTNIDKAILHIPHMYK 196

Query: 256 HPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHT 315
                           EG            PF+PEG  SS+ +I+  + G   G  I ++
Sbjct: 197 G---------------EG-----------VPFDPEGFASSLGAIVQIVFGYFVGMYIKNS 230

Query: 316 KGHLAR---------------LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVT 360
              + +               L      G  LL+ G    +    P+NK+++T SY   T
Sbjct: 231 SAQIPKDLTDKQDPRNPMFKMLTVLFVAGVGLLVTGFC--WDMVFPINKKIWTSSYTVYT 288

Query: 361 SGAAALVFSAIYALVDI 377
           +G A +    +   ++I
Sbjct: 289 TGLAIITLCVMIFFIEI 305


>gi|29349027|ref|NP_812530.1| hypothetical protein BT_3619 [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|298386734|ref|ZP_06996289.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
 gi|383124379|ref|ZP_09945043.1| hypothetical protein BSIG_3594 [Bacteroides sp. 1_1_6]
 gi|29340934|gb|AAO78724.1| putative transmembrane protein [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|251839125|gb|EES67209.1| hypothetical protein BSIG_3594 [Bacteroides sp. 1_1_6]
 gi|298260408|gb|EFI03277.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
          Length = 372

 Score = 91.7 bits (226), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 94/380 (24%), Positives = 163/380 (42%), Gaps = 89/380 (23%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
             +RL +LD+ RG+ +A MILV+  G     +  + HA W G    D V PFF+FI+G++
Sbjct: 6   SNKRLLALDVMRGITIAGMILVNTPGSWQHAYAPLKHAEWIGLTPTDLVFPFFMFIMGIS 65

Query: 88  IALALKR--IPDRADAVKKVIFRTLKLLFWGI----LLQGGFSHAPDELTYGVDVRMIRL 141
             ++L++        A  K++ RT+ +   GI    L    F H P        +  IR+
Sbjct: 66  TYISLRKYNFTFSVPAGLKILKRTVIIFLIGIGISWLSILCFQHDP------FPIDQIRI 119

Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
            GV+QR+AL Y + ++V +  K                 + +   + A +L+ Y A+L  
Sbjct: 120 LGVMQRLALGYGVTAIVALLMK-----------------HKYIPYLIAVLLISYFAIL-- 160

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
                         +   G V++ T           N +  +DR VLG  H+Y       
Sbjct: 161 --------------ALGNGYVYDET-----------NILSIVDRAVLGQAHIYGGQI--- 192

Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
                                     +PEGLLS++S+I   +IG   G +++  K    +
Sbjct: 193 -------------------------LDPEGLLSTISAIAHVLIGFCAGKLLMEVKDIHEK 227

Query: 322 LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLK 381
           L++   +G  L   G  L + +  P+ K++++ S+V VT G  +   + +  ++DI   K
Sbjct: 228 LERLFLIGTILTFAGFLLSYGS--PICKKVWSPSFVLVTCGLGSSFLALLVWIIDIKGYK 285

Query: 382 YPFLPLAWIGMNAMLVYVMA 401
                    G+N + +YV+A
Sbjct: 286 NWSRFFESFGVNPLFIYVLA 305


>gi|302759308|ref|XP_002963077.1| hypothetical protein SELMODRAFT_404643 [Selaginella moellendorffii]
 gi|300169938|gb|EFJ36540.1| hypothetical protein SELMODRAFT_404643 [Selaginella moellendorffii]
          Length = 293

 Score = 91.7 bits (226), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 43/106 (40%), Positives = 66/106 (62%), Gaps = 1/106 (0%)

Query: 320 ARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWN 379
           AR+  ++     LL  G  LH    I +NK LY+ SY+C T+GAA  VF  +Y LVD+++
Sbjct: 8   ARIINFIVPAVILLALGYVLHLL-GIKMNKPLYSFSYMCFTAGAAGAVFCLLYILVDVYD 66

Query: 380 LKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLVCF 425
           ++YP L L W+GMN++++Y +AA  +   F+ G+Y+  P   LV F
Sbjct: 67  IRYPTLLLEWMGMNSLIIYTLAATDVLVVFVQGFYWKQPQKNLVTF 112


>gi|393782159|ref|ZP_10370348.1| hypothetical protein HMPREF1071_01216 [Bacteroides salyersiae
           CL02T12C01]
 gi|392674193|gb|EIY67642.1| hypothetical protein HMPREF1071_01216 [Bacteroides salyersiae
           CL02T12C01]
          Length = 387

 Score = 91.3 bits (225), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 91/390 (23%), Positives = 165/390 (42%), Gaps = 93/390 (23%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
           ++RL +LDI RG+ +A MI+V++ G     +  + HA WNG    D V PFF+FI+G++ 
Sbjct: 6   SKRLLALDILRGITIAGMIMVNNPGSWSYVYAPLGHAQWNGLTPTDLVFPFFMFIMGIST 65

Query: 89  ALALKR--IPDRADAVKKVIFRT-------LKLLFWGILLQGGFSHAPDELTYGVDVRM- 138
            ++L++        A  K++ RT       L L ++ +  +   S + +E+++   +   
Sbjct: 66  YISLRKYNFEFSHSAALKILKRTIVIFAIGLGLAWFSMFCRTWNSLSAEEISFFSRLGQS 125

Query: 139 ------IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
                 IR+ GV+QR+AL Y   ++V +  K                     +L+A  ++
Sbjct: 126 IWTFDHIRILGVMQRLALCYGATAIVALTMKHKHIP----------------YLIATLLI 169

Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
             ++ L+ G                  G  +N T           N +  +DR VLG  H
Sbjct: 170 GYFILLVTGN-----------------GFEYNST-----------NILSVVDRAVLGEAH 201

Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
           MY                                 +PEGLLS++ +I   +IG   G ++
Sbjct: 202 MYKDNG----------------------------IDPEGLLSTIPAIAHVLIGFCVGKLL 233

Query: 313 IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIY 372
           +  K    +L +   +G  L   G  L +    P+NK++++ ++  VT G  +   + + 
Sbjct: 234 MEVKDINEKLGRLFLIGTILTFLGFLLSY--GCPINKKIWSPTFAIVTCGLGSSFLALLI 291

Query: 373 ALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
            ++D+   K         G+N + +YVMA 
Sbjct: 292 WIIDVKGYKSWSRFFESFGVNPLFIYVMAG 321


>gi|47213040|emb|CAF93449.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 297

 Score = 91.3 bits (225), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 87/280 (31%), Positives = 136/280 (48%), Gaps = 55/280 (19%)

Query: 139 IRLCGVLQRIALSYLLVSLVEIFTKD--------VQDKDQSVGRFSIFRLYCWHWLMAAC 190
           +R+ GVLQR+AL+YL+V+ +++            +QD   S G   +      +W    C
Sbjct: 6   LRIPGVLQRLALAYLVVACLDLLVARRFSCVFCVLQDAWWSQGIDILL-----YWPAWVC 60

Query: 191 VLVVYLALLYGTY---VPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRK 246
           VL++    L+ T+   VPD     +      D G   N T G          A G+IDR 
Sbjct: 61  VLLLESVWLFITFLLPVPDCPTGYLGPGGIGDMGLYPNCTGG----------AAGFIDRW 110

Query: 247 VLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGV 306
           +LG  H+Y +P+ +   A                   H P++PEG+L S++SIL   +G+
Sbjct: 111 LLGEKHIYQNPSSQGIYAT------------------HLPYDPEGILGSINSILIAFLGL 152

Query: 307 HFGHVIIHTKG-HLARLKQWVTMGFALLIFG--LTLHFTNA--IPLNKQLYTLSYVCVTS 361
             G +I+H +  H   + +++  GF L I    LT   TN   IP+NK L++LSYV   +
Sbjct: 153 QAGKIILHHRDLHQGVISRFLIWGFLLGIISAVLTNCSTNQGLIPINKNLWSLSYVTTLA 212

Query: 362 GAAALVFSAIYALVDI--WNLKYPFLPLAWIGMNAMLVYV 399
             A ++ + IY  VD+  W    PFL   + G+N++LVYV
Sbjct: 213 CFAYVLLALIYYTVDVKKWWSGRPFL---YPGLNSILVYV 249


>gi|160874301|ref|YP_001553617.1| hypothetical protein Sbal195_1181 [Shewanella baltica OS195]
 gi|378707545|ref|YP_005272439.1| hypothetical protein [Shewanella baltica OS678]
 gi|160859823|gb|ABX48357.1| conserved hypothetical protein [Shewanella baltica OS195]
 gi|315266534|gb|ADT93387.1| hypothetical protein Sbal678_1209 [Shewanella baltica OS678]
          Length = 384

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 104/405 (25%), Positives = 169/405 (41%), Gaps = 102/405 (25%)

Query: 34  RLASLDIFRGLAV-----------ALMILVDHAGGDW--PEISHAPWNGCNLADFVMPFF 80
           RL SLD  RG  +            L+IL   AG  W   ++ H+ W+G +  D + P F
Sbjct: 18  RLMSLDALRGFDMFWILGGEALFGGLLILTGWAGWQWGDEQMHHSQWHGFHFYDLIFPLF 77

Query: 81  LFIVGVAIALALKRIPDRADAVKKVIFR-TLKLLFWGILLQGGFSHAPDELTYGVDVRMI 139
           +F+ GVA+ L+ KR+       +  ++R  +K LF  +LL   ++H         D   I
Sbjct: 78  IFLSGVALGLSPKRLDKLPMKERLPVYRHGIKRLFLLLLLGILYNHGWGT-GAPADPEKI 136

Query: 140 RLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALL 199
           R   VL RIA ++   +L+                        WH  +   ++V  L +L
Sbjct: 137 RYASVLGRIAFAWFFAALL-----------------------VWHTSLRTQIIVA-LGIL 172

Query: 200 YGTYVPDWQFTIINKDSADYGKV---FNVTCGVRAKLNPPCNAVGYIDRKVL-GINHMYH 255
            G                 YG +        G  + L+P  +   Y+D  +L G+++   
Sbjct: 173 LG-----------------YGAMQLWLPFPGGQASVLSPTESINAYVDSILLPGVSYQGR 215

Query: 256 HPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII-- 313
            P                              +PEGLLS++ +I++ + GV  GH I+  
Sbjct: 216 TP------------------------------DPEGLLSTIPAIVNALAGVFVGHFIVKS 245

Query: 314 HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
           H KG  A++      G   L FG  L     IP+NK+L+T S+V VTSG + ++ +  YA
Sbjct: 246 HPKGEWAKVGLLAAAGCVCLAFGWLLDLV--IPVNKELWTSSFVLVTSGWSMILLAVFYA 303

Query: 374 LVDIWNLKYPFLPLAWIGMNAMLVYVMA--------AEGIFAGFI 410
           LVD+   +        IG NA+++Y+ +        A+ +F G +
Sbjct: 304 LVDVLKWQKAAFFFVVIGTNAIIIYLASSLVDWKYIAQSVFGGLV 348


>gi|409199197|ref|ZP_11227860.1| hypothetical protein MsalJ2_19286 [Marinilabilia salmonicolor JCM
           21150]
          Length = 369

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 104/381 (27%), Positives = 166/381 (43%), Gaps = 91/381 (23%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGV 86
           ++QR  +LD+ RG+ +ALMI V+  G  W  I     HA W+GC   D V PFFLF+ GV
Sbjct: 3   QSQRYLALDVLRGMTIALMITVNTPG-SWQYIYAPLRHASWHGCTPTDLVFPFFLFVAGV 61

Query: 87  AIALALKRIPD--RADAVKKVIFRTLKLLFWGILLQG--GFSHAPDELTYGVDVRMIRLC 142
           ++  +  +      ++++K++  RTL +   G+ L     +SH         D   +R+ 
Sbjct: 62  SMFFSFGKYGGALNSESLKRLGRRTLLIFVIGLFLNSFPQWSH---------DFSTLRIM 112

Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
           GVLQRIAL+Y + SL+ +               S  R Y     +   +L++Y  +L   
Sbjct: 113 GVLQRIALAYGIGSLIVL---------------SAPRKYI--PFIGGGILLIYWGIL--- 152

Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
               W                    G     +   NAV   D+ +LG  H+Y        
Sbjct: 153 ---AW-------------------FGGAEPYSLEGNAVIPFDKAILGEQHLY-------- 182

Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
                             +    PF+PEGLLS+V +I++ ++G   G +I +T+    R 
Sbjct: 183 ------------------TGFGIPFDPEGLLSTVPAIVTVLLGYLTGVIIKNTE----RA 220

Query: 323 KQWVTMGFALLIFGLTLHFTNAI-PLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLK 381
           K    +    LI  +   F   I P+NK L+T SYV  T+G AAL  + +  ++DI   K
Sbjct: 221 KVPGRLALYGLITTVIGRFWGVIFPINKPLWTSSYVLYTAGLAALFLALLVFIIDIKGYK 280

Query: 382 YPFLPLAWIGMNAMLVYVMAA 402
                    G+N + +Y ++ 
Sbjct: 281 KWTSFFVVFGVNPLFIYALSG 301


>gi|393786264|ref|ZP_10374400.1| hypothetical protein HMPREF1068_00680 [Bacteroides nordii
           CL02T12C05]
 gi|392659893|gb|EIY53510.1| hypothetical protein HMPREF1068_00680 [Bacteroides nordii
           CL02T12C05]
          Length = 387

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 89/390 (22%), Positives = 164/390 (42%), Gaps = 93/390 (23%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
           ++RL +LDI RG+ +A MI+V++ G     +  + HA WNG    D V PFF+FI+G++ 
Sbjct: 6   SKRLLALDILRGITIAGMIMVNNPGSWSYVYAPLGHAKWNGLTPTDLVFPFFMFIMGIST 65

Query: 89  ALALKR--IPDRADAVKKVIFRT-------LKLLFWGILLQGGFSHAPDELTY----GVD 135
            ++L++        A  K++ RT       L + ++ +  +   S + +E+++    G  
Sbjct: 66  YISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCRTWNSLSGEEISFLSRLGQS 125

Query: 136 V---RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
           V     IR+ GV+QR+AL Y   +++ +  K                             
Sbjct: 126 VWTFDHIRILGVMQRLALCYGATAIIALTMKH---------------------------- 157

Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
             Y+  L  T +  +   +I  +  +Y                  N +  +DR VLG  H
Sbjct: 158 -KYIPYLIVTLLAGYFILLITGNGFEYND---------------TNILSVVDRAVLGEAH 201

Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
           MY                                 +PEGLLS++ +I   +IG   G ++
Sbjct: 202 MYKDNG----------------------------IDPEGLLSTIPAIAHVLIGFCVGKLL 233

Query: 313 IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIY 372
           +  K    +L++   +G  L   G  L +    P+NK++++ ++  VT G  +   + + 
Sbjct: 234 MEVKDINEKLERLFLIGTILTFLGFLLSY--GCPINKKIWSPTFAIVTCGLGSSFLALLI 291

Query: 373 ALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
            ++D+   K         G+N + +YV+AA
Sbjct: 292 WIIDVKGYKSWSRFFESFGVNPLFIYVLAA 321


>gi|386392672|ref|ZP_10077453.1| hypothetical protein DesU5LDRAFT_2079 [Desulfovibrio sp. U5L]
 gi|385733550|gb|EIG53748.1| hypothetical protein DesU5LDRAFT_2079 [Desulfovibrio sp. U5L]
          Length = 370

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 101/379 (26%), Positives = 160/379 (42%), Gaps = 87/379 (22%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           +  RLAS+D  RGLAVA MIL ++ G     + E+ HA W+G   ADF+ P FLF+VGV 
Sbjct: 5   RKTRLASVDGLRGLAVAGMILANNPGERGHVYRELQHAVWDGWTAADFIFPLFLFLVGVC 64

Query: 88  IALALKRIPDRADAV----KKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCG 143
           +ALA+ R   R        ++V+ R + L   G+L         +     V    +R+ G
Sbjct: 65  VALAVDRDTVRTGEAHRFWRRVLTRAIILFLLGLL---------ENAYLRVSFENLRIPG 115

Query: 144 VLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTY 203
           VLQRIA+ YL  + + +       +  + G  S+  +           L+ Y  LL G  
Sbjct: 116 VLQRIAVVYLATAWLHV-------RCGNRGIVSVILV----------TLLGYWLLLAGVP 158

Query: 204 VPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
           VP      +++D                      N  G+ID+ +LG NH++ +       
Sbjct: 159 VPGLGHPSLSRD---------------------VNWEGWIDQLLLG-NHIWKY------- 189

Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLK 323
                                  ++PEG+LS+  +I   ++GV  G  +      +    
Sbjct: 190 --------------------ETTWDPEGVLSTFPAIALGLVGVLCGRWLRLGGLGVG--- 226

Query: 324 QWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYP 383
               +   L +  L L +    P+NK L T S+V    GA  ++ +  Y L+D+      
Sbjct: 227 --RGLAVGLAMLLLGLLWNAWFPVNKSLCTSSFVLFVGGAGVMMLAGCYWLLDMRGNAAW 284

Query: 384 FLPLAWIGMNAMLVYVMAA 402
             P   +G NA+ VYV A+
Sbjct: 285 AGPFVILGTNALAVYVAAS 303


>gi|291516094|emb|CBK65304.1| Uncharacterized conserved protein [Alistipes shahii WAL 8301]
          Length = 331

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 91/363 (25%), Positives = 153/363 (42%), Gaps = 84/363 (23%)

Query: 61  EISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVK------KVIFRTLKLLF 114
           ++ H  WNG    D + P FLFI GV+   +L +   RA  +       KVI R + L+ 
Sbjct: 8   QMGHVSWNGLTQHDTIFPLFLFIAGVSFPFSLSK--QRASGISERRILFKVIRRGMTLIV 65

Query: 115 WGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGR 174
            G++  G F           D   +R+  VL RI L+++  SL+ ++ K V+ +      
Sbjct: 66  LGMIYNGLFRF---------DFASLRVASVLGRIGLAWMFASLLYMYCK-VRTRA----- 110

Query: 175 FSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLN 234
                      + AA VL+ Y  L+Y    PD        D  D              L+
Sbjct: 111 -----------VFAAVVLIGYSLLMYLVVAPD------APDGTD-------------PLS 140

Query: 235 PPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLS 294
              N  G++DR+ L        P    + AC                     F+PEGLLS
Sbjct: 141 VAGNIAGWVDRQWL--------PG---TFAC-------------------GSFDPEGLLS 170

Query: 295 SVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNA-IPLNKQLYT 353
           ++ +I+S + G+  G  ++  +  L+  +  + M  A +   +     N  IP+NK+L++
Sbjct: 171 TLPAIVSALFGMFTGEFLLRKRSSLSGEQIALCMALAAVAITVIGIIWNCWIPINKKLWS 230

Query: 354 LSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGW 413
            S+ CV +G +  +F+  Y L+D+   K        IG+N++ +Y+      F G  N +
Sbjct: 231 SSFTCVVTGYSLGLFALFYYLIDVRGWKRWTFFFRVIGLNSITIYLAQRIVGFGGIANFF 290

Query: 414 YYG 416
           + G
Sbjct: 291 FGG 293


>gi|24375008|ref|NP_719051.1| N-acetylglucosamine locus membrane protein of unknown function
           DUF1624 NagX [Shewanella oneidensis MR-1]
 gi|24349746|gb|AAN56495.1| N-acetylglucosamine locus membrane protein of unknown function
           DUF1624 NagX [Shewanella oneidensis MR-1]
          Length = 395

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 119/484 (24%), Positives = 191/484 (39%), Gaps = 127/484 (26%)

Query: 1   MSEIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAV-----------ALM 49
           MS    E   +  +       ++ Q K      RL SLD  RG  +           AL+
Sbjct: 1   MSTTAPELAANVSINAQVATANNSQPKP-----RLMSLDALRGFDMFWILGGEALFGALL 55

Query: 50  ILVDHAGGDW--PEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRI---------PDR 98
           I    AG  W   ++ H+ W+G  L D + P F+F+ GVA+ L+ KR+         P  
Sbjct: 56  IFTGWAGWQWGDTQMHHSEWHGFRLYDLIFPLFIFLSGVALGLSPKRLDKLPLHERLPVY 115

Query: 99  ADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLV 158
              VK++    L  + +     G  + AP      VD   IR   VL RIA ++   +L+
Sbjct: 116 RHGVKRLFLLLLLGILYN---HGWGTGAP------VDPDKIRYASVLGRIAFAWFFAALL 166

Query: 159 EIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSAD 218
                                   WH  +   VLV  + +L G                 
Sbjct: 167 -----------------------VWHTSLRTQVLVA-VGILVG----------------- 185

Query: 219 YGKV---FNVTCGVRAKLNPPCNAVGYIDRKVL-GINHMYHHPAWRRSKACTQDSPFEGP 274
           YG +        G    L+P  +   Y+D  +L G+++    P                 
Sbjct: 186 YGAMQLWLPFPGGQAGVLSPTVSINAYVDSLLLPGVSYQGRMP----------------- 228

Query: 275 LRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII--HTKGHLARLKQWVTMGFAL 332
                        +PEG+LS++ ++++ + GV  GH I+  H KG  A++      G   
Sbjct: 229 -------------DPEGVLSTLPAVVNALAGVFVGHFIVKSHPKGEWAKVGLLGAAGGVC 275

Query: 333 LIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGM 392
           L  G  L     IP+NK+L+T S+V VTSG + L+ +  YALVD+   +        IG 
Sbjct: 276 LALGWLLD--AVIPVNKELWTSSFVLVTSGWSMLLLALFYALVDVLKWQKLVFVFVVIGT 333

Query: 393 NAMLVYVMA--------AEGIFAGFINGW-YYGDPHNTLVCFLFIISYILHSFLWELRKF 443
           NA+++Y+ +        A+ +F G I     Y  P   +V  L +   +L+   W  R+ 
Sbjct: 334 NAIIIYLASSLVDWKYIAQSVFGGVIAVLPEYAQPLGAVVSLLNVQWLVLY---WMYRRK 390

Query: 444 LYVQ 447
           ++V+
Sbjct: 391 IFVR 394


>gi|90022681|ref|YP_528508.1| hypothetical protein Sde_3039 [Saccharophagus degradans 2-40]
 gi|89952281|gb|ABD82296.1| conserved hypothetical protein [Saccharophagus degradans 2-40]
          Length = 363

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 102/383 (26%), Positives = 159/383 (41%), Gaps = 91/383 (23%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDW----PEISHAPWNGCNLADFVMPFFLFIVG 85
           + TQR  +LD+ RG  +A+MILV+   GDW      + HA W+G  + DFV PFFLFI+G
Sbjct: 1   MATQRYLALDVMRGATLAMMILVN-TPGDWGFVYAPLLHADWHGVTITDFVFPFFLFIIG 59

Query: 86  VAIALALKRIPDRADAV--KKVIFRTLKLLFWGILLQG-GFSHAPDELTYGVDVRMIRLC 142
            A+    +     A A+  KK+I RT  L   G+LL    F+ A  EL         R+ 
Sbjct: 60  SALFFTSRSSGQLAPAIKAKKIIKRTALLFTIGLLLHAFPFTTALSEL---------RIL 110

Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
           GVLQRIAL+Y + + + ++    Q                   LMAA           G 
Sbjct: 111 GVLQRIALAYGIAAFI-VWLPTTQR------------------LMAAL----------GI 141

Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
            V  W   I+   S                     N V +ID  +LG  H++        
Sbjct: 142 LVAYWLVFILTDSSYHLAD----------------NIVRHIDITILGAEHLWQGKG---- 181

Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKG---HL 319
                                   F+PEGLLS++ + ++ + G     +++       + 
Sbjct: 182 ----------------------LAFDPEGLLSTLPAAVNILAGFEATRLLVSQPAGEPNN 219

Query: 320 ARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWN 379
           A  +Q+    +A+    + L +   +P+NK L+T S+V +TSG   LV   +  L     
Sbjct: 220 ATSRQFKLALYAMCSITIALIWHRWMPINKSLWTSSFVLLTSGVGVLVLLLLVRLEPYRA 279

Query: 380 LKYPFLPLAWIGMNAMLVYVMAA 402
               +   A  G N + +YV+++
Sbjct: 280 TAAIYRAFAIYGQNPLFIYVLSS 302


>gi|431798742|ref|YP_007225646.1| hypothetical protein Echvi_3416 [Echinicola vietnamensis DSM 17526]
 gi|430789507|gb|AGA79636.1| hypothetical protein Echvi_3416 [Echinicola vietnamensis DSM 17526]
          Length = 363

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 56/133 (42%), Positives = 76/133 (57%), Gaps = 14/133 (10%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           K +RL SLD+ RG+ +A MILV+  G     +P + HA WNG    DF+ PFFLFIVGV+
Sbjct: 4   KNKRLISLDVLRGMTIAAMILVNFPGSWEHVFPPLHHAQWNGITPTDFIFPFFLFIVGVS 63

Query: 88  IALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
           I +A   K   D+    KK+ FR  K+   G+LL       P+      D   IR+ GVL
Sbjct: 64  IVMAYAGKMEMDKTIVYKKLFFRGAKIFALGVLL----GMIPE-----FDFSAIRVAGVL 114

Query: 146 QRIALSYLLVSLV 158
           QRIAL ++  +L+
Sbjct: 115 QRIALVFVACTLM 127



 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 34/114 (29%), Positives = 58/114 (50%), Gaps = 2/114 (1%)

Query: 288 EPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPL 347
           +PEG+ S++ +I + I+G+  G ++      + +    + +G  L ++GL   +    P+
Sbjct: 189 DPEGVFSTLPAIATGILGMLAGQLLKSQLKEVEKANNLMVIGLVLTLWGL--FWAWFFPI 246

Query: 348 NKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA 401
           NK L+T S+V VT G A     A Y  +D+        P    G NA+ VYV+A
Sbjct: 247 NKNLWTSSFVLVTGGTAFSFLGAFYYWIDVKGNSQGTTPWVIFGSNAITVYVLA 300


>gi|388258355|ref|ZP_10135531.1| hypothetical protein O59_002752 [Cellvibrio sp. BR]
 gi|387937867|gb|EIK44422.1| hypothetical protein O59_002752 [Cellvibrio sp. BR]
          Length = 362

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 113/436 (25%), Positives = 177/436 (40%), Gaps = 110/436 (25%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           QR  +LD+ RGL +ALMILV+  G     +  + HA W+G    DFV PFF+FIVG ++ 
Sbjct: 4   QRFQALDVMRGLTLALMILVNTPGSWSFVYGPLLHADWHGATATDFVFPFFMFIVGSSMY 63

Query: 90  LALKRIPDRADA--VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
            A++ +   A A   +K++ R + L   G+LL    S  P    +  ++   R+ GVLQR
Sbjct: 64  FAMRGLRQLAPAAQAQKILRRVVLLFVIGVLL----SAYP----FTNNIENWRVMGVLQR 115

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
           IA++Y   + + ++           GR           +M+A +L+ Y  LL        
Sbjct: 116 IAIAYGFAAFIILYFG-------FTGRV----------VMSAILLLGYWGLL-------- 150

Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
                N  +  Y              +   N V   D  VLG NH++             
Sbjct: 151 -----NIAADPY--------------SLEHNLVRQFDLAVLGANHLWQGKG--------- 182

Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVT 327
                              F+PEG+LS+V SI++ IIG     V++ ++     L Q   
Sbjct: 183 -----------------LAFDPEGILSTVPSIVNVIIGFEATRVLLASEDKAKALSQLFV 225

Query: 328 MGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSG---------------AAALVFSAIY 372
               L+   L  +     P+NK L+T S+V +T G               AAA     +Y
Sbjct: 226 AALLLIGLALVWNLL--FPINKSLWTSSFVVLTCGVAILLLLLLVKLEQSAAASAVKPVY 283

Query: 373 ALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEG------IFAGFINGWYYGDPHNTLVCFL 426
              +I   K P       G+ A  +Y++   G      ++A F     + DP+   + F 
Sbjct: 284 HFFEIIG-KNPLFIYVLSGLLATTLYLIPVGGESAYAALYAAFCT---FADPYMASLLFA 339

Query: 427 FIISYILHSFLWELRK 442
            ++  IL    W L K
Sbjct: 340 LLMVLILWCVAWVLHK 355


>gi|452822118|gb|EME29140.1| heparan-alpha-glucosaminide N-acetyltransferase isoform 2
           [Galdieria sulphuraria]
          Length = 351

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 79/346 (22%), Positives = 148/346 (42%), Gaps = 87/346 (25%)

Query: 64  HAPWNGCNLADFVMPFFLFIVGVAIALALKRIP-------DRADAVKKVIFRTLKLLFWG 116
           H  W   ++AD + PFFLF+VG +I  A +++P       ++  A++ V  RT+KL   G
Sbjct: 14  HESWFSWHMADLIFPFFLFMVGSSIYFAFRKVPREVENSEEKDKALRSVTSRTIKLFLVG 73

Query: 117 ILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFS 176
           +LL    S        G     +R  G+LQRIA+ Y  V+ + +F       +  V +++
Sbjct: 74  VLLNVPLS--------GFRWETLRWMGILQRIAICYGCVAFLFLFV------NSRVIQYA 119

Query: 177 IFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPP 236
                     + + + +++ +LLYG  VP+                    C +  +L   
Sbjct: 120 ----------LVSVLFLLHTSLLYGLIVPN--------------------CLISERLTRA 149

Query: 237 CNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSV 296
           C+A  Y+D  +LG  H+Y                             H  ++PEG+LS++
Sbjct: 150 CSAQSYLDTMILGGKHLYF----------------------------HLEYDPEGILSTL 181

Query: 297 SSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTL--HFTNAIPLNKQLYTL 354
            + ++T  G+    +    +    R+     +G + +   + L   F +++P++K L+T 
Sbjct: 182 MATINTFAGLEAARLTSSLRYVNQRILWCFLIGSSFVGIEILLVDCFPDSVPISKPLWTA 241

Query: 355 SYVCVTSGAAALVFSAIYALVDIWNLKYPFL--PLAWIGMNAMLVY 398
           S++ +T G +    S       +W    P L  P  W+G N+  ++
Sbjct: 242 SFLFLTVGCSFWCLS----FCGLWAKVTPRLVQPCLWVGRNSFFLF 283


>gi|452822119|gb|EME29141.1| heparan-alpha-glucosaminide N-acetyltransferase isoform 1
           [Galdieria sulphuraria]
          Length = 356

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 79/346 (22%), Positives = 148/346 (42%), Gaps = 87/346 (25%)

Query: 64  HAPWNGCNLADFVMPFFLFIVGVAIALALKRIP-------DRADAVKKVIFRTLKLLFWG 116
           H  W   ++AD + PFFLF+VG +I  A +++P       ++  A++ V  RT+KL   G
Sbjct: 19  HESWFSWHMADLIFPFFLFMVGSSIYFAFRKVPREVENSEEKDKALRSVTSRTIKLFLVG 78

Query: 117 ILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFS 176
           +LL    S        G     +R  G+LQRIA+ Y  V+ + +F       +  V +++
Sbjct: 79  VLLNVPLS--------GFRWETLRWMGILQRIAICYGCVAFLFLFV------NSRVIQYA 124

Query: 177 IFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPP 236
                     + + + +++ +LLYG  VP+                    C +  +L   
Sbjct: 125 ----------LVSVLFLLHTSLLYGLIVPN--------------------CLISERLTRA 154

Query: 237 CNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSV 296
           C+A  Y+D  +LG  H+Y                             H  ++PEG+LS++
Sbjct: 155 CSAQSYLDTMILGGKHLYF----------------------------HLEYDPEGILSTL 186

Query: 297 SSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTL--HFTNAIPLNKQLYTL 354
            + ++T  G+    +    +    R+     +G + +   + L   F +++P++K L+T 
Sbjct: 187 MATINTFAGLEAARLTSSLRYVNQRILWCFLIGSSFVGIEILLVDCFPDSVPISKPLWTA 246

Query: 355 SYVCVTSGAAALVFSAIYALVDIWNLKYPFL--PLAWIGMNAMLVY 398
           S++ +T G +    S       +W    P L  P  W+G N+  ++
Sbjct: 247 SFLFLTVGCSFWCLS----FCGLWAKVTPRLVQPCLWVGRNSFFLF 288


>gi|282877735|ref|ZP_06286550.1| putative membrane protein [Prevotella buccalis ATCC 35310]
 gi|281300307|gb|EFA92661.1| putative membrane protein [Prevotella buccalis ATCC 35310]
          Length = 403

 Score = 89.4 bits (220), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 108/431 (25%), Positives = 180/431 (41%), Gaps = 110/431 (25%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGV 86
           +++R+ ++DI RG+ +A MILV++ G  W  I     HA WNG    D V PFF+F++G+
Sbjct: 8   QSKRILAIDILRGITIAGMILVNNPG-SWAHIFAPLEHAEWNGMTPTDLVFPFFMFVMGM 66

Query: 87  AIALALKR--IPDRADAVKKVIFRTLKLLFWGILLQGGFSH-------APDELTYGVDV- 136
            I +++++         V K+I RTL L   GI + G FS          ++ T G  + 
Sbjct: 67  CIFISMQKYQFACNRQTVYKIIRRTLLLYLVGIFV-GWFSRFCYRWAFPLEDATLGQQIW 125

Query: 137 ------RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAAC 190
                   IRL GVL R+A+ Y + +L+ I  +          R+               
Sbjct: 126 HTVWSFDTIRLSGVLARLAICYGITALLAITVRH---------RY--------------- 161

Query: 191 VLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGI 250
           +L + + LL G       +TI+            + CG     +   N +  +DR VL  
Sbjct: 162 LLSIVITLLIG-------YTIL------------LFCGNGFAYD-ETNILSIVDRAVLTD 201

Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
            HMYH                                +PEGLLS+  +I  T+IG   G 
Sbjct: 202 AHMYHDNG----------------------------IDPEGLLSTFPAIAHTLIGFLIGK 233

Query: 311 VIIHTKGHLARLKQ-------WVTMGFALLIFGLTLHFTNAI-----PLNKQLYTLSYVC 358
           +    +  +    Q        +     L I G  L F+  +     PLNK++++ ++V 
Sbjct: 234 LAFSKQKTVTATTQNDPKTGLILHNIVPLFIAGTILTFSGLLLAYGCPLNKKIWSPTFVL 293

Query: 359 VTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWY--YG 416
            + G A+ + + +  L+D+   K         GMN + +YVM+  G+ A     +   +G
Sbjct: 294 TSCGLASTLLALLIWLIDVKGYKRWCRFFEVFGMNPLFLYVMS--GVIAILFGSFQFPFG 351

Query: 417 DPHNTLVCFLF 427
           D   ++  FL+
Sbjct: 352 DEPTSITGFLY 362


>gi|195565141|ref|XP_002106164.1| GD16714 [Drosophila simulans]
 gi|194203536|gb|EDX17112.1| GD16714 [Drosophila simulans]
          Length = 318

 Score = 89.4 bits (220), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 50/127 (39%), Positives = 74/127 (58%), Gaps = 13/127 (10%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           + +RL SLD FRGL++ LMI V+  GG +  I HA WNG +LAD V P FL+I+GV I L
Sbjct: 182 QRKRLRSLDTFRGLSIVLMIFVNSGGGGYAWIEHAAWNGLHLADIVFPSFLWIMGVCIPL 241

Query: 91  ALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQ 146
           ++K    R   +A    +++ R++KL   G+ L             G ++  +R+ GVLQ
Sbjct: 242 SVKSQLSRGSSKARICLRILVRSIKLFVIGLCLNS---------MSGPNLEQLRVMGVLQ 292

Query: 147 RIALSYL 153
           R  ++YL
Sbjct: 293 RFGVAYL 299


>gi|380693406|ref|ZP_09858265.1| hypothetical protein BfaeM_05407 [Bacteroides faecis MAJ27]
          Length = 371

 Score = 89.4 bits (220), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 96/400 (24%), Positives = 168/400 (42%), Gaps = 90/400 (22%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
             +RL +LD+ RG+ +A MILV+  G     +  + HA W G    D V PFF+FI+G++
Sbjct: 5   SNKRLLALDVMRGITIAGMILVNTPGSWQHTYAPLKHAEWIGLTPTDLVFPFFMFIMGIS 64

Query: 88  IALALKR--IPDRADAVKKVIFRTLKLLFWGI----LLQGGFSHAPDELTYGVDVRMIRL 141
             ++L++        A  K++ RT+ +   GI    L    F H P        +  IR+
Sbjct: 65  TYISLRKYDFTFSIPAGLKILKRTVIIFLIGIGISWLSILCFQHDP------FPIDQIRI 118

Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
            GV+QR+AL Y + +L  +  K                 + +   +   +L+ Y  +L  
Sbjct: 119 LGVMQRLALGYGITALAALLIK-----------------HKYIPYLITVLLIGYFMIL-- 159

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
                         +   G V++ T           N +  +DR VLG  H+Y       
Sbjct: 160 --------------AVGNGYVYDET-----------NVLSIVDRAVLGQAHIYG------ 188

Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
                                  A  +PEGLLS++S++   +IG   G +++  K    +
Sbjct: 189 ----------------------GAILDPEGLLSTISAVAHVMIGFCAGKLLMEVKDIHEK 226

Query: 322 LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLK 381
           L++   +G  L   G  L + +  P+ K++++ S+V +T G  +   + +  ++DI   K
Sbjct: 227 LERLFLIGTILTFAGFLLSYGS--PICKKIWSPSFVLITCGMGSSFLALLVWIIDIKGYK 284

Query: 382 YPFLPLAWIGMNAMLVYVMA-AEGIFAGFINGWYYGDPHN 420
                    G+N + +YV+A    I    I   Y G+P +
Sbjct: 285 GWSRFFESFGVNPLFIYVLADILAILFAMIPMTYAGEPTS 324


>gi|16124796|ref|NP_419360.1| hypothetical protein CC_0541 [Caulobacter crescentus CB15]
 gi|221233512|ref|YP_002515948.1| hypothetical protein CCNA_00575 [Caulobacter crescentus NA1000]
 gi|13421730|gb|AAK22528.1| hypothetical protein CC_0541 [Caulobacter crescentus CB15]
 gi|220962684|gb|ACL94040.1| hypothetical protein CCNA_00575 [Caulobacter crescentus NA1000]
          Length = 372

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 93/349 (26%), Positives = 142/349 (40%), Gaps = 74/349 (21%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
             R  SLD+FRGL V LMI+V+ AG     + ++ HAPW G   AD V P FLF VG ++
Sbjct: 5   AARFLSLDVFRGLTVFLMIVVNTAGPGAKAYSQLVHAPWFGFTAADAVFPSFLFAVGCSM 64

Query: 89  ALALKRIPDRADAVKKVIFRTLKLLFWGILLQ-GGFSHAPDELTYGVDVRMIRLCGVLQR 147
           A A  +     D   KV+ R   +   G L+    F    D     +     R+ GVLQR
Sbjct: 65  AFAFSKPIPLNDFTVKVLRRAALIFLLGFLMYWFPFVRKVDGDWALIPFSDTRVMGVLQR 124

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
           IAL YLL +                        +   WL    ++ +   LL G     W
Sbjct: 125 IALCYLLAA------------------------FAVRWLSPRLIVALCAVLLLGY----W 156

Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
              +   D A             A L+   NA   +D  ++G NH+Y             
Sbjct: 157 AILMAFGDPA-------------APLSKLGNAGTRLDLLLIGQNHLY------------- 190

Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVT 327
                   RKD        F+PEGLL ++ S ++ + G      +    G    + +   
Sbjct: 191 --------RKD------GGFDPEGLLGTLPSTVNVLAGYLAARFLKENPGSSQAMGRMAI 236

Query: 328 MGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
            G  L++ GL   ++   P+ K+L+T S+V +T G   ++ + +  L++
Sbjct: 237 AGLVLILAGLV--WSPLFPIAKKLWTSSFVLLTVGIDLILLAGLAKLLE 283


>gi|386312853|ref|YP_006009018.1| N-acetylglucosamine related transporter, NagX [Shewanella
           putrefaciens 200]
 gi|319425478|gb|ADV53552.1| N-acetylglucosamine related transporter, NagX [Shewanella
           putrefaciens 200]
          Length = 384

 Score = 89.0 bits (219), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 106/410 (25%), Positives = 169/410 (41%), Gaps = 112/410 (27%)

Query: 34  RLASLDIFRGLAV-----------ALMILVDHAGGDW--PEISHAPWNGCNLADFVMPFF 80
           RL SLD  RG  +            L+IL   AG  W   ++ H+ W+G +  D + P F
Sbjct: 18  RLMSLDALRGFDMFWILGGEALFGGLLILTGWAGWQWGDEQMHHSQWHGFHFYDLIFPLF 77

Query: 81  LFIVGVAIALALKRI-----PDRADAVKKVIFRTLKLLFWGILLQGGF-SHAPDELTYGV 134
           +F+ GVA+ L+ KR+      DR    +  I R   LL  GIL   G+ + AP       
Sbjct: 78  IFLSGVALGLSPKRLDKLPMKDRLPVYRHGIKRLFLLLLLGILYNHGWGTGAP------A 131

Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
           D   +R   VL RIA ++   +L+                        WH  +   V+V 
Sbjct: 132 DPEKVRYASVLGRIAFAWFFAALL-----------------------VWHTSLRTQVIVA 168

Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKV---FNVTCGVRAKLNPPCNAVGYIDRKVL-GI 250
            L +L G                 YG +        G    L+P  +   Y+D  +L G+
Sbjct: 169 -LGILLG-----------------YGAMQLWLPFPSGQAGVLSPTQSINAYVDSILLPGV 210

Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
           ++    P                              +PEGLLS++ ++++ + GV  G+
Sbjct: 211 SYQGRTP------------------------------DPEGLLSTIPAVVNALAGVFVGY 240

Query: 311 VII--HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVF 368
            I+  H +G   ++    T G A L  G  L     IP+NK+L+T S+V VTSG + ++ 
Sbjct: 241 FIVKSHPQGEWVKVGLLATAGGAWLALGWLLD--GVIPVNKELWTSSFVLVTSGWSMILL 298

Query: 369 SAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA--------AEGIFAGFI 410
           +  YALVD+   +        IG NA+++Y+ +        A+ +F G +
Sbjct: 299 ALFYALVDVLKWQKAAFFFVVIGTNAIIIYLASSLVDWKYIAQSVFGGLV 348


>gi|392390355|ref|YP_006426958.1| hypothetical protein Ornrh_0972 [Ornithobacterium rhinotracheale
           DSM 15997]
 gi|390521433|gb|AFL97164.1| hypothetical protein Ornrh_0972 [Ornithobacterium rhinotracheale
           DSM 15997]
          Length = 390

 Score = 89.0 bits (219), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 93/371 (25%), Positives = 151/371 (40%), Gaps = 96/371 (25%)

Query: 34  RLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           R  SLD+FRG  VALMILV++ G     +  ++HA W GC   D V PFFLF VG A+A 
Sbjct: 4   RYYSLDVFRGATVALMILVNNPGSWSAMFKPLTHAEWAGCTPTDLVFPFFLFAVGNAMAF 63

Query: 91  ALKRIPDRADAV--KKVIFRTLKLLFWGILLQG-GFSHAPDELTYGVDVRMIRLCGVLQR 147
            + R+      V  +KV+ RT  +   G+LL    F    D +        +R+ GVLQR
Sbjct: 64  VIPRMQKAGSQVFWRKVLKRTFLIFIIGLLLNWFPFVQWKDGILTFKHWENVRILGVLQR 123

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
           IA +Y   +++  + K+ +                   +++  +L+VY  L       D 
Sbjct: 124 IAFAYFFAAIIAYYFKEKKVL-----------------IISFLLLIVYWLLALLLGGAD- 165

Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY----IDRKVLGINHMYHHPAWRRSK 263
                                       P +  G+    +D  +LG +HMYH        
Sbjct: 166 ----------------------------PYSMQGFWGTRVDLAILGESHMYHG------- 190

Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLK 323
                   EG            PF+PEG + ++SS    ++G   G +I+  +G +  L 
Sbjct: 191 --------EG-----------VPFDPEGFVGAISSTAQVLLGYLAGKIIM-AQGEVNWLF 230

Query: 324 QWVTMGFALLIFGLTLHFTNA-------------IPLNKQLYTLSYVCVTSGAAALVFSA 370
                   L    L++ F +A              P+ K++++ +YV  T+G A +  S 
Sbjct: 231 VRAPKTSELHYKVLSMLFVSAGILLVVAYVWQLDFPIIKKIWSSTYVLYTTGLAIITISI 290

Query: 371 IYALVDIWNLK 381
           +   +++   K
Sbjct: 291 MIWFIEVLKAK 301


>gi|392548092|ref|ZP_10295229.1| hypothetical protein PrubA2_17028 [Pseudoalteromonas rubra ATCC
           29570]
          Length = 373

 Score = 88.6 bits (218), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 108/415 (26%), Positives = 167/415 (40%), Gaps = 114/415 (27%)

Query: 31  KTQRLASLDIFRGLAV-----------ALMILVDHAGGDWPEIS--HAPWNGCNLADFVM 77
             +RLASLD  RG+ +           AL +L    G    E    H+ W+G    D + 
Sbjct: 4   NKKRLASLDALRGMDMFWILGGQSIFAALFVLTGWQGWKIFEAQTLHSAWHGFTFYDLIF 63

Query: 78  PFFLFIVGVAIALALKRI-----PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTY 132
           P F+F+ GVA+ L  KRI      +R     K I R   L  +G+L   G+         
Sbjct: 64  PLFIFLSGVAMGLRPKRIDHLPMAERKPIYIKAIKRLGLLCLFGVLYNHGWGTGIPA--- 120

Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
             D   IR   VL RIA+++   +++             VG                 +L
Sbjct: 121 --DFGEIRYASVLGRIAIAWFFCAMLVWHCSLKTTALTGVG-----------------IL 161

Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL-GIN 251
           + Y  LL   ++P                   V  G   +L P  +   ++D+ +L GI 
Sbjct: 162 LAYWLLL--CFIP-------------------VPGGSAGELTPAGSWNAWVDQALLPGIT 200

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           +                               + P +PEG+LSS  +I++ I GV  G +
Sbjct: 201 YQ------------------------------NRPVDPEGILSSFPAIVNAIAGVFAGQL 230

Query: 312 IIHTKGHLARLKQWVTMG--FALLIFGLTLHFTNAI--PLNKQLYTLSYVCVTSGAAALV 367
           I  +     +L QW   G  FA  I  L L +   +  P+NK+L+T S+V VT G +A+ 
Sbjct: 231 IAQSD----KLGQWQVAGRLFAAGIVSLALGWLWDLQFPVNKELWTSSFVLVTVGWSAIF 286

Query: 368 FSAIYALVDIWN---LKYPFLPLAWIGMNAMLVYVMA--------AEGIFAGFIN 411
            +  + LVDI N   L YPF+    IG N++++Y+ +        +  +F G IN
Sbjct: 287 LAVFFTLVDILNGQKLAYPFVI---IGANSIIIYLASSLVDWAFISRSVFGGIIN 338


>gi|313203961|ref|YP_004042618.1| hypothetical protein Palpr_1487 [Paludibacter propionicigenes WB4]
 gi|312443277|gb|ADQ79633.1| hypothetical protein Palpr_1487 [Paludibacter propionicigenes WB4]
          Length = 382

 Score = 88.6 bits (218), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 99/387 (25%), Positives = 158/387 (40%), Gaps = 89/387 (22%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGD----WPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
            R  +LDIFRG+ V  MI+V+ +G      WP + HA WNG    D V P FLF VG A+
Sbjct: 13  SRFTALDIFRGMTVCFMIIVNTSGNGATTYWP-LMHADWNGFTPTDLVFPSFLFAVGNAL 71

Query: 89  ALALKR--IPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGV 144
             A+KR     ++D + K+  RT  +   G L+     F    +       +   R+ GV
Sbjct: 72  GFAMKRWDTMKQSDVLLKIFKRTALIFLIGYLMYWFPFFRLNAESHLILSPISQTRIMGV 131

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           LQRIAL Y + +L+  +    +                  W+    +L  ++ LL     
Sbjct: 132 LQRIALCYGITALLVYYLGTKRTI----------------WVGVVSLLAYWVLLL----- 170

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                    +  A++ K                NAV  +D  +LG +H+Y    +     
Sbjct: 171 ------AFGEAGAEFSKT--------------GNAVLRLDIWLLGTHHLYGGEGF----- 205

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGH----LA 320
                                PF+PEG+LS++ ++ + I G   G  +   KG     LA
Sbjct: 206 ---------------------PFDPEGVLSTLPALFNVIAGFAVGRYLQQQKGKSYESLA 244

Query: 321 RLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNL 380
           +L         + +  L   + + +P+NK+L+T SY  +T G   L+ S I    D    
Sbjct: 245 KLLL-----VGIGLLVLAYCWNSWMPINKKLWTSSYAVLTVGLDCLLLSVIIYFTDFLGK 299

Query: 381 KYP--FLPLAWIGMNAMLVYVMAAEGI 405
                F  +A  G N + +Y+M+  G+
Sbjct: 300 TKGSHFFIIA--GKNPLFIYLMSELGV 324


>gi|329963071|ref|ZP_08300851.1| hypothetical protein HMPREF9446_02444 [Bacteroides fluxus YIT
           12057]
 gi|328529112|gb|EGF56042.1| hypothetical protein HMPREF9446_02444 [Bacteroides fluxus YIT
           12057]
          Length = 381

 Score = 88.6 bits (218), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 98/391 (25%), Positives = 154/391 (39%), Gaps = 96/391 (24%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
             QRL +LD+ RGL +A MILV+        +  + HA WNG    D + PFFLF++GV+
Sbjct: 3   NNQRLVALDVMRGLTIAGMILVNTPETWSYVYAPLQHARWNGLTPTDVIFPFFLFMMGVS 62

Query: 88  IALALKRIPDRADA--VKKVIFRTLKLLFWGILL---------------QGGFSHAPDEL 130
           + ++LK+      +  + K+I R+L L   G  +               Q GF   P + 
Sbjct: 63  MYISLKKCSFHLSSHLLMKIIRRSLILFLIGTAIYALATFLGTLRDACRQPGFEGNPWKE 122

Query: 131 TYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAAC 190
            +   +   R+ GVLQR+ + Y                    G  SI  L C H      
Sbjct: 123 AFA-SLPGTRIPGVLQRLGVCY--------------------GIGSIIVLTCRH------ 155

Query: 191 VLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGI 250
                       Y+P     I+    A Y   F +       ++ P N +  +DR + G 
Sbjct: 156 -----------RYIPHLAGGIL----AGY---FLILLFGNGFVHSPENILSVVDRTLFGD 197

Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
           N +                                  +PEG LS++ SI   +IG   G 
Sbjct: 198 NMIN-----------------------------DGGIDPEGALSTLPSIAQVLIGFCIGK 228

Query: 311 VIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSA 370
           + I T     +L +    G  +LI G    F+   PLNK+++T S+V VT G A L+   
Sbjct: 229 ICIETPDMREKLNKIFLYGSLMLIVGWL--FSYGCPLNKRVWTPSFVLVTCGFACLLLGI 286

Query: 371 IYALVDIWNLKYPFLPLAWIGMNAMLVYVMA 401
           +   +D+  +          G+N +  YV++
Sbjct: 287 LIWYIDLRKVYKQTWTFEVFGVNPLFCYVLS 317


>gi|373953356|ref|ZP_09613316.1| Protein of unknown function DUF2261, transmembrane
           [Mucilaginibacter paludis DSM 18603]
 gi|373889956|gb|EHQ25853.1| Protein of unknown function DUF2261, transmembrane
           [Mucilaginibacter paludis DSM 18603]
          Length = 404

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 90/360 (25%), Positives = 152/360 (42%), Gaps = 92/360 (25%)

Query: 61  EISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIF-----RTLKLLFW 115
           ++ H+PWNG    D + P F+FI G+++  +  R   ++ +  K I+     RT+ L+  
Sbjct: 79  QLHHSPWNGFTFYDLIFPLFIFIAGISMPFSYNRQVAQSPSSNKQIYVRLIKRTVLLILL 138

Query: 116 GILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRF 175
           G ++ G    A  + T        R   VL RIAL+    +++ + +             
Sbjct: 139 GTVVNGALHFAGYQQT--------RFASVLGRIALACFFAAVIYLNSS------------ 178

Query: 176 SIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNP 235
                  W  +  A +L+ Y  L+    VP             +G       GV   L P
Sbjct: 179 -----LRWQIIWFAVILLGYWLLMALVPVPG------------HG------AGV---LTP 212

Query: 236 PCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSS 295
             N   +ID+  L                        G L +         ++PEGLLS+
Sbjct: 213 GANLSAWIDQHFL-----------------------PGKLHRKV-------YDPEGLLST 242

Query: 296 VSSILSTIIGVHFGHVIIHTKGH-LARLKQWVTM---GFALLIFGLTLHFTNAIPLNKQL 351
           + +I + ++G+  GH +    G  L+ LK+   M   G +L++  L  +   A P+NK +
Sbjct: 243 IPAIATAMMGIFTGHFLQWEPGERLSPLKKIGIMAAAGISLILIALIWNM--AFPINKNM 300

Query: 352 YTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFIN 411
           +T S+     G + L+F+  Y ++D+   K    P+ WIG N++L+Y MAA     GFIN
Sbjct: 301 WTSSFTLYAGGWSLLLFTLFYGIIDVAGYKKWCQPMVWIGTNSILIY-MAAH----GFIN 355


>gi|320106288|ref|YP_004181878.1| hypothetical protein AciPR4_1053 [Terriglobus saanensis SP1PR4]
 gi|319924809|gb|ADV81884.1| hypothetical protein AciPR4_1053 [Terriglobus saanensis SP1PR4]
          Length = 394

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 108/398 (27%), Positives = 173/398 (43%), Gaps = 74/398 (18%)

Query: 26  EKSHLKTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLF 82
           E     TQR+ S+D+ RGL VA MILV+  G     +  + HAPWNG    D V P FLF
Sbjct: 3   ETKAAPTQRILSVDVLRGLTVAFMILVNDPGDGHVAYAPLDHAPWNGWTPTDMVFPTFLF 62

Query: 83  IVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
           +VG +I  ++     R D+  ++  + ++   + + +       P +  YG   RM RL 
Sbjct: 63  LVGCSIVFSITSRLKRGDSKSRIALQVIRRTIYLLAINYAIRLIP-QFHYG---RM-RLF 117

Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
           GVL RIA+ YL+ +L+ ++ +  +                W  +    +LV Y AL+   
Sbjct: 118 GVLPRIAICYLIAALLFLWLQRAR----------------WIAVAVVTLLVGYWALMRFV 161

Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
            +P     + +    D    FN             N   YIDR   G N           
Sbjct: 162 PIPGAGMPVHDFPLMDQ---FN-------------NLPSYIDR---GFNDF--------- 193

Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
              TQ     G L +          +PEG+LS++ ++ + ++GV  G ++   + +   +
Sbjct: 194 ---TQRFLHTGSLYEKTR-------DPEGILSTLPAVGTALLGVLNGKLL---RSNTPPV 240

Query: 323 K-QWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDI--WN 379
           K   + +G  +L   L   +    P NK L+T SYV + +G AAL+    + L D+  W 
Sbjct: 241 KVSAILLGGGVLSIALGYLWNPWFPFNKNLWTSSYVLLAAGIAALLLGISFFLFDVKKWQ 300

Query: 380 LKYPFLP-LAW----IGMNAMLVYVMAA-EGIFAGFIN 411
                +  LAW     G NA++ YV  +  G   G+I+
Sbjct: 301 HTSKMIRILAWPCIVFGSNAIVAYVTPSIYGKILGYIH 338


>gi|262381451|ref|ZP_06074589.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
 gi|262296628|gb|EEY84558.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
          Length = 372

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 108/418 (25%), Positives = 161/418 (38%), Gaps = 120/418 (28%)

Query: 26  EKSHLKTQRLASLDIFRG-----------LAVALMILVDH------AGGDWPEISHAPWN 68
           E+   + QRL SLD  RG           L VAL  L  +      AG    ++ H  WN
Sbjct: 2   ERQKQQPQRLQSLDALRGFDMLFIMGGASLFVALATLFPNPFFQAIAG----QMEHVEWN 57

Query: 69  GCNLADFVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFS 124
           G    D + P FLFI G++   +L++   +        KK++ R + L+F G++  G  S
Sbjct: 58  GLAHHDTIFPLFLFIAGISFPFSLEKQRGKGMTEGAIYKKIVRRGITLVFLGLVYNGLLS 117

Query: 125 HAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWH 184
              D L         R   VL RI L ++  +L                   +F  + W 
Sbjct: 118 FEFDHL---------RCASVLARIGLGWMFAAL-------------------LFVRFGWK 149

Query: 185 WL--MAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY 242
               + A +LV Y   +    VPD                     G         N VGY
Sbjct: 150 ARAGITALILVGYWLAMAFVPVPD--------------------AGGAGPFTLEGNLVGY 189

Query: 243 IDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP-FEPEGLLSSVSSILS 301
           IDR  L                               P   H   F+PEGL S+V +I +
Sbjct: 190 IDRLFL-------------------------------PGRLHETVFDPEGLFSTVPAIAT 218

Query: 302 TIIGVHFGHVIIHTKGHLARLKQ---WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVC 358
            ++G+  G  I   K  L   K+    V  G  LLI GL   ++   P+NK+L+T S+VC
Sbjct: 219 AMLGMFTGEWIKLRKEGLTDRKKVLCLVGAGAVLLIVGLL--WSLVFPINKKLWTSSFVC 276

Query: 359 VTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVM--------AAEGIFAG 408
           V    +A +F+  + ++D+   +   L    IGMN++ +Y+          +E IF G
Sbjct: 277 VVGAYSAWMFALFFYIIDVLGWRKWTLFFTVIGMNSITIYLAQRFIRFSYTSEAIFGG 334


>gi|336316712|ref|ZP_08571601.1| hypothetical protein Rhein_3024 [Rheinheimera sp. A13L]
 gi|335878877|gb|EGM76787.1| hypothetical protein Rhein_3024 [Rheinheimera sp. A13L]
          Length = 363

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 100/376 (26%), Positives = 161/376 (42%), Gaps = 84/376 (22%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVG 85
           + + R  +LD  RGLA+ALMILV+  G  W  +     HAPW+G   AD V P FLF+VG
Sbjct: 1   MSSPRFYALDALRGLAIALMILVNTPG-SWQHVYTPLLHAPWDGFTFADIVFPTFLFVVG 59

Query: 86  VAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
            A+  +LK       ++ +V  R LKL+  G+LL        + + + VD+  +RL GVL
Sbjct: 60  AAMFYSLKTAVLSRQSLWRVSSRALKLIGIGVLL--------NYVPFTVDLAELRLPGVL 111

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRI L+Y L +L+ +  K               R Y   ++  A VL+ +LAL+ G    
Sbjct: 112 QRIGLAYWLAALLILTVK---------------RSYL-PFVAVALVLLYWLALVLG---- 151

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
                                 G     +   N V   D  + G  H+Y           
Sbjct: 152 ----------------------GGELPYSLEHNLVRQWDLAIFGAAHLYQG--------- 180

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
                                F+PEG LS++  +++ +IG      +   + H A     
Sbjct: 181 -----------------FGVAFDPEGFLSTLPCVVAVLIGFGSASFLQGKQQHQALRL-- 221

Query: 326 VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFL 385
             +   L++  L L +    P+NK L++ SY+ +++G   L+ +A+   +DI        
Sbjct: 222 -LLLSGLVLVVLALLWHLLWPINKALWSGSYLALSNGLILLLLAALVWCIDIKGWARMAE 280

Query: 386 PLAWIGMNAMLVYVMA 401
           PL   G N + +Y+++
Sbjct: 281 PLKVYGTNPLFIYILS 296


>gi|410099160|ref|ZP_11294132.1| hypothetical protein HMPREF1076_03310 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409219182|gb|EKN12145.1| hypothetical protein HMPREF1076_03310 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 371

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 116/453 (25%), Positives = 183/453 (40%), Gaps = 121/453 (26%)

Query: 27  KSHLKTQRLASLDIFRG-----------LAVALMILVDHAGGD--WPEISHAPWN-GCNL 72
           +   ++QRL SLD  RG           L VAL  LV     +    ++SHA W  G   
Sbjct: 2   EKQKQSQRLLSLDALRGFDMFFIMGGGSLFVALATLVPTPFFESIAAQMSHAKWGAGFTF 61

Query: 73  ADFVMPFFLFIVGVAIALALKRIPDR----ADAVKKVIFRTLKLLFWGILLQGGFSHAPD 128
            D + P FLFI G++   +L++  +R    A   KK+I R + L+  G +  G       
Sbjct: 62  EDIIFPLFLFIAGISFPFSLEKQRERGMSEAAIYKKIIRRGITLVVLGFVYNG------- 114

Query: 129 ELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMA 188
                ++    R   VL RI L ++  +L+ + T+ +                   W++A
Sbjct: 115 --LLNLNFETQRYASVLARIGLGWMFGALIFVNTRTITRV----------------WIVA 156

Query: 189 ACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
           A +L+ Y  LL+    PD         +  + +  N+ C              YIDR +L
Sbjct: 157 A-ILIGYWLLLF-IPAPD------GNGAELFTREGNLAC--------------YIDRLLL 194

Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP-FEPEGLLSSVSSILSTIIGVH 307
                                          P   H   ++PEG+LS++ +I + ++G+ 
Sbjct: 195 -------------------------------PGRLHGGNYDPEGILSTLPAIGTALLGMF 223

Query: 308 FGHVIIHTKGHLARLKQWVTM---GFALLIFGL--TLHFTNAIPLNKQLYTLSYVCVTSG 362
            G  +   +  L   K+ V M   G  LL+ GL   L F    P+NK L+T S+VC   G
Sbjct: 224 TGEFVKLRREGLTETKKVVYMLAVGGCLLVIGLLWGLFF----PINKYLWTSSFVCTVGG 279

Query: 363 AAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV--------MAAEGIFAGFINGWY 414
            +A++F+  Y +VD+   +   L    IG N++ +Y+          A  +F GFI    
Sbjct: 280 ISAILFAVFYYIVDVKECRGWTLFFTVIGTNSITIYLAQVFINFTFTANAVFGGFI---- 335

Query: 415 YGDPHNTLVCFLFIISYILHSFLWELRKFLYVQ 447
            G    T    L  I+YI  +  W    FLY Q
Sbjct: 336 -GLFPETAQPLLGAIAYI--AVCWGFLYFLYRQ 365


>gi|255013328|ref|ZP_05285454.1| hypothetical protein B2_05430 [Bacteroides sp. 2_1_7]
 gi|410103821|ref|ZP_11298742.1| hypothetical protein HMPREF0999_02514 [Parabacteroides sp. D25]
 gi|409236550|gb|EKN29357.1| hypothetical protein HMPREF0999_02514 [Parabacteroides sp. D25]
          Length = 372

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 103/417 (24%), Positives = 163/417 (39%), Gaps = 118/417 (28%)

Query: 26  EKSHLKTQRLASLDIFRG-----------LAVALMILVDH------AGGDWPEISHAPWN 68
           EK   + QRL SLD  RG           L VAL  L  +      AG    ++ H  WN
Sbjct: 2   EKQKQQPQRLQSLDALRGFDMLFIMGGASLFVALATLFPNPFFQAIAG----QMEHVEWN 57

Query: 69  GCNLADFVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFS 124
           G    D + P FLFI G++   +L++   +        KK++ R + L+F G++  G  S
Sbjct: 58  GLAHHDTIFPLFLFIAGISFPFSLEKQRGKGMTEGAIYKKIVRRGITLVFLGLVYNGLLS 117

Query: 125 HAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWH 184
              D L         R   VL RI L ++  +L                   +F  + W 
Sbjct: 118 FEFDHL---------RCASVLARIGLGWMFAAL-------------------LFVRFGWK 149

Query: 185 WLMAACVLVV---YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVG 241
                 VL++   +LA+ +   VPD                     G         N VG
Sbjct: 150 VRAGITVLILVGYWLAMAF-VPVPD--------------------VGGAGPFTLEGNLVG 188

Query: 242 YIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP-FEPEGLLSSVSSIL 300
           YIDR  L                               P   H   F+PEGL S+V +I 
Sbjct: 189 YIDRLFL-------------------------------PGRLHETVFDPEGLFSTVPAIA 217

Query: 301 STIIGVHFGHVI-IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCV 359
           + ++G+  G  I +  +G   R K+   +G   ++  + L ++   P+NK+L+T S+VCV
Sbjct: 218 TAMLGMFTGEWIKLRKEGLTDRKKELCLVGAGAVLLIVGLLWSLVFPINKKLWTSSFVCV 277

Query: 360 TSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVM--------AAEGIFAG 408
               +  +F+  + ++D+   +   L    IGMN++ +Y+          +E IF G
Sbjct: 278 VGAYSVWMFALFFYIIDVLGWRKWTLFFTVIGMNSITIYLAQRFIRFSYTSEAIFGG 334


>gi|383643230|ref|ZP_09955636.1| hypothetical protein SeloA3_10334 [Sphingomonas elodea ATCC 31461]
          Length = 382

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 99/380 (26%), Positives = 166/380 (43%), Gaps = 90/380 (23%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           R  +LD+FRG  + LMILV+ +G     +P++ HA W G  LAD V P FLF +G A++ 
Sbjct: 17  RFLALDVFRGATIFLMILVNTSGPGAEPYPQLVHAKWIGFTLADLVFPTFLFAMGNAMSF 76

Query: 91  ALKRI----PDRADAVKK--VIFRTLKLLFW-GILLQG--GFSHAPDELTYGVDVRMIRL 141
           A ++     P  A   ++  +IF    L++W   + QG  G++  P  LT        R+
Sbjct: 77  AFRKPVATGPFLARLFRRGAIIFVLGYLMYWFPFVEQGPDGWALKPFALT--------RV 128

Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
            GVLQR+AL Y+L  L+                          WL    +L+  +A+L G
Sbjct: 129 PGVLQRLALCYVLAGLMI------------------------RWLKPRQLLLAGIAMLLG 164

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
                W   ++   +   G  F+    +  +          ID  +LG  H+Y       
Sbjct: 165 Y----WTILLVFSPA---GMAFDKYANIGTQ----------IDLWLLGPGHLY------- 200

Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
                         +KD      A F+PEGLL ++ + ++ I G   G  I+        
Sbjct: 201 --------------KKD------AGFDPEGLLGTLPATVNVIAGYLAGLAIVQGGDLRRT 240

Query: 322 LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLK 381
           + +   +G AL++ GL   ++   P+ K+L+T SYV +T G   ++ + +  L++I   K
Sbjct: 241 VGRMALVGAALVLAGLA--WSPWFPIAKKLWTGSYVLLTVGIDLVLLAGVIGLIEIAGFK 298

Query: 382 YPFLPLAWIGMNAMLVYVMA 401
                   +G N + +Y+ +
Sbjct: 299 RGTRFFTILGHNPLAIYLFS 318


>gi|116748970|ref|YP_845657.1| hypothetical protein Sfum_1534 [Syntrophobacter fumaroxidans MPOB]
 gi|116698034|gb|ABK17222.1| conserved hypothetical protein [Syntrophobacter fumaroxidans MPOB]
          Length = 374

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 99/395 (25%), Positives = 162/395 (41%), Gaps = 82/395 (20%)

Query: 27  KSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFI 83
            S     RLASLD FRG  +A MILV+  G     + ++ HA WNG   AD + P FLF+
Sbjct: 2   NSPTTNTRLASLDAFRGAVIAGMILVNSPGRWVYTYSQLKHAQWNGWTFADTIFPAFLFV 61

Query: 84  VGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCG 143
           VGV++  +  R  +  +   +++ +  +      LL    +   D   +G +   +R+ G
Sbjct: 62  VGVSMVFSFSRRRECEEPAWRLVLQVFRRTSLIFLLGLLLNVMLD--FHGSN---LRIPG 116

Query: 144 VLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTY 203
           VLQRIA  Y + SL+ + T               FR                     G  
Sbjct: 117 VLQRIAACYFVASLIVLGTG--------------FR---------------------GQA 141

Query: 204 VPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
           +       +     ++  V  +  GV   L P  N   Y+D  +L   HM+ H       
Sbjct: 142 IWALGLLALYWLLMEFYPVPGIGAGV---LEPGRNFASYVDSLLLD-GHMWSH------- 190

Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLK 323
                               +  ++PEG++S++ ++ ST+ GV  GH +  T    A+ K
Sbjct: 191 --------------------YRTWDPEGIISTIPAVSSTLFGVLTGHFLRST--FSAKAK 228

Query: 324 QWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYP 383
               +G    +  L    +  +P+NK ++T SY    +G +    +  Y L+D+ + K  
Sbjct: 229 TAGMLGAGAALLALGRFCSIWLPINKNIWTSSYSIFMTGLSLAGLAVFYWLIDVKDRKRW 288

Query: 384 FLPLAWIGMNAMLVYVM------AAEGIFAGFING 412
            +P    G NA+  Y++      AA GI   F +G
Sbjct: 289 AIPFEIFGTNAITAYMLSMFLLIAARGIDWTFSDG 323


>gi|298376668|ref|ZP_06986623.1| conserved hypothetical protein [Bacteroides sp. 3_1_19]
 gi|298266546|gb|EFI08204.1| conserved hypothetical protein [Bacteroides sp. 3_1_19]
          Length = 372

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 108/418 (25%), Positives = 161/418 (38%), Gaps = 120/418 (28%)

Query: 26  EKSHLKTQRLASLDIFRG-----------LAVALMILVDH------AGGDWPEISHAPWN 68
           EK   + QRL SLD  RG           L VAL  L  +      AG    ++ H  WN
Sbjct: 2   EKQKQQPQRLQSLDALRGFDMLFIMGGASLFVALATLFPNPFFQAIAG----QMEHVEWN 57

Query: 69  GCNLADFVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFS 124
           G    D + P FLFI G++   +L++   +        KK++ R + L+F G++  G  S
Sbjct: 58  GLAHHDTIFPLFLFIAGISFPFSLEKQRGKGMTEGAIYKKIVRRGITLVFLGLVYNGLLS 117

Query: 125 HAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWH 184
              D L         R   VL RI L ++  +L                   +F  + W 
Sbjct: 118 FEFDHL---------RCASVLARIGLGWMFAAL-------------------LFVRFGWK 149

Query: 185 WL--MAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY 242
               + A +LV Y  ++    VPD                     G         N VGY
Sbjct: 150 ARAGITALILVGYWLVMAFVPVPD--------------------AGGAGPFTLEGNLVGY 189

Query: 243 IDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP-FEPEGLLSSVSSILS 301
           IDR  L                               P   H   F+PEGL S+V +I +
Sbjct: 190 IDRLFL-------------------------------PGRLHETVFDPEGLFSTVPAIAT 218

Query: 302 TIIGVHFGHVIIHTKGHLARLKQ---WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVC 358
            ++G+  G  I   K  L   K+    V  G  LLI GL   ++   P+NK+L+T S+VC
Sbjct: 219 AMLGMFTGEWIKLGKEGLTDRKKVLCLVGAGAVLLIVGLL--WSLVFPVNKKLWTSSFVC 276

Query: 359 VTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVM--------AAEGIFAG 408
           V    +  +F+  + ++D+   +   L    IGMN++ +Y+          +E IF G
Sbjct: 277 VVGAYSVWMFALFFYIIDVLGWRKWTLFFTVIGMNSITIYLAQRFIRFSYTSEAIFGG 334


>gi|322436067|ref|YP_004218279.1| hypothetical protein AciX9_2466 [Granulicella tundricola MP5ACTX9]
 gi|321163794|gb|ADW69499.1| hypothetical protein AciX9_2466 [Granulicella tundricola MP5ACTX9]
          Length = 391

 Score = 87.4 bits (215), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 103/385 (26%), Positives = 161/385 (41%), Gaps = 87/385 (22%)

Query: 34  RLASLDIFRGLAVALMILVDH---AGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           R+ S+D+ RGL +ALMILV+    AG  +P++ HA WNG   AD V P FLF+ G ++  
Sbjct: 16  RVLSIDVLRGLTIALMILVNDPGDAGCVYPQLQHAEWNGYTAADLVFPNFLFLGGASLVF 75

Query: 91  ALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQ 146
           +L+    R  DR +  + +  R + L+   ++L          +     +R IR+ GVL 
Sbjct: 76  SLQGRIERGADRWELARGLGRRGVNLIALKLVL---------AMLPSFRLRRIRIFGVLF 126

Query: 147 RIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPD 206
           R A+  +   L+ + T ++      VG                       A+L G Y   
Sbjct: 127 RTAVCSVAGGLILLGTLEIPMLVGIVG-----------------------AMLTGYY--- 160

Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
                       +G++ N        L+P  N    +DRKV    H+ H           
Sbjct: 161 ------GALRISFGRM-NAPL-----LDPENNLAAALDRKV---AHLLH----------- 194

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH---TKGHLARLK 323
                 G L   A    +   +PEGLLSSV ++ +T++G     V+ H   T+G     K
Sbjct: 195 ------GELHTGA--LYNVTHDPEGLLSSVPAVGTTLLGAVAALVMRHPRLTQGQ----K 242

Query: 324 QWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYP 383
             V     +   G+   +  + P+NK L+T SYV V  G + L    IY  +D+      
Sbjct: 243 VGVLAASGVGALGVGSVWGRSFPVNKNLWTSSYVLVAGGWSLLALGGIYWCLDVRRPSAR 302

Query: 384 FL----PLAWIGMNAMLVYVMAAEG 404
            L    P    G NA++ Y ++  G
Sbjct: 303 TLRAIRPAQIFGANALVAYAVSVAG 327


>gi|423223641|ref|ZP_17210110.1| hypothetical protein HMPREF1062_02296 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392638266|gb|EIY32113.1| hypothetical protein HMPREF1062_02296 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 395

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 97/405 (23%), Positives = 168/405 (41%), Gaps = 102/405 (25%)

Query: 23  DQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMP 78
           + Q K++   +R+ +LDI RG+ +A MI+V++ G  W  I     HA WNG    D V P
Sbjct: 2   NSQTKTN---KRILALDILRGVTIAGMIMVNNPG-TWGHIYAPLRHAEWNGLTPTDLVFP 57

Query: 79  FFLFIVGVAIALALKRIPDRAD--AVKKVIFRTLKLLFWGILLQGGFSH-------APDE 129
           FF+FI+G++  ++LK+   +    A  K++ RT+ +   G+ + G FS        + + 
Sbjct: 58  FFMFIMGISTYISLKKYNFKFSHAAALKILKRTIIIFLIGLAI-GWFSRFCYYWAGSHEG 116

Query: 130 LTYGVDV-------RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYC 182
           L++G  +         IR+ GV+QR+AL Y   +++ +  K                   
Sbjct: 117 LSFGEQLWASVWTFDRIRILGVMQRLALCYGATAIIALTMKHRHIP-------------- 162

Query: 183 WHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY 242
               + A +LV Y  LL                           CG     N   N +  
Sbjct: 163 ---YLIATLLVGYFILL--------------------------MCGNGFAYN-ETNILSI 192

Query: 243 IDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILST 302
           +DR +L   HMY                      KD         +PEGLLS++ SI   
Sbjct: 193 VDRAILTPAHMY----------------------KD------NGIDPEGLLSTIPSIAHV 224

Query: 303 IIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTN-----AIPLNKQLYTLSYV 357
           ++G   G +++ +    +R     +    L + G  L FT        P+NK++++ ++V
Sbjct: 225 LLGFCVGRMMLDSNKAESREALLNSHLIKLFLVGAILTFTGFLLSYGCPINKKIWSPTFV 284

Query: 358 CVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
             T G A+   + +  ++D+   K         G+N + +YV+  
Sbjct: 285 LTTCGLASSFLALLIWIIDVKGYKKWCTFFEAFGVNPLFMYVLGG 329


>gi|374311063|ref|YP_005057493.1| hypothetical protein [Granulicella mallensis MP5ACTX8]
 gi|358753073|gb|AEU36463.1| protein of unknown function DUF1624 [Granulicella mallensis
           MP5ACTX8]
          Length = 435

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 101/368 (27%), Positives = 155/368 (42%), Gaps = 83/368 (22%)

Query: 23  DQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWP----EISHAPWNGCNLADFVMP 78
            Q E++  K  R+ S+D+ RG+ +ALMILV+  G DW     ++ HA WNG  L D V P
Sbjct: 38  SQTERTVSKPGRVLSVDVLRGITIALMILVNDPG-DWDHIFGQLDHAAWNGWTLTDMVFP 96

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELT-YGVDV- 136
            FLF++G +I  +L+    R +               G L    F+ A   L  Y V   
Sbjct: 97  AFLFLMGASIIFSLQARIARGNCK-------------GTLAGHIFARAGKILALYWVLAF 143

Query: 137 --RM---IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV 191
             RM   IR  GVL RIAL YLL SLV + T+ V+     V                A +
Sbjct: 144 FPRMHWTIRWFGVLPRIALCYLLASLVLLATRRVRVLIAIV----------------AFL 187

Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
           LV Y  LL    VP     + +    D                   N   +IDR V   +
Sbjct: 188 LVGYWVLLRWVPVPGLGTPMRDIPFMDQN----------------ANLASWIDRGVSSWS 231

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
             + H                G L +          +PEGLLS++ ++ +T++G   G  
Sbjct: 232 LRWLH---------------TGTLYRKTR-------DPEGLLSTLPAVATTLLGALAGMW 269

Query: 312 IIHTKGHLARLKQWVTMGFALLIFGLTLH---FTNAIPLNKQLYTLSYVCVTSGAAALVF 368
           +I+ +  +  +++ + +G A +          ++   P+NK L+T S+V + +G  AL  
Sbjct: 270 MINGQRVVNGMRR-MRIGLAAMGVAGVAAGVLWSRWFPINKNLWTSSFVLLMAGWTALAL 328

Query: 369 SAIYALVD 376
           +    L+D
Sbjct: 329 AGCSWLID 336


>gi|404256028|ref|ZP_10959996.1| hypothetical protein SPAM266_22806 [Sphingomonas sp. PAMC 26621]
          Length = 392

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 99/376 (26%), Positives = 169/376 (44%), Gaps = 63/376 (16%)

Query: 37  SLDIFRGLAVALMILVDHAGGDWPE----ISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           +LD+ RGLAVA MILV   G DW +    + HA WNG  LAD V P FLF VG+A+ L+ 
Sbjct: 2   ALDVLRGLAVAGMILVVSPG-DWGQAYVQLQHANWNGATLADMVFPTFLFSVGIALGLSF 60

Query: 93  KRIPDRADAVK----KVIFRTLKLLFWGILLQGGFS---HAPDELTYGVDVRMIRLCGVL 145
            R  + A   +    +++ RT  L+  G+L++  +     A      G  +  IR+ G+L
Sbjct: 61  PRRLETAGDRRLFWTRLLRRTALLILLGLLVEATYVWTIAAGAPYPGGPGLAHIRIPGIL 120

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRI L Y L  ++ + T   +D D   G   I  L      +  C++V+ +         
Sbjct: 121 QRIGLCYGLAGILLLATNR-RDPD---GMIRINPLA-----IVGCIVVILIGY------- 164

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
            W   I       +  V     GV   L P  N  G++DR +    H++         + 
Sbjct: 165 -WLLLI-------FVPVPGFGAGV---LTPAGNLPGFVDRTLFTEPHLWPL------GSA 207

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
           T   P              A ++PEGLLS++ +  + + G+     +   + +  R   +
Sbjct: 208 TAARP--------------ATYDPEGLLSTLPATANVLFGILSAWAL---RRYPDRALGY 250

Query: 326 VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFL 385
           + +  A L+F   L     + +NK+++T S+  ++ G +AL  +A+  ++          
Sbjct: 251 IAV-VAALLFSAGLALDPLLVINKRIWTSSFAVLSGGVSALALTALMVVLRSRGAALMLT 309

Query: 386 PLAWIGMNAMLVYVMA 401
           P   +G NA+L ++++
Sbjct: 310 PFQVLGGNAVLAFLIS 325


>gi|301309930|ref|ZP_07215869.1| conserved hypothetical protein [Bacteroides sp. 20_3]
 gi|423340409|ref|ZP_17318148.1| hypothetical protein HMPREF1059_04073 [Parabacteroides distasonis
           CL09T03C24]
 gi|300831504|gb|EFK62135.1| conserved hypothetical protein [Bacteroides sp. 20_3]
 gi|409227844|gb|EKN20740.1| hypothetical protein HMPREF1059_04073 [Parabacteroides distasonis
           CL09T03C24]
          Length = 372

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 107/416 (25%), Positives = 160/416 (38%), Gaps = 116/416 (27%)

Query: 26  EKSHLKTQRLASLDIFRG-----------LAVALMILVDH------AGGDWPEISHAPWN 68
           EK   + QRL SLD  RG           L VAL  L  +      AG    ++ H  WN
Sbjct: 2   EKQKQQPQRLQSLDALRGFDMLFIMGGASLFVALATLFPNPFFQAIAG----QMEHVEWN 57

Query: 69  GCNLADFVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFS 124
           G    D + P FLFI G++   +L++   +        KK++ R + L+F G++  G  S
Sbjct: 58  GLAHHDTIFPLFLFIAGISFPFSLEKQRGKGMTEGAIYKKIVRRGITLVFLGLVYNGLLS 117

Query: 125 HAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWH 184
              D L         R   VL RI L ++  +L                   +F  + W 
Sbjct: 118 FEFDHL---------RCASVLARIGLGWMFAAL-------------------LFVRFGWK 149

Query: 185 WLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYID 244
                 VL++    L   +VP         D+   G                 N VGYID
Sbjct: 150 VRAGITVLILVGYWLAMAFVP-------VPDAGGAGP-----------FTLEGNLVGYID 191

Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP-FEPEGLLSSVSSILSTI 303
           R  L                               P   H   F+PEGL S+V +I + +
Sbjct: 192 RLFL-------------------------------PGRLHETVFDPEGLFSTVPAIATAM 220

Query: 304 IGVHFGHVIIHTKGHLARLKQ---WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVT 360
           +G+  G  I   K  L   K+    V  G  LLI GL   ++   P+NK+L+T S+VCV 
Sbjct: 221 LGMFTGEWIKLRKEGLTDRKKVLCLVGAGAVLLIVGLL--WSLVFPINKKLWTSSFVCVV 278

Query: 361 SGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVM--------AAEGIFAG 408
              +  +F+  + ++D+   +   L    IGMN++ +Y+          +E IF G
Sbjct: 279 GAYSVWMFALFFYIIDVLGWRKWTLFFTVIGMNSITIYLAQRFIRFSYTSEAIFGG 334


>gi|126173329|ref|YP_001049478.1| hypothetical protein Sbal_1087 [Shewanella baltica OS155]
 gi|386340088|ref|YP_006036454.1| hypothetical protein [Shewanella baltica OS117]
 gi|125996534|gb|ABN60609.1| conserved hypothetical protein [Shewanella baltica OS155]
 gi|334862489|gb|AEH12960.1| Protein of unknown function DUF2261, transmembrane [Shewanella
           baltica OS117]
          Length = 387

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 105/409 (25%), Positives = 169/409 (41%), Gaps = 107/409 (26%)

Query: 34  RLASLDIFRGLAV-----------ALMILVDHAG------GDWPEISHAPWNGCNLADFV 76
           RL SLD  RG  +            L+IL   AG      GD  ++ H+ W+G +  D +
Sbjct: 18  RLMSLDALRGFDMFWILGGEALFGGLLILTGWAGWAGWQWGD-EQMHHSQWHGFHFYDLI 76

Query: 77  MPFFLFIVGVAIALALKRIPDRADAVKKVIFR-TLKLLFWGILLQGGFSHAPDELTYGVD 135
            P F+F+ GVA+ L+ KR+       +  ++R  +K LF  +LL   ++H         D
Sbjct: 77  FPLFIFLSGVALGLSPKRLDKLPMKERLPVYRHGIKRLFLLLLLGILYNHGWGT-GAPAD 135

Query: 136 VRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVY 195
              IR   VL RIA ++   +L+                        WH  +   ++V  
Sbjct: 136 PEKIRYASVLGRIAFAWFFAALL-----------------------VWHTSLRTQIIVA- 171

Query: 196 LALLYGTYVPDWQFTIINKDSADYGKV---FNVTCGVRAKLNPPCNAVGYIDRKVL-GIN 251
           L +L G                 YG +        G    L+P  +   Y+D  +L G++
Sbjct: 172 LGILLG-----------------YGAMQLWLPFPGGQAGVLSPTESINAYVDSILLPGVS 214

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           +    P                              +PEGLLS++ +I++ + GV  GH 
Sbjct: 215 YQGRTP------------------------------DPEGLLSTIPAIVNALAGVFVGHF 244

Query: 312 II--HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFS 369
           I+  H KG  A++      G   L FG  L     IP+NK+L+T S+V VTSG + ++ +
Sbjct: 245 IVKSHPKGEWAKVGLLAAAGCVCLTFGWLLDLV--IPVNKELWTSSFVLVTSGWSMILLA 302

Query: 370 AIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA--------AEGIFAGFI 410
             YALVD+   +        IG NA+++Y+ +        A+ +F G +
Sbjct: 303 LFYALVDVLKWQKAAFFFVVIGTNAIIIYLASSLVDWKYIAQSVFGGLV 351


>gi|239907232|ref|YP_002953973.1| hypothetical protein DMR_25960 [Desulfovibrio magneticus RS-1]
 gi|239797098|dbj|BAH76087.1| hypothetical membrane protein [Desulfovibrio magneticus RS-1]
          Length = 371

 Score = 86.7 bits (213), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 103/379 (27%), Positives = 157/379 (41%), Gaps = 92/379 (24%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           RL+S+D  RGLA+A MI+V++ G     +P++ HA W+G  LAD V P FLF+VGV +AL
Sbjct: 8   RLSSVDTLRGLAIAAMIVVNNPGDRRFVYPQLLHAQWHGLTLADVVFPLFLFLVGVCVAL 67

Query: 91  ALKRIPD-------RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCG 143
           A+   PD       RA   +K++ R   L   G+        + DEL         RL G
Sbjct: 68  AID--PDKPRDAEARARLWRKILPRAAVLFALGLGENAYLRLSFDEL---------RLPG 116

Query: 144 VLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTY 203
           VLQRIA+ YL  + ++     +  +  +V              + A  L+ Y  LL    
Sbjct: 117 VLQRIAVVYLAAAWLQ---SRLSSRALAV--------------VGAVTLLGYWLLLAAVP 159

Query: 204 VPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
           VP      +  +                      N  G++D+ VLG +    H +W    
Sbjct: 160 VPGHGHPSLAMEP---------------------NLQGWLDQLVLGRHIWKFHTSW---- 194

Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLK 323
                                   +PEG+LS+  +I   +IGV  G  +        R  
Sbjct: 195 ------------------------DPEGILSTFPAIALGLIGVLAGRWLRRGGDRPGRAG 230

Query: 324 QWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYP 383
                   LL+  + L +    PLNK L T S+V +T G    + +  +A++D       
Sbjct: 231 L-----LGLLLIAIGLAWDAVFPLNKSLCTSSFVLLTGGLGLAMLAVAHAVLDGRPPAAW 285

Query: 384 FLPLAWIGMNAMLVYVMAA 402
             PL  +G N + +YV A+
Sbjct: 286 ARPLGILGRNPLFLYVTAS 304


>gi|357624248|gb|EHJ75102.1| putative heparan-alpha-glucosaminide N-acetyltransferase [Danaus
           plexippus]
          Length = 340

 Score = 86.7 bits (213), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 83/307 (27%), Positives = 133/307 (43%), Gaps = 40/307 (13%)

Query: 98  RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSL 157
           R +A+ +V  R+L L   GI L           +   +   +R  GVLQR+A  YL+V  
Sbjct: 19  RVNALGQVARRSLLLSLIGICLG----------SVNTNWSYVRFPGVLQRLAAMYLIVGS 68

Query: 158 VEI-FTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDS 216
           +E  F +  Q+       F         WL    ++ + L +      P           
Sbjct: 69  LECAFMRTSQNIIPGRSLFRDIAAGWQQWLATVLMVAIQLCITLTVAAPGCPV-----GY 123

Query: 217 ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLR 276
           +  G +     G  +  N      GYIDR +LG NH+Y H              F+   R
Sbjct: 124 SGPGGLHRTATGDFSLQNCTGGIAGYIDRLILGPNHLYQH------------GTFKSIYR 171

Query: 277 KDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV--TMGFALLI 334
              P       +PEG+L  +S +L    G H   +++      AR+ +WV  ++ F + +
Sbjct: 172 TQLPH------DPEGILGILSGVLVVQAGAHAARIMLVYNHARARIMRWVFWSVMFGV-V 224

Query: 335 FGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGM 392
            GL   F++   IP+NK L+++SY  VTS  A  + + +Y +VD+ N K+   PL + G 
Sbjct: 225 GGLLCKFSDGGYIPVNKNLWSVSYCLVTSSMAFFIQAILYFVVDLKN-KWGGRPLYYAGQ 283

Query: 393 NAMLVYV 399
           NA+ +YV
Sbjct: 284 NALFLYV 290


>gi|312131163|ref|YP_003998503.1| hypothetical protein Lbys_2486 [Leadbetterella byssophila DSM
           17132]
 gi|311907709|gb|ADQ18150.1| hypothetical protein Lbys_2486 [Leadbetterella byssophila DSM
           17132]
          Length = 413

 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 103/398 (25%), Positives = 159/398 (39%), Gaps = 127/398 (31%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVG 85
           L  QR+ SLD+FRGL + LMI V++ G DW  +     HA W+G    D V PFF+F +G
Sbjct: 3   LAKQRIVSLDVFRGLTMILMITVNNPG-DWSNVYAPLLHAEWHGWTPTDLVFPFFVFAMG 61

Query: 86  VAIALALK-----------RIPDR---------------------ADAVKKVIFRTLKLL 113
           +A+  ++K           +I  R                     A  +  ++FR +   
Sbjct: 62  MALPFSMKPGSGLSKDDFLKILARSARLIALGLFLNFFSKIEFGNAQGITLLLFRLMITG 121

Query: 114 FWGILLQGGFSHAPDELTYGVDV--------------RMIRLCGVLQRIALSYLLVSLVE 159
           F G LL G F       T    +                +R+ GVLQR+   Y   +++ 
Sbjct: 122 FVGFLLMGNFPTKIKLYTALALLGLMLALAYSGLPHFAQVRIPGVLQRLGTVYFFAAILY 181

Query: 160 I-FTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSAD 218
           + F+  VQ                  W +   VLV+Y  LL    VP    T   K    
Sbjct: 182 LAFSLRVQ------------------WGIGLSVLVIYWLLLAYIPVPGSGVTGFEKGE-- 221

Query: 219 YGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKD 278
                              N   +ID  VLG +H+     W  SK               
Sbjct: 222 -------------------NLPAWIDSIVLG-DHV-----WSSSK--------------- 241

Query: 279 APSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLT 338
                  P++PEG+LS++ +I+S ++G   G  +   K      K+ +  G  LLI GL 
Sbjct: 242 -------PWDPEGVLSTLPAIISCLLGAWAGVFLREDK------KKLLLTGVILLICGLA 288

Query: 339 LHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
             ++   P+NK L+T S+V +T+G  +++ S +  +VD
Sbjct: 289 --WSTFFPINKALWTSSFVLLTAGLGSIIVSLLGFVVD 324


>gi|408369302|ref|ZP_11167083.1| hypothetical protein I215_00330 [Galbibacter sp. ck-I2-15]
 gi|407745048|gb|EKF56614.1| hypothetical protein I215_00330 [Galbibacter sp. ck-I2-15]
          Length = 345

 Score = 85.9 bits (211), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 96/360 (26%), Positives = 149/360 (41%), Gaps = 83/360 (23%)

Query: 49  MILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADA--V 102
           MI+V+  G  W  +    SHA W+G  L D V P FLF+VG A++ ++++    + A  +
Sbjct: 1   MIIVNTPG-SWGSVYRPLSHASWHGFTLTDLVFPTFLFVVGNAMSFSMRKFEQTSQAAFL 59

Query: 103 KKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFT 162
           KKVI RT  +   G LL         +L    D R+    GVLQRIAL Y   SLV    
Sbjct: 60  KKVIKRTFVIFAIGFLLSWFPFFRDGQLKPLEDARIF---GVLQRIALCYFFASLVI--- 113

Query: 163 KDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKV 222
                                H+      LV  +  L G       + +I     DY   
Sbjct: 114 ---------------------HYFKIKGALVFSMVALLG-------YHLIMYTMGDY--- 142

Query: 223 FNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSW 282
                          NA   +D  +LG NH+Y                 EG         
Sbjct: 143 -----------TLEGNAALKLDLWLLGPNHLYQG---------------EG--------- 167

Query: 283 CHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFT 342
              PF+PEGLLS++ + ++ I G +F  + +   G   +    + +G A L+F L L + 
Sbjct: 168 --IPFDPEGLLSTLPATVNVIFG-YFAGLFLQQSGKNFKTIALLMIGGATLVF-LALGWD 223

Query: 343 NAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
              P+NK+++T SYV +T G   +V + +   ++I   K        +G N + +Y+++ 
Sbjct: 224 LLFPINKKIWTSSYVLLTVGIDVMVLAFLVYGIEILQKKKWTYFFEVLGKNPLFIYILSG 283


>gi|189464405|ref|ZP_03013190.1| hypothetical protein BACINT_00746 [Bacteroides intestinalis DSM
           17393]
 gi|189438195|gb|EDV07180.1| hypothetical protein BACINT_00746 [Bacteroides intestinalis DSM
           17393]
          Length = 395

 Score = 85.9 bits (211), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 95/405 (23%), Positives = 170/405 (41%), Gaps = 102/405 (25%)

Query: 23  DQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMP 78
           + Q K++   +R+ +LDI RG+ +A MI+V++ G  W  I     HA WNG    D V P
Sbjct: 2   NSQTKTN---KRILALDILRGVTIAGMIMVNNPG-TWGHIYAPLRHAEWNGLTPTDLVFP 57

Query: 79  FFLFIVGVAIALALKRIPDRAD--AVKKVIFRTLKLLFWGILLQGGFSH-------APDE 129
           FF+FI+G++  ++LK+        A  K++ RT+ +   G+ + G FS        + + 
Sbjct: 58  FFMFIMGISTYISLKKYNFEFSHAAAMKILKRTIIIFLIGLAI-GWFSRFCYYWAGSHEG 116

Query: 130 LTYGVDV-------RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYC 182
           L++G  +         IR+ GV+QR+AL Y   +++ +  K                   
Sbjct: 117 LSFGEQLWASVWTFDRIRILGVMQRLALCYGATAIIALTMKHRHIP-------------- 162

Query: 183 WHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY 242
             +L+A  ++  ++ L+ G                  G V+N T           N +  
Sbjct: 163 --YLIATLLVGYFILLMCGN-----------------GFVYNET-----------NILSI 192

Query: 243 IDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILST 302
           +DR +L   HMY                      KD         +PEGLLS++ SI   
Sbjct: 193 VDRAILTPAHMY----------------------KD------NGIDPEGLLSTIPSIAHV 224

Query: 303 IIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTN-----AIPLNKQLYTLSYV 357
           ++G   G +++ +     R     +    L + G  L F         P+NK++++ ++V
Sbjct: 225 LLGFCVGRMMLDSNKTENRETLLNSHLIKLFLIGTILTFAGFLLSYGCPINKKIWSPTFV 284

Query: 358 CVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
             T G A+   + +  ++D+   K         G+N + +YV+  
Sbjct: 285 LTTCGLASSFLALLIWIIDVKGYKKWCTFFEAFGVNPLFMYVLGG 329


>gi|340786861|ref|YP_004752326.1| protein involved in N-acetyl-D-glucosamine utilization [Collimonas
           fungivorans Ter331]
 gi|340552128|gb|AEK61503.1| protein involved in N-Acetyl-D-glucosamine utilization [Collimonas
           fungivorans Ter331]
          Length = 373

 Score = 85.9 bits (211), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 104/377 (27%), Positives = 161/377 (42%), Gaps = 99/377 (26%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVA 87
           T+RLAS+D  RG  VA M+LV+  G DW  +     H+ W+GC   D V PFFLF+VGV+
Sbjct: 22  TRRLASVDALRGCTVAAMLLVNDPG-DWSHVYAPLEHSAWHGCTPTDLVFPFFLFVVGVS 80

Query: 88  IALALK-RIPDRADA---VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRM-IRLC 142
            AL ++ R+   A+     +  + R L+++  G+L+        + L + +   + +RL 
Sbjct: 81  TALGIEPRLAQGANPSTLARAALIRALRIVALGLLI--------NLLAWFIMPGVHLRLP 132

Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALL-YG 201
           GVLQRI L +   +L  I+T+                     W +   +L+ Y  LL  G
Sbjct: 133 GVLQRIGLCFAATALCSIYTRPRT-----------------QWGLIVAILLGYWGLLTLG 175

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
             +  W       DSA +G                   V  ID     ++   H P    
Sbjct: 176 GSLEPWLNLASRSDSALFGHF-----------------VYQID----AVSGRGHDP---- 210

Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
                                       EGLL ++ S+ ++++G+  G  +        R
Sbjct: 211 ----------------------------EGLLGTLPSLATSLLGLCAGRWLREN-----R 237

Query: 322 LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLK 381
           LK  +    A L+ G TL ++ A+PLNK L+T S+V   +G A L     + LVD  N  
Sbjct: 238 LKPLLFAAVAALVLG-TL-WSLALPLNKNLWTSSFVLWCAGWATLALLLFHWLVDQRN-- 293

Query: 382 YPFLPLAWIGMNAMLVY 398
           +P L     G+NA+  Y
Sbjct: 294 WPALGRR-FGLNAVAAY 309


>gi|388456506|ref|ZP_10138801.1| hypothetical protein FdumT_08017 [Fluoribacter dumoffii Tex-KL]
          Length = 372

 Score = 85.9 bits (211), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 52/138 (37%), Positives = 82/138 (59%), Gaps = 13/138 (9%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
           +T+R+ SLD+FRGL +ALM+LV+  G   ++  + H  WNGC+LAD V P FLFIVG+  
Sbjct: 7   ETKRILSLDVFRGLTMALMVLVNSLGSRENYKILMHVEWNGCSLADLVFPAFLFIVGITT 66

Query: 89  ALALKR---IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
            ++L+R      +A   + ++ RTL L    +      +  P +    +D+  IR+ G+L
Sbjct: 67  VISLQRHLNDESKAQLYRSILTRTLLL----MFFGLFLNIFPKQ----IDLSTIRIYGIL 118

Query: 146 QRIALSYLLVSLVEIFTK 163
           QRIA  YL+ S++ + T 
Sbjct: 119 QRIAWCYLICSILYLHTS 136



 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 33/114 (28%), Positives = 63/114 (55%), Gaps = 4/114 (3%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMG-FALLIFGLTLHFTNAI 345
           F+PEG LS++ +I +T+ G+  G  ++     L +    +T+G   L + G   H+    
Sbjct: 193 FDPEGFLSTIPAIATTLSGLLVGQYLLTPGSKLKKSFTLITIGILCLFVAGFWNHY---F 249

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
           P+NK L+T ++V  +SG + +VF   + ++D+       LP   +GMNA+ +++
Sbjct: 250 PINKNLWTSTFVLWSSGFSLIVFGLCFFVIDVLGYTKWSLPFKILGMNALFIFI 303


>gi|146292182|ref|YP_001182606.1| hypothetical protein Sputcn32_1079 [Shewanella putrefaciens CN-32]
 gi|145563872|gb|ABP74807.1| conserved hypothetical protein [Shewanella putrefaciens CN-32]
          Length = 384

 Score = 85.5 bits (210), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 100/405 (24%), Positives = 168/405 (41%), Gaps = 102/405 (25%)

Query: 34  RLASLDIFRGLAV-----------ALMILVDHAGGDW--PEISHAPWNGCNLADFVMPFF 80
           RL SLD  RG  +            L+IL   AG  W   ++ H+ W+G +  D + P F
Sbjct: 18  RLMSLDALRGFDMFWILGGEALFGGLLILTGWAGWQWGDEQMHHSQWHGFHFYDLIFPLF 77

Query: 81  LFIVGVAIALALKRIPDRADAVKKVIFR-TLKLLFWGILLQGGFSHAPDELTYGVDVRMI 139
           +F+ GVA+ L+ KR+     + +  ++R  +K LF  +LL   ++H         D   I
Sbjct: 78  IFLSGVALGLSPKRLDKLPMSERLPVYRHGIKRLFLLLLLGILYNHGWGT-GAPADPEKI 136

Query: 140 RLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALL 199
           R   VL RIA ++   +L+                        WH  +   V+V  L +L
Sbjct: 137 RYASVLGRIAFAWFFAALL-----------------------VWHTSLRTQVIVA-LGIL 172

Query: 200 YGTYVPDWQFTIINKDSADYGKV---FNVTCGVRAKLNPPCNAVGYIDRKVL-GINHMYH 255
            G                 YG +        G     +P  +   Y+D  +L G+++   
Sbjct: 173 LG-----------------YGAMQLWLPFPSGQAGVFSPTQSINAYVDSILLPGVSYQGR 215

Query: 256 HPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII-- 313
            P                              +P+GLLS++ ++++ + GV  G+ I+  
Sbjct: 216 TP------------------------------DPQGLLSTIPAVVNALAGVFVGYFIVKS 245

Query: 314 HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
           H +G   ++    T G A L  G  L     IP+NK+L+T S+V VTSG + ++ +  YA
Sbjct: 246 HPQGEWVKVGLLATAGGAWLALGWLLD--GVIPVNKELWTSSFVLVTSGWSMILLALFYA 303

Query: 374 LVDIWNLKYPFLPLAWIGMNAMLVYVMA--------AEGIFAGFI 410
           LVD+   +        IG NA+++Y+ +        A+ +F G +
Sbjct: 304 LVDVLKWQKVAFFFVVIGTNAIIIYLASSIVDWKYIAQSVFGGLV 348


>gi|150007980|ref|YP_001302723.1| hypothetical protein BDI_1342 [Parabacteroides distasonis ATCC
           8503]
 gi|256840846|ref|ZP_05546354.1| conserved hypothetical protein [Parabacteroides sp. D13]
 gi|423331513|ref|ZP_17309297.1| hypothetical protein HMPREF1075_01310 [Parabacteroides distasonis
           CL03T12C09]
 gi|149936404|gb|ABR43101.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
           8503]
 gi|256738118|gb|EEU51444.1| conserved hypothetical protein [Parabacteroides sp. D13]
 gi|409230083|gb|EKN22951.1| hypothetical protein HMPREF1075_01310 [Parabacteroides distasonis
           CL03T12C09]
          Length = 372

 Score = 85.5 bits (210), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 102/414 (24%), Positives = 160/414 (38%), Gaps = 112/414 (27%)

Query: 26  EKSHLKTQRLASLDIFRG-----------LAVALMILVDH------AGGDWPEISHAPWN 68
           EK   + QRL SLD  RG           L VAL  L  +      AG    ++ H  WN
Sbjct: 2   EKQKQQPQRLQSLDALRGFDMLFIMGGASLFVALATLFPNPFFQAIAG----QMEHVEWN 57

Query: 69  GCNLADFVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFS 124
           G    D + P FLFI G++   +L++   +        KK++ R + L+F G++  G  S
Sbjct: 58  GLAHHDTIFPLFLFIAGISFPFSLEKQRGKGMTEGAIYKKIVRRGITLVFLGLVYNGLLS 117

Query: 125 HAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWH 184
              D L         R   VL RI L ++  +L                   +F  + W 
Sbjct: 118 FEFDHL---------RCASVLARIGLGWMFAAL-------------------LFVRFGWK 149

Query: 185 WLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYID 244
                 VL++    L   +VP         D+   G                 N VGYID
Sbjct: 150 VRAGITVLILVGYWLAMAFVP-------VPDAGGAGP-----------FTLEGNLVGYID 191

Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP-FEPEGLLSSVSSILSTI 303
           R  L                               P   H   F+PEGL S+V +I + +
Sbjct: 192 RLFL-------------------------------PGRLHETVFDPEGLFSTVPAIATAM 220

Query: 304 IGVHFGHVI-IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSG 362
           +G+  G  I +  +G   R K    +G   ++  + L ++   P+NK+L+T S+VCV   
Sbjct: 221 LGMFTGEWIKLRKEGLTDRNKVLCLVGAGAVLLIVGLLWSLVFPINKKLWTSSFVCVVGA 280

Query: 363 AAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVM--------AAEGIFAG 408
            +  +F+  + ++D+   +   L    IGMN++ +Y+          +E IF G
Sbjct: 281 YSVWMFALFFYIIDVLGWRKWTLFFTVIGMNSITIYLAQRFIRFSYTSEAIFGG 334


>gi|399088486|ref|ZP_10753550.1| hypothetical protein PMI01_04686 [Caulobacter sp. AP07]
 gi|398030770|gb|EJL24173.1| hypothetical protein PMI01_04686 [Caulobacter sp. AP07]
          Length = 398

 Score = 85.5 bits (210), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 108/384 (28%), Positives = 162/384 (42%), Gaps = 64/384 (16%)

Query: 26  EKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLF 82
           + +     R+ +LD+ RGLAVA MILV   G     +  + HA W G  LAD V P FLF
Sbjct: 2   DSAKAGGGRIVALDVLRGLAVAGMILVTSPGAWAHAYAPLKHAAWQGWTLADLVFPTFLF 61

Query: 83  IVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
            VGVAI L++ R+     A   +  +  +     ILL    +  P       D+  +R+ 
Sbjct: 62  CVGVAIGLSVPRLRIGEGASAALWIKVARRTALLILLGLVLNALPR-----FDLAHLRIP 116

Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRL-YCWHWLMAACVLVVYLALLYG 201
           GVLQRI L Y L S + I     +   Q        RL      L    +LV Y ALL  
Sbjct: 117 GVLQRIGLCYALASAICILPARAEADGQ-------LRLNVGGVVLAVVGLLVGYWALLTF 169

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
           T VP   F +   DS      F                   IDR V  I H++    W  
Sbjct: 170 TPVPG--FGVDRWDSQGALPAF-------------------IDRAVFTIPHLW---PWGT 205

Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHT-KGHLA 320
           ++                       ++PEGLLS+  + ++ ++G      +  T  G   
Sbjct: 206 TEGVGVT------------------YDPEGLLSTFPATVNVLLGAVAAAFLARTGDGRQG 247

Query: 321 RLK---QWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDI 377
           R +     + +G AL++ GL L     +P+NK+L+T S+   +SGA+      +  ++  
Sbjct: 248 RGRVLAALLALGAALIVAGLALD--PIVPVNKRLWTPSFALFSSGASLAALVVLQVVLQA 305

Query: 378 WNLKYPFLPLAWIGMNAMLVYVMA 401
              +    PL  +G NA+L +VM+
Sbjct: 306 RAAQLAAWPLTVLGGNAILAFVMS 329


>gi|427384458|ref|ZP_18880963.1| hypothetical protein HMPREF9447_01996 [Bacteroides oleiciplenus YIT
           12058]
 gi|425727719|gb|EKU90578.1| hypothetical protein HMPREF9447_01996 [Bacteroides oleiciplenus YIT
           12058]
          Length = 395

 Score = 85.5 bits (210), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 96/402 (23%), Positives = 168/402 (41%), Gaps = 100/402 (24%)

Query: 27  KSHLKT-QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFL 81
            S +KT +R+ +LDI RG+ +A MI+V++ G  W  I     HA WNG    D V PFF+
Sbjct: 2   SSQMKTNKRILALDILRGVTIAGMIMVNNPG-TWGHIYAPLRHAEWNGLTPTDLVFPFFM 60

Query: 82  FIVGVAIALALKRIPDRAD--AVKKVIFRTLKLLFWGILLQGGFSH------APDE-LTY 132
           FI+G++  ++LK+        A  K++ RT+ +   G+ + G FS        P E + +
Sbjct: 61  FIMGISTYISLKKYNFEFSHAAAMKILKRTIIIFLIGLAI-GWFSKFCYYWTNPSEGIGF 119

Query: 133 GVDV-------RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHW 185
           G  +         IR+ GV+QR+AL Y   +++ +  K                     +
Sbjct: 120 GAQLWESVWTFDRIRILGVMQRLALCYGATAIIALTMKHRHIP----------------Y 163

Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDR 245
           L+A  ++  ++ L+ G                  G  +N T           N +  +DR
Sbjct: 164 LIATLLIGYFILLICGN-----------------GFAYNET-----------NVLSIVDR 195

Query: 246 KVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIG 305
            +L   HMY                      KD         +PEGLLS++ SI   ++G
Sbjct: 196 AILTPAHMY----------------------KD------NGIDPEGLLSTIPSIAHVLLG 227

Query: 306 VHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTN-----AIPLNKQLYTLSYVCVT 360
              G +++      +R +   +    L + G  L F         P+NK++++ ++V  T
Sbjct: 228 FCVGRMMLDGNKAQSREELLNSHLIKLFLIGTILTFAGFLLSYGCPINKKIWSPTFVLTT 287

Query: 361 SGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
            G A+   + +  ++D+   K         G+N + +YV+  
Sbjct: 288 CGLASSFLALLIWIIDVKGYKKWSTFFEAFGINPLFMYVLGG 329


>gi|343087500|ref|YP_004776795.1| hypothetical protein [Cyclobacterium marinum DSM 745]
 gi|342356034|gb|AEL28564.1| Protein of unknown function DUF2261, transmembrane [Cyclobacterium
           marinum DSM 745]
          Length = 386

 Score = 85.1 bits (209), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 99/403 (24%), Positives = 163/403 (40%), Gaps = 106/403 (26%)

Query: 33  QRLASLDIFRGL-------AVALMILVDHAGG----DW--PEISHAPWNGCNLADFVMPF 79
           +RL S+D  RG        A A ++L+    G    DW   +  H  WNG +  DF+ P 
Sbjct: 25  KRLLSIDALRGFDMLLIAGAGAFLVLLKGKTGIPAIDWIAGQFYHPAWNGFSFYDFIFPL 84

Query: 80  FLFIVGVAIALALKRIPD----RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
           FLFI GV++  +L +  +    +    KK   R L L+  GIL    + ++P  +    +
Sbjct: 85  FLFIAGVSLTFSLNKGRNLGMSKPTLYKKTFSRMLVLILLGIL----YKNSPVPI---FE 137

Query: 136 VRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVY 195
              IR   VL RI ++  + +LV +              F  ++       +A  +LV+Y
Sbjct: 138 PSQIRYGSVLGRIGIATFVTTLVYL-------------NFDFYKRLG----IAMAILVLY 180

Query: 196 LALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYH 255
            A L+   VP             YG            L+   N VG+ DR  +       
Sbjct: 181 YAALFLIPVPG------------YGA---------GDLSIEGNLVGWFDRTFM------- 212

Query: 256 HPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHT 315
                            G L+++        ++  GLL+ + ++  TI G   G ++   
Sbjct: 213 ----------------PGILKQEI-------YDELGLLTQIPALCLTIFGTLAGEILTKA 249

Query: 316 KGHLARLKQWVTMGFALLIFGLT--LHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
                ++KQ    G   L  GL   LHF    P+NK L++ S++ +TSG A L     Y 
Sbjct: 250 WLDTKKIKQLAIAGVISLTLGLIWDLHF----PINKHLWSSSFILLTSGMAFLTLLLFYV 305

Query: 374 LVDIWNLKYPFLPLAWIGMNAMLVYV--------MAAEGIFAG 408
           ++D+W ++        IG+N++ +Y           +E +FAG
Sbjct: 306 VIDVWKIRKWAFFFQVIGLNSLTIYFAFSFINFRFTSEKLFAG 348


>gi|374373358|ref|ZP_09631018.1| membrane protein [Niabella soli DSM 19437]
 gi|373234331|gb|EHP54124.1| membrane protein [Niabella soli DSM 19437]
          Length = 375

 Score = 85.1 bits (209), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 97/384 (25%), Positives = 150/384 (39%), Gaps = 93/384 (24%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           K  R  +LDIFRG+ +  MI+V+  G + +PE+ HA WNG  L D V P FLF VG AIA
Sbjct: 8   KPGRFLALDIFRGMTICFMIIVNTGGPNPFPELRHAQWNGFTLTDLVFPSFLFAVGNAIA 67

Query: 90  LALKRIPDRA--DAVKKVIFRTLKLLFWGILL----------QGGFSHAPDELTYGVDVR 137
            +  +   ++  + + K+I RT  L   G L+          Q      P   T      
Sbjct: 68  FSKSKWDQQSNKEVLTKIIKRTCLLFLIGYLMYWLPFVKIDAQNNIRPFPIGET------ 121

Query: 138 MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA 197
             R+ GVLQRIAL Y + +L+  F                          A  +++  + 
Sbjct: 122 --RIFGVLQRIALCYGIGALIIRFAS------------------------ARTIIITSIC 155

Query: 198 LLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHP 257
           LL G +        I     DY     V      KL          D  +   +H+Y   
Sbjct: 156 LLLGYW-------FIMMAFGDY----TVNGAAETKL----------DILLFTRDHLY--- 191

Query: 258 AWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKG 317
                         + P R   P    + F      + V+ ++  + G+H    I     
Sbjct: 192 -------------IKDPARAFDPEGFLSTFP-----AIVNVLIGYLAGIH----IRKNPK 229

Query: 318 HLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDI 377
               L +    GF L+  G   +   A P+NK+L+T S+VC+T+G   L+ + I   +D 
Sbjct: 230 SYEMLARLAVAGFLLVALGYLWNL--AFPVNKKLWTSSFVCLTTGLDCLIIATILYFIDF 287

Query: 378 WNLKYPFLPLAWIGMNAMLVYVMA 401
              +         G NA+ +Y+ +
Sbjct: 288 KEKRTGVFFFEVFGKNALFIYLFS 311


>gi|410663435|ref|YP_006915806.1| hypothetical protein M5M_04345 [Simiduia agarivorans SA1 = DSM
           21679]
 gi|409025792|gb|AFU98076.1| hypothetical protein M5M_04345 [Simiduia agarivorans SA1 = DSM
           21679]
          Length = 356

 Score = 85.1 bits (209), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 98/376 (26%), Positives = 158/376 (42%), Gaps = 91/376 (24%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGVAI 88
           QR  +LD  RGL +ALMI+V+  G  W  +     HA W G    D V PFFLFIVG ++
Sbjct: 3   QRYIALDALRGLTLALMIVVNTPG-SWAHVYGPLLHADWMGWTFTDLVFPFFLFIVGASL 61

Query: 89  ALALKRIPD--RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQ 146
             + K +    RAD ++K+I R+L L+              +   + V +  +RL GVLQ
Sbjct: 62  YFSQKGMASLTRADQLRKIIRRSLLLIV--------LGVLLEYYPFIVSLHELRLPGVLQ 113

Query: 147 RIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPD 206
           RI L++ + +L+ +F              + F L C        +L  Y  LL+ ++ P 
Sbjct: 114 RIGLAFGVAALLVVFVP------------ARFNLVC-----IGAILASYQVLLWLSHEP- 155

Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
                    S ++                  N V  +D  V G +H++            
Sbjct: 156 --------YSLEH------------------NLVRQVDLWVFGESHLWAGKG-------- 181

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
                               F+PEGLLS+  S+ + + G      +    G   R  Q+ 
Sbjct: 182 ------------------LAFDPEGLLSTWPSVATVLAGFETARWL--RSGRQLRYLQFG 221

Query: 327 TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPF-L 385
             G   ++  +T  +  A+P+NK L+T  +V +T+G A     A+  L++ W L      
Sbjct: 222 LWGAGGVVLLMT--YALALPINKSLWTPGFVLLTAGLACWTL-ALMLLMEQWRLGAAIQR 278

Query: 386 PLAWIGMNAMLVYVMA 401
           PL  +G N + +YV++
Sbjct: 279 PLVSLGQNPLFIYVLS 294


>gi|329956032|ref|ZP_08296803.1| hypothetical protein HMPREF9445_01662 [Bacteroides clarus YIT
           12056]
 gi|328524791|gb|EGF51845.1| hypothetical protein HMPREF9445_01662 [Bacteroides clarus YIT
           12056]
          Length = 396

 Score = 85.1 bits (209), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 95/396 (23%), Positives = 160/396 (40%), Gaps = 98/396 (24%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVA 87
            +R+ +LDI RG+ +A MI+V++ G  W  I     HA WNG    D V PFF+FI+G++
Sbjct: 8   NKRILALDILRGVTIAGMIMVNNPG-TWAHIYAPLRHAEWNGLTPTDLVFPFFMFIMGIS 66

Query: 88  IALALKRIP-DRADAVKKVIFRTLKLLFWGILLQGGFSH------APDE-LTYGVDV--- 136
             ++LK+   + + AV   I +   L+F   +  G FS       +P E +++G  +   
Sbjct: 67  TYISLKKYNFEFSRAVGMKILKRTILIFLIGMAIGWFSKFCYYWTSPTEGISFGAQLWES 126

Query: 137 ----RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
                 IR+ GV+QR+AL Y   +++ +  K                       + A +L
Sbjct: 127 VWTFDRIRILGVMQRLALCYGATAIIALTVKHRNIP-----------------YLIATLL 169

Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
             Y  LL                           CG     N   N +  IDR +L   H
Sbjct: 170 TGYFILL--------------------------VCGNGFAYN-DTNILSVIDRTILTPAH 202

Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
           MY                                 +PEGLLS++ SI   ++G   G ++
Sbjct: 203 MYKDNG----------------------------IDPEGLLSTIPSIAHVLLGFCVGRMM 234

Query: 313 IHT-KGHLARLKQWVTMGFALLIFGLTLHFTN-----AIPLNKQLYTLSYVCVTSGAAAL 366
           +   K +  R     +    L + G  L F+        P+NK++++ ++V  T G A+ 
Sbjct: 235 LENGKANEDRESMLNSHLIKLFLAGTILTFSGFLLSYGCPINKKIWSPTFVLTTCGLASS 294

Query: 367 VFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
             + +  ++D+   K   L     G+N + +YV+  
Sbjct: 295 FLALLIWIIDVKGYKKWSLFFESFGVNPLFMYVLGG 330


>gi|265765098|ref|ZP_06093373.1| conserved hypothetical protein [Bacteroides sp. 2_1_16]
 gi|263254482|gb|EEZ25916.1| conserved hypothetical protein [Bacteroides sp. 2_1_16]
          Length = 387

 Score = 85.1 bits (209), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 90/392 (22%), Positives = 160/392 (40%), Gaps = 97/392 (24%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVA 87
            +RL +LD+ RG+ +A MI+V++ G  W  I     HA W G    D V PFF+FI+G++
Sbjct: 6   NKRLLALDVLRGITIAGMIMVNNPG-SWSYIYAPLGHAAWIGLTPTDLVFPFFMFIMGIS 64

Query: 88  IALALKR--IPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRM----- 138
             ++L++        A  K++ RT+ +   G+ +     F    + L+ G D+       
Sbjct: 65  TYISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCRTWNSLS-GEDISFFSRLY 123

Query: 139 --------IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAAC 190
                   IR+ GV+QR+AL Y   +++ +  K                           
Sbjct: 124 ESVWTFGHIRILGVMQRLALCYGATAIIALIMKH-------------------------- 157

Query: 191 VLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGI 250
               Y+  L    +  +   +IN +  +Y                  N +  +DR VLG 
Sbjct: 158 ---KYIPYLIAILLIGYFIILINGNGFEYNS---------------SNILSIVDRTVLGE 199

Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
            HMY                      KD         +PEGLLS++ SI   +IG   G 
Sbjct: 200 AHMY----------------------KD------NGIDPEGLLSTIPSIAHVLIGFCVGK 231

Query: 311 VIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSA 370
           +++  K    ++++   +G  L   G  L +    P++K++++ ++  VT G A+   + 
Sbjct: 232 LLMEVKDIHEKIERLFLIGTILTFAGFLLSY--GCPISKKIWSPTFAIVTCGLASSFLAL 289

Query: 371 IYALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
           +  ++D+             G+N + +YVM A
Sbjct: 290 LVWIIDVRGYTRWSRFFESFGVNPLFIYVMGA 321


>gi|329964617|ref|ZP_08301671.1| hypothetical protein HMPREF9446_03278 [Bacteroides fluxus YIT
           12057]
 gi|328525017|gb|EGF52069.1| hypothetical protein HMPREF9446_03278 [Bacteroides fluxus YIT
           12057]
          Length = 396

 Score = 84.7 bits (208), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 100/409 (24%), Positives = 169/409 (41%), Gaps = 113/409 (27%)

Query: 27  KSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLF 82
            S   ++R+ +LDI RG+ +A MI+V++ G  W  I    +HA W G    D V PFF+F
Sbjct: 2   NSQKTSKRILALDILRGITIAGMIMVNNPG-SWAHIYAPLAHAQWIGLTPTDLVFPFFMF 60

Query: 83  IVGVAIALALKRIPDRAD--AVKKVIFRTLKLLFWGILLQGGFSH-------APDELTYG 133
           I+G++  ++LK+        A  K++ RT+ +   G+ + G FS        APD L +G
Sbjct: 61  IMGISTYISLKKYNFEFSHAAALKILKRTVIIFLIGMAI-GWFSRFCYYWSSAPDNLGFG 119

Query: 134 VDV--------RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHW 185
            ++        RM R+ GV+QR+AL Y   S++ +  K                      
Sbjct: 120 ENLWASVWTFDRM-RILGVMQRLALCYGATSIIALTMKHKNIP----------------- 161

Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDR 245
            + A +L+ Y  LL G                  G  +N T           N +  +DR
Sbjct: 162 YLIATLLIGYFILLLG----------------GNGFAYNET-----------NILSIVDR 194

Query: 246 KVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIG 305
            VL   HMY                                 +PEG+LS++ +I   ++G
Sbjct: 195 AVLTPAHMYKDNG----------------------------IDPEGILSTIPAIAHVLLG 226

Query: 306 VHFGHVII----HTKG--------HLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYT 353
              G +++      KG        HL +L      G  L   G  L +    P++K++++
Sbjct: 227 FCVGRMMLGDSQSAKGDRTNVLDSHLIKL---FLAGTILTFAGFLLSY--GCPISKKIWS 281

Query: 354 LSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
            ++V VT G A+   + +  ++D+   K   +     G+N + +YV+  
Sbjct: 282 PTFVLVTCGLASSFLALLIWIIDVKGYKRWSMFFESFGVNPLFMYVLGG 330


>gi|60679957|ref|YP_210101.1| hypothetical protein BF0369 [Bacteroides fragilis NCTC 9343]
 gi|336407897|ref|ZP_08588393.1| hypothetical protein HMPREF1018_00408 [Bacteroides sp. 2_1_56FAA]
 gi|423248371|ref|ZP_17229387.1| hypothetical protein HMPREF1066_00397 [Bacteroides fragilis
           CL03T00C08]
 gi|423253319|ref|ZP_17234250.1| hypothetical protein HMPREF1067_00894 [Bacteroides fragilis
           CL03T12C07]
 gi|423269643|ref|ZP_17248615.1| hypothetical protein HMPREF1079_01697 [Bacteroides fragilis
           CL05T00C42]
 gi|423272798|ref|ZP_17251745.1| hypothetical protein HMPREF1080_00398 [Bacteroides fragilis
           CL05T12C13]
 gi|60491391|emb|CAH06139.1| putative transmembrane protein [Bacteroides fragilis NCTC 9343]
 gi|335944976|gb|EGN06793.1| hypothetical protein HMPREF1018_00408 [Bacteroides sp. 2_1_56FAA]
 gi|392657219|gb|EIY50856.1| hypothetical protein HMPREF1067_00894 [Bacteroides fragilis
           CL03T12C07]
 gi|392659584|gb|EIY53202.1| hypothetical protein HMPREF1066_00397 [Bacteroides fragilis
           CL03T00C08]
 gi|392700489|gb|EIY93651.1| hypothetical protein HMPREF1079_01697 [Bacteroides fragilis
           CL05T00C42]
 gi|392708362|gb|EIZ01469.1| hypothetical protein HMPREF1080_00398 [Bacteroides fragilis
           CL05T12C13]
          Length = 387

 Score = 84.7 bits (208), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 88/391 (22%), Positives = 160/391 (40%), Gaps = 95/391 (24%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
            +RL +LD+ RG+ +A MI+V++ G     +  + HA W G    D V PFF+FI+G++ 
Sbjct: 6   NKRLLALDVLRGITIAGMIMVNNPGSWSYVYAPLGHAAWIGLTPTDLVFPFFMFIMGIST 65

Query: 89  ALALKR--IPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRM------ 138
            ++L++        A  K++ RT+ +   G+ +     F    + L+ G D+        
Sbjct: 66  YISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCRTWNSLS-GEDISFFSRLYE 124

Query: 139 -------IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV 191
                  IR+ GV+QR+AL Y   +++ +  K                            
Sbjct: 125 SVWTFGHIRILGVMQRLALCYGATAIIALIMKH--------------------------- 157

Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
              Y+  L    +  +   +IN +  +Y                  N +  +DR VLG  
Sbjct: 158 --KYIPYLIAILLIGYFIILINGNGFEYNS---------------SNILSIVDRTVLGEA 200

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           HMY                      KD         +PEGLLS++ SI   +IG   G +
Sbjct: 201 HMY----------------------KD------NGIDPEGLLSTIPSIAHVLIGFCVGKL 232

Query: 312 IIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAI 371
           ++  K    ++++   +G  L   G  L +    P++K++++ ++  VT G A+   + +
Sbjct: 233 LMEVKDIHEKIERLFLIGTILTFAGFLLSY--GCPISKKIWSPTFAIVTCGLASSFLALL 290

Query: 372 YALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
             ++D+             G+N + +YVM A
Sbjct: 291 VWIIDVRGYTRWSRFFESFGVNPLFIYVMGA 321


>gi|359686994|ref|ZP_09256995.1| hypothetical protein LlicsVM_01380 [Leptospira licerasiae serovar
           Varillal str. MMD0835]
 gi|418756670|ref|ZP_13312858.1| PF07786 family protein [Leptospira licerasiae serovar Varillal str.
           VAR 010]
 gi|384116341|gb|EIE02598.1| PF07786 family protein [Leptospira licerasiae serovar Varillal str.
           VAR 010]
          Length = 391

 Score = 84.7 bits (208), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 103/394 (26%), Positives = 165/394 (41%), Gaps = 104/394 (26%)

Query: 34  RLASLDIFRGLAVALMILVDHAGG----DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           R+ S+D+ RGL VA MILV++ G      WP + HA W+GC   D V PFFLF VG +I 
Sbjct: 25  RILSIDLLRGLTVAGMILVNNPGTWSNMYWP-LKHAKWDGCTPTDLVFPFFLFAVGASIP 83

Query: 90  LALKRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVLQR 147
            +   + +      K++ R++ L+F G+ L   G +S +            +R  GVLQR
Sbjct: 84  FS---VSNGIQEFPKILKRSVILIFLGLFLNFFGEWSFSN-----------LRFPGVLQR 129

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
           I  +Y   ++        ++K+          L   +W +   +             P  
Sbjct: 130 IGFAYFFSAIA------YREKNLKFRIILFLTLLISYWYLQEFI------------PPPG 171

Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
                 K+  D+G                     ++DR+V G  H+     W+  K    
Sbjct: 172 AAEPSMKEGKDWG--------------------AWLDREVFGQAHL-----WKFGKV--- 203

Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI-IHTKGHLARLK--- 323
                              ++PEGLL+S +SI S   G+  G  + +H +   + L    
Sbjct: 204 -------------------WDPEGLLTSFTSIASVFCGIFAGEFLKVHLEKKESPLSISG 244

Query: 324 QWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWN---- 379
           + V   FA+L+ G         P+NK L+T +Y   T+G A L  S ++ +++ +N    
Sbjct: 245 KIVLGAFAVLVVGGVWGIY--YPINKSLWTGTYSLWTAGWALLAVS-LFLILEKYNRFGF 301

Query: 380 --LKYPFLPLAWIGMNAMLVYVMAAEGIFAGFIN 411
             L+   LP    G NA+LV+     GIFA  +N
Sbjct: 302 GALQGFLLPF---GKNALLVFF--GSGIFARSLN 330


>gi|325954677|ref|YP_004238337.1| hypothetical protein [Weeksella virosa DSM 16922]
 gi|323437295|gb|ADX67759.1| hypothetical protein Weevi_1050 [Weeksella virosa DSM 16922]
          Length = 402

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 101/407 (24%), Positives = 168/407 (41%), Gaps = 112/407 (27%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGV 86
           +KT R  SLD+FRG  +ALMILV++ G     +  + HA W+GC   D V PFFLF VG 
Sbjct: 1   MKTTRYYSLDVFRGATIALMILVNNPGSWSYMFSPLQHASWHGCTPTDLVFPFFLFAVGN 60

Query: 87  AIALALKRIPDRADAV--KKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGV----DVRM 138
           A++  +  +  +A  V  KK+I RT+ +   G+ +          +EL +      +   
Sbjct: 61  AMSFGMSHLKLQASNVFWKKIIKRTILIFAIGLFINWWPFLKWENNELVFRAWRESEENG 120

Query: 139 IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL 198
           +R+ GVLQRIA++    S +  +      +D+ V + SI             +L+ Y AL
Sbjct: 121 VRIMGVLQRIAIANFFASTLAYYY-----RDRVVLKISI------------LILLFYWAL 163

Query: 199 LYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKV----LGINHMY 254
            +                           GV      P +  G+I  K+    +G+ HMY
Sbjct: 164 TF------------------------FLGGVD-----PYSLEGFIGTKIDVHLIGLAHMY 194

Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
                            EG            PF+PEGL S++ +I   ++G   G V I 
Sbjct: 195 KG---------------EG-----------VPFDPEGLYSTIPAISQILLGYLVG-VYIQ 227

Query: 315 TKGHLARLKQWVTMG-----------FALLIFGLTLHFTNAI--PLNKQLYTLSYVCVTS 361
            +G +    + +              F L  F L + +   +  P NK++++ SYV  T+
Sbjct: 228 KQGDIRWFSRSLPASNLPIYRMLSGLFVLGAFALIMGYIWQLDFPYNKKIWSSSYVIHTT 287

Query: 362 GAAALVFSAIYALVDIWNLKYPFLPLAWI------GMNAMLVYVMAA 402
             A      +    ++  + +P    AW+      G N + ++V++ 
Sbjct: 288 ALAIFTIGCMVWFAEVLQM-HP----AWMRFFDVFGKNPLFIFVLSG 329


>gi|53711719|ref|YP_097711.1| hypothetical protein BF0428 [Bacteroides fragilis YCH46]
 gi|52214584|dbj|BAD47177.1| conserved hypothetical protein [Bacteroides fragilis YCH46]
          Length = 387

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 87/391 (22%), Positives = 160/391 (40%), Gaps = 95/391 (24%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
            +RL +LD+ RG+ +A MI+V++ G     +  + HA W G    D V PFF+FI+G++ 
Sbjct: 6   NKRLLALDVLRGITIAGMIMVNNPGSWSYVYAPLGHAAWIGLTPTDLVFPFFMFIMGIST 65

Query: 89  ALALKR--IPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRM------ 138
            ++L++        A  K++ RT+ +   G+ +     F    + L+ G D+        
Sbjct: 66  YISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCRTWNSLS-GEDISFFSRLYE 124

Query: 139 -------IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV 191
                  IR+ GV+QR+AL Y   +++ +  K                            
Sbjct: 125 SVWTFGHIRILGVMQRLALCYGATAIIALIMKH--------------------------- 157

Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
              Y+  L    +  +   +IN +  +Y                  N +  +DR VLG  
Sbjct: 158 --KYIPYLIAILLIGYFIILINGNGFEYNS---------------SNILSIVDRTVLGEA 200

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           HMY                      KD         +PEGLLS++ SI   +IG   G +
Sbjct: 201 HMY----------------------KD------NGIDPEGLLSTIPSIAHVLIGFCVGKL 232

Query: 312 IIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAI 371
           ++  K    ++++   +G  L   G  L +    P++K++++ ++  +T G A+   + +
Sbjct: 233 LMEVKDIHEKIERLFLIGTILTFAGFLLSY--GCPISKKIWSPTFAIITCGLASSFLALL 290

Query: 372 YALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
             ++D+             G+N + +YVM A
Sbjct: 291 VWIIDVRGYTRWSRFFESFGVNPLFIYVMGA 321


>gi|325103749|ref|YP_004273403.1| hypothetical protein [Pedobacter saltans DSM 12145]
 gi|324972597|gb|ADY51581.1| hypothetical protein Pedsa_1010 [Pedobacter saltans DSM 12145]
          Length = 374

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 50/141 (35%), Positives = 78/141 (55%), Gaps = 6/141 (4%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFF 80
               + +   R  SLD+ RG  VA MI+V+  G     +  + HAPW+G  + D V P F
Sbjct: 1   MANANSIPKPRYLSLDVLRGATVAFMIIVNTPGSWSYVYAPLKHAPWHGFTVTDLVFPTF 60

Query: 81  LFIVGVAIALALKRIPDRADA--VKKVIFRTLKLLFWGILLQG-GFSHAPDELTYGVDVR 137
           LF+VG A++  + ++ ++ ++  +KKV  RTLK+   G+ L    F    D++    D  
Sbjct: 61  LFVVGNAMSFGMGKLKEQGNSAFLKKVFSRTLKIFLIGLFLNMFPFVKWVDDVLVMKDFT 120

Query: 138 MIRLCGVLQRIALSYLLVSLV 158
            IR+ GVLQRIA+ Y + SL+
Sbjct: 121 EIRIWGVLQRIAVCYCIASLL 141



 Score = 48.1 bits (113), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 35/134 (26%), Positives = 72/134 (53%), Gaps = 4/134 (2%)

Query: 286 PFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           PF+PEG+LS++ ++++ I+G +F  + I  KG+  +   +  +G  +++      +    
Sbjct: 198 PFDPEGVLSTIPAVVNVILG-YFAGLFIQKKGN-NKSTAFNLIGTGVILLLAASAWNLVF 255

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGI 405
           P+NK ++T SYV  T G   ++ +A+  ++++W++K         G N + +Y  A  GI
Sbjct: 256 PINKPIWTSSYVLYTVGWDLILLAALILIIEVWHIKKWTYFFEVFGKNPLFIY--ALSGI 313

Query: 406 FAGFINGWYYGDPH 419
           F   ++  Y G  +
Sbjct: 314 FVKLMHTIYIGGEN 327


>gi|423259248|ref|ZP_17240171.1| hypothetical protein HMPREF1055_02448 [Bacteroides fragilis
           CL07T00C01]
 gi|423263781|ref|ZP_17242784.1| hypothetical protein HMPREF1056_00471 [Bacteroides fragilis
           CL07T12C05]
 gi|387776828|gb|EIK38928.1| hypothetical protein HMPREF1055_02448 [Bacteroides fragilis
           CL07T00C01]
 gi|392706047|gb|EIY99170.1| hypothetical protein HMPREF1056_00471 [Bacteroides fragilis
           CL07T12C05]
          Length = 387

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 89/392 (22%), Positives = 160/392 (40%), Gaps = 97/392 (24%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVA 87
            +RL +LD+ RG+ +A MI+V++ G  W  +     HA W G    D V PFF+FI+G++
Sbjct: 6   NKRLLALDVLRGITIAGMIMVNNPG-SWSYVYVPLGHAAWIGLTPTDLVFPFFMFIMGIS 64

Query: 88  IALALKR--IPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRM----- 138
             ++L++        A  K++ RT+ +   G+ +     F    + L+ G D+       
Sbjct: 65  TYISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCRTWNSLS-GEDISFFSRLY 123

Query: 139 --------IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAAC 190
                   IR+ GV+QR+AL Y   +++ +  K                           
Sbjct: 124 ESVWTFGHIRILGVMQRLALCYGATAIIALIMKH-------------------------- 157

Query: 191 VLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGI 250
               Y+  L    +  +   +IN +  +Y                  N +  +DR VLG 
Sbjct: 158 ---KYIPYLIAILLIGYFIILINGNGFEYNS---------------SNILSIVDRTVLGE 199

Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
            HMY                      KD         +PEGLLS++ SI   +IG   G 
Sbjct: 200 AHMY----------------------KD------NGIDPEGLLSTIPSIAHVLIGFCVGK 231

Query: 311 VIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSA 370
           +++  K    ++++   +G  L   G  L +    P++K++++ ++  VT G A+   + 
Sbjct: 232 LLMEVKDIHEKIERLFLIGTILTFAGFLLSY--GCPISKKIWSPTFAIVTCGLASSFLAL 289

Query: 371 IYALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
           +  ++D+             G+N + +YVM A
Sbjct: 290 LVWIIDVRGYTRWSRFFESFGVNPLFIYVMGA 321


>gi|440747989|ref|ZP_20927244.1| N-acetylglucosamine related transporter, NagX [Mariniradius
           saccharolyticus AK6]
 gi|436483731|gb|ELP39771.1| N-acetylglucosamine related transporter, NagX [Mariniradius
           saccharolyticus AK6]
          Length = 382

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 52/134 (38%), Positives = 77/134 (57%), Gaps = 9/134 (6%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
           +R  +LD+ RGL +ALMI+V+  G DW  +     HA W+G  + D V P FLF+VG A+
Sbjct: 13  ERYLALDVLRGLTIALMIVVNTPG-DWSNVFSPLLHADWHGFTITDLVFPTFLFVVGNAM 71

Query: 89  ALALKRIPDRAD-AVKKVIFRTLKLLF---WGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
           + ++K++   +  A  K +F+   L+F   WG+     F      +   +D   +RL GV
Sbjct: 72  SFSMKKMEKMSQGAFLKKVFKRAALIFLIGWGLNAFPFFETNETGVVSMIDWSAVRLLGV 131

Query: 145 LQRIALSYLLVSLV 158
           LQRIAL YL+ SLV
Sbjct: 132 LQRIALCYLIASLV 145



 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 43/170 (25%), Positives = 77/170 (45%), Gaps = 25/170 (14%)

Query: 286 PFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           PF+PEG+LS+  S+++ I G   G  I     +   +K  +  G  L+  G ++ +  A 
Sbjct: 206 PFDPEGVLSTWPSVVNVIAGFLAGKFIQQIGNNKRTVKALLLAG--LIAVGFSVIWELAF 263

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA---- 401
           P+NK+++T SYV +T G   +V   +  ++++        PL   G N +++Y+ A    
Sbjct: 264 PINKKIWTSSYVLLTVGLDLIVLGFLVLIIEVQKFNKWTYPLEVFGRNPLILYIFAWLVI 323

Query: 402 -------------AEGIFAGFINGWYYGDPHNTLVCFLFIISYILHSFLW 438
                           I++G    W      +    FLF I+Y+  S +W
Sbjct: 324 DIMYAIPVGENSLKGAIYSGLFTSWL----GSKTASFLFAIAYM--SLIW 367


>gi|319900285|ref|YP_004160013.1| hypothetical protein Bache_0400 [Bacteroides helcogenes P 36-108]
 gi|319415316|gb|ADV42427.1| putative transmembrane protein [Bacteroides helcogenes P 36-108]
          Length = 396

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 96/397 (24%), Positives = 168/397 (42%), Gaps = 105/397 (26%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVA 87
           ++R+ +LDI RG+ +A MI+V++ G +W  I     HA W G    D V PFF+FI+G++
Sbjct: 7   SKRILALDILRGITIAGMIMVNNPG-NWGHIYAPLEHAEWIGLTPTDLVFPFFMFIMGIS 65

Query: 88  IALALKRIP---DRADAVKKVIFRTLKLLFWGILLQGGFSH-------APDELTYGVDV- 136
             ++LK+      R+ A+K  I +   ++F   L  G F+        AP EL++G ++ 
Sbjct: 66  TYISLKKYDFEFSRSAALK--ILKRTAIIFLIGLAIGWFARLCYYWAAAPGELSFGENLW 123

Query: 137 -------RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAA 189
                  RM R+ GV+QR+AL Y   S++ +  K                     +L+A 
Sbjct: 124 ASVWTFDRM-RILGVMQRLALCYGATSIIALTMKHRHIP----------------YLIAG 166

Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLG 249
            ++  ++ L+ G                  G  +N T           N +  +DR VL 
Sbjct: 167 LLISYFILLMCGN-----------------GFAYNET-----------NILSVVDRAVLT 198

Query: 250 INHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFG 309
             HMY                                 +PEGLLS++ SI   ++G   G
Sbjct: 199 PAHMYKDNG----------------------------IDPEGLLSTIPSIAHVLLGFCVG 230

Query: 310 HVII--HTKGHLARLKQWVTMGFALLIFGLTLHFTN-----AIPLNKQLYTLSYVCVTSG 362
            +++         R+    +    LL+ G  L F         P++K++++ ++V VT G
Sbjct: 231 RMMLGDDRNAKADRMAVLDSHLIKLLLTGAILTFAGFLLSYGCPISKKIWSPTFVLVTCG 290

Query: 363 AAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
            A+   + +  +VD+   +   +     G+N + +YV
Sbjct: 291 MASSFLALLIWIVDVKGYRKWSVFFESFGVNPLFMYV 327


>gi|218131911|ref|ZP_03460715.1| hypothetical protein BACEGG_03534 [Bacteroides eggerthii DSM 20697]
 gi|217986214|gb|EEC52553.1| hypothetical protein BACEGG_03534 [Bacteroides eggerthii DSM 20697]
          Length = 396

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 94/397 (23%), Positives = 166/397 (41%), Gaps = 100/397 (25%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVA 87
            +R+ +LDI RG+ +A MI+V++ G  W  I     HA WNG    D V PFF+FI+G++
Sbjct: 8   NKRILALDILRGVTIAGMIMVNNPG-TWAHIYAPLRHAEWNGLTPTDLVFPFFMFIMGIS 66

Query: 88  IALALKRIPDRAD--AVKKVIFRTLKLLFWGILLQGGFSH------APDE-LTYGVDV-- 136
             ++LK+        A  K++ RT+ +   G+ + G FS       +P E +++G  +  
Sbjct: 67  TYISLKKYNFEFSHAAGIKILKRTILIFLIGMAI-GWFSKFCYYWTSPTEGISFGTQLWE 125

Query: 137 -----RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV 191
                  IR+ GV+QR+AL Y   +++ +  K                     +L+A  +
Sbjct: 126 SVWTFDRIRILGVMQRLALCYGATAIIALTMKHKNIP----------------YLIATLL 169

Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
              ++ LL G                  G  +N T           N +  +DR +L   
Sbjct: 170 TGYFILLLCGN-----------------GFAYNDT-----------NILSIVDRTILTPA 201

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           HMY                                 +PEGLLS++ +I   ++G   G +
Sbjct: 202 HMYKDNG----------------------------IDPEGLLSTIPAIAHVLLGFCVGRM 233

Query: 312 IIH-TKGHLARLKQWVTMGFALLIFGLTLHFTN-----AIPLNKQLYTLSYVCVTSGAAA 365
           ++   K +  R     +    LL+ G  L F+        P+NK++++ ++V  T G A+
Sbjct: 234 MLEGGKANEDRESMLNSHLIKLLLVGTILTFSGFLLSYGCPINKKIWSPTFVLTTCGLAS 293

Query: 366 LVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
              + +  ++D+   K   L     G+N + +YV+  
Sbjct: 294 SFLALLIWIIDVKGYKKWSLFFESFGVNPLFMYVLGG 330


>gi|429738942|ref|ZP_19272716.1| hypothetical protein HMPREF9151_01157 [Prevotella saccharolytica
           F0055]
 gi|429158431|gb|EKY00988.1| hypothetical protein HMPREF9151_01157 [Prevotella saccharolytica
           F0055]
          Length = 400

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 97/403 (24%), Positives = 171/403 (42%), Gaps = 108/403 (26%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGV 86
           K+ R+ ++DI RG+ +A MILV++ G +W  I     HA WNG    D V PFF+F++G+
Sbjct: 7   KSSRILAIDILRGITIAGMILVNNPG-NWGRIFAPFEHAEWNGMTPTDLVFPFFMFVMGM 65

Query: 87  AIALALKRIPDRAD--AVKKVIFRTLKLLFWGILLQGGFSH------AP-DELTYGVDV- 136
            I +A+++     +   V K+  R + +   G+ + G F+       +P +E ++G  + 
Sbjct: 66  CIYIAMRKFDFTCNKSTVYKITKRMVLIYLVGLGI-GWFAKFCFRWASPLEEASFGEQLW 124

Query: 137 ------RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAAC 190
                   IRL GVL R+A+ Y + +L+ +    V+ K+                 +   
Sbjct: 125 YMVWPFDSIRLTGVLARLAICYGITALLAV---TVKHKNLP--------------YIIVT 167

Query: 191 VLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGI 250
           +LV Y  +L                 A  G  ++ T           N +   DR VL  
Sbjct: 168 LLVGYFIIL----------------MAGNGFAYDET-----------NILSIADRAVLTD 200

Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
            HMYH                                +PEGLLS++ SI  T++G   G 
Sbjct: 201 VHMYHDNG----------------------------IDPEGLLSTLPSIAHTLLGFMVGS 232

Query: 311 VIIHTKGH------------LARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVC 358
           ++  T               L+++     +G  L+  G  L +    PLNK++++ +YV 
Sbjct: 233 LLFKTTDEHSEHTDVRTGIILSKVVPLFVVGTILIFSGFLLSY--GCPLNKKVWSPTYVL 290

Query: 359 VTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA 401
           VT G A+ + + +  L+D+   +         G+N + ++V++
Sbjct: 291 VTCGLASTLLALLIWLIDVKGYRRWSKFFEVFGVNPLFLFVLS 333


>gi|389809458|ref|ZP_10205319.1| protein involved in N-acetyl-D-glucosamine utilization
           [Rhodanobacter thiooxydans LCS2]
 gi|388441722|gb|EIL97972.1| protein involved in N-acetyl-D-glucosamine utilization
           [Rhodanobacter thiooxydans LCS2]
          Length = 353

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 55/139 (39%), Positives = 78/139 (56%), Gaps = 16/139 (11%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
           +RLASLD  RG  VA M+LV+  G DW  +    +HA WNGC   D V PFFLF+VGV++
Sbjct: 2   KRLASLDALRGCTVAAMLLVNDPG-DWSHVYWPLAHAAWNGCTPTDLVFPFFLFVVGVSV 60

Query: 89  ALA-LKRIPDRADA---VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
           ALA L R+   A      +  ++R L++L  G+ +            + +    +R  GV
Sbjct: 61  ALAILPRLEQGASPSALTRAAMWRALRILALGVAIN-------LLAAWWLPQAHLRFPGV 113

Query: 145 LQRIALSYLLVSLVEIFTK 163
           LQRIAL +  V+L  ++TK
Sbjct: 114 LQRIALCFAGVALFAVYTK 132



 Score = 40.8 bits (94), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 36/115 (31%), Positives = 54/115 (46%), Gaps = 16/115 (13%)

Query: 288 EPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPL 347
           +PEGLLS++ S+ ST++G+  G         L R K    +        L   ++  +P 
Sbjct: 188 DPEGLLSTLPSLASTLLGLRAGC-------WLRREKLRTLLLAGAACLLLGALWSWWLPF 240

Query: 348 NKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAW---IGMNAMLVYV 399
           NK L+T S+V  TSG   L   A + L+D     +P    AW    G+NA+  Y 
Sbjct: 241 NKNLWTPSFVLWTSGWTTLALLAFHMLID--RRGWP----AWGRRFGINAIAAYA 289


>gi|389793498|ref|ZP_10196662.1| protein involved in N-acetyl-D-glucosamine utilization
           [Rhodanobacter fulvus Jip2]
 gi|388434056|gb|EIL91012.1| protein involved in N-acetyl-D-glucosamine utilization
           [Rhodanobacter fulvus Jip2]
          Length = 354

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 93/354 (26%), Positives = 144/354 (40%), Gaps = 94/354 (26%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVA 87
           ++RL SLD  RG  VA M+LV+  G DW  I     HAPW+GC   D V PFFLF+VGV+
Sbjct: 2   SKRLPSLDALRGCTVAAMLLVNDPG-DWGHIYAPLEHAPWHGCTPTDLVFPFFLFVVGVS 60

Query: 88  IALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCG 143
            ALAL    ++  A    VK  ++R L++L  G+ +            + +    +R  G
Sbjct: 61  SALALLPRLEQGVAPGALVKAALWRALRILALGVAIN-------LLAAWLLPHAHLRFPG 113

Query: 144 VLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLY-GT 202
           VLQRI + +  V+L  + T+                     W+    +L+ Y  LL  G 
Sbjct: 114 VLQRIGICFAAVALFAVHTRPRTQ-----------------WIAIGGILLGYWGLLLAGG 156

Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
            V  W   +   DS  +G                     Y+     G  H          
Sbjct: 157 SVAPWVNIVSRTDSVVFGHFV------------------YLIDPATGRGH---------- 188

Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
                                    +PEGLL+++ S+ +T++G+  G  +   +     +
Sbjct: 189 -------------------------DPEGLLATLPSLATTLLGLRVGCWLRRREIRRLLI 223

Query: 323 KQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
                    +L+  L   ++  +PLNK L+T S+V  T+G A L   A + L+D
Sbjct: 224 AG-------VLLLLLGALWSLWLPLNKNLWTSSFVLWTAGWATLALLAFHWLID 270


>gi|380025576|ref|XP_003696546.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Apis florea]
          Length = 298

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 81/290 (27%), Positives = 137/290 (47%), Gaps = 67/290 (23%)

Query: 139 IRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVV 194
           +R  GVLQ + +SY + +++E IF K         GRF++FR  L  W  WL+ A ++  
Sbjct: 13  LRFPGVLQLLGVSYFVCAILETIFMK----PHSQFGRFAMFRDILESWPQWLIMAGIVTT 68

Query: 195 YLALLY---------GTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDR 245
           +  + +         G + P  ++          GK  N T G          A GYIDR
Sbjct: 69  HTLITFLLPISNCPKGYFGPGGEYHF-------RGKYMNCTAG----------AAGYIDR 111

Query: 246 KVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIG 305
            + G NH Y+H         T++  +   LR D          PEGL++++S+I    +G
Sbjct: 112 LIFG-NHTYNH---------TENFLYGQILRYD----------PEGLMNTISAIFIVYLG 151

Query: 306 VHFGHVIIHTKGHLARLKQWVTMG-FALLIFGLTLHFTNA---IPLNKQLYTLSYVCVTS 361
           VH G +++      +R+ +W     F  ++ G+  +F      IP++K++ TLSYV + S
Sbjct: 152 VHAGKILLLYYQCNSRVIRWFLWTIFTGIVAGILCNFETQGGIIPISKRMMTLSYVLICS 211

Query: 362 GAAALVFSAIYALVD---IWNLKYPFLPLAWIGMNAMLVYV--MAAEGIF 406
             A L+++ +Y L+D    WN      P  + G+N + +YV  +  +G+F
Sbjct: 212 SFAFLLYALLYVLIDYKQFWNGA----PFVYAGINPIFLYVGHILTKGLF 257


>gi|255093765|ref|ZP_05323243.1| hypothetical protein CdifC_14056 [Clostridium difficile CIP 107932]
          Length = 505

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 50/154 (32%), Positives = 81/154 (52%), Gaps = 16/154 (10%)

Query: 20  DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFV 76
            +S+    S L   R+ S+DI RGL++ALMI+ ++ G     +P++ HA W+G  LADF 
Sbjct: 125 KISNNVVDSKLTNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFA 184

Query: 77  MPFFLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTY 132
            PFF+  +GV I ++    LK        +  +  R++ L+ +G  L   +   P     
Sbjct: 185 FPFFVISLGVTIPISINSKLKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP----- 237

Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQ 166
             D+  +R+ GVLQR+ L Y + SLV +  K + 
Sbjct: 238 --DLNSVRILGVLQRMGLVYFVTSLVYLLLKKLN 269



 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 33/114 (28%), Positives = 59/114 (51%), Gaps = 3/114 (2%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTK-GHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           FEP+G L+S+ +I S ++G   G V++    G   +  + + M   LLI      F    
Sbjct: 323 FEPDGFLTSIVAISSGMLGCTMGCVLLKENIGEYKKFFKILVMSIILLIGAFI--FNQYF 380

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
           P NK+L++ S+V + +G+  ++ S  Y + DI N    F P+  +G + +  Y+
Sbjct: 381 PFNKRLWSSSFVLLMAGSYGMLLSIFYFICDIKNKSKIFTPIIALGSSPIFTYM 434


>gi|255315516|ref|ZP_05357099.1| hypothetical protein CdifQCD-7_14229, partial [Clostridium
           difficile QCD-76w55]
          Length = 381

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 50/154 (32%), Positives = 81/154 (52%), Gaps = 16/154 (10%)

Query: 20  DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFV 76
            +S+    S L   R+ S+DI RGL++ALMI+ ++ G     +P++ HA W+G  LADF 
Sbjct: 1   KISNNVVDSKLTNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFA 60

Query: 77  MPFFLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTY 132
            PFF+  +GV I ++    LK        +  +  R++ L+ +G  L   +   P     
Sbjct: 61  FPFFVISLGVTIPISINSKLKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP----- 113

Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQ 166
             D+  +R+ GVLQR+ L Y + SLV +  K + 
Sbjct: 114 --DLNSVRILGVLQRMGLVYFVTSLVYLLLKKLN 145



 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 33/114 (28%), Positives = 59/114 (51%), Gaps = 3/114 (2%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTK-GHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           FEP+G L+S+ +I S ++G   G V++    G   +  + + M   LLI      F    
Sbjct: 199 FEPDGFLTSIVAISSGMLGCTMGCVLLKENIGEYKKFFKILVMSIILLIGAFI--FNQYF 256

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
           P NK+L++ S+V + +G+  ++ S  Y + DI N    F P+  +G + +  Y+
Sbjct: 257 PFNKRLWSSSFVLLMAGSYGMLLSIFYFICDIKNKSKIFTPIIALGSSPIFTYM 310


>gi|395213375|ref|ZP_10400182.1| hypothetical protein O71_05742 [Pontibacter sp. BAB1700]
 gi|394456744|gb|EJF11001.1| hypothetical protein O71_05742 [Pontibacter sp. BAB1700]
          Length = 391

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 51/149 (34%), Positives = 80/149 (53%), Gaps = 9/149 (6%)

Query: 17  SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNL 72
           + P ++D         +R  SLD+ RGL +ALM++V++ G  W  I     HA W+G  +
Sbjct: 6   TAPPLTDAGLLRPQTYERYLSLDVLRGLTIALMVVVNNPG-SWGSIYAPFKHAAWHGFTV 64

Query: 73  ADFVMPFFLFIVGVAIALALKRIPDRADAV--KKVIFRTLKLLFWGILLQ--GGFSHAPD 128
            D V P FLF+VG A++ ++++   + D+V  +KV+ RT  +   G+ L         P+
Sbjct: 65  TDLVFPSFLFVVGNAMSFSMRKFETQPDSVFLRKVLKRTALIFLIGLFLNLFPFVMRNPE 124

Query: 129 ELTYGVDVRMIRLCGVLQRIALSYLLVSL 157
                 D   +R+ GVLQRIAL Y + SL
Sbjct: 125 GAIVMKDFTAVRIMGVLQRIALCYFIASL 153


>gi|313147781|ref|ZP_07809974.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
 gi|423280992|ref|ZP_17259903.1| hypothetical protein HMPREF1203_04120 [Bacteroides fragilis HMW
           610]
 gi|313136548|gb|EFR53908.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
 gi|404583442|gb|EKA88121.1| hypothetical protein HMPREF1203_04120 [Bacteroides fragilis HMW
           610]
          Length = 387

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 90/395 (22%), Positives = 166/395 (42%), Gaps = 95/395 (24%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIV 84
           +    +RL +LD+ RG+ +A MI+V++ G     +  + HA W G    D V PFF+FI+
Sbjct: 2   NQTANKRLLALDVLRGITIAGMIMVNNPGSWSYVYAPLGHAAWIGLTPTDLVFPFFMFIM 61

Query: 85  GVAIALALKR--IPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRM-- 138
           G++  ++L++        A  K++ RT+ +   G+ +     F H  + L+ G D+    
Sbjct: 62  GISTYISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCHTWNSLS-GEDIPFFS 120

Query: 139 -----------IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLM 187
                      IR+ GV+QR+AL Y   +++ +  K                     +L+
Sbjct: 121 RLGESVWTFGHIRILGVMQRLALCYGATAIIALIMKHKYIP----------------YLI 164

Query: 188 AACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKV 247
           AA ++  ++ L+ G                  G  +N T           N +  +DR V
Sbjct: 165 AALLIGYFIILITGN-----------------GFEYNST-----------NILAVVDRAV 196

Query: 248 LGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVH 307
           LG  HMY                      KD         +PEG+LS++ SI   +IG  
Sbjct: 197 LGEAHMY----------------------KD------NGIDPEGVLSTIPSIAHVLIGFC 228

Query: 308 FGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALV 367
            G +++  K    ++++   +G  L   G  L +    P++K++++ ++  VT G A+  
Sbjct: 229 VGKLLMEVKDINEKIERLFLVGTILTFAGFLLSY--GCPISKKIWSPTFAIVTCGLASSF 286

Query: 368 FSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
            + +  ++D+             G+N + +YV  A
Sbjct: 287 LALLVWIIDVKGYTRWSRFFESFGVNPLFIYVTGA 321


>gi|424666001|ref|ZP_18103037.1| hypothetical protein HMPREF1205_01876 [Bacteroides fragilis HMW
           616]
 gi|404574254|gb|EKA79005.1| hypothetical protein HMPREF1205_01876 [Bacteroides fragilis HMW
           616]
          Length = 387

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 90/391 (23%), Positives = 165/391 (42%), Gaps = 95/391 (24%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
            +RL +LD+ RG+ +A MI+V++ G     +  + HA W G    D V PFF+FI+G++ 
Sbjct: 6   NKRLLALDVLRGITIAGMIMVNNPGSWSYVYAPLGHAAWIGLTPTDLVFPFFMFIMGIST 65

Query: 89  ALALKR--IPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRM------ 138
            ++L++        A  K++ RT+ +   G+ +     F H  + L+ G D+        
Sbjct: 66  YISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCHTWNSLS-GEDIPFFSRLGE 124

Query: 139 -------IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV 191
                  IR+ GV+QR+AL Y   +++ +  K                     +L+AA +
Sbjct: 125 SVWTFGHIRILGVMQRLALCYGATAIIALIMKHKYIP----------------YLIAALL 168

Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
           +  ++ L+ G                  G  +N T           N +  +DR VLG  
Sbjct: 169 IGYFIILITGN-----------------GFEYNST-----------NILAVVDRAVLGEA 200

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           HMY                      KD         +PEG+LS++ SI   +IG   G +
Sbjct: 201 HMY----------------------KD------NGIDPEGVLSTIPSIAHVLIGFCVGKL 232

Query: 312 IIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAI 371
           ++  K    ++++   +G  L   G  L +    P++K++++ ++  VT G A+   + +
Sbjct: 233 LMEVKDINEKIERLFLVGTILTFAGFLLSY--GCPISKKIWSPTFAIVTCGLASSFLALL 290

Query: 372 YALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
             ++D+             G+N + +YV  A
Sbjct: 291 VWIIDVKGYTRWSRFFESFGVNPLFIYVTGA 321


>gi|116331948|ref|YP_801666.1| hypothetical protein LBJ_2457 [Leptospira borgpetersenii serovar
           Hardjo-bovis str. JB197]
 gi|116125637|gb|ABJ76908.1| Conserved hypothetical protein [Leptospira borgpetersenii serovar
           Hardjo-bovis str. JB197]
          Length = 363

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 111/390 (28%), Positives = 160/390 (41%), Gaps = 104/390 (26%)

Query: 44  LAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL--KRIPDR 98
           + V  MILV++ G     +  + HA WNGC   D V PFFLF VG +I ++L  K   +R
Sbjct: 1   MTVVGMILVNNPGSWSYVYSPLKHAEWNGCTPTDLVFPFFLFAVGASIPISLYSKNGINR 60

Query: 99  ADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLV 158
                 +  R + L+     L G F +   E T+      +R+ GVLQRI   Y +V+ +
Sbjct: 61  IRIWIGICIRGISLI-----LLGLFLNFFGEWTF----SELRIPGVLQRIGFVYWVVATL 111

Query: 159 EIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSAD 218
                           F +F            VLV  + +L    V  W  T I      
Sbjct: 112 ----------------FLVFP--------GKKVLVFLIPILL---VHTWILTHIAPPGES 144

Query: 219 YGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKD 278
                         L    +   +IDR + G  H+     W+ SK               
Sbjct: 145 -----------MVSLEQGKDIGAWIDRTIFGEKHL-----WKFSKT-------------- 174

Query: 279 APSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLT 338
                   ++PEG LS ++SI +++ GV  G ++   +G   R K  V     L IFGL 
Sbjct: 175 --------WDPEGFLSGIASIATSLFGVICGFILFRREG---RGKNRV-----LSIFGLG 218

Query: 339 LHFT-------NAIPLNKQLYTLSYVCVTSGAAAL---VFSAIYALVDI--WN---LKYP 383
             FT        ++P+NK L+T SY   T+G A L    F  + +L+ +  WN   LK  
Sbjct: 219 FLFTFVGLLWDRSLPMNKSLWTGSYAVYTTGFAFLCIGFFEYLDSLILLKKWNGLDLKIF 278

Query: 384 FLPLAWIGMNAMLVYVMAAEGIFAGFINGW 413
           F P    G NA+LV+V    GI A  +N W
Sbjct: 279 FQPFFVFGKNAILVFV--GSGILARTLNFW 306


>gi|317474486|ref|ZP_07933760.1| hypothetical protein HMPREF1016_00739 [Bacteroides eggerthii
           1_2_48FAA]
 gi|316909167|gb|EFV30847.1| hypothetical protein HMPREF1016_00739 [Bacteroides eggerthii
           1_2_48FAA]
          Length = 396

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 93/397 (23%), Positives = 165/397 (41%), Gaps = 100/397 (25%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVA 87
            +R+ +LDI RG+ +A MI+V++ G  W  I     HA WNG    D V PFF+FI+G++
Sbjct: 8   NKRILALDILRGVTIAGMIMVNNPG-TWAHIYAPLRHAEWNGLTPTDLVFPFFMFIMGIS 66

Query: 88  IALALKRIPDRAD--AVKKVIFRTLKLLFWGILLQGGFSH------APDE-LTYGVDV-- 136
             ++LK+        A  K++ RT+ +   G+ + G FS       +P E +++G  +  
Sbjct: 67  TYISLKKYNFEFSHAAGIKILKRTILIFLIGMAI-GWFSKFCYYWTSPTEGISFGTQLWE 125

Query: 137 -----RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV 191
                  IR+ GV+QR+AL Y   +++ +  K                     +L+A  +
Sbjct: 126 SVWTFDRIRILGVMQRLALCYGATAIIALTMKHKNIP----------------YLIATLL 169

Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
              ++ LL G                  G  +N T           N +  +DR +L   
Sbjct: 170 TGYFILLLCGN-----------------GFAYNDT-----------NILSIVDRTILTPA 201

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           HMY                                 +PEGLLS++ +I   ++G   G +
Sbjct: 202 HMYKDNG----------------------------IDPEGLLSTIPAIAHVLLGFCVGRM 233

Query: 312 IIH-TKGHLARLKQWVTMGFALLIFGLTLHFTN-----AIPLNKQLYTLSYVCVTSGAAA 365
           ++   K +  R     +    L + G  L F+        P+NK++++ ++V  T G A+
Sbjct: 234 MLEGGKANEDRESMLNSHLIKLFLVGTILTFSGFLLSYGCPINKKIWSPTFVLTTCGLAS 293

Query: 366 LVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
              + +  ++D+   K   L     G+N + +YV+  
Sbjct: 294 SFLALLIWIIDVKGYKKWSLFFESFGVNPLFMYVLGG 330


>gi|374310943|ref|YP_005057373.1| hypothetical protein [Granulicella mallensis MP5ACTX8]
 gi|358752953|gb|AEU36343.1| protein of unknown function DUF1624 [Granulicella mallensis
           MP5ACTX8]
          Length = 385

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 52/143 (36%), Positives = 79/143 (55%), Gaps = 18/143 (12%)

Query: 29  HLKTQRLASLDIFRGLAVALMILVDHAGGDWPE----ISHAPWNGCNLADFVMPFFLFIV 84
            L ++R+ S+D+ RG  +A MILV+ A G+WP     + HA WNGC   D V P FLF+ 
Sbjct: 12  ELTSKRIPSVDVLRGFTLAAMILVN-AAGEWPHAYWPLKHAQWNGCTPTDLVFPTFLFLT 70

Query: 85  GVAIALALKRIPDRADAVKKVIFRTLK----LLFWGILLQGGFSHAPDELTYGVDVRMIR 140
           G ++  + +    R    +++   TLK    L F G+LL        + L Y   +  +R
Sbjct: 71  GTSLVFSFRSRLARGVGKRELFLHTLKRSVILFFIGVLL--------NALPY-FHIGTLR 121

Query: 141 LCGVLQRIALSYLLVSLVEIFTK 163
           + GVLQRIAL YL VS++ ++ +
Sbjct: 122 IYGVLQRIALCYLCVSVLYLWNR 144


>gi|223936396|ref|ZP_03628308.1| conserved hypothetical protein [bacterium Ellin514]
 gi|223894914|gb|EEF61363.1| conserved hypothetical protein [bacterium Ellin514]
          Length = 383

 Score = 82.8 bits (203), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 110/466 (23%), Positives = 189/466 (40%), Gaps = 122/466 (26%)

Query: 13  PLIISE-PDVSDQQEKSHLKTQ----RLASLDIFRGLAVALMILVDHAGGDWPEI----- 62
           P   SE P +S+Q   +    Q    R+ S+D  RG  +  ++  D     + +I     
Sbjct: 4   PTSTSEAPALSNQAGSTATLNQKANTRIISIDALRGFDMFWIMGGDQLVRSFQKIDDSAP 63

Query: 63  --------SHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPD---RADAVKKVIFRTLK 111
                    H  W G +  D + P F+F+ GV+I  ++ R+ +   R  AVK++ FR++ 
Sbjct: 64  THALANQMEHCEWAGFHFYDLIFPLFVFLAGVSIVFSITRLIEHSGRVAAVKRIAFRSVI 123

Query: 112 LLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQS 171
           L  +GI   GG S+           + I L GVL RIA++Y   +L+  F +        
Sbjct: 124 LFLFGIFYMGGVSNG---------FKNIYLAGVLHRIAVAYFFAALLFCFFRPKA----- 169

Query: 172 VGRFSIFRLYCWHWLMAACV--LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGV 229
                         L+A C+  LV Y ALL    VP      +   S D GK        
Sbjct: 170 --------------LIAICIGLLVGYWALLTFVPVPG-----VGAASYDQGK-------- 202

Query: 230 RAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEP 289
                   N   Y+D+  L                      FEG                
Sbjct: 203 --------NLAYYLDQHYL------------------PGQKFEGT--------------- 221

Query: 290 EGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW-VTMGFALLIFGLTLHFTNAIPLN 348
             LLS++ ++ + ++G+  G ++ +      +   W +  G   L+ GL   ++   P+ 
Sbjct: 222 --LLSTMPAVANCLLGIFAGLLLTNKTVDDQKKVYWLLGSGITSLVIGLI--WSIQFPII 277

Query: 349 KQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA------- 401
           K L+T +YV +  G +A++    Y +++IW  +    P  W+GMNA+ +Y++A       
Sbjct: 278 KLLWTSTYVLLACGYSAILLGLFYQIIEIWKFQKWAQPFIWLGMNAITIYLVANIVNFRR 337

Query: 402 -AEGIFAGFINGWYYGDPHNTL--VCFLFIISYILHSFLWELRKFL 444
             E    G +   + G+ H+    V  L ++ +++H FL+  + FL
Sbjct: 338 HGERFVGGNVKN-FLGNYHDLALSVVVLILVFWVVH-FLYRRKVFL 381


>gi|392308231|ref|ZP_10270765.1| hypothetical protein PcitN1_06167 [Pseudoalteromonas citrea NCIMB
           1889]
          Length = 375

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 112/452 (24%), Positives = 185/452 (40%), Gaps = 122/452 (26%)

Query: 33  QRLASLDIFRGLAV-----------ALMILVDHAGGDWPEIS----HAPWNGCNLADFVM 77
           +RLASLD  RG+ +           AL +L    G  W        H+ W+G    D + 
Sbjct: 8   KRLASLDALRGMDMFWILGGQSIFAALFVLTGWQG--WKAFEAHTVHSAWHGFTFYDLIF 65

Query: 78  PFFLFIVGVAIALALKRI-----PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTY 132
           P F+F+ GVA+ L+ KRI      +R     K + R   L   G+L   G+         
Sbjct: 66  PLFIFLSGVAMGLSPKRIDHLPFSERRGYYNKALKRLFLLSALGVLYNHGWGTGIP---- 121

Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
            V +  IR   VL RIA+++    L+   T       +++G             +   VL
Sbjct: 122 -VALGEIRYASVLGRIAIAWFFCMLLVWHT-----SLRTLG------------FIGGGVL 163

Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL-GIN 251
           + Y  LL   ++P                   V  G    L+   +   ++D  +L GI+
Sbjct: 164 IGYWLLL--CFIP-------------------VPGGTAGDLSAAGSWNAWVDAMLLPGIS 202

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           +              Q+ P +                PEG+LSS+ +I++ I GV  G +
Sbjct: 203 Y--------------QNRPVD----------------PEGVLSSLPAIVNAIAGVFAGRL 232

Query: 312 IIHTKGHLARLKQWVTMG----FALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALV 367
           I   +     + QW T+     F  +  GL   +    P+NK+L+T S+V VT G +A++
Sbjct: 233 IAQAQ----HVGQWHTVARLFVFGFVALGLGWLWDIHFPVNKELWTSSFVLVTVGWSAIL 288

Query: 368 FSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA--------AEGIFAGFING----WYY 415
            +A YALVD+ N +        IG N++++Y+ +        +  +F G I+G    W  
Sbjct: 289 LAAFYALVDVLNGQRFAYLFVIIGANSIIIYLASSLVNWTFVSRSVFGGIISGSAEHW-- 346

Query: 416 GDPHNTLVCFLFIISYILHSFLWELRKFLYVQ 447
             P   +V  L +   +LH   W  ++ ++V 
Sbjct: 347 -QPLMGVVALLAVQLIVLH---WMYKRKIFVS 374


>gi|423282312|ref|ZP_17261197.1| hypothetical protein HMPREF1204_00735 [Bacteroides fragilis HMW
           615]
 gi|404581880|gb|EKA86575.1| hypothetical protein HMPREF1204_00735 [Bacteroides fragilis HMW
           615]
          Length = 387

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 88/394 (22%), Positives = 160/394 (40%), Gaps = 101/394 (25%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
            +RL +LD+ RG+ +A MI+V++ G     +  + HA W G    D V PFF+FI+G++ 
Sbjct: 6   NKRLLALDVLRGITIAGMIMVNNPGSWSYVYAPLGHAAWIGLTPTDLVFPFFMFIMGIST 65

Query: 89  ALALKR--IPDRADAVKKVIFRTLKLLFWGILLQ-----------------GGFSHAPDE 129
            ++L++        A  K++ RT+ +   G+ +                    FS   + 
Sbjct: 66  YISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCRTWNSLSSEDISFFSRLYES 125

Query: 130 L-TYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMA 188
           + T+G     IR+ GV+QR+AL Y   +++ +  K                         
Sbjct: 126 IWTFG----HIRILGVMQRLALCYGATAIIALIMKH------------------------ 157

Query: 189 ACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
                 Y+  L    +  +   +IN +  +Y                  N +  +DR VL
Sbjct: 158 -----KYIPYLIAILLIGYFIILINGNGFEYNS---------------SNILSIVDRTVL 197

Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
           G  HMY                      KD         +PEGLLS++ SI   +IG   
Sbjct: 198 GEAHMY----------------------KD------NGIDPEGLLSTIPSIAHVLIGFCV 229

Query: 309 GHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVF 368
           G +++  K    ++++   +G  L   G  L +    P++K++++ ++  VT G A+   
Sbjct: 230 GKLLMEVKDIHEKIERLFLIGTILTFAGFLLSY--GCPISKKIWSPTFAIVTCGLASSFL 287

Query: 369 SAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
           + +  ++D+             G+N + +YVM A
Sbjct: 288 ALLVWIIDVRGYTRWSRFFESFGVNPLFIYVMGA 321


>gi|302796998|ref|XP_002980260.1| hypothetical protein SELMODRAFT_419933 [Selaginella moellendorffii]
 gi|300151876|gb|EFJ18520.1| hypothetical protein SELMODRAFT_419933 [Selaginella moellendorffii]
          Length = 312

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 34/80 (42%), Positives = 54/80 (67%)

Query: 343 NAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
             I +NK LY+ SY+C T+GAA  VF  +Y LVD+++++YP L L W+GMN++++Y +AA
Sbjct: 38  QGIKMNKPLYSFSYMCFTAGAAGAVFCLLYILVDVYDIRYPTLLLEWMGMNSLIIYTLAA 97

Query: 403 EGIFAGFINGWYYGDPHNTL 422
             +   F+ G+Y+  P   L
Sbjct: 98  TDVLVVFVQGFYWKQPQKNL 117


>gi|325922207|ref|ZP_08183994.1| hypothetical protein XGA_3017 [Xanthomonas gardneri ATCC 19865]
 gi|325547326|gb|EGD18393.1| hypothetical protein XGA_3017 [Xanthomonas gardneri ATCC 19865]
          Length = 390

 Score = 82.4 bits (202), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 99/379 (26%), Positives = 156/379 (41%), Gaps = 86/379 (22%)

Query: 6   AETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGD---WPEI 62
           +E T     I + P ++ ++E       R  SLD+FRGL + LMILV+ AG     + ++
Sbjct: 2   SEQTPAAAAITASPTLTPKRE-------RFLSLDVFRGLTIFLMILVNTAGPGAQAYAQL 54

Query: 63  SHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQ-- 120
           +HA W G  LAD V P FLF VG A++ AL         + +V  R   +   G+L+   
Sbjct: 55  THAAWFGFTLADLVFPSFLFAVGSAMSFALAADTPHRPFLGRVGKRAALIFLCGVLMYWF 114

Query: 121 GGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRL 180
             F   P        +  +RL GVLQRI L YLL +L+  +                   
Sbjct: 115 PFFHLQPGGGWAFTAIDQLRLTGVLQRIGLCYLLAALLVRYLPP---------------- 158

Query: 181 YCWHWLMAACV--LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCN 238
                +  ACV  L+ Y A+LY          +  +  A+  K  N   G R        
Sbjct: 159 ---RGIAPACVALLLGYWAVLY----------VFGQPGAELSKTGN--AGTR-------- 195

Query: 239 AVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSS 298
               +D  + G  H+Y                     RKD        F+PEGLL ++ +
Sbjct: 196 ----LDLWLYGRAHLY---------------------RKD------NGFDPEGLLGTLPA 224

Query: 299 ILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVC 358
            ++ + G   G   +  +G  A   +   +   + +  L L +  A PL+K+L++ S+V 
Sbjct: 225 TVNVLAGYLTGR-FLQRRGKTAAATR-TLLLAGVGLVLLALLWNPAWPLSKKLWSGSFVA 282

Query: 359 VTSGAAALVFSAIYALVDI 377
            T G   L    +  L+++
Sbjct: 283 CTVGLDLLALGVLVYLLEL 301


>gi|375356811|ref|YP_005109583.1| putative transmembrane protein [Bacteroides fragilis 638R]
 gi|383116724|ref|ZP_09937472.1| hypothetical protein BSHG_1191 [Bacteroides sp. 3_2_5]
 gi|251947990|gb|EES88272.1| hypothetical protein BSHG_1191 [Bacteroides sp. 3_2_5]
 gi|301161492|emb|CBW21032.1| putative transmembrane protein [Bacteroides fragilis 638R]
          Length = 387

 Score = 82.0 bits (201), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 87/391 (22%), Positives = 159/391 (40%), Gaps = 95/391 (24%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
            +RL +LD+ RG+ +A MI+V++ G     +  + HA W G    D V PFF+FI+G++ 
Sbjct: 6   NKRLLALDVLRGITIAGMIMVNNPGSWSYVYAPLGHAAWIGLTPTDLVFPFFMFIMGIST 65

Query: 89  ALALKR--IPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRM------ 138
            ++L++        A  K++ RT+ +   G+ +     F    + L+ G D+        
Sbjct: 66  YISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCRTWNSLS-GEDISFFSRLYE 124

Query: 139 -------IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV 191
                  IR+ GV+QR+AL Y   +++ +  K                            
Sbjct: 125 SVWTFGHIRILGVMQRLALCYGATAIIALIMKH--------------------------- 157

Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
              Y+  L    +  +   +IN +  +Y                  N +  +D  VLG  
Sbjct: 158 --KYIPYLIAILLIGYFIILINGNGFEYNS---------------SNILSIVDHTVLGEA 200

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           HMY                      KD         +PEGLLS++ SI   +IG   G +
Sbjct: 201 HMY----------------------KD------NGIDPEGLLSTIPSIAHVLIGFCVGKL 232

Query: 312 IIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAI 371
           ++  K    ++++   +G  L   G  L +    P++K++++ ++  VT G A+   + +
Sbjct: 233 LMEVKDIHEKIERLFLIGTILTFAGFLLSY--GCPISKKIWSPTFAIVTCGLASSFLALL 290

Query: 372 YALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
             ++D+             G+N + +YVM A
Sbjct: 291 VWIIDVRGYTRWSRFFESFGVNPLFIYVMGA 321


>gi|410028220|ref|ZP_11278056.1| hypothetical protein MaAK2_03415 [Marinilabilia sp. AK2]
          Length = 382

 Score = 82.0 bits (201), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 93/378 (24%), Positives = 161/378 (42%), Gaps = 78/378 (20%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
           +R  +LD+ RGL +ALM++V+  G  W  +     HA W+G  + D + P FLF+VG A+
Sbjct: 13  ERYLALDVLRGLTIALMVVVNTPG-SWSHMYAPFMHADWHGFTITDLIFPTFLFVVGNAM 71

Query: 89  ALALKRIPDRADA--VKKVIFRTLKLLFWGILLQG-GFSHAPDELTYG-VDVRMIRLCGV 144
           + ++KR+     +  +KKV  RTL +   G LL    F +   E  Y  ++   +RL GV
Sbjct: 72  SFSMKRMESMGQSLFLKKVFKRTLLIFLIGWLLNAFPFFNYNAETGYSMINWSEVRLLGV 131

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           LQRIAL Y+L +L+  +           G+   F       + +   L+ Y A++Y    
Sbjct: 132 LQRIALCYMLAALILYY----------FGKKGAF-------IYSFIALLGYWAIMY---- 170

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                     D  D   +               NA   +D  ++G  H+Y          
Sbjct: 171 -------FFGDGEDPYSLIG-------------NAALKLDLWLIGAKHLY---------- 200

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
             +  PF+       P    + F      S V+ I   ++G      I  +  +   +K+
Sbjct: 201 MGEGIPFD-------PEGVLSTFP-----SVVNVIAGFLVG----KFIQESGNNTGTVKK 244

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPF 384
            V  G  LL+    L +    P+NK+++T  YV +T      +   +  ++++W  +   
Sbjct: 245 LVIWGIILLV--ACLVWDMVFPINKKIWTSPYVLLTISLDLFLIGLLMLVIEVWQKRNWT 302

Query: 385 LPLAWIGMNAMLVYVMAA 402
            P    G N +++YV++ 
Sbjct: 303 YPFEVFGRNPLILYVLSG 320


>gi|91794054|ref|YP_563705.1| hypothetical protein Sden_2703 [Shewanella denitrificans OS217]
 gi|91716056|gb|ABE55982.1| conserved hypothetical protein [Shewanella denitrificans OS217]
          Length = 400

 Score = 81.6 bits (200), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 109/414 (26%), Positives = 170/414 (41%), Gaps = 98/414 (23%)

Query: 25  QEKSHLKTQRLASLDIFRG-----------LAVALMILVDHAGGDWP--EISHAPWNGCN 71
           Q  + L   RL SLD  RG           L  AL  L   AG +    ++ H+ W+G  
Sbjct: 25  QTSTSLNKPRLKSLDALRGFDMFWIIGGEGLFAALFTLTGWAGWNIASRQMQHSQWHGFT 84

Query: 72  LADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFR-TLKLLFWGILLQGGFSHAPDEL 130
           L D + P F+F+ GVA+ L+ KR+  +A AV   +++   K L   I L   ++H     
Sbjct: 85  LYDLIFPLFIFLSGVALGLSPKRLDQQAFAVALPLYQHACKRLILLIALGILYNHGWGT- 143

Query: 131 TYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAAC 190
               D+  IR   VL RI  ++   +++   T+      Q +   SI  LY         
Sbjct: 144 GIPADLDKIRYSSVLARIGFAWFFAAMLVWHTR---LSIQVIVSVSIIGLYT-------- 192

Query: 191 VLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGI 250
                LA LY                        V  G   +     +   Y+D  +   
Sbjct: 193 -----LAQLY----------------------LPVPGGQAGQFTLDASINTYVDGLL--- 222

Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
                     R     QD P                 +PEG+LS+V ++++ ++GV  G 
Sbjct: 223 ----------RPGIAYQDRP----------------LDPEGILSTVPAVINAMVGVFAGQ 256

Query: 311 VII--HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVF 368
            II  H++G  A++   +  G  LL+    L     IP+NK L+T S+V VTSG + L  
Sbjct: 257 FIIRAHSRGDWAKVGVLIACGVLLLVLAWLLE--PMIPVNKDLWTTSFVLVTSGWSLLFL 314

Query: 369 SAIYALVDI--WNLKYPFLPLAWIGMNAMLVYVMA--------AEGIFAGFING 412
           S  Y ++D+  W  K+ F+ +  IG NA++VY+ +        +  +FAG I  
Sbjct: 315 SLFYVIIDVLKWQ-KWTFVFVV-IGTNAIIVYLGSSLIDWHYISRSLFAGVIEA 366


>gi|225875032|ref|YP_002756491.1| hypothetical protein ACP_3497 [Acidobacterium capsulatum ATCC
           51196]
 gi|225792728|gb|ACO32818.1| putative membrane protein [Acidobacterium capsulatum ATCC 51196]
          Length = 378

 Score = 81.6 bits (200), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 96/392 (24%), Positives = 167/392 (42%), Gaps = 92/392 (23%)

Query: 23  DQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGD----WPEISHAPWNGCNLADFVMP 78
            Q + + +  +R+ S+D+ RG+ +A MILV++ G +    W  + HA WNG    D V P
Sbjct: 2   KQHQDTVVNAKRMVSIDLLRGITIAFMILVNNNGDEAHAFW-ALKHAQWNGFTPTDLVFP 60

Query: 79  FFLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGV 134
            F+F+VG+++  +    L+R   R       + R++ L   G+++  GF +      +G 
Sbjct: 61  TFIFVVGISLVFSTEARLRRGQSRLLIAAHALRRSVILFLLGLVVN-GFPY----FHFGT 115

Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
               +R+ GVLQRIA+ YL  SL+ + ++ V  +                 L+    LV 
Sbjct: 116 ----LRIYGVLQRIAICYLFGSLLYLLSRRVWLQA----------------LLFTTALVG 155

Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
           Y AL+    VP   + +  +D                 L+P  N V ++DR +L      
Sbjct: 156 YWALMRWVPVPG--YGLPGRDIPF--------------LDPNANLVAWLDRLLL------ 193

Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
             P   R  A T+D                    PEGLLS++ ++ + ++G       + 
Sbjct: 194 --PG--RLYAGTRD--------------------PEGLLSTIPAMGTLLLG-------MM 222

Query: 315 TKGHL-----ARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFS 369
           T G L      R K  + +  A +       +    P+NK+++T SYV    G + L F+
Sbjct: 223 TAGWLRSAAAPRRKLMLLLAAAGIALTAGALWGLEFPINKRVWTSSYVLYAGGWSLLAFA 282

Query: 370 AIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA 401
             + + ++   +         GMN +  Y+ A
Sbjct: 283 LCFWMTEVRKHRNGLYLWLAFGMNPITAYMFA 314


>gi|427400072|ref|ZP_18891310.1| hypothetical protein HMPREF9710_00906 [Massilia timonae CCUG 45783]
 gi|425720812|gb|EKU83727.1| hypothetical protein HMPREF9710_00906 [Massilia timonae CCUG 45783]
          Length = 380

 Score = 81.6 bits (200), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 111/391 (28%), Positives = 163/391 (41%), Gaps = 106/391 (27%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDW----PEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           RL SLD FRG  +A M+LV++ G DW     +++HA W+G    D + PFFLFI GVA+A
Sbjct: 7   RLTSLDAFRGFTIAAMVLVNNPG-DWGHLHAQLAHAAWHGWTFTDTIFPFFLFIGGVAMA 65

Query: 90  LALKRIPD----RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
           L+L R+      +   + K+  R   +   G LL          L    D   +R+ GVL
Sbjct: 66  LSLGRLAAAGAHKPQLLLKLAKRAALIFLIGFLL---------NLIPRFDFDSVRIPGVL 116

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRIAL  LL                                  A  LVVYL     T+  
Sbjct: 117 QRIALCTLL----------------------------------AAPLVVYL-----TWRG 137

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRA-KLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                 +         +     G+ A  L P  +A  +IDR ++   H++       ++A
Sbjct: 138 QALAVFLLLALYSVLMLLVPVPGIGAGNLEPGRDAGAWIDRALMD-GHLW-------AQA 189

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
            T D   EG L    P+ C                 S + GV  G +++     L R++Q
Sbjct: 190 KTWDP--EG-LVSTLPAVC-----------------SLLFGVLAGRLLLSA---LPRVEQ 226

Query: 325 --WVTM-GFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD----- 376
             W+ + G A L  G TL     +P+NK L+T S+  + SG A L F A Y L+D     
Sbjct: 227 VVWLMLSGLACLALGSTLD-AVLMPINKSLWTPSFCLLMSGWALLAFGASYWLLDAAPST 285

Query: 377 -----IWNLKYPFLPLAWIGMNAMLVYVMAA 402
                +  L  PF+     GMNA+ ++ ++ 
Sbjct: 286 ALRERMQRLSTPFVI---YGMNALFIFALSG 313


>gi|440804580|gb|ELR25457.1| Heparan-alpha-glucosaminide N-acetyltransferase [Acanthamoeba
           castellanii str. Neff]
          Length = 446

 Score = 81.6 bits (200), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 103/411 (25%), Positives = 164/411 (39%), Gaps = 109/411 (26%)

Query: 22  SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG-------------DWPEISHAPWN 68
           +D    S  K  RL SLD+FRG+ +  MILVD+ G                P     P N
Sbjct: 47  TDASCPSAPKKPRLQSLDVFRGVTMLGMILVDNQGNFDHVVRPLDESIVRHPAPPRPPTN 106

Query: 69  G---CNLADFVMPFFLFIVGVAIALALKRIPDRADAVK---KVIFRTLKLLFWGILLQGG 122
                + AD    +  F V +A+     +IPDR   +K   +V+ R   L   G+LL   
Sbjct: 107 ARSWVDPADHCAQWDGFAVALAMNGFWDKIPDRRGKIKAWARVLQRIGTLFVVGLLLNAF 166

Query: 123 FSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYC 182
            S+  D+  +       R+ G   RIAL Y  V+++ + T  +  +              
Sbjct: 167 GSNPWDKWPHW----HFRIMGC--RIALCYGTVTVLFLATSTIVQR-------------- 206

Query: 183 WHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY 242
              ++  C   +Y+ L+YG  VP                     CG R  L P CNA G+
Sbjct: 207 ---VVMLCFTAIYVGLMYGLDVP--------------------KCG-RGNLTPGCNAGGF 242

Query: 243 IDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILST 302
           IDR + G                                W   P +PEGLLS++++ L+ 
Sbjct: 243 IDRSIFG-------------------------------DWMIRPNDPEGLLSTLTATLTC 271

Query: 303 IIGVHFGHVI-IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTS 361
            +G+ FG ++  +    L  + +WV +   L+   L L     +P+NK+++++ +  +  
Sbjct: 272 YLGLEFGRILHKYRANQLELVCRWVMLALGLIGLALFLWLW--MPINKKMWSVPFALMMG 329

Query: 362 GAAALVFSAIYALVDI----WN--------LKYPFLPLAWIGMNAMLVYVM 400
           G   LV    Y LVD+    W               PL W+GMN + ++V+
Sbjct: 330 GIGGLVIFICYYLVDMVLASWQEDSAWKKACNAAIQPLIWMGMNPLAIFVL 380


>gi|381169858|ref|ZP_09879020.1| conserved hypothetical protein [Xanthomonas citri pv.
           mangiferaeindicae LMG 941]
 gi|380689628|emb|CCG35507.1| conserved hypothetical protein [Xanthomonas citri pv.
           mangiferaeindicae LMG 941]
          Length = 388

 Score = 81.3 bits (199), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 83/287 (28%), Positives = 118/287 (41%), Gaps = 73/287 (25%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           K +R  SLD+FRGL + LMILV+ AG     + +++HA W G  LAD V P FLF VG A
Sbjct: 18  KRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSA 77

Query: 88  IALALKRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVL 145
           ++ AL         + +V  R   +L  G+L+     F   PD       V  +RL GVL
Sbjct: 78  MSFALATNAPHLQFLGRVSRRAALILLCGVLMYWFPFFHLQPDGGWAFTTVDQLRLTGVL 137

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRI L YL  +L+                          +L    +  V LALL G    
Sbjct: 138 QRIGLCYLAAALLV------------------------RYLPPRGIAPVCLALLLGY--- 170

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
            W F  +                  A+L+   NA   +D  + G +H+Y           
Sbjct: 171 -WAFLYVFGQPG-------------AELSKTGNAGTRLDLWLYGRDHLY----------- 205

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
                     RKD        F+PEGLL ++S+ ++ + G   G  +
Sbjct: 206 ----------RKD------GGFDPEGLLGTLSATVNVLAGYLCGRFL 236


>gi|404450663|ref|ZP_11015643.1| hypothetical protein A33Q_15100 [Indibacter alkaliphilus LW1]
 gi|403763718|gb|EJZ24662.1| hypothetical protein A33Q_15100 [Indibacter alkaliphilus LW1]
          Length = 381

 Score = 81.3 bits (199), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 52/146 (35%), Positives = 80/146 (54%), Gaps = 10/146 (6%)

Query: 21  VSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFV 76
           +S  Q       +R  +LD+ RGL +ALMI+V+  G  W  +     HAPW+G  + D V
Sbjct: 1   MSTLQPSGVPLKERYLALDVLRGLTIALMIVVNTPG-SWSHMYGPFMHAPWHGFTITDLV 59

Query: 77  MPFFLFIVGVAIALALKRIPDRADA--VKKVIFRTLKLLF--WGILLQGGFSHAPDELTY 132
            P FLF+VG A++ ++K++        ++KV+ R+  +    WG+     F    + L  
Sbjct: 60  FPTFLFVVGNAMSFSMKKLEKMGQGLFLRKVLKRSFLIFIIGWGLNAFPFFDQTENGLAM 119

Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLV 158
            ++   +RL GVLQRIAL YL+ SLV
Sbjct: 120 -INWGEVRLLGVLQRIALCYLIASLV 144



 Score = 48.9 bits (115), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 35/117 (29%), Positives = 64/117 (54%), Gaps = 2/117 (1%)

Query: 286 PFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           PF+PEGLLS+  ++++ I G   G  I     +   +K  +  G  L++  + L +    
Sbjct: 205 PFDPEGLLSTFPAVVNVIAGYFAGKFIQQMGNNTKTVKYLLVAG--LILIAVCLAWDPFF 262

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
           P+NK+L+T SYV +T G   L+ +A+  L+++W  +    P    G N +++YV++ 
Sbjct: 263 PINKKLWTSSYVLLTIGLDLLLIAAMILLIEVWQQRSWTYPFEVFGRNPLILYVLSG 319


>gi|167764222|ref|ZP_02436349.1| hypothetical protein BACSTE_02607 [Bacteroides stercoris ATCC
           43183]
 gi|167698338|gb|EDS14917.1| hypothetical protein BACSTE_02607 [Bacteroides stercoris ATCC
           43183]
          Length = 396

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 97/404 (24%), Positives = 165/404 (40%), Gaps = 103/404 (25%)

Query: 27  KSHLKT-QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFL 81
            S +KT +R+ +LDI RG+ +A MI+V++ G  W  I     HA WNG    D V PFF+
Sbjct: 2   SSTVKTNKRILALDILRGVTIAGMIMVNNPG-TWAHIYAPLRHAEWNGLTPTDLVFPFFM 60

Query: 82  FIVGVAIALALKRIP---DRADAVKKVIFRTLKLLFWGILLQGGFSH------APDE-LT 131
           FI+G++  ++LK+      RA  + K++ RT+ +   G+ + G FS       +P E + 
Sbjct: 61  FIMGISTYISLKKYNFEFSRAAGM-KILKRTILIFLIGMGI-GWFSRFCYYWTSPTEGIG 118

Query: 132 YGVDV-------RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWH 184
           +G  +         IR+ GV+QR+AL Y   +++ +  K                     
Sbjct: 119 FGAQLWEAAWTFDRIRILGVMQRLALCYGATAIIALTMKHRNIP---------------- 162

Query: 185 WLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYID 244
             + A +L  Y  LL                           CG     N   N +  +D
Sbjct: 163 -YLIATLLTGYFILL--------------------------VCGNGFAYN-DTNILSVVD 194

Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
           R +L   HMY                                 +PEGLLS++ +I   ++
Sbjct: 195 RAILTPAHMYKDNG----------------------------IDPEGLLSTIPAIAHVLL 226

Query: 305 GVHFGHVIIH-TKGHLARLKQWVTMGFALLIFGLTLHFTN-----AIPLNKQLYTLSYVC 358
           G   G +++   K   +R     +    L + G  L F         P+NK++++ ++V 
Sbjct: 227 GFCVGRMMLEGGKADESRESMLNSHLIKLFLAGTILTFAGFLLSYGCPINKKIWSPTFVL 286

Query: 359 VTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
            T G A+   + +  ++D+   K   L     G+N + +YV+  
Sbjct: 287 ATCGLASSFLALLIWIIDVKGYKKWSLFFESFGVNPLFMYVLGG 330


>gi|384362003|ref|YP_006199855.1| hypothetical protein CDBI1_13575 [Clostridium difficile BI1]
          Length = 485

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 49/146 (33%), Positives = 78/146 (53%), Gaps = 16/146 (10%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIV 84
           S L   R+ S+DI RGL++ALMI+ ++ G     +P++ HA W+G  LADF  PFF+  +
Sbjct: 113 SKLTNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISL 172

Query: 85  GVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR 140
           GV I ++    LK        +  +  R++ L+ +G  L   +   P       D+  +R
Sbjct: 173 GVTIPISINSKLKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLNSVR 223

Query: 141 LCGVLQRIALSYLLVSLVEIFTKDVQ 166
           + GVLQR+ L Y + SLV +  K + 
Sbjct: 224 ILGVLQRMGLVYFVTSLVYLLLKKLN 249



 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 33/114 (28%), Positives = 59/114 (51%), Gaps = 3/114 (2%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTK-GHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           FEP+G L+S+ +I S ++G   G V++    G   +  + + M   LLI      F    
Sbjct: 303 FEPDGFLTSIVAISSGMLGCTMGCVLLKENIGEYKKFFKILVMSIILLIGAFI--FNQYF 360

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
           P NK+L++ S+V + +G+  ++ S  Y + DI N    F P+  +G + +  Y+
Sbjct: 361 PFNKRLWSSSFVLLMAGSYGMLLSIFYFICDIKNKSKIFTPIIALGSSPIFTYM 414


>gi|255651295|ref|ZP_05398197.1| hypothetical protein CdifQCD_14003 [Clostridium difficile
           QCD-37x79]
          Length = 461

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 49/146 (33%), Positives = 78/146 (53%), Gaps = 16/146 (10%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIV 84
           S L   R+ S+DI RGL++ALMI+ ++ G     +P++ HA W+G  LADF  PFF+  +
Sbjct: 89  SKLTNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISL 148

Query: 85  GVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR 140
           GV I ++    LK        +  +  R++ L+ +G  L   +   P       D+  +R
Sbjct: 149 GVTIPISINSKLKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLNSVR 199

Query: 141 LCGVLQRIALSYLLVSLVEIFTKDVQ 166
           + GVLQR+ L Y + SLV +  K + 
Sbjct: 200 ILGVLQRMGLVYFVTSLVYLLLKKLN 225



 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 33/114 (28%), Positives = 59/114 (51%), Gaps = 3/114 (2%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTK-GHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           FEP+G L+S+ +I S ++G   G V++    G   +  + + M   LLI      F    
Sbjct: 279 FEPDGFLTSIVAISSGMLGCTMGCVLLKENIGEYKKFFKILVMSIILLIGAFI--FNQYF 336

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
           P NK+L++ S+V + +G+  ++ S  Y + DI N    F P+  +G + +  Y+
Sbjct: 337 PFNKRLWSSSFVLLMAGSYGMLLSIFYFICDIKNKSKIFTPIIALGSSPIFTYM 390


>gi|255518179|ref|ZP_05385855.1| hypothetical protein CdifQCD-_13768 [Clostridium difficile
           QCD-97b34]
          Length = 469

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 49/146 (33%), Positives = 78/146 (53%), Gaps = 16/146 (10%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIV 84
           S L   R+ S+DI RGL++ALMI+ ++ G     +P++ HA W+G  LADF  PFF+  +
Sbjct: 97  SKLTNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISL 156

Query: 85  GVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR 140
           GV I ++    LK        +  +  R++ L+ +G  L   +   P       D+  +R
Sbjct: 157 GVTIPISINSKLKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLNSVR 207

Query: 141 LCGVLQRIALSYLLVSLVEIFTKDVQ 166
           + GVLQR+ L Y + SLV +  K + 
Sbjct: 208 ILGVLQRMGLVYFVTSLVYLLLKKLN 233



 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 33/114 (28%), Positives = 59/114 (51%), Gaps = 3/114 (2%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTK-GHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           FEP+G L+S+ +I S ++G   G V++    G   +  + + M   LLI      F    
Sbjct: 287 FEPDGFLTSIVAISSGMLGCTMGCVLLKENIGEYKKFFKILVMSIILLIGAFI--FNQYF 344

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
           P NK+L++ S+V + +G+  ++ S  Y + DI N    F P+  +G + +  Y+
Sbjct: 345 PFNKRLWSSSFVLLMAGSYGMLLSIFYFICDIKNKSKIFTPIIALGSSPIFTYM 398


>gi|254976379|ref|ZP_05272851.1| hypothetical protein CdifQC_13741 [Clostridium difficile QCD-66c26]
          Length = 459

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 49/146 (33%), Positives = 78/146 (53%), Gaps = 16/146 (10%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIV 84
           S L   R+ S+DI RGL++ALMI+ ++ G     +P++ HA W+G  LADF  PFF+  +
Sbjct: 87  SKLTNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISL 146

Query: 85  GVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR 140
           GV I ++    LK        +  +  R++ L+ +G  L   +   P       D+  +R
Sbjct: 147 GVTIPISINSKLKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLNSVR 197

Query: 141 LCGVLQRIALSYLLVSLVEIFTKDVQ 166
           + GVLQR+ L Y + SLV +  K + 
Sbjct: 198 ILGVLQRMGLVYFVTSLVYLLLKKLN 223



 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 33/114 (28%), Positives = 59/114 (51%), Gaps = 3/114 (2%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTK-GHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           FEP+G L+S+ +I S ++G   G V++    G   +  + + M   LLI      F    
Sbjct: 277 FEPDGFLTSIVAISSGMLGCTMGCVLLKENIGEYKKFFKILVMSIILLIGAFI--FNQYF 334

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
           P NK+L++ S+V + +G+  ++ S  Y + DI N    F P+  +G + +  Y+
Sbjct: 335 PFNKRLWSSSFVLLMAGSYGMLLSIFYFICDIKNKSKIFTPIIALGSSPIFTYM 388


>gi|329851798|ref|ZP_08266479.1| hypothetical protein ABI_45670 [Asticcacaulis biprosthecum C19]
 gi|328839647|gb|EGF89220.1| hypothetical protein ABI_45670 [Asticcacaulis biprosthecum C19]
          Length = 398

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 99/384 (25%), Positives = 164/384 (42%), Gaps = 86/384 (22%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAIA 89
           R  +LDI RGL +  M+L ++AG DW  I     HA W+G  L D V P F+  VG+++ 
Sbjct: 27  RFEALDILRGLFIIGMLLANNAG-DWSHIYTPLDHAEWHGFTLTDMVFPGFMTCVGLSMT 85

Query: 90  LALKR----IPDRADAVKKVIFRTLKL--------LFWGILLQGGFSHAPDELTYGVDVR 137
           L+L R    +  +A     ++  +L+         LF  +L Q  F H            
Sbjct: 86  LSLGRRQKTLNSQAGGKAALLVHSLRRAAILVGIGLFLNLLPQFDFEH------------ 133

Query: 138 MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA 197
             RL GVLQRI + Y + S + +       +   +       L+   +L+A  +L+ Y+ 
Sbjct: 134 -WRLPGVLQRIGICYAIASGLVVLHSHQNQQGGLILHSRALALWGVGFLVAYTLLLKYVP 192

Query: 198 LLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHP 257
           +  G     W                       A  + P     ++D +VLG+NH+    
Sbjct: 193 VPDGAGANQWD----------------------AIHSWPA----WVDMQVLGVNHV---- 222

Query: 258 AWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKG 317
            W  +K                       ++PEGLLSSV +  + + G+  G + I+T+ 
Sbjct: 223 -WSGAKT----------------------YDPEGLLSSVPATSNILFGILMG-LYINTRT 258

Query: 318 HLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDI 377
                     +G  L++  L L   + +P+ K+L+T S+V ++ G A  V + +  ++D 
Sbjct: 259 PRNAWGGVAIIGVLLML--LALVLDSYVPIIKKLWTPSFVLLSCGFAFTVLAVLMVVMDR 316

Query: 378 WNLKYPFLPLAWIGMNAMLVYVMA 401
              K   +P+   G NA+LVYV A
Sbjct: 317 LGFKRWAVPIKLFGTNAILVYVFA 340


>gi|255656770|ref|ZP_05402179.1| hypothetical protein CdifQCD-2_14006 [Clostridium difficile
           QCD-23m63]
          Length = 481

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 49/146 (33%), Positives = 78/146 (53%), Gaps = 16/146 (10%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIV 84
           S L   R+ S+DI RGL++ALMI+ ++ G     +P++ HA W+G  LADF  PFF+  +
Sbjct: 109 SKLMNSRVKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISL 168

Query: 85  GVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR 140
           GV I ++    LK        +  +  R++ L+ +G  L   +   P       D+  +R
Sbjct: 169 GVTIPISINSKLKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLNTVR 219

Query: 141 LCGVLQRIALSYLLVSLVEIFTKDVQ 166
           + GVLQR+ L Y + SLV +  K + 
Sbjct: 220 ILGVLQRMGLVYFVTSLVYLLLKKLN 245



 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 33/114 (28%), Positives = 59/114 (51%), Gaps = 3/114 (2%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTK-GHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           FEP+G L+S+ +I S ++G   G V++    G   +  + + M   LLI      F    
Sbjct: 299 FEPDGFLTSIVAISSGMLGCTMGCVLLKENIGEYKKFFKILVMSIILLIGAFI--FNQYF 356

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
           P NK+L++ S+V + +G+  ++ S  Y + DI N    F P+  +G + +  Y+
Sbjct: 357 PFNKRLWSSSFVLLMAGSYGVLLSIFYFICDIKNKSKIFTPIIALGSSPIFTYM 410


>gi|390989491|ref|ZP_10259788.1| conserved hypothetical protein [Xanthomonas axonopodis pv. punicae
           str. LMG 859]
 gi|372555760|emb|CCF66763.1| conserved hypothetical protein [Xanthomonas axonopodis pv. punicae
           str. LMG 859]
          Length = 388

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 81/287 (28%), Positives = 117/287 (40%), Gaps = 73/287 (25%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           K +R  SLD+FRGL + LMILV+ AG     + +++HA W G  LAD V P FLF VG A
Sbjct: 18  KRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSA 77

Query: 88  IALALKRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVL 145
           ++ AL         + +V  R   +L  G+L+     F   PD       V  +RL GVL
Sbjct: 78  MSFALATNTPHLQFLGRVSRRAALILLCGVLMYWFPFFHLQPDGGWAFTTVDQLRLTGVL 137

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRI L YL  +L+  +                         +   +L+ Y ALLY    P
Sbjct: 138 QRIGLCYLAAALLVRYLPQRGIAP-----------------VCLALLLGYWALLYAFGQP 180

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
                                    A+L+   NA   +D  + G +H+Y           
Sbjct: 181 G------------------------AELSKTGNAGTRLDLWLYGRDHLY----------- 205

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
                     RKD        F+PEGLL ++S+ ++ + G   G  +
Sbjct: 206 ----------RKD------GGFDPEGLLGTLSATVNVLAGYLCGRFL 236


>gi|288929890|ref|ZP_06423732.1| conserved hypothetical protein [Prevotella sp. oral taxon 317 str.
           F0108]
 gi|288328709|gb|EFC67298.1| conserved hypothetical protein [Prevotella sp. oral taxon 317 str.
           F0108]
          Length = 399

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 99/409 (24%), Positives = 172/409 (42%), Gaps = 109/409 (26%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFL 81
            EK+   T R+ S+DI RGL +A MI V++ G     +  + HA WNG    D V PFF+
Sbjct: 1   MEKNK-TTSRILSIDILRGLTIAGMITVNNPGSWSYMYAPLEHAEWNGLTPTDLVFPFFM 59

Query: 82  FIVGVAIALALKRIP---DRADAVKKVIFRTLKLLFWGILLQGGFS------HAPDE--- 129
            ++G+ I +A+++     +RA   K  I + + L++   L  G F+      ++P E   
Sbjct: 60  CVMGMCIYIAMRKFDFACNRATVYK--IVKRMVLIYLVGLAIGWFAKFCYRWNSPQEGAD 117

Query: 130 ----LTYGV-DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWH 184
               L Y V     IRL GVL R+A+ Y + +L+ I    V+ K                
Sbjct: 118 FFSQLWYMVWSFDKIRLTGVLARLAICYGITALLAI---TVRHKHLP------------- 161

Query: 185 WLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYID 244
           +++   +L  ++ L+ G                  G  ++ T           N +   D
Sbjct: 162 YIIVGLLLTYFVILMAGN-----------------GFAYDET-----------NILSIAD 193

Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
           R VL   HMYH                                +PEGLLS++ SI  T++
Sbjct: 194 RAVLTDAHMYHDNG----------------------------IDPEGLLSTLPSIAHTLL 225

Query: 305 GVHFGHVII------------HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLY 352
           G   G ++              T   L ++     +G +LL  G  L +    P+NK+++
Sbjct: 226 GFIIGSLLFRKADVGEQQLDARTNITLTKVVPLFVVGTSLLFAGYLLSY--GCPINKKVW 283

Query: 353 TLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA 401
           + ++V VT G A+++ + +  ++D+   K         G+N + ++V++
Sbjct: 284 SPTFVLVTCGLASMLLALLTWIIDVKGKKSWSKFFEVFGVNPLFLFVLS 332


>gi|399028715|ref|ZP_10729871.1| hypothetical protein PMI10_01698 [Flavobacterium sp. CF136]
 gi|398073551|gb|EJL64721.1| hypothetical protein PMI10_01698 [Flavobacterium sp. CF136]
          Length = 382

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 90/404 (22%), Positives = 159/404 (39%), Gaps = 105/404 (25%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDH------AGGDWP-------EISHAPWNGCNLADFV 76
           +   RL SLD  RG  +  ++  +H           P       ++ H  W G    D +
Sbjct: 5   ITNGRLVSLDALRGFVMFWIMSGEHIIHALAKAAPIPVFVWMSSQLHHTEWEGITFYDMI 64

Query: 77  MPFFLFIVGVAIALALKR--------IPDRADAVKK------VIFRTLKLLFWGILLQGG 122
            P FLF+ GV++  + ++         P    A +K      ++ RT  L+F G ++ G 
Sbjct: 65  FPIFLFVAGVSMPYSFEKKMSIAGVNTPMELPAKEKKKIYLSMLKRTCILIFLGFIVNGL 124

Query: 123 FSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYC 182
                 + T        R   VL RI L++    ++ +   +   K Q +          
Sbjct: 125 LRFDGYDQT--------RFASVLGRIGLAWFFAGIIYL---NFNLKKQII---------- 163

Query: 183 WHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY 242
             W +   +LV Y   +    VPD+   ++ K+ +                       GY
Sbjct: 164 --WFIG--ILVGYYLAMKLIPVPDFGAGVLTKEGS---------------------LEGY 198

Query: 243 IDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILST 302
           IDR  L        P    SK                       ++PEGL S++ ++ + 
Sbjct: 199 IDRMFL--------PGRLHSKV----------------------YDPEGLFSTIPAVATA 228

Query: 303 IIGVHFGHVIIHTKGHLARLKQWVTM-GFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTS 361
           ++G+  G  +     H +  K+ + M   A+++ G+ L +    P+NK L++ S+VC   
Sbjct: 229 LLGMFLGTFLKIKANHFSTNKKILIMVASAIVLIGIGLIWNYDFPINKHLWSSSFVCFVG 288

Query: 362 GAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGI 405
           G + L F+  Y ++D+        PL  IG N++L+Y MA+EG+
Sbjct: 289 GCSILFFTFFYLIIDVLGFHKWAFPLILIGSNSILIY-MASEGL 331


>gi|406662851|ref|ZP_11070935.1| hypothetical protein B879_02963 [Cecembia lonarensis LW9]
 gi|405553158|gb|EKB48438.1| hypothetical protein B879_02963 [Cecembia lonarensis LW9]
          Length = 382

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 49/134 (36%), Positives = 77/134 (57%), Gaps = 9/134 (6%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
           +R  +LD+ RGL +ALM++V+  G  W  +     HA W+G  + D + P FLF+VG A+
Sbjct: 13  ERYLALDVLRGLTIALMVVVNTPG-SWSHMYAPFMHADWHGFTITDLIFPTFLFVVGNAM 71

Query: 89  ALALKRIPDRADAV--KKVIFRTLKLLFWGILLQG--GFSHAPDELTYGVDVRMIRLCGV 144
           + ++K++      V  KKV  RTL +   G LL      ++ P+     ++   +RL GV
Sbjct: 72  SFSMKKLESMGQQVFLKKVFKRTLLIFLIGWLLNAFPFVNYNPESGYSMINWSEVRLLGV 131

Query: 145 LQRIALSYLLVSLV 158
           LQRIAL Y+L +L+
Sbjct: 132 LQRIALCYMLAALI 145



 Score = 55.1 bits (131), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 32/117 (27%), Positives = 61/117 (52%), Gaps = 2/117 (1%)

Query: 286 PFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           PF+PEG+LS+  S+++ I G   G  I  +  +   +K+ V  G  LL+    L +  A 
Sbjct: 206 PFDPEGILSTFPSVVNVIAGFLVGKFIQDSGNNTGTVKKMVIWGIILLV--ACLIWDMAF 263

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
           P+NK+++T  YV +T      +   +  ++++W  +    P    G N +++YV++ 
Sbjct: 264 PINKKIWTSPYVLLTISLDLFLIGFLMLVIEVWQKRNWTYPFEVFGRNPLILYVLSG 320


>gi|323343595|ref|ZP_08083822.1| transmembrane protein [Prevotella oralis ATCC 33269]
 gi|323095414|gb|EFZ37988.1| transmembrane protein [Prevotella oralis ATCC 33269]
          Length = 384

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 88/361 (24%), Positives = 147/361 (40%), Gaps = 98/361 (27%)

Query: 20  DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGD--WPEISHAPWNGCNLADFVM 77
           D + QQ+K      R+ ++DI RG+ +A MILV++ G D  +  + HA W G    D V 
Sbjct: 2   DTATQQKK------RILAVDILRGMTIAGMILVNNPGTDTVYAPLEHAEWIGLTPTDLVF 55

Query: 78  PFFLFIVGVAIALALKRIPDR--ADAVKKVIFRTLKLLFWGILLQGGFSHA-----PDEL 130
           PFF+FI+G+   L+LK+   +   +  +K+  R L L   G+ +   F        PD  
Sbjct: 56  PFFMFIMGITTYLSLKKFEFKWSVECGRKIAKRALLLWLIGLAISWLFMFCRGLLDPDMS 115

Query: 131 TYGVDVRM---------IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLY 181
           +     R+         +RL GVL R+ + Y L ++V +  K                  
Sbjct: 116 SMPFGSRLWASVNTFDQLRLLGVLPRLGICYGLAAVVALSVKHKYIP------------- 162

Query: 182 CWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVG 241
              WL+A   +  Y+ L                           TC   A  +   N + 
Sbjct: 163 ---WLIAIIFIGYYILL--------------------------ETCNGYA--HDASNILA 191

Query: 242 YIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILS 301
            +D  VLG  H+Y                     R ++P       +PEGLLS+  ++  
Sbjct: 192 IVDDAVLGHGHVY---------------------RWESP-------DPEGLLSTFPALAH 223

Query: 302 TIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTS 361
            +IG   G  ++  +    ++++   +G  L   G  L +  A P++K+L+T ++  VT 
Sbjct: 224 VLIGFCVGRTVMEMQNLNDKIERLFLIGALLTFAGFLLSY--ACPISKKLWTPTFAMVTC 281

Query: 362 G 362
           G
Sbjct: 282 G 282


>gi|255307823|ref|ZP_05351994.1| hypothetical protein CdifA_14636 [Clostridium difficile ATCC 43255]
          Length = 483

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 48/146 (32%), Positives = 78/146 (53%), Gaps = 16/146 (10%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIV 84
           S L   R+ S+DI RGL++ALMI+ ++ G     +P++ HA W+G  LADF  PFF+  +
Sbjct: 111 SKLMNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISL 170

Query: 85  GVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR 140
           GV I +++        +   +I     R++ L+ +G  L   +   P       D+  +R
Sbjct: 171 GVTIPISINSKIKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLDTVR 221

Query: 141 LCGVLQRIALSYLLVSLVEIFTKDVQ 166
           + GVLQR+ L Y + SLV +  K + 
Sbjct: 222 ILGVLQRMGLVYFVTSLVYLLLKKLN 247



 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 33/114 (28%), Positives = 59/114 (51%), Gaps = 3/114 (2%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTK-GHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           FEP+G L+S+ +I S ++G   G V++    G   +  + + M   LLI      F    
Sbjct: 301 FEPDGFLTSIVAISSGMLGCTMGCVLLKENLGEYKKFFKILVMSIILLIGAFI--FNQYF 358

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
           P NK+L++ S+V + +G+  ++ S  Y + DI N    F P+  +G + +  Y+
Sbjct: 359 PFNKRLWSSSFVLLMAGSYGILLSIFYFICDIKNKSKIFTPIIALGSSPIFTYM 412


>gi|255101955|ref|ZP_05330932.1| hypothetical protein CdifQCD-6_14161 [Clostridium difficile
           QCD-63q42]
          Length = 469

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 48/146 (32%), Positives = 78/146 (53%), Gaps = 16/146 (10%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIV 84
           S L   R+ S+DI RGL++ALMI+ ++ G     +P++ HA W+G  LADF  PFF+  +
Sbjct: 97  SKLMNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISL 156

Query: 85  GVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR 140
           GV I +++        +   +I     R++ L+ +G  L   +   P       D+  +R
Sbjct: 157 GVTIPISINSKIKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLDTVR 207

Query: 141 LCGVLQRIALSYLLVSLVEIFTKDVQ 166
           + GVLQR+ L Y + SLV +  K + 
Sbjct: 208 ILGVLQRMGLVYFVTSLVYLLLKKLN 233



 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 33/114 (28%), Positives = 59/114 (51%), Gaps = 3/114 (2%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTK-GHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           FEP+G L+S+ +I S ++G   G V++    G   +  + + M   LLI      F    
Sbjct: 287 FEPDGFLTSIVAISSGMLGCTMGCVLLKENLGEYKKFFKILVMSIILLIGAFI--FNQYF 344

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
           P NK+L++ S+V + +G+  ++ S  Y + DI N    F P+  +G + +  Y+
Sbjct: 345 PFNKRLWSSSFVLLMAGSYGILLSIFYFICDIKNKSKIFTPIIALGSSPIFTYM 398


>gi|423081105|ref|ZP_17069717.1| hypothetical protein HMPREF1122_00699 [Clostridium difficile
           002-P50-2011]
 gi|423085023|ref|ZP_17073481.1| hypothetical protein HMPREF1123_00624 [Clostridium difficile
           050-P50-2011]
 gi|357550878|gb|EHJ32683.1| hypothetical protein HMPREF1123_00624 [Clostridium difficile
           050-P50-2011]
 gi|357551414|gb|EHJ33204.1| hypothetical protein HMPREF1122_00699 [Clostridium difficile
           002-P50-2011]
          Length = 427

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 48/146 (32%), Positives = 78/146 (53%), Gaps = 16/146 (10%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIV 84
           S L   R+ S+DI RGL++ALMI+ ++ G     +P++ HA W+G  LADF  PFF+  +
Sbjct: 55  SKLMNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISL 114

Query: 85  GVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR 140
           GV I +++        +   +I     R++ L+ +G  L   +   P       D+  +R
Sbjct: 115 GVTIPISINSKIKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLDTVR 165

Query: 141 LCGVLQRIALSYLLVSLVEIFTKDVQ 166
           + GVLQR+ L Y + SLV +  K + 
Sbjct: 166 ILGVLQRMGLVYFVTSLVYLLLKKLN 191



 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 33/114 (28%), Positives = 59/114 (51%), Gaps = 3/114 (2%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTK-GHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           FEP+G L+S+ +I S ++G   G V++    G   +  + + M   LLI      F    
Sbjct: 245 FEPDGFLTSIVAISSGMLGCTMGCVLLKEDIGEYKKFFKILVMSIILLIGAFI--FNQYF 302

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
           P NK+L++ S+V + +G+  ++ S  Y + DI N    F P+  +G + +  Y+
Sbjct: 303 PFNKRLWSSSFVLLMAGSYGILLSIFYFICDIKNKSKIFTPIIALGSSPIFTYM 356


>gi|418515336|ref|ZP_13081517.1| hypothetical protein MOU_00795 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB1386]
 gi|418520970|ref|ZP_13087016.1| hypothetical protein WS7_08123 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB2388]
 gi|410702946|gb|EKQ61443.1| hypothetical protein WS7_08123 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB2388]
 gi|410708055|gb|EKQ66504.1| hypothetical protein MOU_00795 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB1386]
          Length = 388

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 81/287 (28%), Positives = 117/287 (40%), Gaps = 73/287 (25%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           K +R  SLD+FRGL + LMILV+ AG     + +++HA W G  LAD V P FLF VG A
Sbjct: 18  KRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSA 77

Query: 88  IALALKRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVL 145
           ++ AL         + +V  R   +L  G+L+     F   PD       V  +RL GVL
Sbjct: 78  MSFALATNTPHLQFLGRVSRRAALILLCGVLMYWFPFFHLQPDGGWAFTTVDQLRLTGVL 137

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRI L YL  +L+  +                         +   +L+ Y ALLY    P
Sbjct: 138 QRIGLCYLAAALLVRYLPPRGIAP-----------------VCLALLLGYWALLYAFGQP 180

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
                                    A+L+   NA   +D  + G +H+Y           
Sbjct: 181 G------------------------AELSKTGNAGTRLDLWLYGRDHLY----------- 205

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
                     RKD        F+PEGLL ++S+ ++ + G   G  +
Sbjct: 206 ----------RKD------GGFDPEGLLGTLSATVNVLAGYLCGRFL 236


>gi|284041428|ref|YP_003391358.1| heparan-alpha-glucosaminide N-acetyltransferase [Spirosoma linguale
           DSM 74]
 gi|283820721|gb|ADB42559.1| Heparan-alpha-glucosaminide N-acetyltransferase [Spirosoma linguale
           DSM 74]
          Length = 381

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 85/346 (24%), Positives = 141/346 (40%), Gaps = 87/346 (25%)

Query: 61  EISHAPWNGCNLADFVMPFFLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWG 116
           + SH  WNG    D + P F+F+ GV+   +    L +  D+A   +K+I R L L+  G
Sbjct: 60  QFSHPAWNGFRAYDLIFPLFMFMAGVSTPFSVGSRLDQGTDKAKIARKIISRGLILVVLG 119

Query: 117 ILLQGG-FSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRF 175
           I+   G F+   +++         R   VL RI L+ +   L+ ++ +            
Sbjct: 120 IIYNNGLFNRVFEDM---------RFPSVLGRIGLAGMFAQLIYLYFRPRAQ-------- 162

Query: 176 SIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNP 235
                Y W       +L+ Y AL+    VP                     CG    L  
Sbjct: 163 -----YIWF----VGLLLGYWALMMLVPVPG--------------------CGA-GVLTM 192

Query: 236 PCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSS 295
            CN   +IDR ++   H+Y                             H   +PEGL S+
Sbjct: 193 ECNLASFIDRMLVP-GHLYKT--------------------------IH---DPEGLFST 222

Query: 296 VSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGF--ALLIFGLTLHFTNAIPLNKQLYT 353
           + +I +T++G+ F    + T G     K  + +G   A ++ G    F    P+NK L+T
Sbjct: 223 LPAIDNTLLGI-FAGTFLRTHGRTGNQKTALLLGAGAAFVLLGWLWDFV--FPINKNLWT 279

Query: 354 LSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
            S+V VT G + L+ +  Y ++D+  +K        IGMN++L+Y+
Sbjct: 280 SSFVLVTGGLSLLLLAVFYWVIDVKGIKRWTFFFTVIGMNSILIYL 325


>gi|294949094|ref|XP_002786049.1| hypothetical protein Pmar_PMAR023775 [Perkinsus marinus ATCC 50983]
 gi|239900157|gb|EER17845.1| hypothetical protein Pmar_PMAR023775 [Perkinsus marinus ATCC 50983]
          Length = 277

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 75/306 (24%), Positives = 108/306 (35%), Gaps = 99/306 (32%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
           R+ ++D+ RG +   + +VD  G   P I HAPWNG +LAD VMP F+FI          
Sbjct: 31  RIVAVDVMRGRSS--VQIVDVCGKTVPSIGHAPWNGLHLADIVMPGFIFI---------- 78

Query: 94  RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYL 153
                                             D LT G+D+   R  G+LQRIA+ Y 
Sbjct: 79  ----------------------------------DTLTLGLDLYTFRAPGILQRIAVCYA 104

Query: 154 LVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIIN 213
              L+     D+   D   G            L+  C+++            +W   ++ 
Sbjct: 105 AAVLLRKLVSDLSPNDTVKGALKNNSRVLLMGLL--CIII------------NWAIMLLG 150

Query: 214 KDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEG 273
                           R  L P CN    IDR V G  HMY  P W              
Sbjct: 151 PQPEGCS---------RGSLTPQCNVASNIDRMVFGPEHMY-SPLW-------------- 186

Query: 274 PLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALL 333
                         +PEGLLS++ ++ +  +G+  G  I     H   L + V  G  L 
Sbjct: 187 --------------DPEGLLSTLPTLATVALGLACGKFIQSRPSH-TELLRLVGCGLLLA 231

Query: 334 IFGLTL 339
           + G+ L
Sbjct: 232 LSGMAL 237


>gi|192359631|ref|YP_001981658.1| hypothetical protein CJA_1162 [Cellvibrio japonicus Ueda107]
 gi|190685796|gb|ACE83474.1| putative membrane protein [Cellvibrio japonicus Ueda107]
          Length = 399

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 53/142 (37%), Positives = 78/142 (54%), Gaps = 15/142 (10%)

Query: 27  KSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLF 82
           + ++  QR  +LD+ RGL +ALMILV+   G W  +     HA W+G    DFV PFFLF
Sbjct: 31  EVYMVKQRFLALDVMRGLTLALMILVN-TPGSWSHVYGPLLHADWHGVTPTDFVFPFFLF 89

Query: 83  IVGVAIALALKRIPD--RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR 140
           IVG A+  +++ +     +  ++KV  R L L   GILL            +  DV   R
Sbjct: 90  IVGSAMYFSVRGLAQLSLSQQLRKVGRRVLLLFVMGILLAA--------YPFTADVHDWR 141

Query: 141 LCGVLQRIALSYLLVSLVEIFT 162
           + GVLQRIAL+Y + +L+ ++ 
Sbjct: 142 IMGVLQRIALAYGVAALIVLYA 163


>gi|371776142|ref|ZP_09482464.1| hypothetical protein AnHS1_01923 [Anaerophaga sp. HS1]
          Length = 369

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 56/157 (35%), Positives = 85/157 (54%), Gaps = 17/157 (10%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGV 86
           KT+R  +LD+ RG+ +ALMI V++ G  W  I     H+ W+GC   D V PFFLF+VGV
Sbjct: 3   KTERYLALDVLRGMTIALMITVNNPG-SWKYIYAPLRHSSWHGCTPTDLVFPFFLFVVGV 61

Query: 87  AIALALKRIPD--RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
           ++  +  +  +    ++ K++  RTL +   G+ L    +  P    +  D   +R+ GV
Sbjct: 62  SMFFSFSKYGNTLNKESFKRLGRRTLLIFAIGLFL----NSFPQ---WDRDYSTLRIMGV 114

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLY 181
           LQRIAL+Y   SL+ +    V  K   +  FSI  LY
Sbjct: 115 LQRIALAYGFGSLIVL---SVPRKYIPLLGFSILLLY 148



 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 36/117 (30%), Positives = 60/117 (51%), Gaps = 3/117 (2%)

Query: 286 PFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           PF+PEGLLS++ ++++ ++G +   V I T   +    Q    G  + I G         
Sbjct: 188 PFDPEGLLSTIPAVVTVLLG-YLTGVFIKTTEKVKIPGQLALYGLIVAIAGRLWGL--VF 244

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
           P+NK L+T SYV  T+G AA+ F+ +  ++DI   K         GMN + +Y ++ 
Sbjct: 245 PINKPLWTGSYVLYTAGLAAMAFALLIFIIDIKGYKKWTSFFVVFGMNPLFIYALSG 301


>gi|296452402|ref|ZP_06894103.1| brp/Blh family beta-carotene 15,15'-monooxygenase [Clostridium
           difficile NAP08]
 gi|296258732|gb|EFH05626.1| brp/Blh family beta-carotene 15,15'-monooxygenase [Clostridium
           difficile NAP08]
          Length = 481

 Score = 79.3 bits (194), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 48/150 (32%), Positives = 78/150 (52%), Gaps = 16/150 (10%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIV 84
           S     R+ S+DI RGL++ALMI+ ++ G     +P++ HA W+G  LADF  PFF+  +
Sbjct: 109 SKFVNSRVKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISL 168

Query: 85  GVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR 140
           GV I ++    LK        +  +  R++ L+ +G  L   +   P       D+  +R
Sbjct: 169 GVTIPISINSKLKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLNTVR 219

Query: 141 LCGVLQRIALSYLLVSLVEIFTKDVQDKDQ 170
           + GVLQR+ L Y + SLV +  K +  +  
Sbjct: 220 ILGVLQRMGLVYFVTSLVYLLLKKLNVRSS 249



 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 33/114 (28%), Positives = 59/114 (51%), Gaps = 3/114 (2%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTK-GHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           FEP+G L+S+ +I S ++G   G V++    G   +  + + M   LLI      F    
Sbjct: 299 FEPDGFLTSIVAISSGMLGCMMGCVLLKENIGEYKKFFKILVMSIILLIGAFI--FNQYF 356

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
           P NK+L++ S+V + +G+  ++ S  Y + DI N    F P+  +G + +  Y+
Sbjct: 357 PFNKRLWSSSFVLLMAGSYGILLSIFYFICDIKNKSKIFTPIIALGSSPIFTYM 410


>gi|339021122|ref|ZP_08645235.1| hypothetical protein ATPR_1543 [Acetobacter tropicalis NBRC 101654]
 gi|338751776|dbj|GAA08539.1| hypothetical protein ATPR_1543 [Acetobacter tropicalis NBRC 101654]
          Length = 377

 Score = 79.0 bits (193), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 109/428 (25%), Positives = 167/428 (39%), Gaps = 98/428 (22%)

Query: 42  RGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIALA----LKR 94
           RG  +  M++V++ G     W  + HA WNGC  AD V PFFLF++G  I  A    L+ 
Sbjct: 2   RGATIVFMVIVNNPGDWNRVWSPLDHAAWNGCTPADLVFPFFLFLMGCVIPFAFDRRLRE 61

Query: 95  IPDRADAVKKVIFRTLKLLFWGILLQG-GFSHAPDELTYGVDVRMIRLCGVLQRIALSYL 153
              R+  V  + +R L L+   +LL    F H          V  +R  GVL RIAL Y+
Sbjct: 62  GAQRSQLVSHIAWRGLALVGLKLLLSLYPFFH----------VTHLRFFGVLTRIALCYV 111

Query: 154 LVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIIN 213
               + + ++        +G                 +L+ Y A+LY   VP   +    
Sbjct: 112 AAVSLYLCSRKTGFLVSVIG----------------LILLAYWAILYALPVPGLGWP--- 152

Query: 214 KDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEG 273
                 GK F       A L+   N   ++DR+       + H      K          
Sbjct: 153 ------GKDF-------AFLDLNRNMAAWLDRQFSAWCQTWLHTGILYEK---------- 189

Query: 274 PLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ-WVTMGFAL 332
                  +W     +PEGLLS++ +I +T+ GV  G V        AR    ++  G A 
Sbjct: 190 -------TW-----DPEGLLSTLPAIATTLSGVLAGQVFRRKDISPARRPMLFMAAGGAS 237

Query: 333 LIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWI-- 390
           ++ GL   +    PLNK L+T S+  V+SGAA    +    + D+  L+     +  I  
Sbjct: 238 ILVGLL--WGQVFPLNKSLWTSSFTLVSSGAALCSLAVCDGIFDVLRLQNKNTIIQAIAT 295

Query: 391 -----GMNAMLVY----------------VMAAEGIFAGFINGWYYGDPHNTLVCFLFII 429
                GMNA+  +                V  +  +  G       G P + L  FLF  
Sbjct: 296 FCQMFGMNAVFAFLFSGFLAKTLLVLPSPVAGSTSLEGGIYQALRAGYPSSNLTSFLFSC 355

Query: 430 SYILHSFL 437
            +++  FL
Sbjct: 356 CFLILVFL 363


>gi|289667572|ref|ZP_06488647.1| hypothetical protein XcampmN_03447, partial [Xanthomonas campestris
           pv. musacearum NCPPB 4381]
          Length = 298

 Score = 79.0 bits (193), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 53/133 (39%), Positives = 73/133 (54%), Gaps = 5/133 (3%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           K +R  SLD+FRGL + LMILV+ AG     + +++HA W G  LAD V P FLF VG A
Sbjct: 22  KRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSA 81

Query: 88  IALALKRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVL 145
           ++ AL         + +V  R + ++  G+L+     F   PD       V  +RL GVL
Sbjct: 82  MSFALATNTPPLQFLGRVSKRAVLIVLCGVLMYWFPFFHLQPDGGWAFTTVDQLRLTGVL 141

Query: 146 QRIALSYLLVSLV 158
           QRI L YL  +L+
Sbjct: 142 QRIGLCYLAAALL 154


>gi|423089801|ref|ZP_17078150.1| hypothetical protein HMPREF9945_01335, partial [Clostridium
           difficile 70-100-2010]
 gi|357557565|gb|EHJ39099.1| hypothetical protein HMPREF9945_01335, partial [Clostridium
           difficile 70-100-2010]
          Length = 391

 Score = 79.0 bits (193), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 48/146 (32%), Positives = 77/146 (52%), Gaps = 16/146 (10%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIV 84
           S L   R+ S+DI RGL++ALMI+ ++ G     +P++ HA W+G  LADF  PFF+  +
Sbjct: 19  SKLMNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISL 78

Query: 85  GVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR 140
           GV I ++    LK        +  +  R++ L+ +G  L   +   P       D+  +R
Sbjct: 79  GVTIPISINSKLKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLDTVR 129

Query: 141 LCGVLQRIALSYLLVSLVEIFTKDVQ 166
           + GVLQR+ L Y + SL  +  K + 
Sbjct: 130 ILGVLQRMGLVYFVTSLAYLLLKKLN 155



 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 33/114 (28%), Positives = 59/114 (51%), Gaps = 3/114 (2%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTK-GHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           FEP+G L+S+ +I S ++G   G V++    G   +  + + M   LLI      F    
Sbjct: 209 FEPDGFLTSIVAISSGMLGCTMGCVLLKENLGEYKKFFKILVMSIILLIGAFI--FNQYF 266

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
           P NK+L++ S+V + +G+  ++ S  Y + DI N    F P+  +G + +  Y+
Sbjct: 267 PFNKRLWSSSFVLLMAGSYGILLSIFYFICDIKNKSKIFTPIIALGSSPIFTYM 320


>gi|119774084|ref|YP_926824.1| hypothetical protein Sama_0947 [Shewanella amazonensis SB2B]
 gi|119766584|gb|ABL99154.1| conserved hypothetical protein [Shewanella amazonensis SB2B]
          Length = 378

 Score = 78.6 bits (192), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 96/401 (23%), Positives = 162/401 (40%), Gaps = 98/401 (24%)

Query: 24  QQEKSHLKTQRLASLDIFRG------LAVALMILVDHAGGDWP-------EISHAPWNGC 70
           Q  ++     RL SLD  RG      L    + +   A   W        E+ H+ W+G 
Sbjct: 2   QATQTKAAKPRLMSLDALRGFDMFWILGGEKLFIALFALTGWSFWQLADAEMHHSEWHGF 61

Query: 71  NLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFR-TLKLLFWGILLQGGFSHAPDE 129
              D + P F+F+ GVA+ L+ KR+   A A +  I+R  +K LF  + L   ++H    
Sbjct: 62  TFYDLIFPLFIFLSGVALGLSPKRLDKLAPAERNPIYRHAVKRLFLLLALGVLYNHG--- 118

Query: 130 LTYGVDVRM----IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHW 185
             +G  +      +R   VL RIA ++   +L+                        WH 
Sbjct: 119 --WGTGIPAHSDEVRYASVLGRIAFAWFFAALL-----------------------VWHT 153

Query: 186 LMAACVLVVYLALLYG-TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYID 244
            +   +    LA+L+G   +  W                 V  G    L P  +   ++D
Sbjct: 154 SLRTQIATA-LAILFGYAAIQLW---------------LPVPGGQAGVLTPSGSINAWVD 197

Query: 245 RKVL-GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTI 303
              L GI +  H P                             ++PEG+LS++ +I++ +
Sbjct: 198 THFLPGITYQ-HRP-----------------------------YDPEGILSTLPAIVNAL 227

Query: 304 IGVHFGHVIIH--TKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTS 361
           +GV  G  I+    +G  A+       G   L+ G +L   + +P+NK L+T S+V VT+
Sbjct: 228 MGVFVGRFIVKPDARGDWAKAGILTGAGGLSLVLGWSLD--SVLPVNKDLWTSSFVLVTT 285

Query: 362 GAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
           G   L  +  Y LVD+   K    P   IG+N++++Y+ ++
Sbjct: 286 GWNLLFLALFYVLVDVLGAKRLAFPFVVIGVNSIIIYLASS 326


>gi|296877751|ref|ZP_06901777.1| brp/Blh family beta-carotene 15,15'-monooxygenase [Clostridium
           difficile NAP07]
 gi|296431202|gb|EFH17023.1| brp/Blh family beta-carotene 15,15'-monooxygenase [Clostridium
           difficile NAP07]
          Length = 370

 Score = 78.6 bits (192), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 47/146 (32%), Positives = 77/146 (52%), Gaps = 16/146 (10%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
             R+ S+DI RGL++ALMI+ ++ G     +P++ HA W+G  LADF  PFF+  +GV I
Sbjct: 2   NSRVKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISLGVTI 61

Query: 89  ALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
            ++    LK        +  +  R++ L+ +G  L   +   P       D+  +R+ GV
Sbjct: 62  PISINSKLKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLNTVRILGV 112

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQ 170
           LQR+ L Y + SLV +  K +  +  
Sbjct: 113 LQRMGLVYFVTSLVYLLLKKLNVRSS 138



 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 33/114 (28%), Positives = 59/114 (51%), Gaps = 3/114 (2%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTK-GHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           FEP+G L+S+ +I S ++G   G V++    G   +  + + M   LLI      F    
Sbjct: 188 FEPDGFLTSIVAISSGMLGCMMGCVLLKENIGEYKKFFKILVMSIILLIGAFI--FNQYF 245

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
           P NK+L++ S+V + +G+  ++ S  Y + DI N    F P+  +G + +  Y+
Sbjct: 246 PFNKRLWSSSFVLLMAGSYGILLSIFYFICDIKNKSKIFTPIIALGSSPIFTYM 299


>gi|289663929|ref|ZP_06485510.1| hypothetical protein XcampvN_12875 [Xanthomonas campestris pv.
           vasculorum NCPPB 702]
          Length = 392

 Score = 78.6 bits (192), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 53/133 (39%), Positives = 73/133 (54%), Gaps = 5/133 (3%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           K +R  SLD+FRGL + LMILV+ AG     + +++HA W G  LAD V P FLF VG A
Sbjct: 22  KRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSA 81

Query: 88  IALALKRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVL 145
           ++ AL         + +V  R + ++  G+L+     F   PD       V  +RL GVL
Sbjct: 82  MSFALATNTPPLQFLGRVSKRAVLIVLCGVLMYWFPFFHLQPDGGWAFTTVDQLRLTGVL 141

Query: 146 QRIALSYLLVSLV 158
           QRI L YL  +L+
Sbjct: 142 QRIGLCYLAAALL 154


>gi|319785830|ref|YP_004145305.1| transmembrane protein [Pseudoxanthomonas suwonensis 11-1]
 gi|317464342|gb|ADV26074.1| putative transmembrane protein [Pseudoxanthomonas suwonensis 11-1]
          Length = 357

 Score = 78.6 bits (192), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 58/141 (41%), Positives = 78/141 (55%), Gaps = 16/141 (11%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGV 86
           + +RLAS+D  RGL VA M+LV++ G DW  +     HA W+GC  AD V PFFL IVGV
Sbjct: 5   RFRRLASVDALRGLTVAAMLLVNNPG-DWGHVYAPLLHADWHGCTPADLVFPFFLAIVGV 63

Query: 87  AIALA-LKRI---PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
           +IAL  + RI    DRA  ++ V  R L++L  G+LL        D+  Y       R  
Sbjct: 64  SIALGVVPRIEAGADRAGLMRTVAVRPLRILAVGLLLHLLAWWWLDQPHY-------RPW 116

Query: 143 GVLQRIALSYLLVSLVEIFTK 163
           GVLQRI L +L      ++ +
Sbjct: 117 GVLQRIGLCFLGAGAAALYLR 137



 Score = 40.8 bits (94), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 49/175 (28%), Positives = 82/175 (46%), Gaps = 25/175 (14%)

Query: 288 EPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPL 347
           +PEGL S++ ++ +T++G+  G  + H +  L RL        A+L   L L  T  +P 
Sbjct: 192 DPEGLPSTLGALATTLLGLRAGDWLRHDQ--LRRLAG-----AAVLALLLGLACTPWMPW 244

Query: 348 NKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNA--------MLVYV 399
           NK L+T SYV  + G   L  +  + LVD+    +P L  A  G+NA        ++VY 
Sbjct: 245 NKNLWTPSYVLWSGGWVLLALAVAHVLVDL--RGWPALGRA-FGVNAIAAYAGSTLMVYA 301

Query: 400 MAA----EGIFAGFINGWYYGDPHNTLVCFLFIISYILHSF---LWELRKFLYVQ 447
           +A     E ++ G   GW        L    F ++++   +   LW  R+  Y++
Sbjct: 302 LAGLDWWEPVYRGAFAGWMTPRLGPYLPSLAFALAFVACWWALVLWMDRRGWYLK 356


>gi|424670170|ref|ZP_18107195.1| hypothetical protein A1OC_03788 [Stenotrophomonas maltophilia
           Ab55555]
 gi|401070628|gb|EJP79142.1| hypothetical protein A1OC_03788 [Stenotrophomonas maltophilia
           Ab55555]
          Length = 355

 Score = 78.2 bits (191), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 50/139 (35%), Positives = 78/139 (56%), Gaps = 16/139 (11%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
           +RL S+D  RG+ VA M+LV++ G DW  +     H+ W+GC   D V PFFLF+VGV++
Sbjct: 7   RRLGSIDALRGITVAAMLLVNNPG-DWSAVFAPLRHSEWHGCTPTDLVFPFFLFLVGVSM 65

Query: 89  ALALK-RIPD---RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
           A ++  R  D   R    + V+ R L++L  G LL          + + +D    R+ GV
Sbjct: 66  AFSVAPRAQDAAARPALARGVLERALRILMAGALLH-------LLIWWALDTHHFRIWGV 118

Query: 145 LQRIALSYLLVSLVEIFTK 163
           LQRIA+   LV ++ ++ +
Sbjct: 119 LQRIAVCAALVGVLAVYAR 137


>gi|260910302|ref|ZP_05916976.1| conserved hypothetical protein [Prevotella sp. oral taxon 472 str.
           F0295]
 gi|260635554|gb|EEX53570.1| conserved hypothetical protein [Prevotella sp. oral taxon 472 str.
           F0295]
          Length = 399

 Score = 78.2 bits (191), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 99/407 (24%), Positives = 170/407 (41%), Gaps = 105/407 (25%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFL 81
            EK+   T R+ S+DI RGL +A MI V++ G     +  + HA WNG    D V PFF+
Sbjct: 1   MEKNK-TTSRILSIDILRGLTIAGMITVNNPGSWSYMYAPLEHAEWNGLTPTDLVFPFFM 59

Query: 82  FIVGVAIALALKRIP---DRADAVKKVIFRTLKLLFWGILLQGGFS------HAPDE--- 129
            ++G+ I +A+ +     +RA   K  I + + L++   L  G F+      + P E   
Sbjct: 60  CVMGMCIYIAMSKFNFACNRATVYK--ILKRMVLIYLVGLAIGWFAKFCYRWNNPQEGAD 117

Query: 130 ----LTYGV-DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWH 184
               L Y V     IRL GVL R+A+ Y + +L+ I    V+ K                
Sbjct: 118 FFSQLWYMVWSFDKIRLTGVLARLAVCYGITALLAI---TVRHKHLP------------- 161

Query: 185 WLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYID 244
           +++   +L  ++ L+ G                  G  ++ T           N +  +D
Sbjct: 162 YIVGGLLLAYFVILMAGN-----------------GFAYDET-----------NILSIVD 193

Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
           R VL   HMYH                                +PEGLLS++ SI  T++
Sbjct: 194 RAVLTDAHMYHDNG----------------------------IDPEGLLSTLPSIAHTLL 225

Query: 305 GVHFGHVIIH-----TKGHLARLKQWVTMGFALLIFGLTLHFTNAI-----PLNKQLYTL 354
           G   G ++        +   AR    +T    L + G +L F   +     P+NK++++ 
Sbjct: 226 GFIIGGMLFRKADAGVQQLDARANITLTKVVPLFVVGTSLLFAGYLLSYGCPINKKVWSP 285

Query: 355 SYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA 401
           ++V VT G A+++ +    ++D+   K         G+N + ++V++
Sbjct: 286 TFVLVTCGLASMLLALFTWIIDVKGKKQWSKFFEVFGVNPLFLFVLS 332


>gi|326801867|ref|YP_004319686.1| hypothetical protein [Sphingobacterium sp. 21]
 gi|326552631|gb|ADZ81016.1| hypothetical protein Sph21_4499 [Sphingobacterium sp. 21]
          Length = 376

 Score = 78.2 bits (191), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 83/335 (24%), Positives = 136/335 (40%), Gaps = 79/335 (23%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGD----WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           R  +LD+FRG+ +  MI+V+  G      WP ++HA W+G    D V P FLF VG A++
Sbjct: 7   RFTALDVFRGMTICFMIIVNSPGSGATPYWP-LNHATWHGFTPTDLVFPSFLFAVGNALS 65

Query: 90  LALKRIPD-RADAVKKVIFRTLKLLF-WGILLQ--GGFSHAPDELTYGVDVRMIRLCGVL 145
            + ++     +  V   IF+   L+F  G L+     F            +   R+ GVL
Sbjct: 66  FSERKFQYLSSKQVLLTIFKRAALIFLLGFLMYWFPFFKITEQHEIISFPLHETRVFGVL 125

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRIAL YL  +L   + +               R Y   WL  A +L+ ++ LL      
Sbjct: 126 QRIALCYLFTALAVYYVR---------------RKYL-VWLAIALLLIYWVILL------ 163

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
                                 G  A  +   NA+  +D  +LG +H+YH          
Sbjct: 164 --------------------IFGTDAPYSLEGNAIFKLDLWLLGESHLYHSHG------- 196

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
                                F+PEGLLS++ +I + I G   G  +    G +  L + 
Sbjct: 197 -------------------IIFDPEGLLSTIPAITNAIAGYLVGKYLQEKGGTVQSLGKL 237

Query: 326 VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVT 360
           + +G   ++  L   +    P+NK++++ S+  +T
Sbjct: 238 LIIGAIGVLIALV--WNQVFPINKKIWSSSFALLT 270


>gi|242062184|ref|XP_002452381.1| hypothetical protein SORBIDRAFT_04g024713 [Sorghum bicolor]
 gi|241932212|gb|EES05357.1| hypothetical protein SORBIDRAFT_04g024713 [Sorghum bicolor]
          Length = 96

 Score = 78.2 bits (191), Expect = 8e-12,   Method: Composition-based stats.
 Identities = 42/101 (41%), Positives = 57/101 (56%), Gaps = 5/101 (4%)

Query: 141 LCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLY 200
           L G+LQRIA++YLL ++ EI+ K   D D   G   + R Y +   +   + + Y  LLY
Sbjct: 1   LMGILQRIAIAYLLAAICEIWLKGDDDVDSGYG---LLRRYRYQLFVGLVLSIAYTILLY 57

Query: 201 GTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVG 241
           G YVPDW++ I    S +  K F+V CGVR      CNAVG
Sbjct: 58  GIYVPDWEYKISGPGSTE--KSFSVKCGVRGDTGLACNAVG 96


>gi|408673239|ref|YP_006872987.1| Protein of unknown function DUF2261, transmembrane [Emticicia
           oligotrophica DSM 17448]
 gi|387854863|gb|AFK02960.1| Protein of unknown function DUF2261, transmembrane [Emticicia
           oligotrophica DSM 17448]
          Length = 370

 Score = 78.2 bits (191), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 95/410 (23%), Positives = 159/410 (38%), Gaps = 108/410 (26%)

Query: 27  KSHLKTQRLASLDIFRGLAV-----------ALMILVDHAGGDW--PEISHAPWNGCNLA 73
            S   TQRL SLD  RG  +           AL     H    W   ++SH  WNG    
Sbjct: 2   SSSSSTQRLYSLDALRGFDMFWIMGGEDFFHALSEATHHPAAIWIATQLSHVAWNGFRFY 61

Query: 74  DFVMPFFLFIVGV----AIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDE 129
           D + P FLFI GV    ++   +++  D+   ++++I R L L+  G++   G       
Sbjct: 62  DLIFPLFLFISGVSTPYSVGREIEKGIDKQAILRRIIKRGLILVLLGVIYNNGLQIK--- 118

Query: 130 LTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAA 189
                ++  IR   VL RI L+Y+   ++ ++                   Y W      
Sbjct: 119 -----ELSQIRFPSVLGRIGLAYMFACIIYVYASQRWQ-------------YVW------ 154

Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCG-VRAKLNPPCNAVGYIDRKVL 248
                +  LL G Y       +I K        FN   G V   L    N   ++DR ++
Sbjct: 155 -----FSGLLIGYY-------LIMK--------FNAAPGFVAGDLTIQGNFASWVDRMLV 194

Query: 249 -GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVH 307
            G  H+  H                               +PEGL S++ +I + ++G+ 
Sbjct: 195 PGRLHLKIH-------------------------------DPEGLFSTIPAIGTGLLGIF 223

Query: 308 FGHVI--IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAA 365
            G+ +  + TK    +      +G   LI G +  +    P+NK L+T S++    G + 
Sbjct: 224 AGNYLKNVTTKTPTQKAATLAILGLVCLIIGGS--WGIVFPINKNLWTSSFMLYAGGCSL 281

Query: 366 LVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYY 415
           L+ S  Y ++D+   +        IG+N++L+Y+       +G +  WYY
Sbjct: 282 LLLSLFYFIIDVLQYQRWAFFFTIIGLNSILIYM-------SGLVINWYY 324


>gi|456734835|gb|EMF59605.1| N-acetylglucosamine transporter NagX [Stenotrophomonas maltophilia
           EPM1]
          Length = 355

 Score = 78.2 bits (191), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 50/139 (35%), Positives = 78/139 (56%), Gaps = 16/139 (11%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
           +RL S+D  RG+ VA M+LV++ G DW  +     H+ W+GC   D V PFFLF+VGV++
Sbjct: 7   RRLGSIDALRGITVAAMLLVNNPG-DWSAVFAPLRHSEWHGCTPTDLVFPFFLFLVGVSM 65

Query: 89  ALALK-RIPD---RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
           A ++  R  D   R    + V+ R L++L  G LL          + + +D    R+ GV
Sbjct: 66  AFSVAPRAQDAAARPALARGVLERALRILVAGALLH-------LLIWWALDTHHFRIWGV 118

Query: 145 LQRIALSYLLVSLVEIFTK 163
           LQRIA+   LV ++ ++ +
Sbjct: 119 LQRIAVCAALVGVLAVYAR 137


>gi|190575857|ref|YP_001973702.1| transmembrane protein [Stenotrophomonas maltophilia K279a]
 gi|190013779|emb|CAQ47415.1| putative transmembrane protein [Stenotrophomonas maltophilia K279a]
          Length = 355

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 50/139 (35%), Positives = 78/139 (56%), Gaps = 16/139 (11%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
           +RL S+D  RG+ VA M+LV++ G DW  +     H+ W+GC   D V PFFLF+VGV++
Sbjct: 7   RRLGSIDALRGITVAAMLLVNNPG-DWSAVFAPLRHSEWHGCTPTDLVFPFFLFLVGVSM 65

Query: 89  ALALK-RIPD---RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
           A ++  R  D   R    + V+ R L++L  G LL          + + +D    R+ GV
Sbjct: 66  AFSVAPRAQDAAARPALARGVLERALRILVAGALLH-------LLIWWALDTHHFRIWGV 118

Query: 145 LQRIALSYLLVSLVEIFTK 163
           LQRIA+   LV ++ ++ +
Sbjct: 119 LQRIAVCAALVGVLAVYAR 137


>gi|126700401|ref|YP_001089298.1| membrane protein [Clostridium difficile 630]
 gi|115251838|emb|CAJ69673.1| putative membrane protein [Clostridium difficile 630]
          Length = 370

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 47/141 (33%), Positives = 76/141 (53%), Gaps = 16/141 (11%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
            R+ S+DI RGL++ALMI+ ++ G     +P++ HA W+G  LADF  PFF+  +GV I 
Sbjct: 3   SRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISLGVTIP 62

Query: 90  LA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
           ++    LK        +  +  R++ L+ +G  L   +   P       D+  +R+ GVL
Sbjct: 63  ISINSKLKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLDTVRILGVL 113

Query: 146 QRIALSYLLVSLVEIFTKDVQ 166
           QR+ L Y + SLV +  K + 
Sbjct: 114 QRMGLVYFVTSLVYLLLKKLN 134



 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 33/114 (28%), Positives = 59/114 (51%), Gaps = 3/114 (2%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTK-GHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           FEP+G L+S+ +I S ++G   G V++    G   +  + + M   LLI      F    
Sbjct: 188 FEPDGFLTSIVAISSGMLGCTMGCVLLKEDIGEYKKFFKILVMSIILLIGAFI--FNQYF 245

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
           P NK+L++ S+V + +G+  ++ S  Y + DI N    F P+  +G + +  Y+
Sbjct: 246 PFNKRLWSSSFVLLMAGSYGILLSIFYFICDIKNKSKIFTPIIALGSSPIFTYM 299


>gi|146300862|ref|YP_001195453.1| hypothetical protein Fjoh_3117 [Flavobacterium johnsoniae UW101]
 gi|146155280|gb|ABQ06134.1| Uncharacterized protein [Flavobacterium johnsoniae UW101]
          Length = 380

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 97/402 (24%), Positives = 157/402 (39%), Gaps = 112/402 (27%)

Query: 34  RLASLDIFRGLAVALMILVDH------AGGDWP-------EISHAPWNGCNLADFVMPFF 80
           RL SLD+ RG  +  ++  +H           P       ++ HA WNG    D + P F
Sbjct: 8   RLISLDVLRGFVMFWIMSGEHIIHALAKAAPIPIFIWMSSQLHHAEWNGITFYDMIFPVF 67

Query: 81  LFIVGVAIALA---------LKRIPDRADAVKKVIF-----RTLKLLFWGILLQGGFSHA 126
           LF+ GV++  +         +K   D   A K+ I+     RT  LL  G ++ G     
Sbjct: 68  LFVAGVSMPFSFEKKMKLAGVKEPKDLPKAEKRKIYLSMLRRTCILLVLGFVVNGLLRFD 127

Query: 127 PDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWL 186
             + T        R   VL RI L++    ++ +   +   K Q +  F I         
Sbjct: 128 GFDQT--------RFASVLGRIGLAWFFAGIIYL---NFDFKKQLICFFGI--------- 167

Query: 187 MAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRK 246
                L+ Y A +    VP++   ++ K+ +                       GYIDR 
Sbjct: 168 -----LIGYYAAMKWIPVPNFGAGVLTKEGS---------------------LEGYIDRL 201

Query: 247 VLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP-FEPEGLLSSVSSILSTIIG 305
            L                               P   H+  ++PEG+ S++ +I + ++G
Sbjct: 202 FL-------------------------------PGRLHSTVYDPEGIFSTIPAIATALLG 230

Query: 306 VHFGHVIIHTKGHLARLKQWVTMGFA---LLIFGLTLHFTNAIPLNKQLYTLSYVCVTSG 362
           V  G   +  K   +   + + M  A   L+I GL        P+NK L+T S+VC   G
Sbjct: 231 VFIG-TFLKAKCPFSINIKLLLMALAAVVLIIAGLIWDIN--FPINKHLWTSSFVCFVGG 287

Query: 363 AAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEG 404
            + L F   YA++D+   +    PL  IG N++L+Y+ AAEG
Sbjct: 288 FSILFFVFFYAIIDLLGFQKWAFPLVLIGSNSILIYI-AAEG 328


>gi|84625357|ref|YP_452729.1| hypothetical protein XOO_3700 [Xanthomonas oryzae pv. oryzae MAFF
           311018]
 gi|188575197|ref|YP_001912126.1| hypothetical protein PXO_04319 [Xanthomonas oryzae pv. oryzae
           PXO99A]
 gi|84369297|dbj|BAE70455.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF
           311018]
 gi|188519649|gb|ACD57594.1| membrane protein, putative [Xanthomonas oryzae pv. oryzae PXO99A]
          Length = 388

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 53/133 (39%), Positives = 71/133 (53%), Gaps = 5/133 (3%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           K +R  SLD+FRGL + LMILV+ AG     + +++HA W G  LAD V P FLF VG A
Sbjct: 18  KRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSA 77

Query: 88  IALALKRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVL 145
           ++ AL         + +V  R   +   G+L+     F   PD       V  +RL GVL
Sbjct: 78  MSFALATNMPHLQFLGRVSKRAALIALCGVLMYWFPFFHLQPDGGWAFTTVDQVRLTGVL 137

Query: 146 QRIALSYLLVSLV 158
           QRI L YL  +L+
Sbjct: 138 QRIGLCYLAAALL 150


>gi|311031971|ref|ZP_07710061.1| hypothetical protein Bm3-1_15792 [Bacillus sp. m3-13]
          Length = 370

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 90/387 (23%), Positives = 154/387 (39%), Gaps = 96/387 (24%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDH--AGGDWPEISHAPWNGCNLADFVMPFFL 81
            +  +H+  +R  S+D+ RG+ V + + V     G ++  + HA W G  + D V P FL
Sbjct: 1   METNNHITKKRYRSIDVTRGIVVLVSVFVSALPGGAEYDFLRHAYWYGLTITDLVFPAFL 60

Query: 82  FIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRL 141
            + G+ +A+  ++     D ++    RT  L+ +G+L            ++  D+  +R 
Sbjct: 61  TVYGIGLAIVYRKGVRWKDLLR----RTFLLVLYGLLFN-------LIASWSFDLSTLRF 109

Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV--VYLALL 199
            GVLQ  A++ L V ++    K                   W  ++A  +++   YL++L
Sbjct: 110 TGVLQLFAITGLGVVVLSYLAKG------------------WKSMLALGMVIATAYLSIL 151

Query: 200 YGTYVPDWQFTIINKDSADYGKVFNVTC--GVRAKLNPPCNAVGYIDRKVLGINHMYHHP 257
                                 + +V C  GV  +    CN  G +D  V G  HMY   
Sbjct: 152 V---------------------ISSVGCEGGVPQR---DCNPSGVVDVLVFGEKHMYAQG 187

Query: 258 AWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKG 317
                                        F+PEG+LS  S++ +   G   G V+   K 
Sbjct: 188 --------------------------EKGFDPEGILSIFSALSNVAFGFAVGLVLNGRKQ 221

Query: 318 HLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDI 377
            L R+      G ++ +  L   F N I LNK+L+T S+  + SG   L+ + ++ L+D 
Sbjct: 222 ILQRV-----FGISIGLISLAFIFNNFIELNKRLWTPSFAILASGLTLLLLAILFYLIDT 276

Query: 378 WNLKY------PFLPLAWIGMNAMLVY 398
              K       P   L   G N+ L+Y
Sbjct: 277 RERKQGKLTGIPLWYLEAFGRNSFLIY 303


>gi|442611023|ref|ZP_21025729.1| N-acetylglucosamine related transporter, NagX [Pseudoalteromonas
           luteoviolacea B = ATCC 29581]
 gi|441746951|emb|CCQ11791.1| N-acetylglucosamine related transporter, NagX [Pseudoalteromonas
           luteoviolacea B = ATCC 29581]
          Length = 373

 Score = 77.4 bits (189), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 101/391 (25%), Positives = 158/391 (40%), Gaps = 96/391 (24%)

Query: 33  QRLASLDIFRGLAV-----------ALMILVDHAGGDWPE--ISHAPWNGCNLADFVMPF 79
           +RLASLD  RG  +           AL +L    G    E    H+ W+G    D + P 
Sbjct: 6   KRLASLDALRGFDMMWILGGQGIFAALFVLTGWTGWRTFEAHTVHSDWHGFTFYDLIFPL 65

Query: 80  FLFIVGVAIALALKRI-----PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGV 134
           F+F+ GVA+ L+ KRI      +R    +K + R + L   GIL   G+           
Sbjct: 66  FIFLSGVAMGLSPKRIDHLPMSERTPIYRKALKRFVLLCLLGILYNHGWGTGIPA----- 120

Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
           D   IR   VL RIA ++L+ +L+         K  +    SI   Y W WL        
Sbjct: 121 DFSEIRYSSVLGRIAFAWLICALL---VWHFSLKQVAYIGLSILITY-WIWL-------- 168

Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL-GINHM 253
                   Y+P     +   D+ D              +    NA  +ID+  L G+ + 
Sbjct: 169 -------CYIP-----VPGGDAGDL------------SIGGSWNA--WIDQNFLPGVRY- 201

Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
                        QD P                 +PEG+LSS+ ++++ I G+  G +I 
Sbjct: 202 -------------QDRP----------------VDPEGILSSLPAVVNAIAGLFAGQLIK 232

Query: 314 HT--KGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAI 371
               KG    +    + G  +L   L   +    P+NK+L+T S+  VT G +A++ +  
Sbjct: 233 RAPEKGEWKCVALLFSGG--VLFIALGWLWDLVFPVNKELWTSSFTLVTIGWSAILLAVF 290

Query: 372 YALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
           Y LVDI   +    P   IG N++++YV ++
Sbjct: 291 YVLVDILPGQKAAYPFVIIGANSIIIYVASS 321


>gi|218260820|ref|ZP_03475939.1| hypothetical protein PRABACTJOHN_01603 [Parabacteroides johnsonii
           DSM 18315]
 gi|423344001|ref|ZP_17321714.1| hypothetical protein HMPREF1077_03144 [Parabacteroides johnsonii
           CL02T12C29]
 gi|218224343|gb|EEC96993.1| hypothetical protein PRABACTJOHN_01603 [Parabacteroides johnsonii
           DSM 18315]
 gi|409213863|gb|EKN06876.1| hypothetical protein HMPREF1077_03144 [Parabacteroides johnsonii
           CL02T12C29]
          Length = 371

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 99/398 (24%), Positives = 163/398 (40%), Gaps = 108/398 (27%)

Query: 27  KSHLKTQRLASLDIFRG-----------LAVALMILVDH----AGGDWPEISHAPWNGCN 71
           +   +++RL SLD  RG           L VAL  L  +      GD  ++ H  W+G  
Sbjct: 2   EKQKQSRRLLSLDALRGFDMFFIMGGASLFVALATLFPNPFFQVIGD--QMHHVKWDGLT 59

Query: 72  LADFVMPFFLFIVGVAIALALKRIPDR----ADAVKKVIFRTLKLLFWGILLQGGFSHAP 127
             D + P FLFI G++   +L++  ++    AD  +K+I R L L+  G +  G  +   
Sbjct: 60  HHDTIFPLFLFIAGISFPFSLEKQREQGKTDADIYRKIIRRGLTLVVLGFVYNGLLNF-- 117

Query: 128 DELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLM 187
                  D    R   VL RI L+++  +L+ + T+ +                   W+ 
Sbjct: 118 -------DFEHQRYASVLGRIGLAWMFGALIFVNTRTITRV----------------WIT 154

Query: 188 AACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKV 247
            A +LV Y  LL     PD           +   VF +            + VGY+DR +
Sbjct: 155 VA-ILVGYWLLLAFVPAPD----------GNGAGVFTM----------EGSLVGYVDRLL 193

Query: 248 L-GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGV 306
           L G  H+  H                               +PEG+LS+V ++ + ++G+
Sbjct: 194 LPGRLHLTVH-------------------------------DPEGILSTVPAVATALLGM 222

Query: 307 HFGHVIIHTKGHLARLKQ---WVTMGFALLIFGL--TLHFTNAIPLNKQLYTLSYVCVTS 361
             G  I   +  L   K+    V  G  LL  GL  +L F    P+NK L+T S+VCV  
Sbjct: 223 FTGEFIKMQREGLTDKKKVGGLVIAGAVLLAVGLLWSLFF----PINKNLWTSSFVCVVG 278

Query: 362 GAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
             +  +F+  + +VD+   +   L    IG+N++ +Y+
Sbjct: 279 AYSVWMFALFFYVVDVLEFRKWTLFFKVIGVNSITIYL 316


>gi|21241481|ref|NP_641063.1| hypothetical protein XAC0710 [Xanthomonas axonopodis pv. citri str.
           306]
 gi|21106824|gb|AAM35599.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
           str. 306]
          Length = 388

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 80/287 (27%), Positives = 116/287 (40%), Gaps = 73/287 (25%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           K +R  SLD+FRGL + LMILV+ AG     + +++HA W G  LAD V P FLF VG A
Sbjct: 18  KRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSA 77

Query: 88  IALALKRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVL 145
           ++ AL         + +V  R   +L  G+L+     F   PD       V  +RL  VL
Sbjct: 78  MSFALATNTPHLQFLGRVSRRAALILLCGVLMYWFPFFHLQPDGGWAFTTVDQLRLTCVL 137

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRI L YL  +L+  +                         +   +L+ Y ALLY    P
Sbjct: 138 QRIGLCYLAAALLVRYLPPRGIAP-----------------VCLALLLGYWALLYAFGQP 180

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
                                    A+L+   NA   +D  + G +H+Y           
Sbjct: 181 G------------------------AELSKTGNAGTRLDLWLYGRDHLY----------- 205

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
                     RKD        F+PEGLL ++S+ ++ + G   G  +
Sbjct: 206 ----------RKD------GGFDPEGLLGTLSATVNVLAGYLCGRFL 236


>gi|157960931|ref|YP_001500965.1| hypothetical protein Spea_1103 [Shewanella pealeana ATCC 700345]
 gi|157845931|gb|ABV86430.1| conserved hypothetical protein [Shewanella pealeana ATCC 700345]
          Length = 394

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 99/429 (23%), Positives = 171/429 (39%), Gaps = 105/429 (24%)

Query: 15  IISEPDVSDQQEKSHLKTQ-----RLASLDIFRGLAV-----------ALMILVDHAGGD 58
           II +P VS     +   ++     RL SLD  RG  +           AL++L   AG +
Sbjct: 4   IIDKPRVSIASVAAESVSKPAAKPRLKSLDALRGFDMFWILGGEAIFAALLLLTGWAGFN 63

Query: 59  W--PEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRI-----PDRADAVKKVIFRTLK 111
           W   ++ H+ W+G    D + P F+F+ GVA+ L+ KR+     P R    +  I R L 
Sbjct: 64  WFDSQMHHSTWHGFTFYDLIFPLFIFLSGVALGLSPKRLDKLPLPQRMPLYQHAIKRLLL 123

Query: 112 LLFWGILLQGGF-SHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQ 170
           LL +G++   G+ + AP  L        IR   VL RIA ++   +              
Sbjct: 124 LLLFGVIYNHGWGTGAPFAL------GDIRYASVLGRIAFAWFFCA-------------- 163

Query: 171 SVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVR 230
                    L  WH  +   ++   L L+    +  + F  +  +S  + +  ++   V 
Sbjct: 164 ---------LLVWHTSLRTQIVTAILVLVGYAVLQTFPFMTLG-NSGAFSQTGSINAAVD 213

Query: 231 AKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPE 290
           + L P             GI +                               +A  +PE
Sbjct: 214 SLLLP-------------GITYQ------------------------------NAAVDPE 230

Query: 291 GLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQ 350
           G+LS+V ++++ + GV  GH I+  +      K  +     L +  L    +   P+NK 
Sbjct: 231 GILSTVPAVVNGLFGVFVGHFIVKPQVKGEWFKVAILALSGLGLLVLGWAVSPWNPVNKT 290

Query: 351 LYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVM--------AA 402
           L+T S+  VTSG + L  +  Y ++D+  ++        IG N++++Y+         AA
Sbjct: 291 LWTSSFTLVTSGWSILFLALFYTIIDVLKVQKWAFFFTVIGCNSIVIYIATSIVNWKYAA 350

Query: 403 EGIFAGFIN 411
             +F G I+
Sbjct: 351 TSLFGGVIS 359


>gi|390946357|ref|YP_006410117.1| hypothetical protein Alfi_1078 [Alistipes finegoldii DSM 17242]
 gi|390422926|gb|AFL77432.1| hypothetical protein Alfi_1078 [Alistipes finegoldii DSM 17242]
          Length = 369

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 89/386 (23%), Positives = 156/386 (40%), Gaps = 93/386 (24%)

Query: 32  TQRLASLDIFRGLAVALMI----LVDHAGGDWP---------EISHAPWNGCNLADFVMP 78
            +RL SLD  RG+ +  ++    LV      WP         ++ HA WNG  + D + P
Sbjct: 4   NRRLLSLDTLRGVDMFFIMGFSGLVTSLCALWPGSFTDMLASQMQHAAWNGLTIQDTIFP 63

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKVIFRTLK----LLFWGILLQGGFSHAPDELTYGV 134
            FLFI GVA   +L +   R    K+++ R  +    L   G++  G F          +
Sbjct: 64  LFLFIAGVAFPFSLAKQRARGFGRKRILDRIFRRGLILALLGMVYNGLFE---------L 114

Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
           +   +R+  VL RI L+++  +L+ ++   V+ +                  +A  +L+ 
Sbjct: 115 NFSSLRIASVLGRIGLAWMFAALLCVYC-SVRTRIA----------------VAGIILIG 157

Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
           Y  LL     PD          AD              L+      G+IDR+ L  + +Y
Sbjct: 158 YSLLLGLVVAPD------APVGAD-------------PLSVEGCLAGWIDRQYLPGHILY 198

Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
                                           F+PEG+LS++ +++S + G+  G  ++ 
Sbjct: 199 ------------------------------GAFDPEGILSTLPAVVSALFGMFTGEFLLD 228

Query: 315 -TKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
             +G     K +     AL I    L +   +P+NK L++ S+ CV SG +  + +  Y 
Sbjct: 229 GRRGLSGSWKAFYMAVAALAITTAGLCWNLIMPVNKNLWSSSFTCVVSGYSLGMTALFYY 288

Query: 374 LVDIWNLKYPFLPLAWIGMNAMLVYV 399
           L+D+   K        IG+N++ +Y+
Sbjct: 289 LIDVCGYKRWTFVFRVIGLNSITIYM 314


>gi|194367192|ref|YP_002029802.1| hypothetical protein Smal_3420 [Stenotrophomonas maltophilia
           R551-3]
 gi|194349996|gb|ACF53119.1| conserved hypothetical protein [Stenotrophomonas maltophilia
           R551-3]
          Length = 355

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 50/139 (35%), Positives = 78/139 (56%), Gaps = 16/139 (11%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
           +RL S+D  RG+ VA M+LV++ G DW  +     H+ W+GC   D V PFFLF+VGV++
Sbjct: 7   RRLGSIDALRGITVAAMLLVNNPG-DWSAVFAPLRHSEWHGCTPTDLVFPFFLFLVGVSM 65

Query: 89  ALALK-RIPD---RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
           A ++  R  D   R    + V+ R L++L  G LL          + + +D    R+ GV
Sbjct: 66  AFSVAPRALDVALRPALARGVLERALRILVAGALLH-------LLIWWALDTHHFRIWGV 118

Query: 145 LQRIALSYLLVSLVEIFTK 163
           LQRIA+   LV ++ ++ +
Sbjct: 119 LQRIAVCAALVGVLAVYAR 137


>gi|255037955|ref|YP_003088576.1| hypothetical protein Dfer_4208 [Dyadobacter fermentans DSM 18053]
 gi|254950711|gb|ACT95411.1| conserved hypothetical protein [Dyadobacter fermentans DSM 18053]
          Length = 371

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 93/413 (22%), Positives = 162/413 (39%), Gaps = 106/413 (25%)

Query: 27  KSHLKTQRLASLDIFRGLAV--------ALMILVDHAGGDWP-----EISHAPWNGCNLA 73
           K    +QRL SLD  RG  +           +L    G  W      + +H  WNG    
Sbjct: 2   KDSAPSQRLLSLDTLRGFDMFWISGGEEIFHVLAKVTGWSWAIVLAHQFTHPDWNGFRAY 61

Query: 74  DFVMPFFLFIVGVAIALAL-----KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPD 128
           D + P FLF+ GV+   +L     K +P  +  V+KVI R + L+F GI+   G      
Sbjct: 62  DLIFPTFLFMAGVSTPFSLGSRLEKGVPP-SQLVRKVIQRGIILVFLGIIYNNG------ 114

Query: 129 ELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMA 188
              +  +   +R   VL RI L+ +   ++ ++                   Y   W+  
Sbjct: 115 --IFETEWSQMRYPSVLARIGLAGMFAQIIYLYFG-----------------YRARWIWF 155

Query: 189 ACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
             +L+ Y   +    VP                     CG    L   CN   Y D  ++
Sbjct: 156 GGLLIGYYLFMMFYPVPG--------------------CGA-GLLTIDCNPASYFDSLII 194

Query: 249 -GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVH 307
            G  H+  H                               +PEGL+S++ +I + ++G+ 
Sbjct: 195 PGRLHLTIH-------------------------------DPEGLVSTIPAIATGLMGIF 223

Query: 308 FGHVIIHTKGHLARLKQWVTMGFALLIFGLT-LHFTNAIPLNKQLYTLSYVCVTSGAAAL 366
            G ++  +   L++  + + + FA ++  L  L +    P+NK L+T S+V    G + L
Sbjct: 224 AGELLRTSHEVLSQKSKVIYLVFAGVVSLLVCLVWDYFFPINKNLWTSSFVLCAGGFSTL 283

Query: 367 VFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA--------AEGIFAGFIN 411
           + +  Y +VD+ N +   L    IGMN++++Y++         A  +F G ++
Sbjct: 284 LLALFYWIVDVLNYRKWTLFFVVIGMNSIVIYMVGRFIDFGYTARALFGGILS 336


>gi|334366956|ref|ZP_08515871.1| putative membrane protein [Alistipes sp. HGB5]
 gi|313156833|gb|EFR56273.1| putative membrane protein [Alistipes sp. HGB5]
          Length = 370

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 89/386 (23%), Positives = 156/386 (40%), Gaps = 93/386 (24%)

Query: 32  TQRLASLDIFRGLAVALMI----LVDHAGGDWP---------EISHAPWNGCNLADFVMP 78
            +RL SLD  RG+ +  ++    LV      WP         ++ HA WNG  + D + P
Sbjct: 5   NRRLLSLDTLRGVDMFFIMGFSGLVTSLCALWPGSFTDMLASQMQHAAWNGLTIQDTIFP 64

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKVIFRTLK----LLFWGILLQGGFSHAPDELTYGV 134
            FLFI GVA   +L +   R    K+++ R  +    L   G++  G F          +
Sbjct: 65  LFLFIAGVAFPFSLAKQRARGFGRKRILDRIFRRGLILALLGMVYNGLFE---------L 115

Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
           +   +R+  VL RI L+++  +L+ ++   V+ +                  +A  +L+ 
Sbjct: 116 NFSSLRIASVLGRIGLAWMFAALLCVYC-SVRTRIA----------------VAGIILIG 158

Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
           Y  LL     PD          AD              L+      G+IDR+ L  + +Y
Sbjct: 159 YSLLLGLVVAPD------APVGAD-------------PLSVEGCLAGWIDRQYLPGHILY 199

Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
                                           F+PEG+LS++ +++S + G+  G  ++ 
Sbjct: 200 ------------------------------GAFDPEGILSTLPAVVSALFGMFTGEFLLD 229

Query: 315 -TKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
             +G     K +     AL I    L +   +P+NK L++ S+ CV SG +  + +  Y 
Sbjct: 230 GRRGLSGSWKAFYMAVAALAITTAGLCWNLIMPVNKNLWSSSFTCVVSGYSLGMTALFYY 289

Query: 374 LVDIWNLKYPFLPLAWIGMNAMLVYV 399
           L+D+   K        IG+N++ +Y+
Sbjct: 290 LIDVCGYKRWTFVFRVIGLNSITIYM 315


>gi|404404699|ref|ZP_10996283.1| hypothetical protein AJC13_04673 [Alistipes sp. JC136]
          Length = 376

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 95/396 (23%), Positives = 162/396 (40%), Gaps = 97/396 (24%)

Query: 25  QEKSHLKTQRLASLDIFRG-----------LAVALMILVDHAGGDW--PEISHAPWNGCN 71
           Q       QRL SLD  RG           L VAL  L   +  DW   ++ HA W+G  
Sbjct: 4   QPTKPTAPQRLLSLDALRGFDMFFIMGFAGLVVALCKLRPGSFADWMSAQMGHAAWDGFF 63

Query: 72  LADFVMPFFLFIVGVAIALALKRIPDRADAVK------KVIFRTLKLLFWGILLQGGFSH 125
             D + P FLFI G++   +L +   R   V+      KVI R L L+  G++  G F+ 
Sbjct: 64  HHDTIFPLFLFIAGISFPFSLAK--QREKGVRERSIYTKVIRRGLTLVALGLVYNGLFN- 120

Query: 126 APDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHW 185
                   +D   +RL  VL RI L+++  +++ I             RF I        
Sbjct: 121 --------LDFATLRLPSVLGRIGLAWMFAAMLFI-------------RFGIRTRIA--- 156

Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDR 245
            +AA +LV Y  LL        QF            V          L    N VGYIDR
Sbjct: 157 -LAAVILVGYGLLL--------QF------------VAAPDAAGAGPLTEAGNIVGYIDR 195

Query: 246 KVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIG 305
            ++   H+Y +                              F+PEGLLS++ +I++ ++G
Sbjct: 196 TIMP-AHLYGNRG----------------------------FDPEGLLSTLPAIVTAMLG 226

Query: 306 VHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAI-PLNKQLYTLSYVCVTSGAA 364
           +  G  +  ++  ++  ++ + M     +  +     + + P+NK+L+T ++V      +
Sbjct: 227 MFTGEFVRRSEEQISGSRKALRMAAGAALLLVLALCLDPLQPVNKKLWTPAFVFAAGAYS 286

Query: 365 ALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVM 400
             +F+  Y ++D+   +        IG+N++ +Y++
Sbjct: 287 LGMFALFYYIIDVRQWRRWSYFFQVIGLNSITIYMV 322


>gi|374384982|ref|ZP_09642493.1| hypothetical protein HMPREF9449_00879 [Odoribacter laneus YIT
           12061]
 gi|373227040|gb|EHP49361.1| hypothetical protein HMPREF9449_00879 [Odoribacter laneus YIT
           12061]
          Length = 382

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 93/396 (23%), Positives = 162/396 (40%), Gaps = 84/396 (21%)

Query: 23  DQQEKSHLKTQRLASLDIFRGLAVALMI--------LVDHAGGDWPE-----ISHAPWNG 69
           D  + +H   +RL SLD  RG  +  ++        L   A  +W        +H  W G
Sbjct: 2   DTMKSTHSAAKRLESLDALRGFDLFFLVALGPLMNSLARAADAEWFNNWMGIFNHVSWEG 61

Query: 70  CNLADFVMPFFLFIVGVAIALALKR---IPDRADAVKKVIFRTLKLLFWGILLQGG-FSH 125
            +  D +MP FLF+ G+++  AL R   +PD+   ++++  R L L  +G++ QG     
Sbjct: 62  FSPWDLIMPLFLFMSGISMPFALARYKSMPDKRPLLRRLGKRILLLWIFGMICQGNLLGL 121

Query: 126 APDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHW 185
            PD        ++      LQ IA  YL+ +L+ +FT+    + Q +             
Sbjct: 122 NPD--------KIYLYSNTLQAIAAGYLITALLFLFTR---RRTQLI------------- 157

Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDR 245
             A  +L++Y A +        QF  ++     YG              P  N   +ID+
Sbjct: 158 -TAVLLLLIYWAAM--------QFIQVD----GYGG---------GNYTPQGNLAEWIDK 195

Query: 246 KVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIG 305
            VLG           R  A   D       +     W H  +    +LSS++  ++ + G
Sbjct: 196 VVLGR---------FRDTAQLVDG------KVVVAEWYHYTW----ILSSLNFGVTVLTG 236

Query: 306 VHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAA 365
           +  G++         +LK +  +G  ++  G   +F   +P+ K ++T S V V+SG   
Sbjct: 237 LFAGYIAKDKIEEKRKLKLYFGIGATMVTIGWLWNFQ--MPVIKTIWTSSMVLVSSGYCF 294

Query: 366 LVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA 401
           L+    Y  +D    +     L   GMN++L Y++A
Sbjct: 295 LLMGLFYYWIDYKGHRSGITWLKVYGMNSILAYMLA 330


>gi|157374353|ref|YP_001472953.1| hypothetical protein Ssed_1214 [Shewanella sediminis HAW-EB3]
 gi|157316727|gb|ABV35825.1| conserved hypothetical protein [Shewanella sediminis HAW-EB3]
          Length = 378

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 46/141 (32%), Positives = 79/141 (56%), Gaps = 12/141 (8%)

Query: 281 SWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII--HTKGHLARLKQWVTMGFALLIFGLT 338
           ++ + P +PEG+LS++ ++ + + GV  GH II  H KG   ++   +  G A L  G  
Sbjct: 205 TYQNRPLDPEGILSTIPAVANALAGVFVGHFIIKPHPKGEWFKVVYMLVAGAAFLGLGWL 264

Query: 339 LHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVY 398
           L F   +P+NK+L+T S+  VT G + ++ +  YA+VD+   +    P   IG NA+++Y
Sbjct: 265 LDFI--VPVNKELWTSSFTLVTIGWSLILLTVFYAIVDLLKWQTLAFPFVVIGCNAIIIY 322

Query: 399 VMA--------AEGIFAGFIN 411
           + +        A+G+F G IN
Sbjct: 323 LASSLIDWKYIAQGLFGGIIN 343



 Score = 45.8 bits (107), Expect = 0.049,   Method: Compositional matrix adjust.
 Identities = 44/156 (28%), Positives = 73/156 (46%), Gaps = 29/156 (18%)

Query: 24  QQEKSHLKTQRLASLDIFRG-----------LAVALMILVDHAGGDW--PEISHAPWNGC 70
           +  ++ +  +RL SLD  RG           L   L+      G  W   ++ H+ W+G 
Sbjct: 2   EVAQAKVSKRRLMSLDALRGFDMFWILGGEVLFAGLLAWTGWQGWQWFDTQMHHSEWHGF 61

Query: 71  NLADFVMPFFLFIVGVAIALALKRIPDRADAVKKV------IFRTLKLLFWGILLQGGFS 124
              D + P F+F+ GVA+ L+ KR+ D+    K++      + R L LLF+GIL   G+ 
Sbjct: 62  TFYDLIFPLFIFLSGVALGLSPKRL-DKLPIAKRMPLYIHAVKRLLLLLFFGILYNHGWG 120

Query: 125 HAPDELTYGVDVRM--IRLCGVLQRIALSYLLVSLV 158
                   GV V +  +R   VL RIA ++   +++
Sbjct: 121 T-------GVPVVLDEVRYASVLGRIAFAWFFAAIL 149


>gi|224025514|ref|ZP_03643880.1| hypothetical protein BACCOPRO_02254, partial [Bacteroides
           coprophilus DSM 18228]
 gi|224018750|gb|EEF76748.1| hypothetical protein BACCOPRO_02254 [Bacteroides coprophilus DSM
           18228]
          Length = 298

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 48/136 (35%), Positives = 80/136 (58%), Gaps = 11/136 (8%)

Query: 33  QRLASLDIFRGLAVALMILVD-HAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALA 91
           +RL S+DIFRG+ +  MILV+  AGG +  + H P  G  +AD V P F+FI+G ++ L+
Sbjct: 17  KRLLSIDIFRGITIFFMILVNTQAGGSFDFLIHIPGYGWRIADLVYPSFIFIMGASMYLS 76

Query: 92  LKRIPDR--ADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
           +++  +    D  K +  RT+ +   GI+    F+  P    +  ++  +R+ GVLQRIA
Sbjct: 77  MRKYVEAPPTDLYKHIFRRTVLIFLMGII----FNWIP----FDQNLLDVRILGVLQRIA 128

Query: 150 LSYLLVSLVEIFTKDV 165
           + YL+ SL+ I  + +
Sbjct: 129 IVYLICSLLVIKVRSI 144



 Score = 48.1 bits (113), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 32/114 (28%), Positives = 65/114 (57%), Gaps = 4/114 (3%)

Query: 288 EPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLA-RLKQWVTMGFALLIFGLTLHFTNAIP 346
           +PEGLLSS+ S+++ IIG     ++  T+  L  RL +  T+  +++     LH+T  +P
Sbjct: 188 DPEGLLSSIPSVVNAIIGYVSARIL--TENELKERLIKMSTIAISMIALAYILHWT-ILP 244

Query: 347 LNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVM 400
           + K  ++ S+  +TSG + L +  ++ + D+W  K   +    +G N+++ Y++
Sbjct: 245 IYKTYWSSSFGLLTSGISLLAWIVVHLICDVWGKKDWGIAFDVLGKNSIVCYLL 298


>gi|154492358|ref|ZP_02031984.1| hypothetical protein PARMER_01992 [Parabacteroides merdae ATCC
           43184]
 gi|423722056|ref|ZP_17696232.1| hypothetical protein HMPREF1078_00295 [Parabacteroides merdae
           CL09T00C40]
 gi|154087583|gb|EDN86628.1| hypothetical protein PARMER_01992 [Parabacteroides merdae ATCC
           43184]
 gi|409242758|gb|EKN35518.1| hypothetical protein HMPREF1078_00295 [Parabacteroides merdae
           CL09T00C40]
          Length = 370

 Score = 75.5 bits (184), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 95/417 (22%), Positives = 168/417 (40%), Gaps = 112/417 (26%)

Query: 27  KSHLKTQRLASLDIFRG-----------LAVALMILVDHA-----GGDWPEISHAPWNGC 70
           +   +++RL SLD  RG           L VAL  L  ++     GG   ++ H  W+G 
Sbjct: 2   EKQKQSRRLLSLDALRGFDMFFIMGGASLFVALATLFPNSFFQAIGG---QMDHVEWDGL 58

Query: 71  NLADFVMPFFLFIVGVAIALALKRIPDR----ADAVKKVIFRTLKLLFWGILLQGGFSHA 126
              D + P FLFI G++   +L++  ++    +D  K+++ R + L+  G +  G     
Sbjct: 59  THHDTIFPLFLFIAGISFPFSLEKQREQGKSESDIYKRIVRRGITLVLLGCVYNGLLQF- 117

Query: 127 PDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWL 186
                   D   +R   VL RI L ++  +L+                F  FR     W+
Sbjct: 118 --------DFANLRCASVLARIGLGWMFAALL----------------FVHFRTSVRAWI 153

Query: 187 MAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRK 246
            A  +LV Y   +    VP           A+ G                 N VGY+DR 
Sbjct: 154 -AGTILVGYWVWIAFIPVP----------GAEAGP-----------FTLEGNWVGYVDRL 191

Query: 247 VLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPF-EPEGLLSSVSSILSTIIG 305
           +L                               P   H  F +PEGLLS++  I++ ++G
Sbjct: 192 LL-------------------------------PGRLHQGFFDPEGLLSTLPGIVTAMLG 220

Query: 306 VHFGHVI-IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAA 364
           +  G  I +  +G   + K    +   +L+  + + ++   P+NK+L++ S+VCV    +
Sbjct: 221 MFTGEFIKLRKEGLTDKRKVAGLVLAGVLLLLVGVLWSLVFPINKKLWSSSFVCVVGAYS 280

Query: 365 ALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVM--------AAEGIFAGFINGW 413
             +F+  Y +VD+   +        IG+N++ +Y+          ++G+F G + GW
Sbjct: 281 VWMFALFYYIVDVLGWRKWTFFFQVIGVNSITIYLAQKFIDFSYTSQGLFGGLV-GW 336


>gi|338212226|ref|YP_004656281.1| hypothetical protein [Runella slithyformis DSM 19594]
 gi|336306047|gb|AEI49149.1| Protein of unknown function DUF2261, transmembrane [Runella
           slithyformis DSM 19594]
          Length = 369

 Score = 75.5 bits (184), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 99/387 (25%), Positives = 163/387 (42%), Gaps = 100/387 (25%)

Query: 33  QRLASLDIFRGLAVALMI-----LVDHAGGDW---------PEISHAPWNGCNLADFVMP 78
            RLASLD  RG  + LMI      +   GG            +  H  W+G    DF+ P
Sbjct: 8   SRLASLDALRGFDM-LMISGGGAFLSLMGGKTDSALLNAVAAQFHHPDWDGFTFYDFIFP 66

Query: 79  FFLFIVGVAIALALKR-----IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYG 133
            FLF+ GV+++++LK      IP +   ++KV  R L L F G+L +    +AP ++   
Sbjct: 67  LFLFMAGVSLSISLKNGIAKGIP-QYKLMEKVFKRMLILFFLGLLDK----NAPIDI--- 118

Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV 193
           +D   IR   VL RI ++  LV+++ +   +   K Q +  F+I  LY      AA +L+
Sbjct: 119 LDPAHIRYGTVLGRIGIATFLVAILYL---NTGWKTQLIVAFTILGLYY-----AALMLI 170

Query: 194 VYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
                                   DYG            L+   N VG+IDR  +     
Sbjct: 171 ---------------------SVGDYGG---------GNLSFEGNLVGWIDRAFM----- 195

Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
              P   R K  T D                       + + +S+   TI G   G +++
Sbjct: 196 ---PG--RLKQTTYDE--------------------LAMTTQLSATCLTIFGSLAGKILL 230

Query: 314 HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
                  +L +   MG   +  GL   +    P+NK L++ S++ +T+G A+L+ +  Y 
Sbjct: 231 DKTTANIKLIRLAGMGVIGIAAGLA--WATVFPINKHLWSSSFILLTAGMASLLVALFYG 288

Query: 374 LVDIWNL-KYPFLPLAWIGMNAMLVYV 399
           ++D+    K+ F     IGMN++++Y+
Sbjct: 289 IMDVLKFTKWAFF-FKVIGMNSLVIYL 314


>gi|423345097|ref|ZP_17322786.1| hypothetical protein HMPREF1060_00458 [Parabacteroides merdae
           CL03T12C32]
 gi|409222883|gb|EKN15820.1| hypothetical protein HMPREF1060_00458 [Parabacteroides merdae
           CL03T12C32]
          Length = 370

 Score = 75.1 bits (183), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 95/417 (22%), Positives = 168/417 (40%), Gaps = 112/417 (26%)

Query: 27  KSHLKTQRLASLDIFRG-----------LAVALMILVDHA-----GGDWPEISHAPWNGC 70
           +   +++RL SLD  RG           L VAL  L  ++     GG   ++ H  W+G 
Sbjct: 2   EKQKQSRRLLSLDALRGVDMFFIMGGASLFVALATLFPNSFFQAIGG---QMDHVEWDGL 58

Query: 71  NLADFVMPFFLFIVGVAIALALKRIPDR----ADAVKKVIFRTLKLLFWGILLQGGFSHA 126
              D + P FLFI G++   +L++  ++    +D  K+++ R + L+  G +  G     
Sbjct: 59  THHDTIFPLFLFIAGISFPFSLEKQREQGKSESDIYKRIVRRGITLVLLGCVYNGLLQF- 117

Query: 127 PDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWL 186
                   D   +R   VL RI L ++  +L+                F  FR     W+
Sbjct: 118 --------DFANLRCASVLARIGLGWMFAALL----------------FVHFRTSVRAWI 153

Query: 187 MAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRK 246
            A  +LV Y   +    VP           A+ G                 N VGY+DR 
Sbjct: 154 -AGVILVGYWVWIAFIPVP----------GAEAGP-----------FTLEGNWVGYVDRL 191

Query: 247 VLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPF-EPEGLLSSVSSILSTIIG 305
           +L                               P   H  F +PEGLLS++  I++ ++G
Sbjct: 192 LL-------------------------------PGRLHQGFFDPEGLLSTLPGIVTAMLG 220

Query: 306 VHFGHVI-IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAA 364
           +  G  I +  +G   + K    +   +L+  + + ++   P+NK+L++ S+VCV    +
Sbjct: 221 MFTGEFIKLRKEGLTDKRKVAGLVLAGVLLLLVGVLWSLVFPINKKLWSSSFVCVVGAYS 280

Query: 365 ALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVM--------AAEGIFAGFINGW 413
             +F+  Y +VD+   +        IG+N++ +Y+          ++G+F G + GW
Sbjct: 281 VWMFALFYYIVDVLGWRKWTFFFQVIGVNSITIYLAQRFIDFSYTSQGLFGGLV-GW 336


>gi|291514403|emb|CBK63613.1| Uncharacterized conserved protein [Alistipes shahii WAL 8301]
          Length = 376

 Score = 75.1 bits (183), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 91/394 (23%), Positives = 158/394 (40%), Gaps = 93/394 (23%)

Query: 25  QEKSHLKTQRLASLDIFRG-----------LAVALMILVDHAGGDW--PEISHAPWNGCN 71
           Q      +QRL SLD  RG           L  AL  L      DW   ++ HA WNG  
Sbjct: 4   QPTQPAASQRLLSLDALRGFDMLFIMGFAGLVTALCKLCPGEFSDWMTAQMGHADWNGFF 63

Query: 72  LADFVMPFFLFIVGVAIALALKRIPDRADAVK----KVIFRTLKLLFWGILLQGGFSHAP 127
             D + P FLFI G++   +L +  ++  + +    KVI R L L+  G +  G F    
Sbjct: 64  HHDTIFPLFLFIAGISFPFSLAKQREKGMSERSIYLKVIRRGLTLVALGFVYSGLFK--- 120

Query: 128 DELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLM 187
                 +D   +RL  VL RI L+++  +L+ +   +   + ++V              +
Sbjct: 121 ------LDFATLRLPSVLGRIGLAWMFAALLFV---NFNVRTRAV--------------I 157

Query: 188 AACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKV 247
           AA +L+ Y  LL     PD                     G    L    N VGY+DR V
Sbjct: 158 AAAILLGYGLLLQFVAAPD--------------------AGGAGPLTLEGNIVGYVDRIV 197

Query: 248 LGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVH 307
                M  H    R                         F+PEGLLS++ +I++ ++G+ 
Sbjct: 198 -----MPSHLLGGRG------------------------FDPEGLLSTLPAIVTAMLGMF 228

Query: 308 FGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAI-PLNKQLYTLSYVCVTSGAAAL 366
            G  +  ++   +  ++ + M    +         + + P+NK+L+T ++V      +  
Sbjct: 229 TGEFVRRSEERTSGSRKTLWMAAGAVALLALALCLDPLQPINKKLWTPAFVFAAGAYSLG 288

Query: 367 VFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVM 400
           +F+  Y ++D+   +        IG+N++ +Y++
Sbjct: 289 MFALFYYIIDVCQWRRWTYFFKVIGVNSITIYMV 322


>gi|445497063|ref|ZP_21463918.1| putative transmembrane protein [Janthinobacterium sp. HH01]
 gi|444787058|gb|ELX08606.1| putative transmembrane protein [Janthinobacterium sp. HH01]
          Length = 353

 Score = 75.1 bits (183), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 50/143 (34%), Positives = 75/143 (52%), Gaps = 16/143 (11%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDW----PEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           R+ S+D  RGL VA M+LV+ AG DW    P + HA W+GC   DF+ P F+ IVG++I 
Sbjct: 2   RINSIDAVRGLTVAAMLLVNDAG-DWSHVYPWLEHAEWHGCTPPDFIFPIFMLIVGISIN 60

Query: 90  LALKRIPDRADAV----KKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
           LAL    D   A     + V+ R ++++  G+ L     H    L   ++ R  RL GVL
Sbjct: 61  LALSPRLDAGAATAPLARSVLLRAVRIVLLGLAL-----HVVAMLL--LNGRGFRLFGVL 113

Query: 146 QRIALSYLLVSLVEIFTKDVQDK 168
           QR  + +    L+ I  +  + +
Sbjct: 114 QRTGICFAAAGLLAIHVRGARAQ 136



 Score = 42.4 bits (98), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 32/111 (28%), Positives = 59/111 (53%), Gaps = 10/111 (9%)

Query: 288 EPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPL 347
           +PEG+L ++ S+ + I+G+  G  +        + ++    G A ++ G    ++ A+P 
Sbjct: 188 DPEGILGTLPSLATVILGLRAGDWL-----RSGQTRKLALAGVAAMLAGGI--WSLALPF 240

Query: 348 NKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVY 398
           NKQL+T S+V  T G   L  +  + L+D+    +P L  A +G+NA+  Y
Sbjct: 241 NKQLWTSSFVLWTGGFGMLAIALAHQLIDV--RAWPPLGRA-MGVNAIAAY 288


>gi|171914858|ref|ZP_02930328.1| hypothetical protein VspiD_26815 [Verrucomicrobium spinosum DSM
           4136]
          Length = 379

 Score = 75.1 bits (183), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 89/355 (25%), Positives = 145/355 (40%), Gaps = 80/355 (22%)

Query: 61  EISHAPWNGCNLADFVMPFFLFIVGVAIALALKRI---PDRADAVKKVIFRTLKLLFWGI 117
           ++ H  W G    D + P FLF+VGV+I L++ R+     R+ A+ +++ R+  L   G+
Sbjct: 29  QLQHVEWEGFRFYDAIFPLFLFLVGVSIVLSVDRMVARVGRSRALARIVRRSALLFAVGV 88

Query: 118 LLQGGFSHA-PDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFS 176
              GG +   PD          ++L GVL RIAL YL+ + + +    +  K   +    
Sbjct: 89  FYYGGIARPWPD----------VQLSGVLPRIALCYLVAATLYVL---LPRKGIVIA--- 132

Query: 177 IFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPP 236
                      AAC+L  +  +L   +VP       N D                 +  P
Sbjct: 133 ----------TAACLLGYWALML---FVP-----FPNVD-----------------IKTP 157

Query: 237 CNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSV 296
             A   ++ K            +  + + T+ +  EG         C  P     L  S 
Sbjct: 158 AGARKQVEAKT-------REELFAGAASTTKGTFKEGLNLAHYVDACWLPGRKRNLYYSN 210

Query: 297 SSILSTI-------IGVHFGHVIIHTKGHLARLKQW-----VTMGFALLIFGLTLHFTNA 344
             +LSTI        GV  G V+ H +    R  +W     V  G A ++ GL       
Sbjct: 211 EGLLSTIPAVASTLFGVLAGWVLTHGR----RSGRWKVGWLVGSGLAGVVIGLLWGLE-- 264

Query: 345 IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
            P+ K+L+T S+  V +G +A++    Y +VD+W  +    P  WIG NA++VY+
Sbjct: 265 FPVIKRLWTSSFCMVAAGFSAVLLGLFYLVVDLWRWQRWCAPFLWIGGNALVVYI 319


>gi|410463501|ref|ZP_11317013.1| hypothetical protein B193_1525 [Desulfovibrio magneticus str.
           Maddingley MBC34]
 gi|409983383|gb|EKO39760.1| hypothetical protein B193_1525 [Desulfovibrio magneticus str.
           Maddingley MBC34]
          Length = 371

 Score = 74.7 bits (182), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 52/136 (38%), Positives = 75/136 (55%), Gaps = 17/136 (12%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
           T RL S+D  RGLA+A MI+V++ G     +P++ HA W+G  LAD V P FLF+VGV +
Sbjct: 6   TSRLLSVDALRGLAIAAMIVVNNPGDRRFIYPQLLHAHWHGLTLADVVFPLFLFLVGVCV 65

Query: 89  ALAL-----KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCG 143
           ALA+     +    RA   +K++ R   L   G+        + DEL         R+ G
Sbjct: 66  ALAIDLDKARDAKGRARLWRKILPRAAVLFALGLGETAYLRLSFDEL---------RIPG 116

Query: 144 VLQRIALSYLLVSLVE 159
           VLQRIA+ YL  + ++
Sbjct: 117 VLQRIAVVYLAAAWLQ 132


>gi|150007979|ref|YP_001302722.1| transmembrane protein [Parabacteroides distasonis ATCC 8503]
 gi|423331514|ref|ZP_17309298.1| hypothetical protein HMPREF1075_01311 [Parabacteroides distasonis
           CL03T12C09]
 gi|149936403|gb|ABR43100.1| putative transmembrane protein [Parabacteroides distasonis ATCC
           8503]
 gi|409230084|gb|EKN22952.1| hypothetical protein HMPREF1075_01311 [Parabacteroides distasonis
           CL03T12C09]
          Length = 378

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 104/434 (23%), Positives = 180/434 (41%), Gaps = 97/434 (22%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHA------GGDWP-------EISHAPWNGCNLADFVM 77
           K  RL SLD+ RG  +  +++++          D P         SH  W G +  D VM
Sbjct: 5   KLNRLESLDVLRGFDLFCLVVLEMVLHPLAHAIDMPWFNSFMWGFSHVEWEGFSTWDLVM 64

Query: 78  PFFLFIVGVAIALALKR---IPDRADAVKKVIFRTLKLLFWGILLQGG-FSHAPDELTYG 133
           P FLF+ GV++  +L R   +PD+    +++  R L L  +G++ QG   +  PD     
Sbjct: 65  PLFLFMAGVSMPFSLSRYKDMPDKMAVYRRIGKRVLLLWVFGMMCQGNLLALDPD----- 119

Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVEIFTK-DVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
              R+      LQ IA+ YL+ SL+ ++ +  VQ     +G            + AA +L
Sbjct: 120 ---RVYLYSNTLQSIAMGYLIASLLFLYVRIRVQ-----IG------------IAAALLL 159

Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
           + + A+ +         T+ N     Y               P  N   +IDR VLG   
Sbjct: 160 IFWGAMEF--------ITVGNYGGGSY--------------TPDSNLAEWIDRTVLG--- 194

Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
                   R +        +G   ++        +    +LSS++  ++ + G+  G+ I
Sbjct: 195 --------RFR--------DGATVENGEVIFATWYRYTWILSSLNFGVTVLTGLFAGY-I 237

Query: 313 IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIY 372
           +  K +  RLK  +  G  L +      +   +P+ K+L+T S V V+SG   L+    Y
Sbjct: 238 LKNKLYSERLKLRMLFGIGLGMVIAGWLWGIELPVIKKLWTSSMVLVSSGYCFLLMGLFY 297

Query: 373 ALVDIWNLKYPFLPLAWIGMNAMLVYVMA--------AEGIFAGFINGWYYGDPHNTLV- 423
             +D    +     L   GMN++L Y++          E +F G     Y G+ +  L+ 
Sbjct: 298 YWIDYKGHRKYTTWLKVYGMNSILAYMLTNVVSFRCIGESLFFG--TQQYLGNYYPVLIA 355

Query: 424 -CFLFIISYILHSF 436
            C + +I  +L++F
Sbjct: 356 MCNVSVIYVLLYAF 369


>gi|295136516|ref|YP_003587192.1| hypothetical protein ZPR_4697 [Zunongwangia profunda SM-A87]
 gi|294984531|gb|ADF54996.1| conserved hypothetical protein [Zunongwangia profunda SM-A87]
          Length = 371

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 53/144 (36%), Positives = 77/144 (53%), Gaps = 12/144 (8%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVG 85
           +   R  SLDI RG+ VALMILV++ G  W  I     HA W+G  L D V P FLF+VG
Sbjct: 1   MARSRYLSLDILRGMTVALMILVNNPG-SWATIYAPFKHAAWHGFTLTDLVFPTFLFVVG 59

Query: 86  VAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAP-----DELTYGVDVRMIR 140
            A++ + K++   + +  + + +T K      L+  G S+ P     D      ++  IR
Sbjct: 60  NAMSFSFKKM--NSWSTPEFLTKTFKRAAIIFLIGLGLSYYPFVRRTDGEFILKNILDIR 117

Query: 141 LCGVLQRIALSYLLVSLVEIFTKD 164
           + GVLQRIA+ YLL ++   F K 
Sbjct: 118 IMGVLQRIAVCYLLAAIAIRFLKK 141



 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 32/117 (27%), Positives = 61/117 (52%), Gaps = 2/117 (1%)

Query: 286 PFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           PF+PEGLLS + ++++ I G   G  I  +      + Q V  G A++   L L +   +
Sbjct: 195 PFDPEGLLSCLPAVVNVIFGYLAGRFIQQSVNKKKLVIQLVIAGLAMI--SLALVWDVIL 252

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
           P+NK ++T SYV +++G   ++  A+  +++I   K         G N + ++V++ 
Sbjct: 253 PVNKPIWTSSYVILSTGWDFIILGALIGILEIAEFKSWSRFFEPFGKNPLFIFVLSG 309


>gi|224537871|ref|ZP_03678410.1| hypothetical protein BACCELL_02758 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|423227284|ref|ZP_17213748.1| hypothetical protein HMPREF1062_05934 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|224520557|gb|EEF89662.1| hypothetical protein BACCELL_02758 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|392624424|gb|EIY18516.1| hypothetical protein HMPREF1062_05934 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 373

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 49/127 (38%), Positives = 70/127 (55%), Gaps = 12/127 (9%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVA 87
             RL +LD+ RG+ +A MILV++ G  W  +     HA W+G    D V PFF+FI+GVA
Sbjct: 5   NNRLLALDVIRGITIAGMILVNNPG-SWQSVYAPLQHARWHGLTPTDLVYPFFMFIMGVA 63

Query: 88  IALALKRIPDRADAVK-KVIFRTLKLLFWGILLQGGFSHAPDELTYGV-DVRMIRLCGVL 145
           I  +L++       V  K+I RT+ L   GI L         +L YG      +R+ GV+
Sbjct: 64  IHFSLRKFDKLNTTVSLKIIRRTVALFAVGIALD-----CFSKLCYGTFSWEHLRILGVM 118

Query: 146 QRIALSY 152
           QR+AL+Y
Sbjct: 119 QRLALAY 125



 Score = 55.8 bits (133), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 47/164 (28%), Positives = 74/164 (45%), Gaps = 24/164 (14%)

Query: 238 NAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVS 297
           N +  ID K+LG  H+    A   S A                      FEPEGLLS++ 
Sbjct: 167 NLIAVIDVKLLGAGHLIKETAEGGSFA----------------------FEPEGLLSTIP 204

Query: 298 SILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYV 357
                + G   G +I     +  R++Q    G  LL  G  L + +  P+NK+L+T SY 
Sbjct: 205 CWAHVLFGTFVGSLITGIAENKERIRQIALFGTVLLFAGFLLQYLD--PINKKLWTASYT 262

Query: 358 CVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA 401
            +TSG A+L+ + +  ++D+   K         G+N + +YV+A
Sbjct: 263 LITSGTASLLLALLIDIIDVRQKKRWCRFFEAFGVNPLFMYVVA 306


>gi|343083133|ref|YP_004772428.1| hypothetical protein [Cyclobacterium marinum DSM 745]
 gi|342351667|gb|AEL24197.1| Protein of unknown function DUF2261, transmembrane [Cyclobacterium
           marinum DSM 745]
          Length = 381

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 48/126 (38%), Positives = 66/126 (52%), Gaps = 6/126 (4%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
            R  SLD+ RGL +ALM++V+  G     +  ++HA W+G  L D V P FLF+VG A++
Sbjct: 12  TRYQSLDVLRGLTLALMVIVNTPGDGSTSFGPLTHADWHGLTLTDLVFPSFLFVVGNAMS 71

Query: 90  LALKRIPDRADAV--KKVIFRTLKLLFWGILLQG-GFSHAPDELTYGVDVRMIRLCGVLQ 146
            +L +   +       KV  RT  +   G+LL    F    D      D   IR+ GVLQ
Sbjct: 72  FSLGKFKLKGGKAYFSKVFKRTALIFIIGLLLTAFPFFRVNDSGVVPYDFTSIRILGVLQ 131

Query: 147 RIALSY 152
           RIAL Y
Sbjct: 132 RIALCY 137



 Score = 43.9 bits (102), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 28/116 (24%), Positives = 63/116 (54%), Gaps = 2/116 (1%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIP 346
           F+PEGLLS+  ++++ ++G   G + I  +G       W+ M FA+++  +   +    P
Sbjct: 206 FDPEGLLSTFPAMVNVLLGYWVG-LQIQKRGGDIETVLWLAM-FAVILLVVGYLWDYGFP 263

Query: 347 LNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
           +NK+++T S+  VT G + L  + +  ++++ ++K         G N + +Y+++ 
Sbjct: 264 INKKIWTSSFTLVTVGYSTLTLALLMFILEVRSIKGWAYFFEVFGKNPLALYILSG 319


>gi|288800484|ref|ZP_06405942.1| conserved hypothetical protein [Prevotella sp. oral taxon 299 str.
           F0039]
 gi|288332697|gb|EFC71177.1| conserved hypothetical protein [Prevotella sp. oral taxon 299 str.
           F0039]
          Length = 409

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 99/422 (23%), Positives = 176/422 (41%), Gaps = 96/422 (22%)

Query: 17  SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPE--------------- 61
           S  ++ ++    H  + RL SLD+ RG  +AL++L       W E               
Sbjct: 25  SSTEIFNKATAPH--SGRLLSLDLLRGADLALLVLFQPIIYQWVEASEPTPGSFGEMVFG 82

Query: 62  -ISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPD-RADAVKKVIFRTLK--LLFW-- 115
            I+H PW G    D +MP F+F+ G+ I  ++ +    +  A K  ++R LK  ++ W  
Sbjct: 83  QITHVPWEGFCFWDIIMPLFMFMSGITIPFSMGKYQQGKVKADKGFLWRLLKRFVVLWVL 142

Query: 116 GILLQGGFSHAPDELTYGVDVRMIRL-CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGR 174
           G++ QG      + L +  D R+I L    LQ IA+ Y++V+L+ ++T     + Q    
Sbjct: 143 GMIAQG------NLLLF--DPRLIHLYSNTLQSIAVGYVMVALLFVYTS---WRTQLA-- 189

Query: 175 FSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLN 234
                      ++AAC LV Y+A+                  A +G++            
Sbjct: 190 -----------VVAAC-LVGYVAVF-----------------AIWGQM---------DFT 211

Query: 235 PPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLS 294
              N    IDR+VLG         WR           +G L K    +    +    +LS
Sbjct: 212 IDANICEEIDRQVLG--------HWR-----------DGVLWKGDNWYWDDSYHYTWILS 252

Query: 295 SVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTL 354
           S++ +++  +G   G ++      L +++  V  G AL+     LH    +P+ K +++ 
Sbjct: 253 SLNFVVTVYLGCVAGMILRMKLQDLKKVRILVFSGVALIALSFLLH--PVVPIIKHIWST 310

Query: 355 SYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWY 414
           S   ++ G   L+ +  Y  VDI       + L + G N+++ Y++     F+   N  +
Sbjct: 311 SMTFLSGGICLLLIALAYYWVDIKGKTKGIMWLRFYGANSLVAYMVGDHVSFSSITNSIF 370

Query: 415 YG 416
           YG
Sbjct: 371 YG 372


>gi|334364999|ref|ZP_08513969.1| putative membrane protein [Alistipes sp. HGB5]
 gi|313158791|gb|EFR58176.1| putative membrane protein [Alistipes sp. HGB5]
          Length = 383

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 90/386 (23%), Positives = 156/386 (40%), Gaps = 94/386 (24%)

Query: 31  KTQRLASLDIFRGLAVALMI----LVDHAGGDWPE---------ISHAPWNGCNLADFVM 77
           +++RL SLD  RG  +  ++    LV    G WP          +SH  W+G    D + 
Sbjct: 20  QSERLMSLDALRGFDMLFIMGFASLVVAVCGLWPSAVTDAAAASMSHVAWDGFAHHDTIF 79

Query: 78  PFFLFIVGVAI--ALALKRIPDRADA--VKKVIFRTLKLLFWGILLQGGFSHAPDELTYG 133
           P FLFI GV+   ++A +R    ++     K++ R L L+  G++  G F          
Sbjct: 80  PLFLFIAGVSFPYSVAKQRAGGMSEGRIYAKIVRRGLTLVVLGMVYNGLFK--------- 130

Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV 193
           +D   +R+  VL RI L++   S+  +   +   K ++               +A  VL 
Sbjct: 131 LDFENLRIASVLGRIGLAW---SIAAVLYLNFGVKTRAA--------------IAVAVLA 173

Query: 194 VYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
            Y AL                       V          L    N  GYIDR+ L     
Sbjct: 174 GYGAL--------------------SALVAAPDAAGAGPLTFEGNLAGYIDRQFL----- 208

Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
                              G L        +  F+PEGLLS+V ++++ ++G+  G  + 
Sbjct: 209 ------------------PGKL-------IYGSFDPEGLLSTVPAVVTAMLGMFTGEFVR 243

Query: 314 HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
            +     R   W+    A L+    L F+  +P+NK+L++ ++VCV    +  +F+  Y 
Sbjct: 244 RSDIRGGRKTLWMAAAAAALLA-AGLAFSGVLPVNKKLWSSTFVCVVGAYSLGMFALFYY 302

Query: 374 LVDIWNLKYPFLPLAWIGMNAMLVYV 399
           L+D+   +   L    +G+N++ +Y+
Sbjct: 303 LIDVRGWRRWTLFFRVVGLNSITIYL 328


>gi|423345098|ref|ZP_17322787.1| hypothetical protein HMPREF1060_00459 [Parabacteroides merdae
           CL03T12C32]
 gi|409222884|gb|EKN15821.1| hypothetical protein HMPREF1060_00459 [Parabacteroides merdae
           CL03T12C32]
          Length = 376

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 95/390 (24%), Positives = 163/390 (41%), Gaps = 92/390 (23%)

Query: 33  QRLASLDIFRGLAVALMILVD---HAGG---DWP-------EISHAPWNGCNLADFVMPF 79
           QRL SLD+ RG  +  ++ ++   H+ G   D P         SH  W G +  D VMP 
Sbjct: 6   QRLESLDVLRGFDLFCLVALEGVLHSLGRAIDAPWYNDFLWGFSHVQWEGFSSWDLVMPL 65

Query: 80  FLFIVGVAIALALKR---IPDRADAVKKVIFRTLKLLFWGILLQGG-FSHAPDELTYGVD 135
           F+F+ GV++  AL R   +PD+    ++++ R   L  +G++ QG      PD       
Sbjct: 66  FMFMAGVSMPFALSRYKAMPDKWAVYRRIVKRVALLWIFGMMCQGNLLGLDPD------- 118

Query: 136 VRMIRLCGVLQRIALSYLLVSLVEIFTK-DVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
            R+      LQ IA+ YL+ +++ +  +  VQ     +G              A  +L+ 
Sbjct: 119 -RIYLYSNTLQAIAMGYLISAMLFLHVRLSVQ-----IGT-------------AVALLLA 159

Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
           Y   +        QF  ++     YG              P  N   ++DR VLG     
Sbjct: 160 YWGAM--------QFITVD----GYGG---------GNYTPDGNLAEWVDRTVLGR---- 194

Query: 255 HHPAWRRSKACTQDSP--FEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
                 R  A  ++    F G  R    +W         +LSS++  ++ + GV  GH++
Sbjct: 195 -----FRDAAVVENGQIVFAGSYRY---TW---------ILSSLNFGVTVLTGVFAGHIL 237

Query: 313 IHTKGHLARLKQWV-TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAI 371
              K  + R  +W   +G   ++  L   +   +P+ K+++T S V V+SG   L+    
Sbjct: 238 ---KSGMDRKHKWQWLLGIGAIMVALGWLWGLQLPVIKKIWTSSMVLVSSGYCFLLMGLF 294

Query: 372 YALVDIWNLKYPFLPLAWIGMNAMLVYVMA 401
           Y  +D    +     L   GMN+++ Y++A
Sbjct: 295 YYWIDYKGHRKNLTWLKVYGMNSIVAYMLA 324


>gi|182412825|ref|YP_001817891.1| hypothetical protein Oter_1003 [Opitutus terrae PB90-1]
 gi|177840039|gb|ACB74291.1| conserved hypothetical protein [Opitutus terrae PB90-1]
          Length = 411

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 115/428 (26%), Positives = 181/428 (42%), Gaps = 113/428 (26%)

Query: 34  RLASLDIFRG-------------LAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFF 80
           RL S+D  RG             LA+  M L         ++ H  W G    D + P F
Sbjct: 11  RLVSVDALRGFDMFWILGADALVLALGAMSLSPTLRALAGQLEHKDWAGFAFYDLIFPLF 70

Query: 81  LFIVGVAIALALKRI---PDRADAVKKVIFRTLKLLFWGILLQGGFSHA-PDELTYGVDV 136
           +FIVGV+   +L  +     RA AVK+++ RTL LL +GI   GG +H  PD        
Sbjct: 71  VFIVGVSTVFSLTSLVAREGRAAAVKRILRRTLLLLAFGIFYNGGLAHQWPD-------- 122

Query: 137 RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYL 196
             +RL GVLQRIAL+Y    L+ +F K               R+      + A +LV Y 
Sbjct: 123 --VRLVGVLQRIALAYAAAGLLFVFFKP--------------RVLA---AVTALLLVGYW 163

Query: 197 ALLYGTYVP--DWQF---TIIN------KDS------ADYGKVFN-VTCGVRAKLNPPCN 238
           ALL  T++P  D Q     +++      +D+      A   ++F+  T  V  +  P  N
Sbjct: 164 ALL--TFIPIRDVQLDRAALVSHLPDAPRDAQGFPAEAQVRQLFDQTTARVTGRFEPGLN 221

Query: 239 AVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSS 298
              ++D   L      H+                              ++PEGLLS++ +
Sbjct: 222 LTNHLDYVYLPGARYDHY------------------------------YDPEGLLSTLPA 251

Query: 299 ILSTIIGVHFGHVIIHTK--GH---LARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYT 353
           I + ++G+  G ++  T   GH   +      V    A  ++G+        P+ K+L+T
Sbjct: 252 IATCLLGIFAGLLLRRTDIGGHDKVVTLALAGVAALAAGWLWGIQF------PIIKKLWT 305

Query: 354 LSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVM--------AAEGI 405
            SYV V  G + L+ +A Y ++D+   +    P  WIGMN + +Y++        AA  +
Sbjct: 306 SSYVLVAGGWSLLLLAAFYYVIDVRQWRRWCQPFVWIGMNPITLYLLSTIVGFREAAARL 365

Query: 406 FAGFINGW 413
             G I+ W
Sbjct: 366 VGGDISEW 373


>gi|390946391|ref|YP_006410151.1| hypothetical protein Alfi_1113 [Alistipes finegoldii DSM 17242]
 gi|390422960|gb|AFL77466.1| hypothetical protein Alfi_1113 [Alistipes finegoldii DSM 17242]
          Length = 366

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 90/386 (23%), Positives = 155/386 (40%), Gaps = 94/386 (24%)

Query: 31  KTQRLASLDIFRGLAVALMI----LVDHAGGDWPE---------ISHAPWNGCNLADFVM 77
           +++RL SLD  RG  +  ++    LV    G WP          +SH  W+G    D + 
Sbjct: 3   QSERLMSLDALRGFDMLFIMGFASLVVAVCGLWPSAVTDAAAASMSHVAWDGFAHHDTIF 62

Query: 78  PFFLFIVGVAI--ALALKRIPDRADA--VKKVIFRTLKLLFWGILLQGGFSHAPDELTYG 133
           P FLFI GV+   ++A +R    ++     K++ R L L+  G++  G F          
Sbjct: 63  PLFLFIAGVSFPYSVAKQRAGGMSEGRIYAKIVRRGLTLVVLGMVYNGLFK--------- 113

Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV 193
           +D   +R+  VL RI L++   S+  +   +   K ++               +A  VL 
Sbjct: 114 LDFENLRIASVLGRIGLAW---SIAAVLYLNFGVKTRAA--------------IAVAVLA 156

Query: 194 VYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
            Y AL                       V          L    N  GYIDR+ L     
Sbjct: 157 GYGAL--------------------SALVAAPDAAGAGPLTFEGNLAGYIDRQFL----- 191

Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
                              G L        +  F+PEGLLS+V ++++ ++G+  G  + 
Sbjct: 192 ------------------PGKL-------IYGSFDPEGLLSTVPAVVTAMLGMFTGEFVR 226

Query: 314 HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
                  R   W+    A L+    L F+  +P+NK+L++ ++VCV    +  +F+  Y 
Sbjct: 227 RGDIRGGRKTLWMAAAAAALLA-AGLAFSGVLPVNKKLWSSTFVCVVGAYSLGMFALFYY 285

Query: 374 LVDIWNLKYPFLPLAWIGMNAMLVYV 399
           L+D+   +   L    +G+N++ +Y+
Sbjct: 286 LIDVRGWRRWTLFFRVVGLNSITIYL 311


>gi|223937685|ref|ZP_03629587.1| conserved hypothetical protein [bacterium Ellin514]
 gi|223893657|gb|EEF60116.1| conserved hypothetical protein [bacterium Ellin514]
          Length = 413

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 100/428 (23%), Positives = 159/428 (37%), Gaps = 98/428 (22%)

Query: 1   MSEIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWP 60
           M+  +A+ T   P   S P  +  +     K  RL SLD +RG  + LM         W 
Sbjct: 1   MNPEEAQATLASPTQESRPARTVPE-----KATRLISLDAYRGFVMLLM--ASEGFNMWR 53

Query: 61  ----------------EISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKK 104
                           +  H  W GC L D + P F+F+VGVA+  +L     +  +   
Sbjct: 54  MAEQNPNSSFWQFLKYQTEHVDWRGCALWDLIQPSFMFMVGVAMPFSLASRRAKGQSFNT 113

Query: 105 VI----FRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEI 160
           ++    +R++ L+F GI L+    H     TY           VL +I L Y  + L+  
Sbjct: 114 MLGHTLWRSIALVFIGIFLRSVGRHQ----TY------FTFEDVLTQIGLGYTFLFLLAW 163

Query: 161 FTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL--LYGTYVPDWQFTIINKDSAD 218
               VQ                  +  A  +LV Y A   LY   V D+          D
Sbjct: 164 TKLRVQ------------------FTAAMLILVGYWAAFALYPLPVNDF----------D 195

Query: 219 YGKV-----FNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEG 273
           Y KV     ++   G  A  +   N    +D+  L +                   P E 
Sbjct: 196 YQKVGIPANWHHLTGFAAHWDKNTNLAAAVDQWFLNL------------------FPREH 237

Query: 274 PLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALL 333
           P   +   +    F P        S+ + I G+  G  +       ++++  V  G A L
Sbjct: 238 PFVFNGGGYLTLSFVP--------SLATMIFGLLAGQFMREQSTQSSKVRLLVGAGIACL 289

Query: 334 IFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMN 393
             G  L  T   P  K+++T S+V  ++G   ++ +  Y ++D    K    PL  +GMN
Sbjct: 290 ALGAVLDMTGICPSVKRIWTPSWVIFSTGWTCILLATFYGIIDWQGYKRWAFPLIVVGMN 349

Query: 394 AMLVYVMA 401
           ++ +YVMA
Sbjct: 350 SIAMYVMA 357


>gi|256840847|ref|ZP_05546355.1| conserved hypothetical protein [Parabacteroides sp. D13]
 gi|298376669|ref|ZP_06986624.1| conserved hypothetical protein [Bacteroides sp. 3_1_19]
 gi|256738119|gb|EEU51445.1| conserved hypothetical protein [Parabacteroides sp. D13]
 gi|298266547|gb|EFI08205.1| conserved hypothetical protein [Bacteroides sp. 3_1_19]
          Length = 378

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 111/434 (25%), Positives = 185/434 (42%), Gaps = 97/434 (22%)

Query: 31  KTQRLASLDIFRG-----LAVALMIL--VDHAGGDWP-------EISHAPWNGCNLADFV 76
           K  RL SLD+ RG     L V  M+L  + HA  D P         SH  W G +  D V
Sbjct: 5   KLNRLESLDVLRGFDLFCLVVLEMVLHPLAHAI-DMPWFNSFMWGFSHVEWEGFSTWDLV 63

Query: 77  MPFFLFIVGVAIALALKR---IPDRADAVKKVIFRTLKLLFWGILLQGG-FSHAPDELTY 132
           MP FLF+ GV++  +L R   +PD+    +++  R L L  +G++ QG   +  PD    
Sbjct: 64  MPLFLFMAGVSMPFSLSRYKDMPDKMAVYRRIGKRVLLLWVFGMMCQGNLLALDPD---- 119

Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
               R+      LQ IA+ YL+ SL+ +    V+ + Q +G             +AA +L
Sbjct: 120 ----RVYLYSNTLQSIAMGYLIASLLFL---HVRIRVQ-IG-------------IAASLL 158

Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
           +++    +GT     +F  +     +YG              P  N   +IDR VLG   
Sbjct: 159 LIF----WGTM----EFITV----GNYGG---------GSYTPDSNLAEWIDRTVLG--- 194

Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
                   R +        +G   ++        +    +LSS++  ++ + G+  G+ I
Sbjct: 195 --------RFR--------DGATVENGEVIFATWYRYTWILSSLNFGVTVLTGLFAGY-I 237

Query: 313 IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIY 372
           +  K +  RLK  +  G  L +      +   +P+ K+L+T S V V+SG   L+    Y
Sbjct: 238 LKNKLYSERLKLRMLFGIGLGMVIAGWLWGIELPVIKKLWTSSMVLVSSGYCFLLMGLFY 297

Query: 373 ALVDIWNLKYPFLPLAWIGMNAMLVYVMA--------AEGIFAGFINGWYYGDPHNTLV- 423
             +D    +     L   GMN++L Y++          E +F G     Y G+ +  L+ 
Sbjct: 298 YWIDYKGHRKYTTWLKVYGMNSILAYMLTNVVSFRCIGESLFFG--TQQYLGNYYPVLIA 355

Query: 424 -CFLFIISYILHSF 436
            C + +I  +L++F
Sbjct: 356 MCNVSVIYVLLYAF 369


>gi|428299602|ref|YP_007137908.1| hypothetical protein Cal6303_2987 [Calothrix sp. PCC 6303]
 gi|428236146|gb|AFZ01936.1| hypothetical protein Cal6303_2987 [Calothrix sp. PCC 6303]
          Length = 104

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 34/81 (41%), Positives = 49/81 (60%), Gaps = 3/81 (3%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
            + RL SLD+FRG+A+A MILV++ G     +P + HA W+GC   D V PFFLFIVG+A
Sbjct: 10  NSNRLVSLDVFRGIAIASMILVNNPGSWDSIYPPLEHAEWHGCTPTDLVFPFFLFIVGMA 69

Query: 88  IALALKRIPDRADAVKKVIFR 108
           +  +  +         +V +R
Sbjct: 70  MPFSFAKYTKENRPTARVYWR 90


>gi|408821750|ref|ZP_11206640.1| hypothetical protein PgenN_01470 [Pseudomonas geniculata N1]
          Length = 355

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 45/139 (32%), Positives = 75/139 (53%), Gaps = 16/139 (11%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
           +RL S+D  RG+ VA M+LV++ G DW  +     H+ W+GC   D V PFFLF+VGV++
Sbjct: 7   RRLGSIDALRGITVAAMLLVNNPG-DWSAVFAPLRHSEWHGCTPTDLVFPFFLFLVGVSM 65

Query: 89  ALALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
           A ++      +  R    + V+ R L++L          +     + + +D    R+ GV
Sbjct: 66  AFSVAPRALDVSARPALARGVLERALRILL-------AGALLHLLIWWALDTHHFRIWGV 118

Query: 145 LQRIALSYLLVSLVEIFTK 163
           LQRIA+   LV ++ ++ +
Sbjct: 119 LQRIAVCAALVGVLAVYAR 137


>gi|301309931|ref|ZP_07215870.1| conserved hypothetical protein [Bacteroides sp. 20_3]
 gi|423340410|ref|ZP_17318149.1| hypothetical protein HMPREF1059_04074 [Parabacteroides distasonis
           CL09T03C24]
 gi|300831505|gb|EFK62136.1| conserved hypothetical protein [Bacteroides sp. 20_3]
 gi|409227845|gb|EKN20741.1| hypothetical protein HMPREF1059_04074 [Parabacteroides distasonis
           CL09T03C24]
          Length = 378

 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 109/434 (25%), Positives = 183/434 (42%), Gaps = 97/434 (22%)

Query: 31  KTQRLASLDIFRG-----LAVALMIL--VDHAGGDWP-------EISHAPWNGCNLADFV 76
           K  RL SLD+ RG     L V  M+L  + HA  D P         SH  W G +  D V
Sbjct: 5   KLNRLESLDVLRGFDLFCLVVLEMVLHPLAHAI-DMPWFNSFMWGFSHVEWEGFSTWDLV 63

Query: 77  MPFFLFIVGVAIALALKR---IPDRADAVKKVIFRTLKLLFWGILLQGG-FSHAPDELTY 132
           MP FLF+ GV++  +L R   +PD+    +++  R L L  +G++ QG   +  PD    
Sbjct: 64  MPLFLFMAGVSMPFSLSRYKDMPDKMAVYRRIGKRVLLLWVFGMMCQGNLLALDPD---- 119

Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
               R+      LQ IA+ YL+ SL+ +    V+ + Q +G             +AA +L
Sbjct: 120 ----RVYLYSNTLQSIAMGYLIASLLFL---HVRIRVQ-IG-------------IAASLL 158

Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
           +++   +        +F  +     +YG              P  N   +IDR VLG   
Sbjct: 159 LIFWGAM--------EFITV----GNYGG---------GSYTPDSNLAEWIDRTVLG--- 194

Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
                   R +        +G   ++        +    +LSS++  ++ + G+  G+ I
Sbjct: 195 --------RFR--------DGATVENGEVIFATWYRYTWILSSLNFGVTVLTGLFAGY-I 237

Query: 313 IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIY 372
           +  K +  RLK  +  G  L +      +   +P+ K+L+T S V V+SG   L+    Y
Sbjct: 238 LKNKLYSERLKLRMLFGIGLGMVIAGWLWGIELPVIKKLWTSSMVLVSSGYCFLLMGLFY 297

Query: 373 ALVDIWNLKYPFLPLAWIGMNAMLVYVMA--------AEGIFAGFINGWYYGDPHNTLV- 423
             +D    +     L   GMN++L Y++          E +F G     Y G+ +  L+ 
Sbjct: 298 YWIDYKGHRKYTTWLKVYGMNSILAYMLTNVVSFRCIGESLFFG--TQQYLGNYYPVLIA 355

Query: 424 -CFLFIISYILHSF 436
            C + +I  +L++F
Sbjct: 356 MCNVSVIYVLLYAF 369


>gi|224088693|ref|XP_002335085.1| predicted protein [Populus trichocarpa]
 gi|222832788|gb|EEE71265.1| predicted protein [Populus trichocarpa]
          Length = 139

 Score = 72.0 bits (175), Expect = 6e-10,   Method: Composition-based stats.
 Identities = 30/31 (96%), Positives = 30/31 (96%)

Query: 392 MNAMLVYVMAAEGIFAGFINGWYYGDPHNTL 422
           MNAMLVYVMAAEGIFAGFINGWYY DPHNTL
Sbjct: 1   MNAMLVYVMAAEGIFAGFINGWYYNDPHNTL 31


>gi|344208862|ref|YP_004794003.1| hypothetical protein [Stenotrophomonas maltophilia JV3]
 gi|343780224|gb|AEM52777.1| Protein of unknown function DUF2261, transmembrane
           [Stenotrophomonas maltophilia JV3]
          Length = 360

 Score = 71.6 bits (174), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 48/147 (32%), Positives = 78/147 (53%), Gaps = 20/147 (13%)

Query: 27  KSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLF 82
           K  +  +RLAS+D  RG+ VA M+LV++ G DW  +     H+ W+GC   D V PFFLF
Sbjct: 6   KGSMPPRRLASIDALRGITVAAMLLVNNPG-DWSAVFAPLRHSEWHGCTPTDLVFPFFLF 64

Query: 83  IVGVAIALALKRIPDRADAVKK------VIFRTLKLLFWGILLQGGFSHAPDELTYGVDV 136
           +VGV++A ++   P   DA  +      V+ R L++L          +     + + +  
Sbjct: 65  LVGVSMAFSVA--PRALDAAARPALARGVLERALRILL-------AGALLHLLIWWALHT 115

Query: 137 RMIRLCGVLQRIALSYLLVSLVEIFTK 163
              R+ GVLQRIA+   LV ++ ++ +
Sbjct: 116 HHFRIWGVLQRIAVCAALVGVLAVYAR 142



 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 42/115 (36%), Positives = 63/115 (54%), Gaps = 10/115 (8%)

Query: 288 EPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPL 347
           +PEGLLS++ ++ ST++G+  G ++    G  A L     +G A  + GL L     +PL
Sbjct: 198 DPEGLLSTLGALASTVLGLIAGGLL--RNGRSAALAG---LGVATAVLGLLL--ATVLPL 250

Query: 348 NKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
           NKQL+T SYV  T G AAL     + L+D     +P L     G+NA+  Y+ A+
Sbjct: 251 NKQLWTPSYVLWTGGLAALALWLGHVLID--QKGWPALGRR-FGVNAITAYLGAS 302


>gi|255013329|ref|ZP_05285455.1| putative transmembrane protein [Bacteroides sp. 2_1_7]
 gi|410103820|ref|ZP_11298741.1| hypothetical protein HMPREF0999_02513 [Parabacteroides sp. D25]
 gi|409236549|gb|EKN29356.1| hypothetical protein HMPREF0999_02513 [Parabacteroides sp. D25]
          Length = 378

 Score = 71.6 bits (174), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 104/433 (24%), Positives = 183/433 (42%), Gaps = 95/433 (21%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHA------GGDWP-------EISHAPWNGCNLADFVM 77
           K  RL SLD+ RG  +  +++++          D P         SH  W G +  D VM
Sbjct: 5   KLNRLESLDVLRGFDLFCLVVLEMVLHPLAHAIDMPWFNSFMWGFSHVEWEGFSTWDLVM 64

Query: 78  PFFLFIVGVAIALALKR---IPDRADAVKKVIFRTLKLLFWGILLQGG-FSHAPDELTYG 133
           P FLF+ GV++  +L R   +PD+    +++  R + L  +G++ QG   +  PD     
Sbjct: 65  PLFLFMAGVSMPFSLSRYKDMPDKMAVYRRIGKRVVLLWVFGMMCQGNLLALDPD----- 119

Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV 193
              R+      LQ IA+ YL+ SL+ +    V+ + Q +G             +AA +L+
Sbjct: 120 ---RVYLYSNTLQSIAMGYLIASLLFL---HVRIRVQ-IG-------------IAASLLL 159

Query: 194 VYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
           ++    +GT     +F  +     +YG              P  N   +IDR VL     
Sbjct: 160 IF----WGTM----EFITV----GNYGG---------GSYTPDSNLAEWIDRTVL----- 193

Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
                 RR +        +G   ++        +    +LSS++  ++ + G+  G+ I+
Sbjct: 194 ------RRFR--------DGATVENGEVIFATWYRYTWILSSLNFGVTVLTGLFAGY-IL 238

Query: 314 HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
             K +  RLK  +  G  L +      +   +P+ K+L+T S V V+SG   L+    Y 
Sbjct: 239 KNKLYSERLKLRMLFGIGLGMVIAGWLWGIELPVIKKLWTSSMVLVSSGYCFLLMGLFYY 298

Query: 374 LVDIWNLKYPFLPLAWIGMNAMLVYVMA--------AEGIFAGFINGWYYGDPHNTLV-- 423
            +D    +     L   GMN++L Y++          E +F G     Y G+ +  L+  
Sbjct: 299 WIDYKGHRKYTTWLKVYGMNSILAYMLTNVVSFRCIGESLFFG--TQQYLGNYYPVLIAM 356

Query: 424 CFLFIISYILHSF 436
           C + +I  +L++F
Sbjct: 357 CNVSVIYVLLYAF 369


>gi|262381452|ref|ZP_06074590.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
 gi|262296629|gb|EEY84559.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
          Length = 378

 Score = 71.6 bits (174), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 103/433 (23%), Positives = 180/433 (41%), Gaps = 95/433 (21%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHA------GGDWP-------EISHAPWNGCNLADFVM 77
           K  RL SLD+ RG  +  +++++          D P         SH  W G +  D VM
Sbjct: 5   KLNRLESLDVLRGFDLFCLVVLEMVLHPLAHAIDMPWFNSFMWGFSHVEWEGFSTWDLVM 64

Query: 78  PFFLFIVGVAIALALKR---IPDRADAVKKVIFRTLKLLFWGILLQGG-FSHAPDELTYG 133
           P FLF+ GV++  +L R   +PD+    +++  R + L  +G++ QG   +  PD     
Sbjct: 65  PLFLFMAGVSMPFSLSRYKDMPDKMAVYRRIGKRVVLLWVFGMMCQGNLLALDPD----- 119

Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV 193
              R+      LQ IA+ YL+ SL+ +    V+ + Q +G            + AA +L+
Sbjct: 120 ---RVYLYSNTLQSIAMGYLIASLLFL---HVRIRVQ-IG------------IAAALLLI 160

Query: 194 VYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
            + A+ +         T+ N     Y               P  N   +IDR VLG    
Sbjct: 161 FWGAMEF--------ITVGNYGGGSY--------------TPDSNLAEWIDRTVLG---- 194

Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
                  R +        +G   ++        +    +LSS++  ++ + G+  G+ I+
Sbjct: 195 -------RFR--------DGATVENGEVIFATWYRYTWILSSLNFGVTVLTGLFAGY-IL 238

Query: 314 HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
             K +  RLK  +  G  L +      +   +P+ K+L+T S V V+SG   L+    Y 
Sbjct: 239 KNKLYSERLKLRMLFGIGLGMVIAGWLWGIELPVIKKLWTSSMVLVSSGYCFLLMGLFYY 298

Query: 374 LVDIWNLKYPFLPLAWIGMNAMLVYVMA--------AEGIFAGFINGWYYGDPHNTLV-- 423
            +D    +     L   GMN++L Y++          E +F G     Y G+ +  L+  
Sbjct: 299 WIDYKGHRKYTTWLKVYGMNSILAYMLTNVVSFRCIGESLFFG--TQQYLGNYYPVLIAM 356

Query: 424 CFLFIISYILHSF 436
           C + +I  +L++F
Sbjct: 357 CNVSVIYVLLYAF 369


>gi|127512051|ref|YP_001093248.1| hypothetical protein Shew_1118 [Shewanella loihica PV-4]
 gi|126637346|gb|ABO22989.1| conserved hypothetical protein [Shewanella loihica PV-4]
          Length = 387

 Score = 71.2 bits (173), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 112/469 (23%), Positives = 191/469 (40%), Gaps = 123/469 (26%)

Query: 18  EPDVSDQQEKSHLKTQRLASLDIFRG-----------LAVALMILVDHAGGDW----PEI 62
           EP  +D   K   K  RL SLD  RG           L  AL++     G  W     ++
Sbjct: 2   EPK-TDTHPKPAAKP-RLMSLDALRGFDMFWILGGEALFAALLVWTGWQG--WRIADAQM 57

Query: 63  SHAPWNGCNLADFVMPFFLFIVGVAIALALKRI-----PDRADAVKKVIFRTLKLLFWGI 117
            H+ W+G    D + P F+F+ GVA+ L+ KR+     P+R    +  I R + LL +G+
Sbjct: 58  HHSQWHGFTFYDLIFPLFIFLSGVALGLSPKRLDSLPWPERLPLYRHAIKRLMLLLLFGV 117

Query: 118 LLQGGFS----HAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVG 173
           L   G+      A DE         +R   VL RIA ++   +L+ ++  D++ +     
Sbjct: 118 LYNHGWGTGMPMAADE---------VRYASVLGRIAFAWFFAALL-VWHTDLKTQI---- 163

Query: 174 RFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKL 233
                       L+A  +L+ Y A+     VP                      GV    
Sbjct: 164 ------------LVAIGILLGYAAMQLWLPVPG---------------------GVAGDF 190

Query: 234 NPPCNAVGYIDRKVL-GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGL 292
           +   +   Y+D K L GI++        + +A                       +PEG+
Sbjct: 191 SISGSINAYVDGKFLPGISY--------QGRAT----------------------DPEGI 220

Query: 293 LSSVSSILSTIIGVHFGHVIIHT-------KGHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           LS++ +I++ + GV  G  I+ +       KG  A++   +  G   L  G  ++    I
Sbjct: 221 LSTLPAIVNALAGVFTGRFIVKSHPKWGGVKGEWAKVGILLLAGLLSLGLGWLMN--PYI 278

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA---- 401
           P+NK L+T S+V VT G + ++ +  YALVD+  L         IG NA+++Y+ +    
Sbjct: 279 PVNKDLWTSSFVMVTLGWSLILLAIFYALVDVLKLHRLAFGFVVIGCNAIIIYLASSLVK 338

Query: 402 ----AEGIFAGFINGWYYGDPHNTLVCFLFIISYILHSFLWELRKFLYV 446
               A  +F G I             C + ++ ++L  ++++   F+ V
Sbjct: 339 WDYLASSLFGGAIQALPIEMQPLASACAMLLVQWLLLYWMYKRGIFIKV 387


>gi|424795356|ref|ZP_18221218.1| N-acetylglucosaminidase [Xanthomonas translucens pv. graminis
           ART-Xtg29]
 gi|422795515|gb|EKU24196.1| N-acetylglucosaminidase [Xanthomonas translucens pv. graminis
           ART-Xtg29]
          Length = 1105

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 50/131 (38%), Positives = 71/131 (54%), Gaps = 5/131 (3%)

Query: 33  QRLASLDIFRGLAVALMILVD--HAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +R  SLD+FRGL + LMILV+   AG D + ++ H PW G   AD V P FLF VG A++
Sbjct: 737 ERFLSLDVFRGLTIFLMILVNTPGAGADAFVQLRHTPWFGFTAADLVFPSFLFAVGNAMS 796

Query: 90  LALKRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVLQR 147
            AL R       ++++  R+  +   G L+       H  D     + +   R+ GVLQR
Sbjct: 797 FALDRGQPLGAFLRRIGKRSALIFLLGFLMYWFPFVHHGADGSWSFIAIDQTRVPGVLQR 856

Query: 148 IALSYLLVSLV 158
           IAL Y L +L+
Sbjct: 857 IALCYALGALL 867


>gi|291514624|emb|CBK63834.1| Uncharacterized conserved protein [Alistipes shahii WAL 8301]
          Length = 352

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 78/344 (22%), Positives = 140/344 (40%), Gaps = 80/344 (23%)

Query: 61  EISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLK----LLFWG 116
           ++ HA WNG  + D + P FLFI GVA   +L +   R    K+++ R  +    L   G
Sbjct: 29  QMQHAAWNGLTIQDTIFPLFLFIAGVAFPFSLAKQRARGFGRKRILDRIFRRGLILALLG 88

Query: 117 ILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFS 176
           ++  G F          ++   +R+  VL RI L+++  +L+ ++   V+ +        
Sbjct: 89  MVYNGLFE---------LNFSSLRIASVLGRIGLAWMFAALLCVYC-SVRTRIA------ 132

Query: 177 IFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPP 236
                     +A  +L+ Y  LL     PD          AD              L+  
Sbjct: 133 ----------VAGIILIGYSLLLGLVVAPD------APVGAD-------------PLSVE 163

Query: 237 CNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSV 296
               G+IDR+ L  + +Y                                F+PEG+LS++
Sbjct: 164 GCLAGWIDRQYLPGHILY------------------------------GAFDPEGILSTL 193

Query: 297 SSILSTIIGVHFGHVIIH-TKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLS 355
            +++S + G+  G  ++   +G     K +     AL I    L +   +P+NK L++ S
Sbjct: 194 PAVVSALFGMFTGEFLLDGRRGLSGSWKAFYMAVAALAITTAGLCWNLIMPVNKNLWSSS 253

Query: 356 YVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
           + CV SG +  + +  Y L+D+   K        IG+N++ +Y+
Sbjct: 254 FTCVVSGYSLGMTALFYYLIDVCGYKRWTFVFRVIGLNSITIYM 297


>gi|224027055|ref|ZP_03645421.1| hypothetical protein BACCOPRO_03816, partial [Bacteroides
           coprophilus DSM 18228]
 gi|224020291|gb|EEF78289.1| hypothetical protein BACCOPRO_03816 [Bacteroides coprophilus DSM
           18228]
          Length = 373

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 91/399 (22%), Positives = 157/399 (39%), Gaps = 109/399 (27%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWP----------------EISHAPWN 68
           + K  +K+QRL SLD+ RG     M  +   G  +                 ++ H PW+
Sbjct: 5   RMKKEVKSQRLQSLDVLRGFD---MFFIMGGGALFAGLATCCPIPFFQAIARQMEHVPWH 61

Query: 69  GCNLADFVMPFFLFIVGVAIALALKRIP----DRADAVKKVIFRTLKLLFWGILLQGGFS 124
           G    D + P FLFI G++   +L++        A   +KVI R L L+  G +  G   
Sbjct: 62  GVAFEDMIFPLFLFIAGISFPYSLEKQKACGMSSAAIYRKVIRRGLVLVLLGCIYNG--- 118

Query: 125 HAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWH 184
                     D   +R   VL RI LS++  +L+ +   +V+ + + +G  ++  L+  +
Sbjct: 119 ------LLDFDFAHLRYASVLGRIGLSWMFAALLFL---NVRTRVR-MGVVAL--LFIGY 166

Query: 185 WLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYID 244
           W + AC                                F    G     +   + VGY+D
Sbjct: 167 WALLAC--------------------------------FPAPDGNGDPFSMEGSLVGYVD 194

Query: 245 RKVL-GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTI 303
           R  L G  ++  H                               +PEGLL  + ++ +  
Sbjct: 195 RMFLPGQLYLGIH-------------------------------DPEGLLGIIPAVGTAS 223

Query: 304 IGVHFGHVIIHTKGHLARLKQWVTM---GFALLIFGLTLHFTNAIPLNKQLYTLSYVCVT 360
           +G+  G  I   +  L  L++ V +   G  L++ G         P+NK L+T S+ C+ 
Sbjct: 224 LGMLTGEWI--KREGLPELRKVVLLACAGVMLVVVGWIWDLV--FPINKNLWTSSFACLV 279

Query: 361 SGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
            G + L+F+  Y L+D+ +     L    IGMN++ +Y+
Sbjct: 280 GGISMLLFALFYYLIDVRHCHRWTLFFRVIGMNSITIYL 318


>gi|395761203|ref|ZP_10441872.1| hypothetical protein JPAM2_05565 [Janthinobacterium lividum PAMC
           25724]
          Length = 373

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 51/137 (37%), Positives = 73/137 (53%), Gaps = 9/137 (6%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVG 85
           + +QR  +LD+ RGL VALMI+V+  G DW  +     HA W+G  L D V P FLF+VG
Sbjct: 3   IASQRYLALDVLRGLTVALMIVVNTPG-DWGSVYAPFLHAEWHGFTLTDLVFPSFLFVVG 61

Query: 86  VAIALALKRIPDRADA--VKKVIFRTLKLLFWGILLQG-GFSHAPDELTYG-VDVRMIRL 141
            A+A  L +  + A    + K+  R+  +   G LL    F    D   +    +   R+
Sbjct: 62  NALAFVLGKYENLAHGAVLAKLCKRSALIFLLGFLLYWFPFFKIDDAGQFAWSSLSQTRI 121

Query: 142 CGVLQRIALSYLLVSLV 158
            GVLQRIA+ YL  +L+
Sbjct: 122 PGVLQRIAVCYLAAALI 138


>gi|294139796|ref|YP_003555774.1| hypothetical protein SVI_1025 [Shewanella violacea DSS12]
 gi|293326265|dbj|BAJ00996.1| conserved hypothetical protein [Shewanella violacea DSS12]
          Length = 378

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 43/136 (31%), Positives = 77/136 (56%), Gaps = 12/136 (8%)

Query: 286 PFEPEGLLSSVSSILSTIIGVHFGHVII--HTKGHLARLKQWVTMGFALLIFGLTLHFTN 343
           P +PEG+LS++ ++ + + GV  GH I+  H +G  A++   +  G  +L  G  ++   
Sbjct: 210 PLDPEGILSTIPAVANAMAGVFVGHFIVKEHKQGEWAKVVCLLLSGAFILALGWWVNLI- 268

Query: 344 AIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA-- 401
            IP+NK L+T S+V VT+G + ++ +  YALVD+   +    P   IG NA+++Y+ +  
Sbjct: 269 -IPVNKDLWTSSFVLVTTGWSIILLAIFYALVDVLKWQKIAFPFVVIGCNAIIIYLASSL 327

Query: 402 ------AEGIFAGFIN 411
                 A+ +F G +N
Sbjct: 328 VNWKYTAQSLFGGVVN 343



 Score = 47.4 bits (111), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 43/145 (29%), Positives = 66/145 (45%), Gaps = 25/145 (17%)

Query: 33  QRLASLDIFRG-----------LAVALMILVDHAGGDWP--EISHAPWNGCNLADFVMPF 79
           +RL SLD  RG           L   L+      G  W   ++ H+ W+G    D + P 
Sbjct: 11  RRLMSLDALRGFDMFWILGGEALFAGLLAWSSWQGWQWADAQMHHSQWHGFTFYDLIFPL 70

Query: 80  FLFIVGVAIALALKR-----IPDRADAVKKVIFRTLKLLFWGILLQGGF-SHAPDELTYG 133
           F+F+ GVA+ L+ KR     I  R    K  + R   LLF+G+L   G+ + AP      
Sbjct: 71  FIFLSGVALGLSPKRLDKLPIAQRMPLYKHSVKRLFLLLFFGVLYNHGWGTGAP------ 124

Query: 134 VDVRMIRLCGVLQRIALSYLLVSLV 158
           V +  +R   VL RIA ++   +++
Sbjct: 125 VAIDEVRYASVLGRIAFAWFFAAML 149


>gi|403236334|ref|ZP_10914920.1| hypothetical protein B1040_11244 [Bacillus sp. 10403023]
          Length = 373

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 92/387 (23%), Positives = 157/387 (40%), Gaps = 93/387 (24%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAG-GDWPEISHAPWNGCNLADFVMPFFLF 82
            +  ++    R+ SLD+ RG+ V   + +     G +   +HA W G  L DF++P F+ 
Sbjct: 1   MKNTNNSARSRIHSLDMARGIIVVFSVFLSSLPYGSYDFATHASWYGLTLVDFILPCFIT 60

Query: 83  IVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
           + GV +A+A +    +    K+ I RT+KL+ +G+L           + +  D+  +R  
Sbjct: 61  VFGVGMAIAYQ----KGVKWKRFISRTIKLILFGLLFN-------IIVAWSFDLSTLRFT 109

Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
           GVLQ  AL  +   L+  F K  +    S+    IF +Y          +++Y+      
Sbjct: 110 GVLQMYALLGIGTVLITRFIK--RPITVSLVGLLIFSIY--------GAILLYMGQTCEG 159

Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
            +P                             P CN    +D  V G +H+Y        
Sbjct: 160 SLPQ----------------------------PGCNPSWLVDPVVFGESHIY-------- 183

Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
                             S     F+PEG+ +  S++ + ++G   G +II  K   A  
Sbjct: 184 ------------------SLGERGFDPEGIPAIFSALGNVLLGFAAGRIIILYKEKGA-- 223

Query: 323 KQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKY 382
             W  +   +L+  L       +P  K+L+T S+  VT+GA +L+ + ++ + D    KY
Sbjct: 224 -GWRLLLHGVLLLALAFLVEAFLPFGKRLWTPSFGLVTAGATSLLLALLHIIFD---RKY 279

Query: 383 --PFL-----PLAWI----GMNAMLVY 398
             PF+      L WI    G NA L+Y
Sbjct: 280 TSPFVQPVSNSLIWILDSFGRNAFLIY 306


>gi|297567057|ref|YP_003686029.1| hypothetical protein [Meiothermus silvanus DSM 9946]
 gi|296851506|gb|ADH64521.1| Protein of unknown function DUF2261, transmembrane [Meiothermus
           silvanus DSM 9946]
          Length = 377

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 55/153 (35%), Positives = 80/153 (52%), Gaps = 27/153 (17%)

Query: 18  EPDVSDQQEKSHLKTQ----RLASLDIFRGLAVALMILVDHAGGDWPE---ISHAPWN-G 69
            P   DQQ ++   ++    RL SLD+FRGL + LM+LV++   D      ++HAPW  G
Sbjct: 7   NPPTQDQQTETPFPSRKTAMRLGSLDVFRGLTILLMLLVNNVALDANTPYLLTHAPWKGG 66

Query: 70  CNLADFVMPFFLFIVGVAIALAL-----KRIPD-RADAVKKVIFRTLKLLFWGILLQGGF 123
             LAD V P+FL  VGVAI  A      K +P  R D   K+I R++ L   G+L+    
Sbjct: 67  VYLADLVFPWFLLAVGVAIPFAAASFRKKNLPSWRYDL--KIIQRSIVLFGLGLLIVSSI 124

Query: 124 SHAPDELTYGVDVRMIRLCGVLQRIALSYLLVS 156
           +  P    + +D        VLQ IA++YL+ +
Sbjct: 125 ARRP---VFALD--------VLQLIAMAYLVAA 146


>gi|217072276|gb|ACJ84498.1| unknown [Medicago truncatula]
          Length = 132

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 42/100 (42%), Positives = 59/100 (59%), Gaps = 2/100 (2%)

Query: 349 KQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAG 408
           KQ Y    V +T+GA+ LV +AIY +VDI  L+ P + L W+GMNA++ Y +AA  IF  
Sbjct: 14  KQWYQ-RRVVITAGASGLVLTAIYYIVDIKQLRKPTVLLQWMGMNALIGYALAACDIFPA 72

Query: 409 FINGWYYGDPHNTLV-CFLFIISYILHSFLWELRKFLYVQ 447
            I G+Y+  P N LV     +I  ILHS  W    F+ ++
Sbjct: 73  VIQGFYWRSPENNLVDASEALIQNILHSEKWGTLAFVIIE 112


>gi|395804714|ref|ZP_10483949.1| hypothetical protein FF52_22629 [Flavobacterium sp. F52]
 gi|395433102|gb|EJF99060.1| hypothetical protein FF52_22629 [Flavobacterium sp. F52]
          Length = 380

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 91/411 (22%), Positives = 159/411 (38%), Gaps = 118/411 (28%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDH------AGGDWP-------EISHAPWNGCNLAD 74
           S+    RL SLD  RG  +  ++  +H           P       ++ HA WNG    D
Sbjct: 2   SNPTNGRLISLDALRGFVMFWIMSGEHIIHALAKAAPIPIFLWMSSQLHHAEWNGITFYD 61

Query: 75  FVMPFFLFIVGVAIALALKR--------IPDRADAVKK------VIFRTLKLLFWGILLQ 120
            + P FLF+ GV++  + ++         P    + +K      ++ RT+ L+  G ++ 
Sbjct: 62  MIFPVFLFVAGVSMPYSFEKKMNLAGVSTPQELPSKEKRKIYLSMLRRTIILVVLGFVVN 121

Query: 121 G-----GFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRF 175
           G     GF H              R   VL RI +++    ++ +   +   K Q +   
Sbjct: 122 GLLRFDGFDHT-------------RFASVLGRIGIAWFFAGMIYL---NFDFKKQFI--- 162

Query: 176 SIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNP 235
                    W +   +L+ Y A +    VP++   ++ K+ +                  
Sbjct: 163 ---------WFLG--ILMGYYAAMKWIPVPEFGAGVLTKEGS------------------ 193

Query: 236 PCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP-FEPEGLLS 294
                GYIDR  L                               P   H+  ++PEG+ S
Sbjct: 194 ---LEGYIDRLFL-------------------------------PGRLHSTVYDPEGIFS 219

Query: 295 SVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGF-ALLIFGLTLHFTNAIPLNKQLYT 353
           ++ +I + ++GV F    +  K   +   + + M   A+L+    L +    P+NK L+T
Sbjct: 220 TLPAISTALLGV-FTGTFLKAKNQFSINAKLILMALTAVLLIIAGLIWDIDFPINKHLWT 278

Query: 354 LSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEG 404
            S+VC   G + L F   Y ++D+        PL  IG N++L+Y+ AAEG
Sbjct: 279 SSFVCFVGGFSILFFIFFYLIIDLSGFHKWAFPLILIGSNSILIYI-AAEG 328


>gi|410099161|ref|ZP_11294133.1| hypothetical protein HMPREF1076_03311 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409219183|gb|EKN12146.1| hypothetical protein HMPREF1076_03311 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 377

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 110/445 (24%), Positives = 176/445 (39%), Gaps = 102/445 (22%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMI--------LVDHAGGDWPE-----ISHAPWNGCN 71
            EK+  K  RL SLD  RG  +  ++        L   A   W        +H  W G +
Sbjct: 1   MEKTTYK--RLESLDALRGFDLFFLVALGPLAHSLARAADVGWLNDCMWAFNHVQWEGFS 58

Query: 72  LADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLK---LLFW--GILLQGGF-SH 125
             D +MP FLF+ G ++  AL R    +D  KK +FR L    LL W  G++ QG     
Sbjct: 59  PWDLIMPLFLFMSGASMPFALSRFKGVSD--KKTLFRRLGKRILLLWIFGMMCQGNLLGF 116

Query: 126 APDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHW 185
            PD        R+      LQ IA  YL+ +++ ++T     +   +G            
Sbjct: 117 DPD--------RIYLYSNTLQSIAAGYLITAVLFLYT----SRRTQIG------------ 152

Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDR 245
            +A  +L+VY A +        QF  I      YG              P  N   +IDR
Sbjct: 153 -VAVALLLVYWAAM--------QFITI----GSYGG---------GNYTPEGNLAEWIDR 190

Query: 246 KVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIG 305
            VLG         +R             P       W H  +    +LS+++  ++ + G
Sbjct: 191 TVLG--------RFRDGAKVVDGEVVFAP-------WYHYTW----ILSTLNFGVTVLTG 231

Query: 306 VHFGHVIIHTKGHLARLKQWVTMGFALL----IFGLTLHFTNAIPLNKQLYTLSYVCVTS 361
           +  G++       + +LK +   G A++    ++GL +      P+ K ++T S V V+S
Sbjct: 232 LFAGYIAKDKTSDIHKLKWYFGAGAAMVAAGWLWGLQM------PVIKTIWTSSMVLVSS 285

Query: 362 GAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNT 421
           G   L+    Y L+D    K     L   GMN+++ Y++A    F+      +YG     
Sbjct: 286 GYCFLLMGLFYYLIDYKGYKKNITWLKVYGMNSIVAYMLANVINFSCLGQSLFYGMEQYL 345

Query: 422 LVCFLFIIS----YILHSFLWELRK 442
              + F+IS     +++  LW L K
Sbjct: 346 GNYYSFLISLSNVLVIYVILWLLYK 370


>gi|357628855|gb|EHJ78009.1| putative heparan-alpha-glucosaminide N-acetyltransferase [Danaus
           plexippus]
          Length = 275

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 69/204 (33%), Positives = 100/204 (49%), Gaps = 38/204 (18%)

Query: 2   SEIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPE 61
           +E+ +ET     ++ +E  V     +S     RL SLDIFRG+A+ALM           +
Sbjct: 59  NELGSETR----VLTTEASVPRSPTRS-----RLRSLDIFRGIAIALM--------QANK 101

Query: 62  ISHAPWNGCNLADFVMPFFLFIVGVAIALALK-----RIPDRADAVKKVIFRTLKLLFWG 116
            SHA WNG  +AD V P+F F +G A+ L+L       +P R +A+ +V  R+L L   G
Sbjct: 102 FSHAVWNGLTVADLVFPWFAFTMGEAMVLSLNARLRTSLP-RVNALGQVARRSLLLSLIG 160

Query: 117 ILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEI-FTKDVQDKDQSVGRF 175
           I L           +   +   +R  GVLQR+A  YL+V  +E  F +  Q+     GR 
Sbjct: 161 ICLG----------SVNTNWSYVRFPGVLQRLAAMYLIVGSLECAFMRTSQNIIP--GR- 207

Query: 176 SIFRLYCWHWLM-AACVLVVYLAL 198
           S+FR     W    A VL+V + L
Sbjct: 208 SLFRDIAAGWQQWLATVLMVAIQL 231


>gi|430745463|ref|YP_007204592.1| hypothetical protein Sinac_4725 [Singulisphaera acidiphila DSM
           18658]
 gi|430017183|gb|AGA28897.1| hypothetical protein Sinac_4725 [Singulisphaera acidiphila DSM
           18658]
          Length = 391

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 96/402 (23%), Positives = 163/402 (40%), Gaps = 92/402 (22%)

Query: 14  LIISEPDVSDQQEKSHLK-TQRLASLDIFRG-----------LAVALMILVDHAGGD--W 59
           L+ +E  + D    +  K ++RL S+D  RG           LA AL    D + G    
Sbjct: 7   LLTAETPLMDSDSIAAPKPSERLLSIDALRGFDMLWIIGGERLAKALARWSDSSAGKVVQ 66

Query: 60  PEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRI--PDRADAVKKVIFRTLKLLFWGI 117
            ++ HA W+G  L D + P FLF+VG  +  +L ++    R    +++  RTL L   G+
Sbjct: 67  EQLEHAEWHGFRLNDLIFPLFLFLVGTVLPFSLGKLQGQGRGAEYRRIARRTLLLFALGL 126

Query: 118 LLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSI 177
           L  G             D   +R+ GVLQRIAL Y + +L+ ++        + V     
Sbjct: 127 LCNG---------VLKFDWANLRVAGVLQRIALCYGIAALISLWF-----SRRGVA---- 168

Query: 178 FRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPC 237
                   ++   +LV Y AL+     P          + DY              +   
Sbjct: 169 --------ILLVLILVGYWALMANVGAP-------GHTAGDY--------------SISG 199

Query: 238 NAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVS 297
           N  G+IDR+ L    M  +                           +   + EGLL+++ 
Sbjct: 200 NLAGWIDRQFLPGKIMKSY---------------------------YGYGDNEGLLTTIP 232

Query: 298 SILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYV 357
           ++ + ++GV  GH +   +G   ++   V  G   LI G+   +    P+NK L+T  +V
Sbjct: 233 AVGTALLGVLAGHWLRSQRGPWQKVAGLVAAGVLSLIVGVA--WGERFPINKILWTSPFV 290

Query: 358 CVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
            V  G + L+ +  YA++D+   +        IG NA+ ++V
Sbjct: 291 LVAGGLSLLLLALFYAVIDVLRFRRWAFFFVVIGANAITIFV 332


>gi|406980095|gb|EKE01754.1| hypothetical protein ACD_21C00059G0003 [uncultured bacterium]
          Length = 295

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 71/265 (26%), Positives = 116/265 (43%), Gaps = 70/265 (26%)

Query: 139 IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL 198
           IR+ GVLQRIA+ Y + SL+ +++K                   W       +LV Y  L
Sbjct: 28  IRIPGVLQRIAVCYFIASLIVLYSK-----------------VAWQIAWTVLLLVGYWVL 70

Query: 199 LYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPA 258
           +    VP +   I                     L P  N   Y+D K+L      H   
Sbjct: 71  IKFVPVPGFGAGI---------------------LEPTGNLAWYLDVKLL------HGHT 103

Query: 259 WRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGH 318
           W        ++P  G             F+PEG+LS++ +I + + GV  G ++   K  
Sbjct: 104 WV-------NAPAPG-------------FDPEGILSTLPAIATMLFGVLTGQLL---KSS 140

Query: 319 LARLKQWVTM-GFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDI 377
             +   W+ + G AL+  GL +  ++ +P+NK L+T SY   TSG A++VF   Y L+D+
Sbjct: 141 FTQKTVWMLIFGGALIFLGLVM--SHWLPINKNLWTSSYAVFTSGMASVVFGCCYWLIDV 198

Query: 378 WNLKYPFLPLAWIGMNAMLVYVMAA 402
              +  F PL   G++A+ ++V++ 
Sbjct: 199 KKHQKWFKPLQIYGLSALTIFVISG 223


>gi|423722057|ref|ZP_17696233.1| hypothetical protein HMPREF1078_00296 [Parabacteroides merdae
           CL09T00C40]
 gi|409242759|gb|EKN35519.1| hypothetical protein HMPREF1078_00296 [Parabacteroides merdae
           CL09T00C40]
          Length = 376

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 91/388 (23%), Positives = 159/388 (40%), Gaps = 88/388 (22%)

Query: 33  QRLASLDIFRGLAVALMILVD---HAGG---DWPE-------ISHAPWNGCNLADFVMPF 79
           QRL SLD+ RG  +  ++ ++   H  G   D P         SH  W G +  D VMP 
Sbjct: 6   QRLESLDVLRGFDLFCLVALEGVLHPLGRAIDAPWYNDFLWCFSHVQWEGFSSWDLVMPL 65

Query: 80  FLFIVGVAIALALKR---IPDRADAVKKVIFRTLKLLFWGILLQGG-FSHAPDELTYGVD 135
           F+F+ GV++  AL R   +PD+    ++++ R   L  +G++ QG      PD       
Sbjct: 66  FMFMAGVSMPFALSRYKVMPDKWAVYRRIVKRVALLWIFGMMCQGNLLGLDPD------- 118

Query: 136 VRMIRLCGVLQRIALSYLLVSLVEIFTK-DVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
            R+      LQ IA+ YL+ +++ +  +  VQ     +G              A  +L+ 
Sbjct: 119 -RIYLYSNTLQAIAMGYLISAMLFLHVRLSVQ-----IGT-------------AVALLLA 159

Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
           Y   +        QF  ++     YG              P  N   ++DR VLG     
Sbjct: 160 YWGAM--------QFITVD----GYGG---------GNYTPDGNLAEWVDRTVLGR---- 194

Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
                 R  A  ++                  +    +LSS++  ++ + GV  GH++  
Sbjct: 195 -----FRDAAVVENGQIV----------FAESYRYTWILSSLNFGVTVLTGVFAGHIL-- 237

Query: 315 TKGHLARLKQWV-TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
            K  + R  +W   +G  +++  L   +   +P+ K+++T S V V+SG   L+    Y 
Sbjct: 238 -KSGMDRKHKWQWLLGIGVIMVALGWLWGLQLPVIKKIWTSSMVLVSSGYCFLLMGLFYY 296

Query: 374 LVDIWNLKYPFLPLAWIGMNAMLVYVMA 401
            +D    +     L   GMN+++ Y++A
Sbjct: 297 WIDYKGHRKNLTWLKVYGMNSIVAYMLA 324


>gi|332666399|ref|YP_004449187.1| hypothetical protein [Haliscomenobacter hydrossis DSM 1100]
 gi|332335213|gb|AEE52314.1| Protein of unknown function DUF2261, transmembrane
           [Haliscomenobacter hydrossis DSM 1100]
          Length = 369

 Score = 70.1 bits (170), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 97/402 (24%), Positives = 157/402 (39%), Gaps = 105/402 (26%)

Query: 33  QRLASLDIFRGL---------AVALMILVDHAGGDWP----EISHAPWNGCNLADFVMPF 79
           QRL SLD  RG          AV   +        W     ++SH  W+G  L D + P 
Sbjct: 8   QRLYSLDALRGFDMFWIMGAEAVVHSLATATGSSVWEAAAHQLSHPDWHGFRLYDLIFPL 67

Query: 80  FLFIVGVAIALALKRIPDRADAVK----KVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
           FLF+ GVA   ++ R  +     +    +VI R L L+  GI+   G    P        
Sbjct: 68  FLFLAGVATPYSVGRDLENGKPKQQLLLRVIRRGLVLVLLGIIYNNGLVLKP-------- 119

Query: 136 VRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVY 195
           +  IR   VL RI        L  +F   +    + +G+   F           C L++ 
Sbjct: 120 LAEIRFPSVLGRI-------GLAYMFANIIYLYTKQLGQIIWF-----------CALLIG 161

Query: 196 LALLYGTYVPDWQFTIINKDSADYGKVFNVTCGV-RAKLNPPCNAVGYIDRKVLGINHMY 254
             LL                       FN   G     L+   N   Y+DR ++   H+Y
Sbjct: 162 YWLLLR---------------------FNAAPGFPMGDLSMEGNFASYLDRLIIP-GHLY 199

Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
                                        H   +PEGL S + +I + ++G++ G+++ H
Sbjct: 200 LD--------------------------IH---DPEGLTSCIPAIGTALLGIYAGNLLKH 230

Query: 315 TKGHLARLKQWVTMGFALLIFGLTLHFTNAI-PLNKQLYTLSYVCVTSGAAALVFSAIYA 373
             G +   K+ + +    ++  +     N + P+NK L+T S+V  T G + L+ SA Y 
Sbjct: 231 --GTMTPPKKALILAIMGVVALVLAQLWNLVFPINKNLWTSSFVLQTGGCSLLLLSAFYY 288

Query: 374 LVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYY 415
           ++D+   +      A IGMN++L+Y+        GFI+ WY+
Sbjct: 289 VIDVLGYRRWAFFFAVIGMNSILIYLS------DGFID-WYF 323


>gi|320333679|ref|YP_004170390.1| hypothetical protein [Deinococcus maricopensis DSM 21211]
 gi|319754968|gb|ADV66725.1| hypothetical protein Deima_1072 [Deinococcus maricopensis DSM
           21211]
          Length = 376

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 73/215 (33%), Positives = 105/215 (48%), Gaps = 40/215 (18%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDW---PEISHAPW-NGCNLADFVMPFFLFIVGV 86
           +  RLA+LD +RGL V LM+LV++   DW    E+ HAPW  G  LAD V P+FLF  G 
Sbjct: 24  RGARLAALDAWRGLTVLLMLLVNNVALDWRTPKELMHAPWGGGATLADLVFPWFLFCAGT 83

Query: 87  AIALAL---KRIPDRADA-VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
           A+  +L   +R   R  A V+K++ RT+ L   G++L    +H    LT+G+        
Sbjct: 84  ALPFSLASARRAGVRGWALVRKLLTRTVLLYLVGVVLVSAVAH---RLTFGL-------- 132

Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
           GVLQ IAL+ LL                 VG   +         +AA +LV Y A+L  T
Sbjct: 133 GVLQLIALASLL---------GAAGAQLRVGARMV---------LAAALLVGYAAVLLLT 174

Query: 203 YVPDWQFTII--NKDSADY-GKVFNVTCGVRAKLN 234
            VP     ++   +++  Y  + F    GVR  L+
Sbjct: 175 PVPGVGAGVLEETRNAVQYLNQTFLAPLGVRGLLS 209


>gi|359686399|ref|ZP_09256400.1| hypothetical protein Lsan2_17893 [Leptospira santarosai str.
           2000030832]
          Length = 329

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 93/349 (26%), Positives = 146/349 (41%), Gaps = 95/349 (27%)

Query: 75  FVMPFFLFIVGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDEL 130
           F +PFFLF VG +I ++L  K   +R+D    +  R+  L+  G+ L   G +S A    
Sbjct: 9   FGVPFFLFAVGTSIPISLYSKNGINRSDIWIGICIRSANLILLGLFLNFFGEWSFAE--- 65

Query: 131 TYGVDVRMIRLCGVLQRIALSYLLV-SLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAA 189
                   +R+ GVLQRI   Y +V SL  +F        + +  FS+  L    W++  
Sbjct: 66  --------LRIPGVLQRIGFVYWVVASLCLVF------PGKKILVFSVPILLIHTWILTQ 111

Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLG 249
             L            P  +  +  +   D G                     +IDR + G
Sbjct: 112 IAL------------PG-ESVVSLEQGKDIG--------------------AWIDRTIFG 138

Query: 250 INHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFG 309
             H+     WR SK                       ++PEG LS V+S+++T+ GV  G
Sbjct: 139 EKHL-----WRFSKT----------------------WDPEGFLSGVASVVTTLFGVLCG 171

Query: 310 HVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAAL--- 366
            ++        R ++   +G  +L   + L +  ++P+NK L+T SY   T+G + L   
Sbjct: 172 FIL------FLRERKNKILGLGILFSFVGLLWDLSLPMNKSLWTGSYSVYTAGLSFLSIW 225

Query: 367 VFSAIYALV--DIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGW 413
            F  + +L+    WNLK  F P    G NA+LV+V    GI A  +N W
Sbjct: 226 FFEYLSSLIISKGWNLKILFQPFLVFGKNAVLVFV--GSGILARTLNLW 272


>gi|255036257|ref|YP_003086878.1| hypothetical protein Dfer_2495 [Dyadobacter fermentans DSM 18053]
 gi|254949013|gb|ACT93713.1| conserved hypothetical protein [Dyadobacter fermentans DSM 18053]
          Length = 379

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 88/355 (24%), Positives = 139/355 (39%), Gaps = 61/355 (17%)

Query: 63  SHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGG 122
           SH PW GC+L D + P F F+VGVA+  ++     +  +V  +   T++     ILL   
Sbjct: 32  SHVPWVGCSLHDLIQPSFSFLVGVALPYSMASRASKDQSVATMWAHTIRRSLILILLGIF 91

Query: 123 FSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYC 182
                 E T             L +I L Y ++  + + ++  Q +D  V          
Sbjct: 92  LRSMHSEQT------NFTFEDTLTQIGLGYPILFALGLASEKTQ-RDALV---------- 134

Query: 183 WHWLMAACVLVVYLALLYGTYVP----DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCN 238
             W     +L+ Y  +     +P    DW  T    D     K      G  A  N   N
Sbjct: 135 --WGALGIILIGYAGVFAAYPLPGPGFDWSQTGTTADWEHNLK------GFAAHWNKNTN 186

Query: 239 AVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEG--LLSSV 296
           A    DR  L +                ++ PFE              F   G   LS +
Sbjct: 187 AAWAFDRWFLNL--------------FPREKPFE--------------FNGGGYSTLSFI 218

Query: 297 SSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSY 356
            ++ + I+G+  G  +         LK++      L +  L  HFT   P+ K+++T ++
Sbjct: 219 PTLGTMILGLIAGKWLKTAVSSTWLLKRYAITAGVLFLLALAFHFTGLNPIVKRIWTPAW 278

Query: 357 VCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFIN 411
              + G A L+ +A Y +VD+   K  F PL  IG N++  YV+ A+G F GFI 
Sbjct: 279 TLFSGGWAFLLLAAFYFVVDVKGQKSWFFPLIVIGTNSIAAYVI-ADG-FGGFIR 331


>gi|198277541|ref|ZP_03210072.1| hypothetical protein BACPLE_03763 [Bacteroides plebeius DSM 17135]
 gi|198270039|gb|EDY94309.1| hypothetical protein BACPLE_03763 [Bacteroides plebeius DSM 17135]
          Length = 338

 Score = 69.3 bits (168), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 41/117 (35%), Positives = 72/117 (61%), Gaps = 7/117 (5%)

Query: 49  MILVDHAGG--DWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKR--IPDRADAVKK 104
           MILV++AGG   +  + H+ WNG    D V PFFLF+VG++  ++L++      ++ ++K
Sbjct: 1   MILVNNAGGPVSYAPLRHSVWNGLTPCDLVFPFFLFMVGISTYISLRKFNFGPTSEVIRK 60

Query: 105 VIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIF 161
           ++ RT  ++  G+ +   F +A +   + +D   +R+ GVLQRI L Y +VSL+ I+
Sbjct: 61  IVRRTFLIILIGLAID-WFGYACNGNFFPIDT--LRIPGVLQRIGLCYGIVSLMVIY 114



 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 32/124 (25%), Positives = 69/124 (55%), Gaps = 2/124 (1%)

Query: 278 DAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGL 337
           +A  +  +P +PEGL  ++S++  T+IG   G +++       R+ + +T    +LI G 
Sbjct: 159 EAHLYKKSPIDPEGLAGTLSAVAHTLIGFMCGRLLLEKISVNKRIVKLITAAVMMLIIGY 218

Query: 338 TLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLV 397
            L     +P+NK++++ ++V VT G  +L+ + +  ++D+ N+   +      GMN + +
Sbjct: 219 VLSI--WMPINKRVWSTTFVLVTCGWGSLLLALLMYVIDVKNINKGWTFFLVFGMNPLFL 276

Query: 398 YVMA 401
           YV++
Sbjct: 277 YVLS 280


>gi|313145390|ref|ZP_07807583.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
 gi|423279948|ref|ZP_17258861.1| hypothetical protein HMPREF1203_03078 [Bacteroides fragilis HMW
           610]
 gi|424661980|ref|ZP_18099017.1| hypothetical protein HMPREF1205_02366 [Bacteroides fragilis HMW
           616]
 gi|313134157|gb|EFR51517.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
 gi|404578291|gb|EKA83026.1| hypothetical protein HMPREF1205_02366 [Bacteroides fragilis HMW
           616]
 gi|404584284|gb|EKA88949.1| hypothetical protein HMPREF1203_03078 [Bacteroides fragilis HMW
           610]
          Length = 377

 Score = 69.3 bits (168), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 100/406 (24%), Positives = 163/406 (40%), Gaps = 87/406 (21%)

Query: 33  QRLASLDIFRGLAVALMILVD-------------HAGGDWPEISHAPWNGCNLADFVMPF 79
           QRL SLD  RGL +  ++ +              H  G    + H  W G +  D +MP 
Sbjct: 7   QRLESLDALRGLDLFFLVALGPLLRTLVRAIDSPHLDGVNWCLRHVDWIGFSPWDLIMPL 66

Query: 80  FLFIVGVAIALALKRIPDRADAVKKVIFRTLK--LLFW--GILLQGG-FSHAPDELTYGV 134
           FLF+ G++I  AL R    AD   K+I+R  K  LL W  G++ QG   S  PD L    
Sbjct: 67  FLFMSGISIPFALSRFKGEADK-SKLIYRLCKRVLLLWIFGMMCQGNLLSFDPDHLYLYT 125

Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
           +         LQ IA  Y+  +L+ ++T     K Q V              +   +L++
Sbjct: 126 N--------TLQSIATGYIAAALLFLYTG---RKTQIV--------------LCVALLLI 160

Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
           Y A +        +F  I+     YG              P  N   +IDR VLG     
Sbjct: 161 YWAAM--------KFISID----GYGG---------GNYTPEGNLAEWIDRTVLG----- 194

Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
                 R +        +G   ++        +    +LSS++ I++ + G+  G++   
Sbjct: 195 ------RFR--------DGASVENGTIIFAEGYYYTWILSSLNFIVTVMTGLFAGYIAKD 240

Query: 315 TKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYAL 374
                 +L+ +  +G  +++ G T       P+ K ++T S V V+SG   L+    Y  
Sbjct: 241 ATEGKHKLRLYFGIGAGMVLAGWTWGLV--FPVIKTIWTSSMVLVSSGYCFLLMGLFYYW 298

Query: 375 VDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYG-DPH 419
           +D    +     L   GMN++L Y++     F    N  +YG +P+
Sbjct: 299 IDYKQHRSHLTLLKVYGMNSILAYLLDCIFDFRCIGNSLFYGLEPY 344


>gi|300123407|emb|CBK24680.2| unnamed protein product [Blastocystis hominis]
          Length = 349

 Score = 69.3 bits (168), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 50/163 (30%), Positives = 75/163 (46%), Gaps = 24/163 (14%)

Query: 242 YIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILS 301
           YID K+LG  HMYH  A                            FE EG+L  +++I  
Sbjct: 157 YIDLKLLGEKHMYHGSAVVYVYGGNH-------------------FECEGILGMMNAIFL 197

Query: 302 TIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTL----HFTNAIPLNKQLYTLSYV 357
           T +G     V    K    +L  ++ +G  LL+F   L     +   +P+NK  +  S++
Sbjct: 198 TYLGTWIPWVFRTVKKQKNQLLVYLGIGAGLLLFSGILCGFKQYDGYMPINKNKWNTSFI 257

Query: 358 CVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVM 400
            +TSG   L F  IY LVD+W +   F P   +GMN++L+YV+
Sbjct: 258 AITSGTGFLAFGLIYLLVDVWKIWSGF-PYRALGMNSLLIYVI 299


>gi|154492357|ref|ZP_02031983.1| hypothetical protein PARMER_01991 [Parabacteroides merdae ATCC
           43184]
 gi|154087582|gb|EDN86627.1| hypothetical protein PARMER_01991 [Parabacteroides merdae ATCC
           43184]
          Length = 376

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 91/387 (23%), Positives = 157/387 (40%), Gaps = 86/387 (22%)

Query: 33  QRLASLDIFRGLAVALMILVD---HAGG---DWPE-------ISHAPWNGCNLADFVMPF 79
           QRL SLD+ RG  +  ++ ++   H  G   D P         SH  W+G +  D VMP 
Sbjct: 6   QRLESLDVLRGFDLFCLVALEGVLHPLGRAIDAPWYNDFLWCFSHVQWDGFSSWDLVMPL 65

Query: 80  FLFIVGVAIALALKR---IPDRADAVKKVIFRTLKLLFWGILLQGG-FSHAPDELTYGVD 135
           F+F+ GV++  AL R   +PD+    ++++ R   L  +G++ QG      PD       
Sbjct: 66  FMFMAGVSMPFALSRYKVMPDKWAVYRRIVKRVALLWIFGMMCQGNLLGLDPD------- 118

Query: 136 VRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVY 195
            R+      LQ IA+ YL+ +++ +             R S+             V+ + 
Sbjct: 119 -RIYLYSNTLQAIAMGYLISAMLFLHV-----------RLSV---------QIGTVVALL 157

Query: 196 LALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYH 255
           LA     Y    QF  ++     YG              P  N   ++DR VLG      
Sbjct: 158 LA-----YWGAMQFITVD----GYGG---------GNYTPDGNLAEWVDRTVLGR----- 194

Query: 256 HPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHT 315
                R  A  ++                  +    +LSS++  ++ + GV  GH++   
Sbjct: 195 ----FRDAAVVENGQIV----------FAESYRYTWILSSLNFGVTVLTGVFAGHIL--- 237

Query: 316 KGHLARLKQWV-TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYAL 374
           K  + R  +W   +G   ++  L   +   +P+ K+++T S V V+SG   L+    Y  
Sbjct: 238 KSGMDRKHKWQWLLGIGAIMVALGWLWGLQLPVIKKIWTSSMVLVSSGYCFLLMGLFYYW 297

Query: 375 VDIWNLKYPFLPLAWIGMNAMLVYVMA 401
           +D    +     L   GMN+++ Y++A
Sbjct: 298 IDYKGHRKNLTWLKVYGMNSIVAYMLA 324


>gi|313225183|emb|CBY20977.1| unnamed protein product [Oikopleura dioica]
          Length = 335

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 90/369 (24%), Positives = 152/369 (41%), Gaps = 75/369 (20%)

Query: 42  RGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADA 101
           RG+A+ +MI V++ GG +    HA W G  +AD  MP+F+F++GV++  +   +      
Sbjct: 2   RGIAIGIMIFVNYGGGGYWFFDHAVWFGLTVADLAMPWFMFMMGVSLTFSFNSM------ 55

Query: 102 VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIF 161
           VKKV+  +  L+                 T+G         GVLQR A+ Y +VS +++ 
Sbjct: 56  VKKVLRLSYNLV---------------NPTFGT------FPGVLQRFAICYAVVSPLQLA 94

Query: 162 TKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADY-G 220
            +    +                WL+   +  ++L L +G          +        G
Sbjct: 95  QQSTSLR----------------WLVILGLEGLWLGLTFGMAEAGCPAGYLGPGGLHRDG 138

Query: 221 KVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAP 280
           +  N + G        C     I++K         HP +        D  + G L+   P
Sbjct: 139 QFRNCSGGADKGCLIYCRE---INQKAHIYGDSTSHPVFW------HDEYWFGDLQAYDP 189

Query: 281 SWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLH 340
                    EG+L S++SIL   +G+  G +    +    R  +    G  L   G  L 
Sbjct: 190 ---------EGILGSINSILIVFLGLQAGRIFNFYETFQQRAIRLSVWGTVLTAVGGALT 240

Query: 341 FTNA------IPLNKQLYTLSYVCVTSGAAALVFSAIYALVD---IWNLKYPFLPLAWIG 391
             N       IP+ K L+TLS+V V +G   L+   +Y L+D   +W+      P  ++G
Sbjct: 241 GLNQFQEGSNIPIAKNLWTLSFVLVMAGWGFLLLLVLYILIDHKKVWDGA----PFYFVG 296

Query: 392 MNAMLVYVM 400
           MN++LVY++
Sbjct: 297 MNSILVYLL 305


>gi|333382729|ref|ZP_08474395.1| hypothetical protein HMPREF9455_02561 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332828330|gb|EGK01039.1| hypothetical protein HMPREF9455_02561 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 389

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 46/151 (30%), Positives = 79/151 (52%), Gaps = 19/151 (12%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
           + RL SLD+ RG+ +A MI+V+++G     +  + H  W+G    D V PFF+FI+G++ 
Sbjct: 6   SGRLLSLDVLRGITIAGMIMVNNSGSGEYTYAPLKHVAWDGLTPTDLVFPFFMFIMGIST 65

Query: 89  ALALKRIPDRAD--AVKKVIFRTLKLLFWGILLQG-GFSH------APDELTYG------ 133
            ++L++     +   + K++ RT+ +   G+ L   G S        PD L +       
Sbjct: 66  YISLRKFNFEFNTPTLLKILKRTIVIFLIGLGLSWLGLSFRTYHMLEPDNLGFWERFFRA 125

Query: 134 -VDVRMIRLCGVLQRIALSYLLVSLVEIFTK 163
             D   +R  GV+QR+AL+Y   S++ I  K
Sbjct: 126 ITDFGHLRTLGVMQRLALTYGAASIIAITVK 156



 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 47/163 (28%), Positives = 80/163 (49%), Gaps = 28/163 (17%)

Query: 240 VGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSI 299
           V  +D+ +LGINHMY            +DS                  +PEGLLS++ +I
Sbjct: 189 VSIVDQAILGINHMY------------KDSGLA--------------IDPEGLLSTIPAI 222

Query: 300 LSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCV 359
              +IG   G +I++TK +  R+ Q   +G  L   G  L +    P+NK+++T ++V  
Sbjct: 223 AHVLIGFCCGALIMNTKDNDKRISQLFIVGTILTFAGFLLSY--GCPINKKIWTPTFVLA 280

Query: 360 TSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
           T G A+L+ + +  ++DI   +   +     G+N + +YVMA 
Sbjct: 281 TCGLASLLLALLIWIIDIKGHRKWSVFFESFGVNPLFIYVMAG 323


>gi|338211620|ref|YP_004655673.1| hypothetical protein [Runella slithyformis DSM 19594]
 gi|336305439|gb|AEI48541.1| Protein of unknown function DUF2261, transmembrane [Runella
           slithyformis DSM 19594]
          Length = 393

 Score = 68.9 bits (167), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 90/394 (22%), Positives = 154/394 (39%), Gaps = 90/394 (22%)

Query: 34  RLASLDIFRGLAVALMIL----VDHAGGDWPEIS----------HAPWNGCNLADFVMPF 79
           R++S+D +RG  + LM+       H     P+ S          H  W GC+L D + P 
Sbjct: 8   RISSVDAYRGFVMFLMMAEVLEFGHISKALPDSSFWAFLAYNQDHVEWVGCSLHDLIQPS 67

Query: 80  FLFIVGVA----IALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
           F F+VGVA    IA  + +  +        + R+L L+F GI L+   S   ++  +  +
Sbjct: 68  FSFLVGVALPYSIASRMAKGQNFGSMFGHTVQRSLILIFLGIFLR---SMHREQTNFTFE 124

Query: 136 VRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVY 195
                    L +I L Y ++  +   T   Q                   L+A   ++V 
Sbjct: 125 -------DTLTQIGLGYPVLFWLGFKTFRTQ-------------------LIALSTILVG 158

Query: 196 LALLYGTY-VPDWQFTIINKDSADYGKVFNVTC-------GVRAKLNPPCNAVGYIDRKV 247
             L +  Y +P   F        DY  V  VT        G  A  N   NA        
Sbjct: 159 YWLFFAAYPLPGADF--------DYASV-GVTADWEHNLQGFAAHWNKNTNA-------- 201

Query: 248 LGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVH 307
                     AW   +      P E P  ++   +    F P        ++ + ++G+ 
Sbjct: 202 ----------AWSFDQWFMNLFPREKPFIRNGGGYSTLSFIP--------TLGTMVLGLM 243

Query: 308 FGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALV 367
            G  +      + ++K++  +G  LL+ G  L++    P  K+++T ++V  + G   L+
Sbjct: 244 AGQWLKSDTAPMEKVKRFAALGVGLLVVGSLLNWLGICPNVKRIWTPTWVLFSGGWCFLL 303

Query: 368 FSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA 401
            +A Y LVDI  L+  F  L  IG N++  Y+++
Sbjct: 304 LAAFYWLVDIQGLRRAFFWLIVIGTNSIAAYIIS 337


>gi|254524630|ref|ZP_05136685.1| putative heparan-alpha-glucosaminide N-acetyltransferase
           (transmembrane protein 76) [Stenotrophomonas sp. SKA14]
 gi|219722221|gb|EED40746.1| putative heparan-alpha-glucosaminide N-acetyltransferase
           (transmembrane protein 76) [Stenotrophomonas sp. SKA14]
          Length = 355

 Score = 68.6 bits (166), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 46/141 (32%), Positives = 75/141 (53%), Gaps = 20/141 (14%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
           +RL S+D  RG+ VA M+LV++ G DW  +     H+ W+GC   D V PFFLF+VGV++
Sbjct: 7   RRLGSIDALRGITVAAMLLVNNPG-DWSAVFAPLRHSEWHGCTPTDLVFPFFLFLVGVSM 65

Query: 89  ALALKRIPDRADAVKK------VIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
           A ++   P   DA  +      V+ R L++L          +     + + +     R+ 
Sbjct: 66  AFSVA--PRALDAAARPALARGVLERALRILL-------AGALLHLLIWWALHTHHFRIW 116

Query: 143 GVLQRIALSYLLVSLVEIFTK 163
           GVLQRIA+   LV ++ ++ +
Sbjct: 117 GVLQRIAVCAALVGVLAVYAR 137



 Score = 46.2 bits (108), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 41/115 (35%), Positives = 61/115 (53%), Gaps = 10/115 (8%)

Query: 288 EPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPL 347
           +PEGLLS++ ++ ST++G+      +   G  A L     +G A  + GL L     +PL
Sbjct: 193 DPEGLLSTLGALASTVLGLLA--GGLLRNGRSAALGG---LGVATAVLGLLL--ATVLPL 245

Query: 348 NKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
           NKQL+T SYV  T G AAL     + L+D     +P L     G+NA+  Y+ A+
Sbjct: 246 NKQLWTPSYVLWTGGLAALALWLGHVLID--RKGWPALGRR-FGVNAITAYLGAS 297


>gi|260790699|ref|XP_002590379.1| hypothetical protein BRAFLDRAFT_76652 [Branchiostoma floridae]
 gi|229275571|gb|EEN46390.1| hypothetical protein BRAFLDRAFT_76652 [Branchiostoma floridae]
          Length = 347

 Score = 68.6 bits (166), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 79/327 (24%), Positives = 130/327 (39%), Gaps = 92/327 (28%)

Query: 17  SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFV 76
           S P+ S+Q        +RL SLD FRG                                 
Sbjct: 40  STPE-SEQGLTEKKARERLRSLDTFRG--------------------------------- 65

Query: 77  MPFFLFIVGVAIALALKRIPDRADAVK---KVIFRTLKLLFWGILLQGGFSHAPDELTYG 133
              F+FI+G ++AL+ + +  R    +   +VI R+ KL   G  L  G  H  +     
Sbjct: 66  ---FVFIMGTSMALSFRGMRKRTSTRRVVFRVITRSAKLFLVGFFLNAG--HGRN----- 115

Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWH--------- 184
            D+  +R+ GVLQR++++YL+   +E F    +       R +   L   H         
Sbjct: 116 -DLGTVRVPGVLQRLSIAYLVSGFIECFVGKERKSSDERSRLTNPTLQKIHNALRDIVDN 174

Query: 185 ---WLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVG 241
              WL+   +LV++L + +   VP      +       G            LN    A G
Sbjct: 175 WAAWLLHLLILVIHLIITFLLPVPGCPTGYLGPGGPLLGDGVEY-------LNCTGGAAG 227

Query: 242 YIDRKVLGINHMYHHPAWR---RSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSS 298
           YIDR +LG +HMY  P  R   ++K                       F+PEG+L S+++
Sbjct: 228 YIDRLILG-SHMYQTPTVRVFYKTKVA---------------------FDPEGILGSLTT 265

Query: 299 ILSTIIGVHFGHVIIHTKGHLARLKQW 325
           I +  +G+  G ++++ K H +R+ +W
Sbjct: 266 IFNCFLGLQAGKILVYYKEHSSRIIRW 292


>gi|406834451|ref|ZP_11094045.1| hypothetical protein SpalD1_22506 [Schlesneria paludicola DSM
           18645]
          Length = 358

 Score = 68.2 bits (165), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 85/350 (24%), Positives = 143/350 (40%), Gaps = 92/350 (26%)

Query: 61  EISHAPWNGCNLADFVMPFFLFIVGVAIALALKR-------IPDRADAVKKVIFRTLKLL 113
           ++ H  W+G +  D + P FLF+VGV +  +L +       +P+R+    ++I RTL L+
Sbjct: 26  QLEHVKWDGFHFYDLIFPLFLFLVGVVLPFSLTKYQTAGELVPNRSGVYARIIRRTLLLI 85

Query: 114 FWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVG 173
             G++  G            +D    R  GVLQRI + Y   +L  +    +  + Q+  
Sbjct: 86  ALGLIGNG---------ILQLDFTNFRWPGVLQRIGICYFFAALAVL---HLGIRSQA-- 131

Query: 174 RFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKL 233
                       L+A  ++  Y ALL     P +Q              F++T       
Sbjct: 132 ------------LLAIGIIAGYWALLRFVPAPGFQ-------------PFDLTM------ 160

Query: 234 NPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPF-EPEGL 292
               + VGY+DR ++                               P   H    + EG+
Sbjct: 161 --EGSLVGYVDRLLI-------------------------------PGKLHYKLGDNEGV 187

Query: 293 LSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLY 352
           LS++ +I + ++GV  GH +  +     +           L  G    F+   PLNK L+
Sbjct: 188 LSTIPAIGTALLGVLTGHWLRSSYTRFHKFLGLCAGAIVCLAAGYLWSFS--FPLNKILW 245

Query: 353 TLSYVCVTSGAAALVFSAIYALVDI--WNLKYPFLPLAWIGMNAMLVYVM 400
           T S+V VT G +  + +  Y L+D+  W  ++ F  +  IGMNA+ +YVM
Sbjct: 246 TSSFVLVTGGWSLTLLAVFYLLIDVIGWR-RWAFFFIV-IGMNAITIYVM 293


>gi|343082821|ref|YP_004772116.1| hypothetical protein [Cyclobacterium marinum DSM 745]
 gi|342351355|gb|AEL23885.1| Protein of unknown function DUF2261, transmembrane [Cyclobacterium
           marinum DSM 745]
          Length = 367

 Score = 68.2 bits (165), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 102/397 (25%), Positives = 158/397 (39%), Gaps = 106/397 (26%)

Query: 27  KSHLKTQRLASLDIFRGLAVALMILVD------HAGGDWP-------EISHAPWNGCNLA 73
           +  +K  R+ S+D  RG  +  +I  D      H GG  P       + SH  W G    
Sbjct: 3   EQAIKPNRILSIDALRGFDMLFIIFADRFFALLHKGGQTPFTGFLANQFSHPDWFGSTFY 62

Query: 74  DFVMPFFLFIVGVAIALAL-KRIPD---RADAVKKVIFRTLKLLFWGILLQGGFSHAPDE 129
           D +MP FLF+VG  I  +L KR+ +   +A   KK+  R L L F G ++QG        
Sbjct: 63  DIIMPLFLFMVGAVIPFSLSKRMQENTGKAQIYKKLFKRVLILFFLGWIVQGNL------ 116

Query: 130 LTYGVDVRMIRL-CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMA 188
               +D+   ++    LQ IA+ Y    L  I+           GR+ +F         A
Sbjct: 117 --LALDINTFKIFSNTLQAIAVGYFFSCLAFIYL-------SRNGRYIMF---------A 158

Query: 189 ACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
           AC L++Y  +L    VP                  NV    ++ + P  N   Y D  VL
Sbjct: 159 AC-LIIYAMIL---TVP------------------NVPGVGQSVILPDKNYALYFDHLVL 196

Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
           G     +   W                                +L+    I +T+ G+  
Sbjct: 197 GRFDDGYQYTW--------------------------------VLTGFGFIATTLSGLFA 224

Query: 309 GHVIIHTKGHLARLKQWVTMGFALL-IFGLTLHFTNAI--PLNKQLYTLSYVCVTSGAAA 365
           G +I   K  L R K  V +   L+ I GL L     I  P+ K+++T S+V  +SG   
Sbjct: 225 GELI---KSTLPRKK--VALYLLLVGIAGLALGMIWGIWHPIVKKIWTSSFVLASSGVCF 279

Query: 366 LVFSAIYALVDI-WNLKYPFLPLAWIGMNAMLVYVMA 401
           ++ +  Y ++D+    K+ F+    IGMNA+  YV++
Sbjct: 280 ILLAIFYWIIDVKGKYKWAFM-FKVIGMNAITAYVLS 315


>gi|315498708|ref|YP_004087512.1| hypothetical protein Astex_1695 [Asticcacaulis excentricus CB 48]
 gi|315416720|gb|ADU13361.1| hypothetical protein Astex_1695 [Asticcacaulis excentricus CB 48]
          Length = 378

 Score = 68.2 bits (165), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 58/152 (38%), Positives = 73/152 (48%), Gaps = 33/152 (21%)

Query: 16  ISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDW--------PEISHAPW 67
           +SEP  +         TQRL SLD+ RGL V  MILV+   G +        P + HA W
Sbjct: 1   MSEPKTA---------TQRLPSLDVLRGLTVIGMILVNATAGMYYGLQAKVFPLLLHAHW 51

Query: 68  NGCNLADFVMPFFLFIVGVAIALALKRIP-----DRADAVKKVIFRTLKLLFWGILLQ-- 120
            G  +AD V P FL +VG++I +AL R       D A A +K+  R L+L   G LL   
Sbjct: 52  EGLKIADVVFPAFLTMVGLSIPMALNRAKMTTGLDVAQA-RKIGGRVLRLFLIGWLLSNL 110

Query: 121 GGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
           G  +H         D    R  GVLQRI L Y
Sbjct: 111 GWLAH--------FDGEPWRFWGVLQRIGLVY 134



 Score = 39.7 bits (91), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 33/120 (27%), Positives = 61/120 (50%), Gaps = 10/120 (8%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIP 346
           ++PEGLL ++ ++   ++G+  G  ++      A        G  LLI G    F    P
Sbjct: 206 YDPEGLLGTLPALAQALLGMAAGEFLMQNCRRSAL--TLAGAGVVLLILGAGWGFV--FP 261

Query: 347 LNKQLYTLSYVCVTSGAAALVFSAIYALVDIWN--LKYPF-LPLAW---IGMNAMLVYVM 400
           + K +++ S+V VT+G   L  + ++A +D  +  L+ P  LP+ +    G+NA+  YV+
Sbjct: 262 IIKDIWSSSFVLVTTGITLLALAPLHACLDNRDTPLRGPLTLPITFASAFGLNAIAAYVL 321


>gi|212557932|gb|ACJ30386.1| Conserved hypothetical protein [Shewanella piezotolerans WP3]
          Length = 387

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 48/174 (27%), Positives = 91/174 (52%), Gaps = 14/174 (8%)

Query: 284 HAPFEPEGLLSSVSSILSTIIGVHFGHVII--HTKGHLARLKQWVTMGFALLIFGLTLHF 341
           +A  +PEG+LS++ ++++ ++GV  GH I+  H KG   +L   V     L +  L    
Sbjct: 217 NAAVDPEGILSTIPAVVNGLLGVFVGHFIVKTHAKGEWYKLA--VMCLLGLGLLALGWLI 274

Query: 342 TNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVM- 400
           +  IP+NK L+T S+V VTSG + L+ +  Y ++D+  L     P   IG NA+++Y+  
Sbjct: 275 SPIIPVNKTLWTSSFVLVTSGWSVLLLTLFYGVIDVMKLSRWAFPFIVIGCNAIVIYLAT 334

Query: 401 -------AAEGIFAGFINGWYYGDPHNTLVCFLFIISYILHSFLWELRKFLYVQ 447
                   A+ +F G IN  ++  P  +L+  + ++        W  ++ ++++
Sbjct: 335 SIVNWKYTADSLFGGVIN--WFPVPAQSLMAVIGLLLVQWLVLYWMYKRSIFIK 386



 Score = 48.9 bits (115), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 47/160 (29%), Positives = 75/160 (46%), Gaps = 28/160 (17%)

Query: 20  DVSDQQEKSHLKTQ-RLASLDIFRGLAV-----------ALMILVDHAGGDW--PEISHA 65
           +   Q E    K + RL SLD  RG  +           AL++L    G  W   ++ H+
Sbjct: 6   NTQSQTEHGPKKNKVRLKSLDALRGFDMFWILGGEAIFAALIVLTGWGGLHWLDKQMHHS 65

Query: 66  PWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKV------IFRTLKLLFWGILL 119
            W+G    D + P F+F+ GVA+ L+ KR+ D+   V+++      + R L LL  G++ 
Sbjct: 66  AWHGFTFYDLIFPLFIFLSGVALGLSPKRL-DKLPMVQRMPLYQHAVKRLLLLLLLGVIY 124

Query: 120 QGGF-SHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLV 158
             G+ + AP  L        IR   VL RIA ++   +L+
Sbjct: 125 NHGWGTGAPMALGD------IRYASVLGRIAFAWFFCALL 158


>gi|170725675|ref|YP_001759701.1| hypothetical protein Swoo_1314 [Shewanella woodyi ATCC 51908]
 gi|169811022|gb|ACA85606.1| conserved hypothetical protein [Shewanella woodyi ATCC 51908]
          Length = 378

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 44/135 (32%), Positives = 75/135 (55%), Gaps = 14/135 (10%)

Query: 288 EPEGLLSSVSSILSTIIGVHFGHVII--HTKGHLARLKQWVTMGFA-LLIFGLTLHFTNA 344
           +PEG+LS++ +I++ + GV  GH I+  H KG   ++   VTM  A  L+ G        
Sbjct: 212 DPEGILSTLPAIVNAMAGVFVGHFIVKEHAKGEWYKV---VTMLIAGALVLGCGWLLNLV 268

Query: 345 IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA--- 401
           IP+NK L+T S+V VT+G + ++ +  YA VD+   +    P   IG NA+++Y+ +   
Sbjct: 269 IPVNKDLWTSSFVLVTTGWSMILLAVFYAAVDVLKWQKAAFPFVVIGCNAIIIYLASSLI 328

Query: 402 -----AEGIFAGFIN 411
                A+ +F G ++
Sbjct: 329 DWKYTAQSLFGGLVS 343



 Score = 48.9 bits (115), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 46/154 (29%), Positives = 72/154 (46%), Gaps = 27/154 (17%)

Query: 26  EKSHLKT--QRLASLDIFRG-----------LAVALMILVDHAGGDWP--EISHAPWNGC 70
           E +  KT  +RL SLD  RG           L   L++     G  W   ++ H+ W+G 
Sbjct: 2   EATQAKTPKRRLMSLDALRGFDMFWILGGEALFAGLLLWTGWHGWQWADAQMHHSQWHGF 61

Query: 71  NLADFVMPFFLFIVGVAIALALKRI-----PDRADAVKKVIFRTLKLLFWGILLQGGF-S 124
              D + P F+F+ GVA+ L+ KR+       R    K  + R L LLF+G+L   G+ +
Sbjct: 62  TFYDLIFPLFIFLSGVALGLSPKRLDKLPMAQRMPLYKHSVKRLLLLLFFGVLYNHGWGT 121

Query: 125 HAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLV 158
            AP      V +  +R   VL RIA ++   +++
Sbjct: 122 GAP------VAIDEVRYASVLGRIAFAWFFAAML 149


>gi|58583544|ref|YP_202560.1| hypothetical protein XOO3921 [Xanthomonas oryzae pv. oryzae KACC
           10331]
 gi|58428138|gb|AAW77175.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae KACC
           10331]
          Length = 362

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 48/124 (38%), Positives = 65/124 (52%), Gaps = 5/124 (4%)

Query: 40  IFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIP 96
           +FRGL + LMILV+ AG     + +++HA W G  LAD V P FLF VG A++ AL    
Sbjct: 1   MFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSAMSFALATNM 60

Query: 97  DRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLL 154
                + +V  R   +   G+L+     F   PD       V  +RL GVLQRI L YL 
Sbjct: 61  PHLQFLGRVSKRAALIALCGVLMYWFPFFHLQPDGGWAFTTVDQVRLTGVLQRIGLCYLA 120

Query: 155 VSLV 158
            +L+
Sbjct: 121 AALL 124


>gi|423344000|ref|ZP_17321713.1| hypothetical protein HMPREF1077_03143 [Parabacteroides johnsonii
           CL02T12C29]
 gi|409213862|gb|EKN06875.1| hypothetical protein HMPREF1077_03143 [Parabacteroides johnsonii
           CL02T12C29]
          Length = 376

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 90/387 (23%), Positives = 157/387 (40%), Gaps = 86/387 (22%)

Query: 33  QRLASLDIFRGLAVALMILVD---HAGG-----DWPE-----ISHAPWNGCNLADFVMPF 79
           +RL SLD+ RG  +  ++ ++   H  G      W        SH  W G +  D VMP 
Sbjct: 6   KRLESLDVLRGFDLFCLVALEGILHPLGRAIDASWYNDFLWGFSHVQWEGFSSWDLVMPL 65

Query: 80  FLFIVGVAIALALKR---IPDRADAVKKVIFRTLKLLFWGILLQGG-FSHAPDELTYGVD 135
           F+F+ GV++  AL R   +PD+    +++I R   L  +G++ QG      PD       
Sbjct: 66  FMFMAGVSMPFALSRYKAMPDKWAVYRRIIKRVALLWIFGMMCQGNLLGLDPD------- 118

Query: 136 VRMIRLCGVLQRIALSYLLVSLVEIFTK-DVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
            R+      LQ IA+ YL+ +++ +  +  VQ     +G              A  +L+V
Sbjct: 119 -RIYLYSNTLQAIAMGYLIAAMLFLHVRLSVQ-----IGT-------------AVALLLV 159

Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
           Y   +        QF  ++     YG              P  N   ++DR VLG     
Sbjct: 160 YWGAM--------QFITVD----GYGG---------GNYTPDGNLAEWVDRTVLGR---- 194

Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
                 R  A  ++                  +    +LSS++  ++ + G+  G  I+ 
Sbjct: 195 -----FRDAAVVENGQVV----------FAESYRYTWILSSLNFGVTVLTGLFAGQ-ILK 238

Query: 315 TKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYAL 374
           +     R  QW+ +G  + +  L   +   +P+ K+++T S V V+SG   L+    Y  
Sbjct: 239 SAMDQKRKWQWL-LGIGVAMVALGWLWGLQLPVIKKIWTSSMVLVSSGYCFLLMGVFYYW 297

Query: 375 VDIWNLKYPFLPLAWIGMNAMLVYVMA 401
           +D    +     L   GMN+++ Y++A
Sbjct: 298 IDYKGHRKNLTWLKVYGMNSIVAYMLA 324


>gi|265767324|ref|ZP_06094990.1| conserved hypothetical protein [Bacteroides sp. 2_1_16]
 gi|263252629|gb|EEZ24141.1| conserved hypothetical protein [Bacteroides sp. 2_1_16]
          Length = 375

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 88/396 (22%), Positives = 156/396 (39%), Gaps = 93/396 (23%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMI--------LVDHAGGDWP-----EISHAPWNGCN 71
           ++ S   + RLASLDI RG  + L++        L       W      +  H  W G  
Sbjct: 2   KKPSSTPSPRLASLDILRGFDLFLLVFFQPVLWTLAHQLNLPWLNSILFQFDHEVWEGFR 61

Query: 72  LADFVMPFFLFIVGVAIALALKRI---PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPD 128
             D VMP FLF+ G ++  +  +    PD+    +K+I R + L  +G+++QG       
Sbjct: 62  FWDLVMPLFLFMTGASMPFSFSKFKDNPDKGPVYRKIIKRFILLFIFGMIVQGNL----- 116

Query: 129 ELTYGVDVRMIRL-CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLM 187
               G+D + + L    LQ IA  YL+ +++++                 FR   W  ++
Sbjct: 117 ---LGLDPKHLYLYSNTLQAIATGYLIAAIIQLHCN--------------FR---WQLIV 156

Query: 188 AACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKV 247
            A +LV+Y   +  T++ D                            P  N    +DR V
Sbjct: 157 TALLLVIYWIPM--TFLGD--------------------------FTPEGNFAEKVDRLV 188

Query: 248 LGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWC-HAPFEPEGLLSSVSSILSTIIGV 306
           LG     H                +G    +  SW   A +    + SS++   + ++G 
Sbjct: 189 LG-----HFR--------------DGVFWNEDGSWSFSAHYNYTWIWSSLTFGATVMLGA 229

Query: 307 HFGHVIIHTKGHLARLKQ-WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAA 365
             G ++   K +  ++ Q  + +G +L+ F L   ++  +P+ K+L+T S    + G   
Sbjct: 230 FAGKIMKAGKDNRRKVVQTLLIIGISLIAFSLI--WSLQMPIIKRLWTSSMTLFSGGLCF 287

Query: 366 LVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA 401
           L+  A Y  +D          L   GMN++  Y++ 
Sbjct: 288 LLMGAFYYRIDYKGHSRGLNWLKIYGMNSITAYILG 323


>gi|53715734|ref|YP_101726.1| hypothetical protein BF4455 [Bacteroides fragilis YCH46]
 gi|423271955|ref|ZP_17250924.1| hypothetical protein HMPREF1079_04006 [Bacteroides fragilis
           CL05T00C42]
 gi|423276040|ref|ZP_17254983.1| hypothetical protein HMPREF1080_03636 [Bacteroides fragilis
           CL05T12C13]
 gi|52218599|dbj|BAD51192.1| conserved hypothetical protein [Bacteroides fragilis YCH46]
 gi|392696310|gb|EIY89506.1| hypothetical protein HMPREF1079_04006 [Bacteroides fragilis
           CL05T00C42]
 gi|392699545|gb|EIY92721.1| hypothetical protein HMPREF1080_03636 [Bacteroides fragilis
           CL05T12C13]
          Length = 375

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 88/396 (22%), Positives = 156/396 (39%), Gaps = 93/396 (23%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMI--------LVDHAGGDWP-----EISHAPWNGCN 71
           ++ S   + RLASLDI RG  + L++        L       W      +  H  W G  
Sbjct: 2   KKPSSTPSPRLASLDILRGFDLFLLVFFQPVLWTLAHQLNLPWLNSILFQFDHEVWEGFR 61

Query: 72  LADFVMPFFLFIVGVAIALALKRI---PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPD 128
             D VMP FLF+ G ++  +  +    PD+    +K+I R + L  +G+++QG       
Sbjct: 62  FWDLVMPLFLFMTGASMPFSFSKFKDNPDKGPVYRKIIKRFILLFIFGMIVQGNL----- 116

Query: 129 ELTYGVDVRMIRL-CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLM 187
               G+D + + L    LQ IA  YL+ +++++                 FR   W  ++
Sbjct: 117 ---LGLDPKHLYLYSNTLQAIATGYLIAAIIQLHCN--------------FR---WQLIV 156

Query: 188 AACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKV 247
            A +LV+Y   +  T++ D                            P  N    +DR V
Sbjct: 157 TALLLVIYWIPM--TFLGD--------------------------FTPEGNFAEKVDRLV 188

Query: 248 LGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWC-HAPFEPEGLLSSVSSILSTIIGV 306
           LG     H                +G    +  SW   A +    + SS++   + ++G 
Sbjct: 189 LG-----HFR--------------DGVFWNEDGSWSFSAHYNYTWIWSSLTFGATVMLGA 229

Query: 307 HFGHVIIHTKGHLARLKQ-WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAA 365
             G ++   K +  ++ Q  + +G +L+ F L   ++  +P+ K+L+T S    + G   
Sbjct: 230 FAGKIMKAGKDNRRKVVQTLLIIGISLIAFSLI--WSLQMPIIKRLWTSSMTLFSGGLCF 287

Query: 366 LVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA 401
           L+  A Y  +D          L   GMN++  Y++ 
Sbjct: 288 LLMGAFYYRIDYKGHSRGLNWLKIYGMNSITAYILG 323


>gi|345881756|ref|ZP_08833266.1| hypothetical protein HMPREF9431_01930 [Prevotella oulorum F0390]
 gi|343918415|gb|EGV29178.1| hypothetical protein HMPREF9431_01930 [Prevotella oulorum F0390]
          Length = 380

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 87/410 (21%), Positives = 163/410 (39%), Gaps = 99/410 (24%)

Query: 31  KTQRLASLDIFRGLAVALMILVDH-----------AGGD-----WPEISHAPWNGCNLAD 74
           + QRL SLDI RG  +A+++L+             A G        ++SH PW G    D
Sbjct: 9   QPQRLLSLDILRGADLAMLVLIQPILFRALKTAHPAEGTIGHFIMGQLSHLPWEGFCFWD 68

Query: 75  FVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLK--LLFW--GILLQGGFSHAPDEL 130
            +MP F+F+ G+ I  A+ R    A    +  +R +K  ++ W  G+++QG         
Sbjct: 69  IIMPLFMFMSGITIPFAMSRYKRGARIDGQFYWRIIKRFVVLWVLGMVVQGNL------- 121

Query: 131 TYGVDVRMIRL-CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAA 189
               D+R + L    LQ IA+ Y+ V+ + +F      + Q V                +
Sbjct: 122 -LAFDLRQLHLFSNTLQSIAVGYVAVAFLFVFCS---LRTQIVA--------------VS 163

Query: 190 CVLVVYLAL--LYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKV 247
              + Y+A+  L+G +     FTI                          N    IDR V
Sbjct: 164 LSFIAYIAIFALWGHF----DFTI------------------------DANICEAIDRAV 195

Query: 248 LGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP-FEPEGLLSSVSSILSTIIGV 306
           LG                     F   ++    +W   P +    ++SS++ I++  +G 
Sbjct: 196 LG--------------------RFRDGVQWQGDTWHWDPTYHYTWIMSSLNFIVTVYLGT 235

Query: 307 HFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAAL 366
             G+++  T+  + + +  +  G A++   L +H    IP+ K +++ S      G   +
Sbjct: 236 LAGYLLKSTRTAMQKFRWLMWGGVAMIAVSLLMH--PWIPIIKHIWSSSMTLFAGGICFV 293

Query: 367 VFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYG 416
           +    Y ++D+   +  ++ L + GMN++  Y +     F    +  ++G
Sbjct: 294 LMGLFYYIIDVKGCRRGWMWLRYYGMNSLAAYFIGDYMNFKSITDSLFFG 343


>gi|254446502|ref|ZP_05059978.1| hypothetical protein VDG1235_4753 [Verrucomicrobiae bacterium
           DG1235]
 gi|198260810|gb|EDY85118.1| hypothetical protein VDG1235_4753 [Verrucomicrobiae bacterium
           DG1235]
          Length = 394

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 93/356 (26%), Positives = 146/356 (41%), Gaps = 87/356 (24%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVG 85
           +K +RL +LD  RG  +  MI+V+  G  W  +     HA W+G    D V PFFLF VG
Sbjct: 1   MKRERLLALDALRGFTIIGMIIVNSPG-SWSHVYSPLLHASWHGVTPTDLVFPFFLFFVG 59

Query: 86  VAIALA------LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMI 139
           V+IALA       KR  +R    +K+ +R  K+   G+ L        +E+         
Sbjct: 60  VSIALAYSGKRGTKR--ERVGKYRKIFWRVAKIFALGLFLNLWPYFYFEEM--------- 108

Query: 140 RLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALL 199
           R+ GVLQRIAL + + +++ + T+  Q                W   + A +L+ Y ALL
Sbjct: 109 RVAGVLQRIALVFGVCAILFLNTRWKQQ--------------LW---VGASILLGYWALL 151

Query: 200 YGTYVPDWQFTIINKDSAD-------YGKVFNVTCGVRAK------LNPPCNAVGYIDRK 246
               VP      +N  + +       YG    V+   R +        P  N   ++DR 
Sbjct: 152 VWVPVP---LDEVNAGALETGIVERSYGTEVAVSVEARGETSIAGNFEPGVNIAAWVDRV 208

Query: 247 VLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGV 306
           +L          W R+                        ++PEGLLS+V ++ + I G+
Sbjct: 209 LL------PGGMWERT------------------------WDPEGLLSTVPAVATGIFGM 238

Query: 307 HFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSG 362
             G +I+       R+     +G   L+ G    +    P NK L++ S+V    G
Sbjct: 239 LVGALILGVGDPYRRVSWVFFVGVVALLIGSAWSWV--FPYNKNLWSSSFVLYAGG 292


>gi|430747657|ref|YP_007206786.1| hypothetical protein Sinac_7036 [Singulisphaera acidiphila DSM
           18658]
 gi|430019377|gb|AGA31091.1| hypothetical protein Sinac_7036 [Singulisphaera acidiphila DSM
           18658]
          Length = 418

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 84/404 (20%), Positives = 152/404 (37%), Gaps = 76/404 (18%)

Query: 16  ISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMIL----VDHAGGDWPEIS-------- 63
           ++ P   +        ++RLAS+D FRG  + L++     +      +P+          
Sbjct: 15  VNAPKPPESSGSGSAPSRRLASIDAFRGFVMFLLLAEWLKLPQVAKSFPKSELWALLSRH 74

Query: 64  --HAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKV----IFRTLKLLFWGI 117
             H  W GC+L D + P F F+VGVA+  ++     R  +  ++     +R L L+  GI
Sbjct: 75  QQHVEWVGCSLHDLIQPSFSFLVGVALPFSIASRLARGQSTTRMAGHAFWRALVLVLLGI 134

Query: 118 LLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSI 177
            L+          + G D         L +I L Y  + L+ +  +  +D+         
Sbjct: 135 FLR----------SMGKDRTNFTFEDTLTQIGLGYGFLFLLGL--RPARDQ--------- 173

Query: 178 FRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKD-SADYGKVFNVTCGVRAKLNPP 236
                  W+    +LV Y         P  +F   + D   D+    +   G  +  N  
Sbjct: 174 -------WIALVVILVGYWGAFALYPAPGTEFDYTSVDVPKDWAHNLS---GFASHWNKN 223

Query: 237 CNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSV 296
            NA    D   L +                   P + P   +   +          LS +
Sbjct: 224 SNAAWAFDTWFLNL------------------FPRKAPFIDNRGGYAT--------LSFI 257

Query: 297 SSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSY 356
            ++ + I G+  G+V+   +    +L      G   L  G  L   +  P+ K+++T S+
Sbjct: 258 PTLATMIFGLIAGNVLKDDRKAWMKLGWLTAAGILGLFLGAALGELDFCPVVKRIWTPSW 317

Query: 357 VCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVM 400
              ++G   L+ +A YA+VD+  L +   PL  IG N++  Y +
Sbjct: 318 TLFSAGWCFLILAAFYAVVDMAGLWWLTYPLIIIGTNSIAAYCL 361


>gi|340617673|ref|YP_004736126.1| hypothetical protein zobellia_1684 [Zobellia galactanivorans]
 gi|339732470|emb|CAZ95738.1| Conserved hypothetical membrane protein [Zobellia galactanivorans]
          Length = 346

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 86/367 (23%), Positives = 142/367 (38%), Gaps = 101/367 (27%)

Query: 61  EISHAPWNGCNLADFVMPFFLFIVGVAIALALKR---IPDRADAVKKVIFRTLKLLFWGI 117
           ++ H PWNG    D + PFF+FIVGVA+  +L++     D+    K ++ R   L  +G 
Sbjct: 30  QLHHHPWNGLRFWDLIQPFFMFIVGVAMPFSLRKRLASGDKKGVTKHILRRCFLLFAFGA 89

Query: 118 LLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQ---SVGR 174
           LL   +SHA           +  L  VL ++A + L+   V     ++ +K Q   SVG 
Sbjct: 90  LLHCVYSHA----------LVWELWNVLVQLAFTILIAYAV----MNLSNKTQIGISVG- 134

Query: 175 FSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLN 234
                            L+V   +LY  Y P+  +    ++   +G              
Sbjct: 135 -----------------LLVLTEVLYRAYNPEAPYV---QNHESFG-------------- 160

Query: 235 PPCNAVGYIDRKVLG-INHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLL 293
                  Y+D  V+G +N  Y                           W    F P    
Sbjct: 161 ------AYVDMLVMGQVNDGY---------------------------WVFVNFIP---- 183

Query: 294 SSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYT 353
               +   TI GV  G V++       ++K ++  G  L++ G  + F    P+ K++ T
Sbjct: 184 ----TAAHTIWGVICGKVLLSHIPSTQKIKPFLIWGAVLVVLGFAMDFVGITPIVKRIAT 239

Query: 354 LSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPL-AWIGMNAMLVYVMA---AEGIFAGF 409
            S+   + G A L  S  Y L+DI   +  +L + + +G N++ +Y+ A       F GF
Sbjct: 240 SSFTLASGGIAILTLSLFYWLIDIKGYQNNWLKIFSVVGTNSIFIYLFAETVGAQWFRGF 299

Query: 410 INGWYYG 416
              W  G
Sbjct: 300 GTIWTEG 306


>gi|88859970|ref|ZP_01134609.1| hypothetical protein PTD2_18200 [Pseudoalteromonas tunicata D2]
 gi|88817964|gb|EAR27780.1| hypothetical protein PTD2_18200 [Pseudoalteromonas tunicata D2]
          Length = 378

 Score = 66.2 bits (160), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 41/136 (30%), Positives = 73/136 (53%), Gaps = 16/136 (11%)

Query: 288 EPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTM----GFALLIFGLTLHFTN 343
           +PEGLLS+V ++++ ++GV  G +I        +  +W T+    G  L    L   +  
Sbjct: 212 DPEGLLSNVPAVVNALMGVFAGRLIAKAN----QFGEWKTVSYLFGAGLTSLALGWAWNM 267

Query: 344 AIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA-- 401
             P+NK L+T S+V VT G + +  ++ YA+VD+ N +  F P   IG N++++Y+ +  
Sbjct: 268 VFPVNKDLWTSSFVLVTVGWSLIFLASFYAVVDLLNTQKYFYPFIIIGANSIVIYLASSL 327

Query: 402 ------AEGIFAGFIN 411
                 A+ +F G +N
Sbjct: 328 VQWEYVAQSVFGGLVN 343



 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 43/150 (28%), Positives = 70/150 (46%), Gaps = 27/150 (18%)

Query: 25  QEKSHLKTQRLASLDIFRGLAV-----------ALMILVDHAGGDWP----EISHAPWNG 69
             ++  + +RLASLD  RG+ +           AL IL    G  W     +  H+ W+G
Sbjct: 3   NTETKPQKRRLASLDALRGMDMFWILGGEKIFAALFILTGWTG--WQVAHGQTLHSNWHG 60

Query: 70  CNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIF-RTLKLLF----WGILLQGGFS 124
               D + P F+F+ GVA+ L+ KRI       ++V + + LK LF    +G+L   G+ 
Sbjct: 61  FTFYDLIFPLFIFLAGVAMGLSPKRIDHLPFQERRVYYAKALKRLFLLAGFGVLYNHGWG 120

Query: 125 HAPDELTYGVDVRMIRLCGVLQRIALSYLL 154
                     ++  IR   VL RIA+++ +
Sbjct: 121 TGIP-----FNLEEIRYASVLGRIAIAWFV 145


>gi|386719962|ref|YP_006186288.1| N-acetylglucosamine related transporter, NagX [Stenotrophomonas
           maltophilia D457]
 gi|384079524|emb|CCH14124.1| N-acetylglucosamine related transporter, NagX [Stenotrophomonas
           maltophilia D457]
          Length = 352

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 45/144 (31%), Positives = 75/144 (52%), Gaps = 20/144 (13%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVG 85
           +  +RL S+D  RG+ VA M+LV++ G DW  +     H+ W+GC   D V PFFLF+VG
Sbjct: 1   MPPRRLGSIDALRGITVAAMLLVNNPG-DWSAVFAPLRHSEWHGCTPTDLVFPFFLFLVG 59

Query: 86  VAIALALKRIPDRADAVKK------VIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMI 139
           V++A ++   P   DA  +      V+ R L++L          +     + + +     
Sbjct: 60  VSMAFSVA--PRALDAAARPALARGVLERALRILL-------AGALLHLLIWWALHTHHF 110

Query: 140 RLCGVLQRIALSYLLVSLVEIFTK 163
           R+ GVLQRIA+    V ++ ++ +
Sbjct: 111 RIWGVLQRIAVCAASVGVLAVYAR 134



 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 37/115 (32%), Positives = 60/115 (52%), Gaps = 10/115 (8%)

Query: 288 EPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPL 347
           +PEGLLS++ ++ ST++G+  G ++ + +          T    LL+          +PL
Sbjct: 190 DPEGLLSTLGALASTVLGLIAGGLLRNGRAAALAGLGVATAVLGLLL-------ATVLPL 242

Query: 348 NKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
           NKQL+T SYV  T G AAL     + L+D    ++P L     G+NA+  Y+ A+
Sbjct: 243 NKQLWTPSYVLWTGGLAALALWLGHVLID--QKRWPPLGRR-FGVNAITAYLGAS 294


>gi|224064476|ref|XP_002301495.1| predicted protein [Populus trichocarpa]
 gi|222843221|gb|EEE80768.1| predicted protein [Populus trichocarpa]
          Length = 136

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 30/65 (46%), Positives = 38/65 (58%), Gaps = 1/65 (1%)

Query: 199 LYGTYVPDWQFTIINKDSADY-GKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHP 257
           LYG Y PDW+F + +     Y      V CGV+  L PPCNA G IDR  LG + +Y HP
Sbjct: 68  LYGLYDPDWEFEVPSTHLFGYKSGTKTVNCGVKGSLEPPCNAAGLIDRFFLGEHPLYQHP 127

Query: 258 AWRRS 262
            +RR+
Sbjct: 128 VYRRT 132



 Score = 65.1 bits (157), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 29/45 (64%), Positives = 37/45 (82%)

Query: 49 MILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
          MILVD AGG +P I+H+PW G  L+DFVMPFFLF+VG++I+L  K
Sbjct: 1  MILVDDAGGAFPCINHSPWFGVTLSDFVMPFFLFVVGLSISLVFK 45


>gi|336411649|ref|ZP_08592112.1| hypothetical protein HMPREF1018_04130 [Bacteroides sp. 2_1_56FAA]
 gi|335941083|gb|EGN02943.1| hypothetical protein HMPREF1018_04130 [Bacteroides sp. 2_1_56FAA]
          Length = 375

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 87/396 (21%), Positives = 156/396 (39%), Gaps = 93/396 (23%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMI--------LVDHAGGDWP-----EISHAPWNGCN 71
           ++ S   + RLASLDI RG  + L++        L       W      +  H  W G  
Sbjct: 2   KKPSSTPSPRLASLDILRGFDLFLLVFFQPVLWTLAHQLNLPWLNSILFQFDHEVWEGFR 61

Query: 72  LADFVMPFFLFIVGVAIALALKRI---PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPD 128
             D VMP FLF+ G ++  +  +    PD+    +K+I R + L  +G+++QG       
Sbjct: 62  FWDLVMPLFLFMTGASMPFSFSKFKDNPDKGPVYRKIIKRFILLFIFGMIVQGNL----- 116

Query: 129 ELTYGVDVRMIRL-CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLM 187
               G+D + + L    LQ IA  YL+ +++++                 FR   W  ++
Sbjct: 117 ---LGLDPKHLYLYSNTLQAIATGYLIAAIIQLHCN--------------FR---WQLIV 156

Query: 188 AACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKV 247
            A +L++Y   +  T++ D                            P  N    +DR V
Sbjct: 157 TALLLLIYWIPM--TFLGD--------------------------FTPEGNFAEKVDRLV 188

Query: 248 LGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWC-HAPFEPEGLLSSVSSILSTIIGV 306
           LG     H                +G    +  SW   A +    + SS++   + ++G 
Sbjct: 189 LG-----HFR--------------DGVFWNEDGSWSFSAHYNYTWIWSSLTFGATVMLGA 229

Query: 307 HFGHVIIHTKGHLARLKQ-WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAA 365
             G ++   K +  ++ Q  + +G +L+ F L   ++  +P+ K+L+T S    + G   
Sbjct: 230 FAGKIMKAGKDNRRKVVQTLLIIGLSLIAFSLI--WSLQMPIIKRLWTSSMTLFSGGLCF 287

Query: 366 LVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA 401
           L+  A Y  +D          L   GMN++  Y++ 
Sbjct: 288 LLMGAFYYRIDYKGHSRGLNWLKIYGMNSITAYILG 323


>gi|218260819|ref|ZP_03475938.1| hypothetical protein PRABACTJOHN_01602 [Parabacteroides johnsonii
           DSM 18315]
 gi|218224342|gb|EEC96992.1| hypothetical protein PRABACTJOHN_01602 [Parabacteroides johnsonii
           DSM 18315]
          Length = 376

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 91/387 (23%), Positives = 158/387 (40%), Gaps = 86/387 (22%)

Query: 33  QRLASLDIFRGLAVALMILVD---HAGG-----DWPE-----ISHAPWNGCNLADFVMPF 79
           +RL SLD+ RG  +  ++ ++   H  G      W        SH  W G +  D VMP 
Sbjct: 6   KRLESLDVLRGFDLFCLVALEGILHPLGRAIDASWYNDFLWGFSHVQWEGFSSWDLVMPL 65

Query: 80  FLFIVGVAIALALKR---IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDV 136
           F+F+ GV++  AL R   +PD+    +++I R   L  +G++ QG           G+D 
Sbjct: 66  FMFMAGVSMPFALSRYKAMPDKWAVYRRIIKRVALLWIFGMMCQGNL--------LGLDP 117

Query: 137 RMIRL-CGVLQRIALSYLLVSLVEIFTK-DVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
             I L    LQ IA+ YL+ +++ +  +  VQ     +G              A  +L+V
Sbjct: 118 GRIYLYSNTLQAIAMGYLIAAMLFLHVRLSVQ-----IGT-------------AVALLLV 159

Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
           Y   +        QF  ++     YG              P  N   ++DR VLG     
Sbjct: 160 YWGAM--------QFITVD----GYGG---------GNYTPDGNLAEWVDRTVLGR---- 194

Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
                 R  A  ++                  +    +LSS++  ++ + G+  G  I+ 
Sbjct: 195 -----FRDAAVVENGQVV----------FAESYRYTWILSSLNFGVTVLTGLFAGQ-ILK 238

Query: 315 TKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYAL 374
           +     R  QW+ +G  + +  L   +   +P+ K+++T S V V+SG   L+    Y  
Sbjct: 239 SAMDQKRKWQWL-LGIGVAMVALGWLWGLQLPVIKKIWTSSMVLVSSGYCFLLMGVFYYW 297

Query: 375 VDIWNLKYPFLPLAWIGMNAMLVYVMA 401
           +D    +     L   GMN+++ Y++A
Sbjct: 298 IDYKGHRKNLTWLKVYGMNSIVAYMLA 324


>gi|423282787|ref|ZP_17261672.1| hypothetical protein HMPREF1204_01210 [Bacteroides fragilis HMW
           615]
 gi|404581658|gb|EKA86354.1| hypothetical protein HMPREF1204_01210 [Bacteroides fragilis HMW
           615]
          Length = 375

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 88/396 (22%), Positives = 156/396 (39%), Gaps = 93/396 (23%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMI--------LVDHAGGDWP-----EISHAPWNGCN 71
           ++ S   + RLASLDI RG  + L++        L       W      +  H  W G  
Sbjct: 2   KKPSSPPSPRLASLDILRGFDLFLLVFFQPVLWTLAHQLNLPWLNSILFQFDHEVWEGFR 61

Query: 72  LADFVMPFFLFIVGVAIALALKRI---PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPD 128
             D VMP FLF+ G ++  +  +    PD+    +K+I R + L  +G+++QG       
Sbjct: 62  FWDLVMPLFLFMTGASMPFSFSKFKDNPDKGPVYRKIIKRFILLFIFGMIVQGNL----- 116

Query: 129 ELTYGVDVRMIRL-CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLM 187
               G+D + + L    LQ IA  YL+ +++++                 FR   W  ++
Sbjct: 117 ---LGLDPKHLYLYSNTLQAIATGYLIAAIIQLHCN--------------FR---WQLIV 156

Query: 188 AACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKV 247
            A +LV+Y   +  T++ D                            P  N    +DR V
Sbjct: 157 TALLLVIYWIPM--TFLGD--------------------------FTPEGNFAEKVDRLV 188

Query: 248 LGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWC-HAPFEPEGLLSSVSSILSTIIGV 306
           LG     H                +G    +  SW   A +    + SS++   + ++G 
Sbjct: 189 LG-----HFR--------------DGVFWNEDGSWSFSAHYNYTWIWSSLTFGATVMLGA 229

Query: 307 HFGHVIIHTKGHLARLKQ-WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAA 365
             G ++   K +  ++ Q  + +G +L+ F L   ++  +P+ K+L+T S    + G   
Sbjct: 230 FAGKIMKAGKDNRRKVVQTLLIIGISLIAFSLI--WSLQMPIIKRLWTSSMTLFSGGLCF 287

Query: 366 LVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA 401
           L+  A Y  +D          L   GMN++  Y++ 
Sbjct: 288 LLMGAFYYRIDYKGHSRGLNWLKIYGMNSITAYILG 323


>gi|383119755|ref|ZP_09940493.1| hypothetical protein BSHG_3425 [Bacteroides sp. 3_2_5]
 gi|423252290|ref|ZP_17233284.1| hypothetical protein HMPREF1066_04294 [Bacteroides fragilis
           CL03T00C08]
 gi|423252861|ref|ZP_17233792.1| hypothetical protein HMPREF1067_00436 [Bacteroides fragilis
           CL03T12C07]
 gi|251944624|gb|EES85099.1| hypothetical protein BSHG_3425 [Bacteroides sp. 3_2_5]
 gi|392647563|gb|EIY41262.1| hypothetical protein HMPREF1066_04294 [Bacteroides fragilis
           CL03T00C08]
 gi|392659230|gb|EIY52856.1| hypothetical protein HMPREF1067_00436 [Bacteroides fragilis
           CL03T12C07]
          Length = 375

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 87/396 (21%), Positives = 156/396 (39%), Gaps = 93/396 (23%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMI--------LVDHAGGDWP-----EISHAPWNGCN 71
           ++ S   + RLASLDI RG  + L++        L       W      +  H  W G  
Sbjct: 2   KKPSSTPSPRLASLDILRGFDLFLLVFFQPVLWTLAHQLNLPWLNSILFQFDHEVWEGFR 61

Query: 72  LADFVMPFFLFIVGVAIALALKRI---PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPD 128
             D VMP FLF+ G ++  +  +    PD+    +K+I R + L  +G+++QG       
Sbjct: 62  FWDLVMPLFLFMTGASMPFSFSKFKDNPDKGPVYRKIIKRFILLFIFGMIVQGNL----- 116

Query: 129 ELTYGVDVRMIRL-CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLM 187
               G+D + + L    LQ IA  YL+ +++++                 FR   W  ++
Sbjct: 117 ---LGLDPKHLYLYSNTLQAIATGYLIAAIIQLHCN--------------FR---WQLIV 156

Query: 188 AACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKV 247
            A +L++Y   +  T++ D                            P  N    +DR V
Sbjct: 157 TALLLLIYWIPM--TFLGD--------------------------FTPEGNFAEKVDRLV 188

Query: 248 LGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWC-HAPFEPEGLLSSVSSILSTIIGV 306
           LG     H                +G    +  SW   A +    + SS++   + ++G 
Sbjct: 189 LG-----HFR--------------DGVFWNEDGSWSFSAHYNYTWIWSSLTFGATVMLGA 229

Query: 307 HFGHVIIHTKGHLARLKQ-WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAA 365
             G ++   K +  ++ Q  + +G +L+ F L   ++  +P+ K+L+T S    + G   
Sbjct: 230 FAGKIMKAGKDNRRKVVQTLLIIGISLIAFSLI--WSLQMPIIKRLWTSSMTLFSGGLCF 287

Query: 366 LVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA 401
           L+  A Y  +D          L   GMN++  Y++ 
Sbjct: 288 LLMGAFYYRIDYKGHSRGLNWLKIYGMNSITAYILG 323


>gi|375360501|ref|YP_005113273.1| putative transmembrane protein [Bacteroides fragilis 638R]
 gi|301165182|emb|CBW24752.1| putative transmembrane protein [Bacteroides fragilis 638R]
          Length = 373

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 88/394 (22%), Positives = 156/394 (39%), Gaps = 91/394 (23%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMI--------LVDHAGGDWP-----EISHAPWNGCN 71
           ++ S   + RLASLDI RG  + L++        L       W      +  H  W G  
Sbjct: 2   KKPSSTPSPRLASLDILRGFDLFLLVFFQPVLWTLAHQLNLPWLNSILFQFDHEVWEGFR 61

Query: 72  LADFVMPFFLFIVGVAIALA-LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDEL 130
             D VMP FLF+ G ++  +  K  PD+    +K+I R + L  +G+++QG         
Sbjct: 62  FWDLVMPLFLFMTGASMPFSKFKDNPDKGPVYRKIIKRFILLFIFGMIVQGNL------- 114

Query: 131 TYGVDVRMIRL-CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAA 189
             G+D + + L    LQ IA  YL+ +++++                 FR   W  ++ A
Sbjct: 115 -LGLDPKHLYLYSNTLQAIATGYLIAAIIQLHCN--------------FR---WQLIVTA 156

Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLG 249
            +L++Y   +  T++ D                            P  N    +DR VLG
Sbjct: 157 LLLLIYWIPM--TFLGD--------------------------FTPEGNFAEKVDRLVLG 188

Query: 250 INHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWC-HAPFEPEGLLSSVSSILSTIIGVHF 308
                H                +G    +  SW   A +    + SS++   + ++G   
Sbjct: 189 -----HFR--------------DGVFWNEDGSWSFSAHYNYTWIWSSLTFGATVMLGAFA 229

Query: 309 GHVIIHTKGHLARLKQ-WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALV 367
           G ++   K +  ++ Q  + +G +L+ F L   ++  +P+ K+L+T S    + G   L+
Sbjct: 230 GKIMKAGKDNRRKVVQTLLIIGISLIAFSLI--WSLQMPIIKRLWTSSMTLFSGGLCFLL 287

Query: 368 FSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA 401
             A Y  +D          L   GMN++  Y++ 
Sbjct: 288 MGAFYYRIDYKGHSRGLNWLKIYGMNSITAYILG 321


>gi|440747820|ref|ZP_20927075.1| N-acetylglucosamine related transporter, NagX [Mariniradius
           saccharolyticus AK6]
 gi|436483562|gb|ELP39602.1| N-acetylglucosamine related transporter, NagX [Mariniradius
           saccharolyticus AK6]
          Length = 372

 Score = 65.1 bits (157), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 86/387 (22%), Positives = 152/387 (39%), Gaps = 95/387 (24%)

Query: 32  TQRLASLDIFRGLAVALMILVD--------HAGGDWPEI-----SHAPWNGCNLADFVMP 78
           ++RL S+D  RG  + ++   D             W ++      H  W G    DF+ P
Sbjct: 9   SKRLVSIDALRGFDMLMICGADAFFRSLEGKTSFAWVDVLARQFEHPEWIGFTFYDFIFP 68

Query: 79  FFLFIVGVAIALALKRI----PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGV 134
            FLF+ GV+I  +L +       + +  KK + RTL L+  G+L +    +AP       
Sbjct: 69  LFLFVAGVSIPFSLGKSLAENVSKREIYKKALSRTLLLIGLGMLDK----NAPFPF---F 121

Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
           D   IRL  VL RI ++      V +F        Q +G             +   VL+ 
Sbjct: 122 DWEQIRLGSVLGRIGIA----GFVTVFLFLNFPARQRLG-------------IVGLVLIA 164

Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
           Y A ++   VP +                         L+   N  G+IDR  L      
Sbjct: 165 YYAAVFLIPVPGFG---------------------AGNLSFEGNLAGWIDRTFL------ 197

Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
                              P R          F+  G+L++  +I  TI+G   G ++  
Sbjct: 198 -------------------PGRL-----LQGSFDELGILTTFPAICLTILGAQAGEIL-- 231

Query: 315 TKGHLARLKQWV-TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
               L+  ++ V T+ F ++  GL L +    P+ K+++T S++ + +G A +  +  Y 
Sbjct: 232 RNAQLSEQQKVVRTLLFGVVCIGLALIWHLHFPIFKRMWTSSFILLNAGMAFVALAGFYW 291

Query: 374 LVDIWNLKYPFLPLAWIGMNAMLVYVM 400
           L+D+ + +        +GMN++ +Y++
Sbjct: 292 LIDMLHFRKWSFFFVVVGMNSLTIYMI 318


>gi|60683670|ref|YP_213814.1| hypothetical protein BF4252 [Bacteroides fragilis NCTC 9343]
 gi|423259842|ref|ZP_17240765.1| hypothetical protein HMPREF1055_03042 [Bacteroides fragilis
           CL07T00C01]
 gi|423267497|ref|ZP_17246478.1| hypothetical protein HMPREF1056_04165 [Bacteroides fragilis
           CL07T12C05]
 gi|60495104|emb|CAH09923.1| putative transmembrane protein [Bacteroides fragilis NCTC 9343]
 gi|387775880|gb|EIK37984.1| hypothetical protein HMPREF1055_03042 [Bacteroides fragilis
           CL07T00C01]
 gi|392696971|gb|EIY90158.1| hypothetical protein HMPREF1056_04165 [Bacteroides fragilis
           CL07T12C05]
          Length = 375

 Score = 65.1 bits (157), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 87/396 (21%), Positives = 155/396 (39%), Gaps = 93/396 (23%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMI--------LVDHAGGDWP-----EISHAPWNGCN 71
           ++ S     RLASLDI RG  + L++        L       W      +  H  W G  
Sbjct: 2   KKPSSTPAPRLASLDILRGFDLFLLVFFQPVLWTLAHQLNLPWLNSILFQFDHEVWEGFR 61

Query: 72  LADFVMPFFLFIVGVAIALALKRI---PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPD 128
             D VMP FLF+ G ++  +  +    PD+    +K+I R + L  +G+++QG       
Sbjct: 62  FWDLVMPLFLFMTGASMPFSFSKFKDNPDKGPVYRKIIKRFILLFIFGMIVQGNL----- 116

Query: 129 ELTYGVDVRMIRL-CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLM 187
               G+D + + L    LQ IA  YL+ +++++                 FR   W  ++
Sbjct: 117 ---LGLDPKHLYLYSNTLQAIATGYLIAAIIQLHCN--------------FR---WQLIV 156

Query: 188 AACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKV 247
            A +L++Y   +  T++ D                            P  N    +DR V
Sbjct: 157 TALLLLIYWIPM--TFLGD--------------------------FTPEGNFAEKVDRLV 188

Query: 248 LGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWC-HAPFEPEGLLSSVSSILSTIIGV 306
           LG     H                +G    +  SW   A +    + SS++   + ++G 
Sbjct: 189 LG-----HFR--------------DGVFWNEDGSWSFSAHYNYTWIWSSLTFGATVMLGA 229

Query: 307 HFGHVIIHTKGHLARLKQ-WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAA 365
             G ++   K +  ++ Q  + +G +L+ F L   ++  +P+ K+L+T S    + G   
Sbjct: 230 FAGKIMKAGKDNRRKVVQTLLIIGISLIAFSLI--WSLQMPIIKRLWTSSMTLFSGGLCF 287

Query: 366 LVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA 401
           L+  A Y  +D          L   GMN++  Y++ 
Sbjct: 288 LLMGAFYYRIDYKGHSRGLNWLKIYGMNSITAYILG 323


>gi|345322030|ref|XP_003430524.1| PREDICTED: hypothetical protein LOC100681967 [Ornithorhynchus
           anatinus]
          Length = 530

 Score = 64.7 bits (156), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 31/78 (39%), Positives = 51/78 (65%), Gaps = 4/78 (5%)

Query: 44  LAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL----KRIPDRA 99
           L++ LM+ V++ GG +    HAPWNG  +AD VMP+F+FI+G ++ALA     +R  +R 
Sbjct: 130 LSLTLMVFVNYGGGGYWFFEHAPWNGLTVADLVMPWFVFILGTSVALAFYAMRRRGVNRV 189

Query: 100 DAVKKVIFRTLKLLFWGI 117
             ++K+ +RT  L+  G+
Sbjct: 190 QLLRKLTWRTAVLMIIGL 207


>gi|409203840|ref|ZP_11232043.1| hypothetical protein PflaJ_21058 [Pseudoalteromonas flavipulchra
           JG1]
          Length = 377

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 37/122 (30%), Positives = 69/122 (56%)

Query: 281 SWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLH 340
           S+ + P +PEG+LSS+ +I++ I GV  G  I + +         +  G  +L   L   
Sbjct: 204 SYQNRPVDPEGVLSSLPAIVNAIAGVFAGRAIANAQTQGEWKIVGILAGSGVLALALGWL 263

Query: 341 FTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVM 400
           +    P+NK+L+T S+V VT G +A++ +  YA+VD+ + +    P   IG N++++Y+ 
Sbjct: 264 WDMQFPVNKELWTSSFVLVTVGWSAILLAVFYAIVDVLSFQRWAYPFVIIGANSIIIYLA 323

Query: 401 AA 402
           ++
Sbjct: 324 SS 325



 Score = 48.1 bits (113), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 45/148 (30%), Positives = 67/148 (45%), Gaps = 27/148 (18%)

Query: 31  KTQRLASLDIFRGLAV-----------ALMILVDHAGGDWPEIS----HAPWNGCNLADF 75
           K +RLASLD  RG+ +           AL +L    G  W        H+PW+G    D 
Sbjct: 6   KPKRLASLDALRGMDMFWILGGQSIFAALFVLTGWQG--WKAFEAHTLHSPWHGFTFYDL 63

Query: 76  VMPFFLFIVGVAIALALKRIP-----DRADAVKKVIFRTLKLLFWGILLQGGFSHAPDEL 130
           + P F+F+ GVA+ L+ KRI      +R     K + R L L  +G+L   G+       
Sbjct: 64  IFPLFIFLSGVAMGLSPKRIDHLPFNERKSFYLKALKRLLLLCAFGVLYNHGWGTGIP-- 121

Query: 131 TYGVDVRMIRLCGVLQRIALSYLLVSLV 158
              +D   +R   VL RIA ++   +L+
Sbjct: 122 ---MDPDGVRYASVLGRIAFAWFFCALL 146


>gi|322785719|gb|EFZ12357.1| hypothetical protein SINV_16151 [Solenopsis invicta]
          Length = 111

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 27/73 (36%), Positives = 42/73 (57%)

Query: 20  DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
           +V D+      K  R+ ++D FRG++   MI V+   G +  + HA WNG  L D V P 
Sbjct: 33  NVKDESSNKEPKKNRVKAIDTFRGISTLFMIFVNDGSGSYTVLEHATWNGLLLGDLVFPC 92

Query: 80  FLFIVGVAIALAL 92
           F++I+GV + +AL
Sbjct: 93  FIWIMGVCVPIAL 105


>gi|189463416|ref|ZP_03012201.1| hypothetical protein BACCOP_04135 [Bacteroides coprocola DSM
          17136]
 gi|189429845|gb|EDU98829.1| hypothetical protein BACCOP_04135 [Bacteroides coprocola DSM
          17136]
          Length = 82

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 29/67 (43%), Positives = 45/67 (67%), Gaps = 3/67 (4%)

Query: 31 KTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
          K QRL SLD+ RG+ +A MI+V++ G     +  ++HA WNG    D V PFF+FI+G++
Sbjct: 3  KAQRLISLDVLRGITIAGMIIVNNPGSWKHVYTPLTHAVWNGLTPTDLVFPFFMFIMGIS 62

Query: 88 IALALKR 94
            ++LK+
Sbjct: 63 TYISLKK 69


>gi|305665830|ref|YP_003862117.1| hypothetical protein FB2170_06080 [Maribacter sp. HTCC2170]
 gi|88710601|gb|EAR02833.1| hypothetical protein FB2170_06080 [Maribacter sp. HTCC2170]
          Length = 346

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 79/348 (22%), Positives = 133/348 (38%), Gaps = 96/348 (27%)

Query: 61  EISHAPWNGCNLADFVMPFFLFIVGVAIALALKR---IPDRADAVKKVIFRTLKLLFWGI 117
           ++ H PWNG    D + PFF+FIVGVA+  +L++      R  A + ++ R   L  +G 
Sbjct: 30  QLHHHPWNGLRFWDLIQPFFMFIVGVAMPFSLRKRLASGSRKSATRHILKRCFLLFAFGA 89

Query: 118 LLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQ---SVGR 174
           LL   +SHA           +  L  VL ++A +     L+  F   + +K Q   S+G 
Sbjct: 90  LLHCVYSHA----------LVWELWNVLVQLAFTI----LIAYFIMSLSNKAQIGISIG- 134

Query: 175 FSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLN 234
                            L++   +LY TY P   +    +    +G              
Sbjct: 135 -----------------LLLLTEILYRTYNPSAPYV---QGHESFGS------------- 161

Query: 235 PPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLS 294
                  +ID  V+G                          + ++  W    F P     
Sbjct: 162 -------FIDMLVMG--------------------------KTNSGYWVVVNFLPTA--- 185

Query: 295 SVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTL 354
                  TI GV  G +++ +     +LK  V  G  LLI G ++   N  P+ K++ T 
Sbjct: 186 -----AHTIWGVVCGKLLLSSATDQNKLKPIVIWGVVLLILGYSMDLLNITPIVKRIATT 240

Query: 355 SYVCVTSGAAALVFSAIYALVDIWNLKYPFLPL-AWIGMNAMLVYVMA 401
           S+   + G   L  +  Y  +DI   K  +L + + +G N++ +Y+ A
Sbjct: 241 SFTMASGGWCLLTLALFYWWIDIKGNKPNWLKIFSVVGTNSIFIYLFA 288


>gi|383753678|ref|YP_005432581.1| hypothetical protein SELR_08500 [Selenomonas ruminantium subsp.
           lactilytica TAM6421]
 gi|381365730|dbj|BAL82558.1| hypothetical protein SELR_08500 [Selenomonas ruminantium subsp.
           lactilytica TAM6421]
          Length = 384

 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 47/134 (35%), Positives = 67/134 (50%), Gaps = 12/134 (8%)

Query: 31  KTQRLASLDIFRGLAVALMILVD---HAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
             +RLA++DIFRGLA+A+M+LV+   +    WP + HAPW G  +AD   P F+FI+GV+
Sbjct: 7   NKRRLAAIDIFRGLAIAIMLLVNALPNFEQAWPLLVHAPWAGLTIADLAFPGFVFIMGVS 66

Query: 88  IALALKRIPDRADAVKKVIFRTLKLLF---------WGILLQGGFSHAPDELTYGVDVRM 138
            +L   +        K  I     LL          + ++LQ  F   P        V  
Sbjct: 67  ASLWFPKHEQDGSGEKFCIILKRSLLLILLGFFLCQFPLVLQHVFQPEPGGSLIKDIVEH 126

Query: 139 IRLCGVLQRIALSY 152
            R+ GVLQR+ L Y
Sbjct: 127 GRIPGVLQRLGLVY 140


>gi|167623085|ref|YP_001673379.1| hypothetical protein Shal_1151 [Shewanella halifaxensis HAW-EB4]
 gi|167353107|gb|ABZ75720.1| conserved hypothetical protein [Shewanella halifaxensis HAW-EB4]
          Length = 398

 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 41/142 (28%), Positives = 78/142 (54%), Gaps = 12/142 (8%)

Query: 281 SWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII--HTKGHLARLKQWVTMGFALLIFGLT 338
           ++ +A  +PEG+LS++ ++++ + GV  GH I+  H KG   +L      G AL+  G  
Sbjct: 225 TYQNAAVDPEGILSTIPAVVNGLFGVFVGHFIVKPHVKGEWFKLAVLAASGVALVALGWL 284

Query: 339 LHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVY 398
           +  +  IP+NK L+T S+  V+SG + L  +  YA++D+  ++        IG N++++Y
Sbjct: 285 I--SPLIPVNKTLWTSSFTLVSSGWSILFLALFYAVIDVVKVQKWAFFFTVIGCNSIVIY 342

Query: 399 VMA--------AEGIFAGFING 412
           + +        A+ +F   I+G
Sbjct: 343 IASSIVNWKYTAQSLFGQLISG 364



 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 50/171 (29%), Positives = 74/171 (43%), Gaps = 32/171 (18%)

Query: 15  IISEPDVSDQQEKSHLKTQ---------RLASLDIFRGLAV-----------ALMILVDH 54
           II++P  S  +    L+TQ         RL SLD  RG  +           AL++L   
Sbjct: 4   IITKPQSSLIESHLKLQTQSIAKSEAKPRLKSLDALRGFDMFWILGGEAIFAALLVLTGW 63

Query: 55  AGGDW--PEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAV-----KKVIF 107
           AG  W   ++ H+ WNG    D + P F+F+ GVA+ L+ KR+            K  I 
Sbjct: 64  AGFKWFDGQMHHSVWNGFTFYDLIFPLFIFLSGVALGLSPKRLDKLPLPPRLPLYKHAIK 123

Query: 108 RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLV 158
           R   LL +G++   G+            V  IR   VL RIA ++   +L+
Sbjct: 124 RLFLLLLFGVIYNHGWGTGAS-----FAVGDIRYASVLGRIAFAWFFCALL 169


>gi|404485011|ref|ZP_11020215.1| hypothetical protein HMPREF9448_00625 [Barnesiella intestinihominis
           YIT 11860]
 gi|404340016|gb|EJZ66447.1| hypothetical protein HMPREF9448_00625 [Barnesiella intestinihominis
           YIT 11860]
          Length = 440

 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 47/145 (32%), Positives = 71/145 (48%), Gaps = 25/145 (17%)

Query: 33  QRLASLDIFRGLAVALMIL-----------VDHA--GGDWPEISHAPWNGCNLADFVMPF 79
           QRLASLDI RG  + L++            VD +       +  H  W G    D VMP 
Sbjct: 75  QRLASLDILRGFDLFLLVFLQPVLVSLGACVDSSVMNAVLYQFDHEVWEGFRFWDLVMPL 134

Query: 80  FLFIVGVAIALAL---KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDV 136
           FLF+ GV++  +    +R+  R    KKV+ R + L   G+++QG           G+D+
Sbjct: 135 FLFMTGVSMPFSFSKYERVESRRFIYKKVLRRFVILFLLGMVVQGNL--------LGLDL 186

Query: 137 RMIRL-CGVLQRIALSYLLVSLVEI 160
           + IRL    LQ IA  YL+ +L+++
Sbjct: 187 KYIRLYSNTLQAIAAGYLIAALIQL 211


>gi|392402534|ref|YP_006439146.1| Protein of unknown function DUF2261, transmembrane [Turneriella
           parva DSM 21527]
 gi|390610488|gb|AFM11640.1| Protein of unknown function DUF2261, transmembrane [Turneriella
           parva DSM 21527]
          Length = 396

 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 104/398 (26%), Positives = 163/398 (40%), Gaps = 99/398 (24%)

Query: 37  SLDIFRGLAVALMILVDHAGGDWPE----ISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           SLD+ RGL +ALMI+V++ G DW      + HA W+G   AD V P FLF+ G A AL +
Sbjct: 5   SLDLLRGLTIALMIIVNNPG-DWKAMFAVLRHAEWHGFLGADIVFPLFLFVAGYAAALKI 63

Query: 93  KRI--------PDRADAVK-----------KVIFRTLKLLFWGILLQG-GFSHAPD-ELT 131
            R+        P  A A+             ++ R   L   G+ L        PD E +
Sbjct: 64  DRLYGPTTAGGPHCASALTLEERELPAYYLPLMRRAAILFLIGLFLNAWPLGLLPDTEFS 123

Query: 132 YGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV 191
           +G     +R+ GVLQRIA+  L                  VG   + R+     L++  +
Sbjct: 124 FG----HLRVLGVLQRIAICVL------------------VGGVLLRRVETVRGLVSVLI 161

Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
           L ++L   +   VP  +F         +G+ F +            N    +D  VL   
Sbjct: 162 L-IFLLYEFCMRVPLVKFA-----DVFFGRSFELQ----------NNFARLVDISVLPEA 205

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIG---VHF 308
            +Y                    +RK A       F+PEGLL+S+++I++ + G      
Sbjct: 206 MLYK-------------------VRKIA-------FDPEGLLTSLTAIMTFLFGGLAFRL 239

Query: 309 GHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVF 368
               I T  H  R       G A +   L     N  P+NK L+T+ +V +T+ AA  + 
Sbjct: 240 ALPCIATTQHKFRNFLQSRWGLASICAVLGFALANFEPINKNLWTIPFVLLTTAAATALL 299

Query: 369 SAI----YALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
            A+    +     W  ++   P A +G N +L+YV++ 
Sbjct: 300 LALEKFDFGKRRFW--RHLAQPFADMGKNPLLIYVLSG 335


>gi|392544017|ref|ZP_10291154.1| hypothetical protein PpisJ2_19642 [Pseudoalteromonas piscicida JCM
           20779]
          Length = 377

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 38/126 (30%), Positives = 72/126 (57%), Gaps = 8/126 (6%)

Query: 281 SWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGF----ALLIFG 336
           S+ + P +PEG+LSS+ +I++ I GV  G  I + +       +W T+G      +L+  
Sbjct: 204 SYQNRPVDPEGVLSSLPAIVNAIAGVFAGRAIANAQTQ----GEWKTVGILAGSGVLVLA 259

Query: 337 LTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAML 396
           L   +    P+NK+L+T S+V VT G +A++ +  YA+VD+ + +        IG N+++
Sbjct: 260 LGWLWDMQFPVNKELWTSSFVLVTVGWSAILLAVFYAIVDVLSFQRWAYSFVIIGANSII 319

Query: 397 VYVMAA 402
           +Y+ ++
Sbjct: 320 IYLASS 325



 Score = 48.1 bits (113), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 42/148 (28%), Positives = 63/148 (42%), Gaps = 27/148 (18%)

Query: 31  KTQRLASLDIFRGLAV-----------ALMILVDHAGGDWPEIS----HAPWNGCNLADF 75
           K +RLASLD  RG+ +           AL +L    G  W        H+PW+G    D 
Sbjct: 6   KPKRLASLDALRGMDMFWILGGQSIFAALFVLTGWQG--WKAFEAHTLHSPWHGFTFYDL 63

Query: 76  VMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLF-----WGILLQGGFSHAPDEL 130
           + P F+F+ GVA+ L+ KRI       +K  +            +G+L   G+       
Sbjct: 64  IFPLFIFLSGVAMGLSPKRIDHLPFNERKPFYLKALKRLLLLCAFGVLYNHGWGTGIP-- 121

Query: 131 TYGVDVRMIRLCGVLQRIALSYLLVSLV 158
              +D   IR   VL RIA ++   +L+
Sbjct: 122 ---MDPDGIRYASVLGRIAFAWFFCALL 146


>gi|29348589|ref|NP_812092.1| hypothetical protein BT_3180 [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|29340494|gb|AAO78286.1| putative transmembrane protein [Bacteroides thetaiotaomicron
           VPI-5482]
          Length = 376

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 78/397 (19%), Positives = 158/397 (39%), Gaps = 92/397 (23%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG------DWP-------EISHAPWNGC 70
             + S   T RLASLDI RG  + L++             + P       +  H  W G 
Sbjct: 1   MNKLSEKNTTRLASLDILRGFDLFLLVFFQPVFAALVRQLNLPFLNDILYQFDHEVWEGF 60

Query: 71  NLADFVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHA 126
              D VMP FLF+ G ++  +L +    + +     ++++ R   L  +G+++QG     
Sbjct: 61  RFWDLVMPLFLFMTGASMPFSLSKYVGMSGSYWLVYRRILRRVFLLFIFGMIVQGNL--- 117

Query: 127 PDELTYGVDVRMIRL-CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHW 185
                 G+D   I L    LQ IA+ YL+ +++++                    + + W
Sbjct: 118 -----LGLDSSHIYLYSNTLQSIAVGYLIAAVIQLH-------------------FSFRW 153

Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDR 245
            +   +L++++  +  T++ D                            P  N    +DR
Sbjct: 154 QIGITLLLLFIYWIPMTFLGD--------------------------FTPAGNFAEQVDR 187

Query: 246 KVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP-FEPEGLLSSVSSILSTII 304
            VLG           R +        +G    +  +W  +P +    + SS++  ++ ++
Sbjct: 188 CVLG-----------RFR--------DGVFWNEDGTWSFSPYYNYTWIWSSLTFGVTVML 228

Query: 305 GVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAA 364
           G   G ++   K +  ++ Q +++   +L+ GL + ++  +P+ K+L+T S   ++ G  
Sbjct: 229 GAFAGKIMKEGKANRKKVVQTLSV-IGVLLVGLAMLWSLQMPIIKRLWTGSMTLLSGGYC 287

Query: 365 ALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA 401
            L+ +  Y  +D          L   GMN++  Y++ 
Sbjct: 288 FLLMALFYYWIDYKGHSRGLNWLKVYGMNSITAYLLG 324


>gi|406832166|ref|ZP_11091760.1| hypothetical protein SpalD1_11017 [Schlesneria paludicola DSM
           18645]
          Length = 413

 Score = 62.8 bits (151), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 98/415 (23%), Positives = 166/415 (40%), Gaps = 85/415 (20%)

Query: 14  LIISEPDVSDQQEKSHLK-------TQRLASLDIFRGLAVALMI-----LVDHAGGDWPE 61
           +I++ P+ S+ +  + L+         RL S+D +RG  + LM+     L D A    PE
Sbjct: 1   MIVTIPNKSEIEGPATLELPAGGAAPSRLVSVDAYRGWVMLLMMAEVLRLRDVAKAL-PE 59

Query: 62  I----------SHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKK----VIF 107
                      SH  W GC L D + P F F+VGVA+ L+L+R       + +      +
Sbjct: 60  SRLWAFLAQQQSHVTWVGCVLHDMIQPSFSFLVGVALPLSLRRRSLSGQPLWQRTAHAAW 119

Query: 108 RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQD 167
           R+L L+  G+ L+          T+    R       L +I L YL + L+ + ++  Q 
Sbjct: 120 RSLVLILLGVFLRS---------THSTQTRWT-FEDTLSQIGLGYLFLYLLSLCSRRAQ- 168

Query: 168 KDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTY-VPDWQFTIINKDSADYGKVFNVT 226
                            W     +LV Y  L +  Y +P   F        D+ K     
Sbjct: 169 -----------------WAALISILVGYW-LFFALYPLPGADF--------DWAK----- 197

Query: 227 CGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP 286
             ++   N P +  G+     L  N     PAW          P + P   +   +    
Sbjct: 198 --IKGDPNSPSHLSGFAAHWNLNTN-----PAWAFDTWFLNLFPRQAPFTHNGGGYSTLN 250

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIP 346
           F P        ++ + I+G+  G VI+  +    +L   V+ G   L  GL L      P
Sbjct: 251 FIP--------TLATMILGLLAGGVIVSDRSTRFKLIWLVSAGLIGLASGLILDAAGLCP 302

Query: 347 LNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA 401
           + K+++T S+V  + G   L+ +A YA+VD+   +        +GMN++  Y+++
Sbjct: 303 IVKKIWTPSWVLFSGGWCFLILAAWYAVVDVAKWQSWAFVFVVVGMNSIAAYLIS 357


>gi|456985619|gb|EMG21386.1| putative membrane protein [Leptospira interrogans serovar
           Copenhageni str. LT2050]
          Length = 296

 Score = 62.8 bits (151), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 56/189 (29%), Positives = 84/189 (44%), Gaps = 38/189 (20%)

Query: 233 LNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGL 292
           L P  +   +IDR V G NH+     W+ SK                       ++PEG 
Sbjct: 68  LEPGKDIGAWIDRNVFGENHL-----WKFSKT----------------------WDPEGF 100

Query: 293 LSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLY 352
            S +SSI ++++GV  G  I+ +K +  + +     GF  L   + L +   +P+NK L+
Sbjct: 101 FSGISSITTSLLGVFCGS-ILSSKTNETKKQILSIFGFGTLFVLVGLLWNQNLPMNKSLW 159

Query: 353 TLSYVCVTSGAAALV-----FSAIYALVDIWN---LKYPFLPLAWIGMNAMLVYVMAAEG 404
           T SYV  T+G A L      F  +      WN   L+  F P    G NA+LV+V    G
Sbjct: 160 TGSYVIYTAGLAFLSIGFFEFLNLLLQTKKWNRLRLETIFQPFLVFGKNAILVFV--GSG 217

Query: 405 IFAGFINGW 413
           + A  +N W
Sbjct: 218 LLARILNLW 226


>gi|380693009|ref|ZP_09857868.1| hypothetical protein BfaeM_03398 [Bacteroides faecis MAJ27]
          Length = 376

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 85/396 (21%), Positives = 160/396 (40%), Gaps = 92/396 (23%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMILVDHAGG------DWP-------EISHAPWNGCN 71
            +KS   T RLASLDI RG  + L++             + P       +  H  W G  
Sbjct: 2   SKKSENNTSRLASLDILRGFDLFLLVFFQPVFAALVRQLNLPFLNDILYQFDHEVWEGFR 61

Query: 72  LADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLF----WGILLQGGFSHAP 127
             D VMP FLF+ G ++  +L +    + + + V  R LK +F    +G+++QG      
Sbjct: 62  FWDLVMPLFLFMTGASMPFSLSKYIGTSGSYRPVYRRILKRVFLLFVFGMIVQGNL---- 117

Query: 128 DELTYGVDVRMIRL-CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWL 186
                G+D + I L    LQ IA+ YL+ +++++                    + + W 
Sbjct: 118 ----LGLDGKHIYLYSNTLQSIAVGYLIAAVIQLH-------------------FSFKWQ 154

Query: 187 MAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRK 246
           +   +L++++  +  T++ D                            P  N    +DR 
Sbjct: 155 IGITLLLLFVYWIPMTFLGD--------------------------FTPAGNFAEQVDRW 188

Query: 247 VLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP-FEPEGLLSSVSSILSTIIG 305
           VLG           R +        +G    +  +W  +P +    + SS++  ++ ++G
Sbjct: 189 VLG-----------RFR--------DGVYWNEDGTWNFSPYYNYTWIWSSLTFGVTVMLG 229

Query: 306 VHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAA 365
              G ++   K    R+ Q +++   LLI GL + ++  +P+ K+L+T S   ++ G   
Sbjct: 230 AFAGKIMKEGKADRKRVVQVLSVVGVLLI-GLAMLWSLQMPVIKRLWTGSMTLLSGGYCF 288

Query: 366 LVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA 401
           L+ +  Y  +D          L   GMN++  Y++ 
Sbjct: 289 LLMALFYYWIDYKGHSRGLNWLKIYGMNSITAYLLG 324


>gi|254784997|ref|YP_003072425.1| hypothetical protein TERTU_0813 [Teredinibacter turnerae T7901]
 gi|237684955|gb|ACR12219.1| putative membrane protein [Teredinibacter turnerae T7901]
          Length = 354

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 45/124 (36%), Positives = 62/124 (50%), Gaps = 9/124 (7%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
            +R  +LD  RG+ +A+MILV+  G     +P + HA W+G    DFV PFFLFIVG A+
Sbjct: 2   NERSLALDALRGITLAMMILVNTPGSWSHVYPPLLHANWHGVTPTDFVFPFFLFIVGCAL 61

Query: 89  ALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
             +     +R +    +    LK +F   +L            Y       RL GVLQRI
Sbjct: 62  FFS-----NRKNHQLDIYTHALK-IFRRTVLLLLAGLGLHAYLYSGTFAEFRLPGVLQRI 115

Query: 149 ALSY 152
           AL+Y
Sbjct: 116 ALAY 119



 Score = 58.5 bits (140), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 38/116 (32%), Positives = 62/116 (53%), Gaps = 5/116 (4%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIP 346
           F+PEGLLS++ +I++ + G     +I+         +Q V +  A L  G+ L     +P
Sbjct: 183 FDPEGLLSTLPAIVTVLSGYEATRIIVERTT-----QQKVLVIIAALAIGMALLLHPWVP 237

Query: 347 LNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
           +NK L+T SYV +TSG A LV  A+  L     ++  +   A  G N +L+Y++A 
Sbjct: 238 INKYLWTSSYVLLTSGVAVLVLVALMQLESFRPVRPAYRAFAVYGENPLLIYILAG 293


>gi|380512476|ref|ZP_09855883.1| hypothetical protein XsacN4_14717 [Xanthomonas sacchari NCPPB 4393]
          Length = 384

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 49/126 (38%), Positives = 66/126 (52%), Gaps = 15/126 (11%)

Query: 33  QRLASLDIFRGLAVALMILVD--HAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +R  SLD+FRGL + LMILV+   AG D + ++ H PW G   AD V P FLF VG A++
Sbjct: 16  ERFLSLDVFRGLTIFLMILVNTPGAGADAFVQLRHTPWFGFTAADLVFPSFLFAVGNAMS 75

Query: 90  LALKRIPDRADAVKKVIFRTLKLLFWGILL-------QGGFSHAPDELTYGVDVRMIRLC 142
            AL R       +++V  R+  +   G L+       QG   H    LT    +   R+ 
Sbjct: 76  FALDRGQPLGAFLRRVGKRSALIFLLGFLMYWFPFVHQGADGHW--SLT---AIDQTRVP 130

Query: 143 GVLQRI 148
           GVLQRI
Sbjct: 131 GVLQRI 136



 Score = 38.9 bits (89), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 41/74 (55%), Gaps = 2/74 (2%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIP 346
           F+PEGLL ++ + ++ I G   G + +   G  AR  +W+ +    L+  L L +    P
Sbjct: 207 FDPEGLLGTLPATVNVIAGYLTG-LYVRRVGKQARTVRWLWLAGGALVL-LALAWQPWFP 264

Query: 347 LNKQLYTLSYVCVT 360
           L K+L+T S+V +T
Sbjct: 265 LAKKLWTGSFVLLT 278


>gi|269978070|ref|ZP_06185020.1| putative membrane protein [Mobiluncus mulieris 28-1]
 gi|269933579|gb|EEZ90163.1| putative membrane protein [Mobiluncus mulieris 28-1]
          Length = 442

 Score = 62.4 bits (150), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 101/417 (24%), Positives = 163/417 (39%), Gaps = 111/417 (26%)

Query: 2   SEIKAETTHHHPLIISEPDV--SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAG--- 56
           ++ +A TT       +EP+   ++Q E    K  R+ SLD+ RG    L++ V  A    
Sbjct: 56  TQSEAATTRQ-----TEPNTGETNQTETKPAKPGRITSLDVGRGWF--LIMSVTSAAWLL 108

Query: 57  --GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLF 114
              DW  + HAPW G    D + P F+ + G+ +A A     +R     KV  R + +L 
Sbjct: 109 PRPDW--LIHAPWIGIRYYDMIFPLFVTLSGIGLAFAYH---NRVS--FKVTLRRIVVLV 161

Query: 115 WGILLQGGFSHAP-DELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVG 173
              LL  G S    D  T+       R  G LQ  A+   +++   +F ++         
Sbjct: 162 VVGLLYNGVSSGQWDPATF-------RFTGPLQVYAVIVAIIATCHLFARN--------- 205

Query: 174 RFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKL 233
                        MA   +   +A+L    +  W  T                 GV   L
Sbjct: 206 ------------WMAWAGITAGVAVLQTGLLTWWAGT--------------CPSGV---L 236

Query: 234 NPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLL 293
           +P CN  G  DR +LG  HMY                + G L  D          PEGL+
Sbjct: 237 SPSCNPSGMWDRALLGA-HMY----------------YGGFLGHD----------PEGLV 269

Query: 294 SSVSSILSTIIGVHFGHVIIHTK--GHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQL 351
           +   ++L+   G   GH+ + ++  G      + + +  A+ +FGL L  T  +P  K+L
Sbjct: 270 AITGALLTAAAGTTAGHLALSSRRLGWKTGPVKLLALAAAMSVFGLIL--TIWVPAFKRL 327

Query: 352 YTLSYVCVTSGAAALVFSAIYALVDI----------WNLKYPFLPLAWIGMNAMLVY 398
           +T S+  +      L+F+  +   D+            + +PF  L   G N++LVY
Sbjct: 328 WTPSFSLIAGAVGVLIFAVAFLCFDVPLRSGNSRIRERIAWPFTAL---GRNSLLVY 381


>gi|227875179|ref|ZP_03993321.1| possible heparan-alpha-glucosaminide N-acetyltransferase
           [Mobiluncus mulieris ATCC 35243]
 gi|227844084|gb|EEJ54251.1| possible heparan-alpha-glucosaminide N-acetyltransferase
           [Mobiluncus mulieris ATCC 35243]
          Length = 399

 Score = 62.4 bits (150), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 100/417 (23%), Positives = 163/417 (39%), Gaps = 111/417 (26%)

Query: 2   SEIKAETTHHHPLIISEPDV--SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAG--- 56
           ++ +A TT       +EP+   ++Q E    K  R+ SLD+ RG    L++ V  A    
Sbjct: 13  TQSEAATTRQ-----TEPNTGETNQAETKPAKPGRITSLDVGRGWF--LIMSVTSAAWLL 65

Query: 57  --GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLF 114
              DW  + HAPW G    D + P F+ + G+ +A A     +R     KV  R + +L 
Sbjct: 66  PRPDW--LIHAPWIGIRYYDMIFPLFVTLSGIGLAFAYH---NRVS--FKVTLRRIVVLV 118

Query: 115 WGILLQGGFSHAP-DELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVG 173
              LL  G S    D  T+       R  G LQ  A+   +++   +F ++         
Sbjct: 119 VVGLLYNGVSSGQWDPATF-------RFTGPLQVYAVIVTIIATCHLFARN--------- 162

Query: 174 RFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKL 233
                        MA   +   +A+L    +  W  T                 GV   L
Sbjct: 163 ------------WMAWAGITAGVAVLQTGLLTWWAGT--------------CPSGV---L 193

Query: 234 NPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLL 293
           +P CN  G  DR +LG  HMY                + G L  D          PEGL+
Sbjct: 194 SPSCNPSGMWDRALLGA-HMY----------------YGGFLGHD----------PEGLV 226

Query: 294 SSVSSILSTIIGVHFGHVIIHTK--GHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQL 351
           +   ++L+   G   GH+ + ++  G      + + +  A+ +FGL L+    +P  K+L
Sbjct: 227 AITGALLTAAAGTTAGHLALSSRRLGWKTGPVKLLALAAAMSVFGLILNIW--VPAFKRL 284

Query: 352 YTLSYVCVTSGAAALVFSAIYALVDI----------WNLKYPFLPLAWIGMNAMLVY 398
           +T S+  +      L+F+  +   D+            + +PF  L   G N++LVY
Sbjct: 285 WTPSFSLIAGAVGVLIFAVAFLCFDVPLRSGNSRIRERIAWPFTAL---GRNSLLVY 338


>gi|383124758|ref|ZP_09945419.1| hypothetical protein BSIG_1496 [Bacteroides sp. 1_1_6]
 gi|251841090|gb|EES69171.1| hypothetical protein BSIG_1496 [Bacteroides sp. 1_1_6]
          Length = 376

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 79/396 (19%), Positives = 159/396 (40%), Gaps = 98/396 (24%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGG------DWP-------EISHAPWNGCNLAD 74
           S   T RLASLDI RG  + L++             + P       +  H  W G    D
Sbjct: 5   SENNTSRLASLDILRGFDLFLLVFFQPVFAALVRQLNLPFLNDILYQFDHEVWEGFRFWD 64

Query: 75  FVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDEL 130
            VMP FLF+ G ++  +L +    + +     ++++ R   L  +G+++QG         
Sbjct: 65  LVMPLFLFMTGASMPFSLSKYVGMSGSYWPVYRRILRRVFLLFIFGMIVQGNL------- 117

Query: 131 TYGVDVRMIRL-CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAA 189
             G+D   I L    LQ IA+ Y + +++++                    + + W +  
Sbjct: 118 -LGLDSSHIYLYSNTLQSIAVGYFIAAVIQLH-------------------FSFRWQIGI 157

Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLG 249
            +L++++  +  T++ D                            P  N    +DR VLG
Sbjct: 158 TLLLLFIYWIPMTFLGD--------------------------FTPAGNFAEQVDRWVLG 191

Query: 250 INHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP-FEPEGLLSSVSSILSTIIGVHF 308
                      R +        +G    +  +W  +P +    + SS++  ++ ++G   
Sbjct: 192 -----------RFR--------DGVFWNEDGTWSFSPYYNYTWIWSSLTFGVTVMLGAFA 232

Query: 309 GHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVF 368
           G ++   K +  ++ Q +++   +L+ GL + ++  +P+ K+L+T S   ++ G   L+ 
Sbjct: 233 GKIMKEGKANRKKVVQTLSV-IGVLLVGLAMLWSLQMPIIKRLWTGSMTLLSGGYCFLLM 291

Query: 369 SAIYALVDIWNLKYPFLPLAWI---GMNAMLVYVMA 401
           +  Y  +D    K     L W+   GMN++  Y++ 
Sbjct: 292 ALFYYWIDY---KGHSRGLNWLKVYGMNSITAYLLG 324


>gi|307700906|ref|ZP_07637931.1| putative membrane protein [Mobiluncus mulieris FB024-16]
 gi|307613901|gb|EFN93145.1| putative membrane protein [Mobiluncus mulieris FB024-16]
          Length = 442

 Score = 62.0 bits (149), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 101/417 (24%), Positives = 163/417 (39%), Gaps = 111/417 (26%)

Query: 2   SEIKAETTHHHPLIISEPDV--SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAG--- 56
           ++ +A TT       +EP+   ++Q E    K  R+ SLD+ RG    L++ V  A    
Sbjct: 56  TQSEAATTRQ-----TEPNTGETNQAETKPAKPGRITSLDVGRGWF--LIMSVTSAAWLL 108

Query: 57  --GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLF 114
              DW  + HAPW G    D + P F+ + G+ +A A     +R     KV  R + +L 
Sbjct: 109 PRPDW--LIHAPWIGIRYYDMIFPLFVTLSGIGLAFAYH---NRVS--FKVTLRRIVVLV 161

Query: 115 WGILLQGGFSHAP-DELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVG 173
              LL  G S    D  T+       R  G LQ  A+   +++   +F ++         
Sbjct: 162 VVGLLYNGVSSGQWDPATF-------RFTGPLQVYAVIVAIIATCHLFARN--------- 205

Query: 174 RFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKL 233
                        MA   +   +A+L    +  W  T                 GV   L
Sbjct: 206 ------------WMAWAGITAGVAVLQTGLLTWWAGT--------------CPSGV---L 236

Query: 234 NPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLL 293
           +P CN  G  DR +LG  HMY                + G L  D          PEGL+
Sbjct: 237 SPSCNPSGMWDRALLGA-HMY----------------YGGFLGHD----------PEGLV 269

Query: 294 SSVSSILSTIIGVHFGHVIIHTK--GHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQL 351
           +   ++L+   G   GH+ + ++  G      + + +  A+ +FGL L  T  +P  K+L
Sbjct: 270 AITGALLTAAAGTTAGHLALSSRRLGWKTGPVKLLALAAAMSVFGLIL--TIWVPAFKRL 327

Query: 352 YTLSYVCVTSGAAALVFSAIYALVDI----------WNLKYPFLPLAWIGMNAMLVY 398
           +T S+  +      L+F+  +   D+            + +PF  L   G N++LVY
Sbjct: 328 WTPSFSLIAGAVGVLIFAVAFLCFDVPLRSGNSRIREQIAWPFTAL---GRNSLLVY 381


>gi|298386962|ref|ZP_06996516.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
 gi|298260112|gb|EFI02982.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
          Length = 376

 Score = 62.0 bits (149), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 77/393 (19%), Positives = 156/393 (39%), Gaps = 92/393 (23%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGG------DWP-------EISHAPWNGCNLAD 74
           S   T RLASLDI RG  + L++             + P       +  H  W G    D
Sbjct: 5   SENNTSRLASLDILRGFDLFLLVFFQPVFAALARQLNLPFLNDILYQFDHEVWEGFRFWD 64

Query: 75  FVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDEL 130
            VMP FLF+ G ++  +L +    + +     ++++ R   L  +G+++QG         
Sbjct: 65  LVMPLFLFMTGASMPFSLSKYVGMSGSYWPVYRRILRRVFLLFIFGMIVQGNL------- 117

Query: 131 TYGVDVRMIRL-CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAA 189
             G+D   I L    LQ IA+ Y + +++++                    + + W +  
Sbjct: 118 -LGLDSSHIYLYSNTLQSIAVGYFIAAVIQLH-------------------FSFRWQIGI 157

Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLG 249
            +L++++  +  T++ D                            P  N    +DR VLG
Sbjct: 158 TLLLLFIYWIPMTFLGD--------------------------FTPAGNFAEQVDRCVLG 191

Query: 250 INHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP-FEPEGLLSSVSSILSTIIGVHF 308
                      R +        +G    +  +W  +P +    + SS++  ++ ++G   
Sbjct: 192 -----------RFR--------DGVFWNEDGTWSFSPYYNYTWIWSSLTFGVTVMLGAFA 232

Query: 309 GHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVF 368
           G ++   K +  ++ Q +++   +L+ GL + ++  +P+ K+L+T S   ++ G   L+ 
Sbjct: 233 GKIMKEGKANRKKVVQTLSV-IGVLLVGLAMLWSLQMPIIKRLWTGSMTLLSGGYCFLLM 291

Query: 369 SAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA 401
           +  Y  +D          L   GMN++  Y++ 
Sbjct: 292 ALFYYWIDYKGHSRGLNWLKVYGMNSITAYLLG 324


>gi|283781521|ref|YP_003372276.1| hypothetical protein Psta_3761 [Pirellula staleyi DSM 6068]
 gi|283439974|gb|ADB18416.1| conserved hypothetical protein [Pirellula staleyi DSM 6068]
          Length = 417

 Score = 62.0 bits (149), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 92/413 (22%), Positives = 158/413 (38%), Gaps = 96/413 (23%)

Query: 17  SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEIS------------- 63
           + P ++     + L   RL SLD +RG    +M+ +   G   P+++             
Sbjct: 4   AAPSLAASTPAATLPA-RLLSLDAYRGF---VMLAMASRGFGIPKVAALPQFASHPTWQF 59

Query: 64  ------HAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKV----IFRTLKLL 113
                 H  W G    D + P F+F+VGVA+A +      + D   K+    IFR + L+
Sbjct: 60  LAGQLDHVAWVGSCFWDLIQPSFMFMVGVAMAYSCAARVSKGDPYWKMLLHAIFRAMVLI 119

Query: 114 FWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVG 173
             G+ L+   S++ D+  +           V  +I L YL + L+      VQ       
Sbjct: 120 ALGVFLR---SNSSDQTNF-------TFMDVTSQIGLGYLPLFLLWGRKFWVQATA---- 165

Query: 174 RFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKL 233
                         A  +LV Y AL          F +    +ADY              
Sbjct: 166 --------------AIVILVGYFAL----------FALYPLPAADYNYA----------- 190

Query: 234 NPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP----FEP 289
                AVG +D   +       H  W ++     D      +     +W   P    F+ 
Sbjct: 191 -----AVG-VDENWVHFQGFQRH--WEKNANVAHD------VDVKVLNWFPRPEPFMFDR 236

Query: 290 EG--LLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPL 347
            G   L+ + SI + I G+  G  +      L +    V  G A+L+ G  L+     P+
Sbjct: 237 GGYNTLNFIPSIATMIFGLIAGEWLRGGASGLKKFGILVGAGLAMLLIGWGLNELGVCPV 296

Query: 348 NKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVM 400
            K+++T S+   ++G   L+ +A Y ++++   ++   PL  IGMN++ +Y M
Sbjct: 297 VKRIWTPSWAIYSTGWTCLLLAAFYGVIELIQFRFWAFPLVVIGMNSIAIYCM 349


>gi|390958852|ref|YP_006422609.1| hypothetical protein Terro_3042 [Terriglobus roseus DSM 18391]
 gi|390413770|gb|AFL89274.1| Protein of unknown function (DUF1624) [Terriglobus roseus DSM
           18391]
          Length = 406

 Score = 61.6 bits (148), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 40/112 (35%), Positives = 59/112 (52%), Gaps = 17/112 (15%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPW---------NGCNLAD 74
           +  ++ +K QR+ SLDIFRGL +ALMI V+    +  EI   PW         N     D
Sbjct: 3   ETTRATVKPQRIQSLDIFRGLNIALMIFVN----ELHEIKGLPWWTYHAPGAANVMTYVD 58

Query: 75  FVMPFFLFIVGVAIALALKRIPDRADAVKK----VIFRTLKLLFWGILLQGG 122
            V P FL IVG+++ LAL+    R D   +    V+ R++ L+  G++LQ  
Sbjct: 59  MVFPAFLVIVGMSLPLALQARIRRGDETPQLIWYVVLRSVALIVLGLILQNA 110


>gi|433678126|ref|ZP_20510025.1| Heparan-alpha-glucosaminide N-acetyltransferase [Xanthomonas
           translucens pv. translucens DSM 18974]
 gi|430816762|emb|CCP40477.1| Heparan-alpha-glucosaminide N-acetyltransferase [Xanthomonas
           translucens pv. translucens DSM 18974]
          Length = 384

 Score = 61.6 bits (148), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 45/121 (37%), Positives = 67/121 (55%), Gaps = 5/121 (4%)

Query: 33  QRLASLDIFRGLAVALMILVD--HAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +R  SLD+FRGL + LMIL +   AG D + ++ HAPW G   AD   P FLF+VG A++
Sbjct: 16  ERFLSLDVFRGLMIFLMILGNTPGAGADAFVQLRHAPWLGFTAADVGFPSFLFVVGNAMS 75

Query: 90  LALKRIPDRADAVKKVIFRTLKLLFWGILLQG-GFSHAPDELTYG-VDVRMIRLCGVLQR 147
            AL R       +++V  R+  +   G L+    F H   + ++  + +   R+ GVLQR
Sbjct: 76  FALDRSQPLGAFLRRVGKRSALIFLLGFLMYWFPFVHQGADGSWSFIAIDQTRVPGVLQR 135

Query: 148 I 148
           I
Sbjct: 136 I 136


>gi|410657728|ref|YP_006910099.1| N-acetylglucosamine related transporter, NagX [Dehalobacter sp.
           DCA]
 gi|409020083|gb|AFV02114.1| N-acetylglucosamine related transporter, NagX [Dehalobacter sp.
           DCA]
          Length = 370

 Score = 61.6 bits (148), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 90/380 (23%), Positives = 156/380 (41%), Gaps = 96/380 (25%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDW--PEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           T R+ +LD  R L+V L+ L     G      I+HAPW G    DF  P F+ + G ++A
Sbjct: 5   TNRIKALDFARALSVLLLFLTFVPEGPLYGAYITHAPWFGYTAIDFAFPAFVTLSGTSMA 64

Query: 90  LALKR-IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
           +A ++ +P       ++I R   L+  G++           +++   +  +R  GVLQ +
Sbjct: 65  IAYRKHVP-----WVRLIRRFFVLIIIGLIFN-------SLVSWEFHLSQLRFTGVLQVL 112

Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
           A + ++ +L+      V  K              W W   A +L++              
Sbjct: 113 AFTGIMTTLIT----RVSGK--------------WFWPFTAGLLIL-------------- 140

Query: 209 FTIINKDSADYGKVFNVTCGVRAKL-NPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
                  +A  G +   +      L +P  N  G ID  +   +H+Y H           
Sbjct: 141 -------AAYLGILLYTSQSFPGSLPSPDHNLSGMIDPFIFTKSHLYVH----------- 182

Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVT 327
                            A ++PEG+ +  S+I ST+ G   G  +     ++ R      
Sbjct: 183 ---------------GDAGYDPEGICTLFSAIASTLFGYTAG--LFLNNKNIGR-NFLKI 224

Query: 328 MGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFL-- 385
           +  A ++  LT   +N IP+ K+L+T S+V ++SGA  LV +  +    IW+ + P +  
Sbjct: 225 LALAAVLLLLTPLLSNFIPIGKRLWTPSFVTLSSGATILVLAFAHL---IWDPQIPVIRK 281

Query: 386 ---PLAW----IGMNAMLVY 398
              P+ W    IG NA+L+Y
Sbjct: 282 LLAPVYWLFEAIGRNAILLY 301


>gi|306818439|ref|ZP_07452162.1| conserved hypothetical protein [Mobiluncus mulieris ATCC 35239]
 gi|304648612|gb|EFM45914.1| conserved hypothetical protein [Mobiluncus mulieris ATCC 35239]
          Length = 399

 Score = 61.6 bits (148), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 101/417 (24%), Positives = 165/417 (39%), Gaps = 111/417 (26%)

Query: 2   SEIKAETTHHHPLIISEPDV--SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAG--- 56
           ++ +A TT       +EP+   ++Q E    K  R+ SLD+ RG    L++ V  A    
Sbjct: 13  TQSEAATTRQ-----TEPNTGETNQTETKPAKPGRITSLDVGRGWF--LIMSVTSAAWLL 65

Query: 57  --GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLF 114
              DW  + HAPW G    D + P F+ + G+ +A A     +R     KV  R + +L 
Sbjct: 66  PRPDW--LIHAPWIGIRYYDMIFPLFVTLSGIGLAFAYH---NRVS--FKVTLRRIVVLV 118

Query: 115 WGILLQGGFSHAP-DELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVG 173
              LL  G S    D  T+       R  G LQ  A+   +++   +F ++         
Sbjct: 119 VVGLLYNGVSSGQWDPATF-------RFTGPLQVYAVIVAIIATCHLFARN--------- 162

Query: 174 RFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKL 233
                      W++ A +    +A+L    +  W  T                 GV   L
Sbjct: 163 -----------WMVWAGI-TAGVAVLQTGLLTWWAGT--------------CPSGV---L 193

Query: 234 NPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLL 293
           +P CN  G  DR +LG  HMY                + G L  D          PEGL+
Sbjct: 194 SPSCNPSGMWDRALLG-AHMY----------------YGGFLGHD----------PEGLV 226

Query: 294 SSVSSILSTIIGVHFGHVIIHTK--GHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQL 351
           +   ++L+   G   GH+ + ++  G      + + +  A+ +FGL L  T  +P  K+L
Sbjct: 227 AITGALLTAAAGTTAGHLALSSRRLGWKTGPVKLLALAAAMSVFGLIL--TIWVPAFKRL 284

Query: 352 YTLSYVCVTSGAAALVFSAIYALVDI----------WNLKYPFLPLAWIGMNAMLVY 398
           +T S+  +      L+F+  +   D+            + +PF  L   G N++LVY
Sbjct: 285 WTPSFSLIAGAVGVLIFAVAFLCFDVPLRSGNSRIRERIAWPFTAL---GRNSLLVY 338


>gi|413922900|gb|AFW62832.1| hypothetical protein ZEAMMB73_935848 [Zea mays]
          Length = 1241

 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 27/60 (45%), Positives = 36/60 (60%)

Query: 222  VFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPS 281
            V  V CGVR   +  CNAVG IDRK+LGI H+Y  P + RSK   +++      R+ AP+
Sbjct: 966  VLQVKCGVRGDTSSGCNAVGMIDRKILGIQHLYGRPVYARSKNYRKNTLAASSSRRKAPA 1025


>gi|423280893|ref|ZP_17259805.1| hypothetical protein HMPREF1203_04022 [Bacteroides fragilis HMW
           610]
 gi|404583534|gb|EKA88212.1| hypothetical protein HMPREF1203_04022 [Bacteroides fragilis HMW
           610]
          Length = 375

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 84/387 (21%), Positives = 153/387 (39%), Gaps = 93/387 (24%)

Query: 34  RLASLDIFRGLAVALMI--------LVDHAGGDW-----PEISHAPWNGCNLADFVMPFF 80
           RLASLDI RG  + L++        L       W      +  H  W G    D VMP F
Sbjct: 11  RLASLDILRGFDLFLLVFFQPVLWALAHQLNAPWLNSILSQFDHEVWEGFRFWDLVMPLF 70

Query: 81  LFIVGVAIALALKRI---PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
           LF+ G ++  +  +    PD+    +K+I R + L  +G+++QG           G++ +
Sbjct: 71  LFMTGASMPFSFSKFKDDPDKGTIYRKIIRRFILLFIFGMIVQGNL--------LGLNPK 122

Query: 138 MIRL-CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYL 196
            + L    LQ IA  YL+ +++ +                 FR      ++ A +L++Y 
Sbjct: 123 YLYLYSNTLQAIATGYLIAAIILLHCS--------------FRR---QLIVTALLLLIYW 165

Query: 197 ALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHH 256
           A +  T++ D                            P  N    ID+ VLG       
Sbjct: 166 APM--TFLGD--------------------------FTPEGNFAEKIDKLVLG------- 190

Query: 257 PAWRRSKACTQDSPFEGPLRKDAPSWCHAP-FEPEGLLSSVSSILSTIIGVHFGHVIIHT 315
               R +        +G    +  +W  +P +    + SS++   + ++G   G ++   
Sbjct: 191 ----RFR--------DGVYWNEDGTWNFSPHYNYTWIWSSLTFGATVMLGTFAGKIMKAG 238

Query: 316 KGHLAR-LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYAL 374
           K +  + ++  + +G AL+ F L   ++  +P+ K+L+T S    + G   L+  A Y  
Sbjct: 239 KDNRRKVVRTLLVIGIALVAFSLV--WSLQMPIIKRLWTSSMTLFSGGLCFLLMGAFYYW 296

Query: 375 VDIWNLKYPFLPLAWIGMNAMLVYVMA 401
           +D          L   GMN++  Y++ 
Sbjct: 297 IDYKGHSRGLNWLKIYGMNSITAYILG 323


>gi|313149262|ref|ZP_07811455.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
 gi|313138029|gb|EFR55389.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
          Length = 375

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 84/387 (21%), Positives = 151/387 (39%), Gaps = 93/387 (24%)

Query: 34  RLASLDIFRGLAVALMI--------LVDHAGGDW-----PEISHAPWNGCNLADFVMPFF 80
           RLASLDI RG  + L++        L       W      +  H  W G    D VMP F
Sbjct: 11  RLASLDILRGFDLFLLVFFQPVLWALAHQLNAPWLNSILSQFDHEVWEGFRFWDLVMPLF 70

Query: 81  LFIVGVAIALALKRI---PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
           LF+ G ++  +  +    PD+    +K+I R + L  +G+++QG           G++ +
Sbjct: 71  LFMTGASMPFSFSKFKDDPDKGPIYRKIIRRFILLFIFGMIVQGNL--------LGLNPK 122

Query: 138 MIRL-CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYL 196
            + L    LQ IA  YL+ +++ +                 FR      ++ A +L++Y 
Sbjct: 123 YLYLYSNTLQAIATGYLIAAIILLHCS--------------FRR---QLIVTALLLLIYW 165

Query: 197 ALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHH 256
           A +  T++ D                            P  N    ID+ VLG  H    
Sbjct: 166 APM--TFLGD--------------------------FTPEGNFAEKIDKLVLG--HFRDG 195

Query: 257 PAWRRSKACTQDSPFEGPLRKDAPSWCHAP-FEPEGLLSSVSSILSTIIGVHFGHVIIHT 315
             W                  +  +W  +P +    + SS++   + ++G   G ++   
Sbjct: 196 VYW-----------------NEDGTWNFSPHYNYTWIWSSLTFGATVMLGTFAGKIMKAG 238

Query: 316 KGHLAR-LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYAL 374
           K +  + ++  + +G AL+ F L   ++  +P+ K+L+T S    + G   L+  A Y  
Sbjct: 239 KDNRRKVVRTLLVIGIALVAFSLV--WSLQMPIIKRLWTSSMTLFSGGLCFLLMGAFYYW 296

Query: 375 VDIWNLKYPFLPLAWIGMNAMLVYVMA 401
           +D          L   GMN++  Y++ 
Sbjct: 297 IDYKGHSRGLNWLKIYGMNSITAYILG 323


>gi|255037019|ref|YP_003087640.1| hypothetical protein Dfer_3263 [Dyadobacter fermentans DSM 18053]
 gi|254949775|gb|ACT94475.1| conserved hypothetical protein [Dyadobacter fermentans DSM 18053]
          Length = 380

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 93/404 (23%), Positives = 159/404 (39%), Gaps = 99/404 (24%)

Query: 16  ISEPDVSDQQE-KSHLKTQRLASLDIFRGLAVALMI-----LVDHAGGDW---------P 60
           +  P V  ++E +      RLAS+D  RG  + LMI      +   GG            
Sbjct: 1   MEAPSVVVKKEVRPSSSPGRLASIDALRGFDM-LMIAGGGQFIATLGGKTGISFIDAVAA 59

Query: 61  EISHAPWNGCNLADFVMPFFLFIVGVAIALAL-----KRIPDRADAVKKVIFRTLKLLFW 115
           +  H  WNG    DF+ P FLF+ G ++A ++     K IP       KV  R L L+  
Sbjct: 60  QFEHPAWNGFTFYDFIFPLFLFLAGTSLAFSVTGGLAKGIPPSV-IRNKVFKRMLILIAL 118

Query: 116 GILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRF 175
           GIL +    +AP ++    D   IR   VL RI L+  + +++ +     Q         
Sbjct: 119 GILDK----NAPMDI---FDPAHIRYGSVLGRIGLATFISAILYMKFGTNQ--------- 162

Query: 176 SIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNP 235
              RLY     +   +LV Y   L    VP +        S D              L  
Sbjct: 163 ---RLY-----IGVGILVAYYLTLILVPVPGF-------GSGD--------------LTF 193

Query: 236 PCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSS 295
             N VG+IDR  +                        G L++         ++   LL+ 
Sbjct: 194 EGNLVGWIDRNFM-----------------------PGILKQ-------GTYDELALLTQ 223

Query: 296 VSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLS 355
             ++  T+ G   G V++      +++ + +      +  G+  +F  A P+NK L++ S
Sbjct: 224 FPALCLTLFGTVAGDVLLRENRGNSKIGKLLLFASTGISIGILWNF--AFPINKHLWSSS 281

Query: 356 YVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
           ++ +TSG A  + +  Y ++D+   +        IG+N++++Y+
Sbjct: 282 FIMLTSGMAFAMLALFYWIIDVKGFQKWAFFFRVIGLNSLVIYL 325


>gi|375255119|ref|YP_005014286.1| hypothetical protein BFO_1396 [Tannerella forsythia ATCC 43037]
 gi|363406141|gb|AEW19827.1| putative membrane protein [Tannerella forsythia ATCC 43037]
          Length = 375

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 87/387 (22%), Positives = 147/387 (37%), Gaps = 91/387 (23%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEI-------------SHAPWNGCNLADFVMPF 79
           +RL SLD+ RG  +  ++++      W EI             +H  W G    D +MP 
Sbjct: 8   KRLVSLDLLRGFDLFCLLMLQPILMTWLEIADNPAWAPLARQFTHVEWRGVAFWDLIMPL 67

Query: 80  FLFIVGVAIALALKRIPDRADAVKKVIFRTLK----LLFWGILLQGG-FSHAPDELTYGV 134
           F+F+ G+ +  AL +    A        + LK    L F G ++QG   +  P+      
Sbjct: 68  FMFMSGITVPFALSKYKRGAKPGHSFYLKLLKRFVILFFLGWIVQGNLLALDPN------ 121

Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
             R       LQ IA+ Y++ +   +             RFS FR+      + A VL  
Sbjct: 122 --RFHIFANTLQAIAVGYVVTAFCYV-------------RFS-FRVQ-----LGATVLFF 160

Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
              LL                      VF    G+     P  N    IDR VLG     
Sbjct: 161 IAYLL----------------------VFATVGGM--NWEPGTNIAEEIDRCVLG----- 191

Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP-FEPEGLLSSVSSILSTIIGVHFGHVII 313
                 R +        +G + +   SW   P +    +LSS++ +++ + G   GH++ 
Sbjct: 192 ------RFR--------DGIITEADGSWKFDPAYHYTWILSSLNFVVTVMTGSFAGHILR 237

Query: 314 HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
             K    RL + +  G +L++  L +      PL K++++ S      G   L+    Y 
Sbjct: 238 LRKTARQRLMRLLITGVSLVVAALLMD--PVFPLIKRIWSSSMTLFYGGVCFLLMGLFYY 295

Query: 374 LVDIWNLKYPFLPLAWIGMNAMLVYVM 400
           +VD+   ++      + GMN++  Y +
Sbjct: 296 VVDMKGYRFGTDWFKYYGMNSIAAYCL 322


>gi|393787642|ref|ZP_10375774.1| hypothetical protein HMPREF1068_02054 [Bacteroides nordii
           CL02T12C05]
 gi|392658877|gb|EIY52507.1| hypothetical protein HMPREF1068_02054 [Bacteroides nordii
           CL02T12C05]
          Length = 373

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 90/394 (22%), Positives = 149/394 (37%), Gaps = 103/394 (26%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEI-------------SHAPWNGCNLADFVMP 78
            +RLASLD+ RG  +  ++++      W EI             +H  W G    D +MP
Sbjct: 5   NKRLASLDLLRGFDLFCLLMLQPILMTWLEIENNPSLDPITNQFTHVEWQGVAFWDLIMP 64

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKVIFRTLK----LLFWGILLQGGFSHAPDELTYGV 134
            F+F+ G+ I  A+ +        +   FR  K    L F G ++QG            +
Sbjct: 65  LFMFMSGITIPFAMSKYKQGEKIDRHFYFRLFKRFFVLFFLGWVVQGNL--------LAL 116

Query: 135 DVRMIRL-CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV 193
           D+R   +    LQ IA+ Y++ +L                      LY W      C   
Sbjct: 117 DIRQFHIFANTLQAIAVGYVVAAL----------------------LYVW------C--- 145

Query: 194 VYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
                   ++     FT++   +  Y  VF  T G      P  N    IDR VLG    
Sbjct: 146 --------SFRTQISFTVLCFIT--YLLVF-ATIG-DMNYEPGTNIAEEIDRCVLG---- 189

Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEG------LLSSVSSILSTIIGVH 307
                                LR D  +W +  +  +       +LSS++ I++ ++G  
Sbjct: 190 --------------------SLR-DGVTWTNGTWSFDSSYHYTWILSSLNFIVTVMLGSF 228

Query: 308 FGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALV 367
            GH++   K  + RLK  +  G  L+   L +      P+ K++++ S      G   L+
Sbjct: 229 AGHILRLRKEPVQRLKVLLLTGGFLVAVALLMD--PLFPIIKRIWSSSMTLFYGGVCFLL 286

Query: 368 FSAIYALVDIWNLKYPFLP-LAWIGMNAMLVYVM 400
               Y L+DI   K   +  L + GMN+++ Y +
Sbjct: 287 MGIFYYLIDIKGWKSGAVNWLNYYGMNSIVAYCL 320


>gi|195167204|ref|XP_002024424.1| GL15027 [Drosophila persimilis]
 gi|194107797|gb|EDW29840.1| GL15027 [Drosophila persimilis]
          Length = 493

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 27/50 (54%), Positives = 34/50 (68%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFF 80
           + +RL SLD FRGL++ LMI V+  GG +  I HA WNG +LAD V PF 
Sbjct: 181 QRKRLRSLDTFRGLSIVLMIFVNSGGGGYAWIEHAAWNGLHLADLVFPFL 230



 Score = 39.7 bits (91), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 49/175 (28%), Positives = 81/175 (46%), Gaps = 22/175 (12%)

Query: 228 GVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPF 287
           G  +   P   A GY+D +V G  H+Y  PA   S+ C         +R   PS     F
Sbjct: 287 GAHSNCTP--GAAGYVDMQVEGNAHIY--PA-SHSQLC---------VRLPVPSIRRESF 332

Query: 288 EPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNA--- 344
               LLS V  +L    G+    +++HT       +  +      ++ G    F+     
Sbjct: 333 G--CLLSIVQVLLGAFAGLT---LLVHTTWQTRLRRWLLLSLLLGVVGGALCGFSREGGV 387

Query: 345 IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
           IP+NK L++LS+V VT   A L+ + +Y ++D+ +  +   P +  GMNA+++YV
Sbjct: 388 IPVNKNLWSLSFVFVTVSVALLLLALLYYIIDVRDGWWSGWPFSECGMNAIIMYV 442


>gi|409198223|ref|ZP_11226886.1| hypothetical protein MsalJ2_14356 [Marinilabilia salmonicolor JCM
           21150]
          Length = 394

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 43/120 (35%), Positives = 64/120 (53%), Gaps = 11/120 (9%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG-----DWPEISHAPWNGCNLADFVMPFFLFIVGV 86
           T R+ S+DI R + VALM+ V+   G      W   + A  +G  LAD V P FLF VG+
Sbjct: 7   THRIKSIDILRAITVALMVFVNDLPGIRDIPQWLGHASAGHDGMFLADIVFPLFLFWVGM 66

Query: 87  AIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
           +I LA+     + D+    V+ ++ RT  L+F G+L+    S    E+T G+D  +  LC
Sbjct: 67  SIPLAVDGRQKKGDSDLTIVRHILKRTFSLVFIGVLMVNT-SRMSVEVT-GIDRNLWALC 124


>gi|440731410|ref|ZP_20911431.1| membrane protein [Xanthomonas translucens DAR61454]
 gi|440373102|gb|ELQ09871.1| membrane protein [Xanthomonas translucens DAR61454]
          Length = 384

 Score = 59.3 bits (142), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 45/121 (37%), Positives = 66/121 (54%), Gaps = 5/121 (4%)

Query: 33  QRLASLDIFRGLAVALMILVD--HAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +R  SLD+FRGL + LMIL +   AG D + ++ HAPW G   AD   P FLF+VG A++
Sbjct: 16  ERFLSLDVFRGLMIFLMILGNTPGAGADAFVQLRHAPWLGFTAADVGFPSFLFVVGNAMS 75

Query: 90  LALKRIPDRADAVKKVIFRTLKLLFWGILLQG-GFSHAPDELTYG-VDVRMIRLCGVLQR 147
            AL R       + +V  R+  +   G L+    F H   + ++  + +   R+ GVLQR
Sbjct: 76  FALDRSQPLGAFLCRVGKRSALIFLLGFLMYWFPFVHQGADGSWSFIAIDQTRVPGVLQR 135

Query: 148 I 148
           I
Sbjct: 136 I 136


>gi|399069322|ref|ZP_10749357.1| Protein of unknown function (DUF1624), partial [Caulobacter sp.
           AP07]
 gi|398045229|gb|EJL37978.1| Protein of unknown function (DUF1624), partial [Caulobacter sp.
           AP07]
          Length = 233

 Score = 59.3 bits (142), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 79/303 (26%), Positives = 122/303 (40%), Gaps = 92/303 (30%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAG--------GDWPEISHAPWNGCNLADFVMPF 79
           S  K  RLASLD+ RGL +  MI+V+ A           +  + HA W G   AD V P 
Sbjct: 3   SRPKAARLASLDVLRGLTIVGMIVVNTASYLHYVSGYAVFAGLEHAEWRGFTAADAVFPA 62

Query: 80  FLFIVGVAIALALKRIP------DR------ADAVKKVIFRTLKLLFWGILLQGGFSHAP 127
           F+F+ GV+I LAL  +       +R        A+++++ R+ +L   G++L   +  A 
Sbjct: 63  FVFMTGVSIPLALGPLALGDGPIERGMAGLDGAALRRLLVRSGRLFLLGLILSNLYWMAT 122

Query: 128 DELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLM 187
            E        + R  GVLQR+AL+        +  K +  + + +              +
Sbjct: 123 PESV------LFRPMGVLQRLALA---FLAAAVLYKTLGPRARMI--------------L 159

Query: 188 AACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKV 247
           A  +L +Y           W  T++               G    L P  N VG+ DR V
Sbjct: 160 AVAILALY-----------WPLTLLPFPD-----------GTTDLLRPGANFVGWFDRAV 197

Query: 248 LGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVH 307
           LG  H Y H                GPL           ++PEGLLS++ ++   ++GV 
Sbjct: 198 LG-AHTYVH----------------GPLG----------YDPEGLLSTLPAVAQALLGVA 230

Query: 308 FGH 310
            G 
Sbjct: 231 AGQ 233


>gi|392551353|ref|ZP_10298490.1| hypothetical protein PspoU_08780 [Pseudoalteromonas spongiae
           UST010723-006]
          Length = 379

 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 49/160 (30%), Positives = 74/160 (46%), Gaps = 34/160 (21%)

Query: 21  VSDQQEKSHLKTQRLASLDIFRGLAV-----------ALMILVDHAGGDWPEIS--HAPW 67
           +SD  +K+    +RLASLD  RG  +           AL +L   AG    E    H+ W
Sbjct: 1   MSDTNKKTK---KRLASLDALRGFDMFWILGGEKIFAALFVLTGWAGWKVAEAQTLHSQW 57

Query: 68  NGCNLADFVMPFFLFIVGVAIALALKRIP-----DRADAVKKVIFRTLKLLFWGILLQGG 122
           +G    D + P F+F+ GVA+ L+ KRI      DR     K   R L L F+G+L   G
Sbjct: 58  HGFTFYDLIFPLFIFLSGVAMGLSPKRIDHLPFVDRKPIYIKAFKRLLLLCFFGVLYNHG 117

Query: 123 FSHA----PDELTYGVDVRMIRLCGVLQRIALSYLLVSLV 158
           +       P+E         +R   VL RIA+++ + +++
Sbjct: 118 WGTGVPLNPEE---------VRYASVLGRIAVAWFVAAML 148



 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 39/139 (28%), Positives = 69/139 (49%), Gaps = 16/139 (11%)

Query: 284 HAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGF----ALLIFGLTL 339
           + P +PEG+LSS+ +I + + GV  G  I   +       +W T G      L+   +  
Sbjct: 209 NKPMDPEGILSSIPAIANALFGVIAGRYIKQAQ----ERGEWKTAGILFAAGLVALAVGW 264

Query: 340 HFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
            +    P+NK L+T S+V VT G + ++ +  YA+VD+ N +        IG N++++Y+
Sbjct: 265 LWNMVFPVNKDLWTSSFVMVTVGWSFILLAVFYAVVDLLNQQRAAYLFVIIGANSIVIYL 324

Query: 400 MA--------AEGIFAGFI 410
            +        +  +F GFI
Sbjct: 325 ASSLVNWEYISRSVFGGFI 343


>gi|326663866|ref|XP_696425.5| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Danio rerio]
          Length = 170

 Score = 58.9 bits (141), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 42/121 (34%), Positives = 65/121 (53%), Gaps = 10/121 (8%)

Query: 286 PFEPEGLLSSVSSILSTIIGVHFGHVIIHTKG-HLARLKQWVTMGFALLIFGLTLHFTNA 344
           PF+PEG+L S++SIL   +G+  G +++H +  H   + +++  G  L I    L   + 
Sbjct: 2   PFDPEGVLGSINSILMAFLGLQAGKILLHYRDQHRQIITRFLMWGLILGIISAVLTKCSR 61

Query: 345 ----IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDI--WNLKYPFLPLAWIGMNAMLVY 398
               IP+NK L++LSYV   S  A +     Y  VD+  W    PF    + GMN++LVY
Sbjct: 62  NDGFIPVNKNLWSLSYVTTLSCFAFVALVFFYYTVDVKKWWSGAPFF---YPGMNSILVY 118

Query: 399 V 399
           V
Sbjct: 119 V 119


>gi|260911058|ref|ZP_05917694.1| conserved hypothetical protein [Prevotella sp. oral taxon 472 str.
           F0295]
 gi|260634862|gb|EEX52916.1| conserved hypothetical protein [Prevotella sp. oral taxon 472 str.
           F0295]
          Length = 409

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 81/366 (22%), Positives = 150/366 (40%), Gaps = 86/366 (23%)

Query: 61  EISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIF--RTLK--LLFW- 115
           +I+H PW G    D +MP F+F+ G+ I  ++ +   R ++   V F  R LK  ++ W 
Sbjct: 83  QITHVPWQGFCFWDIIMPLFMFMSGITIPFSMAKY-QRGESKAGVGFLLRLLKRFVVLWV 141

Query: 116 -GILLQGGFSHAPDELTYGVDVRMIRL-CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVG 173
            G+++QG            +D R + L    LQ IA+ Y++V+L+ ++T           
Sbjct: 142 LGMVVQGNL--------LALDARQLHLYSNTLQSIAVGYVVVALLFVYTS---------- 183

Query: 174 RFSIFRLYCWHWLMA--ACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRA 231
                    W   +A  A  +V Y+A+                  A +G++         
Sbjct: 184 ---------WRTQLAVVASCMVAYVAVF-----------------AIWGQM--------- 208

Query: 232 KLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPL-RKDAPSWCHAPFEPE 290
                 N    IDR+VLG         WR           +G +   D   W  A +   
Sbjct: 209 DFTIDANICEEIDRQVLG--------RWR-----------DGVIWNGDQWQW-DATYHYT 248

Query: 291 GLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQ 350
            +LSS++ + +  +G   G V+  ++   ++L+  +  G  L++    L  +  +P+ K 
Sbjct: 249 WILSSLNFVGTVYLGYLAGVVLRTSQSGTSKLRTLLLSGVGLIVLAFAL--SPVVPIIKH 306

Query: 351 LYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFI 410
           +++ S      G   L+ +A Y  VD+       + L + G N+++ YV+     F+   
Sbjct: 307 IWSTSMTFFAGGICFLLMAAAYYWVDLKGHTRGLMWLRFYGTNSLVAYVLGEYVNFSSLT 366

Query: 411 NGWYYG 416
           +  +YG
Sbjct: 367 DSLFYG 372


>gi|260684359|ref|YP_003215644.1| hypothetical protein CD196_2626 [Clostridium difficile CD196]
 gi|260688018|ref|YP_003219152.1| hypothetical protein CDR20291_2673 [Clostridium difficile R20291]
 gi|260210522|emb|CBA65033.1| putative membrane protein [Clostridium difficile CD196]
 gi|260214035|emb|CBE06182.1| putative membrane protein [Clostridium difficile R20291]
          Length = 352

 Score = 58.5 bits (140), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 38/125 (30%), Positives = 63/125 (50%), Gaps = 16/125 (12%)

Query: 49  MILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIALA----LKRIPDRADA 101
           MI+ ++ G     +P++ HA W+G  LADF  PFF+  +GV I ++    LK        
Sbjct: 1   MIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISLGVTIPISINSKLKNNKSTLSI 60

Query: 102 VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIF 161
           +  +  R++ L+ +G  L   +   P       D+  +R+ GVLQR+ L Y + SLV + 
Sbjct: 61  ILSIFKRSILLILFGFFLN--YLGNP-------DLNSVRILGVLQRMGLVYFVTSLVYLL 111

Query: 162 TKDVQ 166
            K + 
Sbjct: 112 LKKLN 116



 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 33/114 (28%), Positives = 59/114 (51%), Gaps = 3/114 (2%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTK-GHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           FEP+G L+S+ +I S ++G   G V++    G   +  + + M   LLI      F    
Sbjct: 170 FEPDGFLTSIVAISSGMLGCTMGCVLLKENIGEYKKFFKILVMSIILLIGAFI--FNQYF 227

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
           P NK+L++ S+V + +G+  ++ S  Y + DI N    F P+  +G + +  Y+
Sbjct: 228 PFNKRLWSSSFVLLMAGSYGMLLSIFYFICDIKNKSKIFTPIIALGSSPIFTYM 281


>gi|343086706|ref|YP_004776001.1| hypothetical protein [Cyclobacterium marinum DSM 745]
 gi|342355240|gb|AEL27770.1| Protein of unknown function DUF2261, transmembrane [Cyclobacterium
           marinum DSM 745]
          Length = 368

 Score = 58.2 bits (139), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 33/96 (34%), Positives = 49/96 (51%), Gaps = 15/96 (15%)

Query: 31  KTQRLASLDIFRGLAVALMI-----LVDHAGGDWPEIS----------HAPWNGCNLADF 75
             +RL SLD +RG+ + L++     L     G +PE+S          H PWNG    D 
Sbjct: 8   SNKRLVSLDAYRGITMFLLVAESARLYGAFEGLFPEVSGWQMFFTQFTHHPWNGLRFWDL 67

Query: 76  VMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLK 111
           + PFF+FIVGVA+  +L +  ++    +KV    LK
Sbjct: 68  IQPFFMFIVGVAMPFSLNKRLEKQGDRRKVTLHILK 103



 Score = 43.5 bits (101), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 28/102 (27%), Positives = 56/102 (54%), Gaps = 4/102 (3%)

Query: 302 TIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTS 361
           T+ G   G +++    +  ++K  V  G   L+ G  L +    P+ K++ T S+V  + 
Sbjct: 211 TLWGAMCGKILLSKSAYSLKIKYLVIAGLIGLVLGYGLDWLGITPIIKRICTSSFVLASG 270

Query: 362 GAAALVFSAIYALVDI--WNLKYPFLPLAWIGMNAMLVYVMA 401
           G + LV +  Y L+D+  +N ++PF+ +  +GMN++ +Y+ A
Sbjct: 271 GWSILVLAFFYWLIDVKRYN-EWPFMFIV-VGMNSIFIYLFA 310


>gi|320105641|ref|YP_004181231.1| hypothetical protein AciPR4_0402 [Terriglobus saanensis SP1PR4]
 gi|319924162|gb|ADV81237.1| hypothetical protein AciPR4_0402 [Terriglobus saanensis SP1PR4]
          Length = 406

 Score = 58.2 bits (139), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 42/111 (37%), Positives = 55/111 (49%), Gaps = 17/111 (15%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPW---------NGCNLADFVMPFF 80
           L  QR+ SLDIFRGL +ALMI V+       EI   PW         +     D V P F
Sbjct: 10  LAPQRILSLDIFRGLNIALMIFVNELA----EIKGLPWWTYHAPGKVDVMTYVDMVFPGF 65

Query: 81  LFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAP 127
           LFI+G+AI LAL     + D+    +  +  R+  LL  GI+L+ G    P
Sbjct: 66  LFILGMAIPLALNARIRKGDSPATLLGYIALRSAALLVLGIILENGGRGDP 116


>gi|410660784|ref|YP_006913155.1| N-acetylglucosamine related transporter, NagX [Dehalobacter sp. CF]
 gi|409023140|gb|AFV05170.1| N-acetylglucosamine related transporter, NagX [Dehalobacter sp. CF]
          Length = 368

 Score = 58.2 bits (139), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 88/380 (23%), Positives = 153/380 (40%), Gaps = 96/380 (25%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDW--PEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           T R+ +LD  R L+V L+ L     G      I+HAPW G    DF  P F+ + G ++A
Sbjct: 5   TNRIKALDFARALSVLLLFLTFVPEGPLYGAYITHAPWFGYTAIDFAFPAFVTLSGTSMA 64

Query: 90  LALKR-IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
           +  ++ +P       ++I R   LL  G++           +++   +  +R  GVLQ +
Sbjct: 65  IVYRKHVP-----WVRLIRRFFVLLIIGLIFN-------SLVSWEFQLSELRFTGVLQVL 112

Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
           A + ++ +L+      V  K              W W   A +L++              
Sbjct: 113 AFTGIMTALIT----RVSGK--------------WFWPFTAGLLIL-------------- 140

Query: 209 FTIINKDSADYGKVFNVTCGVRAKL-NPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
                  +A  G +   +      L +P  N  G  D  +   +H+Y H           
Sbjct: 141 -------AAYLGILLYTSQSFPGSLPSPDHNLSGMTDPFIFTKSHLYVH----------- 182

Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVT 327
                            A ++PEG+ +  S+I ST+ G   G  +     ++ R      
Sbjct: 183 ---------------GDAGYDPEGICTLFSAIASTLFGYTAG--LFLNNKNIGR-DFLKI 224

Query: 328 MGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFL-- 385
           +  A ++  LT   +N IP+ K+L+T S+V ++SGA  L+ +  +    IW+ + P +  
Sbjct: 225 LALAAVLLLLTPLLSNFIPIGKRLWTPSFVTLSSGATILILAFAHL---IWDPQIPVIRK 281

Query: 386 ---PLAW----IGMNAMLVY 398
              P  W    IG NA+L+Y
Sbjct: 282 LMAPFYWLFEAIGRNAILLY 301


>gi|315126665|ref|YP_004068668.1| hypothetical protein PSM_A1588 [Pseudoalteromonas sp. SM9913]
 gi|315015179|gb|ADT68517.1| conserved hypothetical protein [Pseudoalteromonas sp. SM9913]
          Length = 307

 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 71/324 (21%), Positives = 132/324 (40%), Gaps = 80/324 (24%)

Query: 81  LFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR 140
           +FI+G A+  + K+    A+A +      L+L+  G ++      A +   +   +  +R
Sbjct: 1   MFIIGSAMFFSFKKTNSTANASQ-----VLRLIKRGAIIFA-IGLALNMYPFTTSIENLR 54

Query: 141 LCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLY 200
           + GVLQRI ++Y+L S+  +           + R  +  L       +  +L+ Y  LL 
Sbjct: 55  ILGVLQRIGIAYILASICVLL----------LNRRGVISL-------SVVILIAYWLLLL 97

Query: 201 GTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWR 260
               P+  +T+ N                        N V  +D  VLG +H++      
Sbjct: 98  SVG-PNNAYTLEN------------------------NLVRTVDIAVLGESHLWQGKG-- 130

Query: 261 RSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLA 320
                                     F+PEGLLS++ S++S + G     ++  +     
Sbjct: 131 ------------------------LAFDPEGLLSTLPSVVSVLFGFEVTRLLTSSSSQWT 166

Query: 321 RLKQWVTMGFALLIFGLTLHFTNAI--PLNKQLYTLSYVCVTSGAAALVFSAIYALVDIW 378
            +K+ + +G    + G+ L    ++  P+NK L+T ++V  TSG A +V +    L DI 
Sbjct: 167 SIKRLLVIG----VVGIALGQLGSLIMPINKSLWTSTFVIYTSGIACIVLAFFVWLCDIV 222

Query: 379 NLKYPFLPLAWIGMNAMLVYVMAA 402
             +    PL   G N + +YV++ 
Sbjct: 223 KPERLVNPLIVYGSNPLFIYVLSG 246


>gi|456890764|gb|EMG01555.1| putative membrane protein [Leptospira borgpetersenii str.
           200701203]
          Length = 270

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 58/187 (31%), Positives = 86/187 (45%), Gaps = 52/187 (27%)

Query: 242 YIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILS 301
           +IDR++ G  H+     W+ SK                       ++PEG LS ++SI +
Sbjct: 77  WIDRRIFGEKHL-----WKFSKT----------------------WDPEGFLSGIASIAT 109

Query: 302 TIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFT-------NAIPLNKQLYTL 354
           ++ GV  G ++   +G   R K  V     L IFGL   FT        ++P+NK L+T 
Sbjct: 110 SLFGVICGFILFRREG---RGKNRV-----LSIFGLGFLFTFVGLLWDRSLPMNKSLWTG 161

Query: 355 SYVCVTSGAAAL---VFSAIYALVDI--WN---LKYPFLPLAWIGMNAMLVYVMAAEGIF 406
           SY   T+G + L    F  + +L+ +  WN   LK  F P    G NA+LV+V    GI 
Sbjct: 162 SYAVYTTGLSFLCIGFFEYLDSLILLKKWNGLDLKIFFQPFFVFGKNAILVFV--GSGIL 219

Query: 407 AGFINGW 413
           A  +N W
Sbjct: 220 ARTLNFW 226


>gi|149437198|ref|XP_001516670.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like,
           partial [Ornithorhynchus anatinus]
          Length = 176

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 41/121 (33%), Positives = 66/121 (54%), Gaps = 12/121 (9%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTKGH----LARLKQW-VTMGFALLIFGLTLHF 341
           ++PEG+L +++SI+   +GV  G +++  K      + R   W V MG   LI G+   F
Sbjct: 9   YDPEGILGTINSIVMAFLGVQAGKILLFYKEQHRQIMLRFLTWSVVMG---LISGVLTKF 65

Query: 342 TNA---IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVY 398
           +     +P+NK L+++SYV   S  A +    IY  VD+  L +   P  + GMN++LVY
Sbjct: 66  SQNEGFVPINKNLWSISYVTTLSCFAFVALLLIYYFVDVKRL-WSGAPFFYPGMNSILVY 124

Query: 399 V 399
           V
Sbjct: 125 V 125


>gi|308050627|ref|YP_003914193.1| hypothetical protein Fbal_2917 [Ferrimonas balearica DSM 9799]
 gi|307632817|gb|ADN77119.1| conserved hypothetical protein [Ferrimonas balearica DSM 9799]
          Length = 366

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 101/404 (25%), Positives = 154/404 (38%), Gaps = 129/404 (31%)

Query: 33  QRLASLDIFRG-----------LAVALMILVDHAGGDWP-------EISHAPWNGCNLAD 74
           QRL +LD  RG           L  AL++L       WP       ++ H+PW+G    D
Sbjct: 6   QRLQALDALRGFDMFWIIGGEKLFAALLLLTG-----WPLWQVAADQMLHSPWHGFTFYD 60

Query: 75  FVMPFFLFIVGVAIALALKRI-----PDRADAVKKVIFRTLKLLFWGILLQGGFS----H 125
            + P F+F+ GV I L  + +      DR    +K + R L L   G+L   G+      
Sbjct: 61  LIFPLFIFLSGVTIGLQRQSLIGIAWSDRQPHYRKALKRLLLLALLGVLYNHGWGTGMPM 120

Query: 126 APDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWH- 184
           A DE         IR   VL RI +++ L +++                        WH 
Sbjct: 121 ALDE---------IRYASVLGRIGMAWFLAAMI-----------------------AWHL 148

Query: 185 -W---LMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAV 240
            W   LMA  VL +   L  G +            SA Y    ++   V     P     
Sbjct: 149 PWRSQLMAGLVLALGYGLAQGLW-----------GSAPYSSELSLNTWVDQHWLP----- 192

Query: 241 GYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSIL 300
                   GI +              Q++P                 +PEGL+S +S+ L
Sbjct: 193 --------GITY--------------QNAP----------------LDPEGLMSHLSAAL 214

Query: 301 STIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVT 360
           + ++GV  G+ +   +    R    V    AL    L       +P+NK L+TLS+V VT
Sbjct: 215 NALVGVWAGYWLRQPRSVWQRAG--VLALVALASLLLGWLLHPLVPVNKTLWTLSFVLVT 272

Query: 361 SGAAALVFSAIYALVDIWNLKYPFLP--LAWIGMNAMLVYVMAA 402
            G + L  +   A+VD   L +  L    A IG+N++ +Y+ +A
Sbjct: 273 VGWSGLFLAGFLAVVD--GLGWHRLGKLFALIGVNSIFIYLASA 314


>gi|291295418|ref|YP_003506816.1| hypothetical protein [Meiothermus ruber DSM 1279]
 gi|290470377|gb|ADD27796.1| conserved hypothetical protein [Meiothermus ruber DSM 1279]
          Length = 399

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 50/152 (32%), Positives = 73/152 (48%), Gaps = 17/152 (11%)

Query: 14  LIISEPDVS---DQQEKSHLKTQRLASLDIFRGLAVALMILVDH-----AGGDWPEISHA 65
           ++I++P  +   + + K+     RL +LD  RGL V LM+LV++     A  D  ++ HA
Sbjct: 1   MMIAQPSTAIAVESKNKATPAGARLLALDGLRGLTVFLMLLVNNLALQEATPD--QLVHA 58

Query: 66  PWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSH 125
           P+ G  LAD V P+FLF +G AI  A     D+           +      I L G F  
Sbjct: 59  PFGGVTLADLVFPWFLFCMGAAIPYAASSF-DKQKLPLWRRLLRILRRTSLIFLLGLF-- 115

Query: 126 APDELTYGVDVRMIRLCGVLQRIALSYLLVSL 157
               LT  +    +   GVLQ IAL+Y L +L
Sbjct: 116 ----LTSALARTPVFALGVLQLIALAYCLAAL 143


>gi|284036950|ref|YP_003386880.1| hypothetical protein Slin_2036 [Spirosoma linguale DSM 74]
 gi|283816243|gb|ADB38081.1| hypothetical protein Slin_2036 [Spirosoma linguale DSM 74]
          Length = 404

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 41/132 (31%), Positives = 69/132 (52%), Gaps = 16/132 (12%)

Query: 16  ISEPDVSDQQE-KSHLKTQRLASLDIFRGLAVALMILVDH-----AGGDWPEISHAP--W 67
           +++ + +D+Q  +S   + R+ ++DI R + + LMI V+      A  DW E  H P   
Sbjct: 1   MTQLETADRQAYRSSSVSMRVDAIDILRAMTMILMIFVNDLWSLTAIPDWLE--HVPHGV 58

Query: 68  NGCNLADFVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGF 123
           +G  LAD V P FLFIVG+++  A+     + D     V  +I R++ LL  G+ L  G 
Sbjct: 59  DGIGLADVVFPGFLFIVGMSLPFAMNARRQKGDTNSALVSHIIMRSIALLVMGVFLVNG- 117

Query: 124 SHAPDELTYGVD 135
             + D+   G++
Sbjct: 118 -ESIDQKATGIN 128


>gi|392965639|ref|ZP_10331058.1| hypothetical protein BN8_02168 [Fibrisoma limi BUZ 3]
 gi|387844703|emb|CCH53104.1| hypothetical protein BN8_02168 [Fibrisoma limi BUZ 3]
          Length = 411

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 36/112 (32%), Positives = 60/112 (53%), Gaps = 9/112 (8%)

Query: 15  IISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDH-----AGGDWPEISHAPWNG 69
           +I +  V+  + +  L  +R+ S+DIFR L +  MI V+         DW E S A  + 
Sbjct: 1   MIDQQVVASPETRRQLPEKRVHSIDIFRALTMLFMIFVNDLWTLIGIPDWLEHSPADVDF 60

Query: 70  CNLADFVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGI 117
             LAD V P FLFIVG++I  A++    + D+    ++ ++ R++ LL  G+
Sbjct: 61  LGLADVVFPCFLFIVGMSIPFAIQGRLAKGDSYGLIIRHIVVRSVALLIMGV 112


>gi|311746093|ref|ZP_07719878.1| hypothetical protein ALPR1_06685 [Algoriphagus sp. PR1]
 gi|126576311|gb|EAZ80589.1| hypothetical protein ALPR1_06685 [Algoriphagus sp. PR1]
          Length = 367

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 37/105 (35%), Positives = 55/105 (52%), Gaps = 19/105 (18%)

Query: 31  KTQRLASLDIFRGLAVALMIL--------VDHAGGD-------WPEISHAPWNGCNLADF 75
           K+ RL SLD FRGL + L+I         +  A  D       + + +H PWNG    D 
Sbjct: 7   KSGRLVSLDTFRGLTMFLLIAEAAFVYESLLEAFPDPGILNSFFTQFTHHPWNGLRFWDL 66

Query: 76  VMPFFLFIVGVAIALAL-KRI---PDRADAVKKVIFRTLKLLFWG 116
           + PFF+FIVGVA+  +L KR+    +R++  K ++ R   L  +G
Sbjct: 67  IQPFFMFIVGVAMPFSLNKRLENQENRSEVTKHILKRCFYLFLFG 111



 Score = 45.1 bits (105), Expect = 0.070,   Method: Compositional matrix adjust.
 Identities = 26/100 (26%), Positives = 48/100 (48%)

Query: 302 TIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTS 361
           TI G   G++++  K    ++K     G   LI G  L  T   P+ K++ T S+   + 
Sbjct: 210 TIWGAICGNLLLSKKSDQDKIKTLTIAGVIALIIGYGLDLTGITPIIKRISTSSFALASG 269

Query: 362 GAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA 401
           G A +  +  + L+D+   +    P   +GMN++ +Y+ A
Sbjct: 270 GWALITLAFSFWLIDVKKFQSKAFPFIIVGMNSIFIYLFA 309


>gi|404403948|ref|ZP_10995532.1| hypothetical protein AJC13_00860 [Alistipes sp. JC136]
          Length = 369

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 39/138 (28%), Positives = 74/138 (53%), Gaps = 2/138 (1%)

Query: 284 HAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK-GHLARLKQWVTMGFALLIFGLTLHFT 342
           +  F+PEGLLS+V +I+S + G+  G  +   + G     K       A  I  + + ++
Sbjct: 198 YKTFDPEGLLSTVPAIVSAMFGMFTGEFLRRERPGLTGDRKALYMALAAAAITLVGIAWS 257

Query: 343 NAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
             +P+NK+L++ S+ CV +G +  +F+  Y L+D+   K   L    IG+N++ +Y +A 
Sbjct: 258 GVMPINKKLWSSSFTCVVTGYSLGMFALFYYLIDVRGWKRWTLFFRVIGLNSITIY-LAQ 316

Query: 403 EGIFAGFINGWYYGDPHN 420
             I  G I+ +++G   +
Sbjct: 317 RIIPFGRISDFFFGGAAS 334



 Score = 52.0 bits (123), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 43/149 (28%), Positives = 68/149 (45%), Gaps = 30/149 (20%)

Query: 32  TQRLASLDIFRGLAVALMI----LVDHAGGDWP---------EISHAPWNGCNLADFVMP 78
            QRL SLD  RG  +  ++    LV      WP          + HA W+G    D + P
Sbjct: 4   NQRLLSLDALRGFDMLFIMGFSGLVASLCALWPNPFTDAVAGSMGHAAWDGLTHHDTIFP 63

Query: 79  FFLFIVGVAIALALKRIPDRADA------VKKVIFRTLKLLFWGILLQGGFSHAPDELTY 132
            FLFI GV+   +L +   RA+       + KVI R + L+  G++  G F         
Sbjct: 64  LFLFIAGVSFPFSLAK--QRANGLGERAILGKVIRRGVTLVVLGLVYNGLFK-------- 113

Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLVEIF 161
            +D   +R+  VL RI L+++  +++ I+
Sbjct: 114 -LDFASLRVASVLGRIGLAWMFAAILYIY 141


>gi|296121958|ref|YP_003629736.1| hypothetical protein Plim_1707 [Planctomyces limnophilus DSM 3776]
 gi|296014298|gb|ADG67537.1| conserved hypothetical protein [Planctomyces limnophilus DSM 3776]
          Length = 378

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 80/344 (23%), Positives = 134/344 (38%), Gaps = 62/344 (18%)

Query: 63  SHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDR----ADAVKKVIFRTLKLLFWGIL 118
            H  W GC+L D V P F+F+VGV+I  +L     +         + ++R++ L+  GI 
Sbjct: 32  EHVAWRGCSLWDMVQPSFMFLVGVSIPWSLAAQKSKNVSTGQGWVRAVWRSVLLVVLGIF 91

Query: 119 LQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIF 178
           L      + D               VL +I L YL+V  V   ++ +             
Sbjct: 92  LISNNKPSTD----------FSFVNVLTQIGLGYLVVYAVAQSSRPLAP----------- 130

Query: 179 RLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCN 238
                   MAA +L  Y  LL+  +    Q  I +++ A             A+LN P  
Sbjct: 131 -------FMAAAILAAYW-LLFALWPVVSQGPIGSQELA-------------AQLNLP-- 167

Query: 239 AVGYIDRKVL-GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEG--LLSS 295
               IDR+ L G    ++      + A + D  F     ++ P      F   G   L+ 
Sbjct: 168 ----IDRQQLAGFGAHWNQ---HLNPAASFDRWFLNLFPRETPF----VFNRGGYQTLNF 216

Query: 296 VSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLS 355
           + SI + ++G+  G  I    G      + V  G  + + G  L  T   PL K+++T S
Sbjct: 217 IPSIATMLLGLAAGQTIREVPGAFPAAIRLVLGGVVIALLGWLLDITGVCPLVKRIWTPS 276

Query: 356 YVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
           +   + G    +    + L +    +    PL   GMN++++YV
Sbjct: 277 WTLWSGGLCLSILGLFHFLTEALQKQILAWPLTIAGMNSIVLYV 320


>gi|196234160|ref|ZP_03132993.1| conserved hypothetical protein [Chthoniobacter flavus Ellin428]
 gi|196221811|gb|EDY16348.1| conserved hypothetical protein [Chthoniobacter flavus Ellin428]
          Length = 417

 Score = 56.6 bits (135), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 95/416 (22%), Positives = 152/416 (36%), Gaps = 87/416 (20%)

Query: 34  RLASLDIFRGLAVALMIL-------VDHAGGD---W----PEISHAPWNGCNLADFVMPF 79
           RL S+D +RGL + L++        V  A  D   W     +  H  W G  L D + P 
Sbjct: 32  RLGSIDAYRGLVMFLLLAEQFRTASVAKALPDSSFWRFLATQQEHVTWTGAVLHDMIQPS 91

Query: 80  FLFIVGVAIALALKRIPDRADAVKKV----IFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
           F F+VGVA+  ++     R  + +        R L L+  GI L+    H+    T+   
Sbjct: 92  FSFLVGVALPFSIGNRRARGQSPEATTGHAFLRALILVLLGIFLRST-GHSQTNFTFE-- 148

Query: 136 VRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVY 195
                    L +I L Y  + L+ + +  VQ                  W+    +LV Y
Sbjct: 149 -------DTLTQIGLGYGFLYLIALRSVRVQ------------------WIALVVILVGY 183

Query: 196 LALLYGTYVP----DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
                   +P    DW    +  D A          G  A  N   N     DR  L + 
Sbjct: 184 WLAFARWPLPGEDFDWVHAGVTPDFAGNAS------GFAAHWNKNTNLAWAFDRWFLNL- 236

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
                             P E P   +   +    F P        ++ + I+G+  G V
Sbjct: 237 -----------------FPREKPFLFNQGGYATLSFIP--------TLGTMILGLLAGEV 271

Query: 312 IIHTKGHLARLKQWVTMGFALLI-FGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSA 370
           I   +    ++  W+ M   + +  G  L  T   P+ K+++T S+V  + G      + 
Sbjct: 272 IRSPRRDSEKMG-WLLMAAVVCVGLGWLLGATGVCPVVKRVWTPSWVLFSGGWCFFATAL 330

Query: 371 IYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFI-NGWYYGDPHNTLVCF 425
            Y ++D+   +    PL  IGMN++ +YVM  + +F  FI + W      NT   F
Sbjct: 331 FYRIIDLSGYRRWAFPLIVIGMNSIAIYVM--DHLFPSFIRDAWRTHVGANTFKVF 384


>gi|372268269|ref|ZP_09504317.1| hypothetical protein AlS89_10220 [Alteromonas sp. S89]
          Length = 365

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 44/131 (33%), Positives = 70/131 (53%), Gaps = 11/131 (8%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVII-HTKGHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           ++PEG+LS++ +I + + GV  G  +  H   H A LK     G A L+ G   H  +  
Sbjct: 199 YDPEGVLSTIPAIANALFGVFAGRWLSKHAGDHKAILKGLFAAGVACLVAGYVWH--SVY 256

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV------ 399
           P+NK+L+T S+V +T G   L+    Y LVD+W+        + IG NA+LVY+      
Sbjct: 257 PVNKELWTSSFVLITCGCCLLLLGLFYLLVDVWHWNSFTYFFSVIGCNAILVYLGTSLVN 316

Query: 400 --MAAEGIFAG 408
              +++G+F G
Sbjct: 317 WQYSSKGVFGG 327



 Score = 46.6 bits (109), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 43/146 (29%), Positives = 63/146 (43%), Gaps = 23/146 (15%)

Query: 31  KTQRLASLDIFRGLAVALMI------LVDHAGGDWP-------EISHAPWNGCNLADFVM 77
           K QRLAS+D  RG  +  +I      L   A   W        ++ H PW+G    D + 
Sbjct: 4   KKQRLASVDALRGFDMFWIIGGEALFLPLFALTGWSIFQFGHAQMQHTPWHGFTFYDLIF 63

Query: 78  PFFLFIVGVAIALALKR-----IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTY 132
           P F+F+ GV + LA K      +  RA   +K   R L L+  GIL   G+         
Sbjct: 64  PLFIFLSGVTLGLANKSLRGLPVSQRAPVYRKATKRLLLLVLLGILYNHGWGTGIPA--- 120

Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLV 158
             D+  IR   VL RI  ++   +++
Sbjct: 121 --DLSEIRYASVLARIGFAWFFAAMI 144


>gi|375148919|ref|YP_005011360.1| hypothetical protein [Niastella koreensis GR20-10]
 gi|361062965|gb|AEW01957.1| Protein of unknown function DUF2261, transmembrane [Niastella
           koreensis GR20-10]
          Length = 397

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 36/94 (38%), Positives = 53/94 (56%), Gaps = 9/94 (9%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGD-----WPEISHAPWNGCNLADFVMPFFLFIVGV 86
           TQRLAS+D+FR L + LMI V+  G       W E + A  +G  LAD V P FLFIVG+
Sbjct: 5   TQRLASIDVFRALTMLLMIFVNDLGTLKNIPLWLEHTKANEDGMGLADTVFPAFLFIVGL 64

Query: 87  AIALAL----KRIPDRADAVKKVIFRTLKLLFWG 116
           +I  A+     +   +++ +  ++ R+  LL  G
Sbjct: 65  SIPFAIGNRWAKGASQSNILGHILIRSFALLVMG 98


>gi|374309893|ref|YP_005056323.1| hypothetical protein [Granulicella mallensis MP5ACTX8]
 gi|358751903|gb|AEU35293.1| hypothetical protein AciX8_0944 [Granulicella mallensis MP5ACTX8]
          Length = 399

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 89/391 (22%), Positives = 151/391 (38%), Gaps = 82/391 (20%)

Query: 33  QRLASLDIFRGLAVALMIL----VDHAGGDWPE----------ISHAPWNGCNLADFVMP 78
           QR +++D +RG  +ALM+            +P+           SH  W G +L D + P
Sbjct: 13  QRNSAVDAYRGFVMALMLAEVFRFAFVAKSFPDNFLLHILAYNQSHVEWTGMSLHDMIQP 72

Query: 79  FFLFIVGVAIALALKRIPDRADAVK----KVIFRTLKLLFWGILLQGGFSHAPDELTYGV 134
            F F+VGVA+  +L+    + ++ K      I+R+  L+  GI L+   S A D      
Sbjct: 73  SFTFLVGVALPYSLRSRRRKGESFKYMLGHTIWRSFLLVALGIFLRSIHSTATD------ 126

Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
                     L +I L Y    LV +                  R   W W+    VL  
Sbjct: 127 ----FTFEDTLTQIGLGYTFAFLVAL------------------RPARWLWITLGTVLFS 164

Query: 195 Y-LA-LLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
           Y LA  LY    P++ ++ +   S  +  +F    G  +  N   N     D   L   +
Sbjct: 165 YWLAWALYPAPGPNFPYSSVGVPSEWHQHLF---TGFMSHWNKNSNLGQAFDVWFL---N 218

Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEG--LLSSVSSILSTIIGVHFGH 310
           ++  P           +PFE              F   G   LS + ++ + ++G+  G 
Sbjct: 219 LFPRP-----------TPFE--------------FNDGGYLTLSFIPTLGTMLLGLAAGR 253

Query: 311 VIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSA 370
            ++     +   K+ +     LL  GL  HF    P+ K+++T S+   + G   L  +A
Sbjct: 254 WLLEATPKIP-FKKLLLAAAVLLGSGLLFHFAGICPIVKRIWTPSWTLFSGGVCFLFLAA 312

Query: 371 IYALVDIWNLKYPFLPLAWIGMNAMLVYVMA 401
              +VD+        PL  +G N++  Y+M+
Sbjct: 313 FSWVVDVKKQYRLAFPLIVVGTNSIAAYLMS 343


>gi|424665544|ref|ZP_18102580.1| hypothetical protein HMPREF1205_01419 [Bacteroides fragilis HMW
           616]
 gi|404574617|gb|EKA79366.1| hypothetical protein HMPREF1205_01419 [Bacteroides fragilis HMW
           616]
          Length = 375

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 39/146 (26%), Positives = 68/146 (46%), Gaps = 25/146 (17%)

Query: 32  TQRLASLDIFRGLAVALMI--------LVDHAGGDWP-----EISHAPWNGCNLADFVMP 78
           + RLASLDI RG  + L++        L       W      +  H  W G    D VMP
Sbjct: 9   SPRLASLDILRGFDLFLLVFFQPVLWALAHQLNLPWLNSILFQFDHEVWEGFRFWDLVMP 68

Query: 79  FFLFIVGVAIALALKRI---PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
            FLF+ G ++  +  +    PD+    +K++ R + L  +G+++QG           G++
Sbjct: 69  LFLFMTGASMPFSFSKFKDDPDKGPIYRKILKRFILLFIFGMIVQGNL--------LGLN 120

Query: 136 VRMIRL-CGVLQRIALSYLLVSLVEI 160
            + + L    LQ IA  YL+ +++++
Sbjct: 121 PKYLYLYSNTLQAIATGYLIAAIIQL 146


>gi|407789242|ref|ZP_11136344.1| hypothetical protein B3C1_03120 [Gallaecimonas xiamenensis 3-C-1]
 gi|407207220|gb|EKE77163.1| hypothetical protein B3C1_03120 [Gallaecimonas xiamenensis 3-C-1]
          Length = 364

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 37/117 (31%), Positives = 68/117 (58%), Gaps = 2/117 (1%)

Query: 286 PFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           P +PEGLLS++ ++++ ++GV  G ++   +   ++          L +  L   ++   
Sbjct: 198 PADPEGLLSTLPAVVNALLGVVAGGLLRSPRQPWSKAVL--LAALGLGLLALGYLWSLVF 255

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
           PLNK L+T S+V VTSG +AL+ +  Y L+D+  L++  L  A IG NA+ +Y+ ++
Sbjct: 256 PLNKTLWTSSFVLVTSGWSALLLALFYVLIDLLRLRWLGLAFAVIGANAIAIYLASS 312



 Score = 39.3 bits (90), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 14/49 (28%), Positives = 28/49 (57%)

Query: 61  EISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRT 109
           +++H+ W+G    D + P F+F+ GV + LA KR        ++ ++R+
Sbjct: 48  QMAHSDWHGLTAYDGIFPLFIFLSGVTLGLADKRASALGGGARRALYRS 96


>gi|374312990|ref|YP_005059420.1| hypothetical protein [Granulicella mallensis MP5ACTX8]
 gi|358755000|gb|AEU38390.1| protein of unknown function DUF1624 [Granulicella mallensis
           MP5ACTX8]
          Length = 408

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 36/98 (36%), Positives = 55/98 (56%), Gaps = 9/98 (9%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD-----WPEISHAPWNGCNLADFVMPFFLFIVG 85
           +T R+AS+DIFRGL +A+MI V+   G      W   + A  +     D V PFFLFI+G
Sbjct: 14  RTTRVASIDIFRGLTMAIMIFVNDLDGVQGLPWWTHHAKANIDVMTYVDMVFPFFLFIIG 73

Query: 86  VAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILL 119
           +++ LA    LK+ P        V+ R++ L+  G++L
Sbjct: 74  LSMPLAIRQRLKKNPSIPQLWLHVLIRSVSLVALGVIL 111


>gi|393784535|ref|ZP_10372698.1| hypothetical protein HMPREF1071_03566 [Bacteroides salyersiae
           CL02T12C01]
 gi|392665516|gb|EIY59040.1| hypothetical protein HMPREF1071_03566 [Bacteroides salyersiae
           CL02T12C01]
          Length = 378

 Score = 55.1 bits (131), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 48/148 (32%), Positives = 74/148 (50%), Gaps = 26/148 (17%)

Query: 31  KTQRLASLDIFRGLAVALMIL---VDHAGG---DWP-------EISHAPWNGCNLADFVM 77
            + RLASLDI RG  + L++    V  A G   D P       +  H  W G +L D VM
Sbjct: 10  NSSRLASLDILRGFDLFLLVFFQPVFVALGQQLDLPFLNRLVYQFDHEAWVGFHLWDLVM 69

Query: 78  PFFLFIVGVAIALALKRIPDRADA---VKKVIFRTLKLLF-WGILLQGGFSHAPDELTYG 133
           P FLF+ G ++  +L +    +     V + IFR + LLF +G+++QG           G
Sbjct: 70  PLFLFMTGASMPFSLSKYKISSAGCQFVYRRIFRRVVLLFLFGMIVQGNL--------LG 121

Query: 134 VDVRMIRL-CGVLQRIALSYLLVSLVEI 160
            D + I L    LQ IA+ YL+ +++++
Sbjct: 122 FDSQHIYLYSNTLQAIAVGYLIAAIIQL 149


>gi|322785713|gb|EFZ12351.1| hypothetical protein SINV_07836 [Solenopsis invicta]
          Length = 106

 Score = 54.7 bits (130), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 29/85 (34%), Positives = 47/85 (55%)

Query: 293 LSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLY 352
           L  +++I    +GV  G +++  KG   R+ +W+         G  LHFTN IP+NK L+
Sbjct: 10  LRILTAIFQVFLGVQAGVILMIYKGWKERVIRWLLWAVFYGCMGCILHFTNIIPINKSLW 69

Query: 353 TLSYVCVTSGAAALVFSAIYALVDI 377
           +LS+V V++  A    S  Y L+D+
Sbjct: 70  SLSFVFVSTSFALAFLSGCYLLIDV 94


>gi|408674314|ref|YP_006874062.1| Protein of unknown function DUF2261, transmembrane [Emticicia
           oligotrophica DSM 17448]
 gi|387855938|gb|AFK04035.1| Protein of unknown function DUF2261, transmembrane [Emticicia
           oligotrophica DSM 17448]
          Length = 391

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 89/390 (22%), Positives = 149/390 (38%), Gaps = 69/390 (17%)

Query: 33  QRLASLDIFRGLAVALMIL-VDHAGG---DWPEIS----------HAPWNGCNLADFVMP 78
            RL S DI+RG  + LM+  V H G      PE S          H  W GC+L D + P
Sbjct: 5   NRLTSADIYRGFVMLLMMAEVLHFGKVSEALPESSFWAFLAFHQDHVEWVGCSLHDLIQP 64

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRM 138
            F F+VGV +  ++ R   + +         LK     ILL           TY      
Sbjct: 65  SFSFLVGVVLPYSIARRLTQREGTNAAFLHALKRSLILILLGIFLRSQYKSQTY------ 118

Query: 139 IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL 198
                 L +I + Y  + L+   ++ VQ                   + A  +++V   L
Sbjct: 119 FTFEDTLSQIGMGYPFLFLLAFRSQKVQ-------------------ISALIIILVGYWL 159

Query: 199 LYGTY-VPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHP 257
            +  Y +PD  F  +N   A   +  +      A  N   N     DR  L +       
Sbjct: 160 AFALYPLPDANFDYVNAGVAKDWE--HNLSDFSAHWNKNTNFAWAFDRWFLNL------- 210

Query: 258 AWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKG 317
            + R KA      F G                   LS + ++ + I+G+  G+++     
Sbjct: 211 -FPREKAFL----FNGG--------------GYATLSFIPTLGTMILGLLAGNILKSETK 251

Query: 318 HLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDI 377
              +LKQ++ +G + L+ G+  +     P  K+++T ++V  + G   L  +  Y ++D+
Sbjct: 252 ADEKLKQFIILGVSGLMLGIIFNRLGICPNVKRIWTPTWVLFSGGLCFLFLAFFYWIIDV 311

Query: 378 WNLKYPFLPLAWIGMNAMLVYVMA-AEGIF 406
                    L  IGMN++  Y +A A G++
Sbjct: 312 RGKSDWAYFLKVIGMNSIAAYCIADALGVY 341


>gi|456861512|gb|EMF80162.1| hypothetical protein LEP1GSC188_2620 [Leptospira weilii serovar
          Topaz str. LT2116]
          Length = 88

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 26/59 (44%), Positives = 35/59 (59%), Gaps = 3/59 (5%)

Query: 25 QEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFF 80
          ++KS     R+ SLD+FRG+ V  MILV++ G     +  + HA WNGC   D V PFF
Sbjct: 2  EKKSTQNKDRILSLDLFRGMTVIGMILVNNPGSWSYIYSPLKHAKWNGCTPTDLVFPFF 60


>gi|373954275|ref|ZP_09614235.1| Protein of unknown function DUF2261, transmembrane
           [Mucilaginibacter paludis DSM 18603]
 gi|373890875|gb|EHQ26772.1| Protein of unknown function DUF2261, transmembrane
           [Mucilaginibacter paludis DSM 18603]
          Length = 397

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 46/132 (34%), Positives = 67/132 (50%), Gaps = 17/132 (12%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG-----DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
            R+ S+DIFR + + LMI V+   G     +W + +    +G  LAD V P FLFIVG++
Sbjct: 7   NRVHSIDIFRAVTMFLMIFVNDIDGVPGVPEWIKHAGERTDGLGLADIVFPAFLFIVGLS 66

Query: 88  IALALKRIPDRADAVKK----VIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCG 143
           I  A++    R D+  K    ++ R L L+F       GF HA  E TY  D  +I    
Sbjct: 67  IPHAIQSRISRGDSKTKIAAYIVMRALALIFI------GFIHANME-TYS-DTAVIAQPW 118

Query: 144 VLQRIALSYLLV 155
              +I LS+ L+
Sbjct: 119 WEIQITLSFFLI 130


>gi|317503636|ref|ZP_07961655.1| conserved hypothetical protein, partial [Prevotella salivae DSM
          15606]
 gi|315665261|gb|EFV04909.1| conserved hypothetical protein [Prevotella salivae DSM 15606]
          Length = 59

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 25/55 (45%), Positives = 35/55 (63%), Gaps = 2/55 (3%)

Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGD--WPEISHAPWNGCNLADFVMPFFLFI 83
          KT R+ ++DI RG+ +A MILV++ GG   +  + HA W G    D V PFF+FI
Sbjct: 5  KTSRIEAVDILRGITIAGMILVNNPGGQPVYTPLEHAEWLGLTPTDLVFPFFMFI 59


>gi|332662942|ref|YP_004445730.1| hypothetical protein [Haliscomenobacter hydrossis DSM 1100]
 gi|332331756|gb|AEE48857.1| Protein of unknown function DUF2261, transmembrane
           [Haliscomenobacter hydrossis DSM 1100]
          Length = 394

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 99/440 (22%), Positives = 169/440 (38%), Gaps = 81/440 (18%)

Query: 34  RLASLDIFRGLAVALMIL----VDHAGGDWPEI----------SHAPWNGCNLADFVMPF 79
           RL S+D++RGL + LM+       H    +P+           SH PW GC+L D + P 
Sbjct: 9   RLGSVDVYRGLVMFLMMAEVLEFGHVAKAFPDSGFWAFLHFHQSHVPWVGCSLHDLIQPS 68

Query: 80  FLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMI 139
           F F+VGVA+  +L     +  +  ++   T +     ILL        D+ T        
Sbjct: 69  FSFLVGVALPYSLASRMSQGQSNWQIWGHTFRRALILILLGIFLRSMYDKQT------NF 122

Query: 140 RLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALL 199
                L +I L Y+ +               ++G+ S  R     W+  A +L+ Y A  
Sbjct: 123 TFEDTLTQIGLGYIFLF--------------ALGQVSQRR----QWVALAIILLGYWAWF 164

Query: 200 YGTYVPDWQFTIINKD-SADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPA 258
               +P   F       +AD+        G  A  N   NA    DR  + +        
Sbjct: 165 ALFPLPGADFNWAEAGVAADWPHHLQ---GFAAHWNKNTNAAWVFDRWFMNL-------- 213

Query: 259 WRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGH 318
                      P E   R +   +    F P        ++ + I+G+  G  +   +  
Sbjct: 214 ----------FPREEAFRFNGGGYSTMSFVP--------TLGTMILGLIAGQWLKSAQVA 255

Query: 319 LARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIW 378
           +  L++ V  G  LL  G  L+     P  K+++T ++   + G   L+ +  Y LVD+ 
Sbjct: 256 VDLLRKLVLTGLILLTLGFLLNLLGVCPNVKRIWTPTWTLYSGGWCFLLLAGFYYLVDVL 315

Query: 379 NLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWY-----------YGDPHNTLVCFLF 427
                F  L  +GMN++  YV+A   I  GFI+  +           +G+ + TLV    
Sbjct: 316 QQGRWFYVLKVLGMNSIAAYVLAHTVI--GFIDSSFRIHVSQQYDLIFGEGYRTLVSGTV 373

Query: 428 IISYILHSFLWELRKFLYVQ 447
           +++       W  RK ++++
Sbjct: 374 LLAMQFLVLDWMYRKRIFIK 393


>gi|94985055|ref|YP_604419.1| hypothetical protein Dgeo_0949 [Deinococcus geothermalis DSM 11300]
 gi|94555336|gb|ABF45250.1| hypothetical protein Dgeo_0949 [Deinococcus geothermalis DSM 11300]
          Length = 573

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 49/79 (62%), Gaps = 5/79 (6%)

Query: 15  IISEP-DVSDQQEKSHLKTQ-RLASLDIFRGLAVALMILVDH-AGGDW--PEISHAPWNG 69
           ++S+P   +       ++T+ RL +LD +RGL V LM+LV++ A GD   P++ HAP+ G
Sbjct: 197 VLSDPAPTTSAGGAGPVQTRVRLTALDAWRGLTVLLMLLVNNVALGDLTPPQLQHAPFGG 256

Query: 70  CNLADFVMPFFLFIVGVAI 88
             L D V P+FLF  G A+
Sbjct: 257 LTLTDLVFPWFLFCAGAAL 275


>gi|403717790|ref|ZP_10942873.1| hypothetical protein KILIM_074_00050 [Kineosphaera limosa NBRC
           100340]
 gi|403208927|dbj|GAB97556.1| hypothetical protein KILIM_074_00050 [Kineosphaera limosa NBRC
           100340]
          Length = 461

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 60/246 (24%), Positives = 95/246 (38%), Gaps = 51/246 (20%)

Query: 14  LIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDH--AGGDWPEISHAPWNGCN 71
           L  ++             ++R  SLD+ RGL + + + V+       W E  HA W G +
Sbjct: 61  LPATQSSTKSPPPAQSFPSRRFISLDVARGLMLVVSVAVNAWITAPAWFE--HAAWAGVH 118

Query: 72  LADFVMPFFLFIVGVAIALA-LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDEL 130
             D V P F+ + G  +A A  +RIP     ++ ++ R + L   G+      +HA    
Sbjct: 119 PVDLVFPTFVALSGAGLAFAYARRIP-----LRPLLSRVIVLALAGLAYN---AHAQYLS 170

Query: 131 TYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAAC 190
           T  +D    R+ GVLQ  A   L+++L+                  + R + W W +   
Sbjct: 171 TGQLDWATFRIPGVLQLYAAIVLVIALLHF----------------VLRRW-WAWPLFTI 213

Query: 191 VLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGI 250
           V     AL             +N+        F   C   A L P CN  G  D  + G+
Sbjct: 214 VAATCFAL------------ALNR--------FAAGCPGGA-LTPECNPSGLFDPALFGV 252

Query: 251 NHMYHH 256
            H+YH 
Sbjct: 253 EHIYHQ 258


>gi|374372786|ref|ZP_09630447.1| hypothetical protein NiasoDRAFT_3432 [Niabella soli DSM 19437]
 gi|373234862|gb|EHP54654.1| hypothetical protein NiasoDRAFT_3432 [Niabella soli DSM 19437]
          Length = 357

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 35/102 (34%), Positives = 53/102 (51%), Gaps = 18/102 (17%)

Query: 35  LASLDIFRGLAVALMIL--------VDHA-GGDWPE-----ISHAPWNGCNLADFVMPFF 80
           + SLD  RGL + L+ L        + HA  G W E       H PW+G +  D + P F
Sbjct: 1   MLSLDFMRGLIMVLLALESTGLYEHLSHASAGTWFEGIMQQFFHHPWHGLHFWDLIQPGF 60

Query: 81  LFIVGVAIALALKRIPDR----ADAVKKVIFRTLKLLFWGIL 118
           +F+ GVA+A +L++   R      ++KK + R+  L FWG+L
Sbjct: 61  MFMAGVAMAYSLQKQKQRDYTWNRSLKKTLRRSGWLFFWGVL 102


>gi|408370371|ref|ZP_11168148.1| hypothetical protein I215_05677 [Galbibacter sp. ck-I2-15]
 gi|407744129|gb|EKF55699.1| hypothetical protein I215_05677 [Galbibacter sp. ck-I2-15]
          Length = 394

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 38/110 (34%), Positives = 58/110 (52%), Gaps = 9/110 (8%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDH----AGGDWPEISHAPWNGCNLADFVMPFFLFIVG 85
           + + R+ S+DI RG+ + LM+ V+         W   +    +G  LAD+V P FLF+VG
Sbjct: 1   MNSNRIMSIDIMRGITLFLMLFVNDLFIPGVPKWLVHTQEWEDGMGLADWVFPGFLFMVG 60

Query: 86  VAIALALKRIPDRADAVKK----VIFRTLKLLFWGILLQGGFSHAPDELT 131
           ++I  A+K   ++  +  +    VI RTL LL  GIL+    S    ELT
Sbjct: 61  LSIPYAMKARKNKGQSNLRLWSHVIMRTLSLLLIGILMV-NISRVNPELT 109


>gi|153805867|ref|ZP_01958535.1| hypothetical protein BACCAC_00106 [Bacteroides caccae ATCC 43185]
 gi|149130544|gb|EDM21750.1| hypothetical protein BACCAC_00106 [Bacteroides caccae ATCC 43185]
          Length = 377

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 43/144 (29%), Positives = 67/144 (46%), Gaps = 25/144 (17%)

Query: 32  TQRLASLDIFRGLAVALMIL---VDHAGG---DWP-------EISHAPWNGCNLADFVMP 78
           + RLASLDI RG  + L++    V  A G   ++P       +  H  W G    D VMP
Sbjct: 11  SGRLASLDILRGFDLFLLVFFQPVFVALGQRLNFPWLNDILYQFDHESWIGFRFWDLVMP 70

Query: 79  FFLFIVGVAIALA---LKRIPDRADAVKKVIFRTLKLLFWGILLQGG-FSHAPDELTYGV 134
            FLF+ G ++  +    K  P++    +K+I R + L  +G+++QG      PD L    
Sbjct: 71  LFLFMTGASMPFSFSKFKNAPNKWHIYRKIIKRFVLLFIFGMIVQGNLLGLNPDSLY--- 127

Query: 135 DVRMIRLCGVLQRIALSYLLVSLV 158
                     LQ IA  YL+ +++
Sbjct: 128 -----LYSNTLQAIATGYLIAAII 146


>gi|423219681|ref|ZP_17206177.1| hypothetical protein HMPREF1061_02950 [Bacteroides caccae
           CL03T12C61]
 gi|392624886|gb|EIY18964.1| hypothetical protein HMPREF1061_02950 [Bacteroides caccae
           CL03T12C61]
          Length = 375

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 43/144 (29%), Positives = 67/144 (46%), Gaps = 25/144 (17%)

Query: 32  TQRLASLDIFRGLAVALMIL---VDHAGG---DWP-------EISHAPWNGCNLADFVMP 78
           + RLASLDI RG  + L++    V  A G   ++P       +  H  W G    D VMP
Sbjct: 9   SGRLASLDILRGFDLFLLVFFQPVFVALGQRLNFPWLNDILYQFDHESWIGFRFWDLVMP 68

Query: 79  FFLFIVGVAIALA---LKRIPDRADAVKKVIFRTLKLLFWGILLQGG-FSHAPDELTYGV 134
            FLF+ G ++  +    K  P++    +K+I R + L  +G+++QG      PD L    
Sbjct: 69  LFLFMTGASMPFSFSKFKNAPNKWHIYRKIIKRFVLLFIFGMIVQGNLLGLNPDSLY--- 125

Query: 135 DVRMIRLCGVLQRIALSYLLVSLV 158
                     LQ IA  YL+ +++
Sbjct: 126 -----LYSNTLQAIATGYLIAAII 144


>gi|456985620|gb|EMG21387.1| hypothetical protein LEP1GSC150_0590 [Leptospira interrogans
          serovar Copenhageni str. LT2050]
          Length = 77

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 26/57 (45%), Positives = 35/57 (61%), Gaps = 3/57 (5%)

Query: 27 KSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFF 80
          ++ L   R+ SLD+FRG+ VA MILV++ G     +  + HA WNGC   D V PFF
Sbjct: 2  ENKLNQNRILSLDLFRGMTVAGMILVNNPGSWSFIYSPLKHARWNGCTPTDLVFPFF 58


>gi|224025513|ref|ZP_03643879.1| hypothetical protein BACCOPRO_02253, partial [Bacteroides
           coprophilus DSM 18228]
 gi|224018749|gb|EEF76747.1| hypothetical protein BACCOPRO_02253 [Bacteroides coprophilus DSM
           18228]
          Length = 377

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 37/117 (31%), Positives = 57/117 (48%), Gaps = 20/117 (17%)

Query: 25  QEKSHLKTQRLASLDIFRGLAV--------ALMILVDHAG-GDWPE------ISHAPWNG 69
           +    ++ +RL SLDI RGL +          M L    G  +W +       +H  W G
Sbjct: 1   KNMRKIQKERLESLDILRGLDLFILVGFQSVFMYLAQATGENNWIKTIFDVLFTHVEWEG 60

Query: 70  CNLADFVMPFFLFIVGVAIALALKRIPDRADAVK-KVIFRTLK--LLFW--GILLQG 121
            +L D VMP FLF+ G +I  A+ R   + + +  K+ +R LK  +L W  G ++QG
Sbjct: 61  FHLWDQVMPLFLFMAGTSIPYAMARYKRKEEEISGKLFYRVLKRVVLLWIFGAIVQG 117


>gi|319902718|ref|YP_004162446.1| hypothetical protein Bache_2925 [Bacteroides helcogenes P 36-108]
 gi|319417749|gb|ADV44860.1| hypothetical protein Bache_2925 [Bacteroides helcogenes P 36-108]
          Length = 380

 Score = 52.4 bits (124), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 40/152 (26%), Positives = 68/152 (44%), Gaps = 25/152 (16%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVD------HAGGDWP-------EISHAPWNGC 70
           QQ+   + + RLASLD+ RG  + L++             + P       +  H  W G 
Sbjct: 6   QQDSLKISSSRLASLDVLRGFDLFLLVFFQPVLMSLGQQLNLPFMDVVLYQFDHEVWEGF 65

Query: 71  NLADFVMPFFLFIVGVAIALALKRI---PDRADAVKKVIFRTLKLLFWGILLQGGFSHAP 127
              D +MP FLF+ GV++  +  +    PD+    +K+  R L L   G+++QG      
Sbjct: 66  RFWDLIMPLFLFMTGVSMPFSFAKYQSSPDKCIIYRKIFRRVLLLFLLGMVVQGNL---- 121

Query: 128 DELTYGVDVRMIRL-CGVLQRIALSYLLVSLV 158
                G++ + I      LQ IA+ YL+  ++
Sbjct: 122 ----LGLNPKHIYFYTNTLQAIAVGYLIAGMI 149


>gi|255035026|ref|YP_003085647.1| hypothetical protein Dfer_1233 [Dyadobacter fermentans DSM 18053]
 gi|254947782|gb|ACT92482.1| conserved hypothetical protein [Dyadobacter fermentans DSM 18053]
          Length = 401

 Score = 52.4 bits (124), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 51/104 (49%), Gaps = 9/104 (8%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMILVD-----HAGGDWPEISHAPWNGCNLADFVMPF 79
             K    + RL S+D+FR + + LMI V+      A   W E S A  +   L+D V P 
Sbjct: 1   MNKVASSSLRLDSIDVFRAVTMLLMIFVNDFWTLEAVPKWLEHSKAEEDAMGLSDVVFPA 60

Query: 80  FLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILL 119
           FLFIVG++I  A+     + D     ++ +  RT  LL  GI +
Sbjct: 61  FLFIVGLSIPFAISNRRKKGDGNALIIRHIAERTFALLLMGIFI 104


>gi|189463412|ref|ZP_03012197.1| hypothetical protein BACCOP_04131 [Bacteroides coprocola DSM 17136]
 gi|189429841|gb|EDU98825.1| hypothetical protein BACCOP_04131 [Bacteroides coprocola DSM 17136]
          Length = 219

 Score = 52.4 bits (124), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 57/263 (21%), Positives = 102/263 (38%), Gaps = 81/263 (30%)

Query: 102 VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIF 161
           +KKVI RT+ L   G+ +                 + +R+ GVLQR+AL Y    ++   
Sbjct: 4   IKKVIKRTILLFLVGLFINWCAKGM-------CSFQELRIPGVLQRLALCYFFTVII--- 53

Query: 162 TKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGK 221
             ++ +K                          Y+  L    +  +Q  ++  +   YG 
Sbjct: 54  CTNIHEK--------------------------YIPALITVLLLIYQIILVTGNGFVYG- 86

Query: 222 VFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPS 281
                         P N +  ID+ +LG +H+Y+                          
Sbjct: 87  --------------PQNIIAVIDQYILGASHLYNDHG----------------------- 109

Query: 282 WCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHF 341
                 +PEG+LS++ SI  T+IG   G + I  +   ++L++   +G  LL  G    F
Sbjct: 110 -----IDPEGILSTIPSISHTLIGYCIGKICIEKENIHSKLEKLFLIGTVLLFAGYL--F 162

Query: 342 TNAIPLNKQLYTLSYVCVTSGAA 364
           +   P+NK++++ +YV +T GA 
Sbjct: 163 SYGCPINKKIWSPTYVFMTCGAG 185


>gi|317505448|ref|ZP_07963366.1| ABC superfamily ATP binding cassette transporter permease subunit
           [Prevotella salivae DSM 15606]
 gi|315663361|gb|EFV03110.1| ABC superfamily ATP binding cassette transporter permease subunit
           [Prevotella salivae DSM 15606]
          Length = 380

 Score = 52.0 bits (123), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 43/152 (28%), Positives = 70/152 (46%), Gaps = 29/152 (19%)

Query: 31  KTQRLASLDIFRGLAVALMILVDH-----------AGGD-----WPEISHAPWNGCNLAD 74
           K  RL SLDI RG  +A+++LV             A G        ++ H PW G    D
Sbjct: 9   KPNRLLSLDILRGADLAMLVLVQPILLKALETMQPAEGTVGHFIMGQLLHLPWEGFCFWD 68

Query: 75  FVMPFFLFIVGVAIALALKRIP--DRADA--VKKVIFRTLKLLFWGILLQGGFSHAPDEL 130
            +MP F+F+ G+ I  A+ R    +R D    ++++ R + L   G++ QG         
Sbjct: 69  IIMPLFMFMSGITIPFAMARYKRGERIDGSFYRRILKRFVVLWILGMVCQGNL------- 121

Query: 131 TYGVDVRMIRL-CGVLQRIALSYLLVSLVEIF 161
               D++ + L    LQ IA+ Y+ V+ + +F
Sbjct: 122 -LAFDLQQLHLYSNTLQSIAVGYVAVAFLYVF 152


>gi|325106033|ref|YP_004275687.1| hypothetical protein [Pedobacter saltans DSM 12145]
 gi|324974881|gb|ADY53865.1| hypothetical protein Pedsa_3330 [Pedobacter saltans DSM 12145]
          Length = 397

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 40/108 (37%), Positives = 60/108 (55%), Gaps = 9/108 (8%)

Query: 32  TQRLASLDIFRGLAVALMILVD--HAGG--DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           + R+ S+DI RGL + LM+ V+  +  G   W   + A  +G  LAD+V P FLF+VG++
Sbjct: 6   SVRILSIDIMRGLTLFLMLFVNDLYEPGVPKWLVHTKANVDGMGLADWVFPGFLFMVGLS 65

Query: 88  IALALKRIPDRADAVKK----VIFRTLKLLFWGILLQGGFSHAPDELT 131
           I  A+K    + ++  K    ++ R L LLF GIL+       P ELT
Sbjct: 66  IPYAVKARKAKGESGFKIFVHILLRALSLLFIGILMLNADRVNP-ELT 112


>gi|326799399|ref|YP_004317218.1| hypothetical protein [Sphingobacterium sp. 21]
 gi|326550163|gb|ADZ78548.1| hypothetical protein Sph21_1988 [Sphingobacterium sp. 21]
          Length = 398

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 34/102 (33%), Positives = 56/102 (54%), Gaps = 16/102 (15%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPW--------NGCNLADFVMPF 79
             + ++R+ S+DI RGL + LM+ V+    D  E     W        +G  LAD+V P 
Sbjct: 4   KKVASERILSVDIMRGLTLLLMLFVN----DLFEPGVPAWLLHTKVDVDGMGLADWVFPG 59

Query: 80  FLFIVGVAIALALKRIPDRADAVKKVI----FRTLKLLFWGI 117
           FLFIVGV++  A++   ++ ++ +++I     RTL LL  G+
Sbjct: 60  FLFIVGVSVPYAIRSRLNKGESKRQIIGHIAVRTLSLLIIGV 101


>gi|374373619|ref|ZP_09631279.1| hypothetical protein NiasoDRAFT_2435 [Niabella soli DSM 19437]
 gi|373234592|gb|EHP54385.1| hypothetical protein NiasoDRAFT_2435 [Niabella soli DSM 19437]
          Length = 397

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 54/188 (28%), Positives = 81/188 (43%), Gaps = 31/188 (16%)

Query: 34  RLASLDIFRGLAVALMILVDH----AGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           R+ S+DI RG+ + LM+ V+         W   + A  +   LAD+V P FLF+VG++I 
Sbjct: 8   RIRSIDIMRGITLCLMLFVNDLYEPGVPHWLVHTKAETDSMGLADWVFPGFLFMVGLSIP 67

Query: 90  LALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDEL--------------- 130
            A+     + D     V  ++FR++ LL  G+L+  G    P                  
Sbjct: 68  FAIDSRRRKGDEWPQLVLHILFRSVSLLIIGLLMLNGGRVNPQLTGMPVLLWKSLVYLCI 127

Query: 131 -----TYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQD--KDQSVGRFSIFRLYCW 183
                TY V+ RM     +LQ   +  LL  LV IF   +    K    G + I  L  W
Sbjct: 128 FLVWNTYPVNKRMKPFFILLQLAGIGGLL-YLVWIFKAGIPGAIKWMETGWWGILGLIGW 186

Query: 184 HWLMAACV 191
            +L AA +
Sbjct: 187 GYLTAALI 194


>gi|300123408|emb|CBK24681.2| unnamed protein product [Blastocystis hominis]
          Length = 213

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 24/79 (30%), Positives = 43/79 (54%), Gaps = 7/79 (8%)

Query: 7   ETTHHHPLIISEPDVSDQQEKSHLKTQ----RLASLDIFRGLAVALMILVDHAGGDWP-E 61
           E +      ++EP +  Q+EK  + +Q    R+ S+D+FRG+ + +MI  ++  G +   
Sbjct: 135 EASKTEASSVNEPLI--QKEKQSVVSQPMKSRVQSIDVFRGITICIMIFANYGAGQYSHS 192

Query: 62  ISHAPWNGCNLADFVMPFF 80
           + HA W+G   ADF  P +
Sbjct: 193 LMHAAWDGITFADFAFPLY 211


>gi|15806610|ref|NP_295325.1| hypothetical protein DR_1602 [Deinococcus radiodurans R1]
 gi|6459373|gb|AAF11168.1|AE002004_7 hypothetical protein DR_1602 [Deinococcus radiodurans R1]
          Length = 388

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 45/130 (34%), Positives = 66/130 (50%), Gaps = 14/130 (10%)

Query: 34  RLASLDIFRGLAVALMILVDHA--GGDWP-EISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           RL +LD +RGL V LM+LV++   G   P ++SHA + G  L D V P+FLF  G A+  
Sbjct: 33  RLTALDAWRGLTVLLMLLVNNVALGDSTPRQLSHAHFGGLTLTDLVFPWFLFCAGAALPF 92

Query: 91  ALKRIPDRADAVKKVIFRTLKLLFWGILLQGGF--SHAPDELTYGVDVRMIRLCGVLQRI 148
           +   + ++A      ++R L      + L G F  S     LT G+        GVLQ I
Sbjct: 93  SAAAM-NKAGVTGWPLYRRLLERAALLYLMGAFVTSVTSHRLTLGL--------GVLQLI 143

Query: 149 ALSYLLVSLV 158
           AL+    +L+
Sbjct: 144 ALASFFAALL 153


>gi|187735009|ref|YP_001877121.1| transmembrane protein [Akkermansia muciniphila ATCC BAA-835]
 gi|187425061|gb|ACD04340.1| putative transmembrane protein [Akkermansia muciniphila ATCC
           BAA-835]
          Length = 373

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 47/147 (31%), Positives = 63/147 (42%), Gaps = 25/147 (17%)

Query: 28  SHLKTQRLASLDIFRG-----------LAVALMILVDHAGGDW--PEISHAPWNGCNLAD 74
           S  + QR+A++D  RG           L VA + L      +W     +H  W G    D
Sbjct: 5   SDTRPQRIAAIDALRGFDMFFLTGGLALVVAGINLFYDRSPEWLVKHSTHVAWEGFAAWD 64

Query: 75  FVMPFFLFIVGVAIALAL-KRIPDRA--DAVKKVIFRTLKLLFWGILLQGG-FSHAPDEL 130
            VMP FLFIVG A+  +  KRI          KV  R + L   G+++QG   S  P   
Sbjct: 65  LVMPLFLFIVGTAMPFSFSKRIGSEPLWKIYLKVARRVVVLFLLGMVVQGNLLSFEPS-- 122

Query: 131 TYGVDVRMIRLCGVLQRIALSYLLVSL 157
                 RM   C  LQ IA  YL+ ++
Sbjct: 123 ------RMSLYCNTLQAIASGYLIAAI 143


>gi|392965134|ref|ZP_10330554.1| hypothetical protein BN8_01612 [Fibrisoma limi BUZ 3]
 gi|387846517|emb|CCH52600.1| hypothetical protein BN8_01612 [Fibrisoma limi BUZ 3]
          Length = 531

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 90/396 (22%), Positives = 153/396 (38%), Gaps = 70/396 (17%)

Query: 21  VSDQQEKSHLKTQRLASLDIFRGLAVALM---ILVDHAGGD-------WPEI----SHAP 66
           V+  ++   L   RL S+D +RG  + LM   IL  H   +       W  +    SH  
Sbjct: 133 VTAPRDSGGLAGDRLMSMDAYRGFVMLLMAAEILQFHRLHEAFPNSVLWGLLAYHQSHVE 192

Query: 67  WNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHA 126
           W GC+L D + P F F+VGVA+  +L    +R  +V+      L+     ILL       
Sbjct: 193 WAGCSLHDLIQPSFSFLVGVALPYSLAARLNRGQSVRMQFGHALRRSLILILLGIFLRST 252

Query: 127 PDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWL 186
             E TY            L +I L Y  + L+       Q                  W+
Sbjct: 253 HREQTY------FTFEDTLTQIGLGYPFLFLLGTTRPRTQ------------------WI 288

Query: 187 MAACVLVVY--LALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYID 244
             A +LV Y  L +LY    PD+ ++ +     D+ + +    G+ A  N   N     D
Sbjct: 289 ALAGILVGYWLLFVLYPAPGPDFNYSAVGVP-PDWPEHYT---GIAAHFNKNSNVAWAFD 344

Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
           +  L +                   P E P   +   +    F P        ++ + ++
Sbjct: 345 QWFLNL------------------FPREKPFLFNGGGYATLSFIP--------TLGTMLL 378

Query: 305 GVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAA 364
           G+  G  +       + L+Q++  G A L  GL L  T   P+ K+++T ++V  + G  
Sbjct: 379 GLQAGQWLRAGMPARSLLRQFLIAGAAGLALGLLLDVTGICPIVKRIWTPAWVLFSGGWC 438

Query: 365 ALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVM 400
             + +  Y ++D+   +     L  IGMN++ +Y +
Sbjct: 439 FWLLALFYGIIDVARKRRWAFFLTVIGMNSIAIYCL 474


>gi|332668157|ref|YP_004450945.1| hypothetical protein [Haliscomenobacter hydrossis DSM 1100]
 gi|332336971|gb|AEE54072.1| Protein of unknown function DUF2261, transmembrane
           [Haliscomenobacter hydrossis DSM 1100]
          Length = 387

 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 36/103 (34%), Positives = 54/103 (52%), Gaps = 17/103 (16%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAP-W--------NGCNLADFVMPFF 80
           +K QRL S+DI R + + LMI V+    D   ++H P W        +G  L+D V P F
Sbjct: 1   MKNQRLPSIDILRAVTMLLMIFVN----DLWSLTHVPHWLLHTAAEEDGMGLSDVVFPAF 56

Query: 81  LFIVGVAIALALK----RIPDRADAVKKVIFRTLKLLFWGILL 119
           LFIVG++I  ALK    +   +   +  ++ RT  LL  G+ +
Sbjct: 57  LFIVGLSIPHALKARLEKGASKGSVMLHILSRTFALLVMGLFM 99


>gi|436833713|ref|YP_007318929.1| Protein of unknown function DUF2261,transmembrane [Fibrella
           aestuarina BUZ 2]
 gi|384065126|emb|CCG98336.1| Protein of unknown function DUF2261,transmembrane [Fibrella
           aestuarina BUZ 2]
          Length = 415

 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 87/406 (21%), Positives = 153/406 (37%), Gaps = 74/406 (18%)

Query: 27  KSHLKTQRLASLDIFRGLAVALM----ILVDH------AGGDWPEI----SHAPWNGCNL 72
           ++     RL S+D +RG  + LM    +  DH        G W  +    SH  W+GC+L
Sbjct: 23  QTEFPAGRLLSVDAYRGFVMLLMMGEVLHFDHLHEAFPGSGFWALLAYHQSHVDWSGCSL 82

Query: 73  ADFVMPFFLFIVGVAI--ALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDEL 130
            D + P F F+VGVA+  ++A +++  ++  V+        L+   + +    +HA    
Sbjct: 83  HDLIQPSFSFLVGVALPYSIASRQLKGQSAGVQFGHALRRSLILILLGIFLRSTHASQTY 142

Query: 131 TYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAAC 190
               D         L +I L Y ++ L+      V+ +              W W+    
Sbjct: 143 FTFED--------TLTQIGLGYPVLFLLGY----VRPR--------------WQWVALGV 176

Query: 191 VLVVYLALLYGTYVPDWQFTIINKD-SADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLG 249
           +LV Y  L  G   P   F        AD+   +    G+ A  N   N     D   L 
Sbjct: 177 ILVGYWLLFVGYPAPGPAFAYPAVSVPADWPHHYT---GLMAHFNKNSNVAWAFDTWFLN 233

Query: 250 INHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFG 309
           +                   P E P   +   +          LS + ++ + ++G+  G
Sbjct: 234 L------------------FPREKPFLFNGGGYA--------TLSFIPTLGTMLLGLIAG 267

Query: 310 HVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFS 369
             +         L++ V  GF  L  G         P+ K+++T ++V  + G    + +
Sbjct: 268 QWLRLNLSPTDLLRRLVLAGFVGLAAGTLFELLGICPVVKRIWTPAWVLFSGGWCFWLLA 327

Query: 370 AIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYY 415
             Y +VD+   +    PL  IGMN++ +Y +    +  GFI   +Y
Sbjct: 328 FFYGVVDVGQKQRWAFPLVVIGMNSIAIYCLVH--LIEGFIINSFY 371


>gi|440749360|ref|ZP_20928608.1| N-acetylglucosamine related transporter, NagX [Mariniradius
           saccharolyticus AK6]
 gi|436482365|gb|ELP38488.1| N-acetylglucosamine related transporter, NagX [Mariniradius
           saccharolyticus AK6]
          Length = 401

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 37/94 (39%), Positives = 50/94 (53%), Gaps = 9/94 (9%)

Query: 33  QRLASLDIFRGLAVALMILVD-----HAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           +R+ S+D+FR + + LMI V+          W     A  +G  LAD V P FL IVG++
Sbjct: 11  RRVYSIDVFRAITMMLMIFVNDLWTLEGIPAWLGHVDAKEDGMGLADVVFPAFLVIVGLS 70

Query: 88  IALAL-KRIPDR---ADAVKKVIFRTLKLLFWGI 117
           I  AL KRI      A  +K + FRTL LL  G+
Sbjct: 71  IPFALSKRIEKGERLAGTLKHIFFRTLALLTMGV 104


>gi|333379187|ref|ZP_08470911.1| hypothetical protein HMPREF9456_02506 [Dysgonomonas mossii DSM
           22836]
 gi|332885455|gb|EGK05704.1| hypothetical protein HMPREF9456_02506 [Dysgonomonas mossii DSM
           22836]
          Length = 395

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 32/96 (33%), Positives = 53/96 (55%), Gaps = 9/96 (9%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGG-----DWPEISHAPWNGCNLADFVMPFFLFIVG 85
           K +R+AS+DI+R L +  MI V+          W E + A  +    +D V P FLFI+G
Sbjct: 8   KPKRIASIDIYRALTMFFMIFVNDLWSVSNVPHWLEHAAANEDMLGFSDIVFPSFLFILG 67

Query: 86  VAIALALKRIPDRADA----VKKVIFRTLKLLFWGI 117
           ++I LA++    + D+    +K +I R++ LL  G+
Sbjct: 68  MSIPLAIEIRKKKGDSNSGILKHIIIRSIALLVMGL 103


>gi|256424049|ref|YP_003124702.1| hypothetical protein Cpin_5069 [Chitinophaga pinensis DSM 2588]
 gi|256038957|gb|ACU62501.1| conserved hypothetical protein [Chitinophaga pinensis DSM 2588]
          Length = 390

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 31/94 (32%), Positives = 49/94 (52%), Gaps = 9/94 (9%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG-----DWPEISHAPWNGCNLADFVMPFFLFIVGV 86
           +QRL S+D FR L +  MI V+   G     +W +   A  +G   AD V P FLFIVG+
Sbjct: 5   SQRLLSIDAFRALTMLTMIFVNDVSGVKNIPEWIDHVKAQDDGMGFADTVFPAFLFIVGL 64

Query: 87  AIALALKRIPDRADAV----KKVIFRTLKLLFWG 116
           +I  A+ +   + D+       ++ R+L ++  G
Sbjct: 65  SIPFAIGKRISKQDSFFSIESHILLRSLAMIVMG 98


>gi|167764058|ref|ZP_02436185.1| hypothetical protein BACSTE_02441 [Bacteroides stercoris ATCC
           43183]
 gi|167698174|gb|EDS14753.1| hypothetical protein BACSTE_02441 [Bacteroides stercoris ATCC
           43183]
          Length = 394

 Score = 49.7 bits (117), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 32/114 (28%), Positives = 60/114 (52%), Gaps = 15/114 (13%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGG-----DWPEISHAPWNGCNLADFVMPFFLF 82
            +L  QR+A++D+FR L + LM+ V+   G      W   + A  +    +D + P FLF
Sbjct: 2   KNLTLQRVAAVDVFRALTMFLMLFVNDIPGLKNVPHWLMHAAADEDMLGFSDTIFPAFLF 61

Query: 83  IVGVAIALALKRIPDRADAVKKVI----FRTLKLLFWGIL------LQGGFSHA 126
            +G++++ A++    + D   +VI    +RT+ L+  G+       ++GG SH+
Sbjct: 62  CMGMSVSFAIQNRYKKGDTTTQVIAHIFWRTVALIAMGLFSLNSGGIEGGLSHS 115


>gi|292609605|ref|XP_002660455.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Danio
           rerio]
          Length = 292

 Score = 49.3 bits (116), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 21/46 (45%), Positives = 30/46 (65%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMP 78
           +RL SLD FRGL++ +M+ V++ GG +    H  WNG  +AD V P
Sbjct: 246 RRLRSLDTFRGLSLVIMVFVNYGGGRYWFFRHESWNGLTVADLVFP 291


>gi|134025078|gb|AAI35092.1| Unknown (protein for IMAGE:7224994) [Danio rerio]
          Length = 291

 Score = 49.3 bits (116), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 21/46 (45%), Positives = 30/46 (65%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMP 78
           +RL SLD FRGL++ +M+ V++ GG +    H  WNG  +AD V P
Sbjct: 245 RRLRSLDTFRGLSLVIMVFVNYGGGRYWFFRHESWNGLTVADLVFP 290


>gi|427384705|ref|ZP_18881210.1| hypothetical protein HMPREF9447_02243 [Bacteroides oleiciplenus YIT
           12058]
 gi|425727966|gb|EKU90825.1| hypothetical protein HMPREF9447_02243 [Bacteroides oleiciplenus YIT
           12058]
          Length = 398

 Score = 49.3 bits (116), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 32/113 (28%), Positives = 59/113 (52%), Gaps = 15/113 (13%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGG-----DWPEISHAPWNGCNLADFVMPFFLF 82
            +L  QR+A++D+FR L + LM+ V+   G      W E +    +    +D + P FLF
Sbjct: 6   KNLAPQRVAAVDVFRALTMFLMLFVNDIPGLKNIPHWLEHAEMNEDMMGFSDTIFPAFLF 65

Query: 83  IVGVAIALALKRIPDRADAVKKVI----FRTLKLLFWGIL------LQGGFSH 125
            +G++++ A++    + D   +VI    +RT+ L+  G+       ++GG SH
Sbjct: 66  CMGMSVSFAIQNRYRKGDTTLQVIAHVFWRTVALIAMGLFSLNSGGIEGGLSH 118


>gi|377572860|ref|ZP_09801940.1| hypothetical protein MOPEL_003_01300 [Mobilicoccus pelagius NBRC
           104925]
 gi|377538518|dbj|GAB47105.1| hypothetical protein MOPEL_003_01300 [Mobilicoccus pelagius NBRC
           104925]
          Length = 439

 Score = 49.3 bits (116), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 73/292 (25%), Positives = 114/292 (39%), Gaps = 81/292 (27%)

Query: 31  KTQRLASLDIFRGLAVALMILVDH--AGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
           +  RL SLD+ RG+ + + ++V+      +W E  HA W G +  D V P F+ + G  +
Sbjct: 8   RGGRLESLDVCRGVMLVVSVVVNAWFTAPEWFE--HAAWTGVHPVDLVFPAFVTLSGAGM 65

Query: 89  ALAL-KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYG-VDVRMIRLCGVLQ 146
           A+A  +R+P     V + + R L L   G+     F+ A   L  G VDV  +R  GVLQ
Sbjct: 66  AIAFARRVP-----VARQVRRVLVLTAAGL----AFAVAGQVLGTGAVDVATLRFTGVLQ 116

Query: 147 RIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPD 206
             A   L + LV +  +                   WHWL AA V+    A L  ++   
Sbjct: 117 LYAFLVLALGLVAVVVR-----------------RWWHWLAAAAVVAGAQAWLLASWASS 159

Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
                + K                      CN  G +D  V G  HMY            
Sbjct: 160 CPGGALTKA---------------------CNPSGVVDAAVFG-PHMY------------ 185

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI-IHTKG 317
                 G L  D          PEG +++  ++++ ++G   G ++  H +G
Sbjct: 186 ----VMGRLGHD----------PEGFVAAAGALVTALVGAAAGRLMWEHRRG 223


>gi|345517559|ref|ZP_08797028.1| hypothetical protein BSFG_03809 [Bacteroides sp. 4_3_47FAA]
 gi|254837353|gb|EET17662.1| hypothetical protein BSFG_03809 [Bacteroides sp. 4_3_47FAA]
          Length = 359

 Score = 49.3 bits (116), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 44/144 (30%), Positives = 65/144 (45%), Gaps = 25/144 (17%)

Query: 32  TQRLASLDIFRGLAVALMILVD------HAGGDWP-------EISHAPWNGCNLADFVMP 78
           + RL SLD+ RGL + L++             D+P       +  H  W G    D VMP
Sbjct: 7   SSRLDSLDMLRGLDLFLLVFFQPVLMSFGQQTDFPWMTSILYQFEHEVWVGFRFWDLVMP 66

Query: 79  FFLFIVGVAIALALKR---IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
            FLF+ GV++  +  +   I DR    +K+  R L L   G+++QG           G+D
Sbjct: 67  LFLFMTGVSMPFSFAKYRDISDRNAVYRKITRRFLLLFLLGMVVQGNL--------LGLD 118

Query: 136 VRMIRL-CGVLQRIALSYLLVSLV 158
              I L    LQ IA  YL+ +L+
Sbjct: 119 WEHIYLYNNTLQAIAAGYLIAALL 142


>gi|329960675|ref|ZP_08299018.1| conserved domain protein [Bacteroides fluxus YIT 12057]
 gi|328532548|gb|EGF59342.1| conserved domain protein [Bacteroides fluxus YIT 12057]
          Length = 394

 Score = 48.9 bits (115), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 34/118 (28%), Positives = 63/118 (53%), Gaps = 23/118 (19%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAP-W--------NGCNLADFVMP 78
            +L  QR+A++D+FR L + LM+ V+    D P + + P W        +    +D + P
Sbjct: 2   KNLTPQRVAAVDVFRALTMFLMLFVN----DIPGLKNVPHWLMHAKIDEDMLGFSDTIFP 57

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKVI----FRTLKLLFWGIL------LQGGFSHA 126
            FLF +G++++LA++    + +   +VI    +RT+ LL  G+       ++GG SH+
Sbjct: 58  AFLFCMGMSVSLAIQNRYKKGNTTLQVISHIFWRTIALLAMGLFSLNSGGIEGGLSHS 115


>gi|393788826|ref|ZP_10376952.1| hypothetical protein HMPREF1068_03232 [Bacteroides nordii
           CL02T12C05]
 gi|392653932|gb|EIY47582.1| hypothetical protein HMPREF1068_03232 [Bacteroides nordii
           CL02T12C05]
          Length = 376

 Score = 48.9 bits (115), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 37/145 (25%), Positives = 64/145 (44%), Gaps = 26/145 (17%)

Query: 34  RLASLDIFRGLAVALMILVD------HAGGDWP-------EISHAPWNGCNLADFVMPFF 80
           RLASLDI RG  + L++             + P       +  H  W G    D VMP F
Sbjct: 11  RLASLDILRGFDLFLLVFFQPVFVALARQLNLPFLDEVLYQFDHEVWEGFRFWDLVMPLF 70

Query: 81  LFIVGVAIALALKRIP----DRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDV 136
           LF+ G ++  +L +      D     ++++ R + L  +G+++QG           G D 
Sbjct: 71  LFMTGASMPFSLSKYKTASVDYWPVYRRILKRVILLFIFGMIVQGNL--------LGFDS 122

Query: 137 RMIRL-CGVLQRIALSYLLVSLVEI 160
           + I      LQ IA+ Y + +++++
Sbjct: 123 KHIYFYSNTLQSIAVGYFIAAVIQL 147


>gi|429727718|ref|ZP_19262477.1| hypothetical protein HMPREF9998_00424 [Peptostreptococcus
           anaerobius VPI 4330]
 gi|429151771|gb|EKX94625.1| hypothetical protein HMPREF9998_00424 [Peptostreptococcus
           anaerobius VPI 4330]
          Length = 463

 Score = 48.9 bits (115), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 79/403 (19%), Positives = 156/403 (38%), Gaps = 83/403 (20%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHA----PWNGCNLADFVMPFFLFIVGV 86
           +  R+ S+D  RGL V L + + + G +  +IS+A     WNG  L D ++P FL ++G 
Sbjct: 45  EDMRVQSIDYMRGLLVILSMFMINQGLE-NQISYAFQNSKWNGMTLLDILVPMFLLVIGS 103

Query: 87  AIALALKR----IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
           +I   +K+      D    VK    +++ +   G++    +  A D          +RL 
Sbjct: 104 SIPFYVKKHYEENEDLRHIVKMSFIKSIIVFVIGLIFSCIYYPAND---------YVRLT 154

Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
           G +Q +A  Y++  L+ I    ++ K+ ++    I         M   V +++ A+ +  
Sbjct: 155 GPIQMMAFVYIMSLLLYIGFLKMRIKNNALTYIFIS--------MGIIVSIIFTAIGFAH 206

Query: 203 YVPDWQ---FTIINK---------DSADYGKVFNVTCGVRAKL----------NPPCNAV 240
            +   +   F +++K           AD   +     GV   L            P    
Sbjct: 207 SLKTGESSIFVVMDKALLSTFKSVSMADPEGILVCISGVSLGLIGLGLACILNKKPVENK 266

Query: 241 GYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSIL 300
            YI          Y   +W +    ++ +        D  SW +          S+ +IL
Sbjct: 267 RYIR---------YKRTSWVKESGYSRKNVLH-----DIKSWINP--------RSIKAIL 304

Query: 301 STIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNA-IPLNKQLYTLSYVCV 359
           S    ++               ++ V M F  ++F +        +PLN+ + +L++V  
Sbjct: 305 SNYYRINL------------EARKLVDMLFIAILFYIVSKVMGIWLPLNRNILSLTFVLR 352

Query: 360 TSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
            S     +   +Y + DI  + +  L +  +G+N++ V V+  
Sbjct: 353 VSSYFYFMMFVLYIICDILAINFGTLLVKRLGLNSLAVIVITT 395


>gi|237722081|ref|ZP_04552562.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
 gi|293373568|ref|ZP_06619919.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
 gi|299145142|ref|ZP_07038210.1| conserved hypothetical protein [Bacteroides sp. 3_1_23]
 gi|229448950|gb|EEO54741.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
 gi|292631466|gb|EFF50093.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
 gi|298515633|gb|EFI39514.1| conserved hypothetical protein [Bacteroides sp. 3_1_23]
          Length = 377

 Score = 48.9 bits (115), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 39/145 (26%), Positives = 64/145 (44%), Gaps = 26/145 (17%)

Query: 34  RLASLDIFRGLAVALMIL-------------VDHAGGDWPEISHAPWNGCNLADFVMPFF 80
           RLASLDI RG  + L++              +        +  H  W G    D VMP F
Sbjct: 12  RLASLDILRGFDLFLLVFFQPVFVALARQMNMSFLDSILYQFDHEVWEGFRFWDLVMPLF 71

Query: 81  LFIVGVAIALALKRIPDRADAVKKVIFRTLKLLF----WGILLQGGFSHAPDELTYGVDV 136
           LF+ G ++  +L +      +   V  R LK +F    +G+++QG           G+D 
Sbjct: 72  LFMTGASMPFSLSKYIGTTGSYWPVYRRILKRVFLLFIFGMIVQGNL--------LGLDA 123

Query: 137 RMIRL-CGVLQRIALSYLLVSLVEI 160
             + L    LQ IA+ YL+ +++++
Sbjct: 124 THLYLYSNTLQSIAVGYLIAAVIQL 148


>gi|456861595|gb|EMF80245.1| hypothetical protein LEP1GSC188_2621 [Leptospira weilii serovar
           Topaz str. LT2116]
          Length = 179

 Score = 48.5 bits (114), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 36/124 (29%), Positives = 59/124 (47%), Gaps = 30/124 (24%)

Query: 242 YIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILS 301
           +IDR ++G  H+     W+ SK                       ++PEGLLS V+SI +
Sbjct: 82  WIDRVIIGEKHL-----WKFSKT----------------------WDPEGLLSGVASIAT 114

Query: 302 TIIGVHFGHVI-IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVT 360
           ++ GV  G ++ +   G  +R+     +GF     GL   +  ++P+NK L+T SY   T
Sbjct: 115 SLFGVLCGFILFLRESGGRSRVFSIFGLGFLFTFVGLL--WDQSLPMNKSLWTGSYAAYT 172

Query: 361 SGAA 364
           +G A
Sbjct: 173 AGLA 176


>gi|333382416|ref|ZP_08474086.1| hypothetical protein HMPREF9455_02252 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332828727|gb|EGK01419.1| hypothetical protein HMPREF9455_02252 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 394

 Score = 48.5 bits (114), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 34/108 (31%), Positives = 54/108 (50%), Gaps = 9/108 (8%)

Query: 31  KTQRLASLDIFRGLAVALMILVD-----HAGGDWPEISHAPWNGCNLADFVMPFFLFIVG 85
           K  R+AS+DIFR L +  MI V+          W E + A  +    +D V P FLFI+G
Sbjct: 8   KPVRVASIDIFRALTMFFMIFVNDFWSVSGVPHWLEHAAASEDMLGFSDVVFPSFLFILG 67

Query: 86  VAIALALKRIPDRADAVKK----VIFRTLKLLFWGILLQGGFSHAPDE 129
           ++I LA++    + +  K+    ++ R++ LL  G+      S   DE
Sbjct: 68  MSIPLAMESRMKKGETKKQILWHIVVRSVALLVMGLFTVNLESGVADE 115


>gi|430744438|ref|YP_007203567.1| hypothetical protein Sinac_3623 [Singulisphaera acidiphila DSM
           18658]
 gi|430016158|gb|AGA27872.1| hypothetical protein Sinac_3623 [Singulisphaera acidiphila DSM
           18658]
          Length = 454

 Score = 48.5 bits (114), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 36/102 (35%), Positives = 55/102 (53%), Gaps = 7/102 (6%)

Query: 26  EKSHLKTQRLASLDIFRGLAVALMILVDHAGGDW---PEISHAPWNGCNLADFVMPFFLF 82
           E +  KT R+ S+D FRG  VA M +V+  GG     P + H   N  + AD +MP FLF
Sbjct: 50  EAAGTKTGRIVSMDQFRGYTVAGMCVVNFLGGLQAIHPVLKHNN-NYFSYADTIMPSFLF 108

Query: 83  IVGVAIAL-ALKRIPDRADAV--KKVIFRTLKLLFWGILLQG 121
             G +  L ALKR+     A   ++ ++R+L L+   +++ G
Sbjct: 109 ACGFSYRLTALKRLDQFGPAAMYRRFVWRSLGLVLLSLMMYG 150


>gi|386821099|ref|ZP_10108315.1| hypothetical protein JoomaDRAFT_3082 [Joostella marina DSM 19592]
 gi|386426205|gb|EIJ40035.1| hypothetical protein JoomaDRAFT_3082 [Joostella marina DSM 19592]
          Length = 395

 Score = 48.1 bits (113), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 33/95 (34%), Positives = 50/95 (52%), Gaps = 8/95 (8%)

Query: 33  QRLASLDIFRGLAVALMILVDH----AGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
            R+ S+DI RGL + LM+ V+         W   S A  +   LAD+V P FLF+VG++I
Sbjct: 5   TRILSIDIMRGLTLFLMLFVNDLFEPGVPKWLVHSKATEDAMGLADWVFPGFLFMVGLSI 64

Query: 89  ALAL----KRIPDRADAVKKVIFRTLKLLFWGILL 119
             A     K+     + +K ++ RTL LL  G+ +
Sbjct: 65  PFAFLSRRKKGEGDLEILKHILVRTLSLLLIGVFM 99


>gi|389866878|ref|YP_006369119.1| hypothetical protein MODMU_5285 [Modestobacter marinus]
 gi|388489082|emb|CCH90660.1| conserved transmembrane protein of unknown function [Modestobacter
           marinus]
          Length = 327

 Score = 48.1 bits (113), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 28/71 (39%), Positives = 41/71 (57%), Gaps = 4/71 (5%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGD--WPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           +RL  +D+ RGLAV  M++VD+ G      +  HA W+G ++AD V P FL + GV  ++
Sbjct: 2   RRLHGVDVLRGLAVVGMLVVDNRGNASIATQWHHAAWDGLHVADVVFPAFLLVAGV--SM 59

Query: 91  ALKRIPDRADA 101
              R  DR  A
Sbjct: 60  PFSRRADRPRA 70


>gi|115770385|ref|XP_001180412.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like,
           partial [Strongylocentrotus purpuratus]
          Length = 78

 Score = 47.8 bits (112), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 26/78 (33%), Positives = 47/78 (60%), Gaps = 6/78 (7%)

Query: 49  MILVDHAGGD--WPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADA----V 102
           M+L   A GD  +  +SHA W+G  +ADF+ P+F+FI+G +I L++  +  +  +     
Sbjct: 1   MLLAGGAYGDGHYWFVSHAIWSGITVADFMFPWFVFIMGTSIHLSINILLSKGQSYPSIY 60

Query: 103 KKVIFRTLKLLFWGILLQ 120
           KK++ R++ L   G+ +Q
Sbjct: 61  KKLVSRSITLFIMGVCIQ 78


>gi|224536805|ref|ZP_03677344.1| hypothetical protein BACCELL_01681 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224521571|gb|EEF90676.1| hypothetical protein BACCELL_01681 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 394

 Score = 47.8 bits (112), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 34/113 (30%), Positives = 61/113 (53%), Gaps = 15/113 (13%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGG--DWPE-ISHAPWNG--CNLADFVMPFFLF 82
            +L  QR+A++D+FR L + LM+ V+   G  + P  + HA  N      +D + P FLF
Sbjct: 2   KNLTPQRVAAVDVFRALTMFLMLFVNDIPGLKNIPHWLKHAEMNEDMLGFSDTIFPAFLF 61

Query: 83  IVGVAIALALKRIPDRADAVKKVI----FRTLKLLFWGIL------LQGGFSH 125
            +G++++ A++    + D   +VI    +RT+ L+  G+       ++GG SH
Sbjct: 62  CMGMSVSFAIQNRYRKGDTTLQVIAHIFWRTVALIAMGLFSLNSGGIEGGISH 114


>gi|423223322|ref|ZP_17209791.1| hypothetical protein HMPREF1062_01977 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392638858|gb|EIY32689.1| hypothetical protein HMPREF1062_01977 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 394

 Score = 47.8 bits (112), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 34/113 (30%), Positives = 61/113 (53%), Gaps = 15/113 (13%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGG--DWPE-ISHAPWNG--CNLADFVMPFFLF 82
            +L  QR+A++D+FR L + LM+ V+   G  + P  + HA  N      +D + P FLF
Sbjct: 2   KNLTPQRVAAVDVFRALTMFLMLFVNDIPGLKNIPHWLKHAEMNEDMLGFSDTIFPAFLF 61

Query: 83  IVGVAIALALKRIPDRADAVKKVI----FRTLKLLFWGIL------LQGGFSH 125
            +G++++ A++    + D   +VI    +RT+ L+  G+       ++GG SH
Sbjct: 62  CMGMSVSFAIQNRYRKGDTTLQVIAHIFWRTVALIAMGLFSLNSGGIEGGISH 114


>gi|431799248|ref|YP_007226152.1| hypothetical protein Echvi_3932 [Echinicola vietnamensis DSM 17526]
 gi|430790013|gb|AGA80142.1| Protein of unknown function (DUF1624) [Echinicola vietnamensis DSM
           17526]
          Length = 412

 Score = 47.8 bits (112), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 37/124 (29%), Positives = 57/124 (45%), Gaps = 20/124 (16%)

Query: 4   IKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVD-----HAGGD 58
           +   T    P  +SEP +         + +R  ++D+FR + + LMI V+         D
Sbjct: 3   VNNPTKTQRPSKVSEPII---------EAKRSYAIDVFRAVTMLLMIFVNDLWTLEGYPD 53

Query: 59  WPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL-----KRIPDRADAVKKVIFRTLKLL 113
           W   +    +    +D + P FLFIVG++I  AL     KRIP +    + +I R L LL
Sbjct: 54  WLGHAAVGEDRLGFSDVIFPAFLFIVGLSIPFALQNRFRKRIP-KIKLAEHIILRGLALL 112

Query: 114 FWGI 117
             GI
Sbjct: 113 VMGI 116


>gi|445497064|ref|ZP_21463919.1| hypothetical protein Jab_2c06620 [Janthinobacterium sp. HH01]
 gi|444787059|gb|ELX08607.1| hypothetical protein Jab_2c06620 [Janthinobacterium sp. HH01]
          Length = 381

 Score = 47.8 bits (112), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 33/108 (30%), Positives = 53/108 (49%), Gaps = 12/108 (11%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGG-----DWPEISHAPWNGCNLADFVMPFFLFIVG 85
           K  R+ ++D FRG+ + +MI V+   G      W E + A  +     D V P FLFIVG
Sbjct: 5   KPARVLAIDAFRGITILVMIFVNTLAGVRGMPAWMEHAPADADAMTFPDVVFPAFLFIVG 64

Query: 86  VAIALALKRIPDRADAV----KKVIFRTLKLLFWGILL---QGGFSHA 126
           ++I  A+ +     D      + V+ R   LL  G+ +   +GG++ A
Sbjct: 65  MSIPFAMAQRQAAGDTPAARWRHVLARAAGLLVLGVFMVNAEGGYNEA 112


>gi|395803959|ref|ZP_10483200.1| hypothetical protein FF52_18830 [Flavobacterium sp. F52]
 gi|395433603|gb|EJF99555.1| hypothetical protein FF52_18830 [Flavobacterium sp. F52]
          Length = 396

 Score = 47.8 bits (112), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 29/93 (31%), Positives = 44/93 (47%), Gaps = 5/93 (5%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG-----DWPEISHAPWNGCNLADFVMP 78
            + K +L  QR+ S+D  RG+ + +MI V+          W +   A  +     D V P
Sbjct: 1   MKIKENLFNQRIVSIDSLRGITIFVMIFVNELASIQNVPQWMKHMPADADAMTFVDLVFP 60

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKVIFRTLK 111
            FLFIVG++I  A      + D+ K +   TLK
Sbjct: 61  AFLFIVGMSIPFAFNARLIKGDSPKTIWTHTLK 93


>gi|168705120|ref|ZP_02737397.1| hypothetical protein GobsU_36644 [Gemmata obscuriglobus UQM 2246]
          Length = 387

 Score = 47.4 bits (111), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 36/92 (39%), Positives = 44/92 (47%), Gaps = 10/92 (10%)

Query: 17  SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAG------GDWPEISHAPWNGC 70
           +EP        +     RLASLD FRG  V  M+LV+  G       D P ++H     C
Sbjct: 3   AEPPDPKGAAGAPASAPRLASLDQFRGYTVLGMLLVNFVGSFAVIKADVPVLAHHH-TYC 61

Query: 71  NLADFVMPFFLFIVGVAIALALKRIPDRADAV 102
           + AD +MP FLF VG A  L   R   R DAV
Sbjct: 62  SYADTIMPQFLFAVGFAFRLTFAR---RRDAV 90


>gi|225013130|ref|ZP_03703543.1| conserved hypothetical protein [Flavobacteria bacterium MS024-2A]
 gi|225002750|gb|EEG40733.1| conserved hypothetical protein [Flavobacteria bacterium MS024-2A]
          Length = 365

 Score = 47.4 bits (111), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 27/88 (30%), Positives = 44/88 (50%), Gaps = 14/88 (15%)

Query: 32  TQRLASLDIFRGLAVALMIL-VDHAGGDWPEIS-------------HAPWNGCNLADFVM 77
           +QRL SLD FRG+ + L++    H  G + +               H  W G +  D + 
Sbjct: 8   SQRLRSLDFFRGVVMFLLVAEFSHLFGVFMKTENETITAAADFLFHHVQWEGLHFWDLIQ 67

Query: 78  PFFLFIVGVAIALALKRIPDRADAVKKV 105
           PFF+FIVGV+I  +     ++ D+ K++
Sbjct: 68  PFFMFIVGVSIPYSYANRLEKGDSEKQI 95



 Score = 42.0 bits (97), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 26/112 (23%), Positives = 55/112 (49%), Gaps = 2/112 (1%)

Query: 293 LSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLY 352
            ++V +   TI G+  G +++    H+ + K+ + +    L  G  L F    P+ K++ 
Sbjct: 203 FNAVPTAAHTIWGLMAGQLLMSKSSHIDKFKRLIVVAVICLALGYALSFFT--PVIKRIT 260

Query: 353 TLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEG 404
           T S++ ++ G   L  +  Y ++DI +     L    +G+N + +Y+ A+ G
Sbjct: 261 TTSFIFLSGGWTILALAICYWIIDIKSYIKKTLVFTVVGVNPLFIYLFASVG 312


>gi|270294981|ref|ZP_06201182.1| conserved hypothetical protein [Bacteroides sp. D20]
 gi|270274228|gb|EFA20089.1| conserved hypothetical protein [Bacteroides sp. D20]
          Length = 394

 Score = 47.4 bits (111), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 48/191 (25%), Positives = 89/191 (46%), Gaps = 34/191 (17%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAP-W--------NGCNLADFVMPFF 80
           L  QR+A++D+FR L + LM+ V+    D P + + P W        +    +D + P F
Sbjct: 4   LTLQRIAAVDVFRALTMFLMLFVN----DIPGLKNVPHWLMHARMDEDMMGFSDTIFPAF 59

Query: 81  LFIVGVAIALALKRIPDRADAVKKVI----FRTLKLLFWGIL------LQGGFSHAPDE- 129
           LF +G++++ A++    + D   +V+    +RT+ L+  G+       ++GG SH     
Sbjct: 60  LFCMGMSVSFAIQNRYKKGDNTLQVVAHIFWRTVALIAMGLFSLNSGGIEGGLSHPWFSI 119

Query: 130 -------LTYGVDVRMIRLCGVL--QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRL 180
                  LT+GV  + +    VL         LL++ + I+ KD+  K   +  + I  L
Sbjct: 120 LMVIGFFLTWGVYPKAVGTKKVLFTAMKTAGVLLLAFLVIY-KDMNGKPFQISWWGILGL 178

Query: 181 YCWHWLMAACV 191
             W + + A +
Sbjct: 179 IGWTYAVCAGI 189


>gi|418750786|ref|ZP_13307072.1| putative membrane protein [Leptospira licerasiae str. MMD4847]
 gi|404273389|gb|EJZ40709.1| putative membrane protein [Leptospira licerasiae str. MMD4847]
          Length = 322

 Score = 47.0 bits (110), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 82/348 (23%), Positives = 138/348 (39%), Gaps = 99/348 (28%)

Query: 76  VMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYG 133
           + PFFLF VG +I  +   + +      K++ R++ L+F G+ L   G +S +       
Sbjct: 1   MFPFFLFAVGASIPFS---VSNGIQEFPKILKRSVILIFLGLFLNFFGEWSFSN------ 51

Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV 193
                +R  GVLQRI  +Y   ++        ++K+          L   +W +   +  
Sbjct: 52  -----LRFPGVLQRIGFAYFFSAIA------YREKNLKFRIILFLTLLISYWYLQEFIPP 100

Query: 194 VYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
                  G   P        K+  D+G                     ++DR+V G  H+
Sbjct: 101 P------GAAEPS------MKEGKDWG--------------------AWLDREVFGQAHL 128

Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI- 312
                W+  K                       ++PEGLL+S +SI S   G+  G  + 
Sbjct: 129 -----WKFGKV----------------------WDPEGLLTSFTSIASVFCGIFAGEFLK 161

Query: 313 IHTKGHLARLK---QWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFS 369
           +H +   + L    + V   FA+L+ G         P+NK L+T +Y   T+G A L  S
Sbjct: 162 VHLEKKESPLSISGKIVLGAFAVLVVGGVWGIY--YPINKSLWTGTYSLWTAGWALLAVS 219

Query: 370 AIYALVDIWN------LKYPFLPLAWIGMNAMLVYVMAAEGIFAGFIN 411
            ++ +++ +N      L+   LP    G NA+LV+     GIFA  +N
Sbjct: 220 -LFLILEKYNRFGFGALQGFLLPF---GKNALLVFF--GSGIFARSLN 261


>gi|189464971|ref|ZP_03013756.1| hypothetical protein BACINT_01315 [Bacteroides intestinalis DSM
           17393]
 gi|189437245|gb|EDV06230.1| hypothetical protein BACINT_01315 [Bacteroides intestinalis DSM
           17393]
          Length = 394

 Score = 47.0 bits (110), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 32/113 (28%), Positives = 58/113 (51%), Gaps = 15/113 (13%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGG-----DWPEISHAPWNGCNLADFVMPFFLF 82
            +L  QR+A++D+FR L + LM+ V+   G      W E +    +    +D + P FLF
Sbjct: 2   KNLAPQRVAAVDVFRALTMFLMLFVNDIPGLKNIPHWLEHADINEDMMGFSDTIFPAFLF 61

Query: 83  IVGVAIALALKRIPDRADAVKKVI----FRTLKLLFWGIL------LQGGFSH 125
            +G++++ A++    + D   +VI    +RT+ L+  G+       + GG SH
Sbjct: 62  CMGMSVSFAIQNRYRKGDTTLQVIAHIFWRTVALIAMGLFSLNSGGIAGGISH 114


>gi|305666718|ref|YP_003863005.1| hypothetical protein FB2170_10666 [Maribacter sp. HTCC2170]
 gi|88708942|gb|EAR01176.1| hypothetical protein FB2170_10666 [Maribacter sp. HTCC2170]
          Length = 395

 Score = 47.0 bits (110), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 35/114 (30%), Positives = 58/114 (50%), Gaps = 19/114 (16%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAP-W--------NGCNLADF 75
            EKS  KT R+AS+D+ R L + LMI V+    D+  ++  P W        +    +D 
Sbjct: 1   MEKS--KTLRIASIDVLRALTMLLMIWVN----DFWTLTQVPKWLTHAKPNEDYLGFSDI 54

Query: 76  VMPFFLFIVGVAIALALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGFSH 125
           + P FLFIVG++I  A+     +   R+   K ++ R++ LL  G+ +    +H
Sbjct: 55  IFPLFLFIVGLSIPFAINNRMAKGEPRSIMFKHIVIRSISLLIIGVFMVNYETH 108


>gi|289422375|ref|ZP_06424221.1| hypothetical protein HMPREF0631_1471 [Peptostreptococcus anaerobius
           653-L]
 gi|289157210|gb|EFD05829.1| hypothetical protein HMPREF0631_1471 [Peptostreptococcus anaerobius
           653-L]
          Length = 463

 Score = 47.0 bits (110), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 75/401 (18%), Positives = 156/401 (38%), Gaps = 83/401 (20%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHA----PWNGCNLADFVMPFFLFIVGVAI 88
            R+ S+D  RGL V L + + + G +  +IS+A     WNG  L D ++P FL ++G +I
Sbjct: 47  MRVQSIDYMRGLLVILSMFMINQGLE-NQISYAFQNSKWNGMTLNDILVPMFLLVIGSSI 105

Query: 89  ALALKRIPDRADAVKKVI----FRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
              +K+  +  + ++ ++     +++ +   G++    +  A D          +RL G 
Sbjct: 106 PFYVKKHYEENEDIRHIVKMSFIKSIIVFLIGLIFSCIYYPANDY---------VRLTGP 156

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           +Q +   Y++  L+ I    ++ K+ ++    I         M   V +++ A+     +
Sbjct: 157 IQMMVFVYIMSLLLYIGFLKMRIKNNALTYIFIS--------MGIIVSIIFTAIGLAHSL 208

Query: 205 PDWQ---FTIINK---------DSADYGKVFNVTCGVRAKL----------NPPCNAVGY 242
              +   F +++K           AD   +     GV   L            P     Y
Sbjct: 209 KTGESSIFVVMDKALLSTFKSVSMADPEGILVCISGVSLGLIGLGLACILNKKPVENKRY 268

Query: 243 IDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILST 302
           I          Y   +W +    ++ +        D  SW +          S+ +ILS 
Sbjct: 269 IR---------YKRTSWVKESGYSRKNVLH-----DIKSWINP--------RSIKAILSN 306

Query: 303 IIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNA-IPLNKQLYTLSYVCVTS 361
              ++               ++ V M F  ++F +        +PLN+ + +L++V   S
Sbjct: 307 YYRINL------------EARKLVDMLFIAILFYIVSKVMGIWLPLNRNILSLTFVLRVS 354

Query: 362 GAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
                +   +Y + DI  + +  L +  +G+N++ V V+  
Sbjct: 355 SYFYFMMFVLYIICDILAINFGTLLVKRLGLNSLAVIVITT 395


>gi|146302719|ref|YP_001197310.1| hypothetical protein Fjoh_4992 [Flavobacterium johnsoniae UW101]
 gi|146157137|gb|ABQ07991.1| Uncharacterized protein [Flavobacterium johnsoniae UW101]
          Length = 395

 Score = 47.0 bits (110), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 28/90 (31%), Positives = 43/90 (47%), Gaps = 5/90 (5%)

Query: 27  KSHLKTQRLASLDIFRGLAVALMILVDHAGG-----DWPEISHAPWNGCNLADFVMPFFL 81
           K +L  QR+ S+D  RG+ + +MI V+          W +   A  +     D V P FL
Sbjct: 4   KENLYNQRIISIDALRGITIFVMIFVNELASIQNVPQWMKHMPADADAMTFVDLVFPAFL 63

Query: 82  FIVGVAIALALKRIPDRADAVKKVIFRTLK 111
           FIVG+++  A      + D+ K +   TLK
Sbjct: 64  FIVGMSVPFAFNARLIKGDSPKVIWTHTLK 93


>gi|423304305|ref|ZP_17282304.1| hypothetical protein HMPREF1072_01244 [Bacteroides uniformis
           CL03T00C23]
 gi|423310581|ref|ZP_17288565.1| hypothetical protein HMPREF1073_03315 [Bacteroides uniformis
           CL03T12C37]
 gi|392681752|gb|EIY75109.1| hypothetical protein HMPREF1073_03315 [Bacteroides uniformis
           CL03T12C37]
 gi|392684891|gb|EIY78211.1| hypothetical protein HMPREF1072_01244 [Bacteroides uniformis
           CL03T00C23]
          Length = 394

 Score = 46.6 bits (109), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 32/115 (27%), Positives = 60/115 (52%), Gaps = 23/115 (20%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAP-W--------NGCNLADFVMPFF 80
           L  QR+A++D+FR L + LM+ V+    D P + + P W        +    +D + P F
Sbjct: 4   LTLQRIAAVDVFRALTMFLMLFVN----DIPRLKNVPHWLMHARMDEDMMGFSDTIFPAF 59

Query: 81  LFIVGVAIALALKRIPDRADAVKKVI----FRTLKLLFWGIL------LQGGFSH 125
           LF +G++++ A++    + D   +V+    +RT+ L+  G+       ++GG SH
Sbjct: 60  LFCMGMSVSFAIQNRYKKGDNTLQVVAHIFWRTVALIAMGLFSLNSGGIEGGLSH 114


>gi|254445881|ref|ZP_05059357.1| hypothetical protein VDG1235_4128 [Verrucomicrobiae bacterium
           DG1235]
 gi|198260189|gb|EDY84497.1| hypothetical protein VDG1235_4128 [Verrucomicrobiae bacterium
           DG1235]
          Length = 394

 Score = 46.6 bits (109), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 31/96 (32%), Positives = 49/96 (51%), Gaps = 9/96 (9%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG-----DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
            R+ S+DIFRGL + LMI V+          W E +    +    +D + P FLFIVG++
Sbjct: 3   SRIHSIDIFRGLTMLLMIWVNDFWSLTNVPTWLEHAPGDADAMGFSDIIFPAFLFIVGLS 62

Query: 88  IALALKRIPDRADA----VKKVIFRTLKLLFWGILL 119
           I  AL+    + D+    +  ++ R+  LL  G L+
Sbjct: 63  IPFALRSRLAKGDSKPTIITHILARSFALLLMGFLM 98


>gi|320450186|ref|YP_004202282.1| hypothetical protein TSC_c11130 [Thermus scotoductus SA-01]
 gi|320150355|gb|ADW21733.1| putative membrane protein [Thermus scotoductus SA-01]
          Length = 334

 Score = 46.6 bits (109), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 29/75 (38%), Positives = 43/75 (57%), Gaps = 8/75 (10%)

Query: 32  TQRLASLDIFRGLAVALMILVDH-AGGDWPEISHAPWNG-CNLADFVMPFFLFIVGVAIA 89
           + R  +LD FRGL VALM+ V++   G  P + H P+ G   LAD V P++L  +G AI 
Sbjct: 4   SARSLALDAFRGLTVALMLFVNNLPPGAPPYLEHGPFGGSVYLADLVFPWYLLAMGAAIP 63

Query: 90  LALKRIPDRADAVKK 104
            +      RA A+++
Sbjct: 64  FS------RASALRR 72


>gi|344238550|gb|EGV94653.1| Heparan-alpha-glucosaminide N-acetyltransferase [Cricetulus
           griseus]
          Length = 423

 Score = 46.2 bits (108), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 36/116 (31%), Positives = 57/116 (49%), Gaps = 22/116 (18%)

Query: 284 HAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTN 343
              ++PEG+L +++SI+   +GV  G + I      A  K     GF             
Sbjct: 279 QVAYDPEGILGTINSIVMAFLGVQ-GLISI------ALTKMSADEGF------------- 318

Query: 344 AIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
            IP+NK L+++SYV   S  A  +   +Y +VD+  L +   P  + GMN++LVYV
Sbjct: 319 -IPINKNLWSISYVSTLSCFAFFILLILYPVVDVKGL-WTGTPFFYPGMNSILVYV 372



 Score = 40.4 bits (93), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 41/88 (46%), Gaps = 10/88 (11%)

Query: 24  QQEKSHLKT--QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFL 81
           Q E  H      RL  +D FRG+A+ LM+ V++ GG +    H+ WN  +     +P  L
Sbjct: 188 QPETRHTSALPYRLRCVDTFRGIALILMVFVNYGGGKYWYFKHSSWN-VSWDKVRIPGVL 246

Query: 82  -------FIVGVAIALALKRIPDRADAV 102
                  F+V V   +  K +PDR   V
Sbjct: 247 QRLGVTYFVVAVLELIFSKPVPDRCALV 274


>gi|317477968|ref|ZP_07937151.1| hypothetical protein HMPREF1007_00267 [Bacteroides sp. 4_1_36]
 gi|316905882|gb|EFV27653.1| hypothetical protein HMPREF1007_00267 [Bacteroides sp. 4_1_36]
          Length = 394

 Score = 46.2 bits (108), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 32/115 (27%), Positives = 60/115 (52%), Gaps = 23/115 (20%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAP-W--------NGCNLADFVMPFF 80
           L  QR+A++D+FR L + LM+ V+    D P + + P W        +    +D + P F
Sbjct: 4   LTLQRIAAVDVFRALTMFLMLFVN----DIPGLKNVPHWLMHARMDEDMMGFSDTIFPAF 59

Query: 81  LFIVGVAIALALKRIPDRADAVKKVI----FRTLKLLFWGIL------LQGGFSH 125
           LF +G++++ A++    + D   +V+    +RT+ L+  G+       ++GG SH
Sbjct: 60  LFCMGMSVSFAIQNRYKKGDNTLQVVAHIFWRTVALIAMGLFSLNSGGIEGGLSH 114


>gi|423213223|ref|ZP_17199752.1| hypothetical protein HMPREF1074_01284 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392693683|gb|EIY86913.1| hypothetical protein HMPREF1074_01284 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 469

 Score = 45.8 bits (107), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 30/105 (28%), Positives = 49/105 (46%), Gaps = 16/105 (15%)

Query: 32  TQRLASLDIFRGLAVALMIL----VDHAGGDWPEISHAPWN--------GCNLADFVMPF 79
             R  +LD  RG A+  M+L    V H    W   +  P +        G    D V PF
Sbjct: 2   NNRALALDALRGYAIITMVLSATIVTHVLPGWMSHAQTPPDHVFNPLLPGITWVDLVFPF 61

Query: 80  FLFIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQ 120
           FLF +G A   ++K+  ++ D+  K+++    R ++L F+ I +Q
Sbjct: 62  FLFAMGAAFPFSIKKRAEKGDSKLKLVYEAGKRGIQLTFFAIFIQ 106


>gi|149277363|ref|ZP_01883505.1| hypothetical protein PBAL39_10746 [Pedobacter sp. BAL39]
 gi|149232240|gb|EDM37617.1| hypothetical protein PBAL39_10746 [Pedobacter sp. BAL39]
          Length = 396

 Score = 45.8 bits (107), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 40/110 (36%), Positives = 54/110 (49%), Gaps = 16/110 (14%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG--DWPE-ISHAPW--NGCNLADFVMPFFLFIVGVA 87
           QRL S+D  R L + LMI V+      D P  + HAP   N   LAD V P FL IVG++
Sbjct: 9   QRLVSIDALRALVMLLMIFVNDLWSLIDIPGWLEHAPGDANYMGLADVVFPAFLVIVGLS 68

Query: 88  IALALKRIPDRADAVKK----VIFRTLKLLFWGILLQGGFSHAPDELTYG 133
           +  A+     + D  +     +++RT+ LL        GF H   E TYG
Sbjct: 69  VPYAIDSRRRKGDGNRAIFLHIVYRTIALLV------MGFFHVNME-TYG 111


>gi|149178821|ref|ZP_01857402.1| phosphoribosylaminoimidazole synthetase [Planctomyces maris DSM
           8797]
 gi|148842362|gb|EDL56744.1| phosphoribosylaminoimidazole synthetase [Planctomyces maris DSM
           8797]
          Length = 405

 Score = 45.8 bits (107), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 34/87 (39%), Positives = 44/87 (50%), Gaps = 5/87 (5%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAG--GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
            +R+ SLD FRG  VA M LV++ G     P +       C+ AD +MP FLF VG A  
Sbjct: 14  NKRIVSLDQFRGYTVAGMFLVNYMGFFVVCPVVLKHHNTYCSYADTIMPHFLFAVGFAFR 73

Query: 90  LALKRIPDRADAVK---KVIFRTLKLL 113
           L   R    A AV    +V+ R L L+
Sbjct: 74  LTFGRRVQTAGAVSAYARVVRRLLGLV 100


>gi|313220553|emb|CBY31402.1| unnamed protein product [Oikopleura dioica]
          Length = 194

 Score = 45.8 bits (107), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 38/123 (30%), Positives = 63/123 (51%), Gaps = 13/123 (10%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNA-- 344
           ++PEG+L S++SIL   +G+  G +    +    R  +    G  L   G  L   N   
Sbjct: 46  YDPEGILGSINSILIVFLGLQAGRIFNFYETFQQRAIRLSVWGTVLTAVGGALTGLNQFQ 105

Query: 345 ----IPLNKQLYTLSYVCVTSGAAALVFSAIYALVD---IWNLKYPFLPLAWIGMNAMLV 397
               IP+ K L+TLS+V V +G   L+   +Y L+D   +W+      P  ++GMN++LV
Sbjct: 106 EGSNIPIAKNLWTLSFVLVMAGWGFLLLLVLYILIDHKKVWDGA----PFYFVGMNSILV 161

Query: 398 YVM 400
           Y++
Sbjct: 162 YLL 164


>gi|149276664|ref|ZP_01882807.1| hypothetical protein PBAL39_14829 [Pedobacter sp. BAL39]
 gi|149232333|gb|EDM37709.1| hypothetical protein PBAL39_14829 [Pedobacter sp. BAL39]
          Length = 359

 Score = 45.4 bits (106), Expect = 0.054,   Method: Compositional matrix adjust.
 Identities = 29/109 (26%), Positives = 51/109 (46%), Gaps = 18/109 (16%)

Query: 37  SLDIFRGLAVALM--------ILVDHAGGDWP------EISHAPWNGCNLADFVMPFFLF 82
           SLD+ RGL + L+        + + H    WP      +  H PW+G    D V P F+F
Sbjct: 2   SLDVMRGLIMILLCAESCLLYVSLQHLNPAWPASGLVEQFFHHPWHGLRFWDLVQPAFMF 61

Query: 83  IVGVAIALALKRIPDRADAVKK----VIFRTLKLLFWGILLQGGFSHAP 127
           + G A+ ++  R  ++  +  +    ++ R+LKL   G+ L   ++  P
Sbjct: 62  MAGAAMYISYSRKLEKGSSWSQNWNHILIRSLKLFLCGVGLHCVYAGKP 110


>gi|329849634|ref|ZP_08264480.1| hypothetical protein ABI_25290 [Asticcacaulis biprosthecum C19]
 gi|328841545|gb|EGF91115.1| hypothetical protein ABI_25290 [Asticcacaulis biprosthecum C19]
          Length = 410

 Score = 45.4 bits (106), Expect = 0.062,   Method: Compositional matrix adjust.
 Identities = 34/113 (30%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 26  EKSHLKT-------QRLASLDIFRGLAVALMILVD-----HAGGDWPEISHAPWNGCNLA 73
           EK   KT        R+ ++D+ R L + LMI V+          W E   +  +G  L+
Sbjct: 6   EKGRAKTMPNKNQFSRVGAIDLVRALTMVLMIFVNDLWSLKGVPVWLEHVASGVDGMGLS 65

Query: 74  DFVMPFFLFIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQGG 122
           D V P FLFIVG+++  A+     R D++   +     R++ LL  G+ L  G
Sbjct: 66  DVVFPAFLFIVGLSLPFAVSSRQARGDSLGSTVLHILGRSVALLVMGVFLVNG 118


>gi|387789753|ref|YP_006254818.1| hypothetical protein Solca_0510 [Solitalea canadensis DSM 3403]
 gi|379652586|gb|AFD05642.1| hypothetical protein Solca_0510 [Solitalea canadensis DSM 3403]
          Length = 389

 Score = 45.1 bits (105), Expect = 0.068,   Method: Compositional matrix adjust.
 Identities = 32/107 (29%), Positives = 50/107 (46%), Gaps = 9/107 (8%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG-----DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
            RL S+D+ R L + LMI V+          W E      +G  L+D + P FLFIVG++
Sbjct: 6   NRLGSIDVIRALTMFLMIFVNDLWSLVNVPKWLEHVDVQTDGMGLSDVIFPAFLFIVGLS 65

Query: 88  IALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDEL 130
           I  +++    + D+    +K +  R+  LL  G       S+ P  L
Sbjct: 66  IPFSVENRIKKGDSTIQLLKHIFIRSFALLVIGFFHVNLESYNPGAL 112


>gi|456890770|gb|EMG01561.1| hypothetical protein LEP1GSC123_2562 [Leptospira borgpetersenii
          str. 200701203]
          Length = 74

 Score = 45.1 bits (105), Expect = 0.071,   Method: Composition-based stats.
 Identities = 21/46 (45%), Positives = 28/46 (60%), Gaps = 3/46 (6%)

Query: 38 LDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFF 80
          +D+FRG+ V  MILV++ G     +  + HA WNGC   D V PFF
Sbjct: 1  MDLFRGMTVVGMILVNNPGSWSYVYSPLKHAEWNGCTPTDLVFPFF 46


>gi|423287389|ref|ZP_17266240.1| hypothetical protein HMPREF1069_01283 [Bacteroides ovatus
           CL02T12C04]
 gi|392672504|gb|EIY65971.1| hypothetical protein HMPREF1069_01283 [Bacteroides ovatus
           CL02T12C04]
          Length = 470

 Score = 45.1 bits (105), Expect = 0.079,   Method: Compositional matrix adjust.
 Identities = 30/106 (28%), Positives = 48/106 (45%), Gaps = 17/106 (16%)

Query: 32  TQRLASLDIFRGLAVALMIL----VDHAGGDWPEISHAP---------WNGCNLADFVMP 78
             R  +LD  RG A+  M+L    V H    W   +  P           G    D V P
Sbjct: 2   NNRALALDALRGYAIITMVLSATIVTHVLPGWMSHAQTPPPDHVFNPLLPGITWVDLVFP 61

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQ 120
           FFLF +G A   ++K+  ++ D+  K+++    R ++L F+ I +Q
Sbjct: 62  FFLFAMGAAFPFSIKKRAEKGDSKLKLVYEAGKRGIQLTFFAIFIQ 107


>gi|338209612|ref|YP_004653659.1| hypothetical protein [Runella slithyformis DSM 19594]
 gi|336303425|gb|AEI46527.1| Protein of unknown function DUF2261, transmembrane [Runella
           slithyformis DSM 19594]
          Length = 398

 Score = 45.1 bits (105), Expect = 0.083,   Method: Compositional matrix adjust.
 Identities = 30/98 (30%), Positives = 50/98 (51%), Gaps = 9/98 (9%)

Query: 29  HLKTQRLASLDIFRGLAVALMILVD-----HAGGDWPEISHAPWNGCNLADFVMPFFLFI 83
            +  +R+ S+D FR L + LMI V+          W E + A  +    +D + P FLFI
Sbjct: 2   QITLKRVPSIDAFRALTMLLMIFVNDFWSLSGIPYWLEHAKAEEDFLGFSDIIFPCFLFI 61

Query: 84  VGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGI 117
           +G+AI  A++    + D     V+ +I R++ L+  GI
Sbjct: 62  LGMAIPFAVQNRIAKGDTRWQIVRHIILRSVALIVMGI 99


>gi|406831132|ref|ZP_11090726.1| hypothetical protein SpalD1_05831 [Schlesneria paludicola DSM
           18645]
          Length = 508

 Score = 44.7 bits (104), Expect = 0.098,   Method: Compositional matrix adjust.
 Identities = 36/108 (33%), Positives = 53/108 (49%), Gaps = 10/108 (9%)

Query: 34  RLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           RL SLD FRG  +  M+LV+  GG     P I     + C+ AD +MP FLF  G A+ L
Sbjct: 14  RLTSLDQFRGYTMLGMLLVNFIGGYKAVSPRILLHTHDYCSYADTIMPHFLFAAGFALRL 73

Query: 91  ALKRIPDRAD------AVKKVIFRTLKLLFW-GILLQGGFSHAPDELT 131
           +L R  +         A+++++   L  + W G    GG  H  + +T
Sbjct: 74  SLGRRMEAGGKMPWGRAIRRILGLALVAIIWYGYCDWGGVVHKFNTMT 121


>gi|403174292|ref|XP_003333277.2| hypothetical protein PGTG_14197 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|375170913|gb|EFP88858.2| hypothetical protein PGTG_14197 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 386

 Score = 44.7 bits (104), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 39/136 (28%), Positives = 65/136 (47%), Gaps = 5/136 (3%)

Query: 27  KSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPE-ISHAPW--NGCNLADFVMPFFLFI 83
           +S +  +R  S+D+ RGL    M+LV+ AG   P  +SH          AD + P F+F 
Sbjct: 19  ESDILAKRDRSIDVLRGLTCLAMVLVNTAGPVRPSWLSHPTSIHQSITFADTLFPCFVFT 78

Query: 84  VGVAIALALKRIPD-RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYG-VDVRMIRL 141
            G+A A + K   + R  ++K+ + R +KL   GI               G ++++  R+
Sbjct: 79  SGLASAQSKKNEQNGRNPSLKRTLIRAIKLNLIGIAYNNLIPRLAGLHGDGLLNLKTYRI 138

Query: 142 CGVLQRIALSYLLVSL 157
             VL  I +S L+ +L
Sbjct: 139 PSVLGTIGISSLVCTL 154


>gi|299149192|ref|ZP_07042253.1| conserved hypothetical protein [Bacteroides sp. 3_1_23]
 gi|298512859|gb|EFI36747.1| conserved hypothetical protein [Bacteroides sp. 3_1_23]
          Length = 470

 Score = 44.7 bits (104), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 33/123 (26%), Positives = 55/123 (44%), Gaps = 20/123 (16%)

Query: 32  TQRLASLDIFRGLAVALMIL----VDHAGGDWPEISHAP---------WNGCNLADFVMP 78
             R  +LD  RG A+  M+L    V H    W   +  P           G    D V P
Sbjct: 2   NNRALALDALRGYAIITMVLSATIVTHVLPGWMSHAQTPPPDHVFNPLLPGITWVDLVFP 61

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQGGFSHAPDELTYGV 134
           FFLF +G A   ++++  ++ D+  K+++    R ++L F+ I +Q  +   P  L+   
Sbjct: 62  FFLFAMGAAFPFSIRKRAEKGDSKLKLVYEAVKRGIQLTFFAIFIQHFY---PYMLSSPQ 118

Query: 135 DVR 137
           D+R
Sbjct: 119 DIR 121


>gi|424665794|ref|ZP_18102830.1| hypothetical protein HMPREF1205_01669 [Bacteroides fragilis HMW
           616]
 gi|404574047|gb|EKA78798.1| hypothetical protein HMPREF1205_01669 [Bacteroides fragilis HMW
           616]
          Length = 385

 Score = 44.7 bits (104), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 30/98 (30%), Positives = 55/98 (56%), Gaps = 17/98 (17%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAP-WNG--------CNLADFVMPFFLFI 83
           QR+A++D+FR L + LM+ V+    D P + + P W G           +D + P FLF 
Sbjct: 7   QRVAAVDVFRALTMFLMLFVN----DIPGLRNIPHWLGHAAMTEDMLGFSDTIFPAFLFC 62

Query: 84  VGVAIALALKRIPDRADAVKKVI----FRTLKLLFWGI 117
           +G++I+ A++    + D++ +VI    +RT+ L+  G+
Sbjct: 63  MGMSISFAVQNRYQKGDSLLQVIMHIFWRTVALVVMGL 100


>gi|313148038|ref|ZP_07810231.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
 gi|313136805|gb|EFR54165.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
          Length = 385

 Score = 44.7 bits (104), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 30/98 (30%), Positives = 55/98 (56%), Gaps = 17/98 (17%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAP-WNG--------CNLADFVMPFFLFI 83
           QR+A++D+FR L + LM+ V+    D P + + P W G           +D + P FLF 
Sbjct: 7   QRVAAVDVFRALTMFLMLFVN----DIPGLRNIPHWLGHAAMTEDMLGFSDTIFPAFLFC 62

Query: 84  VGVAIALALKRIPDRADAVKKVI----FRTLKLLFWGI 117
           +G++I+ A++    + D++ +VI    +RT+ L+  G+
Sbjct: 63  MGMSISFAVQNRYQKGDSLLQVIMHIFWRTVALVVMGL 100


>gi|406831133|ref|ZP_11090727.1| hypothetical protein SpalD1_05836 [Schlesneria paludicola DSM
          18645]
          Length = 415

 Score = 44.3 bits (103), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 29/69 (42%), Positives = 39/69 (56%), Gaps = 4/69 (5%)

Query: 31 KTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           + RL SLD FRG  +  MILV++ G      P +     + C+ AD +MP F F VG A
Sbjct: 11 SSPRLTSLDQFRGYTMVGMILVNYLGAYKEVTPRLFRHTNDYCSYADTIMPHFFFAVGFA 70

Query: 88 IALAL-KRI 95
          + L+L KRI
Sbjct: 71 MRLSLGKRI 79


>gi|372221469|ref|ZP_09499890.1| hypothetical protein MzeaS_04069 [Mesoflavibacter
           zeaxanthinifaciens S86]
          Length = 378

 Score = 44.3 bits (103), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 40/169 (23%), Positives = 76/169 (44%), Gaps = 28/169 (16%)

Query: 233 LNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGL 292
           L    NA+  +D  +LG  H+Y                     +KD+      PF+PEGL
Sbjct: 174 LEMGSNAITRLDLFLLGEGHVY---------------------KKDS-----VPFDPEGL 207

Query: 293 LSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLY 352
           LS++ SI++ ++G +F  V +   G+  +    V M    LI+ L   +    P++K+L+
Sbjct: 208 LSTLPSIVN-VLGGYFAGVYLKKNGNTFK-SIAVLMVAGFLIYTLGQWWNLIFPISKKLW 265

Query: 353 TLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMA 401
           T S+   T G    + + +   ++I  +++        G N + +Y+ +
Sbjct: 266 TSSFALHTIGLDLFILATLVYAIEIKKVRFGVYFFDVFGKNPLFIYLFS 314


>gi|293369241|ref|ZP_06615831.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
 gi|336417197|ref|ZP_08597524.1| hypothetical protein HMPREF1017_04632 [Bacteroides ovatus
           3_8_47FAA]
 gi|423297813|ref|ZP_17275873.1| hypothetical protein HMPREF1070_04538 [Bacteroides ovatus
           CL03T12C18]
 gi|292635666|gb|EFF54168.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
 gi|335936517|gb|EGM98443.1| hypothetical protein HMPREF1017_04632 [Bacteroides ovatus
           3_8_47FAA]
 gi|392664450|gb|EIY57988.1| hypothetical protein HMPREF1070_04538 [Bacteroides ovatus
           CL03T12C18]
          Length = 470

 Score = 44.3 bits (103), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 33/123 (26%), Positives = 55/123 (44%), Gaps = 20/123 (16%)

Query: 32  TQRLASLDIFRGLAVALMIL----VDHAGGDWPEISHAP---------WNGCNLADFVMP 78
             R  +LD  RG A+  M+L    V H    W   +  P           G    D V P
Sbjct: 2   NNRALALDALRGYAIITMVLSATIVTHVLPGWMSHAQTPPPDHVFNPLLPGITWVDLVFP 61

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQGGFSHAPDELTYGV 134
           FFLF +G A   ++++  ++ D+  K+++    R ++L F+ I +Q  +   P  L+   
Sbjct: 62  FFLFAMGAAFPFSIRKRAEKGDSKLKLVYEAVKRGIQLTFFAIFIQHFY---PYMLSSPQ 118

Query: 135 DVR 137
           D+R
Sbjct: 119 DIR 121


>gi|336405631|ref|ZP_08586307.1| hypothetical protein HMPREF0127_03620 [Bacteroides sp. 1_1_30]
 gi|335937114|gb|EGM99020.1| hypothetical protein HMPREF0127_03620 [Bacteroides sp. 1_1_30]
          Length = 470

 Score = 44.3 bits (103), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 29/105 (27%), Positives = 48/105 (45%), Gaps = 17/105 (16%)

Query: 33  QRLASLDIFRGLAVALMIL----VDHAGGDWPEISHAP---------WNGCNLADFVMPF 79
            R  +LD  RG A+  M+L    V H    W   +  P           G    D V PF
Sbjct: 3   NRALALDALRGYAIITMVLSATIVTHVLPGWMSHAQTPPPDHVFNPLLPGITWVDLVFPF 62

Query: 80  FLFIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQ 120
           FLF +G A   ++++  ++ D+  K+++    R ++L F+ I +Q
Sbjct: 63  FLFAMGAAFPFSIRKRAEKGDSKLKLVYEAVKRGIQLTFFAIFIQ 107


>gi|160883830|ref|ZP_02064833.1| hypothetical protein BACOVA_01803 [Bacteroides ovatus ATCC 8483]
 gi|156110915|gb|EDO12660.1| hypothetical protein BACOVA_01803 [Bacteroides ovatus ATCC 8483]
          Length = 470

 Score = 44.3 bits (103), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 29/105 (27%), Positives = 48/105 (45%), Gaps = 17/105 (16%)

Query: 33  QRLASLDIFRGLAVALMIL----VDHAGGDWPEISHAP---------WNGCNLADFVMPF 79
            R  +LD  RG A+  M+L    V H    W   +  P           G    D V PF
Sbjct: 3   NRALALDALRGYAIITMVLSATIVTHVLPGWMSHAQTPPPDHVFNPLLPGITWVDLVFPF 62

Query: 80  FLFIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQ 120
           FLF +G A   ++++  ++ D+  K+++    R ++L F+ I +Q
Sbjct: 63  FLFAMGTAFPFSIRKRAEKGDSKLKLVYEAVKRGIQLTFFAIFIQ 107


>gi|295087641|emb|CBK69164.1| hypothetical protein [Bacteroides xylanisolvens XB1A]
          Length = 470

 Score = 44.3 bits (103), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 29/106 (27%), Positives = 48/106 (45%), Gaps = 17/106 (16%)

Query: 32  TQRLASLDIFRGLAVALMIL----VDHAGGDWPEISHAP---------WNGCNLADFVMP 78
             R  +LD  RG A+  M+L    V H    W   +  P           G    D V P
Sbjct: 2   NNRALALDALRGYAIITMVLSATIVTHVLPGWMSHAQTPPPDHVFNPLLPGITWVDLVFP 61

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQ 120
           FFLF +G A   ++++  ++ D+  K+++    R ++L F+ I +Q
Sbjct: 62  FFLFAMGAAFPFSIRKRAEKGDSKLKLVYEAVKRGIQLTFFAIFIQ 107


>gi|373850799|ref|ZP_09593600.1| Protein of unknown function DUF2261, transmembrane [Opitutaceae
           bacterium TAV5]
 gi|372476964|gb|EHP36973.1| Protein of unknown function DUF2261, transmembrane [Opitutaceae
           bacterium TAV5]
          Length = 401

 Score = 44.3 bits (103), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 31/101 (30%), Positives = 53/101 (52%), Gaps = 9/101 (8%)

Query: 31  KTQRLASLDIFRGLAVALMILVD-----HAGGDWPEISHAPWNGCNLADFVMPFFLFIVG 85
              R+AS+DI R L + LMI+V+          W   S +  +G  +AD V P FLF+VG
Sbjct: 9   NAGRVASIDILRALTMVLMIIVNDLFTLKNTPAWLGHSASGVDGIGVADVVFPAFLFLVG 68

Query: 86  VAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQGG 122
           +++  AL+   ++ D   ++++    R+  L+  G+ L  G
Sbjct: 69  LSLPHALEARRNKGDTGLRLVWHVAVRSFALIVMGVFLVNG 109


>gi|430744193|ref|YP_007203322.1| hypothetical protein Sinac_3363 [Singulisphaera acidiphila DSM
           18658]
 gi|430015913|gb|AGA27627.1| hypothetical protein Sinac_3363 [Singulisphaera acidiphila DSM
           18658]
          Length = 368

 Score = 43.9 bits (102), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 35/108 (32%), Positives = 51/108 (47%), Gaps = 8/108 (7%)

Query: 17  SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLAD 74
           S P ++     S     R+ SLD FRG  V  M+ V+  G     P +     + C+ AD
Sbjct: 8   SAPTLTSTSAPSG---SRIVSLDQFRGYTVVGMLFVNFLGNFDALPAVFKHHNSYCSYAD 64

Query: 75  FVMPFFLFIVGVAIALA-LKRIPDR--ADAVKKVIFRTLKLLFWGILL 119
            +MP F F VG A  L  L+R+       AV  V+ R+L L+  G ++
Sbjct: 65  TIMPQFFFAVGFAYRLTFLRRLETSGIGGAVAAVLRRSLGLILLGFVI 112


>gi|336417194|ref|ZP_08597521.1| hypothetical protein HMPREF1017_04629 [Bacteroides ovatus
           3_8_47FAA]
 gi|423297816|ref|ZP_17275876.1| hypothetical protein HMPREF1070_04541 [Bacteroides ovatus
           CL03T12C18]
 gi|335936514|gb|EGM98440.1| hypothetical protein HMPREF1017_04629 [Bacteroides ovatus
           3_8_47FAA]
 gi|392664453|gb|EIY57991.1| hypothetical protein HMPREF1070_04541 [Bacteroides ovatus
           CL03T12C18]
          Length = 466

 Score = 43.9 bits (102), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 31/111 (27%), Positives = 53/111 (47%), Gaps = 17/111 (15%)

Query: 32  TQRLASLDIFRGLAVALMIL------------VDHAGGDWPEIS-HAPWNGCNLADFVMP 78
           T+R  +LD  RG A+  M+L            + HA    P+ +  A  +G    D V P
Sbjct: 2   TKRAYALDALRGYAIITMVLSATVAWNSLPGWMYHAQTPPPDRAFDASLSGITWVDLVFP 61

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKVIFRTLK----LLFWGILLQGGFSH 125
           FFLF +G A   ++K+  ++ D   ++++  +K    L F+ I +Q  + H
Sbjct: 62  FFLFAMGAAFPFSIKKRFEKGDTKLRLVYEAIKRGAQLTFFAIFIQHFYPH 112


>gi|423281270|ref|ZP_17260181.1| hypothetical protein HMPREF1203_04398 [Bacteroides fragilis HMW
           610]
 gi|404583178|gb|EKA87860.1| hypothetical protein HMPREF1203_04398 [Bacteroides fragilis HMW
           610]
          Length = 385

 Score = 43.5 bits (101), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 30/98 (30%), Positives = 54/98 (55%), Gaps = 17/98 (17%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAP-WNG--------CNLADFVMPFFLFI 83
           QR+A++D+FR L + LM+ V+    D P + + P W G           +D + P FLF 
Sbjct: 7   QRVAAVDVFRALTMFLMLFVN----DIPGLRNIPHWLGHAAMTEDMLGFSDTIFPAFLFC 62

Query: 84  VGVAIALALKRIPDRADAVKKVI----FRTLKLLFWGI 117
           +G++I+ A++    + D+  +VI    +RT+ L+  G+
Sbjct: 63  MGMSISFAVQNRYQKGDSPLQVIMHIFWRTVALIVMGL 100


>gi|436836802|ref|YP_007322018.1| hypothetical protein FAES_3417 [Fibrella aestuarina BUZ 2]
 gi|384068215|emb|CCH01425.1| hypothetical protein FAES_3417 [Fibrella aestuarina BUZ 2]
          Length = 401

 Score = 43.5 bits (101), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 36/99 (36%), Positives = 48/99 (48%), Gaps = 9/99 (9%)

Query: 33  QRLASLDIFRGLAVALMILVDH-----AGGDWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
            R+ S+D+ R L + LMI V+      A   W E      +G  LAD V P FLFIVG++
Sbjct: 16  TRVDSIDVLRALTMVLMIFVNDLWSLTAIPGWLEHVPEGADGIGLADVVFPAFLFIVGLS 75

Query: 88  IALAL--KRIPDRADA--VKKVIFRTLKLLFWGILLQGG 122
           I  A+  +R     DA     V+ R   LL  G+ L  G
Sbjct: 76  IPFAIQHRRTRHETDAQIAGHVLTRAAALLVMGLWLVNG 114


>gi|428319838|ref|YP_007117720.1| hypothetical protein Osc7112_5038 [Oscillatoria nigro-viridis PCC
           7112]
 gi|428243518|gb|AFZ09304.1| hypothetical protein Osc7112_5038 [Oscillatoria nigro-viridis PCC
           7112]
          Length = 482

 Score = 43.5 bits (101), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 37/104 (35%), Positives = 48/104 (46%), Gaps = 16/104 (15%)

Query: 24  QQEKS---HLKTQRLASLDIFRGLAVALMIL------------VDHAGGDWPE-ISHAPW 67
           +QE S      +QR  +LD  RG+AV  M+L            + HA    P+ I +   
Sbjct: 3   KQENSLAVAAVSQRADALDALRGIAVLAMVLSGTIARKTLPAWMYHAQLPPPDHIFNNKL 62

Query: 68  NGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLK 111
            G    D V PFFLF +G AI LAL R   +    KKVI   L+
Sbjct: 63  PGLTWVDLVFPFFLFAMGAAIPLALSRRIAKGWDTKKVILSILQ 106


>gi|406835226|ref|ZP_11094820.1| hypothetical protein SpalD1_26403 [Schlesneria paludicola DSM
           18645]
          Length = 508

 Score = 43.5 bits (101), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 38/119 (31%), Positives = 54/119 (45%), Gaps = 22/119 (18%)

Query: 10  HHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPW-- 67
           H  PL    PD             R+ S+D FRG AVA MI V+  GG    + H+ +  
Sbjct: 3   HDAPLTTDSPD-------------RVISMDQFRGYAVAAMIFVNFVGGF--GVVHSVFKH 47

Query: 68  --NGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKV---IFRTLKLLFWGILLQG 121
             N  + AD +M  F+F+VG +  L + R   R   +      + R+L L+F   LL G
Sbjct: 48  NDNYLSYADTIMANFMFMVGFSFRLTMLRRLKRMSWLATCWSYVRRSLLLVFVSTLLYG 106


>gi|404404857|ref|ZP_10996441.1| hypothetical protein AJC13_05463 [Alistipes sp. JC136]
          Length = 392

 Score = 43.1 bits (100), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 35/115 (30%), Positives = 59/115 (51%), Gaps = 12/115 (10%)

Query: 31  KTQRLASLDIFRGLAVALMILVDH--AGGDWPE-ISHAPW--NGCNLADFVMPFFLFIVG 85
           +  R+AS+D+FRGL +  M+ V+   +  D P  + HA    +    +D + P FLFI+G
Sbjct: 4   QRNRIASIDVFRGLTMFFMLWVNSFWSLSDVPHWLQHAARGEDMLGFSDTIFPAFLFIMG 63

Query: 86  VAIALALKRIPDRADAVKKVIF----RTLKLLFWGIL---LQGGFSHAPDELTYG 133
            ++ LA+     + D+  K+++    RT  L+  G+L       FS A   L+ G
Sbjct: 64  ASVPLAVGSRRAKGDSTVKIVWHVFTRTFALVVMGLLTVNFGDAFSAAGTGLSRG 118


>gi|419719054|ref|ZP_14246346.1| acyltransferase [Lachnoanaerobaculum saburreum F0468]
 gi|383304805|gb|EIC96198.1| acyltransferase [Lachnoanaerobaculum saburreum F0468]
          Length = 555

 Score = 43.1 bits (100), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 28/98 (28%), Positives = 48/98 (48%), Gaps = 9/98 (9%)

Query: 26  EKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWP--EISHAPWNGCNLAD----FVMPF 79
           EK  +   R+  LD+ + L+ +++IL+  +   +   E+    W G  + +    F +P 
Sbjct: 203 EKKRVIKDRIIGLDVLKILSASMIILIHSSANLYNNHEVGTLAWKGGLILNVIPRFAVPA 262

Query: 80  FLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGI 117
           FL I G   AL L R  D+  A+KK ++  + L  W I
Sbjct: 263 FLMISG---ALLLGRKTDQRKAIKKAVYAGIALAIWSI 297


>gi|315650733|ref|ZP_07903787.1| conserved hypothetical protein [Lachnoanaerobaculum saburreum DSM
           3986]
 gi|315487007|gb|EFU77335.1| conserved hypothetical protein [Lachnoanaerobaculum saburreum DSM
           3986]
          Length = 555

 Score = 43.1 bits (100), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 28/98 (28%), Positives = 48/98 (48%), Gaps = 9/98 (9%)

Query: 26  EKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWP--EISHAPWNGCNLAD----FVMPF 79
           EK  +   R+  LD+ + L+ +++IL+  +   +   E+    W G  + +    F +P 
Sbjct: 203 EKKRVIKDRIIGLDVLKILSASMIILIHSSANLYNNHEVGTLAWKGGLILNVIPRFAVPA 262

Query: 80  FLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGI 117
           FL I G   AL L R  D+  A+KK ++  + L  W I
Sbjct: 263 FLMISG---ALLLGRKTDQQKAIKKAVYAGVALAIWSI 297


>gi|347736758|ref|ZP_08869314.1| hypothetical protein AZA_88975 [Azospirillum amazonense Y2]
 gi|346919651|gb|EGY01093.1| hypothetical protein AZA_88975 [Azospirillum amazonense Y2]
          Length = 175

 Score = 43.1 bits (100), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 28/107 (26%), Positives = 54/107 (50%), Gaps = 2/107 (1%)

Query: 293 LSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLY 352
           +S++ ++ + ++GV  G  +       AR++  +  G   ++ G       A P+NK L+
Sbjct: 1   MSTLPAVATALLGVQAGRWLARPVPLPARIRGLIVAGVVGVLLGEVWDL--AFPINKNLW 58

Query: 353 TLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
           T S+V +T+G A L   A+  +V++   +    P    G+N +L YV
Sbjct: 59  TSSFVALTAGMACLCLGALLWVVEVRGWRRWTRPALVFGVNPILAYV 105


>gi|406831131|ref|ZP_11090725.1| hypothetical protein SpalD1_05826 [Schlesneria paludicola DSM
          18645]
          Length = 520

 Score = 42.7 bits (99), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 26/65 (40%), Positives = 38/65 (58%), Gaps = 2/65 (3%)

Query: 32 TQRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
          + RL SLD FRG  +  M+LV++ G     P+I     + C+ AD +MP FLF  G A+ 
Sbjct: 13 SARLTSLDQFRGYTMLGMLLVNYLGSYHVCPQILKHSHDYCSYADTIMPQFLFAAGFAMR 72

Query: 90 LALKR 94
          L+L +
Sbjct: 73 LSLGK 77


>gi|237717694|ref|ZP_04548175.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
 gi|299149194|ref|ZP_07042255.1| conserved hypothetical protein [Bacteroides sp. 3_1_23]
 gi|229453013|gb|EEO58804.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
 gi|298512861|gb|EFI36749.1| conserved hypothetical protein [Bacteroides sp. 3_1_23]
          Length = 466

 Score = 42.7 bits (99), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 34/122 (27%), Positives = 58/122 (47%), Gaps = 20/122 (16%)

Query: 33  QRLASLDIFRGLAVALMIL------------VDHAGGDWPEIS-HAPWNGCNLADFVMPF 79
           +R  +LD  RG A+  M+L            + HA    P+ +  A  +G    D V PF
Sbjct: 3   KRAYALDALRGYAIITMVLSATVAWNSLPGWMYHAQTPPPDRAFDASLSGITWVDLVFPF 62

Query: 80  FLFIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQGGFSHAPDELTYGVD 135
           FLF +G A   ++K+  ++ D   ++++    R ++L F+ I +Q  + H    L+   D
Sbjct: 63  FLFAMGAAFPFSIKKRFEKGDTKLRLVYEAIKRGVQLTFFAIFIQHFYPHV---LSNPQD 119

Query: 136 VR 137
           VR
Sbjct: 120 VR 121


>gi|300867270|ref|ZP_07111930.1| conserved membrane hypothetical protein [Oscillatoria sp. PCC 6506]
 gi|300334747|emb|CBN57096.1| conserved membrane hypothetical protein [Oscillatoria sp. PCC 6506]
          Length = 486

 Score = 42.7 bits (99), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 46/104 (44%), Gaps = 13/104 (12%)

Query: 21  VSDQQEKSHLKTQRLASLDIFRGLAVALMIL------------VDHAGGDWPE-ISHAPW 67
           V+ Q   +    +R  +LD  RG+AV  M+L            + HA    P  + +   
Sbjct: 7   VNPQDMSTPAVNKRADALDALRGIAVLAMVLSGTIARKTLPAWMYHAQEPPPSHLFNPKL 66

Query: 68  NGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLK 111
            G    D V PFFLF +G AI LAL R   +    KKVI   L+
Sbjct: 67  AGLTWVDLVFPFFLFAMGAAIPLALSRRIAKGWDTKKVILSILQ 110


>gi|284035350|ref|YP_003385280.1| hypothetical protein Slin_0417 [Spirosoma linguale DSM 74]
 gi|283814643|gb|ADB36481.1| Protein of unknown function DUF2261, transmembrane [Spirosoma
          linguale DSM 74]
          Length = 389

 Score = 42.7 bits (99), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 25/73 (34%), Positives = 37/73 (50%), Gaps = 14/73 (19%)

Query: 34 RLASLDIFRGLAVALM----ILVDHAGGDWPEI----------SHAPWNGCNLADFVMPF 79
          RL S+D +RG  + LM    +  DH    +P+           SH  W GC+L D + P 
Sbjct: 4  RLMSMDAYRGFVMVLMAAEMLQFDHLHETFPDSAFWAFLAHHQSHVAWAGCSLHDLIQPS 63

Query: 80 FLFIVGVAIALAL 92
          F F+VGVA+  ++
Sbjct: 64 FSFLVGVALLFSM 76



 Score = 42.4 bits (98), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 23/108 (21%), Positives = 57/108 (52%)

Query: 293 LSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLY 352
           LS + ++ + ++G+  G  +       A +++++ +G   +  G+ LH     P+ K+++
Sbjct: 225 LSFIPTLGTMLLGLQAGRWLRSDMDKRALIQRFLLVGGICMAVGILLHVAGICPIVKRIW 284

Query: 353 TLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVM 400
           T ++V  + G    + +  Y ++D+ N +    PL  +GMN++ +Y +
Sbjct: 285 TPAWVLFSGGWCFWLLALFYGIIDVANRRSWAFPLIVVGMNSIAIYCL 332


>gi|297299343|ref|XP_001102014.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like,
           partial [Macaca mulatta]
          Length = 149

 Score = 42.4 bits (98), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 29/87 (33%), Positives = 43/87 (49%), Gaps = 2/87 (2%)

Query: 314 HTKGHLARLKQWV-TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIY 372
            TK  L R   W   +G   ++          IP+NK L++LSYV   S  A  +   +Y
Sbjct: 13  QTKDILIRFTAWCCILGLISVVLTKVSENEGFIPVNKNLWSLSYVTTLSSFAFFILLVLY 72

Query: 373 ALVDIWNLKYPFLPLAWIGMNAMLVYV 399
            +VD+  L +   P  + GMN++LVYV
Sbjct: 73  PVVDVKGL-WTGTPFFYPGMNSILVYV 98


>gi|326798253|ref|YP_004316072.1| hypothetical protein [Sphingobacterium sp. 21]
 gi|326549017|gb|ADZ77402.1| hypothetical protein Sph21_0826 [Sphingobacterium sp. 21]
          Length = 368

 Score = 42.0 bits (97), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 35/119 (29%), Positives = 54/119 (45%), Gaps = 23/119 (19%)

Query: 26  EKSHLKTQRLASLDIFRGLAVALMI---------LVDHAGGDWPE-----ISHAPWNGCN 71
           E+  ++T R+ SLD+ RGL + L+          L       WP+       H  W+G  
Sbjct: 2   EEKKVQT-RILSLDVMRGLIMILLAAESCELYTALSSTYSSGWPQGIIHHFFHHEWHGLY 60

Query: 72  LADFVMPFFLFIVGVAIALALKRIPDRADAV------KKVIFRTLKLLFWGILLQGGFS 124
             D V P F+FI G ++ L+ +R   +A  V      K V +R+ KL   G+ L   +S
Sbjct: 61  FWDLVQPAFMFIAGTSLYLSFQR--KQAAGVSWSSHFKSVAWRSAKLFLCGVALHCVYS 117


>gi|186472139|ref|YP_001859481.1| hypothetical protein Bphy_3279 [Burkholderia phymatum STM815]
 gi|184194471|gb|ACC72435.1| conserved hypothetical protein [Burkholderia phymatum STM815]
          Length = 418

 Score = 42.0 bits (97), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 49/146 (33%), Positives = 67/146 (45%), Gaps = 22/146 (15%)

Query: 23  DQQEKSHLKTQ----RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMP 78
            +Q+K+  + Q    RL  LD FRGL V L I+VDH GG    +S A  +   L D    
Sbjct: 43  KKQKKAEKRMQIPKNRLIELDFFRGL-VLLFIVVDHIGGSI--LSRATLHAYALCD-AAE 98

Query: 79  FFLFIVGVAIALALKRIPDR---ADAVKKVIFRTLKL----LFWGILL------QGGFS- 124
            F+F+ G A A A   +  R   ADA  +   R+L+L    L   +L+         FS 
Sbjct: 99  VFVFLGGFATATAYASLAKRHTEADARNRFFKRSLELYRAFLVTAVLMLVVSAVMSAFSI 158

Query: 125 HAPDELTYGVDVRMIRLCGVLQRIAL 150
            AP+  T  +D  M     VL+ I L
Sbjct: 159 DAPNMATTDLDDMMDTPTAVLRDILL 184


>gi|320104555|ref|YP_004180146.1| hypothetical protein Isop_3032 [Isosphaera pallida ATCC 43644]
 gi|319751837|gb|ADV63597.1| hypothetical protein Isop_3032 [Isosphaera pallida ATCC 43644]
          Length = 399

 Score = 42.0 bits (97), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 56/199 (28%), Positives = 83/199 (41%), Gaps = 19/199 (9%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
            R  +LD FRG  VA MI+V+  GG    P I       C+ AD +MP F   VG A   
Sbjct: 15  SRWDALDQFRGYTVAGMIVVNFVGGLAAVPAILKHHNTYCSYADTIMPQFFLAVGFAYRW 74

Query: 91  ALKRIPDRAD---AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
                 +R     AV+  + R L LL  G L+ G    A        D++ + + GVL +
Sbjct: 75  TFLNRLERGGWQAAVRHALGRNLGLLLVGFLMYGLDGKAESW----SDLKALGIRGVLIQ 130

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
                   +LV I    +      + R    RL    WL+ +C L V L+    ++  +W
Sbjct: 131 AFQRDFFQTLVHIAIASLWVLP-VINRSVWIRL---TWLLGSCTLHVVLS---SSFYYEW 183

Query: 208 QFTIINKDSADYGKVFNVT 226
              + N+   D G +  +T
Sbjct: 184 ---VTNRPGIDGGPLGFLT 199


>gi|312131791|ref|YP_003999131.1| hypothetical protein Lbys_3117 [Leadbetterella byssophila DSM
           17132]
 gi|311908337|gb|ADQ18778.1| hypothetical protein Lbys_3117 [Leadbetterella byssophila DSM
           17132]
          Length = 361

 Score = 42.0 bits (97), Expect = 0.66,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 47/98 (47%), Gaps = 20/98 (20%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAP-W--------NGCNLADFVMPFFL 81
           K  RL S+DIFR L +  MI V+    D   + + P W        +G   +D + P FL
Sbjct: 7   KKNRLLSIDIFRALTMFFMIFVN----DLFTVKNVPKWMLHTEMHEDGMGFSDVIFPIFL 62

Query: 82  FIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILL 119
            IVG++I  A      +AD  K +  RT  LL  G+ L
Sbjct: 63  LIVGMSIPFA------KADW-KGIGMRTFALLVMGVFL 93


>gi|262381364|ref|ZP_06074502.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
 gi|262296541|gb|EEY84471.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
          Length = 410

 Score = 42.0 bits (97), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 35/114 (30%), Positives = 56/114 (49%), Gaps = 9/114 (7%)

Query: 32  TQRLASLDIFRGLAVALMILVDH--AGGDWPE-ISHAPW--NGCNLADFVMPFFLFIVGV 86
           TQR  ++DI R + + +MI V+      D P  + HA +  +   LAD V P FLF VG+
Sbjct: 9   TQRNIAIDILRAVTMCVMIFVNDFWTVHDVPHYLEHAAYGEDFMGLADVVFPAFLFAVGM 68

Query: 87  AIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQGGFSHAPDELTYGVDV 136
           +I  A++R   +  + +  I     RTL LL  G  +    +   D+  + + V
Sbjct: 69  SIPFAIERRYAKGMSGESTILHILSRTLALLIMGAFIVNSEAGMADDALFPIGV 122


>gi|189463407|ref|ZP_03012192.1| hypothetical protein BACCOP_04126 [Bacteroides coprocola DSM 17136]
 gi|189429836|gb|EDU98820.1| hypothetical protein BACCOP_04126 [Bacteroides coprocola DSM 17136]
          Length = 467

 Score = 41.6 bits (96), Expect = 0.80,   Method: Compositional matrix adjust.
 Identities = 35/123 (28%), Positives = 54/123 (43%), Gaps = 28/123 (22%)

Query: 32  TQRLASLDIFRGLAVALMIL------------VDHAGGDWPE-ISHAPWNGCNLADFVMP 78
            QR  +LD  RG A+  M+L            + HA    PE I +    G    D V P
Sbjct: 2   NQRALALDALRGYAIITMVLSATIISSILPGWMSHAQTPPPEHIFNPEIPGITWVDLVFP 61

Query: 79  FFLFIVGVAIALALKRIPDRA--------DAVKKVIFRTLKLLFWGILLQGGFSH---AP 127
           FFLF +G A   ++ R  ++         DA+K    R ++L F+ I +Q  + +   +P
Sbjct: 62  FFLFAMGAAFPFSIGRHAEKGRSKLMLCYDAIK----RGIQLTFFAIFIQHFYPYVISSP 117

Query: 128 DEL 130
            +L
Sbjct: 118 QDL 120


>gi|331002841|ref|ZP_08326355.1| hypothetical protein HMPREF0491_01217 [Lachnospiraceae oral taxon
           107 str. F0167]
 gi|330413330|gb|EGG92698.1| hypothetical protein HMPREF0491_01217 [Lachnospiraceae oral taxon
           107 str. F0167]
          Length = 557

 Score = 41.6 bits (96), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 31/99 (31%), Positives = 52/99 (52%), Gaps = 11/99 (11%)

Query: 26  EKSHLKTQRLASLDIFRGLAVALMILVDHAGGDW---PEISHAPWNGCNLAD----FVMP 78
           E+  + T+RL  LD+ + ++ ALMI++ HA  +     +I  + W G  + +    F +P
Sbjct: 205 EEKKINTERLIGLDLLKIIS-ALMIILIHASANIYNNHDIGSSVWFGGLILNVIPRFAVP 263

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGI 117
            FL I G   AL L R  +   AV+K ++  L L+ W +
Sbjct: 264 TFLMISG---ALLLGRSTEPRKAVRKALYAGLALVVWSV 299


>gi|320107689|ref|YP_004183279.1| hypothetical protein AciPR4_2506 [Terriglobus saanensis SP1PR4]
 gi|319926210|gb|ADV83285.1| hypothetical protein AciPR4_2506 [Terriglobus saanensis SP1PR4]
          Length = 419

 Score = 41.6 bits (96), Expect = 0.85,   Method: Compositional matrix adjust.
 Identities = 32/113 (28%), Positives = 53/113 (46%), Gaps = 20/113 (17%)

Query: 33  QRLASLDIFRGLAVALMI----LVDHAGGDWPEI----------SHAPWNGCNLADFVMP 78
           QR  ++D +RGL + LM+      +     +P            SH  W G  L D + P
Sbjct: 33  QRNVAVDAYRGLVMLLMMGEVMQFEVVARSFPSSTIWRILSFNQSHVQWVGMGLHDMIQP 92

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKV----IFRTLKLLFWGILLQGGFSHAP 127
            F F+VGVA+  +L+    +  + +K+    I+R+  L+  GI L+    H+P
Sbjct: 93  SFTFLVGVALPYSLRSRQKKGQSFQKIVGHTIWRSFLLVALGIFLRS--IHSP 143


>gi|298384739|ref|ZP_06994299.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
 gi|298263018|gb|EFI05882.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
          Length = 473

 Score = 41.6 bits (96), Expect = 0.86,   Method: Compositional matrix adjust.
 Identities = 29/106 (27%), Positives = 49/106 (46%), Gaps = 17/106 (16%)

Query: 32  TQRLASLDIFRGLAVALMIL------------VDHAGGDWPE-ISHAPWNGCNLADFVMP 78
             R  +LD  RG A+  M+L            + HA    P+ I +    G    D V P
Sbjct: 2   NNRAYALDALRGYAIITMVLSATIVTQVLPGWMSHAQTPPPDHIFNPSLPGITWVDLVFP 61

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQ 120
           FFLF +G A   ++ +  ++ D+  K+++    R ++L F+ I +Q
Sbjct: 62  FFLFAMGAAFPFSIGKRAEKGDSKLKLVYEAVKRGVQLTFFAIFIQ 107


>gi|29345856|ref|NP_809359.1| hypothetical protein BT_0446 [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|383122991|ref|ZP_09943678.1| hypothetical protein BSIG_0267 [Bacteroides sp. 1_1_6]
 gi|29337749|gb|AAO75553.1| conserved hypothetical protein [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|251841914|gb|EES69994.1| hypothetical protein BSIG_0267 [Bacteroides sp. 1_1_6]
          Length = 469

 Score = 41.6 bits (96), Expect = 0.88,   Method: Compositional matrix adjust.
 Identities = 29/106 (27%), Positives = 49/106 (46%), Gaps = 17/106 (16%)

Query: 32  TQRLASLDIFRGLAVALMIL------------VDHAGGDWPE-ISHAPWNGCNLADFVMP 78
             R  +LD  RG A+  M+L            + HA    P+ I +    G    D V P
Sbjct: 2   NNRAYALDALRGYAIITMVLSATIVTQVLPGWMSHAQTPPPDHIFNPSLPGITWVDLVFP 61

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQ 120
           FFLF +G A   ++ +  ++ D+  K+++    R ++L F+ I +Q
Sbjct: 62  FFLFAMGAAFPFSIGKRAEKGDSKLKLVYEAVKRGVQLTFFAIFIQ 107


>gi|293369243|ref|ZP_06615833.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
 gi|292635668|gb|EFF54170.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
          Length = 466

 Score = 41.6 bits (96), Expect = 0.92,   Method: Compositional matrix adjust.
 Identities = 34/122 (27%), Positives = 58/122 (47%), Gaps = 20/122 (16%)

Query: 33  QRLASLDIFRGLAVALMIL------------VDHAGGDWPEIS-HAPWNGCNLADFVMPF 79
           +R  +LD  RG A+  M+L            + HA    P+ +  A  +G    D V PF
Sbjct: 3   KRAYALDALRGYAIITMVLSATVAWNSLPGWMYHAQTPPPDRAFDASLSGITWVDLVFPF 62

Query: 80  FLFIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQGGFSHAPDELTYGVD 135
           FLF +G A   ++K+  ++ D   ++++    R ++L F+ I +Q  +   P  L+   D
Sbjct: 63  FLFAMGAAFPFSIKKRFEKGDTKLRLVYEAIKRGVQLTFFAIFIQHFY---PYVLSNPQD 119

Query: 136 VR 137
           VR
Sbjct: 120 VR 121


>gi|255532593|ref|YP_003092965.1| hypothetical protein Phep_2702 [Pedobacter heparinus DSM 2366]
 gi|255345577|gb|ACU04903.1| hypothetical protein Phep_2702 [Pedobacter heparinus DSM 2366]
          Length = 390

 Score = 41.2 bits (95), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 23/73 (31%), Positives = 38/73 (52%), Gaps = 5/73 (6%)

Query: 34  RLASLDIFRGLAVALMILVDHAGG-----DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
           R  ++D+ R L + LMI V+  G       W +   A  +G   AD + P FLFIVG+++
Sbjct: 8   RFQAVDVLRALTMFLMIFVNDVGSVKYLPHWVDHVEADVDGMGFADTIFPAFLFIVGLSL 67

Query: 89  ALALKRIPDRADA 101
             AL+   ++  +
Sbjct: 68  PFALQSRMNKGKS 80


>gi|210622217|ref|ZP_03293007.1| hypothetical protein CLOHIR_00953 [Clostridium hiranonis DSM 13275]
 gi|210154351|gb|EEA85357.1| hypothetical protein CLOHIR_00953 [Clostridium hiranonis DSM 13275]
          Length = 483

 Score = 41.2 bits (95), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 40/184 (21%), Positives = 77/184 (41%), Gaps = 20/184 (10%)

Query: 3   EIKAETTHHHPLIISEPDVSDQQEKSHLK----TQRLASLDIFRGLAVALMILVDHAG-- 56
           E+  E +     +I+  +    Q +  +      +R  ++++  G+AV  +I     G  
Sbjct: 80  ELSREESSTEQTVINRGEKEQPQAREVVTGDPLKRRYTTVELIMGVAVIAIICSSGIGVL 139

Query: 57  GDWPE-ISHAPWNGCNLADFVMPFFL----FIVGVAIALALKRIPDRADAVKKVIFRTLK 111
           G+ P  ++ + WNG +  D  +P  L    F++   + L +KR     +   K +   + 
Sbjct: 140 GEMPAFLAFSKWNGISFGDLGLPLLLASVCFMIPTEVELDVKRKKSFKEICIKKVKVGII 199

Query: 112 LLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQS 171
           L   GIL+          L    +    R+ G+LQ IA+ Y+L SL+ +  +    K   
Sbjct: 200 LFVIGILIN---------LIGAWNFNSFRIMGILQMIAVVYMLGSLLYVLFRRFNFKSSV 250

Query: 172 VGRF 175
           +  F
Sbjct: 251 IAVF 254


>gi|291515652|emb|CBK64862.1| hypothetical protein AL1_27050 [Alistipes shahii WAL 8301]
          Length = 466

 Score = 41.2 bits (95), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 29/94 (30%), Positives = 42/94 (44%), Gaps = 14/94 (14%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAG--GDWPE------------ISHAPWNGCNLADFVM 77
             R  ++DI RGLA+  M+L  +     D P             +  A   G    D V 
Sbjct: 2   NNRAYAVDILRGLAIVGMVLSGYIAWNPDLPAWLFHAQLPPPSFVFDASVAGITWVDLVF 61

Query: 78  PFFLFIVGVAIALALKRIPDRADAVKKVIFRTLK 111
           PFFLF +G A  L+L R  +R D  ++++   LK
Sbjct: 62  PFFLFSMGAAFPLSLGRRLNRGDPPRRIVVSILK 95


>gi|317503638|ref|ZP_07961656.1| transmembrane protein, partial [Prevotella salivae DSM 15606]
 gi|315665239|gb|EFV04888.1| transmembrane protein [Prevotella salivae DSM 15606]
          Length = 322

 Score = 40.8 bits (94), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 55/238 (23%), Positives = 88/238 (36%), Gaps = 74/238 (31%)

Query: 139 IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL 198
           IRL GV  R+A  Y+L S V I       K + +  F           + A  +  YL L
Sbjct: 70  IRLVGVFPRLAFCYMLASFVAI-----SMKHKYIPYF-----------IGALFVGYYLIL 113

Query: 199 LYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPA 258
            YG           N  + D   + NV                 ID  V G  H+Y    
Sbjct: 114 CYG-----------NGFAHDASNICNV-----------------IDEAVFGRQHLYQ--- 142

Query: 259 WRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGH 318
           W                  D P       +PEGLLS++ ++   +IG   G  ++     
Sbjct: 143 W------------------DIP-------DPEGLLSNLPALGHVLIGFCVGRSLMSATSI 177

Query: 319 LARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
             ++++    G  L   G  L +    P++K+L+T ++  VT G A+   + +  ++D
Sbjct: 178 NEKIEKLFLFGALLTFAGFLLSY--GCPISKKLWTPTFALVTCGLASTTLAMLSWIID 233


>gi|224536474|ref|ZP_03677013.1| hypothetical protein BACCELL_01348 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224521916|gb|EEF91021.1| hypothetical protein BACCELL_01348 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 193

 Score = 40.8 bits (94), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 28/120 (23%), Positives = 57/120 (47%), Gaps = 5/120 (4%)

Query: 288 EPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTN---- 343
           +PEGLLS++ SI   ++G   G +++ +    +R     +    L + G  L F      
Sbjct: 8   DPEGLLSTIPSIAHVLLGFCVGRMMLDSNRAESREALLNSHLIKLFLVGAILTFAGFLLS 67

Query: 344 -AIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAA 402
              P+NK++++ ++V  T G A+   + +  ++D+   K         G+N + +YV+  
Sbjct: 68  YGCPINKKIWSPTFVLTTCGLASSFLALLIWIIDVKGYKKWCTFFEAFGVNPLFMYVLGG 127


>gi|387791847|ref|YP_006256912.1| hypothetical protein Solca_2702 [Solitalea canadensis DSM 3403]
 gi|379654680|gb|AFD07736.1| hypothetical protein Solca_2702 [Solitalea canadensis DSM 3403]
          Length = 487

 Score = 40.8 bits (94), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 40/128 (31%), Positives = 56/128 (43%), Gaps = 25/128 (19%)

Query: 16  ISEPDVSDQQEKSHLKTQ-----RLASLDIFRGLAVALMILVD-------------HAGG 57
           I+E + S + E S +K Q     R  +LD  RGLA+  M+                H   
Sbjct: 4   ITEIEAS-KPEASFVKAQIDKAPRSFALDALRGLAIIGMVFSGVFPHEALWPGYMFHGQV 62

Query: 58  DWPEISHAPW-NGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLK----L 112
             P+  + P   G    D V PFFLF +G A  LA+ +     D  K VI   LK    L
Sbjct: 63  GPPDFKYTPEVPGITWVDLVFPFFLFSMGAAFPLAMNKKIQEGDQ-KGVILNVLKRFALL 121

Query: 113 LFWGILLQ 120
           +F+ I+L+
Sbjct: 122 VFFAIVLR 129


>gi|325102778|ref|YP_004272432.1| hypothetical protein [Pedobacter saltans DSM 12145]
 gi|324971626|gb|ADY50610.1| hypothetical protein Pedsa_0021 [Pedobacter saltans DSM 12145]
          Length = 466

 Score = 40.4 bits (93), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 35/111 (31%), Positives = 49/111 (44%), Gaps = 30/111 (27%)

Query: 29  HLKTQRLASLDIFRGLAVALMIL-----------------VDHAGGDW-PEISHAPWNGC 70
            L  +R +SLD  RG+A+ LM+L                 V   G  + PEI    W   
Sbjct: 2   KLTVKRDSSLDSLRGIAIILMVLSGSIAFSILPGWMYHAQVPPPGHKFMPEIPGITW--- 58

Query: 71  NLADFVMPFFLFIVGVAIALALKRIPDRADA-------VKKVIFRTLKLLF 114
              D V PFFLF +G AI LA+K+  + + +       VK+ +  T   LF
Sbjct: 59  --VDLVFPFFLFSMGAAIPLAMKKKIENSSSLNIFISIVKRFVLLTFFALF 107


>gi|406830436|ref|ZP_11090030.1| hypothetical protein SpalD1_02314 [Schlesneria paludicola DSM
           18645]
          Length = 380

 Score = 40.4 bits (93), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 36/123 (29%), Positives = 57/123 (46%), Gaps = 18/123 (14%)

Query: 34  RLASLDIFRGLAVALMILVD---HAGGDWPE----------ISHAPWNGCNLADFVMPFF 80
           R+ASLD  RG A+A++++      A  + P+          +SH    G +L D   P F
Sbjct: 27  RVASLDTLRGFAIAILLIATPLVSALKEVPQSATRDMLVWQLSHVKGEGISLFDVGWPAF 86

Query: 81  LFIVGVAIALALKRIPDRAD----AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDV 136
           L I GV++  +L R  +R +    A   ++ ++L    +   + GGFS  P    Y  DV
Sbjct: 87  LIIAGVSLNFSLARRLERGETRFAAWLDLVRKSLLCALFAFFVHGGFS-IPWAKVYFADV 145

Query: 137 RMI 139
             I
Sbjct: 146 LFI 148


>gi|323343607|ref|ZP_08083834.1| hypothetical protein HMPREF0663_10369 [Prevotella oralis ATCC
           33269]
 gi|323095426|gb|EFZ38000.1| hypothetical protein HMPREF0663_10369 [Prevotella oralis ATCC
           33269]
          Length = 468

 Score = 40.4 bits (93), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 34/111 (30%), Positives = 51/111 (45%), Gaps = 17/111 (15%)

Query: 30  LKTQRLASLDIFRGLAVALMIL------------VDHAGGDWPE-ISHAPWNGCNLADFV 76
           +K +R  +LD  RG A+  MIL            + HA    P+ + +    G    D +
Sbjct: 1   MKQERAHALDALRGYAIMTMILSATEAFRVLPAWMYHAQVPPPDHVFNPSIYGITWVDLI 60

Query: 77  MPFFLFIVGVAIALALKRIPDRADAVKKV----IFRTLKLLFWGILLQGGF 123
            PFFLF +G AI L+L R      +++K+      R LKL F+ I +   F
Sbjct: 61  FPFFLFSMGAAIPLSLGRQYKAGASLRKLCRKSAIRWLKLAFFAIFIYHTF 111


>gi|423299515|ref|ZP_17277540.1| hypothetical protein HMPREF1057_00681 [Bacteroides finegoldii
           CL09T03C10]
 gi|408473324|gb|EKJ91846.1| hypothetical protein HMPREF1057_00681 [Bacteroides finegoldii
           CL09T03C10]
          Length = 467

 Score = 40.4 bits (93), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 33/106 (31%), Positives = 49/106 (46%), Gaps = 18/106 (16%)

Query: 33  QRLASLDIFRGLAVALMILVDH-AGGDWPE-ISHAPWN------------GCNLADFVMP 78
           +R  SLD FRG A+  M+L    A G  P  + HA               G    D V P
Sbjct: 2   KRAISLDAFRGYAIVTMVLSGTIASGVLPGWMYHAQMGPRSNYIFDPQLYGITWVDLVFP 61

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKVI----FRTLKLLFWGILLQ 120
           FFLF +G AI  ++    ++ + + K+I     R ++L F+ I +Q
Sbjct: 62  FFLFAMGAAIPFSVGGKIEKGENLWKIIGECVLRGIRLAFFAIFIQ 107


>gi|160891390|ref|ZP_02072393.1| hypothetical protein BACUNI_03840 [Bacteroides uniformis ATCC 8492]
 gi|156858797|gb|EDO52228.1| hypothetical protein BACUNI_03840 [Bacteroides uniformis ATCC 8492]
          Length = 421

 Score = 40.0 bits (92), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 33/114 (28%), Positives = 55/114 (48%), Gaps = 10/114 (8%)

Query: 15  IISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVD-----HAGGDWPEISHAPWNG 69
           I+ + D+SD+   S    +R  ++D+ R L +  MI V+     H    W E +    + 
Sbjct: 8   ILKQDDMSDKTVYS----RRNPAIDMLRALTMFTMIFVNDFWKVHDVPHWLEHAVYGEDF 63

Query: 70  CNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGF 123
             LAD V P FLF VG++I  A++R   +  + +  +   L   F  +L+ G F
Sbjct: 64  MGLADIVFPCFLFAVGMSIPYAIERRYAKGFSAESTLGHILSRTF-ALLVMGAF 116


>gi|345856865|ref|ZP_08809325.1| hypothetical protein DOT_0678 [Desulfosporosinus sp. OT]
 gi|344330006|gb|EGW41324.1| hypothetical protein DOT_0678 [Desulfosporosinus sp. OT]
          Length = 237

 Score = 40.0 bits (92), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 30/94 (31%), Positives = 48/94 (51%), Gaps = 9/94 (9%)

Query: 27  KSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGV 86
            S + +QR   +D+ R LA+ LM+L  H   D  E +     G N+ D+  P + F++G 
Sbjct: 8   NSRISSQRYEEIDVLRALAIGLMVLF-HLAYDLKEFA-----GVNI-DYQAPLW-FVIGK 59

Query: 87  AIALALKRIPDRADAV-KKVIFRTLKLLFWGILL 119
             AL    I   +    K  + R LK+LFWG+++
Sbjct: 60  TSALLFIFISGLSSGFSKSSVRRGLKVLFWGMVV 93


>gi|189468533|ref|ZP_03017318.1| hypothetical protein BACINT_04936 [Bacteroides intestinalis DSM
           17393]
 gi|189436797|gb|EDV05782.1| hypothetical protein BACINT_04936 [Bacteroides intestinalis DSM
           17393]
          Length = 393

 Score = 40.0 bits (92), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 28/96 (29%), Positives = 51/96 (53%), Gaps = 9/96 (9%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG--DWPE-ISHAPWNG--CNLADFVMPFFLFIVGVA 87
           QR+A++D+FR L +  M+ V+   G  + P  + HA  N      +D + P FLF +G++
Sbjct: 7   QRVAAVDVFRALTMFFMLFVNDIPGLKNVPHWLMHAEMNEDMMGFSDTIFPAFLFCMGMS 66

Query: 88  IALALKRIPDRADAVKKVIF----RTLKLLFWGILL 119
           I  A++    + D   ++I     RT+ L+  G+ +
Sbjct: 67  IPFAIQNRVKKGDTALQIISHISERTVALIAMGLFM 102


>gi|416973167|ref|ZP_11937347.1| OpgC protein, partial [Burkholderia sp. TJI49]
 gi|325520603|gb|EGC99672.1| OpgC protein [Burkholderia sp. TJI49]
          Length = 141

 Score = 40.0 bits (92), Expect = 2.5,   Method: Composition-based stats.
 Identities = 30/96 (31%), Positives = 48/96 (50%), Gaps = 8/96 (8%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +  R A LD FRGL V L+I+VDH GG     ++   +  C+ A+     F+F+ G A A
Sbjct: 6   RAGRYAELDFFRGL-VLLVIVVDHIGGSILSRVTLHAYALCDAAE----VFVFLGGFATA 60

Query: 90  LALKRIPDRAD--AVKKVIFRTLKLLFWGILLQGGF 123
           +A   + +R D  A ++   R    ++   L+  G 
Sbjct: 61  IAYNSLAERHDEAAARQRFIRRAFEIYRAFLVTAGL 96


>gi|420256474|ref|ZP_14759317.1| hypothetical protein PMI06_09785 [Burkholderia sp. BT03]
 gi|398043145|gb|EJL36078.1| hypothetical protein PMI06_09785 [Burkholderia sp. BT03]
          Length = 373

 Score = 40.0 bits (92), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 46/140 (32%), Positives = 62/140 (44%), Gaps = 18/140 (12%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIV 84
           Q+       RL  LD FRGL V + I+VDH GG    +S A  +   L D     F+F+ 
Sbjct: 4   QKGMQTSKNRLIELDFFRGL-VLIFIVVDHIGGS--ILSRATLHAYALCD-AAEVFVFLG 59

Query: 85  GVAIALALKRIPDR---ADAVKKVIFRTLKL----LFWGILL------QGGFS-HAPDEL 130
           G A A A   +  R   ADA  +   R+L+L    L   +L+         FS  AP+  
Sbjct: 60  GFATATAYASLAKRHTEADARNRFFKRSLELYRAFLITAVLMLLVSAVMSAFSIDAPNMA 119

Query: 131 TYGVDVRMIRLCGVLQRIAL 150
           T  +D  M     VL+ I L
Sbjct: 120 TTDLDDMMDTPTAVLRDILL 139


>gi|187735023|ref|YP_001877135.1| hypothetical protein Amuc_0516 [Akkermansia muciniphila ATCC
           BAA-835]
 gi|187425075|gb|ACD04354.1| conserved hypothetical protein [Akkermansia muciniphila ATCC
           BAA-835]
          Length = 370

 Score = 40.0 bits (92), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 33/118 (27%), Positives = 55/118 (46%), Gaps = 6/118 (5%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIP 346
           F+PEG L  VS+    + G   G  +  ++    RL      G  LL+F          P
Sbjct: 198 FDPEGPLCVVSASALCMGGWIAGKFLKESRLPPVRLCT-AMAGAGLLLFCTARMLDGTYP 256

Query: 347 LNKQLYTLSYVCVTSGAAALVFSAIYALVDIWN----LKYPFLPLAWIGMNAMLVYVM 400
           + K ++T ++V   +G +    S  +  VD+WN     ++ F PL  IG+NA+  Y++
Sbjct: 257 IIKSMWTGTFVLAAAGISLTFLSLFHLAVDVWNEGKWAQWAF-PLRVIGINALAAYLI 313


>gi|256423178|ref|YP_003123831.1| hypothetical protein Cpin_4173 [Chitinophaga pinensis DSM 2588]
 gi|256038086|gb|ACU61630.1| conserved hypothetical protein [Chitinophaga pinensis DSM 2588]
          Length = 349

 Score = 39.7 bits (91), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 47/100 (47%), Gaps = 14/100 (14%)

Query: 31  KTQRLASLDIFRGLAVALMI----LVDHAGGDW------PEISHAPWNGCNLADFVMPFF 80
            + RL SLD+ RGL + L+      V  +  +W       +  H PW+G    D V P F
Sbjct: 3   NSGRLLSLDVMRGLIMILLAGESCRVYESLHEWHDNAFIRQFFHHPWHGLRFWDLVQPAF 62

Query: 81  LFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWG 116
           + + G A+ ++    L++    +   K ++ R+LKL   G
Sbjct: 63  MLMAGTAMYISYQSKLRKGVSWSQNFKHILIRSLKLFLLG 102


>gi|377812665|ref|YP_005041914.1| hypothetical protein BYI23_B004200 [Burkholderia sp. YI23]
 gi|357937469|gb|AET91027.1| hypothetical protein BYI23_B004200 [Burkholderia sp. YI23]
          Length = 389

 Score = 39.7 bits (91), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 38/127 (29%), Positives = 60/127 (47%), Gaps = 18/127 (14%)

Query: 23  DQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLF 82
           +   +   K+QRL  LD FRGL V L+I++DH GG    +S    +   L D     F+F
Sbjct: 6   NTPSRMQQKSQRLVELDFFRGL-VLLIIVIDHIGGSM--LSRFTLHSFALND-AAEVFVF 61

Query: 83  IVGVAIALALKRIPD-RADAVKKVIF--RTLKL-----------LFWGILLQGGFSHAPD 128
           + G A A A   + + R+++  +V F  R  +L           L    +L+  F HAP+
Sbjct: 62  LGGFATATAYVSLAERRSESAARVRFLKRAFELYRAFVVTAVLMLVASFVLRPLFGHAPN 121

Query: 129 ELTYGVD 135
              + +D
Sbjct: 122 LALHDLD 128


>gi|261880015|ref|ZP_06006442.1| conserved hypothetical protein [Prevotella bergensis DSM 17361]
 gi|270333306|gb|EFA44092.1| conserved hypothetical protein [Prevotella bergensis DSM 17361]
          Length = 477

 Score = 39.7 bits (91), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 50/102 (49%), Gaps = 7/102 (6%)

Query: 302 TIIGVHFGHVIIHTKGHLARLKQWVTM---GFALLIFGLTLH-FTNAIPLNKQLYTLSYV 357
           T+  + F H++I     ++R + W  M   G  LL+ GL    F   +   K   T SY 
Sbjct: 323 TLTLLAFMHILISRSSGISR-ELWRKMYRAGAYLLVLGLVFEAFEGGV--RKDTATFSYY 379

Query: 358 CVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYV 399
            +TSG A L F + + L D++ +K+    L + G N M+ YV
Sbjct: 380 FITSGLAFLSFISFHLLCDVFQIKWLTHSLGYAGQNPMIAYV 421


>gi|333380436|ref|ZP_08472127.1| hypothetical protein HMPREF9455_00293 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332826431|gb|EGJ99260.1| hypothetical protein HMPREF9455_00293 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 469

 Score = 39.3 bits (90), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 36/134 (26%), Positives = 61/134 (45%), Gaps = 25/134 (18%)

Query: 33  QRLASLDIFRGLAVALMIL------------VDHAGGDWPEISHAPWN-GCNLADFVMPF 79
            R  +LD  RG A+  M+L            + HA    P     P   G    D V PF
Sbjct: 3   DRSCALDALRGYAIVTMVLSGAVVYGVLPGWMYHAQVPPPTHVFNPAAPGITWVDLVFPF 62

Query: 80  FLFIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQGGF----SHAPDELT 131
           FLF +G A   ++++  +R ++  K+I+    R+++L F+ I ++  +    S+  D   
Sbjct: 63  FLFAMGSAFPFSIRKRLERGESKLKLIYDALKRSIQLTFFAIFIRHFYPYVLSNPEDARA 122

Query: 132 YGVDVRMIRLCGVL 145
           +G+ +    LC VL
Sbjct: 123 WGLSL----LCFVL 132


>gi|421867873|ref|ZP_16299526.1| OpgC protein [Burkholderia cenocepacia H111]
 gi|358072286|emb|CCE50404.1| OpgC protein [Burkholderia cenocepacia H111]
          Length = 382

 Score = 38.9 bits (89), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 30/100 (30%), Positives = 49/100 (49%), Gaps = 9/100 (9%)

Query: 17  SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADF 75
           + P        +  +  R A LD FRGL V L+I+VDH GG     ++   +  C+ A+ 
Sbjct: 9   TRPAPEGAPMTAPARAGRYAELDFFRGL-VLLVIVVDHIGGSILSRVTLHAYALCDAAE- 66

Query: 76  VMPFFLFIVGVAIALALKRIPDRAD---AVKKVIFRTLKL 112
               F+F+ G A A+A   + +R D   A ++ I R  ++
Sbjct: 67  ---VFVFLGGFATAIAYNSLAERHDEAAARQRFIRRAFEI 103


>gi|312796297|ref|YP_004029219.1| OpgC protein [Burkholderia rhizoxinica HKI 454]
 gi|312168072|emb|CBW75075.1| OpgC [Burkholderia rhizoxinica HKI 454]
          Length = 330

 Score = 38.9 bits (89), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 31/88 (35%), Positives = 44/88 (50%), Gaps = 9/88 (10%)

Query: 17  SEPDVSDQQEKSHL---KTQRLASLDIFRGLAVALMILVDHAGGDW-PEISHAPWNGCNL 72
           S P +      S+L    T RLA LD FRGL V L+I+VDH GG     ++   +  C+ 
Sbjct: 3   SRPALPFHPSPSYLMQPSTARLAELDFFRGL-VLLIIVVDHIGGSMLSRVTLHTYALCDA 61

Query: 73  ADFVMPFFLFIVGVAIALALKRIPDRAD 100
           A+     F+F+ G A A+    +  R D
Sbjct: 62  AE----VFVFLGGYATAIGWTTLAARCD 85


>gi|319900329|ref|YP_004160057.1| hypothetical protein Bache_0445 [Bacteroides helcogenes P 36-108]
 gi|319415360|gb|ADV42471.1| hypothetical protein Bache_0445 [Bacteroides helcogenes P 36-108]
          Length = 414

 Score = 38.5 bits (88), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 32/120 (26%), Positives = 52/120 (43%), Gaps = 9/120 (7%)

Query: 26  EKSHLKTQRLASLDIFRGLAVALMILVD-----HAGGDWPEISHAPWNGCNLADFVMPFF 80
             S   ++R  ++D+ R L +  MI V+     H    W E +    +   LAD V P F
Sbjct: 8   NTSATYSRRNLAIDMLRALTMFTMIFVNDFWKVHDIPRWLEHAGYGEDFMGLADVVFPCF 67

Query: 81  LFIVGVAIALALKRIPDRADAVKK----VIFRTLKLLFWGILLQGGFSHAPDELTYGVDV 136
           LF VG++I  A++R   +  + +     +  RT  LL  G  +         E+ Y + V
Sbjct: 68  LFAVGMSIPYAIERRYAKGFSAESTLGHIFLRTFALLVMGAFITNSEYRLSPEVPYPIGV 127


>gi|373954327|ref|ZP_09614287.1| hypothetical protein Mucpa_2712 [Mucilaginibacter paludis DSM
          18603]
 gi|373890927|gb|EHQ26824.1| hypothetical protein Mucpa_2712 [Mucilaginibacter paludis DSM
          18603]
          Length = 473

 Score = 38.5 bits (88), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 29/76 (38%), Positives = 34/76 (44%), Gaps = 14/76 (18%)

Query: 33 QRLASLDIFRGLAVALMIL-------------VDHAGGDWPEISHAP-WNGCNLADFVMP 78
          QR  SLD  RG A+ LM+L             + HA    P     P   G    D V P
Sbjct: 11 QRANSLDALRGTAILLMVLSGSIAFGGILPGWMYHAQVPPPAHQFKPDLPGITWVDLVFP 70

Query: 79 FFLFIVGVAIALALKR 94
          FFLF +G AI LAL +
Sbjct: 71 FFLFAMGAAIPLALVK 86


>gi|402568588|ref|YP_006617932.1| hypothetical protein GEM_3848 [Burkholderia cepacia GG4]
 gi|402249785|gb|AFQ50238.1| hypothetical protein GEM_3848 [Burkholderia cepacia GG4]
          Length = 365

 Score = 38.5 bits (88), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 9/86 (10%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +  R A LD FRGL V L+I+VDH GG     ++   +  C+ A+     F+F+ G A A
Sbjct: 6   RAGRYAELDFFRGL-VLLVIVVDHIGGSILSRVTLHAYALCDAAE----VFVFLGGFATA 60

Query: 90  LALKRIPDRAD---AVKKVIFRTLKL 112
           +A   + +R D   A ++ I R  ++
Sbjct: 61  IAYNSLAERHDEAAARQRFIRRAFEI 86


>gi|358344082|ref|XP_003636122.1| Magnesium transporter NIPA2 [Medicago truncatula]
 gi|355502057|gb|AES83260.1| Magnesium transporter NIPA2 [Medicago truncatula]
          Length = 328

 Score = 38.5 bits (88), Expect = 7.7,   Method: Compositional matrix adjust.
 Identities = 16/32 (50%), Positives = 23/32 (71%)

Query: 116 GILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
           G +  GG+ H   +LT+GVD++ IRL G+LQR
Sbjct: 105 GGVFTGGYVHRVSDLTFGVDLKQIRLMGILQR 136


>gi|170703183|ref|ZP_02893992.1| conserved hypothetical protein [Burkholderia ambifaria IOP40-10]
 gi|170131915|gb|EDT00434.1| conserved hypothetical protein [Burkholderia ambifaria IOP40-10]
          Length = 367

 Score = 38.5 bits (88), Expect = 8.1,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 9/86 (10%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +  R A LD FRGL V L+I+VDH GG     ++   +  C+ A+     F+F+ G A A
Sbjct: 6   RAGRYAELDFFRGL-VLLVIVVDHIGGSILSRVTLHAYALCDAAE----VFVFLGGFATA 60

Query: 90  LALKRIPDRAD---AVKKVIFRTLKL 112
           +A   + +R D   A ++ I R  ++
Sbjct: 61  IAYNSLAERHDEAAARQRFIRRAFEI 86


>gi|172063517|ref|YP_001811168.1| hypothetical protein BamMC406_4496 [Burkholderia ambifaria MC40-6]
 gi|171996034|gb|ACB66952.1| conserved hypothetical protein [Burkholderia ambifaria MC40-6]
          Length = 367

 Score = 38.5 bits (88), Expect = 8.2,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 9/86 (10%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +  R A LD FRGL V L+I+VDH GG     ++   +  C+ A+     F+F+ G A A
Sbjct: 6   RAGRYAELDFFRGL-VLLVIVVDHIGGSILSRVTLHAYALCDAAE----VFVFLGGFATA 60

Query: 90  LALKRIPDRAD---AVKKVIFRTLKL 112
           +A   + +R D   A ++ I R  ++
Sbjct: 61  IAYNSLAERHDEAAARQRFIRRAFEI 86


>gi|115358787|ref|YP_775925.1| hypothetical protein Bamb_4038 [Burkholderia ambifaria AMMD]
 gi|115284075|gb|ABI89591.1| conserved hypothetical protein [Burkholderia ambifaria AMMD]
          Length = 384

 Score = 38.5 bits (88), Expect = 8.2,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 9/86 (10%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +  R A LD FRGL V L+I+VDH GG     ++   +  C+ A+     F+F+ G A A
Sbjct: 23  RAGRYAELDFFRGL-VLLVIVVDHIGGSILSRVTLHAYALCDAAE----VFVFLGGFATA 77

Query: 90  LALKRIPDRAD---AVKKVIFRTLKL 112
           +A   + +R D   A ++ I R  ++
Sbjct: 78  IAYNSLAERHDEAAARQRFIRRAFEI 103


>gi|406834557|ref|ZP_11094151.1| hypothetical protein SpalD1_23036 [Schlesneria paludicola DSM
           18645]
          Length = 534

 Score = 38.1 bits (87), Expect = 8.5,   Method: Compositional matrix adjust.
 Identities = 28/67 (41%), Positives = 35/67 (52%), Gaps = 6/67 (8%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCN----LADFVMPFFLFIVGVA 87
           T+RL SLD FRG  VA MILV+  G     + H+ +   N     AD +MP F F VG +
Sbjct: 75  TERLVSLDQFRGYTVAGMILVNFIGSF--AVVHSIFKHNNNYFSYADSIMPGFHFAVGYS 132

Query: 88  IALALKR 94
             L   R
Sbjct: 133 YRLTFLR 139


>gi|171315634|ref|ZP_02904868.1| conserved hypothetical protein [Burkholderia ambifaria MEX-5]
 gi|171099166|gb|EDT43939.1| conserved hypothetical protein [Burkholderia ambifaria MEX-5]
          Length = 367

 Score = 38.1 bits (87), Expect = 8.6,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 9/86 (10%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +  R A LD FRGL V L+I+VDH GG     ++   +  C+ A+     F+F+ G A A
Sbjct: 6   RAGRYAELDFFRGL-VLLVIVVDHIGGSILSRVTLHAYALCDAAE----VFVFLGGFATA 60

Query: 90  LALKRIPDRAD---AVKKVIFRTLKL 112
           +A   + +R D   A ++ I R  ++
Sbjct: 61  IAYNSLAERHDEAAARQRFIRRAFEI 86


>gi|134293811|ref|YP_001117547.1| hypothetical protein Bcep1808_5131 [Burkholderia vietnamiensis G4]
 gi|387904835|ref|YP_006335173.1| OpgC protein [Burkholderia sp. KJ006]
 gi|134136968|gb|ABO58082.1| conserved hypothetical protein [Burkholderia vietnamiensis G4]
 gi|387579727|gb|AFJ88442.1| OpgC protein [Burkholderia sp. KJ006]
          Length = 367

 Score = 38.1 bits (87), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 9/86 (10%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +  R A LD FRGL V L+I+VDH GG     ++   +  C+ A+     F+F+ G A A
Sbjct: 6   RAGRYAELDFFRGL-VLLVIVVDHIGGSILSRVTLHAYALCDAAE----VFVFLGGFATA 60

Query: 90  LALKRIPDRAD---AVKKVIFRTLKL 112
           +A   + +R D   A ++ I R  ++
Sbjct: 61  IAYNSLAERHDEAAARQRFIRRAFEI 86


>gi|167584370|ref|ZP_02376758.1| hypothetical protein BuboB_03474 [Burkholderia ubonensis Bu]
          Length = 365

 Score = 38.1 bits (87), Expect = 9.2,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 9/86 (10%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           K  R A LD FRGL V L+I++DH GG     ++   +  C+ A+     F+F+ G A A
Sbjct: 6   KPGRYAELDFFRGL-VLLVIVIDHIGGSILSRVTLHAYALCDAAE----VFVFLGGFATA 60

Query: 90  LALKRIPDRAD---AVKKVIFRTLKL 112
           +A   + +R D   A ++ I R  ++
Sbjct: 61  IAYNSLAERHDEAAARQRFIRRAFEI 86


>gi|107026114|ref|YP_623625.1| hypothetical protein Bcen_3760 [Burkholderia cenocepacia AU 1054]
 gi|116692702|ref|YP_838235.1| hypothetical protein Bcen2424_4608 [Burkholderia cenocepacia
           HI2424]
 gi|105895488|gb|ABF78652.1| conserved hypothetical protein [Burkholderia cenocepacia AU 1054]
 gi|116650702|gb|ABK11342.1| conserved hypothetical protein [Burkholderia cenocepacia HI2424]
          Length = 365

 Score = 38.1 bits (87), Expect = 9.2,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 9/86 (10%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +  R A LD FRGL V L+I+VDH GG     ++   +  C+ A+     F+F+ G A A
Sbjct: 6   RAGRYAELDFFRGL-VLLVIVVDHIGGSILSRVTLHAYALCDAAE----VFVFLGGFATA 60

Query: 90  LALKRIPDRAD---AVKKVIFRTLKL 112
           +A   + +R D   A ++ I R  ++
Sbjct: 61  IAYNSLAERHDEAAARQRFIRRAFEI 86


>gi|221196140|ref|ZP_03569187.1| putative membrane protein [Burkholderia multivorans CGD2M]
 gi|221202813|ref|ZP_03575832.1| putative membrane protein [Burkholderia multivorans CGD2]
 gi|221176747|gb|EEE09175.1| putative membrane protein [Burkholderia multivorans CGD2]
 gi|221182694|gb|EEE15094.1| putative membrane protein [Burkholderia multivorans CGD2M]
          Length = 365

 Score = 38.1 bits (87), Expect = 9.2,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 9/86 (10%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +  R A LD FRGL V L+I+VDH GG     ++   +  C+ A+     F+F+ G A A
Sbjct: 6   RAGRYAELDFFRGL-VLLVIVVDHIGGSILSRVTLHAYALCDAAE----VFVFLGGFATA 60

Query: 90  LALKRIPDRAD---AVKKVIFRTLKL 112
           +A   + +R D   A ++ I R  ++
Sbjct: 61  IAYNSLAERHDESAARQRFIRRAFEI 86


>gi|161520584|ref|YP_001584011.1| hypothetical protein Bmul_4038 [Burkholderia multivorans ATCC
           17616]
 gi|189353228|ref|YP_001948855.1| OpgC protein [Burkholderia multivorans ATCC 17616]
 gi|221209834|ref|ZP_03582815.1| putative membrane protein [Burkholderia multivorans CGD1]
 gi|421474568|ref|ZP_15922594.1| OpgC protein [Burkholderia multivorans CF2]
 gi|160344634|gb|ABX17719.1| conserved hypothetical protein [Burkholderia multivorans ATCC
           17616]
 gi|189337250|dbj|BAG46319.1| OpgC protein [Burkholderia multivorans ATCC 17616]
 gi|221170522|gb|EEE02988.1| putative membrane protein [Burkholderia multivorans CGD1]
 gi|400231859|gb|EJO61520.1| OpgC protein [Burkholderia multivorans CF2]
          Length = 365

 Score = 38.1 bits (87), Expect = 9.2,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 9/86 (10%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +  R A LD FRGL V L+I+VDH GG     ++   +  C+ A+     F+F+ G A A
Sbjct: 6   RAGRYAELDFFRGL-VLLVIVVDHIGGSILSRVTLHAYALCDAAE----VFVFLGGFATA 60

Query: 90  LALKRIPDRAD---AVKKVIFRTLKL 112
           +A   + +R D   A ++ I R  ++
Sbjct: 61  IAYNSLAERHDESAARQRFIRRAFEI 86


>gi|373459116|ref|ZP_09550883.1| hypothetical protein Calab_2944 [Caldithrix abyssi DSM 13497]
 gi|371720780|gb|EHO42551.1| hypothetical protein Calab_2944 [Caldithrix abyssi DSM 13497]
          Length = 471

 Score = 38.1 bits (87), Expect = 9.3,   Method: Compositional matrix adjust.
 Identities = 33/104 (31%), Positives = 45/104 (43%), Gaps = 16/104 (15%)

Query: 32  TQRLASLDIFRGLAVALMILVD------------HAGGDWPEISHAP-WNGCNLADFVMP 78
            +R+ +LD  RG A+  M+L              HA    PE    P   G    D V P
Sbjct: 5   NERVLNLDALRGFAILTMVLAGTIPYTGLPAWMYHAQLPPPERIFNPNLPGFTWVDLVFP 64

Query: 79  FFLFIVGVAIALALKRIPDRADAVK---KVIFRTLKLLFWGILL 119
            FLF +G AI LAL++   R    +    ++ RT  L F+ I L
Sbjct: 65  LFLFSLGAAIPLALEKRLTRQSLPRVGLHIVERTFLLAFFAIFL 108


>gi|421468975|ref|ZP_15917475.1| OpgC protein [Burkholderia multivorans ATCC BAA-247]
 gi|400230841|gb|EJO60585.1| OpgC protein [Burkholderia multivorans ATCC BAA-247]
          Length = 365

 Score = 38.1 bits (87), Expect = 9.6,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 9/86 (10%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +  R A LD FRGL V L+I+VDH GG     ++   +  C+ A+     F+F+ G A A
Sbjct: 6   RAGRYAELDFFRGL-VLLVIVVDHIGGSILSRVTLHAYALCDAAE----VFVFLGGFATA 60

Query: 90  LALKRIPDRAD---AVKKVIFRTLKL 112
           +A   + +R D   A ++ I R  ++
Sbjct: 61  IAYNSLAERHDESAARQRFIRRAFEI 86


>gi|390575261|ref|ZP_10255366.1| hypothetical protein WQE_42474 [Burkholderia terrae BS001]
 gi|389932764|gb|EIM94787.1| hypothetical protein WQE_42474 [Burkholderia terrae BS001]
          Length = 367

 Score = 38.1 bits (87), Expect = 9.6,   Method: Compositional matrix adjust.
 Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 18/132 (13%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
            RL  LD FRGL V + I+VDH GG    +S A  +   L D     F+F+ G A A A 
Sbjct: 6   NRLIELDFFRGL-VLIFIVVDHIGGS--ILSRATLHAYALCD-AAEVFVFLGGFATATAY 61

Query: 93  KRIPDR---ADAVKKVIFRTLKL----LFWGILL------QGGFS-HAPDELTYGVDVRM 138
             +  R   ADA  +   R+L+L    L   +L+         FS  AP+  T  +D  M
Sbjct: 62  ASLAKRHTEADARNRFFKRSLELYRAFLITAVLMLLVSAVMSAFSIDAPNMATTDLDDMM 121

Query: 139 IRLCGVLQRIAL 150
                VL+ I L
Sbjct: 122 DTPTAVLRDILL 133


>gi|206563639|ref|YP_002234402.1| hypothetical protein BCAM1788 [Burkholderia cenocepacia J2315]
 gi|444363993|ref|ZP_21164352.1| OpgC protein [Burkholderia cenocepacia BC7]
 gi|444373170|ref|ZP_21172575.1| OpgC protein [Burkholderia cenocepacia K56-2Valvano]
 gi|198039679|emb|CAR55648.1| putative membrane protein [Burkholderia cenocepacia J2315]
 gi|443592216|gb|ELT61038.1| OpgC protein [Burkholderia cenocepacia K56-2Valvano]
 gi|443593862|gb|ELT62568.1| OpgC protein [Burkholderia cenocepacia BC7]
          Length = 365

 Score = 38.1 bits (87), Expect = 9.6,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 9/86 (10%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +  R A LD FRGL V L+I+VDH GG     ++   +  C+ A+     F+F+ G A A
Sbjct: 6   RAGRYAELDFFRGL-VLLVIVVDHIGGSILSRVTLHAYALCDAAE----VFVFLGGFATA 60

Query: 90  LALKRIPDRAD---AVKKVIFRTLKL 112
           +A   + +R D   A ++ I R  ++
Sbjct: 61  IAYNSLAERHDEAAARQRFIRRAFEI 86


>gi|170738048|ref|YP_001779308.1| hypothetical protein Bcenmc03_5696 [Burkholderia cenocepacia MC0-3]
 gi|254248202|ref|ZP_04941522.1| hypothetical protein BCPG_03029 [Burkholderia cenocepacia PC184]
 gi|124874703|gb|EAY64693.1| hypothetical protein BCPG_03029 [Burkholderia cenocepacia PC184]
 gi|169820236|gb|ACA94818.1| conserved hypothetical protein [Burkholderia cenocepacia MC0-3]
          Length = 365

 Score = 38.1 bits (87), Expect = 9.7,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 9/86 (10%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +  R A LD FRGL V L+I+VDH GG     ++   +  C+ A+     F+F+ G A A
Sbjct: 6   RAGRYAELDFFRGL-VLLVIVVDHIGGSILSRVTLHAYALCDAAE----VFVFLGGFATA 60

Query: 90  LALKRIPDRAD---AVKKVIFRTLKL 112
           +A   + +R D   A ++ I R  ++
Sbjct: 61  IAYNSLAERHDEAAARQRFIRRAFEI 86


>gi|78062083|ref|YP_371991.1| hypothetical protein Bcep18194_B1233 [Burkholderia sp. 383]
 gi|77969968|gb|ABB11347.1| conserved hypothetical protein [Burkholderia sp. 383]
          Length = 365

 Score = 38.1 bits (87), Expect = 9.9,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 9/86 (10%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +  R A LD FRGL V L+I+VDH GG     ++   +  C+ A+     F+F+ G A A
Sbjct: 6   RAGRYAELDFFRGL-VLLVIVVDHIGGSILSRVTLHAYALCDAAE----VFVFLGGFATA 60

Query: 90  LALKRIPDRAD---AVKKVIFRTLKL 112
           +A   + +R D   A ++ I R  ++
Sbjct: 61  IAYNSLAERHDEAAARQRFIRRAFEI 86


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.329    0.142    0.467 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,307,668,449
Number of Sequences: 23463169
Number of extensions: 302394392
Number of successful extensions: 913563
Number of sequences better than 100.0: 958
Number of HSP's better than 100.0 without gapping: 668
Number of HSP's successfully gapped in prelim test: 290
Number of HSP's that attempted gapping in prelim test: 909748
Number of HSP's gapped (non-prelim): 1764
length of query: 463
length of database: 8,064,228,071
effective HSP length: 146
effective length of query: 317
effective length of database: 8,933,572,693
effective search space: 2831942543681
effective search space used: 2831942543681
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.8 bits)
S2: 79 (35.0 bits)